This guide provides comprehensive setup instructions for the SWE-bench Verified High Score Project.
- Python 3.8+
- Git
- Docker
- 120GB+ free disk space
-
Clone the Repository
git clone [Your Repository URL] cd [Your Repository Name]
-
Set Up a Virtual Environment (Optional but recommended)
python -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
-
Install Dependencies
pip install -r requirements.txt
-
Install Docker
- Follow the official Docker installation guide
- Ensure Docker is running
-
Set Up Hugging Face Access (for dataset and model access)
pip install huggingface_hub huggingface-cli login
-
Configure Environment Variables Create a
.env
file in the project root and add:OPENAI_API_KEY=your_openai_api_key_here ANTHROPIC_API_KEY=your_anthropic_api_key_here
-
Verify Installation
python verify_setup.py
- If you encounter Docker-related issues, ensure your user has permissions to run Docker commands.
- For package conflicts, try creating a new virtual environment.
- If you're having trouble accessing the dataset, check your Hugging Face account permissions.
After completing the setup:
- Read through
dataset_info.md
to understand the SWE-bench Verified dataset. - Explore
evaluation_guide.md
for instructions on running evaluations. - Start developing your model improvements using
model_development.md
as a guide.
For any issues not covered here, please open an issue in the GitHub repository.