- Run the setup script:
./setup.sh - Activate virtual environment:
source venv/bin/activate - Copy
.env.templateto.envand add your API keys - Run data collection:
python collect_data.py
data/- Collected datasetsnotebooks/- Jupyter notebooks for analysissrc/- Source code modulesresults/- Model outputs and evaluation resultslogs/- Log files
- GitHub Token: For API rate limiting (optional but recommended)
- OpenAI API Key: For LLM classification
- Anthropic API Key: Alternative LLM option
Get GitHub token: https://github.com/settings/tokens Get OpenAI key: https://platform.openai.com/api-keys Get Anthropic key: https://console.anthropic.com/