minicode

Install

Install uv.

Creating benchmark splits locally

Run all commands from the repository root directory.

CodeContests

uv run python -m minicode.setup_codecontests

Small repositories

uv run python -m minicode.setup_repos

Large repositories

uv run python -m minicode.setup_large_repos

Run agent baseline

Make sure that .env exists in the main directory with TOGETHER_API_KEY and OPENAI_API_KEY and ANTHROPIC_API_KEY.

CodeContests

bash scripts/codecontests/run_claude.sh
# or
bash scripts/codecontests/run_codex.sh

To get the evaluation results, run

uv run python scripts/codecontests/summarize_eval.py

Small repositories

bash scripts/small_repos/run_codex.sh
# or
bash scripts/small_repos/run_claude.sh

Large repositories

At the moment, we run the agents locally. To get <repo_name>, first run the setup script for large repos above, then check the large_repos directory.

bash scripts/large_repos/run_claude.sh <repo_name>

To get the evaluation results, run

uv run python scripts/large_repos/summarize_eval.py <repo_name>

Other

Repositories were synthesized via Claude 3.7 and Claude Code here

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
data		data
minicode		minicode
prompts		prompts
scripts		scripts
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

minicode

Install

Creating benchmark splits locally

Run agent baseline

Other

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

code-refactor/minicode

Folders and files

Latest commit

History

Repository files navigation

minicode

Install

Creating benchmark splits locally

Run agent baseline

Other

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages