Skip to content

Conversation

@georgejwdeane
Copy link
Contributor

@georgejwdeane georgejwdeane commented Nov 18, 2025

11 harder versions of the diagnostic evals

Asana Task

Comment on lines +594 to +597
# Set starting energy to 30 and no regen
agent = cfg.game.agent
agent.initial_inventory = dict(agent.initial_inventory)
agent.initial_inventory["energy"] = 60
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment-code mismatch on energy configuration. The comment on line 594 states "Set starting energy to 30" but line 597 actually sets initial_inventory["energy"] = 60. This inconsistency could lead to incorrect difficulty tuning.

# Either update the comment:
# Set starting energy to 60 and no regen
# Or update the code:
agent.initial_inventory["energy"] = 30
Suggested change
# Set starting energy to 30 and no regen
agent = cfg.game.agent
agent.initial_inventory = dict(agent.initial_inventory)
agent.initial_inventory["energy"] = 60
# Set starting energy to 60 and no regen
agent = cfg.game.agent
agent.initial_inventory = dict(agent.initial_inventory)
agent.initial_inventory["energy"] = 60

Spotted by Graphite Agent

Fix in Graphite


Is this helpful? React 👍 or 👎 to let us know.

@daphnedemekas daphnedemekas added this pull request to the merge queue Nov 20, 2025
Merged via the queue into main with commit 71a1601 Nov 20, 2025
11 of 12 checks passed
@daphnedemekas daphnedemekas deleted the hard-evals branch November 20, 2025 21:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants