Skip to content

Conversation

@ammar-agent
Copy link
Collaborator

@ammar-agent ammar-agent commented Dec 14, 2025

Update nightly benchmark models:

  • anthropic:claude-sonnet-4-5 β†’ anthropic:claude-opus-4-5
  • openai:gpt-5.1-codex β†’ openai:gpt-5.2

Recent Trends (last 5 days)

Date Claude Sonnet 4.5 GPT-5.1-codex
Dec 14 42.5% 31.25%
Dec 13 37.5% 30.0%
Dec 12 36.25% 28.75%
Dec 11 36.25% 28.75%
Dec 10 35.0% 26.25%

Generated with mux β€’ Model: anthropic:claude-opus-4-5 β€’ Thinking: high

@chatgpt-codex-connector
Copy link

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Repo admins can enable using credits for code reviews in their settings.

@ammar-agent ammar-agent force-pushed the tbench-models-upgrade branch from abeb849 to 79a12c9 Compare December 14, 2025 20:01
Update nightly benchmark models:
- anthropic:claude-sonnet-4-5 β†’ anthropic:claude-opus-4-5
- openai:gpt-5.1-codex β†’ openai:gpt-5.2

---
_Generated with `mux` β€’ Model: `anthropic:claude-opus-4-5` β€’ Thinking: `high`_
@ammar-agent ammar-agent force-pushed the tbench-models-upgrade branch from 79a12c9 to bdeed38 Compare December 14, 2025 20:02
@ammario ammario merged commit 9f4c41e into main Dec 14, 2025
20 checks passed
@ammario ammario deleted the tbench-models-upgrade branch December 14, 2025 20:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants