Skip to content

docs: update Codex model references and add auth setup#8

Merged
szjanikowski merged 1 commit into
mainfrom
docs/update-codex-models-and-auth
Mar 23, 2026
Merged

docs: update Codex model references and add auth setup#8
szjanikowski merged 1 commit into
mainfrom
docs/update-codex-models-and-auth

Conversation

@szjanikowski
Copy link
Copy Markdown
Contributor

Summary

  • Replace outdated OpenAI model names (o3, o4-mini, codex-mini-latest) with current Codex models (gpt-5.4, gpt-5.3-codex, gpt-5.3-codex-spark, etc.) across CLAUDE.md and skills
  • Add authentication setup section to benchmark-runner skill (OAuth token script + CODEX_API_KEY from .env)
  • Add critical warning: Codex silently accepts invalid model names (e.g. claude-sonnet-4-6) producing garbage results (0% pass rate)
  • Add Docker OOM warning for parallel Claude runs

Test plan

  • Smoke test: codex-vanilla with --model gpt-5.3-codex scored 87/100 (vs 0-45 with wrong model)
  • Grep confirms no remaining references to o3, o4-mini, gpt-4o, gpt-4.1, or codex-mini-latest

🤖 Generated with Claude Code

- Model resolution chain: --model flag > variant.toml model > nasde.toml
  [defaults] model. Error if no model found at any level.
- Remove hardcoded "claude-sonnet-4-6" default that silently broke Codex
  variants (Codex accepts invalid model names but produces garbage results).
- Add load_variant_config() to read full variant.toml including model field.
- Add model = "gpt-5.3-codex" to all Codex variant.toml files in examples.
- Replace outdated model names (o3, o4-mini, codex-mini-latest) with current
  OpenAI Codex models (gpt-5.4, gpt-5.3-codex, gpt-5.3-codex-spark).
- Add authentication setup section to benchmark-runner skill (OAuth token
  script + CODEX_API_KEY from .env).
- Add warnings about Docker OOM with parallel Claude runs.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@szjanikowski szjanikowski force-pushed the docs/update-codex-models-and-auth branch from 0395289 to 39f9a66 Compare March 23, 2026 22:45
@szjanikowski szjanikowski merged commit 27b6af9 into main Mar 23, 2026
5 checks passed
@szjanikowski szjanikowski deleted the docs/update-codex-models-and-auth branch April 22, 2026 12:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant