Skip to content

feat(sob): new Interfaze scores#11

Merged
Khurdhula-Harshavardhan merged 2 commits into
mainfrom
feat/interfaze-run-2
May 22, 2026
Merged

feat(sob): new Interfaze scores#11
Khurdhula-Harshavardhan merged 2 commits into
mainfrom
feat/interfaze-run-2

Conversation

@Abhinavexist
Copy link
Copy Markdown
Collaborator

@Abhinavexist Abhinavexist commented May 22, 2026

please squash and merge

@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 22, 2026

🏆 Leaderboard preview

Built 32 models, top 10 by Overall:

Rank Model Overall Val. Acc. JSON Pass Perfect
1 GPT-5.4 0.870 0.798 0.993 0.469
2 gemini-3.1-pro-preview 0.869 0.820 0.966 0.542
3 z-ai_glm-5.1 0.866 0.806 0.975 0.498
4 claude-opus-4-7 0.864 0.787 0.993 0.424
5 GLM-4.7 0.861 0.804 0.965 0.508
6 Qwen3.5-35B 0.861 0.801 0.969 0.500
7 gpt-5.5 0.860 0.795 0.978 0.464
8 Gemini-2.5-Flash 0.860 0.796 0.972 0.498
9 Interfaze-Beta 0.860 0.805 0.966 0.507
10 Qwen3-235B 0.857 0.786 0.978 0.463

Generated at 2026-05-22T11:09:19+00:00 • full JSON in workflow artifacts

@Khurdhula-Harshavardhan Khurdhula-Harshavardhan merged commit 47d3b9f into main May 22, 2026
1 check passed
@Khurdhula-Harshavardhan Khurdhula-Harshavardhan deleted the feat/interfaze-run-2 branch May 22, 2026 20:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants