Skip to content

Actions: stanford-crfm/helm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
4,280 workflow runs
4,280 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Daily Integration Tests
Daily Integration Tests #98: Scheduled
November 6, 2024 15:35 8m 48s main
November 6, 2024 15:35 8m 48s
Scenario tests
Scenario tests #80: Scheduled
November 6, 2024 15:35 11m 56s main
November 6, 2024 15:35 11m 56s
Add Vocal Sound, Multilingual Librispeech, and FLEURS scenarios (#3130)
Test #7529: Commit 137698a pushed by teetone
November 6, 2024 08:28 13m 38s main
November 6, 2024 08:28 13m 38s
Add Vocal Sound, Multilingual Librispeech, and FLEURS scenarios (#3130)
Update requirements.txt #75: Commit 137698a pushed by teetone
November 6, 2024 08:28 2h 2m 52s main
November 6, 2024 08:28 2h 2m 52s
Release Image2Struct v1.0.2 (#3133)
Build Frontend #143: Commit 68651b9 pushed by yifanmai
November 6, 2024 03:23 50s main
November 6, 2024 03:23 50s
Release Image2Struct v1.0.2 (#3133)
Frontend #649: Commit 68651b9 pushed by yifanmai
November 6, 2024 03:23 1m 5s main
November 6, 2024 03:23 1m 5s
Release Image2Struct v1.0.2
Frontend #648: Pull request #3133 opened by yifanmai
November 6, 2024 01:27 1m 2s yifanmai/release-image2struct-v1.0.2
November 6, 2024 01:27 1m 2s
Add SUMO Web Claims Summarization scenario
Test #7528: Pull request #3112 synchronize by yifanmai
November 6, 2024 00:28 13m 35s yifanmai/fix-sumosum
November 6, 2024 00:28 13m 35s
Add SUMO Web Claims Summarization scenario
Test #7527: Pull request #3112 synchronize by yifanmai
November 6, 2024 00:25 13m 25s yifanmai/fix-sumosum
November 6, 2024 00:25 13m 25s
Add wmt_14_no_prefix to output_format_instructions run expander (#3132)
Test #7524: Commit 1ee0cec pushed by yifanmai
November 5, 2024 23:17 13m 4s main
November 5, 2024 23:17 13m 4s
Add Legal Contract Summarization scenario
Test #7521: Pull request #3131 opened by yifanmai
November 5, 2024 21:29 12m 58s yifanmai/fix-legal-contract
November 5, 2024 21:29 12m 58s
Improvements to audio utilities (#3128)
Test #7520: Commit d1a9d14 pushed by yifanmai
November 5, 2024 19:31 13m 19s main
November 5, 2024 19:31 13m 19s
Improvements to audio utilities
Test #7519: Pull request #3128 synchronize by yifanmai
November 5, 2024 17:51 13m 40s yifanmai/fix-audio-utils
November 5, 2024 17:51 13m 40s
Daily Integration Tests
Daily Integration Tests #97: Scheduled
November 5, 2024 15:35 5m 6s main
November 5, 2024 15:35 5m 6s
Scenario tests
Scenario tests #79: Scheduled
November 5, 2024 15:35 11m 27s main
November 5, 2024 15:35 11m 27s
Adding the IFEval scenario
Test #7518: Pull request #3122 synchronize by liamjxu
November 5, 2024 04:11 13m 41s jialiang/ifeval
November 5, 2024 04:11 13m 41s
Adding the IFEval scenario
Scenario tests #78: Pull request #3122 synchronize by liamjxu
November 5, 2024 04:11 11m 12s jialiang/ifeval
November 5, 2024 04:11 11m 12s
GPQA Few shot CoT (#3096)
Test #7517: Commit 6f5ca60 pushed by yifanmai
November 4, 2024 22:13 13m 20s main
November 4, 2024 22:13 13m 20s
Comments addressed for MMLU-PRO Non COT (#3125)
Test #7516: Commit 0bed847 pushed by siyagoel
November 4, 2024 20:57 12m 53s main
November 4, 2024 20:57 12m 53s
Comments addressed for MMLU-PRO Non COT (#3125)
Scenario tests #77: Commit 0bed847 pushed by siyagoel
November 4, 2024 20:57 11m 54s main
November 4, 2024 20:57 11m 54s
Skip models column when summarizing mean (#3127)
Test #7515: Commit 119c555 pushed by yifanmai
November 4, 2024 16:30 13m 59s main
November 4, 2024 16:30 13m 59s