Skip to content

Actions: allenai/reward-bench

Actions

Quality

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
456 workflow runs
456 workflow runs
Event

Filter by event

Loading
Status

Filter by status

Loading
Branch
Actor

Filter by actor

Loading
Improve run_generative documentation + add to pip
Quality #382: Pull request #124 synchronize by natolambert
May 11, 2024 02:31 2m 34s package_generative
May 11, 2024 02:31 2m 34s
Improve run_generative documentation + add to pip
Quality #381: Pull request #124 opened by natolambert
May 11, 2024 02:31 2m 41s package_generative
May 11, 2024 02:31 2m 41s
[Add Model] Pairwise Preference Model
Quality #380: Pull request #123 synchronize by WeiXiongUST
May 10, 2024 03:00 2m 37s WeiXiongUST:pair_pm_dev
May 10, 2024 03:00 2m 37s
[Add Model] Pairwise Preference Model
Quality #379: Pull request #123 opened by WeiXiongUST
May 9, 2024 11:34 3m 59s WeiXiongUST:pair_pm_dev
May 9, 2024 11:34 3m 59s
Make RewardBench pip installable + runable! (#121)
Quality #378: Commit a7cf68b pushed by natolambert
May 5, 2024 18:32 2m 39s main
May 5, 2024 18:32 2m 39s
Make RewardBench pip installable + runable!
Quality #377: Pull request #121 synchronize by natolambert
May 4, 2024 14:11 2m 27s package
May 4, 2024 14:11 2m 27s
Make RewardBench pip installable + runable!
Quality #376: Pull request #121 synchronize by natolambert
May 4, 2024 14:01 2m 36s package
May 4, 2024 14:01 2m 36s
Make RewardBench pip installable + runable!
Quality #375: Pull request #121 synchronize by natolambert
May 4, 2024 14:00 2m 37s package
May 4, 2024 14:00 2m 37s
Make RewardBench pip installable + runable!
Quality #374: Pull request #121 synchronize by natolambert
May 4, 2024 13:54 2m 30s package
May 4, 2024 13:54 2m 30s
Make RewardBench pip installable + runable!
Quality #373: Pull request #121 synchronize by natolambert
May 4, 2024 00:15 2m 48s package
May 4, 2024 00:15 2m 48s
Make RewardBench pip installable + runable!
Quality #372: Pull request #121 opened by natolambert
May 3, 2024 23:36 2m 47s package
May 3, 2024 23:36 2m 47s
Ensemble pre-computed reward outputs (#120)
Quality #371: Commit 79e5943 pushed by natolambert
May 2, 2024 23:18 2m 46s main
May 2, 2024 23:18 2m 46s
Ensemble pre-computed reward outputs
Quality #370: Pull request #120 synchronize by natolambert
May 2, 2024 22:57 3m 0s ensemble
May 2, 2024 22:57 3m 0s
Ensemble pre-computed reward outputs
Quality #369: Pull request #120 synchronize by natolambert
May 2, 2024 22:44 2m 51s ensemble
May 2, 2024 22:44 2m 51s
Implement PoLL (LLM-as-a-judge ensembles) (#119)
Quality #368: Commit 12cf23d pushed by natolambert
May 2, 2024 22:42 7m 29s main
May 2, 2024 22:42 7m 29s
Ensemble pre-computed reward outputs
Quality #367: Pull request #120 opened by natolambert
May 2, 2024 22:42 2m 44s ensemble
May 2, 2024 22:42 2m 44s
Implement PoLL (LLM-as-a-judge ensembles)
Quality #366: Pull request #119 opened by natolambert
May 2, 2024 18:48 2m 41s poll
May 2, 2024 18:48 2m 41s
bon eval
Quality #365: Pull request #111 synchronize by yuchenlin
April 30, 2024 15:15 2m 48s bon_eval
April 30, 2024 15:15 2m 48s
bon eval
Quality #364: Pull request #111 synchronize by yuchenlin
April 30, 2024 15:12 3m 10s bon_eval
April 30, 2024 15:12 3m 10s
bon eval
Quality #363: Pull request #111 synchronize by yuchenlin
April 30, 2024 14:49 16m 33s bon_eval
April 30, 2024 14:49 16m 33s
bon eval
Quality #362: Pull request #111 synchronize by yuchenlin
April 30, 2024 14:21 2m 59s bon_eval
April 30, 2024 14:21 2m 59s
bon eval
Quality #361: Pull request #111 synchronize by yuchenlin
April 30, 2024 14:12 2m 57s bon_eval
April 30, 2024 14:12 2m 57s
Add pad_token_id from tokenizer to model config. (#117)
Quality #360: Commit d973dab pushed by natolambert
April 25, 2024 18:57 7m 7s main
April 25, 2024 18:57 7m 7s
Add pad_token_id from tokenizer to model config.
Quality #359: Pull request #117 synchronize by hank0316
April 25, 2024 17:00 2m 44s hank0316:main
April 25, 2024 17:00 2m 44s
bon eval
Quality #358: Pull request #111 synchronize by yuchenlin
April 25, 2024 14:09 2m 52s bon_eval
April 25, 2024 14:09 2m 52s