Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...

Showing runs from all workflows
4,895 workflow runs
4,895 workflow runs
Event

Filter by event

Status

Filter by status

Branch
Actor

Filter by actor

Add various social bias tasks
Tasks Modified #2444: Pull request #1185 synchronize by oskarvanderwal
May 22, 2024 14:34 3m 45s winogender
May 22, 2024 14:34 3m 45s
Add chat template
Unit Tests #2415: Pull request #1873 synchronize by KonradSzafer
May 22, 2024 13:56 5m 25s huggingface:chat_template
May 22, 2024 13:56 5m 25s
Add chat template
Tasks Modified #2443: Pull request #1873 synchronize by KonradSzafer
May 22, 2024 13:56 2m 8s huggingface:chat_template
May 22, 2024 13:56 2m 8s
Add chat template
Unit Tests #2414: Pull request #1873 opened by KonradSzafer
May 22, 2024 13:46 5m 16s huggingface:chat_template
May 22, 2024 13:46 5m 16s
Add chat template
Tasks Modified #2442: Pull request #1873 opened by KonradSzafer
May 22, 2024 13:46 1m 34s huggingface:chat_template
May 22, 2024 13:46 1m 34s
mmlu-pro for the Italian language
Tasks Modified #2440: Pull request #1860 synchronize by giux78
May 22, 2024 11:35 1m 28s giux78:mmlu-pro-ita-2
May 22, 2024 11:35 1m 28s
mmlu-pro for the Italian language
Unit Tests #2412: Pull request #1860 synchronize by giux78
May 22, 2024 11:35 5m 18s giux78:mmlu-pro-ita-2
May 22, 2024 11:35 5m 18s
mmlu-pro for the Italian language
Tasks Modified #2439: Pull request #1860 synchronize by giux78
May 22, 2024 11:26 1m 40s giux78:mmlu-pro-ita-2
May 22, 2024 11:26 1m 40s
mmlu-pro for the Italian language
Unit Tests #2411: Pull request #1860 synchronize by giux78
May 22, 2024 11:26 5m 40s giux78:mmlu-pro-ita-2
May 22, 2024 11:26 5m 40s
Update polemo2_out.yaml (#1871)
Unit Tests #2410: Commit 70e1de0 pushed by lintangsutawika
May 22, 2024 09:16 5m 33s main
May 22, 2024 09:16 5m 33s
Update polemo2_out.yaml (#1871)
Tasks Modified #2438: Commit 70e1de0 pushed by lintangsutawika
May 22, 2024 09:16 1m 49s main
May 22, 2024 09:16 1m 49s
Update polemo2_out.yaml
Unit Tests #2409: Pull request #1871 opened by zhabuye
May 22, 2024 09:15 5m 44s zhabuye:0
May 22, 2024 09:15 5m 44s
Update polemo2_out.yaml
Tasks Modified #2437: Pull request #1871 opened by zhabuye
May 22, 2024 09:15 1m 34s zhabuye:0
May 22, 2024 09:15 1m 34s
Added tests for Anthropic LLMs
Tasks Modified #2436: Pull request #1868 opened by zafstojano
May 21, 2024 16:03 16m 16s zafstojano:test-coverage-anthropic
May 21, 2024 16:03 16m 16s
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Tasks Modified #2434: Pull request #1867 synchronize by maximegmd
May 21, 2024 12:53 Action required maximegmd:main
May 21, 2024 12:53 Action required
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Unit Tests #2406: Pull request #1867 synchronize by maximegmd
May 21, 2024 12:53 Action required maximegmd:main
May 21, 2024 12:53 Action required
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Tasks Modified #2433: Pull request #1867 opened by maximegmd
May 21, 2024 12:15 Action required maximegmd:main
May 21, 2024 12:15 Action required
fixed docs typos (#1863)
Unit Tests #2404: Commit cb22e50 pushed by lintangsutawika
May 21, 2024 09:56 5m 46s main
May 21, 2024 09:56 5m 46s
fixed docs typos (#1863)
Tasks Modified #2432: Commit cb22e50 pushed by lintangsutawika
May 21, 2024 09:56 16s main
May 21, 2024 09:56 16s