fix: pin `lm-eval<0.4.9.1` for trust_remote_code issue #2168

bhimrazy · 2025-12-02T12:47:45Z

What does this PR do?

Fixes failing test_evaluate.py::test_evaluate_script test on master by pinning lm-eval<0.4.9.1.

ref: ci : https://github.com/Lightning-AI/litgpt/actions/runs/19823757338/job/56877042404?pr=2164

Also unblocks other prs

FAILED tests/test_evaluate.py::test_evaluate_script - ValueError: The repository for EleutherAI/logiqa contains custom code which must be executed to correctly load the dataset. You can inspect the repository content at https://hf.co/datasets/EleutherAI/logiqa.
Please pass the argument `trust_remote_code=True` to allow custom code to be run.

ref: #2102 (previously pin was added here)

Upstream issue: Datasets with loading scripts - no longer supported EleutherAI/lm-evaluation-harness#3171

Some lm-eval task datasets (e.g. EleutherAI/logiqa) require trust_remote_code=True which newer versions don't handle properly. See: EleutherAI/lm-evaluation-harness#3171 Related: #2102

lianakoleva

Agree < is better than != but we should eventually use trust_remote_code where needed to be compatible with newer versions as well.

fix: pin lm-eval<0.4.9.1 for trust_remote_code issue

a02b3b5

Some lm-eval task datasets (e.g. EleutherAI/logiqa) require trust_remote_code=True which newer versions don't handle properly. See: EleutherAI/lm-evaluation-harness#3171 Related: #2102

bhimrazy self-assigned this Dec 2, 2025

bhimrazy requested review from KaelanDt, andyland, k223kim, lantiga, lianakoleva and t-vi as code owners December 2, 2025 12:47

bhimrazy mentioned this pull request Dec 2, 2025

fix: pin lm-eval<0.4.9.1 for trust_remote_code issue #2167

Closed

lianakoleva approved these changes Dec 4, 2025

View reviewed changes

lianakoleva merged commit 299cfa4 into main Dec 4, 2025
44 of 60 checks passed

lianakoleva deleted the fix/pin-lm-eval-trust-remote-code branch December 4, 2025 09:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: pin `lm-eval<0.4.9.1` for trust_remote_code issue #2168

fix: pin `lm-eval<0.4.9.1` for trust_remote_code issue #2168

Uh oh!

bhimrazy commented Dec 2, 2025

Uh oh!

lianakoleva left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: pin lm-eval<0.4.9.1 for trust_remote_code issue #2168

fix: pin lm-eval<0.4.9.1 for trust_remote_code issue #2168

Uh oh!

Conversation

bhimrazy commented Dec 2, 2025

What does this PR do?

Uh oh!

lianakoleva left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: pin `lm-eval<0.4.9.1` for trust_remote_code issue #2168

fix: pin `lm-eval<0.4.9.1` for trust_remote_code issue #2168

lianakoleva left a comment •

edited

Loading