-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Pull requests: EleutherAI/lm-evaluation-harness
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Leverage vllm's
tokenizer_info
endpoint to avoid manual duplication
#3185
opened Jul 25, 2025 by
m-misiura
Loading…
Remove generate_until (multiple_target and doc_to_choice indexing) logic in ConfigurableTask.process_results
#3169
opened Jul 21, 2025 by
baberabb
Loading…
Bugfix: set default SamplingParams based on
generation_config
#3160
opened Jul 18, 2025 by
cuttle-fish-my
Loading…
Feat/add permutation benchmark/task to lm-evaluation-harness
#3157
opened Jul 18, 2025 by
BeeGass
Loading…
Fix
mmlu_continuation
subgroup names to fit Readme and other variants
#3137
opened Jul 11, 2025 by
lamalunderscore
Loading…
Add support for OpenVINO text2text generation models
#3101
opened Jul 3, 2025 by
nikita-savelyevv
•
Draft
feat(api_models): add enable_thinking param in chat_template_kwargs
#3088
opened Jun 27, 2025 by
johnsonafool
Loading…
Refactor ConfigurableTask.process_results into modular helpers
#3085
opened Jun 25, 2025 by
mfisher35
Loading…
[Proposal] Change hyphens in n-shot and n-samples to underscores
#3084
opened Jun 24, 2025 by
kiersten-stokes
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.