-
Notifications
You must be signed in to change notification settings - Fork 1
Pull requests: criticalml-uw/TamperBench
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[low-priority] mmlu_pro: Allow using LLM-as-judge as parser
#101
opened Feb 24, 2026 by
tomtseng
Loading…
eval: added lm-eval
evaluation
Adds or modifies evaluation
#36
opened Oct 14, 2025 by
psyonp
Loading…
eval: GPQA Evaluation
evaluation
Adds or modifies evaluation
#30
opened Sep 22, 2025 by
MKowal2
Loading…
attack: Added refusal ablation attack
attack
Adds or modifies attacks
#27
opened Sep 2, 2025 by
NayeemaNonta
Loading…
attack: added Wanda Pruning (attack)
attack
Adds or modifies attacks
#26
opened Aug 29, 2025 by
esveee
Loading…
attack: added latent perturbation attack
attack
Adds or modifies attacks
#23
opened Aug 27, 2025 by
psyonp
Loading…
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.