PhD Student @ Mila, Université de Montréal
- Montréal, Canada
- nikihowe.com
- @__niki_howe__
Highlights
- Pro
Pinned Loading
-
reward-hacking-paper
reward-hacking-paper PublicCode for the paper `Defining and Characterizing Reward Hacking`
Python
-
AlignmentResearch/scaling-llm-robustness-paper
AlignmentResearch/scaling-llm-robustness-paper PublicCode used for the paper `Scaling Trends in Language Model Robustness`
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.