Pinned Loading
Repositories
Showing 10 of 81 repositories
- Edge-Pruning Public
[NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".
- ShortcutGrammar Public
EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560
- unintentional-unalignment Public
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
- tree-of-thought-llm Public
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models