GitHub - KDL-umass/task_based_compositional_generalization: Repository providing benchmarks and code to reproduce experiments of paper on studying task-based compositional generalization in transformers

This repository contains code for the paper: Why Transformers Succeed and Fail at Compositional Generalization: Composition Equivalence and Module Coverage.

Our implementation builds on the code provided by (Ramesh et al., 2024): https://github.com/rahul13ramesh/compositional_capabilities/

To reproduce the results:

Generate data for different train-test split strategies. ./bash_scripts/data.sh
Train model for a given train-test split strategy. ./bash_scripts/train_model.sh
Evaluate trained model on a given test distribution. ./bash_scripts/evaluate_model.sh
- There is an option to do equivalence class analysis and final layer representation analysis during evaluation. See ./bash_scripts/evaluate_model.sh for more details.
Plot the compositional generalization performance of direct and step-by-step models for absolute and relative positional embeddings. This script generates the plots presented in the paper. ./bash_scripts/plotting.sh
Genrate TSNE plots (Figures 12 and 13) by running equivalence class and representation analysis during evaluation and then running the following bash script. ./bash_scripts/equivalence_analysis.sh

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
bash_scripts		bash_scripts
config		config
scripts		scripts
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

KDL-umass/task_based_compositional_generalization

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages