GitHub - ankursikarwar/Grounded-Compositional-Generalization: EMNLP 2022 | When Can Transformers Ground and Compose: Insights from Compositional Generalization Benchmarks

When Can Transformers Ground and Compose: Insights from Compositional Generalization Benchmarks

Humans can reason compositionally whilst grounding language utterances to the real world. Recent benchmarks like ReaSCAN use navigation tasks grounded in a grid world to assess whether neural models exhibit similar capabilities. In this work, we propose simple and well-motivated modifications to a multi-modal transformer model that outperforms specialized architectures on ReaSCAN and a modified version of gSCAN. Moreover, we design a series of experiments to understand the underlying challenges in these tasks. We also demonstrate that the split, testing depth generalization in ReaSCAN is unfair because the training data does not contain sufficient information for the models to correctly choose amongst competing hypotheses. On experimenting with a modified training distribution, we show that simple transformer-based models can successfully generalize to commands with greater depths. We introduce a new simpler grounded compositional generalization task called RefEx to study how transformers ground and compose. Additionally, we conduct experiments to interpret, examine, and provide explicit constructions that show transformer's computations corresponding to grounding and composition in the RefEx task setting.

This repository contains two modules: reascan and refexp

Dependencies

compatible with python 3.6
dependencies can be installed using code/reascan/requirements.txt

Setup

Install VirtualEnv using the following (optional):

$ [sudo] pip install virtualenv

Create and activate your virtual environment (optional):

$ virtualenv -p python3 venv
$ source venv/bin/activate

Install all the required packages:

at code/reascan/:

$ pip install -r requirements.txt

For reascan module:

Download and extract ReaSCAN dataset in code/reascan/data/ReaSCAN-v1.1 from https://reascan.github.io/

Then to preprocess data, run following at code/reascan/src/utils/

$ python preprocess.py --dataset reascan

Usage:

The set of possible arguments available can be seen in the respective args.py file. Here, we illustrate training a model on reascan (similar usage for refexp):

at code/reascan/:

$ python src.main --mode train --test_split custom_comp --train_fname train.json --val_fname dev_comp_3500.json --load_config default_12_6_enc_dec_layers_interleave_co_self_reascan.json --run_name run_1 --batch_size 32 --gpu 0 --lr 0.0001 --epochs 100 --seed 3420

Note: Jupyter notebooks for generating custom splits based on ReaSCAN, gSCAN, and GSRR datasets are in code/reascan/

Citation

If you use our data or code, please cite our work:

@misc{https://doi.org/10.48550/arxiv.2210.12786,
  doi = {10.48550/ARXIV.2210.12786},
  url = {https://arxiv.org/abs/2210.12786},
  author = {Sikarwar, Ankur and Patel, Arkil and Goyal, Navin},
  keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
  title = {When Can Transformers Ground and Compose: Insights from Compositional Generalization Benchmarks},
  publisher = {arXiv},
  year = {2022},
  copyright = {arXiv.org perpetual, non-exclusive license}
}

For any clarification, comments, or suggestions please contact Ankur or Arkil.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
images		images
reascan		reascan
refexp		refexp
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

reascan

reascan

refexp

refexp

LICENSE

LICENSE

README.md

README.md

Repository files navigation

When Can Transformers Ground and Compose: Insights from Compositional Generalization Benchmarks

This repository contains two modules: reascan and refexp

Dependencies

Setup

Usage:

Note: Jupyter notebooks for generating custom splits based on ReaSCAN, gSCAN, and GSRR datasets are in code/reascan/

Citation

About

Releases

Packages

Languages

License

ankursikarwar/Grounded-Compositional-Generalization

Folders and files

Latest commit

History

Repository files navigation

When Can Transformers Ground and Compose: Insights from Compositional Generalization Benchmarks

This repository contains two modules: reascan and refexp

Dependencies

Setup

Usage:

Note: Jupyter notebooks for generating custom splits based on ReaSCAN, gSCAN, and GSRR datasets are in code/reascan/

Citation

About

Resources

License

Stars

Watchers

Forks

Languages