Dual Stream Transformer

This repository contains the code to train the dual stream transformer proposed in my work and its variations including dynamic gating, feature enhancements, auxiliary objectives and data curriculum strategies.

Getting Started

Prerequisites

Download the training data following the BabyLM Challenge instructions[https://babylm.github.io] and place it in a folder named ./data/.

Training

To initialise and train a model, run:

cd dual-stream-transformer
python main.py configs/<config.yaml>

Repository Structure

config/ - All experiments defined in my work
model/ - Base model architecture and variations
trainers/ - Training code
dataset/ - Dataset code
evaluation/ - Gate value testing against linguistic properties and statistical correlation code

Model Variations

The model/ and config/ folders contain implementations for each variation:

Soft gate per feature
Soft gate per token
Hard gate per feature
Hard gate per token
No gate
DyIntra modulation (text, image, cross-attention)
FiLM (text, image, cross-attention)
Channel attention
CLIP and LCG contrastive learning
MLP encoder
No encoder

Evaluation

The evaluation code tests gate values against:

Parts-of-speech
Concreteness
Imageability
Familiarity
Age-of-acquisition

To evaluate the model on the BabyLM Challenge benchmarks, clone the lm-evaluation-harness[https://github.com/EleutherAI/lm-evaluation-harness] repository and add the file dual-stream-transformer/evaluation/lm_harness_wrapper.py to lm-evaluation-harness/lm_eval/models then follow the instructions of the library on how to run the evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
configs		configs
datasets		datasets
evaluation		evaluation
interpretability		interpretability
models		models
trainers		trainers
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Dual Stream Transformer

Getting Started

Prerequisites

Training

Repository Structure

Model Variations

Evaluation

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

biancaganescu/looking-to-learn

Folders and files

Latest commit

History

Repository files navigation

Dual Stream Transformer

Getting Started

Prerequisites

Training

Repository Structure

Model Variations

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages