Fairseq-CO2

This repository shows an example of utilizing CO2 within Fairseq.

Requirements and Installation

PyTorch version >= 1.10.0
Python version >= 3.8
For training new models, you'll also need an NVIDIA GPU and NCCL
To install fairseq-CO2 and develop locally:

git clone https://github.com/weigao266/fairseq-CO2.git
cd fairseq-CO2
pip install --editable ./

The implementation of CO2 is integrated in Fairscale at fairscale-CO2. To install fairscale-CO2 and develop locally:

git clone https://github.com/weigao266/fairscale-CO2.git
cd fairscale-CO2
pip install --editable ./

For faster training install NVIDIA's apex library:

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" \
  --global-option="--deprecated_fused_adam" --global-option="--xentropy" \
  --global-option="--fast_multihead_attn" ./

For large datasets install PyArrow: pip install pyarrow
If you use Docker make sure to increase the shared memory size either with --ipc=host or --shm-size as command line options to nvidia-docker run .

Usage

Run the script run_co2_local.sh to train a GPT-2 (Medium) model with 355M parameters, using CO2. The script sets co2_base_algorithm = localsgd, and co2_outer_momentum = 0.2.

cd co2_examples
bash run_co2_local.sh

Citation

If you find our work useful, please cite the following paper:

@article{sun2024co2,
  title={CO2: Efficient Distributed Training with Full Communication-Computation Overlap},
  author={Sun, Weigao and Qin, Zhen and Sun, Weixuan and Li, Shidi and Li, Dong and Shen, Xuyang and Qiao, Yu and Zhong, Yiran},
  journal={arXiv preprint arXiv:2401.16265},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2,331 Commits
.github		.github
co2_examples		co2_examples
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
hydra_plugins/dependency_submitit_launcher		hydra_plugins/dependency_submitit_launcher
scripts		scripts
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
RELEASE.md		RELEASE.md
hubconf.py		hubconf.py
pyproject.toml		pyproject.toml
release_utils.py		release_utils.py
setup.cfg		setup.cfg
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fairseq-CO2

Requirements and Installation

Usage

Citation

About

Releases

Packages

Languages

License

weigao266/fairseq-CO2

Folders and files

Latest commit

History

Repository files navigation

Fairseq-CO2

Requirements and Installation

Usage

Citation

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages