Fairseq on Amazon SageMaker

Fairseq Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks.

In this repository, we will show how to integrate Fairseq into Amazon SageMaker Training Job using Pytorch Estimator. Instead of using Custom Docker container, this example uses shell script as an entry point of Pytorch estimator. Which contains dependancy installation commands and data preprocessing commands.

Example notebooks

w2v_finetuning.ipynb: Fine-tune a pre-trained wav2vec 2.0 model example of wav2vec using Librispeech dataset

Local Mode

In case of using local mode, we recommend using the following command as a startup script of SageMaker Notebook to change the docker repository path.

#!/bin/bash

set -ex

DAEMON_PATH="/etc/docker"
MEMORY_SIZE=10G

FLAG=$(cat $DAEMON_PATH/daemon.json | jq 'has("data-root")')
# echo $FLAG

if [ "$FLAG" == true ]; then
    echo "Already revised"
else
    echo "Add data-root and default-shm-size=$MEMORY_SIZE"
    sudo cp $DAEMON_PATH/daemon.json $DAEMON_PATH/daemon.json.bak
    sudo cat $DAEMON_PATH/daemon.json.bak | jq '. += {"data-root":"/home/ec2-user/SageMaker/.container/docker","default-shm-size":"'$MEMORY_SIZE'"}' | sudo tee $DAEMON_PATH/daemon.json > /dev/null
    sudo service docker restart
    echo "Docker Restart"
fi

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
w2v_finetuning.ipynb		w2v_finetuning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fairseq on Amazon SageMaker

Example notebooks

Local Mode

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fairseq on Amazon SageMaker

Example notebooks

Local Mode

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages