GitHub - dayyass/neural-machine-translation: Pipeline for training Stanford Seq2Seq Neural Machine Translation using PyTorch.

About

Pipeline for training Stanford Seq2Seq Neural Machine Translation using PyTorch.
Model trained on IWSLT'15 English-Vietnamese.
State-of-the-art on IWSLT'15 English-Vietnamese reference.

Usage

First, install dependencies:

# clone repo
git clone https://github.com/dayyass/neural_machine_translation.git

# install dependencies
cd neural_machine_translation
pip install -r requirements.txt

Data Format

Parallel corpora for Machine Translation.
More about it here.

Vocabulary

Before train any models, you need to create vocabularies for two languages.
More about it here.

Training

Train Neural Machine Translation:

python train.py

At the beginning of the script there is a list of parameters (written in uppercase) for training that can be changed.
Validation performed on every epoch, testing performed after the last epoch.

Validation

NotImplementedError: opened issue.

Inference

NotImplementedError: opened issue.

Models

List of implemented models:

Seq2SeqModel
Seq2SeqAttnModel

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
data		data
vocab		vocab
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
dataset.py		dataset.py
language.py		language.py
metrics.py		metrics.py
network.py		network.py
requirements.txt		requirements.txt
train.py		train.py
train_utils.py		train_utils.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Usage

Data Format

Vocabulary

Training

Validation

Inference

Models

About

Languages

dayyass/neural-machine-translation

Folders and files

Latest commit

History

Repository files navigation

About

Usage

Data Format

Vocabulary

Training

Validation

Inference

Models

About

Topics

Resources

Stars

Watchers

Forks

Languages