Neural-Machine-Translation

Sequence to Sequence Neural Machine translation with Attention implemented in PyTorch

Framework and Libraries : PyTorch, Numpy

Platform : Google Colaboratory GPU

Phases :

- Data Exploration and preparation :

An English to French translation dataset of : 137861 english sentences and a corresponding 137861 french sentences

+ Techniques :

Preprocessing:

Removing Outliers
Tokenization and Removing Punctuation
Padding
Splitting the dataset into Training (80%), Validation (10%) , Testing (10%)

- Modeling and Validation Phase :

+ Techniques :

Implementing the Encoder in PyTorch, includes an Embedding layer and GRU cells
Implementing the Decoder with Attention mechanism in PyTorch, includes an Embedding layer, GRU cells and Dropout
Implementing the sequence to sequence architecture in PyTorch with Batch processing.
Implementation of a training and validation methods
Hyperparameters Tuning
Used Cross-validation technique
Best validation accuracy : 92.01 % after 25 epochs

- Testing Phase:

Testing the trained model on the unseen test dataset resulted on an accuracy : 90.13 %
Implementation of a prediction function that predicts the best translation of an input English sentence to French using the trained model.

References

Based on the following implementations with enhancement and problem fixing

PyTorch Tutorial
spro/practical-pytorch
@AuCson/PyTorch-Batch-Attention-Seq2seq

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
Seq2Seq_Machine_Translation_with_Attention_PyTorch.ipynb		Seq2Seq_Machine_Translation_with_Attention_PyTorch.ipynb
checkpoint_seq2seq_train_v0.pth		checkpoint_seq2seq_train_v0.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural-Machine-Translation

Phases :

- Data Exploration and preparation :

+ Techniques :

Preprocessing:

- Modeling and Validation Phase :

+ Techniques :

- Testing Phase:

References

About

Releases

Packages

Languages

Saoussen-CH/Neural-Machine-Translation

Folders and files

Latest commit

History

Repository files navigation

Neural-Machine-Translation

Phases :

- Data Exploration and preparation :

+ Techniques :

Preprocessing:

- Modeling and Validation Phase :

+ Techniques :

- Testing Phase:

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages