Machine-Translation

Solving the task of machine translation (from English to Vietnamese) with a regular Seq2Seq network and a global attention-based, dot product Seq2Seq network. Both following the Encoder-Decoder architecture.

Preprocessed data courtesy of the Stanford NLP group.

TODO:

[] Attention-based network: Compute exact loss for each sequence (i.e., don't compute loss for sequences with padding). Look into utilizing torch.nn.utils.rnn.pack_padded_sequence
[] Regular network: Get on track with the attention network, files are in .src/TODO/
[] Training: Run for more epochs
[] Training: Implement K-fold CV
[] Training: Hyperparameter tuning
[] Testing: Implement model evaluation for user inputs and test data
[] Miscellaneous: More documentation + better organization

Latest Results

Attention-based network [01/01/2021]:

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
notebooks		notebooks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine-Translation

Latest Results

Standard Seq2Seq (Encoder-Decoder architecture) network

Global attention-based, dot product Seq2Seq (Encoder-Decoder architecture) network

About

Releases

Packages

Languages

License

dddat1017/Machine-Translation

Folders and files

Latest commit

History

Repository files navigation

Machine-Translation

Latest Results

Standard Seq2Seq (Encoder-Decoder architecture) network

Global attention-based, dot product Seq2Seq (Encoder-Decoder architecture) network

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages