Sequence To Sequence GRU for automatic Spelling Correction.

This repository contains code for an auto spell checker built using Pytorch as framework.

We've built encoder-decoder networks using GRU as the building block. Luong Attention mechanism is used to speeden up the training process and improve accuracy.

Characters (not words) were provided as input to the architecture while training and testing.
The model was trained on free cloud GPU available on google colab.

We used a billion word datasetreleased by Google. Artificial noise was injected generate spelling errors so as to train the model. The noise is the simulated spelling mistakes and the model tries to learn how to correct the input by comparing the output to the original text. The dataset can be found here

The trained weights file is available here

The model achieved 92% Test set accuracy on training on free Colab GPU within a few hours

Major Modifications

Used GRU instead of RNN as building block.
Introduced Attention Mechanism

Acknowledgment

This work is an extension to this repo by Tal Weiss.

Feel free to contact me in case of any query.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
LICENSE		LICENSE
README.md		README.md
deep_spell_GRU_attention_test.ipynb		deep_spell_GRU_attention_test.ipynb
deep_spell_GRU_attention_train.ipynb		deep_spell_GRU_attention_train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sequence To Sequence GRU for automatic Spelling Correction.

Major Modifications

Acknowledgment

About

Releases

Packages

Languages

License

gaushh/Deep-Spelling

Folders and files

Latest commit

History

Repository files navigation

Sequence To Sequence GRU for automatic Spelling Correction.

Major Modifications

Acknowledgment

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages