Translator

English to Malayalam Sentence Translator

Model Summary

The model takes English Sentences as the input and uses a 256-unit LSTM network to convert the sentence to Malayalam. The input and output are 3D lists with shape = (1, maximum length of sentence, number of words), the optimizer used is adam, and the loss calculated is using categorical cross-entropy. The model uses early stopping and model checkpointing to get the model with the best validation accuracy.

Steps Involved

Extract English and Malayalam sentences from the input files.
Create a pandas dataframe with the sentences and convert them to lowercase. Also, prepend "START_ " and append " _END" for the Malayalam sentences.
Iterate through the sentences to calculate the maximum length and the number of unique words separately for the 2 languages and create a dictionary to map the words to indexes and vice-versa.
Split the data into training, validation, and test sets and create numpy arrays with the shape = (1, maximum length of sentence, number of words).
Write the data into TF Records and save the files along with the dictionaries to a local folder.
Read the TF Records and parse the data into the different train, validation, and test datasets.
Create a model with 2 LSTM layers having 256 units and set up Model checkpointing and early stopping.
Train the model and plot the loss and accuracy using matplotlib.
Use the trained model to convert the English sentences in the test dataset and compare them with the expected results to verify the accuracy.

Future Steps

Gather more English and Malayalam sentences for improving accuracy.
Clean the data thoroughly to ensure accurate translations.
Fine-tune the hyper-parameters.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Code		Code
Dataset		Dataset
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Translator

Model Summary

Steps Involved

Future Steps

About

Releases

Packages

Languages

License

SauravSJK/Translator

Folders and files

Latest commit

History

Repository files navigation

Translator

Model Summary

Steps Involved

Future Steps

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages