Skip to content

Latest commit

 

History

History
8 lines (6 loc) · 367 Bytes

README.md

File metadata and controls

8 lines (6 loc) · 367 Bytes

This project will detect the language of the given data. With the help of training data the program will be trained to understand the language. Once the program is trained it can be used on any test data. The algorithm is degigned for bigrams and trigrams.

Command to get the result: $ python bigrams.py data/train/en/all_en.txt data/train/es/all_es.txt data/test/