Skip to content

A Natural Language processing of detecting the type of language based on the frequency of bigrams and trigrams

Notifications You must be signed in to change notification settings

himanish532/Language-Detector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This project will detect the language of the given data. With the help of training data the program will be trained to understand the language. Once the program is trained it can be used on any test data. The algorithm is degigned for bigrams and trigrams.

Command to get the result: $ python bigrams.py data/train/en/all_en.txt data/train/es/all_es.txt data/test/

About

A Natural Language processing of detecting the type of language based on the frequency of bigrams and trigrams

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages