Skip to content
Detect the text language automatically using a bigram model, Support Vector Machines, and Artifical Neural Networks. The model is trained using the WiLI-2018 benchmark dataset, and the highest accuracy achieved on the test dataset is 99.7% with paragraph text.
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Generated Datasets
Trained Bigram Models
Trained NN Models
README.md
bigram_model.py Updated code. Dec 5, 2019
fetch_data.py
naive_bayes.py
neural_network.py

README.md

language-identification

Detect the text language automatically using a bigram model, Support Vector Machines, and Artifical Neural Networks. The model is trained using the WiLI-2018 benchmark dataset, and the highest accuracy achieved on the test dataset is 99.7% with paragraph text.

Live demo: http:diptanu.pythonanywhere.com

You can’t perform that action at this time.