Skip to content

Latest commit

 

History

History

lang-detection

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

Language detection model

This model uses an LSTM for character-level language detection. Given a sentence of text, each character is fed into the model and the final output determines which of five languages the sentence was written in.

First run scrape.jl to download a Wikipedia data set. model.jl contains the actual model and training code.

Training

cd text/lang-detection
julia scrape.jl
julia --project model.jl

References