Statistical language detector
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.


                    trilang -- Statistical language detector

trilang is a statistical language detector. To detect the language of a text it
divides it into trigrams (blocks of three letters) and compares their frequency
with reference values in its database. The database is initially empty and has
to be filled by learning from texts with known languages.

The statistical approach has been described in the article "A Statistical
Approach to the Spam Problem" by Gary Robinson, 1 Mar 2003, Linux Journal, .


Please email me with any comments or questions you have:
Hermann Schwarting <>