This repository is private.
All pages are served over SSL and all pushing and pulling is done over SSH.
No one may fork, clone, or view it unless they are added as a member.
Every repository with this icon (
) is private.
Every repository with this icon (
This repository is public.
Anyone may fork, clone, or view it.
Every repository with this icon (
) is public.
Every repository with this icon (
whatlang /
| name | age | message | |
|---|---|---|---|
| |
README | ||
| |
generate_models.rb | ||
| |
lid.rb | ||
| |
models/ |
README
NB: Requires ruby1.9. A module to identify which of any one of a number of human languages a given text is in. We use a simple similarity measure between frequency counts of bigrams to compare an unknown text to a set of models of known languages. The language models are built with samples from: http://www.unicode.org/udhr/downloads.html which is copied to models/ .








