Skip to content

Language detection for Javascript (Node). Based on the CLD2 (Compact Language Detector) library from Google.

License

Notifications You must be signed in to change notification settings

anaisbetts/node-cld

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

92 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

node-cld

Build Status Dependencies NPM version

Language detection for Javascript. Based on the CLD2 (Compact Language Detector) library from Google.

Highly optimized for space and speed. Runs about 10x faster than other libraries. Detects over 160 languages. Full test coverage. Runs on Linux, OS X, and Windows.

Installation

$ npm install cld

Linux users, make sure you have g++ >= 4.8. If this is not an option, you should be able to install node-cld 2.4.4 even with an older g++ build.

Examples

Simple

require('cld').detect('This is a language recognition example', function(err, result) {
  console.log(result);
});

Advanced

var text    = 'Това е пример за разпознаване на Български език';
var options = {
  isHTML       : false,
  languageHint : 'BULGARIAN',
  encodingHint : 'ISO_8859_5',
  tldHint      : 'bg',
  httpHint     : 'bg'
};

require('cld').detect(text, options, function(err, result) {
  console.log(result);
});

Options

isHTML

Set to true if the string contains HTML tags

languageHint

Pass a LANGUAGES key or value as a hint

encodingHint

Pass an ENCODINGS value as a hint

tldHint

Pass top level domain as a hint

httpHint

Pass an HTTP "Content-Encoding" value as a hint

Warning

Once the module has been installed, the underlying C source will remain in the deps/cld folder and continue to occupy considerable space. If you are under severe constraints you can delete this folder and reclam >100M

Copyright

Copyright 2011-2015, Blagovest Dachev.

License

Apache 2

About

Language detection for Javascript (Node). Based on the CLD2 (Compact Language Detector) library from Google.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 79.2%
  • JavaScript 19.6%
  • Python 1.2%