detect and normalize encodings of text
C++ JavaScript
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
COPYING
README.md
charset-detector.cc
package.json
unicoder.js
wscript

README.md

Unicoder

Unicoder is a simple node.js library for normalizing the encoding of text.

Unicoder takes a Buffer containing a bytestring representing a document with an unknown encoding, then guesses the encoding using a binding to ICU, and finally reencodes the Buffer into a JavaScript String.

Installation:

make install

Prerequisites:

  • node-iconv
  • libicu (including header files)