Skip to content

Embeddable C++ Unicode library offering utf-8 and utf-16 encoding, general category info, simple lowercasing and uppercasing and titlecasing, normalization forms and combining marks stripping.

License

Notifications You must be signed in to change notification settings

praveenmunagapati/unilib

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UniLib: C++11 Embeddable Unicode Library

Build Status

UniLib is an embeddable C++11 Unicode library under MPL 2.0 license. It is versioned using Semantic Versioning.

If currently offers:

  • utf-8 and utf-16 encodings
  • Unicode General Category info
  • simple lowercasing, titlecasing and uppercasing
  • Unicode normalization forms
  • efficient stripping of combining marks

It is tested on the following compilers and platforms:

  • Linux using gcc
  • Windows using tdm-gcc
  • Windows using Visual C++ 2013
  • OS X using clang

Copyright 2015 Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic.

UniLib repository http://github.com/ufal/unilib is hosted on GitHub.

About

Embeddable C++ Unicode library offering utf-8 and utf-16 encoding, general category info, simple lowercasing and uppercasing and titlecasing, normalization forms and combining marks stripping.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 93.8%
  • HTML 4.1%
  • Prolog 1.7%
  • Makefile 0.4%