Uuseg — Unicode text segmentation for OCaml
Uuseg is an OCaml library for segmenting Unicode text. It implements the locale independent Unicode text segmentation algorithms to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm to detect line break opportunities.
The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.
Contact: Daniel Bünzli
Uuseg can be installed with
opam install uuseg opam install uutf uuseg # for support on OCaml UTF-X encoded strings
If you don't use
opam consult the
opam file for build
The documentation and API reference is automatically generated by
ocamldoc from the interfaces. It can be consulted online or
odig doc uuseg.
If you installed Uuseg with
opam sample programs are located in
opam config var uuseg:doc.
In the distribution sample programs are located in the
directory of the distribution, they can be built with:
topkg build --tests true