uunf

Unicode text normalization for OCaml
README

v14.0.0

Uunf is an OCaml library for normalizing Unicode text. It supports all
Unicode normalization forms. The library is independent from any
IO mechanism or Unicode text data structure and it can process text
without a complete in-memory representation.

Uunf has no dependency. It may optionally depend on Uutf for
support on OCaml UTF-X encoded strings. It is distributed under the
ISC license.

Home page: http://erratique.ch/software/uunf

Installation

Uunf can be installed with opam:

opam install uunf
opam install uutf uunf # for support on OCaml UTF-X encoded strings

If you don't use opam consult the opam file for build
instructions and a complete specification of the dependencies.

Documentation

The documentation and API reference can be consulted online or
via odig doc uunf.

Sample programs

If you installed Uuseg with opam sample programs are located in
the directory opam config var uuseg:doc.

A few test programs are in the test directory of the distribution.

  • test.native tests the library with the Unicode Normalization Test
    file available from:

    http://www.unicode.org/Public/%%UNICODEVERSION%%/ucd/NormalizationTest.txt

    Nothing should fail.

  • test_string.native tests the UTF-X OCaml string support.

  • unftrip.native inputs Unicode text on stdin and rewrites it on
    stdout in a given normalization form. Invoke with --help for more
    information. Depends on uutf
    and cmdliner.

Install
Published
17 Sep 2021
Maintainers
Sources
uunf-14.0.0.tbz
sha512=9aac01483abb8a8a5d68832d7f7692909d61559f2c9b0284c1da293f8115d1100df26e9e4cf7280bda6499ba866f5da2c72c4c21ca99a1d594b29d000e9bb051
Dependencies
uucd
dev & >= "14.0.0" & < "15.0.0"
topkg
build & >= "1.0.3"
ocaml
>= "4.03.0"
Reverse Dependencies