Unicode text normalization for OCaml
Module type
Class type
  • Unicode 15.0.0 support.

v14.0.0 2021-09-17 Zagreb

  • Unicode 14.0.0 support.

v13.0.0 2020-03-11 La Forclaz (VS)

  • Unicode 13.0.0 support.

  • Require OCaml >= 4.03.0.

v12.0.0 2019-03-08 La Forclaz (VS)

  • Unicode 12.0.0 support.

v11.0.0 2018-06-06 Lausanne

  • Unicode 11.0.0 support.

  • Fix bug when normalizer flushes at the end of stream: a spurious
    Await was returned before the final End.

v10.0.0 2017-06-20 Cambridge (UK)

  • Unicode 10.0.0 support

  • Fix bug in canonical composition algorithm (used by NFC and NFKC forms).
    Thanks to Stephen Dolan for the report.

  • Fix regression of Uucp.ccc introduced by f4c0363 which went into

v2.0.1 2016-03-07 La Forclaz (VS)

  • OCaml 4.05.0 compatibility (removal of Uchar.dump).

v2.0.0 2016-11-23 Zagreb

  • Support for Unicode 9.0.0.

  • OCaml standard library Uchar.t support.

    • Removes and substitutes type Uunf.uchar = int by the (abstract)
      Uchar.t type. Uchar.{of,to}_int allows to recover the previous

    • Removes Uunf.is_scalar_value. Uchar.is_valid can be used instead.

  • Safe string support.

  • Build depend on topkg.

  • Relicense from BSD3 to ISC.

v1.0.0 2015-06-17 Cambridge (UK)

  • Updated for Unicode 8.0.0

  • topkg support

  • Uunf.add now eventually returns `End whenever the latter was
    encoded and the character stream was entirely output. In most existing
    programs this will simply entail to add `End to the existing
    `Await case in pattern matches on the result of Uunf.add.

  • Adds the Uunf_string library that allows to directly normalize UTF-X
    OCaml encoded strings. This library depends on Uutf.

  • Rewrote the utility unftrip to use Cmdliner which is now
    an optional dependency of the package. The cli interface is
    incompatible with previous versions. Support for random
    Unicode scalar value generation was removed, use utftrip from
    the Uutf package for that.

  • Rewrote the module's data generation to essentially match what is done
    in Uucp. Much less ugly, no source file sedding.

v0.9.3 2014-06-16 Cambridge (UK)

  • Updated for Unicode 7.0.0

v0.9.2 2013-10-01 Lausanne

  • Updated for Unicode 6.3.0

  • OPAM friendly workflow and drop OASIS support.

v0.9.1 2013-01-04 La Forclaz (VS)

  • Updated for Unicode 6.2.0.

  • Fix Uunf.is_scalar_value always returning false.

  • Make the module completely safe for the client.

  • Change command line help of unftrip.

v0.9.0 2012-09-07 Lausanne

First release.