package uuseg
Install
Dune Dependency
Authors
Maintainers
Sources
sha512=355139aee2a72baddf3d811e522948456147546ee946b6eca20f57711865770d4b8d32ea01a7338b8e6cdedb4423ee65cee387704bb9c0c057bcbd65012679b8
Description
Uuseg is an OCaml library for segmenting Unicode text. It implements the locale independent Unicode text segmentation algorithms to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm to detect line break opportunities.
The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.
Uuseg is distributed under the ISC license. It depends on Uucp.
Homepage: http://erratique.ch/software/uuseg
README
Uuseg — Unicode text segmentation for OCaml
v16.0.0
Uuseg is an OCaml library for segmenting Unicode text. It implements the locale independent Unicode text segmentation algorithms to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm to detect line break opportunities.
The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.
Uuseg is distributed under the ISC license. It depends on Uucp.
Homepage: http://erratique.ch/software/uuseg
Installation
Uuseg can be installed with opam
:
opam install uuseg
opam install uuseg cmdliner uutf # For the usegtrip tool.
If you don't use opam
consult the opam
file for build instructions.
Documentation
The documentation can be consulted online or via odig doc uuseg
.
Questions are welcome but better asked on the OCaml forum than on the issue tracker.
Sample programs
The usegtrip
tool segments text provided on standard input.
See also the doc examples.
Dependencies (5)
-
uucp
>= "16.0.0" & < "17.0.0"
-
topkg
build & >= "1.0.3"
-
ocamlbuild
build
-
ocamlfind
build
-
ocaml
>= "4.14.0"
Dev Dependencies
None
Used by (10)
- fuzzy_compare
-
inquire
= "0.2.1"
- matita
-
notty
< "0.2.3"
-
ocamlformat
>= "0.10" & < "0.25.1"
- ocamlformat-lib
- ocamlformat-mlx-lib
-
ocamlformat-rpc
< "0.21.0"
- slug
-
zed
>= "3.2.0"