package uucp

  1. Overview
  2. Docs

Break properties.

These properties are for the Unicode text segmentation and line breaking algorithm.

References

Line break

type line = [
  1. | `AI
  2. | `AL
  3. | `B2
  4. | `BA
  5. | `BB
  6. | `BK
  7. | `CB
  8. | `CJ
  9. | `CL
  10. | `CM
  11. | `CP
  12. | `CR
  13. | `EX
  14. | `GL
  15. | `H2
  16. | `H3
  17. | `HL
  18. | `HY
  19. | `ID
  20. | `IN
  21. | `IS
  22. | `JL
  23. | `JT
  24. | `JV
  25. | `LF
  26. | `NL
  27. | `NS
  28. | `NU
  29. | `OP
  30. | `PO
  31. | `PR
  32. | `QU
  33. | `RI
  34. | `SA
  35. | `SG
  36. | `SP
  37. | `SY
  38. | `WJ
  39. | `XX
  40. | `ZW
]

The type for line breaks.

val pp_line : Format.formatter -> line -> unit

pp_line ppf l prints an unspecified representation of l on ppf.

val line : uchar -> line

line u is u's line break property.

Grapheme cluster break

type grapheme_cluster = [
  1. | `CN
  2. | `CR
  3. | `EX
  4. | `L
  5. | `LF
  6. | `LV
  7. | `LVT
  8. | `PP
  9. | `RI
  10. | `SM
  11. | `T
  12. | `V
  13. | `XX
]

The type for grapheme cluster breaks.

val pp_grapheme_cluster : Format.formatter -> grapheme_cluster -> unit

pp_grapheme_cluster ppf g prints an unspecified representation of g on ppf.

val grapheme_cluster : uchar -> grapheme_cluster

grapheme_cluster u is u's grapheme cluster break property.

Word break

type word = [
  1. | `CR
  2. | `DQ
  3. | `EX
  4. | `Extend
  5. | `FO
  6. | `HL
  7. | `KA
  8. | `LE
  9. | `LF
  10. | `MB
  11. | `ML
  12. | `MN
  13. | `NL
  14. | `NU
  15. | `RI
  16. | `SQ
  17. | `XX
]

The type for word breaks.

val pp_word : Format.formatter -> word -> unit

pp_grapheme_cluster ppf g prints an unspecified representation of g on ppf.

val word : uchar -> word

world u is u's word break property.

Sentence break

type sentence = [
  1. | `AT
  2. | `CL
  3. | `CR
  4. | `EX
  5. | `FO
  6. | `LE
  7. | `LF
  8. | `LO
  9. | `NU
  10. | `SC
  11. | `SE
  12. | `SP
  13. | `ST
  14. | `UP
  15. | `XX
]

The type for sentence breaks.

val pp_sentence : Format.formatter -> sentence -> unit

pp_grapheme_cluster ppf g prints an unspecified representation of g on ppf.

val sentence : uchar -> sentence

sentence u is u's sentence break property.

Low level interface

module Low : sig ... end

Low level interface.

OCaml

Innovation. Community. Security.