package orsetto
Install
dune-project
Dependency
Authors
Maintainers
Sources
sha512=9b654edb663ae697563f150824047052f3b1bf760398f24bce6350553f031f73c46b6337239a1acd871e61238597ea92046809e3358290ff14d6ba671b449085
doc/orsetto.ucs/Ucs_db_aux/index.html
Module Ucs_db_aux
Unicode character set properties.
Overview
This module provides an interface to the Unicode character set database.
Types
An alias for the abstract type representing a map of all Unicode code points to the value of its corresponding property.
The property index type. The full Unicode character database is large, and the portion required by the Orsetto Ucs library itself is small, so values of this type provide an abstraction of the relevant portion of the database available to the application.
type utyp += The core population of the extensible universal property type.
Functions and Constants
val create_index : (string * 'a) list -> 'a indexUse create_index s to compose an index from a sequence of pairs.
Use query m c to resolve the value property m for character c.
val search_index : 'a index -> string -> 'a optionUse search_index idx nym to query the index idx for the entry named by nym. Index keys are loosely matched.
Use search_property idx nym to query the property database index idx for the property named nym. Property names are loosely matched.
Use require_property idx nym to query the property database index idx for the property named nym. Property names are loosedly matched. Raises Not_found if no property named nym is indexed.
type blk = [ | `ASCII| `Adlam| `Aegean_Numbers| `Ahom| `Alchemical| `Alphabetic_PF| `Anatolian_Hieroglyphs| `Ancient_Greek_Music| `Ancient_Greek_Numbers| `Ancient_Symbols| `Arabic| `Arabic_Ext_A| `Arabic_Ext_B| `Arabic_Ext_C| `Arabic_Math| `Arabic_PF_A| `Arabic_PF_B| `Arabic_Sup| `Armenian| `Arrows| `Avestan| `Balinese| `Bamum| `Bamum_Sup| `Bassa_Vah| `Batak| `Bengali| `Bhaiksuki| `Block_Elements| `Bopomofo| `Bopomofo_Ext| `Box_Drawing| `Brahmi| `Braille| `Buginese| `Buhid| `Byzantine_Music| `CJK| `CJK_Compat| `CJK_Compat_Forms| `CJK_Compat_Ideographs| `CJK_Compat_Ideographs_Sup| `CJK_Ext_A| `CJK_Ext_B| `CJK_Ext_C| `CJK_Ext_D| `CJK_Ext_E| `CJK_Ext_F| `CJK_Ext_G| `CJK_Ext_H| `CJK_Radicals_Sup| `CJK_Strokes| `CJK_Symbols| `Carian| `Caucasian_Albanian| `Chakma| `Cham| `Cherokee| `Cherokee_Sup| `Chess_Symbols| `Chorasmian| `Compat_Jamo| `Control_Pictures| `Coptic| `Coptic_Epact_Numbers| `Counting_Rod| `Cuneiform| `Cuneiform_Numbers| `Currency_Symbols| `Cypriot_Syllabary| `Cypro_Minoan| `Cyrillic| `Cyrillic_Ext_A| `Cyrillic_Ext_B| `Cyrillic_Ext_C| `Cyrillic_Ext_D| `Cyrillic_Sup| `Deseret| `Devanagari| `Devanagari_Ext| `Devanagari_Ext_A| `Diacriticals| `Diacriticals_Ext| `Diacriticals_For_Symbols| `Diacriticals_Sup| `Dingbats| `Dives_Akuru| `Dogra| `Domino| `Duployan| `Early_Dynastic_Cuneiform| `Egyptian_Hieroglyphs| `Egyptian_Hieroglyph_Format_Controls| `Elbasan| `Elymaic| `Emoticons| `Enclosed_Alphanum| `Enclosed_Alphanum_Sup| `Enclosed_CJK| `Enclosed_Ideographic_Sup| `Ethiopic| `Ethiopic_Ext| `Ethiopic_Ext_A| `Ethiopic_Ext_B| `Ethiopic_Sup| `Geometric_Shapes| `Geometric_Shapes_Ext| `Georgian| `Georgian_Ext| `Georgian_Sup| `Glagolitic| `Glagolitic_Sup| `Gothic| `Grantha| `Greek| `Greek_Ext| `Gujarati| `Gunjala_Gondi| `Gurmukhi| `Half_And_Full_Forms| `Half_Marks| `Hangul| `Hanifi_Rohingya| `Hanunoo| `Hatran| `Hebrew| `High_PU_Surrogates| `High_Surrogates| `Hiragana| `IDC| `IPA_Ext| `Ideographic_Symbols| `Imperial_Aramaic| `Indic_Number_Forms| `Indic_Siyaq_Numbers| `Inscriptional_Pahlavi| `Inscriptional_Parthian| `Jamo| `Jamo_Ext_A| `Jamo_Ext_B| `Javanese| `Kaithi| `Kaktovik_Numerals| `Kana_Ext_A| `Kana_Ext_B| `Kana_Sup| `Kanbun| `Kangxi| `Kannada| `Katakana| `Katakana_Ext| `Kayah_Li| `Kawi| `Kharoshthi| `Khitan_Small_Script| `Khmer| `Khmer_Symbols| `Khojki| `Khudawadi| `Lao| `Latin_1_Sup| `Latin_Ext_A| `Latin_Ext_Additional| `Latin_Ext_B| `Latin_Ext_C| `Latin_Ext_D| `Latin_Ext_E| `Latin_Ext_F| `Latin_Ext_G| `Lepcha| `Letterlike_Symbols| `Limbu| `Linear_A| `Linear_B_Ideograms| `Linear_B_Syllabary| `Lisu| `Lisu_Sup| `Low_Surrogates| `Lycian| `Lydian| `Mahajani| `Mahjong| `Makasar| `Malayalam| `Mandaic| `Manichaean| `Marchen| `Masaram_Gondi| `Math_Alphanum| `Math_Operators| `Mayan_Numerals| `Medefaidrin| `Meetei_Mayek| `Meetei_Mayek_Ext| `Mende_Kikakui| `Meroitic_Cursive| `Meroitic_Hieroglyphs| `Miao| `Misc_Arrows| `Misc_Math_Symbols_A| `Misc_Math_Symbols_B| `Misc_Pictographs| `Misc_Symbols| `Misc_Technical| `Modi| `Modifier_Letters| `Modifier_Tone_Letters| `Mongolian| `Mongolian_Sup| `Mro| `Multani| `Music| `Myanmar| `Myanmar_Ext_A| `Myanmar_Ext_B| `NB| `NKo| `Nabataean| `Nag_Mundari| `Nandinagari| `New_Tai_Lue| `Newa| `No_Block_Assigned| `Number_Forms| `Nushu| `Nyiakeng_Puachue_Hmong| `OCR| `Ogham| `Ol_Chiki| `Old_Hungarian| `Old_Italic| `Old_North_Arabian| `Old_Permic| `Old_Persian| `Old_Sogdian| `Old_South_Arabian| `Old_Turkic| `Old_Uyghur| `Oriya| `Ornamental_Dingbats| `Osage| `Osmanya| `Ottoman_Siyaq_Numbers| `PUA| `Pahawh_Hmong| `Palmyrene| `Pau_Cin_Hau| `Phags_Pa| `Phaistos| `Phoenician| `Phonetic_Ext| `Phonetic_Ext_Sup| `Playing_Cards| `Psalter_Pahlavi| `Punctuation| `Rejang| `Rumi| `Runic| `Samaritan| `Saurashtra| `Sharada| `Shavian| `Shorthand_Format_Controls| `Siddham| `Sinhala| `Sinhala_Archaic_Numbers| `Small_Forms| `Small_Kana_Ext| `Sogdian| `Sora_Sompeng| `Soyombo| `Specials| `Sundanese| `Sundanese_Sup| `Sup_Arrows_A| `Sup_Arrows_B| `Sup_Arrows_C| `Sup_Math_Operators| `Sup_PUA_A| `Sup_PUA_B| `Sup_Punctuation| `Sup_Symbols_And_Pictographs| `Super_And_Sub| `Sutton_SignWriting| `Syloti_Nagri| `Symbols_And_Pictographs_Ext_A| `Symbols_For_Legacy_Computing| `Syriac| `Syriac_Sup| `Tagalog| `Tagbanwa| `Tags| `Tai_Le| `Tai_Tham| `Tai_Viet| `Tai_Xuan_Jing| `Takri| `Tamil| `Tamil_Sup| `Tangsa| `Tangut| `Tangut_Components| `Tangut_Sup| `Telugu| `Thaana| `Thai| `Tibetan| `Tifinagh| `Tirhuta| `Toto| `Transport_And_Map| `UCAS| `UCAS_Ext| `UCAS_Ext_A| `Ugaritic| `VS| `VS_Sup| `Vai| `Vedic_Ext| `Vertical_Forms| `Vithkuqi| `Wancho| `Warang_Citi| `Yezidi| `Yi_Radicals| `Yi_Syllables| `Yijing| `Zanabazar_Square| `Znamenny_Music
]Unicode code block
val show_blk : blk -> stringString representation
type gc = [ | `C| `Cc| `Cf| `Cs| `Co| `Cn| `L| `LC| `Lu| `Ll| `Lt| `Lm| `Lo| `M| `Mn| `Mc| `Me| `N| `Nd| `Nl| `No| `P| `Pc| `Pd| `Ps| `Pe| `Pi| `Pf| `Po| `S| `Sm| `Sc| `Sk| `So| `Z| `Zs| `Zl| `Zp
]The general category property value type.
val show_gc : gc -> stringString representation
val show_qc : qc -> stringString representation
type script = [ | `Adlm| `Aghb| `Ahom| `Arab| `Armi| `Armn| `Avst| `Bali| `Bamu| `Bass| `Batk| `Beng| `Bhks| `Bopo| `Brah| `Brai| `Bugi| `Buhd| `Cakm| `Cans| `Cari| `Cham| `Cher| `Chrs| `Copt| `Cpmn| `Cprt| `Cyrl| `Deva| `Diak| `Dogr| `Dsrt| `Dupl| `Egyp| `Elba| `Elym| `Ethi| `Geor| `Glag| `Gong| `Gonm| `Goth| `Gran| `Grek| `Gujr| `Guru| `Hang| `Hani| `Hano| `Hatr| `Hebr| `Hira| `Hluw| `Hmng| `Hmnp| `Hrkt| `Hung| `Ital| `Java| `Kali| `Kana| `Kawi| `Khar| `Khmr| `Khoj| `Kits| `Knda| `Kthi| `Lana| `Laoo| `Latn| `Lepc| `Limb| `Lina| `Linb| `Lisu| `Lyci| `Lydi| `Mahj| `Maka| `Mand| `Mani| `Marc| `Medf| `Mend| `Merc| `Mero| `Mlym| `Modi| `Mong| `Mroo| `Mtei| `Mult| `Mymr| `Nagm| `Nand| `Narb| `Nbat| `Newa| `Nkoo| `Nshu| `Ogam| `Olck| `Orkh| `Orya| `Osge| `Osma| `Ougr| `Palm| `Pauc| `Perm| `Phag| `Phli| `Phlp| `Phnx| `Plrd| `Prti| `Qaai| `Rjng| `Rohg| `Runr| `Samr| `Sarb| `Saur| `Sgnw| `Shaw| `Shrd| `Sidd| `Sind| `Sinh| `Sogd| `Sogo| `Sora| `Soyo| `Sund| `Sylo| `Syrc| `Tagb| `Takr| `Tale| `Talu| `Taml| `Tang| `Tavt| `Telu| `Tfng| `Tglg| `Thaa| `Thai| `Tibt| `Tirh| `Tnsa| `Toto| `Ugar| `Vaii| `Vith| `Wara| `Wcho| `Xpeo| `Xsux| `Yezi| `Yiii| `Zanb| `Zinh| `Zyyy| `Zzzz
]Unicode script identifier
val show_script : script -> stringString representation
module Quick : sig ... endThis module contains internal fast-path functions for property query.