Proposal-Lgr-Latin-20180531

This document is mechanically formatted from the XML file for the LGR. It provides additional summary data and explanatory text. The XML file remains the sole normative specification of the LGR.

LGR Version 3
Date 2018-05-31
Language(s) und-Latn
Scope(s) domain: .
Unicode Version 6.3.0

Table of Contents

  1. Description
  2. Repertoire
  3. Variant Sets
  4. Classes, Rules and Actions
    1. Character Classes
    2. Whole label evaluation and context rules
    3. Actions
  5. Table of References

Description

Repertoire

Summary

Number of elements in repertoire 220
Number of ranges in repertoire 0
Number of code point sequences 22

Repertoire by Code Point

The following table lists the repertoire by code point (or code point sequence). The data in the Script and Name column are extracted from the Unicode character database. Where the comment in the original LGR is equal to the character name, it has been suppressed.

For any code point or sequence for which a variant is defined, the link to the associated variant set, or if mapped to itself, the variant type of that mapping is provided in the Variants column.

# Code
Point
Glyph Script Name Tags Required Context Variants Comment References
1 U+0061 a Latin LATIN SMALL LETTER A Many languages
2 U+0061 U+0331 a ̱ Latin LATIN SMALL LETTER A COMBINING MACRON BELOW Nuer (4) [146], [129]
3 U+0062 b Latin LATIN SMALL LETTER B Many languages
4 U+0063 c Latin LATIN SMALL LETTER C Many languages
5 U+0064 d Latin LATIN SMALL LETTER D Many languages
6 U+0065 e Latin LATIN SMALL LETTER E Many languages
7 U+0065 U+0331 e ̱ Latin LATIN SMALL LETTER E COMBINING MACRON BELOW Nuer (4) [146]
8 U+0066 f Latin LATIN SMALL LETTER F Many languages
9 U+0067 g Latin LATIN SMALL LETTER G Many languages
10 U+0067 U+0303 g ̃ Latin LATIN SMALL LETTER G COMBINING TILDE Guarani (1) [142], [143]
11 U+0067 U+0304 g ̄ Latin LATIN SMALL LETTER G COMBINING MACRON Raga (Hano) (3) [200]
12 U+0068 h Latin LATIN SMALL LETTER H Many languages
13 U+0069 i Latin LATIN SMALL LETTER I Many languages
14 U+0069 U+0331 i ̱ Latin LATIN SMALL LETTER I COMBINING MACRON BELOW Nuer (4) [146]
15 U+006A j Latin LATIN SMALL LETTER J Many languages
16 U+006B k Latin LATIN SMALL LETTER K Many languages
17 U+006C l Latin LATIN SMALL LETTER L Many languages
18 U+006D m Latin LATIN SMALL LETTER M Many languages
19 U+006D U+0327 m ̧ Latin LATIN SMALL LETTER M COMBINING CEDILLA Marshallese (1) [213], [136], [214]
20 U+006E n Latin LATIN SMALL LETTER N Many languages
21 U+006E U+0304 n ̄ Latin LATIN SMALL LETTER N COMBINING MACRON Raga (Hano) (3), Marshallese (1) [200], [213], [136]
22 U+006E U+0308 n ̈ Latin LATIN SMALL LETTER N COMBINING DIAERESIS Malagasy(1) [230]
23 U+006F o Latin LATIN SMALL LETTER O Many languages
24 U+006F U+0327 o ̧ Latin LATIN SMALL LETTER O COMBINING CEDILLA Marshallese (1) [136]
25 U+006F U+0331 o ̱ Latin LATIN SMALL LETTER O COMBINING MACRON BELOW Nuer (4) [146], [129]
26 U+0070 p Latin LATIN SMALL LETTER P Many languages
27 U+0071 q Latin LATIN SMALL LETTER Q Many languages
28 U+0072 r Latin LATIN SMALL LETTER R Many languages
29 U+0072 U+0303 r ̃ Latin LATIN SMALL LETTER R COMBINING TILDE Hausa (2) [147]
30 U+0073 s Latin LATIN SMALL LETTER S Many languages
31 U+0074 t Latin LATIN SMALL LETTER T Many languages
32 U+0075 u Latin LATIN SMALL LETTER U Many languages
33 U+0076 v Latin LATIN SMALL LETTER V Many languages
34 U+0077 w Latin LATIN SMALL LETTER W Many languages
35 U+0078 x Latin LATIN SMALL LETTER X Many languages
36 U+0079 y Latin LATIN SMALL LETTER Y Many languages
37 U+007A z Latin LATIN SMALL LETTER Z Many languages
38 U+00DF ß Latin LATIN SMALL LETTER SHARP S German (1) [119]
39 U+00E0 à Latin LATIN SMALL LETTER A WITH GRAVE Italian (1), Galician (2), Wolof (4) [130], [131], [106], [132]
40 U+00E1 á Latin LATIN SMALL LETTER A WITH ACUTE Spanish (1), Czech (1), Icelandic (1), Faroese (2), Kirundi (1), Chuukese (2), Galician (2), Lule S��mi (2), Northern S��mi (2) [100], [101], [102], [103], [104], [105], [106], [107], [108]
41 U+00E2 â Latin LATIN SMALL LETTER A WITH CIRCUMFLEX Vietnamese (1), Romanian (1), Skolt Sami (2), Kirundi (1), French (1), Galician (2), West Frisian (1), Friulian (4), Xavante (4) [109], [110], [113], [104], [114], [106], [115], [116], [117]
42 U+00E3 ã Latin LATIN SMALL LETTER A WITH TILDE Umbundu (3), Guarani (1), Nauruan (3), Khoekhoe (4) [141], [142], [143], [144], [145]
43 U+00E4 ä Latin LATIN SMALL LETTER A WITH DIAERESIS German (1), Finnish (1), Turkmen (1), Estonian (1), Swedish (1), Lule S��mi (2), Yapese (2), Dinka (4), Kaqchikel (4), Bashkir (4), Alsatian (5), Nuer (4) [119], [120], [121], [122], [123], [107], [124], [125], [126], [127], [128], [129]
44 U+00E5 å Latin LATIN SMALL LETTER A WITH RING ABOVE Danish (1), Finnish (1), Chamorro (1), Swedish (1), Lule S��mi (2) [139], [120], [140], [123], [107]
45 U+00E6 æ Latin LATIN SMALL LETTER AE Danish (1), Icelandic (1), Faroese (2) [139], [102], [103]
46 U+00E7 ç Latin LATIN SMALL LETTER C WITH CEDILLA Turkish (1), Turkmen (1), Kurdish (2), French (1), Azerbaijani(1), Basque (1), Galician (2), Friulian (4), Bashkir(4) [157], [121], [158], [114], [159], [160], [161], [162], [116], [127]
47 U+00E8 è Latin LATIN SMALL LETTER E WITH GRAVE French (1), Italian (1), Afrikaans (1), Kirundi (1), Haitian Creole (1) [114], [130], [175], [104], [182], [183]
48 U+00E9 é Latin LATIN SMALL LETTER E WITH ACUTE French (1), Italian (1), Spanish (1), Czech (1), Icelandic (1), Kirundi (1), Chuukese (2), Galician (2), Wolof (4), XAVANTE (4), West Frisian (2) [114], [130], [100], [101], [102], [104], [105], [162], [132], [117], [171]
49 U+00EA ê Latin LATIN SMALL LETTER E WITH CIRCUMFLEX French (1), Tswana (1), Afrikaans (1), Vietnamese (1), Kurdish (2), Kirundi (1), West Frisian (1), Friulian (4) [114], [173], [174], [175], [109], [158], [104], [115], [116]
50 U+00EB ë Latin LATIN SMALL LETTER E WITH DIAERESIS Afrikaans (1), Kirundi (1), Albanian (1), French (1), Chuukese (2), Uyghur (2), Yapese (2), Wolof (4), Drehu (4), Kaqchikel (4), West Frisian (2), Nuer (4) [175], [104], [176], [177], [114], [176], [177], [114], [178], [179], [124], [132], [180], [126], [171], [129]
51 U+00EC ì Latin LATIN SMALL LETTER I WITH GRAVE Italian (1), Kirundi (1) [130], [206], [207], [208]
52 U+00ED í Latin LATIN SMALL LETTER I WITH ACUTE Spanish (1), Czech (1), Icelandic (1), Faroese (2), Kirundi (1), Galician (2), Bashkir(4) [100], [101], [102], [103], [104], [162], [127]
53 U+00EE î Latin LATIN SMALL LETTER I WITH CIRCUMFLEX Afrikaans (1), Romanian (1), Kurdish (2), Kirundi (1), French (1), Friulian (4) [175], [110], [158], [104], [114], [116]
54 U+00EF ï Latin LATIN SMALL LETTER I WITH DIAERESIS Afrikaans (1), French (1), Kaqchikel (4), Dinka (4), West Frisian (2) [175], [114], [126], [125], [171]
55 U+00F0 ð Latin LATIN SMALL LETTER ETH Faroese (2), Icelandic (1) [103], [102]
56 U+00F1 ñ Latin LATIN SMALL LETTER N WITH TILDE Spanish (1), Pulaar (3), Chomorro (1), Filipino (1), Guarani (1), Chavacano (4), Basque (1), Galician (2), Iloco (3), Quechua (3), Cape Verdean Creole (4), Waray-Waray (3), Wolof (4), Nauruan(3), Lozi (4), Bashkir (4), Marshallese (1), Mandinka (5), Igbo(2) [221], [222], [142], [143], [223], [160], [162], [224], [225], [226], [227], [228], [132], [144], [229], [127], [136], [197], [205]
57 U+00F2 ò Latin LATIN SMALL LETTER O WITH GRAVE Italian (1), Haitian Creole (1) [130], [182], [183]
58 U+00F3 ó Latin LATIN SMALL LETTER O WITH ACUTE Spanish (1), Polish (1), Czech (1), Icelandic (1), Kirundi (1), Chuukese (2), Galician (2), Wolof (4) [100], [152], [101], [102], [104], [105], [162], [132]
59 U+00F4 ô Latin LATIN SMALL LETTER O WITH CIRCUMFLEX Tswana (1), Afrikaans (1), Vietnamese (1), Kirundi (1), French (1), Northern Sotho(1), West Frisian (1), Galician (2), Friulian (4), Xavante(4) [173], [174], [175], [109], [104], [114], [230], [115], [162], [116], [117]
60 U+00F5 õ Latin LATIN SMALL LETTER O WITH TILDE Estonian (1), Skolt Sami (2), Umbundu (3), Guarani (1), Nauruan (3), Xavante (4), Khoekhoe (4) [122], [113], [141], [142], [143], [144], [117], [235]
61 U+00F6 ö Latin LATIN SMALL LETTER O WITH DIAERESIS German (1), Finnish (1), Afrikaans (1), Turkish (1), Swedish (1), Uygur (2), Yapese (2), Drehu (4), Kaqchikel (4), Dinka (4), Bashkir (4), Low German (5), Chechen (2), 1992 VersionWest Frisian (2), Nuer (4) [119], [120], [175], [157], [123], [179], [124], [180], [126], [125], [127], [231], [232], [171], [129]
62 U+00F8 ø Latin LATIN SMALL LETTER O WITH STROKE Danish (1), Faroese (2) [139], [103]
63 U+00F9 ù Latin LATIN SMALL LETTER U WITH GRAVE Italian (1), Papiamento (1) [130], [206], [245], [246]
64 U+00FA ú Latin LATIN SMALL LETTER U WITH ACUTE Spanish (1), Czech (1), Icelandic (1), Faroese (2), Kirundi (1), Chuukese (2), West Frisian (1), Galician (2) [100], [101], [102], [103], [104], [105], [115], [162]
65 U+00FB û Latin LATIN SMALL LETTER U WITH CIRCUMFLEX Afrikaans (1), Kurdish (2), Kirundi (1), French (1), Miskito (2), West Frisian (1), Friulian (4), Zazaki (4) [175], [158], [104], [114], [243], [115], [116], [244]
66 U+00FC ü Latin LATIN SMALL LETTER U WITH DIAERESIS German (1), Spanish (1), Afrikaans (1), Turkish (1), Swedish (1), French (1), Azeri(1), Basque (1), Galician (2), Uygur (2), Kaqchikel (4), Bashkir (4), Low German (5) [119], [100], [175], [157], [123], [114], [159], [161], [162], [179], [126], [127], [231]
67 U+00FD ý Latin LATIN SMALL LETTER Y WITH ACUTE Turkmen (1), Czech (1), Icelandic (1), Faroese (2), Guarani (1) [121], [101], [102], [103], [142], [143]
68 U+00FE þ Latin LATIN SMALL LETTER THORN Icelandic (1) [102]
69 U+0101 ā Latin LATIN SMALL LETTER A WITH MACRON Latvian (1), Tongan (1), Hawaiian (2), Marshallese(1) [133], [134], [135], [136]
70 U+0103 ă Latin LATIN SMALL LETTER A WITH BREVE Vietnamese (1), Romanian (1), Bavarian (5) [109], [110], [111], [112]
71 U+0105 ą Latin LATIN SMALL LETTER A WITH OGONEK Polish (1), Lithuanian (1) [137], [138]
72 U+0107 ć Latin LATIN SMALL LETTER C WITH ACUTE Croatian (1), Serbian (1), Polish (1) [150], [151], [152]
73 U+010B ċ Latin LATIN SMALL LETTER C WITH DOT ABOVE Maltese(1) [163]
74 U+010D č Latin LATIN SMALL LETTER C WITH CARON Croatian (1), Serbian (1), Latvian (1), Slovak(1), Northern S��mi(2), Lithuanian (1), Kabyle (5) [150], [151], [152], [133], [153], [108], [154], [155], [156]
75 U+010F ď Latin LATIN SMALL LETTER D WITH CARON Czech (1), Slovak (1) [101], [153]
76 U+0111 đ Latin LATIN SMALL LETTER D WITH STROKE Croatian (1), Serbian (1), Vietnamese (1), Northern S��miBrahui (5) [150], [151], [109], [108], [168]
77 U+0113 ē Latin LATIN SMALL LETTER E WITH MACRON Latvian (1), Hawaiian (2), Tongan (1), Minangkabau (5) [133], [135], [134], [184]
78 U+0115 ĕ Latin LATIN SMALL LETTER E WITH BREVE Bavarian (5) [111], [112]
79 U+0117 ė Latin LATIN SMALL LETTER E WITH DOT ABOVE Lithuanian (1) [138], [154]
80 U+0119 ę Latin LATIN SMALL LETTER E WITH OGONEK Polish (1), Palauan (2), Lithuanian (1) [152], [185], [138], [154]
81 U+011B ě Latin LATIN SMALL LETTER E WITH CARON Czech (1), Kirundi (1), Sorbian (4) [101], [104], [172]
82 U+011F ğ Latin LATIN SMALL LETTER G WITH BREVE Turkish (1), Tatar (2), Azeri(1), Bashkir(4), Zaza (5) [157], [201], [159], [127], [202]
83 U+0121 ġ Latin LATIN SMALL LETTER G WITH DOT ABOVE Maltese (1) [163]
84 U+0123 ģ Latin LATIN SMALL LETTER G WITH CEDILLA Latvian (1), Brahui (5) [133], [168]
85 U+0127 ħ Latin LATIN SMALL LETTER H WITH STROKE Maltese (1) [163]
86 U+0129 ĩ Latin LATIN SMALL LETTER I WITH TILDE Guarani (1), Cubeo (3), Khoekhoe (4), Kikuyu (5) [142], [143], [186], [145], [209]
87 U+012B ī Latin LATIN SMALL LETTER I WITH MACRON Latvian (1), Lithuanian (1), Hawaiian (2), Tongan (1) [133], [138], [135], [134]
88 U+012F į Latin LATIN SMALL LETTER I WITH OGONEK Lithuanian (1) [154]
89 U+0131 ı Latin LATIN SMALL LETTER DOTLESS I Turkish (1), Tatar (2), Azeri(1) [157], [203], [201], [159]
90 U+0137 ķ Latin LATIN SMALL LETTER K WITH CEDILLA Latvian (1) [133]
91 U+013A ĺ Latin LATIN SMALL LETTER L WITH ACUTE Slovak (1) [153]
92 U+013C ļ Latin LATIN SMALL LETTER L WITH CEDILLA Latvian (1), Marshallese (1), Brahui (5) [133], [213], [214], [168]
93 U+013E ľ Latin LATIN SMALL LETTER L WITH CARON Slovak (1) [153]
94 U+0142 ł Latin LATIN SMALL LETTER L WITH STROKE Polish (1) [152]
95 U+0144 ń Latin LATIN SMALL LETTER N WITH ACUTE Polish (1), Lule S��mi (2), Sorbian (4), Brahui (5) [152], [107], [172], [168]
96 U+0146 ņ Latin LATIN SMALL LETTER N WITH CEDILLA Latvian (1), Marshallese (1) [133], [136]
97 U+0148 ň Latin LATIN SMALL LETTER N WITH CARON Turkmen (1), Czech (1), Slovak (1) [121], [101], [153]
98 U+014B ŋ Latin LATIN SMALL LETTER ENG Inari Sami (2), Dagaare - Burkina Faso (4), Dagbani (Dagomba), (4), Northern Sami (2), Ewondo (3), Luganda (3), Wolof (4), Adzera(4), Nuer (4), Ga (4), Dinka (4), Duala(3), Ewe (3), Soga (5), Alur (5), Mandinka (5), Acholi (5), Bambara (4), Nuer (4) [188], [148], [189], [108], [190], [191], [132], [192], [146], [193], [125], [194], [170], [195], [196], [197], [198], [199], [129]
99 U+014D ō Latin LATIN SMALL LETTER O WITH MACRON Hawaiian (2), Marshallese (1), Tongan (1) [135], [136], [134]
100 U+0151 ő Latin LATIN SMALL LETTER O WITH DOUBLE ACUTE Hungarian (1) [233], [234]
101 U+0153 œ Latin LATIN SMALL LIGATURE OE French (1) [114], [253]
102 U+0155 ŕ Latin LATIN SMALL LETTER R WITH ACUTE Slovak (1), Brahui (5) [153], [168]
103 U+0159 ř Latin LATIN SMALL LETTER R WITH CARON Czech (1), Sorbian (4) [101], [172]
104 U+015B ś Latin LATIN SMALL LETTER S WITH ACUTE Polish (1) [152]
105 U+015D ŝ Latin LATIN SMALL LETTER S WITH CIRCUMFLEX Tswa (5) [217]
106 U+015F ş Latin LATIN SMALL LETTER S WITH CEDILLA Turkish (1), Turkmen (1), Kurdish (2), Tatar (2), Azeri(1), Bashkir(4), Brahui (5), Zaza (5) [157], [121], [158], [201], [159], [127], [168], [202]
107 U+0161 š Latin LATIN SMALL LETTER S WITH CARON Tswana (1), Croatian (1), Serbian (1), Latvian (1), Northern Sotho (1), Nothert Sami(2), Lithuanian (1) [174], [150], [151], [133], [230], [108], [154]
108 U+0165 ť Latin LATIN SMALL LETTER T WITH CARON Czech (1), Slovak (1) [101], [153]
109 U+0167 ŧ Latin LATIN SMALL LETTER T WITH STROKE Northern, Sami(2), Brahui (5) [108], [168]
110 U+0169 ũ Latin LATIN SMALL LETTER U WITH TILDE Umbundu (3), Guarani (1), Nauruan (3), Khoekhoe (4), Kikuyu (5) [141], [142], [143], [144], [145], [209]
111 U+016B ū Latin LATIN SMALL LETTER U WITH MACRON Latvian (1), Hawaiian (2), Lithuanian (1), Marshallese (1), Tongan (1) [133], [135], [138], [154], [136], [134]
112 U+016F ů Latin LATIN SMALL LETTER U WITH RING ABOVE Czech (1) [101]
113 U+0171 ű Latin LATIN SMALL LETTER U WITH DOUBLE ACUTE Hungarian (1) [233], [234]
114 U+0173 ų Latin LATIN SMALL LETTER U WITH OGONEK Lithuanian (1) [154], [138]
115 U+0175 ŵ Latin LATIN SMALL LETTER W WITH CIRCUMFLEX Chichewa (3) [247]
116 U+017A ź Latin LATIN SMALL LETTER Z WITH ACUTE Polish (1), Brahui (5) (Lower), Sorbian (4) [152], [252], [168], [172]
117 U+017C ż Latin LATIN SMALL LETTER Z WITH DOT ABOVE Polish (1), Maltese(1) [152], [163]
118 U+017E ž Latin LATIN SMALL LETTER Z WITH CARON Lithuanian (1), Croatian (1), Serbian (1), Turkmen (1), Latvian (1), Slovak (1), Northern Sami(2), Chechen(2) 1925 Version [154], [150], [151], [121], [133], [153], [108], [232]
119 U+0192 ƒ Latin LATIN SMALL LETTER F WITH HOOK Ewe(3) [170]
120 U+0199 ƙ Latin LATIN SMALL LETTER K WITH HOOK Hausa (2) [147]
121 U+01A1 ơ Latin LATIN SMALL LETTER O WITH HORN Vietnamese (1) [118]
122 U+01B0 ư Latin LATIN SMALL LETTER U WITH HORN Vietnamese (1) [109]
123 U+01B4 ƴ Latin LATIN SMALL LETTER Y WITH HOOK Dagaare - Burkina Faso (4) [148], [251], [149]
124 U+01CE ǎ Latin LATIN SMALL LETTER A WITH CARON Kirundi (1) [104]
125 U+01D0 ǐ Latin LATIN SMALL LETTER I WITH CARON Kirundi (1) [104]
126 U+01D2 ǒ Latin LATIN SMALL LETTER O WITH CARON Kirundi (1) [104]
127 U+01D4 ǔ Latin LATIN SMALL LETTER U WITH CARON Kirundi (1) [104]
128 U+01DD ǝ Latin LATIN SMALL LETTER TURNED E Kanuri (3) [240]
129 U+01E7 ǧ Latin LATIN SMALL LETTER G WITH CARON Skolt Sami (2) [113]
130 U+01E9 ǩ Latin LATIN SMALL LETTER K WITH CARON Skolt Sami (2) [113]
131 U+01EF ǯ Latin LATIN SMALL LETTER EZH WITH CARON Skolt Sami (2) [113]
132 U+0219 ș Latin LATIN SMALL LETTER S WITH COMMA BELOW Romanian (1) [110]
133 U+021B ț Latin LATIN SMALL LETTER T WITH COMMA BELOW Romanian (1) [110]
134 U+024D ɍ Latin LATIN SMALL LETTER R WITH STROKE Kanuri (3) [240]
135 U+0253 ɓ Latin LATIN SMALL LETTER B WITH HOOK Hausa (2), Dagaare - Burkina Faso (4), Pulaar (3) [147], [148], [149]
136 U+0254 ɔ Latin LATIN SMALL LETTER OPEN O Dagaare - Burkina Faso (4), Dagbani (Dagomba) (4), Lingala (2), Akan (3), Ewondo (3), Fon (3), Nuer (4), Ga (4), Duala (3), EWE (3), Nuer (4) [148], [189], [236], [237], [190], [169], [146], [193], [194], [170], [129]
137 U+0254 U+0308 ɔ ̈ Latin LATIN SMALL LETTER OPEN O COMBINING DIAERESIS DINKA (4) [125]
138 U+0254 U+0331 ɔ ̱ Latin LATIN SMALL LETTER OPEN O COMBINING MACRON BELOW Nuer (4) [129], [146]
139 U+0256 ɖ Latin LATIN SMALL LETTER D WITH TAIL Fon (3), Ewe (3) [169], [170]
140 U+0257 ɗ Latin LATIN SMALL LETTER D WITH HOOK Hausa (2), Pulaar (3) [147], [166], [167]
141 U+0259 ə Latin LATIN SMALL LETTER SCHWA Azeri, Azerbaijani (1), Ewondo (3), Ewe (3), Bugis (3) [159], [190], [170], [241]
142 U+025B ɛ Latin LATIN SMALL LETTER OPEN E Dagaare - Burkina Faso (4), Lingala (2), Akan (3), Ewondo (3), Dagbani (Dagomba), (4), Fon (3), Mossi (3), Ga (4), Ewe (3), Duala (3), Bambara (4), Nuer (4) [148], [236], [237], [190], [189], [169], [212], [238], [193], [170], [194], [199], [129]
143 U+025B U+0308 ɛ ̈ Latin LATIN SMALL LETTER OPEN E COMBINING DIAERESIS Nuer (4), Dinka (4) [129], [146], [239], [125]
144 U+025B U+0331 ɛ ̱ Latin LATIN SMALL LETTER OPEN E COMBINING MACRON BELOW Nuer (4) [129], [146], [239]
145 U+025B U+0331 U+0308 ɛ ̱ ̈ Latin LATIN SMALL LETTER OPEN E COMBINING MACRON BELOW COMBINING DIAERESIS Nuer (4) [146], [239]
146 U+0263 ɣ Latin LATIN SMALL LETTER GAMMA Dagbani (Dagomba) (4), Nuer (4), Dinka (4), Ewe (3), Nuer (4) [189], [146], [125], [170], [129]
147 U+0268 ɨ Latin LATIN SMALL LETTER I WITH STROKE Cubeo (3), Dagbani (Dagomba) (4), HIxkary��na (4), Maasai (5) [186], [189], [210], [211]
148 U+0268 U+0303 ɨ ̃ Latin LATIN SMALL LETTER I WITH STROKE COMBINING TILDE Cubeo (3) [186]
149 U+0269 ɩ Latin LATIN SMALL LETTER IOTA Dagaare - Burkina Faso (4), Mossi (3) [148], [212]
150 U+0272 ɲ Latin LATIN SMALL LETTER N WITH LEFT HOOK Susu (4), Zarma (4), Bambara (4) [218], [219], [199]
151 U+0289 ʉ Latin LATIN SMALL LETTER U BAR Cubeo (3), Maasai (5) [186], [187], [211]
152 U+0289 U+0303 ʉ ̃ Latin LATIN SMALL LETTER U BAR COMBINING TILDE Cubeo (3) [186], [187]
153 U+028B ʋ Latin LATIN SMALL LETTER V WITH HOOK Dagaare - Burkina Faso (4), Mossi (3), Ewe (3) [148], [212], [238], [170]
154 U+0292 ʒ Latin LATIN SMALL LETTER EZH Skolt Sami (2), Dagbani (Dagomba) (4) [113], [189]
155 U+1E0D Latin LATIN SMALL LETTER D WITH DOT BELOW Mundari (5), Kabyle (5) [165], [155], [156]
156 U+1E13 Latin LATIN SMALL LETTER D WITH CIRCUMFLEX BELOW Venda (1) [164]
157 U+1E25 Latin LATIN SMALL LETTER H WITH DOT BELOW Kabyle (5) [155], [156]
158 U+1E37 Latin LATIN SMALL LETTER L WITH DOT BELOW Marshallese (1), Mundari (5) [213], [214], [215], [216], [165]
159 U+1E3D Latin LATIN SMALL LETTER L WITH CIRCUMFLEX BELOW Venda (1) [164]
160 U+1E43 Latin LATIN SMALL LETTER M WITH DOT BELOW Marshallese (1) [213], [136], [215], [216]
161 U+1E45 Latin LATIN SMALL LETTER N WITH DOT ABOVE Venda (1), Tswa (5) [164], [217]
162 U+1E47 Latin LATIN SMALL LETTER N WITH DOT BELOW Mundari (5), Marshallese (1) [165], [136], [215], [216]
163 U+1E49 Latin LATIN SMALL LETTER N WITH LINE BELOW Pitjantjatjara (4) [220]
164 U+1E4B Latin LATIN SMALL LETTER N WITH CIRCUMFLEX BELOW Venda (1) [164]
165 U+1E5B Latin LATIN SMALL LETTER R WITH DOT BELOW Kabyle (5) [155], [156]
166 U+1E63 Latin LATIN SMALL LETTER S WITH DOT BELOW Yoruba (2), Kabyle (5) [181], [155], [156]
167 U+1E6D Latin LATIN SMALL LETTER T WITH DOT BELOW Mizo (4), Mundari (5), Kabyle (5) [242], [165], [155], [156]
168 U+1E71 Latin LATIN SMALL LETTER T WITH CIRCUMFLEX BELOW Venda (1) [164]
169 U+1E8D Latin LATIN SMALL LETTER X WITH DIAERESIS Mam (4) [248], [249]
170 U+1E91 Latin LATIN SMALL LETTER Z WITH CIRCUMFLEX Tswa (5) [217]
171 U+1E93 Latin LATIN SMALL LETTER Z WITH DOT BELOW Kabyle (5) [155]
172 U+1EA1 Latin LATIN SMALL LETTER A WITH DOT BELOW Vietnamese (1) [109]
173 U+1EA3 Latin LATIN SMALL LETTER A WITH HOOK ABOVE Vietnamese (1) [118]
174 U+1EA5 Latin LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACUTE Vietnamese (1) [109]
175 U+1EA7 Latin LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRAVE Vietnamese (1) [109]
176 U+1EA9 Latin LATIN SMALL LETTER A WITH CIRCUMFLEX AND HOOK ABOVE Vietnamese (1) [118]
177 U+1EAB Latin LATIN SMALL LETTER A WITH CIRCUMFLEX AND TILDE Vietnamese (1) [118]
178 U+1EAD Latin LATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT BELOW Vietnamese (1) [109]
179 U+1EAF Latin LATIN SMALL LETTER A WITH BREVE AND ACUTE Vietnamese (1) [109]
180 U+1EB1 Latin LATIN SMALL LETTER A WITH BREVE AND GRAVE Vietnamese (1) [109]
181 U+1EB3 Latin LATIN SMALL LETTER A WITH BREVE AND HOOK ABOVE Vietnamese (1) [109]
182 U+1EB5 Latin LATIN SMALL LETTER A WITH BREVE AND TILDE Vietnamese (1) [109]
183 U+1EB7 Latin LATIN SMALL LETTER A WITH BREVE AND DOT BELOW Vietnamese (1) [109]
184 U+1EB9 Latin LATIN SMALL LETTER E WITH DOT BELOW Yoruba(2) [181]
185 U+1EB9 U+0300 ̀ Latin LATIN SMALL LETTER E WITH DOT BELOW COMBINING GRAVE ACCENT Yoruba(2) [254]
186 U+1EB9 U+0301 ́ Latin LATIN SMALL LETTER E WITH DOT BELOW COMBINING ACUTE ACCENT Yoruba(2) [254]
187 U+1EBB Latin LATIN SMALL LETTER E WITH HOOK ABOVE Vietnamese (1) [118]
188 U+1EBD Latin LATIN SMALL LETTER E WITH TILDE Umbundu (3), Guarani (1), Cubeo (3), Xavante (4) [141], [142], [143], [186], [187], [117]
189 U+1EBF ế Latin LATIN SMALL LETTER E WITH CIRCUMFLEX AND ACUTE Vietnamese (1) [118]
190 U+1EC1 Latin LATIN SMALL LETTER E WITH CIRCUMFLEX AND GRAVE Vietnamese (1) [118]
191 U+1EC3 Latin LATIN SMALL LETTER E WITH CIRCUMFLEX AND HOOK ABOVE Vietnamese (1) [118]
192 U+1EC5 Latin LATIN SMALL LETTER E WITH CIRCUMFLEX AND TILDE Vietnamese (1) [118]
193 U+1EC7 Latin LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT BELOW Vietnamese (1) [118]
194 U+1EC9 Latin LATIN SMALL LETTER I WITH HOOK ABOVE Vietnamese (1) [118]
195 U+1ECB Latin LATIN SMALL LETTER I WITH DOT BELOW Igbo (2) [205]
196 U+1ECD Latin LATIN SMALL LETTER O WITH DOT BELOW Igbo (2), Yoruba (2), Marshallese (1) [204], [205], [181], [136], [215], [216]
197 U+1ECD U+0300 ̀ Latin LATIN SMALL LETTER O WITH DOT BELOW COMBINING GRAVE ACCENT Yoruba (2) [254]
198 U+1ECD U+0301 ́ Latin LATIN SMALL LETTER O WITH DOT BELOW COMBINING ACUTE ACCENT Yoruba (2) [254]
199 U+1ECF Latin LATIN SMALL LETTER O WITH HOOK ABOVE Vietnamese (1) [118]
200 U+1ED1 Latin LATIN SMALL LETTER O WITH CIRCUMFLEX AND ACUTE Vietnamese (1) [118]
201 U+1ED3 Latin LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRAVE Vietnamese (1) [118]
202 U+1ED5 Latin LATIN SMALL LETTER O WITH CIRCUMFLEX AND HOOK ABOVE Vietnamese (1) [118]
203 U+1ED7 Latin LATIN SMALL LETTER O WITH CIRCUMFLEX AND TILDE Vietnamese (1) [118]
204 U+1ED9 Latin LATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT BELOW Vietnamese (1) [118]
205 U+1EDB Latin LATIN SMALL LETTER O WITH HORN AND ACUTE Vietnamese (1) [118]
206 U+1EDD Latin LATIN SMALL LETTER O WITH HORN AND GRAVE Vietnamese (1) [118]
207 U+1EDF Latin LATIN SMALL LETTER O WITH HORN AND HOOK ABOVE Vietnamese (1) [118]
208 U+1EE1 Latin LATIN SMALL LETTER O WITH HORN AND TILDE Vietnamese (1) [118]
209 U+1EE3 Latin LATIN SMALL LETTER O WITH HORN AND DOT BELOW Vietnamese (1) [118]
210 U+1EE5 Latin LATIN SMALL LETTER U WITH DOT BELOW Igbo (2) [204], [205]
211 U+1EE7 Latin LATIN SMALL LETTER U WITH HOOK ABOVE Vietnamese (1) [118]
212 U+1EE9 Latin LATIN SMALL LETTER U WITH HORN AND ACUTE Vietnamese (1) [118]
213 U+1EEB Latin LATIN SMALL LETTER U WITH HORN AND GRAVE Vietnamese (1) [118]
214 U+1EED Latin LATIN SMALL LETTER U WITH HORN AND HOOK ABOVE Vietnamese (1) [118]
215 U+1EEF Latin LATIN SMALL LETTER U WITH HORN AND TILDE Vietnamese (1) [118]
216 U+1EF1 Latin LATIN SMALL LETTER U WITH HORN AND DOT BELOW Vietnamese (1) [118]
217 U+1EF3 Latin LATIN SMALL LETTER Y WITH GRAVE Vietnamese (1) [118]
218 U+1EF5 Latin LATIN SMALL LETTER Y WITH DOT BELOW Vietnamese (1) [118]
219 U+1EF7 Latin LATIN SMALL LETTER Y WITH HOOK ABOVE Vietnamese (1) [118]
220 U+1EF9 Latin LATIN SMALL LETTER Y WITH TILDE Vietnamese (1), Guarani (1) [118], [142]

Legend

Code Point
A code point or code point sequence.
Name
Shows the character or sequence name from the Unicode Character Database.
Glyph
The shape displayed depends on the fonts available to your browser.
Script
Shows the script property value from the Unicode Character Database. Combining marks may have the value Inherited and code points used with more than one script may have the value Common.
References
Links to the references associated with the code point or sequence, if any.
Tags
LGR-defined tag values. Any tags matching the Unicode script property are suppressed in this view.
Required Context
Link to the rule defining the required context a code point or sequence must satisfy. If prefixed by "not:", identifies a context that must not occur.
Variants
A link to the variant set the code point or sequence is a member of, except where a coded point or sequence maps only to itself, in which case the type of that mapping is listed.
Comment
If the comment in this row consists only of the code point or sequence name it is suppressed in this view.

Variant Sets

Summary

Number of variant sets 0
Largest variant set 1
Ordinary Variants by Type

The following tables list each pair of variant mappings on one row.

In a properly specified LGR, all members of each variant set are variants of each other, a property called transitivity. Because of that, all variant sets are necessarily disjoint. In each set, shading is used to group mappings from the same source code point or sequence.

Classes, Rules and Actions

Character Classes

The following table lists all top-level classes with their definition and the regular expression defining their members.

Name Definition Count Members References Comment

Legend

Members or Ranges
Lists the members of the class as code points (xxx) or as ranges of code points (xxx-yyy). Any class too numerous to list in full is elided with "...".
Tag=ttt
An anonymous class implicitly defined based on tag value.
[: :] - named character set
Reference to a named character set [:name:].
(���,���,\,���) - set operators
Sets may be combined by set operators (��� = intersection, ��� = union, \ = difference, ��� = symmetric difference).

Whole label evaluation and context rules

The following table lists all the top-level, or named rules defined in the LGR and indicates whether they are used as trigger in an action or as context (when or not-when) for a code point. (Any use of context rules for variants is not indicated).

Name Regular Expression Used as
Trigger
Used as
Context
Anchor References Comment

Legend

Used as Trigger
This rule triggers one of the actions listed below.
Used as Context
This rule defines a required context for a code point.
Anchor
This has a placeholder for the code point for which it is evaluated.
Regular Expression
A regular expression equivalent to the rule, shown in the standard notation with some extensions as noted:
��� - context anchor
In a regex the ��� signifies a placeholder for the actual code point, when a context is evaluated. The code point must occur at the position corresponding to the anchor. Rules containing an anchor cannot be used as triggers.
(...)��� - look-behind
If present encloses required context preceding the anchor.
���(...) - look-ahead
If present encloses required context following the anchor.
(: :) - rule reference
Non-recursive reference to a named rule.
[: :] - character set either named, implicit or property
Reference to a named character set [:name:], an implicit character set [:class tag=val:] or a given Unicode property [:class property:prop=val:]. A leading "^" before name or tag indicates the set complement.
(|) - choice operator
When there are various choices in a rule, choices are separated by the set operator (|) and each choice is represented by a set enclosed in parenthesis.
(���,���,\,���) - set operators
Sets may be combined by set operators (��� = intersection, ��� = union, \ = difference, ��� = symmetric difference).
�� - empty set
Indicated that the following set is empty because of the result of set operations or because non of its elements are part of the repertoire defined here.
An empty set that is not optional means that a rule can never match.
{m}, {m, n}, {m,} - count
Indicates that the preceding element is evaluated from m to n times. Only {m} means the preceding element is evaluated exactly m times (equivalent to {m,m}), {m,} means the preceding element is evaluated at least m times.
No count indicated the elements is evaluated once (equivalent to "{1}").

Actions

The following table lists the actions that are used to assign dispositions to labels and variant labels, based on the specified conditions. The order of actions defines their precedence: the first action triggered by a label is the one defining its disposition.

# Condition Rule / Variant Set   Disposition References Comment

Legend

{...} - variant type set
In the "Rule/Variant Set" column the notation {...} means a set of variant types.

Table of References

[100] https://www.icann.org/sites/default/files/packages/lgr/lgr-second-level-spanish-30aug16-en.html
[101] http://www.omniglot.com/writing/czech.htm
[102] http://www.omniglot.com/writing/icelandic.htm
[103] http://www.omniglot.com/writing/faroese.htm
[104] https://en.wikipedia.org/wiki/Burundi_Bwacu#Kirundi_.28with_tonal_diacritics_.E2.80.94_utw.C3.A2tuzo.29
[105] http://www.omniglot.com/writing/chuukese.htm
[106] http://www.webcitation.org/6siTI8ieQ
[107] http://www.omniglot.com/writing/lulesami.htm
[108] https://en.wikipedia.org/wiki/Northern_Sami
[109] http://www.omniglot.com/writing/vietnamese.htm
[110] http://www.omniglot.com/writing/romanian.htm
[111] https://www.omniglot.com/writing/bavarian.htm
[112] https://bar.wikipedia.org/wiki/Wikipedia:Boarische_Umschrift
[113] http://www.omniglot.com/writing/skoltsami.htm
[114] http://omniglot.com/writing/french.htm
[115] http://www.omniglot.com/writing/westfrisian.htm
[116] http://www.omniglot.com/writing/friulian.htm
[117] http://www.silbrasil.org.br/resources/archives/17019
[118] https://www.omniglot.com/writing/vietnamese.htm
[119] http://www.omniglot.com/writing/german.htm
[120] http://www.omniglot.com/writing/finnish.htm
[121] http://www.omniglot.com/writing/turkmen.htm
[122] http://www.omniglot.com/writing/estonian.htm
[123] http://www.omniglot.com/writing/swedish.htm
[124] http://www.omniglot.com/writing/yapese.htm
[125] https://www.omniglot.com/writing/dinka.php
[126] http://www.omniglot.com/writing/kaqchikel.htm
[127] http://www.omniglot.com/writing/bashkir.htm
[128] https://www.omniglot.com/writing/alsatian.htm
[129] https://en.wikipedia.org/wiki/Nuer_language
[130] http://www.omniglot.com/writing/italian.htm
[131] https://en.wikipedia.org/wiki/Italian_orthography
[132] http://www.omniglot.com/writing/wolof.htm
[133] http://www.omniglot.com/writing/latvian.htm
[134] http://www.omniglot.com/writing/tongan.htm
[135] http://www.omniglot.com/writing/hawaiian.htm
[136] http://www.omniglot.com/writing/marshallese.php
[137] http://www.omniglot.com/writing/polish.htm
[138] http://www.omniglot.com/writing/lithuanian.htm
[139] http://www.omniglot.com/writing/danish.htm
[140] http://www.omniglot.com/writing/chamorro.htm
[141] http://www.omniglot.com/writing/umbundu.htm
[142] http://www.omniglot.com/writing/guarani.htm
[143] https://en.wikipedia.org/wiki/Guarani_alphabet
[144] http://www.omniglot.com/writing/nauruan.htm
[145] https://www.omniglot.com/writing/khoekhoe.htm
[146] https://www.omniglot.com/writing/nuer.htm
[147] http://www.omniglot.com/writing/hausa.htm
[148] http://www.omniglot.com/writing/dagaare.htm
[149] http://www.omniglot.com/writing/fula.htm
[150] http://www.omniglot.com/writing/croatian.htm
[151] http://www.omniglot.com/writing/serbian.htm
[152] https://en.wikipedia.org/wiki/Polish_language
[153] http://www.omniglot.com/writing/slovak.htm
[154] http://www.evertype.com/alphabets/lithuanian.pdf
[155] http://www.omniglot.com/writing/kabyle.php
[156] https://en.wikipedia.org/wiki/Kabyle_language
[157] http://www.omniglot.com/writing/turkish.htm
[158] http://www.omniglot.com/writing/kurdish.htm
[159] http://www.omniglot.com/writing/azeri.htm
[160] http://www.omniglot.com/writing/basque.htm
[161] https://en.wikipedia.org/wiki/Basque_language#Writing_system
[162] http://scriptsource.org/cms/scripts/page.php?item_id=wrSys_detail_sym
[163] http://www.omniglot.com/writing/maltese.htm
[164] http://www.omniglot.com/writing/venda.htm
[165] https://www.omniglot.com/writing/mundari.htm
[166] https://en.wikipedia.org/wiki/Hausa_language
[167] http://phoible.org/inventories/view/809#tsource
[168] https://www.omniglot.com/writing/brahui.htm
[169] https://en.wikipedia.org/wiki/Fon_language
[170] http://www.omniglot.com/writing/ewe.htm
[171] http://www.geonames.de/alphfj.html
[172] https://www.omniglot.com/writing/sorbian.htm
[173] http://files.peacecorps.gov/multimedia/audio/languagelessons/botswana/Bw_Setswana_Language_Lessons.pdf
[174] http://omniglot.com/writing/tswana.php
[175] https://en.wikipedia.org/wiki/Afrikaans
[176] http://www.omniglot.com/writing/albanian.htm
[177] https://en.wikipedia.org/wiki/Albanian_alphabet
[178] http://www.jesuitvolunteers.org/wp-content/uploads/2015/08/So_you_want_to_learn_chuukese_-_only_for_Chuuk_JVs.pdf
[179] https://en.wikipedia.org/wiki/Uyghur_Latin_alphabet
[180] http://www.omniglot.com/writing/drehu.php
[181] http://www.omniglot.com/writing/yoruba.htm
[182] http://www.omniglot.com/writing/haitiancreole.htm
[183] https://en.wikipedia.org/wiki/Haitian_Creole#Orthography
[184] http://www.omniglot.com/writing/minangkabau.htm
[185] http://www.omniglot.com/writing/palauan.htm
[186] http://www.omniglot.com/writing/cubeo.htm
[187] https://www.sil.org/system/files/reapdata/10/58/27/10582785843693992331766506069073895620/40337_01.pdf
[188] http://www.omniglot.com/writing/inarisami.htm
[189] http://www.omniglot.com/charts/dagbani.pdf
[190] http://www.omniglot.com/writing/ewondo.php
[191] http://www.omniglot.com/writing/ganda.php
[192] http://www.omniglot.com/writing/adzera.htm
[193] http://www.omniglot.com/writing/ga.htm
[194] http://www.omniglot.com/writing/duala.php
[195] http://www.omniglot.com/writing/soga.htm
[196] http://www.omniglot.com/writing/alur.htm
[197] http://www.omniglot.com/writing/mandinka.htm
[198] https://www.omniglot.com/writing/acholi.htm
[199] http://www.omniglot.com/writing/bambara.htm
[200] http://www.omniglot.com/writing/raga.htm
[201] http://www.omniglot.com/writing/tatar.htm
[202] https://www.omniglot.com/writing/zazaki.htm
[203] https://en.wikipedia.org/wiki/Turkish_alphabet
[204] https://www.degruyter.com/downloadpdf/j/psicl.2007.43.issue-1/v10010-007-0009-0/v10010-007-0009-0.pdf
[205] http://www.omniglot.com/writing/igbo.htm
[206] https://www.italianpod101.com/italian-accents
[207] http://www.affaritaliani.it/blog/monica-la-pensa-cosi
[208] http://dictionary.reverso.net/italian-english/venerd%C3%AC
[209] http://www.omniglot.com/writing/kikuyu.htm
[210] http://www.omniglot.com/writing/hixkaryana.htm
[211] http://www.omniglot.com/writing/maasai.htm
[212] http://www.omniglot.com/writing/mossi.htm
[213] http://www.omniglot.com/babel/marshallese.htm
[214] https://en.wikipedia.org/wiki/Cedilla#Marshallese
[215] https://en.wikipedia.org/wiki/Marshallese_language#Display_issues
[216] http://www.trussel2.com/MOD/
[217] http://www.omniglot.com/writing/tswa.htm
[218] https://www.omniglot.com/writing/susu.htm
[219] https://www.omniglot.com/writing/zarma.htm
[220] https://www.omniglot.com/writing/pitjantjatjara.htm
[221] http://www.omniglot.com/writing/spanish.htm
[222] http://www.omniglot.com/writing/filipino.htm
[223] http://www.omniglot.com/writing/chavacano.php
[224] https://en.wikipedia.org/wiki/Ilocano_language#Modern_alphabet
[225] http://www.omniglot.com/writing/quechua.htm
[226] https://en.wikipedia.org/wiki/Quechua_alphabet
[227] http://www.omniglot.com/writing/kriol.php
[228] http://www.omniglot.com/writing/waray.php
[229] http://www.omniglot.com/writing/lozi.htm
[230] http://africanlanguages.com/northern_sotho/
[231] https://www.omniglot.com/writing/lowgerman.htm
[232] https://en.wikipedia.org/wiki/Chechen_language
[233] http://www.omniglot.com/writing/hungarian.htm
[234] https://en.wikipedia.org/wiki/Hungarian_alphabet
[235] http://www.omniglot.com/writing/khoekhoe.htm
[236] http://www.omniglot.com/writing/lingala.htm
[237] https://www.omniglot.com/writing/akan.htm
[238] https://en.wikipedia.org/wiki/Mossi_language
[239] https://www.sil.org/system/files/reapdata/10/06/46/100646256099282892829790816212446104791/OPSL_9.pdf (p. 75)
[240] http://www.omniglot.com/writing/kanuri.htm
[241] http://www.omniglot.com/writing/bugis.htm
[242] http://www.omniglot.com/writing/mizo.htm
[243] http://www.omniglot.com/writing/miskito.htm
[244] http://www.omniglot.com/writing/zazaki.htm
[245] https://en.wikipedia.org/wiki/Papiamento
[246] http://www.omniglot.com/writing/papiamento.php
[247] http://www.omniglot.com/writing/chichewa.php
[248] http://www.native-languages.org/mam_words.htm
[249] http://www.omniglot.com/writing/mam.htm
[250] https://en.wikipedia.org/wiki/Pulaar_language
[251] https://en.wikipedia.org/wiki/Fula_language#Writing_systems
[252] https://en.wikipedia.org/wiki/Polish_alphabet
[253] https://en.wikipedia.org/wiki/French_orthography
[254] https://www.omniglot.com/writing/yoruba.htm