Mirjana,

 

Since I promised to review the second-level LGRs to see if there are any characters in any of those languages that we have not include I have also done that and this is my findings (Bill and Meikal have at least looked at some of them):

 

U+00FF is included for French. Omniglot does not list it, but Wikipedia does.

 

U+0157 is included for Latvian. Omniglog does not list it. Wikipedia does, but today only used in diaspora (as Meikal pointed out).

 

The other missing code-points from Latin Extended-A were not found.

 

Code points I looked at:

 

00FF LATIN SMALL LETTER Y WITH DIAERESIS (French)

0109 LATIN SMALL LETTER C WITH CIRCUMFLEX

0125 LATIN SMALL LETTER H WITH CIRCUMFLEX

0135 LATIN SMALL LETTER J WITH CIRCUMFLEX

014F LATIN SMALL LETTER O WITH BREVE

0157 LATIN SMALL LETTER R WITH CEDILLA (Latvian)

0163 LATIN SMALL LETTER T WITH CEDILLA

0177 LATIN SMALL LETTER Y WITH CIRCUMFLEX

 

The LGRs can be found at https://www.icann.org/resources/pages/second-level-lgr-2015-06-21-en

 

*

 

A comment to Bill and Meikal:

 

The LGR for German and Spanish lists U+014F (ŏ) LATIN SMALL LETTER O WITH BREVE as *excluded* not as included. You have to go down to “repertoire by code point” to see what is included. They also have a concept of “extended code point” which is in between included and excluded, e.g. U+00E0 for Spanish.

 

Ÿ (U+00FF) is neither included in English nor German.

 

 

Mats

 

---

Mats Dufberg

DNS Specialist, IIS

Mobile: +46 73 065 3899

https://www.iis.se/en/

 

 

From: Latingp <latingp-bounces@icann.org> on behalf of Meikal Mumin <meikal.mumin@uni-koeln.de>
Date: Monday, 23 July 2018 at 00:11
To: Bill Jouris <bill.jouris@insidethestack.com>
Cc: ICANN Latin GP <latingp@icann.org>
Subject: Re: [Latingp] Repertoire and Latin Extended A

 

Dear colleagues,

 

On 22 July 2018 at 21:49, Bill Jouris <bill.jouris@insidethestack.com> wrote:

Hi Mirjana,



I've reviewed the repertoire we have (after adding Esperanto) and compared it to the Unicode table's Basic Latin, Latin-1 Supplement, and Latin Extended-A codepoints. 



The following entries from Latin-1 Supplement are included in MSR-2, but not included in our repertoire:

00FF    ÿ     Latin Small Letter Y with Diaeresis

 

This occurs rarely in personal names in German https://de.wikipedia.org/wiki/%C5%B8#Franz%C3%B6sisch

and in French in place names amongst others (https://fr.wikipedia.org/wiki/%C5%B8#Fran%C3%A7ais).



The following entries from Latin Extended-A are included in MSR-3 but not included in our repertoire:

014F     ŏ   Latin Small Letter O with Breve

0157     ŗ    Latin Small Letter R with Cedilla

 

FYI

ÿ is listed in the ICANN LGRs for German and for English (much to my amazement, as I have never encountered it previously), but does not appear in Omniglot, nor in the Wikipedia alphabet referenced in the LGR, for either language.

 

A quick search did not yield any evidence for English, but German - see above.

 

ŏ is listed in the ICANN LGRs for German and for Spanish, but does not appear in Omniglot, nor in the Wikipedia alphabet referenced in the LGR, for either language.

 

A quick search did not yield any supporting evidence.

 

ŗ is listed in the ICANN LGR for Latvian, but does not appear in Omniglot, nor in the Wikipedia alphabet referenced in the LGR.


This https://de.wikipedia.org/wiki/%C5%96 says it was used historically in Lativian. This https://en.wikipedia.org/wiki/Latvian_orthography clarifies that it is part of an older orthography still in use in diaspora communities.

 

 

 

LGR for language deu-Latn — German

 

This is way  larger than the set of characters used in German, even taking into consideration loans and borrowings from other languages. I would be interested to know who developed this on what basis. Some sources are 404.

 

 

 

 

LGR for language eng-Latn — English

 

 

LGR for language spa-Latn — Spanish

 

 

Bill Jouris
Inside Products
bill.jouris@insidethestack.com
831-659-8360
925-855-9512 (direct)


_______________________________________________
Latingp mailing list
Latingp@icann.org
https://mm.icann.org/mailman/listinfo/latingp

 

 

Best,

 

Meikal