Meikal,

 

I have the same question on the LGR for Swedish. It contains characters never used for writing Swedish words. They can be found in non-Swedish names in a text in Swedish, but if that is the criterion (that the letter appears in a Swedish text) then the list of characters should be much longer. Any place or person name written with Latin characters could be found in a Swedish text.

 

The Swedish second-level LGR cannot be used to support the inclusion of characters, and the it seems like it is the same for e.g. the German table.

 

 

Mats

 

---

Mats Dufberg

DNS Specialist, IIS

Mobile: +46 73 065 3899

https://www.iis.se/en/

 

 

From: Latingp <latingp-bounces@icann.org> on behalf of Meikal Mumin <meikal.mumin@uni-koeln.de>
Date: Monday, 23 July 2018 at 00:11
To: Bill Jouris <bill.jouris@insidethestack.com>
Cc: ICANN Latin GP <latingp@icann.org>
Subject: Re: [Latingp] Repertoire and Latin Extended A

 

Dear colleagues,

 

On 22 July 2018 at 21:49, Bill Jouris <bill.jouris@insidethestack.com> wrote:

Hi Mirjana,



I've reviewed the repertoire we have (after adding Esperanto) and compared it to the Unicode table's Basic Latin, Latin-1 Supplement, and Latin Extended-A codepoints. 



The following entries from Latin-1 Supplement are included in MSR-2, but not included in our repertoire:

00FF    ÿ     Latin Small Letter Y with Diaeresis

 

This occurs rarely in personal names in German https://de.wikipedia.org/wiki/%C5%B8#Franz%C3%B6sisch

and in French in place names amongst others (https://fr.wikipedia.org/wiki/%C5%B8#Fran%C3%A7ais).



The following entries from Latin Extended-A are included in MSR-3 but not included in our repertoire:

014F     ŏ   Latin Small Letter O with Breve

0157     ŗ    Latin Small Letter R with Cedilla

 

FYI

ÿ is listed in the ICANN LGRs for German and for English (much to my amazement, as I have never encountered it previously), but does not appear in Omniglot, nor in the Wikipedia alphabet referenced in the LGR, for either language.

 

A quick search did not yield any evidence for English, but German - see above.

 

ŏ is listed in the ICANN LGRs for German and for Spanish, but does not appear in Omniglot, nor in the Wikipedia alphabet referenced in the LGR, for either language.

 

A quick search did not yield any supporting evidence.

 

ŗ is listed in the ICANN LGR for Latvian, but does not appear in Omniglot, nor in the Wikipedia alphabet referenced in the LGR.


This https://de.wikipedia.org/wiki/%C5%96 says it was used historically in Lativian. This https://en.wikipedia.org/wiki/Latvian_orthography clarifies that it is part of an older orthography still in use in diaspora communities.

 

 

 

LGR for language deu-Latn — German

 

This is way  larger than the set of characters used in German, even taking into consideration loans and borrowings from other languages. I would be interested to know who developed this on what basis. Some sources are 404.

 

 

 

 

LGR for language eng-Latn — English

 

 

LGR for language spa-Latn — Spanish

 

 

Bill Jouris
Inside Products
bill.jouris@insidethestack.com
831-659-8360
925-855-9512 (direct)


_______________________________________________
Latingp mailing list
Latingp@icann.org
https://mm.icann.org/mailman/listinfo/latingp

 

 

Best,

 

Meikal