Thanks Mirjana, Thanks Mats, my point was whether you want theory-driven vs data-driven approach to script analysis. Our aim is to develop 'code point repertoire' and the task requires some theoretical criteria, that for sure. 

One way of allocating languages to individual members is to do it on the basis of 'linguistic typology' or by looking at 'language families' or on the basis of 'geographical areas' ie Eurasian, Australian, European etc. 



On Wed, May 24, 2017 at 11:52 AM, Mats Dufberg <mats.dufberg@iis.se> wrote:

Nebiye,

 

I do not fully agree. You will always have some theory in your mind, explicit or implicit. But I do agree that we should have an open mind to let the data be an input on the "theory" that we work from.

 

And I do agree that we should start working on languages to gain experience, and with the risk that we might have to redo the analysis of some languages.

 

 

Yours,

Mats

 

---

Mats Dufberg

DNS Specialist, IIS

Mobile: +46 73 065 3899

https://www.iis.se/en/

 

 

From: <latingp-bounces@icann.org> on behalf of Textual Solutions <textualsolutions@gmail.com>
Date: Wednesday 24 May 2017 at 07:51
To: Mirjana Tasić <Mirjana.Tasic@rnids.rs>, Latin GP <latingp@icann.org>


Subject: Re: [Latingp] Preliminary list of Languages using Latin script with EGIDS=1-4

 

Mirjana, there's nothing 'romantic' about language data, how you look at linguistic information depends on which scjhool of linguistics you come from! The training I had recommends that if you look at your data unbiased and without a theory in mind,, patterns will emerge.  My suggestion is we start by allocating a certain number of languages to each member.   

 

On Tue, May 23, 2017 at 11:48 PM, Mirjana Tasić <Mirjana.Tasic@rnids.rs> wrote:

Dear Nebiye,

 

You should explain to the group, what someone should look for. Your question is not clear to me, if I start looking at the languages, I have to have some idea, what to look for.

 

Regards

 

From: Textual Solutions <textualsolutions@gmail.com>
Date: Tuesday, May 23, 2017 at 22:28
To: Mirjana Tasić <Mirjana.Tasic@rnids.rs>
Cc: Latin GP <latingp@icann.org>
Subject: Re: [Latingp] Preliminary list of Languages using Latin script with EGIDS=1-4

 

Mirjana, thanks, it's now a matter of who's going to look at which languages? Could the members of the Repertoire group please state preferences?

 

23 May 2017 22:13 tarihinde "Mirjana Tasić" <Mirjana.Tasic@rnids.rs> yazdı:

Dear GP members

 

Here we have two links:

 

 

1.       Preliminary list of Languages using Latin script sorted by EGIDS (only 1-4) and then by ISO 639-3

 

 

https://docs.google.com/document/d/1YBb7wsy_hjBex6JXSIeJNOeK1KMIjU70pQRqLFR331s/edit?usp=sharing

 

I have granted access to this table for all GP members

 

 

 

2.       Procedure which has been used in measuring the time necessary for processing all letters for one language

 

 

https://docs.google.com/document/d/1NRYCtrgYqcK1USIE4AQ-oc5ypWGGq9Th6i9sHCAEF2Y/edit?usp=sharing

 

This is the information for people who are new to the group, so they can see how we processed languages in one of the previous steps.

 

 

Regards Mirjana

 

 


_______________________________________________
Latingp mailing list
Latingp@icann.org
https://mm.icann.org/mailman/listinfo/latingp