Dear Chris,
I will deeply study the ToDo list, and your comments, which may take some more time.
Before finishing my homework, I would like to make two points for consideration:
1. Up to now, what we are working on is limited to character (and its variants)- based label generation, we have not defined string/word based rule yet. Especial, we have not define the language and/or context sensitive string/word yet. It seems quite complicated, but is inevitable to work on it in the next stage. For examples,
(1) Exception treatment, such as ·¢-?l-?k and·¢/ ??
(2) C-J sensitive : ±õ-?I-亣¬ÒÕ-??-Ü¿£¬
(3) ͬÒå´Ê Synonym as label£¨word based variant string£©: ?µÎ»»?-Êý×Ö»¯ £¬¼¤¹â-ÀØÉ䣬±ãµ±-Û͵±
(4) Rules like: No simplified/traditional/variant mixing in labels. (important for CGP)
2. Some visible differences amongst Hanzi/Hanja/Kanji would be so-called Z-difference in Unicode/UniHan, say,
¡°I have also been looking for differences between Traditional Chinese characters and Korean hanja. So far I have found one: characters with the progression radical tend to start with two dots in hanja: ÌÓ and only one in Traditional Chinese: ÌÓ.¡±
Actually, both are encoded at U+9003, but rendered in different fonts.
If I may participate the coming Seoul meeting, we may discuss in detail.
Looking forward to seeing you there.
Regards,
Zhang
·¢¼þÈË: chinesegp-bounces@icann.org [mailto:chinesegp-bounces@icann.org] ´ú±í Dillon, Chris
·¢ËÍʱ¼ä: 2015Äê4ÔÂ27ÈÕ 20:43
ÊÕ¼þÈË: hotta@jprs.co.jp; KoreanGP@icann.org; ChineseGP@icann.org; JapaneseGP@icann.org
Ö÷Ìâ: Re: [ChineseGP] [Koreangp] Proposed Action items before Seoul meeting
Dear colleagues,
Here are some comments, as requested by Hiro.
I reckon I have now caught up after missing the Dallas meeting.
I believe Mr Yoneya¡¯s algorithm will work.
I have spent some amount of time looking for exceptions to various statements in it e.g. Slide 5 ¡°there exists at least one identical ideograph¡±. (No exception found.)
It is fortunate that ?C ¡¯machine¡¯ / »ú ¡¯desk¡¯ and ?k ¡¯send¡¯ / ?? ¡®hair¡¯ seem to be the only cases where (at least commonly used) different characters in Japanese are the same character in Simplified Chinese. (I haven¡¯t spent as much time with looking for characters that are separate in Chinese but brought together in Japanese. ÛÍ replaces at least three characters in Chinese, but I think none are common. I can imagine a . Û͵± TLD, so that may be good news for bento companies.)
I note the options for the disposition of variants not defined in the LGR-1s (Slide 6), i.e.:
- Blocked if the variant is not in the LGR-1 / Allocatable otherwise
- Blocked if the variant is not in the LGR-1 / Inherit its original disposition in the LGR-1 (Allocatable/Simp/Trad/Both)
Both case studies are most interesting. I note that there are some labels, e.g. ÓèÔ° (with the first character, I think used only in Japan and the second only in Simplified Chinese) that perhaps we would prefer not to see allocatable in the ideal world, but suspect that blocking them would involve adding horrendous complexity.
I note that it is difficult to understand Japanese LGR-1, as the characters are not visible.
I have also been looking for differences between Traditional Chinese characters and Korean hanja. So far I have found one: characters with the progression radical tend to start with two dots in hanja: ÌÓ and only one in Traditional Chinese: ÌÓ.
Looking forward to Seoul,
Regards,
Chris.
--
Research Associate in Linguistic Computing, Centre for Digital Humanities, UCL, Gower St, London WC1E 6BT Tel +44 20 7679 1599 (int 31599) www.ucl.ac.uk/dis/people/chrisdillon
-----Original Message-----
From: koreangp-bounces@icann.org [mailto:koreangp-bounces@icann.org] On Behalf Of HiroHOTTA
Sent: 25 April 2015 18:05
To: KoreanGP@icann.org; ChineseGP@icann.org; JapaneseGP@icann.org
Subject: [Koreangp] Proposed Action items before Seoul meeting
Dear colleagues in CGP/JGP/KGP,
If I may, in order for us to make our Seoul meeting efficient and fruitful, I'd like to propose what each of us is expected to prepare well before the meeting.
I know I am very pushy but I think at least we must not use our precious time just to understand the information in front of us for a long time.
Please give comments and let's discuss online about the ToDo's before Seoul meeting .
==
[[Premise]]
ToDo-1 <must> Each participant understands what RootLGR is and
what is expected for GPs to do.
ToDo-2 <must> Each participant understands Yoneya's algorithm
that was already sent to CGP/JGP/KGP by Yoneya and
also agreed by C and J in Dallas, which is attached
to this mail as well
ToDo-3 <must> Each participant understands MSS concept that was
already sent to CGP/JGP/KGP by Dr. Wang Wei, which
is attached to this mail along with HiroHOTTA's
response
ToDo-4 <expected> Participants agree on Yoneya's algorithm as a
framework and also agree on partial usage of MSS
to accelerate our discussion ("partial" means
"J doesn't need to be considered to be incorporated
into MSS") This is expected to be discussed and
finalized online before our meeting
[[Integration Algorithm]]
ToDo-5 <expected> J gets IP's feedback on Yoneya's algorithm
[[MSS/LGR-1]]
ToDo-6 <must> C prepares MSS repertoire, which may be
equivalent to Chinese LGR-1 repertoire (done?)
ToDo-7 <expected> C prepares Chinese variants within MSS, which may
be equivalent to Chinese LGR-1 (planned date is
expected to be declared, if not in time for the
meeting)
ToDo-8 <must> J prepares Japanese LGR-1 repertoire and variants
(there's no variants in Japanese LGR-1 : they
were already sent to CGP/JGP/KGP)
ToDo-9 <must> K prepares the basic idea of Korean LGR-1 repertoire
and variants
ToDo-10<expected> K prepares Korean LGR-1 repertoire and variants
(planned date is expected to be declared, ift
LGR-1 does not come in time for the meeting)
ToDo-11<expected> each of CGP/JGP/KGP assesses the repertoires and
variants that have already been provided by other
GPs as far as possible
[[Logistics/etc.]]
ToDo-12<must> each CGP/JGP/KGP Chair designates a person in charge
of ToDo-5 to ToDo-11 well in advance to the meeting
(expected to post the (names) in replying this mail
by May 1st) - this may accelerate the coordination
a lot
ToDo-13<must> convener fixes the agenda through consultation with
CJK colleagues (Hiro is pleased to behave as the
convener until someone will raise his/her hand)
Hiro
_______________________________________________
Koreangp mailing list