Dear NBGP members,

Kindly let me draw you attention to the issue of cross-script variant code points where there is only a single code point or there are only a few code points.

Background

Currently NBGP proposals include all cross-script variant code points which they can form well-formed cross-script variant labels without considering how many cross-script variant code points there are between two scripts.

Example1: Oriya ଠ (0B20) and Malayalam ഠ (0D20) are variant code points.

They are consonants and they can form such ഠഠഠ (0B20 0B20 0B20) and ଠଠଠ (0D20 0D20 0D20) cross-script variant labels

Oriya	Malayalam
ଠ (0B20)	ഠ (0D20)

Example2: Telugu ం (0C02) and Malayalam ം (0D02) are NOT variant code points. As they are combining marks and cannot form variant labels. The same applies or Telugu ః (0C03)and Malayalam ഃ (0D03)

Telugu	Malayalam
ం (0C02)	ം (0D02)
ః (0C03)	ഃ (0D03)

IP Feedback

With only a single consonant (or plus two combining marks) the overlap between scripts appears rather limited (case of Example 1 above) . The IP would recommend dropping the variants. This feedback applies for Telugu, Kannada, Sinhala, Oriya, Malayalam. However the GP decision will affect all NBGP proposals.

The IP suggest dropping following variant sets:

Telugu	Kannada	Sinhala
ం (0C02)	ಂ (0C82)	ං (0D82)
ః (0C03)	ಃ (0C83)	ඃ (0D83)
ర (0C30)	ರ (0CB0)	ර (0DBB)

Oriya	Malayalam
ଠ (0B20)	ഠ (0D20)

OPTIONS

OPTION 1: Do nothing.

OPTION 2: Drop the suggested variant sets.

Both options are valid. The final decision depends on NBGP. Whichever option selected, the proposals will be published for public comment period for 40 days. The community and experts will also have a chance to make a comment there. After the public comment period has ended. NBGP will consider all feedback and finalize proposals accordingly.

We’d like to request the NBGP to consider this issue prior to the NBGP-Sinhala call this evening and let’s aim to finalize the option during the call.

Regards,

Pitinan