Good points Gavin. Makes sense to me. Using your strategy then, we should only check in U-labels to the repo. But it might be nice to have a tool that could capture all of the files in the repo, and "compile" them into a single file of unique A-labels. John Colosi Senior Manager of Product Development JColosi@Verisign.com m: 703-967-4062 t: 703-948-3211 12061 Bluemont Way, Reston VA 20190 VerisignInc.com -----Original Message----- From: Gavin Brown [mailto:gavin.brown@centralnic.com] Sent: Monday, January 13, 2014 9:48 AM To: Colosi, John; gtld-tech@icann.org Cc: Gould, James; Anderson, Marc Subject: Re: [gtld-tech] Specification 5 - Country names... again.. On 10/01/2014 15:52, Colosi, John wrote:
Hi Gavin, it looks like most of the files in the repo are using the utf8 format. But S5.4.3.txt seems to be in utf16. (It starts with a bunch of surrogate pairs.) I wonder if we can standardize on a single format.
I'll see what I can do about converting that file to UTF-8. iconv complains for me when I try to convert that file from utf-16 to utf-8, so I wonder if there has been some mixing of encodings when the file was assembled.
I might even suggest using A Labels as copying and pasting and comparing is less error prone (for me). Maybe I just don't have the right tools. If we decide to standardize then I can help with conversions, but wanted to get some input from folks.
Using A-labels would be less error-prone, but also harder for people who speak the relevant languages. Under ideal circumstances, we'd have language experts reviewing the strings, and it would be a real pain for them to have to keep converting A-labels to U-labels and back again. The U-labels are the source code: the stuff that human beings work with. G. -- Gavin Brown Chief Technology Officer CentralNic Group plc (LSE:CNIC) Innovative, Reliable and Flexible Registry Services for ccTLD, gTLD and private domain name registries https://www.centralnic.com/ CentralNic Group plc is a company registered in England and Wales with company number 8576358. Registered Offices: 35-39 Moorgate, London, EC2R 6AR.