On 19/09/2013 12:48, Luis Muñoz wrote:
On Sep 19, 2013, at 4:33 AM, Gavin Brown wrote:
While it's very easy to extract useful data from ISO-3166, the two PDF documents I found are a different matter.
Copying and pasting text (especially non-ASCII text) out of a PDF is potentially risky: when I tried using Adobe Reader on Mac OS X I had very little luck, getting either nothing at all or a series of junk characters.
You'll find that some of these documents simply won't allow you to cut & paste what you want, because parts of the text have been converted to bitmaps inside the document.
So the text will be have to be manually transcribed - by someone competent in each of the six official UN languages (or six people competent in one of the languages). Multiply that effort by N registry operators/backend operators, and that's an awful lot of unreliable, lossy, duplicated work. G. -- Gavin Brown Chief Technology Officer CentralNic Group plc (LSE:CNIC) Innovative, Reliable and Flexible Registry Services for ccTLD, gTLD and private domain name registries https://www.centralnic.com/ CentralNic Group plc is a company registered in England and Wales with company number 8576358. Registered Offices: 35-39 Moorgate, London, EC2R 6AR.