[LINK] ICANN's non-Latin domain names

Marghanita da Cruz marghanita at ramin.com.au
Tue Jan 5 09:23:11 AEDT 2010


The character set selected in both 
thunderbird (email client) and firefox 
(browser) was Western (ISO-8859-1) but I 
tried changing firefox to ISO-8859-15 - 
but that doesn't seem to make any 
difference.

Firefox/Thunderbird does offer a large 
number of character sets.

Marghanita da Cruz wrote:
> Kim Davies wrote:
>> Quoting Marghanita da Cruz on Monday January 04, 2010:
>> | 
>> | If I cut the URL
>> | http://www.mañana.com/
>> | from Firefox and paste it here, I get
>> | http://www.xn--maana-pta.com/
>> | (that is www dot xn dash das maana dash pta dot com )
>>
>> That is likely correct behaviour... Applications that do not understand
>> the IDNA protocol should revert back to the ASCII format - that is one
>> of the reasons it is encoded this way as it gracefully degrades.
>>
> 
> Though what is curious is that I can cut 
> and paste
> http://www.mañana.com/ within 
> Thunderbird (my
> email client) and it remains intact. So, 
> I would guess
> it is the clipboard (in Linux) that is 
> doing the translation
> to www.xn...
> 
>> | While entering the URL
>> | http://www.mañana.com/ in firefox returns:
>> | Address Not Found
>> | Firefox can't find the server at
>> | www.ma%c3%b1ana.com.
>> | (that is www dot ma #c3%b 1ana dot com)
>>
>> The byte string for the enya encoded in UTF-8 is 0xC3B1. Why it is
>> showing up like that could be that you are cutting and pasting the URL
>> from an application that is putting the UTF-8 encoded ASCII string in
>> the OS's clipboard, rather than the Unicode string (U+00F1).
>>
> 
> The URL is intact - the ...%c... is in 
> the 404 error page.
> 
> when I click on the url 
> http://mañana.com/ (without www) it 
> trnslates to the "xn--maana-pta.com"
> 
> Marghanita


-- 
Marghanita da Cruz
http://ramin.com.au
Tel: 0414-869202





More information about the Link mailing list