[LINK] ICANN's non-Latin domain names
Marghanita da Cruz
marghanita at ramin.com.au
Tue Jan 5 09:23:11 AEDT 2010
The character set selected in both
thunderbird (email client) and firefox
(browser) was Western (ISO-8859-1) but I
tried changing firefox to ISO-8859-15 -
but that doesn't seem to make any
difference.
Firefox/Thunderbird does offer a large
number of character sets.
Marghanita da Cruz wrote:
> Kim Davies wrote:
>> Quoting Marghanita da Cruz on Monday January 04, 2010:
>> |
>> | If I cut the URL
>> | http://www.mañana.com/
>> | from Firefox and paste it here, I get
>> | http://www.xn--maana-pta.com/
>> | (that is www dot xn dash das maana dash pta dot com )
>>
>> That is likely correct behaviour... Applications that do not understand
>> the IDNA protocol should revert back to the ASCII format - that is one
>> of the reasons it is encoded this way as it gracefully degrades.
>>
>
> Though what is curious is that I can cut
> and paste
> http://www.mañana.com/ within
> Thunderbird (my
> email client) and it remains intact. So,
> I would guess
> it is the clipboard (in Linux) that is
> doing the translation
> to www.xn...
>
>> | While entering the URL
>> | http://www.mañana.com/ in firefox returns:
>> | Address Not Found
>> | Firefox can't find the server at
>> | www.ma%c3%b1ana.com.
>> | (that is www dot ma #c3%b 1ana dot com)
>>
>> The byte string for the enya encoded in UTF-8 is 0xC3B1. Why it is
>> showing up like that could be that you are cutting and pasting the URL
>> from an application that is putting the UTF-8 encoded ASCII string in
>> the OS's clipboard, rather than the Unicode string (U+00F1).
>>
>
> The URL is intact - the ...%c... is in
> the 404 error page.
>
> when I click on the url
> http://mañana.com/ (without www) it
> trnslates to the "xn--maana-pta.com"
>
> Marghanita
--
Marghanita da Cruz
http://ramin.com.au
Tel: 0414-869202
More information about the Link
mailing list