[LINK] ICANN's non-Latin domain names
Marghanita da Cruz
marghanita at ramin.com.au
Tue Jan 5 09:16:23 AEDT 2010
Kim Davies wrote:
> Quoting Marghanita da Cruz on Monday January 04, 2010:
> |
> | If I cut the URL
> | http://www.mañana.com/
> | from Firefox and paste it here, I get
> | http://www.xn--maana-pta.com/
> | (that is www dot xn dash das maana dash pta dot com )
>
> That is likely correct behaviour... Applications that do not understand
> the IDNA protocol should revert back to the ASCII format - that is one
> of the reasons it is encoded this way as it gracefully degrades.
>
Though what is curious is that I can cut
and paste
http://www.mañana.com/ within
Thunderbird (my
email client) and it remains intact. So,
I would guess
it is the clipboard (in Linux) that is
doing the translation
to www.xn...
> | While entering the URL
> | http://www.mañana.com/ in firefox returns:
> | Address Not Found
> | Firefox can't find the server at
> | www.ma%c3%b1ana.com.
> | (that is www dot ma #c3%b 1ana dot com)
>
> The byte string for the enya encoded in UTF-8 is 0xC3B1. Why it is
> showing up like that could be that you are cutting and pasting the URL
> from an application that is putting the UTF-8 encoded ASCII string in
> the OS's clipboard, rather than the Unicode string (U+00F1).
>
The URL is intact - the ...%c... is in
the 404 error page.
when I click on the url
http://mañana.com/ (without www) it
trnslates to the "xn--maana-pta.com"
Marghanita
--
Marghanita da Cruz
http://ramin.com.au
Tel: 0414-869202
More information about the Link
mailing list