[LINK] ICANN's non-Latin domain names
Kim Davies
kim at cynosure.com.au
Tue Jan 5 01:14:32 AEDT 2010
Quoting Marghanita da Cruz on Monday January 04, 2010:
|
| If I cut the URL
| http://www.mañana.com/
| from Firefox and paste it here, I get
| http://www.xn--maana-pta.com/
| (that is www dot xn dash das maana dash pta dot com )
That is likely correct behaviour... Applications that do not understand
the IDNA protocol should revert back to the ASCII format - that is one
of the reasons it is encoded this way as it gracefully degrades.
| While entering the URL
| http://www.mañana.com/ in firefox returns:
| Address Not Found
| Firefox can't find the server at
| www.ma%c3%b1ana.com.
| (that is www dot ma #c3%b 1ana dot com)
The byte string for the enya encoded in UTF-8 is 0xC3B1. Why it is
showing up like that could be that you are cutting and pasting the URL
from an application that is putting the UTF-8 encoded ASCII string in
the OS's clipboard, rather than the Unicode string (U+00F1).
kim
More information about the Link
mailing list