[LINK] ICANN's non-Latin domain names

Kim Davies kim at cynosure.com.au
Tue Jan 5 01:14:32 AEDT 2010


Quoting Marghanita da Cruz on Monday January 04, 2010:
| 
| If I cut the URL
| http://www.mañana.com/
| from Firefox and paste it here, I get
| http://www.xn--maana-pta.com/
| (that is www dot xn dash das maana dash pta dot com )

That is likely correct behaviour... Applications that do not understand
the IDNA protocol should revert back to the ASCII format - that is one
of the reasons it is encoded this way as it gracefully degrades.

| While entering the URL
| http://www.mañana.com/ in firefox returns:
| Address Not Found
| Firefox can't find the server at
| www.ma%c3%b1ana.com.
| (that is www dot ma #c3%b 1ana dot com)

The byte string for the enya encoded in UTF-8 is 0xC3B1. Why it is
showing up like that could be that you are cutting and pasting the URL
from an application that is putting the UTF-8 encoded ASCII string in
the OS's clipboard, rather than the Unicode string (U+00F1).

kim



More information about the Link mailing list