[LINK] ICANN's non-Latin domain names

Marghanita da Cruz marghanita at ramin.com.au
Tue Jan 5 09:16:23 AEDT 2010


Kim Davies wrote:
> Quoting Marghanita da Cruz on Monday January 04, 2010:
> | 
> | If I cut the URL
> | http://www.mañana.com/
> | from Firefox and paste it here, I get
> | http://www.xn--maana-pta.com/
> | (that is www dot xn dash das maana dash pta dot com )
> 
> That is likely correct behaviour... Applications that do not understand
> the IDNA protocol should revert back to the ASCII format - that is one
> of the reasons it is encoded this way as it gracefully degrades.
> 

Though what is curious is that I can cut 
and paste
http://www.mañana.com/ within 
Thunderbird (my
email client) and it remains intact. So, 
I would guess
it is the clipboard (in Linux) that is 
doing the translation
to www.xn...

> | While entering the URL
> | http://www.mañana.com/ in firefox returns:
> | Address Not Found
> | Firefox can't find the server at
> | www.ma%c3%b1ana.com.
> | (that is www dot ma #c3%b 1ana dot com)
> 
> The byte string for the enya encoded in UTF-8 is 0xC3B1. Why it is
> showing up like that could be that you are cutting and pasting the URL
> from an application that is putting the UTF-8 encoded ASCII string in
> the OS's clipboard, rather than the Unicode string (U+00F1).
> 

The URL is intact - the ...%c... is in 
the 404 error page.

when I click on the url 
http://mañana.com/ (without www) it 
trnslates to the "xn--maana-pta.com"

Marghanita
-- 
Marghanita da Cruz
http://ramin.com.au
Tel: 0414-869202





More information about the Link mailing list