[LINK] Diacritics and Search Engines

David Goldstein wavey_one at yahoo.com
Wed Jan 16 18:23:30 AEDT 2008


Roger,

My guess is that it has something to do with the search engine used and the preferences. I use search engines a lot for my work, and one at least has no problem in picking up characters not in the English language. Go to the preferences and change the language setting and see what happens.

I use alltheweb.com a lot and it picks up non-English characters perfectly. Alltheweb.com finds your paper at www.homelandsecurity.org.au/files/RNSA_Social_Implications07_TEXT.pdf but not at the link you gave below.

Google I find useless at trying to search for more than one language at a time, although it could just be I can't work out the box to check in the preferences!

Cheers
David

----- Original Message ----
From: Roger Clarke <Roger.Clarke at xamax.com.au>
To: link at anu.edu.au
Sent: Wednesday, 16 January, 2008 12:36:16 PM
Subject: [LINK] Diacritics and Search Engines

It's embarrassing to have to admit it (because I've done some work in 
this area), but I've just twigged to the obvious - diacritics such as 
umlauts, acutes and cedillas are not handled well by search-engines.

In the few languages that I'm familiar with, a letter with a 
diacritic is appropriately treated as a variant of the letter, e.g. 
u-umlaut is still a u (although in some languages the unadorned 
letter may not exist, or the two may be treated as different letters).

I tripped over the problem because people have reported that they're 
unable to find my paper from last September:

     What 'Überveillance' Is, and What To Do About It
     [Heaven knows what your email-client did with the u-umlaut ...]
     http://www.anu.edu.au/people/Roger.Clarke/DV/RNSA07.html

If linkers can point to sources that explain this to dubbos like me, 
and what to do about it, I'd greatly appreciate the assistance.

-- 
Roger Clarke                
  http://www.anu.edu.au/people/Roger.Clarke/
                        
Xamax Consultancy Pty Ltd      78 Sidaway St, Chapman ACT 2611
 AUSTRALIA
                    Tel: +61 2 6288 1472, and 6288 6916
mailto:Roger.Clarke at xamax.com.au              
  http://www.xamax.com.au/

Visiting Professor in Info Science & Eng  Australian National
 University
Visiting Professor in the eCommerce Program      University of Hong
 Kong
Visiting Professor in the Cyberspace Law & Policy Centre      Uni of
 NSW
_______________________________________________
Link mailing list
Link at mailman.anu.edu.au
http://mailman.anu.edu.au/mailman/listinfo/link





      Make the switch to the world's best email. Get the new Yahoo!7 Mail now. www.yahoo7.com.au/worldsbestemail






More information about the Link mailing list