[LINK] deep web

Eric Scheid eric.scheid at ironclad.net.au
Tue Feb 24 09:28:10 AEDT 2009


On 24/2/09 8:59 AM, "Michael Still" <mikal at stillhq.com> wrote:

> Imagine you're a first year computer science student... Surely you can
> think of a way of avoiding infinite loops?

Infinite loops? Sure. That's hypertext defined.

Infinite branching & linking? Googlebot has followed a trillion links so
far, and doesn't look to be stopping any time soon.

Imagine a company contacts directory, organised by name, by initial, by
department name, by department code. How many links to departments would you
follow before giving up? What if you were a spambot crawling for email
addresses, blithely ignoring robots.txt, and every page has oodles of email
addresses. Each page is different and juicy. So what if the directory
web-server appears to be a little slow.

http://en.wikipedia.org/wiki/Teergrube

e.




More information about the Link mailing list