[LINK] RFI: Bulk-Caching of a Web-Site down onto a PC

Craig Sanders cas at taz.net.au
Tue Oct 10 22:05:27 AEST 2006


On Tue, Oct 10, 2006 at 08:03:27PM +1000, Roger Clarke wrote:
> 5.  Am I missing something?

GNU wget[1] or similar website mirroring tool.  there are dozens of them.

wget is particularly good because you can make it change the links in the
downloaded pages to conform to the new location.

[1] http://www.gnu.org/software/wget/wget.html


note, however, that dynamically generated content (aside from simple
stuff like auto-adding headers and footers to each page) tends not
to mirror very well. as a general (but by no means certain) rule, if
there's a "?" in the URL (to delimit user-supplied data from the actual
url) then it probably wont cache well.


craig

-- 
craig sanders <cas at taz.net.au>           (part time cyborg)



More information about the Link mailing list