[LINK] RFI: How Does the Do-Not-Cache Instruction Work?
Avi Miller
avi.miller at gmail.com
Sun Aug 10 21:18:52 AEST 2008
On 10/08/2008, at 8:05 PM, Roger Clarke wrote:
> With some web-pages, I can't do a save of the page, nor of images
> within the page. If I want a copy, I have to do a screen-scrape.
There should be no pages that prevent you from saving the HTML source
content and images. Some do try to make your life more difficult by
using JavaScript to disable the right mouse button, but this is easily
bypassed by either disabling JavaScript or using the Menu -> View
Source option. Alternatively, sites may use Flash to store and display
content, which does prevent offline storage.
There are no-cache directives that are used by web applications to
tell proxy servers and browsers to always re-request content. I found
this page that has a fairly non-technical explanation of the various
caching directives that are available in HTTP/1.1:
http://palisade.plynt.com/issues/2008Jul/cache-control-attributes/
Are you able to provide an example URL for a page who's content you
are unable to save?
cYa,
Avi
More information about the Link
mailing list