[LINK] ABC (and other) RSS feeds
Ivan Trundle
ivan.trundle at alia.org.au
Thu Jun 24 09:34:16 EST 2004
To Howard and other Linkers
>From the author of the ANU website links that parse ABC (and other)
news content:
"My [screen-scraping] script just opens the normal web page and then
runs some rather esoteric regular expressions to extract the headlines.
I just viewed the source and examined it. It's not too hard. The ABC
site uses styles like: <h1 class="indexheadline">. Every now and then
they do an upgrade which breaks things."
He has no objections at all to people using his scripts, but warns that
they might break in future revisions of the ABC website. The hazards of
screen scrpaing...
Warmly
Ivan Trundle
--
Ivan Trundle
Manager, communications and publishing
Australian Library and Information Association
PO Box 6335 Kingston 2604 Australia
ph 02 6215 8232 fx 02 6282 2249
ivan.trundle at alia.org.au http://alia.org.au
More information about the Link
mailing list