[LINK] Publishing BBC Metadata on the Web
Tom Worthington
tom.worthington at tomw.net.au
Wed May 25 10:00:53 AEST 2011
Greetings from the opening of the Meta 2011 Conference at ANU University
House in Canberra <http://www.metalounge.org/meta-2011-conference>.
Tom Scott, from the BBC is the first speaker, on Publishing BBC
Metadata. Tom mentioned the Semantic Web in his first few words. He
asked "What is the web?", showing Tim Berners-Lee's original paper
"Information Management: A Proposal" ( CERN, March 1989)
<http://www.w3.org/History/1989/proposal-msw.html>.
Tom demonstrated the BBC Nature website, which in addition to ordinary
web pages, provides structured data, using RSS and RDF and semantic
mark-up using microformats. This data is available for others to use and
is also used by the BBC to create new stories:
<http://www.bbc.co.uk/nature/feedsanddata>.
Tom also mentioned dbpedia, an attempt to structure Wikipedia data:
<http://dbpedia.org/About>.
At this point he argued that there is no metadata and what is commonly
though of is data is actually metadata. In a reference to Stephen
Hawking, Tom said "Turtles all the way down":
<http://en.wikipedia.org/wiki/Turtles_all_the_way_down>.
This is an metaphor for infinite recursion, however, I would argue it is
"metadata all the way down". James Gleick argues in his book "The
Information: A History, a Theory, a Flood", that the ability to reason
abstractly came after writing ("if all horses are white ..."):
<http://blog.tomw.net.au/2011/05/james-gleick-on-information.html>.
That seems unlikely, as I am sure horse breeders reasoned on the nature
of a good horse, before written language. Data and metadata are
intertwined by their nature, not due to a human invention.
Tom argued that we needed to move from the document web to the data web,
the web of things, which is what the semantic web is for. However, after
spending many years trying to understand the semantic web and teach it
to university students, I think this is a concept which needs to be
further refined and simplified to be widely used. Tim Berners-Lee's key
contribution with the World Wide Web was to take an existing complex
electronic document standard (SGML) and simplify it to make something
easy enough to use (HTML). Ever since, information professionals have
argued that HTML is flawed, some tinkered with SGML and produced XML,
others tinkered with HTML to make XHTML, but lost was the simpliccity of
HTML. In my view the semantic web similarly needs simplification, even
if the purists then say it is incomplete.
Tom then explained that the BBC use metadata for program guides. The
importance is not the metadata but the information it describes. This is
the key point which information professionals tend to find so obvious,
that they forget to explain. While they may say metadata is data about
data, but do not say why this is useful. That is a topic I will explore
in my talk to the conference tomorrow, with Senator Lundy on "Designing
for Democratic Dialogue: More then Mating iPads" ( 11.00 am on Thursday
26th May, 2011).
Next on the program today we have Greg Stone, Chief Technology Officer,
Microsoft Australia and Professor John McMillan, Australian Information
Commissioner, who is launching the new government information policy:
<http://blog.tomw.net.au/2011/04/designing-for-democratic-dialogue.html>.
More in my blog at:
<http://blog.tomw.net.au/2011/05/publishing-bbc-metadata-on-web.html>.
--
Tom Worthington FACS CP HLM, TomW Communications Pty Ltd. t: 0419496150
PO Box 13, Belconnen ACT 2617, Australia http://www.tomw.net.au
Adjunct Senior Lecturer, School of Computer Science, The
Australian National University http://cs.anu.edu.au/courses/COMP7310/
Visiting Scientist, CSIRO ICT Centre: http://bit.ly/csiro_ict_canberra
More information about the Link
mailing list