This is a quick spot to post an OWL onotology to model entities mentioned in the Australian Hansard.
grokHansard.xsl - a GRDDL friendly stylesheet to extract out entities from hansard xml
Example input
Example output
It would be really neat to see the hansard.xsd published via the official website; or a hansard namespace (http://parlinfo.aph.gov.au/parlInfo/ns#).
That way, you could really easily add a GRDDL transformation in the XSD. Even if you didn't do that, you could publish
<hansard
xmlns:grddl='http://www.w3.org/2003/g/data-view#'
grddl:transformation="http://www.lauken.com/doconnor/politics/grokHansard.xsl">
<!-- content here -->
</hansard>
... and semantic web consumers can do their thing.
Also, mashing up news rss feeds with OpenCalaisis + linked data with this data would be really incredibly awesome.
See also: OpenAustralia.org, and the awesome Parlinfo search.