As part of my PhD I have produced several data sets, which are available for download from this page. I will continue to release more data as I produce it. All data is released under the Apache 2.0 License. Any questions, please e-mail me:seo01@docDOTicDOTac.uk.

Wikipedia groundtruth.
A manually annotated sample of 1000 Wikipedia pages.

Description

Citation

Geographic Co-occurrence model
An automatically generated geographic co-occurrence model extracted from Wikipedia.

Bibtex

More data has recently been released via my API.

For more information on how this data was generated please read my publications.

 
 
data_release.txt · Last modified: 2009/12/30 17:08 by simon
 
Recent changes RSS feed Driven by DokuWiki Page Impressions Valid XHTML 1.0 Valid CSS