DBpedia Spotlight NIF NER Corpus

Based on P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. DBpedia Spotlight: shedding light on the web of documents. In Proc. of the 7th Int. Conf. on Semantic Systems, 2011.

It contains 60 natural language sentences from ten different New York Times articles with overall 249 annotated DBpedia entities, i. e. the entities are not explicitely bound to mentions within the texts, which causes a certain lack of clarity. Therefore, we (in all conscience) retroactively have allocated the entities to their positions within the texts. The entities dbp:Markup_Language and dbp:PBC_CSKA_Moscow could not be linked in the texts, since there was also a more specific entity enlisted occupying their solely possible location, e. g. hypertext markup language has been annotated with dbp:HTML rather than dbp:Markup_language.

Data and Resources

Additional Info

Field Value
Author Magnus Knuth
Maintainer Magnus Knuth
Last Updated October 29, 2014, 16:26 (UTC)
Created September 5, 2014, 08:01 (UTC)
homepage http://www.yovisto.com/labs/ner-benchmarks/
links:dbpedia 325
triples 3425
comments powered by Disqus
comments powered by Disqus