6 datasets found

Organisations: AKSW Formats: text/turtle Tags: linguistic

Filter Results
  • KORE 50 NIF NER Corpus

    KORE 50[1] (AIDA) is a subset of the larger AIDA corpus, which is based on the dataset of the CoNLL 2003 NER task. The dataset aims to capture hard to disambiguate mentions of...
  • Brown Corpus in RDF/NIF

    RDF version of the Brown Corpus (W. N. Francis, H. Kucera; Brown University; 1979). 1,014,312 words in 500 documents, taken from newspapers texts on diverse topics, non-fiction...
  • News-100 NIF NER Corpus

    This corpus comprises 100 German news articles from the online news platform news.de. All of the articles were published in the year of 2010 and contain the word Golf. This word...
  • RSS-500 NIF NER CORPUS

    This corpus has been created using a dataset comprising a list of 1,457 RSS feeds as compiled in (Goldhahn et al. 2012). The list includes all major worldwide newspapers and a...
  • DBpedia Spotlight NIF NER Corpus

    Based on P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. DBpedia Spotlight: shedding light on the web of documents. In Proc. of the 7th Int. Conf. on Semantic Systems,...
  • Reuters-128 NIF NER Corpus

    This English corpus is based on the well known Reuters-21578 corpus which contains economic news articles. In particular, we chose 128 articles containing at least one NE....
You can also access this registry using the API (see API Docs).