Zbiory danych
-
DBpedia abstract corpus
This corpus contains a conversion of Wikipedia abstracts in six languages (dutch, english, french, german, italian and spanish) into the I used the NLP Interchange Format (NIF).... -
KORE 50 NIF NER Corpus
KORE 50[1] (AIDA) is a subset of the larger AIDA corpus, which is based on the dataset of the CoNLL 2003 NER task. The dataset aims to capture hard to disambiguate mentions of... -
NeuroLex Wiki
NeuroLex.org is a freely editable semantic wiki for community-based curation of the terms used in Neuroscience. It is a joint project of the Neuroscience Information Framework... -
schema.org
Schema.org markup schema to provide structured metadata for websites and beyond. Linked data variant provided by http://schema.rdfs.org/ -
Brown Corpus in RDF/NIF
RDF version of the Brown Corpus (W. N. Francis, H. Kucera; Brown University; 1979). 1,014,312 words in 500 documents, taken from newspapers texts on diverse topics, non-fiction... -
MLSA - A Multi-layered Reference Corpus for German Sentiment Analysis
Sentence-layer annotation represents the most coarse-grained annotation in this corpus. We adhere to definitions of objectivity and subjectivity introduced in (Wiebe et al.,... -
SentimentWortschatz
SentimentWortschatz, or SentiWS for short, is a publicly available German-language resource for sentiment analysis, opinion mining etc. It lists positive and negative polarity... -
Wikilinks RDF/NIF
The Wikilinks corpus is a coreference resolution corpus of very large scale. It contains over 40 million mentions of over 3 million entities. Mentions are manually labeled links... -
News-100 NIF NER Corpus
This corpus comprises 100 German news articles from the online news platform news.de. All of the articles were published in the year of 2010 and contain the word Golf. This word... -
RSS-500 NIF NER CORPUS
This corpus has been created using a dataset comprising a list of 1,457 RSS feeds as compiled in (Goldhahn et al. 2012). The list includes all major worldwide newspapers and a... -
DBpedia Spotlight NIF NER Corpus
Based on P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. DBpedia Spotlight: shedding light on the web of documents. In Proc. of the 7th Int. Conf. on Semantic Systems,... -
Reuters-128 NIF NER Corpus
This English corpus is based on the well known Reuters-21578 corpus which contains economic news articles. In particular, we chose 128 articles containing at least one NE.... -
BundestagNebeneinkuenfte
This dataset is part of the project opendata-bundestag.de. It contains side jobs of the members of the German Parliament including their sources, places and amounts of money...