16 datasets found

Organizations: AKSW Formats: text/turtle

Filter Results
  • DBpedia abstract corpus

    This corpus contains a conversion of Wikipedia abstracts in six languages (dutch, english, french, german, italian and spanish) into the I used the NLP Interchange Format (NIF)....
  • Lidioms

    the LIDIOM dataset is a multilingual RDF representation of idioms containing five languages. The data set was crawled and integrated from various sources. For assuring the...
  • JRC-Names-MLODE

    From their web site: JRC-Names is a highly multilingual named entity resource for person and organisation names (called 'entities'). It consists of large lists of names and...
  • aksw.org Research Group dataset

    This dataset contains projects, sub groups, people and pages or the Agile Knowledge Management and Semantic Web (AKSW) Research Group @ University of Leipzig.
  • KORE 50 NIF NER Corpus

    KORE 50[1] (AIDA) is a subset of the larger AIDA corpus, which is based on the dataset of the CoNLL 2003 NER task. The dataset aims to capture hard to disambiguate mentions of...
  • ORCID

    ORCID (Open Researcher and Contributor ID) is a nonproprietary alphanumeric code to uniquely identify scientific and other academic authors. This dataset contains RDF conversion...
  • Statbel Corpus

    This corpus contains RDF conversion of datasets from the "Statistics Belgium" (also known as Statbel) which aims at collecting, processing and disseminating relevant, reliable...
  • Global airports in RDF

    This corpus contains RDF conversion of Global airports dataset which was retrieved from openflights.org. The dataset contains information about airport names, its location,...
  • Lion's Den

    Lion's Den is a RDF repository of link specifications. Lion's Den is intended to be an open community-driven dataset that allows data publishers to also publish their...
  • Brown Corpus in RDF/NIF

    RDF version of the Brown Corpus (W. N. Francis, H. Kucera; Brown University; 1979). 1,014,312 words in 500 documents, taken from newspapers texts on diverse topics, non-fiction...
  • SentimentWortschatz

    SentimentWortschatz, or SentiWS for short, is a publicly available German-language resource for sentiment analysis, opinion mining etc. It lists positive and negative polarity...
  • News-100 NIF NER Corpus

    This corpus comprises 100 German news articles from the online news platform news.de. All of the articles were published in the year of 2010 and contain the word Golf. This word...
  • RSS-500 NIF NER CORPUS

    This corpus has been created using a dataset comprising a list of 1,457 RSS feeds as compiled in (Goldhahn et al. 2012). The list includes all major worldwide newspapers and a...
  • DBpedia Spotlight NIF NER Corpus

    Based on P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. DBpedia Spotlight: shedding light on the web of documents. In Proc. of the 7th Int. Conf. on Semantic Systems,...
  • Reuters-128 NIF NER Corpus

    This English corpus is based on the well known Reuters-21578 corpus which contains economic news articles. In particular, we chose 128 articles containing at least one NE....
  • BundestagNebeneinkuenfte

    This dataset is part of the project opendata-bundestag.de. It contains side jobs of the members of the German Parliament including their sources, places and amounts of money...
You can also access this registry using the API (see API Docs).