45 datasets found

Tags: linguistic

Filter Results
  • SSF

    Syntactic and semantic framework of Croatian language
  • Bibliography of Linguistic Literature (BLL) Thesaurus

    The Thesaurus of the Bibliography of Linguistic Literature (BLL Thesaurus) represents a comprehensive bilingual vocabulary for indexing and documentation of linguistically...
  • associations

    A collection of associations and mapping to DBpedia entities. Currently consisting of 780000 human associations from the Edinburgh Associative Thesaurus (as RDF) and a verified...
  • GeoWordNet

    GeoWordNet is a semantic resource built from the full integration of WordNet, GeoNames and the Italian part of MultiWordNet. GeoWordNet Public Dataset contains 3,698,238...
  • DBpedia in Spanish

    These data correspond to the ontology DBpedia version 2014.
  • PDEV-Lemon

    PDEV is a dictionary which provides insight into how verbs collocate with nouns and other words using an empirically well-founded apparatus of syntactic and semantic categories....
  • lexinfo

    Ontology of lexical categories
  • USAGE review corpus

    This corpus consists of sentiment annotations of Amazon reviews for different product categories in the languages German and English. The reviews themselves are not part of this...
  • KORE 50 NIF NER Corpus

    KORE 50[1] (AIDA) is a subset of the larger AIDA corpus, which is based on the dataset of the CoNLL 2003 NER task. The dataset aims to capture hard to disambiguate mentions of...
  • Linguistic Metadata (LIME) vocabulary

    LIME (LInguistic MEtadata) is a vocabulary for expressing linguistic metadata about linguistic resources and linguistically grounded datasets. The metadata vocabulary has been...
  • MASC-BN-NIF

    This dataset contains the MASC 3.0 corpus, a large English corpus covering a wide range of genres of written and spoken text, enhanced with semantic annotations, both word...
  • EMN

    The Terminology of the European Migration Network in RDF
  • IWN

    This is the dataset corresponding to the ItalWordNet as created at the Institute of Computational Linguistic "A. Zampolli" in Pisa. The resource contains single instances such...
  • EuroSentiment

    Gabriela Vulcu, Raul Lario Monje, Mario Munoz, Paul Buitelaar and Carlos A. Iglesias (2014), Linked-Data based Domain-Specific Sentiment Lexicons, In: Proceedings of the 3rd...
  • SIMPLE

    This dataset contains the conversion of the Italian SIMPLE lexicon in different formats including RDF, TTL and a Lemon version of lexical entries with their pointers to senses.
  • The IBL Corpus

    About The IBL Corpus was collected by the University of Plymouth and the University of Edinburgh as part of the EPSRC funded project IBL, Instruction-based Learning for Mobile...
  • eXtended WordNet

    About From website: WordNet is a lexical database for English that has been widely adopted in artificial intelligence and computational linguistics for a variety of practical...
  • Brown Corpus in RDF/NIF

    RDF version of the Brown Corpus (W. N. Francis, H. Kucera; Brown University; 1979). 1,014,312 words in 500 documents, taken from newspapers texts on diverse topics, non-fiction...
  • Wikilinks RDF/NIF

    The Wikilinks corpus is a coreference resolution corpus of very large scale. It contains over 40 million mentions of over 3 million entities. Mentions are manually labeled links...
  • News-100 NIF NER Corpus

    This corpus comprises 100 German news articles from the online news platform news.de. All of the articles were published in the year of 2010 and contain the word Golf. This word...
  • RSS-500 NIF NER CORPUS

    This corpus has been created using a dataset comprising a list of 1,457 RSS feeds as compiled in (Goldhahn et al. 2012). The list includes all major worldwide newspapers and a...
  • DBpedia Spotlight NIF NER Corpus

    Based on P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. DBpedia Spotlight: shedding light on the web of documents. In Proc. of the 7th Int. Conf. on Semantic Systems,...
  • Reuters-128 NIF NER Corpus

    This English corpus is based on the well known Reuters-21578 corpus which contains economic news articles. In particular, we chose 128 articles containing at least one NE....
  • SweFN-RDF

    Swedish FrameNet (SweFN), a lexical-semantic in RDF.
  • SALDOM-RDF

    SALDO morphology, a morphological Swedish lexicon in RDF.
You can also access this registry using the API (see API Docs).