-
dbnary
Extracts of wiktionary data for several languages, structured as an RDF graph, based mainly on the LEMON model. Bulgarian, Dutch, English, Finnish, French, German, Greek,... -
lemonUby
Export of UBY to lemon format -
FAO geopolitical ontology
The FAO geopolitical ontology provides a master reference for geopolitical information, as it manages names in multiple languages (English, French, Spanish, Arabic, Chinese,... -
KAIST silver standard corpus
KAIST silver standard corpus Availability: Freely Avalable Usage: Named Entity Recognition Status:Newly created-finished Description: We propose a novel method to... -
The JMdict (Japanese-Multilingual Dictionary) project
About Overview: The JMdict (Japanese-Multilingual Dictionary) project has at its aim the compilation of a multilingual lexical database with Japanese as the pivot language. The... -
xLiD-Lexica
Our xLiD-Lexica dataset in RDF (http://km.aifb.kit.edu/resources/xLiD-lexica.nt) contains about 300 million triples of cross-lingual groundings. It is extracted from Wikipedia... -
AcadOnto
An academic domain ontology populated using IIT Bombay organization corpus, web and the linked open data. Usage: Information Extraction, Information Retrieval Availability:... -
Manually Annotated Sub-Corpus (MASC) of the Open American National Corpus
The Manually Annotated Sub-Corpus (MASC) consists of approximately 500,000 words of contemporary American English written and spoken data drawn from the OPEN AMERICAN NATIONAL...