-
Automated Similarity Judgment Program lexical data
ASJP collects 40 words from 5500 languages in a simplified phonetic representation. More background can be found at http://email.eva.mpg.de/~wichmann/ASJPHomePage.htm -
Analisi del blog http://www.beppegrillo.it/
Analisi del blog http://www.beppegrillo.it/. I dati vanno da gennaio 2005 a febbraio 2013. Il pacchetto è diviso in quattro dataset: - Dati sui singoli post - Dati sulle... -
AcadOnto
An academic domain ontology populated using IIT Bombay organization corpus, web and the linked open data. Usage: Information Extraction, Information Retrieval Availability:... -
Manually Annotated Sub-Corpus (MASC) of the Open American National Corpus
The Manually Annotated Sub-Corpus (MASC) consists of approximately 500,000 words of contemporary American English written and spoken data drawn from the OPEN AMERICAN NATIONAL... -
Phonetics Information Base and Lexicon (PHOIBLE)
Phonetics Information Base and Lexicon (PHOIBLE) is a data set of phonological inventories with additional linguistic and non-linguistic information. -
Linked Old Germanic Dictionaries
Lexical resources (word lists, etymological dictionaries) for Germanic languages in different historical stages: pre 1100 (incl. Gothic, Old High German, Old English),... -
Glottolog
Glottolog provides information about descriptive literature for all the world's languages. It also provides a language classification as well as knowledge bases for names,... -
Atlante Sintattico d'Italia (ASIt)
The Atlante Sintattico d'Italia, Syntactic Atlas of Italy (ASIt) enterprise builds on a long standing tradition of collecting and analysing linguistic corpora, which has...