-
Lexvo
About Data exposed: Linguistic Data Size of dump and data set: ~40MB Openness Download dump: CC-BY-SA 3.0 license The web service additionally provides some parts that are not... -
Freebase
Description "Freebase is an open database of the world?s information. It is built by the community and for the community?free for anyone to query, contribute to, built... -
OpenCyc
About Now it is even easier to use the rich and diverse collection of real-world concepts in OpenCyc to bring meaning to your semantic web applications! The full OpenCyc... -
Neurocommons text mining pilot
About The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by... -
MeSH pairs
Data exposed: NLM 2007 MeSH descriptor/qualifier pairs Size of dump and data set: 13 MB Openness: OPEN See http://www.nlm.nih.gov/mesh/termscon.html (basically attribution with... -
Entrez Gene
About Data exposed: Select fields from Entrez Gene records Size of dump and data set: 7.7 MB Notes: NCBI Copyright and Disclaimers Openness Data appears to be in public domain.... -
Billion Triples Challenge Dataset 2008
Data exposed: various dumps Size of dump and data set: 1 billion triples