-
OpenCyc
About Now it is even easier to use the rich and diverse collection of real-world concepts in OpenCyc to bring meaning to your semantic web applications! The full OpenCyc... -
Neurocommons text mining pilot
About The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by... -
MeSH pairs
Data exposed: NLM 2007 MeSH descriptor/qualifier pairs Size of dump and data set: 13 MB Openness: OPEN See http://www.nlm.nih.gov/mesh/termscon.html (basically attribution with... -
Linked ISO 3166-2 Data
About Linked ISO 3166-2 Data. ISO-3166-2 gives codes for countries and their principal subdivisions. Openness Published under CC0. (Where is this specified?) -
Entrez Gene
About Data exposed: Select fields from Entrez Gene records Size of dump and data set: 7.7 MB Notes: NCBI Copyright and Disclaimers Openness Data appears to be in public domain.... -
Billion Triples Challenge Dataset 2008
Data exposed: various dumps Size of dump and data set: 1 billion triples