-
Neurocommons text mining pilot
About The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by... -
MeSH pairs
Data exposed: NLM 2007 MeSH descriptor/qualifier pairs Size of dump and data set: 13 MB Openness: OPEN See http://www.nlm.nih.gov/mesh/termscon.html (basically attribution with... -
Linked ISO 3166-2 Data
About Linked ISO 3166-2 Data. ISO-3166-2 gives codes for countries and their principal subdivisions. Openness Published under CC0. (Where is this specified?) -
Billion Triples Challenge Dataset 2008
Data exposed: various dumps Size of dump and data set: 1 billion triples