-
Lexvo
About Data exposed: Linguistic Data Size of dump and data set: ~40MB Openness Download dump: CC-BY-SA 3.0 license The web service additionally provides some parts that are not... -
OpenCyc
About Now it is even easier to use the rich and diverse collection of real-world concepts in OpenCyc to bring meaning to your semantic web applications! The full OpenCyc... -
Neurocommons text mining pilot
About The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by...