Web Tables

This page provides a large corpus of HTML tables for public download. The corpus has been extracted from the 2012 version of the Common Crawl and contains 147 million relational Web tables. Below we provide instructions on how to download the corpus and provide basic statistics about the tables' content.

Data and Resources

Additional Info

Field Value
Author Petar Ristroski
Last Updated August 3, 2014, 10:10 (UTC)
Created August 3, 2014, 10:09 (UTC)
comments powered by Disqus
comments powered by Disqus