Google Books Ngram

Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set).

Each of the links below will directly download a fragment of the given corpus. For instance, the first hundred links below collectively comprise the 1-gram (i.e., individual words) counts for English, as collected from Google's scanned books around July 15, 2009.

Data and Resources

Additional Info

Field Value
Author Google Books
Version 20090715
Last Updated October 10, 2013, 21:06 (UTC)
Created December 17, 2010, 14:31 (UTC)
comments powered by Disqus
comments powered by Disqus