-
English Wikipedia pageviews by second
This file contains a count of pageviews to the English-language Wikipedia from 2015-03-16T00:00:00 to 2015-04-25T15:59:59, grouped by timestamp (down to a one-second resolution... -
Wikipedia Clickstream
This project contains data sets containing counts of (referer, resource) pairs extracted from the request logs of Wikipedia. A referer is an HTTP header field that identifies... -
Wikidata
The free knowledge base anyone can edit https://wikidata.org -
Wikimedia user agents
A dataset of parsed reader and editor browser agents from the Wikimedia web properties. The intent behind releasing the parsed agents is to make it easier for Wikimedia... -
Scholarly article citations in Wikipedia
About This dataset includes a list of citations to scholarly articles from the most recent version of Wikipedia. License All files included in this datasets are released under... -
Wikipedia new user registrations
Historical data on new user account registrations to the English Wikipedia and other large Wikipedias. -
Wikipedia user preferences
Data on user preferences set by active Wikipedia editors. Active editors are defined as registered users with at least 5 edits per month in a given project. The dumps were... -
Wikipedia Editor Engagement Experiments: Timestamp position modification
This experiment looks at the effects of linking to the revision history of Wikipedia articles with a prominent "last modified" timestamp. Currently, the only way for readers to... -
Wikipedia Banner Challenge: Votes file
This file has one row for each vote. For a more detailed file layout, see http://blog.allourideas.org/post/2739358388/download-your-data -
Wikipedia Banner Challenge: Non-votes file
This file has one row for each non-vote (e.g., a voter clicking "I can't decide"). For full file layout details, see:... -
Wikipedia Banner Challenge: Banner file
This file has one row for each banner. For a full file layout, see http://blog.allourideas.org/post/2739358388/download-your-data. -
Wikipedia article ratings
A complete anonymized dump of 11M article ratings collected over 1 year (July 2011 - July 2012) from the English Wikipedia. Read more... -
Wikimedia Research Newsletter corpus
A curated corpus of references on Wikipedia and Wikimedia research, reviewed in the monthly Wikimedia Research Newsletter. -
Wikimedia Fundraiser Public Data
Public data about the Wikimedia Fundraiser. Data is refreshed every 15 minutes and includes the complete historical series since 2006. -
Wikichallenge - Training
This is a non-random dataset containing the edit histories of about 47,000 editors. This can be used for machine learning purposes and the outcome variable is the number of... -
EPIC/Oxford Wikipedia quality assessment
This dataset comprises the full, anonymized set of responses from the blind assessment of a sample of Wikipedia articles across languages and disciplines by academic experts....