-
Wikipedia Clickstream
This project contains data sets containing counts of (referer, resource) pairs extracted from the request logs of Wikipedia. A referer is an HTTP header field that identifies... -
Teahouse corpus
The Teahouse corpus is a set of questions asked at the Wikipedia Teahouse, a peer support forum for new Wikipedia editors. This corpus contains data from its first two years of... -
Scholarly article citations in Wikipedia
About This dataset includes a list of citations to scholarly articles from the most recent version of Wikipedia. License All files included in this datasets are released under... -
Wikipedia Article Feedback corpus
This dataset contains the entire corpus of feedback submitted on the English, French and German Wikipedia during the Article Feedback v.5 pilot (AFT). The Wikimedia Foundation... -
Wikipedia pageview stats
This is real, accurate hourly snapshot data on the access to Wikipedia captured from the Wikimedia Squid servers. Project counts show the total access in a time period to the... -
Wikipedia user preferences
Data on user preferences set by active Wikipedia editors. Active editors are defined as registered users with at least 5 edits per month in a given project. The dumps were... -
Wikipedia Banner Challenge: Votes file
This file has one row for each vote. For a more detailed file layout, see http://blog.allourideas.org/post/2739358388/download-your-data -
Wikipedia Banner Challenge: Non-votes file
This file has one row for each non-vote (e.g., a voter clicking "I can't decide"). For full file layout details, see:... -
Wikipedia Banner Challenge: Banner file
This file has one row for each banner. For a full file layout, see http://blog.allourideas.org/post/2739358388/download-your-data. -
Wikimedia Research Newsletter corpus
A curated corpus of references on Wikipedia and Wikimedia research, reviewed in the monthly Wikimedia Research Newsletter. -
Wikimedia Fundraiser Public Data
Public data about the Wikimedia Fundraiser. Data is refreshed every 15 minutes and includes the complete historical series since 2006.