-
Teahouse corpus
The Teahouse corpus is a set of questions asked at the Wikipedia Teahouse, a peer support forum for new Wikipedia editors. This corpus contains data from its first two years of... -
Wikipedia Article Feedback corpus
This dataset contains the entire corpus of feedback submitted on the English, French and German Wikipedia during the Article Feedback v.5 pilot (AFT). The Wikimedia Foundation... -
Wikipedia new user registrations
Historical data on new user account registrations to the English Wikipedia and other large Wikipedias. -
Wikipedia Templates
This dataset shows the top 60 Wikipedia templates that editors, both new and experienced, receive on their Talk pages. The dataset covers the period 2007 - 2011. -
Wikipedia Banner Challenge: Votes file
This file has one row for each vote. For a more detailed file layout, see http://blog.allourideas.org/post/2739358388/download-your-data -
Wikipedia Banner Challenge: Non-votes file
This file has one row for each non-vote (e.g., a voter clicking "I can't decide"). For full file layout details, see:... -
Wikipedia Banner Challenge: Banner file
This file has one row for each banner. For a full file layout, see http://blog.allourideas.org/post/2739358388/download-your-data.