Wikimedia user agents

A dataset of parsed reader and editor browser agents from the Wikimedia web properties. The intent behind releasing the parsed agents is to make it easier for Wikimedia developers to understand how to best test their software for the group they're targeting.

The actual data collection and anonymisation process varied between readers and editors. For readers, a 1:1000 sampled log of pageviews in February 2014 was taken. Any user agent that had more than 500 (in other words, 500,000) requests in a 24-hour period, from no fewer than 500/500,000 distinct IP addresses, was extracted, along with a count of how many times the agent appeared. For editors, a 90 day sample (December 2014 - February 2015) of user agents was taken globally; any user agent used by >= 50 distinct users was extracted, along with a count of the associated number of edits.

Data and Resources

Additional Info

Field Value
Author Oliver Keyes
Maintainer Oliver Keyes
Last Updated March 6, 2015, 21:42 (UTC)
Created March 6, 2015, 20:10 (UTC)
DOI 10.6084/m9.figshare.1326738
comments powered by Disqus
comments powered by Disqus