Global Database of Events, Language, and Tone

Data and Resources

Grand Data Challenge of 2014 International Conference on Social Computing, Behavioral-Cultural Modeling, & Prediction (SBP14)
Data challenge using GDELT
HTML
The Global Database of Events, Language, and Tone (GDELT) - See more at: http://gdelt.utdallas.edu/#sthash.OchCVJxv.dpuf
Homepage. (Used to be http://gdelt.utdallas.edu/)
HTML
International Studies Association paper, April 2013
This paper describes the news sources and some of their characteristics, the various pro- cessing steps that are used in generating the data, some comparisons with the KEDS Levants/Reuters and ICEWS/Asia data sets, and some visualizations. We conclude with an outline of planned enhancements to the data in the near future: these include recoding with new WordNet-enhanced dictionaries, the extension of the CAMEO cod- ing to incorporate codes for financial events, disease outbreaks and natural disasters, and the development of an open-source Python-based successor to Tabari which will use parsed input from existing natural language processing tools.
PDF
Historical 1979 - March 2013 Backfile (Events)
This is the full resolution GDELT event dataset running January 1, 1979 through March 31, 2013 and containing all data fields for each event record. The years 1979 through 2005, inclusive, are available as yearly downloads containing all records for each year, while starting in January 2006 data is available as monthly downloads due to the larger number of records per month over time. - See more at: http://gdelt.umn.edu/data.html#backfiles
HTML
Data Dictionary Readme
This codebook provides a quick overview of the fields in the GDELT data file format and their descriptions. GDELT event records are stored in an expanded version of the dyadic CAMEO format, capturing two actors and the action performed by Actor1 upon Actor2.
PDF
Daily Updates (Events)
This is the full resolution event GDELT dataset containing all data fields for each event record and updated daily beginning April 1, 2013. It contains an additional field at the end of each record that is not in the backfiles, which is the Source URL, which gives the Source URL of the article the event was found in. Each morning, seven days a week, the latest daily update is posted by 4AM CST.
HTML
Code Lookups
The data format of the Historical and Daily Updates versions of GDELT event record the raw CAMEO 3-digit actor codes and numeric event codes. These lookups give the textual labels for each of those fields making it easier to work with the data for those who have not previously worked with CAMEO. - See more at: http://gdelt.umn.edu/data.html#dailyupdates
CSV
GDELT Global Knowledge Graph (GKG)
Global Knowledge Graph in a single sentence attempts to connect every person, organization, location, count, theme, news source, and event across the planet into a single massive network that captures what's happening around the world, what its context is and who's involved, and how the world is feeling about it, every single day.
HTML
Snoozl Wiki Page
This page provides an overview and some interpretation / guidance for the data available at http://gdelt.umn.edu/data.html.
HTML

Tags

Additional Info

Author#BiblioHack
Maintainer#BiblioHack
Last Updated04 Jun 2014, 18:21:07 UTC
Created06 Jan 2014, 17:55:08 UTC