-
KAIST silver standard corpus
KAIST silver standard corpus Availability: Freely Avalable Usage: Named Entity Recognition Status:Newly created-finished Description: We propose a novel method to... -
Sistema de Informações Organizacionais do Governo Federal (SIORG)
O Sistema de Informações Organizacionais do Governo Federal (SIORG) contém informações organizacionais da administração pública federal, tais como: nomes, códigos e endereços de... -
Portal da Transparência
Sobre o Portal: (retirado da descrição do portal) O Portal da Transparência do Governo Federal é uma iniciativa da Controladoria-Geral da União (CGU), lançada em novembro de... -
JEL Classification
The Journal of Economic Literature (JEL) Classification System was created and is maintained by the American Economic Association. The AEA provides this widely used resource... -
xLiD-Lexica
Our xLiD-Lexica dataset in RDF (http://km.aifb.kit.edu/resources/xLiD-lexica.nt) contains about 300 million triples of cross-lingual groundings. It is extracted from Wikipedia... -
Syntactic Reference Corpus of Medieval French (SRCMF)
The SRCMF contains the 15 Old French texts with about 280000 words. It has a high-quality manual annotation, based on a linguistically adequate dependency grammar. Annotation... -
AlchemyAPI
AlchemyAPI is helping pioneer a computer’s ability to understand human language and vision. Our web services for real-time text analysis and computer vision give you the... -
ORCID Connecting Research and Researchers
ORCID provides a persistent digital identifier that distinguishes you from every other researcher and, through integration in key research workflows such as manuscript and grant... -
Prov-ONE: Provenance of scientfic workflows
Generated from VisTrails. -
Twitter-PROV: Provenance from Social Networks
Twitter-PROV is a large provenance dataset containing a synthetic representation of data from the popular social network Twitter. -
Provenance Data Sets Highlighting Capture Disparaties (TaPP '14)
This dataset has no description
-
Provenance Reconstruction 1: Version Controlled Documents
The ground truth ( groundtruth.ttl) for the first dataset was generated from a number of github repositories using the Git2PROV (http://git2prov.org) tool. As raw data, you... -
Provenance Reconstruction 2: Human Generated News
The ground truth for the second dataset was created using the sources mentioned in news articles from WikiNews. The link between news articles and their sources is modeled using... -
University of Southampton's Picaso
Provenance Interlinking and Collective Authoring for Scientific Objects Linking scientific objects from pre-built templates with ease using drag-and-drop. Currently supporting:... -
PROV Pings
PROV-Pings is a service that allows you to link publications to their provenance, and query and retrieve it afterwards. It combines a provenance query and pingback service, two... -
Wings workflow provenance dataset (ProvBench '13)
This dataset has no description
-
Global Database of Events, Language, and Tone
The Global Database of Events, Language, and Tone (GDELT) is an initiative to construct a catalog of human societal-scale behavior and beliefs across all countries of the world... -
Linked Open Computer Vision
Interconnecting computer vision datasets -
VAST Challenge 2013 - MC3 - Big Marketing
Background Big Marketing is an international marketing company employing a large staff of marketing executives who create and manage advertising and public relations campaigns... -
VAST Challenge 2010 - Grand Challenge
The dataset contains fictitious information and was created for testing and evaluation of visual analytic tools only. No part of this dataset should be taken as real. -
VAST Challenge 2008 - Grand Challenge
The dataset contains fictitious information and was created for testing and evaluation of visual analytic tools only. No part of this dataset should be taken as real. For the... -
VAST Challenge 2007
The dataset contains fictitious information and was created for testing and evaluation of visual analytic tools only. No part of this dataset should be taken as real. It is Fall... -
VAST Challenge 2011 - MC2 - Computer Networking Operations
The dataset contains fictitious information and was created for testing and evaluation of visual analytic tools only. No part of this dataset should be taken as real. The CEO of... -
VAST Challenge 2011- MC3 - Investigation into Terrorist Activity
The dataset contains fictitious information and was created for testing and evaluation of visual analytic tools only. No part of this dataset should be taken as real.... -
VAST Challenge 2011 - MC1 - Characterization of an Epidemic Spread
A major epidemic has started in the city of Vastopolis. The task is to analyze the data and determine how the epidemic is being spread and whether or not it is contained. The...