JRC-Names

JRC-Names is a highly multilingual named entity resource for person and organisation names developed by the European Commission’s Joint Research Centre (JRC). JRC-Names consists of large lists of names and their many spelling variants (up to hundreds for a single person), including across scripts (Latin, Greek, Arabic, Cyrillic, Japanese, Chinese, etc.).

The resource is the by-product of the Europe Media Monitor (EMM)) family of applications, which has been analysing up to 220,000 news reports per day, since 2004. EMM recognises names mentioned in the news in over twenty languages and decides automatically for each newly found name whether it belongs to a new entity or whether it is a spelling variant of a previously known entity. This resource allows EMM users to display news about people or organisations even if their names are spelt differently or if the news articles are written in different languages and scripts.

The new linked data edition, accessible through the European Union’s Open Data Portal, offers more information compared to the previously released resource and tool, including: titles and function names that have been historically found next to the person mentions; information about the time period during which name variants and their titles were found; various frequency counts; as well as links to other linked datasets such as DBPedia.

The JRC-Names RDF representation is based on lemon (Lexicon Model for Ontologies) and includes links towards DBpedia, New York Times Open Data and Talk of Europe.

Previous editions includes: the original release by the JRC since September 2011 (consisting of name variant lists and accompanying software) and a first linked data version produced during the MLODE 2012 workshop.

Data and Resources

Additional Info

Field Value
Source https://ec.europa.eu/jrc/en/language-technologies/jrc-names
Author EC Joint Research Center
Maintainer Guillaume Jacquet, Maud Ehrmann, Ralf Steinberger
Version 1.0
Last Updated January 25, 2016, 21:13 (UTC)
Created January 22, 2016, 08:56 (UTC)
links:DBpedia 95,437
triples 72586712
comments powered by Disqus
comments powered by Disqus