Our xLiD-Lexica dataset in RDF (http://km.aifb.kit.edu/resources/xLiD-lexica.nt) contains about 300 million triples of cross-lingual groundings. It is extracted from Wikipedia dumps of July 2013 in English, German, Spanish, Catalan, Slovenian and Chinese, and based on the canonicalized datasets of DBpedia 3.8 containing triples extracted from the respective Wikipedia whose subject and object resource have an equivalent English article. Based on our xLiD-Lexica dataset, we provide a SPARQL endpoint (http://km.aifb.kit.edu/services/xlike-lexicon/) using OpenLink Virtuoso6 as the back-end database engine.

Data and Resources

Additional Info

Field Value
Author Lei Zhang
Last Updated July 29, 2014, 10:20 (UTC)
Created May 16, 2014, 18:28 (UTC)
links:dbpedia unknown
triples 300000000
comments powered by Disqus
comments powered by Disqus