KAIST silver standard corpus

KAIST silver standard corpus

Availability: Freely Avalable

Usage: Named Entity Recognition

Status:Newly created-finished

Description: We propose a novel method to automatically build named entity corpus based on the DBpedia ontology. Our approach is language independent and easy to be applied to other languages where Wikipedia and DBpedia provided. Our corpus which called KAIST silver standard corpus, includes 6,796,274 sentences and 9,522,298 NEs. NE domain is now focus on just PER, ORG, and LOC. But we are now trying to build DBpedia ontology granularity NE corpus as a future work. You can download freely, and can participate refining corpus in our work flow.

Data and Resources

Additional Info

Field Value
Last Updated March 19, 2015, 15:29 (UTC)
Created May 16, 2014, 18:49 (UTC)
comments powered by Disqus
comments powered by Disqus