4 datasets found

Formats: .txt

Filter Results
  • ChineseSymptomNameRecognition

    This dataset contains the training data for distant supervision from Baidu Baike used in the paper named Combining Distant Supervision with Syntax-based Rules for Chinese...
  • Beijing Air Quality

    This dataset has no description

  • Symptoms in Chinese

    This is a dataset containing entities of symptoms and symptom-related facts. It is extracted from eight mainstream healthcare websites, three Chinese encyclopedia sites as well...
  • Teahouse corpus

    The Teahouse corpus is a set of questions asked at the Wikipedia Teahouse, a peer support forum for new Wikipedia editors. This corpus contains data from its first two years of...
You can also access this registry using the API (see API Docs).