View on GitHub

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Chinese NLP tasks

Entity linking

See here for more information about the task.

Datasets

AIDA CoNLL-YAGO Dataset

Disambiguation-Only Models
Model Micro-Precision Paper / Source Code
Sil et al. (2018) 84.4 Neural Cross-Lingual Entity Linking  
Tsai & Roth (2016) 83.6 Cross-lingual wikification using multilingual embeddings  

Go back to the README

reading comprehension

Dureader Datasets

See here to see the introduction.

Baidu DuReader Dataset

See here to download the Dataset.

Disambiguation-Only Models

See here to see the leaderboard.