View on GitHub


Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.


Chunking and Part of Speech Tagging

Model Accuracy Paper / Source Code
Dalal, Aniket & Nagaraj, Kumar & Sawant, Uma & Shelke, Sandeep (2006) 89.346 Hindi Part-of-Speech Tagging and Chunking : A Maximum Entropy Approach

Machine Translation

Model BLEU Paper / Source Code
Anoop Kunchukuttan, Pratik Mehta, Pushpak Bhattacharyya (2018) 12.83 The IIT Bombay English-Hindi Parallel Corpus