Part-of-speech Tagging
Part-of-speech tagging (POS tagging) is the task of tagging a word in a text with its part of speech. A part of speech is a category of words with similar grammatical properties. Common English parts of speech are noun, verb, adjective, adverb, pronoun, preposition, conjunction, etc.
Linguistic Data Consortium: Indian Bengali
Indian Language Part-of-Speech Tagset: Bengali, Linguistic Data Consortium (LDC) catalog number LDC2010T16 and isbn 1-58563-561-8, is a corpus developed by Microsoft Research (MSR) India to support the task of Part-of-Speech Tagging (POS) and other data-driven linguistic research on Indian Languages in general.
Model | Accuracy | Paper / Source | Code |
---|---|---|---|
Deep Learning(Fasihul et al. 2016) | 93.33 | Deep learning based parts of speech tagger for Bengali | — |