View on GitHub

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Sentiment Analysis

Sentiment analysis is the task of classifying the polarity of a given text.

RuSentRel

The RuSentRel dataset consisted of analytical articles from Internet-portal inosmi.ru. These are translated into Russian texts in the domain of international politics obtained from foreign authoritative sources. The collected articles contain both the author’s opinion on the subject matter of the article and a large number of references mentioned between the participants of the described situations. The dataset contains 73 large analytical texts, labeled with about 2000 relations.

Sentiment Attitude Extraction task: Given a subset of documents, in which every document includes: (1) text, (2) a list of mentioned named entities. For each document it is required to complete a list of labeled entity pairs [(s1->o1, l1), … (sn->on, ln)], for which text conveys the presence of the sentiment relation expressed by the subject towards the object (si->oi) with sentiment li ∈ [neg, pos] (see the example below).

Task paper: https://arxiv.org/pdf/1808.08932.pdf

The public leaderboard is available at github repository

Example
… Meanwhile Moscow has repeatedly emphasized that its activity in the Baltic Sea is a response precisely to actions of NATO and the escalation of the hostile approach to Russia near its eastern borders …
(NATO->Russia, neg), (Russia->NATO, neg)

RuSentNE-2023

The dataset for RuSentNE-2023 evaluation is based on the Russian news corpus RuSentNE having rich sentiment-related annotation. The corpus is annotated with named entities and sentiments towards these entities, along with related effects and emotional states. The dataset contains over 11000 sentences from 400+ large texts.

Entity-Oriented Sentiment Analysis task: Given a subset of sentences, in which every sentence includes one or several named entities. For each sentence all of the named entities should be classified into one of three sentiment classes: positive, negative or neutral within the context of a single sentence.

Task paper: https://arxiv.org/pdf/2305.17679.pdf

The public leaderboard is available at github repository

Example
Notorious figure in the country - Berlusconi has been repeatedly accused of financial fraud
(Berlusconi-> neg)