ChineseNLP

Chinese Emotion Classification

Background

Emotion classification identifies the emotional state of a writer or speaker. This is distinct from sentiment classification, which describes the viewpoint of writers toward their subjects.

Example input/output

Input:

讨厌!你骗我!

Output:

Angry

Standard metrics

Cheng emotion corpus.

Cheng et al 2017 introduce an emotion corpus for Chinese Microblogs. It consists of short posts marked with the following distribution of emotion tags: Joy (11.3%), Angry (3.5%), Sad (2.6%), Fearful (0.6%), Positive (8.2%), Neutral (4.4%), Negative (9.9%), Non-emotion (59.5%). Furthermore, the corpus identifies the sub-span of the post that is the cause of the emotion.

Chen et al 2018 reports that the corpus includes “~3,000 subtweets, ~11,000 instances for EClass, and ~13,000 instances for ECause.”

Test set Genre
Cheng emotion corpus Microblog

Metrics

Results

System F1
[Chen et al 2018] 62.4
[Cheng et al 2017] 58.2

Suggestions? Changes? Please send email to chinesenlp.xyz@gmail.com