Mapping Texts to Multidimensional Emotional Space: Challenges for Dataset Acquisition in Sentiment Analysis
The cornerstone for any sentiment analysis research is labeled data and its acquisition. Canonical corpuses for this task contain different reviews (movies, restaurants) where sentiment can be derived from reviewer’s explicit rating of a reviewed item. Ratings go with supplied comments, which are us...
Saved in:
Published in | Digital Transformation and Global Society pp. 361 - 367 |
---|---|
Main Authors | , , , |
Format | Book Chapter |
Language | English |
Published |
Cham
Springer International Publishing
|
Series | Communications in Computer and Information Science |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The cornerstone for any sentiment analysis research is labeled data and its acquisition. Canonical corpuses for this task contain different reviews (movies, restaurants) where sentiment can be derived from reviewer’s explicit rating of a reviewed item. Ratings go with supplied comments, which are used as text samples and ratings are converted into labels. Usually emotion labels come in binary form like “negative\positive”.
This simplistic approach works well when we are dealing with binary emotional model, but it turns to fail when we are dealing with more complex emotional models like “Pleasure-Arousal-Dominance (PAD)” or Lövheim’s Cube, when we collect data from various sources and of different types (fiction books, social networks conversations, blog posts etc.) or when we delegate labeling to external assessors.
In the article, we describe which methodological problems we faced while collecting dataset for sentiment analysis backed by Lövheim’s Cube - emotional model that represents an emotion as a point in three-dimensional space of balance of three monoamines (Dopamine, Serotonin and Noradrenaline).
These problems include the choice of necessary metadata to be collected along with text and labels, choice of tools used for labeling and survey design. |
---|---|
ISBN: | 9783030028459 3030028453 |
ISSN: | 1865-0929 1865-0937 |
DOI: | 10.1007/978-3-030-02846-6_29 |