Emotion-aware polarity lexicons for Twitter sentiment analysis

Anil Bandhakavi, Nirmalie Wiratunga*, Stewart Massie, P. Deepak

*Corresponding author for this work

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Theoretical frameworks in psychology map the relationships between emotions and sentiments. In this paper, we study the role of such mapping for computational emotion detection from text (e.g., social media) with an aim to understand the usefulness of an emotion-rich corpus of documents (e.g., tweets) to learn polarity lexicons for sentiment analysis. We propose two different methods that leverage a corpus of emotion-labelled tweets to learn word-polarity lexicons. The proposed methods model the emotion corpus using a generative unigram mixture model, combined with the emotion-sentiment mapping proposed in psychology for automated generation of word-polarity lexicons that capture emotion-rich vocabulary. We comparatively evaluate the quality of the proposed mixture model in learning emotion-aware sentiment lexicons with those generated using supervised latent dirichlet allocation (sLDA) and word-document-frequency (WDF) statistics. Sentiment analysis experiments on benchmark Twitter data sets confirm the quality of our proposed lexicons. Further, a comparative analysis with sLDA, WDF-based emotion-aware lexicons, and standard sentiment lexicons that are agnostic to emotion knowledge suggests that the proposed lexicons lead to a significantly better performance in both sentiment classification and sentiment intensity prediction tasks.

Original languageEnglish
Article numbere12332
JournalExpert Systems
Early online date11 Oct 2018
DOIs
Publication statusEarly online date - 11 Oct 2018

Fingerprint

Sentiment Analysis
Polarity
Statistics
Mixture Model
Dirichlet
Emotion
Experiments
Generative Models
Social Media
Comparative Analysis
Leverage
Benchmark

Cite this

Bandhakavi, Anil ; Wiratunga, Nirmalie ; Massie, Stewart ; Deepak, P. / Emotion-aware polarity lexicons for Twitter sentiment analysis. In: Expert Systems. 2018.
@article{cd61b51e705144acaef66f0945cdbf03,
title = "Emotion-aware polarity lexicons for Twitter sentiment analysis",
abstract = "Theoretical frameworks in psychology map the relationships between emotions and sentiments. In this paper, we study the role of such mapping for computational emotion detection from text (e.g., social media) with an aim to understand the usefulness of an emotion-rich corpus of documents (e.g., tweets) to learn polarity lexicons for sentiment analysis. We propose two different methods that leverage a corpus of emotion-labelled tweets to learn word-polarity lexicons. The proposed methods model the emotion corpus using a generative unigram mixture model, combined with the emotion-sentiment mapping proposed in psychology for automated generation of word-polarity lexicons that capture emotion-rich vocabulary. We comparatively evaluate the quality of the proposed mixture model in learning emotion-aware sentiment lexicons with those generated using supervised latent dirichlet allocation (sLDA) and word-document-frequency (WDF) statistics. Sentiment analysis experiments on benchmark Twitter data sets confirm the quality of our proposed lexicons. Further, a comparative analysis with sLDA, WDF-based emotion-aware lexicons, and standard sentiment lexicons that are agnostic to emotion knowledge suggests that the proposed lexicons lead to a significantly better performance in both sentiment classification and sentiment intensity prediction tasks.",
author = "Anil Bandhakavi and Nirmalie Wiratunga and Stewart Massie and P. Deepak",
year = "2018",
month = "10",
day = "11",
doi = "10.1111/exsy.12332",
language = "English",
journal = "Expert Systems",
issn = "0266-4720",
publisher = "Wiley-Blackwell",

}

Emotion-aware polarity lexicons for Twitter sentiment analysis. / Bandhakavi, Anil; Wiratunga, Nirmalie; Massie, Stewart; Deepak, P.

In: Expert Systems, 11.10.2018.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Emotion-aware polarity lexicons for Twitter sentiment analysis

AU - Bandhakavi, Anil

AU - Wiratunga, Nirmalie

AU - Massie, Stewart

AU - Deepak, P.

PY - 2018/10/11

Y1 - 2018/10/11

N2 - Theoretical frameworks in psychology map the relationships between emotions and sentiments. In this paper, we study the role of such mapping for computational emotion detection from text (e.g., social media) with an aim to understand the usefulness of an emotion-rich corpus of documents (e.g., tweets) to learn polarity lexicons for sentiment analysis. We propose two different methods that leverage a corpus of emotion-labelled tweets to learn word-polarity lexicons. The proposed methods model the emotion corpus using a generative unigram mixture model, combined with the emotion-sentiment mapping proposed in psychology for automated generation of word-polarity lexicons that capture emotion-rich vocabulary. We comparatively evaluate the quality of the proposed mixture model in learning emotion-aware sentiment lexicons with those generated using supervised latent dirichlet allocation (sLDA) and word-document-frequency (WDF) statistics. Sentiment analysis experiments on benchmark Twitter data sets confirm the quality of our proposed lexicons. Further, a comparative analysis with sLDA, WDF-based emotion-aware lexicons, and standard sentiment lexicons that are agnostic to emotion knowledge suggests that the proposed lexicons lead to a significantly better performance in both sentiment classification and sentiment intensity prediction tasks.

AB - Theoretical frameworks in psychology map the relationships between emotions and sentiments. In this paper, we study the role of such mapping for computational emotion detection from text (e.g., social media) with an aim to understand the usefulness of an emotion-rich corpus of documents (e.g., tweets) to learn polarity lexicons for sentiment analysis. We propose two different methods that leverage a corpus of emotion-labelled tweets to learn word-polarity lexicons. The proposed methods model the emotion corpus using a generative unigram mixture model, combined with the emotion-sentiment mapping proposed in psychology for automated generation of word-polarity lexicons that capture emotion-rich vocabulary. We comparatively evaluate the quality of the proposed mixture model in learning emotion-aware sentiment lexicons with those generated using supervised latent dirichlet allocation (sLDA) and word-document-frequency (WDF) statistics. Sentiment analysis experiments on benchmark Twitter data sets confirm the quality of our proposed lexicons. Further, a comparative analysis with sLDA, WDF-based emotion-aware lexicons, and standard sentiment lexicons that are agnostic to emotion knowledge suggests that the proposed lexicons lead to a significantly better performance in both sentiment classification and sentiment intensity prediction tasks.

U2 - 10.1111/exsy.12332

DO - 10.1111/exsy.12332

M3 - Article

AN - SCOPUS:85054748701

JO - Expert Systems

JF - Expert Systems

SN - 0266-4720

M1 - e12332

ER -