Data augmentation for aspect-based sentiment analysis

Guangmin Li*, Hui Wang, Yi Ding, Kangan Zhou, Xiaowei Yan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

19 Citations (Scopus)

Abstract

In recent years, deep learning has been widely used in the field of natural language processing (NLP), achieving spectacular successes in various NLP tasks. These successes are largely due to its capability to automatically learn feature representations from text data. However, the performance of deep learning in NLP can be negatively affected by a lack of sufficiently large labeled corpus for training, resulting in limited improvement in performance. Data augmentation overcomes this small data problem by expanding the sample size for the classes of data in the training corpus. This paper introduces the data augmentation for aspect-based sentiment analysis (ABSA), a classical research topic in NLP that has been applied in various fileds. The study aims to enhance the classification performance of ABSA through various augmentation strategies. Two specific augmentation strategies are presented, part-of-speech (PoS) wise synonym substitution (PWSS) and dependency relation-based word swap (DRAWS), which augment data using PoS, external domain knowledge, and syntactic dependency. These strategies are evaluated through extensive experimentation on four public datasets using three representative deep learning models—aspect-specific graph convolutional network (ASGCN), content attention-based aspect-based sentiment classification (CABASC), and long short-term memory (LSTM) network. Compared with the results without data augmentation, our augmentation strategies achieve a performance gain of up to 11.49% on Macro-F1, with the lowest gain being 2.9%. The experimental results demonstrate that the proposed data augmentation strategies are very useful for training deep learning models on small data corpus.

Original languageEnglish
Pages (from-to)125-133
Number of pages9
JournalInternational Journal of Machine Learning and Cybernetics
Volume14
Early online date18 May 2022
DOIs
Publication statusPublished - Jan 2023

Bibliographical note

Funding Information:
We thank Xiang Dai[] for the great suggestion. This research was supported by Natural Science Foundation of Hubei Province of China (Grant No. 2020CFB828), Hubei Normal University Research Project on Teaching Reform (Grant No. XJ202001), Teaching Research Project of Hubei Normal University (Grant No. 2019030), Research Project of Young Teachers in Hubei Normal University (Grant No. HS2020QN029) and Science and Technology Research Project of Hubei Department of Education (Grant No. D20212503).

Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Keywords

  • Data augmentation
  • Deep learning
  • Dependency syntax
  • Sentiment analysis
  • Text classification

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Data augmentation for aspect-based sentiment analysis'. Together they form a unique fingerprint.

Cite this