Abstract
Online reviews have become critical in informing purchasing decisions, making the detection of fake reviews a crucial challenge to tackle. Many different Machine Learning based solutions have been proposed, using various data representations such as n-grams or document embeddings. In this paper, we first explore the effectiveness of different data representations, including emotion, document embedding, n-grams, and noun phrases in embedding format, for fake reviews detection. We evaluate these representations with various state-of-the art deep learning models, such as a BILSTM,LSTM, GRU, CNN, and MLP. Following this, we propose to incorporate different data representations and classification models using early and late data fusion techniques in order to improve the prediction performance. The experiments are conducted on four datasets: Hotel, Restaurant, Amazon, and Yelp. The results demonstrate that a combination of different data representations significantly outperforms any single data representation.
Original language | English |
---|---|
Title of host publication | Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, RANLP 2023 |
Editors | Ruslan Mitkov, Galia Angelova |
Publisher | Incoma Ltd |
Pages | 730-738 |
ISBN (Electronic) | 9789544520922 |
Publication status | Published - 01 Sept 2023 |
Event | Recent Advances in Natural Language Processing Conference 2023 - Varna, Bulgaria Duration: 30 Aug 2023 → 08 Sept 2023 https://acl-bg.org/RANLP%202021.html |
Publication series
Name | Recent Advances in Natural Language Processing |
---|---|
ISSN (Electronic) | 2603-2813 |
Conference
Conference | Recent Advances in Natural Language Processing Conference 2023 |
---|---|
Abbreviated title | RANLP 2023 |
Country/Territory | Bulgaria |
City | Varna |
Period | 30/08/2023 → 08/09/2023 |
Internet address |