Abstract
The generation and spread of fake news within new and online media sources is emerging as a phenomenon of high societal significance. Combating them using data-driven analytics has been attracting much recent scholarly interest. In this computational social science study, we analyze the textual coherence of fake news articles vis-a-vis legitimate ones. We develop three computational formulations of textual coherence drawing upon the state-of-the-art methods in natural language processing and data science. Two real-world datasets from widely different domains which have fake/legitimate article labellings are then analyzed with respect to textual coherence. We observe apparent differences in textual coherence across fake and legitimate news articles, with fake news articles consistently scoring lower on coherence as compared to legitimate news ones. While the relative coherence shortfall of fake news articles as compared to legitimate ones form the main observation from our study, we analyze several aspects of the differences and outline potential avenues of further inquiry.
Original language | English |
---|---|
Title of host publication | CML PKDD 2020 workshops: workshops of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2020): SoGood 2020, PDFL 2020, MLCS 2020, NFMCP 2020, DINA 2020, EDML 2020, XKDD 2020 and INRA 2020: proceedings |
Editors | Irena Koprinska, Michael Kamp, Annalisa Appice, Corrado Loglisci, Luiza Antonie, Albrecht Zimmermann, Riccardo Guidotti, Özlem Özgöbek, Rita P. Ribeiro, Ricard Gavaldà, João Gama, Linara Adilova, Yamuna Krishnamurthy, Pedro M. Ferreira, Donato Malerba, Ibéria Medeiros, Michelangelo Ceci, Giuseppe Manco, Elio Masciari , Zbigniew W. Ras, Peter Christen, Eirini Ntoutsi, Erich Schubert, Arthur Zimek, Anna Monreale, Przemyslaw Biecek, Salvatore Rinzivillo, Benjamin Kille, Andreas Lommatzsch, Jon Atle Gulla |
Publisher | Springer Cham |
Pages | 591-607 |
ISBN (Electronic) | 9783030659653 |
ISBN (Print) | 9783030659646 |
DOIs | |
Publication status | Published - 02 Feb 2021 |
Event | 8th International Workshop on News Recommendation and Analytics (INRA 2020) - Duration: 14 Sept 2020 → 14 Sept 2020 https://www.ntnu.no/wiki/pages/viewpage.action?pageId=188580623 |
Publication series
Name | Communications in Computer and Information Science |
---|---|
Volume | 1323 |
ISSN (Print) | 1865-0929 |
ISSN (Electronic) | 1865-0937 |
Workshop
Workshop | 8th International Workshop on News Recommendation and Analytics (INRA 2020) |
---|---|
Period | 14/09/2020 → 14/09/2020 |
Internet address |