Supervised graph-based term weighting scheme for effective text classification

Niloofer Shanavas, Hui Wang, Zhiwei Lin, Glenn Hawe

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)
29 Downloads (Pure)

Abstract

Due to the increase in electronic documents, automatic text classification has gained a lot of importance as manual classification of documents is time-consuming. Machine learning is the main approach for automatic text classification, where texts are represented, terms are weighted on the basis of the chosen representation and a classification model is built. Vector space model is the dominant text representation largely due to its simplicity. Graphs are becoming an alternative text representation that have the ability to capture important information in text such as term order, term co-occurrence and term relationships that are not considered by the vector space model. Substantially better text classification performance has been demonstrated for term weighting schemes which use a graph representation. In this paper, we introduce a graph-based term weighting scheme, tw-srw, which is an effective supervised term weighting method that considers the co-occurrence information in text for increasing text classification accuracy. Experimental results show that it outperforms the state-of-the-art unsupervised term weighting schemes.

Original languageEnglish
Title of host publicationFrontiers in Artificial Intelligence and Applications
EditorsGal A. Kaminka, Frank Dignum, Eyke Hullermeier, Paolo Bouquet, Virginia Dignum, Maria Fox, Frank van Harmelen
PublisherIOS Press
Pages1710-1711
Number of pages2
ISBN (Electronic)9781614996712
DOIs
Publication statusPublished - 2016
Externally publishedYes
Event22nd European Conference on Artificial Intelligence, ECAI 2016 - The Hague, Netherlands
Duration: 29 Aug 201602 Sept 2016

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume285
ISSN (Print)0922-6389

Conference

Conference22nd European Conference on Artificial Intelligence, ECAI 2016
Country/TerritoryNetherlands
CityThe Hague
Period29/08/201602/09/2016

Bibliographical note

Publisher Copyright:
© 2016 The Authors and IOS Press.

Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.

Keywords

  • text classification
  • term weighting
  • graph
  • co-occurrence

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Supervised graph-based term weighting scheme for effective text classification'. Together they form a unique fingerprint.

Cite this