Abstract
Due to the increase in electronic documents, automatic text classification has gained a lot of importance as manual classification of documents is time-consuming. Machine learning is the main approach for automatic text classification, where texts are represented, terms are weighted on the basis of the chosen representation and a classification model is built. Vector space model is the dominant text representation largely due to its simplicity. Graphs are becoming an alternative text representation that have the ability to capture important information in text such as term order, term co-occurrence and term relationships that are not considered by the vector space model. Substantially better text classification performance has been demonstrated for term weighting schemes which use a graph representation. In this paper, we introduce a graph-based term weighting scheme, tw-srw, which is an effective supervised term weighting method that considers the co-occurrence information in text for increasing text classification accuracy. Experimental results show that it outperforms the state-of-the-art unsupervised term weighting schemes.
Original language | English |
---|---|
Title of host publication | Frontiers in Artificial Intelligence and Applications |
Editors | Gal A. Kaminka, Frank Dignum, Eyke Hullermeier, Paolo Bouquet, Virginia Dignum, Maria Fox, Frank van Harmelen |
Publisher | IOS Press |
Pages | 1710-1711 |
Number of pages | 2 |
ISBN (Electronic) | 9781614996712 |
DOIs | |
Publication status | Published - 2016 |
Externally published | Yes |
Event | 22nd European Conference on Artificial Intelligence, ECAI 2016 - The Hague, Netherlands Duration: 29 Aug 2016 → 02 Sept 2016 |
Publication series
Name | Frontiers in Artificial Intelligence and Applications |
---|---|
Volume | 285 |
ISSN (Print) | 0922-6389 |
Conference
Conference | 22nd European Conference on Artificial Intelligence, ECAI 2016 |
---|---|
Country/Territory | Netherlands |
City | The Hague |
Period | 29/08/2016 → 02/09/2016 |
Bibliographical note
Publisher Copyright:© 2016 The Authors and IOS Press.
Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
Keywords
- text classification
- term weighting
- graph
- co-occurrence
ASJC Scopus subject areas
- Artificial Intelligence