Support vector machine classification for large data sets via minimum enclosing ball clustering

J. Cervantes, X. Li, W. Yu, Kang Li

Research output: Contribution to journalArticlepeer-review

125 Citations (Scopus)

Abstract

Support vector machine (SVM) is a powerful technique for data classification. Despite of its good theoretic foundations and high classification accuracy, normal SVM is not suitable for classification of large data sets, because the training complexity of SVM is highly dependent on the size of data set. This paper presents a novel SVM classification approach for large data sets by using minimum enclosing ball clustering. After the training data are partitioned by the proposed clustering method, the centers of the clusters are used for the first time SVM classification. Then we use the clusters whose centers are support vectors or those clusters which have different classes to perform the second time SVM classification. In this stage most data are removed. Several experimental results show that the approach proposed in this paper has good classification accuracy compared with classic SVM while the training is significantly faster than several other SVM classifiers.
Original languageEnglish
Pages (from-to)611-619
Number of pages9
JournalNeurocomputing
Volume71
Issue number4-6
DOIs
Publication statusPublished - Jan 2008

ASJC Scopus subject areas

  • Artificial Intelligence
  • Cellular and Molecular Neuroscience

Fingerprint

Dive into the research topics of 'Support vector machine classification for large data sets via minimum enclosing ball clustering'. Together they form a unique fingerprint.

Cite this