Efficient divide-and-conquer classification based on parallel feature-space decomposition for distributed systems

Qi Guo, Bo Wei Chen, Seungmin Rho, Wen Ji, Feng Jiang, Xianyang Ji, Sun Yuan Kung

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

This paper presents a divide-and-conquer (DC) approach based on feature-space decomposition for classification. When large-scale data sets are present, typical approaches usually employed truncated kernel methods on the feature space or DC approaches on the sample space. However, this did not guarantee separability between classes, owing to overfitting. To overcome such problems, this paper proposes a novel DC approach on feature spaces consisting of three steps. First, we divide the feature space into several subspaces using the decomposition method proposed in this paper. Subsequently, these feature subspaces are sent into individual local classifiers for training. Finally, the outcome of local classifiers are fused together to generate the final classification results. We also propose a Cascade-TRBFKRR classifier to reweight training samples for data refinement. Experiments on large-scale data sets are carried out for performance evaluation. The results show that the error rates of the proposed DC method decreased compared with the state-of-the-art fast support vector machine solvers, e.g., reducing error rates by 10.53% and 7.53% on RCV1 and covtype data sets, respectively.

Original languageEnglish (US)
Article number7293604
Pages (from-to)1492-1498
Number of pages7
JournalIEEE Systems Journal
Volume12
Issue number2
DOIs
StatePublished - Jun 2018

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Information Systems
  • Computer Science Applications
  • Computer Networks and Communications
  • Electrical and Electronic Engineering

Keywords

  • Classification
  • Divide and conquer (DC)
  • Feature-space division
  • Featurespace decomposition
  • Fusion

Fingerprint

Dive into the research topics of 'Efficient divide-and-conquer classification based on parallel feature-space decomposition for distributed systems'. Together they form a unique fingerprint.

Cite this