On the equivalence of weak learnability and linear separability: New relaxations and efficient boosting algorithms

Shai Shalev-Shwartz, Yoram Singer

Research output: Contribution to journalArticle

22 Scopus citations

Abstract

Boosting algorithms build highly accurate prediction mechanisms from a collection of low-accuracy predictors. To do so, they employ the notion of weak-learnability. The starting point of this paper is a proof which shows that weak learnability is equivalent to linear separability with -1 margin. The equivalence is a direct consequence of von Neumann's minimax theorem. Nonetheless, we derive the equivalence directly using Fenchel duality. We then use our derivation to describe a family of relaxations to the weak-learnability assumption that readily translates to a family of relaxations of linear separability with margin. This alternative perspective sheds new light on known soft-margin boosting algorithms and also enables us to derive several new relaxations of the notion of linear separability. Last, we describe and analyze an efficient boosting framework that can be used for minimizing the loss functions derived from our family of relaxations. In particular, we obtain efficient boosting algorithms for maximizing hard and soft versions of the -1 margin.

Original languageEnglish (US)
Pages (from-to)141-163
Number of pages23
JournalMachine Learning
Volume80
Issue number2-3
DOIs
StatePublished - Sep 1 2010
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Software
  • Artificial Intelligence

Keywords

  • Boosting
  • Linear separability
  • Margin
  • Minimax theorem

Cite this