Context-sensitive learning methods for text categorization

William W. Cohen, Yoram Singer

Research output: Contribution to journalConference articlepeer-review

182 Scopus citations

Abstract

Two recently implemented machine learning algorithms, RIPPER and sleeping experts for phrases, are evaluated on a number of large text categorization problems. These algorithms both construct classifiers that allow the 'context' of a word w to affect how (or even whether) the presence or absence of w will contribute to a classification. However, RIPPER and sleeping experts differ radically in many other respects: differences include different notions as to what constitutes a context, different ways of combining contexts to construct a classifier, different methods to search for a combination of contexts, and different criteria as to what contexts should be included in such a combination. In spite of these differences, both RIPPER and sleeping experts perform extremely well across a wide variety of categorization problems, generally outperforming previously applied learning methods. We view this result as a confirmation of the usefulness of classifiers that represent contextual information.

Original languageEnglish (US)
Pages (from-to)307-316
Number of pages10
JournalSIGIR Forum (ACM Special Interest Group on Information Retrieval)
StatePublished - 1996
EventProceedings of the 1996 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 96 - Zurich, Switz
Duration: Aug 18 1996Aug 22 1996

All Science Journal Classification (ASJC) codes

  • Management Information Systems
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Context-sensitive learning methods for text categorization'. Together they form a unique fingerprint.

Cite this