Augment and Reduce: Stochastic Inference for Large Categorical Distributions

  • Francisco J.R. Ruiz
  • , Michalis K. Titsias
  • , Adji B. Dieng
  • , David M. Blei

Research output: Contribution to journalConference articlepeer-review

Abstract

Categorical distributions are ubiquitous in machine learning, e.g., in classification, language models, and recommendation systems. However, when the number of possible outcomes is very large, using categorical distributions becomes computationally expensive, as the complexity scales linearly with the number of outcomes. To address this problem, we propose augment and reduce (A&R), a method to alleviate the computational complexity. A&R uses two ideas: latent variable augmentation and stochastic variational inference. It maximizes a lower bound on the marginal likelihood of the data. Unlike existing methods which are specific to softmax, A&R is more general and is amenable to other categorical models, such as multinomial probit. On several large-scale classification problems, we show that A&R provides a tighter bound on the marginal likelihood and has better predictive performance than existing approaches.

Original languageEnglish (US)
Pages (from-to)4403-4412
Number of pages10
JournalProceedings of Machine Learning Research
Volume80
StatePublished - 2018
Externally publishedYes
Event35th International Conference on Machine Learning, ICML 2018 - Stockholm, Sweden
Duration: Jul 10 2018Jul 15 2018

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Augment and Reduce: Stochastic Inference for Large Categorical Distributions'. Together they form a unique fingerprint.

Cite this