Filling the gaps: Gaussian mixture models from noisy, truncated or incomplete samples

P. Melchior, A. D. Goulding

Research output: Contribution to journalArticle

2 Scopus citations

Abstract

Astronomical data often suffer from noise and incompleteness. We extend the common mixtures-of-Gaussians density estimation approach to account for situations with a known sample incompleteness by simultaneous imputation from the current model. The method, called GMMis, generalizes existing Expectation–Maximization techniques for truncated data to arbitrary truncation geometries and probabilistic rejection processes, as long as they can be specified and do not depend on the density itself. The method accounts for independent multivariate normal measurement errors for each of the observed samples and recovers an estimate of the error-free distribution from which both observed and unobserved samples are drawn. It can perform a separation of a mixtures-of-Gaussian signal from a specified background distribution whose amplitude may be unknown. We compare GMMis to the standard Gaussian mixture model for simple test cases with different types of incompleteness, and apply it to observational data from the NASA Chandra X-ray telescope. The PYTHON code is released as an open-source package at https://github.com/pmelchior/pyGMMis.

Original languageEnglish (US)
Pages (from-to)183-194
Number of pages12
JournalAstronomy and Computing
Volume25
DOIs
StatePublished - Oct 2018

All Science Journal Classification (ASJC) codes

  • Astronomy and Astrophysics
  • Computer Science Applications
  • Space and Planetary Science

Keywords

  • Density estimation
  • Missing at random
  • Multivariate Gaussian mixture model
  • Truncated data

Fingerprint Dive into the research topics of 'Filling the gaps: Gaussian mixture models from noisy, truncated or incomplete samples'. Together they form a unique fingerprint.

  • Cite this