The Sample Complexity of Multireference Alignment

  • Amelia Perry
  • , Jonathan Weed
  • , Afonso S. Bandeira
  • , Philippe Rigollet
  • , Amit Singer

Research output: Contribution to journalArticlepeer-review

Abstract

The growing role of data-driven approaches to scientific discovery has unveiled a large class of models that involve latent transformations with a rigid algebraic constraint. Three-dimensional molecule reconstruction in Cryo-Electron Microscopy (cryo-EM) is a central problem in this class. Despite decades of algorithmic and software development, there is still little theoretical understanding of the sample complexity of this problem, that is, the number of images required for three-dimensional reconstruction. Here we consider multireference alignment (MRA), a simple model that captures fundamental aspects of the statistical and algorithmic challenges arising in cryo-EM and related problems. In MRA, an unknown signal is subject to two types of corruption: a latent cyclic shift and the more traditional additive white noise. The goal is to recover the signal at a certain precision from independent samples. While at high signal-to-noise ratio (SNR) the number of observations needed to recover a generic signal is proportional to 1/SNR, we prove that it rises to a surprising 1/SNR3 in the low SNR regime. This precise phenomenon was observed empirically more than 20 years ago for cryo-EM but has remained unexplained to date. Furthermore, our techniques can easily be extended to the heterogeneous MRA model where the samples come from a mixture of signals, as is often the case in applications such as cryo-EM, where molecules may have different conformations. This provides a first step towards a statistical theory for heterogeneous cryo-EM.

Original languageEnglish (US)
Pages (from-to)497-517
Number of pages21
JournalSIAM Journal on Mathematics of Data Science
Volume1
Issue number3
DOIs
StatePublished - 2019

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Computational Mathematics
  • Applied Mathematics

Keywords

  • bispectrum
  • cryo-EM
  • method of invariants
  • multireference alignment
  • tensor decomposition

Fingerprint

Dive into the research topics of 'The Sample Complexity of Multireference Alignment'. Together they form a unique fingerprint.

Cite this