ECA: High-Dimensional Elliptical Component Analysis in Non-Gaussian Distributions

Fang Han, Han Liu

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

We present a robust alternative to principal component analysis (PCA)—called elliptical component analysis (ECA)—for analyzing high-dimensional, elliptically distributed data. ECA estimates the eigenspace of the covariance matrix of the elliptical data. To cope with heavy-tailed elliptical distributions, a multivariate rank statistic is exploited. At the model-level, we consider two settings: either that the leading eigenvectors of the covariance matrix are nonsparse or that they are sparse. Methodologically, we propose ECA procedures for both nonsparse and sparse settings. Theoretically, we provide both nonasymptotic and asymptotic analyses quantifying the theoretical performances of ECA. In the nonsparse setting, we show that ECA’s performance is highly related to the effective rank of the covariance matrix. In the sparse setting, the results are twofold: (i) we show that the sparse ECA estimator based on a combinatoric program attains the optimal rate of convergence; (ii) based on some recent developments in estimating sparse leading eigenvectors, we show that a computationally efficient sparse ECA estimator attains the optimal rate of convergence under a suboptimal scaling. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)252-268
Number of pages17
JournalJournal of the American Statistical Association
Volume113
Issue number521
DOIs
StatePublished - Jan 2 2018

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Keywords

  • Elliptical component analysis
  • Elliptical distribution
  • Multivariate Kendall’s tau
  • Optimality property
  • Robust estimators
  • Sparse principal component analysis

Fingerprint

Dive into the research topics of 'ECA: High-Dimensional Elliptical Component Analysis in Non-Gaussian Distributions'. Together they form a unique fingerprint.

Cite this