High-dimensional factor model and its applications to statistical machine learning

Zhao Chen, Jianqing Fan, Dan Christina Wang

Research output: Contribution to journalReview articlepeer-review

2 Scopus citations

Abstract

This paper reviews the recent developments on factor model and its applications to statistical machine learning. The factor model reduces the dimensionality of variables, and provides a low-rank plus sparse structure for the high-dimensional covariance matrices. Therefore, it attracts much attention in high-dimensional data analysis, and has been widely applied in many fields of sciences, engineering, humanities and social sciences, including economics, finance, genomics, neuroscience, machine learning, and so on. We elaborate how to use principal component analysis method to extract latent factors, estimate their associated factor loadings, idiosyncratic components, and their associated covariance matrices. These methods have been proven to effectively cope with the challenges of big data, such as high dimensionality, strong dependence, heavy-tailed variables, and heterogeneity. In addition, we also focus on the role of the factor model in dealing with high-dimensional statistical learning problems such as covariance matrix estimation, model selection, multiple testing, and prediction. Finally, we illustrate the innate relationships between factor models and modern machine learning problems through several applications, including network analysis, matrix completion, ranking, and mixture models.

Original languageEnglish (US)
Pages (from-to)447-490
Number of pages44
JournalScientia Sinica Mathematica
Volume50
Issue number4
DOIs
StatePublished - Apr 1 2020

All Science Journal Classification (ASJC) codes

  • General Mathematics

Keywords

  • Factor model
  • Factor-adjusted method
  • Model selection
  • Multiple testing
  • PCA
  • Structural covariance matrix

Fingerprint

Dive into the research topics of 'High-dimensional factor model and its applications to statistical machine learning'. Together they form a unique fingerprint.

Cite this