An Efficient Optimization Algorithm for Structured Sparse CCA, with Applications to eQTL Mapping

Xi Chen, Han Liu

Research output: Contribution to journalArticlepeer-review

48 Scopus citations

Abstract

In this paper we develop an efficient optimization algorithm for solving canonical correlation analysis (CCA) with complex structured-sparsity-inducing penalties, including overlapping-group-lasso penalty and network-based fusion penalty. We apply the proposed algorithm to an important genome-wide association study problem, eQTL mapping. We show that, with the efficient optimization algorithm, one can easily incorporate rich structural information among genes into the sparse CCA framework, which improves the interpretability of the results obtained. Our optimization algorithm is based on a general excessive gap optimization framework and can scale up to millions of variables. We demonstrate the effectiveness of our algorithm on both simulated and real eQTL datasets.

Original languageEnglish (US)
Pages (from-to)3-26
Number of pages24
JournalStatistics in Biosciences
Volume4
Issue number1
DOIs
StatePublished - May 2012

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Biochemistry, Genetics and Molecular Biology (miscellaneous)

Keywords

  • Genome-wide association study
  • Group structure
  • Network structure
  • Optimization algorithm
  • Sparse CCA
  • Structured sparsity
  • eQTL mapping

Fingerprint

Dive into the research topics of 'An Efficient Optimization Algorithm for Structured Sparse CCA, with Applications to eQTL Mapping'. Together they form a unique fingerprint.

Cite this