Skip to main navigation Skip to search Skip to main content

Communication-Efficient Distributed Estimation and Inference for Cox’s Model

Research output: Contribution to journalArticlepeer-review

Abstract

Motivated by multi-center biomedical studies that cannot share individual data due to privacy and ownership concerns, we develop communication-efficient iterative distributed algorithms for estimation and inference in the high-dimensional sparse Cox proportional hazards model. We demonstrate that our estimator, even with a relatively small number of iterations, achieves the same convergence rate as the ideal full-sample estimator under very mild conditions. To construct confidence intervals for linear combinations of high-dimensional hazard regression coefficients, we introduce a novel debiased method, establish central limit theorems, and provide consistent variance estimators that yield asymptotically valid distributed confidence intervals. In addition, we provide valid and powerful distributed hypothesis tests for any coordinate element based on a decorrelated score test. We allow time-dependent covariates as well as censored survival times. Extensive numerical experiments on both simulated and real data lend further support to our theory and demonstrate that our communication-efficient distributed estimators, confidence intervals, and hypothesis tests improve upon alternative methods. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.

Original languageEnglish (US)
Pages (from-to)1736-1746
Number of pages11
JournalJournal of the American Statistical Association
Volume120
Issue number551
DOIs
StatePublished - 2025

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Keywords

  • Communication efficiency
  • Cox’s proportional hazards model
  • Distributed inference
  • High-dimensional
  • Iterative algorithm

Fingerprint

Dive into the research topics of 'Communication-Efficient Distributed Estimation and Inference for Cox’s Model'. Together they form a unique fingerprint.

Cite this