Fast accelerated failure time modeling for case-cohort data

Sy Han Chiou, Sangwook Kang, Jun Yan

Research output: Contribution to journalArticlepeer-review

16 Citations (Scopus)


Semiparametric accelerated failure time (AFT) models directly relate the expected failure times to covariates and are a useful alternative to models that work on the hazard function or the survival function. For case-cohort data, much less development has been done with AFT models. In addition to the missing covariates outside of the sub-cohort in controls, challenges from AFT model inferences with full cohort are retained. The regression parameter estimator is hard to compute because the most widely used rank-based estimating equations are not smooth. Further, its variance depends on the unspecified error distribution, and most methods rely on computationally intensive bootstrap to estimate it. We propose fast rank-based inference procedures for AFT models, applying recent methodological advances to the context of case-cohort data. Parameters are estimated with an induced smoothing approach that smooths the estimating functions and facilitates the numerical solution. Variance estimators are obtained through efficient resampling methods for nonsmooth estimating functions that avoids full blown bootstrap. Simulation studies suggest that the recommended procedure provides fast and valid inferences among several competing procedures. Application to a tumor study demonstrates the utility of the proposed method in routine data analysis.

Original languageEnglish
Pages (from-to)559-568
Number of pages10
JournalStatistics and Computing
Issue number4
Publication statusPublished - 2014 Jul 1

Bibliographical note

Publisher Copyright:
© 2013, Springer Science+Business Media New York.

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Statistics and Probability
  • Statistics, Probability and Uncertainty
  • Computational Theory and Mathematics


Dive into the research topics of 'Fast accelerated failure time modeling for case-cohort data'. Together they form a unique fingerprint.

Cite this