Importance of events per independent variable in proportional hazards analysis I. Background, goals, and general strategy

John Concato; Peter Peduzzi; Theodore Holford; Alvan R. Feinstein

doi:10.1016/0895-4356(95)00510-2

Importance of events per independent variable in proportional hazards analysis I. Background, goals, and general strategy

John Concato(VA Connecticut Research and Education Foundation), Peter Peduzzi(Yale University), Theodore Holford(Yale University), Alvan R. Feinstein(Yale University)

Journal of Clinical Epidemiology

December 1, 1995

10.1016/0895-4356(95)00510-2

Cited by 797Open Access

Full Text

Abstract

Multivariable methods of analysis can yield problematic results if methodological guidelines and mathematical assumptions are ignored. A problem arising from a too-small ratio of events per variable (EPV) can affect the accuracy and precision of regression coefficients and their tests of statistical significance. The problem occurs when a proportional hazards analysis contains too few "failure" events (e.g., deaths) in relation to the number of included independent variables. In the current research, the impact of EPV was assessed for results of proportional hazards analysis done with Monte Carlo simulations in an empirical data set of 673 subjects enrolled in a multicenter trial of coronary artery bypass surgery. The research is presented in two parts: Part I describes the data set and strategy used for the analyses, including the Monte Carlo simulation studies done to determine and compare the impact of various values of EPV in proportional hazards analytical results. Part II compares the output of regression models obtained from the simulations, and discusses the implication of the findings.

Related Papers

Regression Models and Life-Tables

D. R. Cox|Journal of the Royal Statistical Society Series B (Statistical Methodology)|1972|39.2k

Bootstrap Methods: Another Look at the Jackknife

B. Efron|The Annals of Statistics|1979|17.4k

EDF Statistics for Goodness of Fit and Some Comparisons

Michael A. Stephens|Journal of the American Statistical Association|1974|2.9k

Tests of statistical hypotheses concerning several parameters when the number of observations is large

Abraham Wald|Transactions of the American Mathematical Society|1943|2.2k

The Risk of Determining Risk with Multivariable Models

John Concato, Alvan R. Feinstein, Theodore R. Holford|Annals of Internal Medicine|1993|1.1k