Importance of events per independent variable in proportional hazards analysis I. Background, goals, and general strategy

John Concato(VA Connecticut Research and Education Foundation), Peter Peduzzi(Yale University), Theodore Holford(Yale University), Alvan R. Feinstein(Yale University)
Journal of Clinical Epidemiology
December 1, 1995
Cited by 797Open Access
Full Text

Abstract

Multivariable methods of analysis can yield problematic results if methodological guidelines and mathematical assumptions are ignored. A problem arising from a too-small ratio of events per variable (EPV) can affect the accuracy and precision of regression coefficients and their tests of statistical significance. The problem occurs when a proportional hazards analysis contains too few "failure" events (e.g., deaths) in relation to the number of included independent variables. In the current research, the impact of EPV was assessed for results of proportional hazards analysis done with Monte Carlo simulations in an empirical data set of 673 subjects enrolled in a multicenter trial of coronary artery bypass surgery. The research is presented in two parts: Part I describes the data set and strategy used for the analyses, including the Monte Carlo simulation studies done to determine and compare the impact of various values of EPV in proportional hazards analytical results. Part II compares the output of regression models obtained from the simulations, and discusses the implication of the findings.


Related Papers

Regression Models and Life-Tables
D. R. Cox|Journal of the Royal Statistical Society Series B (Statistical Methodology)|1972|39.2k
EDF Statistics for Goodness of Fit and Some Comparisons
Michael A. Stephens|Journal of the American Statistical Association|1974|2.9k
The Risk of Determining Risk with Multivariable Models
John Concato, Alvan R. Feinstein, Theodore R. Holford|Annals of Internal Medicine|1993|1.1k