Extended Bayesian information criteria for model selection with large model spaces

Jie Chen(University of British Columbia), Zehua Chen(National University of Singapore)
Biometrika
August 29, 2008
Cited by 2,037

Abstract

The ordinary Bayes information criterion is too liberal for model selection when the model space is large. In this article, we re-examine the Bayesian paradigm for model selection and propose an extended family of Bayes information criteria. The new criteria take into account both the number of unknown parameters and the com-plexity of the model space. Their consistency is established, in particular allowing the number of covariates to increase to infinity with the sample size. Their performance in various situations is evaluated by simulation studies. It is demonstrated that the extended Bayes information criteria incur a small loss in the positive selection rate but tightly control the false discovery rate, a desirable property in many applications. The extended Bayes information criteria are extremely useful for variable selection in problems with a moderate sample size but a huge number of covariates, especially in genome-wide association studies, which are now an active area in genetics research.


Related Papers

Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing
Yoav Benjamini, Yosef Hochberg|Journal of the Royal Statistical Society Series B (Statistical Methodology)|1995|108k
Regression Shrinkage and Selection Via the Lasso
Robert Tibshirani|Journal of the Royal Statistical Society Series B (Statistical Methodology)|1996|51.5k