Exploring Generalization in Deep Learning

Behnam Neyshabur(Toyota Technological Institute at Chicago), Srinadh Bhojanapalli(Toyota Technological Institute at Chicago), David McAllester(Toyota Technological Institute at Chicago), Nathan Srebro(Toyota Technological Institute)
Neural Information Processing Systems
June 27, 2017
Cited by 489

Abstract

With a goal of understanding what drives generalization in deep networks, we consider several recently suggested explanations, including norm-based control, sharpness and robustness. We study how these measures can ensure generalization, highlighting the importance of scale normalization, and making a connection between sharpness and PAC-Bayes theory. We then investigate how well the measures explain different observed phenomena.


Related Papers

No related papers found

Powered by citation graph analysis