Hidden technical debt in Machine learning systems

D. Sculley(Google (United States)), Gary D. Holt(Google (United States)), Daniel Golovin(Google (United States)), Eugene Davydov(Google (United States)), Todd Phillips(Google (United States)), Dietmar Ebner(Google (United States)), Vinay Chaudhary(Google (United States)), Michael Young(Google (United States)), Jean-François Crespo(Google (United States)), Dan Dennison(Google (United States))
Neural Information Processing Systems
December 7, 2015
Cited by 841

Abstract

Machine learning offers a fantastically powerful toolkit for building useful complex prediction systems quickly. This paper argues it is dangerous to think of these quick wins as coming for free. Using the software engineering framework of technical debt, we find it is common to incur massive ongoing maintenance costs in real-world ML systems. We explore several ML-specific risk factors to account for in system design. These include boundary erosion, entanglement, hidden feedback loops, undeclared consumers, data dependencies, configuration issues, changes in the external world, and a variety of system-level anti-patterns.


Related Papers

No related papers found

Powered by citation graph analysis