Accelerated enzyme engineering by machine-learning guided cell-free expression

Grant M. Landwehr(Northwestern University), Jonathan W. Bogart(Northwestern University), Carol Magalhaes(Northwestern University), Eric Hammarlund(Northwestern University), Ashty S. Karim(Northwestern University), Michael C. Jewett(Northwestern University)
Nature Communications
January 20, 2025
Cited by 92Open Access
Full Text

Abstract

Enzyme engineering is limited by the challenge of rapidly generating and using large datasets of sequence-function relationships for predictive design. To address this challenge, we develop a machine learning (ML)-guided platform that integrates cell-free DNA assembly, cell-free gene expression, and functional assays to rapidly map fitness landscapes across protein sequence space and optimize enzymes for multiple, distinct chemical reactions. We apply this platform to engineer amide synthetases by evaluating substrate preference for 1217 enzyme variants in 10,953 unique reactions. We use these data to build augmented ridge regression ML models for predicting amide synthetase variants capable of making 9 small molecule pharmaceuticals. Over these nine compounds, ML-predicted enzyme variants demonstrate 1.6- to 42-fold improved activity relative to the parent. Our ML-guided, cell-free framework promises to accelerate enzyme engineering by enabling iterative exploration of protein sequence space to build specialized biocatalysts in parallel. While machine learning shows promise in expanding protein engineering efforts, its potential is limited by the challenge of gathering large datasets of sequence-function relationships. Here, authors introduce a platform that integrates cell-free DNA assembly and gene expression to accelerate enzyme engineering.


Related Papers

No related papers found

Powered by citation graph analysis