Global analysis of protein folding using massively parallel design, synthesis, and testing

Gabriel J. Rocklin(University of Washington), Tamuka M. Chidyausiku(University of Washington), Inna Goreshnik(University of Washington), Alex Ford(University of Washington), Scott Houliston(University of Toronto), Alexander Lemak(Princess Margaret Cancer Centre), Lauren Carter(University of Washington), Rashmi Ravichandran(University of Washington), Vikram Khipple Mulligan(University of Washington), Aaron Chevalier(University of Washington), C.H. Arrowsmith(University of Toronto), David Baker(Howard Hughes Medical Institute)
Science
July 14, 2017
Cited by 556

Abstract

Proteins fold into unique native structures stabilized by thousands of weak interactions that collectively overcome the entropic cost of folding. Although these forces are "encoded" in the thousands of known protein structures, "decoding" them is challenging because of the complexity of natural proteins that have evolved for function, not stability. We combined computational protein design, next-generation gene synthesis, and a high-throughput protease susceptibility assay to measure folding and stability for more than 15,000 de novo designed miniproteins, 1000 natural proteins, 10,000 point mutants, and 30,000 negative control sequences. This analysis identified more than 2500 stable designed proteins in four basic folds-a number sufficient to enable us to systematically examine how sequence determines folding and stability in uncharted protein space. Iteration between design and experiment increased the design success rate from 6% to 47%, produced stable proteins unlike those found in nature for topologies where design was initially unsuccessful, and revealed subtle contributions to stability as designs became increasingly optimized. Our approach achieves the long-standing goal of a tight feedback cycle between computation and experiment and has the potential to transform computational protein design into a data-driven science.


Related Papers

No related papers found

Powered by citation graph analysis