Computational Design of Proteins Targeting the Conserved Stem Region of Influenza HemagglutininWe describe a general computational method for designing proteins that bind a surface patch of interest on a target macromolecule. Favorable interactions between disembodied amino acid residues and the target surface are identified and used to anchor de novo designed interfaces. The method was used to design proteins that bind a conserved surface patch on the stem of the influenza hemagglutinin (HA) from the 1918 H1N1 pandemic virus. After affinity maturation, two of the designed proteins, HB36 and HB80, bind H1 and H5 HAs with low nanomolar affinity. Further, HB80 inhibits the HA fusogenic conformational changes induced at low pH. The crystal structure of HB36 in complex with 1918/H1 HA revealed that the actual binding interface is nearly identical to that in the computational design model. Such designed binding proteins may be useful for both diagnostics and therapeutics.
Single-mutation fitness landscapes for an enzyme on multiple substrates reveal specificity is globally encodedOur lack of total understanding of the intricacies of how enzymes behave has constrained our ability to robustly engineer substrate specificity. Furthermore, the mechanisms of natural evolution leading to improved or novel substrate specificities are not wholly defined. Here we generate near-comprehensive single-mutation fitness landscapes comprising >96.3% of all possible single nonsynonymous mutations for hydrolysis activity of an amidase expressed in E. coli with three different substrates. For all three selections, we find that the distribution of beneficial mutations can be described as exponential, supporting a current hypothesis for adaptive molecular evolution. Beneficial mutations in one selection have essentially no correlation with fitness for other selections and are dispersed throughout the protein sequence and structure. Our results further demonstrate the dependence of local fitness landscapes on substrate identity and provide an example of globally distributed sequence-specificity determinants for an enzyme.
Trade-offs between enzyme fitness and solubility illuminated by deep mutational scanningJustin R. Klesmith, J.P. Bacik, Emily E. Wrenbeck et al.|Proceedings of the National Academy of Sciences|2017 Proteins are marginally stable, and an understanding of the sequence determinants for improved protein solubility is highly desired. For enzymes, it is well known that many mutations that increase protein solubility decrease catalytic activity. These competing effects frustrate efforts to design and engineer stable, active enzymes without laborious high-throughput activity screens. To address the trade-off between enzyme solubility and activity, we performed deep mutational scanning using two different screens/selections that purport to gauge protein solubility for two full-length enzymes. We assayed a TEM-1 beta-lactamase variant and levoglucosan kinase (LGK) using yeast surface display (YSD) screening and a twin-arginine translocation pathway selection. We then compared these scans with published experimental fitness landscapes. Results from the YSD screen could explain 37% of the variance in the fitness landscapes for one enzyme. Five percent to 10% of all single missense mutations improve solubility, matching theoretical predictions of global protein stability. For a given solubility-enhancing mutation, the probability that it would retain wild-type fitness was correlated with evolutionary conservation and distance to active site, and anticorrelated with contact number. Hybrid classification models were developed that could predict solubility-enhancing mutations that maintain wild-type fitness with an accuracy of 90%. The downside of using such classification models is the removal of rare mutations that improve both fitness and solubility. To reveal the biophysical basis of enhanced protein solubility and function, we determined the crystallographic structure of one such LGK mutant. Beyond fundamental insights into trade-offs between stability and activity, these results have potential biotechnological applications.