Improving physical realism, stereochemistry, and side‐chain accuracy in homology modeling: Four approaches that performed well in CASP8

Elmar Krieger(Korea Institute for Advanced Study), Keehyoung Joo(Korea Institute for Advanced Study), Jinwoo Lee(Korea Institute for Advanced Study), Jinwoo Lee(Korea Institute for Advanced Study), Jooyoung Lee(Korea Institute for Advanced Study), Jooyoung Lee(Korea Institute for Advanced Study), Srivatsan Raman(University of Washington), James Thompson(Korea Institute for Advanced Study), Mike Tyka(Korea Institute for Advanced Study), David Baker(Korea Institute for Advanced Study), Kevin Karplus(Korea Institute for Advanced Study)
Proteins Structure Function and Bioinformatics
January 1, 2009
Cited by 1,436Open Access
Full Text

Abstract

A correct alignment is an essential requirement in homology modeling. Yet in order to bridge the structural gap between template and target, which may not only involve loop rearrangements, but also shifts of secondary structure elements and repacking of core residues, high-resolution refinement methods with full atomic details are needed. Here, we describe four approaches that address this "last mile of the protein folding problem" and have performed well during CASP8, yielding physically realistic models: YASARA, which runs molecular dynamics simulations of models in explicit solvent, using a new partly knowledge-based all atom force field derived from Amber, whose parameters have been optimized to minimize the damage done to protein crystal structures. The LEE-SERVER, which makes extensive use of conformational space annealing to create alignments, to help Modeller build physically realistic models while satisfying input restraints from templates and CHARMM stereochemistry, and to remodel the side-chains. ROSETTA, whose high resolution refinement protocol combines a physically realistic all atom force field with Monte Carlo minimization to allow the large conformational space to be sampled quickly. And finally UNDERTAKER, which creates a pool of candidate models from various templates and then optimizes them with an adaptive genetic algorithm, using a primarily empirical cost function that does not include bond angle, bond length, or other physics-like terms.


Related Papers