Highly accurate protein structure prediction with AlphaFold

John Jumper(Google DeepMind (United Kingdom)), Richard Evans(Google DeepMind (United Kingdom)), Alexander Pritzel(Google DeepMind (United Kingdom)), Tim Green(Google DeepMind (United Kingdom)), Michael Figurnov(Google DeepMind (United Kingdom)), Olaf Ronneberger(Google DeepMind (United Kingdom)), Kathryn Tunyasuvunakool(Google DeepMind (United Kingdom)), Russ Bates(Google DeepMind (United Kingdom)), Augustin Žídek(Google DeepMind (United Kingdom)), Anna Potapenko(Google DeepMind (United Kingdom)), Alex Bridgland(Google DeepMind (United Kingdom)), Clemens Meyer(Google DeepMind (United Kingdom)), Simon Köhl(Google DeepMind (United Kingdom)), Andrew J. Ballard(Google DeepMind (United Kingdom)), Andrew Cowie(Google DeepMind (United Kingdom)), Bernardino Romera‐Paredes(Google DeepMind (United Kingdom)), Stanislav Nikolov(Google DeepMind (United Kingdom)), Rishub Jain(Google DeepMind (United Kingdom)), Jonas Adler(Google DeepMind (United Kingdom)), Trevor Back(Google DeepMind (United Kingdom)), Stig Petersen(Google DeepMind (United Kingdom)), David Reiman(Google DeepMind (United Kingdom)), Ellen Clancy(Google DeepMind (United Kingdom)), Michał Zieliński(Google DeepMind (United Kingdom)), Martin Steinegger(Seoul National University), Michalina Pacholska(Google DeepMind (United Kingdom)), Tamas Berghammer(Google DeepMind (United Kingdom)), Sebastian W. Bodenstein(Google DeepMind (United Kingdom)), David Silver(Google DeepMind (United Kingdom)), Oriol Vinyals(Google DeepMind (United Kingdom)), Andrew Senior(Google DeepMind (United Kingdom)), Koray Kavukcuoglu(Google DeepMind (United Kingdom)), Pushmeet Kohli(Google DeepMind (United Kingdom)), Demis Hassabis(Google DeepMind (United Kingdom))
Nature
July 15, 2021
Cited by 44,494Open Access
Full Text

Abstract

Abstract Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort 1–4 , the structures of around 100,000 unique proteins have been determined 5 , but this represents a small fraction of the billions of known protein sequences 6,7 . Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’ 8 —has been an important open research problem for more than 50 years 9 . Despite recent progress 10–14 , existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14) 15 , demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm.


Related Papers

No related papers found

Powered by citation graph analysis