RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive

Yana Rose(San Diego Supercomputer Center), José M. Duarte(San Diego Supercomputer Center), Robert Lowe(Rutgers, The State University of New Jersey), Joan Segura(San Diego Supercomputer Center), Chunxiao Bi(San Diego Supercomputer Center), Charmi Bhikadiya(Rutgers, The State University of New Jersey), Li Chen(Rutgers, The State University of New Jersey), Alexander Rose(San Diego Supercomputer Center), Sebastian Bittrich(San Diego Supercomputer Center), S.K. Burley(San Diego Supercomputer Center), John Westbrook(Rutgers, The State University of New Jersey)
Journal of Molecular Biology
November 10, 2020
Cited by 243Open Access
Full Text

Abstract

The US Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) serves many millions of unique users worldwide by delivering experimentally-determined 3D structures of biomolecules integrated with >40 external data resources via RCSB.org, application programming interfaces (APIs), and FTP downloads. Herein, we present the architectural redesign of RCSB PDB data delivery services that build on existing PDBx/mmCIF data schemas. New data access APIs (data.rcsb.org) enable efficient delivery of all PDB archive data. A novel GraphQL-based API provides flexible, declarative data retrieval along with a simple-to-use REST API. A powerful new search system (search.rcsb.org) seamlessly integrates heterogeneous types of searches across the PDB archive. Searches may combine text attributes, protein or nucleic acid sequences, small-molecule chemical descriptors, 3D macromolecular shapes, and sequence motifs. The new RCSB.org architecture adheres to the FAIR Principles, empowering users to address a wide array of research problems in fundamental biology, biomedicine, biotechnology, bioengineering, and bioenergy.


Related Papers

No related papers found

Powered by citation graph analysis