MIBiG 3.0: a community-driven effort to annotate experimentally validated biosynthetic gene clusters

Barbara R. Terlouw(Wageningen University & Research), Kai Blin(Novo Nordisk Foundation), Jorge C. Navarro-Muñoz(Westerdijk Fungal Biodiversity Institute), Nicole E. Avalon(Scripps Institution of Oceanography), Marc G. Chevrette(University of Florida), Susan Egbert(University of Manitoba), Sang Hyeon Lee(Simon Fraser University), David Meijer(Wageningen University & Research), Michael J. J. Recchia(Simon Fraser University), Zachary L. Reitz(Wageningen University & Research), Jeffrey A. van Santen(Simon Fraser University), Nelly Sélem‐Mójica(Universidad Nacional Autónoma de México), Thomas Tørring(Aarhus University), Liana Zaroubi(Simon Fraser University), Mohammad Alanjary(Wageningen University & Research), Gajender Aleti(Tennessee State University), César Aguilar(Purdue University West Lafayette), Suhad A. A. Al-Salihi(University of Technology - Iraq), Hannah E. Augustijn(Leiden University), J. Abraham Avelar‐Rivas, Luis Alfredo Avitia-Dominguez(Leiden University), Francisco Barona‐Gómez(Leiden University), Jordan Bernaldo-Agüero(Universidad Nacional Autónoma de México), Vincent A. Bielinski(J. Craig Venter Institute), Friederike Biermann(Goethe University Frankfurt), Thomas Booth(The University of Western Australia), Víctor J. Carrión(Leiden University), Raquel Castelo‐Branco(Universidade do Porto), Fernanda O. Chagas(Universidade Federal do Rio de Janeiro), Pablo Cruz‐Morales(Novo Nordisk Foundation), Chao Du(Leiden University), Katherine Duncan(University of Strathclyde), Athina Gavriilidou(University of Tübingen), Damien Gayrard(John Innes Centre), Karina Gutiérrez-García(Carnegie Institution for Science), Kristina Haslinger(University of Groningen), Eric J. N. Helfrich(Goethe University Frankfurt), Justin J. J. van der Hooft(University of Johannesburg), AFIF PRANAYA JATI(Indonesian Pediatric Society), Edward Kalkreuter(Scripps Research Institute), Nikolaos Kalyvas(Westerdijk Fungal Biodiversity Institute), Kyo Bin Kang(Sookmyung Women's University), Satria A. Kautsar(Scripps Research Institute), Wonyong Kim(Sunchon National University), Aditya M. Kunjapur(University of Delaware), Yong‐Xin Li(University of Hong Kong), Geng-Min Lin(Massachusetts Institute of Technology), Catarina Loureiro(Wageningen University & Research), Joris J. R. Louwen(Wageningen University & Research), Nico L L Louwen(Wageningen University & Research), George Lund(Rothamsted Research), Jonathan Parra(Universidad de Costa Rica), Benjamin Philmus(Oregon State University), Bita Pourmohsenin(University of Tübingen), Lotte J. U. Pronk(Wageningen University & Research), Adriana Rego(Universidade do Porto), Rex Devasahayam Arokia Balaya(Yenepoya University), Serina L. Robinson(Swiss Federal Institute of Aquatic Science and Technology), Luis Rodrigo Rosas-Becerra(Leiden University), Eve Tallulah Roxborough(University of Nottingham), Michelle Schorn(Wageningen University & Research), Darren Scobie(University of Strathclyde), Kumar Saurabh Singh(Wageningen University & Research), Nika Sokolova(University of Groningen), Xiaoyu Tang(Shenzhen Bay Laboratory), Daniel Udwary(Joint Genome Institute), Aruna Vigneshwari(University of Szeged), Kristiina Vind(Wageningen University & Research), Sophie P. J. M. Vromans(Wageningen University & Research), Valentin Waschulin(University of Warwick), Sam E. Williams(University of Bristol), Jaclyn M. Winter(University of Utah), Thomas E. Witte(University of Ottawa), Huali Xie(Chinese Academy of Agricultural Sciences), Dong Yang(University of Florida), Jingwei Yu(Southern University of Science and Technology), Mitja M. Zdouc(Wageningen University & Research), Zheng Zhong(Wageningen University & Research), Jérôme Collemare(Westerdijk Fungal Biodiversity Institute), Roger G. Linington(Simon Fraser University), Tilmann Weber(Novo Nordisk Foundation), Marnix H. Medema(Leiden University)
Nucleic Acids Research
November 18, 2022
Cited by 445Open Access
Full Text

Abstract

With an ever-increasing amount of (meta)genomic data being deposited in sequence databases, (meta)genome mining for natural product biosynthetic pathways occupies a critical role in the discovery of novel pharmaceutical drugs, crop protection agents and biomaterials. The genes that encode these pathways are often organised into biosynthetic gene clusters (BGCs). In 2015, we defined the Minimum Information about a Biosynthetic Gene cluster (MIBiG): a standardised data format that describes the minimally required information to uniquely characterise a BGC. We simultaneously constructed an accompanying online database of BGCs, which has since been widely used by the community as a reference dataset for BGCs and was expanded to 2021 entries in 2019 (MIBiG 2.0). Here, we describe MIBiG 3.0, a database update comprising large-scale validation and re-annotation of existing entries and 661 new entries. Particular attention was paid to the annotation of compound structures and biological activities, as well as protein domain selectivities. Together, these new features keep the database up-to-date, and will provide new opportunities for the scientific community to use its freely available data, e.g. for the training of new machine learning models to predict sequence-structure-function relationships for diverse natural products. MIBiG 3.0 is accessible online at https://mibig.secondarymetabolites.org/.


Related Papers

No related papers found

Powered by citation graph analysis