antiSMASH 4.0—improvements in chemistry prediction and gene cluster boundary identification

Kai Blin(Novo Nordisk Foundation), Thomas Wolf(Leibniz-Institut für Naturstoff-Forschung und Infektionsbiologie e. V. - Hans-Knöll-Institut (HKI)), Marc G. Chevrette(University of Wisconsin–Madison), Xiaowen Lu(Wageningen University & Research), Christopher J. Schwalen(University of Illinois Urbana-Champaign), Satria A. Kautsar(Wageningen University & Research), Hernando G. Suárez Duran(Wageningen University & Research), Emmanuel L. C. de los Santos(University of Warwick), Hyun Uk Kim(Korea Advanced Institute of Science and Technology), Mariana Nave(University of Lisbon), Jeroen S. Dickschat(University of Bonn), Douglas A. Mitchell(University of Illinois Urbana-Champaign), Ekaterina Shelest(Leibniz-Institut für Naturstoff-Forschung und Infektionsbiologie e. V. - Hans-Knöll-Institut (HKI)), Rainer Breitling(University of Manchester), Eriko Takano(University of Manchester), Sang Yup Lee(Korea Advanced Institute of Science and Technology), Tilmann Weber(Novo Nordisk Foundation), Marnix H. Medema(Wageningen University & Research)
Nucleic Acids Research
April 13, 2017
Cited by 1,215Open Access
Full Text

Abstract

Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding the production of such compounds. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' has assisted researchers in efficiently performing this, both as a web server and a standalone tool. Here, we present the thoroughly updated antiSMASH version 4, which adds several novel features, including prediction of gene cluster boundaries using the ClusterFinder method or the newly integrated CASSIS algorithm, improved substrate specificity prediction for non-ribosomal peptide synthetase adenylation domains based on the new SANDPUMA algorithm, improved predictions for terpene and ribosomally synthesized and post-translationally modified peptides cluster products, reporting of sequence similarity to proteins encoded in experimentally characterized gene clusters on a per-protein basis and a domain-level alignment tool for comparative analysis of trans-AT polyketide synthase assembly line architectures. Additionally, several usability features have been updated and improved. Together, these improvements make antiSMASH up-to-date with the latest developments in natural product research and will further facilitate computational genome mining for the discovery of novel bioactive molecules.


Related Papers

No related papers found

Powered by citation graph analysis