Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor

S. M. Ashiqul Islam(University of California San Diego), Marcos Díaz‐Gay(University of California San Diego), Yang Wu(Duke-NUS Medical School), Mark Barnes(University of California San Diego), Raviteja Vangara(University of California San Diego), Erik N. Bergstrom(University of California San Diego), Yudou He(University of California San Diego), Mike Vella(Nvidia (United States)), Jingwei Wang(Wellcome Sanger Institute), Jon W. Teague(Wellcome Sanger Institute), Peter Clapham(Wellcome Sanger Institute), Sarah Moody(Wellcome Sanger Institute), S. Senkin(Centre international de recherche sur le cancer), Yun Rose Li(City of Hope), Laura Riva(Wellcome Sanger Institute), Tongwu Zhang(National Cancer Institute), Andreas Gruber(University of Konstanz), Christopher D. Steele(University College London), Burçak Otlu(University of California San Diego), Azhar Khandekar(University of California San Diego), Ammal Abbasi(University of California San Diego), Laura Humphreys(Wellcome Sanger Institute), Natalia Syulyukina(University of California San Diego), Samuel W. Brady(St. Jude Children's Research Hospital), Boian S. Alexandrov(Los Alamos National Laboratory), Nischalan Pillay(Royal National Orthopaedic Hospital), Jinghui Zhang(St. Jude Children's Research Hospital), David J. Adams(Wellcome Sanger Institute), Iñigo Martincorena(Wellcome Sanger Institute), David C. Wedge(University of Manchester), Maria Teresa Landi(National Cancer Institute), Paul Brennan(Centre international de recherche sur le cancer), Michael R. Stratton(Wellcome Sanger Institute), Steve Rozen(Duke-NUS Medical School), Ludmil B. Alexandrov(University of California San Diego)
Cell Genomics
September 23, 2022
Cited by 358Open Access
Full Text

Abstract

extraction of mutational signatures, and benchmark it against another 13 bioinformatics tools by using 34 scenarios encompassing 2,500 simulated signatures found in 60,000 synthetic genomes and 20,000 synthetic exomes. For simulations with 5% noise, reflecting high-quality datasets, SigProfilerExtractor outperforms other approaches by elucidating between 20% and 50% more true-positive signatures while yielding 5-fold less false-positive signatures. Applying SigProfilerExtractor to 4,643 whole-genome- and 19,184 whole-exome-sequenced cancers reveals four novel signatures. Two of the signatures are confirmed in independent cohorts, and one of these signatures is associated with tobacco smoking. In summary, this report provides a reference tool for analysis of mutational signatures, a comprehensive benchmarking of bioinformatics tools for extracting signatures, and several novel mutational signatures, including one putatively attributed to direct tobacco smoking mutagenesis in bladder tissues.


Related Papers

No related papers found

Powered by citation graph analysis