Comprehensive genome analysis and variant detection at scale using DRAGEN

Sairam Behera(Baylor College of Medicine), Severine Catreux(Illumina (United States)), Massimiliano Rossi(Illumina (United States)), Sean Truong(Illumina (United States)), Zhuoyi Huang(Illumina (United States)), Michael Ruehle(Illumina (United States)), Arun Visvanath(Illumina (United States)), Gavin Parnaby(Illumina (United States)), Cooper Roddey(Illumina (United States)), Vitor Onuchic(Illumina (United States)), Andrea Finocchio(Illumina (United States)), Daniel Cameron(Illumina (United States)), Adam C. English(Baylor College of Medicine), Shyamal Mehtalia(Illumina (United States)), James Han(Illumina (United States)), Rami Mehio(Illumina (United States)), Fritz J. Sedlazeck(Baylor College of Medicine)
Nature Biotechnology
October 25, 2024
Cited by 99Open Access
Full Text

Abstract

Research and medical genomics require comprehensive, scalable methods for the discovery of novel disease targets, evolutionary drivers and genetic markers with clinical significance. This necessitates a framework to identify all types of variants independent of their size or location. Here we present DRAGEN, which uses multigenome mapping with pangenome references, hardware acceleration and machine learning-based variant detection to provide insights into individual genomes, with ~30 min of computation time from raw reads to variant detection. DRAGEN outperforms current state-of-the-art methods in speed and accuracy across all variant types (single-nucleotide variations, insertions or deletions, short tandem repeats, structural variations and copy number variations) and incorporates specialized methods for analysis of medically relevant genes. We demonstrate the performance of DRAGEN across 3,202 whole-genome sequencing datasets by generating fully genotyped multisample variant call format files and demonstrate its scalability, accuracy and innovation to further advance the integration of comprehensive genomics. Overall, DRAGEN marks a major milestone in sequencing data analysis and will provide insights across various diseases, including Mendelian and rare diseases, with a highly comprehensive and scalable platform.


Related Papers