<i>De novo</i> Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing

Marcus H. Stoiber(Lawrence Berkeley National Laboratory), Joshua Quick(Czech Academy of Sciences, Institute of Microbiology), Rob Egan(Lawrence Berkeley National Laboratory), Ji Eun Lee(Lawrence Berkeley National Laboratory), S Celniker(Lawrence Berkeley National Laboratory), Robert K. Neely(University of Birmingham), Nicholas J. Loman(Czech Academy of Sciences, Institute of Microbiology), L Pennacchio(Lawrence Berkeley National Laboratory), James B. Brown(Lawrence Berkeley National Laboratory)
bioRxiv (Cold Spring Harbor Laboratory)
December 15, 2016
Cited by 308Open Access
Full Text

Abstract

Abstract Advances in nanopore sequencing technology have enabled investigation of the full catalogue of covalent DNA modifications. We present the first algorithm for the identification of modified nucleotides without the need for prior training data along with the open source software implementation, nanoraw . Nanoraw accurately assigns contiguous raw nanopore signal to genomic positions, enabling novel data visualization, and increasing power and accuracy for the discovery of covalently modified bases in native DNA. Ground truth case studies utilizing synthetically methylated DNA show the capacity to identify three distinct methylation marks, 4mC, 5mC, and 6mA, in seven distinct sequence contexts without any changes to the algorithm. We demonstrate quantitative reproducibility simultaneously identifying 5mC and 6mA in native E. coli across biological replicates processed in different labs. Finally we propose a pipeline for the comprehensive discovery of DNA modifications in any genome without a priori knowledge of their chemical identities.


Related Papers

No related papers found

Powered by citation graph analysis