Sequence-specific error profile of Illumina sequencers

Kensuke Nakamura(Nara Institute of Science and Technology), Taku Oshima(Nara Institute of Science and Technology), Takuya Morimoto(Nara Institute of Science and Technology), Shun Ikeda(Nara Institute of Science and Technology), Hirofumi Yoshikawa(Nara Institute of Science and Technology), Yuh Shiwa(Nara Institute of Science and Technology), Shu Ishikawa(Nara Institute of Science and Technology), Margaret C. Linak(Nara Institute of Science and Technology), Aki Hirai(Nara Institute of Science and Technology), Hiroki Takahashi(Nara Institute of Science and Technology), Md. Altaf‐Ul‐Amin(Nara Institute of Science and Technology), Naotake Ogasawara(Nara Institute of Science and Technology), Shigehiko Kanaya(Nara Institute of Science and Technology)
Nucleic Acids Research
May 14, 2011
Cited by 631Open Access
Full Text

Abstract

We identified the sequence-specific starting positions of consecutive miscalls in the mapping of reads obtained from the Illumina Genome Analyser (GA). Detailed analysis of the miscall pattern indicated that the underlying mechanism involves sequence-specific interference of the base elongation process during sequencing. The two major sequence patterns that trigger this sequence-specific error (SSE) are: (i) inverted repeats and (ii) GGC sequences. We speculate that these sequences favor dephasing by inhibiting single-base elongation, by: (i) folding single-stranded DNA and (ii) altering enzyme preference. This phenomenon is a major cause of sequence coverage variability and of the unfavorable bias observed for population-targeted methods such as RNA-seq and ChIP-seq. Moreover, SSE is a potential cause of false single-nucleotide polymorphism (SNP) calls and also significantly hinders de novo assembly. This article highlights the importance of recognizing SSE and its underlying mechanisms in the hope of enhancing the potential usefulness of the Illumina sequencers.


Related Papers

No related papers found

Powered by citation graph analysis