<i>De novo</i> assembly of the cattle reference genome with single-molecule sequencing

Benjamin D. Rosen(Agricultural Research Service), Derek M. Bickhart(U.S. Dairy Forage Research Center), Robert D. Schnabel(University of Missouri), Sergey Koren(National Institutes of Health), Christine G. Elsik(University of Missouri), Elizabeth Tseng(Pacific Biosciences (United States)), Troy N. Rowan(University of Missouri), Wai Yee Low(The University of Adelaide), Aleksey V. Zimin(Johns Hopkins University), Christine Couldrey(Livestock Improvement Corporation), Richard Hall(Pacific Biosciences (United States)), Wenli Li(U.S. Dairy Forage Research Center), Arang Rhie(National Institutes of Health), Jay Ghurye(University of Maryland, College Park), Stephanie McKay(University of Vermont), Françoise Thibaud‐Nissen(National Institutes of Health), Jinna Hoffman(National Institutes of Health), Brenda M. Murdoch(University of Idaho), W. M. Snelling(Roman L. Hruska U.S. Meat Animal Research Center), T. G. McDaneld(Roman L. Hruska U.S. Meat Animal Research Center), John A. Hammond(The Pirbright Institute), J.C. Schwartz(The Pirbright Institute), Wilson Nandolo(Gregor Mendel Institute of Molecular Plant Biology), Darren E. Hagen(Oklahoma State University), Christian Dreischer, Sebastian J. Schultheiß, Steven Schroeder(Agricultural Research Service), Adam M. Phillippy(National Institutes of Health), John B. Cole(Agricultural Research Service), Curtis P. Van Tassell(Agricultural Research Service), George E. Liu(Agricultural Research Service), Timothy P. L. Smith(Roman L. Hruska U.S. Meat Animal Research Center), Juan F. Medrano(University of California, Davis)
GigaScience
March 1, 2020
Cited by 903Open Access
Full Text

Abstract

BACKGROUND: Major advances in selection progress for cattle have been made following the introduction of genomic tools over the past 10-12 years. These tools depend upon the Bos taurus reference genome (UMD3.1.1), which was created using now-outdated technologies and is hindered by a variety of deficiencies and inaccuracies. RESULTS: We present the new reference genome for cattle, ARS-UCD1.2, based on the same animal as the original to facilitate transfer and interpretation of results obtained from the earlier version, but applying a combination of modern technologies in a de novo assembly to increase continuity, accuracy, and completeness. The assembly includes 2.7 Gb and is >250× more continuous than the original assembly, with contig N50 >25 Mb and L50 of 32. We also greatly expanded supporting RNA-based data for annotation that identifies 30,396 total genes (21,039 protein coding). The new reference assembly is accessible in annotated form for public use. CONCLUSIONS: We demonstrate that improved continuity of assembled sequence warrants the adoption of ARS-UCD1.2 as the new cattle reference genome and that increased assembly accuracy will benefit future research on this species.


Related Papers

No related papers found

Powered by citation graph analysis