A reference library for Canadian invertebrates with 1.5 million barcodes, voucher specimens, and DNA samples

Jeremy R deWaard(University of Guelph), Sujeevan Ratnasingham(University of Guelph), Evgeny Zakharov(University of Guelph), Alex Borisenko(University of Guelph), Dirk Steinke(University of Guelph), Angela C Telfer(University of Guelph), Kate Perez(University of Guelph), Jayme E Sones(University of Guelph), Monica R Young(University of Guelph), Valerie Levesque‐Beaudin(University of Guelph), Crystal N Sobel(University of Guelph), Arusyak Abrahamyan(University of Guelph), Kyrylo Bessonov(Public Health Agency of Canada), Gergin Blagoev(University of Guelph), Stephanie deWaard(University of Guelph), Chris Ho(University of Guelph), Natalya Ivanova(University of Guelph), Kara K S Layton(Dalhousie University), Liuqiong Lu(University of Guelph), Ramya Manjunath(University of Guelph), Jaclyn McKeown(University of Guelph), Megan Milton(University of Guelph), Renee Miskie(University of Guelph), Norm Monkhouse(University of Guelph), Suresh Naik(University of Guelph), Nadya Nikolova(University of Guelph), Mikko Pentinsaari(University of Guelph), Sean W. J. Prosser(University of Guelph), Adriana Radulovici(University of Guelph), Claudia Steinke(University of Guelph), Connor P Warne(University of Guelph), Paul D. N. Hebert(University of Guelph)
Scientific Data
December 6, 2019
Cited by 107Open Access
Full Text

Abstract

The reliable taxonomic identification of organisms through DNA sequence data requires a well parameterized library of curated reference sequences. However, it is estimated that just 15% of described animal species are represented in public sequence repositories. To begin to address this deficiency, we provide DNA barcodes for 1,500,003 animal specimens collected from 23 terrestrial and aquatic ecozones at sites across Canada, a nation that comprises 7% of the planet's land surface. In total, 14 phyla, 43 classes, 163 orders, 1123 families, 6186 genera, and 64,264 Barcode Index Numbers (BINs; a proxy for species) are represented. Species-level taxonomy was available for 38% of the specimens, but higher proportions were assigned to a genus (69.5%) and a family (99.9%). Voucher specimens and DNA extracts are archived at the Centre for Biodiversity Genomics where they are available for further research. The corresponding sequence and taxonomic data can be accessed through the Barcode of Life Data System, GenBank, the Global Biodiversity Information Facility, and the Global Genome Biodiversity Network Data Portal.


Related Papers

No related papers found

Powered by citation graph analysis