Characterization of polyploid wheat genomic diversity using a high‐density 90 000 single nucleotide polymorphism arrayShichen Wang, Debbie Wong, Kerrie Forrest et al.|Plant Biotechnology Journal|2014 High-density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array including about 90,000 gene-associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome-wide distributed SNPs that are represented in populations of diverse geographical origin. We used density-based spatial clustering algorithms to enable high-throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model-free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low-intensity clusters can provide insight into the distribution of presence-absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat.
Durum wheat genome highlights past domestication signatures and future improvement targetsThe domestication of wild emmer wheat led to the selection of modern durum wheat, grown mainly for pasta production. We describe the 10.45 gigabase (Gb) assembly of the genome of durum wheat cultivar Svevo. The assembly enabled genome-wide genetic diversity analyses revealing the changes imposed by thousands of years of empirical selection and breeding. Regions exhibiting strong signatures of genetic divergence associated with domestication and breeding were widespread in the genome with several major diversity losses in the pericentromeric regions. A locus on chromosome 5B carries a gene encoding a metal transporter (TdHMA3-B1) with a non-functional variant causing high accumulation of cadmium in grain. The high-cadmium allele, widespread among durum cultivars but undetected in wild emmer accessions, increased in frequency from domesticated emmer to modern durum wheat. The rapid cloning of TdHMA3-B1 rescues a wild beneficial allele and demonstrates the practical use of the Svevo genome for wheat improvement.
Genebank genomics highlights the diversity of a global barley collectionA high‐density, <scp>SNP</scp>‐based consensus map of tetraploid wheat as a bridge to integrate durum and bread wheat genomics and breedingMarco Maccaferri, Andrea Ricci, Silvio Salvi et al.|Plant Biotechnology Journal|2014 Consensus linkage maps are important tools in crop genomics. We have assembled a high-density tetraploid wheat consensus map by integrating 13 data sets from independent biparental populations involving durum wheat cultivars (Triticum turgidum ssp. durum), cultivated emmer (T. turgidum ssp. dicoccum) and their ancestor (wild emmer, T. turgidum ssp. dicoccoides). The consensus map harboured 30 144 markers (including 26 626 SNPs and 791 SSRs) half of which were present in at least two component maps. The final map spanned 2631 cM of all 14 durum wheat chromosomes and, differently from the individual component maps, all markers fell within the 14 linkage groups. Marker density per genetic distance unit peaked at centromeric regions, likely due to a combination of low recombination rate in the centromeric regions and even gene distribution along the chromosomes. Comparisons with bread wheat indicated fewer regions with recombination suppression, making this consensus map valuable for mapping in the A and B genomes of both durum and bread wheat. Sequence similarity analysis allowed us to relate mapped gene-derived SNPs to chromosome-specific transcripts. Dense patterns of homeologous relationships have been established between the A- and B-genome maps and between nonsyntenic homeologous chromosome regions as well, the latter tracing to ancient translocation events. The gene-based homeologous relationships are valuable to infer the map location of homeologs of target loci/QTLs. Because most SNP and SSR markers were previously mapped in bread wheat, this consensus map will facilitate a more effective integration and exploitation of genes and QTL for wheat breeding purposes.
A Comparison of Mainstream Genotyping Platforms for the Evaluation and Use of Barley Genetic ResourcesBenoît Darrier, Joanne Russell, Sara G. Milner et al.|Frontiers in Plant Science|2019 GenBank hosted at IPK Gatersleben. Each platform revealed equivalent numbers of robust bi-allelic SNPs (39,733 and 37,930 SNPs for the 50K SNP-array and GBS datasets respectively). A small overlap of 464 SNPs was common to both platforms, indicating that the methodologies we used selectively access informative polymorphism in different portions of the barley genome. Approximately half of the GBS dataset was comprised of SNPs with minor allele frequencies (MAFs) below 1%, illustrating the power of GBS to detect rare alleles in diverse germplasm collections. While desired for certain applications, the highly robust calling of alleles at the same SNPs across multiple populations is an advantage of the SNP-array, allowing direct comparisons of data from related or unrelated studies. Overall MAFs and diversity statistics (π) were higher for the SNP-array data, potentially reflecting the conscious removal of markers with a low MAF in the ascertainment population. A comparison of similarity matrices revealed a positive correlation between both approaches, supporting the validity of using either for entire GenBank characterization. To explore the potential of each dataset for focused genetic analyses we explored the outcomes of their use in genome-wide association scans for row type, growth habit and non-adhering hull, and discriminant analysis of principal components for the drivers of sub-population differentiation. Interpretation of the results from both types of analysis yielded broadly similar conclusions indicating that choice of platform used for such analyses should be determined by the research question being asked, group preferences and their capabilities to extract and interpret the different types of output data easily and quickly. Access to the requisite infrastructure for running, processing, analyzing, querying, storing, and displaying either datatype is an additional consideration. Our investigations reveal that for barley the cost per genotyping assay is less for SNP-arrays than GBS, which translates to a cost per informative datapoint being significantly lower for the SNP-array.