Marker‐Assisted Selection in Plant Breeding: From Publications to PracticeThe volume of publications on the development and to a lesser extent the application of molecular markers in plant breeding has increased dramatically during the last decade. However, most of the publications result from investments from donors with a strategic science quality or biotech advocacy mandate leading to insufficient emphasis on applied value in plant breeding. Converting promising publications into practical applications requires the resolution of many logistical and genetical constraints that are rarely addressed in journal publications. This results in a high proportion of published markers failing at one or more of the translation steps from research arena to application domain. The rate of success is likely to increase due to developments in gene‐based marker development, more efficient quantitative trait locus (QTL) mapping procedures, and lower cost genotyping systems. However, some fundamental issues remain to be resolved, particularly regarding complex traits, before marker‐assisted selection realizes its full potential in public sector breeding programs. These include the development of high throughput precision phenotyping systems for QTL mapping, improved understanding of genotype by environment interaction and epistasis, and development of publicly available computational tools tailored to the needs of molecular breeding programs.
Association Analysis of Historical Bread Wheat Germplasm Using Additive Genetic Covariance of Relatives and Population StructureLinkage disequilibrium can be used for identifying associations between traits of interest and genetic markers. This study used mapped diversity array technology (DArT) markers to find associations with resistance to stem rust, leaf rust, yellow rust, and powdery mildew, plus grain yield in five historical wheat international multienvironment trials from the International Maize and Wheat Improvement Center (CIMMYT). Two linear mixed models were used to assess marker-trait associations incorporating information on population structure and covariance between relatives. An integrated map containing 813 DArT markers and 831 other markers was constructed. Several linkage disequilibrium clusters bearing multiple host plant resistance genes were found. Most of the associated markers were found in genomic regions where previous reports had found genes or quantitative trait loci (QTL) influencing the same traits, providing an independent validation of this approach. In addition, many new chromosome regions for disease resistance and grain yield were identified in the wheat genome. Phenotyping across up to 60 environments and years allowed modeling of genotype x environment interaction, thereby making possible the identification of markers contributing to both additive and additive x additive interaction effects of traits.
Genetic Characterization and Linkage Disequilibrium Estimation of a Global Maize Collection Using SNP MarkersA newly developed maize Illumina GoldenGate Assay with 1536 SNPs from 582 loci was used to genotype a highly diverse global maize collection of 632 inbred lines from temperate, tropical, and subtropical public breeding programs. A total of 1229 informative SNPs and 1749 haplotypes within 327 loci was used to estimate the genetic diversity, population structure, and familial relatedness. Population structure identified tropical and temperate subgroups, and complex familial relationships were identified within the global collection. Linkage disequilibrium (LD) was measured overall and within chromosomes, allelic frequency groups, subgroups related by geographic origin, and subgroups of different sample sizes. The LD decay distance differed among chromosomes and ranged between 1 to 10 kb. The LD distance increased with the increase of minor allelic frequency (MAF), and with smaller sample sizes, encouraging caution when using too few lines in a study. The LD decay distance was much higher in temperate than in tropical and subtropical lines, because tropical and subtropical lines are more diverse and contain more rare alleles than temperate lines. A core set of inbreds was defined based on haplotypes, and 60 lines capture 90% of the haplotype diversity of the entire panel. The defined core sets and the entire collection can be used widely for different research targets.
Association Mapping for Enhancing Maize (<i>Zea mays</i> L.) Genetic ImprovementABSTRACT Association mapping through linkage disequilibrium (LD) analysis is a powerful tool for the dissection of complex agronomic traits and for the identification of alleles that can contribute to the enhancement of a target trait. With the developments of high throughput genotyping techniques and advanced statistical approaches as well as the assembling and characterization of multiple association mapping panels, maize has become the model crop for association analysis. In this paper, we summarize progress in maize association mapping and the impacts of genetic diversity, rate of LD decay, population size, and population structure. We also review the use of candidate genes and gene‐based markers in maize association mapping studies that has generated particularly promising results. In addition, we examine recent developments in genome‐wide genotyping techniques that promise to improve the power of association mapping and significantly refine our understanding of the genetic architecture of complex quantitative traits. The new challenges and opportunities associated with genome‐wide analysis studies are discussed. In conclusion, we review the current and future impacts of association mapping on maize improvement along with the potential benefits for poor people in developing countries who are dependent on this crop for their food security and livelihoods.
African soil properties and nutrients mapped at 30 m spatial resolution using two-scale ensemble machine learningSoil property and class maps for the continent of Africa were so far only available at very generalised scales, with many countries not mapped at all. Thanks to an increasing quantity and availability of soil samples collected at field point locations by various government and/or NGO funded projects, it is now possible to produce detailed pan-African maps of soil nutrients, including micro-nutrients at fine spatial resolutions. In this paper we describe production of a 30 m resolution Soil Information System of the African continent using, to date, the most comprehensive compilation of soil samples ([Formula: see text]) and Earth Observation data. We produced predictions for soil pH, organic carbon (C) and total nitrogen (N), total carbon, effective Cation Exchange Capacity (eCEC), extractable-phosphorus (P), potassium (K), calcium (Ca), magnesium (Mg), sulfur (S), sodium (Na), iron (Fe), zinc (Zn)-silt, clay and sand, stone content, bulk density and depth to bedrock, at three depths (0, 20 and 50 cm) and using 2-scale 3D Ensemble Machine Learning framework implemented in the mlr (Machine Learning in R) package. As covariate layers we used 250 m resolution (MODIS, PROBA-V and SM2RAIN products), and 30 m resolution (Sentinel-2, Landsat and DTM derivatives) images. Our fivefold spatial Cross-Validation results showed varying accuracy levels ranging from the best performing soil pH (CCC = 0.900) to more poorly predictable extractable phosphorus (CCC = 0.654) and sulphur (CCC = 0.708) and depth to bedrock. Sentinel-2 bands SWIR (B11, B12), NIR (B09, B8A), Landsat SWIR bands, and vertical depth derived from 30 m resolution DTM, were the overall most important 30 m resolution covariates. Climatic data images-SM2RAIN, bioclimatic variables and MODIS Land Surface Temperature-however, remained as the overall most important variables for predicting soil chemical variables at continental scale. This publicly available 30-m Soil Information System of Africa aims at supporting numerous applications, including soil and fertilizer policies and investments, agronomic advice to close yield gaps, environmental programs, or targeting of nutrition interventions.