GENESPACE tracks regions of interest and gene copy number variation across multiple genomes

John T. Lovell(Lawrence Berkeley National Laboratory), Avinash Sreedasyam(HudsonAlpha Institute for Biotechnology), M. Eric Schranz(Wageningen University & Research), Melissa A. Wilson(Arizona State University), Joseph W. Carlson(Lawrence Berkeley National Laboratory), Alex Harkess(HudsonAlpha Institute for Biotechnology), David Emms(University of Oxford), David Goodstein(Lawrence Berkeley National Laboratory), Jeremy Schmutz(Lawrence Berkeley National Laboratory)
eLife
August 1, 2022
Cited by 401Open Access
Full Text

Abstract

The development of multiple chromosome-scale reference genome sequences in many taxonomic groups has yielded a high-resolution view of the patterns and processes of molecular evolution. Nonetheless, leveraging information across multiple genomes remains a significant challenge in nearly all eukaryotic systems. These challenges range from studying the evolution of chromosome structure, to finding candidate genes for quantitative trait loci, to testing hypotheses about speciation and adaptation. Here, we present GENESPACE, which addresses these challenges by integrating conserved gene order and orthology to define the expected physical position of all genes across multiple genomes. We demonstrate this utility by dissecting presence-absence, copy-number, and structural variation at three levels of biological organization: spanning 300 million years of vertebrate sex chromosome evolution, across the diversity of the Poaceae (grass) plant family, and among 26 maize cultivars. The methods to build and visualize syntenic orthology in the GENESPACE R package offer a significant addition to existing gene family and synteny programs, especially in polyploid, outbred, and other complex genomes.


Related Papers

No related papers found

Powered by citation graph analysis