A compilation of soybean ESTs: generation and analysis

Randy C. Shoemaker; Paul Keim; Lila O. Vodkin; Ernest Retzel; Sandra W. Clifton; R Waterston; David A. Smoller; Virginia H. Coryell; Anupama Khanna; John E. Erpelding; Xiaowu Gai; Volker Brendel; Christina Raph-Schmidt; Elizabeth Shoop; C J Vielweber; Matt Schmatz; Deana Pape; Yvette Bowers; Brenda Theising; John C. Martin; Michael Dante; Todd Wylie; Cheryl L. Granger

doi:10.1139/g01-150

A compilation of soybean ESTs: generation and analysis

Randy C. Shoemaker(Iowa State University), Paul Keim(Iowa State University), Lila O. Vodkin(Iowa State University), Ernest Retzel(Iowa State University), Sandra W. Clifton(Iowa State University), R Waterston(Iowa State University), David A. Smoller(Iowa State University), Virginia H. Coryell(Iowa State University), Anupama Khanna(Iowa State University), John E. Erpelding(Iowa State University), Xiaowu Gai(Iowa State University), Volker Brendel(Iowa State University), Christina Raph-Schmidt(Iowa State University), Elizabeth Shoop(Iowa State University), C J Vielweber(Iowa State University), Matt Schmatz(Iowa State University), Deana Pape(Iowa State University), Yvette Bowers(Iowa State University), Brenda Theising(Iowa State University), John C. Martin(Iowa State University), Michael Dante(Iowa State University), Todd Wylie(Iowa State University), Cheryl L. Granger(Iowa State University)

Genome

April 1, 2002

10.1139/g01-150

Cited by 139

Abstract

Whole-genome sequencing is fundamental to understanding the genetic composition of an organism. Given the size and complexity of the soybean genome, an alternative approach is targeted random-gene sequencing, which provides an immediate and productive method of gene discovery. In this study, more than 120000 soybean expressed sequence tags (ESTs) generated from more than 50 cDNA libraries were evaluated. These ESTs coalesced into 16928 contigs and 17336 singletons. On average, each contig was composed of 6 ESTs and spanned 788 bases. The average sequence length submitted to dbEST was 414 bases. Using only those libraries generating more than 800 ESTs each and only those contigs with 10 or more ESTs each, correlated patterns of gene expression among libraries and genes were discerned. Two-dimensional qualitative representations of contig and library similarities were generated based on expression profiles. Genes with similar expression patterns and, potentially, similar functions were identified. These studies provide a rich source of publicly available gene sequences as well as valuable insight into the structure, function, and evolution of a model crop legume genome.

K. Arumuganathan, Elizabeth D. Earle|Plant Molecular Biology Reporter|1991|3.2k

The Origins of Genomic Duplications in <i>Arabidopsis</i>

Todd Vision, Daniel G. Brown, Steven D. Tanksley|Science|2000|1k

An Integrated Genetic Linkage Map of the Soybean Genome

Perry B. Cregan, T. Jarvik, Arla L. Bush et al.|Crop Science|1999|837

Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence.

Adams, Anthony R. Kerlavage, Fleischmann Rd et al.|Nature|1995|799

Generation and analysis of 280,000 human expressed sequence tags.

L. Hillier, Gregory G. Lennon, Michael A. Becker et al.|Genome Research|1996|469

A compilation of soybean ESTs: generation and analysis

Abstract

Related Papers