A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants

Sarah M. Pilkington(Plant & Food Research), Ross Crowhurst(Plant & Food Research), Elena Hilario(Plant & Food Research), Simona Nardozza(Plant & Food Research), Lena G. Fraser(Plant & Food Research), Yongyan Peng(Plant & Food Research), Kularajathevan Gunaseelan(Plant & Food Research), Robert Simpson(Palmerston North Hospital), Jibran Tahir(Palmerston North Hospital), Simon Deroles(Palmerston North Hospital), Kerry Templeton(Plant & Food Research), Zhiwei Luo(Plant & Food Research), Marcus Davy, Canhong Cheng(Plant & Food Research), M.A. McNeilage(Plant & Food Research), Davide Scaglione(Istituto di Genomica Applicata), Yifei Liu(Chinese Academy of Sciences), Qiong Zhang(Wuhan Botanical Garden), Paul Datson(Plant & Food Research), Nihal De Silva(Plant & Food Research), Susan E. Gardiner(Palmerston North Hospital), Heather Bassett(Palmerston North Hospital), David Chagné(Palmerston North Hospital), John McCallum, Helge Dzierzon(Palmerston North Hospital), Cecilia Deng(Plant & Food Research), Yen-Yi Wang(Plant & Food Research), Lorna Barron(Plant & Food Research), Kelvina Manako(Plant & Food Research), Judith Bowen(Plant & Food Research), Toshi Foster(Palmerston North Hospital), Zoe Erridge(Palmerston North Hospital), Heather Tiffin(Palmerston North Hospital), Chethi Waite(Palmerston North Hospital), Kevin M. Davies(Palmerston North Hospital), Ella P. Grierson(Palmerston North Hospital), William A. Laing(Palmerston North Hospital), Rebecca Kirk(Plant & Food Research), Xiuyin Chen(Plant & Food Research), Marion Wood(Plant & Food Research), Mirco Montefiori(Plant & Food Research), David A. Brummell(Palmerston North Hospital), Kathy E. Schwinn(Palmerston North Hospital), Andrew Catanach, Christina Fullerton(Plant & Food Research), Dawei Li(Chinese Academy of Sciences), Sathiyamoorthy Meiyalaghan, Niels J. Nieuwenhuizen(Plant & Food Research), Nicola Read(University of Auckland), Roneel Prakash(Plant & Food Research), Don Hunter(Palmerston North Hospital), Huaibi Zhang(Palmerston North Hospital), Marian J. McKenzie(Palmerston North Hospital), Mareike Knäbel(Palmerston North Hospital), Alastair Harris(University of Auckland), Andrew C. Allan(Plant & Food Research), Andrew P. Gleave(Plant & Food Research), Angela Chen(University of Auckland), Bart Janssen(Plant & Food Research), Blue Plunkett(Plant & Food Research), Charles Ampomah‐Dwamena(Plant & Food Research), Charlotte Voogd(Plant & Food Research), Davin Leif(Plant & Food Research), Declan Lafferty(University of Auckland), Edwige Souleyre(Plant & Food Research), Erika Varkonyi‐Gasic(Plant & Food Research), Francesco Gambi(Plant & Food Research), Jenny Hanley(University of Auckland), Jia‐Long Yao(Plant & Food Research), J Cheung(University of Auckland), Karine David(University of Auckland), Ben Warren(Plant & Food Research), Ken Marsh(Plant & Food Research), Kimberley C. Snowden(Plant & Food Research), Kui Lin‐Wang(Plant & Food Research), Lara Brian(Plant & Food Research), Marcela Martínez-Sánchez(Plant & Food Research), Mindy Wang(Plant & Food Research), Nadeesha Ileperuma(Plant & Food Research), Nikolai Macnee(Plant & Food Research), Robert Campin(Plant & Food Research), Peter McAtee(Plant & Food Research), Revel Drummond(Plant & Food Research), Richard V. Espley(Plant & Food Research), Hilary Ireland(Plant & Food Research), Rongmei Wu(Plant & Food Research), Ross G. Atkinson(Plant & Food Research), Sakuntala Karunairetnam(Plant & Food Research), Sean Bulley, Shayhan Chunkath(University of Auckland), Zac Hanley(Plant & Food Research), Roy Storey, Amali Thrimawithana(Plant & Food Research), Susan Thomson, Charles David, R. Testolin(University of Udine), Hongwen Huang(South China Botanical Garden), Roger P. Hellens(Queensland University of Technology), Robert J. Schaffer(University of Auckland)
BMC Genomics
April 16, 2018
Cited by 222Open Access
Full Text

Abstract

BACKGROUND: Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models. RESULTS: A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within 'Hongyang' The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned 'Hort16A' cDNAs and comparing with the predicted protein models for Red5 and both the original 'Hongyang' assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised 'Hongyang' annotation, respectively, compared with 90.9% to the Red5 models. CONCLUSIONS: Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.


Related Papers

No related papers found

Powered by citation graph analysis