IslandViewer 4: expanded prediction of genomic islands for larger-scale datasetsIslandViewer (http://www.pathogenomics.sfu.ca/islandviewer/) is a widely-used webserver for the prediction and interactive visualization of genomic islands (GIs, regions of probable horizontal origin) in bacterial and archaeal genomes. GIs disproportionately encode factors that enhance the adaptability and competitiveness of the microbe within a niche, including virulence factors and other medically or environmentally important adaptations. We report here the release of IslandViewer 4, with novel features to accommodate the needs of larger-scale microbial genomics analysis, while expanding GI predictions and improving its flexible visualization interface. A user management web interface as well as an HTTP API for batch analyses are now provided with a secured authentication to facilitate the submission of larger numbers of genomes and the retrieval of results. In addition, IslandViewer's integrated GI predictions from multiple methods have been improved and expanded by integrating the precise Islander method for pre-computed genomes, as well as an updated IslandPath-DIMOB for both pre-computed and user-supplied custom genome analysis. Finally, pre-computed predictions including virulence factors and antimicrobial resistance are now available for 6193 complete bacterial and archaeal strains publicly available in RefSeq. IslandViewer 4 provides key enhancements to facilitate the analysis of GIs and better understand their role in the evolution of successful environmental microbes and pathogens.
The genome of woodland strawberry (Fragaria vesca)The International Strawberry Sequencing Consortium reports the draft genome of the woodland strawberry (Fragaria vesca). The genome of this diploid species should serve as a reference genome for the Fragaria genus, as the cultivated strawberry (Fragaria × ananassa) is an octoploid where F. vesca is predicted to be a subgenome donor. The woodland strawberry, Fragaria vesca (2n = 2x = 14), is a versatile experimental plant system. This diminutive herbaceous perennial has a small genome (240 Mb), is amenable to genetic transformation and shares substantial sequence identity with the cultivated strawberry (Fragaria × ananassa) and other economically important rosaceous plants. Here we report the draft F. vesca genome, which was sequenced to ×39 coverage using second-generation technology, assembled de novo and then anchored to the genetic linkage map into seven pseudochromosomes. This diploid strawberry sequence lacks the large genome duplications seen in other rosids. Gene prediction modeling identified 34,809 genes, with most being supported by transcriptome mapping. Genes critical to valuable horticultural traits including flavor, nutritional value and flowering time were identified. Macrosyntenic relationships between Fragaria and Prunus predict a hypothetical ancestral Rosaceae genome that had nine chromosomes. New phylogenetic analysis of 154 protein-coding genes suggests that assignment of Populus to Malvidae, rather than Fabidae, is warranted.
Complete Genome Sequence of Citrus Huanglongbing Bacterium, ‘<i>Candidatus</i>Liberibacter asiaticus’ Obtained Through MetagenomicsYongping Duan, Lijuan Zhou, David G. Hall et al.|Molecular Plant-Microbe Interactions|2009 Citrus huanglongbing is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with a low-titer, phloem-limited infection by any of three uncultured species of alpha-Proteobacteria, 'Candidatus Liberibacter asiaticus', 'Ca. L. americanus', and 'Ca. L. africanus'. A complete circular 'Ca. L. asiaticus' genome has been obtained by metagenomics, using the DNA extracted from a single 'Ca. L. asiaticus'-infected psyllid. The 1.23-Mb genome has an average 36.5% GC content. Annotation revealed a high percentage of genes involved in both cell motility (4.5%) and active transport in general (8.0%), which may contribute to its virulence. 'Ca. L. asiaticus' appears to have a limited ability for aerobic respiration and is likely auxotrophic for at least five amino acids. Consistent with its intracellular nature, 'Ca. L. asiaticus' lacks type III and type IV secretion systems as well as typical free-living or plant-colonizing extracellular degradative enzymes. 'Ca. L. asiaticus' appears to have all type I secretion system genes needed for both multidrug efflux and toxin effector secretion. Multi-protein phylogenetic analysis confirmed 'Ca. L. asiaticus' as an early-branching and highly divergent member of the family Rhizobiaceae. This is the first genome sequence of an uncultured alpha-proteobacteria that is both an intracellular plant pathogen and insect symbiont.
Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and AnalysisA synergistic combination of two next-generation sequencing platforms with a detailed comparative BAC physical contig map provided a cost-effective assembly of the genome sequence of the domestic turkey (Meleagris gallopavo). Heterozygosity of the sequenced source genome allowed discovery of more than 600,000 high quality single nucleotide variants. Despite this heterozygosity, the current genome assembly (∼1.1 Gb) includes 917 Mb of sequence assigned to specific turkey chromosomes. Annotation identified nearly 16,000 genes, with 15,093 recognized as protein coding and 611 as non-coding RNA genes. Comparative analysis of the turkey, chicken, and zebra finch genomes, and comparing avian to mammalian species, supports the characteristic stability of avian genomes and identifies genes unique to the avian lineage. Clear differences are seen in number and variety of genes of the avian immune system where expansions and novel genes are less frequent than examples of gene loss. The turkey genome sequence provides resources to further understand the evolution of vertebrate genomes and genetic variation underlying economically important quantitative traits in poultry. This integrated approach may be a model for providing both gene and chromosome level assemblies of other species with agricultural, ecological, and evolutionary interest.
Phylogeny of GammaproteobacteriaThe phylogeny of the large bacterial class Gammaproteobacteria has been difficult to resolve. Here we apply a telescoping multiprotein approach to the problem for 104 diverse gammaproteobacterial genomes, based on a set of 356 protein families for the whole class and even larger sets for each of four cohesive subregions of the tree. Although the deepest divergences were resistant to full resolution, some surprising patterns were strongly supported. A representative of the Acidithiobacillales routinely appeared among the outgroup members, suggesting that in conflict with rRNA-based phylogenies this order does not belong to Gammaproteobacteria; instead, it (and, independently, "Mariprofundus") diverged after the establishment of the Alphaproteobacteria yet before the betaproteobacteria/gammaproteobacteria split. None of the orders Alteromonadales, Pseudomonadales, or Oceanospirillales were monophyletic; we obtained strong support for clades that contain some but exclude other members of all three orders. Extreme amino acid bias in the highly A+T-rich genome of Candidatus Carsonella prevented its reliable placement within Gammaproteobacteria, and high bias caused artifacts that limited the resolution of the relationships of other insect endosymbionts, which appear to have had multiple origins, although the unbiased genome of the endosymbiont Sodalis acted as an attractor for them. Instability was observed for the root of the Enterobacteriales, with nearly equal subsets of the protein families favoring one or the other of two alternative root positions; the nematode symbiont Photorhabdus was identified as a disruptor whose omission helped stabilize the Enterobacteriales root.