The Ecoresponsive Genome of <i>Daphnia pulex</i>We describe the draft genome of the microcrustacean Daphnia pulex, which is only 200 megabases and contains at least 30,907 genes. The high gene count is a consequence of an elevated rate of gene duplication resulting in tandem gene clusters. More than a third of Daphnia's genes have no detectable homologs in any other available proteome, and the most amplified gene families are specific to the Daphnia lineage. The coexpansion of gene families interacting within metabolic pathways suggests that the maintenance of duplicated genes is not random, and the analysis of gene expression under different environmental conditions reveals that numerous paralogs acquire divergent expression patterns soon after duplication. Daphnia-specific genes, including many additional loci within sequenced regions that are otherwise devoid of annotations, are the most responsive genes to ecological challenges.
Splign: algorithms for computing spliced alignments with identification of paralogsBACKGROUND: The computation of accurate alignments of cDNA sequences against a genome is at the foundation of modern genome annotation pipelines. Several factors such as presence of paralogs, small exons, non-consensus splice signals, sequencing errors and polymorphic sites pose recognized difficulties to existing spliced alignment algorithms. RESULTS: We describe a set of algorithms behind a tool called Splign for computing cDNA-to-Genome alignments. The algorithms include a high-performance preliminary alignment, a compartment identification based on a formally defined model of adjacent duplicated regions, and a refined sequence alignment. In a series of tests, Splign has produced more accurate results than other tools commonly used to compute spliced alignments, in a reasonable amount of time. CONCLUSION: Splign's ability to deal with various issues complicating the spliced alignment problem makes it a helpful tool in eukaryotic genome annotation processes and alternative splicing studies. Its performance is enough to align the largest currently available pools of cDNA data such as the human EST set on a moderate-sized computing cluster in a matter of hours. The duplications identification (compartmentization) algorithm can be used independently in other areas such as the study of pseudogenes.
NCBI RefSeq: reference sequence standards through 25 years of curation and annotationReference sequences and annotations serve as the foundation for many lines of research today, from organism and sequence identification to providing a core description of the genes, transcripts and proteins found in an organism's genome. Interpretation of data including transcriptomics, proteomics, sequence variation and comparative analyses based on reference gene annotations informs our understanding of gene function and possible disease mechanisms, leading to new biomedical discoveries. The Reference Sequence (RefSeq) resource created at the National Center for Biotechnology Information (NCBI) leverages both automatic processes and expert curation to create a robust set of reference sequences of genomic, transcript and protein data spanning the tree of life. RefSeq continues to refine its annotation and quality control processes and utilize better quality genomes resulting from advances in sequencing technologies as well as RNA-Seq data to produce high-quality annotated genomes, ortholog predictions across more organisms and other products that are easily accessible through multiple NCBI resources. This report summarizes the current status of the eukaryotic, prokaryotic and viral RefSeq resources, with a focus on eukaryotic annotation, the increase in taxonomic representation and the effect it will have on comparative genomics. The RefSeq resource is publicly accessible at https://www.ncbi.nlm.nih.gov/refseq.
Eukaryotic Genome Annotation PipelineFrançoise Thibaud‐Nissen, Alexander Souvorov, Terence D. Murphy et al.|Plant and Animal Genome XX Conference (January 14-18, 2012)|2013 Microwave tomography: two-dimensional system for biological imagingSerguei Semenov, Robert H. Svenson, Alexander E. Boulyshev et al.|IEEE Transactions on Biomedical Engineering|1996 Microwave tomographic imaging is one of the new technologies which has the potential for important applications in medicine. Microwave tomographically reconstructed images may potentially provide information about the physiological state of tissue as well as the anatomical structure of an organ. A two-dimensional (2-D) prototype of a quasi real-time microwave tomographic system was constructed. It was utilized to reconstruct images of physiologically active biological tissues such as an explanted canine perfused heart. The tomographic system consisted of 64 special antennae, divided into 32 emitters and 32 receivers which were electronically scanned. The cylindrical microwave chamber had an internal diameter of 360 mm and was filled with various solutions, including deionized water. The system operated on a frequency of 2.45 GHz. The polarization of the incident electromagnetic field was linear in the vertical direction. Total acquisition time was less than 500 ms. Both accurate and approximation methods of image reconstruction were used. Images of 2-D phantoms, canine hearts, and beating canine hearts have been achieved. In the worst-case situation when the 2-D diffraction model was used for an attempt to "slice" three-dimensional (3-D) object reconstruction, we still achieved spatial resolution of 1 to 2 cm and contrast resolution of 5%.