Unipro UGENE: a unified bioinformatics toolkitUNLABELLED: Unipro UGENE is a multiplatform open-source software with the main goal of assisting molecular biologists without much expertise in bioinformatics to manage, analyze and visualize their data. UGENE integrates widely used bioinformatics tools within a common user interface. The toolkit supports multiple biological data formats and allows the retrieval of data from remote data sources. It provides visualization modules for biological objects such as annotated genome sequences, Next Generation Sequencing (NGS) assembly data, multiple sequence alignments, phylogenetic trees and 3D structures. Most of the integrated algorithms are tuned for maximum performance by the usage of multithreading and special processor instructions. UGENE includes a visual environment for creating reusable workflows that can be launched on local resources or in a High Performance Computing (HPC) environment. UGENE is written in C++ using the Qt framework. The built-in plugin system and structured UGENE API make it possible to extend the toolkit with new functionality. AVAILABILITY AND IMPLEMENTATION: UGENE binaries are freely available for MS Windows, Linux and Mac OS X at http://ugene.unipro.ru/download.html. UGENE code is licensed under the GPLv2; the information about the code licensing and copyright of integrated tools can be found in the LICENSE.3rd_party file provided with the source bundle.
Unipro UGENE NGS pipelines and components for variant calling, RNA-seq and ChIP-seq data analysesThe advent of Next Generation Sequencing (NGS) technologies has opened new possibilities for researchers. However, the more biology becomes a data-intensive field, the more biologists have to learn how to process and analyze NGS data with complex computational tools. Even with the availability of common pipeline specifications, it is often a time-consuming and cumbersome task for a bench scientist to install and configure the pipeline tools. We believe that a unified, desktop and biologist-friendly front end to NGS data analysis tools will substantially improve productivity in this field. Here we present NGS pipelines "Variant Calling with SAMtools", "Tuxedo Pipeline for RNA-seq Data Analysis" and "Cistrome Pipeline for ChIP-seq Data Analysis" integrated into the Unipro UGENE desktop toolkit. We describe the available UGENE infrastructure that helps researchers run these pipelines on different datasets, store and investigate the results and re-run the pipelines with the same parameters. These pipeline tools are included in the UGENE NGS package. Individual blocks of these pipelines are also available for expert users to create their own advanced workflows.
Program complex SNP-MED for analysis of single-nucleotide polymorphism (SNP) effects on the function of genes associated with socially significant diseasesN. L. Podkolodnyy, Д. А. Афонников, Yu. Yu. Vaskin et al.|Russian Journal of Genetics Applied Research|2014 We describe development and application of the new SNP-MED modular software system, designed to examine the influence of single-nucleotide polymorphisms (SNPs) on the function of genes associated with the risk of socially significant diseases. The SNP-MED system includes Genomics, Proteomics, and Gene Networks’ software components, and the Information Resource Database.
Shared Bioinformatics Databases within the Unipro UGENE PlatformIvan V. Protsyuk, German Grekhov, Alexei V. Tiunov et al.|Berichte aus der medizinischen Informatik und Bioinformatik/Journal of integrative bioinformatics|2015 Summary Unipro UGENE is an open-source bioinformatics toolkit that integrates popular tools along with original instruments for molecular biologists within a unified user interface. Nowadays, most bioinformatics desktop applications, including UGENE, make use of a local data model while processing different types of data. Such an approach causes an inconvenience for scientists working cooperatively and relying on the same data. This refers to the need of making multiple copies of certain files for every workplace and maintaining synchronization between them in case of modifications. Therefore, we focused on delivering a collaborative work into the UGENE user experience. Currently, several UGENE installations can be connected to a designated shared database and users can interact with it simultaneously. Such databases can be created by UGENE users and be used at their discretion. Objects of each data type, supported by UGENE such as sequences, annotations, multiple alignments, etc., can now be easily imported from or exported to a remote storage. One of the main advantages of this system, compared to existing ones, is the almost simultaneous access of client applications to shared data regardless of their volume. Moreover, the system is capable of storing millions of objects. The storage itself is a regular database server so even an inexpert user is able to deploy it. Thus, UGENE may provide access to shared data for users located, for example, in the same laboratory or institution. UGENE is available at: http://ugene.net/download.html.
New LTR Retrotransposable Elements from Eukaryotic GenomesOlga Novikova, Mikhail Fursov, E. Beresikov et al.|Kluwer Academic Publishers eBooks|2006 LTR retrotransposons have been identified in the genomes of many eukaryotic organisms. Several approaches can be used in order to identify new LTR retrotransposons from a wide range of organisms. One of them is a systematic search of genomic sequences generated from whole genome sequencing projects. Here, we performed a computer analysis of 14 genomes with the aim of extracting LTR retrotransposons using a software based on the hidden Markov models. Altogether, 24 new LTR retrotransposable elements from 4 subclasses Ty3/gypsy, DIRS, Bel, and Ty1/copia were identified in 9 different genomes. A new family of retroelements was obtained after the analysis of the genomes of ascomycetes and basidiomycetes fungi. Considerable differences in the amount and diversity of retroelements were detected in the distantly related nematodes B. malayi and C. briggsae. Finally, our analysis did not reveal the presence of LTR retroelements in the genomes of parasitic protozoan.