Exploring Massive, Genome Scale Datasets with the GenometriCorr Package

Alexander V. Favorov(Vavilov Institute of General Genetics), Loris Mularoni(Johns Hopkins University), Leslie Cope(Johns Hopkins Medicine), Yulia A. Medvedeva(Vavilov Institute of General Genetics), Andrey A. Mironov(Lomonosov Moscow State University), Vsevolod J. Makeev(Vavilov Institute of General Genetics), Sarah J. Wheelan(Johns Hopkins University)
PLoS Computational Biology
May 31, 2012
Cited by 231Open Access
Full Text

Abstract

UNLABELLED: We have created a statistically grounded tool for determining the correlation of genomewide data with other datasets or known biological features, intended to guide biological exploration of high-dimensional datasets, rather than providing immediate answers. The software enables several biologically motivated approaches to these data and here we describe the rationale and implementation for each approach. Our models and statistics are implemented in an R package that efficiently calculates the spatial correlation between two sets of genomic intervals (data and/or annotated features), for use as a metric of functional interaction. The software handles any type of pointwise or interval data and instead of running analyses with predefined metrics, it computes the significance and direction of several types of spatial association; this is intended to suggest potentially relevant relationships between the datasets. AVAILABILITY AND IMPLEMENTATION: The package, GenometriCorr, can be freely downloaded at http://genometricorr.sourceforge.net/. Installation guidelines and examples are available from the sourceforge repository. The package is pending submission to Bioconductor.


Related Papers

No related papers found

Powered by citation graph analysis