MetaCycle: an integrated R package to evaluate periodicity in large scale dataDetecting periodicity in large scale data remains a challenge. While efforts have been made to identify best of breed algorithms, relatively little research has gone into integrating these methods in a generalizable method. Here, we present MetaCycle, an R package that incorporates ARSER, JTK_CYCLE and Lomb-Scargle to conveniently evaluate periodicity in time-series data. MetaCycle has two functions, meta2d and meta3d, designed to analyze two-dimensional and three-dimensional time-series datasets, respectively. Meta2d implements N-version programming concepts using a suite of algorithms and integrating their results. AVAILABILITY AND IMPLEMENTATION: MetaCycle package is available on the CRAN repository (https://cran.r-project.org/web/packages/MetaCycle/index.html) and GitHub (https://github.com/gangwug/MetaCycle). CONTACT: hogenesch@gmail.comSupplementary information: Supplementary data are available at Bioinformatics online.
A database of tissue-specific rhythmically expressed human genes has potential applications in circadian medicineMarc D. Ruben, Gang Wu, David F. Smith et al.|Science Translational Medicine|2018 The discovery that half of the mammalian protein-coding genome is regulated by the circadian clock has clear implications for medicine. Recent studies demonstrated that the circadian clock influences therapeutic outcomes in human heart disease and cancer. However, biological time is rarely given clinical consideration. A key barrier is the absence of information on tissue-specific molecular rhythms in the human body. We have applied the cyclic ordering by periodic structure (CYCLOPS) algorithm, designed to reconstruct sample temporal order in the absence of time-of-day information, to the gene expression collection of 13 tissues from 632 human donors. We identified rhythms in gene expression across the body; nearly half of protein-coding genes were shown to be cycling in at least 1 of the 13 tissues analyzed. One thousand of these cycling genes encode proteins that either transport or metabolize drugs or are themselves drug targets. These results provide a useful resource for studying the role of circadian rhythms in medicine and support the idea that biological time might play a role in determining drug response.
Guidelines for Genome-Scale Analysis of Biological RhythmsGenome biology approaches have made enormous contributions to our understanding of biological rhythms, particularly in identifying outputs of the clock, including RNAs, proteins, and metabolites, whose abundance oscillates throughout the day. These methods hold significant promise for future discovery, particularly when combined with computational modeling. However, genome-scale experiments are costly and laborious, yielding "big data" that are conceptually and statistically difficult to analyze. There is no obvious consensus regarding design or analysis. Here we discuss the relevant technical considerations to generate reproducible, statistically sound, and broadly useful genome-scale data. Rather than suggest a set of rigid rules, we aim to codify principles by which investigators, reviewers, and readers of the primary literature can evaluate the suitability of different experimental designs for measuring different aspects of biological rhythms. We introduce CircaInSilico, a web-based application for generating synthetic genome biology data to benchmark statistical methods for studying biological rhythms. Finally, we discuss several unmet analytical needs, including applications to clinical medicine, and suggest productive avenues to address them.
Exploring phylogeny to find the function of sleepCYCLOPS reveals human transcriptional rhythms in health and diseaseRon C. Anafi, Lauren J. Francey, John B. Hogenesch et al.|Proceedings of the National Academy of Sciences|2017 Circadian rhythms modulate many aspects of physiology. Knowledge of the molecular basis of these rhythms has exploded in the last 20 years. However, most of these data are from model organisms, and translation to clinical practice has been limited. Here, we present an approach to identify molecular rhythms in humans from thousands of unordered expression measurements. Our algorithm, cyclic ordering by periodic structure (CYCLOPS), uses evolutionary conservation and machine learning to identify elliptical structure in high-dimensional data. From this structure, CYCLOPS estimates the phase of each sample. We validated CYCLOPS using temporally ordered mouse and human data and demonstrated its consistency on human data from two independent research sites. We used this approach to identify rhythmic transcripts in human liver and lung, including hundreds of drug targets and disease genes. Importantly, for many genes, the circadian variation in expression exceeded variation from genetic and other environmental factors. We also analyzed hepatocellular carcinoma samples and show these solid tumors maintain circadian function but with aberrant output. Finally, to show how this method can catalyze medical translation, we show that dosage time can temporally segregate efficacy from dose-limiting toxicity of streptozocin, a chemotherapeutic drug. In sum, these data show the power of CYCLOPS and temporal reconstruction in bridging basic circadian research and clinical medicine.