CODEX: a next-generation sequencing experiment database for the haematopoietic and embryonic stem cell communities

Manuel Sánchez-Castillo(Wellcome/MRC Cambridge Stem Cell Institute), David Ruau(Wellcome/MRC Cambridge Stem Cell Institute), Adam C. Wilkinson(Wellcome/MRC Cambridge Stem Cell Institute), Felicia Ng(Wellcome/MRC Cambridge Stem Cell Institute), Rebecca Hannah(Wellcome/MRC Cambridge Stem Cell Institute), Evangelia Diamanti(Wellcome/MRC Cambridge Stem Cell Institute), Patrick Lombard(Wellcome/MRC Cambridge Stem Cell Institute), Nicola K. Wilson(Wellcome/MRC Cambridge Stem Cell Institute), Berthold Göttgens(Wellcome/MRC Cambridge Stem Cell Institute)
Nucleic Acids Research
September 30, 2014
Cited by 134Open Access
Full Text

Abstract

CODEX (http://codex.stemcells.cam.ac.uk/) is a user-friendly database for the direct access and interrogation of publicly available next-generation sequencing (NGS) data, specifically aimed at experimental biologists. In an era of multi-centre genomic dataset generation, CODEX provides a single database where these samples are collected, uniformly processed and vetted. The main drive of CODEX is to provide the wider scientific community with instant access to high-quality NGS data, which, irrespective of the publishing laboratory, is directly comparable. CODEX allows users to immediately visualize or download processed datasets, or compare user-generated data against the database's cumulative knowledge-base. CODEX contains four types of NGS experiments: transcription factor chromatin immunoprecipitation coupled to high-throughput sequencing (ChIP-Seq), histone modification ChIP-Seq, DNase-Seq and RNA-Seq. These are largely encompassed within two specialized repositories, HAEMCODE and ESCODE, which are focused on haematopoiesis and embryonic stem cell samples, respectively. To date, CODEX contains over 1000 samples, including 221 unique TFs and 93 unique cell types. CODEX therefore provides one of the most complete resources of publicly available NGS data for the direct interrogation of transcriptional programmes that regulate cellular identity and fate in the context of mammalian development, homeostasis and disease.


Related Papers

No related papers found

Powered by citation graph analysis