Principles of metadata organization at the ENCODE data coordination center

Eurie L. Hong(Stanford University), Cricket A. Sloan(Stanford University), Esther T. Chan(Stanford University), Jean M. Davidson(Stanford University), Venkat S. Malladi(Stanford University), J. Seth Strattan(Stanford University), Benjamin C. Hitz(Stanford University), Idan Gabdank(Stanford University), Aditi K. Narayanan(Stanford University), Marcus Ho(Stanford University), Brian T. Lee(University of California, Santa Cruz), Laurence D. Rowe(Stanford University), Timothy R. Dreszer(Stanford University), Greg R. Roe(Stanford University), Nikhil R. Podduturi(Stanford University), Forrest Y. Tanaka(Stanford University), Jason A. Hilton(Stanford University), J. Michael Cherry(Stanford University)
Database
January 1, 2016
Cited by 48Open Access
Full Text

Abstract

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/). Database URL: www.encodeproject.org.


Related Papers

No related papers found

Powered by citation graph analysis