A hypothesis-based approach for identifying the binding specificity of regulatory proteins from chromatin immunoprecipitation data

Kenzie D. MacIsaac(Vassar College), D. Benjamin Gordon(Whitehead Institute for Biomedical Research), Lena Nekludova(Whitehead Institute for Biomedical Research), Duncan T. Odom(Whitehead Institute for Biomedical Research), Joerg Schreiber(Whitehead Institute for Biomedical Research), David K. Gifford(Vassar College), Richard A. Young(Whitehead Institute for Biomedical Research), Ernest Fraenkel(Vassar College)
Bioinformatics
December 6, 2005
Cited by 75

Abstract

MOTIVATION: Genome-wide chromatin-immunoprecipitation (ChIP-chip) detects binding of transcriptional regulators to DNA in vivo at low resolution. Motif discovery algorithms can be used to discover sequence patterns in the bound regions that may be recognized by the immunoprecipitated protein. However, the discovered motifs often do not agree with the binding specificity of the protein, when it is known. RESULTS: We present a powerful approach to analyzing ChIP-chip data, called THEME, that tests hypotheses concerning the sequence specificity of a protein. Hypotheses are refined using constrained local optimization. Cross-validation provides a principled standard for selecting the optimal weighting of the hypothesis and the ChIP-chip data and for choosing the best refined hypothesis. We demonstrate how to derive hypotheses for proteins from 36 domain families. Using THEME together with these hypotheses, we analyze ChIP-chip datasets for 14 human and mouse proteins. In all the cases the identified motifs are consistent with the published data with regard to the binding specificity of the proteins.


Related Papers

No related papers found

Powered by citation graph analysis