Inflammatory Bowel Disease and Mutations Affecting the Interleukin-10 ReceptorErik‐Oliver Glocker, Daniel Kotlarz, Kaan Boztuǧ et al.|New England Journal of Medicine|2009 BACKGROUND: The molecular cause of inflammatory bowel disease is largely unknown. METHODS: We performed genetic-linkage analysis and candidate-gene sequencing on samples from two unrelated consanguineous families with children who were affected by early-onset inflammatory bowel disease. We screened six additional patients with early-onset colitis for mutations in two candidate genes and carried out functional assays in patients' peripheral-blood mononuclear cells. We performed an allogeneic hematopoietic stem-cell transplantation in one patient. RESULTS: In four of nine patients with early-onset colitis, we identified three distinct homozygous mutations in genes IL10RA and IL10RB, encoding the IL10R1 and IL10R2 proteins, respectively, which form a heterotetramer to make up the interleukin-10 receptor. The mutations abrogate interleukin-10-induced signaling, as shown by deficient STAT3 (signal transducer and activator of transcription 3) phosphorylation on stimulation with interleukin-10. Consistent with this observation was the increased secretion of tumor necrosis factor alpha and other proinflammatory cytokines from peripheral-blood mononuclear cells from patients who were deficient in IL10R subunit proteins, suggesting that interleukin-10-dependent "negative feedback" regulation is disrupted in these cells. The allogeneic stem-cell transplantation performed in one patient was successful. CONCLUSIONS: Mutations in genes encoding the IL10R subunit proteins were found in patients with early-onset enterocolitis, involving hyperinflammatory immune responses in the intestine. Allogeneic stem-cell transplantation resulted in disease remission in one patient.
Faster sequential genetic linkage computations.Linkage analysis using maximum-likelihood estimation is a powerful tool for locating genes. As available data sets have grown, the computation required for analysis has grown exponentially and become a significant impediment. Others have previously shown that parallel computation is applicable to linkage analysis and can yield order-of-magnitude improvements in speed. In this paper, we demonstrate that algorithmic modifications can also yield order-of-magnitude improvements, and sometimes much more. Using the software package LINKAGE, we describe a variety of algorithmic improvements that we have implemented, demonstrating both how these techniques are applied and their power. Experiments show that these improvements speed up the programs by an order of magnitude, on problems of moderate and large size. All improvements were made only in the combinatorial part of the code, without restoring to parallel computers. These improvements synthesize biological principles with computer science techniques, to effectively restructure the time-consuming computations in genetic linkage analysis.
Database indexing for production MegaBLAST searchesMOTIVATION: The BLAST software package for sequence comparison speeds up homology search by preprocessing a query sequence into a lookup table. Numerous research studies have suggested that preprocessing the database instead would give better performance. However, production usage of sequence comparison methods that preprocess the database has been limited to programs such as BLAT and SSAHA that are designed to find matches when query and database subsequences are highly similar. RESULTS: We developed a new version of the MegaBLAST module of BLAST that does the initial phase of finding short seeds for matches by searching a database index. We also developed a program makembindex that preprocesses the database into a data structure for rapid seed searching. We show that the new 'indexed MegaBLAST' is faster than the 'non-indexed' version for most practical uses. We show that indexed MegaBLAST is faster than miBLAST, another implementation of BLAST nucleotide searching with a preprocessed database, for most of the 200 queries we tested. To deploy indexed MegaBLAST as part of NCBI'sWeb BLAST service, the storage of databases and the queueing mechanism were modified, so that some machines are now dedicated to serving queries for a specific database. The response time for such Web queries is now faster than it was when each computer handled queries for multiple databases. AVAILABILITY: The code for indexed MegaBLAST is part of the blastn program in the NCBI C++ toolkit. The preprocessor program makembindex is also in the toolkit. Indexed MegaBLAST has been used in production on NCBI's Web BLAST service to search one version of the human and mouse genomes since October 2007. The Linux command-line executables for blastn and makembindex, documentation, and some query sets used to carry out the tests described below are available in the directory: ftp://ftp.ncbi.nlm.nih.gov/pub/agarwala/indexed_megablast [corrected] SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
<i>STAT3</i> Mutations in the Hyper-IgE SyndromeBACKGROUND: The hyper-IgE syndrome (or Job's syndrome) is a rare disorder of immunity and connective tissue characterized by dermatitis, boils, cyst-forming pneumonias, elevated serum IgE levels, retained primary dentition, and bone abnormalities. Inheritance is autosomal dominant; sporadic cases are also found. METHODS: We collected longitudinal clinical data on patients with the hyper-IgE syndrome and their families and assayed the levels of cytokines secreted by stimulated leukocytes and the gene expression in resting and stimulated cells. These data implicated the signal transducer and activator of transcription 3 gene (STAT3) as a candidate gene, which we then sequenced. RESULTS: We found increased levels of proinflammatory gene transcripts in unstimulated peripheral-blood neutrophils and mononuclear cells from patients with the hyper-IgE syndrome, as compared with levels in control cells. In vitro cultures of mononuclear cells from patients that were stimulated with lipopolysaccharide, with or without interferon-gamma, had higher tumor necrosis factor alpha levels than did identically treated cells from unaffected persons (P=0.003). In contrast, the cells from patients with the hyper-IgE syndrome generated lower levels of monocyte chemoattractant protein 1 in response to the presence of interleukin-6 (P=0.03), suggesting a defect in interleukin-6 signaling through its downstream mediators, one of which is STAT3. We identified missense mutations and single-codon in-frame deletions in STAT3 in 50 familial and sporadic cases of the hyper-IgE syndrome. Eighteen discrete mutations, five of which were hot spots, were predicted to directly affect the DNA-binding and SRC homology 2 (SH2) domains. CONCLUSIONS: Mutations in STAT3 underlie sporadic and dominant forms of the hyper-IgE syndrome, an immunodeficiency syndrome involving increased innate immune response, recurrent infections, and complex somatic features.
Protein database searches using compositionally adjusted substitution matricesAlmost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long-standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions. Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein-protein version of blast.