ChEBI: a database and ontology for chemical entities of biological interestChemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities focused on 'small' chemical compounds. The molecular entities in question are either natural products or synthetic products used to intervene in the processes of living organisms. Genome-encoded macromolecules (nucleic acids, proteins and peptides derived from proteins by cleavage) are not as a rule included in ChEBI. In addition to molecular entities, ChEBI contains groups (parts of molecular entities) and classes of entities. ChEBI includes an ontological classification, whereby the relationships between molecular entities or classes of entities and their parents and/or children are specified. ChEBI is available online at http://www.ebi.ac.uk/chebi/
Molecular evolution of P450 superfamily and P450‐containing monooxygenase systemsThis paper reviews the classification of the P450 superfamily which is mainly based on sequence homology. The widely accepted classification by Nebert et al. [(1991) DNA Cell Biol. 10, 1-14] as well as the results of a 'two-step' multiple sequence alignment technique show that the molecular evolution of P450s, in contrast to that of many protein families, does not follow phylogeny. The data suggest that during the evolution of P450s, gene duplications and gene fusions, horizontal gene transfer and intron loss events have occurred. 'Weak' and 'strong' hierarchies in the clustering of P450 sequences were revealed. A novel evolutionary tree of the P450 superfamily has been constructed using a multiple alignment of consensus sequences. The simple classification of known P450-containing monooxygenase systems into three-, two- and one-component systems is further discussed. Particularly, the multidomain enzyme, nitric oxide synthase (NOS), should be classified as an example of a eukaryotic one-component P450 system since its N-terminal (haem) domain exhibits similarity with microsomal P450s.
Bioinorganic motifs: towards functional classification of metalloproteinsKirill Degtyarenko|Bioinformatics|2000 UNLABELLED: The habitat of bioinorganic motifs (BIMs) is at the interface of biological inorganic chemistry and bioinformatics. BIM is defined as a common structural feature shared by functionally related, but not necessarily homologous, proteins, and consisting of the metal atom(s) and first coordination shell ligands. BIMs appear to be suitable for classification of metal centres at any level, from groups of unrelated proteins with similar function to different functional states of the same protein, and for description of possible evolutionary relationships of metalloproteins. However, they have not attracted wide attention from the bioinformatics community. Although their presence is appreciated, they are difficult to predict-therefore the current 'high-throughput' initiatives are likely to miss or ignore them altogether. The protein sequence databases do not distinguish between proteins containing different prosthetic groups (unless they have different sequences) or between apo- and holoprotein. On the other hand, the protein structure databases include data on 'hetero compounds' of various origin but these data are often inconsistent. A number of specialized databases dealing with BIMs and attempts to classify them are reviewed. SUPPLEMENTARY INFORMATION: The additional bibliography and list of Internet resources on bioinorganic chemistry are available at http://www.ebi.ac.uk/ approximately kirill/biometal/
Structural domains of P450-containing monooxygenase systemsKirill Degtyarenko|Protein Engineering Design and Selection|1995 All known P450-containing monooxygenase systems share common structural and functional domain architecture. Apart from P450 itself, these systems can comprise several fundamentally different protein components or domains, all of which are shared by other multicomponent/multidomain enzyme systems with various functions: FAD flavoprotein or domain, FMN domain, Fe2S2 ferredoxin, Fe3S4 ferredoxin, and cytochrome b5. Either FMN domain, ferredoxins or cytochrome b5 serve as the electron transport intermediate between the FAD domain and P450. The molecular evolution of both P450-containing systems and of each particular component does not follow phylogeny in general. Gene fusion and horizontal gene transfer events can lead to the appearance of novel redox chains in the same manner that artificial chimeric proteins can be constructed by humans. Recent studies using genetic and protein engineering techniques to investigate the separate domains and their interaction are described.
Novel developments with the PRINTS protein fingerprint databaseThe PRINTS database of protein family 'fingerprints' is a diagnostic resource that complements the PROSITE dictionary of sites and patterns. Unlike regular expressions, fingerprints exploit groups of conserved motifs within sequence alignments to build characteristic signatures of family membership. Thus fingerprints inherently offer improved diagnostic reliability by virtue of the mutual context provided by motif neighbours. To date, 600 fingerprints have been constructed and stored in PRINTS, representing a 50% increase in the size of the database in the last year. The current version, 13.0, encodes approximately 3000 motifs, covering a range of globular and membrane proteins, modular polypeptides, and so on. The database is accessible via UCL's Bioinformatics World Wide Web (WWW) server at http://www.biochem.ucl.ac.uk/bsm/dbbrowser / . We describe here progress with the database, its Web interface, and a recent exciting development: the integration of a novel colour alignment editor (http://www.biochem.ucl.ac.uk/bsm/dbbrowser++ +/CINEMA ), which allows visualisation and interactive manipulation of PRINTS alignments over the Internet.