UniProt: the Universal Protein Knowledgebase in 2025Alex Bateman, María Martin, Sandra Orchard et al.|Nucleic Acids Research|2024 The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication, we describe ongoing changes to our production pipeline to limit the sequences available in UniProtKB to high-quality, non-redundant reference proteomes. We continue to manually curate the scientific literature to add the latest functional data and use machine learning techniques. We also encourage community curation to ensure key publications are not missed. We provide an update on the automatic annotation methods used by UniProtKB to predict information for unreviewed entries describing unstudied proteins. Finally, updates to the UniProt website are described, including a new tab linking protein to genomic information. In recognition of its value to the scientific community, the UniProt database has been awarded Global Core Biodata Resource status.
Aromatic amino‐acid residues at the active and peripheral anionic sites control the binding of E2020 (Aricept®) to cholinesterasesAshima Saxena, James M. Fedorko, C R Vinayaka et al.|European Journal of Biochemistry|2003 E2020 (R,S)-1-benzyl-4-[(5,6-dimethoxy-1-indanon)-2-yl]methyl)piperidine hydrochloride is a piperidine-based acetylcholinesterase (AChE) inhibitor that was approved for the treatment of Alzheimer's disease in the United States. Structure-activity studies of this class of inhibitors have indicated that both the benzoyl containing functionality and the N-benzylpiperidine moiety are the key features for binding and inhibition of AChE. In the present study, the interaction of E2020 with cholinesterases (ChEs) with known sequence differences, was examined in more detail by measuring the inhibition constants with Torpedo AChE, fetal bovine serum AChE, human butyrylcholinesterase (BChE), and equine BChE. The basis for particular residues conferring selectivity was then confirmed by using site-specific mutants of the implicated residue in two template enzymes. Differences in the reactivity of E2020 toward AChE and BChE (200- to 400-fold) show that residues at the peripheral anionic site such as Asp74(72), Tyr72(70), Tyr124(121), and Trp286(279) in mammalian AChE may be important in the binding of E2020 to AChE. Site-directed mutagenesis studies using mouse AChE showed that these residues contribute to the stabilization energy for the AChE-E2020 complex. However, replacement of Ala277(Trp279) with Trp in human BChE does not affect the binding of E2020 to BChE. Molecular modeling studies suggest that E2020 interacts with the active-site and the peripheral anionic site in AChE, but in the case of BChE, as the gorge is larger, E2020 cannot simultaneously interact at both sites. The observation that the KI value for mutant AChE in which Ala replaced Trp286 is similar to that for wild-type BChE, further confirms our hypothesis.
UniRule: a unified rule resource for automatic annotation in the UniProt KnowledgebaseMOTIVATION: The number of protein records in the UniProt Knowledgebase (UniProtKB: https://www.uniprot.org) continues to grow rapidly as a result of genome sequencing and the prediction of protein-coding genes. Providing functional annotation for these proteins presents a significant and continuing challenge. RESULTS: In response to this challenge, UniProt has developed a method of annotation, known as UniRule, based on expertly curated rules, which integrates related systems (RuleBase, HAMAP, PIRSR, PIRNR) developed by the members of the UniProt consortium. UniRule uses protein family signatures from InterPro, combined with taxonomic and other constraints, to select sets of reviewed proteins which have common functional properties supported by experimental evidence. This annotation is propagated to unreviewed records in UniProtKB that meet the same selection criteria, most of which do not have (and are never likely to have) experimentally verified functional annotation. Release 2020_01 of UniProtKB contains 6496 UniRule rules which provide annotation for 53 million proteins, accounting for 30% of the 178 million records in UniProtKB. UniRule provides scalable enrichment of annotation in UniProtKB. AVAILABILITY AND IMPLEMENTATION: UniRule rules are integrated into UniProtKB and can be viewed at https://www.uniprot.org/unirule/. UniRule rules and the code required to run the rules, are publicly available for researchers who wish to annotate their own sequences. The implementation used to run the rules is known as UniFIRE and is available at https://gitlab.ebi.ac.uk/uniprot-public/unifire.
Computational analysis and identification of amino acid sites in dengue E proteins relevant to development of diagnostics and vaccinesUniRule: a unified rule resource for automatic annotation in the UniProt Knowledgebase