E

Eivind Hovig

Oslo University Hospital

ORCID: 0000-0002-9103-1077

Publishes on Cancer Genomics and Diagnostics, Gene expression and cancer classification, Genetic factors in colorectal cancer. 438 papers and 17.3k citations.

438Publications
17.3kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles
Cited by 1kOpen Access

JASPAR (https://jaspar.elixir.no/) is a widely-used open-access database presenting manually curated high-quality and non-redundant DNA-binding profiles for transcription factors (TFs) across taxa. In this 10th release and 20th-anniversary update, the CORE collection has expanded with 329 new profiles. We updated three existing profiles and provided orthogonal support for 72 profiles from the previous release's UNVALIDATED collection. Altogether, the JASPAR 2024 update provides a 20% increase in CORE profiles from the previous release. A trimming algorithm enhanced profiles by removing low information content flanking base pairs, which were likely uninformative (within the capacity of the PFM models) for TFBS predictions and modelling TF-DNA interactions. This release includes enhanced metadata, featuring a refined classification for plant TFs' structural DNA-binding domains. The new JASPAR collections prompt updates to the genomic tracks of predicted TF binding sites (TFBSs) in 8 organisms, with human and mouse tracks available as native tracks in the UCSC Genome browser. All data are available through the JASPAR web interface and programmatically through its API and the updated Bioconductor and pyJASPAR packages. Finally, a new TFBS extraction tool enables users to retrieve predicted JASPAR TFBSs intersecting their genomic regions of interest.

Database of p53 gene somatic mutations in human tumors and cell lines.
Cited by 863

A data base is described in which over 2,500 mutations in the p53 gene of human tumors and tumor cell lines are compiled from a systematic search of reports published before 1 January 1994. Data from 1994 are being added intermittently, with a systematic search and update scheduled for December, 1994. The compilation has been deposited with the EMBL Data Library and is available in electronic form free of charge. This report contains a rationale for the compilation, a brief summary of the major findings and a description of the data base.

Ten Simple Rules for Reproducible Computational Research
Geir Kjetil Sandve, Anton Nekrutenko, James Taylor et al.|PLoS Computational Biology|2013
Cited by 804Open Access

Replication is the cornerstone of a cumulative science [1]. However, new tools and technologies, massive amounts of data, interdisciplinary approaches, and the complexity of the questions being asked are complicating replication efforts, as are increased pressures on scientists to advance their research [2]. As full replication of studies on independently collected data is often not feasible, there has recently been a call for reproducible research as an attainable minimum standard for assessing the value of scientific claims [3]. This requires that papers in experimental science describe the results and provide a sufficiently clear protocol to allow successful repetition and extension of analyses based on original data [4]. The importance of replication and reproducibility has recently been exemplified through studies showing that scientific papers commonly leave out experimental details essential for reproduction [5], studies showing difficulties with replicating published experimental results [6], an increase in retracted papers [7], and through a high number of failing clinical trials [8], [9]. This has led to discussions on how individual researchers, institutions, funding bodies, and journals can establish routines that increase transparency and reproducibility. In order to foster such aspects, it has been suggested that the scientific community needs to develop a “culture of reproducibility” for computational science, and to require it for published claims [3]. We want to emphasize that reproducibility is not only a moral responsibility with respect to the scientific field, but that a lack of reproducibility can also be a burden for you as an individual researcher. As an example, a good practice of reproducibility is necessary in order to allow previously developed methodology to be effectively applied on new data, or to allow reuse of code and results for new projects. In other words, good habits of reproducibility may actually turn out to be a time-saver in the longer run. We further note that reproducibility is just as much about the habits that ensure reproducible research as the technologies that can make these processes efficient and realistic. Each of the following ten rules captures a specific aspect of reproducibility, and discusses what is needed in terms of information handling and tracking of procedures. If you are taking a bare-bones approach to bioinformatics analysis, i.e., running various custom scripts from the command line, you will probably need to handle each rule explicitly. If you are instead performing your analyses through an integrated framework (such as GenePattern [10], Galaxy [11], LONI pipeline [12], or Taverna [13]), the system may already provide full or partial support for most of the rules. What is needed on your part is then merely the knowledge of how to exploit these existing possibilities. In a pragmatic setting, with publication pressure and deadlines, one may face the need to make a trade-off between the ideals of reproducibility and the need to get the research out while it is still relevant. This trade-off becomes more important when considering that a large part of the analyses being tried out never end up yielding any results. However, frequently one will, with the wisdom of hindsight, contemplate the missed opportunity to ensure reproducibility, as it may already be too late to take the necessary notes from memory (or at least much more difficult than to do it while underway). We believe that the rewards of reproducibility will compensate for the risk of having spent valuable time developing an annotated catalog of analyses that turned out as blind alleys. As a minimal requirement, you should at least be able to reproduce the results yourself. This would satisfy the most basic requirements of sound research, allowing any substantial future questioning of the research to be met with a precise explanation. Although it may sound like a very weak requirement, even this level of reproducibility will often require a certain level of care in order to be met. There will for a given analysis be an exponential number of possible combinations of software versions, parameter values, pre-processing steps, and so on, meaning that a failure to take notes may make exact reproduction essentially impossible. With this basic level of reproducibility in place, there is much more that can be wished for. An obvious extension is to go from a level where you can reproduce results in case of a critical situation to a level where you can practically and routinely reuse your previous work and increase your productivity. A second extension is to ensure that peers have a practical possibility of reproducing your results, which can lead to increased trust in, interest for, and citations of your work [6], [14]. We here present ten simple rules for reproducibility of computational research. These rules can be at your disposal for whenever you want to make your research more accessible—be it for peers or for your future self.

Cancer risks by gene, age, and gender in 6350 carriers of pathogenic mismatch repair variants: findings from the Prospective Lynch Syndrome Database
Cited by 640Open Access

PURPOSE: Pathogenic variants affecting MLH1, MSH2, MSH6, and PMS2 cause Lynch syndrome and result in different but imprecisely known cancer risks. This study aimed to provide age and organ-specific cancer risks according to gene and gender and to determine survival after cancer. METHODS: We conducted an international, multicenter prospective observational study using independent test and validation cohorts of carriers of class 4 or class 5 variants. After validation the cohorts were merged providing 6350 participants and 51,646 follow-up years. RESULTS: There were 1808 prospectively observed cancers. Pathogenic MLH1 and MSH2 variants caused high penetrance dominant cancer syndromes sharing similar colorectal, endometrial, and ovarian cancer risks, but older MSH2 carriers had higher risk of cancers of the upper urinary tract, upper gastrointestinal tract, brain, and particularly prostate. Pathogenic MSH6 variants caused a sex-limited trait with high endometrial cancer risk but only modestly increased colorectal cancer risk in both genders. We did not demonstrate a significantly increased cancer risk in carriers of pathogenic PMS2 variants. Ten-year crude survival was over 80% following colon, endometrial, or ovarian cancer. CONCLUSION: Management guidelines for Lynch syndrome may require revision in light of these different gene and gender-specific risks and the good prognosis for the most commonly associated cancers.