JBrowse: A next-generation genome browserWe describe an open source, portable, JavaScript-based genome browser, JBrowse, that can be used to navigate genome annotations over the web. JBrowse helps preserve the user's sense of location by avoiding discontinuous transitions, instead offering smoothly animated panning, zooming, navigation, and track selection. Unlike most existing genome browsers, where the genome is rendered into images on the webserver and the role of the client is restricted to displaying those images, JBrowse distributes work between the server and client and therefore uses significantly less server overhead than previous genome browsers. We report benchmark results empirically comparing server- and client-side rendering strategies, review the architecture and design considerations of JBrowse, and describe a simple wiki plug-in that allows users to upload and share annotation tracks.
FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencingInflamed and non-inflamed classes of HCC: a revised immunogenomic classificationObjective We previously reported a characterisation of the hepatocellular carcinoma (HCC) immune contexture and described an immune-specific class. We now aim to further delineate the immunogenomic classification of HCC to incorporate features that explain responses/resistance to immunotherapy. Design We performed RNA and whole-exome sequencing, T-cell receptor (TCR)-sequencing, multiplex immunofluorescence and immunohistochemistry in a novel cohort of 240 HCC patients and validated our results in other cohorts comprising 660 patients. Results Our integrative analysis led to define: (1) the inflamed class of HCC (37%), which includes the previously reported immune subclass (22%) and a new immune-like subclass (15%) with high interferon signalling, cytolytic activity, expression of immune-effector cytokines and a more diverse T-cell repertoire. A 20-gene signature was able to capture ~90% of these tumours and is associated with response to immunotherapy. Proteins identified in liquid biopsies recapitulated the inflamed class with an area under the ROC curve (AUC) of 0.91; (2) The intermediate class, enriched in TP53 mutations (49% vs 29%, p=0.035), and chromosomal losses involving immune-related genes and; (3) the excluded class, enriched in CTNNB1 mutations (93% vs 27%, p<0.001) and PTK2 overexpression due to gene amplification and promoter hypomethylation. CTNNB1 mutations outside the excluded class led to weak activation of the Wnt-βcatenin pathway or occurred in HCCs dominated by high interferon signalling and type I antigen presenting genes. Conclusion We have characterised the immunogenomic contexture of HCC and defined inflamed and non-inflamed tumours. Two distinct CTNNB1 patterns associated with a differential role in immune evasion are described. These features may help predict immune response in HCC.
Molecular characterisation of hepatocellular carcinoma in patients with non-alcoholic steatohepatitisRoser Pinyol, Sara Torrecilla, Huan Wang et al.|Journal of Hepatology|2021 Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy changeBACKGROUND: Non-coding RNAs (ncRNAs) have a multitude of roles in the cell, many of which remain to be discovered. However, it is difficult to detect novel ncRNAs in biochemical screens. To advance biological knowledge, computational methods that can accurately detect ncRNAs in sequenced genomes are therefore desirable. The increasing number of genomic sequences provides a rich dataset for computational comparative sequence analysis and detection of novel ncRNAs. RESULTS: Here, Dynalign, a program for predicting secondary structures common to two RNA sequences on the basis of minimizing folding free energy change, is utilized as a computational ncRNA detection tool. The Dynalign-computed optimal total free energy change, which scores the structural alignment and the free energy change of folding into a common structure for two RNA sequences, is shown to be an effective measure for distinguishing ncRNA from randomized sequences. To make the classification as a ncRNA, the total free energy change of an input sequence pair can either be compared with the total free energy changes of a set of control sequence pairs, or be used in combination with sequence length and nucleotide frequencies as input to a classification support vector machine. The latter method is much faster, but slightly less sensitive at a given specificity. Additionally, the classification support vector machine method is shown to be sensitive and specific on genomic ncRNA screens of two different Escherichia coli and Salmonella typhi genome alignments, in which many ncRNAs are known. The Dynalign computational experiments are also compared with two other ncRNA detection programs, RNAz and QRNA. CONCLUSION: The Dynalign-based support vector machine method is more sensitive for known ncRNAs in the test genomic screens than RNAz and QRNA. Additionally, both Dynalign-based methods are more sensitive than RNAz and QRNA at low sequence pair identities. Dynalign can be used as a comparable or more accurate tool than RNAz or QRNA in genomic screens, especially for low-identity regions. Dynalign provides a method for discovering ncRNAs in sequenced genomes that other methods may not identify. Significant improvements in Dynalign runtime have also been achieved.