Gabriela Balderrama-Gutierrez

Systematic phenotyping and characterization of the 5xFAD mouse model of Alzheimer’s disease

Stefânia Forner, Shimako Kawauchi, Gabriela Balderrama-Gutierrez et al.|Scientific Data|2021

Cited by 479Open Access

Mouse models of human diseases are invaluable tools for studying pathogenic mechanisms and testing interventions and therapeutics. For disorders such as Alzheimer's disease in which numerous models are being generated, a challenging first step is to identify the most appropriate model and age to effectively evaluate new therapeutic approaches. Here we conducted a detailed phenotypic characterization of the 5xFAD model on a congenic C57BL/6 J strain background, across its lifespan - including a seldomly analyzed 18-month old time point to provide temporally correlated phenotyping of this model and a template for characterization of new models of LOAD as they are generated. This comprehensive analysis included quantification of plaque burden, Aβ biochemical levels, and neuropathology, neurophysiological measurements and behavioral and cognitive assessments, and evaluation of microglia, astrocytes, and neurons. Analysis of transcriptional changes was conducted using bulk-tissue generated RNA-seq data from microdissected cortices and hippocampi as a function of aging, which can be explored at the MODEL-AD Explorer and AD Knowledge Portal. This deep-phenotyping pipeline identified novel aspects of age-related pathology in the 5xFAD model.

Systematic assessment of long-read RNA-seq methods for transcript identification and quantification

Francisco J. Pardo-Palacios, Dingjie Wang, Fairlie Reese et al.|Nature Methods|2024

Cited by 198Open Access

The Long-read RNA-Seq Genome Annotation Assessment Project Consortium was formed to evaluate the effectiveness of long-read approaches for transcriptome analysis. Using different protocols and sequencing platforms, the consortium generated over 427 million long-read sequences from complementary DNA and direct RNA datasets, encompassing human, mouse and manatee species. Developers utilized these data to address challenges in transcript isoform detection, quantification and de novo transcript detection. The study revealed that libraries with longer, more accurate sequences produce more accurate transcripts than those with increased read depth, whereas greater read depth improved quantification accuracy. In well-annotated genomes, tools based on reference sequences demonstrated the best performance. Incorporating additional orthogonal data and replicate samples is advised when aiming to detect rare and novel transcripts or using reference-free approaches. This collaborative study offers a benchmark for current practices and provides direction for future method development in transcriptome analysis.

A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification

Dana Wyman, Gabriela Balderrama-Gutierrez, Fairlie Reese et al.|bioRxiv (Cold Spring Harbor Laboratory)|2019

Cited by 192Open Access

ABSTRACT Alternative splicing is widely acknowledged to be a crucial regulator of gene expression and is a key contributor to both normal developmental processes and disease states. While cost-effective and accurate for quantification, short-read RNA-seq lacks the ability to resolve full-length transcript isoforms despite increasingly sophisticated computational methods. Long-read sequencing platforms such as Pacific Biosciences (PacBio) and Oxford Nanopore (ONT) bypass the transcript reconstruction challenges of short reads. Here we introduce TALON, the ENCODE4 pipeline for platform-independent analysis of long-read transcriptomes. We apply TALON to the GM12878 cell line and show that while both PacBio and ONT technologies perform well at full-transcript discovery and quantification, each displayed distinct technical artifacts. We further apply TALON to mouse hippocampus and cortex transcriptomes and find that 422 genes found in these regions have more reads associated with novel isoforms than with annotated ones. We demonstrate that TALON is a capable of tracking both known and novel transcript models as well as their expression levels across datasets for both simple studies and in larger projects. These properties will enable TALON users to move beyond the limitations of short-read data to perform isoform discovery and quantification in a uniform manner on existing and future long-read platforms.

Generation of a humanized Aβ expressing mouse demonstrating aspects of Alzheimer’s disease-like pathology

David Baglietto‐Vargas, Stefânia Forner, Lena Cai et al.|Nature Communications|2021

Cited by 114Open Access

The majority of Alzheimer's disease (AD) cases are late-onset and occur sporadically, however most mouse models of the disease harbor pathogenic mutations, rendering them better representations of familial autosomal-dominant forms of the disease. Here, we generated knock-in mice that express wildtype human Aβ under control of the mouse App locus. Remarkably, changing 3 amino acids in the mouse Aβ sequence to its wild-type human counterpart leads to age-dependent impairments in cognition and synaptic plasticity, brain volumetric changes, inflammatory alterations, the appearance of Periodic Acid-Schiff (PAS) granules and changes in gene expression. In addition, when exon 14 encoding the Aβ sequence was flanked by loxP sites we show that Cre-mediated excision of exon 14 ablates hAβ expression, rescues cognition and reduces the formation of PAS granules.

The ENCODE Uniform Analysis Pipelines

Benjamin C. Hitz, Jin-Wook Lee, Otto Jolanki et al.|bioRxiv (Cold Spring Harbor Laboratory)|2023

Cited by 99Open Access

Abstract The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/ ) is publicly available in GitHub, with images available on Dockerhub ( https://hub.docker.com ), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses. Database URL: https://www.encodeproject.org/

Gabriela Balderrama-Gutierrez

Is this you? Claim your profile.

Top publicationsby citations