Karolinska Institutet
ORCID: 0000-0003-2892-673XPublishes on Neuroblastoma Research and Treatments, Cancer, Hypoxia, and Metabolism, Single-cell and spatial transcriptomics. 88 papers and 7.1k citations.
Add your photo, update your bio, and get notified when your ranking changes.
Expression from both alleles is generally observed in analyses of diploid cell populations, but studies addressing allelic expression patterns genome-wide in single cells are lacking. Here, we present global analyses of allelic expression across individual cells of mouse preimplantation embryos of mixed background (CAST/EiJ × C57BL/6J). We discovered abundant (12 to 24%) monoallelic expression of autosomal genes and that expression of the two alleles occurs independently. The monoallelic expression appeared random and dynamic because there was considerable variation among closely related embryonic cells. Similar patterns of monoallelic expression were observed in mature cells. Our allelic expression analysis also demonstrates the de novo inactivation of the paternal X chromosome. We conclude that independent and stochastic allelic transcription generates abundant random monoallelic expression in the mammalian cell.
The parts of the genome transcribed by a cell or tissue reflect the biological processes and functions it carries out. We characterized the features of mammalian tissue transcriptomes at the gene level through analysis of RNA deep sequencing (RNA-Seq) data across human and mouse tissues and cell lines. We observed that roughly 8,000 protein-coding genes were ubiquitously expressed, contributing to around 75% of all mRNAs by message copy number in most tissues. These mRNAs encoded proteins that were often intracellular, and tended to be involved in metabolism, transcription, RNA processing or translation. In contrast, genes for secreted or plasma membrane proteins were generally expressed in only a subset of tissues. The distribution of expression levels was broad but fairly continuous: no support was found for the concept of distinct expression classes of genes. Expression estimates that included reads mapping to coding exons only correlated better with qRT-PCR data than estimates which also included 3' untranslated regions (UTRs). Muscle and liver had the least complex transcriptomes, in that they expressed predominantly ubiquitous genes and a large fraction of the transcripts came from a few highly expressed genes, whereas brain, kidney and testis expressed more complex transcriptomes with the vast majority of genes expressed and relatively small contributions from the most expressed genes. mRNAs expressed in brain had unusually long 3'UTRs, and mean 3'UTR length was higher for genes involved in development, morphogenesis and signal transduction, suggesting added complexity of UTR-based regulation for these genes. Our results support a model in which variable exterior components feed into a large, densely connected core composed of ubiquitously expressed intracellular proteins.
Pluripotent embryonic stem (ES) cells can generate all cell types, but how cell lineages are initially specified and maintained during development remains largely unknown. Different classes of Sox transcription factors are expressed during neurogenesis and have been assigned important roles from early lineage specification to neuronal differentiation. Here we characterize the genome-wide binding for Sox2, Sox3, and Sox11, which have vital functions in ES cells, neural precursor cells (NPCs), and maturing neurons, respectively. The data demonstrate that Sox factor binding depends on developmental stage-specific constraints and reveal a remarkable sequential binding of Sox proteins to a common set of neural genes. Interestingly, in ES cells, Sox2 preselects for neural lineage-specific genes destined to be bound and activated by Sox3 in NPCs. In NPCs, Sox3 binds genes that are later bound and activated by Sox11 in differentiating neurons. Genes prebound by Sox proteins are associated with a bivalent chromatin signature, which is resolved into a permissive monovalent state upon binding of activating Sox factors. These data indicate that a single key transcription factor family acts sequentially to coordinate neural gene expression from the early lineage specification in pluripotent cells to later stages of neuronal development.