M

M Beer

Johns Hopkins University

ORCID: 0000-0001-9955-3809

Publishes on Magnetic confinement fusion research, Genomics and Chromatin Dynamics, RNA and protein synthesis mechanisms. 5k papers and 18.8k citations.

5kPublications
18.8kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

Expanded encyclopaedias of DNA elements in the human and mouse genomes
Cited by 2.6kOpen Access

Abstract The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal ( https://www.encodeproject.org ), including phase II ENCODE 1 and Roadmap Epigenomics 2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis -regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org ) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.

Comparisons and physics basis of tokamak transport models and turbulence simulations
A. M. Dimits, G. Bateman, M Beer et al.|Physics of Plasmas|2000
Cited by 1k

The predictions of gyrokinetic and gyrofluid simulations of ion-temperature-gradient (ITG) instability and turbulence in tokamak plasmas as well as some tokamak plasma thermal transport models, which have been widely used for predicting the performance of the proposed International Thermonuclear Experimental Reactor (ITER) tokamak [Plasma Physics and Controlled Nuclear Fusion Research, 1996 (International Atomic Energy Agency, Vienna, 1997), Vol. 1, p. 3], are compared. These comparisons provide information on effects of differences in the physics content of the various models and on the fusion-relevant figures of merit of plasma performance predicted by the models. Many of the comparisons are undertaken for a simplified plasma model and geometry which is an idealization of the plasma conditions and geometry in a Doublet III-D [Plasma Physics and Controlled Nuclear Fusion Research, 1986 (International Atomic Energy Agency, Vienna, 1987), Vol. 1, p. 159] high confinement (H-mode) experiment. Most of the models show good agreements in their predictions and assumptions for the linear growth rates and frequencies. There are some differences associated with different equilibria. However, there are significant differences in the transport levels between the models. The causes of some of the differences are examined in some detail, with particular attention to numerical convergence in the turbulence simulations (with respect to simulation mesh size, system size and, for particle-based simulations, the particle number). The implications for predictions of fusion plasma performance are also discussed.

Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features
Mahmoud Ghandi, Dongwon Lee, Morteza Mohammad-Noori et al.|PLoS Computational Biology|2014
Cited by 573Open Access

Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem.