M

Mei‐Ling Ting Lee

Mokwon University

ORCID: 0000-0001-5022-1061

Publishes on Statistical Methods and Inference, Statistical Methods and Bayesian Inference, Statistical Distribution Estimation and Applications. 155 papers and 7k citations.

155Publications
7kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

Detection of microRNA Expression in Human Peripheral Blood Microvesicles
Cited by 1.4kOpen Access

BACKGROUND: MicroRNAs (miRNA) are small non-coding RNAs that regulate translation of mRNA and protein. Loss or enhanced expression of miRNAs is associated with several diseases, including cancer. However, the identification of circulating miRNA in healthy donors is not well characterized. Microvesicles, also known as exosomes or microparticles, circulate in the peripheral blood and can stimulate cellular signaling. In this study, we hypothesized that under normal healthy conditions, microvesicles contain miRNAs, contributing to biological homeostasis. METHODOLOGY/PRINCIPAL FINDINGS: Microvesicles were isolated from the plasma of normal healthy individuals. RNA was isolated from both the microvesicles and matched mononuclear cells and profiled for 420 known mature miRNAs by real-time PCR. Hierarchical clustering of the data sets indicated significant differences in miRNA expression between peripheral blood mononuclear cells (PBMC) and plasma microvesicles. We observed 71 miRNAs co-expressed between microvesicles and PBMC. Notably, we found 33 and 4 significantly differentially expressed miRNAs in the plasma microvesicles and mononuclear cells, respectively. Prediction of the gene targets and associated biological pathways regulated by the detected miRNAs was performed. The majority of the miRNAs expressed in the microvesicles from the blood were predicted to regulate cellular differentiation of blood cells and metabolic pathways. Interestingly, a select few miRNAs were also predicted to be important modulators of immune function. CONCLUSIONS: This study is the first to identify and define miRNA expression in circulating plasma microvesicles of normal subjects. The data generated from this study provides a basis for future studies to determine the predictive role of peripheral blood miRNA signatures in human disease and will enable the definition of the biological processes regulated by these miRNA.

Importance of replication in microarray gene expression studies: Statistical methods and evidence from repetitive cDNA hybridizations
Mei‐Ling Ting Lee, Frank C. Kuo, G. À. Whitmore et al.|Proceedings of the National Academy of Sciences|2000
Cited by 870Open Access

We present statistical methods for analyzing replicated cDNA microarray expression data and report the results of a controlled experiment. The study was conducted to investigate inherent variability in gene expression data and the extent to which replication in an experiment produces more consistent and reliable findings. We introduce a statistical model to describe the probability that mRNA is contained in the target sample tissue, converted to probe, and ultimately detected on the slide. We also introduce a method to analyze the combined data from all replicates. Of the 288 genes considered in this controlled experiment, 32 would be expected to produce strong hybridization signals because of the known presence of repetitive sequences within them. Results based on individual replicates, however, show that there are 55, 36, and 58 highly expressed genes in replicates 1, 2, and 3, respectively. On the other hand, an analysis by using the combined data from all 3 replicates reveals that only 2 of the 288 genes are incorrectly classified as expressed. Our experiment shows that any single microarray output is subject to substantial variability. By pooling data from replicates, we can provide a more reliable analysis of gene expression data. Therefore, we conclude that designing experiments with replications will greatly reduce misclassification rates. We recommend that at least three replicates be used in designing experiments by using cDNA microarrays, particularly when gene expression data from single specimens are being analyzed.

Relation of pooled logistic regression to time dependent cox regression analysis: The framingham heart study
Ralph B. D’Agostino, Mei‐Ling Ting Lee, Albert J. Belanger et al.|Statistics in Medicine|1990
Cited by 791

A standard analysis of the Framingham Heart Study data is a generalized person-years approach in which risk factors or covariates are measured every two years with a follow-up between these measurement times to observe the occurrence of events such as cardiovascular disease. Observations over multiple intervals are pooled into a single sample and a logistic regression is employed to relate the risk factors to the occurrence of the event. We show that this pooled logistic regression is close to the time dependent covariate Cox regression analysis. Numerical examples covering a variety of sample sizes and proportions of events display the closeness of this relationship in situations typical of the Framingham Study. A proof of the relationship and the necessary conditions are given in the Appendix.

The Wilcoxon Signed Rank Test for Paired Comparisons of Clustered Data
Cited by 526

The Wilcoxon signed rank test is a frequently used nonparametric test for paired data (e.g., consisting of pre- and posttreatment measurements) based on independent units of analysis. This test cannot be used for paired comparisons arising from clustered data (e.g., if paired comparisons are available for each of two eyes of an individual). To incorporate clustering, a generalization of the randomization test formulation for the signed rank test is proposed, where the unit of randomization is at the cluster level (e.g., person), while the individual paired units of analysis are at the subunit within cluster level (e.g., eye within person). An adjusted variance estimate of the signed rank test statistic is then derived, which can be used for either balanced (same number of subunits per cluster) or unbalanced (different number of subunits per cluster) data, with an exchangeable correlation structure, with or without tied values. The resulting test statistic is shown to be asymptotically normal as the number of clusters becomes large, if the cluster size is bounded. Simulation studies are performed based on simulating correlated ranked data from a signed log-normal distribution. These studies indicate appropriate type I error for data sets with > or =20 clusters and a superior power profile compared with either the ordinary signed rank test based on the average cluster difference score or the multivariate signed rank test of Puri and Sen. Finally, the methods are illustrated with two data sets, (i) an ophthalmologic data set involving a comparison of electroretinogram (ERG) data in retinitis pigmentosa (RP) patients before and after undergoing an experimental surgical procedure, and (ii) a nutritional data set based on a randomized prospective study of nutritional supplements in RP patients where vitamin E intake outside of study capsules is compared before and after randomization to monitor compliance with nutritional protocols.

Threshold Regression for Survival Analysis: Modeling Event Times by a Stochastic Process Reaching a Boundary
Mei‐Ling Ting Lee, G. À. Whitmore|Statistical Science|2006
Cited by 270Open Access

Many researchers have investigated first hitting times as models for survival data. First hitting times arise naturally in many types of stochastic processes, ranging from Wiener processes to Markov chains. In a survival context, the state of the underlying process represents the strength of an item or the health of an individual. The item fails or the individual experiences a clinical endpoint when the process reaches an adverse threshold state for the first time. The time scale can be calendar time or some other operational measure of degradation or disease progression. In many applications, the process is latent (i.e., unobservable). Threshold regression refers to first-hitting-time models with regression structures that accommodate covariate data. The parameters of the process, threshold state and time scale may depend on the covariates. This paper reviews aspects of this topic and discusses fruitful avenues for future research.