A

Ali S. Hadi

University of California, Riverside

ORCID: 0000-0002-1533-8333

Publishes on Advanced Statistical Methods and Models, Bayesian Modeling and Causal Inference, Advanced Statistical Process Monitoring. 211 papers and 11.3k citations.

211Publications
11.3kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

Regression Analysis by Example
Samprit Chatterjee, Ali S. Hadi|Wiley series in probability and statistics|2006
Cited by 962

Praise for the Fourth Edition: "This book is . . . an excellent source of examples for regression analysis. It has been and still is readily readable and understandable." -Journal of the American Statistical Association Regression analysis is a conceptually simple method for investigating relationships among variables. Carrying out a successful application of regression analysis, however, requires a balance of theoretical results, empirical rules, and subjective judgment. Regression Analysis by Example, Fifth Edition has been expanded

Influential Observations, High Leverage Points, and Outliers in Linear Regression
Samprit Chatterjee, Ali S. Hadi|Statistical Science|1986
Cited by 846Open Access

A bewilderingly large number of statistical quantities have been proposed to study outliers and influence of individual observations in regression analysis. In this article we describe the inter-relationships which exist among the proposed measures. An examination of these relationships leads us to conclude that only three of these measures along with some graphical displays can provide an analyst a complete picture of outliers (major discrepant points) and points which excessively influence the fitted regression equation. Illustrative examples based on real data are presented.

Identifying Multiple Outliers in Multivariate Data
Ali S. Hadi|Journal of the Royal Statistical Society Series B (Statistical Methodology)|1992
Cited by 772

SUMMARY We propose a procedure for the detection of multiple outliers in multivariate data. Let X be an n × p data matrix representing n observations on p variates. We first order the n observations, using an appropriately chosen robust measure of outlyingness, then divide the data set into two initial subsets: A ‘basic’ subset which contains p +1 ‘good’ observations and a ‘non-basic’ subset which contains the remaining n - p - 1 observations. Second, we compute the relative distance from each point in the data set to the centre of the basic subset, relative to the (possibly singular) covariance matrix of the basic subset. Third, we rearrange the n observations in ascending order accordingly, then divide the data set into two subsets: A basic subset which contains the first p + 2 observations and a non-basic subset which contains the remaining n - p - 2 observations. This process is repeated until an appropriately chosen stopping criterion is met. The final non-basic subset of observations is declared an outlying subset. The procedure proposed is illustrated and compared with existing methods by using several data sets. The procedure is simple, computationally inexpensive, suitable for automation, computable with widely available software packages, effective in dealing with masking and swamping problems and, most importantly, successful in identifying multivariate outliers.