Estimating Optimal Transformations for Multiple Regression and Correlation

Leo Breiman(University of California System), Jerome H. Friedman(SLAC National Accelerator Laboratory)
Journal of the American Statistical Association
September 1, 1985
Cited by 1,595

Abstract

Abstract In regression analysis the response variable Y and the predictor variables X 1 …, Xp are often replaced by functions θ(Y) and Ø1(X 1), …, Ø p (Xp ). We discuss a procedure for estimating those functions θ and Ø1, …, Ø p that minimize e 2 = E{[θ(Y) — Σ Ø j (Xj )]2}/var[θ(Y)], given only a sample {(yk , xk1 , …, xkp ), 1 ⩽ k ⩽ N} and making minimal assumptions concerning the data distribution or the form of the solution functions. For the bivariate case, p = 1, θ and Ø satisfy ρ = p(θ, Ø) = maxθ,Øρ[θ(Y), Ø(X)], where ρ is the product moment correlation coefficient and ρ is the maximal correlation between X and Y. Our procedure thus also provides a method for estimating the maximal correlation between two variables.


Related Papers

An Analysis of Transformations
George E. P. Box, David R. Cox|Journal of the Royal Statistical Society Series B (Statistical Methodology)|1964|15k
Robust Locally Weighted Regression and Smoothing Scatterplots
William S. Cleveland|Journal of the American Statistical Association|1979|10.9k
Hedonic housing prices and the demand for clean air
David Harrison, Daniel L. Rubinfeld|Journal of Environmental Economics and Management|1978|1.8k
Consistent Nonparametric Regression
Charles J. Stone|The Annals of Statistics|1977|1.8k