Practical Statistics for Astronomers II

3.1 The Method of Least Squares: Regression Analysis

The squares of the residuals are minimized; there is justification for this, and there is a long history and a vast literature (e.g. Williams 1959, Linnik 1961, Montgomery & Peck 1992).

For our particular example of fitting the ``regression line'', or a straight line y = ax + b through N pairs of (x_i, y_i), the solution to the least squares of the residuals in y yields

Equation 6 and

Equation 7

In the absence of knowledge of the how and why of a relation between the x_i and the y_i any two-parameter curve may be fitted to the data pairs with simple coordinate transformations: for example

an exponential, y = b exp a, requires y_i to be changed to ln y_i in the above expressions;
a power-law, y = bx^a; change y_i to ln y_i and x_i to ln x_i;
a parabola, y = b + ax²; change x_i to x_i.

(Note that the residuals cannot be Gaussian for all of these transformations: of course it is always possible to minimize the squares of the residuals, but it may well not be possible to retain the formal justification for doing so.)

There are many further variations available. Algebra can provide expressions for weighted data-pairs and/or the fitting of polynomials of any order. For all of these, residuals can be examined to determine which is the best way to model the data relations.