Notes on Statistics for Physicists, Revised

4. MAXIMUM-LIKELIHOOD METHOD

The preceding section was devoted to the case in which one had a discrete set of hypotheses among which to choose. It is more common in physics to have an infinite set of hypotheses; i.e., a parameter that is a continuous variable. For example, in the µ-e decay distribution

the possible values for alpha ₀ belong to a continuous rather than a discrete set. In this case, as before, we invoke the same basic principle which says the relative probability of any two different values of alpha is the ratio of the probabilities of getting our particular experimental results, x_i, assuming first one and then the other, value of alpha is true. This probability function of alpha is called the likelihood function, curlyL ( alpha ).

	(2)
The likelihood function, (), is the joint probability density of getting a particular experimental result, x₁, ... , x_n, assuming f (;x) is the true normalized distribution function:

The relative probabilities of alpha can be displayed as a plot of curlyL ( alpha ) vs. alpha . The most probable value of alpha is called the maximum-likelihood solution alpha *. The rms (root-mean-square) spread of alpha about alpha * is a conventional measure of the accuracy of the determination alpha = alpha * . We shall call this Delta alpha .

(3)

In general, the likelihood function will be close to Gaussian (it can be shown to approach a Gaussian distribution as N -> infty ) and will look similar to Fig. 1b.

Fig. 1a represents what is called a case of poor statistics. In such a case, it is better to present a plot of curlyL ( alpha ) rather than merely quoting alpha * and Delta alpha . Straightforward procedures for obtaining Delta alpha are presented in Sections 6 and 7.

Figure 1. Two examples of likelihood functions curlyL ( alpha ).

A confirmation of this inverse probability approach is the Maximum-Likelihood Theorem, which is proved in Cramer [4] by use of direct probability. The theorem states that in the limit of large N, alpha * -> alpha ₀; and furthermore, there is no other method of estimation that is more accurate.

In the general case in which there are M parameters, alpha ₁, ..., alpha _M, to be determined, the procedure for obtaining the maximum likelihood solution is to solve the M simultaneous equations,

(4)