Maximum Likelihood Comparisons of Tully-Fisher and Redshift Data: Constraints on Omega and Biasing

5.2.1. Using Residual Correlations to Identify Poor Fits Quantitatively

In order to compare the observed residual correlations with the results from the mock catalogs, we would like to define a single statistic that summarizes the deviation of psi ( tau ) from unity. Let us define ( tau ) ident N_p( tau ) psi ( tau ) (cf. eq. [25]). In Appendix C, we show that ( tau ) approximates a Gaussian random variable of mean zero and variance N_p( tau ), if indeed the VELMOD residuals are uncorrelated on scale tau . [This property was used to compute the error bars on psi ( tau ) above.] To the degree this approximation is a good one, the quantity

(26)

will be distributed approximately as a chi ² variable with M degrees of freedom, where M is the number of separate bins in which ( tau ) is calculated. In contrast, if the residuals are strongly correlated on any scale tau , chi ² will exceed its expected value significantly.

However, because a single galaxy will appear in many different pairs in the correlation statistic, both within and between bins in tau , the assumptions made above do not hold rigorously. In Appendix C, we explore this issue further. For now, we appeal to the mock catalogs to assess how closely the quantity chi ² follows chi ² statistics. We computed it for each of the 20 mock catalog runs (Section 3) with beta _I = 1. We carried out the calculation to a maximum separation of 6400 km s^-1, in bins of width 200 km s^-1, so that M = 32, and found a mean value < chi ²> = 27.83±1.82, which may be compared with an expected value of 32 for a true chi ² statistic. The rms scatter in chi ² was 8.15, which is the same as that expected for a true chi ². The difference between the mean and expected values is 2.3 sigma , indicating that chi ² is not exactly a chi ² statistic, for reasons discussed in Appendix C. However, because the departure from true chi ² statistics is small, chi ² is a useful statistic for measuring goodness of fit when calibrated against the mock catalogs.

Before presenting chi ² for the real data, we consider its variation with beta _I for the mock catalogs. In Figure 18, we plot the average value of chi ² over the 20 mock catalogs at each value of beta _I for which VELMOD was run. Although the minimum is at beta _I = 1, it is not nearly as sharp as is that of the likelihood as function of beta _I (e.g., Fig. 2); this statistic does not have the power that the likelihood does for measuring beta _I. Indeed, for a single realization (the open symbols), the statistic has several local minima. However, it is apparent that a chi ² value much greater than its expected true value of ~ 28 will indicate a poor fit of the model to the data.

Figure 18. Residual correlation statistic chi ², defined by eq. (26), plotted as a function of beta _I for the mock catalogs. The filled symbols show an average over 20 mock catalogs; the open symbols show the values obtained for a single mock catalog.

In Figure 19, we plot the statistic chi ² as a function of beta _I for the real data, with and without the quadrupole included. The horizontal lines indicate the expected value of chi ², and the 1 sigma and 3 sigma deviations from it. Note first that the no-quadrupole model does not provide an acceptable fit for any value of beta _I. This is not a conclusion we could have reached on the basis of the likelihood analysis alone. When the quadrupole is included, the only values of beta _I that are unambiguously ruled out are beta _I = 0.1, 0.2, and 1.0. The best-fit model according to VELMOD, beta _I = 0.5 plus quadrupole, also has the smallest value of chi ². Given the multiple minima seen for one mock realization in Figure 18, this is not necessarily deeply significant. The statistic chi ² is suitable for identifying models that do not fit the data, but it does not have the power of the likelihood statistic for discriminating among those models that do fit.

Figure 19. Residual autocorrelation statistic chi ², defined by eq. (26), plotted as a function of beta _I for the real data, with and without the quadrupole modeled. The heavy solid line shows the expected value of the statistic, which was determined by averaging the derived value for 20 mock catalogs. The dot-dashed line and the dashed line show 1 sigma and 3 sigma deviations from this value, respectively. Note that when the quadrupole is not modeled, highly significant residual correlations are detected for all values of beta _I. (The no-quadrupole points for beta _I = 0.1 and 0.2 are not shown because their chi ² values are too large.)

In summary, the VELMOD likelihood maximization procedure is the proper one for determining which value of _I is better than others, but it cannot identify poor fits to our model. The residual correlation statistic ² can identify unacceptable fits but does not have the power to determine which of the acceptable fits is best. We have found that the IRAS velocity field with _I = 0.5, plus the external quadrupole, is both the best fit of those considered and an acceptable fit. Values of _I > 0.9 and _I < 0.3 are strongly ruled out.