The Bayesian model comparison approach based on the evaluation of the evidence is being increasingly applied to model building questions such as: are isocurvature contributions to the initial conditions required by the data [46, 58, 60, 182]? Is the Universe flat [183, 58, 70]? What is the best description of the primordial power spectrum for density perturbations [51, 184, 58, 185, 165, 47, 70]? Is dark energy best described as a cosmological constant [186, 50, 51, 187, 188, 189, 190, 191]? In this section we review the status of the field.
6.1. Evidence for the cosmological concordance model
Table 4 is a fairly extensive compilation
of recent results regarding possible extensions to (or reduction
of) the vanilla
CDM concordance
cosmological model introduced in
section 5.1. We have chosen to compile
only results obtained using the full Bayesian evidence, rather than
approximate model comparisons obtained via the information criteria because
the latter are often not adequate approximations, for the
reasons explained in section 4.7. Of
course, the outcome depends on the Occam's razor effect brought
about by the prior volume (and sometimes, by the choice of
parameterization). Where applicable, we have show the sensitivity
of the result on the prior assumptions by giving a ballpark range
of values for the Bayes factor, as presented in the original
studies. The reader ought to refer to the original works for the
precise prior and parameter choices and for the justification of
the assumed prior ranges.
Competing model | ![]() |
ln B | Ref | Data | Outcome |
Initial conditions | |||||
Isocurvature modes | |||||
CDM isocurvature | +1 | -7.6 | [58] | WMAP3+, LSS | Strong evidence for adiabaticity |
+ arbitrary correlations | +4 | -1.0 | [46] | WMAP1+, LSS, SN Ia | Undecided |
Neutrino entropy | +1 | [-2.5, -6.5]p | [60] | WMAP3+, LSS | Moderate to strong evidence for adiabaticity |
+ arbitrary correlations | +4 | -1.0 | [46] | WMAP1+, LSS, SN Ia | Undecided |
Neutrino velocity | +1 | [-2.5, -6.5]p | [60] | WMAP3+, LSS | Moderate to strong evidence for adiabaticity |
+ arbitrary correlations | +4 | -1.0 | [46] | WMAP1+, LSS, SN Ia | Undecided |
Primordial power spectrum | |||||
No tilt (ns = 1) | -1 | +0.4 | [47] | WMAP1+, LSS | Undecided |
[-1.1, -0.6]p | [51] | WMAP1+, LSS | Undecided | ||
-0.7 | [58] | WMAP1+, LSS | Undecided | ||
-0.9 | [70] | WMAP1+ | Undecided | ||
[-0.7, -1.7]p,d | [185] | WMAP3+ | ns = 1 weakly disfavoured | ||
-2.0 | [184] | WMAP3+, LSS | ns = 1 weakly disfavoured | ||
-2.6 | [70] | WMAP3+ | ns = 1 moderately disfavoured | ||
-2.9 | [58] | WMAP3+, LSS | ns = 1 moderately disfavoured | ||
<-3.9c | [65] | WMAP3+, LSS | Moderate evidence at best against ns ≠ 1 | ||
Running | +1 | [-0.6, 1.0]p,d | [185] | WMAP3+, LSS | No evidence for running |
< 0.2c | [165] | WMAP3+, LSS | Running not required | ||
Running of running | +2 | <0.4c | [165] | WMAP3+, LSS | Not required |
Large scales cut-off | +2 | [1.3, 2.2]p,d | [185] | WMAP3+, LSS | Weak support for a cut-off |
Matter-energy content | |||||
Non-flat Universe | +1 | -3.8 | [70] | WMAP3+, HST | Flat Universe moderately favoured |
-3.4 | [58] | WMAP3+, LSS, HST | Flat Universe moderately favoured | ||
Coupled neutrinos | +1 | -0.7 | [192] | WMAP3+, LSS | No evidence for non-SM neutrinos |
Dark energy sector | |||||
w(z)= weff ≠ -1 | +1 | [-1.3, -2.7]p | [186] | SN Ia | Weak to moderate support for ![]() |
-3.0 | [50] | SN Ia | Moderate support for
![]() |
||
-1.1 | [51] | WMAP1+, LSS, SN Ia | Weak support for
![]() |
||
[-0.2, -1]p | [187] | SN Ia, BAO, WMAP3 | Undecided | ||
[-1.6, -2.3]d | [188] | SN Ia, GRB | Weak support for
![]() |
||
w(z) = w0 + w1 z | +2 | [-1.5, -3.4]p | [186] | SN Ia | Weak to moderate support for
![]() |
-6.0 | [50] | SN Ia | Strong support for
![]() |
||
-1.8 | [187] | SN Ia, BAO, WMAP3 | Weak support for
![]() |
||
w(z) = w0 + wa(1 - a) | +2 | -1.1 | [187] | SN Ia, BAO, WMAP3 | Weak support for
![]() |
[-1.2, -2.6]d | [188] | SN Ia, GRB | Weak to moderate support for
![]() |
||
Reionization history | |||||
No reionization
(![]() |
-1 | -2.6 | [70] | WMAP3+, HST | ![]() |
No reionization and no tilt | -2 | -10.3 | [70] | WMAP3+, HST | Strongly disfavoured |
d Depending on the choice of datasets. | |||||
p Depending on the choice of priors. | |||||
c Upper bound using Bayesian calibrated
p-values, see section 4.5.
Data sets: WMAP1+ (WMAP3+): WMAP 1st year (3-yr) data and other CMB measurements. LSS: Large scale structures data. SN Ia: supernovae type Ia. BAO: baryonic acoustic oscillations. GRB: gamma ray bursts. |
As anticipated, the 6 parameters
CDM concordance
model is currently well supported by the data, as the inclusion of extra
parameters is not required by the Bayesian evidence. This is shown
by the fact that most model comparisons return either an undecided
result or they support the
CDM model
(negative values for ln
B in Table 4). The only exception is the
support for a cut-off on large scales in the power spectrum
reported by
[185].
This is clearly driven by the
anomalies in the large scale CMB power spectrum, which in this
case are interpreted as being a reflection of a lack of power in
the primordial power spectrum. Whether such anomalies are of
cosmological origin remains however an open question
[193,
194].
If extensions of the
model are not supported, reduction of
CDM to simpler
models is not viable, either: recent studies employing WMAP 3-yr data find
that a scale invariant spectrum with no spectral tilt is now
weakly to moderately disfavoured
[184,
70,
58,
65].
Also, a Universe with no reionization is no longer a good
description of CMB data, and a non-zero optical depth
is indeed required
[70].
A few further comments about the results reported in Table 4 are in place:
Let us now turn to models that are not nested within
CDM— i.e.,
alternative theoretical scenarios.
Table 5 gives some examples
of the outcome of the Bayesian model comparison with the
concordance model. As above, we restrict our considerations to
studies employing the full Bayesian evidence (there are many other
examples in the literature carrying out approximate model
comparison using information criteria instead). The model
comparison is often more difficult for non-nested models, as
priors must be specified for all of the parameters in the
alternative model (and in the
CDM model, as
well), in order to compute the evidence ratio. The usual
caveats on prior choice apply in this case. From
Table 5 it appears that the
data do not seem to require fundamental changes in our underlying
theoretical model, either in the form of Bianchi templates
representing a violation of cosmic isotropy (see also
[206]),
or as Lemaitre-Tolman-Bondi models
or fractal bubble scenarios with dressed cosmological parameters.
The anomalous dipole in the CMB temperature maps is a fine example
of Lindley's paradox. When fitting a dipolar template to the CMB
maps, the effective chi-square improves by 9 to 11 units
(depending on the details of the analysis) for 3 extra parameters
[205,
207],
which would be
deemed a "significant" effect using a standard goodness-of-fit
test. However, the Bayesian evidence analysis shows that the odds
in favour of an anomalous dipole are 9 to 1 at best
(corresponding to ln B < 2.2), which does not reach the
"moderate evidence at best" threshold. Hence Bayesian model
comparison is conservative, requiring a stronger evidence before
deeming an effect to be favoured.
Competing model | Npar | ln B | Ref | Data | Outcome |
Alternatives to FRW | |||||
Bianchi VIIh | 5 to 8 | [-0.9, 1.2]d,p | [54] | WMAP1, WMAP3 | Weak support (at best) for Bianchi template |
5 to 6 | [-0.1, -1.2]p | [203] | WMAP3 | No evidence after texture correction | |
LTB models | 4 | -3.6 | [204] | WMAP3, BAO, SN Ia | Moderate evidence against LTB |
Fractal bubble model | 2 | 0.3 | [89] | SN Ia | Undecided |
Asymmetry in the CMB | |||||
Anomalous dipole | 3 | 1.8 | [205] | WMAP3 | Weak evidence for anomalous dipole |
< 2.2c | [65] | WMAP3 | Weak evidence at best | ||
d Depending on the choice of datasets. | |||||
p Depending on the choice of priors. | |||||
c Upper bound using Bayesian calibrated p-values, see section 4.5. |
6.2. Other uses of the Bayesian evidence
Beside cosmological model building, the Bayesian evidence can be employed in many other different ways. Here we presents two aspects that are relevant to our topic, namely the applications to the field of multi-model inference and model selection forecasting.
![]() |
(48) |
where p(,
| d,
i)
is the posterior within each model
i, and it is
understood that the posterior has
non-zero support only along the parameter directions
i
⊂
that are
relevant for the model, and
delta-functions along all other directions. Each term is weighted
by the corresponding posterior model probability,
![]() |
(49) |
The prior model probabilities
p(i) are
usually set equal,
but a model preference can be incorporated here if necessary. The
model averaged posterior distribution of
Eq. (48) then represents the parameter
constraints obtained independently of the model choice, which has
been marginalized over. Unless one of the models is overwhelmingly
more probable than the others (in which case the model averaging
essentially disappears, as all of the weights for the other models
go to zero), the model-averaged posterior distribution can be
significantly different from the model-specific distribution. A
counter-intuitive consequence is that in the case of dark energy,
the model-averaged posterior shows tighter constraints
around w = -1 than any of the evolving dark energy models by
itself. This comes about because
CDM is the
preferred model and
hence much of the weight in the model-averaged posterior is
shifted to the point w = -1
[208].
For further details on multi-model inference, see e.g.
[209,
210].
The procedure is as follows (see [211] for details and the application to dark energy scenarios). At every point in parameter space, mock data from the future observation are generated and the Bayes factor between the competing models is computed, for example between an evolving dark energy and a cosmological constant. Then one delimits in parameter space the region where the future data would not be able to deliver a clear model comparison verdict, for example | ln B | < 5 (evidence falling short of the "strong" threshold). The experiment with the smallest "model-confusion" volume in parameter space is to be preferred, since it achieves the highest discriminative power between models. An application of a related technique to the spectral index from the Planck satellite is presented in [212, 213].
Alternatively, we can investigate the full probability
distribution for the Bayes factor from a future observation. This
allows to make probabilistic statements regarding the outcome of a
future model comparison, and in particular to quantify the
probability that a new observation will be able to achieve a
certain level of evidence for one of the models, given current
knowledge. This technique is based on the predictive
distribution for a future observation, which gives the expected
posterior on for an
observation with experimental capabilities described by e (this
might describe sky coverage, noise levels, target redshift, etc):
![]() |
(50) |
Here, d are the currently available observations,
p(i
| d) is the current model posterior,
p(
|
i⋆,
e,
i) is
the posterior on
from a
future observation e computed assuming
i⋆
are the correct model parameters, while each term is weighted by the
present probability that
i⋆
is the true value of the parameters,
p(
i⋆
| d,
i).
The sum over i ensures that the
prediction averages over models, as well. From Eq. (50)
we can compute the corresponding probability distribution for ln
B from experiment e, for example by employing MCMC techniques
(further details are given in
[59]).
This method is called PPOD, for predictive posterior odds
distribution and can be useful in the context of experiment design and
optimization, when the aim is to determine which choice of e
will lead to the best scientific return from the experiment, in
this case in terms of model selection capabilities (see
[214,
215,
216]
for a discussion of performance optimization for parameter
constraints). For further details on Bayes factor forecasts and
experiment design, see
[217].