Planck 2015 results
Free Access
Issue
A&A
Volume 594, October 2016
Planck 2015 results
Article Number A20
Number of page(s) 65
Section Cosmology (including clusters of galaxies)
DOI https://doi.org/10.1051/0004-6361/201525898
Published online 20 September 2016

© ESO, 2016

1. Introduction

The precise measurements by Planck1 of the cosmic microwave background (CMB) anisotropies covering the entire sky and over a broad range of scales, from the largest visible down to a resolution of approximately 5′, provide a powerful probe of cosmic inflation, as detailed in the Planck 2013 inflation paper (Planck Collaboration XXII 2014, hereafter PCI13). In the 2013 results, the robust detection of the departure of the scalar spectral index from exact scale invariance, i.e., ns< 1, at more than 5σ confidence, as well as the lack of the observation of any statistically significant running of the spectral index, were found to be consistent with simple slow-roll models of inflation. Single-field inflationary models with a standard kinetic term were also found to be compatible with the new tight upper bounds on the primordial non-Gaussianity parameters fNL reported in Planck Collaboration XXVI (2014). No evidence of isocurvature perturbations as generated in multi-field inflationary models (PCI13) or by cosmic strings or topological defects was found (Planck Collaboration XXV 2014). The Planck 2013 results overall favoured the simplest inflationary models. However, we noted an amplitude deficit for multipoles ≲ 40 whose statistical significance relative to the six-parameter base Λ cold dark matter (ΛCDM) model is only about 2σ, as well as other anomalies on large angular scales but also without compelling statistical significance (Planck Collaboration XXIII 2014). The constraint on the tensor-to-scalar ratio, r< 0.12 at 95% CL, inferred from the temperature power spectrum alone, combined with the determination of ns, suggested models with concave potentials.

This paper updates the implications for inflation in the light of the Planck full mission temperature and polarization data. The Planck 2013 cosmology results included only the nominal mission, comprising the first 14 months of the data taken, and used only the temperature data. However, the full mission includes the full 29 months of scientific data taken by the cryogenically cooled high frequency instrument (HFI; which ended when the 3\hbox{${}^3$}He/4\hbox{${}^4$}He supply for the final stage of the cooling chain ran out) and the approximately four years of data taken by the low frequency instrument (LFI), which covered a longer period than the HFI because the LFI did not rely on cooling down to 100 mK for its operation. For a detailed discussion of the new likelihood and a comparison with the 2013 likelihood, we refer the reader to Planck Collaboration XI (2016) and Planck Collaboration XIII (2016), but we mention here some highlights of the differences between the 2013 and 2015 data processing and likelihoods: (1) improvements in the data processing such as beam characterization and absolute calibration at each frequency result in a better removal of systematic effects and (2) the 2015 temperature high- likelihood uses half-mission cross-power spectra over more of the sky, owing to less aggressive Galactic cuts. The use of polarization information in the 2015 likelihood release contributes to the constraining power of Planck in two principal ways: (1) the measurement of the E-mode polarization at large angular scales (presently based on the 70 GHz channel) constrains the reionization optical depth, τ, independently of other estimates using ancillary data; and (2) the measurement of the TE and EE spectra at ≥ 30 at the same frequencies used for the TT spectra (100, 143, and 217 GHz) helps break parameter degeneracies, particularly for extended cosmological models (beyond the baseline six-parameter model). A full analysis of the Planck low- polarization is still in progress and will be the subject of another forthcoming set of Planck publications.

The Planck 2013 results have sparked a revival of interest in several aspects of inflationary models. We mention here a few examples without the ambition to be exhaustive. A lively debate arose on the conceptual problems of some of the inflationary models favoured by the Planck 2013 data (Ijjas et al. 2013, 2014; Guth et al. 2014; Linde 2014). The interest in the R2 inflationary model originally proposed by Starobinsky (1980) increased, since its predictions for cosmological fluctuations (Mukhanov & Chibisov 1981; Starobinsky 1983) are compatible with the Planck 2013 results (PCI13). It has been shown that supergravity motivates a potential similar to the Einstein gravity conformal representation of the R2 inflationary model in different contexts (Ellis et al. 2013a,b; Buchmüller et al. 2013; Farakos et al. 2013; Ferrara et al. 2013b). A similar potential can also be generated by spontaneous breaking of conformal symmetry (Kallosh & Linde 2013b).

The constraining power of Planck also motivated a comparison between large numbers of inflationary models (Martin et al. 2014) and stimulated different perspectives on how best to compare theoretical inflationary predictions with observations based on the parameterized dependence of the Hubble parameter on the scale factor during inflation (Mukhanov 2013; Binétruy et al. 2015; Garcia-Bellido & Roest 2014). The interpretation of the asymmetries on large angular scales (Planck Collaboration XXIII 2014) also prompted a reanalysis of the primordial dipole modulation (Lyth 2013; Liddle & Cortês 2013; Kanno et al. 2013) of curvature perturbations during inflation.

Another recent development has been the renewed interest in possible tensor modes generated during inflation, sparked by the BICEP2 results (BICEP2 Collaboration 2014a,b). The BICEP2 team suggested that the B-mode polarization signal detected at 50 << 150 at a single frequency (150 GHz) might be of primordial origin. However, a crucial step in this possible interpretation was excluding an explanation based on polarized thermal dust emission from our Galaxy. The BICEP2 team put forward a number of models to estimate the likely contribution from dust, but at the time relevant observational data were lacking, and this modelling involved a high degree of extrapolation. If dust polarization were negligible in the observed patch of 380 deg2, this interpretation would lead to a tensor-to-scalar ratio of r=0.2-0.05+0.07\hbox{$r = 0.2^{+0.07}_{-0.05}$} for a scale-invariant spectrum. A value of r ≈ 0.2, as suggested by BICEP2 Collaboration (2014b), would have obviously changed the Planck 2013 perspective according to which slow-roll inflationary models are favoured, and such a high value of r would also have required a strong running of the scalar spectral index, or some other modification from a simple power-law spectrum, to reconcile the contribution of gravitational waves to temperature anisotropies at low multipoles with the observed TT spectrum.

The interpretation of the B-mode signal in terms of gravitational waves alone presented in BICEP2 Collaboration (2014b) was later cast in doubt by Planck measurements of dust polarization at 353 GHz (Planck Collaboration Int. XIX 2015; Planck Collaboration Int. XX 2015; Planck Collaboration Int. XXI 2015; Planck Collaboration Int. XXII 2015). The Planck measurements characterized the frequency dependence of intensity and polarization of the Galactic dust emission, and moreover showed that the polarization fraction is higher than expected in regions of low dust emission. With the help of the Planck measurements of Galactic dust properties (Planck Collaboration Int. XIX 2015), it was shown that the interpretation of the B-mode polarization signal in terms of a primordial tensor signal plus a lensing contribution was not statistically preferred to an explanation based on the expected dust signal at 150 GHz plus a lensing contribution (see also Flauger et al. 2014a; Mortonson & Seljak 2014). Subsequently, Planck Collaboration Int. XXX (2016) extrapolated the PlanckB-mode power spectrum of dust polarization at 353 GHz over the multipole range 40 << 120 to 150 GHz, showing that the B-mode polarization signal detected by BICEP2 could be entirely due to dust.

More recently, a BICEP2/Keck Array-Planck (BKP) joint analysis (BICEP2/Keck Array and Planck Collaborations 2015, herafter BKP) combined the high-sensitivity B-mode maps from BICEP2 and Keck Array with the Planck maps at higher frequencies where dust emission dominates. A study of the cross-correlations of all these maps in the BICEP2 field found the absence of any statistically significant evidence for primordial gravitational waves, setting an upper limit of r< 0.12 at 95% CL (BKP). Although this upper limit is numerically almost identical to the Planck 2013 result obtained combining the nominal mission temperature data with WMAP polarization to remove parameter degeneracies (Planck Collaboration XVI 2014; Planck Collaboration XXII 2014), the BKP upper bound is much more robust against modifications of the inflationary model, since B modes are insensitive to the shape of the predicted scalar anisotropy pattern. In Sect. 13 we explore how the recent BKP analysis constrains inflationary models.

This paper is organized as follows. Section 2 briefly reviews the additional information on the primordial cosmological fluctuations encoded in the polarization angular power spectrum. Section 3 describes the statistical methodology as well as the Planck and other likelihoods used throughout the paper. Sections 4 and 5 discuss the Planck 2015 constraints on scalar and tensor fluctuations, respectively. Section 6 is dedicated to constraints on the slow-roll parameters and provides a Bayesian comparison of selected slow-roll inflationary models. In Sect. 7 we reconstruct the inflaton potential and the Hubble parameter as a Taylor expansion of the inflaton in the observable range without relying on the slow-roll approximation. The reconstruction of the curvature perturbation power spectrum is presented in Sect. 8. The search for parameterized features is presented in Sect. 9, and combined constraints from the Planck 2015 power spectrum and primordial non-Gaussianity derived in Planck Collaboration XVII (2016) are presented in Sect. 10. The analysis of isocurvature perturbations combined and correlated with curvature perturbations is presented in Sect. 11. In Sect. 12 we study the implications of relaxing the assumption of statistical isotropy of the primordial fluctuations. We discuss two examples of anisotropic inflation in light of the tests of isotropy performed in Planck Collaboration XVI (2016). Section 14 presents some concluding remarks.

2. What new information does polarization provide?

This section provides a short theoretical overview of the extra information provided by polarization data over that of temperature alone. (More details can be found in White et al. 1994; Ma & Bertschinger 1995; Bucher 2015, and references therein.) In Sect. 2 of the Planck 2013 inflation paper (PCI13), we gave an overview of the relation between the inflationary potential and the three-dimensional primordial scalar and tensor power spectra, denoted as \hbox{${\cal P}_{\mathcal{R}}(k)$} and \hbox{${\cal P}_\mathrm{t}(k),$} respectively. (The scalar variable is defined precisely in Sect. 3.) We shall not repeat the discussion there, instead referring the reader to PCI13 and references therein.

Under the assumption of statistical isotropy, which is predicted in all simple models of inflation, the two-point correlations of the CMB anisotropies are described by the angular power spectra CTT,\hbox{$C^{\rm TT}_\ell,$}CTE,\hbox{$C^{\rm TE}_\ell,$}CEE,\hbox{$C^{\rm EE}_\ell,$} and CBB,\hbox{$C^{BB}_\ell,$} where is the multipole number. (See Kamionkowski et al. 1997; Zaldarriaga & Seljak 1997; Seljak & Zaldarriaga 1997; Hu & White 1997; Hu et al. 1998 and references therein for early discussions elucidating the role of polarization.) In principle, one could also envisage measuring CBT\hbox{$C^{BT}_\ell$} and CBE\hbox{$C^{BE}_\ell$}, but in theories where parity symmetry is not explicitly or spontaneously broken, the expectation values for these cross spectra (i.e., the theoretical cross spectra) vanish, although the observed realizations of the cross spectra are not exactly zero because of cosmic variance.

The CMB angular power spectra are related to the three-dimensional scalar and tensor power spectra via the transfer functions Δℓ,𝒜s(k)\hbox{$\Delta _{\ell ,\mathcal{A}}^\mathrm{s}(k)$} and Δℓ,𝒜t(k),\hbox{$\Delta _{\ell ,\mathcal{A}}^\mathrm{t}(k),$} so that the contributions from scalar and tensor perturbations are C𝒜ℬ,s=0dkkΔℓ,𝒜s(k)Δℓ,s(k)𝒫(k)\begin{equation} C^{\mathcal{AB},\mathrm{s}}_\ell =\int _0^\infty \frac{\mathrm{d} k}{k}~ \Delta _{\ell ,\mathcal{A}}^\mathrm{s}(k)~ \Delta _{\ell ,\mathcal{B}}^\mathrm{s}(k)~ {\cal P}_{\mathcal{R}}(k) \end{equation}(1)and C𝒜ℬ,t=0dkkΔℓ,𝒜t(k)Δℓ,t(k)𝒫t(k),\begin{equation} C^{\mathcal{AB},\mathrm{t}}_\ell =\int _0^\infty \frac{\mathrm{d}k}{k}~ \Delta _{\ell ,\mathcal{A}}^\mathrm{t}(k)~ \Delta _{\ell ,\mathcal{B}}^\mathrm{t}(k)~ {\cal P}_\mathrm{t}(k), \end{equation}(2)respectively, where \hbox{$\mathcal{A,B}=T,E,B.$} The scalar and tensor primordial perturbations are uncorrelated in the simplest models, so the scalar and tensor power spectra add in quadrature, meaning that C𝒜ℬ,tot=C𝒜ℬ,s+C𝒜ℬ,t.\begin{equation} C^{\mathcal{AB},\mathrm{tot}}_\ell = C^{\mathcal{AB},\mathrm{s}}_\ell +C^{\mathcal{AB},\mathrm{t}}_\ell. \end{equation}(3)Roughly speaking, the form of the linear transformations encapsulated in the transfer functions Δℓ,𝒜s(k)\hbox{$\Delta _{\ell ,\mathcal{A}}^\mathrm{s}(k)$} and Δℓ,𝒜t(k)\hbox{$\Delta _{\ell ,\mathcal{A}}^\mathrm{t}(k)$} probe the late time physics, whereas the primordial power spectra \hbox{${\cal P}_{\mathcal{R}}(k)$} and \hbox{${\cal P}_\mathrm{t}(k)$} are solely determined by the primordial Universe, perhaps not so far below the Planck scale if large-field inflation turns out to be correct.

thumbnail Fig. 1

Comparison of transfer functions for the scalar and tensor modes. The CMB transfer functions Δℓ,𝒜s(k)\hbox{$\Delta _{\ell ,\mathcal{A}}^\mathrm{s}(k)$} and Δℓ,𝒜t(k)\hbox{$\Delta _{\ell ,\mathcal{A}}^\mathrm{t}(k)$}, where \hbox{$\mathcal{A}=T,E,B$}, define the linear transformations mapping the primordial scalar and tensor cosmological perturbations to the CMB anisotropies as seen by us on the sky today. These functions are plotted for two representative values of the multipole number: = 2 (in black) and = 65 (in red).

To better understand this connection, it is useful to plot and compare the shapes of the transfer functions for representative values of and characterize their qualitative behavior. Referring to Fig. 1, we emphasize the following qualitative features:

  • 1.

    For the scalar mode transfer functions, of which only Δℓ,Ts(k)\hbox{$\Delta _{\ell,T}^\mathrm{s}(k)$} and Δℓ,Es(k)\hbox{$\Delta _{\ell,E}^\mathrm{s}(k)$} are non-vanishing (because to linear order, a three-dimensional scalar mode cannot contribute to the B mode of the polarization), both transfer functions start to rise at more or less the same small values of k (due to the centrifugal barrier in the Bessel differential equation), but Δℓ,Es(k)\hbox{$\Delta _{\ell,E}^\mathrm{s}(k)$} falls off much faster at large k and thus smooths sharp features in \hbox{$\mathcal{P}_{\mathcal{R}}(k)$} to a lesser extent than Δℓ,Ts(k).\hbox{$\Delta _{\ell,T}^\mathrm{s}(k).$} This means that polarization is more powerful than temperature for reconstructing possible sharp features in the scalar primordial power spectrum provided that the required signal-to-noise is available.

  • 2.

    For the tensor modes, Δℓ,Tt(k)\hbox{$\Delta _{\ell,T}^\mathrm{t}(k)$} starts rising at about the same small k as Δℓ,Ts(k)\hbox{$\Delta _{\ell,T}^\mathrm{s}(k)$} and Δℓ,Es(k)\hbox{$\Delta _{\ell,E}^\mathrm{s}(k)$} but falls off faster with increasing k than Δℓ,Ts(k).\hbox{$\Delta _{\ell,T}^\mathrm{s}(k).$} On the other hand, the polarization components, Δℓ,Et(k)\hbox{$\Delta _{\ell,E}^\mathrm{t}(k)$} and Δℓ,Bt(k),\hbox{$\Delta _{\ell,B}^\mathrm{t}(k),$} have a shape completely different from any of the other transfer functions. The shape of Δℓ,Et(k)\hbox{$\Delta _{\ell,E}^\mathrm{t}(k)$} and Δℓ,Bt(k)\hbox{$\Delta _{\ell,B}^\mathrm{t}(k)$} is much wider in ln(k) than the scalar polarization transfer function, with a variance ranging from 0.5 to 1.0 decades. These functions exhibit several oscillations with a period smaller than that for scalar transfer functions, due to the difference between the sound velocity for scalar fluctuations and the light velocity for gravitational waves (Polarski & Starobinsky 1996; Lesgourgues et al. 2000).

Regarding the scalar primordial cosmological perturbations, the power spectrum of the E-mode polarization provides an important consistency check. As we explore in Sects. 8 and 9, to some extent the fit of the temperature power spectrum can be improved by allowing a complicated form for the primordial power spectrum (relative to a simple power law), but the CTE\hbox{$C^{\rm TE}_\ell$} and CEE\hbox{$C^{\rm EE}_\ell$} power spectra provide independent information. Moreover, in multi-field inflationary models, in which isocurvature modes may have been excited (possibly correlated amongst themselves as well as with the adiabatic mode), polarization information provides a powerful way to break degeneracies (see, e.g., Bucher et al. 2001).

The inability of scalar modes to generate B-mode polarization (apart from the effects of lensing) has an important consequence. For the primordial tensor modes, polarization information, especially information concerning the B-mode polarization, offers powerful potential for discovery or for establishing upper bounds. Planck 2013 and WMAP established upper bounds on a possible tensor mode contribution using CTT\hbox{$C^{\rm TT}_\ell$} alone, but these bounds crucially relied on assuming a simple form for the scalar primordial power spectrum. For example, as reported in PCI13, when a simple power law was generalized to allow for running, the bound on the tensor contribution degraded by approximately a factor of two. The new joint BICEP2/Keck Array-Planck upper bound (see Sect. 13), however, is much more robust and cannot be avoided by postulating baroque models that alter the scale dependence of the scalar power spectrum.

3. Methodology

This section describes updates to the formalism used to describe cosmological models and the likelihoods used with respect to the Planck 2013 inflation paper (PCI13).

3.1. Cosmological model

The cosmological models that predict observables such as the CMB anisotropies rely on inputs specifying the conditions and physics at play during different epochs of the history of the Universe. The primordial inputs describe the power spectrum of the cosmological perturbations at a time when all the observable modes were situated outside the Hubble radius. The inputs from this epoch consist of the primordial power spectra, which may include scalar curvature perturbations, tensor perturbations, and possibly also isocurvature modes and their correlations. The late time (i.e., z ≲ 104) cosmological inputs include parameters such as ωb,ωc,ΩΛ, and τ, which determine the conditions when the primordial perturbations become imprinted on the CMB and also the evolution of the Universe between last scattering and today, affecting primarily the angular diameter distance. Finally, there is a so-called “nuisance” component, consisting of parameters that determine how the measured CMB spectra are contaminated by unsubtracted Galactic and extragalactic foreground contamination. The focus of this paper is on the primordial inputs and how they are constrained by the observed CMB anisotropy, but we cannot completely ignore the other non-primordial parameters because their presence and uncertainties must be dealt with in order to correctly extract the primordial information of interest here.

As in PCI13, we adopt the minimal six-parameter spatially flat base ΛCDM cosmological model as our baseline for the late time cosmology, mainly altering the primordial inputs, i.e., the simple power-law spectrum parameterized by the scalar amplitude and spectral index for the adiabatic growing mode, which in this minimal model is the only late time mode excited. This model has four free non-primordial cosmological parameters (ωb,ωc,θMC; for a more detailed account of this model, we refer the reader to Planck Collaboration XIII 2016). On occasion, this assumption will be relaxed in order to consider the impact of more complex alternative late time cosmologies on our conclusions about inflation. Some of the commonly used cosmological parameters are defined in Table 1.

Table 1

Primordial, baseline, and optional late-time cosmological parameters.

3.2. Primordial spectra of cosmological fluctuations

In inflationary models, comoving curvature () and tensor (h) fluctuations are amplified by the nearly exponential expansion from quantum vacuum fluctuations to become highly squeezed states resembling classical states. Formally, this quantum mechanical phenomenon is most simply described by the evolution in conformal time, η, of the mode functions for the gauge-invariant inflaton fluctuation, δφ, and for the tensor fluctuation, h: (ayk)′′+(k2x′′x)ayk=0,\begin{equation} (a y_k)'' + \left(k^2 - \frac{x''}{x} \right) a y_k = 0, \label{fluctuations:Evolution} \end{equation}(4)with \hbox{$(x,y)=(a \dot \phi/H , \delta \phi)$} for scalars and (x,y) = (a,h) for tensors. Here a is the scale factor, primes indicate derivatives with respect to η, and \hbox{$\dot \phi$} and H = ȧ/a are the proper time derivative of the inflaton and the Hubble parameter, respectively. The curvature fluctuation, , and the inflaton fluctuation, δφ, are related via \hbox{${\cal R} = H \delta \phi/\dot \phi$}. Analytic and numerical calculations of the predictions for the primordial spectra of cosmological fluctuations generated during inflation have reached high standards of precision, which are more than adequate for our purposes, and the largest uncertainty in testing specific inflationary models arises from our lack of knowledge of the history of the Universe between the end of inflation and the present time, during the so-called “epoch of entropy generation”.

This paper uses three different methods to compare inflationary predictions with Planck data. The first method consists of a phenomenological parameterization of the primordial spectra of scalar and tensor perturbations according to: 𝒫(k)=k32π2|k|2=𝒫t(k)=\begin{eqnarray} \mathcal{P}_{\cal R}(k) &=& \frac{k^3}{2 \pi^2} |{\cal R}_k|^2 \nonumber \\ &=& A_\mathrm{s} \left(\frac{k}{k_*}\right)^{n_\mathrm{s}-1 + \frac{1}{2} \, \mathrm{d}n_\mathrm{s}/\mathrm{d}\!\ln k \ln(k/k_*) + \frac{1}{6} \, \frac{\mathrm{d}^2n_\mathrm{s}}{\mathrm{d}\!\ln k^2} \left(\ln(k/k_*) \right)^2 + ...}, \label{scalarps}\\ \mathcal{P}_\mathrm{t}(k) &= &\frac{k^3}{2 \pi^2} \left(|h^+_k|^2 + |h^\times_k|^2 \right) = A_\mathrm{t} \left(\frac{k}{k_*}\right)^{n_\mathrm{t} + \frac{1}{2} \, \mathrm{d}n_\mathrm{t}/\mathrm{d}\!\ln k \ln(k/k_*) + ... } , \label{tensorps} \end{eqnarray}where As (At) is the scalar (tensor) amplitude and ns (nt), dns/ dlnk (dnt/ dlnk), and d2ns/ dlnk2 are the scalar (tensor) spectral index, the running of the scalar (tensor) spectral index, and the running of the running of the scalar spectral index, respectively. h+ , × denotes the amplitude of the two polarization states (+ , ×) of gravitational waves and k is the pivot scale. Unless otherwise stated, the tensor-to-scalar ratio, r=𝒫t(k)𝒫(k),\begin{equation} r = \frac{\mathcal{P}_{\mathrm t}(k_*)}{\mathcal{P}_{\cal R}(k_*)}, \label{tensortoscalar} \end{equation}(7)is fixed to −8nt, which is the relation that holds when inflation is driven by a single slow-rolling scalar field with a standard kinetic term2. We will use a parameterization analogous to Eq. (5) with no running for the power spectra of isocurvature modes and their correlations in Sect. 11.

Table 2

Conventions and definitions for inflation physics.

The second method exploits the analytic dependence of the slow-roll power spectra of primordial perturbations in Eqs. (5) and (6) on the values of the Hubble parameter and the hierarchy of its time derivatives, known as the Hubble flow functions (HFF): ϵ1 = −/H2, \hbox{$\epsilon_{i+1} \equiv \dot \epsilon_i/(H \epsilon_i)$}, with i ≥ 1. We will use the analytic power spectra calculated up to second order using the Green’s function method (Gong & Stewart 2001; Leach et al. 2002; see Habib et al. 2002; Martin & Schwarz 2003; and Casadio et al. 2006 for alternative derivations). The spectral indices and the relative scale dependence in Eqs. (5) and (6) are given in terms of the HFFs by: ns1=2ϵ1ϵ22ϵ12(2C+3)ϵ1ϵ2Cϵ2ϵ3,dns/dlnk=2ϵ1ϵ2ϵ2ϵ3,nt=2ϵ12ϵ122(C+1)ϵ1ϵ2,dnt/dlnk=2ϵ1ϵ2,\begin{eqnarray} \label{bs} &&n_\mathrm{s} - 1 = - 2 \epsilon_1 - \epsilon_2 - 2 \epsilon_1^2 -\left(2\,C+3\right)\,\epsilon_1\,\epsilon_2 - C \epsilon_2 \epsilon_3, \\ \label{eqn:bs1} &&\mathrm{d} n_\mathrm{s}/\mathrm{d}\! \ln k = - 2 \epsilon_1 \epsilon_2 - \epsilon_2 \epsilon_3, \\ \label{eqn:bs2} &&n_\mathrm{t} = - 2\epsilon_1 - 2\epsilon_1^2 -2\,\left(C+1\right)\,\epsilon_1\,\epsilon_2 , \\ \label{eqn:bt1} &&\mathrm{d} n_\mathrm{t}/\mathrm{d}\! \ln k = - 2\epsilon_1\epsilon_2 , \end{eqnarray}where C ≡ ln2 + γE−2 ≈ −0.7296 (γE is the Euler-Mascheroni constant). See the Appendix of PCI13 for more details. Primordial spectra as functions of the ϵi will be employed in Sect. 6, and the expressions generalizing Eqs. (8) to (11) for a general Lagrangian p(φ,X), where X ≡ −gμνμφνφ/ 2, will be used in Sect. 10. The good agreement between the first and second method as well as with alternative approximations of slow-roll spectra is illustrated in the Appendix of PCI13.

The third method is fully numerical, suitable for models where the slow-roll conditions are not well satisfied and analytical approximations for the primordial fluctuations are not available. Two different numerical codes, the inflation module of Lesgourgues & Valkenburg (2007) as implemented in CLASS (Lesgourgues 2011; Blas et al. 2011) and ModeCode (Adams et al. 2001; Peiris et al. 2003; Mortonson et al. 2009; Easther & Peiris 2012), are used in Sects. 7 and 10, respectively.3

Conventions for the functions and symbols used to describe inflationary physics are defined in Table 2.

3.3. Planck data

The Planck data processing proceeding from time-ordered data to maps has been improved for this 2015 release in various aspects (Planck Collaboration II 2016; Planck Collaboration VII 2016). We refer the interested reader to Planck Collaboration II (2016) and Planck Collaboration VII (2016) for details, and we describe here two of these improvements. The absolute calibration has been improved using the orbital dipole and more accurate characterization of the Planck beams. The calibration discrepancy between Planck and WMAP described in Planck Collaboration XXXI (2014) for the 2013 release has now been greatly reduced. At the time of that release, a blind analysis for primordial power spectrum reconstruction described a broad feature at ≈ 1800 in the temperature power spectrum, which was most prominent in the 217 × 217 GHz auto-spectra (PCI13). In work done after the Planck 2013 data release, this feature was shown to be associated with imperfectly subtracted systematic effects associated with the 4 K cooler lines, which were particularly strong in the first survey. This systematic effect was shown to potentially lead to 0.5σ shifts in the cosmological parameters, slightly increasing ns and H0, similarly to the case in which the 217 × 217 channel was excised from the likelihood (Planck Collaboration XV 2014; Planck Collaboration XVI 2014). The Planck likelihood (Planck Collaboration XI 2016) is based on the full mission data and comprises temperature and polarization data (see Fig. 2).

thumbnail Fig. 2

PlanckTT (top), high-TE (centre), and high-EE (bottom) angular power spectra. Here \hbox{${\cal D}_\ell \equiv \ell(\ell + 1)C_\ell/(2\pi)$}.

Planck low- likelihood

The Planck low- temperature-polarization likelihood uses foreground-cleaned LFI 70 GHz polarization maps together with the temperature map obtained from the Planck 30 to 353 GHz channels by the Commander component separation algorithm over 94% of the sky (see Planck Collaboration IX 2016 for further details). The Planck polarization map uses the LFI 70 GHz (excluding Surveys 2 and 4) low-resolution maps of Q and U polarization from which polarized synchrotron and thermal dust emission components have been removed using the LFI 30 GHz and HFI 353 GHz maps as templates, respectively. (See Planck Collaboration XI 2016 for more details.) The polarization map covers the 46% of the sky outside the lowP polarization mask.

The low- likelihood is pixel-based and treats the temperature and polarization at the same resolution of \hbox{$3\pdeg6$}, or HEALpix (Górski et al. 2005) Nside = 16. Its multipole range extends from = 2 to = 29 in TT, TE, EE, and BB. In the 2015 Planck papers the polarization part of this likelihood is denoted as “lowP”.4 This Planck low- likelihood replaces the Planck temperature low- Gibbs module combined with the WMAP 9-yr low- polarization module used in the Planck 2013 cosmology papers (denoted by WP), which used lower resolution polarization maps at Nside = 8 (about \hbox{$7\pdeg3$}). With this Planck-only low- likelihood module, the basic Planck results presented in this release are completely independent of external information.

The Planck low-multipole likelihood alone implies τ = 0.067 ± 0.022 (Planck Collaboration XI 2016), a value smaller than the value inferred using the WP polarization likelihood, τ = 0.089 ± 0.013, used in the Planck 2013 papers (Planck Collaboration XV 2014). See Planck Collaboration XIII (2016) for the important implications of this decrease in τ for reionization. However, the LFI 70 GHz and WMAP polarization maps are in very good agreement when both are foreground-cleaned using the HFI 353 GHz map as a polarized dust template (see Planck Collaboration XI 2016 for further details). Therefore, it is useful to construct a noise-weighted combination to obtain a joint Planck/WMAP low resolution polarization data set, also described in Planck Collaboration XI (2016), using as a polarization mask the union of the WMAP P06 and Planck lowP polarization masks and keeping 74% of the sky. The polarization part of the combined low multipole likelihood is called lowP+WP. This combined low multipole likelihood gives τ=0.071-0.013+0.011\hbox{$\tau = 0.071^{+0.011}_{-0.013}$} (Planck Collaboration XI 2016).

Planck high- likelihood

Following Planck Collaboration XV (2014), and Planck Collaboration XI 2016 for polarization, we use a Gaussian approximation for the high- part of the likelihood (30 << 2500), so that log(|C(θ))=12(C(θ))T-1(C(θ)),\begin{equation} -\mathrm{log}{\cal L}\Bigl(\hat{C} | C(\boldsymbol{\theta })\Bigr) = \frac{1}{2} \Bigl(\hat{C} - C(\boldsymbol{\theta })\Bigr) ^T {\mathcal{M}}^{-1} \Bigl(\hat{C} - C(\boldsymbol{\theta })\Bigr), \label{eq:basic-likelihood} \end{equation}(12)where a constant offset has been discarded. Here Ĉ is the data vector, C(θ) is the model prediction for the parameter value vector θ, and is the covariance matrix. For the data vector, we use 100 GHz, 143 GHz, and 217 GHz half-mission cross-power spectra, avoiding the Galactic plane as well as the brightest point sources and the regions where the CO emission is the strongest. We retain 66% of the sky for 100 GHz, 57% for 143 GHz, and 47% for 217 GHz for the T masks, and respectively 70%, 50%, and 41% for the Q, U masks. Following Planck Collaboration XXX (2014), we do not mask for any other Galactic polarized emission. All the spectra are corrected for the beam and pixel window functions using the same beam for temperature and polarization. (For details see Planck Collaboration XI 2016.)

The model for the cross-spectra can be written as Cμ,ν(θ)=Ccmb(θ)+Cμ,νfg(θ)cμcν,\begin{equation} C_{\mu,\nu}(\theta) = \frac{ C^{\mathrm{cmb}}(\theta) + C^{\mathrm{fg}}_{\mu,\nu}(\theta) }{\sqrt{{c}_\mu {c}_\nu}}, \end{equation}(13)where Ccmb(θ) is the CMB power spectrum, which is independent of the frequency, Cμ,νfg(θ)\hbox{$C^{\mathrm fg}_{\mu,\nu}(\theta)$} is the foreground model contribution for the cross-frequency spectrum μ × ν, and cμ is the calibration factor for the μ × μ spectrum. The model for the foreground residuals includes the following components: Galactic dust, clustered cosmic infrared background (CIB), thermal and kinetic Sunyaev-Zeldovich (tSZ and kSZ) effect, tSZ correlations with CIB, and point sources, for the TT foreground modeling; and for polarization, only dust is included. All the components are modelled by smooth C templates with free amplitudes, which are determined along with the cosmological parameters as the likelihood is explored. The tSZ and kSZ models are the same as in 2013 (see Planck Collaboration XV 2014), although with different priors (Planck Collaboration XI 2016; Planck Collaboration XIII 2016), while the CIB and tSZ-CIB correlation models use the updated CIB models described in Planck Collaboration XXX (2014). The point source contamination is modelled as Poisson noise with an independent amplitude for each frequency pair. Finally, the dust contribution uses an effective smooth model measured from high frequency maps. Details of our dust and noise modelling can be found in Planck Collaboration XI (2016). The dust is the dominant foreground component for TT at < 500, while the point source component, and for 217 × 217 also the CIB component, dominate at high . The other foreground components are poorly determined by Planck. Finally, our treatment of the calibration factors and beam uncertainties and mismatch are described in Planck Collaboration XI (2016).

The covariance matrix accounts for the correlation due to the mask and is computed following the equations in Planck Collaboration XV (2014), extended to polarization in Planck Collaboration XI (2016) and references therein. The fiducial model used to compute the covariance is based on a joint fit of base ΛCDM and nuisance parameters obtained with a previous version of the matrix. We iterate the process until the parameters stop changing. For more details, see Planck Collaboration XI (2016).

The joint unbinned covariance matrix is approximately of size 23 000×23 000. The memory and speed requirements for dealing with such a huge matrix are significant, so to reduce its size, we bin the data and the covariance matrix to compress the data vector size by a factor of 10. The binning uses varying bin width with Δ = 5 for 29 << 100, Δ = 9 for 99 << 1504, Δ = 17 for 1503 << 2014, and Δ = 33 for 2013 << 2509, and a weighting in ( + 1) to flatten the spectrum. Where a higher resolution is desirable, we also use a more finely binned version (“bin3”, unbinned up to = 80 and Δ = 3 beyond that) as well as a completely unbinned version (“bin1”). We use odd bin sizes, since for an azimuthally symmetric mask, the correlation between a multipole and its neighbours is symmetric, oscillating between positive and negative values. Using the base ΛCDM model and single-parameter classical extensions, we confirmed that the cosmological and nuisance parameter fits with or without binning are indistinguishable.

As discussed in Planck Collaboration XI (2016) and Planck Collaboration XIII (2016), the TE and EE high- data are not free of small systematic effects, such as leakage from temperature to polarization. Although the propagated effects of these residual systematics on cosmological parameters are small and do not alter the conclusions of this paper, we mainly refer to Planck TT+lowP in combination with the Planck lensing or additional data sets as the most reliable results for this release.

Planck CMB bispectrum

We use measurements of the non-Gaussianity amplitude fNL from the CMB bispectrum presented in Planck Collaboration XVII (2016). Non-Gaussianity constraints have been obtained using three optimal bispectrum estimators: separable template fitting (also known as “KSW”), binned, and modal. The maps analysed are the Planck 2015 full mission sky maps, both in temperature and in E polarization, as cleaned with the four component separation methods SMICA, SEVEM, NILC, and Commander. The map is masked to remove the brightest parts of the Galaxy as well as the brightest point sources and covers approximately 70% of the sky. In this paper we mainly exploit the joint constraints on equilateral and orthogonal non-Gaussianity (after removing the integrated Sachs-Wolfe effect-lensing bias), fNLequil=16±70\hbox{$f_\mathrm{NL}^\mathrm{equil} = -16 \pm 70$}, fNLortho=34±33\hbox{$f_\mathrm{NL}^\mathrm{ortho} = -34 \pm 33$} from T only, and fNLequil=3.7±43\hbox{$f_\mathrm{NL}^\mathrm{equil} = -3.7 \pm 43$}, fNLortho=26±21\hbox{$f_\mathrm{NL}^\mathrm{ortho} = -26 \pm 21$} from T and E (68% CL). For reference, the constraints on local non-Gaussianity are fNLlocal=2.5±5.7\hbox{$f_\mathrm{NL}^\mathrm{local}=2.5 \pm 5.7$} from T only, and fNLlocal=0.8±5.0\hbox{$f_\mathrm{NL}^\mathrm{local}=0.8 \pm 5.0$} from T and E (68% CL). Starting from a Gaussian fNL-likelihood, which is an accurate assumption in the regime of small primordial non-Gaussianity, we use these constraints to derive limits on the sound speed of the inflaton fluctuations (or other microscopic parameters of inflationary models; Planck Collaboration XXIV 2014). The bounds on the sound speed for various models are then used in combination with Planck power spectrum data.

Planck CMB lensing data

Some of our analysis includes the Planck 2015 lensing likelihood, presented in Planck Collaboration XV (2016), which utilizes the non-Gaussian trispectrum induced by lensing to estimate the power spectrum of the lensing potential, Cφφ\hbox{$C_\ell ^{\phi\phi}$}. This signal is extracted using a full set of temperature- and polarization-based quadratic lensing estimators (Okamoto & Hu 2003) applied to the SMICA CMB map over approximately 70% of the sky, as described in Planck Collaboration IX (2016). We have used the conservative bandpower likelihood, covering multipoles 40 ≤ ≤ 400. This provides a measurement of the lensing potential power at the 40σ level, giving a 2.5%-accurate constraint on the overall lensing power in this multipole range. The measurement of the lensing power spectrum used here is approximately twice as powerful as the measurement used in our previous 2013 analysis (Planck Collaboration XXII 2014; Planck Collaboration XVII 2014), which used temperature-only data from the Planck nominal mission data set.

3.4. Non-Planck data

BAO data

Baryon acoustic oscillations (BAO) are the counterpart in the late time matter power spectrum of the acoustic oscillations seen in the CMB multipole spectrum (Eisenstein et al. 2005). Both originate from coherent oscillations of the photon-baryon plasma before these two components become decoupled at recombination. Measuring the position of these oscillations in the matter power spectra at different redshifts constrains the expansion history of the universe after decoupling, thus removing degeneracies in the interpretation of the CMB anisotropies.

In this paper, we combine constraints on \hbox{$D_V(\bar{z})/r_\mathrm{s}$} (the ratio between the spherically-averaged distance scale DV to the effective survey redshift, \hbox{$\bar{z}$}, and the sound horizon, rs) inferred from 6dFGRS data (Beutler et al. 2011) at \hbox{$\bar{z} = 0.106$}, the SDSS-MGS data (Ross et al. 2015) at \hbox{$\bar{z} = 0.15$}, and the SDSS-DR11 CMASS and LOWZ data (Anderson et al. 2014) at redshifts \hbox{$\bar{z} = 0.57$} and 0.32. For details see Planck Collaboration XIII (2016).

Joint BICEP2/Keck Array and Planck constraint on r

Since the Planck temperature constraints on the tensor-to-scalar ratio are close to the cosmic variance limit, the inclusion of data sets sensitive to the expected B-mode signal of primordial gravitational waves is particularly useful. In this paper, we provide results including the joint analysis cross-correlating BICEP2/Keck Array observations and Planck (BKP). Combining the more sensitive BICEP2/Keck Array B-mode polarization maps in the approximately 400 deg2 BICEP2 field with the Planck maps at higher frequencies where dust dominates allows a statistical analysis taking into account foreground contamination. Using BB auto- and cross-frequency spectra between BICEP2/Keck Array (150 GHz) and Planck (217 and 353 GHz), BKP find a 95% upper limit of r0.05< 0.12.

3.5. Parameter estimation and model comparison

Much of this paper uses a Bayesian approach to parameter estimation, and unless otherwise specified, we assign broad top-hat prior probability distributions to the cosmological parameters listed in Table 1. We generate posterior probability distributions for the parameters using either the Metropolis-Hastings algorithm implemented in CosmoMC (Lewis & Bridle 2002) or MontePython (Audren et al. 2013), the nested sampling algorithm MultiNest (Feroz & Hobson 2008; Feroz et al. 2009, 2013), or PolyChord, which combines nested sampling with slice sampling (Handley et al. 2015). The latter two also compute the Bayesian evidence needed for model comparison. Nevertheless, χ2 values are often provided as well (using CosmoMC’s implementation of the BOBYQA algorithm (Powell 2009) for maximizing the likelihood), and other parts of the paper employ frequentist methods when appropriate.

thumbnail Fig. 3

Comparison of the marginalized joint 68% and 95% CL constraints on (ns) (left panel), (nsbh2) (middle panel), and (ns,θMC) (right panel), for Planck 2013 (grey contours), Planck TT+lowP (red contours), Planck TT+lowP+lensing (green contours), and Planck TT, TE, EE+lowP (blue contours).

Table 3

Confidence limits on the parameters of the base ΛCDM model, for various combinations of Planck 2015 data, at the 68% confidence level.

4. Constraints on the primordial spectrum of curvature perturbations

One of the most important results of the Planck nominal mission was the determination of the departure from scale invariance for the spectrum of scalar perturbations at high statistical significance (Planck Collaboration XVI 2014; Planck Collaboration XXII 2014). We now update these measurements with the Planck full mission data in temperature and polarization.

4.1. Tilt of the curvature power spectrum

For the base ΛCDM model with a power-law power spectrum of curvature perturbations, the constraint on the scalar spectral index, ns, with the Planck full mission temperature data is ns=0.9655±0.0062(68%CL,PlanckTT+lowP).\begin{eqnarray} \label{eq:TT_ns} n_\mathrm{s} = 0.9655 \pm 0.0062 \, \, (68\%\ \text{CL, \Planck\ TT+lowP}) . \end{eqnarray}(14)This result is compatible with the Planck 2013 constraint, ns = 0.9603 ± 0.0073 (Planck Collaboration XV 2014; Planck Collaboration XVI 2014). See Fig. 3 for the accompanying changes in τ, Ωbh2, and θMC. The shift towards higher values for ns with respect to the nominal mission results is due to several improvements in the data processing and likelihood which are discussed in Sect. 3, including the removal of the 4 K cooler systematics. For the values of other cosmological parameters in the base ΛCDM model, see Table 3. We also provide the results for the base ΛCDM model and extended models online.5

When the Planck high- polarization is combined with temperature, we obtain ns=0.9645±0.0049(68%CL,PlanckTT,TE,EE+lowP),\begin{eqnarray} n_\mathrm{s} = 0.9645 \pm 0.0049 \, \, (68\%\ \text{CL, \Planck~TT, TE, EE+lowP}), \end{eqnarray}(15)together with τ = 0.079 ± 0.017 (68% CL), which is consistent with the TT+lowP results. The Planck high- polarization pulls τ up to a slightly higher value. When the Planck lensing measurement is added to the temperature data, we obtain ns=0.9677±0.0060(68%CL,PlanckTT+lowP+lensing),\begin{eqnarray} n_\mathrm{s} = 0.9677 \pm 0.0060 \, \, (68\%\ \text{CL, \Planck\ TT+lowP+lensing}), \end{eqnarray}(16)with τ = 0.066 ± 0.016 (68% CL). The shift towards slightly smaller values of the optical depth is driven by a marginal preference for a smaller primordial amplitude, As, in the Planck lensing data (Planck Collaboration XV 2016). Given that the temperature data provide a sharp constraint on the combination e− 2τAs, a slightly lower As requires a smaller optical depth to reionization.

4.2. Viability of the Harrison-Zeldovich spectrum

Even though the estimated scalar spectral index has risen slightly with respect to the Planck 2013 release, the assumption of a Harrison-Zeldovich (HZ) scale-invariant spectrum (Harrison 1970; Peebles & Yu 1970; Zeldovich 1972) continues to be disfavoured (with a modest increase in significance, from 5.1σ in 2013 to 5.6σ today), because the error bar on ns has decreased. The value of ns inferred from the Planck 2015 temperature plus large-scale polarization data lies 5.6 standard deviations away from unity (with a corresponding Δχ2 = 29.9), if one assumes the base ΛCDM late-time cosmological model. If we consider more general reionization models, parameterized by a principal component analysis (Mortonson & Hu 2008) instead of τ (where reionization is assumed to have occurred instantaneously), we find Δχ2 = 14.9 for ns = 1. Previously, simple one-parameter extensions of the base model, such as ΛCDM+Neff (where Neff is the effective number of neutrino flavours) or ΛCDM+YP (where YP is the primordial value of the helium mass fraction), could nearly reconcile the Planck temperature data with ns = 1. They now lead to Δχ2 = 7.6 and 9.3, respectively. For any of the cosmological models that we have considered, the Δχ2 by which the HZ model is penalized with respect to the tilted model has increased since the 2013 analysis (PCI13) thanks to the constraining power of the full mission temperature data. Adding Planck high- polarization data further disfavours the HZ model: in ΛCDM, the χ2 increases by 57.8, for general reionization we obtain Δχ2 = 41.3, and for ΛCDM+Neff and ΛCDM+YP we find Δχ2 = 22.5 and 24.0, respectively.

4.3. Running of the spectral index

The running of the scalar spectral index is constrained by the Planck 2015 full mission temperature data to dnsdlnk=0.0084±0.0082(68%CL,PlanckTT+lowP).\begin{eqnarray} \frac{{\rm d} n_\mathrm{s}}{{\rm d}\!\ln k} = -0.0084 \pm 0.0082 \, \, (68\%\ \text{CL, \Planck\ TT+lowP}) . \end{eqnarray}(17)The combined constraint including high- polarization is dnsdlnk=0.0057±0.0071(68%CL,PlanckTT,TE,EE+lowP).\begin{eqnarray} \frac{{\rm d} n_\mathrm{s}}{{\rm d}\!\ln k} = -0.0057 \pm 0.0071 \, \, (68\%\ \text{CL, \Planck~TT, TE, EE+lowP}) . \end{eqnarray}(18)Adding the Planck CMB lensing data to the temperature data further reduces the central value for the running, i.e., dns/ dlnk = −0.0033 ± 0.0074 (68% CL, Planck TT+lowP+lensing).

The central value for the running has decreased in magnitude with respect to the Planck 2013 nominal mission (Planck Collaboration XVI 2014 found dns/ dlnk = −0.013 ± 0.009; see Fig. 4), and the improvement of the maximum likelihood with respect to a power-law spectrum is smaller, Δχ2 ≈ −0.8. Among the different effects contributing to the decrease in the central value of the running with respect to the Planck 2013 result, we mention a change in HFI beams at ≲ 200 (Planck Collaboration XIII 2016). Nevertheless, the deficit of power at low multipoles in the Planck 2015 temperature power spectrum contributes to a preference for slightly negative values of the running, but with low statistical significance.

thumbnail Fig. 4

Marginalized joint 68% and 95% CL for (ns,dns/ dlnk) using Planck TT+lowP and Planck TT, TE, EE+lowP. Constraints from the Planck 2013 data release are also shown for comparison. For comparison, the thin black stripe shows the prediction for single-field monomial chaotic inflationary models with 50 <N< 60.

Table 4

Constraints on the primordial perturbation parameters for ΛCDM+r and ΛCDM+r+dns/ dlnk models from Planck.

The Planck constraints on ns and dns/ dlnk are remarkably stable against the addition of the BAO likelihood. The combination with BAO shifts ns to slighly higher values and shrinks its uncertainty by about 30% when only high- temperature is considered, and by only about 15% when high- temperature and polarization are combined. In slow-roll inflation, the running of the scalar spectral index is connected to the third derivative of the potential (Kosowsky & Turner 1995). As was the case for the nominal mission results, values of the running compatible with the Planck 2015 constraints can be obtained in viable inflationary models (Kobayashi & Takahashi 2011).

When the running of the running is allowed to float, the Planck TT+lowP (Planck TT, TE, EE+lowP) data give: ns=0.9569±0.0077(0.9586±0.0056),dns/dlnk=0.011-0.013+0.014(0.009±0.010),(68%CL)d2ns/dlnk2=0.029-0.016+0.015(0.025±0.013),\begin{eqnarray} && n_\mathrm{s} = 0.9569\pm0.0077~~(0.9586 \pm 0.0056), \nonumber \\[2mm] && {\rm d} n_\mathrm{s}/{\rm d}\!\ln k = 0.011^{+0.014}_{-0.013}~~(0.009 \pm 0.010), ~~~~~~(68\%~\mathrm{CL})\\[2mm] & & {\rm d}^2 n_\mathrm{s}/{\rm d}\!\ln k^2 = 0.029^{+0.015}_{-0.016}~~(0.025 \pm 0.013), \nonumber \end{eqnarray}(19)at the pivot scale k = 0.05 Mpc-1. Allowing for running of the running provides a better fit to the temperature spectrum at low multipoles, such that Δχ2 ≈ −4.8 (−4.9) for TT+lowP (TT, TE, EE+lowP), but is not statistically preferred over the simplest ΛCDM model.

Note that the inclusion of small-scale data such as Lyα might further constrain the running of the spectral index and its derivative. The recent analysis of the BOSS one-dimensional Lyα flux power spectrum presented in Palanque-Delabrouille et al. (2015) and Rossi et al. (2015) was optimized for measuring the neutrino mass. It does not include constraints on the spectral index running, which would require new dedicated N-body simulations. Hence we do not include Lyα constraints here.

In Sect. 7 on inflaton potential reconstruction we will show that the data cannot accomodate a significant running but are compatible with a larger running of the running.

4.4. Suppression of power on the largest scales

Although not statistically significant, the trend for a negative running or positive running of the running observed in the last subsection was driven by the lack of power in the Planck temperature power spectrum at low multipoles, already mentioned in the Planck 2013 release. This deficit could potentially be explained by a primordial spectrum featuring a depletion of power only at large wavelengths. Here we investigate two examples of such models.

We first update the analysis (already presented in PCI13) of a power-law spectrum multiplied by an exponential cutoff: 𝒫(k)=𝒫0(k)1exp[(kkc)λc].\begin{equation} \mathcal{P_R}(k) = \mathcal{P}_{0}(k) \left\{ 1 - \exp \left[- \left(\frac{k}{k_\mathrm{c}} \right)^{\lambda_\mathrm{c}} \right] \right\}. \label{eq:cutoff} \end{equation}(20)This simple parameterization is motivated by models with a short inflationary stage in which the onset of the slow-roll phase coincides with the time when the largest observable scales exited the Hubble radius during inflation. The curvature spectrum is then strongly suppressed on those scales. We apply top-hat priors on the parameter λc, controlling the steepness of the cutoff, and on the logarithm of the cutoff scale, kc. We choose prior ranges λc ∈ [ 0,10 ] and ln(kc/ Mpc-1) ∈ [ −12,−3 ]. For Planck TT+lowP (Planck TT, TE, EE+lowP), the best-fit model has λc = 0.50 (0.53), ln(kc/ Mpc-1) = −7.98 (−7.98), ns = 0.9647 (0.9649), and improves the effective χ2 by a modest amount, Δχ2 ≈ −3.4 (−3.4).

As a second model, we consider a broken power-law spectrum for curvature perturbations: 𝒫(k)={\begin{eqnarray} \mathcal{P_R}(k) = \begin{cases} ~~A_\mathrm{low} \left(\frac{k}{k_*} \right)^{n_\mathrm{s}-1 + \delta} & \text{if } k \le k_\mathrm{b}, \\ ~~A_\mathrm{s} \left(\frac{k}{k_*} \right)^{n_\mathrm{s}-1} & \text{if } k \ge k_\mathrm{b}, \end{cases} \end{eqnarray}(21)with Alow = As(kb/k)δ to ensure continuity at k = kb. Hence this model, like the previous one, has two parameters, and also suppresses power at large wavelengths when δ> 0. We assume top-hat priors δ ∈ [ 0,2 ] and ln(kb/ Mpc-1) ∈ [ −12,−3 ], and standard uniform priors for ln(1010As) and ns. The best fit to Planck TT+lowP (Planck TT, TE, EE+lowP) is found for ns = 0.9658 (0.9647), δ = 1.14 (1.14), and ln(kb/ Mpc-1) = −7.55 (−7.57), with a very small χ2 improvement of Δχ2 ≈ −1.9 (−1.6).

We conclude that neither of these two models with two extra parameters is preferred over the base ΛCDM model. (See also the discussion of a step inflationary potential in Sect. 9.1.1.)

5. Constraints on tensor modes

In this section, we focus on the Planck 2015 constraints on tensor perturbations. Unless otherwise stated, we consider that the tensor spectral index satisfies the standard inflationary consistency condition to lowest order in slow roll, nt = −r/ 8. We recall that r is defined at the pivot scale k = 0.05 Mpc-1. However, for comparison with other studies, we also report our bounds in terms of the tensor-to-scalar ratio r0.002 at k = 0.002 Mpc-1.

5.1. Planck 2015 upper bound on r

The constraints on the tensor-to-scalar ratio inferred from the Planck full mission data for the ΛCDM+r model are: r0.002<0.10(95%CL,PlanckTT+lowP),r0.002<0.11(95%CL,PlanckTT+lowP+lensing),r0.002<0.11(95%CL,PlanckTT+lowP+BAO),r0.002<0.10(95%CL,PlanckTT,TE,EE+lowP).\begin{eqnarray} \label{erre_t} r_{0.002} & < & 0.10 \quad \text{(95\% CL, \Planck\ TT+lowP)} , \\ \label{erre_tpluslensing} r_{0.002} & < & 0.11 \quad \text{(95\% CL, \Planck\ TT+lowP+lensing)} ,\\ r_{0.002} & < & 0.11 \quad \text{(95\% CL, \Planck\ TT+lowP+BAO)} , \\ \label{erre_tplusp} r_{0.002} & < & 0.10 \quad \text{(95\% CL, \Planck~TT, TE, EE+lowP)} . \end{eqnarray}Table 4 also shows the bounds on ns in each of these cases.

These results slightly improve over the constraint r0.002< 0.12 (95% CL) derived from the Planck 2013 temperature data in combination with WMAP large-scale polarization data (Planck Collaboration XVI 2014; Planck Collaboration XXII 2014). The constraint obtained by Planck temperature and polarization on large scales is tighter than the PlanckB-mode 95% CL upper limit from the 100 and 143 GHz HFI channels, r< 0.27 (Planck Collaboration XI 2016). The constraints on r reported in Table 4 can be translated into upper bounds on the energy scale of inflation at the time when the pivot scale exits the Hubble radius using V=3π2As2rMpl4=(1.88×1016GeV)4r0.10·\begin{equation} V_* = \frac{3 \pi^2 A_{\mathrm{s}}}{2} \, r \, M_{\mathrm {pl}}^4 = (1.88 \times 10^{16}~{\mathrm{GeV}} )^4 \frac{r}{0.10}\cdot \end{equation}(26)This gives an upper bound on the Hubble parameter during inflation of H/Mpl< 3.6 × 10-5 (95% CL) for Planck TT+lowP.

These bounds are relaxed when allowing for a scale dependence of the scalar and tensor spectral indices. In that case, we assume that the tensor spectral index and its running are fixed by the standard inflationary consistency condition at second order in slow roll. We obtain r0.002<0.18(95%CL,PlanckTT+lowP),dnsdlnk=0.013-0.009+0.010(68%CL,PlanckTT+lowP),\begin{eqnarray} \label{erre_run_t} r_{0.002} & < \,0.18 & \text{(95\% CL, \Planck\ TT+lowP),} \\ \label{alpha_run_t} \frac{{\rm d} n_\mathrm{s}}{{\rm d}\!\ln k} & =& \,-0.013^{+0.010}_{-0.009} \text{(68\% CL, \Planck\ TT+lowP),} \end{eqnarray}with ns = 0.9667 ± 0.0066 (68% CL). At the standard pivot scale, k = 0.05 Mpc-1, the bound is stronger (r< 0.17 at 95% CL), because k is closer to the scale at which ns and r decorrelate. The constraint on r0.002 in Eq. (27) is 21% tighter than the corresponding Planck 2013 constraint. The mean value of the running in Eq. (28) is higher (lower in absolute value) than with Planck 2013 by 45%. Figures 5 and 6 clearly illustrate this significant improvement with respect to the previous Planck data release. Table 4 shows how bounds on (r,ns, dns/ dlnk) are affected by the lensing reconstruction, BAO, or high- polarization data. The tightest bounds are obtained in combination with polarization: r0.002<0.15(95%CL,PlanckTT,TE,EE+lowP),dnsdlnk=0.009±0.008(68%CL,PlanckTT,TE,EE+lowP),\begin{eqnarray} r_{0.002} &<& 0.15 \nonumber\\ \label{erre_run_tplusp} && \qquad \qquad \text{(95\% CL, \Planck~TT, TE, EE+lowP),} \\ \frac{{\rm d} n_\mathrm{s}}{{\rm d}\!\ln k} &=& -0.009 \pm 0.008 \nonumber\\ \label{alpha_run_tplusp} && \qquad \qquad \text{(68\% CL, \Planck~TT, TE, EE+lowP),} \end{eqnarray}with ns = 0.9644 ± 0.0049 (68% CL).

thumbnail Fig. 5

Marginalized joint confidence contours for (ns,dns/ dlnk), at the 68% and 95% CL, in the presence of a non-zero tensor contribution, and using Planck TT+lowP or Planck TT, TE, EE+lowP. Constraints from the Planck 2013 data release are also shown for comparison. The thin black stripe shows the prediction of single-field monomial inflation models with 50 <N< 60.

thumbnail Fig. 6

Marginalized joint confidence contours for (ns,r), at the 68% and 95% CL, in the presence of running of the spectral indices, and for the same data combinations as in the previous figure.

Neither the Planck full mission constraints in Eqs. (22)(25) nor those including a running in Eqs. (27) and (29) are compatible with the interpretation of the BICEP2 B-mode polarization data in terms of primordial gravitational waves (BICEP2 Collaboration 2014b). Instead they are in excellent agreement with the results of the BICEP2/Keck Array-Planck cross-correlation analysis, as discussed in Sect. 13.

5.2. Dependence of the r constraints on the low- likelihood

The constraints on r discussed above are further tightened by adding WMAP polarization information on large angular scales. The Planck measurement of CMB polarization on large angular scales at 70 GHz is consistent with the WMAP 9-year one, based on the K, Q, and V-bands (at 30, 40, and 60 GHz, respectively), once the Planck 353 GHz channel is used to remove the dust contamination, instead of the theoretical dust model used by the WMAP team (Page et al. 2007). (For a detailed discussion, see Planck Collaboration XI 2016.) By combining Planck TT data with LFI 70 GHz and WMAP polarization data on large angular scales, we obtain a 35% reduction of uncertainty, giving τ = 0.074 ± 0.012 (68% CL) and ns = 0.9660 ± 0.060 (68% CL) for the base ΛCDM model. When tensors are added, the bounds become r0.002<0.09(95%CL,PlanckTT+lowP+WP),ns=0.9655±0.058(68%CL,PlanckTT+lowP+WP),τ=0.073-0.013+0.011(68%CL,PlanckTT+lowP+WP).\begin{eqnarray} &&r_{0.002} < 0.09 \,\,\quad\quad\quad\quad \!\!\text{(95\% CL, \Planck\ TT+lowP+WP),} \\ &&n_\mathrm{s}= 0.9655 \pm 0.058\quad \text{(68\% CL, \Planck\ TT+lowP+WP),} \\ &&\tau= 0.073^{+0.011}_{-0.013} \,\,\,\,\quad \!\!\text{(68\% CL, \Planck\ TT+lowP+WP).}~~~~~~~~~~~~~~ \end{eqnarray}When tensors and running are both varied, we obtain r0.002< 0.14 (95% CL) and dns/ dlnk = −0.010 ± 0.008 (68% CL) for Planck TT+lowP+WP. These constraints are all tighter than those based on Planck TT+lowP only.

5.3. The tensor-to-scalar ratio and the low- deficit in temperature

As noted previously (Planck Collaboration XV 2014; Planck Collaboration XVI 2014; Planck Collaboration XXII 2014), the low- temperature data display a slight lack of power compared to the expectation of the best-fit tensor-free base ΛCDM model. Since tensor fluctuations add power on small scales, the effect will be exacerbated in models allowing r> 0.

In order to quantify this tension, we compare the observed constraint on r to that inferred from simulated Planck data. In the simulations, we assume the underlying fiducial model to be tensor-free, with parameters close to the base ΛCDM best-fit values. We limit the simulations to mock temperature power spectra only and fit these spectra with an exact low- likelihood for 2 ≤ ≤ 29 (see Perotto et al. 2006), and a high- Gaussian likelihood for 30 ≤ ≤ 2508 based on the frequency-combined, foreground-marginalized, unbinned Planck temperature power spectrum covariance matrix. Additionally, we impose a Gaussian prior of τ = 0.07 ± 0.02.

Based on 100 simulated data sets, we find a 95% CL upper limit on the tensor-to-scalar ratio of \hbox{$\bar{r}_{2\sigma} \approx 0.260$}. The corresponding constraint from real data (using low-Commander temperature data, the frequency-combined, foreground-marginalized, unbinned Planck high- TT power spectrum, and the same prior on τ as above) reads r< 0.123, confirming that the actual constraint is tighter than what one would have expected. However, the actual constraint is not excessively unusual: out of the 100 simulations, 4 lead to an even tighter bound, corresponding to a significance of about 2σ. Thus, under the hypothesis of the base ΛCDM cosmology, the upper limit on r that we get from the data is not implausible as a chance fluctuation of the low multipole power.

To illustrate the contribution of the low- temperature power deficit to the estimates of cosmological parameters, we show as an example in Fig. 7 how ns shifts towards lower values when the < 30 temperature information is discarded (we will refer to this case as “Planck TTlowT”). The shift in ns is approximately −0.005 (or −0.003 when the lowP likelihood is replaced by a Gaussian prior τ = 0.07 ± 0.02). These shifts exceed those found in Sect. 4.4, where a primordial power spectrum suppressed on large scales was fitted to the data.

thumbnail Fig. 7

One-dimensional posterior probabilities for ns for the base ΛCDM model obtained by excluding temperature multipoles for < 30 (“TTlowT”), while either keeping low- polarization data, or in addition replacing them with a Gaussian prior on τ.

Figure 8 displays the posterior probability for r for various combinations of data sets, some of which exclude the < 30TT data. This leads to the very conservative bounds r ≲ 0.24 and r ≲ 0.23 at 95% CL when combined with the lowP likelihood or with the Gaussian prior τ = 0.07 ± 0.020, respectively.

thumbnail Fig. 8

One-dimensional posterior probabilities for r for various data combinations, either including or not including temperature multipoles for < 30, and compared with the baseline choice (Planck TT+lowP, black curve).

5.4. Relaxing assumptions on the late-time cosmological evolution

thumbnail Fig. 9

Marginalized joint 68% and 95% CL for (ns,r0.002) using Planck TT+lowP+BAO (upper panel) and Planck TT, TE, EE+lowP (lower panel).

Table 5

Constraints on extensions of the ΛCDM+r cosmological model for Planck TT+lowP+lensing, Planck TT+lowP+BAO, and Planck TT, TE, EE+lowP.

As in the Planck 2013 release (PCI13), we now ask how robust the Planck results on the tensor-to-scalar ratio are against assumptions on the late-time cosmological evolution. The results are summarized in Table 5, and some particular cases are illustrated in Fig. 9. Constraints on r turn out to be remarkably stable for one-parameter extensions of the ΛCDM+r model, with the only exception the ΛCDM+r+ΩK case in the absence of the late time information from Planck lensing or BAO data. The weak trend towards ΩK< 0, i.e., towards a positively curved (closed) universe from the temperature and polarization data alone, and the well-known degeneracy between ΩK and H0/Ωm lead to a slight suppression of the Sachs-Wolfe plateau in the scalar temperature spectrum. This leaves more room for a tensor component.

This further degeneracy when r is added builds on the negative values for the curvature allowed by Planck TT+lowP, ΩK=0.052-0.055+0.049\hbox{$\Omega_K = -0.052^{+0.049}_{-0.055}$} at 95% CL (Planck Collaboration XIII 2016). The exploitation of the information contained in the Planck lensing likelihood leads to a tighter constraint, ΩK=0.005-0.017+0.016\hbox{$\Omega_K = -0.005^{+0.016}_{-0.017}$} at 95% CL, which improves on the Planck 2013 results (ΩK=0.007-0.019+0.018\hbox{$\Omega_K = -0.007^{+0.018}_{-0.019}$} at 95% CL). However, due to the remaining degeneracies left by the uncertainties in polarization on large angular scales, a full appreciation of the improvement due to the full mission temperature and lensing data can be obtained by using lowP+WP, which leads to ΩK=0.003-0.014+0.012\hbox{$\Omega_K = -0.003^{+0.012}_{-0.014}$} at 95% CL. Note that the negative values allowed for the curvature are decreased in magnitude when the running is allowed, suggesting that the low- temperature deficit is contributing to the estimate of the spatial curvature.

The trend found for ΛCDM+r+ΩK is even clearer when spatial curvature and the running of the spectral index are varied at the same time. In this case, the Planck temperature plus polarization data are compatible with r values as large as 0.19 (95% CL), at the cost of an almost 4σ deviation from spatial flatness (which, however, disappears as soon as lensing or BAO data are considered).

6. Implications for single-field slow-roll inflation

In this section we study the implications of Planck 2015 constraints on standard slow-roll single-field inflationary models.

6.1. Constraints on slow-roll parameters

We first present the Planck 2015 constraints on slow-roll parameters obtained through the analytic perturbative expansion in terms of the HFFs ϵi for the primordial spectra of cosmological fluctuations during slow-roll inflation (Stewart & Lyth 1993; Gong & Stewart 2001; Leach et al. 2002). When restricting to first order in ϵi, we obtain ϵ1<0.0068(95%CL,PlanckTT+lowP),ϵ2=0.029-0.007+0.008(68%CL,PlanckTT+lowP).\begin{eqnarray} \label{epsilon_1_TT_no} \epsilon_1 & < &\,0.0068 \text{(95\% CL, \Planck\ TT+lowP)} , \\ \label{epsilon_2_TT_no} \epsilon_2 & =& \,0.029^{+0.008}_{-0.007} ~~ \text{(68\% CL, \Planck\ TT+lowP)} . \end{eqnarray}When high- polarization is included we obtain ϵ1< 0.0066 at 95% CL and ϵ2=0.030-0.006+0.007\hbox{$\epsilon_2 = 0.030^{+0.007}_{-0.006}$} at 68% CL. When second-order contributions in the HFFs are included, we obtain ϵ1<0.012(95%CL,PlanckTT+lowP),ϵ2=0.031-0.011+0.013(68%CL,PlanckTT+lowP),0.41<ϵ3<1.38(95%CL,PlanckTT+lowP).\begin{eqnarray} \label{epsilon_1_TT} &&\epsilon_1 < 0.012 \,\,\,\quad\quad \text{(95\% CL, \Planck\ TT+lowP)} , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~\\ \label{epsilon_2_TT} &&\epsilon_2 = 0.031^{+0.013}_{-0.011} \,\,\, \text{(68\% CL, \Planck\ TT+lowP)} , \\ \label{epsilon_3_TT} &&\quad -0.41 < \epsilon_3 < 1.38 \quad\quad\quad \text{(95\% CL, \Planck\ TT+lowP)} . \end{eqnarray}When high- polarization is included we obtain ϵ1< 0.011 at 95% CL, ϵ2=0.032-0.009+0.011\hbox{$\epsilon_2 = 0.032^{+0.011}_{-0.009}$} at 68% CL, and −0.32 <ϵ3< 0.89 at 95% CL.

The potential slow-roll parameters are obtained as derived parameters by using their exact expressions as function of ϵi (Leach et al. 2002; Finelli et al. 2010): ϵV=Vφ2Mpl22V2=ϵ1(1ϵ13+ϵ26)2(1ϵ13)2,ηV=VφφMpl2V=2ϵ1ϵ222ϵ123+5ϵ1ϵ26ϵ2212ϵ2ϵ361ϵ13,ξV2=VφφφVφMpl4V2=1ϵ13+ϵ26(1ϵ13)2(4ϵ123ϵ1ϵ2+ϵ2ϵ32ϵ1ϵ22+3ϵ12ϵ243ϵ1376ϵ1ϵ2ϵ3+ϵ22ϵ36+ϵ2ϵ326+ϵ2ϵ3ϵ46),\begin{eqnarray} \label{epsilon_V} \epsilon_V &=& \,\frac{V_\phi^2 M_\mathrm{pl}^2}{2 V^2} = \epsilon_1 \frac{\left(1 - \frac{\epsilon_1}{3} + \frac{\epsilon_2}{6}\right)^2}{\left(1 - \frac{\epsilon_1}{3}\right)^2} ,~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \\ \label{eta_V} \eta_V &=& \,\frac{V_{\phi \phi} M_\mathrm{pl}^2}{V} = \frac{ 2 \epsilon_1 - \frac{\epsilon_2}{2} - \frac{2\epsilon_1^2}{3} + \frac{5\epsilon_1 \epsilon_2}{6} -\frac{\epsilon_2^2}{12} - \frac{\epsilon_2 \epsilon_3}{6}}{1 - \frac{\epsilon_1}{3}} , \\ \label{xi_V} \xi_V^2 &=& \frac{V_{\phi \phi \phi} V_\phi M_\mathrm{pl}^4}{V^2} = \frac{1 - \frac{\epsilon_1}{3} + \frac{\epsilon_2}{6}}{\left(1-\frac{\epsilon_1}{3}\right)^2} \left(4 \epsilon_1^2 -3 \epsilon_1 \epsilon_2 + \frac{\epsilon_2^{\phantom{2}} \epsilon_3}{2} - \epsilon_1 \epsilon_2^2 \right. \nonumber \\ &&\quad+ \left. 3 \epsilon_1^2 \epsilon_2 - \frac{4}{3} \epsilon_1^3 - \frac{7}{6} \epsilon_1 \epsilon_2 \epsilon_3 +\frac{\epsilon_2^2 \epsilon_3}{6} + \frac{\epsilon_2 \epsilon_3^2}{6} + \frac{\epsilon_2 \epsilon_3 \epsilon_4}{6} \right) , \end{eqnarray}where V(φ) is the inflaton potential, the subscript φ denotes the derivative with respect to φ, and Mpl = (8πG)− 1 / 2 is the reduced Planck mass (see also Table 2).

By using Eqs. (39) and (40) with ϵ3 = 0 and the primordial power spectra to lowest order in the HFFs, the derived constraints for the first two slow-roll potential parameters are: ϵV<0.0068(95%CL,PlanckTT+lowP),ηV=0.010-0.009+0.005(68%CL,PlanckTT+lowP).\begin{eqnarray} \label{epsilon_V_TT_no} \epsilon_V & <& \,0.0068 \text{(95\% CL, \Planck\ TT+lowP)} , \\ \label{eta_V_TT_no} \eta_V & =& \,-0.010^{+0.005}_{-0.009} ~\text{(68\% CL, \Planck\ TT+lowP)} . \end{eqnarray}When high- polarization is included we obtain ϵV< 0.0067 at 95% CL and ηV=0.010-0.009+0.004\hbox{$\eta_V = -0.010^{+0.004}_{-0.009}$} at 68% CL. By using Eqs. (39)(41) with ϵ4 = 0 and the primordial power spectra to second order in the HFFs, the derived constraints for the slow-roll potential parameters are: ϵV<0.012(95%CL,PlanckTT+lowP),ηV=0.0080-0.0146+0.0088(68%CL,PlanckTT+lowP),ξV2=0.0070-0.0069+0.0045(68%CL,PlanckTT+lowP).\begin{eqnarray} \label{epsilon_V_TT} \epsilon_V & <& \,0.012 \text{(95\% CL, \Planck\ TT+lowP)} , \\ \label{eta_V_TT} \eta_V & =& \,-0.0080^{+0.0088}_{-0.0146} ~ \text{(68\% CL, \Planck\ TT+lowP)} , \\ \label{xi_V_TT} \xi^2_V & =& \,0.0070^{+0.0045}_{-0.0069} ~ \text{(68\,\% CL, \Planck\ TT+lowP)} . \end{eqnarray}When high- polarization is included we obtain ϵV< 0.011 at 95% CL, and ηV=0.0092-0.0127+0.0074\hbox{$\eta_V = -0.0092^{+0.0074}_{-0.0127}$} and ξV2=0.0044-0.0050+0.0037\hbox{$\xi^2_V = 0.0044^{+0.0037}_{-0.0050}$}, both at 68% CL.

thumbnail Fig. 10

Marginalized joint 68% and 95% CL regions for (ϵ1,ϵ2) (top panel) and (ϵV,ηV) (bottom panel) for Planck TT+lowP (red contours), Planck TT, TE, EE+lowP (blue contours), and compared with the Planck 2013 results (grey contours).

In Figs. 10 and 11 we show the 68% CL and 95% CL of the HFFs and the derived potential slow-roll parameters with and without the high- polarization and compare these values with the Planck 2013 results.

6.2. Implications for selected inflationary models

The predictions to lowest order in the slow-roll approximation for (ns,r) at k = 0.002 Mpc-1 of a few inflationary models with a representative uncertainty for the entropy generation stage (50 <N< 60) are shown in Fig. 12. Figure 12 updates Fig. 1 of PCI13 with the same notation.

In the following we discuss the implications of Planck TT+lowP+BAO data for selected slow-roll inflationary models by taking into account the uncertainties in the entropy generation stage. We model these uncertainties by two parameters, as in PCI13: the energy scale ρth by which the Universe has thermalized, and the parameter wint which characterizes the effective equation of state between the end of inflation and the energy scale specified by ρth. We use the primordial power spectra of cosmological fluctuations generated during slow-roll inflation parameterized by the HFFs, ϵi, to second order, which can be expressed in terms of the number of e-folds to the end of inflation, N, and the parameters of the considered inflationary model, using modified routines of the public code ASPIC6 (Martin et al. 2014). For the number of e-folds to the end of inflation (Liddle & Leach 2003; Martin & Ringeval 2010) we use the expression (PCI13) N67ln(ka0H0)+14ln(V2Mpl4ρend)+13wint12(1+wint)ln(ρthρend)112ln(gth),\begin{eqnarray} \begin{aligned} N_* \approx & \; 67 - \ln \left(\frac{k_*}{a_0 H_0}\right) + \frac{1}{4}\!\ln{\left(\frac{V_*^2}{\Mpl^4 \rhoend}\right) } \\ &+ \frac{1-3w_\mathrm{int}}{12(1+w_\mathrm{int})} \ln{\left(\frac{\rhorh}{\rhoend} \right)} - \frac{1}{12} \ln (g_\mathrm{th} ) , \label{eq:nefolds} \end{aligned} \end{eqnarray}(47)where ρend is the energy density at the end of inflation, a0H0 is the present Hubble scale, V is the potential energy when k left the Hubble radius during inflation, wint characterizes the effective equation of state between the end of inflation and the thermalization energy scale ρth, and gth is the number of effective bosonic degrees of freedom at the energy scale ρth. We consider the pivot scale k = 0.002 Mpc-1, gth = 103, and ϵend = 1. We consider the uniform priors for the cosmological parameters listed in Table 6. We also consider a logarithmic prior on 1010As (over the interval [ (e2.5,e3.7 ]) and ρth (over the interval [ (1 TeV)4,ρend ]). We consider both the case in which wint is kept fixed at zero and the case in which it is allowed to vary with a uniform prior in the range −1 / 3 <wint< 1 / 3.

Table 6

Priors for cosmological parameters used in the Bayesian comparison of inflationary models.

We have validated the slow-roll approach by cross-checking the Bayes factor computations against the fully numerical inflationary mode equation solver ModeCode coupled to the PolyChord sampler. For each inflationary model we provide in Table 7 and in the main text the Δχ2 value with respect to the base ΛCDM model, computed with the CosmoMC implementation of the BOBYQA algorithm for maximizing the likelihood, and the Bayesian evidence with respect to the R2 inflationary model (Starobinsky 1980), computed by CosmoMC connected to CAMB, using MultiNest as the sampler.

thumbnail Fig. 11

Marginalized joint 68% and 95% CL regions for (ϵ1,ϵ2,ϵ3) (top panels) and (ϵV,ηV,ξV2)\hbox{$(\epsilon_V , \eta_V , \xi_V^2)$} (bottom panels) for Planck TT+lowP (red contours), Planck TT, TE, EE+lowP (blue contours), and compared with the Planck 2013 results (grey contours).

Power law potentials

We first investigate the class of inflationary models with a single monomial potential (Linde 1983): V(φ)=λMpl4(φMpl)n,\begin{equation} V(\phi) = \lambda M_\mathrm{pl}^4 \left(\frac{\phi}{M_\mathrm{pl}} \right)^n , \label{PowerLawPot:Eq} \end{equation}(48)in which inflation occurs for large values of the inflaton, φ>Mpl. The predictions for the scalar spectral index and the tensor-to-scalar ratio at first order in the slow-roll approximation are ns−1 ≈ −2(n + 2) / (4N + n) and r ≈ 16n/ (4N + n), respectively. By assuming a dust equation of state (i.e., wint = 0) prior to thermalization, the cubic and quartic potentials are strongly disfavoured by lnB = −11.6 and lnB = −23.3, respectively. The quadratic potential is moderately disfavoured by lnB = −4.7. Other values, such as n = 4 / 3, 1, and 2 / 3, motivated by axion monodromy (Silverstein & Westphal 2008; McAllister et al. 2010), are compatible with Planck data with wint = 0.

Table 7

Results of the inflationary model comparison.

Small modifications occur when considering the effective equation of state parameter, wint = (n−2) / (n + 2), defined by averaging over the coherent oscillation regime which follows inflation (Turner 1983). The Bayes factors are slightly modified when wint is allowed to float, as shown in Table 7.

Hilltop models

In hilltop models (Boubekeur & Lyth 2005), with potential V(φ)Λ4(1φpμp+...),\begin{equation} V(\phi) \approx \Lambda^4 \left(1 - \frac{\phi^{{p}}}{\mu^{{p}}} + ...\right) , \label{newinf} \end{equation}(49)the inflaton rolls away from an unstable equilibrium. The predictions to first order in the slow-roll approximation are r ≈ 8p2(Mpl/μ)2x2p−2/ (1−xp)2 and ns−1 ≈ −2p(p−1)(Mpl/μ)2xp−2/ (1−xp)−3r/ 8, where x = φ/μ. As in PCI13, the ellipsis in Eq. (49) and in what follows indicates higher-order terms that are negligible during inflation but ensure positiveness of the potential.

By sampling log 10(μ/Mpl) within the prior [ 0.30,4.85 ] for p = 2, we obtain log 10(μ/Mpl) > 1.02 (1.05) at 95% CL and lnB = −2.6 (−2.4) for wint = 0 (allowing wint to float).

An exact potential which could also apply after inflation, instead of the approximated one in Eq. (49), might be needed for a better comparison among different models. For μ/Mpl ≫ 1, hilltop models as defined in Eq. (49) by neglecting the additional terms denoted by the ellipsis lead to ns−1 ≈ −3r/ 8, the same prediction as for the previously discussed linear potential, V(φ) ∝ φ. By considering a double well potential, V(φ) = Λ4 [ 1−φ2/ (2μ2) ] 2, instead, we obtain a slightly worse Bayes factor than the hilltop p = 2 model, lnB = −3.1 (−2.3) for wint = 0 (wint allowed to vary). This different result can be easily understood. Although the double well potential is equal to the hilltop model for φμ, it approximates V(φ) ∝ φ2 for μ/Mpl ≫ 1. Since a linear potential is a better fit to Planck than φ2, the fit of the double well potential is therefore worse than the hilltop p = 2 case for μ/Mpl ≫ 1, and this partially explains the slightly different Bayes factors obtained.

In the p = 4 case, we obtain log 10(μ/Mpl) > 1.05 (1.02) at 95% CL and lnB = −2.8 (−2.6) for wint = 0 (allowing wint to float), assuming a prior range [−2,2 ] for log 10(μ/Mpl).

Natural inflation

In natural inflation (Freese et al. 1990; Adams et al. 1993) a nonperturbative shift symmetry is invoked to suppress radiative corrections leading to the periodic potential V(φ)=Λ4[1+cos(φf)],\begin{equation} V(\phi)=\Lambda^4 \left[ 1+\cos \left(\frac{\phi}{f} \right) \right], \label{NatInf} \end{equation}(50)where f is the scale which determines the curvature of the potential. We sample log 10(f/Mpl) within the prior [ 0.3,2.5 ] as in PCI13. We obtain log 10(f/Mpl) > 0.84 (> 0.83) at 95% CL and lnB = −2.4 (−2.3) for wint = 0 (allowing wint to vary).

Note that the super-Planckian value for f required by observations is not necessarily a problem for this class of models. When several fields φi with a cosine potential as in Eq. (50) and scales fi appear in the Lagrangian, an effective single-field inflationary trajectory can be found for a suitable choice of parameters (Kim et al. 2005). In such a setting, the super-Planckian value of the effective scale f required by observations can be obtained even if the original scales satisfy fiMpl (Kim et al. 2005).

D-brane inflation

Inflation can arise from physics involving extra dimensions. If the standard model of particle physics is confined to our 3-dimensional brane, the distance between our brane and anti-brane can drive inflation. We consider the following parameterization for the effective potential driving inflation: V(φ)=Λ4(1μpφp+...).\begin{equation} V(\phi) = \Lambda^4 \left(1 - \frac{\mu^p}{\phi^p} + ... \right) . \label{Dbrane_infl} \end{equation}(51)Sampling log 10(μ/Mpl) using a uniform prior over [−6,0.3 ] , we consider p = 4 (Kachru et al. 2003; Dvali et al. 2001) and p = 2 (Garcia-Bellido et al. 2002). The predictions for r and ns can be obtained from the hilltop case with the substitution p → −p. These models agree with the Planck data with a Bayes factor of lnB = −0.4 (−0.6) and lnB = −0.7 (−0.9) for p = 4 and p = 2, respectively, for wint = 0 (allowing wint to vary).

Potentials with exponential tails

Exponential potentials are ubiquitous in inflationary models motivated by supergravity and string theory (Goncharov & Linde 1984; Stewart 1995; Dvali & Tye 1999; Burgess et al. 2002; Cicoli et al. 2009). We restrict ourselves to analysing the following class of potentials: V(φ)=Λ4(1e/Mpl+...).\begin{equation} V(\phi) = \Lambda^4 \left(1 - {\rm e}^{-q \phi/M_\mathrm{pl}} + ... \right) . \label{exp_infl} \end{equation}(52)As for the hilltop models described earlier, the ellipsis indicates possible higher-order terms that are negligible during inflation but ensure positiveness of the potential. These models predict r ≈ 8q2e− 2/Mpl/ (1−e/Mpl)2 and ns−1 ≈ −q2e/Mpl(2 + e/Mpl) / (1−e/Mpl)2 with a slow-roll trajectory characterized by Nf(φ/Mpl)−f(φend/Mpl), with f(x) = (eqxqx) /q2. By sampling log 10(q/Mpl) with a uniform prior over [−3,3 ], we obtain a Bayes factor of −0.6 for wint = 0 (−0.9 when wint is allowed to vary).

thumbnail Fig. 12

Marginalized joint 68% and 95% CL regions for ns and r at k = 0.002 Mpc-1 from Planck compared to the theoretical predictions of selected inflationary models. Note that the marginalized joint 68% and 95% CL regions have been obtained by assuming dns/ dlnk = 0.

Spontaneously broken SUSY

Hybrid models (Copeland et al. 1994; Linde 1994) predicting ns> 1 are strongly disfavoured by the Planck data, as for the first cosmological release (PCI13). An example of a hybrid model predicting ns< 1 is the case in which slow-roll inflation is driven by loop corrections in spontaneously broken supersymmetric (SUSY) grand unified theories (Dvali et al. 1994) described by the potential V(φ)=Λ4[1+αhlog(φ/Mpl)],\begin{equation} V(\phi) = \Lambda^4 \left[ 1 + \alpha_{\rm h} \log\left(\phi/M_\mathrm{pl}\right) \right], \label{sbsusy} \end{equation}(53)where αh> 0 is a dimensionless parameter. Note that for αh ≪ 1, this model leads to the same predictions as the power-law potential for p ≪ 1 to lowest order in the slow-roll approximation. By sampling log 10(αh) on a flat prior [−2.5,1 ], we obtain a Bayes factor of −2.2 for wint = 0 (−1.7 when wint is allowed to vary).

R2 inflation

The first inflationary model proposed (Starobinsky 1980), with action S=d4xgMpl22(R+R26M2),\begin{equation} S = \int \mathrm{d}^4 x \sqrt{-g} \frac{M^2_\mathrm{pl}}{2} \left(R + \frac{R^2}{6 M^2} \right), \label{R2} \end{equation}(54)still lies within the Planck 68% CL constraints, as for the Planck 2013 release (PCI13). This model corresponds to the potential V(φ)=Λ4(1e2/3φ/Mpl)2\begin{equation} V(\phi) = \Lambda^4 \left(1 - {\rm e}^{- \sqrt{2/3} \phi/M_\mathrm{pl} }\right)^2 \label{R2_Einsteinframe} \end{equation}(55)in the Einstein frame, which leads to the slow-roll predictions ns−1 ≈ −2 /N and r ≈ 12 /N2 (Mukhanov & Chibisov 1981; Starobinsky 1983).

After the Planck 2013 release, several theoretical developments supported the model in Eq. (54) beyond the original motivation of including quantum effects at one-loop (Starobinsky 1980). No-scale supergravity (Ellis et al. 2013a), the large-field regime of superconformal D-term inflation (Buchmüller et al. 2013), or recent developments in minimal supergravity (Farakos et al. 2013; Ferrara et al. 2013b) can lead to a generalization of the potential in Eq. (55) (see Ketov & Starobinsky 2011 for a previous embedding of R2 inflation in F(ℛ) supergravity). The potential in Eq. (55) can also be generated by spontaneous breaking of conformal symmetry (Kallosh & Linde 2013b). This inflationary model has Δχ2 ≈ 0.8 (0.3) larger than the base ΛCDM model with no tensors for wint = 0 (for wint allowed to vary). We obtain 54 <N< 62 (53 <N< 64) at 95% CL for wint = 0 (for wint allowed to vary), compatible with the theoretical prediction, N = 54 (Starobinsky 1980; Vilenkin 1985; Gorbunov & Panin 2011).

α attractors

We now study two classes of inflationary models motivated by recent developments in conformal symmetry and supergravity (Kallosh et al. 2013). The first class has been motivated by considering a vector rather than a chiral multiplet for the inflaton in supergravity (Ferrara et al. 2013a) and corresponds to the potential (Kallosh et al. 2013) V(φ)=Λ4(1e2φ/(3αMpl))2.\begin{equation} V(\phi) = \Lambda^4 \left(1 - {\rm e}^{- \sqrt{2} \phi/\left(\sqrt{3 \alpha} M_\mathrm{pl}\right) }\right)^2 . \label{alpha} \end{equation}(56)To lowest order in the slow-roll approximation, these models predict r64/[3α(1e2φ/(3αMpl))2]\hbox{$r \approx 64/[3 \alpha (1-{\rm e}^{\sqrt{2} \phi/(\sqrt{3 \alpha} M_\mathrm{pl})})^2]$} and ns18(1+e2φ/(3αMpl))/[3α(1e2φ/(3αMpl))2]\hbox{$n_\mathrm{s} -1 \approx - 8(1+{\rm e}^{\sqrt{2} \phi/(\sqrt{3 \alpha} M_\mathrm{pl})}) /[3 \alpha (1-{\rm e}^{\sqrt{2} \phi/(\sqrt{3 \alpha} M_\mathrm{pl})})^2]$} based on an inflationary trajectory characterized by Ng(φ/Mpl)−g(φend/Mpl) with g(x)=(3α4e2x/3α6αx)/4\hbox{$g(x) = (3 \alpha^4 {\rm e}^{\sqrt{2} x/\sqrt{3 \alpha}} - \sqrt{6 \alpha} x)/4$}. The relation between N and φ can be inverted through the use of the Lambert functions, as carried out for other potentials (Martin et al. 2014). By sampling log 10(α2) with a flat prior over [ 0,4 ], we obtain log 10(α2) < 1.7 (2.0) at 95% CL and a Bayes factor of −1.8 (−2) for wint = 0 (for wint allowed to vary).

The second class of models has been called super-conformal α attractors (Kallosh et al. 2013) and can be understood as originating from a different generating function with respect to the first class. This second class is described by the following potential (Kallosh et al. 2013): V(φ)=Λ4tanh2m(φ6αMpl)·\begin{equation} V(\phi) = \Lambda^4 \tanh^{2 m} \left(\frac{\phi}{\sqrt{6 \alpha} M_\mathrm{pl}} \right) \cdot \label{alphaattractors} \end{equation}(57)This is the simplest class of models with spontaneous breaking of conformal symmetry, and for α = m = 1 reduces to the original model introduced by Kallosh & Linde (2013b). The potential in Eq. (57) leads to the following slow-roll predictions (Kallosh et al. 2013): r48αm4mN2+2Ng(α,m)+3αm,\begin{eqnarray} \label{exact1} &&r \approx {48 \alpha m \over 4 m N^2+ 2 N g(\alpha,m) + 3 \alpha m} , \\ \notag \\ &&n_{\rm s} - 1 \approx - \frac{8 m N + 6 \alpha m + 2 g(\alpha,m)}{4 m N^2 + 2 N g(\alpha,m) + 3 \alpha m} , \label{exact} \end{eqnarray}where g(α,m)=3α(4m2+3α)\hbox{$g(\alpha,m) = \sqrt{3 \alpha (4 m^2 + 3 \alpha)}$}. The predictions of this second class of models interpolate between those of a large-field chaotic model, V(φ) ∝ φ2m, for α ≫ 1 and the R2 model for α ≪ 1.

For α we adopt the same priors as for the previous class in Eq. (56). By fixing m = 1, we obtain log 10(α2) < 2.3 (2.5) at 95% CL and a Bayes factor of −2.3 (−2.2) for wint = 0 (when wint is allowed to vary). When m is allowed to vary as well with a flat prior in the range [ 0,2 ], we obtain 0.02 <m< 1 (m< 1) at 95% CL for wint = 0 (when wint is allowed to vary).

Non-minimally coupled inflaton

Inflationary predictions are quite sensitive to a non-minimal coupling ξRφ2 of the inflaton to the Ricci scalar. One of the most interesting effects due to ξ ≠ 0 is to reconcile the quartic potential V(φ) = λφ4/ 4 with Planck observations, even for ξ ≪ 1. Non-minimal coupling leads as well to attractor behaviour towards predictions similar to those in R2 inflation (Kaiser & Sfakianakis 2014; Kallosh & Linde 2013a).

The Higgs inflation model (Bezrukov & Shaposhnikov 2008), in which inflation occurs with V(φ)=λ(φ2φ02)2/4\hbox{$V(\phi) = \lambda (\phi^2-\phi_0^2)^2/4$} and ξ ≫ 1 for φφ0, leads to the same predictions as the R2 model to lowest order in the slow-roll approximation at tree level (see Barvinsky et al. 2008; and Bezrukov & Shaposhnikov 2009 for the inclusion of loop corrections). It is therefore in agreement with the Planck constraints, as for the first cosmological data release (PCI13).

We summarize below our findings for Planck lowP+BAO.

  • Monomial potentials with integral n> 2 are strongly disfavoured withrespect to R2.

  • The Bayes factor prefers R2 over chaotic inflation with monomial quadratic potential by odds of 110:1 under the assumption of a dust equation of state during the entropy generation stage.

  • R2inflation has the strongest evidence (i.e., the greatest Bayes factor) among the models considered here. However, care must be taken not to overinterpret small differences in likelihood lacking statistical significance.

  • The models closest to R2 in terms of evidence are brane inflation and exponential inflation, which have one more parameter than R2. Both brane inflation considered in Eq. (51) and exponential inflation in Eq. (52) approximate the linear potential for a large portion of parameter space (for μ/Mpl ≫ 1 and q ≫ 1, respectively). For this reason these models have a higher Bayesian evidence (although not at a statistically significant level) than those that approximate a quadratic potential, as do α-attractors, for instance.

  • In the models considered here, the Δχ2 obtained by allowing w to vary is modest (i.e., less than approximately 1.6 with respect to wint = 0). The gain in the logarithm of the Bayesian evidence is even smaller, since an extra parameter is added.

7. Reconstruction of the potential and analysis beyond slow-roll approximation

7.1. Introdution

In the previous section, we derived constraints on several types of inflationary potentials assumed to account for the inflaton dynamics between the time at which the largest observable scales crossed the Hubble radius during inflation and the end of inflation. The full shape of the potential was used in order to identify when inflation ends, and thus the field value φ when the pivot scale crosses the Hubble radius.

In Sect. 6 of PCI13, we explored another approach, consisting of reconstructing the inflationary potential within its observable range without making any assumptions concerning the inflationary dynamics outside that range. Indeed, given that the number of e-folds between the observable range and the end of inflation can always be adjusted to take a realistic value, any potential shape giving a primordial spectrum of scalar and tensor perturbations in agreement with observations is a valid candidate. Inflation can end abruptly by a phase transition, or can last a long time if the potential becomes very flat after the observable region has been crossed. Moreover, there could be a short inflationary stage responsible for the origin of observable cosmological perturbations, and another inflationary stage later on (but before nucleosynthesis), thus contributing to the total N.

In Sect. 6 of PCI13, we performed this analysis with a full integration of the inflaton and metric perturbation modes, so that no slow-roll approximation was made. The only assumption was that primordial scalar perturbations are generated by the fluctuations of a single inflaton field with a canonical kinetic term. Since in this approach one is only interested in the potential over a narrow range of observable scales (centred around the field value φ when the pivot scale crosses the Hubble radius), it is reasonable to test relatively simple potential shapes described by a small number of free parameters.

This approach gave very similar results to calculations based on the standard slow-roll analysis. This agreement can be explained by the fact that the Planck 2013 data already preferred a primordial spectrum very close to a power law, at least over most of the observable range. Hence the 2013 data excluded strong deviations from slow-roll inflation, which would either produce a large running of the spectral index or imprint more complicated features on the primordial spectrum. However, this conclusion did not apply to the largest scales observable by Planck, for which cosmic variance and slightly anomalous data points remained compatible with significant deviations from a simple power law spectrum. The most striking result in Sect. 6 of PCI13 was that a less restricted functional form for the inflaton potential gave results compatible with a rather steep potential at the beginning of the observable window, leading to a “not-so-slow” roll stage during the first few observable e-folds. This explains the shape of the potential in Fig. 14 of PCI13 for a Taylor expansion at order n = 4 and in the region where φφ ≤ −0.2. However, such features were only partially explored because the method used for potential reconstruction did not allow for an arbitrary value of the inflation velocity \hbox{$\dot{\phi}$} at the beginning of the observable window. Instead, our code imposed that the inflaton already tracked the inflationary attractor solution when the largest observable modes crossed the Hubble scale.

Given that the Planck 2015 data establish even stronger constraints on the primordial power spectrum than the 2013 results, it is of interest to revisit the reconstruction of the potential V(φ). Section 7.2 presents some new results following the same approach as in PCI13 (explained previously in Lesgourgues & Valkenburg 2007; and Mortonson et al. 2011). But in the present work, we also present some more general results, independent of any assumption concerning the initial field velocity \hbox{$\dot{\phi}$} when the inflaton enters the observable window. Following previous studies (Kinney 2002; Kinney et al. 2006; Peiris & Easther 2006a,b, 2008; Easther & Peiris 2006; Lesgourgues et al. 2008; Powell & Kinney 2007; Hamann et al. 2008; Norena et al. 2012), we reconstruct the Hubble function H(φ), which determines both the potential V(φ) through V(φ)=3MPl2H2(φ)2MPl4[H(φ)]2,\begin{equation} V(\phi)= 3 M_\mathrm{Pl}^2 \, H^2(\phi) - 2 M_\mathrm{Pl}^4 \left[H'(\phi)\right]^2\!, \label{eq:VfromH} \end{equation}(60)and the solution φ(t) through φ̇=2MPl2H(φ),\begin{equation} \dot \phi = - 2 M_\mathrm{Pl}^2 H'(\phi), \end{equation}(61)with H′(φ) = ∂H/∂φ. Note that these two relations are exact. In Sect. 7.3, we fit H(φ) directly to the data, implicitly including all canonical single-field models in which the inflaton is rolling not very slowly (ϵ not much smaller than unity) just before entering the observable window, and the issue of having to start sufficiently early in order to allow the initial transient to decay is avoided. The only drawback in reconstructing H(φ) is that one cannot systematically test the simplest analytic forms for V(φ) in the observable range (for instance, polynomials of order n = 1,3,5,... in (φφ)). But our goal in this section is to explore how much one can deviate from slow-roll inflation in general, independently of the shape of the underlying inflaton potential.

Table 8

Numerical reconstruction of the potential slow-roll parameters beyond any slow-roll approximation when the potential is Taylor expanded to nth order, using Planck TT+lowP+BAO.

7.2. Reconstruction of a smooth inflaton potential

Following the approach of PCI13, we Taylor expand the inflaton potential around φ = φ to order n = 2,3,4. To obtain faster-converging Markov chains, instead of imposing flat priors on the Taylor coefficients { V,Vφ,...,Vφφφφ }, we sample the potential slow-roll (PSR) parameters {ϵV,ηV,ξV2,ϖV3}\hbox{$\{ \epsilon_V, \eta_V, \xi_V^2, \varpi_V^3\}$} related to the former as indicated in Table 2. We stress that this is just a choice of prior and does not imply any kind of slow-roll approximation in the calculation of the primordial spectra.

The results are given in Table 8 (for Planck TT+lowP+BAO) and Fig. 13 (for the same data set and also for Planck TT, TE, EE+lowP). The second part of Table 8 shows the corresponding values of the spectral parameters ns, dns/ dlnk, and r0.002 as measured for each numerical primordial spectrum (at the pivot scales k = 0.05 Mpc-1 for the scalar and 0.002 Mpc-1 for the tensor spectra), as well as the reionization optical depth. We also show in Fig. 14 the derived distribution of each coefficient Vi (with a non-flat prior) and in Fig. 15 the reconstructed shape of the best-fit inflation potentials in the observable window. Finally, the posterior distribution of the derived parameters r0.002 and dns/ dlnk is displayed in Fig. 16.

thumbnail Fig. 13

Posterior distributions for the first four potential slow-roll parameters when the potential is Taylor expanded to nth order using Planck TT+lowP+BAO (filled contours) or TT, TE, EE+lowP (dashed contours). The primordial spectra are computed beyond the slow-roll approximation.

thumbnail Fig. 14

Posterior distributions for the coefficients of the inflation potential Taylor expanded to nth order (in natural units where 8πMpl=1\hbox{$\sqrt{8 \pi} M_\mathrm{pl}=1$}) reconstructed beyond the slow-roll approximation using Planck TT+lowP+BAO (filled contours) or TT, TE, EE+lowP (dashed contours). The plot shows only half of the results; the other half is symmetric, with opposite signs for Vφ and Vφφφ. Note that, unlike Fig. 13, the parameters shown here do not have flat priors, since they are mapped from the slow-roll parameters.

thumbnail Fig. 15

Observable range of the best-fit inflaton potentials, when V(φ) is Taylor expanded to the nth order around the pivot value φ in natural units (where 8πMpl=1)\hbox{$\sqrt{8 \pi} M_\mathrm{pl}=1)$} assuming a flat prior on ϵV, ηV, ξV2\hbox{$\xi^2_V$}, and ϖV3\hbox{$\varpi_V^3$} and using Planck TT+lowP+BAO. Potentials obtained under the transformation (φφ) → (φφ) leave the same observable signature and are also allowed. The sparsity of potentials with a small V0 = V(φ) is due to our use of a flat prior on ϵV rather than on ln(V0); in fact, V0 is unbounded from below in the n = 2 and 3 results. The axis ranges are identical to those in Fig. 20, to make the comparison easier.

thumbnail Fig. 16

Posterior distribution for the tensor-to-scalar ratio (at k = 0.002 Mpc-1) and for the running parameter dns/ dlnk (at k = 0.05 Mpc-1), for the potential reconstructions in Sects. 7.2 and 7.3. The V(φ) reconstruction gives the solid curves for Planck TT+lowP+BAO, or dashed for TT, TE, EE+lowP. The H(φ) reconstruction gives the dotted curves for Planck TT+lowP+BAO, or dashed-dotted for TT, TE, EE+lowP. The tensor-to-scalar ratio appears as a derived parameter, but by taking a flat prior on either ϵV or ϵH, we implicitly also take a nearly flat prior on r. The same applies to dns/ dlnk.

Figure 13 shows that bounds are very similar when temperature data are combined with either high- polarization data or BAO data. This gives a hint of the robustness of these results. For both data sets, the error bars on the PSR parameters are typically smaller by a factor of 1.5 than in PCI13.

Since potentials with n = 2 cannot generate a significant running, the bounds on the scalar spectral index and the tensor-to-scalar ratio and the best-fit models are very similar to those obtained with the ΛCDM+r model in Sect. 5 and Table 4. On the other hand, in the n = 3 model, results follow the trend of the previous ΛCDM+r+dns/ dlnk analysis. The data prefer potentials with Vφ and Vφφφ of the same sign, generating a significant negative running (as can be seen in Fig. 16). This trend for Vφφφ occurs because a scalar spectrum with negative running reduces the power on large scales, and provides a better fit to low- temperature multipoles. However, such a running also suppresses power on small scales, so ξV2\hbox{$\xi_V^2$} cannot be too large.

thumbnail Fig. 17

Primordial spectra (scalar and tensor) of the best-fit V(φ) model with n = 4, for the Planck TT, TE, EE+lowP data set, compared to the primordial spectrum (scalar only) of the best-fit base ΛCDM model. The best-fit potential is initially very steep, as can be seen in Fig. 15 (note the typical shape of the green curves). The transition from “marginal slow roll” (ϵV(φ) between 0.01 and 1) to “full slow roll” (ϵV(φ) of order 0.01 or smaller) is responsible for the suppression of the large-scale scalar spectrum.

thumbnail Fig. 18

Temperature and polarization spectra (total, scalar contribution, tensor contribution) of the best-fit V(φ) model with n = 4, for the Planck TT, TE, EE+lowP data set, compared to the spectra (scalar contribution only) of the best-fit base model. We also show the Planck low- temperature data, which is driving the small differences between the two best-fit models.

The n = 4 case possesses a new feature. The potential has more freedom to generate complicated shapes which would roughly correspond to a running of the running of the tilt (as studied in Sect. 4). The best-fit models now have Vφ and Vφφφ of opposite sign, and a large positive Vφφφφ. The preferred combination of these parameters allows for even more suppression of power on large scales, while leaving small scales nearly unchanged. This can be seen clearly from the shape of the scalar primordial spectrum corresponding to the best-fit models, for both data sets Planck TT+lowP+BAO and Planck TT, TE, EE+lowP. These two best-fit models are very similar, but in Fig. 17 we show the one for Planck TT, TE, EE+lowP, for which the trend is even more pronounced. Interestingly, the preferred models are such that power on large scales is suppressed in the scalar spectrum and balanced by a small tensor contribution, of roughly r0.002 ~ 0.05. This particular combination gives the best fit to the low- data, shown in Fig. 18, while leaving the high- temperature spectrum identical to the best fit base ΛCDM model. Inflation produces such primordial perturbations with the family of green potentials displayed in Fig. 15. At the beginning of the observable range, the potential is very steep [ϵV(φ) decreases from O(1) to O(10-2)], and produces a low amplitude of curvature perturbations (allowing a rather large tensor contribution, up to r0.002 ~ 0.3). Then there is a transition towards a second region with a much smaller slope, leading to a nearly power-law curvature spectrum with the usual tilt value ns ≈ 0.96. In Fig. 15, one can check that the height of the n = 4 potentials varies in a definite range, while the n = 2 and 3 potentials can have arbitrarily small amplitude at the pivot scale, reflecting the posteriors on ϵV or r.

However, the improvement in χ2 between the base ΛCDM and n = 4 models is only 2.2 (for Planck TT+lowP+BAO) or 4.3 (for Planck TT, TE, EE+lowP). This is marginal and offers no statistically significant evidence in favour of these complicated models. This conclusion is also supported by the calculation of the Bayesian evidence ratios, shown in the last line of Table 8 (under the assumption of flat priors in the range [−1, 1] for ξV2\hbox{$\xi_V^2$} and ϖV3\hbox{$\varpi_V^3$}): the evidence decreases each time that a new free parameter is added to the potential. At the 95% CL, r0.002 is still compatible with zero, and so are the higher order PSR parameters ξV2\hbox{$\xi_V^2$} and ϖV3\hbox{$\varpi_V^3$}. More freedom in the inflaton potential allows fitting the data better, but under the assumption of a smooth potential in the observable range, a simple quadratic form provides the best explanation of the Planck observations.

With the Planck TT+lowP+BAO and TT, TE, EE+lowP data sets, models with a large running or running of the running can be compatible with an unusually large value of the optical depth, as can be seen in Table 8. Including lensing information helps to break the degeneracy between the optical depth and the primordial amplitude of scalar perturbations. Hence the Planck lensing data could be used to strengthen the conclusions of this section.

Since in the n = 4 model, slow roll is marginally satisfied at the beginning of observable inflation, the reconstruction is very sensitive to the condition that there is an attractor solution at that time. Hence this case can in principle be investigated in a more conservative way using the H(φ) reconstruction method of the next section.

7.3. Reconstruction of a smooth Hubble function

In this section, we assume that the shape of the function H(φ) is well captured within the observable window by a polynomial of order n (corresponding to a polynomial inflaton potential of order 2n): H(φ)=i=0nHiφii!·\begin{equation} H(\phi) = \sum_{i=0}^n H_i \frac{\phi^i}{i!}\cdot \end{equation}(62)We vary n between 2 and 4. To avoid parameter degeneracies, as in the previous section we assume flat priors not on the Taylor coefficient Hi, but on the Hubble slow-roll (HSR) parameters, which are related according to ϵH=2Mpl2(H1H0)2,ηH=2Mpl2H2H0,ξH2=(2Mpl2)2H1H3H02,ϖH3=(2Mpl2)3H12H4H03.\begin{eqnarray} \epsilon_{\rm H} = 2 M_\mathrm{pl}^2 \left(\frac{H_1}{H_0} \right)^2, \eta_{\rm H} &=& 2 M_\mathrm{pl}^2 \frac{H_2}{H_0}, \\ \xi_{\rm H}^2 = \left(2 M_\mathrm{pl}^2\right)^2 \frac{H_1 H_3}{H_0^2}, \varpi_{\rm H}^3 &=& \left(2 M_\mathrm{pl}^2\right)^3 \frac{H_1^2 H_4}{H_0^3}. \end{eqnarray}This is just a choice of prior. This analysis does not rely on the slow-roll approximation.

thumbnail Fig. 19

Posterior distributions for the first four Hubble slow-roll parameters, when H(φ) is Taylor expanded to nth order, using Planck TT+lowP+BAO (filled contours) or TT, TE, EE+lowP (dashed contours). The primordial spectra are computed beyond the slow-roll approximation.

Table 9

Numerical reconstruction of the Hubble slow-roll parameters beyond the slow-roll approximation, using Planck TT+lowP+BAO.

thumbnail Fig. 20

Same as Fig. 15, when the Taylor expansion to nth order is performed on H(φ) instead of V(φ), and the potential is inferred from Eq. (60).

Table 9 and Fig. 19 show our results for the reconstructed HSR parameters. Figure 20 shows a representative sample of potential shapes V(φφ) derived using Eq. (60), for a sample of models drawn randomly from the chains, for the three cases n = 2,3,4.

Most of the discussion of Sect. 7.2 also applies to this section, and so will not be repeated. Results for Planck TT+lowP+BAO and TT, TE, EE+lowP are still very similar. The n = 2 case still gives results close to ΛCDM+r, and the n = 3 case to ΛCDM+r+dns/ dlnk. The type of potential preferred in the n = 4 case is very similar to the n = 4 analysis of the previous section, for the reasons explained in Sect. 7.2. There are, however, small differences, because the range of parametric forms for the potential explored by the two analyses differ. In the H(φ) reconstruction, the underlying potentials V(φ) are not polynomials. In the first approximation, they are close to polynomials of order 2n, but with constraints between the various coefficients. The main two differences with respect to the results of Sect. 7.2 are as follows:

  • The reconstructed potential shapes for n = 4 at the beginning of the observable window differ. Figure 20 shows that even steeper potentials are allowed than for the V(φ) method, with an even greater excursion of the inflaton field between Hubble crossing for the largest observable wavelengths and the pivot scale. This is because the H(φ) reconstruction does not rely on attractor solutions and automatically explores all valid potentials regardless of their initial field velocity.

  • The best-fit models are different, since they do not explore the same parametric families of potentials. In particular, for n = 4, the best-fit models have a negligible tensor contribution, but the distributions still have thick tails towards large tensor-to-scalar ratios, so that the upper bound on r0.002 is as high as in the previous n = 4 models, r0.002< 0.32.

Note that ϖH3\hbox{$\varpi_{\rm H}^3$} can be significantly larger than unity for n = 4. This does not imply violation of slow roll within the observable range. By assumption, for all accepted models, ϵH must remain smaller than unity over that range. In fact, for most of the green potentials visible in Fig. 20, we checked that ϵH either has a maximum very close to unity near the beginning of the observable range or starts from unity. So the best-fit models (maximizing the power suppression at low multipoles) correspond either to inflation of short duration, or to models nearly violating slow roll just before the observable window. However, such peculiar models are not necessary for a good fit. Table 9 shows that the improvement in χ2 as n increases is negligible.

In summary, this section further establishes the robustness of our potential reconstruction and two main conclusions. Firstly, under the assumption that the inflaton potential is smooth over the observable range, we showed that the simplest parametric forms (involving only three free parameters including the amplitude V(φ), no deviation from slow roll, and nearly power law primordial spectra) are sufficient to explain the data. No high-order derivatives or deviations from slow roll are required. Secondly, if one allows more freedom in the potential – typically, two more parameters – it is easy to decrease the large-scale primordial spectrum amplitude with an initial stage of “marginal slow roll” along a steep branch of the potential followed by a transition to a less steep branch. This type of model can accommodate a large tensor-to-scalar ratio, as high as r0.002 ≈ 0.3.

8. \hbox{$\mathcal{P}({\textit{k}})$} reconstruction

In PCI13 (Sect. 7) we presented the results of a penalized likelihood reconstruction, seeking to detect any possible deviations from a homogeneous power-law form (i.e., \hbox{${\cal P}_{\cal R}(k)\propto k^{n_\mathrm{s}-1}$}) for the primordial power spectrum (PPS) for various values of a smoothing parameter, λ. (For an extensive set of references to the prior literature concerning the methodology for reconstructing the power spectrum, see PCI13.) In the initial March 2013 preprint version of that paper, we reported evidence for a feature at moderate statistical significance around k ≈ 0.15 Mpc-1. However, in the November 2013 revision we retracted this finding, because subsequent tests indicated that the feature was no longer statistically significant when more aggressive cuts were made to exclude sky survey rings where contamination from electromagnetic interference from the 4 K cooler was largest, as indicated in the November 2013 “Note Added.”

In this section we report on results using the 2015 CTT\hbox{$C^{\rm TT}_\ell$} likelihood (Sect. 8.1) using essentially the same methodology as described in PCI13. (See Gauthier & Bucher 2012, and references therein for more technical details.) This method is also extended to include the EE and TE likelihoods in Sect. 8.1.2. As part of this 2015 release, we include the results of two other methods (see Sects. 8.2 and 8.3) to search for features. We find that all three methods yield broadly consistent reconstructions and reach the following main conclusion: there is no statistically significant evidence for any features departing from a simple power-law (i.e., \hbox{$\mathcal{P}_{\cal R}(k)\propto k^{n_\mathrm{s}-1}$}) PPS. Given the substantial differences between these methods, it is satisfying to observe this convergence.

8.1. Method I: penalized likelihood

8.1.1. Update with 2015 temperature likelihood

We repeated the same maximum likelihood analysis used to reconstruct the PPS in PCI13 using the updated Planck TT+lowP likelihood. Since we are interested in deviations from the nearly scale-invariant model currently favoured by the parametric approach, we replaced the true PPS \hbox{$\PR(k)$} by a fiducial power-law spectrum 𝒫(0)(k)=As(k/k)ns1\hbox{$\PR^{(0)}(k) = A_{\mathrm s} (k/k_{\ast})^{n_\mathrm{s}-1}$}, modulated by a small deviation function f(k): 𝒫(k)=𝒫(0)(k)exp[f(k)].\begin{eqnarray} \PR(k) = \PR^{(0)}(k) \exp \left[ f(k) \right] . \end{eqnarray}(65)The deviation function f(k)7 was represented by B-spline basis functions parameterized by nknot control points f={fi}i=1nknot\hbox{$\f = \{f_{i}\}_{i=1}^{n_\mathrm{knot}}$}, which are the values of f(k) along a grid of knot points κi = lnki.

Naively maximizing the Planck TT+lowP likelihood with respect to f results in over-fitting to cosmic variance and noise in the data. Furthermore, due to the limited range of scales over which Planck measures the anisotropy power spectrum, the likelihood is very weakly dependent on f(k) at extremely small and large scales. To address these issues, the following two penalty functions were added to the Planck likelihood: fTR(λ,α)fλdκ(2f(κ)κ2)2+ακmindκf2(κ)+ακmax+dκf2(κ).\begin{eqnarray} \begin{aligned} & \vec{f}^\mathrm{T} \vec{R}(\lambda,\alpha) \vec{f} \equiv \lambda \int \mathrm{d}\kappa~ \left( \frac{ \partial^2f(\kappa) }{ \partial \kappa^2 } \right) ^2 \\ &\hspace*{2cm} + \alpha \int _{-\infty }^{\kappa _{\mathrm{min}}} \mathrm{d}\kappa~f^2(\kappa) + \alpha \int ^{+\infty }_{\kappa _{\mathrm{max}}} \mathrm{d}\kappa~f^2(\kappa). \label{Priors} \end{aligned} \end{eqnarray}(66)The first term on the right-hand side of Eq. (66) is a roughness penalty, which disfavours f(κ) that “wiggle” too much. The last two terms drive f(κ) to zero for scales below κmin and above κmax. The values of λ and α represent the strengths of the respective penalties. The exact value of α is unimportant as long as it is large enough to drive f(κ) close to zero on scales outside [ κmin,κmax ]. However, the magnitude of the roughness penalty λ controls the smoothness of the reconstruction.

Since the anisotropy spectrum depends linearly on the PPS, the Newton-Raphson method is well suited to optimizing with respect to f. However, a maximum likelihood analysis also has to take into account the cosmological parameters, Θ ≡ { H0bh2ch2 }8. These additional parameters are not easy to include in the Newton-Raphson method since it is difficult to evaluate the derivatives C/Θ, 2C/Θ2, etc., to the accuracy required by the method. Therefore a non-derivative method, such as the downhill simplex algorithm, is best suited to optimization over these parameters. Unfortunately the downhill simplex method is inefficient given the large number of control points in our parameter space. Since each method has its drawbacks, we combined the two methods to draw on their respective strengths. We define the function as (Θ)=minfi[1,1]{2ln(Θ,f)+fTR(λ,α)f}.\begin{equation} \mathcal{M}(\boldsymbol \Theta) =\underset{f_{i} \in [-1,1]}{\textrm{min}} \left\{-2 \ln \mathcal{L}(\boldsymbol \Theta,\vec{f}) + \vec{f}^{T} \vec{R}(\lambda,\alpha) \vec{f}\right\}. \end{equation}(67)Given a set of non-PPS cosmological parameters Θ, is the value of the penalized log likelihood, minimized with respect to f using the Newton-Raphson method. The function is in turn minimized with respect to Θ using the downhill simplex method. In contrast to the analysis done in PCI13, the Planck low- likelihood has been modified so that it can be included in the Newton-Raphson minimization. Thus the reconstructions presented here extend to larger scales than were considered in 2013.

thumbnail Fig. 21

Planck TT+lowP likelihood primordial power spectrum (PPS) reconstruction results. Top four panels: reconstruction of the deviation f(k) using four different roughness penalties. The red curves represent the best-fit deviation f(k) using the Planck TT+lowP likelihood. f(k) = 0 would represent a perfectly featureless spectrum with respect to the fiducial PPS model, which is obtained from the best-fit base ΛCDM model with a power-law PPS. The vertical extent of the dark and light green error bars indicates the ± 1σ and ± 2σ errors, respectively. The width of the error bars represents the minimum reconstructible width (the minimum width for a Gaussian feature so that the mean square deviation of the reconstruction is less than 10%). The grey regions indicate where the minimum reconstructible width is undefined, indicating that the reconstruction in these regions is untrustworthy. The hatched region in the λ = 106 plot shows where the fixing penalty has been applied. These hashed regions are not visible in the other three reconstructions, for which κmin lies outside the range shown in the plots. For all values of the roughness penalty, all data points are within 2σ of the fiducial PPS except for the deviations around k ≈ 0.002 Mpc-1 in the λ = 103 and λ = 104 reconstructions. Lower three panels: ± 1σ error bars of the three non-PPS cosmological parameters included in the maximum likelihood reconstruction. All values are consistent with their respective best-fit fiducial model values indicated by the dashed lines.

thumbnail Fig. 22

Planck TT, TE, EE+lowP likelihood primordial power spectrum reconstruction results. Top four panels: reconstruction of the deviation f(k) using four different roughness penalties. As in Fig. 21, the red curves represent the best-fit deviation f(k) and the height and width of the green error bars represent the error and minimum reconstructible width, respectively. For all values of the roughness penalty, the deviations are consistent with a featureless spectrum. Lower three panels: ± 1σ error bars of the three non-PPS cosmological parameters included in the maximum likelihood reconstruction. All values are consistent with their respective best-fit fiducial model values (indicated by the dashed lines).

Figure 21 shows the best-fit PPS reconstruction using the Planck TT+lowP likelihood. The penalties in Eq. (66) introduce a bias in the reconstruction by smoothing and otherwise deforming potential features in the power spectrum. To assess this bias, we define the “minimum reconstructible width” (MRW) to be the minimum width of a Gaussian feature needed so that the integrated squared difference between the feature and its reconstruction is less than 1% of the integrated square of the input Gaussian, which is equivalent to 10% rms. Due to the combination of the roughness and fixing penalties, it is impossible to satisfy the MRW criterion too close to κmin and κmax. Wherever the MRW is undefined, the reconstruction is substantially biased and therefore suspect. An MRW cannot be defined too close to the endpoints κmin and κmax for two reasons: (1) lack of data; and (2) if a feature is too close to where the fixing penalty has been applied, the fixing penalty distorts the reconstruction. Consequently a larger roughness penalty decreases the range over which an MRW is well defined. The grey shaded areas in Fig. 21 show where the MRW is undefined and thus the reconstruction cannot be trusted. The cutoffs κmin and κmax have been chosen to maximize the range over which an MRW is defined for a given value of λ. The 1σ and 2σ error bars in Fig. 21 are estimated using the Hessian of the log-likelihood evaluated at the best-fit PPS reconstruction. More detailed MC investigations suggest that the nonlinear corrections to these error bars are small.

For the λ = 105 and 106 cases of the TT reconstruction, no deviation exceeds 2σ, so we do not comment on the probability of obtaining a worse fit. For the other cases, we use the maximum of the deviation, expressed in σ, of the plotted points as a metric of the quality of fit. Then using Monte Carlo simulations, we compute the p-value, or the probability to obtain a worse fit, according to this metric. For λ = 103 and 104, we obtain p-values of 0.304 and 0.142, respectively, corresponding to 1.03 and 1.47σ. We thus conclude that the observed deviations are not statistically significant.

8.1.2. Penalized likelihood results with polarization

In Fig. 22 the best-fit reconstruction of the PPS from the Planck TT, TE, EE+lowP likelihood is shown. We observe that the reconstruction including polarization broadly agrees with the reconstruction obtained using temperature only. For the Planck TT, TE, EE+lowP likelihood, we obtain for λ = 103, 104, and 105 the p-values 0.166,0.107, and 0.045, respectively, corresponding to 1.38,1.61, and 2.00σ, and likewise conclude the absence of any statistically significant evidence for deviations from a simple power-law scalar primordial power spectrum.

8.2. Method II: Bayesian model comparison

In this section we model the PPS \hbox{$\PR(k)$} using a nested family of models where \hbox{$\PR(k)$} is piecewise linear in the \hbox{$\ln (\mathcal{P})$}-ln(k) plane between a number of knots, Nknots, that is allowed to vary. The question arises as to how many knots one should use, and we address this question using Bayesian model comparison. A family of priors is chosen where both the horizontal and vertical positions of the knots are allowed to vary. We examine the “Bayes factor” or “Bayesian evidence” as a function of Nknots to decide how many knots are statistically justified. A similar analysis has been performed by Vázquez et al. (2012) and Aslanyan et al. (2014). In addition, we marginalize over all possible numbers of knots to obtain an averaged reconstruction weighted according to the Bayesian evidence.

The generic prescription is illustrated in Fig. 23. Nknots knots \hbox{$\{(k_i,\Pknotj{i})\,$}: i = 1,...,Nknots } are placed in the \hbox{$(k,\PR)$} plane and the function \hbox{$\PR(k)$} is constructed by logarithmic interpolation (a linear interpolation in log -log  space) between adjacent points. Outside the horizontal range [ k1,kN ] the function is extrapolated using the outermost interval.

Within this framework, base ΛCDM arises when Nknots = 2 – in other words, when there are two boundary knots and no internal knots, and the parameters \hbox{$\Pknotj{1}$} and \hbox{$\Pknotj{2}$} (in place of As and ns) parameterize the simple power-law PPS. There are also, of course, the four standard cosmological parameters bh2, Ωch2, 100θMC, and τ), as well as the numerous foreground parameters associated with the Planck high- likelihood, all of which are unrelated to the PPS. This simplest model can be extended iteratively by successively inserting an additional internal knot, thus requiring with each iteration two more variables to parameterize the new knot position.

thumbnail Fig. 23

Linear spline reconstruction. The primordial power spectrum is reconstructed using Nknots interpolation points \hbox{$\{(k_i,\Pknotj{i})\,$}: i = 1,2,...,Nknots }. The end knots are fixed in k but allowed to vary in \hbox{${\Pknot}$}, whereas the internal knots can vary subject to the constraint that k1<k2< ··· <kNknots. The function \hbox{$\mathcal{P}_\mathcal{R}(k)$} is constructed within the range [ k1,kNknots ] by interpolating logarithmically between adjacent knots (i.e., linearly in log -log  space). Outside this range the function is extrapolated logarithmically. The function \hbox{$\mathcal{P}_\mathcal{R}(k;\{k_i,\Pknotj{i}\})$} thus has 2Nknots−2 parameters.

We run models for a variety of numbers of internal knots, Nint = Nknots−2, evaluating the evidence for Nint. Under the assumption that the prior is justified, the most likely, or preferred, model is the one with the highest evidence. Evidences are evaluated using the PolyChord sampler (Handley et al. 2015) in CAMB and CosmoMC. The use of PolyChord is essential, as the posteriors in this parameterization are often multi-modal. Also, the ordered log-uniform priors on the ki are easy to implement within the PolyChord framework. All runs were performed with 1000 live points, oversampling the semi-slow and fast parameters by a factor of 5 and 100, respectively.

Table 10

Prior for moveable knot positions.

Priors for the reconstruction parameters are detailed in Table 10. We report evidence ratios with respect to the base ΛCDM case. The cosmological priors remain the same for all models, and this part of the prior has almost no impact on the evidence ratios. The choice of prior on the reconstruction parameters \hbox{$\{\Pknotj{i}\}$} does affect the Bayes factor. CosmoMC, however, puts an implicit prior on all models by excluding parameter choices that render the internal computational approximations in CAMB invalid. The baseline prior for the vertical position of the knots includes all of the range allowed by CosmoMC, so slighly increasing this prior range will not affect the evidence ratios. If one were to reduce the prior widths significantly, the evidence ratios would be increased. The allowed horizontal range includes all k-scales accessible to Planck. Thus, altering this width would be unphysical.

After completion of an evidence calculation, PolyChord generates a representative set of samples of the posterior for each model, P(Θ) ≡ P(Θ | data,Nint). We may use this to calculate a marginalized probability distribution for the PPS: P(log𝒫|k,Nint)=δ(log𝒫log𝒫(k))P(Θ).\begin{equation} P(\log\PR|k,\Nint) = \int \delta\left(\log\PR - \log\PR(k;\Theta)\right)P(\Theta)\:{\rm d}\Theta. \label{eqn:margPR} \end{equation}(68)This expression encapsulates our knowledge of \hbox{$\PR$} at each value of k for a given number of knots. Plots of this PPS posterior are shown in Fig. 24 using Planck TT data.

If one considers the Bayesian evidence of each model, Fig. 25 shows that although no model is preferred over base ΛCDM, the case Nint = 1 is competitive. This model is analogous to the broken-power-law spectrum of Sect. 4.4, although the models differ significantly in terms of the priors used. In this case, the additional freedom of one knot allows a reconstruction of the suppression of power at low . Adding polarization data does not alter the evidences significantly, although Nint = 1 is strengthened. We also plot a Planck TT run, but with the reduced vertical priors 2.5<ln(1010𝒫i)<3.5\hbox{${2.5<\ln\left(10^{10}\mathcal{P}_i\right)<3.5}$}. As expected, this increases the evidence ratios, but does not alter the above conclusion.

For increasing numbers of internal knots, the Bayesian evidence monotonically decreases. Occam’s razor dictates, therefore, that these models should not be preferred, due to their higher complexity. However, there is an intriguing stable oscillatory feature, at 20 ≲ ≲ 50, that appears once there are enough knots to reconstruct it. This is a qualitative feature predicted by several inflationary models (discussed in Sect. 9), and a possible hint of new physics, although its statistical significance is not compelling.

A full Bayesian analysis marginalizes over all models weighted according to the normalized evidence ZNint, so that P(log𝒫|k)=NintP(log𝒫|k,Nint)ZNint,\begin{equation} P(\log\PR|k) = \sum\limits_{\Nint} P(\log\PR|k,\Nint) Z_{\Nint}, \label{eqn:margPR_full} \end{equation}(69)as indicated in Fig. 26. This reconstruction is sensitive to how model complexity is penalized in the prior distribution.

8.3. Method III: cubic spline reconstruction

In this section we investigate another reconstruction algorithm based on cubic splines in the ln(k)-\hbox{$\ln \mathcal{P}_{\cal R}$} plane, where (unlike for the approach of the previous subsection) the horizontal positions of the knots are uniformly spaced in ln(k) and fixed. A prior on the vertical positions (described in detail below) is chosen and the reconstructed power spectrum is calculated using CosmoMC for various numbers of knots. This method differs from the method in Sect. 8.1 in that the smoothness is controlled by the number of discrete knots rather than by a continuous parameter of a statistical model having a well-defined continuum limit. With respect to the Bayesian model comparison of Sect. 8.2, the assessment of model complexity differs because here the knots are not movable.

thumbnail Fig. 24

Bayesian movable knot reconstructions of the primordial power spectrum \hbox{$\PR(k)$} using Planck TT data. The plots indicate our knowledge of the PPS \hbox{$P(\PR(k)|k,\Nint)$} for a given number of knots. The number of internal knots Nint increases (left to right and top to bottom) from 0 to 8. For each k-slice, equal colours have equal probabilities. The colour scale is chosen so that darker regions correspond to lower-σ confidence intervals. 1σ and 2σ confidence intervals are also indicated (black curves). The upper horizontal axes give the approximate corresponding multipoles via kDrec, where Drec is the comoving distance to recombination.

Let the horizontal positions of the n knots be given by kb, where b = 1,...,n, spaced so that kb + 1/kb is independent of b. We single out a “pivot knot” b = p, so that kp = k = 0.05 Mpc-1, which is the standard scalar power spectrum pivot scale. For a given number of knots n we choose k1 and kn so that the interval of relevant cosmological scales, taken to extend from 10-4 Mpc-1 to O(1) Mpc-1, is included. We now define the prior on the vertical knot coordinates. For the pivot point, we define \hbox{$\ln A_\mathrm{s} = \ln \mathcal{P}_{\cal R} (k_*)$}, where lnAs has a uniformly distributed prior, and for the other points with bp, we define the derived variable qbln(𝒫(kb)𝒫,fid(kb)),\begin{equation} q_b \equiv \ln \left( \frac{\mathcal{P}_{\cal R} (k_b)}{\mathcal{P}_{\mathrm{{\cal R},fid}}(k_b)} \right), \end{equation}(70)where \hbox{$\mathcal{P}_{{\cal R},\mathrm{fid}}(k) \equiv A_\mathrm{s}(k/k_*)^{n_{\mathrm{s}, \mathrm{fid}}-1}.$} Here the spectral index ns,fid is fixed. A uniform prior is imposed on each variable qb(bp) and the constraint −1 ≤ qb ≤ 1 is also imposed to force the reconstruction to behave reasonably near the endpoints, where it is hardly constrained by the data. The quantity \hbox{$\ln \mathcal{P}_{\cal R} (k)$} is interpolated between the knots using cubic splines with natural boundary conditions (i.e., the second derivatives vanish at the first and the last knots). Outside [ k1,kn ] we set \hbox{$\mathcal{P}_{\cal R}(k) = {\rm e}^{q_1} \mathcal{P}_{{\cal R},\mathrm{fid}}(k)$} (for k<k1) and \hbox{$\mathcal{P}_{\cal R}(k) = {\rm e}^{q_n} \mathcal{P}_{{\cal R},\mathrm{fid}}(k)$} (for k>kn). For most knots, we found that the upper and lower bounds of the qb prior hardly affect the reconstruction, since the data sharpen the allowed range significantly. However, for super-Hubble scales (i.e., k ≲ 10-4 Mpc-1) and very small scales (i.e., k ≳ 0.2 Mpc-1), which are only weakly constrained by the cosmological data, the prior dominates the reconstruction. For the results here, a fiducial spectral index ns,fid = 0.967 for \hbox{$\mathcal{P}_{{\cal R} , \mathrm{fid}}$} was chosen, which is close to the estimate from Planck TT+lowP+BAO. A different choice of ns,fid leads to a trivial linear shift in the qb.

thumbnail Fig. 25

Bayes factor (relative to the base ΛCDM model) as a function of the number of knots for three separate runs. Solid line: Planck TT. Dashed line: Planck TT, TE, EE. Dotted line: Planck TT, with priors on the \hbox{$\mathcal{P}_i$} parameters reduced in width by a factor of 2 (\hbox{$2.5<\ln(10^{10}\mathcal{P}_i)<3.5$}).

The possible presence of tensor modes (see Sect. 5) has the potential to bias and introduce additional uncertainty in the reconstruction of the primordial scalar power spectrum as parameterized above. Obviously, in the absence of a detection of tensors at high statistical significance, it is not sensible to model a possible tensor contribution with more than a few degrees of freedom. A complicated model would lead to prior dominated results. We therefore use the power law parameterization, \hbox{$\mathcal{P}_\mathrm{t}(k) = r A_\mathrm{s}(k/k_*)^{n_\mathrm{t}}$}, where the consistency relation nt = −r/ 8 is enforced as a constraint.

thumbnail Fig. 26

Bayesian reconstruction of the primordial power spectrum averaged over different values of Nint (as shown in Fig. 24), weighted according to the Bayesian evidence. The region 30 << 2300 is highly constrained, but the resolution is lacking to say anything precise about higher . At lower , cosmic variance reduces our knowledge of \hbox{$\PR(k).$} The weights assigned to the lower Nint models outweigh those of the higher models, so no oscillatory features are visible here.

thumbnail Fig. 27

Reconstructed power spectra applied to the Planck 2015 data using 12 knots (with positions marked as Δ at the bottom of each panel) with cubic spline interpolation. Mean spectra as well as sample trajectories are shown for scalars and tensors, and ± 1σ and ± 2σ limits are shown for the scalars. The fiducial tensor spectrum corresponds, arbitrarily, to r = 0.13. Top: uniform prior, 0 ≤ r ≤ 1. Middle: fixed, r = 0.1. Bottom: fixed, r = 0.01. Data sets: Planck TT+lowP+BAO+SN+HST+zre > 6 prior. Drec is the comoving distance to recombination.

Primordial tensor fluctuations contribute to CMB temperature and polarization angular power spectra, in particular at spatial scales larger than the recombination Hubble length, k ≲ (aH)rec ≈ 0.005 Mpc-1. If a large number of knots in \hbox{$\ln \mathcal{P}_{\cal R}(k)$} is included over that range, then a modified \hbox{$\mathcal{P}_{\cal R}$} can mimic a tensor contribution, leading to a near-degeneracy. This can lead to large uncertainty in the tensor amplitude, r. Once r is measured or tightly constrained in B-mode experiments, this near degeneracy will be broken. As examples here, we do allow r to float, but also show what happens when r is constrained to take the values r = 0.1 and r = 0.01 in the reconstruction.

Figure 27 shows the reconstruction obtained using the 2015 Planck TT+lowP likelihood, BAO, SNIa, HST, and a zre > 6 prior. Including these ancillary likelihoods improves the constraint on the PPS by helping to fix the cosmological parameters (e.g., H0, τ, and the late-time expansion history), which in this context may be regarded as nuisance parameters. These results were obtained by modifying CosmoMC to incorporate the n-knot parameterization of the PPS. Here 12 knots were used and the mean reconstruction as well as the 1σ and 2σ limits are shown. Some 1σ sample trajectories (dashed curves) are also shown to illustrate the degree of correlation or smoothing of the reconstruction. The tensor trajectories are also shown, but, as explained above, have been constrained to be straight lines. In the top panel r is allowed to freely float, and a wide range of r is allowed because of the near-degeneracy with the low-k scalar power. Two illustrative values of fixed r (i.e., r = 0.1 and r = 0.01) are also shown to give an idea of how much the reconstruction is sensitive to variations in r within the range of presently plausible values.

thumbnail Fig. 28

Reconstructed 12-knot power spectra with polarization included. Data sets in common: lowT+lowP+BAO+SN+HST+zre > 6 prior.

The reconstructions using the 2013 Planck likelihood in place of the 2015 likelihood are broadly consistent with the reconstruction shown in Fig. 27. To demonstrate robustness with respect to the interpolation scheme we tried using linear interpolation instead of cubic splines and found that the reconstruction was consistent provided enough knots (i.e., nknot ≈ 14) were used. At intermediate k the reconstruction is consistent with a simple power law, corresponding to a straight line in Fig. 27. We observe that once k drops, so that the effective multipole being probed is below about 60, deviations from a power law appear, but the dispersion in allowed trajectories also rises as a consequence of cosmic variance. The power deficit at k ≈ 0.002 Mpc-1 (i.e., kkDrec ≈ 30, where Drec is the comoving distance to recombination) is largely driven by the power spectrum anomaly in the ≈ 20−30 range that has been evident since the early spectra from WMAP (Bennett et al. 2011), and verified by Planck.

thumbnail Fig. 29

Reconstructed 𝒟TT\hbox{$\mathcal{D}_\ell^{\rm TT}$} power spectra with the base ΛCDM best fit subtracted. The mean spectra shown are for the floating r and the two fixed r cases with 12 cubic spline knots. These should be contrasted with the running best-fit mean (green) and the similar looking uniform ns case in which τ has been lowered from its best-fit base ΛCDM value to 0.04. Data points are the Planck 2015 Commander (< 30) and Plik ( ≥ 30) temperature power spectrum.

We also explore the impact of including the Planck polarization likelihood in the reconstruction. Figure 28 shows the reconstructed power spectra using various combinations of the polarization and temperature data. The < 30 treatments are the same in all cases, so this is mainly a test of the higher k region. What is seen is that, except at high k, the EE polarization data also enforce a nearly uniform ns, consistent with that from TT, over a broad k-range. When TE is used alone, or TE and EE are used in combination, the result is also very similar. The upper right panel shows the constraints from all three spectra together, and the errors on the reconstruction are now better than those from TT alone.

It is interesting to examine how the TT power spectrum obtained using the above reconstructions compares to the CMB data, in particular around the range ≈ 2030, corresponding roughly to k4 ≈ 1.5 × 10-3 Mpc-1. In Fig. 29 the differences in 𝒟TT\hbox{${\cal D}_\ell ^{\rm TT}$} from the best-fit simple power-law model are plotted for various assumptions concerning r. We see that a better fit than the power-law model can apparently be obtained around ≈ 20−30. We quantify this improvement below.

Due to the degeneracy of scalar and tensor contributions to 𝒟TT\hbox{${\cal D}_{\ell}^{TT}$}, the significance of the low- anomaly depends on the tensor prior and whether polarization data are used. For k < 10-3 Mpc-1, once more degeneracy appears: the shape of 𝒟TT\hbox{${\cal D}^{TT}_\ell $} also depends on the reionization optical depth, τ. In Fig. 29 we also show the effect of replacing the best-fit τ for tilted base ΛCDM with a low value, while keeping Ase− 2τ unchanged. A low τ bends 𝒟TT\hbox{${\cal D}_\ell^{\rm TT}$} downward at ≲ 10. For the 12-knot (or similar) runs, if τ is allowed to run into the (nonphysically) small values τ ≲ 0.04, a slight rise in \hbox{$\mathcal{P}_{\cal R}(k)$} at k ≈ 3 × 10-4 Mpc-1 is preferred to compensate the low-τ effect. This degeneracy can be broken to a certain extent using low-redshift data: zre > 6 from quasar observations (Becker et al. 2001), BAO (SDSS), Supernova (JLA), and HST.

It is evident that allowing ns to run is not what the 𝒟TT\hbox{${\cal D}_\ell^{TT}$} data prefer. The best-fit running is also shown in Fig. 29. The k-space \hbox{$\mathcal{P}_{\cal R}(k)$}-response in Fig. 27 shows that running does not capture the shape of the low- residuals.

Table 11

Reduced χ2 and p-values for low-k knots (5 knots) and high-k knots (6 knots, pivot knot excluded), with the null hypothesis being the best-fit power-law spectrum.

We have shown that the cubic spline reconstruction studied in this section consistently produces a dip in q4, corresponding to k ≈ 1.5 × 10-3 Mpc-1. We now turn to the question of whether this result is real or simply the result of cosmic variance. To assess the statistical significance of the departures of the mean reconstruction from a simple power law, we calculate the low-k and high-k reduced χ2 for the five qb values for scales below and six qb values (bp) for scales above 50 /Drec, respectively, indicating the corresponding p-values (i.e., probability to exceed), for various data combinations, in Table 11. The high-k fit is better than expected for reasons that we do not understand, but we attribute this situation to chance. The low-k region shows a poor fit, but in no case does the p-value fall below 10%. Therefore, even though the low-k dip is robust against the various choices made for the reconstruction, we conclude that it is not statistically significant. The plot for the knot position of the dip (corresponding to q4) in Fig. 30 does not contradict this conclusion.

thumbnail Fig. 30

The degeneracy between τ and the knot variables q3 and q4 in the 12-knot case shown in Fig. 27.

thumbnail Fig. 31

Slow-roll parameter ϵ for reconstructed trajectories using 12 knots (marked as Δ at the bottom of the figure) with cubic spline interpolation. The mean values are shown for floating r and r fixed to be 0.1 and 0.01. Sample 1σ trajectories shown for the floating r case show wide variability, which is significantly diminished if r is fixed to r = 0.1, as shown.

thumbnail Fig. 32

Reconstructed single-field inflaton potentials from the cubic spline power spectra mode expansion using 12 knots.

Because of the r degeneracy associated with the scalar power, it is best when quoting statistics to use the fixed r cases, although for completeness we show the floating r case as well. There is also a smaller effect associated with the τ degeneracy, and the values quoted have restricted the redshift of reionization to exceed 6. The value zre = 6.5 was used in Planck Collaboration XIII (2016). The significance of the low-k anomaly is meaningful only if an explicit r prior and low-redshift constraint on τ have been applied.

Finally, we relate the reconstructed \hbox{${\cal P}_{\cal R}(k)$} calculated above to the trajectories of the slow-roll parameter ϵ = −/H2 | k = aH plotted as a function of k (see Fig. 31). We also plot in Fig. 32 the reconstructed inflationary potential in the region over which the inflationary potential is constrained by the data. Here canonical single-field inflation is assumed, and the value of r enters solely to fix the height of the potential at the pivot scale. This is not entirely self-consistent, but justified by the lack of constraining power on the tensors at present.

8.4. Power spectrum reconstruction summary

The three non-parametric methods for reconstructing the primordial power spectrum explored here support the following two conclusions:

  • 1.

    Except possibly at low k, over the range of k where the CMB data best constrain the form of the primordial power spectrum, none of the three methods finds any statistically significant evidence for deviations from a simple power-law form. The fluctuations seen in this regime are entirely consistent with the expectations from cosmic variance and noise.

  • 2.

    At low k, all three methods reconstruct a power deficit at k ≈ 1.52.0 × 10-3 Mpc-1, which can be linked to the dip in the TT angular power spectrum at ≈ 20−30. This agreement suggests that the reconstruction of this “anomaly” is not an artefact of any of the methods, but rather inherent in the CMB data themselves. However, the evidence for this feature is marginal since it is in a region of the spectrum where the fluctuations from cosmic variance are large.

  • 3.

    We have verified that the power deficit at = 20–30 is not substantially modified (a) by removing from the CMB pattern the hottest and coldest peaks selected by the Kolmogorov-Smirnov test studied in Sects. 4.5.3 and 4.5.4 of Planck Collaboration XVI (2016) or (b) by substituting the anomalously cold region around the Cold Spot with Gaussian constrained realizations.

9. Search for parameterized features

In this section, we explore the possibility of a radical departure from the near-scale-invariant power-law spectrum 𝒫0(k)=As(k/k)ns1\hbox{$\mathcal{P}_\mathcal{R}^0(k) = A_\mathrm{s} (k/k_*)^{n_\mathrm{s}-1}$} of the standard slow-roll scenario for a selection of theoretically motivated parameterizations of the spectrum (see Chluba et al. 2015 for a recent review).

9.1. Models

9.1.1. Step in the inflaton potential

A sudden, step-like feature in the inflaton potential (Adams et al. 2001) or the sound speed (Achúcarro et al. 2011) leads to a localized oscillatory burst in the scalar primordial power spectrum. A general parameterization describing both a tanh-step in the potential and in the warp term of a DBI model was proposed in Miranda & Hu (2014): ln𝒫s(k)=exp[ln𝒫0(k)+0(k)+ln(1+12(k))],\begin{equation} \label{eq:step1} \ln \mathcal{P}_\mathcal{R}^\mathrm{s}(k) = \exp \left[ \ln \mathcal{P}_\mathcal{R}^0(k) + \mathcal{I}_0(k) + \ln \left(1 + \mathcal{I}_1^2(k)\right) \right], \end{equation}(71)where the first- and second-order terms are given by 0=[𝒜s𝒲1(0)(k/ks)+𝒜2𝒲2(0)(k/ks)+𝒜3𝒲3(0)(k/ks)]𝒟(k/ksxs),1=12{π2(1ns)+[𝒜s𝒲1(1)(k/ks)+𝒜2𝒲2(1)(k/ks)+𝒜3𝒲3(1)(k/ks)]𝒟(k/ksxs)},\begin{eqnarray} \mathcal{I}_0 &=& \left[ \mathcal{A}_\mathrm{s} \, \mathcal{W}_1^{(0)}(k/k_\mathrm{s}) + \mathcal{A}_2 \, \mathcal{W}_2^{(0)}(k/k_\mathrm{s}) \right. \nonumber\\ &&\quad \left. + \; \mathcal{A}_3 \, \mathcal{W}_3^{(0)}(k/k_\mathrm{s}) \right] \, \mathcal{D}\left(\frac{k/k_\mathrm{s}}{x_\mathrm{s}} \right), \\ \mathcal{I}_1 &=& \frac{1}{\sqrt{2}} \left\{ \frac{\pi}{2} (1 - n_\mathrm{s}) + \left[ \mathcal{A}_\mathrm{s} \, \mathcal{W}_1^{(1)}(k/k_\mathrm{s}) \right. \vphantom{\mathcal{D}\left(\frac{k/k_\mathrm{s}}{x_\mathrm{s}} \right)} \right. \nonumber\\ && \quad \left. \left. + \; \mathcal{A}_2 \, \mathcal{W}_2^{(1)}(k/k_\mathrm{s}) + \mathcal{A}_3 \, \mathcal{W}_3^{(1)}(k/k_\mathrm{s}) \right]\, \mathcal{D}\left(\frac{k/k_\mathrm{s}}{x_\mathrm{s}} \right) \right\}, \end{eqnarray}with window functions 𝒲1(0)(x)=12x3[(18x6x3)cos2x+(15x29)sin2x],𝒲2(0)(x)=32x3[sin(2x)2xcos(2x)x2sin(2x)],𝒲3(0)(x)=1x3[6xcos(2x)+(4x23)sin(2x)],𝒲1(1)(x)=1x3{3(xcosxsinx)[3xcosx+(2x23)sinx]},𝒲2(1)(x)=3x3(sinxxcosx)2,𝒲3(1)(x)=1x3[3+2x2(34x2)cos(2x)6xsin(2x)],\begin{eqnarray} \mathcal{W}_1^{(0)}(x) &=& \frac{1}{2 x^3} \left[ \left(18 x - 6 x^3 \right) \cos 2x + \left(15 x^2 - 9 \right) \sin 2x \right],\\ \mathcal{W}_2^{(0)}(x) &=& \frac{3}{2 x^3} \left[ \sin (2x) - 2x \cos (2x) - x^2 \sin (2x) \right],\\ \mathcal{W}_3^{(0)}(x) &=& \frac{1}{x^3} \left[ 6x \cos (2x) + (4x^2 -3) \sin (2x) \right],\\ \mathcal{W}_1^{(1)}(x) &=& -\frac{1}{x^3} \left\{ 3(x \cos x - \sin x) \left[3 x \cos x + \left(2 x^2 -3\right) \sin x\right] \right\},\\ \mathcal{W}_2^{(1)}(x) &=& \frac{3}{x^3} \left(\sin x - x \cos x \right)^2\!,~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \\ \mathcal{W}_3^{(1)}(x) &=& -\frac{1}{x^3} \left[ 3 + 2x^2 - \left(3-4x^2\right) \cos (2x) - 6x \sin (2x) \right], \end{eqnarray}and damping function 𝒟(x)=xsinhx·\begin{equation} \label{eq:step2} \mathcal{D}(x) = \frac{x}{\sinh x}\cdot \end{equation}(80)Due to the high complexity of this model, we focus on the limiting case of a step in the potential (\hbox{$\mathcal{A}_2 = \mathcal{A}_3 = 0$}).

9.1.2. Logarithmic oscillations

Logarithmic modulations of the primordial power spectrum generically appear, for example, in models with non-Bunch-Davies initial conditions (Martin & Brandenberger 2001; Danielsson 2002; Bozza et al. 2003), or, approximately, in the axion monodromy model, explored in more detail in Sect. 10. We assume a constant modulation amplitude and use 𝒫log(k)=𝒫0(k){1+𝒜logcos[ωlogln(kk)+ϕlog]}.\begin{equation} \mathcal{P}_\mathcal{R}^\mathrm{log}(k) = \mathcal{P}_\mathcal{R}^0(k) \left\{ 1 + \mathcal{A}_\mathrm{log} \cos \left[ \omega_\mathrm{log} \ln \left(\frac{k}{k_*} \right) + \varphi_\mathrm{log} \right] \right\}. \end{equation}(81)

9.1.3. Linear oscillations

A modulation linear in k can be obtained, for example, in boundary effective field theory models (Jackson & Shiu 2013), and is typically accompanied by a scale-dependent modulation amplitude. We adopt the parameterization used in Meerburg & Spergel (2014), which allows for a strong scale dependence of the modulation amplitude: 𝒫lin(k)=𝒫0(k)[1+𝒜lin(kk)nlincos(ωlinkk+ϕlin)].\begin{equation} \mathcal{P}_\mathcal{R}^\mathrm{lin}(k) = \mathcal{P}_\mathcal{R}^0(k) \left[ 1 + \mathcal{A}_\mathrm{lin} \left(\frac{k}{k_*}\right)^{n_\mathrm{lin}} \cos \left(\omega_\mathrm{lin} \frac{k}{k_*} + \varphi_\mathrm{lin} \right) \right]. \end{equation}(82)

9.1.4. Cutoff model

If today’s largest observable scales exited the Hubble radius before the inflaton field reached the slow-roll attractor, the amplitude of the primordial power spectrum is typically strongly suppressed at low k. As an example of such a model, we consider a scenario in which slow roll is preceded by a stage of kinetic energy domination. The resulting power spectrum was derived by Contaldi et al. (2003) and can be expressed as ln𝒫c(k)=ln𝒫0(k)+ln(π16kkc|CcDc|2),\begin{equation} \ln \mathcal{P}_\mathcal{R}^\mathrm{c}(k) = \ln \mathcal{P}_\mathcal{R}^0(k) + \ln \left(\frac{\pi}{16} \, \frac{k}{k_\mathrm{c}} \left| C_\mathrm{c} - D_\mathrm{c} \right|^2 \right), \end{equation}(83)with Cc=exp(ikkc)[H0(2)(k2kc)(kck+i)H1(2)(k2kc)],Dc=exp(ikkc)[H0(2)(k2kc)(kcki)H1(2)(k2kc)],\begin{eqnarray} C_\mathrm{c} &=& \exp\left(\frac{-i k}{k_\mathrm{c}}\right) \left[H_0^{(2)}\left(\frac{k}{2 k_\mathrm{c}}\right) - \left(\frac{k_\mathrm{c}}{k} + i \right) H_1^{(2)}\left(\frac{k}{2 k_\mathrm{c}}\right) \right],\\ D_\mathrm{c} &=& \exp\left(\frac{i k}{k_\mathrm{c}}\right) \left[H_0^{(2)}\left(\frac{k}{2 k_\mathrm{c}}\right) - \left(\frac{k_\mathrm{c}}{k} - i \right) H_1^{(2)}\left(\frac{k}{2 k_\mathrm{c}}\right) \right], \end{eqnarray}where Hn(2)\hbox{$H_n^{(2)}$} denotes the Hankel function of the second kind. The power spectrum in this model is exponentially suppressed for wavenumbers smaller than the cutoff scale kc and converges to a standard power-law spectrum for kkc, with an oscillatory transition region for kkc.

Table 12

Parameters and prior ranges.

9.2. Analysis and results

Table 13

Improvement in fit and Bayes factors with respect to power-law base ΛCDM for Planck TT+lowP and Planck TT, TE, EE+lowP data, as well as approximate probability to exceed the observed Δχ2 (p-value), constructed from simulated Planck TT+lowP data.

Table 14

Best-fit features parameters and parameter constraints on the remaining cosmological parameters for the four features models for Planck TT+lowP data.

We use MultiNest to evaluate the Bayesian evidence for the models, establish parameter constraints, and roughly identify the global maximum likelihood region of parameter space. The features model best-fit parameters and lnℒ are then obtained with the help of the CosmoMC minimization algorithm taking narrow priors around the MultiNest best fit. We assign flat prior probabilities to the parameters of the features models with prior ranges listed in Table 12. Note that throughout this section for the sake of maximizing sensitivity to very sharp features, the unbinned (“bin1”) versions of the high- TT and TT, TE, EE likelihoods are used instead of the standard binned versions.

Since the features considered here can lead to broad distortions of the CMB angular power spectrum degenerate with the late time cosmological parameters (Miranda & Hu 2014), in all cases we simultaneously vary primordial parameters and all the ΛCDM parameters, but keep the foreground parameters fixed to their best-fit values for the power-law base ΛCDM model.

We present the Bayes factors with respect to the power-law base ΛCDM model and the improvement in the effective χ2 over the power-law model in Table 13. For our choice of priors, none of the features models is preferred over a power-law spectrum. The best-fit power spectra are plotted in Fig. 33. While the cutoff and step model best fits reproduce the large-scale suppression at ≈ 20−30 also obtained by direct power spectrum reconstruction in Sect. 8, the oscillation models prefer relatively high frequencies beyond the resolution of the reconstruction methods.

In addition to the four features models we also show in Fig. 33 the best fit of a model allowing for steps in both inflaton potential and warp (brown line). Note the strong resemblance to the reconstructed features of the previous section. The effective Δχ2 for this model is −12.1 (−11.5) for Planck TT+lowP (Planck TT, TE, EE+lowP) data at the cost of adding five new parameters, resulting in a ln-Bayes factor of −0.8 (−0.4). A similar phenomenology can be also be found for a model with a sudden change in the slope of the inflaton potential (Starobinsky 1992; Choe et al. 2004), which yields a best-fit Δχ2 = −4.5 (−4.9) for two extra parameters.

thumbnail Fig. 33

Best-fit power spectra for the power-law (black curve), step (green), logarithmic oscillation (blue), linear oscillation (orange), and cutoff (red) models using Planck TT+lowP data. The brown curve is the best fit for a model with a step in the warp and potential (Eqs. (71)(80)).

As shown in Table 14, constraints on the remaining cosmological parameters are not significantly affected when allowing for the presence of features.

For the cutoff and step models, the inclusion of Planck small-scale polarization data does not add much in terms of direct sensitivity. The best fits lie in the same parameter region as for Planck TT+lowP data, and the Δχ2 and Bayes factors are not subject to major changes. The two oscillation models’ Planck TT+lowP best fits, on the other hand, also predict a non-negligible signature in the polarization spectra at high . Therefore, if the features were real, one would expect an additional improvement in Δχ2 for Planck TT, TE, EE+lowP. This is not the case here. Though the linear oscillation model’s maximum Δχ2 does increase, the local Δχ2 in the Planck TT+lowP best-fit regions is in fact reduced for both models, and the global likelihood maxima occur at different frequencies (log 10ωlog  = 1.25 and log 10ωlin = 1.02) compared to their Planck TT+lowP counterparts.

In addition to the Bayesian model comparison analysis, we also approach the matter of the statistical relevance of the features models from a frequentist statistics perspective in order to give the Δχ2 numbers a quantitative interpretation. Assuming that the underlying \hbox{$\mathcal{P_R}(k)$} was actually a featureless power law, we can ask how large an improvement to lnℒ the different features models would yield on average just by overfitting scatter from cosmic variance and noise. For this purpose, we simulate Planck power spectrum data sets consisting of temperature and polarization up to = 29 and unbinned temperature for 30 ≤ ≤ 2508, taking as input fiducial spectra the power-law base ΛCDM model’s best-fit spectra.

For each of these simulated Planck data sets, we perform the following procedure: (i) find the power-law ΛCDM model’s best-fit parameters with CosmoMC’s minimization algorithm; (ii) fix the non-primordial parameters (ωb,ωc,θMC) to their respective best-fit values; (iii) using MultiNest, find the best fit of the features models;9 and (iv) extract the effective Δχ2 between power-law and features models.

The resulting distributions are shown in Fig. 34. Compared to the real data Δχ2 values from Table 13, they are biased towards lower values, since we do not vary the late-time cosmological parameters in the analysis of the simulated data. Nonetheless, the observed improvements in the fit do not appear to be extraordinarily large, with the respective (conservative) p-values ranging between 0.09 and 0.50.

These observations lead to the conclusion that even though some of the peculiarities seen in the residuals of the Planck data with respect to a power-law primordial spectrum may be explained in terms of primordial features, none of the simple model templates considered here is required by Planck data. The simplicity of the power-law spectrum continues to give it an edge over more complicated initial spectra and the most plausible explanation for the apparent features in the data remains that we are just observing fluctuations due to cosmic variance at large scales and noise at small scales.

thumbnail Fig. 34

Distribution of Δχ2 from 400 simulated Planck TT+lowP data sets.

10. Implications of Planck bispectral constraints on inflationary models

The combination of power spectrum constraints and primordial non-Gaussianity (NG) constraints, such as the Planck upper bound on the NG amplitude fNL (Planck Collaboration XVII 2016), can be exploited to limit extensions to the simplest standard single-field models of slow-roll inflation. The next subsection considers inflationary models with a non-standard kinetic term (Garriga & Mukhanov 1999), where the inflaton Lagrangian is a general function of the scalar inflaton field and its first derivative, i.e., L= P(φ,X), where X = −gμνμφνφ/ 2 (Garriga & Mukhanov 1999; Chen et al. 2007). Section 10.2 focuses on a specific example of a single-field model of inflation with more general higher-derivative operators, the so-called “Galileon inflation”. Section 10.3 presents constraints on axion monodromy inflation. See Planck Collaboration XVII (2016) for the analysis of other interesting non-standard inflationary models, including warm inflation (Berera 1995), whose fNL predictions can be constrained by Planck.

10.1. Inflation with a non-standard kinetic term

This class of models includes k-inflation (Armendáriz-Picón 1999; Garriga & Mukhanov 1999) and Dirac-Born-Infield (DBI) models introduced in the context of brane inflation (Silverstein & Tong 2004; Alishahiha et al. 2004; Chen 2005b,a). In these models inflation can take place despite a steep potential or may be driven by the kinetic term.

Moreover, one of the main predictions of inflationary models with a non-standard kinetic term is that the inflaton perturbations can propagate with a sound speed cs< 1. We show how the Planck combined measurement of the power spectrum and the nonlinearity parameter fNL (Planck Collaboration XVII 2016) improves constraints on this class of models by breaking degeneracies between the parameters determining the observable power spectra. Such degeneracies (see, e.g., Peiris et al. 2007; Powell et al. 2009; Lorenz et al. 2008; Agarwal & Bean 2009; Baumann et al. 2015) are evident from the expressions for the power spectra. We adopt the same notation as Planck Collaboration XXIV (2014). At leading order in the slow-roll parameters the scalar power spectrum depends additionally on the sound speed cs via (Garriga & Mukhanov 1999) As18π2Mpl2H2csϵ1,\begin{eqnarray} \label{sps} A_\mathrm{s} \approx \frac{1}{8 \pi^2 M^2_\mathrm{pl}} \frac{H^2}{c_\mathrm{s} \epsilon_1}, \end{eqnarray}(86)which is evaluated at kcs = aH. Correspondingly, the scalar spectral index ns1=2ϵ1ϵ2s\begin{equation} \label{nnskt} n_\mathrm{s}-1=-2 \epsilon_1-\epsilon_2-s\, \end{equation}(87)depends on an additional slow-roll parameter s=cs˙/(csH)\hbox{$s = \dot{c_\mathrm{s}}/(c_\mathrm{s} H)$}, which describes the running of the sound speed. The usual consistency relation holding for the standard single-field models of slow-roll inflation (r = −8nt) is modified to r ≈ −8ntcs, with nt = −2ϵ1 as usual (Garriga & Mukhanov 1999), potentially allowing models which otherwise would predict a large tensor-to-scalar ratio for the Klein-Gordon case (Unnikrishnan et al. 2012).10

At lowest order in the slow-roll parameters, there are strong degeneracies between the parameters (As,cs,ϵ1,ϵ2,s). This makes the constraints on these parameters from the power spectrum alone not very stringent, and for parameters like ϵ1 and ϵ2 less stringent compared with the standard case. However, combining the constraints on the power spectra observables with those on fNL can also result in a stringent test for this class of inflationary models. Models where the inflaton field has a non-standard kinetic term predict a high level of primordial NG of the scalar perturbations for cs ≪ 1, (see, e.g., Chen et al. 2007). Primordial NG is generated by the higher-derivative interaction terms arising from the expansion of the kinetic part of the Lagrangian, P(φ,X). There are two main contributions to the amplitude of the NG (i.e., to the nonlinearity parameter fNL), coming from the inflaton field interaction terms \hbox{$\dot{\delta \phi}\, (\nabla \delta \phi)^2$} and \hbox{$(\dot{\delta \phi})^3$} (Chen et al. 2007; Senatore et al. 2010). The NG from the first term scales as cs-2\hbox{$c_\mathrm{s}^{-2}$}, while the NG arising from the other term is determined by a second parameter, ˜c3\hbox{$\tilde{c}_3$} (following the notation of Senatore et al. 2010). Each of these two interactions produces bispectrum shapes similar to the so-called equilateral shape (Babich et al. 2004) for which the signal peaks for equilateral triangles with k1 = k2 = k3. (These two shapes are called, respectively, “EFT1” and “EFT2” in Planck Collaboration XVII 2016). However, the difference between the two shapes is such that the total signal is a linear combination of the two, leading to an “orthogonal” bispectral template (Senatore et al. 2010).

thumbnail Fig. 35

(ϵ1,ϵ2) 68% and 95% CL constraints for Planck data comparing the canonical Lagrangian case with cs = 1 to the case of varying cs with a uniform prior 0.024 <cs< 1 derived from the Planck NG measurements.

The equilateral and orthogonal NG amplitudes can be expressed in terms of the two “microscopic” parameters, cs and ˜c3\hbox{$\tilde{c}_3$} (for more details see Planck Collaboration XVII 2016), according to fNLequil=1cs2cs2[0.2750.0780cs2(2/3)×0.780˜c3],fNLortho=1cs2cs2[0.01590.0167cs2(2/3)×0.0167˜c3].\begin{eqnarray} \label{meanfNL} f^{\mathrm{equil}}_{\mathrm{NL}} &=&\frac{1-c_\mathrm{s}^2}{c_\mathrm{s}^2} \left [-0.275 - 0.0780 c_s^2 - (2/3) \times 0.780\, \tilde{c}_3 \right], \\ f^{\mathrm{ortho}}_{\mathrm{NL}} &=&\frac{1-c_\mathrm{s}^2}{c_\mathrm{s}^2} \left[ 0.0159 - 0.0167 c_s^2 - (2/3) \times 0.0167\, \tilde{c}_3\right] .~~~~~~~~~ \end{eqnarray}Thus the measurements of fNLequil\hbox{$f^{\mathrm{equil}}_{\mathrm{NL}}$} and fNLortho\hbox{$f^{\mathrm{ortho}}_{\mathrm{NL}}$} obtained in the companion paper (Planck Collaboration XVII 2016) provide a constraint on the sound speed, cs, of the inflaton field. Such constraints allow us to combine the NG information with the analyses of the power spectra, since the sound speed is the NG parameter also affecting the power spectra.

In this subsection we consider three cases. In the first case we perform a general analysis as described above (focusing on the simplest case of a constant sound speed, s = 0), improving on PCI13 and Planck Collaboration XXIV (2014) by exploiting the full mission temperature and polarization data. The Planck constraints on primordial NG in general single-field models of inflation provide the most stringent bound on the inflaton sound speed (Planck Collaboration XVII 2016):11cs0.024(95%CL).\begin{equation} c_{\mathrm{s}} \geq 0.024\quad (95\%~\mathrm{CL}). \end{equation}(91)We then use this information on cs as a uniform prior 0.024 ≤ cs ≤ 1 in Eq. (88) within the HFF formalism, as in PCI13. Figure 35 shows the joint constraints on ϵ1 and ϵ2.Planck TT+lowP yields ϵ1< 0.031 at 95% CL. No improvement in the upper bound on ϵ1 results when using Planck TT, TE, EE+lowP. This constraint improves the previous analysis in PCI13 and can be compared with the restricted case of cs = 1, also shown in Fig. 35, with ϵ1< 0.0068 at 95% CL. The limits on the sound speed from the constraints on primordial NG are crucial for deriving an upper limit on ϵ1, because the relation between the tensor-to-scalar ratio and ϵ1 also involves the sound speed (see, e.g., Eq. (88)). This breaks the degeneracy in the scalar spectral index.

The other two cases analysed involve DBI models. The degeneracy between the different slow-roll parameters can be broken for s = 0 or in the case where sϵ2. We first consider models defined by an action of the DBI form P(φ,X)=f(φ)-112f(φ)X+f(φ)-1V(φ),\begin{equation} P(\phi,X)=- f(\phi)^{-1} \sqrt{1-2f(\phi) X}+f(\phi)^{-1}-V(\phi), \end{equation}(92)where V(φ) is the potential and f(φ) describes the warp factor determined by the geometry of the extra dimensions. We follow an analogous procedure to exploit the NG limits derived in Planck Collaboration XVII (2016) on cs in the case of DBI models: cs ≥ 0.087 (at 95% CL). Assuming a uniform prior, 0.087 ≤ cs ≤ 1, and s = 0, Planck TT+lowP gives ϵ1< 0.024 at 95% CL, a 43% improvement with respect to PCI13. The addition of high- TE and EE does not improve the upper bound on ϵ1 for this DBI case.

Next we update the constraints on the particularly interesting case of infrared DBI models (Chen 2005b,a), where f(φ) ≈ λ/φ4. (For details, see Silverstein & Tong 2004; Alishahiha et al. 2004; Chen et al. 2007, and references therein.) In these models the inflaton field moves from the IR to the UV side with an inflaton potential V(φ)=V012βH2φ2.\begin{equation} V(\phi)=V_0-\frac{1}{2} \beta H^2 \phi^2. \end{equation}(93)From a theoretical point of view a wide range of values for β is allowed: 0.1 <β< 109 (Bean et al. 2008). PCI13 dramatically restricted the allowed parameter space of these models in the limit where stringy effects can be neglected and the usual field theory computation of the primordial curvature perturbation holds (see Chen 2005a,c; Bean et al. 2008 for more details). In this limit of the IR DBI model, one finds (Chen 2005c; Chen et al. 2007) cs ≈ (βN/ 3)-1, ns−1 = −4 /N, and dns/dlnk=4/N2.\hbox{${\rm d}n_\mathrm{s}/{\rm d}\!\ln k=-4/N_*^2.$} (In this model one can verify that s ≈ 1 /Nϵ2/ 3.) Combining the uniform prior on cs with Planck TT+lowP, we obtain β0.31(95%CL),\begin{equation} \beta \leq 0.31 \quad (95\%~\mathrm{CL}), \end{equation}(94)and a preference for a high number of e-folds: 78 <N< 157 at 95% CL.

We now constrain the general case of the IR DBI model, including the “stringy” regime, which occurs when the inflaton extends back in time towards the IR side (Bean et al. 2008). The stringy phase transition is characterized by an interesting phenomenology altering the predictions for cosmological perturbations. A parameterization of the power spectrum of curvature perturbations interpolating between the two regimes is (Bean et al. 2008; see also Ma et al. 2013) 𝒫(k)=As(NeDBI)4[11(1+x)2],\begin{equation} \mathcal{P}_{\cal R}(k)= \frac{A_\mathrm{s}}{\left(N_{\rm e}^{\mathrm{DBI}}\right)^4} \left[1-\frac{1}{(1+x)^2} \right], \label{eq:DBIps} \end{equation}(95)where As = 324π2/ (nBβ4) is the amplitude of the perturbations which depends on various microscopic parameters (nB is the number of branes at the B-throat; see Bean et al. 2008 for more details), while x=(NeDBI/Nc)8\hbox{$x=(N_{\rm e}^{\mathrm{DBI}}/N_{\rm c})^8$} sets the stringy phase transition taking place at the critical e-fold Nc. (Here NeDBI\hbox{$N_{\rm e}^{\mathrm{DBI}}$} is the number of e-folds to the end of IR DBI inflation.) The spectral index and its running are ns1=4NeDBIx2+3x2(x+1)(x+2),dnsdlnk=4(NeDBI)2x4+6x355x296x4(x+1)2(x+2)2·\begin{eqnarray} n_\mathrm{s}-1 &=& \frac{4}{N_{\rm e}^{\mathrm{DBI}}} \frac{x^{2}+3x-2}{(x+1)(x+2)}, \\ \frac{{\rm d} n_\mathrm{s}}{{\rm d}\!\ln k} &=&\frac{4}{{\left(N_{\rm e}^{\mathrm{DBI}}\right)^2}} \frac{x^4+6x^3-55x^{2}-96x-4}{(x+1)^{2}(x+2)^{2}} \cdot \end{eqnarray}A prediction for the primordial NG in the stringy regime is not available. We assume the standard field-theoretic result for a primordial bispectrum of the equilateral type with an amplitude fNLDBI=(35/108)[(β2(NeDBI)2/9)1]\hbox{$f^\mathrm{DBI}_\mathrm{NL}=-(35/108)\, [(\beta^2\, (N_{\rm e}^{\mathrm{DBI}})^2/9)-1]$}. By considering the same uniform prior on cs, we obtain β< 0.77, 66<NeDBI<72\hbox{$66 < N_{\rm e}^{\mathrm{DBI}} < 72$}, and x< 0.41 at 95% CL, which severely limits the general IR DBI model and strongly restricts the allowed parameter space.

10.2. Galileon inflation

As a further example of the implications of the NG constraints on (non-standard) inflationary models we consider Galileon inflation Burrage et al. (2011; see also Kobayashi et al. 2010; Mizuno & Koyama 2010; Ohashi & Tsujikawa 2012). This represents a well-defined and well-motivated model of inflation with more general higher derivatives of the inflaton field compared to the non-standard kinetic term case analysed above. The Galileon models of inflation are based on the so-called “Galilean symmetry” (Nicolis et al. 2009), and enjoy some well understood stability properties (absence of ghost instabilities and protection from large quantum corrections). This makes the theory also very predictive, since observable quantities (scalar and tensor power spectra and higher-order correlators) depend on a finite number of parameters. From this point of view this class of models shares some of the same properties as the DBI inflationary models (Silverstein & Tong 2004; Alishahiha et al. 2004). The Galileon field arises naturally within fundamental physics constructions (e.g., de Rham & Gabadadze 2010b,a). These models also offer an interesting example of large-scale modifications to Einstein gravity.

The Galileon model is based on the action (Deffayet et al. 2009a,b) S=d4xg(Mpl22R+n=03n),\begin{equation} S=\int {\rm d}^4x\sqrt{-g}\left(\frac{M_\mathrm{pl}^2}{2}R+\sum_{n=0}^{3}\mathcal{L}_{n}\right), \end{equation}(98)where 0=c2X,1=2(c3/Λ3)XΛφ,2=2(c4/Λ6)X[(Λφ)2(μνφ)2]+(c4/Λ6)X2R,3=2(c5/Λ9)X[(Λφ)33Λφ(μνφ)2+2(μνφ)3]+6(c5/Λ9)X2Gμνμνφ.\begin{eqnarray} \mathcal{L}_0 & = & c_2X, ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\\ \mathcal{L}_1 & =& -2\left(c_3/\Lambda^3\right)X \Box\phi,\\ \mathcal{L}_2 & =& 2 \left(c_4/\Lambda^6\right)X \left[\left(\Box\phi\right)^{2}-\left(\nabla_{\mu}\nabla_{\nu}\phi\right)^{2}\right]+\left(c_4/\Lambda^6\right) X^2\,R,\\ \mathcal{L}_3 & = & -2 \left(c_5/\Lambda^9\right)X \left[\left(\Box\phi\right)^{3}-3\Box\phi\left(\nabla_{\mu}\nabla_{\nu}\phi\right)^{2} +2 \left(\nabla_{\mu}\nabla_{\nu}\phi\right)^{3}\right] \nonumber\\ &&\quad+ 6 \left(c_5/\Lambda^9\right)X^2 G_{\mu\nu}\nabla^{\mu}\nabla^{\nu}\phi. \end{eqnarray}Here X = −∇μφμφ/ 2, (∇μνφ)2 = ∇μνφμνφ, and (∇μνφ)3 = ∇μνφμρφνρφ. The coupling coefficients ci are dimensionless and Λ is the cutoff of the theory. The case of interest includes a potential term V(φ) = V0 + λφ + (1 / 2)m2φ2 + ... to drive inflation.

The predicted scalar power spectrum at leading order is (Ohashi & Tsujikawa 2012; Burrage et al. 2011; Tsujikawa et al. 2013; see also Kobayashi et al. 2011a; Gao & Steer 2011)12𝒫=H28π2MPl2ϵsFcs|csk=aH=H48π2A(φ0˙)2cs3,\begin{equation} \label{PSG} \mathcal{P}_\mathcal{R}=\frac{H^2}{8\pi^2M_{\mathrm{Pl}}^2\epsilon_\mathrm{s} F c_\mathrm{s}}\Bigg |_{c_\mathrm{s}k=aH}=\frac{H^4}{8\pi^2 A (\dot{\phi_0})^2 c_\mathrm{s}^3}, \end{equation}(104)where F=1+4(φ̇)20/(2H2MPl2)\hbox{$F=1+\bar c_4(\dot\phi_0)^2/(2H^2M_{\mathrm{Pl}}^2)$} and cs2=B/A\hbox{$c_\mathrm{s}^2=-B/A$} is the sound speed of the Galileon field. ϵs is different from the usual slow-roll parameter ϵ1 and at leading order related according to \hbox{$\epsilon_\mathrm{s} = -2 B/(1+6 \bar c_3+18\bar c_4+30 \bar c_5) \epsilon_1 .$} The scalar spectral index ns1=2ϵ1ηss\begin{equation} \label{nsG} n_\mathrm{s}-1=-2\epsilon_1-\eta_\mathrm{s}-s \end{equation}(105)depends on the slow-roll parameters ϵ1, \hbox{$\eta_{\mathrm{s}}=\dot\epsilon_\mathrm{s}/(H\epsilon_\mathrm{s})$}, and s = ċs/ (Hcs). As usual the slow-roll parameter s describes the running of the sound speed. In the following we restrict ourselves to the case of a constant sound speed with s = 0. The tensor-to-scalar ratio is r=16ϵscs=16ϵ1s,\begin{equation} \label{cG} r=16\epsilon_s c_{\mathrm{s}}=16 \epsilon_1 \bar c_{\mathrm{s}}, \end{equation}(106)where we have introduced the parameter \hbox{$\bar{c}_\mathrm{s}=-[2 B /(1+6 \bar c_3+18\bar c_4+30 \bar c_5)] c_\mathrm{s} $}, which is related to the Galileon sound speed. The parameter \hbox{$\bar c_{\mathrm{s}}$} can be either positive or negative. In the negative branch a blue spectral tilt for the primordial gravitational waves is allowed, contrary to the situation for standard slow-roll models of inflation. We introduce such a quantity so that the consistency relation takes the form \hbox{$r \approx -8 n_\mathrm{t} \bar c_\mathrm{s}$}, with nt = −2ϵ1, analogous to Eq. (88). The measurements of primordial NG constrain \hbox{$\bar{c}_\mathrm{s},$} which in turn constrains ϵ1 and ηs in Eq. (105). This is analogous to the constraints on ϵ1 and η of Eq. (87) in the previous subsection.

Galileon models of inflation predict interesting NG signatures (Burrage et al. 2011; Tsujikawa et al. 2013).13 We have verified (see also Creminelli et al. 2011) that bispectra can be generated with the same shapes as the “EFT1” and “EFT2” (Senatore et al. 2010; Chen et al. 2007) constrained in the companion paper (Planck Collaboration XVII 2016), which usually arise in models of inflation with non-standard kinetic terms, with fNLEFT1=17972(5cs4+30cs240css+15),fNLEFT2=1243(5cs4+30/A55cs2+40css320css30A+275225cs2+280cs3s)·\begin{eqnarray} \label{fnlG} f_\mathrm{NL}^\mathrm{EFT1}&=& \frac{17}{972}\left(-\frac{5}{c_{\mathrm{s}}^4}+\frac{30}{c_{\mathrm{s}}^2}-\frac{40}{c_{\mathrm{s}} \bar c_{\mathrm{s}}}+15 \right),\\ \label{fnlG2} f_\mathrm{NL}^\mathrm{EFT2}&=& \frac{1}{243} \left(\frac{5}{c_{\mathrm{s}}^4}+ \frac{30/A-55}{c_{\mathrm{s}}^2}+\frac{40}{c_{\mathrm{s}} \bar c_{\mathrm{s}}}-320 \frac{c_{\mathrm{s}}}{\bar c_{\mathrm{s}}}-\frac{30}{A}+275 \right. \nonumber\\ &&\quad- \left.225 c_{\mathrm{s}}^2+280 \frac{{c_{\mathrm{s}}^3}}{\bar c_{\mathrm{s}}}\right)\cdot \end{eqnarray}As explained in the previous subsection, the linear combinations of these two bispectra produce both equilateral and orthogonal bispectrum templates. Given Eqs. (104)–(108), we can proceed as in the previous section to exploit the limits on primordial NG in a combined analysis with the power spectra analysis. In Planck Collaboration XVII (2016) the constraint cs ≥ 0.23 (95% CL) is obtained based on the constraints on fNLequil\hbox{$f^{\mathrm{equil}}_{\mathrm{NL}}$} and fNLortho\hbox{$f^{\mathrm{ortho}}_{\mathrm{NL}}$}. One can proceed as described in Planck Collaboration XVII (2016) to constrain the parameter \hbox{$\bar{c}_{\mathrm{s}}$} modifying the consistency relation, Eq. (106). Adopting a log-uniform prior on A in the interval 10-4A ≤ 104 and a uniform prior 10-4cs ≤ 1, the Planck measurements on fNLequil\hbox{$f^{\mathrm{equil}}_{\mathrm{NL}}$} and fNLortho\hbox{$f^{\mathrm{ortho}}_{\mathrm{NL}}$} constrain \hbox{$\bar c_{\mathrm{s}}$} to be \hbox{$0.038 \leq \bar c_{\mathrm{s}}\leq 100$} (95% CL) (Planck Collaboration XVII 2016). We also explore the possibility of the negative branch (corresponding to a blue tensor spectral index), finding \hbox{$-100 \leq \bar c_{\mathrm{s}} \leq -0.034$} (95% CL) (Planck Collaboration XVII 2016). By allowing a logarithmic prior on \hbox{$\bar c_{\mathrm{s}}$} based on the fNL measurements, Fig. 36 shows the joint constraints on ϵ1 and ηs for the nt< 0 branch and for the nt> 0 branch. Planck TT+lowP+BAO and the NG bounds on \hbox{$\bar c_{\mathrm{s}}$} constrain ϵ1< 0.036 at 95% CL for nt< 0 (and | ϵ1 | < 0.041 for nt> 0).

thumbnail Fig. 36

Marginalized joint 68% and 95% CL for the Galileon parameters (ϵ1,ηs) for nt< 0 (left panel) and nt> 0 (right panel).

10.3. Axion monodromy inflation

10.3.1. Introduction

The mechanism of monodromy inflation (Silverstein & Westphal 2008; McAllister et al. 2010; Kaloper et al. 2011; Flauger et al. 2014b) in string theory motivates a broad class of inflationary potentials of the form V(φ)=μ4pφp+Λ04eC0(φφ0)pΛcos[γ0+φ0f0(φφ0)pf+1].\begin{equation} V(\phi)=\mu^{4-p} \phi^p +\Lambda_0^4\, {\rm e}^{- C_0 \left(\frac{\phi}{\phi_0}\right)^{p_\Lambda}} \cos\left[\gamma_0 +\frac{\phi_0}{f_0} \left(\frac{\phi}{\phi_0}\right)^{p_f+1}\right] . \label{powersform} \end{equation}(109)Here μ, Λ0, f0, and φ0 are constants with the dimension of mass and C0, p, pΛ, pf, and γ0 are dimensionless.

In simpler parameterizations used in prior analyses of oscillations from axion monodromy inflation (Peiris et al. 2013; Planck Collaboration XXII 2014; Easther & Flauger 2014; Jackson et al. 2014; Meerburg et al. 2014b,a; Meerburg & Spergel 2014; Meerburg 2014), one assumes pΛ = pf = 0, corresponding to a sinusoidal term with constant amplitude throughout inflation taken to be a periodic function of the canonically-normalized inflaton φ. Taking pΛ ≠ 0 and pf ≠ 0 allows the magnitude and frequency, respectively, of the modulation to depend on φ. For example, the frequency is always a periodic function of an underlying angular axion field, but its relation to the canonically normalized inflaton field is model-dependent.

The microphysical motivation for pΛ ≠ 0 and pf ≠ 0 is that in string theory additional scalar fields, known as “moduli,” evolve during inflation. The inflationary potential depends on a subset of these fields. Because the magnitude and frequency of modulations are determined by the vacuum expectation values of moduli, both quantities are then naturally functions of φ. The case pΛ = pf = 0 corresponds to when these fields are approximately fixed, stabilized strongly by additional terms in the scalar potential. But in other cases, the axion potential that drives inflation also provides a leading term stabilizing the moduli. The exponential dependence of the magnitude in the potential of Eq. (109) arises because the modulations are generated non-perturbatively, e.g., by instantons. For this reason, the modulations can be undetectably small in this framework, although there are interesting regimes where they could be visible.

Specific examples studied thus far yield exponents p, pΛ, and pf that are rational numbers of modest size. For example, models with p = 3, 2, 4 / 3, 1, and 2 / 3 have been constructed (Silverstein & Westphal 2008; McAllister et al. 2010, 2014), or in another case p = 4 / 3, pΛ = −1 / 3, and pf = −1 / 3. Following Flauger et al. (2014b), we investigate the effect of a drift in frequency arising from pf, neglecting a possible drift in the modulation amplitude by setting pΛ = C0 = 0. Even in this restricted model, a parameter exploration using a fully numerical computation of the primordial power spectrum following the methodology of Peiris et al. (2013) is prohibitive, so we follow Flauger et al. (2014b) to study two templates capturing the features of the primordial spectra generated by this potential.

The first template, which we call the “semi-analytic” template, is given by 𝒫(k)=𝒫(k)(kk)ns11+δnscos[φ0f(φkφ0)pf+1+Δφ].\begin{equation} \label{eq:semianalytic_mono} {\cal P}_{\cal R} (k) = {\cal P}_{\cal R} (k_*) \left(\frac{k}{k_*} \right)^{n_\mathrm{s} -1} \left\{1 + \delta n_\mathrm{s} \cos\left[ \frac{\phi_0}{f}\left(\frac{\phi_k}{\phi_0}\right)^{p_f+1} + \Delta \phi \right] \right\}. \end{equation}(110)The parameter f is higher than the underlying axion decay constant f0 of the potential by a few percent, but this difference will be neglected in this analysis. The quantity φ0 is some fiducial value for the scalar field, and φk is the value of the scalar field at the time when the mode with comoving momentum k exits the Hubble radius. At leading order in the slow-roll expansion, in units where the reduced Planck mass MPl = 1, φk=2p(N0ln(k/k))\hbox{$\phi_k = \sqrt{2p\,(N_0 - \ln(k/k_*))}$}, where N0=N+φend2/(2p)\hbox{$N_0 = N_* + \phi_\mathrm{end}^2/(2p)$}, and φend is the value of the scalar field at the end of inflation.

The second “analytic” template was derived by Flauger et al. (2014b) by expanding the argument of the trigonometric function in Eq. (110) in ln(k/k), leading to 𝒫(k)=𝒫(k)(kk)ns1×{1+δnscos[Δφ+α(ln(kk)+n=12cnNnlnn+1(kk))]}.\begin{eqnarray} \label{eq:analytic_mono} {\cal P}_{\cal R}(k)& =& {\cal P}_{\cal R} (k_*) \left(\frac{k}{k_*} \right)^{n_\mathrm{s} -1} \\ \nonumber && \times \left\{1 + \delta n_\mathrm{s} \cos\left[\Delta \phi + \alpha\left(\ln\left(\frac{k}{k_*}\right) + \sum_{n=1}^{2} \frac{c_n}{N_*^n} \ln^{n+1}\left(\frac{k}{k_*}\right)\right)\right] \right\}. \end{eqnarray}(111)The relation between the empirical parameters in the templates and the potential parameters are approximated by δns=3b2π/α\hbox{$\delta n_\mathrm{s} = 3b\sqrt{2\pi/\alpha}$}, where α=(1+pf)φ02fN0(2pN0φ0)1+pf,\begin{equation} \label{eq:alpha} \alpha = (1+p_f)\frac{\phi_0}{2 f N_0} \left(\frac{\sqrt{2 p N_0}}{\phi_0}\right)^{1+p_f}, \end{equation}(112)and b is the monotonicity parameter defined in Flauger et al. (2014b), providing relations converting bounds on cn into bounds on the microphysical parameters of the potential. However, the analytic template can describe more general shapes of primordial spectra than just axion monodromy.

As discussed by Flauger et al. (2014b), there is a degeneracy between p (or alternatively ns) and f. For both templates we fix p = 4 / 3 and also fix the tensor power spectrum to its form in the absence of oscillations. This is an excellent approximation because tensor oscillations are suppressed relative to the scalar oscillations by a factor α(f/MPl)2 ≪ 1. A uniform prior π< Δφ<π is adopted for the phase parameter of both templates as well as a prior 0 <δns< 0.7 for the modulation amplitude parameter.

thumbnail Fig. 37

Constraints on the parameters of the analytic template, showing joint 68% and 95% CL. The dotted lines correspond to the frequencies showing the highest-likelihood improvements (see text).

thumbnail Fig. 38

Constraints on the parameters of the semi-analytic template showing joint 68% and 95% CL. The solid lines on the left-hand panel mark the frequencies showing the highest-likelihood improvements (see text).

In order to specify the semi-analytic template, we assume instantaneous reheating, which for p = 4 / 3 corresponds to N ≈ 57.5 for k = 0.05 Mpc-1. We set φ0 = 12.38MPl with φend = 0.59MPl. We adopt uniform priors −4 < log 10(f/MPl) < −1 and −0.75 <pf< 1 for the remaining parameters. The priors 0 < ln(α) < 6.9 and −2 <c1,2< 2 specify the analytic template. The single-field effective field theory becomes strongly coupled for α> 200. However, in principle the string construction remains valid in this regime.

10.3.2. Power spectrum constraints on monodromy inflation

We carry out a Bayesian analysis of axion monodromy inflation using a high-resolution version of CAMB coupled to the PolyChord sampler (see Sect. 8.2). For our baseline analysis we conservatively adopt Planck TT+lowP, using the “bin1” high-TT likelihood. In addition to the primordial template priors specified above, we marginalize over the standard priors for the cosmological parameters, the primordial amplitude, and foreground parameters.

thumbnail Fig. 39

Frequency residuals for the ln(α) ≈ 3.5 likelihood peak, binned at Δ = 30. The ± 1σ errors are given by the square root of the diagonal elements of the covariance matrix.

The marginalized joint posterior constraints on pairs of primordial parameters for the analytic and semi-analytic templates are shown in Figs. 37 and 38, respectively.

The complex structures seen in these plots arise due to degeneracies in the likelihood frequency “beating” between underlying modulations in the data and the model (Easther et al. 2005). Parameter combinations where “beating” occurs over the largest k ranges lead to discrete local maxima in the likelihood. Fortuitous correlations in the observed realization of the C can give the same effect.

The four frequencies picked out by these structures, ln(α) ≈ {3.5,5.4,6.0,6.8}, show improvements of Δχ2 ≈ {−9.7,−7.1,−12.2,−12.5} relative to ΛCDM, respectively. These frequencies are marked by dotted lines in Fig. 37, and by solid lines in Fig. 38 using Eq. (112). The semi-analytic and analytic templates lead to self-consistent results as expected, with analogous structures being picked out by the likelihood in each template. There is no evidence for a drifting frequency, pf ≠ 0 or cn ≠ 0. Thus, these parameters serve to smooth out structures in the marginalized posterior.

The improvement in χ2 is not compelling enough to suggest a primordial origin. Fitting a modulated model to simulations with a smooth spectrum can give rise to Δχ2 ~ −10 improvements (Flauger et al. 2014b). Furthermore, as the monodromy model contains only a single frequency, at least three of these structures must correspond to spurious fits to the noise. Considering the two models defined by the two templates and the parameter priors specified above, the Bayes factors calculated using PolyChord favours base ΛCDM over both templates by odds of roughly 8:1.

Compared to previous analyses of the linear (p = 1) axion monodromy model for WMAP9 (Peiris et al. 2013) and the 2013 Planck data (Planck Collaboration XXII 2014; Easther & Flauger 2014) the common frequencies are shifted slightly upward. The lower frequency in common appears shifted by a factor of order p\hbox{$\sqrt{p}$} from α ≈ 28.9 to 31.8 and the higher frequency in common from α ≈ 210 to 223. Flauger et al. (2014b) suggest that the lower frequency (which had Δχ2 = −9 in PCI13) was associated with the 4 K cooler line systematic effects in the 2013 Planck likelihood. However, its presence at similar significance in the 2015 likelihood with improved handling of the cooler line systematics suggests that this explanation is not correct. The second frequency, which appeared with Δχ2 ≈ −20 in WMAP9 (Peiris et al. 2013) is still present but with much reduced significance, suggesting that the high multipoles do not give evidence for this frequency. Additionally, two higher frequencies are present, which if interpreted as being of primordial origin, correspond to a regime well beyond the validity of the single-field effective field theory. If one of these frequencies were to be confirmed as primordial, a significantly improved understanding of the underlying string construction would need to be undertaken.

In order to check whether the improvement in fit at these four modulation frequencies is responding to residual foregrounds or other systematics, we examine the frequency residuals. Figure 39 shows the residuals of the data minus the model (including the best-fit foreground model) for the four PLIK frequency combinations binned at Δ = 30 for the lowest modulation frequency, ln(α) ≈ 3.5. This plot shows no significant frequency dependence, and thus there is no indication that the fit is responding to frequency dependent systematics. Furthermore, the plot does not show evidence that the improvement for this modulation frequency comes from the feature at ≈ 800, as suggested by Easther & Flauger (2014). This feature and another at ≈ 1500 are apparent at all frequency combinations. Similar plots for the three other modulation frequencies also do not show indications of frequency dependence.

In order to confirm whether any of the frequencies picked out here is of primordial origin, one can exploit independent information in the polarization data to perform a cross-check of the temperature prediction, thus minimizing the “look-elsewhere” effect (Mortonson et al. 2009). Leaving a complete analysis of the independent information in the polarization for future work, we now check whether the temperature-only result remains stable when high- polarization is added in the likelihood. In Fig. 40 we show a preliminary analysis using the PLIK temperature and polarization (TT, TE, and EE) “bin1” likelihood plus low- polarization data. A comparison with the left-hand panels of Figs. 37 and 38 indicates slight differences from the T-only analysis. However, all the four frequencies identified in the temperature are present when high- polarization is added. There is a maximum Δχ2 ≈ −8.0 improvement over ΛCDM. We also repeat the analysis using only the EE polarization “bin1” likelihood plus low- temperature and polarization data. These results are presented in Fig. 41. The EE-only frequencies are offset with respect to the temperature-only frequencies: the best-fit EE-only frequencies are at ln(α) ≈ {3.8,5.0,5.4,5.8,6.2}. The maximum improvement over ΛCDM for this case is Δχ2 ≈ −12.5.

thumbnail Fig. 40

Constraints on the parameters of the analytic (top) and semi-analytic (bottom) templates with the addition of high- polarization data in the likelihood, showing joint 68% and 95% CL. The lines mark the frequencies showing the highest-likelihood improvements identified in the baseline temperature-only analysis.

thumbnail Fig. 41

Constraints on the parameters of the analytic (top) and semi-analytic (bottom) templates with EE-only high- polarization data plus low- temperature and polarization data, showing joint 68% and 95% CL. The lines mark the frequencies showing the highest-likelihood improvements identified in the baseline temperature-only analysis.

10.3.3. Predictions for resonant non-Gaussianity

The left-hand panel of Fig. 42 presents derived constraints on the parameters of the potential in Eq. (109) calculated using the analytic template. Another cross-check of primordial origin is available since the monodromy model predicts resonant NG, generating a bispectrum whose properties would be strongly correlated with that of the power spectrum (Chen et al. 2008; Flauger & Pajer 2011). Using the mapping fNLres=δns8α2,\begin{equation} f_\mathrm{NL}^\mathrm{res} = \frac{\delta n_\mathrm{s}}{8} \alpha^2 , \end{equation}(113)we use the analytic template to derive the posterior probability for the resonant NG signal predicted by constraints from the power spectrum, presented in the middle and right panels of Fig. 42.

Planck Collaboration XVII (2016) use an improved modal estimator to scan for resonant NG. The resolution of this scan is currently limited to ln(α) < 3.9, which potentially can probe the lowest frequency picked out in the power spectrum search. However, the modal estimator’s sensitivity (imposed by cosmic variance) of ΔfNLres80\hbox{$\Delta f_\mathrm{NL}^\mathrm{res} \approx 80$} is significantly greater than the predicted value for this frequency from fits to the power spectrum, fNLres~10\hbox{$f_\mathrm{NL}^\mathrm{res} \sim 10$}. Efforts to increase the resolution of the modal estimator are ongoing and may allow consistency tests of the significantly higher levels of resonant NG predicted by the higher frequencies in the future.

thumbnail Fig. 42

Derived constraints on the parameters of the potential, Eq. (109), as well as the predicted resonant NG, fNLres\hbox{$f_\mathrm{NL}^\mathrm{res}$}, using the analytic template, showing joint 68% and 95% CL. The dotted lines mark the frequencies showing the highest-likelihood improvements (see text).

10.3.4. Power spectrum and bispectrum constraints on axion inflation with a gauge field coupling

We now consider the case where the axion field is coupled to a gauge field. Such a scenario is physically well motivated. From an effective field theory point of view the derivative coupling is natural and must be included since it respects the same shift symmetry that leads to axion models of inflation (Anber & Sorbo 2010; Barnaby & Peloso 2011; Pajer & Peloso 2013). This type of coupling is also ubiquitous in string theory (see, e.g., Barnaby et al. 2012; Linde et al. 2013). The coupling term in the action is (Anber & Sorbo 2010; Barnaby & Peloso 2011; Barnaby et al. 2011) Sd4xg(α4fφFμν˜Fμν),\begin{equation} S\supset\int {\rm d}^4x\sqrt{-g}\left(-\frac{\alpha}{4f}\phi F^{\mu\nu}\tilde F_{\mu\nu}\right), \end{equation}(114)where Fμν = μAννAμ, its dual is ˜Fμν=ϵμναβFαβ/2\hbox{$\tilde F^{\mu\nu}=\epsilon^{\mu\nu\alpha\beta} F_{\alpha\beta}/2$}, and α is a dimensionless constant which, from an effective field theory perspective, is expected to be of order one. For the potential of the axion field, we will not investigate further the consequences of the oscillatory part of the potential, focusing on the coupling of the axion field to the U(1) gauge field (effectively setting Λ0 = 0).

The coupling of a pseudo-scalar axion with the gauge field has interesting phenomenological consequences, both for density perturbations and primordial gravitational waves (Barnaby & Peloso 2011; Sorbo 2011; Barnaby et al. 2011, 2012; Meerburg & Pajer 2013; Ferreira & Sloth 2014). Gauge field quanta source the axion field via an inverse decay process δA + δAδϕ, modifying the usual predictions already at the power spectrum level. Additionally, the inverse decay can generate a high level of primordial NG.

The parameter ξ=α|φ̇|2fH\begin{equation} \xi=\frac{\alpha|\dot\phi|}{2fH} \end{equation}(115)characterizes the strength of theinverse decay effects. If ξ< 1 the coupling is too small to produce any modifications to the usual predictions of the uncoupled model. For previous constraints on ξ see Barnaby et al. (2011, 2012) and Meerburg & Pajer (2013). Using the slow-roll approximation and neglecting the small oscillatory part of the potential, one can express ξ=MPlαfp8N+2p,\begin{equation} \xi=M_\mathrm{Pl}\frac{\alpha}{f}\sqrt{\frac{p}{8N+2p}},\label{xi} \end{equation}(116)where N is, as usual, the number of e-folds to the end of inflation. The scalar power spectrum of curvature perturbations is given by 𝒫(k)=𝒫(kk)ns1[1+𝒫(kk)ns1f2(ξ(k))e4πξ(kk)2πξϵ2],\begin{equation} \label{p1new} \mathcal{P}_{\mathcal {R}}(k)=\mathcal{P}_* \left(\frac{k}{k_*} \right)^{n_\mathrm{s}-1} \left[1+ \mathcal{P}_* \left(\frac{k}{k_*} \right)^{n_\mathrm{s}-1} f_2(\xi(k))\, e^{4 \pi \xi_*} \left(\frac{k}{k_*}\right)^{2 \pi \xi_* \epsilon_2} \right], \end{equation}(117)where (Meerburg & Pajer 2013) ξ(k)=ξ[1+ϵ22ln(kk)]·\begin{equation} \label{xik} \xi(k)=\xi_* \left[1+\frac{\epsilon_2}{2} \ln \left(\frac{k}{k_*} \right) \right]\cdot \end{equation}(118)Here an asterisk indicates evaluation at the pivot scale, k = 0.05 Mpc-1, and 𝒫=H4/(4π2φ̇2)\hbox{$\mathcal{P}_*=H_*^4/(4 \pi^2 \dot \phi_*^2)$} and ns−1 = −2ϵ1ϵ2 are the amplitude and spectral index, respectively, of the standard slow-roll power spectrum of vacuum-mode curvature perturbations (the usual power spectrum in the absence of the gauge-coupling). By numerically evaluating the function f2(ξ) (defined in Eq. (3.27) of Barnaby et al. 2011), we created an analytical fit to this function accurate to better than 2% for 0.1 <ξ< 7.14 In the following, unless stated otherwise, we fix p = 4 / 3 as in the previous subsection and assume instantaneous reheating so that N ≈ 57.5 and the slow-roll parameters ϵ1 and ϵ2 are fixed. For the tensor power spectrum we adopt the approximation (Barnaby et al. 2011) 𝒫t(k)=𝒫t(kk)nt[1+π22𝒫tft,L(ξ(k))e4πξ(kk)nt+2πϵ2ξ],\begin{equation} \mathcal{P}_{\mathrm t}(k)= \mathcal{P}_{\mathrm t}\left(\frac{k}{k_*}\right)^{n_\mathrm{t}} \left[1+\frac{\pi^2}{2} \mathcal{P}_{\mathrm t} f_{\mathrm t,L}(\xi(k)) {\rm e}^{4 \pi \xi_*} \left(\frac{k}{k_*}\right)^{n_\mathrm{t}+2 \pi \epsilon_2 \xi_*}\right], \end{equation}(119)where ft,L(ξ(k))=2.6×10-7ξ-5.7(k).\begin{equation} \label{fTL} f_{\mathrm t,L}(\xi(k))=2.6 \times 10^{-7} \xi^{-5.7}(k). \end{equation}(120)Here 𝒫t=2H2/(π2MPl2)\hbox{$\mathcal{P}_{\mathrm t}=2 H_*^2/(\pi^2 M_\mathrm{Pl}^2)$} and nt = −2ϵ1 are the “usual” expressions for the tensor amplitude and tensor tilt in standard slow-roll inflation.

The total bispectrum is (Barnaby et al. 2012) B(ki)=Binv.dec.(ki)+Bres(ki)=% subequation 15172 0 \begin{eqnarray} B(k_i) &=& B_{\text{inv.dec.}}(k_i) + B_{\mathrm{res}}(k_i) \\ &=& f_\mathrm{NL}^{\text{inv.dec.}}(\xi) \, F_{\mathrm{\text{inv.dec.}}}(k_i) +B_{\mathrm{res}}(k_i),\label{bispectrum} \end{eqnarray}where the explicit expression for Finv.dec.(ki) (Barnaby et al. 2011; see also Meerburg & Pajer 2013) is reported in Planck Collaboration XVII (2016). This shows that the inverse decay effects and the resonant effects (which arise from the oscillatory part of the potential) simply “add up” in the bispectrum. The nonlinearity parameter is fNLinv.dec.=f3(ξ)𝒫3e6πξ𝒫2(k)·\begin{equation} f_\mathrm{NL}^{\mathrm{inv.dec.}} = \frac{f_3(\xi_*) \mathcal{P}_*^3 e^{6\pi\xi_*}}{\mathcal{P}_{\mathcal{R}}^2(k_*)}\cdot\label{fNLequil} \end{equation}(122)The function f3(ξ) corresponds to the quantity f3(ξ;1,1) defined in Eq. (3.29) of Barnaby et al. (2011). We have computed f3(ξ) numerically and used a fit with an accuracy of better than 2%.15 We use the observational constraint fNLinv.dec.=22.7±25.5\hbox{$f_\mathrm{NL}^\mathrm{inv.dec.}=22.7 \pm 25.5$} (68% CL) obtained in Planck Collaboration XVII (2016) from an analysis where only the inverse decay type NG is assumed present. We omit the explicit expression for the resonant bispectrum Bres, since it will not be used here.

We carried out an MCMC analysis of constraints on the (scalar and tensor) power spectra predicted by this model with the Planck TT+lowP likelihood, marginalizing over standard priors for the cosmological parameters and foreground parameters with the uniform priors \hbox{$2.5 \leq \ln [10^{10} \mathcal{P}_* ] \leq 3.7$} and 0.1 ≤ ξ ≤ 7.0.

The power spectrum constraint gives 0.1ξ2.3(95%CL).\begin{equation} 0.1 \leq \xi_* \leq 2.3 \quad (95\%~\mathrm{CL}). \end{equation}(123)Given that fNLinv.dec.\hbox{$f_\mathrm{NL}^{\mathrm{inv.dec.}}$} is exponentially sensitive to ξ, this translates into the prediction (using Eq. (122)) fNLinv.dec.1.2\hbox{$f_\mathrm{NL}^{\mathrm{inv.dec.}} \leq 1.2$}, which is significantly tighter than the current bispectrum constraint from Planck Collaboration XVII (2016). Indeed, importance sampling with the likelihood for fNLinv.dec.\hbox{$f_\mathrm{NL}^{\mathrm{inv.dec.}}$}, taken to be a Gaussian centred on the NG estimate fNLinv.dec.=22.7±25.5\hbox{$f_\mathrm{NL}^\mathrm{inv.dec.}=22.7 \pm 25.5$} (68% CL) (Planck Collaboration XVII 2016), changes the limit on ξ only at the second decimal place.

We now derive constraints on model parameters using only the observational constraint on fNLinv.dec.\hbox{$f_\mathrm{NL}^\mathrm{inv.dec}.$} The constraints thus derived are applicable for generic p and also to the axion monodromy model discussed in Sect. 10.3, even in the case Λ0 ≠ 0. We follow the procedure described in Sect. 11 of Planck Collaboration XVII (2016). The likelihood for fNLinv.dec.\hbox{$f_\mathrm{NL}^{\mathrm{inv.dec.}}$} is taken to be a Gaussian centred on the NG estimate fNLinv.dec.=22.7±25.5\hbox{$f_\mathrm{NL}^\mathrm{inv.dec.}=22.7 \pm 25.5$} (68% CL) (Planck Collaboration XVII 2016). We use the expression of Eq. (122), where f3(ξ) is numerically evaluated. To find the posterior distribution for the parameter ξ we choose uniform priors in the intervals \hbox{$1.5\times 10^{-9}\leq\mathcal{P}_* \leq3.0\times 10^{-9}$} and 0.1 ≤ ξ ≤ 7.0. This yields 95% CL constraints for ξ (for any value of p) of ξ2.5(95%CL).\begin{equation} \xi_*\leq2.5\quad (95\%~\mathrm{CL}). \end{equation}(124)If we choose a log-constant prior on ξ we find ξ2.2(95%CL).\begin{equation} \xi_*\leq2.2 \quad (95\%~\mathrm{CL}). \end{equation}(125)For both cases the results are insensitive to the upper limit chosen for the prior on ξ since the likelihood quickly goes to zero for ξ> 3. As the likelihood for ξ is fairly flat, the tighter constraint seen for the log-constant case is mildly prior driven. The constraints from the bispectrum are consistent with, and slightly worse than, the result from the power spectrum alone.

Using a similar procedure and Eq. (116) one can also obtain a constraint on α/f. Adopting a log-constant prior162 ≤ α/f ≤ 100 and uniform priors 50 ≤ N ≤ 70 and \hbox{$1.5\times 10^{-9}\leq\mathcal{P}_*\leq3.0\times 10^{-9}$} we obtain the 95% CL constraints α/f48MPl-1forp=1,α/f35MPl-1forp=2,\begin{equation} \alpha/f\leq48M^{-1}_\mathrm{Pl}\,\,\,\mathrm{for}\,\,\, p=1,\qquad\alpha/f\leq35M^{-1}_\mathrm{Pl} \,\,\,\mathrm{for}\,\,\, p=2, \end{equation}(126)and α/f42MPl-1forp=4/3.\begin{equation} \alpha/f\leq 42M^{-1}_\mathrm{Pl} \,\,\,\mathrm{for}\,\,\, p=4/3. \end{equation}(127)For example, for a linear potential, p = 1, if α ~ 1 as suggested by effective field theory, then the axion decay constant f is constrained to be f0.020MPl(95%CL),\begin{equation} f\geq 0.020M_\mathrm{Pl} \quad (95\%~\mathrm{CL}) , \end{equation}(128)while for a potential with p = 4 / 3 we find f0.023MPl(95%CL).\begin{equation} f\geq 0.023M_\mathrm{Pl}\quad (95\%~\mathrm{CL}). \end{equation}(129)These limits are complementary to those derived in Sect. 10.3.

11. Constraints on isocurvature modes

thumbnail Fig. 43

Angular power spectra for the scale-invariant (i.e., nℛℛ = 1) pure adiabatic mode (ADI, green dashed curves) and for the scale invariant (nℐℐ = 1) pure isocurvature (CDI, NDI, or NVI) modes, with equal primordial perturbation amplitudes. The thick lines represent the temperature auto-correlation (TT) and the thin lines the E-mode polarization auto-correlation (EE).

In PCI13, we presented constraints on a number of simple models featuring a mixture of the adiabatic (ADI) mode and one type of isocurvature mode. We covered the cases of CDM density isocurvature (CDI), neutrino density isocurvature (NDI), and neutrino velocity isocurvature (NVI) modes (Bucher et al. 2000) with different assumptions concerning the correlation (Langlois 1999; Amendola et al. 2002) between the primordial adiabatic and isocurvature perturbations. Isocurvature modes, possibly correlated among themselves and with the adiabatic mode, can be generated in multi-field models of inflation; however, at present a mechanism for exciting the neutrino velocity isocurvature mode is lacking. Section 11.2 shows how these constraints have evolved with the new Planck TT+lowP likelihoods, how much including the Planck lensing likelihood changes the results, and what extra information the Planck high- polarization contributes. A pure isocurvature mode as a sole source of perturbations has been ruled out (Enqvist et al. 2002), since, as can be seen from Fig. 43, any of the isocurvature modes leads to an acoustic peak structure for the temperature angular power very different from the adiabatic mode, which fits the data very well. The different phases and tilts of the various modes also occur in the polarization spectra, as shown in Fig. 43 for the E mode.17

In Sect. 11.4 we add one extra degree of freedom to the generally-correlated ADI+CDI model by allowing primordial tensor perturbations (assuming the inflationary consistency relation for the tilt of the tensor power spectrum and its running). Our main goal is to explore a possible degeneracy between tensor modes and negatively-correlated CDI modes, tending to tilt the large-scale temperature spectrum in opposite directions. In Sect. 11.5, we update the constraints on three special cases motivated by axion or curvaton scenarios.

The goal of this analysis is to test the hypothesis of adiabaticity and establish the robustness of the base ΛCDM model against different assumptions concerning initial conditions (Sect. 11.3). Adiabaticity is also an important probe of the inflationary paradigm, since any significant detection of isocurvature modes would exclude the possibility that all perturbations in the Universe emerged from quantum fluctuations of a single inflaton field, which can excite only one degree of freedom, the curvature (i.e., adiabatic) perturbation.18

In this section, theoretical predictions were obtained with a modified version of the CAMB code (version Jul14) while parameter exploration was performed with the MultiNest nested sampling algorithm.

11.1. Parameterization and notation

A general mixture of the adiabatic mode and one isocurvature mode is described by the three functions \hbox{${\cal P}_{\cal R\cal R}(k),$}\hbox{${\cal P}_{\cal I\cal I}(k),$} and \hbox{${\cal P}_{\cal R\cal I}(k)$} describing the curvature, isocurvature, and cross-correlation power spectra, respectively. Our sign conventions are such that positive values for \hbox{${\cal P}_{\cal RI}$} correspond to a positive contribution of the cross-correlation term to the Sachs-Wolfe component of the total temperature spectrum.

As in PCI13, we specify the amplitudes at two scales k1<k2 and assume power-law behaviour, so that 𝒫ab(k)=exp[(ln(k)ln(k2)ln(k1)ln(k2))ln(𝒫ab(1))+(ln(k)ln(k1)ln(k2)ln(k1))ln(𝒫ab(2))],\begin{eqnarray} \begin{split} {\cal P}_{\mathrm{ab}}(k)&= \exp \Biggl[ \left( \frac {\ln (k )-\ln (k_2)} {\ln (k_1)-\ln (k_2)} \right) \ln\left({\cal P}^{(1)}_{\mathrm{ab}}\right) \\ &\quad + \left( \frac {\ln (k )-\ln (k_1)} {\ln (k_2)-\ln (k_1)} \right) \ln\left({\cal P}^{(2)}_{\mathrm{ab}}\right) \Biggr] , \label{matInter} \end{split} \end{eqnarray}(130)where a,b = ℐ,ℛ and ℐ = ℐCDI, NDI, or NVI. We set k1 = 0.002 Mpc-1 and k2 = 0.100 Mpc-1, so that [ k1,k2 ] spans most of the range constrained by the Planck data. The positive definiteness of the initial condition matrix imposes a constraint on its elements at any value of k: [𝒫ab(k)]2𝒫aa(k)𝒫bb(k).\begin{equation} \left[\mathcal{P}_{\mathrm{ab}}(k)\right]^2 \le \mathcal{P}_{\mathrm{aa}}(k) \mathcal{P}_{\mathrm{bb}}(k). \label{PD:Constraint} \end{equation}(131)We take uniform priors on the positive amplitudes, 𝒫ℛℛ(1),𝒫ℛℛ(2)𝒫ℐℐ(1),𝒫ℐℐ(2)(0,10-8).\begin{eqnarray} \mathcal{P}_{\cal R\cal R}^{(1)}, \mathcal{P}_{\cal R\cal R}^{(2)} & \in &(10^{-9},\, 10^{-8}), \label{eq:RRprior}\\ \mathcal{P}_{\cal I\cal I}^{(1)}, \mathcal{P}_{\cal I\cal I}^{(2)} & \in& (0,\, 10^{-8}). \end{eqnarray}The correlation spectrum can be positive or negative. For ab we apply a uniform prior at large scales (at k1): 𝒫ab(1)(10-8,10-8),\begin{equation} \mathcal{P}_{\mathrm{ab}}^{(1)} \in (-10^{-8},\, 10^{-8}),\label{eq:RIprior} \end{equation}(134)but reject all parameter combinations violating the constraint in Eq. (131). To ensure that Eq. (131) holds for all k, we restrict ourselves to a scale-independent correlation fraction: cosΔab𝒫ab(𝒫aa𝒫bb)1/2(1,1).\begin{equation} \cos\Delta_{\mathrm{ab}} \equiv \frac{\mathcal{P}_{\mathrm{ab}}}{\left(\mathcal{P}_{\mathrm{aa}} \mathcal{P}_{\mathrm{bb}} \right)^{1/2}} \in (-1,1). \label{eq:defCosDelta} \end{equation}(135)Thus 𝒫ab(2)\hbox{${\cal P}_{\mathrm{ab}}^{(2)}$} is a derived parameter19 given by 𝒫ab(2)=𝒫ab(1)(𝒫aa(2)𝒫bb(2))1/2(𝒫aa(1)𝒫bb(1))1/2,\begin{equation} \mathcal{P}_{\mathrm{ab}}^{(2)} = \mathcal{P}_{\mathrm{ab}}^{(1)} \frac{\left(\mathcal{P}_{\mathrm{aa}}^{(2)} \mathcal{P}_{\mathrm{bb}}^{(2)} \right)^{1/2} } {\left(\mathcal{P}_{\mathrm{aa}}^{(1)} \mathcal{P}_{\mathrm{bb}}^{(1)} \right)^{1/2}}, \label{eq:Pab2} \end{equation}(136)which in terms of spectral indices is equivalent to nab=12(naa+nbb).\begin{equation} n_{\mathrm{ab}} = \frac{1}{2}(n_{\mathrm{aa}} + n_{\mathrm{bb}}). \label{eq:ncor} \end{equation}(137)The conservative baseline likelihood is Planck TT+lowP. The results obtained with Planck TT, TE, EE+lowP should be interpreted with caution because the data used in the 2015 release are known to contain some low level systematics, in particular arising from T-to-E leakage, and it is possible that such systematics may be fit by the isocurvature auto-correlation and cross-correlation templates. (See Planck Collaboration XIII 2016 for a detailed discussion.)

In what follows, we quote our results in terms of derived parameters identical to those in PCI13. We define the primordial isocurvature fraction as βiso(k)=𝒫ℐℐ(k)𝒫ℛℛ(k)+𝒫ℐℐ(k)·\begin{equation} \beta _\mathrm{iso}(k)=\frac{\mathcal{P}_\mathcal{II}(k)}{\mathcal{P}_\mathcal{RR}(k)+\mathcal{P}_\mathcal{II}(k)}\cdot \label{PrimFrac} \end{equation}(138)Unlike the primordial correlation fraction cosΔ defined in Eq. (135), βiso is scale-dependent in the general case. We present bounds on this quantity at klow = k1, kmid = 0.050 Mpc-1, and khigh = k2.

We report constraints on the relative adiabatic (ab = ℛℛ), isocurvature (ab = ℐℐ), and correlation (ab = ℛℐ) components according to their contribution to the observed CMB temperature variance in various multipole ranges: αab(min,max)\begin{eqnarray} \alpha _\mathrm{ab}(\ell _\mathrm{min},\ell _\mathrm{max}) &\equiv&\frac {(\Delta T)^2_\mathrm{ab}(\ell _\mathrm{min},\ell _\mathrm{max})} {(\Delta T)^2_\mathrm{tot}(\ell _\mathrm{min},\ell _\mathrm{max})}, \label{eq:FracDef} \end{eqnarray}(139)where (ΔT)ab2(min,max)==minmax(2+1)Cab,ℓTT.\begin{equation} {(\Delta T)^2_\mathrm{ab}(\ell _\mathrm{min},\ell _\mathrm{max})}= \sum _{\ell =\ell _\mathrm{min}}^{\ell _\mathrm{max}} (2\ell +1)C_{\mathrm{ab}, \ell }^{\rm TT}. \end{equation}(140)The ranges considered are (min,max) = (2,20), (21,200),(201,2500), and (2,2500), where the last range describes the total contribution to the observed CMB temperature variance. Here αℛℛ measures the adiabaticity of the temperature angular power spectrum, a value of unity meaning “fully adiabatic initial conditions”. Values less than unity mean that some of the observed power comes from the isocurvature or correlation spectrum, while values larger than unity mean that some of the power is “cancelled” by a negatively-correlated isocurvature contribution. The relative non-adiabatic contribution can be expressed as αnon - adi ≡ 1−αℛℛ = αℐℐ + αℛℐ.

thumbnail Fig. 44

68% and 95% CL constraints on the primordial perturbation power in general mixed ADI+CDI a); ADI+NDI b); and ADI+NVI c) models at two scales, k1 = 0.002 Mpc-1 (1) and k2 = 0.100 Mpc-1 (2), for Planck TT+lowP (grey regions highlighted by dotted contours), Planck TT+lowP+lensing (blue), and Planck TT, TE, EE+lowP (red). In the first panels, we also show contours for the pure adiabatic base ΛCDM model with the corresponding colours of solid lines.

thumbnail Fig. 45

Constraints on the primordial isocurvature fraction, βiso, at klow = 0.002 Mpc-1 and khigh = 0.100 Mpc-1, the primordial correlation fraction, cosΔ, the adiabatic spectral index, nℛℛ, the isocurvature spectral index, nℐℐ, and the correlation spectral index, nℛℐ = (nℛℛ + nℐℐ) / 2, with Planck TT+lowP data (dashed curves) and TT, TE, EE+lowP data (solid curves), for the generally-correlated mixed ADI+CDI (black), ADI+NDI (red), and ADI+NVI (blue) models. All these parameters are derived, and the distributions shown here result from a uniform prior on the primary parameters, as detailed in Eqs. (132)–(134). However, the effect of the non-flat derived-parameter priors is negligible for all parameters except for nℐℐ (and nℛℐ) where the prior biases the distribution toward one. With TT+lowP, the flatness of βiso(khigh) in the CDI case up to a “threshold” value of about 0.5 is a consequence of the (k/keq)-2 damping of its transfer function as explained in Footnote 17.

11.2. Results for generally-correlated adiabatic and one isocurvature mode (CDI, NDI, or NVI)

Results are reported as 2D and 1D marginalized posterior probability distributions. Numerical 95% CL intervals or upper bounds are tabulated in Table 16.

Figure 44 shows the Planck 68% and 95% CL contours for various 2D combinations of the primordial adiabatic and isocurvature amplitude parameters at large scales (k1 = 0.002 Mpc-1) and small scales (k2 = 0.100 Mpc-1) for (a) the generally-correlated ADI+CDI; (b) ADI+NDI; and (c) ADI+NVI models. Overall, the results using Planck TT+lowP are consistent with the nominal mission results in PCI13, but slightly tighter. In the first panels of Figs. 44ac we also show the constraints on the curvature perturbation power in the pure adiabatic case. Comparing the generally-correlated isocurvature case to the pure adiabatic case with the same data combination summarizes neatly what the data tell us about the initial conditions. If the contours in the 𝒫ℛℛ(1)\hbox{$\mathcal{P}_{\cal R\cal R}^{(1)}$}-𝒫ℛℛ(2)\hbox{$\mathcal{P}_{\cal R\cal R}^{(2)}$} plane were shifted significantly relative to the pure adiabatic case, the missing power could come either from the isocurvature and postive correlation contributions, or the extra adiabatic power could be cancelled by a negative correlation contribution. We can see that these shifts are small. The low- temperature data continue to mildly favour a negative correlation (see in particular the bottom middle panel for each of the three models), since compared to the prediction of the best-fit adiabatic base ΛCDM model, the TT angular power at multipoles ≲ 40 is somewhat low. But the dotted grey shaded contours in the three middle top panels show that for Planck TT+lowP, the posterior peaks at values (𝒫ℐℐ(1)\hbox{$\mathcal{P}_{\cal I\cal I}^{(1)}$},𝒫ℐℐ(2)\hbox{$\,\mathcal{P}_{\cal I\cal I}^{(2)}$}) entirely consistent with (0, 0), i.e., the pure adiabatic case is preferred. The best-fit values of (𝒫ℐℐ(1)\hbox{$\mathcal{P}_{\cal I\cal I}^{(1)}$},𝒫ℐℐ(2)\hbox{$\,\mathcal{P}_{\cal I\cal I}^{(2)}$}) are (1.4 × 10-11, 4.7 × 10-13) for CDI, (1.2 × 10-12, 4.6 × 10-10) for NDI, and (1.6 × 10-12, 2.3 × 10-10) for NVI, while (𝒫ℛℛ(1)\hbox{$\mathcal{P}_{\cal R\cal R}^{(1)}$},𝒫ℛℛ(2)\hbox{$\,\mathcal{P}_{\cal R\cal R}^{(2)}$}) (2.4 × 10-9, 2.1 × 10-9). It may appear from the bottom-centre panels of Fig. 44 that there is nonzero posterior probability for 𝒫ℛℐ(1)0\hbox{$\mathcal{P}_{\cal R\cal I}^{(1)} \ne 0$} when 𝒫ℐℐ(1)=0\hbox{$\mathcal{P}_{\cal I\cal I}^{(1)} = 0$}, which would violate the positivity constraint, Eq. (131). However, the leftmost pixels of the plots are actually evaluated at values of 𝒫ℐℐ(1)\hbox{$\mathcal{P}_{\cal I\cal I}^{(1)}$} large enough that the constraint is satisfied.

Including the Planck lensing likelihood does not significantly affect the non-adiabatic primordial powers, except for tightening the constraints on the adiabatic power (see the blue versus black contours in the first panels of Figs. 44ac). Including the lensing (Cφφ\hbox{$C_\ell^{\phi\phi}$}) likelihood constrains the optical depth τ more tightly than the high- temperature and low- polarization alone (Planck Collaboration XIII 2016). As there is a strong degeneracy between τ and the primordial (adiabatic) perturbation power \hbox{$\mathcal{P}_{\cal R\cal R}$} (denoted in the other sections of this paper by As), it is natural that adding the lensing data leads to stronger constraints on \hbox{$\mathcal{P}_{\cal R\cal R}$}. Moreover, replacing the low- likelihood Planck lowP by Planck lowP+WP constrains τ better (Planck Collaboration XIII 2016). In the ADI+CDI case the effect of this replacement was very similar to adding the Planck lensing data (see also Table 16). Although the Planck lensing data do not directly constrain the isocurvature contribution,20 they can shift and tighten the constraints on some derived isocurvature parameters by affecting the favoured values of the standard parameters (present even in the pure adiabatic model). However this effect is small as confirmed in Table 16. Therefore, in the figures we do not show 1D posteriors of the derived isocurvature parameters for Planck TT+lowP+lensing, since they would be (almost) indistinguishable from Planck TT+lowP, as we see in Fig. 44 for the primary non-adiabatic parameters.

In contrast, the high-polarization data significantly tighten the bounds on isocurvature and cross-correlation parameters, as seen by comparing the dotted grey and red contours in Fig. 44. The significant negative correlation previously allowed by the temperature data in the ADI+CDI and ADI+NDI models is now disfavoured. This is also clearly visible in the 1D posteriors of primordial and observable isocurvature and cross-correlation fractions shown, respectively, in Figs. 45 and 46. Note how the cosΔ and αℛℐ parameters are driven towards zero by the inclusion of the high- TE, EE data (from the dashed to the solid lines) in the ADI+CDI and ADI+NDI cases. We also observed that when the lowP data are replaced by a simple Gaussian prior on the reionization optical depth (τ = 0.078 ± 0.019), the trend is similar. The high- ( ≥ 30) Planck TT data allow a large negative correlation, while the high-Planck TE, EE data prefer positive correlation. This is clearly seen in Fig. 47 for the ADI+CDI case. The best-fit values show an even more dramatic effect. We find cosΔ = −0.55 with TT+lowP, and + 0.15 with TT, TE, EE+lowP.

Hence there is a competition between the temperature and polarization data that balances out and yields almost symmetric results about zero correlation (except in the ADI+NVI case). The isocurvature auto-correlation amplitude is also strongly reduced, especially in the ADI+CDI case. The best-fit values are slightly offset from (𝒫ℐℐ(1),𝒫ℐℐ(2))=(0,0)\hbox{$(\mathcal{P}_{\cal I\cal I}^{(1)},\,\mathcal{P}_{\cal I\cal I}^{(2)}) = (0,0)$}, but the pure adiabatic model still lies inside the 68% CL (for ADI+CDI and ADI+NDI) or 95% CL (for ADI+NVI) regions. In summary, the high- polarization data exhibit a strong preference for adiabaticity, although one should keep in mind the possibility of unaccounted systematic effects in the polarization data, possibly leading to artificially strong constraints. For example, the tendency for polarization to shift the constraints towards positive correlation may be due to particular systematic effects that mimic modified acoustic peak structure, as we discussed in Sect. 11.1.

thumbnail Fig. 46

Constraints on the fractional contribution of the adiabatic (ℛℛ), isocurvature (ℐℐ), and correlation (ℛℐ) components to the CMB temperature variance in various multipole ranges, as defined in Eq. (139), with Planck TT+lowP data (dashed curves) and with Planck TT, TE, EE+lowP data (solid curves). These are shown for the generally-correlated mixed ADI+CDI (black), ADI+NDI (red), or ADI+NVI (blue) models.

We also performed a parameter extraction with the Planck TT, TE, EE+lowP+lensing data, but this combination did not provide interesting new constraints. We found only a tightening of bounds on the standard adiabatic parameters as in the Planck TT+lowP+lensing case.

thumbnail Fig. 47

Constraints on the primordial correlation fraction, cosΔ, in the mixed ADI+CDI model with Planck TT+lowP data (dashed black curve) compared to the case where Planck lowP data are not used, but replaced by a Gaussian prior τ = 0.078 ± 0.019 (dashed red curve). The same exercise is repeated with Planck TT, TE, EE data (solid curves) demonstrating that to a great extent the preferred value of cosΔ is driven by the high- data.

We provide 95% CL upper limits or ranges for βiso, cosΔ, and αℛℛ in Table 16. With Planck TT+lowP, the constraints on the non-adiabatic contribution to the temperature variance, 1−αℛℛ(2,2500), are (−1.5%, 1.9%), (−4.0%, 1.4%), and (−2.3%, 2.4%) in the ADI+CDI, ADI+NDI, and ADI+NVI cases, respectively.21 With Planck TT, TE, EE+lowP these tighten to (0.1%, 1.5%), (−0.1%, 2.2%), and (−2.0%, 0.8%). In the ADI+CDI case, zero is not in the 95% CL interval, but this should not be considered a detection of non-adiabaticity. For example, as mentioned above, (𝒫ℐℐ(1),𝒫ℐℐ(2))=(0,0)\hbox{$(\mathcal{P}_{\cal I\cal I}^{(1)},\,\mathcal{P}_{\cal I\cal I}^{(2)}) = (0,0)$} is in the 68% CL region, and the best-fit values are (𝒫ℐℐ(1)\hbox{$\mathcal{P}_{\cal I\cal I}^{(1)}$},𝒫ℐℐ(2)\hbox{$\,\mathcal{P}_{\cal I\cal I}^{(2)}$}) = (1.0 × 10-13, 3.5 × 10-9). Moreover, the improvement in χ2 with respect to the adiabatic model is only 5.3 with 3 extra parameters, so this is not a significant improvement of fit. Indeed, for all generally-correlated mixed models the improvement in χ2 is very small. In particular, with Planck TT+lowP it does not even exceed the number of extra degrees of freedom, which is three (see Table 16).

Finally, we checked whether there is any Bayesian evidence for the presence of generally-correlated adiabatic and isocurvature modes. In all cases and with all data combinations studied, the Bayesian model comparison supports the null hypothesis, i.e., adiabaticity. Indeed, the logarithm of the evidence ratio is lnB = ln(PISO/PADI) < −5 (i.e., odds of much greater than 150:1 in favour of pure adiabaticity within Planck’s accuracy and given the parameterization and prior ranges used in our analysis), except for ADI+NDI with Planck TT+lowP+lensing, for which the evidence ratio is slightly larger, −4.6, corresponding to odds of 1:100 for the ADI+NDI model compared to the pure adiabatic model.

thumbnail Fig. 48

Constraints on selected “standard” cosmological parameters with Planck TT+lowP data for the generally-correlated ADI+CDI (black), ADI+NDI (red), and ADI+NVI (blue) models compared to the pure adiabatic case (ADI, green dashed curves).

11.3. Robustness of the determination of standard cosmological parameters

Another outcome of our analysis is the robustness of the determination of the standard cosmological parameters against assumptions on initial conditions. Figure 48 shows the 1D marginalized posteriors for several cosmological parameters (not all independent of each other) with the Planck TT+lowP data alone. For the first time, we observe that in the presence of one generally-correlated isocurvature mode (CDI, NDI, or NVI), predictions for these parameters remain very stable with respect to the pure adiabatic case. Except for the ADI+NDI case, the posteriors neither broaden nor shift significantly. A small broadening is only observed in the sound horizon angle θMC, which is naturally the most sensitive parameter to tiny disturbances of the acoustic peak structure. In the ADI+NDI case, the peak of the posterior distribution for some parameters shifts slightly, but the largest shift (for Ωch2) is less than 1σ.

It is striking that a scale-invariant adiabatic spectrum (nℛℛ = 1) is excluded at many σ even when isocurvature modes are allowed: at 4.7σ (ADI+CDI), 5.0σ (ADI+NDI), and 5.4σ (ADI+NVI). This illustrates how much the constraining power of the CMB has improved. With WMAP data, there was still a strong degeneracy between, for example, the primordial isocurvature fraction and the adiabatic spectral index (Valiviita & Giannantonio 2009; Savelainen et al. 2013). This degeneracy nearly disappears with Planck TT+lowP, and even more so with Planck TT, TE, EE+lowP, as shown in the upper panel of Fig. 49. Contours in the (nℛℛ, cosΔ) space also shrink considerably, with some correlation remaining between these parameters in the ADI+CDI and ADI+NVI cases (Fig. 49, lower panel).

11.4. CDI and primordial tensor perturbations

A primordial tensor contribution adds extra temperature angular power at low multipoles, where the adiabatic base ΛCDM model predicts slightly more power than seen in the data. Hence allowing for a nonzero tensor-to-scalar ratio r might tighten the constraints on positively-correlated isocurvature, but degrade them in negatively-correlated models. We test how treating r as a free parameter affects the constraints on isocurvature and how allowing for the generally-correlated CDI mode affects the constraints on r. These cases are denoted as “CDI+r”. For comparison, we examine the pure adiabatic case in the same parameterization, and call it “ADI+r”. We also consider another approach where we fix r = 0.1. These cases are named “CDI+r = 0.1” and “ADI+r = 0.1”.

thumbnail Fig. 49

Dependence of the determination of the adiabatic spectral index nℛℛ (called ns in the other sections of this paper) on the primordial isocurvature fraction βiso and correlation fraction cosΔ, with Planck TT+lowP data (dashed contours) and with Planck TT, TE, EE+lowP data (shaded regions).

In the pure adiabatic case (where the curvature and tensor perturbations stay constant on super-Hubble scales), the primordial r is the same as the tensor-to-scalar ratio at the Hubble radius exit of perturbations during inflation, which we call ˜r\hbox{$\tilde r$}. However, in the presence of an isocurvature component, \hbox{${\cal P}_{\cal R \cal R}$} is not constant in time even on super-Hubble scales (García-Bellido & Wands 1996). Instead, the isocurvature component may source \hbox{${\cal P}_{\cal R \cal R}$}, for example if the background trajectory in the field space is curved between Hubble exit and the end of inflation (Langlois 1999; Langlois & Riazuelo 2000; Gordon et al. 2001; Amendola et al. 2002). As a result, we will have at the primordial time 𝒫ℛℛ=˜𝒫ℛℛ/(1cos2Δ)\hbox{${\cal P}_{\cal R \cal R} = \tilde{\cal P}_{\cal R \cal R} / (1-\cos^2\Delta)$}, where ˜𝒫ℛℛ\hbox{$\tilde{\cal P}_{\cal R \cal R}$} is the curvature power at Hubble exit. That is, by the primordial time the curvature perturbation power is larger than at the Hubble radius exit time (Bartolo et al. 2001; Wands et al. 2002; Byrnes & Wands 2006). Thus we find a relation (Savelainen et al. 2013; Valiviita et al. 2012; Kawasaki & Sekiguchi 2008): r=(1cos2Δ)˜r,\begin{equation} r = \left(1-\cos^2\!\Delta\right) \tilde r, \label{eq:rvstilder} \end{equation}(141)i.e., the tensor-to-scalar ratio at the primordial time (r) is smaller than the ratio at the Hubble radius exit time (˜r\hbox{$\tilde r$}).

The derivation of Eq. (141) assumes that the adiabatic and isocurvature perturbations are uncorrelated at Hubble radius exit (cos˜Δ=0\hbox{$\cos\tilde\Delta = 0$}), and that all the possible primordial correlation (cosΔ ≠ 0) appears from the evolution of super-Hubble perturbations between Hubble exit and the primordial time. This is true to leading order in the slow-roll parameters, but inflationary models that break slow roll might produce perturbations that are strongly correlated already at the Hubble radius exit time. In these cases the correlation would depend on the details of the particular model, such as the detailed shape of the potential and the interactions of the fields. However, a generic prediction of slow-roll inflation is that, at Hubble radius exit, the cross-correlation ˜𝒫ℛℐ\hbox{$\tilde{\cal P}_{\cal R \cal I}$} is very weak, and indeed is of the order of the slow-roll parameters compared to the auto-correlations ˜𝒫ℛℛ\hbox{$\tilde{\cal P}_{\cal R \cal R}$} and ˜𝒫ℐℐ\hbox{$\tilde{\cal P}_{\cal I \cal I}$} (see, e.g., Byrnes & Wands 2006). Thus, for slow-roll models, |cos˜Δ|=𝒪(slow-rollparameters)1\hbox{$|\cos\tilde\Delta| = {\cal O}\mbox{(slow-roll parameters)}\ll 1$}.

In our analysis, we fix the tensor spectral index by the leading-order inflationary consistency relation, which now reads (Wands et al. 2002) nt=˜r8=r8(1cos2Δ)·\begin{equation} n_\mathrm{t} = -\frac{\tilde r}{8} = -\frac{r}{8\left(1-\cos^2\Delta\right)}\cdot \end{equation}(142)Assuming a uniform prior for r would lead to huge negative nt whenever cos2Δ was close to one. Therefore, when studying the CDI+r case we assume a uniform prior on ˜r\hbox{$\tilde r$} at k = 0.05 Mpc-1 (for details, see Savelainen et al. 2013).

Surprisingly, allowing for a generally-correlated CDI mode (i.e., three extra parameters) hardly changes the constraints on r from those obtained in the pure adiabatic model. In Fig. 50 we demonstrate this in a “standard” plot of r0.002 versus adiabatic spectral index.

From Table 16 we notice that, with Planck TT+lowP and TT, TE, EE+lowP, fixing r to 0.1 tightens constraints on the primordial isocurvature fraction at large scales. This is as we expected, since both tensor and isocurvature perturbations add power at low , and the data do not prefer this. However, the shapes of the tensor spectrum and correlation spectrum are such that negative correlation cannot efficiently cancel the unwanted extra power over all scales produced by tensor perturbations (at ≲ 70). Therefore, the correlation fraction cosΔ is almost unaffected. However, when we allow r to vary, the cancelation mechanism works to some degree when using Planck TT+lowP data, leading to more negative cosΔ than without r: with varying r we have cosΔ in the range (−0.43, 0.20), while without r it is in (−0.30, 0.20), at 95% CL. As there is now some cancellation of power at large scales, the constraint on βiso(klow) weakens slightly from 0.041 without r to 0.043 with r. On the other hand, the high- polarization data constrain the correlation to be so close to zero that with Planck TT, TE, EE+lowP the results for cosΔ with and without r are almost identical.

The mean value of cosΔ in the CDI+r cases is −0.071 (TT+lowP) and −0.076 (TT, TE, EE+lowP). Therefore, 1−cos2Δ ≈ 0.99, and so we do not expect a large difference between the primordial r and the Hubble radius exit value ˜r\hbox{$\tilde r$}. The smallness of the difference is evident in Table 15. To summarize, CDI hardly affects the determination of r from the Planck data, and allowing for tensor perturbations hardly affects the determination of the non-adiabaticity parameters.

thumbnail Fig. 50

68% and 95% CL constraints on the primordial adiabatic spectral index nℛℛ and the primordial tensor-to-scalar ratio r (more accurately, in the CDI+r model, the primordial tensor-to-curvature power ratio) at k = 0.002 Mpc-1. Filled contours are for generally-correlated ADI+CDI and solid contours for the pure adiabatic model.

Table 15

95% CL upper bounds on the tensor-to-scalar ratio (actually the tensor-to-curvature power ratio) at the primordial time, r, and earlier, at the Hubble radius exit time during inflation, ˜r\hbox{$\tilde r$}, at k = 0.05 Mpc-1.

Table 16

Constraints on mixed adiabatic and isocurvature models.

11.5. Special CDI cases

thumbnail Fig. 51

Uncorrelated ADI+CDI with nℐℐ = 1 (“axion”).

Next we study three one-parameter CDI extensions to the adiabatic model. In all these extensions the isocurvature mode modifies only the largest angular scales, since we either fix nℐℐ to unity (“axion”) or to the adiabatic spectral index (“curvaton I/II”). As can be seen from Fig. 43, the polarization E mode at multipoles ≳ 200 will not be significantly affected by this type of CDI mode. Therefore, these models are much less sensitive to residual systematic effects in the high- polarization data than the generally-correlated models.

11.5.1. Uncorrelated ADI+CDI (“Axion”)

We start with an uncorrelated mixture of adiabatic and CDI modes (\hbox{$\mathcal{P}_{\cal R\cal I}=0$}) and make the additional assumption that 𝒫ℐℐ(2)=𝒫ℐℐ(1)\hbox{$\mathcal{P}_{\cal I\cal I}^{(2)} = \mathcal{P}_{\cal I\cal I}^{(1)}$}, i.e., we assume unit isocurvature spectral index, nℐℐ = 1. Constraints in the (nℛℛ,βiso) plane are presented in Fig. 51. This model is the only case for which our new results do not improve over bounds from PCI13. At kmid = 0.050 Mpc-1, we find βiso< 0.038 (95% CL, TT, TE, EE+lowP; see Table 16), compared with βiso< 0.039 using Planck 2013 and low- WMAP data. This is not surprising, since fixing nℐℐ to unity implies that bounds are dominated by measurements on very large angular scales, ≲ 30, as can easily be understood from Fig. 43. Hence the results are insensitive to the addition of better high- temperature data, or new high- polarization data.

We summarized in PCI13 why an uncorrelated CDI mode with nℐℐ ≈ 1 can be produced in axion models under a number of restrictive assumptions: the Peccei-Quinn symmetry should be broken before inflation; it should not be restored by quantum fluctuations of the inflaton or by thermal fluctuations when the Universe reheats; and axions produced through the misalignment angle should contribute to a sizable fraction (or all) of the dark matter. Under all of these assumptions, limits on βiso can be used to infer a bound on the energy scale of inflation, using Eq. (73) of PCI13. This bound is strongest when all the dark matter is assumed to be in the form of axions. In that case, the limit on βiso(kmid) for Planck TT, TE, EE+lowP gives Hinf<0.86×107GeV(fa1011GeV)0.408(95%CL),\begin{equation} H_\mathrm{inf} < 0.86 \times 10^7\,\mathrm{GeV} \left(\frac{f_a}{10^{11}\,\mathrm{GeV}} \right)^{0.408} \quad \mbox{ (95\% CL)}, \end{equation}(143)where Hinf is the expansion rate at Hubble radius exit of the scale corresponding to kmid = 0.050 Mpc-1 and fa is the Peccei-Quinn symmetry-breaking energy scale.

11.5.2. Fully correlated ADI+CDI (“Curvaton I”)

Another interesting special case of mixed adiabatic and CDI (or BDI) perturbations is a model where these perturbations are primordially fully correlated and their power spectra have the same shape. These cases are obtained by setting 𝒫ℛℐ(1)=(𝒫ℛℛ(1)𝒫ℐℐ(1))1/2\hbox{$\mathcal{P}_{\cal R\cal I}^{(1)} = (\mathcal{P}_{\cal R\cal R}^{(1)} \mathcal{P}_{\cal I\cal I}^{(1)} )^{1/2}$}, which, by condition (136), implies that the corresponding statement holds at scale k2 and indeed at any scale. In addition, we set 𝒫ℐℐ(2)=(𝒫ℛℛ(2)/𝒫ℛℛ(1))𝒫ℐℐ(1)\hbox{$\mathcal{P}_{\cal I\cal I}^{(2)} = (\mathcal{P}_{\cal R\cal R}^{(2)} / \mathcal{P}_{\cal R \cal R}^{(1)}) \mathcal{P}_{\cal I\cal I}^{(1)}$}, i.e., nℐℐ = nℛℛ. From this it follows that βiso is scale-independent. Therefore, this model has only one primary non-adiabaticity parameter, 𝒫ℐℐ(1)\hbox{$\mathcal{P}_{\cal I\cal I}^{(1)}$}.

A physically motivated example of this type of model is the curvaton model (Mollerach 1990; Linde & Mukhanov 1997; Enqvist & Sloth 2002; Moroi & Takahashi 2001; Lyth & Wands 2002; Lyth et al. 2003) with the following assumptions. (1) The average curvaton field value \hbox{$\bar\chi_\ast$} is sufficiently below the Planck mass when cosmologically interesting scales exit the Hubble radius during inflation. (2) At Hubble radius exit, the curvature perturbation from the inflaton is negligible compared to the perturbation caused by the curvaton. (3) The same is true for any inflaton decay products after reheating. This means that, after reheating, the Universe is homogeneous, except for the spatially varying entropy (i.e., isocurvature perturbation) due to the curvaton field perturbations. (4) Later, CDM is created from the curvaton decay and baryon number after curvaton decay. This corresponds to case 4 presented in Gordon & Lewis (2003). (5) The curvaton contributes a significant amount to the energy density of the Universe at the time of the curvaton’s decay to CDM, i.e., the curvaton decays late enough. (6) The energy density of curvaton particles possibly produced during reheating should be sufficiently low (Bartolo & Liddle 2002; Linde & Mukhanov 2006). (7) The small-scale variance of curvaton perturbations, Δs2=δχ2s/χ̅2\hbox{$\Delta_s^2 = \langle \delta\chi^2 \rangle_s / \bar\chi^2$}, is negligible, so that it does not significantly contribute to the average energy density on CMB scales; see Eq. (102) in Sasaki et al. (2006). The last two conditions are necessary in order to have an almost-Gaussian curvature perturbation, as required by the Planck observations. Indeed, if they are not valid, a large fNLlocal\hbox{$f_{\mathrm{NL}}^\mathrm{local}$} follows, as discussed below. The conditions (6) and (7) are related, since curvaton particles would add a homogeneous component to the average energy density on large scales, and hence we can describe their effect by Δs2=ρχ,particles/ρχ̅,field\hbox{$\Delta_s^2 = \rho_{\chi,\,\mathrm{particles}} / \rho_{\bar\chi,\,\mathrm{field}}$}, where \hbox{$\rho_{\bar\chi,\,\mathrm{field}}$} is the average energy density of the classical curvaton field on large scales; see Eq. (98) in Sasaki et al. (2006). Then the total energy density carried by the curvaton will be \hbox{$\bar\rho_\chi = \rho_{\bar\chi,\,\mathrm{field}} + \rho_{\chi,\,\mathrm{particles}}$}.

The amount of isocurvature and non-Gaussianity present after curvaton decay depends on the “curvaton decay fraction” rD=3ρ̅χ3ρ̅χ+4ρ̅radiation\begin{equation} r_{\rm D} = \frac{3\bar\rho_\chi}{3\bar\rho_\chi + 4\bar\rho_{\mathrm{radiation}}} \label{eq:curvatondecayfraction} \end{equation}(144)evaluated at curvaton decay time. If conditions (6) and (7) do not hold, then the isocurvature perturbation disappears.22

thumbnail Fig. 52

Fully correlated ADI+CDI with nℐℐ = nℛℛ (“curvaton I”). Since the spectral indices are equal, the primordial isocurvature fraction βiso is scale-independent.

The curvaton scenario presented here is one of the simplest to test against observations. It should be noted that at least the conditions (1)(5) listed at the beginning of this subsection should be satisfied simultaneously. Indeed, if we relax some of these conditions, almost any type of correlation can be produced. For example, the relative correlation fraction can be written as cosΔ=λ/(1+λ)\hbox{$\cos\Delta = \sqrt{\lambda / (1+\lambda)}$}, where λ=(8/9)rD2ϵ(MPl/χ̅)2\hbox{$\lambda = (8/9)r_{\rm D}^2 \epsilon_\ast (M_\mathrm{Pl} / \bar\chi_\ast)^2$}. Therefore, the model is fully correlated only if λ ≫ 1. If the slow-roll parameter ϵ is very close to zero or the curvaton field value \hbox{$\bar\chi_\ast$} is large compared to the Planck mass, this model leads to almost uncorrelated perturbations.

As seen in Fig. 52 and Table 16, the upper bound on the primordial isocurvature fraction in the fully-correlated ADI+CDI model weakens slightly when we add the Planck lensing data to Planck TT+lowP, whereas adding high- TE, EE tightens the upper bound moderately. With all of these three data combinations, the pure adiabatic model gives an equally good best-fit χ2 as the fully-correlated ADI+CDI model. Bayesian model comparison strengthens the conclusion that the data disfavour this model with respect to the pure adiabatic model.

The isocurvature fraction is connected to the curvaton decay fraction in Eq. (144) by βiso9(1rD)2rD2+9(1rD)2\begin{equation} \beta_{\mathrm{iso}} \approx \frac{9(1-r_{\rm D})^2}{r_{\rm D}^2 + 9(1-r_{\rm D})^2} \label{eq:betaisocurvatonI} \end{equation}(145)(see case 4 in Gordon & Lewis 2003). We can convert the constraints on βiso from Table 16 into constraints on rD and further into the non-Gaussianity parameter assuming a quadratic potential for the curvaton and instantaneous decay23(Sasaki et al. 2006): fNLlocal=(1+Δs2)54rD535rD6·\begin{equation} f_{\mathrm{NL}}^\mathrm{local}=\left(1+\Delta_s^2\right)\frac{5}{4 r_{\rm D}} - \frac{5}{3} - \frac{5 r_{\rm D}}{6}\cdot \label{eq:fnlcurvaton} \end{equation}(146)If conditions (6) and (7) hold, i.e., Δs2=0\hbox{$\Delta_s^2 = 0$}, as implicitly assumed, e.g., in Bartolo et al. (2004a,b), then the smallest possible value of fNLlocal\hbox{$f_{\mathrm{NL}}^\mathrm{local}$} is −5 / 4, which is obtained when rD = 1, and Eqs. (145) and (146) yield for the various Planck data sets (at 95% CL):24TT+lowP:βiso<0.00180.9860<rD11.250fNLlocal<1.220,TT+lowP+lensing:βiso<0.00220.9845<rD11.250fNLlocal<1.217,TT,TE,EE+lowP:βiso<0.00130.9882<rD11.250fNLlocal<1.225.\begin{eqnarray} \mbox{TT+lowP: } & \beta_{\mathrm{iso}} < 0.0018 \Rightarrow 0.9860 < r_{\rm D} \le 1 \nonumber \\ & \Rightarrow -1.250 \le f_{\mathrm{NL}}^{\mathrm{local}} < -1.220, \\ \mbox{TT+lowP+lensing: } & \beta_{\mathrm{iso}} < 0.0022 \Rightarrow 0.9845 < r_{\rm D} \le 1 \nonumber\\ & \Rightarrow -1.250 \le f_{\mathrm{NL}}^{\mathrm{local}} < -1.217, \\ \mbox{ TT, \,TE, \,EE+lowP: } & \beta_{\mathrm{iso}} < 0.0013 \Rightarrow 0.9882 < r_{\rm D} \le 1 \nonumber \\ & \Rightarrow -1.250 \le f_{\mathrm{NL}}^{\mathrm{local}} < -1.225. \end{eqnarray}Thus the results for the simplest curvaton model remain unchanged from those presented in PCI13. In in order to produce almost purely adiabatic perturbations, the curvaton should decay when it dominates the energy density of the Universe (rD> 0.98), and the non-Gaussianity parameter is constrained to close to its smallest possible value (5/4<fNLlocal<1.21\hbox{$-5/4 < f_{\mathrm{NL}}^{\mathrm{local}}< -1.21$}), which is consistent with the result fNLlocal=2.5±5.7\hbox{$f_\mathrm{NL}^\mathrm{local}=2.5 \pm 5.7$} (68% CL, from T only) found in Planck Collaboration XVII (2016).

thumbnail Fig. 53

Fully anticorrelated ADI+CDI with nℐℐ = nℛℛ (“curvaton II”).

11.5.3. Fully anticorrelated ADI+CDI (“Curvaton II”)

The curvaton scenario or some other mechanism could also produce 100% anticorrelated perturbations, with nℐℐ = nℛℛ. The constraints in the (nℛℛ,βiso) plane are presented in Fig. 53. Examples of this kind of model are provided by cases 2, 3, and 6 in Gordon & Lewis (2003). These lead to a fixed, large amount of isocurvature, e.g., in case 2 to βiso = 9 / 10, and are hence excluded by the data at very high significance. However, case 9 in Gordon & Lewis (2003), with a suitable rD (i.e., rD>Rc, where Rc = ρc/ (ρc + ρb)), leads to fully-anticorrelated perturbations and might provide a good fit to the data. In this case CDM is produced by curvaton decay while baryons are created earlier from inflaton decay products and do not carry a curvature perturbation. We obtain a very similar expression to Eq. (145), namely βiso9(1rD/Rc)2rD2+9(1rD/Rc)2·\begin{equation} \beta_{\mathrm{iso}} \approx \frac{9(1-r_{\rm D}/R_\mathrm{c})^2}{r_{\rm D}^2 + 9(1-r_{\rm D}/R_\mathrm{c})^2}\cdot \label{eq:betaisocurvatonII} \end{equation}(150)We convert this to an approximate constraint on rD by fixing Rc to its best-fit value, Rc = 0.8437 (Planck TT+lowP), within this model. The results for the various Planck data sets are: TT+lowP:βiso<0.00640.8437<rD<0.86320.9379<fNLlocal<0.8882,TT+lowP+lensing:βiso<0.00520.8437<rD<0.86120.9329<fNLlocal<0.8882,TT,TE,EE+lowP:βiso<0.00080.8437<rD<0.85050.9056<fNLlocal<0.8882.\begin{eqnarray} &&\mbox{TT+lowP: } \beta_{\mathrm{iso}} < 0.0064 \Rightarrow 0.8437 < r_{\rm D} < 0.8632 \nonumber \\ & &\hspace*{3mm}\Rightarrow -0.9379 < f_{\mathrm{NL}}^{\mathrm{local}} < -0.8882, \\ &&\mbox{TT+lowP+lensing: } \beta_{\mathrm{iso}} < 0.0052 \Rightarrow 0.8437< r_{\rm D} < 0.8612 \nonumber \\ && \hspace*{3mm}\Rightarrow -0.9329 < f_{\mathrm{NL}}^{\mathrm{local}}< -0.8882, \\ &&\mbox{ TT, TE, EE+lowP: } \beta_{\mathrm{iso}} < 0.0008 \Rightarrow 0.8437 < r_{\rm D} < 0.8505 \nonumber \\ &&\hspace*{3mm} \Rightarrow -0.9056 < f_{\mathrm{NL}}^{\mathrm{local}} < -0.8882. \end{eqnarray}After all the tests conducted in this section, both for the generally-correlated CDI, NDI, and NVI cases as well as for the special CDI cases, we conclude that within the spatially flat base ΛCDM model, the initial conditions of perturbations are consistent with the hypothesis of pure adiabaticity, a conclusion that is also supported by the Bayesian model comparison. Moreover, Planck Collaboration XVII (2016) reports a null detection of isocurvature non-Gaussianity, with polarization improving constraints significantly.

12. Statistical anisotropy and inflation

A key prediction of standard inflation, which in the present context includes all single field models of inflation as well as many multi-field models, is that the stochastic process generating the primordial cosmological perturbations is completely characterized by its power spectrum, constrained by statistical isotropy to depend only on the multipole number . This statement applies at least to the accuracy that can be probed using the CMB given the limitations imposed by cosmic variance, since all models exhibit some level of non-Gaussianity. Nevertheless, more general Gaussian stochastic processes can be envisaged for which one or more special directions on the sky are singled out, so that the expectation values for the temperature multipoles take the form aℓmT(amT)=Cℓm;mTT,\begin{equation} \left< a_{\ell m}^T ~\left({a_{\ell'm'}^T}\right)^* \right> =C_{\ell m; \ell'm'}^{\rm TT}, \label{eq:gencovariance} \end{equation}(154)rather than the very special form aℓmT(amT)=CTTδℓ,δm,m,\begin{equation} \left< a_{\ell m}^T ~\left(a_{\ell'm'}^T\right)^* \right> =C_\ell ^{\rm TT}\, \delta _{\ell , \ell '}\, \delta _{m, m'}, \end{equation}(155)which is the only possibility consistent with statistical isotropy.

The most general form for a Gaussian stochastic process on the sphere violating the hypothesis of statistical isotropy in Eq. (154) is too broad to be useful, given that we have only one sky to analyse. For <max, there are O(max2)\hbox{$O(\ell _{\mathrm{max}}^2)$} multipole expansion coefficients, compared with O(max4)\hbox{$O(\ell _{\mathrm{max}}^4)$} model parameters. Therefore, in order to make some progress on testing the hypothesis of statistical isotropy, we must restrict ourselves to examining only the simplest models violating statistical isotropy, for which the available data can establish meaningful constraints and for which one can hope to find a simple theoretical motivation.

12.1. Asymmetry: observations versus model building

In one simple class of statistically anisotropic models, we start with a map produced by a process respecting statistical isotropy, which becomes modulated by another field in the following manner to produce the observed sky map: δTsky(Ω̂)=(1+M(Ω̂))δTs-i(Ω̂),\begin{equation} \delta T_{\mathrm{sky}}(\vec{\hat{\Omega}}) = \Bigl(1 + M(\vec{\hat{\Omega}})\Bigr) ~\delta T_{\text{s-i}}(\vec{\hat{\Omega}}), \label{ModulationAnsatz} \end{equation}(156)where \hbox{$\vec{\hat{\Omega}}$} denotes a position on the celestial sphere and \hbox{$\delta T_{\text{s-i}}(\vec{\hat{\Omega}})$} is the outcome of the underlying statistically isotropic process before modulation. Roughly speaking, where the modulating field \hbox{$M(\vec{\hat{\Omega}})$} is positive, power on scales smaller than the scale of variation of \hbox{$M(\vec{\hat{\Omega}})$} is enhanced, whereas where \hbox{$M(\vec{\hat{\Omega}})$} is negative, power is suppressed. We refer to this as a “power asymmetry.” If \hbox{$M(\vec{\hat{\Omega}}) = A\vec{\hat{d}}\cdot\vec{\hat{\Omega}}$}, we have a model of dipolar modulation with amplitude A and direction \hbox{$\vec{\hat{d}}$}, but higher-order or mixed modulation may also be considered, such as a quadrupole modulation or modulation by a scale-invariant field \hbox{$M(\vec{\hat{\Omega}})$}, to name just a few special cases. Alternatively, and more closely tied to physical models, we can consider modulations of the position- or k-space fluctuations.

In Planck Collaboration XXIII (2014) and Planck Collaboration XVI (2016), the details of constructing efficient estimators for statistical anisotropy, in particular in the presence of realistic data involving sky cuts and possibly incompletely removed foreground contamination, are considered in depth. In addition, the question of the statistical significance of any detected “anomalies” from the expectations of base ΛCDM is examined in detail. Importantly, in the absence of a particular inflationary model for such an observed anomaly, the significance should be corrected for the “multiplicity of tests” that could have resulted in similarly-significant detections (i.e., for the “look elsewhere effect”), although applying such corrections can be ambiguous. In this paper, however, we consider only forms of statistical anisotropy that are predicted by specific inflationary models, and hence such corrections will not be necessary.

Several important questions can be posed regarding the link between statistical isotropy and inflation. In particular, we can ask the following questions. (1) Does a statistically significant finding of a violation of statistical isotropy falsify inflation? (2) If not, what sort of non-standard inflation could produce the required departure from statistical isotropy? (3) What other perhaps non-inflationary models could also account for the violation of statistical isotropy? In this section, we begin to address these questions by assessing the viability of an inflationary model for dipolar asymmetry, as well as by placing new limits on the presence of quadrupolar power asymmetry.

For the case of the observed dipolar asymmetry examined in detail in Planck Collaboration XVI (2016), there are two aspects that make inflationary model building difficult. First is the problem of obtaining a significant amplitude of dipole modulation. In Planck Collaboration XVI (2016) the asymmetry was found to have amplitude A ≈ 6−7% on scales 2 ≤ ≤ 64. This compares with the expected value of A = 2.9% on these scales due to cosmic variance in statistically isotropic skies. One basic strategy for incorporating the violation of statistical isotropy into inflation is to consider some form of multi-field inflation and use one of the directions orthogonal to the direction of slow roll as the field responsible for the modulation. Obtaining the required modulation is problematic because most extra fields in multi-field inflation become disordered in a nearly scale-invariant way, just like the fluctuations in the field parallel to the direction of slow roll. What is needed resembles a pure gradient with no fluctuations of shorter wavelength. In Liddle & Cortês (2013) it was proposed that such a field could be produced using the supercurvature mode of open inflation. (See however the discussion in Kanno et al. 2013.) Also, in order to respect the fNL constraints, one must avoid that the modulating field leave a direct imprint on the temperature anisotropy.

The second aspect which makes model building difficult for dipolar asymmetry is that the measured amplitude is strongly scale dependent, and on scales ≳ 100 no significant detection of a dipolar modulation amplitude is made (Planck Collaboration XVI 2016), once our proper motion has been taken into account (Planck Collaboration XXVII 2014). On the other hand, the simplest models are scale-free and produce statistical anisotropy of the type described by the ansatz in Eq. (156), for which the bulk of the statistical weight should be detected at the resolution of the survey. To resolve this difficulty, Erickcek et al. (2009) proposed modulating CDI fluctuations generated within the framework of a curvaton scenario, because, unlike adiabatic perturbations, CDI perturbations entering the Hubble radius before last scattering contribute negligibly to the CMB fluctuations (recall Fig. 43).

The situation for the quadrupolar power asymmetry is different from the dipolar case in that no detection is currently claimed. Model building is easier than the dipolar case since no pure gradient modes are required, but also more difficult in that anisotropy during inflation is needed. While the isotropy of the recent expansion of the Universe (i.e., since the CMB fluctuations were first imprinted) is tightly constrained, bounds on a possible anisotropic expansion at early times are much weaker. Ackerman et al. (2007) proposed using constraints on the quadrupolar statistical anisotropy of the CMB to probe the isotropy of the expansion during inflation – that is, during the epoch when the perturbations now seen in the CMB first exited the Hubble radius. Assuming an anisotropic expansion during inflation, Ackerman et al. (2007) computed its impact on the three-dimensional power spectrum on super-Hubble scales by integrating the mode functions for the perturbations during inflation and beyond. Several sources of such anisotropy have been proposed, such as vector fields during inflation (Dimastrogiovanni et al. 2010; Soda 2012; Maleknejad et al. 2013; Schmidt & Hui 2013; Bartolo et al. 2013; Naruko et al. 2015), or an inflating solid or elastic medium (Bartolo et al. 2013).

12.2. Scale-dependent modulation and idealized estimators

The ansatz in Eq. (156) expressed in angular space may be rewritten in terms of the multipole expansion and generalized to include scale-dependent modulation by means of Wigner 3j symbols: aℓmTamT=L=0M=L+LC;;L,MTT(LmmM).\begin{equation} \left< a_{\ell m}^T~a_{\ell'm'}^T \right> = \sum _{L=0}^\infty \sum _{M=-L}^{+L} C_{\ell; \ell'; L, M}^{\rm TT} \left( \begin{matrix} \ell & \ell' & L\\ m & m' & M \end{matrix} \right) .\label{eq:gencovar} \end{equation}(157)Because of the symmetry of the left-hand side, the coefficients C;;L,MTT\hbox{$C_{\ell; \ell'; L, M}^{\rm TT}$} acquire a phase (− 1) + ′ + L under interchange of and . This is the most general form consistent with the hypothesis of Gaussianity. The usual isotropic power spectrum, which is the generic prediction of simple models of inflation, includes only the L = 0 term, where C;;0,0TT=CTT\hbox{$C_{\ell; \ell'; 0, 0}^{\rm TT}=C_\ell^{\rm TT}$} and the Wigner 3j symbol provides the δℓ,δm,m factor. The coefficients C;;L,MTT\hbox{$C_{\ell; \ell'; L, M}^{\rm TT}$} with L> 0 introduce statistical anisotropy.

If we assume that there is a common vector (corresponding to L = 1 on the celestial sphere) that defines the direction of the anisotropy of the power spectrum for all the terms of L = 1, we may adopt a more restricted ansatz for the bipolar modulation, so that C;;1,MTT=Cℓ,1XM(1),\begin{equation} C_{\ell; \ell'; 1, M}^{\rm TT} = C^1_{\ell,\ell'} X_M^{(1)}, \label{eq:dipolarcovar} \end{equation}(158)where we assume that XM is normalized (i.e., MXMXM = 1). In such a model, supposing that Cℓ,1\hbox{$C^1_{\ell,\ell'}$} is theoretically determined, but the orientation of the unit vector XM is random and isotropically distributed on the celestial sphere, we may construct the following quadratic estimator for the direction: XM(L)=ℓ,m,mwℓ,;L(2L+1)(C)1/2(C)1/2×()aℓmTamT,\begin{eqnarray} \begin{aligned} X_M^{(L)} &= \sum_{\ell,m}\sum_{\ell',m'} \frac{w_{\ell,\ell';L}}{(2L+1)(C_\ell)^{1/2}~(C_{\ell'})^{1/2}}\\ &\quad\times \left( \begin{matrix} \ell & \ell' & L\\ m & m' & M \end{matrix} \right) ~~ a_{\ell m}^T~a_{\ell 'm'}^T, \end{aligned} \end{eqnarray}(159)where the weights for the unbiased minimum variance estimator are given by wℓ,;L=Cℓ,L(ℓ,Cℓ,L)-1.\begin{equation} w_{\ell , \ell';L}= C_{\ell,\ell'}^L \left( \sum _{\ell,\ell'} C_{\ell,\ell'}^L \right) ^{-1}. \end{equation}(160)This construction, which for the L = 1 case may be found in Moss et al. (2011) and Planck Collaboration XVI (2016), may be readily generalized to L> 1 in the above way.

12.3. Constraining inflationary models for dipolar asymmetry

In this section, we confront with Planck data the modulated curvaton model of Erickcek et al. (2009), which attempts to explain the observed large-scale power asymmetry via a gradient in the background curvaton field. In this model, the curvaton decays after CDM freeze out, which results in a nearly-scale-invariant isocurvature component between CDM and radiation. In the viable version of this scenario, the curvaton contributes negligibly to the CDM density. A long-wavelength fluctuation in the curvaton field initial value σ is assumed, with amplitude Δσ across our observable volume. This modulates the curvaton isocurvature fluctuations according to Sσγ ≈ 2δσ/σ. The curvaton produces all of the final CDI fluctuations, which are nearly scale invariant, as well as a component of the final adiabatic fluctuations. Hence both of these components will be modulated, and the parameter space of the model will be constrained by observations of the power asymmetry on large and small scales, as well as the full-sky CDI fraction. In practice, the very tight constraints on small scale power asymmetry obtained in Planck Collaboration XVI (2016) imply a small curvaton adiabatic component, which implies that the CDI and adiabatic fluctuations are only weakly correlated. This model easily satisfies constraints due to the CMB dipole, quadrupole, and non-Gaussianity (Erickcek et al. 2009).

There are two main parameters that we constrain for this model. First, the fraction of adiabatic fluctuations due to the curvaton ξ is defined as ξΣσ2𝒫σ𝒫inf+Σσ2𝒫σ·\begin{equation} \xi \equiv \frac{\Sigma_\sigma^2{\cal P}_\sigma} {{\cal P}_{{\cal R}_{\mathrm{inf}}} + \Sigma_\sigma^2{\cal P}_\sigma}\cdot \end{equation}(161)Here,