Fast and easy super-sample covariance of large-scale structure observables

Fabien Lacasa; Julien Grain

doi:10.1051/0004-6361/201834343

Home

All issues

Volume 624 (April 2019)

A&A, 624 (2019) A61

Full HTML

Free Access

Issue		A&A Volume 624, April 2019


Article Number		A61
Number of page(s)		14
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/201834343
Published online		12 April 2019

A&A 624, A61 (2019)

Fast and easy super-sample covariance of large-scale structure observables^⋆

Fabien Lacasa¹ and Julien Grain²

¹ Département de Physique Théorique and Center for Astroparticle Physics, Université de Genève, 24 quai Ernest Ansermet, 1211 Geneva, Switzerland
e-mail: fabien.lacasa@unige.ch
² Institut d’Astrophysique Spatiale, CNRS (UMR8617) and Université Paris-Sud 11, Bâtiment 121, 91405 Orsay, France

Received: 28 September 2018
Accepted: 15 February 2019

Abstract

We present a numerically cheap approximation to super-sample covariance (SSC) of large-scale structure cosmological probes, first in the case of angular power spectra. No new elements are needed besides those used to predict the considered probes, thus relieving analysis pipelines from having to develop a full SSC modeling, and reducing the computational load. The approximation is asymptotically exact for fine redshift bins Δz → 0. We furthermore show how it can be implemented at the level of a Gaussian likelihood or a Fisher matrix forecast as a fast correction to the Gaussian case without needing to build large covariance matrices. Numerical application to a Euclid-like survey show that, compared to a full SSC computation, the approximation nicely recovers the signal-to-noise ratio and the Fisher forecasts on cosmological parameters of the wCDM cosmological model. Moreover, it allows for a fast prediction of which parameters are going to be the most affected by SSC and at what level. In the case of photometric galaxy clustering with Euclid-like specifications, we find that σ₈, n_s, and the dark energy equation of state w are particularly heavily affected. We finally show how to generalize the approximation for probes other than angular spectra (correlation functions, number counts, and bispectra) and at the likelihood level, allowing for the latter to be non-Gaussian if necessary. We release publicly a Python module allowing the implementation of the SSC approximation and a notebook reproducing the plots of the article.

Key words: large-scale structure of Universe / galaxies: statistics / methods: data analysis / methods: analytical

^⋆

The Python module is available at https://github.com/fabienlacasa/PySSC and at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/624/A61

© ESO 2019

1. Introduction

The distribution of matter on large scales in the Universe is one of the main cosmological probes allowing for shading lights, for example on dark matter, dark energy, and gravity at cosmological scales. The current surveys of galaxies such as the Kilo-Degree Survey (KiDS; Hildebrandt et al. 2017) and the Dark Energy Survey (DES; Dark Energy Survey Collaboration 2018) recently provided cosmological constraints on the ΛCDM model from galaxy clustering and weak lensing, which are now competitive with constraints derived from the lensing of the Cosmic Microwave Background (CMB) and consistent with CMB primary anisotropies (for a recent comparison, see e.g., Planck Collaboration VI 2018). In the near future, large surveys such as the Large Synoptic Sky Telescope (LSST; LSST Science Collaborations 2009) and the Euclid satellite mission (Laureijs et al. 2011) will greatly improve our understanding of the structure of the Universe, the nature and the properties of dark energy, the potential modification of gravity at cosmological scales, and the initial conditions of cosmological perturbations (Amendola et al. 2013).

Unlike CMB primary anisotropies, however, late-time tracers of the large-scale structures (LSS) evolved through nonlinear dynamics, and as a result, the probability distribution function (pdf) of probes such as the galaxy distribution or weak lensing by LSS is no longer Gaussian, with deviation from a Gaussian distribution increasing at smaller scales. This first means that not all the information is compressed in the two-point statistics of the considered probes. Second, this means that the covariance of statistical observables built from LSS tracers (e.g., any n-point statistics) is increased by the presence of non-Gaussian contributions; as an example, the covariance on angular power spectra will be increased by contributions from a nonvanishing trispectrum. In the present context of preparing the cosmological interpretation of forthcoming datasets, and of forecasting the expected performance of future galaxy surveys that aim at precision cosmology from LSS tracers, it is now necessary to properly take into account the non-Gaussian contribution to the covariance for any inference of cosmological parameters from LSS observables.

Among the different non-Gaussian sources to the covariance (see Lacasa 2018 for a full derivation) is the super-sample covariance (SSC), first discovered for cluster counts by Hu & Kravtsov (2003) and to which a vast amount of literature has been devoted (e.g., Takada & Hu 2013; Takada & Spergel 2014; Takahashi et al. 2014; Li et al. 2018; Chan et al. 2018; Lacasa et al. 2018; Barreira et al. 2018a,b). This additional source of cosmic variance is inherent to all galaxy surveys due to the limited portion of the Universe that is observed, both in redshift depth and in sky fraction. SSC hence comes from the nonlinear impact of density fluctuations with wavelengths greater than the survey size. These super-survey modes modulate the local observables by making the background density averaged over the survey size to be nonrepresentative (either denser or less dense than) of the averaged density in the Universe. Barring systematics, SSC is expected to be the dominant source of statistical error or cosmic variance for weak lensing (Barreira et al. 2018a) beyond the usual Gaussian covariance, although other terms may also be important for galaxy clustering (Lacasa 2018). It affects the whole set of statistical observables and correlates them. Contrary to intrasurvey sources of covariance, it can be shown that SSC cannot be reliably calibrated from the data itself or from classical simulations (Lacasa & Kunz 2017). This thus motivates the need for analytical or semi-analytical predictions of the effect, for use in the analysis of current and future galaxy surveys.

When analyzing such galaxy surveys, we usually deal with observables 𝒪_i being line-of-sight integrals of the form 𝒪_i = ∫dV_i 𝔬_i, where 𝔬_i is the comoving density of the observable (including selection effects such as redshift binning) and $d V = r^{2} (z) \frac{d r}{d z} d z$ ${\mathrm{d}} V = r^2(z) \frac{{\mathrm{d}} r}{{\mathrm{d}} z} {\mathrm{d}} z$ is the comoving volume per steradian. Then the rigorous super-sample covariance for such observables is given by (e.g., Lacasa & Rosenfeld 2016)

$\begin{matrix} {Cov}_{SSC} (O_{1}, O_{2}) = \int \int d V_{1} d V_{2} \frac{\partial o_{1}}{\partial δ_{b}} (z_{1}) \frac{\partial o_{2}}{\partial δ_{b}} (z_{2}) σ^{2} (z_{1}, z_{2}) . \end{matrix}$ $\begin{aligned} {\mathrm{Cov} }_{\mathrm{SSC} }\left(\mathcal{O} _1,\mathcal{O} _2\right) = \int \int {\mathrm{d} } V_{1} {\mathrm{d} } V_2 \, \frac{\partial \mathfrak{o} _1}{\partial \delta _b}(z_1) \, \frac{\partial \mathfrak{o} _2}{\partial \delta _b}(z_2) \, \sigma ^2(z_1,z_2). \end{aligned}$ (1)

In the above, $\frac{\partial o_{1}}{\partial δ_{b}} (z_{1})$ $\frac{\partial \mathfrak{o}_1}{\partial\delta_b}(z_1)$ is the response of the probe which amounts how a given probe varies with changes of the background density δ_b. The quantity σ²(z₁, z₂) reads (assuming full sky here for simplicity)

$\begin{matrix} σ^{2} (z_{1}, z_{2}) = \frac{1}{2 π^{2}} \int k^{2} d k P_{m} (k | z_{12}) j_{0} (k r_{1}) j_{0} (k r_{2}), \end{matrix}$ $\begin{aligned} \sigma ^2(z_1,z_2) = \frac{1}{2\pi ^2}\int k^2 \, {\mathrm{d} } k \ P_{\mathrm{m} }(k|z_{12}) \ j_0(k r_1) j_0(k r_2), \end{aligned}$ (2)

with P_m(k|z₁₂) the linear matter cross-spectrum between redshifts z₁ and z₂, and j₀ the spherical Bessel functions. It basically amounts to the variation in background density on a given survey volume due to super-survey modes modulations.

Computing this SSC contribution to the covariance exactly, however, becomes costly very quickly. In practice, we take advantage of the separability in redshift (e.g., Lacasa et al. 2018; Barreira et al. 2018b) to reduce the cost of a covariance evaluation to that of an angular power spectrum evaluation. However the covariance needs to be evaluated at every pair of multipoles. For future surveys doing angular power spectra analysis with ℓ_max of a few thousands, this induces a 𝒪(10³) slow-down of prediction pipelines, which can be increased by more orders of magnitude if we include tomography (⇒ pairs of redshift bins) and combine probes (⇒ pairs of probes).

Furthermore, an exact computation necessitates the knowledge of the probe’s response $\frac{\partial o}{\partial δ_{b}}$ $\frac{\partial \mathfrak{o}}{\partial \delta_b}$ , either through analytical means or through simulations, for every redshift and multipoles, which is a barrier for analysts not already experts in the field of SSC. It is thus desirable to have instead simpler functions, if not fixed parameters, as we will find later on, with reference ansatzs that can be easily implemented by the community.

The aim of this article is thus to present an approximation for the SSC that allows fast numerical computation and ease of use by the community, and to assess its accuracy in a forecast analysis using the Fisher matrix approach.

The article is organized as follows. Our approximation is presented in Sect. 2 for the case of angular power spectra as our statistical observables. This approximation basically abolishes the above-mentioned numerical burden, and makes the computation of the super-sample covariance matrix as fast as the computation of the involved angular power spectra. Furthermore, we show in Sect. 3 that the resulting matrix form enables fast application to common uses of the covariance (i.e., in a Gaussian likelihood or for computation of a signal-to-noise ratio, S/N, or a Fisher matrix) as a correction to the Gaussian case. Then in Sect. 4 we show numerical results validating the approximation and giving its range of applicability. Finally in Sect. 5 we generalize the approach to other statistics (number counts, correlation function, and bispectrum) and to the full likelihood, making the implementation of super-sample covariance feasible even if the likelihood is not Gaussian.

We release publicly a Python code that allows the easy implemention of SSC with our approach¹.

2. Approximating the SSC

We consider the case of the angular power spectra cross-correlating two LSS tracers, A and B. In the context of galaxy surveys, these two tracers typically are galaxy clustering and galaxy shear. This can, however, be extended to other LSS tracers such as lensing of the CMB or the integrated Sachs-Wolfe (iSW) effect. The signals are observed in some redshift bins indicated with indices i_z, j_z, etc., and with a given width. In full generality, the redshift bins may overlap².

We use the Limber approximation throughout the article, both for the power spectrum and the super-sample covariance. The approximation is accurate enough for the power spectrum on the range of scales of our later forecast (ℓ ≥ 50). Furthermore, it is even more adapted to super-sample covariance because SSC impacts the covariance on small scales ℓ ≳ 300, as we show in Sect. 4.

With Limber approximation, the angular power spectrum between two signals can generally be written as

$\begin{matrix} C_{ℓ}^{AB} (i_{z}, j_{z}) = \int d V W_{i_{z}}^{A} (z) W_{j_{z}}^{B} (z) P_{AB} (k_{ℓ} | z) . \end{matrix}$ $\begin{aligned} C_{\ell }^{AB}(i_z,j_z) = \int {\mathrm{d} } V \ W^A_{i_z}(z) \, W^B_{j_z}(z) \ P_{AB}(k_\ell |z). \end{aligned}$ (3)

The weighting kernels $W_{i_{z}}^{A} (z)$ $W^A_{i_z}(z)$ , $W_{j_{z}}^{B} (z)$ $W^B_{j_z}(z)$ are nonzero over the width of the redshift bin, and are expressed in units of [probe unit] ⋅ sr/(Mpc/h)³. The quantity P_AB(k_ℓ|z) is the 3D power spectrum of the considered probe, evaluated at the Limber wavenumber k_ℓ = (ℓ+1/2)/r(z), with r(z) the comoving distance. Weighting kernels and power spectra for the different probes of interest (galaxy clustering and shear, CMB lensing, iSW effect) are given in Appendix A.

For an angular power spectrum, the comoving density of the observable, i.e., 𝔬_AB entering Eq. (1), is $o_{A B} = W_{i_{z}}^{A} (z) W_{i_{z}}^{B} (z) P_{A B} (k_{l} | z)$ $\mathfrak{o}_{AB} = W^A_{i_z}(z) \, W^B_{i_z}(z) \ P_{AB}(k_\ell|z)$ . Assuming that in Eq. (1) the responses, $\frac{\partial o_{AB}}{\partial δ_{b}}$ $\frac{\partial \mathfrak{o}_{AB}}{\partial\delta_b}$ , vary slowly with redshift compared to σ²(z₁, z₂), we arrive at the approximation that is the basis of this article:

$\begin{matrix} {Cov}_{SSC} (C_{ℓ}^{AB} (i_{z}, j_{z}), C_{ℓ^{'}}^{CD} (k_{z}, l_{z})) \approx & R_{ℓ}^{AB} C_{ℓ}^{AB} (i_{z}, j_{z}) R_{ℓ^{'}}^{CD} C_{ℓ^{'}}^{CD} (k_{z}, l_{z}) \\ \times S_{i_{z}, j_{z} ; k_{z}, l_{z}}^{A, B ; C, D}, \end{matrix}$ $\begin{aligned} {\mathrm{Cov} }_{\mathrm{SSC} }\left(C_{\ell }^{AB}(i_z,j_z),C_{\ell ^{\prime }}^{CD}(k_z,l_z)\right) \approx &\ R_\ell ^{AB} \, C_{\ell }^{AB}(i_z,j_z) \ R_{\ell ^{\prime }}^{CD} \, C_{\ell ^{\prime }}^{CD}(k_z,l_z) \nonumber \\& \times S^{A,B;C,D}_{i_z,j_z;k_z,l_z}, \end{aligned}$ (4)

where the double integrals over redshift in Eq. (1) have been approximately performed. The matrix $S_{i_{z}, j_{z}; k_{z}, l_{z}}^{A, B; C, D}$ $S^{A,B;C,D}_{i_z,j_z;k_z,l_z}$ is the dimensionless volume-averaged (co)variance of the background matter density contrast

$\begin{matrix} S_{i_{z}, j_{z} ; k_{z}, l_{z}}^{A, B ; C, D} = \int d V_{1} d V_{2} \frac{W_{i_{z}}^{A} (z_{1}) W_{j_{z}}^{B} (z_{1})}{I^{AB} (i_{z}, j_{z})} \frac{W_{k_{z}}^{C} (z_{2}) W_{l_{z}}^{D} (z_{2})}{I^{CD} (k_{z}, l_{z})} σ^{2} (z_{1}, z_{2}), \end{matrix}$ $\begin{aligned} S^{A,B;C,D}_{i_z,j_z;k_z,l_z} = \int {\mathrm{d} } V_1 \, {\mathrm{d} } V_2 \, \frac{W_{i_z}^A(z_1) \, W_{j_z}^B(z_1)}{I^{AB}(i_z,j_z)} \, \frac{W_{k_z}^C(z_2) \, W_{l_z}^D(z_2)}{I^{CD}(k_z,l_z)} \ \sigma ^2(z_1,z_2), \end{aligned}$ (5)

with

$\begin{matrix} I^{AB} (i_{z}, j_{z}) = \int d V_{1} W_{i_{z}}^{A} (z_{1}) W_{j_{z}}^{B} (z_{1}) . \end{matrix}$ $\begin{aligned} I^{AB}(i_z,j_z) = \int {\mathrm{d} } V_1 \ W_{i_z}^A(z_1) \ W_{j_z}^B(z_1). \end{aligned}$ (6)

The quantity R_ℓ is the effective relative response of the considered power spectrum. In the context of second-order perturbation theory (hereafter 2PT), the growth-only response of the matter power spectrum is $\frac{\partial P (k)}{\partial δ_{b}} = \frac{68}{21} P (k)$ $\frac{\partial P(k)}{\partial \delta_b} = \frac{68}{21} P(k)$ (e.g., Takada & Hu 2013), i.e., $R_{ℓ} = \frac{68}{21}$ $R_\ell = \frac{68}{21}$ . Other nonlinear terms are present, however, which increase the total response. In Appendix C we detail these terms, and present a full computation of the response. Our formalism is valid for a general scale-dependent response. In numerical applications later in this article, we test the approximation both with the full response and with the simpler ansatz R_ℓ = 5 ≡ R, which is the effective value found in Appendix C.

The key point is that starting from the approximation in Eq. (4) makes the computation of the SSC for angular power spectra have the same numerical cost as the computation of the power spectra themselves.

3. Application to parameter constraints

In this section we examine the consequences for data analysis or forecasts of the SSC covariance given by Eq. (4) as an update to the covariance, i.e., the total covariance is 𝒞 = 𝒞_noSSC + 𝒞_SSC, where 𝒞_noSSC is the sum of all other contributions to the covariance matrix.

A common statistical use of a covariance matrix, 𝒞, is to compute scalar quantities of the form³

$\begin{matrix} I = X^{T} \cdot C^{- 1} \cdot Y, \end{matrix}$ $\begin{aligned} I = X^T \cdot \mathcal{C} ^{-1} \cdot Y, \end{aligned}$ (7)

where the dot “⋅” stands for matrix multiplication. For example, to compute the cumulative S/N we would have X = Y = (C_ℓ)_{ℓ = ℓ_min⋯ℓ_max} ≡ C. In the exponent of a Gaussian likelihood, we would need X = Y = Ĉ − C(p), where Ĉ ≡ Ĉ_ℓ_{ℓ = ℓ_min⋯ℓ_max} is the estimated/measured power spectrum and C(p)≡(C_ℓ(p))_{ℓ = ℓ_min⋯ℓ_max} is the predicted power spectrum with model parameters p. Finally for Fisher forecasts, computing the Fisher matrix, F_α, β, requires X = ∂C/∂p_α ≡ ∂_αC and Y = ∂_βC.

This last case is the primary aim of the article since it is a measure of the amount of information we have on cosmological parameters from the observables C. The Fisher matrix will be our figure of merit to gauge the quality of the approximation, with numerical results to be presented in Sect. 4.

Computing the scalar quantities given by Eq. (7) requires the inversion of the covariance matrix, 𝒞 = 𝒞_noSSC + 𝒞_SSC. Using the approximation Eq. (4), adding the SSC corresponds to a rank 1 update of the covariance matrix of the angular power spectrum C_ℓ. Furthermore, in Appendix D, we detail the way to introduce binned power spectra in this approach, and we show that adding the SSC to the covariance of the binned spectra is also a rank 1 update of the covariance.

We thus make use of the Sherman–Morrison formula (Sherman & Morrison 1950; Bartlett 1951), which gives the impact on matrix inversion of a rank 1 update,

$\begin{matrix} {(A + U V^{T})}^{- 1} = A^{- 1} - \frac{(A^{- 1} \cdot U) (V^{T} \cdot A^{- 1})}{1 + V^{T} \cdot A^{- 1} \cdot U}, \end{matrix}$ $\begin{aligned} \left(A+U V^T\right)^{-1} = A^{-1} - \frac{\left(A^{-1} \cdot U\right) \left( V^T \cdot A^{-1}\right)}{1 + V^T \cdot A^{-1} \cdot U}, \end{aligned}$ (8)

where A is any n × n square matrix, and U and V are two n-dimensional vectors, and T means the transpose.

3.1. Single probe and single redshift bin

If we neglect all non-Gaussian terms except SSC, the noSSC covariance reduces to the Gaussian term which is diagonal in full sky:

$\begin{matrix} {(C_{G})}_{ℓ, ℓ^{'}} = G_{ℓ} δ_{ℓ, ℓ^{'}} with G_{ℓ} = \frac{2 C_{ℓ}^{2}}{2 ℓ + 1} \cdot \end{matrix}$ $\begin{aligned} \left(\mathcal{C} _{\mathrm{G} }\right)_{\ell ,\ell ^{\prime }} = G_\ell \ \delta _{\ell ,\ell ^{\prime }} \qquad {\mathrm{with} } \qquad G_\ell = \frac{2 C_\ell ^2}{2\ell +1}\cdot \end{aligned}$ (9)

This can simplify the inversion of the noSSC covariance later on. In partial sky observations, the Gaussian covariance will not be diagonal due to mask-induced couplings between different angular scales (i.e., ℓ-to-ℓ′ couplings). It can nevertheless be made diagonal in practice by binning the power spectrum with bins wider than the typical width of the mask-induced couplings, as shown in Appendix D.

In the following we keep the covariance general throughout the derivation, and indicate when appropriate which expressions are simplified by the diagonal assumption. We also keep the same subscript, ℓ, to indicate either single multipoles or bins of multipoles.

The super-sample covariance has a separable form between the two multipoles so that we can write the total covariance as

$\begin{matrix} C = C_{noSSC} + S_{i, i} (V V^{T}), \end{matrix}$ $\begin{aligned} \mathcal{C} = \mathcal{C} _{\mathrm{noSSC} } + S_{i,i} \ \left(V V^T\right), \end{aligned}$ (10)

where V is a vector with size the number of multipoles, given by

$\begin{matrix} V_{ℓ} = R_{ℓ} C_{ℓ}, \end{matrix}$ $\begin{aligned} V_\ell = R_\ell \, C_{\ell }, \end{aligned}$ (11)

and S_i, i is just a number. The Sherman–Morrison formula Eq. (8) then gives the inverse covariance as

$\begin{matrix} C^{- 1} = C_{noSSC}^{- 1} - \frac{S_{i, i} (C_{noSSC}^{- 1} V) (V^{T} C_{noSSC}^{- 1})}{1 + S_{i, i} (V^{T} \cdot C_{noSSC}^{- 1} \cdot V)}, \end{matrix}$ $\begin{aligned} \mathcal{C} ^{-1} = \mathcal{C} _{\mathrm{noSSC} }^{-1} - \frac{S_{i,i} \ \left(\mathcal{C} _{\mathrm{noSSC} }^{-1} V\right) \left(V^T \mathcal{C} _{\mathrm{noSSC} }^{-1}\right)}{1+S_{i,i} \ \left(V^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot V\right)}, \end{aligned}$ (12)

where $V^{T} \cdot C_{noSSC}^{- 1} \cdot V$ $V^T \cdot \mathcal{C}_{\mathrm{noSSC}}^{-1} \cdot V$ is a scalar.

Thus, the scalar quantity defined in Eq. (7) is given by

$\begin{matrix} I = I_{noSSC} - \frac{f_{X}^{SSC} f_{Y}^{SSC} S_{i, i}}{1 + V^{T} \cdot C_{noSSC}^{- 1} \cdot V S_{i, i}}, \end{matrix}$ $\begin{aligned} I = I_{\mathrm{noSSC} } - \frac{f^{\mathrm{SSC} }_X \ f^{\mathrm{SSC} }_Y \ S_{i,i}}{1+V^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot V \ S_{i,i}}, \end{aligned}$ (13)

where we defined the scalar

$\begin{matrix} f_{X}^{SSC} \equiv X^{T} \cdot C_{noSSC}^{- 1} \cdot V \overset{diagonal}{=} \sum_{ℓ} \frac{R_{ℓ} C_{ℓ} X_{ℓ}}{G_{ℓ}} \cdot \end{matrix}$ $\begin{aligned} f^{\mathrm{SSC} }_X \equiv X^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot V \mathop {=}\limits ^{\text{ diagonal}} \sum _\ell \frac{R_\ell \, C_{\ell } \ X_\ell }{G_\ell }\cdot \end{aligned}$ (14)

The notation $\overset{diagonal}{=}$ $\stackrel{\text{ diagonal}}{=}$ in the above means that the assumption that the noSSC covariance matrix is diagonal has been used. In particular for Fisher matrices, SSC gives a negative correction to the noSSC case:

$\begin{matrix} F_{α, β} = F_{α, β}^{noSSC} - \frac{f_{α}^{SSC} f_{β}^{SSC} S_{i, i}}{1 + V^{T} \cdot C_{noSSC}^{- 1} \cdot V S_{i, i}}, \end{matrix}$ $\begin{aligned} F_{\alpha ,\beta } = F^{\mathrm{noSSC} }_{\alpha ,\beta } - \frac{f^{\mathrm{SSC} }_\alpha \ f^{\mathrm{SSC} }_\beta \ S_{i,i}}{1+V^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot V \ S_{i,i}}, \end{aligned}$ (15)

with

$\begin{matrix} f_{α}^{SSC} \equiv \partial_{α} C^{T} \cdot C_{noSSC}^{- 1} \cdot V \overset{diagonal}{=} \sum_{ℓ} \frac{R_{ℓ} C_{ℓ} \partial_{α} C_{ℓ}}{G_{ℓ}} \cdot \end{matrix}$ $\begin{aligned} f^{\mathrm{SSC} }_\alpha \equiv \partial _\alpha C^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot V \mathop {=}\limits ^{\text{ diagonal}} \sum _\ell \frac{R_\ell \, C_{\ell } \ \partial _\alpha C_\ell }{G_\ell }\cdot \end{aligned}$ (16)

We finally note that all the above expressions for the impact of the SSC are easily extended to the case of binned spectra by replacing the vector V_ℓ by its binned version, V_b (see Appendix D).

3.2. Multi-probe and single redshift bin

A more complex case of interest is when we have spectra of different probes sharing the same redshift bin. One example is a multi-tracer analysis for galaxy clustering at a given redshift. Another is a weak-lensing analysis splitting galaxy types (e.g., red vs. blue galaxies) to better mitigate the effect of intrinsic alignments.

In the following we illustrate this with two probes (A and B) yielding three power spectra, although the results hold straightforwardly for more probes.

The three angular power spectra are generally correlated. So even in the Gaussian case, the covariance matrix is not diagonal as a function of probes, for example, even in full sky:

$\begin{matrix} {Cov}_{G} (C_{ℓ}^{AA}, C_{ℓ}^{BB}) = \frac{2 {(C_{ℓ}^{AB})}^{2}}{2 ℓ + 1} \neq 0 . \end{matrix}$ $\begin{aligned} {\mathrm{Cov} }_{\mathrm{G} }(C_\ell ^{AA},C_\ell ^{BB}) = \frac{2 \left(C_\ell ^{AB}\right)^2}{2\ell +1} \ne 0. \end{aligned}$ (17)

However, it remains diagonal as a function of multipoles.

Given that they share the same redshift bin, and assuming the weighting kernels to have similar enough redshift dependence within the bin, the S matrix can be assumed independent of probes :

$\begin{matrix} S_{i, i ; i, i}^{W, X ; Y, Z} = cst \equiv S_{i, i} \forall (W, X, Y, Z) \in {A, B}^{4} . \end{matrix}$ $\begin{aligned} S^{W,X;Y,Z}_{i,i;i,i} = \mathrm{cst} \equiv S_{i,i} \qquad \forall \ (W,X,Y,Z)\in \{A,B\}^4. \end{aligned}$ (18)

This property simplifies the super-sample covariance, allowing us to easily compute its impact on Fisher forecasts as we show below.

In the following we call n_c the number of spectra and n_ℓ the number of multipoles. It now becomes useful to arrange the n_c × n_ℓ data vector C grouping probes together before multipoles, for example with two probes A and B:

$\begin{matrix} C = (\begin{matrix} C_{ℓ_{\min}} \\ ⋮ \\ C_{ℓ_{\max}} \end{matrix}) with C_{ℓ} = (\begin{matrix} C_{ℓ}^{AA} \\ C_{ℓ}^{AB} \\ C_{ℓ}^{BB} \end{matrix}) . \end{matrix}$ $\begin{aligned} {\boldsymbol{C}} = \left( \begin{matrix} {\boldsymbol{C}}_{\ell _{\mathrm{min} }}\\ \vdots \\ {\boldsymbol{C}}_{\ell _{\mathrm{max} }} \end{matrix}\right) \qquad {\mathrm{with} } \qquad {\boldsymbol{C}}_{\ell } = \left(\begin{matrix} C_{\ell }^{AA} \\ C_{\ell }^{AB} \\ C_{\ell }^{BB} \end{matrix}\right). \end{aligned}$ (19)

The covariance matrix has a size (n_c × n_ℓ)×(n_c × n_ℓ). With C arranged as above, the covariance matrix is thus partitioned in n_ℓ × n_ℓ blocks, each of these blocks having a size n_c × n_c.

In the Gaussian case, in full sky or with the f_sky approximation, the covariance matrix is block diagonal, and we call G_ℓ these blocks of size n_c × n_c on the diagonal (see Appendix D for the case of binned spectra).

The formalism of Sect. 3.1 for a single probe is then easily adapted with only slight changes. We have the total covariance matrix

$\begin{matrix} C = C_{noSSC} + C_{SSC} = C_{noSSC} + V V^{T} S_{i, i}, \end{matrix}$ $\begin{aligned} \mathcal{C} = \mathcal{C} _{\mathrm{noSSC} } + \mathcal{C} _{\mathrm{SSC} } = \mathcal{C} _{\mathrm{noSSC} } + {\boldsymbol{V}} {\boldsymbol{V}}^T \ S_{i,i}, \end{aligned}$ (20)

where V is a n_c × n_ℓ vector, given by

$\begin{matrix} V_{ℓ} = R_{ℓ} C_{ℓ} . \end{matrix}$ $\begin{aligned} {\boldsymbol{V}}_\ell = R_\ell \ {\boldsymbol{C}}_{\ell }. \end{aligned}$ (21)

Then the inverse covariance follows

$\begin{matrix} C^{- 1} & = C_{noSSC}^{- 1} - \frac{S_{i, i} (C_{noSSC}^{- 1} V) (V^{T} C_{noSSC}^{- 1})}{1 + S_{i, i} (V^{T} \cdot C_{noSSC}^{- 1} \cdot V)}, \end{matrix}$ $\begin{aligned} \nonumber \mathcal{C} ^{-1}&= \mathcal{C} _{\mathrm{noSSC} }^{-1} - \frac{ S_{i,i} \ \left(\mathcal{C} _{\mathrm{noSSC} }^{-1} {\boldsymbol{V}}\right)\left( {\boldsymbol{V}}^T \mathcal{C} _{\mathrm{noSSC} }^{-1}\right)}{1+S_{i,i} \ \left({\boldsymbol{V}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}}\right)}, \end{aligned}$

where $V^{T} \cdot C_{noSSC}^{- 1} \cdot V$ ${\boldsymbol{V}}^T \cdot \mathcal{C}_{\mathrm{noSSC}}^{-1} \cdot {\boldsymbol{V}}$ is a scalar. For a block-diagonal covariance, it simplifies to

$\begin{matrix} V^{T} \cdot C_{noSSC}^{- 1} \cdot V \overset{diagonal}{=} \sum_{ℓ} C_{ℓ}^{T} \cdot G_{ℓ}^{- 1} \cdot C_{ℓ}, \end{matrix}$ $\begin{aligned} {\boldsymbol{V}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}} \mathop {=}\limits ^{\text{ diagonal}} \sum _\ell {\boldsymbol{C}}_\ell ^T \cdot {\boldsymbol{G}}^{-1}_\ell \cdot {\boldsymbol{C}}_{\ell }, \end{aligned}$ (22)

with the inner matrix products being in the space of the n_c spectra, and appropriately reduced to the case of Sect. 3.1 when n_c = 1.

Thus, the scalar quantity defined in Eq. (7) is given by

$\begin{matrix} I = I_{noSSC} - \frac{f_{X}^{SSC} f_{Y}^{SSC} S_{i, i}}{1 + V^{T} \cdot C_{noSSC}^{- 1} \cdot V S_{i, i}}, \end{matrix}$ $\begin{aligned} I = I_{\mathrm{noSSC} } - \frac{{\boldsymbol{f}}^{\mathrm{SSC} }_X \ {\boldsymbol{f}}^{\mathrm{SSC} }_Y \ S_{i,i}}{1+{\boldsymbol{V}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}} \ S_{i,i}}, \end{aligned}$ (23)

where we defined the scalar

$\begin{matrix} f_{X}^{SSC} \equiv X^{T} \cdot C_{noSSC}^{- 1} \cdot V \overset{diagonal}{=} \sum_{ℓ} X_{ℓ} \cdot G_{ℓ}^{- 1} \cdot R_{ℓ} C_{ℓ}, \end{matrix}$ $\begin{aligned} {\boldsymbol{f}}^{\mathrm{SSC} }_X \equiv {\boldsymbol{X}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}} \mathop {=}\limits ^{\text{ diagonal}} \sum _\ell {\boldsymbol{X}}_\ell \cdot {\boldsymbol{G}}^{-1}_\ell \cdot R_\ell \ {\boldsymbol{C}}_{\ell }, \end{aligned}$ (24)

with again the inner matrix products in the space of the n_c spectra.

Finally, the total Fisher matrix is given by the noSSC Fisher matrix plus a negative SSC correction:

$\begin{matrix} F_{α, β} = F_{α, β}^{noSSC} - \frac{f_{α}^{SSC} f_{β}^{SSC} S_{i, i}}{1 + V^{T} \cdot C_{noSSC}^{- 1} \cdot V S_{i, i}}, \end{matrix}$ $\begin{aligned} F_{\alpha ,\beta } = F^{\mathrm{noSSC} }_{\alpha ,\beta } - \frac{{\boldsymbol{f}}^{\mathrm{SSC} }_\alpha \ {\boldsymbol{f}}^{\mathrm{SSC} }_\beta \ S_{i,i}}{1+{\boldsymbol{V}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}} \ S_{i,i}}, \end{aligned}$ (25)

with

$\begin{matrix} f_{α}^{SSC} \equiv \partial_{α} C^{T} \cdot C_{noSSC}^{- 1} \cdot V \overset{diagonal}{=} \sum_{ℓ} \partial_{α} C_{ℓ}^{T} \cdot G_{ℓ}^{- 1} \cdot R_{ℓ} C_{ℓ} . \end{matrix}$ $\begin{aligned} {\boldsymbol{f}}^{\mathrm{SSC} }_\alpha \equiv \partial _\alpha {\boldsymbol{C}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}} \mathop {=}\limits ^{\text{ diagonal}} \sum _\ell \partial _\alpha {\boldsymbol{C}}_\ell ^T \cdot {\boldsymbol{G}}^{-1}_\ell \cdot R_\ell \ {\boldsymbol{C}}_{\ell }. \end{aligned}$ (26)

3.3. Multi-probe and multiple redshift bins

A first case of interest is when we have probes in different nonoverlapping bins, or when the overlap is small enough to be neglected. This happens for instance for galaxies, cluster counts, or power spectra in sufficiently wide bins, i.e., larger than the photo-z error bars, and ≥0.1 to be larger than the width of the SSC σ²(z₁, z₂) (see Fig. 6 in Lacasa & Rosenfeld 2016).

In this case we can basically add up the bins independently,

$\begin{matrix} I = \sum_{i} I_{i, i} = I_{noSSC} - Δ I_{SSC}, \end{matrix}$ $\begin{aligned} I = \sum _i I_{i,i} = I_{\mathrm{noSSC} } - \Delta I_{\mathrm{SSC}}, \end{aligned}$ (27)

where I_i, i is given by Eq. (23). The (negative) SSC correction is

$\begin{matrix} Δ I_{SSC} = \sum_{i} \frac{f_{X}^{SSC} f_{Y}^{SSC} S_{i, i}}{1 + V^{T} \cdot C_{noSSC}^{- 1} \cdot V S_{i, i}} \cdot \end{matrix}$ $\begin{aligned} \Delta I_{\mathrm{SSC}} = \sum _i \frac{{\boldsymbol{f}}^{\mathrm{SSC} }_X \ {\boldsymbol{f}}^{\mathrm{SSC} }_Y \ S_{i,i}}{1+{\boldsymbol{V}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}} \ S_{i,i}}\cdot \end{aligned}$ (28)

In particular for Fisher forecasts, the (negative) SSC correction reads

$\begin{matrix} Δ F_{α, β}^{SSC} = \sum_{i} \frac{f_{α}^{SSC} f_{β}^{SSC} S_{i, i}}{1 + V^{T} \cdot C_{noSSC}^{- 1} \cdot V S_{i, i}}, \end{matrix}$ $\begin{aligned} \Delta F_{\alpha ,\beta }^{\mathrm{SSC}} = \sum _i \frac{{\boldsymbol{f}}^{\mathrm{SSC} }_\alpha \ {\boldsymbol{f}}^{\mathrm{SSC} }_\beta \ S_{i,i}}{1+{\boldsymbol{V}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}} \ S_{i,i}}, \end{aligned}$ (29)

which is obtained as the sum over independent redshift bins of the SSC corrections derived for one single bin.

The second case of interest is when bins are overlapping. This happens for instance when analyzing galaxy shear, which integrates the signal from z = 0 to the sources, either alone or in combination with other probes. In that case, no simplifications can be carried out: the power spectra are correlated both as a function of multipoles and as a function of redshift bins. The covariance matrix must be built in full generality using Eq. (4), and then inverted numerically. We note that already at the Gaussian level inversion must be carried out numerically, due to the coupling between redshift bins.

3.4. Importance of SSC: an analytical rule of thumb

The importance of SSC can be gauged easily in an analytical way, if we assume a single redshift bin, and further approximate the response R_ℓ to be independent of scale R_ℓ ≡ R. It is important to note that this scale independent assumption is not a requirement for numerical application, and may be relaxed as is be done in Sect. 4.

In this section, we gauge the importance of SSC analytically first for the S/N, then for the Fisher constraints.

10pt

3.4.1. Impact on the signal-to-noise ratio

The S/N of a set of angular power spectra, collected in a single data vector C, is given by

$\begin{matrix} {(S / N)}^{2} = C \cdot C^{- 1} \cdot C, \end{matrix}$ $\begin{aligned} \left(S/N\right)^2 = {\boldsymbol{C}} \cdot \mathcal{{C}}^{-1} \cdot {\boldsymbol{C}}, \end{aligned}$ (30)

and we also introduce the noSSC version of it as

$\begin{matrix} {(S / N)}_{G}^{2} = C \cdot C_{noSSC}^{- 1} \cdot C . \end{matrix}$ $\begin{aligned} \left(S/N\right)^2_{\mathrm{G} } = {\boldsymbol{C}} \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{C}}. \end{aligned}$ (31)

We recall that these are cumulative S/N values obtained as a summation over multipoles up to a maximum value ℓ_max. It is then a function of the maximum multipole up to which we integrate our observables. Let us also introduce the scalar quantity

$\begin{matrix} Y \equiv V^{T} \cdot C_{noSSC}^{- 1} \cdot V S_{i, i} . \end{matrix}$ $\begin{aligned} Y \equiv {\boldsymbol{V}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}} \ S_{i,i}. \end{aligned}$ (32)

With the assumption of a single redshift bin and a scale-independent response, the scalar Y is shown to be proportional to the square of the noSSC S/N, i.e.,

$\begin{matrix} Y = {(S / N)}_{noSSC}^{2} R^{2} S_{i, i} . \end{matrix}$ $\begin{aligned} Y=\left(S/N\right)^2_{\mathrm{noSSC} } \, R^2 \, S_{i,i}. \end{aligned}$ (33)

Then the total (S/N) boils down to

$\begin{matrix} {(S / N)}^{2} = {(S / N)}_{noSSC}^{2} (1 - \frac{Y}{1 + Y}) = \frac{{(S / N)}_{noSSC}^{2}}{(1 + Y)} \cdot \end{matrix}$ $\begin{aligned} \left(S/N\right)^2 = \left(S/N\right)^2_{\mathrm{noSSC} } \ \left( 1 - \frac{Y}{1+Y}\right) = \frac{\left(S/N\right)^2_{\mathrm{noSSC} }}{(1+Y)}\cdot \end{aligned}$ (34)

It is thus obvious that the SSC decreases the S/N compared to the noSSC case as Y is by construction a positive number. The impact of the SSC is enhanced for higher values of Y, which is then an excellent indicator of its importance⁴.

From Eq. (33), the impact of the SSC increases for higher S/N: the higher (S/N)_noSSC, the higher Y. Enlarging the set of power spectra to smaller angular scales (i.e., increasing ℓ_max to higher multipoles) increases the S/N, hence the impact of the SSC. By integrating to smaller scales, the entire S/N will thus reach a plateau at an asymptotic value

$\begin{matrix} {(S / N)}_{\max}^{2} = \frac{{(S / N)}_{G}^{2}}{Y} = \frac{1}{R^{2} S_{i, i}} \cdot \end{matrix}$ $\begin{aligned} \left(S/N\right)^2_{\mathrm{max} } =\frac{(S/N)^2_{\mathrm{G} }}{Y} = \frac{1}{R^2 \, S_{i,i}}\cdot \end{aligned}$ (35)

This saturation is reached when Y ∼ 1. For the case of a full sky cosmic variance-limited analysis of a single power spectrum up to a maximum multipole ℓ_max, and neglecting other non-Gaussian terms, we have $Y \sim \frac{ℓ_{\max}^{2}}{2} R^{2} S_{i, i}$ $Y\sim \frac{\ell_{\mathrm{max}}^2}{2} R^2 S_{i,i}$ . The typical angular scales above which the (S/N) starts to saturate because of the SSC is defined by Y ≃ 1. We thus find that SSC becomes important when the analysis goes up to ℓ_max ≳ ℓ_SSC given by

$\begin{matrix} ℓ_{SSC} = \sqrt{\frac{2}{R^{2} S_{i, i}}} \cdot \end{matrix}$ $\begin{aligned} \ell _{\mathrm{SSC} } = \sqrt{\frac{2}{R^2 \ S_{i,i}}}\cdot \end{aligned}$ (36)

Generalizing to the case of partial sky coverage and several cosmic variance-limited probes (in the same redshift bin and neglecting other non-Gaussian terms), the analysis is affected as soon as it reaches multipoles of order

$\begin{matrix} ℓ_{SSC} = \sqrt{\frac{2}{N_{p}^{eff} R^{2} f_{sky} S_{i, i}}}, \end{matrix}$ $\begin{aligned} \ell _{\mathrm{SSC} } = \sqrt{\frac{2}{N_p^\mathrm{eff} \ R^2 \ f_{\mathrm{sky} } \ S_{i,i}}}, \end{aligned}$ (37)

where

$\begin{matrix} N_{p}^{eff} = \frac{2 {(S / N)}_{G, joint}^{2}}{f_{sky} ℓ_{\max}^{2}} \end{matrix}$ $\begin{aligned} N_p^\mathrm{eff} = \frac{2 \ (S/N)^2_\mathrm{G,joint} }{f_{\mathrm{sky} } \ \ell _{\mathrm{max} }^2} \end{aligned}$ (38)

is the effective number of probes⁵.

Finally, if the probes are not cosmic variance-limited, for example due to the presence of shot-noise (galaxy clustering) or shape noise (weak lensing), or if other non-Gaussian covariance terms are important, then we need a full computation of the noSSC S/N accounting for these additional sources of error; the criterion for the importance of SSC is $(_{S}^{/} ≳ 1 / (R^{2} S_{i, i})$ $(S/N)_{\mathrm{noSSC}}^2 \gtrsim 1/(R^{2} S_{i,i})$ . We note that this critical value is that of the maximum S/N with the full covariance (Eq. (35)), this plateau being the same for single- and multi-probe cases (as long as all probes have the same response R); in other words, it is the maximum amount of information that can be extracted from matter fluctuations in a finite volume of the universe with probes with a given response, regardless of the number of probes.

3.4.2. Impact on Fisher constraints

The S/N is highly impacted by SSC, because in the SSC dominated regime and with a constant R_ℓ, Eq. (4) shows that C_ℓ measurements are 100% correlated and thus all information is lost on the overall amplitude. However, one may question the impact on cosmological parameters, if they are sensitive to other features of the power spectrum.

We first recall Eq. (25) for the Fisher information on model parameters α and β,

$\begin{matrix} F_{α, β} = F_{α, β}^{noSSC} - \frac{f_{α}^{SSC} f_{β}^{SSC} S_{i, i}}{1 + Y}, \end{matrix}$ $\begin{aligned} F_{\alpha ,\beta } = F^{\mathrm{noSSC} }_{\alpha ,\beta } - \frac{{\boldsymbol{f}}^{\mathrm{SSC} }_\alpha \ {\boldsymbol{f}}^{\mathrm{SSC} }_\beta \ S_{i,i}}{1+Y}, \end{aligned}$ (39)

rewritten here using Y. We further introduce two angles: first, the angle θ_α between the vectors C and ∂_αC,

$\begin{matrix} cos θ_{α} & = \frac{\partial_{α} C^{T} \cdot C_{noSSC}^{- 1} \cdot V}{\sqrt{\partial_{α} C^{T} \cdot C_{noSSC}^{- 1} \cdot \partial_{α} C \times V^{T} \cdot C_{noSSC}^{- 1} \cdot V}} \\ = \frac{f_{α}^{SSC}}{\sqrt{F_{α, α}^{noSSC} \times Y / S_{i, i}}}, \end{matrix}$ $\begin{aligned} \cos \theta _\alpha&= \frac{\partial _\alpha {\boldsymbol{C}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}}}{\sqrt{\partial _\alpha {\boldsymbol{C}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot \partial _\alpha {\boldsymbol{C}}\,{\times }\,{\boldsymbol{V}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot {\boldsymbol{V}}}} \nonumber \\&= \frac{{\boldsymbol{f}}^{\mathrm{SSC} }_\alpha }{\sqrt{F^{\mathrm{noSSC} }_{\alpha ,\alpha } \times Y/S_{i,i}}}, \end{aligned}$ (40)

and second, the angle θ_αβ between ∂_αC and ∂_βC,

$\begin{matrix} cos θ_{α β} & = \frac{\partial_{α} C^{T} \cdot C_{noSSC}^{- 1} \cdot \partial_{β} C}{\sqrt{\partial_{α} C^{T} \cdot C_{noSSC}^{- 1} \cdot \partial_{α} C \times \partial_{β} C^{T} \cdot C_{noSSC}^{- 1} \cdot \partial_{β} C}} \\ = \frac{F_{α, β}^{noSSC}}{\sqrt{F_{α, α}^{noSSC} \times F_{β, β}^{noSSC}}} \cdot \end{matrix}$ $\begin{aligned} \cos \theta _{\alpha \beta }&= \frac{\partial _\alpha {\boldsymbol{C}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot \partial _\beta {\boldsymbol{C}}}{\sqrt{\partial _\alpha {\boldsymbol{C}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot \partial _\alpha {\boldsymbol{C}}\,{\times }\,\partial _\beta {\boldsymbol{C}}^T \cdot \mathcal{C} _{\mathrm{noSSC} }^{-1} \cdot \partial _\beta {\boldsymbol{C}}}}\nonumber \\&= \frac{F^{\mathrm{noSSC} }_{\alpha ,\beta }}{\sqrt{F^{\mathrm{noSSC} }_{\alpha ,\alpha } \times F^{\mathrm{noSSC} }_{\beta ,\beta }}}\cdot \end{aligned}$ (41)

Let us roughly interpret these angles. The second, θ_αβ, is easily interpreted as the noSSC correlation between the parameter α and the parameter β. The first angle, θ_α, can be interpreted as follows. We recall that V is proportional to the data vector C. Up to a normalization constant, C and thus V can be viewed as ∂_AC, where A is the normalization of the data vector. Since angles are obtained from normalized vectors, θ_α is thus a measure of the noSSC correlation between the parameter α and the normalization of the data vector.

The Fisher information matrix including SSC is then conveniently expressed as a function of the noSSC Fisher matrix, the parameters Y measuring the impact of the SSC on the S/N, and the angles θ_α, θ_β, and θ_αβ:

$\begin{matrix} F_{α, β} = F_{α, β}^{noSSC} (1 - \frac{cos θ_{α} cos θ_{β}}{cos θ_{α β}} \frac{Y}{1 + Y}) \cdot \end{matrix}$ $\begin{aligned} F_{\alpha ,\beta } = F^{\mathrm{noSSC} }_{\alpha ,\beta } \left(1-\frac{\cos \theta _\alpha \cos \theta _\beta }{\cos \theta _{\alpha \beta }} \frac{Y}{1+Y}\right)\cdot \end{aligned}$ (42)

The impact of the SSC on the Fisher matrix is driven first by the impact of the SSC on the (S/N) through Y, and then by the angles θ_α, θ_β, and θ_αβ. In particular for the diagonal elements, the change of the Fisher matrix is

$\begin{matrix} \frac{δ F_{α, α}}{F_{α, α}^{noSSC}} = - {cos}^{2} θ_{α} (\frac{Y}{1 + Y}) \cdot \end{matrix}$ $\begin{aligned} \frac{\delta F_{\alpha ,\alpha }}{F^{\mathrm{noSSC} }_{\alpha ,\alpha }} = -\cos ^2\theta _\alpha \left(\frac{Y}{1+Y}\right)\cdot \end{aligned}$ (43)

This has a negative value, showing that the SSC lowers the amount of information on a given parameter.

Two conditions have to be met for the impact of the SSC to be important: Y should be greater than one, and cos²θ_α should be close to one. Supposing Y ≫ 1 and if θ_α ≠ 0, π, i.e., cos θ_α ≠ ±1⁶, then the Fisher information keeps increasing with ℓ_max, but at a reduced rate compared to the noSSC case, with the asymptote

$\begin{matrix} F_{α, α} \sim F_{α, α}^{noSSC} (1 - {cos}^{2} θ_{α}), \end{matrix}$ $\begin{aligned} F_{\alpha ,\alpha } \sim F^{\mathrm{noSSC} }_{\alpha ,\alpha } \left(1-\cos ^2\theta _\alpha \right), \end{aligned}$ (44)

i.e., the unmarginalized error bar is increased as

$\begin{matrix} σ_{α} \sim \frac{σ_{α}^{G}}{| sin θ_{α} |} \cdot \end{matrix}$ $\begin{aligned} \sigma _\alpha \sim \frac{\sigma ^G_\alpha }{|\sin \theta _\alpha |}\cdot \end{aligned}$ (45)

A maximum impact of the SSC is thus obtained for θ_α close to zero, that is when the parameter α is at the noSSC level highly correlated with the normalization of the data vector.

When we have several parameters, the situation becomes more complex and cannot be judged with only rules of thumb. For example, a parameter α may seem unaffected by SSC because cos θ_α ≪ 1, but it may be correlated (already at the noSSC level) with a parameter β which is affected by SSC, so that α will be affected indirectly through marginalization. Another possibility is that we have two parameters that are uncorrelated at the noSSC level, but through Eq. (42) they become correlated due to SSC ; in that case a large error on one parameter would rebound on the other, which did not happen in the noSSC case.

Summary. To decide the importance of SSC on parameter constraints, first compute the multipole above which the SSC dominated regime is entered, i.e.,

$\begin{matrix} ℓ_{SSC} = \sqrt{\frac{2}{N_{p} R_{ℓ}^{2} f_{sky} S_{i, i}}} \cdot \end{matrix}$ $\begin{aligned} \ell _{\mathrm{SSC} } = \sqrt{\frac{2}{N_p \ R_\ell ^2 \ f_{\mathrm{sky} } \ S_{i,i}}}\cdot \end{aligned}$ (46)

If the analysis is restricted to scales such as ℓ ≪ ℓ_SSC, then it will not be affected. If the analysis enters the SSC dominated regime, then for each parameter of interest it is necessary to compute the angle

$\begin{matrix} cos θ_{α} = \frac{〈 \partial_{α} C, V 〉}{‖ \partial_{α} C ‖ ‖ V ‖}, where 〈 X, Y 〉 = X^{T} \cdot C_{G}^{- 1} \cdot Y, \end{matrix}$ $\begin{aligned} \cos \theta _\alpha = \frac{\left\langle \partial _\alpha {\boldsymbol{C}}, {\boldsymbol{V}}\right\rangle }{\Vert \partial _\alpha {\boldsymbol{C}}\Vert \ \Vert {\boldsymbol{V}}\Vert }, \qquad \mathrm{where} \quad \left\langle X, Y\right\rangle = X^T \cdot \mathcal{C} _{\mathrm{G} }^{-1} \cdot Y, \end{aligned}$ (47)

and the (unmarginalized) error bar on parameter α is increased asymptotically as

$\begin{matrix} σ_{α} \sim \frac{σ_{α}^{G}}{| sin θ_{α} |} \cdot \end{matrix}$ $\begin{aligned} \sigma _\alpha \sim \frac{\sigma ^G_\alpha }{|\sin \theta _\alpha |}\cdot \end{aligned}$ (48)

In the case with several cosmological parameters and/or nuisance parameters that need to be marginalized over, a full computation is necessary.

4. Numerical application to Fisher forecasts

To test the accuracy of the proposed SSC approximation, and to illustrate its impact on a cosmological analysis, we perform here an application to forecast the cosmological constraints from photometric galaxy clustering with the following specifications:

–
single redshift bin with a top-hat window 0.9 < z < 1, and galaxy numbers representative of Euclid: 28M galaxies in the bin, corresponding to a density of ∼2.5 gal arcmin⁻²;
–
full sky coverage and an analysis in the multipole range 50 < ℓ < 2000, in bins of constant width Δℓ = 50 (a realistic range for a conservative photometric galaxy cosmological analysis relying on a constant bias model);
–
flat wCDM model with fiducial cosmological parameters from Planck 2013 ΛCDM constraints (Planck Collaboration XVI 2014):

${Ω_{b} h^{2}, Ω_{c} h^{2}, n_{s}, σ_{8}, H_{0}, w} = {0.022, 0.12, 0.96, 0.83, 67, - 1} ;$ $\{\Omega _b h^2,\Omega _{c} h^2,n_s,\sigma _8,H_0,w\}=\{0.022,0.12,0.96,0.83,67,-1\};$
–
other non-Gaussian covariance terms beyond SSC are neglected.

With these specifications, we compute the galaxy angular power spectrum $C_{l}^{gal}$ $C_\ell^{\mathrm{gal}}$ using the halo model and halo occupation distribution as done in Lacasa & Rosenfeld (2016). On the one hand we find that the shot-noise level is completely negligible, so that we are signal-dominated over all the scales of interest. On the other hand, we find that the nonlinear part of the power spectrum is important on those scales, with the one-halo term dominating the two-halo term for ℓ > 800.

For the S matrix, computed following Appendix B.2, we find the value

$\begin{matrix} S_{i, i} = 6.2 \times 10^{- 7} . \end{matrix}$ $\begin{aligned} S_{i,i} = 6.2\,{\times }\,10^{-7}. \end{aligned}$ (49)

Following Sect. 3.4.1 and assuming a scale independent response R = 5, this translates into a knee multipole ℓ_SSC ∼ 360, and a plateau at S/N ∼ 250.

The cumulative S/N as a function of the maximum multipole of the analysis ℓ_max is shown in Fig. 1. The (S/N) is computed with three different covariance matrices: Gaussian only, Gaussian plus SSC through Eq. (34), and Gaussian plus a full SSC computation following Lacasa & Rosenfeld (2016). We see that the Gaussian covariance matrix completely overestimates the significance of the angular power spectrum by a factor of ∼5.7 at ℓ_max = 2000. However, the S_i, j approximation with a constant response does recover precisely the full SSC computation over the whole multipole range with a precision better than 7%. We also see that the value of the knee multipole and the (S/N) plateau mentioned previously indeed capture the features of the full SSC curve. The S_ij approximation is thus validated at the level of the S/N.

Fig. 1.

Comparison of the cumulative S/N values up to a multipole ℓ_max with different covariances: Gaussian only, with the S_i, j approximation, and with a full SSC computation.

The impact of the SSC on the cosmological parameter estimation is depicted in Figs. 2 and 3. Figure 2 shows the angle cos²θ_α, defined by Eq. (40), for the main cosmological parameters of the wCDM model considered here. Interestingly, all curves are significantly different from zero. Following Sect. 3.4.2, this means that all parameters are going to be significantly impacted by SSC when reaching small scales (i.e., ℓ ≳ ℓ_SSC). In this specific case, a value of cos²θ_α significantly different from zero will thus matter for an ℓ_max greater than ∼400. Some parameters should be affected less (Ω_bh², Ω_ch², and h), and others more (σ₈, n_s, and w₀).

Fig. 2.

cos²θ for each cosmological parameter as a function of the maximum multipole ℓ_max. Parameters with cos²θ close to 1 are the most affected by SSC when it starts to dominate the covariance (i.e., for multipoles ℓ ≳ ℓ_SSC with ℓ_SSC 360 in this specific case).

Fig. 3.

Comparison of $\sqrt{F_{α α}}$ $\sqrt{F_{\alpha\alpha}}$ for an analysis up to a multipole ℓ_max with different covariances: Gaussian only (blue), S_i, j approximation with a constant response (green), S_i, j approximation with a scale-dependent response (cyan), full SSC computation (red).

Figure 3 confirms the qualitative conclusion of Fig. 2. It shows for each cosmological parameter the square root of the Fisher element as a function of the maximum multipole ℓ_max of analysis. This coefficient has an evolution with ℓ_max similar to the S/N⁷, and its inverse is the error bar on the considered cosmological parameter when the other parameters are fixed⁸. As in the case of the (S/N), on large scales (ℓ_max < ℓ_SSC) the impact of the SSC is negligible. This happens in spite of the angle cos²θ_α being close to one for all the parameters, simply because the impact of the SSC on the (S/N) is subdominant: Y ≪ 1. When we integrate the signal to smaller scales, however (ℓ_max ≳ ℓ_SSC), the Gaussian covariance significantly overestimates the strength of the angular power spectrum: the Fisher elements become significantly smaller with the full covariance compared to the Gaussian case. Furthermore, the parameters most affected by SSC are indeed those that were identified in Fig. 2 (σ₈, n_s, and w₀), although the other parameters are still significantly affected. We note that this finding is in agreement with the recent analysis of Barreira et al. (2018a) in the different case of weak lensing.

We now briefly discuss the specific case of h, which is paradigmatic of how the SSC has an impact on the estimation of cosmological parameters via the interplay between Y and cos²θ_α. For ℓ_max < ℓ_SSC, the Gaussian covariance and the full covariance gives the same results since Y ≪ 1. Then for a range of multipoles roughly given by 500 ≲ ℓ_max ≲ 1500, the Gaussian covariance and the full covariance do not give the same S/N; however, the impact of SSC on h is small in this range of multipole because the angle cos²θ_h is close to zero, as shown in Fig. 2, which suppresses the impact of the SSC. Finally, for ℓ_max ≳ 1500, both Y ≫ 1 and cos²θ_h ∼ 1, and the impact of the SSC is clearly seen since the Gaussian covariance now overestimates F_h, h compared to the full covariance.

Comparing the S_i, j approximation with a constant response R to the full SSC computation, we find that the former reproduces the Fisher elements of the less affected parameters (Ω_bh², Ω_ch², and h) to 3% precision, the Fisher on n_s to better than 8% precision, but it is less precise for two of the heavily affected parameters (σ₈ and w₀), where the precision is around 30% at the highest ℓ_max. This is potentially an issue for application to surveys as Euclid with requirements of 10% precision of marginalized errors on cosmological parameters if the analysis is pursued to these small scales. We found that the approximation respects the 10% precision on SSC up to ℓ_max = 1000, but becomes less precise afterwards. We tracked the issue, and found that it originates from the assumption of a constant response R_ℓ = dlnC_ℓ/dδ_b. Using the proper scale-dependent response shown in Appendix C, we found that the S_i, j approximation reproduces the Fisher elements of the full SSC computation to 5% precision over the whole multipole range.

The S_i, j approximation is thus validated at the level of parameter constraints. In the constant response case, it reproduces the Fisher constraints to acceptable precision, except deep in the SSC-dominated regime for the most affected parameters. Accounting for the scale-dependence of the response allows us to recover all parameter constraints to sufficient precision, if it is necessary to pursue the analysis to small scales.

5. Generalizations of the SSC approximation

5.1. Generalization to other statistics

A first note is that for 3D statistics there is no need for an approximation like that in Eq. (4). In such cases, analyses commonly use the (often implicit) assumption of no redshift evolution within the volume. And that assumption means that the SSC covariance already takes the same form as Eq. (4) (see, e.g., Takada & Hu 2013 for the 3D matter power spectrum P(k)).

Number counts. The case of cluster number counts is where the S_i, j approximation actually first started, devised by Hu & Kravtsov (2003). Among other counts of interest for LSS surveys are those of galaxies and shear peaks. Generally, we note N_α(i_z) the number counts, with an index α specifying the type of object as well as the bin of the considered property (e.g., mass, luminosity, color, shear S/N). The response of these counts is the first order bias,

$\begin{matrix} \frac{\partial N_{α}}{\partial δ_{b}} = b_{α} N_{α}, \end{matrix}$ $\begin{aligned} \frac{\partial N_\alpha }{\partial \delta _b} = b_\alpha \ N_\alpha , \end{aligned}$ (50)

i.e., R_α = b_α. The analog of Eq. (4) is then

$\begin{matrix} {Cov}_{SSC} (N_{α} (i_{z}), N_{β} (j_{z})) \approx b_{α} (i_{z}) N_{α} (i_{z}) b_{β} (j_{z}) N_{β} (j_{z}) \times S_{i_{z} ; j_{z}}^{α ; β} . \end{matrix}$ $\begin{aligned} {\mathrm{Cov} }_{\mathrm{SSC} }\left(N_\alpha (i_z),N_\beta (j_z)\right) \approx \ b_\alpha (i_z) \, N_\alpha (i_z) \ b_\beta (j_z) \, N_\beta (j_z)\,{\times }\,S^{\alpha ;\beta }_{i_z;j_z}. \end{aligned}$ (51)

The SSC approximation is also extended to the cross-covariance with an angular power spectrum

$\begin{matrix} {Cov}_{SSC} (N_{α} (i_{z}), C_{ℓ^{'}}^{CD} (k_{z}, l_{z})) \approx & b_{α} (i_{z}) N_{α} (i_{z}) R_{ℓ^{'}}^{CD} C_{ℓ^{'}}^{CD} (k_{z}, l_{z}) \\ \times S_{i_{z} ; k_{z}, l_{z}}^{α ; C, D} . \end{matrix}$ $\begin{aligned} {\mathrm{Cov} }_{\mathrm{SSC} }\left(N_\alpha (i_z),C_{\ell ^{\prime }}^{CD}(k_z,l_z)\right) \approx &\ b_\alpha (i_z) \, N_\alpha (i_z) \ R_{\ell ^{\prime }}^{CD} \, C_{\ell ^{\prime }}^{CD}(k_z,l_z) \nonumber \\&\times S^{\alpha ;C,D}_{i_z;k_z,l_z}. \end{aligned}$ (52)

In the case of the angular power spectrum, we were able to consider the response as having weak scale dependence, and thus approximate R_ℓ = cst. In the case of counts this is generally not the case. For instance, for clusters the bias has a strong dependence on mass and will thus vary from bin to bin.

Correlation function. The 2D correlation function is a linear transform of the angular power spectrum

$\begin{matrix} w (θ) = \sum_{ℓ} \frac{2 ℓ + 1}{4 π} C_{ℓ} P_{ℓ} (cos θ) . \end{matrix}$ $\begin{aligned} w(\theta ) = \sum _\ell \frac{2\ell +1}{4\pi } \ C_\ell \ P_\ell (\cos \theta ). \end{aligned}$ (53)

It is thus readily seen that its SSC covariance takes the form

$\begin{matrix} {Cov}_{SSC} (w_{i_{z}, j_{z}}^{AB} (θ), w_{k_{z}, l_{z}}^{CD} (θ^{'})) \approx {\tilde{w}}_{i_{z}, j_{z}}^{AB} (θ) {\tilde{w}}_{k_{z}, l_{z}}^{CD} (θ^{'}) \times S_{i_{z}, j_{z} ; k_{z}, l_{z}}^{A, B ; C, D}, \end{matrix}$ $\begin{aligned} {\mathrm{Cov} }_{\mathrm{SSC} }\left(w^{AB}_{i_z,j_z}(\theta ),w^{CD}_{k_z,l_z}(\theta ^{\prime })\right) \approx \ \tilde{w}^{AB}_{i_z,j_z}(\theta ) \ \tilde{w}^{CD}_{k_z,l_z}(\theta ^{\prime })\,{\times }\,S^{A,B;C,D}_{i_z,j_z;k_z,l_z}, \end{aligned}$ (54)

with

$\begin{matrix} \tilde{w} (θ) = \sum_{ℓ} \frac{2 ℓ + 1}{4 π} R_{ℓ} C_{ℓ} P_{ℓ} (cos θ) = (R * w) (θ) \end{matrix}$ $\begin{aligned} \tilde{w}(\theta ) = \sum _\ell \frac{2\ell +1}{4\pi } \ R_\ell \ C_\ell \ P_\ell (\cos \theta ) = (R * w)(\theta ) \end{aligned}$ (55)

the convolution product (denoted ) of the original correlation function with the response R(θ). If the response R_ℓ ≡ R can be assumed constant, as in most of this article, then R(θ) reduces to a Dirac distribution R(θ) = R × δ(θ). The convolution product of R with w thus simplifies to a standard product, i.e., $\tilde{w} (θ) = R \times w (θ)$ $\tilde{w}(\theta) = R\,{\times}\,w(\theta)$ , so that Eq. (54) takes the same form as Eq. (4).

Bispectrum. Analogously to Eq. (4), the SSC covariance for bispectra coefficients will take the form

$\begin{matrix} {Cov}_{SSC} (b_{ℓ_{1} ℓ_{2} ℓ_{3}}^{ABC} (i_{z}, j_{z}, k_{z}), b_{ℓ_{1}^{'} ℓ_{2}^{'} ℓ_{3}^{'}}^{DEF} (l_{z}, m_{z}, n_{z})) \approx R_{ℓ_{1} ℓ_{2} ℓ_{3}}^{ABC} b_{ℓ_{1} ℓ_{2} ℓ_{3}}^{ABC} (i_{z}, j_{z}, k_{z}) \\ \times R_{ℓ_{1}^{'} ℓ_{2}^{'} ℓ_{3}^{'}}^{DEF} b_{ℓ_{1}^{'} ℓ_{2}^{'} ℓ_{3}^{'}}^{DEF} (l_{z}, m_{z}, n_{z}) S_{i_{z}, j_{z}, k_{z} ; l_{z}, m_{z}, n_{z}}^{A, B, C ; D, E, F} . \end{matrix}$ $\begin{aligned}&{\mathrm{Cov} }_{\mathrm{SSC} }\Big (b_{\ell _1\ell _2\ell _3}^{ABC}(i_z,j_z,k_z), b_{\ell ^{\prime }_1\ell ^{\prime }_2\ell ^{\prime }_3}^{DEF}(l_z,m_z,n_z)\Big ) \approx \ R_{\ell _1\ell _2\ell _3}^{ABC} \, b_{\ell _1\ell _2\ell _3}^{ABC}(i_z,j_z,k_z) \nonumber \\&\qquad \times R_{\ell ^{\prime }_1\ell ^{\prime }_2\ell ^{\prime }_3}^{DEF} \, b_{\ell ^{\prime }_1\ell ^{\prime }_2\ell ^{\prime }_3}^{DEF}(l_z,m_z,n_z) \ S^{A,B,C;D,E,F}_{i_z,j_z,k_z;l_z,m_z,n_z}. \end{aligned}$ (56)

The 3D bispectrum SSC was studied extensively in Chan et al. (2018), who found that the growth-only response from perturbation theory is $R_{k_{1} k_{2} k_{3}} = \frac{433}{126}$ $R_{k_1 k_2 k_3} = \frac{433}{126}$ , while the total response ranges between 4 and 6. Small BAO features are visible in these responses, but should we washed out in 2D projected quantities. A constant response R = 5 may thus give an acceptable first-order approximation, as we found in this article for C_ℓ.

5.2. Generalization to the likelihood

A usual assumption is that the likelihood of the observable vector 𝒪 (e.g., the power spectrum C = (C_ℓ)_{ℓ = ℓ_min⋯ℓ_max}, as in most of this article) given model parameters p, is a multivariate Gaussian

$\begin{matrix} P (O | p) = N (\bar{O} (p), C_{tot}), \end{matrix}$ $\begin{aligned} P(\mathcal{O} |{\boldsymbol{p}}) = \mathcal{N} (\bar{\mathcal{O} }({\boldsymbol{p}}),\mathcal{C} _\mathrm{tot} ), \end{aligned}$ (57)

where 𝒩(X, Σ) denotes the Gaussian distribution with mean X and covariance Σ.

Given that 𝒞_tot = 𝒞_std + 𝒞_SSC (e.g., for the power spectrum 𝒞_std = 𝒞_G) and using properties of Gaussian distributions, this can be rewritten (artificially for the moment) as the convolution

$\begin{matrix} P (O | p) & = N (\bar{O} (p), C_{std}) * N (0, C_{SSC}) \\ = P_{std} (O | p) * N (0, C_{SSC}), \end{matrix}$ $\begin{aligned} P(\mathcal{O} |{\boldsymbol{p}})&= \mathcal{N} (\bar{\mathcal{O} }({\boldsymbol{p}}),\mathcal{C} _\mathrm{std} ) * \mathcal{N} (0,\mathcal{C} _{\mathrm{SSC} }) \nonumber \\&= P_\mathrm{std} (\mathcal{O} |{\boldsymbol{p}}) * \mathcal{N} (0,\mathcal{C} _{\mathrm{SSC} }), \end{aligned}$ (58)

where P_std is the standard (no-SSC) likelihood. The second pdf can be interpreted physically as the probability that super-survey modes induce a shift of the observable vector.

We can then reformulate the probability in a form similar to that found for cluster counts by Lima & Hu (2004) (see also Appendix E),

$\begin{matrix} P (O | p) = \int d (δ O) P_{std} (O | \bar{O} (p) + δ O, C_{G}) P_{SSC} (δ O | 0, C_{SSC}), \end{matrix}$ $\begin{aligned} P(\mathcal{O} |{\boldsymbol{p}}) = \int {\mathrm{d} }(\delta \mathcal{O} ) \ P_\mathrm{std} (\mathcal{O} | \bar{\mathcal{O} }({\boldsymbol{p}})+\delta \mathcal{O} ,\mathcal{C} _{\mathrm{G} }) \ P_{\mathrm{SSC} }(\delta \mathcal{O} | 0,\mathcal{C} _{\mathrm{SSC} }), \end{aligned}$ (59)

where the shift δ𝒪 is a random variable with probability P_SSC, i.e., centered on zero and with covariance matrix 𝒞_SSC.

At first order, the observable reacts to the change of background δ_b induced by long wavelength modes, through the response $\frac{\partial O}{\partial δ_{b}}$ $\frac{\partial \mathcal{O}}{\partial \delta_b}$ (e.g., the power spectrum response discussed in Sect. 2 and Appendix C). Noting that

$\begin{matrix} \bar{O} (p, δ_{b}) = \bar{O} (p) + δ O = \bar{O} (p) + \frac{\partial O}{\partial δ_{b}} δ_{b} \end{matrix}$ $\begin{aligned} \bar{\mathcal{O} }({\boldsymbol{p}},\delta _b) = \bar{\mathcal{O} }({\boldsymbol{p}}) + \delta \mathcal{O} = \bar{\mathcal{O} }({\boldsymbol{p}}) + \frac{\partial \mathcal{O} }{\partial \delta _b}\delta _b \end{aligned}$ (60)

is the (average) observable in a part of the universe with a background change δ_b, we can rewrite the likelihood as

$\begin{matrix} P (O | p) = \int d δ_{b} \underset{\equiv P (O | p, δ_{b})}{\underset{⏟}{P_{std} (O | \bar{O} (p, δ_{b}), C_{G}) P_{SSC} (δ_{b} | 0, S)}}, \end{matrix}$ $\begin{aligned} P(\mathcal{O} |{\boldsymbol{p}}) = \int {\mathrm{d} }\delta _b \ \underbrace{P_\mathrm{std} (\mathcal{O} | \bar{\mathcal{O} }({\boldsymbol{p}},\delta _b),\mathcal{C} _{\mathrm{G} }) \ P_{\mathrm{SSC} }(\delta _b|0,S)}_{\equiv P(\mathcal{O} |{\boldsymbol{p}},\delta _b)}, \end{aligned}$ (61)

where the S matrix defined in Eq. (5) appears. We note that δ_b is not a simple scalar; it depends on the pair of probes and redshift bins of the observable considered.

Because δ_b is the density field smoothed over very large scales (the whole survey area), it is safe to assume that it has a Gaussian distribution, i.e., P_SSC = 𝒩(0, S). However, the same may not be true of P_std(𝒪|p), where the observable may not have a Gaussian likelihood. For instance in the case of cluster counts studied in Lima & Hu (2004), the observable follows a Poissonian distribution if S = 0. In the case of the angular power spectrum, which is a quadratic quantity, the observable follows a Wishart distribution on the full sky (e.g., Hamimeche & Lewis 2008), which has an important impact on inference from large angular scales. For galaxy lensing, this has been shown to be of importance by Sellentin et al. (2018). In the case of the bispectrum, it is also known that the likelihood should not be Gaussian (Chan & Blot 2017), although no numerical or analytical form exists for it at the moment.

This is where the rewriting Eq. (61) becomes useful in practice (beyond giving a nice physical interpretation) since we can now use for $P_{std} (O | \bar{O} (p, δ_{b})$ $P_{\mathrm{std}}(\mathcal{O} | \bar{\mathcal{O}}({\boldsymbol{p}},\delta_b)$ a more realistic and possibly non-Gaussian pdf.

This means that SSC can be accounted for at the likelihood level through the hierarchical model

$\begin{matrix} P (O | p, δ_{b}) = P_{std} (O | \bar{O} (p, δ_{b})) \times π (δ_{b} | p), \end{matrix}$ $\begin{aligned} P(\mathcal{O} |{\boldsymbol{p}},\delta _b) = P_\mathrm{std} \left(\mathcal{O} |\bar{\mathcal{O} }({\boldsymbol{p}},\delta _b)\right) \times \pi (\delta _b |{\boldsymbol{p}}), \end{aligned}$ (62)

where π is the prior on δ_b, i.e., 𝒩(0, S), where the S matrix depends implicitly on cosmological parameters (and potentially on other model parameters if they affect the weighting kernels). The value of δ_b then needs to be marginalized over to get constraints on the standard model parameters.

We note that in the separate universe approach, a region with a background change in a cosmology p can be simulated as a region with no background change but a different cosmology with parameters p′(p, δ_b) (Wagner et al. 2015). Thus, the likelihood Eq. (62) may be implemented with only small changes to current no-SSC likelihood pipelines,

$\begin{matrix} P (O | p, δ_{b}) = P_{std} (O | \bar{O} (p^{'} (p, δ_{b}))) \times π (δ_{b} | p), \end{matrix}$ $\begin{aligned} P(\mathcal{O} |{\boldsymbol{p}},\delta _b) = P_\mathrm{std} \left(\mathcal{O} |\bar{\mathcal{O} }\left({\boldsymbol{p^{\prime }}}({\boldsymbol{p}},\delta _b)\right)\right) \times \pi (\delta _b |{\boldsymbol{p}}), \end{aligned}$ (63)

which avoids having to model or measure the observable’s response, and means that accounting for SSC is as easy as including extra nuisance parameters.

6. Conclusion

We presented a fast and easy approximation for the super sample covariance of 2D projected statistics; the study was mainly focused on the angular power spectrum C_ℓ and generalization to other statistics given later. In addition to the considered probe, this S_i, j approximation relies on two ingredients:

–
the S matrix, which is an integral of the (linear) matter power spectrum convolved with the survey window. In the flat sky limit, computable expressions are found in the literature (e.g., Aguena & Lima 2018; see Appendix B.2 for expressions for the full sky and partial sky cases).
–
the probe’s response. We found the simple ansatz R = 5 to perform very well for the case of C_ℓ. It is sufficient for Euclid precision requirements on parameter constraints for cosmological parameters of wCDM model, and up to ℓ_max ∼ 1000. To push to smaller scales for σ₈ and w, it is necessary to account for the scale dependence of R_ℓ (see Appendix C, Table C.1).

Neither of these ingredients requires expensive computations or physical models additional to the usual cosmological tools necessary to predict the considered probes. The S_i, j approximation can thus readily be implemented in cosmological prediction codes. Furthermore, we showed that SSC can be included in cosmological pipelines (for significance quantification, Fisher forecasts, or MCMC parameter estimation) through a simple correction to the Gaussian covariance case, not even spoiling the speed-up induced by a diagonal covariance.

The S_i, j approximation also allows us to easily identify which cosmological parameters are going to be affected by SSC and at what level through the fast computation of the cos θ_α coefficients (Eq. (40)) and the scalar Y (Eq. (32)).

To facilitate the use of the S_i, j approximation by the community, we publicly release a Python code that implements it, together with examples of applications⁹.

In the case of photometric galaxy clustering in a redshift bin 0.9 < z < 1 and with Euclid-like specifications, we found all cosmological constraints to be heavily impacted by SSC. This makes it necessary to include this effect in forecasts and analysis pipelines for future galaxy surveys, a task now largely eased up by the S_i, j approximation. Furthermore we showed how this approximation can be generalized beyond the angular power spectrum to other statistics such as number counts, the correlation function, and bispectrum, where we indicated the corresponding probe’s response¹⁰.

Finally, we showed that the S_i, j approximation can be generalized at the likelihood level, which avoids having to assume a Gaussian likelihood, an assumption that is incorrect in many cases, for example cluster counts at high masses, C_ℓ at low ℓ or the correlation function on large scales, and the bispectrum. We will explore these likelihood developments in future works.

¹

https://github.com/fabienlacasa/PySSC

²

This is the case for the shear signals and the iSW effect since they are integrated signals from the redshift of the source plane to the observer.

³

For all these cases of interest, the matrix-vector products in Eq. (7) have to be understood as

$\begin{matrix} I = \sum_{ℓ, ℓ^{'} = ℓ_{\min}}^{ℓ_{\max}} X_{ℓ} {[C^{- 1}]}_{ℓ ℓ^{'}} Y_{ℓ^{'}}, \end{matrix}$ $\begin{aligned} I=\displaystyle \sum _{\ell ,\ell ^{\prime }=\ell _{\mathrm{min} }}^{\ell _{\mathrm{max} }} X_\ell \,\left[\mathcal{C} ^{-1}\right]_{\,\,\ell \ell ^{\prime }}\,Y_{\ell ^{\prime }}, \end{aligned}$

i.e., it corresponds to a cumulative quantity over multipole.

⁴

In the case of many uncorrelated redshift bins, Eq. (34) is generalized by summing over the redshift bins, i.e.,

$\begin{matrix} {(S / N)}^{2} = \sum_{i} \frac{{(S / N)}_{G, i}^{2}}{(1 + Y_{i})} \cdot \end{matrix}$ $\begin{aligned} \left(S/N\right)^2 = \displaystyle \sum _i \frac{\left(S/N\right)^2_{G,i}}{(1+Y_i)}\cdot \end{aligned}$

⁵

It is exactly the number of probes if they are uncorrelated, but it goes down to 1 if they are totally correlated.

⁶

The case cos θ_α = ±1 corresponds to the parameter α being ± the amplitude of the power spectrum. In this case, up to a normalization, we go back to the case of the S/N studied in Sect. 3.4.1.

⁷

The unmarginalized S/N on a given parameter α is simply obtained by multiplying this coefficient with the input value of the parameter. It coincides with the S/N in the case of a model parameter being the amplitude A of the power spectrum: $C_{l} = A C_{l}^{template}$ $C_\ell = A C_\ell^\mathrm{template}$ .

⁸

We do not attempt any marginalization or production of realistic forecasts. Our framework is different from the usual cosmological analyses since we use the halo model and HOD; for instance, we cannot marginalize over galaxy bias.

⁹

https://github.com/fabienlacasa/PySSC

¹⁰

Namely the object’s bias for counts, and R = 5 for the correlation function and the bispectrum.

¹¹

This is easily related to the binning operator, P_bℓ, commonly used in the CMB context through P_bℓ = S_bℓ/Δ_b.

¹²

Typically ℓ(ℓ + 1) in the CMB context.

Acknowledgments

We thank Stéphane Ilić for his help with the integrated Sachs-Wolfe effect. F.L. acknowledges support from the Swiss National Science Foundation. J.G. acknowledge partial support from the ByoPiC project (https://byopic.eu/team) funded by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme grant agreement ERC-2015-AdG 695561.

References

Aguena, M., & Lima, M. 2018, Phys. Rev. D, 98, 123529 [NASA ADS] [CrossRef] [Google Scholar]
Amendola, L., Appleby, S., Bacon, D., et al. 2013, Liv. Rev. Rel., 16, 6 [Google Scholar]
Barreira, A., Krause, E., & Schmidt, F. 2018a, J. Cosmol. Astropart. Phys., 10, 053 [CrossRef] [Google Scholar]
Barreira, A., Krause, E., & Schmidt, F. 2018b, J. Cosmol. Astropart. Phys., 6, 015 [Google Scholar]
Bartlett, M. S. 1951, Ann. Math. Statist., 22, 107 [Google Scholar]
Chan, K. C., & Blot, L. 2017, Phys. Rev. D, 96, 023528 [NASA ADS] [CrossRef] [Google Scholar]
Chan, K. C., Moradinezhad Dizgah, A., & Noreña, J. 2018, Phys. Rev. D, 97, 043532 [NASA ADS] [CrossRef] [Google Scholar]
Dark Energy Survey Collaboration (Abbott, T. M. C., et al.) 2018, Phys. Rev. D, 98, 043526 [Google Scholar]
Hamimeche, S., & Lewis, A. 2008, Phys. Rev. D, 77, 103013 [NASA ADS] [CrossRef] [Google Scholar]
Hildebrandt, H., Viola, M., Heymans, C., et al. 2017, MNRAS, 465, 1454 [Google Scholar]
Hu, W., & Kravtsov, A. V. 2003, ApJ, 584, 702 [NASA ADS] [CrossRef] [Google Scholar]
Kilbinger, M., Heymans, C., Asgari, M., et al. 2017, MNRAS, 472, 2126 [NASA ADS] [CrossRef] [Google Scholar]
Lacasa, F. 2018, A&A, 615, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Lacasa, F., & Kunz, M. 2017, A&A, 604, A104 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Lacasa, F., & Rosenfeld, R. 2016, J. Cosmol. Astropart. Phys., 8, 005 [CrossRef] [Google Scholar]
Lacasa, F., Lima, M., & Aguena, M. 2018, A&A, 611, A83 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]
Lewis, A., & Challinor, A. 2006, Phys. Rep., 429, 1 [Google Scholar]
Li, Y., Schmittfull, M., & Seljak, U. 2018, J. Cosmol. Astropart. Phys., 2, 022 [Google Scholar]
Lima, M., & Hu, W. 2004, Phys. Rev. D, 70, 043504 [NASA ADS] [CrossRef] [Google Scholar]
LSST Science Collaborations (Abell, P. A., et al.) 2009, ArXiv e-prints [arXiv:0912.0201] [Google Scholar]
Planck Collaboration XVI. 2014, A&A, 571, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Planck Collaboration XXI. 2016, A&A, 594, A21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Planck Collaboration VI. 2018, ArXiv e-prints [arXiv:1807.06209] [Google Scholar]
Sellentin, E., Heymans, C., & Harnois-Déraps, J. 2018, MNRAS, 477, 4879 [NASA ADS] [CrossRef] [Google Scholar]
Sherman, J., & Morrison, W. J. 1950, Ann. Math. Stat., 21, 124 [CrossRef] [Google Scholar]
Takada, M., & Hu, W. 2013, Phys. Rev. D, 87, 123504 [NASA ADS] [CrossRef] [Google Scholar]
Takada, M., & Spergel, D. N. 2014, MNRAS, 441, 2456 [NASA ADS] [CrossRef] [Google Scholar]
Takahashi, R., Soma, S., Takada, M., & Kayo, I. 2014, MNRAS, 444, 3473 [NASA ADS] [CrossRef] [Google Scholar]
Wagner, C., Schmidt, F., Chiang, C.-T., & Komatsu, E. 2015, MNRAS, 448, L11 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Example of weighting kernels

Here we give the weighting kernels and 3D power spectrum needed to compute the angular power spectra, Eq. (3), in the convention used in this article for different LSS observables.

Galaxy clustering. The observable is the projected galaxy number density in a redshift bin. In this case the weighting kernel is

$\begin{matrix} W_{i_{z}}^{GC} (z) = \frac{n_{gal}^{(i_{z})} (z)}{N_{gal} (i_{z})}, \end{matrix}$ $\begin{aligned} W_{i_z}^\mathrm{GC} (z)= \frac{n_{\mathrm{gal} }^{(i_z)}(z)}{N_{\mathrm{gal} }(i_z)}, \end{aligned}$ (A.1)

with n_gal(z) the 3D comoving galaxy density and N_gal(i_z) the 2D number of galaxies per solid angle. The 3D power spectrum is the galaxy power spectrum, i.e., P_gg(k) = P_gal(k), which on large scales is linked to the matter power spectrum via $P_{gg} (k) = b_{g}^{2} P_{m} (k)$ $P_\mathrm{gg}(k)=b_\mathrm{g}^2 \ P_{\mathrm{m}}(k)$ .

Weak lensing / shear. The observable is the galaxy shear averaged over a redshift bin. In this case the weighting is

$\begin{matrix} W^{κ_{gal}} (z) = \frac{A}{a (z) r (z)} q_{i_{z}} (z), \end{matrix}$ $\begin{aligned} W^{\kappa _{\mathrm{gal} }}(z)= \frac{\mathcal{A}}{a(z) \ r(z)} \ q_{i_z}(z), \end{aligned}$ (A.2)

with $A = \frac{3}{2} Ω_{m} {(\frac{H_{0}}{c})}^{2}$ ${\cal A}=\frac{3}{2}\Omega_m \, \left(\frac{H_0}{c}\right)^2$ and $q_{i_{z}} (z) = \int v z' \frac{n_{gal}^{(i_{z})} (z')}{N_{gal} (i_{z})} \frac{r' - r}{r'}$ $q_{i_z}(z) = \int v z\prime \frac{n_{\mathrm{gal}}^{(i_z)}(z\prime)}{N_{\mathrm{gal}}(i_z)} \frac{r\prime-r}{r\prime}$ the lensing efficiency (e.g., Kilbinger et al. 2017). The 3D power spectrum is the matter power spectrum P_m(k).

CMB lensing. The observable is the distortion of the CMB temperature anisotropies. In this case the weighting is

$\begin{matrix} W^{κ_{CMB}} (z) = \frac{A}{a (z) r (z)} \frac{r_{*} - r}{r_{*}}, \end{matrix}$ $\begin{aligned} W^{\kappa _\mathrm{CMB} }(z)= \frac{\mathcal{A}}{a(z) \ r(z)} \ \frac{r_*-r}{r_*}, \end{aligned}$ (A.3)

with r_* the comoving distance to the CMB last scattering surface (Lewis & Challinor 2006), and the 3D power spectrum is the matter power spectrum P_m(k).

Integrated Sachs-Wolfe effect. The observable is the iSW contribution to the temperature anisotropies of the CMB. In this case the weighting kernel is (Planck Collaboration XXI 2016)

$\begin{matrix} W^{iSW} (k, z) = - \frac{2 A}{k^{2}} \frac{d (1 + z) G (z)}{d z} / (G (z) r^{2} (z) \frac{d r}{d z}), \end{matrix}$ $\begin{aligned} W^\mathrm{iSW} (k,z) = - \frac{2 \mathcal{A}}{k^2} \left.\frac{{\mathrm{d} } (1+z)\, G(z)}{{\mathrm{d} } z} / \left(G(z) \ r^2(z) \frac{{\mathrm{d} } r}{{\mathrm{d} } z}\right) \right., \end{aligned}$ (A.4)

with G(z) the linear growth function. The 3D power spectrum is the matter power spectrum P_mm(k).

We note that this kernel does not depend only on redshift, but also on wavenumber k, but since the latter dependence is factorizable, it cancels out in the S matrix and thus does not impact the applicability of the approximation Eq. (4).

We also note that the Limber approximation, used throughout this article, is poorly adapted to the iSW signal because it peaks at low multipoles. Lacasa (2018) provided expressions for super-sample covariance without Limber’s approximation. We leave the generalization of the present SSC approximation to this no-Limber case to future work. However, as the SSC impact peaks on small scales, we expect that its impact on iSW constraints should be small and that the present approximation should thus be good enough to gauge its level.

Appendix B: Particular cases for σ²(z₁,z₂) and S matrix

σ²(z₁, z₂)

In full sky we have (Lacasa & Rosenfeld 2016)

$\begin{matrix} σ^{2} (z_{1}, z_{2}) = \frac{1}{2 π^{2}} \int k^{2} d k P_{m} (k | z_{12}) j_{0} (k r_{1}) j_{0} (k r_{2}) . \end{matrix}$ $\begin{aligned} \sigma ^2(z_1,z_2) = \frac{1}{2\pi ^2}\int k^2 \, {\mathrm{d} } k \ P_m(k|z_{12}) \ j_0(k r_1) j_0(k r_2). \end{aligned}$ (B.1)

In the partial sky case, σ²(z₁, z₂) can be expanded in spherical harmonics (Lacasa et al. 2018) to get

$\begin{matrix} σ^{2} (z_{1}, z_{2}) = \frac{1}{Ω_{S}^{2}} \sum_{ℓ} (2 ℓ + 1) C_{ℓ} (W) C_{ℓ}^{m} (z_{1}, z_{2}), \end{matrix}$ $\begin{aligned} \sigma ^2(z_1,z_2) = \frac{1}{\Omega _\mathrm{S} ^2} \sum _\ell (2\ell +1) \ C_\ell (W) \ C_\ell ^{\mathrm{m} }(z_1,z_2)\,, \end{aligned}$ (B.2)

where Ω_S = 4π f_sky is the solid angle covered by the survey, C_ℓ(W) is the angular power spectrum of the survey mask, and $C_{l}^{m}$ $C_\ell^{\mathrm{m}}$ is the angular power spectrum of matter given by

$\begin{matrix} C_{ℓ}^{m} (z_{1}, z_{2}) = \frac{2}{π} \int k^{2} d k j_{ℓ} (k r_{1}) j_{ℓ} (k r_{2}) P_{m} (k | z_{12}) . \end{matrix}$ $\begin{aligned} C_\ell ^{\mathrm{m} }(z_1,z_2) = \frac{2}{\pi } \int k^2 \, {\mathrm{d} } k \; j_\ell (k r_1) \, j_\ell (k r_2) \; P_{\mathrm{m} }(k|z_{12})\,. \end{aligned}$ (B.3)

S matrix

Assuming that super-survey modes can be described by linear theory, the matter power spectrum writes P_m(k|z₁₂) = G(z₁)G(z₂)P(k), where G(z) is the growth function and we note simply P(k) the power spectrum at z = 0.

It then results from its definition Eq. (5) that the S matrix is given by

$\begin{matrix} S_{i_{z}, j_{z} ; k_{z}, l_{z}}^{A, B ; C, D} = \frac{1}{2 π^{2}} \int k^{2} d k P (k) I_{i_{z}, j_{z}}^{AB} (k) I_{k_{z}, l_{z}}^{CD} (k), \end{matrix}$ $\begin{aligned} S^{A,B;C,D}_{i_z,j_z;k_z,l_z} = \frac{1}{2\pi ^2}\int k^2 \, {\mathrm{d} } k \ P(k) \ \mathcal{I} ^{AB}_{i_z,j_z}(k) \ \mathcal{I} ^{CD}_{k_z,l_z}(k), \end{aligned}$ (B.4)

where

$\begin{matrix} I_{i_{z}, j_{z}}^{AB} (k) = \int d V \frac{W_{i_{z}}^{A} (z) W_{j_{z}}^{B} (z)}{I^{AB} (i_{z}, j_{z})} G (z) Win (k r), \end{matrix}$ $\begin{aligned} \mathcal{I} ^{AB}_{i_z,j_z}(k) = \int {\mathrm{d} } V \frac{W_{i_z}^A(z) \, W_{j_z}^B(z)}{I^{AB}(i_z,j_z)} G(z) \ \mathrm{Win} (k r), \end{aligned}$ (B.5)

where we recall

$\begin{matrix} I^{AB} (i_{z}, j_{z}) = \int d V W_{i_{z}}^{A} (z) W_{j_{z}}^{B} (z) . \end{matrix}$ $\begin{aligned} I^{AB}(i_z,j_z) = \int {\mathrm{d} } V \ W_{i_z}^A(z) \ W_{j_z}^B(z). \end{aligned}$ (B.6)

and the angle-averaged survey window is

$\begin{matrix} Win (k r) = {\begin{matrix} j_{0} (k r) & f u l l s k y \\ \frac{4 π}{Ω_{S}^{2}} \sum_{ℓ} (2 ℓ + 1) C_{ℓ} (W) j_{ℓ} (k r) & p a r t i a l s k y \end{matrix} \end{matrix}$ $\begin{aligned} \mathrm{Win} (k r) = \left\{ \begin{array}{ll} j_0(kr)&\mathrm full\ sky \\ \frac{4\pi }{\Omega _S^2} \sum _\ell (2\ell +1) \ C_\ell (W) \ j_\ell (k r)&\mathrm partial\ sky \end{array} \right. \end{aligned}$ (B.7)

with C_ℓ(W) the power spectrum of the survey mask (Lacasa et al. 2018).

There is a special case where the S matrix can be further simplified analytically by assuming a full sky survey where the weighting kernel is constant within the redshift bins, and by approximating the growth function at the center of the redshift bin. If the weighting kernel is constant, we have

$\begin{matrix} W_{i} (z) = \frac{3}{r_{\max}^{3} (i) - r_{\min}^{3} (i)} 1_{z \in [z_{\min} (i), z_{\max} (i)]} . \end{matrix}$ $\begin{aligned} W_i(z) = \frac{3}{r_{\mathrm{max} }^3(i) - r_{\mathrm{min} }^3(i)} \ \mathbb{1} _{z\in [z_{\mathrm{min} }(i),z_{\mathrm{max} }(i)]}. \end{aligned}$ (B.8)

This happens, for instance, in the case of galaxy clustering with perfect redshift determinations and if the galaxy comoving density can be considered constant n_gal(z) = cst. Then Eq. (B.6) simplifies to

$\begin{matrix} I^{AB} (i_{z}, j_{z}) = \frac{3 δ_{i_{z}, j_{z}}}{r_{\max}^{3} (i_{z}) - r_{\min}^{3} (i_{z})} \cdot \end{matrix}$ $\begin{aligned} I^{AB}(i_z,j_z) = \frac{3 \ \delta _{i_z,j_z}}{r_{\mathrm{max} }^3(i_z) - r_{\mathrm{min} }^3(i_z)}\cdot \end{aligned}$ (B.9)

This gives for Eq. (B.5)

$\begin{matrix} I_{i_{z}, j_{z}}^{AB} (k) & \approx G (z_{mean} (i_{z})) \frac{3 δ_{i_{z}, j_{z}}}{r_{\max}^{3} (i_{z}) - r_{\min}^{3} (i_{z})} \int_{r_{\min} (i_{z})}^{r_{\max} (i_{z})} r^{2} d r j_{0} (k r) \\ = \frac{3 G (z_{mean}) δ_{i_{z}, j_{z}}}{k (r_{\max}^{3} - r_{\min}^{3})} [r_{\max}^{2} j_{1} (k r_{\max}) - r_{\min}^{2} j_{1} (k r_{\min})], \end{matrix}$ $\begin{aligned} \mathcal{I} ^{AB}_{i_z,j_z}(k)&\approx G(z_\mathrm{mean} (i_z)) \frac{3 \ \delta _{i_z,j_z}}{r_{\mathrm{max} }^3(i_z) - r_{\mathrm{min} }^3(i_z)} \int _{r_{\mathrm{min} }(i_z)}^{r_{\mathrm{max} }(i_z)} r^2\ {\mathrm{d} } r \ j_0(kr) \nonumber \\&= \frac{3 \ G(z_\mathrm{mean} ) \ \delta _{i_z,j_z}}{k\left(r_{\mathrm{max} }^3 - r_{\mathrm{min} }^3\right)} \left[r_{\mathrm{max} }^2 \, j_1(k r_{\mathrm{max} }) - r_{\mathrm{min} }^2 \, j_1(k r_{\mathrm{min} })\right], \end{aligned}$ (B.10)

which can be fed into Eq. (B.4) for the S matrix, leading to

$\begin{matrix} S_{i_{z}, j_{z} ; k_{z}, l_{z}}^{A, B ; C, D} = & δ_{i_{z}, j_{z}} δ_{k_{z}, l_{z}} \frac{3 G (z_{mean} (i_{z}))}{r_{\max}^{3} (i_{z}) - r_{\min}^{3} (i_{z})} \frac{3 G (z_{mean} (k_{z}))}{r_{\max}^{3} (k_{z}) - r_{\min}^{3} (k_{z})} \\ \times \frac{1}{2 π^{2}} \int k^{2} d k P (k) \\ \times [r_{\max}^{2} (i_{z}) j_{1} (k r_{\max} (i_{z})) - r_{\min}^{2} (i_{z}) j_{1} (k r_{\min} (i_{z}))] / k \\ \times [r_{\max}^{2} (k_{z}) j_{1} (k r_{\max} (k_{z})) - r_{\min}^{2} (k_{z}) j_{1} (k r_{\min} (k_{z}))] / k . \end{matrix}$ $\begin{aligned} S^{A,B;C,D}_{i_z,j_z;k_z,l_z} =&\delta _{i_z,j_z} \ \delta _{k_z,l_z} \ \frac{3 \ G\left(z_\mathrm{mean} (i_z)\right)}{r_{\mathrm{max} }^3(i_z) - r_{\mathrm{min} }^3(i_z)} \ \frac{3 \ G\left(z_\mathrm{mean} (k_z)\right)}{r_{\mathrm{max} }^3(k_z) - r_{\mathrm{min} }^3(k_z)} \nonumber \\&\times \frac{1}{2\pi ^2}\int k^2 \, {\mathrm{d} } k \ P(k) \nonumber \\&\times \left[r_{\mathrm{max} }^2(i_z) \, j_1\left(k r_{\mathrm{max} }(i_z)\right) - r_{\mathrm{min} }^2(i_z) \, j_1\left(k r_{\mathrm{min} }(i_z)\right)\right]/k \nonumber \\&\times \left[r_{\mathrm{max} }^2(k_z) \, j_1\left(k r_{\mathrm{max} }(k_z)\right) - r_{\mathrm{min} }^2(k_z) \, j_1\left(k r_{\mathrm{min} }(k_z)\right)\right]/k. \end{aligned}$ (B.11)

We note that this expression has become independent of the considered probes (A, B, C, D). Furthermore, the Bessel function j₁ is a sum of sines and cosines, $j_{1} (x) = \frac{sin x}{x^{2}} - \frac{cos x}{x}$ $j_1(x)=\frac{\sin x}{x^2} - \frac{\cos x}{x}$ , and thus the S matrix can be expressed in terms of Fourier transforms of the matter power spectrum. Specifically defining

$I_{c, n}^{\pm} (r_{1}, r_{2}) \equiv \int d k P (k) / k^{n} \cos (k (r_{1} \pm r_{2})),$ $I_{c,n}^{\pm }(r_1,r_2) \equiv \int {\mathrm{d} } k \ P(k)/k^n \ \cos (k(r_1\pm r_2)),$ (B.12)

$I_{s, n}^{\pm} (r_{1}, r_{2}) \equiv \int d k P (k) / k^{n} \sin (k (r_{1} \pm r_{2})),$ $I_{s,n}^{\pm }(r_1,r_2) \equiv \int {\mathrm{d} } k \ P(k)/k^n \ \sin (k(r_1\pm r_2)),$ (B.13)

$\begin{matrix} F (r_{1}, r_{2}) \equiv - I_{c, 4}^{+} (r_{1}, r_{2}) + I_{c, 4}^{-} (r_{1}, r_{2}) - (r_{1} + r_{2}) I_{s, 3}^{+} (r_{1}, r_{2}) \\ + (r_{1} - r_{2}) I_{s, 3}^{-} (r_{1}, r_{2}) + r_{1} r_{2} [I_{c, 2}^{+} (r_{1}, r_{2}) + I_{c, 2}^{-} (r_{1}, r_{2})], \end{matrix}$ $\begin{aligned} F(r_1,r_2) \equiv &-I_{c,4}^{+}(r_1,r_2)+I_{c,4}^{-}(r_1,r_2)-(r_1+r_2)I_{s,3}^{+}(r_1,r_2) \\ & +(r_1-r_2) I_{s,3}^{-}(r_1,r_2) + r_1 r_2 \left[I_{c,2}^{+}(r_1,r_2)+I_{c,2}^{-}(r_1,r_2)\right], \end{aligned}$ (B.14)

and shortening

$\begin{matrix} r_{-, i} \equiv r_{\min} (i_{z}), r_{-, k} \equiv r_{\min} (k_{z}), r_{+, i} \equiv r_{\max} (i_{z}), r_{+, k} \equiv r_{\max} (k_{z}), \end{matrix}$ $\begin{aligned} r_{-,i} \equiv r_{\mathrm{min} }(i_z), \quad r_{-,k} \equiv r_{\mathrm{min} }(k_z), \quad r_{+,i} \equiv r_{\mathrm{max} }(i_z), \quad r_{+,k} \equiv r_{\mathrm{max} }(k_z), \end{aligned}$ (B.15)

we have the long expression

$\begin{matrix} S_{i_{z}, j_{z} ; k_{z}, l_{z}} = & δ_{i_{z}, j_{z}} δ_{k_{z}, l_{z}} \frac{3 G (z_{mean} (i_{z}))}{r_{+, i}^{3} - r_{-, i}^{3}} \frac{3 G (z_{mean} (k_{z}))}{r_{+, k}^{3} - r_{-, k}^{3}} \times \frac{1}{4 π^{2}} \\ \times [F (r_{+, i}, r_{+, k}) - F (r_{+, i}, r_{-, k}) - F (r_{-, i}, r_{+, k}) \\ + F (r_{-, i}, r_{-, k})] . \end{matrix}$ $\begin{aligned} S_{i_z,j_z;k_z,l_z} =&\delta _{i_z,j_z} \ \delta _{k_z,l_z} \ \frac{3 \ G\left(z_\mathrm{mean} (i_z)\right)}{r_{+,i}^3 - r_{-,i}^3} \ \frac{3 \ G\left(z_\mathrm{mean} (k_z)\right)}{r_{+,k}^3 - r_{-,k}^3}\,{\times }\,\frac{1}{4\pi ^2} \nonumber \\&\times \big [ F(r_{+,i},r_{+,k})-F(r_{+,i},r_{-,k})-F(r_{-,i},r_{+,k})\nonumber \\ &+F(r_{-,i},r_{-,k}) \big ]. \end{aligned}$ (B.16)

Formally, the $I_{c / s, n}^{\pm}$ $I_{c/s,n}^{\pm}$ are IR divergent integrals as when k → 0 we have P(k)∝k^n_s with n_s ∼ 1. However, for every $I_{c / s, n}^{+}$ $I_{c/s,n}^{+}$ there is an opposite $I_{c / s, n}^{-}$ $I_{c/s,n}^{-}$ that carries the same divergence, which is thus cancelled. Hence, when applying a lower cutoff k_min to all integrals, the full expression Eq. (B.16) is convergent when k_min → 0. Numerically, we need to apply this cutoff and not put it too low in order to avoid cases of large cancellations between large numbers where numerical errors could spoil the result. Inspecting Eq. (B.11) and recalling j₁(x)∝x when x → 0, we see that the integrand is ∝k²P(k)∝k^2 + n_s in the IR, meaning that the integral is quickly converging. A conservative choice is thus to ensure that the start of the integral is at least one decade before the matter-radiation equality, and that the Bessel functions are in the small x regime. Hence, we take k_min = min{k_eq, 1/r_max}/10. Numerically, an upper cutoff k_max also needs to be taken. The integrals are convergent in this limit so this is a less pressing issue. With the same type of argument as for k_min, we can see that Eq. (B.16) is well converged if we take the value k_max = 10 × max{k_eq, 1/r_min}.

Appendix C: Angular power spectrum response

Takada & Hu (2013) showed that the 3D matter power spectrum reacts to a change in background through two separate effects: a term from second-order perturbation theory (2PT) that dominates on large scales, and a term from the one-halo part of the spectrum called halo sample variance that dominates on small scales. For the galaxy power spectrum, it was shown that the reaction also contains terms from second-order galaxy bias and shot-noise, and that the contribution from second-order nonlocal bias vanishes (Lacasa & Rosenfeld 2016; Lacasa 2018),

$\begin{matrix} \frac{\partial P_{gal} (k | z)}{\partial δ_{b}} \equiv & (\frac{68}{21} b_{1}^{gal} {(k, z)}^{2} + 2 b_{2}^{gal} (k, z) b_{1}^{gal} (k, z)) P_{m} (k | z) \\ + \frac{\partial P_{1 h} (k | z)}{\partial δ_{b}} + b_{1}^{gal} (k = 0, z) / {\bar{n}}_{gal} (z), \end{matrix}$ $\begin{aligned} \frac{\partial P_{\mathrm{gal} }(k | z)}{\partial \delta _b} \equiv &\left(\frac{68}{21} \, b_1^{\mathrm{gal} }(k,z)^2 + 2 b_2^{\mathrm{gal} }(k,z) \, b_1^{\mathrm{gal} }(k,z)\right) \, P_{\mathrm{m} }(k|z)\nonumber \\&+ \frac{\partial P_\mathrm{1h} (k | z)}{\partial \delta _b} + b_1^{\mathrm{gal} }(k=0,z)/\overline{n}_{\mathrm{gal} }(z), \end{aligned}$ (C.1)

where in the halo model

$\begin{matrix} b_{i}^{gal} (k, z) = \int d M \frac{d n_{h}}{d M} \frac{〈 N_{gal} 〉}{{\bar{n}}_{gal} (z)} u (k | M, z) b_{i} (M, z) \end{matrix}$ $\begin{aligned} b_i^{\mathrm{gal} }(k,z) = \int {\mathrm{d} } M \ \frac{{\mathrm{d} } n_h}{{\mathrm{d} } M} \ \frac{\left\langle N_{\mathrm{gal} } \right\rangle }{\overline{n}_{\mathrm{gal} }(z)} \ u(k|M,z) \ b_i(M,z) \end{aligned}$ (C.2)

and

$\begin{matrix} \frac{\partial P_{1 h} (k | z)}{\partial δ_{b}} = \int d M \frac{d n_{h}}{d M} \frac{〈 N_{gal} (N_{gal} - 1) 〉}{{\bar{n}}_{gal} {(z)}^{2}} u {(k | M, z)}^{2} b_{1} (M, z) \end{matrix}$ $\begin{aligned} \frac{\partial P_\mathrm{1h} (k | z)}{\partial \delta _b} = \int {\mathrm{d} } M \ \frac{{\mathrm{d} } n_h}{{\mathrm{d} } M} \ \frac{\left\langle N_{\mathrm{gal} }(N_{\mathrm{gal} }-1) \right\rangle }{\overline{n}_{\mathrm{gal} }(z)^2} \ u(k|M,z)^2 \ b_1(M,z) \end{aligned}$ (C.3)

with $\frac{d n_{h}}{d M}$ $\frac{{\mathrm{d}} n_h}{{\mathrm{d}} M}$ the halo mass function, b_i(M, z) the i-th order halo bias, u(k|M, z) the halo profile, and N_gal given by the halo occupation distribution.

We call the four terms in Eq. (C.1) respectively 2PT, b2, 1h, and shot. The reaction of the angular power spectrum then follows

$\begin{matrix} \frac{d C_{ℓ}^{gg} (i_{z}, j_{z})}{d δ_{b}} = \int d V \frac{n_{gal}^{(i_{z})} (z)}{N_{gal} (i_{z})} \frac{n_{gal}^{(j_{z})} (z)}{N_{gal} (j_{z})} \frac{d P_{gal} (k_{ℓ} | z)}{d δ_{b}}, \end{matrix}$ $\begin{aligned} \frac{{\mathrm{d} } C_\ell ^{gg}(i_z,j_z)}{{\mathrm{d} } \delta _b} = \int {\mathrm{d} } V \ \frac{n_{\mathrm{gal} }^{(i_z)}(z)}{N_{\mathrm{gal} }(i_z)} \, \frac{n_{\mathrm{gal} }^{(j_z)}(z)}{N_{\mathrm{gal} }(j_z)} \ \frac{{\mathrm{d} } P_{\mathrm{gal} }(k_\ell |z)}{{\mathrm{d} } \delta _b}, \end{aligned}$ (C.4)

and it defines the (relative) response through

$\begin{matrix} \frac{d C_{ℓ}^{gg} (i_{z}, j_{z})}{d δ_{b}} = R_{ℓ}^{gg} C_{ℓ}^{gg} (i_{z}, j_{z}) . \end{matrix}$ $\begin{aligned} \frac{{\mathrm{d} } C_\ell ^{gg}(i_z,j_z)}{{\mathrm{d} } \delta _b} = R_\ell ^{gg} \ C_\ell ^{gg}(i_z,j_z). \end{aligned}$ (C.5)

Figure C.1 shows this response and its different terms. On large scales the response is dominated by the 2PT and b2 terms, but quickly the 1h terms start to dominate. This switch between 2h and 1h terms appears at ℓ ∼ 650, i.e., earlier than the switch in C_ℓ which appears at ℓ ∼ 800. This happens because the nonlinear part of the power spectrum reacts more strongly to background change than the linear part : $\frac{\partial ln P_{1 h}}{\partial δ_{b}} > \frac{\partial ln P_{2 h}}{\partial δ_{b}}$ $\frac{\partial \ln P_{\mathrm{1h}}}{\partial \delta_b} > \frac{\partial \ln P_{\mathrm{2h}}}{\partial \delta_b}$ .

Fig. C.1.

Power spectrum response R_ℓ and its different terms. The dashed line indicates the effective value taken in the analysis (see main text).

The total angular response shows some scale dependence over the range of multipoles considered, ranging from R_ℓ ∼ 4.2 on large scales to R_ℓ ∼ 5.5 on small scales. In this article for simplicity we took a constant effective value R_ℓ = 5 (dashed line in Fig. C.1). This has the advantage of allowing analytical calculations in Sect. 3.4, and we find in Sect. 4 that it reproduces adequately the S/N over all scales considered, and also reproduces the Fisher constraints on cosmological parameters, except deep in the SSC-dominated regime for the most affected parameters.

To go beyond this constant R_ℓ approximation, we need the scale dependence of the response. The redshift dependence is also needed, if we want to work on redshifts other than the one studied in this article (0.9 < z < 1). To answer both of these problems, we have computed numerically the full response through Eqs. (C.1) and (C.5) on scales 50 < ℓ < 2000 on a wide range of redshift (0.1 < z < 2 in bins Δz = 0.1). We then fitted the responses in each redshift bin either with a constant model $R_{ℓ} = \bar{R}$ $R_\ell=\overline{R}$ or a linear model R_ℓ = R₀ + R₁ × (ℓ/ℓ₀) with ℓ₀ = 1000. The values of the fitted parameters are given in Table C.1.

Table C.1.

Fits to the redshift dependence of the response of the galaxy power spectrum.

Another numerical approach that we anticipate is to calibrate the response through dedicated simulations, similarly to the work of Barreira et al. (2018b).

Appendix D: Impact of binning

Indicating bins of multipoles by the b subscript, the binned angular power spectrum, C_b, is defined as

$\begin{matrix} C_{b} = \sum_{ℓ \in b} \frac{S_{b ℓ}}{Δ_{b}} \times C_{ℓ}, \end{matrix}$ $\begin{aligned} C_b=\displaystyle \sum _{\ell \in b}\frac{S_{b\ell }}{\Delta _b}\times C_\ell , \end{aligned}$ (D.1)

where the summation is over multipoles within the bin b, Δ_b is the width of the bin, and S_bℓ is a reshaping operator usually chosen to flatten the C_ℓ within bins of multipoles¹¹. This reshaping operator is thus obtained assuming that within the bin b, the angular power spectrum is approximately given by $C_{ℓ} ≃ \frac{C_{b}}{S_{b ℓ}}$ $C_\ell\simeq \frac{C_b}{S_{b\ell}}$ with C_b a constant over the bin, and S_bℓ a (usually theoretically) known function of ℓ¹². Finally, the widths of the bins are usually chosen to be greater than the typical length in multipoles of the ℓ-to-ℓ′ coupling induced by the mask.

The covariance of the binned spectrum, C_b, is related to the covariance of the full spectrum, C_ℓ, as

$\begin{matrix} Cov (C_{b}, C_{b^{'}}) = \sum_{ℓ \in b} \sum_{ℓ^{'} \in b^{'}} (\frac{S_{b ℓ}}{Δ_{b}}) (\frac{S_{b^{'} ℓ^{'}}}{Δ_{b}^{'}}) Cov (C_{ℓ}, C_{ℓ^{'}}) . \end{matrix}$ $\begin{aligned} {\mathrm{Cov} }\left(C_b,C_{b^{\prime }}\right)=\displaystyle \sum _{\ell \in b}\sum _{\ell ^{\prime }\in b^{\prime }}\left(\frac{S_{b\ell }}{\Delta _b}\right)\left(\frac{S_{b^{\prime }\ell ^{\prime }}}{\Delta _b^{\prime }}\right)\ {\mathrm{Cov} }\left(C_\ell ,C_{\ell ^{\prime }}\right). \end{aligned}$ (D.2)

By writing Cov(C_ℓ,C_ℓ′) = 𝒞_G + 𝒞_SSC, the covariance of the binned spectra is then given by the sum of its Gaussian contribution and its super-sample contribution.

Choosing bins that are wider than the typical width of the mask-induced couplings leads to a diagonal Gaussian covariance (𝒞_G)_b, b′ ≃ G_bδ_b, b′. An analytic expression for G_b can be obtained assuming the f_sky approximation, i.e.,

$\begin{matrix} G_{b} = \sum_{ℓ \in b} {(\frac{S_{b ℓ}}{Δ_{b}})}^{2} \times (\frac{2 C_{ℓ}^{2}}{(2 ℓ + 1) f_{sky}}) \cdot \end{matrix}$ $\begin{aligned} G_b = \displaystyle \sum _{\ell \in b}\left(\frac{S_{b\ell }}{\Delta _b}\right)^2\times \left(\frac{2 C_\ell ^2}{(2\ell +1)\,f_{\mathrm{sky} } }\right)\cdot \end{aligned}$ (D.3)

Since the reshaping function is chosen such that S_bℓC_ℓ is roughly constant, we can simplify the above to get $G_{b} = \frac{2 C_{ℓ_{b}}^{2}}{(2 ℓ_{b} + 1) f_{sky} Δ_{b}}$ $G_b=\frac{2C^2_{\ell_b}}{(2\ell_b+1)\,f_{\mathrm{sky} }\,\Delta_b}$ by defining the average multipole in the bin ℓ_b with the identification $\frac{1}{(2 ℓ_{b} + 1) Δ_{b}} \equiv \sum_{ℓ \in b} \frac{1}{(2 ℓ + 1) Δ_{b}^{2}}$ $\frac{1}{(2\ell_b+1)\Delta_b}\equiv\sum_{\ell\in b}\frac{1}{(2\ell+1)\Delta^2_b}$ .

For the SSC, we first recall that for a single probe and a single redshift bin, the covariance of the spectra is given by

$\begin{matrix} {[C_{SSC}]}_{ℓ, ℓ^{'}} = S_{i, i} V_{ℓ} V_{ℓ^{'}}, \end{matrix}$ $\begin{aligned} \left[\mathcal{C} _{\mathrm{SSC} }\right]_{\ell ,\ell ^{\prime }}=S_{i,i}V_\ell V_{\ell ^{\prime }}, \end{aligned}$ (D.4)

with V_ℓ = R_ℓC_ℓ. It is then straightforward to show that for the binned spectra, we obtain

$\begin{matrix} {[C_{SSC}]}_{b, b^{'}} = S_{i, i} V_{b} V_{b^{'}}, \end{matrix}$ $\begin{aligned} \left[\mathcal{C} _{\mathrm{SSC} }\right]_{b,b^{\prime }}=S_{i,i}V_b V_{b^{\prime }}, \end{aligned}$ (D.5)

with the binned version of the vector V_ℓ, i.e.,

$\begin{matrix} V_{b} = \sum_{ℓ \in b} \frac{S_{b ℓ}}{Δ_{b}} \times V_{ℓ} . \end{matrix}$ $\begin{aligned} V_b=\displaystyle \sum _{\ell \in b}\frac{S_{b\ell }}{\Delta _b}\times V_\ell . \end{aligned}$ (D.6)

In the case where the response R_ℓ ≡ R is constant, this simplifies to V_b = RC_b. This shows that for a single probe and a single bin in redshift, adding the SSC to covariance of binned power spectra still corresponds to a rank 1 update of the Gaussian covariance.

The above is easily generalized to the other cases where it is enlarged to multi-probes and more than one redshift bin since it is exactly the same binning in multipoles which has to be used for the entire set of multi-probe and multi-redshift auto- and cross-spectra. The data vector is now built from the multi-probe binned angular power spectra, and the vector V_b in the SSC is obtained by binning the vector V_ℓ. Only the Gaussian covariance is slightly amended, being partitioned into n_b × n_b non-diagonal blocks of size n_c × n_c (n_b is the number of bins and n_c the number of auto- and cross-spectra). Using the f_sky approximation, it becomes block diagonal 𝒞_G = G_bδ_b, b′. The blocks of size n_c × n_c read

$\begin{matrix} {[G_{b}]}^{W, X ; Y Z} & \equiv {Cov}_{G} (C_{b}^{WX}, C_{b}^{YZ}) \\ = \sum_{ℓ \in b} {(\frac{S_{b ℓ}}{Δ_{b}})}^{2} \times (\frac{C_{ℓ}^{WY} C_{ℓ}^{XZ} + C_{ℓ}^{WZ} C_{ℓ}^{XY}}{(2 ℓ + 1) f_{sky}}), \end{matrix}$ $\begin{aligned} \left[{\boldsymbol{G}}_b\right]^{W,X;YZ}&\equiv {\mathrm{Cov} }_{\mathrm{G} }\left(C^{WX}_b,C^{YZ}_b\right) \nonumber \\&= \displaystyle \sum _{\ell \in b}\left(\frac{S_{b\ell }}{\Delta _b}\right)^2\times \left(\frac{ C^{WY}_\ell C^{XZ}_\ell +C^{WZ}_\ell C^{XY}_\ell }{(2\ell +1)\,f_{\mathrm{sky} } }\right), \end{aligned}$ (D.7)

where (W, X, Y, Z) run over probes.

Appendix E: Likelihood of cluster counts

The purpose of this section is to recall the form of the full likelihood of cluster counts, a result which seems overlooked in the literature. Further, we extend the likelihood with the formulation developed in Sect. 5.2, which will ensure it to be well-defined analytically.

We call N = (N_{i_M, i_z})_{i_M, i_z} the vector of cluster counts in all bins of mass (indexed by i_M) and redshift (indexed by i_z). Then without super-sample covariance (simply called sample variance in the cluster literature), the likelihood is a collection of independent Poisson distribution in each bin of mass and redshift,

$\begin{matrix} P (N | p) = \prod_{i_{M}, i_{z}} Poiss (N_{i_{M}, i_{z}} | {\bar{N}}_{i_{M}, i_{z}} (p)), \end{matrix}$ $\begin{aligned} P(N|{\boldsymbol{p}}) = \prod _{i_M,i_z} \mathrm{Poiss} (N_{i_M,i_z} | \overline{N}_{i_M,i_z}({\boldsymbol{p}})), \end{aligned}$ (E.1)

where ${\bar{N}}_{i_{M}, i_{z}} (p)$ $\overline{N}_{i_M,i_z}({\boldsymbol{p}})$ is the model prediction for parameters p.

The above likelihood is sufficient to describe small counts, i.e., at high mass. However for current and future surveys detecting an increasing number of clusters, it becomes necessary to account for the effect of sample variance (Hu & Kravtsov 2003). Lima & Hu (2004) found the full likelihood for cluster counts in different cells, which can be straightforwardly applied to our case with only one cell (the survey),

$\begin{matrix} P (N | p) = \int d^{n} \tilde{N} (\prod_{i_{M}, i_{z}} Poiss (N_{i_{M}, i_{z}} | {\tilde{N}}_{i_{M}, i_{z}})) Gauss (\tilde{N} - \bar{N}, S^{'}), \end{matrix}$ $\begin{aligned} P(N|{\boldsymbol{p}}) = \int {\mathrm{d} }^n\tilde{N} \ \left(\prod _{i_M,i_z} \mathrm{Poiss} (N_{i_M,i_z} | \tilde{N}_{i_M,i_z}) \right) \ \mathrm{Gauss} (\tilde{N}-\overline{N},S^{\prime }), \end{aligned}$ (E.2)

where S′ is similar to the S matrix defined in Eq. (5) in the case of cluster counts, but also including the counts response, and was defined originally for a 3D survey neglecting redshift evolution (Lima & Hu 2004):

$\begin{matrix} S_{i_{M}, i_{z} ; j_{M}, j_{z}}^{'} = b_{i_{M}, i_{z}} N_{i_{M}, i_{z}} b_{j_{M}, j_{z}} N_{j_{M}, j_{z}} \times S_{i_{z} ; j_{z}} . \end{matrix}$ $\begin{aligned} S^{\prime }_{i_M,i_z;j_M,j_z} = b_{i_M,i_z} \, N_{i_M,i_z} \ b_{j_M,j_z} \, N_{j_M,j_z}\,{\times }\,S_{i_z;j_z}. \end{aligned}$ (E.3)

The matrix S_{i_z; j_z} reads

$\begin{matrix} S_{i_{z} ; j_{z}} = \int \frac{d^{3} k}{{(2 π)}^{3}} {\tilde{W}}_{i_{z}}^{*} (k) {\tilde{W}}_{j_{z}} (k) P (k), \end{matrix}$ $\begin{aligned} S_{i_z;j_z} = \int \frac{{\mathrm{d} }^3{\boldsymbol{k}}}{(2\pi )^3} \ \tilde{W}_{i_z}^*({\boldsymbol{k}}) \, \tilde{W}_{j_z}({\boldsymbol{k}}) \ P(k), \end{aligned}$ (E.4)

where ${\tilde{W}}_{i_{z}}$ $\tilde{W}_{i_z}$ is the normalized ( $\int d^{3} x {\tilde{W}}_{i_{z}} (x) = 1$ $\int {\mathrm{d}}^3{\boldsymbol{x}} \ \tilde{W}_{i_z}({\boldsymbol{x}})=1$ ) window function in redshift bin i_z.

In the framework developed in Sect. 5.2, $\tilde{N}$ $\tilde{N}$ is interpreted as the average number counts in a region of the universe having a background change δ_b. Noting that the response of the cluster counts is $\frac{\partial N_{i_{M}, i_{z}}}{\partial δ_{b}} = b_{i_{M}, i_{z}} N_{i_{M}, i_{z}}$ $\frac{\partial N_{i_M,i_z}}{\partial \delta_b} = b_{i_M,i_z} \, N_{i_M,i_z}$ , we can rewrite the likelihood as

$\begin{array}{l} P (N | p) = \int d^{n} δ N (\prod_{i_{M}, i_{z}} Poiss (N_{i_{M}, i_{z}} | {\bar{N}}_{i_{M}, i_{z}} + δ N_{i_{M}, i_{z}})) \\ \times Gauss (δ N, {\frac{\partial N}{\partial δ_{b}}}^{T} S \frac{\partial N}{\partial δ_{b}}) \end{array}$ $\begin{aligned} P(N|{\boldsymbol{p}}) =&\int {\mathrm{d} }^{n}\delta N \ \left(\prod _{i_M,i_z} \mathrm{Poiss} \left(N_{i_M,i_z} | \overline{N}_{i_M,i_z} + \delta N_{i_M,i_z}\right) \right) \nonumber \\&\times \mathrm{Gauss} \left(\delta N,\frac{\partial N}{\partial \delta _b}^T S \frac{\partial N}{\partial \delta _b}\right)\end{aligned}$ (E.5)

$\begin{array}{l} = \int d^{n_{z}} δ_{b} (\prod_{i_{M}, i_{z}} Poiss (N_{i_{M}, i_{z}} | {\bar{N}}_{i_{M}, i_{z}} + \frac{\partial N_{i_{M}, i_{z}}}{\partial δ_{b}} δ_{b} (i_{z}))) \\ \times Gauss (δ_{b}, S) . \end{array}$ $\begin{aligned} & = \int {{{\rm{d}}^{{n_z}}}} {\delta _b}\;\left( {\prod\limits_{{i_M},{i_z}} {{\rm{Poiss}}} \left( {{N_{{i_M},{i_z}}}|{{\bar N}_{{i_M},{i_z}}} + \frac{{\partial {N_{{i_M},{i_z}}}}}{{\partial {\delta _b}}}{\delta _b}({i_z})} \right)} \right)\\ &\times {\rm{Gauss}}\left( {{\delta _b},S} \right). \end{aligned}$ (E.6)

Rigorously, we might be concerned about the edges of the integral in this likelihood: it does not make physical sense for δ_b to go to −∞, as it corresponds to δN → −∞, i.e., $\tilde{N}$ $\tilde{N}$ becoming negative, which is impossible for a number of objects. In practice, this is unlikely to be a concern since for any reasonably sized survey the background change follows δ_b ≪ 1 at all redshifts, hence $\tilde{N} > 0$ $\tilde{N} > 0$ .

For the purpose of rigorousness, let us solve this physical concern nonetheless. When δ_b becomes of order 1, two approximations fail: (i) the pdf of δ_b being Gaussian, which is incorrect since for instance δ_b ≥ −1, and (ii) using a linear response ansatz $\tilde{N} = \bar{N} + \frac{\partial N}{\partial δ_{b}} δ_{b}$ $\tilde{N}=\overline{N}+\frac{\partial N}{\partial \delta_b} \delta_b$ . Both failures can be cured formally with

$\begin{matrix} P (N | p) = \int d^{n_{z}} δ_{b} (\prod_{i_{M}, i_{z}} Poiss (N_{i_{M}, i_{z}} | {\tilde{N}}_{i_{M}, i_{z}} (p, δ_{b}))) \times P (δ_{b} | p), \end{matrix}$ $\begin{aligned} P(N|{\boldsymbol{p}})= \int {\mathrm{d} }^{n_z}\delta _b \ \left(\prod _{i_M,i_z} \mathrm{Poiss} \left(N_{i_M,i_z} | \tilde{N}_{i_M,i_z}({\boldsymbol{p}},\delta _b)\right) \right)\,{\times }\,P\left(\delta _b|{\boldsymbol{p}}\right), \end{aligned}$ (E.7)

where P(δ_b|p) is the pdf of the background change (which has support on δ_b ∈ [−1, ∞[) and ${\tilde{N}}_{i_{M}, i_{z}} (p, δ_{b})$ $\tilde{N}_{i_M,i_z}({\boldsymbol{p}},\delta_b)$ is the average cluster count in a region with background change δ_b of a universe with cosmological parameters p. For instance in the separate universe approach this ${\tilde{N}}_{i_{M}, i_{z}} (p, δ_{b})$ $\tilde{N}_{i_M,i_z}({\boldsymbol{p}},\delta_b)$ could be computed thanks to a change of cosmological parameters p′(p, δ_b).

All Tables

Table C.1.

Fits to the redshift dependence of the response of the galaxy power spectrum.

In the text

All Figures

	Fig. 1. Comparison of the cumulative S/N values up to a multipole ℓ_max with different covariances: Gaussian only, with the S_i, j approximation, and with a full SSC computation.
In the text

	Fig. 2. cos²θ for each cosmological parameter as a function of the maximum multipole ℓ_max. Parameters with cos²θ close to 1 are the most affected by SSC when it starts to dominate the covariance (i.e., for multipoles ℓ ≳ ℓ_SSC with ℓ_SSC 360 in this specific case).
In the text

	Fig. 3. Comparison of $\sqrt{F_{α α}}$ $\sqrt{F_{\alpha\alpha}}$ for an analysis up to a multipole ℓ_max with different covariances: Gaussian only (blue), S_i, j approximation with a constant response (green), S_i, j approximation with a scale-dependent response (cyan), full SSC computation (red).
In the text

	Fig. C.1. Power spectrum response R_ℓ and its different terms. The dashed line indicates the effective value taken in the analysis (see main text).
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Aguena, M., & Lima, M. 2018, Phys. Rev. D, 98, 123529 [NASA ADS] [CrossRef] [Google Scholar]

[2] Amendola, L., Appleby, S., Bacon, D., et al. 2013, Liv. Rev. Rel., 16, 6 [Google Scholar]

[3] Barreira, A., Krause, E., & Schmidt, F. 2018a, J. Cosmol. Astropart. Phys., 10, 053 [CrossRef] [Google Scholar]

[4] Barreira, A., Krause, E., & Schmidt, F. 2018b, J. Cosmol. Astropart. Phys., 6, 015 [Google Scholar]

[5] Bartlett, M. S. 1951, Ann. Math. Statist., 22, 107 [Google Scholar]

[6] Chan, K. C., & Blot, L. 2017, Phys. Rev. D, 96, 023528 [NASA ADS] [CrossRef] [Google Scholar]

[7] Chan, K. C., Moradinezhad Dizgah, A., & Noreña, J. 2018, Phys. Rev. D, 97, 043532 [NASA ADS] [CrossRef] [Google Scholar]

[8] Dark Energy Survey Collaboration (Abbott, T. M. C., et al.) 2018, Phys. Rev. D, 98, 043526 [Google Scholar]

[9] Hamimeche, S., & Lewis, A. 2008, Phys. Rev. D, 77, 103013 [NASA ADS] [CrossRef] [Google Scholar]

[10] Hildebrandt, H., Viola, M., Heymans, C., et al. 2017, MNRAS, 465, 1454 [Google Scholar]

[11] Hu, W., & Kravtsov, A. V. 2003, ApJ, 584, 702 [NASA ADS] [CrossRef] [Google Scholar]

[12] Kilbinger, M., Heymans, C., Asgari, M., et al. 2017, MNRAS, 472, 2126 [NASA ADS] [CrossRef] [Google Scholar]

[13] Lacasa, F. 2018, A&A, 615, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[14] Lacasa, F., & Kunz, M. 2017, A&A, 604, A104 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[15] Lacasa, F., & Rosenfeld, R. 2016, J. Cosmol. Astropart. Phys., 8, 005 [CrossRef] [Google Scholar]

[16] Lacasa, F., Lima, M., & Aguena, M. 2018, A&A, 611, A83 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[17] Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]

[18] Lewis, A., & Challinor, A. 2006, Phys. Rep., 429, 1 [Google Scholar]

[19] Li, Y., Schmittfull, M., & Seljak, U. 2018, J. Cosmol. Astropart. Phys., 2, 022 [Google Scholar]

[20] Lima, M., & Hu, W. 2004, Phys. Rev. D, 70, 043504 [NASA ADS] [CrossRef] [Google Scholar]

[21] LSST Science Collaborations (Abell, P. A., et al.) 2009, ArXiv e-prints [arXiv:0912.0201] [Google Scholar]

[22] Planck Collaboration XVI. 2014, A&A, 571, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[23] Planck Collaboration XXI. 2016, A&A, 594, A21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[24] Planck Collaboration VI. 2018, ArXiv e-prints [arXiv:1807.06209] [Google Scholar]

[25] Sellentin, E., Heymans, C., & Harnois-Déraps, J. 2018, MNRAS, 477, 4879 [NASA ADS] [CrossRef] [Google Scholar]

[26] Sherman, J., & Morrison, W. J. 1950, Ann. Math. Stat., 21, 124 [CrossRef] [Google Scholar]

[27] Takada, M., & Hu, W. 2013, Phys. Rev. D, 87, 123504 [NASA ADS] [CrossRef] [Google Scholar]

[28] Takada, M., & Spergel, D. N. 2014, MNRAS, 441, 2456 [NASA ADS] [CrossRef] [Google Scholar]

[29] Takahashi, R., Soma, S., Takada, M., & Kayo, I. 2014, MNRAS, 444, 3473 [NASA ADS] [CrossRef] [Google Scholar]

[30] Wagner, C., Schmidt, F., Chiang, C.-T., & Komatsu, E. 2015, MNRAS, 448, L11 [NASA ADS] [CrossRef] [Google Scholar]

Fast and easy super-sample covariance of large-scale structure observables⋆

1. Introduction

2. Approximating the SSC

3. Application to parameter constraints

3.1. Single probe and single redshift bin

3.2. Multi-probe and single redshift bin

3.3. Multi-probe and multiple redshift bins

3.4. Importance of SSC: an analytical rule of thumb

3.4.1. Impact on the signal-to-noise ratio

3.4.2. Impact on Fisher constraints

4. Numerical application to Fisher forecasts

5. Generalizations of the SSC approximation

5.1. Generalization to other statistics

5.2. Generalization to the likelihood

6. Conclusion

Acknowledgments

References

Appendix A: Example of weighting kernels

Appendix B: Particular cases for σ2(z1,z2) and S matrix

σ2(z1, z2)

S matrix

Appendix C: Angular power spectrum response

Appendix D: Impact of binning

Appendix E: Likelihood of cluster counts

All Tables

All Figures

Fast and easy super-sample covariance of large-scale structure observables^⋆

Appendix B: Particular cases for σ²(z₁,z₂) and S matrix

σ²(z₁, z₂)