Covariance matrices for halo number counts and correlation functions

P. Valageas; N. Clerc; F. Pacaud; M. Pierre

doi:10.1051/0004-6361/201117117

Free Access

Issue		A&A Volume 536, December 2011


Article Number		A95
Number of page(s)		36
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/201117117
Published online		16 December 2011

A&A 536, A95 (2011)

Covariance matrices for halo number counts and correlation functions^⋆

P. Valageas¹, N. Clerc², F. Pacaud³ and M. Pierre²

¹ Institut de Physique Théorique, CEA Saclay, 91191 Gif-sur-Yvette, France
e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
² Laboratoire AIM, CEA/DSM/IRFU/Sap, CEA Saclay, 91191 Gif-sur-Yvette, France
³ Argelander-Institut für Astronomie, University of Bonn, Auf dem Hügel 71, 53121 Bonn, Germany

Received: 20 April 2011
Accepted: 20 September 2011

Abstract

Aims. We study the mean number counts and two-point correlation functions, along with their covariance matrices, of cosmological surveys such as for clusters. In particular, we consider correlation functions averaged over finite redshift intervals, which are well suited to cluster surveys or populations of rare objects, where one needs to integrate over nonzero redshift bins to accumulate enough statistics.

Methods. We develop an analytical formalism to obtain explicit expressions of all contributions to these means and covariance matrices, taking into account both shot-noise and sample-variance effects. We compute low-order as well as high-order (including non-Gaussian) terms.

Results. We derive expressions for the number counts per redshift bins both for the general case and for the small window approximation. We estimate the range of validity of Limber’s approximation and the amount of correlation between different redshift bins. We also obtain explicit expressions for the integrated 3D correlation function and the 2D angular correlation. We compare the relative importance of shot-noise and sample-variance contributions, and of low-order and high-order terms. We check the validity of our analytical results through a comparison with the Horizon full-sky numerical simulations, and we obtain forecasts for several future cluster surveys.

Key words: cosmology: observations / large-scale structure of Universe / galaxies: clusters: general

^⋆

Appendices are only available in electronic form at http://www.aanda.org

© ESO, 2011

1. Introduction

The large-scale structure of the Universe is a key test of modern cosmological scenarios. Indeed, according to the standard cosmological model, the large-scale structures of the present Universe have formed through the amplification by gravitational instability of small almost-Gaussian primordial fluctuations (Peebles 1980). Then, from observations of the recent Universe, such as galaxy surveys (Cole et al. 2005; Tegmark et al. 2006), cluster surveys (Evrard 1989; Oukbir & Blanchard 1992; Pacaud et al. 2007), weak-lensing studies (Massey et al. 2007; Munshi et al. 2008), or measures of baryon acoustic oscillations (Eisenstein et al. 1998, 2005), one can derive constraints on the cosmological parameters (e.g., the mean matter and dark energy contents) and on the properties of the initial perturbations (e.g., possible deviations from Gaussianity). Moreover, one can check whether these structures have really formed through this gravitational instability process.

In the case of well-defined astrophysical objects, such as galaxies or X-ray clusters, which form a discrete population, standard probes are the abundance of these objects, that is, “number counts”, and their low-order correlation functions. Galaxies are governed by complex gas physics and star formation processes, which makes it difficult to relate their abundance as a function of optical luminosity to theoretical predictions. However, since they are rather common objects (one can reach almost 10⁶ galaxies in current surveys, e.g. Abazajian et al. 2009) it is possible to reconstruct halo density fields using subsamples of luminous red galaxies and to compare their power spectrum (i.e. the Fourier transform of their two-point correlation) with theory to derive constraints on cosmology (Reid et al. 2010). In contrast, as the largest nonlinear objects in the present Universe, galaxy clusters are much rarer (current cluster samples have a density of about ten per deg² at most, e.g. Adami et al. 2011), but their relationship with dark matter halo mass is controlled better. This means that their abundance is very sensitive to cosmological parameters (especially Ω_m and σ₈, Evrard 1989; Oukbir & Blanchard 1992), and has already been used to derive constraints on cosmology, but their clustering has not yet provided much cosmological information (because of low statistics). However, upcoming cluster surveys should allow the use of both number counts and spatial clustering to derive cosmological constraints (Pierre et al. 2011).

To compare observations with theory, one needs either to relate the observed properties of the objects (e.g., optical galaxy luminosity, X-ray cluster luminosity or temperature) to the quantities that are predicted by theoretical models (e.g., virialized halo mass) or to use semi-analytical models that attempt to build mock catalogs (Harker et al. 2007). For clusters, one can use scaling laws between luminosity, temperature, and mass, calibrated on observations of the local universe (Arnaud et al. 2005, 2010). Then, one must take the selection function of the survey being considered into account, since the probability of detecting the objects is usually more complex than a sharp cutoff on mass or luminosity (Pierre et al. 2011). Finally, one needs to estimate the error bars of the statistical quantities that are measured, in order to derive meaningful constraints on cosmology. In addition to the uncertainties associated with the relationship between observed quantities (e.g., luminosity) and theoretical quantities (e.g., halo mass) discussed above, two unavoidable sources of uncertainty are the “shot-noise” effects of the discrete character of the population and the “sample variance” due to the limited size of the survey. In the case of a full-sky survey, the latter is also known as the “cosmic variance”, due to the fact that we only observe “one sky” so that there is only a limited number of low-k modes to be measured.

In practice (Benoist et al. 1996; Maller et al. 2005; Norberg et al. 2009), one often estimates error bars from the data itself by subsampling the data and by computing the scatter between the means measured within each subsample (e.g., jackknife resampling). However, if one studies rare objects (e.g., clusters) or large scales, it is not possible to obtain reliable estimates from such subsamplings (because of low-quality statistics). Moreover, one often wishes to estimate the signal-to-noise ratio of future surveys, even before they have been approved by research agencies, in order to evaluate their scientific possibilities and to compare the efficiency of different probes. Then, one must use numerical simulations (Pierre et al. 2011; Croton et al. 2004; Kazin et al. 2010) or analytic methods. The former have the advantage of greater power (in the sense that one may explicitly introduce complex recipes for the formation of the objects, such as cooling processes and feedback, or intricate survey geometry), but are limited by finite resolution on large scales and for rare objects. Analytical approaches allow one to describe a wider range of scales and halo masses, and usually provide faster computations. Hence they remain a useful complementary method, which we investigate in this paper.

Thus, in this paper, we present a general analytical formalism for computing the shot-noise and sample-variance error bars of estimators of number counts and real-space two-point correlations for deep surveys that cover a significant range of redshifts. We consider both the 3D correlation function and the 2D angular correlation function on the sky. As explained above, this study is motivated by the need for such covariance matrices to compare any survey with theory. This extends over some previous works, which only considered the sample variance of number counts (Hu & Kravtsov 2003) or neglected high-order or non-Gaussian terms in the sample variance of estimators for two-point correlations or power spectra (Feldman et al. 1994; Majumdar & Mohr 2004; Eisenstein et al. 2005; Cohn 2006; Crocce et al. 2011). Indeed, while the sample variance of number counts (i.e., the mean number of objects per unit volume) only involves the two-point correlation of the objects, the sample variance of estimators of the two-point correlation itself also involves the three- and four-point correlations (and so on for estimators of higher order correlation functions) (Bernstein 1994).

Some previous studies have already included the contributions of such higher order correlations to covariance matrices (Szapudi & Colombi 1996; Meiksin & White 1999; Scoccimarro et al. 1999; Eisenstein & Zaldarriaga 2001; Smith 2009), mostly in the context of galaxy surveys. However, we extend these works by comparing the various contributions to expected error bars in detail, including all shot-noise and sample-variance terms, as well as high-order contributions, and by studying real-space two-point correlation functions instead of Fourier-space power spectra. Moreover, since we have the application to cluster surveys in mind, and more generally to deep surveys of rare objects, we consider quantities that are defined by integration over finite redshift bins. For instance, number counts may be associated with bins Δz = 0.1 while the two-point correlation functions are integrated over a significant redshift interval, such as 0 < z < 1, to accumulate enough statistics. Then, the statistical quantities that we consider involve integrations along the line-of-sight, rather than local power spectra or two-point correlations in a small box at a given redshift. Moreover, for number counts we consider arbitrary angular scales, from small angles, where the Limber approximation applies, to full-sky surveys. In these various respects, our study fills a gap in published works.

In view of the application to cluster surveys, we consider a population of objects defined by their mass M, and focusing on the case of dark matter halos, we use the halo mass function and bias measured in previous numerical studies for our numerical computations. To estimate the three- and four-point correlation functions (needed for the covariance of the two-point estimator), we use a simple hierarchical model (Peebles 1980; Bernstein 1994), which writes these higher order correlations as products of the two-point correlation. We give the explicit expressions of our results, which we also compare with numerical simulations, and we provide several realistic illustrations. In particular, we take advantage of our analytical formalism to compare the various contributions to the error bars and derive approximate scalings. Although we eventually apply our results to several future cluster surveys, our method is more general and could be applied to other objects (e.g., galaxies or quasars), defined by other quantities (e.g., luminosity), provided one has a model for their multiplicity function and their two-point correlation, and the three- and four-point correlation functions can be described reasonably well by a hierarchical model in the regime where they are relevant.

This paper is organized as follows. In Sect. 2 we first briefly describe the analytic models that we use to estimate the means and covariance matrices of numbers counts and halo correlations, as well as the numerical simulations that we use to check the accuracy of our results. Then, we study the halo number counts per redshift bins in Sect. 3. This allows us to introduce on a simple example our approach to evaluate the mean and the covariance matrix of various estimators. We consider the cases of both small angular windows and arbitrary angular windows (including full-sky surveys), and we estimate the accuracy of small-angle approximations and the decay in correlations between distant redshift bins. Next, we study the real-space 3D halo correlation function in Sect. 4. We consider both the Peebles & Hauser and the Landy & Szalay estimators and compare their covariance matrices. We also discuss the relative importance of different contributions to these covariance matrices (shot noise/sample variance, low-order/high-order terms). Then, we investigate the halo angular correlation in Sect. 5, using the same approach. Finally, we apply our formalism to several real survey cases in Sect. 6 and we conclude in Sect. 7.

We give details of our calculations in several appendices. We discuss shot-noise terms in Appendix A, in the simple case of number counts, and finite-size effects in Appendix B. Then, we describe our computation of the mean and covariance of the estimators of the 3D halo correlation in Appendices C and D, for the Peebles & Hauser estimator, and in Appendix E for the Landy & Szalay estimator. We give further details on high-order contributions to the covariance matrices in Appendix F, for the 3D correlation, and Appendix H, for the angular correlation.

2. Halo density fields

Before we describe our analysis of the covariance matrices for halo number counts and correlation functions, we present in this section the analytic models that we use for the underlying halo distributions (mass and bias function, etc.) and the numerical simulations that we use to validate our results.

2.1. Analytic models

2.1.1. Halo mass function and correlation

To be consistent with the numerical simulations, in Sects. 3.1 to 5, where we develop our formalism and compare our results with simulations, we use the WMAP3 cosmology (Spergel et al. 2007), that is, Ω_m = 0.24, Ω_de = 0.76, Ω_b = 0.042, h = 0.73, σ₈ = 0.77, n_s = 0.958, and w_de = −1. In Sect. 6, where we apply our formalism to obtain forecasts for current and future surveys, we use the more recent WMAP7 cosmology (Komatsu et al. 2011), that is, Ω_m = 0.274, Ω_de = 0.726, Ω_b = 0.046, h = 0.702, σ₈ = 0.816, n_s = 0.968, and w_de = −1.

In this paper, keeping in mind the study of X-ray clusters, we consider the number counts and correlations of dark matter halos defined by the nonlinear density contrast δ = 200. These halos are fully characterized by their mass, and we do not investigate the relationship between this mass and cluster properties such as the gas temperature and X-ray luminosity. These scaling laws can be added to our formalism to derive the cluster number counts and correlations, depending on the quantities that are actually measured, but we keep a more general setting in this paper.

We use the halo mass function, dn/dlnM, of Tinker et al. (2008), and the halo bias of Tinker et al. (2010). Thus, the two-point correlation function $ξ_{i,j}^{h}$ $Mathematical equation: \hbox{$\xih_{i,j}$}$ between two halos labeled “i” and “j” can be factored¹ in as $ξ_{i,j}^{h} = b_{i} b_{j} ξ (| x i - x j |; z),$ $Mathematical equation: \begin{equation} \xih_{i,j} = b_i b_j \, \xi(|\vx_i-\vx_j|;z) , \label{xij-bb} \end{equation}$ (1)where ξ is the matter density correlation, and the bias factors b_i and b_j do not depend on scale, $b_{i} = b (M_{i}, z_{i}) .$ $Mathematical equation: \begin{equation} b_i = b(M_i,z_i) . \label{b-def} \end{equation}$ (2)This approximation of scale-independent halo bias is valid to better than 10% on scales 20 < r < 130 h^-1 Mpc (Manera & Gaztanaga 2011), with a small feature on the baryon acoustic scale (r ~ 100 h^-1Mpc) of amplitude of 5% (Desjacques et al. 2010).

Throughout most of this paper we assume that correlations are negligible over cosmological distances (of order c/H₀), so that the redshift z on the right-hand side of Eq. (1) can be taken at will as z_i or z_j (or the mean (z_i + z_j)/2). In Sect. 3.2.2, where we consider the case of large angular windows (and go beyond the flat sky and Limber’s approximations), we do not use this approximation but replace the matter density correlation by its linear approximation. This yields the alternative factorization $ξ_{i,j}^{h} \propto b (M_{i}, z_{i}) b (M_{j}, z_{j}) D_{+} (z_{i}) D_{+} (z_{j}) ξ_{L 0} (x_{i}, x_{j})$ $Mathematical equation: \hbox{$\xih_{i,j} \propto b(M_i,z_i) b(M_j,z_j) D_+(z_i) D_+(z_j) \xi_{L0}(\vx_i,\vx_j)$}$ that allows one to handle arbitrary redshifts z_i and z_j.

For the nonlinear matter correlation ξ(x;z), and the Fourier-space nonlinear power spectrum P(k). defined by $ξ (x; z) = \int d k e^{i k \cdot x} P (k; z),$ $Mathematical equation: \begin{equation} \xi(x;z) = \int \dd\vk \, {\rm e}^{\ii\vk\cdot\vx} \, P(k;z) , \label{xi-Pk} \end{equation}$ (3)we use the popular fitting formula to numerical simulations of Smith et al. (2003).

Throughout this paper, all angular number densities are in units of deg^-2.

2.1.2. Three-point and four-point halo correlations

The covariance matrices of the estimators $Mathematical equation: \hbox{$\hxi$}$ for the halo two-point correlation ξ^h also involve the halo three-point and four-point correlation functions, ζ^h and η^h, so we must define a model for these quantities. On large scales, for Gaussian initial conditions, the three- and four-point correlation functions of the matter density field behave as ζ ~ ξ² and η ~ ξ³ at lowest order over ξ (Bernardeau et al. 2002; Goroff et al. 1986). On small scales, these scaling laws remain a reasonable approximation (Colombi et al. 1996), but with numerical prefactors that are different from the large-scale ones (and may slightly vary with scale). On the other hand, for rare massive clusters, using the standard approach of Kaiser (1984) where virialized objects are identified with overdense regions in the linear density field, Politzer & Wise (1984) obtain $Mathematical equation: \hbox{$1+\xih(\vx_1,..,\vx_N) = \prod_{i>j} [1+\xih(\vx_i,\vx_j)]$}$ . Since our goal is only to estimate the magnitude of high-order contributions we consider in this article a simple “hierarchical clustering ansatz”, where the N − point correlation function can be expressed in terms of products of (N − 1) two-point correlation functions through tree diagrams (Groth & Peebles 1977; Peebles 1980). This is the simplest model² that is in qualitative agreement with large-scale theoretical predictions and small-scale numerical results, as well as with observations.

Through a comparison with numerical simulations, we check that the accuracy of this model is sufficient for our purpose, which is to estimate signal-to-noise ratios and compare different survey strategies (while we would require a higher accuracy for the computation of the means themselves, that is, the number counts and two-point correlations that we wish to measure). The advantage of this simple model is that it describes all scales, through the scalings recalled above, and does not require additional free parameters.

Thus, as in Bernstein (1994), Szapudi & Colombi (1996), Meiksin & White (1999), we write the three-point halo correlation function as $ζ_{1, 2, 3}^{h} = b_{1} b_{2} b_{3} \frac{S_{3}}{3} [ξ_{1, 2} ξ_{1, 3} + ξ_{2, 1} ξ_{2, 3} + ξ_{3, 1} ξ_{3, 2}],$ $Mathematical equation: \begin{equation} \zetah_{1,2,3} = b_1 b_2 b_3 \; \frac{S_3}{3} \; \left[ \xi_{1,2} \xi_{1,3} + \xi_{2,1} \xi_{2,3} + \xi_{3,1} \xi_{3,2} \right], \label{zeta-def} \end{equation}$ (4)where we sum over all three possible configurations over the three halos labeled “1”, “2”, and “3”. This corresponds to the three tree-diagrams shown in Fig. 1. We use a linear bias model³, as in Eq. (1), and for the matter density normalization factor S₃, we take its large-scale limit, which is obtained by perturbation theory (Peebles 1980; Fry 1984; Bernardeau et al. 2002), $S_{3} = \frac{34}{7} - (n + 3),$ $Mathematical equation: \begin{equation} S_3 = \frac{34}{7} - (n+3) , \label{S3-def} \end{equation}$ (5)where n is the slope of the linear power spectrum at the scale of interest. Within the same “hierarchical clustering ansatz”, the four-point correlation function is expressed in terms of products of three two-point functions, as shown in Fig. 2. We have two possible topologies, and sixteen different diagrams for four distinct halos. For simplicity, in this work we give the same weight to all sixteen diagrams, independently of their topology, as $\begin{matrix} η_{1, 2, 3, 4}^{h} & = & b_{1} b_{2} b_{3} b_{4} \frac{S_{4}}{16} [ξ_{1, 2} ξ_{1, 3} ξ_{1, 4} + 3 cyc . \end{matrix}$ $Mathematical equation: \begin{eqnarray} \etah_{1,2,3,4} & = & b_1 b_2 b_3 b_4 \; \frac{S_4}{16} \; \left[ \xi_{1,2} \xi_{1,3} \xi_{1,4} + 3 \, {\rm cyc.} \right. \nonumber \\ && \left. + \xi_{1,2} \xi_{2,3} \xi_{3,4} + 11 \, {\rm cyc.} \right] , \label{eta-def} \end{eqnarray}$ (6)where “3 cyc.” and “11 cyc.” stand for three and eleven terms that are obtained from the previous one by permutations over the labels “1, 2, 3, 4” of the four halos. Again we take for S₄ its large-scale limit, $S_{4} = \frac{60712}{1323} - \frac{62}{3} (n + 3) + \frac{7}{3} (n + 3)^{2} .$ $Mathematical equation: \begin{equation} S_4 = \frac{60712}{1323} - \frac{62}{3} (n+3) + \frac{7}{3} (n+3)^2 . \label{S4-def} \end{equation}$ (7)This is the simplest possible model, where the angular dependence only comes from the decomposition over the terms in brackets in Eqs. (4) and (6). More complex models and exact computations at lowest order of perturbation theory would introduce homogeneous kernels Q_N(x₁,..,x_N) (in place of the numbers S_N) that also depend on the angles between the vectors x_i and x_j (Scoccimarro et al. 1999). Observations of galaxy clustering show, for instance, that Q₃ displays a weak dependence on the triangle shape, while remaining close to unity (Gaztanaga et al. 2005; Kulkarni et al. 2007). Here we simply take the constant value Q₃ = S₃/3.

Fig. 1

The “hierarchical clustering ansatz” for the three-point correlation function $ζ_{1, 2, 3}^{h}$ $Mathematical equation: \hbox{$\zetah_{1,2,3}$}$ of Eq. (4). Each solid line corresponds to a two-point correlation ξ, and ζ^h is written as the sum of these three diagrams, with a multiplicative factor b₁b₂b₃S₃/3.

Fig. 2

The two topologies of the four-point diagrams associated with the “hierarchical clustering ansatz” for the four-point correlation, as in Eq. (6). The numbers are the multiplicity factors of each diagram.

In terms of the halo two-point correlation, Eqs. (4) and (6) imply halo coefficients $S_{N}^{h}$ $Mathematical equation: \hbox{$\Sh_N$}$ that behave as $S_{N}^{h} ~ b^{2 - N} S_{N}$ $Mathematical equation: \hbox{$\Sh_N \sim b^{2-N} S_N$}$ . For b ~ 1 and n ≃ − 1 and from Eqs. (5) and (7), this gives the values $S_{3}^{h} ~ S_{3} ≃ 2.9$ $Mathematical equation: \hbox{$\Sh_3 \sim S_3 \simeq 2.9$}$ and $S_{4}^{h} ~ S_{4} ≃ 13.9$ $Mathematical equation: \hbox{$\Sh_4 \sim S_4 \simeq 13.9$}$ , which roughly agree with observations of galaxy clustering (Szapudi et al. 2001; Croton et al. 2004; Ross et al. 2006; Marin et al. 2007).

2.1.3. Flat-sky and Limber’s approximations

In this paper we often encounter quantities, such as the mean matter density correlation over a redshift bin j, z_j, − < z′ < z_j, +, with respect to some redshift z in a second bin i, integrated over some angular window of area (ΔΩ), $ξ_{con}^{(j)} (z) = \int_{χ_{j, -}}^{χ_{j, +}} \frac{d χ^{'}}{𝒟 (z)} \int \frac{d Ω d Ω^{'}}{(ΔΩ)^{2}} ξ (x, x'),$ $Mathematical equation: \begin{equation} \xiconzj(z) = \int_{\chi_{j,-}}^{\chi_{j,+}} \frac{\dd\chi'}{\cD(z)} \int \frac{\dd\vOm\dd\vOm'}{(\Delta\Omega)^2} \, \xi(\vx,\vx') , \label{xib-ij-def} \end{equation}$ (8)where $Mathematical equation: \hbox{$\vx=(\chi,\cD\vOm)$}$ and $Mathematical equation: \hbox{$\vx'=(\chi',\cD'\vOm')$}$ . Here χ(z) and $Mathematical equation: \hbox{$\cD(z)$}$ are the comoving radial and angular distances, and we introduce the factor $Mathematical equation: \hbox{$1/\cD(z)$}$ so that $ξ_{con}^{(j)}$ $Mathematical equation: \hbox{$\xiconzj$}$ is dimensionless. Equation (8) is a “conical” average, within the observational cone. However, for small angular windows it is possible to use a flat-sky approximation and to approximate this “conical” average $ξ_{con}^{(j)}$ $Mathematical equation: \hbox{$\xiconzj$}$ by a “cylindrical” average $ξ_{cyl}^{(j)}$ $Mathematical equation: \hbox{$\xicylzj$}$ . Thus, using Eq. (3) and assuming that the correlation ξ is negligible on cosmological scales, we write for circular angular windows of radius θ_s, for a redshift z that also belongs to the j-bin, $\begin{matrix} z_{j, -} < z < z_{j, +} : ξ_{cyl}^{(j)} (z) & = & \int \begin{matrix} χ_{j, +} \\ χ_{j, -} \end{matrix} \frac{d χ^{'}}{𝒟} \int \frac{d θ d θ^{'}}{(π θ_{s}^{2})^{2}} \int d k \end{matrix}$ $Mathematical equation: \begin{eqnarray} z_{j,-} \!<\! z \!<\! z_{j,+} : \;\; \xicylzj(z) & = & \int_{\chi_{j,-}}^{\chi_{j,+}} \! \frac{\dd\chi'}{\cD} \, \int \! \frac{\dd\vtheta\dd\vtheta'}{(\pi\theta_{\rm s}^2)^2} \int \! \dd\vk \, \nonumber \\ && \times \; {\rm e}^{\ii k_{\parallel}\cdot(\chi'-\chi) +\ii \vk_{\perp}\cdot\cD(\vtheta'-\vtheta)} \; P(k;z) , \label{Cij-3} \end{eqnarray}$ (9)and $ξ_{cyl}^{(j)} (z) = 0$ $Mathematical equation: \hbox{$\xicylzj(z) = 0$}$ if z does not belong to the j-bin. Here k_∥ and k_⊥ are the longitudinal and transverse components of k, with respect to the line of sight, while θ and θ′ are the 2D transverse angular vectors.

For a redshift binning that is not too small, $Mathematical equation: \hbox{$\Delta\chi \gg \cD\theta_{\rm s}$}$ , longitudinal wavenumbers above 1/(Δχ) are suppressed by integrating χ′ along the line of sight, and the integral is dominated by wavenumbers with k ≃ k_⊥ and $Mathematical equation: \hbox{$k_{\perp} \sim 1/(\cD\theta_{\rm s})$}$ . Thus, using the Fourier form of Limber’s approximation (Limber 1953), which is widely used in weak-lensing studies (Kaiser 1992; Munshi et al. 2008), the integration over χ′ yields a Dirac term (2π)δ_D(k_∥), and the integration over k_∥ gives $ξ_{cyl}^{(j)} (z) ≃ ξ_{cyl} (z),$ $Mathematical equation: \begin{equation} \xicylzj(z) \simeq \xicyl(z) , \label{flat-Limber} \end{equation}$ (10)with $ξ_{cyl} (z) = \frac{2 π}{𝒟} \int \frac{d θ d θ^{'}}{(π θ_{s}^{2})^{2}} \int d k ⊥ e^{i k ⊥ \cdot 𝒟 (θ^{'} - θ)} P (k_{⊥}; z),$ $Mathematical equation: \begin{equation} \xicyl(z) = \frac{2\pi}{\cD} \int \frac{\dd\vtheta \dd\vtheta'}{(\pi\theta_{\rm s}^2)^2} \int\! \dd \vk_{\perp} \, {\rm e}^{\ii \vk_{\perp}\cdot\cD(\vtheta'-\vtheta)} \, P(k_{\perp};z) , \label{Cij-4} \end{equation}$ (11)which does not depend on the size of the redshift bin j (because we have taken the limit of a very large redshift bin). Introducing the 2D Fourier-space circular window⁴, $\begin{matrix} ˜ \\ W_{2} \end{matrix} (k_{⊥} 𝒟 θ_{s}) = \int \frac{d θ}{π θ_{s}^{2}} e^{i k ⊥ \cdot 𝒟 θ} = \frac{2 J_{1} (k_{⊥} 𝒟 θ_{s})}{k_{⊥} 𝒟 θ_{s}},$ $Mathematical equation: \begin{equation} \tW_2(k_{\perp}\cD\theta_{\rm s}) = \int \frac{\dd\vtheta}{\pi\theta_{\rm s}^2} \, {\rm e}^{\ii\vk_{\perp}\cdot\cD\vtheta} = \frac{2J_1(k_{\perp}\cD\theta_{\rm s})}{k_{\perp}\cD\theta_{\rm s}} , \label{W-thetas} \end{equation}$ (12)we obtain $ξ_{cyl} (z) = π \int_{0}^{\infty} \frac{d k}{k} \frac{Δ^{2} (k,z)}{𝒟 k} \begin{matrix} ˜ \\ W_{2} \end{matrix} (k 𝒟 θ_{s})^{2},$ $Mathematical equation: \begin{equation} \xicyl(z) = \pi \int_0^{\infty} \frac{\dd k}{k} \frac{\Delta^2(k,z)}{\cD k} \tW_2(k\cD\theta_{\rm s})^2 , \label{I-thetas-def} \end{equation}$ (13)where we defined the 3D power per logarithmic wavenumber, Δ²(k,z), by $Δ^{2} (k,z) = 4 π k^{3} P (k,z) .$ $Mathematical equation: \begin{equation} \Delta^2(k,z) = 4\pi k^3 P(k,z) . \label{Delta2-def} \end{equation}$ (14)We will evaluate the accuracy of this approximation, based on the flat-sky and Limber’s approximation, in Sect. 3.2.3.

2.2. Numerical simulations

Our analytical formalism allows us to consider a broad range of scales and halo masses, from small angular windows up to full-sky surveys, and to compare the relative contributions to covariance matrices that arise from shot-noise and cosmic variance effects, and from low-order and high-order large-scale correlations. Numerical simulations do not easily allow such a detailed analysis; however, in order to validate our approach, we must check whether it agrees with estimates from simulations, wherever a comparison is possible.

We use the high-resolution full-sky Horizon simulation (Teyssier et al. 2009), based on the WMAP3 cosmology (Spergel et al. 2007). This is a 68.7 billion particle N-body simulation, featuring more than 140 billion cells in the AMR grid of the RAMSES code (Teyssier 2002). The simulation consists in a lightcone spanning the entire sky up to redshift ~1, with a mass resolution of 1.1 × 10¹⁰ M_⊙.

Halos in the (2 Gpc)³N-body simulation are found with the HOP algorithm (Eisenstein & Hut 1998). Their comoving positions are then converted into sky coordinates, taking their radial velocity into account when calculating redshifts. The physical effects of the baryons are neglected and the total mass is given by the number of particles inside the halo. We only consider halos at redshifts z ≤ 0.8, up to which the simulation is complete towards all directions.

We design simulated surveys by extracting rectangular fields in angular coordinates. To minimize the effect of intrinsic sample correlations, we impose a 10 (resp. 20) deg gap between consecutive fields when computing number counts (resp. correlation functions), which yields 138 (resp. 34) nonoverlapping fields that can be cut out in the simulation.

For the purpose of clustering analysis, auxiliary random fields are constructed by shuffling the angular coordinates of halos in the data fields, thus preserving the halo mass and redshift distributions. To gain in computational efficiency, the number of halos per random field is ten times the average number of halos in data fields. The Landy & Szalay estimator is then scaled accordingly to the ratios of pair numbers in data and random fields.

The mean and covariance of all quantities of interest are estimated by sample averaging over the extracted surveys. Because of the uniqueness of the simulation, a residual noise is expected whenever the area of individual fields becomes large and their number diminishes. Thus, there are about 41253 deg²/(ΔΩ) nonoverlapping fields of area (ΔΩ). (For instance, we cut 41 fields of 400 deg² for the analysis of the angular correlation function.)

3. Number density of halos

3.1. Mean number counts in redshift bins

We consider a population of objects defined by some property, such as their mass M, with a mean comoving number density per logarithmic interval of M written as dn/dlnM. Then, the mean number of objects in the redshift interval [z,z + dz] , within the solid angle dΩ on the sky, with a mass in the range [M,M + dM] , reads as $d Ω d N = d z d Ω | \frac{d V}{d z d Ω} | (z) \frac{d M}{M} \frac{d n}{d \ln M} (M,z),$ $Mathematical equation: \begin{equation} \dd\vOm \, \dd N = \dd z \, \dd\vOm \, \left|\frac{\dd V}{\dd z\dd\vOm}\right|\!(z) \, \frac{\dd M}{M} \, \frac{\dd n}{\dd\! \ln M}(M,z) , \label{dndz} \end{equation}$ (15)where |dV/dzdΩ| is the cosmological volume factor, which is given by $| \frac{d V}{d z d Ω} | (z) = 𝒟 (z)^{2} \frac{d χ}{d z},$ $Mathematical equation: \begin{equation} \left|\frac{\dd V}{\dd z\dd\vOm}\right|(z) = \cD(z)^2 \; \frac{\dd\chi}{\dd z} , \label{V-chi} \end{equation}$ (16)and χ(z) and $Mathematical equation: \hbox{$\cD(z)$}$ are the comoving radial and angular distances. In Eq. (15) and in the following we define N as the number density of objects per unit area on the sky, instead of the total number of objects within a given window (ΔΩ). This choice is more convenient for practical purposes because it allows a simpler comparison between different surveys that have different total areas.

We can split the interval of mass⁵ over several bins “α”, [M_α, −,M_α, +] and the observational cone over nonoverlapping redshift intervals “i”, [z_i, −,z_i, +] with z_i, + ≤ z_{i + 1, −}. In practice, one usually takes z_i, + = z_{i + 1, −} so as to cover a continuous range of redshifts. Then, the number of objects per unit area, in the bin (i,α), reads as $N̂ i,α = \int_{z_{i, -}}^{z_{i, +}} d z \frac{d χ}{d z} 𝒟^{2} \int_{ΔΩ} \frac{d Ω}{(ΔΩ)} \int_{M_{α, -} (z)}^{M_{α, +} (z)} \frac{d M}{M} \frac{d n̂}{d \ln M},$ $Mathematical equation: \begin{equation} \hN_{i,\alpha} = \int_{\zim}^{\zip} \!\! \dd z \, \frac{\dd\chi}{\dd z} \, \cD^2 \int_{\Delta\Omega} \! \frac{\dd\vOm}{(\Delta\Omega)} \int_{M_{\alpha,-}(z)}^{M_{\alpha,+}(z)} \! \frac{\dd M}{M} \, \frac{\dd\hn}{\dd\!\ln M} , \label{Ni-1} \end{equation}$ (17)where $Mathematical equation: \hbox{$\dd\hn/\dd\!\ln M$}$ is the observed density of objects. Here and in the following, we note observed quantities by a hat (i.e. in one realization of the sky) to distinguish them from mean quantities, such as the comoving number density of Eq. (15), that correspond to expectation values over many realizations. In practice, assuming ergodicity, these expectation averages are assumed to be identical to volume averages (in the case of statistically homogeneous and isotropic cosmologies).

To simplify notations we define the mean cumulative number density of objects observed at a given redshift, within the mass bin α (with boundaries that may depend on z), $n_{α} (z) = \int_{M_{α, -} (z)}^{M_{α, +} (z)} \frac{d M}{M} \frac{d n}{d \ln M} (M,z),$ $Mathematical equation: \begin{equation} \nb_{\alpha}(z) = \int_{M_{\alpha,-}(z)}^{M_{\alpha,+}(z)} \frac{\dd M}{M} \, \frac{\dd n}{\dd\!\ln M}(M,z) , \label{Nbz-def} \end{equation}$ (18)and we omit the explicit boundaries on mass in the following. Then, the mean number of objects per unit area in the redshift and mass bins (i,α) reads as $⟨ N̂ i,α ⟩ = \int_{χ_{i, -}}^{χ_{i, +}} d χ 𝒟^{2} n_{α} .$ $Mathematical equation: \begin{equation} \lag \hN_{i,\alpha} \rag = \int_{\chiim}^{\chiip} \dd\chi \, \cD^2 \, \nb_{\alpha} . \label{Ni-3} \end{equation}$ (19)

Fig. 3

The mean number density of dark matter halos per square degree, within redshift bins of width Δz = 0.1. We count all halos above the thresholds M_∗ = 2 × 10¹³,10¹⁴, and 5 × 10¹⁴h^-1 M_⊙, from top down to bottom. We compare our analytical results (solid lines) with numerical simulations (dashed lines).

We plot the mean number counts $Mathematical equation: \hbox{$\lag \hN_{i,\alpha} \rag$}$ of Eq. (19) in Fig. 3, per square degree, for redshift bins of width Δz = 0.1. Here we select all dark matter halos above a mass threshold M_∗, with M_∗ = 2 × 10¹³,10¹⁴, and 5 × 10¹⁴ h^-1 M_⊙. The error bars are the 3 − σ statistical errors obtained from the covariance matrices derived in Sect. 3.2.1, for 138 fields of 50 deg² as used in the simulations. We can check that our estimates agree reasonably well with the numerical results. The small discrepancies are probably due to the complex relation between theoretical halo masses and the actual halos found in the simulation box. In particular, it is well known that using different algorithms, such as spherical-overdensity, HOP or friends-of-friends algorithms, can lead to slightly different results (e.g. Eisenstein & Hut 1998; Tinker et al. 2008). However, this point is beyond the scope of this paper, as we only wish here to check that our analytical results provide reasonable estimates of the mean and covariance of halo number counts and correlations.

3.2. Covariance of number counts

As usual we define the covariance C_i,α;j,β of the statistical quantities $Mathematical equation: \hbox{$\hN_{i,\alpha}$}$ and $Mathematical equation: \hbox{$\hN_{j,\beta}$}$ by $C_{i,α; j,β} = ⟨ N̂ i,α N̂ j,β ⟩ - ⟨ N̂ i,α ⟩ ⟨ N̂ j,β ⟩ .$ $Mathematical equation: \begin{equation} C_{i,\alpha;j,\beta} = \lag \hN_{i,\alpha} \hN_{j,\beta} \rag - \lag \hN_{i,\alpha} \rag \lag \hN_{j,\beta} \rag . \label{C-ij-def} \end{equation}$ (20)As recalled in Appendix A, following Peebles (1980), it can be decomposed over “shot-noise” and “sample-variance” contributions, $C_{i,α; j,β} = C_{i,α; j,β}^{(s . n .)} + C_{i,α; j,β}^{(s . v .)},$ $Mathematical equation: \begin{equation} C_{i,\alpha;j,\beta} = C_{i,\alpha;j,\beta}^{\sn} + C_{i,\alpha;j,\beta}^{\sv} , \label{C-NiNj-sn-sv} \end{equation}$ (21)which write from Eq. (A.10) as $C_{i,α; j,β}^{(s . n .)} = δ_{i,j} δ_{α,β} \frac{⟨ N̂ i,α ⟩}{(ΔΩ)},$ $Mathematical equation: \begin{equation} C_{i,\alpha;j,\beta}^{\sn} = \delta_{i,j} \, \delta_{\alpha,\beta} \; \frac{\lag \hN_{i,\alpha} \rag}{(\Delta\Omega)} , \label{Cij-sn} \end{equation}$ (22)(for nonoverlapping mass binning), and $\begin{matrix} C_{i,α; j,β}^{(s . v .)} & = & \int i d χ 𝒟^{2} \int \frac{d Ω}{(ΔΩ)} \int_{α} \frac{d M}{M} \frac{d n}{d \ln M} \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,\alpha;j,\beta}^{\sv} & = & \int_i\dd \chi \, \cD^2 \int\frac{\dd\vOm}{(\Delta\Omega)} \int_{\alpha} \frac{\dd M}{M} \, \frac{\dd n}{\dd\!\ln M} \nonumber \\ && \hspace{0cm} \times \int_j\dd \chi' \, \cD'^2 \int\frac{\dd\vOm'}{(\Delta\Omega)} \int_{\beta}\frac{\dd M'}{M'} \, \frac{\dd n}{\dd\!\ln M'} \; \xih . \label{Cij-xi} \end{eqnarray}$ (23)Here we denote $\int_{i}$ $Mathematical equation: \hbox{$\int_i$}$ and $\int_{α}$ $Mathematical equation: \hbox{$\int_{\alpha}$}$ as the integrals over the redshift and mass bins i and α. The superscript “h” refers to the “halo” correlation function, which depends on the two redshifts, angular directions, and masses (or temperatures, etc.), ξ^h = ξ^h(M,x;M′,x′), with $Mathematical equation: \hbox{$\vx=(\chi,\cD\vOm)$}$ .

The first term $C_{i,α; j,β}^{(s . n .)}$ $Mathematical equation: \hbox{$C_{i,\alpha;j,\beta}^{\sn}$}$ is the shot-noise contribution and vanishes for nonoverlapping bins. As expected it decreases with the survey size as 1/(ΔΩ). (We recall that $Mathematical equation: \hbox{$\hN$}$ is the angular number density.) The second term $C_{i,α; j,β}^{(s . v .)}$ $Mathematical equation: \hbox{$C_{i,\alpha;j,\beta}^{\sv}$}$ is due to the “sample-variance” cross-correlation ξ^h between distant objects (Hu & Kravtsov 2003). Using the approximation (1), that is, the factorization of the dependence on mass and distance of ξ^h, we define the mean bias $b_{α}$ $Mathematical equation: \hbox{$\bb_{\alpha}$}$ at redshift z, for the mass bin α, through $b_{α} (z) n_{α} (z) = \int_{α} \frac{d M}{M} b (M,z) \frac{d n}{d \ln M} (M,z),$ $Mathematical equation: \begin{equation} \bb_{\alpha}(z) \, \nb_{\alpha}(z) = \int_{\alpha} \frac{\dd M}{M} \, b(M,z) \, \frac{\dd n}{\dd\!\ln M}(M,z) , \label{bb-def} \end{equation}$ (24)where $n_{α}$ $Mathematical equation: \hbox{$\nb_{\alpha}$}$ was defined in Eq. (18), so that Eq. (23) also writes as $\begin{matrix} C_{i,α; j,β}^{(s . v .)} & = & \int i d χ 𝒟^{2} b_{α} n_{α} \int_{j} d χ^{'} 𝒟^{' 2} b \begin{matrix} ^{'} \\ β \end{matrix} n \begin{matrix} ^{'} \\ β \end{matrix} \int \frac{d Ω d Ω^{'}}{(ΔΩ)^{2}} ξ (x, x') . \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,\alpha;j,\beta}^{\sv} & = & \int_i\dd\chi \, \cD^2 \, \bb_{\alpha} \nb_{\alpha} \int_j \dd\chi' \, \cD'^2 \, \bb_{\beta}^{\,'} \nb_{\beta}^{\,'} \int\! \frac{\dd\vOm \dd\vOm'}{(\Delta\Omega)^2} \, \xi(\vx,\vx') . \nonumber \\ && \label{Cij-sv-1} \end{eqnarray}$ (25)

3.2.1. Small angular windows

Using the approximation that the correlation function is negligible on cosmological scales, whence $Mathematical equation: \hbox{$\cD' \simeq \cD$}$ , $b^{^{'}} ≃ b$ $Mathematical equation: \hbox{$\bb^{\,'} \simeq \bb$}$ and $n^{^{'}} ≃ n$ $Mathematical equation: \hbox{$\nb^{\,'}\simeq \nb$}$ , we obtain $C_{i,α; j,β}^{(s . v .)} = \int_{i} d χ 𝒟^{5} b_{α} b_{β} n_{α} n_{β} ξ_{con}^{(j)},$ $Mathematical equation: \begin{equation} C_{i,\alpha;j,\beta}^{\sv} = \int_i \dd\chi \, \cD^5 \, \bb_{\alpha} \bb_{\beta} \, \nb_{\alpha} \nb_{\beta} \, \xiconzj , \label{Cij-2} \end{equation}$ (26)where $ξ_{con}^{(j)} (z)$ $Mathematical equation: \hbox{$\xiconzj(z)$}$ was defined in Eq. (8). It is a “conical” average, over objects “i” and “j” that are located at unrelated positions (χ,Ω) and (χ′,Ω′). To recall this “conical” integration along χ′ we have added the subscript “con”. This helps distinguishing quantities such as (8) from other averages of ξ over 3D spherical shells, which we encounter in Sect. 4 below. To further distinguish from the quantities encountered in Sects. 4 and 5, we put the label j, which refers to a redshift bin, as a superscript, whereas the labels i or j of the radial or angular bins studied in Sects. 4 and 5 appear as indices. (The parenthesis refer to the fact that in Limber’s approximation of wide redshift bins the dependence on the boundaries of the bin j disappears, because they are pushed to infinity, see Eq. (13).)

In the case where the nonoverlapping redshift bins are large enough to neglect the cross-correlation between different bins (we evaluate the accuracy of this approximation in Sect. 3.2.4), the integral (8) gives rise to a Kronecker factor δ_i,j. Moreover, for small angular windows and large enough redshift bins we can use the flat-sky approximation (9), where the “conical” average is approximated by a “cylindrical” average (and spherical harmonics are replaced by plane waves), and Limber’s approximations (10), where longitudinal wavenumbers are neglected over transverse wavenumbers. Substituting into Eq. (26) we recover the results of Hu & Kravtsov (2003), $C_{i,α; j,β}^{(s . v .)} = δ_{i,j} \int_{i} d χ 𝒟^{5} b_{α} b_{β} n_{α} n_{β} ξ_{cyl},$ $Mathematical equation: \begin{equation} C_{i,\alpha;j,\beta}^{\sv} = \delta_{i,j} \int_i \dd\chi \, \cD^5 \, \bb_{\alpha} \bb_{\beta} \, \nb_{\alpha} \nb_{\beta} \, \xicyl , \label{Cij-7} \end{equation}$ (27)where $ξ_{cyl} (z)$ $Mathematical equation: \hbox{$\xicyl(z)$}$ was defined in Eq. (13). (We evaluate the accuracy of the approximation (27) in Sect. 3.2.3.) Thus, while the shot-noise contribution (22) to the covariance matrix is diagonal, the sample-variance contribution (27) is only block-diagonal (for large redshift bins) since within the same redshift bin different mass bins are correlated.

Fig. 4

The variance σ_{N_i} of the halo angular number densities of Fig. 3, for redshift bins Δz = 0.1 and an angular window of 50 deg². We compare our analytical results (solid lines) with numerical simulations (dashed lines).

In Fig. 4 we compare with the numerical simulations our results for the variance σ_{N_i} of the halo angular number densities, where σ_{N_i} includes both the shot-noise and sample-variance contributions (21), $σ_{N_{i}} = \sqrt{C_{i,i}^{(s . n .)} + C_{i,i}^{(s . v .)}} .$ $Mathematical equation: \begin{equation} \sigma_{N_i} = \sqrt{C_{i,i}^{\sn}+C_{i,i}^{\sv}} . \label{sigma-Ni-def} \end{equation}$ (28)We consider an angular window of 50 deg², which corresponds for instance to the case of the XXL survey (Pierre et al. 2011). As for the mean densities of Fig. 3, we obtain a reasonable agreement with the simulations, and we correctly reproduce the dependence on halo mass and redshift.

Fig. 5

The shot-noise (dashed lines) and sample-variance (solid lines) errors for the angular number densities shown in Fig. 3, associated with a redshift binning of width Δz = 0.1, but up to z = 2, and an angular window of 50 deg².

Taking advantage of our analytical model, we compare in Fig. 5 the shot-noise and sample-variance contributions to the total error that was displayed in Fig. 4, but going up to redshift z = 2. Here we define $σ_{N_{i}}^{(s . n .)} = \sqrt{C_{i,i}^{(s . n .)}}, σ_{N_{i}}^{(s . v .)} = \sqrt{C_{i,i}^{(s . v .)}} .$ $Mathematical equation: \begin{equation} \sigma_{N_i}^{\sn} = \sqrt{C_{i,i}^{\sn}} , \;\;\; \sigma_{N_i}^{\sv} = \sqrt{C_{i,i}^{\sv}} . \label{sigma-Ni-sn-sv-def} \end{equation}$ (29)As expected, we can see that the error of observed number counts $Mathematical equation: \hbox{$\hN_i$}$ is dominated by the shot-noise contribution for rare halos (high mass or high redshift), where effects associated with the discreteness of the halo distribution are very important.

Signal-to-noise ratio

Fig. 6

The signal-to-noise ratios of number counts for an angular area ΔΩ = 50 deg², as in Figs. 3 and 4. We compare our analytical results (solid lines) with numerical simulations (dashed lines).

From the angular number density $Mathematical equation: \hbox{$\lag\hN_i\rag$}$ and its variance σ_{N_i} we define the signal-to-noise ratio as $\frac{S}{N} = \frac{⟨ N̂ i ⟩}{σ_{N_{i}}},$ $Mathematical equation: \begin{equation} \frac{S}{N} = \frac{\lag\hN_i\rag}{\sigma_{N_i}} , \label{SN-def} \end{equation}$ (30)which we compute from Eqs. (19) and (28). Thus, combining Figs. 3 and 4, we display in Fig. 6 this signal-to-noise ratio. In agreement with these previous figures, we obtain a good match to the numerical simulations. Thus, the analytical results are competitive with the numerical simulations since they appear to be no less reliable and much faster to compute.

Scalings with survey area and number of subfields

For practical purposes it is interesting to compare the signal-to-noise ratios associated with a different number of subfields at fixed total area ΔΩ, since this can help for choosing the best observational strategy, whether one should perform a single wide-field survey or several smaller scale surveys.

Fig. 7

The signal-to-noise ratios of number counts for a total angular area ΔΩ = 50 deg², divided over $Mathematical equation: \hbox{$\cN$}$ independent subfields. We show the results obtained for the numbers of subfields $Mathematical equation: \hbox{$\cN=1$}$ (solid lines), 2 (dashed lines), and 4 (dotted lines).

Therefore, let us consider a survey with a total angular window of area ΔΩ that can be split over $Mathematical equation: \hbox{$\cN$}$ angular subfields, which we assume to be independent and to have equal area $Mathematical equation: \hbox{$\Delta\Omega/\cN$}$ . For instance, the survey may be made of $Mathematical equation: \hbox{$\cN$}$ smaller regions that are well separated on the sky. Then, the total angular number density $N̂ \begin{matrix} tot \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hN^{\rm tot}_i$}$ of objects in the redshift bin [z_i, −,z_i, +] , summed over the $Mathematical equation: \hbox{$\cN$}$ smaller subfields of index α with angular number densities $N̂ \begin{matrix} (α) \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hN^{(\alpha)}_i$}$ , writes as $N̂ \begin{matrix} tot \\ i \end{matrix} = \frac{1}{𝒩} \sum_{α = 1}^{𝒩} N̂ \begin{matrix} (α) \\ i \end{matrix} .$ $Mathematical equation: \begin{equation} \hN^{\rm tot}_i = \frac{1}{\cN} \sum_{\alpha=1}^{\cN} \hN^{(\alpha)}_i . \label{Ntot-def} \end{equation}$ (31)Since all subfields are independent and have the same depth we have, for any α, $⟨ N̂ \begin{matrix} tot \\ i \end{matrix} ⟩ = ⟨ N̂ \begin{matrix} (α) \\ i \end{matrix} ⟩,$ $Mathematical equation: \begin{equation} \lag \hN^{\rm tot}_i \rag = \lag\hN^{(\alpha)}_i\rag , \label{Ntot-1} \end{equation}$ (32)which depends neither on (ΔΩ) nor $Mathematical equation: \hbox{$\cN$}$ . Without mass binning the covariance matrix remains diagonal, see Eq. (27), with a shot-noise contribution $C_{i,i}^{tot, (s . n .)} = \frac{1}{𝒩} C_{i,i}^{(α), (s . n .)} = \frac{⟨ N̂ \begin{matrix} tot \\ i \end{matrix} ⟩}{(ΔΩ)} \propto (ΔΩ)^{-1},$ $Mathematical equation: \begin{equation} C^{\rm tot,(s.n.)}_{i,i} = \frac{1}{\cN} \, C^{\rm (\alpha),(s.n.)}_{i,i} = \frac{\lag \hN^{\rm tot}_i \rag}{(\Delta\Omega)} \propto (\Delta\Omega)^{-1} , \label{CN-tot-sn} \end{equation}$ (33)while the sample-variance contribution is $C_{i,i}^{tot, (s . v .)} = \frac{1}{𝒩} C_{i,i}^{(α), (s . v .)} \propto ξ_{cyl, (α)} / 𝒩,$ $Mathematical equation: \begin{equation} C^{\rm tot,(s.v.)}_{i,i} = \frac{1}{\cN} \, C^{\rm (\alpha),(s.v.)}_{i,i} \propto \xicylalpha/\cN , \label{CN-tot-mf} \end{equation}$ (34)where $ξ_{cyl, (α)}$ $Mathematical equation: \hbox{$\xicylalpha$}$ is the typical value of the integral (13) within the small angular window $Mathematical equation: \hbox{$(\Delta\Omega)/\cN$}$ . Then, the signal-to-noise ratio scales as $\frac{S}{N} = \frac{⟨ N̂ \begin{matrix} tot \\ i \end{matrix} ⟩}{\sqrt{C_{i,i}^{tot,}}} \propto \sqrt{\frac{(ΔΩ)}{1 + \frac{(ΔΩ)}{𝒩} ξ_{cyl, (α)}}} \cdot$ $Mathematical equation: \begin{equation} \frac{S}{N} = \frac{\lag \hN^{\rm tot}_i \rag}{\sqrt{C^{\rm tot,}_{i,i}}} \propto \sqrt{\frac{(\Delta\Omega)}{1+\frac{(\Delta\Omega)}{\cN} \xicylalpha}} \cdot \label{SN-Ntot} \end{equation}$ (35)In the regime where the covariance is dominated by the shot-noise contribution, the signal-to-noise ratio grows as the square root of the total area and does not depend on the number of subfields, $(S / N) \propto \sqrt{ΔΩ}$ $Mathematical equation: \hbox{$(S/N) \propto \sqrt{\Delta\Omega}$}$ .

To estimate the scaling in the regime where the covariance is dominated by the sample-variance contribution, we assume that on the relevant scale $Mathematical equation: \hbox{$k \sim 1/(\cD\theta_{\rm s})$}$ , that is, $k ~ \sqrt{𝒩 / (𝒟^{2} ΔΩ)}$ $Mathematical equation: \hbox{$k \sim \sqrt{\cN/(\cD^2\Delta\Omega)}$}$ , the power spectrum behaves as P(k) ~ kⁿ, with − 2 < n < 1. (Wavenumbers where n < − 2 for CDM cosmologies would correspond to very small angular windows.) Then, from Eq. (13) we obtain $ξ_{cyl, (α)} ~ k^{n + 2} ~ (𝒩 / ΔΩ)^{(n + 2) / 2}$ $Mathematical equation: \hbox{$\xicylalpha \sim k^{n+2} \sim (\cN/\Delta\Omega)^{(n+2)/2}$}$ , and the signal-to-noise ratio scales as $Mathematical equation: \hbox{$(S/N) \sim \cN^{-n/4} \, (\Delta\Omega)^{(n+2)/4}$}$ . Thus, in this regime the signal-to-noise ratio still grows with the total survey area, but there is a weak dependence on the number of subfields, which may either increase or decrease with $Mathematical equation: \hbox{$\cN$}$ depending on the sign of n.

For illustration, we show in Fig. 7 the signal-to-noise ratios of the number counts obtained for a total angular area ΔΩ = 50 deg², divided over $Mathematical equation: \hbox{$\cN$}$ subfields with $Mathematical equation: \hbox{$\cN=1$}$ , 2, and 4. The case of a single field, $Mathematical equation: \hbox{$\cN=1$}$ , corresponds to Figs. 3 and 4. The curves in Fig. 7 are exact estimates of the signal-to-noise ratios, obtained from $⟨ N̂ \begin{matrix} tot \\ i \end{matrix} ⟩ / \sqrt{C_{i,i}^{tot,}}$ $Mathematical equation: \hbox{$\lag \hN^{\rm tot}_i \rag/\!\!\sqrt{C^{\rm tot,}_{i,i}}$}$ and not from the approximate scaling given in Eq. (35).

In agreement with the discussion above and with Fig. 4, we can check that at high redshift or for high mass, the signal-to-noise ratio does not depend on $Mathematical equation: \hbox{$\cN$}$ , since the error is dominated by the shot-noise contribution, which only depends on the total area as seen in Eqs. (33) and (35).

At low redshift and low mass, where the error is dominated by the sample-variance contribution, the signal-to-noise ratio increases slightly with $Mathematical equation: \hbox{$\cN$}$ . This can be understood from the fact that the local slope n of the power spectrum is slightly negative on the scales where the halo correlation is significant. For instance, within the ΛCDM cosmology that we consider in this paper, a window of area 50 deg² corresponds at z = 1 to a radius $Mathematical equation: \hbox{$\cD\theta_{\rm s}=164~h^{-1}$}$ Mpc, and the local slope n(k) of the linear power spectrum, at wavenumber k ≃ 2π/(164 h^-1 Mpc), is n ≃ − 0.6. This means that for the number counts at low redshift and low masses, it is slightly advantageous to choose a survey divided over several independent subfields.

As shown by Figs. I.1 and I.2 in Appendix I, these scalings are approximately satisfied by the results obtained from numerical simulations, for a wide variety of survey area and of number of subfields. Therefore, the scalings derived from Eq. (35) allow a reasonable estimate of the dependence of the signal-to-noise ratio of number counts with (ΔΩ) and $Mathematical equation: \hbox{$\cN$}$ .

3.2.2. Large angular windows

Expression (27) relies on the approximation of small angular windows for the sample-variance contribution. This allowed us to use the flat-sky approximation (9), where the observational cone over the redshift bin [z_i, −,z_i, +] is approximated as a cylinder of radius θ_s around the central line of sight. For large angular windows this approximation is no longer valid and we must decompose over spherical harmonics (Hu & Kravtsov 2003), rather than over the plane waves of Eq. (9).

Rather than using the Eqs. (25) or (26), with a new expression for $ξ_{con}^{(j)} (z)$ $Mathematical equation: \hbox{$\xiconzj(z)$}$ that would remain valid for large angles, it is more convenient to go back to the angular number densities $Mathematical equation: \hbox{$\hN$}$ , as in Eq. (15). Thus, for any redshift bin i we expand the observed distribution $Mathematical equation: \hbox{$\hN_i(\vOm)$}$ on the sky over the spherical harmonics, $N̂ i (Ω) = \sum_{ℓ,m} N̂ \begin{matrix} (ℓ,m) \\ i \end{matrix} Y_{ℓ}^{m} (Ω),$ $Mathematical equation: \begin{equation} \hN_i(\vOm) = \sum_{\ell,m} \hN_i^{(\ell,m)} \, Y_{\ell}^m(\vOm) , \label{Ni-lm} \end{equation}$ (36)and we define the angular power spectrum as $⟨ N̂ \begin{matrix} (ℓ,m) * \\ i \end{matrix} N̂ \begin{matrix} (ℓ^{'}, m^{'}) \\ j \end{matrix} ⟩_{c}^{(s . v .)} = δ_{ℓ, ℓ^{'}} δ_{m, m^{'}} C_{i,j; ℓ},$ $Mathematical equation: \begin{equation} \lag \hN_i^{(\ell,m)*} \hN_j^{(\ell',m')} \rag_c^{\sv} = \delta_{\ell,\ell'} \, \delta_{m,m'} \, C_{i,j;\ell} , \label{Cl-def} \end{equation}$ (37)where we only take the sample-variance contribution. To simplify notations we do not include mass binning, but this can be added without difficulty, as in Sect. 3.2.1. Then, writing the two-point correlation function again under the factored form (1), introducing the Fourier-space power spectrum as in Eq. (3) and expanding the plane-wave exponential factor over spherical harmonics, a standard calculation gives (Hu 2000; Hu & Kravtsov 2003) $\begin{matrix} C_{i,j; ℓ} & = & 4 π \int \frac{d k}{k} Δ_{L 0}^{2} (k) \int_{i} d χ χ^{2} b n D_{+} j_{ℓ} (kχ) \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j;\ell} & = & 4\pi \int \frac{\dd k}{k} \Delta^2_{L0}(k) \int_i \dd\chi \, \chi^2 \, \bb \, \nb \, D_+ \, j_{\ell}(k\chi) \nonumber \\ && \times \int_j \dd\chi' \, \chi'^2 \, \bb' \, \nb' \, D'_+ \, j_{\ell}(k\chi') , \label{Cl-noflat} \end{eqnarray}$ (38)where j_ℓ is the spherical Bessel function of order ℓ. Here we assumed for simplicity a flat background, which is sufficient for practical purposes, and we approximated the Fourier-space power spectrum by the linear power spectrum, P(k,z) ≃ D₊(z)²P_L0(k), where D₊(z) is the linear growth rate (normalized to unity at z = 0).

Fig. 8

The angular power spectrum of the distribution of halos in the redshift bin 0.95 < z < 1.05. We plot both the exact result (38) (solid line) and Limber’s approximation (39) (dotted line).

Limber’s approximation can be recovered in the limit of large ℓ, for slowly varying k-dependent prefactors, by using the property ^∫dk k²j_ℓ(kχ)j_ℓ(kχ′) = π/(2χ²)δ_D(χ − χ′), and the correspondence k ↔ (ℓ + 1/2)/χ (Hu & Kravtsov 2003; LoVerde & Afshordi 2008). This yields $C_{i,j; ℓ}^{Limber} = δ_{i,j} \int d χ χ^{5} b^{2} n^{2} \frac{2 π^{2}}{(ℓ + 1 / 2)^{3}} Δ^{2} (\frac{ℓ + 1 / 2}{χ}) \cdot$ $Mathematical equation: \begin{equation} C_{i,j;\ell}^{\rm Limber} = \delta_{i,j} \int \dd \chi \, \chi^5 \, \bb^2 \, \nb^2 \, \frac{2\pi^2}{(\ell+1/2)^3} \, \Delta^2\!\left(\frac{\ell+1/2}{\chi}\right) \cdot \label{Cl-flat} \end{equation}$ (39)Nevertheless, because the structures of Eqs. (38) and (26) are quite different (the order of the integrations over redshift and wavenumber is exchanged, and the large-angle expression keeps two integrations over redshift, while in the small-angle expression one integral over redshift has already been performed) it is more convenient to treat the small-angle and large-angle derivations separately.

We plot in Fig. 8 the angular power spectrum C_i,i;ℓ obtained for halos above three mass thresholds in the redshift bin 0.95 < z < 1.05. We can see that Limber’s approximation (39) significantly underestimates the power at low ℓ, while it slightly overestimates the power at high multipoles, ℓ > 20. It is already rather good at ℓ ~ 20, and becomes increasingly accurate at higher ℓ, although the difference remains on the order of 10% until ℓ ~ 80. These results agree with previous studies of the Limber approximation (LoVerde & Afshordi 2008; Crocce et al. 2011). As noticed in LoVerde & Afshordi (2008), the latter can be extended as a series expansion over (ℓ + 1/2)^-1, but higher orders behave increasingly badly at low ℓ. Therefore, we do not investigate further this approach here, since using the exact expression (38) is not more difficult (but slower) to compute and ensures a smooth behavior over all ℓ, while the usual Limber approximation (39) is sufficient for our purposes on small scales.

Next, the mean angular number densities $Mathematical equation: \hbox{$\hN_i$}$ of Eq. (17), smoothed over the angular window of radius θ_s and filter W₂(Ω), read as $N̂ i = \int d Ω N̂ i (Ω) W_{2} (Ω),$ $Mathematical equation: \begin{equation} \hN_i = \int\dd\vOm \, \hN_i(\vOm) \, W_2(\vOm) , \label{Ni-W2} \end{equation}$ (40)where W₂(Ω) = 1/(ΔΩ) within the angular window and vanishes outside (but we can choose more general filters). Then, the sample variance of these number counts writes as (Hu & Kravtsov 2003) $C_{i,j}^{(s . v .)} = ⟨ N̂ i N̂ j ⟩_{c}^{(s . v .)} = \sum_{ℓ,m} C_{i,j; ℓ} | {W_{2}^{(ℓ,m)}}_{˜} |^{2},$ $Mathematical equation: \begin{equation} C_{i,j}^{\sv} = \lag \hN_i \hN_j \rag_c^{\sv} = \sum_{\ell,m} C_{i,j;\ell} \; | \tW_2^{(\ell,m)}|^2 , \label{Cii-W2} \end{equation}$ (41)where C_i,j;ℓ is the angular power spectrum (38), while ${W_{2}^{(ℓ,m)}}_{˜}$ $Mathematical equation: \hbox{$\tW_2^{(\ell,m)}$}$ are the angular multipoles of the window W₂, ${W_{2}^{(ℓ,m)}}_{˜} = \int d Ω W_{2} (Ω) Y_{ℓ}^{m} (Ω)^{*} .$ $Mathematical equation: \begin{equation} \tW_2^{(\ell,m)} = \int \dd\vOm \, W_2(\vOm) \, Y_{\ell}^m(\vOm)^* . \label{tW2-lm-def} \end{equation}$ (42)For a top-hat window that is symmetric around the azimuthal axis, we have for ℓ ≥ 1, $\begin{matrix} {W_{2}^{(ℓ, 0)}}_{˜} & = & \frac{2 π}{(ΔΩ)} \int_{0}^{θ_{s}} d θ \sin θ Y_{ℓ}^{0} (θ) \\ = \end{matrix}$ $Mathematical equation: \begin{eqnarray} \tW_2^{(\ell,0)} & = & \frac{2\pi}{(\Delta\Omega)} \int_0^{\theta_{\rm s}} \dd \theta \; \sin\theta \; Y_{\ell}^0(\theta) \\ & = & \sqrt{\frac{\pi}{2\ell+1}} \, \frac{P_{\ell-1}(\cos\theta_{\rm s}) - P_{\ell+1}(\cos\theta_{\rm s})}{(\Delta\Omega)} \label{W2-l0} \end{eqnarray}$ and ${W_{2}^{(0, 0)}}_{˜} = \frac{1}{2 \sqrt{π}}, and (ΔΩ) = 2 π (1 - \cos θ_{s}),$ $Mathematical equation: \begin{equation} \tW_2^{(0,0)} = \frac{1}{2\sqrt{\pi}} , \hspace{0.3cm} \mbox{and} \hspace{0.3cm} (\Delta\Omega) = 2\pi (1-\cos\theta_{\rm s}) , \label{W2-00} \end{equation}$ (45)where P_ℓ are the Legendre polynomials, and ${W_{2}^{(ℓ,m)}}_{˜} = 0$ $Mathematical equation: \hbox{$\tW_2^{(\ell,m)} = 0$}$ for m ≠ 0.

Fig. 9

The shot-noise (dashed line) and sample-variance errors (29) for the angular number densities in the redshift bin 0.95 < z < 1.05, as a function of the radius θ_s of the angular window. The solid line is the exact sample variance, from Eqs. (38) and (41), while the dotted line is the result (27), which was used in Fig. 5 and involves both the flat-sky and Limber’s approximations.

In the limit of large ℓ, $Y_{ℓ}^{0} (θ) ≃ \sqrt{\frac{ℓ + 1 / 2}{2 π}} J_{0} [(ℓ + 1 / 2) θ]$ $Mathematical equation: \hbox{$Y_{\ell}^0(\theta) \simeq \sqrt{\frac{\ell+1/2}{2\pi}} J_0[(\ell+1/2)\theta]$}$ (Hu 2000), and we obtain $| {W_{2}^{(ℓ, 0)}}_{˜} |^{2} ≃ \frac{kχ}{2 π} \begin{matrix} _{˜} \\ W_{2} \end{matrix} (kχ θ_{s})^{2}$ $Mathematical equation: \hbox{$|\tW_2^{(\ell,0)}|^2 \simeq \frac{k\chi}{2\pi} \tW_2(k\chi\theta_{\rm s})^2$}$ , with k = (ℓ + 1/2)/χ and $\begin{matrix} ˜ \\ W_{2} \end{matrix}$ $Mathematical equation: \hbox{$\tW_2$}$ the 2D Fourier-space window (12). This shows that for small angles, where the covariance is dominated by large ℓ, the expression (41) goes to the flat-sky approximation (27), using the fact that Limber’s approximation (39) also applies in this limit (see Fig. 8).

We plot in Fig. 9 the shot-noise and sample-variance errors $σ_{i} = \sqrt{C_{i,i}}$ $Mathematical equation: \hbox{$\sigma_i=\sqrt{C_{i,i}}$}$ , as in Fig. 4 but as a function of the angular radius θ_s, for the angular number densities in the redshift bin 0.95 < z < 1.05. In agreement with Fig. 8, we can check that the combination (27) of the flat-sky & Limber’s approximations provides a good approximation to the exact result (41) on small angles, typically θ_s < 10 deg, while it underestimates the sample variance on large angles. In agreement with Fig. 5, the shot-noise contribution is dominant for massive and rare halos, and subdominant for small and numerous halos. In our case, the transition between the shot-noise and sample-variance dominated regimes takes place at M_∗ ~ 10¹⁴ h^-1 M_⊙.

3.2.3. Accuracy of the “flat-sky + Limber” approximation

Fig. 10

The ratio of the exact sample-variance error (41) to the approximation (27), which uses both the flat-sky and Limber’s approximations. We show this ratio as a function of the radius θ_s of the angular window, for several redshift bins, for halos above the mass threshold M > 10¹⁴ h^-1 M_⊙. Higher z corresponds to a higher ratio.

We plot in Fig. 10 the ratio of the exact sample-variance error (41) to the approximation (27), which used both the flat-sky and Limber’s approximations. As in Figs. 8 and 9, we can check that the approximation (27) is reliable for small angular windows but significantly underestimate the sample-variance error for wide angles, above 10 deg. In the extreme case of full-sky surveys (θ_s = 180 deg), it can underestimate the sample-variance error by a factor from 2 to 5. The effect is actually greater for higher redshift bins. This may seem somewhat surprising since the “flat-sky” approximation (9) is expected to be more accurate at higher redshifts, where large angles θ correspond to large distances $Mathematical equation: \hbox{$\cD\theta$}$ that are weakly correlated and should not significantly contribute (i.e., the CDM power spectrum itself yields more weight to pairs separated by small angles). However, Limber’s approximation (10) goes in the opposite direction because it relies on the assumption $Mathematical equation: \hbox{$\Delta\chi \gg \cD \theta_{\rm s}$}$ (i.e., longitudinal wavenumbers are more suppressed by the integration along the line of sight than transverse wavenumbers, which are only integrated over the smaller angular distance $Mathematical equation: \hbox{$\cD \theta_{\rm s}$}$ ). For instance, for an Einstein-de Sitter universe, where $Mathematical equation: \hbox{$\chi=\cD=2c/H_0[1-(1+z)^{-1/2}]$}$ , this constraint on the angle θ_s and the redshift bin width Δz writes as θ_s ≪ 29(Δz) [(1 + z)^3/2 − (1 + z)] ^-1 deg. At fixed Δz this upper bound on θ_s becomes stronger at higher z. Therefore, Fig. 10 shows that this second effect, associated with Limber’s approximation, dominates over the first effect, associated with the flat-sky approximation.

We checked that we obtain very close results for other mass thresholds (not shown in the figure). For instance, the curves obtained for the mass threshold 2 × 10¹³ and 5 × 10¹⁴ h^-1 M_⊙ cannot be distinguished from those plotted in Fig. 10. This is not surprising since within our bias model (1) the halo correlations are governed by the same matter density correlation function ξ(r;z).

Figure 10 shows that the small-angle approximation (27) that we used in Sect. 3.2.1 was legitimate, since we considered angular windows of 50 deg² or less (i.e. θ_s ≲ 4 deg).

3.2.4. Correlation between different redshift bins

Fig. 11

The correlation matrix $ℛ_{i,j}^{(s . v .)}$ $Mathematical equation: \hbox{$\cR^{\sv}_{i,j}$}$ of Eq. (46) between redshift bins of width Δz = 0.1. We show $ℛ_{i,j}^{(s . v .)}$ $Mathematical equation: \hbox{$\cR^{\sv}_{i,j}$}$ as a function of j, for four values of i. In each case, $ℛ_{i,j}^{(s . v .)} = 1$ $Mathematical equation: \hbox{$\cR^{\sv}_{i,j}=1$}$ at j = i. We consider halos above 10¹⁴ h^-1 M_⊙ and an angular window of area (ΔΩ) = 50 deg².

We can use the expression (41) to compute the correlation between different redshift bins i and j. Thus, we show in Fig. 11 the correlation matrix $ℛ_{i,j}^{(s . v .)}$ $Mathematical equation: \hbox{$\cR^{\sv}_{i,j}$}$ (also called normalized covariance matrix) defined as $ℛ_{i,j}^{(s . v .)} = \frac{C_{i,j}^{(s . v .)}}{\sqrt{C_{i,i}^{(s . v .)} C_{j,j}^{(s . v .)}}},$ $Mathematical equation: \begin{equation} \cR^{\sv}_{i,j} = \frac{C^{\sv}_{i,j}}{\sqrt{C^{\sv}_{i,i} C^{\sv}_{j,j}}} , \label{correlation-matrix} \end{equation}$ (46)where we only consider the sample-variance contribution. (The shot-noise contribution (22) is always diagonal for nonoverlapping redshift bins.) Thus, $ℛ_{i,j}^{(s . v .)}$ $Mathematical equation: \hbox{$\cR^{\sv}_{i,j}$}$ is unity along the diagonal and elements { i,j } where $ℛ_{i,j}^{(s . v .)}$ $Mathematical equation: \hbox{$\cR^{\sv}_{i,j}$}$ is much smaller than one are weakly correlated. We can check that the decay is always rather fast and correlations between neighboring redshift bins, j = i ± 1, are already below 10%. This shows that it is appropriate to neglect cross-correlations between redshift bins of width Δz = 0.1, as we did in Sect. 3.2.1. We also checked that we obtain almost identical results for other angular windows, such as (ΔΩ) = 400 deg². (For very large or full-sky surveys we do not need the approximation of uncorrelated redshift bins since we use Eq. (41).)

4. Real-space two-point correlation function

In the previous section we have studied the covariance of the estimators $Mathematical equation: \hbox{$\hN_i$}$ , which measure the redshift distribution dn/dz of the population of interest (galaxies, clusters, etc.), over a set of finite redshift bins. This corresponds to one-point statistics. We now study estimators of the real-space two-point correlation function ξ(x₁₂;M₁,M₂;z) of these objects, which corresponds to two-point statistics, as a function of the comoving distance x₁₂. In this article we do not investigate redshift-space distortions, which we leave for future works, and we assume that a real-space map of the population under study is available or that redshift distortions can be neglected.

Estimators of 3D correlation functions, or power spectra, have already been studied in many works, mostly in view of their application to galaxy surveys. However, since we have in mind the application to cluster surveys and, more generally, to deep surveys of rare objects, we consider 3D correlation functions averaged over a wide redshift bin (in order to accumulate a large enough number of objects), rather than the usual local 3D correlation functions at a given redshift. This means that the quantities that we consider in this section, while being truly 3D correlations and not 2D angular correlations, nevertheless involve integrations along the line of sight or, more precisely, the observational cone, within a finite redshift interval. This is also why 3D Fourier-space power spectra may not be the most convenient tool for our purposes, since we do not have homogeneous and isotropic distributions since the radial direction plays a special role.

4.1. Mean correlation

4.1.1. Peebles & Hauser estimator

Following Peebles & Hauser (1974), a simple estimator $Mathematical equation: \hbox{$\hxi$}$ for the two-point correlation function of a point distribution is given by $ξ̂ = \frac{DD}{RR} - 1,$ $Mathematical equation: \begin{equation} \hxi = \frac{DD}{RR} - 1 , \label{hxi-DR} \end{equation}$ (47)where D represents the data field and R an independent Poisson distribution, both with the same mean density. More precisely, the estimator $Mathematical equation: \hbox{$\hxi_i$}$ introduced in (47) for the mean correlation over the radial bin [R_i, −,R_i, +] corresponds to counting all pairs “DD” in the data field that fall in this pair-separation bin i and all pairs “RR” in the auxiliary Poisson field that fall in the same bin, and to taking the ratio of these two counts.

Before appropriate rescaling, the mean number density of the actual Poisson process R is taken as much higher than the observed one, so that the contribution from fluctuations of the denominator RR to the noise of $Mathematical equation: \hbox{$\hxi$}$ can be ignored. The advantage of form (47) is that one automatically includes the geometry of the survey (including boundary effects, cuts, etc.), because the auxiliary field R is drawn on the same geometry.

In our case, we write the analog $Mathematical equation: \hbox{$\hxi_i$}$ of Eq. (47) for the mean correlation on scales delimited by R_i, − and R_i, +, integrated over some redshift range and mass intervals, as $\begin{matrix} 1 + ξ̂ i; α,β & = & \frac{1}{Q_{i; α,β}} \int d z \frac{d χ}{d z} 𝒟^{2} \int \frac{d Ω}{(ΔΩ)} \int_{α} \frac{d M}{M} \end{matrix}$ $Mathematical equation: \begin{eqnarray} 1+\hxi_{i;\alpha,\beta} & = & \frac{1}{\QQ_{i;\alpha,\beta}} \int \dd z \, \frac{\dd\chi}{\dd z} \, \cD^2 \int\frac{\dd\vOm}{(\Delta\Omega)} \int_{\alpha} \frac{\dd M}{M} \nonumber \\ && \times \, \int_{\Rim}^{\Rip} \dd\vr' \int_{\beta} \frac{\dd M'}{M'} \; \frac{\dd\hn}{\dd\!\ln M} \frac{\dd\hn}{\dd\!\ln M'} , \label{xi-1} \end{eqnarray}$ (48)with $\begin{matrix} Q_{i; α,β} & = & \int d z \frac{d χ}{d z} 𝒟^{2} \int \frac{d Ω}{(ΔΩ)} \int_{α} \frac{d M}{M} \int_{R_{i, -}}^{R_{i, +}} d r' \int_{β} \frac{d M^{'}}{M^{'}} \end{matrix}$ $Mathematical equation: \begin{eqnarray} \QQ_{i;\alpha,\beta} & = & \int\dd z \, \frac{\dd\chi}{\dd z} \, \cD^2 \int\frac{\dd\vOm}{(\Delta\Omega)} \int_{\alpha} \frac{\dd M}{M} \int_{\Rim}^{\Rip} \dd\vr' \int_{\beta} \frac{\dd M'}{M'} \nonumber \\ && \times \, \frac{\dd n}{\dd\!\ln M} \frac{\dd n}{\dd\!\ln M'}\cdot \label{QQi-def} \end{eqnarray}$ (49)Here we denoted ${}^{\int}R_{i, -}_{R_{i, +}} d r^{'}$ $Mathematical equation: \hbox{$\int_{\Rim}^{\Rip} \dd\vr'$}$ as the integral over the 3D spherical shell of radii R_i, − < R_i, +, and $\int_{α}$ $Mathematical equation: \hbox{$\int_{\alpha}$}$ and $\int_{β}$ $Mathematical equation: \hbox{$\int_{\beta}$}$ are the integrals over the mass bins α and β.

The redshift interval Δz is not necessarily small, and to increase the statistics we can choose the whole redshift range of the survey, such as [0,z_s] . If we bin the survey over smaller nonoverlapping redshift intervals, which are large enough to neglect cross-correlations between different bins (see for instance Fig. 11), we can independently study each redshift bin. For simplicity we do not explicitly write the redshift boundaries.

As in Eq. (47), the counting method that underlies Eq. (48) can be understood as follows (Peebles & Hauser 1974). We span all objects in the “volume” (z,Ω,lnM), and count all neighbors at distance r′, within the shell [R_i, −,R_i, +] , with a mass M′. We denote with unprimed letters the quantities associated with the first object, (z,Ω,lnM), and with primed letters the quantities associated with the neighbor of mass M′ at distance r′. Thus, with obvious notations, $Mathematical equation: \hbox{$\dd\hn/\dd\!\ln M$}$ and $Mathematical equation: \hbox{$\dd\hn/\dd\!\ln M'$}$ are the observed number densities at the first and second (neighboring) points. The difference between the quantities $Mathematical equation: \hbox{$(1+\hxi)$}$ and Q is that in the latter case we use the mean number densities dn/dlnM and dn/dlnM′. Therefore, Q is not a random quantity so it shows no noise. In practice, the mean number densities dn/dlnM may actually be measured from the same survey, as described in Sect. 3. However, since these measures do not involve a distance binning over r′, there are many more objects in a redshift bin than within a small interval [R_i, −,R_i, +] . Then, the one-point quantities $Mathematical equation: \hbox{$\lag\hN\rag$}$ are measured with much greater accuracy than $Mathematical equation: \hbox{$\lag\hxi\rag$}$ , so that we can indeed neglect their contribution to the noise of the estimator $Mathematical equation: \hbox{$\hxi$}$ . In terms of Eq. (47) this corresponds to neglecting fluctuations of “RR”. (This is achieved in practice by choosing a much higher density for the field R, which is later rescaled.)

In Eq. (48) we used a simple average over the shell [R_i, −,R_i, +] , because we count all pairs with a uniform weight in r′-space. Through the change to spherical coordinates dr′ = dr′r^′2dΩ′, this yields a geometrical weight r^′2 in terms of the radial distance r′. An alternative would be to add a weight r^′ − 2, instead of the simple 3D top-hat written in Eq. (48), to eventually obtain a uniform weight over the radial distance r′. For simplicity we only consider choice (48) in the following, but such alternative weights could be used with straightforward modifications in the expressions given below.

Thus, we focus on the behavior of the two-point correlation as a function of distance r′, measured through the binning over the intervals [R_i, −,R_i, +] . We assume that different bins do not overlap, R_i, + ≤ R_{i + 1, −}, and in practice one usually has R_i, + = R_{i + 1, −}, to cover a continuous range of scales. On the other hand, these intervals may depend on redshift, as long as R_i, +(z) ≤ R_{i + 1, −}(z) at each redshift.

Using Eq. (18), Eq. (49) also writes as $Q_{i; α,β} = \int d χ 𝒟^{2} n_{α} n_{β} 𝒱_{i},$ $Mathematical equation: \begin{equation} \QQ_{i;\alpha,\beta} = \int \dd\chi \, \cD^2 \, \nb_{\alpha} \nb_{\beta} \, \cV_i , \label{QQ-1} \end{equation}$ (50)where the volume $Mathematical equation: \hbox{$\cV_i$}$ of the i-shell is $𝒱_{i} (z) = \frac{4 π}{3} [R_{i, +} (z)^{3} - R_{i, -} (z)^{3}],$ $Mathematical equation: \begin{equation} \cV_i(z) = \frac{4\pi}{3} [\Rip(z)^3-\Rim(z)^3] , \label{Vi-def} \end{equation}$ (51)which may depend on z. In practice, one would usually choose constant comoving shells, so that $Mathematical equation: \hbox{$\cV_i$}$ does not depend on z. To obtain Eq. (50) we used that dn/dlnM and dn/dlnM′ have no scale dependence (because they correspond to a uniform distribution of objects) and we neglected edge effects. (These finite-size effects are discussed and evaluated in Appendix B.)

Because of the finite distance r′ between the two objects M and M′ in Eq. (48), there is no shot-noise contribution to the average of the quadratic term $Mathematical equation: \hbox{$(\dd\hn/\dd\!\ln M)\times(\dd\hn/\dd\!\ln M')$}$ . Within the framework presented for Eqs. (A.1), (A.2), the integration in Eq. (48) does not contain common small (infinitesimal) cells, because of the finite-size distance r′ > R_i, −. Therefore, the average of the statistical estimator (48) reads as $\begin{matrix} 1 + ⟨ ξ̂ i; α,β ⟩ & = & \frac{1}{Q_{i; α,β}} \int d χ 𝒟^{2} \int \frac{d Ω}{(ΔΩ)} \int_{α} \frac{d M}{M} \int_{i} d r' \int_{β} \frac{d M^{'}}{M^{'}} \end{matrix}$ $Mathematical equation: \begin{eqnarray} 1\!+\!\lag\hxi_{i;\alpha,\beta}\rag & \! = \! & \frac{1}{\QQ_{i;\alpha,\beta}} \int\!\! \dd\chi \, \cD^2 \int\!\! \frac{\dd\vOm}{(\Delta\Omega)} \int_{\alpha}\frac{\dd M}{M} \int_i \dd\vr' \int_{\beta} \frac{\dd M'}{M'} \nonumber \\ && \times \, \frac{\dd n}{\dd\!\ln M} \frac{\dd n}{\dd\!\ln M'} \, [1+\xih(r';M,M';z) ] , \label{xi-2} \end{eqnarray}$ (52)where ξ^h(r′;M,M′;z) is the two-point correlation function of the objects, as in Eq. (A.9). Here we denoted ^∫_idr′ as the integral ${}^{\int}R_{i, -}_{R_{i, +}} d r^{'}$ $Mathematical equation: \hbox{$\int_{\Rim}^{\Rip}\dd\vr'$}$ over the 3D spherical shell i. Comparing with Eq. (50) we clearly see that $Mathematical equation: \hbox{$\hxi_i$}$ is an unbiased estimator of the two-point correlation function ξ^h, averaged over the shell [R_i, −,R_i, +] (with a geometrical weight r^′2), whence the name “ $Mathematical equation: \hbox{$\hxi$}$ ”.

As in Eq. (50), in Eq. (52) and in the following we neglect finite-size effects, which arise because the integration over r′ should be restricted to the observational cone of the survey. This leads to a smaller available volume than the spherical shell [R_i, −,R_i, +] close to the survey boundaries. This does not affect the mean value of the estimator $Mathematical equation: \hbox{$\hxi_i$}$ , because this effect cancels out between the numerator of Eq. (52) and the denominator Q_i. However, it will have a small effect on our estimate of the covariance matrix. As described in Appendix B, at z = 1 for a circular survey area ΔΩ = 50 deg², and for a radial bin at r = 30 h^-1 Mpc, by geometrical counting we overestimate the number of pairs by 10% and the signal-to-noise ratio by 5%.

As in Sect. 3.2, in order to make progress we assume that the two-point correlation can be factored in as in Eq. (1), so that Eq. (52) reads as $⟨ ξ̂ i; α,β ⟩ = \frac{1}{Q_{i; α,β}} \int d χ 𝒟^{2} b_{α} b_{β} n_{α} n_{β} 𝒱_{i} ξ_{i^{'}}^{(r)} (z),$ $Mathematical equation: \begin{equation} \lag\hxi_{i;\alpha,\beta}\rag = \frac{1}{Q_{i;\alpha,\beta}} \int\dd\chi \, \cD^2 \, \bb_{\alpha} \bb_{\beta} \, \nb_{\alpha} \nb_{\beta} \, \cV_i \, \overline{\xir_{i'}}(z) , \label{xi-3} \end{equation}$ (53)with $ξ_{i^{'}}^{(r)} (z) = \int_{i} \frac{d r'}{𝒱_{i}} ξ (r^{'}; z) .$ $Mathematical equation: \begin{equation} \overline{\xir_{i'}}(z) = \int_i\frac{\dd\vr'}{\cV_i} \, \xi(r';z) . \label{xi-i-i-def} \end{equation}$ (54)We have introduced the superscript “(r)” to recall that Eq. (54) is the radial average of ξ, over the 3D spherical shell associated with the radial bin i, to distinguish it from the angular averages that we encounter in Sect. 5 below. The prime in the subscript “i′” also recalls that we integrate over a neighboring point r′, with respect to a given point $Mathematical equation: \hbox{$(\chi,\cD\vOm)$}$ of the observational cone, to distinguish it from the integration over an unrelated point within the observational cone as in the “cylindrical” average (8). We give in Eq. (C.3) in Appendix C the Fourier-space expression of $ξ_{i^{'}}^{(r)} (z)$ $Mathematical equation: \hbox{$\overline{\xir_{i'}}(z)$}$ , which is more convenient for numerical computations.

Fig. 12

The mean halo correlation, $⟨ ξ̂ \begin{matrix} LS \\ i \end{matrix} ⟩$ $Mathematical equation: \hbox{$\lag\hxiLS_i\rag$}$ , over ten comoving distance bins within 5 < r < 100 h^-1 Mpc, equally spaced in log (r). We integrate over halos within the redshift interval 0 < z < 0.8 and we compare our analytical results (solid lines) with numerical simulations (dashed lines).

4.1.2. Landy & Szalay estimator

As shown in Landy & Szalay (1993), a better estimator than (47) is given by $ξ̂ LS = \frac{DD - 2 DR + RR}{RR},$ $Mathematical equation: \begin{equation} \hxiLS = \frac{DD-2DR+RR}{RR} , \label{hxi-LS-DR} \end{equation}$ (55)which involves the product DR between the data and the auxiliary field. Within our framework, where the mean quantity Q plays the role of R, this second estimator reads as $\begin{matrix} ξ̂ \begin{matrix} LS \\ i; α,β \end{matrix} & = & \frac{1}{Q_{i; α,β}} \int d z \frac{d χ}{d z} 𝒟^{2} \int \frac{d Ω}{(ΔΩ)} \int_{α} \frac{d M}{M} \int_{i} d r' \int_{β} \frac{d M^{'}}{M^{'}} \\ \times \frac{d n̂}{d \ln M} \frac{d n̂}{d \ln M^{'}} - 2 \frac{1}{Q_{i; α,β}} \int d z \frac{d χ}{d z} 𝒟^{2} \int \frac{d Ω}{(ΔΩ)} \end{matrix}$ $Mathematical equation: \begin{eqnarray} \hxiLS_{i;\alpha,\beta} & = & \frac{1}{\QQ_{i;\alpha,\beta}} \int \dd z \, \frac{\dd\chi}{\dd z} \, \cD^2 \int\frac{\dd\vOm}{(\Delta\Omega)} \int_{\alpha}\frac{\dd M}{M} \int_i \dd\vr' \int_{\beta} \frac{\dd M'}{M'} \nonumber \\ && \times \, \frac{\dd\hn}{\dd\!\ln M} \frac{\dd\hn}{\dd\!\ln M'} - 2 \, \frac{1}{\QQ_{i;\alpha,\beta}} \int \dd z \, \frac{\dd\chi}{\dd z} \, \cD^2 \int\frac{\dd\vOm}{(\Delta\Omega)} \nonumber \\ && \times \, \int_{\alpha} \frac{\dd M}{M} \int_i \dd\vr' \int_{\beta}\frac{\dd M'}{M'} \, \frac{\dd\hn}{\dd\!\ln M} \frac{\dd n}{\dd\!\ln M'} + 1 . \label{xi-LS-1} \end{eqnarray}$ (56)The difference between the terms associated with DD and DR is that in the former we have a product of two observed number densities, $Mathematical equation: \hbox{$(\dd\hn/\dd\!\ln M)\times(\dd\hn/\dd\!\ln M')$}$ , while in the latter we have a crossproduct between the observed and the mean number densities, $Mathematical equation: \hbox{$(\dd\hn/\dd\!\ln M)\times(\dd n/\dd\!\ln M')$}$ .

As checked in Appendix E, the mean of this second estimator $ξ̂ \begin{matrix} LS \\ i; α,β \end{matrix}$ $Mathematical equation: \hbox{$\hxiLS_{i;\alpha,\beta}$}$ is equal to the mean of the estimator $Mathematical equation: \hbox{$\hxi_{i;\alpha,\beta}$}$ studied in Sect. 4.1.1, $⟨ ξ̂ \begin{matrix} LS \\ i; α,β \end{matrix} ⟩ = ⟨ ξ̂ i; α,β ⟩ .$ $Mathematical equation: \begin{equation} \lag\hxiLS_{i;\alpha,\beta}\rag = \lag\hxi_{i;\alpha,\beta}\rag . \label{xi-LS-2} \end{equation}$ (57)To simplify the notations, in the following we do not consider binning over mass (i.e., we independently consider the correlation functions of halos above some mass thresholds), so that Eq. (53) readily simplifies as $⟨ ξ̂ \begin{matrix} LS \\ i \end{matrix} ⟩ = ⟨ ξ̂ i ⟩ = \frac{1}{Q_{i}} \int d χ 𝒟^{2} b^{2} n^{2} 𝒱_{i} ξ_{i^{'}}^{(r)} (z),$ $Mathematical equation: \begin{equation} \lag\hxiLS_i\rag = \lag\hxi_i\rag = \frac{1}{Q_i} \int\dd\chi \, \cD^2 \, \bb^2 \, \nb^2 \, \cV_i \, \overline{\xir_{i'}}(z) , \label{xi-4} \end{equation}$ (58)and a similar simplification holds for Q_i. If needed, it is not difficult to include a mass binning in the expressions given in the following.

4.1.3. Comparison with simulations

We compare in Fig. 12 the mean correlation (58) with results from numerical simulations (which use the Landy & Szalay estimator) for halos above the thresholds M > 2 × 10¹³ and 10¹⁴ h^-1 M_⊙, within the redshift range 0 < z < 0.8. The error bars are the 3 − σ statistical errors obtained from the covariance matrices derived in Sect. 4.2.2 for 34 fields of 50 deg² as used in the simulations. We obtain reasonable agreement with the simulations, although we appear to underestimate the halo correlation of the most massive halos at small radius, r < 7 h^-1 Mpc. This may be due to a scale-dependent halo bias or to a small discrepancy in the definition of the halo mass, which depends on the halo-finder algorithm (Knebe et al. 2011).

4.2. Covariance matrices for the halo correlation

We now consider the covariance of the estimators $Mathematical equation: \hbox{$\hxi_i$}$ and $ξ̂ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxiLS_i$}$ . As described in Appendix D, the covariance of the Peebles & Hauser estimator is given by $C_{i,j} = ⟨ ξ̂ i ξ̂ j ⟩ - ⟨ ξ̂ i ⟩ ⟨ ξ̂ j ⟩ = C_{i,j}^{(2)} + C_{i,j}^{(3)} + C_{i,j}^{(4)},$ $Mathematical equation: \begin{equation} C_{i,j} = \lag\hxi_i\hxi_j\rag - \lag\hxi_i\rag \lag\hxi_j\rag = C_{i,j}^{(2)} + C_{i,j}^{(3)} + C_{i,j}^{(4)} , \label{xii-xij-2} \end{equation}$ (59)with (see also Landy & Szalay 1993 for a computation of low-order terms) $\begin{matrix} C_{i,j}^{(2)} & = & δ_{i,j} \frac{2}{(ΔΩ) Q_{i}^{2}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} [1 + {ξ_{i, i^{'}}^{h}}^{]}, \\ C_{i,j}^{(3)} & = & \frac{4}{(ΔΩ) Q_{i} Q_{j}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} \frac{d n}{d \ln M_{j^{'}}} \\ \times [1 + ξ_{i, i^{'}}^{h} + ξ_{i, j^{'}}^{h} + ξ_{i^{'}, j^{'}}^{h} + {ζ_{i, i^{'}, j^{'}}^{h}}^{]}, \\ C_{i,j}^{(4)} & = & \frac{1}{Q_{i} Q_{j}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} \\ \times \int d χ_{j} 𝒟_{j}^{2} \frac{d Ω_{j}}{(ΔΩ)} \frac{d M_{j}}{M_{j}} \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{j}} \frac{d n}{d \ln M_{j^{'}}} \\ \times [4 ξ_{i; j}^{h} + 2 ζ_{i; j, j^{'}}^{h} + 2 ζ_{i, i^{'}; j}^{h} + 2 ξ_{i; j^{'}}^{h} ξ_{i^{'}; j}^{h} + {η_{i, i^{'}; j, j^{'}}^{h}}^{]}, \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{(2)} & = & \delta_{i,j} \, \frac{2}{(\Delta\Omega) \QQ_i^2} \int \dd\chi_i \, \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \nonumber \\\label{C2-def} && \times \, \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \left[ 1+\xih_{i,i'} \right] , \\ C_{i,j}^{(3)} & = & \frac{4}{(\Delta\Omega) \QQ_i\QQ_j} \int \dd\chi_i \, \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ && \times \int_j \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \; \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \frac{\dd n}{\dd\!\ln M_{j'}} \nonumber \\\label{C3-def} && \times \left[ 1+ \xih_{i,i'} + \xih_{i,j'} + \xih_{i',j'} + \zetah_{i,i',j'} \right] , \\ C_{i,j}^{(4)} & = & \frac{1}{\QQ_i\QQ_j} \int\!\! \dd\chi_i \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \! \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \nonumber \\ && \times \int \dd\chi_j \, \cD_j^2 \frac{\dd\vOm_j}{(\Delta\Omega)} \frac{\dd M_j}{M_j} \int_j \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \frac{\dd n}{\dd\!\ln M_j} \frac{\dd n}{\dd\!\ln M_{j'}} \nonumber \\ \label{C4-def} && \times \left[ 4 \xih_{i;j} + 2 \zetah_{i;j,j'} + 2 \zetah_{i,i';j} + 2 \xih_{i;j'} \xih_{i';j} + \etah_{i,i';j,j'} \right] , \end{eqnarray}$ where ξ^h, ζ^h, and η^h, are the two-point, three-point, and four-point correlation functions of the objects. To make the expressions compact but easy to understand, we introduced the following notation in Eqs. (60)–(62). Variables associated with the object at the center of the $Mathematical equation: \hbox{$\cV_i$}$ -shell are noted by the label i (e.g., χ_i,M_i,...) and those associated with the object within the $Mathematical equation: \hbox{$\cV_i$}$ -shell are noted by the label i′ (e.g., r_i′,M_i′,...). This corresponds to the primed and unprimed variables in Eqs. (48) and (52), and we may speak of objects i,i′,j, and j′. Then, in the indices of the correlation functions, we separate with a semicolon, as in $ξ_{i; j}^{h}$ $Mathematical equation: \hbox{$\xih_{i;j}$}$ of Eq. (62), objects i and j that are located at unrelated positions (χ_i,Ω_i) and (χ_j,Ω_j) in the observational cone, whereas we separate with a comma, as in $ξ_{i, i^{'}}^{h}$ $Mathematical equation: \hbox{$\xih_{i,i'}$}$ of Eq. (60), objects that are located at a fixed distance r′. (More precisely, the distance r′ is restricted to a radial bin $Mathematical equation: \hbox{$\cV$}$ .)

The label C⁽ⁿ⁾ refers to quantities that involve n distinct objects. Thus, the contributions C⁽²⁾ and C⁽³⁾ arise from shot-noise effects (as is apparent through the prefactors 1/(ΔΩ)), associated with the discreteness of the number density distribution, and they would vanish for continuous distributions. However, they also involve the two-point and three-point correlations, and as such they couple discreteness effects with the underlying large-scale correlations of the population. In case of zero large-scale correlations, they remain nonzero because of the unit factors in the brackets and become purely shot-noise contributions, arising solely from discreteness effects.

More precisely, contribution (60) arises from the coupled identification i = j and i′ = j′ (or i = j′ and i′ = j), whereas contribution (61) arises from the single identification i = j (or either one of i = j′, i′ = j, i′ = j′). Thus, in Eq. (61) the object i is at the center of both shells $Mathematical equation: \hbox{$\cV_i$}$ and $Mathematical equation: \hbox{$\cV_j$}$ .

Contribution C⁽⁴⁾ is a pure sample-variance contribution and does not depend on the discreteness of the number density distribution (hence there is no 1/(ΔΩ) prefactor).

As shown in Appendix E, the covariance matrix of the Landy & Szalay estimator reads as (see also Szapudi 2001; Bernardeau et al. 2002) $C_{i,j}^{LS} = C_{i,j}^{LS (2)} + C_{i,j}^{LS (3)} + C_{i,j}^{LS (4)},$ $Mathematical equation: \begin{equation} C^{\rm LS}_{i,j} = C^{\rm LS (2)}_{i,j} + C^{\rm LS (3)}_{i,j} + C^{\rm LS (4)}_{i,j} , \label{Cij-LS-2} \end{equation}$ (63)where the first term is equal to Eq. (60), $C_{i,j}^{LS (2)} = C_{i,j}^{(2)},$ $Mathematical equation: \begin{equation} C_{i,j}^{\rm LS (2)} = C_{i,j}^{(2)} , \label{C2-LS-def} \end{equation}$ (64)and $\begin{matrix} C_{i,j}^{LS (3)} & = & \frac{4}{(ΔΩ) Q_{i} Q_{j}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} \frac{d n}{d \ln M_{j^{'}}} \\ \times [ξ_{i^{'}, j^{'}}^{h} + {ζ_{i, i^{'}, j^{'}}^{h}}^{]}, \\ C_{i,j}^{LS (4)} & = & \frac{1}{Q_{i} Q_{j}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} \\ \times \int d χ_{j} 𝒟_{j}^{2} \frac{d Ω_{j}}{(ΔΩ)} \frac{d M_{j}}{M_{j}} \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{j}} \frac{d n}{d \ln M_{j^{'}}} \\ \times [2 ξ_{i; j^{'}}^{h} ξ_{i^{'}; j}^{h} + {η_{i, i^{'}; j, j^{'}}^{h}}^{]} . \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{\rm LS (3)} & = & \frac{4}{(\Delta\Omega)\QQ_i\QQ_j} \int \dd\chi_i \, \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ && \times \int_j \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \; \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \frac{\dd n}{\dd\!\ln M_{j'}} \nonumber \\ \label{C3-LS-def}&& \times \left[ \xih_{i',j'} + \zetah_{i,i',j'} \right] , \\ \!C_{i,j}^{\rm LS (4)} & = & \frac{1}{\QQ_i\QQ_j}\! \int \!\! \dd\chi_i \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \! \int_i \! \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \nonumber \\ && \times \int \dd\chi_j \, \cD_j^2 \frac{\dd\vOm_j}{(\Delta\Omega)} \frac{\dd M_j}{M_j} \int_j \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \frac{\dd n}{\dd\!\ln M_j} \frac{\dd n}{\dd\!\ln M_{j'}} \nonumber \\ \label{C4-LS-def}&& \times \left[ 2 \xih_{i;j'} \xih_{i';j} + \etah_{i,i';j,j'} \right] . \end{eqnarray}$ By comparison with Eqs. (61)–(62) we can see that many terms have been canceled (Landy & Szalay 1993; Szapudi & Szalay 1998). This confirms that the estimator (56) is more efficient than (48), since its covariance will be smaller.

4.2.1. Low-order terms

In this section we assume that the radial bins [R_i, −,R_i, +] are restricted to large enough scales to neglect three and four-point correlation functions, as well as products such as $ξ_{i; j^{'}}^{h} ξ_{i^{'}; j}^{h}$ $Mathematical equation: \hbox{$\xih_{i;j'} \xih_{i';j}$}$ . We compute these high-order terms in Sect. 4.2.2 and Figs. 15 and 16 show the range where they can be neglected. Along the diagonal, for halos above 10¹⁴h^-1M_⊙ this corresponds to the full range 5 < r < 100 h^-1 Mpc. For lower mass halos, M > 2 × 10¹³h^-1M_⊙, all scales receive significant contributions from high-order terms, but the low-order terms contribute to about 50% for r < 15 h^-1 Mpc. This is sufficient for our purposes in this section, which are to compare the Peebles & Hauser and the Landy & Szalay estimators, the shot-noise and sample-variance effects, and the scalings with survey area and number of subfields. Accurate computation of the covariance matrix requires taking all terms into account, which we do in Sect. 4.2.2.

Thus, in this section we only keep the contributions that are constant or linear over the two-point correlation function ξ^h of the objects, and we again assume that the two-point correlation function can be factored as in Eq. (1). Then, as shown in Appendix D, for the Peebles & Hauser estimator we obtain from Eqs. (60)–(62), at this order, $\begin{matrix} C_{i,j} & = & δ_{i,j} \frac{2}{(ΔΩ) Q_{i}} (1 + ⟨ ξ̂ i ⟩) + \frac{4}{(ΔΩ) Q_{i} Q_{j}} \\ \times \int d χ 𝒟^{2} n^{3} 𝒱_{i} 𝒱_{j} [1 + b^{2} (ξ_{i^{'}}^{(r)} + ξ_{j^{'}}^{(r)} + ξ_{i^{'}, j^{'}}^{(r)})] \\ + \frac{4}{Q_{i} Q_{j}} \int d χ 𝒟^{5} b^{2} n^{4} 𝒱_{i} 𝒱_{j} ξ_{cyl}, \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j} & = & \delta_{i,j} \frac{2}{(\Delta\Omega)\QQ_i} (1+\lag\hxi_i\rag) + \frac{4}{(\Delta\Omega)\QQ_i\QQ_j} \nonumber \\ && \times \int \dd\chi \, \cD^2 \, \nb^3 \, \cV_i \cV_j \left[ 1 + \bb^2 \, \left(\overline{\xir_{i'}} \!+\! \overline{\xir_{j'}} \!+\! \overline{\xir_{i',j'}} \right) \right] \nonumber \\ \label{Cij-tot} && + \frac{4}{\QQ_i\QQ_j} \int \dd\chi \, \cD^5 \, \bb^2 \nb^4 \, \cV_i \cV_j \, \xicyl , \end{eqnarray}$ (67)where we introduced $ξ_{i^{'}, j^{'}}^{(r)} (z) = \int_{i} \frac{d r i^{'}}{𝒱_{i}} \int_{j} \frac{d r j^{'}}{𝒱_{j}} ξ (| r i^{'} - r j^{'} |; z) .$ $Mathematical equation: \begin{equation} \overline{\xir_{i',j'}}(z) = \int_i\frac{\dd\vr_{i'}}{\cV_i} \int_j\frac{\dd\vr_{j'}}{\cV_j} \; \xi(|\vr_{i'}-\vr_{j'}|;z) . \label{I3-ij-xi-def} \end{equation}$ (68)Following the notation explained earlier, below Eq. (62), the comma and the primes in $ξ_{i^{'}, j^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{i',j'}}$}$ mean that this is a “spherical average”, more precisely the average over the two spherical shells $Mathematical equation: \hbox{$\cV_i$}$ and $Mathematical equation: \hbox{$\cV_j$}$ , in contrast to $ξ_{con}^{(j)}$ $Mathematical equation: \hbox{$\xiconzj$}$ in Eq. (8), which was a “conical” average within the observational cone. There are two indices, i′ and j′, because we integrate over the two shells $Mathematical equation: \hbox{$\cV_i$}$ and $Mathematical equation: \hbox{$\cV_j$}$ , whereas in $ξ_{i^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{i'}}$}$ of Eq. (54) there was only one index i′ because we integrated over a single shell $Mathematical equation: \hbox{$\cV_i$}$ . The Fourier-space expression of Eq. (68), which can be convenient for numerical computations, is given in Eq. (D.10) in Appendix D.

Again, to obtain Eq. (67) we neglected finite-size effects, that is, we did not take the fact into account that close to the boundaries of the survey part of the shell $Mathematical equation: \hbox{$\cV_i$}$ is not observed. As explained in Appendix B, this only leads to an overestimate of 5% of the signal-to-noise ratio, for a radial bin of 30 h^-1 Mpc in a circular survey window of 50 deg². This error decreases for wider surveys or smaller radial bins.

For the Landy & Szalay estimator, at the same order the covariance matrix reads from Eqs. (64)–(66) as $\begin{matrix} C_{i,j}^{LS} & = & δ_{i,j} \frac{2}{(ΔΩ) Q_{i}} (1 + ⟨ ξ̂ i ⟩) + \frac{4}{(ΔΩ) Q_{i} Q_{j}} \\ \times \int d χ 𝒟^{2} b^{2} n^{3} 𝒱_{i} 𝒱_{j} ξ_{i^{'}, j^{'}}^{(r)} . \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{\rm LS} & = & \delta_{i,j} \frac{2}{(\Delta\Omega)\QQ_i} (1+\lag\hxi_i\rag) + \frac{4}{(\Delta\Omega)\QQ_i\QQ_j} \nonumber \\ \label{Cij-LS-tot} && \times \int \dd\chi \, \cD^2 \, \bb^2 \, \nb^3 \, \cV_i \cV_j \, \overline{\xir_{i',j'}} . \end{eqnarray}$ (69)Again, as compared with Eq. (67) several terms have been canceled. Moreover, at this order only shot-noise terms, whether coupled to large-scale correlations or not, contribute to the Landy & Szalay covariance (69), as can be seen from the prefactors 1/(ΔΩ). In contrast, at the same order in the Peebles & Hauser covariance (67), we have two more shot-noise terms (coupled to the large-scale correlations through the means $ξ_{i^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{i'}}$}$ and $ξ_{j^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{j'}}$}$ ) and one additional sample-variance-only contribution (i.e., the last term, without the prefactor 1/(ΔΩ)).

Comparison of Peebles & Hauser and Landy & Szalay covariance matrices

Fig. 13

The covariance matrices $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ (solid line) and C_i,j (dashed line) of the estimators $ξ̂ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxiLS_i$}$ and $Mathematical equation: \hbox{$\hxi_i$}$ , for i = 4 associated with the distance bin 12.3 < r < 16.6 h^-1 Mpc, as a function of j. We show the results obtained for halos in the redshift range 0 < z < 0.8 with an angular window of 50 deg². Here we only consider the low-order terms given by Eqs. (67) and (69).

We show in Fig. 13 one row of the covariance matrices C_i,j and $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as a function of j at fixed i. We consider halos in the redshift range 0 < z < 0.8, for a window of 50 deg². The covariance is larger for the case of higher mass threshold. In agreement with Eqs. (67) and (69) and with standard results (Kerscher et al. 2000), the covariance of the Landy & Szalay estimator (55) is smaller than for the Peebles & Hauser estimator (47), especially for the lower mass threshold (the higher mass threshold case being more dominated by the common shot-noise contribution (64)).

As shown by Fig. 13, another advantage of the Landy & Szalay estimator is that its covariance matrix is much more diagonal than for the Peebles & Hauser estimator. This can be checked by comparing the left and middle panels of Fig. 19, where we show the correlation matrices ℛ_i,j defined as in Eq. (46), but where we include all shot-noise and sample-variance contributions of Eqs. (67) and (69).

Comparison of sample-variance and shot-noise effects

Fig. 14

The contributions C⁽²⁾ and C⁽³⁾ to the covariance of the Landy & Szalay estimator, along the diagonal i = j. As in Fig. 13, we only consider the low-order terms, given by Eq. (69).

We compare in Fig. 14 the contributions C⁽²⁾ (first term in Eq. (69)) and C⁽³⁾ (second term in Eq. (69)), again keeping only these low-order terms. We consider the same survey properties as in Fig. 13 but plot these contributions along the diagonal, i = j. Let us recall that both contributions C⁽²⁾ and C⁽³⁾ are shot-noise contributions (i.e., they arise from the discreteness of the halo distribution). However, they also involve the underlying large-scale correlations, as apparent through the factors ξ. In particular, C⁽³⁾, which arises from a single pair identification, vanishes if there are no large-scale correlations, whereas C⁽²⁾, which arises from two pair identifications, remains nonzero if ξ = 0 (the term associated with the factor 1 is thus a “pure shot-noise” contribution). Therefore, by comparing C⁽²⁾ and C⁽³⁾ we can assess the relative importance of shot-noise and sample-variance effects, C⁽²⁾ involving an extra degree of shot noise (one more pair identification). As expected, C⁽²⁾ is dominant for small distance bins, which correspond to small volumes, $Mathematical equation: \hbox{$\cV_i \propto r^3$}$ , and contain few halos. It also remains dominant up to larger scales in the case of more massive halos, which are rarer. Since the contribution C⁽²⁾ is diagonal, as shown by the Kronecker prefactor in Eq. (67), it implies that covariance matrices are more strongly diagonal for high-mass halos, as can be checked in Fig. 19 where we show the correlation matrices ℛ_i,j of small (upper row) and large (lower row) halos.

Scalings with survey area and number of subfields

As in Sect. 3.2.1, we consider the dependence of the signal-to-noise ratio on the total survey area ΔΩ and on the number $Mathematical equation: \hbox{$\cN$}$ of subfields. Thus, we define the estimator $ξ̂ \begin{matrix} LS, tot \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxi_i^{\rm LS,tot}$}$ as the mean of the estimators $ξ̂ \begin{matrix} LS, (α) \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxi_i^{\rm LS,(\alpha)}$}$ of Eq. (56) of the subfields, $ξ̂ \begin{matrix} LS, tot \\ i \end{matrix} = \frac{1}{𝒩} \sum_{α = 1}^{𝒩} ξ̂ \begin{matrix} LS, (α) \\ i \end{matrix} .$ $Mathematical equation: \begin{equation} \hxi_i^{\rm LS,tot} = \frac{1}{\cN} \sum_{\alpha=1}^{\cN} \hxi_i^{\rm LS,(\alpha)} . \label{xi-LS-tot-def} \end{equation}$ (70)Of course, the expectation value is independent of (ΔΩ) and $Mathematical equation: \hbox{$\cN$}$ , $⟨ ξ̂ \begin{matrix} LS, tot \\ i \end{matrix} ⟩ = ⟨ ξ̂ \begin{matrix} LS, (α) \\ i \end{matrix} ⟩ isindependentof (ΔΩ) and 𝒩 .$ $Mathematical equation: \begin{equation} \lag \hxi_i^{\rm LS,tot} \rag = \lag \hxi_i^{\rm LS,(\alpha)} \rag \; \mbox{is independent of} \; (\Delta\Omega) \; \mbox{and} \; \cN . \label{xi-LS-tot-1} \end{equation}$ (71)From Eq. (50) we can check that $Q_{i}^{(α)}$ $Mathematical equation: \hbox{$Q_i^{(\alpha)}$}$ does not depend on (ΔΩ) nor $Mathematical equation: \hbox{$\cN$}$ , so that for each subfield α, of area $Mathematical equation: \hbox{$(\Delta\Omega)/\cN$}$ , the covariance (69) scales as $C_{i,j}^{LS, (α)} \propto \frac{𝒩}{(ΔΩ)} \cdot$ $Mathematical equation: \begin{equation} C_{i,j}^{\rm LS,(\alpha)} \propto \frac{\cN}{(\Delta\Omega)} \cdot \label{Cij-LS-alpha} \end{equation}$ (72)Both terms in Eq. (69) scale in the same fashion, so that the structure of the covariance matrix does not change with (ΔΩ) nor $Mathematical equation: \hbox{$\cN$}$ (i.e. it does not become more or less diagonal), if we neglect boundary effects. Then, the covariance matrix of the averaged estimator (70) scales as $C_{i,j}^{LS, tot} = \frac{1}{𝒩} C_{i,j}^{LS, (α)} \propto \frac{1}{(ΔΩ)},$ $Mathematical equation: \begin{equation} C_{i,j}^{\rm LS,tot} = \frac{1}{\cN} C_{i,j}^{\rm LS,(\alpha)} \propto \frac{1}{(\Delta\Omega)} , \label{Cij-LS-tot-alpha} \end{equation}$ (73)so that the signal-to-noise ratio scales as $\frac{S}{N} = \frac{⟨ ξ̂ \begin{matrix} LS, tot \\ i \end{matrix} ⟩}{\sqrt{C_{i,j}^{LS, tot}}} \propto \sqrt{(ΔΩ)} .$ $Mathematical equation: \begin{equation} \frac{S}{N}=\frac{\lag \hxi_i^{\rm LS,tot} \rag}{\sqrt{C_{i,j}^{\rm LS,tot}}} \propto \sqrt{(\Delta\Omega)} . \label{SN-xi-tot} \end{equation}$ (74)Therefore, a single wide-field survey and a combination of several independent smaller surveys, with the same total area, show the same efficiency. This is because both terms in Eq. (69) scale in the same way with the survey geometry, as 1/(ΔΩ), because the sample-variance effects involved in these mixed contributions arise from the correlation between objects separated by a distance r < R_i, + + R_j, +, independently of the angular size of the survey. This is different from the sample-variance contribution (27) to the covariance of the number counts, which explicitly depends on the large-scale correlation over the survey angular size θ_s, see Eq. (13), because it arises from the correlation between objects located at any position in the survey cone. Of course, result (74) only applies to small length scales, $Mathematical equation: \hbox{$\Rip+\Rjp \ll \cD\theta_{\rm s}$}$ , where it is legitimate to neglect finite-size effects. For long wavelengths a wider survey is clearly more efficient, and the only possible choice for scales that are close to the larger survey diameter.

4.2.2. High-order terms for the covariance of $Mathematical equation: \hbox{$\hat{\xi}^\mathsf {LS}$}$

We now estimate the high-order terms for the covariance $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ of the Landy & Szalay estimator $ξ̂ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxiLS_i$}$ that we neglected in Eq. (69), where we only kept terms of order zero or one over the two-point correlation function. To evaluate the contributions associated with the factors $ζ_{i, i^{'}, j^{'}}^{h}$ $Mathematical equation: \hbox{$\zetah_{i,i',j'}$}$ in Eq. (65) and $η_{i, i^{'}; j, j^{'}}^{h}$ $Mathematical equation: \hbox{$\etah_{i,i';j,j'}$}$ in Eq. (66), we use the model for the three- and four-point halo correlation functions described in Sect. 2.1.2. Then, as shown in Appendix F, the contribution associated with the product $ξ_{i; j^{'}}^{h} ξ_{i^{'}; j}^{h}$ $Mathematical equation: \hbox{$\xih_{i;j'} \xih_{i';j}$}$ in Eq. (66) is given by $C_{i,j}^{LS (ξξ)} = \frac{2}{Q_{i} Q_{j}} \int d χ 𝒟^{5} b^{4} n^{4} 𝒱_{i} 𝒱_{j} ξ_{i; j^{'}}^{(r)} ξ_{i^{'}; j}^{(r)},$ $Mathematical equation: \begin{equation} C_{i,j}^{\rm LS (\xi\xi)} = \frac{2}{\QQ_i\QQ_j} \int \dd\chi \, \cD^5 \, \bb^4 \, \nb^4 \, \cV_i \cV_j \, \overline{\xir_{i;j'} \xir_{i';j}} , \label{CLS-xixi-1} \end{equation}$ (75)the term $ζ_{i, i^{'}, j^{'}}^{h}$ $Mathematical equation: \hbox{$\zetah_{i,i',j'}$}$ of Eq. (65) yields $\begin{matrix} C_{i,j}^{LS (ζ)} & = & \frac{4}{(ΔΩ) Q_{i} Q_{j}} \int d χ 𝒟^{2} b^{3} n^{3} 𝒱_{i} 𝒱_{j} \frac{S_{3}}{3} \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{\rm LS (\zeta)} & = & \frac{4}{(\Delta\Omega)\QQ_i\QQ_j} \int \dd\chi \, \cD^2 \, \bb^3 \, \nb^3 \, \cV_i \cV_j \, \frac{S_3}{3} \nonumber \\ && \times \, \left[ \overline{\xir_{i'}} \times \overline{\xir_{j'}} + \overline{\xir_{i',i} \xir_{i',j'}} +\overline{\xir_{j',i} \xir_{j',i'}} \right] , \label{CLS-zeta-1} \end{eqnarray}$ (76)and the term $η_{i, i^{'}; j, j^{'}}^{h}$ $Mathematical equation: \hbox{$\etah_{i,i';j,j'}$}$ of Eq. (66) gives $\begin{matrix} C_{i,j}^{LS (η)} & = & \frac{2}{Q_{i} Q_{j}} \int d χ 𝒟^{5} b^{4} n^{4} 𝒱_{i} 𝒱_{j} \frac{S_{4}}{16} [ξ_{i^{'}}^{(r)} \times ξ_{i; j} ξ_{i; j^{'}}^{(r)} \\ + ξ_{j^{'}}^{(r)} \times ξ_{i; j} ξ_{j; i^{'}}^{(r)} + 2 ξ_{i^{'}}^{(r)} \times ξ_{j^{'}}^{(r)} \times ξ_{cyl} \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{\rm LS (\eta)} & = & \frac{2}{\QQ_i\QQ_j} \int \dd\chi \, \cD^5 \, \bb^4 \, \nb^4 \, \cV_i \cV_j \, \frac{S_4}{16} \left[ \overline{\xir_{i'}} \times \overline{\xi_{i;j} \xir_{i;j'}} \right . \nonumber \\ && + \overline{\xir_{j'}} \times \overline{\xi_{i;j} \xir_{j;i'}} + 2 \, \overline{\xir_{i'}} \times \overline{\xir_{j'}} \times \xicyl \nonumber \\ && \left. + 2 \, \overline{\xir_{j';i}\xi_{i;j}\xir_{j;i'}} + \overline{\xir_{j';i}\xir_{i,i'}\xir_{i';j}} + \overline{\xir_{i';j}\xir_{j,j'}\xir_{j';i}} \right] \label{CLS-eta-1} \end{eqnarray}$ (77)where the various factors are given in Appendix F, and we used for Eqs. (76)–(77) the “hierarchical clustering ansatz”, described in Figs. 1 and 2 and given by Eqs. (4) and (6).

The terms $C_{i,j}^{LS (ξξ)}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS (\xi\xi)}$}$ and $C_{i,j}^{LS (η)}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS (\eta)}$}$ are “pure sample-variance” contributions. Thus, there is no prefactor 1/(ΔΩ) and they involve large-scale correlations among four halos, i,i′,j,j′. The term $C_{i,j}^{LS (ζ)}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS (\zeta)}$}$ is a coupled shot-noise and sample-variance contribution, as shown by the prefactor 1/(ΔΩ) and the fact that it involves large-scale correlations among three halos, i,i′,j′. (The discreteness of the halo distribution has led to the identification i = j, i.e. a shot-noise effect, which leaves three distinct halos.)

Fig. 15

The low- and high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ along its diagonal. We again consider halos in the redshift range 0 < z < 0.8, with an angular window of 50 deg², above two mass thresholds.

Fig. 16

The low- and high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as in Fig. 15, but along one row. This corresponds to the fixed bin i = 4, associated with the distance bin 12.3 < r < 16.6 h^-1 Mpc, as a function of j.

We compare in Figs. 15 and 16 the low-order contributions (69) with these high-order contributions (75)–(77). We can see that the latter can be non-negligible on these scales, 5 < r < 100 h^-1 Mpc. Along the diagonal, i = j, shown in Fig. 15, they are always significantly smaller than the low-order contribution (which includes both sample-variance and shot-noise effects) for massive halos, M > 10¹⁴ h^-1 M_⊙, but are close to it or larger for M > 2 × 10¹³ h^-1 M_⊙. On large scales the main high-order contribution is the term (75), associated with a product ξξ, while the terms (76) and (77), associated with the three- and four-point correlation functions, dominate on small scales. Indeed, the former does not increase much on small scales, whereas the latter are very sensitive to the smoothing scales R_i and R_j and show a steep growth on small scales, even though formally ζ is also of order ξξ within the model (4). This is because the term (75) involves the product of two correlations between two distinct lines of sight, as seen in Eq. (66), so that each ξ is averaged along the radial direction, while the term (76), which arises from one shot-noise contraction that has removed one line-of-sight integration, involves the product of two correlations between a central point and two points at distances R_i and R_j, as seen in Eq. (65).

As seen in Fig. 16, at fixed i the relative importance of these high-order contributions to $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ increases as the bin j shifts to smaller scales. Again, we can see that among these contributions the “ξξ” term (75) dominates on large scales and saturates on small scales, while the “ζ” and “η” terms (76) and (77) dominate on small scales and strongly depend on the smoothing scales.

That high-order contributions can become dominant as one of the bins i and j shifts to small scales agrees with expectations, as one probes deeper into the nonlinear regime where three- and four-point correlation functions become important, and with some previous studies (Meiksin & White 1999; Scoccimarro et al. 1999). This implies that the covariance matrix is less diagonal once we take these contributions into account, and it decreases the number of effectively independent modes. This can be checked in Fig. 19, where we show the correlation matrices $ℛ_{i,j}^{LS}$ $Mathematical equation: \hbox{$\cR_{i,j}^{\rm LS}$}$ without (middle panels) and with (right panels) these high-order contributions, for the mass thresholds M > 2 × 10¹⁴ h^-1 M_⊙ and M > 10¹⁴ h^-1 M_⊙. Therefore, for survey characteristics such as those of Figs. 15, 16, it is necessary to include high-order contributions to the covariance matrix of two-point estimators for moderate-mass halos that are not dominated by shot-noise effects.

4.2.3. Comparison with numerical simulations

Fig. 17

The covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ of the Landy & Szalay estimator, along the diagonal i = j. We show our analytical results including all contributions (solid lines) or only low-order terms (dotted lines), and results from numerical simulations (dashed lines).

Fig. 18

The covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as in Fig. 17, but along one row. This corresponds to the fixed bin i = 4, associated with the distance bin 12.3 < r < 16.6 h^-1 Mpc, as a function of j.

We display in Fig. 17 the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ of the Landy & Szalay estimator, along its diagonal. We show our results obtained when we include the high-order contributions of Sect. 4.2.2, see Eqs. (75)–(77), and when we only take the low-order terms of Eq. (69) into account. We obtain a good match to the numerical simulations, especially on the largest scales, which are also more reliable. In particular, we recover the strong dependence on radius and halo mass. We can see that, for moderate-mass halos, M > 2 × 10¹³ h^-1 M_⊙, the high-order contributions are not negligible (because the low-order shot-noise contribution is relatively smaller).

We show the same covariance matrix along its fourth row in Fig. 18. The results from the numerical simulations are somewhat noisy, especially for the rare massive halos at low radii. However, where they are reliable they show reasonably good agreement with our analytical results. In agreement with Sect. 4.2.2, it is clear that, even more than along the diagonal, the high-order contributions of Eqs. (75)–(77) cannot be neglected in order to obtain a good estimate of the off-diagonal terms of the covariance matrix (see also Fig. 19).

4.2.4. Correlation matrices

Fig. 19

Contour plots for the correlation matrix ℛ_i,j, defined as in Eq. (46) but for the full covariance matrix C_ij of the halo correlation. There are ten distance bins, over 5 < r < 100 h^-1 Mpc, equally spaced in log (r), as in previous figures. We consider halos in the redshift range 0 < z < 0.8, within an angular window of 50 deg², above the mass thresholds M > 2 × 10¹³ h^-1 M_⊙ in the upper row, and M > 10¹⁴ h^-1 M_⊙ in the lower row. Left panels: low-order contributions (67) for the Peebles & Hauser estimator. Middle panels: low-order contributions (69) for the Landy & Szalay estimator. Right panels: full correlation matrix, including the high-order contributions of Eqs. (75)–(77), for the Landy & Szalay estimator.

We show in Fig. 19 the correlation matrices, defined as in Eq. (46) but for the full covariance matrix C_ij of the halo correlation. (Although ℛ_i,j is a discrete 10 × 10 matrix, it is still possible to draw a contour plot by interpolation. This gives clear figures that are easier to read than a density plot where each cell is colored with a level of gray that depends on the entry ℛ_i,j.)

The left and middle panels of Fig. 19 clearly show the strong improvement associated with the use of the Landy & Szalay estimator in place of the Peebles & Hauser estimator. In agreement with Fig. 14 and the discussion in Sect. 4.2.1, the correlation matrix is more diagonal for massive halos, where the diagonal shot-noise contribution C⁽²⁾ of Eq. (60) is more important. Indeed, shot-noise effects become dominant for rare objects. For the same reason, high-order contributions to the covariance matrix, which are due to sample-variance effects, are more important for low-mass halos, as shown by the comparison between the middle and right panels. The slope of the contour lines in the right panels, especially in the low-mass case, shows that high-order terms are more important on small scales and also increase the correlation between small and large scales while making the matrix less diagonal.

Thus, for low-mass halos there are rather strong correlations between all scales in the range 5 < r < 100 h^-1 Mpc, and to obtain accurate estimates of error bars on cosmological parameters it is necessary to take off-diagonal entries and high-order contributions to the covariance matrix into account.

5. Angular correlation function

In the previous section we considered the real-space 3D correlation function, which requires knowledge of the radial position of the halos (or more generally of the objects of interest). If this information is not available (e.g., redshift estimates are too noisy or distance measures are highly contaminated by redshift-space distortions), it is still possible to derive some constraints on cosmology from the angular distribution of the objects on the sky (Peebles 1980; Eisenstein & Zaldarriaga 2001; Maller et al. 2005). Therefore, in this section we apply the formalism developed in Sect. 4 to the angular two-point correlation function w(θ).

5.1. Mean correlation

5.1.1. Peebles & Hauser estimator

As in Sect. 3.2.2, we write the observed number density of objects on the sky as $Mathematical equation: \hbox{$\hN(\vOm)$}$ , but we omit the index i of Eq. (40) since we consider a single redshift bin. As in Sect. 4, the width Δz is not necessarily small and may cover the whole redshift range of the survey. Then, using notations that are similar to Eq. (48), we can write the Peebles & Hauser estimator ŵ_i as $1 + ŵ i = \frac{1}{𝒬_{i}} \int \frac{d Ω}{(ΔΩ)} N̂ (Ω) \int_{θ_{i, -}}^{θ_{i, +}} d θ^{'} N̂ (Ω^{'}),$ $Mathematical equation: \begin{equation} 1+\hw_i = \frac{1}{\Qw_i} \int \frac{\dd\vOm}{(\Delta\Omega)} \, \hN(\vOm) \int_{\thetaim}^{\thetaip} \dd\vtheta' \hN(\vOm') , \label{wi-1} \end{equation}$ (78)with $𝒬_{i} = \int \frac{d Ω}{(ΔΩ)} N (Ω) \int_{θ_{i, -}}^{θ_{i, +}} d θ^{'} N (Ω^{'}),$ $Mathematical equation: \begin{equation} \Qw_i = \int \frac{\dd\vOm}{(\Delta\Omega)} \, \Nb(\vOm) \int_{\thetaim}^{\thetaip} \dd\vtheta' \, \Nb(\vOm') , \label{Qwi-def} \end{equation}$ (79)where $N (Ω)$ $Mathematical equation: \hbox{$\Nb(\vOm)$}$ is the mean angular number density on the direction Ω, given by $N = \int d χ 𝒟^{2} n (z) = \int d χ 𝒟^{2} \int \frac{d M}{M} \frac{d n}{d \ln M} (M,z) .$ $Mathematical equation: \begin{equation} \Nb = \int\! \dd\chi \, \cD^2 \, \nb(z) = \int\! \dd\chi \, \cD^2 \! \int\! \frac{\dd M}{M} \, \frac{\dd n}{\dd\!\ln M}(M,z) . \label{Nb-Om} \end{equation}$ (80)Here we used Eq. (18), and we assumed that the sky coverage is the same over the survey window (ΔΩ), so that $N (Ω)$ $Mathematical equation: \hbox{$\Nb(\vOm)$}$ is actually a constant that does not depend on Ω (but the formalism is readily extended to the more general case where we add a filter that depends on Ω).

Fig. 20

The mean angular correlation, $⟨ ŵ \begin{matrix} LS \\ i \end{matrix} ⟩$ $Mathematical equation: \hbox{$\lag\hwLS_i\rag$}$ , over eight angular bins within 1.25 < θ < 50 arcmin, equally spaced in log (θ). We compare our analytical results (solid lines) with numerical simulations (dashed lines).

Here and in the following, the index i of Eqs. (78), (79) refers to the angular bin [θ_i, −,θ_i, +] , over which we estimate the angular correlation w(θ). Again, we denote with unprimed letters the quantities associated with the first object, such as its position Ω on the sky, and with primed letters the quantities associated with the neighbor at distance θ′, such as its position Ω′. We use the flat-sky and Limber’s approximations, which are typically valid for angular radii below 10 deg, as seen in Fig. 10.

The quantity $Mathematical equation: \hbox{$\Qw_i$}$ introduced in Eq. (79) can be written as $𝒬_{i} = 𝒜_{i} N^{2},$ $Mathematical equation: \begin{equation} \Qw_i = \cA_i \, \Nb^2 , \label{Qw-1} \end{equation}$ (81)using that $N$ $Mathematical equation: \hbox{$\Nb$}$ defined in Eq. (80) does not depend on Ω in our case, and $Mathematical equation: \hbox{$\cA_i$}$ is the area of the i-ring, $𝒜_{i} = π (θ_{i, +}^{2} - θ_{i, -}^{2}) .$ $Mathematical equation: \begin{equation} \cA_i = \pi ( \thetaip^2 - \thetaim^2) . \label{Ai-def} \end{equation}$ (82)Then, we proceed as in Sect. 4. Substituting the observed 3D number density $Mathematical equation: \hbox{$\dd \hn/\dd\!\ln M$}$ as in Eq. (17), introducing the halo two-point correlation ξ^h when we take the average as in Eq. (52), and using the factorization (1), we obtain $⟨ ŵ i ⟩ = \frac{1}{N^{2}} \int d χ 𝒟^{5} b^{2} n^{2} ξ_{i^{'}}^{(θ)} (z),$ $Mathematical equation: \begin{equation} \lag\hw_i\rag = \frac{1}{\Nb^2} \int\dd\chi \, \cD^5 \, \bb^2 \, \nb^2 \, \overline{\xith_{i'}}(z) , \label{wi-2} \end{equation}$ (83)with $ξ_{i^{'}}^{(θ)} (z) = \int_{i} \frac{d θ^{'}}{𝒜_{i}} \int \frac{d χ^{'}}{𝒟} ξ (r^{'}; z) .$ $Mathematical equation: \begin{equation} \overline{\xith_{i'}}(z) = \int_i \, \frac{\dd\vtheta'}{\cA_i} \int \frac{\dd\chi'}{\cD} \, \xi(r';z) . \label{w-i-i-def} \end{equation}$ (84)The superscript “(θ)” recalls that Eq. (84) is an average over the angular ring $Mathematical equation: \hbox{$\cA_i$}$ , instead of the 3D spherical shell $Mathematical equation: \hbox{$\cV_i$}$ of Eq. (54). The prime in the subscript “i′” also recalls that we integrate over a neighboring point θ′, with respect to a given point $Mathematical equation: \hbox{$(\chi,\cD\vOm)$}$ of the observational cone. However, because the two points are only close in the 2D angular space (i.e., in the i-ring), we also integrate over the longitudinal coordinate χ′ along the full line of sight in Eq. (84).

Explicit expressions for $ξ_{i^{'}}^{(θ)} (z)$ $Mathematical equation: \hbox{$\overline{\xith_{i'}}(z)$}$ are given in Appendix G. In contrast to the number counts studied in Sect. 3, where, for large angles above a few degrees, it is necessary to go beyond Limber’s approximation, as found in Figs. 9 and 10, for our study of the angular correlation function Limber’s approximation is sufficient because we consider much smaller angular scales of a few arcmin.

5.1.2. Landy & Szalay estimator

As in Sect. 4.1.2, the measure of the angular correlation can be made more accurate by using the Landy & Szalay estimator instead of the Peebles & Hauser estimator (78) (Landy & Szalay 1993; Szapudi & Szalay 1998). As in Eq. (56), this reads as $\begin{matrix} ŵ \begin{matrix} LS \\ i \end{matrix} & = & \frac{1}{𝒬_{i}} \int \frac{d Ω}{(ΔΩ)} N̂ (Ω) \int d θ^{'} N̂ (Ω^{'}) \\ - \frac{2}{𝒬_{i}} \int \frac{d Ω}{(ΔΩ)} N̂ (Ω) \int d θ^{'} N (Ω^{'}) + 1, \end{matrix}$ $Mathematical equation: \begin{eqnarray} \hwLS_i & = & \frac{1}{\Qw_i} \int \frac{\dd\vOm}{(\Delta\Omega)} \, \hN(\vOm) \int \dd\vtheta' \, \hN(\vOm') \nonumber \\ \label{wi-LS-1} && - \frac{2}{\Qw_i} \int \frac{\dd\vOm}{(\Delta\Omega)} \, \hN(\vOm) \int \dd\vtheta' \, \Nb(\vOm') + 1 , \end{eqnarray}$ (85)and we can check that its mean is again equal to the average (83).

5.1.3. Comparison with simulations

We compare in Fig. 20 the mean correlation (83) with results from numerical simulations. The error bars are the 3 − σ statistical errors obtained from the covariance matrices derived in Sect. 5.2.2 for 41 fields of 400 deg² as used in the simulations. Above 5 arcmin we obtain a good match between our results and the numerical simulations. This could be expected from Sect. 4 since the angular correlation is a projection of the 3D correlation. On lower angular scales the discrepancy may be due to the finite size of the clusters. This implies that ξ^h = −1 at distances below the sum of the two cluster radii (exclusion effect), but we have not included this effect in our bias model (1). Since a typical cluster at z = 0.5 (with a size of 1 h^-1 Mpc) corresponds to an angle of ~ 2.5 arcmin and projection effects are rare (since clusters are rare objects with a surface density ~ 10 deg^-2), this exclusion effect indeed occurs at θ ≲ 5 arcmin and appears at slightly larger angles for more massive halos. This explains the behavior found on these scales in Fig. 20.

5.2. Covariance matrices for the halo angular correlation

The covariance matrices of the estimators ŵ_i and $ŵ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hwLS_i$}$ can be computed following the procedure used in Sect. 4 for the 3D correlation. Denoting again the covariance matrices as C_i,j and $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C^{\rm LS}_{i,j}$}$ , they decompose as in Eq. (59), $C_{i,j} = C_{i,j}^{(2)} + C_{i,j}^{(3)} + C_{i,j}^{(4)},$ $Mathematical equation: \begin{equation} C_{i,j} = C_{i,j}^{(2)} + C_{i,j}^{(3)} + C_{i,j}^{(4)} , \label{Cij-w-1} \end{equation}$ (86)where $C_{i,j}^{(4)}$ $Mathematical equation: \hbox{$C_{i,j}^{(4)}$}$ is a pure sample-variance contribution, whereas $C_{i,j}^{(2)}$ $Mathematical equation: \hbox{$C_{i,j}^{(2)}$}$ and $C_{i,j}^{(3)}$ $Mathematical equation: \hbox{$C_{i,j}^{(3)}$}$ are shot-noise contributions that arise when either one pair or two pairs of objects are identified. Again, the contributions $C_{i,j}^{(2)}$ $Mathematical equation: \hbox{$C_{i,j}^{(2)}$}$ and $C_{i,j}^{(3)}$ $Mathematical equation: \hbox{$C_{i,j}^{(3)}$}$ also involve the two-point and three-point correlations; i.e., they contain terms that couple discreteness effects with large-scale density correlations.

For the Peebles & Hauser estimator (78) we obtain, as in Eqs. (60)–(62), $\begin{matrix} C_{i,j}^{(2)} & = & δ_{i,j} \frac{2}{(ΔΩ) 𝒬_{i}^{2}} \int \frac{d Ω_{i}}{(ΔΩ)} d χ_{i} 𝒟_{i}^{2} \frac{d M_{i}}{M_{i}} \int d θ_{i^{'}} d χ_{i^{'}} 𝒟_{i^{'}}^{2} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i}^{'}} [1 + {ξ_{i, i^{'}}^{h}}^{]}, \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{(2)} & = & \delta_{i,j} \, \frac{2}{(\Delta\Omega)\Qw_i^2} \int \!\! \frac{\dd\vOm_i}{(\Delta\Omega)} \dd\chi_i \, \cD_i^2 \frac{\dd M_i}{M_i} \int \!\! \dd\vtheta_{i'}\dd\chi_{i'}\cD_{i'}^2 \frac{\dd M_{i'}}{M_{i'}} \nonumber \\\label{C2-w-def} && \times \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_i'} \left[ 1+\xih_{i,i'} \right] , \end{eqnarray}$ (87) $\begin{matrix} C_{i,j}^{(3)} & = & \frac{4}{(ΔΩ) 𝒬_{i} 𝒬_{j}} \int \frac{d Ω_{i}}{(ΔΩ)} d χ_{i} 𝒟_{i}^{2} \frac{d M_{i}}{M_{i}} \int d θ_{i^{'}} d χ_{i^{'}} 𝒟_{i^{'}}^{2} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \int d θ_{j^{'}} d χ_{j^{'}} 𝒟_{j^{'}}^{2} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} \frac{d n}{d \ln M_{j^{'}}} \\ \times [1 + ξ_{i, i^{'}}^{h} + ξ_{i, j^{'}}^{h} + ξ_{i^{'}, j^{'}}^{h} + {ζ_{i, i^{'}, j^{'}}^{h}}^{]}, \\ C_{i,j}^{(4)} & = & \frac{1}{𝒬_{i} 𝒬_{j}} \int \frac{d Ω_{i}}{(ΔΩ)} d χ_{i} 𝒟_{i}^{2} \frac{d M_{i}}{M_{i}} d θ_{i^{'}} d χ_{i^{'}} 𝒟_{i^{'}}^{2} \frac{d M_{i^{'}}}{M_{i^{'}}} \frac{d n}{d \ln M_{i}} \\ \times \frac{d n}{d \ln M_{i^{'}}} \int \frac{d Ω_{j}}{(ΔΩ)} d χ_{j} 𝒟_{j}^{2} \frac{d M_{j}}{M_{j}} d θ_{j^{'}} d χ_{j^{'}} 𝒟_{j^{'}}^{2} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{j}} \\ \times \frac{d n}{d \ln M_{j}^{'}} [4 ξ_{i; j}^{h} + 2 ζ_{i; j, j^{'}}^{h} + 2 ζ_{i, i^{'}; j}^{h} + 2 ξ_{i; j^{'}}^{h} ξ_{i^{'}; j}^{h} + {η_{i, i^{'}; j, j^{'}}^{h}}^{]} . \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{(3)} & = & \frac{4}{(\Delta\Omega)\Qw_i\Qw_j} \int \!\! \frac{\dd\vOm_i}{(\Delta\Omega)} \dd\chi_i \, \cD_i^2 \frac{\dd M_i}{M_i} \int \!\! \dd\vtheta_{i'}\dd\chi_{i'}\cD_{i'}^2 \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ && \times \int \dd\vtheta_{j'}\dd\chi_{j'}\cD_{j'}^2 \frac{\dd M_{j'}}{M_{j'}} \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \frac{\dd n}{\dd\!\ln M_{j'}} \nonumber \\ \label{C3-w-def}&& \times \left[ 1+ \xih_{i,i'} + \xih_{i,j'} + \xih_{i',j'} + \zetah_{i,i',j'} \right] , \\ C_{i,j}^{(4)} & = & \frac{1}{\Qw_i\Qw_j} \int \!\! \frac{\dd\vOm_i}{(\Delta\Omega)} \dd\chi_i \, \cD_i^2 \frac{\dd M_i}{M_i} \dd\vtheta_{i'}\dd\chi_{i'}\cD_{i'}^2 \frac{\dd M_{i'}}{M_{i'}} \frac{\dd n}{\dd\!\ln M_i} \nonumber \\ && \times \frac{\dd n}{\dd\!\ln M_{i'}} \int \!\! \frac{\dd\vOm_j}{(\Delta\Omega)} \dd\chi_j \, \cD_j^2 \frac{\dd M_j}{M_j} \dd\vtheta_{j'}\dd\chi_{j'}\cD_{j'}^2 \frac{\dd M_{j'}}{M_{j'}} \frac{\dd n}{\dd\!\ln M_j} \nonumber \\ && \times \frac{\dd n}{\dd\!\ln M_j'} \left[ 4 \xih_{i;j} \!+\! 2 \zetah_{i;j,j'} \!+\! 2 \zetah_{i,i';j} \!+\! 2 \xih_{i;j'} \xih_{i';j} \!+\! \etah_{i,i';j,j'} \right] . \nonumber \\ && \label{C4-w-def} \end{eqnarray}$ For the Landy & Szalay estimator (85) only a few of these terms remain, as in Eqs. (64)–(66), and we obtain $\begin{matrix} C_{i,j}^{LS (2)} & = & C_{i,j}^{(2)}, \\ C_{i,j}^{LS (3)} & = & \frac{4}{(ΔΩ) 𝒬_{i} 𝒬_{j}} \int \frac{d Ω_{i}}{(ΔΩ)} d χ_{i} 𝒟_{i}^{2} \frac{d M_{i}}{M_{i}} \int d θ_{i^{'}} d χ_{i^{'}} 𝒟_{i^{'}}^{2} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \int d θ_{j^{'}} d χ_{j^{'}} 𝒟_{j^{'}}^{2} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} \frac{d n}{d \ln M_{j^{'}}} \\ \times [ξ_{i^{'}, j^{'}}^{h} + {ζ_{i, i^{'}, j^{'}}^{h}}^{]}, \\ C_{i,j}^{LS (4)} & = & \frac{1}{𝒬_{i} 𝒬_{j}} \int \frac{d Ω_{i}}{(ΔΩ)} d χ_{i} 𝒟_{i}^{2} \frac{d M_{i}}{M_{i}} d θ_{i}^{'} d χ_{i}^{'} 𝒟_{i}^{' 2} \frac{d M_{i}^{'}}{M_{i}^{'}} \frac{d n}{d \ln M_{i}} \\ \times \frac{d n}{d \ln M_{i}^{'}} \int \frac{d Ω_{j}}{(ΔΩ)} d χ_{j} 𝒟_{j}^{2} \frac{d M_{j}}{M_{j}} d θ_{j}^{'} d χ_{j}^{'} 𝒟_{j}^{' 2} \frac{d M_{j}^{'}}{M_{j}^{'}} \frac{d n}{d \ln M_{j}} \\ \times \frac{d n}{d \ln M_{j}^{'}} [2 ξ_{i; j^{'}}^{h} ξ_{i^{'}; j}^{h} + {η_{i, i^{'}; j, j^{'}}^{h}}^{]}, \end{matrix}$ $Mathematical equation: \begin{eqnarray} \label{C2-LS-w-def}C_{i,j}^{\rm LS (2)} &= &C_{i,j}^{(2)} , \\ C_{i,j}^{\rm LS (3)} & = & \frac{4}{(\Delta\Omega)\Qw_i\Qw_j} \int\!\! \frac{\dd\vOm_i}{(\Delta\Omega)} \dd\chi_i \, \cD_i^2 \frac{\dd M_i}{M_i} \int \!\! \dd\vtheta_{i'}\dd\chi_{i'}\cD_{i'}^2 \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ && \times \int \dd\vtheta_{j'}\dd\chi_{j'}\cD_{j'}^2 \frac{\dd M_{j'}}{M_{j'}} \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \frac{\dd n}{\dd\!\ln M_{j'}} \nonumber \\ \label{C3-LS-w-def}&& \times \left[ \xih_{i',j'} + \zetah_{i,i',j'} \right] , \\ C_{i,j}^{\rm LS (4)} & = & \frac{1}{\Qw_i\Qw_j} \int \!\! \frac{\dd\vOm_i}{(\Delta\Omega)} \dd\chi_i \, \cD_i^2 \frac{\dd M_i}{M_i} \dd\vtheta_i'\dd\chi_i'\cD_i'^2 \frac{\dd M_i'}{M_i'} \frac{\dd n}{\dd\!\ln M_i} \nonumber \\ && \times \frac{\dd n}{\dd\!\ln M_i'} \int \!\! \frac{\dd\vOm_j}{(\Delta\Omega)} \dd\chi_j \, \cD_j^2 \frac{\dd M_j}{M_j} \dd\vtheta_j'\dd\chi_j'\cD_j'^2 \frac{\dd M_j'}{M_j'} \frac{\dd n}{\dd\!\ln M_j} \nonumber \\ \label{C4-LS-w-def} && \times \frac{\dd n}{\dd\!\ln M_j'} \left[ 2 \xih_{i;j'} \xih_{i';j} + \etah_{i,i';j,j'} \right] , \end{eqnarray}$ see also Szapudi (2001), and Bernstein (1994) who considers (up to order $n^{-2}$ $Mathematical equation: \hbox{$\nb^{-2}$}$ over the inverse of the mean density) the additional terms associated with fluctuations of the denominator in the estimator (55), when the latter is normalized to the number counts in the same field.

5.2.1. Low-order terms

Keeping only the contributions that are constant or linear over the two-point halo correlation ξ^h, as in Eqs. (67) and (69), we obtain $\begin{matrix} C_{i,j} & = & δ_{i,j} \frac{2}{(ΔΩ) 𝒬_{i}} [1 + ⟨ ŵ i ⟩] + \frac{4}{(ΔΩ) N} + \frac{4}{(ΔΩ) N^{3}} \\ \times \int d χ 𝒟^{5} b^{2} n^{2} [ξ_{i^{'}}^{(θ)} + ξ_{j^{'}}^{(θ)} + ξ_{i^{'}, j^{'}}^{(θ)}] \\ + \frac{4}{N^{2}} \int d χ 𝒟^{5} b^{2} n^{2} ξ_{cyl}, \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j} & = & \delta_{i,j} \, \frac{2}{(\Delta\Omega)\Qw_i} \left[ 1+\lag\hw_i\rag\right] + \frac{4}{(\Delta\Omega)\Nb} + \frac{4}{(\Delta\Omega)\Nb^3} \nonumber \\ && \times \int\dd\chi \, \cD^5 \, \bb^2 \, \nb^2 \, \left[ \overline{\xith_{i'}} + \overline{\xith_{j'}} + \overline{\xith_{i',j'}} \right] \nonumber \\ \label{Cij-w-tot} && + \frac{4}{\Nb^2} \int\dd\chi \, \cD^5 \, \bb^2 \, \nb^2 \, \xicyl , \end{eqnarray}$ (93)and $C_{i,j}^{LS} = δ_{i,j} \frac{2 [1 + ⟨ ŵ i ⟩]}{(ΔΩ) 𝒬_{i}} + \frac{4}{(ΔΩ) N^{3}} \int d χ 𝒟^{5} b^{2} n^{2} ξ_{i^{'}, j^{'}}^{(θ)}$ $Mathematical equation: \begin{equation} C_{i,j} ^{\rm LS} = \delta_{i,j} \, \frac{2 [ 1+\lag\hw_i\rag]}{(\Delta\Omega)\Qw_i} + \frac{4}{(\Delta\Omega)\Nb^3} \int\dd\chi \, \cD^5 \, \bb^2 \, \nb^2 \, \overline{\xith_{i',j'}} \label{Cij-LS-w-tot} \end{equation}$ (94)where, in a fashion similar to Eq. (68), we introduced the average $ξ_{i^{'}, j^{'}}^{(θ)} (z) = \int_{i} \frac{d θ_{i^{'}}}{𝒜_{i}} \int_{j} \frac{d θ_{j^{'}}}{𝒜_{j}} \int \frac{d χ_{j^{'}}}{𝒟} ξ (| x i^{'} - x j^{'} |; z) .$ $Mathematical equation: \begin{equation} \overline{\xith_{i',j'}}(z) = \int_i \frac{\dd\vtheta_{i'}}{\cA_i} \int_j \frac{\dd\vtheta_{j'}}{\cA_j} \int \frac{\dd\chi_{j'}}{\cD} \; \xi(|\vx_{i'}-\vx_{j'}|;z) . \label{I2ij-def-xi} \end{equation}$ (95)The Fourier-space expression of Eq. (95) is given in Eq. (H.1).

Comparison of Peebles & Hauser and Landy & Szalay covariance matrices

Fig. 21

The covariance matrices $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ (solid line) and C_i,j (dashed line) of the estimators $ŵ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hwLS_i$}$ and ŵ_i, for i = 2 associated with the angular bin 2 < θ < 3.2 arcmin, as a function of j. We show the results obtained for halos in the redshift range 0 < z < 0.8, with an angular window of 400 deg², above the mass thresholds M_∗ = 2 × 10¹³ and 10¹⁴ h^-1 M_⊙, from bottom to top. Here we only consider the low-order contributions, given by Eqs. (93) and (94).

We compare in Fig. 21 the covariances matrices (93) and (94) as a function of j at fixed i. As in Fig. 20, we consider halos in the redshift range 0 < z < 0.8 in a survey of area 400 deg². As was the case for the 3D real-space correlation ξ shown in Fig. 13, and in agreement with previous works (Kerscher et al. 2000), the covariance is much smaller and more diagonal for the Landy & Szalay estimator (85) than for the Peebles & Hauser estimator (78). This can also be clearly seen from the comparison of the left and middle panels of Fig. 27, where we show the correlation matrices associated with Eqs. (93) and (94).

Comparison of sample-variance and shot-noise effects

Fig. 22

The contributions C⁽²⁾ and C⁽³⁾ to the covariance of the Landy & Szalay estimator, along the diagonal i = j. As in Fig. 21, we only consider the low-order terms, given by Eq. (94).

Next, we compare in Fig. 22 the contributions C⁽²⁾ (first term in Eq. (94)) and C⁽³⁾ (second term in Eq. (94)), again keeping only these low-order terms of the covariance of the Landy & Szalay estimator. As compared with C⁽³⁾, C⁽²⁾ involves an extra degree of shot noise (one more pair identification). Taking only these low-order terms into account, the covariance is dominated by C⁽²⁾ (whence shot-noise effects are dominant) below 10 arcmin for halos above M_∗ = 2 × 10¹³ h^-1 M_⊙, and below 50 arcmin for halos above M_∗ = 10¹⁴ h^-1 M_⊙. As for the 3D correlation, shot-noise effects dominate up to larger scales for more massive and rare halos, and this also implies that their covariance matrix is more strongly diagonal.

5.2.2. High-order terms

Fig. 23

Fig. 24

The low- and high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as in Fig. 23, but along one row. This corresponds to the fixed bin i = 2, associated with the angular bin 2 < θ < 3.2 arcmin, as a function of j.

At small angular separations, the high-order terms in Eqs. (91) and (92), associated with the product ξξ and the three- and four-point correlations ζ and η, are not negligible. As in Sect. 4.2.2 and in Bernstein (1994), to estimate these high-order correlations we use the “hierarchical clustering ansatz” shown in Figs. 1 and 2 and given by Eqs. (4)–(7). As described in Appendix H, we follow the procedure that we have already used in Appendix F to compute the high-order terms associated with the 3D correlation ξ. Then, the contribution associated with the product $ξ_{i; j^{'}}^{h} ξ_{i^{'}; j}^{h}$ $Mathematical equation: \hbox{$\xih_{i;j'} \xih_{i';j}$}$ in Eq. (92) writes as $C_{i,j}^{LS (ξξ)} = \frac{2 π^{2}}{N^{4}} \int_{0}^{2} d y y A^{(2)} (y) B_{i}^{(2)} (y θ_{s}) B_{j}^{(2)} (y θ_{s}),$ $Mathematical equation: \begin{equation} C_{i,j}^{\rm LS (\xi\xi)} = \frac{2\pi^2}{\Nb^4} \int_0^2\dd y \, y \, A^{(2)}(y) B_i^{(2)}(y\theta_{\rm s}) B_j^{(2)}(y\theta_{\rm s}) , \label{CLS-w-xixi-1} \end{equation}$ (96)the term $ζ_{i, i^{'}, j^{'}}^{h}$ $Mathematical equation: \hbox{$\zetah_{i,i',j'}$}$ of Eq. (91) yields $\begin{matrix} C_{i,j}^{LS (ζ)} & = & \frac{4}{(ΔΩ) N^{4}} \int d χ 𝒟^{8} b^{3} n^{3} \frac{S_{3}}{3} [ξ_{i^{'}}^{(θ)} \times ξ_{j^{'}}^{(θ)} \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{\rm LS (\zeta)} & = & \frac{4}{(\Delta\Omega)\Nb^4} \int \dd\chi \, \cD^8 \, \bb^3 \, \nb^3 \, \frac{S_3}{3} \left[ \overline{\xith_{i'}} \times \overline{\xith_{j'}} \right. \nonumber \\[2mm] && \left. + \overline{\xith_{i',i}\xith_{i',j'}} + \overline{\xith_{j',j}\xith_{j',i'}} \right] , \label{CLS-w-zeta-1} \end{eqnarray}$ (97)and the term $η_{i, i^{'}; j, j^{'}}^{h}$ $Mathematical equation: \hbox{$\etah_{i,i';j,j'}$}$ of Eq. (92) gives $\begin{matrix} C_{i,j}^{LS (η)} & = & \frac{2}{N^{4}} \int d χ 𝒟^{11} b^{4} n^{4} \frac{S_{4}}{16} [ξ_{i^{'}}^{(θ)} \times ξ_{i; j} ξ_{i; j^{'}}^{(θ)} \\ + ξ_{j^{'}}^{(θ)} \times ξ_{i; j} ξ_{j; i^{'}}^{(θ)} + 2 ξ_{i^{'}}^{(θ)} \times ξ_{j^{'}}^{(θ)} \times ξ_{cyl} \\ + 2 ξ_{j^{'},i}^{(θ)} ξ_{i; j} ξ_{j; i^{'}}^{(θ)} + ξ_{j^{'}; i}^{(θ)} ξ_{i, i^{'}}^{(θ)} ξ_{i^{'}; j}^{(θ)} + ξ_{i^{'}; j}^{(θ)} ξ_{j, j^{'}}^{(θ)} ξ_{j^{'}; i}^{(θ)}] \end{matrix}$ $Mathematical equation: \begin{eqnarray} C_{i,j}^{\rm LS (\eta)} & = & \frac{2}{\Nb^4} \int \dd\chi \, \cD^{11} \, \bb^4 \, \nb^4 \, \frac{S_4}{16} \left[ \overline{\xith_{i'}} \times \overline{\xi_{i;j}\xith_{i;j'}} \right. \nonumber \\[2mm] && + \overline{\xith_{j'}} \times \overline{\xi_{i;j}\xith_{j;i'}} + 2 \, \overline{\xith_{i'}} \times \overline{\xith_{j'}} \times \xicyl \nonumber \\[2mm] \label{CLS-w-eta-1} && \left. + 2 \, \overline{\xith_{j',i}\xi_{i;j}\xith_{j;i'}} + \overline{\xith_{j';i}\xith_{i,i'}\xith_{i';j}} + \overline{\xith_{i';j}\xith_{j,j'}\xith_{j';i}} \right] \end{eqnarray}$ (98)where the various factors are given in Appendix H.

We compare in Figs. 23 and 24 these high-order contributions (96)–(98) with the low-order contribution (94), for the covariance matrix of the Landy & Szalay estimator. We recover the qualitative behavior encountered in Figs. 15 and 16 for the estimator of the 3D correlation function. The “ζ” and “η” terms (97) and (98) show a strong dependence on the smoothing scales, while the “ξξ” term (96) shows a very weak dependence. Again, this is because the contribution (96) involves the product of two correlations between two distinct lines of sight, so that each ξ is averaged over the angular window θ_s of the survey, as seen in Eq. (92), whereas the contribution (97) involves the product of two correlations between a central point and two points at angular distances θ_i and θ_j, as seen in Eq. (91).

For the case of massive halos, M > 10¹⁴ h^-1 M_⊙, the high-order terms are negligible along the diagonal, which is dominated by the shot-noise term, and only give a modest contribution to off-diagonal entries. Then, the covariance matrix remains strongly diagonal (for the angular bins studied here).

For the case of low-mass halos, M > 2 × 10¹³ h^-1 M_⊙, the high-order contribution (96) to the diagonal is no longer negligible for θ > 5 arcmin, while the two other contributions (97) and (98), which involve the three- and four-point correlation functions, are always subdominant on these scales. This is a convenient property since the modelization of high-order many-body correlations is increasingly difficult. However, this was not the case for the 3D correlation ξ, as seen in Figs. 15 and 16, except on large scales. For off-diagonal entries, the high-order contribution (96) can become dominant for widely separated angular scales, while on small scales, θ ~ 1 arcmin, all contributions are of the same order of magnitude.

5.2.3. Comparison with numerical simulations

Fig. 25

The covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ along its diagonal. We show our analytical results including all contributions (solid lines) or only low-order terms (dotted lines), and results from numerical simulations (dashed lines).

Fig. 26

The covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as in Fig. 25, but along one row. This corresponds to the fixed bin i = 4, associated with the angular bin 5 < θ < 8 arcmin, as a function of j.

As for the 3D correlation, we show the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , along its diagonal and along one row, in Figs. 25 and 26. Again we obtain a reasonable agreement with the numerical simulations. For moderate-mass halos, the high-order contributions are again necessary to obtain a good match on large scales for diagonal entries and on most scales for off-diagonal entries. The off-diagonal terms of the covariance matrix obtained from the numerical simulations are rather noisy, and our analytical results are competitive in obtaining reliable estimates.

5.2.4. Correlation matrices

Fig. 27

Contour plots for the correlation matrix ℛ_i,j, defined as in Eq. (46) but for the full covariance matrix C_ij of the halo angular correlation. There are eight angular bins, over 1.25 < r < 50 arcmin, equally spaced in log (θ), as in previous figures. We consider halos in the redshift range 0 < z < 0.8, with an angular window of 400 deg², above the mass thresholds M > 2 × 10¹³ h^-1 M_⊙ in the upper row, and M > 10¹⁴ h^-1 M_⊙ in the lower row. Left panels: low-order contributions (93) for the Peebles & Hauser estimator. Middle panels: low-order contributions (94) for the Landy & Szalay estimator. Right panels: full correlation matrix, including the high-order contributions of Eqs. (96)–(98), for the Landy & Szalay estimator.

We show in Fig. 27 the correlation matrices ℛ_i,j, defined as in Eq. (46), but for the full covariance matrices C_i,j of the estimators of the halo angular correlation. As for the 3D correlation, we can check that, keeping only low-order terms, the correlation matrix of the Landy & Szalay estimator (85) is much more diagonal than for the Peebles & Hauser estimator (78). Taking high-order contributions into account makes the matrix slightly less diagonal, but it still remains significantly diagonal, in agreement with Fig. 24. As in the 3D case, the correlation matrix is much more diagonal for massive halos, where shot-noise effects are more important. The full angular correlation matrix is more diagonal than its 3D counterpart shown in the right hand panels of Fig. 19, and the correlations between small and large angular scales are not as strong as the correlations between the small and large radii found in Fig. 19. In particular, off-diagonal entries and high-order contributions play a less important role (although for low-mass halos it is still useful to take them into account).

6. Applications to real survey cases

In this section, we compare the statistical significance of the number counts and of the 3D correlation function for future large cosmological cluster surveys (we give in Appendix J the selection functions that we use for some of these surveys). Here we must note that, while redshift-space distortions only have a low impact on angular number densities (number counts and angular correlations) for wide redshift bins, they can more strongly affect 3D clustering. In principle, redshift distortions could be corrected to recover a real-space map if the velocity field is known, and when also applying a finger-of-god compression, but this would require a rather complete spectroscopic follow up, so it is not very practical. Therefore, observations instead provide redshift-space 3D correlations. Then, the results discussed in this section for 3D correlations should be seen as a first step toward more accurate computations.

Nevertheless, a simple estimate shows that these redshift-space distortions should not strongly affect our results. Indeed, we find in the numerical simulations that at z = 0.5 for instance clusters have peculiar velocities v on the order of 300 km s^-1 along each axis. The redshift-space coordinate s_∥ along the line of sight is given by s_∥ = x_∥ + v_∥/(aH). This yields a typical error Δx_∥ for the cluster comoving coordinate on the order of 3.6 h^-1 Mpc. This is not much larger than the typical size of the clusters, which ranges from 1 to 2 h^-1 Mpc. Then, for distance bins that are larger than 20 h^-1 Mpc we can expect redshift distortions to affect our results on the covariance matrices by about 20%. The net effect should actually be smaller because the 3D estimators also include information on clustering along the transverse directions, which are not contaminated by the cluster peculiar velocities. We leave an explicit computation of these redshift-space distortions to future works.

6.1. Surveys of limited areas

Fig. 28

The mean angular number densities of X-ray clusters per square degree, within redshift bins of width Δz = 0.1, for the XXL, DES, and SPT surveys. Error bars contain both the shot-noise and sample-variance contributions, from Eqs. (22) and (27). For DES we consider the mass thresholds M > 5 × 10¹³ h^-1 M_⊙ and M > 5 × 10¹⁴ h^-1 M_⊙ (smaller error bars), and for SPT the mass threshold M > 5 × 10¹⁴ h^-1M_⊙ (larger error bars shifted to the right).

We first consider several surveys of clusters of galaxies on limited angular windows.

The XXL survey (Pierre et al. 2011) is an XMM Very Large Programme specifically designed to constrain the equation of state of the dark energy by using clusters of galaxies. It consists of two 5 × 5 deg² areas and probes massive clusters out to a redshift of ~2. The well-characterized cluster selection function relies on the fact that clusters of galaxies are the only extended extragalactic sources, so that the selection operates in a two-dimensional parameter space (equivalent to flux and spatial extent), allowing for different degrees of contamination by misclassified point sources. We show the mass detection probabilities as a function of redshift in the left hand panel of Fig. J.1, for the C1 selection. The space density of this population is ~6 deg^-2. This complex selection function F(M,z), which differs from a simple mass or X-ray flux threshold (see also Pacaud et al. 2006, 2007), is readily included in our formalism through a redefinition of the halo mass function, n(M,z) → F(M,z)n(M,z).
The Dark Energy Survey (DES) is an optical imaging survey to cover 5000 deg² with the Blanco 4-meter telescope at the Cerro Tololo Inter-American Observatory⁶. We consider the expected mass threshold M > 5 × 10¹³ h^-1 M_⊙, as well as the subset of massive clusters M > 5 × 10¹⁴ h^-1 M_⊙, since a binning over mass should help in deriving tighter constraints on cosmology.
The South Pole Telescope (SPT) operates at millimeter wavelengths⁷. It will cover some 2500 deg² at three frequencies, aiming at detecting clusters of galaxies from the Sunyaev-Zel’dovich (S-Z) effect. A preliminary survey of 178 deg² at 150 GHz reveals some 20 clusters down to a depth of 18 μK. Extensive simulations allow the determination of the mass completeness level, above a given significance for these secondary CMB anisotropies (Vanderlinde et al. 2010). This gives a mass threshold on the order of 5 × 10¹⁴ h^-1 M_⊙.

Fig. 29

The mean correlation, $Mathematical equation: \hbox{$\lag\hxi_i\rag$}$ from Eq. (58), over ten comoving distance bins within 5 < r < 100 h^-1 Mpc, equally spaced in log (r). We integrate over halos within the redshift interval 0 < z < 1, for the XXL, DES, and SPT surveys, as in Fig. 28 (again the error bars for SPT are slightly larger and shifted to the right with respect to those of DES, for M > 5 × 10¹⁴ h^-1 M_⊙). The error bars show the diagonal part of the covariance, $\sqrt{C_{i,i}^{LS}}$ $Mathematical equation: \hbox{$\sqrt{C_{i,i}^{\rm LS}}$}$ , for the Landy & Szalay estimator, from Eqs. (69) and (75)–(77).

Fig. 30

The mean correlation, $Mathematical equation: \hbox{$\lag\hxi_i\rag$}$ , for the clusters detected by DES over the redshift interval 1 < z < 2. Here we consider 20 distance bins within 5 < r < 100 h^-1 Mpc, equally spaced in log (r) (i.e. twice as many as in Fig. 29).

We show in Figs. 28–30 the angular number densities and 3D correlations expected for these various surveys. The error bars include all shot-noise and sample-variance contributions (including high-order terms). For the higher redshift interval, 1 < z < 2, we only show the correlation of clusters above 5 × 10¹³ h^-1 M_⊙ for DES, because in other cases the error bars are too large to allow an accurate measure. On the other hand, to take advantage of the good expected accuracy of this case we consider in Fig. 30 distance bins that are half the size of those of Fig. 29.

As expected, the DES provides the best measures of cluster number counts and correlations, hence the tightest constraints on cosmology, thanks to its wide size, which provides a large number of objects. However, the much smaller XXL survey already provides a meaningful measure of both the abundance and the correlation of clusters, and appears to be a promising tool. The SPT survey allows a useful measure of the number counts as a function of redshift, but its rather high mass threshold leads to a relatively small number of objects, hence large error bars for the 3D correlations, even though a positive signal should still be within reach. Assuming its expected mass threshold of M > 5 × 10¹³ h^-1 M_⊙ remains valid over 1 < z < 2, the DES is the only survey among these three that allows an accurate measure of the cluster correlation at high redshift, which should help to further constrain the cosmology.

6.2. All-sky surveys

Following Planck, space missions will map the entire sky in the X-ray (EROSITA) and optical (EUCLID) wavebands at unprecedented depth and angular resolution. Corresponding selection functions are still at the tentative or predictive level. It is nevertheless instructive to compare estimates of the statistical significance of the all-sky cluster catalogs expected from these forthcoming surveys⁸. In practice, the total angular area of such surveys is not really 4π sterad since we must remove the galactic plane. In the following, for Planck we consider the two-sided cone of angle θ_s = 75 deg (i.e., |b| > 15 deg), which yields a total area ΔΩ ≃ 30576 deg². For Erosita and Euclid we take θ_s = 59 deg (i.e., |b| > 31 deg), which corresponds to a total area that is about one-half of the full sky, ΔΩ ≃ 20 000 deg².

Planck operates at nine frequencies, enabling an efficientdetection of the cluster S-Z signature but has a rather large PSF(5′–10′). Some 1625 massive clusters out to z = 1 are expected over the whole sky. We assume the selection function by Melin et al. (2006), shown in middle panel of Fig. J.1.
For Erosita, a simple flux limit is currently assumed as an average over the whole sky: 4 × 10^-14 erg s^-1 cm^-2 in the [0.5 − 2] keV band (Predehl et al. 2009). The associated selection function is shown in the right hand panel of Fig. J.1. This would yield 71,907 clusters out to z = 1.
For Euclid, we follow the prescription of the Euclid Science Book for the cluster optical selection function and adopt a fixed mass threshold of 5 × 10¹³ h^-1 M_⊙ (Refregier et al. 2010).

We show in Fig. 31 the angular number densities per redshift bin. The error bars contain the shot-noise contribution (22), as well as the sample-variance contribution (41) that holds for any angular window and does not rely on the flat-sky and Limber’s approximations⁹. The 3D correlation functions are shown in Figs. 32 and 33.

Fig. 31

The mean angular number densities of clusters within redshift bins of width Δz = 0.1. From top to bottom, we show a) halos above 5 × 10¹³ h^-1 M_⊙ in Euclid, b) halos detected by Erosita with the selection function of the right panel in Fig. J.1, c), halos above 5 × 10¹⁴ h^-1 M_⊙ in either Erosita or Euclid, and d) halos detected by Planck with the selection function of the middle panel in Fig. J.1.

Fig. 32

The mean correlation, $Mathematical equation: \hbox{$\lag\hxi_i\rag$}$ , integrated over 0 < z < 1, as in Fig. 29. From top to bottom, we show a) halos above 5 × 10¹⁴ h^-1 M_⊙ in either Erosita or Euclid, b) halos detected by Planck with the selection function of the middle panel in Fig. J.1, c) halos above 5 × 10¹³ h^-1 M_⊙ in Euclid, and d) halos detected by Erosita with the selection function of the right panel in Fig. J.1.

Fig. 33

The mean correlation, $Mathematical equation: \hbox{$\lag\hxi_i\rag$}$ , over the redshift interval 1 < z < 2, for the clusters detected by Erosita (upper curve, with ten distance bins) and Euclid (lower curve, with twenty distance bins).

As compared with the smaller surveys of Sect. 6.1, these (almost) full-sky surveys provide much more accurate measures of the evolution with redshift of cluster abundance, and of two-point correlation functions, thanks to the greater number of objects. In particular, thanks to its lower mass threshold, Euclid can probe higher redshifts, both for number counts and correlation functions. Although we have only considered two redshift bins for the two-point correlation function, 0 < z < 1 and 1 < z < 2, Figs. 32 and 33 suggest that for Euclid it should be possible to introduce a smaller redshift binning, such as Δz = 0.5. We leave it to future works to estimate which redshift binning is the most efficient at constraining cosmology.

6.3. Shot noise versus sample variance

Fig. 34

The ratio $σ_{N_{i}}^{(s . n .)} / σ_{N_{i}}^{(s . v .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.n.)}/\sigma_{N_i}^{(s.v.)}$}$ of the rms shot-noise contribution $σ_{N_{i}}^{(s . n .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.n.)}$}$ to the rms sample-variance contribution $σ_{N_{i}}^{(s . v .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.v.)}$}$ , of the covariance of the angular number densities N_i obtained for various surveys. (For DES and Euclid we only consider the case M > 5 × 10¹³ h^-1 M_⊙.)

Fig. 35

The ratio $σ_{ξ_{i}}^{(2)} / σ_{ξ_{i}}^{(3 + 4)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(2)}/\sigma_{\xi_i}^{(3+4)}$}$ of the rms contributions $\sqrt{C^{(2)}}$ $Mathematical equation: \hbox{$\sqrt{C^{(2)}}$}$ and $\sqrt{C^{(3)} + C^{(4)}}$ $Mathematical equation: \hbox{$\sqrt{C^{(3)}+C^{(4)}}$}$ of the covariance matrix of the estimator $ξ̂ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxiLS_i$}$ . This is a measure of shot-noise effects. (For DES and Euclid we only consider the case M > 5 × 10¹³ h^-1 M_⊙.)

We show in Fig. 34 the ratio of the shot-noise to sample-variance contributions to the covariance of number counts, where the rms contributions $σ_{N_{i}}^{(s . n .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.n.)}$}$ and $σ_{N_{i}}^{(s . v .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.v.)}$}$ are defined by Eq. (29). As expected, shot noise becomes increasingly dominant at higher redshift, as the number of clusters decreases, and it is smaller for Euclid which has a wider sky coverage and a lower mass threshold.

We show in Fig. 35 the ratio $σ_{ξ_{i}}^{(2)} / σ_{ξ_{i}}^{(3 + 4)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(2)}/\sigma_{\xi_i}^{(3+4)}$}$ of the contribution (64) to the sum of contributions (65) and (66), to the rms error $σ_{i} = \sqrt{C_{i,i}}$ $Mathematical equation: \hbox{$\sigma_i=\sqrt{C_{i,i}}$}$ . In contrast to Fig. 14 we include the high-order terms of C⁽³⁾ and C⁽⁴⁾, but the ratio $σ_{ξ_{i}}^{(2)} / σ_{ξ_{i}}^{(3 + 4)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(2)}/\sigma_{\xi_i}^{(3+4)}$}$ is again a measure of shot-noise effects. As expected, we can see that the contribution C⁽²⁾ becomes increasingly dominant for smaller radial bins since they contain fewer clusters. We can see that the ordering between the various surveys is not the same as the one obtained in Fig. 34 for the number counts. This is because the mass thresholds are not the same (and couplings between shot-noise and sample-variance effects in the covariance matrix of halo correlations make the analysis less direct). Since a higher mass means both a larger correlation function and larger discreteness effects (because halos are rarer), it is not always obvious a priori how the relative importance of shot-noise effects changes from one configuration to another.

Here we must recall that Fig. 35 only shows the diagonal part of the covariance matrix C_i,j and that off-diagonal terms can be non-negligible, see Sect. 4.

6.4. High-order and low-order contributions to the sample variance of $Mathematical equation: \hbox{$\hxi$}$

Fig. 36

The ratio $σ_{ξ_{i}}^{(ξξ + ζ + η)} / σ_{ξ_{i}}^{(ξ)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(\xi\xi+\zeta+\eta)}/\sigma_{\xi_i}^{(\xi)}$}$ of the rms high-order contribution (75)–(77) to the rms low-order contribution (second term in Eq. (69)) of the sample variance of the correlation ξ_i obtained for various surveys. (For DES and Euclid we only consider the case M > 5 × 10¹³ h^-1 M_⊙.)

We show in Fig. 36 the ratio of the high-order contributions to the low-order contribution of the sample variance of the 3D correlation ξ_i. We consider the Landy & Szalay estimator and we define along the diagonal the rms contributions as $σ_{ξ_{i}}^{(ξξ + ζ + η)} = \sqrt{C_{i,i}^{LS (ξξ)} + C_{i,i}^{LS (ζ)} + C_{i,i}^{LS (η)}}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(\xi\xi+\zeta+\eta)}=\sqrt{C_{i,i}^{\rm LS (\xi\xi)} +C_{i,i}^{\rm LS (\zeta)}+C_{i,i}^{\rm LS (\eta)}}$}$ , from Eqs. (75)–(77) for the high-order term, and $σ_{ξ_{i}}^{(ξ)} = \sqrt{C_{i,i}^{LS (ξ)}}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(\xi)}=\sqrt{C_{i,i}^{\rm LS (\xi)}}$}$ for the low-order term, where $C_{i,i}^{LS (ξ)}$ $Mathematical equation: \hbox{$C_{i,i}^{\rm LS (\xi)}$}$ is given by the second term in Eq. (69). We do not consider here the shot-noise contribution $σ_{ξ_{i}}^{(2)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(2)}$}$ associated with the first term in Eq. (69), which was studied in Fig. 35; however, while $C_{i,i}^{LS (ξξ)}$ $Mathematical equation: \hbox{$C_{i,i}^{\rm LS (\xi\xi)}$}$ and $C_{i,i}^{LS (η)}$ $Mathematical equation: \hbox{$C_{i,i}^{\rm LS (\eta)}$}$ are pure sample-variance contributions, $C_{i,i}^{LS (ζ)}$ $Mathematical equation: \hbox{$C_{i,i}^{\rm LS (\zeta)}$}$ and $C_{i,i}^{LS (ξ)}$ $Mathematical equation: \hbox{$C_{i,i}^{\rm LS (\xi)}$}$ are mixed shot-noise and sample-variance contributions. Indeed, they arise from both the discreteness of the halo population (as shown by the power $N^{3}$ $Mathematical equation: \hbox{$\Nb^3$}$ instead of $N^{4}$ $Mathematical equation: \hbox{$\Nb^4$}$ , which comes from the identification of two objects as explained in Eq. (D.2)) and its large-scale correlations (as shown by the bias factors $b^{3}$ $Mathematical equation: \hbox{$\bb^3$}$ and $b^{2}$ $Mathematical equation: \hbox{$\bb^2$}$ ).

As could be expected, we can see in Fig. 36 that the relative importance of high-order terms increases on smaller scales, deeper in the nonlinear regime where correlations are stronger. However, in some cases there is a flattening on larger scales because the relative importance of high-order terms no longer decreases (and could even increase in the case of low-mass halos as seen in Fig. 15). This is because the low-order contribution $C_{i,i}^{LS (ξ)}$ $Mathematical equation: \hbox{$C_{i,i}^{\rm LS (\xi)}$}$ is actually a mixed “shot-noise and sample-variance” contribution, as noticed above, and shot-noise effects decrease on larger radii (because of the greater volume), as seen in Fig. 35. In agreement with this explanation, we can see that this upturn appears earlier and is greater for the surveys where shot-noise effects are less, that is, Erosita, Euclid, and DES.

More generally, Fig. 36 shows that high-order contributions to the sample-variance or mixed terms are not negligible (but on small scales along the diagonal the covariance matrix is often dominated by the pure shot-noise contribution). For the variety of cases studied in Fig. 36 they do not grow above five times the low-order contribution along the diagonal, but as shown in Figs. 16 and 18 their importance can be greater far from the diagonal. Then, these contributions should be taken into account if one requires accurate or safe estimates of signal-to-noise ratios.

6.5. Dependence of the results on cosmology

We investigate in Appendix K the sensitivity of our results to the value of the cosmological parameters, by comparing the curves obtained in the previous sections with those that are obtained when we change either h, Ω_m, or σ₈ by an amount that corresponds to the current “2 − σ” uncertainty (Komatsu et al. 2011). We find that the main features shown in Figs. 34–36 remain valid, with modest quantitative changes (e.g., shot-noise effects become slightly less important, with respect to sample-variance contributions, when σ₈ is slightly increased). Therefore, our results and conclusions are not sensitive to the precise value of the cosmological parameters (within their current range of uncertainty).

7. Conclusion

In this paper we have presented a general formalism for obtaining analytical estimates of the means and covariance matrices of number counts and correlation functions, for distributions of cosmological objects such as clusters of galaxies or galaxies. To do so, we assumed that the two-point correlation function of these objects can be factored in terms of a linear bias model, and this simplifies expressions as spatial and mass (or luminosity, temperature, etc.) integrals factor. To estimate the high-order contributions to the covariance of two-point estimators, we also assumed that the three- and four-point correlations can be described by a hierarchical ansatz, that is, that they can be written as products of the two-point functions. This is the simplest model that agrees reasonably well with realistic distributions (of the dark matter density field, as well as of cosmological objects such as galaxies or clusters that follow the dark matter density on large scales). Although this is only an approximate model and it is known that actual cosmological fields do not exactly obey such a hierarchical clustering, this allows us to derive explicit expressions that provide a reasonably good description of covariance matrices.

The main differences or improvements with respect to previous studies are the following.

Keeping the application to cluster surveys in mind, rather thanthe galaxy surveys that have been the aim of most works, weconsidered two-point estimators that involve integrations overbroad redshift bins. Thus, we do not work with local 3Dcorrelations within an homogeneous and isotropic box at a givenredshift, but with averages over a redshift interval with explicitintegration along part of the observational cone, where the radialdirection plays a specific role.
We took all shot-noise and sample-variance contributions into account, along with high-order contributions, which in the present case of one-point and two-point estimators involve products of two two-point correlation functions and the three- and four-point correlations.
Within the framework of the simple hierarchical model recalled above, we gave explicit expressions for all contributions to these means and covariance matrices. They can be readily used for any population of objects and any set of cosmological parameters, provided one is able to compute the mass function (or the luminosity/temperature function), the two-point correlation and three- and four-point normalization parameters. In practice, assuming a linear scale-independent bias model (or a uniform scale-dependence that can be absorbed into the two-point correlation), it is sufficient to give a bias b(M,z) in addition to the mass function.

We first studied the number counts per redshift bins, comparing the relative importance of shot-noise and sample variance contributions and giving scaling laws obeyed by the signal-to-noise ratios, as a function of the survey area and the number of fields. We have explicitly considered the case of large angular windows, and estimated the angular scale where the flat-sky and Limber’s approximations break down, which occurs at about 10 deg. We also computed the decay of correlations between distant redshift bins. In particular, we checked that a redshift binning of width Δz = 0.1 is broad enough to neglect cross-correlations between different bins.

Next, we studied estimators of the 3D correlation function, averaged over finite redshift intervals. We compared the Peebles & Hauser estimator with the usual Landy & Szalay estimator, and we evaluated the relative importance of shot-noise and sample-variance, low-order and high-order, contributions to the covariance matrix. We also considered the behavior of the off-diagonal terms, and described how high-order contributions make the covariance matrices less diagonal as correlations develop between different scales (especially as one of the scales becomes smaller and more nonlinear). Then, we performed the same analysis for the 2D angular correlation function.

Throughout we compared our analytical expressions with results from numerical simulations and we obtain a reasonably good match. This makes such analytical results more competitive than simulations, because they are much faster to compute and allow one to describe rare objects that would have low-quality statistics in the simulations. Finally, we applied our formalism to several future cluster surveys, and considered both limited-area and full-sky missions.

We hope our results can help for estimating the signal-to-noise ratio of current and future surveys. This is useful for comparing the efficiency of different probes and different survey configurations, such as the choice of redshift binning, survey area, or number of subfields.

Our study should be extended in several directions. First, it would be interesting to consider the noise associated with photometric redshifts. Second, one should include the effect of redshift-space distortions, which are likely to be important on small scales. Third, the computation of the means and covariance matrices studied in this paper is only an intermediate tool for comparing theoretical predictions with observations, and the final goal is to derive constraints on cosmological parameters or astrophysical processes (e.g., scaling laws for cluster mass-luminosity-temperature relationships). Our results may be used to further investigate the cosmological information that can be extracted from cluster surveys and to optimize observing configurations so as to improve those constraints. We leave these tasks to future works.

Online material

Appendix A: Mean and covariance of number counts

To avoid introducing numerous Dirac factors, owing to the discreteness of the observed distribution of objects, we follow the simple approach described in Sect. 36 of Peebles (1980) to compute the statistical properties of counts in cells. We illustrate in this section this method for the computation of the mean and covariance of number counts within redshift bins.

We divide the “volume” of the space (z,Ω,lnM), which enters the expression (17) of the angular number density of observed objects in redshift bin i, over $Mathematical equation: \hbox{$\cN$}$ small (infinitesimal) cells labeled by the index α, so that Eq. (17) reads as $N̂ i = \frac{1}{(ΔΩ)} \sum_{α_{i}} n̂ α_{i},$ $Mathematical equation: \appendix \setcounter{section}{1} \begin{equation} \hN_i = \frac{1}{(\Delta\Omega)} \sum_{\alpha_i} \hn_{\alpha_i} , \label{hNi} \end{equation}$ (A.1)where subscript i refers to the redshift bin i. Then, since the cell α_i is infinitesimally small it contains at most one object, whence (Peebles 1980) $n̂ α_{i} = 0 or 1, and n̂ \begin{matrix} 2 \\ α_{i} \end{matrix} = n̂ α_{i} .$ $Mathematical equation: \appendix \setcounter{section}{1} \begin{equation} \hn_{\alpha_i} =0 \;\; \mbox{or} \;\; 1 , \;\; \mbox{and} \;\; \hn_{\alpha_i}^2 = \hn_{\alpha_i}{\rm .} \label{hn-alpha} \end{equation}$ (A.2)Moreover, by definition its average is given by $⟨ n̂ α_{i} ⟩ = d z \frac{d χ}{d z} 𝒟^{2} d Ω \frac{d M}{M} \frac{d n}{d \ln M} \cdot$ $Mathematical equation: \appendix \setcounter{section}{1} \begin{equation} \left\lag \hn_{\alpha_i} \right\rag = \dd z \frac{\dd\chi}{\dd z} \, \cD^2 \dd\vOm \frac{\dd M}{M} \, \frac{\dd n}{\dd\!\ln M} \cdot \label{hn-mean} \end{equation}$ (A.3)Of course, we recover for the mean number of objects in the redshift bin i the expression (19), which could also be read from Eq. (17) using the average $⟨ \frac{d n̂}{d \ln M} ⟩ = \frac{d n}{d \ln M} \cdot$ $Mathematical equation: \appendix \setcounter{section}{1} \begin{equation} \left\lag \frac{\dd\hn}{\dd\!\ln M} \right\rag = \frac{\dd n}{\dd\!\ln M} \cdot \label{hn-n} \end{equation}$ (A.4)We now consider the covariance of the angular number densities $Mathematical equation: \hbox{$\hN_i$}$ . From Eq. (A.1) we have $\begin{matrix} (ΔΩ)^{2} ⟨ N̂ i N̂ j ⟩ & = & ⟨ (\sum_{α_{i}} n̂ α_{i}) (\sum_{α_{j}} n̂ α_{j}) ⟩ \\ = & δ_{i,j} \sum_{α_{i}} ⟨ n̂ \begin{matrix} 2 \\ α_{i} \end{matrix} ⟩ + \sum_{α_{i} \neq α_{j}} ⟨ n̂ α_{i} n̂ α_{j} ⟩ \\ = & δ_{i,j} \sum_{α_{i}} ⟨ n̂ α_{i} ⟩ + \sum_{α_{i} \neq α_{j}} ⟨ n̂ α_{i} n̂ α_{j} ⟩ . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{1} \begin{eqnarray} (\Delta\Omega)^2\lag \hN_i \hN_j \rag & = & \left\lag \biggl ( \sum_{\alpha_i} \hn_{\alpha_i} \biggl ) \biggl ( \sum_{\alpha_j} \hn_{\alpha_j} \biggl ) \right\rag \\ & = & \delta_{i,j} \sum_{\alpha_i} \lag \hn_{\alpha_i}^2 \rag + \sum_{\alpha_i \neq \alpha_j} \lag \hn_{\alpha_i} \hn_{\alpha_j} \rag \\ \label{Ni-Nj-1} & = & \delta_{i,j} \sum_{\alpha_i} \lag \hn_{\alpha_i} \rag + \sum_{\alpha_i \neq \alpha_j} \lag \hn_{\alpha_i} \hn_{\alpha_j} \rag . \end{eqnarray}$ In the second line we used the fact that the redshift bins do not overlap, so that for two “volumes” α_i and α_j to coincide, bins i and j must be the same (and δ_i,j is the Kronecker symbol), while in the third line we used Eq. (A.2). The first term in Eq. (A.7) corresponds to the shot noise, due to the discreteness of the object distribution. The second term includes the nonzero-distance correlation between objects, and reads as (for α_i ≠ α_J) $⟨ n̂ α_{i} n̂ α_{j} ⟩ = ⟨ n̂ α_{i} ⟩ ⟨ n̂ α_{j} ⟩ [1 + {ξ_{α_{i}, α_{j}}^{h}}^{]},$ $Mathematical equation: \appendix \setcounter{section}{1} \begin{equation} \lag \hn_{\alpha_i} \hn_{\alpha_j} \rag = \lag \hn_{\alpha_i} \rag \lag \hn_{\alpha_j} \rag \left[ 1 + \xih_{\alpha_i,\alpha_j} \right] , \label{xi-ij} \end{equation}$ (A.8)where $ξ_{α_{i}, α_{j}}^{h}$ $Mathematical equation: \hbox{$\xih_{\alpha_i,\alpha_j}$}$ is the “halo” two-point correlation function between “volumes” α_i and α_j, see Peebles (1980). This yields $\begin{matrix} ⟨ N̂ i N̂ j ⟩ & = & δ_{i,j} \frac{⟨ N̂ i ⟩}{(ΔΩ)} + \int_{i} d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \frac{d n}{d \ln M_{i}} \\ \times \int_{j} d χ_{j} 𝒟_{j}^{2} \frac{d Ω_{j}}{(ΔΩ)} \frac{d M_{j}}{M_{j}} \frac{d n}{d \ln M_{j}} [1 + {ξ_{i,j}^{h}}^{]}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{1} \begin{eqnarray} \lag \hN_i \hN_j \rag & = & \delta_{i,j} \frac{\lag \hN_i \rag}{(\Delta\Omega)} + \int_i\dd \chi_i \, \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \, \frac{\dd n}{\dd\!\ln M_i} \nonumber \\ \label{Ni-Nj-2} && \times \int_j\dd \chi_j \, \cD_j^2 \frac{\dd\vOm_j}{(\Delta\Omega)} \frac{\dd M_j}{M_j} \, \frac{\dd n}{\dd\!\ln M_j} \, \left[ 1 + \xih_{i,j} \right] , \end{eqnarray}$ (A.9)using obvious notations where we label the quantities associated with $Mathematical equation: \hbox{$\hN_i$}$ and $Mathematical equation: \hbox{$\hN_j$}$ by the subscripts i and j and we integrate over the bins i and j. This could also be directly obtained from Eq. (17) by writing $\begin{matrix} ⟨ \frac{d n̂}{d \ln M_{i}} \frac{d n̂}{d \ln M_{j}} ⟩ & = & \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{j}} [1 + {ξ_{i,j}^{h}}^{]} \\ + \frac{M_{j}}{𝒟_{j}^{2}} δ_{D} (χ_{j} - χ_{i}) δ_{D} (Ω_{j} - Ω_{i}) δ_{D} (M_{j} - M_{i}) \frac{d n}{d \ln M_{i}}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{1} \begin{eqnarray} \left\lag \frac{\dd \hn}{\dd\!\ln M_i} \frac{\dd \hn}{\dd\!\ln M_j}\right \rag & = & \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_j} \left[ 1 + \xih_{i,j} \right] \nonumber \\ \label{hni-hnj} && \hspace{-2cm} + \frac{M_j}{\cD_j^2} \, \delta_{\rm D}(\chi_j\!-\!\chi_i) \delta_{\rm D}(\vOm_j\!-\!\vOm_i) \delta_{\rm D}(M_j\!-\!M_i) \frac{\dd n}{\dd\!\ln M_i} , \end{eqnarray}$ (A.10)where the second term with the Dirac factors gives the shot-noise contribution.

In this derivation we have assumed in Eq. (A.2) that space can be divided into infinitesimal volumes that contain either zero or one object and that each object only appears in one cell. Even though clusters and dark matter halos are actually extended objects, it is still possible to define a point distribution by associating a single point to each cluster or halo, for instance the halo mass center. Thus, this approach, which follows Peebles (1980), applies to these cases as well and to any distribution of discrete objects, as long as we restrict ourselves to count distributions and do not study the internal structure of these objects.

Appendix B: Finite-size effects

Fig. B.1

Geometrical illustration of finite-size effects. Close to the survey boundary, part of the sphere of radius r extends beyond the observational cone and should not be counted. The left plot is a transverse view, orthogonal to the central line of sight, whereas the right plot is a view from a point far away on the line of sight.

As noticed in Sect. 4, in our computations of the mean and covariance of the estimators $Mathematical equation: \hbox{$\hxi$}$ and $Mathematical equation: \hbox{$\hxiLS$}$ we neglect finite-size effects. Indeed, we do not take the fact into account that when a point i gets close to the survey boundaries the available space for points i′ located in the distance bin [R_i, −,R_i, +] , with respect to point i, is only a fraction of this spherical shell since a part of it extends beyond the observational cone. This means that we overestimate the total number of pairs. This has no impact on the mean, $Mathematical equation: \hbox{$\lag\hxi_i\rag$}$ , since this effect cancels out between the numerator and denominator in (48), but it means that we slightly overestimate the signal-to-noise ratio.

To estimate the magnitude of this error, we compute the geometrical factor illustrated in Fig. B.1. Approximating the observational cone as a cylinder of radius $Mathematical equation: \hbox{$R_{\rm s}=\cD\theta_{\rm s}$}$ , a point i at distance ℓ from the central line of sight is the center of a spherical shell of radius r, onto which we count all neighbors i′ to estimate the correlation ξ at this distance r. We denote F(ℓ) as the fraction of this sphere that is enclosed within the observational cone. In our computations elsewhere we used the approximation F = 1, but for R_s − r < ℓ < R_s we actually have F < 1. As in the transverse view shown in the left hand plot of Fig. B.1, the angle θ_ℓ associated with the farthest point of intersection between the cylinder and the sphere satisfies ℓ + rsinθ_ℓ = R, whence $R_{s} - r < ℓ < R_{s}, 0 < θ_{ℓ} < \frac{π}{2} : \sin θ_{ℓ} = \frac{R_{s} - ℓ}{r} \cdot$ $Mathematical equation: \appendix \setcounter{section}{2} \begin{equation} R_{\rm s}\!-\!r\!<\!\ell\!<\!R_{\rm s} , \;\; 0 \!<\! \theta_{\ell} \!<\! \frac{\pi}{2} : \;\;\; \sin\theta_{\ell} = \frac{R_{\rm s}-\ell}{r} \cdot \label{theta-l} \end{equation}$ (B.1)Next, in the plane of each vertical section (i.e., at fixed θ), shown in the right hand plot of Fig. B.1 that corresponds to a projection along the line of sight, the cylinder appears as a circle of radius R_s, whereas the section of the sphere of center i appears as a circle of radius rsin(θ). Both circles intersect (again for R_s − r < ℓ < R_s) at the symmetric polar angles ϕ_±, with $R_{s}^{2} = ℓ^{2} + r^{2} \sin^{2} θ + 2 ℓr \sin θ \sin ϕ_{\pm} .$ $Mathematical equation: \appendix \setcounter{section}{2} \begin{equation} R_{\rm s}^2 = \ell^2 + r^2 \sin^2\theta + 2 \ell r \sin\theta \sin\varphi_{\pm} . \end{equation}$ (B.2)Then, the surface of the sphere that extends outside of the observational cylinder writes as $S^{out} = 4 r^{2} \int_{θ_{ℓ}}^{π / 2} d θ \sin θ \int_{ϕ_{-}}^{π / 2} d ϕ .$ $Mathematical equation: \appendix \setcounter{section}{2} \begin{equation} S^{\rm out} = 4 r^2 \int_{\theta_{\ell}}^{\pi/2} \dd\theta \, \sin\theta \int_{\varphi_-}^{\pi/2} \dd\varphi . \label{S-out} \end{equation}$ (B.3)Thus, for R_s − r < ℓ < R_s the fraction of the sphere that is enclosed within the observational cylinder reads as $\begin{matrix} F (ℓ) & = & 1 - \frac{\cos θ_{ℓ}}{2} + \int_{0}^{\cos θ_{ℓ}} \frac{d x}{π} Arcsin (\frac{R_{s}^{2} - ℓ^{2} - r^{2} (1 - x^{2})}{2 ℓr \sqrt{1 - x^{2}}}) \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{2} \begin{eqnarray} F(\ell) & \!\! = \! & 1 - \frac{\cos\theta_{\ell}}{2} + \int_0^{\cos\theta_{\ell}} \frac{\dd x}{\pi} \, {\rm Arcsin}\! \left( \! \frac{R_{\rm s}^2\!-\!\ell^2\!-\!r^2(1\!-\!x^2)} {2\ell r\sqrt{1-x^2}} \! \right) \nonumber \\ && \label{Fl-1} \end{eqnarray}$ (B.4)whereas F(ℓ) = 1 for 0 < ℓ < R_s − r. Then, integrating the position of the central point i over the cylinder, the fraction of volume for pairs at distance r, with respect to the approximation F = 1, writes as $\begin{matrix} \frac{N^{'}}{N} & = & \int \begin{matrix} R_{s} \\ 0 \end{matrix} \frac{d ℓ}{R_{s}} \frac{2 ℓ}{R_{s}} F (ℓ) \\ = & {(1 - \frac{r}{R_{s}})}^{2} + \int_{R_{s} - r}^{R_{s}} \frac{d ℓ}{R_{s}} \frac{2 ℓ}{R_{s}} F (ℓ) . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{2} \begin{eqnarray} \frac{N'}{N} & = & \int_0^{R_{\rm s}} \frac{\dd\ell}{R_{\rm s}} \frac{2\ell}{R_{\rm s}} F(\ell) \\ \label{N-N} & = & \left(1-\frac{r}{R_{\rm s}}\right)^2 + \int_{R_{\rm s}-r}^{R_{\rm s}} \frac{\dd\ell}{R_{\rm s}} \frac{2\ell}{R_{\rm s}} F(\ell). \end{eqnarray}$ This gives the ratio of the number of pairs N′, which is measured in the survey, to the number N obtained when we do not take finite-size effects into account. For instance, at z = 1, which corresponds to the angular distance $Mathematical equation: \hbox{$\cD\simeq 2352~h^{-1}$}$ Mpc, and for a survey angular window of area 50 deg², which corresponds to θ_s ≃ 0.0696 rad, we have $Mathematical equation: \hbox{$R_s=\cD\theta_{\rm s}\simeq 164~h^{-1}$}$ Mpc. Then, we obtain N′/N ≃ 0.91 for a shell at radius r = 30 h^-1 Mpc. This means that the approximation F = 1 overestimates the number of pairs by about 10% and the signal-to-noise ratio by 5%.

Appendix C: Computation of the mean of the estimators $Mathematical equation: \hbox{$\hxi$}$ and $Mathematical equation: \hbox{$\hat{\xi}^\mathsf{LS}$}$

Defining the 3D Fourier-space top-hat as $\begin{matrix} ˜ \\ W_{3} \end{matrix} (kR) = \int_{0}^{R} \frac{d r}{4 π R^{3} / 3} e^{i k \cdot r} = 3 \frac{\sin (kR) - kR \cos (kR)}{(kR)^{3}},$ $Mathematical equation: \appendix \setcounter{section}{3} \begin{equation} \tW_3(kR) = \! \int_0^R \!\! \frac{\dd \vr}{4\pi R^3/3} \, {\rm e}^{\ii\vk\cdot\vr} = 3 \, \frac{\sin(kR)-kR\cos(kR)}{(kR)^3} , \label{W3-def} \end{equation}$ (C.1)the 3D Fourier-space window of the i-shell reads as $\begin{matrix} {W_{i}^{(3)}}_{˜} (k) & = & \int 𝒱_{i} \frac{d r}{𝒱_{i}} e^{i k \cdot r} \\ = & \frac{R_{i, +}^{3} \begin{matrix} ˜ \\ W_{3} \end{matrix} (k R_{i, +}) - R_{i, -}^{3} \begin{matrix} ˜ \\ W_{3} \end{matrix} (k R_{i, -})}{R_{i, +}^{3} - R_{i, -}^{3}}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{3} \begin{eqnarray} \tW^{(3)}_i(k) & = & \int_{\cV_i}\frac{\dd\vr}{\cV_i} \, {\rm e}^{\ii\vk\cdot\vr} \nonumber \\ \label{W3-D-def} & = & \frac{\Rip^3 \tW_3(k\Rip) - \Rim^3 \tW_3(k\Rim)}{\Rip^3- \Rim^3} , \end{eqnarray}$ (C.2)where the superscript (3) recalls that we consider a 3D radial bin. Then, writing the two-point correlation function in terms of the power spectrum, as in Eq. (3), we obtain for its radial average (54) $ξ_{i^{'}}^{(r)} (z) = \int_{0}^{\infty} \frac{d k}{k} Δ^{2} (k,z) {W_{i}^{(3)}}_{˜} (k) .$ $Mathematical equation: \appendix \setcounter{section}{3} \begin{equation} \xir_{i'}(z) = \int_0^{\infty} \frac{\dd k}{k} \, \Delta^2(k,z) \, \tW^{(3)}_i(k) . \label{I3-def} \end{equation}$ (C.3)(Here i and i′ refer to the same radial bin; the prime only recalls that we are integrating over a neighbor i′ within a small radial shell with respect to another point in the observational cone.)

Appendix D: Derivation of the covariance of the Peebles & Hauser estimator $Mathematical equation: \hbox{$\hxi$}$

We compute here the covariance of the estimators $Mathematical equation: \hbox{$\hxi_i$}$ , which is identical to the covariance of the quantities $Mathematical equation: \hbox{$(1+\hxi_i)$}$ . To simplify the expressions we do not consider mass binning here, but it is straightforward to generalize to the case of several mass bins. From the definition (48) we can write with obvious notations the second moment as $\begin{matrix} ⟨ (1 + ξ̂ i) (1 + ξ̂ j) ⟩ & = & \frac{1}{Q_{i}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \frac{1}{Q_{j}} \int d χ_{j} 𝒟_{j}^{2} \frac{d Ω_{j}}{(ΔΩ)} \frac{d M_{j}}{M_{j}} \int d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \\ \times ⟨ \frac{d n̂}{d \ln M_{i}} \frac{d n̂}{d \ln M_{i^{'}}} \frac{d n̂}{d \ln M_{j}} \frac{d n̂}{d \ln M_{j^{'}}} ⟩ \cdot \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{eqnarray} \lag (1+\hxi_i)(1+\hxi_j)\rag & = & \frac{1}{\QQ_i} \int \dd\chi_i \, \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ && \hspace{-1.5cm} \times \frac{1}{\QQ_j} \int \dd\chi_j \, \cD_j^2 \frac{\dd\vOm_j}{(\Delta\Omega)} \frac{\dd M_j}{M_j} \int \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \nonumber \\ \label{xii-xij-1} && \hspace{-1.5cm} \times \, \left\lag \frac{\dd\hn}{\dd\!\ln M_i} \frac{\dd\hn}{\dd\!\ln M_{i'}} \frac{\dd\hn}{\dd\!\ln M_j} \frac{\dd\hn}{\dd\!\ln M_{j'}} \right\rag \cdot \end{eqnarray}$ (D.1)The average in Eq. (D.1) can be written as in Eq. (A.10), with many Dirac factors for the shot-noise contributions. However, as in Appendix A, it may be easier to follow Peebles (1980) and to divide “volumes” over small (infinitesimal) cells that contain $Mathematical equation: \hbox{$\hn$}$ objects, with $Mathematical equation: \hbox{$\hn=0$}$ or 1. Then, we can split the average $Mathematical equation: \hbox{$\lag\hn_i\hn_{i'}\hn_j\hn_{j'}\rag$}$ as $\begin{matrix} ⟨ n̂ i n̂ i^{'} n̂ j n̂ j^{'} ⟩ & = & ⟨ n̂ i n̂ i^{'} n̂ j n̂ j^{'} ⟩^{(s . v .)} + δ_{i,j} ⟨ n̂ i n̂ i^{'} n̂ j^{'} ⟩^{(s . v .)} \\ + δ_{i, j^{'}} ⟨ n̂ i n̂ i^{'} n̂ j ⟩^{(s . v .)} + δ_{i^{'},j} ⟨ n̂ i n̂ i^{'} n̂ j^{'} ⟩^{(s . v .)} \\ + δ_{i^{'}, j^{'}} ⟨ n̂ i n̂ i^{'} n̂ j ⟩^{(s . v .)} + δ_{i,j} δ_{i^{'}, j^{'}} ⟨ n̂ i n̂ i^{'} ⟩^{(s . v .)} \\ + δ_{i, j^{'}} δ_{i^{'},j} ⟨ n̂ i n̂ i^{'} ⟩^{(s . v .)}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{eqnarray} \lag\hn_i\hn_{i'}\hn_j\hn_{j'}\rag &= & \lag\hn_i\hn_{i'}\hn_j\hn_{j'}\rag^{\sv} + \delta_{i,j} \lag\hn_i\hn_{i'}\hn_{j'}\rag^{\sv} \nonumber \\ && + \delta_{i,j'} \lag\hn_i\hn_{i'}\hn_{j}\rag^{\sv} + \delta_{i',j} \lag\hn_i\hn_{i'}\hn_{j'}\rag^{\sv} \nonumber \\ && + \delta_{i',j'} \lag\hn_i\hn_{i'}\hn_{j}\rag^{\sv} + \delta_{i,j} \delta_{i',j'} \lag\hn_i\hn_{i'}\rag^{\sv} \nonumber \\ \label{n4-sn-def} && + \delta_{i,j'} \delta_{i',j} \lag\hn_i\hn_{i'}\rag^{\sv} , \end{eqnarray}$ (D.2)where we have explicitly written the first “pure sample-variance” contribution and the last six “shot-noise” contributions associated with the Kronecker symbols. The remaining averages with the superscript “(s.v.)” denote “sample-variance” averages, that is, without further shot-noise terms. Here we used the fact that the objects i and i′ are separated by the finite distance r_i′, with r_i′ ≥ R_i, −, so that the elementary “cells” i and i′ cannot coincide and there is no shot-noise contribution of the form δ_i,i′. For the same reason there is no term δ_j,j′. Next, the “sample-variance” averages of Eq. (D.2) read as (Peebles 1980) $\begin{matrix} ⟨ n̂ i n̂ i^{'} n̂ j n̂ j^{'} ⟩^{(s . v .)} = ⟨ n̂ i ⟩ ⟨ n̂ i^{'} ⟩ ⟨ n̂ j ⟩ ⟨ n̂ j^{'} ⟩ [1 + ξ_{i, i^{'}}^{h} + ξ_{i,j}^{h} + {ξ_{i, j^{'}}^{h}}^{} \\ + ξ_{i^{'},j}^{h} + ξ_{i^{'}, j^{'}}^{h} + ξ_{j, j^{'}}^{h} + ζ_{i^{'},j, j^{'}}^{h} + ζ_{i,j, j^{'}}^{h} + ζ_{i, i^{'}, j^{'}}^{h} + ζ_{i, i^{'},j}^{h} \\ + ξ_{i, i^{'}}^{h} ξ_{j, j^{'}}^{h} + ξ_{i,j}^{h} ξ_{i^{'}, j^{'}}^{h} + ξ_{i, j^{'}}^{h} ξ_{i^{'},j}^{h} + {η_{i, i^{'},j, j^{'}}^{h}}^{]}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{eqnarray} \lefteqn{ \lag\hn_i\hn_{i'}\hn_j\hn_{j'}\rag^{\sv} = \lag\hn_i\rag \lag\hn_{i'}\rag \lag\hn_j\rag \lag\hn_{j'}\rag \left[ 1 + \xih_{i,i'} + \xih_{i,j} + \xih_{i,j'} \right. } \nonumber \\ && + \xih_{i',j} + \xih_{i',j'} + \xih_{j,j'} + \zetah_{i',j,j'} + \zetah_{i,j,j'} + \zetah_{i,i',j'} + \zetah_{i,i',j} \nonumber \\ \label{n4-def} && \left. + \xih_{i,i'} \xih_{j,j'} + \xih_{i,j} \xih_{i',j'} + \xih_{i,j'} \xih_{i',j} + \etah_{i,i',j,j'} \right] , \end{eqnarray}$ (D.3) $\begin{matrix} ⟨ n̂ i n̂ i^{'} n̂ j^{'} ⟩^{(s . v .)} & = & ⟨ n̂ i ⟩ ⟨ n̂ i^{'} ⟩ ⟨ n̂ j^{'} ⟩ [1 + ξ_{i, i^{'}}^{h} + ξ_{i, j^{'}}^{h} + {ξ_{i^{'}, j^{'}}^{h}}^{} \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{eqnarray} \lag\hn_i\hn_{i'}\hn_{j'}\rag^{\sv} & = & \lag\hn_i\rag \lag\hn_{i'}\rag \lag\hn_{j'}\rag \left[ 1 + \xih_{i,i'} +\xih_{i,j'} +\xih_{i',j'} \right. \nonumber \\ && \left. + \zetah_{i,i',j'} \right] , \label{n3-def} \end{eqnarray}$ (D.4) $⟨ n̂ i n̂ i^{'} ⟩^{(s . v .)} = ⟨ n̂ i ⟩ ⟨ n̂ i^{'} ⟩ [1 + {ξ_{i, i^{'}}^{h}}^{]},$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{equation} \lag\hn_i\hn_{i'}\rag^{\sv} = \lag\hn_i\rag \lag\hn_{i'}\rag \left[ 1 + \xih_{i,i'} \right] , \label{n2-def} \end{equation}$ (D.5)where ξ^h, ζ^h, and η^h are the two-point, three-point, and four-point correlation functions of the objects. Since we have $⟨ ξ̂ i ξ̂ j ⟩ - ⟨ ξ̂ i ⟩ ⟨ ξ̂ j ⟩ = ⟨ (1 + ξ̂ i) (1 + ξ̂ j) ⟩ - ⟨ 1 + ξ̂ i ⟩ ⟨ 1 + ξ̂ j ⟩,$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{equation} \lag\hxi_i\hxi_j\rag - \lag\hxi_i\rag \lag\hxi_j\rag = \lag(1\!+\!\hxi_i) (1\!+\!\hxi_j)\rag - \lag 1\!+\!\hxi_i\rag \lag 1\!+\!\hxi_j\rag , \end{equation}$ (D.6)we obtain from Eqs. (D.1)–(D.5) the decomposition (59) of the covariance matrix, with the explicit expressions (60)–(62) of the various “sample-variance” and “shot-noise” contributions. Here we used the symmetries¹⁰ { i ↔ i′ } and { j ↔ j′ } of Eq. (D.1). In Eq. (61) the object “j′” is at the distance r_j′ from the object “i”, since this shot-noise contribution comes from the case where the objects i and j are the same object (or from one of the three remaining cases “i = j′”, “i′ = j”, or “i′ = j′”). The shot-noise contribution (60) comes from the identification “i = j and i′ = j′” (or “i = j′ and i′ = j”). This implies that the distances r_i′ and r_j′ are equal, which gives rise to the Kronecker symbol δ_i,j since we consider the case of nonoverlapping distance bins [R_i, −,R_i, +] .

From Eq. (52) the contribution $C_{i,j}^{(2)}$ $Mathematical equation: \hbox{$C_{i,j}^{(2)}$}$ of Eq. (60) also reads as $C_{i,j}^{(2)} = δ_{i,j} \frac{2}{(ΔΩ) Q_{i}} (1 + ⟨ ξ̂ i ⟩) .$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{equation} C_{i,j}^{(2)} = \delta_{i,j} \frac{2}{(\Delta\Omega)\QQ_i} (1+\lag\hxi_i\rag) . \label{C2-1} \end{equation}$ (D.7)In order to estimate the contributions $C_{ij}^{(3)}$ $Mathematical equation: \hbox{$C_{ij}^{(3)}$}$ and $C_{ij}^{(4)}$ $Mathematical equation: \hbox{$C_{ij}^{(4)}$}$ we assume that the radial bins [R_i, −,R_i, +] are restricted to large enough scales to neglect three- and four-point correlation functions, as well as products such as ξ_i;j′ξ_i′;j. Thus, we only keep in this Appendix the contributions that are constant or linear over the two-point correlation function ξ_i;j of the objects, which we recall with the superscripts “1” and “ξ” below. Moreover, we again assume that the two-point correlation function can be factored in as in Eq. (1).

The first contribution to $C_{i,j}^{(3)}$ $Mathematical equation: \hbox{$C_{i,j}^{(3)}$}$ , associated with the factor 1 in the brackets in Eq. (61), reads as $C_{i,j}^{(3, 1)} = \frac{4}{(ΔΩ) Q_{i} Q_{j}} \int d χ 𝒟^{2} n^{3} 𝒱_{i} 𝒱_{j} .$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{equation} C_{i,j}^{(3,1)} = \frac{4}{(\Delta\Omega)\QQ_i\QQ_j} \int \dd\chi \, \cD^2 \, \nb^3 \, \cV_i \cV_j . \label{C3-0-1} \end{equation}$ (D.8)The contributions that are linear over ξ sum up as $\begin{matrix} C_{i,j}^{(3,ξ)} & = & \frac{4}{(ΔΩ) Q_{i} Q_{j}} \int d χ 𝒟^{2} b^{2} n^{3} 𝒱_{i} 𝒱_{j} \\ \times [ξ_{i^{'}}^{(r)} + ξ_{j^{'}}^{(r)} + ξ_{i^{'}, j^{'}}^{(r)}], \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{eqnarray} C_{i,j}^{(3,\xi)} & = & \frac{4}{(\Delta\Omega)\QQ_i\QQ_j} \int \dd\chi \, \cD^2 \, \bb^2 \, \nb^3 \, \cV_i \cV_j \nonumber \\ \label{C3-xi-1} && \times \, \left[ \overline{\xir_{i'}} + \overline{\xir_{j'}} + \overline{\xir_{i',j'}} \right] , \end{eqnarray}$ (D.9)where $ξ_{i^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{i'}}$}$ and $ξ_{j^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{j'}}$}$ are defined as in Eqs. (54) and (C.3), whereas $ξ_{i^{'}, j^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{i',j'}}$}$ is defined in Eq. (68) and also writes as $ξ_{i^{'}, j^{'}}^{(r)} = \int_{0}^{\infty} \frac{d k}{k} Δ^{2} (k,z) {W_{i}^{(3)}}_{˜} (k) {W_{j}^{(3)}}_{˜} (k) .$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{equation} \overline{\xir_{i',j'}} = \int_0^{\infty} \frac{\dd k}{k} \, \Delta^2(k,z) \, \tW_i^{(3)}(k) \, \tW_j^{(3)}(k) . \label{I3-ij-def} \end{equation}$ (D.10)Next, at this order the contribution (62) to the covariance simplifies as $C_{i,j}^{(4,ξ)} = \frac{4}{Q_{i} Q_{j}} \int d χ 𝒟^{5} b^{2} n^{4} 𝒱_{i} 𝒱_{j} ξ_{cyl},$ $Mathematical equation: \appendix \setcounter{section}{4} \begin{equation} C_{i,j}^{(4,\xi)} = \frac{4}{\QQ_i\QQ_j} \int \dd\chi \, \cD^5 \, \bb^2 \nb^4 \, \cV_i \cV_j \, \xicyl , \label{C4-xi-2} \end{equation}$ (D.11)where $ξ_{cyl}$ $Mathematical equation: \hbox{$\xicyl$}$ is Limber’s approximation (13) to Eq. (8). Then, collecting all terms, we obtain the expression (67) for the covariance.

Appendix E: Derivation of the mean and covariance of the Landy & Szalay estimator $Mathematical equation: \hbox{$\hat{\xi}^\mathsf{LS}$}$

We can relate the Landy & Szalay estimator $Mathematical equation: \hbox{$\hxiLS$}$ defined by Eq. (56) to the Peebles & Hauser estimator (48) by $ξ̂ \begin{matrix} LS \\ i \end{matrix} = ξ̂ i - 2 ξ̂ \begin{matrix} c \\ i \end{matrix},$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{equation} \hxiLS_i = \hxi_i - 2 \hxic_i , \label{xi-LS-c} \end{equation}$ (E.1)where we defined the cross-term $ξ̂ \begin{matrix} c \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxic_i$}$ by $\begin{matrix} 1 + ξ̂ \begin{matrix} c \\ i \end{matrix} & = & \frac{1}{Q_{i}} \int d z \frac{d χ}{d z} 𝒟^{2} \frac{d Ω}{(ΔΩ)} \frac{d M}{M} \int_{i} d r' \frac{d M^{'}}{M^{'}} \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{eqnarray} 1+\hxic_i & = & \frac{1}{\QQ_i} \int \dd z \, \frac{\dd\chi}{\dd z} \, \cD^2 \frac{\dd\vOm}{(\Delta\Omega)} \frac{\dd M}{M} \int_i \dd\vr' \frac{\dd M'}{M'} \nonumber \\ && \times \, \frac{\dd\hn}{\dd\!\ln M} \frac{\dd n}{\dd\!\ln M'} \cdot \label{xi-c-1} \end{eqnarray}$ (E.2)We obtain at once, using Eqs. (A.4) and (50), $⟨ ξ̂ \begin{matrix} c \\ i \end{matrix} ⟩ = 0,$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{equation} \lag \hxic_i \rag =0 , \label{hxi-c-2} \end{equation}$ (E.3)which leads to Eq. (57).

From the relation (E.1) we have for the covariance of the estimator $Mathematical equation: \hbox{$\hxiLS$}$ , $C_{i,j}^{LS} = C_{i,j} - 2 ⟨ ξ̂ i ξ̂ \begin{matrix} c \\ j \end{matrix} ⟩ - 2 ⟨ ξ̂ \begin{matrix} c \\ i \end{matrix} ξ̂ j ⟩ + 4 ⟨ ξ̂ \begin{matrix} c \\ i \end{matrix} ξ̂ \begin{matrix} c \\ j \end{matrix} ⟩,$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{equation} C^{\rm LS}_{i,j} = C_{i,j} - 2 \lag\hxi_i\hxic_j\rag - 2 \lag\hxic_i\hxi_j\rag + 4 \lag\hxic_i\hxic_j\rag , \label{Cij-LS-1} \end{equation}$ (E.4)where C_i,j is the covariance of the Peebles & Hauser estimator $Mathematical equation: \hbox{$\hxi$}$ , defined in Eq. (59). To compute the cross-terms in (E.4) we write as in Eq. (D.1), $\begin{matrix} ⟨ (1 + ξ̂ i) (1 + ξ̂ \begin{matrix} c \\ j \end{matrix}) ⟩ & = & \frac{1}{Q_{i}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \frac{1}{Q_{j}} \int d χ_{j} 𝒟_{j}^{2} \frac{d Ω_{j}}{(ΔΩ)} \frac{d M_{j}}{M_{j}} \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \\ \times ⟨ \frac{d n̂}{d \ln M_{i}} \frac{d n̂}{d \ln M_{i^{'}}} \frac{d n̂}{d \ln M_{j}} ⟩ \frac{d n}{d \ln M_{j^{'}}} \cdot \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{eqnarray} \lag (1+\hxi_i)(1+\hxic_j)\rag & = & \frac{1}{\QQ_i} \int \dd\chi_i \, \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ && \hspace{-1.5cm} \times \frac{1}{\QQ_j} \int \dd\chi_j \, \cD_j^2 \frac{\dd\vOm_j}{(\Delta\Omega)} \frac{\dd M_j}{M_j} \int_j \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \nonumber \\ \label{xii-xicj-1} && \hspace{-1.5cm} \times \, \left\lag \frac{\dd\hn}{\dd\!\ln M_i} \frac{\dd\hn}{\dd\!\ln M_{i'}} \frac{\dd\hn}{\dd\!\ln M_j} \right\rag \, \frac{\dd n}{\dd\!\ln M_{j'}} \cdot \end{eqnarray}$ (E.5)Proceeding as in Appendix D, this gives $C_{i,j}^{c} = ⟨ ξ̂ i ξ̂ \begin{matrix} c \\ j \end{matrix} ⟩ = C_{i,j}^{c (3)} + C_{i,j}^{c (4)},$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{equation} C^{\rm c}_{i,j} = \lag\hxi_i\hxic_j \rag = C_{i,j}^{\rm c (3)} + C_{i,j}^{\rm c (4)} , \label{xii-xicj-2} \end{equation}$ (E.6)with $\begin{matrix} C_{i,j}^{c (3)} & = & \frac{2}{(ΔΩ) Q_{i} Q_{j}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} \frac{d n}{d \ln M_{j^{'}}} [1 + {ξ_{i, i^{'}}^{h}}^{]}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{eqnarray} C_{i,j}^{\rm c (3)} & = & \frac{2}{(\Delta\Omega)\QQ_i\QQ_j} \int \dd\chi_i \, \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ \label{C2-c-def} && \hspace{-0.7cm} \times \int_j \! \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \frac{\dd n}{\dd\!\ln M_{j'}} \left[ 1\!+\! \xih_{i,i'} \right] , \end{eqnarray}$ (E.7) $\begin{matrix} C_{i,j}^{c (4)} & = & \frac{1}{Q_{i} Q_{j}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \frac{d n}{d \ln M_{i}} \\ \times \frac{d n}{d \ln M_{i^{'}}} \int d χ_{j} 𝒟_{j}^{2} \frac{d Ω_{j}}{(ΔΩ)} \frac{d M_{j}}{M_{j}} \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{j}} \\ \times \frac{d n}{d \ln M_{j^{'}}} [2 ξ_{i; j}^{h} + {ζ_{i, i^{'}; j}^{h}}^{]} . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{eqnarray} C_{i,j}^{\rm c (4)} & = & \frac{1}{\QQ_i\QQ_j} \int \dd\chi_i \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \frac{\dd n}{\dd\!\ln M_i} \nonumber \\ && \hspace{-0.6cm} \times \frac{\dd n}{\dd\!\ln M_{i'}} \int \dd\chi_j \cD_j^2 \frac{\dd\vOm_j}{(\Delta\Omega)} \frac{\dd M_j}{M_j} \int_j \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \frac{\dd n}{\dd\!\ln M_j} \nonumber \\ \label{C3-c-def} && \hspace{-0.6cm} \times \frac{\dd n}{\dd\!\ln M_{j'}} \left[ 2 \xih_{i;j} + \zetah_{i,i';j} \right] . \end{eqnarray}$ (E.8)Next, to compute the last term in Eq. (E.4) we write $\begin{matrix} ⟨ (1 + ξ̂ \begin{matrix} c \\ i \end{matrix}) (1 + ξ̂ \begin{matrix} c \\ j \end{matrix}) ⟩ & = & \frac{1}{Q_{i}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \frac{1}{Q_{j}} \int d χ_{j} 𝒟_{j}^{2} \frac{d Ω_{j}}{(ΔΩ)} \frac{d M_{j}}{M_{j}} \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \\ \times ⟨ \frac{d n̂}{d \ln M_{i}} \frac{d n̂}{d \ln M_{j}} ⟩ \frac{d n}{d \ln M_{i^{'}}} \frac{d n}{d \ln M_{j^{'}}}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{eqnarray} \lag (1+\hxic_i)(1+\hxic_j)\rag & = & \frac{1}{\QQ_i} \int \dd\chi_i \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ && \hspace{-1.5cm} \times \frac{1}{\QQ_j} \int \dd\chi_j \cD_j^2 \frac{\dd\vOm_j}{(\Delta\Omega)} \frac{\dd M_j}{M_j} \int_j \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \nonumber \\ \label{xici-xicj-1} && \hspace{-1.5cm} \times \, \left\lag \frac{\dd\hn}{\dd\!\ln M_i} \frac{\dd\hn}{\dd\!\ln M_j} \right\rag \, \frac{\dd n}{\dd\!\ln M_{i'}} \frac{\dd n}{\dd\!\ln M_{j'}} , \end{eqnarray}$ (E.9)whence $C_{i,j}^{cc} = ⟨ ξ̂ \begin{matrix} c \\ i \end{matrix} ξ̂ \begin{matrix} c \\ j \end{matrix} ⟩ = C_{i,j}^{cc (3)} + C_{i,j}^{cc (4)},$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{equation} C^{\rm cc}_{i,j} = \lag\hxic_i\hxic_j \rag = C_{i,j}^{\rm cc (3)} + C_{i,j}^{\rm cc (4)} , \label{xici-xicj-2} \end{equation}$ (E.10)with $\begin{matrix} C_{i,j}^{cc (3)} & = & \frac{1}{(ΔΩ) Q_{i} Q_{j}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \\ \times \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{i}} \frac{d n}{d \ln M_{i^{'}}} \frac{d n}{d \ln M_{j^{'}}}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{eqnarray} C_{i,j}^{\rm cc (3)} & = & \frac{1}{(\Delta\Omega)\QQ_i\QQ_j} \int \dd\chi_i \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \nonumber \\ \label{C0-cc-def} && \times \int_j \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \; \frac{\dd n}{\dd\!\ln M_i} \frac{\dd n}{\dd\!\ln M_{i'}} \frac{\dd n}{\dd\!\ln M_{j'}} , \end{eqnarray}$ (E.11) $\begin{matrix} C_{i,j}^{cc (4)} & = & \frac{1}{Q_{i} Q_{j}} \int d χ_{i} 𝒟_{i}^{2} \frac{d Ω_{i}}{(ΔΩ)} \frac{d M_{i}}{M_{i}} \int_{i} d r i^{'} \frac{d M_{i^{'}}}{M_{i^{'}}} \frac{d n}{d \ln M_{i}} \\ \times \frac{d n}{d \ln M_{i^{'}}} \int d χ_{j} 𝒟_{j}^{2} \frac{d Ω_{j}}{(ΔΩ)} \frac{d M_{j}}{M_{j}} \int_{j} d r j^{'} \frac{d M_{j^{'}}}{M_{j^{'}}} \frac{d n}{d \ln M_{j}} \\ \times \frac{d n}{d \ln M_{j^{'}}} ξ_{i; j}^{h} . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{5} \begin{eqnarray} C_{i,j}^{\rm cc (4)} & = & \frac{1}{\QQ_i\QQ_j} \int \! \dd\chi_i \cD_i^2 \frac{\dd\vOm_i}{(\Delta\Omega)} \frac{\dd M_i}{M_i} \int_i \! \dd\vr_{i'} \frac{\dd M_{i'}}{M_{i'}} \frac{\dd n}{\dd\!\ln M_i} \nonumber \\ && \times \frac{\dd n}{\dd\!\ln M_{i'}} \int \! \dd\chi_j \cD_j^2 \frac{\dd\vOm_j}{(\Delta\Omega)} \frac{\dd M_j}{M_j} \int_j \! \dd\vr_{j'} \frac{\dd M_{j'}}{M_{j'}} \frac{\dd n}{\dd\!\ln M_j} \nonumber \\ \label{C2-cc-def} && \times \frac{\dd n}{\dd\!\ln M_{j'}} \; \xih_{i;j}. \end{eqnarray}$ (E.12)Collecting all terms in Eq. (E.4), which reads as $C_{i,j}^{LS} = C_{i,j} - 2 C_{i,j}^{c} - 2 C_{j,i}^{c} + 4 C_{i,j}^{cc}$ $Mathematical equation: \hbox{$C^{\rm LS}_{i,j} = C_{i,j} - 2 C_{i,j}^{\rm c} - 2 C_{j,i}^{\rm c} + 4 C^{\rm cc}_{i,j}$}$ , we obtain the decomposition (63) with the contributions (64)–(66).

Appendix F: Computation of high-order terms for the covariance of $Mathematical equation: \hbox{$\hat{\xi}^\mathsf{LS}$}$

We compute here the high-order terms for the covariance $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ of the Landy-Szalay estimator $ξ̂ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxiLS_i$}$ that we had neglected in Eq. (69).

For numerical computations, it is often more efficient to express the quantities that we encounter in this work in terms of the real-space correlation ξ(x), instead of the Fourier-space power spectra P(k) or Δ²(k), provided ξ(x) is known (e.g., computed in advance on a fine grid¹¹). Indeed, this replaces oscillatory integrals by integrals with slowly-varying factors, which allows faster and more accurate computations. This comes from mostly considering various kinds of volume averages of correlation functions, such as Eq. (8), which are more naturally written in configuration space. This yields integrations over bounded or unbounded domains with typically positive and slowly-varying kernels. In contrast, the transformation to Fourier space yields highly oscillatory kernels as soon as some underlying real-space volumes are finite with a size much larger than some other scales (see for instance the 2D top-hat (12) for a window θ_s that is much broader than the typical angular scale $Mathematical equation: \hbox{$1/(k_{\perp}\cD)$}$ ). On the other hand, intermediate analytical computations are often easier to perform in Fourier space, mostly because of the convolution theorem. Then, a convenient method is to first write expressions in terms of Fourier-space power spectra, perform integrations over angles, and finally go back to the real-space correlation function, using the fact that from Eq. (3), ξ(x) and Δ²(k) are related by $ξ (x) = \int \frac{d k}{k} Δ^{2} (k) \frac{\sin (kx)}{kx},$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \xi(x) = \int \frac{\dd k}{k} \, \Delta^2(k) \, \frac{\sin(k x)}{k x} , \label{xi-Deltak} \end{equation}$ (F.1) $Δ^{2} (k) = \frac{2}{π} \int \frac{d x}{x} ξ (x) (kx)^{2} \sin (kx) .$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \Delta^2(k) = \frac{2}{\pi} \int \frac{\dd x}{x} \, \xi(x) \, (k x)^2 \, \sin(k x) . \label{Deltak-xi} \end{equation}$ (F.2)As shown below, this method also allows partial factorization of most integrals.

A first high-order contribution to the covariance $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ arises from the product ξ_i;j′ξ_i′;j in Eq. (66), which also writes as Eq. (75) where we introduced the quantity $ξ_{i; j^{'}}^{(r)} ξ_{i^{'}; j}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{i;j'} \xir_{i';j}}$}$ defined by $ξ_{i; j^{'}}^{(r)} ξ_{i^{'}; j}^{(r)} = \int \frac{d χ_{j}}{𝒟_{i}} \int \frac{d Ω_{i} d Ω_{j}}{(ΔΩ)^{2}} \int \frac{d r i^{'} d r j^{'}}{𝒱_{i} 𝒱_{j}} ξ_{i; j^{'}} ξ_{i^{'}; j} .$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \overline{\xir_{i;j'} \xir_{i';j}} = \int \frac{\dd\chi_j}{\cD_i} \int\frac{\dd\vOm_i\dd\vOm_j}{(\Delta\Omega)^2} \int\frac{\dd\vr_{i'}\dd\vr_{j'}}{\cV_i\cV_j} \, \xi_{i;j'} \xi_{i';j} . \label{Kij-def} \end{equation}$ (F.3)Expressing the two-point correlation functions in terms of the power spectrum, using the flat-sky (small angle) approximation, as well as Limber’s approximation as we did for Eq. (9), we obtain after integration over angles and over the two radial shells, $\begin{matrix} ξ_{i; j^{'}}^{(r)} ξ_{i^{'}; j}^{(r)} & = & \frac{2 π}{𝒟} \int d k 1 d k 2 P (k_{1}) P (k_{2}) δ_{D} (k_{1 ∥} + k_{2 ∥}) \\ \times {W_{i}^{(3)}}_{˜} (k_{1}) {W_{j}^{(3)}}_{˜} (k_{2}) \begin{matrix} ˜ \\ W_{2} \end{matrix} (| k 1 ⊥ + k 2 ⊥ | 𝒟 θ_{s})^{2} . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \overline{\xir_{i;j'} \xir_{i';j}} & = & \frac{2\pi}{\cD} \int \dd\vk_1\dd\vk_2 \, P(k_1) P(k_2) \, \delta_{\rm D}(k_{1\parallel}+k_{2\parallel}) \nonumber \\ \label{xi-ij-ij-1} && \hspace{-0.4cm} \times \, \tW_i^{(3)}(k_1) \, \tW_j^{(3)}(k_2) \, \tW_2(|\vk_{1\perp}+\vk_{2\perp}|\cD\theta_{\rm s})^2 . \end{eqnarray}$ (F.4)Again, the factor 2πδ_D(k_1 ∥ + k_2 ∥) comes from the integration over χ_j, which suppresses longitudinal wavelengths. Using the exponential representation of Dirac functions, Eq. (F.4) can be partially factorized as $\begin{matrix} ξ_{i; j^{'}}^{(r)} ξ_{i^{'}; j}^{(r)} & = & \int \frac{d r}{𝒟 (2 π)^{2}} \int d k 1 d k 2 P (k_{1}) P (k_{2}) {W_{i}^{(3)}}_{˜} (k_{1}) {W_{j}^{(3)}}_{˜} (k_{2}) \\ \times \int d k ⊥ \begin{matrix} ˜ \\ W_{2} \end{matrix} (k_{⊥} 𝒟 θ_{s})^{2} e^{i r_{∥} \cdot (k_{1 ∥} + k_{2 ∥}) + i r ⊥ \cdot (k 1 ⊥ + k 2 ⊥ - k ⊥)}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \overline{\xir_{i;j'} \xir_{i';j}} & \!=\!\! & \!\int\!\!\frac{\dd\vr}{\cD(2\pi)^2} \!\int\!\!\dd\vk_1\dd\vk_2 \, P(k_1) P(k_2) \, \tW_i^{(3)}(k_1) \, \tW_j^{(3)}(k_2) \, \nonumber \\ \label{Kij-1} && \hspace{-1cm} \times \! \int\!\!\dd\vk_{\perp} \tW_2(k_{\perp}\cD\theta_{\rm s})^2 {\rm e}^{\ii r_{\parallel}\cdot(k_{1\parallel}\!+k_{2\parallel}) +\ii\vr_{\perp}\cdot(\vk_{1\perp}\!+\vk_{2\perp}\!-\vk_{\perp})} , \end{eqnarray}$ (F.5)and the integration over angles yields $\begin{matrix} ξ_{i; j^{'}}^{(r)} ξ_{i^{'}; j}^{(r)} & = & 2 \int \frac{d r}{r} \int \frac{d k_{1}}{k_{1}} \frac{Δ^{2} (k_{1})}{𝒟 k_{1}} \sin (k_{1} r) {W_{i}^{(3)}}_{˜} (k_{1}) \\ \times \int \frac{d k_{2}}{k_{2}} \frac{Δ^{2} (k_{2})}{𝒟 k_{2}} \sin (k_{2} r) {W_{j}^{(3)}}_{˜} (k_{2}) \\ \times \int \frac{d k}{k} 𝒟 k \sin (kr) \begin{matrix} ˜ \\ W_{2} \end{matrix} (k 𝒟 θ_{s})^{2} . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \overline{\xir_{i;j'} \xir_{i';j}} & = & 2 \int \! \frac{\dd r}{r} \int \frac{\dd k_1}{k_1} \frac{\Delta^2(k_1)}{\cD k_1} \sin(k_1 r) \tW_i^{(3)}(k_1) \nonumber \\ && \times \int \frac{\dd k_2}{k_2} \frac{\Delta^2(k_2)}{\cD k_2} \sin(k_2 r) \tW_j^{(3)}(k_2) \nonumber \\ \label{Kij-2} && \times \int \frac{\dd k}{k} \cD k \sin(k r) \tW_2(k\cD\theta_{\rm s})^2 . \end{eqnarray}$ (F.6)This also reads as $ξ_{i; j^{'}}^{(r)} ξ_{i^{'}; j}^{(r)} = 2 θ_{s} \int \frac{d rr}{(𝒟 θ_{s})^{2}} ℐ_{i}^{(3)} (r) ℐ_{j}^{(3)} (r) A^{(3)} (\frac{r}{𝒟 θ_{s}}),$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \overline{\xir_{i;j'} \xir_{i';j}} = 2\theta_{\rm s} \! \int\! \frac{\dd r \; r}{(\cD\theta_{\rm s})^2} \; \cI_i^{(3)}(r) \, \cI_j^{(3)}(r) \, A^{(3)}\!\left(\!\frac{r}{\cD\theta_{\rm s}}\!\right) , \label{Kij-3} \end{equation}$ (F.7)where we introduced $A^{(3)} (y) = \int_{0}^{\infty} d u \sin (yu) \begin{matrix} ˜ \\ W_{2} \end{matrix} (u)^{2},$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} A^{(3)}(y) = \int_0^{\infty} \dd u \, \sin(y u) \, \tW_2(u)^2 , \label{A3_y-def} \end{equation}$ (F.8)and $ℐ_{i}^{(3)} (r) = \int \frac{d k}{k} Δ^{2} (k) \frac{\sin (kr)}{kr} {W_{i}^{(3)}}_{˜} (k) .$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \cI_i^{(3)}(r) = \int \frac{\dd k}{k} \, \Delta^2(k) \, \frac{\sin(kr)}{kr} \, \tW_i^{(3)}(k) . \label{cI-3-def} \end{equation}$ (F.9)The function A⁽³⁾(y) can be written as $\begin{matrix} 0 < y < 2 : A^{(3)} (y) & = & \frac{2}{3 π} [3 πy - 2 (4 + y^{2}) E (y / 2)^{} \\ - 2 (- 4 + y^{2}) K (y / 2)^{]}, \\ y > 2 : A^{(3)} (y) & = & \frac{2 y}{3 π} [3 π - (4 + y^{2}) E (2 / y)^{} \\ + (- 4 + y^{2}) K (2 / y)^{]}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} 0< y< 2\!: A^{(3)}(y) & = & \frac{2}{3\pi} \left[ 3\pi y - 2 (4+y^2) {\bf E}(y/2) \right .\nonumber \\ && \left. - 2 (-4+y^2) {\bf K}(y/2) \right] , \\ y> 2\! : A^{(3)}(y) & = & \frac{2 y}{3\pi} \left[ 3\pi - (4+y^2) {\bf E}(2/y) \right .\nonumber \\ \label{A3_y} && \left. + (-4+y^2) {\bf K}(2/y) \right] , \end{eqnarray}$ where K(k) and E(k) are the complete elliptic integrals of the first and second kinds (Gradshteyn & Ryzhik 1965). One can check that A⁽³⁾(y) is a positive, nonoscillatory, and continuous function (but not analytic at y = 2), with A⁽³⁾(y) ~ 2y for y → 0 and A⁽³⁾(y) ~ 1/y for y → ∞.

Going back to configuration space, by substituting Eq. (F.2), the integral (F.9) can be written as $\begin{matrix} ℐ_{i}^{(3)} (r) & = & \int \frac{d xx ξ (x)}{r (R_{i, +}^{3} - R_{i, -}^{3})} [R_{i, +}^{2} W_{3} (\frac{x}{R_{i, +}}, \frac{r}{R_{i, +}}) \\ - R_{i, -}^{2} W_{3} (\frac{x}{R_{i, -}}, \frac{r}{R_{i, -}})] \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \cI_i^{(3)}(r) & = & \int \frac{\dd x \; x \, \xi(x)}{r(\Rip^3-\Rim^3)} \, \left[ \Rip^2 W_3\!\left(\frac{x}{\Rip},\frac{r}{\Rip}\right) \right. \nonumber \\ \label{cI-3-W3} && \left. - \Rim^2 W_3\!\left(\frac{x}{\Rim},\frac{r}{\Rim}\right) \right] \end{eqnarray}$ (F.12)with $W_{3} (a,b) = \frac{2}{π} \int_{0}^{\infty} d u \sin (au) \sin (bu) \begin{matrix} ˜ \\ W_{3} \end{matrix} (u),$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} W_3(a,b) = \frac{2}{\pi} \int_0^{\infty} \dd u \, \sin(a u) \sin(b u) \tW_3(u) , \label{W3-ab-def} \end{equation}$ (F.13)which for a > 0 and b > 0 is given by $\begin{matrix} | a - b | > 1 : & W_{3} = 0 \\ | a - b | < 1,a + b < 1 : & W_{3} = 3 ab \\ | a - b | < 1,a + b > 1 : & W_{3} = \frac{3}{4} [1 - (a - b)^{2}] . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \begin{array}{rl} |a-b|>1 \!: & W_3= 0 \\ |a-b|<1, \; a+b<1 \!: & W_3= 3 a b \\ |a-b|<1, \; a+b>1 \!: & W_3= \frac{3}{4} [1-(a-b)^2] . \end{array} \end{equation}$ (F.14)Thus, using Eq. (F.12), the quantity $ξ_{i; j^{'}}^{(r)} ξ_{i^{'}; j}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{i;j'} \xir_{i';j}}$}$ of Eq. (F.7) involves slowly varying integrals over real-space variables, which partially factor as three factors within the integrand of Eq. (F.7). This makes it more efficient to use Eq. (F.7) than the Fourier-space expressions (F.4) or (F.6).

To evaluate the two remaining contributions, associated with the factors ζ_i,i′,j′ in Eq. (65) and η_{i,i′;j,j′} in Eq. (66), we use the model for the three- and four-point correlation functions described in Sect. 2.1.2. Thus, using Eq. (4) for the three-point correlation function that enters Eq. (65), this contribution to Eq. (65) reads as $\begin{matrix} C_{i,j}^{LS (3,ζ)} & = & \frac{4}{(ΔΩ) Q_{i} Q_{j}} \int d χ 𝒟^{2} b^{3} n^{3} 𝒱_{i} 𝒱_{j} \frac{S_{3}}{3} \\ \times [ξ_{i, i^{'}}^{(r)} ξ_{i, j^{'}}^{(r)} + ξ_{i^{'},i}^{(r)} ξ_{i^{'}, j^{'}}^{(r)} + ξ_{j^{'},i}^{(r)} ξ_{j^{'}, i^{'}}^{(r)}] . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} C_{i,j}^{\rm LS (3,\zeta)} & = & \frac{4}{(\Delta\Omega)\QQ_i\QQ_j} \int \dd\chi \, \cD^2 \, \bb^3 \, \nb^3 \, \cV_i \cV_j \, \frac{S_3}{3} \nonumber \\ \label{C3-zeta-1} && \times \, \left[ \overline{\xir_{i,i'} \xir_{i,j'}} + \overline{\xir_{i',i} \xir_{i',j'}} + \overline{\xir_{j',i} \xir_{j',i'}} \right] . \end{eqnarray}$ (F.15)The first term in the bracket in Eq. (F.15) is given by $ξ_{i, i^{'}}^{(r)} ξ_{i, j^{'}}^{(r)} = ξ_{i^{'}}^{(r)} \times ξ_{j^{'}}^{(r)},$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \overline{\xir_{i,i'} \xir_{i,j'}} = \overline{\xir_{i'}} \, \times \, \overline{\xir_{j'}} , \end{equation}$ (F.16)where $ξ_{i^{'}}^{(r)}$ $Mathematical equation: \hbox{$ \overline{\xir_{i'}}$}$ was defined in Eq. (54), because the integrations over r_i′ and r_j′ are independent. The second term reads as $\begin{matrix} ξ_{i^{'},i}^{(r)} ξ_{i^{'}, j^{'}}^{(r)} & = & \int d k 1 d k 2 P (k_{1}) P (k_{2}) {W_{i}^{(3)}}_{˜} (| k 1 + k 2 |) \\ \times {W_{j}^{(3)}}_{˜} (k_{2}), \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \overline{\xir_{i',i} \xir_{i',j'}} & = & \int\!\dd\vk_1\dd\vk_2 \, P(k_1) P(k_2) \, \tW_i^{(3)}(|\vk_1+\vk_2|) \nonumber \\ \label{xi-ii-xi-ij-1} && \times \, \tW_j^{(3)}(k_2) , \end{eqnarray}$ (F.17)which no longer factors. Introducing an auxiliary wavenumber k and the Dirac factor δ_D(k₁ + k₂ − k), which we write under its exponential form as in Eq. (F.5), and using the inverse Fourier transform of the 3D shell (C.2), $\begin{matrix} \frac{Θ (r \in 𝒱_{i})}{𝒱_{i}} & = & \int \frac{d k}{(2 π)^{3}} e^{- i k \cdot r} {W_{i}^{(3)}}_{˜} (k) \\ = & \frac{1}{2 π^{2}} \int \frac{d k}{k} k^{3} \frac{\sin (kr)}{kr} {W_{i}^{(3)}}_{˜} (k), \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \frac{\Theta(\vr\in\cV_i)}{\cV_i} & = & \int \frac{\dd\vk}{(2\pi)^3} \, {\rm e}^{-\ii\vk\cdot\vr} \, \tW_i^{(3)}(k) \\ \label{W3-inverseFourier} & = & \frac{1}{2\pi^2} \int \frac{\dd k}{k} \, k^3 \, \frac{\sin(k r)}{k r} \, \tW_i^{(3)}(k) , \end{eqnarray}$ as well as Eq. (F.9), we obtain $ξ_{i^{'},i}^{(r)} ξ_{i^{'}, j^{'}}^{(r)} = \int_{𝒱_{i}} \frac{d r}{𝒱_{i}} ξ (r) ℐ_{j}^{(3)} (r) .$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \overline{\xir_{i',i} \xir_{i',j'}} = \int_{\cV_i} \frac{\dd\vr}{\cV_i} \, \xi(r) \, \cI_j^{(3)}(r) . \label{J3ij-1} \end{equation}$ (F.20)The third term in Eq. (F.15) is obtained from Eq. (F.20) by exchanging the labels “i” and “j”.

We now turn to the four-point contribution to Eq. (66), using Eq. (6) for the halo four-point correlation function $η_{i, i^{'}; j, j^{'}}^{h}$ $Mathematical equation: \hbox{$\etah_{i,i';j,j'}$}$ . Thanks to the symmetries { i ↔ i′ } and { j ↔ j′ } we have two different contributions (a) and (b) associated with the topology of the left diagram in Fig. 2, each with a multiplicity factor 2, and four different contributions (c), (d), (e), and (f), associated with the topology of the right diagram, with multiplicity factors 4,4,2, and 2.

The first contribution (a) reads as $C_{i,j}^{LS (4, a)} = \frac{2}{Q_{i} Q_{j}} \int d χ 𝒟^{5} b^{4} n^{4} 𝒱_{i} 𝒱_{j} \frac{S_{4}}{16} ξ_{i, i^{'}}^{(r)} ξ_{i; j} ξ_{i; j^{'}}^{(r)}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} C_{i,j}^{\rm LS (4,a)} = \frac{2}{\QQ_i\QQ_j} \int \dd\chi \, \cD^5 \, \bb^4 \, \nb^4 \, \cV_i \cV_j \, \frac{S_4}{16} \, \overline{\xir_{i,i'}\xi_{i;j}\xir_{i;j'}} \label{C4-a-1} \end{equation}$ (F.21)with $\begin{matrix} ξ_{i, i^{'}}^{(r)} ξ_{i; j} ξ_{i; j^{'}}^{(r)} & = & \int \frac{d χ_{j}}{𝒟_{i}} \int \frac{d Ω_{i} d Ω_{j}}{(ΔΩ)^{2}} \int \frac{d r i^{'} d r j^{'}}{𝒱_{i} 𝒱_{j}} ξ_{i, i^{'}} ξ_{i; j} ξ_{i; j^{'}} \\ = & ξ_{i^{'}}^{(r)} \times ξ_{i; j} ξ_{i; j^{'}}^{(r)} . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \overline{\xir_{i,i'}\xi_{i;j}\xir_{i;j'}} & = & \int \frac{\dd\chi_j}{\cD_i} \int\frac{\dd\vOm_i\dd\vOm_j}{(\Delta\Omega)^2} \int\frac{\dd\vr_{i'}\dd\vr_{j'}}{\cV_i\cV_j} \, \xi_{i,i'} \xi_{i;j} \xi_{i;j'} \nonumber \\ & = & \overline{\xir_{i'}} \times \overline{\xi_{i;j} \xir_{i;j'}} . \end{eqnarray}$ (F.22)Proceeding as for Eq. (F.3), we obtain $ξ_{i; j} ξ_{i; j^{'}}^{(r)} = 2 θ_{s} \int \frac{d rr}{(𝒟 θ_{s})^{2}} ξ (r) ℐ_{j}^{(3)} (r) A^{(3)} (\frac{r}{𝒟 θ_{s}}),$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \overline{\xi_{i;j} \xir_{i;j'}} = 2\theta_{\rm s} \int \frac{\dd r \; r}{(\cD\theta_{\rm s})^2} \; \xi(r) \, \cI_j^{(3)}(r) \, A^{(3)} \!\left(\frac{r}{\cD\theta_{\rm s}}\right) , \label{Kj-def} \end{equation}$ (F.23)the contribution $C_{i,j}^{LS (4, b)}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS (4,b)}$}$ is the symmetric one with respect to { i ↔ j } of Eq. (F.21); that is, the product $ξ_{i^{'}}^{(r)} \times ξ_{i; j} ξ_{i; j^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{i'}} \times \overline{\xi_{i;j} \xir_{i;j'}}$}$ is replaced by $ξ_{j^{'}}^{(r)} \times ξ_{i; j} ξ_{j; i^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{j'}} \times \overline{\xi_{i;j} \xir_{j;i'}}$}$ .

Next, the contribution $C_{i,j}^{LS (4, c)}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS (4,c)}$}$ reads as $C_{i,j}^{LS (4, c)} = \frac{4}{Q_{i} Q_{j}} \int d χ 𝒟^{5} b^{4} n^{4} 𝒱_{i} 𝒱_{j} \frac{S_{4}}{16} ξ_{i^{'},i}^{(r)} ξ_{i; j} ξ_{j, j^{'}}^{(r)}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} C_{i,j}^{\rm LS (4,c)} = \frac{4}{\QQ_i\QQ_j} \int \dd\chi \, \cD^5 \, \bb^4 \, \nb^4 \, \cV_i \cV_j \, \frac{S_4}{16} \, \overline{\xir_{i',i}\xi_{i;j}\xir_{j,j'}} \label{C4-c-1} \end{equation}$ (F.24)where the geometrical average writes as $ξ_{i^{'},i}^{(r)} ξ_{i; j} ξ_{j, j^{'}}^{(r)} = ξ_{i^{'}}^{(r)} \times ξ_{cyl} \times ξ_{j^{'}}^{(r)},$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{equation} \overline{\xir_{i',i}\xi_{i;j}\xir_{j,j'}} = \overline{\xir_{i'}} \times \xicyl \times \overline{\xir_{j'}} , \end{equation}$ (F.25)since integrals over r_i′ and r_j′ can be factored.

The contribution (d) involves $ξ_{j^{'}; i}^{(r)} ξ_{i; j} ξ_{j; i^{'}}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{j';i}\xi_{i;j}\xir_{j;i'}}$}$ where no factorization is possible. Proceeding as for Eq. (F.3) we obtain $\begin{matrix} ξ_{j^{'}; i}^{(r)} ξ_{i; j} ξ_{j; i^{'}}^{(r)} & = & 2 θ_{s} \int \frac{d rr}{(𝒟 θ_{s})^{2}} ξ (r) ℐ_{i}^{(3)} (r) ℐ_{j}^{(3)} (r) A^{(3)} (\frac{r}{𝒟 θ_{s}}) \cdot \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \overline{\xir_{j';i}\xi_{i;j}\xir_{j;i'}} & = & 2\theta_{\rm s} \int \! \frac{\dd r \; r}{(\cD\theta_{\rm s})^2} \; \xi(r) \, \cI_i^{(3)}(r) \, \cI_j^{(3)}(r) \, A^{(3)} \!\left(\!\frac{r}{\cD\theta_{\rm s}}\!\right) \cdot \nonumber \\ && \label{Lij-def} \end{eqnarray}$ (F.26)The contribution (e) involves $ξ_{j^{'}; i}^{(r)} ξ_{i, i^{'}}^{(r)} ξ_{i^{'}; j}^{(r)}$ $Mathematical equation: \hbox{$\overline{\xir_{j';i}\xir_{i,i'}\xir_{i';j}}$}$ that can be written as $\begin{matrix} ξ_{j^{'}; i}^{(r)} ξ_{i, i^{'}}^{(r)} ξ_{i^{'}; j}^{(r)} & = & 3 θ_{s} \int \frac{d rr}{(𝒟 θ_{s})^{2}} A^{(3)} (\frac{r}{𝒟 θ_{s}}) ℐ_{j}^{(3)} (r) \\ \times \int_{R_{i, -}}^{R_{i, +}} \frac{d r^{'} r^{' 2}}{R_{i, +}^{3} - R_{i, -}^{3}} ξ (r^{'}) \int_{| r - r^{'} |}^{r + r^{'}} \frac{d r^{′′} r^{′′}}{r r^{'}} ξ (r^{′′}), \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{6} \begin{eqnarray} \overline{\xir_{j';i}\xir_{i,i'}\xir_{i';j}} & = & 3\theta_{\rm s} \int \frac{\dd r \; r}{(\cD\theta_{\rm s})^2} \; A^{(3)}\!\left(\!\frac{r}{\cD\theta_{\rm s}}\!\right) \, \cI_j^{(3)}(r) \nonumber \\ \label{Tij-def} && \hspace{-1.2cm} \times \int_{\Rim}^{\Rip} \!\! \frac{\dd r' \, r'^2}{\Rip^3\!-\!\Rim^3} \xi(r') \int_{|r-r'|}^{r+r'} \frac{\dd r'' \, r''}{r \, r'} \, \xi(r'') , \end{eqnarray}$ (F.27)whereas contribution (f) is obtained from (e) by exchanging the labels “i” and “j”.

Collecting all terms, the high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ are given by Eqs. (75)–(77).

Appendix G: Computation of the mean of the estimators ŵ and ŵ^LS

We give here explicit expressions of the average (84) of the correlation function over an angular ring. As in Sect. 2.1.3, using the flat-sky and Limber’s approximations, we obtain $ξ_{i^{'}}^{(θ)} (z) = π \int_{0}^{\infty} \frac{d k}{k} \frac{Δ^{2} (k,z)}{𝒟 k} {W_{i}^{(2)}}_{˜} (k 𝒟),$ $Mathematical equation: \appendix \setcounter{section}{7} \begin{equation} \overline{\xith_{i'}}(z) = \pi \int_0^{\infty} \frac{\dd k}{k} \frac{\Delta^2(k,z)}{\cD k} \tW_i^{(2)}(k\cD) , \label{I2i-def} \end{equation}$ (G.1)where we introduced the 2D Fourier-space window of the i-ring, $\begin{matrix} {W_{i}^{(2)}}_{˜} (k_{⊥} 𝒟) & = & \int 𝒜_{i} \frac{d θ}{𝒜_{i}} e^{i k ⊥ \cdot 𝒟 θ} \\ = \frac{θ_{i, +}^{2} \begin{matrix} ˜ \\ W_{2} \end{matrix} (k_{⊥} 𝒟 θ_{i, +}) - θ_{i, -}^{2} \begin{matrix} ˜ \\ W_{2} \end{matrix} (k_{⊥} 𝒟 θ_{i, -})}{θ_{i, +}^{2} - θ_{i, -}^{2}}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{7} \begin{eqnarray} \tW_i^{(2)}(k_{\perp}\cD) & = & \int_{\cA_i} \, \frac{\dd\vtheta}{\cA_i} \, {\rm e}^{\ii \vk_{\perp}\cdot\cD\vtheta} \nonumber \\ \label{W2i-def} && \hspace{-0.6cm} = \frac{\thetaip^2 \tW_2(k_{\perp}\cD\thetaip) - \thetaim^2 \tW_2(k_{\perp}\cD\thetaim)}{\thetaip^2- \thetaim^2} , \end{eqnarray}$ (G.2)and $\begin{matrix} ˜ \\ W_{2} \end{matrix}$ $Mathematical equation: \hbox{$\tW_2$}$ , associated with a full circular window, was defined in Eq. (12). In terms of the two-point correlation function, Eq. (G.1) also writes as $\begin{matrix} ξ_{i^{'}}^{(θ)} (z) & = & \frac{4}{θ_{i, +}^{2} - θ_{i, -}^{2}} {\int_{𝒟 θ_{i, -}}^{𝒟 θ_{i, +}} \frac{d x x^{2}}{𝒟^{3}} ξ (x) \sqrt{1 - 𝒟^{2} θ_{i, -}^{2} / x^{2}} \\ + \int_{𝒟 θ_{i, +}}^{\infty} \frac{d x x^{2}}{𝒟^{3}} ξ (x) [\sqrt{1 - 𝒟^{2} θ_{i, -}^{2} / x^{2}} \\ - \sqrt{1 - 𝒟^{2} θ_{i, +}^{2} / x^{2}}]}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{7} \begin{eqnarray} \overline{\xith_{i'}}(z) & \!\!=\! & \frac{4}{\thetaip^2 \!-\! \thetaim^2} \biggl\lbrace \int_{\cD\thetaim}^{\cD\thetaip} \frac{\dd x \; x^2}{\cD^3} \; \xi(x) \, \sqrt{1 \!-\! \cD^2\thetaim^2/x^2} \nonumber \\ && + \int_{\cD\thetaip}^{\infty} \frac{\dd x \; x^2}{\cD^3} \; \xi(x) \, \left[ \sqrt{1 \!-\! \cD^2\thetaim^2/x^2} \right . \nonumber \\ && \left. - \sqrt{1 \!-\! \cD^2\thetaip^2/x^2} \right] \biggl\rbrace , \end{eqnarray}$ (G.3)which avoids introducing oscillatory kernels.

Appendix H: Computation of the covariance of ŵ^LS

The low-order contribution (94) to the covariance matrix of the estimator ŵ^LS involves the angular average (95). Using Limber’s approximation it also reads as $ξ_{i^{'}, j^{'}}^{(θ)} = π \int_{0}^{\infty} \frac{d k}{k} \frac{Δ^{2} (k,z)}{𝒟 k} {W_{i}^{(2)}}_{˜} (k 𝒟) {W_{j}^{(2)}}_{˜} (k 𝒟) .$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} \overline{\xith_{i',j'}} = \pi \int_0^{\infty} \frac{\dd k}{k} \frac{\Delta^2(k,z)}{\cD k} \tW_i^{(2)}(k\cD) \tW_j^{(2)}(k\cD) . \label{I2ij-def} \end{equation}$ (H.1)We now compute the high-order terms of the covariance $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , which are given in Eqs. (96)–(98). A first contribution (96) ari-ses from the product ξ_i;j′ξ_i′;j in Eq. (92). Using Limber’s approximation and integrating over angles yields $\begin{matrix} C_{i,j}^{LS (4,ξξ)} & = & \frac{2 (2 π)^{2}}{N^{4}} \int d χ_{i} 𝒟_{i}^{4} b \begin{matrix} 2 \\ i \end{matrix} n \begin{matrix} 2 \\ i \end{matrix} \int d χ_{j} 𝒟_{j}^{4} b \begin{matrix} 2 \\ j \end{matrix} n \begin{matrix} 2 \\ j \end{matrix} \\ \times \int d k 1 ⊥ d k 2 ⊥ P (k_{1 ⊥}; z_{i}) P (k_{2 ⊥}; z_{j}) {W_{i}^{(2)}}_{˜} (k_{2 ⊥} 𝒟_{j}) \\ \times {W_{j}^{(2)}}_{˜} (k_{1 ⊥} 𝒟_{i}) \begin{matrix} ˜ \\ W_{2} \end{matrix} [(𝒟_{j} k 2 ⊥ - 𝒟_{i} k 1 ⊥) θ_{s}]^{2} . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} C_{i,j}^{\rm LS (4,\xi\xi)} & = & \frac{2(2\pi)^2}{\Nb^4} \int \dd\chi_i \, \cD_i^4 \, \bb_i^2 \, \nb_i^2 \int \dd\chi_j \, \cD_j^4 \, \bb_j^2 \, \nb_j^2 \nonumber \\ && \times \int \dd\vk_{1\perp} \dd\vk_{2\perp} \, P(k_{1\perp};z_i) P(k_{2\perp};z_j) \, \tW_i^{(2)}(k_{2\perp}\cD_j) \nonumber \\ \label{Cw-LS-xixi-1} && \times \tW_j^{(2)}(k_{1\perp}\cD_i) \tW_2[(\cD_j\vk_{2\perp}-\cD_i\vk_{1\perp})\theta_{\rm s}]^2 . \end{eqnarray}$ (H.2)Introducing a Dirac factor $Mathematical equation: \hbox{$\delta_{\rm D}(\cD_j\vk_{2\perp}-\cD_i\vk_{1\perp}-\vx_{\perp})$}$ , which we write with the usual exponential representation in a fashion similar to Eq. (F.5), we obtain after integration over angles $\begin{matrix} C_{i,j}^{LS (4,ξξ)} & = & \frac{2 (2 π)^{4}}{N^{4}} \int d χ_{i} 𝒟_{i}^{4} b \begin{matrix} 2 \\ i \end{matrix} n \begin{matrix} 2 \\ i \end{matrix} \int d χ_{j} 𝒟_{j}^{4} b \begin{matrix} 2 \\ j \end{matrix} n \begin{matrix} 2 \\ j \end{matrix} \\ \times \int d k_{1} k_{1} P (k_{1}; z_{i}) {W_{j}^{(2)}}_{˜} (k_{1} 𝒟_{i}) \int d k_{2} k_{2} P (k_{2}; z_{j}) \\ \times {W_{i}^{(2)}}_{˜} (k_{2} 𝒟_{j}) \int d y y J_{0} (y k_{1} 𝒟_{i}) J_{0} (y k_{2} 𝒟_{j}) \\ \times \int d x x J_{0} (xy) \begin{matrix} ˜ \\ W_{2} \end{matrix} (x θ_{s})^{2} . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} C_{i,j}^{\rm LS (4,\xi\xi)} & = & \frac{2(2\pi)^4}{\Nb^4} \int \dd\chi_i \, \cD_i^4 \, \bb_i^2 \, \nb_i^2 \int \dd\chi_j \, \cD_j^4 \, \bb_j^2 \, \nb_j^2 \nonumber \\ && \hspace{-1.1cm} \times \int \dd k_1 \, k_1 P(k_1;z_i) \tW_j^{(2)}(k_1\cD_i) \int \dd k_2 \, k_2 P(k_2;z_j) \nonumber \\ && \hspace{-1.1cm} \times \tW_i^{(2)}(k_2\cD_j) \int \dd y \, y J_0(yk_1\cD_i) J_0(yk_2\cD_j) \nonumber \\ && \hspace{-1.1cm} \times \int \dd x \, x J_0(x y) \tW_2(x\theta_{\rm s})^2 . \end{eqnarray}$ (H.3)Then, after a rescaling of variables x and y, and defining the quantities $A^{(2)} (y) = \int_{0}^{\infty} d u u J_{0} (yu) \begin{matrix} ˜ \\ W_{2} \end{matrix} (u)^{2},$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} A^{(2)}(y) = \int_0^{\infty} \dd u \, u \, J_0(y u) \, \tW_2(u)^2 , \label{A2-def} \end{equation}$ (H.4) $B_{i}^{(2)} (θ) = \int d χ 𝒟^{5} b^{2} n^{2} ℐ_{i}^{(2)} (θ),$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} B_i^{(2)}(\theta) = \int \dd\chi \, \cD^5 \, \bb^2 \, \nb^2 \, \cI_i^{(2)}(\theta) , \label{B2-def} \end{equation}$ (H.5) $ℐ_{i}^{(2)} (θ) = \int \frac{d k}{k} \frac{Δ^{2} (k)}{𝒟 k} J_{0} (k 𝒟 θ) {W_{i}^{(2)}}_{˜} (k 𝒟),$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} \cI_i^{(2)}(\theta) = \int \frac{\dd k}{k} \, \frac{\Delta^2(k)}{\cD k} \, J_0(k\cD\theta) \, \tW_i^{(2)}(k\cD) , \label{cI2-def} \end{equation}$ (H.6)we obtain the expression (96), using the property A⁽²⁾(y) = 0 for y > 2. As compared with Eq. (H.2), introducing the Dirac factor and the two auxiliary variables x and y has allowed us to partly factor in the integrals, as seen in Eq. (96), which is convenient for numerical computations. Again, it is useful to express Eq. (H.6) in terms of the real-space two-point correlation function, which yields $\begin{matrix} ℐ_{i}^{(2)} (θ) & = & \frac{2}{π} \int \frac{d xξ (x)}{𝒟 (θ_{i, +}^{2} - θ_{i, -}^{2})} [θ_{i, +}^{2} W_{2} (\frac{𝒟 θ}{x}, \frac{𝒟 θ_{i, +}}{x}) \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} \cI_i^{(2)}(\theta) & = & \frac{2}{\pi} \int \frac{\dd x \; \xi(x)}{\cD (\thetaip^2-\thetaim^2)} \left[ \thetaip^2 W_2\!\left(\frac{\cD\theta}{x},\frac{\cD\thetaip}{x}\right) \right. \nonumber \\ && \left. - \thetaim^2 W_2\!\left(\frac{\cD\theta}{x},\frac{\cD\thetaim}{x}\right) \right] , \label{cI2-xi} \end{eqnarray}$ (H.7)with $W_{2} (a,b) = \int_{0}^{\infty} d u \sin (u) J_{0} (au) \begin{matrix} ˜ \\ W_{2} \end{matrix} (bu) .$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} W_2(a,b) = \int_0^{\infty} \dd u \, \sin(u) \, J_0(a u) \, \tW_2(b u) . \label{W2-ab-def} \end{equation}$ (H.8)Although there is no explicit expression for the integral (H.8) for arbitrary (a,b), for |a − b| > 1 we can use the properties $\begin{matrix} b < a - 1 : & W_{2} = 0 \\ b > a + 1 : & W_{2} = 2 / b^{2} . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} \begin{array}{rl} b<a-1 \!: & W_2= 0 \\ b>a+1\! : & W_2= 2/b^2 . \\ \end{array} \end{equation}$ (H.9)In the band |a − b| < 1 one can check that W₂(a,b) is positive and decays as ~ b^-2 for large b, so that the real-space expression (H.7) is again more convenient than the Fourier-space expression (H.6).

The second contribution (97) arises from the three-point correlation ζ in Eq. (91). Using Eq. (4) it reads as $\begin{matrix} C_{i,j}^{LS (3,ζ)} & = & \frac{4}{(ΔΩ) N^{4}} \int d χ 𝒟^{8} b^{3} n^{3} \frac{S_{3}}{3} \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} C_{i,j}^{\rm LS (3,\zeta)} & = & \frac{4}{(\Delta\Omega)\Nb^4} \int \dd\chi \, \cD^8 \, \bb^3 \, \nb^3 \, \frac{S_3}{3} \nonumber \\ && \times \, \left[ \overline{\xith_{i,i'} \xith_{i,j'}} \label{C3-w-zeta-1} + \overline{\xith_{i',i} \xith_{i',j'}} + \overline{\xith_{j',i} \xith_{j',i'}} \right] , \end{eqnarray}$ (H.10)where the three terms in the brackets, which correspond to the three diagrams in Fig. 1, are again geometrical averages along the lines of sight, which we compute with Limber’s approximation. In particular, the first term factors as $ξ_{i, i^{'}}^{(θ)} ξ_{i, j^{'}}^{(θ)} = ξ_{i^{'}}^{(θ)} \times ξ_{j^{'}}^{(θ)},$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} \overline{\xith_{i,i'} \xith_{i,j'}} = \overline{\xith_{i'}} \times \overline{\xith_{j'}} , \end{equation}$ (H.11)where $ξ_{i^{'}}^{(θ)}$ $Mathematical equation: \hbox{$\overline{\xith_{i'}}$}$ was defined in Eq. (84), while the second term reads as $\begin{matrix} ξ_{i^{'},i}^{(θ)} ξ_{i^{'}, j^{'}}^{(θ)} & = & \frac{(2 π)^{2}}{𝒟^{2}} \int d k 1 ⊥ d k 2 ⊥ P (k_{1 ⊥}) P (k_{2 ⊥}) \\ \times {W_{i}^{(2)}}_{˜} (| k 1 ⊥ + k 2 ⊥ | 𝒟) {W_{j}^{(2)}}_{˜} (k_{2 ⊥} 𝒟) . \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} \overline{\xith_{i',i} \xith_{i',j'}} & = & \frac{(2\pi)^2}{\cD^2} \int\dd\vk_{1\perp} \dd\vk_{2\perp} \, P(k_{1\perp}) P(k_{2\perp}) \nonumber \\ \label{xi-ii-xi-ij-w-1} && \times \, \tW_i^{(2)}(|\vk_{1\perp}+\vk_{2\perp}|\cD) \, \tW_j^{(2)}(k_{2\perp}\cD) . \end{eqnarray}$ (H.12)With the same factorization method, and using the inverse Fourier transform of the 2D shell (G.2), $\begin{matrix} \frac{Θ (θ \in 𝒜_{i})}{𝒜_{i}} & = & \int \frac{d k ⊥ 𝒟^{2}}{(2 π)^{2}} e^{- i k ⊥ \cdot 𝒟 θ} {W_{i}^{(2)}}_{˜} (k_{⊥} 𝒟) \\ = & \frac{𝒟^{2}}{2 π} \int \frac{d k}{k} k^{2} J_{0} (k 𝒟 θ) {W_{i}^{(2)}}_{˜} (k 𝒟), \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} \frac{\Theta(\vtheta\in\cA_i)}{\cA_i} & = & \int \frac{\dd\vk_{\perp}\,\cD^2}{(2\pi)^2} \, {\rm e}^{-\ii\vk_{\perp}\cdot\cD\vtheta} \, \tW_i^{(2)}(k_{\perp}\cD) \\ \label{W2-inverseFourier} & = & \frac{\cD^2}{2\pi} \int \frac{\dd k}{k} \, k^2 \, J_0(k\cD\theta) \, \tW_i^{(2)}(k\cD) , \end{eqnarray}$ we obtain $ξ_{i^{'},i}^{(θ)} ξ_{i^{'}, j^{'}}^{(θ)} = 2 π^{2} \int_{θ_{i, -}}^{θ_{i, +}} \frac{d θθ}{θ_{i, +}^{2} - θ_{i, -}^{2}} ξ_{cyl} (θ) ℐ_{j}^{(2)} (θ),$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} \overline{\xith_{i',i} \xith_{i',j'}} = 2\pi^2 \int_{\thetaim}^{\thetaip} \frac{\dd\theta \; \theta}{\thetaip^2-\thetaim^2} \, \xi_{\rm cyl}(\theta) \, \cI_j^{(2)}(\theta) , \label{xi-ii-xi-ij-w-2} \end{equation}$ (H.15)where we introduced $\begin{matrix} ξ_{cyl} (θ) & = & \int \frac{d k}{k} \frac{Δ^{2} (k)}{𝒟 k} J_{0} (k 𝒟 θ) \\ = & \frac{2 θ}{π} \int_{0}^{1} \frac{d u}{u^{2} \sqrt{1 - u^{2}}} ξ (\frac{𝒟 θ}{u}) \cdot \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} \xi_{\rm cyl}(\theta) & = & \int\frac{\dd k}{k} \, \frac{\Delta^2(k)}{\cD k} \, J_0(k\cD\theta) \\ \label{xi-cyl} & = & \frac{2\theta}{\pi} \int_0^1 \frac{\dd u}{u^2\sqrt{1-u^2}} \; \xi\!\left(\frac{\cD\theta}{u}\right) \cdot \end{eqnarray}$ The third term in Eq. (H.10) is obtained from Eq. (H.15) by exchanging the labels “i” and “j”.

The third contribution (98) arises from the four-point correlation η in Eq. (92). As in Appendix F, we must compute the various terms associated with the diagrams of Fig. 2, with contributions (a) and (b) associated with the left diagram and contributions (c), (d), (e), and (f) associated with the right diagram. The first contribution (a) leads to $C_{i,j}^{LS (4, a)} = \frac{2}{N^{4}} \int d χ 𝒟^{11} b^{4} n^{4} \frac{S_{4}}{16} ξ_{i, i^{'}}^{(θ)} ξ_{i; j} ξ_{i; j^{'}}^{(θ)} .$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} C_{i,j}^{\rm LS (4,a)} = \frac{2}{\Nb^4} \int \dd\chi \, \cD^{11} \, \bb^4 \, \nb^4 \, \frac{S_4}{16} \, \overline{\xith_{i,i'}\xi_{i;j}\xith_{i;j'}} . \end{equation}$ (H.18)As in Appendix F, this geometrical average factors as $ξ_{i, i^{'}}^{(θ)} ξ_{i; j} ξ_{i; j^{'}}^{(θ)} = ξ_{i^{'}}^{(θ)} \times ξ_{i; j} ξ_{i; j^{'}}^{(θ)},$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} \overline{\xith_{i,i'}\xi_{i;j}\xith_{i;j'}} = \overline{\xith_{i'}} \times \overline{\xi_{i;j} \xith_{i;j'}} , \end{equation}$ (H.19)with $\begin{matrix} ξ_{i; j} ξ_{i; j^{'}}^{(θ)} & = & \frac{(2 π)^{2}}{𝒟^{2}} \int d k 1 ⊥ d k 2 ⊥ P (k_{1 ⊥}) P (k_{2 ⊥}) {W_{j}^{(2)}}_{˜} (k_{1 ⊥} 𝒟) \\ \times \begin{matrix} ˜ \\ W_{2} \end{matrix} (| k 1 ⊥ + k 2 ⊥ | 𝒟 θ_{s})^{2} \\ = & π^{2} \int_{0}^{2 θ_{s}} \frac{d θθ}{θ_{s}^{2}} ξ_{cyl} (θ) ℐ_{j}^{(2)} (θ) A^{(2)} (\frac{θ}{θ_{s}}) \cdot \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} \overline{\xi_{i;j} \xith_{i;j'}} & = & \frac{(2\pi)^2}{\cD^2} \int\dd\vk_{1\perp} \dd\vk_{2\perp} P(k_{1\perp}) P(k_{2\perp}) \tW_j^{(2)}(k_{1\perp}\cD) \nonumber \\ && \times \, \tW_2(|\vk_{1\perp}+\vk_{2\perp}|\cD\theta_{\rm s})^2 \\ \label{K2-xi} & = & \pi^2 \int_0^{2\theta_{\rm s}} \frac{\dd\theta \; \theta}{\theta_{\rm s}^2} \, \xi_{\rm cyl}(\theta) \, \cI_j^{(2)}(\theta) \, A^{(2)}\!\left(\frac{\theta}{\theta_{\rm s}}\right) \cdot \end{eqnarray}$ Contribution (b) is the symmetric one of (a) with respect to i ↔ j.

Next, contribution (c) involves the geometrical average $ξ_{i^{'},i}^{(θ)} ξ_{i; j} ξ_{j, j^{'}}^{(θ)}$ $Mathematical equation: \hbox{$\overline{\xith_{i',i}\xi_{i;j}\xith_{j,j'}}$}$ , which again factors as $ξ_{i^{'},i}^{(θ)} ξ_{i; j} ξ_{j, j^{'}}^{(θ)} = ξ_{i^{'}}^{(θ)} \times ξ_{cyl} \times ξ_{j^{'}}^{(θ)} .$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{equation} \overline{\xith_{i',i}\xi_{i;j}\xith_{j,j'}} = \overline{\xith_{i'}} \times \xicyl \times \overline{\xith_{j'}} . \end{equation}$ (H.22)

The contribution (d) involves the average $\begin{matrix} ξ_{j^{'}; i}^{(θ)} ξ_{i; j} ξ_{j; i^{'}}^{(θ)} & = & \int \frac{d χ_{i^{'}} d χ_{j} d χ_{j^{'}}}{𝒟^{3}} \int \frac{d Ω_{i} d Ω_{j}}{(ΔΩ)^{2}} \int \frac{d θ_{i^{'}} d θ_{j^{'}}}{𝒜_{i} 𝒜_{j}} \\ \times ξ_{j^{'}; i} ξ_{i; j} ξ_{j; i^{'}}, \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} \overline{\xith_{j';i}\xi_{i;j}\xith_{j;i'}} & = & \int \frac{\dd\chi_{i'}\dd\chi_j\dd\chi_{j'}}{\cD^3} \int \frac{\dd\vOm_i\dd\vOm_j}{(\Delta\Omega)^2} \int\frac{\dd\vtheta_{i'}\dd\vtheta_{j'}}{\cA_i\cA_j} \nonumber \\ && \times \, \xi_{j';i}\xi_{i;j}\xi_{j;i'} , \end{eqnarray}$ (H.23)which also writes as $\begin{matrix} ξ_{j^{'}; i}^{(θ)} ξ_{i; j} ξ_{j; i^{'}}^{(θ)} & = & π^{3} \int_{0}^{2 θ_{s}} \frac{d θθ}{θ_{s}^{2}} ξ_{cyl} (θ) ℐ_{i}^{(2)} (θ) ℐ_{j}^{(2)} (θ) A^{(2)} (\frac{θ}{θ_{s}}) \cdot \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} \overline{\xith_{j';i}\xi_{i;j}\xith_{j;i'}} & \! = \! & \pi^3 \! \int_0^{2\theta_{\rm s}} \! \frac{\dd\theta \; \theta}{\theta_{\rm s}^2} \, \xi_{\rm cyl}(\theta) \, \cI_i^{(2)}(\theta) \, \cI_j^{(2)}(\theta) \, A^{(2)}\!\left(\frac{\theta}{\theta_{\rm s}}\right) \cdot \nonumber \\ && \label{L2-xi} \end{eqnarray}$ (H.24)Contribution (e) involves $ξ_{j^{'}; i}^{(θ)} ξ_{i, i^{'}}^{(θ)} ξ_{i^{'}; j}^{(θ)}$ $Mathematical equation: \hbox{$\overline{\xith_{j';i}\xith_{i,i'}\xith_{i';j}}$}$ , which reads as $\begin{matrix} ξ_{j^{'}; i}^{(θ)} ξ_{i, i^{'}}^{(θ)} ξ_{i^{'}; j}^{(θ)} & = & 2 π^{2} \int_{0}^{2 θ_{s}} \frac{d θθ}{θ_{s}^{2}} A^{(2)} (\frac{θ}{θ_{s}}) ℐ_{j}^{(2)} (θ) \int_{θ_{i, -}}^{θ_{i, +}} \frac{d θ^{'} θ^{'}}{θ_{i, +}^{2} - θ_{i, -}^{2}} \\ \times ξ_{cyl} (θ^{'}) \int_{0}^{π} d ϕ ξ_{cyl} (\sqrt{θ^{2} + θ^{' 2} + 2 θ θ^{'} \cos ϕ}), \end{matrix}$ $Mathematical equation: \appendix \setcounter{section}{8} \begin{eqnarray} \overline{\xith_{j';i}\xith_{i,i'}\xith_{i';j}} & \!\! = \! & 2\pi^2 \!\! \int_0^{2\theta_{\rm s}} \! \frac{\dd\theta\;\theta}{\theta_{\rm s}^2} \, A^{(2)} \! \left(\frac{\theta}{\theta_{\rm s}}\right) \cI_j^{(2)}(\theta) \int_{\thetaim}^{\thetaip} \!\!\!\! \frac{\dd\theta' \, \theta'}{\thetaip^2\!-\!\thetaim^2} \nonumber \\ && \hspace{-0.9cm} \times \xi_{\rm cyl}(\theta') \int_0^{\pi}\! \dd\varphi \; \xi_{\rm cyl}(\sqrt{\theta^2\!+\!\theta'^2\!+\!2\theta\theta'\cos\varphi}) , \end{eqnarray}$ (H.25)whereas contribution (f) is obtained from (e) by exchanging the labels “i” and “j”.

Collecting all terms, the high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ are given by Eqs. (96)–(98).

Appendix I: Scaling of the number counts signal-to-noise in simulations

Fig. I.1

Scaling of the number-counts signal-to-noise ratio by $\sqrt{ΔΩ}$ $Mathematical equation: \hbox{$\sqrt{\Delta\Omega}$}$ as computed in the Horizon simulation, see Sect. 2.2. Different configurations are displayed according to the total surveyed area ΔΩ, the number of subfields $Mathematical equation: \hbox{$N_f=\cN$}$ , and the mass limit. In the right caption, ΔΩ is expressed in deg² and the mass unit is h^-1 M_⊙.

Fig. I.2

Same as Fig. I.1 but with a scaling that depends on the number of subfields: $Mathematical equation: \hbox{$\cN^{-n/4} \, (\Delta\Omega)^{(n+2)/4}$}$ , with n = −0.6

Fig. J.1

Left panel: cluster mass associated with a 50%, 80%, or 95% detection probability (from bottom to top), for the XXL selection function C1, as a function of redshift. Middle panel: minimum detectable cluster mass, as a function of redshift, for the Planck space mission. Right panel: cluster mass associated with a 50%, 80%, or 95% detection probability (from bottom to top), for the Erosita selection function as a function of redshift (we consider a flux limit of 4 × 10^-14 erg s^-1 cm^-2 in the [0.5 − 2] keV band).

We present here the result of scaling the number counts with the total surveyed area ΔΩ and the number of subfields $Mathematical equation: \hbox{$\cN$}$ . Figures I.1 and I.2 show the scalings expected from Eq. (35) in the shot-noise and sample-variance dominated regimes. Multiple survey configurations are explored by varying the total surveyed area (ΔΩ = 25,50, and 100 deg²), the number of subfields ( $Mathematical equation: \hbox{$\cN=1, 2$}$ , and 4), and the mass threshold (M > 2 × 10¹³,10¹⁴, and 5 × 10¹⁴ h^-1 M_⊙).

The weak scatter in those plots shows that (35) provides a valid approximation of the signal-to-noise scaling with respect to ΔΩ and $Mathematical equation: \hbox{$\cN$}$ . In agreement with the discussion in Sect. 3.2.1 and Fig. 7, at high redshift and for high mass, the scaling $\sqrt{(ΔΩ)}$ $Mathematical equation: \hbox{$\sqrt{(\Delta\Omega)}$}$ shown in Fig. I.1 is best, as expected for the shot-noise dominated regime, whereas at low redshift and for low mass the scaling $Mathematical equation: \hbox{$\cN^{-n/4} \, (\Delta\Omega)^{(n+2)/4}$}$ shown in Fig. I.2 (with n = −0.6) is best, as expected for the sample-variance dominated regime.

Appendix J: Selection functions used for various surveys

We give in Fig. J.1 the selection functions that we use for several cluster surveys investigated in Sect. 6. For Planck, the curves shown in the middle panel corresponds to a 100% detection probability.

For the other surveys studied in Sect. 6 we consider simple mass thresholds, rather than detailed selection functions. More precisely, we consider halos above the two thresholds 5 × 10¹³ h^-1 M_⊙ and 5 × 10¹⁴ h^-1 M_⊙ for DES and Euclid, and above 5 × 10¹⁴ h^-1 M_⊙ for SPT.

Appendix K: Dependence on cosmology

In this appendix we investigate the dependence of the results obtained in Sect. 6 on the value of the cosmological parameters. Thus, in addition to the WMAP7 cosmology recalled in the first line of Table K.1, which was used in Sect. 6, we also consider the three modified cosmologies where one among the three parameters h, Ω_m, and σ₈ is changed to the values shown in the second line of Table K.1. They correspond to “” deviations from WMAP7 (Komatsu et al. 2011) and describe current uncertainties. (When we vary Ω_m we keep a flat ΛCDM universe and we change Ω_de according to Ω_de = 1−Ω_m.)

Thus, we compare in Figs. K.1–K.3, the three curves obtained for these three alternative cosmologies with the curve that was obtained in Sect. 6 for the fiducial WMAP7 cosmology. To avoid overcrowding the figures we only consider the all-sky

surveys, Planck, Erosita, and Euclid. We can see that the main features of these figures are not modified when we consider these alternative cosmologies, so our results and conclusions are not sensitive to the precise value of the cosmological parameters. As expected, we can also check that shot-noise effects become less important, with respect to sample-variance contributions, when σ₈ is increased.

Table K.1

Three alternative cosmologies.

Fig. K.1

The ratio $σ_{N_{i}}^{(s . n .)} / σ_{N_{i}}^{(s . v .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.n.)}/\sigma_{N_i}^{(s.v.)}$}$ of the rms shot-noise contribution $σ_{N_{i}}^{(s . n .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.n.)}$}$ to the rms sample-variance contribution $σ_{N_{i}}^{(s . v .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.v.)}$}$ , of the covariance of the angular number densities N_i, as in Fig. 34. The fiducial curve that was shown in Fig. 34 is the solid line (mean WMAP7 cosmology), whereas the dashed, dot-dashed, and dotted lines correspond to the three cosmologies where either h, Ω_m, or σ₈, is changed to the value given in the second line of Table K.1.

Fig. K.2

The ratio $σ_{ξ_{i}}^{(2)} / σ_{ξ_{i}}^{(3 + 4)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(2)}/\sigma_{\xi_i}^{(3+4)}$}$ of the rms contributions $\sqrt{C^{(2)}}$ $Mathematical equation: \hbox{$\sqrt{C^{(2)}}$}$ and $\sqrt{C^{(3)} + C^{(4)}}$ $Mathematical equation: \hbox{$\sqrt{C^{(3)}+C^{(4)}}$}$ of the covariance matrix of the estimator $ξ̂ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxiLS_i$}$ , as in Fig. 35. The line styles are as in Fig. K.1 and Table K.1.

Fig. K.3

The ratio $σ_{ξ_{i}}^{(ξξ + ζ + η)} / σ_{ξ_{i}}^{(ξ)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(\xi\xi+\zeta+\eta)}/\sigma_{\xi_i}^{(\xi)}$}$ of the rms high-order contribution (75)–(77) to the rms low-order contribution (second term in Eq. (69)) of the sample variance of the correlation ξ_i, as in Fig. 36. The line styles are as in Fig. K.1 and Table K.1.

A weaker hypothesis would be to write $ξ_{i,j}^{h} = b_{i,j}^{2} ξ (x_{ij}; z)$ $Mathematical equation: \hbox{$\xih_{i,j} = b^2_{i,j} \xi(x_{ij};z)$}$ , where the dependence on M_i and M_j does not factor in (i.e., $b_{i,j}^{2} \neq b_{i} b_{j}$ $Mathematical equation: \hbox{$b^2_{i,j} \neq b_i b_j$}$ ), but keeping the factorization with respect to the relative distance x_ij. This only slightly modifies our expressions in a straightforward manner.

An alternative would be to use a halo model (Cooray & Sheth 2002), coupled to perturbation theory predictions on large scales. This could provide more accurate estimates, since such an approach is able to describe low-order correlation functions well from very large to small scales (Scoccimarro et al. 2001; Giocoli et al. 2010; Valageas & Nishimichi 2011a,b). However, this would introduce several contributions associated with “1-halo” up to “4-halo” terms, and would require additional parameters such as halo occupation functions and high-order bias parameters, depending on the objects that one considers. Therefore, we do not investigate this approach in this paper (although it would certainly deserve further attention), and we restrict ourselves to the simpler hierarchical models, as described in Eqs. (4)–(6).

Within a local bias model, one writes the halo density field as δ_h = ∑ _kb_kδ^k/k !, where δ is the matter density contrast smoothed on large scales. Then, the halo many-body correlations also depend on the higher order bias coefficients b_k (Fry & Gaztanaga 1993), but for simplicity we only consider a linear bias model here (i.e. b_k = 0 for k ≥ 2).

⁴

For more complicated angular shapes we can still define a Fourier-space window $\begin{matrix} _{˜} \\ W_{2} \end{matrix} (k_{⊥} 𝒟 θ_{s})$ $Mathematical equation: \hbox{$\tW_2(\vk_{\perp}\cD\theta_{\rm s})$}$ , normalized by $\begin{matrix} _{˜} \\ W_{2} \end{matrix} (0) = 1$ $Mathematical equation: \hbox{$\tW_2(0)=1$}$ , but it will also depend on the direction of k_⊥.

⁵

Although M stands for the mass of the objects, as for the mass function of clusters of galaxies, or of galaxies themselves, it can also represent any other quantity, such as temperature, luminosity, or a vector made of several such quantities.

⁶

https://www.darkenergysurvey.org/index.shtml

⁷

http://pole.uchicago.edu/

⁸

For all the considered surveys, the M_lim(z) curves were estimated using specific assumptions as to the evolution of the X-ray, optical, and S-Z properties of the clusters, hence on their detectability. Moreover, the assumed cosmology was either WMAP5 or WMAP7, to be consistent with published analysis of each survey. Therefore, these hypotheses may not be totally self-consistent with respect to each other, but the main results of the comparison between the expected signals should remain valid.

⁹

Because we consider a symmetric two-sided angular window (i.e., two cones of angle θ_s around the north and south galactic poles), the coefficients ${W_{2}^{(ℓ,m)}}_{˜}$ $Mathematical equation: \hbox{$\tW_2^{(\ell,m)}$}$ vanish for nonzero m and for odd ℓ. For even ℓ, they are still given by Eqs. (44)–(45) where we substitute (ΔΩ) → (ΔΩ)_half, where (ΔΩ)_half = (ΔΩ)/2 is the area associated with a single side (so that the last expression (45) still applies).

¹⁰

The integration over the points i and i′ in Eq. (D.1) should be understood as , which is more clearly symmetric, where Θ is a top-hat window that takes values 0 or 1 with obvious notations. In practice, we actually slightly “break” this symmetry by using the variables (χ_i,Ω_i;r_i′) as in Eq. (D.1) if we do not take boundary effects into account, as in this paper.

¹¹

In practice, we compute in advance ξ(x,z) on a 2D grid, over distance and redshift, using Eq. (F.1) and the nonlinear power spectrum from Smith et al. (2003). To obtain meaningful and accurate results, one needs to make sure that ξ(x) is accurately computed, especially on large scales where one should recover linear theory and a smooth two-point correlation.

Acknowledgments

We thank the anonymous referee for comments that helped to improve the presentation of the paper. F.P. acknowledges support from Grant No. 50 OR 1003 of the Deutsches Zemtrum für Luft- und Raumfahrt (DLR) and from the Transregio Programme TR33 of the Deutsche Forschungsgemeinschaft (DfG).

References

Abazajian, K. N., Adelman-McCarthy, J. K., Agüeros, M. A., et al. 2009, APJS, 182, 543 [Google Scholar]
Adami, C., Mazure, A., Pierre, M., et al. 2011, A&A, 526, A18 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Arnaud, M., Pointecouteau, E., & Pratt, G. W. 2005, A&A, 441, 893 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Arnaud, M., Pratt, G. W., Piffaretti, R., et al. 2010, A&A, 517, A92 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Benoist, C., Maurogordato, S., da Costa, L. N., Cappi, A., & Schaeffer, R. 1996, ApJ, 472, 452 [NASA ADS] [CrossRef] [Google Scholar]
Bernardeau, F., Colombi, S., Gaztañaga, E., & Scoccimarro, R. 2002, Phys. Rep., 367, 1 [NASA ADS] [CrossRef] [EDP Sciences] [MathSciNet] [Google Scholar]
Bernstein, G. M. 1994, ApJ, 424, 569 [NASA ADS] [CrossRef] [Google Scholar]
Cohn, J. D. 2006, New Astron., 11, 226 [NASA ADS] [CrossRef] [Google Scholar]
Cole, S., Percival, W. J., Peacock, J. A., et al. 2005, MNRAS, 362, 505 [NASA ADS] [CrossRef] [Google Scholar]
Colombi, S., Bouchet, F. R., & Hernquist, L. 1996, ApJ, 465, 14 [NASA ADS] [CrossRef] [Google Scholar]
Cooray, A., & Sheth, R. 2002, Phys. Rep., 372, 1 [NASA ADS] [CrossRef] [Google Scholar]
Crocce, M., Cabre, A., & Gaztanaga, E. 2011, MNRAS, 414, 329 [NASA ADS] [CrossRef] [Google Scholar]
Croton, D. J., Gaztanaga, E., Baugh, C. M., et al. 2004, MNRAS, 352, 1232 [NASA ADS] [CrossRef] [Google Scholar]
Desjacques, V., Crocce, M., Scoccimarro, R., & Sheth, R. K. 2010, Phys. Rev. D, 82, 103529 [NASA ADS] [CrossRef] [Google Scholar]
Eisenstein, D. J., & Hut, P. 1998, ApJ, 498, 137 [NASA ADS] [CrossRef] [Google Scholar]
Eisenstein, D. J., & Zaldarriaga, M. 2001, ApJ, 546, 2 [NASA ADS] [CrossRef] [Google Scholar]
Eisenstein, D. J., Hu, W., & Tegmark, M. 1998, ApJ, 504, L57 [NASA ADS] [CrossRef] [Google Scholar]
Eisenstein, D. J., Zehavi, I., Hogg, D. W., et al. 2005, ApJ, 633, 560 [NASA ADS] [CrossRef] [Google Scholar]
Evrard, A. E. 1989, ApJ, 341, L71 [NASA ADS] [CrossRef] [Google Scholar]
Feldman, H. A., Kaiser, N., & Peacock, J. A. 1994, ApJ, 426, 23 [NASA ADS] [CrossRef] [Google Scholar]
Fry, J. N. 1984, ApJ, 279, 499 [NASA ADS] [CrossRef] [Google Scholar]
Fry, J. N., & Gaztanaga, E. 1993, ApJ, 413, 447 [NASA ADS] [CrossRef] [Google Scholar]
Gaztanaga, E., Norberg, P., Baugh, C. M., & Croton, D. J. 2005, MNRAS, 364, 620 [NASA ADS] [CrossRef] [Google Scholar]
Giocoli, C., Bartelmann, M., Sheth, R. K., & Cacciato, M. 2010, MNRAS, 408, 300 [NASA ADS] [CrossRef] [Google Scholar]
Goroff, M. H., Grinstein, B., Rey, S.-J., & Wise, M. B. 1986, ApJ, 311, 6 [NASA ADS] [CrossRef] [Google Scholar]
Gradshteyn, I. S., & Ryzhik, I. M. 1965, Table of integrals, series, and products (New York: Academic Press) [Google Scholar]
Groth, E. J., & Peebles, P. J. E. 1977, ApJ, 217, 385 [NASA ADS] [CrossRef] [Google Scholar]
Harker, G., Cole, S., & Jenkins, A. 2007, MNRAS, 382, 1503 [NASA ADS] [CrossRef] [Google Scholar]
Hu, W. 2000, Phys. Rev. D, 62, 043007 [NASA ADS] [CrossRef] [Google Scholar]
Hu, W., & Kravtsov, A. V. 2003, ApJ, 584, 702 [NASA ADS] [CrossRef] [Google Scholar]
Kaiser, N. 1984, ApJ, 284, L9 [NASA ADS] [CrossRef] [Google Scholar]
Kaiser, N. 1992, ApJ, 388, 272 [NASA ADS] [CrossRef] [Google Scholar]
Kazin, E. A., Blanton, M. R., Scoccimarro, R., et al. 2010, ApJ, 710, 1444 [NASA ADS] [CrossRef] [Google Scholar]
Kerscher, M., Szapudi, I., & Szalay, A. S. 2000, ApJ, 535, L13 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
Knebe, A., Knollmann, S. R., Muldrew, S. I., et al. 2011, MNRAS, 415, 2293 [NASA ADS] [CrossRef] [Google Scholar]
Komatsu, E., Smith, K. M., Dunkley, J., et al. 2011, ApJS, 192, 18 [NASA ADS] [CrossRef] [Google Scholar]
Kulkarni, G. V., Nichol, R. C., Sheth, R. K., et al. 2007, MNRAS, 378, 1196 [NASA ADS] [CrossRef] [Google Scholar]
Landy, S. D., & Szalay, A. S. 1993, ApJ, 412, 64 [NASA ADS] [CrossRef] [Google Scholar]
Limber, D. N. 1953, ApJ, 117, 134 [NASA ADS] [CrossRef] [Google Scholar]
LoVerde, M., & Afshordi, N. 2008, Phys. Rev. D, 78, 123506 [NASA ADS] [CrossRef] [Google Scholar]
Majumdar, S., & Mohr, J. J. 2004, ApJ, 613, 41 [NASA ADS] [CrossRef] [Google Scholar]
Maller, A. H., McIntosh, D. H., Katz, N., & Weinberg, M. D. 2005, ApJ, 619, 147 [NASA ADS] [CrossRef] [Google Scholar]
Manera, M., & Gaztanaga, E. 2011, MNRAS, 415, 383 [NASA ADS] [CrossRef] [Google Scholar]
Marin, F. A., Wechsler, R. H., Frieman, J. A., & Nichol, R. C. 2007, ApJ, 172, 849 [Google Scholar]
Massey, R., Rhodes, J., Leauthaud, A., et al. 2007, ApJS, 172, 239 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
Meiksin, A., & White, M. 1999, MNRAS, 308, 1179 [NASA ADS] [CrossRef] [Google Scholar]
Melin, J.-B., Bartlett, J. G., & Delabrouille, J. 2006, A&A, 459, 341 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Munshi, D., Valageas, P., van Waerbeke, L., & Heavens, A. 2008, Phys. Rep., 462, 67 [NASA ADS] [CrossRef] [Google Scholar]
Norberg, P., Baugh, C. M., Gaztañaga, E., & Croton, D. J. 2009, MNRAS, 396, 19 [NASA ADS] [CrossRef] [Google Scholar]
Oukbir, J., & Blanchard, A. 1992, A&A, 262, L21 [NASA ADS] [Google Scholar]
Pacaud, F., Pierre, M., Leauthaud, A., et al. 2006, MNRAS, 372, 578 [NASA ADS] [CrossRef] [Google Scholar]
Pacaud, F., Pierre, M., Adami, C., et al. 2007, MNRAS, 382, 1289 [NASA ADS] [CrossRef] [Google Scholar]
Peebles, P. J. E. 1980, The large scale structure of the universe (Princeton: Princeton University Press) [Google Scholar]
Peebles, P. J. E., & Hauser, M. G. 1974, ApJS, 28, 19 [NASA ADS] [CrossRef] [Google Scholar]
Pierre, M., Pacaud, F., Juin, J. B., et al. 2011, MNRAS, 414, 1732 [NASA ADS] [CrossRef] [Google Scholar]
Politzer, H. D., & Wise, M. B. 1984, ApJ, 285, L1 [NASA ADS] [CrossRef] [Google Scholar]
Predehl, P., Boehringer, H., et al. 2009, Proceedings of the conference X-ray Astronomy 2009, Bologna, September 2009 [Google Scholar]
Refregier, A., Amara, A., Kitching, T. D., et al. 2010, Euclid Imaging Consortium Science Book [arXiv:1001.0061] [Google Scholar]
Reid, B. A., Percival, W. J., Eisenstein, D. J., et al. 2010, MNRAS, 404, 60 [NASA ADS] [CrossRef] [Google Scholar]
Ross, A. J., Brunner, R. J., & Myers, A. D. 2006, ApJ, 649, 48 [NASA ADS] [CrossRef] [Google Scholar]
Scoccimarro, R., Zaldarriaga, M., & Hui, L. 1999, ApJ, 527, 1 [NASA ADS] [CrossRef] [Google Scholar]
Scoccimarro, R., Sheth, R. K., Hui, L., & Jain, B. 2001, ApJ, 546, 20 [NASA ADS] [CrossRef] [Google Scholar]
Smith, R. E. 2009, MNRAS, 400, 851 [NASA ADS] [CrossRef] [Google Scholar]
Smith, R. E., Peacock, J. A., Jenkins, A., et al. 2003, MNRAS, 341, 1311 [NASA ADS] [CrossRef] [Google Scholar]
Spergel, D. N., Bean, R., Doré, O., et al. 2007, ApJS, 170, 377 [NASA ADS] [CrossRef] [Google Scholar]
Szapudi, I. 2001, in Annals of the New York Academy of Sciences, The Onset of Nonlinearity in Cosmology, ed. J. N. Fry, J. R. Buchler, & H. Kandrup, 927, 94 [Google Scholar]
Szapudi, I., & Colombi, S. 1996, ApJ, 470, 131 [NASA ADS] [CrossRef] [Google Scholar]
Szapudi, I., & Szalay, A. 1998, ApJ, 494, L41 [NASA ADS] [CrossRef] [Google Scholar]
Szapudi, I., Postman, M., Lauer, T. R., & Oegerle, W. 2001, ApJ, 548, 114 [NASA ADS] [CrossRef] [Google Scholar]
Tegmark, M., Eisenstein, D., Strauss, M. A., et al. 2006, Phys. Rev. D, 74, 123507 [Google Scholar]
Teyssier, R. 2002, A&A, 385, 337 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Teyssier, R., Pires, S., Prunet, S., et al. 2009, A&A, 497, 335 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Tinker, J., Kravtsov, A. V., Klypin, A., et al. 2008, ApJ, 688, 709 [NASA ADS] [CrossRef] [Google Scholar]
Tinker, J., Robertson, B. E., Kravtsov, A. V., et al. 2010, ApJ, 724, 878 [NASA ADS] [CrossRef] [Google Scholar]
Valageas, P., & Nishimichi, T. 2011a, A&A, 527, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Valageas, P., & Nishimichi, T. 2011b, A&A, 532, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Vanderlinde, K., Crawford, T. M., & de Haan, T., et al. 2010, ApJ, 722, 1180 [NASA ADS] [CrossRef] [Google Scholar]

All Tables

Table K.1

Three alternative cosmologies.

In the text

All Figures

	Fig. 1 The “hierarchical clustering ansatz” for the three-point correlation function $ζ_{1, 2, 3}^{h}$ $Mathematical equation: \hbox{$\zetah_{1,2,3}$}$ of Eq. (4). Each solid line corresponds to a two-point correlation ξ, and ζ^h is written as the sum of these three diagrams, with a multiplicative factor b₁b₂b₃S₃/3.
In the text

	Fig. 2 The two topologies of the four-point diagrams associated with the “hierarchical clustering ansatz” for the four-point correlation, as in Eq. (6). The numbers are the multiplicity factors of each diagram.
In the text

	Fig. 3 The mean number density of dark matter halos per square degree, within redshift bins of width Δz = 0.1. We count all halos above the thresholds M_∗ = 2 × 10¹³,10¹⁴, and 5 × 10¹⁴h^-1 M_⊙, from top down to bottom. We compare our analytical results (solid lines) with numerical simulations (dashed lines).
In the text

	Fig. 4 The variance σ_{N_i} of the halo angular number densities of Fig. 3, for redshift bins Δz = 0.1 and an angular window of 50 deg². We compare our analytical results (solid lines) with numerical simulations (dashed lines).
In the text

	Fig. 5 The shot-noise (dashed lines) and sample-variance (solid lines) errors for the angular number densities shown in Fig. 3, associated with a redshift binning of width Δz = 0.1, but up to z = 2, and an angular window of 50 deg².
In the text

	Fig. 6 The signal-to-noise ratios of number counts for an angular area ΔΩ = 50 deg², as in Figs. 3 and 4. We compare our analytical results (solid lines) with numerical simulations (dashed lines).
In the text

	Fig. 7 The signal-to-noise ratios of number counts for a total angular area ΔΩ = 50 deg², divided over $Mathematical equation: \hbox{$\cN$}$ independent subfields. We show the results obtained for the numbers of subfields $Mathematical equation: \hbox{$\cN=1$}$ (solid lines), 2 (dashed lines), and 4 (dotted lines).
In the text

	Fig. 8 The angular power spectrum of the distribution of halos in the redshift bin 0.95 < z < 1.05. We plot both the exact result (38) (solid line) and Limber’s approximation (39) (dotted line).
In the text

	Fig. 9 The shot-noise (dashed line) and sample-variance errors (29) for the angular number densities in the redshift bin 0.95 < z < 1.05, as a function of the radius θ_s of the angular window. The solid line is the exact sample variance, from Eqs. (38) and (41), while the dotted line is the result (27), which was used in Fig. 5 and involves both the flat-sky and Limber’s approximations.
In the text

	Fig. 10 The ratio of the exact sample-variance error (41) to the approximation (27), which uses both the flat-sky and Limber’s approximations. We show this ratio as a function of the radius θ_s of the angular window, for several redshift bins, for halos above the mass threshold M > 10¹⁴ h^-1 M_⊙. Higher z corresponds to a higher ratio.
In the text

Fig. 11

In the text

	Fig. 12 The mean halo correlation, $⟨ ξ̂ \begin{matrix} LS \\ i \end{matrix} ⟩$ $Mathematical equation: \hbox{$\lag\hxiLS_i\rag$}$ , over ten comoving distance bins within 5 < r < 100 h^-1 Mpc, equally spaced in log (r). We integrate over halos within the redshift interval 0 < z < 0.8 and we compare our analytical results (solid lines) with numerical simulations (dashed lines).
In the text

Fig. 13

In the text

	Fig. 14 The contributions C⁽²⁾ and C⁽³⁾ to the covariance of the Landy & Szalay estimator, along the diagonal i = j. As in Fig. 13, we only consider the low-order terms, given by Eq. (69).
In the text

	Fig. 15 The low- and high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ along its diagonal. We again consider halos in the redshift range 0 < z < 0.8, with an angular window of 50 deg², above two mass thresholds.
In the text

	Fig. 16 The low- and high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as in Fig. 15, but along one row. This corresponds to the fixed bin i = 4, associated with the distance bin 12.3 < r < 16.6 h^-1 Mpc, as a function of j.
In the text

	Fig. 17 The covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ of the Landy & Szalay estimator, along the diagonal i = j. We show our analytical results including all contributions (solid lines) or only low-order terms (dotted lines), and results from numerical simulations (dashed lines).
In the text

	Fig. 18 The covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as in Fig. 17, but along one row. This corresponds to the fixed bin i = 4, associated with the distance bin 12.3 < r < 16.6 h^-1 Mpc, as a function of j.
In the text

Fig. 19

In the text

	Fig. 20 The mean angular correlation, $⟨ ŵ \begin{matrix} LS \\ i \end{matrix} ⟩$ $Mathematical equation: \hbox{$\lag\hwLS_i\rag$}$ , over eight angular bins within 1.25 < θ < 50 arcmin, equally spaced in log (θ). We compare our analytical results (solid lines) with numerical simulations (dashed lines).
In the text

Fig. 21

In the text

	Fig. 22 The contributions C⁽²⁾ and C⁽³⁾ to the covariance of the Landy & Szalay estimator, along the diagonal i = j. As in Fig. 21, we only consider the low-order terms, given by Eq. (94).
In the text

	Fig. 23 The low- and high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ along its diagonal. We again consider halos in the redshift range 0 < z < 0.8, with an angular window of 400 deg², above two mass thresholds.
In the text

	Fig. 24 The low- and high-order contributions to the covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as in Fig. 23, but along one row. This corresponds to the fixed bin i = 2, associated with the angular bin 2 < θ < 3.2 arcmin, as a function of j.
In the text

	Fig. 25 The covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ along its diagonal. We show our analytical results including all contributions (solid lines) or only low-order terms (dotted lines), and results from numerical simulations (dashed lines).
In the text

	Fig. 26 The covariance matrix $C_{i,j}^{LS}$ $Mathematical equation: \hbox{$C_{i,j}^{\rm LS}$}$ , as in Fig. 25, but along one row. This corresponds to the fixed bin i = 4, associated with the angular bin 5 < θ < 8 arcmin, as a function of j.
In the text

	Fig. 30 The mean correlation, $Mathematical equation: \hbox{$\lag\hxi_i\rag$}$ , for the clusters detected by DES over the redshift interval 1 < z < 2. Here we consider 20 distance bins within 5 < r < 100 h^-1 Mpc, equally spaced in log (r) (i.e. twice as many as in Fig. 29).
In the text

	Fig. 33 The mean correlation, $Mathematical equation: \hbox{$\lag\hxi_i\rag$}$ , over the redshift interval 1 < z < 2, for the clusters detected by Erosita (upper curve, with ten distance bins) and Euclid (lower curve, with twenty distance bins).
In the text

	Fig. B.1 Geometrical illustration of finite-size effects. Close to the survey boundary, part of the sphere of radius r extends beyond the observational cone and should not be counted. The left plot is a transverse view, orthogonal to the central line of sight, whereas the right plot is a view from a point far away on the line of sight.
In the text

Fig. I.1

In the text

	Fig. I.2 Same as Fig. I.1 but with a scaling that depends on the number of subfields: $Mathematical equation: \hbox{$\cN^{-n/4} \, (\Delta\Omega)^{(n+2)/4}$}$ , with n = −0.6
In the text

Fig. J.1

In the text

Fig. K.1

The ratio $σ_{N_{i}}^{(s . n .)} / σ_{N_{i}}^{(s . v .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.n.)}/\sigma_{N_i}^{(s.v.)}$}$ of the rms shot-noise contribution $σ_{N_{i}}^{(s . n .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.n.)}$}$ to the rms sample-variance contribution $σ_{N_{i}}^{(s . v .)}$ $Mathematical equation: \hbox{$\sigma_{N_i}^{(s.v.)}$}$ , of the covariance of the angular number densities N_i, as in Fig. 34. The fiducial curve that was shown in Fig. 34 is the solid line (mean WMAP7 cosmology), whereas the dashed, dot-dashed, and dotted lines correspond to the three cosmologies where either h, Ω_m, or σ₈, is changed to the value given in the second line of Table K.1.

In the text

Fig. K.2

The ratio $σ_{ξ_{i}}^{(2)} / σ_{ξ_{i}}^{(3 + 4)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(2)}/\sigma_{\xi_i}^{(3+4)}$}$ of the rms contributions $\sqrt{C^{(2)}}$ $Mathematical equation: \hbox{$\sqrt{C^{(2)}}$}$ and $\sqrt{C^{(3)} + C^{(4)}}$ $Mathematical equation: \hbox{$\sqrt{C^{(3)}+C^{(4)}}$}$ of the covariance matrix of the estimator $ξ̂ \begin{matrix} LS \\ i \end{matrix}$ $Mathematical equation: \hbox{$\hxiLS_i$}$ , as in Fig. 35. The line styles are as in Fig. K.1 and Table K.1.

In the text

	Fig. K.3 The ratio $σ_{ξ_{i}}^{(ξξ + ζ + η)} / σ_{ξ_{i}}^{(ξ)}$ $Mathematical equation: \hbox{$\sigma_{\xi_i}^{(\xi\xi+\zeta+\eta)}/\sigma_{\xi_i}^{(\xi)}$}$ of the rms high-order contribution (75)–(77) to the rms low-order contribution (second term in Eq. (69)) of the sample variance of the correlation ξ_i, as in Fig. 36. The line styles are as in Fig. K.1 and Table K.1.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[R1] Abazajian, K. N., Adelman-McCarthy, J. K., Agüeros, M. A., et al. 2009, APJS, 182, 543 [Google Scholar]

[R2] Adami, C., Mazure, A., Pierre, M., et al. 2011, A&A, 526, A18 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R3] Arnaud, M., Pointecouteau, E., & Pratt, G. W. 2005, A&A, 441, 893 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R4] Arnaud, M., Pratt, G. W., Piffaretti, R., et al. 2010, A&A, 517, A92 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R5] Benoist, C., Maurogordato, S., da Costa, L. N., Cappi, A., & Schaeffer, R. 1996, ApJ, 472, 452 [NASA ADS] [CrossRef] [Google Scholar]

[R6] Bernardeau, F., Colombi, S., Gaztañaga, E., & Scoccimarro, R. 2002, Phys. Rep., 367, 1 [NASA ADS] [CrossRef] [EDP Sciences] [MathSciNet] [Google Scholar]

[R7] Bernstein, G. M. 1994, ApJ, 424, 569 [NASA ADS] [CrossRef] [Google Scholar]

[R8] Cohn, J. D. 2006, New Astron., 11, 226 [NASA ADS] [CrossRef] [Google Scholar]

[R9] Cole, S., Percival, W. J., Peacock, J. A., et al. 2005, MNRAS, 362, 505 [NASA ADS] [CrossRef] [Google Scholar]

[R10] Colombi, S., Bouchet, F. R., & Hernquist, L. 1996, ApJ, 465, 14 [NASA ADS] [CrossRef] [Google Scholar]

[R11] Cooray, A., & Sheth, R. 2002, Phys. Rep., 372, 1 [NASA ADS] [CrossRef] [Google Scholar]

[R12] Crocce, M., Cabre, A., & Gaztanaga, E. 2011, MNRAS, 414, 329 [NASA ADS] [CrossRef] [Google Scholar]

[R13] Croton, D. J., Gaztanaga, E., Baugh, C. M., et al. 2004, MNRAS, 352, 1232 [NASA ADS] [CrossRef] [Google Scholar]

[R14] Desjacques, V., Crocce, M., Scoccimarro, R., & Sheth, R. K. 2010, Phys. Rev. D, 82, 103529 [NASA ADS] [CrossRef] [Google Scholar]

[R15] Eisenstein, D. J., & Hut, P. 1998, ApJ, 498, 137 [NASA ADS] [CrossRef] [Google Scholar]

[R16] Eisenstein, D. J., & Zaldarriaga, M. 2001, ApJ, 546, 2 [NASA ADS] [CrossRef] [Google Scholar]

[R17] Eisenstein, D. J., Hu, W., & Tegmark, M. 1998, ApJ, 504, L57 [NASA ADS] [CrossRef] [Google Scholar]

[R18] Eisenstein, D. J., Zehavi, I., Hogg, D. W., et al. 2005, ApJ, 633, 560 [NASA ADS] [CrossRef] [Google Scholar]

[R19] Evrard, A. E. 1989, ApJ, 341, L71 [NASA ADS] [CrossRef] [Google Scholar]

[R20] Feldman, H. A., Kaiser, N., & Peacock, J. A. 1994, ApJ, 426, 23 [NASA ADS] [CrossRef] [Google Scholar]

[R21] Fry, J. N. 1984, ApJ, 279, 499 [NASA ADS] [CrossRef] [Google Scholar]

[R22] Fry, J. N., & Gaztanaga, E. 1993, ApJ, 413, 447 [NASA ADS] [CrossRef] [Google Scholar]

[R23] Gaztanaga, E., Norberg, P., Baugh, C. M., & Croton, D. J. 2005, MNRAS, 364, 620 [NASA ADS] [CrossRef] [Google Scholar]

[R24] Giocoli, C., Bartelmann, M., Sheth, R. K., & Cacciato, M. 2010, MNRAS, 408, 300 [NASA ADS] [CrossRef] [Google Scholar]

[R25] Goroff, M. H., Grinstein, B., Rey, S.-J., & Wise, M. B. 1986, ApJ, 311, 6 [NASA ADS] [CrossRef] [Google Scholar]

[R26] Gradshteyn, I. S., & Ryzhik, I. M. 1965, Table of integrals, series, and products (New York: Academic Press) [Google Scholar]

[R27] Groth, E. J., & Peebles, P. J. E. 1977, ApJ, 217, 385 [NASA ADS] [CrossRef] [Google Scholar]

[R28] Harker, G., Cole, S., & Jenkins, A. 2007, MNRAS, 382, 1503 [NASA ADS] [CrossRef] [Google Scholar]

[R29] Hu, W. 2000, Phys. Rev. D, 62, 043007 [NASA ADS] [CrossRef] [Google Scholar]

[R30] Hu, W., & Kravtsov, A. V. 2003, ApJ, 584, 702 [NASA ADS] [CrossRef] [Google Scholar]

[R31] Kaiser, N. 1984, ApJ, 284, L9 [NASA ADS] [CrossRef] [Google Scholar]

[R32] Kaiser, N. 1992, ApJ, 388, 272 [NASA ADS] [CrossRef] [Google Scholar]

[R33] Kazin, E. A., Blanton, M. R., Scoccimarro, R., et al. 2010, ApJ, 710, 1444 [NASA ADS] [CrossRef] [Google Scholar]

[R34] Kerscher, M., Szapudi, I., & Szalay, A. S. 2000, ApJ, 535, L13 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]

[R35] Knebe, A., Knollmann, S. R., Muldrew, S. I., et al. 2011, MNRAS, 415, 2293 [NASA ADS] [CrossRef] [Google Scholar]

[R36] Komatsu, E., Smith, K. M., Dunkley, J., et al. 2011, ApJS, 192, 18 [NASA ADS] [CrossRef] [Google Scholar]

[R37] Kulkarni, G. V., Nichol, R. C., Sheth, R. K., et al. 2007, MNRAS, 378, 1196 [NASA ADS] [CrossRef] [Google Scholar]

[R38] Landy, S. D., & Szalay, A. S. 1993, ApJ, 412, 64 [NASA ADS] [CrossRef] [Google Scholar]

[R39] Limber, D. N. 1953, ApJ, 117, 134 [NASA ADS] [CrossRef] [Google Scholar]

[R40] LoVerde, M., & Afshordi, N. 2008, Phys. Rev. D, 78, 123506 [NASA ADS] [CrossRef] [Google Scholar]

[R41] Majumdar, S., & Mohr, J. J. 2004, ApJ, 613, 41 [NASA ADS] [CrossRef] [Google Scholar]

[R42] Maller, A. H., McIntosh, D. H., Katz, N., & Weinberg, M. D. 2005, ApJ, 619, 147 [NASA ADS] [CrossRef] [Google Scholar]

[R43] Manera, M., & Gaztanaga, E. 2011, MNRAS, 415, 383 [NASA ADS] [CrossRef] [Google Scholar]

[R44] Marin, F. A., Wechsler, R. H., Frieman, J. A., & Nichol, R. C. 2007, ApJ, 172, 849 [Google Scholar]

[R45] Massey, R., Rhodes, J., Leauthaud, A., et al. 2007, ApJS, 172, 239 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]

[R46] Meiksin, A., & White, M. 1999, MNRAS, 308, 1179 [NASA ADS] [CrossRef] [Google Scholar]

[R47] Melin, J.-B., Bartlett, J. G., & Delabrouille, J. 2006, A&A, 459, 341 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R48] Munshi, D., Valageas, P., van Waerbeke, L., & Heavens, A. 2008, Phys. Rep., 462, 67 [NASA ADS] [CrossRef] [Google Scholar]

[R49] Norberg, P., Baugh, C. M., Gaztañaga, E., & Croton, D. J. 2009, MNRAS, 396, 19 [NASA ADS] [CrossRef] [Google Scholar]

[R50] Oukbir, J., & Blanchard, A. 1992, A&A, 262, L21 [NASA ADS] [Google Scholar]

[R51] Pacaud, F., Pierre, M., Leauthaud, A., et al. 2006, MNRAS, 372, 578 [NASA ADS] [CrossRef] [Google Scholar]

[R52] Pacaud, F., Pierre, M., Adami, C., et al. 2007, MNRAS, 382, 1289 [NASA ADS] [CrossRef] [Google Scholar]

[R53] Peebles, P. J. E. 1980, The large scale structure of the universe (Princeton: Princeton University Press) [Google Scholar]

[R54] Peebles, P. J. E., & Hauser, M. G. 1974, ApJS, 28, 19 [NASA ADS] [CrossRef] [Google Scholar]

[R55] Pierre, M., Pacaud, F., Juin, J. B., et al. 2011, MNRAS, 414, 1732 [NASA ADS] [CrossRef] [Google Scholar]

[R56] Politzer, H. D., & Wise, M. B. 1984, ApJ, 285, L1 [NASA ADS] [CrossRef] [Google Scholar]

[R57] Predehl, P., Boehringer, H., et al. 2009, Proceedings of the conference X-ray Astronomy 2009, Bologna, September 2009 [Google Scholar]

[R58] Refregier, A., Amara, A., Kitching, T. D., et al. 2010, Euclid Imaging Consortium Science Book [arXiv:1001.0061] [Google Scholar]

[R59] Reid, B. A., Percival, W. J., Eisenstein, D. J., et al. 2010, MNRAS, 404, 60 [NASA ADS] [CrossRef] [Google Scholar]

[R60] Ross, A. J., Brunner, R. J., & Myers, A. D. 2006, ApJ, 649, 48 [NASA ADS] [CrossRef] [Google Scholar]

[R61] Scoccimarro, R., Zaldarriaga, M., & Hui, L. 1999, ApJ, 527, 1 [NASA ADS] [CrossRef] [Google Scholar]

[R62] Scoccimarro, R., Sheth, R. K., Hui, L., & Jain, B. 2001, ApJ, 546, 20 [NASA ADS] [CrossRef] [Google Scholar]

[R63] Smith, R. E. 2009, MNRAS, 400, 851 [NASA ADS] [CrossRef] [Google Scholar]

[R64] Smith, R. E., Peacock, J. A., Jenkins, A., et al. 2003, MNRAS, 341, 1311 [NASA ADS] [CrossRef] [Google Scholar]

[R65] Spergel, D. N., Bean, R., Doré, O., et al. 2007, ApJS, 170, 377 [NASA ADS] [CrossRef] [Google Scholar]

[R66] Szapudi, I. 2001, in Annals of the New York Academy of Sciences, The Onset of Nonlinearity in Cosmology, ed. J. N. Fry, J. R. Buchler, & H. Kandrup, 927, 94 [Google Scholar]

[R67] Szapudi, I., & Colombi, S. 1996, ApJ, 470, 131 [NASA ADS] [CrossRef] [Google Scholar]

[R68] Szapudi, I., & Szalay, A. 1998, ApJ, 494, L41 [NASA ADS] [CrossRef] [Google Scholar]

[R69] Szapudi, I., Postman, M., Lauer, T. R., & Oegerle, W. 2001, ApJ, 548, 114 [NASA ADS] [CrossRef] [Google Scholar]

[R70] Tegmark, M., Eisenstein, D., Strauss, M. A., et al. 2006, Phys. Rev. D, 74, 123507 [Google Scholar]

[R71] Teyssier, R. 2002, A&A, 385, 337 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R72] Teyssier, R., Pires, S., Prunet, S., et al. 2009, A&A, 497, 335 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R73] Tinker, J., Kravtsov, A. V., Klypin, A., et al. 2008, ApJ, 688, 709 [NASA ADS] [CrossRef] [Google Scholar]

[R74] Tinker, J., Robertson, B. E., Kravtsov, A. V., et al. 2010, ApJ, 724, 878 [NASA ADS] [CrossRef] [Google Scholar]

[R75] Valageas, P., & Nishimichi, T. 2011a, A&A, 527, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R76] Valageas, P., & Nishimichi, T. 2011b, A&A, 532, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R77] Vanderlinde, K., Crawford, T. M., & de Haan, T., et al. 2010, ApJ, 722, 1180 [NASA ADS] [CrossRef] [Google Scholar]

Covariance matrices for halo number counts and correlation functions⋆

1. Introduction

2. Halo density fields

2.1. Analytic models

2.1.1. Halo mass function and correlation

2.1.2. Three-point and four-point halo correlations

2.1.3. Flat-sky and Limber’s approximations

2.2. Numerical simulations

3. Number density of halos

3.1. Mean number counts in redshift bins

3.2. Covariance of number counts

3.2.1. Small angular windows

Signal-to-noise ratio

Scalings with survey area and number of subfields

3.2.2. Large angular windows

3.2.3. Accuracy of the “flat-sky + Limber” approximation

3.2.4. Correlation between different redshift bins

4. Real-space two-point correlation function

4.1. Mean correlation

4.1.1. Peebles & Hauser estimator

4.1.2. Landy & Szalay estimator

4.1.3. Comparison with simulations

4.2. Covariance matrices for the halo correlation

4.2.1. Low-order terms

Comparison of Peebles & Hauser and Landy & Szalay covariance matrices

Comparison of sample-variance and shot-noise effects

Scalings with survey area and number of subfields

4.2.2. High-order terms for the covariance of

4.2.3. Comparison with numerical simulations

4.2.4. Correlation matrices

5. Angular correlation function

5.1. Mean correlation

5.1.1. Peebles & Hauser estimator

5.1.2. Landy & Szalay estimator

5.1.3. Comparison with simulations

5.2. Covariance matrices for the halo angular correlation

5.2.1. Low-order terms

Comparison of Peebles & Hauser and Landy & Szalay covariance matrices

Comparison of sample-variance and shot-noise effects

5.2.2. High-order terms

5.2.3. Comparison with numerical simulations

5.2.4. Correlation matrices

6. Applications to real survey cases

6.1. Surveys of limited areas

6.2. All-sky surveys

6.3. Shot noise versus sample variance

6.4. High-order and low-order contributions to the sample variance of

6.5. Dependence of the results on cosmology

7. Conclusion

Online material

Appendix A: Mean and covariance of number counts

Appendix B: Finite-size effects

Appendix C: Computation of the mean of the estimators and

Appendix D: Derivation of the covariance of the Peebles & Hauser estimator

Appendix E: Derivation of the mean and covariance of the Landy & Szalay estimator

Appendix F: Computation of high-order terms for the covariance of

Appendix G: Computation of the mean of the estimators ŵ and ŵLS

Appendix H: Computation of the covariance of ŵLS

Appendix I: Scaling of the number counts signal-to-noise in simulations

Appendix J: Selection functions used for various surveys

Appendix K: Dependence on cosmology

Acknowledgments

References

All Tables

All Figures

Covariance matrices for halo number counts and correlation functions^⋆

4.2.2. High-order terms for the covariance of $Mathematical equation: \hbox{$\hat{\xi}^\mathsf {LS}$}$

6.4. High-order and low-order contributions to the sample variance of $Mathematical equation: \hbox{$\hxi$}$

Appendix C: Computation of the mean of the estimators $Mathematical equation: \hbox{$\hxi$}$ and $Mathematical equation: \hbox{$\hat{\xi}^\mathsf{LS}$}$

Appendix D: Derivation of the covariance of the Peebles & Hauser estimator $Mathematical equation: \hbox{$\hxi$}$

Appendix E: Derivation of the mean and covariance of the Landy & Szalay estimator $Mathematical equation: \hbox{$\hat{\xi}^\mathsf{LS}$}$

Appendix F: Computation of high-order terms for the covariance of $Mathematical equation: \hbox{$\hat{\xi}^\mathsf{LS}$}$

Appendix G: Computation of the mean of the estimators ŵ and ŵ^LS

Appendix H: Computation of the covariance of ŵ^LS