Covariances of density probability distribution functions. Lessons from hierarchical models

Francis Bernardeau

doi:10.1051/0004-6361/202142526

Home

All issues

Volume 663 (July 2022)

A&A, 663 (2022) A124

Full HTML

Open Access

Issue		A&A Volume 663, July 2022


Article Number		A124
Number of page(s)		22
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/202142526
Published online		19 July 2022

A&A 663, A124 (2022)

Covariances of density probability distribution functions. Lessons from hierarchical models

Francis Bernardeau¹^,2

¹ Université Paris-Saclay, CNRS, CEA, Institut de physique théorique, 91191 Gif-sur-Yvette, France
e-mail: francis.bernardeau@cea.fr
² Institut d’Astrophysique de Paris, CNRS and Sorbonne Université, UMR 7095, 98 bis bd Arago, 75014 Paris, France

Received: 26 October 2021
Accepted: 11 March 2022

Abstract

Context. Statistical properties of the cosmic density fields are to a large extent encoded in the shape of the one-point density probability distribution functions (PDF) as measured in surveys. In order to successfully exploit such observables, a detailed functional form of the covariance matrix of the one-point PDF is needed.

Aims. The objectives are to model the properties of this covariance for general stochastic density fields and for stochastic fields that reproduce the properties expected in cosmology. The accuracy of the proposed forms is evaluated in specific cases.

Methods. The study was conducted in a cosmological context and determined whether the density is defined absolutely or relatively to the sample mean density. Leading and subleading contributions were identified within a large class of models, the so-called hierarchical models. They come from either large or short separation contributions. The validity of the proposed forms for the covariance matrix was assessed with the help of a toy model, the minimum tree model, for which a corpus of exact results could be obtained (forms of the one- and two-point PDF, large-scale density-bias functions, and full covariance matrix of the one-point PDF).

Results. It is first shown that the covariance matrix elements are directly related to the spatial average of the two-point density PDF within the sample. The dominant contribution to this average is explicitly given for hierarchical models (coming from large scale contribution), which leads to the construction of specific density-bias functions. However, this contribution alone cannot be used to construct an operational likelihood function. Subdominant large-scale effects are found to provide corrective terms, but also a priori lead to limited information on the covariance matrix. Short distance effects are found to be more important but more difficult to derive as they depend more on the details of the model. However, a simple and generic form of these contributions is proposed. Detailed comparisons in the context of the Rayleigh-Levy flight model show that the large-scale effects capture the bulk of the supersample effects and that, by adding the short-distance contributions, a qualitatively correct model of the likelihood function can be obtained.

Key words: large-scale structure of Universe / cosmology: theory / methods: statistical

© F. Bernardeau 2022

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1. Introduction

In the context of cosmological studies, the concept of counts-in-cells statistics has been put forward for a long time as a unique way to quantify the statistical properties of the cosmological fields (White 1979; Colombi et al. 1995; Balian & Schaeffer 1989; Bernardeau & Schaeffer 1999). It was then shown in particular that counts-in-cells statistics, which represents a discrete representation of the local density probability distribution function (PDF), could be directly related to the correlation hierarchy of the density field.

Interest in these types of observables was recently renewed for several reasons. The size of the surveys makes accurately measuring these quantities more realistic. This is already the case for surveys such as the Dark Energy Survey (DES collaboration; Abbott et al. 2018), the Kilo-Degree Survey (KIDS; Heymans et al. 2021), and the Hyper Suprime Cam (HSC; Hikage et al. 2019). The future promises even larger and more powerful surveys such as Euclid (Laureijs et al. 2011; Amendola et al. 2018) and the Rubin Observatory (Ivezić et al. 2019). Moreover, the theoretical foundations for these constructions (at least in the cosmological context) has been considerably strengthen with the realization that the large-deviation theory (LDT; for a general review, see Touchette 2011) could successfully be invoked, as shown in Bernardeau & Reimberg (2016). It clarifies the applicability of the theory to the cosmological density field and places previous works on a much more solid foundation (Valageas 2002; Bernardeau et al. 2014). The ability of density PDF to constrain cosmology was emphasized in Codis et al. (2016b) and completed in Friedrich et al. (2020) and in Uhlemann et al. (2020), who showed that these observable could efficiently constrain the neutrino mass or primordial non-Gaussianities. Finally, although the matter PDF is not a direct observable, as is matter density, it can be closely approached with the help of luminous tracer statistics (Repp & Szapudi 2020), more convincingly in weak-lensing fields, as advocated in numerous recent papers (Barthelemy et al. 2021; Bernardeau & Valageas 2000), or with combined approaches such as density-split statistics (Gruen et al. 2018; Friedrich et al. 2018; Brouwer et al. 2018), which proved to be particularly promising.

The construction of a full theory of these observable requires a detailed analysis of its global error budget, however, due to finite-size surveys, imperfect tracers, and so on. Some of these aspects have been explored in early studies such as Szapudi & Colombi (1996) and Szapudi et al. (1999), but a full theory is still lacking. The developments presented in this paper are made in this context. More precisely, the purpose of this study is to explore what determines the expression of the covariance of data vectors whose elements are local quantities, such as the density contrast and density profiles, in cosmological contexts, that is, in classical random fields with long-range correlations. Derivations were made furthermore assuming statistical homogeneity and isotropy. The domain of application encompasses both counts-in-cells statistics, basically 2D or 3D counts of density tracers, or proxies to projected densities such as mass maps for weak-lensing tomographic observations.

In order to gain insights into the different contributions and the effects that might contribute to the covariance, we rely on the use of the hierarchical models to derive results we think rather general. The immense advantage of using such models is that they naturally incorporate many of the features expected in density cosmological fields (e.g., the magnitude of the high-order correlation functions), and there are also models for which many exact results can be obtained in particular for counts-in-cells statistics. The goal of these constructions is to eventually infer precisely what the performance of PDF measurements would be on the determination of cosmological parameters, taking advantage of results such as those found in Boyle et al. (2021), which give the response function of these observable to various cosmological parameters

Section 2 is devoted to the presentation of the general framework. The subsequent section explores different contributions, from large-scale effects with the derivation of several bias functions to short-distance contributions. Results are derived in a framework as general as possible, including discrete noise associated with the use of a finite number of tracers. Section 4 presents the general hierarchical models, and more specifically, the Rayleigh-Levy flight model that we use as a toy model to evaluate the performances of approximate schemes. In Sect. 5, simplified models for the covariance matrix are presented and evaluated with the help of a set of numerical experiments. Section 6 summarizes the results that have be found and specifies their expected range of application.

The text is complemented by appendices that contain a large amount of material. They present the hierarchical models, their mathematical description, and the mean-field approximation that is used throughout for explicit computations. Appendix C is more specifically devoted to the minimal tree model and the construction of the exact mean-field covariance matrix.

2. General framework. Construction of covariance matrices

The purpose of this section is to show how the elements of the covariance matrix are related to the joint density PDFs within a given survey. We first formalize this relation in a general framework before we explore its consequence in the context we are interested in. We assume we are interested in the PDF of some local quantity, μ, that can be evaluated within a survey, thus defining a field μ(x) throughout the survey. The a priori typical example of this quantity is the density (see below for a more precise illustration of what this quantity could be). The value of μ is assumed to lie in some ensemble ℳ (that can be simply the real numbers), and the data vector we are interested in consists of the probabilities p_i that μ lie within the subsets ℳ_i (which are a priori nonzero within ℳ). The one-point PDF of μ is then given by

$\begin{matrix} p_{i} (x) = \int_{M_{i}} d μ P (μ, x), \end{matrix}$ $\begin{aligned} p_{i}(\mathbf{x})=\int _{\mathcal{M}_{i}} \mathrm{d}\mu \ \mathcal{P}(\mu ,\mathbf{x}), \end{aligned}$ (1)

if ℘(μ,x) dμ is the PDF of μ at location x. p_i(x) is then assumed to be independent of x in the context we are interested in, for which statistical homogeneity is assumed. More formally, we can define the characteristic function χ_i(x), which takes the value 1 where μ(x) ∈ ℳ_i and 0 otherwise.

An estimation of p_i would then be given by the volume fraction of the survey where μ(x) ∈ ℳ_i. We note this estimate as P_i ¹,

$\begin{matrix} P_{i} = \frac{1}{V} \int d x χ_{i} (x), \end{matrix}$ $\begin{aligned} P_{i} =\frac{1}{V} \int \mathrm{d}\mathbf{x}\ \chi _{i}(\mathbf{x}), \end{aligned}$ (2)

which is then itself a random variable, the properties of which we are interested in. More precisely, we would like to derive an operational form for the likelihood function of a set of P_i variable. We limit our investigation here to the construction of the likelihood from the covariance matrix, assuming that the likelihood of P_i is close enough to a Gaussian distribution².

The ensemble average of P_i is

$\begin{matrix} ⟨ P_{i} ⟩ = \frac{1}{V} \int d x ⟨ χ_{i} (x) ⟩ = \frac{1}{V} \int d x p_{i} (x) = p_{i .} \end{matrix}$ $\begin{aligned} \langle P_{i} \rangle =\frac{1}{V} \int \mathrm{d}\mathbf{x}\ \langle \chi _{i}(\mathbf{x})\rangle = \frac{1}{V} \int \mathrm{d}\mathbf{x}\ p_{i}(\mathbf{x})=p_{i.} \end{aligned}$ (3)

We can further define a joint PDF of the same field, ℘(μ,x;μ′,x′), which is the joint PDF of μ and μ′ in locations x and x′. Defining p_ij(x, x′) as the joint ensemble average of ℘(μ,x;μ′,x′), we have

$\begin{matrix} p_{ij} (x, x^{'}) = \int_{M_{i}} d μ \int_{M_{j}} d μ^{'} P (μ, x ; μ^{'}, x^{'}) . \end{matrix}$ $\begin{aligned} p_{ij}(\mathbf{x},\mathbf{x}^{\prime })=\int _{\mathcal{M}_{i}} \mathrm{d}\mu \int _{\mathcal{M}_{j}} \mathrm{d}\mu ^{\prime }\ \mathcal{P}(\mu ,\mathbf{x};\mu ^{\prime },\mathbf{x}^{\prime }). \end{aligned}$ (4)

The elements of the covariance matrix of P_i are then formally

$\begin{matrix} ⟨ P_{i} P_{j} ⟩ = \frac{1}{V^{2}} \int d x \int d x^{'} ⟨ χ_{i} (x) χ_{j} (x^{'}) ⟩ \\ = \frac{1}{V^{2}} \int d x \int d x^{'} p_{ij} (x, x^{'}) \equiv {\bar{p}}_{ij} . \end{matrix}$ $\begin{aligned}&\langle P_{i} P_{j} \rangle = \frac{1}{V^{2}} \int \mathrm{d}\mathbf{x}\int \mathrm{d}\mathbf{x}^{\prime }\ \langle \chi _{i}(\mathbf{x})\chi _{j}(\mathbf{x}^{\prime })\rangle \nonumber \\&\quad \ \ \quad = \frac{1}{V^{2}} \int \mathrm{d}\mathbf{x}\int \mathrm{d}\mathbf{x}^{\prime }\ p_{ij}(\mathbf{x},\mathbf{x}^{\prime })\equiv \overline{p}_{ij}. \end{aligned}$ (5)

This gives the relation between the covariances and joint PDF. If p_ij(x, x′) depends only on the relative distance between x and x′, this expression can be recast in terms of the distribution of such distances, P_s(r_d), in the form

$\begin{matrix} ⟨ P_{i} P_{j} ⟩ = \int d r_{d} P_{s} (r_{d}) p_{ij} (r_{d}) . \end{matrix}$ $\begin{aligned} \langle P_{i} P_{j} \rangle =\int \mathrm{d}r_d\,P_{s}(r_d) p_{ij}(r_d). \end{aligned}$ (6)

The precise form of P_s(r_d) depends on the detail of the survey. Explicit forms can be given in case of simple regular surveys such as square surveys³. In the context of statistically homogeneous and isotropic random fields, this latter expression is used. In particular, we wish to determine the configurations that contribute most to p̄_ij. They obviously depend on both the random processes we consider and on the definition of ℳ_i and ℳ_j. In order to be more specific, we assume in the following that μ is a local density assigned to be in bins (i) centered on ρ_i and with width dρ_i (assumed a priori to be arbitrarily small), so that

$\begin{matrix} P_{i} = P (ρ_{i}) d ρ_{i} . \end{matrix}$ $\begin{aligned} P_{i}=P(\rho _{i})\mathrm{d}\rho _{i}. \end{aligned}$ (7)

If necessary, local densities could be obtained after the field μ(x) has been convolved with a window function W_R(x), associated with a scale R that is

$\begin{matrix} ρ (x) = \int d x^{'} μ (x - x^{'}) W_{R} (x^{'}) . \end{matrix}$ $\begin{aligned} \rho (\mathbf{x})=\int \mathrm{d}\mathbf{x}^{\prime }\,\mu (\mathbf{x}-\mathbf{x}^{\prime })W_{R}(\mathbf{x}^{\prime }). \end{aligned}$ (8)

It is then assumed R is small compared to the sample size in order to identify what the leading contributions to the joint PDFs might be. In practice, W_R might also be a simple top-hat window function, but this is not necessarily the case. It could be more elaborated filters, such as compensated filters (of zero average), such as those introduced for cosmic shear analysis (Schneider 1996; Kaiser 1998; Bernardeau & Valageas 2000).

We furthermore allow the estimated densities ρ_i to be defined with respect the overall density of the sample ρ_s,

$\begin{matrix} ρ_{s} = \frac{1}{V} \int d x μ (x) . \end{matrix}$ $\begin{aligned} \rho _{s}=\frac{1}{V}\int \mathrm{d}\mathbf{x}\ \mu (\mathbf{x}). \end{aligned}$ (9)

For instance, we could be interested in ${\hat{ρ}}_{i} \equiv ρ_{i} / ρ_{s}$ ${\hat{\rho}}_{i}\equiv\rho_{i}/\rho_{s}$ or ${\bar{ρ}}_{i} \equiv ρ_{i} - (ρ_{s} - 1),$ ${\overline{\rho}}_{i}\equiv\rho_{i}-(\rho_s-1),$ which are frequently encountered situations in praxis. Then ρ_s is itself a random variable whose correlation with ρ(x) ought to be taken into account. We then need to explore the properties of either P(ρ_i, ρ_j; x, x′) or P(ρ_s, ρ_i, ρ_j; x, x′) from which functions of interest can be built, that is,

$\begin{matrix} P (\hat{ρ}, {\hat{ρ}}^{'}) & = \int d ρ_{s} ρ_{s}^{2} P (ρ_{s}, \hat{ρ} ρ_{s}, {\hat{ρ}}^{'} ρ_{s} ; x, x^{'}) \end{matrix}$ $\begin{aligned} P(\hat{\rho },\hat{\rho }^{\prime })&=\int \mathrm{d}\rho _{s}\,\rho _{s}^{2}\,P(\rho _{s},\hat{\rho }\rho _{s},\hat{\rho }^{\prime }\rho _{s};\mathbf{x},\mathbf{x}^{\prime })\end{aligned}$ (10)

$\begin{matrix} P (\bar{ρ}, {\bar{ρ}}^{'}) & = \int d ρ_{s} P (ρ_{s}, {\bar{ρ}}_{i} - 1 + ρ^{'}, {\bar{ρ}}^{'} - 1 + ρ_{s} ; x, x^{'}), \end{matrix}$ $\begin{aligned} P(\overline{\rho },\overline{\rho }^{\prime })&=\int \mathrm{d}\rho _{s}\,P(\rho _{s},\overline{\rho }_{i}-1+\rho ^{\prime },\overline{\rho }^{\prime }-1+\rho _{s};\mathbf{x},\mathbf{x}^{\prime }), \end{aligned}$ (11)

from which the covariance elements such as

$\begin{matrix} Cov (ρ_{i}, ρ_{j}) d ρ_{i} d ρ_{j} = \int d r_{d} P_{s} (r_{d}) P (ρ_{i}, ρ_{j} ; r_{d}) d ρ_{i} d ρ_{j} \\ - P (ρ_{i}) P (ρ_{j}) d ρ_{i} d ρ_{j} \end{matrix}$ $\begin{aligned}&\mathrm{Cov}(\rho _{i},\rho _{j})\mathrm{d}\rho _{i}\mathrm{d}\rho _{j}= \int \mathrm{d}r_{d}\,P_{s}(r_d) P(\rho _{i},\rho _{j};r_d)\mathrm{d}\rho _{i}\mathrm{d}\rho _{j}\nonumber \\&\qquad \qquad \qquad \qquad -P(\rho _{i})P(\rho _{j})\mathrm{d}\rho _{i}\mathrm{d}\rho _{j} \end{aligned}$ (12)

can be derived and whose properties we wish to explore. We wish in particular to build a model of the likelihood function from such a covariance, requiring full knowledge of its eigenvalues and eigendirections. In this respect, it is implicit that the number of bins (i) to be used is finite. We nonetheless present at least in this first section the results in the continuous limit for ρ_i. It is finally to be noted that as stated before, we restrict our analysis to covariance matrices, but higher-order correllators might also be considered by generalizing the relation (5) to a higher number of variables.

3. PDF covariances in the context of cosmological models

3.1. Modeling the joint PDF

To make progress, we need to make further assumptions about the mathematical structure of the joint PDF. In the following, we assume in particular that joint PDFs can be obtained from their cumulant generating functions (CGF)⁴, φ(λ_i, λ_j; r_d). The latter is defined as

$\begin{matrix} exp (φ (λ_{i}, λ_{j} ; r_{d})) = ⟨ exp (λ_{i} ρ_{i} + λ_{j} ρ_{j}) ⟩ \\ = \int d ρ_{i} d ρ_{j} P (ρ_{i}, ρ_{j} ; r_{d}) exp (λ_{i} ρ_{i} + λ_{j} ρ_{j}), \end{matrix}$ $\begin{aligned}&\exp \left(\varphi (\lambda _{i},\lambda _{j};r_{d})\right)=\langle \exp \left({\lambda _{i}\rho _{i}+\lambda _{j}\rho _{j}}\right)\rangle \nonumber \\&\qquad \qquad \quad \quad =\int \mathrm{d}\rho _{i}\mathrm{d}\rho _{j}\,P(\rho _{i},\rho _{j};r_d)\,\exp \left({\lambda _{i}\rho _{i}+\lambda _{j}\rho _{j}}\right), \end{aligned}$ (13)

and it is assumed that this relation can be inverted to give the joint PDF from Laplace inverse transformations,

$\begin{matrix} P (ρ_{i}, ρ_{j} ; r_{d}) = \int \frac{d λ_{i}}{2 π i} \frac{d λ_{j}}{2 π i} e^{- λ_{i} ρ_{i} - λ_{j} ρ_{j} + φ (λ_{i}, λ_{j} ; r_{d})}, \end{matrix}$ $\begin{aligned} P(\rho _{i},\rho _{j};r_{d})=\int \frac{\mathrm{d}\lambda _{i}}{2\pi \mathrm{i}}\frac{\mathrm{d}\lambda _{j}}{2\pi \mathrm{i}}e^{-\lambda _{i}\rho _{i}-\lambda _{j}\rho _{j}+\varphi (\lambda _{i},\lambda _{j};r_{d})}, \end{aligned}$ (14)

where the integrations are made a priori along the imaginary axis. The CGFs themselves are closely related to the averaged correlation functions of the underlying field. In the following, we develop models for which these correlation functions can be computed precisely.

3.2. Large-distance contributions to the joint density PDF

We start by assuming that covariances are dominated by long-range correlation and not by proximity effects (e.g., densities taken in nearby regions). Whether this assumption is correct obviously depends on the particular model and setting we consider, as we detail below. There are large sets of models for which general expressions can be given in this regime. They are the so-called hierarchical models, originally introduced in Peebles (1980), discussed in more detail in Fry (1984a,b), Balian & Schaeffer (1989), Bernardeau & Schaeffer (1992), and further formalized in Bernardeau & Schaeffer (1999) as described below; it is also true in the quasilinear regime as originally pointed out in Bernardeau (1996) and derived in more detail in Codis et al. (2016a). In these regimes, we obtain the following functional form (see previous references and the detailed derivation in Appendix B):

$\begin{matrix} φ (λ_{s}, λ_{i}, λ_{j}) = λ_{s} + φ_{0} (λ_{i}) + φ_{0} (λ_{j}) \\ + \frac{λ_{s}^{2}}{2} \int d x_{s} d x_{s}^{'} ξ (x_{s}, x_{s}^{'}) + λ_{s} \int d x_{s} ξ (x_{s}, x_{1}) φ_{1} (λ_{i}) \\ + λ_{s} \int d x_{s} ξ (x_{s}, x_{2}) φ_{1} (λ_{j}) + φ_{1} (λ_{i}) ξ (x_{1}, x_{2}) φ_{1} (λ_{j}), \end{matrix}$ $\begin{aligned}&\varphi (\lambda _{s},\lambda _{i},\lambda _{j})= \lambda _{s}+\varphi _{0}(\lambda _{i})+\varphi _{0}(\lambda _{j}) \nonumber \\&\qquad +\frac{\lambda _{s}^{2}}{2}\int \mathrm{d}\mathbf{x}_{s}\mathrm{d}\mathbf{x}^{\prime }_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}^{\prime }_{s}) +\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}_{1})\,\varphi _{1}(\lambda _{i})\nonumber \\&\qquad +\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}_{2})\,\varphi _{1}(\lambda _{j}) +\varphi _{1}(\lambda _{i})\,\xi (\mathbf{x}_{1},\mathbf{x}_{2})\,\varphi _{1}(\lambda _{j}), \end{aligned}$ (15)

where ξ(x, x′) is the two-point correlation function of the density field at positions x and x′, and φ₀(λ) and φ₁(λ) are specific functions of λ that depend on the details of the model.

Then, setting λ_s to zero, we can easily obtain the expression of the joint PDF at leading order in ξ(r_d),

$\begin{matrix} P (ρ_{i}, ρ_{j} ; r_{d}) = P (ρ_{i}) P (ρ_{j}) (1 + b (ρ_{i}) ξ (r_{d}) b (ρ_{j})) . \end{matrix}$ $\begin{aligned} P(\rho _{i},\rho _{j};r_{d})=P(\rho _{i})P(\rho _{j})\left(1+b(\rho _{i})\xi (r_d)\,b(\rho _{j})\right)\!. \end{aligned}$ (16)

Here P(ρ_i) is the one-point density PDF (i.e., implicitly at scale R), and b(ρ_i) is the density-bias function (to be distinguished from the standard halo-bias function). It also depends on ρ_i (and on the scale R) so that in the previous expression, the dependence on ρ_i, ρ_j, and r_d can be factorized out.

To be more precise, P(ρ_i) is given by the inverse Laplace transform of the CGF (see, e.g., Balian & Schaeffer 1989 and Bernardeau 2013 for a detailed derivation of this inversion),

$\begin{matrix} P (ρ_{i}) = \int \frac{d λ}{2 π i} exp (- i λ ρ_{i} + φ_{0} (λ)), \end{matrix}$ $\begin{aligned} P(\rho _{i})=\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\exp \left(-\mathrm{i}\lambda \rho _{i}+\varphi _{0}(\lambda )\right)\!, \end{aligned}$ (17)

where φ₀(λ) is the CGF of the density taken at scale R (i.e., for the filter W_R). The function b(ρ_i) is defined through a similar relation,

$\begin{matrix} b (ρ_{i}) P (ρ_{i}) = \int \frac{d λ}{2 π i} φ_{1} (λ) exp (- i λ ρ_{i} + φ_{0} (λ)) . \end{matrix}$ $\begin{aligned} b(\rho _{i})P(\rho _{i})=\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\,\varphi _{1}(\lambda )\,\exp \left(-\mathrm{i}\lambda \rho _{i}+\varphi _{0}(\lambda )\right)\!. \end{aligned}$ (18)

The function φ₁(λ) can be explicitly computed in the context of perturbation theory calculations (Codis et al. 2016a). This is the case also for models in the so-called hierarchical models (see appendices). In the latter case, these calculations can be extended to higher order, as we describe below, providing ways to better assess the domain of validity of this expansion.

According to the previous relation, this implies that this form translates into the expression of the covariance coefficients for the density PDF. More precisely, we expect

$\begin{matrix} Cov (ρ_{i}, ρ_{j}) = b (ρ_{i}) P (ρ_{i}) {\bar{ξ}}_{s} b (ρ_{j}) P (ρ_{j}), \end{matrix}$ $\begin{aligned} \mathrm{Cov}(\rho _{i},\rho _{j})=b(\rho _{i})P(\rho _{i})\,\overline{\xi }_{s}\,b(\rho _{j})P(\rho _{j}), \end{aligned}$ (19)

where ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ is the average value of the two-point correlation function ξ(r_d) within the sample.

It is to be noted, however, that this is true if

The term in ${\bar{ξ}}_{s}$ ${\overline{\xi}}_{s}$ is indeed the leading contribution of the expansion (15). This is obviously not the case for samples with periodic boundary conditions, for which ${\bar{ξ}}_{s}$ ${\overline{\xi}}_{s}$ vanishes by construction;
The density is defined regardless of the density of the sample. Its expectation value therefore does not coincide with ρ_s for a given sample.

It can also be noted that in the Gaussian limit, we have b(ρ_i)=δ_i/ξ. Applying the relation (16) to the density within one cell and to the sample density ρ_s = 1 + δ_s leads then to the following expression for the conditional expression of density PDF,

$\begin{matrix} P (ρ_{i} | ρ_{s}) = P (ρ_{i}) (1 + δ_{s} b (ρ_{i})) . \end{matrix}$ $\begin{aligned} P(\rho _{i}\vert \rho _{s})=P(\rho _{i})\left(1+\delta _{s}\,b(\rho _{i})\right). \end{aligned}$ (20)

This leads to the interpretation of the function b(ρ_i) as the response function of the density PDF to the sample density. This means that although the density-bias function cannot be derived from the density PDF alone, we should be able to derive it if we are in possession of an operational method to compute the density PDF for arbitrary cosmological parameters (in a way similar to the derivation of halo-bias function as pioneered in Mo & White 1996). Undoubtedly, this result is closely related to the so-called supersample effects (as described for the power spectra covariance in Takada & Hu 2013), that is, the impact of modes of scale comparable to or larger than the sample size. This is not necessarily their only contribution (because subdominant large-scale contributions can also contribute to the covariance), however, but likely to be the most important contribution, as described below.

The density-bias function obeys the following consistency relations:

$\begin{matrix} \int d ρ b (ρ) P (ρ) = 0, \end{matrix}$ $\begin{aligned}&\int \mathrm{d}\rho \, b(\rho )P(\rho )=0,\end{aligned}$ (21)

$\begin{matrix} \int d ρ ρ b (ρ) P (ρ) = 1, \end{matrix}$ $\begin{aligned}&\int \mathrm{d}\rho \, \rho \ b(\rho )P(\rho )=1, \end{aligned}$ (22)

as initially pointed out in Bernardeau & Schaeffer (1992).

3.3. Case of relative densities

The previous formula applies to the local densities, evaluated regardless of the sample density. It does not apply in particular to standard settings (e.g., densities measured out of galaxy counts) where the density is defined with respect to the mean density of the sample. To address this case in particular, we should consider

$\begin{matrix} {\hat{ρ}}_{i} = \frac{ρ_{i}}{ρ_{s}} \end{matrix}$ $\begin{aligned} \hat{\rho }_{i}=\frac{\rho _{i}}{\rho _{\rm s}} \end{aligned}$ (23)

as the observable for which the covariance is to be computed. In this case, the formal derivation of the PDFs is presented in the appendix, and it leads to

$\begin{matrix} P ({\hat{ρ}}_{i}) = \int \frac{d λ_{i}}{2 π i} {[\frac{\partial φ}{\partial λ_{s}}]}_{|_{λ_{s} = - λ_{i} {\hat{ρ}}_{i}}} exp [φ (- λ_{i} {\hat{ρ}}_{i}, λ_{i})] \end{matrix}$ $\begin{aligned}&P(\hat{\rho }_{i})=\int \frac{\mathrm{d}\lambda _{i}}{2\pi \mathrm{i}} \left[ \frac{\partial \varphi }{\partial \lambda _{s}} \right]_{\big \vert _{\lambda _{s}=-\lambda _{i}\hat{\rho }_{i}}} \exp \left[\varphi (-\lambda _{i}\hat{\rho }_{i},\lambda _{i})\right]\end{aligned}$ (24)

$\begin{matrix} P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) = \int \frac{d λ_{i}}{2 π i} \frac{d λ_{j}}{2 π i} {[{(\frac{\partial φ}{\partial λ_{s}})}^{2} + \frac{\partial^{2} φ}{\partial λ_{s}^{2}}]}_{|_{λ_{s} = - λ_{i} {\hat{ρ}}_{i} - λ_{j} {\hat{ρ}}_{j}}} \\ \times exp [φ (- λ_{i} {\hat{ρ}}_{i} - λ_{j} {\hat{ρ}}_{j}, λ_{i}, λ_{j})] . \end{matrix}$ $\begin{aligned}&P(\hat{\rho }_{i},\hat{\rho }_{j})=\int \frac{\mathrm{d}\lambda _{i}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{j}}{2\pi \mathrm{i}} \left[ \left( \frac{\partial \varphi }{\partial \lambda _{s}} \right)^{2}+\frac{\partial ^{2}\varphi }{\partial \lambda _{s}^{2}} \right]_{\big \vert _{\lambda _{s}=-\lambda _{i}\hat{\rho }_{i}-\lambda _{j}\hat{\rho }_{j}}}\nonumber \\&\qquad \qquad \times \exp \left[\varphi (-\lambda _{i}\hat{\rho }_{i}-\lambda _{j}\hat{\rho }_{j},\lambda _{i},\lambda _{j})\right]\!. \end{aligned}$ (25)

We then use relation (15) to compute the form of this function. At this stage, it is to be noted that the expressions $\int d x_{0} d x_{0}^{'} ξ (x_{0}, x_{0}^{'})$ $\smallint{\rm d}{\bf x}_{0}{\rm d}{\bf x}^\prime_{0}\,\xi({\bf x}_{0},{\bf x}^\prime_{0})$ , ∫dx₀ ξ(x₀, x₁) and ξ₁₂ all take the same averaged value when integrated over the sample. We note here this common value as ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ . Inserting the resulting expressions of the CGF in both the expressions of $P ({\hat{ρ}}_{i})$ $P({\hat{\rho}}_{i})$ and $P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j})$ $P({\hat{\rho}}_{i}, {\hat{\rho}}_{j})$ and expanding all terms at linear oder in ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ , we obtain

$\begin{matrix} P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) - P ({\hat{ρ}}_{i}) P ({\hat{ρ}}_{j}) = {\bar{ξ}}_{s} \int \frac{d λ_{i}}{2 π i} \frac{d λ_{j}}{2 π i} \\ \times (1 + φ_{1} (λ_{i}) - λ_{i} {\hat{ρ}}_{i}) (1 + φ_{1} (λ_{j}) - λ_{j} {\hat{ρ}}_{j}) \\ \times exp [- λ_{i} {\hat{ρ}}_{i} - λ_{j} {\hat{ρ}}_{j} + φ_{0} (λ_{i}) + φ_{0} (λ_{j})] . \end{matrix}$ $\begin{aligned}&P(\hat{\rho }_{i},\hat{\rho }_{j})-P(\hat{\rho }_{i})\,P(\hat{\rho }_{j})= \overline{\xi }_{s}\int \frac{\mathrm{d}\lambda _{i}}{2\pi \mathrm{i}} \,\frac{\mathrm{d}\lambda _{j}}{2\pi \mathrm{i}} \nonumber \\&\qquad \qquad \times \left( 1+\varphi _{1}(\lambda _{i})-\lambda _{i} \hat{\rho }_{i} \right)\left( 1+\varphi _{1}(\lambda _{j})-\lambda _{j}\hat{\rho }_{j} \right)\ \nonumber \\&\qquad \qquad \times \exp \left[-\lambda _{i} \hat{\rho }_{i}-\lambda _{j} \hat{\rho }_{j}+\varphi _{0}(\lambda _{i})+\varphi _{0}(\lambda _{j})\right]\!. \end{aligned}$ (26)

This leads to the definition of the first sample-bias function,

$\begin{matrix} b_{s 1} ({\hat{ρ}}_{i}) = \frac{1}{P ({\hat{ρ}}_{i})} \int \frac{d λ}{2 π i} (1 + φ_{1} (λ) - λ {\hat{ρ}}_{i}) exp [- λ {\hat{ρ}}_{i} + φ_{0} (λ)], \end{matrix}$ $\begin{aligned} b_{\rm s1}(\hat{\rho }_{i})\! =\! \frac{1}{P(\hat{\rho }_{i})}\! \int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\! \left( 1+\varphi _{1}(\lambda )-\lambda \hat{\rho }_{i} \right) \exp \left[-\lambda \hat{\rho }_{i}+\varphi _{0}(\lambda )\right]\!, \end{aligned}$ (27)

which can be re-expressed in terms of the density-bias function defined in Eq. (18) and the derivative of $P ({\hat{ρ}}_{i})$ $P({\hat{\rho}}_{i})$ with respect to ${\hat{ρ}}_{i}$ ${\hat{\rho}}_{i}$

$\begin{matrix} b_{s 1} ({\hat{ρ}}_{i}) = b ({\hat{ρ}}_{i}) + 1 + \frac{\partial log (P ({\hat{ρ}}_{i}))}{\partial log {\hat{ρ}}_{i}} . \end{matrix}$ $\begin{aligned} b_{\rm s1}(\hat{\rho }_{i})=b(\hat{\rho }_{i})+1+\frac{\partial \log (P(\hat{\rho }_{i}))}{\partial \log \hat{\rho }_{i}}. \end{aligned}$ (28)

In this case, the covariance matrix elements are then expected to be given by

$\begin{matrix} Cov ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) = b_{s 1} ({\hat{ρ}}_{i}) P ({\hat{ρ}}_{i}) {\bar{ξ}}_{s} b_{s 1} ({\hat{ρ}}_{j}) P ({\hat{ρ}}_{j}) . \end{matrix}$ $\begin{aligned} \mathrm{Cov}(\hat{\rho }_{i},\hat{\rho }_{j})=b_{\rm s1}(\hat{\rho }_{i})P(\hat{\rho }_{i})\,\overline{\xi }_{s}\,b_{\rm s1}(\hat{\rho }_{j})P(\hat{\rho }_{j}). \end{aligned}$ (29)

Remarkably, b_s1(ρ) can entirely be expressed in terms of b(ρ). For the sake of completeness, we also consider the case of ${\bar{ρ}}_{i} = ρ_{i} - (ρ_{s} - 1)$ ${\overline{\rho}}_{i}=\rho_{i}-(\rho_{s}-1)$ . In this case, it is easy to show that

$\begin{matrix} P ({\bar{ρ}}_{i}) = \int \frac{d λ_{i}}{2 π i} exp [- λ_{i} {\bar{ρ}}_{i} + φ (- λ_{i}, λ_{i})] \end{matrix}$ $\begin{aligned}&P(\overline{\rho }_{i})= \int \frac{\mathrm{d}\lambda _{i}}{2\pi \mathrm{i}} \exp \left[-\lambda _{i}\overline{\rho }_{i}+\varphi (-\lambda _{i},\lambda _{i})\right]\end{aligned}$ (30)

$\begin{matrix} P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) = \int \frac{d λ_{i}}{2 π i} \frac{d λ_{j}}{2 π i} \\ \times exp [- λ_{i} {\bar{ρ}}_{i} - λ_{j} {\bar{ρ}}_{j} + φ (- λ_{i} - λ_{j}, λ_{i}, λ_{j})] . \end{matrix}$ $\begin{aligned}&P(\overline{\rho }_{i},\overline{\rho }_{j})=\int \frac{\mathrm{d}\lambda _{i}}{2\pi \mathrm{i}} \frac{\mathrm{d}\lambda _{j}}{2\pi \mathrm{i}}\ \nonumber \\&\qquad \qquad \times \exp \left[-\lambda _{i}\overline{\rho }_{i}-\lambda _{j}\overline{\rho }_{j}+\varphi (-\lambda _{i}-\lambda _{j},\lambda _{i},\lambda _{j})\right]\!. \end{aligned}$ (31)

Following the same approach as for the previous case, the leading-order expression in ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ of the connected joint PDF is

$\begin{matrix} P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) - P ({\bar{ρ}}_{i}) P ({\bar{ρ}}_{j}) = {\bar{ξ}}_{s} \int \frac{d λ_{i}}{2 π i} \frac{d λ_{j}}{2 π i} \\ \times (φ_{1} (λ_{i}) - λ_{i}) (φ_{1} (λ_{j}) - λ_{j}) \\ \times exp [- λ_{i} {\bar{ρ}}_{i} - λ_{j} {\bar{ρ}}_{j} + φ_{0} (λ_{i}) + φ_{0} (λ_{j})] . \end{matrix}$ $\begin{aligned}&P(\overline{\rho }_{i},\overline{\rho }_{j})-P(\overline{\rho }_{i})\,P(\overline{\rho }_{j})= \overline{\xi }_{s}\,\int \frac{\mathrm{d}\lambda _{i}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{j}}{2\pi \mathrm{i}} \nonumber \\&\qquad \qquad \times \left( \varphi _{1}(\lambda _{i})-\lambda _{i} \right)\left( \varphi _{1}(\lambda _{j})-\lambda _{j} \right)\ \nonumber \\&\qquad \qquad \times \exp \left[-\lambda _{i} \overline{\rho }_{i}-\lambda _{j} \overline{\rho }_{j}+\varphi _{0}(\lambda _{i})+\varphi _{0}(\lambda _{j})\right]\!. \end{aligned}$ (32)

It leads to the definition of the second sample-bias function,

$\begin{matrix} b_{s 2} ({\bar{ρ}}_{i}) = b ({\bar{ρ}}_{i}) + \frac{\partial log (P ({\bar{ρ}}_{i}))}{\partial {\bar{ρ}}_{i}}, \end{matrix}$ $\begin{aligned} b_{\rm s2}(\overline{\rho }_{i})=b(\overline{\rho }_{i})+\frac{\partial \log (P(\overline{\rho }_{i}))}{\partial \overline{\rho }_{i}}, \end{aligned}$ (33)

so that

$\begin{matrix} Cov ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) = b_{s 2} ({\bar{ρ}}_{i}) P ({\bar{ρ}}_{i}) {\bar{ξ}}_{s} b_{s 2} ({\bar{ρ}}_{j}) P ({\bar{ρ}}_{j}) . \end{matrix}$ $\begin{aligned} \mathrm{Cov}(\overline{\rho }_{i},\overline{\rho }_{j})=b_{\rm s2}(\overline{\rho }_{i})P(\overline{\rho }_{i})\,\overline{\xi }_{s}\,b_{\rm s2}(\overline{\rho }_{j})P(\overline{\rho }_{j}). \end{aligned}$ (34)

The three bias functions are therefore closely related. Although the density-bias function b(ρ) cannot be derived from the shape of P(ρ) alone, as mentioned before, the relations between b(ρ) and either b_s1(ρ) and b_s2(ρ) depend on the PDF alone. Furthermore, the two relative density bias functions obey the following consistency relations:

$\begin{matrix} \int d ρ_{i} b_{s #} (ρ_{i}) P (ρ_{i}) = 0 \end{matrix}$ $\begin{aligned}&\int \mathrm{d}\rho _{i}\, b_{s\#}(\rho _{i})P(\rho _{i})=0\end{aligned}$ (35)

$\begin{matrix} \int d ρ_{i} ρ_{i} b_{s #} (ρ_{i}) P (ρ_{i}) = 0 . \end{matrix}$ $\begin{aligned}&\int \mathrm{d}\rho _{i}\, \rho _{i}\ b_{s\#}(\rho _{i})P(\rho _{i})=0. \end{aligned}$ (36)

The second relation is at variance with the corresponding relation (22) for the density-bias function. It indicates that for typical values of ρP(ρ), the sample bias functions, b_s#(ρ), are likely to be smaller than the density-bias function b(ρ).

3.4. Structure of the covariance matrix

The consequences of these formulae on the structure of the covariance matrix are illustrated below with the help of the Rayleigh-Levy flight model. Figure 3 compares the results from exact derivations of the covariance matrix with these prescriptions. The diagonal parts of the covariance matrices are well accounted for by these formulae. The root mean square of the measured local density PDF in particular exhibits the expected density dependence, at least for mild values of the density.

In all the formulae (19), (29), and (34), the expression of the covariance exhibits a simple structure, as it is factorizable in the two densities. This implies, for instance, that the reduced covariance matrix

$\begin{matrix} {Cov}_{reduced} (ρ_{i}, ρ_{j}) = \frac{Cov (ρ_{i}, ρ_{j})}{\sqrt{Cov (ρ_{i}, ρ_{i}) Cov (ρ_{j}, ρ_{j})}} \end{matrix}$ $\begin{aligned} \mathrm{Cov}_{\rm reduced}(\rho _{i},\rho _{j})=\frac{\mathrm{Cov}(\rho _{i},\rho _{j})}{\sqrt{\mathrm{Cov}(\rho _{i},\rho _{i}) \mathrm{Cov}(\rho _{j},\rho _{j})}} \end{aligned}$ (37)

has an extremely simple structure: it is given by the sign of the product of the bias functions (i.e., sign[b(ρ_i)b(ρ_j)], sign[b_s1(ρ_i)b_s1(ρ_j)], and sign[b_s2(ρ_i)b_s2(ρ_j)] for the three different measurement strategies). This leads to the butterfly-like structure in the plotted matrices, as illustrated in Fig. 4. This simple form betrays the fact that the density covariance is only poorly known. To be more specific, formulae (19), (29), and (34) give only a single eigendirection of the covariance matrix (namely b(ρ_i)P(ρ_i)) and the amplitude of a single eigenvalue associated with it. The numerical calculations suggest that it is the leading one when ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ does not vanish, as illustrated on Fig. 7. These formulae do not offer any indication of the amplitude of the covariance in orthogonal directions, however. Taken at face value, they imply that the other eigenvalues all vanish, preventing the covariance matrix from being invertible. These formulae therefore cannot be used alone to model the covariance for practical purposes, and complementary contributions have to be derived from other (and a priori subdominant) effects.

3.5. Beyond leading-order effects

In the previous subsection, we identified the long-distance leading contributions. As mentioned before, this leads to only limited information of the covariance structure. This difficulty is even more acute for covariances evaluated in numerical experiments consisting of a collection of independent samples, each of them with periodic boundary conditions (this does not have to be so, but it is often the case in practice). By construction, the mean correlation function within the sample then vanishes, ${\bar{ξ}}_{s} \to 0,$ ${\overline{\xi}_{s}}\to 0,$ making the term we have computed identically zero. All these considerations indicate that further contributions need to be identified. The identification of the next-to-leading order effects in Eq. (16) is difficult to do a priori, however:

One natural next-to-leading contribution is obtained by taking into account second-order terms in ξ(d) in Eq. (16), that is, by considering doubles lines between cells in a diagrammatic representation. This would induce a term of about ξ(d)², whose average never vanishes⁵. As shown in the appendix, these contributions can be formally derived in the context of the hierarchical models. This leads to correction terms that can be organized in a sum of factorized terms. Therefore, although it can indeed provide corrective terms to the covariance matrix, only a limited number of eigendirections can be generated.
Other contributions naturally come from proximity effects due to the fact that cells are finite, and could even overlap, which makes the expansion in ξ(d) ineffective. In a diagrammatic point of view, they are due to the fact that many more diagrams contributed when cells are too close. This has dramatic effects for overlapping cells. For hierarchical models, an approximate form can be used to help model these effects, which we use below.
Finally, effects due to the fact that discrete tracers are used in count-in-cells statistics might also play a role at short distances. They are also tentatively modeled below.

In the following, we propose some modeling of these effects and explore how they depend on the properties of the survey.

3.5.1. Joint PDF at short distances

There are no general forms for the joint PDF at close distance. The hierarchical models suggest the following form (derived from the saddle point approximation, which is valid for moderate values of $\bar{ξ}$ ${\overline{\xi}}$ and of the density contrast), however:

$\begin{matrix} P_{short dist .} (ρ_{i}, ρ_{j}) d ρ_{i} d ρ_{j} = P (ρ_{m}) \\ \times exp [- \frac{δ_{ρ}^{2}}{ρ_{m}^{α} Δ_{ξ} (d)}] \frac{d ρ_{m} d δ_{ρ}}{\sqrt{π ρ_{m}^{α} Δ_{ξ} (d)}}, \end{matrix}$ $\begin{aligned}&P_{\mathrm{short\ dist.}}(\rho _{i},\rho _{j})\mathrm{d}\rho _{i}\mathrm{d}\rho _{j}=P(\rho _{m}) \nonumber \\&\qquad \qquad \times \exp \left[-\frac{\delta _{\rho }^{2}}{\rho _{m}^{\alpha }\Delta _{\xi }(d)}\right]\frac{\mathrm{d}\rho _{m}\mathrm{d}\delta _{\rho }}{\sqrt{\pi \rho _{m}^{\alpha }\Delta _{\xi }(d)}} , \end{aligned}$ (38)

where ρ_m = (ρ_i + ρ_j)/2 and δ_ρ = (ρ_i − ρ_j)/2, and where α is model-dependent parameter. In other words, the PDF of the difference between ρ_i and ρ_j can be described by a simple Gaussian with a known width driven by the expression of $Δ_{ξ} (d) \equiv \bar{ξ} - ξ_{12} (d),$ $\Delta_{\xi}(d)\equiv {\overline{\xi}}-\xi_{12}(d),$ provided it is small compared to $\bar{ξ}$ ${\overline{\xi}}$ . We note that Δ_ξ(d) obviously vanishes at d = 0, it then leads to a Kronecker δ function at zero separation as expected, and generically scales like d² at short distances⁶. Interestingly, for the minimal tree model the form (38) is exact for α = 1 (see appendix). In general, this is also the expected form based on the saddle point approximation (valid when $\bar{ξ}$ ${\overline{\xi}}$ is small) for generic hierarchical tree models. The value of α can be identified from small-order cumulants,

$\begin{matrix} α = \frac{2}{3} S_{3}, \end{matrix}$ $\begin{aligned} \alpha =\frac{2}{3}S_{3}, \end{aligned}$ (39)

where S₃ is the reduced third-order cumulant,

$\begin{matrix} S_{3} = \frac{⟨ δ^{3} ⟩}{{⟨ δ^{2} ⟩}^{2}} . \end{matrix}$ $\begin{aligned} S_{3}=\frac{\langle \delta ^{3}\rangle }{\langle \delta ^{2}\rangle ^{2}}. \end{aligned}$ (40)

This form is probably not very accurate in general. It can be used to model the impact of close distances to the covariance matrix, however, as shown below.

3.5.2. Poisson noise and minimum separation

A further contribution to this joint PDF can come from discrete effects that arise because the density is evaluated from the counting of discrete tracers (as explored in Szapudi & Colombi 1996 or more recently in Repp & Szapudi 2021). In this subsection, we assume that the density corresponds to the density obtained after application of a top-hat filter and that tracers are Poisson realizations of continuous fields (although it is possible to encounter sub- or over-Poissonnian noises; Friedrich et al. 2018). The use of other filters can be explored but would require specific developments that we do not pursue here. Within such hypotheses then, the joint distribution of counts-in-cells N_i is given by the convolution of the joint density PDF, ℘({ρ_i}), in the continuous limit convolved by Poisson counts-in-cells as

$\begin{matrix} P ({N_{i}}) = \int Π_{i} d ρ_{i} P_{Poisson} (N_{i} ; {\bar{N}}_{i} ρ_{i}) P ({ρ_{i}}), \end{matrix}$ $\begin{aligned} \mathcal{P}(\{N_{i}\})=\int \Pi _{i} \mathrm{d}\rho _{i}\,P_{\mathrm{Poisson}}(N_{i};\overline{N}_{i}\rho _{i})\,\mathcal{P}(\left\{ \rho _{i}\right\} ), \end{aligned}$ (41)

where P_Poisson(N;N̄) is more precisely the probability of having N tracers in a cell where the mean density of tracers is N̄. In practice, N̄_i is given by nV_i, where n is the number density of tracers and V_i is the volume of the cell V_i.

Discrete effects would then induce further scatter between the estimated values of ρ_i and ρ_j. The latter are given by Poisson noise induced by the nonoverlapping parts of the cells, as shown in Szapudi & Colombi (1996), further contributing to the scatter. The scatter in the difference in the number of points is

$\begin{matrix} σ_{Poisson}^{2} = \frac{2}{\bar{N}} ρ_{m} f_{e} (d) . \end{matrix}$ $\begin{aligned} \sigma ^{2}_{\mathrm{Poisson}}=\frac{2}{\overline{N}}\rho _{m}\,f_{e}(d). \end{aligned}$ (42)

It can be incorporated as a contribution to the variance of the PDF of δ_ρ of the form

$\begin{matrix} σ_{Poisson}^{2} = \frac{1}{2 \bar{N}} ρ_{m} f_{e} (d), \end{matrix}$ $\begin{aligned} \sigma ^{2}_{\mathrm{Poisson}}=\frac{1}{2 \overline{N}}\rho _{m}\,f_{e}(d), \end{aligned}$ (43)

where f_e(d) is the fraction of the volume of each cell that does not overlap with the other as a function of the cell distance. For short distances (i.e., for about d ≲ R), it is in the 2D case given by

$\begin{matrix} f_{e} (d) = \frac{2 d}{π R} . \end{matrix}$ $\begin{aligned} f_{e}(d)=\frac{2d}{\pi \,R}. \end{aligned}$ (44)

The expression (43) is then a priori to be added to the variance term that appears in Eq. (38) so that the total variance for the density difference reads

$\begin{matrix} σ_{δ_{ρ}}^{2} (d) = \frac{1}{2} ρ_{i}^{α} Δ_{ξ} (d) + \frac{1}{2 \bar{N}} ρ_{i} f_{e} (d) . \end{matrix}$ $\begin{aligned} \sigma _{\delta _{\rho }}^{2}(d)=\frac{1}{2}\rho ^{\alpha }_{i}\Delta _{\xi }(d)+\frac{1}{2\overline{N}}\rho _{i}\,f_{e}(d). \end{aligned}$ (45)

Equation (38) then fully encodes the fact that nearby cells are likely to have similar densities. This encodes, for instance, that nearby cells are within the same haloes. This contribution is expected to enhance the covariance terms. It shows that the amount of information is limited at small scales: there is therefore a minimum separation between cells smaller than which no gain in precision is expected of PDF measurements. The minimum distance depends on the bin size: d_min is the distance such that the densities in two cells separated by less than d_min are almost certainly in the sale density bin. d_min therefore depends on the bin width Δ_i. From Eq. (45), it is possible to infer this value. We desire

$\begin{matrix} σ_{δ_{ρ}}^{2} (d_{min}) ≪ Δ_{i}^{2} . \end{matrix}$ $\begin{aligned} \sigma _{\delta _{\rho }}^{2}(d_{\min })\ll \Delta _{i}^{2}. \end{aligned}$ (46)

This suggests that a minimum distance between cells can be adopted, given by

$\begin{matrix} d_{min Poisson} = \frac{π Δ_{i}^{2}}{\bar{N}} R . \end{matrix}$ $\begin{aligned} d_{\min \mathrm{Poisson}}=\frac{\pi \Delta _{i}^{2}}{{\overline{N}}}\,R. \end{aligned}$ (47)

The other upper limit comes on d from the expression of Δ_ξ as a function of d. The latter depends on both the shape of the power spectrum and on the filter that is used. In general (e.g., for a Gaussian filter), Δ_ξ(d) scales like d²/R², where R is the filtering radius, with a coefficient c_{n_s} that depends on the power spectrum index n_s and is proportional to its amplitude. Top-hat filters have different analytical properties. We give here the formal expression of Δ_ξ(d) at 2D for a power-law spectrum of index n_s,

$\begin{matrix} \frac{Δ_{ξ} (d)}{\bar{ξ}} & = - \frac{2^{n_{s} - 1} Γ (1 - \frac{n_{s}}{2}) Γ (2 - \frac{n_{s}}{2}) Γ (\frac{1}{2} (n_{s} - 1))}{\sqrt{π} Γ (\frac{1}{2} - \frac{n_{s}}{2}) Γ (\frac{3}{2} - \frac{n_{s}}{2}) Γ (\frac{n_{s}}{2} + 1)} {(\frac{d}{R})}^{1 - n_{s}} \\ \approx 0.72 \frac{d^{3 / 2}}{R^{3 / 2}} for n_{s} = 0.5 . \end{matrix}$ $\begin{aligned} \frac{\Delta _{\xi }(d)}{\overline{\xi }}&=-\frac{2^{n_s\!-\!1} \Gamma \left(1\!-\!\frac{n_s}{2}\right) \Gamma \left(2\!-\!\frac{n_s}{2}\right) \Gamma \left(\frac{1}{2} \left(n_s\!-\!1\right)\right) }{\sqrt{\pi } \Gamma \left(\frac{1}{2}\!-\!\frac{n_s}{2}\right) \Gamma \left(\frac{3}{2}\!-\!\frac{n_s}{2}\right) \Gamma \left(\frac{n_s}{2}+1\right)} \left(\frac{d}{R}\right)^{1\!-\!n_s}\nonumber \\&\!\!\approx 0.72\,\frac{d^{3/2}}{R^{3/2}}\ \mathrm{for }\ n_{s}=0.5. \end{aligned}$ (48)

This is the situation we encounter below in the numerical experiments we perform. This leads to the following form:

$\begin{matrix} d_{min halo} = {(\frac{Δ_{i}^{2}}{0.72 \bar{ξ}})}^{2 / 3} R . \end{matrix}$ $\begin{aligned} d_{\min \mathrm{halo}}=\left(\frac{\Delta _{i}^{2}}{0.72\,\overline{\xi }}\right)^{2/3}\,R. \end{aligned}$ (49)

It is to be noted that it can be in practice a rather short distance, shorter than the filtering scale R. For instance, for a bin width of 1/4, a variance of about unity, d_minhalo is about R/5.

Equation (38), together with the expressions of the bias functions described above, is the main results of this paper. We illustrate below how they can be used to compute the covariance matrices.

4. Hierarchical models

In order to illustrate the previous findings, we make use of toy models for which explicit computations can be made.

4.1. General formalism

Hierarchical models are a class of non-Gaussian fields whose correlation functions follow the so-called hierarchical ansatz,

$\begin{matrix} ξ_{p} (r_{1}, \dots, r_{p}) = \sum_{t \in trees} Q_{p} (t) \prod_{lines \in t} ξ (r_{i}, r_{j}), \end{matrix}$ $\begin{aligned} \xi _{p}(\mathbf{r}_{1},\dots ,\mathbf{r}_{p})=\sum _{t\in \mathrm{trees}}Q_{p}(t)\,\prod _{\mathrm{lines}\in t}\xi (\mathbf{r}_{i},\mathbf{r}_{j}), \end{aligned}$ (50)

where the sum is made over all possible trees that join the p points (diagram without loops), and the tree value is obtained by the product of a fixed weight (that depends only on the tree topology) and the product of the two-point correlation functions for all pairs that are connected together in the given tree. This construction ensures that the average p-point function, $\bar{ξ_{p}}$ $\overline{\xi_{p}}$ , scales like the ${\bar{ξ}}^{p - 2}$ ${\overline{\xi}}^{p-2}$ , where $\bar{ξ}$ ${\overline{\xi}}$ is the averaged two-point function. More precisely, there are S_p parameters such that

$\begin{matrix} \bar{ξ_{p}} = S_{p} {\bar{ξ}}^{p - 2} . \end{matrix}$ $\begin{aligned} \overline{\xi _{p}}=S_{p}\,\overline{\xi }^{p-2}. \end{aligned}$ (51)

The precise value of the S_p parameters depend on the Q_p parameters and on the averages of the product of ξ(r_ij) functions. A very good approximation is to assume that the average of the products of this function is given by the product of these averages. Then the S_p coefficients depend solely on Q_p,

$\begin{matrix} S_{p} = \sum_{t} Q_{p} (t) . \end{matrix}$ $\begin{aligned} S_{p}=\sum _{t}Q_{p}(t). \end{aligned}$ (52)

4.2. The (minimal) tree model

The tree models are based on a further assumption on the Q_p parameters. It is basically assumed that tree expressions can be computed locally⁷, that is,

$\begin{matrix} Q (t) = \prod_{vertices \in t} ν_{p}, \end{matrix}$ $\begin{aligned} Q(t)=\prod _{\mathrm{vertices}\in t} \nu _{p}, \end{aligned}$ (53)

where ν_p is a weight attributed to all vertices with p incoming lines (ν₀ = ν₁ = 1 for completion). In this formalism, the vertex generating function is generally introduced,

$\begin{matrix} ζ (τ) = \sum_{p} \frac{ν_{p}}{p!} τ^{p} . \end{matrix}$ $\begin{aligned} \zeta (\tau )=\sum _{p}\frac{\nu _{p}}{p!}\tau ^{p}. \end{aligned}$ (54)

The minimal tree model is a model in which ν₂ alone does not vanish. In the minimal model⁸, its value is fixed and is given by ν₂ = 1/2, so that

$\begin{matrix} ζ_{RL} (τ) = {(1 + τ / 2)}^{2} . \end{matrix}$ $\begin{aligned} \zeta _{\mathrm{RL}}(\tau )=(1+\tau /2)^{2}. \end{aligned}$ (55)

Together with the Gaussian case (which corresponds to ζ(τ)=1 + τ), this is the only case for which we are sure that it can be effectively built (in the sense that other models may be unphysical).

In this model, it is possible to build the cumulant generating function of the local density. For the one-point case, assuming the mean-field approximation, the CGF is given by

$\begin{matrix} φ (λ) = λ [ζ (τ) - \frac{1}{2} τ ζ^{'} (τ)] \end{matrix}$ $\begin{aligned} \varphi (\lambda )=\lambda \left[\zeta (\tau )-\frac{1}{2}\tau \zeta ^{\prime }(\tau )\right] \end{aligned}$ (56)

with

$\begin{matrix} τ = λ \bar{ξ} ζ^{'} (τ) . \end{matrix}$ $\begin{aligned} \tau =\lambda \,\overline{\xi }\,\zeta ^{\prime }(\tau ). \end{aligned}$ (57)

This is not the result of large deviation principle calculations, but of mere combinatorics, although it leads to the same formal transformation between the CGF and the vertex-generating function. In case of the minimal model, Eq. (57) takes a simple form that can be easily solved. We finally have

$\begin{matrix} φ (λ) = \frac{τ (λ)}{\bar{ξ}}, τ (λ) = \frac{λ \bar{ξ}}{1 - λ \bar{ξ} / 2} . \end{matrix}$ $\begin{aligned} \varphi (\lambda )=\frac{\tau (\lambda )}{\overline{\xi }},\ \ \ \tau (\lambda )=\frac{\lambda \overline{\xi }}{1-\lambda \overline{\xi }/2}. \end{aligned}$ (58)

The one-point PDF of the density can then be computed explicitly (see appendix),

$\begin{matrix} P (ρ) = \frac{4}{{\bar{ξ}}^{2}} exp [- \frac{2}{\bar{ξ}} (1 + ρ)]_{0} F_{1} (2, \frac{4 ρ}{\bar{ξ}}), \end{matrix}$ $\begin{aligned} P(\rho )=\frac{4}{\overline{\xi }^{2}}\exp \left[-\frac{2}{\overline{\xi }}(1+\rho )\right] \ _{0}F_{1}\left(2,\frac{4\rho }{\overline{\xi }}\right) , \end{aligned}$ (59)

as can the density-bias function,

$\begin{matrix} b (ρ) = \frac{_{0} F_{1} (1, \frac{4 ρ}{\bar{ξ}})}{_{0} F_{1} (2, \frac{4 ρ}{\bar{ξ}})} - \frac{2}{\bar{ξ}}, \end{matrix}$ $\begin{aligned} b(\rho )=\frac{\ _{0}F_{1}\left(1,\frac{4\rho }{\overline{\xi }}\right)}{\ _{0}F_{1}\left(2,\frac{4\rho }{\overline{\xi }}\right)}-\frac{2}{\overline{\xi }}, \end{aligned}$ (60)

where $\bar{ξ}$ ${\overline{\xi}}$ is the averaged two-point correlation function within the cell.

4.3. Rayleigh-Levy flight model

The minimal tree model can be implemented with Rayleigh-Levy random walks (or rather Rayleigh-Levy flights, as described in Peebles 1980). This is a Markov random walk where the PDF of the step length ℓ follows a power law,

$\begin{matrix} P (ℓ) \sim \frac{1}{ℓ^{α}}, \end{matrix}$ $\begin{aligned} P(\ell )\sim \frac{1}{\ell ^{\alpha }}, \end{aligned}$ (61)

with a regularizing cutoff at small separation, and where α satisfies

$\begin{matrix} 0 < α < 2 . \end{matrix}$ $\begin{aligned} 0 < \alpha < 2. \end{aligned}$ (62)

The sample points are then all the step points reached by the walker.

More precisely, the cumulative distribution function of step of length ℓ is

$\begin{matrix} P (> ℓ_{0}) = 1, \end{matrix}$ $\begin{aligned}&P(>\ell _{0})=1,\end{aligned}$ (63)

$\begin{matrix} P (> ℓ) = {(\frac{ℓ_{0}}{ℓ})}^{α} for ℓ > ℓ_{0}, \end{matrix}$ $\begin{aligned}&P(>\ell )=\left(\frac{\ell _{0}}{\ell }\right)^{\alpha }\ \ \mathrm{for}\ \ \ell >\ell _{0}, \end{aligned}$ (64)

where ℓ₀ is a small-scale parameter. The two- and higher-order correlation functions can then be explicitly computed. Starting with a first point at position r₀, the density of the subsequent point (first descendant) at position r is given by

$\begin{matrix} f_{1} (r) = \frac{α}{2 π} \frac{ℓ_{0}^{α}}{| r - r_{0} |^{2 + α}} in 2 D space ; \end{matrix}$ $\begin{aligned}&f_{1}(\mathbf{r})=\frac{\alpha }{2\pi }\frac{\ell _{0}^{\alpha }}{\vert \mathbf{r}-\mathbf{r}_{0}\vert ^{2+\alpha }}\ \ \mathrm{in\ 2D\ space};\end{aligned}$ (65)

$\begin{matrix} f_{1} (r) = \frac{α}{4 π} \frac{ℓ_{0}^{α}}{| r - r_{0} |^{3 + α}} in 3 D space . \end{matrix}$ $\begin{aligned}&f_{1}(\mathbf{r}) =\frac{\alpha }{4\pi }\frac{\ell _{0}^{\alpha }}{\vert \mathbf{r}-\mathbf{r}_{0}\vert ^{3+\alpha }}\ \ \mathrm{in\ 3D\ space}. \end{aligned}$ (66)

In the following, the dimension of space is denoted D. The density of the descendants, assuming there are an infinity of them, of a point at position r₀ is then given by a series of convolutions,

$\begin{matrix} f (r_{0}, r) = f_{1} (r) + \int d^{D} r_{1} f_{1} (r - r_{1}) f_{1} (r_{1} - r_{0}) + \dots, \end{matrix}$ $\begin{aligned} f(\mathbf{r}_{0},\mathbf{r})=f_{1}(\mathbf{r})+\int \mathrm{d}^{D}\mathbf{r}_{1}\ f_{1}(\mathbf{r}-\mathbf{r}_{1})f_{1}(\mathbf{r}_{1}-\mathbf{r}_{0})+\dots , \end{aligned}$ (67)

with subsequent convolutions and where the integral is done in the whole space. Defining the Fourier transform of f₁(r) as ψ(k),

$\begin{matrix} ψ (k) = \int d^{D} r f_{1} (r) e^{- i k . r}, \end{matrix}$ $\begin{aligned} \psi (k)=\int \mathrm{d}^{D}\mathbf{r}\ f_{1}(\mathbf{r})\,e^{-\mathrm{i}\mathbf{k}.\mathbf{r}}, \end{aligned}$ (68)

which is then a function of k only, it is easy to see that

$\begin{matrix} f (r_{0}, r) = \int \frac{d^{D} k}{{(2 π)}^{D}} e^{i k . (r - r_{0})} [ψ (k) + ψ^{2} (k) + \dots] \\ = \int \frac{d^{D} k}{{(2 π)}^{D}} e^{i k . (r - r_{0})} \frac{1}{1 - ψ (k)}, \end{matrix}$ $\begin{aligned}&f(\mathbf{r}_{0},\mathbf{r})=\int \frac{\mathrm{d}^{D}\mathbf{k}}{(2\pi )^{D}}\ e^{\mathrm{i}\mathbf{k}.(\mathbf{r}-\mathbf{r}_{0})}\left[\psi (k)+\psi ^{2}(k)+\dots \right]\nonumber \\&\qquad \quad =\int \frac{\mathrm{d}^{D}\mathbf{k}}{(2\pi )^{D}}\ e^{\mathrm{i}\mathbf{k}.(\mathbf{r}-\mathbf{r}_{0})}\,\frac{1}{1-\psi (k)}, \end{aligned}$ (69)

where we take advantage of the expression of convolutions in Fourier space and their resummations. The two-point correlation function is then given by two possible configurations: a neighbor can either be an ascendant or a descendant, so that the two-point correlation functions between positions r₁ and r₂ are given by

$\begin{matrix} ξ_{2} (r_{1}, r_{2}) = \frac{1}{n} [f (r_{1}, r_{2}) + f (r_{2}, r_{1})] \\ = \frac{1}{n} \int \frac{d^{D} k}{{(2 π)}^{D}} e^{i k . (r_{2} - r_{1})} \frac{2}{1 - ψ (k)}, \end{matrix}$ $\begin{aligned}&\xi _{2}(\mathbf{r}_{1},\mathbf{r}_{2})=\frac{1}{n}\left[f(\mathbf{r}_{1},\mathbf{r}_{2})+f(\mathbf{r}_{2},\mathbf{r}_{1})\right]\nonumber \\&\qquad \quad \ \ =\frac{1}{n} \int \frac{\mathrm{d}^{D}\mathbf{k}}{(2\pi )^{D}}\ e^{\mathrm{i}\mathbf{k}.(\mathbf{r}_{2}-\mathbf{r}_{1})}\,\frac{2}{1-\psi (k)}, \end{aligned}$ (70)

where n is the number density of points in the sample that can be associated with a typical length ℓ_n,

$\begin{matrix} n = \frac{1}{ℓ_{n}^{D}} . \end{matrix}$ $\begin{aligned} n=\frac{1}{\ell _{n}^{D}}. \end{aligned}$ (71)

At large scale, this expression causes the power spectra to be power laws. They scale like k^−α, and the resulting two-point correlation function then takes the form in the large separation limit,

$\begin{matrix} ξ_{α, 2 D} (r) = \frac{α}{π} r^{α - 2} ℓ_{0}^{- α} ℓ_{n}^{2} \end{matrix}$ $\begin{aligned}&\xi _{\alpha ,\,2D}(r)=\frac{\alpha }{\pi }\ r^{\alpha -2} \ell _{0}^{-\alpha }\ell _{n}^{2}\end{aligned}$ (72)

$\begin{matrix} ξ_{α, 3 D} (r) = \frac{(1 - α^{2}) tan (\frac{π α}{2})}{π^{2}} r^{α - 3} ℓ_{0}^{- α} ℓ_{n}^{3} . \end{matrix}$ $\begin{aligned}&\xi _{\alpha ,\,3D}(r) =\frac{\left(1-\alpha ^2\right) \tan \left(\frac{\pi \alpha }{2}\right)}{\pi ^2}\ r^{\alpha -3} \ell _{0}^{-\alpha }\ell _{n}^{3} . \end{aligned}$ (73)

It is to be noted, however, that this expression does not take into account the boundary conditions, in particular if they are assumed to be periodic. This case is examined in some detail in the next paragraph. It is to be noted, however, that in this case, the function ξ(r) has a more complex form. It is in particular no more isotropic.

Higher-order correlation functions can also be computed in this model: n points are correlated when they are embedded in a chronological sequence that can be run in one direction or the other. Thus the three-point function is simply given by

$\begin{matrix} ξ_{α} (r_{1}, r_{2}, r_{3}) = \frac{1}{n^{2}} [f (r_{1}, r_{2}) f (r_{2}, r_{3}) + \dots], \end{matrix}$ $\begin{aligned} \xi _{\alpha }(\mathbf{r}_{1},\mathbf{r}_{2},\mathbf{r}_{3})=\frac{1}{n^{2}}\left[f(\mathbf{r}_{1},\mathbf{r}_{2})f(\mathbf{r}_{2},\mathbf{r}_{3})+\cdots \right] , \end{aligned}$ (74)

with five other terms obtained by all combinations of the indices. Expressing the result in terms of the two-point function, we have

$\begin{matrix} ξ_{α} (r_{1}, r_{2}, r_{3}) = \frac{1}{2} [ξ_{α} (r_{1}, r_{2}) ξ_{α} (r_{2}, r_{3}) + \\ ξ_{α} (r_{2}, r_{3}) ξ_{α} (r_{3}, r_{1}) + ξ_{α} (r_{3}, r_{1}) ξ_{α} (r_{1}, r_{2})], \end{matrix}$ $\begin{aligned}&\xi _{\alpha }(\mathbf{r}_{1},\mathbf{r}_{2},\mathbf{r}_{3})=\frac{1}{2}\left[\xi _{\alpha }(\mathbf{r}_{1},\mathbf{r}_{2})\xi _{\alpha }(\mathbf{r}_{2},\mathbf{r}_{3})+\right.\nonumber \\&\qquad \qquad \left.\xi _{\alpha }(\mathbf{r}_{2},\mathbf{r}_{3})\xi _{\alpha }(\mathbf{r}_{3},\mathbf{r}_{1})+\xi _{\alpha }(\mathbf{r}_{3},\mathbf{r}_{1})\xi _{\alpha }(\mathbf{r}_{1},\mathbf{r}_{2}) \right] , \end{aligned}$ (75)

corresponding to a tree structure with ν₂ = 1/2.

Higher-order correlation functions can be computed similarly. They follow a tree structure in the sense above, with ν₂ = 1/2 and ν_p = 0 for p ≥ 3.

4.4. Periodic boundary conditions

We briefly explore here the case of periodic boundary conditions. Then the multipoint density field g^PBC(r_i) for periodic boundary conditions can be expressed in terms of the former density field g(r_i) as sums of all copies of the sample, that is,

$\begin{matrix} g^{PBC} ({r_{i}}) = \sum_{n_{i}} g ({r_{i} + n_{i} L}), \end{matrix}$ $\begin{aligned} g^{\mathrm{PBC}}(\{\mathbf{r}_{i}\})=\sum _{\mathbf{n}_{i}}g(\{\mathbf{r}_{i}+\mathbf{n}_{i}L\}), \end{aligned}$ (76)

where n_i are vectors whose components are integers, $n_{i} = (n_{i}^{x}, n_{i}^{y}, n_{i}^{z})$ ${\textbf{n}}_{i}=(n_{i}^{x},n_{i}^{\mathit{y}},n_{i}^{z})$ and the sums run over all integer values for all i; L is the size of the sample (assumed to be the same in all directions).

When it is applied in this context, we can construct the n-point density function out of the density function f computed previously. Thus the two-point density function is given by

$\begin{matrix} g^{PBC} (r_{1}, r_{2}) = n^{PBC} \sum_{n_{12}} f (r_{1}, r_{2} - r_{1} + n_{12} L), \end{matrix}$ $\begin{aligned} g^{\mathrm{PBC}}(\mathbf{r}_{1},\mathbf{r}_{2})=n^{\mathrm{PBC}}\sum _{\mathbf{n}_{12}}f(\mathbf{r}_{1},\mathbf{r}_{2}-\mathbf{r}_{1}+\mathbf{n}_{12}L), \end{aligned}$ (77)

where n₁₂ = n₂ − n₁ and n^PBC is the resulting one-point (and therefore homogeneous) density in the sample. This expression is therefore written in terms of the function

$\begin{matrix} f^{PBC} (r_{0}, r) = \sum_{n} f (r_{0}, r - r_{0} + n L) . \end{matrix}$ $\begin{aligned} f^{\mathrm{PBC}}(\mathbf{r}_{0},\mathbf{r})= \sum _{\mathbf{n}}f(\mathbf{r}_{0},\mathbf{r}-\mathbf{r}_{0}+\mathbf{n}L). \end{aligned}$ (78)

We can now compute its expression in terms of the power spectra, or more specifically, the function ψ(k) defined previously. We have

$\begin{matrix} f^{PBC} (r_{0}, r) = \int \frac{d^{D} k}{{(2 π)}^{D}} e^{i k . (r - r_{0})} \sum_{n} e^{i n . k L} \frac{1}{1 - ψ (k)}, \end{matrix}$ $\begin{aligned} f^{\mathrm{PBC}}(\mathbf{r}_{0},\mathbf{r})= \int \frac{\mathrm{d}^{D}\mathbf{k}}{(2\pi )^{D}}\ e^{\mathrm{i}\mathbf{k}.(\mathbf{r}-\mathbf{r}_{0})}\,\sum _{\mathbf{n}}e^{\mathrm{i}\mathbf{n}.\mathbf{k}\,L}\frac{1}{1-\psi (k)}, \end{aligned}$ (79)

and the latter sum ensures that the contributing wave modes k are only those that are periodic in all three directions, that is, those whose components are multiples of 2π/L so that

$\begin{matrix} f^{PBC} (r_{0}, r) = \sum_{n^{*}} \frac{1}{L^{D}} e^{2 π i n . (r - r_{0}) / L} \frac{1}{1 - ψ (k_{n})}, \end{matrix}$ $\begin{aligned} f^{\mathrm{PBC}}(\mathbf{r}_{0},\mathbf{r})=\sum _{\mathbf{n}^{*}}\frac{1}{L^{D}}\ e^{2\pi \mathrm{i}\ \mathbf{n}.(\mathbf{r}-\mathbf{r}_{0})/L}\,\frac{1}{1-\psi (k_{n})}, \end{aligned}$ (80)

with

$\begin{matrix} k_{n} = {(n . n)}^{1 / 2} \frac{2 π}{L}, \end{matrix}$ $\begin{aligned} k_{n}=(\mathbf{n}.\mathbf{n})^{1/2}\frac{2\pi }{L}, \end{aligned}$ (81)

and where the sum is all over possible integer triplets but n = (0, 0, 0). The two-point correlation function is now given by

$\begin{matrix} ξ_{α}^{PBC} (r_{1}, r_{2}) = \frac{1}{n^{PBC}} [f^{PBC} (r_{1}, r_{2}) + f (r_{2}, r_{1})] . \end{matrix}$ $\begin{aligned} \xi _{\alpha }^{\mathrm{PBC}}(\mathbf{r}_{1},\mathbf{r}_{2})=\frac{1}{n^{\mathrm{PBC}}}\left[f^{\mathrm{PBC}}(\mathbf{r}_{1},\mathbf{r}_{2})+f(\mathbf{r}_{2},\mathbf{r}_{1})\right] . \end{aligned}$ (82)

A similar result can be obtained for the three-point correlation function with

$\begin{matrix} ξ_{α}^{PBC} (r_{1}, r_{2}, r_{3}) = \frac{1}{{(n^{PBC})}^{2}} [f^{PBC} (r_{1}, r_{2}) f^{PBC} (r_{2}, r_{3}) + \dots] . \end{matrix}$ $\begin{aligned} \xi _{\alpha }^{\mathrm{PBC}}(\mathbf{r}_{1},\mathbf{r}_{2},\mathbf{r}_{{3}})\!=\! \frac{1}{(n^{\mathrm{PBC}})^{2}}\left[f^{\mathrm{PBC}}(\mathbf{r}_{1},\mathbf{r}_{2})\ f^{\mathrm{PBC}}(\mathbf{r}_{2},\mathbf{r}_{3})+\cdots \right]. \end{aligned}$ (83)

As a consequence, the functional relation between the three-point correlation function and the two-point correlation function is left unchanged. This is also the case at all orders.

4.5. Covariance matrix of the minimal tree model

Remarkably, in case of the minimal tree model, the derivation of the CGF can also be made for multiple cells, and in particular, for two cells. Its expression is derived in the appendix. It takes the form

$\begin{matrix} φ (λ_{1}, λ_{2}) = \frac{λ_{1} + λ_{2} + (ξ_{12} - \bar{ξ}) λ_{1} λ_{2}}{1 - (λ_{1} + λ_{2}) \bar{ξ} / 2 - λ_{1} λ_{2} (ξ_{12}^{2} - {\bar{ξ}}^{2}) / 4} . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})= \frac{\lambda _{1}+\lambda _{2}+(\xi _{12}-\overline{\xi })\lambda _{1}\lambda _{2}}{1-(\lambda _{1}+\lambda _{2})\,\overline{\xi }/2-\lambda _{1}\lambda _{2}\,(\xi _{12}^{2}-\overline{\xi }^{2})/4}. \end{aligned}$ (84)

In this case, it is then possible to expand its expression in powers of ξ₁₂ for distant cells or in powers of $(\bar{ξ} - ξ_{12})$ $({\overline{\xi}}-\xi_{12})$ for close cells, and in both cases, closed forms can be obtained to any order. It leads to the possibility of computing the joint PDF for any configuration (see the appendix for details) and finally to evaluate the covariance matrix directly. This is even possible for any of the thre sets of variables we consider, {ρ_i}, ${{\hat{ρ}}_{i}}$ $\{{\hat{\rho}}_{i}\}$ , or ${{\bar{ρ}}_{i}}$ $\{{\overline{\rho}}_{i}\}$ .

We performed these computations for the minimal tree model with a power-law behavior ξ(r)∼r^−1.5 (α = 0.5), a 2D survey with a size of 200² pixels, and a top-hat smoothing radius of 4.25 pixels. The amplitude of the correlation function was fixed to give $\bar{ξ} = 1.09$ ${\overline{\xi}}=1.09$ at the smoothing scale. It precisely corresponds to the setting of the numerical simulations of Rayleigh-Levy flights we also performed, as described in the next section. It allows us to compare the two approaches. These analytic results have two limitations: the results are based on the mean-field approximation for the computation of the two-variable GCF, and the covariance elements are computed ignoring the bin sizes (i.e., by evaluating the expression of the covariance for their central values). Although in most cases this should not be an issue, it still might have a non-negligible impact when the PDF varies rapidly with the density.

5. Simplified models of the covariance matrix

The purpose of this section is then to propose two levels of modeling of the covariance matrix based on the previous results and to compare these propositions with results of either the full analytic results presented before or with the results of numerical experiments based on Rayleigh-Levy flights.

5.1. Modeling the covariance matrix

More specifically, we considered two approximate forms for the full covariance. The first approximation is fully analytic. It makes use of the large-scale contributions and those from the short distance expression (38). It reads as the sum of the two contributions

$\begin{matrix} Cov (ρ_{i}, ρ_{j}) = b_{#} (ρ_{i}) P (ρ_{i}) {\bar{ξ}}_{s} b_{#} (ρ_{j}) P (ρ_{j}) \\ + \int_{0}^{r_{max}} d r_{d} P_{s} (r_{d}) P_{short dist .} (ρ_{i}, ρ_{j}, r_{d}) . \end{matrix}$ $\begin{aligned}&\mathrm{Cov}(\rho _{i},\rho _{j})=b_{\#}(\rho _{i})P(\rho _{i})\overline{\xi }_{s}b_{\#}(\rho _{j})P(\rho _{j}) \nonumber \\&\qquad \qquad + \int _{0}^{r_{\max }} \mathrm{d}r_{d}\,P_{s}(r_{d})\, P_{\mathrm{short\ dist.}}(\rho _{i},\rho _{j},r_{d}). \end{aligned}$ (85)

In this expression, the only free parameter is r_max. This is indeed a crucial parameter as it determines to a large extent the amplitude of the short-distance effects. In the following, we take r_max = R, that is, the filtering scale. It is found to give a good result for the 2D case and for n_s = −1/2, but this choice is likely to depend on the shape of the power spectrum. In general, this formula is intended to give a good account of the general properties of the covariance matrix, it cannot provide reliable quantitative results a priori.

The other form we propose is intended to be much more precise quantitatively. Is is given by the following expression:

$\begin{matrix} Cov (ρ_{i}, ρ_{j}) = b_{#} (ρ_{i}) P (ρ_{i}) {\bar{ξ}}_{s} b_{#} (ρ_{j}) P (ρ_{j}) + {Cov}^{PBC} (ρ_{i}, ρ_{j}), \end{matrix}$ $\begin{aligned} \mathrm{Cov}(\rho _{i},\rho _{j})=b_{\#}(\rho _{i})P(\rho _{i})\overline{\xi }_{s}b_{\#}(\rho _{j})P(\rho _{j}) +\mathrm{Cov}^{\mathrm{PBC}}(\rho _{i},\rho _{j}) , \end{aligned}$ (86)

where Cov^PBC(ρ_i, ρ_j) is the expression of the covariant matrix for periodic boundary conditions. It is obtained here simply by replacing $P (ρ_{i}, ρ_{j}, \bar{ξ}, ξ_{12} (r_{d}))$ $P(\rho_{i},\rho_{j},{\overline{\xi}},\xi_{12}(r_d))$ by $P (ρ_{i}, ρ_{j}, \bar{ξ} - {\bar{ξ}}_{s}, ξ_{12} (r_{d}) - {\bar{ξ}}_{s})$ $P(\rho_{i},\rho_{j},{\overline{\xi}}-{\overline{\xi}_{s}},\xi_{12}(r_d)-{\overline{\xi}_{s}})$ before integrating over r_d so that the averaged joint correlations vanish identically. The rationale for this proposition is that Cov^PBC(ρ_i, ρ_j) could be more easily estimated from specific numerical experiments. In both cases, the short-distance contributions are the same for the three types of observables ρ_i, ${\hat{ρ}}_{i}$ ${\hat{\rho}}_{i}$ , and ${\bar{ρ}}_{i}$ ${\overline{\rho}}_{i}$ . These forms are then compared to numerical results.

5.2. Numerical experiments with the Rayleigh-Levy flight model

A series of experiments of 2D walks with a large number of samples were performed as described below. We restricted our analysis to α = 0.5 with l₀ = 0.003 pixel size (the dependence on l₀ was tested as illustrated on Fig. 2, where l₀ = .006 was also used, but the analyses were made for a fixed value of l₀). Figure 1 illustrates how points are distributed in these samples. The point distribution does not show the filamentary structure of realistic cosmological simulations. It exhibits the presence of concentrated halos surrounded by empty regions, however, which are reminiscent of the structure of the largest matter concentrations of the cosmic web.

Fig. 1.

Example of a realization of a Rayleigh-Levy walk. Points mark the end point of each displacement. They are clearly correlated.

Two different setting were employed to explore different aspects of the results that were found:

Set 𝒜: 1600 samples extracted from a single numerical realization (with periodic boundary conditions) with a size of 8000 × 8000 pixels² containing 64 × 10⁶ points. Each sample then has 200 × 200 pixels² containing an average of 200² points each. For this set of samples, the average and covariance of the PDF were extracted following the three procedures mentioned before: either the density was taken with respect to the mean density of the realization, with respect to the density of each sample, or by subtracting the sample density. It therefore corresponds to an evaluation of the mean and covariance of the PDF of ρ_i, ${\hat{ρ}}_{i}$ ${\hat{\rho}}_{i}$ , and ${\bar{ρ}}_{i}$ ${\overline{\rho}}_{i}$ , respectively.
Set 𝓑: 1600 samples, each with periodic boundary conditions, with a size of 200 × 200 pixels² containing 200² points each. By construction, the average two-point function in the sample, ${\bar{ξ}}_{s}$ ${\overline{\xi}}_{s}$ , vanishes in this case, and covariance is entirely due to proximity effects.

In each case, the local density was obtained after a filtering procedure. The point positions were first pixelized, that is, each point was attributed to a pixel so that the mean number of points per pixel was one. The field was then filtered by a (quasi) circular top-hat functions. In practice, the number of pixels in the window function was 57. This makes the effective smoothing radius about 4.25 in pixel units. The resulting density was then measured at each pixel location. Their histograms were then computed after density binning. To avoid large undue discrete effects, the bin width was chosen to be a multiple of 1/57, and in order to ensure that the requirement (49) was met at the pixel distance, we chose a bin width of about 1/4, more precisely, of 14/57.

Figure 2 shows the resulting PDF as measured in the simulations and how it compares to the theoretical prediction, Eq. (59), for two different choices of l₀. The expected scaling for $\bar{ξ}$ ${\overline{\xi}}$ is recovered. The measured PDF also follows theoretical predictions for a wide range of probabilities remarkably well. It gives us confidence in the whole procedure and in the approaches used to compute PDFs in this model. The detailed comparisons were made for l₀ = 0.003, leading to $\bar{ξ} = 1.09,$ ${\overline{\xi}}=1.09,$ and a sample density variance in sets 𝒜 given by ${\bar{ξ}}_{s} = 0.09$ ${\overline{\xi}}_{s}=0.09$ .

Fig. 2.

One-point density PDF obtained with top-hat filters compared with the theoretical predictions, Eq. (59). The values of $\bar{ξ}$ ${\overline{\xi}}$ are 0.8 and 1.09 for the blue and red curves, respectively, corresponding to two different values of l₀. The bottom panel shows the residuals. Departure from theory might be due to binning and/or to the finite number of samples.

The measured variance of the density PDF is obtained from 1600 samples in each case. The resulting shapes are presented in Figs. 3–5 for the different cases, density in a supersample realization, and in samples with periodic boundary conditions. The results show the comparison between results obtained in the numerical simulations with yellow symbols, and results derived from the analytic prescriptions as blue dots, based on the mean-field approximation. The agreement between the two is very good. The overall shape of the variance and its dependence on the density is well reproduced. Discrepancies can be observed for densities above 4 or 5, however, where the theoretical predictions are seen to underestimate the results. The reasons for these discrepancies are not clear at this stage. A possible explanation might be the finite number of samples that is used to infer the variances⁹. The variance of the density PDF is also compared with the large-scale contributions (19), (29), and (34) for set 𝒜 depending on the cases (at this order, the covariance vanishes for set 𝒝). It shows that this formula captures some features of the variance (especially at low and moderate densities), but does not account for all. This is also illustrated in Fig. 4, which shows the reduced covariance. The fact that the covariance is determined to a large extent by its leading large-scale contribution leads to values of the reduced covariance close to 1 or −1, leading to these butterfly patterns. Proximity effects, not captured in these forms, also contribute to the covariances at a significant level, however. This is already apparent in Fig. 3.

Fig. 3.

Measured variance of the density PDF, i.e., diagonal elements of the covariance matrix, in sets 𝒜 for α = 0.5 and different prescription of the measured density. From left to right, raw density ρ_i, scaled density ${\hat{ρ}}_{i}$ ${\hat{\rho}}_{i}$ , and scaled density ${\bar{ρ}}_{i}$ ${\overline{\rho}}_{i}$ . The blue dots and solid lines are from the mean-field analytical expressions, and the large gold symbols are from the numerical simulations. The dashed black lines are what is expected from the large-scale leading contribution. The variance at cell scale is about 1.09, and the variance at sample scale, ${\bar{ξ}}_{s}$ ${\overline{\xi}}_{s}$ , is about 0.09.

Fig. 4.

Resulting reduced covariance matrix for the three types of observables for set 𝒜. The covariance matrix is dominated by its leading eigenvalue and direction, leading to this typical butterfly shape of the reduced covariance matrix.

Fig. 5.

Measured variance of the density PDF obtained for set 𝒝). Symbols are the same as in Fig. 3.

5.3. Testing models of covariance matrices

Expressions (85) and (86) are precise propositions to show how the large-scale contributions can be completed to account for the full form of the covariance. The comparisons between the predicted form and those obtained from the numerical experiments are explored in detail at different levels and using the following criteria:

Amplitude of the PDF variance,
Density dependence of the first eigenvalue of the covariance matrix,
Amplitude of the eigenvalues of the covariance matrix, and
Resulting χ² distribution of a set of data vectors drawn from the original covariance.

These comparisons are shown in Figs. 6–8. For model (86), the term Cov^PBC(ρ_i, ρ_j) is taken from the measured covariance of set 𝒝. Figures 6 and 7 show that these two prescriptions give a good account of the leading behavior of the covariance matrix. The conclusion is quite sensitive for the choice of r_max for prescription (85). On the other hand, there is no free parameter that can be adjusted for model (86). Interestingly, Fig. 7 shows that the PDF variance also departs significantly from the large-scale term. The first eigenvector reproduces the functional form of the large-scale density-bias functions very faithfully.

Fig. 6.

Measured variance of the density PDF, i.e., diagonal elements of the covariance matrix, in sets 𝒜 and comparisons with proposed approximate forms. The yellow line and symbols are the results obtained in the numerical experiments. The dot-dashed line is the prediction derived from relation (86), and the dashed gray line shows the prediction from Eq. (85). The dot-dashed black lines correspond to the large-scale contributions.

Fig. 7.

Behavior of the first eigenvector with the same color-coding as in Fig. 6. The dashed black lines are the large-scale prediction, b_#(ρ_i)P(ρ_i) appropriately normalized. The size of the data vector is 30.

Fig. 8.

Performances of the approximate forms of the covariance matrix in terms of rigenvalues and χ²-distributions. Top panel: eigenvalues of the covariance matrices (rebinned into six bins) compared to what can be obtained from the proposed approximate forms; same color-coding as for Fig. 6. The χ² distributions are shown in the bottom panel. Model (86) reproduces the very same χ² distributions. Model (85), in gray, is not as accurate and tends to slightly overestimate the χ². This latter behavior is amplified when a larger number of bins is used.

The last two criteria are designed to verify that the reconstructed covariances also capture the subleading behavior of the matrix and can eventually be safely inverted and used as a model of likelihood. To avoid numerical uncertainties and make the comparison tractable, we chose to reduce the binning to six bins (through a rebinning of the histograms and densities ranging from 0.5 to 6.5). The resulting eigenvalues are shown in the top panel of Fig. 8. It shows that the eigenvalues decrease rapidly in amplitude, suggesting that the eigendirections are well sequenced and that the approximate form captures their values rather accurately. Form (86) in particular reproduces all six eigenvalues almost exactly.

Finally, χ² distributions were computed from a set of random values $P_{i}^{e x}$ $P_{i} ^{\ex}$ drawn in each case from a Gaussian likelihood built from the measured covariance (with six bins). The values of $χ^{2} (P_{i}^{e x})$ $\chi^{2}(P_{i} ^{\ex})$ were then computed for each data vector, and their histogram was computed from each of the proposed models (including the original model for reference),

$\begin{matrix} χ_{model}^{2} (P_{i}^{ex}) = \frac{1}{2} \sum_{ij} N_{model}^{ij} P_{i}^{ex} P_{j}^{ex}, \end{matrix}$ $\begin{aligned} \chi _{\mathrm{model}}^{2}(P_{i} ^{\mathrm{ex}})=\frac{1}{2}\sum _{ij}\mathcal{N}_{\mathrm{model}}^{ij}P_{i} ^{\mathrm{ex}}P_{j} ^{\mathrm{ex}}, \end{aligned}$ (87)

where $N_{model}^{ij}$ ${{\cal N}}_{{\text{ model}}}^{ij}$ is the inverse of the covariance matrix, either computed from Eq. (85) or from Eq. (86). For the original model, the expected distribution of the χ² values is then expected to be precisely that of a χ² distribution with six degrees of freedom. This is indeed what is almost exactly obtained for model (86). Results obtained from prescription (85) are not quite as good. This is expected as the short-distance effects are estimated rather crudely in Eq. (85). The performance of this prescription deteriorates when the dimension of the data vector (i.e., the number of bins) increases.

6. Conclusions and lessons

We presented key relations that give the large-scale behavior of the joint PDF, and hence the leading behavior of the covariance matrix of the density PDF. These contributing terms do not give the expression of a covariance matrix that can be used to build a likelihood function, however, as it is not invertible. Further significant contributions are found to be due to small separation effects, and an approximate form is proposed in Eq. (38). The latter is found to encapsulate most of the proximity effects, that is, it informs on the fact that nearby regions are likely to be correlated. They also give an indication on the minimal grid size that can be used the maximum bin size that can be used without information loss for a given bin width.

We then used a toy model for which numerical experiments can easily be performed and for which the exact PDF and large-scale covariance can be derived. It allows us to evaluate the efficiency of approximate schemes precisely. The conclusions of these comparisons are listed below.

The theoretical forms Eqs. (19), (29), and (34) give the leading-order expression of the covariance elements when supersample effects are taken into account. It gives an accurate prediction of the leading eigenvalue and eigendirection of the covariance matrix.
Whether subdominant effects can be accounted for by subsequent terms depends on the behavior of the two-point function: if the rms of the two-point function is dominated by large separations, then next-to-leading-order effect need to be taken into account; otherwise, short-distance effects will be the dominant contributor.
In case short-distance effects dominate, the covariance matrix can be accessed from small simulations provided the relevant dominant large-scale contributions are added.

This suggests that in realistic situations, the supersample effects, that is, the effects due to modes whose wavelength is larger than the size of the survey, have limited impact on the structure of the covariance matrix and that they can be captured by the only leading large-scale contribution. This is supported by a further analysis of the behavior of ξ(r_d) in realistic cosmological settings. For the standard model of cosmology (as derived from cosmic microwave background observations, Planck Collaboration VI 2020), the behavior of the matter correlation function can be derived. This is illustrated in Fig. 9, which illustrates the scales that are the main contributors to the first two moments of the two-point correlation function. Whether in 2D or in 3D, the first moment is dominated by large-scale contributions, whereas the second moment is dominated by small-scale contributions.

Fig. 9.

Scale dependence of the matter correlation functions for a realistic cosmological model (cosmological parameters derived from Plank, Planck Collaboration VI 2020) for the 3D density and the projected density (for a uniformly sampled survey with a depth of about 800 h⁻¹Mpc between z = 0.75 and z = 1.25). The top panel shows $r_{d}^{3} ξ (r_{d})$ $r_{d}^{3}\xi(r_{d})$ (solid blue line) and $r_{d}^{3} ξ^{2} (r_{d})$ $r_{d}^{3}\xi^{2}(r_{d})$ (dashed red line) for the 3D density field, and the bottom panel shows $θ_{d}^{2} ξ (θ_{d})$ $\theta^{2}_{d}\xi(\theta_{d})$ and $θ_{d}^{2} ξ^{2} (θ_{d})$ $\theta^{2}_{d}\xi^{2}(\theta_{d})$ for the projected density. In both cases, the average value of the first moment of the two-point correlation function is dominated by large-distance contributions, whereas short-distance contributions dominate the second moment, assuming survey sizes of about 100 h⁻¹ Mpc or above.

In the context of this study, we assumed that the measured P_i were Gaussian distributed. Although it is difficult to assess the accuracy of this hypothesis, the structure uncovered in Sect. 3 can be used to make such an attempt. In tree models, higher-order expressions of the joint density PDFs are expected to preserve the tree structure; see Bernardeau & Schaeffer (1999). The connected part of the three-point joint density PDF is then expected to take the form

$\begin{matrix} Cov (ρ_{i}, ρ_{j}, ρ_{k}) \\ = b_{2} (ρ_{i}) P (ρ_{i}) {\bar{ξ}}_{s} b (ρ_{j}) P (ρ_{j}) {\bar{ξ}}_{s} b (ρ_{k}) P (ρ_{k}) + sym ., \end{matrix}$ $\begin{aligned}&\mathrm{Cov}(\rho _{i},\rho _{j},\rho _{k})\nonumber \\&\quad \quad = b_{2}(\rho _{i})P(\rho _{i})\,\overline{\xi }_{s}\,b(\rho _{j})P(\rho _{j})\,\overline{\xi }_{s}\,b(\rho _{k})P(\rho _{k})+\mathrm{sym.}, \end{aligned}$ (88)

where b₂(ρ) is the two-line bias function of amplitude similar to b²(ρ). This implies in particular that the third-order cumulant is about $b {(ρ)}^{4} P {(ρ)}^{3} {\bar{ξ}}_{s}^{2}$ $b(\rho)^{4}P(\rho)^{3}{\overline{\xi}_{s}}^{2}$ , much smaller than ${[b {(ρ)}^{2} P {(ρ)}^{2} {\bar{ξ}}_{s}]}^{3 / 2}$ $\left[b(\rho)^{2}P(\rho)^{2}{\overline{\xi}_{s}}\right]^{3/2}$ , making the distribution of the measured values of P(ρ) (quasi-) Gaussian distributed. There might be some combination of ρ_i and values of ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ , however, for which a higher-order term could play a role in the expression of the likelihood function.

For the application of these formulae in practical cases, some limitations have to be noted. We list them below.

In the proposed form, the fact that in practice, PDFs are generally measured on a grid, that is, on a finite set of locations, is not taken into account. For instance, the exclusion of nonoverlapping cells is not considered. this is expected to introduce additional noise in the PDF estimates. The covariance matrix for these constructions cannot then be derived from general formulae (6) even when the integral in r_d is restricted above a given threshold.
Relation (38) has been derived in a specific regime (using saddle point approximations) for tree hierarchical models. They are expected to capture the phenomenon at play for “typical” values of the densities, but they may not perform so well in the rare event tails (the exception being the minimum model, for which it is exact). Further checks of the validity of (38) should therefore certainly be done.
The general formulae (19), (29), and (34) are valid for any type of filtering schemes, even for a compensated filter. This is not the case for relation (38). The proximity effects for compensated filters ought to be considered specifically.
Prescription (86) is found to give a very precise account of the properties of the covariance matrix. It is based on the proposition that large-scale (supersample) effects can be added separately from the proximity effects and that the latter can be evaluated with small-scale mocks in which supersample effects are absent (with periodic boundary conditions). This is not an exact result, however,. It relies in particular on the fact that the r.m.s. of the ξ_s is dominated by scales much smaller than the sample size.
Prescription (85) is less solid. It can be used for a quick assessment of the different contributing terms, or to build fully invertible covariance matrices, but it is unlikely to give reliable predictions at the χ² level.

In all cases, prescriptions (85) and (86) can be the starting point of a more precise evaluation of the covariance from specific numerical experiments that can complement its evaluation following the approach presented in Friedrich & Eifler (2018), for instance. The authors also showed that some strategies could be adopted to limit the number of realizations required to reach a specific precision. This point is not discussed here.

¹

This is an ideal estimate in the sense that μ is evaluated in an infinite number of locations. We therefore neglect here the impact of measuring μ on a finite number of locations when evaluating P_i. Regarding this aspect, a specific derivation that takes a finite number of measurements into account can be found in Codis et al. (2016a).

²

Whether this is a correct assumption is difficult to assess in general. It probably depends on the detailed properties of the setting. The Conclusion section contains further comments on this aspect.

³

For a square survey of unit size (with nonperiodic boundary conditions), the distance distribution function P_s(r_d) is given by

$\begin{matrix} P_{s} (r_{d}) = {\begin{matrix} 2 r_{d} ((r_{d} - 4) r_{d} + π) & 0 < r_{d} < 1 \\ - 2 r_{d} (2_{d} + r_{d}^{2} - 4 \sqrt{r_{d}^{2} - 1} \\ - 2 {sec}^{- 1} (\frac{r_{d}}{\sqrt{r_{d}^{2} - 1}}) + 2 {sec}^{- 1} (r_{d})) & 1 < r_{d} < \sqrt{2}, \end{matrix} \end{matrix}$ $\begin{aligned} P_{s}(r_{d})= {\left\{ \begin{array}{ll} 2 r_{d} ((r_{d}-4) r_{d}+\pi )&0<r_{d}<1 \\ -2 r_{d} \Big (2_{d}+r_{d}^2-4 \sqrt{r_{d}^2-1}&\nonumber \\ -2 \sec ^{-1}\left(\frac{r_{d}}{\sqrt{r_{d}^2-1}}\right)+2 \sec ^{-1}(r_{d})\Big )&1<r_{d}<\sqrt{2}, \end{array}\right.} \end{aligned}$

as can be obtained after integrating over three of four of the position coordinates.

⁴

This is not necessarily so, as exemplified in Carron (2011), Carron & Neyrinck (2012).

⁵

In the minimal tree model, it is possible to compute these terms in the so-called mean-field approximation (see appendix), but they do not lead to a positive definite covariance matrix and therefore cannot be the sole, or dominant, contribution to the covariances.

⁶

This limited form would induce a minimum contribution to Cov(ρ_i, ρ_i) given by Δ_{ρ_i}/℘(ρ_i), where Δ_{ρ_i} is the bin size in density.

⁷

Perturbation theory results do not exactly follow this construction as vertices are then dependent on the geometry of the incoming lines. However, in this case, Q_p values are indeed obtained from a product of vertices.

⁸

it is minimal in the sense that it can be shown that ν₂ cannot be smaller than 1/2 in the strongly nonlinear regime (Peebles 1980).

⁹

Although the number of samples is large, the number of haloes contained in each sample is finite leading to discretization errors in the estimate of the covariance. Estimate of the minimal number of realizations required to make such estimates is beyond the scope of this paper.

¹⁰

The formulae are written here for a top-hat profile, but can be extended to arbitrary profiles.

¹¹

It takes the very same form as the stationary equation in the context of the large deviation principle, although the results here do not correspond to this regime.

¹²

A quicker approach is to view φ₀(λ) as a function of the “leaf weight”, that is, the value of ν₁; φ₁(λ), φ₂(λ) are then obtained by successive derivatives of φ₀(λ) with respect to ν₁.

¹³

it is possible to fully solve the consistency relations in case of two cells; the conclusions remain unchanged.

Acknowledgments

The author of this article is indebted to Cora Uhlemann, Alex Gough, Oliver Friedrich, Sandrine Codis, Aoife Boyle and Alexandre Barthelemy for many comments and careful examination of the preparatory notes of this manuscript.

References

Abbott, T. M. C., Abdalla, F. B., Alarcon, A., et al. 2018, Phys. Rev. D, 98, 043526 [NASA ADS] [CrossRef] [Google Scholar]
Amendola, L., Appleby, S., Avgoustidis, A., et al. 2018, Liv. Rev. Rel., 21, 2 [Google Scholar]
Balian, R., & Schaeffer, R. 1989, A&A, 220, 1 [NASA ADS] [Google Scholar]
Barthelemy, A., Codis, S., & Bernardeau, F. 2021, MNRAS, 503, 5204 [Google Scholar]
Bernardeau, F. 1992, ApJ, 392, 1 [NASA ADS] [CrossRef] [Google Scholar]
Bernardeau, F. 1996, A&A, 312, 11 [NASA ADS] [Google Scholar]
Bernardeau, F. 2013, ArXiv e-prints [arXiv:1311.2724] [Google Scholar]
Bernardeau, F., & Reimberg, P. 2016, Phys. Rev. D, 94, 063520 [NASA ADS] [CrossRef] [Google Scholar]
Bernardeau, F., & Schaeffer, R. 1992, A&A, 255, 1 [NASA ADS] [Google Scholar]
Bernardeau, F., & Schaeffer, R. 1999, A&A, 349, 697 [NASA ADS] [Google Scholar]
Bernardeau, F., & Valageas, P. 2000, A&A, 364, 1 [NASA ADS] [Google Scholar]
Bernardeau, F., Pichon, C., & Codis, S. 2014, Phys. Rev. D, 90, 103519 [NASA ADS] [CrossRef] [Google Scholar]
Boyle, A., Uhlemann, C., Friedrich, O., et al. 2021, MNRAS, 505, 2886 [NASA ADS] [CrossRef] [Google Scholar]
Brouwer, M. M., Demchenko, V., Harnois-Déraps, J., et al. 2018, MNRAS, 481, 5189 [NASA ADS] [CrossRef] [Google Scholar]
Carron, J. 2011, ApJ, 738, 86 [NASA ADS] [CrossRef] [Google Scholar]
Carron, J., & Neyrinck, M. C. 2012, ApJ, 750, 28 [NASA ADS] [CrossRef] [Google Scholar]
Codis, S., Pichon, C., Bernardeau, F., Uhlemann, C., & Prunet, S. 2016a, MNRAS, 460, 1549 [NASA ADS] [CrossRef] [Google Scholar]
Codis, S., Bernardeau, F., & Pichon, C. 2016b, MNRAS, 460, 1598 [NASA ADS] [CrossRef] [Google Scholar]
Colombi, S., Bouchet, F. R., & Schaeffer, R. 1995, ApJS, 96, 401 [NASA ADS] [CrossRef] [Google Scholar]
Friedrich, O., & Eifler, T. 2018, MNRAS, 473, 4150 [NASA ADS] [CrossRef] [Google Scholar]
Friedrich, O., Gruen, D., DeRose, J., et al. 2018, Phys. Rev. D, 98, 023508 [Google Scholar]
Friedrich, O., Uhlemann, C., Villaescusa-Navarro, F., et al. 2020, MNRAS, 498, 464 [NASA ADS] [CrossRef] [Google Scholar]
Fry, J. N. 1984a, ApJ, 277, L5 [NASA ADS] [CrossRef] [Google Scholar]
Fry, J. N. 1984b, ApJ, 279, 499 [NASA ADS] [CrossRef] [Google Scholar]
Gruen, D., Friedrich, O., Krause, E., et al. 2018, Phys. Rev. D, 98 [Google Scholar]
Heymans, C., Tröster, T., Asgari, M., et al. 2021, A&A, 646, A140 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hikage, C., Oguri, M., Hamana, T., et al. 2019, PASJ, 71, 43 [Google Scholar]
Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111 [Google Scholar]
Kaiser, N. 1998, ApJ, 498, 26 [NASA ADS] [CrossRef] [Google Scholar]
Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]
Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 [Google Scholar]
Peebles, P. J. E. 1980, The Large-scale Structure of the Universe (Princeton University Press) [Google Scholar]
Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Repp, A., & Szapudi, I. 2020, MNRAS, 498, L125 [NASA ADS] [CrossRef] [Google Scholar]
Repp, A., & Szapudi, I. 2021, MNRAS, 500, 3631 [Google Scholar]
Schneider, P. 1996, MNRAS, 283, 837 [Google Scholar]
Szapudi, I., & Colombi, S. 1996, ApJ, 470, 131 [NASA ADS] [CrossRef] [Google Scholar]
Szapudi, I., Colombi, S., & Bernardeau, F. 1999, MNRAS, 310, 428 [NASA ADS] [CrossRef] [Google Scholar]
Takada, M., & Hu, W. 2013, Phys. Rev. D, 87 [CrossRef] [Google Scholar]
Touchette, H. 2011, ArXiv e-prints [arXiv:1106.4146] [Google Scholar]
Uhlemann, C., Friedrich, O., Villaescusa-Navarro, F., Banerjee, A., & Codis, S. 2020, MNRAS, 495, 4006 [NASA ADS] [CrossRef] [Google Scholar]
Valageas, P. 2002, A&A, 382, 412 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
White, S. D. M. 1979, MNRAS, 186, 145 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Hierarchical tree models

In hierarchical tree models, the p-point matter correlation functions are assumed to follow tree structures in the sense described in the main text. They are thus entirely defined by the two-point functions ξ(r) and the vertex-generating function ζ(τ). The exact generating function of multiple cell correlation functions can be built through simple transforms. We therefore consider a set of n cells V_i. These cells can overlap. The joint cumulants we consider are those of the average densities in cells V_i that can be expressed in terms of spatial averages¹⁰ of correlation functions,

$\begin{matrix} {⟨ ρ_{1}^{p_{1}} \dots ρ_{n}^{p_{n}} ⟩}_{c} & = \int_{V_{1}} \frac{d x_{1, 1}}{V_{1}} \dots \int_{V_{1}} \frac{d x_{1, p_{1}}}{V_{1}} \dots \\ . . . \int_{V_{n}} \frac{d x_{n, 1}}{V_{n}} \dots \int_{V_{n}} \frac{d x_{n, p_{n}}}{V_{n}} \\ \times ξ_{p} (x_{1, 1}, \dots, x_{1, p_{1}}, \dots x_{n, 1}, \dots, x_{n, p_{n}}) . \end{matrix}$ $\begin{aligned} \langle \rho _{1}^{p_{1}}\dots \rho _{n}^{p_{n}}\rangle _{c}&= \int _{V_{1}}\frac{\mathrm{d}\mathbf{x}_{1,1}}{V_{1}}\dots \int _{V_{1}}\frac{\mathrm{d}\mathbf{x}_{1,p_{1}}}{V_{1}} \dots \nonumber \\& ... \int _{V_{n}}\frac{\mathrm{d}\mathbf{x}_{n,1}}{V_{n}}\dots \int _{V_{n}}\frac{\mathrm{d}\mathbf{x}_{n,p_{n}}}{V_{n}} \nonumber \\&\times \xi _{p}(\mathbf{x}_{1,1},\dots ,\mathbf{x}_{1,p_{1}},\dots \mathbf{x}_{n,1},\dots ,\mathbf{x}_{n,p_{n}}). \end{aligned}$ (A.1)

We then wish to build the cumulant-generating function,

$\begin{matrix} φ (λ_{1}, \dots, λ_{n}) = \sum_{p_{1}, \dots, p_{n}} {⟨ ρ_{1}^{p_{1}} \dots ρ_{n}^{p_{n}} ⟩}_{c} \frac{λ_{1}^{p_{1}}}{p_{1}!} \dots \frac{λ_{n}}{p_{n}!} . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\dots ,\lambda _{n})= \sum _{p_{1},\dots ,p_{n}}\langle \rho _{1}^{p_{1}}\dots \rho _{n}^{p_{n}}\rangle _{c}\,\frac{\lambda _{1}^{p_{1}}}{p_{1}!}\dots \frac{\lambda _{n}}{p_{n}!}. \end{aligned}$ (A.2)

This function represents the generating function of (averaged) tree diagrams where λ_i counts the number of points in each cells. As shown in Bernardeau & Schaeffer (1999), this is obtained with the help of the intermediate function τ(x) solution of the consistency equation¹¹,

$\begin{matrix} τ (x) = \sum_{j} λ_{j} \int_{V_{j}} \frac{d x}{V_{j}} ξ (x, x^{'}) ζ^{'} (τ (x^{'})), \end{matrix}$ $\begin{aligned} \tau (\mathbf{x})=\sum _{j}\lambda _{j}\int _{V_{j}}\frac{\mathrm{d}\mathbf{x}}{V_{j}}\,\xi (\mathbf{x},\mathbf{x}^{\prime })\,\zeta ^{\prime }(\tau (\mathbf{x}^{\prime })), \end{aligned}$ (A.3)

and then

$\begin{matrix} φ (λ_{1}, \dots, λ_{n}) & = \sum_{j} λ_{j} \int_{V_{j}} \frac{d x}{V_{j}} ζ (τ (x)) \\ - \frac{1}{2} \sum_{j} λ_{j} \int_{V_{j}} \frac{d x}{V_{j}} τ (x) ζ^{'} (τ (x)) . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\dots ,\lambda _{n})&=\sum _{j}\lambda _{j} \int _{V_{j}}\frac{\mathrm{d}\mathbf{x}}{V_{j}}\zeta (\tau (\mathbf{x})) \nonumber \\& -\frac{1}{2}\sum _{j}\lambda _{j}\int _{V_{j}}\frac{\mathrm{d}\mathbf{x}}{V_{j}}\tau (\mathbf{x})\zeta ^{\prime }(\tau (\mathbf{x})) . \end{aligned}$ (A.4)

This is an exact result based on pure combinatorics.

For cases of interest, it is possible to do a mean-field approximation that consists of assuming that τ(x) is constant within each cell. We then have the system of equations for τ_i,

$\begin{matrix} τ_{i} = \sum_{j} λ_{j} ξ_{ij} ζ^{'} (τ_{j}), \end{matrix}$ $\begin{aligned} \tau _{i}=\sum _{j}\lambda _{j}\xi _{ij}\zeta ^{\prime }(\tau _{j}), \end{aligned}$ (A.5)

where

$\begin{matrix} ξ_{ij} = \int_{V_{i}} \frac{d x_{i}}{V_{i}} \int_{V_{i}} \frac{d x_{i}}{V_{i}} ξ (x_{i}, x_{j}) \end{matrix}$ $\begin{aligned} \xi _{ij}= \int _{V_{i}}\frac{\mathrm{d}\mathbf{x}_{i}}{V_{i}} \int _{V_{i}}\frac{\mathrm{d}\mathbf{x}_{i}}{V_{i}} \xi (\mathbf{x}_{i},\mathbf{x}_{j}) \end{aligned}$ (A.6)

and

$\begin{matrix} φ (λ_{1}, \dots, λ_{n}) = \sum_{j} λ_{j} [ζ (τ_{j}) - \frac{1}{2} τ_{j} ζ^{'} (τ_{j})] . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\dots ,\lambda _{n})= \sum _{j}\lambda _{j}\left[\zeta (\tau _{j})-\frac{1}{2}\tau _{j}\zeta ^{\prime }(\tau _{j})\right]. \end{aligned}$ (A.7)

Bernardeau & Schaeffer (1999) found this to be very accurate, and we extensively use this approximation in the following, in particular for the minimal tree model.

Appendix B: Joint PDF, density-bias function in PT, and hierarchical tree models

Here we consider the joint distribution of densities in two cells whose centers are at distance d. The calculation is based on the inverse Laplace transform of the joint cumulant-generating function φ(λ₁, λ₂),

$\begin{matrix} φ (λ_{1}, λ_{2}) = \sum_{p, q} {⟨ ρ_{1}^{p} ρ_{2}^{q} ⟩}_{c} \frac{λ_{1}^{p}}{p!} \frac{λ_{2}^{q}}{q!}, \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})=\sum _{p,q}\langle \rho _{1}^{p}\rho _{2}^{q}\rangle _{c}\,\frac{\lambda _{1}^{p}}{p!}\,\frac{\lambda _{2}^{q}}{q!}, \end{aligned}$ (B.1)

where ${⟨ ρ_{1}^{p} ρ_{2}^{q} ⟩}_{c}$ $\langle\rho_{1}^{p}\rho_{2}^{q}\rangle_{c}$ are the cumulants of the local density fields. They depend on the size and distance d between the cells. We assume in particular that the density correlation function between cells ξ(d) is small compared to unity and can serve as a small parameter.

B.1. Leading order in the mean-field approximation

Expanding with respect to ξ(d) then leads to the following form:

$\begin{matrix} φ (λ_{1}, λ_{2}) = φ_{0} (λ_{1}) + φ_{0} (λ_{2}) + ξ (d) φ_{1} (λ_{1}) φ_{1} (λ_{2}), \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})=\varphi _{0}(\lambda _{1})+\varphi _{0}(\lambda _{2})+\xi (d)\varphi _{1}(\lambda _{1})\varphi _{1}(\lambda _{2}), \end{aligned}$ (B.2)

that is, a factorization of the linear term in ξ(d). This can explicitly be shown in case of tree models (as described in the main text). This is also the case in perturbation theory.

In case of the tree models, we have

$\begin{matrix} φ_{1} (λ) = \frac{τ (λ)}{\bar{ξ}}, \end{matrix}$ $\begin{aligned} \varphi _{1}(\lambda )=\frac{\tau (\lambda )}{\overline{\xi }}, \end{aligned}$ (B.3)

where $\bar{ξ}$ ${\overline{\xi}}$ is the averaged correlation function within the cells. This is obtained assuming a mean-field approximation. We expect subleading corrections when d becomes comparable to the size of the cells.

B.2. Extending the previous case to the three variable case

In addition to the two variables ρ₁ and ρ₂, we introduce here the variable ρ_s, which is the density within the sample. We assume that the correlation functions are built with the same model. Here the small parameter is the correlation between two cells V₁ and V₂ (at positions x₁ and x₂) and the correlation function at sample size. It is natural in the context we are interested in to assume that these two quantities are on the same order.

We further assume we can use the mean-field approximation for the two cells V₁ and V₂. This is not a priori the case for the density in the whole sample, however. We therefore derive the results without this approximation. The general expression is then

$\begin{matrix} φ (λ_{s}, λ_{1}, λ_{2}) = & λ_{s} \int d x_{s} (ζ (τ (x_{s})) - \frac{1}{2} τ (x_{s}) ζ^{'} (τ (x_{s}))) \\ + λ_{1} [ζ (τ_{1}) - \frac{1}{2} τ_{1} ζ^{'} (τ_{1})] + λ_{2} [ζ (τ_{2}) - \frac{1}{2} τ_{2} ζ^{'} (τ_{2})], \end{matrix}$ $\begin{aligned} \varphi (\lambda _{s},\lambda _{1},\lambda _{2})=&\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\left(\zeta (\tau (\mathbf{x}_{s}))-\frac{1}{2}\tau (\mathbf{x}_{s})\zeta ^{\prime }(\tau (\mathbf{x}_{s}))\right)\nonumber \\&+\lambda _{1}\left[\zeta (\tau _{1})-\frac{1}{2}\tau _{1}\zeta ^{\prime }(\tau _{1})\right] +\lambda _{2}\left[\zeta (\tau _{2})-\frac{1}{2}\tau _{2}\zeta ^{\prime }(\tau _{2})\right], \end{aligned}$ (B.4)

with the consistency relations

$\begin{matrix} τ (x_{s}) & = λ_{s} \int d x_{s}^{'} ξ (x_{s}, x_{s}^{'}) ζ^{'} (τ (x_{s}^{'})) \\ + λ_{1} ξ (x_{s}, x_{1}) ζ^{'} (τ_{1}) + λ_{2} ξ (x_{s}, x_{2}) ζ^{'} (τ_{2}) \end{matrix}$ $\begin{aligned} \tau (\mathbf{x}_{s})&=\lambda _{s}\int \mathrm{d}\mathbf{x}^{\prime }_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}^{\prime }_{s})\,\zeta ^{\prime }(\tau (\mathbf{x}^{\prime }_{s})) \nonumber \\& +\lambda _{1}\xi (\mathbf{x}_{s},\mathbf{x}_{1})\zeta ^{\prime }(\tau _{1}) +\lambda _{2}\xi (\mathbf{x}_{s},\mathbf{x}_{2})\zeta ^{\prime }(\tau _{2})\end{aligned}$ (B.5)

$\begin{matrix} τ_{1} & = λ_{s} \int d x_{s} ξ (x_{1}, x_{s}) ζ^{'} (τ (x_{s})) \\ + λ_{1} \bar{ξ} ζ^{'} (τ_{1}) + λ_{2} ξ_{12} ζ^{'} (τ_{2}) \end{matrix}$ $\begin{aligned} \tau _{1}&=\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{1},\mathbf{x}_{s})\,\zeta ^{\prime }(\tau (\mathbf{x}_{s})) \nonumber \\& +\lambda _{1}\,\overline{\xi }\,\zeta ^{\prime }(\tau _{1}) +\lambda _{2}\,\xi _{12}\,\zeta ^{\prime }(\tau _{2})\end{aligned}$ (B.6)

$\begin{matrix} τ_{2} & = λ_{s} \int d x_{s} ξ (x_{2}, x_{s}) ζ^{'} (τ (x_{s})) \\ + λ_{1} ξ_{12} ζ^{'} (τ_{1}) + λ_{2} \bar{ξ} ζ^{'} (τ_{2}) . \end{matrix}$ $\begin{aligned} \tau _{2}&=\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{2},\mathbf{x}_{s})\,\zeta ^{\prime }(\tau (\mathbf{x}_{s})) \nonumber \\& +\lambda _{1}\,\xi _{12}\,\zeta ^{\prime }(\tau _{1}) +\lambda _{2}\,\overline{\xi }\,\zeta ^{\prime }(\tau _{2}) . \end{aligned}$ (B.7)

We therefore derive the expression of φ(λ_s, λ₁, λ₂) up to linear order jointly in $ξ (x_{s}, x_{s}^{'})$ $\xi({\bf x}_{s},{\bf x}^\prime_{s})$ , ξ(x_s, x₁), ξ(x_s, x₁) and ξ₁₂ ≡ ξ(x₁, x₂). At zeroth order, τ(x_s) vanishes and

$\begin{matrix} τ_{1}^{(0)} = λ_{1} \bar{ξ} ζ^{'} (τ_{1}), \end{matrix}$ $\begin{aligned} \tau _{1}^{(0)}=\lambda _{1}\,\overline{\xi }\zeta ^{\prime }(\tau _{1}), \end{aligned}$ (B.8)

with a similar relation for $τ_{2}^{(0)}$ $\tau_{2}^{(0)}$ . At linear order, we have

$\begin{matrix} τ^{(1)} (x_{s}) & = λ_{s} \int d x_{s}^{'} ξ (x_{s}, x_{s}^{'}) \\ + λ_{1} ξ (x_{s}, x_{1}) ζ^{'} (τ_{1}^{(0)}) + λ_{2} ξ (x_{s}, x_{2}) ζ^{'} (τ_{2}^{(0)}) \end{matrix}$ $\begin{aligned} \tau ^{(1)}(\mathbf{x}_{s})&\!\!=\!\!\lambda _{s}\int \mathrm{d}\mathbf{x}^{\prime }_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}^{\prime }_{s}) \nonumber \\& +\lambda _{1}\xi (\mathbf{x}_{s},\mathbf{x}_{1})\zeta ^{\prime }(\tau _{1}^{(0)}) +\lambda _{2}\xi (\mathbf{x}_{s},\mathbf{x}_{2})\zeta ^{\prime }(\tau _{2}^{(0)})\end{aligned}$ (B.9)

$\begin{matrix} F (τ_{1}^{(0)}) τ_{1}^{(1)} & = λ_{s} \int d x_{s} ξ (x_{1}, x_{s}) + λ_{2} ξ_{12} ζ^{'} (τ_{2}^{(0)}) \end{matrix}$ $\begin{aligned} \mathcal{F}(\tau _{1}^{(0)})\tau _{1}^{(1)}&\!\!=\!\!\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{1},\mathbf{x}_{s})+\lambda _{2}\,\xi _{12}\,\zeta ^{\prime }(\tau _{2}^{(0)})\end{aligned}$ (B.10)

$\begin{matrix} F (τ_{2}^{(0)}) τ_{2}^{(1)} & = λ_{s} \int d x_{s} ξ (x_{2}, x_{s}) + λ_{1} ξ_{12} ζ^{'} (τ_{1}^{(0)}), \end{matrix}$ $\begin{aligned} \mathcal{F}(\tau _{2}^{(0)})\tau _{2}^{(1)}&\!\!=\!\!\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{2},\mathbf{x}_{s})+\lambda _{1}\,\xi _{12}\,\zeta ^{\prime }(\tau _{1}^{(0)}), \end{aligned}$ (B.11)

where

$\begin{matrix} F (τ) \equiv (1 - \frac{τ ζ^{″} (τ)}{ζ^{'} (τ)}) . \end{matrix}$ $\begin{aligned} \mathcal{F}(\tau )\equiv \left(1-\frac{\tau \zeta ^{\prime \prime }(\tau )}{\zeta ^{\prime }(\tau )}\right). \end{aligned}$ (B.12)

The resulting cumulant-generating function reads

$\begin{matrix} φ (λ_{s}, λ_{1}, λ_{2}) = & λ_{s} + λ_{1} [ζ (τ_{1}^{(0)}) - \frac{1}{2} τ_{1}^{(0)} ζ^{'} (τ_{1}^{(0)})] \\ + λ_{2} [ζ (τ_{2}^{(0)}) - \frac{1}{2} τ_{2}^{(0)} ζ^{'} (τ_{2}^{(0)})] + \frac{λ_{s}}{2} \int d x_{s} τ^{(1)} (x_{s}) \\ + \frac{λ_{1}}{2} ζ^{'} (τ_{1}^{(0)}) F (τ_{1}^{(0)}) τ_{1}^{(1)} + \frac{λ_{2}}{2} ζ^{'} (τ_{2}^{(0)}) F (τ_{2}^{(0)}) τ_{2}^{(1)}, \end{matrix}$ $\begin{aligned} \varphi (\lambda _{s},\lambda _{1},\lambda _{2})=&\lambda _{s} +\lambda _{1}\left[\zeta (\tau _{1}^{(0)})-\frac{1}{2}\tau _{1}^{(0)}\zeta ^{\prime }(\tau _{1}^{(0)})\right] \nonumber \\& +\lambda _{2}\left[\zeta (\tau _{2}^{(0)})-\frac{1}{2}\tau _{2}^{(0)}\zeta ^{\prime }(\tau _{2}^{(0)})\right]+\frac{\lambda _{s}}{2}\int \mathrm{d}\mathbf{x}_{s}\,\tau ^{(1)}(\mathbf{x}_{s}) \nonumber \\& +\frac{\lambda _{1}}{2}\zeta ^{\prime }(\tau _{1}^{(0)})\mathcal{F}(\tau _{1}^{(0)})\tau _{1}^{(1)} +\frac{\lambda _{2}}{2}\zeta ^{\prime }(\tau _{2}^{(0)})\mathcal{F}(\tau _{2}^{(0)})\tau _{2}^{(1)}, \end{aligned}$ (B.13)

where the first three terms are at zeroth order and the last three are at linear order. Using the previous expression, we obtain

$\begin{matrix} φ (λ_{s}, λ_{1}, λ_{2}) = & λ_{s} + φ_{0} (λ_{1}) + φ_{0} (λ_{2}) \\ + \frac{λ_{s}^{2}}{2} \int d x_{s} d x_{s}^{'} ξ (x_{s}, x_{s}^{'}) + λ_{s} \int d x_{s} ξ (x_{s}, x_{1}) φ_{1} (λ_{1}) \\ + λ_{s} \int d x_{s} ξ (x_{s}, x_{2}) φ_{1} (λ_{2}) + φ_{1} (λ_{1}) ξ_{12} φ_{1} (λ_{2}) . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{s},\lambda _{1},\lambda _{2})=&\lambda _{s}+\varphi _{0}(\lambda _{1})+\varphi _{0}(\lambda _{2}) \nonumber \\&+\frac{\lambda _{s}^{2}}{2}\int \mathrm{d}\mathbf{x}_{s}\mathrm{d}\mathbf{x}^{\prime }_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}^{\prime }_{s}) +\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}_{1})\,\varphi _{1}(\lambda _{1}) \nonumber \\&+\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}_{2})\,\varphi _{1}(\lambda _{2}) +\varphi _{1}(\lambda _{1})\,\xi _{12}\,\varphi _{1}(\lambda _{2}). \end{aligned}$ (B.14)

This relation is used to derive the expression of the sample bias functions in the next section.

B.3. Second order in the mean-field approximation

Results of Sect. B.1 can naturally be extended to any order in the cross-cell correlation function in the context of the tree-hierarchical models (as illustrated on Fig B.1). Up to second order, it takes the form

$\begin{matrix} φ (λ_{1}, λ_{2}) & = φ_{0} (λ_{1}) + φ_{0} (λ_{2}) + φ_{1} (λ_{1}) ξ_{12} φ_{1} (λ_{2}) \\ + \frac{1}{2} φ_{1}^{2} (λ_{1}) ξ_{12}^{2} φ_{2} (λ_{2}) + \frac{1}{2} φ_{2} (λ_{1}) ξ_{12}^{2} φ_{1}^{2} (λ_{2}), \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})&= \varphi _{0}(\lambda _{1})+\varphi _{0}(\lambda _{2})+ \varphi _{1}(\lambda _{1})\,\xi _{12}\,\varphi _{1}(\lambda _{2}) \nonumber \\& +\frac{1}{2}\varphi _{1}^{2}(\lambda _{1})\,\xi _{12}^{2}\,\varphi _{2}(\lambda _{2}) +\frac{1}{2}\varphi _{2}(\lambda _{1})\,\xi _{12}^{2}\,\varphi _{1}^{2}(\lambda _{2}), \end{aligned}$ (B.15)

Fig. B.1.

Diagrammatic visualization of the bias functions. The function φ₀(λ) is the generating function of all trees within one cell, φ₁(λ) of all trees within one cell with one external lines, and φ₂(λ) with two external lines. The resulting connected diagrams up to second order in ξ₁₂ are thus those presented here. Two φ₂ generating functions cannot be conntected as that would induce a loop contribution. This reflects the underlying tree structure.

where the function φ₂(λ) takes the form

$\begin{matrix} φ_{2} (λ) = \frac{λ ζ^{″} (τ)}{1 - τ ζ^{″} (τ) / ζ^{'} (τ)} . \end{matrix}$ $\begin{aligned} \varphi _{2}(\lambda )=\frac{\lambda \zeta ^{\prime \prime }(\tau )}{1-\tau \zeta ^{\prime \prime }(\tau )/\zeta ^{\prime }(\tau )}. \end{aligned}$ (B.16)

This last expression can be directly obtained through a perturbative expansion as presented in the previous subsection¹².

In case of the minimal tree model, this perturbative expansion can be directly compared with exact mean-field results¹³. This is shown in Fig. B.2. It shows that for a large regime in λ, the relation (B.15) provides a very accurate description of the joint cumulant-generating function down to a distance corresponding to overlapping cells. For overlapping cells, relation (B.15) continues to be accurate except for high values of λ. In general,

Fig. B.2.

Joint CGF as a function of d and for different values of λ₁, λ₂: from bottom to top, λ₁ = λ₂ = −0.1, λ₁ = λ₂ = 0.2, λ₁ = λ₂ = 0.5. The solid blue lines correspond to the two-cell mean-field expression, (C.13). The predictions given in Eqs. (B.2) and (B.15) are shown as dashed red lines and dotted black lines. The shaded area is the region of overlapping cells.

$\begin{matrix} φ (λ_{1}, λ_{2}) \to φ_{0} (λ_{1} + λ_{2}) \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})\rightarrow \varphi _{0}(\lambda _{1}+\lambda _{2}) \end{aligned}$ (B.17)

when d → 0 (more accurate results in case of the minimal tree model are given in Appendix D.).

Using (B.15), we then derive corrective perturbative terms to the joint density PDF. More specifically, we have

$\begin{matrix} P (ρ_{i}, ρ_{j}) & = P (ρ_{i}) P (ρ_{j}) [1 + b_{1} (ρ_{i}) ξ_{12} b_{1} (ρ_{j}) \\ + \frac{1}{2} b_{2} (ρ_{i}) ξ_{12}^{2} b_{2} (ρ_{j}) + \frac{1}{2} b_{2} (ρ_{i}) ξ_{12}^{2} q_{1} (ρ_{j}) \\ + \frac{1}{2} q_{1} (ρ_{i}) ξ_{12}^{2} b_{2} (ρ_{j})], \end{matrix}$ $\begin{aligned} P(\rho _{i},\rho _{j})&=P(\rho _{i})P(\rho _{j})\left[ 1+b_{1}(\rho _{i})\,\xi _{12}\,b_{1}(\rho _{j}) \right.\nonumber \\&\left. +\frac{1}{2}b_{2}(\rho _{i})\,\xi _{12}^{2}\,b_{2}(\rho _{j})+ \frac{1}{2}b_{2}(\rho _{i})\,\xi _{12}^{2}\,q_{1}(\rho _{j}) \right.\nonumber \\&\left. +\frac{1}{2}q_{1}(\rho _{i})\,\xi _{12}^{2}\,b_{2}(\rho _{j}) \right], \end{aligned}$ (B.18)

where

$\begin{matrix} b_{2} (ρ_{i}) P (ρ_{i}) & = \int \frac{d λ}{2 π i} {[φ_{1} (λ)]}^{2} exp [- λ ρ_{i} + φ (λ)] \end{matrix}$ $\begin{aligned} b_{2}(\rho _{i})P(\rho _{i})&=\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\left[\varphi _{1}(\lambda )\right]^{2}\exp \left[-\lambda \rho _{i}+\varphi (\lambda )\right]\end{aligned}$ (B.19)

$\begin{matrix} q_{1} (ρ_{i}) P (ρ_{i}) & = \int \frac{d λ}{2 π i} φ_{2} (λ) exp [- λ ρ_{i} + φ (λ)] . \end{matrix}$ $\begin{aligned} q_{1}(\rho _{i})P(\rho _{i})&=\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\,\varphi _{2}(\lambda )\exp \left[-\lambda \rho _{i}+\varphi (\lambda )\right] . \end{aligned}$ (B.20)

For a sample with periodic boundary conditions, the average of ξ₁₂ vanishes, which a priori makes the other terms the leading contributors to the covariance elements. Equation (B.18) can be written as a sum of symmetric factorized terms,

$\begin{matrix} P (ρ_{i}, ρ_{j}) & = P (ρ_{i}) P (ρ_{j}) [1 + b_{1} (ρ_{i}) ξ_{12} b_{1} (ρ_{j}) \\ + \frac{1}{2} (b_{2} (ρ_{i}) + q_{1} (ρ_{i})) ξ_{12}^{2} (b_{2} (ρ_{j}) + q_{1} (ρ_{j})) \\ - \frac{1}{2} q_{1} (ρ_{i}) ξ_{12}^{2} q_{1} (ρ_{j})], \end{matrix}$ $\begin{aligned} P(\rho _{i},\rho _{j})&=P(\rho _{i})P(\rho _{j})\left[ 1+b_{1}(\rho _{i})\,\xi _{12}\,b_{1}(\rho _{j}) \right.\nonumber \\&\left. +\frac{1}{2}\left(b_{2}(\rho _{i})+q_{1}(\rho _{i})\right)\,\xi _{12}^{2}\,\left(b_{2}(\rho _{j}) +q_{1}(\rho _{j})\right) \right.\nonumber \\&\left. -\frac{1}{2}q_{1}(\rho _{i})\,\xi _{12}^{2}\,q_{1}(\rho _{j}) \right], \end{aligned}$ (B.21)

showing the eigenstructure of the resulting matrix and showing that it defines three different eigendirections at most.

B.4. Relative density joint PDFs and bias functions

We wish to compute the joint PDF of the density when expressed in terms of the survey average density ρ_s. In order to do so, we consider the joint density P(ρ_s, ρ_i, ρ_j), where ρ_s is the density in the sample and ρ_i and ρ_j are the densities in two cells at distance d.

We wish to compute the joint probability distribution function of ${\hat{ρ}}_{i}$ ${\hat{\rho}}_{i}$ and ${\hat{ρ}}_{j}$ ${\hat{\rho}}_{j}$ , defined as

$\begin{matrix} {\hat{ρ}}_{i} = \frac{ρ_{i}}{ρ_{s}}, \end{matrix}$ $\begin{aligned} \hat{\rho }_{i}=\frac{\rho _{i}}{\rho _{s}}, \end{aligned}$ (B.22)

and the joint distribution functions of ${\bar{ρ}}_{i}$ ${\overline{\rho}}_{i}$ and ${\bar{ρ}}_{j}$ ${\overline{\rho}}_{j}$ , defined as

$\begin{matrix} {\bar{ρ}}_{i} = ρ_{i} - ρ_{s} + 1 . \end{matrix}$ $\begin{aligned} \overline{\rho }_{i}=\rho _{i}-\rho _{s}+1. \end{aligned}$ (B.23)

From these changes of variables, we have

$\begin{matrix} P ({\hat{ρ}}_{i}) = \int d ρ_{s} ρ_{s} P (ρ_{s}, {\hat{ρ}}_{i} ρ_{s}) \end{matrix}$ $\begin{aligned} P(\hat{\rho }_{i})=\int \mathrm{d}\rho _{s}\,\rho _{s}\,P(\rho _{s},\hat{\rho }_{i}\rho _{s}) \end{aligned}$ (B.24)

and

$\begin{matrix} P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) = \int d ρ_{s} ρ_{s}^{2} P (ρ_{s}, {\hat{ρ}}_{i} ρ_{s}, {\hat{ρ}}_{j} ρ_{s}) . \end{matrix}$ $\begin{aligned} P(\hat{\rho }_{i},\hat{\rho }_{j})=\int \mathrm{d}\rho _{s}\,\rho _{s}^{2}\,P(\rho _{s},\hat{\rho }_{i}\rho _{s},\hat{\rho }_{j}\rho _{s}). \end{aligned}$ (B.25)

Similarly, we also have

$\begin{matrix} P ({\bar{ρ}}_{i}) = \int d ρ_{s} P (ρ_{s}, {\bar{ρ}}_{i} + ρ_{s} - 1) \end{matrix}$ $\begin{aligned} P(\overline{\rho }_{i})=\int \mathrm{d}\rho _{s}\,P(\rho _{s},\overline{\rho }_{i}+\rho _{s}-1) \end{aligned}$ (B.26)

and

$\begin{matrix} P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) = \int d ρ_{s} P (ρ_{s}, {\bar{ρ}}_{i} + ρ_{s} - 1, {\bar{ρ}}_{j} + ρ_{s} - 1) . \end{matrix}$ $\begin{aligned} P(\overline{\rho }_{i},\overline{\rho }_{j})=\int \mathrm{d}\rho _{s}\,P(\rho _{s},\overline{\rho }_{i}+\rho _{s}-1,\overline{\rho }_{j}+\rho _{s}-1). \end{aligned}$ (B.27)

We continue the calculations by expressing the joint PDF with the help of inverse Laplace transforms,

$\begin{matrix} P (ρ_{s}, ρ_{i}) & = \int \frac{d λ_{s}}{2 π i} \frac{d λ}{2 π i} \\ \times exp [- λ_{s} ρ_{s} - λ ρ_{i} + φ (λ_{s}, λ)] \end{matrix}$ $\begin{aligned} P(\rho _{s},\rho _{i})&=\int \frac{\mathrm{d}\lambda _{s}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\,\nonumber \\&\times \exp \left[ -\lambda _{s}\rho _{s}-\lambda \rho _{i}+\varphi (\lambda _{s},\lambda )\right]\end{aligned}$ (B.28)

$\begin{matrix} P (ρ_{s}, ρ_{i}, ρ_{j}) & = \int \frac{d λ_{s}}{2 π i} \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} \\ \times exp [- λ_{s} ρ_{s} - λ_{1} ρ_{i} - λ_{2} ρ_{j} + φ (λ_{s}, λ_{1}, λ_{2})] . \end{matrix}$ $\begin{aligned} P(\rho _{s},\rho _{i},\rho _{j})&=\int \frac{\mathrm{d}\lambda _{s}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}} \, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}}\,\nonumber \\&\times \exp \left[ -\lambda _{s}\rho _{s}-\lambda _{1}\rho _{i}-\lambda _{2}\rho _{j}+\varphi (\lambda _{s},\lambda _{1},\lambda _{2}) \right]. \end{aligned}$ (B.29)

As a result,

$\begin{matrix} P ({\hat{ρ}}_{i}) & = \int \frac{d λ_{s}}{2 π i} \frac{d λ}{2 π i} \frac{1}{{(λ_{s} + λ {\hat{ρ}}_{i})}^{2}} exp [φ (λ_{s}, λ)] \end{matrix}$ $\begin{aligned} P(\hat{\rho }_{i})&= \int \frac{\mathrm{d}\lambda _{s}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}} \frac{1}{(\lambda _{s}+\lambda \hat{\rho }_{i})^{2}} \exp \left[\varphi (\lambda _{s},\lambda ) \right]\end{aligned}$ (B.30)

$\begin{matrix} P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) & = \int \frac{d λ_{s}}{2 π i} \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} \frac{1}{{(λ_{s} + λ_{1} {\hat{ρ}}_{i} + λ_{2} {\hat{ρ}}_{j})}^{3}} \\ \times exp [φ (λ_{s}, λ_{1}, λ_{2})] \end{matrix}$ $\begin{aligned} P(\hat{\rho }_{i},\hat{\rho }_{j})&= \int \frac{\mathrm{d}\lambda _{s}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}} \, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}} \frac{1}{(\lambda _{s}+\lambda _{1}\hat{\rho }_{i}+\lambda _{2}\hat{\rho }_{j})^{3}}\, \nonumber \\&\times \exp \left[\varphi (\lambda _{s},\lambda _{1},\lambda _{2}) \right] \end{aligned}$ (B.31)

after integration over ρ_s. The latter expressions can be expressed as

$\begin{matrix} P ({\hat{ρ}}_{i}) & = \int \frac{d λ}{2 π i} {[\frac{\partial φ}{\partial λ_{s}}]}_{|_{λ_{s} = - λ {\hat{ρ}}_{i}}} exp [φ (- λ {\hat{ρ}}_{i}, λ_{i})] \end{matrix}$ $\begin{aligned} P(\hat{\rho }_{i})&=\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}} \left[ \frac{\partial \varphi }{\partial \lambda _{s}} \right]_{\big \vert _{\lambda _{s}=-\lambda \hat{\rho }_{i}}} \exp \left[\varphi (-\lambda \hat{\rho }_{i},\lambda _{i})\right]\end{aligned}$ (B.32)

$\begin{matrix} P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) & = \int \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} {[{(\frac{\partial φ}{\partial λ_{s}})}^{2} + \frac{\partial^{2} φ}{\partial λ_{s}^{2}}]}_{|_{λ_{s} = - λ_{1} {\hat{ρ}}_{i} - λ_{2} {\hat{ρ}}_{j}}} \\ \times exp [φ (- λ_{1} {\hat{ρ}}_{i} - λ_{2} {\hat{ρ}}_{j}, λ_{1}, λ_{2})] . \end{matrix}$ $\begin{aligned} P(\hat{\rho }_{i},\hat{\rho }_{j})&=\int \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}} \left[ \left( \frac{\partial \varphi }{\partial \lambda _{s}} \right)^{2}+\frac{\partial ^{2}\varphi }{\partial \lambda _{s}^{2}} \right]_{\big \vert _{\lambda _{s}=-\lambda _{1}\hat{\rho }_{i}-\lambda _{2}\hat{\rho }_{j}}}\nonumber \\&\times \exp \left[\varphi (-\lambda _{1}\hat{\rho }_{i}-\lambda _{2}\hat{\rho }_{j},\lambda _{1},\lambda _{2})\right]. \end{aligned}$ (B.33)

In a similar manner, we can obtain the form of the joint PDF for ${{\bar{ρ}}_{i}}$ $\{{\overline{\rho}}_{i}\}$ ,

$\begin{matrix} P ({\bar{ρ}}_{i}) = & \int \frac{d λ}{2 π i} exp [- λ {\bar{ρ}}_{i} + φ (- λ, λ)] \end{matrix}$ $\begin{aligned} P(\overline{\rho }_{i})=&\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}} \exp \left[-\lambda \overline{\rho }_{i}+\varphi (-\lambda ,\lambda )\right]\end{aligned}$ (B.34)

$\begin{matrix} P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) = & \int \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} \\ \times exp [- λ_{1} {\bar{ρ}}_{i} - λ_{2} {\bar{ρ}}_{j} + φ (- λ_{1} - λ_{2}, λ_{1}, λ_{2})] . \end{matrix}$ $\begin{aligned} P(\overline{\rho }_{i},\overline{\rho }_{j})=&\int \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}}\,\nonumber \\&\times \exp \left[-\lambda _{1}\overline{\rho }_{i}-\lambda _{2}\overline{\rho }_{j}+\varphi (-\lambda _{1}-\lambda _{2},\lambda _{1},\lambda _{2})\right]. \end{aligned}$ (B.35)

We then use the relation (B.14) to compute the form of these functions.

Noting that the expressions $\int d x_{0} d x_{0}^{'} ξ (x_{0}, x_{0}^{'})$ $\smallint{\rm d}{\bf x}_{0}{\rm d}{\bf x}^\prime_{0}\,\xi({\bf x}_{0},{\bf x}^\prime_{0})$ , ∫dx₀ ξ(x₀, x₁) take all the same averaged value when integrated over the sample, which we note ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ , then at linear order in ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ ,

$\begin{matrix} \frac{\partial φ}{\partial λ_{s}} = & 1 + {\bar{ξ}}_{s} [λ_{s} + φ_{1} (λ_{1}) + φ_{1} (λ_{2})] . \end{matrix}$ $\begin{aligned} \frac{\partial \varphi }{\partial \lambda _{s}}=&1+\overline{\xi }_{s}\left[\lambda _{s}+\varphi _{1}(\lambda _{1})+\varphi _{1}(\lambda _{2})\right].\end{aligned}$ (B.36)

$\begin{matrix} \frac{\partial^{2} φ}{\partial λ_{s}^{2}} & = {\bar{ξ}}_{s} . \end{matrix}$ $\begin{aligned} \frac{\partial ^{2}\varphi }{\partial \lambda _{s}^{2}}&=\overline{\xi }_{s}. \end{aligned}$ (B.37)

At the same order, we then have

$\begin{matrix} {(\frac{\partial φ}{\partial λ_{s}})}^{2} + \frac{\partial^{2} φ}{\partial λ_{s}^{2}} = 1 + {\bar{ξ}}_{s} [1 + 2 (λ_{s} + φ_{1} (λ_{1}) + φ_{1} (λ_{2}))] . \end{matrix}$ $\begin{aligned} \left( \frac{\partial \varphi }{\partial \lambda _{s}} \right)^{2}+\frac{\partial ^{2}\varphi }{\partial \lambda _{s}^{2}} = 1+\overline{\xi }_{s}\left[ 1+2\left( \lambda _{s}+\varphi _{1}(\lambda _{1})+\varphi _{1}(\lambda _{2}) \right) \right]. \end{aligned}$ (B.38)

Combining both the expressions of $P ({\hat{ρ}}_{i})$ $P({\hat{\rho}}_{i})$ and $P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j})$ $P({\hat{\rho}}_{i},{\hat{\rho}}_{j})$ and expanding all terms at linear order in ${\bar{ξ}}_{s}$ ${\overline{\xi}_{s}}$ , we obtain

$\begin{matrix} P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) - P ({\hat{ρ}}_{i}) P ({\hat{ρ}}_{j}) & = \int \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} \\ \times [1 + {\bar{ξ}}_{s} (1 + φ_{1} (λ_{1}) - λ_{1} {\hat{ρ}}_{i}) (1 + φ_{1} (λ_{2}) - λ_{2} {\hat{ρ}}_{j})] \\ \times exp [- λ_{1} {\hat{ρ}}_{i} - λ_{2} {\hat{ρ}}_{j} + φ_{0} (λ_{1}) + φ_{0} (λ_{2})] . \end{matrix}$ $\begin{aligned} P(\hat{\rho }_{i},\hat{\rho }_{j})-P(\hat{\rho }_{i})P(\hat{\rho }_{j})&= \int \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}} \nonumber \\&\times \left[ 1+\overline{\xi }_{s}\left( 1+\varphi _{1}(\lambda _{1})-\lambda _{1}\hat{\rho }_{i} \right)\left( 1+\varphi _{1}(\lambda _{2})-\lambda _{2}\hat{\rho }_{j} \right) \right]\ \nonumber \\&\times \exp \left[-\lambda _{1}\hat{\rho }_{i}-\lambda _{2}\hat{\rho }_{j}+\varphi _{0}(\lambda _{1})+\varphi _{0}(\lambda _{2})\right]. \end{aligned}$ (B.39)

This leads to the definition of the first sample bias function,

$\begin{matrix} b_{s 1} ({\hat{ρ}}_{i}) = \frac{1}{P ({\hat{ρ}}_{i})} \int \frac{d λ}{2 π i} (1 + φ_{1} (λ) - λ {\hat{ρ}}_{i}) exp [- λ {\hat{ρ}}_{i} + φ_{0} (λ)], \end{matrix}$ $\begin{aligned} b_{\rm s1}(\hat{\rho }_{i})=\frac{1}{P(\hat{\rho }_{i})} \int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}} \left( 1+\varphi _{1}(\lambda )-\lambda \hat{\rho }_{i} \right) \exp \left[-\lambda \hat{\rho }_{i}+\varphi _{0}(\lambda )\right], \end{aligned}$ (B.40)

which can be re-expressed in terms of the density-bias function defined in Eq. (18) and the derivative of $P ({\hat{ρ}}_{i})$ $P({\hat{\rho}}_{i})$ with respect to ${\hat{ρ}}_{i}$ ${\hat{\rho}}_{i}$

$\begin{matrix} b_{s 1} ({\hat{ρ}}_{i}) = b ({\hat{ρ}}_{i}) + 1 + \frac{d log (P ({\hat{ρ}}_{i}))}{d log {\hat{ρ}}_{i}} . \end{matrix}$ $\begin{aligned} b_{\rm s1}(\hat{\rho }_{i})=b(\hat{\rho }_{i})+1+\frac{\mathrm{d}\log (P(\hat{\rho }_{i}))}{\mathrm{d}\log \hat{\rho }_{i}}. \end{aligned}$ (B.41)

The second sample-bias function can be obtained in a similar manner. We indeed have

$\begin{matrix} P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) - P ({\bar{ρ}}_{i}) P ({\bar{ρ}}_{j}) = & \int \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} \\ \times [1 + {\bar{ξ}}_{s} (φ_{1} (λ_{1}) - λ_{1}) (φ_{1} (λ_{2}) - λ_{2})] \\ \times exp [- λ_{1} {\bar{ρ}}_{i} - λ_{2} {\bar{ρ}}_{j} + φ_{0} (λ_{1}) + φ_{0} (λ_{2})], \end{matrix}$ $\begin{aligned} P(\overline{\rho }_{i},\overline{\rho }_{j})-P(\overline{\rho }_{i})P(\overline{\rho }_{j})=&\int \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}} \nonumber \\&\times \left[ 1+\overline{\xi }_{s}\left( \varphi _{1}(\lambda _{1})-\lambda _{1} \right)\left( \varphi _{1}(\lambda _{2})-\lambda _{2} \right) \right]\ \nonumber \\&\times \exp \left[-\lambda _{1}\overline{\rho }_{i}-\lambda _{2}\overline{\rho }_{j}+\varphi _{0}(\lambda _{1})+\varphi _{0}(\lambda _{2})\right], \end{aligned}$ (B.42)

which eventually leads to

$\begin{matrix} b_{s 2} ({\bar{ρ}}_{i}) = b ({\bar{ρ}}_{i}) + \frac{d log (P ({\bar{ρ}}_{i}))}{d {\bar{ρ}}_{i}} . \end{matrix}$ $\begin{aligned} b_{\rm s2}(\overline{\rho }_{i})=b(\overline{\rho }_{i})+\frac{\mathrm{d}\log (P(\overline{\rho }_{i}))}{\mathrm{d}\overline{\rho }_{i}}. \end{aligned}$ (B.43)

B.5. Response to a change in amplitude in ξ

A close notion related to the density-bias function is how the PDF is changed when the parameters of the simulations are changed. In particular for tree models, the statistical properties are entirely determined by the amplitude of the two-point function, for instance, at cell size. This dependence can be made explicit by writing Eq. (17) as

$\begin{matrix} P (ρ_{i}, \bar{ξ}) = \frac{1}{\bar{ξ}} \int \frac{d \hat{λ}}{2 π i} exp (- \hat{λ} \frac{ρ_{i}}{\bar{ξ}} + \frac{1}{\bar{ξ}} ψ (\hat{λ})) \end{matrix}$ $\begin{aligned} P(\rho _{i},\overline{\xi })=\frac{1}{\overline{\xi }}\int \frac{\mathrm{d}\hat{\lambda }}{2 \pi \mathrm{i}}\,\exp \left(-\hat{\lambda }\frac{\rho _{i}}{\overline{\xi }}+\frac{1}{\overline{\xi }}\psi (\hat{\lambda })\right) \end{aligned}$ (B.44)

after the change of variable and function,

$\begin{matrix} \hat{λ} = \bar{ξ} λ, ψ (\hat{λ}) = \bar{ξ} φ (λ), \end{matrix}$ $\begin{aligned} \hat{\lambda }=\overline{\xi }\lambda ,\ \ \psi (\hat{\lambda })=\overline{\xi }\varphi (\lambda ), \end{aligned}$ (B.45)

where then the expression $ψ (\hat{λ})$ $\psi({\hat{\lambda}})$ does not depend on $\bar{ξ}$ ${\overline{\xi}}$ (only on the functional form of ζ). It follows that

$\begin{matrix} - \bar{ξ} \frac{\partial P (ρ_{i}, \bar{ξ})}{\partial \bar{ξ}} & = P (ρ_{i}, \bar{ξ}) + ρ \frac{\partial P (ρ_{i}, \bar{ξ})}{\partial ρ_{i}} \\ + \int \frac{d λ}{2 π i} φ (λ) exp (- \hat{λ} \frac{ρ_{i}}{\bar{ξ}} + φ (λ)) . \end{matrix}$ $\begin{aligned} -\overline{\xi }\frac{\partial P(\rho _{i},\overline{\xi })}{\partial \overline{\xi }}&=P(\rho _{i},\overline{\xi })+\rho \frac{\partial P(\rho _{i},\overline{\xi })}{\partial \rho _{i}} \nonumber \\& +\int \frac{\mathrm{d}\lambda }{2 \pi \mathrm{i}}\,\varphi (\lambda )\exp \left(-\hat{\lambda }\frac{\rho _{i}}{\overline{\xi }}+\varphi (\lambda )\right). \end{aligned}$ (B.46)

This expression can be used to defined the function b_ξ(ρ_i) as

$\begin{matrix} b_{ξ} (ρ_{i}) = - \frac{\partial log P (ρ_{i}, \bar{ξ})}{\partial log \bar{ξ}} . \end{matrix}$ $\begin{aligned} b_{\xi }(\rho _{i})=-\frac{\partial \log P(\rho _{i},\overline{\xi })}{\partial \log \overline{\xi }}. \end{aligned}$ (B.47)

It appears that b_ξ(ρ_i) is very similar to b_s1(ρ_i), but the two are not equal in general.

B.6. Close cell results

B.6.1. CGF for 2 close cells

Saddle point approximation and close cell results. In case of two cells, the general system in the mean-field approximation leads to

$\begin{matrix} τ_{1} & = λ_{1} \bar{ξ} ζ^{'} (τ_{1}) + λ_{2} ξ_{12} ζ^{'} (τ_{2}) \end{matrix}$ $\begin{aligned} \tau _{1}&=\lambda _{1}\overline{\xi }\zeta ^{\prime }(\tau _{1})+\lambda _{2}\xi _{12}\zeta ^{\prime }(\tau _{2})\end{aligned}$ (B.48)

$\begin{matrix} τ_{2} & = λ_{2} {\bar{ξ}}_{12} ζ^{'} (τ_{1}) + λ_{2} \bar{ξ} ζ^{'} (τ_{2}) \end{matrix}$ $\begin{aligned} \tau _{2}&=\lambda _{2}\overline{\xi }_{12}\zeta ^{\prime }(\tau _{1})+\lambda _{2}\overline{\xi }\zeta ^{\prime }(\tau _{2}) \end{aligned}$ (B.49)

and

$\begin{matrix} φ (λ_{1}, λ_{2}) & = λ_{1} (ζ (τ_{1}) - \frac{1}{2} τ_{1} ζ^{'} (τ_{1})) \\ + λ_{2} (ζ (τ_{2}) - \frac{1}{2} τ_{2} ζ^{'} (τ_{2})) . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})&=\lambda _{1}\left(\zeta (\tau _{1})-\frac{1}{2}\tau _{1}\zeta ^{\prime }(\tau _{1})\right) \nonumber \\& +\lambda _{2}\left(\zeta (\tau _{2})-\frac{1}{2}\tau _{2}\zeta ^{\prime }(\tau _{2})\right). \end{aligned}$ (B.50)

We are interested here in the behavior of φ(λ₁, λ₂) when the two cells are close, that is, when $ξ_{12} \to \bar{ξ}$ $\xi_{12}\to{\overline{\xi}}$ . When $ξ_{12} = \bar{ξ}$ $\xi_{12}={\overline{\xi}}$ , τ₁ and τ₂ are also equal, making φ(λ₁, λ₂) a sole function of λ₁ + λ₂ and therefore forcing the joint PDF to be proportional to δ_Dirac(ρ_i − ρ_j). To be more precise, in this regime, $ξ_{12} \to \bar{ξ}$ $\xi_{12}\to{\overline{\xi}}$ , δ_ρ ≡ (ρ_i − ρ_j)/2 is expected to be distributed with a width of about $Δ_{ξ}^{1 / 2} = {(\bar{ξ} - ξ_{12})}^{1 / 2}$ $\Delta^{1/2}_{\xi}=({\overline{\xi}}-\xi_{12})^{1/2}$ . This suggests that in this limit, the difference λ₁ − λ₂ should be treated as a large quantity of about ${(\bar{ξ} - ξ_{12})}^{- 1 / 2}$ $({\overline{\xi}}-\xi_{12})^{-1/2}$ . The limit behavior of the joint CGF can then be explicitly computed in terms of

$\begin{matrix} λ & = λ_{1} + λ_{2}, \end{matrix}$ $\begin{aligned} \lambda&=\lambda _{1}+\lambda _{2},\end{aligned}$ (B.51)

$\begin{matrix} μ & = λ_{1} - λ_{2}, \end{matrix}$ $\begin{aligned} \mu&=\lambda _{1}-\lambda _{2},\end{aligned}$ (B.52)

$\begin{matrix} Δ_{ξ} & = \bar{ξ} - ξ_{12} . \end{matrix}$ $\begin{aligned} \Delta _{\xi }&=\overline{\xi }-\xi _{12}. \end{aligned}$ (B.53)

In this limit, we obtain

$\begin{matrix} τ & = λ \bar{ξ} ζ^{'} (τ) + μ \bar{ξ} ζ^{″} (τ) δ τ, \end{matrix}$ $\begin{aligned} \tau&=\lambda \,\overline{\xi }\,\zeta ^{\prime }(\tau )+\mu \,\overline{\xi }\,\zeta ^{\prime \prime }(\tau )\,\delta \tau ,\end{aligned}$ (B.54)

$\begin{matrix} δ τ & = \frac{μ}{2} Δ_{ξ} ζ^{'} (τ), \end{matrix}$ $\begin{aligned} \delta \tau&=\frac{\mu }{2}\,\Delta _{\xi }\,\zeta ^{\prime }(\tau ), \end{aligned}$ (B.55)

leading to

$\begin{matrix} τ = λ \bar{ξ} ζ^{'} (τ) + \frac{μ^{2} \bar{ξ}}{4} Δ_{ξ} {(ζ^{'} {(τ)}^{2})}^{'} \end{matrix}$ $\begin{aligned} \tau =\lambda \,\overline{\xi }\,\zeta ^{\prime }(\tau )+\frac{\mu ^{2}\overline{\xi }}{4}\,\Delta _{\xi }\,\left(\zeta ^{\prime }(\tau )^{2}\right)^{\prime } \end{aligned}$ (B.56)

and

$\begin{matrix} φ (λ_{1}, λ_{2}) & = λ (ζ (τ) - \frac{1}{2} τ ζ^{'} (τ)) \\ + \frac{μ^{2}}{4} Δ_{ξ} (ζ^{'} {(τ)}^{2} - \frac{1}{2} τ {(ζ^{'} {(τ)}^{2})}^{'}) . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})&=\lambda \left(\zeta (\tau )-\frac{1}{2}\tau \zeta ^{\prime }(\tau )\right) \nonumber \\& +\frac{\mu ^{2}}{4}\,\Delta _{\xi }\, \left(\zeta ^{\prime }(\tau )^{2}-\frac{1}{2}\tau \left(\zeta ^{\prime }(\tau )^{2}\right)^{\prime } \right). \end{aligned}$ (B.57)

The joint PDF of ρ_m = (ρ_i + ρ_j)/2 and δ_ρ then reads

$\begin{matrix} P (ρ_{m}, δ_{ρ}) = \int \frac{d λ}{2 π i} \frac{d μ}{2 π i} exp [- λ ρ_{m} - μ δ_{ρ} + φ (λ, μ)], \end{matrix}$ $\begin{aligned} P({\rho _{\rm m}},{\delta _{\!\rho }})= \int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\, \frac{\mathrm{d}\mu }{2\pi \mathrm{i}} \exp \left[-\lambda {\rho _{\rm m}}-\mu {\delta _{\!\rho }}+\varphi (\lambda ,\mu )\right] , \end{aligned}$ (B.58)

for which there is in general no closed form. We then need to rely on approximation schemes to complete the calculations.

B.6.2. Saddle point approximation

One of the approximations that can be used to evaluate Eq. (B.58) is to use the saddle point approximation. It has been used in the literature to compute the PDF (see Balian & Schaeffer 1989; Bernardeau 1992; Valageas 2002; Codis et al. 2016a). It is a priori valid when $\bar{ξ}$ ${\overline{\xi}}$ is small (and not for too high values of the density). In this approach, the expression under the exponential is approximated by a quadratic form at its minimum. In practice, the latter is obtained from the resolution of the system

$\begin{matrix} \frac{\partial φ (λ, μ)}{\partial λ} & = ρ_{m} \end{matrix}$ $\begin{aligned} \frac{\partial \varphi (\lambda ,\mu )}{\partial \lambda }&={\rho _{\rm m}}\end{aligned}$ (B.59)

$\begin{matrix} \frac{\partial φ (λ, μ)}{\partial μ} & = δ_{ρ}, \end{matrix}$ $\begin{aligned} \frac{\partial \varphi (\lambda ,\mu )}{\partial \mu }&={\delta _{\!\rho }}, \end{aligned}$ (B.60)

which leads to the implicit or explicit values of λ, μ, and τ at the saddle point position (we hereafter denote this with the subscript s),

$\begin{matrix} ρ_{m} = ζ (τ_{s}), δ_{ρ} = μ_{s} \frac{Δ_{ξ}}{2} {(ζ^{'} (τ_{s}))}^{2} . \end{matrix}$ $\begin{aligned} {\rho _{\rm m}}=\zeta (\tau _{s}),\ \ {\delta _{\!\rho }}=\mu _{s}\,\frac{\Delta _{\xi }}{2}\left(\zeta ^{\prime }(\tau _{s})\right)^{2}. \end{aligned}$ (B.61)

It is to be noted here that the value of τ_s is independent of δ_ρ. At the saddle point position, we then have

$\begin{matrix} - λ_{s} {ρ_{m}}_{s} - μ_{s} δ_{ρ} + φ (λ_{s}, μ_{s}) = - \frac{{δ_{ρ}}^{2}}{Δ_{ξ} {[ζ^{'} (τ_{s})]}^{2}} - \frac{τ_{s}^{2}}{2} . \end{matrix}$ $\begin{aligned} -\lambda _{s}{\rho _{\rm m}}_{s}-\mu _{s}{\delta _{\!\rho }}+\varphi (\lambda _{s},\mu _{s})= -\frac{{\delta _{\!\rho }}^{2}}{\Delta _{\xi }\left[\zeta ^{\prime }(\tau _{s})\right]^{2}}-\frac{\tau _{s}^{2}}{2}. \end{aligned}$ (B.62)

This then suggests that the joint PDF is given by the product of the one-point PDF of ρ_m and a Gaussian distribution of δ_ρ of width $Δ_{ξ}^{1 / 2} ζ' (τ_{s}) / \sqrt{2}$ $\Delta^{1/2}_{\xi}\zeta\prime(\tau_{s})/\sqrt{2}$ . For usual models, as described above, ζ′(τ_s)² scales like a power of ρ_m so that one suggested form for the joint PDF is the following:

$\begin{matrix} P (ρ_{m}, δ_{ρ}) = P (ρ_{m}) exp (- \frac{{δ_{ρ}}^{2}}{Δ_{ξ} ρ_{m}^{α}}) \frac{1}{\sqrt{π Δ_{ξ} {ρ_{m}}^{α}}} . \end{matrix}$ $\begin{aligned} P({\rho _{\rm m}},{\delta _{\!\rho }})=P({\rho _{\rm m}})\exp \left(-\frac{{\delta _{\!\rho }}^{2}}{\Delta _{\xi }\rho _{m}^{\alpha }}\right)\frac{1}{\sqrt{\pi \Delta _{\xi }{\rho _{\rm m}}^{\alpha }}}. \end{aligned}$ (B.63)

Interestingly, the value of α can be related to the reduced skewness of the density field from the computation of ⟨δ_ρ²ρ_m⟩_c, and in the context of tree hierarchical models, it leads to

$\begin{matrix} α = \frac{2}{3} S_{3} . \end{matrix}$ $\begin{aligned} \alpha =\frac{2}{3}S_{3}. \end{aligned}$ (B.64)

The validity of this form clearly ought to be checked. Its simplicity nonetheless offers a good grasp of the contribution of close cells to the covariance matrix.

Appendix C: Minimal tree model

In the previous section, general formulae were given. The aim of this section is to account for more precise results obtained in the case of a specific hierarchical model, namely the minimal tree model, as described below. It can then serve as a toy model for the construction of the approximate form for the covariance matrix. We first recall that this model describes the Rayleigh Levy flights model.

C.1. One-point results in the mean-field approximation

The Rayleigh-Levy flight model makes it possible to build synthetic samples whose statistical properties follow the minimal model, that is, a hierarchical model with the following vertex-generating function:

$\begin{matrix} ζ_{RL} (τ) = 1 + τ + \frac{1}{4} τ^{2} . \end{matrix}$ $\begin{aligned} {\zeta _{\rm RL}}(\tau )=1+\tau +\frac{1}{4}\tau ^{2}. \end{aligned}$ (C.1)

In the one-cell mean-field approximation, we have the equation

$\begin{matrix} τ = λ \bar{ξ} {ζ_{RL}}^{'} (τ), \end{matrix}$ $\begin{aligned} \tau =\lambda \overline{\xi }{\zeta _{\rm RL}}^{\prime }(\tau ), \end{aligned}$ (C.2)

which can be solved in

$\begin{matrix} τ = \frac{λ ξ}{1 - λ ξ / 2}, \end{matrix}$ $\begin{aligned} \tau =\frac{\lambda \xi }{1-\lambda \,\xi /2}, \end{aligned}$ (C.3)

which leads to the following expression for the CGF:

$\begin{matrix} φ (λ) = \frac{λ}{1 - λ \bar{ξ} / 2} . \end{matrix}$ $\begin{aligned} \varphi (\lambda )=\frac{\lambda }{1-\lambda \,\overline{\xi }/2}. \end{aligned}$ (C.4)

The one-point PDF of the density can then be computed explicitly. It takes the form in the continuous limit of

$\begin{matrix} P (ρ) & = \int \frac{d λ}{2 π i} e^{- λ ρ + φ (λ)} \\ = e^{- \frac{2}{\bar{ξ}}} δ_{Dirac} (ρ) + \frac{4}{{\bar{ξ}}^{2}} e^{- \frac{2}{\bar{ξ}} (1 + ρ)}_{0} F_{1} (2, \frac{4 ρ}{\bar{ξ}}) . \end{matrix}$ $\begin{aligned} P(\rho )&=\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\,e^{-\lambda \rho +\varphi (\lambda )} \nonumber \\&=e^{-\frac{2}{\overline{\xi }}}\,{\delta _{\rm Dirac}}(\rho )+\frac{4}{\overline{\xi }^{2}}e^{-\frac{2}{\overline{\xi }}(1+\rho )} \ _{0}F_{1}\left(2,\frac{4\rho }{\overline{\xi }}\right) . \end{aligned}$ (C.5)

For this particular model, the void probability distribution (VPF) is nonzero even in the continuous limit. We recall here that the general expression of the VPF is given by exp(φ(−N̄)), which for the minimal model leads to

$\begin{matrix} P_{0} = exp (- 2 / \bar{ξ}) \end{matrix}$ $\begin{aligned} P_{0}=\exp \left(-2/\overline{\xi }\right) \end{aligned}$ (C.6)

when N̄ → ∞.

The density-bias function can also be computed explicitly. For the minimal model, we have φ₁(λ)=φ(λ) so that

$\begin{matrix} b_{1} (ρ) & = \frac{1}{P (ρ)} \int \frac{d λ}{2 π i} φ_{1} (λ) e^{- λ ρ + φ (λ)} \\ = \frac{_{0} F_{1} (1, \frac{4 ρ}{\bar{ξ}})}{_{0} F_{1} (2, \frac{4 ρ}{\bar{ξ}})} - \frac{2}{\bar{ξ}} \end{matrix}$ $\begin{aligned} b_{1}(\rho )&=\frac{1}{P(\rho )}\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\varphi _{1}(\lambda )\, e^{-\lambda \rho +\varphi (\lambda )}\nonumber \\&=\frac{\ _{0}F_{1}\left(1,\frac{4\rho }{\overline{\xi }}\right)}{\ _{0}F_{1}\left(2,\frac{4\rho }{\overline{\xi }}\right)}-\frac{2}{\overline{\xi }} \end{aligned}$ (C.7)

for ρ > 0. For this model, the fact that φ₁(λ)=φ(λ) implies that

$\begin{matrix} b_{s 1} (ρ) = b_{ξ} (ρ) . \end{matrix}$ $\begin{aligned} b_{\rm s1}(\rho )=b_{\xi }(\rho ). \end{aligned}$ (C.8)

This means that in the case of the minimal model, the density-bias function can be extracted from the functional form of the one-point PDF as

$\begin{matrix} b (ρ) P (ρ) = [- 1 - \frac{d}{d log ρ} - \frac{d}{d log \bar{ξ}}] P (ρ, \bar{ξ}) . \end{matrix}$ $\begin{aligned} b(\rho )P(\rho )=\left[-1-\frac{\mathrm{d}}{\mathrm{d}\log \rho }-\frac{\mathrm{d}}{\mathrm{d}\log \overline{\xi }}\right]P(\rho ,\overline{\xi }). \end{aligned}$ (C.9)

This is a somewhat remarkable identity (which can be extended to higher orders, as shown below.)

In this case, the second-order expansion leads to the form φ₂(λ) given by

$\begin{matrix} φ_{2} (λ) = \frac{1}{2} φ_{1} (λ), \end{matrix}$ $\begin{aligned} \varphi _{2}(\lambda )=\frac{1}{2}\varphi _{1}(\lambda ), \end{aligned}$ (C.10)

and we note that φ_p(λ) all vanish for p ≥ 3.

C.2. Two-cell results in the mean-field approximation

These mean-field calculations can be extended to the two-cell case. In this case, we have the system

$\begin{matrix} τ_{1} & = λ_{1} \bar{ξ} {ζ_{RL}}^{'} (τ_{1}) + λ_{2} ξ_{12} {ζ_{RL}}^{'} (τ_{2}) \end{matrix}$ $\begin{aligned} \tau _{1}&=\lambda _{1}\,\overline{\xi }\,{\zeta _{\rm RL}}^{\prime }(\tau _{1})+\lambda _{2}\,\xi _{12}\,{\zeta _{\rm RL}}^{\prime }(\tau _{2})\end{aligned}$ (C.11)

$\begin{matrix} τ_{2} & = λ_{1} ξ_{12} {ζ_{RL}}^{'} (τ_{1}) + λ_{2} \bar{ξ} {ζ_{RL}}^{'} (τ_{2}) \end{matrix}$ $\begin{aligned} \tau _{2}&=\lambda _{1}\,\xi _{12}\,{\zeta _{\rm RL}}^{\prime }(\tau _{1})+\lambda _{2}\,\overline{\xi }\,{\zeta _{\rm RL}}^{\prime }(\tau _{2}) \end{aligned}$ (C.12)

when the two cells are of the same size. This leads to the following expression for the joint CGF:

$\begin{matrix} φ (λ_{1}, λ_{2}) = \frac{λ_{1} + λ_{2} + (ξ_{12} - \bar{ξ}) λ_{1} λ_{2}}{1 - (λ_{1} + λ_{2}) \bar{ξ} / 2 - λ_{1} λ_{2} (ξ_{12}^{2} - {\bar{ξ}}^{2}) / 4} . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})= \frac{\lambda _{1}+\lambda _{2}+(\xi _{12}-\overline{\xi })\lambda _{1}\lambda _{2}}{1-(\lambda _{1}+\lambda _{2})\,\overline{\xi }/2-\lambda _{1}\lambda _{2}\,(\xi _{12}^{2}-\overline{\xi }^{2})/4}. \end{aligned}$ (C.13)

Remarkably, this expression can be written in terms of the one-cell CGF as

$\begin{matrix} φ (λ_{1}, λ_{2}) = \frac{φ (λ_{1}) + φ (λ_{2}) + ξ_{12} φ (λ_{1}) φ (λ_{2})}{1 - ξ_{12}^{2} φ (λ_{1}) φ (λ_{2}) / 4} . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})=\frac{\varphi (\lambda _{1})+\varphi (\lambda _{2})+\xi _{12}\,\varphi (\lambda _{1})\varphi (\lambda _{2})}{1-\xi _{12}^{2}\,\varphi (\lambda _{1})\varphi (\lambda _{2})/4}. \end{aligned}$ (C.14)

This opens the possibility of computing the joint PDF to any order of ξ₁₂. The calculation of this expansion is made simple by the following observations: The corrective terms will make intervene functions of the forms

$\begin{matrix} b_{n} (ρ) P (ρ) \equiv \int \frac{d λ}{2 π i} {[φ (λ)]}^{n} exp (- λ ρ + φ (λ)) . \end{matrix}$ $\begin{aligned} b_{n}(\rho )\,P(\rho )\equiv \int \frac{\mathrm{d}\lambda }{2 \pi \mathrm{i}}\left[\varphi (\lambda )\right]^{n}\ \exp \left(-\lambda \rho +\varphi (\lambda )\right). \end{aligned}$ (C.15)

We further note that

$\begin{matrix} \frac{d}{d λ} φ (λ) = {(\frac{\bar{ξ}}{2})}^{2} {[ψ (λ)]}^{2} \end{matrix}$ $\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}\lambda }\varphi (\lambda )=\left(\frac{\overline{\xi }}{2}\right)^{2}\left[\psi (\lambda )\right]^{2} \end{aligned}$ (C.16)

with

$\begin{matrix} ψ (λ) = φ (λ) + \frac{2}{\bar{ξ}} . \end{matrix}$ $\begin{aligned} \psi (\lambda )=\varphi (\lambda )+\frac{2}{\overline{\xi }}. \end{aligned}$ (C.17)

We the define c_n(ρ) as

$\begin{matrix} c_{n} (ρ) P (ρ) \equiv \int \frac{d λ}{2 π i} {[ψ (λ)]}^{n} exp (- λ ρ + φ (λ)) . \end{matrix}$ $\begin{aligned} c_{n}(\rho )\,P(\rho )\equiv \int \frac{\mathrm{d}\lambda }{2 \pi \mathrm{i}}\left[\psi (\lambda )\right]^{n}\ \exp \left(-\lambda \rho +\varphi (\lambda )\right). \end{aligned}$ (C.18)

We then have on one side

$\begin{matrix} b_{n} (ρ) = \sum_{q = 0}^{n} C_{n}^{q} {(- \frac{2}{\bar{ξ}})}^{q} c_{n - q} (ρ), \end{matrix}$ $\begin{aligned} b_{n}(\rho )=\sum _{q=0}^{n}C_{n}^{q}\left(-\frac{2}{\overline{\xi }}\right)^{q}c_{n-q}(\rho ), \end{aligned}$ (C.19)

and on the other side

$\begin{matrix} ρ c_{n} (ρ) = {(\frac{\bar{ξ}}{2})}^{2} (c_{n + 2} (ρ) + n c_{n + 1} (ρ)), \end{matrix}$ $\begin{aligned} \rho \,c_{n}(\rho )=\left(\frac{\overline{\xi }}{2}\right)^{2}\left(c_{n+2}(\rho )+nc_{n+1}(\rho )\right), \end{aligned}$ (C.20)

which derives from the fact that

$\begin{matrix} ρ c_{n} (ρ) = \int \frac{d λ}{2 π i} exp (- λ ρ) \frac{d}{d λ} {{[ψ (λ)]}^{n} exp (φ (λ))} \end{matrix}$ $\begin{aligned} \rho \, c_{n}(\rho )=\int \frac{\mathrm{d}\lambda }{2 \pi \mathrm{i}}\exp \left(-\lambda \rho \right)\frac{\mathrm{d}}{\mathrm{d}\lambda } \left\{ \left[\psi (\lambda )\right]^{n}\exp \left(\varphi (\lambda )\right)\right\} \end{aligned}$ (C.21)

after integration by parts. As a result, the expression of the join PDF to any order can be written as polynomials making intervene P(ρ₁), b(ρ₁) P(ρ₂) and b(ρ₁) only.

C.3. Perturbative expansion for close cells

Another interesting result is when the cell centers are close (so that cells overlap), as described above. In this case, the limit behavior of the joint CGF is given by

$\begin{matrix} φ (λ_{1}, λ_{2}) = \frac{λ + μ^{2} Δ_{ξ} / 4}{1 - λ \bar{ξ} / 2 - μ^{2} Δ_{ξ} \bar{ξ} / 8}, \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})=\frac{\lambda +\mu ^{2}\,\Delta _{\xi }/4}{1-\lambda \,\overline{\xi }/2-\mu ^{2}\,\Delta _{\xi }\,\overline{\xi }/8}, \end{aligned}$ (C.22)

with

$\begin{matrix} λ & = λ_{1} + λ_{2}, \end{matrix}$ $\begin{aligned} \lambda&=\lambda _{1}+\lambda _{2},\end{aligned}$ (C.23)

$\begin{matrix} μ & = λ_{1} - λ_{2}, \end{matrix}$ $\begin{aligned} \mu&=\lambda _{1}-\lambda _{2},\end{aligned}$ (C.24)

$\begin{matrix} Δ_{ξ} & = \bar{ξ} - ξ_{12} . \end{matrix}$ $\begin{aligned} \Delta _{\xi }&=\overline{\xi }-\xi _{12}. \end{aligned}$ (C.25)

It is then remarkable to see the result can be expressed with the sole one-cell CGF,

$\begin{matrix} φ (λ_{1}, λ_{2}) = φ (λ + μ^{2} Δ_{ξ} / 4) . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{1},\lambda _{2})=\varphi (\lambda +\mu ^{2}\,\Delta _{\xi }/4). \end{aligned}$ (C.26)

In other words, the GFC of the variables ρ_m = (ρ₁ + ρ₂)/2 and δ_ρ = (ρ₁ − ρ₂)/2 is given by Eq. (C.26). It is possible to compute the joint PDF,

$\begin{matrix} P (ρ_{m}, δ_{ρ}) & = \int \frac{d λ}{2 π i} \int \frac{d μ}{2 π i} \\ \times exp (- λ ρ_{m} - μ δ_{ρ} + φ (λ + μ^{2} Δ_{ξ} / 4)), \end{matrix}$ $\begin{aligned} P({\rho _{\rm m}},{\delta _{\!\rho }})&=\int \frac{\mathrm{d}\lambda }{2\pi \mathrm{i}}\int \frac{\mathrm{d}\mu }{2\pi \mathrm{i}} \nonumber \\& \times \exp \left( -\lambda \,{\rho _{\rm m}}-\mu \,{\delta _{\!\rho }}+\varphi (\lambda +\mu ^{2}\,\Delta _{\xi }/4)\right), \end{aligned}$ (C.27)

with the change of variable

$\begin{matrix} \tilde{λ} = λ + μ^{2} \frac{Δ_{ξ}}{4} . \end{matrix}$ $\begin{aligned} \tilde{\lambda }=\lambda +\mu ^{2}\,\frac{\Delta _{\xi }}{4}. \end{aligned}$ (C.28)

The integral in $\tilde{λ}$ $\tilde\lambda$ leads to the one-cell PDF of the density ρ_m, whereas the integral in μ can be done explicitly as it is a quadratic form in μ, leading to a Gaussian distribution in δ_ρ. The final PDF is given by

$\begin{matrix} P (ρ_{m}, δ ρ) = P (ρ_{m}) \frac{1}{{[π Δ_{ξ} ρ_{m}]}^{1 / 2}} exp (- \frac{{δ_{ρ}}^{2}}{Δ_{ξ} ρ_{m}}) . \end{matrix}$ $\begin{aligned} P({\rho _{\rm m}},\delta \rho )=P({\rho _{\rm m}})\, \frac{1}{\left[\pi \Delta _{\xi }{\rho _{\rm m}}\right]^{1/2}}\exp \left(-\frac{{\delta _{\!\rho }}^{2}}{\Delta _{\xi }\,{\rho _{\rm m}}}\right). \end{aligned}$ (C.29)

This shows that the joint PDF peaks for ρ₁ ∼ ρ₂ with a width that depends on the distance between the cells through the difference $\bar{ξ} - ξ_{12}$ ${\overline{\xi}}-\xi_{12}$ . Moreover, this form has no overlapping regime with the previous expansions of the joint PDF. It captures different aspects of the covariance calculations as listed below.

The previous expression says that close cells contribute more specifically to the covariance when ρ₁ and ρ₂ are close. This suggests that Eq. (C.29) contributes mostly to the near diagonal terms, whereas off diagonal terms could still be well described by perturbative expansions, as described before.
As noted before, perturbative expansions are closely related to supersample effects. They encode the way in which the local densities are jointly correlated with long-wavelength modes. This is not the case in Eq. (C.29). It rather captures how a rare event, such as a peak, can contribute to the covariance elements: if there is a peak somewhere, nearby cells are likely to have a similar density up to distances for which $\bar{ξ} - ξ_{12}$ ${\overline{\xi}}-\xi_{12}$ remains small enough.

The above development can be pursued to any order in $Δ_{ξ}^{1 / 2}$ $\Delta^{1/2}_{\xi}$ provided the following recipe is applied:

$\begin{matrix} δ_{ρ} \sim Δ_{ξ}^{1 / 2} and μ \sim \frac{1}{Δ_{ξ}^{1 / 2}} . \end{matrix}$ $\begin{aligned} \delta _{\rho }\sim \Delta ^{1/2}_{\xi }\ \ \ \mathrm{and}\ \ \ \mu \sim \frac{1}{\Delta ^{1/2}_{\xi }}. \end{aligned}$ (C.30)

Then the joint density can be computed to any order in $Δ_{ξ}^{1 / 2}$ $\Delta^{1/2}_{\xi}$ , making use of the very same expressions b_n(ρ).

The next-to-leading order in $Δ_{ξ}^{1 / 2}$ $\Delta^{1/2}_{\xi}$ is thus given by

$\begin{matrix} P^{(2)} (ρ_{m}, δ ρ) = P^{(0)} (ρ_{m}, δ ρ) \\ \times {\frac{δ_{ρ}^{2}}{2 \bar{ξ} ρ_{m}} - \frac{ρ_{m} Δ_{ξ}}{{\bar{ξ}}^{2}} - \frac{Δ_{ξ}}{4 \bar{ξ}} - \frac{Δ_{ξ}}{{\bar{ξ}}^{2}} - \frac{δ_{ρ}^{4}}{4 ρ_{m}^{3} Δ_{ξ}} + \frac{3 δ_{ρ}^{2}}{4 ρ_{m}^{2}} - \frac{3 Δ_{ξ}}{16 ρ_{m}} \\ + \frac{(\bar{ξ} ρ_{m} Δ_{ξ} - 2 \bar{ξ} δ_{ρ}^{2} + 8 ρ_{m}^{2} Δ_{ξ})}{8 \bar{ξ} ρ_{m}^{2}} \frac{_{0} F_{1} (1, \frac{4 ρ_{m}}{{\bar{ξ}}^{2}})}{_{0} F_{1} (2, \frac{4 ρ_{m}}{{\bar{ξ}}^{2}})}}, \end{matrix}$ $\begin{aligned}&P^{(2)}({\rho _{\rm m}},\delta \rho )=P^{(0)}({\rho _{\rm m}},\delta \rho ) \nonumber \\&\times \left\{ \frac{\delta _{\rho }^2}{2 \bar{\xi } \rho _m}-\frac{\rho _m \Delta _{\xi }}{\bar{\xi }^2}-\frac{\Delta _{\xi }}{4 \bar{\xi }} -\frac{\Delta _{\xi }}{\bar{\xi }^2}-\frac{\delta _{\rho }^4}{4 \rho _m^3 \Delta _{\xi }}+\frac{3 \delta _{\rho }^2}{4 \rho _m^2}-\frac{3 \Delta _{\xi }}{16 \rho _m} \right. \nonumber \\&\left. +\frac{\left(\bar{\xi } \rho _m \Delta _{\xi }-2 \bar{\xi } \delta _{\rho }^2+8 \rho _m^2 \Delta _{\xi }\right)}{8 \bar{\xi } \rho _m^2} \frac{_0{F}_1\left(1,\frac{4 \rho _m}{\bar{\xi }^2}\right) }{_0{F}_1\left(2,\frac{4 \rho _m}{\bar{\xi }^2}\right)} \right\} , \end{aligned}$ (C.31)

and the expansion can be extended in any (even) order in $Δ_{ξ}^{1 / 2}$ $\Delta^{1/2}_{\xi}$ . Fig. C.1 illustrates the convergence properties of these expansions. Depending on $ξ_{12} / \bar{ξ}$ $\xi_{12}/{\overline{\xi}}$ , either the expansion in ξ₁₂ or that in $Δ_{ξ}^{1 / 2}$ $\Delta^{1/2}_{\xi}$ gives a very accurate estimate of the joint PDF. It opens the way to computing the covariance matrix starting in the two-cell mean-field approximation (C.13).

Fig. C.1.

Performances of the perturbative expansions of the joint PDF $P_{MF} (ρ_{1}, ρ_{2} ; \bar{ξ}, ξ_{12})$ $P_{\mathrm{MF}}(\rho_{1},\rho_{2};{\overline{\xi}},\xi_{12})$ in the mean-field approximation either for the ξ₁₂ expansion (open blue dots) or the $Δ_{ξ}^{1 / 2}$ $\Delta^{1/2}_{\xi}$ expansion (red dots) up to 11th and 10th order, respectively. The comparisons are made for ρ₁ = ρ₂ = 2 and $\bar{ξ} = 1$ ${\overline{\xi}}=1$ (top panel) and for ρ₁ = 0.5, ρ₂ = 3.5 and $\bar{ξ} = 1$ ${\overline{\xi}}=1$ (bottom panel) and for ξ₁₂ equalling 0.1, 0.3, 0.5, 0.7, and 0.9.

C.4. Construction of the theoretical covariance matrix for the minimal tree model

The previous form can be used to compute the covariance matrix for the minimal tree model in simple implementations. It relies on analytic forms for both the two-point cell correlation functions, which can formally be written as

$\begin{matrix} ξ_{12} (r) = \int d^{2} k W_{2 D}^{2} (k R) J_{0} (k r) P (k) \end{matrix}$ $\begin{aligned} \xi _{12}(r)=\int \mathrm{d}^{2}\mathbf{k}\,W_{2D}^{2}(k\,R)\,J_{0}(k\,r)\,P(k) \end{aligned}$ (C.32)

for a given power spectrum. We also make use of the form P_s(r_d) given in footnote 2 to derive the PDF of cell distances. We then have all the required ingredients to compute the elements of the covariance matrix in the mean-field approximation,

$\begin{matrix} {Cov}_{MF} (ρ_{1}, ρ_{2}) = \int d r P_{d} (r) P_{MF}^{(8)} (ρ_{1}, ρ_{2} ; \bar{ξ}, ξ_{12} (r L_{sample})) . \end{matrix}$ $\begin{aligned} \mathrm{Cov}_{\rm MF}(\rho _{1},\rho _{2})=\int \mathrm{d}r\,P_{d}(r)\,P_{\rm MF}^{(8)}(\rho _{1},\rho _{2};\overline{\xi },\xi _{12}(r L_{\rm sample})). \end{aligned}$ (C.33)

In practice, $P_{MF}^{(8)} (ρ_{1}, ρ_{2} ; \bar{ξ}, ξ_{12})$ $P_{\mathrm{MF}}^{(8)}(\rho_{1},\rho_{2};{\overline{\xi}},\xi_{12})$ is computed from the eighth-order expansion either in ξ₁₂ when $ξ_{12} / \bar{ξ} < 0.4$ $\xi_{12}/{\overline{\xi}} < 0.4$ or in Δ_ξ when $ξ_{12} / \bar{ξ} > 0.4$ $\xi_{12}/{\overline{\xi}} > 0.4$ . This is used to explore the detailed properties of the covariance matrix and the validity of approximate schemes.

C.5. Joined PDF for relative densities

The minimal model allows us also to pursue the computation of the joint PDF for the variables ${{\hat{ρ}}_{i}}$ $\{{\hat{\rho}}_{i}\}$ or ${{\bar{ρ}}_{i}}$ $\{{\overline{\rho}}_{i}\}$ in all regimes. The first step is to extend Eq. (B.14) to a regime in which ξ₁₂ is not assumed to be small. We find that

$\begin{matrix} φ (λ_{s}, λ_{1}, λ_{2}) & = λ_{s} + φ (λ_{1}, λ_{2}) + \frac{λ_{s}^{2}}{2} \int d x_{s} d x_{s}^{'} ξ (x_{s}, x_{s}^{'}) \\ + λ_{s} \int d x_{s} ξ (x_{s}, x_{1}) φ_{c} (λ_{1}, λ_{2}) \\ + λ_{s} \int d x_{s} ξ (x_{s}, x_{2}) φ_{c} (λ_{2}, λ_{1}), \end{matrix}$ $\begin{aligned} \varphi (\lambda _{s},\lambda _{1},\lambda _{2})&= \lambda _{s}+\varphi (\lambda _{1},\lambda _{2}) +\frac{\lambda _{s}^{2}}{2}\int \mathrm{d}\mathbf{x}_{s}\mathrm{d}\mathbf{x}^{\prime }_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}^{\prime }_{s}) \nonumber \\&+\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}_{1})\,\varphi _{c}(\lambda _{1},\lambda _{2}) \nonumber \\&+\lambda _{s}\int \mathrm{d}\mathbf{x}_{s}\,\xi (\mathbf{x}_{s},\mathbf{x}_{2})\,\varphi _{c}(\lambda _{2},\lambda _{1}), \end{aligned}$ (C.34)

where φ_c(λ₁, λ₂) is given by

$\begin{matrix} φ_{c} (λ_{1}, λ_{2}) = \frac{λ_{1} + (ξ_{12} - \bar{ξ}) λ_{1} λ_{2} / 2}{1 - (λ_{1} + λ_{2}) \bar{ξ} / 2 - λ_{1} λ_{2} (ξ_{12}^{2} - {\bar{ξ}}^{2}) / 4}, \end{matrix}$ $\begin{aligned} \varphi _{c}(\lambda _{1},\lambda _{2})= \frac{\lambda _{1}+(\xi _{12}-\overline{\xi })\lambda _{1}\lambda _{2}/2}{1-(\lambda _{1}+\lambda _{2})\,\overline{\xi }/2-\lambda _{1}\lambda _{2}\,(\xi _{12}^{2}-\overline{\xi }^{2})/4} , \end{aligned}$ (C.35)

and we can note that

$\begin{matrix} φ_{c} (λ_{1}, λ_{2}) + φ_{c} (λ_{2}, λ_{1}) = φ (λ_{1}, λ_{2}) . \end{matrix}$ $\begin{aligned} \varphi _{c}(\lambda _{1},\lambda _{2})+\varphi _{c}(\lambda _{2},\lambda _{1})=\varphi (\lambda _{1},\lambda _{2}). \end{aligned}$ (C.36)

At leading order in ξ_s, that is, when we assume that the density fluctuations at sample size are much smaller than at smoothing scale, this expression then reduces to

$\begin{matrix} φ (λ_{s}, λ_{1}, λ_{2}) = λ_{s} + \frac{λ_{s}^{2}}{2} ξ_{s} + φ (λ_{1}, λ_{2}) (1 + λ_{s} ξ_{s}) . \end{matrix}$ $\begin{aligned} \varphi (\lambda _{s},\lambda _{1},\lambda _{2})= \lambda _{s}+\frac{\lambda _{s}^{2}}{2}\xi _{s}+\varphi (\lambda _{1},\lambda _{2})(1+\lambda _{s}\xi _{s}). \end{aligned}$ (C.37)

We can then exploit this relation to compute the $P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j})$ $P({\hat{\rho}}_{i},{\hat{\rho}}_{j})$ and $P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j})$ $P({\overline{\rho}}_{i},{\overline{\rho}}_{j})$ from Eqs. (25) and (31), respectively. We then have at leading order in ξ_s

$\begin{matrix} P_{s 1} ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) & = P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) + ξ_{s} \int \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} \\ \times (1 + 2 λ_{s} + \frac{1}{2} λ_{s}^{2} + (2 + λ_{s}) φ (λ_{1}, λ_{2})) \\ \times exp {[λ_{s} + φ (λ_{1}, λ_{2})]}_{|_{λ_{s} = - {\hat{ρ}}_{i} λ_{1} - {\hat{ρ}}_{j} λ_{2}}} \end{matrix}$ $\begin{aligned} P_{s1}(\hat{\rho }_{i},\hat{\rho }_{j})&=P(\hat{\rho }_{i},\hat{\rho }_{j})+\xi _{s}\int \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}} \nonumber \\&\times \ \left( 1+2\lambda _{s}+\frac{1}{2}\lambda _{s}^{2}+(2+\lambda _{s})\varphi (\lambda _{1},\lambda _{2}) \right)\nonumber \\&\times \exp \left[\lambda _{s}+\varphi (\lambda _{1},\lambda _{2})\right]_{\big \vert _{\lambda _{s}=-\hat{\rho }_{i}\lambda _{1}-\hat{\rho }_{j}\lambda _{2}}} \end{aligned}$ (C.38)

and

$\begin{matrix} P_{s 2} ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) & = P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) \\ + ξ_{s} \int \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} {(\frac{1}{2} λ_{s}^{2} + λ_{s} φ (λ_{1}, λ_{2}))}_{|_{λ_{s} = - λ_{1} - λ_{2}}} \\ \times exp [- λ_{1} {\bar{ρ}}_{i} - λ_{2} {\bar{ρ}}_{j} + φ (λ_{1}, λ_{2})] . \end{matrix}$ $\begin{aligned} P_{s2}(\overline{\rho }_{i},\overline{\rho }_{j})&=P(\overline{\rho }_{i},\overline{\rho }_{j})\nonumber \\&+\xi _{s}\int \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}} \left( \frac{1}{2}\lambda _{s}^{2}+\lambda _{s}\varphi (\lambda _{1},\lambda _{2}) \right)_{\big \vert _{\lambda _{s}=-\lambda _{1}-\lambda _{2}}} \ \nonumber \\&\times \exp \left[-\lambda _{1}\overline{\rho }_{i}-\lambda _{2}\overline{\rho }_{j}+\varphi (\lambda _{1},\lambda _{2})\right]. \end{aligned}$ (C.39)

To complete the formal calculation of these expressions, we introduce the function

$\begin{matrix} P_{b} (ρ_{i}, ρ_{j}, \bar{ξ}, ξ_{12}) = & \int \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} φ (λ_{1}, λ_{2}) \\ \times exp [- λ_{1} ρ_{i} - λ_{2} ρ_{j} + φ (λ_{1}, λ_{2})] . \end{matrix}$ $\begin{aligned} P_{b}(\rho _{i},\rho _{j},\overline{\xi },\xi _{12})=&\int \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}} \varphi (\lambda _{1},\lambda _{2})\ \nonumber \\&\times \exp \left[-\lambda _{1}\rho _{i}-\lambda _{2}\rho _{j}+\varphi (\lambda _{1},\lambda _{2})\right]. \end{aligned}$ (C.40)

We can first note that Eq. (C.9) can be extended to

$\begin{matrix} P_{b} (ρ_{i}, ρ_{j}, \bar{ξ}, ξ_{12}) = & (- 2 - \frac{d}{d log ρ_{i}} \\ - \frac{d}{d log ρ_{j}} - \frac{d}{d log \bar{ξ}} - \frac{d}{d log ξ_{12}}) P (ρ_{i}, ρ_{j}, \bar{ξ}, ξ_{12}) . \end{matrix}$ $\begin{aligned} P_{b}(\rho _{i},\rho _{j},\overline{\xi },\xi _{12})=&\left(-2-\frac{\mathrm{d}}{\mathrm{d}\log \rho _{i}}\right.\nonumber \\&\left.-\frac{\mathrm{d}}{\mathrm{d}\log \rho _{j}}-\frac{\mathrm{d}}{\mathrm{d}\log \overline{\xi }}-\frac{\mathrm{d}}{\mathrm{d}\log \xi _{12}}\right) P(\rho _{i},\rho _{j},\overline{\xi },\xi _{12}) . \end{aligned}$ (C.41)

This comes from the observation that

$\begin{matrix} P (ρ_{i}, ρ_{j}, \bar{ξ}, ξ_{12}, η) & \equiv \int \frac{d λ_{1}}{2 π i} \frac{d λ_{2}}{2 π i} \\ \times exp [- λ_{1} ρ_{i} - λ_{2} ρ_{j} + η φ (λ_{1}, λ_{2}, \bar{ξ}, ξ_{12})] \end{matrix}$ $\begin{aligned} P(\rho _{i},\rho _{j},\overline{\xi },\xi _{12},\eta )&\equiv \int \frac{\mathrm{d}\lambda _{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\lambda _{2}}{2\pi \mathrm{i}}\ \nonumber \\&\times \exp \left[-\lambda _{1}\rho _{i}-\lambda _{2}\rho _{j}+\eta \varphi (\lambda _{1},\lambda _{2},\overline{\xi },\xi _{12})\right] \nonumber \end{aligned}$

can also be written

$\begin{matrix} P (ρ_{i}, ρ_{j}, \bar{ξ}, ξ_{12}, η) & = \frac{1}{η^{2}} \int \frac{d {\hat{λ}}_{1}}{2 π i} \frac{d {\hat{λ}}_{2}}{2 π i} \\ \times exp [- {\hat{λ}}_{1} \frac{ρ_{i}}{η} - {\hat{λ}}_{2} \frac{ρ_{j}}{η} + φ ({\hat{λ}}_{1}, {\hat{λ}}_{2}, \frac{\bar{ξ}}{η}, \frac{ξ_{12}}{η})] \\ = \frac{1}{η^{2}} P (\frac{ρ_{i}}{η}, \frac{ρ_{j}}{η}, \frac{\bar{ξ}}{η}, \frac{ξ_{12}}{η}, 1) \end{matrix}$ $\begin{aligned} P(\rho _{i},\rho _{j},\overline{\xi },\xi _{12},\eta )&=\frac{1}{\eta ^{2}} \int \frac{\mathrm{d}\hat{\lambda }_{1}}{2\pi \mathrm{i}}\, \frac{\mathrm{d}\hat{\lambda }_{2}}{2\pi \mathrm{i}} \nonumber \\&\times \exp \left[-\hat{\lambda }_{1}\frac{\rho _{i}}{\eta }-\hat{\lambda }_{2}\frac{\rho _{j}}{\eta }+\varphi \left(\hat{\lambda }_{1},\hat{\lambda }_{2},\frac{\overline{\xi }}{\eta },\frac{\xi _{12}}{\eta }\right)\right]\nonumber \\&=\frac{1}{\eta ^{2}}P\left(\frac{\rho _{i}}{\eta },\frac{\rho _{j}}{\eta },\frac{\overline{\xi }}{\eta },\frac{\xi _{12}}{\eta },1\right) \end{aligned}$ (C.42)

and that

$\begin{matrix} P_{b} (ρ_{i}, ρ_{j}, \bar{ξ}, ξ_{12}) = {\frac{\partial}{\partial η}}_{|_{η = 1}} P (ρ_{i}, ρ_{j}, \bar{ξ}, ξ_{12}, η) . \end{matrix}$ $\begin{aligned} P_{b}(\rho _{i},\rho _{j},\overline{\xi },\xi _{12})=\frac{\partial }{\partial \eta }_{\big \vert _{\eta =1}}P(\rho _{i},\rho _{j},\overline{\xi },\xi _{12},\eta ). \end{aligned}$ (C.43)

The final expression of the PDF of the relative densities can then be obtained by noting that applying a multiplicative factor λ_i to the moment-generating function is equivalent to the application of the operator ∂/∂ρ_i to the final expression this finally leads to the following forms:

$\begin{matrix} P_{s 1} ({\hat{ρ}}_{i}) = P ({\hat{ρ}}_{i}) + ξ_{s} [({\hat{ρ}}_{i} \frac{\partial}{\partial {\hat{ρ}}_{i}} + \frac{{\hat{ρ}}_{i}^{2}}{2} \frac{\partial^{2}}{\partial {\hat{ρ}}_{i}^{2}}) P ({\hat{ρ}}_{i}) \\ + (1 + {\hat{ρ}}_{i} \frac{\partial}{\partial {\hat{ρ}}_{i}}) (b ({\hat{ρ}}_{i}) P ({\hat{ρ}}_{i}))] \end{matrix}$ $\begin{aligned}&P_{s1}(\hat{\rho }_{i})=P(\hat{\rho }_{i})+\xi _{s}\,\left[\left(\hat{\rho }_{i}\frac{\partial }{\partial \hat{\rho }_{i}}+\frac{\hat{\rho }_{i}^{2}}{2}\frac{\partial ^{2}}{\partial \hat{\rho }_{i}^{2}}\right)P(\hat{\rho }_{i})\right.\nonumber \\&\qquad \qquad \left.+\left(1+\hat{\rho }_{i}\frac{\partial }{\partial \hat{\rho }_{i}}\right)(b(\hat{\rho }_{i})P(\hat{\rho }_{i}))\right]\end{aligned}$ (C.44)

$\begin{matrix} P_{s 1} ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) = P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) \\ + ξ_{s} [(1 + 2 \sum_{i} {\hat{ρ}}_{i} \frac{\partial}{\partial {\hat{ρ}}_{i}} + \frac{1}{2} \sum_{i} {\hat{ρ}}_{i}^{2} \frac{\partial^{2}}{\partial {\hat{ρ}}_{i}^{2}}) P ({\hat{ρ}}_{i}, {\hat{ρ}}_{j}) \\ + (2 + \sum_{i} {\hat{ρ}}_{i} \frac{\partial}{\partial {\hat{ρ}}_{i}}) P_{b} ({\hat{ρ}}_{i}, {\hat{ρ}}_{j})] \end{matrix}$ $\begin{aligned}&P_{s1}(\hat{\rho }_{i},\hat{\rho }_{j})=P(\hat{\rho }_{i},\hat{\rho }_{j}) \nonumber \\&\qquad \qquad \quad +\xi _{s}\,\left[\left(1+2\sum _{i}\hat{\rho }_{i}\frac{\partial }{\partial \hat{\rho }_{i}}+\frac{1}{2}\sum _{i}\hat{\rho }_{i}^{2} \frac{\partial ^{2}}{\partial \hat{\rho }_{i}^{2}}\right)P(\hat{\rho }_{i},\hat{\rho }_{j}) \right.\nonumber \\&\qquad \qquad \quad \left. +\left(2+\sum _{i}\hat{\rho }_{i}\frac{\partial }{\partial \hat{\rho }_{i}}\right)P_{b}(\hat{\rho }_{i},\hat{\rho }_{j})\right] \end{aligned}$ (C.45)

for the ${\hat{ρ}}_{i} = ρ_{i} / ρ_{s}$ ${\hat{\rho}}_{i}=\rho_{i}/\rho_{s}$ and

$\begin{matrix} P_{s 2} ({\bar{ρ}}_{i}) & = P ({\bar{ρ}}_{i}) + ξ_{s} [\frac{1}{2} \frac{\partial^{2}}{\partial {\bar{ρ}}_{i}^{2}} P ({\bar{ρ}}_{i}) \\ + \frac{\partial}{\partial {\bar{ρ}}_{i}} (b ({\bar{ρ}}_{i}) P ({\bar{ρ}}_{i}))] \end{matrix}$ $\begin{aligned} P_{s2}(\overline{\rho }_{i})&=P(\overline{\rho }_{i})+\xi _{s}\,\left[\frac{1}{2}\frac{\partial ^{2}}{\partial \overline{\rho }_{i}^{2}}P(\overline{\rho }_{i}) \right.\nonumber \\&\quad \left. +\frac{\partial }{\partial \overline{\rho }_{i}}(b(\overline{\rho }_{i})P(\overline{\rho }_{i}))\right]\end{aligned}$ (C.46)

$\begin{matrix} P_{s 2} ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) & = P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) + ξ_{s} [\frac{1}{2} \sum_{i} \frac{\partial^{2}}{\partial {\bar{ρ}}_{i}^{2}} P ({\bar{ρ}}_{i}, {\bar{ρ}}_{j}) \\ + \sum_{i} \frac{\partial}{\partial {\bar{ρ}}_{i}} P_{b} ({\hat{ρ}}_{i}, {\hat{ρ}}_{j})] \end{matrix}$ $\begin{aligned} P_{s2}(\overline{\rho }_{i},\overline{\rho }_{j})&= P(\overline{\rho }_{i},\overline{\rho }_{j})+\xi _{s}\,\left[\frac{1}{2}\sum _{i}\frac{\partial ^{2}}{\partial \overline{\rho }_{i}^{2}} P(\overline{\rho }_{i},\overline{\rho }_{j}) \right.\nonumber \\&\quad \left. +\sum _{i}\frac{\partial }{\partial \overline{\rho }_{i}}P_{b}(\hat{\rho }_{i},\hat{\rho }_{j})\right] \end{aligned}$ (C.47)

for ${\bar{ρ}}_{i} = ρ_{i} - ρ_{s} + 1$ ${\overline{\rho}}_{i}=\rho_{i}-\rho_{s}+1$ . These relations can then be applied to the expressions of the joint density such as $P_{MF}^{(8)} (ρ_{i}, ρ_{j} ; \bar{ξ}, ξ_{12})$ $P_{\mathrm{MF}}^{(8)}(\rho_{i},\rho_{j};{\overline{\xi}},\xi_{12})$ found in the previous subsection.

All Figures

	Fig. 1. Example of a realization of a Rayleigh-Levy walk. Points mark the end point of each displacement. They are clearly correlated.
In the text

	Fig. 2. One-point density PDF obtained with top-hat filters compared with the theoretical predictions, Eq. (59). The values of $\bar{ξ}$ ${\overline{\xi}}$ are 0.8 and 1.09 for the blue and red curves, respectively, corresponding to two different values of l₀. The bottom panel shows the residuals. Departure from theory might be due to binning and/or to the finite number of samples.
In the text

Fig. 3.

Measured variance of the density PDF, i.e., diagonal elements of the covariance matrix, in sets 𝒜 for α = 0.5 and different prescription of the measured density. From left to right, raw density ρ_i, scaled density ${\hat{ρ}}_{i}$ ${\hat{\rho}}_{i}$ , and scaled density ${\bar{ρ}}_{i}$ ${\overline{\rho}}_{i}$ . The blue dots and solid lines are from the mean-field analytical expressions, and the large gold symbols are from the numerical simulations. The dashed black lines are what is expected from the large-scale leading contribution. The variance at cell scale is about 1.09, and the variance at sample scale, ${\bar{ξ}}_{s}$ ${\overline{\xi}}_{s}$ , is about 0.09.

In the text

	Fig. 4. Resulting reduced covariance matrix for the three types of observables for set 𝒜. The covariance matrix is dominated by its leading eigenvalue and direction, leading to this typical butterfly shape of the reduced covariance matrix.
In the text

	Fig. 5. Measured variance of the density PDF obtained for set 𝒝). Symbols are the same as in Fig. 3.
In the text

Fig. 6.

Measured variance of the density PDF, i.e., diagonal elements of the covariance matrix, in sets 𝒜 and comparisons with proposed approximate forms. The yellow line and symbols are the results obtained in the numerical experiments. The dot-dashed line is the prediction derived from relation (86), and the dashed gray line shows the prediction from Eq. (85). The dot-dashed black lines correspond to the large-scale contributions.

In the text

	Fig. 7. Behavior of the first eigenvector with the same color-coding as in Fig. 6. The dashed black lines are the large-scale prediction, b_#(ρ_i)P(ρ_i) appropriately normalized. The size of the data vector is 30.
In the text

Fig. 8.

Performances of the approximate forms of the covariance matrix in terms of rigenvalues and χ²-distributions. Top panel: eigenvalues of the covariance matrices (rebinned into six bins) compared to what can be obtained from the proposed approximate forms; same color-coding as for Fig. 6. The χ² distributions are shown in the bottom panel. Model (86) reproduces the very same χ² distributions. Model (85), in gray, is not as accurate and tends to slightly overestimate the χ². This latter behavior is amplified when a larger number of bins is used.

In the text

Fig. 9.

Scale dependence of the matter correlation functions for a realistic cosmological model (cosmological parameters derived from Plank, Planck Collaboration VI 2020) for the 3D density and the projected density (for a uniformly sampled survey with a depth of about 800 h⁻¹Mpc between z = 0.75 and z = 1.25). The top panel shows $r_{d}^{3} ξ (r_{d})$ $r_{d}^{3}\xi(r_{d})$ (solid blue line) and $r_{d}^{3} ξ^{2} (r_{d})$ $r_{d}^{3}\xi^{2}(r_{d})$ (dashed red line) for the 3D density field, and the bottom panel shows $θ_{d}^{2} ξ (θ_{d})$ $\theta^{2}_{d}\xi(\theta_{d})$ and $θ_{d}^{2} ξ^{2} (θ_{d})$ $\theta^{2}_{d}\xi^{2}(\theta_{d})$ for the projected density. In both cases, the average value of the first moment of the two-point correlation function is dominated by large-distance contributions, whereas short-distance contributions dominate the second moment, assuming survey sizes of about 100 h⁻¹ Mpc or above.

In the text

Fig. B.1.

Diagrammatic visualization of the bias functions. The function φ₀(λ) is the generating function of all trees within one cell, φ₁(λ) of all trees within one cell with one external lines, and φ₂(λ) with two external lines. The resulting connected diagrams up to second order in ξ₁₂ are thus those presented here. Two φ₂ generating functions cannot be conntected as that would induce a loop contribution. This reflects the underlying tree structure.

In the text

	Fig. B.2. Joint CGF as a function of d and for different values of λ₁, λ₂: from bottom to top, λ₁ = λ₂ = −0.1, λ₁ = λ₂ = 0.2, λ₁ = λ₂ = 0.5. The solid blue lines correspond to the two-cell mean-field expression, (C.13). The predictions given in Eqs. (B.2) and (B.15) are shown as dashed red lines and dotted black lines. The shaded area is the region of overlapping cells.
In the text

Fig. C.1.

Performances of the perturbative expansions of the joint PDF $P_{MF} (ρ_{1}, ρ_{2} ; \bar{ξ}, ξ_{12})$ $P_{\mathrm{MF}}(\rho_{1},\rho_{2};{\overline{\xi}},\xi_{12})$ in the mean-field approximation either for the ξ₁₂ expansion (open blue dots) or the $Δ_{ξ}^{1 / 2}$ $\Delta^{1/2}_{\xi}$ expansion (red dots) up to 11th and 10th order, respectively. The comparisons are made for ρ₁ = ρ₂ = 2 and $\bar{ξ} = 1$ ${\overline{\xi}}=1$ (top panel) and for ρ₁ = 0.5, ρ₂ = 3.5 and $\bar{ξ} = 1$ ${\overline{\xi}}=1$ (bottom panel) and for ξ₁₂ equalling 0.1, 0.3, 0.5, 0.7, and 0.9.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Abbott, T. M. C., Abdalla, F. B., Alarcon, A., et al. 2018, Phys. Rev. D, 98, 043526 [NASA ADS] [CrossRef] [Google Scholar]

[2] Amendola, L., Appleby, S., Avgoustidis, A., et al. 2018, Liv. Rev. Rel., 21, 2 [Google Scholar]

[3] Balian, R., & Schaeffer, R. 1989, A&A, 220, 1 [NASA ADS] [Google Scholar]

[4] Barthelemy, A., Codis, S., & Bernardeau, F. 2021, MNRAS, 503, 5204 [Google Scholar]

[5] Bernardeau, F. 1992, ApJ, 392, 1 [NASA ADS] [CrossRef] [Google Scholar]

[6] Bernardeau, F. 1996, A&A, 312, 11 [NASA ADS] [Google Scholar]

[7] Bernardeau, F. 2013, ArXiv e-prints [arXiv:1311.2724] [Google Scholar]

[8] Bernardeau, F., & Reimberg, P. 2016, Phys. Rev. D, 94, 063520 [NASA ADS] [CrossRef] [Google Scholar]

[9] Bernardeau, F., & Schaeffer, R. 1992, A&A, 255, 1 [NASA ADS] [Google Scholar]

[10] Bernardeau, F., & Schaeffer, R. 1999, A&A, 349, 697 [NASA ADS] [Google Scholar]

[11] Bernardeau, F., & Valageas, P. 2000, A&A, 364, 1 [NASA ADS] [Google Scholar]

[12] Bernardeau, F., Pichon, C., & Codis, S. 2014, Phys. Rev. D, 90, 103519 [NASA ADS] [CrossRef] [Google Scholar]

[13] Boyle, A., Uhlemann, C., Friedrich, O., et al. 2021, MNRAS, 505, 2886 [NASA ADS] [CrossRef] [Google Scholar]

[14] Brouwer, M. M., Demchenko, V., Harnois-Déraps, J., et al. 2018, MNRAS, 481, 5189 [NASA ADS] [CrossRef] [Google Scholar]

[15] Carron, J. 2011, ApJ, 738, 86 [NASA ADS] [CrossRef] [Google Scholar]

[16] Carron, J., & Neyrinck, M. C. 2012, ApJ, 750, 28 [NASA ADS] [CrossRef] [Google Scholar]

[17] Codis, S., Pichon, C., Bernardeau, F., Uhlemann, C., & Prunet, S. 2016a, MNRAS, 460, 1549 [NASA ADS] [CrossRef] [Google Scholar]

[18] Codis, S., Bernardeau, F., & Pichon, C. 2016b, MNRAS, 460, 1598 [NASA ADS] [CrossRef] [Google Scholar]

[19] Colombi, S., Bouchet, F. R., & Schaeffer, R. 1995, ApJS, 96, 401 [NASA ADS] [CrossRef] [Google Scholar]

[20] Friedrich, O., & Eifler, T. 2018, MNRAS, 473, 4150 [NASA ADS] [CrossRef] [Google Scholar]

[21] Friedrich, O., Gruen, D., DeRose, J., et al. 2018, Phys. Rev. D, 98, 023508 [Google Scholar]

[22] Friedrich, O., Uhlemann, C., Villaescusa-Navarro, F., et al. 2020, MNRAS, 498, 464 [NASA ADS] [CrossRef] [Google Scholar]

[23] Fry, J. N. 1984a, ApJ, 277, L5 [NASA ADS] [CrossRef] [Google Scholar]

[24] Fry, J. N. 1984b, ApJ, 279, 499 [NASA ADS] [CrossRef] [Google Scholar]

[25] Gruen, D., Friedrich, O., Krause, E., et al. 2018, Phys. Rev. D, 98 [Google Scholar]

[26] Heymans, C., Tröster, T., Asgari, M., et al. 2021, A&A, 646, A140 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[27] Hikage, C., Oguri, M., Hamana, T., et al. 2019, PASJ, 71, 43 [Google Scholar]

[28] Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111 [Google Scholar]

[29] Kaiser, N. 1998, ApJ, 498, 26 [NASA ADS] [CrossRef] [Google Scholar]

[30] Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]

[31] Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 [Google Scholar]

[32] Peebles, P. J. E. 1980, The Large-scale Structure of the Universe (Princeton University Press) [Google Scholar]

[33] Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[34] Repp, A., & Szapudi, I. 2020, MNRAS, 498, L125 [NASA ADS] [CrossRef] [Google Scholar]

[35] Repp, A., & Szapudi, I. 2021, MNRAS, 500, 3631 [Google Scholar]

[36] Schneider, P. 1996, MNRAS, 283, 837 [Google Scholar]

[37] Szapudi, I., & Colombi, S. 1996, ApJ, 470, 131 [NASA ADS] [CrossRef] [Google Scholar]

[38] Szapudi, I., Colombi, S., & Bernardeau, F. 1999, MNRAS, 310, 428 [NASA ADS] [CrossRef] [Google Scholar]

[39] Takada, M., & Hu, W. 2013, Phys. Rev. D, 87 [CrossRef] [Google Scholar]

[40] Touchette, H. 2011, ArXiv e-prints [arXiv:1106.4146] [Google Scholar]

[41] Uhlemann, C., Friedrich, O., Villaescusa-Navarro, F., Banerjee, A., & Codis, S. 2020, MNRAS, 495, 4006 [NASA ADS] [CrossRef] [Google Scholar]

[42] Valageas, P. 2002, A&A, 382, 412 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[43] White, S. D. M. 1979, MNRAS, 186, 145 [NASA ADS] [CrossRef] [Google Scholar]