Estimating the galaxy two-point correlation function using a split random catalog

E. Keihänen; H. Kurki-Suonio; V. Lindholm; A. Viitanen; A.-S. Suur-Uski; V. Allevato; E. Branchini; F. Marulli; P. Norberg; D. Tavagnacco; S. de la Torre; J. Valiviita; M. Viel; J. Bel; M. Frailis; A. G. Sánchez

doi:10.1051/0004-6361/201935828

Home

All issues

Volume 631 (November 2019)

A&A, 631 (2019) A73

Full HTML

Free Access

Issue		A&A Volume 631, November 2019


Article Number		A73
Number of page(s)		11
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/201935828
Published online		22 October 2019

A&A 631, A73 (2019)

Estimating the galaxy two-point correlation function using a split random catalog

E. Keihänen¹, H. Kurki-Suonio¹, V. Lindholm¹, A. Viitanen¹, A.-S. Suur-Uski¹, V. Allevato²^,1, E. Branchini³, F. Marulli⁴^,5^,6, P. Norberg⁷, D. Tavagnacco⁸, S. de la Torre⁹, J. Valiviita¹, M. Viel¹⁰^,11^,8^,14, J. Bel¹², M. Frailis⁸ and A. G. Sánchez¹³

¹ Department of Physics and Helsinki Institute of Physics, University of Helsinki, Gustaf Hällströmin katu 2, 00014 Helsinki, Finland
e-mail: elina.keihanen@helsinki.fi
² Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126 Pisa, Italy
³ Department of Mathematics and Physics, Roma Tre University, Via della Vasca Navale 84, 00146 Rome, Italy
⁴ Dipartimento di Fisica e Astronomia – Alma Mater Studiorum Università di Bologna, Via Piero Gobetti 93/2, 40129 Bologna, Italy
⁵ INAF – Osservatorio di Astrofisica e Scienza dello Spazio di Bologna, Via Piero Gobetti 93/3, 40129 Bologna, Italy
⁶ INFN – Sezione di Bologna, Viale Berti Pichat 6/2, 40127 Bologna, Italy
⁷ ICC & CEA, Department of Physics, Durham University, South Road, Durham DH1 3LE, UK
⁸ INAF, Osservatorio Astronomico di Trieste, Via Tiepolo 11, 34131 Trieste, Italy
⁹ Aix Marseille Univ., CNRS, CNES, LAM, Marseille, France
¹⁰ SISSA, International School for Advanced Studies, Via Bonomea 265, 34136 Trieste, TS, Italy
¹¹ INFN, Sezione di Trieste, Via Valerio 2, 34127 Trieste, TS, Italy
¹² Aix Marseille Univ., Université de Toulon, CNRS, CPT, Marseille, France
¹³ Max-Planck-Institut für Extraterrestrische Physik, Postfach 1312, Giessenbachstr., 85741 Garching, Germany
¹⁴ IFPU – Institute for Fundamental Physics of the Universe, Via Beirut 2, 34014 Trieste, Italy

Received: 3 May 2019
Accepted: 24 July 2019

Abstract

The two-point correlation function of the galaxy distribution is a key cosmological observable that allows us to constrain the dynamical and geometrical state of our Universe. To measure the correlation function we need to know both the galaxy positions and the expected galaxy density field. The expected field is commonly specified using a Monte-Carlo sampling of the volume covered by the survey and, to minimize additional sampling errors, this random catalog has to be much larger than the data catalog. Correlation function estimators compare data–data pair counts to data–random and random–random pair counts, where random–random pairs usually dominate the computational cost. Future redshift surveys will deliver spectroscopic catalogs of tens of millions of galaxies. Given the large number of random objects required to guarantee sub-percent accuracy, it is of paramount importance to improve the efficiency of the algorithm without degrading its precision. We show both analytically and numerically that splitting the random catalog into a number of subcatalogs of the same size as the data catalog when calculating random–random pairs and excluding pairs across different subcatalogs provides the optimal error at fixed computational cost. For a random catalog fifty times larger than the data catalog, this reduces the computation time by a factor of more than ten without affecting estimator variance or bias.

Key words: large-scale structure of Universe / cosmology: observations / methods: statistical / methods: data analysis

© ESO 2019

1. Introduction

The spatial distribution of luminous matter in the Universe is a key diagnostic for studying cosmological models and the physical processes involved in the assembly of structure. In particular, light from galaxies is a robust tracer of the overall matter distribution, whose statistical properties can be predicted by cosmological models. Two-point correlation statistics are very effective tools for compressing the cosmological information encoded in the spatial distribution of the mass in the Universe. In particular, the two-point correlation function in configuration space has emerged as one of the most popular cosmological probes. Its success stems from the presence of characterized features that can be identified, measured, and effectively compared to theoretical models to extract clean cosmological information.

One such feature is baryon acoustic oscillations (BAOs), which imprint a characteristic scale in the two-point correlation function that can be used as a standard ruler. After the first detection in the two-point correlation function of SDSS DR3 and 2dFGRS galaxy catalogs (Eisenstein et al. 2005; Cole et al. 2005), the BAO signal was identified, with different degrees of statistical significance, and has since been used to constrain the expansion history of the Universe in many spectroscopic galaxy samples (see e.g., Percival et al. 2010; Blake et al. 2011; Beutler et al. 2011; Anderson et al. 2012, 2014; Ross et al. 2015, 2017; Alam et al. 2017; Vargas-Magaña et al. 2018; Bautista et al. 2018; Ata et al. 2018). Several of these studies did not focus on the BAO feature only but also analyzed the anisotropies in the two-point correlation function induced by the peculiar velocities (Kaiser 1987), the so-called redshift space distortions (RSD), and by assigning cosmology-dependent distances to the observed redshifts (the Alcock & Paczyński 1979 test). For RSD analyses, see also, for example, Peacock et al. (2001), Guzzo et al. (2008), Beutler et al. (2012), Reid et al. (2012), de la Torre et al. (2017), Pezzotta et al. (2017), Zarrouk et al. (2018), Hou et al. (2018), and Ruggeri et al. (2019).

Methods to estimate the galaxy two-point correlation function (2PCF) ξ(r) from survey data are based on its definition as the excess probability of finding a galaxy pair. One counts from the data (D) catalog the number DD(r) of pairs of galaxies with separation x₂ − x₁ ∈ r, where r is a bin of separation vectors, and compares it to the number of pairs RR(r) in a corresponding randomly generated (R) catalog and to the number of data-random pairs DR(r). The bin may be a 1D ( $r \pm \frac{1}{2} Δ r$ $r\pm\frac{1}{2}\Delta r$ ), 2D, or a 3D bin. In the 1D case, r is the length of the separation vector and Δr is the width of the bin. From here on, “separation r” indicates that the separation falls in this bin.

Several estimators of the 2PCF have been proposed by Hewett (1982), Davis & Peebles (1983), Hamilton (1993), and Landy & Szalay (1993), building on the original Peebles & Hauser (1974) proposal. These correspond to different combinations of the DD, DR, and RR counts to obtain a 2PCF estimate $\hat{ξ} (r)$ $\hat{\xi}({\boldsymbol{r}})$ ; see Kerscher (1999) and Kerscher et al. (2000) for more estimators. The Landy–Szalay (Landy & Szalay 1993) estimator

$\begin{matrix} {\hat{ξ}}_{LS} (r) : = \frac{N_{r}^{'} (N_{r}^{'} - 1)}{N_{d} (N_{d} - 1)} \frac{D D (r)}{R R (r)} - \frac{N_{r}^{'} - 1}{N_{d}} \frac{D R (r)}{R R (r)} + 1, \end{matrix}$ $\begin{aligned} \hat{\xi }_{\rm LS}({\boldsymbol{r}}) := \frac{N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)}{N_{\rm d}(N_{\rm d}-1)}\frac{DD({\boldsymbol{r}})}{RR({\boldsymbol{r}})} - \frac{N^{\prime }_{\rm r}-1}{N_{\rm d}}\frac{DR({\boldsymbol{r}})}{RR({\boldsymbol{r}})} + 1 , \end{aligned}$ (1)

(we call this method “standard LS” in the following) is the most commonly used, since it provides the minimum variance when |ξ|≪1 and is unbiased in the limit $N'_{r} \to \infty$ $N^\prime_{\mathrm{r}}\rightarrow\infty$ . Here N_d is the size (number of objects) of the data catalog and $N'_{r}$ $N^\prime_{\mathrm{r}}$ is the size of the random catalog. We define $M_{r} : = N_{r}^{'} / N_{d}$ $M_{\rm r} := N^{\prime}_{\rm r}/N_{\rm d}$ . To minimize random error from the random catalog, M_r ≫ 1 should be used (for a different approach, see Demina et al. 2018).

One is usually interested in ξ(r) only up to some r_max ≪ L_max (the maximum separation in the survey), and therefore pairs with larger separations can be skipped. Efficient implementations of the LS estimator involve pre-ordering of the catalogs through kd-tree, chain-mesh, or other algorithms (e.g., Moore et al. 2000; Alonso 2012; Jarvis 2015; Marulli et al. 2016) to facilitate this. The computational cost is then roughly proportional to the actual number of pairs with separation r ≤ r_max.

The correlation function is small for large separations, and in cosmological surveys r_max is large enough so that for most pairs |ξ(r)| ≪ 1. The fraction f of DD pairs with r ≤ r_max is therefore not very different from the fraction of DR or RR pairs with r ≤ r_max. The computational cost is dominated by the part proportional to the total number of pairs needed, $\frac{1}{2} f N_{d} (N_{d} - 1) + f N_{d} N_{r} + \frac{1}{2} f N'_{r} (N'_{r} - 1) \approx \frac{1}{2} f N_{d}^{2} (1 + 2 M_{r} + M_{r}^{2})$ $\frac{1}{2} fN_{\mathrm{d}}(N_{\mathrm{d}}-1) + f\,N_{\mathrm{d}}N_{\mathrm{r}} + \frac{1}{2} fN^\prime_{\mathrm{r}}(N^\prime_{\mathrm{r}}-1) \approx \frac{1}{2} fN_{\mathrm{d}}^2(1 + 2M_{\mathrm{r}} + M_{\mathrm{r}}^2)$ , which in turn is dominated by the RR pairs as M_r ≫ 1. The smaller number of DR pairs contribute much more to the error of the estimate than the large number of RR pairs, whereas the cost is dominated by RR. Thus, a significant saving of computation time with an insignificant loss of accuracy may be achieved by counting only a subset of RR pairs, while still counting the full set (up to r_max) of DR pairs.

A good way to achieve this is to use many small (i.e, low-density) R catalogs instead of one large (high-density) catalog (Landy & Szalay 1993; Wall & Jenkins 2012; Slepian & Eisenstein 2015), or, equivalently, to split an already generated large R catalog into M_s small ones for the calculation of RR pairs while using the full R catalog for the DR counts. This method has been used by some practitioners (e.g., Zehavi et al. 2011¹), but this is usually not documented in the literature. One might also consider obtaining a similar cost saving by diluting (subsampling) the R catalog for RR counts, but, as we show below, this is not a good idea. We refer to these two cost-saving methods as “split” and “dilution”.

In this work we theoretically derive the additional covariance and bias due to the size and treatment of the R catalog; test these predictions numerically with mock catalogs representative of next-generation datasets, such as the spectroscopic galaxy samples that will be obtained by the future Euclid satellite mission (Laureijs et al. 2011); and show that the “split” method, while reducing the computational cost by a large factor, retains the advantages of the LS estimator.

We follow the approach of Landy & Szalay (1993; hereafter, LS93), but generalize it in a number of ways: In particular, since we focus on the effect of the random catalog, we do not work in the limit M_r → ∞. Also, we calculate covariances, not just variances, and make fewer approximations (see Sect. 2.2).

The layout of the paper is as follows. In Sect. 2 we derive theoretical results for bias and covariance. In Sect. 3 we focus on the split LS estimator and its optimization. In Sect. 4 we test the different estimators with mock catalogs. Finally, we discuss the results and present our conclusions in Sect. 5.

2. Theoretical results: bias and covariance

2.1. General derivation

We follow the derivation and notations in LS93 but extend to the case that includes random counts covariance. We consider the survey volume as divided into K microcells (very small subvolumes) and work in the limit K → ∞, which means that no two objects will ever be located within the same microcell.

Here, α, β, and γ represent the relative deviation of the DD(r), DR(r), and RR(r) counts from their expectation values (mean values over an infinite number of independent realizations):

$\begin{matrix} D D (r) = : ⟨ D D (r) ⟩ [1 + α (r)], \\ D R (r) = : ⟨ D R (r) ⟩ [1 + β (r)], \\ R R (r) = : ⟨ R R (r) ⟩ [1 + γ (r)] . \end{matrix}$ $\begin{aligned}&DD({\boldsymbol{r}}) =: \langle DD({\boldsymbol{r}}) \rangle [1+\alpha ({\boldsymbol{r}})],\nonumber \\&DR({\boldsymbol{r}}) =: \langle DR({\boldsymbol{r}}) \rangle [1+\beta ({\boldsymbol{r}})],\nonumber \\&RR({\boldsymbol{r}}) =: \langle RR({\boldsymbol{r}}) \rangle [1+\gamma ({\boldsymbol{r}})]. \end{aligned}$ (2)

By definition ⟨α⟩ = ⟨β⟩ = ⟨γ⟩ = 0. The factors α, β, and γ represent fluctuations in the pair counts, which arise as a result of a Poisson process. As long as the mean pair counts per bin are large (≫1) the relative fluctuations will be small. We calculate up to second order in α, β, and γ, and ignore the higher-order terms (in the limit M_r → ∞, γ → 0, so LS93 set γ = 0 at this point).

The expectation values for the pair counts are:

$\begin{matrix} ⟨ D D (r) ⟩ = \frac{1}{2} N_{d} (N_{d} - 1) G^{p} (r) [1 + ξ (r)], \\ ⟨ D R (r) ⟩ = N_{d} N_{r} G^{p} (r), \\ ⟨ R R (r) ⟩ = \frac{1}{2} N_{r}^{'} (N_{r}^{'} - 1) G^{p} (r), \end{matrix}$ $\begin{aligned}&\langle DD({\boldsymbol{r}})\rangle = \frac{1}{2} N_{\rm d}(N_{\rm d}-1)G^\mathrm{p}({\boldsymbol{r}})[1+\xi ({\boldsymbol{r}})],\nonumber \\&\langle DR({\boldsymbol{r}})\rangle = N_{\rm d}N_{\rm r} G^\mathrm{p}({\boldsymbol{r}}),\nonumber \\&\langle RR({\boldsymbol{r}})\rangle = \frac{1}{2} N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)G^\mathrm{p}({\boldsymbol{r}}) , \end{aligned}$ (3)

where ξ(r) is the correlation function normalized to the actual number density of galaxies in the survey and

$\begin{matrix} G^{p} (r) : = \frac{2}{K^{2}} \sum_{i < j}^{K} Θ^{ij} (r) \end{matrix}$ $\begin{aligned} G^\mathrm{p}({\boldsymbol{r}}) := \frac{2}{K^2} \sum _{i<j}^K \Theta ^{ij}({\boldsymbol{r}}) \end{aligned}$ (4)

is the fraction of microcell pairs with separation r. Here Θ^ij(r): = 1 if x_i − x_j falls in the r-bin, otherwise it is equal to zero.

The expectation value of the LS estimator (1) is

$\begin{matrix} ⟨ {\hat{ξ}}_{LS} ⟩ & = (1 + ξ) 〈 \frac{1 + α}{1 + γ} 〉 - 2 〈 \frac{1 + β}{1 + γ} 〉 + 1 \\ ≂ ξ + (ξ - 1) ⟨ γ^{2} ⟩ + 2 ⟨ β γ ⟩ . \end{matrix}$ $\begin{aligned} \langle \hat{\xi }_{\rm LS}\rangle&= (1+\xi )\left\langle \frac{1+\alpha }{1+\gamma }\right\rangle - 2\left\langle \frac{1+\beta }{1+\gamma }\right\rangle + 1 \nonumber \\&\eqsim \xi + (\xi -1)\langle \gamma ^2\rangle + 2\langle \beta \gamma \rangle . \end{aligned}$ (5)

A finite R catalog thus introduces a (small) bias. (In LS93, γ = 0, so the estimator is unbiased in this limit). This expression is calculated to second order in α, β, and γ (we denote equality to second order by “≂”). Calculation to higher order is beyond the scope of this work. Since data and random catalogs are independent, ⟨αγ⟩ = 0.

We introduce shorthand notations ⟨α₁α₂⟩ for ⟨α(r₁)α(r₂)⟩, ⟨DD₁DD₂⟩ for ⟨DD(r₁)DD(r₂)⟩, and similarly for other terms.

For the covariance we get

$\begin{matrix} Cov [{\hat{ξ}}_{LS} (r_{1}), {\hat{ξ}}_{LS} (r_{2})] & \equiv 〈 {\hat{ξ}}_{LS} (r_{1}) {\hat{ξ}}_{LS} (r_{2}) 〉 - 〈 {\hat{ξ}}_{LS} (r_{1}) 〉 〈 {\hat{ξ}}_{LS} (r_{2}) 〉 \\ ≂ (1 + ξ_{1}) (1 + ξ_{2}) ⟨ α_{1} α_{2} ⟩ + 4 ⟨ β_{1} β_{2} ⟩ \\ + (1 - ξ_{1}) (1 - ξ_{2}) ⟨ γ_{1} γ_{2} ⟩ \\ - 2 (1 + ξ_{1}) ⟨ α_{1} β_{2} ⟩ - 2 (1 + ξ_{2}) ⟨ β_{1} α_{2} ⟩ \\ - 2 (1 - ξ_{1}) ⟨ γ_{1} β_{2} ⟩ - 2 (1 - ξ_{2}) ⟨ β_{1} γ_{2} ⟩ . \end{matrix}$ $\begin{aligned} \mathrm{Cov}\left[\hat{\xi }_{\rm LS}({\boldsymbol{r}}_1),\hat{\xi }_{\rm LS}({\boldsymbol{r}}_2)\right]&\equiv \left\langle \hat{\xi }_{\rm LS}({\boldsymbol{r}}_1)\hat{\xi }_{\rm LS}({\boldsymbol{r}}_2)\right\rangle - \left\langle \hat{\xi }_{\rm LS}({\boldsymbol{r}}_1)\right\rangle \left\langle \hat{\xi }_{\rm LS}({\boldsymbol{r}}_2)\right\rangle \nonumber \\&\eqsim (1+\xi _1)(1+\xi _2)\langle \alpha _1\alpha _2\rangle + 4\langle \beta _1\beta _2\rangle \nonumber \\&\quad + (1-\xi _1)(1-\xi _2)\langle \gamma _1\gamma _2\rangle \nonumber \\&\quad - 2(1+\xi _1)\langle \alpha _1\beta _2\rangle - 2(1+\xi _2)\langle \beta _1\alpha _2\rangle \nonumber \\&\quad - 2(1-\xi _1)\langle \gamma _1\beta _2\rangle - 2(1-\xi _2)\langle \beta _1\gamma _2\rangle . \end{aligned}$ (6)

Terms with γ represent additional variance due to finite $N_{r}^{'}$ $N^{\prime}_{\rm r}$ , and are new compared to those of LS93. Also, ⟨β₁β₂⟩ collects an additional contribution, which we denote by Δ⟨β₁β₂⟩, from variations in the random field (see Sect. 2.3). The cross terms α₁β₂ and α₂β₁, instead depend linearly on the random field, and average to the $N_{r}^{'} \to \infty$ $N^{\prime}_{\rm r}\rightarrow\infty$ result. The additional contribution due to finite $N_{r}^{'}$ $N^{\prime}_{\rm r}$ is thus

$\begin{matrix} Δ Cov [{\hat{ξ}}_{LS} (r_{1}), {\hat{ξ}}_{LS} (r_{2})] ≂ & 4 Δ ⟨ β_{1} β_{2} ⟩ + (1 - ξ_{1}) (1 - ξ_{2}) ⟨ γ_{1} γ_{2} ⟩ \\ - 2 (1 - ξ_{1}) ⟨ γ_{1} β_{2} ⟩ - 2 (1 - ξ_{2}) ⟨ β_{1} γ_{2} ⟩ . \end{matrix}$ $\begin{aligned} \Delta \mathrm{Cov}\left[\hat{\xi }_{\rm LS}({\boldsymbol{r}}_1),\hat{\xi }_{\rm LS}({\boldsymbol{r}}_2)\right] \eqsim &4\Delta \langle \beta _1\beta _2\rangle + (1-\xi _1)(1-\xi _2)\langle \gamma _1\gamma _2\rangle \nonumber \\& - 2(1-\xi _1)\langle \gamma _1\beta _2\rangle - 2(1-\xi _2)\langle \beta _1\gamma _2\rangle . \end{aligned}$ (7)

From (2),

$\begin{matrix} ⟨ D D_{1} D D_{2} ⟩ = ⟨ D D_{1} ⟩ ⟨ D D_{2} ⟩ (1 + ⟨ α_{1} α_{2} ⟩), \end{matrix}$ $\begin{aligned} \langle DD_1\ DD_2 \rangle = \langle DD_1\rangle \langle DD_2\rangle \left(1+\langle \alpha _1\alpha _2\rangle \right), \end{aligned}$ (8)

and so on, so that the covariances of the deviations are obtained from

$\begin{matrix} ⟨ α_{1} α_{2} ⟩ & = \frac{⟨ D D_{1} D D_{2} ⟩ - ⟨ D D_{1} ⟩ ⟨ D D_{2} ⟩}{⟨ D D_{1} ⟩ ⟨ D D_{2} ⟩}, \\ ⟨ β_{1} β_{2} ⟩ & = \frac{⟨ D R_{1} D R_{2} ⟩ - ⟨ D R_{1} ⟩ ⟨ D R_{2} ⟩}{⟨ D R_{1} ⟩ ⟨ D R_{2} ⟩}, \\ ⟨ γ_{1} γ_{2} ⟩ & = \frac{⟨ R R_{1} R R_{2} ⟩ - ⟨ R R_{1} ⟩ ⟨ R R_{2} ⟩}{⟨ R R_{1} ⟩ ⟨ R R_{2} ⟩}, \\ ⟨ α_{1} β_{2} ⟩ & = \frac{⟨ D D_{1} D R_{2} ⟩ - ⟨ D D_{1} ⟩ ⟨ D R_{2} ⟩}{⟨ D D_{1} ⟩ ⟨ D R_{2} ⟩}, \\ ⟨ β_{1} γ_{2} ⟩ & = \frac{⟨ D R_{1} R R_{2} ⟩ - ⟨ D R_{1} ⟩ ⟨ R R_{2} ⟩}{⟨ D R_{1} ⟩ ⟨ R R_{2} ⟩}, \\ ⟨ α_{1} γ_{2} ⟩ & = \frac{⟨ D D_{1} R R_{2} ⟩ - ⟨ D D_{1} ⟩ ⟨ R R_{1} ⟩}{⟨ D D_{1} ⟩ ⟨ R R_{2} ⟩} = 0 . \end{matrix}$ $\begin{aligned} \langle \alpha _1\alpha _2\rangle&= \frac{\langle DD_1\ DD_2 \rangle - \langle DD_1\rangle \langle DD_2\rangle }{\langle DD_1\rangle \langle DD_2\rangle },\nonumber \\ \langle \beta _1\beta _2\rangle&= \frac{\langle DR_1\ DR_2 \rangle - \langle DR_1\rangle \langle DR_2\rangle }{\langle DR_1\rangle \langle DR_2\rangle },\nonumber \\ \langle \gamma _1\gamma _2\rangle&= \frac{\langle RR_1\ RR_2 \rangle - \langle RR_1\rangle \langle RR_2\rangle }{\langle RR_1\rangle \langle RR_2\rangle } ,\nonumber \\ \langle \alpha _1\beta _2\rangle&= \frac{\langle DD_1\ DR_2 \rangle - \langle DD_1 \rangle \langle DR_2 \rangle }{\langle DD_1 \rangle \langle DR_2 \rangle } ,\nonumber \\ \langle \beta _1\gamma _2\rangle&= \frac{\langle DR_1\ RR_2 \rangle - \langle DR_1 \rangle \langle RR_2 \rangle }{\langle DR_1 \rangle \langle RR_2 \rangle } ,\nonumber \\ \langle \alpha _1\gamma _2\rangle&= \frac{\langle DD_1\ RR_2 \rangle - \langle DD_1 \rangle \langle RR_1 \rangle }{\langle DD_1 \rangle \langle RR_2 \rangle } = 0. \end{aligned}$ (9)

2.2. Quadruplets, triplets, and approximations

We use

$\begin{matrix} G_{12}^{t} : = G^{t} (r_{1}, r_{2}) : = \frac{1}{K^{3}} \sum_{ijk}^{*} Θ_{1}^{ik} Θ_{2}^{jk} \end{matrix}$ $\begin{aligned} G^\mathrm{t}_{12} := G^\mathrm{t}({\boldsymbol{r}}_1,{\boldsymbol{r}}_2) := \frac{1}{K^3}\sum ^*_{ijk}\Theta ^{ik}_1\Theta ^{jk}_2 \end{aligned}$ (10)

to denote the fraction of ordered microcell triplets, where x_i − x_k ∈ r₁ and x_j − x_k ∈ r₂. The notation ∑^* means that only terms where all indices (microcells) are different are included. Here $G_{12}^{t}$ $G^{\mathrm{t}}_{12}$ is of the same magnitude as $G_{1}^{p} G_{2}^{p}$ $G^{\rm p}_1\,G^{\rm p}_2$ but is larger.

Appendix A gives examples of how the ⟨DD₁DD₂⟩ and so on in (9) are calculated. These covariances involve expectation values ⟨n_in_jn_ln_k⟩, where n_i is the number of objects (0 or 1) in microcell i and so on, and only cases where the four microcells are separated pairwise by r₁ and r₂ are included. If all four microcells i, j, k, and l are different, we call this case a quadruplet; it consists of two pairs with separations r₁ and r₂. If two of the indices, that is, microcells, are equal, we have a triplet with a center cell (the equal indices) and two end cells separated from the center by r₁ and r₂.

We make the following three approximations:

For microcell quadruplets, the correlations between unconnected cells are approximated by zero on average.
Three-point correlations vanish.
The part of four-point correlations that does not arise from the two-point correlations vanishes.

With approximations (2) and (3), we have for the expectation value of a galaxy triplet

$\begin{matrix} ⟨ n_{i} n_{j} n_{k} ⟩ \propto 1 + ξ_{ij} + ξ_{jk} + ξ_{ik}, \end{matrix}$ $\begin{aligned} \langle n_in_jn_k\rangle \propto 1+\xi _{ij}+\xi _{jk}+\xi _{ik} , \end{aligned}$ (11)

where ξ_ij := ξ(x_j − x_i), and for a quadruplet

$\begin{matrix} ⟨ n_{i} n_{j} n_{k} n_{l} ⟩ \propto 1 + ξ_{ij} + ξ_{jk} + ξ_{ik} + ξ_{il} + ξ_{jl} + ξ_{kl} + ξ_{ij} ξ_{kl} + ξ_{ik} ξ_{jl} + ξ_{il} ξ_{jk} . \end{matrix}$ $\begin{aligned} \langle n_in_jn_kn_l\rangle \propto 1+\xi _{ij}+\xi _{jk}+\xi _{ik}+\xi _{il}+\xi _{jl}+\xi _{kl}+\xi _{ij}\xi _{kl}+\xi _{ik}\xi _{jl}+\xi _{il}\xi _{jk}. \end{aligned}$ (12)

We use “≃” to denote results based on these three approximations. Approximation (1) is good as long as the survey size is large compared to r_max. It allows us to drop terms other than 1 + ξ_ij + ξ_kl + ξ_ijξ_kl in (12). Approximations (2) and (3) hold for Gaussian density fluctuations, but in the realistic cosmological situation they are not good: the presence of the higher-order correlations makes the estimation of the covariance of ξ(r) estimators a difficult problem. However, this difficulty applies only to the contribution of the data to the covariance, that is, to the part that does not depend on the size and treatment of the random catalog. The key point in this work is that while our theoretical result for the total covariance does not hold in a realistic situation (it is an underestimate), our results for the difference in estimator covariance due to different treatments of the random catalog hold well.

In addition to working in the limit $N_{r}^{'} \to \infty$ $N^{\prime}_{\rm r}\rightarrow\infty$ (γ = 0), LS93 considered only 1D bins and the case where r₁ = r₂ ≡ r (i.e., variances, not covariances) and made also a fourth approximation: for triplets (which in this case have legs of equal length) they approximated the correlation between the end cells (whose separation in this case varies between 0 and 2r) by ξ(r). We use ξ₁₂ to denote the mean value of the correlation between triplet end cells (separated from the triplet center by r₁ and r₂). (For our plots in Sect. 4 we make a similar approximation of ξ₁₂ as Landy & Szalay 1993, see Sect. 4.2). Also, LS93 only calculated to first order in ξ, whereas we do not make this approximation. Bernstein (1994) also considered covariances, and included the effect of three-point and four-point correlations, but worked in the limit $N_{r}^{'} \to \infty$ $N^{\prime}_{\rm r}\rightarrow\infty$ (γ = 0).

2.3. Poisson, edge, and q terms

After calculating all the ⟨DD₁DD₂⟩ and so on (see Appendix A), (9) becomes

$\begin{matrix} (1 + ξ_{1}) (1 + ξ_{2}) ⟨ α_{1} α_{2} ⟩ \\ ≃ \frac{4}{N_{d}} (1 + ξ_{1}) (1 + ξ_{2}) [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1] \\ + \frac{2 (1 + ξ_{1})}{N_{d} (N_{d} - 1)} [\frac{δ_{12}}{G_{1}^{p}} - 2 (1 + ξ_{2}) \frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} + (1 + ξ_{2})] \\ + \frac{4 (N_{d} - 2)}{N_{d} (N_{d} - 1)} (ξ_{12} - ξ_{1} ξ_{2}) \frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}}, \\ ⟨ β_{1} β_{2} ⟩ ≃ \frac{1}{N_{d} N_{r}} {N_{r}^{'} [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1] + N_{d} [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1] + 1 \\ - \frac{2 G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} + \frac{δ_{12}}{G_{1}^{p}}} + \frac{N_{d} - 1}{N_{d} N_{r}} ξ_{12} \frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}}, \\ ⟨ γ_{1} γ_{2} ⟩ = \frac{2}{N_{r}^{'} (N_{r}^{'} - 1)} {2 (N_{r}^{'} - 2) [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1] + \frac{δ_{12}}{G_{1}^{p}} - 1}, \\ ⟨ α_{1} β_{2} ⟩ ≃ \frac{2}{N_{d}} [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1], \\ ⟨ β_{1} γ_{2} ⟩ = \frac{2}{N_{r}^{'}} [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1], \\ ⟨ α_{1} γ_{2} ⟩ = 0, \end{matrix}$ $\begin{aligned}&(1+\xi _1)(1+\xi _2)\langle \alpha _1\alpha _2\rangle \nonumber \\&\qquad \qquad \qquad \simeq \frac{4}{N_{\rm d}} (1+\xi _1)(1+\xi _2)\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} - 1\right] \nonumber \\&\qquad \qquad \qquad \quad + \frac{2(1+\xi _1)}{N_{\rm d}(N_{\rm d}-1)}\left[ \frac{\delta _{12}}{G^\mathrm{p}_1}- 2(1+\xi _2)\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} +(1+\xi _2) \right] \nonumber \\&\qquad \qquad \qquad \quad + \frac{4(N_{\rm d}-2)}{N_{\rm d}(N_{\rm d}-1)}(\xi _{12}-\xi _1\xi _2)\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} ,\nonumber \\&\langle \beta _1\beta _2\rangle \simeq \frac{1}{N_{\rm d}N_{\rm r}}\Biggl \{ N^{\prime }_{\rm r}\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2}-1\right] + N_{\rm d}\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2}-1\right] + 1 \nonumber \\&\qquad \quad \;\; - \frac{2G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} + \frac{\delta _{12}}{G^\mathrm{p}_1} \Biggr \} + \frac{N_{\rm d}-1}{N_{\rm d}N_{\rm r}}\xi _{12}\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} ,\nonumber \\&\langle \gamma _1\gamma _2\rangle = \frac{2}{N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)}\left\{ 2(N^{\prime }_{\rm r}-2)\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} - 1\right] + \frac{\delta _{12}}{G^\mathrm{p}_1} - 1\right\} ,\nonumber \\&\langle \alpha _1\beta _2\rangle \simeq \frac{2}{N_{\rm d}}\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} - 1\right] ,\nonumber \\&\langle \beta _1\gamma _2\rangle = \frac{2}{N^{\prime }_{\rm r}}\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} - 1\right] ,\nonumber \\&\langle \alpha _1\gamma _2\rangle = 0, \end{aligned}$ (13)

for the standard LS estimator.

Following the definition of t and p in LS93, we define

$\begin{matrix} t_{12} : = \frac{1}{N_{d}} [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1], \\ t_{12}^{r} : = \frac{1}{N_{r}^{'}} [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1] = \frac{N_{d}}{N_{r}^{'}} t_{12}, \\ p_{12} : = \frac{2}{N_{d} (N_{d} - 1)} [\frac{δ_{12}}{(1 + ξ_{1}) G_{1}^{p}} - 2 \frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} + 1], \\ p_{12}^{c} : = \frac{1}{N_{d} N_{r}} [\frac{δ_{12}}{G_{1}^{p}} - 2 \frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} + 1], \\ p_{12}^{r} : = \frac{2}{N_{r}^{'} (N_{r}^{'} - 1)} [\frac{δ_{12}}{G_{1}^{p}} - 2 \frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} + 1], \\ q_{12} : = \frac{1}{N_{d}} \frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} = t_{12} + \frac{1}{N_{d}}, \\ q_{12}^{r} : = \frac{1}{N_{r}^{'}} \frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} = t_{12}^{r} + \frac{1}{N_{r}^{'}} \cdot \end{matrix}$ $\begin{aligned}&t_{12} := \frac{1}{N_{\rm d}}\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2}-1\right] ,\nonumber \\&t^\mathrm{r}_{12} := \frac{1}{N^{\prime }_{\rm r}}\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2}-1\right] \ = \ \frac{N_{\rm d}}{N^{\prime }_{\rm r}}\,t_{12}, \nonumber \\&p_{12} := \frac{2}{N_{\rm d}(N_{\rm d}-1)}\left[\frac{\delta _{12}}{(1+\xi _1)G^\mathrm{p}_1} - 2\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} + 1\right] ,\nonumber \\&p^\mathrm{c}_{12} := \frac{1}{N_{\rm d}N_{\rm r}}\left[\frac{\delta _{12}}{G^\mathrm{p}_1} - 2\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} + 1\right] ,\nonumber \\&p^\mathrm{r}_{12} := \frac{2}{N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)}\left[\frac{\delta _{12}}{G^\mathrm{p}_1} - 2\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} + 1\right],\nonumber \\&q_{12} := \frac{1}{N_{\rm d}}\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} = t_{12} + \frac{1}{N_{\rm d}} ,\nonumber \\&q^\mathrm{r}_{12} := \frac{1}{N^{\prime }_{\rm r}}\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} = t^\mathrm{r}_{12} + \frac{1}{N^{\prime }_{\rm r}}\cdot \end{aligned}$ (14)

For their diagonals (r₁ = r₂), we write t, t_r, p, p_c, p_r, q, and q_r. Thus, t ≡ t₁₁ ≡ t₂₂, $t_{r} \equiv t_{11}^{r} \equiv t_{22}^{r}$ $t_{\mathrm{r}}\equiv t^{\mathrm{r}}_{11}\equiv t^{\mathrm{r}}_{22}$ and so on. (We use superscripts for the matrices, e.g., t^r(r₁, r₂), and subscripts for their diagonals, e.g., t_r(r)).

Using these definitions, (13) becomes

$\begin{matrix} (1 + ξ_{1}) (1 + ξ_{2}) ⟨ α_{1} α_{2} ⟩ ≃ (1 + ξ_{1}) (1 + ξ_{2}) (4 t_{12} + p_{12}) \\ + 4 \frac{N_{d} - 2}{(N_{d} - 1)} (ξ_{12} - ξ_{1} ξ_{2}) q_{12}, \\ ⟨ β_{1} β_{2} ⟩ ≃ t_{12} + t_{12}^{r} + p_{12}^{c} + \frac{N_{d} - 1}{N_{d}} ξ_{12} q_{12}^{r}, \\ ⟨ γ_{1} γ_{2} ⟩ = 4 t_{12}^{r} + p_{12}^{r}, \\ ⟨ α_{1} β_{2} ⟩ ≃ 2 t_{12}, ⟨ β_{1} γ_{2} ⟩ = 2 t_{12}^{r}, and ⟨ α_{1} γ_{2} ⟩ = 0 . \end{matrix}$ $\begin{aligned}&(1+\xi _1)(1+\xi _2)\langle \alpha _1\alpha _2\rangle \simeq (1+\xi _1)(1+\xi _2)(4t_{12} + p_{12}) \nonumber \\&\qquad \qquad \qquad \qquad \qquad + 4\frac{N_{\rm d}-2}{(N_{\rm d}-1)}(\xi _{12}-\xi _1\xi _2)q_{12}, \nonumber \\&\langle \beta _1\beta _2\rangle \simeq t_{12} + t^\mathrm{r}_{12} + p^\mathrm{c}_{12} + \frac{N_{\rm d}-1}{N_{\rm d}}\xi _{12}q^\mathrm{r}_{12},\nonumber \\&\langle \gamma _1\gamma _2\rangle = 4\,t^\mathrm{r}_{12} + p^\mathrm{r}_{12},\nonumber \\&\langle \alpha _1\beta _2\rangle \simeq 2t_{12},\,\langle \beta _1\gamma _2\rangle = 2\,t^\mathrm{r}_{12} ,\quad \mathrm{and} \quad \langle \alpha _1\gamma _2\rangle = 0. \end{aligned}$ (15)

The new part in ⟨β₁β₂⟩ due to finite size of the random catalog is

$\begin{matrix} Δ ⟨ β_{1} β_{2} ⟩ ≃ t_{12}^{r} + p_{12}^{c} + \frac{N_{d} - 1}{N_{d}} ξ_{12} q_{12}^{r} . \end{matrix}$ $\begin{aligned} \Delta \langle \beta _1\beta _2\rangle \simeq t^\mathrm{r}_{12} + p^\mathrm{c}_{12} + \frac{N_{\rm d}-1}{N_{\rm d}}\xi _{12}q^\mathrm{r}_{12}. \end{aligned}$ (16)

Thus only ⟨α₁α₂⟩ and ⟨β₁β₂⟩ are affected by ξ(r) (in our approximation its effect cancels in ⟨α₁β₂⟩). The results for ⟨γ₁γ₂⟩, ⟨β₁γ₂⟩, and ⟨α₁γ₂⟩ are exact. The result for ⟨α₁α₂⟩ involves all three approximations mentioned above, ⟨α₁β₂⟩ involves approximations (1) and (2), and ⟨β₁β₂⟩ involves approximation (1).

We refer to p, p^c, and p^r as “Poisson” terms and t and t^r as “edge” terms (the difference between $G_{12}^{t}$ $G^{\mathrm{t}}_{12}$ and $G_{1}^{p} G_{2}^{p}$ $G^{\rm p}_1\,G^{\rm p}_2$ is due to edge effects). While the Poisson terms are strongly diagonal dominated, the edge terms are not. Since $N_{d} t_{12} = N'_{r} t_{12}^{r} ≪ 1$ $N_{\mathrm{d}} t_{12} = N^\prime_{\mathrm{r}} t^{\mathrm{r}}_{12} \ll 1$ , the q terms are much larger than the edge terms, but they get multiplied by ξ₁₂ − ξ₁ξ₂ or ξ₁₂. In the limit $N_{r}^{'} \to \infty$ $N^{\prime}_{\rm r}\rightarrow\infty$ : ⟨β₁γ₂⟩→0, ⟨γ₁γ₂⟩→0, ⟨β₁β₂⟩→t₁₂; ⟨α₁α₂⟩, and also ⟨α₁β₂⟩ are unaffected.

We see that DD–DR and DR–RR correlations arise from edge effects. If we increase the density of data or random objects, the Poisson terms decrease as N⁻² but the edge terms decrease only as N⁻¹ so the edge effects are more important for a higher density of objects.

Doubling the bin size (combining neighboring bins) doubles G^p(r) but makes G^t(r₁, r₂) four times as large, since triplets where one leg was in one of the original smaller bins and the other leg was in the other bin are now also included. Thus, the ratio $G_{12}^{t} / (G_{1}^{p} G_{2}^{p})$ $G^{\mathrm{t}}_{12}/(G^{\mathrm{p}}_1\,G^{\mathrm{p}}_2)$ and t are not affected, but the dominant term in p, 1/(1 + ξ)G^p is halved. Edge effects are thus more important for larger bins.

2.4. Results for the standard Landy–Szalay estimator

Inserting the results for ⟨α₁α₂⟩ and so on into Eqs. (5) and (6), we get that the expectation value of the standard LS estimator (1) is

$\begin{matrix} ⟨ {\hat{ξ}}_{LS} ⟩ = ξ + (ξ - 1) (4 t_{r} + p_{r}) + 4 t_{r} . \end{matrix}$ $\begin{aligned} \langle \hat{\xi }_{\rm LS}\rangle = \xi + \left(\xi -1\right)(4\,t_{\rm r}+p_{\rm r}) + 4\,t_{\rm r}. \end{aligned}$ (17)

This holds also for large ξ and in the presence of three-point and four-point correlations. A finite R catalog thus introduces a bias (ξ−1)(4 t_r + p_r)+4 t_r = −p_r + (4 t_r + p_r)ξ; the edge (t_r) part of the bias cancels in the ξ → 0 limit.

For the covariance we get

$\begin{matrix} Cov [{\hat{ξ}}_{LS} (r_{1}), {\hat{ξ}}_{LS} (r_{2})] & \equiv 〈 {\hat{ξ}}_{LS} (r_{1}) {\hat{ξ}}_{LS} (r_{2}) 〉 - 〈 {\hat{ξ}}_{LS} (r_{1}) 〉 〈 {\hat{ξ}}_{LS} (r_{2}) 〉 \\ ≃ (1 + ξ_{1}) (1 + ξ_{2}) p_{12} + 4 p_{12}^{c} \\ + (1 - ξ_{1}) (1 - ξ_{2}) p_{12}^{r} + 4 ξ_{1} ξ_{2} (t_{12} + t_{12}^{r}) \\ + 4 \frac{N_{d} - 2}{N_{d} - 1} (ξ_{12} - ξ_{1} ξ_{2}) q_{12} + 4 \frac{N_{d} - 1}{N_{d}} ξ_{12} q_{12}^{r} . \end{matrix}$ $\begin{aligned} \mathrm{Cov}\left[\hat{\xi }_{\rm LS}({\boldsymbol{r}}_1),\hat{\xi }_{\rm LS}({\boldsymbol{r}}_2)\right]&\equiv \left\langle \hat{\xi }_{\rm LS}({\boldsymbol{r}}_1)\hat{\xi }_{\rm LS}({\boldsymbol{r}}_2)\right\rangle - \left\langle \hat{\xi }_{\rm LS}({\boldsymbol{r}}_1)\right\rangle \left\langle \hat{\xi }_{\rm LS}({\boldsymbol{r}}_2)\right\rangle \nonumber \\&\simeq (1+\xi _1)(1+\xi _2)p_{12} + 4p^\mathrm{c}_{12}\nonumber \\&\quad + (1-\xi _1)(1-\xi _2)\,p^\mathrm{r}_{12} + 4\xi _1\xi _2(t_{12}+t^\mathrm{r}_{12})\nonumber \\&\quad + 4\frac{N_{\rm d}-2}{N_{\rm d}-1}(\xi _{12}-\xi _1\xi _2)q_{12} + 4\frac{N_{\rm d}-1}{N_{\rm d}}\xi _{12}q^\mathrm{r}_{12}. \end{aligned}$ (18)

Because of the approximations made, this result for the covariance does not apply to the realistic cosmological case; not even for large separations r, where ξ is small, since large correlations at small r increase the covariance also at large r. However, this concerns only ⟨α₁α₂⟩ and ⟨α₁β₂⟩. Our focus here is on the additional covariance due to the size and handling of the random catalog, which for standard LS is

$\begin{matrix} Δ Cov [{\hat{ξ}}_{LS} (r_{1}), {\hat{ξ}}_{LS} (r_{2})] & ≂ 4 Δ ⟨ β_{1} β_{2} ⟩ + (1 - ξ_{1}) (1 - ξ_{2}) ⟨ γ_{1} γ_{2} ⟩ \\ - 2 (1 - ξ_{1}) ⟨ γ_{1} β_{2} ⟩ - 2 (1 - ξ_{2}) ⟨ β_{1} γ_{2} ⟩ \\ ≃ 4 p_{12}^{c} + (1 - ξ_{1}) (1 - ξ_{2}) p_{12}^{r} + 4 ξ_{1} ξ_{2} t_{12}^{r} \\ + 4 \frac{N_{d} - 1}{N_{d}} ξ_{12} q_{12}^{r} . \end{matrix}$ $\begin{aligned} \Delta \mathrm{Cov}\left[\hat{\xi }_{\rm LS}({\boldsymbol{r}}_1),\hat{\xi }_{\rm LS}({\boldsymbol{r}}_2)\right]&\eqsim 4\Delta \langle \beta _1\beta _2\rangle + (1-\xi _1)(1-\xi _2)\langle \gamma _1\gamma _2\rangle \nonumber \\&\quad - 2(1-\xi _1)\langle \gamma _1\beta _2\rangle - 2(1-\xi _2)\langle \beta _1\gamma _2\rangle \nonumber \\&\simeq 4p^\mathrm{c}_{12} + (1-\xi _1)(1-\xi _2)\,p^\mathrm{r}_{12} +4\xi _1\xi _2\,t^\mathrm{r}_{12}\nonumber \\&\quad + 4\frac{N_{\rm d}-1}{N_{\rm d}}\xi _{12}q^\mathrm{r}_{12}. \end{aligned}$ (19)

To zeroth order in ξ the covariance is given by the Poisson terms and the edge terms cancel to first order in ξ. This is the property for which the standard LS estimator was designed. To first order in ξ, the q terms contribute. This q contribution involves the triplet correlation ξ₁₂, which, depending on the form of ξ(r), may be larger than ξ₁ or ξ₂.

If we try to save cost by using a diluted random catalog with $N'_{r} ≪ N'_{r}$ $N^\prime_{\mathrm{r}} \ll N^\prime_{\mathrm{r}}$ for RR pairs, ⟨γ₁γ₂⟩ is replaced by $⟨ γ'_{1} γ'_{2} ⟩ = 4 t_{12}^{r'} + p_{12}^{r ″}$ $\langle\gamma\prime_1\gamma\prime_2\rangle = 4\, t^{\mathrm{r}\prime}_{12} + p^{\mathrm{r}\prime\prime}_{12}$ with $N'_{r}$ $N^\prime_{\mathrm{r}}$ in place of $N'_{r}$ $N^\prime_{\mathrm{r}}$ , but $〈 β_{1} γ_{2}^{'} 〉 = 〈 β_{1} γ_{2} 〉$ $\langle\beta_1\gamma_2^{\prime}\rangle = \langle\beta_1\gamma_2\rangle$ and ⟨β₁β₂⟩ are unaffected, so that the edge terms involving randoms no longer cancel. In Sect. 4 we see that this is a large effect. Therefore, one should not use dilution.

3. Split random catalog

3.1. Bias and covariance for the split method

In the split method one has M_s independent smaller R^μ catalogs of size $N_{r}^{'}$ $N^{\prime}_{\rm r}$ instead of one large random catalog R. Their union, R, has a size of $N_{r}^{'} = M_{s} N_{r}^{'}$ $N^{\prime}_{\rm r} = M_{\rm s}N^{\prime}_{\rm r}$ . The pair counts DR(r) and RR′(r) are calculated as

$\begin{matrix} D R (r) : = \sum_{μ = 1}^{M_{s}} D R^{μ} (r) and R R^{'} (r) : = \sum_{μ = 1}^{M_{s}} R^{μ} R^{μ} (r), \end{matrix}$ $\begin{aligned} DR({\boldsymbol{r}}) := \sum _{\mu =1}^{M_{\rm s}}DR^\mu ({\boldsymbol{r}}) \quad \mathrm{and} \quad RR^{\prime }({\boldsymbol{r}}) := \sum _{\mu =1}^{M_{\rm s}}R^\mu R^\mu ({\boldsymbol{r}}), \end{aligned}$ (20)

that is, pairs across different R^μ catalogs are not included in RR′. The total number of pairs in RR′ is $\frac{1}{2} M_{s} N'_{r} (N'_{r} - 1) = \frac{1}{2} N'_{r} (N'_{r} - 1)$ $\frac{1}{2} M_{\mathrm{s}}N^{\prime}_{\mathrm{r}}(N^{\prime}_{\mathrm{r}}-1) = \frac{1}{2} N^{\prime}_{\mathrm{r}}(N^{\prime}_{\mathrm{r}}-1)$ . Here, DR is equal to its value in standard LS.

The split Landy–Szalay estimator is

$\begin{matrix} {\hat{ξ}}_{split} (r) : = \frac{N_{r}^{'} (N_{r}^{'} - 1)}{N_{d} (N_{d} - 1)} \frac{D D (r)}{R R^{'} (r)} - \frac{N_{r}^{'} - 1}{N_{d}} \frac{D R (r)}{R R^{'} (r)} + 1 \cdot \end{matrix}$ $\begin{aligned} \hat{\xi }_\mathrm{split} ({\boldsymbol{r}}) := \frac{N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)}{N_{\rm d}(N_{\rm d}-1)}\frac{DD({\boldsymbol{r}})}{RR^{\prime }({\boldsymbol{r}})} - \frac{N^{\prime }_{\rm r}-1}{N_{\rm d}}\frac{DR({\boldsymbol{r}})}{RR^{\prime }({\boldsymbol{r}})} + 1\cdot \end{aligned}$ (21)

Compared to standard LS, ⟨α₁α₂⟩, ⟨β₁β₂⟩, and ⟨α₁β₂⟩ are unaffected. We construct ⟨RR′⟩, ⟨RR′⋅RR′⟩, and ⟨RR′⋅DR⟩ from the standard LS results, bearing in mind that the random catalog is a union of independent catalogs, arriving at

$\begin{matrix} ⟨ β_{1} γ_{2}^{'} ⟩ & = 2 t_{12}^{r}, \\ ⟨ γ_{1}^{'} γ_{2}^{'} ⟩ & = 4 t_{12}^{r} + p_{12}^{r^{'}}, \end{matrix}$ $\begin{aligned} \langle \beta _1\gamma _2^{\prime }\rangle&= 2\,t^\mathrm{r}_{12} ,\nonumber \\ \langle \gamma ^{\prime }_1\gamma ^{\prime }_2\rangle&= 4\,t^\mathrm{r}_{12} + p^{\mathrm{r}^{\prime }}_{12} , \end{aligned}$ (22)

where

$\begin{matrix} p_{12}^{r^{'}} : = \frac{N_{d} (N_{d} - 1)}{N_{r}^{'} (N_{r}^{'} - 1)} p_{12} \equiv \frac{N_{r}^{'} - 1}{N_{r}^{'} - 1} p_{12}^{r} . \end{matrix}$ $\begin{aligned} p^{\mathrm{r}^{\prime }}_{12} := \frac{N_{\rm d}(N_{\rm d}-1)}{N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)}\,p_{12} \equiv \frac{N^{\prime }_{\rm r}-1}{N^{\prime }_{\rm r}-1}\,p^\mathrm{r}_{12}. \end{aligned}$ (23)

The first is the same as in standard LS and dilution, but the second differs both from standard LS and from dilution, since it involves both $N_{r}^{'}$ $N^{\prime}_{\rm r}$ and $N_{r}^{'}$ $N^{\prime}_{\rm r}$ .

For the expectation value we get

$\begin{matrix} ⟨ {\hat{ξ}}_{split} ⟩ = ξ + (ξ - 1) (4 t_{r} + p_{r}^{'}) + 4 t_{r}, \end{matrix}$ $\begin{aligned} \langle \hat{\xi }_\mathrm{split} \rangle = \xi + \left(\xi -1\right)(4\,t_{\rm r}+p^{\prime }_{\rm r}) + 4\,t_{\rm r} , \end{aligned}$ (24)

so that the bias is $(ξ - 1) (4 t_{r} + p_{r}^{'}) + 4 t_{r} = - p_{r}^{'} + (4 t_{r} + p_{r}^{'}) ξ$ $(\xi-1)(4\,t_{\rm r}+p^{\prime}_{\rm r}) + 4\,t_{\rm r} = -p^{\prime}_{\rm r} + (4\,t_{\rm r}+p^{\prime}_{\rm r})\xi$ . In the limit ξ → 0 the edge part cancels, leaving only the Poisson term.

The covariance is

$\begin{matrix} Cov [{\hat{ξ}}_{split} (r_{1}), {\hat{ξ}}_{split} (r_{2})] ≃ & (1 + ξ_{1}) (1 + ξ_{2}) p_{12} + 4 p_{12}^{c} \\ + (1 - ξ_{1}) (1 - ξ_{2}) p_{12}^{r^{'}} + 4 ξ_{1} ξ_{2} (t_{12} + t_{12}^{r}) . \end{matrix}$ $\begin{aligned} \mathrm{Cov}\left[\hat{\xi }_\mathrm{split} ({\boldsymbol{r}}_1),\hat{\xi }_\mathrm{split} ({\boldsymbol{r}}_2)\right] \simeq &(1+\xi _1)(1+\xi _2)p_{12} + 4\,p^\mathrm{c}_{12} \nonumber \\& + (1-\xi _1)(1-\xi _2)p^{\mathrm{r}^{\prime }}_{12} + 4\xi _1\xi _2(t_{12}+t^\mathrm{r}_{12}) . \end{aligned}$ (25)

The change in the covariance compared to the standard LS method is

$\begin{matrix} Cov [{\hat{ξ}}_{1}^{split}, {\hat{ξ}}_{2}^{split}] - Cov [{\hat{ξ}}_{1}^{LS}, {\hat{ξ}}_{2}^{LS}] = (1 - ξ_{1}) (1 - ξ_{2}) (p_{12}^{r^{'}} - p_{12}^{r}), \end{matrix}$ $\begin{aligned}&\mathrm{Cov}\left[\hat{\xi }^\mathrm{split} _1,\hat{\xi }^\mathrm{split} _2\right] - \mathrm{Cov}\left[\hat{\xi }^\mathrm{LS} _1,\hat{\xi }^\mathrm{LS} _2\right]= (1-\xi _1)(1-\xi _2)(p^{\mathrm{r}^{\prime }}_{12}-p^\mathrm{r}_{12}), \end{aligned}$ (26)

which again applies in the realistic cosmological situation. Our main result is that in the split method the edge effects cancel and the bias and covariance are the same as for standard LS, except that the Poisson term p^r from RR is replaced with the larger p^r′.

3.2. Optimizing computational cost and variance of the split method

The bias is small compared to variance in our application (see Fig. 1 for the theoretical result and Fig. 5 for an attempted bias measurement), and therefore we focus on variance as the figure of merit. The computational cost should be roughly proportional to

$\begin{matrix} \frac{1}{2} N_{d}^{2} (1 + 2 M_{r} + \frac{M_{r}^{2}}{M_{s}}) = : \frac{1}{2} N_{d}^{2} c, \end{matrix}$ $\begin{aligned} \frac{1}{2} N_{\rm d}^2\left(1+2M_{\rm r}+\frac{M_{\rm r}^2}{M_{\rm s}}\right) =: \frac{1}{2} N_{\rm d}^2c , \end{aligned}$ (27)

Fig. 1.

Mean ξ(r) estimate and the scatter and theoretical bias of the estimates for different estimators. The dash-dotted line, our theoretical result for the scatter of the LS method, underestimates the scatter, since higher-order correlations in the D catalog are ignored. The dotted line is without the contribution of the q terms, and is dominated by the Poisson (p) terms. The bias is multiplied by 100 so the curves can be displayed in a more compact plot. For the measured mean and scatter, and the theoretical bias we plot standard LS in black, dilution with d = 0.14 in red, and split with M_s = 50 in blue. For the mean and scatter the difference between the methods is not visible in this plot. The differences in the mean estimate are shown in Fig. 5. The differences in scatter (or its square, the variance) are shown in Fig 3. For the theoretical bias the difference between split and dilution is not visible at small r (ξ(r) > 1), where the bias is positive.

and the additional variance due to finite R catalog in the ξ → 0 limit becomes

$\begin{matrix} Δ var \approx (\frac{2}{M_{r}} + \frac{M_{s}}{M_{r}^{2}}) p = : v p . \end{matrix}$ $\begin{aligned} \Delta \mathrm{var} \approx \left(\frac{2}{M_{\rm r}}+\frac{M_{\rm s}}{M_{\rm r}^2}\right)p =: vp. \end{aligned}$ (28)

Here, N_d and p are fixed by the survey and the requested r binning, but we can vary M_r and M_s in the search for the optimal computational method. In the above we defined the “cost” and “variance” factors c and v.

We may ask two questions:

For a fixed level of variance v, which combination of M_r and M_s minimizes computational cost c?
For a fixed computational cost c, which combination of M_r and M_s minimizes the variance v?

The answer to both questions is (Slepian & Eisenstein 2015)

$\begin{matrix} M_{s} = M_{r} \Rightarrow c = 1 + 3 M_{r} and v = \frac{3}{M_{r}} \cdot \end{matrix}$ $\begin{aligned} M_{\rm s} = M_{\rm r}\;\;\Rightarrow \;\;c = 1+3M_{\rm r} \quad \mathrm{and}\quad v = \frac{3}{M_{\rm r}}\cdot \end{aligned}$ (29)

Thus, the optimal version of the split method is the natural one where $N_{r}^{'} = N_{d}$ $N^{\prime}_{\rm r} = N_{\rm d}$ . In this case the additional variance in the ξ → 0 limit becomes

$\begin{matrix} Δ var \approx (2 \frac{N_{d}}{N_{r}^{'}} + \frac{N_{d}}{N_{r}^{'}}) p, \end{matrix}$ $\begin{aligned} \Delta \mathrm{var} \approx \left(2\frac{N_{\rm d}}{N^{\prime }_{\rm r}} + \frac{N_{\rm d}}{N^{\prime }_{\rm r}}\right)p , \end{aligned}$ (30)

and the computational cost factor $N_{d}^{2} + 2 N_{d} N_{r} + N_{r}^{'} N_{r}^{'}$ $N_{\rm d}^2+2N_{\rm d}N_{\rm r}+N^{\prime}_{\rm r} N^{\prime}_{\rm r}$ becomes

$\begin{matrix} (1 + 2 \frac{N_{r}^{'}}{N_{d}} + \frac{N_{r}^{'}}{N_{d}}) N_{d}^{2}, \end{matrix}$ $\begin{aligned} \left(1 + 2\frac{N^{\prime }_{\rm r}}{N_{\rm d}} + \frac{N^{\prime }_{\rm r}}{N_{\rm d}}\right)N_{\rm d}^2 , \end{aligned}$ (31)

meaning that DR pairs contribute twice as much as RR pairs to the variance and also twice as much computational cost is invested in them. The memory requirement for the random catalog is then the same as for the data catalog. The cost saving estimate above is optimistic, since the computation involves some overhead not proportional to the number of pairs.

For small scales, where ξ ≫ 1, the situation is different. The greater density of DD pairs due to the correlation requires a greater density of the R catalog so that the additional variance from it is not greater. From Eq. (19) we see that the balance of the DR and the RR contributions is different for large ξ (the p_c term vs. the other terms). We may consider recomputing $\hat{ξ}$ $\hat{\xi}$ for the small scales using a smaller r_max and a larger R catalog. Considering just the Poisson terms (p_c and p_r or $p_{r}^{'}$ $p^{\prime}_{\rm r}$ ) with a “representative” ξ value, (27) and (28) become $c = 1 + ξ + 2 M_{r} + M_{r}^{2} / M_{s}$ $c = 1+\xi+2\,M_{\rm r}+M_{\rm r}^2/M_{\rm s}$ and $v = 2 / M_{r} + {(1 - ξ)}^{2} M_{s} / M_{r}^{2}$ $v = 2/M_{\rm r}+(1-\xi)^2M_{\rm s}/M_{\rm r}^2$ which modifies the above result (Eq. (29)) for the optimal choice of M_s and M_r to

$\begin{matrix} M_{s} = \frac{M_{r}}{| ξ - 1 |}, that is, N_{r}^{'} = | ξ - 1 | N_{d} . \end{matrix}$ $\begin{aligned} M_{\rm s} = \frac{M_{\rm r}}{|\xi -1|}, \quad \mathrm{that\,is},\quad N^{\prime }_{\rm r} = |\xi -1|N_{\rm d}. \end{aligned}$ (32)

This result is only indicative, since it assumes a constant ξ for r < r_max. In particular, it does not apply for ξ ≈ 1, because then the approximation of ignoring the q^r and t^r terms in (19) is not good.

4. Tests on mock catalogs

4.1. Minerva simulations and methodology

The Minerva mocks are a set of 300 cosmological mocks produced with N-body simulations (Grieb et al. 2016; Lippich et al. 2019), stored at five output redshifts z ∈ {2.0, 1.0, 0.57, 0.3, 0}. The cosmology is flat ΛCDM with Ω_m = 0.285, and we use the z = 1 outputs. The mocks have N_d ≈ 4 × 10⁶ objects (“halos” found by a friends-of-friend algorithm) in a box of 1500 h⁻¹ Mpc cubed.

To model the survey geometry of a redshift bin with Δz ≈ 0.1 at z ∼ 1, we placed the observer at comoving distance 2284.63 h⁻¹ Mpc from the center of the cube and selected from the cube a shell 2201.34–2367.92 h⁻¹ Mpc from the observer. The comoving thickness of the shell is 166.58 h⁻¹ Mpc. The resulting mock sub-catalogs have N_d ≈ 4.5 × 10⁵ and are representative of the galaxy number density of the future Euclid spectroscopic galaxy catalog.

We ignore peculiar velocities, that is, we perform our analysis in real space. Therefore, we consider results for the 1D 2PCF ξ(r). We estimated ξ(r) up to r_max = 200 h⁻¹ Mpc using Δr = 1 h⁻¹ Mpc bins.

We chose standard LS with M_r = 50 as the reference method. In the following, LS without further qualification refers to this. The random catalog was generated separately for each shell mock to measure their contribution to the variance. For one of the random catalogs we calculated also triplets to obtain the edge effect quantity $N_{d} t_{12} = G_{12}^{t} / G_{1}^{p} G_{2}^{p} - 1$ $N_{\mathrm{d}} t_{12} = G^{\mathrm{t}}_{12}/G^{\mathrm{p}}_1\,G^{\mathrm{p}}_2 - 1$ .

While dilution can already be discarded on theoretical grounds, we show results obtained using dilution, since these results provide the scale for edge effects demonstrating the importance of eliminating them with a careful choice of method. For the dilution and split methods we also used M_r = 50, and tried out dilution fractions $d : = N_{r}^{'} / N_{r}^{'} = 0.5, 0.25, 0.14$ $d := N^{\prime}_{\rm r}/N^{\prime}_{\rm r} = 0.5, 0.25, 0.14$ and split factors M_s = 4, 16, 50 (chosen to have similar pairwise computational costs). In addition, we considered standard LS with M_r = 25, which has the same number of RR pairs as d = 0.5 and M_s = 4, but only half the number of DR pairs; and standard LS with M_r = 1 to demonstrate the effect of a small $N_{r}^{'}$ $N^{\prime}_{\rm r}$ .

The code used to estimate the 2PCF implements a highly optimized pair-counting method, specifically designed for the search of object pairs in a given range of separations. In particular, the code provides two alternative counting methods, the chain-mesh and the kd-tree. Both methods measure the exact number of object pairs in separation bins, without any approximation. However, since they implement different algorithms to search for pairs, they perform differently at different scales, both in terms of CPU time and memory usage. Overall, the efficiency of the two methods depends on the ratio between the scale range of the searching region and the maximum separation between the objects in the catalog.

The kd-tree method first constructs a space-partitioning data structure that is filled with catalog objects. The minimum and maximum separations probed by the objects are kept in the data structure and are used to prune object pairs with separations outside the range of interest. The tree pair search is performed through the dual-tree method in which cross-pairs between two dual trees are explored. This is an improvement in terms of exploration time over the single-tree method.

On the other hand, in the chain-mesh method the catalog is divided in cubic cells of equal size, and the indexes of the objects in each cell are stored in vectors. To avoid counting object pairs with separations outside the interest range, the counting is performed only on the cells in a selected range of distances from each object. The chain-mesh algorithm has been imported from the CosmoBolognaLib, a large set of free software C++/python libraries for cosmological calculations (Marulli et al. 2016).

For our test runs we used the chain-mesh method.

4.2. Variance and bias

In Fig. 1 we show the mean (over the 300 mock shells) estimated correlation function and the scatter (square root of the variance) of the estimates using the LS, split, and dilution methods; our theoretical approximate result for the scatter for LS; and our theoretical result for bias for the different methods.

The theoretical result for the scatter is shown with and without the q terms, which include the triplet correlation ξ₁₂, for which we used here the approximation ξ₁₂ ≈ ξ(max(r₁, r₂)). This behaves as expected, that is, it underestimates the variance, since we neglected the higher-order correlations in the D catalog. Nevertheless, it (see the dash-dotted line in Fig. 1) has similar features to the measured variance (dashed lines).

In Fig. 2 we plot the diagonals of the p, t, and q quantities. This shows how their relative importance changes with separation scale. It also confirms that our initial assumption on small relative fluctuations is valid in this simulation case.

Fig. 2.

Quantities p, p_c, p_r, q, q_r, t, and t_r for the Minerva shell. The values for the first bin are noisy. The vertical red line marks r = L.

Consider now the variance differences (from standard LS with M_r = 50), for which our theoretical results should be accurate. Figure 3 compares the measured variance difference to the theoretical result. For the diluted estimators and LS with M_r = 1 the measured result agrees with theory, although clearly the measurement with just 300 mocks is rather noisy. For the split estimators and LS with M_r = 25 the difference is too small to be appreciated with 300 mocks, but at least the measurement does not disagree with the theoretical result.

Fig. 3.

Measured difference from LS of the variance of different estimators, multiplied by r². Dashed lines are our theoretical results.

In Fig. 4 we show the relative theoretical increase in scatter compared to the best possible case, which is LS in the limit M_r → ∞. Since we do not have a valid theoretical result for the total scatter, we estimate it by subtracting the theoretical difference from LS with M_r = 50 from the measured variance of the latter.

Fig. 4.

Theoretical estimate of the scatter of the ξ estimates divided by the scatter in the $N_{r}^{'} \to \infty$ $N^{\prime}_{\rm r}\rightarrow\infty$ limit. The dotted lines correspond to 0.3%, 0.5%, and 1% increase in scatter. For r < 10 h⁻¹ Mpc there is hardly any difference between split and dilution, the curves lie on top of each other; whereas for larger r split is much better.

At scales r ≲ 10 h⁻¹ Mpc the theoretical prediction is about the same for dilution and split and neither method looks promising for r ≪ 10 h⁻¹ Mpc where ξ ≫ 1. This suggests that for optimizing cost and accuracy, a different method should be used for smaller scales than that used for large scales. The number of RR pairs with small separations is much less. Therefore, for the small-scale computation there is no need to restrict the computation to a subset of RR pairs, or alternatively, one can afford to increase M_r. For the small scales, we may consider the split method with increased M_r as an alternative to standard LS. We have the same number of pairs to compute as in the reference LS case, if we use M_r = 866 and M_s = 866. We added this case to Fig. 4. It seems to perform better than LS at intermediate scales, but for the smallest scales LS has the smaller variance. This is in line with our conclusion in Sect. 3.2, which is that when ξ ≫ 1, it is not optimal to split the R catalog into small subsets.

We also compared the differences in the mean estimate from the different estimators to our theoretical results on the bias differences (see Fig. 5), but the theoretical bias differences are much smaller than the expected error of the mean from 300 mocks; and we simply confirm that the differences we see are consistent with the error of the mean and thus consistent with the true bias being much smaller. We also performed tests with completely random (ξ = 0) mocks, and with a large number (10 000) of mocks confirmed the theoretical bias result for the different estimators in this case. Since the bias is too small to be interesting we do not report these results in more detail here.

Fig. 5.

Differences between the mean ξ(r) estimate and that from the LS, multiplied by r to better display all scales. This measured difference is not the true bias, which is too small to measure with 300 mocks, and is mainly due to random error of the mean. The results for dilution appear to reveal a systematic bias, but this is just due to strong error correlations between nearby bins; for different subsets of the 300 mocks the mean difference is completely different.

However, we note that for the estimation of the 2D 2PCF and its multipoles, the 2D bins will contain a smaller number of objects than the 1D bins of these test runs and therefore the bias is larger. Using the theoretical results (17) or (24) the bias can be removed afterwards with accuracy depending on how well we know the true ξ.

4.3. Computation time and variance

The test runs were made using a single full 24-core node for each run. Table 1 shows the mean computation time and mean estimator variance for different r ranges for the different cases we tested. Of these r ranges, the r = 80 − 120 h⁻¹ Mpc is perhaps the most interesting, since it contains the important BAO scale. Therefore, we plot the mean variance at this range versus mean computation time in Fig. 6, together with our theoretical predictions. The theoretical estimate for the computation time for other dilution fractions and split factors is

$\begin{matrix} (1 + 24.75 d^{2}) 306 s and (1 + 24.75 / M_{s}^{2}) 306 s, \end{matrix}$ $\begin{aligned} (1+24.75d^2)\ 306\,\mathrm{s} \quad \mathrm{and} \quad (1+24.75/M_{\rm s}^2)\ 306\,\mathrm{s} , \end{aligned}$ (33)

Table 1.

Mean computation time over the 300 runs and the mean variance over four different ranges of r bins (given in units of h⁻¹ Mpc) for each method.

Fig. 6.

Measured variance (mean variance over the range r = 80 − 120 h⁻¹ Mpc) vs. computational cost (mean computation time) for the different methods (markers with error bars) and our theoretical prediction (solid lines). The solid lines (blue for the split method, red for dilution, and black for standard LS with M_r ≤ 50) are our theoretical predictions for the increase in variance and computation time ratio when compared to the standard LS, M_r = 50, case, and the dots on the curves correspond to the measured cases (except for LS they are, from right to left, M_r = 25, 12.5, and (50/7); only the first of which was measured). The curve for split ends at M_s = 2500; the optimal case, M_s = M_r, is the circled dot. The error bars for the variance measurement are naive estimates that do not account for error correlations between bins. The theoretical predictions overestimate the cost savings (data points are to the right of the dots on curves; except for the smaller split factors, where the additional speed-up compared to theory is related to some other performance differences between our split and standard LS implementations). This plot would have a different appearance for other r ranges.

assuming M_r = 50. For standard LS with other random catalog sizes, the computation time estimate is

$\begin{matrix} (1 + 2 M_{r} + M_{r}^{2}) 3.03 s . \end{matrix}$ $\begin{aligned} (1+2\,M_{\rm r}+M_{\rm r}^2)\ 3.03\,\mathrm{s}. \end{aligned}$ (34)

5. Conclusions

The computational time of the standard Landy–Szalay estimator is dominated by the RR pairs. However, except at small scales where correlations are large, these make a negligible contribution to the expected error compared to the contribution from the DD and DR pairs. Therefore, a substantial saving of computation time with an insignificant loss of accuracy can be achieved by counting a smaller subset of RR pairs.

We considered two ways to reduce the number of RR pairs: dilution and split. In dilution, only a subset of the R catalog is used for RR pairs. In split, the R catalog is split into a number of smaller subcatalogs, and only pairs within each subcatalog are counted. We derived theoretical results for the additional estimator covariance and bias due to the finite size of the random catalog for these different variants of the LS estimator, extending in many ways the original results by Landy & Szalay (1993), who worked in the limit of an infinite random catalog. We tested our results using 300 mock data catalogs, representative of the z = 0.95–1.05 redshift range of the future Euclid survey. The split method maintains the property the Landy–Szalay estimator was designed for, namely cancelation of edge effects in bias and variance (for ξ = 0), whereas dilution loses this cancellation and therefore should not be used.

For small scales, where correlations are large, one should not reduce RR counts as much. The natural dividing line is the scale r where ξ(r) = 1. Interestingly, the difference in bias and covariance between the different estimators (split, dilution, and LS) vanishes when ξ = 1. We recommend the natural version of the split method, M_s = M_r, for large scales where |ξ|< 1. This leads to a saving in computation time by more than a factor of ten (assuming M_r = 50) with a negligible effect on variance and bias. For small scales, where ξ > 1, one should consider using a larger random catalog and one can use either the standard LS method or the split method with a more modest split factor. Because the number of pairs with these small separations is much smaller, the computation time is not a similar issue as for large separations.

The results of our analysis will have an impact also on the computationally more demanding task of covariance matrix estimation. However, since in that case the exact computational cost is determined by the balance of data catalogs and random catalogs, which does not need to be the same as for the individual two-point correlation estimate, we postpone a quantitative analysis to a future, dedicated study. The same kind of methods can be applied to higher-order statistics (three-point and four-point correlation functions) to speed up their estimation (Slepian & Eisenstein 2015).

¹

I. Zehavi, priv. comm.

Acknowledgments

We thank Will Percival and Cristiano Porciani for useful discussions. The 2PCF computations were done at the Euclid Science Data Center Finland (SDC-FI, urn:nbn:fi:research-infras-2016072529), for whose computational resources we thank CSC – IT Center for Science, the Finnish Grid and Cloud Computing Infrastructure (FGCI, urn:nbn:fi:research-infras-2016072533), and the Academy of Finland grant 292882. This work was supported by the Academy of Finland grant 295113. VL was supported by the Jenny and Antti Wihuri Foundation, AV by the Väisälä Foundation, AS by the Magnus Ehrnrooth Foundation, and JV by the Finnish Cultural Foundation. We also acknowledge travel support from the Jenny and Antti Wihuri Foundation. VA acknowledges funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 749348. FM acknowledges the grants ASI n.I/023/12/0 “Attivit‘a relative alla fase B2/C per la missione Euclid” and PRIN MIUR 2015 “Cosmology and Fundamental Physics: illuminating the Dark Universe with Euclid”.

References

Alam, S., Ata, M., Bailey, S., et al. 2017, MNRAS, 470, 2617 [NASA ADS] [CrossRef] [Google Scholar]
Alcock, C., & Paczyński, B. 1979, Nature, 281, 358 [NASA ADS] [CrossRef] [Google Scholar]
Alonso, D. 2012, ArXiv e-prints [arXiv:1210.1833] [Google Scholar]
Anderson, L., Aubourg, E., Bailey, S., et al. 2012, MNRAS, 427, 3435 [Google Scholar]
Anderson, L., Aubourg, E., Bailey, S., et al. 2014, MNRAS, 441, 24 [NASA ADS] [CrossRef] [Google Scholar]
Ata, M., Baumgarten, F., Bautista, J., et al. 2018, MNRAS, 473, 4773 [NASA ADS] [CrossRef] [Google Scholar]
Bautista, J. E., Vargas-Magaña, M., Dawson, K. S., et al. 2018, ApJ, 863, 110 [NASA ADS] [CrossRef] [Google Scholar]
Bernstein, G. M. 1994, ApJ, 424, 569 [NASA ADS] [CrossRef] [Google Scholar]
Beutler, F., Blake, C., Colless, M., et al. 2011, MNRAS, 416, 3017 [NASA ADS] [CrossRef] [Google Scholar]
Beutler, F., Blake, C., Colless, M., et al. 2012, MNRAS, 423, 3430 [NASA ADS] [CrossRef] [Google Scholar]
Blake, C., Kazin, E. A., Beutler, F., et al. 2011, MNRAS, 418, 1707 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
Cole, S., Percival, W. J., Peacock, J. A., et al. 2005, MNRAS, 362, 505 [NASA ADS] [CrossRef] [Google Scholar]
Davis, M., & Peebles, P. J. E. 1983, ApJ, 267, 465 [NASA ADS] [CrossRef] [Google Scholar]
de la Torre, S., Jullo, E., Giocoli, C., et al. 2017, A&A, 608, A44 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Demina, R., Cheong, S., BenZvi, S., & Hindrichs, O. 2018, MNRAS, 480, 49 [NASA ADS] [CrossRef] [Google Scholar]
Eisenstein, D. J., Zehavi, I., Hogg, D. W., et al. 2005, ApJ, 633, 560 [NASA ADS] [CrossRef] [Google Scholar]
Grieb, J. N., Sánchez, A. G., Salvador-Albornoz, S., & Dalla Vecchia, C. 2016, MNRAS, 457, 1577 [NASA ADS] [CrossRef] [Google Scholar]
Guzzo, L., Pierleoni, M., Meneux, B., et al. 2008, Nature, 451, 541 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
Hamilton, A. J. S. 1993, ApJ, 417, 19 [NASA ADS] [CrossRef] [Google Scholar]
Hewett, H. C. 1982, MNRAS, 201, 867 [NASA ADS] [CrossRef] [Google Scholar]
Hou, J., Sánchez, A. G., Scoccimarro, R., et al. 2018, MNRAS, 480, 2521 [Google Scholar]
Jarvis, M. 2015, Astrophysics Source Code Library [record ascl:1508.007] [Google Scholar]
Kaiser, N. 1987, MNRAS, 227, 1 [NASA ADS] [CrossRef] [Google Scholar]
Kerscher, M. 1999, A&A, 343, 333 [NASA ADS] [Google Scholar]
Kerscher, M., Szapudi, I., & Szalay, A. S. 2000, ApJ, 535, L13 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
Landy, S. D., & Szalay, A. S. 1993, ApJ, 412, 64 [NASA ADS] [CrossRef] [Google Scholar]
Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]
Lippich, M., Sánchez, A. G., Colavincenzo, M., et al. 2019, MNRAS, 482, 1786 [NASA ADS] [CrossRef] [Google Scholar]
Marulli, F., Veropalumbo, A., & Moresco, M. 2016, Astron. Comput., 14, 35 [NASA ADS] [CrossRef] [Google Scholar]
Moore, A., Connolly, A., Genovese, C., et al. 2000, ArXiv e-prints [arXiv:astro-ph/0012333] [Google Scholar]
Peacock, J. A., Cole, S., Norberg, P., et al. 2001, Nature, 410, 169 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
Peebles, P. J. E., & Hauser, M. G. 1974, ApJS, 28, 19 [NASA ADS] [CrossRef] [Google Scholar]
Percival, W. J., Reid, B. A., Eisenstein, D. J., et al. 2010, MNRAS, 401, 2148 [NASA ADS] [CrossRef] [Google Scholar]
Pezzotta, A., de la Torre, S., Bel, J., et al. 2017, A&A, 604, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Reid, B. A., Samushia, L., White, M., et al. 2012, MNRAS, 426, 2719 [NASA ADS] [CrossRef] [Google Scholar]
Ross, A. J., Samushia, L., Howlett, C., et al. 2015, MNRAS, 449, 835 [NASA ADS] [CrossRef] [Google Scholar]
Ross, A. J., Beutler, F., Chuang, C. H., et al. 2017, MNRAS, 464, 1168 [NASA ADS] [CrossRef] [Google Scholar]
Ruggeri, R., Percival, W. J., Gil-Marín, H., et al. 2019, MNRAS, 483, 3878 [NASA ADS] [CrossRef] [Google Scholar]
Slepian, Z., & Eisenstein, D. J. 2015, MNRAS, 454, 4142 [NASA ADS] [CrossRef] [Google Scholar]
Vargas-Magaña, M., Ho, S., Cuesta, A. J., et al. 2018, MNRAS, 477, 1153 [NASA ADS] [CrossRef] [Google Scholar]
Wall, J. V., & Jenkins, C. R. 2012, Practical Statistics for Astronomers (Cambridge, UK: Cambridge University Press) [CrossRef] [Google Scholar]
Zarrouk, P., Burtin, E., Gil-Marín, H., et al. 2018, MNRAS, 477, 1639 [NASA ADS] [CrossRef] [Google Scholar]
Zehavi, I., Zheng, Z., Weinberg, D. H., et al. 2011, ApJ, 736, 59 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Derivation examples

As examples of how the variances of the different deviations, ⟨αβ⟩ and so on, are derived, following the method presented in LS93, we give three of the cases here: ⟨α₁α₂⟩ (common for all the variants of LS), and $〈 β_{1} γ_{2}^{'} 〉$ $\langle\beta_1\gamma^{\prime}_2\rangle$ and $〈 γ_{1}^{'} γ_{2}^{'} 〉$ $\langle\gamma^{\prime}_1\gamma^{\prime}_2\rangle$ for the split method. The rest are calculated in a similar manner.

To derive the ⟨DD₁DD₂⟩ appearing in the ⟨α₁α₂⟩ of (15) we start from

$\begin{matrix} ⟨ D D_{1} D D_{2} ⟩ = \sum_{i < j}^{K} \sum_{k < l}^{K} ⟨ n_{i} n_{j} n_{k} n_{l} ⟩ Θ^{ij} (r_{1}) Θ^{kl} (r_{2}), \end{matrix}$ $\begin{aligned} \langle DD_1\ DD_2\rangle = \sum _{i<j}^K \sum _{k<l}^K\langle n_in_jn_kn_l\rangle \Theta ^{ij}({\boldsymbol{r}}_1)\Theta ^{kl}({\boldsymbol{r}}_2), \end{aligned}$ (A.1)

where both i, j, and k, l sum over all microcell pairs; n_i = 1 or 0 is the number of galaxies in microcell i. There are three different cases for the terms ⟨n_in_jn_kn_l⟩ depending on how many indices are equal (i ≠ j and k ≠ l for all of them).

The first case (quadruplets, i.e., two pairs of microcells) is when i, j, k, l are all different. We use $\sum_{ijkl}^{*}$ $\sum\nolimits^*_{ijkl}$ to denote this part of the sum. There are $\frac{1}{2} K (K - 1) \times \frac{1}{2} (K - 2) (K - 3) = \frac{1}{4} K^{4}$ $\frac{1}{2} K(K-1)\times\frac{1}{2}(K-2)(K-3) = \frac{1}{4} K^4$ (we work in the limit K → ∞) such terms and they have

$\begin{matrix} ⟨ n_{i} n_{j} n_{k} n_{l} ⟩ \\ = \frac{N_{d}}{K} \frac{N_{d} - 1}{K - 1} \frac{N_{d} - 2}{K - 2} \frac{N_{d} - 3}{K - 3} 〈 (1 + δ_{i}) (1 + δ_{j}) (1 + δ_{k}) (1 + δ_{l}) 〉 \\ = \frac{N_{d} (N_{d} - 1) (N_{d} - 2) (N_{d} - 3)}{K^{4}} [1 + ⟨ δ_{i} δ_{j} ⟩ + ⟨ δ_{k} δ_{l} ⟩ + ⟨ δ_{i} δ_{k} ⟩ \\ + ⟨ δ_{i} δ_{l} ⟩ + ⟨ δ_{j} δ_{k} ⟩ + ⟨ δ_{j} δ_{l} ⟩ \\ + ⟨ δ_{i} δ_{j} δ_{k} ⟩ + ⟨ δ_{i} δ_{j} δ_{l} ⟩ + ⟨ δ_{i} δ_{k} δ_{l} ⟩ + ⟨ δ_{j} δ_{k} δ_{l} ⟩ + ⟨ δ_{i} δ_{j} δ_{k} δ_{l} ⟩] \\ = \frac{N_{d} (N_{d} - 1) (N_{d} - 2) (N_{d} - 3)}{K^{4}} [1 + ξ (r_{ij}) + ξ (r_{kl}) + ξ (r_{ik}) \\ + ξ (r_{il}) + ξ (r_{jk}) + ξ (r_{jl}) \\ + ζ (x_{i}, x_{j}, x_{k}) + ζ (x_{i}, x_{k}, x_{l}) + ζ (x_{i}, x_{j}, x_{l}) + ζ (x_{j}, x_{k}, x_{l}) \\ + ξ (r_{ij}) ξ (r_{kl}) + ξ (r_{ik}) ξ (r_{jl}) + ξ (r_{il}) ξ (r_{jk}) \\ + η (x_{i}, x_{j}, x_{k}, x_{l})], \end{matrix}$ $\begin{aligned}&\langle n_in_jn_kn_l\rangle \nonumber \\&\qquad = \frac{N_{\rm d}}{K}\frac{N_{\rm d} - 1}{K-1}\frac{N_{\rm d}-2}{K-2}\frac{N_{\rm d}-3}{K- 3}\Bigl \langle (1+\delta _i)(1+\delta _j)(1+\delta _k)(1+\delta _l)\Bigr \rangle \nonumber \\&\qquad = \frac{N_{\rm d}(N_{\rm d}-1)(N_{\rm d}-2)(N_{\rm d}-3)}{K^4}\Bigl [1+\langle \delta _i\delta _j\rangle +\langle \delta _k\delta _l\rangle +\langle \delta _i\delta _k\rangle \nonumber \\&\qquad \quad +\langle \delta _i\delta _l\rangle +\langle \delta _j\delta _k\rangle +\langle \delta _j\delta _l\rangle \nonumber \\&\qquad \quad + \langle \delta _i\delta _j\delta _k\rangle +\langle \delta _i\delta _j\delta _l\rangle +\langle \delta _i\delta _k\delta _l\rangle +\langle \delta _j\delta _k\delta _l\rangle +\langle \delta _i\delta _j\delta _k\delta _l\rangle \Bigr ] \nonumber \\&\qquad = \frac{N_{\rm d}(N_{\rm d}-1)(N_{\rm d}-2)(N_{\rm d} -3)}{K^4}\Bigl [1+\xi ({\boldsymbol{r}}_{ij})+\xi ({\boldsymbol{r}}_{kl})+\xi ({\boldsymbol{r}}_{ik})\nonumber \\&\qquad \quad + \xi ({\boldsymbol{r}}_{il})+\xi ({\boldsymbol{r}}_{jk})+\xi ({\boldsymbol{r}}_{jl})\nonumber \\&\qquad \quad + \zeta ({\boldsymbol{x}}_i,{\boldsymbol{x}}_j,{\boldsymbol{x}}_k)+\zeta ({\boldsymbol{x}}_i,{\boldsymbol{x}}_k,{\boldsymbol{x}}_l)+\zeta ({\boldsymbol{x}}_i,{\boldsymbol{x}}_j,{\boldsymbol{x}}_l)+\zeta ({\boldsymbol{x}}_j,{\boldsymbol{x}}_k,{\boldsymbol{x}}_l) \nonumber \\&\qquad \quad + \xi ({\boldsymbol{r}}_{ij})\xi ({\boldsymbol{r}}_{kl})+\xi ({\boldsymbol{r}}_{ik})\xi ({\boldsymbol{r}}_{jl})+\xi ({\boldsymbol{r}}_{il})\xi ({\boldsymbol{r}}_{jk}) \nonumber \\&\qquad \quad +\eta ({\boldsymbol{x}}_i,{\boldsymbol{x}}_j,{\boldsymbol{x}}_k,{\boldsymbol{x}}_l)\Bigr ], \end{aligned}$ (A.2)

where δ_i := δ(x_i) is the density perturbation (relative to the actual mean density of galaxies in the survey), ζ is the three-point correlation, and η is the connected (i.e., the part that does not arise from the two-point correlation) four-point correlation. The fraction of microcell quadruplets (pairs of pairs) that satisfy r_ij ≡ x_j − x_i ∈ r₁ and r_kl ∈ r₂ is $G^{p} (r_{1}) G^{p} (r_{2}) = : G_{1}^{p} G_{2}^{p}$ $G^{\rm p}({\boldsymbol{r}}_1)G^{\rm p}({\boldsymbol{r}}_2) =: G^{\rm p}_1\,G^{\rm p}_2$ . In the limit of large K the number of other index quadruplets is negligible compared to those where all indices have different values, so we have

$\begin{matrix} \sum_{ijkl}^{*} Θ_{1}^{ij} Θ_{2}^{kl} = \frac{K (K - 1) (K - 2) (K - 3)}{4} G_{1}^{p} G_{2}^{p} = \frac{K^{4}}{4} G_{1}^{p} G_{2}^{p} . \end{matrix}$ $\begin{aligned} \sum ^*_{ijkl} \Theta ^{ij}_1\Theta ^{kl}_2 = \frac{K(K-1)(K-2)(K-3)}{4}G^\mathrm{p}_1\,G^\mathrm{p}_2 = \frac{K^4}{4}G^\mathrm{p}_1\,G^\mathrm{p}_2. \end{aligned}$ (A.3)

For the connected pairs (i, j) and (k, l) we have ξ(r_ij)=ξ(r₁)≡ξ₁ and ξ(r_kl)=ξ(r₂)≡ξ₂.

The second case (triplets of microcells) is when k or l is equal to i or j. We denote this part of the sum by

$\begin{matrix} \sum_{ijk}^{*} ⟨ n_{i} n_{j} n_{k} ⟩ Θ_{1}^{ik} Θ_{2}^{jk} . \end{matrix}$ $\begin{aligned} \sum ^*_{ijk}\langle n_in_jn_k\rangle \Theta _1^{ik}\Theta _2^{jk}. \end{aligned}$ (A.4)

It turns out that it goes over all ordered combinations of {i, j, k}, where i, j, k are all different, exactly once, so there are K(K − 1)(K − 2)=K³ such terms (triplets). Here

$\begin{matrix} ⟨ n_{i} n_{j} n_{k} ⟩ & = \frac{N_{d}}{K} \frac{N_{d} - 1}{K - 1} \frac{N_{d} - 2}{K - 2} 〈 (1 + δ_{i}) (1 + δ_{j}) (1 + δ_{k}) 〉 \\ = \frac{N_{d} (N_{d} - 1) (N_{d} - 2)}{K^{3}} [1 + ⟨ δ_{i} δ_{k} ⟩ + ⟨ δ_{j} δ_{k} ⟩ + ⟨ δ_{i} δ_{j} ⟩ + ⟨ δ_{i} δ_{j} δ_{k} ⟩] \\ = \frac{N_{d} (N_{d} - 1) (N_{d} - 2)}{K^{3}} [1 + ξ (r_{ik}) + ξ (r_{jk}) + ξ (r_{ij}) + ζ (x_{i}, x_{j}, x_{k})], \end{matrix}$ $\begin{aligned} \langle n_in_jn_k\rangle&= \frac{N_{\rm d}}{K}\frac{N_{\rm d}-1}{K-1}\frac{N_{\rm d}-2}{K-2}\Bigl \langle (1+\delta _i)(1+\delta _j)(1+\delta _k)\Bigr \rangle \nonumber \\&= \frac{N_{\rm d}(N_{\rm d}-1)(N_{\rm d}-2)}{K^3}\Bigl [1 +\langle \delta _i\delta _k\rangle +\langle \delta _j\delta _k\rangle +\langle \delta _i\delta _j\rangle + \langle \delta _i\delta _j\delta _k\rangle \Bigr ] \nonumber \\&= \frac{N_{\rm d}(N_{\rm d}-1)(N_{\rm d}-2)}{K^3}\Bigl [1+\xi ({\boldsymbol{r}}_{ik})+\xi ({\boldsymbol{r}}_{jk})+\xi ({\boldsymbol{r}}_{ij})+\zeta ({\boldsymbol{x}}_i,{\boldsymbol{x}}_j,{\boldsymbol{x}}_k)\Bigr ] , \end{aligned}$ (A.5)

and

$\begin{matrix} \sum_{ijk}^{*} Θ_{1}^{ik} Θ_{2}^{jk} = K^{3} G_{12}^{t} . \end{matrix}$ $\begin{aligned} \sum ^*_{ijk}\Theta ^{ik}_1\Theta ^{jk}_2 = K^3G^\mathrm{t}_{12}. \end{aligned}$ (A.6)

The third case (pairs of microcells) is when i = k and j = l. This part of the sum becomes

$\begin{matrix} \sum_{i < j} ⟨ n_{i} n_{j} ⟩ Θ_{1}^{ij} Θ_{2}^{ij}, \end{matrix}$ $\begin{aligned} \sum _{i<j} \langle n_in_j\rangle \Theta ^{ij}_1\Theta ^{ij}_2, \end{aligned}$ (A.7)

where

$\begin{matrix} ⟨ n_{i} n_{j} ⟩ = \frac{N_{d} (N_{d} - 1)}{K^{2}} [1 + ξ (r_{ij})], \end{matrix}$ $\begin{aligned} \langle n_in_j\rangle = \frac{N_{\rm d}(N_{\rm d}-1)}{K^2}\left[1+\xi ({\boldsymbol{r}}_{ij})\right] , \end{aligned}$ (A.8)

and

$\begin{matrix} \sum_{i < j} Θ_{1}^{ij} Θ_{2}^{ij} = δ_{12} \frac{K^{2}}{2} G_{1}^{p}, \end{matrix}$ $\begin{aligned} \sum _{i<j}\Theta ^{ij}_1\Theta ^{ij}_2 = \delta _{12}\frac{K^2}{2}G^\mathrm{p}_1 , \end{aligned}$ (A.9)

that is, the sum vanishes unless the two bins are the same, r₁ = r₂.

We now apply the three approximations listed in Sect. 2.2: (1) ξ(r_ik)=ξ(r_il)=ξ(r_jk)=ξ(r_jl) = 0 in (A.2); (2) ζ = 0 in (A.2) and (A.5); (3) η = 0 in (A.2). We obtain

$\begin{matrix} ⟨ D D_{1} D D_{2} ⟩ ≃ & \frac{1}{4} N_{d} (N_{d} - 1) (N_{d} - 2) (N_{d} - 3) (1 + ξ_{1}) (1 + ξ_{2}) G_{1}^{p} G_{2}^{p} \\ + N_{d} (N_{d} - 1) (N_{d} - 2) (1 + ξ_{1} + ξ_{2} + ξ_{12}) G_{12}^{t} \\ + \frac{1}{2} N_{d} (N_{d} - 1) δ_{12} (1 + ξ_{1}) G_{1}^{p}, \end{matrix}$ $\begin{aligned} \langle DD_1\ DD_2 \rangle \simeq &\frac{1}{4} N_{\rm d}(N_{\rm d}-1)(N_{\rm d}-2)(N_{\rm d}-3)(1+\xi _1)(1+\xi _2)G^\mathrm{p}_1\,G^\mathrm{p}_2 \nonumber \\& + N_{\rm d}(N_{\rm d}-1)(N_{\rm d}-2)(1+\xi _1+\xi _2+\xi _{12})G^\mathrm{t}_{12} \nonumber \\& + \frac{1}{2} N_{\rm d}(N_{\rm d}-1)\delta _{12}(1+\xi _1)G^\mathrm{p}_1, \end{aligned}$ (A.10)

and using ⟨DD⟩ from (3) we arrive at the ⟨α₁α₂⟩ result given in Eq. (15).

For the split method, we give the calculation for

$\begin{matrix} ⟨ γ_{1}^{'} β_{2}^{'} ⟩ & = \frac{⟨ R R_{1}^{'} D R_{2} ⟩ - ⟨ R R_{1}^{'} ⟩ ⟨ D R_{2} ⟩}{⟨ R R_{1}^{'} ⟩ ⟨ D R_{2} ⟩}, \\ ⟨ γ_{1}^{'} γ_{2}^{'} ⟩ & = \frac{⟨ R R_{1}^{'} R R_{2}^{'} ⟩ - ⟨ R R_{1}^{'} ⟩ ⟨ R R_{2}^{'} ⟩}{⟨ R R_{1}^{'} ⟩ ⟨ R R_{2}^{'} ⟩} \end{matrix}$ $\begin{aligned} \langle \gamma ^{\prime }_1\beta ^{\prime }_2\rangle&= \frac{\langle RR^{\prime }_1 \ DR_2 \rangle - \langle RR^{\prime }_1 \rangle \langle DR_2 \rangle }{\langle RR^{\prime }_1 \rangle \langle DR_2 \rangle },\nonumber \\ \langle \gamma ^{\prime }_1\gamma ^{\prime }_2\rangle&= \frac{\langle RR^{\prime }_1\ RR^{\prime }_2 \rangle - \langle RR^{\prime }_1\rangle \langle RR^{\prime }_2\rangle }{\langle RR^{\prime }_1\rangle \langle RR^{\prime }_2\rangle } \end{aligned}$ (A.11)

below.

First the $〈 γ_{1}^{'} β_{2} 〉$ $\langle\gamma^{\prime}_1\beta_2\rangle$ :

$\begin{matrix} ⟨ R R_{1}^{'} D R_{2} ⟩ = \sum_{μ = 1}^{M_{s}} ⟨ R^{μ} R_{1}^{μ} D R_{2} ⟩ = M_{s} ⟨ R^{μ} R_{1}^{μ} D R_{2} ⟩, \end{matrix}$ $\begin{aligned} \langle RR^{\prime }_1 \ DR_2 \rangle = \sum _{\mu =1}^{M_{\rm s}}\langle R^\mu R^\mu _1\ DR_2\rangle = M_{\rm s}\langle R^\mu R^\mu _1\ DR_2\rangle , \end{aligned}$ (A.12)

where

$\begin{matrix} ⟨ R^{μ} R_{1}^{μ} D R_{2} ⟩ = \sum_{i < j} \sum_{k \neq l} ⟨ s_{i} s_{j} n_{k} r_{l} ⟩ Θ_{1}^{ij} Θ_{2}^{kl}, \end{matrix}$ $\begin{aligned} \langle R^\mu R^\mu _1\ DR_2\rangle = \sum _{i<j}\sum _{k\ne l}\langle s_is_jn_kr_l\rangle \Theta ^{ij}_1\Theta ^{kl}_2 , \end{aligned}$ (A.13)

s_i is the number (0 or 1) of R^μ objects in microcell i, and r_l is the number of R objects in microcell l.

There are $\frac{1}{2} K^{4}$ $\frac{1}{2} K^4$ quadruplet terms, for which (see (A.3)) $\sum^{*} Θ Θ = \frac{1}{2} K^{4} G_{1}^{p} G_{2}^{p}$ $\sum\nolimits^\ast\Theta\Theta = \frac{1}{2} K^4G^{\mathrm{p}}_1\,G^{\mathrm{p}}_2$ , and

$\begin{matrix} ⟨ s_{i} s_{j} n_{k} r_{l} ⟩ = \frac{N_{r}^{'} (N_{r}^{'} - 1) N_{d} (N_{r}^{'} - 2)}{K^{4}} \cdot \end{matrix}$ $\begin{aligned} \langle s_is_jn_kr_l\rangle = \frac{N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)N_{\rm d}(N^{\prime }_{\rm r}-2)}{K^4}\cdot \end{aligned}$ (A.14)

Triplets where i or j is equal to k have ⟨s_is_kn_kr_l⟩ = 0, since s_kn_k = 0 always (we cannot have two objects, from R^μ and D, in the same cell). There are K³ triplet terms where i or j is equal to l: for them $\sum^{*} Θ Θ = K^{3} G_{12}^{t}$ $\sum\nolimits^\ast\Theta\Theta = K^3\,G^{\mathrm{t}}_{12}$ , and

$\begin{matrix} ⟨ s_{i} s_{l} n_{k} r_{l} ⟩ = ⟨ s_{i} s_{l} n_{k} ⟩ = \frac{N_{r}^{'} (N_{r}^{'} - 1) N_{d}}{K^{3}}, \end{matrix}$ $\begin{aligned} \langle s_is_ln_kr_l\rangle = \langle s_is_ln_k\rangle = \frac{N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)N_{\rm d}}{K^3}, \end{aligned}$ (A.15)

where if s_l = 1 then also r_l = 1 since R^μ ⊂ R.

Pairs where (i, j)=(k, l) or (i, j)=(l, k) have ⟨s_ks_ln_kr_l⟩ = 0, since again we cannot have two different objects in the same cell. Thus

$\begin{matrix} ⟨ R R_{1}^{'} D R_{2} ⟩ = \frac{1}{2} N_{d} N_{r} (N_{r}^{'} - 1) (N_{r}^{'} - 2) G_{1}^{p} G_{2}^{p} + N_{d} N_{r} (N_{r}^{'} - 1) G_{12}^{t}, \end{matrix}$ $\begin{aligned} \langle RR^{\prime }_1 \ DR_2 \rangle = \frac{1}{2} N_{\rm d}N_{\rm r}(N^{\prime }_{\rm r}-1)(N^{\prime }_{\rm r}-2)G^\mathrm{p}_1\,G^\mathrm{p}_2 + N_{\rm d}N_{\rm r}(N^{\prime }_{\rm r}-1)G^\mathrm{t}_{12}, \end{aligned}$ (A.16)

and we obtain that

$\begin{matrix} ⟨ γ_{1}^{'} β_{2} ⟩ = \frac{2}{N_{r}^{'}} [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1] = 2 t_{12}^{r}, \end{matrix}$ $\begin{aligned} \langle \gamma ^{\prime }_1\beta _2\rangle = \frac{2}{N^{\prime }_{\rm r}}\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} - 1\right] \ = \ 2\,t^\mathrm{r}_{12} , \end{aligned}$ (A.17)

which is equal to the ⟨γ₁β₂⟩ of standard LS.

Then the $〈 γ_{1}^{'} γ_{2}^{'} 〉$ $\langle\gamma^{\prime}_1\gamma^{\prime}_2\rangle$ :

$\begin{matrix} ⟨ R R_{1}^{'} R R_{2}^{'} ⟩ = \sum_{μ = 1}^{M_{s}} \sum_{ν = 1}^{M_{s}} ⟨ R^{μ} R_{1}^{μ} \cdot R^{ν} R_{2}^{ν} ⟩, \end{matrix}$ $\begin{aligned} \langle RR^{\prime }_1\ RR^{\prime }_2 \rangle = \sum _{\mu =1}^{M_{\rm s}}\sum _{\nu =1}^{M_{\rm s}}\langle R^\mu R^\mu _1\cdot R^\nu R^\nu _2\rangle , \end{aligned}$ (A.18)

where there are M_s(M_s − 1) terms with μ ≠ ν giving

$\begin{matrix} ⟨ R^{μ} R_{1}^{μ} R^{ν} R_{2}^{ν} ⟩ = ⟨ R^{μ} R_{1}^{μ} ⟩ ⟨ R^{ν} R_{2}^{ν} ⟩ = \frac{1}{4} {(N_{r}^{'})}^{2} {(N_{r}^{'} - 1)}^{2} G_{1}^{p} G_{2}^{p}, \end{matrix}$ $\begin{aligned} \langle R^\mu R^\mu _1 \ R^\nu R^\nu _2\rangle = \langle R^\mu R^\mu _1\rangle \langle R^\nu R^\nu _2\rangle = \frac{1}{4} (N^{\prime }_{\rm r})^2(N^{\prime }_{\rm r}-1)^2G^\mathrm{p}_1\,G^\mathrm{p}_2,\qquad \end{aligned}$ (A.19)

and M_s terms with μ = ν giving

$\begin{matrix} ⟨ R^{μ} R_{1}^{μ} R^{μ} R_{2}^{μ} ⟩ = & \frac{1}{4} N_{r}^{'} (N_{r}^{'} - 1) (N_{r}^{'} - 2) (N_{r}^{'} - 3) G_{1}^{p} G_{2}^{p} \\ + N_{r}^{'} (N_{r}^{'} - 1) (N_{r}^{'} - 2) G_{12}^{t} \\ + \frac{1}{2} N_{r}^{'} (N_{r}^{'} - 1) δ_{12} G_{1}^{p} . \end{matrix}$ $\begin{aligned} \langle R^\mu R^\mu _1\ R^\mu R^\mu _2\rangle =&\frac{1}{4} N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)(N^{\prime }_{\rm r}-2)(N^{\prime }_{\rm r}-3)G^\mathrm{p}_1\,G^\mathrm{p}_2\nonumber \\& + N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)(N^{\prime }_{\rm r}-2)G^\mathrm{t}_{12} \nonumber \\& + \frac{1}{2} N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)\delta _{12}G^\mathrm{p}_1. \end{aligned}$ (A.20)

Adding these up gives

$\begin{matrix} ⟨ R R_{1}^{'} R R_{2}^{'} ⟩ = & \frac{1}{4} N_{r}^{'} (N_{r}^{'} - 1) (N_{r}^{'} N_{r}^{'} - N_{r}^{'} - 4 N_{r}^{'} + 6) G_{1}^{p} G_{2}^{p} \\ + N_{r}^{'} (N_{r}^{'} - 1) (N_{r}^{'} - 2) G_{12}^{t} \\ + \frac{1}{2} N_{r}^{'} (N_{r}^{'} - 1) δ_{12} G_{1}^{p} \end{matrix}$ $\begin{aligned} \langle RR^{\prime }_1\ RR^{\prime }_2 \rangle =&\frac{1}{4} N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)(N^{\prime }_{\rm r} N^{\prime }_{\rm r}-N^{\prime }_{\rm r}-4\,N^{\prime }_{\rm r}+6)G^\mathrm{p}_1\,G^\mathrm{p}_2 \nonumber \\& + N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)(N^{\prime }_{\rm r}-2)G^\mathrm{t}_{12} \nonumber \\& + \frac{1}{2} N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)\delta _{12}G^\mathrm{p}_1 \end{aligned}$ (A.21)

and

$\begin{matrix} ⟨ γ_{1}^{'} γ_{2}^{'} ⟩ & = \frac{2}{N_{r}^{'} (N_{r}^{'} - 1)} {2 (N_{r}^{'} - 2) [\frac{G_{12}^{t}}{G_{1}^{p} G_{2}^{p}} - 1] + \frac{δ_{12}}{G_{1}^{p}} - 1} \\ = 4 t_{12}^{r} + p_{12}^{r^{'}} . \end{matrix}$ $\begin{aligned} \langle \gamma ^{\prime }_1\gamma ^{\prime }_2\rangle&= \frac{2}{N^{\prime }_{\rm r}(N^{\prime }_{\rm r}-1)}\left\{ 2(N^{\prime }_{\rm r}-2)\left[\frac{G^\mathrm{t}_{12}}{G^\mathrm{p}_1\,G^\mathrm{p}_2} - 1\right] + \frac{\delta _{12}}{G^\mathrm{p}_1} - 1\right\} \nonumber \\&= 4\,t^\mathrm{r}_{12} + p^{\mathrm{r}^{\prime }}_{12} . \end{aligned}$ (A.22)

This differs both from standard LS and from dilution, since it involves both $N_{r}^{'}$ $N^{\prime}_{\rm r}$ and $N_{r}^{'}$ $N^{\prime}_{\rm r}$ .

All Tables

Table 1.

Mean computation time over the 300 runs and the mean variance over four different ranges of r bins (given in units of h⁻¹ Mpc) for each method.

In the text

All Figures

Fig. 1.

Mean ξ(r) estimate and the scatter and theoretical bias of the estimates for different estimators. The dash-dotted line, our theoretical result for the scatter of the LS method, underestimates the scatter, since higher-order correlations in the D catalog are ignored. The dotted line is without the contribution of the q terms, and is dominated by the Poisson (p) terms. The bias is multiplied by 100 so the curves can be displayed in a more compact plot. For the measured mean and scatter, and the theoretical bias we plot standard LS in black, dilution with d = 0.14 in red, and split with M_s = 50 in blue. For the mean and scatter the difference between the methods is not visible in this plot. The differences in the mean estimate are shown in Fig. 5. The differences in scatter (or its square, the variance) are shown in Fig 3. For the theoretical bias the difference between split and dilution is not visible at small r (ξ(r) > 1), where the bias is positive.

In the text

	Fig. 2. Quantities p, p_c, p_r, q, q_r, t, and t_r for the Minerva shell. The values for the first bin are noisy. The vertical red line marks r = L.
In the text

	Fig. 3. Measured difference from LS of the variance of different estimators, multiplied by r². Dashed lines are our theoretical results.
In the text

	Fig. 4. Theoretical estimate of the scatter of the ξ estimates divided by the scatter in the $N_{r}^{'} \to \infty$ $N^{\prime}_{\rm r}\rightarrow\infty$ limit. The dotted lines correspond to 0.3%, 0.5%, and 1% increase in scatter. For r < 10 h⁻¹ Mpc there is hardly any difference between split and dilution, the curves lie on top of each other; whereas for larger r split is much better.
In the text

Fig. 5.

Differences between the mean ξ(r) estimate and that from the LS, multiplied by r to better display all scales. This measured difference is not the true bias, which is too small to measure with 300 mocks, and is mainly due to random error of the mean. The results for dilution appear to reveal a systematic bias, but this is just due to strong error correlations between nearby bins; for different subsets of the 300 mocks the mean difference is completely different.

In the text

Fig. 6.

Measured variance (mean variance over the range r = 80 − 120 h⁻¹ Mpc) vs. computational cost (mean computation time) for the different methods (markers with error bars) and our theoretical prediction (solid lines). The solid lines (blue for the split method, red for dilution, and black for standard LS with M_r ≤ 50) are our theoretical predictions for the increase in variance and computation time ratio when compared to the standard LS, M_r = 50, case, and the dots on the curves correspond to the measured cases (except for LS they are, from right to left, M_r = 25, 12.5, and (50/7); only the first of which was measured). The curve for split ends at M_s = 2500; the optimal case, M_s = M_r, is the circled dot. The error bars for the variance measurement are naive estimates that do not account for error correlations between bins. The theoretical predictions overestimate the cost savings (data points are to the right of the dots on curves; except for the smaller split factors, where the additional speed-up compared to theory is related to some other performance differences between our split and standard LS implementations). This plot would have a different appearance for other r ranges.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Alam, S., Ata, M., Bailey, S., et al. 2017, MNRAS, 470, 2617 [NASA ADS] [CrossRef] [Google Scholar]

[2] Alcock, C., & Paczyński, B. 1979, Nature, 281, 358 [NASA ADS] [CrossRef] [Google Scholar]

[3] Alonso, D. 2012, ArXiv e-prints [arXiv:1210.1833] [Google Scholar]

[4] Anderson, L., Aubourg, E., Bailey, S., et al. 2012, MNRAS, 427, 3435 [Google Scholar]

[5] Anderson, L., Aubourg, E., Bailey, S., et al. 2014, MNRAS, 441, 24 [NASA ADS] [CrossRef] [Google Scholar]

[6] Ata, M., Baumgarten, F., Bautista, J., et al. 2018, MNRAS, 473, 4773 [NASA ADS] [CrossRef] [Google Scholar]

[7] Bautista, J. E., Vargas-Magaña, M., Dawson, K. S., et al. 2018, ApJ, 863, 110 [NASA ADS] [CrossRef] [Google Scholar]

[8] Bernstein, G. M. 1994, ApJ, 424, 569 [NASA ADS] [CrossRef] [Google Scholar]

[9] Beutler, F., Blake, C., Colless, M., et al. 2011, MNRAS, 416, 3017 [NASA ADS] [CrossRef] [Google Scholar]

[10] Beutler, F., Blake, C., Colless, M., et al. 2012, MNRAS, 423, 3430 [NASA ADS] [CrossRef] [Google Scholar]

[11] Blake, C., Kazin, E. A., Beutler, F., et al. 2011, MNRAS, 418, 1707 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]

[12] Cole, S., Percival, W. J., Peacock, J. A., et al. 2005, MNRAS, 362, 505 [NASA ADS] [CrossRef] [Google Scholar]

[13] Davis, M., & Peebles, P. J. E. 1983, ApJ, 267, 465 [NASA ADS] [CrossRef] [Google Scholar]

[14] de la Torre, S., Jullo, E., Giocoli, C., et al. 2017, A&A, 608, A44 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[15] Demina, R., Cheong, S., BenZvi, S., & Hindrichs, O. 2018, MNRAS, 480, 49 [NASA ADS] [CrossRef] [Google Scholar]

[16] Eisenstein, D. J., Zehavi, I., Hogg, D. W., et al. 2005, ApJ, 633, 560 [NASA ADS] [CrossRef] [Google Scholar]

[17] Grieb, J. N., Sánchez, A. G., Salvador-Albornoz, S., & Dalla Vecchia, C. 2016, MNRAS, 457, 1577 [NASA ADS] [CrossRef] [Google Scholar]

[18] Guzzo, L., Pierleoni, M., Meneux, B., et al. 2008, Nature, 451, 541 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]

[19] Hamilton, A. J. S. 1993, ApJ, 417, 19 [NASA ADS] [CrossRef] [Google Scholar]

[20] Hewett, H. C. 1982, MNRAS, 201, 867 [NASA ADS] [CrossRef] [Google Scholar]

[21] Hou, J., Sánchez, A. G., Scoccimarro, R., et al. 2018, MNRAS, 480, 2521 [Google Scholar]

[22] Jarvis, M. 2015, Astrophysics Source Code Library [record ascl:1508.007] [Google Scholar]

[23] Kaiser, N. 1987, MNRAS, 227, 1 [NASA ADS] [CrossRef] [Google Scholar]

[24] Kerscher, M. 1999, A&A, 343, 333 [NASA ADS] [Google Scholar]

[25] Kerscher, M., Szapudi, I., & Szalay, A. S. 2000, ApJ, 535, L13 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]

[26] Landy, S. D., & Szalay, A. S. 1993, ApJ, 412, 64 [NASA ADS] [CrossRef] [Google Scholar]

[27] Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]

[28] Lippich, M., Sánchez, A. G., Colavincenzo, M., et al. 2019, MNRAS, 482, 1786 [NASA ADS] [CrossRef] [Google Scholar]

[29] Marulli, F., Veropalumbo, A., & Moresco, M. 2016, Astron. Comput., 14, 35 [NASA ADS] [CrossRef] [Google Scholar]

[30] Moore, A., Connolly, A., Genovese, C., et al. 2000, ArXiv e-prints [arXiv:astro-ph/0012333] [Google Scholar]

[31] Peacock, J. A., Cole, S., Norberg, P., et al. 2001, Nature, 410, 169 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]

[32] Peebles, P. J. E., & Hauser, M. G. 1974, ApJS, 28, 19 [NASA ADS] [CrossRef] [Google Scholar]

[33] Percival, W. J., Reid, B. A., Eisenstein, D. J., et al. 2010, MNRAS, 401, 2148 [NASA ADS] [CrossRef] [Google Scholar]

[34] Pezzotta, A., de la Torre, S., Bel, J., et al. 2017, A&A, 604, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[35] Reid, B. A., Samushia, L., White, M., et al. 2012, MNRAS, 426, 2719 [NASA ADS] [CrossRef] [Google Scholar]

[36] Ross, A. J., Samushia, L., Howlett, C., et al. 2015, MNRAS, 449, 835 [NASA ADS] [CrossRef] [Google Scholar]

[37] Ross, A. J., Beutler, F., Chuang, C. H., et al. 2017, MNRAS, 464, 1168 [NASA ADS] [CrossRef] [Google Scholar]

[38] Ruggeri, R., Percival, W. J., Gil-Marín, H., et al. 2019, MNRAS, 483, 3878 [NASA ADS] [CrossRef] [Google Scholar]

[39] Slepian, Z., & Eisenstein, D. J. 2015, MNRAS, 454, 4142 [NASA ADS] [CrossRef] [Google Scholar]

[40] Vargas-Magaña, M., Ho, S., Cuesta, A. J., et al. 2018, MNRAS, 477, 1153 [NASA ADS] [CrossRef] [Google Scholar]

[41] Wall, J. V., & Jenkins, C. R. 2012, Practical Statistics for Astronomers (Cambridge, UK: Cambridge University Press) [CrossRef] [Google Scholar]

[42] Zarrouk, P., Burtin, E., Gil-Marín, H., et al. 2018, MNRAS, 477, 1639 [NASA ADS] [CrossRef] [Google Scholar]

[43] Zehavi, I., Zheng, Z., Weinberg, D. H., et al. 2011, ApJ, 736, 59 [NASA ADS] [CrossRef] [Google Scholar]