KiDS+VIKING+GAMA: Halo occupation distributions and correlations of satellite numbers with a new halo model of the galaxy-matter bispectrum for galaxy-galaxy-galaxy lensing

Laila Linke; Patrick Simon; Peter Schneider; Daniel J. Farrow; Jens Rödiger; Angus H. Wright

doi:10.1051/0004-6361/202243711

Home

All issues

Volume 665 (September 2022)

A&A, 665 (2022) A38

Full HTML

Open Access

Issue		A&A Volume 665, September 2022


Article Number		A38
Number of page(s)		35
Section		Extragalactic astronomy
DOI		https://doi.org/10.1051/0004-6361/202243711
Published online		08 September 2022

A&A 665, A38 (2022)

KiDS+VIKING+GAMA: Halo occupation distributions and correlations of satellite numbers with a new halo model of the galaxy-matter bispectrum for galaxy-galaxy-galaxy lensing

Laila Linke¹, Patrick Simon¹, Peter Schneider¹, Daniel J. Farrow²^,3, Jens Rödiger¹ and Angus H. Wright⁴

¹ Argelander-Institut für Astronomie, Rheinische Friedrich-Wilhems Universität Bonn, Auf dem Hügel 71, 53121 Bonn, Germany
e-mail: llinke@astro.uni-bonn.de
² Max-Planck-Institut für extraterrestriche Physik, Giessenbachstrasse 1, 85748 Garching, Germany
³ Universitäts-Sternwarte, Fakultät für Physik, Ludwig-Maximilians-Universität München, Scheinerstr. 1, 81679 München, Germany
⁴ Ruhr University Bochum, Faculty of Physics and Astronomy, Astronomical Institute (AIRUB), German Centre for Cosmological Lensing, 44780 Bochum, Germany

Received: 5 April 2022
Accepted: 28 June 2022

Abstract

Context. Halo models and halo occupation distributions (HODs) are important tools to model the distribution of galaxies and matter.

Aims. We present and assess a new method for constraining the parameters of HODs using the mean gravitational lensing shear around galaxy pairs, so-called galaxy-galaxy-galaxy lensing (G3L). In contrast to galaxy-galaxy lensing, G3L is also sensitive to the correlations between the per-halo numbers of galaxies from different populations. We employed our G3L halo model to probe these correlations and test the default hypothesis that they are negligible.

Methods. We derived a halo model for G3L and validated it with realistic mock data from the Millennium Simulation and a semi-analytic galaxy model. Then, we analysed public data from the Kilo-Degree Survey (KiDS), the VISTA Infrared Kilo-Degree Galaxy Survey (VIKING) and data from the Galaxy And Mass Assembly Survey (GAMA) to infer the HODs of galaxies at z < 0.5 in five different stellar mass bins between 10^8.5h⁻² M_⊙ and 10^11.5h⁻² M_⊙ and two colours (red and blue), as well as correlations between satellite numbers.

Results. The analysis accurately recovers the true HODs in the simulated data for all galaxy samples within the 68% credibility range. The model best fits agree with the observed G3L signal on the 95% confidence level. The inferred HODs vary significantly with colour and stellar mass. In particular, red galaxies prefer more massive halos ≳10¹² M_⊙, while blue galaxies are present in halos ≳10¹¹ M_⊙. There is strong evidence (> 3σ) for a high correlation, increasing with halo mass, between the numbers of red and blue satellites and between galaxies with stellar masses below 10¹⁰ M_⊙.

Conclusions. Our G3L halo model accurately constrains galaxy HODs for lensing surveys of up to 10³ deg² and redshift below 0.5 probed here. Analyses of future surveys may need to include non-Poisson variances of satellite numbers or a revised model for central galaxies. Correlations between satellite numbers are ubiquitous between various galaxy samples and are relevant for halos with masses ≳10¹³ M_⊙, that is, of galaxy-group scale and more massive. Possible causes of these correlations are the selection of similar galaxies in different samples, the survey flux limit, or physical mechanisms such as a fixed ratio between the satellite numbers of distinct populations. The decorrelation for halos with smaller masses is probably an effect of shot noise by low-occupancy halos. The inferred HODs can be used to complement galaxy-galaxy lensing or galaxy-clustering HOD studies or as input to cosmological analyses and improved mock galaxy catalogues.

Key words: gravitational lensing: weak / galaxies: halos / cosmology: observations / large-scale structure of Universe

© L. Linke et al. 2022

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe-to-Open model. Subscribe to A&A to support open access publication.

1. Introduction

Accurate models of the distribution of galaxies inside the cosmic large-scale structure are crucial to understanding the physics of galaxy evolution and inferring cosmological parameters from galaxy surveys. Popular frameworks for analytically describing the galaxy and matter distribution are halo models (e.g. Cooray & Sheth 2002; Scoccimarro et al. 2001; Kravtsov et al. 2004; Zheng et al. 2005). Their key ingredient is the halo occupation distribution (HOD), which gives the expected number of galaxies inside a halo of a given mass (Berlind & Weinberg 2002). Here we present a new method to accurately infer galaxy HODs with a halo model for galaxy-galaxy-galaxy lensing (G3L) – the mean gravitational lensing shear around galaxy pairs (Schneider & Watts 2005). We demonstrate that with this higher-order statistic, we can obtain the HODs of various galaxy samples selected, for example, by their colour, in current galaxy surveys. Additionally, in contrast to other lensing methods, G3L can probe the correlation of per-halo numbers of satellite galaxies from different populations. In a first application, we infer the HODs and correlations of various galaxy selections in the Kilo-Degree Survey (KiDS; Kuijken et al. 2015), the VISTA Infrared Kilo-degree Galaxy survey (VIKING; Edge et al. 2013; Venemans et al. 2015), and the Galaxy And Mass Assembly survey (GAMA; Driver et al. 2009; Liske et al. 2015).

Halo models are the backbone of many analytical expressions for the statistics of the large-scale structure. They postulate that matter is distributed in virialised halos, and galaxies only exist within these halos (White & Rees 1978). For known halo density profiles and HODs, they can predict all statistics of the galaxy and matter distributions (Cooray & Sheth 2002). Therefore, with halo models, we can infer HODs from the measured galaxy and matter statistics, such as galaxy clustering (Zehavi et al. 2011; Ishikawa et al. 2021) and the galaxy-matter power spectrum (Mandelbaum et al. 2006; Clampitt et al. 2016; Dvornik et al. 2018).

Since halo models are analytic, calculating their predictions is faster and easier than more complex approaches such as semi-analytic galaxy models (SAMs; e.g. Henriques et al. 2015) or hydrodynamical simulations (Vogelsberger et al. 2020). Even though they rely on simple assumptions, they can accurately describe second-order statistics of galaxies and matter. For example, halo model predictions for the galaxy-galaxy two-point correlation function agree well with observations (Zehavi et al. 2011). Mead et al. (2015) shows that halo models can also describe the non-linear matter power spectrum in N-body simulations with 5% accuracy.

Although halo models are prevalent for two-point statistics, it is unclear whether they are sufficiently accurate for modelling higher-order statistics, such as the galaxy- and matter bispectrum. These higher-order statistics contain complementary information to the two-point statistics (Berlind & Weinberg 2002) so it is worthwhile to expand halo models to include them to improve and cross-validate constraints from second-order statistics. Here we extend halo models to measurements of G3L (Simon et al. 2008, 2013; Linke et al. 2020a), a third-order statistic.

The G3L signal is induced on a background galaxy (the ‘source’) by the weak gravitational lensing of matter around a pair of foreground galaxies (the ‘lenses’). It directly depends on the galaxy-matter bispectrum integrated over the spread of lenses along the line-of-sight (Schneider & Watts 2005). Unlike galaxy-galaxy lensing, G3L is sensitive to the mean number of galaxy pairs inside halos and, therefore, the correlation of halo satellite numbers. If the galaxies in a lens pair belong to different samples, the G3L signal is higher for positively correlated satellite numbers and lower if they are anti-correlated. While the default assumption in the literature is that satellite numbers are uncorrelated (Scranton 2001, 2002; Zehavi et al. 2005), some galaxy-clustering studies suggest a correlation between galaxy populations, such as red and blue galaxies (Zehavi et al. 2011; Ross & Brunner 2009; Wang et al. 2007; Simon et al. 2009).

We challenge the default assumption by constructing a halo model for the galaxy-galaxy-matter bispectrum and the G3L signal, inferring the HODs and satellite correlations for various galaxy samples. Our model is based on the approaches by Zheng et al. (2007), Zehavi et al. (2011) and Clampitt et al. (2017), but goes further by including the correlation between the satellite numbers of different galaxy samples. We validate the model and our inference procedure with G3L estimates in a simulated lensing survey based on the Millennium Simulation (MS; Springel et al. 2005) populated with SAM galaxies by Henriques et al. (2015). As a first real-data application, we analyze G3L measurements in the overlap region of KiDS, VIKING, and GAMA, with lenses selected from GAMA and sources from KiDS+VIKING (Linke et al. 2020a). We infer the HODs of galaxies in five different stellar mass bins between 10^8.5 to 10^11.5 h⁻² M_⊙ and two colours (red and blue), as well as the cross-correlations between the per-halo satellite numbers for these galaxy samples.

This paper is structured as follows: In Sect. 2, we give an overview of the basics of G3L. We present our halo model for G3L in Sect. 3. The simulated and observed data sets are described in Sect. 4. Section 5 describes the estimator for the G3L signal and our analysis of the measurements. We present our results in Sect. 6 and discuss them in Sect. 7.

In the analysis of the MS, we use the cosmological parameters of this simulation, namely Ω_m = 0.25, Ω_b = 0.045, H₀ = 73 km s⁻¹ Mpc⁻¹, and σ₈ = 0.9. For the analysis of the observations, we use the parameters from the Planck Collaboration I (2020), namely Ω_m = 0.315, Ω_b = 0.049, H₀ = 67.4 km s⁻¹ Mpc⁻¹, and σ₈ = 0.811. Throughout we assume a flat cosmology with Ω_Λ = 1 − Ω_m.

2. Theory of galaxy-galaxy-galaxy lensing

G3L is a weak gravitational lensing effect (see, e.g. Bartelmann & Schneider 2001 for a review on weak lensing), first described by Schneider & Watts (2005, SW05 hereafter). There are two types of this effect: The lensing of background galaxy pairs by matter around individual foreground galaxies (lens-shear-shear G3L) and the lensing of individual background galaxies by matter around foreground galaxy pairs (lens-lens-shear G3L). We concentrate on the latter.

Figure 1 shows the geometric configuration for a lens-lens-shear G3L system on the sky. A background galaxy (‘source’), located at angular position θ, is gravitationally lensed by matter around two foreground galaxies (‘lenses’), located at θ + ϑ₁ and θ + ϑ₂. Due to the lensing, the source experiences a tangential shear γ_t, which is measured with respect to the bisector of the angle ϕ between the source-lens connections. The main observables for G3L are the three-point correlation function ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ and the closely related aperture statistics ⟨𝒩^a𝒩^bℳ⟩, where a and b denote the considered galaxy populations. We describe the observables in Sects. 2.2 and 2.3. To model ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ and ⟨𝒩^a𝒩^bℳ⟩, we first introduce the galaxy-galaxy-matter bispectrum and its relation to the matter and galaxy number density distributions.

Fig. 1.

Geometry of a G3L configuration with one source and two lens galaxies on the sky; adapted from Schneider & Watts (2005). Lens galaxies are at angular positions θ₁ = θ + ϑ₁ and θ₂ = θ + ϑ₂ on the sky; the source galaxy is at θ. The angle between the source-lens connections is the opening angle ϕ.

2.1. Projected fields and the galaxy-galaxy-matter bispectrum

The distribution of matter and galaxies is defined by the density ρ(x, z) and discrete galaxy number density $n_{g}^{a} (x, z)$ $n_\mathrm{g}^{a}(\boldsymbol{x}, z)$ at comoving position x and redshift z. The subscript a refers to the ‘sample’ of the galaxies, chosen with the same selection function, for example early- or late-type galaxies. Fluctuations in the densities ρ and $n_{g}^{a}$ $n_\mathrm{g}^a$ are the matter- and galaxy number density contrast, δ(x, z) and $δ_{g}^{a} (x, z)$ $\delta^a_\mathrm{g}(\boldsymbol{x}, z)$ , defined as

$\begin{matrix} δ (x, z) = \frac{ρ (x, z) - \bar{ρ} (z)}{\bar{ρ} (z)}, \end{matrix}$ $\begin{aligned} \delta (\boldsymbol{x},z)=\frac{\rho (\boldsymbol{x},z)-\bar{\rho }(z)}{\bar{\rho }(z)}, \end{aligned}$ (1)

and

$\begin{matrix} δ_{g}^{a} (x, z) = \frac{n_{g}^{a} (x, z) - \bar{n_{g}^{a}} (z)}{\bar{n_{g}^{a}} (z)}, \end{matrix}$ $\begin{aligned} \delta ^{a}_\mathrm{g} (\boldsymbol{x},z)=\frac{n_\mathrm{g} ^{a}(\boldsymbol{x},z)-\overline{n_\mathrm{g} ^a}(z)}{\overline{n_\mathrm{g} ^a}(z)}, \end{aligned}$ (2)

where bars denote ensemble averages.

Since G3L involves the correlation of galaxy pairs with the matter field, it is natural to derive its model from the galaxy-galaxy-matter bispectrum $B_{gg δ}^{a b} (k_{1}, k_{2}; k_{3}, z)$ $B_{\text{gg}\delta }^{ab}({{\mathbf{k}}_{1}},{{\mathbf{k}}_{2}};{{\mathbf{k}}_{3}},z)$ at comoving modes k₁, k₂, and k₃ and redshift z. This bispectrum is defined by

$\begin{matrix} ⟨ {\hat{δ}}_{g}^{a} (k_{1}, z) {\hat{δ}}_{g}^{b} (k_{2}, z) \hat{δ} (k_{3}, z) ⟩ \\ = {(2 π)}^{3} δ_{D} (k_{1} + k_{2} + k_{3}) B_{gg δ}^{ab} (k_{1}, k_{2} ; k_{3}, z) \end{matrix}$ $\begin{aligned}&\langle {\hat{\delta }_\mathrm{g} ^{a}(\boldsymbol{k}_1, z)\,\hat{\delta }_\mathrm{g} ^{b}(\boldsymbol{k}_2, z)\, \hat{\delta }(\boldsymbol{k}_3, z)}\rangle \; \\&\nonumber =(2\pi )^3\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3)\, B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2; \boldsymbol{k}_3, z) \end{aligned}$ (3)

where hats denote Fourier transforms for which we use the convention

$\begin{matrix} \hat{δ} (k) = \int [3] x δ (x) exp (- i k \cdot x) . \end{matrix}$ $\begin{aligned} \hat{\delta }(\boldsymbol{k})=\int [3]{x}\; \delta (\boldsymbol{x})\, \exp (-\mathrm{i}\boldsymbol{k}\cdot \boldsymbol{x}). \end{aligned}$ (4)

The bispectrum is defined only for k₃ = −k₁ − k₂, so we abbreviate

$\begin{matrix} B_{gg δ}^{ab} (k_{1}, k_{2} ; - k_{1} - k_{2}, z) = : B_{gg δ}^{ab} (k_{1}, k_{2}, z) . \end{matrix}$ $\begin{aligned} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2; -\boldsymbol{k}_1-\boldsymbol{k}_2, z) =: B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z). \end{aligned}$ (5)

As a gravitational lensing effect, G3L depends mainly on the projections κ and $κ_{g}^{a}$ $\kappa^a_\mathrm{g}$ of δ and $δ_{g}^{a}$ $\delta^a_\mathrm{g}$ along the line-of-sight. In a flat universe the lensing convergence is

$\begin{matrix} κ (θ) = \frac{3 H_{0}^{2} Ω_{m}}{2 c^{2}} \int_{0}^{\infty} χ q (χ) χ \frac{δ (x (θ, χ), z (χ))}{a (χ)}, \end{matrix}$ $\begin{aligned} \kappa (\boldsymbol{\theta })= \frac{3H_0^2\, \Omega _\mathrm{m} }{2c^2}\, \int _0^\infty {\chi }\; q(\chi )\, \chi \, \frac{\delta \Big (\boldsymbol{x}(\boldsymbol{\theta }, \chi ), z(\chi )\Big )}{a(\chi )}, \end{aligned}$ (6)

where χ is the comoving distance, z(χ) is the redshift at χ, x(θ, χ) = (χ θ, χ), and q(χ) is an integral over the probability distribution function p_s(χ) of source galaxies with comoving distance χ, given as

$\begin{matrix} q (χ) = \int_{χ}^{\infty} χ^{'} p_{s} (χ^{'}) \frac{χ^{'} - χ}{χ^{'}} . \end{matrix}$ $\begin{aligned} q(\chi )=\int _{\chi }^{\infty } {\chi ^\prime }\; p_\mathrm{s} (\chi ^\prime )\, \frac{\chi ^\prime -\chi }{\chi ^\prime }. \end{aligned}$ (7)

The projected galaxy number density contrast is given by

$\begin{matrix} κ_{g}^{a} (θ) = \int_{0}^{\infty} χ p^{a} (χ) δ_{g}^{a} (x (θ, χ), z (χ)), \end{matrix}$ $\begin{aligned} \kappa _\mathrm{g} ^a(\boldsymbol{\theta }) = \int _0^\infty {\chi }\; p^a(\chi )\, \delta _\mathrm{g} ^a\Big (\boldsymbol{x}(\boldsymbol{\theta }, \chi ), z(\chi )\Big ), \end{aligned}$ (8)

where p^a(χ) is the probability distribution function of lens galaxies from sample a in comoving distance. This distribution strongly depends on the selection criteria for sample a. With p^a(χ) we also define the projected galaxy number density $N_{g}^{a}$ $N_\mathrm{g}^a$ as

$\begin{matrix} N_{g}^{a} (θ) = \int_{0}^{\infty} χ p^{a} (χ) n_{g}^{a} (χ (θ, χ), z (χ)) . \end{matrix}$ $\begin{aligned} N_\mathrm{g} ^a(\boldsymbol{\theta }) = \int _0^\infty {\chi }\; p^a(\chi )\, n_\mathrm{g} ^a\Big (\boldsymbol{\chi }(\boldsymbol{\theta }, \chi ), z(\chi )\Big ). \end{aligned}$ (9)

To arrive at a G3L signal, we convert the galaxy-galaxy-matter bispectrum $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ to its projected counterpart $b_{gg κ}^{a b}$ ${{\mathit{b}^{ab}_{\mathrm{gg}\kappa}}}$ , defined by

$\begin{matrix} ⟨ {\hat{κ}}_{g}^{a} (ℓ_{1}) {\hat{κ}}_{g}^{b} (ℓ_{2}) \hat{κ} (ℓ_{3}) ⟩ = {(2 π)}^{2} δ_{D} (ℓ_{1} + ℓ_{2} + ℓ_{3}) b_{gg κ}^{ab} (ℓ_{1}, ℓ_{2}, ℓ_{3}), \end{matrix}$ $\begin{aligned} \langle {\hat{\kappa }^a_\mathrm{g} (\boldsymbol{\ell }_1)\, \hat{\kappa }^b_\mathrm{g} (\boldsymbol{\ell }_2)\, \hat{\kappa }(\boldsymbol{\ell }_3)}\rangle = (2\pi )^2\, \delta _{\rm D}\left(\boldsymbol{\ell }_1 + \boldsymbol{\ell }_2 + \boldsymbol{\ell }_3\right)\, b _{\mathrm{gg} \kappa }^{ab}(\boldsymbol{\ell }_1, \boldsymbol{\ell }_2, \boldsymbol{\ell }_3), \end{aligned}$ (10)

where hats again denote Fourier transforms. Similar as for $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ , we abbreviate

$\begin{matrix} b_{gg κ}^{ab} (ℓ_{1}, ℓ_{2}, - ℓ_{1} - ℓ_{2}) = : b_{gg κ}^{ab} (ℓ_{1}, ℓ_{2}) . \end{matrix}$ $\begin{aligned} b _{\mathrm{gg} \kappa }^{ab}(\boldsymbol{\ell }_1, \boldsymbol{\ell }_2, -\boldsymbol{\ell }_1-\boldsymbol{\ell }_2) =: b _{\mathrm{gg} \kappa }^{ab}(\boldsymbol{\ell }_1, \boldsymbol{\ell }_2). \end{aligned}$ (11)

As shown in Kaiser (1992), under the assumptions of the Limber equation, the projected bispectrum is

$\begin{matrix} b_{gg κ}^{ab} (ℓ_{1}, ℓ_{2}) & = \frac{3 H_{0}^{2} Ω_{m}}{2 c^{2}} \int χ \frac{q (χ) p^{a} (χ) p^{b} (χ)}{χ^{3} a (χ)} \\ \times B_{gg δ}^{ab} (ℓ_{1}, ℓ_{2}, z (χ)), \end{matrix}$ $\begin{aligned} b _{\mathrm{gg} \kappa }^{ab}(\boldsymbol{\ell }_1, \boldsymbol{\ell }_2)&= \frac{3H_0^2\, \Omega _\mathrm{m} }{2c^2} \int {\chi } \frac{q(\chi )\, p^a(\chi )\, p^b(\chi )}{\chi ^3\, a(\chi )}\\&\nonumber \quad \times B _{\mathrm{gg} \delta }^{ab}\Big (\boldsymbol{\ell }_1, \boldsymbol{\ell }_2, z(\chi )\Big ), \end{aligned}$ (12)

which we use to model the observables of G3L: the correlation function ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ and the aperture statistics ⟨𝒩^a𝒩^bℳ⟩. Equation (12) requires that the product q(χ) p^a(χ) p^b(χ) in the integrand does not vary strongly on scales smaller than the typical correlation length (i.e. a few megaparsecs) of the galaxy distribution (Bartelmann & Schneider 2001).

2.2. G3L correlation function

The G3L correlation function ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ correlates the projected lens galaxy number densities $N_{g}^{a}$ $N_\mathrm{g}^a$ and $N_{g}^{b}$ $N_\mathrm{g}^b$ of two lens galaxy samples a and b with the tangential shear γ_t(θ) of the source galaxies,

$\begin{matrix} {\tilde{G}}^{ab} (ϑ_{1}, ϑ_{2}, ϕ) = \frac{1}{\bar{N_{g}^{a}} \bar{N_{g}^{b}}} ⟨ N_{g}^{a} (θ + ϑ_{1}) N_{g}^{b} (θ + ϑ_{2}) γ_{t} (θ) ⟩ . \end{matrix}$ $\begin{aligned} \tilde{\mathcal{G} }^{ab}(\vartheta _1, \vartheta _2, \phi )=\frac{1}{\overline{N_\mathrm{g} ^a}\,\overline{N_\mathrm{g} ^b}}\,\langle { N_\mathrm{g} ^{a}(\boldsymbol{\theta }+\boldsymbol{\vartheta }_1)\, N_\mathrm{g} ^b(\boldsymbol{\theta }+\boldsymbol{\vartheta }_2) \, \gamma _{\rm t}(\boldsymbol{\theta })}{\rangle }. \end{aligned}$ (13)

Due to the statistical homogeneity and isotropy of the matter and galaxy fields, ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ only depends on ϑ₁ = |ϑ₁| and ϑ₂ = |ϑ₂| of the lens-source separations, and the angle ϕ between ϑ₁ and ϑ₂. For a ≠ b, the two lenses in each G3L configuration are from two different samples (mixed lens pairs); otherwise, they are from the same sample (unmixed lens pairs).

As shown in SW05, ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ can be calculated from the projected galaxy-galaxy-matter bispectrum $b_{gg κ}^{a b}$ ${{\mathit{b}^{ab}_{\mathrm{gg}\kappa}}}$ (cf. their Eq. (40)), by integrating $b_{gg κ}^{a b}$ ${{\mathit{b}^{ab}_{\mathrm{gg}\kappa}}}$ (ℓ₁, ℓ₂), multiplied with a complex kernel function containing the second-order Bessel function. Consequently, ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ can be modelled from the bispectrum and directly compared to measurements. However, it is numerically preferable to consider the G3L aperture statistics, a linear transform of $\tilde{G}$ ${\tilde{\mathcal{G}}}$ . We describe them in the following subsection.

2.3. G3L aperture statistics

Aperture statistics are moments of aperture masses ℳ_θ(ϑ) and aperture number counts $N_{θ}^{p} (ϑ)$ ${\mathcal{N}}_\theta^p({\boldsymbol{\vartheta}})$ ,

$\begin{matrix} M_{θ} (ϑ) = \int [2] ϑ^{'} U_{θ} (| ϑ - ϑ^{'} |) κ (ϑ^{'}) \end{matrix}$ $\begin{aligned} {\mathcal{M} }_{\theta }(\boldsymbol{\vartheta }) = \int [2]{\vartheta^\prime } U_\theta (|\boldsymbol{\vartheta }-\boldsymbol{\vartheta }^\prime |)\,\kappa (\boldsymbol{\vartheta }^\prime ) \end{aligned}$ (14)

and

$\begin{matrix} N_{θ}^{a} (ϑ) = \frac{1}{\bar{N_{g}^{a}}} \int [2] ϑ^{'} U_{θ} (| ϑ - ϑ^{'} |) N_{g}^{a} (ϑ^{'}), \end{matrix}$ $\begin{aligned} {\mathcal{N} }_{\theta }^a(\boldsymbol{\vartheta }) = \frac{1}{\overline{N_\mathrm{g} ^a}} \int [2]{\vartheta^\prime } U_\theta (|\boldsymbol{\vartheta }-\boldsymbol{\vartheta }\prime |)\,N_\mathrm{g} ^a(\boldsymbol{\vartheta }^\prime ), \end{aligned}$ (15)

with a filter function U_θ of aperture scale radius θ. The function U_θ has to be a compensated filter, that is ∫ⅆϑ ϑ U_θ(ϑ) = 0. For G3L, the relevant aperture statistics are ⟨𝒩^a𝒩^bℳ⟩, given by

$\begin{matrix} ⟨ N^{a} N^{b} M ⟩ (θ_{1}, θ_{2}, θ_{3}) = ⟨ N_{θ_{1}}^{a} (ϑ) N_{θ_{2}}^{b} (ϑ) M_{θ_{3}} (ϑ) ⟩ . \end{matrix}$ $\begin{aligned}&\langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _1, \theta _2, \theta _3) = \langle {{\mathcal{N} }_{\theta _1}^a(\boldsymbol{\vartheta })\,{\mathcal{N} }_{\theta _2}^b(\boldsymbol{\vartheta })\, {\mathcal{M} }_{\theta _3}(\boldsymbol{\vartheta })}{\rangle }. \end{aligned}$ (16)

The aperture statistics are related to the bispectrum $b_{gg κ}^{a b}$ ${{\mathit{b}^{ab}_{\mathrm{gg}\kappa}}}$ by

$\begin{matrix} ⟨ N^{a} N^{b} M ⟩ (θ_{1}, θ_{2}, θ_{3}) \\ = \int \frac{[2] ℓ_{1}}{{(2 π)}^{2}} \int \frac{[2] ℓ_{2}}{{(2 π)}^{2}} {\hat{U}}_{θ_{1}} (ℓ_{1}) {\hat{U}}_{θ_{2}} (ℓ_{2}) {\hat{U}}_{θ_{3}} (| ℓ_{1} + ℓ_{2} |) b_{gg κ}^{ab} (ℓ_{1}, ℓ_{2}), \end{matrix}$ $\begin{aligned}&\langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _1, \theta _2, \theta _3) \\&\nonumber = \int \frac{[2]{\ell _1}}{(2\pi )^2} \int \frac{[2]{\ell _2}}{(2\pi )^2} \; \hat{U}_{\theta _1}(\ell _1)\, \hat{U}_{\theta _2}(\ell _2)\, \hat{U}_{\theta _3}(|\boldsymbol{\ell }_1+\boldsymbol{\ell }_2|)\, b _{\mathrm{gg} \kappa }^{ab}(\boldsymbol{\ell }_1, \boldsymbol{\ell }_2), \end{aligned}$ (17)

where ${\hat{U}}_{θ}$ $\hat{U}_\theta$ is the Fourier transform of U_θ. We choose the exponential filter function by Crittenden et al. (2002),

$\begin{matrix} U_{θ} (ϑ) = \frac{1}{2 π θ} (1 - \frac{ϑ^{2}}{2 θ^{2}}) exp (- \frac{ϑ^{2}}{2 θ^{2}}), \end{matrix}$ $\begin{aligned} U_\theta (\vartheta )=\frac{1}{2\pi \theta } \, \left(1-\frac{\vartheta ^2}{2\theta ^2}\right)\, \exp (-\frac{\vartheta ^2}{2\theta ^2}), \end{aligned}$ (18)

which is commonly used for studies of higher-order aperture statistics (e.g. Schneider et al. 2005; Jarvis et al. 2004; Simon et al. 2009; Saghiha et al. 2017) due to its favourable analytical properties. In particular, SW05 show that for this filter function the correlation function ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ can be connected analytically to ⟨𝒩^a𝒩^bℳ⟩ through

$\begin{matrix} ⟨ N^{a} N^{b} M ⟩ (θ_{1}, θ_{2}, θ_{3}) \\ = \int_{0}^{\infty} ϑ_{1} ϑ_{1} \int_{0}^{\infty} ϑ_{2} ϑ_{2} \int_{0}^{2 π} ϕ {\tilde{G}}^{ab} (ϑ_{1}, ϑ_{2}, ϕ) \\ \times A_{N N M} (ϑ_{1}, ϑ_{2}, ϕ ∣ θ_{1}, θ_{2}, θ_{3}), \end{matrix}$ $\begin{aligned}&\langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _1, \theta _2, \theta _3) \nonumber \\&= \int _{0}^{\infty } {\vartheta _1} \, \vartheta _1 \int _{0}^{\infty } {\vartheta _2} \, \vartheta _2 \int _{0}^{2\pi } {\phi } \, \tilde{\mathcal{G} }^{ab}(\vartheta _1, \vartheta _2, \phi )\,\\&\qquad \times \mathcal{A} _{{\mathcal{N} }{\mathcal{N} }{\mathcal{M} }}(\vartheta _1, \vartheta _2, \phi \mid \theta _1, \theta _2, \theta _3)\nonumber , \end{aligned}$ (19)

with the kernel function 𝒜_𝒩𝒩ℳ(ϑ₁, ϑ₂, ϕ ∣ θ₁, θ₂, θ₃) given in the appendix of SW05.

As alluded to before, there are practical advantages to modelling ⟨𝒩^a𝒩^bℳ⟩ instead of ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ . First, evaluating Eq. (17) is numerically more stable than calculating $\tilde{G}$ ${\tilde{\mathcal{G}}}$ from the bispectrum because the filter function U_θ is localised and non-oscillating. Second, in contrast to $\tilde{G}$ ${\tilde{\mathcal{G}}}$ , the aperture statistics do not depend on the galaxy-matter two-point correlation function. Therefore, they do not need a model of the galaxy-galaxy lensing signal ⟨γ_t⟩. Third, a ⟨𝒩^a𝒩^bℳ⟩ data vector is a condensed summary statistic, where a few aperture radii (∼tens) contain a similar amount of information as $\tilde{G}$ ${\tilde{\mathcal{G}}}$ over hundreds of bins.

Consequently, we use the aperture statistics ⟨𝒩^a𝒩^bℳ⟩ as the primary observable and model it with Eq. (17), based on a halo-model based bispectrum $b_{gg κ}^{a b}$ ${{\mathit{b}^{ab}_{\mathrm{gg}\kappa}}}$ . To measure the aperture statistics, we estimate ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ (see Sect. 5.1) and convert it to aperture statistics with Eq. (19). We focus on the equilateral statistics, that is, θ₁ = θ₂ = θ₃, to reduce the size of the data vector and use the shorthand

$\begin{matrix} ⟨ N^{a} N^{b} M ⟩ (θ) : = ⟨ N^{a} N^{b} M ⟩ (θ, θ, θ), \end{matrix}$ $\begin{aligned} \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta ):=\langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta , \theta , \theta ), \end{aligned}$ (20)

for convenience. We describe the ‘ingredients’ of our halo model and the derivation of the model bispectrum in the next section.

3. G3L halo model

In this section, we derive the galaxy-galaxy-matter bispectrum based on the halo model assumption, from which we can predict the G3L aperture statistics with Eq. (17). For this derivation, we require several ingredients: the linear matter power spectrum and critical density contrast, the halo density profile, the halo mass function (HMF), the halo bias, the spatial distribution of galaxies within a halo, and the first- and second-order moments of the HOD. We give details on these ingredients next.

3.1. Ingredients

Halo models assume that all matter and galaxies are within self-bound, virialized halos. Halos form in regions at redshift z where the linear density contrast of matter exceeds the critical density contrast δ_c(z). They reach the virial density $ρ_{vir} = Δ (z) \bar{ρ}$ $\rho_{\mathrm{vir}}=\Delta(z)\,\bar{\rho}$ , where $\bar{ρ}$ $\bar{\rho}$ is the cosmic mean density of matter and Δ(z) is the fractional overdensity within the virialized region.

The first ingredients are the linear matter power spectrum P_lin and the critical density contrast δ_c. We use the P_lin by Eisenstein & Hu (1998) that includes baryonic effects, and the fitting formula by Nakamura & Suto (1997) for the critical density contrast,

$\begin{matrix} δ_{c} (z) & = \frac{3}{20} {(12 π)}^{2 / 3} [1 + 0.012299 {log}_{10} (1 + \frac{Ω_{m}^{- 1} - 1}{{(1 + z)}^{3}})], \end{matrix}$ $\begin{aligned} \delta _\mathrm{c} (z)&= \frac{3}{20}\,(12\pi )^{2/3} \left[ 1 + 0.012299 \log _{10}\left(1+ \frac{\Omega _\mathrm{m} ^{-1} -1}{(1+z)^3}\right)\right], \end{aligned}$ (21)

which was derived for flat ΛCDM universes. For Ω_m = 1, it reduces to the value for an Einstein–de Sitter Universe, δ_c ≃ 1.69.

Second, we require the density profile ρ(|r − r₀| | m, z) of a halo with mass m and redshift z, centred at r₀. We assume halos follow (truncated) Navarro-Frenk-White profiles (NFW; Navarro et al. 1996), given as

$\begin{matrix} ρ (r | m, z) & = \frac{200 \bar{ρ} (z)}{3 \frac{r}{r_{200}} {[\frac{1}{c (m, z)} + \frac{r}{r_{200}}]}^{2}} \end{matrix}$ $\begin{aligned} \rho (r \,|\, m, z)&= \frac{200\,\bar{\rho }(z)}{3 \frac{r}{r_{200}}\left[\frac{1}{c(m,z)} + \frac{r}{r_{200}}\right]^2} \end{aligned}$ (22)

$\begin{matrix} \times \frac{1 + c (m, z)}{ln (1 + c (m, z)) [1 + c (m, z)] - c} \\ = m u (r | m, z), \end{matrix}$ $\begin{aligned} &\quad \times \frac{1+c(m,z)}{\ln \left(1+c(m, z)\right)[1+c(m,z)] - c } \nonumber \\&= m\, u(r\,|\,m, z), \end{aligned}$ (23)

where r₂₀₀ is the radius of a sphere around the halo centre, in which the mean density¹ is 200 times the Universes mean density, m is the mass inside this sphere, c is the halo concentration parameter and u is the normalised density profile.

We model the mass and redshift dependence of c with the fitting formula by Bullock et al. (2001),

$\begin{matrix} c (m, z) = \frac{c_{0}}{1 + z} {(\frac{m}{m_{⋆} (z)})}^{- α}, \end{matrix}$ $\begin{aligned} c(m,z) = \frac{c_0}{1+z}\left(\frac{m}{m_\star (z)}\right)^{-\alpha }, \end{aligned}$ (24)

with c₀ = 9 and α = 0.13. The mass m_⋆ is that enclosed by a sphere of radius r_⋆,

$\begin{matrix} m_{⋆} (z) = \frac{4 π}{3} \bar{ρ} (z) r_{⋆}^{3} (z), \end{matrix}$ $\begin{aligned} m_\star (z) = \frac{4\pi }{3}\,\bar{\rho }(z) r_\star ^3(z), \end{aligned}$ (25)

where r_⋆ is the scale at which the standard deviation σ(r, z) of linear density fluctuations is equal to the critical overdensity δ_c, that is σ(r_⋆, z) = δ_c(z). The σ(r, z) is given by the convolution of the linear matter power spectrum P_lin(k, z) with the Fourier transformed tophat filter $\hat{W} (x)$ $\hat{W}(x)$ ,

$\begin{matrix} σ^{2} (r, z) = 2 π \int k k^{2} P_{lin} (k, z) {\hat{W}}^{2} (r k) . \end{matrix}$ $\begin{aligned} \sigma ^2(r, z) = 2\pi \int {k} \; k^2\, P_\mathrm{lin} (k, z)\, \hat{W}^2\Big (r\,k\Big ). \end{aligned}$ (26)

The third group of ingredients are the HMF n(m, z)ⅆm and the halo bias. The HMF describes the comoving number density of dark matter halos with mass between m and m + ⅆm. We use the HMF by Sheth & Tormen (1999),

$\begin{matrix} n (m, z) m & = A \frac{\bar{ρ}}{m^{2}} ln ν ln m m \\ \times [1 + \frac{1}{{(q ν)}^{2 p}}] \sqrt{\frac{{(q ν)}^{2}}{2 π}} exp (- \frac{{(q ν)}^{2}}{2}), \end{matrix}$ $\begin{aligned} n(m, z)\, {m}&= A\, \frac{\bar{\rho }}{m^2} \,{\ln \nu }{\ln m}\, {m}\\&\nonumber \quad \times \left[1 + \frac{1}{(q\,\nu )^{2p}}\right]\sqrt{\frac{(q\,\nu )^2}{2\pi }}\, \exp (-\frac{(q\nu )^2}{2}), \end{aligned}$ (27)

where the parameters A = 0.322, p = 0.3, and q = 0.707 were found in N-body simulations, and ν = δ_c/σ(r_m, z) for $r_{m}^{3} = 3 m {(4 π \bar{ρ})}^{- 1}$ $r_m^3=3m\,(4\pi\bar{\rho})^{-1}$ .

The halo bias quantifies the clustering of halos by the ratio of the halo density contrast, δ_h, and the matter density contrast, δ. We assume a linear halo bias,

$\begin{matrix} δ_{h} (x, z | m) & = b_{1} (m, z) δ (x, z), \end{matrix}$ $\begin{aligned} \delta _\mathrm{h} (\boldsymbol{x}, z\,|\,m)&= b_1(m, z)\, \delta (\boldsymbol{x}, z), \end{aligned}$ (28)

neglecting terms higher than linear in δ. Using the so-called peak-background split formalism (Mo & White 1996; Scoccimarro et al. 2001) the linear bias b₁ is

$\begin{matrix} b_{1} (m, z) = 1 + \frac{q ν^{2} (z) - 2}{δ_{c} (z)} + \frac{2 p}{1 + q^{p} ν^{2 p} (z)} \frac{1}{δ_{c} (z)}, \end{matrix}$ $\begin{aligned} b_1(m, z) = 1 + \frac{q\,\nu ^2(z) -2}{\delta _\mathrm{c} (z)} + \frac{2p}{1+q^p\,\nu ^{2p}(z)} \, \frac{1}{\delta _\mathrm{c} (z)}, \end{aligned}$ (29)

with the q and p as in the Sheth-Tormen HMF. Assuming a deterministic halo bias, the halo power spectrum is then

$\begin{matrix} P_{h} (k, z | m_{1}, m_{2}) = b_{1} (m_{1}, z) b_{1} (m_{2}, z) P_{lin} (k, z), \end{matrix}$ $\begin{aligned} P_\mathrm{h} (k, z\,|\,m_1, m_2) = b_1(m_1, z)\, b_1(m_2, z)\, P_\mathrm{lin} (k, z), \end{aligned}$ (30)

and the halo bispectrum is

$\begin{matrix} B_{h} (k_{1}, k_{2}, z | m_{1}, m_{2}, m_{3}) \\ = b_{1} (m_{1}, z) b_{1} (m_{2}, z) b_{1} (m_{3}, z) B_{lin} (k_{1}, k_{2}, z) \end{matrix}$ $\begin{aligned}&B_\mathrm{h} (\boldsymbol{k}_1, \boldsymbol{k_2}, z \,|\,m_1, m_2, m_3) \\&\nonumber = b_1(m_1, z)\, b_1(m_2, z)\, b_1(m_3, z)\, B_\mathrm{lin} (\boldsymbol{k}_1, \boldsymbol{k}_2, z) \end{aligned}$ (31)

for the linear matter bispectrum by Bernardeau et al. (2002),

$\begin{matrix} B_{lin} (k_{1}, k_{2}, z) & = 2 F (k_{1}, k_{2}) P (k_{1}, z) P (k_{2}, z) \\ + 2 F (k_{1}, k_{3}) P (k_{1}, z) P (k_{3}, z) \\ + 2 F (k_{2}, k_{3}) P (k_{2}, z) P (k_{3}, z), \end{matrix}$ $\begin{aligned} B_\mathrm{lin} (\boldsymbol{k}_1, \boldsymbol{k}_2, z)&= 2\, F(\boldsymbol{k}_1, \boldsymbol{k}_2)\, P(k_1, z)\, P(k_2, z)\\&\quad + 2\, F(\boldsymbol{k}_1, \boldsymbol{k}_3)\, P(k_1, z)\, P(k_3, z)\nonumber \\&\quad + 2\, F(\boldsymbol{k}_2, \boldsymbol{k}_3)\, P(k_2, z)\, P(k_3, z)\nonumber , \end{aligned}$ (32)

where k₃ = −k₁ − k₂ and

$\begin{matrix} F (k_{1}, k_{2}) = \frac{5}{7} + \frac{2}{7} \frac{{(k_{1} \cdot k_{2})}^{2}}{k_{1}^{2} k_{2}^{2}} + \frac{1}{2} \frac{k_{1} \cdot k_{2}}{k_{1} k_{2}} (\frac{k_{1}}{k_{2}} + \frac{k_{2}}{k_{1}}) . \end{matrix}$ $\begin{aligned} F(\boldsymbol{k}_1, \boldsymbol{k}_2) = \frac{5}{7} + \frac{2}{7}\, \frac{(\boldsymbol{k}_1\cdot \boldsymbol{k}_2)^2}{k_1^2\,k_2^2} + \frac{1}{2}\, \frac{\boldsymbol{k}_1\cdot \boldsymbol{k}_2}{k_1\,k_2}\left(\frac{k_1}{k_2} + \frac{k_2}{k_1}\right). \end{aligned}$ (33)

Fourth, for the average number density of satellite galaxies a inside halos of mass m, $〈 N_{sat}^{a} 〉 u_{g}^{a} (x | m)$ $\langle{N_{\mathrm{sat}}^{a}\rangle}u^a_\mathrm{g}(\boldsymbol{x}\,|\,m)$ , we assume an NFW profile with concentration

$\begin{matrix} c_{g}^{a} = f^{a} c . \end{matrix}$ $\begin{aligned} c^a_\mathrm{g} = f^a c. \end{aligned}$ (34)

This concentration may differ from that of the halo matter if f^a ≠ 1. A similar parameter was, for example, introduced by Cacciato et al. (2012) in galaxy-clustering halo models, who show that f^a affects galaxy clustering at scales below 1 h⁻¹ Mpc.

Finally, we express the first– and second–order moments of the HODs by the model of Zheng et al. (2007), where galaxies are split into centrals and satellites, each with their own expected number per halo, such that the expected number ⟨N^a | m⟩ of galaxies from sample a in a halo of mass m is

$\begin{matrix} ⟨ N^{a} | m ⟩ = ⟨ N_{cen}^{a} | m ⟩ + ⟨ N_{sat}^{a} | m ⟩ . \end{matrix}$ $\begin{aligned} \langle {N^a\,|\,m}{\rangle } = \langle {N_{\mathrm{cen} }^{a}\,|\,m}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\, m}{\rangle }. \end{aligned}$ (35)

Each halo hosts at most one central galaxy, $N_{cen}^{a} = 1$ ${N_{\mathrm{cen}}^{{a}}}=1$ , situated at the halo centre but can contain several satellite galaxies, $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ . With this split, the galaxy-galaxy-matter bispectrum requires the specification of the mean numbers ⟨ $N_{cen}^{a} | m$ ${{N_{\mathrm{cen}}^{a}}\,|\,m}$ ⟩ and ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ | m⟩ of central and satellite galaxies, as well as the numbers of central and satellite pairs ⟨ $N_{sat}^{a} N_{sat}^{b} | m$ ${{N_{\mathrm{sat}}^{a}}{N_{\mathrm{sat}}^{b}}\,|\,m}$ ⟩, ⟨ $N_{sat}^{a} N_{sat}^{b} | m$ ${{N_{\mathrm{sat}}^{a}}{N_{\mathrm{sat}}^{b}}\,|\,m}$ ⟩, for a ≠ b, and ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ ( $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ − 1) | m⟩ for a = b.

The mean number ⟨ $N_{cen}^{a} | m$ ${{N_{\mathrm{cen}}^{a}}\,|\,m}$ ⟩ of central galaxies depends only on halo mass. For small halo masses, no galaxy formation occurs, so ⟨ $N_{cen}^{a} | m$ ${{N_{\mathrm{cen}}^{a}}\,|\,m}$ ⟩ = 0. Halos with masses above a certain threshold, though, will contain central galaxies, at most one per halo. Similar to Zheng et al. (2007), we assume

$\begin{matrix} ⟨ N_{cen}^{a} | m ⟩ = \frac{α^{a}}{2} {1 + [\frac{log (m) - log (M_{th}^{a})}{σ^{a}}]}, \end{matrix}$ $\begin{aligned} \langle {N_{\mathrm{cen} }^{a}\,|\,m}{\rangle } = \frac{\alpha ^{a}}{2}\,\left\{ 1+\left[\frac{\log (m)-\log (M_\mathrm{th} ^{a})}{\sigma ^{a}}\right]\right\} , \end{aligned}$ (36)

with the free parameters α^a, $M_{th}^{a}$ $M_\mathrm{th}^{a}$ , and σ^a. The mass M_th is the halo mass below which we do not expect halos to contain galaxies. The parameter σ^a determines the transition of ⟨ $N_{cen}^{a} | m$ ${{N_{\mathrm{cen}}^{a}}\,|\,m}$ ⟩ from 0 to α^a. If σ^a is small, the transition from ⟨ $N_{cen}^{a} | m$ ${{N_{\mathrm{cen}}^{a}}\,|\,m}$ ⟩ = 0 to ⟨ $N_{cen}^{a} | m$ ${{N_{\mathrm{cen}}^{a}}\,|\,m}$ ⟩=α^a occurs quickly, whereas the transition is slower for larger σ^a. The parameter 0 ≤ α^a ≤ 1, for the maximum of ⟨N^a|m⟩, gives the fraction of massive halos (m ≫ $M_{th}^{a}$ $M_\mathrm{th}^{a}$ ) with a central galaxy from population a. Its inclusion is necessary because we are splitting galaxies into samples: only one sample can contain the central galaxy in a halo at a time. Since no more than one central galaxy per halo is allowed, the sum of all α^a from disjunct samples can never exceed unity.

For the mean number ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ | m⟩ of satellites, we assume, based on Zehavi et al. (2005),

$\begin{matrix} ⟨ N_{sat}^{a} | m ⟩ = \frac{1}{2} {1 + [\frac{log (m) - log (M_{th}^{a})}{σ^{a}}]} {(\frac{m}{M^{' a}})}^{β^{a}}, \end{matrix}$ $\begin{aligned} \langle {N_{\mathrm{sat} }^{a}\,|\,m}{\rangle } = \frac{1}{2} \left\{ 1+\left[\frac{\log (m)-\log (M_\mathrm{th} ^{a})}{\sigma ^{a}}\right]\right\} \left(\frac{m}{M^{\prime a}}\right)^{\beta ^{a}}, \end{aligned}$ (37)

with the free parameters M^′a and β^a. The satellite number therefore follows the central galaxy number for small halo masses and becomes a power law at high halo masses. To illustrate the dependence of the HOD terms on halo mass, Fig. 2 shows the expected number of satellite and central galaxies and the parameters in Table 1. The total number of galaxies depends strongly on the central galaxy distribution for low-mass halos. Satellite galaxies predominate in massive halos.

Fig. 2.

Mean per-halo numbers of galaxies for fiducial HOD parameters in Table 1. The solid black line shows the total galaxy number per halo, the dashed red line shows the fraction of halos with central galaxies, and the dotted blue line shows the number of satellite galaxies per halo.

Table 1.

Fiducial values and flat priors of the halo model parameters.

The galaxy-galaxy matter bispectrum depends not only on the expected numbers of centrals and satellites, but also on the expected number ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ ( $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ − 1) | m⟩ of unmixed satellite pairs, the expected number ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ $N_{sat}^{b}$ ${N_{\mathrm{sat}}^{b}}$ | m⟩ of mixed satellite pairs, i.e, a ≠ b, and the number ⟨ $N_{cen}^{a}$ ${N_{\mathrm{cen}}^{a}}$ $N_{sat}^{b}$ ${N_{\mathrm{sat}}^{b}}$ | m⟩ of central-satellite pairs per halo of mass m. In principle, it also depends on the number ⟨ $N_{sat}^{a} N_{sat}^{b} | m$ ${{N_{\mathrm{sat}}^{a}}{N_{\mathrm{sat}}^{b}}\,|\,m}$ ⟩ of central pairs, but this number is 0 for all m, as each halo contains only one central galaxy. We assume, as, for example in Cacciato et al. (2012), that the occupation numbers of centrals and satellites are statistically independent,

$\begin{matrix} ⟨ N_{cen}^{a} N_{sat}^{b} | m ⟩ = ⟨ N_{cen}^{a} | m ⟩ ⟨ N_{sat}^{b} | m ⟩ . \end{matrix}$ $\begin{aligned} \langle {N_{\mathrm{cen} }^{a}\, N_{\mathrm{sat} }^{b}\,|\, m}{\rangle } = \langle {N_{\mathrm{cen} }^{a}\,|\,m}{\rangle }\,\langle {N_{\mathrm{sat} }^{b}\,|\,m}{\rangle }. \end{aligned}$ (38)

This common assumption entails that the mean number of type b satellite galaxies in halos of mass m is independent of whether a central galaxy a is present or not. Furthermore, following Kravtsov et al. (2004), we assume that satellite occupation numbers vary according to a Poisson statistic,

$\begin{matrix} ⟨ N_{sat}^{a} (N_{sat}^{a} - 1) | m ⟩ = ⟨ N_{sat}^{a} | m ⟩^{2} . \end{matrix}$ $\begin{aligned} \langle {N_{\mathrm{sat} }^{a}\left(N_{\mathrm{sat} }^{a}-1\right)\,|\,m}{\rangle }=\langle {N_{\mathrm{sat} }^{a}\,|\,m}{\rangle }^2. \end{aligned}$ (39)

We test this assumption and its impact on the model accuracy with galaxies inserted via SAM in a cosmological simulation in Appendix B.

Finally, we introduce as a new parameter the cross-correlation coefficient r^ab of satellites, defined for mixed galaxy samples a ≠ b by

$\begin{matrix} ⟨ N_{sat}^{a} N_{sat}^{b} | m ⟩ = ⟨ N_{sat}^{a} ⟩ ⟨ N_{sat}^{b} ⟩ + r^{ab} (m) \sqrt{⟨ N_{sat}^{a} | m ⟩ ⟨ N_{sat}^{b} | m ⟩} . \end{matrix}$ $\begin{aligned} \langle {N_{\mathrm{sat} }^{a}\,N_{\mathrm{sat} }^{b}\,|\,m}{\rangle } = \langle {N_{\mathrm{sat} }^{a}}{\rangle }\, \langle {N_{\mathrm{sat} }^{b}}{\rangle } + r^{ab}(m)\, \sqrt{\langle {N_{\mathrm{sat} }^{a}\,|\,m}{\rangle } \langle {N_{\mathrm{sat} }^{b}\,|\,m}{\rangle }}. \end{aligned}$ (40)

The coefficient r^ab is negative if the numbers of satellite galaxies a and b are anti-correlated; and positive if they are positively correlated. If the satellite galaxies are distributed Poissonian, that is, σ²( $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ | m) = ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ | m⟩, then r^ab is a Pearson correlation coefficient. As we show in Sect. 6, mock galaxies from a SAM imply a mass dependence of r^ab(m), which we model by

$\begin{matrix} r^{ab} (m) = A^{ab} {(\frac{m}{10^{12} M_{⊙}})}^{ϵ^{ab}}, \end{matrix}$ $\begin{aligned} r^{ab}(m)= A^{ab} \left(\frac{m}{10^{12}\,M_\odot }\right)^{\epsilon ^{ab}}, \end{aligned}$ (41)

with the free parameters A^ab and ϵ^ab. A value of A^ab = 0 corresponds to uncorrelated galaxy samples. The cross-correlation between galaxy samples is independent of halo mass if ϵ = 0. Ignoring a mass dependence of r^ab in a more simplistic approach, as, for example in Simon et al. (2009), would result in a weighted average over the true r^ab(m). The comparison of such an average value to the true r^ab(m) in the simulation would not be straightforward, as the weighting per halo mass, given the survey characteristics, needs to be determined. A halo-mass dependent r^ab(m), on the other hand, is easier to interpret, which is why we use the mass-dependent fitting formula above.

A decrease of r^ab(m) towards small halo masses can be understood qualitatively as follows. The mean satellite number of galaxies from sample a inside a halo decreases with halo mass until a regime is reached where we find at most one. Likewise, at small enough halo masses, we find at most one galaxy from a second sample b (other galaxy populations may be present as well but are not of interest here). Then there are basically only four possibilities to populate a halo: (i) one galaxy from sample a, (ii) one galaxy from sample b, (iii) a pair of galaxies from a and b, and (iv) no galaxies from a or b. We shall denote the probabilities of these cases (i) to (iv) by p_a, p_b, p_ab, and 1 − p_a − p_b − p_ab, respectively. For this Poisson process with four outcomes, we find for the mean number of galaxies a in a halo

$\begin{matrix} ⟨ N_{sat}^{a} ⟩ & = ⟨ N_{sat}^{a} | (i) ⟩ p_{a} + ⟨ N_{sat}^{a} | (i i) ⟩ p_{b} \\ + ⟨ N_{sat}^{a} | (i i i) ⟩ p_{ab} + ⟨ N_{sat}^{a} | (i v) ⟩ (1 - p_{a} - p_{b} - p_{ab}) \end{matrix}$ $\begin{aligned} \langle N_{\mathrm{sat} }^{a} \rangle&= \langle N_{\mathrm{sat} }^{a}|(i)\rangle \,p_a+\langle N_{\mathrm{sat} }^{a}|(ii)\rangle \,p_b\\&\quad +\langle N_{\mathrm{sat} }^{a}|(iii)\rangle \,p_{ab}+\langle N_{\mathrm{sat} }^{a}|(iv)\rangle \,(1-p_a-p_b-p_{ab}) \end{aligned}$ (42)

$\begin{matrix} = 1 \times p_{a} + 0 \times p_{b} + 1 \times p_{ab} + 0 \times (1 - p_{a} - p_{b} - p_{ab}) \end{matrix}$ $\begin{aligned} &= 1 \times p_a + 0 \times p_b + 1 \times p_{ab} + 0 \times (1-p_a-p_b-p_{ab})\end{aligned}$ (43)

$\begin{matrix} = p_{a} + p_{ab}, \end{matrix}$ $\begin{aligned}&=p_a+p_{ab}, \end{aligned}$ (44)

and ⟨ $N_{sat}^{b}$ ${N_{\mathrm{sat}}^{b}}$ ⟩=p_b + p_ab for galaxies b. Similarly, the mean number of pairs is ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ $N_{sat}^{b}$ ${N_{\mathrm{sat}}^{b}}$ ⟩=p_ab, and the variances are σ²( $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ ) = ⟨( $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ )²⟩−⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ ⟩² = (p_a + p_ab)(1 − p_a − p_ab) and σ²( $N_{sat}^{b}$ ${N_{\mathrm{sat}}^{b}}$ ) = (p_b + p_ab)(1 − p_b − p_ab). The Pearson cross-correlation coefficient of variations in the galaxy numbers consequently becomes

$\begin{matrix} r_{pear}^{ab} & = \frac{⟨ N_{a} N_{b} ⟩ - ⟨ N_{a} ⟩ ⟨ N_{b} ⟩}{σ (N_{sat}^{a}) σ (N_{sat}^{a})} \\ = \frac{p_{ab} - (p_{a} + p_{ab}) (p_{b} + p_{ab})}{\sqrt{(p_{a} + p_{ab}) (p_{b} + p_{ab}) (1 - p_{a} - p_{ab}) (1 - p_{b} - p_{ab})}} \\ \leq \frac{{(p_{a} - p_{b})}^{2} - 2 (p_{a} + p_{b}) + 1}{1 - {(p_{a} - p_{b})}^{2}} . \end{matrix}$ $\begin{aligned} r_\mathrm{pear} ^{ab}&= \frac{\langle N_a N_b\rangle -\langle N_a\rangle \,\langle N_b\rangle }{\sigma (N_{\mathrm{sat} }^{a})\,\sigma (N_{\mathrm{sat} }^{a})} \nonumber \\&=\frac{p_{ab}-(p_a+p_{ab})(p_b+p_{ab})}{\sqrt{(p_a+p_{ab})(p_b+p_{ab})(1-p_a-p_{ab})(1-p_b-p_{ab})}} \nonumber \\&\le \frac{(p_a-p_b)^2-2(p_a+p_b)+1}{1-(p_a-p_b)^2}. \end{aligned}$ (45)

Therefore, at the extreme end, for halos too small to host more than one galaxy, that is, p_ab = 0, the value of $r_{pear}^{a b}$ $r_\mathrm{pear}^{ab}$ exactly converges to zero for p_a, p_b → 0 (that is ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ ⟩, ⟨ $N_{sat}^{b}$ ${N_{\mathrm{sat}}^{b}}$ ⟩→0). But, already in the intermediate regime, where 0 ≤ p_ab ≤ 1 − p_a − p_b and p_a, p_b ∼ 0.1, the upper limit of $r_{pear}^{a b}$ $r_\mathrm{pear}^{ab}$ in the last line confines the correlation factor to smaller values: for example, r^ab ≤ 0.6 for p_a = p_b = 0.1, or r^ab ≤ 0.81 for p_a = 0 and p_b = 0.1. Therefore, a decreasing $r_{pear}^{a b}$ $r_\mathrm{pear}^{ab}$ for m → 0 is a natural outcome of halos sparsely populated with galaxies a and b. This also applies to r^ab for sub-Poisson variances, expected at low halo masses, since r^ab < $r_{pear}^{a b}$ $r_\mathrm{pear}^{ab}$ in this case.

As stated above, r^ab is a Pearson correlation coefficient only for Poisson variances of the satellite numbers, as assumed in our halo model. We show in Appendix B, however, that this assumption is inaccurate for the SAMs, and possibly real galaxies, at high or low halo masses, with specifics depending on the galaxy selection. In extreme cases, variances may be an order of magnitude larger for ⟨N|m⟩≫1 or smaller for ⟨N|m⟩≪1. Consequently, r^ab is at small halo masses systematically lower than the Pearson correlation coefficient (r^ab < $r_{pear}^{a b}$ $r_\mathrm{pear}^{ab}$ ) and higher for high halo masses (r^ab > $r_{pear}^{a b}$ $r_\mathrm{pear}^{ab}$ ). Nevertheless, true and reconstructed correlation parameters in this paper are comparable because we use the same r^ab definition in both cases (and in the science verification). In addition, our following verification results show that the strict assumption of Poisson satellites still allows an accurate HOD reconstruction within the statistical errors of KV450×GAMA data.

After describing our choices for the halo model parameters, we can model the galaxy-galaxy-matter bispectrum B_ggδ. We derive the bispectrum in the following subsection.

3.2. Galaxy-galaxy-matter bispectrum

Using Eqs. (1)–(3), the galaxy-galaxy-matter bispectrum is given by

$\begin{matrix} \frac{1}{{\bar{n}}_{g}^{a} (z) {\bar{n}}_{g}^{b} (z) \bar{ρ} (z)} ⟨ {\hat{n}}_{g}^{a} (k_{1}, z) {\hat{n}}_{g}^{b} (k_{2}, z) \hat{ρ} (k_{3}, z) ⟩ = \\ {(2 π)}^{3} B_{gg δ}^{ab} (k_{1}, k_{2}, z) δ_{D} (k_{1} + k_{2} + k_{3}) + unconnected terms, \end{matrix}$ $\begin{aligned}&\frac{1}{\bar{n}^a_\mathrm{g} (z)\, \bar{n}^b_\mathrm{g} (z)\, \bar{\rho }(z)}\langle {\hat{n}^a_\mathrm{g} (\boldsymbol{k}_1, z) \, \hat{n}^b_\mathrm{g} (\boldsymbol{k}_2, z) \, \hat{\rho }(\boldsymbol{k}_3, z)}{\rangle } =\\&\nonumber (2\pi )^3 B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z) \, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3) + \text{ unconnected} \text{ terms}, \end{aligned}$ (46)

where a and b denote the galaxy samples, and ‘unconnected terms’ are those proportional to δ_D(k₁), δ_D(k₂), or δ_D(k₃), which do not affect the bispectrum. The bispectrum can be divided into three terms: the 1-halo term ₁ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ (k₁, k₂, z), the 2-halo term ₂ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ (k₁, k₂, z), and the 3-halo term ₃ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ (k₁, k₂, z), or together

$\begin{matrix} B_{gg δ}^{ab} (k_{1}, k_{2}, z) & =_{1} B_{gg δ}^{ab} (k_{1}, k_{2}, z) +_{2} B_{gg δ}^{ab} (k_{1}, k_{2}, z) \\ +_{3} B_{gg δ}^{ab} (k_{1}, k_{2}, z) . \end{matrix}$ $\begin{aligned} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z)&= _{1} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z) + _{2} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z)\\&\nonumber \quad + _{3} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z). \end{aligned}$ (47)

The 1-halo term depends on the correlation between galaxies numbers and matter density in the same halo. One part of the 2-halo term is caused by the correlation of galaxies in one halo with the matter in a different halo. Its second part is due to the correlation of galaxies and matter in one halo with galaxies in another halo. Correlations between matter and galaxies in three distinct halos cause the 3-halo term.

To derive $\hat{ρ}$ $\hat{\rho}$ and $\hat{n}$ $\hat{n}$ , we use that in the halo model the cosmic density field consists of H halos with masses {m₁, …, m_H} at positions {x₁, …, x_H}. With the normalised density profile u the matter density field is

$\begin{matrix} ρ (x, z) = \sum_{i = 1}^{H} m_{i} u (x - x_{i} | m_{i}, z), \end{matrix}$ $\begin{aligned} \rho (\boldsymbol{x}, z) = \sum _{i=1}^H m_i\, u(\boldsymbol{x}-\boldsymbol{x}_i\,|\,m_i, z), \end{aligned}$ (48)

whose Fourier transform is

$\begin{matrix} \hat{ρ} (k, z) = \sum_{i = 1}^{H} m_{i} \hat{u} (k | m_{i}, z) exp (- i k \cdot x_{i}) . \end{matrix}$ $\begin{aligned} \hat{\rho }(\boldsymbol{k}, z) = \sum _{i=1}^H m_i\, \hat{u}(\boldsymbol{k}\,|\,m_i, z)\, \exp (-{i}\boldsymbol{k}\cdot \boldsymbol{x}_i). \end{aligned}$ (49)

Galaxies are treated as discrete point particles. Each satellite galaxy j from sample a belongs to a halo centred at x_i and is at separation $Δ x_{i j}^{a}$ $\Delta \boldsymbol{x}^a_{ij}$ from the halo centre. Centrals are exactly at the halo centre. The number density of galaxies from sample a is therefore

$\begin{matrix} n_{g}^{a} (x, z) = \sum_{i = 1}^{H} [N_{cen, i}^{a} δ_{D} (x - x_{i}) + \sum_{j = 1}^{N_{sat, i}^{a}} δ_{D} (x - x_{i} - Δ x_{ij})], \end{matrix}$ $\begin{aligned} n_\mathrm{g} ^a(\boldsymbol{x}, z) = \sum _{i=1}^H \left[N_{\mathrm{cen} , i}^{a}\,\delta _{\rm D}(\boldsymbol{x}-\boldsymbol{x}_i) + \sum _{j=1}^{N_{\mathrm{sat} , i}^{a}}\delta _{\rm D}(\boldsymbol{x} - \boldsymbol{x}_i - \Delta \boldsymbol{x}_{ij}) \right], \end{aligned}$ (50)

where $N_{sat, i}^{a}$ ${N_{\mathrm{sat}, {i}}^{a}}$ is the number of satellite galaxies and $N_{cen, i}^{a}$ ${N_{\mathrm{cen}, {i}}^{a}}$ the number of central galaxies from sample a inside halo i. The Fourier transform of the number density is

$\begin{matrix} {\hat{n}}_{g}^{a} (k, z) = \sum_{i = 1}^{H} (N_{cen, i}^{a} e^{- i k \cdot x_{i}} + \sum_{j = 1}^{N_{sat, i}^{a}} e^{- i k \cdot (x_{i} + Δ x_{ij}^{a})}) . \end{matrix}$ $\begin{aligned} \hat{n}^a_\mathrm{g} (\boldsymbol{k}, z) = \sum _{i=1}^{H} \left( N_{\mathrm{cen} , i}^{a}\, \mathrm{e} ^{-{i}\boldsymbol{k}\cdot \boldsymbol{x}_i} + \sum _{j=1}^{N_{\mathrm{sat} , i}^{a}} \mathrm{e} ^{-{i}\boldsymbol{k}\cdot (\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{ij})}\right). \end{aligned}$ (51)

To derive the bispectrum we insert Eqs. (49) and (51) into Eq. (46), which leads to

$\begin{matrix} _{1} B_{gg δ}^{ab} (k_{1}, k_{2}, z) & = \frac{1}{{\bar{n}}_{g}^{a} (z) {\bar{n}}_{g}^{b} (z) \bar{ρ} (z)} \int m n (m) \\ \times m \hat{u} (k_{1} + k_{2} | m, z) G^{ab} (k_{1}, k_{2} | m, z) ; \end{matrix}$ $\begin{aligned} _{1} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z)&= \frac{1}{\bar{n}^a_\mathrm{g} (z)\, \bar{n}^b_\mathrm{g} (z)\, \bar{\rho }(z)}\, \int {m}\;n(m)\, \\&\nonumber \quad \times m\,\hat{u}(\boldsymbol{k}_1+\boldsymbol{k}_2\,|\,m, z) \, G^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2\,|\,m, z); \end{aligned}$ (52)

$\begin{matrix} _{2} B_{gg δ}^{ab} (k_{1}, k_{2}, z) & = \frac{1}{{\bar{n}}_{g}^{a} (z) {\bar{n}}_{g}^{b} (z) \bar{ρ} (z)} \int m_{1} \int m_{2} n (m_{1}) n (m_{2}) \\ \times [m_{1} \hat{u} (k_{1} + k_{2} | m_{1}, z) G^{ab} (k_{1}, k_{2} | m_{2}, z) \\ \times P_{h} (| k_{1} + k_{2} | | m_{1}, m_{2}, z) \\ + m_{2} \hat{u} (k_{1} + k_{2} | m_{2}, z) G^{a} (k_{1} | m_{1}, z) \\ \times G^{b} (k_{2} | m_{2}, z) P_{h} (k_{1} | m_{1}, m_{2}, z) \\ + m_{2} \hat{u} (k_{1} + k_{2} | m_{2}, z) G^{b} (k_{2} | m_{1}, z) \\ \times G^{a} (k_{1} | m_{2}, z) P_{h} (k_{2} | m_{1}, m_{2}, z)] ; \end{matrix}$ $\begin{aligned} _{2} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z)&= \frac{1}{\bar{n}^a_\mathrm{g} (z)\, \bar{n}^b_\mathrm{g} (z)\, \bar{\rho }(z)} \int {m_1} \int {m_2}\; n(m_1)\, n(m_2)\,\\&\nonumber \quad \times \Big [ m_1\,\hat{u}(\boldsymbol{k}_1+\boldsymbol{k}_2\,|\,m_1, z)\, G^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2\,|\,m_2, z)\\&\nonumber \quad \quad \times P_\mathrm{h} (|\boldsymbol{k}_1+\boldsymbol{k}_2|\,|\,m_1, m_2, z)\\&\nonumber \quad \quad + m_2\,\hat{u}(\boldsymbol{k}_1+\boldsymbol{k}_2\,|\,m_2, z)\,G^{a}(\boldsymbol{k}_1\,|\,m_1, z) \\&\nonumber \quad \quad \times G^{b}(\boldsymbol{k}_2\,|\,m_2, z)\, P_\mathrm{h} ({k}_1\,|\,m_1, m_2, z) \\&\nonumber \quad \quad + m_2\,\hat{u}(\boldsymbol{k}_1+\boldsymbol{k}_2\,|\,m_2, z)\,G^{b}(\boldsymbol{k}_2\,|\,m_1, z) \\&\nonumber \quad \quad \times G^{a}(\boldsymbol{k}_1\,|\,m_2, z)\, P_\mathrm{h} ({k}_2\,|\,m_1, m_2, z) \Big ]; \end{aligned}$ (53)

and

$\begin{matrix} _{3} B_{gg δ}^{ab} (k_{1}, k_{2}, z) \\ = \frac{1}{{\bar{n}}_{g}^{a} (z) {\bar{n}}_{g}^{b} (z) \bar{ρ} (z)} \int m_{1} \int m_{2} \int m_{3} n (m_{1}) n (m_{2}) n (m_{3}) \\ \times m_{3} \hat{u} (- k_{1} - k_{2} | m_{3}, z) G^{a} (k_{1} | m_{1}, z) G^{b} (k_{2} | m_{2}, z) \\ \times B_{h} (k_{1}, k_{2} | m_{1}, m_{2}, m_{3}, z), \end{matrix}$ $\begin{aligned}&_{3} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, z) \\&\nonumber =\frac{1}{\bar{n}^a_\mathrm{g} (z)\, \bar{n}^b_\mathrm{g} (z)\, \bar{\rho }(z)} \int {m_1} \int {m_2} \int {m_3}\;n(m_1)\, n(m_2)\, n(m_3)\,\\&\nonumber \quad \times m_3\,\hat{u}(-\boldsymbol{k}_1-\boldsymbol{k}_2\,|\,m_3, z)\, G^{a}(\boldsymbol{k}_1\,|\,m_1, z)\,G^{b}(\boldsymbol{k}_2\,|\,m_2, z)\\&\nonumber \quad \times B_\mathrm{h} (\boldsymbol{k}_1, \boldsymbol{k}_2\,|\,m_1, m_2, m_3, z), \end{aligned}$ (54)

with

$\begin{matrix} G^{a} (k | m, z) : = ⟨ N_{cen}^{a} | m ⟩ + ⟨ N_{sat}^{a} | m ⟩ {\hat{u}}_{g}^{a} (k | m, z), \end{matrix}$ $\begin{aligned} G^{a}(\boldsymbol{k}\,|\,m, z) := \langle {N_{\mathrm{cen} }^{a}\,|\,m}{\rangle }+ \langle {N_{\mathrm{sat} }^{a}\,|\,m}{\rangle } \hat{u}^a_\mathrm{g} (\boldsymbol{k}\,|\,m, z), \end{aligned}$ (55)

and

$\begin{matrix} G^{ab} (k_{1}, k_{2} | m, z) & : = ⟨ N_{cen}^{a} (N_{cen}^{b} - δ_{ab}^{K}) | m ⟩ \\ + ⟨ N_{cen}^{a} N_{sat}^{b} | m ⟩ {\hat{u}}_{g}^{b} (k_{2} | m, z) \\ + ⟨ N_{cen}^{b} N_{sat}^{a} | m ⟩ {\hat{u}}_{g}^{a} (k_{1} | m, z) \\ + ⟨ N_{sat}^{a} (N_{sat}^{b} - δ_{ab}^{K}) | m ⟩ \\ \times {\hat{u}}_{g} (k_{1} | m, z) {\hat{u}}_{g} (k_{2} | m, z), \end{matrix}$ $\begin{aligned} G^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2\,|\,m, z)&:= \langle {N_{\mathrm{cen} }^{a}\,(N_{\mathrm{cen} }^{b}-\delta ^\mathrm{K}_{a b})\,|\,m}{\rangle }\\&\nonumber \quad + \langle {N_{\mathrm{cen} }^{a}\, N_{\mathrm{sat} }^{b}\,|\,m}{\rangle }\hat{u}_\mathrm{g} ^b(\boldsymbol{k}_2\,|\,m, z) \\&\nonumber \quad +\langle {N_{\mathrm{cen} }^{b}\, N_{\mathrm{sat} }^{a}\,|\,m}{\rangle }\hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m, z) \\&\nonumber \quad + \langle {N_{\mathrm{sat} }^{a}\, (N_{\mathrm{sat} }^{b}-\delta ^\mathrm{K}_{a b})\,|\,m}{\rangle }\, \\&\nonumber \quad \quad \times \hat{u}_\mathrm{g} (\boldsymbol{k}_1\,|\,m, z)\, \hat{u}_\mathrm{g} (\boldsymbol{k}_2\,|\,m,z), \end{aligned}$ (56)

where the Kronecker symbol $δ_{a b}^{K}$ ${\delta^{\rm K}_{{a} {b}}}$ is 1 for a = b, and 0 otherwise. We derive these equations in full in Appendix A.

With the ingredients of the halo model from Sect. 3.1, the bispectrum is fully specified. By inserting it into Eq. (12) and (17) we obtain the aperture statistics. Before fitting the model to simulated and actual G3L measurements, we discuss the parameter sensitivity of the model in the next section.

3.3. Discussion of model parameters

We qualitatively study the impact of the parameters on the theoretical G3L signal. For this, we vary each parameter inside the prior range, adopted for the likelihood analysis below (Table 1), while keeping the other parameters fixed to ‘fiducial’ values, that is, to the centres of the parameter prior range. There are six free parameters for each galaxy sample and two additional parameters for the correlation of satellite numbers, hence in total 14 parameters for the bispectrum of a combination of galaxy samples. We arrange these parameters inside the vector p,

$\begin{matrix} p^{T} = (α^{a} σ^{a} M_{th}^{a} β^{a} M^{' a} f^{a} α^{b} σ^{b} M_{th}^{b} β^{b} M^{' b} f^{b} A^{ab} ϵ^{ab}), \end{matrix}$ $\begin{aligned} \boldsymbol{p}^\mathrm{T} = \left(\alpha ^{a}\; \sigma ^{a}\; M_\mathrm{th} ^{a}\; \beta ^{a}\; M^{\prime a}\, f^{a}\; \alpha ^{b}\; \sigma ^{b}\; M_\mathrm{th} ^{b}\; \beta ^{b}\; M^{\prime b} f^{b}\; A^{ab}\; \epsilon ^{ab}\right), \end{aligned}$ (57)

and write the modelled aperture statistics for a set p of parameters and scale radius θ as ⟨𝒩𝒩ℳ⟩(θ, p).

The priors are aimed to be uninformative. For σ, log₁₀( $M_{th}^{a}$ $M_\mathrm{th}^{a}$ /M_⊙), β^a, and log₁₀(M^′a/M_⊙), they are based on the galaxy-galaxy lensing halo model by Clampitt et al. (2017), whose prior ranges we doubled for the G3L analysis. For α, the prior range contains all possible values between 0 and 1. For the parameters f, A, and ϵ, which are unique to our G3L halo model, there are no reference values. Therefore, we chose arbitrary broad ranges centred on the values f = 0 (indicating galaxy distributions perfectly following dark matter halos), A = 1, and ϵ = 1 (indicating no correlations between galaxy HODs). To sanity-check the dependence on priors, we performed the analysis on the validation data also with prior ranges two times larger and found no difference in the final parameter constraints.

Figure 3 shows ⟨𝒩^a𝒩^aℳ⟩ for unmixed lens pairs, when varying α^a, σ^a, $M_{th}^{a}$ $M_\mathrm{th}^{a}$ , M^′a, and f^a. All parameters, except for σ, visibly impact the aperture statistics. However, we note that the trends seen in Fig. 3 are for varying each parameter individually and do not show the correlation between the parameters. For example, the threshold mass $M_{th}^{a}$ $M_\mathrm{th}^{a}$ and the satellite mass scale M^′a are tightly correlated: Increasing $M_{th}^{a}$ $M_\mathrm{th}^{a}$ while decreasing M^′a could lead to the same ⟨𝒩^a𝒩^aℳ⟩.

Fig. 3.

Impact of halo model parameters on ⟨𝒩^a𝒩^aℳ⟩ for unmixed lens pairs. In each panel, only one parameter is varied at a time. Solid lines indicate the total ⟨𝒩^a𝒩^aℳ⟩, while dashed lines show the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term.

The threshold hal o mass $M_{th}^{a}$ $M_\mathrm{th}^{a}$ and the slope β^a of the satellite HOD affect the signal the strongest on the scales we consider here. Increasing the threshold mass $M_{th}^{a}$ $M_\mathrm{th}^{a}$ from 3.2 × 10¹² M_⊙ to 7.5 × 10¹² M_⊙ roughly doubles the aperture statistics across the whole range of θ from $0 \overset{'}{.} 1$ $0{{\overset{\prime}{.}}}1$ to 100′ because $M_{th}^{a}$ $M_\mathrm{th}^{a}$ is closely connected to the galaxy bias. A higher $M_{th}^{a}$ $M_\mathrm{th}^{a}$ causes galaxies to reside in more massive halos, so the galaxy bias and the shear amplitude increase, which increases the amplitude of the G3L aperture statistics. The satellite parameter M^′a, which describes the mass scale where ⟨N_sat|m⟩≈1 if $M_{th}^{a}$ $M_\mathrm{th}^{a}$ ≪ M^′a, changes the signal amplitude for θ ≳ 1′. A smaller M^′a produces a higher amplitude by scaling up the number of satellites in a halo of a given mass.

The slope β^a of ⟨ $N_{sat}^{q} | m$ ${N^q_{\rm sat}|m}$ ⟩ has the strongest impact on the statistics at θ ∼ 9′, an up to a 20-fold increase in amplitude. While a larger β^a increases the signal on larger scales, at scales below $0 \overset{'}{.} 3$ $0{{\overset{\prime}{.}}}3$ , it decreases the aperture statistics by up to 50%. These opposite trends have two reasons. First, a larger β^a causes more massive halos to contain more satellite galaxies and pairs, giving the halos more weight in the second term in G^a (Eq. (A.27)) and the last term in G^ab (Eq. (A.18)), compared to low-mass halos with fewer satellite pairs. This increases ⟨𝒩^a𝒩^aℳ⟩. Second, more massive halos are more extended and contribute predominantly at larger scales. Therefore, the signal increases at larger angular scales but decreases at scales where lower-mass halos are more relevant.

The concentration parameter f^a affects the aperture statistics stronger at small scales than at large scales (57% at $0 \overset{'}{.} 1$ $0{{\overset{\prime}{.}}}1$ and 40% at 1′). A more concentrated galaxy profile (f^a > 1) leads to more galaxies in the inner regions of each halo, which increases the galaxy density contrast δ_g at small scales. Consequently, the galaxy-galaxy-matter bispectrum and the aperture statistics increase at small scales. At scales above $11 \overset{'}{.} 5$ $11{{\overset{\prime}{.}}}5$ , though, the change is less than 10%.

The sensitivity of the statistics for r^ab is visible in Fig. 4, where we show ⟨𝒩^a𝒩^bℳ⟩ for mixed lens pairs for correlated satellite numbers (r^ab = 1), uncorrelated numbers (r^ab = 0), or anti-correlated numbers (r^ab = −1). For this example, we keep r^ab constant with halo mass, so ϵ^ab = 0, while A^ab is varied between A^ab ∈ {0, ±1}. The amplitude of ⟨𝒩^a𝒩^bℳ⟩ changes strongest for $θ = 1 \overset{'}{.} 1$ $\theta= 1{{\overset{\prime}{.}}}1$ (72%), where the signal is dominated by the 1-halo term containing ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ $N_{sat}^{b}$ ${N_{\mathrm{sat}}^{b}}$ |m⟩ and has a r^ab-dependence. The cross-correlation parameter has only a small effect (less than 10% change) at scales above 30′ because these scales are dominated by the 3-halo term, independent of the per-halo number of satellite pairs.

Fig. 4.

Impact of the correlation of satellite numbers on the aperture statistics of mixed lens pairs. Satellite numbers are either fully correlated (r^ab = 1, blue lines), uncorrelated (r^ab = 0, black lines) or anti-correlated (r^ab = −1, red lines). Solid lines indicate the total aperture statistics, dashed lines the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term.

In conclusion, the aperture statistics between $0 \overset{'}{.} 1$ $0{{\overset{\prime}{.}}}1$ and 100′are most sensitive to the threshold halo mass $M_{th}^{a}$ $M_\mathrm{th}^{a}$ and the slope β^a. In contrast, we do not expect G3L to improve the σ^a constraints beyond the prior range. To probe the cross-correlation parameter r^ab, the statistic ⟨𝒩^a𝒩^bℳ⟩ needs to be measured at scales below 30′and preferably around 1′. Both of these measurements are achieved for our survey data.

4. Data

Before describing the G3L measurement procedure and halo model fitting procedures, we briefly explain the simulated and observed data. These are the same data sets as in Linke et al. (2020a): simulated (verification) data based on the MS and the SAM by Henriques et al. (2015), and one composed of the overlap of KiDS, VIKING, and GAMA. As mentioned in Sect. 1, the cosmology used to analyze the observation differs from the fiducial cosmology of the MS and is based on Planck Collaboration I (2020). This difference in cosmology should not impact our conclusions because we are not interested in a one-to-one comparison between the observation and simulation and merely use the mock data to validate our method.

4.1. KV450 × GAMA

We use observational data from the overlap of KiDS, VIKING, and GAMA, KV450 × GAMA for short, which is an area of approximately 180 deg². KiDS (Kuijken et al. 2015; de Jong et al. 2015) and VIKING (Edge et al. 2013; Venemans et al. 2015) are two photometric surveys covering approximately the same 1350 deg² area, with KiDS observed in the optical with the VLT survey telescope, and VIKING observed in the near-infrared with the VISTA telescope. We use the public combined data release KV450, described in detail in Wright et al. (2019). Galaxies were observed in the u, g, r, i band for KiDS and Z, Y, J, H, K_s bands for VIKING, with shape measurements performed in the r-band. This data set was processed with AstroWISE (de Jong et al. 2015) and THELI (Erben et al. 2005; Schirmer 2013) to give a catalogue of observed galaxies. The shapes of these galaxies were measured with lensfit (Miller et al. 2013; Kannawadi et al. 2019). We use the KV450 galaxies with photometric redshifts between 0.5 and 1.2 (obtained by Hildebrandt et al. 2020) as sources for the G3L measurements. The cosmological analysis of KV450 found that the n(z) for individual tomographic bins of the KV450 galaxies are offset by values between −0.007 and 0.013 (Hildebrandt et al. 2020).

Our lens galaxies are from GAMA (Driver et al. 2009; Liske et al. 2015), a spectroscopic survey conducted at the Anglo Australian Telescope. We use all galaxies listed in the table distanceFramesv14 from the data management unit (DMU) LocalFlowCorrection with a redshift quality parameter N_Q > 2 and with a spectroscopic redshift less than 0.5 to avoid overlap between lenses and sources. Each lens galaxy is assigned absolute magnitudes, restframe photometry, and stellar masses according to the table stellarMassesLambdarv20 from the DMU StellarMasses (Taylor et al. 2011). These were obtained with matched aperture photometry and the LAMBDAR code (Wright et al. 2016), assuming the initial mass function by Chabrier (2003), stellar population synthesis according to Bruzual & Charlot (2003), and dust extinction according to Calzetti et al. (2000). To calculate the angular correlation function ω^ab, we use the randoms in the table randomsv02 from the DMU Randoms, created by Farrow et al. (2015).

Galaxies observed by GAMA are brighter than r = 19.8 mag, rendering our lens galaxy sample flux-limited. We divide the GAMA galaxies into different samples: a ‘red’ and a ‘blue’ sample, using their restframe (g − r)₀ colour and their absolute magnitude M_r in the r-band, and five samples defined by stellar masses. For this we use the same cuts as in Farrow et al. (2015) and Linke et al. (2020a). An overview of these cuts is given in Table 2.

Table 2.

Selection criteria for lens samples.

These different samples have differing redshift distributions n(z), shown in Fig. 5. While the red and blue galaxies are distributed similarly, the n(z) of the stellar-mass selected lenses differ strongly. Galaxies with lower stellar mass are predominantly observed at smaller redshifts; galaxies with higher stellar mass are found up to the limiting redshift 0.5. These differences are caused by the flux limit of the survey; galaxies with lower stellar masses, typically fainter than more massive galaxies, are only visible at smaller redshifts.

Fig. 5.

Normalised redshift distributions n(z) of GAMA galaxies, selected by colour (left) and stellar mass (right).

The differences in the redshift distribution of galaxies from different samples are also visible in Fig. 6, which shows the stellar mass M_* and redshift z of the GAMA galaxies. Observed galaxies at higher redshift have higher stellar masses on average, which is a direct consequence of the flux limit. Red and blue galaxies are distributed similarly with redshift but differ in stellar mass: red galaxies tend to have higher stellar masses than blue galaxies.

Fig. 6.

Stellar-mass and redshift of GAMA galaxies, divided by colour (left) and stellar mass (right).

4.2. Millennium Simulation

The simulation data are constructed from the MS (Springel et al. 2005), a dark-matter only cosmological N-body simulation. Its simulation box has a comoving side length of 500 h⁻¹Mpc and contains 2160³ particles with mass 8.6 × 10⁸ h⁻¹ M_⊙ (h = 0.73).

Maps of the lensing shear, γ, were created with the multiple-lens-plane raytracing algorithm by Hilbert et al. (2009). This algorithm generated shear maps of size 4 × 4 deg² on a regular grid of 4096 × 4096 deg² for 64 lines-of-sights (total area of 1024 deg²) at each redshift slice of the MS². We combine the shear redshift slices in a weighted average according to the observed KV450 galaxy redshift distribution, resulting in a combined shear without shear noise and intrinsic source alignment. The effective source density for the science verification corresponds to the pixel density, that is, 291 arcmin⁻².

Since all 64 lines-of-sight originate from the same simulation, they are not independent at the largest scales. However, the lines-of-sight correspond to different observers placed in the simulation box such that the overlaps between the lines-of-sights are minimal, so we expect correlations predominantly at large scales, close to the simulation box size of 500 h⁻¹ Mpc. In contrast, the G3L signal is dominated by the 1-halo term, originating from small scales of approximately 1 h⁻¹Mpc. Therefore, we treat the lines-of-sight as independent for our purposes.

The lens galaxies for the simulated lenses Henriques et al. (2015) use the same initial mass function (Chabrier 2003) as assumed for the stellar mass estimates from GAMA, but a different stellar population model (Maraston 2005). Nevertheless, as shown in Linke et al. (2020a), the predictions of the MS combined with this SAM for the G3L signal agree with observations in KV450×GAMA.

To mimic the GAMA selection function, we use all lens galaxies with redshift less than 0.5 and brighter than r = 19.8 mag. We also divide the simulated lens galaxies by their colour and stellar masses by the same cuts as for the observed lens galaxies (Table 2).

5. Methods

This section outlines our G3L estimators and the model fit to the data. The measurement procedure largely follows Simon et al. (2008) and Linke et al. (2020b) and is summarised in Sect. 5.1. The statistical analysis for the inference of HOD parameters from the G3L data is detailed in Sect. 5.2. To assess the accuracy of the inferred HOD in a science verification with mock data, later on, we describe in Sect. 5.3 the determination of the true HODs of the simulated galaxies. We make our codes publicly available at github.com³.

5.1. Measuring G3L

We estimate the G3L aperture statistics from a catalogue of galaxy positions and source shapes for bins B of similar triangles (ϑ₁, ϑ₂, ϕ) with the estimator ${\tilde{G}}_{est}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}_{\mathrm{est}}$ in Simon et al. (2008). For N^a lens galaxies from sample a, N^b lens galaxies from sample b, and N_s source galaxies with complex ellipticities ϵ_k, the estimate of ${\tilde{G}}^{ab} (B)$ ${\tilde{\mathcal{G}}}^{ab}(B)$ is the real part of

$\begin{matrix} {\tilde{G}}_{est}^{ab} (B) \\ = \frac{\sum_{i = 1}^{N^{a}} \sum_{j = 1}^{N^{b}} \sum_{k = 1}^{N_{s}} (- 1) w_{k} ϵ_{k} e^{- i (φ_{ik} + φ_{jk})} [1 + ω^{ab} (| θ_{i} - θ_{j} |)] Δ_{ijk} (B)}{\sum_{i, j = 1}^{N_{d}} \sum_{k = 1}^{N_{s}} w_{k} Δ_{ijk} (B)}, \end{matrix}$ $\begin{aligned}&\nonumber \tilde{\mathcal{G} }_{\mathrm{est}}^{ab}(B) \\&= \frac{\sum \limits _{i=1}^{N^a}\sum \limits _{j=1}^{N^b}\sum \limits _{k=1}^{N_{\rm s}}(-1)\,{ w}_k\, \epsilon _k\, \mathrm{e}^{-\mathrm{i}(\varphi _{ik} + \varphi _{jk})}\left[1 + \omega ^{ab}(|\boldsymbol{\theta }_i - \boldsymbol{\theta }_j|)\right]\, \Delta _{ijk}(B)}{\sum \limits _{i,j=1}^{N_{\rm d}}\sum \limits _{k=1}^{N_{\rm s}}\, w_k\, \Delta _{ijk}(B)}, \end{aligned}$ (58)

where ω^ab(θ) is the angular two-point correlation function of lens galaxies from samples a and b with lag θ, and

$\begin{matrix} Δ_{ijk} (B) = {\begin{matrix} 1 & for (| θ_{k} - θ_{i} |, | θ_{k} - θ_{j} |, ϕ_{ijk}) \in B \\ 0 & otherwise \end{matrix} . \end{matrix}$ $\begin{aligned} \Delta _{ijk}(B) = {\left\{ \begin{array}{ll} 1&\mathrm{for}\; (|\boldsymbol{\theta }_k - \boldsymbol{\theta }_i|, |\boldsymbol{\theta }_k - \boldsymbol{\theta }_j|, \phi _{ijk})\in B\\ 0&\mathrm{otherwise} \end{array}\right.}. \end{aligned}$ (59)

The angles φ_ik and φ_jk are the polar angles of the lens-source separation vectors θ_i − θ_k and θ_j − θ_k (corresponding to φ₁ and φ₂ in Fig. 1) and ϕ_ijk = φ_ik − φ_jk is the opening angle between the lens-source separation vectors (corresponding to ϕ in Fig. 1). The weight w_k of the source k (set to w_k ≡ 1 for our simulated data) measures the confidence in the shape measurements for the source, with higher weights indicating more precise ellipticities. The imaginary part of ${\tilde{G}}_{est}^{ab} (B)$ ${\tilde{\mathcal{G}}}_{\mathrm{est}}^{ab}(B)$ is pure noise in the absence of systematic errors in the shear data (Linke et al. 2020a).

We give in the estimator equal weight to all lens pairs, which is in contrast to the recent analysis in Linke et al. (2020a), where we weighted galaxy pairs based on the redshift difference between their members. The lack of weighting dilutes the signal-to-noise ratio of the G3L signal since non-physical pairs (widely separated along the line-of-sight) carry no signal but increase the noise. However, a model with weights requires abandoning the Limber approximation for the projection of the bispectrum in Eq. (12). The weighting introduces an additional factor into the integral, which depends on the line-of-sight distance between the lens galaxies in a pair. This factor, by definition, varies strongly on scales corresponding to the correlation length between galaxies. It is designed to give high weights to galaxy pairs within a correlation length while down-weighting galaxies outside this range. Consequently, the assumption that the integrand in Eq. (12) is only slowly varying is no longer valid. With an alternative formalism for Eq. (12) unclear at this point, we apply equal weights to our lens pairs for the scope of this work.

Our estimator for the two-point correlation function of the lens clustering, ω^ab(θ), is that by Szapudi & Szalay (1998),

$\begin{matrix} ω^{ab} (θ) = \frac{N_{r}^{a} N_{r}^{b}}{N_{d}^{a} N_{d}^{b}} \frac{D_{a} D_{b} (θ)}{R_{a} R_{b} (θ)} - \frac{N_{r}^{a}}{N_{d}^{a}} \frac{D_{a} R_{b} (θ)}{R_{a} R_{b} (θ)} - \frac{N_{r}^{b}}{N_{d}^{b}} \frac{D_{b} R_{a} (θ)}{R_{a} R_{b} (θ)} + 1, \end{matrix}$ $\begin{aligned} \omega ^{ab}(\theta ) = \frac{N_{\mathrm{r}}^{a}\, N_{\mathrm{r}}^{b}}{N_{\mathrm{d}}^{a}\,N_{\mathrm{d}}^{b}}\frac{{D_aD_b}(\theta )}{{R_aR_b}(\theta )} - \frac{N_{\mathrm{r}}^{a}}{N_{\mathrm{d}}^{a}}\frac{{D_aR_b}(\theta )}{{R_aR_b}(\theta )} - \frac{N_{\mathrm{r}}^{b}}{N_{\mathrm{d}}^{b}}\frac{{D_bR_a}(\theta )}{{R_aR_b}(\theta )} +1, \end{aligned}$ (60)

for two lens samples a and b with $N_{d}^{a}$ $N_{\mathrm{d}}^{a}$ and $N_{d}^{b}$ $N_{\mathrm{d}}^{b}$ galaxies, and two ‘random samples’. These random samples contain $N_{r}^{a}$ $N_{\mathrm{r}}^{a}$ and $N_{r}^{b}$ $N_{\mathrm{r}}^{b}$ unclustered galaxies subject to the same selection functions as the lens samples. The D_aD_b, D_aR_b, D_bR_a, and R_aR_b are the pair counts of observed (D) and random galaxies (R).

Having obtained ω^ab, we measure (on an approximate flat sky) ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ with (58) individually for tiles of size 1° ×1°. For this, we divide the observational data into N = 189 and the simulation data into N = 4 × 64 = 256 tiles. The tiles allow us to project the galaxy positions and shear to Cartesian coordinates via an orthographic transformation and to quantify the uncertainty of the ${\tilde{G}}_{est}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}_{\mathrm{est}}$ with jackknife resampling on a tile by tile basis. We estimate ${\tilde{G}}_{i}^{ab}$ ${\tilde{\mathcal{G}}}_i^{ab}$ for each tile i on a regular grid of 128 × 128 × 128 bins. These bins are spaced logarithmically for ϑ₁ and ϑ₂, and linearly for ϕ. We use $ϑ_{1}, ϑ_{2} \in [0 \overset{'}{.} 15, 85^{'}]$ $\vartheta_1, \vartheta_2 \in [0{{\overset{\prime}{.}}}15, 85^{\prime}]$ and ϕ ∈ [0, 2π]. For the total ${\tilde{G}}_{est}^{ab}$ ${\tilde{\mathcal{G}}}_{\mathrm{est}}^{ab}$ , individual tile estimates ${\tilde{G}}_{i}^{ab}$ ${\tilde{\mathcal{G}}}_i^{ab}$ are averaged, weighted by the effective number of triplets per bin.

Due to the finite number of galaxies, some of the bins will remain ‘empty’, meaning that the data contains no lens-lens-source triplet fitting the configuration of the bin. Setting the correlation function in these bins to an arbitrary value biases the measurement, so we use the adaptive binning from Linke et al. (2020b). This scheme uses Voronoi tesselation to redefine the bins, now b_i, such that no empty bins occur, effectively merging empty and ‘filled’ bins. We found in Linke et al. (2020b) that estimating $\tilde{G}$ ${\tilde{\mathcal{G}}}$ in 128 × 128 × 128 bins and then applying the tesselation leads to a measurement accuracy within 5%.

We convert a binned ${\tilde{G}}_{est}^{ab}$ ${\tilde{\mathcal{G}}}_{\mathrm{est}}^{ab}$ to aperture statistics with a numerical approximation to Eq. (19),

$\begin{matrix} ⟨ N^{a} N^{b} M ⟩ (θ) & = \sum_{i = 1}^{N_{bin}} V (b_{i}) {\tilde{G}}_{est}^{ab} (b_{i}) A_{NNM} (b_{i} | θ, θ, θ), \end{matrix}$ $\begin{aligned} \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta )&= \sum _{i=1}^{N_\mathrm{bin} } V(b_i)\, \tilde{\mathcal{G} }_\mathrm{est} ^{ab}(b_i)\,\mathcal{A} _{\mathcal{NNM} }(b_i\,|\, \theta , \theta , \theta ), \end{aligned}$ (61)

where N_bin is the number of bins, V(b_i) is the tesselation volume of bin b_i, and 𝒜_𝒩𝒩ℳ(b_i | θ₁, θ₂, θ₃) is the kernel function of Eq. (19), evaluated at the centre of bin b_i.

We estimate the covariance matrix of the estimates with jackknife resampling. For this, we combine the ${\tilde{G}}^{ab}$ ${\tilde{\mathcal{G}}}^{ab}$ estimates for all but the k-th tile to the k-th jackknife sample ${\tilde{G}}_{k, jn}^{ab}$ ${\tilde{\mathcal{G}}}_{k, \mathrm{jn}}^{ab}$ , which is then converted to the aperture statistics, Eq. (19), leading to N samples ⟨𝒩^a𝒩^bℳ⟩_k. The estimate $\hat{C}$ $\hat{C}$ of the ⟨𝒩^a𝒩^bℳ⟩ covariance matrix is then

$\begin{matrix} {\hat{C}}_{ij}^{ab} & = \frac{N}{N - 1} \sum_{k = 1}^{N} [{⟨ N^{a} N^{b} M ⟩}_{k} (θ_{i}) - \bar{{⟨ N^{a} N^{b} M ⟩}_{k}} (θ_{i})] \\ \times [{⟨ N^{a} N^{b} M ⟩}_{k} (θ_{j}) - \bar{{⟨ N^{a} N^{b} M ⟩}_{k}} (θ_{j})], \end{matrix}$ $\begin{aligned} \hat{C}^{ab}_{ij}&= \frac{N}{N-1}\, \sum _{k=1}^{N}\left[\langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle _k(\theta _i)-\overline{\langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle _k}(\theta _i)\right]\\&\nonumber \quad \times \left[\langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle _k(\theta _j)-\overline{\langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle _k}(\theta _j)\right], \end{aligned}$ (62)

where $\bar{{⟨ N^{a} N^{b} M ⟩}_{k}} (θ_{i})$ $\overline{{\langle{{\mathcal{N}}^{a}{\mathcal{N}}^{b}{\mathcal{M}}}\rangle}_k}(\theta_i)$ is the average of all aperture statistics jackknife samples. The (mean-square) statistical uncertainty of the aperture statistics ⟨𝒩^a𝒩^bℳ⟩(θ_i) is $σ_{i} = \sqrt{{\hat{C}}_{ii}^{ab}}$ $\sigma_i=\sqrt{\hat{C}^{ab}_{ii}}$ . The inverse of this covariance, needed for the likelihood analysis, is estimated with

$\begin{matrix} {(C^{ab})}_{ij}^{- 1} = \frac{N - N_{θ} - 2}{N - 1} {({\hat{C}}_{ij}^{ab})}^{- 1}, \end{matrix}$ $\begin{aligned} \left(C^{ab}\right)^{-1}_{ij} = \frac{N-N_\theta -2}{N-1} \left(\hat{C}^{ab}_{ij}\right)^{-1}, \end{aligned}$ (63)

where N_θ is the number of aperture radii bins (Hartlap et al. 2007; Anderson 2003). In our case N_θ = 3 × 30 = 90.

Formally, jackknife resampling assumes that all individual estimates of $\tilde{G}$ ${\tilde{\mathcal{G}}}$ are statistically independent. However, as all observed tiles originate from the same observation and are adjacent to each other, this assumption is not strictly valid. It is still probably a good approximation on scales smaller than the tile sizes. A possible bias in the empirical covariance is less pronounced for the simulation, as the tiles originate from 64 (mostly) independent line-of-sights.

5.2. Fitting the halo model

We constrain the parameters of the G3L halo model by fitting it to measurements of both the auto-correlation aperture statistics, ⟨𝒩^a𝒩^aℳ⟩(θ) and ⟨𝒩^b𝒩^bℳ⟩(θ), and cross-correlation statistics, ⟨𝒩^a𝒩^bℳ⟩(θ), of two galaxy samples a and b for N_θ = 30 scale radii θ between $0 \overset{'}{.} 1$ $0{{\overset{\prime}{.}}}1$ and 30′. To this end, we combine the measurements into a data vector of 3N_θ elements,

$\begin{matrix} d^{ab} = (\begin{matrix} ⟨ N^{a} N^{b} M ⟩ (θ_{1}) \\ ⋮ \\ ⟨ N^{a} N^{b} M ⟩ (θ_{30}) \\ ⟨ N^{a} N^{a} M ⟩ (θ_{1}) \\ ⋮ \\ ⟨ N^{a} N^{a} M ⟩ (θ_{30}) \\ ⟨ N^{b} N^{b} M ⟩ (θ_{1}) \\ ⋮ \\ ⟨ N^{b} N^{b} M ⟩ (θ_{30}) \end{matrix}) . \end{matrix}$ $\begin{aligned} \boldsymbol{d}^{ab}=\begin{pmatrix} \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _1)\\ \vdots \\ \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _{30})\\ \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{a}{\mathcal{M} }}\rangle (\theta _1)\\ \vdots \\ \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{a}{\mathcal{M} }}\rangle (\theta _{30})\\ \langle {{\mathcal{N} }^{b}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _1)\\ \vdots \\ \langle {{\mathcal{N} }^{b}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _{30}) \end{pmatrix}. \end{aligned}$ (64)

Likewise, we define the halo model vector m^ab(p) for each parameter set p, which is obtained from Eq. (46) and the bispectrum in Sect. 3.2,

$\begin{matrix} m^{ab} (p) = (\begin{matrix} ⟨ N^{a} N^{b} M ⟩ (θ_{1} | p) \\ ⋮ \\ ⟨ N^{a} N^{b} M ⟩ (θ_{30} | p) \\ ⟨ N^{a} N^{a} M ⟩ (θ_{1} | p) \\ ⋮ \\ ⟨ N^{a} N^{a} M ⟩ (θ_{30} | p) \\ ⟨ N^{b} N^{b} M ⟩ (θ_{1} | p) \\ ⋮ \\ ⟨ N^{b} N^{b} M ⟩ (θ_{30} | p) \end{matrix}) . \end{matrix}$ $\begin{aligned} \boldsymbol{m}^{ab}(\boldsymbol{p})=\begin{pmatrix} \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _1\,|\,\boldsymbol{p})\\ \vdots \\ \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _{30}\,|\,\boldsymbol{p})\\ \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{a}{\mathcal{M} }}\rangle (\theta _1\,|\,\boldsymbol{p})\\ \vdots \\ \langle {{\mathcal{N} }^{a}{\mathcal{N} }^{a}{\mathcal{M} }}\rangle (\theta _{30}\,|\,\boldsymbol{p})\\ \langle {{\mathcal{N} }^{b}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _1\,|\,\boldsymbol{p})\\ \vdots \\ \langle {{\mathcal{N} }^{b}{\mathcal{N} }^{b}{\mathcal{M} }}\rangle (\theta _{30}\,|\,\boldsymbol{p}) \end{pmatrix}. \end{aligned}$ (65)

Optimal parameters p_opt are those that minimise the goodness-of-fit

$\begin{matrix} χ^{2} (p) = {[d^{ab} - m^{ab} (p)]}^{T} {C^{- 1}}^{ab} [d^{ab} - m^{ab} (p)], \end{matrix}$ $\begin{aligned} \chi ^2(\boldsymbol{p})=\left[\boldsymbol{d}^{ab}-\boldsymbol{m}^{ab}(\boldsymbol{p})\right]^\mathrm{T} \, {{C}^{-1}}^{ab}\, \left[\boldsymbol{d}^{ab}-\boldsymbol{m}^{ab}(\boldsymbol{p})\right], \end{aligned}$ (66)

determined by the Nelder-Mead algorithm (Nelder & Mead 1965) as implemented in the GNU Scientific Library (Gough 2009). This algorithm is well-suited to multi-dimensional minimisation problems and requires only few (typically 1 or 2) function evaluations per iteration step. To avoid local minima, the algorithm is restarted multiple times at different, randomly chosen, initial parameter values.

To estimate the uncertainties on the best-fitting parameters p_opt and to quickly sample the posterior distribution of the parameters P(p | d^ab), we approximate the posterior distribution with the importance function q(p | d^ab) following the importance sampling scheme (e.g. Liu 2004). According to the Bayes theorem, the posterior density of p given the data is

$\begin{matrix} P (p | d^{ab}) \propto L (d^{ab} | p) P_{prior} (p), \end{matrix}$ $\begin{aligned} P(\boldsymbol{p}\,|\,\boldsymbol{d}^{ab}) \propto \mathcal{L} (\boldsymbol{d}^{ab}\,|\,\boldsymbol{p})\, {P}_\mathrm{prior} (\boldsymbol{p}), \end{aligned}$ (67)

where P_prior(p) is the prior density of the parameters, as given in Table 1, and ℒ(d^ab|p) is the likelihood of the data given the parameters p. We assume Gaussian statistical errors in the data, that is, the likelihood function is

$\begin{matrix} L (d^{ab} | p) \propto exp [- \frac{1}{2} χ^{2} (p)], \end{matrix}$ $\begin{aligned} \mathcal{L} (\boldsymbol{d}^{ab}\,|\,\boldsymbol{p})\propto \exp \left[-\frac{1}{2}\, \chi ^2(\boldsymbol{p})\right], \end{aligned}$ (68)

with the χ² as defined in Eq. (66); the Bayesian evidence and thus the normalisation of ℒ is not of interest here and will be ignored in what follows. The importance sampling function q should be close to the posterior P to achieve an efficient sampling. To find an appropriate q, we approximate ℒ(d^ab | p) in the proximity of the optimal parameters p_opt by the Gaussian probability density

$\begin{matrix} \tilde{L} (d^{ab} | p) \propto exp [- \frac{1}{2} {(p - p_{opt})}^{T} F (p_{opt}) (p - p_{opt})], \end{matrix}$ $\begin{aligned} \tilde{\mathcal{L} }(\boldsymbol{d}^{ab}\,|\,\boldsymbol{p})\propto \exp [-\frac{1}{2}\, \left(\boldsymbol{p}-\boldsymbol{p}_\mathrm{opt} \right)^\mathrm{T} \, {{F}}(\boldsymbol{p}_\mathrm{opt} )\left(\boldsymbol{p}-\boldsymbol{p}_\mathrm{opt} \right)], \end{aligned}$ (69)

where the matrix F is the Fisher information,

$\begin{matrix} F_{ij} (p) = {(m^{ab} (p) p_{i})}^{T} C^{- 1} m^{ab} (p) p_{j} \end{matrix}$ $\begin{aligned} {F}_{ij}(\boldsymbol{p})=\left({\boldsymbol{m}^{ab}(\boldsymbol{p})}{p_i}\right)^\mathrm{T}\, {C}^{-1}\, {\boldsymbol{m}^{ab}(\boldsymbol{p})}{p_j} \end{aligned}$ (70)

(e.g. Tegmark et al. 1997), and choose q as

$\begin{matrix} q (p | d^{ab}) \propto \tilde{L} (d^{ab} | p) P_{prior} (p) . \end{matrix}$ $\begin{aligned} q(\boldsymbol{p}\,|\,\boldsymbol{d}^{ab}) \propto \tilde{\mathcal{L} }(\boldsymbol{d}^{ab}\,|\,\boldsymbol{p})\, {P}_\mathrm{prior} (\boldsymbol{p}). \end{aligned}$ (71)

We now draw N_p parameter sets p_i from q which are then, by importance sampling, weighted to sample P. The allocated weights are

$\begin{matrix} w (p_{i} | d^{ab}) \propto \frac{P (p_{i} | d^{ab})}{q (p_{i} | d)} \propto \frac{L (d^{ab} | p_{i})}{\tilde{L} (d^{ab} | p_{i})} \\ \propto exp [- \frac{1}{2} χ^{2} (p) + \frac{1}{2} {(p - p_{opt})}^{T} F (p_{opt}) (p - p_{opt})], \end{matrix}$ $\begin{aligned}&{ w}(\boldsymbol{p}_i\,|\,\boldsymbol{d}^{ab}) \propto \frac{P(\boldsymbol{p}_i\,|\,\boldsymbol{d}^{ab})}{q(\boldsymbol{p}_i\,|\,\boldsymbol{d})}\propto \frac{\mathcal{L} (\boldsymbol{d}^{ab}\,|\,\boldsymbol{p}_i)}{\tilde{\mathcal{L} }(\boldsymbol{d}^{ab}\,|\,\boldsymbol{p}_i)} \nonumber \\&\quad \propto \exp [-\frac{1}{2}\,\chi ^2(\boldsymbol{p}) + \frac{1}{2}\, \left(\boldsymbol{p}-\boldsymbol{p}_\mathrm{opt} \right)^\mathrm{T} \, {F}(\boldsymbol{p}_\mathrm{opt} )\left(\boldsymbol{p}-\boldsymbol{p}_\mathrm{opt} \right) ], \end{aligned}$ (72)

normalised such that their sum is unity.

For each parameter pⁱ we give the α credibility interval (CI) on the marginalised posterior P(pⁱ|d^ab), defined by

$\begin{matrix} P (p^{i} | d^{ab}) = \int [14] p^{'} P (p^{'} | d^{ab}) δ_{D} (p^{' i} - p^{i}) . \end{matrix}$ $\begin{aligned} P(p^i | \boldsymbol{d}^{ab}) = \int [14]p^\prime \; P(\boldsymbol{p^{\prime }}\,|\,\boldsymbol{d}^{ab})\; \delta _{\rm D}(p^{\prime i} - p^i). \end{aligned}$ (73)

We take the mode of the posterior as optimal parameter value $p_{opt}^{i}$ $p_\mathrm{opt}^i$ and define the CI I_α of a single parameter as the interval around the optimal parameter value including α of the marginalised posterior, that is,

$\begin{matrix} α = \int_{I_{α}} p^{i} P (p^{i} | d^{ab}) . \end{matrix}$ $\begin{aligned} \alpha = \int _{I_\alpha } {p^i} P(p^i | \boldsymbol{d}^{ab}). \end{aligned}$ (74)

To find this interval, we sort the sampling points by the distance |pⁱ − $p_{opt}^{i}$ $p_\mathrm{opt}^i$ | from the optimal parameter value and take the first N_p points, for which the sum of their weights equals α. The interval defined by these points is our estimate for the α credibility region of the optimal parameter.

5.3. Science verification

We assess the accuracy of the model, estimators, and our statistical setup by comparing the inferred HODs to the true ones in mock data. In these mock data, each galaxy is associated with a dark matter halo, identified by its halo ID and virial halo mass from a friends-of-friends halo finder. To extract the true HOD, we count the number of galaxies of each sample a in each halo and divide the halos into 50 mass bins in the range between 10¹¹ and 10¹⁵ h⁻² M_⊙. For each mass bin, the number of satellite and central galaxies per halo is averaged, yielding the true average ⟨N^a | m⟩=⟨ $N_{cen}^{a} | m$ ${{N_{\mathrm{cen}}^{a}}\,|\,m}$ ⟩+⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ | m⟩ in the data. We also verify the inference estimate of the correlation coefficient r^ab(m), Eq. (40), by computing

$\begin{matrix} r^{ab} (m) = \frac{⟨ N_{sat}^{a} N_{sat}^{b} | m ⟩ - ⟨ N_{sat}^{a} | m ⟩ ⟨ N_{sat}^{b} | m ⟩}{\sqrt{⟨ N_{sat}^{a} | m ⟩ ⟨ N_{sat}^{b} | m ⟩}} \end{matrix}$ $\begin{aligned} r^{ab}(m) = \frac{\langle {N_{\mathrm{sat} }^{a}N_{\mathrm{sat} }^{b}|m}{\rangle }-\langle {N_{\mathrm{sat} }^{a}|m}{\rangle }\langle {N_{\mathrm{sat} }^{b}|m}{\rangle }}{\sqrt{\langle {N_{\mathrm{sat} }^{a}|m}{\rangle }\langle {N_{\mathrm{sat} }^{b}|m}{\rangle }}} \end{aligned}$ (75)

from the halo catalogue and galaxies in the mock data. The uncertainties on ⟨N^a | m⟩ and r^ab(m) are the standard errors on the mean over the 64 lines-of-sights of the simulation.

6. Results

In this section, we give the results of the science verification and the G3L analysis for KV450 × GAMA. We first present the results for lens samples defined by their colour in Sect. 6.1 and then for lens samples defined by their stellar mass in Sect. 6.2.

6.1. Colour-selected lens samples

The G3L aperture statistics of simulated red and blue galaxies are shown in Fig. 7a, along with the best fit of the halo model and its decomposition into the 1-, 2-, and 3-halo terms. The parameter values corresponding to the best fit are given in the first two columns in Table 3. The goodness-of-fit is χ² = 93.63 for 90 − 14 = 76 degrees of freedom (d.o.f.), or χ²/d.o.f. = 1.28. This χ² corresponds to a p-value of 0.083, indicating no significant deviation between the fit and the simulated G3L signal within the 95% confidence level (CL).

Fig. 7.

G3L measurement (points) and best-fitting halo model (lines) for red and blue galaxies in the MS (upper plot) and the KV450 × GAMA (lower plot). Solid lines indicate the total aperture statistics, dashed lines the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term of the fit. The left panels show the result for red-red galaxy pairs, the central panels for blue-blue galaxy pairs, and the right panels for red-blue mixed pairs. Error bars correspond to the standard deviation estimated from the jackknife resampling detailed in Sect. 5.1. (a) MS. (b) KV450 × GAMA.

Table 3.

Best-fitting values of halo model parameters for colour-selected lenses and 68% credibility intervals (d.o.f. = 76).

For our science verification, we compare in Fig. 8a the HODs inferred by the best-fitting G3L halo model to the directly estimated HODs of the simulated galaxies. The model prediction and direct estimate agree within the 68% credibility band (shaded areas) for red and blue galaxies. Likewise, the correlation of numbers of red and blue satellites, A^red-blue and ϵ^red-blue, is detected at 3σ significance in the verification, and r^ab(m) agrees in Fig. 9 with the true r^ab in the galaxy SAM within the 68% credibility band. Therefore, the G3L fit recovers the galaxy HODs and r^ab(m) sufficiently accurate within the statistical errors for a survey similar to our verification data (∼10³ deg², a high source number density without shape noise) and surveys with higher estimator noise, such as KV450 × GAMA. As an additional test, we explore the impact of a systematically wrong source n(z) in the HOD inference by shifting the full n(z)↦n(z − δz) by δz = 0.02, roughly twice the expected bias on the mean redshift of n(z) (see Sect. 4.1). The resulting systematic shift of the best fit HOD parameters stays within the 68% CI of the verification data, for example, δM_th = −1.8 × 10¹¹ M_⊙ (−0.05 × 10¹¹ M_⊙) for the red (blue) galaxies. Systematic errors in the HOD due to bias in n(z) are hence negligible for our analysis of KV450×GAMA.

Fig. 8.

Mean per-halo numbers of simulated galaxies (top) and observed galaxies (bottom) as function of halo mass. Red crosses (blue points) indicate the true HOD of simulated red (blue) galaxies, where the error bars are the standard deviation of the mean over the 64 line-of-sights. The lines indicate the per-halo numbers inferred from the fit to the G3L signal for red (solid red) and blue galaxies (dashed blue). The shaded areas are the 68% credibility areas of the halo model fit. (a) MS. (b) KV450 × GAMA.

We determine the HOD parameters of real red and blue galaxies in KV450×GAMA in Fig. 7b, where we show the aperture statistics and the best fits of our halo model, together with a decomposition into the 1-, 2-, and 3-halo terms. The model fit has χ²/d.o.f. = 0.977, corresponding to a p-value of 0.53 and an agreement with the model within the 95% CL. The best fitting parameters are reported in the second pair of columns in Table 3. They show that red galaxies clearly populate more massive halos than blue galaxies: $M_{th}^{red} = 1 . 5_{- 1.2}^{+ 6.7} \times 10^{12} M_{⊙}$ $M_\text{th}^\text{red}=1.5^{+6.7}_{-1.2}\times10^{12}\,M_\odot$ is roughly ten times larger than $M_{th}^{blue} = 1 . 4_{- 0.9}^{+ 7.2} \times 10^{11} M_{⊙}$ $M_\text{th}^\text{blue}=1.4^{+7.2}_{-0.9}\times10^{11}\,M_\odot$ (68% credibility intervals, CI herafter). The per-halo number of blue satellites increases slower with halo mass than for red galaxies: the mass scale $M^{' blue} = 2 . 0_{- 1.2}^{+ 3.0} \times 10^{14} M_{⊙}$ $M^{\prime \mathrm{blue}}=2.0^{+3.0}_{-1.2}\times10^{14}\,M_\odot$ of blue satellites is more than five times larger than $M^{' red} = 3 . 6_{- 0.7}^{+ 3.5} \times 10^{13} M_{⊙}$ $M^{\prime \mathrm{red}}=3.6^{+3.5}_{-0.7}\times10^{13}\,M_\odot$ (68% CI). For central galaxies, the sum of α^red and α^blue is ${0.47}_{- 0.31}^{+ 0.51}$ $0.47_{-{0.31}}^{+{0.51}}$ (68% CI), which is consistent with unity (68% CI). The concentration of halo satellites is consistent with that of matter (f^a ∼ 1) in the 68% CI. As expected from the qualitative analysis (Sect. 3.3) σ cannot be constrained better than the prior.

The HODs corresponding to the best fitting parameter values are shown in Fig. 8b. The HOD of red (blue) galaxies is non-zero for halo masses above 10¹² M_⊙ (5 × 10¹¹ M_⊙), whereas, at lower halo masses, the constraints become essentially upper limits for ⟨N|m⟩. The inferred HODs for GAMA galaxies match those obtained from the fit to the simulated galaxies in Fig. 8b. This agreement reflects the similar G3L aperture statistics of mock data and observations – the SAM predictions for ⟨𝒩^a𝒩^bℳ⟩ agree with the measurements in KV450 × GAMA within the errors (Fig. 7). However, the uncertainties on the HODs and their parameters are considerably larger for the observation since our simulated data has less noise in the shear signal.

As for r^ab of red and blue satellites in KV450×GAMA, we report a 2σ to 3σ detection of a positive correlation and an amplitude increase towards more massive halos: both A^red-blue and ϵ^red-blue are positive. This trend is similar to that of the simulated galaxies. However, the increase with halo mass is steeper for the observed galaxies (Fig. 9 $, ϵ^{ab} = 0 . 99_{- 0.12}^{+ 0.22}$ $, \epsilon^{ab}=0.99^{+0.22}_{-0.12}$ versus $ϵ^{ab} = 0 . 69_{- 0.04}^{+ 0.08}$ $\epsilon^{ab}=0.69^{+0.08}_{-0.04}$ at 68% CI), while the correlation amplitude at 10¹² M_⊙ is lower ( $A^{ab} = 1 . 62_{- 0.51}^{+ 0.62} \times 10^{- 2}$ $A^{ab}=1.62^{+0.62}_{-0.51}\times10^{-2}$ versus $A^{ab} = 5 . 31_{- 0.92}^{+ 0.87} \times 10^{- 2}$ $A^{ab}=5.31^{+0.87}_{-0.92}\times10^{-2}$ at 68% CI). Consequently, numbers of red and blue satellites are correlated both in the SAMs and for true galaxies, especially beyond the mass scale of galaxy groups m ≳ 10¹³ M_⊙ where $r^{red - blue} ≳ 0 . 16_{- 0.05}^{+ 0.06}$ $r^{\mathrm{red-blue}}\gtrsim0.16^{+0.06}_{-0.05}$ for ϵ^{red − blue} = 1 (68% CI).

Fig. 9.

Correlation parameter r^ab for red and blue galaxies in the simulation and observation as a function of halo mass. Black crosses show the direct estimate for the simulation, where the error bars are the standard deviation of the mean over the 64 line-of-sights. The solid brown line shows the r^ab inferred by the halo model fit to the G3L signal of the MS, and the green dashed line is the result of the fit to the KV450 × GAMA G3L signal. The shaded areas show the 68% credibility bands of the fits.

The correlation matrix of the G3L estimate for the KV450 × GAMA red and blue galaxies is shown in Fig. 10. We see that the signal for similar aperture radii is strongly correlated. The signals for red-red and blue-blue lens pairs are almost independent, while the signal for mixed galaxy pairs has correlation coefficients of up to 0.3 with the signal for unmixed red-red pairs.

Fig. 10.

Correlation matrix for aperture statistics measurement in KV450×GAMA for red and blue galaxy samples. The data vector is ordered as given in Eq. (64), starting with the smallest aperture radius.

6.2. Stellar mass-selected lens samples

We repeat the science verification and G3L analysis for the stellar mass-selected galaxies. We consider five stellar-mass bins m1 to m5, so there are ten distinct combinations of two samples a and b. For each combination, we again use the statistics ⟨𝒩^a𝒩^aℳ⟩, ⟨𝒩^a𝒩^bℳ⟩, and ⟨𝒩^b𝒩^bℳ⟩ to infer HOD parameters. Since each galaxy sample is used in four combinations, we obtain four versions of the same HOD for each sample. For a realistic model and successful fits, these four HODs should be consistent with each other.

Fitting the halo model individually to two samples out of five neglects the correlations between the ten combinations. Consequently, better constraints on the model parameters could be obtained by fitting the model to the signal of all five stellar-mass selected samples simultaneously. However, this would increase the data vector from 90 entries to 450 entries. As the covariance estimated is obtained from only 180 (quasi)-independent data realisations, it cannot be used with such a large data vector, and a simultaneous fit of all five samples is unfeasible.

The HOD parameters, χ², and plots of the best fitting models for the science verification with the simulated data are given in Appendix C. The p-values exceed 0.05 for all fits, indicating no significant deviation between the fits and the measurements within the 95% CL. To evaluate the overall agreement of all fits to the mock data, we consider the cumulative distribution of p-values, shown in Fig. 11 (solid line). This distribution should correspond to a uniform distribution between 0 and 1 (dotted line) if the measurements are unbiased realisations of the model. A Kolmogorov-Smirnov (KS) test on the distribution of p-values for the simulation yields a KS distance of 0.118, which for the 11 samples and d.o.f. = 76 in the distribution indicates no significant deviation from a uniform distribution at 95% CL. Additionally, the four parameter sets for each stellar-mass bin agree within the 68% CI. The distribution of p-values and the consistency of the HOD parameters for the four model fits supports the view that the model coherently and accurately reproduces the G3L signal of the verification data within the statistical errors.

Fig. 11.

Cumulative distribution of p-values of G3L halo model fits for MS (orange, solid) and KV450 × GAMA (green, dashed). For a perfect description of G3L signal and data noise, the distributions would be consistent with a uniform distribution (black, dotted).

To verify the reconstruction of the HODs for stellar-mass samples, we compare the inferred HODs (lines and shaded areas) to the true HODs (data points) in Fig. 12a, which only shows the reconstructions for the combinations m1–m5, m2–m5, m3–m5, and m4–m5; the other reconstructions for the same sample but in a different combination agree with those within the 68% CI. The inferred HODs agree with the true HODs within the 68% credibility band. However, for the lower stellar mass galaxies from samples m1, m2, and m3, there is a ‘dip’ for the true ⟨N|m⟩ with a local minimum near 6 × 10¹¹ M_⊙, 1 × 10¹² M_⊙, and 2 × 10¹² M_⊙, respectively. Our halo model cannot trace this feature because, by construction, ⟨N|m⟩ increases monotonically with halo mass m; the halo model fits a smooth profile across the dip. However, this smoothing only increases the HOD reconstruction uncertainty without introducing a significant bias within the 68% CI. Therefore, at the level of the precision of the verification data and for the noisier observational data, the missing HOD model feature is acceptable for KV450×GAMA.

Fig. 12.

Mean per-halo galaxy numbers in the simulation (top) and observation (bottom) for lens galaxies from each stellar mass bin as function of halo mass. Crosses indicate the directly estimated per-halo numbers of simulated galaxies, while lines show the predictions from the G3L fits. The shaded areas indicate the 68% confidence areas. Left panels: the mean per-halo numbers for galaxies from stellar mass bins m1, m3, and m5 obtained from the fits to the G3L signal for m1–m5, m3–m5, and m4–m5. The right panels show the same for galaxies from stellar mass bins m2 and m5, obtained from the fits to the signal for m2–m5 and m4–m5. The corresponding HOD parameters are listed in Table C.1. (a) MS. (b) KV450 × GAMA.

To verify the inference of r^ab, we compare the model fits (solid lines and 68% credibility regions) to the true correlation of satellite numbers (data points) in the simulation in Fig. 13. The corresponding values of A^ab and ϵ^ab are listed in Table C.1 in the Appendix. For all combinations of stellar-mass bins, r^ab is positive and scales approximately linearly with halo mass (ϵ^ab ≈ 1). The amplitude A^ab, though, drops if one of the samples is m4 or m5 in the combination. For example, A^m1m2 is ${1.20}_{- 0.36}^{+ 0.45} \times 10^{- 2}$ $1.20_{-0.36}^{+0.45}\times10^{-2}$ , whereas A^m1m5 is ${0.029}_{- 0.020}^{+ 0.024} \times 10^{- 2}$ $0.029_{-0.020}^{+0.024}\times10^{-2}$ (68% CI). The r^ab from the halo model fits agree with the true r^ab for all sample combinations at the 68% CI (cf. data points to lines and dark green areas).

Fig. 13.

Correlation parameter r^ab for stellar mass-selected galaxies in the MS and in KV450×GAMA. Black crosses show the true correlation for the MS, where the error bars are the standard deviation of the mean over the 64 simulated line-of-sights. The solid brown line shows r^ab inferred from a simulated G3L analysis of MS, and the green dashed line the inference for KV450×GAMA. The shaded areas show the 68% credibility bands of the inferences. Blue points show the correlation parameter of galaxies in the MS selected without assuming a flux limit.

After the science verification, we fit the model to the observed G3L signal of KV450×GAMA. The best-fitting model parameters, χ²-values and plots are reported in Table C.3. For the observations, all p-values exceed 0.05, indicating no significant deviation between fit and measurement at a 95% CL. Again, we perform a KS-test on the cumulative distribution of p-values to evaluate the overall agreement of the model and measurements (dashed lines in Fig. 11). The KS distance of this distribution to a uniform distribution is now 0.31, larger than for the science verification but still consistent with a uniform distribution at the 95% CL.

The HODs inferred for KV450×GAMA are shown in Fig. 12b by solid lines and 68% credibility regions, again using the combinations m1–m5, m2–m5, m3–m5, and m4–m5; other combinations for the same sample yield consistent results within the 68% CI. The HODs vary with the stellar mass of the lenses. In contrast to massive halos, low mass halos are mainly populated by galaxies of low stellar mass. For example, in halos with masses below 2 × 10¹¹ M_⊙, m1 galaxies are the most numerous, whereas m4 galaxies dominate halos with masses above 4 × 10¹⁴ M_⊙. The most massive m5 galaxies, however, are never the most numerous sample between 10¹¹ to 10¹⁵ M_⊙. Instead, they are outnumbered by satellites of lower stellar mass. Compared to the SAM HOD (data points), the HODs of KV450×GAMA are consistent with the model.

Each sample HOD is an average over the sample redshift distribution, shown in Fig. 5. Therefore, the inferred HODs are affected by the survey flux limit. Most affected are the faint m1-galaxies. Although they are observed up to z ∼ 0.34, for z ≳ 0.15 only the most massive m1 galaxies are seen (M_* > 2 × 10⁹ h⁻² M_⊙). Due to this flux selection, the inferred average HOD is skewed towards the HOD of the more massive galaxies in this stellar-mass bin. Likewise, the HODS of the other samples also give more weight to the most massive galaxies in the samples but with less bias than for the faint m1 sample.

The threshold masses $M_{th}^{a}$ $M_\mathrm{th}^{a}$ increase with stellar mass, indicating that galaxies with higher stellar mass prefer to inhabit more massive halos. We show this trend in Fig. 14, which plots $M_{th}^{a}$ $M_\mathrm{th}^{a}$ against the average stellar mass of the samples. For each stellar mass bin, there are four estimates of $M_{th}^{a}$ $M_\mathrm{th}^{a}$ , from the four possible sample combinations, slightly displaced along the y-axis. As noted above, these four estimates are consistent with each other for all samples.

Fig. 14.

Threshold mass $M_{th}^{a}$ $M_\mathrm{th}^{a}$ measured for the GAMA galaxies as a function of the average stellar mass of each stellar mass bin (Green crosses). We show all four estimates for $M_{th}^{a}$ $M_\mathrm{th}^{a}$ for each sample a, slightly displaced along the y-axis for visibility. Horizontal errors correspond to 68% CI of $M_{th}^{a}$ $M_\mathrm{th}^{a}$ ; vertical errors show the standard deviation of the stellar masses of galaxies within a sample.

The central galaxies parameter α^a also increases with stellar mass: The central galaxy of a halo with $m ≳ max {M_{th}^{a}, M_{th}^{b}}$ $m\gtrsim {\rm max}\{M_\mathrm{th}^a,M_\mathrm{th}^b\}$ is more likely to be a than b, if a is a sample with higher stellar mass than b. As the four α^as obtained for each stellar mass bin are consistent, we can average them to a ${\bar{α}}^{a}$ $\bar{\alpha}^a$ for each sample a, yielding ${\bar{α}}^{m 1} = 0 . 11_{- 0.04}^{+ 0.07}$ $\bar{\alpha}^{\mathrm{m1}}=0.11^{+0.07}_{-0.04}$ , ${\bar{α}}^{m 2} = 0 . 14_{- 0.06}^{+ 0.08}$ $\bar{\alpha}^{\mathrm{m2}}=0.14^{+0.08}_{-0.06}$ , ${\bar{α}}^{m 3} = 0 . 22_{- 0.08}^{+ 0.10}$ $\bar{\alpha}^{\mathrm{m3}}=0.22^{+0.10}_{-0.08}$ , ${\bar{α}}^{m 4} = 0 . 47_{- 0.13}^{+ 0.11}$ $\bar{\alpha}^{\mathrm{m4}}=0.47^{+0.11}_{-0.13}$ , ${\bar{α}}^{m 5} = 0 . 54_{- 0.19}^{+ 0.19}$ $\bar{\alpha}^{\mathrm{m5}}=0.54^{+0.19}_{-0.19}$ . The sum of these α^a is $1 . 48_{- 0.25}^{+ 0.26}$ $1.48_{-0.25}^{+0.26}$ , which is consistent with unity within a 2σ CI. To derive the uncertainties, we assumed that each individual estimate of α^a is an independent measurement. This assumption is not true in our case (the data vectors for the ten fits contain multiple times the same measurements), so the derived uncertainties are probably larger in reality. A better estimate of the uncertainty on $\sum_{a} {\bar{α}}^{a}$ $\sum\nolimits_a \bar{\alpha}^a$ requires a simultaneous fit of all 10 combinations of the five mass bins. This is, unfortunately, not feasible in our case because our jackknife-derived covariance becomes singular for such a large data vector.

The satellite parameters σ^a, f^a, and β^a do not change significantly with stellar mass (68% CI), which is not surprising for σ^a because we cannot constrain σ^a better than the prior range. The concentration parameter is consistent with f^a ≈ 1 for all samples (68% CI), so there is generally no detectable deviation of the satellite distribution from the matter distribution inside halos. The slope of the satellite numbers is β^a ≈ 1 for all mass samples.

Lastly, we show the correlation r^ab of GAMA satellite numbers in Fig. 13 (green, dotted lines). There is a similar trend to the simulated galaxies (black data points), with a positive r^ab, approximately scaling linearly with halo mass (ϵ^ab ≈ 1). For the sample combinations m1–m2, m1–m3, m2–m3, and m2–m4, we find A^ab > 0 (95% CI): the satellite numbers of galaxies below stellar mass of 10^10.5 h⁻² M_⊙ are positively correlated. Since r^ab increases with halo mass, the correlations between low stellar masses become relevant beyond galaxy-group size halos, where $r^{m 1 m 2} = 1 . 9_{- 0.6}^{+ 0.4}$ $r^{\mathrm{m1m2}}=1.9^{+0.4}_{-0.6}$ , $r^{m 1 m 3} = 1 . 1_{- 0.2}^{+ 0.2}$ $r^{\mathrm{m1m3}}=1.1^{+0.2}_{-0.2}$ , and $r^{m 2 m 3} = 0 . 6_{- 0.2}^{+ 0.2}$ $r^{\mathrm{m2m3}}=0.6^{+0.2}_{-0.2}$ for m = 10¹³ M_⊙ and ϵ = 1 (68% CI).

We remind here that r^ab > 1 are possible because the r^ab are a Pearson correlation coefficient, $r_{pear}^{a b}$ $r_\mathrm{pear}^{ab}$ hereafter, only for a Poissonian variance of satellite numbers. In particular, we have the relation

$\begin{matrix} r_{pear}^{ab} = r^{ab} \frac{\sqrt{⟨ N_{sat}^{a} | m ⟩ ⟨ N_{sat}^{b} | m ⟩}}{σ (N_{sat}^{a} | m) σ (N_{sat}^{b} | m)}, \end{matrix}$ $\begin{aligned} r^{ab}_\mathrm{pear} = r^{ab}\,\frac{\sqrt{\langle {N_{\mathrm{sat} }^{a}|m}{\rangle }\,\langle {N_{\mathrm{sat} }^{b}|m}{\rangle }}}{\sigma (N_{\mathrm{sat} }^{a}|m)\,\sigma (N_{\mathrm{sat} }^{b}|m)}\;, \end{aligned}$ (76)

so that values of r^ab > 1 indicate a super-Poissonian variance. For the strongly non-Poissonian m1 and m2 SAM galaxies at m = 10¹⁴ M_⊙, r^m1 m2 = 1.2, which corresponds to $r_{pear}^{m1 m2} = 0.18$ $r^{\mathrm{m1\, m2}}_\mathrm{pear}=0.18$ (see Fig. B.1 in Appendix B for the deviation from a Poisson statistic). For the samples m4 and m5, which are closer to Poissonian, the r^m4 m5 = 0.07 at the same halo mass corresponds to $r_{pear}^{m4 m5} = 0.04$ $r^{\mathrm{m4\, m5}}_\mathrm{pear}=0.04$ . For red and blue SAM galaxies, the measured r^red blue = 3.2 at m = 10¹⁴ M_⊙ corresponds to $r_{pear}^{red blue} = 0.44$ $r^{\mathrm{red\, blue}}_\mathrm{pear}=0.44$ .

The positive correlation coefficients are partly caused by the flux limit of the survey. To see this effect, consider as an example the correlation between m1 and m2 galaxies. At low z ≲ 0.1, all galaxies with stellar masses in the m1 range are observed, so the correlation parameter at these redshifts measures the true correlation between all m1 and m2 galaxies. However, at higher redshifts, only m1 galaxies at the high end of the stellar mass bin are observed due to the flux limit. These galaxies have stellar masses closer to m2-galaxies. They are therefore more similar to m2 galaxies and stronger correlated than the overall less massive m1 galaxies. This systematically increases the inferred value of r^ab, which is an average over the correlations at different redshifts, compared to the correlation without flux limit.

To test whether the flux limit is the only cause of th e correlation, we consider samples of galaxies without a flux limit in the MS. For this purpose, we select all galaxies up to z = 0.5, irrespective of their magnitude, and divide them by the same stellar-mass cuts as the flux-limited samples. We then directly estimate the parameter r^ab for these samples, shown in grey in Fig. 13. The r^ab of the unlimited samples are generally lower than for the flux-limited samples, indicating that the flux limit indeed causes a higher correlation. Nevertheless, the r^ab are still clearly positive for the sample combinations m1–m2, m1–m3, m2–m3, m2–m4, and m3–m4 without flux limit. Consequently, the measured positive correlation is not purely an effect of the survey incompleteness.

7. Discussion

We presented a new method to measure galaxy HODs and the correlation of per-halo satellite galaxy numbers using G3L, a weak gravitational lensing effect measuring the projected galaxy-galaxy-matter bispectrum. To this end, we constructed a new halo model for the galaxy-galaxy-matter bispectrum and the G3L aperture statistics. We validated the science analysis for different selections of lens galaxies and demonstrated that it accurately recovers the true HODs within 68% errors for a survey with an area of ≃10³deg² and lens galaxies between 0 ≤ z ≤ 0.5 and brighter than r = 19.8 mag. Therefore, our G3L analysis is accurate enough to infer HODs from KV450×GAMA, which has a smaller footprint (180 deg²) and similar selections for the lens galaxies.

We inferred HODs for GAMA for galaxies in two colour samples (red and blue) and five stellar mass bins between 10^8.5h⁻² M_⊙ and 10^11.5h⁻² M_⊙ (m1 to m5). The best-constrained HOD parameter for the KV450 × GAMA is the threshold halo mass $M_{th}^{a}$ $M_\mathrm{th}^{a}$ , which is M_th ≈ 10¹¹ M_⊙ for our blue galaxies and M_th ≈ 10¹² M_⊙ for our red galaxies. These values match the expectation that red galaxies are typically group or cluster galaxies, whereas blue galaxies tend to be field galaxies. At lower halo masses, we find tight upper limits on ⟨N|m⟩, indicating that such halos rarely host galaxies satisfying our selection criteria. The transition region between this regime and halos typically containing galaxies is only poorly constrained. These poor constraints are reflected by the large uncertainty on the transition parameter σ, even for the mock data in our science verification. Therefore, G3L alone cannot constrain σ better than our prior, and future studies may fix σ to a fiducial value or dispense with this parameter altogether by modelling ⟨ $N_{cen}^{a}$ ${N_{\mathrm{cen}}^{a}}$ |m⟩ by a step function.

The 1-halo term of the G3L signal of red-red lens pairs stretches to larger scales than for blue-blue lens pairs (Fig. 7). This finding can be explained by the tendency of red galaxies to populate more massive halos than blue galaxies. Since massive halos are larger than less massive ones, pairs of red galaxies inside the same halo can have wider separations than pairs of blue galaxies. Accordingly, the 1-halo term extends to larger scales for red-red lens pairs than for blue-blue ones. However, mixed red-blue pairs exist in intermediate halos, large enough to host red galaxies but small enough to contain a significant fraction of blue galaxies. Consequently, the cross-over between the domination of the 1-halo and the 3-halo term occurs at larger scales than for blue-blue lens pairs.

In contrast to previous HOD studies, we measured the correlation of numbers of satellites inside halos between galaxy samples. We report a > 3σ detection of a positive correlation for red and blue GAMA galaxies ( $r ≳ 0 . 16_{- 0.05}^{+ 0.06}$ $r\gtrsim0.16^{+0.06}_{-0.05}$ for m ≳ 10¹³ M_⊙, rising towards galaxy cluster scales). Similar positive correlations are present between the samples m1 (stellar masses below 10^9.5 M_⊙) and m2 or m3 (stellar masses between 10^9.5 M_⊙ and 10^10.5 M_⊙), as well as between sample m2 (stellar masses between 10^9.5 M_⊙ and 10¹⁰ M_⊙) and samples m3 or m4 (stellar masses between 10¹⁰ M_⊙ and 10¹¹ M_⊙). In particular, galaxies with similar stellar masses (for example from neighbouring samples m1 and m2) show positive correlations. Towards smaller halos, correlations become irrelevant due to the almost linear decrease of r^ab, visible both in the SAMs and in our observational data (Figs. 9 and 13). This trend fits our toy model consideration in Sect. 3.1, where a decreasing correlation is the consequence of Poisson shot noise inside low-occupancy halos. This finding implies that the assumption of uncorrelated satellite distributions is probably still appropriate if only galaxies in low-mass halos, m ≲ 10¹³ M_⊙, are considered.

The obvious presence of correlations, especially for halos of the group- and cluster-mass scale, questions the assumption of no correlation in halo models for the cross-correlation statistics between galaxy samples in galaxy-clustering studies (e.g. Scranton 2001, 2002; Zehavi et al. 2005; Simon et al. 2008). The correlations also raise questions about their origin and impact on galaxy models. To address the first, we note that finding strong correlations for similar galaxy samples is not unexpected. If two galaxy samples are drawn randomly from a pool of galaxies, they are statistically identical and their satellite numbers inside halos differ only by Poisson shot noise. Consequently, the satellite numbers are tightly correlated, which could explain the correlations between satellite galaxies in neighbouring stellar mass bins (Fig. 13). Stellar mass varies continuously and does not uniquely define a category of galaxies. Therefore, the division of galaxies by their stellar mass is ultimately arbitrary and satellite galaxies from neighbouring stellar mass bins have similar statistical properties. In addition, errors in the estimators for stellar mass blur the stellar mass bin on the edges, similar to randomly drawing galaxies from the same pool. This effect, in combination with depth variations between the samples due to the survey flux limit (Sect. 6.2), systematically increases r^ab. Moreover, strong correlations also emerge if two samples have a (close to) fixed ratio of satellite numbers inside a halo. This effect could occur for red and blue galaxies for cluster-sized halos (m ≳ 10¹⁴ M_⊙): Galaxy models commonly assume that star-forming blue galaxies falling into a galaxy cluster are quenched and turn into red galaxies with a certain probability. If this probability is roughly constant for halos of similar mass, satellite numbers of red and blue galaxies would be strongly correlated. Whatever the cause for the correlations for the GAMA galaxies, the consistency with the mock galaxies shows that galaxy SAMs already account for it at z < 0.5. Therefore, our findings do not hint at a need to improve these models at low redshift. However, as the galaxy population in clusters evolves with time, it will be interesting to study the correlations at higher redshift in the future.

It could be interesting to use G3L to investigate the evolution of the cross-correlation parameter r^ab with cosmic time, by selecting similar galaxy samples at different redshifts. Such a selection is difficult with the GAMA sample as the detection limit of r = 19.8 mag restricts our analysis to z ≲ 0.5. An analysis of the redshift evolution of the HOD parameters would consequently require a deeper lens galaxy sample, so lenses could be divided into different redshift bins. It would also require a large number of source galaxies with higher redshifts than the lens samples so that the source number density is sufficient for a significant G3L signal.

Our G3L halo model makes several unrealistic assumptions, although they are sufficient for analysing KV450 × GAMA. Concerning the matter distribution, we ignore halo exclusion (Smith et al. 2007), the dependency of halo properties on environment and assembly history (assembly bias, Gao & White 2007), or galactic sub-halos, known to be relevant for galaxy-galaxy lensing at sub-arcmin scales (e.g. Velander et al. 2014). Studies of simulated galaxies from SAMs (e.g. Zehavi et al. 2018) and hydrodynamical simulations (e.g. Hadzhiyska et al. 2020) have shown that HODs also vary with halo formation time and environment. As we only include a dependence on halo mass in our HOD model, our parameter values can be considered an average over all secondary halo properties. Furthermore, concerning the lens galaxy distribution, there is a known feature of the HOD of the simulated galaxies our model does not include: a local minimum in the HODs of galaxies with stellar masses below 10^10.5 h⁻² M_⊙ (see Fig. 12b). This minimum occurs at the transition region between the domination of the HODs by central and satellite galaxies. There, contrary to our model for ⟨ $N_{cen}^{a}$ ${N_{\mathrm{cen}}^{a}}$ |m⟩ the mean number of centrals decreases again with halo mass for galaxies from samples m1, m2, and m3 (stellar masses below 10^10.5 h⁻² M_⊙). A similar observation was reported by Berlind et al. (2003) and Zheng et al. (2005) for galaxies with young stellar populations: Due to quenching, massive halos are unlikely to have central galaxies with young stars suppressing ⟨ $N_{cen}^{a}$ ${N_{\mathrm{cen}}^{a}}$ |m⟩ above a certain halo mass. Another case are emission line galaxies (Guo et al. 2019) where both the central and satellite number decreases with mass for massive halos. Therefore, our specific model implementation is not directly applicable to other lens sample selections, and the HOD model needs to be adjusted on a case-by-case basis. Another issue is the assumption of a Poissonian variance for satellite numbers. Contrary to this assumption, the SAM galaxies by Henriques et al. (2015) have a sub-Poissonian variance for low-mass halos and super-Poissonian variance for high-mass halos, as we show in Fig. B.1. We investigated the impact of this effect by comparing the best-fitting G3L model to the prediction by a modified model using the actual variance ⟨ $N_{sat}^{2} | m$ ${N_{\rm sat}^2|m}$ ⟩ of the SAM galaxies. The difference between the two G3L signals is smaller than the statistical errors in the verification data, so we can neglect non-Poissonian variances here (see Appendix B). However, a G3L analysis in future surveys may require a more sophisticated model with additional parameters for the satellite number variance, such as the models in Dvornik et al. (2018). Relaxing the Poisson assumption is a new test for galaxy models with G3L. While some simulations assume Poisson satellites (Kravtsov et al. 2004; Zheng et al. 2005), recent studies suggest otherwise (Dvornik et al. 2018; Gruen et al. 2018). Despite the above model approximations, our inferred HODs agree with the true HODs of the validation data within the 68% CI. Therefore, these approximations are acceptable for surveys at the same level (or worse) of statistical estimator noise as our validation data.

Aside from increasing the survey area or the source density in a new survey, better HOD constraints are achievable by tapping into two more sources of information in the observational data – after a future revision of the model and the statistical analysis. First, the lens redshifts can be exploited in the redshift-weighting scheme of lens pairs by Linke et al. (2020b). In this scheme, lens galaxy pairs that appear close in projection but are physically distant are down-weighted compared to physically close lens pairs in estimating the G3L correlation function. This weighting increases the signal-to-noise by up to 35% for lenses with spectroscopic redshifts. However, incorporating the scheme into the halo model requires abandoning the Limber approximation and revising Eq. (12). Without the Limber approximation, the projected bispectrum includes three line-of-sight integrals over Bessel functions, whose evaluation is computationally challenging. There are several efforts to obtain projected matter power- and bispectra without relying on the Limber approximation (Assassi et al. 2017; Campagne et al. 2017; Deshpande & Kitching 2020), which might be adopted for a weighted G3L. Second, we expect the parameter constraints to improve by combining G3L with other statistics, breaking degeneracies between HOD parameters (Sect. 3.3). For example, second-order galaxy-galaxy lensing or the mean number density of the lens galaxies are easily available from the same observational data. In particular, the mean number density can be measured to high precision, and it depends directly on ⟨ $N_{cen}^{a}$ ${N_{\mathrm{cen}}^{a}}$ |m⟩+⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ |m⟩, while G3L alone depends on the second-order moments of the central- and satellite galaxy numbers. The overall benefit of this combination is unclear because it requires a refinement of the statistical analysis in order to account for the (expected) covariance between the different probes. Focusing here purely on constraints with G3L, we also leave this promising prospect to a future analysis.

In conclusion, G3L with our halo model is a viable new method to infer the HODs of galaxies and the correlation between the halo distributions of galaxies from different populations. This information is useful for several other analyses. First, HODs offer a fast method to obtain realistic mock galaxy catalogues from halo catalogues (e.g. Carretero et al. 2015; Avila et al. 2018), which is faster than using SAMs or hydrodynamical simulations. Realistic mock galaxies are vital to validate the redshift calibration (e.g. van den Busch et al. 2020; Hildebrandt et al. 2021) or inference pipelines (e.g. Ferrero et al. 2021; DeRose et al. 2022). Including the cross-correlation of galaxy samples constrained with G3L will increase the realism of the mock galaxy distribution and, therefore, the robustness of the analyses based on these simulated data. Second, HODs can be used to provide a physical interpretation of the galaxy bias, which tells us how galaxies trace the cosmic large-scale structure (Cacciato et al. 2012; Simon & Hilbert 2018; Dvornik et al. 2018). Cosmological studies often use simple linear or quadratic models for galaxy biasing (Joachimi et al. 2021; Krause et al. 2021). They, therefore, need to exclude small scales from their analysis, where the simple models are not sufficiently accurate. HODs may provide a more accurate model on small scales so that cosmological parameters could be inferred from a larger scale range. However, the HODs obtained in this work depend on the assumed fiducial cosmology. Accordingly, using the G3L halo model for cosmological inference requires simultaneously constraining the HOD and cosmological parameters. This inference might be unfeasible in practice due to the high-dimensional parameter space. Third, HODs are important tools to constrain the stellar-to-halo mass ratio, traditionally obtained from second-order statistics, such as galaxy-galaxy lensing (Velander et al. 2014; van Uitert et al. 2018; Dvornik et al. 2020). Combining these measurements with the HOD constraints from G3L could lead to tighter constraints on the relationship between baryonic and dark matter.

¹

In their original derivation, Navarro et al. (1996) defined the mass by a sphere of enclosed mean density 200 ρ_crit. However, the fitting function for the concentration by Bullock et al. (2001), employed here, defines m in terms of the mean cosmic density $\bar{ρ}$ $\bar{\rho}$ .

²

The ray-tracing algorithm introduces a smoothing, which lowers the lensing power spectrum for large ℓ; for ℓ < 2 × 10⁴, however, the bias is below 5% (Hilbert et al. 2009). The smoothing bias beyond ℓ ∼ 10⁴ is almost irrelevant for the G3L aperture statistics, as the filter function $\hat{U}$ $\hat{U}$ in Eq. (17) decreases sharply for high ℓ: For a small aperture of $θ = 0 \overset{'}{.} 5$ $\theta=0{{\overset{\prime}{.}}}5$ , we have ${\hat{U}}_{θ}^{3} (ℓ \geq 5 \times 10^{4}) < 10^{- 3} \max [{\hat{U}}_{θ}^{3} (ℓ)]$ $\hat{U}^3_\theta(\ell\ge5\times 10^4) < 10^{-3}\,\mathrm{max}[\hat{U}^3_\theta(\ell)]$ .

³

https://github.com/llinke1/g3lhalo

⁴

We note that an improper PDF P_m is irrelevant here, as long as its moments are well defined, which is the case for the HMF.

Acknowledgments

We thank Elisa Chisari for her helpful comments and for acting as internal referee for the KiDS Collaboration. This work has been supported by the Deutsche Forschungsgemeinschaft through the project SCHN 342/15-1. LL received financial support for this research from the International Max Planck Research School (IMPRS) for Astronomy and Astrophysics at the Universities of Bonn and Cologne. AHW is supported by a European Research Council Consolidator Grant (No. 770935). Based on data products from observations made with ESO Telescopes at the La Silla Paranal Observatory under programme IDs 177.A-3016, 177.A-3017, 177.A-3018, 179.A-2004, 298.A-5015. We also use products from the GAMA survey. GAMA is a joint European-Australasian project based around a spectroscopic campaign using the Anglo-Australian Telescope. The GAMA input catalogue is based on data taken from the Sloan Digital Sky Survey and the UKIRT Infrared Deep Sky Survey. Complementary imaging of the GAMA regions is being obtained by several independent survey programmes, including GALEX MIS, VST KiDS, VISTA VIKING, WISE, Herschel-ATLAS, GMRT and ASKAP, providing UV to radio coverage. GAMA is funded by the STFC (UK), the ARC (Australia), the AAO, and the participating institutions. The GAMA website is http://www.gama-survey.org/. Author contributions. All authors contributed to the development and writing of this paper. The authorship list is given in two groups: The lead authors (LL, PSi, PS), followed by an alphabetical list of contributors to either the scientific analysis or the data products.

References

Anderson, T. W. 2003, An Introduction to Multivariate Statistical Analysis (Wiley-Interscience) [Google Scholar]
Assassi, V., Simonović, M., & Zaldarriaga, M. 2017, J. Cosmol. Astropart. Phys., 2017, 054 [CrossRef] [Google Scholar]
Avila, S., Crocce, M., Ross, A. J., et al. 2018, MNRAS, 479, 94 [NASA ADS] [CrossRef] [Google Scholar]
Bartelmann, M., & Schneider, P. 2001, Phys. Rep., 340, 291 [Google Scholar]
Berlind, A. A., & Weinberg, D. H. 2002, ApJ, 575, 587 [Google Scholar]
Berlind, A. A., Weinberg, D. H., Benson, A. J., et al. 2003, ApJ, 593, 1 [NASA ADS] [CrossRef] [Google Scholar]
Bernardeau, F., Colombi, S., Gaztañaga, E., & Scoccimarro, R. 2002, Phys. Rep., 367, 1 [Google Scholar]
Bruzual, G., & Charlot, S. 2003, MNRAS, 344, 1000 [NASA ADS] [CrossRef] [Google Scholar]
Bullock, J. S., Kolatt, T. S., Sigad, Y., et al. 2001, MNRAS, 321, 559 [Google Scholar]
Cacciato, M., Lahav, O., van den Bosch, F. C., Hoekstra, H., & Dekel, A. 2012, MNRAS, 426, 566 [Google Scholar]
Calzetti, D., Armus, L., Bohlin, R. C., et al. 2000, ApJ, 533, 682 [NASA ADS] [CrossRef] [Google Scholar]
Campagne, J. E., Neveu, J., & Plaszczynski, S. 2017, A&A, 602, A72 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Carretero, J., Castander, F. J., Gaztañaga, E., Crocce, M., & Fosalba, P. 2015, MNRAS, 447, 646 [NASA ADS] [CrossRef] [Google Scholar]
Chabrier, G. 2003, PASP, 115, 763 [Google Scholar]
Clampitt, J., Miyatake, H., Jain, B., & Takada, M. 2016, MNRAS, 457, 2391 [NASA ADS] [CrossRef] [Google Scholar]
Clampitt, J., Sánchez, C., Kwan, J., et al. 2017, MNRAS, 465, 4204 [NASA ADS] [CrossRef] [Google Scholar]
Cooray, A., & Sheth, R. 2002, Phys. Rep., 372, 1 [Google Scholar]
Crittenden, R. G., Natarajan, P., Pen, U.-L., & Theuns, T. 2002, ApJ, 568, 20 [NASA ADS] [CrossRef] [Google Scholar]
de Jong, J. T. A., Verdoes Kleijn, G. A., Boxhoorn, D. R., et al. 2015, A&A, 582, A62 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
DeRose, J., Wechsler, R. H., Becker, M. R., et al. 2022, Phys Rev. D, 105, 123520 [NASA ADS] [CrossRef] [Google Scholar]
Deshpande, A. C., & Kitching, T. D. 2020, Phys Rev. D, 101, 103531 [NASA ADS] [CrossRef] [Google Scholar]
Driver, S. P., Norberg, P., Baldry, I. K., et al. 2009, Astron. Geophys., 50, 12 [Google Scholar]
Dvornik, A., Hoekstra, H., Kuijken, K., et al. 2018, MNRAS, 479, 1240 [Google Scholar]
Dvornik, A., Hoekstra, H., Kuijken, K., et al. 2020, A&A, 642, A83 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Edge, A., Sutherland, W., Kuijken, K., et al. 2013, The Messenger, 154, 32 [NASA ADS] [Google Scholar]
Eisenstein, D. J., & Hu, W. 1998, ApJ, 496, 605 [Google Scholar]
Erben, T., Schirmer, M., Dietrich, J. P., et al. 2005, Astron. Nachr., 326, 432 [NASA ADS] [CrossRef] [Google Scholar]
Farrow, D. J., Cole, S., Norberg, P., et al. 2015, MNRAS, 454, 2120 [Google Scholar]
Ferrero, I., Crocce, M., Tutusaus, I., et al. 2021, A&A, 656, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Gao, L., & White, S. D. M. 2007, MNRAS, 377, L5 [NASA ADS] [CrossRef] [Google Scholar]
Gough, B. 2009, GNU Scientific Library Reference Manual– Third Edition (Network Theory Ltd.) [Google Scholar]
Gruen, D., Friedrich, O., Krause, E., et al. 2018, Phys Rev. D, 98, 023507 [NASA ADS] [CrossRef] [Google Scholar]
Guo, H., Yang, X., Raichoor, A., et al. 2019, ApJ, 871, 147 [NASA ADS] [CrossRef] [Google Scholar]
Hadzhiyska, B., Bose, S., Eisenstein, D., Hernquist, L., & Spergel, D. N. 2020, MNRAS, 493, 5506 [NASA ADS] [CrossRef] [Google Scholar]
Hartlap, J., Simon, P., & Schneider, P. 2007, A&A, 464, 399 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Henriques, B. M. B., White, S. D. M., Thomas, P. A., et al. 2015, MNRAS, 451, 2663 [Google Scholar]
Hilbert, S., Hartlap, J., White, S. D. M., & Schneider, P. 2009, A&A, 499, 31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hildebrandt, H., Köhlinger, F., van den Busch, J. L., et al. 2020, A&A, 633, A69 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hildebrandt, H., van den Busch, J. L., Wright, A. H., et al. 2021, A&A, 647, A124 [EDP Sciences] [Google Scholar]
Ishikawa, S., Okumura, T., Oguri, M., & Lin, S.-C. 2021, ApJ, 922, 23 [NASA ADS] [CrossRef] [Google Scholar]
Jarvis, M., Bernstein, G., & Jain, B. 2004, MNRAS, 352, 338 [Google Scholar]
Joachimi, B., Lin, C. A., Asgari, M., et al. 2021, A&A, 646, A129 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Kaiser, N. 1992, ApJ, 388, 272 [Google Scholar]
Kannawadi, A., Hoekstra, H., Miller, L., et al. 2019, A&A, 624, A92 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Krause, E., Fang, X., Pandey, S., et al. 2021, ArXiv e-prints [arXiv:2105.13548]. [Google Scholar]
Kravtsov, A. V., Berlind, A. A., Wechsler, R. H., et al. 2004, ApJ, 609, 35 [Google Scholar]
Kuijken, K., Heymans, C., Hildebrandt, H., et al. 2015, MNRAS, 454, 3500 [Google Scholar]
Linke, L., Simon, P., Schneider, P., et al. 2020a, A&A, 640, A59 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Linke, L., Simon, P., Schneider, P., & Hilbert, S. 2020b, A&A, 634, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Liske, J., Baldry, I. K., Driver, S. P., et al. 2015, MNRAS, 452, 2087 [Google Scholar]
Liu, J. S. 2004, Monte Carlo Strategies in Scientific Computing, 1st edn. (New York, NY: Springer), 31 [CrossRef] [Google Scholar]
Mandelbaum, R., Hirata, C. M., Broderick, T., Seljak, U., & Brinkmann, J. 2006, MNRAS, 370, 1008 [Google Scholar]
Maraston, C. 2005, MNRAS, 362, 799 [NASA ADS] [CrossRef] [Google Scholar]
Martin, S. M. 2019, Ph.D. Thesis, University of Bonn, Germany [Google Scholar]
Mead, A. J., Peacock, J. A., Heymans, C., Joudaki, S., & Heavens, A. F. 2015, MNRAS, 454, 1958 [NASA ADS] [CrossRef] [Google Scholar]
Miller, L., Heymans, C., Kitching, T. D., et al. 2013, MNRAS, 429, 2858 [Google Scholar]
Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 [Google Scholar]
Nakamura, T. T., & Suto, Y. 1997, Prog. Theor. Phys., 97, 49 [Google Scholar]
Navarro, J. F., Frenk, C. S., & White, S. D. M. 1996, ApJ, 462, 563 [Google Scholar]
Nelder, J. A., & Mead, R. 1965, Comput. J., 7, 308 [Google Scholar]
Planck Collaboration I. 2020, A&A, 641, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Rödiger, J. 2009, Ph.D. Thesis, University of Bonn, Germany [Google Scholar]
Ross, A. J., & Brunner, R. J. 2009, MNRAS, 399, 878 [NASA ADS] [CrossRef] [Google Scholar]
Saghiha, H., Simon, P., Schneider, P., & Hilbert, S. 2017, A&A, 601, A98 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Schirmer, M. 2013, ApJS, 209, 21 [NASA ADS] [CrossRef] [Google Scholar]
Schneider, P., Kilbinger, M., & Lombardi, M. 2005, A&A, 431, 9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Schneider, P., & Watts, P. 2005, A&A, 432, 783 [CrossRef] [EDP Sciences] [Google Scholar]
Scoccimarro, R., Sheth, R. K., Hui, L., & Jain, B. 2001, ApJ, 546, 20 [NASA ADS] [CrossRef] [Google Scholar]
Scranton, R. 2001, MNRAS, 332, 697 [Google Scholar]
Scranton, R. 2002, MNRAS, 339, 410 [Google Scholar]
Sheth, R. K., & Tormen, G. 1999, MNRAS, 308, 119 [Google Scholar]
Simon, P., & Hilbert, S. 2018, A&A, 613, A15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Simon, P., Watts, P., Schneider, P., et al. 2008, A&A, 479, 655 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Simon, P., Hetterscheidt, M., Wolf, C., et al. 2009, MNRAS, 398, 807 [NASA ADS] [CrossRef] [Google Scholar]
Simon, P., Erben, T., Schneider, P., et al. 2013, MNRAS, 430, 2476 [Google Scholar]
Smith, R. E., Scoccimarro, R., & Sheth, R. K. 2007, Phys Rev. D, 75 [CrossRef] [Google Scholar]
Springel, V., White, S. D. M., Jenkins, A., et al. 2005, Nature, 435, 629 [Google Scholar]
Szapudi, I., & Szalay, A. S. 1998, ApJ, 494, L41 [NASA ADS] [CrossRef] [Google Scholar]
Taylor, E. N., Hopkins, A. M., Baldry, I. K., et al. 2011, MNRAS, 418, 1587 [Google Scholar]
Tegmark, M., Taylor, A. N., & Heavens, A. F. 1997, ApJ, 480, 22 [NASA ADS] [CrossRef] [Google Scholar]
van den Busch, J. L., Hildebrandt, H., Wright, A. H., et al. 2020, A&A, 642, A200 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
van Uitert, E., Joachimi, B., Joudaki, S., et al. 2018, MNRAS, 476, 4662 [NASA ADS] [CrossRef] [Google Scholar]
Velander, M., van Uitert, E., Hoekstra, H., et al. 2014, MNRAS, 437, 2111 [Google Scholar]
Venemans, B. P., Verdoes Kleijn, G. A., Mwebaze, J., et al. 2015, MNRAS, 453, 2259 [Google Scholar]
Vogelsberger, M., Marinacci, F., Torrey, P., & Puchwein, E. 2020, Nat. Rev. Phys., 2, 42 [Google Scholar]
Wang, Y., Yang, X., Mo, H. J., & van den Bosch, F. C. 2007, ApJ, 664, 608 [NASA ADS] [CrossRef] [Google Scholar]
Watts, P., & Schneider, P. 2005, in Gravitational Lensing Impact on Cosmology, eds. Y. Mellier, & G. Meylan, IAU Symp., 225, 243 [NASA ADS] [Google Scholar]
Weisstein, E. W. 2022, Delta Function. From MathWorld–A Wolfram Web Resource, http://mathworld.wolfram.com/DeltaFunction.html, Last visited on 02/3/2022 [Google Scholar]
White, S. D. M., & Rees, M. J. 1978, MNRAS, 183, 341 [Google Scholar]
Wright, A. H., Robotham, A. S. G., Bourne, N., et al. 2016, MNRAS, 460, 765 [Google Scholar]
Wright, A. H., Hildebrandt, H., Kuijken, K., et al. 2019, A&A, 632, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Zehavi, I., Zheng, Z., Weinberg, D. H., et al. 2005, ApJ, 630, 1 [Google Scholar]
Zehavi, I., Zheng, Z., Weinberg, D. H., et al. 2011, ApJ, 736, 59 [NASA ADS] [CrossRef] [Google Scholar]
Zehavi, I., Contreras, S., Padilla, N., et al. 2018, ApJ, 853, 84 [NASA ADS] [CrossRef] [Google Scholar]
Zheng, Z., Berlind, A. A., Weinberg, D. H., et al. 2005, ApJ, 633, 791 [NASA ADS] [CrossRef] [Google Scholar]
Zheng, Z., Coil, A. L., & Zehavi, I. 2007, ApJ, 667, 760 [Google Scholar]

Appendix A: Calculation of galaxy-galaxy-matter bispectrum

Here we derive the galaxy-galaxy-matter bispectrum for mixed (a ≠ b) and unmixed (a = b) galaxy pairs in a halo model. While the bispectrum for unmixed pairs has already been derived partially in Watts et al. (2005), Rödiger (2009), and Martin (2019), a model for mixed pairs is, to our knowledge, still pending and therefore shown in detail here.

Our model assumes that all statistics of matter and galaxies inside halos only depend on halo mass, halo positions are clustered according to a deterministic linear bias, matter and galaxy number density profiles are spherically symmetric, satellite galaxies have no sub-halos, and galaxy positions inside halos are statistically independent of each other. These approximations greatly simplify the model formalism while still being sufficiently accurate to describe the G3L aperture statistics in the science verification data. For brevity, we drop all explicit redshift dependencies, and all quantities are understood to be evaluated at the same redshift.

The halo model averages over the probability density functions (PDFs) of the halos masses, positions, galaxy numbers and galaxy positions. The average of an arbitrary quantity f(…) depending on the halo variables is, for H halos distributed in the volume V and containing two distinct galaxy samples a ≠ b,

$\begin{matrix} ⟨ f ⟩ & = \int_{0}^{\infty} m_{1} \dots \int_{0}^{\infty} m_{H} \underset{PDF of halo masses m_{1}, \dots, m_{H}}{\underset{⏟}{P_{m} (m_{1}, \dots, m_{H})}} \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} \underset{PDF of halo centres x_{1}, \dots, x_{H}}{\underset{⏟}{P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H})}} \\ \times \prod_{h = 1}^{H} [\sum_{N_{cen, h}^{a} = 0}^{\infty} \sum_{N_{sat, h}^{a} = 0}^{\infty} \sum_{N_{cen, h}^{b} = 0}^{\infty} \sum_{N_{sat, h}^{b} = 0}^{\infty} \underset{\begin{matrix} Probability that halo h has N_{cen, h}^{a} centrals and N_{sat, h}^{a} satellites of sample a, \\ N_{cen, h}^{b} centrals and N_{sat, h}^{b} satellites of sample b \end{matrix}}{\underset{⏟}{P_{N}^{ab} (N_{cen, h}^{a}, N_{sat, h}^{a}, N_{cen, h}^{b}, N_{sat, h}^{b} | m_{h})}} \\ \times \prod_{v = 1}^{N_{sat, h}^{a}} (\int [3] Δ x_{hv}^{a}) \prod_{w = 1}^{N_{sat, h}^{b}} (\int [3] Δ x_{hw}^{b}) \underset{PDF of satellite offsets Δ x relative to halo centre}{\underset{⏟}{P_{g}^{ab} (Δ x_{h 1}^{a}, \dots, Δ x_{h N_{sat, h}^{a}}^{a}, Δ x_{h 1}^{b}, \dots, Δ x_{h N_{sat, h}^{b}}^{b} | m_{h})}}] f (\dots) . \end{matrix}$ $\begin{aligned} \langle {f}{\rangle }&= \int _0^\infty {m_1}\; \dots \int _0^\infty {m_{H}}\;\underbrace{P_{\rm m}(m_1, \dots , m_{H})}_{{\text{PDF} \text{ of} \text{ halo} \text{ masses} m_1, \dots , m_H}}\, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; \underbrace{P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H})}_{{\text{PDF} \text{ of} \text{ halo} \text{ centres} \boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}}} \\&\nonumber \quad \times \prod _{h=1}^{H}\Bigg [ \sum _{N_{\mathrm{cen} , h}^{a}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{a}=0}^\infty \sum _{N_{\mathrm{cen} , h}^{b}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{b}=0}^\infty \underbrace{P^{ab}_\mathrm{N} (N_{\mathrm{cen} , h}^{a}, N_{\mathrm{sat} , h}^{a}, N_{\mathrm{cen} , h}^{b}, N_{\mathrm{sat} , h}^{b}\,|\,m_h)}_{\begin{matrix} \text{ Probability} \text{ that} \text{ halo} { h} \text{ has} N_{\mathrm{cen} , h}^{a} \text{ centrals} \text{ and} N_{\mathrm{sat} , h}^{a} \text{ satellites} \text{ of} \text{ sample} a,\\ N_{\mathrm{cen} , h}^{b} \text{ centrals} \text{ and} N_{\mathrm{sat} , h}^{b} \text{ satellites} \text{ of} \text{ sample} b \end{matrix}}\\&\nonumber \quad \quad \quad \times \prod _{v=1}^{N_{\mathrm{sat} , h}^{a}}\left(\int [3]{\Delta {x}^{a}_{hv}}\right)\, \prod _{w=1}^{N_{\mathrm{sat} , h}^{b}}\left(\int [3]{\Delta {x}^{b}_{hw}}\right)\, \underbrace{P^{ab}_\mathrm{g} (\Delta \boldsymbol{x}^{a}_{h1}, \dots , \Delta \boldsymbol{x}^{a}_{hN_{\mathrm{sat} , h}^{a}}, \Delta \boldsymbol{x}^{b}_{h1}, \dots , \Delta \boldsymbol{x}^{b}_{hN_{\mathrm{sat} , h}^{b}}\,|\, m_h)}_{\text{PDF} \text{ of} \text{ satellite} \text{ offsets} \Delta \boldsymbol{x} \text{ relative} \text{ to} \text{ halo} \text{ centre}}\Bigg ]\, f(\dots )\;. \end{aligned}$ (A.1)

For two identical galaxy samples a = b, this expression simplifies to

$\begin{matrix} ⟨ f ⟩ & = \int_{0}^{\infty} m_{1} \dots \int_{0}^{\infty} m_{H} P_{m} (m_{1}, \dots, m_{H}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) \\ \times \prod_{h = 1}^{H} [\sum_{N_{cen, h}^{a} = 0}^{\infty} \sum_{N_{sat, h}^{a} = 0}^{\infty} P_{N}^{a} (N_{cen, h}^{a}, N_{sat, h}^{a} | m_{h}) \prod_{v = 1}^{N_{sat, h}^{a}} (\int [3] Δ x_{hv}^{a}) P_{g}^{a} (Δ x_{h 1}^{a}, \dots, Δ x_{h N_{sat, h}^{a}}^{a} | m_{h})] f (\dots), \end{matrix}$ $\begin{aligned} \langle {f}{\rangle }&= \int _0^\infty {m_1}\; \dots \int _0^\infty {m_{H}}\;{P_{\rm m}(m_1, \dots , m_{H})}\, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; {P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H})} \\&\nonumber \quad \times \prod _{h=1}^{H}\Bigg [ \sum _{N_{\mathrm{cen} , h}^{a}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{a}=0}^\infty {P^a_\mathrm{N} (N_{\mathrm{cen} , h}^{a}, N_{\mathrm{sat} , h}^{a}\,|\,m_h)} \prod _{v=1}^{N_{\mathrm{sat} , h}^{a}}\left(\int [3]{\Delta {x}^{a}_{hv}}\right)\, {P^a_\mathrm{g} (\Delta \boldsymbol{x}^{a}_{h1}, \dots , \Delta \boldsymbol{x}^{a}_{hN_{\mathrm{sat} , h}^{a}}\,|\, m_h)}\Bigg ]\, f(\dots )\;, \end{aligned}$ (A.2)

where $P_{N}^{a} (N_{cen, h}^{a}, N_{sat, h}^{a} | m_{h})$ $P^a_\mathrm{N}({N_{\mathrm{cen}, {h}}^{{a}}}, {N_{\mathrm{sat}, {h}}^{{a}}}\,|\,m_h)$ is the joint probability for halo h to have $N_{cen, h}^{a}$ ${N_{\mathrm{cen}, h}^{a}}$ central and $N_{sat, h}^{a}$ ${N_{\mathrm{sat}, h}^{a}}$ satellite galaxies from sample a, and $P_{g}^{a} (Δ x_{h 1}^{a}, \dots, Δ x_{h N_{sat, h}^{a}}^{a} | m_{h})$ $P^a_{\mathrm{g}}(\Delta \boldsymbol{x}^{a}_{h1}, \dots, \Delta \boldsymbol{x}^{a}_{h{N_{\text{sat}, h}^{a}}}\,|\, m_h)$ is the PDF for the positions $Δ x_{h 1}^{a}, \dots, Δ x_{h N_{sat, h}^{a}}^{a}$ $\Delta \boldsymbol{x}^{a}_{h1}, \dots, \Delta \boldsymbol{x}^{a}_{h{N_{\text{sat}, h}^{a}}}$ of these satellites relative to the halo centre. The joint PDF P_m of halo masses, assuming that halo masses are independent of each other, is a product of the HMF,

$\begin{matrix} P_{m} (m_{1} \dots m_{H}) = {\bar{n}}_{h}^{- H} n (m_{1}) \dots n (m_{H}), \end{matrix}$ $\begin{aligned} P_\mathrm{m} (m_1 \dots m_H) = \bar{n}_\mathrm{h} ^{-H} \, n(m_1) \dots n(m_H)\;, \end{aligned}$ (A.3)

where ${\bar{n}}_{h} = H / V$ $\bar{n}_{\mathrm{h}}=H/V$ is the mean halo number density⁴. The PDF $P_{g}^{a b}$ $P^{ab}_\mathrm{g}$ is a product of the normalised spatial distributions $u_{g}^{a} (Δ x | m)$ $u_\mathrm{g}^a(\Delta \boldsymbol{x}|m)$ of satellites in each halo,

$\begin{matrix} P_{g}^{ab} (Δ x_{h 1}^{a}, \dots, Δ x_{h N_{sat, h}^{a}}^{a}, Δ x_{h 1}^{b}, \dots, Δ x_{h N_{sat, h}^{b}}^{b} | m_{h}) & = P_{g}^{a} (Δ x_{h 1}^{a}, \dots, Δ x_{h N_{sat, h}^{a}}^{a} | m_{h}) P_{g}^{b} (Δ x_{h 1}^{b}, \dots, Δ x_{h N_{sat, h}^{b}}^{b} | m_{h}) \end{matrix}$ $\begin{aligned} P^{ab}_\mathrm{g} (\Delta \boldsymbol{x}^{a}_{h1}, \dots , \Delta \boldsymbol{x}^{a}_{h N_{\mathrm{sat} , h}^{a}}, \Delta \boldsymbol{x}^{b}_{h1}, \dots , \Delta \boldsymbol{x}^{b}_{ hN_{\mathrm{sat} , h}^{b}}\,|\, m_h)&= P^a_\mathrm{g} (\Delta \boldsymbol{x}^{a}_{h1}, \dots , \Delta \boldsymbol{x}^{a}_{h N_{\mathrm{sat} , h}^{a}}\,|\, m_h)\, P^b_\mathrm{g} (\Delta \boldsymbol{x}^{b}_{h1}, \dots , \Delta \boldsymbol{x}^{b}_{hN_{\mathrm{sat} , h}^{b}}\,|\, m_h)\end{aligned}$ (A.4)

$\begin{matrix} = \prod_{v = 1}^{N_{sat, h}^{a}} [u_{g}^{a} (Δ x_{hv}^{a} | m_{h})] \prod_{w = 1}^{N_{sat, h}^{b}} [u_{g}^{b} (Δ x_{hw}^{b} | m_{h})], \end{matrix}$ $\begin{aligned}&= \prod _{v=1}^{N_{\mathrm{sat} , h}^{a}} \left[ u_\mathrm{g} ^{a}(\Delta \boldsymbol{x}^{a}_{hv}\,|\,m_h) \right]\, \prod _{w=1}^{N_{\mathrm{sat} , h}^{b}} \left[ u_\mathrm{g} ^{b}(\Delta \boldsymbol{x}^{b}_{hw}\,|\,m_h) \right]\;, \end{aligned}$ (A.5)

under the assumption of independent satellite positions inside a halo. The joint probability P_N of per-halo galaxy numbers has the moments

$\begin{matrix} ⟨ {(N_{cen}^{a})}^{p} {(N_{sat}^{a})}^{q} {(N_{cen}^{b})}^{r} {(N_{cen}^{b})}^{s} | m ⟩ = \sum_{N_{cen, h}^{a} = 0}^{\infty} \sum_{N_{sat, h}^{a} = 0}^{\infty} \sum_{N_{cen, h}^{b} = 0}^{\infty} \sum_{N_{sat, h}^{b} = 0}^{\infty} {(N_{cen, h}^{a})}^{p} {(N_{sat, h}^{a})}^{q} {(N_{cen, h}^{b})}^{r} {(N_{sat, h}^{b})}^{s} P_{N} (N_{cen, h}^{a}, N_{sat, h}^{a}, N_{cen, h}^{b}, N_{sat, h}^{b} | m_{h}), \end{matrix}$ $\begin{aligned} \langle {(N_{\mathrm{cen} }^{a})^p\,(N_{\mathrm{sat} }^{a})^q\,(N_{\mathrm{cen} }^{b})^r\,(N_{\mathrm{cen} }^{b})^s\,|\, m}{\rangle } = \sum _{N_{\mathrm{cen} , h}^{a}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{a}=0}^\infty \sum _{N_{\mathrm{cen} , h}^{b}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{b}=0}^\infty (N_{\mathrm{cen} , h}^{a})^p\, (N_{\mathrm{sat} , h}^{a})^q\, (N_{\mathrm{cen} , h}^{b})^r\, (N_{\mathrm{sat} , h}^{b})^s \,{P_\mathrm{N} (N_{\mathrm{cen} , h}^{a}, N_{\mathrm{sat} , h}^{a}, N_{\mathrm{cen} , h}^{b}, N_{\mathrm{sat} , h}^{b}\,|\,m_h)}\,, \end{aligned}$ (A.6)

where p, q, r, s ∈ ℕ₀.

After inserting Eqs. (49) and (51) into (46) and using Eq. (A.1), the bispectrum $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ for mixed pairs, a ≠ b, is given by

$\begin{matrix} {(2 π)}^{3} B_{gg δ}^{ab} (k_{1}, k_{2}, k_{3}) δ_{D} (k_{1} + k_{2} + k_{3}) + unconnected terms \\ = \frac{1}{{\bar{n}}_{h}^{H} {\bar{n}}_{g}^{a} {\bar{n}}_{g}^{b} \bar{ρ}} \sum_{i, j, k = 1}^{H} \int_{0}^{\infty} m_{1} \dots \int_{0}^{\infty} m_{H} n (m_{1}) \dots n (m_{H}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) \\ \times \prod_{h = 1}^{H} {\sum_{N_{cen, h}^{a} = 0}^{\infty} \sum_{N_{sat, h}^{a} = 0}^{\infty} \sum_{N_{cen, h}^{b} = 0}^{\infty} \sum_{N_{sat, h}^{b} = 0}^{\infty} P_{N} (N_{cen, h}^{a}, N_{sat, h}^{a}, N_{cen, h}^{b}, N_{sat, h}^{b} | m_{h}) \prod_{v = 1}^{N_{sat, h}^{a}} [\int [3] Δ x_{hv}^{a} u_{g}^{a} (Δ x_{hv}^{a} | m_{h})] \prod_{w = 1}^{N_{sat, h}^{b}} [\int [3] Δ x_{hw}^{b} u_{g}^{b} (Δ x_{hw}^{b} | m_{h})]} \\ \times m_{k} \hat{u} (k_{3} | m_{k}) e^{- i k_{3} \cdot x_{k}} [N_{cen, i}^{a} e^{- i k_{1} \cdot x_{i}} + \sum_{l = 1}^{N_{sat, i}^{a}} e^{- i k_{1} \cdot x_{i} - i k_{1} \cdot Δ x_{il}^{a}}] [N_{cen, j}^{b} e^{- i k_{2} \cdot x_{j}} + \sum_{m = 1}^{N_{sat, j}^{b}} e^{- i k_{2} \cdot x_{j} - i k_{2} \cdot Δ x_{jm}^{b}}], \end{matrix}$ $\begin{aligned}&\nonumber (2\pi )^3 B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3) + \text{ unconnected} \text{ terms}\\&= \frac{1}{\bar{n}_\mathrm{h} ^H \bar{n}^a_\mathrm{g} \bar{n}^b_\mathrm{g} \, \bar{\rho }} \sum _{i,j,k=1}^H \int _0^\infty {m_1}\; \dots \int _0^\infty {m_{H}}\; n(m_1) \dots n(m_H)\, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H}) \\&\nonumber \times \prod _{h=1}^{H}\Biggl \{ \sum _{N_{\mathrm{cen} , h}^{a}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{a}=0}^\infty \sum _{N_{\mathrm{cen} , h}^{b}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{b}=0}^\infty {P_\mathrm{N} (N_{\mathrm{cen} , h}^{a}, N_{\mathrm{sat} , h}^{a}, N_{\mathrm{cen} , h}^{b}, N_{\mathrm{sat} , h}^{b}\,|\,m_h)}\, \prod _{v=1}^{N_{\mathrm{sat} , h}^{a}} \left[ \int [3]{\Delta {x}^{a}_{hv}}\, u_\mathrm{g} ^{a}(\Delta \boldsymbol{x}^{a}_{hv}\,|\,m_h) \right]\, \prod _{w=1}^{N_{\mathrm{sat} , h}^{b}} \left[ \int [3]{\Delta {x}^{b}_{hw}}\, u_\mathrm{g} ^{b}(\Delta \boldsymbol{x}^{b}_{hw}\,|\,m_h) \right]\Biggr \}\\&\nonumber \times m_k\, \hat{u}(\boldsymbol{k}_3\,|\,m_k)\, \mathrm{e} ^{-{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_k}\, \left[N_{\mathrm{cen} , i}^{a}\,\mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i} + \sum _{l=1}^{N_{\mathrm{sat} , i}^{a}} \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i-{i}\boldsymbol{k}_1\cdot \Delta \boldsymbol{x}^a_{il}}\right]\, \left[N_{\mathrm{cen} , j}^{b}\,\mathrm{e} ^{-{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j} + \sum _{m=1}^{N_{\mathrm{sat} , j}^{b}} \mathrm{e} ^{-{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j-{i}\boldsymbol{k}_2\cdot \Delta \boldsymbol{x}^b_{jm}}\right]\;, \end{aligned}$ (A.7)

where unconnected terms do not contain δ_D(k₁ + k₂ + k₂), ${\bar{n}}_{g}^{a}$ $\bar{n}^a{_\text{g}}$ is the mean galaxy number density and $\bar{ρ}$ $\bar{\rho}$ is the mean matter density (as in Eqs. 1 and 2). For unmixed pairs, a = b, the bispectrum is given by

$\begin{matrix} {(2 π)}^{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) δ_{D} (k_{1} + k_{2} + k_{3}) + unconnected terms \\ = \frac{1}{{\bar{n}}_{h}^{H} {({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \sum_{i, j, k = 1}^{H} \int_{0}^{\infty} m_{1} \dots \int_{0}^{\infty} m_{H} n (m_{1}) \dots n (m_{H}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) \\ \times \prod_{h = 1}^{H} {\sum_{N_{cen, h}^{a} = 0}^{\infty} \sum_{N_{sat, h}^{a} = 0}^{\infty} P_{N} (N_{cen, h}^{a}, N_{sat, h}^{a} | m_{h}) \prod_{v = 1}^{N_{sat, h}^{a}} [\int [3] Δ x_{hv}^{a} u_{g}^{a} (Δ x_{hv}^{a} | m_{h})]} \\ \times m_{k} \hat{u} (k_{3} | m_{k}) e^{- i k_{3} \cdot x_{k}} [N_{cen, i}^{a} (N_{cen, j}^{a} - δ_{ij}^{K}) e^{- i k_{1} \cdot x_{i} - i k_{2} \cdot x_{j}} + N_{cen, j}^{a} \sum_{l = 1}^{N_{sat, i}^{a}} e^{- i k_{2} \cdot x_{j} - i k_{1} \cdot x_{i} - i k_{1} \cdot Δ x_{il}^{a}} \\ + N_{cen, i}^{a} \sum_{m = 1}^{N_{sat, j}^{a}} e^{- i k_{1} \cdot x_{i} - i k_{2} \cdot x_{j} - i k_{2} \cdot Δ x_{jm}^{a}} + \sum_{l = 1}^{N_{sat, i}^{a}} \sum_{m = 1, m \neq l}^{N_{sat, j}^{a}} e^{- i k_{1} \cdot x_{i} - i k_{1} \cdot Δ x_{il}^{a} - i k_{2} \cdot x_{j} - i k_{2} \cdot Δ x_{jm}^{a}}], \end{matrix}$ $\begin{aligned}&\nonumber (2\pi )^3 B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3) + \text{ unconnected} \text{ terms}\\&= \frac{1}{\bar{n}_\mathrm{h} ^H\, (\bar{n}^a_\mathrm{g} )^2 \, \bar{\rho }} \sum _{i,j,k=1}^H \int _0^\infty {m_1}\; \dots \int _0^\infty {m_{H}}\; n(m_1) \dots n(m_H)\, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H}) \\&\nonumber \quad \times \prod _{h=1}^{H}\Biggl \{\sum _{N_{\mathrm{cen} , h}^{a}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{a}=0}^\infty {P_\mathrm{N} (N_{\mathrm{cen} , h}^{a}, N_{\mathrm{sat} , h}^{a}\,|\,m_h)}\, \prod _{v=1}^{N_{\mathrm{sat} , h}^{a}} \left[ \int [3]{\Delta {x}^{a}_{hv}}\, u_\mathrm{g} ^{a}(\Delta \boldsymbol{x}^{a}_{hv}\,|\,m_h) \right]\Biggr \}\\&\nonumber \quad \times m_k\, \hat{u}(\boldsymbol{k}_3\,|\,m_k)\, \mathrm{e} ^{-{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_k}\,\Bigg [N_{\mathrm{cen} , i}^{a}\,(N_{\mathrm{cen} , j}^{a} - \delta ^\mathrm{K}_{i j})\,\mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i-{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j} + N_{\mathrm{cen} , j}^{a}\,\sum _{l=1}^{N_{\mathrm{sat} , i}^{a}} \mathrm{e} ^{-{i}\boldsymbol{k_2}\cdot \boldsymbol{x}_j-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i-{i}\boldsymbol{k}_1\cdot \Delta \boldsymbol{x}^a_{il}}\\&\nonumber \quad \quad + N_{\mathrm{cen} , i}^{a}\,\sum _{m=1}^{N_{\mathrm{sat} , j}^{a}} \mathrm{e} ^{-{i}\boldsymbol{k_1}\cdot \boldsymbol{x}_i-{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j-{i}\boldsymbol{k}_2\cdot \Delta \boldsymbol{x}^a_{jm}} + \sum _{l=1}^{N_{\mathrm{sat} , i}^{a}} \sum _{m=1, m\ne l}^{N_{\mathrm{sat} , j}^{a}} \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i - {i}\boldsymbol{k}_1\cdot \Delta \boldsymbol{x}_{il}^a-{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j-{i}\boldsymbol{k}_2\cdot \Delta \boldsymbol{x}^a_{jm}}\Bigg ]\;, \end{aligned}$ (A.8)

where $δ_{i j}^{K}$ ${\delta^{\rm K}_{{i} {j}}}$ is the Kronecker symbol.

We divide the sum over i, j and k into the 1-halo term ₁ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ for i = j = k, the 2-halo term ₂ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ for i = j ≠ k, i = k ≠ j, and j = k ≠ i, and the 3-halo term ₃ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ for i ≠ j ≠ k. In the following, we calculate these terms individually.

A.1. 1-halo term

To calculate the 1-halo term ₁ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ with i = j = k, we distinguish between mixed (a ≠ b) and unmixed (a = b) pairs. For a = b, the 1-halo term is given by

$\begin{matrix} {(2 π)}_{1}^{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) δ_{D} (k_{1} + k_{2} + k_{3}) \\ = \frac{1}{{\bar{n}}_{h}^{H} {({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \sum_{i = 1}^{H} \int_{0}^{\infty} m_{1} \dots \int_{0}^{\infty} m_{H} n (m_{1}) \dots n (m_{H}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) \\ \times \prod_{h = 1}^{H} {\sum_{N_{cen, h}^{a} = 0}^{\infty} \sum_{N_{sat, h}^{a} = 0}^{\infty} P_{N} (N_{cen, h}^{a}, N_{sat, h}^{a} | m_{h}) \prod_{v = 1}^{N_{sat, h}^{a}} [\int [3] Δ x_{hv}^{a} u_{g}^{a} (Δ x_{hv}^{a} | m_{h})]} m_{i} \hat{u} (k_{3} | m_{i}) e^{- i k_{3} \cdot x_{i}} \\ \times [N_{cen, i}^{a} (N_{cen, i}^{a} - 1) e^{- i (k_{1} + k_{2}) \cdot x_{i}} + N_{cen, i}^{a} \sum_{l = 1}^{N_{sat, i}^{a}} e^{- i (k_{1} + k_{2}) \cdot x_{i} - i k_{1} \cdot Δ x_{il}^{a}} + N_{cen, i}^{a} \sum_{m = 1}^{N_{sat, i}^{a}} e^{- i (k_{1} + k_{2}) \cdot x_{i} - i k_{2} \cdot Δ x_{im}^{a}} + \sum_{l = 1}^{N_{sat, i}^{a}} \sum_{\begin{matrix} m = 1, \\ m \neq l \end{matrix}}^{N_{sat, i}^{a}} e^{- i (k_{1} + k_{2}) \cdot x_{i} - i k_{1} \cdot Δ x_{il}^{a} - i k_{2} \cdot Δ x_{im}^{a}}] . \end{matrix}$ $\begin{aligned}&\nonumber (2\pi )^3 _{1} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3)\\&= \frac{1}{\bar{n}_\mathrm{h} ^H\,(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \sum _{i=1}^H \int _0^\infty {m_1}\; \dots \int _0^\infty {m_H}\; n(m_1)\dots n(m_H) \, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H}) \\&\nonumber \quad \times \prod _{h=1}^{H}\Biggl \{ \sum _{N_{\mathrm{cen} , h}^{a}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{a}=0}^\infty {P_\mathrm{N} (N_{\mathrm{cen} , h}^{a}, N_{\mathrm{sat} , h}^{a}\,|\,m_h)}\, \prod _{v=1}^{N_{\mathrm{sat} , h}^{a}} \left[ \int [3]{\Delta {x}^{a}_{hv}}\, u_\mathrm{g} ^{a}(\Delta \boldsymbol{x}^{a}_{hv}\,|\,m_h) \right]\Biggr \}\, m_i\, \hat{u}(\boldsymbol{k}_3\,|\,m_i)\, \mathrm{e} ^{-{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_i}\, \\&\nonumber \quad \times \Bigg [N_{\mathrm{cen} , i}^{a}\,(N_{\mathrm{cen} , i}^{a} - 1)\,\mathrm{e} ^{-{i}(\boldsymbol{k}_1+\boldsymbol{k}_2)\cdot \boldsymbol{x}_i} + N_{\mathrm{cen} , i}^{a}\,\sum _{l=1}^{N_{\mathrm{sat} , i}^{a}} \mathrm{e} ^{-{i}(\boldsymbol{k}_1+\boldsymbol{k}_2)\cdot \boldsymbol{x}_i-{i}\boldsymbol{k}_1\cdot \Delta \boldsymbol{x}^a_{il}} + N_{\mathrm{cen} , i}^{a}\,\sum _{m=1}^{N_{\mathrm{sat} , i}^{a}} \mathrm{e} ^{-{i}(\boldsymbol{k_1}+\boldsymbol{k_2})\cdot \boldsymbol{x}_i-{i}\boldsymbol{k}_2\cdot \Delta \boldsymbol{x}^a_{im}} + \sum _{l=1}^{N_{\mathrm{sat} , i}^{a}} \sum ^{N_{\mathrm{sat} , i}^{a}}_{\begin{matrix} m=1,\\ m\ne l \end{matrix}} \mathrm{e} ^{-{i}(\boldsymbol{k}_1+\boldsymbol{k}_2)\cdot \boldsymbol{x}_i - {i}\boldsymbol{k}_1\cdot \Delta \boldsymbol{x}^a_{il}-{i}\boldsymbol{k}_2\cdot \Delta \boldsymbol{x}^a_{im}}\Bigg ]\,. \end{aligned}$ (A.9)

Evaluating all m-integrals aside from the one over m_i (H − 1 integrals in total) leads to

$\begin{matrix} {(2 π)}_{1}^{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) δ_{D} (k_{1} + k_{2} + k_{3}) \\ = \frac{1}{{\bar{n}}_{h}^{H} {({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \sum_{i = 1}^{H} {\bar{n}}_{h}^{H - 1} \int_{0}^{\infty} m_{i} n (m_{i}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) e^{- i (k_{1} + k_{2} + k_{3}) \cdot x_{i}} \\ \times \sum_{N_{cen, h}^{a} = 0}^{\infty} \sum_{N_{sat, h}^{a} = 0}^{\infty} P_{N} (N_{cen, i}^{a}, N_{sat, i}^{a} | m_{i}) \prod_{v = 1}^{N_{sat, i}^{a}} [\int [3] Δ x_{iv}^{a} u_{g}^{a} (Δ x_{iv}^{a} | m_{i})] m_{i} \hat{u} (k_{3} | m_{i}) \\ \times [N_{cen, i}^{a} (N_{cen, i}^{a} - 1) + N_{cen, i}^{a} \sum_{l = 1}^{N_{sat, i}^{a}} e^{- i k_{1} \cdot Δ x_{il}^{a}} + N_{cen, i}^{a} \sum_{m = 1}^{N_{sat, i}^{a}} e^{- i k_{2} \cdot Δ x_{im}^{a}} + \sum_{l = 1}^{N_{sat, i}^{a}} \sum_{m = 1, m \neq l}^{N_{sat, i}^{a}} e^{- i k_{1} \cdot Δ x_{il}^{a} - i k_{2} \cdot Δ x_{im}^{a}}] . \end{matrix}$ $\begin{aligned}&\nonumber (2\pi )^3 _{1} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3)\\&= \frac{1}{\bar{n}_\mathrm{h} ^H\,(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \sum _{i=1}^H \bar{n}_\mathrm{h} ^{H-1} \int _0^\infty {m_i}\; n(m_i) \, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H})\, \mathrm{e} ^{-{i}(\boldsymbol{k}_1 + \boldsymbol{k}_2 +\boldsymbol{k}_3)\cdot \boldsymbol{x}_i}\, \\&\nonumber \quad \times \sum _{N_{\mathrm{cen} , h}^{a}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{a}=0}^\infty {P_\mathrm{N} (N_{\mathrm{cen} , i}^{a}, N_{\mathrm{sat} , i}^{a}\,|\,m_i)}\, \prod _{v=1}^{N_{\mathrm{sat} , i}^{a}} \left[ \int [3]{\Delta {x}^{a}_{iv}}\, u_\mathrm{g} ^{a}(\Delta \boldsymbol{x}^{a}_{iv}\,|\,m_i) \right]\, m_i\, \hat{u}(\boldsymbol{k}_3\,|\,m_i) \\&\nonumber \quad \times \Bigg [N_{\mathrm{cen} , i}^{a}\,(N_{\mathrm{cen} , i}^{a} - 1) + N_{\mathrm{cen} , i}^{a}\,\sum _{l=1}^{N_{\mathrm{sat} , i}^{a}} \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \Delta \boldsymbol{x}^a_{il}} + N_{\mathrm{cen} , i}^{a}\,\sum _{m=1}^{N_{\mathrm{sat} , i}^{a}} \mathrm{e} ^{-{i}\boldsymbol{k}_2\cdot \Delta \boldsymbol{x}^a_{im}} + \sum _{l=1}^{N_{\mathrm{sat} , i}^{a}} \sum _{m=1, m\ne l}^{N_{\mathrm{sat} , i}^{a}} \mathrm{e} ^{ - {i}\boldsymbol{k}_1\cdot \Delta \boldsymbol{x}^a_{il}-{i}\boldsymbol{k}_2\cdot \Delta \boldsymbol{x}^a_{im}}\Bigg ]\;. \end{aligned}$ (A.10)

Using Eqs. (A.6), ∫ⅆ[3]x ug^a(x | m) = 1, and $\int d^{3} x u g^{a} (x | m) exp (- i k \cdot x) = \hat{u} g^{a} (k | m)$ $\int \mathrm{d}^{3}x\; u\mathrm{g}^a(\mathbf{x}\,|\,m)\, \exp(-{{i}}\boldsymbol{k}\cdot \boldsymbol{x}) = \hat{u}\mathrm{g}^a(\boldsymbol{k}\,|\,m)$ , we find

$\begin{matrix} {(2 π)}_{1}^{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) δ_{D} (k_{1} + k_{2} + k_{3}) \\ = \frac{1}{{\bar{n}}_{h} {({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \sum_{i = 1}^{H} \int_{0}^{\infty} m_{i} n (m_{i}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) e^{- i (k_{1} + k_{2} + k_{3}) \cdot x_{i}} m_{i} \hat{u} (k_{3} | m_{i}) \\ \times {⟨ N_{cen}^{a} (N_{cen}^{a} - 1) | m_{i} ⟩ + ⟨ N_{cen}^{a} N_{sat}^{a} | m_{i} ⟩ [{\hat{u}}_{g}^{a} (k_{1} | m_{i}) + {\hat{u}}_{g}^{a} (k_{2} | m_{i})] + ⟨ N_{sat}^{a} (N_{sat}^{a} - 1) | m_{i} ⟩ {\hat{u}}_{g}^{a} (k_{1} | m_{i}) {\hat{u}}_{g}^{a} (k_{2} | m_{i})}, \end{matrix}$ $\begin{aligned}&\nonumber (2\pi )^3 _{1} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3)\\&= \frac{1}{\bar{n}_\mathrm{h} \,(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \sum _{i=1}^H \int _0^\infty {m_i}\; n(m_i) \, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H})\, \mathrm{e} ^{-{i}(\boldsymbol{k}_1 + \boldsymbol{k}_2 +\boldsymbol{k}_3)\cdot \boldsymbol{x}_i}\, m_i\, \hat{u}(\boldsymbol{k}_3\,|\,m_i) \\&\nonumber \quad \times \left\{ \langle {N_{\mathrm{cen} }^{a}\,(N_{\mathrm{cen} }^{a} - 1)\,|\,m_i}{\rangle } + \langle {N_{\mathrm{cen} }^{a}\, N_{\mathrm{sat} }^{a}\,|\,m_i}{\rangle } \left[ \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_i) + \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_i) \right] +\langle {N_{\mathrm{sat} }^{a}\; (N_{\mathrm{sat} }^{a}-1)\,|\, m_i}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_i)\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_i)\right\} \;, \end{aligned}$ (A.11)

explicitly using the second-order moments of P_N in the last line.

We consider large volumes V here, rendering the V-integrals over the halo positions x with exp−ik ⋅ x in the integrand essentially Fourier transforms. This can, for example, be seen for a cubic volume V_s with side length s,

$\begin{matrix} \int_{V_{s}} [3] x exp (- i k \cdot x) = \frac{8}{k_{x} k_{y} k_{z}} sin (\frac{s k_{x}}{2}) sin (\frac{s k_{y}}{2}) sin (\frac{s k_{z}}{2}), \end{matrix}$ $\begin{aligned} \int _{V_s} [3]{x} \, \exp (-{i}\boldsymbol{k}\cdot \boldsymbol{x}) = \frac{8}{k_x k_y k_z} \sin (\frac{s k_x}{2})\,\sin (\frac{s k_y}{2})\,\sin (\frac{s k_z}{2})\;, \end{aligned}$ (A.12)

where the k_i are the components of k. For large V_s (s → ∞), this expression approximates a Dirac-distribution (e.g. Weisstein 2022), as

$\begin{matrix} lim_{s \to \infty} \frac{sin (s k)}{k} = π δ_{D} (k) \end{matrix}$ $\begin{aligned} \lim _{s\rightarrow \infty } \frac{\sin (s\,k)}{k} = \pi \delta _{\rm D}(k) \end{aligned}$ (A.13)

and hence

$\begin{matrix} lim_{s \to \infty} \int_{V_{s}} [3] x exp (- i k \cdot x) = {(2 π)}^{3} δ_{D} (x) . \end{matrix}$ $\begin{aligned} \lim _{s\rightarrow \infty }\int _{V_s} [3]{x} \, \exp (-{i}\boldsymbol{k}\cdot \boldsymbol{x})=(2\pi )^3 \, \delta _{\rm D}(\boldsymbol{x})\;. \end{aligned}$ (A.14)

In this limit (assumed henceforth),

$\begin{matrix} \int_{V} [3] x_{1} \dots [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) e^{- i (k_{1} + k_{2} + k_{3}) \cdot x_{i}} = \int_{V} [3] x_{i} \frac{1}{V} e^{- i (k_{1} + k_{2} + k_{3}) \cdot x_{i}} = \frac{{(2 π)}^{3} {\bar{n}}_{h}}{H} δ_{D} (k_{1} + k_{2} + k_{3}), \end{matrix}$ $\begin{aligned} \int _V [3]{x_1} \, \dots \, [3]{x_H}\; {P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H})}\, \mathrm{e} ^{-{i}(\boldsymbol{k}_1 + \boldsymbol{k}_2 +\boldsymbol{k}_3)\cdot \boldsymbol{x}_i} = \int _V [3]{x_i}\; \frac{1}{V} \, \mathrm{e} ^{-{i}(\boldsymbol{k}_1 + \boldsymbol{k}_2 +\boldsymbol{k}_3)\cdot \boldsymbol{x}_i}= \frac{(2\pi )^3 \, \bar{n}_\mathrm{h} }{H}\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3)\;, \end{aligned}$ (A.15)

which leads to

$\begin{matrix} _{1} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) & = \frac{1}{{({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \int_{0}^{\infty} m n (m) m \hat{u} (k_{3} | m) {⟨ N_{cen}^{a} (N_{cen}^{a} - 1) | m ⟩ + ⟨ N_{cen}^{a} N_{sat}^{a} | m ⟩ [{\hat{u}}_{g}^{a} (k_{1} | m) + {\hat{u}}_{g}^{a} (k_{2} | m)] \end{matrix}$ $\begin{aligned} _{1} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)&= \frac{1}{(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }}\, \int _0^\infty {m}\; n(m)\, m\, \hat{u}(\boldsymbol{k}_3\,|\,m) \, \Big \{\langle {N_{\mathrm{cen} }^{a}\,(N_{\mathrm{cen} }^{a} - 1)\,|\,m}{\rangle } + \langle {N_{\mathrm{cen} }^{a}\, N_{\mathrm{sat} }^{a}\,|\,m}{\rangle } \left[ \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m) + \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m) \right]\end{aligned}$ (A.16)

$\begin{matrix} + ⟨ N_{sat}^{a} (N_{sat}^{a} - 1) | m ⟩ {\hat{u}}_{g}^{a} (k_{1} | m) {\hat{u}}_{g}^{a} (k_{2} | m)} \\ = \frac{1}{{({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \int_{0}^{\infty} m n (m) m \hat{u} (k_{3} | m) G^{aa} (k_{1}, k_{2} | m), \end{matrix}$ $\begin{aligned}&\nonumber \quad \quad +\langle {N_{\mathrm{sat} }^{a}\; (N_{\mathrm{sat} }^{a}-1)\,|\, m}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m)\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m)\Big \} \;\\&= \frac{1}{(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }}\, \int _0^\infty {m}\; n(m)\, m\, \hat{u}(\boldsymbol{k}_3\,|\,m)\, G^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2\,|\, m) \;, \end{aligned}$ (A.17)

with k₃ = −k₁ − k₂ and

$\begin{matrix} G^{ab} (k_{1}, k_{2} | m) \\ = ⟨ N_{cen}^{a} (N_{cen}^{a} - δ_{ab}^{K}) | m ⟩ + ⟨ N_{cen}^{a} N_{sat}^{b} | m ⟩ {\hat{u}}_{g}^{b} (k_{2} | m) + ⟨ N_{cen}^{b} N_{sat}^{a} | m ⟩ {\hat{u}}_{g}^{a} (k_{1} | m) + ⟨ N_{sat}^{a} (N_{sat}^{b} - δ_{ab}^{K}) ⟩ {\hat{u}}_{g}^{a} (k_{1} | m) {\hat{u}}_{g}^{b} (k_{2} | m) . \end{matrix}$ $\begin{aligned}&\nonumber G^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2\,|\,m)\\&= \langle {N_{\mathrm{cen} }^{a}\,(N_{\mathrm{cen} }^{a}-\delta ^\mathrm{K}_{a b})\,|\,m}{\rangle } + \langle {N_{\mathrm{cen} }^{a}\, N_{\mathrm{sat} }^{b}\,|\,m}{\rangle }\, \hat{u}_\mathrm{g} ^{b}(\boldsymbol{k}_2\,|\,m) + \langle {N_{\mathrm{cen} }^{b}\, N_{\mathrm{sat} }^{a}\,|\,m}{\rangle }\, \hat{u}_\mathrm{g} ^{a}(\boldsymbol{k}_1\,|\,m) + \langle {N_{\mathrm{sat} }^{a}\,(N_{\mathrm{sat} }^{b} - \delta ^\mathrm{K}_{a b})}{\rangle }\, \hat{u}_\mathrm{g} ^{a}(\boldsymbol{k}_1\,|\,m)\,\hat{u}_\mathrm{g} ^{b}(\boldsymbol{k}_2\,|\,m)\;. \end{aligned}$ (A.18)

For mixed pairs, a ≠ b, an analogous calculation gives

$\begin{matrix} _{1} B_{gg δ}^{ab} (k_{1}, k_{2}, k_{3}) & = \frac{1}{{\bar{n}}_{g}^{a} {\bar{n}}_{g}^{b} \bar{ρ}} \int_{0}^{\infty} m n (m) m \hat{u} (k_{3} | m) [⟨ N_{cen}^{a} N_{cen}^{b} | m ⟩ + ⟨ N_{cen}^{a} N_{sat}^{b} | m ⟩ {\hat{u}}_{g}^{b} (k_{2} | m) \end{matrix}$ $\begin{aligned} _{1} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)&= \frac{1}{\bar{n}^a_\mathrm{g} \,\bar{n}^b_\mathrm{g} \, \bar{\rho }}\, \int _0^\infty {m}\; n(m)\, m\, \hat{u}(\boldsymbol{k}_3\,|\,m) \Big [\langle {N_{\mathrm{cen} }^{a}\,N_{\mathrm{cen} }^{b}\,|\,m}{\rangle } + \langle {N_{\mathrm{cen} }^{a}\, N_{\mathrm{sat} }^{b}\,|\,m}{\rangle } \, \hat{u}_\mathrm{g} ^b(\boldsymbol{k}_2\,|\,m) \end{aligned}$ (A.19)

$\begin{matrix} + ⟨ N_{cen}^{b} N_{sat}^{a} | m ⟩ {\hat{u}}_{g}^{a} (k_{1} | m) + ⟨ N_{sat}^{a} N_{sat}^{b} | m ⟩ {\hat{u}}_{g}^{a} (k_{1} | m) {\hat{u}}_{g}^{b} (k_{2} | m)] \\ = \frac{1}{{\bar{n}}_{g}^{a} {\bar{n}}_{g}^{b} \bar{ρ}} \int_{0}^{\infty} m n (m) m \hat{u} (k_{3} | m) G^{ab} (k_{1}, k_{2} | m) . \end{matrix}$ $\begin{aligned}&\nonumber \quad \quad + \langle {N_{\mathrm{cen} }^{b}\, N_{\mathrm{sat} }^{a}\,|\,m}{\rangle } \, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m) + \langle {N_{\mathrm{sat} }^{a}\; N_{\mathrm{sat} }^{b}\,|\, m}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m)\, \hat{u}_\mathrm{g} ^b(\boldsymbol{k}_2\,|\,m)\Big ]\;\\&= \frac{1}{\bar{n}^a_\mathrm{g} \,\bar{n}^b_\mathrm{g} \, \bar{\rho }}\, \int _0^\infty {m}\; n(m)\, m\, \hat{u}(\boldsymbol{k}_3\,|\,m)\, G^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2\,|\, m)\;. \end{aligned}$ (A.20)

In the main text, we denote ₁ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ (k₁, k₂, k₃) as ₁ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ (k₁, k₂) for brevity.

A.2. 2-halo term

The 2-halo term ₂ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ in Eq. (A.7) contains the contributions from i = j ≠ k, i = k ≠ j, and k = j ≠ i. For unmixed pairs, a = b, the 2-halo term is given by

$\begin{matrix} {(2 π)}_{2}^{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) δ_{D} (k_{1} + k_{2} + k_{3}) + unconnected terms \\ = \frac{1}{{\bar{n}}_{h}^{H} {({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \sum_{i, j = 1, j \neq i}^{H} \int_{0}^{\infty} m_{1} \dots \int_{0}^{\infty} m_{H} n (m_{1}) \dots n (m_{H}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) \\ \times \prod_{h = 1}^{H} {\sum_{N_{cen, h}^{a} = 0}^{\infty} \sum_{N_{sat, h}^{a} = 0}^{\infty} P_{N} (N_{cen, h}^{a}, N_{sat, h}^{a} | m_{h}) \prod_{v = 1}^{N_{sat, h}^{a}} [\int [3] Δ x_{hv}^{a} u_{g}^{a} (Δ x_{hv}^{a} | m_{h})]} \\ \times {m_{j} \hat{u} (k_{3} | m_{j}) e^{- i k_{3} \cdot x_{j}} [N_{cen, i}^{a} (N_{cen, i}^{a} - 1) e^{- i (k_{1} + k_{2}) \cdot x_{i}} + N_{cen, i}^{a} \sum_{l = 1}^{N_{sat, i}^{a}} e^{- i k_{1} \cdot (x_{i} + Δ x_{il}^{a}) - i k_{2} \cdot x_{i}} + N_{cen, i}^{a} \sum_{m = 1}^{N_{sat, i}^{a}} e^{- i k_{2} \cdot (x_{i} + Δ x_{im}^{a}) - i k_{1} \cdot x_{i}} \\ + \sum_{l = 1}^{N_{sat, i}^{a}} \sum_{l \neq m} e^{- i k_{1} \cdot (x_{i} + Δ x_{il}^{a}) - i k_{2} \cdot (x_{i} + Δ x_{im}^{a})}] \\ + m_{i} \hat{u} (k_{3} | m_{i}) e^{- i k_{3} \cdot x_{i}} (N_{cen, i}^{a} e^{- i k_{1} \cdot x_{i}} + \sum_{l = 1}^{N_{sat, i}^{a}} e^{- i k_{1} (x_{i} + Δ x_{il}^{a})}) (N_{cen, j}^{a} e^{i k_{2} \cdot x_{j}} + \sum_{m = 1}^{N_{sat, j}^{a}} e^{i k_{2} (x_{j} + Δ x_{jm}^{a})}) \\ + m_{i} \hat{u} (k_{3} | m_{i}) e^{- i k_{3} \cdot x_{i}} (N_{cen, j}^{a} e^{- i k_{1} \cdot x_{j}} + \sum_{l = 1}^{N_{sat, i}^{a}} e^{- i k_{1} (x_{i} + Δ x_{jl}^{a})}) (N_{cen, i}^{a} e^{i k_{2} \cdot x_{i}} + \sum_{m = 1}^{N_{sat, i}^{a}} e^{i k_{2} (x_{i} + Δ x_{im}^{a})})} . \end{matrix}$ $\begin{aligned}&\nonumber (2\pi )^3 _{2} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3) + \text{ unconnected} \text{ terms}\\&= \frac{1}{\bar{n}_\mathrm{h} ^H\,(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \sum _{i,j=1, j\ne i}^H \int _0^\infty {m_1}\; \dots \int _0^\infty {m_H}\; n(m_1)\dots n(m_H) \, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H}) \\&\nonumber \quad \times \prod _{h=1}^{H}\Biggl \{ \sum _{N_{\mathrm{cen} , h}^{a}=0}^\infty \sum _{N_{\mathrm{sat} , h}^{a}=0}^\infty {P_\mathrm{N} (N_{\mathrm{cen} , h}^{a}, N_{\mathrm{sat} , h}^{a}\,|\,m_h)}\, \prod _{v=1}^{N_{\mathrm{sat} , h}^{a}} \left[ \int [3]{\Delta {x}^{a}_{hv}}\, u_\mathrm{g} ^{a}(\Delta \boldsymbol{x}^{a}_{hv}\,|\,m_h) \right] \Biggr \}\\&\nonumber \quad \times \Biggl \{ m_j\, \hat{u}(\boldsymbol{k}_3\; | \; m_j)\, \mathrm{e} ^{-{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_j} \Big [ N_{\mathrm{cen} , i}^{a}\,(N_{\mathrm{cen} , i}^{a}-1)\,\mathrm{e} ^{-{i}(\boldsymbol{k}_1+\boldsymbol{k}_2)\cdot \boldsymbol{x}_i} + N_{\mathrm{cen} , i}^{a}\sum _{l=1}^{N_{\mathrm{sat} , i}^{a}}\mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot (\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{il})-{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_i} + N_{\mathrm{cen} , i}^{a}\sum _{m=1}^{N_{\mathrm{sat} , i}^{a}}\mathrm{e} ^{-{i}\boldsymbol{k}_2\cdot (\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{im})-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i}\\&\nonumber \quad \quad \quad + \sum _{l=1}^{N_{\mathrm{sat} , i}^{a}}\sum _{l\ne m} \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot (\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{il})-{i}\boldsymbol{k}_2\cdot (\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{im})}\Big ]\\&\nonumber \quad \quad + m_i\, \hat{u}(\boldsymbol{k}_3\; | \; m_i)\, \mathrm{e} ^{-{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_i} \Big (N_{\mathrm{cen} , i}^{a}\, \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i} + \sum _{l=1}^{N_{\mathrm{sat} , i}^{a}}\mathrm{e} ^{-{i}\boldsymbol{k}_1(\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{il})}\Big ) \Big (N_{\mathrm{cen} , j}^{a}\, \mathrm{e} ^{{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j} + \sum _{m=1}^{N_{\mathrm{sat} , j}^{a}}\mathrm{e} ^{{i}\boldsymbol{k}_2(\boldsymbol{x}_j + \Delta \boldsymbol{x}^a_{jm})}\Big ) \\&\nonumber \quad \quad + m_i\, \hat{u}(\boldsymbol{k}_3\; | \; m_i)\, \mathrm{e} ^{-{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_i} \Big (N_{\mathrm{cen} , j}^{a}\, \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_j} + \sum _{l=1}^{N_{\mathrm{sat} , i}^{a}}\mathrm{e} ^{-{i}\boldsymbol{k}_1(\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{jl})}\Big ) \Big (N_{\mathrm{cen} , i}^{a}\, \mathrm{e} ^{{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_i} + \sum _{m=1}^{N_{\mathrm{sat} , i}^{a}}\mathrm{e} ^{{i}\boldsymbol{k}_2(\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{im})}\Big )\Biggr \}\;. \end{aligned}$ (A.21)

We now evaluate all m-integrals independent of m_i and m_j (these are H(H − 1) terms in total), use Eq. (A.6), and evaluate the Δx integrals over the halo profiles $u_{g}^{a}$ $u_\mathrm{g}^a$ . This leads to

$\begin{matrix} {(2 π)}_{2}^{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) δ_{D} (k_{1} + k_{2} + k_{3}) + unconnected terms \\ = \frac{1}{{\bar{n}}_{h}^{2} {({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \sum_{i = 1, j \neq i}^{H} \int_{0}^{\infty} m_{i} \int_{0}^{\infty} m_{j} n (m_{i}) n (m_{j}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) \\ \times {m_{j} \hat{u} (k_{3} | m_{j}) e^{- i (k_{1} + k_{2}) \cdot x_{i} - i k_{3} \cdot x_{j}} [⟨ N_{cen}^{a} (N_{cen}^{a} - 1) | m_{i} ⟩ + ⟨ N_{cen}^{a} N_{sat}^{a} | m_{i} ⟩ ({\hat{u}}_{g}^{a} (k_{1} | m_{i}) + {\hat{u}}_{g}^{a} (k_{2} | m_{i})) + ⟨ N_{sat}^{a} (N_{sat}^{a} - 1) | m_{i} ⟩] \\ + m_{i} \hat{u} (k_{3} | m_{i}) e^{- i (k_{1} + k_{3}) \cdot x_{i} - i k_{2} \cdot x_{j}} [⟨ N_{cen}^{a} | m_{i} ⟩ + ⟨ N_{sat}^{a} | m_{i} ⟩ {\hat{u}}_{g}^{a} (k_{1} | m_{i})] [⟨ N_{cen}^{a} | m_{j} ⟩ + ⟨ N_{sat}^{a} | m_{j} ⟩ {\hat{u}}_{g}^{a} (k_{2} | m_{j})] \\ + m_{i} \hat{u} (k_{3} | m_{i}) e^{- i (k_{2} + k_{3}) \cdot x_{i} - i k_{1} \cdot x_{j}} [⟨ N_{cen}^{a} | m_{i} ⟩ + ⟨ N_{sat}^{a} | m_{i} ⟩ {\hat{u}}_{g}^{a} (k_{2} | m_{i})] [⟨ N_{cen}^{a} | m_{j} ⟩ + ⟨ N_{sat}^{a} | m_{j} ⟩ {\hat{u}}_{g}^{a} (k_{1} | m_{j})]} . \end{matrix}$ $\begin{aligned}&\nonumber (2\pi )^3 _{2} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3) + \text{ unconnected} \text{ terms}\\&= \frac{1}{\bar{n}_\mathrm{h} ^2\,(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \sum _{i=1, j\ne i}^H \int _0^\infty {m_i}\;\int _0^\infty {m_j}\; n(m_i)\, n(m_j) \, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H}) \\&\nonumber \quad \times \Biggl \{ m_j\, \hat{u}(\boldsymbol{k}_3\; | \; m_j)\, \mathrm{e} ^{-{i}(\boldsymbol{k}_1+\boldsymbol{k}_2)\cdot \boldsymbol{x}_i-{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_j} \Big [ \langle {N_{\mathrm{cen} }^{a}\,(N_{\mathrm{cen} }^{a}-1)\,|\,m_i}{\rangle } + \langle {N_{\mathrm{cen} }^{a}\, N_{\mathrm{sat} }^{a}\,|\, m_i}{\rangle }\left(\hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_i) + \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_i)\right) + \langle {N_{\mathrm{sat} }^{a}\, (N_{\mathrm{sat} }^{a}-1)\,|\,m_i}{\rangle }\Big ]\\&\nonumber \quad \quad + m_i\, \hat{u}(\boldsymbol{k}_3\; | \; m_i)\, \mathrm{e} ^{-{i}(\boldsymbol{k}_1+\boldsymbol{k}_3)\cdot \boldsymbol{x}_i - {i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j}\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_i}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_i}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_i)\Big ]\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_j}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_j}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_j)\Big ] \\&\nonumber \quad \quad + m_i\, \hat{u}(\boldsymbol{k}_3\; | \; m_i)\, \mathrm{e} ^{-{i}(\boldsymbol{k}_2+\boldsymbol{k}_3)\cdot \boldsymbol{x}_i - {i}\boldsymbol{k}_1\cdot \boldsymbol{x}_j}\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_i}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_i}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_i)\Big ]\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_j}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_j}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_j)\Big ] \Biggr \} \;. \end{aligned}$ (A.22)

In the limit of an infinite volume V, Eq. (A.14), the halo two-point correlation function ξ_h and its Fourier transform P_h, gives

$\begin{matrix} \int_{V} [3] x_{1} \dots [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) e^{- i (k_{1} + k_{2}) x_{i} - i k_{3} \cdot x_{j}} \\ = \int_{V} [3] x_{i} \int_{V} [3] x_{j} \frac{1}{V^{2}} [1 + ξ_{h} (| x_{i} - x_{j} | | m_{1}, m_{2})] e^{- i (k_{1} + k_{2}) x_{i} - i k_{3} \cdot x_{j}} \end{matrix}$ $\begin{aligned}&\nonumber \int _V [3]{x_1} \, \dots \, [3]{x_H}\; {P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H})}\, \mathrm{e} ^{-{i}(\boldsymbol{k}_1+\boldsymbol{k}_2)\, \boldsymbol{x}_i -{i}\boldsymbol{k}_3 \cdot \boldsymbol{x}_j} \\&= \int _V [3]{x_i} \int _V [3]{x_j}\; \frac{1}{V^2} \left[1 + \xi _\mathrm{h} (|\boldsymbol{x}_i- \boldsymbol{x}_j|\,|\,m_1, m_2)\right] \, \mathrm{e} ^{-{i}(\boldsymbol{k}_1+\boldsymbol{k}_2)\, \boldsymbol{x}_i -{i}\boldsymbol{k}_3 \cdot \boldsymbol{x}_j}\end{aligned}$ (A.23)

$\begin{matrix} = \frac{{\bar{n}}_{h}^{2}}{H^{2}} [{(2 π)}^{6} δ_{D} (k_{1} + k_{2}) δ_{D} (k_{3}) + {(2 π)}^{3} δ_{D} (k_{1} + k_{2} + k_{3}) P_{h} (| k_{1} + k_{2} | | m_{1}, m_{2})], \end{matrix}$ $\begin{aligned}&=\frac{\bar{n}_\mathrm{h} ^2}{H^2}\, \left[(2\pi )^6\,\delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2)\, \delta _{\rm D}(\boldsymbol{k}_3) + (2\pi )^3\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3)\, P_\mathrm{h} (|\boldsymbol{k}_1+\boldsymbol{k}_2|\,|\,m_1, m_2)\right]\;, \end{aligned}$ (A.24)

With this expression, neglecting unconnected terms proportional to δ_D(k₁) and δ_D(k₂) and the approximation H(H − 1)/H² ≃ 1, we find

$\begin{matrix} _{2} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) \\ ≃ \frac{1}{{({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \int_{0}^{\infty} m_{1} \int_{0}^{\infty} m_{2} n (m_{1}) n (m_{2}) {m_{2} \hat{u} (k_{3} | m_{2}) P_{h} (k_{3} | m_{1}, m_{2}) [⟨ N_{cen}^{a} (N_{cen}^{a} - 1) | m_{1} ⟩ \end{matrix}$ $\begin{aligned}&\nonumber _{2} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \\&\simeq \frac{1}{(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \, \int _0^\infty {m_1}\;\int _0^\infty {m_2}\; n(m_1)\, n(m_2) \, \Biggl \{ m_2\, \hat{u}(\boldsymbol{k}_3\; | \; m_2)\, P_\mathrm{h} (\boldsymbol{k}_3\,|\,m_1, m_2) \Big [ \langle {N_{\mathrm{cen} }^{a}\,(N_{\mathrm{cen} }^{a}-1)\,|\,m_1}{\rangle } \end{aligned}$ (A.25)

$\begin{matrix} + ⟨ N_{cen}^{a} N_{sat}^{a} | m_{1} ⟩ ({\hat{u}}_{g}^{a} (k_{1} | m_{1}) + {\hat{u}}_{g}^{a} (k_{2} | m_{1})) + ⟨ N_{sat}^{a} (N_{sat}^{a} - 1) | m_{1} ⟩ {\hat{u}}_{g}^{a} (k_{1} | m_{1}) {\hat{u}}_{g}^{a} (k_{2} | m_{1})] \\ + m_{1} \hat{u} (k_{3} | m_{1}) [⟨ N_{cen}^{a} | m_{1} ⟩ + ⟨ N_{sat}^{a} | m_{1} ⟩ {\hat{u}}_{g}^{a} (k_{1} | m_{1})] [⟨ N_{cen}^{a} | m_{2} ⟩ + ⟨ N_{sat}^{a} | m_{2} ⟩ {\hat{u}}_{g}^{a} (k_{2} | m_{2})] \\ + m_{1} \hat{u} (k_{3} | m_{1}) [⟨ N_{cen}^{a} | m_{1} ⟩ + ⟨ N_{sat}^{a} | m_{1} ⟩ {\hat{u}}_{g}^{a} (k_{2} | m_{1})] [⟨ N_{cen}^{a} | m_{2} ⟩ + ⟨ N_{sat}^{a} | m_{2} ⟩ {\hat{u}}_{g}^{a} (k_{1} | m_{2})]} \\ = \frac{1}{{({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \int_{0}^{\infty} m_{1} \int_{0}^{\infty} m_{2} n (m_{1}) n (m_{2}) \\ \times [m_{2} \hat{u} (k_{3} | m_{2}) P_{h} (k_{3} | m_{1}, m_{2}) G^{aa} (k_{1}, k_{2} | m_{1}) + m_{1} \hat{u} (k_{3} | m_{1}) G^{a} (k_{1} | m_{1}) G^{a} (k_{2} | m_{2}) + m_{1} \hat{u} (k_{3} | m_{1}) G^{a} (k_{1} | m_{2}) G^{a} (k_{2} | m_{1})], \end{matrix}$ $\begin{aligned}&\nonumber \quad \quad \quad + \langle {N_{\mathrm{cen} }^{a}\, N_{\mathrm{sat} }^{a}\,|\, m_1}{\rangle }\left(\hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_1) + \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_1)\right) + \langle {N_{\mathrm{sat} }^{a}\, (N_{\mathrm{sat} }^{a}-1)\,|\,m_1}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_1)\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_1)\Big ]\\&\nonumber \quad \quad + m_1\, \hat{u}(\boldsymbol{k}_3\; | \; m_1)\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_1}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_1}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_1)\Big ]\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_2}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_2}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_2)\Big ] \\&\nonumber \quad \quad + m_1\, \hat{u}(\boldsymbol{k}_3\; | \; m_1)\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_1}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_1}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_1)\Big ]\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_2}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_2}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_2)\Big ] \Biggr \}\\&= \frac{1}{(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }}\, \int _0^\infty {m_1}\;\int _0^\infty {m_2}\; n(m_1)\, n(m_2) \\&\nonumber \quad \times \Big [ m_2\, \hat{u}(\boldsymbol{k}_3\; | \; m_2)\, P_\mathrm{h} (\boldsymbol{k}_3\,|\,m_1, m_2)\, G^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2\,|\, m_1) + m_1\, \hat{u}(\boldsymbol{k}_3\; | \; m_1)\, G^a(\boldsymbol{k}_1\,|\, m_1)\, G^a(\boldsymbol{k}_2\,|\,m_2) + m_1\, \hat{u}(\boldsymbol{k}_3\; | \; m_1)\, G^a(\boldsymbol{k}_1\,|\, m_2)\, G^a(\boldsymbol{k}_2\,|\,m_1) \Big ]\;, \end{aligned}$ (A.26)

where again k₃ = −k₁ − k₂ and

$\begin{matrix} G^{a} (k | m) = ⟨ N_{cen}^{a} | m ⟩ + ⟨ N_{sat}^{a} | m ⟩ {\hat{u}}_{g}^{a} (k | m) . \end{matrix}$ $\begin{aligned} G^{a}(\boldsymbol{k}\,|\,m) = \langle {N_{\mathrm{cen} }^{a}\,|\,m}{\rangle }+ \langle {N_{\mathrm{sat} }^{a}\,|\,m}{\rangle } \hat{u}^a_\mathrm{g} (\boldsymbol{k}\,|\,m)\;. \end{aligned}$ (A.27)

For mixed pairs, a ≠ b, a similar calculation yields

$\begin{matrix} _{2} B_{gg δ}^{ab} (k_{1}, k_{2}, k_{3}) \\ ≃ \frac{1}{{({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \int_{0}^{\infty} m_{1} \int_{0}^{\infty} m_{2} n (m_{1}) n (m_{2}) \\ \times [m_{2} \hat{u} (k_{3} | m_{2}) P_{h} (k_{3} | m_{1}, m_{2}) G^{ab} (k_{1}, k_{2} | m_{1}) + m_{1} \hat{u} (k_{3} | m_{1}) G^{a} (k_{1} | m_{1}) G^{b} (k_{2} | m_{2}) + m_{1} \hat{u} (k_{3} | m_{1}) G^{a} (k_{1} | m_{2}) G^{b} (k_{2} | m_{1})] . \end{matrix}$ $\begin{aligned}&\nonumber _{2} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\\&\simeq \frac{1}{(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \, \int _0^\infty {m_1}\;\int _0^\infty {m_2}\; n(m_1)\, n(m_2) \\&\nonumber \quad \times \Big [ m_2\, \hat{u}(\boldsymbol{k}_3\; | \; m_2)\, P_\mathrm{h} (\boldsymbol{k}_3\,|\,m_1, m_2)\, G^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2\,|\, m_1) + m_1\, \hat{u}(\boldsymbol{k}_3\; | \; m_1)\, G^a(\boldsymbol{k}_1\,|\, m_1)\, G^b(\boldsymbol{k}_2\,|\,m_2) + m_1\, \hat{u}(\boldsymbol{k}_3\; | \; m_1)\, G^a(\boldsymbol{k}_1\,|\, m_2)\, G^b(\boldsymbol{k}_2\,|\,m_1) \Big ]\;. \end{aligned}$ (A.28)

In the main text we just denote ₂ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ (k₁, k₂).

A.3. 3-halo term

The 3-halo term for unmixed pairs a = b is given by

$\begin{matrix} {(2 π)}_{3}^{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) δ_{D} (k_{1} + k_{2} + k_{3}) + unconnected terms \\ = \frac{1}{{\bar{n}}_{h}^{H} {({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \sum_{\begin{matrix} i, j, k = 1, j \neq i, \\ k \neq i, j \end{matrix}}^{H} \int_{0}^{\infty} m_{1} \dots \int_{0}^{\infty} m_{H} n (m_{1}) \dots n (m_{H}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) \\ \times \prod_{h = 1}^{H} {\sum_{N_{cen, h}^{a}}^{\infty} \sum_{N_{sat, h}^{a}}^{\infty} P_{N} (N_{cen, h}^{a}, N_{sat, h}^{a} | m_{h}) \prod_{v = 1}^{N_{sat, h}^{a}} [\int [3] Δ x_{hv}^{a} u_{g}^{a} (Δ x_{hv}^{a} | m_{h})]} m_{k} \hat{u} (k_{3} | m_{k}) e^{- i k_{3} \cdot x_{k}} \\ \times [N_{cen, i}^{a} e^{- i k_{1} \cdot x_{i}} + \sum_{l = 1}^{N_{sat, i}^{a}} e^{- i k_{1} (x_{i} + Δ x_{il}^{a})}] [N_{cen, j}^{a} e^{- i k_{2} \cdot x_{j}} + \sum_{m = 1}^{N_{sat, j}^{a}} e^{- i k_{2} (x_{j} + Δ x_{jm}^{a})}] . \end{matrix}$ $\begin{aligned}&\nonumber (2\pi )^3 _{3} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3) + \text{ unconnected} \text{ terms}\\&= \frac{1}{\bar{n}_\mathrm{h} ^H\,(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \sum _{\begin{matrix} i,j,k=1, j\ne i,\\ k\ne i,j \end{matrix}}^H \int _0^\infty {m_1}\; \dots \int _0^\infty {m_H}\; n(m_1)\dots n(m_H) \, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H}) \\&\nonumber \quad \times \prod _{h=1}^{H}\Biggl \{ \sum _{N_{\mathrm{cen} , h}^{a}}^\infty \sum _{N_{\mathrm{sat} , h}^{a}}^\infty {P_\mathrm{N} (N_{\mathrm{cen} , h}^{a}, N_{\mathrm{sat} , h}^{a}\,|\,m_h)}\, \prod _{v=1}^{N_{\mathrm{sat} , h}^{a}} \left[ \int [3]{\Delta {x}^{a}_{hv}}\, u_\mathrm{g} ^{a}(\Delta \boldsymbol{x}^{a}_{hv}\,|\,m_h) \right]\Biggr \}\, m_k\, \hat{u}(\boldsymbol{k}_3\,|\,m_k)\, \mathrm{e} ^{-{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_k}\, \\&\nonumber \quad \times \Big [N_{\mathrm{cen} , i}^{a}\, \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i} + \sum _{l=1}^{N_{\mathrm{sat} , i}^{a}}\mathrm{e} ^{-{i}\boldsymbol{k}_1(\boldsymbol{x}_i + \Delta \boldsymbol{x}^a_{il})}\Big ] \Big [N_{\mathrm{cen} , j}^{a}\, \mathrm{e} ^{-{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j} + \sum _{m=1}^{N_{\mathrm{sat} , j}^{a}}\mathrm{e} ^{-{i}\boldsymbol{k}_2(\boldsymbol{x}_j + \Delta \boldsymbol{x}^a_{jm})}\Big ]\;. \end{aligned}$ (A.29)

We evaluate all m-integrals independent of m_i, m_j, and m_k (H(H − 1)(H − 2) terms in total), and evaluate the Δx integrals. This leads to

$\begin{matrix} _{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) + unconnected terms \\ = \frac{1}{{\bar{n}}_{h}^{3} {({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \sum_{\begin{matrix} i = 1, j \neq i, \\ k \neq i, j \end{matrix}}^{H} \int_{0}^{\infty} m_{i} \int_{0}^{\infty} m_{j} \int_{0}^{\infty} m_{k} n (m_{i}) n (m_{j}) n (m_{k}) \int_{V} [3] x_{1} \dots \int_{V} [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) \\ \times m_{k} \hat{u} (k_{3} | m_{k}) [⟨ N_{cen}^{a} | m_{i} ⟩ + ⟨ N_{sat}^{a} | m_{i} ⟩ {\hat{u}}_{g}^{a} (k_{1} | m_{i})] [⟨ N_{cen}^{a} | m_{j} ⟩ + ⟨ N_{sat}^{a} | m_{j} ⟩ {\hat{u}}_{g}^{a} (k_{2} | m_{j})] e^{- i k_{1} \cdot x_{i} - i k_{2} \cdot x_{j} - i k_{3} \cdot x_{k}} . \end{matrix}$ $\begin{aligned}&\nonumber _{3} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3) + \text{ unconnected} \text{ terms}\\&= \frac{1}{\bar{n}_\mathrm{h} ^3\,(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \sum _{\begin{matrix} i=1, j\ne i,\\ k\ne i,j \end{matrix}}^H \int _0^\infty {m_i} \int _0^\infty {m_j}\int _0^\infty {m_k}\; n(m_i)\, n(m_j)\, n(m_k) \, \int _V [3]{x_1}\; \dots \int _V [3]{x_{H}}\; P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H}) \\&\nonumber \quad \times m_k\, \hat{u}(\boldsymbol{k}_3\,|\,m_k)\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_i}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_i}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_i)\Big ] \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_j}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_j}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_j)\Big ]\, \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i -{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j -{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_k} \;. \end{aligned}$ (A.30)

Now, again in the limit of an infinite volume V, Eq. (A.14), we use the two- and three-point functions ξ_h and ζ_h of halo clustering and their Fourier transforms P_h and B_h to find

$\begin{matrix} \int_{V} [3] x_{1} \dots [3] x_{H} P_{c} (x_{1}, \dots, x_{H} | m_{1}, \dots, m_{H}) e^{- i k_{1} \cdot x_{i} - i k_{2} \cdot x_{j} - i k_{3} \cdot x_{k}} \\ = \int_{V} [3] x_{i} \int_{V} [3] x_{j} \int_{V} [3] x_{k} \frac{e^{- i k_{1} \cdot x_{i} - i k_{2} \cdot x_{j} - i k_{3} \cdot x_{k}}}{V^{3}} [1 + ξ_{h} (| x_{i} - x_{j} | | m_{1}, m_{2}) + ξ_{h} (| x_{i} - x_{k} | | m_{1}, m_{3}) + ξ_{h} (| x_{j} - x_{k} | | m_{2}, m_{3}) \end{matrix}$ $\begin{aligned}&\nonumber \int _V [3]{x_1} \, \dots \, [3]{x_H}\; {P_{\rm c}(\boldsymbol{x}_1, \dots , \boldsymbol{x}_{H}\,|\, m_1, \dots , m_{H})}\, \mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i -{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j -{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_k}\\&= \int _V [3]{x_i} \int _V [3]{x_j} \int _V [3]{x_k}\; \frac{\mathrm{e} ^{-{i}\boldsymbol{k}_1\cdot \boldsymbol{x}_i -{i}\boldsymbol{k}_2\cdot \boldsymbol{x}_j -{i}\boldsymbol{k}_3\cdot \boldsymbol{x}_k}}{V^3} \left[1 + \xi _\mathrm{h} (|\boldsymbol{x}_i- \boldsymbol{x}_j| \,|\, m_1, m_2) + \xi _\mathrm{h} (|\boldsymbol{x}_i- \boldsymbol{x}_k| \,|\, m_1, m_3) + \xi _\mathrm{h} (|\boldsymbol{x}_j- \boldsymbol{x}_k| \,|\, m_2, m_3)\right.\end{aligned}$ (A.31)

$\begin{matrix} + ζ_{h} (| x_{i} - x_{j} |, | x_{i} - x_{k} |, | x_{j} - x_{k} | | m_{1}, m_{2}, m_{3})] \\ = \frac{{\bar{n}}_{h}^{3}}{H^{3}} [{(2 π)}^{9} δ_{D} (k_{1}) δ_{D} (k_{2}) δ_{D} (k_{3}) + {(2 π)}^{6} δ_{D} (k_{2}) δ_{D} (k_{1} + k_{3}) P_{h} (| k_{1} + k_{3} | | m_{1}, m_{3}) \\ + {(2 π)}^{6} δ_{D} (k_{3}) δ_{D} (k_{1} + k_{2}) P_{h} (| k_{1} + k_{2} | | m_{1}, m_{2}) + {(2 π)}^{6} δ_{D} (k_{1}) δ_{D} (k_{2} + k_{3}) P_{h} (| k_{2} + k_{3} | | m_{2}, m_{3}) \\ + {(2 π)}^{3} δ_{D} (k_{1} + k_{2} + k_{3}) B_{h} (k_{1}, k_{2}, k_{3} | m_{1}, m_{2}, m_{3})], \end{matrix}$ $\begin{aligned}&\nonumber \quad \quad \left. + \zeta _\mathrm{h} (|\boldsymbol{x}_i-\boldsymbol{x}_j|, |\boldsymbol{x}_i-\boldsymbol{x}_k| , |\boldsymbol{x}_j-\boldsymbol{x}_k| \,|\, m_1, m_2, m_3)\right] \\&= \frac{\bar{n}_\mathrm{h} ^3}{H^3}\, \big [(2\pi )^9\,\delta _{\rm D}(\boldsymbol{k}_1)\, \delta _{\rm D}(\boldsymbol{k}_2)\, \delta _{\rm D}(\boldsymbol{k}_3) + (2\pi )^6\,\delta _{\rm D}(\boldsymbol{k}_2)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_3)\, P_\mathrm{h} (|\,\boldsymbol{k}_1+\boldsymbol{k}_3\,|\, \,|\, m_1, m_3) \\&\quad \quad \nonumber + (2\pi )^6\,\delta _{\rm D}(\boldsymbol{k}_3)\, \delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2)\, P_\mathrm{h} (|\,\boldsymbol{k}_1+\boldsymbol{k}_2\,|\, \,|\, m_1, m_2) + (2\pi )^6\,\delta _{\rm D}(\boldsymbol{k}_1)\, \delta _{\rm D}(\boldsymbol{k}_2+\boldsymbol{k}_3)\, P_\mathrm{h} (|\,\boldsymbol{k}_2+\boldsymbol{k}_3\,|\, \,|\, m_2, m_3)\\&\quad \quad \nonumber + (2\pi )^3\,\delta _{\rm D}(\boldsymbol{k}_1+\boldsymbol{k}_2+\boldsymbol{k}_3)\, B_\mathrm{h} (\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3 \,|\, m_1, m_2, m_3)\big ]\;, \end{aligned}$ (A.32)

As before, all terms not proportional to δ_D(k₁ + k₂ + k₃) are unconnected and therefore neglected. Using this expression and the approximation H(H − 1)(H − 2)/H³ ≃ 1, we obtain

$\begin{matrix} _{3} B_{gg δ}^{aa} (k_{1}, k_{2}, k_{3}) \\ ≃ \frac{1}{{({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \int_{0}^{\infty} m_{1} \int_{0}^{\infty} m_{2} \int_{0}^{\infty} m_{3} n (m_{1}) n (m_{2}) n (m_{3}) B_{h} (k_{1}, k_{2}, k_{3} | m_{1}, m_{2}, m_{3}) m_{3} \hat{u} (k_{3} | m_{3}) \end{matrix}$ $\begin{aligned}&\nonumber _{3} B _{\mathrm{gg} \delta }^{aa}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3) \\&\simeq \frac{1}{(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \, \int _0^\infty {m_1}\int _0^\infty {m_2}\int _0^\infty {m_3}\; n(m_1)\, n(m_2)\, n(m_3)\, B_\mathrm{h} (\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3\,|\, m_1, m_2, m_3)\, m_3\, \hat{u}(\boldsymbol{k}_3\; | \; m_3) \end{aligned}$ (A.33)

$\begin{matrix} \times [⟨ N_{cen}^{a} | m_{1} ⟩ + ⟨ N_{sat}^{a} | m_{1} ⟩ {\hat{u}}_{g}^{a} (k_{1} | m_{1})] [⟨ N_{cen}^{a} | m_{2} ⟩ + ⟨ N_{sat}^{a} | m_{2} ⟩ {\hat{u}}_{g}^{a} (k_{2} | m_{2})] \\ = \frac{1}{{({\bar{n}}_{g}^{a})}^{2} \bar{ρ}} \int_{0}^{\infty} m_{1} \int_{0}^{\infty} m_{2} \int_{0}^{\infty} m_{3} n (m_{1}) n (m_{2}) n (m_{3}) B_{h} (k_{1}, k_{2}, k_{3} | m_{1}, m_{2}, m_{3}) m_{3} \hat{u} (k_{3} | m_{3}) G^{a} (k_{1} | m_{1}) G^{a} (k_{2} | m_{2}) . \end{matrix}$ $\begin{aligned}&\nonumber \quad \times \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_1}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_1}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_1\,|\,m_1)\Big ]\, \Big [\langle {N_{\mathrm{cen} }^{a}\,|\,m_2}{\rangle } + \langle {N_{\mathrm{sat} }^{a}\,|\,m_2}{\rangle }\, \hat{u}_\mathrm{g} ^a(\boldsymbol{k}_2\,|\,m_2)\Big ] \\&= \frac{1}{(\bar{n}^a_\mathrm{g} )^2\, \bar{\rho }} \, \int _0^\infty {m_1}\int _0^\infty {m_2}\int _0^\infty {m_3}\; n(m_1)\, n(m_2)\, n(m_3)\, B_\mathrm{h} (\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3\,|\, m_1, m_2, m_3)\, m_3\, \hat{u}(\boldsymbol{k}_3\; | \; m_3)\, G^a(\boldsymbol{k}_1\,|\,m_1) \, G^a(\boldsymbol{k}_2\,|\,m_2)\;. \end{aligned}$ (A.34)

For mixed pairs a ≠ b a similar calculation yields

$\begin{matrix} _{3} B_{gg δ}^{ab} (k_{1}, k_{2}, k_{3}) \\ ≃ \frac{1}{{\bar{n}}_{g}^{a} {\bar{n}}_{g}^{b} \bar{ρ}} \int_{0}^{\infty} m_{1} \int_{0}^{\infty} m_{2} \int_{0}^{\infty} m_{3} n (m_{1}) n (m_{2}) n (m_{3}) B_{h} (k_{1}, k_{2}, k_{3} | m_{1}, m_{2}, m_{3}) m_{3} \hat{u} (k_{3} | m_{3}) G^{a} (k_{1} | m_{1}) G^{b} (k_{2} | m_{2}) . \end{matrix}$ $\begin{aligned}&\nonumber _{3} B _{\mathrm{gg} \delta }^{ab}(\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3) \\&\simeq \frac{1}{\bar{n}^a_\mathrm{g} \,\bar{n}^b_\mathrm{g} \, \bar{\rho }} \, \int _0^\infty {m_1}\int _0^\infty {m_2}\int _0^\infty {m_3}\; n(m_1)\, n(m_2)\, n(m_3)\, B_\mathrm{h} (\boldsymbol{k}_1, \boldsymbol{k}_2, \boldsymbol{k}_3\,|\, m_1, m_2, m_3)\, m_3\, \hat{u}(\boldsymbol{k}_3\; | \; m_3)\, G^a(\boldsymbol{k}_1\,|\,m_1) \, G^b(\boldsymbol{k}_2\,|\,m_2)\;. \end{aligned}$ (A.35)

Again, we just denote ₃ $B_{gg δ}^{a b}$ ${{\mathit{B}^{ab}_{\mathrm{gg}\delta}}}$ (k₁, k₂) in the main text.

Appendix B: Poissonianity of satellite galaxies

Our HOD model explicitly assumes Poisson satellites – satellite numbers inside halos that vary according to a Poisson statistic. In this section, we explore the bias due to this assumption in the presence of non-Poisson satellites. To some degree, deviations from Poisson satellites are indeed visible in our SAM galaxies: Figure B.1 shows the ratio of σ( $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ |m) to the Poisson variance $\sqrt{⟨ N_{sat}^{a} | m ⟩}$ $\sqrt{\langle{{N_{\text{sat}}^{a}}|m}{\rangle}}$ for different samples of the simulated galaxies. Poisson satellites, where this ratio is unity, are present in the intermediate halo-mass range, such as between 2 × 10¹¹ M_⊙ ≲ m ≲ 10¹² M_⊙ for galaxies from stellar-mass bin m2. On the low-mass end, satellites are distributed sub-Poissonian (ratio is below unity), and on the high-mass end, they are super-Poissonian (ratio exceeds unity). The trend is similar across all galaxy populations, although with a shift of the intermediate range depending on the stellar mass of the sample.

Fig. B.1.

Ratio of the standard deviation of the galaxy satellite number per halo and the Poisson variance. For Poissonian satellites this ratio is unity. Left: Ratio for all simulated galaxies (black crosses), red simulated galaxies (red crosses), and blue simulated galaxies (blue dots). Right: Ratio for stellar mass-selected samples.

We estimate the bias induced by the mismatch between the true satellite variance and the model assumption by computing the fractional change of ⟨𝒩^a𝒩^bℳ⟩_mod when switching from Poisson satellites to the true variance in the SAM. This means: For each best fit to the simulated G3L signal, we calculate new model predictions ⟨𝒩^a𝒩^bℳ⟩_mod, using the HOD parameters from the best fit but updating the number of satellite pairs inside a halo to

$\begin{matrix} ⟨ N_{sat}^{a} (N_{sat}^{a} - 1) | m ⟩ = (1 - \frac{σ^{2} (N_{sat}^{a} | m)}{⟨ N_{sat}^{a} | m ⟩}) ⟨ N_{sat}^{a} | m ⟩ + ⟨ N_{sat}^{a} | m ⟩^{2}, \end{matrix}$ $\begin{aligned} \langle {N_{\mathrm{sat} }^{a}(N_{\mathrm{sat} }^{a}-1)|m}{\rangle } = \left(1-\frac{\sigma ^2(N_{\mathrm{sat} }^{a}|m)}{\langle {N_{\mathrm{sat} }^{a}|m}{\rangle }}\right) \langle {N_{\mathrm{sat} }^{a}|m}{\rangle } + \langle {N_{\mathrm{sat} }^{a}|m}{\rangle }^2\;, \end{aligned}$ (B.1)

where σ²( $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ |m) is the true satellite variance in the simulation. Clearly, for $σ (N_{sat}^{a} | m) = \sqrt{⟨ N_{sat}^{a} | m ⟩}$ $\sigma({N_{\text{sat}}^{a}}|m) = \sqrt{\langle{{N_{\text{sat}}^{a}}|m}{\rangle}}$ , that is, Poisson satellites, this reduces to Eq. (39).

Figure B.2 shows the fractional change between the updated model and the original best fit with Poisson satellites for the red and blue galaxies. Also shown is the fractional difference of the measured aperture statistics to the best fit. For red galaxies, the updated model differs from the original strongest at 4′ by roughly 50%. For blue galaxies, the difference is greatest at 2′ and has a similar magnitude. It is to be expected that the aperture statistics at scales between 1′and 10′are most affected because the 1-halo term, containing ⟨ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ ( $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ − 1)|m⟩, dominates here. Smaller scales are dominated by the halo term containing ⟨ $N_{cen}^{a}$ ${N_{\mathrm{cen}}^{a}}$ $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ |m⟩, whereas larger scales are dominated by the 3-halo term. Although a bias is visible by the solid line in Fig. B.2, it is of the order of or smaller than the uncertainties on the aperture statistics measurements (error bars). In particular, for blue galaxies, the measurements cannot discriminate between the two models. Therefore, while the presence of non-Poisson satellites affects the model prediction, a Poisson model is accurate enough for the G3L analysis of measurements in this work.

Fig. B.2.

Fractional difference of aperture statistics measurement ⟨𝒩^a𝒩^bℳ⟩_meas (points) and modified aperture statistics model ⟨𝒩^a𝒩^bℳ⟩_mod(lines) to the original model ⟨𝒩^a𝒩^bℳ⟩ for red-red lens pairs (left) and blue-blue lens pairs (right) in the MS. The original model assumes a Poissonian satellite distribution, while the modified model uses the directly measured distribution of satellite galaxies.

We have repeated this test also for the satellite galaxies from the stellar mass-selected samples. Galaxies from samples m4 and m5 are closer to a Poisson satellite model than blue galaxies and therefore show less bias (below 10%). For the other samples, the bias is larger (up to 61% for m1, 57% for m2, 19% for m3 satellites), but since the measured aperture statistics have larger uncertainties, the bias is of even less relevance than for red and blue galaxies.

Finally, a deviation of the variance in satellite numbers from a Poisson statistic also affects the interpretation of the correlation parameter r^ab, as defined in Eq. (40). For Poisson satellites, r^ab corresponds to a Pearson coefficient, that is, r^ab = 1 for perfectly correlated $N_{sat}^{a}$ ${N_{\mathrm{sat}}^{a}}$ and $N_{sat}^{b}$ ${N_{\mathrm{sat}}^{b}}$ , r^ab = −1 for perfectly anti-correlated satellite numbers, and r^ab = 0 for uncorrelated samples. However, for a super-Poisson variance in the high-mass regime, our r^ab is larger than the Pearson coefficient. Conversely, for low-mass halos with sub-Poisson variance, r^ab is smaller than the Pearson coefficient. The combination of these trends increases the slope ϵ^ab of our r^ab(m) compared to a Pearson definition of correlation. Nevertheless, the sign of r^ab remains unchanged, and if the samples a and b are uncorrelated, both r^ab and the Pearson coefficient vanish.

Appendix C: Results of model fit to stellar mass-selected galaxies

In Fig C.1 we show the model fits to the G3L aperture statistics from stellar mass-selected lenses in the MS and the χ² of the fits. All fits have 76 d.o.f. (the same as the fit to the colour-selected samples in Sect. 6.1), so a χ² < 97.4 signifies an agreement between fit and data at the 95% CL. The HOD parameters of the best-fitting models are given in Table C.1; the parameters A^ab and ϵ^ab that determine the cross-correlation between satellite numbers together with the χ² and p-values of the fits are given in Table C.2. Figure C.4 shows the model fits to the G3L aperture statistics from stellar mass-selected lenses in KV450× GAMA and the χ² of the fits. The parameters of the best-fitting models are given in Tables C.3 and C.4.

Fig. C.1.

G3L measurement in MS (points) and best-fitting halo model (lines) for stellar mass-selected lens samples, as defined in Table 2. Solid lines indicate the total aperture statistics, dashed lines the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term of the fit. Each row was fitted individually, leading to the χ² values in the last panel. The corresponding halo model parameters are given in Table C.1.

Fig. C.2.

Fig. C.1 continued

Fig. C.3.

Fig. C.2 continued

Fig. C.4.

G3L measurement in KV450 × GAMA (points) and best-fitting halo model (lines) for stellar mass-selected lens samples, as defined in Table 2. Solid lines indicate the total aperture statistics, dashed lines the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term of the fit. Each row was fitted individually, leading to the χ² values in the last panel. The corresponding halo model parameters are given in Table C.3

Fig. C.5.

Fig. C.4 continued

Fig. C.6.

Fig. C.5 continued

Table C.1.

best fit values for halo model parameters for stellar-mass-selected lenses in MS for each stellar mass sample a.

Table C.2.

best fit values of HOD parameters describing satellite number cross-correlation, and χ² and p-values for the G3L halo model fit to MS

Table C.3.

best fit values for halo model parameters for stellar-mass-selected lenses in KV450 × GAMA for each stellar mass sample a.

Table C.4.

best fit values of HOD parameters describing satellite number cross-correlation, and χ² and p-values for the G3L halo model fit to KV450 × GAMA

All Tables

Table 1.

Fiducial values and flat priors of the halo model parameters.

In the text

Table 2.

Selection criteria for lens samples.

In the text

Table 3.

Best-fitting values of halo model parameters for colour-selected lenses and 68% credibility intervals (d.o.f. = 76).

In the text

Table C.1.

best fit values for halo model parameters for stellar-mass-selected lenses in MS for each stellar mass sample a.

In the text

Table C.2.

best fit values of HOD parameters describing satellite number cross-correlation, and χ² and p-values for the G3L halo model fit to MS

In the text

Table C.3.

best fit values for halo model parameters for stellar-mass-selected lenses in KV450 × GAMA for each stellar mass sample a.

In the text

Table C.4.

best fit values of HOD parameters describing satellite number cross-correlation, and χ² and p-values for the G3L halo model fit to KV450 × GAMA

In the text

All Figures

	Fig. 1. Geometry of a G3L configuration with one source and two lens galaxies on the sky; adapted from Schneider & Watts (2005). Lens galaxies are at angular positions θ₁ = θ + ϑ₁ and θ₂ = θ + ϑ₂ on the sky; the source galaxy is at θ. The angle between the source-lens connections is the opening angle ϕ.
In the text

	Fig. 2. Mean per-halo numbers of galaxies for fiducial HOD parameters in Table 1. The solid black line shows the total galaxy number per halo, the dashed red line shows the fraction of halos with central galaxies, and the dotted blue line shows the number of satellite galaxies per halo.
In the text

	Fig. 3. Impact of halo model parameters on ⟨𝒩^a𝒩^aℳ⟩ for unmixed lens pairs. In each panel, only one parameter is varied at a time. Solid lines indicate the total ⟨𝒩^a𝒩^aℳ⟩, while dashed lines show the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term.
In the text

	Fig. 4. Impact of the correlation of satellite numbers on the aperture statistics of mixed lens pairs. Satellite numbers are either fully correlated (r^ab = 1, blue lines), uncorrelated (r^ab = 0, black lines) or anti-correlated (r^ab = −1, red lines). Solid lines indicate the total aperture statistics, dashed lines the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term.
In the text

	Fig. 5. Normalised redshift distributions n(z) of GAMA galaxies, selected by colour (left) and stellar mass (right).
In the text

	Fig. 6. Stellar-mass and redshift of GAMA galaxies, divided by colour (left) and stellar mass (right).
In the text

Fig. 7.

G3L measurement (points) and best-fitting halo model (lines) for red and blue galaxies in the MS (upper plot) and the KV450 × GAMA (lower plot). Solid lines indicate the total aperture statistics, dashed lines the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term of the fit. The left panels show the result for red-red galaxy pairs, the central panels for blue-blue galaxy pairs, and the right panels for red-blue mixed pairs. Error bars correspond to the standard deviation estimated from the jackknife resampling detailed in Sect. 5.1. (a) MS. (b) KV450 × GAMA.

In the text

Fig. 8.

Mean per-halo numbers of simulated galaxies (top) and observed galaxies (bottom) as function of halo mass. Red crosses (blue points) indicate the true HOD of simulated red (blue) galaxies, where the error bars are the standard deviation of the mean over the 64 line-of-sights. The lines indicate the per-halo numbers inferred from the fit to the G3L signal for red (solid red) and blue galaxies (dashed blue). The shaded areas are the 68% credibility areas of the halo model fit. (a) MS. (b) KV450 × GAMA.

In the text

Fig. 9.

Correlation parameter r^ab for red and blue galaxies in the simulation and observation as a function of halo mass. Black crosses show the direct estimate for the simulation, where the error bars are the standard deviation of the mean over the 64 line-of-sights. The solid brown line shows the r^ab inferred by the halo model fit to the G3L signal of the MS, and the green dashed line is the result of the fit to the KV450 × GAMA G3L signal. The shaded areas show the 68% credibility bands of the fits.

In the text

	Fig. 10. Correlation matrix for aperture statistics measurement in KV450×GAMA for red and blue galaxy samples. The data vector is ordered as given in Eq. (64), starting with the smallest aperture radius.
In the text

	Fig. 11. Cumulative distribution of p-values of G3L halo model fits for MS (orange, solid) and KV450 × GAMA (green, dashed). For a perfect description of G3L signal and data noise, the distributions would be consistent with a uniform distribution (black, dotted).
In the text

Fig. 12.

Mean per-halo galaxy numbers in the simulation (top) and observation (bottom) for lens galaxies from each stellar mass bin as function of halo mass. Crosses indicate the directly estimated per-halo numbers of simulated galaxies, while lines show the predictions from the G3L fits. The shaded areas indicate the 68% confidence areas. Left panels: the mean per-halo numbers for galaxies from stellar mass bins m1, m3, and m5 obtained from the fits to the G3L signal for m1–m5, m3–m5, and m4–m5. The right panels show the same for galaxies from stellar mass bins m2 and m5, obtained from the fits to the signal for m2–m5 and m4–m5. The corresponding HOD parameters are listed in Table C.1. (a) MS. (b) KV450 × GAMA.

In the text

Fig. 13.

Correlation parameter r^ab for stellar mass-selected galaxies in the MS and in KV450×GAMA. Black crosses show the true correlation for the MS, where the error bars are the standard deviation of the mean over the 64 simulated line-of-sights. The solid brown line shows r^ab inferred from a simulated G3L analysis of MS, and the green dashed line the inference for KV450×GAMA. The shaded areas show the 68% credibility bands of the inferences. Blue points show the correlation parameter of galaxies in the MS selected without assuming a flux limit.

In the text

Fig. 14.

Threshold mass $M_{th}^{a}$ $M_\mathrm{th}^{a}$ measured for the GAMA galaxies as a function of the average stellar mass of each stellar mass bin (Green crosses). We show all four estimates for $M_{th}^{a}$ $M_\mathrm{th}^{a}$ for each sample a, slightly displaced along the y-axis for visibility. Horizontal errors correspond to 68% CI of $M_{th}^{a}$ $M_\mathrm{th}^{a}$ ; vertical errors show the standard deviation of the stellar masses of galaxies within a sample.

In the text

	Fig. B.1. Ratio of the standard deviation of the galaxy satellite number per halo and the Poisson variance. For Poissonian satellites this ratio is unity. Left: Ratio for all simulated galaxies (black crosses), red simulated galaxies (red crosses), and blue simulated galaxies (blue dots). Right: Ratio for stellar mass-selected samples.
In the text

Fig. B.2.

Fractional difference of aperture statistics measurement ⟨𝒩^a𝒩^bℳ⟩_meas (points) and modified aperture statistics model ⟨𝒩^a𝒩^bℳ⟩_mod(lines) to the original model ⟨𝒩^a𝒩^bℳ⟩ for red-red lens pairs (left) and blue-blue lens pairs (right) in the MS. The original model assumes a Poissonian satellite distribution, while the modified model uses the directly measured distribution of satellite galaxies.

In the text

Fig. C.1.

G3L measurement in MS (points) and best-fitting halo model (lines) for stellar mass-selected lens samples, as defined in Table 2. Solid lines indicate the total aperture statistics, dashed lines the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term of the fit. Each row was fitted individually, leading to the χ² values in the last panel. The corresponding halo model parameters are given in Table C.1.

In the text

	Fig. C.2. Fig. C.1 continued
In the text

	Fig. C.3. Fig. C.2 continued
In the text

Fig. C.4.

G3L measurement in KV450 × GAMA (points) and best-fitting halo model (lines) for stellar mass-selected lens samples, as defined in Table 2. Solid lines indicate the total aperture statistics, dashed lines the 1-halo, dotted lines the 2-halo, and dash-dotted lines the 3-halo term of the fit. Each row was fitted individually, leading to the χ² values in the last panel. The corresponding halo model parameters are given in Table C.3

In the text

	Fig. C.5. Fig. C.4 continued
In the text

	Fig. C.6. Fig. C.5 continued
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Anderson, T. W. 2003, An Introduction to Multivariate Statistical Analysis (Wiley-Interscience) [Google Scholar]

[2] Assassi, V., Simonović, M., & Zaldarriaga, M. 2017, J. Cosmol. Astropart. Phys., 2017, 054 [CrossRef] [Google Scholar]

[3] Avila, S., Crocce, M., Ross, A. J., et al. 2018, MNRAS, 479, 94 [NASA ADS] [CrossRef] [Google Scholar]

[4] Bartelmann, M., & Schneider, P. 2001, Phys. Rep., 340, 291 [Google Scholar]

[5] Berlind, A. A., & Weinberg, D. H. 2002, ApJ, 575, 587 [Google Scholar]

[6] Berlind, A. A., Weinberg, D. H., Benson, A. J., et al. 2003, ApJ, 593, 1 [NASA ADS] [CrossRef] [Google Scholar]

[7] Bernardeau, F., Colombi, S., Gaztañaga, E., & Scoccimarro, R. 2002, Phys. Rep., 367, 1 [Google Scholar]

[8] Bruzual, G., & Charlot, S. 2003, MNRAS, 344, 1000 [NASA ADS] [CrossRef] [Google Scholar]

[9] Bullock, J. S., Kolatt, T. S., Sigad, Y., et al. 2001, MNRAS, 321, 559 [Google Scholar]

[10] Cacciato, M., Lahav, O., van den Bosch, F. C., Hoekstra, H., & Dekel, A. 2012, MNRAS, 426, 566 [Google Scholar]

[11] Calzetti, D., Armus, L., Bohlin, R. C., et al. 2000, ApJ, 533, 682 [NASA ADS] [CrossRef] [Google Scholar]

[12] Campagne, J. E., Neveu, J., & Plaszczynski, S. 2017, A&A, 602, A72 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[13] Carretero, J., Castander, F. J., Gaztañaga, E., Crocce, M., & Fosalba, P. 2015, MNRAS, 447, 646 [NASA ADS] [CrossRef] [Google Scholar]

[14] Chabrier, G. 2003, PASP, 115, 763 [Google Scholar]

[15] Clampitt, J., Miyatake, H., Jain, B., & Takada, M. 2016, MNRAS, 457, 2391 [NASA ADS] [CrossRef] [Google Scholar]

[16] Clampitt, J., Sánchez, C., Kwan, J., et al. 2017, MNRAS, 465, 4204 [NASA ADS] [CrossRef] [Google Scholar]

[17] Cooray, A., & Sheth, R. 2002, Phys. Rep., 372, 1 [Google Scholar]

[18] Crittenden, R. G., Natarajan, P., Pen, U.-L., & Theuns, T. 2002, ApJ, 568, 20 [NASA ADS] [CrossRef] [Google Scholar]

[19] de Jong, J. T. A., Verdoes Kleijn, G. A., Boxhoorn, D. R., et al. 2015, A&A, 582, A62 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[20] DeRose, J., Wechsler, R. H., Becker, M. R., et al. 2022, Phys Rev. D, 105, 123520 [NASA ADS] [CrossRef] [Google Scholar]

[21] Deshpande, A. C., & Kitching, T. D. 2020, Phys Rev. D, 101, 103531 [NASA ADS] [CrossRef] [Google Scholar]

[22] Driver, S. P., Norberg, P., Baldry, I. K., et al. 2009, Astron. Geophys., 50, 12 [Google Scholar]

[23] Dvornik, A., Hoekstra, H., Kuijken, K., et al. 2018, MNRAS, 479, 1240 [Google Scholar]

[24] Dvornik, A., Hoekstra, H., Kuijken, K., et al. 2020, A&A, 642, A83 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[25] Edge, A., Sutherland, W., Kuijken, K., et al. 2013, The Messenger, 154, 32 [NASA ADS] [Google Scholar]

[26] Eisenstein, D. J., & Hu, W. 1998, ApJ, 496, 605 [Google Scholar]

[27] Erben, T., Schirmer, M., Dietrich, J. P., et al. 2005, Astron. Nachr., 326, 432 [NASA ADS] [CrossRef] [Google Scholar]

[28] Farrow, D. J., Cole, S., Norberg, P., et al. 2015, MNRAS, 454, 2120 [Google Scholar]

[29] Ferrero, I., Crocce, M., Tutusaus, I., et al. 2021, A&A, 656, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[30] Gao, L., & White, S. D. M. 2007, MNRAS, 377, L5 [NASA ADS] [CrossRef] [Google Scholar]

[31] Gough, B. 2009, GNU Scientific Library Reference Manual– Third Edition (Network Theory Ltd.) [Google Scholar]

[32] Gruen, D., Friedrich, O., Krause, E., et al. 2018, Phys Rev. D, 98, 023507 [NASA ADS] [CrossRef] [Google Scholar]

[33] Guo, H., Yang, X., Raichoor, A., et al. 2019, ApJ, 871, 147 [NASA ADS] [CrossRef] [Google Scholar]

[34] Hadzhiyska, B., Bose, S., Eisenstein, D., Hernquist, L., & Spergel, D. N. 2020, MNRAS, 493, 5506 [NASA ADS] [CrossRef] [Google Scholar]

[35] Hartlap, J., Simon, P., & Schneider, P. 2007, A&A, 464, 399 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[36] Henriques, B. M. B., White, S. D. M., Thomas, P. A., et al. 2015, MNRAS, 451, 2663 [Google Scholar]

[37] Hilbert, S., Hartlap, J., White, S. D. M., & Schneider, P. 2009, A&A, 499, 31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[38] Hildebrandt, H., Köhlinger, F., van den Busch, J. L., et al. 2020, A&A, 633, A69 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[39] Hildebrandt, H., van den Busch, J. L., Wright, A. H., et al. 2021, A&A, 647, A124 [EDP Sciences] [Google Scholar]

[40] Ishikawa, S., Okumura, T., Oguri, M., & Lin, S.-C. 2021, ApJ, 922, 23 [NASA ADS] [CrossRef] [Google Scholar]

[41] Jarvis, M., Bernstein, G., & Jain, B. 2004, MNRAS, 352, 338 [Google Scholar]

[42] Joachimi, B., Lin, C. A., Asgari, M., et al. 2021, A&A, 646, A129 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[43] Kaiser, N. 1992, ApJ, 388, 272 [Google Scholar]

[44] Kannawadi, A., Hoekstra, H., Miller, L., et al. 2019, A&A, 624, A92 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[45] Krause, E., Fang, X., Pandey, S., et al. 2021, ArXiv e-prints [arXiv:2105.13548]. [Google Scholar]

[46] Kravtsov, A. V., Berlind, A. A., Wechsler, R. H., et al. 2004, ApJ, 609, 35 [Google Scholar]

[47] Kuijken, K., Heymans, C., Hildebrandt, H., et al. 2015, MNRAS, 454, 3500 [Google Scholar]

[48] Linke, L., Simon, P., Schneider, P., et al. 2020a, A&A, 640, A59 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[49] Linke, L., Simon, P., Schneider, P., & Hilbert, S. 2020b, A&A, 634, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[50] Liske, J., Baldry, I. K., Driver, S. P., et al. 2015, MNRAS, 452, 2087 [Google Scholar]

[51] Liu, J. S. 2004, Monte Carlo Strategies in Scientific Computing, 1st edn. (New York, NY: Springer), 31 [CrossRef] [Google Scholar]

[52] Mandelbaum, R., Hirata, C. M., Broderick, T., Seljak, U., & Brinkmann, J. 2006, MNRAS, 370, 1008 [Google Scholar]

[53] Maraston, C. 2005, MNRAS, 362, 799 [NASA ADS] [CrossRef] [Google Scholar]

[54] Martin, S. M. 2019, Ph.D. Thesis, University of Bonn, Germany [Google Scholar]

[55] Mead, A. J., Peacock, J. A., Heymans, C., Joudaki, S., & Heavens, A. F. 2015, MNRAS, 454, 1958 [NASA ADS] [CrossRef] [Google Scholar]

[56] Miller, L., Heymans, C., Kitching, T. D., et al. 2013, MNRAS, 429, 2858 [Google Scholar]

[57] Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 [Google Scholar]

[58] Nakamura, T. T., & Suto, Y. 1997, Prog. Theor. Phys., 97, 49 [Google Scholar]

[59] Navarro, J. F., Frenk, C. S., & White, S. D. M. 1996, ApJ, 462, 563 [Google Scholar]

[60] Nelder, J. A., & Mead, R. 1965, Comput. J., 7, 308 [Google Scholar]

[61] Planck Collaboration I. 2020, A&A, 641, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[62] Rödiger, J. 2009, Ph.D. Thesis, University of Bonn, Germany [Google Scholar]

[63] Ross, A. J., & Brunner, R. J. 2009, MNRAS, 399, 878 [NASA ADS] [CrossRef] [Google Scholar]

[64] Saghiha, H., Simon, P., Schneider, P., & Hilbert, S. 2017, A&A, 601, A98 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[65] Schirmer, M. 2013, ApJS, 209, 21 [NASA ADS] [CrossRef] [Google Scholar]

[66] Schneider, P., Kilbinger, M., & Lombardi, M. 2005, A&A, 431, 9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[67] Schneider, P., & Watts, P. 2005, A&A, 432, 783 [CrossRef] [EDP Sciences] [Google Scholar]

[68] Scoccimarro, R., Sheth, R. K., Hui, L., & Jain, B. 2001, ApJ, 546, 20 [NASA ADS] [CrossRef] [Google Scholar]

[69] Scranton, R. 2001, MNRAS, 332, 697 [Google Scholar]

[70] Scranton, R. 2002, MNRAS, 339, 410 [Google Scholar]

[71] Sheth, R. K., & Tormen, G. 1999, MNRAS, 308, 119 [Google Scholar]

[72] Simon, P., & Hilbert, S. 2018, A&A, 613, A15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[73] Simon, P., Watts, P., Schneider, P., et al. 2008, A&A, 479, 655 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[74] Simon, P., Hetterscheidt, M., Wolf, C., et al. 2009, MNRAS, 398, 807 [NASA ADS] [CrossRef] [Google Scholar]

[75] Simon, P., Erben, T., Schneider, P., et al. 2013, MNRAS, 430, 2476 [Google Scholar]

[76] Smith, R. E., Scoccimarro, R., & Sheth, R. K. 2007, Phys Rev. D, 75 [CrossRef] [Google Scholar]

[77] Springel, V., White, S. D. M., Jenkins, A., et al. 2005, Nature, 435, 629 [Google Scholar]

[78] Szapudi, I., & Szalay, A. S. 1998, ApJ, 494, L41 [NASA ADS] [CrossRef] [Google Scholar]

[79] Taylor, E. N., Hopkins, A. M., Baldry, I. K., et al. 2011, MNRAS, 418, 1587 [Google Scholar]

[80] Tegmark, M., Taylor, A. N., & Heavens, A. F. 1997, ApJ, 480, 22 [NASA ADS] [CrossRef] [Google Scholar]

[81] van den Busch, J. L., Hildebrandt, H., Wright, A. H., et al. 2020, A&A, 642, A200 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[82] van Uitert, E., Joachimi, B., Joudaki, S., et al. 2018, MNRAS, 476, 4662 [NASA ADS] [CrossRef] [Google Scholar]

[83] Velander, M., van Uitert, E., Hoekstra, H., et al. 2014, MNRAS, 437, 2111 [Google Scholar]

[84] Venemans, B. P., Verdoes Kleijn, G. A., Mwebaze, J., et al. 2015, MNRAS, 453, 2259 [Google Scholar]

[85] Vogelsberger, M., Marinacci, F., Torrey, P., & Puchwein, E. 2020, Nat. Rev. Phys., 2, 42 [Google Scholar]

[86] Wang, Y., Yang, X., Mo, H. J., & van den Bosch, F. C. 2007, ApJ, 664, 608 [NASA ADS] [CrossRef] [Google Scholar]

[87] Watts, P., & Schneider, P. 2005, in Gravitational Lensing Impact on Cosmology, eds. Y. Mellier, & G. Meylan, IAU Symp., 225, 243 [NASA ADS] [Google Scholar]

[88] Weisstein, E. W. 2022, Delta Function. From MathWorld–A Wolfram Web Resource, http://mathworld.wolfram.com/DeltaFunction.html, Last visited on 02/3/2022 [Google Scholar]

[89] White, S. D. M., & Rees, M. J. 1978, MNRAS, 183, 341 [Google Scholar]

[90] Wright, A. H., Robotham, A. S. G., Bourne, N., et al. 2016, MNRAS, 460, 765 [Google Scholar]

[91] Wright, A. H., Hildebrandt, H., Kuijken, K., et al. 2019, A&A, 632, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[92] Zehavi, I., Zheng, Z., Weinberg, D. H., et al. 2005, ApJ, 630, 1 [Google Scholar]

[93] Zehavi, I., Zheng, Z., Weinberg, D. H., et al. 2011, ApJ, 736, 59 [NASA ADS] [CrossRef] [Google Scholar]

[94] Zehavi, I., Contreras, S., Padilla, N., et al. 2018, ApJ, 853, 84 [NASA ADS] [CrossRef] [Google Scholar]

[95] Zheng, Z., Berlind, A. A., Weinberg, D. H., et al. 2005, ApJ, 633, 791 [NASA ADS] [CrossRef] [Google Scholar]

[96] Zheng, Z., Coil, A. L., & Zehavi, I. 2007, ApJ, 667, 760 [Google Scholar]