Probabilistic Lagrangian bias estimators and the cumulant bias expansion

Jens Stücker; Marcos Pellejero-Ibáñez; Raul E. Angulo; Francisco Maion; Rodrigo Voivodic

doi:10.1051/0004-6361/202451176

Home

All issues

Volume 699 (July 2025)

A&A, 699 (2025) A197

Full HTML

Open Access

Issue		A&A Volume 699, July 2025


Article Number		A197
Number of page(s)		27
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/202451176
Published online		09 July 2025

A&A, 699, A197 (2025)

Probabilistic Lagrangian bias estimators and the cumulant bias expansion

Jens Stücker¹^⋆, Marcos Pellejero-Ibáñez²^⋆, Raul E. Angulo¹^,3, Francisco Maion¹ and Rodrigo Voivodic¹

¹ Donostia International Physics Center (DIPC), Paseo Manuel de Lardizabal 4, 20018 Donostia-San Sebastian, Spain
² Institute for Astronomy, University of Edinburgh, Royal Observatory, Blackford Hill, Edinburgh EH9 3HJ, UK
³ IKERBASQUE, Basque Foundation for Science, E-48013 Bilbao, Spain

^⋆ Corresponding authors: jens.stuecker@univie.ac.at, mpelleje@ed.ac.uk

Received: 19 June 2024
Accepted: 1 May 2025

Abstract

The spatial distribution of galaxies is a highly complex phenomenon that is impossible to predict deterministically at present. However, by using a statistical ‘bias’ relation, it has become feasible to robustly model the average abundance of galaxies as a function of the underlying matter density field. Understanding the properties and parametric description of the bias relation is key to extracting cosmological information from future galaxy surveys. In this work, we contribute to this topic primarily in two ways. First, we have developed a set of ‘probabilistic’ estimators for bias parameters using the moments of the Lagrangian galaxy environment distribution. These estimators include spatial corrections at different orders to measure the bias parameters independently of the damping scale. We report robust measurements of a variety of bias parameters for halos, including the tidal bias and its dependence on spin at a fixed mass. Second, we have proposed an alternative formulation of the bias expansion in terms of ‘cumulant bias parameters’, which describe the response of the logarithmic galaxy density to large-scale perturbations. We find that cumulant biases of halos are consistent with zero at orders of n > 2. This suggests that: (i) previously reported bias relations at the order of n > 2 are an artefact of the entangled basis of the canonical bias expansion; (ii) the convergence of the bias expansion may be improved by expressing it in terms of cumulants; and (iii) the bias function is very well approximated by a Gaussian. We explore these avenues in greater depth in a companion paper.

Key words: methods: analytical / cosmology: theory / large-scale structure of Universe

© The Authors 2025

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

Observations of the large-scale distribution of galaxies are among the most promising probes for accurately inferring the cosmological parameters of our Universe. Past large-scale structure surveys have helped shape our current model of the Universe by providing precise measurements of the angular diameter distance and growth rate at different epochs (see e.g. Alam et al. 2021). Forthcoming surveys such as the Dark Energy Instrument (DESI, Levi et al. 2013) and Euclid (Amendola et al. 2018) will measure the sky positions and redshifts (both photometric and spectroscopic) of an unprecedented amount of galaxies. To reliably interpret these datasets, it is of crucial importance to accurately model the spatial distributions of galaxies as a function of cosmology.

The evolution of the large-scale distribution of matter is predominantly driven by gravity and can be predicted very reliably through perturbation theory (see Bernardeau et al. 2002, for a review) or by gravity-only N-body simulations (see e.g. Angulo & Hahn 2022, for a review). However, observed galaxies trace the matter distribution only in a ‘biased’ way (Kaiser 1984; Mo & White 1996). Optimally exploiting the information from galaxy surveys therefore requires not only accurately modelling gravity, but also to account for the formation and evolution of galaxies.

There are a variety of methodologies to model the formation of galaxies. The most detailed approach involves employing hydrodynamical simulations to explicitly follow gas dynamics and the formation and evolution of stars, black holes, and galaxies (see e.g. Vogelsberger et al. 2014; Dubois et al. 2014; Schaye et al. 2015; Davé et al. 2019). In principle, such simulations would be the ideal method to account for the formation of galaxies as long as the underlying physics could be modelled reliably and at an affordable cost. However, in practice, these simulations are limited in volume and number due to their substantial computational requirements. Additionally, they have required the assumption of sub-grid physics; for instance, to model the unresolved formation of stars, as well as the growth of active galactic nuclei and their feedback processes. The results of such simulations depend strongly on the associated parameters and are therefore severely limited in their predictive ability (e.g. Genel et al. 2019; Villaescusa-Navarro et al. 2021).

To overcome the limitations of hydrodynamic simulations, a more agnostic approach can be pursued. Semi-analytic models, for example, follow a number of empirical and semi-empirical relations that provide more versatility to the modelling of galaxy formation physics (see e.g. Kauffmann et al. 1999; Henriques et al. 2015; Stevens et al. 2016; Lacey et al. 2016; Croton et al. 2016; Lagos et al. 2018). An even more agnostic strategy involves techniques such as sub-halo abundance matching (see e.g. Conroy et al. 2006; Reddick et al. 2013; Chaves-Montero et al. 2016; Lehmann et al. 2017; Dragomir et al. 2018; Contreras et al. 2021; Ortega-Martinez et al. 2024) or halo occupation distribution (see e.g.Peacock & Smith 2000; Berlind & Weinberg 2002; Zheng et al. 2005; Cacciato et al. 2012; Salcedo et al. 2022a), which rely solely on gravity simulations and fundamental assumptions about populating collapsed structures (halos and sub-halos) based on their mass. These assumptions must be sufficiently flexible to accommodate various galaxy formation scenarios. Nonetheless, these techniques also have computational constraints and the principles they rely on may oversimplify reality, necessitating extensions to account for dependencies that go beyond solely the mass of the collapsed structure (a phenomenon known as assembly bias; Gao et al. 2005; Wechsler et al. 2006; Gao & White 2007; Croton et al. 2007; Dalal et al. 2008; Faltenbacher & White 2010; Montero-Dorta et al. 2017; Zehavi et al. 2018; Ferreras et al. 2019; Sato-Polito et al. 2019; Tucci et al. 2021; Salcedo et al. 2022b; Chaves-Montero et al. 2023, among others).

The most general approach currently known is a perturbative expansion of galaxy bias (see Desjacques et al. 2018 for a review). In this framework, the evolution of matter is assumed to be modelled accurately through purely gravitational effects, whereas the galaxy field selectively populates the matter field. The galaxy field is then written in a perturbative manner as a function of the properties of the underlying matter distribution. In Eulerian bias schemes, the galaxy number density is expanded in terms of the final properties of the density field; whereas in Lagrangian schemes, this is done in terms of the initial properties of the linear density field. This approach offers great flexibility, allowing for a single model to describe biased tracers with vastly different properties. Bias approaches based on effective field theory are aimed at describing the clustering behaviour of galaxies down to k ≈ 0.2h/Mpc (Baumann et al. 2012; Baldauf et al. 2016a; Vlah et al. 2016). They have been used extensively to extract robust cosmological constraints from surveys (Ivanov et al. 2020; d’Amico et al. 2020; Colas et al. 2020; Nishimichi et al. 2020; Chen et al. 2020; Philcox & Ivanov 2022).

There exist different options for defining the biasing scheme. Traditionally, perturbation theory is used to describe the underlying gravitational evolution of the density field; whereas in the recently developed ‘hybrid approaches’, the gravitational evolution is treated exactly through N-body simulations (Modi et al. 2020; Kokron et al. 2021; Zennaro et al. 2023; Pellejero Ibañez et al. 2022, 2023; DeRose et al. 2023). However, in all bias methods the galaxy number density is expanded in a perturbative series with a set of free coefficients known as ‘bias parameters’. These parameters can be interpreted as the response of the galaxy number density to perturbations of the density field. While these parameters offer some degree of physical insights (Lazeyras et al. 2016; Lazeyras & Schmidt 2019; Barreira et al. 2021), they are generally treated as nuisance parameters that can be marginalised over to extract the cosmological information of interest.

A quantitative understanding of the bias parameters is important for analysing future galaxy surveys. On the one hand, this is necessary to determine how well the bias expansion converges. The question of whether the truncation at a given order gives accurate results depends on higher order terms being small enough that they can be neglected. On the other hand, it is of significant interest to limit the prior volume that is used when fitting to cosmological surveys to maximise the extracted cosmological information. In particular, it may significantly benefit an analysis to fix bias parameters to so called ‘co-evolution relations’ that relate the parameters to each other. Therefore, significant efforts have been placed on measuring the bias parameters and to constrain coevolution relations in simulations.

One of the most accurate methods for measuring bias parameters is based on ‘separate-universe’ simulations (Li et al. 2014; Wagner et al. 2015). In these cases, we can directly test how the number of tracers responds to an increase in the large-scale density. This technique has been used to constrain the density bias parameters of halos to very high accuracy (Lazeyras et al. 2016). Furthermore, similar ideas have been used to measure the response to non-Gaussian perturbations (Barreira 2020) and to the changes in the Laplacian of the density field (Lazeyras & Schmidt 2018). Other bias parameters have been constrained through different methods. For example, the tidal bias can be been measured through Fourier space correlations (Modi et al. 2017; Lazeyras & Schmidt 2018); alternatively, all bias parameters can be constrained simultaneously through forward modeling of the power spectrum (e.g. Zennaro et al. 2022) or the field-level galaxy distribution (e.g. Lazeyras et al. 2021).

Another successful avenue of measuring and understanding the behaviour of bias parameters is through peak theory (Bardeen et al. 1986) where the number density of peaks of the initial density field is investigated as a function of a Lagrangian smoothing scale. If galaxies (and halos) correspond to peaks of the initial density field, then bias parameters can be predicted through the response of peaks to large scale perturbations. While the mapping between structures and peaks bears significant uncertainty due to the necessary inclusion of a heuristic smoothing scale to define peaks and significant difficulty with treating the cloud-in-cloud problem (Bardeen et al. 1986), peak theory has proven as a useful tool in various aspects of biasing (Desjacques et al. 2018). For example, it predicts the importance of a Laplacian bias parameter (Bardeen et al. 1986), the scale dependence of the bias in the matter correlation function (Desjacques et al. 2010; Paranjape et al. 2013b,a) and non-zero velocity bias (Baldauf et al. 2015). All of these predictions have been established through measurements in simulations. Furthermore, peak theory and excursion set approaches have motivated the possibility of measuring biases through correlations between density and halos in Lagrangian space (Musso et al. 2012; Paranjape et al. 2013b,a; Biagetti et al. 2014). In particular, Paranjape et al. (2013a) have shown that it is possible to recover precise measurements of (scale-independent) large-scale bias parameters by mapping the scale-dependence of ‘naive’ bias parameters under the assumption of Peak theory to their large scale limit.

In this paper, we propose a novel approach for accurately measuring Lagrangian bias parameters from simulations. Here, we consider the distribution of the linear density field at the initial (Lagrangian) locations of galaxies, which we call the ‘galaxy environment distribution’. We adopted a probabilistic approach to model this galaxy environment distribution and show that the large-scale bias parameters have a simple relationship to the moments of this distribution. We used this to derive estimators of the bias parameters that can take into account spatial corrections at any order to practically eliminate any scale dependence. We derived such estimators for a broad variety of bias parameters and provide corresponding measurements for the case of halos. Operationally, our method is similar to aforementioned peak theory approaches. However, it is (by design) independent of the assumption that galaxies form in smoothed Lagrangian density peaks and it can be used to correct bias measurements at arbitrary high spatial orders.

Furthermore, we propose a new set of ‘cumulant bias parameters’, which are defined as the response of the logarithm of the galaxy number density to perturbations in the linear field. We show that these parameters have significantly improved properties when compared to their canonical counterparts. In the case of halos, we find that cumulant biases at the order of 3 and higher are consistent with zero. Therefore, the canonical co-evolution relations for halos at the order of 3 and beyond primarily appear as an effect of a sub-optimal parameterisation. We suggest that rephrasing the bias expansion in terms of the cumulant bias parameters can significantly enhance its convergence.

The vanishing of cumulants for halos at the order of 3 and beyond implies that the bias function can be well approximated through a Gaussian. We will explore such a Gaussian bias model in our companion paper (Stücker et al. 2025).

This article is organised as follows: In Sect. 2, we explain the probabilistic approach to measure Lagrangian bias parameters of scalar variables such as the density and the Laplacian. We introduce the concept of the cumulant bias expansion. In Sect. 3, we present measurements of the corresponding parameters and demonstrate how cumulant bias parameters exhibit several practical advantages. In Sect. 4, we show how to generalise the concept of the probabilistic estimators for tensorial bias variables, such as the tidal bias, b_K². In Sect. 5, we provide measurements of a few select tensorial quantities. In Sect. 6, we briefly discuss the quantitative importance of different bias terms. Finally, in Sect. 7, we summarise the benefits of our novel estimators and we discuss the circumstances where it is advantageous to phrase the bias expansion in terms of the cumulant bias parameters.

2. Theory

In this section, we introduce the necessary theory and (1) relate the bias parameters to properties of the galaxy environment distribution; (2) express the moment and cumulant generating functions of the galaxy environment distribution in terms of the large-scale bias functio; (3) introduce the concept of a cumulant bias expansion; (4) and show how to derive estimators for both the canonical and cumulant bias parameters.

2.1. Definitions

Our considerations are based on the idea of the peak-background split (PBS) which states that ‘a long-wavelength density perturbation acts as a local modification of the background density for the purposes of the formation of halos and galaxies’ (Kaiser 1984; Bardeen et al. 1986; Desjacques et al. 2018). Bias parameters describe the response of the galaxy number to such long-wavelength perturbations. In this paper, we focus exclusively on Lagrangian bias parameters as the response to perturbations in the linear density field.

An exact implementation of the PBS is given by the separate-universe approach. In this approach, we consider a generic universe with background density, ρ_bg, 0, whereby galaxies form with an average number density, n_g, 0. If we were to increase the initial background density of the universe by a linear amount δ₀ (e.g. in a separate-universe simulation, Frenk et al. 1988; Li et al. 2014; Wagner et al. 2015), in this new universe, galaxies would form with a different average number density, n_g(δ₀)¹. We refer to their ratio as:

$\begin{matrix} F (δ_{0}) & = \frac{n_{g} (δ_{0})}{n_{g, 0}} = 1 + b_{1} δ_{0} + \frac{1}{2} b_{2} δ_{0}^{2} + \dots + \frac{1}{n!} b_{n} δ_{0}^{n} + \dots \end{matrix}$ $\begin{aligned} F(\delta _0)&= \frac{n_g(\delta _0)}{n_{g,0}} = 1 + b_1 \delta _0 + \frac{1}{2} b_2 \delta _0^2 + \cdots + \frac{1}{n!}b_n\delta _0^n + \cdots \end{aligned}$ (1)

Here, we refer to the bias function or the separate-universe bias function, where δ₀ refers to a contrast in linear densities so that the new background density corresponds to

$\begin{matrix} ρ_{bg} \approx ρ_{bg, 0} (1 + δ_{0} D (a)) for a \to 0, \end{matrix}$ $\begin{aligned} \rho _{\mathrm{bg} } \approx \rho _{\mathrm{bg,0} } (1 + \delta _0 D(a)) \quad \text{ for} a \rightarrow 0, \end{aligned}$ (2)

where D is the linear growth factor normalised to D(a = 1) = 1 and F can be directly measured with separate-universe simulations (e.g. Lazeyras et al. 2016; Baldauf et al. 2016b). We refer to the coefficients of the indicated expansion as the ‘canonical bias parameters’ or simply ‘the bias parameters’ as:

$\begin{matrix} b_{n} & = {\frac{\partial^{n} F (δ_{0})}{\partial δ_{0}^{n}} |}_{δ_{0} = 0} . \end{matrix}$ $\begin{aligned} b_n&= \left. \frac{\partial ^{n} F(\delta _0)}{\partial \delta _0^{n}} \right|_{\delta _0 = 0}. \end{aligned}$ (3)

Therefore, the bias parameters physically describe the response of the galaxy density to small perturbations at infinitely large scales. In this paper, we want to investigate galaxy bias from a probabilistic perspective.

We considered an infinitesimally small Lagrangian volume element, about which nothing is known, apart from the linear density contrast, δ, smoothed at some scale (and possibly other features of the linear field like the Laplacian, L, or the tidal field). Neglecting the primordial non-Gaussianity, the density contrast follows a Gaussian distribution:

$\begin{matrix} p (δ) & = \frac{1}{\sqrt{2 π} σ} exp (- \frac{δ^{2}}{2 σ^{2}}) . \end{matrix}$ $\begin{aligned} p(\delta )&= \frac{1}{\sqrt{2 \pi } \sigma } \exp \left( - \frac{\delta ^2}{2 \sigma ^2} \right) . \end{aligned}$ (4)

For simplicity, we assume throughout this article that the smoothed density contrast is defined with a sharp k-space filter. Most considerations can be translated to cases filtered in different ways and in a simple manner, but some additional care must be taken due to the more complicated correlation between large and small scales. This is discussed in more detail in Appendix A.

When a sufficiently small volume element is considered, then it is only possible to have either ‘0’ or ‘1’ galaxy. We may therefore speak of a binary event ‘g’ that a volume contains a galaxy. We call the average probability that a galaxy forms in such a volume element ‘p(g)’ and we call the conditional probability, given the knowledge of the linear density contrast, ‘p(g|δ)’². The excess probability,

$\begin{matrix} f (δ) : = \frac{p (g | δ)}{p (g)}, \end{matrix}$ $\begin{aligned} f(\delta ) := \frac{p(\mathrm{g} | \delta )}{p(\mathrm{g} )}, \end{aligned}$ (5)

is parameterised through a function f(δ), which we refer to as the ‘scale-dependent bias function’ or just the ‘bias function’ throughout this article. The bias function depends in a predictable manner on the variance of δ at the considered scale, as we show later in this paper.

Since densities at different scales add up linearly, a separate-universe style modification of the large-scale density contrast from 0 to δ₀ will immediately translate to a modification of the linear density in our volume element δ → δ + δ₀. Therefore, F and f should be related through

$\begin{matrix} F (δ_{0}) & = ⟨ f (δ + δ_{0}) ⟩, \end{matrix}$ $\begin{aligned} F(\delta _0)&= \langle f(\delta + \delta _0) \rangle , \end{aligned}$ (6)

where the angled brackets indicate an expectation value taken over the Lagrangian volume (see also Desjacques et al. 2018). The relation indicates that in a separate-universe experiment, the number of galaxies would change according to the average change in probability of forming galaxies when changing the linear density contrast everywhere in space. Therefore, the canonical bias parameters are given in terms of the scale dependent bias function as

$\begin{matrix} b_{n} & = 〈 \frac{\partial^{n} f}{\partial δ^{n}} 〉 . \end{matrix}$ $\begin{aligned} b_n = \left\langle { \frac{\partial ^n f}{\partial \delta ^n} } \right\rangle . \end{aligned}$ (7)

Later, we show that Equation (6) holds only approximately, since at any finite smoothing scale, the density contrast, δ, is correlated with other variables (e.g. the Laplacian), so that a change in the small-scale density contrast at the location of all galaxies is not exactly equivalent to the ‘pure’ density change in separate-universe experiments. This introduces scale-dependencies that can be accounted for.

Finally, we introduce one further object which we call the ‘galaxy environment distribution’ of p(δ|g) or halo environment distribution when we are specifically talking about halos. This quantifies the probability of the linear density contrast in an infinitesimal volume element, given that there is a galaxy at the considered location. This function can easily be measured as a histogram of the linear density at the Lagrangian locations of galaxies as is illustrated in Figure 1, where we used a damping scale of k_d = 0.15 h Mpc⁻¹ here³, leading to σ = 0.56.

Fig. 1.

Illustration of the inference of the halo environment distribution p(δ|g). Left: Galaxies traced back to their origin in Lagrangian space (marked as black dots) and with the (smoothed) linear density field, δ, inferred at their Lagrangian locations. Right: Environment distribution (orange histogram) given by the distribution of δ at the galaxy locations which is notably biased relative to the matter distribution, p(δ), (blue histogram and a Gaussian represented as dashed line). The galaxy environment distribution is well approximated through p(δ)f(δ), where here f(δ) is a quadratic polynomial bias function.

Through Bayes’ theorem, the relation between the galaxy environment distribution and the bias function is given by

$\begin{matrix} p (δ | g) & = \frac{p (δ \cap g)}{p (g)} \end{matrix}$ $\begin{aligned} p(\delta | g)&= \frac{p(\delta \cap g)}{p(g)} \end{aligned}$ (8)

$\begin{matrix} = \frac{p (g | δ) p (δ)}{p (g)} \end{matrix}$ $\begin{aligned}&= \frac{p(g | \delta ) p(\delta )}{p(g)} \end{aligned}$ (9)

$\begin{matrix} = f (δ) p (δ) . \end{matrix}$ $\begin{aligned}&= f(\delta ) p(\delta ) . \end{aligned}$ (10)

This means for example, that the bias function, f(δ), can be investigated in a non-parametric way by measuring p(δ|g) and dividing by the Gaussian background distribution, p(δ), as we will investigate in the companion paper (Stücker et al. 2025).

In this section, we explain how we use probability theory to investigate the properties of the galaxy environment distribution. In particular, we show that the bias parameters are simply related to the moments of this distribution and that probability theory motivates the application of better behaved ‘cumulant bias parameters’.

2.2. Bias estimators

Following up on Equation (6), we can write

$\begin{matrix} F (δ_{0}) & = ⟨ f (δ + δ_{0}) ⟩ \\ = \int_{- \infty}^{\infty} p (δ) f (δ + δ_{0}) d δ \\ = \int_{- \infty}^{\infty} p (δ - δ_{0}) f (δ) d δ . \end{matrix}$ $\begin{aligned} F(\delta _0)&= \langle f(\delta + \delta _0) \rangle \nonumber \\&= \int _{-\infty }^\infty p(\delta ) f(\delta + \delta _0) \mathrm{d} \delta \nonumber \\&= \int _{-\infty }^\infty p(\delta - \delta _0) f(\delta ) \mathrm{d} \delta . \end{aligned}$ (11)

We note that in the last line, we made the substitution δ → δ + δ₀; also, δ represents the linearly extrapolated overdensity and, hence, it can assume values that are between negative and positive infinity. The latter Equation can be interpreted as an alternative perspective onto the separate-universe experiment: when increasing the background density, the probability of environments is changed by a factor p(δ − δ₀)/p(δ), whereas the likelihood of forming a galaxy when presupposing a given environment stays constant.

Now, combining Eqs. (3) and (11), we can evaluate the bias parameters as

$\begin{matrix} b_{n} & = {\frac{\partial^{n}}{\partial δ_{0}^{n}} \int_{- \infty}^{\infty} p (δ - δ_{0}) f (δ) d δ |}_{δ_{0} = 0} \\ = {(- 1)}^{n} \int_{- \infty}^{\infty} p^{(n)} (δ) f (δ) d δ \\ = {(- 1)}^{n} \int_{- \infty}^{\infty} \frac{p^{(n)} (δ)}{p (δ)} p (δ | g) d δ \\ = {(- 1)}^{n} {〈 \frac{p^{(n)} (δ)}{p (δ)} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_n&= \left. \frac{\partial ^n}{\partial \delta _0^n} \int _{-\infty }^\infty p(\delta - \delta _0) f(\delta ) \mathrm{d} \delta \nonumber \right|_{\delta _0 = 0} \\&= (-1)^n \int _{-\infty }^\infty p^{(n)}(\delta ) f(\delta ) \mathrm{d} \delta \nonumber \\&= (-1)^n \int _{-\infty }^\infty \frac{p^{(n)}(\delta )}{p(\delta )} p(\delta |g) \mathrm{d} \delta \nonumber \\&= (-1)^n \left\langle { \frac{p^{(n)} (\delta )}{p(\delta )} } \right\rangle _{\mathrm{g} } , \end{aligned}$ (12)

where the angled brackets with a ‘g’ subscript indicate an expectation value evaluated over the locations of galaxies (rather than all of Lagrangian space). Furthermore, we can use the fact that p is a Gaussian distribution, for which the derivatives are given by the (probabilisist’s) Hermite polynomials:

$\begin{matrix} \frac{\partial^{n}}{\partial δ^{n}} exp (- \frac{δ^{2}}{2 σ^{2}}) & = {(- 1)}^{n} σ^{- n} exp (- \frac{δ^{2}}{2 σ^{2}}) H_{n} (δ / σ), \end{matrix}$ $\begin{aligned} \frac{\partial ^n }{\partial \delta ^n} \exp \left(- \frac{\delta ^2}{2 \sigma ^2}\right)&= (-1)^n \sigma ^{-n} \exp \left(- \frac{\delta ^2}{2 \sigma ^2}\right) H_n \left( \delta / \sigma \right), \end{aligned}$ (13)

so that the bias estimators are:

$\begin{matrix} b_{n, so 0} & = {〈 \frac{H_{n} (δ / σ)}{σ^{n}} 〉}_{g} . \end{matrix}$ $\begin{aligned} b_{n, \mathrm{so0} }&= \left\langle { \frac{H_n \left( \delta / \sigma \right)}{\sigma ^n} } \right\rangle _{\mathrm{g} } . \end{aligned}$ (14)

Here, the subscript ‘so0’ indicates that these estimators are at a ‘spatial order of 0’. In other words, they do not include corrections from higher spatial derivatives such as the Laplacian (as we explain in more detail later in this work). This expression was already used by Paranjape et al. (2013a,b) to measure the bias parameters. It was motivated by its emergence in excursion set frameworks (Musso et al. 2012) and peak statistics (Paranjape & Sheth 2012), but it is clearly also valid outside such frameworks, requiring us to only assume the PBS. On a related note, it is worth mentioning that Szalay (1988) previously proposed expanding the field in terms of Hermite polynomials.

The estimators for the first four bias parameters are expressed as:

$\begin{matrix} b_{1, so 0} & = {〈 \frac{δ}{σ^{2}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{1,\mathrm{so0} }&= \left\langle { \frac{\delta }{\sigma ^2} } \right\rangle _{\mathrm{g} } ,\end{aligned}$ (15)

$\begin{matrix} b_{2, so 0} & = {〈 \frac{δ^{2} - σ^{2}}{σ^{4}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{2,\mathrm{so0} }&= \left\langle { \frac{\delta ^2 - \sigma ^2}{\sigma ^4} } \right\rangle _{\mathrm{g} } ,\end{aligned}$ (16)

$\begin{matrix} b_{3, so 0} & = {〈 \frac{δ^{3} - 3 δ σ^{2}}{σ^{6}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{3,\mathrm{so0} }&= \left\langle { \frac{\delta ^3 - 3 \delta \sigma ^2}{\sigma ^6} } \right\rangle _{\mathrm{g} } ,\end{aligned}$ (17)

$\begin{matrix} b_{4, so 0} & = {〈 \frac{δ^{4} - 6 δ^{2} σ^{2} + 3 σ^{4}}{σ^{8}} 〉}_{g} . \end{matrix}$ $\begin{aligned} b_{4,\mathrm{so0} }&= \left\langle { \frac{\delta ^4 - 6 \delta ^2 \sigma ^2 + 3 \sigma ^4}{\sigma ^8} } \right\rangle _{\mathrm{g} } . \end{aligned}$ (18)

2.3. The moment generating function

Equation (14) shows that there exists a simple relation between moments of the galaxy environment distribution and the bias parameters. We can show this in a very general manner. Expanding the Gaussian background distribution, we obtain

$\begin{matrix} p (δ - δ_{0}) & = \frac{1}{\sqrt{2 π} σ} exp (- \frac{δ^{2} - 2 δ δ_{0} + δ_{0}^{2}}{2 σ^{2}}) \\ = p (δ) exp (\frac{δ δ_{0}}{σ^{2}}) exp (- \frac{δ_{0}^{2}}{2 σ^{2}}) . \end{matrix}$ $\begin{aligned} p(\delta - \delta _0)&= \frac{1}{\sqrt{2 \pi } \sigma } \exp \left( - \frac{\delta ^2 - 2\delta \delta _0 + \delta _0^2}{2 \sigma ^2} \right) \nonumber \\&= p(\delta ) \exp \left(\frac{\delta \delta _0}{\sigma ^2} \right) \exp \left( - \frac{\delta _0^2}{2 \sigma ^2} \right). \nonumber \end{aligned}$

Inserting this into Equation (11) yields

$\begin{matrix} F (δ_{0}) & = exp (- \frac{δ_{0}^{2}}{2 σ^{2}}) \int_{- \infty}^{\infty} f (δ) p (δ) exp (\frac{δ δ_{0}}{σ^{2}}) d δ \\ = exp (- \frac{1}{2} t^{2} σ^{2}) \int_{- \infty}^{\infty} p (δ | g) exp (t δ) d δ \\ = exp (- \frac{1}{2} t^{2} σ^{2}) {〈 exp (t δ) 〉}_{g}, \end{matrix}$ $\begin{aligned} F(\delta _0)&= \exp \left( - \frac{\delta _0^2}{2 \sigma ^2} \right) \int _{-\infty }^\infty f(\delta ) p(\delta ) \exp \left(\frac{\delta \delta _0}{\sigma ^2} \right) \nonumber \mathrm{d} \delta \\&= \exp \left( - \frac{1}{2} t^2 \sigma ^2 \right) \int _{-\infty }^\infty p(\delta |g) \exp \left(t \delta \right) \nonumber \mathrm{d} \delta \\&= \exp \left( - \frac{1}{2} t^2 \sigma ^2 \right) \left\langle {\exp \left(t \delta \right)} \right\rangle _{\mathrm{g} } , \end{aligned}$ (19)

where we have labelled t = δ₀/σ². The last term can be identified with the moment generating function,

$\begin{matrix} M_{g} (t) = {〈 exp (t δ) 〉}_{g}, \end{matrix}$ $\begin{aligned} M_g(t) = \left\langle {\exp \left(t \delta \right)} \right\rangle _{\mathrm{g} } , \end{aligned}$ (20)

of the galaxy environment distribution. Therefore, the moment-generating function of the galaxy environment distribution and the separate-universe bias function can be directly converted into each other as follows:

$\begin{matrix} M_{g} (t) & = F (t σ^{2}) exp (\frac{1}{2} t^{2} σ^{2}) . \end{matrix}$ $\begin{aligned} M_g(t)&= F(t \sigma ^2) \exp \left(\frac{1}{2} t^2 \sigma ^2 \right). \end{aligned}$ (21)

Furthermore, the relation between the moments and the bias parameters, as given by Equation (14) can equivalently be found by taking derivatives of the moment generating function:

$\begin{matrix} μ_{n} & : = {〈 δ^{n} 〉}_{g} = {\frac{\partial}{\partial t^{n}} M_{g} (t) |}_{t = 0} . \end{matrix}$ $\begin{aligned} \mu _n&:= \left\langle {\delta ^n} \right\rangle _{\mathrm{g} } = \left. \frac{\partial }{\partial t^n} M_g(t) \right|_{t = 0} . \end{aligned}$ (22)

It is worth noting that this result may be related to the considerations in White (1979), where in the case of discrete tracers, the moment-generating function of the galaxy count frequency distribution is expressed through the void probability function.

2.4. Cumulant bias parameters

We may further consider how the bias parameters are related to the cumulants of the galaxy environment distribution. In probability theory, the value of cumulants are generally thought to characterise a distribution more independently than its moments. For example, if a distribution has a large first moment ⟨x⟩, then we should also expect that it has large second and third moments ⟨x²⟩ and ⟨x³⟩. However, the first cumulant of a distribution ⟨x⟩ says very little about the second and third cumulants, ⟨(x−⟨x⟩)²⟩ and ⟨(x−⟨x⟩)³⟩, respectively. For example, for a Gaussian distribution the mean is independent of its variance and the third and higher order cumulants are actually zero.

The cumulant generating function is defined as

$\begin{matrix} K_{g} (t) & = log M_{g} (t) \\ = \frac{1}{2} t^{2} σ^{2} + log F (t σ^{2}) . \end{matrix}$ $\begin{aligned} K_g(t)&= \log M_g(t) \nonumber \\&= \frac{1}{2} t^2 \sigma ^2 + \log F(t \sigma ^2) . \end{aligned}$ (23)

Cumulants of the galaxy environment distribution are simply given by the derivatives of the cumulant generating function

$\begin{matrix} κ_{n} & = {\frac{\partial K_{g}}{\partial t^{n}} |}_{t = 0} . \end{matrix}$ $\begin{aligned} \kappa _n&= \left. \frac{\partial K_g}{ \partial t^n} \right|_{t = 0} . \end{aligned}$ (24)

Therefore, they can be evaluated as:

$\begin{matrix} κ_{n} & = {\begin{matrix} β_{n} σ^{2 n} & i f n \neq 2, \\ β_{2} σ^{4} + σ^{2} & i f n = 2, \end{matrix} \end{matrix}$ $\begin{aligned} \kappa _n&= {\left\{ \begin{array}{ll} \beta _n \sigma ^{2n}&\mathrm \quad if \quad n \ne 2, \\ \beta _2 \sigma ^4 + \sigma ^2&\mathrm \quad if \quad n = 2, \end{array}\right.} \end{aligned}$ (25)

where we have defined

$\begin{matrix} β_{n} & = {\frac{\partial^{n}}{\partial δ_{0}^{n}} log F (δ_{0}) |}_{δ_{0} = 0} . \end{matrix}$ $\begin{aligned} \beta _n&= \left. \frac{\partial ^n}{\partial \delta _0^n} \log F(\delta _0) \right|_{\delta _0 = 0} . \end{aligned}$ (26)

We notice that the question of whether β₂ is above or below zero is a direct indication of whether the variance of the halo environment distribution is larger or smaller than that of the background. We refer to these parameters as ‘cumulant bias parameters’ and they are directly related to the canonical bias parameters b_n. Comparing their definition to the canonical bias parameters in Equation (3) shows that they relate to each other exactly in the same way that cumulants relate to moments (compare Equations (22) and (24)). For example, at the first four orders we have:

$\begin{matrix} β_{1} & = b_{1}, \end{matrix}$ $\begin{aligned} \beta _{1}&= b_{1} ,\end{aligned}$ (27)

$\begin{matrix} β_{2} & = b_{2} - b_{1}^{2}, \end{matrix}$ $\begin{aligned} \beta _{2}&= b_{2} - b_{1}^{2} ,\end{aligned}$ (28)

$\begin{matrix} β_{3} & = b_{3} - 3 b_{1} b_{2} + 2 b_{1}^{3}, \end{matrix}$ $\begin{aligned} \beta _{3}&= b_{3} - 3 b_{1} b_{2} + 2 b_{1}^{3} ,\end{aligned}$ (29)

$\begin{matrix} β_{4} & = b_{4} - 4 b_{1} b_{3} - 3 b_{2}^{2} + 12 b_{1}^{2} b_{2} - 6 b_{1}^{4} . \end{matrix}$ $\begin{aligned} \beta _{4}&= b_{4} - 4 b_{1} b_{3} - 3 b_{2}^{2} + 12 b_{1}^{2} b_{2} - 6 b_{1}^{4} . \end{aligned}$ (30)

2.5. Interpretation

Given a set of bias parameters, these relations allow us to directly find the cumulants of the galaxy environment distribution, or alternatively, they allow us to infer canonical bias parameters by measuring cumulants of the galaxy environment distribution.

However, it is also possible to phrase the bias expansion directly in terms of the cumulant bias parameters.

$\begin{matrix} log F & = β_{1} δ_{0} + \frac{1}{2} β_{2} δ_{0}^{2} + \frac{1}{6} β_{3} δ_{0}^{3} + \dots \end{matrix}$ $\begin{aligned} \log F&= \beta _1 \delta _0 + \frac{1}{2} \beta _2 \delta _0^2 + \frac{1}{6} \beta _3 \delta _0^3 + \cdots \end{aligned}$ (31)

There are several reasons to believe that they may form a better set of parameters than the canonical bias parameters:

The cumulant bias parameters are the derivatives of the logarithm of the galaxy density. Therefore, they presuppose the positivity of F and may be better behaved, especially in low density regions.
The canonical bias parameters behave similarly to moments of the galaxy environment distribution. For example, we may expect that if b₁ is large, automatically b₄ will also be large. On the other hand, we may expect β₄ to be independent of the value of β₁ – just as cumulants are relatively independent of each other.
If the galaxy environment distribution has the form of a Gaussian then β_n = 0 for all n > 2. On the other hand, all b_n would be non-zero in this case. Therefore, if all β_n beyond a degreeof 2 are small, this motivates the usage of a Gaussian bias model – where both log F and log f are quadratic polynomials. We show here that this is indeed the case for halos.

Furthermore, we note that the cumulant bias parameters and the probabilistic considerations in this section motivate novel approaches to parameterising the bias function at finite smoothing scales. There are several possible well motivated approaches and we only mention them here in brief, leaving a more thorough investigation to future studies.

The first and most straight-forward proposition is to assume an expansion of the logarithm of the bias function, leading to an exponential of a polynomial

$\begin{matrix} log f (δ) & = γ_{0} + γ_{1} δ + γ_{2} δ^{2} + γ_{3} δ^{3} + \dots \end{matrix}$ $\begin{aligned} \log f(\delta )&= \gamma _0 + \gamma _1 \delta + \gamma _2 \delta ^2 + \gamma _3 \delta ^3 + \cdots \end{aligned}$ (32)

$\begin{matrix} f (δ) & = exp (γ_{0} + γ_{1} δ + γ_{2} δ^{2} + γ_{3} δ^{3} + \dots) . \end{matrix}$ $\begin{aligned} f(\delta )&= \exp \left(\gamma _0 + \gamma _1 \delta + \gamma _2 \delta ^2 + \gamma _3 \delta ^3 + \cdots \right) . \end{aligned}$ (33)

Such a bias expansion would have to be truncated at an even order (if n ≥ 2) and the highest order coefficient must be negative to guarantee a fully normalised probability distribution. By definition, this guarantees the positivity of the bias function which is arguably a desirable property – especially if our intention is to create mocks from a bias model. Furthermore, the resulting distribution would be part of the exponential family which guarantees several desirable properties. The main limitation is that at orders of n > 2, it might be difficult to properly re-normalise the expression analytically⁴.

This choice is the only general one we know of that guarantees f(δ) > 0 for all values of δ, even if the expansion is truncated. However, there are a few other noteworthy options that do not fulfill this constraint. One option is motivated by the intriguing simplicity of the relation between cumulant generating the function in Equation (23) and F. This motivates to define the bias function directly through the cumulant generating function,

$\begin{matrix} K_{g} (t) & = K_{m} (t) + \sum_{n} \frac{1}{n!} β_{n} t σ^{2 n,} \end{matrix}$ $\begin{aligned} K_g(t)&= K_{\mathrm{m} }(t) + \sum _n \frac{1}{n!} \beta _n t \sigma ^{2n,} \end{aligned}$ (34)

where K_m(t) is the cumulant generating function of the matter distribution. This general Ansatz might also be feasible in Eulerian space, but for the Lagrangian case, it is simply $K_{m} (t) = \frac{1}{2} t σ^{2}$ $K_{\mathrm{m}}(t) = \frac{1}{2} t \sigma^2$ .

If the expansion is truncated at an order of n = 2 then this corresponds to a well defined Gaussian bias model, but if it is truncated at a higher order, then it is not easy to find an analytical form of f(δ). However, a numerical expression can be obtained through a Fourier transformation. Unfortunately, such a truncation at higher orders does not lead to a well defined probability density, but instead may have negative function values. In fact the cumulant generating function does not yield a positive probability density function (pdf) for any finite order polynomial of degree n > 2 (Lukacs 1970). However, this does not disfavour this approach over the canonical bias expansion which also yields a negative pdf.

Furthermore, we note that it is an option to phrase the bias expansion relative to a Gaussian distribution that has the correct cumulants up to an order of 2, as follows:

$\begin{matrix} p (δ | g) & = \frac{1}{\sqrt{2 π} σ_{g}} exp (- \frac{{(δ - μ_{g})}^{2}}{2 σ_{g}^{2}}) (1 + γ_{3} δ^{3} + γ_{4} δ^{4} + \dots), \end{matrix}$ $\begin{aligned} p(\delta |g)&= \frac{1}{\sqrt{2 \pi } \sigma _g} \exp \left( - \frac{(\delta - \mu _g)^2}{2 \sigma _g^2} \right) (1 + \gamma _3 \delta ^3 + \gamma _4 \delta ^4 + \cdots ), \end{aligned}$ (35)

where μ_g = κ₁ = β₁σ² and σ_g² = κ₂ = β₂σ⁴ + σ². This is also where the γ_n start at an order of 3, since the lowest two orders are already specified through the Gaussian. This type of expansion is similar to a truncated Edgeworth series (which is an expansion to approximate distributions with given cumulants).

Finally, we note that it is possible to use the cumulant bias parameters in a classical polynomial bias expansion as well. For example, a third-order expansion that only uses two parameters, assuming β₃ = 0 would read

$\begin{matrix} f (δ) & = β_{1} δ + (β_{2} + β_{1}^{2}) (δ^{2} - σ^{2}) + (3 β_{1} β_{2} - β_{1}^{3}) (δ^{3} - 3 δ σ^{2}) + \dots \end{matrix}$ $\begin{aligned} f(\delta )&= \beta _1 \delta + (\beta _2 + \beta _1^2) (\delta ^2 - \sigma ^2) + (3 \beta _1 \beta _2 - \beta _1^3) (\delta ^3 - 3 \delta \sigma ^2) + \cdots \end{aligned}$ (36)

This can be understood as the lower order terms predicting the likely behaviour of higher order terms.

2.6. Multivariate estimators

So far, we have only considered the bias function based on the assumption that the only known aspect of the environment is the density, δ, at the considered location. However, other aspects of the environment may be known, such as the Laplacian,

$\begin{matrix} L & = \nabla^{2} δ, \end{matrix}$ $\begin{aligned} L&= \nabla ^2 \delta , \end{aligned}$ (37)

or higher order derivatives of the density field,

$\begin{matrix} P & = \nabla^{4} δ, \end{matrix}$ $\begin{aligned} P&= \nabla ^4 \delta , \end{aligned}$ (38)

or the tidal field. Since the tidal field is inherently a tensorial quantity, its treatment is slightly more complicated and we therefore discuss this in more detail in Sect. 4.

Assuming that we are dealing only with scalar quantities that describe the environment, we may summarise the environment through a single vector x – for example, if we consider the density and Laplacian as variables we have x = (δ, L)^T. The majority of the relations that we have derived in the previous section hold in a similar manner for the multivariate case as well. We do not derive all of them again, since their derivation follows almost completely analogously, however, we do list all the important relations.

Assuming that all the considered environment variables follow from linear operators on the Gaussian random field, their distribution is given by a multivariate Gaussian,

$\begin{matrix} p (x) & = \frac{1}{2 π \sqrt{det (C)}} exp (- \frac{1}{2} x^{T} C^{- 1} x) . \end{matrix}$ $\begin{aligned} p(\boldsymbol{x})&= \frac{1}{2 \pi \sqrt{\det (\mathsf C )}} \exp \left(- \frac{1}{2} \boldsymbol{x}^T \mathsf{C }^{-1} \boldsymbol{x} \right). \,\, \end{aligned}$ (39)

For example, for the case of x = (δ, L)^T, which is particularly well motivated by the peak model (Bardeen et al. 1986), we have the covariance matrix:

$\begin{matrix} C_{δ, L} & = [\begin{matrix} σ_{0}^{2} & - σ_{1}^{2} σ_{1}^{2} & σ_{2}^{2} \end{matrix}], \end{matrix}$ $\begin{aligned} \mathsf{C }_{\delta ,L}= \left[\begin{matrix} {\sigma _{0}^{2}}&{- \sigma _{1}^{2}} & {\sigma _{1}^{2}} &{\sigma _{2}^{2}}\end{matrix}\right], \end{aligned}$ (40)

$\begin{matrix} C_{δ, L}^{- 1} & = [\begin{matrix} \frac{σ_{2}^{2}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} & \frac{σ_{1}^{2}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} \\ \frac{σ_{1}^{2}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} & \frac{σ_{0}^{2}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} \end{matrix}], \end{matrix}$ $\begin{aligned} \mathsf{C }_{\delta ,L}^{-1}= \left[\begin{matrix}\frac{\sigma _{2}^{2}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}&\frac{\sigma _{1}^{2}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}\\ \frac{\sigma _{1}^{2}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}&\frac{\sigma _{0}^{2}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}\end{matrix}\right], \end{aligned}$ (41)

where σ₀² = ⟨δ²⟩, σ₂² = ⟨L²⟩ and σ₁² = ⟨−δL⟩.

Now, we can conveniently write bias parameters in a matrix form

$\begin{matrix} b_{1} & = {\nabla_{x} F |}_{x = 0}, \end{matrix}$ $\begin{aligned} \boldsymbol{b}_1&= \left. \nabla _{\boldsymbol{x}} F \, \right|_{\boldsymbol{x} = 0} ,\end{aligned}$ (42)

$\begin{matrix} b_{2} & = {(\nabla_{x} \otimes \nabla_{x}) F |}_{x = 0}, \end{matrix}$ $\begin{aligned} \mathsf{b }_2&= \left. (\nabla _{\boldsymbol{x}} \otimes \nabla _{\boldsymbol{x}}) F \, \right|_{\boldsymbol{x} = 0} ,\end{aligned}$ (43)

$\begin{matrix} b_{3} & = {(\nabla_{x} \otimes \nabla_{x} \otimes \nabla_{x}) F |}_{x = 0}, \end{matrix}$ $\begin{aligned} \mathsf{b }_3&= \left. (\nabla _{\boldsymbol{x}} \otimes \nabla _{\boldsymbol{x}} \otimes \nabla _{\boldsymbol{x}} ) F \, \right|_{\boldsymbol{x} = 0} ,\end{aligned}$ (44)

$\begin{matrix} b_{n} & = {\nabla_{x}^{\otimes n} F |}_{x = 0} . \end{matrix}$ $\begin{aligned} \mathsf{b }_n&= \left. \nabla _{\boldsymbol{x}}^{\otimes n} F \, \right|_{\boldsymbol{x} = 0}. \end{aligned}$ (45)

Here, ⊗ designates an outer product, the power notation in the last line designates a repeated outer product, b₁ is a vector, b₂ is a symmetric rank two matrix, b₃ is a symmetric rank three tensor, and so on. Furthermore, ∇_x denotes a gradient with respect to the chosen variables, for example, ∇_x = (∂/∂δ, ∂/∂L)^T.

It is then straightforward to show

$\begin{matrix} b_{n} & = {(- 1)}^{n} {〈 \frac{\nabla_{x}^{\otimes n} p}{p} 〉}_{g}, \end{matrix}$ $\begin{aligned} \mathsf{b }_n&= (-1)^n \left\langle { \frac{\nabla _{\boldsymbol{x}}^{\otimes n} p}{p} } \right\rangle _{\mathrm{g} } , \end{aligned}$ (46)

which at the first two orders, can be expressed as:

$\begin{matrix} b_{1} & = {〈 C^{- 1} x 〉}_{g}, \end{matrix}$ $\begin{aligned} \boldsymbol{b}_1&= \left\langle {\mathsf{C }^{-1} \boldsymbol{x} } \right\rangle _{\mathrm{g} } ,\end{aligned}$ (47)

$\begin{matrix} b_{2} & = {〈 C^{- 1} (x \otimes x - C) C^{- 1} 〉}_{g} . \end{matrix}$ $\begin{aligned} \mathsf{b }_2&= \left\langle {\mathsf{C }^{-1} (\boldsymbol{x} \otimes \boldsymbol{x} - \mathsf{C }) \mathsf{C }^{-1} } \right\rangle _{\mathrm{g} } . \end{aligned}$ (48)

Importantly, these expressions do not lead to the same bias estimators for the density if a second variable is present that is correlated with the density. For example, if we use the density and Laplacian as variables, we find

$\begin{matrix} b_{δ, so 2} & = {〈 \frac{δ σ_{2}^{2} + L σ_{1}^{2}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\delta , \mathrm{so2} }&= \left\langle {\frac{\delta \sigma _{2}^{2} + L \sigma _{1}^{2}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}} \right\rangle _{\mathrm{g} } ,\end{aligned}$ (49)

$\begin{matrix} b_{L, so 2} & = {〈 \frac{L σ_{0}^{2} + δ σ_{1}^{2}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{L, \mathrm{so2} }&= \left\langle {\frac{L \sigma _{0}^{2} + \delta \sigma _{1}^{2}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}} \right\rangle _{\mathrm{g} } , \end{aligned}$ (50)

for the components of b₁ and

$\begin{matrix} b_{δ δ, so 2} & = {〈 \frac{{(δ σ_{2}^{2} + L σ_{1}^{2})}^{2} - σ_{2}^{2} (σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4})}{{(σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4})}^{2}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\delta \delta , \mathrm{so2} }&= \left\langle {\frac{\left(\delta \sigma _{2}^{2} + L \sigma _{1}^{2} \right)^{2} - \sigma _{2}^{2} \left(\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}\right)}{\left(\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}\right)^{2}}} \right\rangle _{\mathrm{g} } ,\end{aligned}$ (51)

$\begin{matrix} b_{L L, so 2} & = {〈 \frac{σ_{0}^{2} (- σ_{0}^{2} σ_{2}^{2} + σ_{1}^{4}) + {(L σ_{0}^{2} + δ σ_{1}^{2})}^{2}}{{(σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4})}^{2}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{LL, \mathrm{so2} }&= \left\langle {\frac{\sigma _{0}^{2} \left(- \sigma _{0}^{2} \sigma _{2}^{2} + \sigma _{1}^{4}\right) + \left(L \sigma _{0}^{2} + \delta \sigma _{1}^{2}\right)^{2}}{\left(\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}\right)^{2}}} \right\rangle _{\mathrm{g} } ,\end{aligned}$ (52)

$\begin{matrix} b_{δ L, so 2} & = {〈 \frac{(L σ_{0}^{2} + δ σ_{1}^{2}) (L σ_{1}^{2} + δ σ_{2}^{2}) - σ_{1}^{2} (σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4})}{{(σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4})}^{2}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\delta L, \mathrm{so2} }&= \left\langle {\frac{\left(L \sigma _{0}^{2} + \delta \sigma _{1}^{2}\right) \left(L \sigma _{1}^{2} + \delta \sigma _{2}^{2}\right) - \sigma _{1}^{2} \left(\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}\right)}{\left(\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}\right)^{2}}} \right\rangle _{\mathrm{g} } , \end{aligned}$ (53)

for the components of b₂, where we have written, for instance, b_δδ as a symbol for what was previously referred to as b₂. Furthermore, we note that a general form for the Nth density bias parameter is given by

$\begin{matrix} b_{δ^{N}, so 2} & = {〈 \frac{H_{N} ((δ + L σ_{1}^{2} / σ_{2}^{2}) / σ_{*})}{σ_{*}^{N}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\delta ^N, \mathrm{so2} }&= \left\langle {\frac{H_N((\delta + L \sigma _{1}^{2} / \sigma _{2}^{2})/\sigma _{*})}{\sigma _{*}^N}} \right\rangle _{\mathrm{g} } , \end{aligned}$ (54)

with $σ_{*}^{2} = σ_{0}^{2} - σ_{1}^{4} / σ_{2}^{2}$ $\sigma_{*}^2 = \sigma_{0}^{2} - \sigma_{1}^{4} / \sigma_{2}^{2}$ . We note that these relations have already been derived through the response of density peaks to large scale perturbations (Bardeen et al. 1986; Mo & White 1996; Desjacques et al. 2010) and were later been shown to apply also to more general tracers (Lazeyras et al. 2016).

The difference between Eqs. (49)–(54) and the estimators from Eqs. (15)–(18) is that they correspond to partial derivatives of the density at fixed values of the Laplacian. The previous estimators are the derivatives of the projected distribution, which is not the same due to the correlation between density and Laplacian. As we see in Sect. 3.3, the new estimators are less scale-dependent, since they are closer to pure partial derivatives with respect to the density (in the separate-universe sense).

We refer to the estimators from Equations (49)–(54) as the estimators with spatial corrections of the second order, whereas the previous estimators are without any spatial corrections (i.e. at the spatial order of 0). It is straightforward to obtain higher order spatial corrections, for instance, by considering the covariance matrix of the distribution of x = (δ, L, P)^T with P as in Equation (38). For example, the estimators of density bias parameters at a spatial order of 4 can be expressed as

$\begin{matrix} b_{δ^{N}} & = {〈 \frac{H_{N} (δ_{* 4} / σ_{* 4})}{σ_{* 4}^{N}} 〉}_{g}, \\ δ_{* 4} & = δ + L \frac{(σ_{1}^{2} σ_{4}^{2} - σ_{2}^{2} σ_{3}^{2})}{σ_{2}^{2} σ_{4}^{2} - σ_{3}^{4}} + P \frac{(σ_{1}^{2} σ_{3}^{2} - σ_{2}^{4})}{σ_{2}^{2} σ_{4}^{2} - σ_{3}^{4}}, \\ σ_{* 4} & = \frac{σ_{0}^{2} σ_{2}^{2} σ_{4}^{2} - σ_{0}^{2} σ_{3}^{4} - σ_{1}^{4} σ_{4}^{2} + 2 σ_{1}^{2} σ_{2}^{2} σ_{3}^{2} - σ_{2}^{6}}{σ_{2}^{2} σ_{4}^{2} - σ_{3}^{4}}, \end{matrix}$ $\begin{aligned} b_{\delta ^N}&= \left\langle {\frac{H_N(\delta _{*4}/\sigma _{*4})}{\sigma _{*4}^N}} \right\rangle _{\mathrm{g} } ,\\ \delta _{*4}&= \delta + L \frac{\left(\sigma _{1}^{2} \sigma _{4}^{2} - \sigma _{2}^{2} \sigma _{3}^{2}\right)}{\sigma _{2}^{2} \sigma _{4}^{2} - \sigma _{3}^{4}} + P \frac{\left(\sigma _{1}^{2} \sigma _{3}^{2} - \sigma _{2}^{4}\right)}{\sigma _{2}^{2} \sigma _{4}^{2} - \sigma _{3}^{4}}, \nonumber \\ \sigma _{*4}&= \frac{\sigma _{0}^{2} \sigma _{2}^{2} \sigma _{4}^{2} - \sigma _{0}^{2} \sigma _{3}^{4} - \sigma _{1}^{4} \sigma _{4}^{2} + 2 \sigma _{1}^{2} \sigma _{2}^{2} \sigma _{3}^{2} - \sigma _{2}^{6}}{\sigma _{2}^{2} \sigma _{4}^{2} - \sigma _{3}^{4}}, \nonumber \end{aligned}$ (55)

where σ₃² = −⟨δ⋅P⟩ and σ₄² = ⟨P²⟩. In practice, it is easier to evaluate such high-order estimators numerically, rather than deriving explicit expressions for them. The covariance matrix may easily be measured from a given linear density field and then used to evaluate the estimators as e.g. in Equation (48), so that in principle it is not very difficult to obtain corrections at any spatial order. Throughout this article, we mainly focus on the estimators of a spatial order of 2, since they are already sufficiently accurate to obtain good measurements of biases, but we have selectively included both lower or higher order estimators throughout this article to demonstrate the convergence.

We note that the estimators with second-order spatial corrections are quite similar in spirit to the considerations by Musso et al. (2012), Paranjape & Sheth (2012), Paranjape et al. (2013b,a) to map scale-dependent measurements of biases as in Equation (14) under the assumption of excursion set (peaks) models to scale-independent large-scale parameters. However, here we can see that such a mapping to the large-scale limit can be done without the assumption of Peak or excursion set models and for spatial corrections of any order. Consider for example that the relation between the zeroth-order spatial estimator and second-order spatial estimator of b₁ is given by

$\begin{matrix} {〈 δ 〉}_{g} = C b_{1} & = σ_{0}^{2} b_{1, so 0} \\ = σ_{0}^{2} b_{1, so 2} - σ_{1}^{2} b_{L, so 2} . \end{matrix}$ $\begin{aligned} \left\langle {\delta } \right\rangle _{\mathrm{g} } = \mathsf{C } \boldsymbol{b}_1&= \sigma _0^2 b_{1, \mathrm{so} 0} \nonumber \\&= \sigma _0^2 b_{1, \mathrm{so} 2} - \sigma _1^2 b_{L, \mathrm{so} 2} . \nonumber \end{aligned}$

Therefore, b_L, so2 predicts the scale dependence of b_1, so0 and b_1, so2 posits a more scale-independent estimate:

$\begin{matrix} b_{1, so 0} & = b_{1, so 2} - \frac{σ_{1}^{2}}{σ_{0}^{2}} b_{L, so 2} . \end{matrix}$ $\begin{aligned} b_{1,\mathrm{so} 0}&= b_{1,\mathrm{so} 2} - \frac{\sigma _1^2}{\sigma _0^2} b_{L,\mathrm{so} 2} \,\, . \end{aligned}$ (56)

We note that this argument can easily be pushed to higher spatial orders:

$\begin{matrix} b_{1, so 0} & = b_{1, so 4} - \frac{σ_{1}^{2}}{σ_{0}^{2}} b_{L, so 4} + \frac{σ_{2}^{2}}{σ_{0}^{2}} b_{P, so 4} . \end{matrix}$ $\begin{aligned} b_{1,\mathrm{so} 0}&= b_{1,\mathrm{so} 4} - \frac{\sigma _1^2}{\sigma _0^2} b_{L,\mathrm{so} 4} + \frac{\sigma _2^2}{\sigma _0^2} b_{P,\mathrm{so} 4} . \end{aligned}$ (57)

The additional cost of our approach (e.g. in comparison to Paranjape et al. 2013a) is that, beneath the density, also the Laplacian has to be evaluated in Lagrangian space, but the benefits are model-independence and a much reduced mathematical complexity.

2.7. Multivariate cumulants

Analogously to the monovariate case, we define multivariate cumulant bias parameters as

$\begin{matrix} β_{N} = {\nabla_{x}^{N} log (F) |}_{x = 0}, \end{matrix}$ $\begin{aligned} \boldsymbol{\beta }_N = \left. \nabla _{\boldsymbol{x}}^N \log (F) \right|_{\boldsymbol{x} = 0}, \end{aligned}$ (58)

which immediately leads to

$\begin{matrix} β_{1} & = b_{1}, \end{matrix}$ $\begin{aligned} \boldsymbol{\beta }_1&= \boldsymbol{b}_1 ,\end{aligned}$ (59)

$\begin{matrix} β_{2} & = b_{2} - b_{1} \otimes b_{1}, \end{matrix}$ $\begin{aligned} \boldsymbol{\beta }_2&= \mathsf{b }_2 - \boldsymbol{b}_1 \otimes \boldsymbol{b}_1 ,\end{aligned}$ (60)

$\begin{matrix} β_{3, i j k} & = b_{3, i j k} - (b_{1, i} b_{2, j k} + b_{1, j} b_{2, k i} + b_{1, k} b_{2, i j}) + 2 b_{1, i} b_{1, j} b_{1, k}, \end{matrix}$ $\begin{aligned} \beta _{3, ijk}&= b_{3,ijk} - (b_{1,i} b_{2,jk} + b_{1,j} b_{2,ki} + b_{1,k} b_{2,ij}) + 2 b_{1,i} b_{1,j} b_{1,k}, \end{aligned}$ (61)

where we gave β₃ in index notation, since the central term is difficult to express in vectorial notation. We note that this leads to the same relations between the density cumulant biases and their canonical biases as in Equations (27)–(30), but it also includes additional relations, such as:

$\begin{matrix} β_{δ δ L} & = b_{δ δ L} - b_{δ δ} b_{L} - 2 b_{δ L} b_{δ} + 2 b_{δ}^{2} b_{L} . \end{matrix}$ $\begin{aligned} \beta _{\delta \delta L}&= b_{\delta \delta L} - b_{\delta \delta } b_{L} - 2 b_{\delta L} b_{\delta } +2 b_{\delta }^2 b_L . \end{aligned}$ (62)

Finally, the relation between the separate-universe bias function and the moment and cumulant generating functions is analagous to the expression in Equations (21) and (23), given by

$\begin{matrix} M (t) & = exp (\frac{1}{2} t^{T} C t) F (x_{0} = C t), \end{matrix}$ $\begin{aligned} M(\boldsymbol{t})&= \exp \left(\frac{1}{2} \boldsymbol{t}^T \mathsf C \boldsymbol{t} \right) F(\boldsymbol{x}_0 = \mathsf C \boldsymbol{t}) ,\end{aligned}$ (63)

$\begin{matrix} K (t) & = \frac{1}{2} t^{T} C t + log (F (C t)), \end{matrix}$ $\begin{aligned} K(\boldsymbol{t})&= \frac{1}{2} \boldsymbol{t}^T \mathsf C \boldsymbol{t} + \log \left( F(\mathsf C \boldsymbol{t}) \right), \end{aligned}$ (64)

which can be differentiated to show that the cumulants are given in an easy form through the cumulant bias parameters, for instance,

$\begin{matrix} κ_{1} & = C β_{1}, \end{matrix}$ $\begin{aligned} \boldsymbol{\kappa }_1&= \mathsf C \boldsymbol{\beta }_1 ,\end{aligned}$ (65)

$\begin{matrix} κ_{2} & = C β_{2} C + C, \end{matrix}$ $\begin{aligned} \boldsymbol{\kappa }_2&= \mathsf C \boldsymbol{\beta }_2 \mathsf C + \mathsf C ,\end{aligned}$ (66)

$\begin{matrix} κ_{3, i j k} & = \sum_{abc} C_{ai} C_{bj} C_{ck} β_{abc} . \end{matrix}$ $\begin{aligned} \kappa _{3,ijk}&= \sum _{abc} C_{ai} C_{bj} C_{ck} \beta _{abc} . \end{aligned}$ (67)

However, instead of measuring directly the cumulants of the galaxy environment distribution and inverting Equations (65)–(67), it is in practice simpler to instead define the variable,

$\begin{matrix} u & = C^{- 1} x . \end{matrix}$ $\begin{aligned} \boldsymbol{u}&= \mathsf{C }^{-1} \boldsymbol{x}. \end{aligned}$ (68)

We then measure its cumulants, κ_u, which relate to the cumulant biases as

$\begin{matrix} β_{i j k \dots} & = {\begin{matrix} κ_{u, i j k \dots} & i f i + j + k + \dots \neq 2, \\ κ_{u, i j k \dots} - C_{ab}^{- 1} & i f i + j + k + \dots = 2, \end{matrix} \end{matrix}$ $\begin{aligned} \boldsymbol{\beta }_{ijk\ldots }&= {\left\{ \begin{array}{ll} \kappa _{u,ijk\ldots }&\mathrm \quad if \quad i + j + k + \cdots \ne 2, \\ \kappa _{u,ijk\ldots } - C^{-1}_{ab}&\mathrm \quad if \quad i + j + k + \cdots = 2, \end{array}\right.} \end{aligned}$ (69)

where a and b indicate the indices of the non-zero variables in the second-order case.

We show in Appendix A that it is also possible to derive similar estimators if the density field was filtered with a function different than a sharp k-space filter. The only additional complication in that case that it is necessary to account for the correlation matrix between the smoothed large scales and unsmoothed small scales. However, for simplicity, we focus only on measurements with the sharp k-space filter in this work.

3. Density bias measurements

In this section, we explain our evaluation of the Lagrangian density bias parameters for different sets of halos. We want to verify the consistency of our estimators by comparing them to the vast literature available on the subject. Furthermore, we also compare the canonical bias parameters, b_n, with the cumulant bias parameters, β_n.

3.1. Simulation

For the analysis through this paper, we used a single cosmological box simulation with high resolution. This simulation is part of the ‘BACCO simulation project’ that was first introduced in Angulo et al. (2021). It has a box size of L = 1440h⁻¹Mpc with 4320³ particles leading to a mass resolution of m_p = 3.2 × 10⁹ h⁻¹ M_⊙. The cosmological parameters are Ω_m = 0.307, Ω_Λ = 0.693, Ω_b = 0.048., n_s = 0.9611, σ₈ = 0.9, h = 0.677 which are similar to the Planck Collaboration VI (2020) cosmology except for the roughly 10% larger value of σ₈. By default we use this simulation at a scale-factor of a = 1.08, corresponding approximately to a = 1.

To identify halos, the simulation code uses a modified version of SUBFIND (Springel et al. 2001), which first identifies halos via a friends of friends (FoF) algorithm and subsequently calculates for each FoF group, the mass, M_200b, in a region that encloses 200 times the mean density of the Universe.

3.2. Bias measurements

We defined sets of biased tracers by considering halos selected by their mass. For this, we considered 21 equally log-spaced halo masses between 10¹² h⁻¹ M_⊙ and 10¹⁶ h⁻¹ M_⊙ and for each halo mass, we considered all halos with masses in the range of

$\begin{matrix} M_{200 b} \in [M_{i} / 1.25, M_{i} \cdot 1.25] . \end{matrix}$ $\begin{aligned} M_{200b} \in [M_i / 1.25, M_i \cdot 1.25 ]. \,\, \end{aligned}$ (70)

To evaluate our Lagrangian bias estimators, we only need to know the linear field evaluated at the Lagrangian locations of the tracers. We approximated the Lagrangian location of each halo through the Lagrangian coordinate of its most bound particle. Since the simulation started from a Lagrangian grid, the Lagrangian origin of the most bound particle can easily be inferred from its id i_mb as

$\begin{matrix} q_{mb} & = \frac{L}{N_{grid}} (\begin{matrix} i_{x} \\ i_{y} \\ i_{z} \end{matrix}), \\ i_{mb} & = i_{x} N_{grid}^{2} + i_{y} N_{grid} + i_{z}, \end{matrix}$ $\begin{aligned} \boldsymbol{q}_{\mathrm{mb} }&= \frac{L}{N_{\mathrm{grid} }} \begin{pmatrix} i_x \\ i_y \\ i_z \end{pmatrix} ,\\ i_{\mathrm{mb} }&= i_x N_{\mathrm{grid} }^2 + i_y N_{\mathrm{grid} } + i_z, \end{aligned}$ (71)

where N_grid = 4320 is the number of particles per dimension.

We know the linear density field of the simulation through the initial condition generator. To save on the computation time, we can create a low resolution grid representation of the linear density field with N_lin³ grid points. For fields different than the linear density field, we additionally multiply by the correct operator (e.g. −k² for the Laplacian) in Fourier space. We created a smoothed version of this field by multiplying with a sharp k−filter in Fourier space

$\begin{matrix} δ_{k} & = δ_{lin, k} \cdot Θ (k_{d} - k), \end{matrix}$ $\begin{aligned} \delta _k&= \delta _{\rm {lin},k} \cdot \Theta (k_{\mathrm{d} } - k), \end{aligned}$ (72)

where Θ is the heavy-side function and we tested, for each measurement, different damping scales k_d ∈ [0.1, 0.15, 0.2, 0.25, 0.3] h⁻¹ Mpc. We then deconvolved this field with a linear interpolation kernel and interpolated it to the Lagrangian locations of our tracer set. We chose an N_lin value that is sufficiently large that the resulting interpolated values are virtually independent of this discretisation; for instance, N_lin = 183 at k_d = 0.1 h⁻¹ Mpc and N_lin = 549 at k_d = 0.3 h⁻¹ Mpc.

With the linear densities at the Lagrangian locations of tracers, it is easy to evaluate the zeroth-order spatial estimators of the biases by averaging as in Equation (14). However, for higher spatial order estimators we also need to know the values of the Laplacian (Eqs. (49)–(53)) or that of the fourth derivative (Equation (55)). The Laplacian is inferred by multiplying the linear density field in Fourier space by −k² and then interpolated to the tracer locations. Other variables such as the fourth derivatives can be evaluated in a similar manner. This allows us to evaluate any of the bias estimators (from Sect. 2) directly as simple expectation values over tracers. We note that this is a major difference compared to the measurement process in Paranjape et al. (2013a), where the zeroth-order spatial estimators, as those in Equation (14), were evaluated, but then mapped to scale-independent parameters through peak theory arguments.

To estimate the covariance of a set of measured bias parameters we use a Jackknife technique. For this we divide the box in Lagrangian space into N_jk³ subboxes with N_jk = 4. We perform 64 measurements of the vector of bias parameters b_i by subsequently leaving out all tracers in one of the subboxes. Then we estimate the covariance through

$\begin{matrix} C_{b} & = \frac{1}{N_{jk}^{3} - 1} \sum_{i} (b_{i} - b_{0}) \otimes (b_{i} - b_{0}), \end{matrix}$ $\begin{aligned} \mathsf{C }_{\boldsymbol{b}}&= \frac{1}{N_{\mathrm{jk} }^3 -1} \sum _i (\boldsymbol{b}_i - \boldsymbol{b}_0) \otimes (\boldsymbol{b}_i - \boldsymbol{b}_0) ,\end{aligned}$ (73)

$\begin{matrix} b_{0} & = \frac{1}{N_{jk}^{3}} \sum_{i} b_{i} . \end{matrix}$ $\begin{aligned} \boldsymbol{b}_0&= \frac{1}{N_{\mathrm{jk} }^3} \sum _i \boldsymbol{b}_i . \end{aligned}$ (74)

It is worth noting that the Jackknife estimator results in a more reliable estimate of the uncertainty of the measurement than, for instance, a simple bootstrap would yield. When comparing these estimators we found that the Jackknife gives larger (more conservative) error estimates, since it also accounts for the uncertainty induced by cosmic variance, which is sampled by leaving out spatially correlated parts of the data sets.

3.3. b₁ and the scale dependence of estimators

In Figure 2, we show the measurements of b₁ for halos as a function of M_200b using the estimator from Equation (15) without spatial corrections (top panel) versus the estimator from Equation (49) with spatial corrections at the second order (bottom panel). The dashed lines indicate the fitting function from Tinker et al. (2010) evaluated for the cosmology of our simulation.

Fig. 2.

b₁ as a function of halo mass using the estimators from Equation (15) at the top and Equation (49) at the bottom, measured at different damping scales (different coloured regions). The shaded regions indicate the 1σ certainty region of the estimators. Using the b₁ estimator that includes the Laplacian correction increases the uncertainty of the b₁ estimates, but reduces the dependence on the damping scale, leading to a good agreement across different scales.

As expected, halos of large masses M ∼ 10¹⁵ h⁻¹ M_⊙ are highly biased b₁ ∼ 3, whereas low-mass halos (M ∼ 10¹² h⁻¹ M_⊙) have an even slightly negative (Lagrangian) bias. The zeroth-order spatial estimators have a very small degree of statistical uncertainty, but exhibit a significant scale dependence and are inconsistent across different damping scales. On the other hand, the estimators with a spatial order of 2 have a larger statistical uncertainty, but seem consistent across different damping scales – except at very large k_d and large halo masses; in this case, some scale dependence becomes visible, probably indicating the effect of higher spatial-order terms.

Our second-order spatial measurements agree at all scales well with the fitting function from Tinker et al. (2010). While they lie systematically about 5% below this fit, this is within the quoted uncertainty of the Tinker et al. (2010) fit and a similar difference can be seen in Lazeyras et al. (2016). In comparison to other studies at z = 0 (e.g. Lazeyras et al. 2016), we have lower bias values at the same masses (e.g. b₁ ∼ 0.5 at 10¹⁴ h⁻¹ M_⊙ instead of b₁ ∼ 1) because of the later time (z = −0.08) and relatively high variance (σ₈ = 0.9) of our simulation.

To further highlight the difference in the scale dependence of the estimators, we show in Figure 3 the error Δb₁ in b₁ measurements as a function of damping scale for a few different halo masses. Here, we phrase the error Δb₁ = b₁ − b_1, ref relative to the second-order spatial estimator at k_d = 0.2 h Mpc⁻¹ as b_1, ref. As expected, the zeroth-order estimator seems to approach the selected reference value on large scales. However, the zeroth-order second-order spatial estimator exhibits a significant scale dependence, whereas the second-order spatial estimators are almost scale-independent, except for the largest one at k_d ≳ 0.25 h Mpc⁻¹. We also show measurements with the fourth-order spatial estimator from Equation (55), which are scale-independent to even smaller scales, showing that the measurements converge well with adding higher order spatial corrections. However, for getting a good estimate of the bias parameters it seems sufficient to use the second-order spatial estimator at scales of k_d ≲ 0.2 h Mpc⁻¹, which bears a similar accuracy in measurement, for instance, akin to evaluating the fourth-order spatial estimator at k_d ∼ 0.3 h Mpc⁻¹.

Fig. 3.

Scale dependence of the error in b₁ estimates for different halo mass selections. The error is expressed relatively to the measurement with spatial order of 2 at $k_{d} = 0.2 h {Mpc}^{- 1}$ $k_\mathrm{{d}} = 0.2\, h\, \mathrm{Mpc}^{-1}$ . On large scales (small k_d) the zero-spatial-order estimators from Equation (15) converge well to this estimate, whereas the spatial order of 2 estimators from Equation (49) agree well up to k_d ≳ 0.25 h Mpc⁻¹ where slight disagreements arise. The spatial order of 4 estimates remain scale-independent even beyond this scale.

We can estimate the scale dependence of the zeroth-order spatial estimator, as shown in Equations (56) and (57). We mark the estimated scale dependence of the zeroth-order spatial estimators using the second-order spatial bias parameters measured at k_d = 0.2 h Mpc⁻¹ as dashed lines in Figure 3 and using the fourth-order spatial parameters from k_d = 0.3 h Mpc⁻¹ as dotted lines. The scale dependence of the zeroth-order spatial estimator is predicted quite well, showing that the consideration of the Laplacian introduces a correction of the order of σ₁²/σ₀², which scales approximately as k_d² and including of the fourth derivatives provides additional corrections that become relevant on even smaller scales.

Our measurements therefore confirm the results from previous studies that including the Laplacian is vital to recover scale-independent bias on large scales (Paranjape et al. 2013a; Lazeyras et al. 2016). Furthermore, if the modelling is pushed to small scales k_d ≳ 0.25h the inclusion of additional higher spatial derivative terms may prove beneficial.

3.4. b₂ versus β₂

The first cumulant bias parameter that is different from the canonical bias parameter of the same order is β₂ = b₂ − b₁² where b₁ = β₁. Since there is a one-to-one relation between (b₁, b₂) and (β₁, β₂) it might seem that there should not be any advantage by using β₂ over b₂ as a parameter when fitting datasets. However, here we show that β₂ seems to be more independent of β₁ than b₂ of b₁, especially when their covariance matrix is considered.

In Figure 4, we show the measured co-evolution relation between b₂ and b₁ (left) versus the co-evolution between β₂ and β₁ (right) for the second-order spatial estimators. In comparison, we also show the co-evolution relations measured by Lazeyras et al. (2016) as a dashed line, which seems to match our measured relation well up to k_d ≲ 0.15 − 0.2 h Mpc⁻¹, showing that our method for measuring the bias parameters is indeed able to recover the correct large-scale limit of the bias parameters.

Fig. 4.

b₂ and β₂ as a function of b₁ = β₁ using the second-order spatial estimators. The black dashed line is the b₂(b₁) relation inferred by Lazeyras et al. (2016) from separate-universe simulations. The second-order spatial estimators seem consistent with the literature co-evolution relation down to damping scales of k_d ∼ 0.15 − 0.2 h Mpc⁻¹. It is noteworthy that β₂ < 0 appears to always hold, which means that the width of the galaxy environment distribution p(δ|g) is always smaller than that of the background p(δ). Therefore, the cumulant bias parameter β₂ − β₁ relation appears slightly simpler than the b₂ − b₁ relation.

We note that for halos we find β₂ < 0 across all masses and with high statistical significance. This means that the width of the halo environment distribution p(δ|g) is smaller than the width of the background distribution p(δ) – showing that halo formation is more selective than a random distribution. Possibly, the assumption β₂ < 0 could be used to limit the considered prior range when fitting certain galaxy surveys. However, it is difficult to anticipate whether this should hold for every possible set of galaxies.

Furthermore, we note that the co-evolution relation between β₂ and β₁ is monotonic and roughly linear. For b₁ ≳ 2, it satisfies |β₂|< |b₂|. Therefore, β₂ versus β₁ seems slightly simpler than the b₂ to b₁ relation.

The benefit of using β₂ as a parameter is even more evident when considering the covariance matrix of the measurements. In Figure 5 we show the uncertainty of the b₂ measurements (solid lines), versus the uncertainty of the β₂ measurements. Comparing different damping scales shows that the inclusion of smaller scales in the measurements decreases the statistical uncertainty (but increases systematic error). In all cases and especially at high values of b₁ the uncertainty of β₂ is significantly smaller than that of b₂. This is so, since b₂ and b₁ are more correlated than β₂ and β₁.

Fig. 5.

Uncertainty of b₂ measurements (solid lines) and β₂ measurements (dashed lines) as a function of b₁ for different damping scales. The uncertainty of β₂ is significantly smaller than of b₂ for high values of b₁.

To highlight this we show in Figure 6 the correlation coefficient

Fig. 6.

Correlation coefficient α₁₂ between the b₁ and b₂ measurement (solid) and the β₁ and β₂ measurement (dashed). Note: β₂ as a parameter is much more independent of the value of b₁ than b₂.

$\begin{matrix} α_{12} & : = \frac{C_{12}}{\sqrt{C_{11} C_{22}}}, \end{matrix}$ $\begin{aligned} \alpha _{12}&:= \frac{C_{12}}{\sqrt{C_{11} C_{22}}}, \end{aligned}$ (75)

where C is the covariance matrix between b₁ and b₂ or β₁ and β₂. For b₁ ≫ 0 the correlation coefficient of (b₁, b₂) is quite large, even close to 1 in some cases. On the other hand, the correlation coefficient of (β₁, β₂) is quite small and seems on average consistent with 0. Therefore, measurements of b₁ are quite entangled with b₂, but not so much with β₂. While we do not show it here, we have found that this is also the case for higher order correlations, for instance, those of (b₁, b₃) versus (β₁, β₃).

This can be understood when considering the difference between moments and cumulants of a probability distribution. For instance, if we knew with respect to a certain distribution p(x) that it has a large first moment ⟨x⟩, then we might also expect that the second moment ⟨x²⟩ is large. On the other hand, knowledge about the mean of a distribution, is very uninformative about its width – that is its second cumulant σ_x² = ⟨(x−⟨x⟩)²⟩ = ⟨x²⟩ − ⟨x⟩². In the same sense, we would expect the cumulant bias parameters to be more independent of each other than canonical bias parameters.

3.5. Higher order biases

While the second-order cumulant bias β₂ has already slight advantages over b₂ as highlighted in the last section, the benefits are even more significant at higher orders. Here, we compare the behaviour of b₃ and b₄ with the behaviour of β₃ and β₄.

In Figure 7, we show b₃, b₄, β₃, and β₄ as a function of b₁ for the second-order spatial estimator. The uncertainty of the measurements of b₃ and b₄ are much larger than for b₂ so that we hardly find any signal at k_d ∼ 0.1 h Mpc⁻¹. However, at k_d ∼ 0.15 h Mpc⁻¹ we can find a meaningful signal and it is consistent with the b₃ measurements of Lazeyras et al. (2016).

Fig. 7.

Co-evolution relations of higher order bias parameters b₃ and β₃ (top) and b₄ and β₄ (bottom) for the second-order spatial bias estimators. For b₄ we indicate as a dashed line a prediction that follows from combining the Lazeyras et al. (2016) measurements of b₁, b₂ and b₃ with Equation (30) when using β₄ = 0. Strikingly, β₃ and β₄ are extremely close to 0 – independently of the value of b₁.

Strikingly, the value of β₃ is extremely small in comparison to b₃ and seems approximately consistent with 0. It is noteworthy that this is not only the case for scales where b₃ is reasonably scale independent (k_d ≲ 0.15 h Mpc⁻¹), but it is also so for much smaller scales. There does not seem to be any noteworthy relation between β₃ and β₁. Furthermore, the co-evolution relation measured by Lazeyras et al. (2016) seems consistent with β₃ = 0. Therefore, we could summarise the third-order co-evolution relation through the relation we get by using β₃ = 0 in Equation (29):

$\begin{matrix} b_{3} & = 3 b_{1} b_{2} - 2 b_{1}^{3} . \end{matrix}$ $\begin{aligned} b_3&= 3 b_1 b_2 - 2 b_1^3 . \end{aligned}$ (76)

Similarly, we find for β₄ an apparent independence of β₁ and consistency with zero across different scales. Assuming β₄ = 0 in Equation (30) leads to the co-evolution relation between b₄, b₃, b₂, and b₁ given by

$\begin{matrix} b_{4} & = 4 b_{1} b_{3} + 3 b_{2}^{2} - 12 b_{1}^{2} b_{2} - 6 b_{1}^{4} . \end{matrix}$ $\begin{aligned} b_4&= 4 b_1 b_3 + 3 b_2^2 - 12 b_1^2 b_2 - 6 b_1^4. \end{aligned}$ (77)

We mark this relation with the measurements of b₁, b₂, and b₃ from Lazeyras et al. (2016) as a dashed line in the bottom left panel of Figure 7. This seems in good agreement with the measured b₄.

We conclude that at an order of n ≥ 3, the cumulant bias parameters are very close to 0. The co-evolution relations at these orders can be summarised simply through β_n = 0. We might therefore suggest that high-order co-evolution relations do not represent any physical insights, but rather highlight that the canonical bias parameters form a suboptimal, deeply entangled basis.

Further, the fact that all the cumulant biases beyond an order of 2 are close to zero indicates that the galaxy environment distribution is very well approximated by a Gaussian distribution. We plan to investigate the possibility of a Gaussian bias model in Stücker et al. (2025), where we also explain how this fact may arise naturally from the Gaussianity of the background distribution.

3.6. Laplacian bias

In Figure 8, we show the Lagrangian Laplacian bias parameter as a function of halo mass, as inferred from the estimator in Equation (50). The Laplacian measurements have a much stronger scale dependence than our previous measurements of density bias parameters. However, it seems that at masses M ≲ 3 × 10¹³ h⁻¹ M_⊙ the scale dependence disappears and a reliable measurement is obtained.

Fig. 8.

Lagrangian Laplacian bias b_L as a function of halo mass. At masses M ≲ 3 × 10¹³ h⁻¹ M_⊙ the scale dependence of the b_L measurements disappears and they agree well with the fits of the Eulerian Laplacian bias from Lazeyras & Schmidt (2019).

We compare our Lagrangian measurements with the Eulerian measurements of Lazeyras & Schmidt (2019). In general, the relation between the Lagrangian and Eulerian Laplacian bias depends on velocity bias (e.g. Desjacques et al. 2018). The velocity bias b_s quantifies the difference between the displacement field of matter s and galaxies s_g and gets as first-order contributions of the form

$\begin{matrix} s_{g} - s \approx b_{s} \nabla δ . \end{matrix}$ $\begin{aligned} \boldsymbol{s}_{\mathrm{g} } - \boldsymbol{s} \approx b_{s} \nabla \delta . \end{aligned}$ (78)

Therefore, the galaxy density can get an extra contribution proportional to the divergence of this field, that scales as L = ∇²δ. However, for our set of tracers s_g = s by definition, so that b_s = 0 and we simply assume that their Lagrangian and Eulerian Laplacian biases are identical or at least very similar.

In the reliable range of M ≲ 3 × 10¹³ h⁻¹ M_⊙, it seems that our measurements agree well with the linear and the quadratic (in halo radius) fit that was inferred by Lazeyras & Schmidt (2019) for the Eulerian Laplacian bias. Therefore, we confirm that the Laplacian bias is negative for halos and that it can plausibly have amplitudes of the order of b_L ∼ −20 Mpc² h⁻². However, we leave more detailed considerations to future studies. In principle, more accurate measurements could be obtained by consideration of higher order corrections and through measurements at higher redshifts.

3.7. The scale-split break-down scale

The core assumption of the bias formalism is the separation of scales: the formation of galaxies and halos is assumed to depend only on the properties of the Lagrangian environment on some small length scale. Larger scale perturbations are only relevant as far as they determine the distribution of small scale environments. We refer to this as the ‘scale-split’ assumption.

This assumption makes it possible to define scale-independent bias parameters. For example, b₁ describes how the likelihood of forming a galaxy responds when changing only the linear density contrast δ of the relevant smaller scale environment, while keeping other aspects (e.g. L) constant. As all larger scale density perturbations affect this aspect in the same way b₁ is independent of scale.

Physically, the scale-split assumption has to become invalid for sufficiently small smoothing scales. For example, halo formation may respond differently to perturbations that are smaller than their Lagrangian radius, than to those which are larger.

Here, we want to show that the scale-split assumption also becomes mathematically inconsistent beyond some scale for a given set of variables: Recall that we have shown in Equation (25), how κ₂, the variance of the galaxy environment distribution, changes with scale. The predicted variance is only well defined, if

$\begin{matrix} σ^{2} \leq σ_{\max, δ}^{2} & = \frac{1}{- β_{2}} . \end{matrix}$ $\begin{aligned} \sigma ^2 \le \sigma _{\mathrm{max} , \delta }^2&= \frac{1}{-\beta _2} . \end{aligned}$ (79)

This becomes zero if σ = σ_max, δ. Mathematically, at this scale, the environment distribution and the bias function have to become Dirac-delta functions. We may understand σ_max as the scale where (formally) all information that is relevant for galaxy formation has been accounted for and the biasing becomes deterministic in density.

Beyond this scale, the PBS predicts a galaxy environment distribution with negative variance – making any density-only bias model mathematically inconsistent. It may seem as if it was possible to set up expansion bias models for arbitrary high σ, but this is only so, since the negativity of the bias function makes it formally possible to have a negative variance. Actual galaxies will strictly obey κ₂ > 0 and therefore, the response of density-only models has to become scale-dependent latest at σ_max, δ – but likely already earlier. We call this the ‘scale-split break-down scale’.

The break-down scale is different if additional variables are considered. For the multivariate distribution of density and Laplacian (δ, L), we have to require the covariance matrix of the galaxy environment distribution to be positive and semi-definite. For this, it is at least necessary to have:

$\begin{matrix} det (κ_{2}) & \geq 0, \\ \Leftrightarrow det (1 + C β_{2}) & \geq 0, \end{matrix}$ $\begin{aligned} \det (\boldsymbol{\kappa }_2)&\ge 0, \nonumber \\ \Leftrightarrow \det (\mathsf 1 + \mathsf C \boldsymbol{\beta }_2)&\ge 0, \end{aligned}$ (80)

which appears slightly more complicated, but it will also be violated at some finite damping scale.

To measure the break-down scale, we infer for each halo mass the bias parameters at three different scales k_d ∈ (0.15, 0.2, 0.25) h Mpc⁻¹. Then, we evaluate the co-variance matrix of the background distribution at 500 different equally log-spaced damping scales between 10⁻³ h Mpc⁻¹ and 10¹ h Mpc⁻¹ and determine the earliest damping scale k_max where the covariance is large enough to violate Equation (80) or Equation (81). We show the corresponding results as shaded contours in Figure 9, where we additionally mark the characteristic wave-number of halos as:

Fig. 9.

Maximum damping scale where the PBS can be valid and bias scale-independent. The three reddish contours show the break-down scale of density-only bias models, estimated with bias parameters measured at different scales, and the three green-blueish contours show it for (δ, L) bias models. The black dashed line shows the wavenumber associated with the Lagrangian radius of halos. The break-down scale is consistent across different measurements and only seems to scale strongly with halo radius for the (δ) case.

$\begin{matrix} k_{halo} & = \frac{π}{R_{halo}}, \end{matrix}$ $\begin{aligned} k_{\mathrm{halo} }&= \frac{\pi }{R_{\mathrm{halo} }}, \end{aligned}$ (81)

where R_halo is the Lagrangian radius that encloses the halo mass, M_200b.

Comparing the measurements at different damping scales, we find that the inferred scale is reasonably converged with the scale that we measured the bias parameters at. The density-only break-down scale is typically a factor two smaller than k_halo and seems roughly proportional to it. We therefore conclude that any Lagrangian local in matter density (LLIMD) bias model with scale-independent bias parameters has to break down at a length scale roughly a factor two larger than Lagrangian radii of the considered halos⁵.

On the other hand, the (δ, L) case shows a notably different break-down scale. It scales only weakly with halo mass and it ranges only between k_max ∼ 0.3 − 0.5 h Mpc⁻¹. Note that this corresponds roughly to the Lagrangian scale of halos with M ∼ 2 × 10¹⁴ h⁻¹ M_⊙, where b₁ ∼ 1. For halos above M ≳ 3 × 10¹³ h⁻¹ M_⊙ including the Laplacian increases k_max relative to the density-only case, but for lower masses, it decreases it.

Comparing with the right panel of Figure 4, we notice that the bias parameters already become scale-dependent at notably smaller damping scales than k_max. For example, for M_200b ∼ 10¹⁵ h⁻¹ M_⊙ with b₁ ∼ 3, the measurements of β₂ are already scale-dependent beyond k_d ≳ 0.15 h Mpc⁻¹, whereas the mathematical break-down scale is k_max ∼ 0.3 h Mpc⁻¹. We therefore suggest that considerable care should be taken when setting up bias models close to the break-down scale. This is particularly relevant for hybrid methods, which might, in principle, allow for galaxy clustering to b described at notably smaller scales than are usually considered in a perturbative schemes (e.g. Modi et al. 2020; Zennaro et al. 2022).

4. Estimators for tensorial bias parameters

In Sect. 2, we explain how we inferred the general estimators of bias parameters associated with scalar variables (e.g. the density and Laplacian) with spatial corrections of any order. In Sect. 3, we show how these can be used to obtain reliable bias measurements from a single simulation.

However, the theory in Sect. 2 does not explain how to obtain estimators for parameters such as the tidal bias, b_K², which is defined as the response to

$\begin{matrix} K^{2} & = tr (K \cdot K), \end{matrix}$ $\begin{aligned} K^2&= \mathrm{tr} \left( \mathsf{K \cdot \mathsf K } \right) ,\end{aligned}$ (82)

$\begin{matrix} K & = (\nabla \otimes \nabla) ϕ - \frac{δ}{3} I_{2}, \end{matrix}$ $\begin{aligned} \mathsf K&= (\boldsymbol{\nabla } \otimes \boldsymbol{\nabla }) \phi - \frac{\delta }{3} \mathsf{I }_2, \end{aligned}$ (83)

where ϕ is the displacement potential, I₂ is the unit matrix, and K is the traceless tidal tensor. In this section, we present a general scheme to measure the related bias parameter b_K² and any other bias parameters that follow from contractions of derivatives of the potential field.

To achieve this, it is not optimal to consider directly the distribution of such scalar contracted quantities, since these distributions may get quite complicated. For example, p(K²) is not a Gaussian distribution, but rather a χ² distribution with five degrees of freedom. Furthermore, it is not immediately obvious how to define partial derivatives with respect to such variables, since partial derivatives may depend on what other terms are kept fixed. For example, it is not clear how a term like K³ = tr(K⋅K⋅K) would be derived with respect to K². It seems therefore difficult to generalise Equation (12) in this manner.

However, a clear and general framework for measuring such ‘tensorial’ bias terms can be developed by instead considering the full (quite high dimensional) distribution of the tidal tensor and its derivatives. Since the potential field of the early Universe is a Gaussian random field, these must follow a multivariate Gaussian distribution so that it is easy to compute derivatives of the distribution function in a general manner. The resulting ‘bias tensors’ can be decomposed into isotropic tensors that each have a one to one correspondence with traditionally used bias parameters. In this section, we introduce the needed mathematical notions step by step and will provide estimators for a few selected bias terms.

4.1. Tensorial bias expansion

We write the bias expansion in tensorial form as

$\begin{matrix} F & = 1 + B_{T} T + \frac{1}{2} T B_{T T} T + \dots + B_{S} S + \dots + B_{R} R + \dots \end{matrix}$ $\begin{aligned} F&= 1 + \mathsf{B }_{\mathsf{T }} \mathsf{T } + \frac{1}{2} \mathsf T \mathsf{B }_{\mathsf{T } \mathsf T } \mathsf T + \cdots + \mathsf{B }_\mathsf{S } \mathsf S + \cdots + \mathsf{B }_\mathsf{R } \mathsf R + \cdots \end{aligned}$ (84)

Here, again, F = n_g/n_g, 0 (in a separate-universe sense) and

$\begin{matrix} T & = (\nabla \otimes \nabla) ϕ, \end{matrix}$ $\begin{aligned} \mathsf T&= (\nabla \otimes \nabla ) \phi ,\end{aligned}$ (85)

$\begin{matrix} S & = (\nabla \otimes \nabla \otimes \nabla) ϕ, \end{matrix}$ $\begin{aligned} \mathsf S&= (\nabla \otimes \nabla \otimes \nabla ) \phi ,\end{aligned}$ (86)

$\begin{matrix} R & = (\nabla \otimes \nabla \otimes \nabla \otimes \nabla) ϕ, \end{matrix}$ $\begin{aligned} \mathsf R&= (\nabla \otimes \nabla \otimes \nabla \otimes \nabla ) \phi , \end{aligned}$ (87)

where B_T is a canonical bias tensor of rank of 2, B_TT of rank of 4, B_S of the rank of 3, and so on. This is also where an omitted product sign indicates a product over the indices of the last fully symmetric part of the first tensor and the first fully symmetric part of the second tensor. For example:

$\begin{matrix} B_{T} T & = B_{T} \overset{(2)}{\cdot} T = \sum_{ij} B_{T, ij} T_{ij}, \\ T B_{T T} T & = T \overset{(2)}{\cdot} B_{T T} \overset{(2)}{\cdot} T = \sum_{ijkl} T_{ij} B_{T T, ijkl} T_{kl}, \\ B_{S} S & = B_{S} \overset{(3)}{\cdot} S = \sum_{ijk} B_{S, ijk} S_{ijk}, \end{matrix}$ $\begin{aligned} \mathsf{B }_\mathsf{T } \mathsf T&= \mathsf{B }_\mathsf{T } \overset{(2)}{\cdot } \mathsf T = \sum _{ij} B_\mathsf{T , ij} T_{ij}, \nonumber \\ \mathsf{T } \mathsf{B }_\mathsf{T \mathsf T } \mathsf T&= \mathsf T \overset{(2)}{\cdot } \mathsf{B }_\mathsf{T \mathsf T } \overset{(2)}{\cdot } \mathsf T = \sum _{ijkl} T_{ij} B_\mathsf{T \mathsf T , ijkl} T_{kl}, \nonumber \\ \mathsf{B }_\mathsf{S } \mathsf S&= \mathsf{B }_\mathsf{S } \overset{(3)}{\cdot } \mathsf S = \sum _{ijk} B_\mathsf{S , ijk} S_{ijk}, \nonumber \end{aligned}$ (88)

where we can use $\overset{(n)}{\cdot}$ ${\overset{(n)}{\cdot}}$ to explicitly denote the number of dimensions that are contracted.

We note that just as before, the bias tensors correspond to derivatives of the large-scale bias function, for instance,

$\begin{matrix} B_{T} & = {\frac{\partial}{\partial T} F |}_{T = 0}, \end{matrix}$ $\begin{aligned} \mathsf{B }_\mathsf{T }&= \left. \frac{\partial }{\partial \mathsf T } F \, \right|_\mathsf{T = 0} ,\end{aligned}$ (89)

$\begin{matrix} B_{T T} & = {(\frac{\partial}{\partial T} \otimes \frac{\partial}{\partial T}) F |}_{T = 0} . \end{matrix}$ $\begin{aligned} \mathsf{B }_\mathsf{T \mathsf T }&= \left. \left( \frac{\partial }{\partial \mathsf T } \otimes \frac{\partial }{\partial \mathsf T } \right) F \, \right|_\mathsf{T = 0}. \end{aligned}$ (90)

4.2. Isotropic tensors

Due to the isotropy of the Universe, each of the bias tensors has to be isotropic. That means that a bias tensor should be identical when measured from a rotated frame of reference. For example, it has to be

$\begin{matrix} U^{T} B_{T} U & = B_{T} \end{matrix}$ $\begin{aligned} \mathsf{U }^T \mathsf{B }_\mathsf{T } \mathsf U&= \mathsf{B }_\mathsf{T } \end{aligned}$ (91)

for any rotation matrix U. From this, it follows immediately that B_T has to be proportional to the unit matrix I₂ (where the proportionality constant is equal to b₁). In general, a rank n tensor A is isotropic if it holds for any rotation matrix, U:

$\begin{matrix} A_{a b c \dots} U_{ai} U_{bj} U_{ck} \dots & = A_{i j k \dots} \end{matrix}$ $\begin{aligned} A_{abc\ldots } U_{ai} U_{bj} U_{ck} \ldots&= A_{ijk\ldots } \end{aligned}$ (92)

Here, we have used Einstein’s sum convention, where the rotation matrix is applied to each index of A individually. To express whether some tensor is isotropic, we define 𝕌_n as the space of all tensors that are isotropic and of a rank, n.

In general, all isotropic tensors can be decomposed in index notation through different combinations of the Kronecker-delta symbol δ_ij and the Levi-Civita symbol, ϵ_ijk. For example

$\begin{matrix} A \in U_{2} & \Rightarrow A_{ij} = a δ_{ij}, \end{matrix}$ $\begin{aligned} \mathsf A \in \mathbb{U} _{2}&\Rightarrow A_{ij} = a \delta _{ij} ,\end{aligned}$ (93)

$\begin{matrix} A \in U_{3} & \Rightarrow A_{ijk} = a ϵ_{ijk}, \end{matrix}$ $\begin{aligned} \mathsf A \in \mathbb{U} _{3}&\Rightarrow A_{ijk} = a \epsilon _{ijk} ,\end{aligned}$ (94)

$\begin{matrix} A \in U_{4} & \Rightarrow A_{ijkl} = a δ_{ij} δ_{kl} + b δ_{ik} δ_{jl} + c δ_{il} δ_{jk}, \end{matrix}$ $\begin{aligned} \mathsf A \in \mathbb{U} _{4}&\Rightarrow A_{ijkl} = a \delta _{ij} \delta _{kl} + b \delta _{ik} \delta _{jl} + c \delta _{il} \delta _{jk}, \end{aligned}$ (95)

where a, b, c ∈ ℝ. A compact way of writing the same type of statement can be:

$\begin{matrix} U_{4} = span ({δ_{ij} δ_{kl}, δ_{il} δ_{jk}, δ_{ik} δ_{jl}}), \end{matrix}$ $\begin{aligned} \mathbb{U} _{4} = \mathrm{span} \left( \{ \delta _{ij}\delta _{kl}, \delta _{il}\delta _{jk}, \delta _{ik}\delta _{jl} \} \right), \end{aligned}$ (96)

which signifies that 𝕌₄ is the space of tensors that can be reached through linear combinations of the indicated tensors and we say that these are basis tensors of 𝕌₄.

It is easy to see that we should be able to decompose each of the bias tensors into a small number of independent scalars that multiply the basis tensors and that correspond to traditional bias parameters. However, before performing such a decomposition we need to further consider the symmetries of the tensors.

4.3. Symmetric isotropic tensors

Since the bias tensors correspond to derivatives of a scalar function with respect to symmetric tensors, they have to obey the same symmetries. For example, the tensor B_TT has to be symmetric in the first two and last two indices⁶:

$\begin{matrix} B_{T T, ijkl} & = B_{T T, jikl} = B_{T T, ijlk} = B_{T T, jilk} . \end{matrix}$ $\begin{aligned} B_\mathsf{T \mathsf T , ijkl}&= B_\mathsf{T \mathsf T , jikl} = B_\mathsf{T \mathsf T , ijlk} = B_\mathsf{T \mathsf T , jilk} . \end{aligned}$ (97)

Therefore, B_TT cannot be every tensor from the tensorspace 𝕌₄, but only from the subspace 𝕍₂₂ ⊆ 𝕌₄ of isotropic rank four tensors that have the 2, 2 symmetry.

To formalise this, we define 𝕍_n ⊂ 𝕌_n as the space of all isotropic tensor of rank n that are additionally fully symmetric under any permutation of indices, specifically:

$\begin{matrix} B_{T} & \in V_{2}, \end{matrix}$ $\begin{aligned} \mathsf{B }_\mathsf{T }&\in \mathbb{V} _2,\end{aligned}$ (98)

$\begin{matrix} B_{S} & \in V_{3}, \end{matrix}$ $\begin{aligned} \mathsf{B }_\mathsf{S }&\in \mathbb{V} _3 ,\end{aligned}$ (99)

$\begin{matrix} B_{R} & \in V_{4} . \end{matrix}$ $\begin{aligned} \mathsf{B }_\mathsf{R }&\in \mathbb{V} _4 . \end{aligned}$ (100)

We note that 𝕍₃ = {0}, namely, the only symmetric rank three isotropic tensor is 0. Furthermore, we define 𝕍_nm as the space of all isotropic tensors of rank n + m that are symmetric in the first n indices and the last m indices. For example,

$\begin{matrix} B_{T T} & \in V_{22} . \end{matrix}$ $\begin{aligned} \mathsf{B }_\mathsf{T \mathsf T }&\in \mathbb{V} _{22}. \end{aligned}$ (101)

To find a basis for some symmetric isotropic tensor space of rank n, we can consider all basis tensors of 𝕌_n, symmetrise these and then discard any tensors that are duplicate or zero. We define the symmetrisation operator S_n which symmetrises a tensor in n indices. For instance, we have:

$\begin{matrix} S_{2} (M_{ij}) & = \frac{1}{2} (M_{ij} + M_{ji}), \end{matrix}$ $\begin{aligned} S_{2}(M_{ij})&= \frac{1}{2}(M_{ij} + M_{ji}) ,\end{aligned}$ (102)

$\begin{matrix} S_{3} (M_{ijk}) & = \frac{1}{6} (M_{ijk} + M_{ikj} + M_{jik} + M_{jki} + M_{kij} + M_{jki}) . \end{matrix}$ $\begin{aligned} S_{3}(M_{ijk})&= \frac{1}{6}(M_{ijk} + M_{ikj} + M_{jik} + M_{jki} + M_{kij} + M_{jki}). \end{aligned}$ (103)

Furthermore, we defined a double symmetrisation operator, S_nm, that symmetrises the first n indices and the last m indices. For example, the effect of S₂₂ onto a rank four tensor M is given in the index notation by

$\begin{matrix} S_{22} (M_{ijkl}) & = \frac{1}{4} (M_{ijkl} + M_{jikl} + M_{ijlk} + M_{jilk}) . \end{matrix}$ $\begin{aligned} S_{22}(M_{ijkl})&= \frac{1}{4} \left( M_{ijkl} + M_{jikl} + M_{ijlk} + M_{jilk} \right). \end{aligned}$ (104)

It acts on the basis tensors of 𝕌₄ in the following ways:

$\begin{matrix} S_{22} (δ_{ij} δ_{kl}) & = δ_{ij} δ_{kl} = : I_{22, i j}, \end{matrix}$ $\begin{aligned} S_{22}(\delta _{ij} \delta _{kl})&= \delta _{ij} \delta _{kl} =: I_{22, ij},\end{aligned}$ (105)

$\begin{matrix} S_{22} (δ_{ik} δ_{jl}) & = \frac{1}{2} (δ_{ik} δ_{jl} + δ_{il} δ_{jk}) = : I_{2 = 2, i j k l}, \end{matrix}$ $\begin{aligned} S_{22}(\delta _{ik}\delta _{jl})&= \frac{1}{2} ( \delta _{ik}\delta _{jl} + \delta _{il}\delta _{jk}) =: I_{2 = 2,ijkl},\end{aligned}$ (106)

$\begin{matrix} S_{22} (δ_{il} δ_{jk}) & = \frac{1}{2} (δ_{il} δ_{jk} + δ_{ik} δ_{jl}) = I_{2 = 2, i j k l} \end{matrix}$ $\begin{aligned} S_{22}(\delta _{il}\delta _{jk})&= \frac{1}{2} ( \delta _{il}\delta _{jk} + \delta _{ik}\delta _{jl}) = I_{2 = 2, ijkl} \end{aligned}$ (107)

where we have assumed that δ_ij is itself symmetric. The symmetrised second and third basis tensor are identical. Therefore, 𝕍₂₂ has (unlike 𝕌₄) only two basis tensors:

$\begin{matrix} V_{22} & = span ({I_{22}, I_{2 = 2}}) . \end{matrix}$ $\begin{aligned} \mathbb{V} _{22}&= \mathrm{span} ({\{\mathsf{I }_{22}, \mathsf{I }_{2 = 2}\}}). \end{aligned}$ (108)

The main difference between δ_ijδ_kl and the two tensors δ_ikδ_jl and δ_ilδ_jk is that the delta symbols used to define δ_ijδ_kl each connect internally inside the groups of symmetric indices and have no connections between the groups. (Recall that i ↔ j and k ↔ l are to be symmetrised.) However, δ_ikδ_jl and δ_ilδ_jk each have zero group internal connections, but two connections between the symmetry groups. In fact, the symmetrisation operation identifies all terms that have the same number of intra- and inter- group connections and therefore we only need to consider how many independent possibilities exist to connect the symmetry groups to identify the basis tensors of any space, 𝕍_mn. We can therefore represent the basis tensors of each tensor space through a simple diagram as illustrated in Figure 10. We label these tensors through a symbol that shows the number of inter group connections in the index:

Fig. 10.

Graphic representation of the isotropic tensors that form a basis for a few selected isotropic tensor spaces with symmetries. All basis tensors of a space with given symmetry can be constructed by considering the number of different ways that the symmetry groups can be connected. In this figure each circle with number n represents a group of n fully symmetric indices and each connection represents one delta symbol (that can either connect two indices from the same group or from two different groups).

$\begin{matrix} V_{33} & = span ({I_{3 - 3}, I_{3 \equiv 3}}), \end{matrix}$ $\begin{aligned} \mathbb{V} _{33}&= \mathrm{span} \left( \{\mathsf{I }_{3-3}, \mathsf{I }_{3\equiv 3}\} \right) ,\end{aligned}$ (109)

$\begin{matrix} V_{44} & = span ({I_{44}, I_{4 = 4}, I_{4 ≣ 4}}), \end{matrix}$ $\begin{aligned} \mathbb{V} _{44}&= \mathrm{span} \left( \{\mathsf{I }_{44}, \mathsf{I }_{4 = 4}, \mathsf{I }_{4 \superequiv 4}\} \right) ,\end{aligned}$ (110)

$\begin{matrix} V_{24} & = span ({I_{24}, I_{2 = 4}}) . \end{matrix}$ $\begin{aligned} \mathbb{V} _{24}&= \mathrm{span} \left( \{\mathsf{I }_{24}, \mathsf{I }_{2 = 4}\} \right). \end{aligned}$ (111)

We explain in Appendix B.1 how to construct tensors with three or more symmetry groups. For these cases the procedure is mostly analogous, but it additionally needs to be considered whether there is symmetry with respect to permutations of the different symmetry groups.

4.4. Orthogonal basis

The basis tensors highlighted in the last section are sufficient to uniquely decompose the bias tensors. However, it is advantageous to define the basis tensors in such a way that they are orthogonal to each other.

For example, we can decompose an isotropic tensor M ∈ 𝕍₂₂ as

$\begin{matrix} M & = A I_{22} + B I_{2 = 2} . \end{matrix}$ $\begin{aligned} \mathsf M&= A \mathsf{I }_{22} + B \mathsf{I }_{2 = 2} . \end{aligned}$ (112)

Its full contraction with I₂₂ is

$\begin{matrix} M \overset{(4)}{\cdot} I_{22} & = A (I_{22} \overset{(4)}{\cdot} I_{22}) + B (I_{2 = 2} \overset{(4)}{\cdot} I_{22}), \end{matrix}$ $\begin{aligned} \mathsf M \overset{(4)}{\cdot } \mathsf{I }_{22}&= A (\mathsf{I }_{22} \overset{(4)}{\cdot } \mathsf{I }_{22}) + B (\mathsf{I }_{2 = 2} \overset{(4)}{\cdot } \mathsf{I }_{22}) ,\end{aligned}$ (113)

$\begin{matrix} = 9 A + 3 B, \end{matrix}$ $\begin{aligned} &= 9 A + 3 B, \end{aligned}$ (114)

which includes contributions of both A and B. We could infer A and B from M by additionally contracting with I_2 = 2 and then solving the resulting system of equations.

However, it would be desirable to have a simple way of finding the coefficients. Therefore, we define an orthogonal basis such that

$\begin{matrix} J_{a} \overset{(r)}{\cdot} J_{b} & = {\begin{matrix} | | J_{a} | |^{2} if a = b, \\ 0 otherwise, \end{matrix} \end{matrix}$ $\begin{aligned} \mathsf{J }_a \overset{(r)}{\cdot } \mathsf{J }_b&= {\left\{ \begin{array}{ll} \left||{\mathsf{J }_a} \right||^2 \quad \text{ if} a = b, \\ 0 \quad \text{ otherwise}, \end{array}\right.} \end{aligned}$ (115)

where the contraction goes over the full rank r of J_a and J_b (which must have equal rank). Such a basis can be found through Gram-Schmidt orthogonalisation and is for example given for 𝕍₂₂ by:

$\begin{matrix} J_{22} & = I_{22}, \end{matrix}$ $\begin{aligned} \mathsf{J }_{22}&= \mathsf{I }_{22} ,\end{aligned}$ (116)

$\begin{matrix} J_{2 = 2} & = I_{2 = 2} - \frac{1}{3} I_{22} . \end{matrix}$ $\begin{aligned} \mathsf{J }_{2 = 2}&= \mathsf{I }_{2 = 2} - \frac{1}{3} \mathsf{I }_{22} . \end{aligned}$ (117)

Then we can easily find the coefficients as

$\begin{matrix} M & = α J_{22} + β J_{2 = 2}, \end{matrix}$ $\begin{aligned} \mathsf M&= \alpha \mathsf{J }_{22} + \beta \mathsf{J }_{2 = 2} ,\end{aligned}$ (118)

$\begin{matrix} α & = \frac{M \overset{(4)}{\cdot} J_{22}}{| | J_{22} | |^{2}} = \frac{1}{9} M \overset{(4)}{\cdot} J_{22}, \end{matrix}$ $\begin{aligned} \alpha&= \frac{{\mathsf M \overset{(4)}{\cdot } \mathsf{J }_{22}}}{\left||{\mathsf{J }_{22} } \right||^2} = \frac{1}{9} \mathsf M \overset{(4)}{\cdot } \mathsf{J }_{22},\end{aligned}$ (119)

$\begin{matrix} β & = \frac{M \overset{(4)}{\cdot} J_{2 = 2}}{| | J_{2 = 2} | |^{2}} = \frac{1}{5} M \overset{(4)}{\cdot} J_{2 = 2} . \end{matrix}$ $\begin{aligned} \beta&= \frac{\mathsf M \overset{(4)}{\cdot } \mathsf{J }_{2 = 2}}{\left||{\mathsf{J }_{2 = 2}} \right||^2} = \frac{1}{5} \mathsf M \overset{(4)}{\cdot } \mathsf{J }_{2 = 2}. \,\, \end{aligned}$ (120)

We note that the coefficients inferred in this way are different than the A and B and the chosen basis will also affect the inferred bias parameters.

It is worth noting that a decomposition of a bias tensor into different bases,

$\begin{matrix} B_{T T} & = b_{I_{22}} I_{22} + b_{I_{2 = 2}} I_{2 = 2}, \\ = b_{J_{22}} J_{22} + b_{J_{2 = 2}} J_{2 = 2}, \end{matrix}$ $\begin{aligned} \mathsf{B }_\mathsf{T \mathsf T }&= b_{\mathsf{I }_{22}} \mathsf{I }_{22} + b_{\mathsf{I }_{2 = 2}} \mathsf{I }_{2 = 2}, \nonumber \\&= b_{\mathsf{J }_{22}} \mathsf{J }_{22} + b_{\mathsf{J }_{2 = 2}} \mathsf{J }_{2 = 2}, \end{aligned}$ (121)

corresponds to different independent scalar variables appearing in the bias expansion. For example, if we track the corresponding term from Equation (85), we find

$\begin{matrix} T B_{T T} T & = b_{I_{22}} tr {(T)}^{2} + b_{I_{2 = 2}} tr (T^{2}), \\ = b_{I_{22}} δ^{2} + b_{I_{2 = 2}} T^{2}, \end{matrix}$ $\begin{aligned} \mathsf T \mathsf{B }_\mathsf{T \mathsf T } \mathsf T&= b_{\mathsf{I }_{22}} \mathrm{tr} \left( {\mathsf{T }} \right)^2 + b_{\mathsf{I }_{2 = 2}} \mathrm{tr} \left( {\mathsf{T }^2} \right), \nonumber \\&= b_{\mathsf{I }_{22}} \delta ^2 + b_{\mathsf{I }_{2 = 2}} T^2,\end{aligned}$ (122)

$\begin{matrix} T B_{T T} T & = b_{J_{22}} tr {(T)}^{2} + b_{J_{2 = 2}} (tr (T^{2}) - \frac{δ^{2}}{3}), \\ = b_{J_{22}} δ^{2} + b_{J_{2 = 2}} K^{2} . \end{matrix}$ $\begin{aligned} \mathsf T \mathsf{B }_\mathsf{T \mathsf T } \mathsf T&= b_{\mathsf{J }_{22}} \mathrm{tr} \left( {\mathsf{T }} \right)^2 + b_{\mathsf{J }_{2 = 2}} \left(\mathrm{tr} \left( {\mathsf{T }^2} \right) - \frac{\delta ^2}{3} \right), \nonumber \\&= b_{\mathsf{J }_{22}}\delta ^2 + b_{\mathsf{J }_{2 = 2}} K^2. \end{aligned}$ (123)

Thus, the non-orthogonal basis leads to δ and T² as independent terms, whereas the orthogonal basis leads to using δ and K² as independent terms. The corresponding bias parameters can be directly identified as b_J₂₂ = b₂ and $\frac{1}{2} b_{J_{2 = 2}} = b_{K^{2}}$ $\frac{1}{2} b_{{\mathsf{J}}_{2 = 2}} = b_{K^2}$ .

We list an overview of the orthogonal basis tensors that we use here in Table 1. While it is possible to derive the full algebra of products between isotropic tensors from the index representations, this is rather cumbersome. Therefore, we have written a short code that creates explicit numerical representations of these tensors and with which it is easy to evaluate (and decompose) different types of products of these tensors. We use this together with the symbolic computer algebra system SYMPY (Meurer et al. 2017) to systematically compute symbolic representations of expressions in the following sections. This code is openly available⁷.

Table 1.

Orthogonal basis tensors that we consider here.

4.5. Bias estimators

Given the orthogonal basis tensors, we define the tensorial bias parameters through

$\begin{matrix} b_{J_{X}} & = {\frac{\partial^{N} F}{\partial T_{0}^{N}} \overset{(2 N)}{\cdot} \frac{J_{X}}{| | J_{X} | |^{2}} |}_{T_{0} = 0}, \end{matrix}$ $\begin{aligned} b_{\mathsf{J }_{X}}&= \left. \frac{\partial ^N F}{\partial \mathsf{T }_0^N} \overset{(2N)}{\cdot } \frac{\mathsf{J }_{X}}{\left||{ \mathsf{J }_X } \right||^2} \right|_{\mathsf{T }_0 = 0}, \end{aligned}$ (124)

where the rank of J_X is 2N. While for scalar terms, as in Equation (3), the bias parameters are simply given by derivatives of the galaxy number with respect to the corresponding scalar, more generally we define bias parameters for any tensorial terms as derivatives of the galaxy number with respect to the tidal tensor (or higher spatial order tensors) that are contracted with the corresponding isotropic tensor.

In complete analogy to Equation (46) it follows that these parameters can be estimated as

$\begin{matrix} b_{J_{X}} & = {(- 1)}^{N} {〈 \frac{1}{p} \frac{\partial p}{\partial T^{N}} \overset{(2 N)}{\cdot} \frac{J_{X}}{| | J_{X} | |^{2}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\mathsf{J }_{X}}&= (-1)^N \left\langle {\frac{1}{p} \frac{\partial p}{\partial \mathsf{T }^N} \overset{(2N)}{\cdot } \frac{\mathsf{J }_{X}}{\left||{ \mathsf{J }_X } \right||^2}} \right\rangle _g , \end{aligned}$ (125)

where p is the full distribution of the tidal tensor, T.

Bias terms corresponding to higher spatial derivatives like R and S can be defined analogously.

4.6. Tidal bias

We show in Appendices B.2 and B.3 that the distribution of the tidal tensor is given by

$\begin{matrix} p (T) & = N exp (- \frac{1}{2} T^{T} C_{T}^{+} T), \end{matrix}$ $\begin{aligned} p(\mathsf T )&= N \exp \left(- \frac{1}{2} \mathsf{T }^T \mathsf{C }_{\mathsf{T }}^+ \mathsf{T } \right), \end{aligned}$ (126)

where C_T⁺ (the generalised inverse of the covariance matrix, as explained in the Appendix) is given by

$\begin{matrix} C_{T}^{+} & = \frac{1}{σ_{0}^{2}} J_{22} + \frac{15}{2 σ_{0}^{2}} J_{2 = 2} . \end{matrix}$ $\begin{aligned} \mathsf{C }_\mathsf{T }^+&= \frac{1}{\sigma _0^2} \mathsf{J }_{22} + \frac{15}{2 \sigma _0^2} \mathsf{J }_{2 = 2} . \end{aligned}$ (127)

Taking derivatives yields

$\begin{matrix} \frac{1}{p} \frac{\partial p}{\partial T} & = - C_{T}^{+} T, \end{matrix}$ $\begin{aligned} \frac{1}{p} \frac{\partial p}{\partial \mathsf T }&= - \mathsf{C }_\mathsf{T }^+ \mathsf T ,\end{aligned}$ (128)

$\begin{matrix} \frac{1}{p} \frac{\partial^{2} p}{\partial T^{2}} & = (T C_{T}^{+}) \otimes (C_{T}^{+} T) - C_{T}^{+} \end{matrix}$ $\begin{aligned} \frac{1}{p} \frac{\partial ^2 p}{\partial \mathsf{T }^2}&= (\mathsf{T } \mathsf{C }_{\mathsf{T }}^+) \otimes (\mathsf{C }_\mathsf{T }^+ \mathsf T ) - \mathsf{C }_\mathsf{T }^+ \end{aligned}$ (129)

and we find

$\begin{matrix} b_{J_{2}} = b_{1} & = {〈 \frac{J_{2} C^{+} T}{| | J_{2} | |^{2}} 〉}_{g} = {〈 \frac{J_{2} T}{σ_{0}^{2}} 〉}_{g} = {〈 \frac{δ}{σ_{0}^{2}} 〉}_{g} \end{matrix}$ $\begin{aligned} b_{\mathsf{J }_2} = b_1&= \left\langle {\frac{\mathsf{J }_{2} \mathsf{C }^+ \mathsf T }{\left||{\mathsf{J }_{2}} \right||^2}} \right\rangle _{\mathrm{g} } = \left\langle {\frac{\mathsf{J }_{2} \mathsf{T }}{\sigma _{0}^{2}}} \right\rangle _{\mathrm{g} } = \left\langle {\frac{\delta }{\sigma _{0}^{2}}} \right\rangle _{\mathrm{g} } \end{aligned}$ (130)

where we used the inner product between two isotropic tensors: $J_{2} \overset{(2)}{\cdot} J_{22} = 3 J_{2}$ ${\mathsf{J}}_{2} {\overset{(2)}{\cdot}} {\mathsf{J}}_{22} = 3 {\mathsf{J}}_{2}$ . We evaluated the terms of this type systematically with numerical representations as described in Sect. 4.4. Naturally, this estimator of b_J₂ = b₁ is consistent with the one that we have inferred previously in Equation (15).

At the second order, we find the following estimators:

$\begin{matrix} b_{J_{22}} = b_{2} & = {〈 \frac{T^{T} J_{22} T - σ_{0}^{2}}{σ_{0}^{4}} 〉}_{g} \\ = {〈 \frac{δ^{2} - σ_{0}^{2}}{σ_{0}^{4}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\mathsf{J }_{22}} = b_2&= \left\langle {\frac{\mathsf{T }^T \mathsf{J }_{22} \mathsf{T } - \sigma _{0}^{2}}{\sigma _{0}^{4}}} \right\rangle _{\mathrm{g} } \nonumber \\&= \left\langle {\frac{\delta ^2 - \sigma _0^2}{\sigma _0^4}} \right\rangle _{\mathrm{g} } ,\end{aligned}$ (131)

$\begin{matrix} b_{J_{2 = 2}} = 2 b_{K^{2}} & = \frac{15}{4 σ_{0}^{4}} {〈 3 T^{T} J_{2 = 2} T - 2 σ_{0}^{2} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\mathsf{J }_{2 = 2}} = 2 b_{K^2}&= \frac{15}{4 \sigma _{0}^{4}} \left\langle {3 \mathsf{T }^T \mathsf{J }_{2 = 2} \mathsf{T } - 2 \sigma _{0}^{2} } \right\rangle _{\mathrm{g} } ,\end{aligned}$ (132)

$\begin{matrix} = \frac{15}{4 σ_{0}^{4}} {〈 3 K^{2} - 2 σ_{0}^{2} 〉}_{g} . \end{matrix}$ $\begin{aligned}&= \frac{15}{4 \sigma _{0}^{4}} \left\langle {3 K^2 - 2 \sigma _{0}^{2}} \right\rangle _{\mathrm{g} } . \end{aligned}$ (133)

We note that conventionally the bias parameter, b_K², is defined so that it appears with a pre-factor K² in the bias expansion, whereas our parameter b_{J_2 = 2} appears with a factor $\frac{1}{2} K^{2}$ $\frac{1}{2} K^2$ in the expansion (after contracting the corresponding tensors), so that it is $b_{K^{2}} = \frac{1}{2} b_{J_{2 = 2}}$ $b_{K^2} = \frac{1}{2} b_{{\mathsf{J}}_{2 = 2}}$ . Although we think that our fore-factor convention makes more sense in principle, since b_{J_2 = 2} corresponds to a second derivative term, we still present our results in terms of the conventional notation (e.g. b_K²) throughout this paper.

We note that just as the large-scale value of δ/σ² at an object’s location is motivated by Equation (15) and sometimes called the ‘bias-per-object’ (e.g. Paranjape et al. 2018; Contreras et al. 2023), we could refer in a similar manner to the value of

$\begin{matrix} b_{K^{2}, 1 h} & = \frac{15}{8 σ_{0}^{4}} (3 K^{2} - 2 σ_{0}^{2}) \end{matrix}$ $\begin{aligned} b_{K^2, 1h}&= \frac{15}{8 \sigma _{0}^{4}} (3 K^2 - 2 \sigma _{0}^{2}) \end{aligned}$ (134)

as the value of the ‘tidal bias-per-object’.

Just as estimators with spatial corrections could be obtained for the density biases by considering the joint distribution of p(δ, L), we can obtain higher spatial order estimators for the tidal bias by considering the joint distribution of second and fourth potential derivatives p(T, R) and evaluating Equation (125) for this distribution. We show how we derived the bias estimator for this case in appendices B.5 and B.6. The resulting estimator is:

$\begin{matrix} b_{J_{2 = 2}} & = \frac{15}{4 σ_{*}^{8}} {〈 3 K^{2} σ_{2}^{4} + 6 ϕ_{2 = 4} σ_{1}^{2} σ_{2}^{2} + 3 ϕ_{4 = 4} σ_{1}^{4} - 2 σ_{2}^{2} σ_{*}^{4} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\mathsf{J }_{2 = 2}}&= \frac{15}{4 \sigma _{*}^{8}} \left\langle {3 K^2 \sigma _{2}^{4} + 6 \phi _{2 = 4} \sigma _{1}^{2} \sigma _{2}^{2} + 3 \phi _{4 = 4} \sigma _{1}^{4} - 2 \sigma _{2}^{2} \sigma _{*}^{4}} \right\rangle _{\mathrm{g} } , \end{aligned}$ (135)

where

$\begin{matrix} ϕ_{2 = 4} & : = T J_{2 = 4} R = \sum_{ij} (\partial_{i} \partial_{j} ϕ) (\partial_{i} \partial_{j} δ) - \frac{1}{3} δ \nabla^{2} δ, \end{matrix}$ $\begin{aligned} \phi _{2 = 4}&:= \mathsf{T } J_{2 = 4} \mathsf{R } = \sum _{ij} (\partial _i \partial _j \phi ) (\partial _i \partial _j \delta ) - \frac{1}{3} \delta \nabla ^2 \delta ,\end{aligned}$ (136)

$\begin{matrix} ϕ_{4 = 4} & : = T J_{4 = 4} R = \sum_{ij} {(\partial_{i} \partial_{j} δ)}^{2} - \frac{1}{3} {(\nabla^{2} δ)}^{2}, \end{matrix}$ $\begin{aligned} \phi _{4 = 4}&:= \mathsf T J_{4 = 4} \mathsf R = \sum _{ij} (\partial _i \partial _j \delta )^2 - \frac{1}{3} (\nabla ^2 \delta )^2, \end{aligned}$ (137)

and $σ_{*}^{2} = σ_{0}^{2} - σ_{1}^{4} / σ_{2}^{2}$ $\sigma_{*}^2 = \sigma_{0}^{2} - \sigma_{1}^{4} / \sigma_{2}^{2}$ .

4.7. Estimators for third derivative terms

The distribution of third derivatives of the potential S is derived in Appendix B.4 and is given by a multivariate Gaussian with the generalised inverse covariance matrix

$\begin{matrix} C_{S}^{+} & = \frac{3}{σ_{1}^{2}} J_{3 - 3} + \frac{35}{2 σ_{1}^{2}} J_{3 \equiv 3} . \end{matrix}$ $\begin{aligned} \mathsf{C }_\mathsf{S }^+&= \frac{3}{\sigma _1^{2}} \mathsf{J }_{3-3} + \frac{35}{2 \sigma _1^{2}} \mathsf{J }_{3\equiv 3}. \end{aligned}$ (138)

We note that it is fine to consider the third spatial derivatives independently of the second derivatives, since the joint distribution factorises as:

$\begin{matrix} p (T, S) & = p (T) p (S) . \end{matrix}$ $\begin{aligned} p(\mathsf T , \mathsf S )&= p(\mathsf T ) p(\mathsf S ). \end{aligned}$ (139)

We find bias estimators by evaluating Equation (125) as:

$\begin{matrix} b_{J_{3 - 3}} & = \frac{3}{σ_{1}^{4}} {〈 S J_{3 - 3} S - σ_{1}^{2} 〉}_{g} \\ = \frac{3}{σ_{1}^{4}} {〈 {(\nabla δ)}^{2} - σ_{1}^{2} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\mathsf{J }_{3-3}}&= \frac{3}{\sigma _1^4} \left\langle \mathsf{S \mathsf{J }_{3-3} \mathsf S - \sigma _{1}^{2}} \right\rangle _{\mathrm{g} } \nonumber \\&= \frac{3}{\sigma _1^4} \left\langle {(\nabla \delta )^2 - \sigma _{1}^{2}} \right\rangle _{\mathrm{g} } ,\end{aligned}$ (140)

$\begin{matrix} b_{J_{3 \equiv 3}} & = \frac{35}{σ_{1}^{4}} {〈 5 S J_{3 \equiv 3} S - 2 σ_{1}^{2} 〉}_{g} \\ = \frac{35}{σ_{1}^{4}} {〈 5 ϕ_{3 \equiv 3} - 2 σ_{1}^{2} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_{\mathsf{J }_{3\equiv 3}}&= \frac{35}{\sigma _1^4} \left\langle {5 \mathsf S \mathsf{J }_{3\equiv 3} \mathsf S - 2 \sigma _{1}^{2}} \right\rangle _{\mathrm{g} } \nonumber \\&= \frac{35}{\sigma _1^4} \left\langle {5 \phi _{3\equiv 3} - 2 \sigma _{1}^{2}} \right\rangle _{\mathrm{g} } , \end{aligned}$ (141)

where

$\begin{matrix} ϕ_{3 \equiv 3} & = \sum_{ijk} {(\partial_{ijk} ϕ)}^{2} - \frac{3}{5} \sum_{i} {(\partial_{i} δ)}^{2}, \end{matrix}$ $\begin{aligned} \phi _{3\equiv 3}&= \sum _{ijk} (\partial _{ijk} \phi )^2 - \frac{3}{5} \sum _i (\partial _{i} \delta )^2, \end{aligned}$ (142)

and where we note again that conventions may differ by a factor 2 so that we have, for instance, b_{J_3 − 3} = 2b_(∇δ)² if b_(∇δ)² is the bias parameter that would be multiplied (∇δ²) without any fore-factor in a bias expansion.

4.8. Tensorial cumulant biases

As for the scalar case, it is possible to define tensorial cumulant bias parameters instead of tensorial canonical bias parameters. Unfortunately, interesting differences arise only at orders that are higher than the ones that we have considered here. However, for the benefit of future studies, we still want to briefly line out, how to define tensorial cumulant biases.

Analogously to Equation (85), we considered a tensorial expansion of log F as

$\begin{matrix} log F = 1 + β_{T} T + \frac{1}{2} T β_{T T} T + \dots + β_{S} S + \dots + β_{R} R + \dots \end{matrix}$ $\begin{aligned} \log F = 1 + {\beta }_\mathsf{T } \mathsf T + \frac{1}{2} \mathsf T {\beta }_\mathsf{T \mathsf T } \mathsf T + \cdots + {\beta }_{\mathsf{S }} \mathsf S + \cdots + {\beta }_\mathsf{R } \mathsf R + \cdots \end{aligned}$ (143)

Taking derivatives at zero and identifying coefficients yields the relations

$\begin{matrix} β_{T} & = B_{T}, \end{matrix}$ $\begin{aligned} {\beta }_\mathsf{T }&= \mathsf{B }_\mathsf{T } ,\end{aligned}$ (144)

$\begin{matrix} β_{T T} & = B_{T T} - B_{T} \otimes B_{T} . \end{matrix}$ $\begin{aligned} {\beta }_\mathsf{T \mathsf T }&= \mathsf{B }_\mathsf{T \mathsf T } - \mathsf{B }_\mathsf{T } \otimes \mathsf{B }_\mathsf{T }. \end{aligned}$ (145)

To decompose this into isotropic tensors we may contract these equations with the isotropic tensors to find the relations

$\begin{matrix} β_{1} = β_{J_{2}} & = b_{J_{2}} = b_{1}, \end{matrix}$ $\begin{aligned} \beta _1 = \beta _{\mathsf{J }_2}&= b_{\mathsf{J }_2} = b_1 ,\end{aligned}$ (146)

$\begin{matrix} β_{2} = β_{J_{22}} & = b_{J_{22}} - β_{J_{2}}^{2} = b_{2} - b_{1}^{2}, \end{matrix}$ $\begin{aligned} \beta _{2} = \beta _{\mathsf{J }_{22}}&= b_{\mathsf{J }_{22}} - \beta _{\mathsf{J }_2}^2 = b_2 - b_1^2 ,\end{aligned}$ (147)

$\begin{matrix} 2 β_{K^{2}} = β_{J_{2 = 2}} & = b_{J_{2 = 2}} = 2 b_{K^{2}} . \end{matrix}$ $\begin{aligned} 2 \beta _{K^2} = \beta _{\mathsf{J }_{2 = 2}}&= b_{\mathsf{J }_{2 = 2}} = 2 b_{K^2}. \end{aligned}$ (148)

This is consistent with what we derived earlier. Tensorial cumulant biases become more interesting at third order, where third derivative of the bias Function yields

$\begin{matrix} β_{T T T} & = B_{T T T} - 3 S_{222} (B_{T} \otimes B_{T T}) + 2 S_{222} (B_{T} \otimes B_{T} \otimes B_{T}) . \end{matrix}$ $\begin{aligned} \beta _\mathsf{T \mathsf T \mathsf T }&= \mathsf{B }_\mathsf{T \mathsf T \mathsf T } - 3 S_{222}(\mathsf{B }_\mathsf{T } \otimes \mathsf{B }_\mathsf{T \mathsf T }) + 2 S_{222} (\mathsf{B }_\mathsf{T } \otimes \mathsf{B }_\mathsf{T } \otimes \mathsf{B }_\mathsf{T }). \end{aligned}$ (149)

If we contract this with the isotropic tensor S₂₂₂(J_2 = 2 ⊗ J₂), we have

$\begin{matrix} β_{J_{2 = 22}} & = b_{J_{2 = 22}} - 3 b_{J_{2}} b_{J_{2 = 2}} . \end{matrix}$ $\begin{aligned} \beta _{\mathsf{J }_{2 = 22}}&= b_{\mathsf{J }_{2 = 22}} - 3 b_\mathsf{J _{2}} b_\mathsf{J _{2 = 2}}. \end{aligned}$ (150)

Therefore, if the full multivariate bias function is close to Gaussian, also tensorial bias parameters of this form should be expected to be close to zero. Dealing with such objects goes beyond the framework of this study and we set this aside as an interesting proposition for a future work.

5. Measurements of tensorial bias parameters

In this section, we use the estimators derived in Sect. 4 to perform measurements of tensorial bias parameters. Here, we focus on the tidal bias, b_K², and on the biases associated with the third potential derivatives, $b_{{(\nabla δ)}^{2}} = \frac{1}{2} b_{J_{3 - 3}}$ $b_{(\nabla \delta)^2} = \frac{1}{2} b_{{\mathsf{J}}_{3-3}}$ and b_{J_3 ≡ 3}. For the tidal bias, we have additionally derived the estimator with spatial corrections of the order of 2, so that we can use it to validate that the presented theory indeed behaves as expected. Furthermore, we chose to briefly check the effect of assembly bias on b_K², which has previously been reported to be quite significant. For these measurements, we used the same simulations and jackknife technique (described in Sects. 3.1 and 3.2).

5.1. Tidal bias

We measured the tidal bias, $b_{K^{2}} = \frac{1}{2} b_{J_{2 = 2}}$ $b_{K^2} = \frac{1}{2} b_{{\mathsf{J}}_{2 = 2}}$ , with the estimators from Equation (133) at a zeroth-spatial order and Equation (135) at a spatial order of 2. We show the results as a function of mass in Figure 11. The zeroth-order estimator exhibits a strong scale dependence at scales k_d ≳ 0.15 h Mpc⁻¹. However, overall it seems that the tidal bias is very small in amplitude and that a slightly negative Lagrangian tidal bias is preferred at high halo masses. On the other hand, the second-order spatial estimator has a much lower scale dependence and seems mostly consistent across different damping scales, potentially with a slight inconsistency at very high halo masses and large k_d. This comes at the price of a significantly increased uncertainty in the measured bias values. However, this demonstrates that the estimators we derived in Sect. 4 do indeed behave the way we would expect. More precise measurements of the tidal bias could easily be obtained by employing a larger simulation volume or by averaging over several realisations.

In Figure 12, we show the co-evolution relation between b_K² and b₁. Here, we show for each estimator only the smallest two scales that appear reasonably unbiased in Figure 11. These are k_d = 0.1 h Mpc⁻¹ and k_d = 0.15 h Mpc⁻¹ for the zeroth-order spatial estimator and k_d = 0.25 h Mpc⁻¹ and k_d = 0.3 h Mpc⁻¹ for the second-order spatial estimator. It is noteworthy that the estimators of different orders are in agreement with each other. For the larger k_d values, the estimators have smaller error bars, but do appear slightly biased at large values of b₁ ≳ 2. A rough approximation to the observed functional shape is given by

Fig. 11.

Tidal bias as a function of mass and for different damping scales measured with the zeroth-order spatial estimator (top) versus the second-order spatial estimator (bottom). The second-order spatial estimator exhibits significantly reduced scale dependence at the price of increased error bars.

Fig. 12.

Co-evolution relation b_K² versus b₁. Coloured shaded regions are our measurements and error bars show the measurements from Lazeyras & Schmidt (2018). Black lines include measurements from Zennaro et al. (2022), Modi et al. (2017) and Abidi & Baldauf (2018), an excursion set prediction from Sheth et al. (2013) and an approximation that we propose as b_K² = −0.05b₁².

$\begin{matrix} b_{K^{2}} = - 0.05 b_{1}^{2} . \end{matrix}$ $\begin{aligned} b_{K^2} = -0.05 b_1^2 . \end{aligned}$ (151)

However, the uncertainty is certainly quite high. However, in comparison to some previous literature estimates, we find very small values of b_K². The first measurement that we compare with is from Modi et al. (2017). Their relation is stated for Eulerian bias parameters b_K²^E(b₁^E) and we convert it to a Lagrangian relation via b₁ = b₁^E − 1 and b_K² = b_K²^E + 2/7b₁ (Sheth et al. 2013; Desjacques et al. 2018). Similar to other studies (Lazeyras & Schmidt 2018; Zennaro et al. 2022), we find a strong disagreement between our measurements and the Modi et al. (2017) measurements. Furthermore, we show as a dashed line in Figure 12 the co-evolution relation of the tidal bias that was presented in Zennaro et al. (2022). This relation has been measured at very small scales k ∼ 0.7 h Mpc⁻¹ and is not really expected to accurately represent the large-scale tidal bias. However, it is interesting to see that the large-scale tidal response that we measure here is so much weaker than the response that can be measured at small scales. Next, we made a comparison with the excursion set prediction from Sheth et al. (2013) parameterised as $b_{1}^{E} = 0.524 - 0.547 b_{1}^{E} + 0.046 {(b_{1}^{E})}^{2}$ $b_{1}^{\mathrm{E}} = 0.524 - 0.547 b_{1}^{\mathrm{E}} + 0.046 (b_{1}^{\mathrm{E}})^2$ . This prediction seems to be in better agreement with our measurements, since it generally predicts smaller amplitudes for the tidal bias and a slight negative tendency. However, the predicted upturn in the function at b₁ ≳ 2 does not seem to be present in any of our measurements.

Furthermore, we compared our measurements with the Lagrangian tidal bias inferred by Abidi & Baldauf (2018). For this, we use their fitted b_K²(M) relation and map masses to b₁ values through the Tinker et al. (2010) bias relation evaluated for their simulated cosmology. Our measurements are in good qualitative and quantitative agreement with the results from Abidi & Baldauf (2018) when compared at the same value of b₁.

Finally, we compared our data with the measurements from Lazeyras & Schmidt (2018), again transformed from Eulerian to Lagrangian parameters, indicated via the error bars in Figure 12. Our measurements agree roughly with that study, in that we favor a very small tidal bias with a preferentially slightly negative amplitude. We suggest therefore, that high Lagrangian tidal biases |b_K²|> 0.5 for mass-selected halos are strongly disfavored for lowly biased objects b₁ ≪ 2.

However, we also note that there are some statistically significant differences at b₁ ∼ 0, where we find tidal bias consistent with zero, whereas the data of Lazeyras & Schmidt (2018) clearly favor a negative value of b_K² ∼ −0.2. Given our agreement with Abidi & Baldauf (2018), the fact that past erroneous measurements have rather predicted too large absolute values of the tidal bias rather than too small ones, and that the three lowest b₁ data points of Lazeyras & Schmidt (2018) do not seem really consistent with the overall behaviour of the curve, we suggest that our measurements are more reliable here and that the Lagrangian tidal bias becomes indeed very small |b_K²|≪0.1 for b₁ ≲ 0.5.

We conclude that overall the amplitude of the Lagrangian b_K² is quite low for mass selected halos. Given the challenges of the measurement and the history of errors, it would be desirable to have a clear measurement that does not suffer from the usual scale dependencies and large uncertainties. This could be achieved through anisotropic separate-universe simulations as they have been presented in Stücker et al. (2021), Masaki et al. (2020), Akitsu et al. (2021). We checked whether the simulations that were presented in Stücker et al. (2021) are sufficient to measure the tidal bias at the level that we observe here. The simulation parameter choices were not optimal for measuring it, since those simulations do not have sufficient statistics of high mass objects – which are the objects that would have a significant tidal bias according to Equation (151). However, a reliable measurement could be obtained by running with larger tidal fields that are tuned to measure the tidal bias at higher redshift, where most objects have a larger b₁ and likely a larger b_K². However, that said, we find that the simulations have significant enough statistics to constraint an upper bound |b_K²|< 0.1 (roughly 1σ level) at low masses M_200b ∼ 10¹³ h⁻¹ M_⊙, consistent with our observations here.

5.2. Assembly bias in b_K²

While the tidal bias seems to be quite low for mass selected halos, it is not at all obvious that this should also be the case for galaxies. The formation and selection of galaxies may depend on other properties that are more sensitive to the tidal field, like the spin, the formation time and the halo concentration. For example, in tidal torque theory the spin of an object should depend on the Lagrangian tidal tensor (White 1984). Therefore, it is important to investigate whether there exists assembly bias with respect to such properties. Lazeyras et al. (2021) have shown that the tidal bias b_K²^E and the associated b_K²(b₁) co-evolution relation depend strongly on secondary halo properties. Here, we briefly checked whether we can reproduce these results.

We measure the spin of each halo in our simulation following Bullock et al. (2001) as

$\begin{matrix} λ & = \frac{| L |}{\sqrt{2} v_{200 c} r_{200 c}}, \end{matrix}$ $\begin{aligned} \lambda&= \frac{|\boldsymbol{L}|}{\sqrt{2} v_{200c} r_{200c}}, \end{aligned}$ (152)

where

$\begin{matrix} L & = {〈 r \times v 〉}_{particles} \end{matrix}$ $\begin{aligned} \boldsymbol{L}&= \left\langle {\boldsymbol{r} \times \boldsymbol{v}} \right\rangle _{\mathrm{particles} } \end{aligned}$ (153)

is the specific angular momentum averaged over all particles that are within the bound component of the main subhalo according to SUBFIND and r_200c is the radius within which the density is 200 times the critical density of the Universe and v_200c the circular velocity at that radius.

We then grouped halos in each mass bin by their spin into four quartiles q₁, q₂, q₃ and q₄, q₁ corresponding to the 25% of halos with the lowest spin and q₄ to the 25% with the highest spin. We then measured the bias parameters independently in each quartile, q₁. We show the resulting b_K²(b₁) relation in Figure 13. Here, we present only results for the second-order spatial estimator at a damping scale of k_d = 0.25 h Mpc⁻¹, but we have checked whether other choices lead to the same results.

Fig. 13.

Assembly bias in the Lagrangian b_K²(b₁) relation for halos split into four quartiles by spin (at fixed mass). All lines use the second-order spatial estimator and k_d = 0.25 h Mpc⁻¹ as damping scale. Dashed lines show the variation of b₁ and b_K² at fixed mass. The b_K² − b₁ relation shows a strong degree of assembly bias – with larger spin selections yielding larger b_K² and larger b₁.

The b_K² − b₁ relation exhibits a strong degree of assembly bias with halo spin. For example, at b₁ ∼ 1, the tidal bias of mass selected halos seems very small |b_K²|≪0.1, but after the selection on halo spin, values may range between −1 and 0.5. In individual mass bins, the spin selection increases both the value of b₁ and the value of b_K².

The sign, the b₁ dependence, and the overall amplitude of the spin-assembly bias are all in good agreement with the measurements of Lazeyras et al. (2021). For example, at b₁ = 2, we find assembly bias variations of Δb_K² ∼ 2 (between low and high spins) consistent with their variations at $b_{1}^{E} = 3$ $b_{1}^{E} = 3$ of around Δb_K²^E ∼ 2 and the assembly bias decreases towards lower b₁ and increases towards higher b₁. Therefore, our measurements provide a validation of the results of that study, but it also shows that our method allows us to reliably measure the tidal bias.

While we do not present it here, we have also tested whether we can reproduce the concentration assembly bias that was measured by Lazeyras et al. (2021). We have found that if we also define concentration through v_max/v_200c, then we find an assembly bias of a similar amplitude. However, if concentrations are defined through fits of the halo density profile, the assembly bias is smaller (possibly consistent with 0) and prefers the opposite sign. The concentration assembly bias seems therefore to be a bit more uncertain than that of the spin.

We conclude that even if b_K² seems quite small for mass selected halos, the tidal bias may not so easily be neglected for tracers that follow more general selection criteria. Values of the Lagrangian b_K² of an order of unity (and slightly larger) are a plausible possibility for realistic galaxy catalogues.

5.3. Biases of third derivative terms

In Figure 14, we show the measurements that we obtain by evaluating the bias estimators from Equations (140) and (141) and where we again divide by a factor of 2 to get b_(∇δ)² = b_{J_3 − 3}/2, in accordance with the literature convention.

Fig. 14.

Measurements of biases terms of third derivatives of the potential. The bias of the density gradient b_(∇δ)² (top) has a very strong scale dependence so that we cannot reliably measure it here. Bottom: Fully contracted third derivatives of the potential seem to have a very small associated bias parameter.

The value of b_(∇δ)² appears to depend strongly on the damping scale and we cannot therefore claim a reliable measurement here. However, it seems mostly b_(∇δ)² < 0 which seems consistent with the idea that halos should preferentially form at peaks where density gradients should be small. However, at very large masses, b_(∇δ)² seems to become positive. We are not quite sure how to interpret this behaviour, but it might just be an artefact of the uncorrected scale dependencies. It is worth noting that the negative amplitude of b_(∇δ)² is quite significant b_(∇δ)² ∼ 10 h⁻² Mpc² and even grows towards larger scales. We therefore discuss whether this (usually neglected) bias term is worth including under some circumstances, as described late in this paper. We note that b_(∇δ)² was also measured by Biagetti et al. (2014). However, since we have not properly corrected the scale dependence here, we abstain from a comparison.

On the other hand, b_{J_3 ≡ 3} appears to be quite small, |b_{J_3 ≡ 3}/2|≲5h⁻²Mpc², for everything except the very high mass end. Therefore, the contributions of b_{J_3 ≡ 3} to the bias expansion should be significantly lower than b_(∇δ)².

6. Relevance of bias parameters

We aim to estimate the relevance of different contributions to the bias expansion. If we have a term of the form $\frac{1}{n!} b_{X} \cdot X$ $\frac{1}{n!} b_X \cdot X$ of the re-normalised bias expansion, then we can estimate its relative relevance by comparing the typical difference in the predicted galaxy density field between a bias function, f_X, which includes the term versus an expansion, f₀, which neglects it, namely:

$\begin{matrix} σ_{ϵ, X}^{2} & = 〈 {(f_{X} - f_{0})}^{2} 〉 \\ = {(\frac{b_{X}}{n!})}^{2} 〈 X^{2} 〉, \end{matrix}$ $\begin{aligned} \sigma _{\epsilon ,X}^2&= \left\langle {(f_X - f_0)^2 } \right\rangle \nonumber \\&= \left( \frac{b_X}{n!} \right)^2 \left\langle {X^2} \right\rangle , \end{aligned}$ (154)

where the value of ⟨X²⟩ depends on the adopted damping scale and where σ_ϵ, X is a dimensionless quantity that can be compared across different bias terms. For example, for the contribution of b₃, we have

$\begin{matrix} σ_{ϵ, δ^{3}}^{2} & = {(\frac{b_{3}}{6})}^{2} 〈 {(δ^{3} - 3 δ σ^{2})}^{2} 〉 . \end{matrix}$ $\begin{aligned} \sigma _{\epsilon ,\delta ^3}^2&= \left( \frac{b_3}{6} \right)^2 \left\langle {(\delta ^3 - 3\delta \sigma ^2)^2} \right\rangle . \end{aligned}$ (155)

The value of σ_ϵ, X quantifies the typical error that is made at the field level (in Lagrangian space) when neglecting the corresponding term.

We chose two halo sets M ∼ 10¹⁴ h⁻¹ M_⊙ and M ∼ 10¹⁵ h⁻¹ M_⊙ as examples and measured all their bias parameters with a damping scale of k_d = 0.2 h Mpc⁻¹ and used the second-order spatial estimators for all parameters (except b_(∇δ)² and b_3 ≡ 3). We list the corresponding parameters in Table 2. The first set represents a moderately biased population, whereas the second set represents an example of strongly biased tracers. We then kepy these parameters fixed and evaluate ⟨X²⟩ at many different damping scales to obtain σ_ϵ, X, which we show in Figure 15.

Table 2.

Sets of bias parameters that are shown in Figure 15.

For the moderately biased tracers, the relevance of parameters decreases reasonably between different orders. However, this statement depends strongly on the considered damping scale. For instance, at k_d = 0.4 h Mpc⁻¹ many different terms reach similar amplitudes; whereas at k_d = 0.05 h Mpc⁻¹, there is a strong order of relevance among different terms. If we aim to fit a bias expansion at the field level at k_d = 0.15 h Mpc⁻¹ (marked as a vertical dotted line), then we would find that for the moderately biased set of b₁ > b₂ > b₃ > b₄, all of them lie within a factor of 2 of each other and for the strongly biased set, we even see their order has changed, namely, to: b₂ > b₁ > b₄ > b₃ and all of these lie within a factor of 3 of each other. This is certainly very problematic, since it does not at all seem clear that higher order terms are smaller than lower order terms here, which casts doubts on the convergence of the expansion. Of course this problem can be controlled by simply going to larger smoothing scales, but at the cost of a reduced constraining power. However, another possibility is to make use of the advantages of the cumulant bias parameters. With these parameters, significant parts of higher order terms get already absorbed by lower order parameters. For example, considering the first three terms of the bias expansion in terms of cumulant parameters:

$\begin{matrix} f & = 1 + β_{1} δ + \frac{1}{2} (β_{2} + β_{1}^{2}) (δ^{2} - σ^{2}) \\ + \frac{1}{6} (β_{3} + 3 β_{1} β_{2} + β_{1}^{3}) (δ^{3} - δ σ^{2}) + \dots \end{matrix}$ $\begin{aligned} f&= 1 + \beta _1 \delta + \frac{1}{2} (\beta _2 + \beta _1^2) (\delta ^2 - \sigma ^2) \nonumber \\&\quad \quad + \frac{1}{6} (\beta _3 + 3 \beta _1 \beta _2 + \beta _1^3) (\delta ^3 - \delta \sigma ^2) + \cdots \end{aligned}$ (156)

Here β₁ and β₂ contribute also to the third term and to infinitely many higher order terms. If we write out the polynomial to high orders, but set β₃ = 0, then the leading order error we would make is

$\begin{matrix} σ_{ϵ, β_{3} δ^{3}}^{2} & = {(\frac{β_{3}}{6})}^{2} 〈 {(δ^{3} - 3 δ σ^{2})}^{2} 〉, \end{matrix}$ $\begin{aligned} \sigma _{\epsilon ,\beta _3 \delta ^3}^2&= \left( \frac{\beta _3}{6} \right)^2 \left\langle {(\delta ^3 - 3\delta \sigma ^2)^2} \right\rangle , \end{aligned}$ (157)

which is identical to the error term from the canonical bias terms, but with a different fore-factor. Error estimates of these types are indicated as dashed lines in Figure 15. Strikingly, these are significantly smaller than the canonical terms and it seems now plausibly to reach good convergence at k_d = 0.15 h Mpc⁻¹. For instance, to reach σ_ϵ ≤ 0.1 for the strongly biased set, we would only need to consider the terms (β₁, β₂, β_L).

Fig. 15.

Quantitative contribution of different bias terms to the bias expansion as a function of damping scale and for two different sets of tracers. To avoid cluttering, we omit the bias fore-factors of some terms in the legend. The vertical dotted line indicates an example scale of interest k_d = 0.15 h Mpc⁻¹. The dashed lines show the contributions of cumulant biases at orders of 3 and 4, which can directly be compared to the contributions of b₃ and b₄ (labelled as δ³ and δ⁴).

It is noteworthy that the contribution of b_(∇δ)² is not much smaller than that of b_L so that it might make sense to include this parameter more commonly in the bias expansion at scales k_d ≳ 0.1 h Mpc⁻¹. However, we remind the reader that there are some other possibly important parameters here that we have not considered, so that we cannot present a comprehensive picture of the relevance of bias parameters.

We conclude that it may have significant advantages to phrase the bias expansion in terms of the cumulant parameters, β_n, instead of the canonical bias parameters. This is especially true when considering highly biased tracers or small smoothing scales, where the rate of convergence of the canonical expansion is excruciatingly slow.

7. Conclusions

In this article, we present a method for measuring bias parameters through moments of the galaxy environment distribution, p(δ|g). We have shown that such estimators can be derived both for scalar bias parameters such as b₁ and b_L and for tensorial ones such as b_K² and b_(∇δ)². Furthermore, we demonstrate that they can be made independent of scale if we consider spatial corrections at various orders. We have verified the reliability of these estimators by recovering well established relations between known parameters. Additionally, we used our estimators to measure terms that have been unknown thus far, such as b_3 ≡ 3 and b_(∇δ)², where the latter appears to be particularly large so that it might be worth including it more commonly in the bias expansion.

The main benefits of the method are its simplicity and generality. We are only required to evaluate the linear field of a simulation at the Lagrangian locations of a set of tracers (e.g. given by the most bound id in a halo catalogue) and to take simple expectation values. This measurement uses a minimal number of discretisation steps, making it numerically robust and easy to understand. While a similar approach was previously used by Paranjape et al. (2013a,b), we have shown that it is possible to incorporate spatial corrections at any order to get accurate reconstructions of large-scale bias parameters, even at fairly large damping scales, k_d. In contrast to measurement approaches that depend on forward models, for instance, of power spectra or correlation functions, measured bias parameters are significantly more independent of each other in the method at hand. For example, the measurement of b₁ is completely independent of the assumed order of the expansion in powers of δ. This makes it generally unlikely that the parameters can numerically compensate for the absence of other terms. Finally, we note that all bias parameters can be evaluated at a low computational cost with a single simulation and that it is possible to use arbitrary filtering methods with the slight alterations explained in Appendix A.

However, to avoid confusion, we want to point out a few limitations of our method for estimating the biases. First of all, we note that our method assumes that the background distribution is known analytically. Therefore, it does not immediately translate to situations where this is not the case. For example, at third order in the bias expansion, time derivatives of the tidal tensor appear and these depend on the second Lagrangian perturbation theory potential, ϕ⁽²⁾, which is not a Gaussian random field. Therefore, the formalism can only be translated to the associated bias parameters if the distribution of ϕ⁽²⁾ and its derivatives can be written (or at least adequately approximated) analytically. Furthermore, the formalism cannot directly be applied to the Eulerian galaxy environment distribution p^E(δ|g), since the non-linear Eulerian matter distribution p^E(δ) does also not follow Gaussian statistics.

Beyond the novel bias estimators, we have also newly introduced the cumulant bias parameters

$\begin{matrix} β_{n} & = {\frac{\partial^{n}}{\partial δ_{0}^{n}} log (\frac{n_{g} (δ_{0})}{n_{g, 0}}) |}_{δ_{0} = 0} . \end{matrix}$ $\begin{aligned} \beta _n&= \left. \frac{\partial ^n}{\partial \delta _0^n} \log \left( \frac{n_g (\delta _0)}{n_{g,0}} \right) \right|_{\delta _0 = 0}. \end{aligned}$ (158)

These cumulant bias parameters are proportional to the cumulants of the galaxy environement distribution⁸ and they are related to the canonical bias parameters exactly in the same way as cumulants relate to moments. When measuring these parameters, we found them to be more independent from each other than canonical bias parameters. Furthermore, we have shown that for halos cumulant biases of the order of n ≥ 3 are very close to zero. This has several intriguing consequences:

(1) The galaxy environment distribution and the bias function of halos are very well approximated by a Gaussian. This motivates the use of a Gaussian bias model for the order of n = 2, as well as alternative approaches, for instance, expanding around a Gaussian bias model for orders of n > 2. We will discuss the Gaussian bias model and its desirable properties in detail in a companion paper (Stücker et al. 2025).

(2) The co-evolution relations of halos of the order of n ≥ 3 are equivalent to β_n = 0. Therefore, we may predict such relations between canonical bias parameters at very high orders by combining β_n = 0 with the mapping between canonical and cumulant bias parameters. However, this also suggests that such co-evolution relations may not carry much physical significance, but instead they are the artefacts of a suboptimal basis choice. An expansion around a Gaussian model would naturally incorporate all of these relations at any order.

(3) The bias expansion may have significantly improved convergence when phrased in terms of cumulants instead of canonical bias parameters. At the field level, the convergence of the canonical expansion seems already questionable at scales of k_d ≳ 0.15 h Mpc⁻¹ since the parameters may easily rise as their order rises. However, when phrased in terms of cumulant biases, we expect neglected higher order terms to be significantly less important, as they are already well captured through lower order parameters.

Overall, we expect cumulant biases to be most beneficial whenever the canonical expansion shows poor convergence, for instance, at high masses, at late times, or at small smoothing scales.

¹

Making this measurement in simulations may involve some uncertainty due to finite-size effects and cosmic variance. For simplicity, here we consider the limiting case of an infinite measurement volume so that the relation is completely deterministic.

²

The definitions we have given here should not be confused with those in works that consider the 2d joint distribution function of the smoothed galaxy density, n_g, and the smoothed matter density, p(n_g, δ), and their conditional p(n_g|δ) (Dekel & Lahav 1999). Describing these is significantly more complicated, as they are two-dimensional and they contain stochastic contributions with a complicated dependence on two smoothing operations (for defining the matter contrast and the galaxy density, respectively).

³

Corresponding to a sharp truncation scale in Fourier space, as explained in Sect. 3.2.

⁴

That means to write it in terms of scale independent bias parameters.

⁵

Models that explicitly allow for a scale-dependent density response might still work.

⁶

And additionally symmetric to exchanges between the first two and last two indices. However, this symmetry follows automatically here, so that we do not consider it explicitly.

⁷

https://github.com/jstuecker/probabilistic-bias

⁸

Except for β₂, which has an additional contribution.

Acknowledgments

The authors thank Simon White, Oliver Hahn and Fabian Schmidt for helpful discussions and comments to the draft. JS thanks Oliver Philcox for helpful discussions. We acknowledge funding from the Spanish Ministry of Science and Innovation through grant number PID2021-128338NB-I00. RV acknowledges the support of the Juan de la Cierva fellowship (FJC2021-048002-I). MPI is supported by STFC consolidated grant no. RA5496.

References

Abidi, M. M., & Baldauf, T. 2018, JCAP, 2018, 029 [CrossRef] [Google Scholar]
Akitsu, K., Li, Y., & Okumura, T. 2021, JCAP, 2021, 041 [CrossRef] [Google Scholar]
Alam, S., Aubert, M., Avila, S., et al. 2021, Phys. Rev. D, 103, 083533 [NASA ADS] [CrossRef] [Google Scholar]
Amendola, L., Appleby, S., Avgoustidis, A., et al. 2018, Liv. Rev. Rel., 21, 2 [Google Scholar]
Angulo, R. E., & Hahn, O. 2022, Liv. Rev. Comput. Astrophys., 8, 1 [CrossRef] [Google Scholar]
Angulo, R. E., Zennaro, M., Contreras, S., et al. 2021, MNRAS, 507, 5869 [NASA ADS] [CrossRef] [Google Scholar]
Baldauf, T., Desjacques, V., & Seljak, U. 2015, Phys. Rev. D, 92, 123507 [Google Scholar]
Baldauf, T., Mirbabayi, M., Simonović, M., & Zaldarriaga, M. 2016a, arXiv e-prints [arXiv:1602.00674] [Google Scholar]
Baldauf, T., Seljak, U., Senatore, L., & Zaldarriaga, M. 2016b, JCAP, 2016, 007 [NASA ADS] [CrossRef] [Google Scholar]
Bardeen, J. M., Bond, J. R., Kaiser, N., & Szalay, A. S. 1986, ApJ, 304, 15 [Google Scholar]
Barreira, A. 2020, JCAP, 2020, 031 [CrossRef] [Google Scholar]
Barreira, A., Lazeyras, T., & Schmidt, F. 2021, JCAP, 2021, 029 [CrossRef] [Google Scholar]
Baumann, D., Nicolis, A., Senatore, L., & Zaldarriaga, M. 2012, JCAP, 2012, 051 [Google Scholar]
Berlind, A. A., & Weinberg, D. H. 2002, ApJ, 575, 587 [Google Scholar]
Bernardeau, F., Colombi, S., Gaztañaga, E., & Scoccimarro, R. 2002, Phys. Rep., 367, 1 [Google Scholar]
Biagetti, M., Chan, K. C., Desjacques, V., & Paranjape, A. 2014, MNRAS, 441, 1457 [Google Scholar]
Bullock, J. S., Dekel, A., Kolatt, T. S., et al. 2001, ApJ, 555, 240 [NASA ADS] [CrossRef] [Google Scholar]
Cacciato, M., Lahav, O., van den Bosch, F. C., Hoekstra, H., & Dekel, A. 2012, MNRAS, 426, 566 [Google Scholar]
Chaves-Montero, J., Angulo, R. E., Schaye, J., et al. 2016, MNRAS, 460, 3100 [NASA ADS] [CrossRef] [Google Scholar]
Chaves-Montero, J., Angulo, R. E., & Contreras, S. 2023, MNRAS, 521, 937 [Google Scholar]
Chen, S.-F., Vlah, Z., & White, M. 2020, JCAP, 2020, 062 [CrossRef] [Google Scholar]
Colas, T., d’Amico, G., Senatore, L., Zhang, P., & Beutler, F. 2020, JCAP, 2020, 001 [Google Scholar]
Conroy, C., Wechsler, R. H., & Kravtsov, A. V. 2006, ApJ, 647, 201 [Google Scholar]
Contreras, S., Angulo, R. E., & Zennaro, M. 2021, MNRAS, 508, 175 [NASA ADS] [CrossRef] [Google Scholar]
Contreras, S., Chaves-Montero, J., & Angulo, R. E. 2023, MNRAS, 525, 3149 [NASA ADS] [CrossRef] [Google Scholar]
Croton, D. J., Gao, L., & White, S. D. M. 2007, MNRAS, 374, 1303 [Google Scholar]
Croton, D. J., Stevens, A. R. H., Tonini, C., et al. 2016, ApJS, 222, 22 [Google Scholar]
Dalal, N., White, M., Bond, J. R., & Shirokov, A. 2008, ApJ, 687, 12 [NASA ADS] [CrossRef] [Google Scholar]
d’Amico, G., Gleyzes, J., Kokron, N., et al. 2020, JCAP, 2020, 005 [Google Scholar]
Davé, R., Anglés-Alcázar, D., Narayanan, D., et al. 2019, MNRAS, 486, 2827 [Google Scholar]
Dekel, A., & Lahav, O. 1999, ApJ, 520, 24 [Google Scholar]
DeRose, J., Kokron, N., Banerjee, A., et al. 2023, JCAP, 2023, 054 [CrossRef] [Google Scholar]
Desjacques, V., Crocce, M., Scoccimarro, R., & Sheth, R. K. 2010, Phys. Rev. D, 82, 103529 [CrossRef] [Google Scholar]
Desjacques, V., Jeong, D., & Schmidt, F. 2018, Phys. Rep., 733, 1 [Google Scholar]
Dragomir, R., Rodríguez-Puebla, A., Primack, J. R., & Lee, C. T. 2018, MNRAS, 476, 741 [NASA ADS] [CrossRef] [Google Scholar]
Dubois, Y., Pichon, C., Welker, C., et al. 2014, MNRAS, 444, 1453 [Google Scholar]
Faltenbacher, A., & White, S. D. M. 2010, ApJ, 708, 469 [NASA ADS] [CrossRef] [Google Scholar]
Ferreras, I., Hopkins, A. M., Lagos, C., et al. 2019, MNRAS, 487, 435 [Google Scholar]
Frenk, C. S., White, S. D. M., Davis, M., & Efstathiou, G. 1988, ApJ, 327, 507 [NASA ADS] [CrossRef] [Google Scholar]
Gao, L., & White, S. D. M. 2007, MNRAS, 377, L5 [NASA ADS] [CrossRef] [Google Scholar]
Gao, L., Springel, V., & White, S. D. M. 2005, MNRAS, 363, L66 [NASA ADS] [CrossRef] [Google Scholar]
Genel, S., Bryan, G. L., Springel, V., et al. 2019, ApJ, 871, 21 [NASA ADS] [CrossRef] [Google Scholar]
Henriques, B. M. B., White, S. D. M., Thomas, P. A., et al. 2015, MNRAS, 451, 2663 [Google Scholar]
Ivanov, M. M., Simonović, M., & Zaldarriaga, M. 2020, JCAP, 2020, 042 [Google Scholar]
Kaiser, N. 1984, ApJ, 284, L9 [NASA ADS] [CrossRef] [Google Scholar]
Kauffmann, G., Colberg, J. M., Diaferio, A., & White, S. D. M. 1999, MNRAS, 303, 188 [NASA ADS] [CrossRef] [Google Scholar]
Kokron, N., DeRose, J., Chen, S.-F., White, M., & Wechsler, R. H. 2021, MNRAS, 505, 1422 [NASA ADS] [CrossRef] [Google Scholar]
Lacey, C. G., Baugh, C. M., Frenk, C. S., et al. 2016, MNRAS, 462, 3854 [Google Scholar]
Lagos, C. d. P., Tobar, R. J., Robotham, A. S. G., et al. 2018, MNRAS, 481, 3573 [CrossRef] [Google Scholar]
Lazeyras, T., & Schmidt, F. 2018, JCAP, 2018, 008 [CrossRef] [Google Scholar]
Lazeyras, T., & Schmidt, F. 2019, JCAP, 2019, 041 [Google Scholar]
Lazeyras, T., Wagner, C., Baldauf, T., & Schmidt, F. 2016, JCAP, 2016, 018 [CrossRef] [Google Scholar]
Lazeyras, T., Barreira, A., & Schmidt, F. 2021, JCAP, 2021, 063 [CrossRef] [Google Scholar]
Lehmann, B. V., Mao, Y.-Y., Becker, M. R., Skillman, S. W., & Wechsler, R. H. 2017, ApJ, 834, 37 [NASA ADS] [CrossRef] [Google Scholar]
Levi, M., Bebek, C., Beers, T., et al. 2013, arXiv e-prints [arXiv:1308.0847] [Google Scholar]
Li, Y., Hu, W., & Takada, M. 2014, Phys. Rev. D, 89, 083519 [NASA ADS] [CrossRef] [Google Scholar]
Lukacs, E. 1970, Characteristic Functions, 2nd edn. (Griffin) [Google Scholar]
Masaki, S., Nishimichi, T., & Takada, M. 2020, MNRAS, 496, 483 [NASA ADS] [CrossRef] [Google Scholar]
Meurer, A., Smith, C. P., Paprocki, M., et al. 2017, PeerJ Comput. Sci., 3 [Google Scholar]
Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 [Google Scholar]
Modi, C., Castorina, E., & Seljak, U. 2017, MNRAS, 472, 3959 [Google Scholar]
Modi, C., Chen, S.-F., & White, M. 2020, MNRAS, 492, 5754 [NASA ADS] [CrossRef] [Google Scholar]
Montero-Dorta, A. D., Pérez, E., Prada, F., et al. 2017, ApJ, 848, L2 [NASA ADS] [CrossRef] [Google Scholar]
Musso, M., Paranjape, A., & Sheth, R. K. 2012, MNRAS, 427, 3145 [CrossRef] [Google Scholar]
Nishimichi, T., D’Amico, G., Ivanov, M. M., et al. 2020, Phys. Rev. D, 102, 123541 [NASA ADS] [CrossRef] [Google Scholar]
Ortega-Martinez, S., Contreras, S., & Angulo, R. 2024, A&A, 689, A66 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Paranjape, A., & Sheth, R. K. 2012, MNRAS, 426, 2789 [Google Scholar]
Paranjape, A., Sefusatti, E., Chan, K. C., et al. 2013a, MNRAS, 436, 449 [CrossRef] [Google Scholar]
Paranjape, A., Sheth, R. K., & Desjacques, V. 2013b, MNRAS, 431, 1503 [CrossRef] [Google Scholar]
Paranjape, A., Hahn, O., & Sheth, R. K. 2018, MNRAS, 476, 3631 [NASA ADS] [CrossRef] [Google Scholar]
Peacock, J. A., & Smith, R. E. 2000, MNRAS, 318, 1144 [Google Scholar]
Pellejero Ibañez, M., Stücker, J., Angulo, R. E., et al. 2022, MNRAS, 514, 3993 [Google Scholar]
Pellejero Ibañez, M., Angulo, R. E., Zennaro, M., et al. 2023, MNRAS, 520, 3725 [Google Scholar]
Philcox, O. H. E., & Ivanov, M. M. 2022, Phys. Rev. D, 105, 043517 [CrossRef] [Google Scholar]
Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Reddick, R. M., Wechsler, R. H., Tinker, J. L., & Behroozi, P. S. 2013, ApJ, 771, 30 [Google Scholar]
Salcedo, A. N., Weinberg, D. H., Wu, H.-Y., & Wibking, B. D. 2022a, MNRAS, 510, 5376 [CrossRef] [Google Scholar]
Salcedo, A. N., Zu, Y., Zhang, Y., et al. 2022b, Sci. China Phys. Mech. Astron., 65, 109811 [Google Scholar]
Sato-Polito, G., Montero-Dorta, A. D., Abramo, L. R., Prada, F., & Klypin, A. 2019, MNRAS, 487, 1570 [NASA ADS] [CrossRef] [Google Scholar]
Schaye, J., Crain, R. A., Bower, R. G., et al. 2015, MNRAS, 446, 521 [Google Scholar]
Sheth, R. K., Chan, K. C., & Scoccimarro, R. 2013, Phys. Rev. D, 87, 083002 [NASA ADS] [CrossRef] [Google Scholar]
Springel, V., White, S. D. M., Tormen, G., & Kauffmann, G. 2001, MNRAS, 328, 726 [Google Scholar]
Stevens, A. R. H., Croton, D. J., & Mutch, S. J. 2016, MNRAS, 461, 859 [Google Scholar]
Stücker, J., Schmidt, A. S., White, S. D. M., Schmidt, F., & Hahn, O. 2021, MNRAS, 503, 1473 [CrossRef] [Google Scholar]
Stücker, J., Pellejero-Ibáñez, M., Voivodic, R., & Angulo, R. E. 2025, A&A, 694, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Szalay, A. S. 1988, ApJ, 333, 21 [NASA ADS] [CrossRef] [Google Scholar]
Tinker, J. L., Robertson, B. E., Kravtsov, A. V., et al. 2010, ApJ, 724, 878 [NASA ADS] [CrossRef] [Google Scholar]
Tucci, B., Montero-Dorta, A. D., Abramo, L. R., Sato-Polito, G., & Artale, M. C. 2021, MNRAS, 500, 2777 [Google Scholar]
Villaescusa-Navarro, F., Anglés-Alcázar, D., Genel, S., et al. 2021, ApJ, 915, 71 [NASA ADS] [CrossRef] [Google Scholar]
Vlah, Z., Castorina, E., & White, M. 2016, JCAP, 2016, 007 [Google Scholar]
Vogelsberger, M., Genel, S., Springel, V., et al. 2014, MNRAS, 444, 1518 [Google Scholar]
Wagner, C., Schmidt, F., Chiang, C. T., & Komatsu, E. 2015, MNRAS, 448, L11 [CrossRef] [Google Scholar]
Wechsler, R. H., Zentner, A. R., Bullock, J. S., Kravtsov, A. V., & Allgood, B. 2006, ApJ, 652, 71 [NASA ADS] [CrossRef] [Google Scholar]
White, S. D. M. 1979, MNRAS, 186, 145 [NASA ADS] [CrossRef] [Google Scholar]
White, S. D. M. 1984, ApJ, 286, 38 [NASA ADS] [CrossRef] [Google Scholar]
Zehavi, I., Contreras, S., Padilla, N., et al. 2018, ApJ, 853, 84 [NASA ADS] [CrossRef] [Google Scholar]
Zennaro, M., Angulo, R. E., Contreras, S., Pellejero-Ibáñez, M., & Maion, F. 2022, MNRAS, 514, 5443 [NASA ADS] [CrossRef] [Google Scholar]
Zennaro, M., Angulo, R. E., Pellejero-Ibáñez, M., et al. 2023, MNRAS, 524, 2407 [Google Scholar]
Zheng, Z., Berlind, A. A., Weinberg, D. H., et al. 2005, ApJ, 633, 791 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Estimators with different filters

In Section 2, we have shown derivations under the assumption of a sharp k-space filter. For this choice all considered bias functions and bias parameters relate trivially to the separate-universe parameters that are commonly referred to in the literature. However, in general other filtering kernels could also be chosen. For example if we define the smoothed density as

$\begin{matrix} δ_{l} (k) & = W (k) δ (k), \end{matrix}$ $\begin{aligned} \delta _l(k)&= W(k) \delta (k), \end{aligned}$ (A.1)

where δ_l is the smoothed density and δ the unsmoothed one. If the filter W is for example chosen to be a Gaussian kernel,

$\begin{matrix} W_{g} (k) = exp (- \frac{k^{2}}{2 k_{d}^{2}}), \end{matrix}$ $\begin{aligned} W_{\mathrm{g} }(k) = \exp \left(-\frac{k^2}{2 k_{\mathrm{d} }^2} \right), \end{aligned}$ (A.2)

instead of a sharp k filter, then some considerations have to be adapted. In principle, all of the formula from Section 2 could still be applied, but they would refer to a different set of bias parameters that describe the response to perturbations of a different shape. Rather than introducing a new set of bias parameters, it is more convenient to directly write the commonly used parameters in terms of the new basis. Therefore, we show here how to directly measure the separate-universe bias parameters from the moments of the galaxy environment distribution that was obtained with a density field that was filtered with any kernel.

First of all, we consider how a bias function on very large scales f_l(δ_l) should relate to the bias function on some much smaller scale f_s(δ_s). If the PBS is still valid on the smaller scale, then the probability of forming a galaxy, given knowledge of δ_s and δ_l depends only on the value of δ_s:

$\begin{matrix} p (g | δ_{s}, δ_{l}) & = p (g | δ_{s}) . \end{matrix}$ $\begin{aligned} p(\mathrm{g} | \delta _s, \delta _l)&= p(\mathrm{g} | \delta _s). \,\, \end{aligned}$ (A.3)

Under this assumption, we can re-express the large scale bias function through the small scale function marginalised over the conditional probability of the small scale densities:

$\begin{matrix} f_{l} (δ_{l}) = \frac{p (g | δ_{l})}{p (g)} & = \int \frac{p (g | δ_{s})}{p (g)} p (δ_{s} | δ_{l}) d δ_{s} \\ = \int f_{s} (δ_{s}) p (δ_{s} | δ_{l}) d δ_{s} . \end{matrix}$ $\begin{aligned} f_l(\delta _l) = \frac{p(\mathrm{g} |\delta _l)}{p(\mathrm{g} )}&= \int \frac{p(\mathrm{g} | \delta _s)}{p(g)} p(\delta _s | \delta _l) d \delta _s \nonumber \\&= \int f_s(\delta _s) p(\delta _s | \delta _l) d \delta _s \,\,. \end{aligned}$ (A.4)

Since, the joint distribution of δ_s and δ_l is a multivariate Gaussian, the conditional distribution corresponds to a multivariate Gaussian that is conditioned on the value of one variable, which itself is a Gaussian with modified mean and covariance:

$\begin{matrix} p (δ_{s} | δ_{l}) & = N (δ_{s}, μ_{*}, σ_{*}), \end{matrix}$ $\begin{aligned} p(\delta _s | \delta _l)&= N(\delta _s, \mu _*, \sigma _*) ,\end{aligned}$ (A.5)

$\begin{matrix} μ_{*} & = α δ_{l}, \end{matrix}$ $\begin{aligned} \mu _*&= \alpha \delta _l ,\end{aligned}$ (A.6)

$\begin{matrix} σ_{*}^{2} & = σ_{ss}^{2} - α^{2} σ_{ll}^{2}, \end{matrix}$ $\begin{aligned} \sigma _*^2&= \sigma _{ss}^2 - \alpha ^2 \sigma _{ll}^2, \end{aligned}$ (A.7)

where α = σ_ls²/σ_ll² corresponds to the correlation between the smoothed large-scale and small-scale density fields, σ_ll² = ⟨δ_lδ_l⟩, σ_ls² = ⟨δ_lδ_s⟩ and σ_ss = ⟨δ_sδ_s⟩.

If we define the re-normalised bias function, F_α, through the bias function f_l as in Section 2, then we find

$\begin{matrix} F_{α} (δ_{0}) & = \int f_{l} (δ_{l}) p (δ_{l} - δ_{0}) d δ_{l} \\ = \int f_{s} (δ_{s}) p_{ss} (δ_{ss} - α δ_{0}) d δ_{s}, \end{matrix}$ $\begin{aligned} F_\alpha (\delta _0)&= \int f_l(\delta _l) p(\delta _l - \delta _0) \mathrm{d} \delta _l \nonumber \\&= \int f_s(\delta _s) p_{ss}(\delta _{ss} - \alpha \delta _0) d \delta _s, \end{aligned}$ (A.8)

where p_ss is simply the background distribution of the small scale densities. This can be shown either by evaluating the integrals (corresponding to convolutions) explicitly or by considering the large scale limit of f_l where σ_* → σ_ss.

We can now relate the re-normalised bias functions obtained with different filtering methods, since we may assume that f_s is independent of the filter employed on large scales. Therefore,

$\begin{matrix} F_{α_{1}} (\frac{δ_{0}}{α_{1}}) = F_{α_{2}} (\frac{δ_{0}}{α_{2}}), \end{matrix}$ $\begin{aligned} F_{\alpha _1} \left(\frac{\delta _0}{\alpha _1} \right) = F_{\alpha _2} \left(\frac{\delta _0}{\alpha _2} \right), \end{aligned}$ (A.9)

where we have indicated the usage of different filters through different α in the index. Therefore, different filters simply lead to a rescaling of the re-normalised bias function. The value of α depends on the filter and the smoothing scale. If we assume that δ_s was obtained with some small scale filter W_s and δ_l with W_l, we have

$\begin{matrix} σ_{ls}^{2} & = 〈 \int δ (k_{1}) W_{s} (k_{1}) d^{3} k_{1} \int δ (k_{2}) W_{l} (k_{2}) d^{3} k_{2} 〉 \\ = \int \int 〈 δ (k_{1}) δ (k_{2}) 〉 W_{s} (k_{1}) W_{l} (k_{2}) d^{3} k_{1} d^{3} k_{2} \\ = \int \int P (k_{1}) δ_{D} (k_{1} - k_{2}) W_{s} (k_{1}) W_{l} (k_{2}) d^{3} k_{1} d^{3} k_{2} \\ = \int P (k) W_{s} (k) W_{l} (k) d^{3} k, \end{matrix}$ $\begin{aligned} \sigma _{ls}^2&= \left\langle {\int \delta (\boldsymbol{k}_1) W_s(k_1) \mathrm{d} ^3\boldsymbol{k}_1 \int \delta (\boldsymbol{k}_2) W_l(k_2) \mathrm{d} ^3\boldsymbol{k}_2 } \right\rangle \nonumber \\&= \int \int \left\langle {\delta (\boldsymbol{k}_1) \delta (\boldsymbol{k}_2)} \right\rangle W_s(k_1) W_l(k_2) \mathrm{d} ^3\boldsymbol{k}_1 \mathrm{d} ^3\boldsymbol{k}_2 \nonumber \\&= \int \int P(k_1) \delta _D(\boldsymbol{k}_1 - \boldsymbol{k}_2) W_s(k_1) W_l(k_2) \mathrm{d} ^3\boldsymbol{k}_1 \mathrm{d} ^3\boldsymbol{k}_2 \nonumber \\&= \int P(k) W_s(k) W_l(k) \mathrm{d} ^3\boldsymbol{k} ,\end{aligned}$ (A.10)

$\begin{matrix} \approx \int P (k) W_{l} (k) d^{3} k, \end{matrix}$ $\begin{aligned}&\approx \int P(k) W_l(k) \mathrm{d} ^3\boldsymbol{k,} \end{aligned}$ (A.11)

where the last approximation assumes that W_s is very close to 1 over the support of W_l, because W_l corresponds to a much larger scale smoothing than W_s. Furthermore, we have

$\begin{matrix} σ_{ll}^{2} & = \int P (k) W_{l}^{2} (k) d^{3} k \end{matrix}$ $\begin{aligned} \sigma _{ll}^2&= \int P(k) W_l^2(k) \mathrm{d} ^3\boldsymbol{k} \end{aligned}$ (A.12)

$\begin{matrix} α & = \frac{\int P (k) W_{l} (k) d^{3} k}{\int P (k) W_{l}^{2} (k) d^{3} k} . \end{matrix}$ $\begin{aligned} \alpha&= \frac{\int P(k) W_l(k) \mathrm{d} ^3\boldsymbol{k}}{\int P(k) W_l^2(k) \mathrm{d} ^3\boldsymbol{k}} \,\,. \end{aligned}$ (A.13)

It is easy to see that α = 1 for any sharp Fourier space filter, independent of the filtering scale. However, for other filters it is important to note that α may be generally larger than 1, for example α ∼ 2 for a Gaussian filter at scales of k_d ∼ 0.1 h Mpc⁻¹. Furthermore, α depends generally on the filtering-scale. The re-normalised bias function does therefore only correspond to a well-defined limit at arbitrary large scales if α is constant (as for the sharp k filter).

Since the case α = 1 seems clearly to be the simplest and leads to a sensibly scale-independent re-normalisation, we consider it as the reference case and simply label its re-normalised bias function F, and we have focused in the main-text on the discussion of this case. However, we can still use the relations derived in the main-text to measure the properties of F directly through measuring aspects of some filter-dependent function F_α.

We directly show this for the general multivariate case here by adapting the considerations from Section 2.7. For the multivariate case the re-normalised bias functions are relate as

$\begin{matrix} F_{A} (x_{0}) & = F (A x_{0}), \end{matrix}$ $\begin{aligned} F_\mathsf{A }(\boldsymbol{x}_0)&= F(\mathsf A \boldsymbol{x}_0), \end{aligned}$ (A.14)

where

$\begin{matrix} A & = C_{ls} C_{ll}^{- 1}, \end{matrix}$ $\begin{aligned} \mathsf A&= \mathsf C _{ls} \mathsf C _{ll}^{-1} ,\end{aligned}$ (A.15)

$\begin{matrix} C_{ls} & = 〈 x_{l} \otimes x_{s} 〉, \end{matrix}$ $\begin{aligned} \mathsf C _{ls}&= \left\langle { \boldsymbol{x}_l \otimes \boldsymbol{x}_s } \right\rangle ,\end{aligned}$ (A.16)

$\begin{matrix} C_{ll} & = 〈 x_{l} \otimes x_{l} 〉, \end{matrix}$ $\begin{aligned} \mathsf C _{ll}&= \left\langle { \boldsymbol{x}_l \otimes \boldsymbol{x}_l } \right\rangle , \end{aligned}$ (A.17)

where C_ls may more conveniently be measured as ⟨x_m⊗x_m⟩, where x_m is evaluated in the linear fields filtered with $\sqrt{W_{l}}$ $\sqrt{W_l}$ .

The cumulant-generating function of the galaxy environment distribution with δ_l is again given through Equation (64), just with F_A instead of F:

$\begin{matrix} K_{A} (t) & = \frac{1}{2} t^{T} C_{ll} t + log (F_{A} (C_{ll} t)) \\ = \frac{1}{2} t^{T} C_{ll} t + log (F (C_{ls} t)), \end{matrix}$ $\begin{aligned} K_\mathsf{A }(\boldsymbol{t})&= \frac{1}{2} \boldsymbol{t}^T \mathsf C _{ll} \boldsymbol{t} + \log \left( F_\mathsf{A }(\mathsf C _{ll} \boldsymbol{t}) \right) \nonumber \\&= \frac{1}{2} \boldsymbol{t}^T \mathsf C _{ll} \boldsymbol{t} + \log \left( F(\mathsf C _{ls} \boldsymbol{t}) \right), \end{aligned}$ (A.18)

For measuring the cumulant bias parameters, we may then define a more convenient variable,

$\begin{matrix} u_{*} & = C_{ls} x_{0}, \end{matrix}$ $\begin{aligned} \boldsymbol{u}_*&= \mathsf C _{ls} \boldsymbol{x}_0, \end{aligned}$ (A.19)

which has the cumulant generating function,

$\begin{matrix} K_{u_{*}} (t) & = \frac{1}{2} t^{T} C_{ls}^{- 1} C_{ll} C_{ls}^{- 1} t + log (F (t)) . \end{matrix}$ $\begin{aligned} K_{\boldsymbol{u}_*}(\boldsymbol{t})&= \frac{1}{2} \boldsymbol{t}^T \mathsf C _{ls}^{-1} \mathsf C _{ll} \mathsf C _{ls}^{-1} \boldsymbol{t} + \log \left( F(\boldsymbol{t}) \right). \end{aligned}$ (A.20)

Thus, the cumulant bias parameters can be inferred by differentiating, leading to:

$\begin{matrix} β_{i j k . . .} & = {\begin{matrix} κ_{u_{*}, i j k . . .} & i f i + j + k + . . . \neq 2, \\ κ_{u_{*}, i j k . . .} - C_{*, a b}^{- 1} & i f i + j + k + . . . = 2, \end{matrix} \end{matrix}$ $\begin{aligned} \boldsymbol{\beta }_{ijk...}&= {\left\{ \begin{array}{ll} \kappa _{u_*,ijk...}&\mathrm \quad if \quad i + j + k + ... \ne 2, \\ \kappa _{u_*,ijk...} - C^{-1}_{*,ab}&\mathrm \quad if \quad i + j + k + ... = 2, \end{array}\right.} \end{aligned}$ (A.21)

where $C_{*}^{- 1} = C_{ls}^{- 1} C_{ll} C_{ls}^{- 1}$ $\mathsf{C}_*^{-1} = \mathsf{C}_{ls}^{-1} \mathsf{C}_{ll} \mathsf{C}_{ls}^{-1}$ . We can therefore measure the separate-universe bias parameters with an arbitrarily filtered distribution in a very similar manner to the one using a sharp k-space filter. It is worth noting that the inferred relations show also an important property: If β_n vanish for n > 2, which automatically implies that the corresponding cumulants vanish for any filtered galaxy environment distribution. Therefore, if the bias function is Gaussian for one filter, then it will also appear Gaussian for any other filter.

Appendix B: Isotropic tensors

B.1. Tensors with three or more symmetry groups

In the main text, we limit our discussion to an explanation of how to obtain basis tensors for tensor spaces with at most two symmetry groups. However, for higher order biases, for instance, the equivalent of b₃ the isotropic tensor decomposition requires consideration of additional symmetries. We define 𝕍₂₂₂ as the space of tensors that are symmetric under permutations of the first two, second two and third two indices. Furthermore, we define 𝕍₍₂₂₂₎ as the space of tensors that are additionally invariant to exchanges of these symmetry groups. These spaces have different number of basis tensors,

$\begin{matrix} V_{222} & = span ({I_{222}, I_{2 - 2 - 2 -}, I_{2 = 22}, I_{22 = 2}, I_{222 =}}), \end{matrix}$ $\begin{aligned} \mathbb{V} _{222}&= \mathrm{span} \left(\{ \mathsf I _{222}, \mathsf I _{2-2-2-}, \mathsf I _{2 = 22}, \mathsf I _{22 = 2}, \mathsf I _{222=}\}\right) , \end{aligned}$ (B.1)

$\begin{matrix} V_{(222)} & = span ({I_{222}, I_{2 - 2 - 2 -}, I_{(2 = 22)}}), \end{matrix}$ $\begin{aligned} \mathbb{V} _{(222)}&= \mathrm{span} \left( \{ \mathsf I _{222}, \mathsf I _{2-2-2-}, \mathsf I _{(2 = 22)}\} \right), \end{aligned}$ (B.2)

where

$\begin{matrix} I_{(2 = 22)} & = \frac{1}{3} (I_{2 = 22} + I_{22 = 2} + I_{222 =}) \end{matrix}$ $\begin{aligned} \mathsf I _{(2 = 22)}&= \frac{1}{3} (\mathsf I _{2 = 22} + \mathsf I _{22 = 2} + \mathsf I _{222=}) \end{aligned}$ (B.3)

is additionally symmetrised between exchanges between the symmetry groups and the other tensors are given by

$\begin{matrix} I_{222, i j k l m n} & = S_{222} (δ_{ij} δ_{kl} δ_{mn}), \end{matrix}$ $\begin{aligned} I_{222,ijklmn}&= S_{222}(\delta _{ij}\delta _{kl}\delta _{mn}) ,\end{aligned}$ (B.4)

$\begin{matrix} I_{2 - 2 - 2 -, i j k l m n} & = S_{222} (δ_{ik} δ_{lm} δ_{nj}), \end{matrix}$ $\begin{aligned} I_{2-2-2-,ijklmn}&= S_{222}(\delta _{ik}\delta _{lm}\delta _{nj}) ,\end{aligned}$ (B.5)

$\begin{matrix} I_{2 = 22, i j k l m n} & = S_{222} (δ_{ik} δ_{jl} δ_{mn}), \end{matrix}$ $\begin{aligned} I_{2 = 22,ijklmn}&= S_{222}(\delta _{ik}\delta _{jl}\delta _{mn}) ,\end{aligned}$ (B.6)

$\begin{matrix} I_{22 = 2, i j k l m n} & = S_{222} (δ_{ij} δ_{km} δ_{\ln}), \end{matrix}$ $\begin{aligned} I_{22 = 2,ijklmn}&= S_{222}(\delta _{ij}\delta _{km}\delta _{ln}) ,\end{aligned}$ (B.7)

$\begin{matrix} I_{222 =, i j k l m n} & = S_{222} (δ_{im} δ_{kl} δ_{jn}) . \end{matrix}$ $\begin{aligned} I_{222=,ijklmn}&= S_{222}(\delta _{im}\delta _{kl}\delta _{jn}). \end{aligned}$ (B.8)

The basis tensors of 𝕍₍₂₂₂₎ are visualised through a diagram in Figure B.1. The third derivative tensor of that would appear in the bias expansion can be decomposed in this space

Fig. B.1.

Illustration of the basis tensors of 𝕍₂₂₂ versus those of 𝕍₍₂₂₂₎. Top: without the symmetry requirement between groups (indicated by usage of different symbols for each group) there are five basis tensors. Bottom: With the additional symmetry requirement between groups (indicated by usage of the same symbol for each group), since the latter three tensors of 𝕍₂₂₂ get all united to a single symmetrised term (bottom).

$\begin{matrix} {\frac{\partial^{3} F}{\partial T \partial T \partial T} |}_{T = 0} \in V_{(222)} . \end{matrix}$ $\begin{aligned} \left. \frac{\partial ^3 F}{\partial \mathsf T \partial \mathsf T \partial \mathsf T } \right|_\mathsf{T = 0} \in \mathbb{V} _{(222)}. \end{aligned}$ (B.9)

B.2. Covariance of potential derivatives

To find an explicit form for the distribution of the derivatives of the tidal tensor and other higher order derivatives of the potential, we need to find their covariance matrix and invert it.

For this we first consider the covariance between any derivatives of the potential. This is given by

$\begin{matrix} C_{i j . . ., a b . . .} & = ⟨ ϕ_{i j . . .} ϕ_{a b . . .} ⟩ \\ = \frac{1}{{(2 π)}^{6}} 〈 \int \int \frac{(i k_{i}) (i k_{j}) . . . (i k_{a}) (i k_{b}) . . .}{k^{4}} \\ e^{- i (k_{1} - k_{2}) x} δ_{k}^{*} (k_{1}) δ_{k} (k_{2}) d k_{1} d k_{2} 〉, \end{matrix}$ $\begin{aligned} C_{ij...,ab...}&= \langle \phi _{ij...} \phi _{ab...} \rangle \nonumber \\&= \frac{1}{(2 \pi )^6} \left\langle \int \int \frac{(ik_i)(ik_j) ... (ik_a) (ik_b) ...}{k^4} \right. \nonumber \\&\quad \quad \quad \quad \quad \left. e^{-i (\boldsymbol{k_1} - \boldsymbol{k_2}) \boldsymbol{x}} \delta _k^*(\boldsymbol{k}_1) \delta _k(\boldsymbol{k}_2) \mathrm{d} \boldsymbol{k}_1 \mathrm{d} \boldsymbol{k}_2 \right\rangle , \end{aligned}$ (B.10)

where δ_k is the Fourier transform of density field δ and we have used that δ is a real field so that δ_k(k) = δ_k^*(−k). Using the definition of the power spectrum,

$\begin{matrix} ⟨ δ_{k}^{*} (k_{1}) δ_{k} (k_{2}) ⟩ & = (2 π^{3}) P (k) δ_{D}^{3} (k_{1} - k_{2}), \end{matrix}$ $\begin{aligned} \langle \delta _k^*(\boldsymbol{k}_1) \delta _k(\boldsymbol{k}_2) \rangle&= (2 \pi ^3) P(\boldsymbol{k}) \delta _\mathrm{D} ^3(\boldsymbol{k}_1 - \boldsymbol{k}_2), \end{aligned}$ (B.11)

where δ_D is the Dirac delta function, we can simplify

$\begin{matrix} C_{i j . . ., a b . . .} & = \frac{1}{{(2 π)}^{3}} \int \frac{(i k_{i}) (i k_{j}) . . . (i k_{a}) (i k_{b}) . . .}{k^{4}} P (k) d^{3} k . \end{matrix}$ $\begin{aligned} C_{ij...,ab...}&= \frac{1}{(2 \pi )^3} \int \frac{(ik_i)(ik_j) ... (ik_a) (ik_b) ...}{k^4} P(k) \mathrm{d} ^3\boldsymbol{k.} \end{aligned}$ (B.12)

We can see that this expression does not depend on the order of the indices, nor whether they stem from the first or the second potential term. Therefore, we just count the total number n₁ of occurrences of k_x, n₂ of k_y and n₃ of k_z. Without loss of generality we can order the coordinates, such that n₁ ≥ n₂ ≥ n₃. Furthermore, we can switch to spherical coordinates and replace the cosine of the azimutal variable:

$\begin{matrix} k & = (\begin{matrix} k μ \\ k {(1 - μ)}^{2} cos ϕ \\ k {(1 - μ)}^{2} {sin ϕ)}^{T} \end{matrix}), \end{matrix}$ $\begin{aligned} \boldsymbol{k}&= \begin{pmatrix} k \mu \\ k (1 - \mu )^2 \cos \phi \\ k (1 - \mu )^2 \sin \phi )^T \end{pmatrix}, \end{aligned}$ (B.13)

so that we have

$\begin{matrix} Σ (i j . . . a b) & = \frac{σ_{M}^{2}}{4 π} \int_{- 1}^{1} μ^{n_{1}} d μ \int_{0}^{2 π} {cos}^{n_{2}} (ϕ) {sin}^{n_{3}} (ϕ) d ϕ, \end{matrix}$ $\begin{aligned} \Sigma (ij...ab)&= \frac{\sigma _M^2}{4 \pi } \int _{-1}^{1} \mu ^{n_1} \mathrm{d} \mu \int _{0}^{2 \pi } \cos ^{n_2}(\phi ) \sin ^{n_3}(\phi ) \mathrm{d} \phi ,\end{aligned}$ (B.14)

$\begin{matrix} σ_{M}^{2} & = \frac{1}{2 π^{2}} \int P (k) k^{2 M} d k, \end{matrix}$ $\begin{aligned} \sigma _M^2&= \frac{1}{2 \pi ^2} \int P(k) k^{2M} \mathrm{d} k ,\end{aligned}$ (B.15)

$\begin{matrix} M & = \frac{n_{1} + n_{2} + n_{3} - 4}{2} . \end{matrix}$ $\begin{aligned} M&= \frac{n_1 + n_2 + n_3 - 4}{2} \,\,. \end{aligned}$ (B.16)

These integrals can easily be evaluated by hand or with a computer algebra system. We list a few examples of the covariances in Table B.1 We note that only terms where each n₁, n₂, and n₃ are even are non-zero. We note that all uneven terms are zero, so that, for instance, the two-second derivatives of the potential have zero covariance with the third derivatives, but a non-zero covariance with the fourth derivatives.

Table B.1.

Covariances of various derivatives of the potential.

B.3. The distribution of the tidal tensor

From Equation (B.12) it can be seen that the covariance tensor of any derivatives of the potential has to be an isotropic tensor with full symmetry in all indices. Therefore, the covariance matrix of the tidal tensor is C_T ∈ 𝕍₄ so that it has to be proportional to I₄ with a coefficient that we can read off from table B.1:

$\begin{matrix} C_{T} & = 〈 T \otimes T 〉 \\ = \frac{σ^{2}}{5} J_{4} . \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T }&= \left\langle \mathsf{T \otimes \mathsf T } \right\rangle \nonumber \\&= \frac{\sigma ^2}{5} \mathsf J _4 \,\,. \end{aligned}$ (B.17)

This tensor is not invertible, since, for instance, the T₁₂ component is perfectly degenerate with the T₂₁ component. However, in such cases where the distribution of T can still be inferred by constructing a generalised inverse $C_{T}^{+}$ $\mathsf{C}_{\mathsf{T}}^{+}$ , which has the following property:

$\begin{matrix} C C_{T}^{+} C & = C_{T} . \end{matrix}$ $\begin{aligned} \mathsf C \mathsf C _\mathsf{T }^{+} \mathsf C&= \mathsf C _\mathsf{T }. \end{aligned}$ (B.18)

It then holds that

$\begin{matrix} p (T) & = N exp (- \frac{1}{2} T^{T} C_{T}^{+} T) . \end{matrix}$ $\begin{aligned} p(\mathsf T )&= N \exp \left( -\frac{1}{2} \mathsf T ^T \mathsf C _\mathsf{T }^+ \mathsf T \right). \end{aligned}$ (B.19)

To find the generalised inverse we may assume that C_T⁺ ∈ 𝕍₂₂, so that we can make the Ansatz

$\begin{matrix} C_{T}^{+} & = A_{22} J_{22} + A_{2 = 2} J_{2 = 2} . \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T }^{+}&= A_{22} \mathsf J _{22} + A_{2 = 2} \mathsf J _{2 = 2}. \end{aligned}$ (B.20)

We can use the decomposition $J_{4} = \frac{5}{9} J_{22} + \frac{2}{3} J_{2 = 2}$ $\mathsf{J}_{4} = \frac{5}{9} \mathsf{J}_{22} + \frac{2}{3} \mathsf{J}_{2 = 2}$ (determined computationally) and evaluate Equation (B.18) term by term:

$\begin{matrix} C_{T} C_{T}^{+} & = A_{22} \frac{σ^{2}}{9} (J_{22} \overset{(2)}{\cdot} J_{22}) + A_{2 = 2} \frac{2 σ^{2}}{15} (J_{2 = 2} \overset{(2)}{\cdot} J_{2 = 2}), \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T } \mathsf C _\mathsf{T }^{+}&= A_{22} \frac{\sigma ^{2}}{9} (\mathsf J _{22} \overset{(2)}{\cdot } \mathsf J _{22}) + A_{2 = 2} \frac{2 \sigma ^{2}}{15} (\mathsf J _{2 = 2} \overset{(2)}{\cdot } \mathsf J _{2 = 2}) ,\end{aligned}$ (B.21)

$\begin{matrix} = 3 A_{22} \frac{σ^{2}}{9} J_{22} + A_{2 = 2} \frac{2 σ^{2}}{15} J_{2 = 2}, \end{matrix}$ $\begin{aligned}&= 3 A_{22} \frac{\sigma ^{2}}{9} \mathsf J _{22} + A_{2 = 2} \frac{2 \sigma ^{2}}{15} \mathsf J _{2 = 2},\end{aligned}$ (B.22)

$\begin{matrix} C_{T} C_{T}^{+} C_{T} & = 9 A_{22} {(\frac{σ^{2}}{9})}^{2} J_{22} + A_{2 = 2} {(\frac{2 σ^{2}}{15})}^{2} J_{2 = 2}, \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T } \mathsf C _\mathsf{T }^{+} \mathsf C _\mathsf{T }&= 9 A_{22} \left( \frac{\sigma ^{2}}{9} \right)^2 \mathsf J _{22} + A_{2 = 2} \left(\frac{2 \sigma ^{2}}{15} \right)^2 \mathsf J _{2 = 2},\end{aligned}$ (B.23)

$\begin{matrix} \equiv C_{T} = \frac{σ^{2}}{9} J_{22} + \frac{2 σ^{2}}{15} J_{2 = 2}, \end{matrix}$ $\begin{aligned}&\equiv \mathsf C _\mathsf{T } = \frac{\sigma ^{2}}{9} \mathsf J _{22} + \frac{2 \sigma ^{2}}{15} \mathsf J _{2 = 2}, \end{aligned}$ (B.24)

where we have evaluated the appearing tensor products computationally. By identifying coefficients, we find $A_{22} = \frac{1}{σ^{2}}$ $A_{22} = \frac{1}{\sigma^2}$ and $A_{2 = 2} = \frac{15}{2 σ^{2}}$ $A_{2 = 2} = \frac{15}{2 \sigma^2}$ and therefore we have the generalised inverse of the covariance matrix:

$\begin{matrix} C_{T}^{+} & = \frac{1}{σ^{2}} J_{22} + \frac{15}{2 σ^{2}} J_{2 = 2} \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T }^+&= \frac{1}{\sigma ^{2}} \mathsf J _{22} + \frac{15}{2 \sigma ^{2}} \mathsf J _{2 = 2} \end{aligned}$ (B.25)

and the distribution of the tidal tensor is given by

$\begin{matrix} p (T) & = N exp (- \frac{2 T^{T} J_{22} T + 15 T^{T} J_{2 = 2} T}{4 σ^{2}}) \end{matrix}$ $\begin{aligned} p(\mathsf T )&= N \exp \left(- \frac{2 \mathsf T ^T \mathsf J _{22} \mathsf T + 15 \mathsf T ^T \mathsf J _{2 = 2} \mathsf T }{4 \sigma ^2} \right) \end{aligned}$ (B.26)

$\begin{matrix} = N exp (- \frac{2 δ^{2} + 15 K^{2}}{4 σ^{2}}) . \end{matrix}$ $\begin{aligned}&= N \exp \left(- \frac{2 \delta ^2 + 15 K^2}{4 \sigma ^2} \right) \,\,. \end{aligned}$ (B.27)

B.4. The distribution of third derivatives

The third derivative tensor of the potential S has the covariance matrix

$\begin{matrix} C_{S} & = 〈 S \otimes S 〉, \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{S }&= \left\langle \mathsf{S \otimes \mathsf S } \right\rangle ,\end{aligned}$ (B.28)

$\begin{matrix} = \frac{σ_{1}^{2}}{7} J_{6}, \end{matrix}$ $\begin{aligned}&= \frac{\sigma _1^2}{7} \mathsf J _6,\end{aligned}$ (B.29)

$\begin{matrix} = \frac{3 σ_{1}^{2}}{25} J_{3 - 3} + \frac{2 σ_{1}^{2}}{35} J_{3 \equiv 3} . \end{matrix}$ $\begin{aligned}&= \frac{3 \sigma _{1}^{2}}{25} \mathsf J _{3-3} + \frac{2 \sigma _{1}^{2}}{35} \mathsf J _{3\equiv 3}. \,\, \end{aligned}$ (B.30)

Analogously to the calculation in Section B.3, we find the generalised inverse

$\begin{matrix} C_{S}^{+} & = \frac{3}{σ_{1}^{2}} J_{3 - 3} + \frac{35}{2 σ_{1}^{2}} J_{3 \equiv 3}, \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{S }^+&= \frac{3}{\sigma _1^{2}} \mathsf J _{3-3} + \frac{35}{2 \sigma _1^{2}} \mathsf J _{3\equiv 3}, \end{aligned}$ (B.31)

where we have again used numerical representations to evaluate products between isotropic tensors when evaluating Equation (B.18). Therefore, the distribution of S is given by

$\begin{matrix} p (S) & = N exp (- \frac{35 S J_{3 \equiv 3} S - 15 S J_{3 - 3} S}{4 σ_{1}^{2}}) \\ = : N exp (- \frac{35 S_{3 \equiv 3} - 15 S_{3 - 3}}{4 σ_{1}^{2}}) . \end{matrix}$ $\begin{aligned} p(\mathsf S )&= N \exp \left(-\frac{35 \mathsf S \mathsf J _{3 \equiv 3} \mathsf S - 15 \mathsf S \mathsf J _{3-3} \mathsf S }{4 \sigma _{1}^{2}}\right) \nonumber \\&=: N \exp \left(-\frac{35 S_{3 \equiv 3} - 15 S_{3-3} }{4 \sigma _{1}^{2}}\right)\,\,. \end{aligned}$ (B.32)

where in the last line we have defined some new scalar variables through the given contractions, where S_3 − 3 = (∇δ)².

B.5. Joint distribution of second and fourth derivatives

We can write the covariance matrix of fourth and second derivatives as a block matrix

$\begin{matrix} C_{T R} & = (\begin{matrix} 〈 T \otimes T 〉 & 〈 T \otimes R 〉 \\ 〈 R \otimes T 〉 & 〈 R \otimes R 〉 \end{matrix}), \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T \mathsf R }&= \begin{pmatrix} \left\langle \mathsf{T \otimes \mathsf T } \right\rangle&\left\langle \mathsf{T \otimes \mathsf R } \right\rangle \\ \left\langle \mathsf{R \otimes \mathsf T } \right\rangle&\left\langle \mathsf{R \otimes \mathsf R } \right\rangle \end{pmatrix} , \end{aligned}$ (B.33)

$\begin{matrix} = (\begin{matrix} \frac{σ^{2}}{5} J_{4} & \frac{σ_{1}^{2}}{7} J_{6} \\ \frac{σ_{1}^{2}}{7} J_{6} & \frac{σ_{2}^{2}}{9} J_{8} \end{matrix}), \end{matrix}$ $\begin{aligned} &= \begin{pmatrix} \frac{\sigma ^2}{5} \mathsf J _4&\frac{\sigma _1^2}{7} \mathsf J _{6} \\ \frac{\sigma _1^2}{7} \mathsf J _{6}&\frac{\sigma _2^2}{9} \mathsf J _8 \end{pmatrix} , \end{aligned}$ (B.34)

$\begin{matrix} = (\begin{matrix} \frac{σ_{0}^{2}}{9} J_{22} + \frac{2 σ_{0}^{2}}{15} J_{2 = 2} & \frac{σ_{1}^{2}}{15} J_{24} + \frac{4 σ_{1}^{2}}{35} J_{2 = 4} \\ \frac{σ_{1}^{2}}{15} J_{42} + \frac{4 σ_{1}^{2}}{35} J_{4 = 2} & \frac{σ_{2}^{2}}{25} J_{44} + \frac{24 σ_{2}^{2}}{245} J_{4 = 4} + \frac{8 σ_{2}^{2}}{315} J_{4 = = 4} \end{matrix}) . \end{matrix}$ $\begin{aligned} &= \begin{pmatrix}\frac{\sigma _{0}^{2}}{9} \mathsf J _{22} + \frac{2 \sigma _{0}^{2}}{15} \mathsf J _{2 = 2}&\frac{\sigma _{1}^{2}}{15} \mathsf J _{24} + \frac{4 \sigma _{1}^{2}}{35} \mathsf J _{2 = 4}\\ \frac{\sigma _{1}^{2}}{15} \mathsf J _{42} + \frac{4 \sigma _{1}^{2} }{35} \mathsf J _{4 = 2}&\frac{\sigma _{2}^{2} }{25} \mathsf J _{44} + \frac{24 \sigma _{2}^{2} }{245} \mathsf J _{4 = 4} + \frac{8 \sigma _{2}^{2} }{315} \mathsf J _{4= = 4}\end{pmatrix}\,\,. \end{aligned}$ (B.35)

For its inverse, we make the Ansatz

$\begin{matrix} C_{T R}^{+} & = (\begin{matrix} A J_{22} + B J_{2 = 2} & C J_{24} + D J_{2 = 4} \\ C J_{42} + D J_{4 = 2} & E J_{44} + F J_{4 = 4} + G J_{4 = = 4} \end{matrix}) \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T \mathsf R }^+&= \begin{pmatrix} A \mathsf J _{22} + B \mathsf J _{2 = 2}&C\mathsf J _{24} + D\mathsf J _{2 = 4} \\ C\mathsf J _{42} + D\mathsf J _{4 = 2}&E \mathsf J _{44} + F \mathsf J _{4 = 4} + G \mathsf J _{4= = 4} \end{pmatrix} \end{aligned}$ (B.36)

and we find the solution to its generalised inverse by solving

$\begin{matrix} C_{T R} = C_{T R} C_{T R}^{+} C_{T R}, \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T \mathsf R } = \mathsf C _\mathsf{T \mathsf R } \mathsf C _\mathsf{T \mathsf R }^+ \mathsf C _\mathsf{T \mathsf R }, \end{aligned}$ (B.37)

with a computer algebra system and find

$\begin{matrix} C_{T R}^{+} = [\begin{matrix} \frac{15 σ_{2}^{2} J_{2 = 2}}{2 σ_{0}^{2} σ_{2}^{2} - 2 σ_{1}^{4}} + \frac{σ_{2}^{2} J_{22}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} & - \frac{15 σ_{1}^{2} J_{2 = 4}}{2 σ_{0}^{2} σ_{2}^{2} - 2 σ_{1}^{4}} - \frac{σ_{1}^{2} J_{24}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} \\ \frac{15 σ_{1}^{2} J_{4 = 2}}{2 σ_{0}^{2} σ_{2}^{2} - 2 σ_{1}^{4}} - \frac{σ_{1}^{2} J_{42}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} & \frac{15 σ_{0}^{2} J_{4 = 4}}{2 σ_{0}^{2} σ_{2}^{2} - 2 σ_{1}^{4}} + \frac{σ_{0}^{2} J_{44}}{σ_{0}^{2} σ_{2}^{2} - σ_{1}^{4}} + \frac{315 J_{4 = = 4}}{8 σ_{2}^{2}} \end{matrix}] . \end{matrix}$ $\begin{aligned} \mathsf C _\mathsf{T \mathsf R }^+= \left[\begin{matrix}\frac{15 \sigma _{2}^{2} \mathsf J _{2 = 2}}{2 \sigma _{0}^{2} \sigma _{2}^{2} - 2 \sigma _{1}^{4}} + \frac{\sigma _{2}^{2} \mathsf J _{22}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}&- \frac{15 \sigma _{1}^{2} \mathsf J _{2 = 4}}{2 \sigma _{0}^{2} \sigma _{2}^{2} - 2 \sigma _{1}^{4}} - \frac{\sigma _{1}^{2} \mathsf J _{24}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}\\ \frac{15 \sigma _{1}^{2} \mathsf J _{4 = 2}}{2 \sigma _{0}^{2} \sigma _{2}^{2} - 2 \sigma _{1}^{4}} - \frac{\sigma _{1}^{2} \mathsf J _{42}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}}&\frac{15 \sigma _{0}^{2} \mathsf J _{4 = 4}}{2 \sigma _{0}^{2} \sigma _{2}^{2} - 2 \sigma _{1}^{4}} + \frac{\sigma _{0}^{2} \mathsf J _{44}}{\sigma _{0}^{2} \sigma _{2}^{2} - \sigma _{1}^{4}} + \frac{315 \mathsf J _{4= = 4}}{8 \sigma _{2}^{2}}\end{matrix}\right]. \end{aligned}$ (B.38)

The joint distribution of second and fourth derivatives of the potential can be written as

$\begin{matrix} p (T, R) & = N exp (- \frac{1}{2} (\begin{matrix} T & R \end{matrix}) C_{T R}^{+} (\begin{matrix} T \\ R \end{matrix})) . \end{matrix}$ $\begin{aligned} p(\mathsf T , \mathsf R )&= N \exp \left( - \frac{1}{2} \begin{pmatrix} \mathsf T&\mathsf R \end{pmatrix} \mathsf C _\mathsf{T \mathsf R }^+ \begin{pmatrix} \mathsf T \\ \mathsf R \end{pmatrix} \right) \,\,. \end{aligned}$ (B.39)

B.6. Tidal estimator with spatial correction

We find the tidal estimator with spatial corrections of the order of 2 as:

$\begin{matrix} b_{J_{2 = 2}} & = {〈 \frac{1}{p (T, R)} \frac{\partial^{2} p (T, R)}{\partial T \partial T} \overset{(4)}{\cdot} \frac{J_{2 = 2}}{| | J_{2 = 2} | |^{2}} 〉}_{g}, \end{matrix}$ $\begin{aligned} b_\mathsf{J _{2 = 2}}&= \left\langle {\frac{1}{p(\mathsf T , \mathsf R )} \frac{\partial ^2 p(\mathsf T , \mathsf R )}{\partial \mathsf T \partial \mathsf T } \overset{(4)}{\cdot } \frac{\mathsf J _{2 = 2}}{\left||{ \mathsf J _{2 = 2} } \right||^2}} \right\rangle _{\mathrm{g} } ,\end{aligned}$ (B.40)

$\begin{matrix} = \frac{15}{4 σ_{*}^{8}} {〈 3 K^{2} σ_{2}^{4} + 6 ϕ_{2 = 4} σ_{1}^{2} σ_{2}^{2} + 3 ϕ_{4 = 4} σ_{1}^{4} - 2 σ_{2}^{2} σ_{*}^{4} 〉}_{g}, \end{matrix}$ $\begin{aligned}&= \frac{15}{4 \sigma _{*}^{8}} \left\langle {3 K^2 \sigma _{2}^{4} + 6 \phi _{2 = 4} \sigma _{1}^{2} \sigma _{2}^{2} + 3 \phi _{4 = 4} \sigma _{1}^{4} - 2 \sigma _{2}^{2} \sigma _{*}^{4}} \right\rangle _{\mathrm{g} } , \end{aligned}$ (B.41)

where

$\begin{matrix} ϕ_{2 = 4} & : = T J_{2 = 4} R = \sum_{ij} \partial_{i} \partial_{j} ϕ \partial_{i} \partial_{j} δ - \frac{1}{3} δ \nabla^{2} δ, \end{matrix}$ $\begin{aligned} \phi _{2 = 4}&:= \mathsf T J_{2 = 4} \mathsf R = \sum _{ij} \partial _i \partial _j \phi \partial _i \partial _j \delta - \frac{1}{3} \delta \nabla ^2 \delta ,\end{aligned}$ (B.42)

$\begin{matrix} ϕ_{4 = 4} & : = T J_{4 = 4} R = \sum_{ij} {(\partial_{i} \partial_{j} δ)}^{2} - \frac{1}{3} {(\nabla^{2} δ)}^{2} . \end{matrix}$ $\begin{aligned} \phi _{4 = 4}&:= \mathsf T J_{4 = 4} \mathsf R = \sum _{ij} (\partial _i \partial _j \delta )^2 - \frac{1}{3} (\nabla ^2 \delta )^2. \end{aligned}$ (B.43)

While we do not list them here, we have verified that if we analogously evaluate the expressions for b_J₂, b_J₄, b_J₂₂ and b_J₂₄ that they are identical to the estimators for b₁, b_L, b₂, and b_δ, L, respectively, from Equations (49) - (53).

All Tables

Table 1.

Orthogonal basis tensors that we consider here.

In the text

Table 2.

Sets of bias parameters that are shown in Figure 15.

In the text

Table B.1.

Covariances of various derivatives of the potential.

In the text

All Figures

Fig. 1.

Illustration of the inference of the halo environment distribution p(δ|g). Left: Galaxies traced back to their origin in Lagrangian space (marked as black dots) and with the (smoothed) linear density field, δ, inferred at their Lagrangian locations. Right: Environment distribution (orange histogram) given by the distribution of δ at the galaxy locations which is notably biased relative to the matter distribution, p(δ), (blue histogram and a Gaussian represented as dashed line). The galaxy environment distribution is well approximated through p(δ)f(δ), where here f(δ) is a quadratic polynomial bias function.

In the text

Fig. 2.

b₁ as a function of halo mass using the estimators from Equation (15) at the top and Equation (49) at the bottom, measured at different damping scales (different coloured regions). The shaded regions indicate the 1σ certainty region of the estimators. Using the b₁ estimator that includes the Laplacian correction increases the uncertainty of the b₁ estimates, but reduces the dependence on the damping scale, leading to a good agreement across different scales.

In the text

Fig. 3.

Scale dependence of the error in b₁ estimates for different halo mass selections. The error is expressed relatively to the measurement with spatial order of 2 at $k_{d} = 0.2 h {Mpc}^{- 1}$ $k_\mathrm{{d}} = 0.2\, h\, \mathrm{Mpc}^{-1}$ . On large scales (small k_d) the zero-spatial-order estimators from Equation (15) converge well to this estimate, whereas the spatial order of 2 estimators from Equation (49) agree well up to k_d ≳ 0.25 h Mpc⁻¹ where slight disagreements arise. The spatial order of 4 estimates remain scale-independent even beyond this scale.

In the text

Fig. 4.

b₂ and β₂ as a function of b₁ = β₁ using the second-order spatial estimators. The black dashed line is the b₂(b₁) relation inferred by Lazeyras et al. (2016) from separate-universe simulations. The second-order spatial estimators seem consistent with the literature co-evolution relation down to damping scales of k_d ∼ 0.15 − 0.2 h Mpc⁻¹. It is noteworthy that β₂ < 0 appears to always hold, which means that the width of the galaxy environment distribution p(δ|g) is always smaller than that of the background p(δ). Therefore, the cumulant bias parameter β₂ − β₁ relation appears slightly simpler than the b₂ − b₁ relation.

In the text

	Fig. 5. Uncertainty of b₂ measurements (solid lines) and β₂ measurements (dashed lines) as a function of b₁ for different damping scales. The uncertainty of β₂ is significantly smaller than of b₂ for high values of b₁.
In the text

	Fig. 6. Correlation coefficient α₁₂ between the b₁ and b₂ measurement (solid) and the β₁ and β₂ measurement (dashed). Note: β₂ as a parameter is much more independent of the value of b₁ than b₂.
In the text

Fig. 7.

Co-evolution relations of higher order bias parameters b₃ and β₃ (top) and b₄ and β₄ (bottom) for the second-order spatial bias estimators. For b₄ we indicate as a dashed line a prediction that follows from combining the Lazeyras et al. (2016) measurements of b₁, b₂ and b₃ with Equation (30) when using β₄ = 0. Strikingly, β₃ and β₄ are extremely close to 0 – independently of the value of b₁.

In the text

	Fig. 8. Lagrangian Laplacian bias b_L as a function of halo mass. At masses M ≲ 3 × 10¹³ h⁻¹ M_⊙ the scale dependence of the b_L measurements disappears and they agree well with the fits of the Eulerian Laplacian bias from Lazeyras & Schmidt (2019).
In the text

Fig. 9.

Maximum damping scale where the PBS can be valid and bias scale-independent. The three reddish contours show the break-down scale of density-only bias models, estimated with bias parameters measured at different scales, and the three green-blueish contours show it for (δ, L) bias models. The black dashed line shows the wavenumber associated with the Lagrangian radius of halos. The break-down scale is consistent across different measurements and only seems to scale strongly with halo radius for the (δ) case.

In the text

Fig. 10.

Graphic representation of the isotropic tensors that form a basis for a few selected isotropic tensor spaces with symmetries. All basis tensors of a space with given symmetry can be constructed by considering the number of different ways that the symmetry groups can be connected. In this figure each circle with number n represents a group of n fully symmetric indices and each connection represents one delta symbol (that can either connect two indices from the same group or from two different groups).

In the text

	Fig. 11. Tidal bias as a function of mass and for different damping scales measured with the zeroth-order spatial estimator (top) versus the second-order spatial estimator (bottom). The second-order spatial estimator exhibits significantly reduced scale dependence at the price of increased error bars.
In the text

	Fig. 12. Co-evolution relation b_K² versus b₁. Coloured shaded regions are our measurements and error bars show the measurements from Lazeyras & Schmidt (2018). Black lines include measurements from Zennaro et al. (2022), Modi et al. (2017) and Abidi & Baldauf (2018), an excursion set prediction from Sheth et al. (2013) and an approximation that we propose as b_K² = −0.05b₁².
In the text

Fig. 13.

Assembly bias in the Lagrangian b_K²(b₁) relation for halos split into four quartiles by spin (at fixed mass). All lines use the second-order spatial estimator and k_d = 0.25 h Mpc⁻¹ as damping scale. Dashed lines show the variation of b₁ and b_K² at fixed mass. The b_K² − b₁ relation shows a strong degree of assembly bias – with larger spin selections yielding larger b_K² and larger b₁.

In the text

	Fig. 14. Measurements of biases terms of third derivatives of the potential. The bias of the density gradient b_(∇δ)² (top) has a very strong scale dependence so that we cannot reliably measure it here. Bottom: Fully contracted third derivatives of the potential seem to have a very small associated bias parameter.
In the text

Fig. 15.

Quantitative contribution of different bias terms to the bias expansion as a function of damping scale and for two different sets of tracers. To avoid cluttering, we omit the bias fore-factors of some terms in the legend. The vertical dotted line indicates an example scale of interest k_d = 0.15 h Mpc⁻¹. The dashed lines show the contributions of cumulant biases at orders of 3 and 4, which can directly be compared to the contributions of b₃ and b₄ (labelled as δ³ and δ⁴).

In the text

Fig. B.1.

Illustration of the basis tensors of 𝕍₂₂₂ versus those of 𝕍₍₂₂₂₎. Top: without the symmetry requirement between groups (indicated by usage of different symbols for each group) there are five basis tensors. Bottom: With the additional symmetry requirement between groups (indicated by usage of the same symbol for each group), since the latter three tensors of 𝕍₂₂₂ get all united to a single symmetrised term (bottom).

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Abidi, M. M., & Baldauf, T. 2018, JCAP, 2018, 029 [CrossRef] [Google Scholar]

[2] Akitsu, K., Li, Y., & Okumura, T. 2021, JCAP, 2021, 041 [CrossRef] [Google Scholar]

[3] Alam, S., Aubert, M., Avila, S., et al. 2021, Phys. Rev. D, 103, 083533 [NASA ADS] [CrossRef] [Google Scholar]

[4] Amendola, L., Appleby, S., Avgoustidis, A., et al. 2018, Liv. Rev. Rel., 21, 2 [Google Scholar]

[5] Angulo, R. E., & Hahn, O. 2022, Liv. Rev. Comput. Astrophys., 8, 1 [CrossRef] [Google Scholar]

[6] Angulo, R. E., Zennaro, M., Contreras, S., et al. 2021, MNRAS, 507, 5869 [NASA ADS] [CrossRef] [Google Scholar]

[7] Baldauf, T., Desjacques, V., & Seljak, U. 2015, Phys. Rev. D, 92, 123507 [Google Scholar]

[8] Baldauf, T., Mirbabayi, M., Simonović, M., & Zaldarriaga, M. 2016a, arXiv e-prints [arXiv:1602.00674] [Google Scholar]

[9] Baldauf, T., Seljak, U., Senatore, L., & Zaldarriaga, M. 2016b, JCAP, 2016, 007 [NASA ADS] [CrossRef] [Google Scholar]

[10] Bardeen, J. M., Bond, J. R., Kaiser, N., & Szalay, A. S. 1986, ApJ, 304, 15 [Google Scholar]

[11] Barreira, A. 2020, JCAP, 2020, 031 [CrossRef] [Google Scholar]

[12] Barreira, A., Lazeyras, T., & Schmidt, F. 2021, JCAP, 2021, 029 [CrossRef] [Google Scholar]

[13] Baumann, D., Nicolis, A., Senatore, L., & Zaldarriaga, M. 2012, JCAP, 2012, 051 [Google Scholar]

[14] Berlind, A. A., & Weinberg, D. H. 2002, ApJ, 575, 587 [Google Scholar]

[15] Bernardeau, F., Colombi, S., Gaztañaga, E., & Scoccimarro, R. 2002, Phys. Rep., 367, 1 [Google Scholar]

[16] Biagetti, M., Chan, K. C., Desjacques, V., & Paranjape, A. 2014, MNRAS, 441, 1457 [Google Scholar]

[17] Bullock, J. S., Dekel, A., Kolatt, T. S., et al. 2001, ApJ, 555, 240 [NASA ADS] [CrossRef] [Google Scholar]

[18] Cacciato, M., Lahav, O., van den Bosch, F. C., Hoekstra, H., & Dekel, A. 2012, MNRAS, 426, 566 [Google Scholar]

[19] Chaves-Montero, J., Angulo, R. E., Schaye, J., et al. 2016, MNRAS, 460, 3100 [NASA ADS] [CrossRef] [Google Scholar]

[20] Chaves-Montero, J., Angulo, R. E., & Contreras, S. 2023, MNRAS, 521, 937 [Google Scholar]

[21] Chen, S.-F., Vlah, Z., & White, M. 2020, JCAP, 2020, 062 [CrossRef] [Google Scholar]

[22] Colas, T., d’Amico, G., Senatore, L., Zhang, P., & Beutler, F. 2020, JCAP, 2020, 001 [Google Scholar]

[23] Conroy, C., Wechsler, R. H., & Kravtsov, A. V. 2006, ApJ, 647, 201 [Google Scholar]

[24] Contreras, S., Angulo, R. E., & Zennaro, M. 2021, MNRAS, 508, 175 [NASA ADS] [CrossRef] [Google Scholar]

[25] Contreras, S., Chaves-Montero, J., & Angulo, R. E. 2023, MNRAS, 525, 3149 [NASA ADS] [CrossRef] [Google Scholar]

[26] Croton, D. J., Gao, L., & White, S. D. M. 2007, MNRAS, 374, 1303 [Google Scholar]

[27] Croton, D. J., Stevens, A. R. H., Tonini, C., et al. 2016, ApJS, 222, 22 [Google Scholar]

[28] Dalal, N., White, M., Bond, J. R., & Shirokov, A. 2008, ApJ, 687, 12 [NASA ADS] [CrossRef] [Google Scholar]

[29] d’Amico, G., Gleyzes, J., Kokron, N., et al. 2020, JCAP, 2020, 005 [Google Scholar]

[30] Davé, R., Anglés-Alcázar, D., Narayanan, D., et al. 2019, MNRAS, 486, 2827 [Google Scholar]

[31] Dekel, A., & Lahav, O. 1999, ApJ, 520, 24 [Google Scholar]

[32] DeRose, J., Kokron, N., Banerjee, A., et al. 2023, JCAP, 2023, 054 [CrossRef] [Google Scholar]

[33] Desjacques, V., Crocce, M., Scoccimarro, R., & Sheth, R. K. 2010, Phys. Rev. D, 82, 103529 [CrossRef] [Google Scholar]

[34] Desjacques, V., Jeong, D., & Schmidt, F. 2018, Phys. Rep., 733, 1 [Google Scholar]

[35] Dragomir, R., Rodríguez-Puebla, A., Primack, J. R., & Lee, C. T. 2018, MNRAS, 476, 741 [NASA ADS] [CrossRef] [Google Scholar]

[36] Dubois, Y., Pichon, C., Welker, C., et al. 2014, MNRAS, 444, 1453 [Google Scholar]

[37] Faltenbacher, A., & White, S. D. M. 2010, ApJ, 708, 469 [NASA ADS] [CrossRef] [Google Scholar]

[38] Ferreras, I., Hopkins, A. M., Lagos, C., et al. 2019, MNRAS, 487, 435 [Google Scholar]

[39] Frenk, C. S., White, S. D. M., Davis, M., & Efstathiou, G. 1988, ApJ, 327, 507 [NASA ADS] [CrossRef] [Google Scholar]

[40] Gao, L., & White, S. D. M. 2007, MNRAS, 377, L5 [NASA ADS] [CrossRef] [Google Scholar]

[41] Gao, L., Springel, V., & White, S. D. M. 2005, MNRAS, 363, L66 [NASA ADS] [CrossRef] [Google Scholar]

[42] Genel, S., Bryan, G. L., Springel, V., et al. 2019, ApJ, 871, 21 [NASA ADS] [CrossRef] [Google Scholar]

[43] Henriques, B. M. B., White, S. D. M., Thomas, P. A., et al. 2015, MNRAS, 451, 2663 [Google Scholar]

[44] Ivanov, M. M., Simonović, M., & Zaldarriaga, M. 2020, JCAP, 2020, 042 [Google Scholar]

[45] Kaiser, N. 1984, ApJ, 284, L9 [NASA ADS] [CrossRef] [Google Scholar]

[46] Kauffmann, G., Colberg, J. M., Diaferio, A., & White, S. D. M. 1999, MNRAS, 303, 188 [NASA ADS] [CrossRef] [Google Scholar]

[47] Kokron, N., DeRose, J., Chen, S.-F., White, M., & Wechsler, R. H. 2021, MNRAS, 505, 1422 [NASA ADS] [CrossRef] [Google Scholar]

[48] Lacey, C. G., Baugh, C. M., Frenk, C. S., et al. 2016, MNRAS, 462, 3854 [Google Scholar]

[49] Lagos, C. d. P., Tobar, R. J., Robotham, A. S. G., et al. 2018, MNRAS, 481, 3573 [CrossRef] [Google Scholar]

[50] Lazeyras, T., & Schmidt, F. 2018, JCAP, 2018, 008 [CrossRef] [Google Scholar]

[51] Lazeyras, T., & Schmidt, F. 2019, JCAP, 2019, 041 [Google Scholar]

[52] Lazeyras, T., Wagner, C., Baldauf, T., & Schmidt, F. 2016, JCAP, 2016, 018 [CrossRef] [Google Scholar]

[53] Lazeyras, T., Barreira, A., & Schmidt, F. 2021, JCAP, 2021, 063 [CrossRef] [Google Scholar]

[54] Lehmann, B. V., Mao, Y.-Y., Becker, M. R., Skillman, S. W., & Wechsler, R. H. 2017, ApJ, 834, 37 [NASA ADS] [CrossRef] [Google Scholar]

[55] Levi, M., Bebek, C., Beers, T., et al. 2013, arXiv e-prints [arXiv:1308.0847] [Google Scholar]

[56] Li, Y., Hu, W., & Takada, M. 2014, Phys. Rev. D, 89, 083519 [NASA ADS] [CrossRef] [Google Scholar]

[57] Lukacs, E. 1970, Characteristic Functions, 2nd edn. (Griffin) [Google Scholar]

[58] Masaki, S., Nishimichi, T., & Takada, M. 2020, MNRAS, 496, 483 [NASA ADS] [CrossRef] [Google Scholar]

[59] Meurer, A., Smith, C. P., Paprocki, M., et al. 2017, PeerJ Comput. Sci., 3 [Google Scholar]

[60] Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 [Google Scholar]

[61] Modi, C., Castorina, E., & Seljak, U. 2017, MNRAS, 472, 3959 [Google Scholar]

[62] Modi, C., Chen, S.-F., & White, M. 2020, MNRAS, 492, 5754 [NASA ADS] [CrossRef] [Google Scholar]

[63] Montero-Dorta, A. D., Pérez, E., Prada, F., et al. 2017, ApJ, 848, L2 [NASA ADS] [CrossRef] [Google Scholar]

[64] Musso, M., Paranjape, A., & Sheth, R. K. 2012, MNRAS, 427, 3145 [CrossRef] [Google Scholar]

[65] Nishimichi, T., D’Amico, G., Ivanov, M. M., et al. 2020, Phys. Rev. D, 102, 123541 [NASA ADS] [CrossRef] [Google Scholar]

[66] Ortega-Martinez, S., Contreras, S., & Angulo, R. 2024, A&A, 689, A66 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[67] Paranjape, A., & Sheth, R. K. 2012, MNRAS, 426, 2789 [Google Scholar]

[68] Paranjape, A., Sefusatti, E., Chan, K. C., et al. 2013a, MNRAS, 436, 449 [CrossRef] [Google Scholar]

[69] Paranjape, A., Sheth, R. K., & Desjacques, V. 2013b, MNRAS, 431, 1503 [CrossRef] [Google Scholar]

[70] Paranjape, A., Hahn, O., & Sheth, R. K. 2018, MNRAS, 476, 3631 [NASA ADS] [CrossRef] [Google Scholar]

[71] Peacock, J. A., & Smith, R. E. 2000, MNRAS, 318, 1144 [Google Scholar]

[72] Pellejero Ibañez, M., Stücker, J., Angulo, R. E., et al. 2022, MNRAS, 514, 3993 [Google Scholar]

[73] Pellejero Ibañez, M., Angulo, R. E., Zennaro, M., et al. 2023, MNRAS, 520, 3725 [Google Scholar]

[74] Philcox, O. H. E., & Ivanov, M. M. 2022, Phys. Rev. D, 105, 043517 [CrossRef] [Google Scholar]

[75] Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[76] Reddick, R. M., Wechsler, R. H., Tinker, J. L., & Behroozi, P. S. 2013, ApJ, 771, 30 [Google Scholar]

[77] Salcedo, A. N., Weinberg, D. H., Wu, H.-Y., & Wibking, B. D. 2022a, MNRAS, 510, 5376 [CrossRef] [Google Scholar]

[78] Salcedo, A. N., Zu, Y., Zhang, Y., et al. 2022b, Sci. China Phys. Mech. Astron., 65, 109811 [Google Scholar]

[79] Sato-Polito, G., Montero-Dorta, A. D., Abramo, L. R., Prada, F., & Klypin, A. 2019, MNRAS, 487, 1570 [NASA ADS] [CrossRef] [Google Scholar]

[80] Schaye, J., Crain, R. A., Bower, R. G., et al. 2015, MNRAS, 446, 521 [Google Scholar]

[81] Sheth, R. K., Chan, K. C., & Scoccimarro, R. 2013, Phys. Rev. D, 87, 083002 [NASA ADS] [CrossRef] [Google Scholar]

[82] Springel, V., White, S. D. M., Tormen, G., & Kauffmann, G. 2001, MNRAS, 328, 726 [Google Scholar]

[83] Stevens, A. R. H., Croton, D. J., & Mutch, S. J. 2016, MNRAS, 461, 859 [Google Scholar]

[84] Stücker, J., Schmidt, A. S., White, S. D. M., Schmidt, F., & Hahn, O. 2021, MNRAS, 503, 1473 [CrossRef] [Google Scholar]

[85] Stücker, J., Pellejero-Ibáñez, M., Voivodic, R., & Angulo, R. E. 2025, A&A, 694, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[86] Szalay, A. S. 1988, ApJ, 333, 21 [NASA ADS] [CrossRef] [Google Scholar]

[87] Tinker, J. L., Robertson, B. E., Kravtsov, A. V., et al. 2010, ApJ, 724, 878 [NASA ADS] [CrossRef] [Google Scholar]

[88] Tucci, B., Montero-Dorta, A. D., Abramo, L. R., Sato-Polito, G., & Artale, M. C. 2021, MNRAS, 500, 2777 [Google Scholar]

[89] Villaescusa-Navarro, F., Anglés-Alcázar, D., Genel, S., et al. 2021, ApJ, 915, 71 [NASA ADS] [CrossRef] [Google Scholar]

[90] Vlah, Z., Castorina, E., & White, M. 2016, JCAP, 2016, 007 [Google Scholar]

[91] Vogelsberger, M., Genel, S., Springel, V., et al. 2014, MNRAS, 444, 1518 [Google Scholar]

[92] Wagner, C., Schmidt, F., Chiang, C. T., & Komatsu, E. 2015, MNRAS, 448, L11 [CrossRef] [Google Scholar]

[93] Wechsler, R. H., Zentner, A. R., Bullock, J. S., Kravtsov, A. V., & Allgood, B. 2006, ApJ, 652, 71 [NASA ADS] [CrossRef] [Google Scholar]

[94] White, S. D. M. 1979, MNRAS, 186, 145 [NASA ADS] [CrossRef] [Google Scholar]

[95] White, S. D. M. 1984, ApJ, 286, 38 [NASA ADS] [CrossRef] [Google Scholar]

[96] Zehavi, I., Contreras, S., Padilla, N., et al. 2018, ApJ, 853, 84 [NASA ADS] [CrossRef] [Google Scholar]

[97] Zennaro, M., Angulo, R. E., Contreras, S., Pellejero-Ibáñez, M., & Maion, F. 2022, MNRAS, 514, 5443 [NASA ADS] [CrossRef] [Google Scholar]

[98] Zennaro, M., Angulo, R. E., Pellejero-Ibáñez, M., et al. 2023, MNRAS, 524, 2407 [Google Scholar]

[99] Zheng, Z., Berlind, A. A., Weinberg, D. H., et al. 2005, ApJ, 633, 791 [NASA ADS] [CrossRef] [Google Scholar]

Probabilistic Lagrangian bias estimators and the cumulant bias expansion

1. Introduction

2. Theory

2.1. Definitions

2.2. Bias estimators

2.3. The moment generating function

2.4. Cumulant bias parameters

2.5. Interpretation

2.6. Multivariate estimators

2.7. Multivariate cumulants

3. Density bias measurements

3.1. Simulation

3.2. Bias measurements

3.3. b1 and the scale dependence of estimators

3.4. b2 versus β2

3.5. Higher order biases

3.6. Laplacian bias

3.7. The scale-split break-down scale

4. Estimators for tensorial bias parameters

4.1. Tensorial bias expansion

4.2. Isotropic tensors

4.3. Symmetric isotropic tensors

4.4. Orthogonal basis

4.5. Bias estimators

4.6. Tidal bias

4.7. Estimators for third derivative terms

4.8. Tensorial cumulant biases

5. Measurements of tensorial bias parameters

5.1. Tidal bias

5.2. Assembly bias in bK2

5.3. Biases of third derivative terms

6. Relevance of bias parameters

7. Conclusions

Acknowledgments

References

Appendix A: Estimators with different filters

Appendix B: Isotropic tensors

B.1. Tensors with three or more symmetry groups

B.2. Covariance of potential derivatives

B.3. The distribution of the tidal tensor

B.4. The distribution of third derivatives

B.5. Joint distribution of second and fourth derivatives

B.6. Tidal estimator with spatial correction

All Tables

All Figures

3.3. b₁ and the scale dependence of estimators

3.4. b₂ versus β₂

5.2. Assembly bias in b_K²