Issue |
A&A
Volume 695, March 2025
|
|
---|---|---|
Article Number | A159 | |
Number of page(s) | 9 | |
Section | Cosmology (including clusters of galaxies) | |
DOI | https://doi.org/10.1051/0004-6361/202452709 | |
Published online | 14 March 2025 |
Exploring the physical origins of halo assembly bias from early times
1
Departamento de Física, Universidad Técnica Federico Santa María, Avenida Vicuña Mackenna 3939, San Joaquín, Santiago, Chile
2
Donostia International Physics Center, Manuel Lardizabal Ibilbidea, 4, 20018 Donostia, Gipuzkoa, Spain
3
Universidad Andres Bello, Facultad de Ciencias Exactas, Departamento de Ciencias Físicas, Instituto de Astrofísica, Av. Fernández Concha 700, Santiago, Chile
4
CONICET, Instituto de Astronomía Teórica y Experimental (IATE), Laprida 854, Córdoba X5000BGR, Argentina
5
Universidad Nacional de Córdoba (UNC), Observatorio Astronómico de Córdoba (OAC), Laprida 854, Córdoba X5000BGR, Argentina
6
Instituto de Astrofísica de Canarias, s/n, E-38205 La Laguna, Tenerife, Spain
7
Departamento de Astrofísica, Universidad de La Laguna, E-38206 La Laguna, Tenerife, Spain
⋆ Corresponding author; antonio.montero@usm.cl
Received:
22
October
2024
Accepted:
17
January
2025
Context. The large-scale linear halo bias encodes the relation between the clustering of dark-matter (DM) halos and that of the underlying matter density field. Although the primary dependence of bias on halo mass is well understood in the context of structure formation, the physical origins of the multiple additional relations at fixed halo mass, commonly known as secondary halo bias, have not been fully elucidated. Of particular relevance is the secondary dependence on the halo assembly history, known as halo assembly bias.
Aims. Our goal is to determine whether the properties of the initial regions from which z = 0 halos originate produce any secondary bias at z = 0. By analyzing these initial dependences in connection with halo assembly bias, we intend to provide insight into the physical origins of the effect.
Methods. We selected halos at z = 0 in the IllustrisTNG DM-only simulation and traced back the positions and velocities of their DM particles to z = 12. The resulting initial regions were characterized according to several shape-related and kinematic properties. The secondary bias signal produced by these properties at z = 0 was measured using an object-by-object bias estimator, which offers significant analytical advantages as compared to the traditional approach.
Results. We show that, when split by the properties of their initial DM clouds, z = 0 halos display significant secondary bias, clearly exceeding the amplitude of the well-known halo assembly bias signal produced by concentration and age. The maximum bias segregation is measured for cloud velocity dispersion and radial velocity, followed by cloud concentration, sphericity, ellipticity, and triaxiality. We further show that both velocity dispersion and radial velocity are also the properties of the initial clouds that most strongly correlate with halo age and concentration at fixed halo mass. Our results highlight the importance of linear effects in shaping halo assembly bias.
Key words: methods: numerical / methods: statistical / cosmology: theory / dark matter / large-scale structure of Universe
© The Authors 2025
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
1. Introduction
The large-scale linear bias of dark matter (DM) halos encodes fundamental information on the process of halo formation from the underlying matter density field. Halo bias depends primarily on the peak height of density fluctuations, ν, as it can be analytically derived from Λ-cold dark matter (Λ-CDM) structure formation formalisms (see, e.g., Kaiser 1984; Bardeen et al. 1986; Mo & White 1996; Sheth & Tormen 1999; Sheth et al. 2001; Tinker et al. 2010). This fundamental dependence naturally affects the halo virial mass: more massive halos are more tightly clustered (have a higher bias) than less massive halos. In the last few decades, this rather simple picture has become more complex, as a number of additional secondary dependences of halo bias (i.e., at fixed halo mass) have been unveiled using cosmological simulations (see, e.g., Sheth & Tormen 2004; Gao et al. 2005; Wechsler et al. 2006; Gao & White 2007; Dalal et al. 2008; Angulo et al. 2008; Li et al. 2008; Faltenbacher & White 2010; Lazeyras et al. 2017; Salcedo et al. 2018; Mao et al. 2018; Han et al. 2019; Sato-Polito et al. 2019; Johnson et al. 2019; Ramakrishnan et al. 2019; Contreras et al. 2019, 2021a; Montero-Dorta et al. 2020, 2021; Tucci et al. 2021; Montero-Dorta & Rodriguez 2024; Balaguera-Antolínez et al. 2024). Among these dependences, which are called secondary halo bias by some authors, the dependence on the assembly history of halos, or halo assembly bias, is the effect that has drawn the most attention: at fixed halo mass, halos that assemble a significant fraction of their mass earlier on are more tightly clustered than those that take more time to acquire the same mass. Throughout this work, halo assembly bias is specifically defined as the dependence on two internal assembly properties: the virial concentration and the half-mass formation scale factor1 (the scale factor at which the halo has acquired half its final z = 0 mass).
The physical origins of the multiple secondary halo bias dependences are still not fully determined, although several key results have been reported. For halo assembly bias studies, following Dalal et al. (2008), it is common to divide samples into two mass regimes, motivated by the conspicuous reversal of the secondary-bias signal measured for concentration at Mvir ≃ 1013h−1 M⊙ (e.g., Gao & White 2007; Sato-Polito et al. 2019). At the high-mass end, Dalal et al. (2008) proposed that the fact that lower-concentration halos have higher bias than higher-concentration halos was tightly connected with the statistics of density peaks in the initial field. At fixed ν, peaks of lower curvature are more clustered and tend to accrete their mass earlier on as compared to high-curvature peaks, which suggests a connection with the concentration dependence of halo bias.
The above argument is clearly not valid at the low-mass end, as pointed out by Dalal et al. (2008) as well (it is also unclear how the peak-curvature mechanism would be adapted to other proxies of assembly history, such as the half-mass formation time, for which no signal of secondary bias is measured at the high-mass end). For these low-mass halos, there are indications that assembly bias may be influenced by nonlinear effects2 that in turn may depend on the location of halos within the cosmic web. These effects can hinder mass accretion early on for a subpopulation of halos (Dalal et al. 2008). The suppression of halo growth may be caused by tidal effects induced individually by a neighboring massive halo (e.g., Hahn et al. 2009) or, more generally, by the whole geometry of tides (Borzyszkowski et al. 2017; Musso et al. 2018). Pinpointing the exact mechanisms is nevertheless complicated given the large intrinsic stochasticity in bias and the complex map of correlations between environmental and internal halo properties. A clearer conceptual picture is to view low-mass assembly bias as a consequence of correlations between bias and the anisotropy of the local tidal field, which in turn correlates with the internal halo properties at a fixed halo mass (Paranjape et al. 2018; Ramakrishnan et al. 2019). We note that this picture can be adapted to include other related environmental properties as mediators of the effect (see Balaguera-Antolínez et al. 2024; Montero-Dorta & Rodriguez 2024).
The results reported in the literature provide valuable clues as to the physical origins of halo assembly bias. However, a comprehensive theory is yet to be established. In particular, it is still unclear how the different physical mechanisms that have been proposed, both in the linear and nonlinear regimes, contribute to the signals across different halo mass ranges. In this paper, we aim to provide new insights by explicitly connecting halos at z = 0 with the initial density field. We used the IllustrisTNG3 DM-only simulation to investigate the properties of the collapsing regions at z = 12 where different types of halos form. These properties were analyzed in connection with the halo assembly bias signal measured at z = 0 in order to shed light on the processes that shape this effect. To facilitate the analysis of bias and its correlations with multiple properties, an object-by-object halo bias estimator was employed (Paranjape et al. 2018; Contreras et al. 2021b; Balaguera-Antolínez et al. 2024; Balaguera-Antolínez & Montero-Dorta 2024).
The paper is organized as follows. Section 2 provides a brief description of the simulation data used in this work. Our methodology for measuring the properties of the z = 12 initial clouds and the z = 0 halo-by-halo bias is described in Sect. 3. Our main results regarding assembly bias are presented in Sect. 4. Finally, Sect. 5 is devoted to discussing the implications of our results and providing a brief summary of the paper. The IllustrisTNG300 simulation adopts the standard ΛCDM cosmology (Planck Collaboration XIII 2016), with parameters Ωm = 0.3089, Ωb = 0.0486, ΩΛ = 0.6911, H0 = 100 h km s−1 Mpc−1 with h = 0.6774, σ8 = 0.8159, and ns = 0.9667.
2. Data
Our analysis is based on publicly available data from the IllustrisTNG suite of cosmological simulations (hereafter TNG for simplicity; Pillepich et al. 2018a,b; Nelson et al. 2018, 2019; Marinacci et al. 2018; Naiman et al. 2018; Springel et al. 2018). The TNG simulations, particularly the magnetohydrodynamic boxes, have been employed in a large number of works over the years, providing insight into a variety of galaxy formation processes4. Since we are only interested in DM halos, we analyzed the TNG300-1 Dark box (hereafter TNG300-DMO5, for DM-only), which follows the dynamical evolution of 25003 DM particles of mass 5.0 × 107h−1 M⊙ within a simulated cosmological volume of 205 h−1 Mpc (with periodic boundary conditions). Choosing the largest box within the TNG suite provides an obvious advantage when it comes to measuring large-scale halo clustering, as we discuss in Sect. 3. DM halos in TNG are identified using a friends-of-friends algorithm with a linking length of 0.2 times the mean inter-particle separation (Davis et al. 1985). The SUBFIND algorithm (Springel et al. 2001; Dolag et al. 2009) is in turn used to identify subhalos.
Several halo properties at z = 0 were employed in this work. We used the virial halo mass, Mvir [h−1 M⊙], defined as the total mass enclosed within a sphere of radius Rvir, where the density equals 200 times the critical density of the Universe. From Mvir and the merger trees provided in the database, we computed the standard proxy for halo age, namely, the half-mass scale factor a1/2, which represents the time at which half the z = 0 halo mass Mvir has formed. We also measured the virial concentration, cvir, defined in the standard way as the ratio between the virial radius and the scale radius (i.e., cvir = Rvir/Rs, where Rs is obtained by fitting the DM density profiles of individual halos to a Navarro-Frenk-White (NFW) profile; Navarro et al. 1997).
As a proxy for the initial field, we used the z = 12 snapshot. This is the earliest (in terms of cosmic time) complete (full-particle) snapshot in TNG300-DMO. In the following section, we describe the set of properties measured from the collapsing regions that are identified in this snapshot as the progenitors of z = 0 halos.
3. Methodology
The main goal of this work is to analyze the secondary bias signal (i.e., the differences in bias at fixed halo mass) produced by the properties of the z = 12 regions from which z = 0 halos form. Our procedure can be divided into the following simple steps:
-
We selected all halos at z = 0 within four discontinuous narrow halo mass bins of width 0.2 dex and centered at log10 Mvir[h−1 M⊙] = [11.5, 12, 12.5 13]. This selection ensures that differences in bias are measured at an approximately fixed halo mass.
-
We traced the DM particles composing each of the selected halos all the way back to z = 12 (snapshot number 2 in TNG300-DMO).
-
With the positions and velocities of the selected particles, several internal properties of the resulting z = 12 clouds are measured. As we describe below, we focused on velocity dispersion, radial velocity, sphericity, ellipticity, triaxiality, and mass concentration.
-
We measured the secondary bias signal produced by the internal properties of the initial regions. The z = 0 halo population in each mass bin was split using these properties, thus explicitly establishing a relation between the high-z regions and the z = 0 halos. The large-scale linear bias was computed at z = 0.
Figure 1 shows an example of a collapsing region at z = 12 and the resulting virialized object (halo) of mass Mvir = 1011.5 h−1 M⊙ at z = 0. In the remainder of this section, we describe in more detail the calculation of the z = 12 properties, and the individual halo-by-halo bias at z = 0.
![]() |
Fig. 1. Collapsing region at z = 12 and the corresponding halo that forms from that patch of the density field, viewed at z = 0. Only 10% of the particles are plotted for the sake of simplicity. The color code indicates the height on the z-axis at z = 12, from low (purple) to high (yellow). Arrows indicate the initial and final location of a few individual DM particles. Note that this particular region is composed of two disjointed subregions that are in the process of collapsing and merging. |
3.1. Characterizing collapsing regions at z = 12
The following properties of the z = 12 regions are measured based on the positions and velocities of traced-back particles:
-
Velocity dispersion (σ12 [km/s]): the square root of the variance of the velocity of particles with respect to the mean velocity of the region.
-
Radial velocity (vr, 12 [km/s]): the velocity dispersion of the region projected onto the radial direction, which is defined with respect to the mean velocity of the region. Negative values of this property correspond to inward bulk motions.
-
Sphericity (s12): computed by diagonalizing the inertia tensor, so that the principal axes of the ideal ellipsoid can be obtained (see, e.g., Allgood et al. 2006; Chua et al. 2019; Rodriguez et al. 2024). The inertia tensor is defined as
, where i, j correspond to the 3 axes of the simulation box (i.e., i, j = 1, 2, 3), m is the mass of the particle, np is the total number of particles, and xi(k), xj(k) represent the positions of the k-th particle in the i- and j-direction, respectively. These particle positions are measured with respect to the position of the particle with the minimum gravitational potential energy. The eigenvalues of the inertia tensor (λa, λb, λc) are used to define the axes of each halo, as
,
,
, with a > b > c. The sphericity parameter is calculated as s = c/a, so larger values correspond to more spherical objects.
-
Ellipticity (e12): also computed from the principal axes of the ellipsoid as e12 = (1 − s122)/(1 + s122 + q122), where s12 is the sphericity and q12 = b/a. Larger values indicate more elliptical objects.
-
Triaxiality (T12): also computed from the principal axes of the ellipsoid as T12 = (1 − q122)/(1 − s122). Values of T ≲ 1/3 are usually associated with oblate shapes, whereas objects with T ≳ 2/3 are considered prolate. The range 1/3 ≲ T ≲ 2/3 corresponds to triaxial objects not clearly fitting into either of these categories. These objects have a more complex shape, with all three axes having different lengths.
-
Mass concentration (c12): measured as the radius that contains 90% of the particles divided by the radius that contains 50% of the particles, following the philosophy of the virial concentration defined for halos. As in that case, larger values of c12 correspond to more concentrated clouds.
The above properties are sometimes measured within our framework for regions of very complex structure. In some cases, these regions are composed of multiple subregions. This is illustrated in Fig. 1, where the initial region is composed of two disjointed subregions that are in the process of collapsing and merging. The above properties provide an average characterization of the shape and kinematics of the whole patch, which is naturally dominated by the most massive components. As explained in Sect. 4, our tests indicate that defining global properties yields robust results, despite the complexity of certain regions. In follow-up work, we will refine our measurement by characterizing the multiple initial components individually.
![]() |
Fig. 5. The average properties of the z = 12 collapsing regions from which z = 0 halos originate. Left: Average properties of the regions in which older (low a1/2) and younger (high a1/2) halos form. Right: Same but for virial concentration (low and high cvir). From top to bottom, the mean values of the sphericity, ellipticity, concentration, triaxiality, velocity dispersion, and radial velocity of the corresponding clouds are presented for each subset separately. Errors were estimated using a bootstrap procedure based on 1000 resampling subsets. |
3.2. Measuring secondary bias
The standard way of measuring linear halo bias is based on a ratio of either correlation functions or power spectra, for example b = ξhm/ξmm (where ξmm is the auto-correlation of the DM density field and ξhm is the cross-correlation between halos and the DM itself). These procedures have the disadvantage that the bias is measured for subsets of halos, which can complicate some analyses that aim at disentangling correlations between internal halo properties, environment and bias. Instead, we used the object-by-object estimate of halo bias developed by Paranjape et al. (2018). In this prescription, the effective large-scale bias is viewed as a sample mean in which each element contributes with an object-by-object bias, bi, of the form
where δDM(k) is the Fourier transform of the DM density field, PDM is the matter power spectrum, and is the number of Fourier modes in the j-th spherical shell. The sum is carried over the range of wavenumbers in which the ratio between the halo and the DM power spectra is constant. We set this limit as 0.04 ≤ k[h Mpc−1]≤0.08. For computational feasibility, 1/256 of the total number of particles of the DM field are employed. Similar applications of the individual bias are described in Ramakrishnan et al. (2019), Contreras et al. (2021b), Balaguera-Antolínez et al. (2024), and Balaguera-Antolínez & Montero-Dorta (2024).
With the object-by-object bias in hand, measuring secondary bias is trivial. In each selected mass bin, the halo set can be split by a particular property (internal at z = 0 or from the original z = 12 region) using equal-size percentile subsets encompassing halos with the 20% lowest and highest values of the corresponding property. Subsequently, the mean individual bias of each subset is computed and an error on the mean is assigned based on a bootstrap method. This is the traditional approach, but in order to harness the power of the halo-by-halo bias, the measurement can be also inserted into a correlation analysis as another individual halo property.
4. Results
Figure 2 displays the secondary halo bias produced at z = 0 by several properties of the collapsing regions (measured at z = 12), as compared to the assembly bias signal. For simplicity, only the cvir trend is plotted for assembly bias, since a1/2 is known to produce a similar signal within the halo mass range considered. In both panels, the mean halo-by-halo bias is presented as a function of the z = 0 halo mass, showing the characteristic enhancement of clustering toward the high-mass end (see, e.g., Tinker et al. 2010 for a detailed discussion and fitting functions). We note that lines connecting data have been added only for the sake of a better visualization of the differences between subsets. As mentioned in the previous section, different percentile subsets are created by ranking and splitting the halo population within each mass bin based on the values of each property (the 20% lowest, the 20% highest, and the entire sample). Errors throughout this work have been estimated by means of a bootstrap analysis, based on 1000 resampling subsets. Secondary bias exists when a significant difference is measured between the mean bias of the low- and the high-percentile subsets. We note that, as long as the bias has a monotonous dependence on a given property, the mean bias of the entire population must lie between the extreme subsets.
![]() |
Fig. 2. Secondary halo bias (measured at z = 0) produced by a set of properties related to the shape (left) and kinematics (right) of the collapsing regions at z = 12 from which z = 0 halos in specific mass bins originate. These results are compared with the assembly bias signal, represented here by the concentration, cvir, secondary dependence. The mean biases in equal-size percentile subsets encompassing halos with the 20% lowest and highest values of the corresponding property in each mass bin are shown as a function of halo mass, along with the mean bias of the entire population in that mass bin. Errors were estimated using a bootstrap technique based on 1000 resampling subsets. The lines connecting data have been added only for the sake of a better visualization of the differences between subsets. |
Figure 2 shows one of the key results of this work: when properties of the initial collapsing regions are assigned to the resulting halos at z = 0, large secondary bias signals, often exceeding in amplitude the assembly bias trend, are obtained. To better quantify this result, in Fig. 3 we show the relative bias produced by the same properties, that is, the ratio between the mean bias of each percentile subset and the mean bias of the entire population in that mass bin. From the analysis of Figs. 2 and 3, the following conclusions are drawn:
![]() |
Fig. 3. Similar to Fig. 2 but showing the mean relative bias as a function of halo mass for the same set of properties, which are related to the shape (left) and kinematics (right) of the initial clouds. The mean relative bias is simply defined as the ratio between the mean bias of the low- or high-percentile subset and the mean bias of the entire mass bin. Errors were estimated using a bootstrap procedure based on 1000 resampling subsets. The lines connecting data points are simply added to facilitate the readability of the plot. |
-
Below log10Mvir[h−1 M⊙]≃13, halos at z = 0 that originate from collapsing regions at z = 12 that have higher concentration, lower sphericity, higher ellipticity, higher velocity dispersion and more negative (i.e., higher, in absolute values) radial velocity are more clustered on large scales than halos in the opposite subsets. Lower triaxiality (more oblate) halos have a higher bias only toward the low-mass end, where the signal seems to be statistically significant, according to our uncertainty estimation.
-
Among the above trends, the largest secondary bias signals within the aforementioned mass range are found for velocity dispersion and radial velocity, with mean relative biases of > 2 (< 0.25) for the higher- (lower-) bias subsets. Also, concentration and sphericity display large signals, with factors of ∼1.75 (∼0.25) in relative bias. Ellipticity also yields a signal that exceeds that of virial concentration (factors of 1.5−0.3). The secondary bias produced by triaxiality, on the other hand, is only significant for low-mass halos (1.3−0.75) and is smaller than the concentration trend.
-
All properties, except for triaxiality, produce signals that, although strong, show very little dependence on mass (even at the high mass end). This is different from the well-known assembly bias trends, which vanish toward the high-mass end (both for concentration and age, within the halo mass range considered). The triaxiality trend appears to be the one that more closely resembles the concentration trend, with vanishing signal toward the high-mass end.
The results presented in Figs. 2 and 3 are not obvious. They imply that, at fixed z = 0 halo mass, (individual) z = 0 halo bias correlates strongly with the properties of the initial regions where halos originate. These figures also highlight two specific properties that display the strongest dependences: the radial velocity and the velocity dispersion. These properties are tightly anticorrelated, since the radial velocity is a projection of the velocity dispersion and is negative in regions that are collapsing.
The fact that selecting halos by different properties produces segregation in bias is a consequence of often small, although non-negligible correlations (or anticorrelations) between bias and these properties at fixed halo mass. To help us investigate the extent of these correlations and to provide a complete picture, Fig. 4 displays the correlation matrices for bias, and the z = 0 and z = 12 properties in the four halo mass bins considered. There are several important conclusions that we can extract from this figure. First of all, it shows that, due to the large scatter in all relations, the amount of correlation needed for a significant secondary bias signal to exist is rather small. The largest correlation (in absolute values) with bias is measured, as expected, for the velocity dispersion and the radial velocity of the collapsing cloud, with values of the correlation coefficient of C ≃ ±0.14, respectively. It is interesting that several properties display almost null correlation coefficients with respect to the individual bias, and still show significant secondary bias.
![]() |
Fig. 4. Correlation matrix for all properties analyzed in this work in each of the four halo mass bins considered. Three different groups of properties are highlighted with thicker lines: clustering (bias), internal z = 0 halo properties, and the properties of the initial z = 12 clouds. |
It is also obvious from Fig. 4 that significant correlation exists within the z = 0 and the z = 12 subset of properties. Virial concentration and age are, as expected, strongly anticorrelated throughout the entire mass range (C ≃ −0.6). Among the high-z properties, it is also not surprising that the pairs that display stronger correlation are e12 − T12 and vr, 12 − σ12: C ≃ 0.8 and C ≃ −0.97, respectively. Importantly, the high-z properties that are more tightly connected with halo concentration and age are also vr, 12 and σ12 in all halo mass bins, which suggests, when analyzed in the context of bias, that these are the properties of the initial regions that more directly shape the assembly bias trend.
To complement the above analysis, Fig. 5 displays the average characteristics of the z = 12 regions from which higher- or lower-concentration and older or younger z = 0 halos originate. We note that this figure reflects the correlations displayed in Fig. 5, but provides additional information on the actual values of each property. On average, higher-cvir halos originate in regions of higher sphericity, and consequently lower ellipticity, as compared to low-cvir halos. Those regions are also more concentrated and characterized by higher velocity dispersions and more negative radial velocities. Similar results are obtained for older halos (as expected from the cvir − a1/2 anticorrelation), although some interesting discrepancies exist. For example, older halos appear to be more spherical, but at the same time slightly more elliptical than younger halos. This contradiction might reflect the structural complexity of the initial regions (recall that these regions can be composed of disjointed patches). Figure 5 also shows that virial concentration is a better tracer of the original ellipticity, whereas age discriminate better between regions of different triaxiality.
Finally, we conducted a simple test to evaluate the robustness of our results, accounting for the complexity of the initial regions, which may consist of multiple subregions. We computed the cumulative mass fraction for each region, centered on the system’s overall center of mass. Our analysis revealed that highly heterogeneous regions tend to produce uneven mass fraction profiles. By fitting a simple error-function model to these profiles, we were able to approximately isolate such regions. Removing the 20% subset of regions with the most uneven profiles (higher χ2) did not qualitatively alter the signals shown in Figs. 2 and 3. This result suggests that complex regions are not preferentially biased toward any particular subset when the sample is divided based on the regions’ global properties, supporting the robustness of our findings. Nevertheless, we plan to explore this aspect in more detail in future work.
5. Discussion and conclusions
In this work we traced the positions and velocities of the DM particles that make up a selected subset of z = 0 halos all the way back to z = 12. In particular, all halos in four specific narrow mass bins, covering the mass range 1011.5 < Mvir[h−1 M⊙]< 1013 at z = 0, were considered in the analysis. The goal of the study was to examine the properties of the initial regions that give rise to these objects (initial in the sense that structure formation is still in its linear regime) and to evaluate the degree of secondary bias that these properties display at z = 0. This analysis is intended to shed light on the buildup of the assembly bias signal (the secondary bias dependence on halo assembly history, i.e., either on concentration or formation time) from the initial density field.
We have shown that halo concentration and formation time retain information on the properties of the initial regions. In terms of statistical correlation, the velocity dispersion and radial velocity of the initial clouds have the highest correlation coefficients with respect to these internal halo properties (C ± 0.25 − 0.35, in absolute values), followed by sphericity (C ± 0.10 − 0.2). The rest of the properties display values of C ≲ 0.1. In general, as we have verified by looking at the mean values of the properties of the initial regions, older, more concentrated halos at fixed halo mass tend to originate (as compared to younger, less concentrated halos) in more spherical and concentrated patches, which are characterized by higher velocity dispersions and radial velocities (in terms of amplitude, meaning more negative radial velocities values). Although these regions are fairly spherical, older, more concentrated halos also appear to emerge from slightly more oblate regions than those where younger, less concentrated halos form.
Importantly, using a convenient halo-by-halo estimate of bias at z = 0, we have shown that several properties of the initial regions display large secondary bias signals when they are used to split the z = 0 halo population at a fixed halo mass. This is a clear indication of the importance of linear effects in shaping assembly bias (i.e., the effects that can be directly connected with properties of the initial density field). The shapes of the trends themselves, however, might potentially reflect the impact of nonlinear effects that become important at later stages of halo evolution. We will come back to this point later in this section.
Halos with the same mass that formed in regions of, in this order, higher (more negative) velocity dispersion (radial velocity), lower sphericity, higher concentration, and higher ellipticity are significantly more clustered than the opposite subpopulations, to extents that considerably exceed the assembly bias trends. Only triaxiality seems to display a weaker secondary dependence than halo concentration and halo age. This hierarchy is reflected in the correlation matrix, where we can include the bias thanks to the individual nature of the estimate. The correlation between these initial properties and bias tends to be, as expected, stronger than for halo properties, for example ∼0.13 for σ12 compared to ∼0.05 for concentration. We note that it is well known that small correlations with bias can produce large secondary bias signals (see, e.g., Montero-Dorta et al. 2020; Paranjape et al. 2018). Importantly, our results highlight a clear link between the velocity dispersion and radial velocity of the initial regions and assembly bias, which can be easily understood from a physical standpoint. The radial velocity, vr, for example, is related to peculiar velocities, which respond to the gravitational potential of the region, thus reflecting the “strength” of the collapse. This picture is consistent with our result that older, more concentrated halos have typically more negative values of vr, indicating that these objects tend to collapse faster. We note that, since the absolute value of the radial velocity is highly correlated with the velocity dispersion, it is also possible to understand this bias trend in terms of virial mass alone. The regions that give rise to early-formed halos will tend to reach a higher virial mass earlier on than late-formed halos, even though they both reach the same virial mass at z = 0. It is reasonable, therefore, that these regions are more biased at early times.
The large amplitudes of secondary bias measured in this paper for the initial velocity dispersion and the radial velocity have only been previously found for environmental properties. They are not internal properties but have been proposed as mediators of the assembly bias effect. Examples of this can be found in Paranjape et al. (2018), where clustering is measured as a function of the anisotropy of the environment (the so-called α parameter) at fixed halo mass, or in Montero-Dorta & Rodriguez (2024), where a similar analysis is performed as a function of the distance to the main cosmic-web structures (characterized by the critical points of the density field). By exploring the correlation with properties of the initial collapsing clouds, we have been able to identify simple “internal” quantities (in the sense that they can be computed from the same set of particles that make up a current halo) that maximize the secondary bias signal at z = 0.
As mentioned above, another interesting aspect to consider is the shape of the secondary bias trends measured. Whereas both assembly bias signals decrease with mass up to Mhalo ≃ 1013 h−1 M⊙, most of the trends related to the initial clouds show almost no dependence with mass, thus maintaining the same signal at the high-mass end (we did not consider the well-known reversal of the signal above this mass scale for concentration, since that is beyond the mass range analyzed here). The cause of the specific mass dependence of assembly bias is unclear. The drop in signal toward Mvir ≃ 1013 h−1 M⊙ could be the imprint of nonlinear effects that act at a later stage of halo evolution. The correlations between different properties of the initial regions might also provide some insight. As the correlations between different properties display a dependence on mass, the vanishing of the assembly bias signal could potentially be related to the cancellation of different dependences. As an example, the correlation between concentration and sphericity increases significantly with halo mass. Sphericity is anticorrelated with bias, whereas velocity dispersion correlates with both concentration and bias. Trends like these could provide hints as to the physical mechanisms that make the assembly bias trend vanish at the high-mass end. Unfortunately, the small volume of TNG300 prevents us from properly investigating this hypothesis. We will address this aspect in a follow-up work, where we will analyze multiple snapshots in a bigger cosmological volume: this will allow us to reach halo masses of around Mvir ≃ 1014 h−1 M⊙ at high statistical significance.
Our analysis can be improved by taking into account that, when DM halos are traced back, the resulting initial regions are sometimes composed of two or potentially more disjointed subregions that are in the process of merging. Although our tests indicate that defining the properties of these regions on a global basis is a robust choice, characterizing the properties of their subregions separately, along with their relationship with the properties of virialized objects at z = 0, will add another layer of complexity to our analysis. In this context, the study of the buildup of the assembly bias signal can be extended by evaluating the connection between the initial regions and halos at different redshifts.
Finally, a potential application of the properties investigated in this work is the inclusion of assembly bias in galaxy population models. Models such as halo occupation distributions (Jing et al. 1998; Benson et al. 2000; Peacock & Smith 2000; Berlind et al. 2003; Zheng et al. 2005, 2007; Contreras et al. 2013, 2017; Guo et al. 2015; Zehavi et al. 2019) and subhalo abundance matching (Vale & Ostriker 2006; Conroy et al. 2006; Reddick et al. 2013; Contreras et al. 2015; Guo et al. 2016; Chaves-Montero et al. 2016; Lehmann et al. 2017; Dragomir et al. 2018; Artale et al. 2018; Hadzhiyska et al. 2021; Favole et al. 2022) contain a limited or even zero level of assembly bias (we refer to galaxy assembly bias in this case6). Some extensions of these models have been able to inject additional assembly bias to improve their accuracy. Initially, these models add assembly bias to their galaxies by correlating the occupation number or galaxy properties (such as stellar mass or luminosity) to a halo property that displays a certain level of halo assembly bias (e.g., Hearin et al. 2015). The amount of galaxy assembly bias this method could add is limited by the amount of halo assembly bias the halo property had, which was usually insufficient (Xu et al. 2021a). We note that here we are specifically referring to properties such as concentration or formation time, thus the use of the term assembly bias. Modern mocks add bias using environmental properties, a type of secondary bias that is not technically assembly bias but has a similar (actually stronger) effect (Paranjape et al. 2018; Contreras et al. 2021b,c; Xu et al. 2021b). Using the properties found in this work could help introduce assembly bias in a more natural way, which could in turn facilitate the use of these mocks in a variety of halo-galaxy connection studies.
Note that multiple definitions of halo age have been analyzed in this context, as discussed by Li et al. (2008).
See https://www.tng-project.org/results/ for a complete list of publications.
Here, we are implicitly adopting the definition of galaxy assembly bias based on so-called occupancy variations, i.e., the dependence of halo occupation on the assembly history of halos and related quantities (Zehavi et al. 2018).
Acknowledgments
First of all, we want to thank the anonymous referee for his/her comments, which have significantly improved the quality of the manuscript. This work is the result of the ideas and analyses developed during the Institute for Fundamental Physics of the Universe (IFPU) Team Research Program “Unveiling the physical origins of assembly bias”, which was held in Trieste in July 2024. We thank IFPU for their financial support and Emiliano Sefusatti and Pierluigi Monaco for their hospitality and valuable discussions during the week. ADMD and FR also acknowledge the Abdus Salam International Centre for Theoretical Physics (ICTP) for their hospitality and financial support through the Senior Associates Programme 2022-2027 and Junior Associates Programme 2023-2028, respectively. ADMD thanks Fondecyt for financial support through the Fondecyt Regular 2021 grant 1210612. FR thanks the support by Agencia Nacional de Promoción Científica y Tecnoólogica, the Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET, Argentina) and the Secretaría de Ciencia y Tecnología de la Universidad Nacional de Córdoba (SeCyT-UNC, Argentina). MCA acknowledges support from FONDECYT Iniciación 2024 grant number 11240540 and ANID BASAL project FB210003. SC acknowledges the support of the ‘Juan de la Cierva Incorporacíon’ fellowship (IJC2020-045705-I).
References
- Allgood, B., Flores, R. A., Primack, J. R., et al. 2006, MNRAS, 367, 1781 [NASA ADS] [CrossRef] [Google Scholar]
- Angulo, R. E., Baugh, C. M., & Lacey, C. G. 2008, MNRAS, 387, 921 [NASA ADS] [CrossRef] [Google Scholar]
- Artale, M. C., Zehavi, I., Contreras, S., & Norberg, P. 2018, MNRAS, 480, 3978 [NASA ADS] [CrossRef] [Google Scholar]
- Balaguera-Antolínez, A., & Montero-Dorta, A. D. 2024, A&A, 692, A32 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Balaguera-Antolínez, A., Montero-Dorta, A. D., & Favole, G. 2024, A&A, 685, A61 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Bardeen, J. M., Bond, J. R., Kaiser, N., & Szalay, A. S. 1986, ApJ, 304, 15 [Google Scholar]
- Benson, A. J., Cole, S., Frenk, C. S., Baugh, C. M., & Lacey, C. G. 2000, MNRAS, 311, 793 [NASA ADS] [CrossRef] [Google Scholar]
- Berlind, A. A., Weinberg, D. H., Benson, A. J., et al. 2003, ApJ, 593, 1 [NASA ADS] [CrossRef] [Google Scholar]
- Borzyszkowski, M., Porciani, C., Romano-Díaz, E., & Garaldi, E. 2017, MNRAS, 469, 594 [NASA ADS] [CrossRef] [Google Scholar]
- Chaves-Montero, J., Angulo, R. E., Schaye, J., et al. 2016, MNRAS, 460, 3100 [NASA ADS] [CrossRef] [Google Scholar]
- Chua, K. T. E., Pillepich, A., Vogelsberger, M., & Hernquist, L. 2019, MNRAS, 484, 476 [Google Scholar]
- Conroy, C., Wechsler, R. H., & Kravtsov, A. V. 2006, ApJ, 647, 201 [Google Scholar]
- Contreras, S., Baugh, C. M., Norberg, P., & Padilla, N. 2013, MNRAS, 432, 2717 [Google Scholar]
- Contreras, S., Baugh, C. M., Norberg, P., & Padilla, N. 2015, MNRAS, 452, 1861 [CrossRef] [Google Scholar]
- Contreras, S., Zehavi, I., Baugh, C. M., Padilla, N., & Norberg, P. 2017, MNRAS, 465, 2833 [NASA ADS] [CrossRef] [Google Scholar]
- Contreras, S., Zehavi, I., Padilla, N., et al. 2019, MNRAS, 484, 1133 [NASA ADS] [CrossRef] [Google Scholar]
- Contreras, S., Chaves-Montero, J., Zennaro, M., & Angulo, R. E. 2021a, MNRAS, 507, 3412 [NASA ADS] [CrossRef] [Google Scholar]
- Contreras, S., Angulo, R. E., & Zennaro, M. 2021b, MNRAS, 504, 5205 [CrossRef] [Google Scholar]
- Contreras, S., Angulo, R. E., & Zennaro, M. 2021c, MNRAS, 508, 175 [NASA ADS] [CrossRef] [Google Scholar]
- Dalal, N., White, M., Bond, J. R., & Shirokov, A. 2008, ApJ, 687, 12 [NASA ADS] [CrossRef] [Google Scholar]
- Davis, M., Efstathiou, G., Frenk, C. S., & White, S. D. M. 1985, ApJ, 292, 371 [Google Scholar]
- Dolag, K., Borgani, S., Murante, G., & Springel, V. 2009, MNRAS, 399, 497 [Google Scholar]
- Dragomir, R., Rodríguez-Puebla, A., Primack, J. R., & Lee, C. T. 2018, MNRAS, 476, 741 [NASA ADS] [CrossRef] [Google Scholar]
- Faltenbacher, A., & White, S. D. M. 2010, ApJ, 708, 469 [NASA ADS] [CrossRef] [Google Scholar]
- Favole, G., Montero-Dorta, A. D., Artale, M. C., et al. 2022, MNRAS, 509, 1614 [Google Scholar]
- Gao, L., & White, S. D. M. 2007, MNRAS, 377, L5 [NASA ADS] [CrossRef] [Google Scholar]
- Gao, L., Springel, V., & White, S. D. M. 2005, MNRAS, 363, L66 [NASA ADS] [CrossRef] [Google Scholar]
- Guo, H., Zheng, Z., Zehavi, I., et al. 2015, MNRAS, 453, 4368 [Google Scholar]
- Guo, H., Zheng, Z., Behroozi, P. S., et al. 2016, MNRAS, 459, 3040 [NASA ADS] [CrossRef] [Google Scholar]
- Hadzhiyska, B., Bose, S., Eisenstein, D., & Hernquist, L. 2021, MNRAS, 501, 1603 [Google Scholar]
- Hahn, O., Porciani, C., Dekel, A., & Carollo, C. M. 2009, MNRAS, 398, 1742 [CrossRef] [Google Scholar]
- Han, J., Li, Y., Jing, Y., et al. 2019, MNRAS, 482, 1900 [NASA ADS] [CrossRef] [Google Scholar]
- Hearin, A. P., Watson, D. F., & van den Bosch, F. C. 2015, MNRAS, 452, 1958 [CrossRef] [Google Scholar]
- Jing, Y. P., Mo, H. J., & Börner, G. 1998, ApJ, 494, 1 [NASA ADS] [CrossRef] [Google Scholar]
- Johnson, J. W., Maller, A. H., Berlind, A. A., Sinha, M., & Holley-Bockelmann, J. K. 2019, MNRAS, 486, 1156 [CrossRef] [Google Scholar]
- Kaiser, N. 1984, ApJ, 284, L9 [NASA ADS] [CrossRef] [Google Scholar]
- Lazeyras, T., Musso, M., & Schmidt, F. 2017, JCAP, 2017, 059 [Google Scholar]
- Lehmann, B. V., Mao, Y.-Y., Becker, M. R., Skillman, S. W., & Wechsler, R. H. 2017, ApJ, 834, 37 [NASA ADS] [CrossRef] [Google Scholar]
- Li, Y., Mo, H. J., & Gao, L. 2008, MNRAS, 389, 1419 [NASA ADS] [CrossRef] [Google Scholar]
- Mao, Y.-Y., Zentner, A. R., & Wechsler, R. H. 2018, MNRAS, 474, 5143 [NASA ADS] [CrossRef] [Google Scholar]
- Marinacci, F., Vogelsberger, M., Pakmor, R., et al. 2018, MNRAS, 480, 5113 [NASA ADS] [Google Scholar]
- Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 [Google Scholar]
- Montero-Dorta, A. D., & Rodriguez, F. 2024, MNRAS, 531, 290 [NASA ADS] [CrossRef] [Google Scholar]
- Montero-Dorta, A. D., Artale, M. C., Abramo, L. R., et al. 2020, MNRAS, 496, 1182 [NASA ADS] [CrossRef] [Google Scholar]
- Montero-Dorta, A. D., Chaves-Montero, J., Artale, M. C., & Favole, G. 2021, MNRAS, 508, 940 [NASA ADS] [CrossRef] [Google Scholar]
- Musso, M., Cadiou, C., Pichon, C., et al. 2018, MNRAS, 476, 4877 [Google Scholar]
- Naiman, J. P., Pillepich, A., Springel, V., et al. 2018, MNRAS, 477, 1206 [Google Scholar]
- Navarro, J. F., Frenk, C. S., & White, S. D. M. 1997, ApJ, 490, 493 [Google Scholar]
- Nelson, D., Pillepich, A., Springel, V., et al. 2018, MNRAS, 475, 624 [Google Scholar]
- Nelson, D., Springel, V., Pillepich, A., et al. 2019, Comput. Astrophys. Cosmol., 6, 2 [Google Scholar]
- Paranjape, A., Hahn, O., & Sheth, R. K. 2018, MNRAS, 476, 3631 [NASA ADS] [CrossRef] [Google Scholar]
- Peacock, J. A., & Smith, R. E. 2000, MNRAS, 318, 1144 [Google Scholar]
- Pillepich, A., Nelson, D., Hernquist, L., et al. 2018a, MNRAS, 475, 648 [Google Scholar]
- Pillepich, A., Springel, V., Nelson, D., et al. 2018b, MNRAS, 473, 4077 [Google Scholar]
- Planck Collaboration XIII. 2016, A&A, 594, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ramakrishnan, S., Paranjape, A., Hahn, O., & Sheth, R. K. 2019, MNRAS, 489, 2977 [NASA ADS] [CrossRef] [Google Scholar]
- Reddick, R. M., Wechsler, R. H., Tinker, J. L., & Behroozi, P. S. 2013, ApJ, 771, 30 [Google Scholar]
- Rodriguez, F., Merchán, M., & Artale, M. C. 2024, A&A, 688, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Salcedo, A. N., Maller, A. H., Berlind, A. A., et al. 2018, MNRAS, 475, 4411 [NASA ADS] [CrossRef] [Google Scholar]
- Sato-Polito, G., Montero-Dorta, A. D., Abramo, L. R., Prada, F., & Klypin, A. 2019, MNRAS, 487, 1570 [NASA ADS] [CrossRef] [Google Scholar]
- Sheth, R. K., & Tormen, G. 1999, MNRAS, 308, 119 [Google Scholar]
- Sheth, R. K., & Tormen, G. 2004, MNRAS, 350, 1385 [NASA ADS] [CrossRef] [Google Scholar]
- Sheth, R. K., Mo, H. J., & Tormen, G. 2001, MNRAS, 323, 1 [NASA ADS] [CrossRef] [Google Scholar]
- Springel, V., White, S. D. M., Tormen, G., & Kauffmann, G. 2001, MNRAS, 328, 726 [Google Scholar]
- Springel, V., Pakmor, R., Pillepich, A., et al. 2018, MNRAS, 475, 676 [Google Scholar]
- Tinker, J. L., Robertson, B. E., Kravtsov, A. V., et al. 2010, ApJ, 724, 878 [NASA ADS] [CrossRef] [Google Scholar]
- Tucci, B., Montero-Dorta, A. D., Abramo, L. R., Sato-Polito, G., & Artale, M. C. 2021, MNRAS, 500, 2777 [Google Scholar]
- Vale, A., & Ostriker, J. P. 2006, MNRAS, 371, 1173 [NASA ADS] [CrossRef] [Google Scholar]
- Wechsler, R. H., Zentner, A. R., Bullock, J. S., Kravtsov, A. V., & Allgood, B. 2006, ApJ, 652, 71 [NASA ADS] [CrossRef] [Google Scholar]
- Xu, X., Zehavi, I., & Contreras, S. 2021a, MNRAS, 502, 3242 [NASA ADS] [CrossRef] [Google Scholar]
- Xu, X., Kumar, S., Zehavi, I., & Contreras, S. 2021b, MNRAS, 507, 4879 [CrossRef] [Google Scholar]
- Zehavi, I., Contreras, S., Padilla, N., et al. 2018, ApJ, 853, 84 [NASA ADS] [CrossRef] [Google Scholar]
- Zehavi, I., Kerby, S. E., Contreras, S., et al. 2019, ApJ, 887, 17 [NASA ADS] [CrossRef] [Google Scholar]
- Zheng, Z., Berlind, A. A., Weinberg, D. H., et al. 2005, ApJ, 633, 791 [NASA ADS] [CrossRef] [Google Scholar]
- Zheng, Z., Coil, A. L., & Zehavi, I. 2007, ApJ, 667, 760 [Google Scholar]
All Figures
![]() |
Fig. 1. Collapsing region at z = 12 and the corresponding halo that forms from that patch of the density field, viewed at z = 0. Only 10% of the particles are plotted for the sake of simplicity. The color code indicates the height on the z-axis at z = 12, from low (purple) to high (yellow). Arrows indicate the initial and final location of a few individual DM particles. Note that this particular region is composed of two disjointed subregions that are in the process of collapsing and merging. |
In the text |
![]() |
Fig. 5. The average properties of the z = 12 collapsing regions from which z = 0 halos originate. Left: Average properties of the regions in which older (low a1/2) and younger (high a1/2) halos form. Right: Same but for virial concentration (low and high cvir). From top to bottom, the mean values of the sphericity, ellipticity, concentration, triaxiality, velocity dispersion, and radial velocity of the corresponding clouds are presented for each subset separately. Errors were estimated using a bootstrap procedure based on 1000 resampling subsets. |
In the text |
![]() |
Fig. 2. Secondary halo bias (measured at z = 0) produced by a set of properties related to the shape (left) and kinematics (right) of the collapsing regions at z = 12 from which z = 0 halos in specific mass bins originate. These results are compared with the assembly bias signal, represented here by the concentration, cvir, secondary dependence. The mean biases in equal-size percentile subsets encompassing halos with the 20% lowest and highest values of the corresponding property in each mass bin are shown as a function of halo mass, along with the mean bias of the entire population in that mass bin. Errors were estimated using a bootstrap technique based on 1000 resampling subsets. The lines connecting data have been added only for the sake of a better visualization of the differences between subsets. |
In the text |
![]() |
Fig. 3. Similar to Fig. 2 but showing the mean relative bias as a function of halo mass for the same set of properties, which are related to the shape (left) and kinematics (right) of the initial clouds. The mean relative bias is simply defined as the ratio between the mean bias of the low- or high-percentile subset and the mean bias of the entire mass bin. Errors were estimated using a bootstrap procedure based on 1000 resampling subsets. The lines connecting data points are simply added to facilitate the readability of the plot. |
In the text |
![]() |
Fig. 4. Correlation matrix for all properties analyzed in this work in each of the four halo mass bins considered. Three different groups of properties are highlighted with thicker lines: clustering (bias), internal z = 0 halo properties, and the properties of the initial z = 12 clouds. |
In the text |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.