Open Access
Issue
A&A
Volume 693, January 2025
Article Number A59
Number of page(s) 24
Section Cosmology (including clusters of galaxies)
DOI https://doi.org/10.1051/0004-6361/202451683
Published online 03 January 2025

© The Authors 2024

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model.

Open access funding provided by Max Planck Society.

1. Introduction

Interest in galaxy proto-clusters has recently strongly increased thanks to observational capabilities of new survey instruments, which have revealed how some of the most essential processes shaping the present-day galaxy population in clusters has already happened in the very early stages of cluster formation. The desire to obtain direct observational evidence of these processes at high redshifts has motivated many recent observational studies of proto-clusters (e.g. Overzier 2016; Alberts & Noble 2022).

Galaxy proto-clusters have been found in different ways. They have been found as serendipitous detections in systematic surveys: such as (1) in photometric surveys often conducted to find distant galaxies. One of the first of these discoveries is the proto-cluster in the SSA22 field, which was found as an overdensity of Ly-break galaxies at redshift z ∼ 3 (Steidel et al. 1998). Other such detections include overdensities of Lyα-emitters (Shimasaku et al. 2003 – Subaru Deep Field, Ouchi et al. 2005 – COSMOS Survey, Higuchi et al. 2019 – SUBARU HSC Survey), i-band drop outs (Toshikawa et al. 2012 – Subaru Deep Field), and multi-band photometric redshifts (Chiang et al. 2014 – COSMOS Survey). Galaxy proto-clusters have also been found (2) in spectroscopic surveys, for example, in the VIMOS Ultra Deep Survey (Cucciati et al. 2014, 2018; Lemaux et al. 2014; Harikane et al. 2019), and (3) as concentrations of sub-millimeter sources in the Planck Survey (Planck Collaboration XXVII 2015; Flores-Cacho et al. 2016; Calvi et al. 2023), by the South Pole Telescope (Vieira et al. 2010; Miller et al. 2018), by the Herschel Space Observatory (Clements et al. 2014; Greenslade et al. 2018), and in other deep survey fields (Daddi et al. 2009; Dannerbauer et al. 2014; Casey et al. 2015; Oteo et al. 2018; Gómez-Guijarro et al. 2019; Wang et al. 2021). Also the detection of proto-clusters as overdensities of passive galaxies has been reported recently (Strazzullo et al. 2015; McConachie et al. 2022; Ito et al. 2023), where the objects of Strazzullo et al. (2015) are not easily classified as either clusters or proto-clusters. However, interesting proto-cluster systems have been discovered around particular objects marking dense regions of the Universe, so-called signposts for proto-clusters, including (4) radio galaxies, which have been used for quite some time to search for dense environments (Le Fevre et al. 1996; Pentericci et al. 2000; Kurk et al. 2000, 2004; Venemans et al. 2002, 2004, 2007; Miley & De Breuck 2008; Kuiper et al. 2011; Hatch et al. 2011a,b; Galametz et al. 2012; Wylezalek et al. 2013; Koyama et al. 2013; Castignani et al. 2014a,b), (5) AGN (active galactic nuclei) playing a similar role as radio galaxies (Djorgovski et al. 2003; Hennawi et al. 2015; García-Vergara et al. 2017, 2019, 2022), (6) and Ly-α blobs (Calvi et al. 2023), and last but not least (7) by absorption in the light of background objects (Francis et al. 1996; Steidel et al. 1998; Hennawi et al. 2015).

While these observed systems span a range of properties (not all of these objects may end up in one galaxy cluster), one looks for a unifying description. The general idea is to call a proto-cluster a structures that is expected to evolve into a galaxy cluster by redshift z = 0 (e.g. Steidel et al. 1998; Overzier 2016), whereby theoretical modelling or simulations are used to connect the observations to present-day cluster properties. In Steidel et al. (1998), one of the earliest studies of a proto-cluster, the structure evolution model of a spherical top-hat overdensity was used to relate an observed overdensity to the expectation for a galaxy cluster at the present day. Another approach is to use N-body simulations to trace the evolution of z = 0 clusters back to the redshifts of observations and provide, in this way, relations between present-day cluster masses and the properties of their precursors at high redshift. Chiang et al. (2013) provide results from such a study and present correlations between the proto-cluster overdensity and the expected cluster mass at z = 0, which has been applied with some success to several proto-cluster observations (e.g. Cucciati et al. 2014). Also, Contini et al. (2016) studied proto-cluster sizes in simulations. However, they used boxes instead of spheres, which makes a comparison to other work more difficult.

The ESA-Euclid mission (Euclid Collaboration 2022, 2024a,b,c,d) with its deep near-infrared and high-angular-resolution visual band survey, together with the auxiliary ground-based optical survey data, will provide a unique opportunity to search for proto-clusters over a large region of the sky. This will not only increase the number of known proto-clusters and improve the statistics on their properties, but also yield rare and massive systems, that can only be found in large survey volumes. The main part of the Euclid Survey is the Wide Survey of the sky outside the Galactic band, which has an area of about 14 500 deg2 over six years. It will reach estimated limiting AB magnitudes (5σ for point-like sources) of about 26.2 in the visual band, IE1, and 24.5 for the near-infrared bands YE, JE, and HE (Euclid Collaboration 2022). This paper provides studies of proto-cluster properties and their appearance in the Euclid Wide Survey by means of simulations in order to explore the prospects for the search of proto-clusters in the Euclid sky. Euclid will enormously increase the number of known proto-clusters at high redshifts and thus provide the base for precise statistical studies on proto-cluster structure, early galaxy evolution in dense environments, and the origin of present day cluster properties. But it will also provide the large survey volume needed to find the most interesting objects, the precursors of the most massive galaxy clusters.

One of the major interests in the study of proto-clusters is gaining an understanding of the evolution of galaxies in these dense regions compared to the regions outside of them. The important questions are, when and through which processes was the difference of the more passive present-day cluster members compared to field galaxies established, which epoch saw the bulk of the star formation activity in the cluster galaxies and the most intense enrichment of the intracluster medium with heavy elements, and when and how were the central regions of clusters and the giant central galaxies, such as cD galaxies, formed. An interesting example of the latter question is the study of the Spiderweb proto-cluster (e.g. Tozzi et al. 2022). The Euclid survey will provide a large statistical sample of proto-clusters, which will not only allow for the investigation of single interesting cases as has been done so far, but enable a study of these questions in the form of population statistics.

In this paper we define a proto-cluster as a matter and galaxy concentration at a higher redshift that is bound to develop into a galaxy cluster by redshift zero with a mass larger than 1014 M inside r2002. We focus mainly on the redshift range z = 1.5 − 4. At higher redshifts, the galaxy density in the Euclid Survey is too sparse to effectively characterise proto-clusters, while at lower redshifts, galaxy clusters are already abundant. In the following, we explore the observational features of such proto-clusters using analytical models and cosmological simulations. We also estimate their abundance as a function of redshift.

The paper is organised in the following way. In Section 2, we describe the formalism of proto-cluster evolution with a top-hat model, while in Section 3 we give information on the cosmological simulations used to explore proto-cluster properties. The following sections are focused on different proto-cluster properties, such as sizes (Sect. 4), galaxy density profiles (Sect. 5), projected contrast in observations (Sects. 6 and 7), and the mass-richness relation (Sect. 8). Section 9 discusses the expected proto-cluster abundances and their sky-filling factors (how much of the sky is covered by proto-clusters in projection). Discussions of the findings are provided in Section 10 and in Section 11 we present our summary and conclusions. Unless stated otherwise, we use a cosmological model with h100 = 0.7 = H0/100 km s−1 Mpc−1, Ωm = 0.3, and a flat metric, which is referred to as ‘reference model’.

2. Proto-cluster model

As a basic characterisation of proto-clusters, we explore their overdensity evolution in this section. For galaxy clusters and their formation, a simple, general concept that can be expressed with analytic formulas has helped us very much in guiding our thoughts, the so-called Press–Schechter model (Press & Schechter 1974) and its extensions (e.g. Bond et al. 1991; Sheth & Tormen 1999). It is in its original form based on the collapse model of a homogeneous overdense sphere and the first-order statistics of density peaks in the large-scale matter distribution. It provides the most essential information to characterise the galaxy cluster population, such as the mass function, their characteristic sizes, and their number density evolution with time. Its precision for the prediction of number counts is usually better than a factor of two for low redshifts and not extremely high masses ( ≲ 1015M) (Bond et al. 1991). This is not enough for precise cosmological modelling. But this concept has also provided the frame in which more precise analytical models have been devised, which have been calibrated with N-body simulations (e.g. Jenkins et al. 2001; Evrard et al. 2002; Tinker et al. 2010; Despali et al. 2016; Castro et al. 2021). In recent work, we applied this concept also successfully to those superclusters, which are expected to collapse in the future – calling them “superstes-clusters” (Chon et al. 2015). These superclusters have the same relation to future galaxy clusters as high-redshift proto-clusters to galaxy clusters of the present day.

Therefore, it is well justified to apply this concept analogously to our proto-cluster project. The adopted model describes the evolution of a homogeneous top-hat spherical overdensity in a ΛCDM universe. The calculation of the evolution is based on Birkhoff’s theorem, where a homogeneous sphere in a homogeneous background universe evolves as a universe would with the local parameters as cosmological parameters. After finding the proper initial conditions for a collapse at redshift zero, we follow the evolution of the overdensity by integrating the Friedmann equations, including a Λ term starting at high redshift to z = 0. In the calculation, we determine the overdensity with respect to the mean matter density and the evolution of the radius of the overdense region with respect to r200 at z = 0.

We define a matter overdensity ratio as the ratio of the mean proto-cluster density, ρ ¯ pc ( z ) $ \bar\rho_{\mathrm{pc}}(z) $, to the background density, R ov DM ( z ) = ρ ¯ pc ( z ) / ρ m ( z ) $ R_{\mathrm{ov-DM}}(z) = \bar\rho_{\mathrm{pc}}(z) / \rho_{\mathrm{m}}(z) $, where ρm(z) is the mean matter density at the given redshift. We have calculated Rov − DM(z) by means of the spherical collapse model for three different sets of cosmological parameters: for the reference model defined in the introduction, the cosmology resulting from the Planck Survey (Planck Collaboration XIII 2016), and the cosmology used in the Millennium Simulations (Springel et al. 2005). Table 1 gives these three sets of cosmological parameters. We also list the cosmological parameters inferred from the present-day cluster population in the REFLEX cluster survey (Böhringer et al. 2014). The cluster mass function used later in this study (Böhringer et al. 2017) is based on these results.

Table 1.

Cosmological parameters used in the different models assuming a flat metric.

The resulting evolution of Rov − DM(z) is shown in Fig. 1. The curves for the reference model and the Planck cosmology can hardly be distinguished in the figure, but the Millennium result is slightly different. The difference is mostly due to the choice of Ωm and σ8, for which an older preference was used in the Millennium Simulations. The difference in the cosmological models does not significantly change the observables discussed in the following. We also indicate in the figure the turn-around redshift, at which the overdensity stops expanding and starts to collapse.

thumbnail Fig. 1.

Evolution of the overdensity ratio, Rov − DM(z), with redshift for an object that collapses at z = 0. The results for three cosmological models are shown: the reference model, Planck cosmology, and Millennium cosmology. The first two models can hardly be distinguished in the plot. The vertical line indicates the redshift of the turn-around point.

As reference we provide a few numerical values of Rov − DM(z): 3.24, 2.52, 1.91, 1.64 for z = 1.5, 2, 3, 4, respectively. The overdensities are small for the time before the turn-around, which occurs at z ∼ 0.8 in the reference model. For an object that collapses at z = 0, the overdensity ratio at turn-around is about 6.6. After the turn-around, the overdensity ratio increases rapidly.

3. Cosmological simulations

Three sets of simulations in the form of lightcones are used for the following study. The MAMBO, GAEA H, and GAEA F lightcones are derived from the Millennium Simulations (Springel et al. 2005; Boylan-Kolchin et al. 2009). The size of the lightcones and the cosmological parameters on which the simulations were based are given in Table 2. Importantly, these lightcones contain information about the merger trees, such that for each galaxy, one can find out if it is included in a larger dark matter halo, a group or cluster of galaxies, at redshift zero. The mass of this final host halo is known, and this parameter is used throught this paper. These three simulations were the only ones available to us where this information necessary for our work was included. For the two GAEA light cones different semi-analytical models were used to determine the properties of the galaxy population. Some of the following analysis required special information only available for the MAMBO light cone at the time of writing. In this case the results are shown only for this simulation. One of the critical differences in the semi-analytic modelling of the galaxy population that can effect the observables derived from the light cones is the stellar mass function used, as shown by Fu et al. (2024).

Table 2.

Parameters for the lightcones and the underlying cosmological simulations.

The MAMBO simulation based on empirical relations, developed by M. Bolzonella, L. Pozzetti, and G. Girelli, uses the halo and sub-halo positions as well as dark matter masses of one of the 24 lightcones built by Henriques et al. (2015). Under the assumption that each subhalo is hosting one galaxy, the stellar mass was derived from the stellar-to-halo mass relation (Girelli et al. 2020), and the physical and observed properties (e.g. star formation rate, dust attenuation, gas metallicity, morphology, emission lines, broadband rest-frame and observed fluxes) from empirical relations implemented in a modified version of the Empirical Galaxy Generator code (EGG; Schreiber et al. 2017). The predicted galaxy properties also include magnitudes in the HE and IE bands. In Henriques et al. (2015), the Millennium Simulations have been re-scaled to a cosmology consistent with the Planck results (Planck Collaboration XIII 2016; see Table 1).

For the GAEA lightcones, we take advantage of lightcones built from predictions based on the Galaxy Evolution and Assembly (GAEA) theoretical model. This is coupled with dark matter merger trees extracted from the Millennium Simulation (Springel et al. 2005), which makes it possible to track the evolution of each model galaxy both to higher and lower redshifts (in terms of their progenitors and descendants). This approach allowed us to study the later evolution of systems that are identified as proto-cluster regions and characterise the z = 0 descendant mass distribution. GAEA follows the evolution of galaxies across different cosmic epochs and environments by means of a coupled system of differential equations, each of them describing a single physical process responsible for the exchange of mass and energy among the different baryonic components. The individual prescriptions can be of empirical, analytical, or theoretical derivation and can involve the definition of free parameters that are usually calibrated against a selected set of observational constraints.

In this paper, we consider two different GAEA realisations based on the model versions published in Hirschmann et al. (2016) and Fontanot et al. (2020). Hereafter, we will refer to these realisations as GAEA-H and GAEA-F, respectively. Both model runs include a detailed treatment for non-instantaneous chemical enrichment (De Lucia et al. 2014) and a prescription for stellar feedback partly based on hydro-simulations. These prescriptions allow us to reproduce the evolution of the galaxy stellar mass function as well as cosmic star formation rate up to z ≲ 7 (Fontanot et al. 2017), as well the evolution of the mass-metallicity relations and their secondary dependencies (De Lucia et al. 2020; Fontanot et al. 2021). GAEA-F also includes an improved treatment of cold gas accretion onto supermassive black holes, an explicit treatment for AGN-driven winds (Fontanot et al. 2020), and an updated tracing of the angular momentum exchanges between different galactic components, which we use to model galaxy structural properties (Xie et al. 2020). While retaining all successes of the previous model, this update also reproduces the properties of the AGN populations up to z ∼ 4.

For each model galaxy, GAEA predicts the expected broadband photometry in the H-band and in the Euclid visual band, in addition to a number of physical properties and other photometric bands. These model outputs have been used to construct two independent lightcones (each for each model realisation) using the algorithm described in Zoldan et al. (2017). Each lightcone covers an aperture of 5 . ° 27 $ 5{{{\overset{\circ}{.}}}}27 $ diameter and includes all model galaxies from z = 0 to z = 4 down to HE = 25.

In the simulations, galaxies keep their identity through time and can be traced through the merger trees. Therefore, galaxies that are found to belong to a galaxy cluster at redshift zero with a mass M200 ≥ 1014M inside r200 can be identified and labelled as members of a proto-cluster at higher redshift in the lightcone. The ensemble of the member galaxies defines the proto-cluster.

For our studies, we selected galaxies with the following magnitude limits, mHE ≤ 24.25 for the near-infrared H-band and mIE ≤ 25.10 for the visual band of Euclid3. The lightcone databases provide observed and randomly perturbed magnitudes according to the expected measurement errors. Here, we use the unperturbed magnitudes. The limits correspond to an expected S/N = 5 Euclid will reach in the Wide Survey for extended sources. For each galaxy, redshifts calculated from the simulations with and without peculiar motions are available. We use the values without peculiar motion. Also, the photometric redshift was derived for each galaxy with the SED fitting code Phosphoros developed in the Euclid Collaboration (Paltani et al., in prep.), taking into account the photometric noise expected in Euclid bands complemented by the ground-based ones that will be available at the time of the Data Release 3 southern hemisphere4 (Euclid Collaboration 2022). The lightcone databases contain the entire probability distribution of the derived photometric redshift. Here, we use only the median values.

4. Extent of proto-clusters

In this section, we present estimates of the proto-cluster radius, rpc. We first derive a theoretical estimate for the radius evolution of a spherical overdensity in Sect. 4.1 and compare the results to simulations in Sect. 4.2.

4.1. Theoretical estimate

For rpc, we took the radius of the overdensity, which evolves into a galaxy cluster with radius r200 in the spherical collapse model described in Sect. 2. The proto-cluster radius, rpc in comoving units, rcom, can then be determined from the overdensity ratio through the relation:

r com = r 200 ( Ω m R ov DM 200 ) 1 / 3 , $$ \begin{aligned} r_{\rm com} = r_{200}~ \left( \frac{\Omega _{\rm m}~ R_{\rm ov-DM}}{200} \right)^{-1/3}, \end{aligned} $$(1)

and the physical radius of the proto-cluster is given by rphys = rcom/(1 + z).

Figure 2 shows the results of the calculations for the three cosmological models listed in Table 1. The results for the reference and Planck cosmology are practically identical, while those for the Millennium cosmology are different by a few per cent. Chiang et al. (2017) published similar calculations based on simulations, which are in good agreement with our results. Similarly, Muldrew et al. (2015) used simulations to assess the evolution of the proto-cluster radius, defined as enclosing 90% of the stellar mass ending up in the z = 0 cluster and found similar results. In their Fig. 2, we see that the physical radius shows a similar function of time with a value ∼2.5 times higher at z = 1 than at z = 0 and a factor of ∼1.2 higher at z = 5.

thumbnail Fig. 2.

Evolution of the proto-cluster radius in comoving and physical units scaled to r200 at z = 0. Results for three cosmological models are shown by red lines (reference model), blue lines (Millennium cosmology) and dashed black lines (Planck cosmology). The over-plotted black dots show the parameterised approximation of rcom (see Eq. (2)) for the reference cosmology and rphys for the Millennium cosmology.

For further practical work, we derived numerical fits to the results for rcom. The following approximation, valid for the redshift range z = 0.5 − 8, provides an accuracy better than 1%, with parameters listed in Table 3,

r com / r 200 = A z 0.7 + B z 0.95 + C z 1.1 + D . $$ \begin{aligned} r_{\rm com}/r_{200} = A~z^{0.7} + B~z^{0.95} + C~z^{1.1} + D. \end{aligned} $$(2)

Table 3.

Parameters describing the evolution of the proto-cluster radius with redshift (see Eq. (2)).

We use this relation for the reference cosmology in the subsequent work in this paper.

4.2. Comparison to simulations

We can compare the theoretical predictions for the proto-cluster radius with the distribution of the cluster member galaxies in the MAMBO and GAEA lightcones. Here and in the following, we use only proto-clusters that are fully contained in the field of view of the lightcone and in addition reject a small number of objects that are artefacts originating from common problems with the lightcone construction.

Figure 3 shows two proto-clusters from the MAMBO simulation, where galaxies inside and outside the proto-cluster radius (in three dimensions) are marked as members and non-members. As the centre of the proto-clusters, we chose the barycentre. As proto-cluster members, we took those galaxies that are members of the descendent cluster at redshift zero inside r200. All galaxies in a redshift slice with a width of five times the proto-cluster radius are shown in projection on the sky. We note that the majority of the proto-cluster members are located inside the estimated proto-cluster radius, and a few non-members, marked in blue in the figure, are found inside. These non-members are found close to the cluster boundary and are not bound into the cluster during the following collapse of the system. Some of the non-members are seen inside the circle in projection in the plots as black dots, but they are outside the proto-cluster spheres. Members of neighbouring proto-clusters in the same redshift interval are shown as green symbols.

thumbnail Fig. 3.

Examples of two proto-clusters at redshifts 1.666 and 2.8335 with 148 and 22 members, respectively. The proto-cluster members are shown as red-filled circles, while non-members inside the proto-cluster radius are shown as blue circles. Other non-members outside the proto-cluster radius in three dimensions are shown as small black dots, and members of other proto-clusters are shown as full green circles. The large black circle indicates the estimated proto-cluster radius, whose size is indicated in the top left corner of the plot.

Figure 4 shows the three-dimensional radial distribution of the galaxy number and number density of the members of MAMBO, GAEA-H, and GAEA-F proto-clusters compared to the radial distribution of non-members. The differential distribution in spherical shells is shown. For the proto-cluster centres, we used the barycentre, the mean of the mass distribution of all proto-cluster member galaxies. The distance of the galaxies to the centre in all proto-clusters was scaled to the estimated proto-cluster radius, rpc = rphysr200. The radius, rphys, has been defined in the sentence connected to Eq. (1). For the positions, the true locations of the galaxies were used without redshift space distortions due to galaxy peculiar motions.

thumbnail Fig. 4.

Three-dimensional radial distribution of the member galaxies in all MAMBO (top), GAEA-H (middle), and GAEA-F (bottom) proto-clusters (z = 1.5 − 4). The radius is in units of the estimated proto-cluster radius, rpc. The red lines show the numbers and densities of the proto-cluster members, and the blue lines those of the non-member galaxies. The solid lines show the galaxy number in the shells (left Y-axis), while the dotted lines show the galaxy density in the shells (right Y-axis).

The distributions of the galaxies in all three simulation data sets look very similar. The contamination of non-members inside the proto-cluster radius in the GAEA lightcones is higher than that in the MAMBO lightcone. About 80% of the members are located inside rpc with a contamination of ∼5% (MAMBO), ∼10% (GAEA) non-members in three dimensions. About 90% of the members are located inside 1.2 rpc with a contamination of ∼10% (MAMBO) and ∼20% (GAEA). At the moment we have no explanation for this difference of the contamination in the two light cones. Apart from this effect we see no significant difference between the different simulation samples. The theoretically estimated proto-cluster radius thus provides a good orientation for the expected size of proto-clusters. Results for splitting the proto-cluster sample up into three redshift shells are given in Table 4, where the completeness is defined as the fraction of member-galaxies contained inside the aperture radius compared to the total number of member galaxies. We note little variation with redshift.

Table 4.

Completeness and contamination of the members inside the estimated proto-cluster radius, rpc and 1.2 rpc, for different redshift shells for the MAMBO and GAEA samples

5. Radial galaxy density profiles

We used the GAEA and MAMBO simulations to study the typical radial density profiles of proto-clusters. Here and throughout the paper, we use the barycentre of the galaxy population as the proto-cluster centre. In Fig. 5 we show the mean three-dimensional proto-cluster profile for the GAEA-F and MAMBO samples for the redshift range z = 1.5 − 4. In addition we show with data for GAEA-F in the range z = 1.5 − 2, that there is no substantial change with redshift. The profiles were generated involving 6533 and 786 proto-clusters from the two simulations, respectively. Since we used radii scaled to the proto-cluster radius, rpc, we also determined the mean densities as scaled parameters in units of 4/3πrpc3. The mean profile for GAEA-H has the same shape but about 10% higher normalisation since this sample has relatively more proto-cluster members.

thumbnail Fig. 5.

Mean three-dimensional radial density profile of proto-clusters. The black solid line and blue points show the mean profile for all proto-clusters in the GAEA F lightcone in the redshift range z = 1.5 − 4, while the green curve shows that for proto-clusters with z = 1.5 − 2. The blue dashed curve shows the model fit. The red open symbols show the density profile for the clusters in the MAMBO lightcone (z = 1.5 − 4) for comparison. The galaxy density is scaled to the volume of the proto-cluster, volumerPC = (4/3)πrPC3.

We fitted the resulting profile with a model including a power-law profile with a core and an additional inner and outer slope:

n ( r ) = A ( r r c ) α [ 1 + ( r r c ) 2 ] β + α 2 [ 1 + ( r r s ) 2 ] γ + β + α 2 . $$ \begin{aligned} n(r) = A~\left(\frac{r}{r_{\rm c}}\right)^{-\alpha }~ \left[1 + \left(\frac{r}{r_{\rm c}}\right)^2 \right]^{-\beta +\frac{\alpha }{2}} \left[1 + \left(\frac{r}{r_{\rm s}}\right)^2 \right]^{-\gamma +\beta +\frac{\alpha }{2}}. \end{aligned} $$(3)

The resulting fit parameters for the three-dimensional case are shown in Table 5. We also determined the projected two-dimensional profiles, which look similar with a flatter slope. The profiles were fitted with the same relation, and the results are also shown in Table 5. In this case, rc and rs are projected radii. Figure 5 shows that the fit provides reasonable approximations. Actually, a fit with a function including only the first four parameters, with the first two terms on the right hand side of Eq. (3), provides a good approximation up to r ∼ rPC, but leaves a small shallow tail beyond. To remove this particular feature with an additional outer slope requires large values for the core radius, rs, and the slope parameter.

Table 5.

Fit parameters for the proto-cluster profiles in three and two dimensions for the formula in Eq. (3).

One interesting result from the average projected proto-cluster profile to keep in mind is that about half of the member galaxies reside inside 0.5 rpc. That implies that the density inside an aperture of 0.5 rpc is about 3 times higher than the density in the annulus at r = 0.5 to 1 rpc.

Inspection of individual profiles shows that there is a large variety of profile shapes. Figure 3 and in the Appendix Fig. A.3 show a selection of proto-clusters with some emphasis on systems with substructure. To devise a cluster detection method that works with assumptions on the proto-cluster shapes, as, for example, a matched filter algorithm, one needs an overview of the variation of the proto-cluster structures. Any treatment that regards azimuthally symmetric shapes in first order would rely on knowledge of profiles.

To get such an overview, we dissected the three-dimensional profiles out to rpc into three equal radial intervals and classified the proto-cluster profiles into five categories according to the density ratios of the different regions as listed in Table 6. Category 1 and 2 have a central peak, 3 and 4 have the maximum in the middle, and 5 has the maximum near rpc. Table 6 also shows the number of proto-clusters that fall into each category.

Table 6.

Definition of the five different categories of proto-cluster density profiles.

To display the variation of the profiles, we have determined the mean profile for each category with a resolution of 8 bins out to 1.6 rpc. We note that this provides a higher radial resolution of 5 bins inside rpc than the three radial bins used for the categorisation. This allows us to show the profiles in more detail. Not to let those proto-clusters with the largest number of galaxy members completely dominate the results, we use a weighting with a factor of n gal 1 / 2 $ n_{\mathrm{gal}}^{-1/2} $ per proto-cluster in averaging the profiles, where ngal is the total number of members in the simulations. Due to the scaling with proto-cluster radius and this weighing scheme, the resulting densities loose the normal physical units, and we show relative values. These mean profiles for categories 1–5 are shown in Figs. A.1 and A.2 in the Appendix. We note first of all that 73.2% of the proto-clusters have a high central density inside 0.2 rpc (much higher than in the other radial bins). 4.2% of these proto-clusters have a higher density in the third than in the second bin. But as shown in Fig. A.1 they have mostly very compact cores with few galaxies outside. The 24.6% of category 3 and 4 have the highest density in the middle radial region. Two examples of such clusters are shown in the upper two panels of Fig. A.3.

Only 17 proto-clusters have the highest density in the region from 2/3 rpc to 1 rpc. An example is shown in the bottom panel of Fig. A.3. Among these cases with high density in the outer annuli we find binary and multiple systems, which will nevertheless collapse into a single cluster by z = 0. Thus, the majority of the proto-clusters will show a significantly higher contrast of the central region compared to the overall system. It also implies that the central region will collapse earlier than redshift zero for most proto-clusters to form a smaller galaxy group or cluster first.

6. Density contrast in the projected galaxy distribution

6.1. Density contrast with respect to the global sky background

To explore the efficiency with which proto-clusters can be detected in the Euclid Wide Survey, we study the projected galaxy density contrast of proto-clusters against the galaxy background in this section. Here and in the following we designate all galaxies outside the proto-clusters, that is either in front or in the back, as background galaxies. We make use of the photometric redshifts that have been modelled in the simulations. To find the majority of the members of a proto-cluster we have to consider a selection window for the redshifts wide enough to cover the uncertainties of the photometric redshifts. Using the MAMBO lightcone, we illustrate in Fig. 6 the median photometric errors corresponding to including 50–90% of all galaxies at a given redshift, which we define as ‘completeness’. These photometric errors constitute the redshift windows used to detect PC. The half window size is shown as photo-z error parameter as a function of redshift in the Figure. We will designate the redshift interval for a 50% completeness limit as Δz (50%). These calculations have been performed for all galaxies in the lightcone, including non-members, for better statistics in 132 redshift bins. In each bin, the galaxies were sorted by Δz, the deviation of the photometric redshift from the true one. The maximum Δz of the 50% smallest values yields then, for example, Δz (50%). This completeness limit is identical to the median. We used this value in most of the following examples because it provides a detection efficiency close to the maximum, as shown in Sects. 6.2 and 7. Also shown is the official Euclid requirement for the redshift accuracy, Δz = 0.05 (1 + z) (e.g. Sartoris et al. 2016). This requirement is close to the median curve. At redshifts between z = 1.5 and 2.2, it is worse than the requirement. With increasing redshift, it gets better and falls below the requirement at z = 3 − 4. At these higher redshifts, the photometric bands bracket the Ly-break better.

thumbnail Fig. 6.

Photo-z error as a function of redshift for different completeness limits. The parameter photo-z error describes the half-width of the redshift window necessary to contain 50%, 60%, 70%, 80%, and 90% of all galaxies at a certain redshift. The dashed line shows the official requirement for the photometric accuracy.

Galaxy counts and densities in proto-clusters and background in the MAMBO lightcone (limited by the redshift range corresponding to given redshift uncertainties) were calculated for different aperture sizes (with respect to rpc)5. In practice, we counted all known proto-cluster members inside the aperture radius and all other galaxies as background in the cylinders defined by the redshift uncertainty, Δz(X%), and the aperture area. This was done for different redshift regions and different photometric uncertainty limits. Figure 7 shows two examples of the galaxy counts in proto-clusters and background for apertures, rap = 0.5 rpc for z = 1.5 − 2 and rap = rpc for z = 3 − 4 using a redshift window defined by Δz (50%). We note that the background galaxies always outnumber the proto-cluster galaxies. This is due to the fact that background galaxies are sampled over a much larger line of sight than proto-cluster members due to the limited photometric redshift accuracy, as will be explained below. We show the data as a function of the total galaxy number of the proto-cluster, which is a good proxy for the proto-cluster mass (Rykoff et al. 2014). Of course, the number of recovered proto-cluster galaxies increases with the total galaxy number, but also, the background increases somewhat due to the increase of rpc. Statistically, the number of detected galaxies with large aperture radius (r ∼ 2.5 rpc) scatters around 50% of the total number by construction for a Δz (50%) limit.

thumbnail Fig. 7.

Galaxy counts in proto-clusters (red) in comparison to the background (blue) as a function of the total galaxy number in the proto-clusters. The top row was taken in the redshift range z = 1.5 − 2 with an aperture of 0.5 rpc and the bottom row in the range z = 3 − 4 inside an aperture of rpc. For the galaxy selection a redshift interval corresponding to Δz (50%) was used.

Results for the densities of the detected galaxies are shown in Fig. 8. Again, the projected densities of the background are always larger than the proto-cluster densities. As expected the galaxy density in the background is constant, while the average projected galaxy density increases with proto-cluster size and richness, as the proto-cluster volume increases. The latter increase is mainly shown by an increase in the mean density and its lower limit. Figure 9 summarises the results for the density statistics as a function of aperture radius and redshift for a 50% completeness and Euclid magnitude limit. Here we note that the background densities appear constant with changing aperture radius as expected. The densities for the proto-clusters, however, decrease with increasing aperture radius due to the decreasing density profiles.

thumbnail Fig. 8.

Galaxy densities in proto-clusters (red) in comparison to the background (blue) in the redshift range z = 1.5 − 2 with an aperture of 0.5 rpc. For the galaxy selection a redshift interval corresponding to Δz (50%) was used.

thumbnail Fig. 9.

Mean projected galaxy densities in proto-clusters (solid lines) as a function of aperture radius in three redshift intervals. The dashed curves give the galaxy densities in the background. The results were derived for Δz (50%).

The fact that the projected background density is often very much larger than the proto-cluster density is a challenge for the reliable detection of the proto-clusters. Thus, before proceeding further, we study the reason for this situation in more detail. We have shown in Sect. 2 that the proto-cluster overdensities are not very large since we capture the proto-clusters before turn-around. This overdensity has to be compared with the line-of-sight ratio across the proto-cluster and the redshift range defined by the photometric redshift accuracy. In projection, we sample all the galaxies in the line-of-sight of the proto-cluster, which have a redshift inside the uncertainty limits of the photometric redshifts. Since the line of sight distance interval corresponding to the redshift uncertainties is much larger than the diameter of the proto-clusters, more background galaxies are sampled than proto-cluster members by the photometric redshift selection, in spite of the moderate galaxy overdensity in proto-clusters. To illustrate this, we show in Fig. 10 the ratio between the distance interval corresponding to the photometric redshift uncertainty and the proto-cluster diameter as a function of redshift. Here, we have used a redshift interval of Δz (50%); with a smaller completeness, the volume ratio would be smaller, but we would also sample fewer member galaxies. We clearly see that the volume from which the background galaxies are sampled is much larger than the proto-cluster volume (which has been simply approximated here by a cylinder). This large volume ratio is the consequence of using only broadband photometry for the redshift estimates.

thumbnail Fig. 10.

Ratio between the line-of-sight interval given by the photometric redshift uncertainty and the line of sight covered by proto-clusters, rz (50%))/rpc). The ratios for all MAMBO proto-clusters as a function of redshift are shown.

6.2. Significance of the density contrast

An additional problem complicates the detection of proto-clusters. While it is sometimes assumed naively that one detects the proto-clusters in a distinct and smooth background field, which is characterised by Poissonian density fluctuations, we are facing the situation that we have to detect the proto-clusters against a background characterised by the presence of cosmic large-scale structure. Since the aperture sizes for detecting proto-clusters sample the background at relatively small scales, where the cosmic large-scale structure is well in the non-linear regime, we have to cope with background fluctuations larger than the Poisson noise for the relevant galaxy counts. We illustrate this by means of the MAMBO simulation below.

The rms of the background density fluctuations was determined in different aperture radii in five redshift ranges (z = 1.5 − 2, 2 − 2.5, 2.5 − 3, 3 − 3.5, and 3.5 − 4). The background densities were evaluated at the positions of the proto-clusters, while the proto-cluster galaxies were excluded from the background density calculation. In principle, one could have also used random positions, but in our approach, we include the small effect that a tiny part of the line of sight is occupied by the proto-cluster and not by the background. This time the aperture radius is kept fixed for a chosen aperture value and not scaled with the proto-cluster radius. For the line of sight integration we take the redshift interval Δz (50%). We show the results in Fig. 11 for six aperture radii (4, 5, 6, 7, 8, and 9 arcmin) and compare them with Poisson errors. While the Poisson errors decrease steadily with the aperture radius, as expected, we note hardly any decrease in the variance, except for the first aperture radius bin, where shot noise still plays an important role. The variance decreases with aperture radius not because of the improving count statistics but because of the decreasing variance of the large-scale structure with scale. This decrease is comparatively slow and hardly noticeable in the relevant scale range. For this reason, the significance of detection above the background cannot easily be improved by increasing the aperture size, as it would be for Poisson errors.

thumbnail Fig. 11.

Relative rms deviations of the projected background galaxy density fluctuations (solid lines) as a function of aperture radius for five different redshift intervals. The dashed lines show the estimated Poisson errors for comparison.

To characterise the significance of detection of the proto-clusters we take the ratio of the detected counts to the rms of the background fluctuations. We do not include the shot noise of the proto-cluster counts, which would be important in the calculation of the error of the detected signal. We look here just at the detection significance. We determine the detection significance for every proto-cluster by using the actually detectable counts in the simulations, and for the background, we take the rms of the background counts determined from all proto-cluster positions in the same redshift bin and for the same aperture radius. We note that the fluctuations on the relevant scales are actually mildly non-linear and therefore the use of the RMS for the background noise is only a reasonable approximation. This will be accounted for more precisely in the practicle application with the calibration of the proto-cluster detection techniques in simulations. Examples of the results are shown in Fig. 12 for the redshift range z = 1.5 − 2 for aperture radii of rap = 0.5 and 1.0 rpc. We show the detection significance as a function of the total galaxy number belonging to the proto-clusters in the simulation.

thumbnail Fig. 12.

Detection significance of proto-clusters from the MAMBO simulation in the redshift interval z = 1.5 − 2. The panels show results for aperture radii of 0.5 (top) and 1 rpc (bottom). For the galaxy selection a redshift interval corresponding to Δz (50%) was used.

The values for the significance are small. There is a correlation of the significance with the richness of the proto-cluster. This correlation is more pronounced for the better number statistics with the larger aperture. For the smaller aperture the behaviour for the less rich proto-clusters is dominated by Poisson noise, but a correlation becomes apparent at richer proto-clusters: the mean and the lower limit of the significance clearly increases.

The upper panel of Fig. 13 summarises the significance studied for a Δz (50%) limit. In the plot, we show both the mean and maximum values for the significance of each parameter selection. The maximum values for the smallest aperture radius are relatively high due to the large Poisson noise for small counts. We note that for the highest redshift bin, the maximum values of the significance is always highest for the same reason, but for the mean values, the highest redshift bin is not more significant than the lowest one.

thumbnail Fig. 13.

Detection significance of proto-clusters as a function of aperture radius (top) and as a function of completeness limit (bottom) for three different redshift intervals. Top: The solid curves show the mean significance, while the dashed curves show the maxima. A redshift interval of Δz (50%) was used. Bottom: The completeness limit, Δz (X%), covers the range 30–90%. The solid lines show the results for an aperture radius of 0.5 rpc and dashed lines for 1 rpc.

We also explored if we can increase the significance by using a higher or lower completeness limit so that we can sample more galaxies for each proto-cluster or deal with a smaller redshift range for the background. The lower panel of Fig. 13 shows how the significance changes with the completeness limit for two apertures (0.5 and 1 rpc) and three redshift intervals. We note that the 50% or 60% completeness limits provide the maximum in all cases. Otherwise, the curves are relatively flat. Only for the highest redshift bin there is a notable decrease of the significance with increasing completeness because it is closer to the detection limit for the galaxies. The main conclusion is, however, that there is not much room for improvement in the detection efficiency with a different choice of the completeness limit.

Overall, a strategy that uses Δz (50%) and apertures smaller than 1 rpc (for example 0.5 rpc) provides a close-to-optimal solution. While at decreasing radii the signal-to-noise gets better, the statistics gets worse, and therefore a radius of around 0.5 rpc gives a good compromise. Thus for most of the following analysis we use this selection.

An improvement can, of course, be expected from using a probability-based detection algorithm, for example, a matched filter technique. The present study provides a useful guideline for such a method since it shows which proto-cluster region contributes most to the detection signal.

7. Detection significance with local background assessment

In this section we explore whether the detection significance can be improved with a local background assessment. In the previous section, we used the global background variance for the significance calculation. Here, we test the behaviour of the background if the background is taken for each proto-cluster from an annulus around it and its variance determined from the data. Due to the spatial correlation of the galaxy density in the large-scale structure, we could expect that there would be a correlation between the background galaxy density inside the proto-cluster aperture and that of the surrounding annulus. This correlation is shown for the simulated MAMBO proto-clusters in Fig. 14. Since there is a clear correlation, the estimated background variance can be reduced compared to that considered in the previous section if we add information about the local background.

thumbnail Fig. 14.

Correlation of the galaxy background density for proto-clusters in an aperture radius of 5 arcmin and the background taken from an annulus of 9 to 13 arcmin radius from the cluster centre around each proto-cluster. The colours mark different redshift intervals from z = 1.5 to 4.0 with a width of Δz = 0.5.

We estimated the background in an annulus around the detection aperture with radii of 9 and 13 arcmin. The radii of proto-clusters for masses of 1014 and 1015M for different redshifts can be found in Table 7. For z = 1.5 we find rpc ∼ 4.4 − 9.6 arcmin and for z = 4 we get rpc ∼ 3.4 − 7.4 arcmin for this mass range. The background region is thus outside the proto-clusters. The correlation of the background in the aperture and the annulus, shown in Fig. 14, helps us in the following way. A measurement of the background in the annulus can be used to normalise the background in the aperture region. The residual aperture backround will then be smaller than the variance without this normalisation. The relevant residual background variance is then the variance of the ratio of the galaxy density inside the aperture to the galaxy density in the background annulus. It is the scatter of the relation shown in Fig. 14.

Table 7.

Statistics concerning the proto-cluster abundance and sky coverage as a function of redshift.

In Fig. 15 the rms of the density fluctuations is shown as a function of aperture radius and redshift in an equivalent way to the results in Fig. 11. Thus, the results can be directly compared in these figures, and we note, in most cases, an improvement of almost a factor of 2. The improvement is due to the fact that the variations in the projected galaxy density background are caused only to a minor degree by Poisson noise but mostly by large-scale structure, which can, for example, be well described by clustering statistics like the two-point correlation function. Therefore we obtain a better estimate of the local background if we take a measurement in its immediate neighbourhood.

thumbnail Fig. 15.

Relative (rms) variations of the projected background galaxy densities (solid lines) with a local assessment as a function of aperture radius for five different redshift intervals.

This has, of course, an effect on the detection significance, which is illustrated in Fig. 16. We note that now more proto-clusters reach a detection significance of 2σ, and a large fraction of detections has significances above 1σ, which was not the case in the previous section.

thumbnail Fig. 16.

Detection significance for proto-clusters for the redshift range z = 1.5 − 2 (top) and z = 3 − 4 (bottom). For the galaxy selection a redshift interval corresponding to Δz (50%) was used. Top: A detection aperture of 5 arcmin was applied with a local background assessment. Bottom: An aperture of 7 arcmin was used.

Figure 17 (top) summarises the results on the detection significances. This figure can be compared to Fig. 13, with a change in the radius units to arcmin. We note that the local background assessment can provide a significant improvement. For real observations, the method can be tested by studying the spatial correlation of the galaxy density fluctuations in the field outside the proto-clusters and by using these results to determine the rms of the background fluctuations. This can actually be applied in most detection methods, for example, for matched-filter detections. In practice, the filter would, for example, include the background ring with a negative weighting.

thumbnail Fig. 17.

Mean detection significance of proto-clusters as a function of aperture radius (top) and as a function of the completeness limit (bottom). The set-up is similar to Fig. 13, but with a local background and an aperture radius in arcminutes. Top: A redshift interval of Δz (50%) was used. Bottom: Values of the completeness limit, Δz (X%), in the range of 30–90% were used. The solid lines show the results for an aperture radius of 0.5 rpc and the dashed lines show the results for 1 rpc.

The bottom panel of Fig. 17 shows, analogously to Fig. 13 the mean detection significance as a function of the photometric redshift completeness limit for five different redshift intervals. The solid lines are for an aperture radius of 0.5 rpc and the dashed lines for 1 rpc. We note that the results for the redshift intervals Δz (30%) and Δz (40%) have improved in comparison to the other redshift intervals, but Δz (50%) is still a good choice for the detection.

The practical meaning of the significance of a detection algorithm depends also on the rareness of the objects to be detected. If a large sky area has to be inspected to find an object, one needs a high detection significance threshold to keep the detected samples reasonably pure. In our case, we will find below that proto-clusters are quite abundant in projection on the sky. Therefore, we can still obtain a valuable proto-cluster candidate sample with a low significance threshold. The discussion section provides further details on this point.

In summary, we conclude that the best strategy for the detection of proto-clusters is to use an aperture radius smaller than rpc and a local background assessment. Also, a redshift range given by Δz (50%) is quite optimal, but an exploration of higher completeness limits is often not much worse. A more sophisticated detection algorithm will, in this respect, anyway, include a probability distributions of photometric redshifts.

8. Mass-richness relation

For galaxy clusters, the richness, the number of member galaxies inside a given radius and magnitude limit can be used as a proxy for the cluster mass (e.g. Andreon & Hurn 2010; Rykoff et al. 2014; Castignani & Benoist 2016). Therefore, we test in this section how tight the mass-richness relation is for the proto-clusters in the simulations. Here, we inspect the intrinsic relation, including all known proto-cluster members in the simulations (with magnitude limits defined in Sect. 3) and not only the ones that would be detected with a certain prescription.

The mass-richness relation was determined for the MAMBO, GAEA H, and GAEA F samples of proto-clusters. Here, the mass of the system is that of the descendent cluster at z = 0 inside r200 since we attribute all the mass of the descendent cluster to the proto-cluster at any redshift. We binned the proto-clusters into subsamples of redshift bins with a width of Δz = 0.25 starting at z = 1.5 and an extra bin for z = 1.5 − 2. This leaves several hundred proto-clusters per bin for the GAEA samples and an order of a hundred for MAMBO.

The results are shown in Fig. 18 for the GAEA H simulations. The results from the other lightcones look similar. We find a few proto-clusters with very low member numbers well outside the variance of the number counts. They were identified as artefacts due to some common problems in the production of the lightcones. We excluded them from our study by a cut, which removes all cases with a negative 3σ deviation from the mean relation. The relative scatter that we observe in the relations, as shown for some examples in Fig. 18, is typically around 40% and decreases with richness to about 20%. This scatter is distinctly different from Poisson uncertainties und usually significantly larger. We provide some further illustrations of this fact towards the end of this Section.

thumbnail Fig. 18.

Mass-richness relation for GAEA H for the redshift range z = 1.5 − 2 (top) and z = 3 − 3.25 (bottom). The small black dots show all proto-clusters, and the blue dots are the ones above the cut, which is shown as a solid black line. The linear regression fit to the relation is shown by a dashed line. M200 is in units of M.

The distribution of the number counts in Fig. 18 and all other relations studied is highly suggestive of a linear relation in logarithmic space. Therefore, we fitted the distribution by a relation of the form:

N g ( M , z ) = N g 0 ( z ) ( M 10 14 M ) α . $$ \begin{aligned} N_{\rm g}(M,z) = N_{\rm g0}(z) ~ \left({M \over 10^{14}\,M_{\odot }}\right) ^{\alpha }. \end{aligned} $$(4)

The fits are shown in the figures as dashed lines. The fit results for the normalisation as a function of redshift are shown in the top panel of Fig. 19 for the three sets of simulations. The results are encouragingly similar. We fitted this mass-richness relation normalisation as a function of redshift with a third-order polynomial expression. The lines in Fig. 19 show the fitted functions, and the resulting parameters are given in Table 8.

thumbnail Fig. 19.

Top: Normalisation, Ng0, for the mass-richness relation as a function of redshift. The lines show the third-order polynomial approximation. The normalisation is given for log10M = 14, with M in units of solar mass. Bottom: Slope parameter, α, of the mass-richness relation as a function of redshift. The dotted lines show fits of linear relations. The colours have the same meaning as above.

Table 8.

Fit parameters of the polynomial expression for the normalisation, Ng0(z) = a + bz + cz2 + dz3, of the mass-richness relation as a function of redshift for the three proto-cluster samples. We also show the slope, α, averaged over the redshift intervals.

The slope of the fitted mass-richness relation for the different redshift shells is shown in the bottom panel of Fig. 19. The values are about 0.9 or a little higher. Table 8 also shows the mean slopes averaged over the redshift intervals. This is in line with the observational finding that the efficiency of galaxy formation decreases with halo mass in the mass range above 1012M (e.g. Behroozi et al. 2013; Kravtsov et al. 2018). The simulations attempted to reproduce this empirical finding. It is also good to observe that the mass-richness relations are similar in all three approaches of painting galaxy evolution onto the cosmological simulations. The small deviations are also due to the limited statistics.

We also studied the variance of the richness in the mass-richness relations because we wanted to see if we could approximate this quantity with simple Poisson statistics. We used the GAEA sample here for better statistics. We divided the sample into redshift bins with the division described above. Each of the eleven redshift bins was subdivided into several mass bins, with at least 50 members per interval. This left us with 12–25 mass bins per redshift bin. For each subsample, we determined the root mean square (RMS) deviation between the mean number of galaxies in the bin and the actual counts. The results are plotted for the GAEA H sample in Fig. 20. We compare the results for the scatter in the galaxy counts with the expectation for Poisson statistics. The relative scatter for the galaxy counts in the simulations is fairly constant as a function of the mean galaxy number: it decreases from about 0.4 to 0.25 over the range of galaxy counts from 3 to 200. This is distinctly different from the Poisson distribution, where the relative RMS changes by a factor of about 8. This deviation from Poisson statistics is observed in the simulations. At least here, the formation of galaxies in the overdense region of the proto-cluster is not a Poisson point process. It is not immediately clear if this is purely a result of the process of ‘painting galaxies onto the N-body simulations’ or if we should expect a similar behaviour in nature. But this exercise gives at least a warning that we should not use simple Poisson statistics for the uncertainty of the mass-richness relation unless we have shown that it works for observational data.

thumbnail Fig. 20.

Relative rms deviation of the galaxy number counts as a function of the mean number of galaxies for the proto-clusters in each mass-redshift bin (data points). We also show the expectation for the scatter if it would be governed by Poisson statistics, which is shown by the solid line.

To further illustrate this difference between the variance of the number counts for a given mass and a Poisson distribution, we study the distribution of the richness for a given mass in two narrow mass intervals as shown in Fig. 21 for the redshift range z = 1.5 − 2. The mean masses in the intervals are ∼1 × 1014M and ∼4.5 × 1014M. To remove the scatter in the number counts due to the width of the mass interval, we normalise the number of members of each proto-cluster by a correction factor m ¯ / m pc $ \bar m / m_{\mathrm{pc}} $. If for example the mass of a proto-cluster in the first bin is 1.15 × 1014M, we multiply the number counts by a factor of 1/1.15. We note that in both cases, the observed distribution is wider than the Poisson prediction, and this difference is much more pronounced in the higher mass bin.

thumbnail Fig. 21.

Comparison of the distribution of the galaxy membership counts (richness) for given mass and the prediction for Poissionian scatter (black curves). The mean masses in the interval are 1014M (red curve) and 4.5 × 1014M (blue curve), respectively. The curves show cumulative histograms normalised to unity. The data are taken from the GAEA F lightcone in the redshift interval z = 1.5 − 2.

Overall, we find mass-richness relations that are close to linear with logarithmic slopes close to 0.9. Again, we emphasise that these are the relations found in the simulations with known galaxy memberships.

9. Proto-cluster abundance and sky coverage

The abundance of proto-clusters is determined by their definition, which in our study is given by the statement that they evolve into present-day galaxy clusters with masses M200 ≥ 1014M. Practically, we defined proto-clusters above as all galaxies that will end up in the descendent cluster at z = 0 inside r200. Similarly, we associate all matter that will finally be assembled in the zero redshift cluster as belonging to the proto-cluster. This is exactly the mass contained inside rpc in the top-hat overdensity model. Thus, in this context, the proto-cluster has the same mass as the descendent cluster at all times. Therefore, the density of proto-clusters for a certain mass limit in comoving coordinates is that of the present-day clusters above that mass limit. The cumulative present-day cluster mass function, n(> M), can be obtained from observations, for example, from the cluster abundance in the REFLEX survey of X-ray luminous galaxy clusters (Böhringer et al. 2013, 2014), one of the best defined and comprehensive galaxy cluster samples in the nearby Universe. The cumulative mass function can be approximated by the following function:

n ( > M ) = α ( M 14 2 ) β exp ( M 14 δ γ ) , $$ \begin{aligned} n({>}M) = \alpha \left({M_{\rm 14} \over 2 }\right)^{-\beta } ~ \exp \left({-M_{\rm 14}^{\delta }\over \gamma }\right), \end{aligned} $$(5)

where M14 is the proto-cluster mass in units of 1014M for a fiducial radius of r200, α = 1.237 × 10−5 Mpc−3, β = 0.907, γ = 0.961, and δ = 0.625 (Böhringer et al. 2017). For the calculation of the comoving volume as a function of redshift, we used the reference cosmology. Table 7 shows the results for different lower mass limits for volumes for which proto-cluster counts were integrated over redshift intervals of Δz = 0.5. The redshifts listed in the Table, give the upper bound of the redshift interval. The Table also provides values for the proto-cluster radii calculated by means of Eq. (2) in units of arcmin on the sky for proto-clusters with masses of 1014M and 1015M and sky surface areas of proto-clusters with masses ≥1014M.

Based on the proto-cluster abundances described above, we can study how well we can separate proto-clusters in the sky if we have only photometric redshifts. We look at the fraction of the sky covered by proto-clusters within a redshift slice, which corresponds to the uncertainty of photometric redshifts. For the study, we select all the proto-clusters and their member galaxies from the MAMBO lightcone in redshift intervals given by the photometric redshift accuracy for a 50% completeness limit and show how they are projected onto the sky. Figure 22 shows proto-clusters and member galaxies selected from redshifts around z = 2 with a redshift range of Δz, (50%). The core regions (r ≤ 0.5 rpc), which will stick out due to their higher density contrast, are highlighted in the plot.

thumbnail Fig. 22.

Proto-clusters and their galaxies in a redshift slice around z = 2 with a redshift range according to the photometric redshift accuracy limits for 50% completeness and Mpc ≥ 1014M. The dense core regions inside r ≤ 0.5 rpc are shown by black circles and red dots for the galaxies, while the outer regions are shown in grey.

In total, there are 132 proto-clusters; in this exercise, we also included proto-clusters that are not completely contained in the field of view of the MAMBO lightcone.

We note that the sky is indeed densely covered by the proto-clusters. By just integrating the surface area of each proto-cluster (inside rpc), taking into account that some proto-cluster have some of their areas outside the lightcone, we find that the sum of all proto-cluster areas is 91%, while the fraction of covered sky is about 59%, due to overlaps. We have seen in Sects. 6 and 7, however, that proto-clusters are detected with sufficient significance around 2σ only at radii smaller than the proto-cluster radius and in Sect. 5, we showed that typically half the member galaxies are found inside 0.5 rpc where they have a 2.7 times higher projected density than in the annulus around. Therefore, to give an impression of the sky coverage of the recognisable core regions of the proto-clusters, we also quote the results for proto-cluster radii of 0.5 rpc, which is a total fractional area of 23% and a fraction of the sky covered of 21%. Figure 23 illustrates the case for both proto-cluster radii. We note that in some regions, the proto-cluster cores overlap, which is the result of clustering in large-scale structures.

thumbnail Fig. 23.

Number of MAMBO proto-clusters and the sky-filling factor in a Δz (50%) redshift interval as a function of redshift and proto-cluster mass limit. The numbers are only shown for the sample with Mpc ≥ 1014M as a dotted line. Solid lines show the fraction of the sky covered by the proto-clusters, and the dashed lines display the combined sky area of the proto-clusters. The thick lines with larger data points show the results for the core regions, while the thin lines show those for the region out to rpc. For the core regions of the Mpc ≥ 3 × 1014M proto-clusters, the dashed and solid lines overlap.

We can, however, look at this from a more practical point of view. If we relax our ambition to detect all proto-clusters with a final mass of 1014M and look, for example, only at proto-clusters above a mass limit of 3 × 1014M, the situation looks much better, as shown in Fig. 24. If we increase the mass limit even further, for example, to 8 × 1014M we find only none to three proto-clusters per Δz (50%) in the MAMBO lightcone, with filling factors of a few per cent. Only in the highest redshift bin, are there four proto-clusters with a filling factor of 7%. These larger mass objects correspond probably better to the typical proto-clusters that have been identified in optical surveys. If we had included smaller systems (< 1014M), the picture would have looked worse. Thus, the whole problem is that of detecting massive proto-clusters in the presence of large-scale structures in the background, a problem that we have already encountered in section 6, which has already been taken into account in the calculation of the detection significance. It therefore appears that targeting proto-clusters with a mass limit of 1014M may be ambitious. In Fig. 25 we also show the sky coverage for M > 1014M proto-clusters around redhift z ∼ 3.5. The sky coverage is less dramatic than in the low redshift case. But we note in all three pictures the substantial clumping of the proto-clusters. This may sometimes cause difficulties in clearly separate structures, which are identified with separate proto-clusters in the simulations.

thumbnail Fig. 24.

Proto-clusters and their galaxies in a redshift slice around z = 2 as in Fig. 22 but with a higher mass limit of the proto-clusters of Mpc = 3 × 1014M. Symbols and colours have the same meaning as in Fig. 22.

In Fig. 23, we summarise the results for our target redshift range at eight redshift values. We show the sky coverage of the proto-clusters in the Δz (50%) redshift slices and also the sum of their sky area. The sky coverage factors are decreasing with redshift. Since only the centres of the proto-clusters will be detectable and their sky-filling factors never exceed 30% even for the worst case, proto-clusters should, in general, be separable from detection algorithms, except for the regions of proto-superclusters.

10. Discussion

The previous sections have shown that the approximate analytic concept for the prediction of the proto-cluster properties such as size, density contrast, and abundance, developed above, can provide a robust guideline to explore proto-clusters in the Euclid Wide Survey. In particular, the prediction for the proto-cluster radius gives a good approach to defining a fiducial radius for a proto-cluster. In addition, we have obtained further useful descriptions, such as the typical proto-cluster profiles and their variations by means of the simulations. We note that some of the presented properties are intrinsic in the sense that they are described on the basis of the information given by the simulations. This includes the proto-cluster radii, profiles, abundances, and mass-richness relation. In contrast the detection significances are observed properties, and in their case the information from the simulations that is used in the same as would be available to an observer.

As an application of the information given in Table 7, we calculate the expected volumes of proto-clusters and compare them to some observed examples in the literature. From the radii given in arcmin, we calculate the comoving volumes for proto-clusters with a descendent mass of 1015M at redshifts 2, 3, 4, and 5 with results of about 9700, 13 000, 15 000, and 16 000 Mpc3, respectively, and compare them to the volumes assigned to some prominent proto-clusters by Casey (2016). The quoted volumes for COSMOS (z = 2.10) and (z = 2.47) of about 15 000 Mpc3 are larger than what is expected for the most massive proto-clusters, and one might conclude that we are looking at two or more closely neighbouring structures, as we see them in Figs. 22, 24, and 25. The combined volume of three fields in SSA22 (z = 3.09) with a value of 21 000 Mpc3 is consistent with the expectation for massive proto-clusters if we take the three fields as different systems. The value of 20 000 Mpc3 for HDF 850.1 (z = 5.18) is a bit high but could still be explained by one very massive proto-cluster if the boundary was considered slightly too generous. For MRC1138−256 (z = 2.16) and AzTEC-3 (z = 5.18) the observed values are comparably small, and either the volumes assigned concern only part of the proto-cluster, or these proto-clusters have descendants with a mass much lower than 1015M. In a recent CO survey around MRC1138−256 (Jin et al. 2021), a large overdensity of CO emitters was found as part of a filamentary structure with an extent of about 120 comoving Mpc. The extent of 38 Mpc in physical scale is too large for one proto-cluster and a structure of this size should break up to form several virialised units. But within this filament, MRC1138−256 could indeed be a larger proto-cluster than mentioned above.

thumbnail Fig. 25.

Proto-clusters with Mpc ≥ 1014M and their galaxies in a redshift slice around z = 3.5 with a redshift range according to the photometric redshift accuracy limits for 50% completeness. Symbols and colours have the same meaning as in Fig. 22.

With these tools, we can prepare the strategy for the detection of proto-clusters in the Euclid Survey and the assessment of their properties. They could, in particular, provide a good orientation for the parameter selection for more sophisticated detection algorithms than the aperture counts used here.

In Sects. 6 and 7, we reported the moderate detection significance for proto-clusters, which can be achieved with the Euclid Survey with the given photometric redshift uncertainties based on broadband photometry. This will make it difficult, in general, to characterise proto-clusters quantitatively without further follow-up observations. On the other hand, many proto-clusters will be detectable with sufficient significance (∼2σ) to obtain a large number of highly likely candidates. Since the proto-clusters have a large sky-filling factor, as shown in Sect. 10, one can actually afford a low detection significance threshold. The following example may best illustrate this. If we study a contiguous sky region of 200 times the area of a typical proto-cluster and the region contains 100 such systems, corresponding to a filling factor of 50%, a 2σ detection threshold would lead to an average false detection rate of 5 contaminating events (since only the positive deviations of the distribution count), a false detection rate of 5%, which could be tolerable for statistical studies.

What we have described here is a worst-case scenario using a very simple detection aperture. Several sophisticated detection algorithms have been developed for the detection of proto-clusters in the Euclid Survey. Using a particular aperture and redshift interval exploits only part of the information given by the observations. A good detection algorithm will make use of the complete available information, including galaxies at all relevant radii and the complete probability distributions of the photometric redshifts with optimal weights. Therefore, we can easily expect a significant increase in the detection efficiency with the proper use of dedicated algorithms. One has to be careful, however, when proto-cluster shapes are assumed in the algorithms about the possible introduction of selection effects. The examples of unusual proto-cluster profiles found in Sect. 5 are, in this respect, interesting test cases.

We can illustrate the unique opportunity offered by Euclid, also with the following example. We focus on the most massive proto-clusters with M ≥ 8 × 1014M mentioned above with a typical sky-filling factor of ∼5%. A dedicated algorithm is expected to improve the detection significance from ∼2 to at least ∼3σ. This would correspond to false detections with a sky-filling factor of 0.15%. Thus, we would obtain a ratio of about 34 true to one false detection. In total, we can expect of the order of 40 000 proto-clusters with a mass limit of M ≥ 8 × 1014M in the complete Euclid Wide Survey in the redshift range z = 1.5 − 4. This will definitely provide a relatively pure, interesting, and unique proto-cluster candidate sample.

A sweet spot for the detection of proto-clusters in the considered redshift range is z ∼ 3, where the density contrast turns out to be the highest. This is an interesting region where Ly-break galaxy surveys revealed the first proto-clusters. With a further boost of significance by means of dedicated detection algorithms, the data should be sufficient to estimate selection functions and perform population statistics. What will be particularly interesting is the unprecedentedly large survey volume of the Euclid Wide Survey, which will turn up the most extreme and rare objects that have not been observed in the available much smaller survey areas.

The present study also leads to the question: how can the contrast above the background be improved? Narrowband surveys will, of course, lead to more precise photometric redshifts and thus provide an improvement. We can take a look at two examples. The survey of Yamada et al. (2012) in the SA22 field at z ∼ 3 with a narrowband filter Δ λ = 77 Å provides an accuracy of about Δz = 0.063. For the COSMOS survey Chiang et al. (2014) obtained Δz = 0.025 (1 + z). Compared to the numbers for the Euclid data given above, this is an improvement of about a factor of two, which is, of course, very helpful, but it does not improve the situation so much that it allows a precise assessment of the proto-cluster properties. Thus, spectroscopic follow-up observations will be mandatory for an accurate study of the most interesting proto-clusters.

11. Summary and conclusions

We have provided an overview of how galaxy proto-clusters in the redshift range z = 1.5 − 4 are expected to appear in the Euclid Wide Survey in order to assist in the preparation of the survey analysis. The paper reports results on the following studies of proto-cluster properties.

  • A practical estimate of the proto-cluster radius was obtained on the basis of an analytical model for the evolution of a homogeneous top-hat overdensity. A comparison with cosmological simulations showed that about 80% of the members of the descendent cluster (inside r200) are contained within this radius with contamination of about 5–10%. We provided an analytic approximation of the evolution of the proto-cluster radius with redshift (Sect. 4).

  • The mean radial galaxy distribution of proto-clusters can be well described with a cored profile with a specified inner and outer logarithmic slope. We provide best-fit formulae for the mean three-dimensional and projected profiles (Sect. 5). Individual proto-clusters, however, show a large variety. About three-quarters of the proto-clusters have a pronounced dense core. However, proto-clusters with multiple substructure components can have profiles with a density maximum at larger radii.

  • We investigated which density contrast in the galaxy distribution proto-clusters can be observed against the galaxy background. We assumed that photometric data from the Euclid Survey and broadband ground-based auxiliary data (at the time of the third data release) are available for this study to estimate photometric redshifts. We showed that the small overdensity of proto-clusters, the large redshift depth that has to be sampled, and the large-scale structure variance of the background lead to a moderate detection significance for simple detections in circular apertures. With a local background assessment and apertures focused on the inner regions of the proto-clusters, significances of the order of 2σ can be reached. Because proto-clusters are not very rare, useful samples of highly likely proto-cluster candidates can be obtained with such detection thresholds.

  • The cluster richness (i.e. the number of galaxies that are members of the proto-cluster) was found to be tightly correlated with the mass of the descendent cluster. We found a relation with a logarithmic slope around of 0.9, which may reflect the decreasing galaxy formation efficiency with increasing halo mass for the most massive halos. We provided analytical approximations to the mass-richness relation as a function of mass and redshift for the GAEA and MAMBO simulations (Sect. 8).

  • The abundance of proto-clusters is determined by their definition of being the precursors of galaxy clusters at redshift zero with masses, M200 ≥ 1014M. The matter fraction in such clusters was, for example, determined in Böhringer et al. (2017) to be about 4%, and the comoving number density can be obtained from this result. By knowing the abundance and sizes of the objects, one can calculate how densely the proto-clusters fill the sky for a redshift range comparable to the uncertainties of the photometric redshifts. We found that the area that is filled is quite large, and proto-clusters with the quoted mass limit start to overlap significantly.

There are several efforts in the Euclid Consortium to devise optimised detection algorithms for the search of proto-clusters in the Euclid Survey. They will use more sophisticated techniques, which also involve probability distributions for the photometric galaxy redshifts. This will provide the leverage to improve the detection efficiency over what has been described here. With the large sky area explored by Euclid, an enormously large database of likely proto-clusters and their properties will be provided, which can bring proto-cluster studies and follow-up observations to a new level.


1

The visual band of the Euclid VIS instrument covers a wavelength range from 500 to 900 nm.

2

The radius r200 is the radius inside which the mean matter overdensity of the cluster is 200 times the critical density of the Universe at the cluster redshift.

3

We note that these magnitude limits deviate slightly from the limits given for the survey in Sect. 1. The constantly evolving work on the survey definition and in the science working group cannot always be perfectly synchronised in the Euclid preparation phase.

4

The photometric data used for the determination of the photometric redshifts include the Euclid bands: IE, YE, JE, HE and the LSST bands: u, g, r, i, z.

5

We present here only the results from the MAMBO lightcone due to some problems in the cross-identification of individual proto-cluster members in different simulation result tables at the time of writing. This problem is being solved.

Acknowledgments

HB thanks the Munich Excellence Cluster ’Origins’ for support. GC acknowledges support by the Deutsches Luft- und Raumfahrt Zentrum through grant no. 50 OR 2204. HD acknowledges financial support from the Agencia Estatal de Investigación del Ministerio de Ciencia e Innovación (AEI-MCINN) under grant (La evolución de los cúmulos de galaxias desde el amanecer hasta el mediodía cósmico) with reference (PID2019-105776GB-I00/DOI:10.13039/501100011033) and del Ministerio de Ciencia, Innovación y Universidades (MCIU/AEI) under grant (Construcción de cúmulos de galaxias en formación a través de la formación estelar ocurecida por el polvo) and the European Regional Development Fund (ERDF) with reference (PID2022-143243NB-I00/DOI:10.13039/501100011033). LM acknowledges the financial contribution from the grant PRIN-MUR 2022 20227RNLY3 “The concordance cosmological model: stress-tests with galaxy clusters” supported by Next Generation EU and from the grants ASI n.2018-23-HH.0 and n. 2024-10-HH.0 “Attività scientifiche per la missione Euclid – fase E” The Euclid Consortium acknowledges the European Space Agency and a number of agencies and institutes that have supported the development of Euclid, in particular the Agenzia Spaziale Italiana, the Austrian Forschungsförderungsgesellschaft funded through BMK, the Belgian Science Policy, the Canadian Euclid Consortium, the Deutsches Zentrum für Luft- und Raumfahrt, the DTU Space and the Niels Bohr Institute in Denmark, the French Centre National d’Etudes Spatiales, the Fundação para a Ciência e a Tecnologia, the Hungarian Academy of Sciences, the Ministerio de Ciencia, Innovación y Universidades, the National Aeronautics and Space Administration, the National Astronomical Observatory of Japan, the Netherlandse Onderzoekschool Voor Astronomie, the Norwegian Space Agency, the Research Council of Finland, the Romanian Space Agency, the State Secretariat for Education, Research, and Innovation (SERI) at the Swiss Space Office (SSO), and the United Kingdom Space Agency. A complete and detailed list is available on the Euclid web site (https://www.euclid-ec.org).

References

  1. Alberts, S., & Noble, A. 2022, Universe, 8, 554 [Google Scholar]
  2. Andreon, S., & Hurn, M. A. 2010, MNRAS, 404, 1922 [NASA ADS] [Google Scholar]
  3. Behroozi, P. S., Wechsler, R. H., & Conroy, C. 2013, ApJ, 770, 57 [NASA ADS] [CrossRef] [Google Scholar]
  4. Böhringer, H., Chon, G., Collins, C. A., et al. 2013, A&A, 555, A30 [Google Scholar]
  5. Böhringer, H., Chon, G., & Collins, C. A. 2014, A&A, 570, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  6. Böhringer, H., Chon, G., & Fukugita, M. 2017, A&A, 608, A65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  7. Bond, J. R., Cole, S., Efstathiou, G., & Kaiser, N. 1991, ApJ, 379, 440 [NASA ADS] [CrossRef] [Google Scholar]
  8. Boylan-Kolchin, M., Springel, V., White, S. D. M., Jenkins, A., & Lemson, G. 2009, MNRAS, 398, 1150 [Google Scholar]
  9. Calvi, R., Castignani, G., & Dannerbauer, H. 2023, A&A, 678, A15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  10. Casey, C. M. 2016, ApJ, 824, 36 [CrossRef] [Google Scholar]
  11. Casey, C. M., Cooray, A., Capak, P., et al. 2015, ApJ, 808, L33 [NASA ADS] [CrossRef] [Google Scholar]
  12. Castignani, G., & Benoist, C. 2016, A&A, 595, A111 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  13. Castignani, G., Chiaberge, M., Celotti, A., & Norman, C. 2014a, ApJ, 792, 113 [NASA ADS] [CrossRef] [Google Scholar]
  14. Castignani, G., Chiaberge, M., Celotti, A., Norman, C., & De Zotti, G. 2014b, ApJ, 792, 114 [NASA ADS] [CrossRef] [Google Scholar]
  15. Castro, T., Borgani, S., Dolag, K., et al. 2021, MNRAS, 500, 2316 [Google Scholar]
  16. Chiang, Y.-K., Overzier, R., & Gebhardt, K. 2013, ApJ, 779, 127 [Google Scholar]
  17. Chiang, Y.-K., Overzier, R., & Gebhardt, K. 2014, ApJ, 782, L3 [NASA ADS] [CrossRef] [Google Scholar]
  18. Chiang, Y.-K., Overzier, R. A., Gebhardt, K., & Henriques, B. 2017, ApJ, 844, L23 [Google Scholar]
  19. Chon, G., Böhringer, H., & Zaroubi, S. 2015, A&A, 575, L14 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  20. Clements, D. L., Braglia, F. G., Hyde, A. K., et al. 2014, MNRAS, 439, 1193 [NASA ADS] [CrossRef] [Google Scholar]
  21. Contini, E., De Lucia, G., Hatch, N., Borgani, S., & Kang, X. 2016, MNRAS, 456, 1924 [NASA ADS] [CrossRef] [Google Scholar]
  22. Cucciati, O., Zamorani, G., Lemaux, B. C., et al. 2014, A&A, 570, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  23. Cucciati, O., Lemaux, B. C., Zamorani, G., et al. 2018, A&A, 619, A49 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  24. Daddi, E., Dannerbauer, H., Stern, D., et al. 2009, ApJ, 694, 1517 [Google Scholar]
  25. Dannerbauer, H., Kurk, J. D., De Breuck, C., et al. 2014, A&A, 570, A55 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  26. De Lucia, G., Tornatore, L., Frenk, C. S., et al. 2014, MNRAS, 445, 970 [Google Scholar]
  27. De Lucia, G., Xie, L., Fontanot, F., & Hirschmann, M. 2020, MNRAS, 498, 3215 [NASA ADS] [CrossRef] [Google Scholar]
  28. Despali, G., Giocoli, C., Angulo, R. E., et al. 2016, MNRAS, 456, 2486 [NASA ADS] [CrossRef] [Google Scholar]
  29. Djorgovski, S. G., Stern, D., Mahabal, A. A., & Brunner, R. 2003, ApJ, 596, 67 [NASA ADS] [CrossRef] [Google Scholar]
  30. Euclid Collaboration (Scaramella, R., et al.) 2022, A&A, 662, A112 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  31. Euclid Collaboration (Cropper, M., et al.) 2024a, A&A, in press, https://doi.org/10.1051/0004-6361/202450996 [Google Scholar]
  32. Euclid Collaboration (Jahnke, K., et al.) 2024b, A&A, in press, https://doi.org/10.1051/0004-6361/202450786 [Google Scholar]
  33. Euclid Collaboration (Hormuth, F., et al.) 2024c, A&A, in press, https://doi.org/10.1051/0004-6361/202450345 [Google Scholar]
  34. Euclid Collaboration (Mellier, Y., et al.) 2024d, A&A, in press, https://doi.org/10.1051/0004-6361/202450810 [Google Scholar]
  35. Evrard, A. E., MacFarland, T. J., Couchman, H. M. P., et al. 2002, ApJ, 573, 7 [NASA ADS] [CrossRef] [Google Scholar]
  36. Flores-Cacho, I., Pierini, D., Soucail, G., et al. 2016, A&A, 585, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  37. Fontanot, F., De Lucia, G., Hirschmann, M., et al. 2017, MNRAS, 464, 3812 [NASA ADS] [CrossRef] [Google Scholar]
  38. Fontanot, F., De Lucia, G., Hirschmann, M., et al. 2020, MNRAS, 496, 3943 [CrossRef] [Google Scholar]
  39. Fontanot, F., Calabrò, A., Talia, M., et al. 2021, MNRAS, 504, 4481 [CrossRef] [Google Scholar]
  40. Francis, P. J., Woodgate, B. E., Warren, S. J., et al. 1996, ApJ, 457, 490 [NASA ADS] [CrossRef] [Google Scholar]
  41. Fu, H., Shankar, F., Ayromlou, M., et al. 2024, MNRAS, 532, 177 [NASA ADS] [CrossRef] [Google Scholar]
  42. Galametz, A., Stern, D., De Breuck, C., et al. 2012, ApJ, 749, 169 [NASA ADS] [CrossRef] [Google Scholar]
  43. García-Vergara, C., Hennawi, J. F., Barrientos, L. F., & Rix, H.-W. 2017, ApJ, 848, 7 [CrossRef] [Google Scholar]
  44. García-Vergara, C., Hennawi, J. F., Barrientos, L. F., & Arrigoni Battaia, F. 2019, ApJ, 886, 79 [CrossRef] [Google Scholar]
  45. García-Vergara, C., Rybak, M., Hodge, J., et al. 2022, ApJ, 927, 65 [CrossRef] [Google Scholar]
  46. Girelli, G., Pozzetti, L., Bolzonella, M., et al. 2020, A&A, 634, A135 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  47. Gómez-Guijarro, C., Riechers, D. A., Pavesi, R., et al. 2019, ApJ, 872, 117 [CrossRef] [Google Scholar]
  48. Greenslade, J., Clements, D. L., Cheng, T., et al. 2018, MNRAS, 476, 3336 [NASA ADS] [CrossRef] [Google Scholar]
  49. Harikane, Y., Ouchi, M., Ono, Y., et al. 2019, ApJ, 883, 142 [Google Scholar]
  50. Hatch, N. A., Kurk, J. D., Pentericci, L., et al. 2011a, MNRAS, 415, 2993 [NASA ADS] [CrossRef] [Google Scholar]
  51. Hatch, N. A., De Breuck, C., Galametz, A., et al. 2011b, MNRAS, 410, 1537 [NASA ADS] [Google Scholar]
  52. Hennawi, J. F., Prochaska, J. X., Cantalupo, S., & Arrigoni-Battaia, F. 2015, Science, 348, 779 [Google Scholar]
  53. Henriques, B. M. B., White, S. D. M., Thomas, P. A., et al. 2015, MNRAS, 451, 2663 [Google Scholar]
  54. Higuchi, R., Ouchi, M., Ono, Y., et al. 2019, ApJ, 879, 28 [Google Scholar]
  55. Hirschmann, M., De Lucia, G., & Fontanot, F. 2016, MNRAS, 461, 1760 [Google Scholar]
  56. Ito, K., Tanaka, M., Valentino, F., et al. 2023, ApJ, 945, L9 [NASA ADS] [CrossRef] [Google Scholar]
  57. Jenkins, A., Frenk, C. S., White, S. D. M., et al. 2001, MNRAS, 321, 372 [Google Scholar]
  58. Jin, S., Dannerbauer, H., Emonts, B., et al. 2021, A&A, 652, A11 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  59. Koyama, Y., Kodama, T., Tadaki, K.-I., et al. 2013, MNRAS, 428, 1551 [NASA ADS] [CrossRef] [Google Scholar]
  60. Kravtsov, A. V., Vikhlinin, A. A., & Meshcheryakov, A. V. 2018, Astron. Lett., 44, 8 [Google Scholar]
  61. Kuiper, E., Hatch, N. A., Venemans, B. P., et al. 2011, MNRAS, 417, 1088 [NASA ADS] [CrossRef] [Google Scholar]
  62. Kurk, J. D., Röttgering, H. J. A., Pentericci, L., et al. 2000, A&A, 358, L1 [NASA ADS] [Google Scholar]
  63. Kurk, J. D., Pentericci, L., Röttgering, H. J. A., & Miley, G. K. 2004, A&A, 428, 793 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  64. Le Fevre, O., Deltorn, J. M., Crampton, D., & Dickinson, M. 1996, ApJ, 471, L11 [NASA ADS] [CrossRef] [Google Scholar]
  65. Lemaux, B. C., Le Floc’h, E., Le Fèvre, O., et al. 2014, A&A, 572, A90 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  66. McConachie, I., Wilson, G., Forrest, B., et al. 2022, ApJ, 926, 37 [NASA ADS] [CrossRef] [Google Scholar]
  67. Miley, G., & De Breuck, C. 2008, A&ARv, 15, 67 [Google Scholar]
  68. Miller, T. B., Chapman, S. C., Aravena, M., et al. 2018, Nature, 556, 469 [CrossRef] [Google Scholar]
  69. Muldrew, S. I., Hatch, N. A., & Cooke, E. A. 2015, MNRAS, 452, 2528 [NASA ADS] [CrossRef] [Google Scholar]
  70. Oteo, I., Ivison, R. J., Dunne, L., et al. 2018, ApJ, 856, 72 [Google Scholar]
  71. Ouchi, M., Shimasaku, K., Akiyama, M., et al. 2005, ApJ, 620, L1 [NASA ADS] [CrossRef] [Google Scholar]
  72. Overzier, R. A. 2016, A&ARv, 24, 14 [Google Scholar]
  73. Pentericci, L., Kurk, J. D., Röttgering, H. J. A., et al. 2000, A&A, 361, L25 [NASA ADS] [Google Scholar]
  74. Planck Collaboration XXVII. 2015, A&A, 582, A30 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  75. Planck Collaboration XIII. 2016, A&A, 594, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  76. Press, W. H., & Schechter, P. 1974, ApJ, 187, 425 [Google Scholar]
  77. Rykoff, E. S., Rozo, E., Busha, M. T., et al. 2014, ApJ, 785, 104 [Google Scholar]
  78. Sartoris, B., Biviano, A., Fedeli, C., et al. 2016, MNRAS, 459, 1764 [Google Scholar]
  79. Schreiber, C., Elbaz, D., Pannella, M., et al. 2017, A&A, 602, A96 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  80. Sheth, R. K., & Tormen, G. 1999, MNRAS, 308, 119 [Google Scholar]
  81. Shimasaku, K., Ouchi, M., Okamura, S., et al. 2003, ApJ, 586, L111 [NASA ADS] [CrossRef] [Google Scholar]
  82. Springel, V., White, S. D. M., Jenkins, A., et al. 2005, Nature, 435, 629 [Google Scholar]
  83. Steidel, C. C., Adelberger, K. L., Dickinson, M., et al. 1998, ApJ, 492, 428 [Google Scholar]
  84. Strazzullo, V., Daddi, E., Gobat, R., et al. 2015, A&A, 576, L6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  85. Tinker, J. L., Robertson, B. E., Kravtsov, A. V., et al. 2010, ApJ, 724, 878 [NASA ADS] [CrossRef] [Google Scholar]
  86. Toshikawa, J., Kashikawa, N., Ota, K., et al. 2012, ApJ, 750, 137 [Google Scholar]
  87. Tozzi, P., Gilli, R., Liu, A., et al. 2022, A&A, 667, A134 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  88. Venemans, B. P., Kurk, J. D., Miley, G. K., et al. 2002, ApJ, 569, L11 [NASA ADS] [CrossRef] [Google Scholar]
  89. Venemans, B. P., Röttgering, H. J. A., Overzier, R. A., et al. 2004, A&A, 424, L17 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  90. Venemans, B. P., McMahon, R. G., Warren, S. J., et al. 2007, MNRAS, 376, L76 [NASA ADS] [CrossRef] [Google Scholar]
  91. Vieira, J. D., Crawford, T. M., Switzer, E. R., et al. 2010, ApJ, 719, 763 [NASA ADS] [CrossRef] [Google Scholar]
  92. Wang, G. C. P., Hill, R., Chapman, S. C., et al. 2021, MNRAS, 508, 3754 [NASA ADS] [CrossRef] [Google Scholar]
  93. Wylezalek, D., Vernet, J., De Breuck, C., et al. 2013, MNRAS, 428, 3206 [CrossRef] [Google Scholar]
  94. Xie, L., De Lucia, G., Hirschmann, M., & Fontanot, F. 2020, MNRAS, 498, 4327 [NASA ADS] [CrossRef] [Google Scholar]
  95. Yamada, T., Nakamura, Y., Matsuda, Y., et al. 2012, AJ, 143, 79 [Google Scholar]
  96. Zoldan, A., De Lucia, G., Xie, L., Fontanot, F., & Hirschmann, M. 2017, MNRAS, 465, 2236 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Morphological classification of the proto-cluster profile

In Section 5, we have explored the variation of proto-cluster profiles in the simulation lightcones. Here, we present figures for the mean profiles for each of the five categories in Figs. A.1 and A.2. The figures also show the standard deviations of the profiles in each category. Fig. A.3 shows examples of proto-clusters in categories 3 to 5. All these proto-clusters show large substructures or even multi-modality.

thumbnail Fig. A.1.

Mean profiles of the proto-clusters for two of the five structural categories defined in the text, where the categories are labelled in the panels. The shaded regions in the histograms indicate the rms variance of the density.

thumbnail Fig. A.2.

Mean profiles of the proto-clusters for the remaining three of the five structural categories defined in the text, where the categories are labelled in the panels.

thumbnail Fig. A.3.

Examples of three proto-clusters for the categories 3 to 5. The category and the redshift of the proto-clusters is given in the legend of the Figure.

All Tables

Table 1.

Cosmological parameters used in the different models assuming a flat metric.

Table 2.

Parameters for the lightcones and the underlying cosmological simulations.

Table 3.

Parameters describing the evolution of the proto-cluster radius with redshift (see Eq. (2)).

Table 4.

Completeness and contamination of the members inside the estimated proto-cluster radius, rpc and 1.2 rpc, for different redshift shells for the MAMBO and GAEA samples

Table 5.

Fit parameters for the proto-cluster profiles in three and two dimensions for the formula in Eq. (3).

Table 6.

Definition of the five different categories of proto-cluster density profiles.

Table 7.

Statistics concerning the proto-cluster abundance and sky coverage as a function of redshift.

Table 8.

Fit parameters of the polynomial expression for the normalisation, Ng0(z) = a + bz + cz2 + dz3, of the mass-richness relation as a function of redshift for the three proto-cluster samples. We also show the slope, α, averaged over the redshift intervals.

All Figures

thumbnail Fig. 1.

Evolution of the overdensity ratio, Rov − DM(z), with redshift for an object that collapses at z = 0. The results for three cosmological models are shown: the reference model, Planck cosmology, and Millennium cosmology. The first two models can hardly be distinguished in the plot. The vertical line indicates the redshift of the turn-around point.

In the text
thumbnail Fig. 2.

Evolution of the proto-cluster radius in comoving and physical units scaled to r200 at z = 0. Results for three cosmological models are shown by red lines (reference model), blue lines (Millennium cosmology) and dashed black lines (Planck cosmology). The over-plotted black dots show the parameterised approximation of rcom (see Eq. (2)) for the reference cosmology and rphys for the Millennium cosmology.

In the text
thumbnail Fig. 3.

Examples of two proto-clusters at redshifts 1.666 and 2.8335 with 148 and 22 members, respectively. The proto-cluster members are shown as red-filled circles, while non-members inside the proto-cluster radius are shown as blue circles. Other non-members outside the proto-cluster radius in three dimensions are shown as small black dots, and members of other proto-clusters are shown as full green circles. The large black circle indicates the estimated proto-cluster radius, whose size is indicated in the top left corner of the plot.

In the text
thumbnail Fig. 4.

Three-dimensional radial distribution of the member galaxies in all MAMBO (top), GAEA-H (middle), and GAEA-F (bottom) proto-clusters (z = 1.5 − 4). The radius is in units of the estimated proto-cluster radius, rpc. The red lines show the numbers and densities of the proto-cluster members, and the blue lines those of the non-member galaxies. The solid lines show the galaxy number in the shells (left Y-axis), while the dotted lines show the galaxy density in the shells (right Y-axis).

In the text
thumbnail Fig. 5.

Mean three-dimensional radial density profile of proto-clusters. The black solid line and blue points show the mean profile for all proto-clusters in the GAEA F lightcone in the redshift range z = 1.5 − 4, while the green curve shows that for proto-clusters with z = 1.5 − 2. The blue dashed curve shows the model fit. The red open symbols show the density profile for the clusters in the MAMBO lightcone (z = 1.5 − 4) for comparison. The galaxy density is scaled to the volume of the proto-cluster, volumerPC = (4/3)πrPC3.

In the text
thumbnail Fig. 6.

Photo-z error as a function of redshift for different completeness limits. The parameter photo-z error describes the half-width of the redshift window necessary to contain 50%, 60%, 70%, 80%, and 90% of all galaxies at a certain redshift. The dashed line shows the official requirement for the photometric accuracy.

In the text
thumbnail Fig. 7.

Galaxy counts in proto-clusters (red) in comparison to the background (blue) as a function of the total galaxy number in the proto-clusters. The top row was taken in the redshift range z = 1.5 − 2 with an aperture of 0.5 rpc and the bottom row in the range z = 3 − 4 inside an aperture of rpc. For the galaxy selection a redshift interval corresponding to Δz (50%) was used.

In the text
thumbnail Fig. 8.

Galaxy densities in proto-clusters (red) in comparison to the background (blue) in the redshift range z = 1.5 − 2 with an aperture of 0.5 rpc. For the galaxy selection a redshift interval corresponding to Δz (50%) was used.

In the text
thumbnail Fig. 9.

Mean projected galaxy densities in proto-clusters (solid lines) as a function of aperture radius in three redshift intervals. The dashed curves give the galaxy densities in the background. The results were derived for Δz (50%).

In the text
thumbnail Fig. 10.

Ratio between the line-of-sight interval given by the photometric redshift uncertainty and the line of sight covered by proto-clusters, rz (50%))/rpc). The ratios for all MAMBO proto-clusters as a function of redshift are shown.

In the text
thumbnail Fig. 11.

Relative rms deviations of the projected background galaxy density fluctuations (solid lines) as a function of aperture radius for five different redshift intervals. The dashed lines show the estimated Poisson errors for comparison.

In the text
thumbnail Fig. 12.

Detection significance of proto-clusters from the MAMBO simulation in the redshift interval z = 1.5 − 2. The panels show results for aperture radii of 0.5 (top) and 1 rpc (bottom). For the galaxy selection a redshift interval corresponding to Δz (50%) was used.

In the text
thumbnail Fig. 13.

Detection significance of proto-clusters as a function of aperture radius (top) and as a function of completeness limit (bottom) for three different redshift intervals. Top: The solid curves show the mean significance, while the dashed curves show the maxima. A redshift interval of Δz (50%) was used. Bottom: The completeness limit, Δz (X%), covers the range 30–90%. The solid lines show the results for an aperture radius of 0.5 rpc and dashed lines for 1 rpc.

In the text
thumbnail Fig. 14.

Correlation of the galaxy background density for proto-clusters in an aperture radius of 5 arcmin and the background taken from an annulus of 9 to 13 arcmin radius from the cluster centre around each proto-cluster. The colours mark different redshift intervals from z = 1.5 to 4.0 with a width of Δz = 0.5.

In the text
thumbnail Fig. 15.

Relative (rms) variations of the projected background galaxy densities (solid lines) with a local assessment as a function of aperture radius for five different redshift intervals.

In the text
thumbnail Fig. 16.

Detection significance for proto-clusters for the redshift range z = 1.5 − 2 (top) and z = 3 − 4 (bottom). For the galaxy selection a redshift interval corresponding to Δz (50%) was used. Top: A detection aperture of 5 arcmin was applied with a local background assessment. Bottom: An aperture of 7 arcmin was used.

In the text
thumbnail Fig. 17.

Mean detection significance of proto-clusters as a function of aperture radius (top) and as a function of the completeness limit (bottom). The set-up is similar to Fig. 13, but with a local background and an aperture radius in arcminutes. Top: A redshift interval of Δz (50%) was used. Bottom: Values of the completeness limit, Δz (X%), in the range of 30–90% were used. The solid lines show the results for an aperture radius of 0.5 rpc and the dashed lines show the results for 1 rpc.

In the text
thumbnail Fig. 18.

Mass-richness relation for GAEA H for the redshift range z = 1.5 − 2 (top) and z = 3 − 3.25 (bottom). The small black dots show all proto-clusters, and the blue dots are the ones above the cut, which is shown as a solid black line. The linear regression fit to the relation is shown by a dashed line. M200 is in units of M.

In the text
thumbnail Fig. 19.

Top: Normalisation, Ng0, for the mass-richness relation as a function of redshift. The lines show the third-order polynomial approximation. The normalisation is given for log10M = 14, with M in units of solar mass. Bottom: Slope parameter, α, of the mass-richness relation as a function of redshift. The dotted lines show fits of linear relations. The colours have the same meaning as above.

In the text
thumbnail Fig. 20.

Relative rms deviation of the galaxy number counts as a function of the mean number of galaxies for the proto-clusters in each mass-redshift bin (data points). We also show the expectation for the scatter if it would be governed by Poisson statistics, which is shown by the solid line.

In the text
thumbnail Fig. 21.

Comparison of the distribution of the galaxy membership counts (richness) for given mass and the prediction for Poissionian scatter (black curves). The mean masses in the interval are 1014M (red curve) and 4.5 × 1014M (blue curve), respectively. The curves show cumulative histograms normalised to unity. The data are taken from the GAEA F lightcone in the redshift interval z = 1.5 − 2.

In the text
thumbnail Fig. 22.

Proto-clusters and their galaxies in a redshift slice around z = 2 with a redshift range according to the photometric redshift accuracy limits for 50% completeness and Mpc ≥ 1014M. The dense core regions inside r ≤ 0.5 rpc are shown by black circles and red dots for the galaxies, while the outer regions are shown in grey.

In the text
thumbnail Fig. 23.

Number of MAMBO proto-clusters and the sky-filling factor in a Δz (50%) redshift interval as a function of redshift and proto-cluster mass limit. The numbers are only shown for the sample with Mpc ≥ 1014M as a dotted line. Solid lines show the fraction of the sky covered by the proto-clusters, and the dashed lines display the combined sky area of the proto-clusters. The thick lines with larger data points show the results for the core regions, while the thin lines show those for the region out to rpc. For the core regions of the Mpc ≥ 3 × 1014M proto-clusters, the dashed and solid lines overlap.

In the text
thumbnail Fig. 24.

Proto-clusters and their galaxies in a redshift slice around z = 2 as in Fig. 22 but with a higher mass limit of the proto-clusters of Mpc = 3 × 1014M. Symbols and colours have the same meaning as in Fig. 22.

In the text
thumbnail Fig. 25.

Proto-clusters with Mpc ≥ 1014M and their galaxies in a redshift slice around z = 3.5 with a redshift range according to the photometric redshift accuracy limits for 50% completeness. Symbols and colours have the same meaning as in Fig. 22.

In the text
thumbnail Fig. A.1.

Mean profiles of the proto-clusters for two of the five structural categories defined in the text, where the categories are labelled in the panels. The shaded regions in the histograms indicate the rms variance of the density.

In the text
thumbnail Fig. A.2.

Mean profiles of the proto-clusters for the remaining three of the five structural categories defined in the text, where the categories are labelled in the panels.

In the text
thumbnail Fig. A.3.

Examples of three proto-clusters for the categories 3 to 5. The category and the redshift of the proto-clusters is given in the legend of the Figure.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.