Open Access
Issue
A&A
Volume 678, October 2023
Article Number A15
Number of page(s) 22
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361/202346200
Published online 28 September 2023

© The Authors 2023

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

Understanding when and how present-day galaxy clusters formed at high redshifts has been the main science driver for the extensive search for protoclusters of galaxies in the distant universe, especially at optical and near-infrared wavelengths (see Overzier 2016 and Alberts & Noble 2022 for reviews; the latter focuses on the infrared). The most massive galaxies in the present-day universe lie in rich clusters. Their old, coeval, metal-rich populations suggest that they may have formed as spectacular starbursts at high redshifts. Powerful high-redshift radio galaxies (see the review by Miley & De Breuck 2008) are considered to be the most promising signposts of massive clusters in formation.

In the last decade, (sub)millimeter surveys have revolutionized our understanding of the formation and evolution of galaxies, by revealing an unexpected population of high-redshift, dust-obscured galaxies that are forming stars at a tremendous rate. Submillimeter galaxies (SMGs; see the review by Casey et al. 2014), first discovered by Smail et al. (1997), have intense star formation rates (SFRs) of a few hundred to several thousand solar masses per year. These dusty starbursts are massive (e.g., Greve et al. 2005), most probably the precursors of present-day ellipticals (e.g., Ivison et al. 2013), and should be excellent tracers of matter density peaks and thus protoclusters. In the past few years, several groups have discovered overdensities of dusty star-forming galaxies at redshifts beyond z ≃ 2 (e.g., Clements et al. 2014, 2016; Dannerbauer et al. 2014; Casey et al. 2015; Flores-Cacho et al. 2016; Hung et al. 2016; Wang et al. 2016).

Some of these protoclusters, such as the well-known Spiderweb Galaxy protocluster around the powerful radio galaxy MRC1138+262 at z = 2.16, are associated with galaxy overdensities not only of SMGs but also of low- to medium-mass galaxy populations, such as Lyman α emitters (LAEs) or Hα emitters. These studies are complemented by discoveries of a handful of single SMGs physically associated with galaxy overdensities traced at other wavelengths (e.g., Caputi et al. 2021). A notable example is the companion GN20 and GN20.2a SMGs at z = 4.05, which were discovered by Daddi et al. (2009) to lie in a strong galaxy overdensity with a total mass of ∼1014 M.

At similar redshifts, Oteo et al. (2018) later discovered an extreme protocluster whose core is formed by at least ten dusty star-forming galaxies at z = 4.0, located within a projected area with a size of ∼140 kpc. Miller et al. (2018) also reported a massive overdensity of 14 gas-rich galaxies at redshift z = 4.3 within a projected region of ∼130 kpc in diameter. At even earlier epochs, there is the protocluster around the SMG AzTEC-3 at z = 5.3 in the Cosmic Evolution Survey (COSMOS) field (Capak et al. 2007), which contains at least one SMG (Riechers et al. 2010; Capak et al. 2011) as well as several LAEs (Guaita et al. 2022).

While generally similar to radio galaxies in terms of their stellar mass, SMGs may have one advantage for studying the detailed properties of high-redshift clusters: the surface density of bright SMGs is at least a factor of 10 higher than that of powerful radio galaxies (Reuland et al. 2003). Thus, if it can be proven that luminous submillimeter sources do indeed lie in regions of galaxy overdensities, there is the potential that, in the future, they could be used to substantially increase the number of known high-redshift clusters and protoclusters and thereby facilitate a detailed and statistical study of such large-scale structures.

However, we note that there is a controversy on how reliably SMGs can be used as tracers of large-scale structures (Chapman et al. 2009; Miller et al. 2015; Casey 2016; Álvarez Crespo et al. 2021; Gao et al. 2022). By using large-volume semi-analytic simulations from Klypin et al. (2011) and applying abundance matching and analytical prescriptions for galaxies, Miller et al. (2015) found that most of the strongest overdensities do not host SMGs. The recent review by Alberts & Noble (2022) on galaxy protoclusters, focusing on the infrared, discusses the pro and cons and concludes that SMGs could be signposts for overdensities.

To the best of our knowledge, there are only a few systematic studies that have investigated whether SMGs can be used as reliable tracers of galaxy overdensities (e.g., Álvarez Crespo et al. 2021; Gao et al. 2022). In this work we present a pilot statistical study aimed at exploring the potential of SMGs to trace galaxy protoclusters. To this aim, we use the Poisson probability method (PPM; Castignani et al. 2014a) to search for and characterize megaparsec-scale overdensities around a sample of distant z ∼ 1.2 − 5.3 SMGs, which are possibly associated with large-scale protoclusters. The PPM was primarily introduced and applied to search for high-redshift clusters and groups around z ∼ 1 − 3 radio galaxies, in particular within the COSMOS survey (Castignani et al. 2014b, 2019), and in this work we apply the method using distant SMGs instead as positional priors for our search for protoclusters. The major goal of our work is to shed light on the strongly debated question of whether SMGs are good tracers of distant protoclusters (see, e.g., Miller et al. 2015; Casey et al. 2015), via testing the efficiency of the PPM as a validation and characterization of protocluster candidates around well-known and spectroscopically confirmed SMGs.

The paper is organized as follows. In Sect. 2 we describe the data used in this work: the SMG sample and the photometric redshift catalogs. In Sect. 3 we describe the PPM that we use to search for protoclusters around the SMGs. In Sect. 4 we present and discuss our results. In Sect. 5 we draw our conclusions. Throughout this work we adopt a flat Λ cold dark matter cosmology with matter density Ωm = 0.30, dark energy density ΩΛ = 0.70, and Hubble constant h = H0/100 km s−1 Mpc−1 = 0.70.

2. Data

In this section we present the main data from the literature that we use in our study. We first describe our sample of SMGs, and then the different photometric redshift catalogs that we use to search for galaxy overdensities around the SMGs themselves. We emphasize that with the PPM we will be able to search for galaxy overdensities at megaparsec scales, which is the typical size of protocluster cores, whereas the overall extent of galaxy protoclusters could be up to 10–20 Mpc (see, e.g., Muldrew et al. 2015; Casey et al. 2015).

2.1. The submillimeter galaxies

2.1.1. The selection of the SMG sample

We searched in the literature for SMGs in the GOODS-N (Great Observatories Origins Deep Survey) field1, with spectroscopic redshifts obtained via CO lines, which is the most reliable and accurate methods for determining the redshift of these dusty starbursts. This search resulted in 17 SMGs. However, as we want to search for galaxy overdensities with the PPM, which makes uses of photometric redshifts of galaxies in the field of the SMGs, we removed five sources from the sample of 17 as they fall outside the range of RA and Dec covered by the photo-z catalogs that are currently available for the GOODS-N field. Our final SMG sample thus consists of 12 gas-rich, star-forming, and bright far-infrared (FIR) sources, as further discussed in the following. In Table 1 we report the main properties of the 12 SMGs of this work, including their redshift, stellar and molecular gas masses, CO and FIR luminosities, SFR, and depletion time τdep = MH2/SFR. We note that two SMGs, namely, ID.03 and ID.19, have not been selected through submillimeter surveys but they were found directly through a CO blind survey (Decarli et al. 2014).

Table 1.

Properties of the GOODS-N SMGs.

2.1.2. Molecular gas and star formation

Molecular gas masses of the SMGs are high, ranging between (1.3 − 15)×1010 M. They come from flux measurements from the literature, and were estimated as follows. For all 12 SMGs we first searched in the literature for CO(J → J–1) line flux measurements, where J is a positive integer denoting the total angular momentum. When multiple lines were available for a given SMG, we gave preference to the lowest-J transition available. This is because lower-J CO transitions require lower gas densities and temperatures (e.g., Liu et al. 2021) and are thus preferred when estimating the total cold molecular gas masses. We then converted CO(J → J − 1) fluxes into velocity integrated luminosities L CO(JJ1) $ L^\prime_{{\rm CO}(J\rightarrow J-1)} $, via Eq. (3) of Solomon & Vanden Bout (2005), and then into molecular gas masses. In order to have homogeneous gas mass estimates we used a CO-to-H2 conversion factor αCO = 0.8 M (K km s−1 pc2)−1, typical of starbursts, and excitation ratios r J1 = L CO(JJ1) / L CO(10) =0.9 $ r_{\rm J1}=L^\prime_{{\rm CO}({J}\rightarrow {J}-1)}/L^\prime_{{\rm CO}(1\rightarrow 0)}=0.9 $, 0.6, and 0.32 for J = 2, 3, and 4, respectively (Birkin et al. 2021).

It is worth noticing that the SMGs in our sample not only have high molecular gas masses but they also have high FIR luminosities, with a median value of log(LFIR/L)≃12.6, typical of ultra-luminous infrared galaxies (ULIRGs; see Table 1). The sources in our sample are thus bright SMGs, similar to those originally discovered with bolometer cameras mounted on single-dish telescopes (e.g., Smail et al. 1997; Hughes et al. 1998; Barger et al. 1998).

The FIR luminosities were then converted into SFRs using the relation SFR/(M/yr) = 9.09 × 10−11LFIR/L by Kennicutt (1998). Hence, our SMGs are characterized by strong ongoing star formation activity, with a median of SFR ≃ 340 M yr−1. In particular, both SMGs of the GN20–GN20.2a pair have the highest SFRs, exceeding 1000 M yr−1, and FIR luminosities largely exceeding the value of log(LFIR/L) = 13, which is typical of hyper-luminous infrared galaxies (HyLIRGs; see Sect. 2.1.4). These high levels of star formation activity in the SMGs are ultimately sustained by the large molecular gas reservoirs, which are consumed quite efficiently, which is typical of starbursts. Our SMGs have indeed short depletion timescales τdep = MH2/SFR ≃ 0.1 Gyr (median).

2.1.3. Stellar masses

We found stellar masses for only seven SMGs in the literature (Hainline et al. 2011; Tan et al. 2014), in the range log(M/M)≃10.5 − 11.2 (see Table 1). Many SMGs are indeed undetected in the optical wavelengths, as for example it is the case for GN10 and HDF850.1, which are completely dust-obscured (Daddi et al. 2009; Walter et al. 2012; Calvi et al. 2021), so that it is difficult to estimate their stellar masses. Altogether, the SMGs in our sample have high stellar masses and star formation, and are thus likely the progenitors of local ellipticals (Michałowski et al. 2010; Fu et al. 2013).

2.1.4. The SMGs and their environments: General considerations

Previous studies found evidence of protoclusters for six SMGs out of the 12 in our sample, as discussed in the following. SMM J123711, SMM J12618, and SMM J123712 were confirmed by Bothwell et al. (2013) as bright submillimeter emitters, with similar properties in terms of gas masses MH2 ≃ (3 − 5 − 5.5)×1010 M and FIR luminosities, LFIR/L = 12.4 − 12.9, typical of ULIRGs. They also have the same redshifts z = 1.995 − 1.996 and relatively small pairwise projected separations of ∼3.3 Mpc. At the location of these SMGs, Chapman et al. (2009) reported the existence of a z = 1.99 protocluster, discovered as a spectroscopic overdensity of ∼100 SMGs across 800 arcmin2. Therefore, we believe all three SMGs in our sample belong to the same overdense large-scale structure.

GN20 and GN20.2a were discovered as bright distant SMGs by Daddi et al. (2009). They have similarly high gas masses MH2 = (0.9 − 1.5)×1011 M and FIR luminosities, LFIR/L = 13.20 − 13.46, typical of HyLIRGs (e.g., Rowan-Robinson 2000; Neri et al. 2020), which are often associated with infall/outflows, mergers, and mutual interactions. They are thus possibly interacting as indeed they are separated by just 24 arcsec (i.e., 167 kpc) in projection, while being at the same redshift z = 4.05. Furthermore, Daddi et al. (2009) found a strong overdensity of B-band dropouts and IRAC (Infrared Array Camera on the Spitzer Space Telescope) -selected massive z > 3.5 galaxies, appearing to be centered on the two SMGs GN20 and GN20.2a, which suggests the presence of a z = 4.05 protocluster.

Similarly, the z = 5.2 SMG HDF850.1 in our sample is embedded in an overdensity of galaxies with spectroscopic redshifts that agree well with that of HDF850.1 (Walter et al. 2012). Calvi et al. (2021) showed none of their 23 spectroscopically confirmed members are SMGs. They are instead in majority LAEs or Lyman break galaxies (LBGs).

To the best of our knowledge, these are the only known (candidate) protoclusters around SMGs in the GOOD-N field. The remaining six SMGs in our sample have no unambiguous environmental analysis in the literature and have pairwise large separation both in redshift and in projection. In this work we investigate the large-scale environments of all 12 SMGs using the PPM, considering each of them separately. However, given the considerations outlined above, we then discuss the detected overdensities associated with the SMGs GN20 and GN20.2a altogether, and similarly for the SMGs SMM J123711, SMM J12618, and SMM J123712.

In Table 2 we provide a summary of the basic properties of the 12 SMGs, which are grouped in those cases where multiple SMGs are close to each other and likely belong to same overdensity, as discussed above.

Table 2.

Summary of the SMG properties: coordinates and known overdensities.

2.2. Photometric redshift catalogs

In this section we give a brief overview of the three photometric redshift catalogs we used to search for galaxy protoclusters around the SMGs. These catalogs were built and released by Arrabal Haro et al. (2018), Barro et al. (2019), and Liu et al. (2018), hereafter denoted as AH18, B19, and L18, respectively, to which we refer for a complete complete discussion of the sample built-up and detailed properties. As outlined below, the three catalogs contain sources that were selected with different criteria and techniques, as well as deep multiwavelength photometry from the infrared to the ultra-violet. They thus contain complementary information about the high-z galaxy population in the GOOD-N field and are therefore optimal to search for and characterize distant protoclusters limiting the possible biases that could arise by using instead a single multiband photometric catalog.

2.2.1. Barro et al. (2019)

This is the largest photo-z catalog we used. The sources were selected by the authors from the CANDELS (Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey) survey (Grogin et al. 2011; Koekemoer et al. 2011) covering the GOODS-N field.

The selection was done using the WFC3/F160W filters (H-band) down to a magnitude limit of H ∼ 28 mag. The selection also combined data from the deep and wide-field observations of the CANDELS program, from the ultraviolet to the FIR, over a total area of 171 arcmin2. The released catalog contains 35 445 sources, of which 22 670 have photometric redshifts in the range 1.0 < z < 5.5. The highest and nearly uniform accuracy σ ( δ z 1 + z spec ) 0.01 $ \sigma(\frac{\delta z}{1+z_{\mathrm{spec}}})\lesssim0.01 $ of the photometric redshifts is found in the redshift range z ∼ 0 − 2.5. Altogether, the median redshift of the sources in the catalog is z = 1 . 43 0.85 + 1.22 $ z=1.43^{+1.22}_{-0.85} $, where the uncertainties correspond to the 68% confidence interval around the median. The B19 catalog thus allows us to potentially search for overdensities around all SMGs in our sample.

2.2.2. Liu et al. (2018)

This catalog includes photometry for 3306 super-deblended Herschel sources in the GOODS-N field, up to z ∼ 6. The sample was built by the authors using a technique that allowed them to process low-resolution FIR images, thus limiting source confusion. Specifically, they used positional priors based on source positions from higher-resolution infrared and radio observations with Spitzer/MIPS (Multiband Imaging Photometer for Spitzer) at 24 μm and VLA (Karl Guthe Jansky Very Large Array) at 1.4 GHz. These positional priors allowed the authors to properly fit the FIR/submillimeter data, and thus build an unblended source catalog with reliable photometry in both FIR and (sub)millimeter bands.

2.2.3. Arrabal Haro et al. (2018)

This photometric catalog consists of 1558 high-z galaxies selected in the ultraviolet (528 LAEs and 1030 LBGs), which were selected with criteria based on color excesses and spectral energy distribution fits. In more detail, the authors carried out a systematic search for LAEs and LBGs between z ∼ 3.35 − 6.8 by using 25 medium-band filters (full width at half maximum ∼17 nm; from 500 to 941 nm) of the Survey for High-z Absorption Red and Dead Sources (SHARDS; Pérez-González et al. 2013). This survey was conducted with the instrument OSIRIS (Optical System for Imaging and low-Intermediate-Resolution Integrated Spectroscopy; Cepa 2010) at the GranTeCan telescope, covering an area of ∼130 arcmin2 of the GOODS-N field.

Figure 1 displays the redshift distribution of galaxies in the three photometric catalogs. It shows the complementarity in terms of redshift range of the AH18 catalog with respect to the other two. While both B19 and L18 samples have similar redshift distributions peaking around z ∼ 1, AH18 sources populate the highest redshifts.

thumbnail Fig. 1.

Photometric redshift distributions of the B19 (Barro et al. 2019), L18 (Liu et al. 2018), and AH18 (Arrabal Haro et al. 2018) catalogs in the GOODS-N field.

3. The Poisson probability method

The PPM searches for high-z megaparsec-scale overdensities of galaxies around a given target. It is based on a theory defined on the ensemble of the photometric redshift realizations of the galaxies in the field. Through the use of a solid positional prior and an accurate photometric redshift sampling, the PPM partially overcomes the limitations deriving from low number-count statistics and shot-noise fluctuations, which are particularly relevant in the high-z universe such as in the case of protoclusters. More specifically, the PPM method uses photometric redshifts of galaxies to search for overdensities around each target (each SMG, in this work) along the line of sight. To this aim the PPM adopts an accurate sampling of the photometric redshift information to the detriment of a less sophisticated tessellation of the projected space, which is performed in terms of concentric annuli centered around each target. We refer to our previous studies for a detailed description of the method (Castignani et al. 2014a), its wavelet based extension (wPPM; Castignani et al. 2019), and the applications (Castignani et al. 2014b, 2019). We summarize below the basic steps of the method.

First, we tessellated the projected space with a circle centered at the coordinates of the SMG and a number of consecutive adjacent annuli. The annuli and the central circle have an equal area of 3.14 arcmin2. In particular, the circle has a radius of 60 arcsec, which corresponds to physical scales in the range ≃(0.4–0.5) Mpc for the SMGs in our sample.

Second, for each region of the tessellation (the central circle and the consecutive annuli), we counted the number of sources with photometric redshifts within an interval of length Δz and centered at the centroid redshift, zcentroid. We refer to Sect. 4 for a description of the photometric redshift catalog considered. The parameters Δz and zcentroid uniformly span a grid of values that reflect the photometric redshift uncertainties and correspond to the redshift range of our interest, respectively.

Third, for each pair (zcentroid; Δz), we mapped the galaxy number counts into an overdensity measure that relies on the significance, based on Poisson statistics, that the null hypothesis of no clustering is rejected at the location of the SMG. For this, we estimated the mean field density using a rectangular control region concentric with the GOODS-N field and with a subtended area of 50.3 arcmin2. The corresponding PPM plots (Appendix A) show the overdensity patterns at different redshifts, where we further applied a Gaussian filter to eliminate high-frequency noisy patterns. We chose a conservative smoothing scale of 0.02 in redshift, on the order of or smaller than typical photometric redshift uncertainties. Overdensities along the line of sight of the SMG were searched for by fixing the redshift bin Δ z ¯ = 0.3 , $ \overline{\Delta z}=0.3, $ which roughly maximizes the overdensity significance as it corresponds to a ∼(3 − 5)σ interval for high-z sources with typical photometric redshift uncertainties of 0.02(1 + z).

Fourth, at fixed Δ z ¯ $ \overline{\Delta z} $, a peak-finding algorithm was applied to the PPM plot to search for overdensities around the SMGs. This procedure relies on the Morse theory. It allows us to estimate the significance of the overdensity, its redshift zov, and the projected radius ℛPPM, which is the maximum separation from the SMG up to which the overdensity is detected, while Nselected is the number of sources that contribute to the overdensity, within the radius ℛPPM. Any overdensity that is located at a redshift consistent with that of the SMG is associated with the galaxy, as done in Castignani et al. (2014b, 2019). Multiple overdensity associations are not excluded.

Fifth, similarly to Castignani et al. (2019), we then applied a wavelet transform to further characterize the overdensity in the projected space. In particular, we provide the projected miscentering, θov, of the overdensity peak, as found with the wavelet transform, with respect to the SMG position in the sky. We also derived a second estimate for the overdensity size, ℛw, which is the projected radius of the overdensity as found with the wavelet transform.

Hereafter we refer to the PPM cluster finder together with its wavelet extension as wPPM.

4. Results

We ran the wPPM as described in Sect. 3 on the three photometric catalogs presented in Sect. 2.2. As illustrated in the redshift distributions of Fig. 1, both L18 and B19 catalogs span the full redshift range of our SMGs and we are thus potentially able to detect protoclusters around them using these photo-z catalogs. On the other hand, only four SMGs at the highest redshifts z ∼ 4.05 − 5.30 out of 12 fall within the redshift range of the AH18 catalog, namely, GN20, GN20.2a, HDF850.1, and GN10.

Table 3 summarizes the properties of the overdensities (ov) detected with the wPPM along the line of sight of each SMG. In particular, we report the projected coordinates of the overdensity peak, the projected miscentering θov with respect to the SMG coordinates, the estimated redshift of the overdensity (zov), the overdensity detection significance, as well as estimates for the overdensity richness (Nselected) and size. In Appendix A we report the density maps and PPM plots for all the overdensities reported in Table 3.

Table 3.

Properties of the megaparsec-scale overdensities around the SMGs.

4.1. Global properties of the detected overdensities

Figure 2 displays the redshift distribution of the 12 SMGs in our sample, where we highlight with different colors the overdensities detected using the B19, L18, and AH18 catalogs, separately. As the redshift coverage of three photo-z catalogs is not the same, as well as the corresponding sample selections, we do not expect to find the same overdensities in all three catalogs. Comparing the overdensity redshifts for the three different photo-z catalogs we note that for both L18 and B19 catalogs the PPM is effective in detecting overdensities over the full redshift range spanned by the SMGs, while with the AH18 catalog we detect overdensities only around the highest-redshift SMGs.

thumbnail Fig. 2.

Redshift distribution of the SMGs. The associated overdensities found using the B19 (left), L18 (center), and AH18 (right) photo-z catalogs are highlighted with different colors. The shaded region in the right histogram indicates that there are no sources at z < 3.35 in the AH18 catalog.

As shown in Table 3, there are 11 SMGs that are associated with overdensities in at least one photo-z catalog. In particular, in five cases, overdensities are found in B19 and L18 catalogs, which cover the full redshift range of our SMGs. These L18 and B19 overdensities are all at z < 4, with the exception of GN20. Similarly, all our z > 4 SMGs appear to be in overdensities in at least two photo-z catalogs.

Overdensities are also found around the remaining SMG ID.03 with both the B19 and L18 catalogs, but at lower redshifts than that of the SMG, and so the association with the SMG is uncertain. Therefore, unless specified otherwise, in the following we do not consider these overdensities around ID.03.

In three cases we found overdensities around the SMGs only when choosing a slightly lower value, Δ z ¯ = 0.2 $ \overline{\Delta z}=0.2 $, than that used for the other overdensities (i.e., Δ z ¯ = 0.3 $ \overline{\Delta z}=0.3 $; see Sect. 3). These three overdensities are those around ID.19 (in the case of both B19 and L18 catalogs) and GN20.2a (B19), and for our analysis we consider them altogether with the others. However, for the sake of clarity we highlight them in Table 3.

In Fig. 3 we show the 3D distribution of the SMGs and photometric redshift galaxies which are used in this work to search for protoclusters. Fiducial overdensity members as found by the wPPM are highlighted (see Sects. 3 and 4.4 for further details). We grouped overdensities, and their members, around mutually close SMGs, as in Table 2 and discussed in Sect. 2.1.4. The interactive version of Fig. 3 is available electronically. In the following we further discuss our detections when considering separately the three redshift catalogs.

thumbnail Fig. 3.

Distribution of the SMGs and associated protocluster galaxies in the RA, Dec, and redshift space when the B19, L18, and AH18 photometric redshift catalogs are used for the detection of protoclusters. Small gray and colored dots always correspond to sources from the B19 catalog, which is chosen for a better visualization as it is the largest photometric redshift catalog. Red squares show the location of the SMGs in our sample. Colored clouds show the members of the overdensities as identified with the PPM. Protocluster members are color coded in yellow, green, and blue according to their increasing redshift. Interactive 3D plots for all three photometric redshift catalogs are available online.

4.1.1. Barro et al. (2019)

For B19 we have eight detected overdensities around the SMGs, including GN20, GN10, and possibly HDF850.1 at the highest redshifts z ∼ 4 − 5, even though for the last one the association of the SMG with the overdensity is uncertain and not reported in Fig. 2. As highlighted in Table 3, this is because the overdensity is detected at a higher redshift, zov = 5.87 ± 0.07, than the spectroscopic one of the SMG, zspec = 5.183. A similar uncertain association is found for ID.03 at z = 1.7844, discussed above, while the overdensity redshift is zov = 1.29 ± 0.08 and zov = 1.43 ± 0.08 when the B19 and L18 catalogs are used, respectively.

4.1.2. Liu et al. (2018)

When using the L18 catalog nine SMGs are found in overdensities up to z ∼ 4 (GN20–GN20.2a). However, we miss two overdensities in L18 around the most distant SMGs GN10 and HDF850.1, which are however detected with the other two photometric redshift catalogs (B19, AH18). At lower redshifts, we note that the catalog has completeness issues below z = 2 (see Fig. 4 in L18). Despite this, all SMGs at z < 2 appear to be in L18 overdensities, with some caveats, which we discuss below. The source SMM J123711 at z = 1.9951 is associated with a ≃3σ overdensity at zov = 2.29 ± 0.07; therefore, the two redshifts are marginally consistent with each other, |zov − zspec|/(1 + zspec) = 0.10. Similarly, in the case of ID.03, discussed just above, the association in redshift between the SMG and the overdensity is uncertain, with |zov − zspec|/(1 + zspec) = 0.14.

4.1.3. Arrabal Haro et al. (2018)

Out of the four distant SMGs that fall in the z ∼ 3.35 − 6.8 redshift range of AH18, overdensities are detected around HDF850.1 and GN10, separately. No overdensity is detected for the other high-z SMGs GN20 and GN20.2a using AH18. Nonetheless, Arrabal Haro et al. (2018) reported a redshift peak of their ultraviolet-selected galaxies at the redshift of GN20 and GN20.2a. Furthermore, as discussed in Sect. 2.1.4, Daddi et al. (2009) found a spectroscopic overdensity around the two companion SMGs at z = 4.05 and we similarly find high signal-to-noise overdensities when using the L18 and B19 catalogs (see Table 3). We double-checked the AH18 catalog and found that the SMGs GN20 and GN20.2a are located at the edge of the footprint of the photo-z catalog, which prevents us from detecting the overdensity, given the circular PPM tessellation of the projected space. A more accurate treatment of edge effects will be implemented in a forthcoming upgrade of the PPM.

4.2. Multiple associations

As listed in Table 3, for a given photo-z catalog, most SMGs have at most one associated overdensity. However, in two cases, namely the SMGs SMM J123634 and SMM J123707, there are multiple overdensities that are detected when using the L18 and B19 catalogs. These correspond to distinct overdensity peaks in the PPM plots at redshifts that are formally consistent with that of the SMG (see Appendix A, where the PPM plots are reported). Similar multiple associations were discussed in our previous studies, Castignani et al. (2014b, 2019), where we looked for overdensities around distant radio galaxies. Altogether, for a given photometric redshift catalog (B19 or L18), the multiple overdensities reported for SMM J123634 and SMM J123707, separately, have similar properties in terms of significance, size, projected coordinates, and thus miscentering θov (see the density maps in Appendix A). These aspects suggest that these multiple peaks correspond to the same overdensity that is fragmented at different redshifts by the PPM procedure. This behavior is not uncommon for distant (proto)cluster finders, especially in the regime of low number counts.

4.3. Overdensity redshift accuracy

We now compare the SMG spectroscopic redshifts with those of the overdensities detected by the wPPM using photometric redshifts of galaxies. Figure 4 (top) shows the redshifts of the overdensities, estimated with the PPM using the B18, L18, and AH18 photo-z catalogs, which are plotted against the spectroscopic redshifts of the SMGs. In the bottom panel we show the residuals. The SMG redshifts agree well with our estimates obtained with the PPM for the overdensities. We found indeed an accuracy of σ((zov − zspec)/(1 + zspec)) = 0.043 and a negligible bias ⟨zov − zspec⟩ = 0.034, on average, which is competitive with the typical photometric redshift accuracy of distant galaxies (e.g., in COSMOS, Weaver et al. 2022) and distant protoclusters (e.g., Brinch et al. 2023). The best accuracy is remarkably found for the overdensities around the most distant SMGs at z ∼ 4 − 5, for which we have (zov − zspec)/(1 + zspec) between 0.019 and 0.038. These results show that the wPPM is thus effective in recovering well the redshifts of the overdensities, all over the entire broad redshift range covered by the SMGs.

thumbnail Fig. 4.

Top panel: Overdensity redshift measured by the PPM (zov) as a function of the spectroscopic redshift of the SMG (zspec). We refer to the legends at the top left and bottom right for the symbols and the color code used for the different SMGs and photo-z catalogs. Bottom panel: Redshift residuals as a function of zspec.

4.4. Structural properties of the overdensities

We next used the parameters measured by the wPPM to characterize the structural properties of the detected overdensities. As further described in Sect. 3, one key quantity is ℛPPM, which is the projected size of the overdensity measured as an angular separation from the SMG. We use ℛw to denote the projected size of the overdensity as found by the wavelet transform. In this case, a miscentering θov between the SMG and the wavelet-based overdensity peak is determined. Another important quantity is the number of galaxies (Nselected) with projected separations of less than ℛPPM from the SMG and with redshifts within ( z ov Δ z ¯ / 2 ; z ov + Δ z ¯ / 2 ) $ (z_{\mathrm{ov}}-\overline{\Delta z}/2;z_{\mathrm{ov}}+\overline{\Delta z}/2) $, where zov is the estimated redshift of the protocluster and Δ z ¯ = 0.3 $ \overline{\Delta z}=0.3 $ (see Sect. 3). For a given photo-z catalog, Nselected is thus a proxy for the richness of the detected overdensity. In Table 3 we report these quantities for all overdensities associated with the SMGs. In the following we discuss the mutual dependence of these overdensity properties.

4.4.1. Richness

Overdensities detected using the B19 catalog show the highest richness, at fixed SMG, up to values of Nselected ≃ 200, while much lower values Nselected ≃ 10 − 30 are found for the overdensities detected with the L18 or AH18 catalogs. This is not surprising, as the B18 catalog contains ∼(10 − 20) times more sources than both L18 and AH18 samples (see Sect. 2.2). Furthermore, we find that when we use the B19 catalog the overdensities associated with lower-redshift SMGs in our sample (i.e., z < 2.5) are 60–70% richer (∼150–200) in members than those located at higher redshifts. This behavior is expected as indeed the photometric redshift catalog is flux-limited and thus galaxy number densities decrease from lower to higher-redshift protoclusters. To some extent, the behavior can be also explained by the fact that at lower-z we are looking at a significantly more advanced stage of cluster formation, while at higher-z the overdensity cores embedded in the large-scale protocluster structures are less overdense and in the phase of assembly.

4.4.2. Size and redshift

In Figs. 5 and 6 we compare relevant quantities estimated with the wPPM for the different overdensities. For each SMG, in the case of multiple overdensity associations (see Sect. 4.2), we considered only the one closer to the SMG redshift. Figure 5 displays the overdensity size (in comoving and physical unit) as a function of the overdensity redshifts estimated with the PPM, for all all three photo-z catalogs. We overplot as solid lines the projected radius that corresponds to the minimum size allowed by the PPM tessellation of the projected space and equal to 60 arcsec. For the sake of simplicity we report only the ℛPPM values. We verified indeed that our results do not change when ℛw is instead plotted. Ultimately, this is because the relative median difference between ℛPPM and ℛw is somehow limited and equal to R w / R PPM 1 = 0 . 15 0.46 + 0.59 $ \mathcal{R}_{\mathit{w}}/\mathcal{R}_{\mathrm{PPM}}-1 =0.15^{+0.59}_{-0.46} $, where the reported uncertainties correspond to the 68% confidence interval. Altogether, the estimated overdensity sizes range between (1.5–4) Mpc (comoving) as well as between (0.4–1.0) Mpc (physical), quite independently of the redshift. We stress that these values are those associated with protocluster cores, while the total extent of large-scale protoclusters can reach (20–30) Mpc (physical, Muldrew et al. 2015; Casey 2016). These results further highlight the capability of the PPM in detecting megaparsec-scale overdensities around the SMGs, likely associated with the cores of larger-scale protoclusters.

thumbnail Fig. 5.

Overdensity core sizes in comoving units (top panel) and physical units (bottom panel) as a function of overdensity redshifts estimated by the PPM. The solid black lines show the evolution with redshift of the comoving (top) and physical (bottom) core sizes that correspond to an angular aperture of 60 arcsec. Different colors and symbols refer to the different photometric redshift catalogs used to detect the overdensities, as illustrated in the legend.

thumbnail Fig. 6.

Projected separation between the SMG coordinates and the overdensity peak as found by the wavelet transform of wPPM as a function of the estimated richness (top) and redshift (bottom) of the overdensity. The vertical line at zov ≳ 4 in the bottom panel is used to indicate the possible transitioning phase in the protocluster galaxy population. The color coding is the same as in Fig. 5.

4.4.3. Miscentering between the SMG and the overdensity peak.

Figure 6 shows instead the projected miscentering θov between the SMG and the overdensity peak as found with the wavelet transform, as a function of both the estimated overdensity richness Nselected and the overdensity redshift zov. Interestingly, θov has a broad range of values, from negligible offsets up to separations of ≃1 Mpc (physical). Any clear trend of θov is observed neither with Nselected nor with zov.

However, we do find evidence that the lowest θov ≲ 0.4 Mpc are preferentially associated with low richness overdensities (Nselected ≲ 20). The majority of the detected overdensities have instead larger θov ≃ (0.4 − 1.0) Mpc, and all rich overdensities (Nselected ≳ 20) have such large θov values. These results suggest that, in the cases of large θov, either the SMG is located in the outskirts of the overdensity core or that, in the megaparsec-scale surroundings of the SMG, there exists another, more overdense region that is found with the wavelet analysis. Visual inspection of the overdensity density maps in Appendix A indeed suggests that the morphology of the large-scale protoclusters is complex. In some cases (e.g., GN10, GN20, and GN20.2a) they have compact morphologies, similarly to CL J1001+0220 at z = 2.51 in the COSMOS field Wang et al. (2016). In other cases (e.g., HDF850.1, SMM J123634, SMM J123606, SMM J123618, SMM J123707, SMM J123711, and SMM J123712) our detected overdensities have evidence for large-scale filamentary or clumpy structures, as typically found in still-assembling protoclusters such as SSA22 at z = 3.09 (Steidel et al. 1998; Umehata et al. 2019), the Spiderweb at z = 2.16 (Jin et al. 2021), and Hyperion at z = 2.45 (Cucciati et al. 2018). These overdensities with nontrivial morphology represent a piece of indirect evidence for the complex dynamical state of the associated protoclusters, which are indeed non-relaxed structures.

Altogether, the measured miscentering values correspond to only a small fraction of the total extent of protoclusters, as they have overall sizes of 10–20 Mpc (physical; e.g., Jin et al. 2021). Some of our SMGs could still be potentially associated with brightest cluster galaxies as indeed found in previous studies (Emonts et al. 2013; Zhou et al. 2020). However, as outlined in Sect. 2.1.4, different SMGs may be associated with the same structure, so the identification of the proto-brightest cluster galaxies among the SMGs in our sample is not straightforward.

4.5. Protoclusters beyond z = 4

4.5.1. Megaparsec-scale overdensities and morphology

We now discuss in more detail the overdensities around the most distant SMGs, namely the GN20–GN20.2a pair (z = 4.05), HDF850.1 (z = 5.183), and GN10 (z = 5.303). As further outlined above, megaparsec-scale overdensities are detected around them at redshifts consistent with those of the SMGs. These SMGs are thus likely associated with large-scale protoclusters close to the epoch of reionization, which are excellent targets for the James Webb Space Telescope (JWST) and next generation spectrographs.

Figure 7 displays the PPM plots (left) and density maps (right) for the overdensities around the three SMGs, when using the AH18, B19, and L18 catalogs, for HDF850.1, GN10, and GN20, respectively. The PPM plots show clear overdensity patterns at the redshift of the SMGs. Patterns at lower and even higher (z ≃ 6) redshifts than the SMGs are also visible. These correspond to foreground and background overdensities. All three photo-z catalogs used in this work are thus effective in detecting protoclusters at the highest redshifts.

thumbnail Fig. 7.

PPM plots (left) and density maps (right) of protoclusters detected around z > 4 SMGs using the AH18 (top), B19 (center), and L18 (bottom) catalogs. Left: PPM plots for HDF850.1, GN10, and GN20. The vertical solid lines show the SMG spectroscopic redshifts. Colored dots refer to significance levels > 2σ (cyan), 3σ (green), 4σ (blue), 5σ (red), 6σ (brown), and 7σ (black). Right: Gaussian density maps for the overdensities, centered at the projected space coordinates of the SMGs. The solid black and dashed red circles are centered at the projected space coordinates of the SMGs. The former have a (physical) radius of 1 Mpc, estimated at zov, while the latter, with a radius ℛPPM, denote the region within which the PPM detects the overdensity. The dotted-dashed green circle is centered at the peak of the detection as found by the wavelet transform and has a radius ℛw.

The right panels of Fig. 7 instead show the density maps centered around the SMG coordinates. As pointed out also in Sect. 4.4, the morphology of these high-z overdensities appear heterogeneous. Several overdensity peaks are present around the SMGs. However, while GN10 and GN20 appear to be co-spatial with the overdensity with the highest significance in their field, the large-scale structure overdensity around HDF850.1 is more complex, with multiple peaks detected and a substantial miscentering of ∼90 arcsec (∼570 kpc) between the SMG and the most prominent overdensity peak in the southeast.

Interestingly, Calvi et al. (2021) performed a spectroscopic campaign of this overdensity and found a multicomponent system, similarly to what we find with the wPPM analysis of this work. They found that the overall structure is extended over 700 arcsec in projection (i.e., 4.4 Mpc) and presents a density enhancement located 200 arcsec (i.e., 1.3 Mpc) to the northeast of the SMG HDF850.1. This density enhancement is seen also in the density map of Fig. 7, while we additional probe another strong overdensity peak with a separation of 3 arcmin to the west of the SMG.

4.5.2. A transition epoch at z ≳ 4 for the protocluster galaxy population

Interestingly, the two SMGs at the highest redshifts, z ∼ 5 (i.e., HDF850.1 and GN10), are both found in overdensities only when the AH18 catalog is used. The other two photo-z catalogs (L18, B19) are less effective in finding the associated overdensities. Using the L18 catalog, which primarily includes Herschel sources, we do not find any of the two overdensities, while with the B19 photo-z we are able to detect the overdensity associated with GN10, and that around HDF850.1 is more uncertain, as the PPM detects an overdensity at a higher redshift than the SMG. Given these results, we speculate that we possibly witness a transitioning phase in the protocluster galaxy population at z ≳ 4. Protoclusters at z ≲ 4 are well detected with the B19 and L18 catalogs, which were built also using FIR data, including from Herschel. Therefore, the galaxy populations of these protocluster likely contain a number of dusty star-forming galaxies, in addition to the SMGs. On the other hand, the AH18 photo-z catalog, thanks to which we detect the highest-redshift protoclusters, contains only LAEs and LBGs. One interpretation is that protoclusters around SMGs at z ≲ 4, thus around the peak of the cosmic SFR density, are rich in dusty galaxies, while z ≳ 4 protoclusters are likely populated by less dusty galaxies, which appear as LAEs or LBGs (see also Fig. 6, bottom panel, for a schematic classification). Nevertheless, it is necessary to exercise caution in accepting this explanation outright, and further investigation with large samples is needed to firmly confirm the proposed scenario.

These results are in agreement with those by Malavasi et al. (2021). By studying the galaxy population of a protocluster at z = 4 traced by LAEs, they showed that lower-redshift LAEs are on average significantly dustier than their counterparts at higher redshifts, which supports our proposed scenario for z ≳ 4 being a transitioning epoch for the protocluster galaxy population. Similarly, we note that Calvi et al. (2021) recently performed a spectroscopic campaign targeting the protocluster around HDF850.1 and found that, beside HDF850.1, none of the confirmed members are bright in the FIR/(sub)millimeter wavelength range. Likewise, Daddi et al. (2009) detected the GN20–GN20.2a protocluster as an overdensity of B-band dropout LBGs. Overall, these independent results support our proposed scenario that z ≳ 4 corresponds to a transitioning epoch for the protocluster galaxy population.

4.6. SMG molecular gas masses and the overdensities

One of the main goals of this work is to test whether SMGs are tracers of protoclusters. Therefore, in order to investigate the possible physical interplay between the SMGs and their large-scale environments in Fig. 8 we plot the molecular H2 gas mass of the SMGs in our sample as a function of the associated overdensity significance, as found with the PPM. In the case of multiple detections for a given SMG, we averaged the different values obtained with the photo-z catalogs used in this work, listed in Table 3.

thumbnail Fig. 8.

Molecular H2 gas mass for each SMG in our sample versus the average overdensity significance (in units of σ) of the associated overdensity. The points are colored according to the SMG redshift, and different symbols refer to different SMGs. The shaded area highlights the minimum overdensity significance (= 2) that is adopted throughout this work.

Remarkably, we do observe a good correlation between MH2 and the significance of the detected overdensity. The Pearson and Spearman tests tell us that the probability that the null hypothesis (i.e., no correlation) is rejected is at a level of 3.7σ and 2.6σ, respectively (i.e., p-value  =  2.36  ×  10−4 and 9.76  ×  10−3). Furthermore, the highest values of MH2 ≃ 1011 M and highest overdensity significance are found in the case of the highly star-forming companions GN20 and GN20.2a at z ≃ 4.05, which have high FIR luminosities typical of HyLIRGs (see Sect. 2.1). This result was not predictable a priori as indeed high-z overdensities are associated with lower number counts of galaxies than those at lower redshifts and thus in principle more difficult to detect with a high significance.

Altogether, as pointed out in Sect. 2.1.2, our SMGs have, on average, the high SFRs and FIR luminosities typical of ULIRGs, which are often associated with gas-rich mergers (e.g., Sanders & Mirabel 1996; Downes & Solomon 1998; Genzel et al. 2001). Altogether, given the overall stellar and gas properties of the SMGs, we interpret the MH2 versus overdensity significance correlation as a result of the fact that the SMGs that live in the strongest overdensities are also those that experienced most the interaction with the companions, as it may be the case for the GN20–GN20.2a pair. This interaction likely triggered gas infall and high levels of gas content and star formation in the SMGs (see, e.g., Castignani et al. 2018; Noble et al. 2019, for similar examples of distant gas rich galaxies in dense environments). Galaxy mergers are indeed often associated with ULIRGs and high levels of IR luminosities ≳1012 L as for example in local major merging gas-rich pairs such as the Antennae Galaxies (Gao et al. 2001; Ueda et al. 2012; Whitmore et al. 2014).

4.7. Are SMGs good tracers of protoclusters?

In Table 4 we present the outcome of our analysis and summarize the list of overdensities that we detected with the wPPM around the SMGs. Secure overdensity detections associated with the SMGs are indicated as “yes,” non-detections as “no,” and tentative detections as “possible” when the estimated redshift of the overdensity is only marginally consistent with that of the SMG, that is, the absolute value of the difference between the two is ≳0.3. We also denote both GN20 and GN20.2a as “edge” in Col. (6) because they are located at the edge of the AH18 survey footprint.

Table 4.

Overdensities around GOODS-N SMGs.

As outlined in Sect. 2.1.4, some SMGs do belong to the same known overdensities as they are close to each other both in redshift and in projected coordinates. These are the triplet SMM J123711, SMM J123618, and SMM J123712 as well as the GN20–GN20.2a pair. For these sources, we obtain similar results when looking for overdensities in the three photo-z catalogs used in this work. The only exception is SMM J123618, for which we find an overdensity only in the L18 catalog, at variance with SMM J123711 and SMM J123712, for which overdensities are found both in B19 and L18.

In the case of the SMG ID.03, an overdensity is detected both in the B19 and AH18 photometric redshift catalogs but at a lower redshift; thus, we classified these overdensities as possible. Interestingly, the source ID.03 is the only SMG in our sample that is not associated with an overdensity in at least one of the three photometric redshift catalogs. ID.03, as well as ID.19 for which we detect a relatively poor 2σ overdensity, have the lowest gas masses MH2 ≃ (1 − 2)×1010 M among those reported in Table 1 for our SMG sample. These relatively low H2 gas masses imply that the two sources may not be true SMGs, that is, with strong dust-continuum emission and high molecular gas content. Furthermore, the two sources are not clearly detected in several CO(J → J–1) lines, as it is often the case for SMGs. Indeed, ID.19 has been detected in CO(3→2) only (Decarli et al. 2014), while ID.03 has relatively low-J detections by Decarli et al. (2014) and Boogaard et al. (2023) in CO(1→0) (tentative), CO(2→1), and CO(4→3).

Altogether, for 11 out of 12 SMGs we detect an overdensity in at least one of the three photometric catalogs. This corresponds a success rate of 92%±8%, where we report the average value as well as the root mean square uncertainty derived using the binomial distribution (see, e.g., Castignani et al. 2014b). When grouping the above mentioned SMGs that likely belong to the same parent protoclusters we find eight overdensities, while we recover all three previously known protoclusters. We thus more than double the number of known overdensities physically associated with SMGs in the GOODS-N field.

Interestingly, while we find a high fraction (92%) of SMGs in protoclusters, with a similar PPM-based search, Castignani et al. (2014b) found a lower fraction of ∼70%, when considering instead distant radio galaxies at z ∼ 1 − 2 in the COSMOS field, which suggests that bright SMGs may be better tracers of protocluster cores. We stress, however, that this comparison relies on different data sets and should be taken with caution, considering in particular the advent of forthcoming low-frequency radio surveys, for example with the Square Kilometer Array, which will enable the detection of distant radio-loud active galactic nuclei and radio galaxies down to the lowest radio powers.

In Table 4 (Col. 7) we report the overdensity success rate over all three considered photo-z catalogs. We find that the vast majority of the SMGs are found in overdensities with the B19 (8 out of 12 SMGs), L18 (9 out of 12 SMGs), and AH18 (2 out of 3 SMGs) catalogs, separately, where we took under consideration the fact that the AH18 catalog does not have sources at z < 3.35. These results correspond to similar success rates of 67%, 75%, and 67% for the three photo-z catalogs, respectively.

We stress here that PPM overdensities are detected down to 2σ significance, similarly to our previous studies (Castignani et al. 2014a,b, 2019). This is not unusual as indeed distant megaparsec-scale overdensities within larger-scale protoclusters are often characterized by low number counts and thus possibly detected with relatively low significance by the PPM. A notable example is the overdensity around COSMOS-FRI 03, which was detected by the PPM at zov = 2.39 ± 0.09 with a 2.5σ significance (Castignani et al. 2014b), and later spectroscopically confirmed at z = 2.45 (Diener et al. 2015). Another case is the 2.8σ PPM overdensity at zov = 2.65 ± 0.04 that Castignani et al. (2019) found around the spectroscopically confirmed COSMOS-FRI 70 radio galaxy (z = 2.625). Interestingly, if we limit ourselves to a higher significance threshold of ≥2.5σ and ≥3σ, we find only a marginally lower number of overdensities, six and four, respectively, compared to the eight recovered using the 2σ limit of this work; the three previously confirmed protoclusters in our sample (Chapman et al. 2009; Daddi et al. 2009; Walter et al. 2012) are all detected at ≥3σ by the PPM with a least one photo-z catalog. Altogether, we expect the PPM overdensities reported in this work to have total masses greater than ∼1013 M, which is what Castignani et al. (2014b, see their Sect. 8.7.1) found by cross-matching the PPM overdensities with catalogs of distant groups in COSMOS with available mass estimates.

5. Summary and conclusions

In this paper we have employed the PPM along with its wavelet-based extension (wPPM; Castignani et al. 2014a, 2019) to carry out a detailed statistical analysis of the megaparsec-scale environments of distant SMGs. More specifically, we applied the wPPM to search for protoclusters around a sample of 12 spectroscopically confirmed SMGs at z ≃ 1.2 − 5.3 in the GOODS-N field using three photometric redshift catalogs independently. The main results of this analysis can be summarized as follows:

  • Of the 12 SMGs in our sample, 11 (i.e., 92%±8%) are physically associated with megaparsec-scale overdensities. Altogether, these ten SMGs belong to eight overdensities. Three of them are previously known protoclusters, and we detect five new protoclusters between z ∼ 1.2 and 5.3. We have thus doubled the number of known overdensities physically related to SMGs in the GOODS-N field. The detection rates for each of the three photometric catalogs range between 67% and 75%. By comparison with the spectroscopic redshifts (zspec) of the SMGs, we find that the redshifts of the detected overdensities (zov) are well recovered, with an accuracy of σ((zov − zspec)/(1 + zspec)) = 0.043.

  • A wavelet-based analysis of the protocluster fields shows that the majority of the detected overdensities have separations of (0.4 − 1.0) Mpc between the SMG and the overdensity peak. Therefore, the SMGs do not always reside in the most overdense peak of the protocluster. They live in protocluster cores or overdensities that have an average size of ∼(0.4 − 1) Mpc and are characterized by a complex morphology (compact, filamentary, or clumpy).

  • We find a good correlation between the molecular (H2) gas mass of the SMGs and the overdensity detection significance. We interpret this correlation as a result of the fact that the SMGs that live in the strongest overdensities are also those that experienced the most interaction with their companions. This interaction likely triggered gas infall and high levels of gas content and star formation in the SMGs.

  • We speculate that we are possibly witnessing a transitioning phase at z ≃ 4 for the galaxy population of protoclusters. While z ≲ 4 protoclusters appear to be mostly populated by dusty galaxies, those at higher z are mostly detected as overdensities of LAEs or LBGs. However, it is essential to approach this interpretation with caution. Further studies on larger samples are needed to validate and expand upon these findings and would ultimately provide a more solid foundation for drawing definitive conclusions.

The analysis and results presented in this paper support the scenario that SMGs are excellent signposts of protoclusters. We stress that this result was made possible thanks to the use of (i) spectroscopically confirmed SMGs, (ii) good photometric redshifts for the sources in the field of the SMGs themselves, and (iii) a specific method, that is, the wPPM, which is tailored to find overdensities around specific tracers in the regime of low galaxy number counts and shot noise. This is at variance with previous studies, which used mostly photometrically selected SMGs, that did not find a high fraction of SMGs in overdensities (Álvarez Crespo et al. 2021; Gao et al. 2022)

Altogether, the protoclusters presented in this paper are excellent targets for ongoing facilities such as JWST. Furthermore, the advent of infrared facilities such as JWST (Laporte et al. 2022; Morishita et al. 2023) and Euclid (Euclid Collaboration 2022), as well as next-generation wide-field spectrographs such as MOONS (Cirasuolo et al. 2020), will provide an unprecedented leverage to constrain galaxy evolution in assembling protoclusters, which makes the present work a benchmark for future studies with larger samples of SMGs and protoclusters than those considered in this work.

Interactive Figures

Three interactive images associated with Fig. 3 Access here


Acknowledgments

RC acknowledges partial financial support through the grant PRIN-MIUR 2017WSCC3. GC acknowledges the support from the grant ASI n.2018-23-HH.0. RC and HD acknowledge financial support from the Agencia Estatal de Investigación del Ministerio de Ciencia e Innovación (AEI-MCINN) under grant (La evolución de los cíumulos de galaxias desde el amanecer hasta el mediodía cósmico) with reference (PID2019-105776GB-I00/DOI:10.13039/501100011033). HD acknowledges support from the ACIISI, Consejería de Economía, Conocimiento y Empleo del Gobierno de Canarias and the European Regional Development Fund (ERDF) under grant with reference PROID2020010107.

References

  1. Alberts, S., & Noble, A. 2022, Universe, 8, 554 [Google Scholar]
  2. Álvarez Crespo, N., Smolić, V., Finoguenov, A., Barrufet, L., & Aravena, M. 2021, A&A, 646, A174 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  3. Arrabal Haro, P., Rodríguez Espinosa, J. M., Muñoz-Tuñón, C., et al. 2018, MNRAS, 478, 3740 [NASA ADS] [CrossRef] [Google Scholar]
  4. Barger, A. J., Cowie, L. L., Sanders, D. B., et al. 1998, Nature, 394, 248 [Google Scholar]
  5. Barro, G., Pérez-González, P. G., Cava, A., et al. 2019, ApJS, 243, 22 [NASA ADS] [CrossRef] [Google Scholar]
  6. Birkin, J. E., Weiss, A., Wardlow, J. L., et al. 2021, MNRAS, 501, 3926 [Google Scholar]
  7. Boogaard, L. A., Decarli, R., Walter, F., et al. 2023, ApJ, 945, 111 [NASA ADS] [CrossRef] [Google Scholar]
  8. Bothwell, M. S., Smail, I., Chapman, S. C., et al. 2013, MNRAS, 429, 3047 [Google Scholar]
  9. Brinch, M., Greve, T. R., Weaver, J. R., et al. 2023, ApJ, 943, 153 [NASA ADS] [CrossRef] [Google Scholar]
  10. Calvi, R., Dannerbauer, H., Haro, P. A., et al. 2021, MNRAS, 502, 4558 [NASA ADS] [CrossRef] [Google Scholar]
  11. Capak, P., Aussel, H., Ajiki, M., et al. 2007, ApJS, 172, 99 [Google Scholar]
  12. Capak, P. L., Riechers, D., Scoville, N. Z., et al. 2011, Nature, 470, 233 [Google Scholar]
  13. Caputi, K. I., Caminha, G. B., Fujimoto, S., et al. 2021, ApJ, 908, 146 [NASA ADS] [CrossRef] [Google Scholar]
  14. Casey, C. M. 2016, ApJ, 824, 36 [CrossRef] [Google Scholar]
  15. Casey, C. M., Cooray, A., Capak, P., et al. 2015, ApJ, 808, L33 [NASA ADS] [CrossRef] [Google Scholar]
  16. Casey, C. M., Narayanan, D., & Cooray, A. 2014, Phys. Rep., 541, 45 [Google Scholar]
  17. Castignani, G., Chiaberge, M., Celotti, A., & Norman, C. 2014a, ApJ, 792, 113 [NASA ADS] [CrossRef] [Google Scholar]
  18. Castignani, G., Chiaberge, M., Celotti, A., Norman, C., & De Zotti, G. 2014b, ApJ, 792, 114 [NASA ADS] [CrossRef] [Google Scholar]
  19. Castignani, G., Combes, F., Salomé, P., et al. 2018, A&A, 617, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  20. Castignani, G., Combes, F., Salomé, P., et al. 2019, A&A, 623, A48 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  21. Cepa, J. 2010, Astrophys. Space Sci. Proc., 14, 15 [Google Scholar]
  22. Chapman, S. C., Blain, A., Ibata, R., et al. 2009, ApJ, 691, 560 [NASA ADS] [CrossRef] [Google Scholar]
  23. Cirasuolo, M., Fairley, A., Rees, P., et al. 2020, The Messenger, 180, 10 [NASA ADS] [Google Scholar]
  24. Clements, D. L., Braglia, F. G., Hyde, A. K., et al. 2014, MNRAS, 439, 1193 [NASA ADS] [CrossRef] [Google Scholar]
  25. Clements, D. L., Braglia, F., Petitpas, G., et al. 2016, MNRAS, 461, 1719 [NASA ADS] [CrossRef] [Google Scholar]
  26. Cucciati, O., Lemaux, B. C., Zamorani, G., et al. 2018, A&A, 619, A49 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  27. Daddi, E., Dannerbauer, H., Stern, D., et al. 2009, ApJ, 694, 1517 [Google Scholar]
  28. Dannerbauer, H., Kurk, J. D., De Breuck, C., et al. 2014, A&A, 570, A55 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  29. Decarli, R., Walter, F., Carilli, C., et al. 2014, ApJ, 782, 78 [NASA ADS] [CrossRef] [Google Scholar]
  30. Diener, C., Lilly, S. J., Ledoux, C., et al. 2015, ApJ, 802, 31 [NASA ADS] [CrossRef] [Google Scholar]
  31. Downes, D., & Solomon, P. M. 1998, ApJ, 507, 615 [NASA ADS] [CrossRef] [Google Scholar]
  32. Emonts, B. H. C., Feain, I., Röttgering, H. J. A., et al. 2013, MNRAS, 430, 3465 [NASA ADS] [CrossRef] [Google Scholar]
  33. Euclid Collaboration (Scaramella, R., et al.) 2022, A&A, 662, A112 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  34. Flores-Cacho, I., Pierini, D., Soucail, G., et al. 2016, A&A, 585, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  35. Fu, H., Cooray, A., Feruglio, C., et al. 2013, Nature, 498, 338 [CrossRef] [Google Scholar]
  36. Gao, Y., Lo, K. Y., Lee, S. W., & Lee, T. H. 2001, ApJ, 548, 172 [Google Scholar]
  37. Gao, F., Wang, L., Ramos Padilla, A. F., et al. 2022, A&A, 668, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  38. Genzel, R., Tacconi, L. J., Rigopoulou, D., Lutz, D., & Tecza, M. 2001, ApJ, 563, 527 [NASA ADS] [CrossRef] [Google Scholar]
  39. Greve, T. R., Bertoldi, F., Smail, I., et al. 2005, MNRAS, 359, 1165 [NASA ADS] [CrossRef] [Google Scholar]
  40. Grogin, N. A., Kocevski, D. D., Faber, S. M., et al. 2011, ApJS, 197, 35 [NASA ADS] [CrossRef] [Google Scholar]
  41. Guaita, L., Aravena, M., Gurung-Lopez, S., et al. 2022, A&A, 660, A137 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  42. Hainline, L. J., Blain, A. W., Smail, I., et al. 2011, ApJ, 740, 96 [NASA ADS] [CrossRef] [Google Scholar]
  43. Hughes, D. H., Serjeant, S., Dunlop, J., et al. 1998, Nature, 394, 241 [Google Scholar]
  44. Hung, C.-L., Casey, C. M., Chiang, Y.-K., et al. 2016, ApJ, 826, 130 [NASA ADS] [CrossRef] [Google Scholar]
  45. Ivison, R. J., Swinbank, A. M., Smail, I., et al. 2013, ApJ, 772, 137 [NASA ADS] [CrossRef] [Google Scholar]
  46. Jin, S., Dannerbauer, H., Emonts, B., et al. 2021, A&A, 652, A11 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  47. Kennicutt, R. C., Jr 1998, ApJ, 498, 541 [Google Scholar]
  48. Klypin, A. A., Trujillo-Gomez, S., & Primack, J. 2011, ApJ, 740, 102 [NASA ADS] [CrossRef] [Google Scholar]
  49. Koekemoer, A. M., Faber, S. M., Ferguson, H. C., et al. 2011, ApJS, 197, 36 [NASA ADS] [CrossRef] [Google Scholar]
  50. Laporte, N., Zitrin, A., Dole, H., et al. 2022, A&A, 667, L3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  51. Liu, D., Daddi, E., Dickinson, M., et al. 2018, ApJ, 853, 172 [Google Scholar]
  52. Liu, D., Daddi, E., Schinnerer, E., et al. 2021, ApJ, 909, 56 [NASA ADS] [CrossRef] [Google Scholar]
  53. Malavasi, N., Lee, K.-S., Dey, A., et al. 2021, ApJ, 921, 103 [NASA ADS] [CrossRef] [Google Scholar]
  54. Michałowski, M., Hjorth, J., & Watson, D. 2010, A&A, 514, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  55. Miley, G., & De Breuck, C. 2008, A&ARv, 15, 67 [Google Scholar]
  56. Miller, T. B., Hayward, C. C., Chapman, S. C., & Behroozi, P. S. 2015, MNRAS, 452, 878 [NASA ADS] [CrossRef] [Google Scholar]
  57. Miller, T. B., Chapman, S. C., Aravena, M., et al. 2018, Nature, 556, 469 [CrossRef] [Google Scholar]
  58. Morishita, T., Roberts-Borsani, G., Treu, T., et al. 2023, ApJ, 947, L24 [NASA ADS] [CrossRef] [Google Scholar]
  59. Muldrew, S. I., Hatch, N. A., & Cooke, E. A. 2015, MNRAS, 452, 2528 [NASA ADS] [CrossRef] [Google Scholar]
  60. Neri, R., Cox, P., Omont, A., et al. 2020, A&A, 635, A7 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  61. Noble, A. G., Muzzin, A., McDonald, M., et al. 2019, ApJ, 870, 56 [Google Scholar]
  62. Oteo, I., Ivison, R. J., Dunne, L., et al. 2018, ApJ, 856, 72 [Google Scholar]
  63. Overzier, R. A. 2016, A&ARv, 24, 14 [Google Scholar]
  64. Pérez-González, P. G., Cava, A., Barro, G., et al. 2013, ApJ, 762, 46 [Google Scholar]
  65. Reuland, M., Röttgering, H., & van Breugel, W. 2003, New Astron. Rev., 47, 303 [CrossRef] [Google Scholar]
  66. Riechers, D. A., Capak, P. L., Carilli, C. L., et al. 2010, ApJ, 720, L131 [Google Scholar]
  67. Riechers, D. A., Hodge, J. A., Pavesi, R., et al. 2020, ApJ, 895, 81 [Google Scholar]
  68. Rowan-Robinson, M. 2000, MNRAS, 316, 885 [Google Scholar]
  69. Sanders, D. B., & Mirabel, I. F. 1996, ARA&A, 34, 749 [Google Scholar]
  70. Smail, I., Ivison, R. J., & Blain, A. W. 1997, ApJ, 490, L5 [NASA ADS] [CrossRef] [Google Scholar]
  71. Solomon, P. M., & Vanden Bout, P. A. 2005, ARA&A, 43, 677 [NASA ADS] [CrossRef] [Google Scholar]
  72. Steidel, C. C., Adelberger, K. L., Dickinson, M., et al. 1998, ApJ, 492, 428 [Google Scholar]
  73. Tan, Q., Daddi, E., Magdis, G., et al. 2014, A&A, 569, A98 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  74. Ueda, J., Iono, D., Petitpas, G., et al. 2012, ApJ, 745, 65 [NASA ADS] [CrossRef] [Google Scholar]
  75. Umehata, H., Fumagalli, M., Smail, I., et al. 2019, Science, 366, 97 [Google Scholar]
  76. Walter, F., Decarli, R., Carilli, C., et al. 2012, Nature, 486, 233 [Google Scholar]
  77. Wang, W.-H., Kohno, K., Hatsukade, B., et al. 2016, ApJ, 833, 195 [NASA ADS] [CrossRef] [Google Scholar]
  78. Weaver, J. R., Kauffmann, O. B., Ilbert, O., et al. 2022, ApJS, 258, 11 [NASA ADS] [CrossRef] [Google Scholar]
  79. Whitmore, B. C., Brogan, C., Chandar, R., et al. 2014, ApJ, 795, 156 [NASA ADS] [CrossRef] [Google Scholar]
  80. Zhou, L., Elbaz, D., Franco, M., et al. 2020, A&A, 642, A155 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

Appendix A: PPM results

In Figs. A.1, A.2, and A.3 we report the PPM plots and density maps for all overdensities reported in Table 3 when using the B19, L18, and AH18 catalogs, respectively. We refer to Sect. 3 and our previous studies (Castignani et al. 2014a,b, 2019) for details about these wPPM outputs.

thumbnail Fig. A.1.

PPM plots (left) and density maps (right) for the SMGs and corresponding overdensities found using the B19 photo-z catalog. See the text for details about the color coding.

thumbnail Fig. A.1.

continued.

thumbnail Fig. A.1.

continued.

thumbnail Fig. A.2.

PPM plots (left) and density maps (right) for the SMGs and corresponding overdensities found using the L18 photo-z catalog. See the text for details about the color coding.

thumbnail Fig. A.2.

continued.

thumbnail Fig. A.2.

continued.

thumbnail Fig. A.3.

PPM plots (left) and density maps (right) for the SMGs and corresponding overdensities found using the AH18 photo-z catalog. See the text for details about the color coding.

The left panels show the PPM plots where the overdensity patterns at different redshifts along the line of sight of each SMG are displayed. In each plot the vertical solid line shows the SMG spectroscopic redshift. Colored dots refer to significance levels > 2σ (cyan), 3σ (green), 4σ (blue), 5σ (red), 6σ (brown), and 7σ (black).

The right panels display the Gaussian density maps centered at the projected coordinates of the SMGs. The pixel size is 1/16 Mpc while the Gaussian kernel has σ = 3/16 Mpc. Sources with photometric redshifts between z ov Δ z ¯ / 2 $ z_{ov}-\overline{\Delta z}/2 $ and z ov + Δ z ¯ / 2 $ z_{ov}+\overline{\Delta z}/2 $ were considered to produce the maps. The values of zov are reported in Table 3, while Δ z ¯ = 0.3 $ \overline{\Delta z}=0.3 $ except for ID.19 (for B19 and L18 catalogs) and GN20.2a (B19 catalog), for which we chose Δ z ¯ = 0.2 $ \overline{\Delta z}=0.2 $ (see Table 3). The solid black and dashed red circles are centered at the projected coordinates of the SMG. The former have a (physical) radius of 1 Mpc, estimated at zov, while the latter, with a radius ℛPPM, denote the region within which the PPM detects the overdensity. The dotted-dashed green circle is centered at the peak of the detection as found by the wavelet transform and has a radius ℛw.

All Tables

Table 1.

Properties of the GOODS-N SMGs.

Table 2.

Summary of the SMG properties: coordinates and known overdensities.

Table 3.

Properties of the megaparsec-scale overdensities around the SMGs.

Table 4.

Overdensities around GOODS-N SMGs.

All Figures

thumbnail Fig. 1.

Photometric redshift distributions of the B19 (Barro et al. 2019), L18 (Liu et al. 2018), and AH18 (Arrabal Haro et al. 2018) catalogs in the GOODS-N field.

In the text
thumbnail Fig. 2.

Redshift distribution of the SMGs. The associated overdensities found using the B19 (left), L18 (center), and AH18 (right) photo-z catalogs are highlighted with different colors. The shaded region in the right histogram indicates that there are no sources at z < 3.35 in the AH18 catalog.

In the text
thumbnail Fig. 3.

Distribution of the SMGs and associated protocluster galaxies in the RA, Dec, and redshift space when the B19, L18, and AH18 photometric redshift catalogs are used for the detection of protoclusters. Small gray and colored dots always correspond to sources from the B19 catalog, which is chosen for a better visualization as it is the largest photometric redshift catalog. Red squares show the location of the SMGs in our sample. Colored clouds show the members of the overdensities as identified with the PPM. Protocluster members are color coded in yellow, green, and blue according to their increasing redshift. Interactive 3D plots for all three photometric redshift catalogs are available online.

In the text
thumbnail Fig. 4.

Top panel: Overdensity redshift measured by the PPM (zov) as a function of the spectroscopic redshift of the SMG (zspec). We refer to the legends at the top left and bottom right for the symbols and the color code used for the different SMGs and photo-z catalogs. Bottom panel: Redshift residuals as a function of zspec.

In the text
thumbnail Fig. 5.

Overdensity core sizes in comoving units (top panel) and physical units (bottom panel) as a function of overdensity redshifts estimated by the PPM. The solid black lines show the evolution with redshift of the comoving (top) and physical (bottom) core sizes that correspond to an angular aperture of 60 arcsec. Different colors and symbols refer to the different photometric redshift catalogs used to detect the overdensities, as illustrated in the legend.

In the text
thumbnail Fig. 6.

Projected separation between the SMG coordinates and the overdensity peak as found by the wavelet transform of wPPM as a function of the estimated richness (top) and redshift (bottom) of the overdensity. The vertical line at zov ≳ 4 in the bottom panel is used to indicate the possible transitioning phase in the protocluster galaxy population. The color coding is the same as in Fig. 5.

In the text
thumbnail Fig. 7.

PPM plots (left) and density maps (right) of protoclusters detected around z > 4 SMGs using the AH18 (top), B19 (center), and L18 (bottom) catalogs. Left: PPM plots for HDF850.1, GN10, and GN20. The vertical solid lines show the SMG spectroscopic redshifts. Colored dots refer to significance levels > 2σ (cyan), 3σ (green), 4σ (blue), 5σ (red), 6σ (brown), and 7σ (black). Right: Gaussian density maps for the overdensities, centered at the projected space coordinates of the SMGs. The solid black and dashed red circles are centered at the projected space coordinates of the SMGs. The former have a (physical) radius of 1 Mpc, estimated at zov, while the latter, with a radius ℛPPM, denote the region within which the PPM detects the overdensity. The dotted-dashed green circle is centered at the peak of the detection as found by the wavelet transform and has a radius ℛw.

In the text
thumbnail Fig. 8.

Molecular H2 gas mass for each SMG in our sample versus the average overdensity significance (in units of σ) of the associated overdensity. The points are colored according to the SMG redshift, and different symbols refer to different SMGs. The shaded area highlights the minimum overdensity significance (= 2) that is adopted throughout this work.

In the text
thumbnail Fig. A.1.

PPM plots (left) and density maps (right) for the SMGs and corresponding overdensities found using the B19 photo-z catalog. See the text for details about the color coding.

In the text
thumbnail Fig. A.2.

PPM plots (left) and density maps (right) for the SMGs and corresponding overdensities found using the L18 photo-z catalog. See the text for details about the color coding.

In the text
thumbnail Fig. A.3.

PPM plots (left) and density maps (right) for the SMGs and corresponding overdensities found using the AH18 photo-z catalog. See the text for details about the color coding.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.