The VIMOS Ultra Deep Survey: Emerging from the Dark, a Massive Proto-Cluster at z~4.57

Using spectroscopic observations taken for the VIMOS Ultra-Deep Survey (VUDS) we report here on the discovery of PCl J1001+0220, a massive proto-cluster located at $z_{spec}\sim4.57$ in the COSMOS field. The proto-cluster was initially detected as a $\sim12\sigma$ overdensity of typical star-forming galaxies in the blind spectroscopic survey of the early universe ($2<z<6$) performed by VUDS. It was further mapped using a new technique developed that statistically combines spectroscopic and photometric redshifts, the latter derived from a recent compilation of deep multi-band imaging. Through various methods, the descendant halo mass of PCl J1001+0220 is estimated to be $\log(M_{h}/M_{\odot})_{z=0}\sim14.5-15$ with a large amount of mass apparently already in place at $z\sim4.57$. Tentative evidence is found for a fractional excess of older and more massive galaxies within the proto-cluster, an observation which suggests the pervasive early onset of vigorous star formation. No evidence is found for the differences in the star formation rates of member and a matched sample of coeval field galaxies either through rest-frame ultraviolet methods or through stacking extremely deep Very Large Array 3 GHz imaging. Additionally, no evidence for pervasive strong active galactic nuclei (AGN) activity is observed. Analysis of Hubble Space Telescope images provides weak evidence for for an elevated incidence of galaxy-galaxy interaction within the proto-cluster. The spectral properties of the two samples are compared, with a definite suppression of Ly$\alpha$ seen in the average member galaxy relative to the coeval field ($f_{esc,Ly\alpha}=1.8^{+0.3}_{-1.7}$% and $4.0^{+1.0}_{-0.8}$%, respectively). This observation along with other lines of evidence leads us to infer the possible presence of a large, cool diffuse medium within the proto-cluster environment evocative of a nascent intracluster medium.


Introduction
The last decade and a half has seen a revolution in the study of overdensities in the early Universe. While the study and careful characterization of large associations of galaxies in the local Universe has been possible for nearly a century, and in the intermediate redshift Universe for a significant fraction of that time (e.g., Shapley & Ames 1926;Shapley 1930;Zwicky 1937;Abell 1958;Zwicky et al. 1961), the study of their progenitors presented several practical problems which have prevented their study until relatively recently. The primary problem, inherent to the study of nearly all galaxy populations in the early Universe, is the extreme apparent faintness of galaxy populations at these distances. While some phenomena exist in the early Universe, such as quasars or radio galaxies, which are so powerful and intrinsically bright that they have been able to serve as beacons to early searches near the epoch of H I reionization (z ∼ 5.5-10, Becker et al. 2001Becker et al. , 2015Planck Collaboration XIII 2016), the bulk of the galaxy population residing in the early Universe does not contain such phenomena (Miley & De Breuck 2008;Ouchi et al. 2008;Lemaux et al. 2014b;Ueda et al. 2014;Talia et al. 2017). As such, the first searches for these more typical primeval galaxies were largely doomed to failure (Davis & Wilkinson 1974;Partridge 1974;Pritchet & Hartwick 1987;Parkes et al. 1994). It was not until the advent of the 10m-class ground-based telescopes largely used in conjunction with the Hubble Space Telescope (HST) that the prospect of detecting and characterizing moderate samples of such galaxies became even remotely feasible (e.g., Steidel et al. 1999;Shapley et al. 2003;Giavalisco et al. 2004;Stanway et al. 2004;Malhotra & Rhoads 2004;Malhotra et al. 2005;Vanzella et al. 2005, Le Fèvre et al. 2005. With this, the prospect of finding and characterizing analogs of the progenitors of the massive clusters and superclusters of galaxies scattered throughout the local Universe began to come within the realm of possibility. However, other issues remained. At least in the local and intermediate-redshift Universe (z < ∼ 1.5), massive overdensities of galaxies are relatively rare (Piffaretti et al. 2011;Campanelli et al. 2012) requiring searches over large swaths of the sky. While redshifts derived from imaging data alone can, with careful calibration, be used to select relatively pure and complete samples of overdensities at these redshifts, and while en masse smallscale clustering of at least certain types of populations may be reproducible with carefully crafted photometric redshift schemes (e.g., Ménard et al. 2013;Schmidt et al. 2013;Aragon-Calvo et al. 2015), the internal structure and dynamics of such overdensities are impossible to characterize on an individual basis without considerable investment of telescope time to obtain spectroscopic followup observations (see e.g., Oke et al. 1998;Dressler et al. 1999;Halliday et al. 2004;Gal et al. 2008;Dressler et al. 2013 and references therein). As both of these issues are potentially exacerbated at higher redshift (z > ∼ 1.5), searches for overdensities at such redshifts were potentially required to both be larger and to compensate for larger levels of impurity and incompleteness than their lower redshift analogues. The large amounts of telescope time necessary to detect and confirm even the most modest number of individual high-redshift galaxies made such large-scale searches prohibitive. Further, the study of galaxy overdensities, clusters, superclusters, and, to a lesser extent, groups of galaxies, in the local and intermediate-redshift Universe takes a somewhat regularized form. Such overdensities are known to contain certain markers that allow them to be detected readily, though with varying degrees of purity and completeness. To some degree, nearly all massive overdensities in the low-to intermediate-redshift Universe are marked by a sequence of bright, redder galaxies (e.g., Gladders & Yee 2005;Ascaso et al. 2014Ascaso et al. , 2015 and by a hot medium, detectable, at least in principle, in the X-ray through bremsstrahlung emission and at mm/cm wavelengths through the Sunyaev-Zel'dovich effect (e.g., Pierre et al. 2004Pierre et al. , 2016Muchovej et al. 2007;Piffaretti et al. 2011;Hasselfield et al. 2013;Rumbaugh et al. 2013;Clerc et al. 2014;Bleem et al. 2015). Additionally, the main component of the mass of these overdensities typically takes a regular, triaxial shape, which, combined with their immense mass and favorable geometric conditions, allowed for at least some of these overdensities to be detected through weak or strong gravitational lensing (e.g., Paczynski 1987;Soucail et al. 1987;Fort & Mellier 1994;Clowe et al. 1998;Gladders et al. 2003;Jee et al. 2006;Hoekstra et al. 2013;Schrabback et al. 2018). While exceptions exist, it was these signposts that allowed for the success of early systematic searches for such overdensities, even with relatively moderate means. Analogs of the progenitors of these populations do not, however, necessarily share these signposts. The increasingly short time allowed for galaxies to form and evolve, to generate and heat an overarching medium, and for an overdensity to build up even a remote fraction of its eventual total mass, requires that the utility of all the methods mentioned above must necessarily decrease and eventually fail as searches move to higher redshift.
The first searches for high-redshift overdensities, historically termed proto-clusters, attempted to overcome these deficiencies, with considerable success, by utilizing the previously mentioned beacons, quasars, other radio-emitting active galactic nuclei (AGN), and extremely prodigious star-forming galaxies also known as sub-mm galaxies (Ivison et al. 2000;Pentericci et al. 2000;Venemans et al. 2002;Miley et al. 2004;Zheng et al. 2006;Overzier et al. 2008). The initial exploration of the surroundings of such phenomena took the form of large-scale deep narrowband imaging which allowed for the determination of the density of specific types of star-forming galaxies known as Lyman α λ1215.7 Å emitters (LAEs) as searches for such galaxies in this manner were relatively economical in terms of telescope time. Despite the success of these searches, it was not clear that such beacons were a requisite condition for a proto-cluster to form in the early Universe and, if not, whether the proto-clusters found around such phenomena were typical. Further, only preliminary estimates were available of how the clustering of LAEs related to the clustering of galaxies as a whole (e.g., Ouchi et al. 2003), the former being a relatively small subset of the overall population (Schenker et al. 2012;Cassata et al. 2015), which, at least when selected through narrowband imaging, are potentially biased towards specific types of galaxies (e.g., Gawiser et al. 2006;Trainor et al. 2015). While later surveys employed identical techniques to target other recombination lines, such as, for example, the Hα λ6563 Å feature (e.g., Hatch et al. 2011;Matsuda et al. 2011), both these surveys and those of LAEs potentially suffered from lack of spectroscopic confirmation of the presumed features. Though some earlier methods detected and attempted to characterize proto-clusters through the photometric selection of a more representative population of high-redshift galaxies known as Lyman break galaxies (LBGs), the lack of redshift resolution of early LBG selection techniques (∆z ∼ 1) meant that only a few such selected overdensities held up to scrutiny (e.g., Steidel et al. 1998).
In recent years, however, searches have begun to shift to large samples of L * -type galaxies using a variety of different techniques. While systematic spectroscopic characterization is still a luxury, large deep photometric surveys combined with dedicated spectroscopic followup of interesting patches of the sky, as well as archival searches of compilations of observations, are starting to produce results, with regular, well-defined selection criteria for proto-clusters (Capak et al. 2011b;Chiang et al. 2014;Dey et al. 2016;Wang et al. 2016;Franck & McGaugh 2016a,b;Toshikawa et al. 2016). In addition, contemporary searches are persisting over large regions of the sky using radio or sub-mm galaxies (Wylezalek et al. 2013;Casey et al. 2015;Smolčić et al. 2017a) allowing for a comparison sample to those proto-clusters selected blind to such beacons, crucial to understanding whether or not the presence of such phenomena biases the types of protocluster environments found in these searches (see e.g., Hatch et al. 2014). Correspondingly, major advances in the understanding of proto-clusters are coming from the study of both N-body simulations and semi-analytic models (e.g., Chiang et al. 2013;Muldrew et al. 2015;Orsi et al. 2016). Both observations and models are converging on the realization that proto-clusters are extremely large in volume, contain diverse galaxy populations, and are more or less readily detectable depending on the galaxy population observed. Developing selection criteria applicable to all types of proto-clusters, a precise, well-characterized methodology of defining membership, and a consistent method to construct comparable lower-density galaxy samples, all of which were elusive in earlier searches for proto-clusters due to the inhomogeneity in both the conception of what a proto-cluster is and the observations employed for the search, is crucial to advancing the study of environmentally-driven evolution in the early Universe. With new large-scale deep photometric and spectroscopic surveys imminent, such as those planned with the Large Synaptic Survey Telescope (LSST), Euclid, Hobby-Eberly Telescope Dark Energy Experiment (HETDEX), and Subaru Prime Focus Spectrograph (PFS), all of which capable, in principle, of detecting extremely large numbers of proto-clusters, the need to determine an operational definition of these structures becomes imperative.
The wide-scale detection and careful characterization of high-redshift overdensities is not simply an academic exercise. Many open questions remain regarding the formation and early development in the life of present-day massive clusters and the constituent galaxies which seed them. While several studies of z ∼ 1 clusters have placed the formation epoch of the massive red galaxies prevalent in such structures at z f ∼ 3-4 (Rettura et al. 2010;Raichoor et al. 2012;Lemaux et al. 2012;Zeimann et al. 2012), observational evidence of rapidly forming massive galaxies in proto-clusters at these redshifts is sparse. Such activity, if observed, would contain natural pathways not only to form the requisite number of massive quiescent galaxies observed in intermediate-redshift clusters through gas depletion, but also to begin the transfer of baryonic mass away from the constituent galaxies and into the infant intracluster medium (proto-ICM) through merging activity, tidal stripping, stellar feedback, and AGN activity. The inferred presence of a dense medium in the few z∼ 2-3 overdensities where such an investigation has been possible (e.g., Gobat et al. 2011;Cucciati et al. 2014;Lee et al. 2016;Wang et al. 2016;Cai et al. 2017) strongly suggests that the processes by which the earliest (proto-)cluster galaxies are quenched occur coevally with the formation of this medium. The observation of pervasive AGN activity in many high-redshift proto-clusters also suggests a possible mechanism to pre-heat the proto-ICM, a necessary requirement to reproduce observed cluster scaling relations (see e.g., Hilton et al. 2012;Kravtsov & Borgani 2012 and references therein). The large amount of diffuse gas possibly contained within the proto-ICM and in areas surrounding proto-clusters may cause such regions to be the last in the Universe to undergo H I reionization (e.g., Ciardi et al. 2003), though it is also possible that, due to the increased density of star-forming galaxies, they are the first (e.g., Castellano et al. 2016).
In this paper we report on the systematic search for high-redshift overdensities in the Cosmic Evolution Survey (COSMOS; Scoville et al. 2007b) field using new observations from the VIsible Multi-Object Spectrograph (VIMOS; Le Fèvre et al. 2003) taken as part of the VIMOS Ultra-Deep Survey (VUDS; Le . These observations were combined with new and archival deep multi-band imaging and used in conjunction with a new density-mapping technique to discover PCl J1001+0220, a massive proto-cluster at z ∼ 4.57 emerging just ∼250-500 Myr after the canonical end of H I reionization. The structure of the paper is as follows. Section 2 provides an overview of the spectroscopic and imaging data available in the COSMOS field, as well as the derivation of physical parameters of galaxies in our sample, with particular attention paid to observations from the VUDS survey. Section 3 describes the search methodology employed used to discover PCl J1001+0220 and to quantify its statistical overdensity. In Sect. 4 we describe the estimation of the total mass of PCl J1001+0220 and an investigation of the properties of its spectroscopically-confirmed members compared with properties of galaxies inhabiting lower-density environments. Finally, Sect. 5 presents a summary of our results. Throughout this paper all magnitudes, including those in the infrared (IR), are presented in the AB system (Oke & Gunn 1983;Fukugita et al. 1996). All equivalent width (EW) measurements are presented in the rest frame with negative EWs defined as features observed in emission. Unless otherwise noted, distances are given in proper rather than comoving units. We adopt a concordance

Observations
To date, the Cosmic Evolution Survey (COSMOS) field ([α J2000 , δ J2000 ] = [10:00:28.6,+02:12:21.0]) is arguably the most wellstudied patch of the entire sky. It has been observed at wavelengths which span nearly the entirety of the electromagnetic spectrum, a large fraction of which we utilize in this paper. In Appendix A we summarize the various datasets available in the COSMOS field.

Spectroscopic data
The primary impetus for the current study comes from the vast spectroscopic data available in the COSMOS field, with a nearly exclusive reliance on recent VIMOS spectroscopic observations taken as part of VUDS. We therefore begin here with a brief discussion of the spectroscopic survey whose data are used for this study.

The VIMOS Ultra-Deep Survey
The observations from which nearly the entirety of our results are derived were drawn from VIMOS observations taken for VUDS, a massive 640-h (∼80 night) spectroscopic campaign A77, page 3 of 27 A&A 615, A77 (2018) reaching extreme depths (i < ∼ 25) over 1 • in COSMOS, the 02h field of the VIMOS Very Large Telescope Deep Survey (VVDS-02h) 1 , and the Extended Chandra Deep Field South (E-CDF-S). The design, goals, and survey strategy of VUDS are described in detail in Le  and are thus described here only briefly.
The primary goal of the VUDS survey is to measure the spectroscopic redshifts of a large sample of galaxies at redshifts 2 < ∼ z < ∼ 6. To this end, target selection was performed primarily through photometric redshift cuts, occasionally supplemented with a variety of magnitude and color−color criteria. These selections were used primarily to maximize the number of galaxies with redshifts likely in excess of z > ∼ 2 (see discussion in Le  and, in the COSMOS field, to complement the largely color-color selected zCOSMOS-Deep sample. This selection has been used to great effect, as a large fraction (72.4%) of the galaxies with secure spectroscopic redshifts (see below) in VUDS-COSMOS are at z > 2. The main novelties of the VUDS observations are the depth of the spectroscopy, the large wavelength coverage that is afforded by the 50 400 s integration time per pointing and per grating with the low-resolution blue and red gratings on VIMOS (R ∼ 230), and the large sky area covered by the survey. The combination of wavelength coverage and depth along with the high redshift of the sample allows not only for spectroscopic confirmation of the LAE galaxies, galaxies which dominate other high-redshift spectroscopic samples, but also for redshift determination from Lyman series and interstellar medium (ISM) absorption in those galaxies that exhibit no or weak emission line features. Thus, the VUDS data allow for a selection of a spectroscopic volume-limited sample of galaxies at redshifts 2 < ∼ z < ∼ 6, a sample that probes as faint as ∼M * FUV at the redshifts of interest for the study presented in this paper .
The flagging code for VUDS is discussed in Le Fèvre et al. (2015). For some of the analysis presented in the paper, we adopt spectroscopic flags = X2, X3, & X4, where X = 0-3 2 , for which the probability of the redshift being correct is > ∼ 75% (hereafter "secure spectroscopic redshifts"). For other portions of this paper we will take a statistical approach incorporating the likelihood of each spectroscopic redshift rather than relying on binary logic (see Sect. 3.3). In total, spectra of 4303 unique objects were obtained as part of VUDS in the COSMOS field, with 2687 of those resulting in secure spectroscopic redshifts (a ∼60% redshift success rate, a value consistent with that of the full survey). The spectroscopic sampling of VUDS in the COSMOS field is uniform in that no VUDS pointing appreciably overlaps another in the field.
While we could, in principle, use galaxies targeted by VUDS in the other two fields of the survey (VVDS-02h and E-CDF-S), effectively doubling the control sample used in this study, we choose not to for two reasons. The first is that the COSMOS photometry is more varied and, in general, deeper than that of the other two fields. Our lack of need for excess statistical power of the control sample outweighs the potential for adding in any secondary effects due to unseen bias due to 1 Depending on the author, this field can also be referred to as the first field (D1) of the Canada-France-Hawaii Telescope Legacy Survey (CFHTLS-D1). 2 X = 0 is reserved for target galaxies, X = 1 for broadline AGN, X = 2 for non-targeted objects that fell serendipitously on a slit at a spatial location separable from the target, and X = 3 for those non-targeted objects that fell serendipitously on a slit at a spatial location coincident with the target. For more details on the probability of a correct redshift for a given flag, see Le  inputting different photometry in the spectral energy distribution (SED) fitting procedure. The second and more important reason is that the lack of comparable photometric depth in the other two VUDS fields causes the reconstruction of the density field (see Sect. 3.3), at least at the current level of implementation, to be more suspect for both fields at the redshift of this study (z > ∼ 4.5), meaning we cannot robustly discriminate between lowand moderate-density environments at these redshifts. For further discussion of the survey design, observations, reduction, redshift determination, and the properties of the full VUDS sample, see Le . See also (Tasca et al. 2017) for the first VUDS data release, which is available through the Centre de donnéeS Astrophysiques de Marseille (CeSAM) database 3 .

Other spectroscopic data
In order to maximize the effectiveness of our search for overdensities in the region of the COSMOS field covered by VUDS, we additionally drew spectroscopic redshifts from the zCOSMOS-Bright 4 (Lilly et al. 2007(Lilly et al. , 2009) and zCOSMOS-Deep (Lilly et al., in prep., Diener et al. 2013, 2015 surveys. Accounting for duplicate objects where a more secure spectroscopic redshift was available from VUDS, a total of 19485 secure spectroscopic redshifts for unique objects are available from zCOSMOS of which 2034 are at z > 2 and only a small percentage (∼1%) reach z > 3. A small number of additional redshifts were also taken from Casey et al. (2015), Chiang et al. (2015), and Diener et al. (2015) at z ∼ 2.5, all of which were considered secure. While we include mention of the additional redshifts of these surveys here to demonstrate the precision and accuracy of our photometric redshifts (see following section) and to contextualize the main subject of this paper in terms of the full search for overdensities (see Sect. 3.3), it is important to note that none of these galaxies enter into the main portion of the analysis presented in this paper (i.e., 4 ≤ z ≤ 5) and, thus, differential selection effects resulting from different targeting strategies are irrelevant.
The variety of other spectroscopic observations taken across the COSMOS field (see Ilbert et al. 2013 for a review) were not incorporated into this analysis either because they were at redshifts that are too low to be pertinent to this study or the redshifts were not public at the time of publishing. The one exception is the DEep Imaging and Multi-Object Spectrometer (DEIMOS; Faber et al. 2003) campaign undertaken in the COSMOS field by Capak et al. (2011a) targeting luminous galaxies at 4.5 < z < 6.5. However, these observations are offset too far from the main target in this study, the closest slit being located 7.2 (∼3 Mpc) away from the center of our target (see Sect. 3.1). While these observations, which contain a roughly equivalent number of z > 4 galaxies as VUDS-COSMOS, could be, in principle, useful for bolstering the high-redshift (coeval) field sample defined in Sect. 4.2.1 concerns over differential bias between this campaign and that of VUDS far outweigh the gain in sample size. As such, we do not include these observations as part of this study.

Synthetic model fitting
Despite the high density and immense depth of the spectroscopic coverage in the COSMOS field, the majority of the objects in the field that are detectable to the depth of our imaging data were not targeted with spectroscopy. For these objects, information can only be obtained through fitting to their SEDs in the observed-frame optical/near-infrared (NIR) broadband photometry. For this study we adopt four different forms of SED fitting. The four methods are used in complementary fashion throughout the paper and the results from each method are compared internally expiating any relative bias.

Photometric redshifts
To estimate photometric redshifts for objects with redshifts left unconstrained from spectroscopic observations, we draw from the fitting performed on the most recent version of the COSMOS2015 catalog (v1.3, Laigle et al. 2016) which employs the use of LE PHARE 5 (Arnouts et al. 1999;Ilbert et al. 2006Ilbert et al. , 2009) on point-spread function (PSF)-matched photometry from far-ultraviolet (FUV) to [8.0]. Photometric redshifts, used as priors for a second round of fitting and magnitudes, originally estimated from 3 apertures, are corrected to total magnitudes following the method of Moutard et al. (2016) and used to derive associated physical quantities, for example, stellar masses, mean luminosity-weighted stellar ages, extinctions, and SFRs. The parameters used for deriving these quantities are identical to those given in Lemaux et al. (2014a) and to those used for the following two methods in this section. For further details on this process as well as the assumptions and parameterization input for the SED-fitting process, see Laigle et al. (2016).
In Fig. 1 we show a comparison of the photometric redshifts derived from the COSMOS2015 catalog with the cut given above and the associated secure spectroscopic redshifts. The normalized median of the absolute deviations, σ N MAD (Hoaglin et al. 1983), for the full sample (21781 objects) along with the σ N MAD , the median photo-z offset (∆ z /(1 + z s )), and the catastrophic outlier rate (|z p − z s |/(1 + z s ) > 0.15; η) for 4 < z spec < 5 (131 galaxies) are shown in the main panel of Fig. 1. If we instead adopt an alternative approach sometimes used to estimate photometric redshift precision and fit a Gaussian to the (z phot − z spec )/(1 + z spec ) distributions we recover σ z/(1+z) = 0.008 and 0.0145 for the full sample and the subsample at 4 < z < 5, respectively. These numbers are consistent with previous estimates of photometric redshift precision in the COSMOS field at similar redshifts (e.g., Smolčić et al. 2017a).
Throughout the paper we conservatively adopt the σ N MAD estimate as the formal uncertainty on photometric redshifts. This estimate, however, is still likely to be a lower limit to the true spread in the photo-z population selected in this study for the following reasons. As the accuracy and precision of photometric redshifts is a function of magnitude (see e.g., Ilbert et al. 2006Ilbert et al. , 2009, the difference of more than a magnitude between the median [3.6] magnitude of the spectral sample at 4 < z spec < 5 and the [3.6]-limited z phot sample selected in this study at similar redshifts (see below), 23.3 versus 24.4, respectively, is disconcerting. While we attempt to mitigate any effects of underestimating the uncertainties by using 1.5σ N MAD in instances where we use the global uncertainty, impose an additional K s < 24.0 criterion on all z phot objects used to generate maps for this study (which forces the median [3.6] magnitude of z phot objects at these redshift to a value similar to that of the spectral sample (23.2 vs. 23.3, respectively)), and adopt methods that rely on individual z phot errors (see Sect. 3.3), this caveat should be kept in mind throughout the study. An additional important subtlety of this analysis is understood through examining the spectroscopic redshift distribution of galaxies with 4 < z phot < 5 and secure spectroscopic redshifts, which are almost always 5 http://cfht.hawaii.edu/~arnouts/LEPHARE/lephare.html A&A proofs: manuscript no. AA/2017/30870 cluded from the fitting due to relatively large (∼ 1 − 2.5 mag) offsets seen in these bands in ∼30% of all VUDS-COSMOS galaxies with respect to the best-fit model estimated without the use of these bands. We note that this issue appears limited to the Capak et al. (2007) catalog and does not apply to fitting performed with the COSMOS2015 photometry. The process of PSF-homogenizing all UV/optical/ground-based NIR images, source detection, the inclusion of the Spitzer/IRAC data, and the conversion of all magnitudes to "pseudo-total" magnitudes is described in detail in Ilbert et al. (2013). Details of this fitting including the input parameters and the effect of various assumptions made for this fitting are given in Lemaux et al. (2014a).
For some analyses, a similar fitting was performed on VUDS rest-frame near-ultraviolet (NUV) spectra in conjunction with identical photometry as in the Le Phare analysis using GOS-SIP+, a modified version of the Galaxy Observed-Simulated SED Interactive Program (GOSSIP; Franzetti et al. 2008). The details of the modifications made for GOSSIP+ as well as the fitting process as it pertains to VUDS spectra and the advantages of the fitting over traditional photometric SED fitting, particularly in regards to estimating galaxy ages, are given in Thomas et al. (2017a,b). Both methodologies, Le Phare and GOSSIP+, use some form of a χ 2 ν minimization in order to recover the bestfit model from which the physical parameters are estimated. In addition, both methodologies employ a nearly identical set of input parameters including Bruzual & Charlot (2003) (hereafter BC03) stellar templates, models generated from exponentially declining and delayed star formation histories (SFHs), dust extinctions, and a Chabrier (2003) initial mass function (IMF). For a full listing of the parameters adopted for the fitting see Lemaux et al. (2014a) and Thomas et al. (2017a).
The final method employs the three-component SED-fitting code SED3FIT 6 (Berta et al. 2013), which combines the emission from stars, dust heated by star formation, and a possible AGN emission component. The fiducial package of galaxy templates is taken from the Multi-wavelength Analysis of Galaxy PHYSical properties (MAGPHYS; da Cunha et al. 2008) code, which relies on an energy balance between the dust-absorbed stellar continuum and the reprocessed emission in the mid-to far-IR. Stellar templates are taken from the BC03 library, whose output SEDs are modulated by the effects of dust attenuation and the stellar heating of dust (Charlot & Fall 2000;da Cunha et al. 2008). To this fiducial set of models, the SED3FIT code implements a set of libraries which include both an accretion disc component and a warm dust component surrounding the AGN in a smooth toroidal structure (Feltre et al. 2012; see also Fritz et al. 2006) following the method of (Berta et al. 2013). For each galaxy in our sample, we run both the SED3FIT and the MAG-PHYS codes to the full COSMOS2015 photometry (Laigle et al. 2016), from FUV to 500 µm, using a common set of galaxy templates. The results of these two runs are then compared statistically (see §4.2.2) to estimate the relative contribution of various components to the global UV-IR SED (see Delvecchio et al. 2014 for details). Though the Herschel SPIRE/PACS data for the COSMOS field are relatively shallow for individual galaxies, stacked photometry of various galaxy sub-samples in this paper allows for the placing of meaningful constraints on both AGN activity and the amount of obscured star formation (see §4.2.2 is not generally known a priori. A scale bar indicates the density of objects in each region of the phase space. Filled red diamonds denote VUDS z spec members of PCl J1001+0220. The σ N MAD for the whole sample and for 4 < z < 5 along with the median offset (∆ z = (z phot − z spec )/(1 + z spec )) and the percentage of catastrophic outliers (|z phot − z spec |/(1 + z spec ) > 0.15) for the latter redshift range are shown in the main panel. Residuals are shown in the bottom panel. We note that the spectroscopic sample at z > 4 has a median [3.6] magnitude nearly one magnitude brighter (∆m = 0.85) than the z phot sample adopted in this study meaning that these values are likely lower limits. We note also that all known catastrophic outliers at these redshifts result from the SED fitting confusing the Lyman break for the Balmer break and not the reverse, thus resulting in what is likely a pure but incomplete sample at 4 < z < 5. At z < 2 the two zCOSMOS samples dominate over VUDS providing 96.0% of all secure spectroscopic redshifts. At 2 ≤ z ≤ 3 the combined zCOSMOS samples and the VUDS sample are roughly matched in number with ∼2000 spectroscopic redshifts each. At higher redshifts, z > 3 and z > 4, the VUDS sample dominates providing 83.9% and 97.3% of the secure spectroscopic redshifts, respectively.
3. Discovery of a z ∼ 4.57 proto-structure in the

COSMOS field
In this Section we describe the discovery and characterization of the highest redshift overdensity seen by VUDS. We begin by focusing exclusively on the spectroscopic overdensity, as it was in these data that the proto-structure was first detected. However, the observed characteristics and magnitude of the overdensity of VUDS proto-structures are sensitive to a variety of different aspects of the distribution of the VUDS targets spatially, in galaxy type, and in brightness. As such, we additionally classify this overdensity through the use of photometric redshifts and by combining photometric and spectroscopic redshifts in a probabilistic manner. These methods paint a coherent picture of a highly significant overdensity assembling in the early universe. A scale bar indicates the density of objects in each region of the phase space. Filled red diamonds denote VUDS z spec members of PCl J1001+0220. The σ N MAD for the whole sample and for 4 < z < 5 along with the median offset (∆ z = (z phot − z spec )/(1 + z spec )) and the percentage of catastrophic outliers (|z phot − z spec |/(1 + z spec ) > 0.15) for the latter redshift range are shown in the main panel. Residuals are shown in the bottom panel. We note that the spectroscopic sample at z > 4 has a median [3.6] magnitude nearly one magnitude brighter (∆m = 0.85) than the z phot sample adopted in this study meaning that these values are likely lower limits. We note also that all known catastrophic outliers at these redshifts result from the SED fitting confusing the Lyman break for the Balmer break and not the reverse, thus resulting in what is likely a pure but incomplete sample at 4 < z < 5. At z < 2 the two zCOSMOS samples dominate over VUDS providing 96.0% of all secure spectroscopic redshifts. At 2 ≤ z ≤ 3 the combined zCOS-MOS samples and the VUDS sample are roughly matched in number with ∼2000 spectroscopic redshifts each. At higher redshifts, z > 3 and z > 4, the VUDS sample dominates providing 83.9% and 97.3% of the secure spectroscopic redshifts, respectively.
(77.1% of the time) at 4 < z spec < 5. In all (24) cases where a galaxy is at 4 < z spec < 5 and the photometric redshift estimation failed catastrophically, the Lyα/Lyman λ912 Å break was mistaken for the Balmer/4000 Å break placing the galaxy at lower (z ∼ 0.7) redshifts (see also discussion in Capak et al. 2011a andin Sect. 3.5 of Le Fèvre et al. 2015). The directionality of these failures implies that the photometric redshift sample used at these redshifts, while somewhat incomplete, has a high level of purity at least for those galaxies that we are able to test with our spectroscopy. For analyses presented in this paper related to galaxy evolution we use only those objects with a ≥ 3σ detection in [3.6] ([3.6] < 25.4), cuts which apply both to spectroscopic and photometric objects. The median number of filters used in the fitting for the samples presented in this paper for which this cut is applied is 28 and the minimum number is 10. For the study of the properties of galaxies in different environments, such a cut is preferable to a, for example, K s -selected sample for a variety of reasons. The primary redshifts of interest in this study are in the range 4 < z < 5, a redshift range for which the K s filter begins to be sensitive primarily to rest-frame wavelengths blueward A77, page 5 of 27 A&A 615, A77 (2018) of the Balmer/4000 Å break. While the entire COSMOS2015 catalog is selected by a z ++ Y JHK s "chi-squared" image (Szalay et al. 1999), the added requirement of a significant IRAC detection imposes that the sample selected at these redshifts be roughly stellar mass limited and minimally affected by a star-formation driven Malmquist bias. Additionally, it was shown by Caputi et al. (2015) that IRAC bright, K s -faint sources ([4.5] < 23, K s > 24) comprise ≥50% of galaxies with large stellar masses (log(M * /M ) ≥ 10.75) at these redshifts, a phenomenon we have verified to hold within the photometric redshift range adopted in this paper. Such galaxies would be missed if we had instead opted for a K s -selected sample which would have resulted in a ∼35% incompleteness at these stellar masses.
For analyses that involve mapping of the field either in photometric redshifts or through a combination of photometric and spectroscopic redshifts (see Sect. 3.3) we instead opt for a K s < 24 and [3.6] < 25.4 sample. Such a cut allows us to keep a majority of high-stellar-mass galaxies in this redshift in the sample while forcing the average brightness of objects with 4 < z phot < 5 to those similar to the spectroscopic sample such that we can reasonably apply the statistics derived in this section to this sample. Additionally, this cut yields a spectroscopic sampling rate (i.e., the number of objects targeted by all surveys versus the total number of objects) within the area covered by VUDS to a value roughly twice that of a [3.6]-limited sample (16.3 vs. 9.8%) and at ∼10% for objects in the magnitude range which place them as potential members (21 < [3.6] < 25.4). This point will be especially important when we statistically combine the z spec and z phot samples in Sect. 3.3. It should be noted, however, that all analyses presented in this study are relatively unaffected by making cuts which are less well motivated for both portions of the analysis as long as some sort of reasonable K s is applied to the galaxy sample used to make various maps.

Estimation of physical parameters
For galaxies with secure spectroscopic redshifts, we used three different methods to derive associated parameters. The first was to use the package LE PHARE in a method identical to the one described in Lemaux et al. (2014a). For this fitting we drew upon v2.0 of the PSF-matched photometry given in Capak et al. (2007) using all bands blueward and including Spitzer/IRAC [4.5]. The Spitzer/IRAC cryogenic bands ([5.8]/[8.0]) were excluded from the fitting due to relatively large (∼1-2.5 mag) offsets seen in these bands in ∼30% of all VUDS-COSMOS galaxies with respect to the best-fit model estimated without the use of these bands. We note that this issue appears limited to the Capak et al. (2007) catalog and does not apply to fitting performed with the COSMOS2015 photometry. The process of PSF-homogenizing all UV/optical/ground-based NIR images, source detection, the inclusion of the Spitzer/IRAC data, and the conversion of all magnitudes to "pseudo-total" magnitudes is described in detail in Ilbert et al. (2013). Details of this fitting including the input parameters and the effect of various assumptions made for this fitting are given in Lemaux et al. (2014a).
For some analyses, a similar fitting was performed on VUDS rest-frame near-ultraviolet (NUV) spectra in conjunction with identical photometry as in the LE PHARE analysis using GOSSIP+, a modified version of the Galaxy Observed-Simulated SED Interactive Program (GOSSIP; Franzetti et al. 2008). The details of the modifications made for GOSSIP+ as well as the fitting process as it pertains to VUDS spectra and the advantages of the fitting over traditional photometric SED fitting, particularly in regards to estimating galaxy ages, are given in Thomas et al. (2017a,b). Both methodologies, LE PHARE and GOSSIP+, use some form of a χ 2 ν minimization in order to recover the best-fit model from which the physical parameters are estimated. In addition, both methodologies employ a nearly identical set of input parameters including Bruzual & Charlot (2003; hereafter BC03) stellar templates, models generated from exponentially declining and delayed star formation histories (SFHs), dust extinctions, and a Chabrier (2003) initial mass function (IMF). For a full listing of the parameters adopted for the fitting see Lemaux et al. (2014a) and Thomas et al. (2017a).
The final method employs the three-component SED-fitting code SED3FIT 6 (Berta et al. 2013), which combines the emission from stars, dust heated by star formation, and a possible AGN emission component. The fiducial package of galaxy templates is taken from the Multi-wavelength Analysis of Galaxy PHYSical properties (MAGPHYS; da Cunha et al. 2008) code, which relies on an energy balance between the dust-absorbed stellar continuum and the reprocessed emission in the midto far-infrared. Stellar templates are taken from the BC03 library, whose output SEDs are modulated by the effects of dust attenuation and the stellar heating of dust (Charlot & Fall 2000;da Cunha et al. 2008). To this fiducial set of models, the SED3FIT code implements a set of libraries which include both an accretion disc component and a warm dust component surrounding the AGN in a smooth toroidal structure (Feltre et al. 2012; see also Fritz et al. 2006) following the method of (Berta et al. 2013). For each galaxy in our sample, we run both the SED3FIT and the MAGPHYS codes to the full COSMOS2015 photometry (Laigle et al. 2016), from FUV to 500 µm, using a common set of galaxy templates. The results of these two runs are then compared statistically (see Sect. 4.2.2) to estimate the relative contribution of various components to the global UV-IR SED (see Delvecchio et al. 2014 for details). Though the Herschel SPIRE/PACS data for the COSMOS field are relatively shallow for individual galaxies, stacked photometry of various galaxy sub-samples in this paper allows for the placing of meaningful constraints on both AGN activity and the amount of obscured star formation (see Sect. 4.2.2).

Discovery of a z ∼ 4.57 proto-structure in the COSMOS field
In this section we describe the discovery and characterization of the highest redshift overdensity seen by VUDS. We begin by focusing exclusively on the spectroscopic overdensity, as it was in these data that the proto-structure was first detected. However, the observed characteristics and magnitude of the overdensity of VUDS proto-structures are sensitive to a variety of different aspects of the distribution of the VUDS targets spatially, in galaxy type, and in brightness. As such, we additionally classify this overdensity through the use of photometric redshifts and by combining photometric and spectroscopic redshifts in a probabilistic manner. These methods paint a coherent picture of a highly significant overdensity assembling in the early Universe.

Overdensity as seen by spectroscopy
The initial search for overdensities, generically and nonthreateningly termed "proto-structures" here and throughout this  Fig. 2. Individual rest-frame VUDS spectra of the nine z spec members of the z ∼ 4.57 proto-structure detected in COSMOS with secure spectroscopic redshifts (see §2.1.1 for the meaning of this phrase). The spectra were smoothed with a Gaussian filter of σ=1.5 pixels (∼ 1.5Å in the rest-frame). The ID of each member galaxy along with its spectroscopic redshift, redshift quality flag, and observed i + magnitude are shown on the upper left hand corner of each panel. The typical flux density uncertainty as estimated by the NMAD scatter of each spectrum over the wavelengths plotted here is 1.5×10 −19 ergs s −1 cm −2 Å −1 . Colored dotted lines indicate important spectral features. We highlight that in many cases the Lyα emission feature is absent or an identical redshift would have been recovered even in its absence.
fail catastrophically (see Fig. 1) and, even if correct, to adopt the approach of a binary z phot membership criteria given the large extent in redshift space required by the z phot precision (∆ z = 0.42) is perhaps overly coarse. While we will employ a more complex statistical combination of photometric and spectroscopic redshifts in the following Section to quantify the overdensity, we attempt one other method here to determine the genuineness of the z phot overdensity. The full probability distribution functions (PDFs) generated by Le Phare for each z phot member were reconstructed using the effective uncertainties in the z phot values and, when necessary, those of secondary peaks in the z phot PDF which exceeded 5% of the overall probability density. These reconstructed PDFs were combined into a single composite PDF shown in the bottom right panel of Figure 4 as a green filled histogram. Simultaneously, the same process was performed for all objects in the COSMOS2015 catalog, subject to the criteria set in §2.2, with z phot = 4.57 ± 1.5σ N MAD × (1 + 4.57) but outside of the projected spatial cut (coeval z phot field). The combined distribution of these objects is shown in the bottom right panel of Figure  4 as the blue hashed histogram. The combined PDF of the coeval z phot field sample appears roughly flat from 4.35 ≤ z phot ≤ 4.67, a redshift range which contains ∼85% of relatively peaked z phot measurements (σ z phot < 0.3) in this sample and ∼80% of all objects in the coeval z phot field. In stark contrast, the combined PDF of z phot members has its largest peak within the spectroscopic redshift bounds of the proto-structure. Indeed, by this analysis z phot members are nearly twice as likely as coeval z phot field ob-jects to be within the true redshift bounds of the proto-structure, 16% versus 9%, respectively. Such a result strongly suggests that the overdensity observed amongst the z phot objects at the location of the proto-structure is genuine.

Overdensity as measured by monte-carlo voronoi tessellation
We have now determined unequivocally that an overdensity is observed in both spectroscopic and photometric redshifts in the area encompassing PCl J1001+0220 11 However, both of these methods have their limitations. In the case of the former, though accurate high-precision measurements are made, only ∼10% of potential (z phot ) members were targeted for spectroscopy, with the targeting having a complex dependence on galaxy type, the level of current star formation, and intervening structure. For example, because VUDS generally targets objects with z phot > 2.4 − 1σ z phot , a true projected dearth of galaxies along the LOS at lower (z ∼ 2 − 4) redshifts means a higher chance of targeting higher-redshift galaxies and, thus, finding a proto-structure like PCl J1001+0220. Conversely, while the z phot methods presented in the previous Section allow for an essentially stellarmass-limited sample largely independent of galaxy type, though subject to the caveats discussed in §2.2.1, estimates are model Article number, page 8 of 27 Fig. 2. Individual rest-frame VUDS spectra of the nine z spec members of the z ∼ 4.57 proto-structure detected in COSMOS with secure spectroscopic redshifts (see Sect. 2.1.1 for the meaning of this phrase). The spectra were smoothed with a Gaussian filter of σ = 1.5 pixels (∼1.5 Å in the rest-frame). The ID of each member galaxy along with its spectroscopic redshift, redshift quality flag, and observed i + magnitude are shown on the upper left hand corner of each panel. The typical flux density uncertainty as estimated by the NMAD scatter of each spectrum over the wavelengths plotted here is 1.5 × 10 −19 ergs s −1 cm −2 Å −1 . Colored dotted lines indicate important spectral features. We highlight that in many cases the Lyα emission feature is absent or an identical redshift would have been recovered even in its absence. study, in the COSMOS field took an almost identical form to that described in Lemaux et al. (2014a). We briefly describe the process here. The 4895 galaxies in the COSMOS field with secure spectroscopic redshifts at z > 1.5 available to us were combined into a single catalog. In the case of duplicates, we gave preference to the redshifts with the most secure flag, or, in the case of equal flags, to VUDS redshifts. This catalog was used to generate spectral density maps of the entire COS-MOS field from 1.5 ≤ z ≤ 5 in 25 Mpc slices applying the nearest-neighbor method of Gutermuth et al. (2005). These maps were searched by eye for overdensities and all those overdensities which contained seven galaxies with concordant redshifts (i.e., in the same slice) within R pro j ≤ 2 Mpc were considered as proto-structures. For each proto-structure, new maps were generated iteratively decreasing the redshift range until the overdensity was maximized. Following the generation of the final maps, luminosity-weighted and unit-weighted z spec member centers were determined iteratively following the method of Ascaso et al. (2014) using R pro j ≤ 2 Mpc and the K s or [3.6] bands for luminosity weighting. In total, 26 such proto-structures were found in the COSMOS field rank ordered in increasing redshift, of which one of the most significant was reported in Cucciati et al. (2014). The highest redshift of these and also one of the most significant in the entire COSMOS field, a proto-structure at z ∼ 4.57 spanning 7.5 Mpc 7 along the LOS and containing nine z spec member galaxies (see Fig. 2), serves as the subject of this paper. 7 Equivalent to ∆v ∼ 3800 km s −1 at these redshifts.  . Spectroscopic overdensity of the z ∼ 4.57 proto-structure as seen by VUDS. The black histogram shows the incidence of galaxies with secure spectroscopic redshifts falling within a "filter" of size given at the top of the plot for 1000 trial observations of the COSMOS field. For each observation the filter is placed at a random (uniformly generated) spatial location over the VUDS footprint and given a random (uniform) redshift center in the range 4 < z < 5. The filter size is chosen to be identical to that adopted for this proto-structure. The green solid line is a Poissonian fit to the histogram with best-fit parameters given to the right of the line. The number of z spec members of the proto-structure, N PS , is denoted by the vertical dashed line. The spectroscopic overdensity (δ gal = (N PS − µ)/µ) and detection significance (σ PC = (N PS − µ)/σ) of the proto-structure is listed to the right of the line. dependent and have moderate accuracy and coarse precision at these redshifts. A method which combines both samples statistically can, in principle, help to mitigate the shortcomings of each flagging allowed these me ceeded, was repl the z phot able, the from an respond Voro at each r of the ce Mpc or between tween su by rando ization o ple the u at each g the inver that encl Σ V MC , fo puted by Voronoi then com whereΣ the map width to employe of proto spectros level pre field esti resembli Fig. 3. Spectroscopic overdensity of the z ∼ 4.57 proto-structure as seen by VUDS. The black histogram shows the incidence of galaxies with secure spectroscopic redshifts falling within a "filter" of size given at the top of the plot for 1000 trial observations of the COSMOS field. For each observation the filter is placed at a random (uniformly generated) spatial location over the VUDS footprint and given a random (uniform) redshift center in the range 4 < z < 5. The filter size is chosen to be identical to that adopted for this proto-structure. The green solid line is a Poissonian fit to the histogram with best-fit parameters given to the right of the line. The number of z spec members of the proto-structure, N PS , is denoted by the vertical dashed line. The spectroscopic overdensity (δ gal = (N PS − µ)/µ) and detection significance (σ PC = (N PS − µ)/σ) of the proto-structure is listed to the right of the line.  Median redshiftz = 4.568 Spectral overdensity δ gal = 17.0 ± 6.2, σ Proto-Struct . = 12.0 Photo-z overdensity σ z phot ,LS S = 4.7 b (20.7) Voronoi average overdensity δ gal = 3.30 ± 0.32 Galaxy velocity dispersion σ v = 1037.6 ± 177.9 km s −1 Notes. (a) This number refers to all spectroscopically confirmed members with R pro j < 2 Mpc and 4.53 < z spec < 4.60. (b) The number outside the parentheses refers to the formal significance of the detection after accounting for spurious density peaks (see Sect. 3).
The method to determine the significance of the spectral overdensity of each of these proto-structures was identical to that of Lemaux et al. (2014a) and Smolčić et al. (2017a). Briefly, for each proto-structure, a volume equivalent to that used to define that proto-structure, that is, "filter", was randomly placed at 1000 spatial locations securely (≥ 2 Mpc) inside the borders of the VUDS+zCOSMOS spatial coverage 8 , avoiding gaps in coverage, and at a random central redshift. Because of the rapidly varying selection and effectiveness of the zCOSMOS and VUDS surveys as a function of redshift, random central redshifts for the filters were limited to certain redshift ranges depending on the redshift of the proto-structure: 1.5 ≤ z ≤ 2, 2 ≤ z ≤ 3, 3 ≤ z ≤ 4, or 4 ≤ z ≤ 5. The angular spatial constraints listed above ensure that the density of spectroscopic targeting varies by less than a factor of two over the entirety of the area sampled, a negligible variation for this exercise. For each realization, galaxies in the spectroscopic catalog falling within the filter were counted, and the resulting distribution was fit to a Poissonian or Gaussian function, the former being used when the average number of galaxies falling within the filter was small.
This distribution and the resulting fit for the z ∼ 4.57 protostructure is shown in Fig. 3 and the resulting overdensity values given in Table 1. While there exist large formal uncertainties in the magnitude of the calculated spectral overdensity, δ gal = 17.0 ± 6.2, where δ gal ≡ (N PS − µ)/µ, the overdensity is highly significant, representing a 12σ fluctuation of the spectroscopic density field, that is, σ PS ≡ (N PS − µ)/σ, where µ and σ are the mean and standard deviation, respectively, associated to the Poissonian fit performed above and N PS is the number of spectroscopically confirmed proto-structure members. Such an overdensity is even more impressive given that the VUDS spectroscopic coverage does not continue eastward of RA ∼ 150.4, meaning only ∼70% of the area bounded by R pro j ≤ 2 Mpc from the number-weighted z spec member center of the proto-structure was covered by VUDS. Given the relatively small number of confirmed z spec members, the non-uniform VUDS coverage, the sparsity of the coeval field, and the complicated nature of the VUDS selection at these redshifts, we refrain from pushing these values further and wait for further context presented later in this section. 8 At z > 4 the spatial locations were limited to the spatial coverage of VUDS as essentially no galaxies with secure spectroscopic redshifts from zCOSMOS exist at these redshifts.

Overdensity as seen by photometric redshifts
In Fig. 4 we show the sky location of the z spec members of the z ∼ 4.57 proto-structure plotted against the backdrop of a density map of objects with photometric redshifts (hereafter z phot ) from the COSMOS2015 catalog, consistent with that of the proto-structure, that is, z phot ∼ 4.57 ± 1.5σ N MAD × (1 + 4.57), subject to the criteria given in Sect. 2.2 and following the nearest-neighbor method of Gutermuth et al. (2005). In order to determine the level and significance of the overdensity, if any, SExtractor (Bertin & Arnouts 1996) was run on the full COSMOS density map at the redshift of the proto-structure along with those at a variety of other redshifts. All peaks in the z phot density map that formally exceeded 5σ detections were cataloged. Spurious peaks were identified as those that had the requisite spectroscopic coverage but which lacked a corresponding spectroscopic overdensity at ≥3σ PS , where σ PS is defined earlier in this section. As in Lemaux et al. (2014a), a Gaussian was fit to the distribution of SExtractor significances of spurious peaks 9 , with the resulting parameters used to estimate the true (spurious-corrected) significance of measured overdensities. Here, and throughout this section, we expand the projected area considered to be part of the proto-structure to 3 Mpc. While the results presented in this section do not change appreciably if we instead consider the region which bounds the spectroscopic overdensity (R pro j ≤ 2 Mpc), we choose a larger radius here because it is better matched to the spatial size of protostructures both in simulations (e.g., Chiang et al. 2013;Muldrew et al. 2015;Contini et al. 2016;Orsi et al. 2016) and in observations (e.g., Lemaux et al. 2014a;Dey et al. 2016;Toshikawa et al. 2014Toshikawa et al. , 2016 and due to our ignorance of the true center of the proto-structure 10 . Furthermore, the absence of VUDS observations eastward of α J2000 ∼ 150.40 • means that our spectroscopy certainly does not probe the full extent of the proto-structure. Thus, we take here an inclusive approach. There appears both a significant raw (20.7) and spurious-corrected (4.7) overdensity in the region centered on the z ∼ 4.57 VUDS-COSMOS proto-structure. It is important to note this overdensity is not 9 Photometric redshift peaks were only considered spurious if the peak fell within the spatial coverage of full spectroscopic catalog and failed to show a significant spectral overdensity. 10 Though all of these arguments equally apply to the spectral sample, in practice, partially due to lack of VUDS coverage in the eastern portion of the proto-cluster, the spectral member sample remains unchanged if we instead enforce R pro j ≤ 3 Mpc for that sample.   Fig. 4. Sky plot of the highest redshift proto-structure detected from VUDS spectroscopy (PCl J1001+0220). Plotted in the background is the smoothed (2.5 pixels FWHM) density map for all objects with z phot = 4.57 ± 1.5σ N MAD × (1 + 4.57) with the scale in gal/Mpc indicated by the color bar to the right. Additionally, all galaxies from VUDS+zCOSMOS with z > 2 with secure spectroscopic redshifts are plotted in the background as small gray dots. The density of these points drops precipitously to the east of the proto-cluster center owing to a lack of VUDS coverage of this area. Galaxies with secure and less secure (see §2.1) spectroscopic redshifts consistent with the redshift range of the proto-structure (4.53 < z < 4.60) are represented by filled circumscribed circles and Xs, respectively. Blue and red symbols differentiate galaxies at the redshift of the proto-structure by their rest-frame M NUV − M r ′ colors (younger and older than 200 Myr, respectively, see §4.2.1) and are logarithmically scaled (log 4 ) by their stellar mass. The dashed gray and black circles (R pro j = 2 and 3 Mpc from the proto-structure center, respectively) set the boundaries for z spec and z phot members, respectively. Plotted on the bottom right is a histogram of all secure spectroscopic redshifts within R pro j < 2 Mpc of the proto-structure center. Plotted in the background is the combined PDF of z phot members (green filled) and non-members (dark blue hatched) with redshifts consistent with the redshift of the proto-structure. median density of the full slice as measured in all 500 realizations. We adopt this overdensity value and its associated uncertainty as being the most reliable estimate of the galaxy overdensity of PCl J1001+0220. Because this value exceeds the value for equivalent galaxy populations of nearly all simulated protoclusters at these redshifts when viewed over an equivalent volume, (∼21 comoving Mpc 3 , see Table 4 of Chiang et al. 2013), here and for the rest of the paper we abandon the term protostructure when referring to the PCl J1001+0220 proto-cluster. It is also perhaps revelant to note that at this galaxy overdensity, assuming the galaxy bias value given in §4.1 and pressureless spherical collapse, the entire proto-cluster as defined over the volume above collapses well before z ∼ 0. Each overdensity value and central coordinate computed in this Section, as well as a variety of other information on the PCl J1001+0220 protocluster, is given in Table 1.

Properties of the PCl J1001+0220 proto-cluster
With the identity of the PCl J1001+0220 proto-cluster now firmly established, in this section we attempt to contextualize this proto-cluster and to make a cursory exploration of the properties of its member galaxies using all the information at our disposal.

Weighing the PCl J10001+0220 proto-cluster
At lower redshifts (z ≤ 1.5), a clear correlation exists between the total mass of a structure and the evolutionary state of its constituent galaxy population (e.g., Moran et al. 2007;Poggianti et al. 2008Poggianti et al. , 2009Hansen et al. 2009;Lubin et al. 2009;Lemaux et al. 2012;van der Burg et al. 2014;Balogh et al. 2016). While dispersion in this correlation exists, and while the evolutionary state of a galaxy population clearly has a complex relationship with other factors either causally or circumstantially connected with global environment, such as local (over)density, stellar mass, and global dynamics, estimating the total mass for a structure is a necessary step in predicting the fate of that structure and its constituent galaxy population. At such redshifts reasonably precise estimates are at least achievable with current technology through strong or weak lensing, X-ray observations, and dynamics measurements from exhaustive spectroscopic campaigns. While widely employed, these estimates necessarily have (relatively) large systematic uncertainties originating from the large number of assumptions required to even attempt a measurement with these data. Furthermore, at the highest redshifts in this redshift range z ∼ 1.5, only the most exquisite data sets can be reasonably employed to estimate total masses, as sparser sampling, differential bias in sampling, and/or lower signal-to-noise ratio (S/N) data makes it impossible to begin to probe the validity of these assumptions.
Article number, page 10 of 27 . Sky plot of the highest redshift proto-structure detected from VUDS spectroscopy (PCl J1001+0220). Plotted in the background is the smoothed (2.5 pixels FWHM) density map for all objects with z phot = 4.57 ± 1.5σ N MAD × (1 + 4.57) with the scale in gal/Mpc indicated by the color bar to the right. Additionally, all galaxies from VUDS+zCOSMOS with z > 2 with secure spectroscopic redshifts are plotted in the background as small gray dots. The density of these points drops precipitously to the east of the proto-cluster center owing to a lack of VUDS coverage of this area. Galaxies with secure and less secure (see Sect. 2.1) spectroscopic redshifts consistent with the redshift range of the proto-structure (4.53 < z < 4.60) are represented by filled circumscribed circles and Xs, respectively. Blue and red symbols differentiate galaxies at the redshift of the proto-structure by their rest-frame M NUV − M r colors (younger and older than 200 Myr, respectively, see Sect. 4.2.1) and are logarithmically scaled (log 4 ) by their stellar mass. The dashed gray and black circles (R pro j = 2 and 3 Mpc from the proto-structure center, respectively) set the boundaries for z spec and z phot members, respectively. Plotted on the bottom right is a histogram of all secure spectroscopic redshifts within R pro j < 2 Mpc of the proto-structure center. Plotted in the background is the combined PDF of z phot members (green filled) and non-members (dark blue hatched) with redshifts consistent with the redshift of the proto-structure.
necessarily redundant evidence of an overdensity of galaxies as VUDS certainly does not target nor detect most L * galaxies at this redshift and the number of galaxies required to create such an overdensity, assuming the catastrophic outlier rate given in Sect. 2.2, considerably exceeds the number of z spec members.
However, despite the use of arguably the best z phot measurements made to date, such measurements are moderately prone to fail catastrophically (see Fig. 1) and, even if correct, to adopt the approach of a binary z phot membership criteria given the large extent in redshift space required by the z phot precision (∆ z = 0.42) is perhaps overly coarse. While we will employ a more complex statistical combination of photometric and spectroscopic redshifts in the following section to quantify the overdensity, we attempt one other method here to determine the genuineness of the z phot overdensity. The full probability distribution functions (PDFs) generated by Le Phare for each z phot member were re-constructed using the effective uncertainties in the z phot values and, when necessary, those of secondary peaks in the z phot PDF which exceeded 5% of the overall probability density. These re-constructed PDFs were combined into a single composite PDF shown in the bottom right panel of Fig. 4 as a green filled histogram. Simultaneously, the same process was performed for all objects in the COSMOS2015 catalog, subject to the criteria set in Sect. 2.2, with z phot = 4.57 ± 1.5σ N MAD × (1 + 4.57) but outside of the projected spatial cut (coeval z phot field). The combined distribution of these objects is shown in the bottom right panel of Fig. 4 as the blue hashed histogram. The combined PDF of the coeval z phot field sample appears roughly flat from 4.35 ≤ z phot ≤ 4.67, a redshift range which contains ∼85% of relatively peaked z phot measurements (σ z phot < 0.3) in this sample and ∼80% of all objects in the coeval z phot field. In stark contrast, the combined PDF of z phot members has its largest peak within the spectroscopic redshift bounds of the protostructure. Indeed, by this analysis z phot members are nearly twice as likely as coeval z phot field objects to be within the true redshift bounds of the proto-structure, 16% versus 9%, respectively. Such a result strongly suggests that the overdensity observed amongst the z phot objects at the location of the proto-structure is genuine.

Overdensity as measured by Monte-Carlo voronoi tessellation
We have now determined unequivocally that an overdensity is observed in both spectroscopic and photometric redshifts in the area encompassing PCl J1001+0220 11 . However, both of these methods have their limitations. In the case of the former, though accurate high-precision measurements are made, only ∼10% of potential (z phot ) members were targeted for spectroscopy, with the targeting having a complex dependence on galaxy type, the level of current star formation, and intervening structure. For example, because VUDS generally targets objects with z phot > 2.4−1σ z phot , a true projected dearth of galaxies along the LOS at lower (z ∼ 2-4) redshifts means a higher chance of targeting higher-redshift galaxies and, thus, finding a proto-structure like PCl J1001+0220. Conversely, while the z phot methods presented in the previous section allow for an essentially stellar-mass-limited sample largely independent of galaxy type, though subject to the caveats discussed in Sect. 2.2.1, estimates are model dependent and have moderate accuracy and coarse precision at these redshifts. A method which combines both samples statistically can, in principle, help to mitigate the shortcomings of each sample.
To this end, we introduce here a modified version of the Voronoi tessellation measure of the density field employed by a variety of other studies in the COSMOS field which rely almost exclusively on photometric redshifts (e.g., Scoville et al. 2013;Darvish et al. 2015;Smolčić et al. 2017a). The method that we employ here most resembles the "weighted Voronoi tessellation estimator" introduced in Darvish et al. (2015). In that study it was found that using this method to recover the underlying density field matched or exceeded the accuracy and precision of all other methods of density estimation. The one metric with comparable performance to the Voronoi approach, weighted adaptive kernel estimation, is sensitive to both the form and size of the kernel and, generally, employs a spatially symmetric kernel (along the transverse dimensions) which is not ideal for the complex transverse shape of proto-structures.
Our version of this method is as follows. Beginning at z = 2 and reaching up to z = 7 in 7.5 Mpc steps (i.e., half the size of a slice), a suite of ten Monte Carlo realizations of the magnitude-cut nearest-neighbor-matched master spectroscopic and COSMOS2015 z phot catalogs were generated for each step. This number of realizations was a compromise between computational intensity and stability of the resultant density estimates. For each Monte Carlo realization, first we selected the spectroscopic sub-sample by drawing from a uniform distribution ranging from 0 to 100 and retaining those z spec measurements where the drawn number exceeded 75, 95, and 99.5 for those measurements with flags X2/X9, X3, and X4, respectively. These thresholds follow the fiducial reliability estimates of the VUDS/zCOSMOS flagging system (see e.g., Le Fèvre et al. 2015). Such a method allowed for the incorporation of the statistical reliability of these measurements. If the predetermined threshold was not exceeded, z spec would be retained for that realization, otherwise it was replaced with the z phot information. For each object where the z phot information was used or was the only information available, the original z phot for that object was perturbed by sampling from an asymmetric Gaussian distribution with σ values that correspond to the lower and upper effective 1σ z phot uncertainties.
Voronoi tessellation was then performed on each realization at each redshift step on objects with redshifts falling within ±7.5 of the central redshift of each bin, that is, a bin width of ∆χ = 15 Mpc or ∆ z ∼ 0.03-0.3 from z ∼ 2-7. The 7.5 Mpc steps between slices along with the slice thickness ensure overlap between successive slices such that we do not miss overdensities by randomly choosing unlucky redshift bounds. For each realization of each slice, a grid of 75 × 75 kpc was created to sample the underlying local density distribution. The local density at each grid value for each realization and slice was set equal to the inverse of the Voronoi cell area (multiplied by D 2 A ) of the cell that encloses the central point of that grid. Final local densities, Σ V MC , for each grid point in each redshift slice are then computed by median combining the values of ten realizations of the Voronoi maps. The local overdensity value for each grid point is whereΣ V MC is the median Σ V MC for all grid points over which the map was defined, that is, excluding a border region of ∼1 in width to mitigate edge effects. In preliminary tests of the method employed for our study we observed realistic mock catalogs of proto-clusters and proto-groups with a combination of deep spectroscopy and photometric redshifts with COSMOS2015level precision and accuracy. It was found that the overdensity field estimated by our method after an identical magnitude cut resembling that applied to our true sample used for the mapping (i.e., K s < 24) falls within <30% of the true (i.e., real space) overdensity field essentially independent of the value of overdensity, the redshift, or the spectroscopic completeness, as long as the latter is in excess of ∼10%, a value similar to that of this study (see Sect. 2.2.1). While reconstruction is still possible when the spectroscopic completeness is lower, the relative accuracy drops immensely, with the reconstructed values deviating by up to ∼200% away from the true value when the sampling drops to ∼3%. The results of these tests will be presented in depth in a future work.
Once a proto-structure was identified, as in the case of PCl J1001+0220, zoom Monte Carlo Voronoi mappings were made by iteratively shifting the redshift bounds to find the values most appropriate to that particular proto-structure. For PCl J1001+0220, these values were found to be identical to the redshift range of the z spec members, 4.53 ≤ z ≤ 4.60. Once the redshift bin was chosen, 500 Monte Carlo realizations were performed for that redshift slice, with the resultant final density and overdensity maps generated using an identical method to the mapping described earlier. The latter of these two maps for PCl J1001+0220 is shown in Fig. 5. Following the generation of these maps for PCl J1001+0220, SExtractor was run on the overdensity maps for the purpose of quantifying both the significance and the extent of the proto-structure using log(1 + δ gal ) > 0.5 as a detection threshold. The resulting detection of PCl J1001+0220 extends over an area of 7.58 Mpc 2 and has an average log( 1 + δ gal ) = 0.63 ± 0.03. The uncertainty is determined from the dispersion around both the average density contained within the region defining PCl J1001+0220 and the median density of the full slice as measured in all 500 realizations. We adopt this overdensity value and its associated uncertainty as being the most reliable estimate of the galaxy overdensity of PCl J1001+0220. Because this value exceeds the value for equivalent galaxy populations of nearly all simulated proto-clusters at these redshifts when viewed over an equivalent volume, (∼21 comoving Mpc 3 , see Table 4 of Chiang et al. 2013), here and for the rest of the paper we abandon the term proto-structure when referring to the PCl J1001+0220 proto-cluster. It is also perhaps revelant to note that at this galaxy overdensity, assuming the galaxy bias value given in Sect. 4.1 and pressureless spherical collapse, the entire proto-cluster as defined over the volume above collapses well before z ∼ 0. Each overdensity value and central coordinate computed in this section, as well as a variety of other information on the PCl J1001+0220 proto-cluster, is given in Table 1.

Properties of the PCl J1001+0220 proto-cluster
With the identity of the PCl J1001+0220 proto-cluster now firmly established, in this section we attempt to contextualize this proto-cluster and to make a cursory exploration of the properties of its member galaxies using all the information at our disposal.  Fig. 5. Sky plot of the Voronoi Monte-Carlo overdensity map encompassing a large fraction of the VUDS coverage in the COSMOS field. The redshift bound over which the Voronoi tessellation is calculated is set to 4.53 < z < 4.60 . Both spectroscopic and photometric redshifts are treated statistically following the prescription in §3.3 and used generate 500 realizations of the overdensity map in the field. These 500 maps are then median combined and smoothed with a Gaussian of FWHM=4.5 pixel to create the plotted map. A color bar indicates the magnitude of the overdensity in each 75x75 kpc 2 pixel. The dashed line indicates the extent of our z spec member cut (R pro j = 2 Mpc). Small black dots denote galaxies with secure spectroscopic redshifts at z > 3 from VUDS and filled circumscribed blue diamonds denote those within the range of PCl J1001+0220. The PCl J1001+0220 proto-cluster exhibits coherent overdensity (log(1 + δ gal ) > 0.5) over an area of 7.58 Mpc 2 . Notes. (a) This number refers to all spectroscopically-confirmed members with R pro j < 2 Mpc and 4.53 < z spec < 4.60. (b) The number outside the parentheses refers to the formal significance of the detection after accounting for spurious density peaks (see §3).
Article number, page 11 of 27 The redshift bound over which the Voronoi tessellation is calculated is set to 4.53 < z < 4.60 . Both spectroscopic and photometric redshifts are treated statistically following the prescription in Sect. 3.3 and used generate 500 realizations of the overdensity map in the field. These 500 maps are then median combined and smoothed with a Gaussian of FWHM = 4.5 pixel to create the plotted map. A color bar indicates the magnitude of the overdensity in each 75 × 75 kpc 2 pixel. The dashed line indicates the extent of our z spec member cut (R pro j = 2 Mpc). Small black dots denote galaxies with secure spectroscopic redshifts at z > 3 from VUDS and filled circumscribed blue diamonds denote those within the range of PCl J1001+0220. The PCl J1001+0220 proto-cluster exhibits coherent overdensity (log(1 + δ gal ) > 0.5) over an area of 7.58 Mpc 2 .

Weighing the PCl J10001+0220 proto-cluster
At lower redshifts (z ≤ 1.5), a clear correlation exists between the total mass of a structure and the evolutionary state of its constituent galaxy population (e.g., Moran et al. 2007;Poggianti et al. 2008Poggianti et al. , 2009Hansen et al. 2009;Lubin et al. 2009;Lemaux et al. 2012;van der Burg et al. 2014;Balogh et al. 2016). While dispersion in this correlation exists, and while the evolutionary state of a galaxy population clearly has a complex relationship with other factors either causally or circumstantially connected with global environment, such as local (over)density, stellar mass, and global dynamics, estimating the total mass for a structure is a necessary step in predicting the fate of that structure and its constituent galaxy population. At such redshifts reasonably precise estimates are at least achievable with current technology through strong or weak lensing, X-ray observations, and dynamics measurements from exhaustive spectroscopic campaigns. While widely employed, these estimates necessarily have (relatively) large systematic uncertainties originating from the large number of assumptions required to even attempt a measurement with these data. Furthermore, at the highest redshifts in this redshift range z ∼ 1.5, only the most exquisite data sets can be reasonably employed to estimate total masses, as sparser sampling, differential bias in sampling, and/or lower signal-to-noise ratio (S/N) data makes it impossible to begin to probe the validity of these assumptions. At higher redshift, the situation becomes much more dire. Precision estimates of total masses through lensing and X-ray measurements become nearly impossible. Quiescent galaxies, whose global dynamics should, generally, better trace the underlying potential than other galaxy populations, become sparser and more difficult to detect spectroscopically, a requisite condition of any dynamics analysis. Furthermore, assumptions which generally come close to being valid for massive overdensities at lower redshift, for example, virialization, and hydrostatic equilibrium, almost certainly do not hold at these redshifts. Further, proto-clusters appear in simulations to have extremely complex morphologies, and overdensity estimates, estimates which are subsequently translated into total masses, appear to be strongly dependent on viewing angle σ v = 1038 ± 178 km/s Fig. 6. Differential velocity distribution of the z spec members of PCl J1001+0220. The median redshift of z spec member galaxies is shown in the top right corner of the plot. Also shown in the top right corner is the value of the best-fit line of sight (LOS) velocity dispersion (σ v , see §3 for details). The resulting Gaussian function generated by the best-fit σ v is overplotted on the differential velocity histogram (solid black line) along with those functions generated from σ v ± σ σv .
At higher redshift, the situation becomes much more dire. Precision estimates of total masses through lensing and X-ray measurements become nearly impossible. Quiescent galaxies, whose global dynamics should, generally, better trace the underlying potential than other galaxy populations, become sparser and more difficult to detect spectroscopically, a requisite condition of any dynamics analysis. Furthermore, assumptions which generally come close to being valid for massive overdensities at lower redshift, for example, virialization, and hydrostatic equilibrium, almost certainly do not hold at these redshifts. Further, proto-clusters appear in simulations to have extremely complex morphologies, and overdensity estimates, estimates which are subsequently translated into total masses, appear to be strongly dependent on viewing angle (Shattow et al. 2013). Given these difficulties, here we attempt a variety of different methods (as was done in Lemaux et al. (2014a)) to estimate the total mass of PCl J1001+0220. Because each of these methods requires a different set of assumptions, the above uncertainties are, to the best of our ability, minimized, allowing for at least the potential to triangulate the requisite accuracy to properly contextualize the PCl J1001+0220 proto-cluster when the estimates are averaged. Note that, for all proto-cluster estimates, the term "total mass" is meant to refer to the composite mass of all halos which comprise the proto-cluster (or the eventual cluster) rather than the mass of the most massive halo in the (proto-)cluster. As pointed out in, for example, Muldrew et al. (2015), the most massive halo for all high-redshift proto-clusters likely contains a small fraction of proto-cluster galaxies and a small fraction of the overall mass ( < ∼ 10% for both cases).
Under the assumption of virialization and isotropic motion, the LOS dynamics of satellite galaxies provide a readily available proxy for the total mass. Though the redshift of Cl1001+0220 makes it extremely unlikely that the observed members have reached a near-virialized state, and though the time at which a system is observed in its dynamical history dictates whether the dispersion of the observed LOS differential velocities is an overestimate or underestimate relative to it virial equivalent (see, e.g., Cucciati et al. 2014), neither of these possibilities precludes the possibility that such a measurement could provide useful information. Indeed, the member galaxies in both simulations of cluster progenitors (Cucciati et al. 2014) and in large compilations of observed proto-clusters (Franck & McGaugh 2016b) exhibit velocity dispersions, and, by consequence, dynamical masses, which correlate, weakly but signficantly, with masses derived through independent estimates. In Figure 6 we plot the differential LOS velocity dispersion of all nine members of Cl1001+0220 along with the biweight fit to distribution. The estimated LOS velocity dispersion of 1038±178 km s −1 corresponds to a virial mass of: While this is an immense total mass at this redshift, it is possible that galaxy dynamics provide a gross overestimate due the effects described above. The level by which this dynamical mass might exceed the true total mass can be crudely estimated by applying the average offset between dynamical masses and masses estimated by matter overdensities, in those cases where both were definitely measured, for the large ensemble of protoclusters compiled by Franck & McGaugh (2016b). Though it is not clear that overdensity masses necessarily more accurately reflect true total masses, the methods used in that study differ from those used here, and there exists a large scatter in this distribution ( (M dyn /M δ m ) = 6.7 +15.5 −4.6 , median and effective 1σ scatter); such a correction can perhaps provide a reasonable range of values in which the true total mass resides. Applying this median correction and propagating its uncertainties into the resultant corrected value yields log(M vir /M ⊙ ) z∼4.57,dyn,corr = 13.6 +0.4 −1.0 . A different approach is to count the amount of baryonic material within the proto-cluster bounds associated with the member galaxies and attempt to relate that back to the overall mass of the structure, an approach which has been employed successfully at low redshift when the galactic baryonic content of cluster member galaxies is dominated by stars (Andreon 2012). The approach we take here is similar to that of Lemaux et al. (2014a). Briefly, the total amount of stellar matter of the z spec members is counted and a completeness correction is made to this value for galaxies at stellar masses log(M * /M ⊙ ) ≥ 9.5 (see §4.2.1 for the reasoning behind this cut) based on the number of z phot members and non-members without secure spectral redshifts within the bounds of the proto-cluster and the likelihood of their being true members. An additional correction is made to correct for galaxies in the stellar mass range 8.0 < log(M * /M ⊙ ) < 9.5 by integrating the stellar mass function of Davidzon et al. (2017) appropriate for this redshift. Here we additionally make the assumption that stellar mass comprises 50% of the baryonic content of galaxies by mass at these redshifts, a value broadly consistent with the few measurements made at or near these redshifts (e.g., Tacconi et al. 2010;Capak et al. 2011b;Schinnerer et al. 2016;Scoville et al. 2016). We assume for the purposes of this calculation that the proto-cluster is a closed system, with all gas being converted to stars by z = 0 and that the completenesscorrected galaxy population which lies within R pro j ≤ 3 Mpc at z ∼ 4.57 comprises the entirety of the galaxy population which will eventually be contained within the cluster virial radius at Article number, page 12 of 27 Fig. 6. Differential velocity distribution of the z spec members of PCl J1001+0220. The median redshift of z spec member galaxies is shown in the top right corner of the plot. Also shown in the top right corner is the value of the best-fit line of sight (LOS) velocity dispersion (σ v , see Sect. 3 for details). The resulting Gaussian function generated by the best-fit σ v is overplotted on the differential velocity histogram (solid black line) along with those functions generated from σ v ± σ σv . (Shattow et al. 2013). Given these difficulties, here we attempt a variety of different methods (as was done in Lemaux et al. 2014a) to estimate the total mass of PCl J1001+0220. Because each of these methods requires a different set of assumptions, the above uncertainties are, to the best of our ability, minimized, allowing for at least the potential to triangulate the requisite accuracy to properly contextualize the PCl J1001+0220 proto-cluster when the estimates are averaged. Note that, for all proto-cluster estimates, the term "total mass" is meant to refer to the composite mass of all halos which comprise the proto-cluster (or the eventual cluster) rather than the mass of the most massive halo in the (proto-)cluster. As pointed out in, for example, Muldrew et al. (2015), the most massive halo for all high-redshift proto-clusters likely contains a small fraction of proto-cluster galaxies and a small fraction of the overall mass ( < ∼ 10% for both cases).
Under the assumption of virialization and isotropic motion, the LOS dynamics of satellite galaxies provide a readily available proxy for the total mass. Though the redshift of Cl1001+0220 makes it extremely unlikely that the observed members have reached a near-virialized state, and though the time at which a system is observed in its dynamical history dictates whether the dispersion of the observed LOS differential velocities is an overestimate or underestimate relative to it virial equivalent (see e.g., Cucciati et al. 2014), neither of these possibilities precludes the possibility that such a measurement could provide useful information. Indeed, the member galaxies in both simulations of cluster progenitors (Cucciati et al. 2014) and in large compilations of observed proto-clusters (Franck & McGaugh 2016b) exhibit velocity dispersions, and, by consequence, dynamical masses, which correlate, weakly but significantly, with masses derived through independent estimates. In Fig. 6 we plot the differential LOS velocity dispersion of all nine members of Cl1001+0220 along with the biweight fit to distribution. The estimated LOS velocity dispersion of 1038 ± 178 km s −1 corresponds to a virial mass of: log(M vir /M ) z∼4.57,dyn = 14.39 ± 0.22. (1) While this is an immense total mass at this redshift, it is possible that galaxy dynamics provide a gross overestimate due the effects described above. The level by which this dynamical mass might exceed the true total mass can be crudely estimated by applying the average offset between dynamical masses and masses estimated by matter overdensities, in those cases where both were definitely measured, for the large ensemble of proto-clusters compiled by Franck & McGaugh (2016b). Though it is not clear that overdensity masses necessarily more accurately reflect true total masses, the methods used in that study differ from those used here, and there exists a large scatter in this distribution ( (M dyn /M δ m ) = 6.7 +15.5 −4.6 , median and effective 1σ scatter); such a correction can perhaps provide a reasonable range of values in which the true total mass resides. Applying this median correction and propagating its uncertainties into the resultant corrected value yields log(M vir /M ) z∼4.57,dyn,corr = 13.6 +0.4 −1.0 . A different approach is to count the amount of baryonic material within the proto-cluster bounds associated with the member galaxies and attempt to relate that back to the overall mass of the structure, an approach which has been employed successfully at low redshift when the galactic baryonic content of cluster member galaxies is dominated by stars (Andreon 2012). The approach we take here is similar to that of Lemaux et al. (2014a). Briefly, the total amount of stellar matter of the z spec members is counted and a completeness correction is made to this value for galaxies at stellar masses log(M * /M ) ≥ 9.5 (see Sect. 4.2.1 for the reasoning behind this cut) based on the number of z phot members and non-members without secure spectral redshifts within the bounds of the proto-cluster and the likelihood of their being true members. An additional correction is made to correct for galaxies in the stellar mass range 8.0 < log(M * /M ) < 9.5 by integrating the stellar mass function of Davidzon et al. (2017) appropriate for this redshift. Here we additionally make the assumption that stellar mass comprises 50% of the baryonic content of galaxies by mass at these redshifts, a value broadly consistent with the few measurements made at or near these redshifts (e.g., Tacconi et al. 2010;Capak et al. 2011b;Schinnerer et al. 2016;Scoville et al. 2016). We assume for the purposes of this calculation that the proto-cluster is a closed system, with all gas being converted to stars by z = 0 and that the completeness-corrected galaxy population which lies within R pro j ≤ 3 Mpc at z ∼ 4.57 comprises the entirety of the galaxy population which will eventually be contained within the cluster virial radius at z = 0. The latter assumption is broadly consistent with simulations (Muldrew et al. 2015) to ∼10% accuracy, a factor which we account for in the calculation below. Any total to stellar mass conversion then provides a z = 0 total mass, which we de-evolve to z ∼ 4.57 using the correction factors of Muldrew et al. (2015) appropriate for z ∼ 4.57 (0.20 ± 0.03), a correction which is appropriate for descendents of all masses. Into this formalism we input the resulting completeness-corrected bary- We note that this estimate, within the context of this formalism, is almost certainly a lower limit to the total mass of the protocluster given that there exists diffuse gas at z ∼ 4.57, which we do not account for here, of which some fraction will undoubtedly be accreted and used to create stars by z = 0. This lack of precision may be mitigated somewhat by the fact that we also do not account for the loss of stellar and gaseous material in member galaxies due to, for example, tidal or ram pressure stripping or by merging activity as the proto-cluster evolves. Despite all the uncertainties related to this calculation, if the completenesscorrected baryonic content of the proto-cluster is instead used in conjunction with the universal dark-to-baryonic fraction of 6.42 ± 0.19 (Planck Collaboration XIII 2016) to estimate the total mass, the result, log(M/M ) z=4.57, ΣM * ,corr =13.20 +0.05 −0.07 , is nearly identical to the above calculations. Nevertheless, all three of these estimates place the descendant total mass of PCl J1001+0220 at or above M h,z=0 = 10 14 once evolved to z = 0 following the formalism of McBride et al. (2009) andFakhouri et al. (2010) (see Lemaux et al. 2014a for its implementation in this context) or Muldrew et al. (2015). Again, however, these methods relied on a number of assumptions which, while perhaps applicable generally, may or may not be valid for this particular proto-cluster.
The final method we employ here relies on the galaxy overdensity as measured from the Voronoi Monte Carlo technique in an attempt to relate this overdensity to the present day total mass through large-scale simulations (Chiang et al. 2013). As such, though this method is model dependent, neither the galaxy velocities, the lack of complete spectroscopic sampling, nor the bulk properties of the galaxies themselves should affect this estimation. This estimation follows that of Lemaux et al. (2014a) nearly identically and, as such, we do not repeat the formalism here. The one major exception for this study is the replacement of the spectroscopic overdensity, used previously, with the Voronoi Monte Carlo overdensity making the estimate much less uncertain and much less dependent on sampling effects. As we have also changed the volume used to calculate this overdensity, a volume which is now set by the SExtractor detection (segmentation) map (see Sect. 3.3) and the adopted redshift range of the proto-cluster, a correction is made following Chiang et al. (2013) to account for additional mass outside of the volume used (e.g., similar to an aperture correction, see also Muldrew et al. 2015). The correction factor is taken to be 1/0.8, appropriate for the segmentation-estimated proto-cluster volume which has an effective radius (R e ) of 10.9 comoving Mpc (see Chiang et al. 2013 for more details). We have also adopted here a bias parameter of b = 3.60, a reasonable estimate for the redshift and galaxy sample used here (Chiang et al. 2013;Durkalec et al. 2018). The final descendant mass of PCl J1001+0220 is then estimated to be: a descendant mass intermediate to those estimated by the other two methods. It is important to note, though, that this descendant mass is not cause for ambiguity in the overall nature of PCl J1001+0220; the galaxy overdensity value used here along with the area over which that overdensity was measured still places PCl J1001+0220 as a proto-cluster at >80% confidence (Chiang et al. 2013). The M h,z=0,(1+δ gal ) estimated above is broadly similar to the value derived from the formalism of Orsi et al. (2016), a study which employs vastly different models and assumptions. While it is necessary to assume for the latter that the proto-cluster contains a quasar or radio galaxy which has yet to be observed and that, further, the density of faint Lyα emitters scales directly with the density of ∼ L * galaxies at these redshifts, the concordance in M h,z=0,(1+δ gal ) estimates between these two methods is encouraging. Regardless of the method chosen, the large descendant mass estimated for PCl J1001+0220 requires, under the formalism of models predicting typical mass growth (Fakhouri et al. 2010;Muldrew et al. 2015), that considerable amount of mass assembles (log(M h /M ) > ∼ 12) before the end of the epoch of reionization (i.e., z ∼ 6). As such, the progenitors of systems like PCl J1001+0220 can be, under certain circumstances, useful in differentiating between different models of reionization (see D'Aloisio et al. 2015;Castellano et al. 2016;Davies & Furlanetto 2016 and references therein). Because none of our results are dependent on the exact total mass of PCl J1001+0220 either at z ∼ 4.57 or at z = 0, we satisfy ourselves with the understanding that PCl J1001+0220 is a massive proto-cluster that will evolve into a cluster at low redshift with an overarching mass somewhere in between that of the Fornax and Coma clusters (Kent & Gunn 1982;Colless & Dunn 1996;Nasonova et al. 2011).

An investigation of environmentally driven evolution at z ∼ 4.57
From the previous sections we have determined that the member galaxies of PCl J1001+0220 reside in a globally dense (i.e., proto-cluster) environment that exhibits a large-scale coherent elevated local density. Though strong correlations between galaxy properties and both local and global environment are observed at lower redshifts (z < ∼ 2, e.g., Goto et al. 2003

Luminosity, color, stellar mass, and age properties
We begin by comparing quantities derived from our various SED fitting techniques for PCl J1001+0220 member and coeval field samples defined in a variety of ways. We choose to use a variety of techniques as well as a variety of sample definitions to mitigate our ignorance of the true center of the proto-cluster, our ignorance of both the redshift and transverse extent of the member population, the lack of VUDS spectroscopic targeting in much of the eastern quadrants of the proto-cluster, and the various systematic and random uncertainties in SED fitting methods, uncertainties which are almost certainly exacerbated at these redshifts. Throughout these comparisons, the member and coeval field samples will always, by design, have the same median redshift and will be consistently cut at the various limits imposed. In Appendix B we describe in detail the definitions for the various samples used in the analysis in the remainder of this paper as well as the statistical approaches used to compare these samples.
In the left panel of Fig. 7 we plot the rest-frame M NUV − M r + versus M r + color-magnitude diagram (CMD) of the z spec PCl J1001+0220 proto-cluster members against the backdrop of the coeval field for all galaxies with log(M * /M ) ≥ 9.7. The area of this phase space where VUDS begins to become unrepresentative of the underlying photometric sample is shaded out.  Fig. 7. Left: Rest-frame NUV-r + vs. r + color-magnitude diagram for the z spec members of the PCl J1001+0220 proto-cluster (red diamonds) and a matched coeval field sample at 4.23 < z < 4.88 (small navy dots) at stellar masses log(M * /M ⊙ ) ≥ 9.7. The stellar mass cut is imposed here in an attempt to mitigate any induced differential bias between the field and proto-cluster members. Overplotted are the expected locations in this phase space of a progentor of a low-redshift cluster L * galaxy (assuming passive evolution) estimated from synthetic spectra (see §4.2.1) at different stellar-phase metallicities and with different SFHs. The gray dashed track is generated from an instantaneous burst while the three green tracks are generated from an exponentially declining SFH with τ = 100 Myr at three different stellar-phase metallicities. Age ticks along each track indicate the time since the inception of the star formation event. Color and magnitude histograms for each sample are normalized to each other such that they contain the same area. The solid light blue area indicates the area of this phase space for which VUDS is no longer representative of the underlying photometric sample at these redshifts and stellar masses (see §4.2.1). Right: Rest-frame M NUV -M r + vs. M * color-stellar-mass diagram for the identical samples as in the left panel. The meanings of all symbols and lines are unchanged. While some of the brightest, reddest, and most (stellar) massive galaxies in the entire spectroscopic sample at these redshifts fall within PCl J1001+0220, only a moderately significant difference in the average brightness and a significant difference in the stellar mass distribution is detected relative to the coeval field population.  7). All spectroscopic knowledge is ignored when defining these samples. Photo-z membership is defined by the projected radial cut given in the top left of the panel and 4.53 − 1.5σ N MAD (1 + 4.53) < z phot < 4.60 + 1.5σ N MAD (1 + 4.60) with the remaining galaxies in this z phot range comprising the coeval field sample. Histograms in both panels are normalized such that they contain the same area. While the two distributions shown in the left panel are formally consistent, once a color cut is imposed, the two distributions are found to be disparate at a > 3σ level, with redder z phot proto-cluster members exhibiting a preference for significantly higher masses relative to the coeval field.
for more details). Again, a difference between the two populations is observed in the stellar mass distribution in that galaxies in the proto-cluster are, on average, more massive in their stellar content ( log(M * /M ⊙ ) = 10.51 ± 0.07 vs. 10.05 ± 0.04). Here, however, we additionally see a clear difference between the two populations in their age in that galaxies in the proto-cluster appear older ( age f = 904 ± 63 vs. 640 ± 41 Myr) than the coeval field population. These differences are also significant at the ≥ 2.5σ level for both parameters when using a KS test. The observed difference in the ages of the two sub-samples is in apparent contrast to the lack of significant difference between the two spectroscopic populations seen in the rest-frame colors. However, the methods used by Thomas et al. (2017b) are considerably more advanced than that used to generate the models overplotted in Figure 7, with the former method employing a vastly larger array of SFHs, treatment for the effects of extinction, and the virtue of numerous tests performed on its precision and accuracy. A similar comparison of z phot members and coeval field galaxies could not be performed as the method of Thomas et al. (2017b) requires spectroscopy. Regardless, the common element of all three analyses presented here appears to be, at varying degrees of significance, a fractional excess within the proto-cluster of older, and, in some analyses, redder, galaxies which appear more massive in their stellar content. The large differences in stellar masses compared to what are relatively small differences in ages (or colors) necessarily implies that, on average, a protocluster galaxy must have a more violent SFH than their field counterparts, with stellar mass being built up more rapidly after their formation. As we will show in the following section, the two populations have indistinguishable instantaneous SFRs (integrated over ∼ 100 Myr), which, in conjunction with the results presented in this section, further implies that there is something necessarily different about the SFHs of the two populations.
The slope, β, of the continuum flux density in the rest-frame ultraviolet ( f λ ∝ λ β ) has been shown, for a given IMF and a given dust geometry and law, to correlate well with the level of dust in a galaxy (e.g., Meurer et al. 1999;Reddy et al. 2010;Álvarez-Márquez et al. 2016). As the analysis throughout this section relied heavily on the difference in the colors and vari-Article number, page 15 of 27 Fig. 7. Left panel: rest-frame NUV-r + versus r + color-magnitude diagram for the z spec members of the PCl J1001+0220 proto-cluster (red diamonds) and a matched coeval field sample at 4.23 < z < 4.88 (small navy dots) at stellar masses log(M * /M ) ≥ 9.7. The stellar mass cut is imposed here in an attempt to mitigate any induced differential bias between the field and proto-cluster members. Overplotted are the expected locations in this phase space of a progentor of a low-redshift cluster L * galaxy (assuming passive evolution) estimated from synthetic spectra (see Sect. 4.2.1) at different stellar-phase metallicities and with different SFHs. The gray dashed track is generated from an instantaneous burst while the three green tracks are generated from an exponentially declining SFH with τ = 100 Myr at three different stellar-phase metallicities. Age ticks along each track indicate the time since the inception of the star formation event. Color and magnitude histograms for each sample are normalized to each other such that they contain the same area. The solid light blue area indicates the area of this phase space for which VUDS is no longer representative of the underlying photometric sample at these redshifts and stellar masses (see Sect. 4.2.1). Right panel: rest-frame M NUV -M r + versus M * color-stellarmass diagram for the identical samples as in the left panel. The meanings of all symbols and lines are unchanged. While some of the brightest, reddest, and most (stellar) massive galaxies in the entire spectroscopic sample at these redshifts fall within PCl J1001+0220, only a moderately significant difference in the average brightness and a significant difference in the stellar mass distribution is detected relative to the coeval field population.
These bands are chosen as they discriminate galaxies of different ages in the absence of a copious dust content Moutard et al. 2016), which, as we show below, is likely minimal, on average, in both samples. Additionally, by selecting these colors, we guarantee that each galaxy in the selected sample was significantly detected in at least one observed-frame band that has reasonably similar wavelength coverage at z ∼ 4.57 to each of these rest-frame bands 12 . Such a guarantee minimizes k-correction necessary to calculate the rest-frame magnitudes and thus the model dependence of these magnitudes. The overplotted tracks show BC03 model-derived colors, absolute magnitudes, and stellar masses as generated by EZGAL 13 . These models, which employ a variety of different SFHs, stellar-phase metallicities, and epochs of the most recent major star formation event, were designed to match the brightness of a z ∼ 0.6 L * cluster galaxy (De Propris et al. 2013) after passively evolving for ∼7 Gyr (i.e., the difference in lookback time between z ∼ 4.57 and z ∼ 0.6).
Despite having both what is, by far, the reddest galaxy in VUDS at these redshifts and several other redder galaxies fall within the ∼1% of the volume of the VUDS-COSMOS field at these redshifts spanned by the proto-cluster, a statistical difference relative to the field is not observed. Neither a KS test nor a comparison of the average M NUV − M r + colors of the member and coeval field samples ( M NUV − M r + = 1.15 ± 0.15 vs. 1.11 ± 0.08, respectively) reveals any significant difference between the two populations. The proto-cluster also contains the brightest as well as three of the top ten brightest rest-frame optical galaxies in VUDS at these redshifts, and though, again, a KS test fails to find a significant difference (∼1σ), a comparison of the average M r + magnitudes yields marginally significant hints of 12 All VUDS galaxies used in this paper are detected significantly in the i + band and are required to be detected significantly in the [3.6] band. 13 http://www.baryons.org/ezgal/ a difference between the two populations ( M r + = −23.30 ± 0.20 vs. −22.87 ± 0.08, respectively). In the right panel of Fig. 7 we plot a rest-frame color-stellar mass diagram (CSMD) of the same two samples. The trend in magnitude hinted in the CMD appears to be manifest in the CSMD more significantly. Two of the four most (stellar) massive galaxies in the entire spectroscopic sample at these redshifts, both of which appear to contain a suggestively old luminosity-weighted stellar population (>200 Myr), lie within the bounds on the proto-cluster. Galaxies in this region of the phase space, log(M * /M ) > 10.6, M NUV − M r + > 1.5, have, broadly, already formed the requisite stellar material to passively evolve to lower redshift L * red-sequence galaxies. The presence of these two galaxies alone hint that this population is relatively more abundant in the proto-cluster than in the field (2/9 vs. 2/54), an inference which does not depend especially on the specific limits chosen here and one which we attempt to expand on below when comparing z phot members. A KS test (2.5σ) and a comparison of the average stellar masses of the member and coeval field galaxies (log( M * /M ) = 10.34 ± 0.08 and 10.09 ± 0.03, respectively) additionally both suggest that the average member galaxy has built up a more massive stellar content relative to other galaxies in the field. We note that these conclusions and those below comparing stellar mass distributions of z phot members and non-members are unchanged if we instead compare inverse error-weighted means.
The above comparisons were made with only the spectral members and exclusively using one of the SED fitting techniques we have at our disposal. Though we have gone to extreme lengths to ensure that the VUDS sample is largely representative of the underlying population of galaxies at these redshifts within the selected color and stellar mass range and that the SED method chosen is reliable at these redshifts, we now attempt two different reformulations of the analysis in order to understand if the trends hinted at in the above analysis are robust to changing the analysis framework. We begin by considering the z phot A77, page 14 of 27 B. C. Lemaux et al.: VUDS discovery of a high-redshift proto-cluster they contain the same area. The solid light blue area indicates the area of this phase space for which VUDS is no longer representative of the underlying photometric sample at these redshifts and stellar masses (see §4.2.1). Right: Rest-frame M NUV -M r + vs. M * color-stellar-mass diagram for the identical samples as in the left panel. The meanings of all symbols and lines are unchanged. While some of the brightest, reddest, and most (stellar) massive galaxies in the entire spectroscopic sample at these redshifts fall within PCl J1001+0220, only a moderately significant difference in the average brightness and a significant difference in the stellar mass distribution is detected relative to the coeval field population.  7). All spectroscopic knowledge is ignored when defining these samples. Photo-z membership is defined by the projected radial cut given in the top left of the panel and 4.53 − 1.5σ N MAD (1 + 4.53) < z phot < 4.60 + 1.5σ N MAD (1 + 4.60) with the remaining galaxies in this z phot range comprising the coeval field sample. Histograms in both panels are normalized such that they contain the same area. While the two distributions shown in the left panel are formally consistent, once a color cut is imposed, the two distributions are found to be disparate at a > 3σ level, with redder z phot proto-cluster members exhibiting a preference for significantly higher masses relative to the coeval field.
for more details). Again, a difference between the two populations is observed in the stellar mass distribution in that galaxies in the proto-cluster are, on average, more massive in their stellar content ( log(M * /M ⊙ ) = 10.51 ± 0.07 vs. 10.05 ± 0.04). Here, however, we additionally see a clear difference between the two populations in their age in that galaxies in the proto-cluster appear older ( age f = 904 ± 63 vs. 640 ± 41 Myr) than the coeval field population. These differences are also significant at the ≥ 2.5σ level for both parameters when using a KS test. The observed difference in the ages of the two sub-samples is in apparent contrast to the lack of significant difference between the two spectroscopic populations seen in the rest-frame colors. However, the methods used by Thomas et al. (2017b) are considerably more advanced than that used to generate the models overplotted in Figure 7, with the former method employing a vastly larger array of SFHs, treatment for the effects of extinction, and the virtue of numerous tests performed on its precision and accuracy. A similar comparison of z phot members and coeval field galaxies could not be performed as the method of Thomas et al. (2017b) requires spectroscopy. Regardless, the common element of all three analyses presented here appears to be, at varying degrees of significance, a fractional excess within the proto-cluster of older, and, in some analyses, redder, galaxies which appear more massive in their stellar content. The large differences in stellar masses compared to what are relatively small differences in ages (or colors) necessarily implies that, on average, a protocluster galaxy must have a more violent SFH than their field counterparts, with stellar mass being built up more rapidly after their formation. As we will show in the following section, the two populations have indistinguishable instantaneous SFRs (integrated over ∼ 100 Myr), which, in conjunction with the results presented in this section, further implies that there is something necessarily different about the SFHs of the two populations.
The slope, β, of the continuum flux density in the rest-frame ultraviolet ( f λ ∝ λ β ) has been shown, for a given IMF and a given dust geometry and law, to correlate well with the level of dust in a galaxy (e.g., Meurer et al. 1999;Reddy et al. 2010;Álvarez-Márquez et al. 2016). As the analysis throughout this section relied heavily on the difference in the colors and vari-Article number, page 15 of 27 and only those z phot members with redder rest-frame colors (M NUV − M r + > 1.5, right panel) plotted against the corresponding coeval field sample. Such a color cut selects galaxies with ages in excess of 200 Myr (see Fig. 7). All spectroscopic knowledge is ignored when defining these samples. Photo-z membership is defined by the projected radial cut given in the top left of the panel and 4.53 − 1.5σ N MAD (1 + 4.53) < z phot < 4.60 + 1.5σ N MAD (1 + 4.60) with the remaining galaxies in this z phot range comprising the coeval field sample. Histograms in both panels are normalized such that they contain the same area. While the two distributions shown in the left panel are formally consistent, once a color cut is imposed, the two distributions are found to be disparate at a >3σ level, with redder z phot proto-cluster members exhibiting a preference for significantly higher masses relative to the coeval field. member sample and contrasting it with the z phot coeval field sample (see Sect. 3.2 for the definition of each sample). While it is not necessary that differences seen in the previous analysis persist in significance or gain traction, as the photometric redshifts blur the lines between the proto-cluster and the coeval field considerably, any agreement would lend veracity to the claims made with the relatively small spectroscopic sample. In the left panel of Fig. 8 we compare the stellar mass distribution of all z phot members and the coeval field sample to the stellar mass limit of log(M * /M ) > 9.5. A comparison of the median stellar masses (log( M * /M ) ∼ 9.9 in both cases) and a KS test return no significant difference between the two samples. If, however, we instead restrict the sample to those galaxies with M NUV − M r + ≥ 1.5, which, according to the models overplotted in Fig. 7 have ages that are ≥200 Myr, the two populations begin to differentiate themselves. Both a comparison of the average stellar mass of the two populations and a KS test return differences at >3σ, with the redder galaxies in the proto-cluster appearing at much higher stellar masses than their coeval field counterparts; log( M * /M ) = 10.61 ± 0.06 versus 10.22 ± 0.02, respectively. The relative overdensity within the proto-cluster bounds of possible progenitors to low-redshift cluster L * galaxies hinted at in the spectral member sample is also hinted at here ( f red,L * =14.6±4.2 vs. 8.2±0.7%).
As a final check on these results, we shift back to considering only the spectral sample and plot in Fig. 9 the stellar masses and formation age (age f 14 hereafter) of the member and coeval field galaxies derived following the combination spectral and photometric fitting methodology of Thomas et al. (2017b). For all fits we required a "fitting flag" of ≥2 corresponding to good or 14 This age is not to be confused with the previously mentioned age coming from EZGAL, which is essentially a luminosity-weighted age that is limited in resolution to the most recent star formation event.
A&A proofs: manuscript no. A   Fig. 9. Left: Distribution of stellar masses of spec-z members of PCl J1001+0220 (red histogram) and coeval field galaxies (blue histogram) as measured by GOSSIP+. The projected radial cut used to define membership is indicated at the top left of the panel. Right: Distributions of the formation ages of proto-cluster spec-z members and coeval field galaxies, where age f is defined as the time elapsed since the galaxy began forming stars (see Thomas et al. 2017b). Colors are identical to the left panel. The age of universe at the redshift of the proto-cluster (z ∼ 4.57) using our adopted cosmology is denoted by the dashed line. Two coeval field galaxies with log(age f yr −1 )∼7.7 are not shown for clarity. Average effective 1σ lower and upper uncertainties for age estimates of the galaxies in each sample are shown in the top left. There is an obvious excess of more (stellar) massive and older galaxies within the bounds of the proto-cluster; both sets of distributions are inconsistent with being drawn from the same underlying population at a > 2.5σ level and have average quantities which are inconsistent at an even larger significance. Additionally, the majority of proto-cluster members (5/9) appear to have formed within 500 Myr of the Big Bang as compared to only ∼10% of coeval field galaxies.
ous ages of member and coeval field samples, and as the precision and meaning of such parameters can be heavily influenced by the presence of dust, an independent measure of the dust content is important to gain context on the reliability of these results and their interpretation. In Figure 10 we plot the distribution of β slopes measured for the member and coeval field samples. While, in principle such measurements could be made directly on the galaxy spectra, the high redshift of the sample along with the low resolution of the VUDS spectra and the high density of OH lines preclude a brute force measurement in practice. Future effort will be made to comprise a scheme to measure the spectral β slope on VUDS spectra at these redshifts. Here, instead, we measure β on the COSMOS2015 photometry using the I + /Z ++ /Y/J bands for galaxies in the range 4.23 ≤ z ≤ 4.5 and the Z ++ /Y/J/H bands for galaxies in the range 4.5 < z ≤ 4.88 adopting the photometric methods of Hathi et al. (2016). This choice of photometry avoids the Lyα feature for all cases and provides a nearly contiguous coverage of all ten bandpasses defined by Calzetti et al. (1994). In neither the average value of the members versus the coeval field galaxies (β ∼ −1.7 in both cases) nor a KS test do the two populations appear different in their UV continuum slopes. Furthermore, the average value measured in the two populations implies that the member and coeval field galaxies have little dust content (Finkelstein et al. 2012;Castellano et al. 2014;Álvarez-Márquez et al. 2016 9. Left panel: distribution of stellar masses of spec-z members of PCl J1001+0220 (red histogram) and coeval field galaxies (blue histogram) as measured by GOSSIP+. The projected radial cut used to define membership is indicated at the top left of the panel. Right: distributions of the formation ages of proto-cluster spec-z members and coeval field galaxies, where age f is defined as the time elapsed since the galaxy began forming stars (see Thomas et al. 2017b). Colors are identical to the left panel. The age of Universe at the redshift of the proto-cluster (z ∼ 4.57) using our adopted cosmology is denoted by the dashed line. Two coeval field galaxies with log(age f yr −1 ) ∼ 7.7 are not shown for clarity. Average effective 1σ lower and upper uncertainties for age estimates of the galaxies in each sample are shown in the top left. There is an obvious excess of more (stellar) massive and older galaxies within the bounds of the proto-cluster; both sets of distributions are inconsistent with being drawn from the same underlying population at a >2.5σ level and have average quantities which are inconsistent at an even larger significance. Additionally, the majority of proto-cluster members (5/9) appear to have formed within 500 Myr of the Big Bang as compared to only ∼10% of coeval field galaxies. excellent fits to both the spectroscopy and photometry for each galaxy. The fitting method used to derive these parameters is considerably different in scope, in the range of models used, and in input than those used in any of the previous analysis presented in this section, and has been found to accurately and precisely recover the ages of simulated galaxies using VUDSlike observations under the assumption that the SFHs of galaxies at these redshifts can be approximated well by the range of SFHs assumed in our fitting process (see Thomas et al. 2017b for more details). Again, a difference between the two populations is observed in the stellar mass distribution in that galaxies in the proto-cluster are, on average, more massive in their stellar content ( log(M * /M ) = 10.51 ± 0.07 vs. 10.05 ± 0.04). Here, however, we additionally see a clear difference between the two populations in their age in that galaxies in the proto-cluster appear older ( age f = 904 ± 63 vs. 640 ± 41 Myr) than the coeval field population. These differences are also significant at the ≥2.5σ level for both parameters when using a KS test. The observed difference in the ages of the two sub-samples is in apparent contrast to the lack of significant difference between the two spectroscopic populations seen in the rest-frame colors. However, the methods used by Thomas et al. (2017b) are considerably more advanced than that used to generate the models overplotted in Fig. 7, with the former method employing a vastly larger array of SFHs, treatment for the effects of extinction, and the virtue of numerous tests performed on its precision and accuracy. A similar comparison of z phot members and coeval field galaxies could not be performed as the method of Thomas et al. (2017b) requires spectroscopy. Regardless, the common element of all three analyses presented here appears to be, at  10. Distributions of the estimate for the index of the power-law dependence between flux density and wavelength in the rest-frame UV (β-slope) measured from the photometry of individual galaxies for PCl J1001+0220 spec-z members and the coeval field (red and blue histograms, respectively). Measurements are made assuming a single (unbroken) power-law dependence of f λ on λ for each galaxy following the method of Hathi et al. (2016). For galaxies at z < 4.5, β is measured with the i + z + Y J photometry spanning roughly 1300Å < λ rest < 2300Å. At z > 4.5, the i + band is exchanged for the H band (covering roughly 1400Å < λ rest < 3000Å) to prevent Lyα emission from potentially contaminating the measurement. The average errors on individual measurements for the two samples are shown at the top left (σ β ∼ 0.2 in both cases). Galaxies where extremely blue β-slopes are measured (β < −2.5) have average errors roughly twice the global average. The median β-slope for both samples isβ ∼ −1.7 and no significant difference is measured between the two distributions. surement (see, e.g., Castellano et al. 2014) and there are perhaps more reliable ways to calculate this quantity than the method employed here (see, e.g., Finkelstein et al. 2012), we require only the precision here necessary to claim that there is no apparent difference in the dust content of the two populations. Having achieved that aim, we now move to the properties of galaxies as measured by photometry in wavelengths not exclusively in the UV/optical/NIR regime.

Multiwavelength properties and AGN content
One of the most intriguing lines of inquiry of high-redshift environmental studies relates to the question of whether the protocluster environment preferentially enhances, diminishes, or has no effect on the frequency and/or level of star formation and galactic nuclear activity in galaxies relative to matched samples in field environments. An overwhelming variety of results have been obtained for studies of high-redshift (proto-)clusters (e.g., Digby-North et al. 2010;Capak et al. 2011a;Martini et al. 2013;Cucciati et al. 2014;Lemaux et al. 2014a;Casey et al. 2015;Diener et al. 2015;Wang et al. 2016) underscoring that trends are highly dependent on the sample selected, the nature of the overdensity or overdensities studied, the level of care taken in Fig. 10. Distributions of the estimate for the index of the power-law dependence between flux density and wavelength in the rest-frame UV (β-slope) measured from the photometry of individual galaxies for PCl J1001+0220 spec-z members and the coeval field (red and blue histograms, respectively). Measurements are made assuming a single (unbroken) power-law dependence of f λ on λ for each galaxy following the method of Hathi et al. (2016). For galaxies at z < 4.5, β is measured with the i + z + Y J photometry spanning roughly 1300 Å < λ rest < 2300 Å. At z > 4.5, the i + band is exchanged for the H band (covering roughly 1400 Å < λ rest < 3000 Å) to prevent Lyα emission from potentially contaminating the measurement. The average errors on individual measurements for the two samples are shown at the top left (σ β ∼ 0.2 in both cases). Galaxies where extremely blue β-slopes are measured (β < −2.5) have average errors roughly twice the global average. The median β-slope for both samples isβ ∼ −1.7 and no significant difference is measured between the two distributions. varying degrees of significance, a fractional excess within the proto-cluster of older, and, in some analyses, redder, galaxies which appear more massive in their stellar content. The large differences in stellar masses compared to what are relatively small differences in ages (or colors) necessarily implies that, on average, a proto-cluster galaxy must have a more violent SFH than their field counterparts, with stellar mass being built up more rapidly after their formation. As we will show in the following section, the two populations have indistinguishable instantaneous SFRs (integrated over ∼100 Myr), which, in conjunction with the results presented in this section, further implies that there is something necessarily different about the SFHs of the two populations.
The slope, β, of the continuum flux density in the rest-frame ultraviolet ( f λ ∝ λ β ) has been shown, for a given IMF and a given dust geometry and law, to correlate well with the level of dust in a galaxy (e.g., Meurer et al. 1999;Reddy et al. 2010;Álvarez-Márquez et al. 2016). As the analysis throughout this section relied heavily on the difference in the colors and various ages of member and coeval field samples, and as the precision and meaning of such parameters can be heavily influenced by the presence of dust, an independent measure of the dust content is important to gain context on the reliability of these results and their interpretation. In Fig. 10 we plot the distribution of β slopes measured for the member and coeval field samples. While, in principle such measurements could be made directly on the galaxy spectra, the high redshift of the sample along with the low resolution of the VUDS spectra and the high density of OH lines preclude a brute force measurement in practice. Future effort will be made to comprise a scheme to measure the spectral β slope on VUDS spectra at these redshifts. Here, instead, we measure β on the COSMOS2015 photometry using the I + /Z ++ /Y/J bands for galaxies in the range 4.23 ≤ z ≤ 4.5 and the Z ++ /Y/J/H bands for galaxies in the range 4.5 < z ≤ 4.88 adopting the photometric methods of Hathi et al. (2016). This choice of photometry avoids the Lyα feature for all cases and provides a nearly contiguous coverage of all ten bandpasses defined by Calzetti et al. (1994). In neither the average value of the members versus the coeval field galaxies (β ∼ −1.7 in both cases) nor a KS test do the two populations appear different in their UV continuum slopes. Furthermore, the average value measured in the two populations implies that the member and coeval field galaxies have little dust content (Finkelstein et al. 2012;Castellano et al. 2014;Álvarez-Márquez et al. 2016), though a non-negligible fraction appear to have at least moderate levels of dust (β >-1.4) or roughly E s (B − V) > 0.15, see Álvarez-Márquez et al. 2016). While there are many caveats for the interpretation of this measurement (see e.g., Castellano et al. 2014) and there are perhaps more reliable ways to calculate this quantity than the method employed here (see e.g., Finkelstein et al. 2012), we require only the precision here necessary to claim that there is no apparent difference in the dust content of the two populations. Having achieved that aim, we now move to the properties of galaxies as measured by photometry in wavelengths not exclusively in the UV/ optical/NIR regime.

Multiwavelength properties and AGN content
One of the most intriguing lines of inquiry of high-redshift environmental studies relates to the question of whether the proto-cluster environment preferentially enhances, diminishes, or has no effect on the frequency and/or level of star formation and galactic nuclear activity in galaxies relative to matched samples in field environments. An overwhelming variety of results have been obtained for studies of high-redshift (proto-)clusters (e.g., Digby-North et al. 2010;Capak et al. 2011a;Martini et al. 2013;Cucciati et al. 2014;Lemaux et al. 2014a;Casey et al. 2015;Diener et al. 2015;Wang et al. 2016) underscoring that trends are highly dependent on the sample selected, the nature of the overdensity or overdensities studied, the level of care taken in appropriately matching the high-and low-density samples, and the wavelength of light and scheme used to proxy star formation or AGN activity. Though the amount of information available to us at z ∼ 4.57 is somewhat limited, we draw here on nearly the entirety of the wealth of data in the COSMOS field to attempt to investigate the relative level of activity in the PCl J1001+0220 proto-cluster. Shown in Fig. 11 are the LE PHARE SED-fit stellar masses and SFRs of the (z spec ) proto-cluster members plotted against those of the coeval field population subject to the constraints applied in the previous section. By eye, the two SED-fit SFR distributions appear similar, with a KS test confirming this observation, finding the two distributions to be statistically indistinguishable. As in the previous analyses, we do not rely solely on a KS test to attempt to distinguish the two populations. Shown in the top right-hand corner of Fig. 11 are the median SED-fit SFRs of each population along with their bootstrapped uncertainties generated following the method of Lemaux et al. (2014a). While there appears a slight suppression of the level of star formation activity within the proto-cluster bounds, the uncertainties are far too large to claim any level of significance. As all galaxies in VUDS are, by selection, star forming, these galaxy populations cannot be used to comment on the frequency (fraction) of star-forming galaxies in each environment. Similarly ambiguous results are found when we compare the star formation activity of the z phot proto-cluster member sample relative to that of the z phot coeval field sample. Using the definitions from Sect. 3.2 results in median SED-fit SFRs of 46 ± 12 and 32 ± 3 M yr −1 for the z phot proto-cluster member and coeval field populations, respectively. A lack of significant differences in this quantity remains if we instead consider only those z phot objects in the magnitude-color-M * phase space where VUDS is representative (see Sect. 4.2.1).
The previous analysis relies on SED-fit SFRs, a method that has not been thoroughly tested for accuracy at the redshifts explored in this paper, a deficiency which is amplified due to the relatively small size of the galaxy samples considered here. Further, the SED-fit SFRs are extinction corrected using photometry that has only a modest dynamic range in wavelength in the rest-frame, the bulk of which lies in the rest-frame UV at these redshifts. As such, we attempt here to measure the average SFR of the proto-cluster member and coeval field samples using a tracer not subject to such uncertainties. Such a complementary approach is important as it has been suggested that at least some (proto-)clusters are dominated by dusty star-forming galaxies (e.g., Casey et al. 2015;Wang et al. 2016). For this purpose we are required to shift to wavelengths further to the red. There are only four viable possibilities for the COSMOS field. The first are deep Spitzer/MIPS 24 µm observations used to routinely probe galaxy populations at 0 < z < 2 in COSMOS (Le Floc'h et al. 2009). However, at z ∼ 4.57, these observations are too far blue to reliably probe young stellar emission processed and re-radiated by dust. The second are deep Herschel/PACS and SPIRE observations taken on COSMOS, but even at z ∼ 3, the stack of 100s to 1000s of galaxies are needed for a significant detection (Álvarez-Márquez et al. 2016). The third comes from imaging taken with the James Clerk Maxwell Telescope (JCMT) at 850 µm to a 5σ depth of 6.0 mJy/beam observed as part of the SCUBA-2 Cosmology Legacy Survey (S2LCS; Geach et al. 2017). The final possibility comes from recently obtained extremely deep (currently 11.5 µJy/beam, 5σ) 3 GHz imaging from the Karl G. Jansky Very Large Array (JVLA) of the entirety of the COSMOS field Smolčić et al. 2017b). The rest-frame frequency which these observations correspond to at z ∼ 4.57 is a probe of synchrotron emission generated from supernovae and is found to correlate well with star formation activity (see Condon 1992 and references therein) in the absence of a radio AGN. The latter possibility is addressed later in this section. No significant individual detections in either the S2LCS nor the JVLA imaging were found amongst the proto-cluster member or coeval field samples.
In order to attempt to place more stringent constraints on the prevalence of dusty star formation activity we performed stacking on the JVLA imaging, which was preferred over the S2LCS imaging due to its comparable depth in SFR and considerably smaller PSF. Proto-cluster member and coeval field galaxies were stacked separately at 3 GHz by inverse-varianceweighted mean combining postage stamps centered on the optical centroid of each galaxy. While poor relative astrometry, if randomly rather than systematically poor, can render this exercise pointless, the astrometry of the JVLA mosaic was checked against the COSMOS2015 catalog and was found to be accurate to an RMS of 0.1 . As this is half the size of the JVLA pixel scale and considerably smaller than the restored circular  Fig. 11. Distribution of the PCl J1001+0220 spec-z members (red diamonds and histogram) and coeval field galaxies (navy points and histogram) in the SFR-M * plane as estimated by Le Phare. Histograms are normalized as in other figures such that the area is identical for both samples. The upper limit to the radio-derived SFR of spec-z member and coeval field galaxies coming from stacked 3 GHz JVLA data is shown as the green and light blue lines/arrows, respectively. The lack of detection for both proto-cluster members and coeval field galaxies suggests that prodigious dusty star formation (IRX ∼ < 10) is no more common in the proto-cluster than in the overall field at these redshifts and is not particularly common in either environment. Hatch et al. 2017). Such analysis will be followed further as the member and coeval field sample in and around PCl J1001+0220 is increased and for other, lower redshift VUDS proto-structures discovered in the COSMOS field.
To investigate the relative presence of AGN activity amongst the proto-cluster member and coeval field populations we matched both samples to the Chandra-COSMOS point source catalogue (Elvis et al. 2009;Civano et al. 2016) finding no significant detection (> 3σ). No significant detections were found amongst the z phot proto-cluster members either. Such matching was also performed on the JVLA 3 GHz imaging finding no significant detection to a 3σ power density limit of log(P ν W −1 Hz) > ∼ 24.4. Additionally, Chandra images 15 were stacked using the optical position of the sources as a centroid yielding no significant stacked signal to a 90% limit of log(L X, 2−10keV ergs −1 s) < 43.54 and 43.21 for the proto-cluster member and coeval field populations, respectively. An identical stack was performed on z phot proto-cluster members yielding no significant detection to a depth of log(L X, 2−10keV ergs −1 s) < 43.10. Note that the proto-cluster lies at the boundary between the deep and shallower portions of the C-COSMOS survey with the effective exposure times varying by a factor of ∼5 over area enclosed by both the spectral and z phot members. Further, we applied the midinfrared criteria developed by Donley et al. (2012) to select powerful (both obscured and unobscured) AGN, finding that none of the sources analyzed here exhibit an AGN-like trend which could possibly be ascribed to a circumnuclear torus. Note that, in this context, the terms "obscured" and "unobscured" AGN refer to sources with rest-frame optical emission dominated by the host galaxy or by the AGN, respectively. The lack of individual and stacked X-ray and mid-infrared emission along with the complete absence of any broadline AGN activity amongst the VUDS 15 The Chandra web tool for X-ray stacking, CSTACK  11. Distribution of the PCl J1001+0220 spec-z members (red diamonds and histogram) and coeval field galaxies (navy points and histogram) in the SFR-M * plane as estimated by LE PHARE. Histograms are normalized as in other figures such that the area is identical for both samples. The upper limit to the radio-derived SFR of spec-z member and coeval field galaxies coming from stacked 3 GHz JVLA data is shown as the green and light blue lines/arrows, respectively. The lack of detection for both proto-cluster members and coeval field galaxies suggests that prodigious dusty star formation (IRX 10) is no more common in the proto-cluster than in the overall field at these redshifts and is not particularly common in either environment. PSF of 0.75 , it is unlikely that astrometric smearing would lead to degradation of any stacked signal. Despite the extreme depth of the data, the relatively large (combined) sample size, and the relatively high mean optical/UV SFRs estimated for each sample, in neither case was a significant (>3σ) detection measured. The 3σ f ν limit for the proto-cluster member sample was found to be 2.24 µJy. This limit translates to a 3σ S FR 1.4GHz limit of <157 M yr −1 following a k-correction of α = 0.68 (Lemaux et al. 2014b) and the SFR conversion of Bell (2003) converted to a Chabrier (2003) IMF. This limit, overplotted in Fig. 11, is ∼4× the average UV/optical SFR of the proto-cluster member sample. However, since this ratio is dependent on the dust-correction scheme employed by the SED fitting process, we instead calculate the IR excess (IRX ≡ L IR /L FUV ) from the average M FUV and the radio limits assuming the q T IR value from Lemaux et al. (2014b) for star-forming galaxies. This exercise yields an IRX 10 for the proto-cluster member sample, a value which precludes pervasive dusty starburst activity amongst the proto-cluster member galaxies. A similarly stringent limit of IRX 6 is found for the same sample if we replace the stacked radio limit with the L IR limit as derived by MAGPHYS from the average proto-cluster member SED. An additional attempt was made to stack with Herschel/SPIRE but no significant detection was found and the corresponding limits were too shallow.
The 54 galaxies of the coeval field sample were stacked at 3 GHz in a similar manner finding no detection and resulting in a 3σ f ν limit of 0.95 µJy or a S FR 1.4 GHz, 3σ limit of <64 M yr −1 at z ∼ 4.6. Such limits begin to rival the average UV/optical SFRs of the coeval field sample (see Fig. 11) and the resulting IRX limit of ≤4.8 approaches the value expected for galaxies with β slopes similar to those observed in the coeval field sample (e.g., Álvarez-Márquez et al. 2016). As above, a similarly stringent value of IRX ≤ 2 is found if we instead impose the MAGPHYS L IR limit in conjunction with the average M FUV . While not extremely constraining for the populations studied in this paper, this exercise has allowed us to definitively show that the SED-fit SFRs do not vastly underestimate the true SFRs for A77, page 17 of 27 A&A 615, A77 (2018) these populations and that, despite the high redshift and relatively high local densities, the proto-cluster is not dominated by dusty, prodigiously star-forming galaxies, nor, for that matter, by radio AGN activity. However, neither this analysis nor the β-slope analysis presented in the previous section precludes the possibility that some subdominant portion of the member population is undergoing obscured star formation activity (as in, e.g., Hatch et al. 2017). Such analysis will be followed further as the member and coeval field sample in and around PCl J1001+0220 is increased and for other, lower redshift VUDS proto-structures discovered in the COSMOS field.
To investigate the relative presence of AGN activity amongst the proto-cluster member and coeval field populations we matched both samples to the Chandra-COSMOS point source catalogue (Elvis et al. 2009;Civano et al. 2016) finding no significant detection (>3σ). No significant detections were found amongst the z phot proto-cluster members either. Such matching was also performed on the JVLA 3 GHz imaging finding no significant detection to a 3σ power density limit of log(P ν W −1 Hz) > ∼ 24.4. Additionally, Chandra images 15 were stacked using the optical position of the sources as a centroid yielding no significant stacked signal to a 90% limit of log(L X, 2−−10 keV ergs −1 s) < 43.54 and 43.21 for the proto-cluster member and coeval field populations, respectively. An identical stack was performed on z phot proto-cluster members yielding no significant detection to a depth of log(L X, 2−10 keV ergs −1 s) < 43.10. Note that the proto-cluster lies at the boundary between the deep and shallower portions of the C-COSMOS survey with the effective exposure times varying by a factor of ∼5 over area enclosed by both the spectral and z phot members. Further, we applied the mid-infrared criteria developed by Donley et al. (2012) to select powerful (both obscured and unobscured) AGN, finding that none of the sources analyzed here exhibit an AGN-like trend which could possibly be ascribed to a circumnuclear torus. Note that, in this context, the terms "obscured" and "unobscured" AGN refer to sources with rest-frame optical emission dominated by the host galaxy or by the AGN, respectively. The lack of individual and stacked X-ray and mid-infrared emission along with the complete absence of any broadline AGN activity amongst the VUDS sample at these redshifts suggest that no significant radiatively efficient AGN activity is taking place in these sources.
To test this hypothesis in a statistical sense, we constructed the average SED separately for the proto-cluster member, coeval field, and z phot members. These average SEDs were then fit with SED3FIT (Berta et al. 2013) via χ 2 ν minimization. Through this SED-fitting decomposition technique we found that the fraction of the rest-frame IR (8-1000 µm) and UV light contributed by the torus/AGN is, in all three cases, negligible (≤1%). In the mid-IR, defined as an integration over 5-40 µm in the restframe, where the relative contribution from an obscured AGN is maximized, the fraction of light contributed by the torus/AGN remains at only a few percent (≤5%). This line of analysis, while almost certainly imprecise due to only upper limits being available on many of the most discriminating bands, allows us to make definitive claims on the prevalance of AGN activity amongst the proto-cluster galaxies. It appears that, within the limits of our data, there is no evidence for any powerful (i.e., quasar level) unobscured or obscured radiatively efficient or inefficient AGN activity in any of the known or potential member galaxies, a conclusion which is robust to changes of factors of a few in the above estimates. This fact precludes the possibility that such activity is a requisite condition for the presence of a proto-cluster and suggests that the proto-cluster does not necessarily foster such activity (e.g., Hatch et al. 2014), though the validity of the latter suggestion is subject to a variety of timescale concerns which we do not attempt to address in this paper.
As a final thought related to multiwavelength activity in and around PCl J1001+0220, it is interesting to note that a prodigiously star forming submillimeter galaxy (SMG, S FR T IR = 500-2000 M yr −1 ) or galaxy-galaxy merger detected in AzTEC 1.1 mm observations was spectroscopically confirmed by Capak et al. (2008) to be at z = 4.547 ± 0.002, in proximity to PCl J1001+0220. Such a discovery is interesting since potential links between proto-structures and SMGs have been proposed by many authors (e.g., Daddi et al. 2009;Aravena et al. 2010;Smolčić et al. 2017a;Wilkinson et al. 2017). However, this SMG is well outside the bounds of either the z spec or z phot proto-cluster member samples ([α J2000 , δ J2000 ] = [150.22715, 2.5764]) regardless of the choice of centers. As a consequence, PCl J1001+0220 could not have been found solely by looking for an overdensity surrounding this SMG. While the SMG may eventually merge with the proto-cluster core, a process that would take ∼6 Gyr assuming the LOS velocity to be equivalent to the infalling velocity and a purely radial orbit, it is sufficient to say that the proto-cluster and the SMG do not appear causally related at the epoch at which they are observed.

Morphological properties
In the penultimate section of the comparisons between protocluster members and coeval field galaxies we return to rest-frame UV light to inspect the morphology of these galaxies as they appear in the COSMOS HST/F814W imaging. Because the PCl J1001+0220 is not covered by the CANDELS/COSMOS survey, these images are the only images for which we can get reliable morphological information. This band is nearly identical to the rest-frame FUV at these wavelengths 16 and, thus, any morphological information derived from these images will be highly subject to dust effects. While we showed earlier in Sect. 4.2.1 that there does not appear to be any bulk difference between the two populations in this regard in terms of integrated dust content, differing dust geometry may still differentially affect the measurements of the two samples. As such, we limit ourselves here to the most basic of morphological quantities and observations. Plotted in Fig. 12 are the HST/F814W postage stamps of the nine PCl J1001+0220 member galaxies. Given in the bottom lefthand corner of each postage stamp is the effective circularized radius (r e,circ ) obtained from GALFIT for each galaxy following the methods of Ribeiro et al. (2016). Proto-cluster members are observed to span a wide variety of FUV morphologies from irregular diffuse galaxies with multiple clumps to compact, high surface-brightness galaxies. Correspondingly, the observed r e,circ values span nearly an order of magnitude from 0.3 to 2.2 kpc with an average of 1.2 kpc. The (luminosity-)dominant stellar populations probed in these images are those associated with the current star formation activity in member galaxies. Thus, the vast diversity of morphologies and r e,circ values unambiguously demonstrate that while these galaxies appear to share a common overarching environment, their stellar mass assembly W 5×5 ′′ (∼33×33 kpc) postage stamps of the nine secure spec-z members of PCl J1001+0220. At the redshift of the st exactly corresponds to the rest-frame FUV band. The redshift, SFR, circularized effective radius (r e,circ ), and stellar wn in each panel. The nine galaxies exhibit a large diversity of morphologies, from a single, compact (r e,circ < 0.5 ′′ ) o larger (r e,circ ∼ 1 − 2 ′′ ), multi-component diffuse structures devoid of concentrated light. Many of the proto-cluster single or multiple projected companions, a rate which possibly exceeds that of the coeval field (see text). rd in terms of integrated dust content, may still differentially affect the meaples. As such, we limit ourselves here hological quantities and observations. are the HST/F814W postage stamps of 0 member galaxies. Given in the boteach postage stamp is the effective cirbtained from GALFIT for each galaxy of Ribeiro et al. (2016b). Proto-cluster to span a wide variety of FUV mordiffuse galaxies with multiple clumps -brightness galaxies. Correspondingly, es span nearly an order of magnitude an average of 1.2 kpc. The (luminositytions probed in these images are those ent star formation activity in member iversity of morphologies and r e,circ valnstrate that while these galaxies appear arching environment, their stellar mass ng in a common manner. This large vagies and r e,circ values is, however, also UDS field population at these redshifts hile it should be noted that a stellar on the sample in Ribeiro et al. (2016b), made on galaxies in their lower redthe galaxies in redshift bin referenced ighest redshift studied in Ribeiro et al. compared to the coeval field sample. V morphologies in all cases is perhaps build up the average stellar mass of member and coeval field galaxies.
While the same range in visual and (one form of) parametric morphologies is observed in both the coeval field and the proto-cluster member population, the latter has a σ N MAD ∼ 25% higher and in a sample comprised of only nine galaxies. To test the likelihood of the dispersion in the observed size distributions occurring by random sampling of the VUDS field galaxies at this redshift, we randomly drew 1000 samples of nine galaxies from the r e,circ distribution observed in Ribeiro et al. (2016b) for the redshift bin most comparable to our own. In only ∼10% of the trials did the variance of the r e,circ values in the randomly drawn sample exceed that of the proto-cluster member sample. This result is identical if we instead perform this exercise drawing from a Gaussian fit to the distribution of r e,circ values of the coeval field sample. However, because galaxy size is correlated with the mass of their stellar content (e.g., Allen et al. 2015), it is perhaps the potentially disparate stellar mass distribution of the member population with respect to the field that drives the observed scatter (see §4.2.1). Further, the lack of an azimuthally symmetric profile for a large fraction of both the member and coeval field samples, a phenomenon common to galaxies at high redshift (e.g., Mortlock et al. 2013;Guo et al. 2015), likely decreases the utility of both the r e,circ measurement and the scatter on this quantity. To mitigate both issues, an effective non-parametric half-light radius, r 50 T , was measured for each galaxy in both the member and coeval field samples following the methodology of Ribeiro et al. (2016b) and normalized in such a way as to remove the correlation between size and stellar mass (see Fig. 13). None of the results in this section Fig. 12. Hubble/ACS F814W 5 × 5 (∼ 33 × 33 kpc) postage stamps of the nine secure specz members of PCl J1001+0220. At the redshift of the proto-cluster, F814W almost exactly corresponds to the rest-frame FUV band. The redshift, SFR, circularized effective radius (r e,circ ), and stellar mass for each galaxy is shown in each panel. The nine galaxies exhibit a large diversity of morphologies, from a single, compact (r e,circ < 0.5 ) bright star-forming clump to larger (r e,circ ∼ 1− 2 ), multi-component diffuse structures devoid of concentrated light. Many of the proto-cluster members are observed with single or multiple projected companions, a rate which possibly exceeds that of the coeval field (see text).
is not proceeding in a common manner. This large variety of visual morphologies and r e,circ values is, however, also shared by the general VUDS field population at these redshifts (Ribeiro et al. 2016). While it should be noted that a stellar mass limit was imposed on the sample in Ribeiro et al. (2016), this imposition was only made on galaxies in their lower redshift bins meaning that the galaxies in redshift bin referenced here, 3.5 ≤ z ≤ 4.5, the highest redshift studied in Ribeiro et al. (2016), can be validly compared to the coeval field sample. Such a large diversity in UV morphologies in all cases is perhaps not unexpected for galaxies at these redshifts as either continuous moderate-level (∼20 M yr −1 ) or shorter-lived, vigorous star formation over the course of the last ∼1 Gyr is required to build up the average stellar mass of member and coeval field galaxies.
While the same range in visual and (one form of) parametric morphologies is observed in both the coeval field and the proto-cluster member population, the latter has a σ N MAD ∼ 25% higher and in a sample comprised of only nine galaxies. To test the likelihood of the dispersion in the observed size distributions occurring by random sampling of the VUDS field galaxies at this redshift, we randomly drew 1000 samples of nine galaxies from the r e,circ distribution observed in Ribeiro et al. (2016) for the redshift bin most comparable to our own. In only ∼10% of the trials did the variance of the r e,circ values in the randomly drawn sample exceed that of the proto-cluster member sample. This result is identical if we instead perform this exercise drawing from a Gaussian fit to the distribution of r e,circ values of the coeval field sample. However, because galaxy size is correlated with the mass of their stellar content (e.g., Allen et al. 2015), it is perhaps the potentially disparate stellar mass distribution of the member population with respect to the field that drives the observed scatter (see Sect. 4.2.1). Further, the lack of an azimuthally symmetric profile for a large fraction of both the member and coeval field samples, a phenomenon common to galaxies at high redshift (e.g., Mortlock et al. 2013;Guo et al. 2015), likely decreases the utility of both the r e,circ measurement and the scatter on this quantity. To mitigate both issues, an effective non-parametric half-light radius, r 50 T , was measured for each galaxy in both the member and coeval field samples following the methodology of Ribeiro et al. (2016) and normalized in such a way as to remove the correlation between size and stellar mass (see Fig. 13). None of the results in this section change appreciably if we instead adopt r 100 T , the effective non-parametric radius containing 100% of the flux of each galaxy. Performing a similar exercise as was done for the r e,circ distributions again on those of r 50 T /M α results in ∼10% of trials exceeding the observed variance of proto-cluster member population, a result which mildly suggests more stochasticity in the assembly stage of member galaxies relative to the field.
Assuming this increased scatter to be real, we explore here the possibility of increased galaxy-galaxy interactions and potential merging activity within the proto-cluster as possible driving mechanisms for this increased stochasticity. In order to estimate the level and type of galaxy-galaxy interactions experienced by the average galaxy in the member and coeval field samples, for each galaxy in each sample we counted the number of unique objects in the magnitude range 24 ≤ F814W ≤ 26 other than the galaxy itself whose centroids lay within r pro j < 25 kpc from the galaxy center. This radius was adopted as it is commonly used for galaxy pair studies and has been found through simulations and observations to contain galaxies with a high likelihood of eventual coalescence (see e.g., Le Fèvre et al. 2000;de Ravel et al. 2009;Lotz et al. 2011;López-Sanjuan et al. 2013, 2015 andreferences therein). It is worth noting that galaxy-galaxy interactions can have visible effects over much larger separations (r pro j ∼ 50-100 kpc, e.g., Patton et al. 2011) and can still merge given a long enough timescale (3-8 Gyr, Kitzbichler & White 2008). Here, however, we are concerned only with the most severe of interactions and thus limit ourselves to the more restrictive  T , as measured in the Hubble/ACS F814W imaging of spec-z member galaxies of PCl J1001+0220 compared to that of the coeval (red and blue histograms, respectively). As in previous figures, histograms are normalized to have equal areas. Due to the relationship between size and stellar mass, size measurements are modulated by the estimated stellar mass of each galaxy (M α ⋆ ) where α is derived from a fit to the observed size-stellar mass relation in the combined proto-cluster and coeval field sample (α = 0.18±0.03). The average value for both samples is identical within the uncertainties, though a slight increase (∼ 25%) in the spread of the proto-cluster distribution is observed which we argue in §4.2.3 that, if real, may be due to increased galaxy-galaxy interactions. been found to have a complex relationship with environment in the few studies with a reasonable chance to test this possibility. In some cases an enhancement of the Lyα feature has been found. Larger Lyα-emitting regions were found to be associated, on average, with Lyα emitters (LAEs) situated in overdense environments (Matsuda et al. 2012;Momose et al. 2016;Chiang et al. 2015). It has been suggested that the presence of such overdensities are possibly a requisite condition in the higher redshift universe in order to observe multiple strong, physically associated LAEs (e.g., Castellano et al. 2016). Indeed, some proto-cluster or proto-cluster candidates have been found solely through the presence of large numbers of relatively strong LAEs (e.g., Pentericci et al. 2000b;Venemans et al. 2002b;Kuiper et al. 2011;Chiang et al. 2015) with some studies finding an increase of strong narrowband-selected LAEs with increasing local galaxy density (e.g., Zheng et al. 2016). Such an increase in the size of Lyα halos could be related to both the column density and kinematics of the neutral hydrogen in and around the galaxies (see Guaita et al. 2017 and references therein) and, thus, a possible signature of a nascent medium.
However, other results have found a lack of enhancement or suppression of Lyα amongst galaxies in the higher-density environments. Narrowband-selected LAEs are found in general to be less clustered than those galaxies without strong Lyα emission at high redshifts (e.g, Ouchi et al. 2010;Bielby et al. 2016), though this trend is predicted to be a strong function of redshift and Lyα line luminosity (Orsi et al. 2008). In a study of the member galaxies of a z ∼ 3.3 proto-cluster, Lemaux et al. (2014a) found no definitive evidence for differences in individual or coadded Lyα measurements compared to a corresponding coeval field population. In a study of a narrow-band-selected T , as measured in the Hubble/ACS F814W imaging of spec-z member galaxies of PCl J1001+0220 compared to that of the coeval (red and blue histograms, respectively). As in previous figures, histograms are normalized to have equal areas. Due to the relationship between size and stellar mass, size measurements are modulated by the estimated stellar mass of each galaxy (M α ) where α is derived from a fit to the observed size-stellar mass relation in the combined proto-cluster and coeval field sample (α = 0.18 ± 0.03). The average value for both samples is identical within the uncertainties, though a slight increase (∼25%) in the spread of the proto-cluster distribution is observed which we argue in Sect. 4.2.3 that, if real, may be due to increased galaxy-galaxy interactions.
criterion. The bright end of the magnitude limit imposed corresponds to M FUV < ∼ −22.3 at z ∼ 4.5 (∼3L * ), a population extremely unlikely to be observed with any abundance in the small areas searched here (Bouwens et al. 2015). The fainter end of the magnitude range given above was set as a rudimentary attempt to control for surface brightness effects and detection limits as F814W ∼ 26 corresponds to the 50% completeness limit for r h = 0.25 sources ). Finally, because the ACS observations probe the rest-frame FUV, there is some concern that confusion could occur between multiple distinct galaxies and bright star-forming regions contained within a single galaxy. The method for disambiguating these cases in VUDS is discussed in detail in Ribeiro et al. (2017).
Subsequent to these cuts, the simplest possible comparison is made: the average number of companions per galaxy in the two samples corrected by the average density of sources in this magnitude range in the COSMOS2015 catalog. In total, 0.4 ± 0.3 and 0.1 ± 0.1 background-corrected projected companions per galaxy are found for the proto-cluster member and coeval field sample, respectively. A discrepancy of similar directionality and significance is seen when comparing the z phot member and field sample, 0.17 ± 0.08 versus 0.08 ± 0.02, respectively. While these results depend on the choice of the magnitude and r pro j criteria adopted, the cuts chosen here are the most conservative that could be made with respect to projection effects for which this analysis could reasonably be performed. For these choices, the analysis is suggestive of possible elevated levels of strong galaxy-galaxy interactions within the proto-cluster bounds. This line of thought will be followed further at the conclusion of future followup spectroscopic observations of this and other VUDS proto-clusters.

Spectral properties
The final comparison between proto-cluster members and their field counterparts comes in measuring rudimentary individual and stacked (hereafter "coadded") spectral properties. We begin with the former and concentrate on the features most commonly studied for high-redshift galaxies. The Lyα feature has been found to have a complex relationship with environment in the few studies with a reasonable chance to test this possibility. In some cases an enhancement of the Lyα feature has been found. Larger Lyα-emitting regions were found to be associated, on average, with Lyα emitters (LAEs) situated in overdense environments (Matsuda et al. 2012;Momose et al. 2016;Chiang et al. 2015). It has been suggested that the presence of such overdensities are possibly a requisite condition in the higher redshift Universe in order to observe multiple strong, physically associated LAEs (e.g., Castellano et al. 2016). Indeed, some proto-cluster or proto-cluster candidates have been found solely through the presence of large numbers of relatively strong LAEs (e.g., Pentericci et al. 2000;Venemans et al. 2002;Kuiper et al. 2011;Chiang et al. 2015) with some studies finding an increase of strong narrowband-selected LAEs with increasing local galaxy density (e.g., Zheng et al. 2016). Such an increase in the size of Lyα halos could be related to both the column density and kinematics of the neutral hydrogen in and around the galaxies (see Guaita et al. 2017 and references therein) and, thus, a possible signature of a nascent medium.
However, other results have found a lack of enhancement or suppression of Lyα amongst galaxies in the higher-density environments. Narrowband-selected LAEs are found in general to be less clustered than those galaxies without strong Lyα emission at high redshifts (e.g., Ouchi et al. 2010;Bielby et al. 2016), though this trend is predicted to be a strong function of redshift and Lyα line luminosity (Orsi et al. 2008). In a study of the member galaxies of a z ∼ 3.3 proto-cluster, Lemaux et al. (2014a) found no definitive evidence for differences in individual or coadded Lyα measurements compared to a corresponding coeval field population. In a study of a narrow-band-selected z = 3.78 protocluster, Dey et al. (2016) found no environmental dependence of EW(Lyα) between proto-cluster LAE candidates and coeval field candidate LAEs, though a marginal increase in the average Lyα luminosity was seen closer to the core of the proto-cluster. In a spectroscopic study of a proto-cluster LAEs at z ∼ 3.67, Toshikawa et al. (2016) found proto-cluster member galaxies to exhibit less negative EW(Lyα) relative to galaxies in a similarly selected field sample indicating a suppression of the Lyα escape fraction in the proto-cluster.
Here we attempt two basic comparisons. In the first comparison we plot in the left panel of Fig. 14 the coadded spectrum of all nine PCl J1001+0220 members against that of the coeval field sample. These coadded spectra are generated following the methodology of Lemaux et al. (2014a) and show the unit-weighted (normalized) average (mean) spectra of the two populations. The wavelength region in the left panel of Fig. 14 is tailored such that only those wavelengths where all galaxies contribute the coadded spectra for each population are shown. Immediately it can be seen that, while the overall continuum shape is similar between the two coadded spectra, a fact which bolsters the β-slope analysis presented in Sect. 4.2.1, there appears a noticeable difference in the strength of the emission component of the Lyα line. We use a custom bandpass IDL routine to measure the EW(Lyα) of both coadded spectra, which, similarly to Cassata et al. (2015), uses the continuum just redward of Lyα Fig. 14. Left panel: unit-weighted (normalized) mean rest-frame VUDS spectra of the nine spec-z members of the PCl J1001+0220 proto-cluster plotted against that of the 53 coeval field galaxies (one galaxy was removed due to the presence of severe reduction artifacts). The requisite stellar mass limit for each coaddition is given in the figure panel. Each spectrum is smoothed with a Gaussian of FWHM = 1.5 pixels (1.45 Å). Important absorption and emission features frequently observed in VUDS spectra are marked by vertical lines. Clear differences can be observed between the two spectra, with the proto-cluster members exhibiting, on average, reduced Lyα emission and exaggerated absorption in Hydrogen and various metal lines relative to the average coeval field galaxy. In both cases, a significant (∼300 km s −1 ) offset to the red can be seen for the Lyα line potentially indicating that large-scale outflows are pervasive in galaxies at this redshift. Right panel: distribution of rest-frame Lyα equivalent widths (EWs) for the proto-cluster (red histogram) and coeval field sample (blue histogram) measured on individual spectra. Negative or positive EW indicates the line is seen in emission or absorption, respectively. The two distributions are inconsistent at >3σ, the Lyα feature being more likely to be seen in absorption for the proto-cluster members.
to set the continuum level. For individual measurements of EW(Lyα), this routine returns measurements which are broadly consistent with those of Cassata et al. (2015). Using this method we measure EW(Lyα) = −4.7 ± 0.6 and −15.4 ± 0.3 Å for the proto-cluster member and coeval field galaxies, respectively. We note that these cannot be directly compared to the mean values of the individual measurements due to the fact that each spectrum has its average flux density scaled to unity prior to coadding. Uncertainties on mean EWs are derived from a combination of the covariance matrix of the linear continuum fit and the error spectrum of the coadded spectra constructed from the individual VIPGI error spectra of the input galaxies. Though there are some known issues with the latter, neither EWs or their associated uncertainties are found to change significantly if a running measure of the root mean square fluctuations is instead used. The suppression of Lyα relative to the level of the UV continuum amongst the proto-cluster galaxies is clearly significant, a difference which persists in significance and directionality if we instead perform unit-weighted median coadding. In the right panel of Fig. 14 we plot the distributions of the individual measurements of EW(Lyα) as measured by Cassata et al. (2015) for the proto-cluster member and coeval field samples used to construct the coadded spectra. By eye it appears that the protocluster members are skewed to less negative EW(Lyα) values with respect to those of the coeval field sample, an observation which is borne out both by a KS test (rejection of null hypothesis at ∼99%) and by a comparison of the mean values (−5.5 vs. −37.6 Å, respectively).
It is important to note, though, that, as we do not preserve flux density in our coaddition method, this suppression cannot necessarily be interpreted as a suppression of Lyα line luminosity. For this to also be true, the dust-uncorrected FUV luminosity between the two samples must be similar. Both a KS test and a comparison of the average of the M FUV values of the proto-cluster member and coeval field galaxies (M FUV = −21.77 ± 0.18 and −21.60 ± 0.06, respectively) show no significant offset between the two samples. Echoing this result, a KS test performed directly on the Lyα line luminosity measurements, made following the method of Cassata et al. (2015), rejects the hypothesis that the two distributions are drawn from the same underlying parent sample at >99%. It appears that the suppression of Lyα relative to the UV continuum observed in the proto-cluster members relative to the coeval field galaxies extends to an absolute suppression of Lyα luminosity. Given that the two populations appear to have similar dust properties and UV-derived SFRs, and because the absolute brightness of Lyα in the absence of resonant scattering and dust is directly proportional to the amount of star formation, this decrease can be translated directly into the fraction of Lyα photons which escape the average galaxy in the two populations. Adopting the Lyα SFR conversion of Lemaux et al. (2009) where M FUV is the (linear) average M FUV of the sample, EW(Lyα) is measured in the coadded spectra, and the constant of proportionality comes from a combination of that given in Lemaux et al. (2009), 4πd 2 L , where d L is 10 pc in cm, c is cm s −1 , and the λ e f f of the FUV filter (1539 × 10 −8 cm, Boselli et al. 2011). We then define the escape fraction of Lyα as: f esc, Lyα = S FR(L Lyα )/S FR S ED , where the SFR S ED are the average values given in Sect. 4.2.1. For the full VUDS sample in COSMOS at 2 ≤ z ≤ 5 with flags ≥X2 we have confirmed that, as EW(Lyα) becomes more negative, that is, more Lyα flux for a given UV continuum level, f esc, Lyα approaches 100% using the above formalism. The resulting values for the proto-cluster member and coeval field sample are A77, page 21 of 27 A&A 615, A77 (2018) f esc, Lyα = 1.8 +0.3 −1.7 % and 4.0 +1.0 −0.8 %, respectively. The coeval field value is broadly consistent with the global f esc, Lyα at 4 < z < 5 as derived through comparisons between Lyα and UV luminosity functions (Hayes et al. 2011;Zheng et al. 2013) or other measures (e.g., Wardlow et al. 2014;Cassata et al. 2015 though these measurements were limited to LAEs only), especially considering the unknown uncertainties of differing sample selections and measurement schemes. The average f esc,Lyα for proto-cluster members appears, however, at the extreme low end of measured values at these redshifts.
A variety of factors can increase the chance of the escape of Lyα photons or the appearance thereof: geometric effects, observational bias, dust content and geometry, strong outflows, and decreased column densities of neutral hydrogen along the line of sight. Because we are averaging over a reasonably large number of galaxies in each sample, geometrical effects (see e.g., Finkelstein et al. 2008), are likely not the cause of the observed difference. Similarly, because we are comparing galaxy samples with the same mean redshift observed in an identical manner, it is unlikely that differing measurement apertures, a pernicious effect when measuring f esc, Lyα (e.g., Matthee et al. 2016), affect our measurements here. Further, we have shown through a variety of methods that the dust content between the two samples is likely to be the same. Therefore, we are left with only two possibilities: increased prevalence and strength of outflows amongst the coeval sample or decreased H I column density. It has been shown that the presence of outflows and their strength depend, in the absence of strong AGN activity, on the instantaneous SFR (e.g., Heckman 2001;Weiner et al. 2009;Erb 2015), a quantity which, as far as we can tell, is the same between the two samples. In addition, we see no difference in the velocity offset of the Lyα line in the coadded spectra with respect to the ISM lines, 310 versus 270 km s −1 for the coeval field and proto-cluster member coadded spectra, respectively, which suggests similar H I kinematics between the two galaxies. However, this test is far from definitive as the relationship between these components is not necessarily straightforward (e.g., Shapley et al. 2003;Guaita et al. 2017).
What remains, then, is the intriguing possibility that the proto-cluster galaxies see a denser medium in their immediate vicinity or a more isotropic medium resulting in a higher covering fraction of H I. Such conditions would cause a subsequent drop of Lyα photons able to escape (e.g., Reddy et al. 2016) due to their increased possibility of interacting with a dust grain for a given dust content and geometry. The presence of a large and dense neutral medium is a possibility which has been raised previously with the observation of large scale Lyα absorption in another VUDS proto-cluster (Cucciati et al. 2014) and in Lyα tomography studies (e.g., Lee et al. 2016) and is consistent with the idea of the assembly of a nascent proto-ICM. Assuming a one-to-one correspondence between increased H I and an increase in heavy elements, the possibility of denser medium is also broadly consistent with the relative strength ISM absorption features, as the average ratio between EWs measured on the proto-cluster member and coeval field coadded spectra for the six ISM absorption features labeled in Fig. 14 is 1.7 ± 0.2, measured using the bandpass method of Lemaux et al. (2010) with custom bandpasses. From comparing Gaussian fits to each of the features in the two coadditions we find that the increase in the average EW is driven almost entirely by an increased (relative) depth of the absorption feature rather than by an increase in the feature width. Since all ISM features should be unresolved in VUDS data, this lack of appreciable difference in the widths measured in the ISM features of the two coadded spectra strongly indicates that the increased absorption strength in the cluster member coadded spectrum is real. However, the relationship between ionic column density and ISM absorption strength is a complicated one (see Talia et al. 2012 and references therein) and, as such, while suggestive, these measurements do not offer definitive proof that the galaxies inhabiting the proto-cluster are experiencing a denser medium. Such lines of inquiry will continue to be followed for the entire sample of VUDS protostructures. It may be, though, that observations of cold gas from, for example, the Atacama Large Millimeter/submillimeter Array will be needed to definitively confirm or deny this assembly, a prospect which may have already begun to be realized (Ginolfi et al. 2016;Dannerbauer et al. 2017).

Summary and conclusions
In this study we reported on the detection, spectroscopic confirmation, and the mapping of the PCl J1001+0220 proto-cluster at z ∼ 4.57 in the COSMOS field. We undertook a preliminary investigation of various properties of the spectroscopic and photometric proto-cluster member galaxies and contrast these with matched field samples. The following were the main conclusions of these investigations: -PCl J1001+0220 was discovered in a blind spectroscopic survey without the use of traditional beacons of proto-structure or proto-cluster activity such as quasars or other prodigious radio emitters. Its large extent on the plane of the sky is consistent with both predicted and observed sizes of high-redshift proto-clusters. -The galaxy overdensity of the proto-cluster was measured using three methods: spectroscopic overdensity, photometric redshift overdensity, and a new method, Voronoi Monte-Carlo mapping, which statistically combines both the spectroscopic and photometric redshifts. While the level of overdensity was found to be dependent on the method, in all three methods PCl J1001+0220 was significantly detected. The overdensity derived from the Voronoi Monte-Carlo mapping was considered the most secure, with PCl J1001+0220 exhibiting an average δ gal = 3.30 ± 0.32 over a sky area of 7.58 Mpc 2 and a line-of-sight distance of 7.49 Mpc. -Several methods were attempted to measure the total mass associated with PCl J1001+0220. Though subject to a large number of assumptions and uncertainties, this analysis provided a general picture that PCl J1001+0220 will evolve into a cluster of mass ∼3-10 ×10 14 M by the present day, with the mass estimated from the overdensity derived from the Voronoi Monte Carlo approach falling at the low end of these estimates. -When comparing various member populations to similarly selected coeval field samples we found tentative evidence for a fractional excess of older and more massive galaxies within the proto-cluster bounds suggesting that the SFHs of galaxies in the proto-cluster environment differ appreciably, on average, from those in the field. -A variety of methods were used to attempt to quantify the level of AGN activity amongst the various member samples using some of the deepest multiwavelength data ever compiled. In contrast with studies of many other high-redshift proto-clusters, no indication was found for the presence of any type of AGN within the proto-cluster. -No evidence was found for either suppression or enhancement of star formation activity within the proto-cluster bounds through comparison of dust-corrected rest-frame UV star-formation indicators. Through cross-matching and stacking newly obtained deep JVLA 3 GHz imaging of the COSMOS field for both the proto-cluster member and coeval field samples it was demonstrated that neither environment contained a large number of dusty starbursting galaxies. Limits on the stacking analysis and the distribution of measured β slopes precluded the possibility of copious amounts of dust in the average galaxy in both populations. -The rest-frame FUV morphologies from the HST/F814W mosaic of the COSMOS field for the proto-cluster member and coeval field population were compared visually, parametrically, and non-parametrically. The nine z spec members spanned the entirety of all quantifiable morphological measurements of the corresponding coeval field sample which suggested, in tandem with a possible increase in the average number of background-corrected close pairs in both the z spec and z phot member sample, a potential enhancement of galaxy-galaxy interactions within the proto-cluster. -The individual and stacked spectral properties of the member and coeval field galaxies were contrasted. The member population exhibited a decided suppression in the strength of the Lyα emission feature relative to the coeval field as well as a decrease in the estimated average Lyα escape fraction (1.8 +0.3 −1.7 % vs. 4.0 +1.0 −0.8 %, respectively). Additionally, the average absorption of ISM features in the stacked spectra was markedly stronger for the proto-cluster members. These lines of evidence were used to suggest the possible presence of a large and dense medium housed in the proto-cluster.
Despite the relatively small sample assembled here, the observed properties of galaxies in PCl J1001+0220 tentatively demonstrate that local environment may already impact galaxy evolution and that the effects which give rise to the massive quiescent galaxies that dominate local galaxy clusters may begin to be felt as early as just 1.3 Gyr after the Big Bang. Further, the simple presence of PCl J1001+0220 and the inference of the large amount of mass assembled in the proto-cluster by z ∼ 4.57 is a clear indication that such overdensities were able to begin to distinguish themselves from average density regions of the Universe well before the end of the epoch of reionization. While the existence and nature of PCl J1001+0220 was proven definitively in this study, many of the results pertaining to galaxy evolution within PCl J1001+0220 presented in this work do not have the statistical weight to be anything other than suggestive. That we were able to even attempt such lines of analysis in some sort of controlled manner at these redshifts already constitutes a triumph. It will be up to further spectroscopic observations of PCl J1001+0220, followup observations at other wavelengths, and future works using the full power of the VUDS survey or other large-scale spectroscopic surveys, such as the Subaru Prime Focus Spectrograph Galaxy Evolution Survey, to determine whether the tentative signs of early-onset environmentally driven evolution seen here are legitimate hallmarks of the general population of high-redshift proto-clusters. as was, for example, done for the spectral sample is not needed as once we impose a stellar mass cut above which the data are highly complete, the resulting sample will be, by definition, representative of the overall galaxy population at these redshifts and above the stellar mass limit imposed. In order to determine the appropriate stellar mass cut for the z phot sample we choose not to adopt the limits of Laigle et al. (2016), log(M * /M ) ≥ 10.7 and 10.3 in the UltraVISTA Deep and UltraDeep regions, respectively, as these limits are based solely on the K s band magnitude at which 90% of all sources could be recovered. In order to estimate the stellar mass limit of our (broadly) [3.6]-selected z phot sample we adopted approach identical to that employed in Quadri et al. (2012) and Tomczak et al. (2014). This approach is as follows.
We began by selecting all objects in the same z phot range which were ∼2.5× brighter in flux density (i.e., 1 mag) than the 3σ magnitude limit at [3.6] within a narrow window of ±0.1 mags. Because of their brightness relative to the 3σ detection limit, essentially all objects at these magnitudes in this z phot range should be detected in the COSMOS2015 catalog. The stellar mass distribution of the resultant sample was then scaled down by exactly the same factor as the ratio of the average flux density of the sample and the flux density corresponding to the 3σ completeness limit. By scaling in such a way, we artificially reproduce the stellar mass distribution (though not the number) of objects appearing right at the 3σ magnitude limit modulo changes in the average M * /L ratio. Since we are working with a sample almost exclusively comprised of star-forming galaxies by virtue of the redshift considered, a constant M * /L as a function of stellar mass within the range considered here is at least a plausible assumption. The stellar mass completeness limit (hereafter M lim ) was then defined as the stellar mass for which 80% of the objects in the scaled stellar mass distribution were more massive. In other words, since the scaled mass distribution is intended to represent the true stellar mass distribution at the 3σ completeness limit of the image, and since objects with lower stellar masses are more likely to be missed given the selection band and the redshift, removing the bottom 20% of scaled stellar masses returns the 80% M lim . This exercise resulted in a M lim of log(M * /M ) ≥ 9.5, a limit which was not sensitive to the exact magnitude range chosen in this exercise. This limit corresponds to the stellar mass at which the number counts of all z phot objects in the COSMOS2015 catalog in this redshift range subject to the constraints given in Sect. 3.3 begin to turn over. Additionally, this number is similar to the recent M lim estimated by Davidzon et al. (2017) at the same redshift for a [3.6]-selected COSMOS2015 sample using entirely different methodologies.
Photometric redshift members (hereafter z phot members) were defined as objects which satisfy all above criteria and fall within R pro j ≤ 3 Mpc from the spectral-number-weighted center (for the rationale behind this choice see Sect. 3.2). There are 82 such objects. The remaining 1975 objects comprise the z phot coeval field sample. We note that the (z spec ) members and coeval field galaxies are not removed from these samples as to do so would impose unnecessary spatial and color bias on the z phot sample. For all comparisons in this section and throughout the remainder of the paper, except where otherwise noted, we use the term average synonymously with median, preferred over the mean to minimize the effects of outliers in what are admittedly relatively sparsely sampled distributions, and denote the median of any parameter by the notationx. Uncertainties on the median are in all cases given by σ N MAD / √ n − 1 (see Müller 2000 for a discussion on adopting this type of estimate on the uncertainty on the median). We proceed, for most comparisons throughout the remainder of the paper, by diligently testing differences in various subsets of galaxies by both one-dimensional KS tests and by comparing medians and their associated uncertainties. These are broadly conservative choices, as if we instead use the mean as an estimator of the average value or if we substitute onedimensional KS tests with Student's t or a Mann-Whitney tests the significance of differences between subsets are generally seen to increase.