Open Access
Issue
A&A
Volume 677, September 2023
Article Number A184
Number of page(s) 39
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361/202245581
Published online 25 September 2023

© The Authors 2023

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

The galaxy stellar mass function (SMF) is defined as the number density of galaxies Φ(ℳ, z) in bins of stellar mass Δℳ at each redshift z, and is a fundamental cosmological observable in the study of the statistical properties of galaxies. Understanding its shape and evolution ultimately informs us about the growth of the baryonic content of the Universe, and can help infer the star formation activity and the availability of cold molecular gas across cosmic time. Its integral over ℳ is the galaxy stellar mass density (SMD) ρ*(z), which describes the cumulative mass assembled by a given epoch.

Remarkable progress has been made since the first mass-selected measurements of the local z ≈ 0 SMF by Cole et al. (2001). The intervening years have been marked by order-of-magnitude increases in sample size, photometric precision, and redshift accuracy which have enabled more precise determinations of the local SMF as well as groundbreaking extensions to increasingly higher redshifts (e.g., Fontana et al. 2006; Marchesini et al. 2009; Drory et al. 2009; Peng et al. 2010; Ilbert et al. 2010, 2013; Pozzetti et al. 2010; Mortlock et al. 2011; Vulcani et al. 2013; Muzzin et al. 2013; Grazian et al. 2015; Mortlock et al. 2015; Duncan et al. 2014; Song et al. 2016; Bernardi et al. 2017; Davidzon et al. 2017; Wright et al. 2018; Santini et al. 2021, 2022; Stefanon et al. 2021; Adams et al. 2021; Thorne et al. 2021; McLeod et al. 2021).

A coherent picture has emerged from these studies. For instance, the shape of the SMF for star-forming galaxies at z < 3 is well described by the empirically constructed Schechter function (Schechter 1976), which features an exponential downturn at a characteristic stellar mass ℳ*, with a low-mass end that declines with a slope α; both seem to remain constant out to z ≈ 2 (Marchesini et al. 2009; Ilbert et al. 2013; Tomczak et al. 2014; Davidzon et al. 2017; Adams et al. 2021). This constancy in the shape of the SMF implies that star formation activity has acted consistently to transform baryons from cold gas into stars, and appears to have formed > 75% of the total stellar mass of the Universe in ≈10 Gyr. Only the normalization of the SMF, Φ*(z), of star-forming galaxies has been confidently shown to evolve with cosmic time, and its rapid evolution at earlier times points directly to enhanced rates of mass growth at earlier epochs (Popesso et al. 2023, and references therein).

Which physical processes are responsible for the behavior of the SMF, and the extent of their influence at different cosmic epochs, are not well understood even in the low redshift Universe (z < 1). Examples include feedback from supermassive black holes, supernovae, galaxy-galaxy mergers, stellar and gas dynamics, and gaseous inflows and outflows, all of which act on varying scales both physically and in time. As a result, significant effort has been undertaken to build detailed cosmological simulations to identify and understand the role of physical processes that underpin the observed shape and evolution of the SMF (e.g., Somerville et al. 2015; Furlong et al. 2015; Lagos et al. 2018; Laigle et al. 2019; Pillepich et al. 2018; Davé et al. 2019; Lovell et al. 2021). Hence, precise measurements of the SMF and SMD are key observables utilized by most (but not all, see e.g., Dubois et al. 2014) large-scale cosmological simulations to tune input parameters such that the SMF of the local Universe (z ≈ 0) is recovered. Comparisons of predicted and observed SMFs at earlier cosmic ages can point to the physical processes at play and/or suggest recipes in need of refinement (Torrey et al. 2014; Furlong et al. 2015).

Owing to the challenges associated with large-scale simulations, purely analytical, data-driven models have enjoyed great popularity with the introduction of the first large galaxy samples with photometric redshifts (photo-z). For example, Peng et al. (2010) constructed a phenomenological model to explain the bimodality of galaxy types (star-forming and quiescent) as seen from their mass functions, megaparsec-scale environment, and star formation activity. That is, as a consequence of two mechanisms driving star formation cessation (often referred to as “quenching”): massive galaxies cease forming stars irrespective of environment (“mass quenching”), and galaxies in dense environments cease forming stars irrespective of mass (“environmental quenching”). Several discrete mechanisms have been proposed to explain the latter, such as ram pressure stripping (e.g., Gunn & Gott 1972), gas strangulation (e.g., Larson et al. 1980; Balogh et al. 2000), and dynamical heating of gas within haloes (e.g., Gabor & Davé 2015). Proposed mechanisms for the former must, by definition, involve secular processes. This includes star formation cessation due to structural changes, or heating and/or ejection of gas by central super-massive black holes for the most massive galaxies – radiative feedback from active galactic nuclei (AGN) – or by supernovae for less massive systems as they are more weakly self-gravitating (e.g., Shankar et al. 2006). Peng et al. (2010) hypothesized that while environmental effects reproduce the single Schechter shape observed for star-forming, blue galaxies in the local Universe, it is through a combination of both environmental and mass effects that the two-component Schechter shape observed for quiescent, red galaxies is reproduced. A wide variety of extensions have been applied to this model to directly incorporate other measurements such as the gas fraction (Bouché et al. 2010), and wholly new models continue to be developed (e.g., Peng et al. 2015; Belli et al. 2019; Suess et al. 2021; Varma et al. 2022).

The observed distributions of stellar mass and redshift are not in themselves the intrinsic, underlying SMF. Instead, they are bridged by statistical inferences between carefully selected samples and a vast, practically unobservable parent population from which a sample is taken. Insufficient control of biases and systematics obscure such inferences by creating unrepresentative samples, and hence weaken comparisons with simulation and analytical models (Marchesini et al. 2009; Fontanot et al. 2009). Great effort has been expended over the past 20 years to strip away a number of these biases, and their obscuring effects (Cole et al. 2001; Pozzetti et al. 2010; Marchesini et al. 2009; Bernardi et al. 2017; Davidzon et al. 2017; Leja et al. 2019a; Adams et al. 2021). Examples of these biases include sample incompleteness (including Malmquist biases and mass completeness), the so-called “cosmic” variance relating to sampling over- and under-dense regions of large scale structure, and Eddington bias which acts to overestimate the number density of the most massive galaxies (Malmquist 1920, 1922; Eddington 1913). While many studies incorporate only poisson noise (Fontanot et al. 2009, although this is steadily improving with time), other notable uncertainties exist including sample variance and effective volume size, as well as uncertainties on photo-z and stellar mass; the latter two items can introduce significant bias. Since the ultimate goal of any survey is to generalize the observed properties of a specific sample to that of the entire population, ignoring any of these important biases or uncertainties severely complicate this effort.

Studies of the SMF at increasingly higher redshift have been made possible due to advances in facilities and continued investment in deep, primarily photometric surveys of the distant Universe. Building on work by Songaila et al. (1994) and Glazebrook et al. (1995), Cowie et al. (1996) secured the first mass-complete samples at z ≈ 2 − 3, selected in the near-infrared (K) to directly constrain the Balmer continuum at ∼0.6 μm. Sampled nearer the stellar bulk at ∼1 − 2 μm, their mass-selected samples enabled a higher degree of mass completeness, and in turn provided a more representative view of the high-z Universe compared to optically selected forerunner surveys. More recently, precise mass estimates from similarly selected samples have been obtained by fitting observed spectral energy distributions (Tomczak et al. 2014; Martis et al. 2016; Straatman et al. 2015), from the ground (with VISTA, UKIRT) and space (HST/WFC3, Spitzer/IRAC). Although limited in area, samples recovered by HST have continued to pose new and significant challenges to existing paradigms by highlighting a series of stark changes in the shape of the SMF that indicates earlier galaxy populations were fundamentally different from those in the present day Universe. Additional studies at these early times (z ≳ 2) by degree-scale surveys capable of finding the rarest sources have revealed the existence of surprisingly mature, massive quiescent galaxies whose mass has already been assembled by z ≈ 4 (e.g., Ilbert et al. 2013), challenging the assumed timeline typical for galaxy assembly (Steinhardt et al. 2016; Behroozi & Silk 2018). Limited numbers of them have been confirmed by detailed spectroscopic follow-up (e.g., Glazebrook et al. 2017; Schreiber et al. 2018a; Tanaka et al. 2019; Valentino et al. 2020; Forrest et al. 2020a,b). Likewise, several studies have placed constraints on the massive end of the SMF at the highest-redshifts (z > 6) from samples selected by their rest-frame UV luminosity (e.g., Song et al. 2016; Harikane et al. 2016), who utilized empirically calibrated LUV − ℳ relations to convert directly measured UV luminosity into indirect estimates of stellar mass.

Although tantalizing, deriving mass estimates from UV luminosity involves significant uncertainties; only deep infrared photometry can directly measure the rest-frame stellar bulk (λ > 4000 Å, although ideally ∼1 − 2 μm) required to more directly assess stellar mass at these redshifts. Only recently have deep (Ks ≈ 25) near-infrared photometric studies enabled the first mass-selected samples at z > 2, although with mass uncertainties increasing with redshift (Retzlaff et al. 2010; Fontana et al. 2014; Laigle et al. 2016). Importantly, photometric surveys of galaxies have – and are expected to remain – the primary means by which these measurements will be made; obtaining even elementary parameters for large (N > 100 000) samples with deep spectroscopy, while precise, is simply too expensive even when utilizing highly multiplexed instrumentation. For the time being, spectroscopy of the distant Universe will remain a follow-up exercise to strengthen larger photometrically selected samples.

Of these deep photometric surveys, those with large areas have a key advantage: they probe a wider range of environments (i.e., density) compared to narrower surveys. As such, they provide more representative samples that are significantly less likely to be affected by field-to-field cosmic variances (and, for the same reason, are more suited to find rare, massive galaxies). Although one may resort to combining disparate data sets into a single analysis to combat this (e.g., Moster et al. 2013; Henriques et al. 2015; Volonteri & Reines 2016; McLeod et al. 2021), the unknown systematics between survey selection functions and depths make the interpretation of these joined samples more uncertain.

Stitching together surveys of low- and high-z samples carries similar concerns. A lack of uniform selection owing to different survey areas, detection bands, and photo-z determination strategies (i.e., dropout selection) can likewise complicate interpretations arising from such composite samples (e.g., Leja et al. 2019a; McLeod et al. 2021; Adams et al. 2021; Santini et al. 2022). Worse, these photometric samples may have been processed with different spectral energy distribution (SED) fitting codes that assume different templates, emission line recipes, and dust attenuation laws. Differing choices of cosmology and initial mass function, while reversible, nonetheless add complexity.

Accurate estimates of the galaxy stellar mass function at increasingly higher redshifts requires complementary deep observations from near-infrared selected samples to ensure both mass completeness and data reliability. Currently the widest field with near-infrared coverage of the requisite depth to probe large samples of z ≫ 3 galaxies is the Cosmic Evolution Survey (COSMOS; Scoville et al. 2007). This work takes advantage of the latest NIR-selected catalog of the COSMOS field, COSMOS2020 (Weaver et al. 2022). We adopt the “THE FARMER” catalog whose photometry was consistently measured across all bands by fitting galaxy profiles while taking into account PSF variations, depth, and source crowding. From the resulting photo-z and stellar masses we construct a consistently measured SMF from z = 0.2 − 7.5, identify quiescent and star-forming systems, and study the build-up and assembly of stellar mass over 10 Gyr of cosmic history. This includes a detailed study of key moments in the development of galaxy populations from the Era of Reionization (z > 6) to the peak of star formation activity at Cosmic Noon (z ∼ 2), up to the rich bimodality of star-forming and quiescent galaxies observed at more recent times (z ≲ 1).

This paper is organized as follows. Section 2 introduces the dataset chosen for this analysis from which samples are drawn and possible uncertainties discussed in Sect. 3. Section 4 provides a brief overview of the Schechter function. Results are presented in Sect. 5, including the presentation and fitting of the total, star-forming, and quiescent mass functions. These are compared to literature measurements in Sect. 6, whereupon further discussion is had with regards to galaxy assembly, star formation cessation, and connections to dark matter halos. This work concludes in Sect. 7.

These results are computed adopting a standard ΛCDM cosmology with H0 = 70 km s−1 Mpc−1, Ωm, 0 = 0.3, and ΩΛ, 0 = 0.7 throughout, such that the dimensionless Hubble parameter h70H0/(70 km s−1 Mpc−1) = 1. Galaxy stellar masses, when derived from SED fitting, scale as the square of the luminosity distance (i.e., D L 2 $ D_{\rm L}^2 $); hence a factor of h 70 1 $ h_{70}^{-1} $ is retained implicitly for all relevant measurements, unless explicitly noted otherwise (see Croton 2013 for an overview of h and best practices). Estimates of stellar mass (henceforth ℳ) assume an Chabrier (2003) initial mass function (IMF). All magnitudes are expressed in the AB system (Oke 1974), for which a flux fν in μJy (10−29 erg cm−2 s−1 Hz−1) corresponds to ABν = 23.9 − 2.5 log10(fν/μJy).

2. Data: COSMOS2020

We utilize the most recent release of the COSMOS catalog, COSMOS2020 (Weaver et al. 2022), comprised of ∼1 000 000 galaxies selected from a izYJHKs coadd with near-infrared depths approaching 26 AB ensuring a mass-selected sample complete down to 109 ℳ at z ≈ 3. This sample is complemented by extensive supporting photometry ranging from the UV to 8 μm over a total area of 2 deg2, making it the widest near-infrared-selected multiwavelength catalog to these depths. This is made possible by the latest release of the UltraVISTA survey (DR4; McCracken et al. 2012; Moneti et al. 2023), which is the longest running near-infrared survey to date, and complemented by even deeper optical grizy data provided by Subaru’s Hyper Suprime-Cam instrument (HSC PDR2; Aihara et al. 2019). Flanking this core imaging are u band measurements made by the CLAUDS survey from the Canada-France-Hawaii Telescope (Sawicki et al. 2019) and deep Spitzer/IRAC imaging in all four channels, reprocessed to include nearly every exposure taken in COSMOS for use in this catalog (Euclid Collaboration 2022, and references therein). As in the previous COSMOS catalog by Laigle et al. (2016), several intermediate and narrow bands from both Subaru/Suprime-Cam and VISTA (Milvang-Jensen et al. 2013) are used to provide precise determinations of photometric redshifts.

In addition to the comprehensive set of data, photometry is performed by two methods to produce two separate catalogs. The catalog called CLASSIC contains aperture photometry extracted with Source Extractor (Bertin & Arnouts 1996) for the optical/near-infrared and with IRACLEAN (Hsieh et al. 2012) for the infrared (as in Laigle et al. 2016). The catalog called THE FARMER contains profile-fitting photometry extracted with The Farmer (Weaver et al. 2023), which uses The Tractor (Lang et al. 2016) to construct and apply parametric models to estimate source fluxes and does so consistently across more than 30 bands, including all four IRAC channels. Each photometric catalog is paired with photometric redshift and physical parameter estimates from both LePhare (Arnouts et al. 2002; Ilbert et al. 2006) and EAzY (Brammer et al. 2008) resulting in four measurements of photo-z and ℳ per source. The combination of comprehensive deep images, independent photometry techniques, and independent SED fitting enables an unparalleled control of systematics and uncertainties, and hence further insight when inferring the underlying true nature of galaxy populations. See Weaver et al. (2022) for details.

3. Sample selection, completeness, and uncertainties

3.1. Selection function

Galaxies in COSMOS2020 are selected from a near-infrared izYJHKsCHI-MEAN coadded detection image (Szalay et al. 1999; Bertin 2010). The deepest band is i, with a 3σ sensitivity limit at ∼27 mag, and the shallowest is Ks at ∼25 mag1. The reddest selection band, Ks delivers mass-selected samples to z ≲ 4.5. Furthermore, the comparably deeper Ks imaging (by 0.5 − 0.8 mag relative to COSMOS2015) translates directly into a higher degree of completeness where low-ℳ samples are more mass complete down to lower masses. Taken together with IRAC, these images provide reliable detections of the reddest, oldest, and heavily dust obscured sources not seen in COSMOS2015.

Although the nominal area of the COSMOS survey is 2 deg2, the most secure region comprises 1.279 deg2 after removing contamination due to bright star halos and requiring an intersection of the deep near-infrared UltraVISTA imaging and the Subaru Suprime-Cam intermediate bands, where the photo-z performance is generally best. Compared to Davidzon et al. (2017) who use UltraVISTA DR2 (McCracken et al. 2012) imaging to probe the near-IR, here we rely on UltraVISTA DR4 (Moneti et al. 2023), reaching greater and simultaneously uniform NIR depth across the entire field (especially Ks). These non uniform depths restricted the SMFs of Davidzon et al. (2017) to only the four deepest stripes in UltraVISTA spanning 0.62 deg2. The gains leveraged in this work are hence five-fold. Firstly, the total number of recovered sources at all apparent magnitudes increases due to the additional area, improving statistical margins especially of rare sources. Secondly, the increased optical and near-infrared depths permit the recovery of fainter sources, improving flux completeness. Thirdly, the increased depth reduces photometric uncertainties leading to more precise, less biased photo-z and ℳ. Fourth, the consistent depth and improved imaging performance from Hyper Suprime-Cam provides significantly better photometry relative to Suprime-Cam. Lastly, the wider field makes it less likely to become biased due to specific structures along the line-of-sight (e.g., clusters, voids) and instead probes a greater variety of environments affecting the evolution of galaxies within them.

As presented in Fig. 13 of Weaver et al. (2022), the best photo-z performance at i > 22.5 AB is achieved by the combination of The Farmer photometry and photo-z from LePhare with a δz/(1 + z) precision of < 1% at i ≈ 20 and < 4% at i ≈ 26 AB. For this reason, and to expedite comparisons with the most similar work in the literature (Ilbert et al. 2013; Davidzon et al. 2017), this work adopts the photometry from The Farmer paired with the photo-z, ℳ, and rest-frame magnitudes estimated by LePhare. This combination will be henceforth referred to as COSMOS2020, unless explicitly stated otherwise. Our photo-z estimates correspond to the column LP_zPDF, defined as the median of the redshift likelihood distribution ℒ(z) (often referred to as p(z)). We choose to adopt ℳ estimates from MASS_MED as the median of the mass likelihood distributions are generally less susceptible to template fitting systematics than MASS_BEST which is taken at the minimum χ SED 2 $ \chi^2_{\rm SED} $. We find that the two agree at all redshifts within 0.01 dex with a narrow scatter whose 68% range is below 0.05 dex.

Identification of stellar contaminants proceeds as described in Sect. 5.1 of Weaver et al. (2022). Briefly, LePhare computes the best-fit stellar template to each SED from a range of templates, including white dwarf and brown dwarf stars. Sources that achieve a χ2 from a stellar template fit lower than any galaxy template are removed. An additional criterion on morphology is used, identifying likely stars as point-like sources from the COSMOS HST/ACS mosaics (Koekemoer et al. 2007) and Subaru/HSC (PDR2, Aihara et al. 2019) for the i < 23 and i < 21.5 AB, respectively. Bright, resolved sources with SED shapes similar to stars are not removed.

An initial sample of 636 567 galaxies2 in the range 0.2 < z ≤ 6.5 is selected from the contiguous 1.27 deg2COMBINED region defined as the union of the UltraVISTA and Subaru/SC footprints but removing regions around bright stars3. However, the source density at 6.5 < z ≤ 7.5 becomes noticeably greater in the ultra-deep stripes and so we restrict sources in this particular z-bin to only the 0.716 deg2 of the ultra-deep stripes also covered by Subaru/SC but away from bright stars, finding 1327 galaxies. The construction of a robust stellar mass function requires precise redshifts and accurate stellar masses. Since an infrared detection is required to accurately measure galaxy stellar masses at z ≳ 2 − 3, where this study seeks to cover new ground, 227 305 sources with mch1 > 26 AB (i.e., S/N ≲ 5) are removed; ≈93% of which fall below the mass completeness limit introduced in Sect. 3.3. An additional 13 644 sources with ambiguous redshifts are removed, quantified for simplicity by having > 32% of their redshift probability lying outside zphot ± 0.5; ≈70% of which fall below the mass completeness limit. Finally, we remove a further 1850 sources with unreliable SED fits with reduced χ2 > 10. These three necessary cuts create a final sample of 395 095 galaxies with precise redshifts and accurate stellar masses. We note that many excluded sources fail more than one of these three criteria (e.g., 90% are undetected in the infrared); Fig. A.1 shows their cross-section. Although the rejected objects are potentially interesting, their deeply uncertain nature does not permit us to reliably incorporate them into our measurements without a substantially more complex approach (e.g., to model potential outliers, see Hogg et al. 2010). Overall the excluded sources are predominantly faint (median Ks ∼ 25.7 AB) such that their best-fit masses are generally below our adopted mass completeness limit and their potential for bias minimal within the mass ranges examined in this work.

3.2. Quiescent galaxy selection

Accurate identification of distant and/or unresolved quiescent galaxies has been a longstanding problem (see e.g., Leja et al. 2019b; Shahidi et al. 2020; Steinhardt et al. 2020). While quiescent galaxies can be identified by their low star formation rates, SFR estimates from fitting broad-band photometry are subject to large uncertainties and are dependent on the particular SED template library (Pacifici et al. 2015; Leja et al. 2019a; Carnall et al. 2019). The other most successful approach is that of rest-frame colors, as the lack of blue O- and B-type stars in quiescent populations implies a dearth of UV emission and a red optical contintuum. Similar photometric UV to NIR SEDs are measured for heavily dust-enshrouded but otherwise star-forming systems, making the challenge of color-color selection that of distinguishing truly quiescent galaxies from dusty star-forming ones, in the absence of FIR measurements. Popular rest-frame color-color selections include (U − V) vs. (V − J), (NUV − r) vs. (r − J), (NUV − r) vs. (r − K), and (FUV − r, r − J) (Williams et al. 2009; Ilbert et al. 2010; Arnouts et al. 2013; Leja et al. 2019b, respectively), in addition to observed-frame methods such as (B − z) vs. (z − K) for sources at 1.4 ≲ z ≲ 2.5 (Daddi et al. 2004). However, each implies a different definition of quiescence (see Leja et al. 2019b; Shahidi et al. 2020). These pairs of rest-frame colors may be measured directly from the observed photometry (where available) or derived from the best-fit templates that should largely agree with observed-band photometry, and can also be meaningfully extrapolated in cases where appropriate observed-frame data are not available. While the former may be adversely affected by uncertainties and systematics propagated from the observed photometry, the latter assumes that the SED template library or basis contains a suitable model. It is worth noting that efforts are being made to employ machine learning techniques to lessen the dependence and bias impact from model assumptions (e.g., Steinhardt et al. 2020), although no such method has gained comparable use as yet.

Recent work by Leja et al. (2019b) has clarified the efficacy of these color-color selections, proposing to extend the typically adopted U − V vs. V − J baseline further into the UV on the basis of stronger correlation with intrinsic sSFR as measured in simulated photometry (see also Arnouts et al. 2013). As such, catalogs containing reliable and deep FUV and NUV photometry have the advantage of being able to identify low-z quiescent populations which though extrapolation can be consistently identified up to the highest redshifts even when the observed-frame U band no longer corresponds to rest-frame U. At redder wavelengths the rest-frame J band is crucial for measuring the rest-frame stellar bulk, however, it becomes easily redshifted out of most deep surveys with IRAC photometry by z ≈ 2 − 3 which renders quiescent galaxy selections highly dependent on template libraries even at these intermediate redshifts.

In order to provide broad comparisons to other mass functions, in particular those measured in COSMOS (e.g., Ilbert et al. 2013; Davidzon et al. 2017), this work selects quiescent galaxies following the criteria introduced by Ilbert et al. (2013):

( N U V r ) > 3 ( r J ) + 1 and ( N U V r ) > 3.1 $$ \begin{aligned} (NUV - r) > 3\,(r - J) + 1\,\mathrm{and}\,(NUV-r) > 3.1 \end{aligned} $$(1)

whereby the slant line runs perpendicular to the direction of increasing sSFR and parallel to an increase in dust attenuation to separate truly quiescent systems from otherwise dusty star-forming contaminants. For clarity, we henceforth refer to this (NUV − r) vs. (r − J) selection as NUVrJ. Generally, this selection has been found to approximate a cut in sSFR ≲ 10−11 yr−1 (Davidzon et al. 2018).

It is important to note that LePhare computes rest-frame magnitudes by attempting to find an observed frame band which directly probes the rest-frame band, which for the reasons listed above is less model-dependent than simply adopting the rest-frame colors of the best-fit galaxy template (or sSFR) and hence conveys a view of the true variety of observed galaxies that is less biased by our assumptions. Although the rest-frame photometry are estimated using a K-correction and color-term, they are most strongly dependent on the best-fit template at redshifts where observed photometry are not well matched to the rest-frame band and are then effectively extrapolated from the best-fit template (see Appendix 1 of Ilbert et al. 2005 for details).

Unbiased sample selection becomes increasingly difficult with redshift. By z ≈ 3, rest-frame J-band fluxes are no longer directly measured even by IRAC channel 24 and so become increasingly model-dependent and uncertain. While rest-fame NUV remains constrained even at z > 6, rest-frame r must be extrapolated by z ≈ 5 at which point differentiation between quiescent and dusty star-forming galaxies becomes statistically impossible. Because of this, and the expectation that quiescence is unlikely at such early times, selection of quiescent galaxies in this work is limited to z ≤ 5.5, noting that selection between 3 < z ≤ 5.5 is subject to a significantly higher degree of uncertainty. Advancement in this selection will only be made possible with deeper infrared imaging from facilities such as JWST (Rigby et al. 2023).

Selection of quiescent galaxies is presented in Fig. 1, showing star-forming and quiescent galaxies whose masses are above their respective mass limits (see Sect. 3.3). Error bars on the rest-frame colors are estimated from the quadrature addition of the observed-frame filter nearest to that of the rest-frame. Given that even rest-frame photometric uncertainties are generally inversely proportional to mass at given redshift, these error bars are most representative for median mass systems but are overestimated for bright, massive galaxies. In addition, uncertainties become increasingly underestimated at z > 3 as the extrapolation based on the best-fit template is not propagated. However, it is already obvious that the dividing power of the slant line is significantly diminished at z > 3 where the rest-frame J band is no longer directly observed. Both the quiescent and dusty star-forming regions appear to be devoid of sources by z ≈ 6, such that no intrinsically quiescent (misclassified or not) are found at these early times. Whether this is an intrinsic feature of galaxy populations or merely a selection effect is discussed below in Sect. 3.3. For these reasons this work restricts distinction between quiescent and star-forming galaxies to z ≤ 5.5. Galaxies at z > 5.5 in this work should be considered star-forming.

thumbnail Fig. 1.

Galaxies are classified as either star-forming or quiescent in bins of redshift by selection in rest frame NUV − r and r − J colors, following Ilbert et al. (2013). The rest-frame J band is not directly measured at z ≳ 3.5 and so the sloped part of the selection box is dashed to indicate that the r − J colors are strongly model dependent. Typical uncertainties for the rest-frame colors of the star-forming and quiescent subsamples are estimated as medians of the nearest observed-frame bands, consistent with the derivation of the rest-frame colors. See the text for details.

3.3. Completeness

Accurate estimates of completeness are crucial when inferring general properties about a population from an otherwise incomplete sample. While advancements in near-infrared facilities have enabled breakthroughs in selecting representative sample of galaxies by measuring their stellar bulk, samples are still mass limited and these mass limits evolve with z, owing to the faintness of increasingly distant galaxies. As such, mass limits are highly dependent on accurate estimates of survey depths and their impact on the selection function as it relates to the detection of the lowest mass galaxies.

There are three known populations which are expected to be missed by a near-infrared selection whose deepest images are on the blue-end of the selection function. Firstly, the faintest star-forming blue galaxies in the local Universe (e.g., dwarf irregular starbursts) may have detectable fluxes blueward of i, but they will not be included in existing izYJHKs selections as their near-infrared emission is too faint. However, their contribution is expected to be limited to only the low-mass end of the galaxy stellar mass function, and can henceforth be well characterized in large numbers by deeper small field surveys (e.g., CANDELS: Grogin et al. 2011; Koekemoer et al. 2011).

Secondly, quiescent galaxies are more difficult to detect than star-forming galaxies owing to a lack of rest-frame UV and blue optical emission, meaning their detection relies upon deep near-infrared imaging at both low and high-redshifts. For this reason, an izYJHKs detection will be insufficient to detect the quiescent systems compared to star-forming ones of the same mass, meaning that the quiescent sample will be as complete as the star-forming sample at higher masses. We compute their respective mass completeness limits separately and apply them consistently throughout.

Lastly, and similar to quiescent galaxies, the most heavily dust obscured star-forming galaxies (AV ≫ 5) at high redshift (z ≳ 2) will not present appreciable optical or near-infrared fluxes to be detected in COSMOS2020, but unlike quiescent galaxies are ubiquitously and efficiently detected in far-infrared, submillimeter, and radio surveys (e.g., Schreiber et al. 2018b; Wang et al. 2019; Sun et al. 2021; Jin et al. 2019; Fudamoto et al. 2020, 2021; Casey et al. 2021; Shu et al. 2022). The nature and extent of this recently discovered population remains difficult to quantify, owing to a combination of complex selection functions and serendipitous detections. The majority are likely to be missed by an izYJHKs selection function even at the depths of COSMOS2020 (see discussions in Sect. 6.3).

Although mass completeness estimates are presented in Weaver et al. (2022), they are derived for a comparably less secure sample than used in this work. For this reason the mass completeness limits are rederived in identical fashion following the method of Pozzetti et al. (2010). A critical advantage of this method is that it does not rely on the theoretical mass distribution of galaxies fainter than the magnitude limit, but assumes that those just above the threshold share similar properties with the undetected ones, modulo a rescaling factor as detailed below. This contrasts with studies that estimate mass completeness either through injection-recovery of simulated sources, or extrapolation of the observed distribution itself below the magnitude limit. The latter approach, in particular, may underestimate the stellar mass limit if the galaxy sample is sparse due purely to astrophysical reasons and not genuine incompleteness. In our case, the sample in each z-bin is first cleaned by discounting the 1% worst-fit sources via χ2. The stellar masses ℳ of the 30% faintest galaxies in channel 1 are then rescaled such that their observed channel 1 apparent magnitude mch1 matches the IRAC sensitivity limit mlim = 26:

Log 10 ( M resc ) = Log 10 ( M ) + 0.4 ( m ch 1 26.0 ) . $$ \begin{aligned} \mathrm{Log}_{10}(\mathcal{M} _{\rm resc}) = \mathrm{Log}_{10}(\mathcal{M} ) + 0.4\,(m_{\rm ch1}-26.0). \end{aligned} $$(2)

The limiting mass ℳlim is then taken to be the 95th percentile of the ℳresc distribution. Finally 2nd order expansions in (1 + z) are fitted to each Mlim per z-bin up to z = 7 for the total sample, and z = 5 for star-forming and quiescent samples, to produce a smoothly evolving limiting mass. Limits above which samples are ∼70% mass complete (see justification below) are derived consistently for the total sample from 0.2 < z ≤ 7.5 and for the star-forming, and quiescent samples from 0.2 < z ≤ 5.5:

Total : 3.23 × 10 7 ( 1 + z ) + 7.83 × 10 7 ( 1 + z ) 2 $$ \begin{aligned}&\mathrm{Total\!:} -3.23\times 10^{7}\,(1+z) + 7.83\times 10^{7}\,(1+z)^2 \end{aligned} $$(3)

Star forming : 5.77 × 10 7 ( 1 + z ) + 8.66 × 10 7 ( 1 + z ) 2 $$ \begin{aligned}&\mathrm{Star-forming:} -5.77\times 10^{7}\,(1+z) + 8.66\times 10^{7}\,(1+z)^2 \end{aligned} $$(4)

Quiescent : 3.79 × 10 8 ( 1 + z ) + 2.98 × 10 8 ( 1 + z ) 2 $$ \begin{aligned}&\mathrm{Quiescent:} -3.79\times 10^{8}\,(1+z) + 2.98\times 10^{8}\,(1+z)^2 \end{aligned} $$(5)

and are shown in Fig. 2. Despite the more conservative selection adopted in this work, the derived mass completeness limits are essentially identical to those derived in Weaver et al. (2022), which indicates the robustness of these limits against sample selections. For reference, at z ≈ 1 the quiescent sample is complete at ∼0.25 dex in mass above the star-forming sample, growing to ∼0.45 dex above by z ≈ 3 − 5.

thumbnail Fig. 2.

Stellar mass completeness limits. The three panels show, from left to right, the density of the total, star-forming, and quiescent samples with their respective mass completeness limits. The limits are derived in discrete z-bins following Pozzetti et al. (2010) with magnitude limits adopted from IRAC channel 1 (filled circles), then interpolated with Eqs. (3)–(5), respectively, resulting in the solid lines. For quiescent galaxies, consistent estimates using Ks are also shown (empty circles). To verify these estimates we compute conservative mass limits from simulated spectra following a delta-like burst at z = 15 which are normalized separately to the Ks and channel 1 magnitude limits; they are shown by the dotted and solid red lines, respectively. See the text for details.

There remains an additional incompleteness arising from the fact that mass completeness is derived from IRAC channel 1 photometry, and yet our izYJHKs selection function does not include sources which are only identified in IRAC. As discussed in Davidzon et al. (2017), despite this drawback it is also disadvantageous to estimate the mass completeness from any one of the six izYJHKs detection bands either. For instance, the reddest, Ks, samples the rest-frame stellar bulk only out to z ≲ 2, and will tend to underestimate stellar masses at z ≳ 2 until z ≈ 4.5 when it becomes a UV tracer. Thankfully, it is possible to estimate the impact of this additional incompleteness by examining a sample of galaxies common to this work and those of the comparably deeper ∼200 arcmin2 CANDELS-COSMOS catalog of Nayyeri et al. (2017). This analysis is performed and discussed at length in Weaver et al. (2022). In summary, 75% of CANDELS sources at Mlim are recovered by both THE FARMER and CLASSIC COSMOS2020 catalogs, which combined with the choice of Mlim being the 95th percentile of Mresc implies that Mlim of the total sample corresponds to a mass completeness of ∼70%.

As shown in Fig. 2, the mass limit of the total sample is almost identical with that of the star-forming sample at all redshifts. This is unsurprising as star-forming galaxies generally dominate galaxy demographics. As in our NUVrJ selection from Fig. 1, we see the lack of quiescent systems at z > 5.5 and so we consider the star-forming subsample to be statistically equivalent to the total sample at z > 5.5.

Deriving a consistent mass completeness limit for quiescent galaxies is more challenging. It is well known that the predominantly older, redder stellar populations of quiescent systems imply higher mass-to-light ratios than found in star-forming galaxies which in turn imply a lower degree of mass completeness at the same flux limit. Figure 2 shows that our channel 1 derived mass completeness falls ≳0.2 dex lower than the bulk of the quiescent sources. Taken at face value, this would appear to indicate a lack of quiescent systems at low-intermediate masses at z > 2. Again, we have cause for concern as channel 1 is not included in our izyJHKs detection strategy, and hence our selection function is liable to miss IRAC-bright sources that are not present in bluer bands. Furthermore, due to the predominantly red optical spectral slopes of quiescent galaxies, continuum emission in Ks should be lower than in channel 1, implying that one would need a deeper Ks magnitude limit to detect the same sources from shallower channel 1 imaging. In other words, it is expected that there are red sources visible only in channel 1 that are not included in our izYJHKs detected catalog. Furthermore, by z ≈ 4.5 Ks no longer traces mass but rather UV continuum (channel 1 does so by z ≈ 8). Worse, the UV continuum in these systems is expected to be faint (even given ‘frostings’ of star formation in NUVrJ-selected post-starburst systems). Therefore, while deriving a mass completeness from channel 1 magnitude limits may be suboptimal due to possible selection effects with regards to a izYJHKs-selected sample, we cannot turn to Ks as it is no longer a mass indicator at z ≳ 4.5.

One solution is to modify our selection function by incorporating izYJHKs undetected IRAC-only sources into our sample. While a systematic search for IRAC only sources is ongoing, their identification is made difficult not only because of the significantly lower resolution and consequently higher source crowding in necessarily deep IRAC images, but these sources also lack optical/NIR data which is, by definition, insufficiently deep to identify low-z interlopers. Deep mid-infrared (MIR) data redward of IRAC over sufficient degree-scale areas do not currently exist, making a determination of redshift, mass, and quiescent nature of these IRAC-detected sources problematically uncertain.

For the time being, whether or not this absence of intermediate-low mass quiescent systems is astrophysical cannot be determined. Literature measurements do not yield more mass-complete quiescent samples as the UltraVISTA DR4 NIR depths are now similar to those from even the deepest small-field NIR imaging, H160 ≈ 25.9 (e.g., CANDELS, Tomczak et al. 2014), and even so, comparisons are hampered by field-to-field and photometric systematics. Comparisons with other stellar mass functions measured more consistently in COSMOS (e.g., Davidzon et al. 2017; Ilbert et al. 2013) are still affected by systematics from comparably less certain measurements from previous, shallower data, despite the lessened impact of cosmic variance. Additionally, comparisons with Davidzon et al. (2017) are complicated by the fact that they refit the photometry of Laigle et al. (2016) to produce new z and ℳ estimates. Modifications include expanding the redshift baseline from z ≤ 6 to z ≤ 8 and adding in additional SED templates to probe star-bursting galaxies with rising star formation histories as well as dust-obscured systems. As demonstrated by a recent comparison by Lustig et al. (2023), these modifications produce lower number densities of massive galaxies compared to the original measurements.

We turn to theoretical frameworks to investigate this further. Namely, we use Flexible Stellar Population Synthesis (FSPS; Conroy et al. 2009; Conroy & Gunn 2010) to estimate the stellar masses produced by extraordinarily old stellar population produced by a delta-burst evaluated at z = 15. We assume a Chabrier (2003) IMF with log10(Z/Z) = − 0.3. At each epoch (i.e., z) we normalize the evolved model spectrum to match our channel 1 observed frame magnitude limits (26.0 mag) and estimate the corresponding stellar mass shown by the solid red curve in Fig. 2. This exercise is repeated consistently with the Ks magnitude limit (25.5 mag) shown by the dashed red curve.

We caution however, that the assumption that these models accurately describe real high-z quiescent galaxies is becoming increasingly dubious. Such a system formed in a monolithic delta-burst just following the big bang cannot reach quiescence (as defined by NUVrJ) above z ≈ 5.3, and yet remarkably mature systems at z ≈ 4 − 5 have already been reported in the literature (e.g., Schreiber et al. 2018a; Tanaka et al. 2019; Valentino et al. 2020). Worse, the spread of mass-to-light ratios found in quiescent systems means that a single mass completeness limit for all quiescent galaxies at a given redshift is ill-defined even for consistently selected (e.g., UVJ, NUVrJ, sSFR) samples, resulting in a non negligible selection effects.

Nonetheless, we stress that the difference in completeness between the effectively flux-complete Ks-derived mass limit and the mass limit derived from channel 1 magnitudes is only ∼0.3 dex, which is typically less than a single bin in our analysis. In light of this considerable uncertainty, we adopt the optimistic quiescent galaxy mass completeness limits derived via Pozzetti et al. (2010) from channel 1 magnitudes with the caveat that the lowest mass bins in each measurement at z ≳ 2 are potentially incomplete. We indicate the Ks mass limit for quiescent samples throughout. Also, we note that our quiescent mass limit, although consistently determined in Davidzon et al. (2017), is more conservative despite the deeper NIR data. As will be discussed in Sect. 5.2, we attribute this to an overestimate of the quiescent galaxy mass completeness by Davidzon et al. (2017).

The difference in mass completeness between the star-forming and quiescent samples presents an additional complication. Because the star-forming galaxies can be reliably detected to lower masses than quiescent galaxies from our izYJHKs selection function, the low-mass regime of the total SMF at a given epoch, as defined in this work, does not actually include contributions from the lowest-mass quiescent systems. Therefore the total SMF is simply the star-forming SMF at masses below the quiescent mass limit. The effect from the undetected low-mass quiescent systems is almost certainly negligible: a simple extrapolation from the evolution in the number density of z ≈ 0.2 − 2.5 low-mass quiescent galaxies predict a number density at least 10× lower than that of the star-forming galaxies at z > 3. Furthermore, given that general uncertainties are still above the 10% level at low-masses even at z ≈ 2 (Fig. 3), any bias arising in the total number density from undetected low-mass quiescent systems is lower than other sources of error.

thumbnail Fig. 3.

Adopted estimates for the total uncertainty σΦ (solid) as a function of stellar mass at several redshifts (by color), for mass-complete samples only. Contributions include uncertainties from Poisson noise σN (dashed), Cosmic Variance σCV (dotted), and SED fitting σSED (dash-dot). 10% and 100% levels are indicated for reference.

3.4. Derivation of the 1/Vmax correction

Intrinsically faint galaxies at any given redshift are more likely to be missed by survey selection functions compared to brighter sources. For a NIR-selected sample, this equates to a mass bias by which low-luminosity, low-mass galaxies can only be detected in smaller volumes (i.e. out to lower z) relative to brighter, more massive ones which could be detected if they were at higher z. This is the well known Malmquist bias (Malmquist 1920, 1922).

The most straight-forward approach to correct for such a bias is the 1/Vmax method of Schmidt (1968), which has enjoyed significant popularity owing to its simplicity. Briefly, the 1/Vmax method statistically corrects for selection incompleteness by weighting each detected object by the maximum comoving volume in which it can be observed, given the characteristics of the telescope survey. The Vmax estimate per individual object is computed after finding the maximum redshift zmax by which the best-fit SED would no longer be observable5 because of the survey’s flux limit. On the other hand, the minimum redshift (zmin) should be the one at which the source would become too bright and saturate the camera, although in practice is the lower boundary of the z bin in which the considered galaxy lies (zlow). Therefore, for the ith galaxy inside the bin zlow < z < zhigh, the maximum observable volume is

V max , i = 4 π 3 Ω Ω sky ( D cov ( min ( z max , i , z high ) ) 3 D cov ( max ( z min , i , z low ) ) 3 ) , $$ \begin{aligned} V_{\mathrm{max}, i} = \frac{4\pi }{3} \frac{\Omega }{\Omega ^\mathrm{sky}} (D_{\rm cov}(\min (z_{\mathrm{max}, i},z_{\rm high}))^3 - D_{\rm cov}(\max (z_{\mathrm{min}, i},z_{\rm low}))^3), \end{aligned} $$(6)

where Ω is the solid angle subtended by the sample, Ωsky ≡ 41 253 deg2 is the solid angle of a sphere, and Dcov(z) is the comoving distance at z (Hogg 1999). If zmax exceeds the upper boundary of the redshift bin, the latter is used instead, meaning that the brightest sources are often assigned a weight that corresponds to the full volume of that redshift slice. As such, this correction is expected to be significant for only the faintest, lowest-mass sources in a given z-bin. While it is nonparametric and does not assume a functional form of the SMF, the 1/Vmax technique does assume that samples are drawn from a uniform spatial distribution, which is not accurate in the case of over- or under-dense environments (Efstathiou & Bond 1999). However, the assumption of a uniform spatial distribution is expected to be problematic only at z < 1, where large-scale structures have fully assembled, or in narrower surveys that can be biased by structures at smaller scales. Other methods exist which do not make this assumption such as STY (Sandage et al. 1979) and SWML (Efstathiou et al. 1988), a parametric and nonparametric maximum likelihood method, respectively. Already, Davidzon et al. (2017) found that the constraints provided by COSMOS2015 were sufficiently strong for the 1/Vmax method as well as more complex methods (e.g., STY, SWML) to essentially converge. With even stronger constraints and larger effective area provided by COSMOS2020, we can expect even better agreement, with minimal advantages to the more complex methods. More extensive discussions on strengths and weaknesses of the various approaches can be found in the literature (e.g., Ilbert et al. 2004; Binggeli & Jerjen 1998; Johnston 2011; Takeuchi 2000; Weigel et al. 2016).

3.5. Further considerations of uncertainty

We adopt a statistical error budget on the SMF number density Φ consisting of the quadrature addition of Poisson noise (σN), cosmic variance fluctuations (σcv), and uncertainties on masses induced by SED fitting (σSED) such that σ Φ = ( σ N 2 + σ cv 2 + σ SED 2 ) 1/2 $ \sigma_{\Phi} = (\sigma_{\rm N}^2 + \sigma_{\rm cv}^2 + \sigma_{\rm SED}^2)^{1/2} $. Figure 3 shows the composition of the total error budget from z = 1.1 to 6.5 as a function of stellar mass for mass-complete bins.

3.5.1. Poisson noise

Poisson noise arises from processes wherein the abundance of a discrete quantity (or counts) is measured. Although in the limit of many events a Poisson process becomes indistinguishable with that of a Gaussian, in small number counts it can be the dominant source of uncertainty. Here we compute the Poisson error σN for each mass bin as N $ \sqrt{N} $ where N is the number of objects in that bin. These values are recomputed for the star-forming and quiescent subsamples separately. As shown in Fig. 3, the fractional contribution from σN increases with mass and redshift with the largest contribution at ℳ > 1011.5 ℳ.

The discrete nature of Poisson processes allows us to also provide upper limits on bins containing zero detected galaxies. Following Table 1 of Gehrels (1986), the statistical upper limit on Φ (N = 0) for a given observed volume V is σN, limit = 1.841/V. See Ebeling (2003) and Weigel et al. (2016) for details and further discussions.

3.5.2. Cosmic variance

It is well established that galaxy properties are correlated with environmental density (i.e., clustering). Galaxy clusters, while being an important laboratory for galaxy evolution, are not typical of galaxy environments. Because of their density, they impart a higher overall normalization to the stellar mass function. More noticeably, they tend to inflate the massive end of the mass function as they preferentially contain the most massive systems. This environmental field-to-field bias (so-called “cosmic variance”) is a topic of intense study, and is a key component to accurately assessing sample uncertainties when trying to infer universal or intrinsic properties of galaxies.

There are many published methods to estimate cosmic variance, based on numerical simulations (Bhowmick et al. 2020; Ucci et al. 2021), analytical models calibrated to observations solved either using linear theory (Moster et al. 2011; Trapp & Furlanetto 2020) or on forward simulation corrections to linear theory (Steinhardt et al. 2021), and observationally (e.g., Driver & Robotham 2010). Trenti & Stiavelli (2008) combines results from cosmological simulations with analytical predictions.

Cosmic variance σcv is estimated following Steinhardt et al. (2021), who adapt the methods of Moster et al. (2011) which, importantly, scale with stellar mass (up to 1011.25 ℳ) and are commonly adopted for use in 0.1 ≲ z ≲ 3.5 measurements as that is the redshift range in which the Moster et al. (2011) calculator was devised. However, above z ≈ 3.5 these estimates become increasingly underestimated, so Steinhardt et al. (2021) use linear perturbation theory to extend this work more reasonably to the early Universe, while maintaining agreement at z < 3. Although environmental density has known covariance with star formation (e.g., Davidzon et al. 2016; Bolzonella et al. 2010), we assume cosmic variance is equivalent between star-forming and quiescent subsamples. As shown in Fig. 3, the fractional contribution from σCV is dominates the error budget for low-ℳ systems at all redshifts and becomes progressively more important at high-ℳ with increasing z.

3.5.3. SED fitting uncertainties and bias

Another consideration is the uncertainty on the stellar mass estimate provided by the SED fit. Figure 4 shows the likelihood distributions on stellar mass at fixed redshifts and masses produced by LePhare sampling at Δlog10(ℳ/ℳ) = 0.025 dex. Trends with width of the likelihood distributions indicate that mass is best constrained for low-redshift, massive (i.e., bright) sources. Although there is nonzero skewness and kurtosis in individual cases, the overall median distribution is symmetric. This is expected, as the uncertainty on stellar mass is essentially a measurement of the range of allowable templates and their normalization in the fitting procedure, and thus σSED scales with photometric uncertainties. However, these likelihood distributions on stellar mass should be treated as lower limits as they do not take into account any covariance with redshift.

thumbnail Fig. 4.

Likelihood distributions of galaxy stellar mass are derived from SED fitting with LePhare and assume a fixed redshift. Individual distributions (gray) are summarized by a median stack (blue) grouped by redshift and stellar mass (indicated in ranges of log10(ℳ/ℳ)). Estimates of standard deviation σ are shown. The size of a typical mass bin used in this work is 0.25 dex, indicated by the pair of dotted gray lines in each panel.

While these typical per-bin distributions can be valuable, especially for injecting noise into measurements from simulations, attempting to compute its contribution to the SMF, σSED, using the typical width in a given bin is suboptimal as the wings of neighboring mass-bins contribute asymmetrically. To address this, we use the individual mass likelihood distributions to draw 1000 independent realizations of the galaxy stellar mass function and thereby directly estimate the variance produced by the mass uncertainties, which we take as the 68% range about the median number density per bin of mass. As shown in Fig. 3, the contribution from σSED become dominant only at ℳ > 1011.5 ℳ, in some cases becoming larger than unity. They are comparable to contributions from σN across the entire mass range.

It is important to note that this does not account for systematic biases arising from SED fitting, such as assumptions of the stellar initial mass function6, potential photo-z offsets, and assumptions as to the star formation histories that all propagate into the stellar mass determination. However, concerning the latter case, we show in Weaver et al. (2022) that the combination of The Farmer and LePhare achieves a subpercent photo-z bias even for faint, high-z sources (−0.004 δz/1 + z at 25 < i < 27) improving over other works including COSMOS2015. Systematic errors cannot be combined with random errors, and so additionally complicate measurements of the SMF. Given this indication of relatively low bias arising from the photo-z and significantly better constrained SEDs relative to previous measurements, we omit these considerations in the present work. See Marchesini et al. (2009) and Davidzon et al. (2017) for detailed discussions of various sources of bias and their effect on the SMF.

3.5.4. Eddington bias

The number of low-mass galaxies is orders of magnitude larger that of the highest-mass systems, and so a randomly chosen galaxy is overwhelmingly likely to be lower-mass. If even a small fraction of such truly low-mass systems scatter toward high-mass (owing to a ℳ overestimate) it can significantly change the poisson-dominated high-mass number density estimate. The converse situation, while depleting the high-mass end, would have virtually no effect on the low-mass estimates. This is the well known Eddington bias (Eddington 1913). While generally understood to mean that there is a net bias leading to the overestimation of the density of massive galaxies, a small, but highly asymmetric uncertainty on the mass of low-mass systems can similarly generate such a bias that affects the shape of the low-mass regime. See Grazian et al. (2015) for further discussions.

Effectively correcting for Eddington biases has been a leading point of discussion in recent literature, generally favoring the convolution of the fitting function with a kernel that describes the uncertainty in stellar mass (e.g., Davidzon et al. 2017; Ilbert et al. 2013). Recently, Adams et al. (2021) compared the effect of using three different forms for the convolution kernel finding a significant difference in the inferred intrinsic SMF. Alternative approaches have been proposed (e.g., Leja et al. 2019a), developed a nonparametric formalism for incorporating σΦ into an unbinned Likelihood fitting, whereby multiple realizations of the parent catalog are made, each time sampling stellar mass from the mass likelihood distributions of each galaxy. In this work we primarily adopt the traditional convolution kernel method to estimate Eddington bias. At the same time, we also fit the mass function using the method of Leja et al. (2019a), and so follow their approach where relevant.

4. The Schechter function

Galaxy luminosity and stellar mass functions can be described empirically by the parametric formulation first introduced by Schechter (1976) in the context of the local Universe. Since then, the Schechter function has been adopted ubiquitously in statistical studies of galaxy mass assembly. It is more convenient to express the number density of galaxies per logarithmic mass bin d log ℳ as given by Weigel et al. (2016):

n gal = Φ d log M = ln ( 10 ) Φ e 10 log M log M × ( 10 log M log M ) α + 1 d log M , $$ \begin{aligned} n_{\rm gal}&= \Phi \,d\,\mathrm{log}\,\mathcal{M} \nonumber \\&= \mathrm{ln\,(10)}\,\Phi ^*\,e^{-10^{\mathrm{log}\mathcal{M} - \mathrm{log}\mathcal{M} ^*}} \times (10^{\mathrm{log}\mathcal{M} - \mathrm{log}\mathcal{M} ^*})^{\alpha +1}\,d\,\mathrm{log}\,\mathcal{M} , \end{aligned} $$(7)

which describes a power law of slope α at masses smaller than a characteristic stellar mass ℳ*, whereupon the function is cut off by a high-mass exponential tail. The overall normalization is set by Φ*, which corresponds to the number density at ℳ*. Although Φ (ℳ) is expected to evolve smoothly with redshift such that Φ (ℳ, z) can be mapped from secondary functions ℳ*(z),Φ*(z),α(z), most literature applications fit for each parameter independently at each redshift. A notable exception is Leja et al. (2019a).

Many studies have since found evidence that the total galaxy population at low-z is better described as the coaddition of two Schechter functions (e.g., Peng et al. 2010) whereby the low-mass and high-mass end acquire individual normalizations ( Φ 1 * $ \Phi^*_1 $ and Φ 2 * $ \Phi^*_2 $) and low-mass slopes (α1 and α2) but retaining a single characteristic stellar mass ℳ*. This so-called ‘Double’ Schechter form is as follows:

Φ d log M = ln ( 10 ) e 10 log M log M × [ Φ 1 ( 10 log M log M ) α 1 + 1 + Φ 2 ( 10 log M log M ) α 2 + 1 ] d log M . $$ \begin{aligned} \Phi \,d\,\mathrm{log}\,\mathcal{M} =&\mathrm{ln\,(10)}\,e^{-10^{\mathrm{log}\mathcal{M} - \mathrm{log}\mathcal{M} ^*}} \nonumber \\&\times {\Big [} \,\Phi _1^*\,(10^{\mathrm{log}\mathcal{M} - \mathrm{log}\mathcal{M} ^*})^{\alpha _1+1} \nonumber \\&+ \Phi _2^*\,(10^{\mathrm{log}\mathcal{M} - \mathrm{log}\mathcal{M} ^*})^{\alpha _2+1}{\Big ]}\,d\,\mathrm{log}\,\mathcal{M} . \end{aligned} $$(8)

The Double Schechter function has been found to be as suitable description in most of the studies in the local Universe and up to z ≈ 2. At higher redshifts, it is less clear whether this is a better description of Φ than a single Schechter function. Moreover, deviations have been recently been reported at z > 7 (e.g., Stefanon et al. 2021) in which the observed stellar mass function is better described by a power law than by any Schechter profile.

5. Results

Now having established the selections and methods adopted in this work, we present the resulting measurement of the SMF for the total, star-forming, and quiescent samples. We investigate also the evolution of number densities and quiescent fractions at fixed mass, and then fit the SMF for each sample with several methods to derive the evolution of the best-fit Schechter parameters.

5.1. The total galaxy stellar mass function

We measure the SMF for our total (star-forming and quiescent) sample divided in 12 redshift bins from z = 0.2 to 7.5 in fixed bins of stellar mass Δℳ = 0.25 above the mass limit for that z-range7. Shown in the left panel of Fig. 5, the shape and normalization of the SMF changes considerably over the ∼10 billion years corresponding to this redshift range. At z ≲ 3, the SMF features a smooth, monotonically decreasing number density at low-ℳ that flattens before falling off steeply at ℳ ≈ 1011 ℳ, around the so-called ‘knee’ of the function. Its overall shape and normalization remains roughly constant until z ∼ 1.5 indicating a lack of mass growth at recent times, consistent with the decline of the cosmic star formation rate (Madau & Dickinson 2014). However, by z ∼ 1.5 the normalization decreases dramatically. The number density at the knee is only ∼1% of its z ≈ 0.5 value with the fastest growth occuring on the low mass end, consistent with mass “downsizing in time” (Cowie et al. 1996; Neistein et al. 2006; Fontanot et al. 2009). The slope of the low mass end steeps with time and is expected to be driven by physical processes including outflows, star formation efficiency, slope of the main sequence, and mergers (see discussions in Peng et al. 2014). At the same time, the knee itself becomes difficult to distinguish as the SMF takes the form of a smooth power law, and we become increasingly unable to constrain the low-mass end of the SMF due to worsening mass completeness (mass incomplete points are omitted in the figure). As expected from Fig. 3, the overall uncertainty grows significantly with increasing redshift and mass. We note that at a few epochs (e.g., z ≈ 5), the SMF is not strictly monotonic, likely driven by systematics rather than a real physical phenomenon (see discussions in Sect. 6).

thumbnail Fig. 5.

Evolution of the galaxy stellar mass function over 12 redshift bins (0.2 < z ≤ 7.5) for the total, star-forming, and quiescent samples. Mass incomplete bins based on the channel 1 limiting magnitude are not shown. For the quiescent sample, bins that are fully incomplete based on the Ks limiting magnitude are indicated by cross hatching.

The evolution of the total SMF measured in this work is compared with literature results in Fig. 6. We begin at z ≈ 0.2 by comparing with two SMFs previously measured in the same field: Ilbert et al. (2013) and Davidzon et al. (2017); both use photo-z and ℳ computed with LePhare over COSMOS, out to z ≈ 4.0 and z ≈ 5.5, respectively, with the nearly same redshift binning scheme as we use. We note one exception that (Ilbert et al. 2013) bins sources at 3.0 < z < 4.0 whereas (Davidzon et al. 2017) uses 3.0 < z ≤ 3.5 and 3.5 < z ≤ 4.5, and so we opt to follow the scheme of Davidzon et al. (2017) as it allows a comparison up to higher redshift and omit the comparison with the highest-z measurement of Ilbert et al. (2013). Additionally, Davidzon et al. (2017) only considered sources in the ultra-deep regions of COSMOS2015 (corresponding to UltraVISTA DR2) as the spatial inhomogeniety of the NIR bands implies a significant variation in selection function and mass completeness between the deep and ultra-deep regions. Thankfully, COSMOS2020 (corresponding to UltraVISTA DR4) contains nearly uniform NIR coverage (Δ ≈ 0.4 mag, Moneti et al. 2023) and so can leverage an area almost 2× larger out to at least z ≈ 6.5, beyond which the source density becomes clearly different between the deep and ultra-deep stripes. Thus, for the 6.5 < z ≤ 7.5 bin we use the 0.72 deg2 subset of our primary area covered by the UltraVISTA deep stripes.

thumbnail Fig. 6.

Evolution of the galaxy stellar mass function across 12 redshift bins (0.2 < z ≤ 7.5). The 0.2 < z ≤ 0.5 SMF from the first redshift bin is repeated in each panel for reference shown by the purple dotted curve. Two other COSMOS stellar mass functions from Ilbert et al. (2013) and Davidzon et al. (2017) are shown for comparison, along with Grazian et al. (2015) from UDS/GOODS-S/HUDF and the recent work of Stefanon et al. (2021) from GREATS at z > 6. Mass incomplete measurements are not shown. Upper limits for empty bins are shown by the horizontal gray line with an arrow.

At z < 2.5, in the range that all three can be directly compared, we find excellent agreement with Ilbert et al. (2013) and Davidzon et al. (2017). This is unsurprising, as measurements from Davidzon et al. (2017) at z < 2.5 are adopted directly from COSMOS2015 and computed nearly identically as Ilbert et al. (2013) but with deeper imaging. Our measurements are similar, but with visibly less structure around the knee especially between 0.8 < z ≤ 1.1 where the volume density of sources is slightly higher, with the greatest increase at ℳ ∼ 1010 ℳ. However, at z ≈ 2.5 Davidzon et al. (2017) predicts a lower volume density than either Ilbert et al. (2013) or our measurements, which lie in agreement. This is not surprising, as Davidzon et al. (2017) expanded the template library of LePhare to include both starbursting and dust-obscured systems (see Sect. 3 of Davidzon et al. 2017) and so our work is naturally more similar to that of Ilbert et al. (2013) – a trend made clearer when looking at quiescent systems in the Sect. 5.2. At z > 3.0, we observe a significantly higher volume densities of massive galaxies compared to Davidzon et al. (2017), although the general shape of the low-mass end of the SMF remains similar. However, if we similarly limit our SMF to only the Ultra-Deep region we find it decreases to about the existing lower 1σ limit at ℳ > 1011 ℳ. We speculate that this may be due to the presence of (proto-)clusters preferentially in the Deep region at 3 < z ≤ 4 (see Brinch et al. 2023), one of which has been recently spectroscopically confirmed by McConachie et al. (2022). Constraints from Grazian et al. (2015) can be introduced at 3.5 < z ≤ 4.5, although at a significantly higher degree of caution as Grazian et al. (2015) uses a combination of smaller CANDELS fields (GOODS-South, UDS, HUDF) with an overall area of ∼12× smaller than the present study which implies a higher degree of uncertainty from cosmic variances and Poisson noise, as well as a reduced constraint on rare, high-mass systems found only in larger volumes. They are, however, completely independent as Grazian et al. (2015) do not include the CANDELS-COSMOS field. Where a comparison is possible (3.5 < z ≤ 7.5), our results are consistent with those of Grazian et al. (2015) within the stated uncertainties. Similarly at z > 5.5, we are able to compare with the recent measurements of the low-to-intermediate mass regime from Stefanon et al. (2021), with which the present work also is consistent within the stated uncertainties.

We do not include comparisons to stellar mass functions inferred from LUV-selected samples with stellar masses estimated from their UV luminosities (e.g., Song et al. 2016; Harikane et al. 2016) as UV-bright sources are only a portion of the general galaxy population and conversations between LUV and ℳ are hazardous. Moreover, such studies often rely on color-color selections which are less certain than photo-z, in addition to being susceptible to a number of systematic effects when translating a UV luminosity function to a SMF by means of a LUV − ℳ relation. We refer the reader to Davidzon et al. (2017) for basic comparisons and discussion.

While the smaller area surveys used by Grazian et al. (2015) and Stefanon et al. (2021) are more mass complete at lower masses, COSMOS2020 contains the widest contiguous NIR imaging at these depths and consequently provides a larger volume than any previous study (including Davidzon et al. 2017) and so introduces new constraints on the rarest systems at all redshifts z ≤ 7.5 such as massive galaxies. Yet it appears that we are quickly approaching the statistical limit beyond which a larger survey is needed to find more massive systems, if they exist (e.g., Euclid, see discussions in McPartland et al., in prep.). The nature of these sources, their authenticity, and their potential Eddington bias is further assessed in Sect. 6.3.

5.2. The star-forming and quiescent galaxy stellar mass functions

In Sect. 3.2, we used an NUVrJ color-color selection to distinguish quiescent systems from star-forming ones and then estimated their respective mass completeness limits, which will differ since their ℳ/L ratios are not the same. The corresponding SMFs are shown in Fig. 5 and compared to literature in Fig. 7. Fitting results are discussed in Sect. 5.3 and details can be found in Appendix C; their corresponding Schechter parameters are reported in Tables C.2 and C.3 for the star-forming and quiescent samples, respectively. Fractional uncertainties on mass and cosmic variance are adopted from the total sample, leaving only the impact of poisson noise differentiated between the star-forming and quiescent samples (see Sect. 3.5).

thumbnail Fig. 7.

Evolution of the star-forming (blue) and quiescent (orange) galaxy components of the galaxy stellar mass function in nine bins of redshift (0.2 < z < 4.5). Uncertainty envelopes correspond to 1 and 2σ limits. For comparison, we show other literature studies in COSMOS from Ilbert et al. (2013) and Davidzon et al. (2017) that adopt similar selections and methodologies. For reference, the total SMF is shown in each bin (solid gray) and the 0.2 < z < 0.5 total SMF is repeated in each panel (purple dotted). Mass incomplete measurements as defined by the channel 1 limiting magnitude are not shown, with the mass limits corresponding to the Ks limiting magnitude shown by orange arrows. Upper limits for empty bins are shown by the horizontal gray line and arrow.

We can follow the development of quiescent systems out to z ≈ 5, although with significant uncertainty at z ≳ 4. As evidenced by Fig. 1, only ∼200 of quiescent systems are found at z ≳ 4, dropping precipitously to only one candidate by z ≈ 6. This is partly driven by mass completeness. At z < 3 the difference between the IRAC channel 1 mass limit and the effective mass completeness dictated by our izYJHKs selection function is ≲0.2 dex in mass. This difference grows to ∼0.4 dex at z ∼ 4.5 when Ks falls blueward of the Balmer break causing the mass completeness to shift upwards to higher masses, indicated by the hatched region of the SMF of 4.5 < z ≤ 5.5 quiescent systems in Fig. 5. At this point the identification of quiescent systems is reliant on only a few bands, and is dependent on the particular SED templates (as suggested by the two distinct clusters in the upper right corner of the 4.5 < z ≤ 5.5 NUVrJ diagram), making such a distinction hazardous and susceptible to interlopers.

Our measurements of the star-forming and quiescent SMFs are generally in good agreement with other literature measurements in COSMOS, namely Davidzon et al. (2017) and Ilbert et al. (2013) as they are the only NIR-selected, mass-complete samples from which star-forming and quiescent subsamples are identified by NUVrJ. Other selections may induce additional systematics, and other separation methods (e.g., UVJ, BzK, sSFR) implicitly adopt a different criteria for quiescence (see Davidzon et al. 2018; Leja et al. 2019b). As shown in Fig. 7, our measurements indicate similar high-ℳ quiescent number densities compared to Ilbert et al. (2013) and Davidzon et al. (2017), but find significantly more low-mass lowest-ℳ quiescent galaxies compared to Davidzon et al. (2017; which are not measurable from the data used by Ilbert et al. 2013). Since our sample is derived from comparably deeper NIR data, it is expected to be complete down to lower masses relative to Davidzon et al. (2017). Given the increased quiescent galaxy number densities near the mass completeness limit in our work, we conclude that the 70−80% completeness threshold of Davidzon et al. (2017) is underestimated by a factor of ∼2 − 10 across this z-range, and is more likely only 15 − 35% complete. We note, however, that this in agreement with worst-case scenario discussed by the authors (see Sect. 4.2 of Davidzon et al. 2017).

The SMFs of star-forming and quiescent galaxies have remarkably different shapes and evolutionary histories. As shown by Fig. 5, star-forming galaxies at 0.2 < z ≤ 0.5 follow a double Schechter form (Eq. (7)) with a characteristic stellar mass ℳ* ≈ 1010 − 11 ℳ and yet by z ≈ 3 it appears to flatten into a smooth powerlaw-like form with a lower overall normalization. Meanwhile the SMF of 0.2 < z ≤ 0.5 quiescent galaxies follows the form of a double Schechter (Eq. (8)) with a low-mass upturn and a similarly positioned ℳ* (see Moutard et al. 2016) but the form appears to loose its lower-mass Schechter component around z ∼ 2. Although this may be physical, the fact that the quiescent sample is less mass complete at a given z means that the low-mass end of the total sample reflects only the contribution from star-forming systems. However, the contribution from low-mass quiescent systems, if they exist, can be expected to be < 1%.

This evolutionary picture is perhaps more easily understood by Fig. 8. Here galaxies have been binned by mass instead of redshift, allowing for a more direct comprehension of the growth, or lack thereof, in the number density of galaxies at fixed mass. We first notice a growth rate in the number densities that is strikingly consistent across all masses from z = 7.5 → 1 (i.e., similar slopes in each mass bin), constituting ∼5 Gyr or ∼36% of the history of the Universe, consistent with recent findings by Wright et al. (2018). This consistent growth is made especially clear in the lower left panel where the growth is relative to that of the total sample at z0 ≡ 0.2 < z ≤ 0.5. Although the uncertainties are considerable, there may be a hint that the growth is fastest (i.e., the slope is higher) for systems at ℳ ∼ 1010.5 − 11.0 ℳ or at the ‘knee’ of the mass function where star formation is hypothesized to be the most efficient (Moster et al. 2013; Behroozi et al. 2013; Wechsler & Tinker 2018). However, we note that Eddington bias remains uncorrected and so a growth in the bias strength with redshift may produce this apparently slow evolution (as suggested by Fig. 4). While the least massive systems are always more common than more massive ones, this is simply a consequence of the monotonic shape of the SMF.

thumbnail Fig. 8.

Evolution of the volume number density Φ of the total, star-forming, and quiescent samples at fixed mass, showing only mass complete bins. 1σ uncertainties are indicated by the colored envelopes. Gray shaded regions indicate upper limits corresponding to Φ(N = 0) at each redshift bin. The lower panel shows the evolution of the fractional change in the volume number density at fixed mass, relative to z0 ≡ 0.2 < z ≤ 0.5 of the total sample.

Another interesting feature is that the number density of ℳ = 109.5 − 10.0 and ℳ = 1010.0 − 10.5 ℳ systems appear to decrease mildly at z ∼ 2.5. There are fewer such systems in this work relative to both Davidzon et al. (2017) and Ilbert et al. (2013), and so this may be indicative of an incompleteness in our selection, or alternatively a bias in z and/or ℳ. Determining the origin of this difference is nontrivial, and so we simply caution against over-interpretation of this feature.

The highest-z constraints are difficult to interpret due to the incompleteness of low-mass systems (which are omitted) and the rarity of the most massive sources. The fact that the constraints at z ≳ 5 overlap with the Φ(N = 0) upper limit region in gray indicates that even the comparably large, deep volume of COSMOS is insufficient to provide robust measurements of the number density of massive galaxies at these early times. Future constraints will be derived from even larger volumes, which are expected to be delivered soon by near-infrared wide-area deep surveys such as Roman and Euclid.

This nearly uniform growth in the number density of sources slows down at z ≲ 1 in all but the least massive bin, with almost no growth for ℳ > 1010.5 ℳ systems. To understand this further, we examine the growth of the number densities of the star-forming and quiescent subsamples. As seen in the middle column of Fig. 8, star-forming systems maintain their monotonic mass-ranked order. In addition, their number density grows rapidly until z ≈ 1, when it starts to decrease. As shown in the rightmost column, quiescent galaxies follow a different pattern. Instead of a monotonic, mass ranked growth in number density, quiescent systems around the knee at 1010.5 ℳ are always the most numerous, with a slower rate of growth compared to the least massive and most massive bins. While the evidence for massive quiescent systems at z > 2 is hampered by the limited volume of COSMOS, we can confidently see that they appear to grow quickly in number at z < 2. By z ≈ 1 the number density of the most massive ones (ℳ = 1011.5 − 12.0 ℳ) stabilizes at ≈3 × 10−5 Mpc−3 dex−1. Reasons for this are discussed in Sect. 6.3.

Figure 9 shows the evolution of the fractional contribution of quiescent systems in four 0.5 dex bins at fixed mass and one wider 2 dex bin. The quiescent fraction of low mass systems increases with time monotonically, and at a higher rate of growth than those of the highest mass, at least for z < 2 where comparisons can be made. The quiescent fraction of the most massive systems at any redshift is more affected by both Eddington bias and poisson noise compared to low-mass systems and so should be taken as an upper limit. Nevertheless, it seems that between 20 and 30% of ℳ > 1011 ℳ galaxies are quiescent from z ≈ 5 → 1, growing above 50% and plateauing at z < 1.

thumbnail Fig. 9.

Evolution of the fraction of quiescent galaxies as a function of redshift in four bins spanning 109.5 < ℳ ≤ 1011.5 ℳ. Uncertainty envelopes correspond to 1 and 2σ limits. Points which are mass incomplete are not shown.

5.3. Inferring the intrinsic mass function

So far we have accounted for three key sources of statistical uncertainty in Sect. 3.5. However, Eddington bias remains a source of systematic uncertainty that has not yet been removed.

To do so, we fit the observed total, star-forming, and quiescent SMFs with Schechter functions convolved with a kernel which attempts to describe the mass uncertainty for a given mass and redshift interval. This approach is common in the literature (Ilbert et al. 2013; Davidzon et al. 2017; Adams et al. 2021). We adopt a two component parametric kernel of the form introduced by Ilbert et al. (2013):

D ( M ) = 1 2 π exp ( M 2 σ Edd 2 2 ) × τ Edd 2 π 1 ( τ Edd / 2 ) 2 + M 2 $$ \begin{aligned} \mathcal{D} (\mathcal{M} ) = \frac{1}{2\pi }\exp \left(\frac{-\mathcal{M} }{2\sigma _{\rm Edd}^2}^2\right) \times \frac{\tau _{\rm Edd}}{2\pi }\frac{1}{(\tau _{\rm Edd}/2)^2+\mathcal{M} ^2} \end{aligned} $$(9)

which describes a core Gaussian component with a constant width σEdd in product with a Lorenztian component which provides wide wings that are crucial to capturing catastrophic errors in SED measurements. The other free parameter τEdd is redshift dependent such that τEdd = τc (1 + z), widening the wings with increasing z. Instead of fitting the kernel directly to the ℒ(ℳ | z) distributions (as in Davidzon et al. 2017; Adams et al. 2021), we determine the kernel parameters from directly fitting the total SMF leaving the kernel free to vary independently at each redshift. However, both σEdd and τc are nuisance parameters and leaving them free to vary is may produce over-fitting of any one SMF at a given z. As expected, the resulting parameter distributions with respect to z contain outliers on either side of the median particularly where the kernel shape is most degenerate with that of the intrinsic SMF; we adopt the median values σEdd = 0.2 and τc = 0.1. The resulting kernel is significantly wider than the ℒ(ℳ | z) distributions shown earlier in Fig. 4. This is because the latter assume perfect redshifts and therefore underestimate the full ℳ uncertainty as there is likely to be a covariance with z. In addition, there is a mild evolution of ℒ(ℳ | z) with ℳ, which we have omitted from our kernel form for the sake of simplicity and to avoid overfitting. Interestingly, while σEdd ≈ 0.2 is smaller than that found by Davidzon et al. (2017) (0.35), τc ≈ 0.1 is conspicuously larger (0.04, same as in Ilbert et al. 2013). If we instead fit to the ℒ(ℳ | z) distributions directly, we find σEdd ≈ 0.05 and τc ≈ 0.04. These are much smaller than (Davidzon et al. 2017), which may be explained in part by the slightly different approach they used to estimate ℒ(ℳ) that more directly incorporates uncertainties on photo-z (see Sect. 4.1 of Davidzon et al. 2017). Nevertheless, we opt for the pessimistic case and fix the kernel shape and evolution with z to σEdd ≈ 0.2 and τc ≈ 0.1 for the subsequent analysis of the total, as well as the star-forming and quiescent SMFs; their shapes are shown in the inset panels in Figs. C.1C.3, respectively.

At the present, precisely determining the correct, intrinsic shape of the Eddington bias, and its evolution with z as well as ℳ is problematically difficult. In addition, the results stemming from these kernel-convolved fits suffer a degree of degeneracy with the kernel parameters (fixed or unfixed). We are not alone in issuing this caution; although the SMF measurements of Davidzon et al. (2017) is similar to those of Grazian et al. (2015), their different choice of convolution kernel caused their inferred Schechter parameters to differ. Recently, Adams et al. (2021) explored the impact of the assumed shape of the kernel, as well as various other systematics at z < 2, highlighting the full extent of the problem. Pushing measurements of the SMF to higher redshifts, where fewer constraints are available, naturally increases the influence of the kernel shape, and so the results derived here for α, ℳ*, and Φ* (Tables C.1C.3) should be taken in conjunction with our assumed kernel and its evolution with z.

When fitting each sample (total, star-forming, and quiescent) we assume a double Schechter parametric form (Eq. (8)) and move to a single Schechter form (Eq. (7)) when it achieves a lower χ2 per degree of freedom. The point at which this change occurs, and when various parameters are fixed, are different for each of the three samples as it depends on not only their shape but also ℳlim. The fitted points follow from before: we account for minor incompleteness on the low-mass end using the 1/Vmax correction and include only mass complete ℳ-bins adopting the uncertainty budget from Sect. 3.5. We proceed in two stages, fitting first using a simple and efficient χ2 minimization routine (MINUIT, James & Roos 1975) whose resulting best-fit parameters are used to set the initial positions of walkers in a secondary and longer Markov chain Monte Carlo (MCMC) optimization (EMCEE, Foreman-Mackey et al. 2013). It is generally appropriate to initialize walkers at well defined initial positions, assuming each chain is linearly independent (in this case scattering them by 1% of the initial parameter values) and allowed to converge well past its respective autocorrelation length such that it is not sensitive to those initial conditions (Hogg et al. 2010). Flat priors are used for each parameter, with generous limits that are typically not encountered during a given fit. 500 MCMC walkers are initialized to seek the state of maximum likelihood derived similarly to the minimum χ2, and do so following the “stretch move” (Goodman & Weare 2010) until converged, defined as having every chain iterate for at least 10× their autocorrelation length, and every autocorrelation length having changed by < 1% of their previous value. We do not see any significant differences by using a different move style (e.g., Red-Blue, Metropolis-Hastings), suggesting that the data provide sufficient constraining power. Although we include the original χ2 results throughout, we focus on the MCMC results which provide full posterior sampling, which are taken from the last 90% of each chain to avoid only the initial burn-in when the parameters are only beginning to converge. The MCMC and χ2 methods generally find consistent results. Fitting solutions corresponding to both the median posterior and maximum likelihood for the total, star-forming, and quiescent samples is shown in Fig. 10. An alternative version binned in redshift (as opposed to total, star-forming, or quiescent sample) is included for reference in Fig. C.4 and discussed in Appendix C.

thumbnail Fig. 10.

Summary of inferred galaxy stellar mass function evolution. Best-fit parameters are estimated by a Markov chain Monte Carlo fitting with a Schechter function at fixed redshift convolved with a redshift dependent kernel from which the inferred intrinsic mass function is recovered with the unconvolved Schechter fit (colored solid curves) for the total (leftmost), star-forming (center left), and quiescent (center right) samples. Estimates are shown corresponding to both the median posterior parameter values including 1σ envelopes (upper row) and the single set of maximum likelihood parameters (bottom row). The rightmost panel shows the results of fitting the Double Schechter continuity model of Leja et al. (2019a) to the total SMF between 0.2 < z ≤ 3.0 where the fits are reliable; see later in Sect. 5.3 for details.

The evolution of the Schechter parameters inferred from the total SMF are shown in the leftmost panels Fig. 11, and compared with (Davidzon et al. 2017) in the center panels. Table C.1 records the best-fit parameters, with detailed fits shown in Fig. C.1.

thumbnail Fig. 11.

Schechter parameter evolution with redshift. Left: evolution of best-fit Schechter parameters computed from fits to the total SMF from χ2 regression (red diamonds), as well as likelihood methods using fixed redshifts bins (blue; median posteriors as filled points with 68% error bars, maximum likelihood parameter set as unfilled squares), and our continuity model fit (maroon curve; median posterior). Error bars are not shown when parameters are fixed. Center: same points as on the left panels compared to literature values from Davidzon et al. (2017, gray points with hatching) and Leja et al. (2019a, gray dashed curves). Right: evolution of the best-fit Schechter parameters computed from fits to the star-forming (blue) and quiescent (orange) SMFs from the χ2 regression (unfilled colored diamonds) and likelihood methods using fixed redshifts bins (median posterior as filled points with ±34% envelope, maximum likelihood as unfilled squares).

As evidenced by the low χ2 values, the double Schechter well describes the observed SMF at z ≤ 3. No parameters are fixed other than those of the kernel. While a single Schechter finds a better fit at z > 3, the increasing mass completeness limit weakens the constraints on α1. To avoid overfitting, we fix α1 to the value from the best-fit value at 2.5 < z ≤ 3.0 and do so for the χ2 and MCMC methods independently but to excellent agreement. All of the unfixed search parameters are generously bounded such that the chains are not likely to encounter them. However, for the sake of physicality especially at high-z, we choose to bound the search space for the normalization Φ (z > 0.5) (both Φ1 and Φ2 where applicable) to below the upper 68% on the posterior distribution of max (Φ1, Φ2) of the previous mass bin.

It is important to note that from χ2 minimization we obtain a single set of parameters that have minimized the χ2 as well as their symmetric, Gaussian uncertainties. On the other hand, MCMC provides only posterior distributions that can be used to estimate parameter uncertainties but do not imply a unique definition of “best-fit parameters”. Commonly, the median is the best-fit statistic of choice, bounded by a 68% one-parameter confidence interval (which ignores covariance). However, for highly skewed posteriors the median may lie out in the wings and is therefore not a typical value for that parameter. In this case, the most obvious choice may be the parameters corresponding to the model which has found the maximum likelihood. However, MCMC cannot provide an uncertainty on these maximum likelihood parameters, limiting its use. Worse, the model uncertainty cannot be derived from the posterior widths, as they also are covariant and so the resulting error envelopes will be meaningless. The most obvious choice then is to compute the medians and widths of posterior distributions of the chains after burn-in. However, the curve traced by the median is not guaranteed to reflect the actual model function, and the 68% confidence envelopes may not contain the maximum likelihood nor the median posterior. We therefore strongly advise that models are reconstructed using the maximum likelihood parameters, and that caution should be exercised when interpreting best-fit parameters from median posteriors. Thankfully, the situation is less severe for symmetric posteriors, which for the total SMF are generally symmetric at z ≲ 6. At 6.5 < z ≤ 7.5, the posteriors are highly skewed as the relatively weak constraints produce widespread parameter covariance. The maximum likelihood results in all cases appear reliable. The evolution of the fitted Schechter models for both the median posterior and maximum likelihood is shown in Fig. 10.

In addition to the fits at fixed redshift, we also use the method of Leja et al. (2019a) to fit both the shape and evolution of the SMF simultaneously. This aptly named “continuity model” is fitted on the unbinned distribution of mass-complete sources in z and ℳ8. Double Schechter parameters ℳ*, Φ1, and Φ2 are treated as continuous functions over time described by second order polynomial expansions in z. The slopes α1 and α2 are assumed to be constant. These 11 parameters are joined by a minor parameter σeff which attempts to capture the cosmic variance. The effects of Eddington bias are incorporated by resampling the entire catalog by 10 random draws from their ℒ(ℳ | z). Only the mass-complete points are fitted, which allows sources near the mass limit to scatter in and out of the SMF realizations. Importantly, this method accounts for the significant covariances in the Schechter function between adjacent redshift bins which is neglected when binning in redshift, and exploits it to provide generally tighter parameter constraints (see Leja et al. 2019a for details).

We employ EMCEE to determine the posterior distributions and maximum likelihood fit of the continuity model, noting the aforementioned caveats. Initially we tried to fit the entire SMF out of z = 7.5, but the fits were inaccurate due to the limited range of evolution which can be described by a second order expansion. Already known from Davidzon et al. (2017), Φ rises quickly before slowing down toward z ∼ 0. The second order expansion in z can either fit the quick rise or the slow down, but not both. We therefore only consider galaxies at z ≤ 3 in our continuity model fit, as Leja et al. (2019a) do, and leave modifications to the fitting functions to future work. The expansion shapes are determined not by their coefficients, but rather by the amplitudes of three anchor points at fixed redshifts. The location of these anchor points is largely arbitrary, and so we follow (Leja et al. 2019a) in choosing z = 0.2, 1.6, and 3.0. Given the larger parameter space, we randomly initialize 100 walkers which explore the space until converged as before. We cannot apply the 1/Vmax correction directly to the continuity model, and so to remain conservative we exclude sources in the lowest 0.25 dex from the ℳlim given in Eq. (3) for the total sample.

The evolution of the Schechter parameters inferred from the star-forming and quiescent SMFs are shown in the rightmost panels in Fig. 11. Tables C.2 and C.3 record the best-fit parameters, with fits shown in Figs. C.2 and C.3, respectively. Adapting the continuity model to the star-forming and quiescent SMFs is left to future work.

The treatment for the star-forming SMF fit is similar to that of the total form. We begin with a double Schechter form at 0.2 < z ≤ 0.5, and transition to a single Schechter form at 2.5 < z ≤ 5.5 with α1 fixed. The best-fit model describes the observed SMF reasonably well until z ≈ 3.5 − 5.5 where an excess of high-mass star-forming systems is observed that cannot be described by a single Schechter form. Possible reasons for this are discussed in Sect. 6.3.

The quiescent SMF behaves noticeably differently from that of the star-forming sample and therefore requires a different strategy. We begin with a double Schechter form at 0.2 < z ≤ 1.5 but fix α2 at 1.1 < z ≤ 1.5 as the constraints on the low-mass regime deteriorate. With no significant low-mass constraints, we transition to a single Schechter at 1.5 < z ≤ 5.5. We note that this disregards a possible low-mass contribution and so the extrapolation to low-masses is likely an underestimate at z > 1.5, depending on the growth of low-mass quiescent galaxies beyond our mass limit (see Santini et al. 2022). The normalization of the quiescent SMF continues to steadily decline with z. Given the uncertainty about sample completeness at 4.5 < z ≤ 5.5, we necessarily fix α1 to ensure a reliable fit of the secure measurements at high-ℳ. Consequently, we find a mild rise in the expected ℳ* from z ≈ 3 → 5.

The evolution of the inferred total, star-forming, and quiescent SMFs resulting from the fixed redshift and continuity model fits are shown in Fig. 10. Furthermore, the evolution of the Schechter function parameters inferred from each sample are shown in Fig. 11. Although the evolutionary physics inferred from the fitting will be discussed in Sect. 6.1, we briefly describe the immediate result here. In general, parameters derived at fixed redshift from the χ2, maximum likelihood, and median posterior agree well for the total, star-forming, and quiescent SMFs, with few exceptions.

The characteristic mass is found to be ℳ* ≈ 1010.7 − 11.0 ℳ with very little significant evolution until z ≈ 3 when it begins to decrease with increasing z, with the continuity model suggesting a potentially increasing value with z. This contrasts with results from Davidzon et al. (2017), who find an initial decrease in ℳ* which increases at z > 3. The results of the continuity model of Leja et al. (2019a) agree well with the lack of evolution suggested by our measurements, although in some tension with our continuity model results. ℳ* of the star-forming sample similar in shape but generally larger than that of the quiescent sample. The likeness to the total ℳ* at fixed redshift is proportional to the sub sample that dominates around the knee in that z interval.

The low-mass slope of the low-mass Schechter component α1 is roughly constant with time, although may experience a maximum at z ≈ 1, in agreement with Davidzon et al. (2017). However, whereas Davidzon et al. (2017) finds a steeply decreasing α1 at 2 < z ≤ 5, we find a constant evolution which we then fix at z > 2.5 to avoid mounting degeneracies as the low-ℳ constraints are lost; this is also where a single Schechter becomes the preferred form. The value of α1 in good agreement with that found from our continuity model fit, as well as that of Leja et al. (2019a) who formed a composite SMF with significantly lower-ℳ and hence α1 constraints. While α1 slope of the star-forming sample is similar to that of the total sample, the α1 slope of the quiescent sample is poorly constrained and appears to decrease with redshift (i.e. steepen with time), consistent with the rapid appearance of low-ℳ quiescent systems at late times.

The normalization of the low-mass Schechter component Φ1 has little evolution at 0.2 < z ≤ 1.0, afterwards rapidly decreasing until it appears to approach zero asymptotically. Although Davidzon et al. (2017) finds slightly larger values of Φ1 at low-z, both measurements generally agree on the rapid decline of the low-mass normalization. Φ1 derived from the continuity model finds still lower values and hence a more gradual evolution9. Yet, Φ1 derived by Leja et al. (2019a) agrees well with our fixed redshift measurements. As with α1, the Φ1 normalization of the star-forming sample is similar to that of the total sample and Φ1 of the quiescent sample remains significantly smaller and decreases with redshift.

The low-mass slope of the high-mass Schechter component α2 is statistically consistent with being constant with z, but may rise slightly toward z ≈ 2. This is broadly comparable with Davidzon et al. (2017) within the stated uncertainties. α2 derived from our continuity model agrees well with the fixed redshift measurements, and with that of Leja et al. (2019a). Interestingly, at z ≲ 3 we find that α2 − α1 ≈ 1, which is predicted by Peng et al. (2010) as an indicator of mass quenching. The α2 slope of the star-forming sample is similar to that of the total sample, with α2 of the quiescent sample changing from negative at z ≲ 1 to positive at 1 ≲ z ≲ 2 where it appears to stabilize.

Lastly, the normalization of the high-mass Schechter component Φ2 generally declines over z = 0.2 → 3.0. Interestingly, the continuity model finds an initial increase in Φ2 toward z ≈ 1, declining afterwards in agreement with the fixed redshift estimates. We find lower values compared to those of Davidzon et al. (2017) and Leja et al. (2019a), which are in general agreement in part because they both use COSMOS2015. This suggests that the different evolution in Φ2 between this work and the literature may be driven by subtle differences between the catalogs, sample selection, assumed Eddington kernels, or a combination of the three. The Φ2 normalization of the star-forming sample is subdominant to that of the quiescent sample with both decreasing steadily with time, consistent with the large fraction of quiescence found in massive systems out to z ≈ 2.5.

Overall we find good agreement with the most directly comparable literature measurements from Davidzon et al. (2017) and Leja et al. (2019a). The implications for these evolutionary trends, and their relation to the growth of massive quiescent galaxies, are explored in detail in Sects. 6.3 and 6.4.

6. Discussion

In this section we focus on some of the open issues in galaxy evolution that can be addressed from a statistical perspective, using the results of Sect. 5 as a starting point. Besides the assembly of z ≈ 3 − 5 massive galaxies and their quiescent fractions, we also discuss derived measurements such as the cosmic stellar mass density ρ*, as well as the relation between stellar and dark matter halo mass functions. We include comparisons with simulations to provide physical insight and also to highlight areas for improvement from both sides.

6.1. Cosmic stellar mass density evolution

Galaxy mass assembly and growth is inextricably related to star formation. As such, reconciling direct measurements of galaxy mass growth with the behavior of the star formation main sequence (e.g., Brinchmann et al. 2004; Noeske et al. 2007; Daddi et al. 2007; Salim et al. 2007; Whitaker et al. 2012, 2014) is of great interest. Meaningful assessment of empirical models of galaxy growth that link the two (e.g., Peng et al. 2010; Behroozi et al. 2019) depends on unbiased, accurate measurements of the integrated mass density ρ* and its evolution with z.

We integrate the SMF measurements presented in Sect. 5 to derive an estimate of the stellar mass density (ρ*) for each bin of z. Although definitions vary, ρ* is commonly integrated down to 108 ℳ. Since our ℳlim at all redshifts is larger than 108 ℳ, we integrate into the extrapolated low-ℳ regime of our (unconvolved) Schechter models. The resulting ρ* for the total sample in Fig. 10 is compared with other literature measurements in Fig. 12, converting to a Chabrier IMF where relevant. All SMF-based literature measurements of ρ* have been reintegrated consistently to the same mass limit. We also show ρ* derived from integrating the star formation rate density (SFRD) function of Madau & Dickinson (2014), assuming a 41% return fraction corresponding to Chabrier’s IMF (see Sect. 6.1 of Ilbert et al. 2010).

thumbnail Fig. 12.

0.2 < z ≤ 7.5 cosmic stellar mass density. Upper: evolution of the cosmic stellar mass density of the total sample computed from the best-fit likelihood models (blue) integrated above 108 ℳ. Literature results from observational studies of mass-selected samples (Grazian et al. 2015; Tomczak et al. 2014; Caputi et al. 2015, 2011; Ilbert et al. 2013; Muzzin et al. 2013; Santini et al. 2012; Adams et al. 2021; McLeod et al. 2021; Wright et al. 2018; Thorne et al. 2021) and mass inferred from rest UV measurements (González et al. 2011). By integrating their SFRD functions, we can plot ρ* from Behroozi et al. (2013) and Madau & Dickinson (2014). In both cases we assume a return fraction of 41% (based on Chabrier’s IMF, see Sect. 6.1 of Ilbert et al. 2010). For Madau & Dickinson (2014), we include a shaded area based on return fractions between 25 and 50% (the latter value is similar to the one given by Salpeter’s IMF). Lower: evolution of the cosmic stellar mass density of the total (gray, repeated from above), star-forming (light blue), and quiescent (orange) samples compared to literature measurements (Behroozi et al. 2019; Santini et al. 2021; McLeod et al. 2021).

We find remarkably good agreement with previous observational studies. At z ≲ 3, the agreement seems to be limited by systematics as these are generally well measured, secure samples. However, ρ* at z ≳ 3 is dominated by significantly less certain measurements, due both to the size of the samples and their typically noisy photometry. Our measurements place ρ* near the midpoint of the scatter, in closest agreement with Davidzon et al. (2017, z ≤ 5) and Grazian et al. (2015, z ≤ 7). The relatively large uncertainties on ρ* beyond z ≈ 4 stem from the increasing degeneracy of ℳ* and ϕ with decreasing low-ℳ constraints (α being fixed at z > 1.5; see Fig. C.1).

At z ≳ 5, our estimated median ρ* as well as that of Grazian et al. (2015) is lower than the SFRD-derived predictions of Madau & Dickinson (2014). This discrepancy could suggest that while star formation is high, the mass growth is lagging behind. As intriguing as this is, we stress that the measurements are consistent within 1σ, that the SFRD constructed in Madau & Dickinson (2014) are constrained by only two surveys at z > 5 available at the time (Bouwens et al. 2012a,b; Bowler et al. 2012), and that our measurements assume a fixed low-mass slope α. Further work utilizing larger samples complete to lower masses will be required to confidently evaluate this divergence at z > 7.

In addition to ρ* of the total sample, the lower panel of Fig. 12 includes ρ* for the star-forming and quiescent samples. While stellar mass is overwhelmingly concentrated in star-forming systems from z ≈ 7 → 3, the mass density of quiescent systems grows rapidly until flattening at z ≈ 1, consistent with Fig. 8. From z = 1 → 0, there is no growth in ρ* for either star-forming and quiescent systems where the former remains larger than the latter.

While ρ* of the star-forming sample is well constrained out to lower masses, the same cannot be said about ρ* of the quiescent sample as its SMF is less mass complete in comparison. Since we cannot directly determine its low-ℳ shape, we assume that the low-ℳ Schechter component effectively vanishes at z ≳ 1.5. If this is not the case, then we underestimate ρ* for the quiescent sample. This may help explain the differences observed with respect to UNIVERSE MACHINE from Behroozi et al. (2019) who report generally larger values of ρ* for their quiescent sample, which features a significant quiescent population at low ℳ (see Fig. B.1). Although in agreement at 0.2 < z ≤ 0.5, (Behroozi et al. 2019) report higher quiescent fractions at < 1010.7 with redshift, finding more than 10× as many 109 ℳ by z ≈ 1. UNIVERSE MACHINE primarily defines quiescence as sSFR < 10−11 yr−1, which according to Davidzon et al. (2018) is comparable our NUVrJ-selection, and so this overproduction of quiescent galaxies may indeed be at odds with our observations. Although not readily available, a consistently NUVrJ-selected sample from UNIVERSE MACHINE would clarify this sensitive comparison. Comparisons with the results of Santini et al. (2021) and McLeod et al. (2021), although generally in agreement, are also complicated by differences in selection. Whereas Santini et al. (2021) adopts a novel SFR-driven selection, McLeod et al. (2021) adopts a UVJ selection following Carnall et al. (2018). Therefore, we stress that these estimates of ρ* for our quiescent sample are particular to NUVrJ-selected quiescent systems, and our assumption of a single Schechter description at z > 1.5 may drive our relatively low estimates. Finally, despite differences in selection and constraining power, these observational studies collectively indicate a general agreement as to the evolution of ρ* for quiescent systems.

6.2. Comparison to simulations

Observational constraints on the shape and evolution of the SMF may be useful in their own right, but we can also gain meaningful insight by comparing them with SMFs produced by galaxy formation simulations. Figure 13 shows the SMF constraints from this work and the inferred SMF derived by the binned MCMC fits evaluated at maximum likelihood, with the flagship or reference versions of simulated hydrodynamical SMFs overlaid: TNG100 of the ILLUSTRISTNG project10, EAGLE, SIMBA and FLARES (Pillepich et al. 2018; Furlong et al. 2015; Davé et al. 2019; Lovell et al. 2021, respectively). It is worth noting that FLARES uses the same code and model as EAGLE (specifically they adopted the strong AGN feedback EAGLE model introduced in Crain et al. 2015), but sample regions that span a wider dynamical range, and therefore has an much larger effective volume than EAGLE. We also include the semi-analytical results from SHARK (Lagos et al. 2018) as well as those of SANTA CRUZ (Yung et al. 2019a,b, see also Somerville et al. 2015). To note, we do not apply any artificial normalization (aside from h considerations) to any model or observation such that direct comparisons are possible from the figures alone.

thumbnail Fig. 13.

Comparison of observed and inferred galaxy stellar mass function (gray points, and colored curve with 1 and 2σ envelopes, respectively) to the reference flavors of four simulations: TNG100 of the ILLUSTRISTNG project (Pillepich et al. 2018), EAGLE (Furlong et al. 2015), SHARK (Default; Lagos et al. 2018), SANTA CRUZ (Yung et al. 2019a,b), and SIMBA (Flagship; Davé et al. 2019). To note, we do not apply any artificial normalization to any model or observation. Upper limits for empty bins are shown by the horizontal gray line with an arrow. Mass incomplete measurements are not shown.

Since our observed SMF measurements are affected by Eddington bias, it is preferable to compare the SMF directly predicted by the simulations assuming no errors to our inferred SMF with considerations as to the assumptions made in our Eddington bias correction. We acknowledge that each simulation has multiple flavors with variations to their physical recipes. Although these variations provide additional insight, care must be taken when making these more complicated comparisons. As the goal of this work is to provide constraints on the observed and inferred SMF, we reserve comparisons to the variations of simulated SMFs for future work.

Overall, FLARES, SHARK, and EAGLE perform best below ℳ*, with SIMBA, FLARES, and the SANTA CRUZ SAM performing best above ℳ*. At high-z (z ≳ 4), we find the best agreement with SIMBA, ILLUSTRIS TNG100, and the SANTA CRUZ SAM. At low-z (z ≲ 1.5), however, there is a slight preference toward SIMBA as ILLUSTRIS TNG100 underestimates the number densities at ℳ* for z ≤ 1.5. The situation is different at 1.5 < z ≤ 3.0 where SIMBA significantly overestimates the high-ℳ end. While this could be explained by a systematic bias in observed masses or a missing high-ℳ population (see Sect. 6.3), it could also be due to SIMBA potentially over-grouping several separate dark matter haloes into one massive halo and/or insufficient AGN feedback at early times. Over this same range, it is apparent that EAGLE suffers from volume limitations and thereby does not contain the most massive galaxies. This is most clear from the comparison between EAGLE and FLARES, which are based on the same physical recipes, but the latter sample much rarer overdensities, and hence captures the high-ℳ population. Both the SHARK and Santar Cruz SAMs fare better as their semi-analytical prescriptions are able to produce high-ℳ systems. In some z-bins, both SHARK and EAGLE assemble low-ℳ systems too early relative to the observed SMF where there are direct constraints (most apparent at 1 ≲ z ≲ 2). We find a lesser degree of agreement at z > 3.5: while both SHARK and EAGLE underproduce the number of galaxies at all ℳ, there is significant scatter between ILLUSTRIS TNG, SIMBA, FLARES, and SANTA CRUZ. Meanwhile the volume limitations of ILLUSTRIS TNG100 become significant at z > 6 (see Pillepich et al. 2018 for comparisons with TNG300).

It is remarkable that all of the simulations11 reproduce the 0.2 < z < 0.5 SMF, and yet at higher redshifts display severe disagreement with our observations and each other with number densities differing by more than a factor of ten. While this should not come as a surprise given that simulations are typically tuned to reach an end state in agreement with the local Universe (despite known disagreements with the observed sSFR history and merger rates, see Popesso et al. 2023; Conselice et al. 2022), it suggests that their initial conditions and early evolutionary behavior are strikingly different such that variously tuned physical recipes and initial conditions can produce the same end state. This highlights the need for continued observations to improve constraints on the SMF at high-z where simulations can be critically tested, and their physical thereby recipes refined. In addition to the treatment of AGN and associated outflows (e.g., Debuhr et al. 2012; Richardson et al. 2016; Mitchell et al. 2020), modeling of dust attenuation at high-z is a particularly relevant concern at z ≳ 4 where the dust content of galaxies has been largely unknown, and in turn makes stellar mass estimates based on rest-frame UV/optical light increasingly uncertain. This, coupled with uncertainties in dust production mechanisms at high-z, can lead to simulations diverging significantly (from each other and observations), and has often been cited as a major contributor to such discrepancies (see discussions in e.g., Kokorev et al. 2021). Although future observations will benefit from improved more comprehensive mass uncertainties, our current constraints indicate that while it appears that simulations are able to reproduce the observed abundance of high-ℳ galaxies and so the production of early low-ℳ systems needs improvement.

6.3. Abundant massive galaxies at z ∼ 3 − 5

One of the most striking results highlighted in Fig. 6 is the high number density of massive ℳ > 1011 ℳ galaxies not only at z > 3.5. Although few in number, their identification in COSMOS2020 is a direct consequence of utilizing the larger 1.27 deg2 now accessible with deep, homogeneous NIR coverage. While no ℳ > 1011 ℳ galaxies are observed at z > 5, their growth since then has been similar to galaxies at other masses, as shown in Fig. 8. The majority of ℳ > 1011 ℳ systems are star-forming at z > 1, as shown by Fig. 7 and quantified in Fig. 9, with only z < 1 systems at ℳ ≈ 1010.5 − 11 ℳ being equally divided between star-forming and quiescent states. We find evidence for a sustained population of massive quiescent systems at z > 2, but their number densities are dwarfed by that of star-forming systems by a factor of ∼10. The existence of these massive quiescent systems seems to defy the timescales expected for mass assembly (Steinhardt et al. 2016; Faisst et al. 2017; Schreiber et al. 2018a; Cecchi et al. 2019), and so their tremendously early formation and growth are a topic of great interest (Caputi et al. 2011; Toft et al. 2017; Hill et al. 2017; Carnall et al. 2018; Tanaka et al. 2019; Valentino et al. 2020; Whitaker et al. 2021; Akhshik et al. 2023; Marsan et al. 2022), including a more focused investigation into the origins of similarly selected massive galaxies from COSMOS2020 by Gould et al. (2023).

Despite nearly identical sample selections, we still find greater number densities of massive galaxies compared to Davidzon et al. (2017) at 3 < z ≤ 5.5. By comparing to COSMOS2015 (matched within 0.6″), we find that 78% of ℳ > 1011 ℳ galaxies in our sample are also found in COSMOS2015 where they are exclusively high-z (91% agree within Δz ± 0.5) and high-mass (78% agree within Δℳ ± 0.5 ℳ). The remaining 23% are new sources found only in COSMOS2020. As shown by Fig. 14, they are predominantly faint, having a median Ks magnitude ≈24.2 AB compared to the median sample brightness Ks ≈ 23.4 AB with a photometric uncertainty 0.05 − 0.1 AB (see Fig. 10 of Weaver et al. 2022). They are generally as massive as the already known sources from COSMOS2015 and follow the same ℳ distribution. They are also remarkably red, having a median H − Ks color of ≈1.2 AB compared to that of the sample overall (≈0.8 AB). It seems unlikely that they are now found because of the better de-blending by The Farmer as visual inspection shows that they are generally isolated sources and are detected by Source Extractor in the CLASSIC version of the catalog. Therefore the most probable explanation for the new faint, red sources is that the deeper UltraVISTA YJHKs (as well as HSC iz) have enabled the detection of these faint red sources which in COSMOS2015 could not be detected. Interestingly, we find that their distribution in stellar mass is consistent with that of the total sample such that number densities at all masses ℳ > 1011 ℳ are proportionately represented.

thumbnail Fig. 14.

Ks magnitude and H − Ks colors of 125 new massive ℳ > 1011 ℳ galaxy candidates 3.0 < z ≤ 5.5 found in COSMOS2020 (orange), relative to the total sample used in this work (gray) and the 444 galaxies found in COSMOS2015 (blue). Measurements are taken from COSMOS2020, although they are similar to those from COSMOS2015, where matched. Number densities are summarised using gaussian kernel density estimators to produce smoothed distributions and contours for each sample. Median Ks and H − Ks values for each sample are shown by colored dotted lines.

Visual inspection of photometry and SED fits indicate that nearly all of the 3 < z ≤ 5.5 ℳ > 1011 ℳ galaxies have red colors. About 75% of them are selected as star-forming by NUVrJ, ≳80% of which are likely attenuated by a thick screen of dust (AV > 1; compared to only ∼10% across entire sample). The remaining ∼25% are classified as quiescent. The red colors appear to be genuine and not driven by blends, as confirmed by visually inspecting the imaging for each galaxy. Their broad-band photometry lacks the strong spectral features that contribute to a secure photo-z: there is no detectable flux contamination by nebular emission lines and both the Lyman and Balmer breaks are weak. However, they are surprisingly well fit by LePhare (typical χ N 2 1.5 $ \chi^{2}_{\mathrm{N}}\approx1.5 $). Their likelihood redshift distributions ℒ(z) are narrow as > 68% of the probability is typically contained within Δ z ≈ 1, with similarly well constrained ℒ(ℳ | z). Recently, Lower et al. (2020) has shown how the relatively simple parametric star formation histories assumed by most template-based SED fitting codes are susceptible to biases on the order of 0.5 dex in mass, which suggests that these uncertainties are likely underestimated (see also Michałowski et al. 2012, 2014).

One possible explanation for smooth SEDs with a power-law slope is contamination by AGN. The COSMOS2020 results computed by LePhare include classifications for sources with strong X-ray detections12 determined from crossmatching to Civano et al. (2016) sources within 0.6″ radius. In our sample we only use those sources identified as galaxies, which excludes these X-ray detections. Their inclusion would only serve to increase these already surprisingly large number densities. We do not attempt to quantify this, as the expected accretion disc light from Type 1 Seyferts and quasars make estimates of photo-z and ℳ unreliable and susceptible to catastrophic failures. See Weaver et al. (2022) and Salvato et al. (2011) for details.

Identification of AGN (including X-ray faint AGN) from broadband colors has been explored in the literature (Stern et al. 2005; Donley et al. 2012; Hviding et al. 2022), and their discussion is a standard component of SMF studies at these redshifts (see Grazian et al. 2015; Davidzon et al. 2017), although a consensus has yet to be reached. In general, AGN selection criteria rely on colors derived from (near-)infrared wavelengths from most notably Spitzer/IRAC. While the Donley et al. (2012) criteria have been used successfully at z < 3, they require constraints in all four IRAC bands, which is not the case for COSMOS2020 where channel 3 and channel 4 lack sufficient depth to detect these sources. Even if deeper IRAC images could be taken, the selection criteria rely on the four bands sampling the continuum shape at restframe 2 − 10 μm but at z > 3 instead sample the rest-frame stellar bulk at ≲2 μm. While the MIPS 24 μm data from S-COSMOS (Sanders et al. 2007) is an attractive solution, it also is too shallow (20.2 AB at 3σ, Jin et al. 2018) to fully constrain the restframe 2 − 10 μm continuum at 3 < z ≤ 5: only galaxies H ≲ 20 with large AGN fractions can be positively identified, and both low-fraction AGN at H ≈ 20 and normal galaxies at H < 20 cannot be classified with MIPS. Full spectral fitting including far-infrared measurements is challenging without the constraints from channel 3, channel 4, and 24 μm, and attempts to gain further insight using STARDUST (Kokorev et al. 2021) were broadly unsuccessful with tentative contamination found to be on the order of 10 − 30%. We note, importantly, that removing potentially contaminated sources provides no significant change to the SMF at these epochs.

Despite these challenges, we are able to leverage the elementary AGN template fitting from LePhare to statistically assess the spectral similarity between the best-fit AGN and galaxy templates for each source, limited to rest-frame UV/optical light. While only 10% of galaxies in the total sample are best-fit with AGN templates, this fraction grows to 30% for all mass complete galaxies 3.5 < z < 5.5 and then to 50% for those with ℳ > 1010.8 ℳ with broad wings, having > 80% of these sources statistically indistinguishable as either galaxy or AGN (|Δχ2|< 0.5). Turning to further infrared data of individual sources, we find 15% of the sample are detected by VLA-COSMOS (Smolčić et al. 2017). While not conclusive, we find 5% of these massive ℳ > 1011 ℳ galaxies at 3 < z ≤ 5 are detected in the ALMA maps of A3COSMOS (Liu et al. 2019), which currently covers ∼5% of the COSMOS field, highlighting the need for further observations. While similar sample statistics for individual sources have been extrapolated in the literature (e.g., Santini et al. 2021), anticipated surveys such as the TolTEC Ultra-Deep Galaxy Survey (Pope et al. 2019) will expand and deepen the far-infrared/(sub)mm coverage in COSMOS so that these samples can be more conclusively investigated.

A recent X-ray and radio stacking analysis of similarly selected z > 1.5, ℳ > 1010 ℳ COSMOS2020 galaxies by Ito et al. (2022; selected from independent SED fitting with MIZUKI, Tanaka 2015) revealed that low-luminosity AGN are likely ubiquitous in massive quiescent galaxies from 1.5 < z ≤ 5, even if individually they are not X-ray or radio detected. They also estimate the AGN contribution to optical/NIR continuum and find that at these redshifts, e.g., the rest-frame B-band luminosities of their quiescent galaxies are a factor of 30× larger than the expected rest-frame AGN luminosity. Ito et al. (2022) also find that the AGN luminosities for quiescent galaxies are significantly larger than those of star-forming galaxies. Together, their findings provide further confidence that our redshifts and stellar mass estimates for our X-ray undetected sample are not strongly contaminated by AGN light.

The lack of obvious systematics or likely only weak AGN contamination increases our confidence that these sources, or at least a part of them, are truly massive at z ≳ 3. In agreement with Ito et al. (2022), we also find that > 60% of ℳ > 1011 ℳ galaxies appear to be star-forming, and it is likely that at least some of them are also dust obscured (DSFGs; Casey et al. 2014; Zavala et al. 2021, and references therein). Since there are a number of sources found in the deeper NIR images of COSMOS2020, it makes sense that we are now more sensitive to fainter red sources than ever before (see Fig. 14). It is precisely this class of galaxy which are efficiently captured by infrared facilities such as Spitzer, ALMA, and JWST until now being optically “dark” (e.g., Schreiber et al. 2018b; Wang et al. 2019; Gruppioni et al. 2020; Sun et al. 2021; Fudamoto et al. 2021; Shu et al. 2022). If genuine, their existence not only points at an incompleteness particularly in the massive end of SMFs reported in previous studies lacking the necessarily deep infrared data over degree scales, but also highlights the sensitive interplay between the shape of their SEDs and the selection function consisting of the bands, their depths, and the detection methodology (see Fig. 3 of Weaver et al. 2022). Although detailed simulations will be explored in future work, qualitatively this could explain why the excess of sources relative to a Schechter at z ≈ 3 − 5 diminishes at z ≈ 6 as similarly red sources become too faint to still be detected. Optically dark galaxies selected from 2 mm ALMA observations in COSMOS from Casey et al. (2021) and Manning et al. (2022) are constrained to similar redshifts, stellar masses, and number densities. Importantly, Manning et al. (2022) studied two systems with star formation rates above 200 ℳ yr−1 but a gas depletion timescale < 1 Gyr, suggesting rapid star formation cessation and a potential transition to massive quiescent galaxies by z ∼ 3 − 4. Casey et al. (2021) find that DSFGs are responsible for ∼30% of the integrated star formation rate density at 3 < z < 6 and that 2 mm selection is an efficient way to identify larger samples in future surveys (see also Cooper et al. 2022). In an empirically based numerical model, Long et al. (2023) uses known DSFG population properties to predict the DSFG stellar mass function out to z ∼ 5. They find a shallow power law extension beyond the knee of traditional star-forming Schechter SMFs, and posit that 60−80% of massive (ℳ > 1011 ℳ) star-forming galaxies at z ≈ 3 − 6 are significantly dust-obscured and not captured in previous SMF studies (see also Martis et al. 2016). This is in line with our aforementioned results on significantly attenuated NUVrJ-selected massive star-forming galaxies at similar redshifts.

As introduced in Sect. 6.2 and shown in Fig. 13, there is generally good agreement between number densities of massive galaxies found in several simulations and in this work. This may be surprising given the range of physical recipes utilized in these simulations, and may suggest that several different tuning of physical recipes (mainly those of radiative and mechanical AGN feedback) can reproduce observations. However, from 2.5 < z ≤ 5.5, we observe galaxies with mass 1.8× larger than the most massive galaxies in any of these simulations. While bias and underestimated mass uncertainties may contribute, the simulations are usually considered to be limited by their volume. Figure 15 compares the observed number densities of massive galaxies (including quiescent) to the upper limits set by volumes of different surveys as well as simulations. The fully hydrodynamical codes (EAGLE, TNG100, and SIMBA) have the smallest volumes comparable to CANDELS and are similar to the observed number densities of ℳ > 1011 ℳ galaxies. However, the fact that they contain a large enough volume to catch the most massive galaxy seen in our observations may suggest that their volumes are adequate, but that their DM halo growth physics are not providing the large halos from which they can grow. Alternatively, recent observations of a highly star-forming galaxy at z = 6.9 (SFR ≈ 2900 ℳ yr−1, Marrone et al. 2018) place close to a maximally massive DM halo not seen in current simulations – populations expected to evolve into the most massive systems at z ≈ 2 − 4 (Lower et al. 2023). This is especially true of SHARK, whose volume is similar to that of COSMOS itself. More massive galaxies, if they exist, are not likely to be found in COSMOS, and so larger volumes (such as that probed by the two Euclid Deep Fields North and Fornax, as well as Roman will be required, see McPartland et al., in prep.).

thumbnail Fig. 15.

Comparison of observed number densities and upper limits on the rarity probed by various observed (gray steps) and simulated (colored lines) volumes. Number densities correspond to 1011.0 < ℳ < 1011.5 ℳ and 1011.5ℳ < 1012.0 ℳ from the total sample (light and dark gray, respectively), and 1011.5ℳ < 1012.0 ℳ from the quiescent sample (orange) from Fig. 8. 1σ upper limits are computed following Gehrels (1986), which for the observed volumes are dependent on widths of redshift bins.

Naturally, massive galaxies in the z > 3 Universe are of great interest as targets for JWST. The widest deep-field ERS program is the 100 arcmin2 Cosmic Evolution Early Release Science Survey (CEERS; Finkelstein et al. 2017), and based on our estimates it stands to find approximately 22, 16, 5 for ℳ > 1010.5 ℳ, and 6, 4, 2 for > 1011.0 ℳ) galaxies at 3 < z ≤ 3.5, 3.5 < z ≤ 4.5, and 4.5 < z ≤ 5.5, respectively, although the small area will equate to a stronger density bias from cosmic variance. The widest galaxy survey field of any GO program is the 0.6 deg2 COSMOS-Web (Kartaltepe et al. 2021), which is expected to find 493 (137), 340, (94), 103 (40) galaxies for the same redshift ranges (and masses). Although JWST will be instrumental in studying these sources in exquisite spatial resolution and with efficient spectroscopy (Barrufet et al. 2021; Carnall et al. 2021; Glazebrook et al. 2021), ground-based NIR observations that can efficiently identify them in wide-area surveys will retain their importance.

6.4. Rise of quiescent galaxies

As shown by Fig. 7, low-z quiescent galaxies are well described by a two component Schechter function whose low-ℳ component diminishes rapidly from z ≈ 0.2 → 1 (see Fig. 11). Simultaneously our sample becomes less mass complete with redshift, doing so more rapidly for quiescent systems due to their red color characteristic of their high mass-to-light ratios. Despite the considerable uncertainties on the completeness of our low-ℳ quiescent sample outlined in Sect. 3.3, it seems likely that the shape of the quiescent stellar mass function in this work and in previous literature in fact does turn down at low-masses, with selection effects playing a comparably minor role (see also Ilbert et al. 2010). However, there are hints of ℳ ≲ 109 ℳ quiescent systems at z ≳ 1.5 (beyond our mass limit) reported by Santini et al. (2022), albeit tentative from a ∼130× smaller effective area from the 33.4 arcmin2 Hubble Frontier Field parallels.

Given their low apparent brightness and rarity, the present work is the first to consistently quantify the number densities of ℳ ≈ 109.5 ℳ low-mass quiescent systems at 1.5 < z ≤ 4.5 at high signal-to-noise, based on the deepest NIR data taken over such a homogeneously measured degree-scale area required to find them in sufficient numbers. As seen in Fig. 8, the rate of growth in the number density of low-mass (ℳ ≲ 1010 ℳ) quiescent galaxies has been seemingly rapid over the past ∼10 billion years. Still, they constitute only a minor fraction of low-mass sources by z ∼ 0.2, and by extrapolating the number densities from z ≈ 1 − 2 one may expect to find none within COSMOS by z ∼ 3 (see Fig. 8). This is typically interpreted by the phenomenological model of Peng et al. (2010) to mean that the processes which act to halt star formation cessation in low-ℳ systems are inefficient at early times. For example the lack of virialized at z > 2 − 3 structure makes influence from environmental effects unlikely.

As shown by Fig. 8, the apparent lack of growth in the abundance of massive systems at z < 1 is the result of a decline in star-forming galaxies simultaneous with an increase in quiescent number densities. As noted by Ilbert et al. (2013), this decrease in the star-forming population is consistent with star formation cessation becoming extremely efficient, to the extent that massive star-forming galaxies are becoming quiescent faster than they can be replaced. Therefore, the mass assembly of massive star-forming systems at z < 1 is slower than the cessation of their star formation. However, there is also a slowing down in the rate of growth in the number density of massive quiescent systems themselves. While this may suggest high incidences of dry mergers or even rejuvination, it must also relate to evolving demographics: from z ≈ 1.5 → 0.3 there are fewer and fewer high-ℳ star-forming galaxies available to become quiescent. While the number density growth of massive systems seems to have stalled out, that of lower-mass systems continue to grow; this is the so-called phenomenon of “downsizing with time” (Cowie et al. 1996; Neistein et al. 2006; Fontanot et al. 2009).

The quiescent mass function at ℳ > 1011 does not change much from z ≈ 4.5 → 2.0, with surprisingly elevated quiescent fractions (Fig. 9) being above 20% at z < 5. Figure 8 shows why: while the number density of star-forming galaxies at z ≈ 5 is lower than at z ≈ 2, quiescent galaxies are roughly constant in density over this same range. However, we caution that their number densities are only marginally above what should be possible to find within the nominal volume of COSMOS at these redshifts, and so it is plausible that they are dominated by noise and/or photo-z bias. The question of whether or not this behavior is genuine can only be explored in future large volume surveys, as demonstrated by Fig. 15. Even though the most massive galaxies at z ≳ 3 − 4 are typically too faint to obtain continuum features, spectroscopic follow-up and supporting SED fitting will continue to provide valuable insights (Gobat et al. 2012; Schreiber et al. 2018a; Valentino et al. 2020; Glazebrook et al. 2017).

Figure 16 shows the fraction of quiescent galaxies in bins of mass for three epochs: z ≈ 0.3, 1.3, and 2.7. A key advantage of examining fractions is that they are less sensitive to the overall normalization of the simulation and biases from observations (e.g., in simulations: overproduction of all galaxy masses; in observations: systematics in effective survey volume), and provide additional insight which is obscured by simply comparing mass functions. Although comparisons to UNIVERSE MACHINE have been already discussed in Sect. 6.2, we introduce two samples of quiescent galaxies which we selected from EAGLE and SHARK with an NUVrJ selection consistent with our methodology. While EAGLE underproduces quiescent systems by 10 − 20% at all masses, SHARK overproduces them in all but the most massive bins at z ≈ 0.3 − 1.3 but underproduces them at z ≈ 2.7. Roughly, the fraction of quiescent galaxies in SHARK at z ≈ 1.3 matches the observed fractions at z ≈ 0.3. This may suggest that SHARK’s physical recipes that halt star formation in lower mass galaxies are too aggressive. These same systems are also seemingly overproduced (see Fig. 13), and so may be assembling and maturing too early. For additional figures and details, see Appendix B.

thumbnail Fig. 16.

Quiescent mass fractions. Upper: fraction of quiescent galaxies as a function of mass for three redshift ranges compared with predictions from EAGLE (Furlong et al. 2015) and SHARK (Lagos et al. 2018) as well as measurements from the empirically calibrated UNIVERSE MACHINE (Behroozi et al. 2019). Lower: fractional difference between this work and literature predictions/measurements. See Appendix B for more details.

6.5. Dark matter halo connection

The mass assembly of a galaxy is inherently connected to the dark matter halo in which it formed and grew (see Wechsler & Tinker 2018 for a review). Yet, stellar masses ℳ have been observed to be < 2% of their halo mass ℳh, which point to galaxy formation as a strikingly inefficient process (Mandelbaum et al. 2006; Conroy & Wechsler 2009; Behroozi et al. 2010). This has led to investigations into the role of dark matter halo mass in influencing star formation cessation, including promoting thermal heating and/or gas expulsion by AGN (Shankar et al. 2006; Main et al. 2017), as well as virial shock heating of in-falling cold gas whose Jeans mass inhibits the formation of star-forming molecular clouds (so-called hot-halo mode, Birnboim & Dekel 2003). Hence, the gravitational influence of the halo mass on the cold gas reservoir regulates the ability of a galaxy to form stars, and hence the stellar-to-halo mass relationship (SHMR). Therefore it is no surprise that there is a similar evolution between the specific mass increase rate of the halo by accretion (sMIR ≡ M h 1 M h / t $ \mathcal{M}_{\mathrm{h}}^{-1}\partial\mathcal{M}_{\mathrm{h}}/\partial t $, Neistein & Dekel 2008; Saintonge et al. 2013) and the instantaneous mass growth by star formation (sSFR; see discussions in Lilly et al. 2013). While in this work we restrict ourselves to comparisons with theoretical HMFs, another paper in this series computes a self consistent SHMR based on measuring the halo occupation distribution directly from angular correlation functions and SMFs of COSMOS2020 galaxies (Shuntov et al. 2022). For an investigation into the SHMR split by star-forming and quiescent samples, see Cowley et al. (2019), which is also based on galaxies from the COSMOS field.

As shown in Fig. 17, we compare our observed and inferred SMFs to the halo mass function (HMF) of Tinker et al. (2008)13 from z = 1.5 → 7.5. We choose not to show z < 1.5 as these comparisons have been thoroughly explored by previous investigations (e.g., Davidzon et al. 2017; Legrand et al. 2019) and we do not observe any significant differences. The effects of feedback can be seen in the first panel of Fig. 17 at 1.5 < z ≤ 2.0 that explains the relatively lower number densities of both low- and high-ℳ systems, with those around ℳ* being most similar to HMF. This apparent tension is a well known feature and lies at the foundation of the contemporary galaxy evolution paradigm, involving halting star formation by secular (internal) and/or environmental (external) action on the gas reservoir such as thermal heating, dynamical turbulence, and/or removal. The growth of low-mass ℳ < ℳ* galaxies can be impeded by secular (e.g., supernovae and stellar winds) and environmental processes (e.g., ram-pressure stripping, thermal evaporation). Similarly, secular and environmental processes can also impede the growth of massive ℳ > ℳ* galaxies, albeit from different driving forces such as AGN and major mergers, respectively (see Peng et al. 2010, 2012, 2015; Wechsler & Tinker 2018; Förster Schreiber & Wuyts 2020). In this context, the characteristic knee at ℳ* is the result of a build-up of massive galaxies which can no longer sustain mass growth. At z ≈ 3, the low-mass end is still considerably lower than the scaled HMF but the number density of massive systems comes into agreement. Although the stellar mass function slightly lies above the SHMR-scaled HMF at some points, we caution that this should not be taken as a challenge to theory as it assumes that the SHMR at z = 0 is appropriate at higher-z (which is unlikely; see Legrand et al. 2019) and small modifications can reconcile this difference. We note that Fig. 18 of Davidzon et al. (2017) finds no such offset using the same scaling, but we are unable to reproduce their precise result.

thumbnail Fig. 17.

Comparison of the inferred z > 1.5 galaxy stellar mass function (colored curve with 1 and 2σ envelopes) fitted to observed measurements (gray points). We scale the Tinker et al. (2008) halo mass function (HMF; lower bound of the gray shaded region) by 0.018 corresponding to the stellar-to-halo mass relation at z = 0 and M h = M h * $ \mathcal{M}_{\rm h} = \mathcal{M}_{\rm h}^* $ (Behroozi et al. 2013) to produce an idealized SMF (gray curve). Upper limits on the SMF at each redshift are derived from the HMF assuming a fixed baryon fraction (0.166). Upper limits for empty bins are shown by the horizontal gray line with an arrow. Mass incomplete measurements are not shown.

We derive an upper limit for the baryonic matter distribution by rescaling the HMF by Ωbm = 0.166, which for our adopted cosmology is redshift independent, and assume a 100% efficient baryon-to-stellar mass conversion. This is the maximum SMF physically allowed under our simple assumptions. This upper limit becomes relevant especially at z > 3.5 where our observed number densities exceed those inferred by the Schechter model. While a large Eddington bias or selection systematic could explain this excess (see Sect. 6.3), we stress that our inferred SMF assumes the applicability of Schechter formalism, which cannot accurately describe the observed number densities at 3.5 < z ≤ 5.5. Nevertheless, the inferred stellar mass function agrees well with the SHMR-scaled HMF up to z ≈ 7. This suggests that the most efficient haloes during these early epochs are not around ℳ*, but rather the most massive ones, and with little observed evolution consistent with the findings from Stefanon et al. (2021). This phenomenon has also been observed in some local massive spiral galaxies (e.g., Di Teodoro et al. 2023), which have seemingly matured without significant star formation cessation. One interpretation is that this high star formation efficiency in massive haloes is the result of diminished feedback from AGN, with stellar mass growing similarly to the host halo at early times. Indeed, this is consistent with findings of inefficient radiative AGN feedback from simulations (Kaviraj et al. 2017; Laigle et al. 2019; Roos et al. 2015; Bieri et al. 2017; Habouzit et al. 2022), as well as FIR/radio observations of AGN activity at z > 3 (Maiolino et al. 2012; Cicone et al. 2014, 2015; Padovani et al. 2015; Vito et al. 2018).

At no point do our mass functions, observed or inferred, exceed this upper limit. Therefore we do not report evidence of “impossibly early galaxies” introduced by Steinhardt et al. (2016) who point out that there appears to be too many massive galaxies at z > 4 compared to the dark matter haloes that should host them. However at z > 6.5, where Steinhardt et al. (2016) predicts that the effect will be most obvious, we report observed number densities approaching this upper limit and in clear excess of the SHMR-scaled HMF. We caution that these sources are the most vulnerable to misclassification and bias, being susceptible to blending in addition to being constrained by only a handful of NIR bands yielding proportionately uncertain Schechter fits. Extrapolation to ℳ ≳ 1011.5 ℳ would place their number density below that which can be probed in a volume contained by the 0.716 deg2 area of the Ultra-Deep region, and so COSMOS2020 is unlikely to find them if they exist. While they are not “impossibly early galaxies”, their surprising abundance hint that explorations of z > 7 with future deep, large-volume surveys may provide the evidence necessary to firmly challenge theoretical frameworks.

7. Summary and conclusions

Following on from COSMOS2020 photo-z catalog (Weaver et al. 2022), we study the shape and evolution of the galaxy stellar mass function. Our measurements span three decades in mass (at z ≈ 0.2) across 10 billion years of cosmic history (z = 7.5 → 0.2), including the most mass complete sample of quiescent galaxies at z > 2 enabled by our unprecedented, homogeneous NIR depths across an effective 1.27 deg2. We probe a volume nearly 2× that of Davidzon et al. (2017) which not only improves sample statistics but also finds new galaxies of still greater mass at all redshifts. Complementary deep IRAC coverage has allowed us to directly measure the stellar bulk, and hence galaxy stellar mass, in a less biased way and to higher redshifts compared to Ks-based measurements. We developed a robust, mass-dependent error budget with contributions from poisson, stellar mass, and cosmic variance, and account for the effects of Eddington bias by fitting a kernel-convolved Schechter function to our observed SMF. We use three fitting techniques, including the continuity model of Leja et al. (2019a), finding good agreement with literature measurements with smaller bin-to-bin variance with z. We stress that our derived parameters are dependent on the assumed Eddington correction, and while the inferred SMF evaluated at maximum likelihood and associated parameters are robust, parameters (and their uncertainties) derived from the median of posterior distributions can be unreliable if constraints are weak. To make these results more transparent and to facilitate comparisons, we have made the object IDs and key measurements presented in this work available to download14.

Although literature comparisons on the shape of the SMF at fixed redshift show good agreement, the novel advantage of COSMOS2020 is the extended historical baseline over which the mass functions (as well as many other properties) can be consistently measured. Not only do we examine the evolution of the integrated mass density ρ* over this time, we also examine the remarkably consistent rate of growth in the number densities of systems of differen masses from z ≈ 7 → 1, whereupon the most massive star-forming galaxies become quiescent faster than they can be replaced. Similarly, we find evidence for the sharp rise in low-mass quiescent systems consistent with the phenomenological model of Peng et al. (2010) probed to z ≈ 2.5 where our sample becomes incomplete. Although tentative, we also find evidence for a sustained > 20% fraction of high-mass quiescent systems from z ≈ 5 → 2.

Furthermore, we highlight three main results:

  • Comparisons with several hydrodynamical simulations and semi-analytical models indicate an exceptional degree of agreement for the most massive galaxies out to high-z. This comes despite the surprisingly high number densities of massive galaxies at z ≈ 3 − 5 in excess of a Schechter function, suggesting that existing physical recipes are assembling massive ℳ ≈ 1010 − 11.5 ℳ systems in sufficient quantity at early times. In order to explore star formation cessation and feedback modes, we identify quiescent galaxies out to z ≈ 5.5 by means of a NUVrJ selection and compare them to consistently selected quiescent samples produced by two simulation codes, finding evidence for delayed assembly of low-mass quiescent systems in EAGLE, and too rapid assembly in the SHARK.

  • A closer examination of these massive systems reveals that a quarter are not found in COSMOS2015. Not only are they Ks-faint, but their extremely red colors challenge SED fitting templates. We find no strong evidence for AGN contamination, although we stress the need for future infrared facilities with deep surveys capable of measuring the rest-frame MIR light at z ≈ 3 − 5. Recent findings of optically dark galaxies from IRAC and ALMA suggest that previous studies have missed contributions from dust-obscured star-forming galaxies. Their brightness, redshifts, mass, and number densities are consistent with our findings, suggesting that the Ks ∼ 26 depth of UltraVISTA DR4 may indeed be sufficient to reach out into the tail end of this population missed by previous optical-NIR selections. Further work is required to conclusively establish the nature of these massive galaxy candidates.

  • Lastly, we investigate the connection to dark matter halos by comparing both our observed and inferred SMFs to constraints provided by the HMF. While we confirm the divegence of the SMF from the HMF at both low- and high-mass regimes which has been historically intepreted as evidence for feedback processes, the massive end of the inferred, Schechter-fit SMF comes into agreement with the HMF at z ≳ 2. While we find no evidence of tension which would challenge theoretical models, our observed number densities at z ≈ 3 − 5 approach the upper limit for fully efficient star formation in the most massive halos. Larger volume surveys containing even more massive systems, if they exist, may be able to challenge these models, especially at z ≳ 6 − 7.

The launch of JWST has opened the door on a new era, and it will soon be flanked by efficient survey facilities from space (Euclid, Roman) and the ground (Rubin). While massive quiescent systems may exist at z ∼ 5 and perhaps even at earlier times (Mawatari et al. 2016, 2020), their identification is beyond the reach of COSMOS2020. They may be identified soon by deep degree-scale JWST surveys i.e., COSMOS-Web (Kartaltepe et al. 2021; Casey et al. 2023) and possibly by narrower ones including the JWST Advanced Deep Extragalactic Survey (JADES, Eisenstein et al. 2017; Ferruit 2017), the Cosmic Evolution Early Release Science Survey (CEERS, Finkelstein et al. 2017), the Next Generation Deep Extragalactic Exploratory Public Survey (NGDEEP, Finkelstein et al. 2021), and Ultra-deep NIRCam and NIRSpec Observations Before the Epoch of Reionization (UNCOVER, Labbé et al. 2021; Bezanson et al. 2022) to name a few.

While low mass quiescent systems at z ≳ 2.5 may be found by JWST (with hints emerging from Marchesini et al. 2023), a truly precise quantification of their number density and demographics is likely to remain a future objective. By extrapolating these observations, < 1% of ℳ ≈ 109.5 − 10.0 ℳ are expected to be quiescent by z ∼ 2.5, become even rarer at earlier times. While identifying even one quiescent, low-mass system at z > 2 in the absence of virialized structure would present a significant challenge to the paradigm of Peng et al. (2010), performing a statistically meaningful survey of them will require incredibly deep, degree-scale NIR surveys, placing them out of reach by current facilities. While the deepest degree-scale surveys from JWST (COSMOS-Web), Roman, and Euclid stand to establish the rarity of these systems at z ≈ 2 − 3, it seems that no currently planned survey will be able to definitively quantify their contribution at z ≳ 3.

While explorations of mass-selected samples at z > 7 are being be made possible by JWST, the most UV-luminous sources at these epochs will likely be missed by small area programs and yet are crucial to a comprehensive study of the ionizing UV budget (Kauffmann et al. 2022; Donnan et al. 2023). Following up known samples with JWST will not only allow us to measure the star formation rate and dust content of the first ultra-luminous galaxies from deep within the epoch of reionization, but also to directly identify the progenitors of z ∼ 3 − 4 massive galaxies while still in their formation stage (e.g., Weaver et al. 2021). Yet despite the incredible promises of highly resolved IR imaging and spectroscopy from JWST, identifying the rarest and potentially most informative populations that can challenge and thereby improve galaxy formation models will remain the domain of wide-field surveys.


1

Computed at 3σ from 3″ apertures randomly placed in regions without detected sources; see Table 1 of Weaver et al. (2022).

2

Galaxies are selected by lp_type = 0, see catalog documentation.

3

The COMBINED region corresponds to FLAG_COMBINED = 0.

4

Channel 3 and 4 are too shallow to provide useful constraints at z ≳ 3.

5

zmax estimates are computed with ALF (Ilbert et al. 2005).

6

For example, ℳ computed assuming Salpeter (1955) is on average 0.24 dex larger than those computed assuming Chabrier (2003).

7

The range of the lowest-ℳ bin at each z is adjusted to include its respective mass limit at its lowest extent. Bins of width < 0.05 dex are discarded.

8

This is not formally an STY method, as the normalization is constrained as a fitted parameter.

9

We note that a second order expansion cannot describe an exponential tail as seen here, which justifies z ∼ 3 as the rightmost domain limit reachable by our 12 parameter continuity model.

10

We choose TNG100 as a compromise between resolution and volume, see Pillepich et al. (2018), Donnari et al. (2021a,b).

11

With the exception FLARES and the SANTA CRUZ SAM which are limited to z > 4.5 and z > 4.0, respectively.

12

Sources with X-ray counterparts (i.e., AGN) are classified in Weaver et al. (2022) by lp_type = 2.

13

The Tinker et al. (2008) HMF is computed according to our cosmology (σ8 = 0.82) at the mid-point in each z-bin using COLUSSUS (Diemer 2018), which explicitly takes into account that these mass functions derived were originally derived from spherical overdensities, which is not universal with redshift (see Eqs. (3)–(8) of Tinker et al. 2008). We adopt the definition for a halo as the DM mass contained in a region 200× the mean matter density, and found no significant differences from using other definitions (e.g., friends-of-friends, spherical overdensity, virial radius).

Acknowledgments

The authors thank Vadim Ruskov, Sidney Lower, Desika Narayanan, Stephen Wilkins, William Roper, Lukas Furtak, and Pratika Dayal for helpful discussions. We are also grateful for the many helpful and constructive comments from the anonymous referee. The Cosmic Dawn Center (DAWN) is funded by the Danish National Research Foundation under grant No. 140. S.T. and J.W. acknowledge support from the European Research Council (ERC) Consolidator Grant funding scheme (project ConTExt, grant No. 648179). O.I. acknowledges the funding of the French Agence Nationale de la Recherche for the project iMAGE (grant ANR-22-CE31-0007). H.J.Mc.C. acknowledges support from the PNCG. I.D. has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No. 896225. This work used the CANDIDE computer system at the IAP supported by grants from the PNCG, CNES, and the DIM-ACAV and maintained by S. Rouberol. B.M.J. is supported in part by Independent Research Fund Denmark grant DFF – 7014-00017. G.E.M. acknowledges the Villum Fonden research grant 13160 “Gas to stars, stars to dust: tracing star formation across cosmic time”. D.R. acknowledges support from the National Science Foundation under grant numbers AST-1614213 and AST-1910107. Y.P. acknowledges National Science Foundation of China (NSFC) Grant No. 12125301, 12192220, 12192222, and the science research grants from the China Manned Space Project with No. CMS-CSST-2021-A07. The authors wish to recognize and acknowledge the very significant cultural role and reverence that the summit of Mauna Kea has always had within the indigenous Hawaiian community. We are most fortunate to have the opportunity to conduct observations from this mountain. This work is based on data products from observations made with ESO Telescopes at the La Silla Paranal Observatory under ESO program ID 179.A-2005 and on data products produced by CALET and the Cambridge Astronomy Survey Unit on behalf of the UltraVISTA consortium. This work is based in part on observations made with the NASA/ESA Hubble Space Telescope, obtained from the Data Archive at the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-26555. Some of the data presented herein were obtained at the W.M. Keck Observatory, which is operated as a scientific partnership among the California Institute of Technology, the University of California and the National Aeronautics and Space Administration. The Observatory was made possible by the generous financial support of the W.M. Keck Foundation. This research is also partly supported by the Centre National d’Etudes Spatiales (CNES). These data were obtained and processed as part of the CFHT Large Area U-band Deep Survey (CLAUDS), which is a collaboration between astronomers from Canada, France, and China described in Sawicki et al. (2019). CLAUDS is based on observations obtained with MegaPrime/MegaCam, a joint project of CFHT and CEA/DAPNIA, at the CFHT which is operated by the National Research Council (NRC) of Canada, the Institut National des Science de l’Univers of the Centre National de la Recherche Scientifique (CNRS) of France, and the University of Hawaii. CLAUDS uses data obtained in part through the Telescope Access Program (TAP), which has been funded by the National Astronomical Observatories, Chinese Academy of Sciences, and the Special Fund for Astronomy from the Ministry of Finance of China. CLAUDS uses data products from TERAPIX and the Canadian Astronomy Data Centre (CADC) and was carried out using resources from Compute Canada and Canadian Advanced Network For Astrophysical Research (CANFAR).

References

  1. Adams, N. J., Bowler, R. A. A., Jarvis, M. J., Häußler, B., & Lagos, C. D. P. 2021, MNRAS, 506, 4933 [CrossRef] [Google Scholar]
  2. Aihara, H., AlSayyad, Y., Ando, M., et al. 2019, PASJ, 71, 114 [Google Scholar]
  3. Akhshik, M., Whitaker, K. E., Leja, J., et al. 2023, ApJ, 943, 179 [NASA ADS] [CrossRef] [Google Scholar]
  4. Arnouts, S., Moscardini, L., Vanzella, E., et al. 2002, MNRAS, 329, 355 [Google Scholar]
  5. Arnouts, S., Le Floc’h, E., Chevallard, J., et al. 2013, A&A, 558, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  6. Balogh, M. L., Navarro, J. F., & Morris, S. L. 2000, ApJ, 540, 113 [Google Scholar]
  7. Barrufet, L., Oesch, P., Fudamoto, Y., et al. 2021, JWST Proposal, Cycle 1, 2198 [Google Scholar]
  8. Behroozi, P., & Silk, J. 2018, MNRAS, 477, 5382 [Google Scholar]
  9. Behroozi, P. S., Conroy, C., & Wechsler, R. H. 2010, ApJ, 717, 379 [Google Scholar]
  10. Behroozi, P. S., Wechsler, R. H., & Conroy, C. 2013, ApJ, 770, 57 [NASA ADS] [CrossRef] [Google Scholar]
  11. Behroozi, P., Wechsler, R. H., Hearin, A. P., & Conroy, C. 2019, MNRAS, 488, 3143 [NASA ADS] [CrossRef] [Google Scholar]
  12. Belli, S., Newman, A. B., & Ellis, R. S. 2019, ApJ, 874, 17 [Google Scholar]
  13. Bernardi, M., Meert, A., Sheth, R. K., et al. 2017, MNRAS, 467, 2217 [Google Scholar]
  14. Bertin, E. 2010, Astrophysics Source Code Library [record ascl:1010.068] [Google Scholar]
  15. Bertin, E., & Arnouts, S. 1996, A&AS, 117, 393 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Bezanson, R., Labbé, I., Whitaker, K. E., et al. 2022, ApJ, submitted [arXiv:2212.04026] [Google Scholar]
  17. Bhowmick, A. K., Somerville, R. S., Di Matteo, T., et al. 2020, MNRAS, 496, 754 [Google Scholar]
  18. Bieri, R., Dubois, Y., Rosdahl, J., et al. 2017, MNRAS, 464, 1854 [NASA ADS] [CrossRef] [Google Scholar]
  19. Binggeli, B., & Jerjen, H. 1998, A&A, 333, 17 [NASA ADS] [Google Scholar]
  20. Birnboim, Y., & Dekel, A. 2003, MNRAS, 345, 349 [Google Scholar]
  21. Bolzonella, M., Kovač, K., Pozzetti, L., et al. 2010, A&A, 524, A76 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  22. Bouché, N., Dekel, A., Genzel, R., et al. 2010, ApJ, 718, 1001 [Google Scholar]
  23. Bouwens, R. J., Illingworth, G. D., Oesch, P. A., et al. 2012a, ApJ, 754, 83 [Google Scholar]
  24. Bouwens, R. J., Illingworth, G. D., Oesch, P. A., et al. 2012b, ApJ, 752, L5 [Google Scholar]
  25. Bowler, R. A. A., Dunlop, J. S., McLure, R. J., et al. 2012, MNRAS, 426, 2772 [NASA ADS] [CrossRef] [Google Scholar]
  26. Brammer, G. B., van Dokkum, P. G., & Coppi, P. 2008, ApJ, 686, 1503 [Google Scholar]
  27. Brinch, M., Greve, T. R., Weaver, J. R., et al. 2023, ApJ, 943, 153 [NASA ADS] [CrossRef] [Google Scholar]
  28. Brinchmann, J., Charlot, S., White, S. D. M., et al. 2004, MNRAS, 351, 1151 [Google Scholar]
  29. Caputi, K. I., Cirasuolo, M., Dunlop, J. S., et al. 2011, MNRAS, 413, 162 [NASA ADS] [CrossRef] [Google Scholar]
  30. Caputi, K. I., Ilbert, O., Laigle, C., et al. 2015, ApJ, 810, 73 [Google Scholar]
  31. Carnall, A. C., McLure, R. J., Dunlop, J. S., & Davé, R. 2018, MNRAS, 480, 4379 [Google Scholar]
  32. Carnall, A. C., Leja, J., Johnson, B. D., et al. 2019, ApJ, 873, 44 [Google Scholar]
  33. Carnall, A., Begley, R. A., Cimatti, A., et al. 2021, JWST Proposal, Cycle 1, 2285 [Google Scholar]
  34. Casey, C. M., Narayanan, D., & Cooray, A. 2014, Phys. Rep., 541, 45 [Google Scholar]
  35. Casey, C. M., Zavala, J. A., Manning, S. M., et al. 2021, ApJ, 923, 215 [NASA ADS] [CrossRef] [Google Scholar]
  36. Casey, C. M., Kartaltepe, J. S., Drakos, N. E., et al. 2023, ApJ, 954, 31 [NASA ADS] [CrossRef] [Google Scholar]
  37. Cecchi, R., Bolzonella, M., Cimatti, A., & Girelli, G. 2019, ApJ, 880, L14 [NASA ADS] [CrossRef] [Google Scholar]
  38. Chabrier, G. 2003, PASP, 115, 763 [Google Scholar]
  39. Cicone, C., Maiolino, R., Sturm, E., et al. 2014, A&A, 562, A21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  40. Cicone, C., Maiolino, R., Gallerani, S., et al. 2015, A&A, 574, A14 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Civano, F., Marchesi, S., Comastri, A., et al. 2016, ApJ, 819, 62 [Google Scholar]
  42. Cole, S., Norberg, P., Baugh, C. M., et al. 2001, MNRAS, 326, 255 [NASA ADS] [CrossRef] [Google Scholar]
  43. Conroy, C., & Gunn, J. E. 2010, ApJ, 712, 833 [Google Scholar]
  44. Conroy, C., & Wechsler, R. H. 2009, ApJ, 696, 620 [NASA ADS] [CrossRef] [Google Scholar]
  45. Conroy, C., Gunn, J. E., & White, M. 2009, ApJ, 699, 486 [Google Scholar]
  46. Conselice, C. J., Mundy, C. J., Ferreira, L., & Duncan, K. 2022, ApJ, 940, 168 [CrossRef] [Google Scholar]
  47. Cooper, O. R., Casey, C. M., Zavala, J. A., et al. 2022, ApJ, 930, 32 [NASA ADS] [CrossRef] [Google Scholar]
  48. Cowie, L. L., Songaila, A., Hu, E. M., & Cohen, J. G. 1996, AJ, 112, 839 [Google Scholar]
  49. Cowley, W. I., Caputi, K. I., Deshmukh, S., et al. 2019, ApJ, 874, 114 [NASA ADS] [CrossRef] [Google Scholar]
  50. Crain, R. A., Schaye, J., Bower, R. G., et al. 2015, MNRAS, 450, 1937 [NASA ADS] [CrossRef] [Google Scholar]
  51. Croton, D. J. 2013, PASA, 30, e052 [NASA ADS] [CrossRef] [Google Scholar]
  52. Daddi, E., Cimatti, A., Renzini, A., et al. 2004, ApJ, 617, 746 [NASA ADS] [CrossRef] [Google Scholar]
  53. Daddi, E., Dickinson, M., Morrison, G., et al. 2007, ApJ, 670, 156 [NASA ADS] [CrossRef] [Google Scholar]
  54. Davé, R., Anglés-Alcázar, D., Narayanan, D., et al. 2019, MNRAS, 486, 2827 [Google Scholar]
  55. Davidzon, I., Cucciati, O., Bolzonella, M., et al. 2016, A&A, 586, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  56. Davidzon, I., Ilbert, O., Laigle, C., et al. 2017, A&A, 605, A70 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  57. Davidzon, I., Ilbert, O., Faisst, A. L., Sparre, M., & Capak, P. L. 2018, ApJ, 852, 107 [NASA ADS] [CrossRef] [Google Scholar]
  58. Debuhr, J., Quataert, E., & Ma, C.-P. 2012, MNRAS, 420, 2221 [NASA ADS] [CrossRef] [Google Scholar]
  59. Di Teodoro, E. M., Posti, L., Fall, S. M., et al. 2023, MNRAS, 518, 6340 [Google Scholar]
  60. Diemer, B. 2018, ApJS, 239, 35 [NASA ADS] [CrossRef] [Google Scholar]
  61. Donley, J. L., Koekemoer, A. M., Brusa, M., et al. 2012, ApJ, 748, 142 [Google Scholar]
  62. Donnan, C. T., McLeod, D. J., Dunlop, J. S., et al. 2023, MNRAS, 518, 6011 [Google Scholar]
  63. Donnari, M., Pillepich, A., Joshi, G. D., et al. 2021a, MNRAS, 500, 4004 [Google Scholar]
  64. Donnari, M., Pillepich, A., Nelson, D., et al. 2021b, MNRAS, 506, 4760 [NASA ADS] [CrossRef] [Google Scholar]
  65. Driver, S. P., & Robotham, A. S. G. 2010, MNRAS, 407, 2131 [Google Scholar]
  66. Drory, N., Bundy, K., Leauthaud, A., et al. 2009, ApJ, 707, 1595 [NASA ADS] [CrossRef] [Google Scholar]
  67. Dubois, Y., Pichon, C., Welker, C., et al. 2014, MNRAS, 444, 1453 [Google Scholar]
  68. Duncan, K., Conselice, C. J., Mortlock, A., et al. 2014, MNRAS, 444, 2960 [Google Scholar]
  69. Ebeling, H. 2003, MNRAS, 340, 1269 [NASA ADS] [CrossRef] [Google Scholar]
  70. Eddington, A. S. 1913, MNRAS, 73, 359 [Google Scholar]
  71. Efstathiou, G., & Bond, J. R. 1999, MNRAS, 304, 75 [NASA ADS] [CrossRef] [Google Scholar]
  72. Efstathiou, G., Ellis, R. S., & Peterson, B. A. 1988, MNRAS, 232, 431 [Google Scholar]
  73. Eisenstein, D. J., Ferruit, P., & Rieke, M. J. 2017, JWST Proposal, Cycle 1, 1181 [Google Scholar]
  74. Euclid Collaboration (Moneti, A., et al.) 2022, A&A, 658, A126 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  75. Faisst, A. L., Carollo, C. M., Capak, P. L., et al. 2017, ApJ, 839, 71 [NASA ADS] [CrossRef] [Google Scholar]
  76. Ferruit, P. 2017, JWST Proposal, Cycle 1, 1210 [Google Scholar]
  77. Finkelstein, S. L., Dickinson, M., Ferguson, H. C., et al. 2017, JWST Proposal, 1345, Cycle 0 Early Release Science [Google Scholar]
  78. Finkelstein, S. L., Papovich, C., Pirzkal, N., et al. 2021, JWST Proposal, Cycle 1, 2079 [Google Scholar]
  79. Fontana, A., Salimbeni, S., Grazian, A., et al. 2006, A&A, 459, 745 [CrossRef] [EDP Sciences] [Google Scholar]
  80. Fontana, A., Dunlop, J. S., Paris, D., et al. 2014, A&A, 570, A11 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  81. Fontanot, F., De Lucia, G., Monaco, P., Somerville, R. S., & Santini, P. 2009, MNRAS, 397, 1776 [NASA ADS] [CrossRef] [Google Scholar]
  82. Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]
  83. Forrest, B., Annunziatella, M., Wilson, G., et al. 2020a, ApJ, 890, L1 [NASA ADS] [CrossRef] [Google Scholar]
  84. Forrest, B., Marsan, Z. C., Annunziatella, M., et al. 2020b, ApJ, 903, 47 [NASA ADS] [CrossRef] [Google Scholar]
  85. Förster Schreiber, N. M., & Wuyts, S. 2020, ARA&A, 58, 661 [Google Scholar]
  86. Fudamoto, Y., Oesch, P. A., Faisst, A., et al. 2020, A&A, 643, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  87. Fudamoto, Y., Oesch, P. A., Schouws, S., et al. 2021, Nature, 597, 489 [CrossRef] [Google Scholar]
  88. Furlong, M., Bower, R. G., Theuns, T., et al. 2015, MNRAS, 450, 4486 [Google Scholar]
  89. Gabor, J. M., & Davé, R. 2015, MNRAS, 447, 374 [NASA ADS] [CrossRef] [Google Scholar]
  90. Gehrels, N. 1986, ApJ, 303, 336 [Google Scholar]
  91. Glazebrook, K., Peacock, J. A., Miller, L., & Collins, C. A. 1995, MNRAS, 275, 169 [NASA ADS] [CrossRef] [Google Scholar]
  92. Glazebrook, K., Schreiber, C., Labbé, I., et al. 2017, Nature, 544, 71 [Google Scholar]
  93. Glazebrook, K., Nanayakkara, T., Esdaile, J., et al. 2021, JWST Proposal, Cycle 1, 2565 [Google Scholar]
  94. Gobat, R., Strazzullo, V., Daddi, E., et al. 2012, ApJ, 759, L44 [Google Scholar]
  95. González, V., Labbé, I., Bouwens, R. J., et al. 2011, ApJ, 735, L34 [Google Scholar]
  96. Goodman, J., & Weare, J. 2010, Commun. Appl. Math. Comput. Sci., 5, 65 [Google Scholar]
  97. Gould, K. M. L., Brammer, G., Valentino, F., et al. 2023, AJ, 165, 248 [NASA ADS] [CrossRef] [Google Scholar]
  98. Grazian, A., Fontana, A., Santini, P., et al. 2015, A&A, 575, A96 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  99. Grogin, N. A., Kocevski, D. D., Faber, S. M., et al. 2011, ApJS, 197, 35 [NASA ADS] [CrossRef] [Google Scholar]
  100. Gruppioni, C., Béthermin, M., Loiacono, F., et al. 2020, A&A, 643, A8 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  101. Gunn, J. E., & Gott, J. R., III 1972, ApJ, 176, 1 [Google Scholar]
  102. Habouzit, M., Somerville, R. S., Li, Y., et al. 2022, MNRAS, 509, 3015 [Google Scholar]
  103. Harikane, Y., Ouchi, M., Ono, Y., et al. 2016, ApJ, 821, 123 [NASA ADS] [CrossRef] [Google Scholar]
  104. Henriques, B. M. B., White, S. D. M., Thomas, P. A., et al. 2015, MNRAS, 451, 2663 [Google Scholar]
  105. Hill, A. R., Muzzin, A., Franx, M., et al. 2017, ApJ, 837, 147 [NASA ADS] [CrossRef] [Google Scholar]
  106. Hogg, D. W. 1999, ArXiv e-prints [arXiv:astro-ph/9905116] [Google Scholar]
  107. Hogg, D. W., Bovy, J., & Lang, D. 2010, ArXiv e-prints [arXiv:1008.4686] [Google Scholar]
  108. Hsieh, B.-C., Wang, W.-H., Hsieh, C.-C., et al. 2012, ApJS, 203, 23 [NASA ADS] [CrossRef] [Google Scholar]
  109. Hviding, R. E., Hainline, K. N., Rieke, M., et al. 2022, AJ, 163, 224 [NASA ADS] [CrossRef] [Google Scholar]
  110. Ilbert, O., Tresse, L., Arnouts, S., et al. 2004, MNRAS, 351, 541 [NASA ADS] [CrossRef] [Google Scholar]
  111. Ilbert, O., Tresse, L., Zucca, E., et al. 2005, A&A, 439, 863 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  112. Ilbert, O., Arnouts, S., McCracken, H. J., et al. 2006, A&A, 457, 841 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  113. Ilbert, O., Salvato, M., Le Floc’h, E., et al. 2010, ApJ, 709, 644 [Google Scholar]
  114. Ilbert, O., McCracken, H. J., Le Fèvre, O., et al. 2013, A&A, 556, A55 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  115. Ito, K., Tanaka, M., Miyaji, T., et al. 2022, ApJ, 929, 53 [NASA ADS] [CrossRef] [Google Scholar]
  116. James, F., & Roos, M. 1975, Comput. Phys. Commun., 10, 343 [Google Scholar]
  117. Jin, S., Daddi, E., Liu, D., et al. 2018, ApJ, 864, 56 [Google Scholar]
  118. Jin, S., Daddi, E., Magdis, G. E., et al. 2019, ApJ, 887, 144 [Google Scholar]
  119. Johnston, R. 2011, A&ARv, 19, 41 [NASA ADS] [CrossRef] [Google Scholar]
  120. Kartaltepe, J., Casey, C. M., Bagley, M., et al. 2021, JWST Proposal, Cycle 1, 1727 [Google Scholar]
  121. Kauffmann, O. B., Ilbert, O., Weaver, J. R., et al. 2022, A&A, 667, A65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  122. Kaviraj, S., Laigle, C., Kimm, T., et al. 2017, MNRAS, 467, 4739 [NASA ADS] [Google Scholar]
  123. Koekemoer, A. M., Aussel, H., Calzetti, D., et al. 2007, ApJS, 172, 196 [Google Scholar]
  124. Koekemoer, A. M., Faber, S. M., Ferguson, H. C., et al. 2011, ApJS, 197, 36 [NASA ADS] [CrossRef] [Google Scholar]
  125. Kokorev, V. I., Magdis, G. E., Davidzon, I., et al. 2021, ApJ, 921, 40 [NASA ADS] [CrossRef] [Google Scholar]
  126. Labbé, I., Bezanson, R., Atek, H., et al. 2021, JWST Proposal, Cycle 1, 2561 [Google Scholar]
  127. Lagos, C. D. P., Tobar, R. J., Robotham, A. S. G., et al. 2018, MNRAS, 481, 3573 [CrossRef] [Google Scholar]
  128. Laigle, C., McCracken, H. J., Ilbert, O., et al. 2016, ApJS, 224, 24 [Google Scholar]
  129. Laigle, C., Davidzon, I., Ilbert, O., et al. 2019, MNRAS, 486, 5104 [Google Scholar]
  130. Lang, D., Hogg, D. W., & Mykytyn, D. 2016, Astrophysics Source Code Library [record ascl:1604.008] [Google Scholar]
  131. Larson, R. B., Tinsley, B. M., & Caldwell, C. N. 1980, ApJ, 237, 692 [Google Scholar]
  132. Legrand, L., McCracken, H. J., Davidzon, I., et al. 2019, MNRAS, 486, 5468 [NASA ADS] [CrossRef] [Google Scholar]
  133. Leja, J., Carnall, A. C., Johnson, B. D., Conroy, C., & Speagle, J. S. 2019a, ApJ, 876, 3 [Google Scholar]
  134. Leja, J., Tacchella, S., & Conroy, C. 2019b, ApJ, 880, L9 [NASA ADS] [CrossRef] [Google Scholar]
  135. Lilly, S. J., Carollo, C. M., Pipino, A., Renzini, A., & Peng, Y. 2013, ApJ, 772, 119 [NASA ADS] [CrossRef] [Google Scholar]
  136. Liu, D., Lang, P., Magnelli, B., et al. 2019, ApJS, 244, 40 [Google Scholar]
  137. Long, A. S., Casey, C. M., Lagos, C. D. P., et al. 2023, ApJ, 953, 11 [NASA ADS] [CrossRef] [Google Scholar]
  138. Lovell, C. C., Vijayan, A. P., Thomas, P. A., et al. 2021, MNRAS, 500, 2127 [Google Scholar]
  139. Lower, S., Narayanan, D., Leja, J., et al. 2020, ApJ, 904, 33 [NASA ADS] [CrossRef] [Google Scholar]
  140. Lower, S., Narayanan, D., Li, Q., & Davé, R. 2023, ApJ, 950, 94 [CrossRef] [Google Scholar]
  141. Lustig, P., Strazzullo, V., Remus, R. S., et al. 2023, MNRAS, 518, 5953 [Google Scholar]
  142. Madau, P., & Dickinson, M. 2014, ARA&A, 52, 415 [Google Scholar]
  143. Main, R. A., McNamara, B. R., Nulsen, P. E. J., Russell, H. R., & Vantyghem, A. N. 2017, MNRAS, 464, 4360 [NASA ADS] [CrossRef] [Google Scholar]
  144. Maiolino, R., Gallerani, S., Neri, R., et al. 2012, MNRAS, 425, L66 [Google Scholar]
  145. Malmquist, K. G. 1920, Meddelanden fran Lunds Astronomiska Observatorium Serie I, 96, 1 [NASA ADS] [Google Scholar]
  146. Malmquist, K. G. 1922, Meddelanden fran Lunds Astronomiska Observatorium Serie I, 100, 1 [Google Scholar]
  147. Mandelbaum, R., Seljak, U., Kauffmann, G., Hirata, C. M., & Brinkmann, J. 2006, MNRAS, 368, 715 [NASA ADS] [CrossRef] [Google Scholar]
  148. Manning, S. M., Casey, C. M., Zavala, J. A., et al. 2022, ApJ, 925, 23 [CrossRef] [Google Scholar]
  149. Marchesini, D., van Dokkum, P. G., Förster Schreiber, N. M., et al. 2009, ApJ, 701, 1765 [NASA ADS] [CrossRef] [Google Scholar]
  150. Marchesini, D., Brammer, G., Morishita, T., et al. 2023, ApJ, 942, L25 [NASA ADS] [CrossRef] [Google Scholar]
  151. Marrone, D. P., Spilker, J. S., Hayward, C. C., et al. 2018, Nature, 553, 51 [NASA ADS] [CrossRef] [Google Scholar]
  152. Marsan, Z. C., Muzzin, A., Marchesini, D., et al. 2022, ApJ, 924, 25 [NASA ADS] [CrossRef] [Google Scholar]
  153. Martis, N. S., Marchesini, D., Brammer, G. B., et al. 2016, ApJ, 827, L25 [CrossRef] [Google Scholar]
  154. Mawatari, K., Yamada, T., Fazio, G. G., Huang, J.-S., & Ashby, M. L. N. 2016, PASJ, 68, 46 [NASA ADS] [CrossRef] [Google Scholar]
  155. Mawatari, K., Inoue, A. K., Hashimoto, T., et al. 2020, ApJ, 889, 137 [NASA ADS] [CrossRef] [Google Scholar]
  156. McConachie, I., Wilson, G., Forrest, B., et al. 2022, ApJ, 926, 37 [NASA ADS] [CrossRef] [Google Scholar]
  157. McCracken, H. J., Milvang-Jensen, B., Dunlop, J., et al. 2012, A&A, 544, A156 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  158. McLeod, D. J., McLure, R. J., Dunlop, J. S., et al. 2021, MNRAS, 503, 4413 [NASA ADS] [CrossRef] [Google Scholar]
  159. Michałowski, M. J., Dunlop, J. S., Cirasuolo, M., et al. 2012, A&A, 541, A85 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  160. Michałowski, M. J., Hayward, C. C., Dunlop, J. S., et al. 2014, A&A, 571, A75 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  161. Milvang-Jensen, B., Freudling, W., Zabl, J., et al. 2013, A&A, 560, A94 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  162. Mitchell, P. D., Schaye, J., Bower, R. G., & Crain, R. A. 2020, MNRAS, 494, 3971 [Google Scholar]
  163. Moneti, A., McCracken, H. J., Rouberol, S., et al. 2023, VizieR On-line Data Catalog: II/373 [Google Scholar]
  164. Mortlock, A., Conselice, C. J., Bluck, A. F. L., et al. 2011, MNRAS, 413, 2845 [Google Scholar]
  165. Mortlock, A., Conselice, C. J., Hartley, W. G., et al. 2015, MNRAS, 447, 2 [NASA ADS] [CrossRef] [Google Scholar]
  166. Moster, B. P., Somerville, R. S., Newman, J. A., & Rix, H.-W. 2011, ApJ, 731, 113 [Google Scholar]
  167. Moster, B. P., Naab, T., & White, S. D. M. 2013, MNRAS, 428, 3121 [Google Scholar]
  168. Moutard, T., Arnouts, S., Ilbert, O., et al. 2016, A&A, 590, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  169. Muzzin, A., Marchesini, D., Stefanon, M., et al. 2013, ApJ, 777, 18 [NASA ADS] [CrossRef] [Google Scholar]
  170. Nayyeri, H., Hemmati, S., Mobasher, B., et al. 2017, ApJS, 228, 7 [NASA ADS] [CrossRef] [Google Scholar]
  171. Neistein, E., & Dekel, A. 2008, MNRAS, 388, 1792 [Google Scholar]
  172. Neistein, E., van den Bosch, F. C., & Dekel, A. 2006, MNRAS, 372, 933 [NASA ADS] [CrossRef] [Google Scholar]
  173. Noeske, K. G., Weiner, B. J., Faber, S. M., et al. 2007, ApJ, 660, L43 [CrossRef] [Google Scholar]
  174. Oke, J. B. 1974, ApJS, 27, 21 [Google Scholar]
  175. Pacifici, C., da Cunha, E., Charlot, S., et al. 2015, MNRAS, 447, 786 [NASA ADS] [CrossRef] [Google Scholar]
  176. Padovani, P., Bonzini, M., Kellermann, K. I., et al. 2015, MNRAS, 452, 1263 [Google Scholar]
  177. Peng, Y.-J., Lilly, S. J., Kovač, K., et al. 2010, ApJ, 721, 193 [Google Scholar]
  178. Peng, Y.-J., Lilly, S. J., Renzini, A., & Carollo, M. 2012, ApJ, 757, 4 [NASA ADS] [CrossRef] [Google Scholar]
  179. Peng, Y.-J., Lilly, S. J., Renzini, A., & Carollo, M. 2014, ApJ, 790, 95 [NASA ADS] [CrossRef] [Google Scholar]
  180. Peng, Y., Maiolino, R., & Cochrane, R. 2015, Nature, 521, 192 [Google Scholar]
  181. Pillepich, A., Nelson, D., Hernquist, L., et al. 2018, MNRAS, 475, 648 [Google Scholar]
  182. Pope, A., Aretxaga, I., Hughes, D., Wilson, G., & Yun, M. 2019, Am. Astron. Soc. Meet. Abstr., 233, 363.20 [NASA ADS] [Google Scholar]
  183. Popesso, P., Concas, A., Cresci, G., et al. 2023, MNRAS, 519, 1526 [Google Scholar]
  184. Pozzetti, L., Bolzonella, M., Zucca, E., et al. 2010, A&A, 523, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  185. Retzlaff, J., Rosati, P., Dickinson, M., et al. 2010, A&A, 511, A50 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  186. Richardson, M. L. A., Scannapieco, E., Devriendt, J., et al. 2016, ApJ, 825, 83 [NASA ADS] [CrossRef] [Google Scholar]
  187. Rigby, J., Perrin, M., McElwain, M., et al. 2023, PASP, 135, 048001 [NASA ADS] [CrossRef] [Google Scholar]
  188. Roos, O., Juneau, S., Bournaud, F., & Gabor, J. M. 2015, ApJ, 800, 19 [NASA ADS] [CrossRef] [Google Scholar]
  189. Saintonge, A., Lutz, D., Genzel, R., et al. 2013, ApJ, 778, 2 [Google Scholar]
  190. Salim, S., Rich, R. M., Charlot, S., et al. 2007, ApJS, 173, 267 [NASA ADS] [CrossRef] [Google Scholar]
  191. Salpeter, E. E. 1955, ApJ, 121, 161 [Google Scholar]
  192. Salvato, M., Ilbert, O., Hasinger, G., et al. 2011, ApJ, 742, 61 [Google Scholar]
  193. Sandage, A., Tammann, G. A., & Yahil, A. 1979, ApJ, 232, 352 [NASA ADS] [CrossRef] [Google Scholar]
  194. Sanders, D. B., Salvato, M., Aussel, H., et al. 2007, ApJS, 172, 86 [Google Scholar]
  195. Santini, P., Fontana, A., Grazian, A., et al. 2012, A&A, 538, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  196. Santini, P., Castellano, M., Merlin, E., et al. 2021, A&A, 652, A30 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  197. Santini, P., Castellano, M., Fontana, A., et al. 2022, ApJ, 940, 135 [NASA ADS] [CrossRef] [Google Scholar]
  198. Sawicki, M., Arnouts, S., Huang, J., et al. 2019, MNRAS, 489, 5202 [NASA ADS] [Google Scholar]
  199. Schechter, P. 1976, ApJ, 203, 297 [Google Scholar]
  200. Schmidt, M. 1968, ApJ, 151, 393 [Google Scholar]
  201. Schreiber, C., Glazebrook, K., Nanayakkara, T., et al. 2018a, A&A, 618, A85 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  202. Schreiber, C., Labbé, I., Glazebrook, K., et al. 2018b, A&A, 611, A22 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  203. Scoville, N., Aussel, H., Brusa, M., et al. 2007, ApJS, 172, 1 [Google Scholar]
  204. Shahidi, A., Mobasher, B., Nayyeri, H., et al. 2020, ApJ, 897, 44 [NASA ADS] [CrossRef] [Google Scholar]
  205. Shankar, F., Lapi, A., Salucci, P., De Zotti, G., & Danese, L. 2006, ApJ, 643, 14 [NASA ADS] [CrossRef] [Google Scholar]
  206. Shu, X., Yang, L., Liu, D., et al. 2022, ApJ, 926, 155 [NASA ADS] [CrossRef] [Google Scholar]
  207. Shuntov, M., McCracken, H. J., Gavazzi, R., et al. 2022, A&A, 664, A61 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  208. Smolčić, V., Novak, M., Bondi, M., et al. 2017, A&A, 602, A1 [Google Scholar]
  209. Somerville, R. S., Popping, G., & Trager, S. C. 2015, MNRAS, 453, 4337 [Google Scholar]
  210. Song, M., Finkelstein, S. L., Ashby, M. L. N., et al. 2016, ApJ, 825, 5 [NASA ADS] [CrossRef] [Google Scholar]
  211. Songaila, A., Cowie, L. L., Hu, E. M., & Gardner, J. P. 1994, ApJS, 94, 461 [NASA ADS] [CrossRef] [Google Scholar]
  212. Stefanon, M., Labbé, I., Oesch, P. A., et al. 2021, ApJS, 257, 68 [NASA ADS] [CrossRef] [Google Scholar]
  213. Steinhardt, C. L., Capak, P., Masters, D., & Speagle, J. S. 2016, ApJ, 824, 21 [NASA ADS] [CrossRef] [Google Scholar]
  214. Steinhardt, C. L., Weaver, J. R., Maxfield, J., et al. 2020, ApJ, 891, 136 [NASA ADS] [CrossRef] [Google Scholar]
  215. Steinhardt, C. L., Jespersen, C. K., & Linzer, N. B. 2021, ApJ, 923, 8 [NASA ADS] [CrossRef] [Google Scholar]
  216. Stern, D., Eisenhardt, P., Gorjian, V., et al. 2005, ApJ, 631, 163 [Google Scholar]
  217. Straatman, C. M. S., Labbé, I., Spitler, L. R., et al. 2015, ApJ, 808, L29 [Google Scholar]
  218. Suess, K. A., Kriek, M., Price, S. H., & Barro, G. 2021, ApJ, 915, 87 [NASA ADS] [CrossRef] [Google Scholar]
  219. Sun, F., Egami, E., Pérez-González, P. G., et al. 2021, ApJ, 922, 114 [NASA ADS] [CrossRef] [Google Scholar]
  220. Szalay, A. S., Connolly, A. J., & Szokoly, G. P. 1999, AJ, 117, 68 [Google Scholar]
  221. Takeuchi, T. T. 2000, Ap&SS, 271, 213 [NASA ADS] [CrossRef] [Google Scholar]
  222. Tanaka, M. 2015, ApJ, 801, 20 [Google Scholar]
  223. Tanaka, M., Valentino, F., Toft, S., et al. 2019, ApJ, 885, L34 [Google Scholar]
  224. Thorne, J. E., Robotham, A. S. G., Davies, L. J. M., et al. 2021, MNRAS, 505, 540 [NASA ADS] [CrossRef] [Google Scholar]
  225. Tinker, J., Kravtsov, A. V., Klypin, A., et al. 2008, ApJ, 688, 709 [Google Scholar]
  226. Toft, S., Zabl, J., Richard, J., et al. 2017, Nature, 546, 510 [NASA ADS] [CrossRef] [Google Scholar]
  227. Tomczak, A. R., Quadri, R. F., Tran, K.-V. H., et al. 2014, ApJ, 783, 85 [Google Scholar]
  228. Torrey, P., Vogelsberger, M., Genel, S., et al. 2014, MNRAS, 438, 1985 [CrossRef] [Google Scholar]
  229. Trapp, A. C., & Furlanetto, S. R. 2020, MNRAS, 499, 2401 [CrossRef] [Google Scholar]
  230. Trenti, M., & Stiavelli, M. 2008, ApJ, 676, 767 [Google Scholar]
  231. Ucci, G., Dayal, P., Hutter, A., et al. 2021, MNRAS, 506, 202 [CrossRef] [Google Scholar]
  232. Valentino, F., Tanaka, M., Davidzon, I., et al. 2020, ApJ, 889, 93 [Google Scholar]
  233. Varma, S., Huertas-Company, M., Pillepich, A., et al. 2022, MNRAS, 509, 2654 [NASA ADS] [Google Scholar]
  234. Vito, F., Brandt, W. N., Yang, G., et al. 2018, MNRAS, 473, 2378 [Google Scholar]
  235. Volonteri, M., & Reines, A. E. 2016, ApJ, 820, L6 [NASA ADS] [CrossRef] [Google Scholar]
  236. Vulcani, B., Poggianti, B. M., Oemler, A., et al. 2013, A&A, 550, A58 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  237. Wang, T., Schreiber, C., Elbaz, D., et al. 2019, Nature, 572, 211 [Google Scholar]
  238. Weaver, J. R., Brammer, G., Casey, C. M., et al. 2021, JWST Proposal, Cycle 1, 2659 [Google Scholar]
  239. Weaver, J. R., Kauffmann, O. B., Ilbert, O., et al. 2022, ApJS, 258, 11 [NASA ADS] [CrossRef] [Google Scholar]
  240. Weaver, J. R., Zalesky, L., Kokorev, V., et al. 2023, ApJS, submitted [Google Scholar]
  241. Wechsler, R. H., & Tinker, J. L. 2018, ARA&A, 56, 435 [NASA ADS] [CrossRef] [Google Scholar]
  242. Weigel, A. K., Schawinski, K., & Bruderer, C. 2016, MNRAS, 459, 2150 [NASA ADS] [CrossRef] [Google Scholar]
  243. Whitaker, K. E., van Dokkum, P. G., Brammer, G., & Franx, M. 2012, ApJ, 754, L29 [Google Scholar]
  244. Whitaker, K. E., Franx, M., Leja, J., et al. 2014, ApJ, 795, 104 [NASA ADS] [CrossRef] [Google Scholar]
  245. Whitaker, K. E., Williams, C. C., Mowla, L., et al. 2021, Nature, 597, 485 [NASA ADS] [CrossRef] [Google Scholar]
  246. Williams, R. J., Quadri, R. F., Franx, M., van Dokkum, P., & Labbé, I. 2009, ApJ, 691, 1879 [NASA ADS] [CrossRef] [Google Scholar]
  247. Wright, A. H., Driver, S. P., & Robotham, A. S. G. 2018, MNRAS, 480, 3491 [Google Scholar]
  248. Yung, L. Y. A., Somerville, R. S., Finkelstein, S. L., Popping, G., & Davé, R. 2019a, MNRAS, 483, 2983 [NASA ADS] [CrossRef] [Google Scholar]
  249. Yung, L. Y. A., Somerville, R. S., Popping, G., et al. 2019b, MNRAS, 490, 2855 [NASA ADS] [CrossRef] [Google Scholar]
  250. Zavala, J. A., Casey, C. M., Manning, S. M., et al. 2021, ApJ, 909, 165 [CrossRef] [Google Scholar]

Appendix A: Sample selection

As discussed in Section 3, we select sources from the 1.27 deg2COMBINED region for sources 0.2 < z ≤ 6.5, and impose an additional restriction of the ultra-deep stripe for sources 6.5 < z ≤ 7.5. The three remaining criteria are aimed to ensure secure photo-z and ℳ based on the IRAC ch1 magnitude, SED χ2 fit quality, and ℒ(z). As shown in Fig. A.1, there is considerable overlap between these three criteria, with mch1 > 26 accounting for 94% of the sample removed by combining these three.

thumbnail Fig. A.1.

Fraction of sources removed by a single criterion that are also removed by another, after applying the primary criteria on area and photo-z. E.g., 71% of χ SED 2 >10 $ \chi_{\rm SED}^2>10 $ sources have mch1 > 26.

Appendix B: Further comparisons with simulations

Here we include additional figures comparing the total, star-forming, and quiescent mass functions (Fig. B.1) and quiescent fractions (Fig. B.2) from this work to those of UNIVERSE MACHINE Behroozi et al. (2019, sSFR < 10−11 ℳ yr−1) as well as those of consistently NUVrJ-selected quiescent galaxies from EAGLE (Furlong et al. 2015) and SHARK (Lagos et al. 2018).

thumbnail Fig. B.1.

Galaxy stellar mass functions for total (gray), star-forming (blue), and quiescent (orange) samples from this work compared to simulations. We include those from UNIVERSE MACHINE Behroozi et al. (2019, sSFR < 10−11 ℳ yr−1) shown by dotted lines with hatched uncertainty envelopes, as well as consistently NUVrJ-selected samples from EAGLE (Furlong et al. 2015) and SHARK (Lagos et al. 2018) shown by the dashed and dot-dashed lines, respectively. Upper limits for empty bins are shown by the horizontal gray line with an arrow. Mass incomplete measurements are not shown.

thumbnail Fig. B.2.

Fraction of star-forming (blue) and quiescent (orange) galaxy samples as a function of mass at for three redshift ranges compared to simulations. We include measurements from UNIVERSE MACHINE Behroozi et al. (2019, sSFR < 10−11 ℳ yr−1) shown by dotted lines, as well as those from consistently NUVrJ-selected samples from EAGLE (Furlong et al. 2015) and SHARK (Lagos et al. 2018).

Appendix C: Fitting Results

Here we include the inferred total, star-forming, and quiescent mass functions (Figs. C.1, C.2, and C.3) optimized by both χ2 and Likelihood maximization; their derived parameters are contained in Tables C.1, C.2, and C.3, respectively. These results are summarized together in Fig. C.4, along with quiescent fractions as a function of mass (defined three ways). Shown in the same figure are the SMFs realized by evaluation of the continuity model (based on Leja et al. 2019a), with related parameters contained in Table C.4. For those interested in parameter covariances, we have made the full Markov chains available to download15.

thumbnail Fig. C.1.

Results of fitting a double (z < 3) and single Schechter (z ≥ 3) functional forms to the observed galaxy stellar mass function. Measurements are binned in redshift and inferred from both χ2 minimization (red) and likelihood methods (evolving colors) taken at the median of each parameter posterior distribution. In both cases, Schechter functions are convolved with parameterized, redshift-dependent kernels in δM (upper right subpanels) to account for Eddington bias (dashed lines) which is then removed in the corrected fit (solid lines). For completeness, the corrected maximum likelihood solutions are also indicated (dash-dot colored curves). In the lower panels, fitting residuals between the uncorrected kernel convolved fits and the data are shown by both the relative fractional difference (dashed curve) and the difference weighted by the uncertainty (square points). Corresponding parameters are shown in Table C.1.

thumbnail Fig. C.2.

Results of fitting a double (z ≤ 3.0) and single Schechter (z > 3.0) functional forms to the star-forming sample. The figure follows the same layout as Fig. C.1. Corresponding parameters are shown in Table C.2.

thumbnail Fig. C.3.

Results of fitting a double (z ≤ 1.5) and single Schechter (z > 1.5) functional forms to the quiescent sample. The figure follows the same layout as Fig. C.1. Corresponding parameters are shown in Table C.3.

thumbnail Fig. C.4.

Best fit Schechter models corresponding to maximum likelihood for the total, star-forming, and quiescent samples (solid colored curves) based on the observed data (colored points). Fits for the median posterior will be similar for symmetric parameter posterior distributions, which are found in all but the last bin shown here. Kernel convolved fits are shown by the dashed curves. Fits measured using the continuity model are shown in gray solid curves. Lower panels indicate the fraction of quiescent galaxies ℱQG ≡ ΦQG/(ΦSF + ΦQG) as a function of log10(ℳ/ℳ) for the data (colored points) and maximum likelihood fits (solid curve). Since the star-forming and quiescent fits are not required to sum to the total fit, changing the definition to ℱQG ≡ ΦQGTotal as reported by Ilbert et al. (2013) and Davidzon et al. (2017) produces the dotted curves with noticeably lower ℱQG at high ℳ.

Table C.1.

Double (z ≤ 3) and single (z > 3) Schechter parameters derived for the total mass complete sample.

Table C.2.

Double (z ≤ 3) and single (z > 3) Schechter parameters derived for the star-forming mass complete subsample.

Table C.3.

Double (z ≤ 1.5) and single (z > 1.5) Schechter parameters derived for the quiescent mass complete subsample.

Table C.4.

Double Schechter function parameters (z ≤ 3) derived for the total sample from the continuity model fitting.

All Tables

Table C.1.

Double (z ≤ 3) and single (z > 3) Schechter parameters derived for the total mass complete sample.

Table C.2.

Double (z ≤ 3) and single (z > 3) Schechter parameters derived for the star-forming mass complete subsample.

Table C.3.

Double (z ≤ 1.5) and single (z > 1.5) Schechter parameters derived for the quiescent mass complete subsample.

Table C.4.

Double Schechter function parameters (z ≤ 3) derived for the total sample from the continuity model fitting.

All Figures

thumbnail Fig. 1.

Galaxies are classified as either star-forming or quiescent in bins of redshift by selection in rest frame NUV − r and r − J colors, following Ilbert et al. (2013). The rest-frame J band is not directly measured at z ≳ 3.5 and so the sloped part of the selection box is dashed to indicate that the r − J colors are strongly model dependent. Typical uncertainties for the rest-frame colors of the star-forming and quiescent subsamples are estimated as medians of the nearest observed-frame bands, consistent with the derivation of the rest-frame colors. See the text for details.

In the text
thumbnail Fig. 2.

Stellar mass completeness limits. The three panels show, from left to right, the density of the total, star-forming, and quiescent samples with their respective mass completeness limits. The limits are derived in discrete z-bins following Pozzetti et al. (2010) with magnitude limits adopted from IRAC channel 1 (filled circles), then interpolated with Eqs. (3)–(5), respectively, resulting in the solid lines. For quiescent galaxies, consistent estimates using Ks are also shown (empty circles). To verify these estimates we compute conservative mass limits from simulated spectra following a delta-like burst at z = 15 which are normalized separately to the Ks and channel 1 magnitude limits; they are shown by the dotted and solid red lines, respectively. See the text for details.

In the text
thumbnail Fig. 3.

Adopted estimates for the total uncertainty σΦ (solid) as a function of stellar mass at several redshifts (by color), for mass-complete samples only. Contributions include uncertainties from Poisson noise σN (dashed), Cosmic Variance σCV (dotted), and SED fitting σSED (dash-dot). 10% and 100% levels are indicated for reference.

In the text
thumbnail Fig. 4.

Likelihood distributions of galaxy stellar mass are derived from SED fitting with LePhare and assume a fixed redshift. Individual distributions (gray) are summarized by a median stack (blue) grouped by redshift and stellar mass (indicated in ranges of log10(ℳ/ℳ)). Estimates of standard deviation σ are shown. The size of a typical mass bin used in this work is 0.25 dex, indicated by the pair of dotted gray lines in each panel.

In the text
thumbnail Fig. 5.

Evolution of the galaxy stellar mass function over 12 redshift bins (0.2 < z ≤ 7.5) for the total, star-forming, and quiescent samples. Mass incomplete bins based on the channel 1 limiting magnitude are not shown. For the quiescent sample, bins that are fully incomplete based on the Ks limiting magnitude are indicated by cross hatching.

In the text
thumbnail Fig. 6.

Evolution of the galaxy stellar mass function across 12 redshift bins (0.2 < z ≤ 7.5). The 0.2 < z ≤ 0.5 SMF from the first redshift bin is repeated in each panel for reference shown by the purple dotted curve. Two other COSMOS stellar mass functions from Ilbert et al. (2013) and Davidzon et al. (2017) are shown for comparison, along with Grazian et al. (2015) from UDS/GOODS-S/HUDF and the recent work of Stefanon et al. (2021) from GREATS at z > 6. Mass incomplete measurements are not shown. Upper limits for empty bins are shown by the horizontal gray line with an arrow.

In the text
thumbnail Fig. 7.

Evolution of the star-forming (blue) and quiescent (orange) galaxy components of the galaxy stellar mass function in nine bins of redshift (0.2 < z < 4.5). Uncertainty envelopes correspond to 1 and 2σ limits. For comparison, we show other literature studies in COSMOS from Ilbert et al. (2013) and Davidzon et al. (2017) that adopt similar selections and methodologies. For reference, the total SMF is shown in each bin (solid gray) and the 0.2 < z < 0.5 total SMF is repeated in each panel (purple dotted). Mass incomplete measurements as defined by the channel 1 limiting magnitude are not shown, with the mass limits corresponding to the Ks limiting magnitude shown by orange arrows. Upper limits for empty bins are shown by the horizontal gray line and arrow.

In the text
thumbnail Fig. 8.

Evolution of the volume number density Φ of the total, star-forming, and quiescent samples at fixed mass, showing only mass complete bins. 1σ uncertainties are indicated by the colored envelopes. Gray shaded regions indicate upper limits corresponding to Φ(N = 0) at each redshift bin. The lower panel shows the evolution of the fractional change in the volume number density at fixed mass, relative to z0 ≡ 0.2 < z ≤ 0.5 of the total sample.

In the text
thumbnail Fig. 9.

Evolution of the fraction of quiescent galaxies as a function of redshift in four bins spanning 109.5 < ℳ ≤ 1011.5 ℳ. Uncertainty envelopes correspond to 1 and 2σ limits. Points which are mass incomplete are not shown.

In the text
thumbnail Fig. 10.

Summary of inferred galaxy stellar mass function evolution. Best-fit parameters are estimated by a Markov chain Monte Carlo fitting with a Schechter function at fixed redshift convolved with a redshift dependent kernel from which the inferred intrinsic mass function is recovered with the unconvolved Schechter fit (colored solid curves) for the total (leftmost), star-forming (center left), and quiescent (center right) samples. Estimates are shown corresponding to both the median posterior parameter values including 1σ envelopes (upper row) and the single set of maximum likelihood parameters (bottom row). The rightmost panel shows the results of fitting the Double Schechter continuity model of Leja et al. (2019a) to the total SMF between 0.2 < z ≤ 3.0 where the fits are reliable; see later in Sect. 5.3 for details.

In the text
thumbnail Fig. 11.

Schechter parameter evolution with redshift. Left: evolution of best-fit Schechter parameters computed from fits to the total SMF from χ2 regression (red diamonds), as well as likelihood methods using fixed redshifts bins (blue; median posteriors as filled points with 68% error bars, maximum likelihood parameter set as unfilled squares), and our continuity model fit (maroon curve; median posterior). Error bars are not shown when parameters are fixed. Center: same points as on the left panels compared to literature values from Davidzon et al. (2017, gray points with hatching) and Leja et al. (2019a, gray dashed curves). Right: evolution of the best-fit Schechter parameters computed from fits to the star-forming (blue) and quiescent (orange) SMFs from the χ2 regression (unfilled colored diamonds) and likelihood methods using fixed redshifts bins (median posterior as filled points with ±34% envelope, maximum likelihood as unfilled squares).

In the text
thumbnail Fig. 12.

0.2 < z ≤ 7.5 cosmic stellar mass density. Upper: evolution of the cosmic stellar mass density of the total sample computed from the best-fit likelihood models (blue) integrated above 108 ℳ. Literature results from observational studies of mass-selected samples (Grazian et al. 2015; Tomczak et al. 2014; Caputi et al. 2015, 2011; Ilbert et al. 2013; Muzzin et al. 2013; Santini et al. 2012; Adams et al. 2021; McLeod et al. 2021; Wright et al. 2018; Thorne et al. 2021) and mass inferred from rest UV measurements (González et al. 2011). By integrating their SFRD functions, we can plot ρ* from Behroozi et al. (2013) and Madau & Dickinson (2014). In both cases we assume a return fraction of 41% (based on Chabrier’s IMF, see Sect. 6.1 of Ilbert et al. 2010). For Madau & Dickinson (2014), we include a shaded area based on return fractions between 25 and 50% (the latter value is similar to the one given by Salpeter’s IMF). Lower: evolution of the cosmic stellar mass density of the total (gray, repeated from above), star-forming (light blue), and quiescent (orange) samples compared to literature measurements (Behroozi et al. 2019; Santini et al. 2021; McLeod et al. 2021).

In the text
thumbnail Fig. 13.

Comparison of observed and inferred galaxy stellar mass function (gray points, and colored curve with 1 and 2σ envelopes, respectively) to the reference flavors of four simulations: TNG100 of the ILLUSTRISTNG project (Pillepich et al. 2018), EAGLE (Furlong et al. 2015), SHARK (Default; Lagos et al. 2018), SANTA CRUZ (Yung et al. 2019a,b), and SIMBA (Flagship; Davé et al. 2019). To note, we do not apply any artificial normalization to any model or observation. Upper limits for empty bins are shown by the horizontal gray line with an arrow. Mass incomplete measurements are not shown.

In the text
thumbnail Fig. 14.

Ks magnitude and H − Ks colors of 125 new massive ℳ > 1011 ℳ galaxy candidates 3.0 < z ≤ 5.5 found in COSMOS2020 (orange), relative to the total sample used in this work (gray) and the 444 galaxies found in COSMOS2015 (blue). Measurements are taken from COSMOS2020, although they are similar to those from COSMOS2015, where matched. Number densities are summarised using gaussian kernel density estimators to produce smoothed distributions and contours for each sample. Median Ks and H − Ks values for each sample are shown by colored dotted lines.

In the text
thumbnail Fig. 15.

Comparison of observed number densities and upper limits on the rarity probed by various observed (gray steps) and simulated (colored lines) volumes. Number densities correspond to 1011.0 < ℳ < 1011.5 ℳ and 1011.5ℳ < 1012.0 ℳ from the total sample (light and dark gray, respectively), and 1011.5ℳ < 1012.0 ℳ from the quiescent sample (orange) from Fig. 8. 1σ upper limits are computed following Gehrels (1986), which for the observed volumes are dependent on widths of redshift bins.

In the text
thumbnail Fig. 16.

Quiescent mass fractions. Upper: fraction of quiescent galaxies as a function of mass for three redshift ranges compared with predictions from EAGLE (Furlong et al. 2015) and SHARK (Lagos et al. 2018) as well as measurements from the empirically calibrated UNIVERSE MACHINE (Behroozi et al. 2019). Lower: fractional difference between this work and literature predictions/measurements. See Appendix B for more details.

In the text
thumbnail Fig. 17.

Comparison of the inferred z > 1.5 galaxy stellar mass function (colored curve with 1 and 2σ envelopes) fitted to observed measurements (gray points). We scale the Tinker et al. (2008) halo mass function (HMF; lower bound of the gray shaded region) by 0.018 corresponding to the stellar-to-halo mass relation at z = 0 and M h = M h * $ \mathcal{M}_{\rm h} = \mathcal{M}_{\rm h}^* $ (Behroozi et al. 2013) to produce an idealized SMF (gray curve). Upper limits on the SMF at each redshift are derived from the HMF assuming a fixed baryon fraction (0.166). Upper limits for empty bins are shown by the horizontal gray line with an arrow. Mass incomplete measurements are not shown.

In the text
thumbnail Fig. A.1.

Fraction of sources removed by a single criterion that are also removed by another, after applying the primary criteria on area and photo-z. E.g., 71% of χ SED 2 >10 $ \chi_{\rm SED}^2>10 $ sources have mch1 > 26.

In the text
thumbnail Fig. B.1.

Galaxy stellar mass functions for total (gray), star-forming (blue), and quiescent (orange) samples from this work compared to simulations. We include those from UNIVERSE MACHINE Behroozi et al. (2019, sSFR < 10−11 ℳ yr−1) shown by dotted lines with hatched uncertainty envelopes, as well as consistently NUVrJ-selected samples from EAGLE (Furlong et al. 2015) and SHARK (Lagos et al. 2018) shown by the dashed and dot-dashed lines, respectively. Upper limits for empty bins are shown by the horizontal gray line with an arrow. Mass incomplete measurements are not shown.

In the text
thumbnail Fig. B.2.

Fraction of star-forming (blue) and quiescent (orange) galaxy samples as a function of mass at for three redshift ranges compared to simulations. We include measurements from UNIVERSE MACHINE Behroozi et al. (2019, sSFR < 10−11 ℳ yr−1) shown by dotted lines, as well as those from consistently NUVrJ-selected samples from EAGLE (Furlong et al. 2015) and SHARK (Lagos et al. 2018).

In the text
thumbnail Fig. C.1.

Results of fitting a double (z < 3) and single Schechter (z ≥ 3) functional forms to the observed galaxy stellar mass function. Measurements are binned in redshift and inferred from both χ2 minimization (red) and likelihood methods (evolving colors) taken at the median of each parameter posterior distribution. In both cases, Schechter functions are convolved with parameterized, redshift-dependent kernels in δM (upper right subpanels) to account for Eddington bias (dashed lines) which is then removed in the corrected fit (solid lines). For completeness, the corrected maximum likelihood solutions are also indicated (dash-dot colored curves). In the lower panels, fitting residuals between the uncorrected kernel convolved fits and the data are shown by both the relative fractional difference (dashed curve) and the difference weighted by the uncertainty (square points). Corresponding parameters are shown in Table C.1.

In the text
thumbnail Fig. C.2.

Results of fitting a double (z ≤ 3.0) and single Schechter (z > 3.0) functional forms to the star-forming sample. The figure follows the same layout as Fig. C.1. Corresponding parameters are shown in Table C.2.

In the text
thumbnail Fig. C.3.

Results of fitting a double (z ≤ 1.5) and single Schechter (z > 1.5) functional forms to the quiescent sample. The figure follows the same layout as Fig. C.1. Corresponding parameters are shown in Table C.3.

In the text
thumbnail Fig. C.4.

Best fit Schechter models corresponding to maximum likelihood for the total, star-forming, and quiescent samples (solid colored curves) based on the observed data (colored points). Fits for the median posterior will be similar for symmetric parameter posterior distributions, which are found in all but the last bin shown here. Kernel convolved fits are shown by the dashed curves. Fits measured using the continuity model are shown in gray solid curves. Lower panels indicate the fraction of quiescent galaxies ℱQG ≡ ΦQG/(ΦSF + ΦQG) as a function of log10(ℳ/ℳ) for the data (colored points) and maximum likelihood fits (solid curve). Since the star-forming and quiescent fits are not required to sum to the total fit, changing the definition to ℱQG ≡ ΦQGTotal as reported by Ilbert et al. (2013) and Davidzon et al. (2017) produces the dotted curves with noticeably lower ℱQG at high ℳ.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.