| Issue |
A&A
Volume 709, May 2026
|
|
|---|---|---|
| Article Number | A70 | |
| Number of page(s) | 15 | |
| Section | Cosmology (including clusters of galaxies) | |
| DOI | https://doi.org/10.1051/0004-6361/202558373 | |
| Published online | 05 May 2026 | |
Forecasting local primordial non-Gaussianities from UNIONS Lyman-break galaxies and Planck CMB lensing
1
Université Paris-Saclay, CEA, IRFU, 91191 Gif-sur-Yvette, France
2
IFAE, The Barcelona Institute of Science and Technology, Campus UAB, 08193 Bellaterra, Barcelona, Spain
3
Ruhr University Bochum, Faculty of Physics and Astronomy, Astronomical Institute, GCCL, 44780 Bochum, Germany
4
Perimeter Institute for Theoretical Physics, 31 Caroline St. North, Waterloo, ON N2L 2Y5, Canada
5
Department of Physics and Astronomy, University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, Canada
6
Institute for Astronomy, University of Hawaii, 2680 Woodlawn Drive, Honolulu, HI 96822, USA
7
National Research Council Herzberg Astronomy and Astrophysics, 5071 West Saanich Road, Victoria, BC V8Z 6M7, Canada
8
Department of Computer Science, University of British Columbia, 2366 Main Mall, Vancouver, BC V6T 1Z4, Canada
★ Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
3
December
2025
Accepted:
26
March
2026
Abstract
Context. Local primordial non-Gaussianities, characterized by the parameter fNLloc, provide a powerful window into the physics of inflation. Cross-correlating high-redshift tracer samples with the Cosmic Microwave Background (CMB) lensing potential offers a particularly robust probe of fNLloc, mitigating imaging systematics that typically affect large-scale measurements from tracer auto-spectra. In this context, the Ultraviolet Near Infrared Optical Northern Survey (UNIONS) enables the selection of u-dropout high-redshift Lyman-break galaxies (LBGs).
Aims. We aim to forecast the expected precision on fNLloc that is achievable from analyzing the cross-correlation power spectrum between the distribution of UNIONS-selected LBGs and the CMB lensing potential measured by the Planck satellite.
Methods. We performed a Markov chain Monte Carlo forecast to estimate the uncertainties on fNLloc and on a galaxy bias parameter b0, which captured our uncertainty in the tracer bias.
Results. We forecast σ(fNLloc) = 34 for an idealized photometric sample of r < 24.3 LBGs selected with a Random Forest classification algorithm from UNIONS-like ugriz imaging, with a resulting surface density of 1100 deg−2 over 3730 deg2. This precision can be improved to σ(fNLloc) = 20 after spectroscopic follow-up with the Dark Energy Spectroscopic Instrument (DESI), during its next phase starting in 2029, DESI-II. We also tested a more realistic selection using early UNIONS data, based on a u-dropout color cut over the ugr imaging, which yields a denser sample of r < 24.2 objects at 1400 deg−2 over 4760 deg2. From this sample–covering a larger footprint and expected to have a higher large-scale galaxy bias–we forecast an improved constraint of σ(fNLloc) = 20, with a similar precision that is achievable after DESI-II follow-up. In addition, we performed a preliminary validation of the redshift distribution using the clustering-redshift method with DESI DR1 data, confirming the calibration from deep, small-area photometric fields. However, accounting for uncertainties in the clustering-redshift distribution significantly degrades the fNLloc) constraining power.
Key words: methods: statistical / galaxies: high-redshift / early Universe / large-scale structure of Universe
© The Authors 2026
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.
1. Introduction
Lyman-break galaxies (LBGs; Steidel et al. 1996) are young, actively star-forming galaxies at z > 1.5. Their rest-frame spectra exhibit a sharp flux decrement blueward of the Lyman-α transition at 1216 Å, extending down to the Lyman limit at 912 Å, due to absorption by neutral hydrogen in both the intergalactic medium (IGM) and within the galaxies themselves. These spectral features enable the identification of LBGs at 2.5 < z < 3.5 through the u-dropout technique, which selects objects with a pronounced flux deficit in the u band (3300−4000 Å) relative to the flux measured in the g or r band (Ruhlmann-Kleider et al. 2024; Payerne et al. 2025). At higher redshifts, analogous dropout techniques can be used (e.g., g or r dropouts; Malkan et al. 2017; Ono et al. 2018; Harikane et al. 2022).
Lyman-break galaxies have long been central to studies of galaxy formation and evolution at high redshift (Steidel et al. 1996, 1999; Giavalisco et al. 2004; Reddy et al. 2008; Hildebrandt et al. 2009; Harikane et al. 2023). More recently, dropout-selected galaxies have also emerged as powerful cosmological probes (see review by Wilson & White 2019). They serve as (i) highly biased tracers of large-scale structure in the high-redshift, matter-dominated Universe (Foucaud et al. 2003; Miyatake et al. 2022; Ruhlmann-Kleider et al. 2024; Ye et al. 2025), and (ii) distant background light sources for probing the intergalactic medium through Lyman-α forest absorption in their spectra (Herrera-Alcantar et al. 2025).
Dense samples of LBGs at z > 2.5 are particularly valuable for cosmology, as their clustering measurements allow to constraint on the growth of structures at high-redshift (Wilson & White 2019; Miyatake et al. 2022) and to study the evolution of dark energy via their Baryon Acoustic Oscillations (BAO) features (Sailer et al. 2021) at higher redshift than current spectroscopic surveys. LBGs also enable stringent tests of inflation models via the scale-dependent bias effect arising from local-type primordial non-Gaussianity (PNG, Schmittfull & Seljak 2018; Chaussidon et al. 2025), as well as investigations of the sum of neutrino masses (Yu et al. 2023).
Such samples are expected to be provided by current and forthcoming wide-field, multi band imaging surveys with u-band coverage (Payerne et al. 2025; Crenshaw et al. 2025), including the ongoing Ultraviolet Near Infrared Optical Northern Survey (UNIONS; Chambers et al. 2016; Ibata et al. 2017; Miyazaki et al. 2018; Gwyn et al. 2025), the Vera C. Rubin Observatory’s Legacy Survey of Space and Time (LSST; LSST Science Collaboration 2009), and the Chinese Space Station Telescope (CSST, CSST Collaboration 2026). Next-generation spectroscopic programs such as DESI-II, the next phase of the Dark Energy Spectroscopic Instrument (Schlegel et al. 2022), the Wide-field Spectroscopic Telescope (WST, Mainieri et al. 2024) and the Spectroscopic Stage-5 Experiment (Spec-S5, Besuner et al. 2025) will leverage these dense LBG samples to test the robustness of our standard cosmological model and explore its numerous extensions.
Inflation remains the leading paradigm for the early Universe, and local PNG–quantified by the parameter
–offers a key test of inflationary models. The state-of-the-art single-field inflation model predicts a very small amplitude for local PNG in the primordial gravitational field (
. While the CMB has already provided strong constraints in favor of single-field inflation (with
, Planck Collaboration IX 2020), classes of multi-field inflation models predict a potentially observable level of PNG, typically
. This makes large-scale structure surveys the most promising avenue for future progress, as further improvements from CMB observations are fundamentally limited by cosmic variance.
Local Primordial non-Gaussianities imprint a distinctive scale dependence on the large-scale linear bias of cosmological tracers (Dalal et al. 2008; Slosar et al. 2008), such as galaxies and quasars. This feature has been widely used to constrain
, either through the three-dimensional power spectrum of tracers (Rezaie et al. 2024; Cagliari et al. 2024; Chaussidon et al. 2025) or via angular power spectra and cross correlations with CMB lensing (Krolewski et al. 2024; Fabbian et al. 2026; Chiarenza et al. 2025). The tightest current limit,
, was obtained by Chaussidon et al. (2025) from DESI quasar (0.8 < z < 3.1) and luminous red galaxy (0.6 < z < 1.1) large-scale power spectrum measurements. In this context, high-redshift LBGs are expected to deliver independent and competitive constraints on
thanks to their higher number densities compared to DESI quasars (Payerne et al. 2025; Crenshaw et al. 2025) and their redshift distribution spanning 2.5 < z < 3.5 (d’Assignies et al. 2023).
The paper is organized as follows: In Section 2, we introduce the formalism for the LBG angular power spectrum, the cross-angular power spectrum between the LBG population and the CMB lensing potential, and their link to local PNGs. In Section 3, we present the different datasets that we used throughout this study to conduct PNG forecasts. In Section 4, we detail the different modeling choices we made through this work to characterize the LBG samples. In Section 5, we describe the forecasting methodology employed throughout this work, based on posterior estimation via Markov chain Monte Carlo (MCMC) using fiducial data vectors. In Section 6, we present the different forecasts on the parameter
, exploring modeling choices and propagation of photometric redshift distribution uncertainties, using either an idealized LBG sample obtained by a Random Forest approach on UNIONS-like data, or an LBG sample obtained from early UNIONS data. We conclude in Section 7.
2. Formalism for the angular power spectrum
Under the Limber approximation (Limber 1953), the correlation function of two fields X, Y is dominated by small angular scales only (i.e., high multipoles) and the kernel varies slowly along the line of sight. The angular power spectrum simplifies to
(1)
where qx are the kernels associated to X and Y, χ is the comoving distance, and P is the matter power spectrum. From this, we discuss the auto-correlation angular power spectrum of the LBG population, and its cross-correlation with CMB lensing maps.
2.1. Galaxy density field and clustering
The kernel corresponding to the intrinsic galaxy clustering contribution is
(2)
where b(z) denotes the large-scale linear galaxy bias, and n(z) the (normalized) galaxy redshift distribution. In our analysis, we restrict the fit of
to ℓ < 300, which impacts the range of comoving scales that can be probed by the LBG sample. For sources at z ≃ 1.5 − 3 (corresponding to the typical redshift values used in this work), it is set to k ∼ ℓ/χ(z)≲0.05 − 0.075 h Mpc−1 (see Appendix B). At these scales and associated redshifts, the matter density field lies well within the linear regime: nonlinear clustering only becomes relevant at k ≳ 0.12 − 1.0 h Mpc−1 at these redshifts (Takahashi et al. 2012). Furthermore, galaxy bias is expected to remain scale-independent down to k ∼ 0.1 − 0.2 h Mpc−1 (Desjacques et al. 2018). Finally, as the minimum angular momentum in this analysis is set to be ℓ = 5, such as the LBG angular clustering in this work is typically probing k ∈ [𝒪(10−3)−0.075] h Mpc−1. The redshift distributions used in this work (see Figure 1) exhibit some outliers at z < 0.5, which probe smaller scales where the linear regime breaks down. We still employ the linear matter power spectrum and a linear bias model, as these low-redshift outliers constitute only a small fraction of the sample. A detailed exploration of nonlinear bias parameterizations is left for future work.
![]() |
Fig. 1. Left: Photometric redshift distributions of the photometric LBG samples UNIONSlike_RF in red (the convolution of the UNIONSlike_RF distribution with the DESI spectroscopic efficiency is shown in dashed and dotted lines, they have not been normalized to show the impact at z ∼ 2.5). The distribution of the samples UNIONSlike_RF_u180s, and LSSTY4like_RF_u2x180s are shown in blue and dashed cyan lines. Right: Photometric redshift distributions of the UNIONS LBG samples UNIONS_colorcut and UNIONS_PZcut in the XMM field. The sample selected with the hybrid method à-la SOM is shown for illustration. We also show the quantity nb (the product between the large-scale bias and the normalized redshift distribution) resulting from our clustering-redshift calibration methods. |
Moreover, lensing magnification alters the observed galaxy number density by deflecting light from intervening structures. Depending on the survey flux limit and the slope of the luminosity function, this effect can lead to either an apparent enhancement or suppression of number counts. The associated kernel is given by
(3)
The above equation considers the leading-order magnification term1, where s(mlim, z) = dlog10N(< mlim, z)/dm (see Eq. (38) in Challinor & Lewis 2011) denotes the magnification bias2 (Euclid Collaboration: Lepori et al. 2022; Elvin-Poole et al. 2023). We emphasize that it is a zeroth-order approximation of the magnification bias since color cuts may distort the effective selection at the limit magnitude mlim. For the latter, we will consider the r band to compute the limiting magnitude of the sample. Although s(mlim, z) shows some redshift dependence for the LBG samples used in this work (see Appendix C), we adopt a single value for the magnification bias, s(mlim), defined as the average over redshift. This choice is motivated by the large uncertainties associated with the inferred redshift-dependent estimates for some samples. Moreover, redshift-space distortions (RSD) arise from the peculiar velocities of galaxies along the line of sight, which modify their observed redshifts and induce anisotropies in the observed clustering. The RSD kernel is given by
(4)
where D(a) is the linear growth factor and jℓ(x) is the ℓ-spherical Bessel function. From this, the observed angular power spectrum of the galaxy density field is given by
(5)
where
, and
is the surface density of the galaxy sample in steradians.
2.2. CMB Lensing and Cross-Correlation with LSS
The temperature anisotropies and polarization patterns of the CMB are gravitationally lensed by the intervening large-scale structure between the surface of last scattering and the observer. This lensing potential remaps the primary CMB anisotropies and induces B-mode polarization, as well as characteristic non-Gaussian features in the observed CMB maps (Lewis & Challinor 2006). The associated convergence field
can then be reconstructed from high-resolution CMB temperature and polarization maps using quadratic estimators (Okamoto & Hu 2003). The lensing convergence has a projection kernel given by
(6)
where χ* is the comoving distance to the surface of last scattering, a(χ) is the scale factor, H0 is the Hubble constant, and Ωm is the matter density parameter today. This kernel peaks at redshift z ∼ 2, which makes CMB lensing especially sensitive to the high-redshift Universe.
The reconstructed CMB lensing convergence power spectrum includes both signal and noise contributions, and is given by Cℓκκ, obs = Cℓκκ + Nℓκ where Cℓκκ is the theoretical lensing auto-spectrum, and Nℓκ is the lensing reconstruction noise, which depends on the specifications of the CMB experiment (e.g., angular resolution, instrumental noise, etc.). In this work, we adopt the noise spectrum of the CMB lensing maps derived from the Planck PR4 temperature and polarization data (Carron et al. 2022).
The CMB lensing signal can also be cross-correlated with the distribution of galaxies or other tracers of large-scale structure. Such cross-correlations provide a powerful probe of the large-scale matter distribution and the galaxy bias, and are sensitive to primordial non-Gaussianity,
, via scale-dependent effects. The observed cross-angular power spectrum between the CMB lensing field and a galaxy field is given by the sum of Cℓκqi, where the sum runs over the relevant galaxy kernel contributions qi (i.e., intrinsic clustering, magnification, and redshift-space distortions).
2.3. PNG: impact of
on galaxy field
Multifield inflation models can generate a small level of local PNG, parametrized by
, which modifies the statistics of the initial gravitational potential. By modifying the height of rare density peaks, the parameter
alters the response of halo abundance to long-wavelength background modes, thereby affecting the large-scale halo bias (Dalal et al. 2008; Slosar et al. 2008; Desjacques & Seljak 2010). The tracer bias acquires an additional scale-dependent correction b → b + Δb in Eq. (2), with
(7)
where T(k) is the matter power spectrum transfer function and D(z) is the linear growth factor. The bias bΦ can be related to the linear tracer bias through the relation
(8)
where pΦ = 1 is adopted for the universal mass function. While this relation is by far the most widely adopted in the literature, there is no compelling reason to expect it to hold for realistic tracers of large-scale structure. For instance, pΦ = 1.6 yields a better description of bΦ for objects whose host halos had recently undergone a major merger, such as quasar’s host halos (Slosar et al. 2008). In addition, pΦ ∈ [0.4, 0.7] was found to describe accurately stellar mass-selected galaxies (Barreira et al. 2020). It is therefore generally accepted that pΦ ≠ 1 must be taken into account depending on the tracer under consideration. Even if the lensing-related quantities are in principle subject to corrections from primordial non-Gaussianity (Jeong et al. 2011; Anbajagane et al. 2024), we neglect the impact of
on the CMB lensing potential since the effects of
are strongly suppressed by projection effects. Thus, in this work, we rely on the large-scale dependent bias effect described in Eq. (7) to constrain
.
3. Datasets
In this work, we perform a forecast study for an LBG sample that can be generated from the UNIONS data. The UNIONS is build upon a collaboration between the Hawaiian observatories: the Canada-France-Hawaii Telescope (CFHT, Mauna Kea), the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS, Maui), and the Subaru Telescope (Mauna Kea). It is currently providing ugriz imaging over 5000 deg2 of the northern sky. The CFHT Canada-France Imaging Survey (CFHT/CFIS) targets the u and r bands with the Megacam imager, delivering image quality competitive with all other current large ground-based facilities. CFHT/CFIS will reach a depth of r ≃ 25 over 5000 deg2 and u ≃ 24.6 over 9000 deg2. Meanwhile, Pan-STARRS provides the i-band, and the Wide Imaging with Subaru Hyper Suprime-Cam of the Euclid Sky (WISHES) supplies the z-band. The 5σ depth of the data, measured within a 2-arcsecond diameter aperture (Gwyn et al. 2025), is [u, g, r, i, z]=[24.45, 25.25, 24.95, 24.55, 24.05]. Multiband catalogs over the UNIONS footprint have been obtained using the GAaP (Gaussian Aperture and PSF; Kuijken 2008; Kuijken et al. 2015, 2019) method, which will be made publicly available upon release of the UNIONS data set. The covered surface areas are, for {ugr, ugri, ugriz}={4760, 4630, 3730} deg2 (Gwyn et al. 2025). Besides the main UNIONS footprint, multiband ugr catalogs are available on the two deep fields XMM-LSS and COSMOS. We will consider using these deep fields in the next section.
In the following sections, we present the various LBG selections applied to UNIONS-like and UNIONS imaging. For each case, we derive the corresponding sample characteristics, namely the redshift distribution and the lensing magnification bias.
3.1. UNIONS-like LBG sample from a Random Forest approach by degrading deep catalogs
In this section, we present the selection of high-redshift LBGs on UNIONS-like imaging, obtained on the COSMOS field.
3.1.1. Photometric LBG sample:
We first introduce the LBG sample that was derived in Payerne et al. (2025), labeled as UNIONSlike_RF, obtained by applying a Random Forest selection to UNIONS-like photometry, i.e., by degrading CLAUDS3u and HSC4-PDR3 griz broadband imaging on the COSMOS field. For this dataset, we get deep 5σ point source depth [u, g, r, i, z]=[27.7, 27.4, 27.1, 26.9, 26.3]. The combination of CLAUDS and HSC-SSP data over COSMOS (and other small deep fields such as XMM-LSS) is detailed in Desprez et al. (2023). In Payerne et al. (2025), the COSMOS CLAUDS+HSC catalog has been degraded to UNIONS depth (called UNIONS-like, with respective depth, at the time of this work ugriz = [24.6, 25.5, 25.5, 24.2, 24.4], see Appendix E.2 for more details on this method) to train a Random Forest for selecting r < 24.3 UNIONS-like LBGs in the redshift range z ∈ [2.5, 3.5]. The LePhare (6 bands) photometric redshift distribution of this sample is shown in red full lines in the left panel of Figure 1, corresponding to a surface density of ∼1100 photometric LBGs per deg2. The magnification bias factor for this sample is found to be s(rlim = 24.3) = 0.43. As mentioned in Payerne et al. (2025), the artificial degradation procedure to mimic shallower depths is valid for point-source magnitude rescaling (which may be different for some galaxy populations), simplified noise models (decreasing the photon count only) thus neglects several sources of systematic uncertainty inherent to each photometric pipelines; it should therefore be regarded as an optimistic approximation of shallower imaging conditions. In addition, the Random Forest model is trained on relatively small datasets (the COSMOS and XMM-LSS fields are of a few square degrees), which may limit its ability to capture rare populations and increase sensitivity to statistical fluctuations due to large-scale survey systematics (depth coverage, etc.) and sample variance.
3.1.2. DESI-II LBG sample:
This work is also the occasion to explore DESI-II scenarios, whose high-redshift target selection strategy is still under discussion. Spectroscopic redshift follow-ups will allow us to (1) remove low redshift contaminants and (2) reduce the uncertainty on the sample redshift distribution. Considering all z > 2 objects in the distribution in Figure 1 (left panel), the number density goes from 1100 to 880 LBGs per square degree. Moreover, the current DESI spectroscopic redshift reconstruction method5 has an internal efficiency, which degrades the efficiency of the recovered LBG spectroscopic sample at z ∼ 2.2. Considering the redshift efficiency in Figure 11 of Payerne et al. (2025) or in Fig. 18 of Ruhlmann-Kleider et al. (2024) (resp. for 2-hour and 4-hour exposure) multiplying the corresponding redshift distribution, we get the corrected number densities 500 and 590 LBGs per square degrees. The corresponding 2-hour and 4-hour exposure high-redshift distributions are shown in the left panel of Figure 1.
3.1.3. Alternative LBG selection:
We explore two alternative UNIONS-like LBG selections on the COSMOS field; Individual CLAUDSu-band images of 180-second exposures on COSMOS have been coadded and have undergone forced photometry using HSC g-band. Two u-band catalogs were obtained, corresponding to 180-second and 2 × 180-second exposures (see Appendix E.1). For reference, the default deep CLAUDSu-band catalog uses 600-second exposures. Two other selections, namely UNIONSlike_RF_u180s and LSSTY4like_RF_u2x180s where obtained by training a RF algorithm to select a sample of 1100 deg−2 galaxies with r < 24.5 in the range z ∈ [2.5, 3.5], where, for each galaxy sample, s(rlim = 24.5) = 0.21 and 0.46, respectively. The corresponding photometric redshift distributions are represented in the left panel of Fig. 1. These selections, as for the one in Payerne et al. (2025), were tested on the COSMOS field and show different selection efficiencies.
3.2. LBG sample using UNIONS GAaP catalogs
In this section, we make use of the early GAaP UNIONS data to test different high-redshift LBG selections.
3.2.1. LBG color-color selection
We use the early UNIONS GAaP photometric data available in the UNIONS collaboration, to test a color–color box selection of high-redshift r < 24.2 LBGs referred to as [COSMOS: TMG u dropout] in Ruhlmann-Kleider et al. (2024) (see their Table 1). For 22 < r < 24.2, the u-dropout color selection is defined as
(9)
(10)
(11)
(12)
This sample is referred to as UNIONS_colorcut and yields a photometric LBG angular density of 1400 deg−2 and a magnification bias s(rlim = 24.2) = 0.25. To assess the redshift distribution of these photometrically selected u-dropout LBGs, we match geometrically the selected UNIONS-LBGs to deep photometric redshift catalogs provided by CLAUDS+HSC imaging on XMM (XMM benefits from UNIONS multiband imaging). The corresponding photometric redshift distribution is displayed in the right panel of Figure 1. In our previous Payerne et al. (2025) idealistic dataset, the resulting redshift distribution n(z) is relatively narrow, peaking sharply at z ∼ 3. In comparison to the color-box selection, which peaks at z ∼ 1.5 with a broader shape, the difference reflects (i) the slight optimistic ugrz magnitude depth used in Payerne et al. (2025) for the degradation procedure (ii) the use of a Random Forest approach (iii) the simplified and noise-free nature of the simulated conditions in Payerne et al. (2025), where measurement uncertainties, selection effects, and intrinsic galaxy diversity are minimized. The broadening of the UNIONS distribution can be attributed to observational noise in UNIONS data, all of which introduces scatter and extends the distribution away from the main peak. As a result, the realistic dataset provides a more accurate representation of the complexity encountered in real observations.
As for the idealistic Payerne et al. (2025), an alternative is to use a Random Forest classifier to select LBGs within a specific redshift range using UNIONS imaging. However, the limited overlap between UNIONS and CLAUDS+HSC on XMM restricts the available training data (the overlap is about 2 deg2). Instead, we construct a hybrid selection by splitting the matches into two separate samples. For the first subsample, we precompute the mean LePhare photometric redshift in a multidimensional grid of UNIONS colors (u − g, g − r, r − i) and r magnitude. A UNIONS galaxy in the second subsample is considered as part of the LBG sample if the mean redshift in its associated grid cell satisfies ⟨z⟩> 1.7. This grid-based approach is conceptually similar to Self-Organizing Maps (SOMs, Zhang et al. 2025; Roster et al. 2026), and poorly mimics the cut-free approach of Random Forests. From this, we obtain a representative redshift distribution based on a reference sample. Our method uses a fixed, axis-aligned grid, while SOMs construct an adaptive, data-driven grid that captures complex structures in color space. This hybrid SOM-like selection tests the relevance of the color-box cuts compared to a data-driven cut-free method, which is represented in green in the right panel of Figure 1. We observe that the distribution fairly matches the color-color box selection.
To check the consistency of the u-dropout selection described above, we apply an alternative selection based on the BPZ photometric redshifts (Benítez 2011) of UNIONS detections, requiring Z_ML > 1.7 in the catalog. This selection is referred to as UNIONS_PZcut, and represented in purple in the right panel of Figure 1. This validation on XMM of the color-box selection enables us to apply this selection to a wider UNIONS portion of the sky, and assess the corresponding redshift distribution through clustering-redshift methods.
3.2.2. Measuring the redshift distribution of u-dropout LBGs
Matching LBG targets with deep CLAUDS+HSC photometric catalogs is useful to assess the underlying photometric redshift distribution; however, this approach remains limited by the accuracy of the deep photometric catalogs and by the relatively low statistical significance of the inferred distribution (usually done on fields of a few square degrees). A second method relies on spectroscopic data, as high-precision spectroscopic measurements can help calibrate the less accurate photometric redshifts. For available spectroscopic subsamples of photometric datasets, direct calibration is possible (see, e.g., Lima et al. 2008; Hildebrandt et al. 2021). Such direct calibration is challenging for LBGs, however, since obtaining a representative spectroscopic sample for any specific photometric selection is difficult in practice (Ruhlmann-Kleider et al. 2024; Payerne et al. 2025).
In this work, we use the clustering-redshifts method (Ménard et al. 2013; Schmidt et al. 2013; d’Assignies et al. 2025b) to evaluate the redshift distribution of an arbitrary photometrically-selected dataset based on the spatial cross-correlation with a reference population, the latter with spectroscopic redshift available. The overlap of UNIONS data we use in this work with DESI spectroscopic observations publicly available (DR1, Ross et al. 2025) is approximately 1300 deg2. The region of overlap suffers from a low completeness of DESI data (Dec > 30 deg), which limits the method’s constraining power. DESI data also includes randoms. We evaluate the measurements for the different DESI tracers separately (BGS, LRG, ELG, and QSO), and then recombine them, as the tracers’ masks differ from one to another. The DESI data are then used to measure the product of the redshift distribution with galaxy bias bLBG(z) nLBG(z) with the so-called clustering-redshifts methods (using the pipeline developed in d’Assignies et al. 2025b), over the redshift range 0 < z < 3.4. Contrary to the majority of the previous clustering-redshift calibration methods, we do not aim to break the degeneracy between bias and redshift distribution, as the observables we consider directly depend on their product. The uncertainty σ(bLBG(z) nLBG(z)) can then be used directly for the Bayesian inference, and we marginalize over both bias and distribution during the same step.
For clustering-redshift calibration, we binned the spectroscopic data in redshift bins zj ± Δz/2, and measure the cross correlations between each spectroscopic bin sj and the UNIONS LBG bin, as functions of perpendicular separations rp: wsj LBG(rp). To maximize the signal-to-noise ratio and simplify covariance estimates, we usually reduce the data vectors to scalars given by
(13)
with W(rp)∝rpγ a normalized weighting functions. We use the scale range 1.5 − 5.5 Mpc and the weighting γ = −1, which is an excellent tradeoff between boosting the signal-to-noise ratio coming from small scales, limiting biasing due to nonlinearity (d’Assignies et al. 2025b), and fiber collisions (see Choppin de Janvry et al. 2025; d’Assignies et al. 2025a). These scalars can be used to constrain bLBG(z) nLBG(z) as
(14)
where
is a theoretical function estimated with Halofit (Takahashi et al. 2012). The spectroscopic galaxy biases bs(zj) are measured from the auto-correlations of the spectroscopic bins, for larger scales (to limit the effect of fiber collision). The data vectors are measured with Treecorr (Jarvis 2015) and LS estimators (Landy & Szalay 1993). Moreover, the correlations of
between different redshifts zj can be neglected (d’Assignies et al. 2025a). Thus, we estimate the covariance matrix using a standard Jackknife estimate, and set all the off-diagonal coefficients to 0. We combined the measurements from different tracers localized at the same redshift, using an inverse weighting scheme, neglecting cross-correlations (Choppin de Janvry et al. 2025). The joint product of the LBG bias and the LBG redshift distribution in Eq. (14) is shown in Figure 1 (right panel) with the corresponding error bars. For the latter, we work in a minimal scenario, where the bias and the redshift distribution cannot be disentangled. This is possible by dealing with accurate photometric redshifts. We can compute an effective bias of the sample by integrating bLBGnLBG, we get beff = 1.5 ± 0.2, where the error is estimated from random samples of bLBG nLBG.
We will use clustering-redshift estimates of bLBGnLBG for a subset of the forecasts, propagating the associated uncertainties, arising from imperfect knowledge of the galaxy bias and redshift distribution, to illustrate a data-driven forecast scenario. Owing to the limited sky overlap, these estimates carry relatively large uncertainties.
4. Modeling of LBG properties
We first detail how to parametrize uncertainties associated with our LBG sample, i.e., the large-scale bias, the outlier fraction, and the redshift distribution uncertainty, as well as accounting for clustering-redshift-derived distribution in the
prediction.
4.1. Modeling of the LBG large-scale linear bias
For the galaxy large-scale bias, we follow (Wilson & White 2019)
(15)
with A(m) = − 0.98 (m − 25)+0.11 and B(m) = 0.12 (m − 25)+0.17. Here, m is the apparent magnitude, to be considered to be the limiting magnitude mlim of the LBG sample (d’Assignies et al. 2023). For u-dropout (resp. g-dropout and r-dropout) LBGs, the limiting magnitude corresponds to the r band (resp. i and z band). This prescription successfully reproduces the large-scale bias measurements for various dropout selections and limiting magnitudes from the CARS6 (Hildebrandt et al. 2009) and GOLDRUSH7 (Ono et al. 2018) surveys. For the UNIONSlike_RF, UNIONSlike_RF_u180s, UNIONSlike_RF_u2x180s and UNIONS_colorcut samples, we use mlim = 24.3, 24.5, 24.5 and 24.2, respectively8. We introduce a global rescaling amplitude b0, so that the LBG bias is modeled as
(16)
4.2. Fraction of LBG outlier and their galaxy bias
In this section, we expect our LBG samples to be contaminated by a fraction fout of outliers, from which we can define a proper large-scale linear bias. We use a two-population model for a given sample with given total n(z), with some outliers at redshift z < zmid with given redshift distribution nout(z), outlier fraction fout and bias bout(z), along with a high-redshift sample at z > zmid, with redshift distribution nhighz(z) and bias bhighz(z), such as
(17)
The values taken for fout are discussed later in Section 6.1.4. We adopt the two-population bias model (Mergulhão et al. 2022)
(18)
where
(19)
Moreover, we also use the model
(20)
where this time, b0 is connected only to the high-redshift portion of the sample, contrary to Eq. (18). As a result, only the high-redshift LBG bias is treated as unknown, and the uncertainty on
is expected to be smaller than in the previous case, since the low-redshift outlier bias is fixed.
4.3. LBG redshift distribution uncertainty
Cosmological inference from the angular clustering of photometrically selected u-dropout LBGs relies on an accurate determination of their redshift distribution, n(z) (Choi et al. 2016; Petri et al. 2026). Calibrated n(z) has error bars that translate the uncertainty in calibrating the redshift distribution of LBGs. We consider a very simple toy example where the LBG sample has a n(z) with error bar σ(z). To propagate the uncertainty of the n(z) on the cosmological fits, we consider random samples
, where σ(z) is taken to be αn(z), i.e., a fraction of the total n(z). We consider α = 0.2 (i.e., 20% error on the recovered redshift distribution). We explore another n(z)-sampling technique, inspired from the shift-and-stretch standard method in the literature (Myles et al. 2021; d’Assignies et al. 2025b; Giannini et al. 2025), where we shift the high-redshift part of the Payerne et al. (2025)n(z) (left panel of Figure 1), i.e., z > 1 part around its mean z = 2.8 by a factor Δz ∼ 𝒩(0, 0.1), and stretch it by a factor 1 + α ∼ 𝒩(1, 0.1).
4.4. Accounting for all with Cz estimates
The clustering-redshift method estimates the product ⟨bLBGnLBG⟩(z), directly accounting for outliers. This inherent degeneracy between b(z) and n(z) cannot be disentangled without highly precise UNIONS photometric redshifts, making it practically impossible to separate the two terms n(z) and b(z). For standard galaxy clustering analysis, this is indeed advantageous, as marginalizing over clustering redshift uncertainties accounts for both the redshift evolution of the distribution and bias, as the degeneracy between b and n is also in the intrinsic kernel, cf. Eq. (2). As we are considering an additional term from non-Gaussianity, we slightly modify the kernel in Eq. (7) with
(21)
where beff = 1.5, as computed previously. That way,
is to be the only free parameter of the fit, as redshift uncertainty on bias and distribution, and outlier fraction are accounted for in the nb term. However, we are neglecting the impact of the redshift evolution on the galaxy bias in the second term. Let’s note that the form of the second term is, on its own, not exact, as we are also assuming a specific (and redshift invariant) model for bϕ.
We further explore a more sophisticated bias modeling, adopting the prescription of Wilson & White (2019) used throughout this work. In this case, b0 is no longer a free parameter but is instead fixed by the normalization constraint
(22)
From this, we replace beff in Eq. (21) by
, incorporating redshift dependence in LBG bias.
5. Forecasting methodology
In our forecast analysis, we consider two free parameters: (i)
, with a fiducial value of 0, and (ii) a galaxy bias-related parameter b0 (see below), rescaled to 1 as its fiducial value. To forecast constraints on
, we consider the observed dataset as a theoretical prediction for the clustering amplitudes of the LBG population and the CMB lensing potential, along with the corresponding theoretical covariances. In other words, the data we use in the forecasts are the binned theoretical predictions
where CbXY = Bbℓ CℓXY, Bbℓ is the binning matrix. We adopt a binning scheme inspired by Krolewski et al. (2024), with ℓmin = 5 (this lower limit is also imposed by the UNIONS sky coverage), ℓmax = 300 (safely describing galaxy clustering statistics as linear and redshift-only bias dependent), and Δℓ = 5, resulting in 60 bins. In the above equation, CℓXY is the unbinned full-sky angular power spectrum in multipole ℓ bins. Using the theoretical full-sky prediction is appropriate because it eliminates the variance of the estimator, resulting in a posterior distribution centered on the input values. However, this approach is a simplification, as it does not account for potential systematic biases in the signal estimation pipeline, which must be corrected in the analysis (e.g., via radial integral constraints).
The covariance of the binned de-coupled angular power spectrum is given by
(23)
where Bbℓ is the binning matrix, and the covariance of the unbinned spectra is (Brown et al. 2005)
(24)
where ℳℓℓ′−1 is the mixing-mode matrix which accounts for partial sky coverage (Alonso et al. 2019). Not included in our mock validation pipeline are non-Gaussian contributions to the covariance, such as the super-sample covariance (SSC), which is currently considered to be the dominant non-Gaussian contribution. SSC arises from the non-linear modulation of local observables by long-wavelength density fluctuations.
For our inference, we then adopt the theoretical data vector (i.e., the input prediction for the angular power spectrum), which is satisfactorily recovered by the mocks, along with the theoretical covariance that includes the full mode-coupling matrix. We also include a free galaxy bias-related parameter, b0, with a default value of 1, used to rescale certain LBG bias dependencies (discussed below). For the measured LBG angular power spectrum, the cross-angular power spectrum between the LBG density and the CMB lensing, and for the combination of the two, the likelihoods (respectively, ℒgg, ℒκg, ℒgg + κg) are assumed to follow a multivariate Gaussian distribution with theoretical covariances computed at the fiducial values
, which is accurate for approximating the likelihood of angular power spectra, which follows a Gamma distribution (Carron 2013).
We then draw samples from the parameter posterior distribution using Bayes’ theorem:
(25)
where ℒtot corresponds to ℒgg, ℒκg, or ℒgg + κg. We use the emcee package (Foreman-Mackey et al. 2013) with flat priors on
(between −500 and 500) and b0 (between 0 and 5). To accelerate the likelihood evaluation, we adopt the template method presented in Fabbian et al. (2026); Since our goal is to infer the LBG linear bias and the PNG parameter
, most contributions to the angular power spectra arise from pre-factors
multiplying terms that are independent of bias and PNG. This is always the case for
, and for b0 when it simply rescales the overall bias of the considered population. We precompute the angular power spectra for a fiducial cosmology with
and with b0 = 1, allowing only b0 and
to vary in the MCMC. This approach reduces the computation time of each likelihood evaluation to a few milliseconds and posterior estimations in seconds. The recovered uncertainties on
and b0 are taken to be the standard deviations of the recovered posterior distributions.
When accounting for redshift distributions in the inference, a continuous model that can be readily marginalized over is often unavailable. On the other hand, generating realizations of n(z) is usually straightforward (see Sections 4.3 and 4.4). This is typically the situation described in Bernstein et al. (2025), except that we are dealing with only one bin at a time and our likelihood can be evaluated within milliseconds. Hence, we can directly compute the likelihood for every discrete realization nk and effectively marginalize over them by stacking:
(26)
where Nreal is the number of realizations, all assumed to have equal probability, independent of y.
6. Results on
precision
6.1. From UNIONS-like LBG samples
The constraining power on
from galaxy two-point statistics arises primarily from the largest clustering scales (i.e., the lowest multipoles ℓ). The gain in
precision enabled by the increased LBG density from UNIONS is only realized if the relevant scales of Cℓgg are in the shot-noise-dominated regime. In the cosmic-variance–dominated regime, where
or equivalently
, the improvement from a higher number density saturates. For Cℓκg, the cosmic-variance–dominated regime requires both
and Cℓκκ ≫ Nℓκκ. In the lower plot of Figure 2, we show the quantities
and Cℓκκ/Nℓκκ. To define the cosmic-variance–dominated regime, we adopt the criteria
and Cℓκκ/Nℓκκ > 10. We find that the LBG auto-spectrum is cosmic-variance dominated up to ℓ ≲ 200, implying that increasing the tracer density does not improve the signal-to-noise ratio on these scales. At smaller scales, however, a higher tracer density can still provide additional information on the tracer bias, rather than directly on the
response, thereby helping to break the degeneracy between b0 and
.
![]() |
Fig. 2. Top panel shows the angular power spectra Cℓgg, Cℓκg, and Cℓκκ (solid lines) for the baseline sample. The cosmological-only contribution is shown as dashed lines, while the noise contribution is shown as dotted lines. Bottom panel shows the signal-to-noise ratio, defined as the ratio of the cosmological-only term to the noise, for Cℓgg and Cℓκκ. |
We first consider an idealized framework, with no outliers, no uncertainty, and with the redshift distribution set in Payerne et al. (2025) to evaluate the precision on
and b0 from a combination of the auto- and cross-spectra, Cℓgg and Cℓgκ, respectively. We obtain
from Cℓgg alone. From Cℓκg alone, the constraints are
. The joint constraints (i.e., combining Cℓgg and Cℓκg) are very similar to those from Cℓgg alone, reflecting the relatively larger uncertainties from the Cℓκg-only case. The corresponding posterior distributions for the Cℓκg-only and Cℓgg-only analyses are shown in the left panel of Figure 3. Although we do not adopt a tomographic approach by splitting the LBG sample into different redshift bins (see, e.g., Fabbian et al. 2026; Chiarenza et al. 2025), we mention that this method – when tomographic redshift distribution are sufficient constrained – can help separate potential systematic effects affecting only a fraction of the sample by providing improved constraints on the galaxy bias in each redshift bin. In Appendix D, we find that splitting the LBG redshift distribution into three bins between z = 1.5 and z = 3.5 with roughly equal numbers of objects (around 300 deg−2) still results in a cosmic-variance dominated regime on the scales considered (with respect to the galaxy surface density), while significantly degrading the signal-to-noise of the cross-spectrum in each bin compared to a single-bin approach.
![]() |
Fig. 3. Left: Posterior distribution of the parameters |
While CMB lensing reconstruction from temperature data can be contaminated by extragalactic foregrounds such as the Cosmic Infrared Background and the thermal Sunyaev-Zel’dovich effect, a polarization-only CMB lensing map–despite its higher noise–can also be used to measure
(Chiarenza et al. 2025). On the considered fitting scales, the Planck PR4 Polarization-only CMB lensing noise spectrum is 7 to 12 times larger than the Temperature+Polarization one. With Polarization-only noise, we get
(roughly
times larger than with Temperature+Polarization), with a 20% precision on the bias.
Table A.1 lists the forecasted precision on
and b0 for the various analysis cases we explore in this work. We also indicate to what percentage the
constraints compare to the baseline constraints with the UNIONS-like RF LBG sample, which yields
. We will now move to forecasts considering different modeling choices, treatment of uncertainties and systematics, cf. Section 4.
6.1.1. Impact of the angular momentum fitting range to mitigate imaging systematics
Imaging systematics, such as survey depth inhomogeneities and Galactic dust, can induce spurious density fluctuations in the tracer sample, generating artificial clustering signals. These effects are particularly problematic on the large scales most relevant for
measurements, where they can lead to significant biases. A common mitigation strategy is to model the tracer density variations as linear functions of imaging properties and then reweigh the tracers, thereby suppressing spurious fluctuations while preserving the cosmological signal (Chaussidon et al. 2022; Krolewski et al. 2024). In the context of
inference from the tracer angular power spectrum, however, removing such systematic contributions from the auto-spectrum Cℓgg remains challenging. For this reason, angular power spectrum–based analyses often rely primarily on the cross-spectrum with CMB lensing, Cℓκg, since the large-scale modes of Cℓgg are the most contaminated. It is therefore important to explore the constraining power of the LBG angular distribution when used exclusively in cross-correlation with the CMB lensing potential.
In the right panel of Figure 3, we show the precision on
obtained from Cℓκg only, with fitting ranges ℓ ∈ [ℓmin, 300] and 5 < ℓmin < 200. As expected, the constraints rapidly degrade with increasing ℓmin, since the sensitivity to
resides on the largest clustering scales. In parallel, we show the constraints from a joint analysis of Cℓκg and Cℓgg, where the auto-spectrum is progressively truncated to ℓ ∈ [ℓmin, 300], under the assumption that the clustering signal of Cℓgg can be reliably recovered only for ℓ > ℓmin. We find that as ℓmin increases, the precision on
approaches the Cℓκg-only result. The corresponding posterior distribution for ℓmin = 200 is shown in the left panel of Figure 3. While essentially no constraining power remains on
in this case, the precision on the bias parameter b0 is significantly improved. This indicates that relying on the small-scale clustering of the LBG population primarily constrains b0, rather than
, since these scales are insensitive to primordial non-Gaussianity but probe the large-scale linear bias more efficiently. So far, we have discussed the constraining power on
from the joint analysis of the LBG auto-power spectrum and the cross-correlation between LBGs and the CMB lensing map. In the next section, we instead focus on assessing the precision on
using the cross-correlation alone, adopting a more conservative approach regarding the treatment of imaging systematics.
6.1.2. Impact of the bΦ(b) parametrization
The sensitivity of the angular power spectrum to
is entirely degenerate with the parameter bΦ, through the combination
in Eq. (7). In the previous section, we assumed the universality of the halo mass function, which allows a partial disentanglement of
from the LBG bias via the bΦ(b) relation in Eq. (8). For most real tracers of large-scale structure, however, this universality does not hold, and the relation must account for a parameter pΦ that encodes the tracer’s merger history (Slosar et al. 2008), with pΦ = 1 corresponding to the universal case. The parameter pΦ remains poorly constrained across tracer populations, and broad marginalization over it can degrade
estimates (Barreira 2020). In the absence of reliable priors on pΦ, only the combination
can be constrained, although any nonzero detection of this product would still indicate the presence of local primordial non-Gaussianity. Besides the fiducial value pΦ = 1, we consider several alternative values from 0.1 to 2. Using the bias prescription in Eq. (15) we get
for pΦ = {0.2, 0.5, 1.5, 2}. We see that the accuracy on
can improve if the parameter p is lower than unity–an aspect that can be further investigated using cosmological simulations.
6.1.3. Impact of the LBG linear bias
An essential ingredient in the modeling of
is the tracer bias and its redshift dependence. In our baseline forecast setup, we adopt the prescription of Eq. (15), following Wilson & White (2019). This parametrization, however, is subject to modeling uncertainties, as current observational constraints on the LBG bias remain statistically limited (Ye et al. 2025; Ruhlmann-Kleider et al. 2024) and are highly sensitive to the adopted selection criteria.
As a simple test of modeling uncertainties, we can adopt a single effective bias parameter for the full redshift range, rather than a fully redshift-dependent prescription (see, e.g., Doux et al. 2018; Ye et al. 2025; Ruhlmann-Kleider et al. 2024). However, to account for the presence of interlopers described with other biases, we use a two-population model (Mergulhão et al. 2022), which allows the data to self-consistently capture contamination while keeping the model minimal, with only a few free parameters: the outlier fraction fout, and the outlier bias bout and the high-redshift bias bhighz as presented in Eq. (18). For the baseline UNIONS-like RF sample, we choose the z < 1 (resp. z > 1) part of the n(z) to represent the outliers (resp. high-redshift LBGs) in our sample. The fraction of outliers is taken to be the integral of the LBG photometric sample redshift distribution below z = 1, and yields fout = 0.33.
We fix the low-redshift bias to bout = {0.5, 0.8, 1} and the high-redshift bias to bhighz = {2.5, 3, 3.5}. Additionally, we consider bhighz(z) = bW19(z) from Eq. (15), giving an average high-redshift bias of ⟨bW19(z) | z > 1⟩ = 4.20. The forecasted precision on
for the different {bout, bhighz} choices is shown in full-lines in Figure 4. We see that, at fixed bout, the constraints improve as bhighz increases. Conversely, at fixed bhighz, larger bout values lead to weaker constraints. Adopting the two-population model with bout = ⟨bW19(z)|z < 1⟩ = 1.02 and bhighz = ⟨bW19(z)|z > 1⟩ = 4.20 results in
, slightly larger than the baseline using the Wilson & White (2019) prescription for the full-redshift range bias. We also consider the two-population bias model in Eq. (20), where b0 (the free parameter in the MCMC) is only connected to the high-redshift part of the sample, contrary to Eq. (18). The results are shown in dashed lines in Figure 4, where constraining b0 solely for the high-redshift population leads to slightly tighter
constraints. We observe that the two-population model provides greater flexibility in modeling galaxy bias.
![]() |
Fig. 4. Forecasted error on |
6.1.4. Impact of the outlier fraction
We model the Payerne et al. (2025) redshift distribution as in Eq. (17), depending on the outlier fraction at z < 1, (cf. Section 6.1.4). This modeling allows us to account for the effect of low-redshift outliers and to assess how
constraints are impacted when the sample is more or less contaminated. We vary fout following Mill et al. (2025), considering the values {0, 0.1, 0.2, 0.3, 0.4, 0.5} while keeping the total LBG number density fixed. Maintaining the same number density ensures that the Random Forest required density budget is unchanged, but with varying efficiency in selecting high-redshift LBGs. For fout = 0 (no contamination), we obtain the tightest constraint,
. As fout increases from 0.1 to 0.5, the precision degrades approximately linearly, with
increasing to
.
For the two alternative Random Forest selections based on UNIONS-like and LSSTY4-like photometry yielding 1100 LBG per square degrees, namely UNIONSlike_RF_u180s and LSSTY4like_RF_u2x180s (presented in Section 3), we obtain
and
, respectively. Still in the context of modifying the outlier fraction, we can assess the precision on
for a DESI-II-like sample, i.e., when the outliers can be removed by spectroscopic redshift confirmation. Considering the z > 2 part of the sample only (and modifying the LBG density accordingly, i.e., 800 LBG per square degree) we get that
. When accounting for the DESI spectroscopic redshift efficiency for 2-hour exposure (resp. 4-hour), we get
(resp.
). We observe that a higher precision on
can be achieved by removing outliers, which motivates enhanced sample selection from photometric datasets, and their spectroscopic follow-up, even if the current DESI spectroscopic redshift efficiency removes most of the z < 2 targets.
6.1.5. Impact of the redshift distribution uncertainty
Here we explore the impact of the n(z) uncertainty on the PNG constraints, with two models described in Section 4.3. For the first, we consider random samples
, where σ(z) is taken to be 0.2 n(z). In the second, we generate samples
, with the so-called shift-and-stretch method: we shift the high-redshift part of the n(z) by a random factor Δz ∼ 𝒩(0, 0.1), and stretch it by a random factor 1 + α ∼ 𝒩(1, 0.1). To derive the error on
accounting for the uncertainty on the n(z), we stack the chains once converged and compute the mean and variance from the stacked chains (as explained in Eq. (26)).
For the first model, considering fitting Cℓκg-only, the standard deviation STD of
and b0 best-fits over the 50 MCMCs are STD
and STD(b0) = 0.01, respectively. Marginal error on
and b0 are obtained after stacking the chains, and we get
, that we retrieve if we consider the approximation
, and σ(b0)≈0.08 similar to the fiducial fit, the effect of
precision is really small.
Using shift and stretch, we find similar results, namely that the dispersion of
over the 50 MCMCs is roughly STD
, leading to very few differences on the
precision when stacking chains and taking the marginal dispersion, i.e.,
. From this simple exercise, we tested the robustness of
constraints when accounting for the galaxy sample redshift uncertainty. We find that the effect is rather small for a well-calibrated n(z).
6.2. From early UNIONS LBG samples
In this section, we go beyond our idealistic LBG samples obtained in Payerne et al. (2025) to explore how
forecasts can be conducted using LBG selection on the early UNIONS multiband data. We first discuss the selection we use on the GAaP multiband catalog, and then we discuss the methodology to infer the LBG redshift distribution.
6.2.1. CLAUDS+HSC-calibrated n(z)
From the CLAUDS+HSC-calibrated redshift distribution presented in Figure 1 (right panel) and the lensing magnification bias for this sample, we forecast a precision of
when adopting the Wilson et al. (2017) bias model. This result is slightly better than that of the baseline photometric sample from Payerne et al. (2025), which contains a more significant fraction of low-redshift outliers. In contrast, the UNIONS selection includes a higher number of galaxies in the redshift range 1.5 < z < 2, where the CMB lensing kernel peaks. Additionally, the UNIONS sample benefits from a higher galaxy density (increasing from 1100 to 1400 deg−2), a lower magnification bias, and a higher mean bias, corresponding to a deeper r-band limit magnitude (24.2 vs. 24.3). The usable area for the forecast is also extended to the full UNIONS footprint, since the selection relies solely on ugr magnitudes. After convolving the redshift distribution with the DESI redshift reconstruction efficiency for 2-hour (resp. 4-hour) spectroscopic exposures–and adjusting for the corresponding number densities–we obtain
(resp.
), assuming only the UNIONS North Galactic Cap (NGC) footprint that DESI-II could realistically survey. The significant reduction in LBG number density for the DESI-II spectroscopic follow-up with 2-hours and 4-hours exposure time (down to 140 and 170 deg−2) arises from the higher proportion of UNIONS-LBGs that lie below z < 2 in the UNIONS selection compared to the selection in Payerne et al. (2025) peaking at z ≈ 3 (down to 500 and 590 deg−2 from 1100 deg−2 initially).
6.2.2. From clustering-redshifts with DESI DR1 data
In this section, we evaluate the degradation of the PNG constraints, using the clustering-redshift estimates of the LBG large-scale bias and redshift distribution (see Section 4.4). First, we fix bn to the measured bLBGnLBG with clustering redshifts, with corresponding beff = 1.5, leaving
to be the only free parameter of the fit. For this setup, we obtain a fiducial uncertainty of
(without accounting for uncertainties in the clustering-redshift b(z)n(z) distribution), as the lower effective bias naturally reduces the constraining power. Then, we use the redshift-dependent bias model in Eq. (15), with a fixed rescaling factor
in Eq. (22). From this model, we obtain
, quite similar to the late results obtained neglecting the bias redshift evolution through beff. From these results (still not accounting for the clustering-redshift uncertainties), we see that the previous result
assumes a high-bias, high-purity sample that is not currently supported by the clustering-redshift measurement.
To assess the impact of the uncertainty of the clustering-redshifts distribution, we generate random realizations
, and ensure that
remains positive. For each realization, we compute an effective bias
as the redshift-integrated value of
. The fiducial data vector is obtained using the reference bLBGnLBG(z) and beff in Eq. (21), and 50 MCMC realizations are performed, accounting for the sampled
and
as input for the Cℓκg modeling.
From this, the marginal uncertainty reaches
when stacking all converged chains. This corresponds to roughly a factor-of-two increase compared to the idealized case, reflecting a reduction in the forecasting power of the cross-correlation power spectrum. Importantly, this degradation is driven not by the dispersion in n(z) itself, but by the effective bias beff, which is closely linked to the constraining power on
–higher bias naturally leads to a better signal-to-noise ratio. Repeating the analysis with the error bars of the inferred clustering-redshift distribution reduced by a factor of
–corresponding to the expected improvement in clustering-redshift precision with upcoming DESI DR3 data–reduces the uncertainty to
, highlighting the strong potential for improved clustering-redshift calibration with next-generation spectroscopic data.
Then we use the redshift-dependent bias model in Eq. (15), and we propagate uncertainties on the clustering-redshift distribution via
and
to be the redshift integral of each
. From this model, we obtain
when including DESI DR1 uncertainties, and
when assuming DESI DR3 performance. We can see that the more sophisticated model is not associated with an increase in the uncertainty in the fixed case, and creates a 15% degradation accounting for uncertainties. Thus, reducing the uncertainties associated with the clustering redshift measurements is the main limitation of this method.
This framework offers a promising avenue for combining clustering-redshift information with realistic bias modeling, representing an important step toward fully data-driven constraints on primordial non-Gaussianity. Even though current uncertainties remain significant and are the major contribution to the uncertainty on
, this exploratory analysis demonstrates the feasibility of applying clustering-redshift methods to LBG samples over such a wide redshift range, by combining the analysis of multiple DESI tracers.
Table A.1 lists the forecasted precision on
and b0 for the various analysis cases we explore in Sect. 6.2. To help isolate the gains due to sample properties versus simple volume expansion from the ugriz (Section 6.1) and ugr footprint (Section 6.2), we also rescaled the constraints in the summary table by the square root of the ratio between the footprints (a factor of 1.13).
7. Summary and conclusions
In this work, we investigated how effectively UNIONS-selected LBGs can constrain local primordial non-Gaussianities, quantified by the parameter
. When measured from large-scale structure,
is intrinsically degenerate with the galaxy bias, represented here by b0, and both parameters are jointly fitted throughout this study.
Using an idealized LBG sample from Payerne et al. (2025) identified from broadband ugriz bands using a Random Forest algorithm, peaking at z = 2 and with a surface density of 1100 deg−2, we found that the scales most relevant for
measurement (ℓ = 5 − 200) are dominated by cosmic variance. Consequently, expanding the survey area–and therefore increasing the number of available modes–directly enhances the precision on
derived from the LBG angular power spectrum. In contrast, the cross-correlation between the LBG distribution and the Planck CMB lensing map remains limited by the noise of the latter. For our fiducial model, we obtained
, with a 9% uncertainty in the recovered galaxy bias, assuming the universality relation bϕ = 2δc(b − p) with pΦ = 1. However, the accuracy could be improved if the parameter p is lower than unity–an aspect that can be further investigated using cosmological simulations. By introducing a more complex, two-population bias model, we found that it degraded the precision to
. We also examined the impact of photometric outliers on
constraints. Tests using more realistic UNIONS-like datasets, from coadded U-band exposure on COSMOS, confirmed the strong role of outliers in degrading
constraints. Considering UNIONS as an option for DESI-II target selection naturally removes outliers below z = 2.2 through spectroscopic confirmation, yielding a precision of
, accounting for DESI redshift efficiency. This clearly demonstrated that reducing the fraction of outliers – even at the cost of a lower LBG density – significantly enhances the accuracy of
measurements. We further assessed the robustness of our forecasts to uncertainties in the underlying redshift distribution n(z). Randomizing n(z) at the 20% level or applying a shift-and-stretch modification impacted
by less than 1%, confirming the stability of our results against such variations.
Finally, leveraging early UNIONS data, we tested LBG selection on real observations. The LBG sample obtained on XMM was matched to the deep photometric catalog from CLAUDS+HSC to assess their n(z), from which we found
with the Wilson & White (2019) bias model, leveraging the wider UNIONS footprint (the UNIONS selection relies only on ugr, not ugriz as in Payerne et al. 2025), a larger linear bias (due to the lower magnitude limit) and a broader coverage of the CMB lensing kernel. In the context of DESI-II, spectroscopic follow-up would allow us to reach the high-redshift end of the distribution with no uncertainties on the redshift distribution, providing
. We then used the clustering-redshift technique (Ménard et al. 2013; d’Assignies et al. 2025b) to assess the redshift distribution of the UNIONS photometrically selected LBGs by cross-correlating them with DESI DR1 tracers over the wide range 0 < z < 3.5. We found good agreement with the CLAUDS+HSC calibrated distribution, demonstrating the application of clustering redshifts to such an extensive redshift interval for LBGs. However, when accounting for uncertainties in the clustering-redshift b(z)n(z) distribution, the constraint broadens to
, yet is expected to improve significantly to
with DESI DR3 data. Overall, these exploratory results highlight the promise of the clustering-redshift method as an independent method to validate the recovered photometric redshift distribution up to z > 2, without relying on deep photometric catalog or spectroscopic follow-ups.
Acknowledgments
The authors thank the anonymous reviewer for their insightful comments and suggestions. We thank David Alonso for his help in using the Core Cosmology Library (CCL, Chisari et al. 2019) and NaMaster (Alonso et al. 2019). C. P. and C. Y. acknowledge support from grant ANR-22-CE31-0009 for the HZ-3D-MAP project and from grant ANR-22-CE92-0037 for the DESI-Lya project. W. d’A. acknowledges support from the MICINN projects PID2019-111317GB-C32, PID2022-141079NB-C32, as well as predoctoral program AGAUR-FI ajuts (2024 FI-1 00692) Joan Oró. IFAE is partially funded by the CERCA program of the Generalitat de Catalunya.
References
- Aihara, H., AlSayyad, Y., Ando, M., et al. 2022, PASJ, 74, 247 [NASA ADS] [CrossRef] [Google Scholar]
- Alonso, D., Sanchez, J., Slosar, A., & LSST Dark Energy Science Collaboration 2019, MNRAS, 484, 4127 [NASA ADS] [CrossRef] [Google Scholar]
- Anbajagane, D., Chang, C., Lee, H., & Gatti, M. 2024, JCAP, 2024, 062 [Google Scholar]
- Barreira, A. 2020, JCAP, 2020, 031 [CrossRef] [Google Scholar]
- Barreira, A., Cabass, G., Schmidt, F., Pillepich, A., & Nelson, D. 2020, JCAP, 2020, 013 [NASA ADS] [CrossRef] [Google Scholar]
- Benítez, N. 2011, Astrophysics Source Code Library [record ascl:1108.011] [Google Scholar]
- Bernstein, G., Assignies Doumerg, W., Troxel, M. A., et al. 2025, ArXiv e-prints [arXiv:2506.00758] [Google Scholar]
- Besuner, R., Dey, A., Drlica-Wagner, A., et al. 2025, ArXiv e-prints [arXiv:2503.07923] [Google Scholar]
- Brown, M. L., Castro, P. G., & Taylor, A. N. 2005, MNRAS, 360, 1262 [NASA ADS] [CrossRef] [Google Scholar]
- Busca, N., & Balland, C. 2018, ArXiv e-prints [arXiv:1808.09955] [Google Scholar]
- Cagliari, M. S., Castorina, E., Bonici, M., & Bianchi, D. 2024, JCAP, 2024, 036 [Google Scholar]
- Carron, J. 2013, A&A, 551, A88 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Carron, J., Mirmelstein, M., & Lewis, A. 2022, JCAP, 2022, 039 [CrossRef] [Google Scholar]
- Challinor, A., & Lewis, A. 2011, Phys. Rev. D, 84, 043516 [NASA ADS] [CrossRef] [Google Scholar]
- Chambers, K. C., Magnier, E. A., Metcalfe, N., et al. 2016, ArXiv e-prints [arXiv:1612.05560] [Google Scholar]
- Chaussidon, E., Yèche, C., Palanque-Delabrouille, N., et al. 2022, MNRAS, 509, 3904 [Google Scholar]
- Chaussidon, E., Yèche, C., de Mattia, A., et al. 2025, JCAP, 2025, 029 [Google Scholar]
- Chiarenza, S., Krolewski, A., Bonici, M., et al. 2025, ArXiv e-prints [arXiv:2512.17865] [Google Scholar]
- Chisari, N. E., Alonso, D., Krause, E., et al. 2019, ApJS, 242, 2 [Google Scholar]
- Choi, A., Heymans, C., Blake, C., et al. 2016, MNRAS, 463, 3737 [Google Scholar]
- Choppin de Janvry, J., Gontcho, S. G. A., Seljak, U., et al. 2025, ArXiv e-prints [arXiv:2511.18133] [Google Scholar]
- Crenshaw, J. F., Leistedt, B., Graham, M. L., et al. 2025, ApJS, 281, 54 [Google Scholar]
- CSST Collaboration (Gong, Y., et al.) 2026, Sci. China Phys. Mech. Astron., 69, 239501 [Google Scholar]
- Dalal, N., Doré, O., Huterer, D., & Shirokov, A. 2008, Phys. Rev. D, 77, 123514 [CrossRef] [Google Scholar]
- d’Assignies, D. W., Zhao, C., Yu, J., & Kneib, J. P. 2023, MNRAS, 521, 3648 [Google Scholar]
- d’Assignies, W., Bernstein, G. M., Yin, B., et al. 2025a, ArXiv e-prints [arXiv:2510.23565] [Google Scholar]
- d’Assignies, W., Manera, M., Padilla, C., et al. 2025b, A&A, 702, A155 [Google Scholar]
- Desjacques, V., & Seljak, U. 2010, Class. Quant. Grav., 27, 124011 [Google Scholar]
- Desjacques, V., Jeong, D., & Schmidt, F. 2018, Phys. Rep., 733, 1 [Google Scholar]
- Desprez, G., Picouet, V., Moutard, T., et al. 2023, A&A, 670, A82 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Doux, C., Penna-Lima, M., Vitenti, S. D. P., et al. 2018, MNRAS, 480, 5386 [Google Scholar]
- Elvin-Poole, J., MacCrann, N., Everett, S., et al. 2023, MNRAS, 523, 3649 [NASA ADS] [CrossRef] [Google Scholar]
- Euclid Collaboration (Lepori, F., et al.) 2022, A&A, 662, A93 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Fabbian, G., Alonso, D., Storey-Fisher, K., & Cornish, T. 2026, JCAP, 2026, 056 [Google Scholar]
- Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]
- Foucaud, S., McCracken, H. J., Le Fèvre, O., et al. 2003, A&A, 409, 835 [Google Scholar]
- Giannini, G., Alarcon, A., d’Assignies, W., et al. 2025, ArXiv e-prints [arXiv:2509.07964] [Google Scholar]
- Giavalisco, M., Dickinson, M., Ferguson, H. C., et al. 2004, ApJ, 600, L103 [NASA ADS] [CrossRef] [Google Scholar]
- Guy, J., Bailey, S., Kremin, A., et al. 2023, AJ, 165, 144 [NASA ADS] [CrossRef] [Google Scholar]
- Gwyn, S., McConnachie, A. W., Cuillandre, J.-C., et al. 2025, AJ, 170, 324 [Google Scholar]
- Harikane, Y., Ono, Y., Ouchi, M., et al. 2022, ApJS, 259, 20 [NASA ADS] [CrossRef] [Google Scholar]
- Harikane, Y., Ouchi, M., Oguri, M., et al. 2023, ApJS, 265, 5 [NASA ADS] [CrossRef] [Google Scholar]
- Herrera-Alcantar, H. K., Armengaud, E., Yèche, C., et al. 2025, JCAP, 2025, 053 [Google Scholar]
- Hildebrandt, H., Pielorz, J., Erben, T., et al. 2009, A&A, 498, 725 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Hildebrandt, H., Arnouts, S., Capak, P., et al. 2010, A&A, 523, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Hildebrandt, H., van den Busch, J. L., Wright, A. H., et al. 2021, A&A, 647, A124 [EDP Sciences] [Google Scholar]
- Ibata, R. A., McConnachie, A., Cuillandre, J.-C., et al. 2017, ApJ, 848, 128 [Google Scholar]
- Jarvis, M. 2015, Astrophysics Source Code Library [record ascl:1508.007] [Google Scholar]
- Jeong, D., Schmidt, F., & Sefusatti, E. 2011, Phys. Rev. D, 83, 123005 [Google Scholar]
- Krolewski, A., Percival, W. J., Ferraro, S., et al. 2024, JCAP, 2024, 021 [Google Scholar]
- Kuijken, K. 2008, A&A, 482, 1053 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Kuijken, K., Heymans, C., Hildebrandt, H., et al. 2015, MNRAS, 454, 3500 [Google Scholar]
- Kuijken, K., Heymans, C., Dvornik, A., et al. 2019, A&A, 625, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Landy, S. D., & Szalay, A. S. 1993, ApJ, 412, 64 [Google Scholar]
- Lewis, A., & Challinor, A. 2006, Phys. Rep., 429, 1 [Google Scholar]
- Lima, M., Cunha, C. E., Oyaizu, H., et al. 2008, MNRAS, 390, 118 [Google Scholar]
- Limber, D. N. 1953, ApJ, 117, 134 [NASA ADS] [CrossRef] [Google Scholar]
- LSST Science Collaboration (Abell, P. A., et al.) 2009, ArXiv e-prints [arXiv:0912.0201] [Google Scholar]
- Mainieri, V., Anderson, R. I., Brinchmann, J., et al. 2024, ArXiv e-prints [arXiv:2403.05398] [Google Scholar]
- Malkan, M. A., Cohen, D. P., Maruyama, M., et al. 2017, ApJ, 850, 5 [NASA ADS] [CrossRef] [Google Scholar]
- Ménard, B., Scranton, R., Schmidt, S., et al. 2013, ArXiv e-prints [arXiv:1303.4722] [Google Scholar]
- Mergulhão, T., Rubira, H., Voivodic, R., & Abramo, L. R. 2022, JCAP, 2022, 021 [Google Scholar]
- Mill, C. M., Leonard, C. D., Rau, M. M., Uhlemann, C., & Joudaki, S. 2025, JCAP, 2025, 037 [Google Scholar]
- Miyatake, H., Harikane, Y., Ouchi, M., et al. 2022, Phys. Rev. Lett., 129, 061301 [NASA ADS] [CrossRef] [Google Scholar]
- Miyazaki, S., Komiyama, Y., Kawanomoto, S., et al. 2018, PASJ, 70, S1 [NASA ADS] [Google Scholar]
- Myles, J., Alarcon, A., Amon, A., et al. 2021, MNRAS, 505, 4249 [NASA ADS] [CrossRef] [Google Scholar]
- Okamoto, T., & Hu, W. 2003, Phys. Rev. D, 67, 083002 [NASA ADS] [CrossRef] [Google Scholar]
- Ono, Y., Ouchi, M., Harikane, Y., et al. 2018, PASP, 70, S10 [Google Scholar]
- Payerne, C., d’Assignies Doumerg, W., Yèche, C., et al. 2025, JCAP, 2025, 031 [Google Scholar]
- Petri, F., Leistedt, B., Mortlock, D. J., et al. 2026, MNRAS, 545, staf2115 [Google Scholar]
- Planck Collaboration IX. 2020, A&A, 641, A9 [Google Scholar]
- Reddy, N. A., Steidel, C. C., Pettini, M., et al. 2008, ApJS, 175, 48 [CrossRef] [Google Scholar]
- Rezaie, M., Ross, A. J., Seo, H.-J., et al. 2024, MNRAS, 532, 1902 [NASA ADS] [CrossRef] [Google Scholar]
- Ross, A. J., Aguilar, J., Ahlen, S., et al. 2025, JCAP, 2025, 125 [Google Scholar]
- Roster, W., Wright, A. H., Hildebrandt, H., et al. 2026, A&A, 707, A277 [Google Scholar]
- Ruhlmann-Kleider, V., Yèche, C., Magneville, C., et al. 2024, JCAP, 2024, 059 [Google Scholar]
- Sailer, N., Castorina, E., Ferraro, S., & White, M. 2021, JCAP, 2021, 049 [CrossRef] [Google Scholar]
- Sawicki, M., Arnouts, S., Huang, J., et al. 2019, MNRAS, 489, 5202 [NASA ADS] [Google Scholar]
- Schlegel, D. J., Ferraro, S., Aldering, G., et al. 2022, ArXiv e-prints [arXiv:2209.03585] [Google Scholar]
- Schmidt, S. J., Ménard, B., Scranton, R., Morrison, C., & McBride, C. K. 2013, MNRAS, 431, 3307 [Google Scholar]
- Schmittfull, M., & Seljak, U. 2018, Phys. Rev. D, 97, 123540 [NASA ADS] [CrossRef] [Google Scholar]
- Slosar, A., Hirata, C., Seljak, U., Ho, S., & Padmanabhan, N. 2008, JCAP, 2008, 031 [CrossRef] [Google Scholar]
- Steidel, C. C., Giavalisco, M., Pettini, M., Dickinson, M., & Adelberger, K. L. 1996, ApJ, 462, L17 [Google Scholar]
- Steidel, C. C., Adelberger, K. L., Giavalisco, M., Dickinson, M., & Pettini, M. 1999, ApJ, 519, 1 [NASA ADS] [CrossRef] [Google Scholar]
- Takahashi, R., Sato, M., Nishimichi, T., Taruya, A., & Oguri, M. 2012, ApJ, 761, 152 [Google Scholar]
- Wilson, M. J., & White, M. 2019, JCAP, 2019, 015 [Google Scholar]
- Wilson, M. J., Peacock, J. A., Taylor, A. N., & de la Torre, S. 2017, MNRAS, 464, 3121 [Google Scholar]
- Ye, I., Bull, P., Bowler, R. A. A., et al. 2025, MNRAS, 543, 3196 [Google Scholar]
- Yu, B., Knight, R. Z., Sherwin, B. D., et al. 2023, Phys. Rev. D, 107, 123522 [Google Scholar]
- Zhang, Y. H., Zuntz, J., Moskowitz, I., et al. 2025, MNRAS, submitted [arXiv:2508.20903] [Google Scholar]
Such as the fractional perturbation
, where κ is the projected matter density field along the line of sight, and
is the intrinsic clustering fractional perturbation.
i.e., the logarithmic slope of the cumulative number counts at the survey detection limit, where N(< m) is the number of objects brighter than magnitude m, and mlim is the survey flux limit.
Canada-France-Hawaii Telescope Large Area U-band Deep Survey, Sawicki et al. (2019).
Hyper Suprime-Camera, Aihara et al. (2022).
First, a convolutional neural network (CNN) derived from QuasarNET (Busca & Balland 2018) is applied to each DESI spectrum. A more precise LBG redshift is then obtained using the RedRock (RR) software (Guy et al. 2023) that uses the CNN output redshift as a prior and refines its measurement.
CFHTLS-Archive-Research Survey.
Great Optically Luminous Dropout Research Using Subaru HSC.
With typical bias bW19(z = 2.5, mlim = 24.2) = 4, bW19(z = 2.5, mlim = 24.3) = 3.8 and bW19(z = 2.5, mlim = 24.5) = 3.4.
Appendix A: Summary of
forecasts
We present in Table A.1 a summary of the different forecast results on
and b0 (when considered, only the fractional error is reported in this table) that we conducted in this paper. Results are sorted by paper sections.
Summary of the forecasted precision on
and the precision on the bias parameter σ(b0)/b0 for the different analysis cases we explore in this paper.
Appendix B: Scales probed by the LBG redshift distributions
In this appendix, we examine the comoving scales probed by the various LBG samples used in this work. The default angular multipole range is ℓ ∈ [5, 300], which we convert to comoving wavenumbers via
, where χ(z) is the comoving distance at redshift z. Fixing the minimum and maximum values of ℓ restricts the k-range to the region between the two blue lines shown in Figure B.1. We now examine the k − z regions spanned by the UNIONS-like RF selection from Payerne et al. (2025) (left panel of Figure 1). We draw redshift samples according to n(z)2 (in red), corresponding to the LBG redshift distribution kernel, which peaks at z ∼ 3. The high-redshift tail probes scales k ≲ 0.05 h Mpc−1, while the largest accessible scales are limited by the survey size at z = 3 with ℓ = 5, corresponding to k ≳ 10−3 h Mpc−1. The RF UNIONS-like distribution also includes low-redshift outliers (z < 0.5), which primarily probe non-linear scales located above the blue dashed line in Figure B.1. We define the maximum wavenumber kmax(z) for which the linear matter power spectrum remains valid as kmax(z) = max{k | Δ2(k,z) < 0.2}, with
(B.1)
![]() |
Fig. B.1. Wavelength k − z plane. The region between the two green lines corresponds to all possible scales probed by the LBG samples. The area above the dashed blue line indicates non-linear scales. Left: The red (black) distribution shows the 1-2σ weighted k − z map following n(z)2 (n(z)Wκ(z), respectively) of the RF UNIONS-like selection from Payerne et al. (2025). Right: Same as the left plot, but for the UNIONS color-cut LBG selection. |
where P(k, z) is the linear matter power spectrum evaluated at scale factor a = 1/(1 + z). The mode sensitivity when cross-correlating with CMB lensing – by drawing redshift samples according to n(z)Wκ(z) – represented in black in the left panel of Figure B.1, significantly reduces the importance of z < 0.5 objects, such as all scales probed are highly linear. For the UNIONS color-cut distribution, which probes somewhat smaller redshifts (right panel of Figure B.1), the minimum scale accessed by the high-redshift portion of the distribution (LBG redshift kernel only) is smaller than for the UNIONS-like RF selection, with k ≲ 0.075 h Mpc−1. Overall, for the high-redshift portion of the LBG distributions considered in this work, the typical comoving wavenumber range is k ∈ [10−3, 0.075] h Mpc−1. The low-redshift outliers probe smaller scales, but they constitute a much smaller fraction of the sample. When cross-correlating with CMB lensing, the scales probed are roughly the same, with a slightly higher sensitivity to high redshifts.
Appendix C: Magnification bias
In Figure C.1, we show the calculation of the magnification bias
(C.1)
![]() |
Fig. C.1. Magnification bias in several redshift intervals for the different LBG samples we use in this work. |
for different redshift intervals (the redshift intervals are obtained using LePhare photometric redshifts on COSMOS and XMM). For the different samples, we see some redshift variation over the redshift range. We adopt a conservative approach by using the mean value over the redshift range 0 < z < 4.
Appendix D: Investigating tomographic LBG samples
In this appendix, we consider the UNIONS-like RF sample presented in Section 3, with a surface density of 1100 deg−2 over the full redshift range 0 < z < 4. For the tomographic approach, we split the sample into three redshift bins with edges at z = 1.5, 2.6, 2.9, and 3.5, such that the corresponding number density in each bin is approximately 300 deg−2. Figure D.1 (top panel) shows the per-ℓ quantity Cℓgg × ngal for the three tomographic bins, as well as for a single bin that includes all LBGs contained in the three bins. On scales 10 < ℓ < 300, the LBG-only angular power spectrum remains cosmic-variance dominated. The bottom panel shows the signal-to-noise ratio of the Cℓκg signal for the three bins and the single bin approach, demonstrating that it is significantly degraded when adopting a tomographic approach.
![]() |
Fig. D.1. Top: Ratio of the predicted angular power spectrum of the LBG population, split into three tomographic redshift bins (together with the single-bin prediction), to the corresponding galaxy shot noise. Bottom: Corresponding signal-to-noise ratio of the angular cross-correlation power spectrum between the LBG tomographic samples and CMB lensing. |
Appendix E: Degrading deep photometric datasets
E.1. CLAUDS U-band forced photometry using HSC g-band detection position
The resulting U-band PSF depth distribution, obtained by performing forced photometry in the g-band on the U-band CLAUDS images (with exposure times of texp = 180 s and texp = 2 × 180 s, respectively), is shown in Figure E.1, together with two additional runs of 3 × 180 s and 600 s (the default texp of for the released CLAUDS catalog available at https://www.clauds.net), respectively. From the plot, the higher texp, the higher the mean U-band PSF depth we get. For texp = 180 s and texp = 2 × 180 s, we get a mean PSF depth of 24.6 and 25. Let us note that this enables us to have UNIONS-like data in the U-band, not in the other bands (griz). This is explained in the next section.
![]() |
Fig. E.1. 5σ PSF depth for U-band data, obtained by forced photometry at the HSC g-band detection position. The different histograms are obtained by considering different texp for the CLAUDS U-band images. |
E.2. Artificially degrading a deeper photometric dataset to shallower depths
For the other bands (griz), we follow the method presented in Payerne et al. (2025). The shallow magnitude mshallow is obtained from the deep magnitude mdeep (from the HSC catalog) by adding noise in flux space corresponding to a shallower limiting depth. The deep flux is first computed as
(E.1)
The shallow flux is obtained by adding a Gaussian random deviate,
(E.2)
where an additional flux uncertainty σf,add is introduced,
(E.3)
The corresponding shallow magnitude is
(E.4)
The additional magnitude uncertainty σm,add is defined such that the total magnitude error matches that expected at the shallower depth,
(E.5)
with
(E.6)
where Nσ = 5. An illustration of this procedure is shown in Figure E.2 for the degradation of HSC magnitudes measured in the r band. The deep r-band magnitude distribution, shown in blue, corresponds to a 5σ depth of r = 26.6. After applying the artificial degradation, the resulting shallow magnitude distribution is shown in magenta and corresponds to a simulated 5σ depth of r = 25.5. Reaching a shallower depth also implies a reduced number of detected objects. To explore how detection efficiency modifies the histograms, we model this effect by applying a random selection of sources based on their probability of detection in the shallower survey. Assuming Gaussian flux uncertainties and a detection threshold of Nσ = 3, the detection probability for an object of magnitude m is given by
(E.7)
![]() |
Fig. E.2. Top: Histogram of r-band magnitudes before (blue) and after (magenta) applying the artificial degradation procedure. The orange histogram shows the magnitude distribution after applying the detection efficiency of the shallower survey. Bottom: r-band magnitude uncertainty as a function of magnitude before (blue) and after (magenta) applying the artificial degradation procedure. |
where σf = f(mshallow depth)/5. The resulting magnitude distribution after applying this completeness selection is shown in orange in Figure E.2 (upper panel), and exhibits the same turnover at the limiting depth as the deep sample. For the r-band cut we use in this work (of about 24.2 to 24.5), we are not sensitive to this photometric detection efficiency effect. Finally, the bottom plot show the magnitude error σm,deep and σm,shallow given by
(E.8)
with respect to magnitude for the deep and shallow r-band catalog, matching their expected value in Eq. (E.6). We note that a more sophisticated error model exists to describe the turnover of the magnitude error at the magnitude depth (see Figure E.2, lower panel), as explained in Section 3.2.4 of Hildebrandt et al. (2010), used in Wilson & White (2019) for degradation.
All Tables
Summary of the forecasted precision on
and the precision on the bias parameter σ(b0)/b0 for the different analysis cases we explore in this paper.
All Figures
![]() |
Fig. 1. Left: Photometric redshift distributions of the photometric LBG samples UNIONSlike_RF in red (the convolution of the UNIONSlike_RF distribution with the DESI spectroscopic efficiency is shown in dashed and dotted lines, they have not been normalized to show the impact at z ∼ 2.5). The distribution of the samples UNIONSlike_RF_u180s, and LSSTY4like_RF_u2x180s are shown in blue and dashed cyan lines. Right: Photometric redshift distributions of the UNIONS LBG samples UNIONS_colorcut and UNIONS_PZcut in the XMM field. The sample selected with the hybrid method à-la SOM is shown for illustration. We also show the quantity nb (the product between the large-scale bias and the normalized redshift distribution) resulting from our clustering-redshift calibration methods. |
| In the text | |
![]() |
Fig. 2. Top panel shows the angular power spectra Cℓgg, Cℓκg, and Cℓκκ (solid lines) for the baseline sample. The cosmological-only contribution is shown as dashed lines, while the noise contribution is shown as dotted lines. Bottom panel shows the signal-to-noise ratio, defined as the ratio of the cosmological-only term to the noise, for Cℓgg and Cℓκκ. |
| In the text | |
![]() |
Fig. 3. Left: Posterior distribution of the parameters |
| In the text | |
![]() |
Fig. 4. Forecasted error on |
| In the text | |
![]() |
Fig. B.1. Wavelength k − z plane. The region between the two green lines corresponds to all possible scales probed by the LBG samples. The area above the dashed blue line indicates non-linear scales. Left: The red (black) distribution shows the 1-2σ weighted k − z map following n(z)2 (n(z)Wκ(z), respectively) of the RF UNIONS-like selection from Payerne et al. (2025). Right: Same as the left plot, but for the UNIONS color-cut LBG selection. |
| In the text | |
![]() |
Fig. C.1. Magnification bias in several redshift intervals for the different LBG samples we use in this work. |
| In the text | |
![]() |
Fig. D.1. Top: Ratio of the predicted angular power spectrum of the LBG population, split into three tomographic redshift bins (together with the single-bin prediction), to the corresponding galaxy shot noise. Bottom: Corresponding signal-to-noise ratio of the angular cross-correlation power spectrum between the LBG tomographic samples and CMB lensing. |
| In the text | |
![]() |
Fig. E.1. 5σ PSF depth for U-band data, obtained by forced photometry at the HSC g-band detection position. The different histograms are obtained by considering different texp for the CLAUDS U-band images. |
| In the text | |
![]() |
Fig. E.2. Top: Histogram of r-band magnitudes before (blue) and after (magenta) applying the artificial degradation procedure. The orange histogram shows the magnitude distribution after applying the detection efficiency of the shallower survey. Bottom: r-band magnitude uncertainty as a function of magnitude before (blue) and after (magenta) applying the artificial degradation procedure. |
| In the text | |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.















