Optimizing redshift distribution inference through joint self-calibration and clustering-redshift synergy

Weilun Zheng; Kwan Chuen Chan; Haojie Xu; Le Zhang; Ruiyu Song

doi:10.1051/0004-6361/202452334

Home

All issues

Volume 692 (December 2024)

A&A, 692 (2024) A186

Full HTML

Open Access

Issue		A&A Volume 692, December 2024


Article Number		A186
Number of page(s)		17
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/202452334
Published online		12 December 2024

A&A, 692, A186 (2024)

Optimizing redshift distribution inference through joint self-calibration and clustering-redshift synergy

Weilun Zheng¹^,2, Kwan Chuen Chan¹^,2^⋆, Haojie Xu³^,4^,5, Le Zhang¹^,2^,6 and Ruiyu Song¹^,2

¹ School of Physics and Astronomy, Sun Yat-Sen University, 2 Daxue Road, Tangjia, Zhuhai 519082, China
² CSST Science Center for the Guangdong-Hongkong-Macau Greater Bay Area, SYSU, Zhuhai 519082, China
³ Shanghai Astronomical Observatory, Chinese Academy of Sciences, Nandan Road 80, Shanghai 200240, China
⁴ Department of Astronomy, Shanghai Jiao Tong University, Shanghai 200240, China
⁵ Key Laboratory for Particle Astrophysics and Cosmology (MOE)/Shanghai Key Laboratory for Particle Physics and Cosmology, Shanghai 200240, China
⁶ Peng Cheng Laboratory, No. 2, Xingke 1st Street, Shenzhen 518000, China

^⋆ Corresponding author; chankc@mail.sysu.edu.cn

Received: 21 September 2024
Accepted: 12 November 2024

Abstract

Context. Accurately characterizing the true redshift (true-z) distribution of a photometric redshift (photo-z) sample is critical for cosmological analyses in imaging surveys. Clustering-based techniques, which include clustering-redshift (CZ) and self-calibration (SC) methods–depending on whether external spectroscopic data are used–offer powerful tools for this purpose.

Aims. In this study, we explore the joint inference of the true-z distribution by combining SC and CZ (denoted as SC+CZ).

Methods. We derived simple multiplicative update rules to perform the joint inference. By incorporating appropriate error weighting and an additional weighting function, our method shows significant improvement over previous algorithms. We validated our approach using a DES Y3 mock catalog.

Results. The true-z distribution estimated through the combined SC+CZ method is generally more accurate than using SC or CZ alone. To account for the different constraining powers of these methods, we assigned distinct weights to the SC and CZ contributions. The optimal weights, which minimize the distribution error, depend on the relative constraining strength of the SC and CZ data. Specifically, for a spectroscopic redshift sample that amounts to 1% of the photo-z sample, the optimal combination reduces the total error by 20% (40%) compared to using CZ (SC) alone, and it keeps the bias in mean redshift [Δ͞z/(1+z)] at the level of 0.003. Furthermore, when CZ data are only available in the low-z range and the high-z range relies solely on SC data, SC+CZ enables consistent estimation of the true-z distribution across the entire redshift range.

Conclusions. Our findings demonstrate that SC+CZ is an effective tool for constraining the true-z distribution, paving the way for clustering-based methods to be applied at z ≳ 1.

Key words: cosmology: observations / large-scale structure of Universe

© The Authors 2024

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

Wide-area imaging surveys provide powerful cosmological probes to constrain cosmology. Weak lensing is a prime example (Bartelmann & Schneider 2001; Heymans et al. 2013; Hildebrandt et al. 2017; Troxel et al. 2018; Asgari et al. 2021; Hikage et al. 2019; Amon et al. 2022; Secco et al. 2022; Li et al. 2023; Dalal et al. 2023). In particular, cosmic shear is often bundled in 3 × 2 point analysis, which includes cosmic shear, galaxy-galaxy lensing, and galaxy clustering (Abbott et al. 2018, 2022a; Heymans et al. 2021; Miyatake et al. 2023; Sugiyama et al. 2023). These analyses offer a strong constraint on S₈, and the tightest uncertainty level (∼2%) is already comparable to that from the Planck Satellite’s study of the cosmic microwave background (Planck Collaboration VI 2020). Because the lensing results are consistently lower than those from the cosmic microwave background, there is heated discussion regarding the S₈ tension. Moreover, imaging surveys also enable measurements of the transverse baryon acoustic oscillations (BAO) (Padmanabhan et al. 2007; Estrada et al. 2009; Hütsi 2010; Seo et al. 2012; Carnero et al. 2012; de Simoni et al. 2013; Abbott et al. 2019, 2024, 2022b; Chan et al. 2022; Song et al. 2024). The latest transverse BAO measurements, such as Abbott et al. (2024), are yielding constraints competitive with those from spectroscopic surveys (DESI Collaboration 2024). Upcoming stage IV surveys, such as from the Rubin Observatory Legacy Survey of Space and Time (LSST) (Ivezić et al. 2019; Mandelbaum et al. 2018), Euclid (Laureijs et al. 2011), Chinese Space Station Telescope (CSST) (Zhan 2011; Gong et al. 2019), and Roman Space Telescope (WFIRST) (Spergel et al. 2015; Eifler et al. 2021), are expected to deliver even more enormous photometric data and hence more exquisite results.

In imaging surveys, the redshifts (photo-z’s) are derived from the photometry information measured from a few broadband filters. The template fitting and training methods are commonly used to infer the photo-z’s of the galaxies (see Salvato et al. (2019), Newman & Gruen (2022) for a review). The template fitting method (e.g. Arnouts et al. 1999; Bolzonella et al. 2000; Benítez 2000; Ilbert et al. 2006) fits a model derived from known spectral energy distribution SED) templates, which includes the photo-z as a fitting parameter, to the color or magnitude data. The prior information can also be included in the fitting. This method is limited by the accuracy and representativeness of the templates and the accuracy of the prior information. Training methods often make use of tools from machine learning (Collister & Lahav 2004; Sadeh et al. 2016; De Vicente et al. 2016; Zhou et al. 2021; Li et al. 2022). A spectroscopic redshift (spec-z) sample is required to train the machine learning algorithm, so its accuracy depends on the abundance and the completeness or representativity of the spec-z sample.

In cosmological applications, accurate true redshift (true-z) distribution of the photo-z sample must be obtained to avoid biasing the cosmological results. Using a weighted spec-z sample is a viable approach. The effectiveness of this method hinges on the availability of the spec-z sample or sometimes a high-quality photo-z sample with numerous photo-z bands. The weighted spec-z sample can be constructed using a k-nearest neighbor search (Lima et al. 2008; Cunha et al. 2009; Bonnett et al. 2016) or a self-organizing map (Carrasco Kind & Brunner 2014; Masters et al. 2015; Buchs et al. 2019; Wright et al. 2020; Campos et al. 2024).

The clustering-based methods provide an independent way to calibrate the true-z distribution of the photo-z sample. Unlike photometry-based methods, this approach uses the clustering information, which can be traced back to gravity. Depending on the utilization of the external spec-z sample, it can be further categorized into the clustering-z (hereafter CZ) and self-calibration (SC) methods. In CZ, the true-z distribution is determined by cross correlating the photo-z sample with an external spec-z sample (Newman 2008; Matthews & Newman 2010; McQuinn & White 2013; Ménard et al. 2013; Schmidt et al. 2013; Morrison et al. 2017; van den Busch et al. 2020). This requires the spec-z sample to overlap with the photo-z one spatially, but the spec-z sample does not need to be representative. CZ has been routinely used to calibrate the true-z distribution in real data (e.g., Gatti et al. 2018, 2022; Cawthon et al. 2022; Hildebrandt et al. 2021; Rau et al. 2023). On the other hand, the SC method relies solely on the clustering information, both the auto and cross bin correlation function, of the photometric sample itself. (Schneider et al. 2006; Zhang et al. 2010; Benjamin et al. 2010; Zhang et al. 2017; Peng et al. 2022; Xu et al. 2023). Although SC has also been used for weak lensing (Benjamin et al. 2013) and BAO (Song et al. 2024) measurements, it is less frequently used than CZ. This may be because the redshift range explored in current surveys are still relatively low (z ≲ 1), and hence the spec-z sample is still sufficient for the calibration purpose. However, even in present surveys, the high redshift portion of the sample cannot be calibrated using CZ due to the absence of spec-z galaxies at high redshifts (e.g., Rau et al. 2023; Abbott et al. 2024). Consequently, we expect the SC method to play a more prominent role in upcoming surveys.

In this paper we explore combining the information of CZ and SC to simultaneously constrain the true-z distribution of a photo-z sample. As far as we know, this is the first time that these two methods have been jointly applied to constrain the true-z distribution in realistic mock data (see the Fisher forecast in McQuinn & White 2013). We anticipate that this synergy in redshift calibration is particularly fruitful in the high redshift regime, where the spec-z sample is scarce. The rest of the paper is organized as follows. We first review the SC and CZ formalism in Sect. 2.1 and then present the algorithm to jointly solve the SC and CZ equations in Sect. 2.2. We test our method using a DES Y3 mock catalog in Sect. 3. In particular, we contrast the true-z inference results from SC, CZ, and SC+CZ, and we demonstrate that our improved algorithm is superior to the old algorithm. Moreover, we study the scenarios when the number of spec-z bins is equal to and less than the number of photo-z bins respectively, and show that SC+CZ can effectively extend the clustering-based method to a higher redshift. We conclude in Sect. 4. In Appendix A, we present the derivation of the update rules. We investigate the impact of outlying spec-z galaxies in Appendix B and the impact of setting the negative measurements to a tiny positive value in Appendix C.

2. The joint calibration method

2.1. Derivation of the SC and CZ equations

In this subsection, we show how the correlation function of the photo-z sample and the cross correlation function between the photo-z and spec-z sample are related to the true-z distribution. Our convention follows that of Song et al. (2024), and the meaning for the key notations used in Sect. 2.1 are summarized in Table 1.

Table 1.

Summary of the meaning of the key notations used in Sect. 2.1.

Because of photo-z uncertainties, galaxies within a photo-z bin may originate from multiple spec-z bins:

$\begin{matrix} M_{i^{'}} = \sum_{k} Q_{k i^{'}} M_{k}, \end{matrix}$ $\begin{aligned} M_{i^{\prime }} = \sum _k Q_{k i^{\prime }} M_k, \end{aligned}$ (1)

where M is the angular galaxy number density, Q_ki′ represents the probability that a galaxy in spec-z bin k leaks into photo-z bin i′. The index with (without) a prime denotes the photo-z (spec-z) bin index.

The correlation function of the photo-z sample can be written as

$\begin{matrix} ⟨ M_{i^{'}} M_{j^{'}} ⟩ = \sum_{k, l} ⟨ Q_{k i^{'}} M_{k} Q_{l j^{'}} M_{l} ⟩ . \end{matrix}$ $\begin{aligned} \langle M_{i^{\prime }} M_{j^{\prime }} \rangle = \sum _{k,l} \langle Q_{k i^{\prime }} M_k Q_{l j^{\prime }} M_l \rangle . \end{aligned}$ (2)

By expressing $M_{i'} = {\bar{M}}_{i'} (1 + δ_{i'})$ $M_{i\prime} = \bar{M}_{i\prime}(1 + \delta_{i\prime})$ and $M_{i} = {\bar{M}}_{i} (1 + δ_{i})$ $M_{i} = \bar{M}_i(1 + \delta_i)$ in terms of their respective means $\bar{M}$ $\bar{M}$ and fluctuations δ, we can write the angular overdensity correlation function as

$\begin{matrix} C_{i^{^{'}} j^{^{'}}} \equiv ⟨ δ_{i^{'}} δ_{j^{'}} ⟩ & = \sum_{k} P_{k i^{'}} P_{k j^{'}} C_{kk}^{'}, \end{matrix}$ $\begin{aligned} C_{i^{^{\prime }}j{^{^{\prime }}}} \equiv \langle \delta _{i^{\prime }} \delta _{j^{\prime }} \rangle&= \sum _k P_{ki^{\prime }} P_{kj^{\prime }} C^{\prime }_{kk} , \end{aligned}$ (3)

where C′_kk = ⟨δ_kδ_k⟩ is the spec-z angular overdensity correlation function of the photo-z sample in the spec-z bin k. We have defined P_ki′ as

$\begin{matrix} P_{k i^{'}} = \frac{Q_{k i^{'}} {\bar{M}}_{k}}{{\bar{M}}_{i^{'}}} . \end{matrix}$ $\begin{aligned} P_{ki^{\prime }} = \frac{ Q_{ki^{\prime }} \bar{M}_k}{ \bar{M}_{i^{\prime }} }. \end{aligned}$ (4)

From Eq. (4), it is evident that P_ki′ satisfies the normalization condition

$\begin{matrix} \sum_{k} P_{k i^{'}} = 1 . \end{matrix}$ $\begin{aligned} \sum _k P_{ki^{\prime }} = 1 . \end{aligned}$ (5)

Thus P_ki′ represents the fraction of galaxies in photo-z bin i′ coming from spec-z bin k.

By expressing M_i′ and M_i in Eq. (1) in terms of their fluctuations and then invoking Eq. (5), we get the following relation

$\begin{matrix} δ_{i^{'}} = \sum_{j} P_{j i^{'}} δ_{j}, \end{matrix}$ $\begin{aligned} \delta _{i^{\prime }} = \sum _{j} P_{ji^{\prime }} \delta _{j}, \end{aligned}$ (6)

which is the analogous relation to Eq. (1). In fact, P_ji′ is the true-z distribution of the photo-z sample, which is often denoted as n(z). In this paper, we use these two notations interchangeably.

In arriving at Eq. (3), we have assumed that $C_{ij}'$ $C_{ij}\prime$ is diagonal. In other words, the correlation between spec-z bins is non-zero only when they are within the same bin. This is an excellent approximation and it is exact under Limber approximation (e.g. Simon 2007). Eq. (3) is the essence of the SC method, and it enables us to extract P_ki′ using the clustering information of the photo-z data alone (Schneider et al. 2006; Zhang et al. 2010).

Assuming linear galaxy bias, we can further write Eq. (3) in terms of the underlying spec-z matter power spectrum $C_{kk}^{m}$ $C_{kk}^{\mathrm{m}}$ as

$\begin{matrix} C_{i^{^{'}} j^{^{'}}} = \sum_{k} P_{k i^{'}} P_{k j^{'}} b_{k}^{' 2} C_{kk}^{m}, \end{matrix}$ $\begin{aligned} C_{i^{^{\prime }}j{^{^{\prime }}}} = \sum _k P_{ki^{\prime }} P_{kj^{\prime }} b_{k}^{\prime 2} C^\mathrm{m}_{kk}, \end{aligned}$ (7)

where b_k′ is the bias parameter of the photo-z sample in kth spec-z bin.

From Eqs. (1) to (3), we implicitly assume that the leakage due to photo-z is universal irrespective of the galaxy type composition in the spec-z bin. To illustrate this point, let us assume that there are two types of galaxies in a spec-z bin, say blue and red galaxies. The universal leakage assumption asserts that both types of galaxies leak to different photo-z bins in the same proportion. Given photo-z estimation is based on the photometry information, which is closely related to the galaxy types, this assumption can only hold approximately.

For the present work, the violation of the universal leakage assumption manifests as the dependence of the bias parameter on the photo-z bin index i′. Without the universal leakage assumption, Eq. (7) would be generalized to

$\begin{matrix} C_{i^{^{'}} j^{^{'}}} = \sum_{k} P_{k i^{'}} P_{k j^{'}} b_{k i^{'}} b_{k j^{'}} C_{kk}^{m}, \end{matrix}$ $\begin{aligned} C_{i^{^{\prime }}j{^{^{\prime }}}} = \sum _k P_{ki^{\prime }} P_{kj^{\prime }} b_{k i^{\prime }} b_{k j^{\prime }} C^\mathrm{m}_{kk}, \end{aligned}$ (8)

where b_ki′ is the bias parameter of the i′th photo-z sample in the kth spec-z bin. In this work, we assume that the leakage is universal.

Suppose now we have another spec-z sample with number density in spec-z bin i, N_i, which is related to its mean number density ${\bar{N}}_{i}$ $\bar{N}_i$ and density fluctuation ϵ_i by the relation $N_{i} = {\bar{N}}_{i} (1 + ϵ_{i})$ $N_i = \bar{N}_i ( 1 + \epsilon_i )$ . We note that N_i and ϵ_i are in general different from the corresponding quantities for the photo-z sample in spec-z bin, M_i and δ_i. The spec-z sample is often much less abundant than the photo-z sample, and it tends to be brighter, and hence the linear bias of the spec-z sample is likely to be higher.

The cross correlation function between the spec-z and photo-z sample reads

$\begin{matrix} ⟨ N_{i} M_{j^{'}} ⟩ = \sum_{k} Q_{k j^{'}} ⟨ N_{i} M_{k} ⟩ . \end{matrix}$ $\begin{aligned} \langle N_i M_{j^{\prime }} \rangle = \sum _{k} Q_{k j^{\prime }} \langle N_i M_k \rangle . \end{aligned}$ (9)

Similar to the derivation of the SC results, we have the corresponding CZ equation:

$\begin{matrix} C_{i j^{'}} \equiv ⟨ ϵ_{i} δ_{j^{'}} ⟩ & = P_{i j^{'}} C_{ii}^{x} \end{matrix}$ $\begin{aligned} C_{ij^{\prime }} \equiv \langle \epsilon _{i} \delta _{j^{\prime }} \rangle&= P_{ij^{\prime }} C_{ii}^\mathrm{x} \end{aligned}$ (10)

$\begin{matrix} = P_{i j^{'}} b_{i} b_{i}^{'} C_{ii}^{m}, \end{matrix}$ $\begin{aligned}&= P_{ij^{\prime }} b_i b_{i}^\prime C_{ii}^\mathrm{m}, \end{aligned}$ (11)

where $C_{ii}^{x} = ⟨ ϵ_{i} δ_{i} ⟩$ $C_{ii}^{\mathrm{x}} = \langle \epsilon_i \delta_i \rangle$ is the cross angular correlation function between the spec-z sample in bin i with the photo-z sample in spec-z bin i, and b_i is the bias parameter of the spec-z sample in ith spec-z bin. For completeness, without the universal leakage assumption, we would instead have

$\begin{matrix} C_{i j^{'}} = P_{i j^{'}} b_{i} b_{i j^{'}} C_{ii}^{m} . \end{matrix}$ $\begin{aligned} C_{ij^{\prime }} = P_{ij^{\prime }} b_i b_{ij^{\prime }} C_{ii}^\mathrm{m}. \end{aligned}$ (12)

Unlike the SC case, which is quadratic in P, CZ is a linear problem. In the usual CZ method, the bias parameter of the spec-z sample, b_j can be measured easily but it is difficult to get b_j′. Thus CZ only directly constrains P_ij′b_i′ if we assume some theoretical matter correlation function model. Moreover, since P_ij′ is normalized, if b_i′ is a constant, it has no effect on the estimation of P_ij′. However, if b_i′ evolves with redshift (or index i), its evolution is degenerate with P_ij′. Indirect methods have been proposed to mitigate the impact of bias evolution (Schmidt et al. 2013; Davis et al. 2018; van den Busch et al. 2020; Cawthon et al. 2022; Gatti et al. 2022). From Eq. (7), it is clear that the bias evolution issue also affects the SC problem. When we allow the correlation function in the spec-z bin to be a free parameter (called P method below), the degree of freedom due to bias evolution is taken into account.

We shall test different approaches to solve the system of SC and CZ equations. First we regard P_ij′, $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ , and C′_ii as unknowns, denoted as the P method. This is the most general parameterization, but the constraint is slightly weakened. Similar approach is taken in SC, e.g. Zhang et al. (2017). On the other hand, analogous to the approach in CZ, we take R_ij′ ≡ P_ij′b_i′ as the variable and abbreviate it as the R method. We note that there is no normalization constraint on R_ij′. If we assume that b_i is known from measurement and $C_{kk}^{m}$ $C_{kk}^{\mathrm{m }}$ can be computed theoretically, then we only need to solve for R in the system:

$\begin{matrix} C_{i j^{'}} & = R_{i j^{'}} b_{i} C_{ii}^{m}, \end{matrix}$ $\begin{aligned} C_{ij^{\prime }}&= R_{ij^{\prime }} b_i C_{ii}^\mathrm{m} ,\end{aligned}$ (13)

$\begin{matrix} C_{i^{^{'}} j^{^{'}}} & = \sum_{k} R_{k i^{'}} R_{k j^{'}} C_{kk}^{m} . \end{matrix}$ $\begin{aligned} C_{i^{^{\prime }}j{^{^{\prime }}}}&= \sum _k R_{ki^{\prime }} R_{kj^{\prime }} C_{kk}^\mathrm{m} . \end{aligned}$ (14)

As a side note, in terms of the variable R_ij′ ≡ P_ij′b_ij′, it is easy to see that the non-universality issue does not introduce extra complications in the case of CZ.

2.2. Solution to the SC and CZ equations

2.2.1. Cost function and multiplicative update rules

Our goal is to develop an efficient method to solve Eq. (3) and (10) simultaneously for P_ij′, C′_ii, and $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ . Here we use the P method as an example and comment on the R method later on. In particular, because Eq. (3) is a system of coupled quadratic equations, it is challenging to solve. Several studies have been devoted to solving the SC equation (Erben et al. 2009; Benjamin et al. 2010, 2013; Zhang et al. 2017). Unlike prior researches, Zhang et al. (2017) solves Eq. (3) in full generality, without limiting the analysis to a two-bin scenario (Erben et al. 2009) or relying on a linear coupling (in P) approximation as done by Benjamin et al. (2010). In this work, inspired by Zhang et al. (2017), we construct multiplicative update rules to solve Eqs. (3) and (10) jointly.

We aim to derive an iterative update rule to minimize the sum of the SC and CZ cost functions:¹

$\begin{matrix} J = J_{1} + J_{2}, \end{matrix}$ $\begin{aligned} \mathcal{J} =\mathcal{J} _{1}+\mathcal{J} _{2}, \end{aligned}$ (15)

where 𝒥₁ and 𝒥₂ are the contributions from SC and CZ respectively. They are given by

$\begin{matrix} J_{1} & = \frac{1}{2} \sum_{i^{^{'}}, j^{^{'}}, μ} \frac{W_{1} (θ_{μ})}{σ_{i^{'} j^{'}}^{2} (θ_{μ})} [D_{i^{^{'}} j^{^{'}}} (θ_{μ}) - \sum_{k} P_{k i^{^{'}}} P_{k j^{^{'}}} C_{kk}^{'} (θ_{μ})]^{2}, \end{matrix}$ $\begin{aligned} \mathcal{J} _{1}&=\frac{1}{2} \sum _{i^{^{\prime }},j^{^{\prime }}, \mu } \frac{ W_1(\theta _\mu ) }{ \sigma _{i^{\prime }j^{\prime }}^2 (\theta _\mu ) } \Bigg [ D_{i^{^{\prime }}j^{^{\prime }}}(\theta _\mu )-\sum _{k}P_{ki^{^{\prime }}}P_{kj^{^{\prime }}}C^{\prime }_{kk}(\theta _\mu ) \Bigg ]^{2} ,\end{aligned}$ (16)

$\begin{matrix} J_{2} & = \frac{1}{2} \sum_{i, j^{^{'}}, μ} \frac{W_{2} (θ_{μ})}{σ_{i j^{'}}^{2} (θ_{μ})} [D_{i j^{^{'}}} (θ_{μ}) - P_{i j^{^{'}}} C_{ii}^{x} (θ_{μ})]^{2}, \end{matrix}$ $\begin{aligned} \mathcal{J} _{2}&=\frac{1}{2} \sum _{i,j^{^{\prime }}, \mu } \frac{ W_2(\theta _\mu ) }{ \sigma _{ij^{\prime }}^2 (\theta _\mu ) } \Bigg [ D_{ij^{^{\prime }}}(\theta _\mu )-P_{ij^{^{\prime }}}C_{ii}^\mathrm{x}(\theta _\mu ) \Bigg ]^{2} , \end{aligned}$ (17)

where D denotes the data measurement (angular correlation function in our case), σ is the error bar of the measurement, and W₁ and W₂ are the weight functions. We consider weight function of the form θⁿ in this work (see also Ménard et al. 2013). Here we use Latin indices for the redshift bins and Greek indices for the angular bins. The update rule will update P_ij^′, $C_{ii}'$ $C_{ii}\prime$ , and $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ iteratively to look for a minimum of 𝒥. Additionally, we will consider assigning different weights to 𝒥₁ and 𝒥₂ later on.

By minimizing the cost function with respect to P_ab′, a multiplicative update rule (Eq. (A.9)) for P_ab′ can be derived. Upon multiplying the factor in Eq. (A.9) to P_ab′ repeatedly, P_ab′ converges to the solution. This method can be viewed as a variant of the gradient decent (Lee & Seung 2000). Similar update rule can also be established for $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ and C′_ii (Eq. (A.10)). The details of the derivation are relegated to Appendix A. A few comments of the multiplicative rules are in order.

First, Zhang et al. (2017) employed the non-negative matrix factorization (NMF) algorithm, originally proposed by Lee & Seung (2000), to address the SC problem. This approach has been adopted in subsequent studies (Peng et al. 2022; Xu et al. 2023; Song et al. 2024; Peng & Yu 2024). Due to the non-negativity constraint inherent in the SC model, the NMF method offers an elegant and efficient solution. NMF assumes that the model can be factorized into a product form WH, where W and H are distinct matrices. To apply this framework to the SC model, Zhang et al. (2017) reformulated it as WH_θ, with W = P^T and H_θ = C(θ)P. However, this splitting is somewhat artificial for the SC problem and may cause difficulty in including the CZ information due to its rigid structure. To further optimize and generalize the approach for solving the SC model, we bypass the NMF interpretation and instead treat it directly as a minimization problem with respect to P_ab′. By employing a specialized parameter update scheme inspired by the NMF approach, our simplified interpretation allows for an efficient combination of information from both SC and CZ models.

Secondly, the cost function for the multiplicative update rule is often taken to be the mean squared error without the inverse error bar weighting, e.g. Lee & Seung (2000), Zhang et al. (2017). Treating all the data on equal footing is at best sub-optimal. We have to include the error bar to down-weight the contribution of the poor measurements and up-weight the good ones, and to eliminate the impact of missing data measurements. Here we have improved over previous treatments by taking into account the error of measurements and the inclusion of additional weighting functions. The proper treatment should include the full covariance matrix. However, in the derivation of the multiplicative rule, we have to separate the positive part of the gradient from the negative one. Usage of the full covariance will hinder this separation. Thus for simplicity, we opt to use diagonal error bars in Eqs. (16) and (17). Under the diagonal covariance approximation, the data points are taken to be independent. This may seem a poor approximation, but we observed that the overall performance of the algorithm is good. We mention that Xu et al. (2023) used χ² as the stopping criterion of the iteration process, but the update rule is still based on the cost function without the inverse error weighting. Recently, Peng & Yu (2024) independently proposed to include error weighting in the NMF algorithm to solve the SC problem. They adopted the improved NMF algorithm of Zhu (2016), Green & Bailey (2023), which generalizes the Lee & Seung (2000) NMF update rule to account for the data measurement error. We expect its performance to be similar to our SC results.

Thirdly, the multiplicative update rule ensures that the estimated values are non-negative provided that the measurements are non-negative². This is less likely to hold for C_ij′ (see Fig. 1 below) due to its larger covariance. Negative measurements may spoil the update rule and result in negative estimated value. To mitigate this problem, in Xu et al. (2023), negative measurements are set to small positive values by hand, and a more refined process is applied in Peng & Yu (2024). In Appendix C, we test the impact of setting the negative measurements to a tiny positive value.

Fig. 1.

Sample of the photo-z angular correlation function between the photo-z bin i′ and j′, C_i^′j^′ (blue) and the cross angular correlation function between the spec-z bin i and photo-z bin j′, C_ij′ (red) to be used for true-z inference. The line and its associated color band are the median and 16 and 84 percentile among 100 mock runs. The whole redshift range [0.6,1.1] is divided into 10 photo-z bins, each of width Δz_p = 0.05. This results in a true-z distribution with resolution of Δz = 0.05. The label i-j represents i′j′ for photo-z correlation function and ij′ for spec-z-photo-z cross correlation function respectively. We only show the odd bin results for clarity.

Finally, we mention that our update rule is also applicable when 𝒥₁ or 𝒥₂ are missing. In particular it can be used for the SC method. We will contrast ours against the results from Xu et al. (2023) below.

Before closing this section, we comment on the solution of the system Eqs. (13) and (14) in the R method. Basically it is the same as the P method, except that R_ij′ is solved without the normalization constraint, viz. Eq. (A.4). From R_ij′, if we assume that there is no bias evolution, then using Eq. (5), we have³

$\begin{matrix} b_{i}^{'} & \approx b^{'} = \sum_{i} R_{i j^{'}}, \end{matrix}$ $\begin{aligned} b_{i}^{\prime }&\approx b^{\prime } = \sum _i R_{ij^{\prime }}, \end{aligned}$ (18)

and it follows that

$\begin{matrix} P_{i j^{'}} & = \frac{R_{i j^{'}}}{\sum_{k} R_{k j^{'}}} . \end{matrix}$ $\begin{aligned} P_{ij^{\prime }}&= \frac{ R_{ij^{\prime }} }{ \sum _k R_{kj^{\prime }} }. \end{aligned}$ (19)

2.2.2. Numerical implementation

Because our method is rooted in the previous works and we shall contrast the old result with ours, it is helpful to review the algorithm of Zhang et al. (2017) and its improvements in Xu et al. (2023). We shall denote this as the “Old NMF” and use it as a benchmark for comparison.

The algorithm starts by initializing P_ij′ randomly as a diagonally dominant matrix. The initialization procedure assumes that the true-z distribution peaks at the photo-z estimate and decreases monotonically on both sides. The initial C′_ii and $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ are obtained from Eqs. (3) or (10) using the initial P_ij′. Before applying the multiplicative update rule, Zhang et al. (2017) found that it was necessary to get a preliminary solution using a fixed-point method for the NMF step to be successful. Using this preliminary solution as the initial trial solution, the NMF update rule is applied until a minimum of the cost function is attained. Zhang et al. (2017) writes the SC model as WH_θ with W = P^T and H_θ = C(θ)P. In each step, the update rule analogous to that in Lee & Seung (2000) is applied to W with H_θ fixed. The P in H_θ is then replaced with the new W, and with P held fixed C(θ) is subsequently solved by the least square solution. To estimate the error bar on P_ij′, Xu et al. (2023) generates 100 sets of P_ij′ and for each set, the measurement C_i^′j^′ is perturbed by adding a Gaussian perturbation obtained by sampling the covariance of C_i′j′. We call this set random runs to distinguish them from the mock realizations. The best fit and the 1σ error are estimated by the median and the half width between the 16 and 84 percentile, respectively.

We shall contrast three different setups: SC, CZ, and the joint inference by SC and CZ, denoted as SC+CZ. For SC and SC+CZ, the default method uses P_ij′ and $C_{ii}'$ $C_{ii}\prime$ (and $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ ) as unknowns (abbreviated as P method), while for CZ, we use R_ij′ as variables (denoted as R method). The reason for these differences is related to the error estimation and we shall comment on it later on. For the P method, we initialize P_ij′ and C′_ii (and $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ ) as in the Old NMF, but we then feed the initial guess to the multiplicative update rule (Eqs. (A.9) and (A.10)) directly because we find that our algorithm no longer requires the intermediate solution from the fixed point method. We take the weighting function to be θⁿ with n = −1. Similar to Xu et al. (2023), we generate 100 random runs to estimate the best fit and 1σ error bar. The best fit and 1σ error bar are estimated by the median and half width of the 68-percentile about the median. In each run, we also perturb the measurement using a Gaussian perturbation derived from the mock covariance.

For the R method, the procedures are similar. To initialize R_ij′ we generate the initial P_ij′ as above, but we find that the result is sensitive to the trial b_i′. Thus we perform a grid search on b_i′ to find the one that gives rise to the minimum 𝒥 and use it as the final solution. We then get P_ij′ from the best fit R_ij′ using Eq. (19).

3. Mock test results

In this section, we apply our algorithm to the mock catalog to test its performance.

3.1. Description of the mock catalog

We shall test and validate our results using mock catalogs. For this purpose, we employ the ICE-COLA mocks (Ferrero et al. 2021), which is a specialized mock catalog tailored for the DES Y3 BAO analyses (Abbott et al. 2022b). A brief overview is provided here, with further details available in Ferrero et al. (2021).

The ICE-COLA mocks are derived from the COLA simulations, built upon the COLA method (Tassev et al. 2013), and executed through the ICE-COLA code (Izard et al. 2016). The COLA method combines the second-order Lagrangian perturbation theory with the particle-mesh simulation technique, ensuring the preservation of accuracy in large-scale modes despite the utilization of coarse simulation time steps. In each simulation, there are 2048³ particles in a cube measuring side length of 1536 Mpc h⁻¹. The comoving simulation is transformed to a lightcone simulation extending up to z ∼ 1.4. The cosmology adopted by the mock catalog follows that in the MICE simulation (Fosalba et al. 2015; Crocce et al. 2015), which is a flat ΛCDM with Ω_m = 0.25, Ω_Λ = 0.75, h = 0.7, and σ₈ = 0.8. Each mock occupies a DES Y3 footprint, covering 4180 deg² in area. We make use of 100 mock catalogs to estimate the covariance. The same set of mocks are also used to estimate the ensemble mean and its associated error.

The mock galaxies are allocated to the ICE-COLA halos through a hybrid method combining Halo Occupation Distribution and Halo Abundance Matching, mirroring the technique outlined in Avila et al. (2018). These mock galaxies resemble the red galaxy sample in DES Y3 (see Carnero et al. 2012 for more details on this sample). The mock galaxies are equipped with realistic photo-z. To do so, a 2D joint probability distribution in the photo-z and spec-z space is constructed using a sample of actual galaxies possessing both types of redshifts. Through sampling this distribution, the candidate mock galaxies are assigned the appropriate photo-z’s.

Because the galaxies have both photo-z and spec-z labels, we can use them to create photo-z and spec-z samples. As in DES Y3, we select galaxies in the photo-z range [0.6,1.1] and divide them into five tomographic bins, each of width 0.1. Our goal is to determine the true-z distribution of the samples in these tomographic bins using clustering-based methods. To model the fact that the spec-z galaxies are generally few and bright, we approximate it by selecting the most massive galaxies from the mock. In this case we expect the clustering amplitude of the spec-z sample to be higher than the intrinsic clustering of the photo-z sample since the linear bias increases with mass. We consider samples containing the most massive p% of the galaxies with p = 5, 1, and 0.5.

When we restrict the sample to the photo-z range [0.6,1.1], a small fraction of the galaxies will possess spec-z values outside this range. This implies that there are reductions in correlation signal from SC and CZ. This problem can be alleviated by considering the redshift range of data, both photo-z and spec-z data, to be sufficiently wide relative to the redshift range of interest. Because the mock data are only available in the photo-z range [0.6,1.1], this is not possible. Here we clean the sample by removing the galaxies with spec-z values outside of the range [0.6,1.1], and this effectively forces the true-z distribution to vanish outside the range [0.6,1.1]. This cleaning cannot be done in real data. In Appendix B, we study the impact of the outlying galaxies on the true-z estimation.

3.2. Clustering measurements

The data for inference are the angular correlation functions. While the bin width of the target photo-z sample is Δz_p = 0.1, to increase the redshift resolution of the true-z distribution, we divided the sample into ten photo-z bins, each with bin width of Δz_p = 0.05. From the true-z inference code, we ended up with ten true-z distributions with a redshift resolution of Δz = 0.05 for these ten photo-z samples. We then combined every two true-z distributions to get the weighted mean for our target photo-z samples, e.g., the first and second true-z distributions are combined to get the one for the first target photo-z sample.

In Fig. 1, we show a sample of the angular correlation function w(θ) to be used in true-z inference. The plot shows the photo-z angular correlation function C_i^′j^′ and the cross angular correlation between the spec-z and photo-z sample C_ij′. In this plot, the spec-z sample consists of the most massive 1% galaxies. We have plotted the median and the 1σ band estimated by the 16 and 84 percentile among 100 mock runs. We only show the results for i-j with i and j being odd for clarity.

These angular correlation function measurements are performed using CUTE (Alonso 2012) with the grid method. We compute them in the angular range of [0.2, 5]° with linear binning of width 0.2°.

3.3. Inference of the true-z distribution

We show the results on the inference of the true-z distribution in Fig. 2. We compare the results obtained with SC, CZ, and SC+CZ against the true-z distribution measurement from the mock catalog. Here we show the results from a single mock. They are produced using the fiducial setup, and the spec-z sample is the most massive 1%.

Fig. 2.

True-z distribution inferred by the clustering-based estimators [CZ (red), SC (orange), and SC+CZ (blue)] are compared with the direct mock measurement (green bars). The clustering-based estimator data points are offset horizontally for clarity. The results for five tomographic bins are shown (from top to bottom).

In order to quantify the accuracy of the results, we use all 100 mocks and consider the metric $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_i |{\hat{P}}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the mock measurement P_ij′^true and the estimated result ${\hat{P}}_{i j'}$ $\hat{P}_{ij\prime}$ . We show this in the upper panel of Fig. 3. The line corresponds to the median and the color band demarcates the 16 and 84 percentile lines among 100 mock runs.

Fig. 3.

Comparison of the accuracy of the true-z distribution inferred using CZ (red), SC (orange), and SC+CZ (blue). Upper panel: $\sum_{i} | {\hat{P}}_{ij}' - P_{i j'}^{true} |$ $\sum_{i} |{\hat{P}}_{ij}{\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the true-z distribution measured from the mock, P_ij′^true and the one estimated, ${\hat{P}}_{i j'}$ ${\hat{P}}_{ij\prime}$ . Lower panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_{i} |{\hat{P}}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ the absolute difference normalized by the estimated error σ_{P_ij′}. The line and color band represent the median and the 16 and 84 percentile among 100 mock runs. The results for five tomographic bins are shown.

We find that for the first two bins, SC and CZ results are similar with SC marginally better, and the accuracy of CZ remains good for the high z bins while SC deteriorates. In reality, the number of spec-z galaxies may plummet faster than the constant fraction selection assumed here, and thus the performance of CZ in the high z bins may not be as good as the case shown here. The combination SC+CZ achieves the minimal error, and performs better than SC or CZ alone.

To go on to check the accuracy of the error estimate on P_ij′, σ_{P_ij′}, we plot the absolute difference normalized by the estimated error, $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ in the lower panel of Fig. 3. A rough estimate is that the difference should be of order σ_{P_ij′}, and so the sum is ∼5. We find that the result are generally larger than this simple estimate by a factor of ∼1.5, and it gets larger for the last bin. The large fluctuation of the 1σ band especially for the last bin is caused by the occasionally too small error bar estimation in the tail of true-z distribution.

Table 2.

Bias in the mean redshift e_z obtained with various methods.

To determine the true-z distribution, all the cross bin clustering information is used; thus the information is not localized to a particular bin and it is hard to get a “simple” explanation of the trend seen in Fig. 3. For CZ, all the spec-z sample is used to cross correlate with a photo-z sample. Thus we argue that the photo-z sample property could give a better estimate of the quality of the CZ inference. After cleaning, the number of photo-z galaxies in the photo-z bins are 1228413, 1576759, 1683208, 1266897, and 708319 respectively. The effective bias parameters of the photo-z sample in the tomographic bins are similar (about 1.1), but at both ends the bias is ∼1.3. The rise at both ends is caused by the cleaning process, which cuts off the true-z distribution and hence increases the clustering amplitude (Chan et al. 2024). Nonetheless the biases in the true-z bins, b_i′ are quite constant (see Fig. 6). Hence the number of photo-z galaxies in the tomographic bins can qualitatively explain the trend of CZ in Fig. 3. The trend of the SC is even harder to interpret as it relies on the auto and cross clustering information all the bins.

The bias in the mean of the true-z distribution is commonly used as an indicator for the accuracy of the characterization of the true-z distribution⁴. In Table 2, we show bias in the mean redshift:

$\begin{matrix} e_{z} = \frac{{\bar{z}}_{\hat{n}} - {\bar{z}}_{n_{true}}}{1 + {\bar{z}}_{n_{true}}}, \end{matrix}$ $\begin{aligned} e_z = \frac{ \bar{z}_{ \hat{n} } - \bar{z}_{n_{ \mathrm {true} }} }{ 1 + \bar{z}_{n_{ \mathrm {true} }} }, \end{aligned}$ (20)

where ${\bar{z}}_{\hat{n}}$ $\bar{z}_{ \hat{n} }$ ( ${\bar{z}}_{n_{true}}$ $\bar{z}_{n_{ \rm true }}$ ) denotes the mean redshift computed using the true-z distribution from the estimator (direct measurement).

For SC the biases are about 0.5%, and are positive for all the tomographic bins. The CZ results show larger variation and the bias goes from positive to negative as the redshift increases. The patterns for the three mass samples are similar. Although the first bin is relatively inaccurate, the others are good. In fact, the bias in the third bin is the smallest among all the entries. But this could be a particularity of the mock as the bias remains 0.05% even for the 0.5% sample. In contrast, the SC+CZ results are stable across the tomographic bins with bias about 0.3% for most of the entries. Although SC+CZ bias values are systematically lower than SC in all the tomographic bins, somewhat surprisingly they are larger than CZ for the last three bins. Nonetheless, if we take the bias of all the bins into account by adding up the absolute value of the biases in all five bins, SC+CZ gives a smaller total bias. For example, for the 1% sample, the total bias is reduced by about 31% and 48% relative to the CZ and SC bias, respectively. It seems that adding SC to CZ spoils some of the “accidentally” accurate bin results for CZ, but it makes the overall results more stable. Here we avoid overinterpreting the results, and leave it to future mock tests to settle the some of the subtle trends.

3.4. Comparison of different implementations

In this subsection, we compare the results obtained with different implementations. Using SC as an example, in Fig. 4, we compare the true-z distribution obtained with different algorithms for a single mock. In Fig. 5, we plot the absolute error and the normalized absolute error of these estimators computed with the 100-mock ensemble.

Fig. 4.

Comparison of the true-z distribution obtained by different implementations of the SC method with the direct measurement on the mock (green bars). The results obtained using the R method (red), P method (orange), and the Old NMF (blue) are compared. Because the Old NMF is not stable when it is run with resolution Δz = 0.05, we can only produce the results with Δz = 0.1.

Fig. 5.

Comparison of the accuracy of the estimated true-z distribution obtained with different SC algorithms. Upper panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the true-z distribution measured from the mock, P_ij′^true and the estimated one, ${\hat{P}}_{i j'}$ $\hat{P}_{ij\prime}$ . Lower panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ the absolute difference normalized by the estimated error σ_{P_ij′}. In both panels, the results from R method (red), P method (orange), and Old NMF (blue) are compared. The line and color band represent the median and the 16 and 84 percentiles among 100 mock runs. The results for five tomographic bins are shown.

Recall that for SC, our fiducial method is the P method, for which both P_ij′ and $C_{ii}'$ $C_{ii}\prime$ are the fitting parameters. Alternatively, in the R method, the only unknown is R_ij′. For the best fit value, P and R method give pretty similar results, but R method tends to give smaller error estimates thanks to less unknowns.

It seems that R method is more constraining, but we find that it gives too small error bar too often in the case of SC+CZ. On the other hand, P method also tends to give too small error bars in the case of CZ. Because CZ is a linear problem, the computation of error bars is straightforward. We find that R method yields error bars consistent with the simple error propagation result. These considerations motivate the adoption of P method for SC and SC+CZ, while R method for CZ.

We further contrast our results against the one obtained with the Old NMF method, for which we use the default setting in Xu et al. (2023). In particular the best fit is estimated using 100 random runs, and the best fit and the 1σ error bar are the median and the half width of the 68 percentile about the median. The stopping criterion is based on minimal χ² although its update rule is still rooted in the old 𝒥. For each random run, a Gaussian perturbation is added to the measurement by sampling the covariance. We note that the covariance adopted here is derived from 100 mocks rather than the jackknife covariance as in Xu et al. (2023). The adoption of the mock covariance here is driven by the observation that there is a significant difference between the mock covariance and the jackknife covariance computed using the method in Xu et al. (2023). By definition, the covariance is estimated by the ensemble mean among realizations; thus the mock covariance is the correct one to use. Furthermore, the negative measurements are set to a small positive value. We tried to run the old NMF with Δz = 0.05, but it is not stable in this case and we have to settle with resolution Δz = 0.1. Song et al. (2024) also found that the old NMF fails to produce the true-z distribution with a fine resolution, and used a coarse resolution of Δz = 0.1 although that did not impair the final BAO measurement there.

Fig. 4 and the top panel of Fig. 5 demonstrate that our improved algorithm results in a much more accurate estimation of the true-z distribution than the Old NMF method. The Old NMF tends to give excessively large error bars in the central region of the true-z distribution, while too small error estimate in the tail. We note that when the jackknife covariance is used instead, the error bars would be smaller.

Table 2 also displays the bias from the old NMF method, and we find that our updated algorithm reduces the bias by more than a factor of 2 in almost all the bins.

3.5. Estimation of the clustering amplitudes

Although our primary target is the true-z distribution P_ij′, the accuracy of the best fit correlation function serves an important cross-check and it reflects the consistency of the algorithm.

In Fig. 6, we plot the galaxy bias parameter of the photo-z sample in the spec-z bin i, b_i′ in 10 spec-z bins. Here we illustrate the results from SC+CZ using the massive 1% sample. The bias b_i′ is estimated by the following means:

Fig. 6.

Galaxy bias parameter of the photo-z sample in the spec-z bin i, b_i′ estimated by various methods. The estimate by $C_{ii}'$ $C_{ii}\prime$ (blue circles), C_ii′^x (orange squares), and R_ij′ (black dashed line with 1σ error band in gray) are compared with the direct measurements (green triangles). The results for 10 spec-z bins are shown. The data points are offset slightly horizontally for clarity. See the text for details.

From the best fit $C_{ii}'$ $C_{ii}\prime$ , we have $b_{i}' = \sqrt{\frac{C_{ii}'}{C_{ii}^{m}}}$ $b_i\prime =\sqrt{ \frac{C_{ii}\prime }{ C_{ii}^{\mathrm{m}} } }$ .
Using the best fit $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ , we get $b_{i}' = \frac{C_{ii}^{x}}{b_{i} C_{ii}^{m}}$ $b_i\prime = \frac{C_{ii}^{\mathrm{x}} }{ b_i C_{ii}^{\mathrm{m}}}$ , where b_i is measured using the auto angular correlation function of the spec-z sample.
Under the universal leakage assumption and no bias evolution approximation, b′≈∑_iR_ij′, which predicts that the bias is constant across all the photo-z bins. If we employ the generalized bias form, b_j′ ≈ ∑_ib_ij′P_ij′, the resultant bias is redshift dependent albeit on index j′. Because the true-z distribution peaks about the photo-z estimate, b_j′ ≈ b_j. In Fig. 6, we have plotted b_j′ as black dashed line.
By dividing the whole photo-z sample into ten spec-z bins, we can measure their correlation functions to directly get b_i′.

Different estimates are in agreement with each other within 10% in most of the bins in the range θ ≲ 1°. The estimate from $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ has larger error bar because it also requires the estimation of b_i from the angular correlation function of the spec-z sample. Moreover, b_i′ from R_ij′ is almost constant across all the spec-z bins. It is remarkable that it is in nice agreement with other estimates except for the last bin given there are a few approximations made.

In principle, to check the universal leakage assumption, for data in each photo-z bin j′, we could divide the sample into 10 true-z bins and measure the bias b_ij′. Under the universal leakage assumption, we anticipate b_ij′ to be independent of j′. However, in creating the mocks, photo-z’s are assigned to galaxies based on the spec-z and photo-z probability distribution only, without the photometry information. Thus the universal leakage assumption is built in during mock construction.

3.6. Optimal weighting for SC and CZ

While σ² takes into account the measurement error of the correlation function, the theoretical uncertainties of the method are not included. Simply adding up 𝒥₁ and 𝒥₂ implicitly assumes that they have equal constraining power and this may not lead to optimal results because SC and CS have different level of degeneracy. A simple way to tackle this issue is to assign different weights to their cost functions. We consider two scenarios: (i) the number of spec-z bins with spec-z data is equal to the number of photo-z bins or (ii) the number of spec-z bins with spec-z data is less than the number of photo-z bins. The second setup is particularly interesting because the lack of spec-z data at high z limits the application of the clustering-based method to the high z regime.

3.6.1. Full spec-z data

Here we focus on the scenario when the number of spec-z bin is the same as the number of photo-z bins. For this full spec-z bin data case, we consider the joint cost function:

$\begin{matrix} J = α J_{1} + J_{2}, \end{matrix}$ $\begin{aligned} \mathcal{J} = \alpha \mathcal{J} _1 + \mathcal{J} _2 , \end{aligned}$ (21)

where 𝒥₁ and 𝒥₂ are given by Eqs. (16) and (17) respectively, and the weight α adjusts their relative importance.

In Fig. 7, we show the absolute error of the estimated distribution. We note that we have summed over both indices i and j′. We have shown the results for three spec-z samples, which respectively consist of the most massive 5%, 1%, and 0.5% of the galaxies. The best fit and the 1σ error band are derived from 100 mocks. For α ≳ 100, we see that the curves are convergent because the information is dictated by SC in this limit. On the other hand, in the low α limit, the result is determined by CZ. As expected, if the fraction of galaxies in the spec-z sample is higher, the signal-to-noise of the cross correlation function is higher, and hence the absolute error is smaller.

Fig. 7.

Accuracy of the true-z estimation characterized by $\sum_{i j'} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_{ij\prime} | \hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , is plotted as a function of α, which measures the relative importance between SC and CZ. The line and the associated color band represent the median and the 16 and 84 percentiles among 100 mocks. We have shown the results for three spec-z sample consisting of the most massive 5%, 1%, and 0.5% galaxies in the spec-z sample. The black dashed line indicates the fiducial value α = 1.

In the intermediate range, for the 0.5% sample, there is a dip roughly around α ∼ 0.7, implying that this weight optimally combines the information. As the mass fraction increases, dip becomes much broader and shallower but the dip position remains nearly unchanged with very mild shift towards a smaller α. These indicate that the SC information helps little and the signal is largely dominated by CZ. We note that the from 0.5% to 1%, the optimal α seems to increase, but this is not expected and should be interpreted as statistical fluctuations. For the 1% spec-z sample, the optimal combination of SC+CZ reduces the total error by 20% relative to CZ and 40% relative to SC.

3.6.2. Missing spec-z data

Often the spec-z data are only available in the relatively low redshift range. Thus it is of interest to consider the situation in which the low z part of the true-z distribution is simultaneously constrained by both SC and CZ, while the high z part relies solely on SC.

For notational convenience, let S be the set of all the bins of interest, {1, 2, …, N}, T be the set of the bins with spec-z data available, {1, 2, ,…,N_s} with N_s ≤ N, and S ∖ T be the set of bins without spec-z info, {N_s + 1, …, N}. T⨂T denotes the tuple {(i, j)| i ∈ Tandj ∈ T}, and its complement (T⨂T)^C represents {(i, j)| i ∈ S ∖ Torj ∈ S ∖ T}. With these notations defined, the total cost function with two weight adjustment factors α₁ and α₂ is written as

$\begin{matrix} J = J_{1} + J_{2}, \end{matrix}$ $\begin{aligned} \mathcal{J} = \mathcal{J} _{1}+\mathcal{J} _{2}, \end{aligned}$ (22)

where 𝒥₁ is given by

$\begin{matrix} J_{1} & = \frac{α_{1}}{2} \sum_{(i^{^{'}}, j^{^{'}}) \in T ⨂ T} \sum_{μ} F_{i^{'} j^{'} μ} \\ + \frac{α_{2}}{2} \sum_{(i^{^{'}}, j^{^{'}}) \in {(T ⨂ T)}^{C}} \sum_{μ} F_{i^{'} j^{'} μ}, \end{matrix}$ $\begin{aligned} \mathcal{J} _{1}&= \frac{\alpha _1}{2} \sum _{ (i^{^{\prime }}, j^{^{\prime }}) \in T \bigotimes T } \sum _\mu F_{i^{\prime }j^{\prime }\mu } \nonumber \\&+ \frac{ \alpha _2 }{2} \sum _{ (i^{^{\prime }}, j^{^{\prime }} ) \in (T \bigotimes T)^C } \sum _\mu F_{i^{\prime }j^{\prime }\mu }, \end{aligned}$ (23)

with F_i′j′μ denoting

$\begin{matrix} F_{i^{'} j^{'} μ} = \frac{W_{1} (θ_{μ})}{σ_{i^{'} j^{'}}^{2} (θ_{μ})} [D_{i^{^{'}} j^{^{'}}} (θ_{μ}) - \sum_{k} P_{k i^{^{'}}} P_{k j^{^{'}}} C_{kk}^{'} (θ_{μ})]^{2}, \end{matrix}$ $\begin{aligned} F_{i^{\prime }j^{\prime }\mu } = \frac{ W_1(\theta _\mu ) }{ \sigma ^2_{i^{\prime } j^{\prime }}(\theta _\mu ) } \bigg [ D_{i^{^{\prime }}j^{^{\prime }}}(\theta _\mu ) - \sum _{k}P_{ki^{^{\prime }}}P_{kj^{^{\prime }}}C_{kk}^{\prime }( \theta _\mu ) \bigg ]^{2}, \end{aligned}$ (24)

and 𝒥₂ by

$\begin{matrix} J_{2} = \frac{1}{2} \sum_{(i, j^{^{'}}) \in T ⨂ T} \sum_{μ} \frac{W_{2} (θ_{μ})}{σ_{i j^{'}}^{2} (θ_{μ})} [D_{i j^{^{'}}} (θ_{μ}) - P_{i j^{^{'}}} C_{ii}^{x} (θ_{μ})]^{2} . \end{matrix}$ $\begin{aligned} \mathcal{J} _{2} =\frac{1}{2} \sum _{(i,j^{^{\prime }}) \in T \bigotimes T} \sum _\mu \frac{ W_2(\theta _\mu ) }{ \sigma _{ij^{\prime }}^2 (\theta _\mu ) } \Bigg [ D_{ij^{^{\prime }}}(\theta _\mu )-P_{ij^{^{\prime }}}C_{ii}^\mathrm{x}(\theta _\mu ) \Bigg ]^{2} . \end{aligned}$ (25)

In words, the weight α₁ is to adjust the importance of the SC part with respect to the CZ counterpart, while the additional weight α₂ in 𝒥₁ allows for the possibility to leverage the weight of the SC contribution from the S ∖ T bins to compensate the missing CZ contribution.

In the left panel of Fig. 8, we show the results when the ninth and tenth spec-z bin data are missing. We contrasted the cases with α₁ fixed to be 0.1, 1, and 10, corresponding to the CZ information being dominant, similar to, and subdominant to the matching SC bins. The accuracy of the inference is presented as a function of α₂. In addition, we have compared the results from the spec-z samples consisting of the most massive 5%, 1%, and 0.5% of the galaxies.

Fig. 8.

Accuracy of the true-z inference by SC+CZ in the absence of the high z spec-z data. In the left panel, the spec-z data for the last two bins, bins 9 and 10, are missing; In the right panel, the spec-z data from bins 7 to 10 are missing. We have contrasted the cases with α₁ = 0.1, 1, and 10, which controls the importance of the preceding SC bins with CZ counterpart. The total absolute difference is plotted as a function of α₂, which adjusts the weight of the SC bin without CZ counterpart. Shown are the results from spec-z samples consisting of the most massive 5%, 1%, and 0.5%, respectively.

First, for α₁ = 0.1, CZ information is determinant in the first 8 bins. For large α₂, the importance of SC bin 9 and 10 is inflated and becomes dominant so that the overall accuracy decreases. The error is minimized at a trough varying from α₂ ∼ 0.2 to 0.5, depending on the fraction of the spec-z sample. The optimal α₂ is less than unity reflects that the constraining power of the SC in 9th and 10th bin is bin-wisely weaker than CZ in the bins 1 to 8. The accuracy also deteriorates in the limit of small α₂ because the missing bins become less and less constrained in this case. The optimal α₂ for higher mass sample is slightly higher in order to balance the greater CZ constraint from the lower redshift bins. The differences among the three spec-z samples diminish at α₂ ≲ 0.1, and this reflects that the CZ information is saturated and the missing bin is the bottleneck in enhancing the accuracy of the true-z inference.

The overall trend with α₂ is similar for α₁ = 1 and 10, but the shape generally shifts to a larger α₂ to be in line with α₁. For α₁ = 1, the CZ and SC information has comparable weight for the first 8 bins, we find that the minimal error occurs around α₂ ∼ 1 for 0.5% spec-z sample and ∼2 for 5%. The differences among the spec-z samples are less pronounced than α₁ = 0.1 case because larger α₁ value reduces the weight of CZ. Finally, when SC information is dominant in the first 8 bins (α₁ = 10), the optimal α₂ moves to ∼9 because the constraining power of SC bins are roughly similar.

It is worth summarizing the roles of the weight factors. α₁ adjusts the weight of the SC bins relative to their CZ counterpart. The SC and CZ part compete with each other and they have equal weights when α₁ = 1. α₂ controls the weight of the SC part without CZ information. Through the above exercise, we see that α₂ is in the same order of magnitude as the dominating part of the lower redshift bins.

In addition, we present the missing spec-z data in bin 7-10 case in the right panel of Fig. 8. They are qualitatively similar to the no bin 9 and 10 case. In particular, the optimal α₂ is quite similar to the no bin 9 and 10 case. However, in contrast, the minimal error increases in the CZ dominating case (α₁ = 0.1) by a small amount, while the SC dominated case (α₁ = 10) is the least affected. This can be attributed to the fact that the CZ information is more constraining than SC. For the same reason, in the small α₂ limit, we find that the accuracy is generally lower than the no bin 9 and 10 case.

Although the precise weights depend on the relative constraining power of the SC and CZ data, our test suggests that there is generally a set of optimal α₁ and α₂ minimizing the error.

We show the bias in the mean redshift in Table 2 for missing spec-z bin data 9-10 and 7-10. Even with missing spec-z bin data, SC+CZ is stable and the results are similar to the full bin case. Sometimes, the mean value in missing bin case is even lower than the full bin case although the difference is statistically insignificant. Only when the bin 7-10 are missing, we notice systematic increase in bias in the tomographic bin 4 and 5 (by about half σ).

Finally, for completeness, we plot the true-z distribution estimated by various number of spec-z bin data for a single mock in Fig. 9 and the absolute error estimated from 100 mocks in Fig. 10. In these plots, we assume α₁ = 1 and α₂ = 1, which are close to the optimal weights for the 1% sample used here. We have shown the results obtained with full spec-z bins, no spec-z bin 9 and 10, no spec-z bin 7 to 10, and SC only. We indeed see that as the number of spec-z bins decreases, the constraining power weakens. The true-z distribution for lower tomographic bins 1 and 2 are almost unaffected for the missing spec-z bin cases, while the higher tomographic bin ones are more affected. In summary, we have demonstrated that by incorporating the SC information with CZ, we are able to extend the constraint on the true-z distribution to higher redshift where spec-z data are missing, and the constraint is better than SC alone.

Fig. 9.

Comparison of the true-z distribution obtained by different number of spec-z bins with spec-z data available against the direct measurement on the mock (green bars) for a single mock. We have presented the results from full spec-z bins (violet), no spec-z bin 9 and 10 (red), no spec-z bin 7 to 10 (blue), and SC only (orange).

Fig. 10.

The absolute error of the true-z distribution estimated by different amount of spec-z bin data. Upper panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the true-z distribution measured from the mock, P_ij′^true and the one estimated, ${\hat{P}}_{i j'}$ $\hat{P}_{ij\prime}$ . Lower panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ the absolute difference normalized by the estimated error σ_{P_ij′}. In both panels, the results from full spec-z data (blue), no spec-z bin 9 and 10 (red), no spec-z bin 7 to 10 (violet), and SC only (orange) are compared. With the reduction of the number of spec-z bin data, the constraining power is weakened. When only the high-z spec-z bins are missing, the impact is mainly localized in the high-z tomographic bins with the low-z bins little affected.

4. Conclusions

Characterization of the true-z distribution is crucial in the science of the wide-field imaging surveys such as weak lensing and BAO. A valuable avenue to calibrate the true-z distribution of a photo-z sample is to make use of the clustering information. Clustering-z (CZ) has been widely used in the cosmological analyses in imaging surveys. However, a limitation of this method is that the spec-z sample is often only available in relatively low redshift regime. The self-calibration (SC) method relying solely on the clustering information of the photometric sample is gaining popularity. In this work we develop a method to jointly constrain the true-z distribution of a photo-z sample using the information of SC and CZ simultaneously. We use the DES Y3 catalog to test our method and find that it can effectively infer the true-z distribution. The codes performing the SC+CZ inference can be downloaded on GitHub⁵.

Previous SC analyses often rely on the non-negative matrix factorization (NMF) algorithm. Such an interpretation becomes a burden when we want to generalize the method to combine SC with CZ. Inspired by the NMF method, we construct multiplicative update rules to directly solve for the true-z distribution. It is worth stressing that we have avoided the NMF interpretation altogether. Our straightforward analysis enables us to easily combine the information of SC with CZ.

Our formalism has improved upon the previous approach by taking the weighting functions into account. We have included the inverse error weighting and an additional weighting function. The inverse error weighting allows us to downweight the impact of the poor measurements and upweight the good ones. The additional weighting function give the freedom to put more weight on the constraining part of the correlation function, and we take it to be the form θ⁻ⁿ with n = 1. We demonstrate that our improved algorithm results in a much more accurate estimation of the true-z distribution compared to the previous method in the case of SC (Figs. 4 and 5). Moreover, our algorithm gives more stable results and allows us to have a true-z distribution with higher resolution. This bias in the mean redshift is reduced by more than a factor of two.

We employ our algorithm to show that SC+CZ improves the constraint relative to using SC or CZ alone (Figs. 2 and 3). SC+CZ gives a stable bias on the mean redshift, keeping it at a level of ∼0.3%, even though it sometimes fairs worst than CZ. The clustering-based methods also yield estimates of the intrinsic clustering amplitude. We find that various clustering measurements are consistent with each other (Fig. 6). To optimize the constraining power of SC and CZ data, we assign extra weight factors to the SC cost function. We consider the scenario with the number of spec-z bins with spec-z data equal to the number of photo-z bins and the more interesting one with the spec-z bin data only available in the low z bins. We find that there generally exist some weights that minimize the total error (Figs. 7 and 8). The precise weights depend on the constraining power of the SC and CZ data, and detailed mock tests are required to locate the optimal values. To highlight the improvement, we quote the 1% spec-z sample result with full spec-z bins here. In this case we find that the optimal combination reduces the total error by 20% and 40% compared to using respectively CZ and SC only. The test with spec-z data restricted to the low z range demonstrates that by incorporating with the SC information, we can extend the utility of the clustering-based method to higher redshift.

After successfully demonstrating the power of the SC+CZ method via mock catalogs, it is desirable to apply it to real data, such as the DES Y6 BAO data (Abbott et al. 2024; Mena-Fernández et al. 2024). For this dataset, only the portion of data with z < 1 is calibrated with CZ, while the data in the range 1 < z < 1.2 have to be calibrated with a spec-z sample from a small sky area. A few caveats need to keep in mind when applying the method to real data. In this work we have used Δz = 0.05, but this is still larger than the redshift bin width often taken in CZ analysis (e.g. Δz = 0.03). We have also assumed no bias evolution, at least in the CZ case. We leave it to future work to test the impact of these setups, and study possible improvements. We have not considered the magnification bias in this paper because the redshift range of the data is still small and the mock does not include this effect. For larger redshift extent, magnification bias correction must be made. Furthermore, the photometric data are more prone to observational systematics, which must be treated before SC. Compared to the applications to the clustering samples such as the lens sample in weak lensing and the BAO sample mentioned above, the application of the SC method to the source sample in weak lensing is more challenging simply because it is not constructed for clustering analysis. The source sample is often inhomogeneous and the random catalog is likely missing in standard analysis, so it must be built with great care. Moreover, the systematics mitigation is designed for the shear measurements, and so it remains to show that the existing mitigation efforts are sufficient or additional clustering weight needs to be created. Nevertheless, our work paves the way to calibrate the true-z distribution in high redshift using the clustering information.

¹

This is actually the χ² with the regularization by the weight function W, but here we follow the previous convention (Lee & Seung 2000; Zhang et al. 2017) to call it 𝒥.

²

We mention that the suppressed Gaussian process can also be tuned to return non-negative estimated values (Naidoo et al. 2023).

³

In Eq. (18) the indices on both sides do not match. It will look more pleasant if we employ the general bias form b_ij′.

⁴

For example, the target bias level of Euclid is less than 0.002(1 + z) (Laureijs et al. 2011) and Rubin aims for bias less than 0.003(1 + z) (Ivezić et al. 2019).

⁵

https://github.com/kcc274/SCplusCZ

Acknowledgments

We thank the anonymous referee for his/her insightful comments that improve the presentation of the manuscript. WZ, KCC, and RS are supported by the National Science Foundation of China under the grant number 12273121 and the science research grants from the China Manned Space Project with NO.CMS-CSST-2021-B01. HX is supported by the National SKA Program of China (grant No. 2020SKA0110100), the National Natural Science Foundation of China (Nos. 11922305, 11833005) and the science research grants from the China Manned Space Project with NOs. CMS-CSST-2021-A02. LZ is supported by National SKA Program of China (2020SKA0110401, 2020SKA0110402, 2020SKA0110100), the National Key R&D Program of China (2020YFC2201600), the China Manned Space Project with No. CMS-CSST-2021 (A02, A03), and Guangdong Basic and Applied Basic Research Foundation (2024A1515012309).

References

Abbott, T. M. C., Abdalla, F. B., Alarcon, A., et al. 2018, Phys. Rev. D, 98, 043526 [NASA ADS] [CrossRef] [Google Scholar]
Abbott, T., Abdalla, F. B., Alarcon, A., et al. 2019, MNRAS, 483, 4866 [NASA ADS] [CrossRef] [Google Scholar]
Abbott, T., Aguena, M., Alarcon, A., et al. 2022a, Phys. Rev. D, 105, 023520 [NASA ADS] [CrossRef] [Google Scholar]
Abbott, T. M. C., Aguena, M., Allam, S., et al. 2022b, Phys. Rev. D, 105, 043512 [CrossRef] [Google Scholar]
Abbott, T. M. C., Adamow, M., Aguena, M., et al. 2024, Phys. Rev. D, 110, 063515 [NASA ADS] [CrossRef] [Google Scholar]
Alonso, D. 2012, arXiv e-prints [arXiv:1210.1833] [Google Scholar]
Amon, A., Gruen, D., Troxel, M. A., et al. 2022, Phys. Rev. D, 105, 023514 [NASA ADS] [CrossRef] [Google Scholar]
Arnouts, S., Cristiani, S., Moscardini, L., et al. 1999, MNRAS, 310, 540 [Google Scholar]
Asgari, M., Lin, C.-A., Joachimi, B., et al. 2021, A&A, 645, A104 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Avila, S., Crocce, M., Ross, A. J., et al. 2018, MNRAS, 479, 94 [NASA ADS] [CrossRef] [Google Scholar]
Bartelmann, M., & Schneider, P. 2001, Phys. Rep., 340, 291 [Google Scholar]
Benítez, N. 2000, ApJ, 536, 571 [Google Scholar]
Benjamin, J., van Waerbeke, L., Ménard, B., & Kilbinger, M. 2010, MNRAS, 408, 1168 [NASA ADS] [CrossRef] [Google Scholar]
Benjamin, J., Van Waerbeke, L., Heymans, C., et al. 2013, MNRAS, 431, 1547 [Google Scholar]
Bolzonella, M., Miralles, J. M., & Pelló, R. 2000, A&A, 363, 476 [NASA ADS] [Google Scholar]
Bonnett, C., Troxel, M. A., Hartley, W., et al. 2016, Phys. Rev. D, 94, 042005 [Google Scholar]
Buchs, R., Davis, C., Gruen, D., et al. 2019, MNRAS, 489, 820 [Google Scholar]
Campos, A., Yin, B., Dodelson, S., et al. 2024, arXiv e-prints [arXiv:2408.00922] [Google Scholar]
Carnero, A., Sánchez, E., Crocce, M., Cabré, A., & Gaztañaga, E. 2012, MNRAS, 419, 1689 [NASA ADS] [CrossRef] [Google Scholar]
Carrasco Kind, M., & Brunner, R. J. 2014, MNRAS, 438, 3409 [NASA ADS] [CrossRef] [Google Scholar]
Cawthon, R., Elvin-Poole, J., Porredon, A., et al. 2022, MNRAS, 513, 5517 [NASA ADS] [CrossRef] [Google Scholar]
Chan, K. C., Avila, S., Carnero Rosell, A., et al. 2022, Phys. Rev. D, 106, 123502 [NASA ADS] [CrossRef] [Google Scholar]
Chan, K. C., Lu, G., & Wang, X. 2024, MNRAS, 529, 1667 [NASA ADS] [CrossRef] [Google Scholar]
Choi, S. 2008, in 2008 IEEE International Joint Conference on NeuralNetworks (IEEE World Congress on Computational Intelligence), 1828 [Google Scholar]
Collister, A. A., & Lahav, O. 2004, PASP, 116, 345 [NASA ADS] [CrossRef] [Google Scholar]
Crocce, M., Castander, F. J., Gaztañaga, E., Fosalba, P., & Carretero, J. 2015, MNRAS, 453, 1513 [NASA ADS] [CrossRef] [Google Scholar]
Cunha, C. E., Lima, M., Oyaizu, H., Frieman, J., & Lin, H. 2009, MNRAS, 396, 2379 [Google Scholar]
Dalal, R., Li, X., Nicola, A., et al. 2023, Phys. Rev. D, 108, 123519 [CrossRef] [Google Scholar]
Davis, C., Rozo, E., Roodman, A., et al. 2018, MNRAS, 477, 2196 [Google Scholar]
de Simoni, F., Sobreira, F., Carnero, A., et al. 2013, MNRAS, 435, 3017 [NASA ADS] [CrossRef] [Google Scholar]
De Vicente, J., Sánchez, E., & Sevilla-Noarbe, I. 2016, MNRAS, 459, 3078 [NASA ADS] [CrossRef] [Google Scholar]
DESI Collaboration (Adame, A. G., et al.) 2024, arXiv e-prints [arXiv:2404.03000] [Google Scholar]
Eifler, T., Miyatake, H., Krause, E., et al. 2021, MNRAS, 507, 1746 [NASA ADS] [CrossRef] [Google Scholar]
Erben, T., Hildebrandt, H., Lerchster, M., et al. 2009, A&A, 493, 1197 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Estrada, J., Sefusatti, E., & Frieman, J. A. 2009, ApJ, 692, 265 [NASA ADS] [CrossRef] [Google Scholar]
Ferrero, I., Crocce, M., Tutusaus, I., et al. 2021, A&A, 656, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Fosalba, P., Crocce, M., Gaztañaga, E., & Castander, F. J. 2015, MNRAS, 448, 2987 [NASA ADS] [CrossRef] [Google Scholar]
Gatti, M., Vielzeuf, P., Davis, C., et al. 2018, MNRAS, 477, 1664 [Google Scholar]
Gatti, M., Giannini, G., Bernstein, G. M., et al. 2022, MNRAS, 510, 1223 [Google Scholar]
Gong, Y., Liu, X., Cao, Y., et al. 2019, ApJ, 883, 203 [NASA ADS] [CrossRef] [Google Scholar]
Goodfellow, I. J., Bengio, Y., & Courville, A. 2016, Deep Learning (Cambridge, MA, USA: MIT Press) [Google Scholar]
Green, D., & Bailey, S. 2023, IEEE Transactions on Signal Processing, 72, 5187 [Google Scholar]
Heymans, C., Grocutt, E., Heavens, A., et al. 2013, MNRAS, 432, 2433 [Google Scholar]
Heymans, C., Tröster, T., Asgari, M., et al. 2021, A&A, 646, A140 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hikage, C., Oguri, M., Hamana, T., et al. 2019, PASJ, 71, 43 [Google Scholar]
Hildebrandt, H., Viola, M., Heymans, C., et al. 2017, MNRAS, 465, 1454 [Google Scholar]
Hildebrandt, H., van den Busch, J. L., Wright, A. H., et al. 2021, A&A, 647, A124 [EDP Sciences] [Google Scholar]
Hütsi, G. 2010, MNRAS, 401, 2477 [CrossRef] [Google Scholar]
Ilbert, O., Arnouts, S., McCracken, H. J., et al. 2006, A&A, 457, 841 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111 [Google Scholar]
Izard, A., Crocce, M., & Fosalba, P. 2016, MNRAS, 459, 2327 [NASA ADS] [CrossRef] [Google Scholar]
Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, arXiv e-prints [arXiv:1110.3193] [Google Scholar]
Lee, D., & Seung, H. S. 2000, Adv. Neural Inf. Process. Syst., 13 [Google Scholar]
Li, R., Napolitano, N. R., Roy, N., et al. 2022, ApJ, 929, 152 [NASA ADS] [CrossRef] [Google Scholar]
Li, X., Zhang, T., Sugiyama, S., et al. 2023, Phys. Rev. D, 108, 123518 [CrossRef] [Google Scholar]
Lima, M., Cunha, C. E., Oyaizu, H., et al. 2008, MNRAS, 390, 118 [Google Scholar]
Mandelbaum, R., Eifler, T., Hložek, R., et al. 2018, arXiv e-prints [arXiv:1809.01669] [Google Scholar]
Masters, D., Capak, P., Stern, D., et al. 2015, ApJ, 813, 53 [Google Scholar]
Matthews, D. J., & Newman, J. A. 2010, ApJ, 721, 456 [Google Scholar]
McQuinn, M., & White, M. 2013, MNRAS, 433, 2857 [Google Scholar]
Mena-Fernández, J., Rodríguez-Monroy, M., Avila, S., et al. 2024, Phys. Rev. D, 110, 063514 [CrossRef] [Google Scholar]
Ménard, B., Scranton, R., Schmidt, S., et al. 2013, arXiv e-prints [arXiv:1303.4722] [Google Scholar]
Miyatake, H., Sugiyama, S., Takada, M., et al. 2023, Phys. Rev. D, 108, 123517 [NASA ADS] [CrossRef] [Google Scholar]
Morrison, C. B., Hildebrandt, H., Schmidt, S. J., et al. 2017, MNRAS, 467, 3576 [Google Scholar]
Naidoo, K., Johnston, H., Joachimi, B., et al. 2023, A&A, 670, A149 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Newman, J. A. 2008, ApJ, 684, 88 [Google Scholar]
Newman, J. A., & Gruen, D. 2022, ARA&A, 60, 363 [NASA ADS] [CrossRef] [Google Scholar]
Padmanabhan, N., Schlegel, D. J., Seljak, U., et al. 2007, MNRAS, 378, 852 [NASA ADS] [CrossRef] [Google Scholar]
Peng, H., & Yu, Y. 2024, JCAP, 2024, 025 [CrossRef] [Google Scholar]
Peng, H., Xu, H., Zhang, L., Chen, Z., & Yu, Y. 2022, MNRAS, 516, 6210 [Google Scholar]
Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Rau, M. M., Dalal, R., Zhang, T., et al. 2023, MNRAS, 524, 5109 [NASA ADS] [CrossRef] [Google Scholar]
Sadeh, I., Abdalla, F. B., & Lahav, O. 2016, PASP, 128, 104502 [NASA ADS] [CrossRef] [Google Scholar]
Salvato, M., Ilbert, O., & Hoyle, B. 2019, Nat. Astron., 3, 212 [NASA ADS] [CrossRef] [Google Scholar]
Schmidt, S. J., Ménard, B., Scranton, R., Morrison, C., & McBride, C. K. 2013, MNRAS, 431, 3307 [Google Scholar]
Schneider, M., Knox, L., Zhan, H., & Connolly, A. 2006, ApJ, 651, 14 [Google Scholar]
Secco, L. F., Samuroff, S., Krause, E., et al. 2022, Phys. Rev. D, 105, 023515 [NASA ADS] [CrossRef] [Google Scholar]
Seo, H.-J., Ho, S., White, M., et al. 2012, ApJ, 761, 13 [NASA ADS] [CrossRef] [Google Scholar]
Simon, P. 2007, A&A, 473, 711 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Song, R., Chan, K. C., Xu, H., & Zheng, W. 2024, MNRAS, 530, 881 [NASA ADS] [CrossRef] [Google Scholar]
Spergel, D., Gehrels, N., Baltay, C., et al. 2015, arXiv e-prints [arXiv:1503.03757] [Google Scholar]
Sugiyama, S., Miyatake, H., More, S., et al. 2023, Phys. Rev. D, 108, 123521 [NASA ADS] [CrossRef] [Google Scholar]
Tassev, S., Zaldarriaga, M., & Eisenstein, D. J. 2013, JCAP, 2013, 036 [Google Scholar]
Troxel, M. A., MacCrann, N., Zuntz, J., et al. 2018, Phys. Rev. D, 98, 043528 [Google Scholar]
van den Busch, J. L., Hildebrandt, H., Wright, A. H., et al. 2020, A&A, 642, A200 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Wright, A. H., Hildebrandt, H., van den Busch, J. L., & Heymans, C. 2020, A&A, 637, A100 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Xu, H., Zhang, P., Peng, H., et al. 2023, MNRAS, 520, 161 [Google Scholar]
Zhan, H. 2011, Sci. Sin. Phys. Mech. Astron., 41, 1441 [NASA ADS] [CrossRef] [Google Scholar]
Zhang, P., Pen, U.-L., & Bernstein, G. 2010, MNRAS, 405, 359 [NASA ADS] [Google Scholar]
Zhang, L., Yu, Y., & Zhang, P. 2017, ApJ, 848, 44 [NASA ADS] [CrossRef] [Google Scholar]
Zhou, R., Newman, J. A., Mao, Y.-Y., et al. 2021, MNRAS, 501, 3309 [NASA ADS] [CrossRef] [Google Scholar]
Zhu, G. 2016, arXiv e-prints [arXiv:1612.06037] [Google Scholar]
Zhu, Z., Yang, Z., & Oja, E. 2013, in 18th conference Scandinavian Conferences on Image Analysis (SCIA 2013) Espoo, Finland, June 17–20, 2013 (Germany: Springer Gabler), 143 [Google Scholar]

Appendix A: Derivation of the multiplicative update rule

In general, a multiplicative update rule is a set of iterative instructions to update a function by multiplying it with some factor to search for the minimum of the cost function. The most influential multiplicative update rule for non-negative matrix factorization (NMF) is presented by Lee & Seung (2000). This multiplicative update rule can be interpreted as a special kind of gradient descent rule with variable step size. A distinguishing feature of the NMF update rule is the non-negativity of the solution. This has been adopted in Zhang et al. (2017) to solve the SC equation. The update rule in Lee & Seung (2000) can be easily derived by splitting the gradient of the cost function into the positive and negative parts (Choi 2008). Inspired by this derivation, we derive multiplicative update rules for the joint inference by SC and CZ.

A.1. P method

In P method, we aim to derive multiplicative rules for P_ij^′, $C_{ii}^{x}$ $C_{ii}^{\mathrm{x}}$ , and C′_ii so that the cost function 𝒥 [Eq. (15)] is minimized. If we take P_ab^′ as the independent variable, we also need to account for the normalization constraint Eq. (5). This can be implemented by introducing the Lagrange multipliers into the cost function as

$\begin{matrix} J = J_{1} + J_{2} - \sum_{j^{^{'}}} λ_{j^{^{'}}} (\sum_{i} P_{i j^{'}} - 1) . \end{matrix}$ $\begin{aligned} \mathcal{J} =\mathcal{J} _{1}+\mathcal{J} _{2}-\sum _{j^{^{\prime }}}\lambda _{j^{^{\prime }}}\big (\sum _{i}P_{ij^{\prime }}-1 \big ). \end{aligned}$ (A.1)

We first computed the derivative with respect to P_ab^′ and then to the Lagrange multipliers.

The derivative of 𝒥₁ with respect to P_ab^′ reads

$\begin{matrix} \frac{\partial J_{1}}{\partial P_{a b^{^{'}}}} = - 2 \sum_{j^{^{'}}, μ} & \frac{W_{1} (θ_{μ})}{σ_{b^{'} j^{'}}^{2} (θ_{μ})} [D_{b^{^{'}} j^{^{'}}} (θ_{μ}) - \sum_{k} P_{k b^{^{'}}} P_{k j^{^{'}}} C_{kk}^{'} (θ_{μ})] \\ \times P_{a j^{^{'}}} C_{aa}^{'} (θ_{μ}), \end{matrix}$ $\begin{aligned} \frac{\partial \mathcal{J} _{1}}{\partial P_{ab^{^{\prime }}}} = -2\sum _{j^{^{\prime }}, \mu }&\frac{ W_1(\theta _\mu ) }{\sigma _{b^{\prime } j^{\prime }}^2( \theta _\mu ) } \Big [ D_{b^{^{\prime }}j^{^{\prime }}}(\theta _\mu )-\sum _{k}P_{kb^{^{\prime }}}P_{kj^{^{\prime }}}C^{\prime }_{kk}(\theta _\mu ) \Big ] \nonumber \\&\times P_{aj^{^{\prime }}}C^{\prime }_{aa} (\theta _\mu ) , \end{aligned}$ (A.2)

and for 𝒥₂, we have

$\begin{matrix} \frac{\partial J_{2}}{\partial P_{a b^{^{'}}}} & = - \sum_{μ} \frac{W_{2} (θ_{μ})}{σ_{a b^{'}}^{2} (θ_{μ})} [D_{a b^{^{'}}} (θ_{μ}) - P_{a b^{^{'}}} C_{aa}^{x} (θ_{μ})] C_{aa}^{x} (θ_{μ}) . \end{matrix}$ $\begin{aligned} \frac{\partial \mathcal{J} _{2}}{\partial P_{ab^{^{\prime }}}}&=- \sum _\mu \frac{ W_2(\theta _\mu ) }{ \sigma _{ab^{\prime }}^2(\theta _\mu ) } [ D_{ab^{^{\prime }}}(\theta _\mu )-P_{ab^{^{\prime }}}C_{aa}^\mathrm{x}(\theta _\mu )]C_{aa}^\mathrm{x}(\theta _\mu ). \end{aligned}$ (A.3)

Without the constraint, following Lee & Seung (2000), the update rule is given by

$\begin{matrix} P_{a b^{^{'}}} {\overset{P}{\leftarrow}}_{a b^{^{'}}} \frac{{[\partial_{P_{a b^{'}}} J]}^{-}}{{[\partial_{P_{a b^{'}}} J]}^{+}}, \end{matrix}$ $\begin{aligned} P_{ab^{^{\prime }}}\xleftarrow P_{ab^{^{\prime }}}\frac{ [\partial _{P_{ab^{\prime }}} \mathcal{J} ]^-}{[\partial _{P_{ab^{\prime }}} \mathcal{J} ]^+}, \end{aligned}$ (A.4)

where [∂_{P_ab′}𝒥]⁻ ([∂_{P_ab′}𝒥]⁺) denotes the unsigned negative (positive) part of the derivative of ∂_{P_ab′}𝒥. We can contrast this with the usual additive gradient descent (Goodfellow et al. 2016). The term [∂_{P_ab′}𝒥]⁻ tends to increase the value of P_ab^′, while [∂_{P_ab′}𝒥]⁺ tends to decrease it, so they play the same role as in the usual gradient descent. When the minimum is reached, ∇𝒥 vanishes and the multiplicative factor becomes unity. Moreover, the factor [∂_{P_ab′}𝒥]⁻/[∂_{P_ab′}𝒥]⁺ is non-negative, and so it does not flip the sign of P_ab^′.

When the normalization constraint is included, we have

$\begin{matrix} \frac{\partial J}{\partial P_{a b^{^{'}}}} & = \frac{\partial J_{1}}{\partial P_{a b^{^{'}}}} + \frac{\partial J_{2}}{\partial P_{a b^{^{'}}}} - λ_{b^{^{'}}} . \end{matrix}$ $\begin{aligned} \frac{\partial \mathcal{J} }{\partial P_{ab^{^{\prime }}}}&=\frac{\partial \mathcal{J} _{1}}{\partial P_{ab^{^{\prime }}}}+\frac{\partial \mathcal{J} _{2}}{\partial P_{ab^{^{\prime }}}}-\lambda _{b^{^{\prime }}}. \end{aligned}$ (A.5)

For the treatment of the Lagrange multiplier part, we follow the procedures in Zhu et al. (2013). We can first establish a preliminary update rule in terms of λ_b^′, and then demand the resultant P_ab^′ to satisfy the normalization constraint. This enabled us to solve for λ_b^′. It is convenient to include a negative sign for λ_b^′ and assume λ_b^′to be positive. However, this assumption does not always hold and so Zhu et al. (2013) consider a “moving term” trick to alleviate this problem.

The preliminary update rule is

$\begin{matrix} P_{a b^{^{'}}} {\overset{P}{\leftarrow}}_{a b^{^{'}}} \frac{{[\partial_{P_{a b^{'}}} J]}^{-} + λ_{b^{^{'}}}}{{[\partial_{P_{a b^{'}}} J]}^{+}} . \end{matrix}$ $\begin{aligned} P_{ab^{^{\prime }}}\xleftarrow P_{ab^{^{\prime }}}\frac{ [\partial _{P_{ab^{\prime }}} \mathcal{J} ]^- +\lambda _{b^{^{\prime }}}}{[\partial _{P_{ab^{\prime }}} \mathcal{J} ]^+}. \end{aligned}$ (A.6)

The new P is also demanded to meet the normalization constraint:

$\begin{matrix} \sum_{a} & P_{a b^{^{'}}} \frac{{[\partial_{P_{a b^{'}}} J]}^{-} + λ_{b^{^{'}}}}{{[\partial_{P_{a b^{'}}} J]}^{+}} = 1 . \end{matrix}$ $\begin{aligned} \sum _{a}&P_{ab^{^{\prime }}}\frac{ [\partial _{P_{ab^{\prime } }} \mathcal{J} ]^- +\lambda _{b^{^{\prime }}}}{[\partial _{P_{ab^{\prime }}} \mathcal{J} ]^+ } = 1 . \end{aligned}$ (A.7)

Solving for λ_b′, the update rule becomes

$\begin{matrix} P_{a b^{^{'}}} {\overset{P}{\leftarrow}}_{a b^{^{'}}} \frac{{[\partial_{P_{a b^{'}}} J]}^{-} \sum_{c} \frac{P_{c b^{^{'}}}}{{[\partial_{P_{c b^{'}}} J]}^{+}} + 1 - \sum_{c} P_{c b^{^{'}}} \frac{{[\partial_{P_{c b^{'}}} J]}^{-}}{{[\partial_{P_{c b^{'}}} J]}^{+}}}{{[\partial_{P_{a b^{'}}} J]}^{+} \sum_{c} \frac{P_{c b^{^{'}}}}{{[\partial_{P_{c b^{'}}} J]}^{+}}} . \end{matrix}$ $\begin{aligned} P_{ab^{^{\prime }}}\xleftarrow P_{ab^{^{\prime }}} \frac{ [\partial _{P_{ab^{\prime }} }\mathcal{J} ]^- \sum _{c}\frac{P_{cb^{^{\prime }}}}{ [\partial _{P_{cb^{\prime }} }\mathcal{J} ]^+ }+1-\sum _{c}P_{cb^{^{\prime }}}\frac{ [\partial _{P_{cb^{\prime }} }\mathcal{J} ]^- }{ [\partial _{P_{cb^{\prime }} }\mathcal{J} ]^+}}{[\partial _{P_{ab^{\prime }} }\mathcal{J} ]^+\sum _{c}\frac{P_{cb^{^{\prime }}}}{ [\partial _{P_{cb^{\prime }} }\mathcal{J} ]^+ }}. \end{aligned}$ (A.8)

To make the numerator always positive, we applied the “moving trick” to get

$\begin{matrix} P_{a b^{^{'}}} {\overset{P}{\leftarrow}}_{a b^{^{'}}} \frac{{[\partial_{P_{a b^{'}}} J]}^{-} \sum_{c} \frac{P_{c b^{^{'}}}}{{[\partial_{P_{c b^{'}}} J]}^{+}} + 1}{{[\partial_{P_{a b^{'}}} J]}^{+} \sum_{c} \frac{P_{c b^{^{'}}}}{{[\partial_{P_{c b^{'}}} J]}^{+}} + \sum_{c} P_{c b^{^{'}}} \frac{{[\partial_{P_{c b^{'}}} J]}^{-}}{{[\partial_{P_{c b^{'}}} J]}^{+}}} . \end{matrix}$ $\begin{aligned} P_{ab^{^{\prime }}}\xleftarrow P_{ab^{^{\prime }}}\frac{ [\partial _{P_{ab^{\prime }} }\mathcal{J} ]^- \sum _{c}\frac{P_{cb^{^{\prime }}}}{ [\partial _{P_{cb^{\prime }} }\mathcal{J} ]^+ }+1}{[\partial _{P_{ab^{\prime }} }\mathcal{J} ]^+ \sum _{c}\frac{P_{cb^{^{\prime }}}}{ [\partial _{P_{cb^{\prime }} }\mathcal{J} ]^+}+\sum _{c}P_{cb^{^{\prime }}}\frac{[\partial _{P_{cb^{\prime }} }\mathcal{J} ]^-}{[\partial _{P_{cb^{\prime }} }\mathcal{J} ]^+}}. \end{aligned}$ (A.9)

We also update C_aa′ [in Eq. (16)] and $C_{aa}^{x}$ $C_{aa}^{\mathrm{x}}$ [in Eq. (17)] using the multiplicative rule although this is not necessary. Because they are linear parameters, direct analytic minimization is possible as was done in Zhang et al. (2017). The update rule for C_aa′ reads

$\begin{matrix} C_{aa}^{'} (θ) {\overset{C}{\leftarrow}}_{aa}^{'} (θ) \frac{{[\partial_{C_{aa}^{'} (θ)} J]}^{-}}{{[\partial_{C_{aa}^{'} (θ)} J]}^{+}} . \end{matrix}$ $\begin{aligned} C_{aa}^{\prime }(\theta )\xleftarrow C_{aa}^{\prime }(\theta ) \frac{ [\partial _{C^{\prime }_{aa}(\theta ) } \mathcal{J} ]^- }{ [\partial _{C^{\prime }_{aa}(\theta ) } \mathcal{J} ]^+ }. \end{aligned}$ (A.10)

A similar update rule for $C_{aa}^{x}$ $C_{aa}^{\mathrm{x}}$ can be derived.

A.2. R method

In R method, only R_ij′ is the unknown. Explicitly the cost functions read

$\begin{matrix} J_{1} & = \frac{1}{2} \sum_{i^{'}, j^{'}, μ} \frac{W_{1} (θ_{μ})}{σ_{i^{'} j^{'}}^{2} (θ_{μ})} [D_{i^{^{'}} j^{^{'}}} (θ_{μ}) - \sum_{k} R_{k i^{'}} R_{k j^{'}} C_{kk}^{m} (θ_{μ})]^{2}, \end{matrix}$ $\begin{aligned} \mathcal{J} _{1}&=\frac{1}{2} \sum _{i^{\prime },j^{\prime }, \mu } \frac{ W_1(\theta _\mu ) }{ \sigma _{i^{\prime }j^{\prime }}^2 (\theta _\mu ) } \Big [ D_{i^{^{\prime }}j^{^{\prime }}}(\theta _\mu ) - \sum _k R_{ki^{\prime }} R_{kj^{\prime }} C_{kk}^\mathrm{m} (\theta _\mu ) \Big ]^{2} , \end{aligned}$ (A.11)

$\begin{matrix} J_{2} & = \frac{1}{2} \sum_{i, j^{'}, μ} \frac{W_{2} (θ_{μ})}{σ_{i j^{'}}^{2} (θ_{μ})} [D_{i j^{^{'}}} (θ_{μ}) - R_{i j^{'}} b_{i} C_{ii}^{m} (θ_{μ})]^{2} . \end{matrix}$ $\begin{aligned} \mathcal{J} _{2}&=\frac{1}{2} \sum _{i,j^{\prime }, \mu } \frac{ W_2(\theta _\mu ) }{ \sigma _{ij^{\prime }}^2 (\theta _\mu ) } \Big [ D_{ij^{^{\prime }}}(\theta _\mu ) - R_{ij^{\prime }} b_i C_{ii}^\mathrm{m} (\theta _\mu ) \Big ]^{2} . \end{aligned}$ (A.12)

The derivative of the cost functions with respect to R_ab′ are given by

$\begin{matrix} \frac{\partial J_{1}}{\partial R_{a b^{'}}} = - 2 \sum_{j^{'}, μ} & \frac{W_{1} (θ_{μ})}{σ_{b^{'} j^{'}}^{2} (θ_{μ})} [D_{b^{'} j^{'}} (θ_{μ}) - \sum_{k} R_{k b^{'}} R_{k j^{'}} C_{kk}^{m} (θ_{μ})] \\ \times R_{a j^{'}} C_{aa}^{m} (θ_{μ}), \end{matrix}$ $\begin{aligned} \frac{\partial \mathcal{J} _{1}}{\partial R_{ab^{\prime }}} = -2\sum _{j^{\prime }, \mu }&\frac{ W_1(\theta _\mu ) }{\sigma _{b^{\prime } j^{\prime }}^2( \theta _\mu ) } \Big [ D_{b^{\prime }j^{\prime }}(\theta _\mu )- \sum _{k}R_{kb^{\prime }}R_{kj^{\prime }}C^\mathrm{m}_{kk}(\theta _\mu ) \Big ] \nonumber \\&\times R_{aj^{\prime }}C^\mathrm{m}_{aa} (\theta _\mu ) , \end{aligned}$ (A.13)

and

$\begin{matrix} \frac{\partial J_{2}}{\partial R_{a b^{'}}} & = - \sum_{μ} \frac{W_{2} (θ_{μ})}{σ_{a b^{'}}^{2} (θ_{μ})} [D_{a b^{'}} (θ_{μ}) - b_{a} R_{a b^{'}} C_{aa}^{m} (θ_{μ})] b_{a} C_{aa}^{m} (θ_{μ}) . \end{matrix}$ $\begin{aligned} \frac{\partial \mathcal{J} _{2}}{\partial R_{ab^{\prime }}}&=- \sum _\mu \frac{ W_2(\theta _\mu ) }{ \sigma _{ab^{\prime }}^2(\theta _\mu ) } [ D_{ab^{\prime }}(\theta _\mu )- b_a R_{ab^{\prime }}C_{aa}^\mathrm{m}(\theta _\mu )] b_a C_{aa}^\mathrm{m}(\theta _\mu ). \end{aligned}$ (A.14)

The update rule for R_ab′ can then be constructed:

$\begin{matrix} R_{a b^{^{'}}} {\overset{R}{\leftarrow}}_{a b^{'}} \frac{{[\partial_{R_{a b^{'}}} J]}^{-}}{{[\partial_{R_{a b^{'}}} J]}^{+}} . \end{matrix}$ $\begin{aligned} R_{ab^{^{\prime }}}\xleftarrow R_{ab^{\prime }}\frac{ [\partial _{R_{ab^{\prime }}} \mathcal{J} ]^-}{[\partial _{R_{ab^{\prime }}} \mathcal{J} ]^+}. \end{aligned}$ (A.15)

We note that P_ij′ follows from Eq. (19) if we assume that there is no galaxy bias evolution.

Appendix B: Impact of outlying galaxies

When the photo-z galaxies are present only in the redshift range [0.6, 1.1], the true-z distribution of this sample can extend outside it. In this work, we restrict both photo-z and spec-z range of the galaxies to [0.6,1.1], and this effectively assumes that P_ij′ vanishes outside this range. In practice, however, this kind of cleaning is not possible.

To test the impact of the outlying galaxies on the estimation of the true-z distribution, we apply the algorithm to the raw photo-z sample, which contains the galaxies with spec-z values lying outside the range [0.6,1.1]. When the outlying galaxies are not removed, this additional component does not correlate with the ones inside, and this causes a reduction in the correlation signals in general. We compare the resulting distributions from the raw sample against the ones from the cleaned sample in Fig. B.1. Overall, we find that the impact of the outlying galaxies is pretty small for our sample.

We note that, as expected, the distributions for the boundary bins are more affected, i.e. bin 1 and 5. This is because the outlying spec-z galaxies are mainly located in the photo-z bins near the boundaries. We further note that CZ is more strongly affected than SC, and thus SC+CZ is in between. This can be explained by the fact that in CZ, the correlation is localized in the precise redshift of the spec-z bin, while the information in SC is more distributed, and hence CZ is more affected by the reduction in correlation with the boundary bins.

Fig. B.1.

Test of the impact of the galaxies with spec-z values lying outside the redshift range [0.6,1.1]. The true-z distribution from the raw sample, which contains the outlying spec-z galaxies (unfilled markers), are compared with the cleaned sample results (filled markers). The results from SC (red), CZ (orange), and SC+CZ (blue) are displayed. The direct mock measurements (green bars) are from the cleaned sample.

Appendix C: Impact of setting negative measurements to a tiny positive value

In the main text, we mention that the multiplicative rule implicitly assumes that the measurements are positive. A quick fix is to set the negative measurements to a tiny positive value, for which we take it to be 10⁻⁵. Here we test the impact of such a modification on the true-z inference results.

Fig. C.1 showcases the true-z distribution from a mock. While there are only small fluctuations in the best fit value, without the modification, the estimated error bars from SC and SC+CZ appear to be larger, but the CZ error bars are little affected. We further examine the accuracy of the estimated distribution in Fig. C.2. The central value and the 1σ error bound are estimated from 100 mock catalogs. From the top panel, we see that nulling the negative measurements seems to increase the accuracy of SC and SC+CZ in most of the bins albeit by a statistically insignificant amount, but the modification also causes the SC bin 1 result to degrade. The impact on the estimated error bar is more apparent. The modification homogenizes the range of $| {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $|\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ across tomographic bins.

There seem to be benefits, especially on the error bar estimation, in doing such a modification, and thus we may consider adopting it in future.

Fig. C.1.

Comparison of the true-z distribution obtained by setting the negative measurements to a tiny positive value (empty markers) or not (filled markers). We have presented the results from SC (orange), CZ (red), and SC+CZ (blue).

Fig. C.2.

Absolute error of the true-z distribution estimated using samples processed by setting the negative measurements to a tiny positive value (empty markers) or not (filled markers). Shown are the results from SC (orange), CZ (red), and SC+CZ (blue). Upper panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the true-z distribution measured from the mock, P_ij′^true and the one estimated, ${\hat{P}}_{i j'}$ $\hat{P}_{ij\prime}$ . Lower panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ the absolute difference normalized by the estimated error σ_{P_ij′}.

All Tables

Table 1.

Summary of the meaning of the key notations used in Sect. 2.1.

In the text

Table 2.

Bias in the mean redshift e_z obtained with various methods.

In the text

All Figures

Fig. 1.

Sample of the photo-z angular correlation function between the photo-z bin i′ and j′, C_i^′j^′ (blue) and the cross angular correlation function between the spec-z bin i and photo-z bin j′, C_ij′ (red) to be used for true-z inference. The line and its associated color band are the median and 16 and 84 percentile among 100 mock runs. The whole redshift range [0.6,1.1] is divided into 10 photo-z bins, each of width Δz_p = 0.05. This results in a true-z distribution with resolution of Δz = 0.05. The label i-j represents i′j′ for photo-z correlation function and ij′ for spec-z-photo-z cross correlation function respectively. We only show the odd bin results for clarity.

In the text

	Fig. 2. True-z distribution inferred by the clustering-based estimators [CZ (red), SC (orange), and SC+CZ (blue)] are compared with the direct mock measurement (green bars). The clustering-based estimator data points are offset horizontally for clarity. The results for five tomographic bins are shown (from top to bottom).
In the text

Fig. 3.

Comparison of the accuracy of the true-z distribution inferred using CZ (red), SC (orange), and SC+CZ (blue). Upper panel: $\sum_{i} | {\hat{P}}_{ij}' - P_{i j'}^{true} |$ $\sum_{i} |{\hat{P}}_{ij}{\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the true-z distribution measured from the mock, P_ij′^true and the one estimated, ${\hat{P}}_{i j'}$ ${\hat{P}}_{ij\prime}$ . Lower panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_{i} |{\hat{P}}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ the absolute difference normalized by the estimated error σ_{P_ij′}. The line and color band represent the median and the 16 and 84 percentile among 100 mock runs. The results for five tomographic bins are shown.

In the text

	Fig. 4. Comparison of the true-z distribution obtained by different implementations of the SC method with the direct measurement on the mock (green bars). The results obtained using the R method (red), P method (orange), and the Old NMF (blue) are compared. Because the Old NMF is not stable when it is run with resolution Δz = 0.05, we can only produce the results with Δz = 0.1.
In the text

Fig. 5.

Comparison of the accuracy of the estimated true-z distribution obtained with different SC algorithms. Upper panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the true-z distribution measured from the mock, P_ij′^true and the estimated one, ${\hat{P}}_{i j'}$ $\hat{P}_{ij\prime}$ . Lower panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ the absolute difference normalized by the estimated error σ_{P_ij′}. In both panels, the results from R method (red), P method (orange), and Old NMF (blue) are compared. The line and color band represent the median and the 16 and 84 percentiles among 100 mock runs. The results for five tomographic bins are shown.

In the text

Fig. 6.

Galaxy bias parameter of the photo-z sample in the spec-z bin i, b_i′ estimated by various methods. The estimate by $C_{ii}'$ $C_{ii}\prime$ (blue circles), C_ii′^x (orange squares), and R_ij′ (black dashed line with 1σ error band in gray) are compared with the direct measurements (green triangles). The results for 10 spec-z bins are shown. The data points are offset slightly horizontally for clarity. See the text for details.

In the text

Fig. 7.

Accuracy of the true-z estimation characterized by $\sum_{i j'} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_{ij\prime} | \hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , is plotted as a function of α, which measures the relative importance between SC and CZ. The line and the associated color band represent the median and the 16 and 84 percentiles among 100 mocks. We have shown the results for three spec-z sample consisting of the most massive 5%, 1%, and 0.5% galaxies in the spec-z sample. The black dashed line indicates the fiducial value α = 1.

In the text

Fig. 8.

Accuracy of the true-z inference by SC+CZ in the absence of the high z spec-z data. In the left panel, the spec-z data for the last two bins, bins 9 and 10, are missing; In the right panel, the spec-z data from bins 7 to 10 are missing. We have contrasted the cases with α₁ = 0.1, 1, and 10, which controls the importance of the preceding SC bins with CZ counterpart. The total absolute difference is plotted as a function of α₂, which adjusts the weight of the SC bin without CZ counterpart. Shown are the results from spec-z samples consisting of the most massive 5%, 1%, and 0.5%, respectively.

In the text

	Fig. 9. Comparison of the true-z distribution obtained by different number of spec-z bins with spec-z data available against the direct measurement on the mock (green bars) for a single mock. We have presented the results from full spec-z bins (violet), no spec-z bin 9 and 10 (red), no spec-z bin 7 to 10 (blue), and SC only (orange).
In the text

Fig. 10.

The absolute error of the true-z distribution estimated by different amount of spec-z bin data. Upper panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the true-z distribution measured from the mock, P_ij′^true and the one estimated, ${\hat{P}}_{i j'}$ $\hat{P}_{ij\prime}$ . Lower panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ the absolute difference normalized by the estimated error σ_{P_ij′}. In both panels, the results from full spec-z data (blue), no spec-z bin 9 and 10 (red), no spec-z bin 7 to 10 (violet), and SC only (orange) are compared. With the reduction of the number of spec-z bin data, the constraining power is weakened. When only the high-z spec-z bins are missing, the impact is mainly localized in the high-z tomographic bins with the low-z bins little affected.

In the text

Fig. B.1.

Test of the impact of the galaxies with spec-z values lying outside the redshift range [0.6,1.1]. The true-z distribution from the raw sample, which contains the outlying spec-z galaxies (unfilled markers), are compared with the cleaned sample results (filled markers). The results from SC (red), CZ (orange), and SC+CZ (blue) are displayed. The direct mock measurements (green bars) are from the cleaned sample.

In the text

	Fig. C.1. Comparison of the true-z distribution obtained by setting the negative measurements to a tiny positive value (empty markers) or not (filled markers). We have presented the results from SC (orange), CZ (red), and SC+CZ (blue).
In the text

Fig. C.2.

Absolute error of the true-z distribution estimated using samples processed by setting the negative measurements to a tiny positive value (empty markers) or not (filled markers). Shown are the results from SC (orange), CZ (red), and SC+CZ (blue). Upper panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} |$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} |$ , the absolute difference between the true-z distribution measured from the mock, P_ij′^true and the one estimated, ${\hat{P}}_{i j'}$ $\hat{P}_{ij\prime}$ . Lower panel: $\sum_{i} | {\hat{P}}_{i j'} - P_{i j'}^{true} | / σ_{P_{i j'}}$ $\sum_i |\hat{P}_{ij\prime} - P^{\mathrm{true}}_{ij\prime} | / \sigma_{P_{ij\prime}}$ the absolute difference normalized by the estimated error σ_{P_ij′}.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Abbott, T. M. C., Abdalla, F. B., Alarcon, A., et al. 2018, Phys. Rev. D, 98, 043526 [NASA ADS] [CrossRef] [Google Scholar]

[2] Abbott, T., Abdalla, F. B., Alarcon, A., et al. 2019, MNRAS, 483, 4866 [NASA ADS] [CrossRef] [Google Scholar]

[3] Abbott, T., Aguena, M., Alarcon, A., et al. 2022a, Phys. Rev. D, 105, 023520 [NASA ADS] [CrossRef] [Google Scholar]

[4] Abbott, T. M. C., Aguena, M., Allam, S., et al. 2022b, Phys. Rev. D, 105, 043512 [CrossRef] [Google Scholar]

[5] Abbott, T. M. C., Adamow, M., Aguena, M., et al. 2024, Phys. Rev. D, 110, 063515 [NASA ADS] [CrossRef] [Google Scholar]

[6] Alonso, D. 2012, arXiv e-prints [arXiv:1210.1833] [Google Scholar]

[7] Amon, A., Gruen, D., Troxel, M. A., et al. 2022, Phys. Rev. D, 105, 023514 [NASA ADS] [CrossRef] [Google Scholar]

[8] Arnouts, S., Cristiani, S., Moscardini, L., et al. 1999, MNRAS, 310, 540 [Google Scholar]

[9] Asgari, M., Lin, C.-A., Joachimi, B., et al. 2021, A&A, 645, A104 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[10] Avila, S., Crocce, M., Ross, A. J., et al. 2018, MNRAS, 479, 94 [NASA ADS] [CrossRef] [Google Scholar]

[11] Bartelmann, M., & Schneider, P. 2001, Phys. Rep., 340, 291 [Google Scholar]

[12] Benítez, N. 2000, ApJ, 536, 571 [Google Scholar]

[13] Benjamin, J., van Waerbeke, L., Ménard, B., & Kilbinger, M. 2010, MNRAS, 408, 1168 [NASA ADS] [CrossRef] [Google Scholar]

[14] Benjamin, J., Van Waerbeke, L., Heymans, C., et al. 2013, MNRAS, 431, 1547 [Google Scholar]

[15] Bolzonella, M., Miralles, J. M., & Pelló, R. 2000, A&A, 363, 476 [NASA ADS] [Google Scholar]

[16] Bonnett, C., Troxel, M. A., Hartley, W., et al. 2016, Phys. Rev. D, 94, 042005 [Google Scholar]

[17] Buchs, R., Davis, C., Gruen, D., et al. 2019, MNRAS, 489, 820 [Google Scholar]

[18] Campos, A., Yin, B., Dodelson, S., et al. 2024, arXiv e-prints [arXiv:2408.00922] [Google Scholar]

[19] Carnero, A., Sánchez, E., Crocce, M., Cabré, A., & Gaztañaga, E. 2012, MNRAS, 419, 1689 [NASA ADS] [CrossRef] [Google Scholar]

[20] Carrasco Kind, M., & Brunner, R. J. 2014, MNRAS, 438, 3409 [NASA ADS] [CrossRef] [Google Scholar]

[21] Cawthon, R., Elvin-Poole, J., Porredon, A., et al. 2022, MNRAS, 513, 5517 [NASA ADS] [CrossRef] [Google Scholar]

[22] Chan, K. C., Avila, S., Carnero Rosell, A., et al. 2022, Phys. Rev. D, 106, 123502 [NASA ADS] [CrossRef] [Google Scholar]

[23] Chan, K. C., Lu, G., & Wang, X. 2024, MNRAS, 529, 1667 [NASA ADS] [CrossRef] [Google Scholar]

[24] Choi, S. 2008, in 2008 IEEE International Joint Conference on NeuralNetworks (IEEE World Congress on Computational Intelligence), 1828 [Google Scholar]

[25] Collister, A. A., & Lahav, O. 2004, PASP, 116, 345 [NASA ADS] [CrossRef] [Google Scholar]

[26] Crocce, M., Castander, F. J., Gaztañaga, E., Fosalba, P., & Carretero, J. 2015, MNRAS, 453, 1513 [NASA ADS] [CrossRef] [Google Scholar]

[27] Cunha, C. E., Lima, M., Oyaizu, H., Frieman, J., & Lin, H. 2009, MNRAS, 396, 2379 [Google Scholar]

[28] Dalal, R., Li, X., Nicola, A., et al. 2023, Phys. Rev. D, 108, 123519 [CrossRef] [Google Scholar]

[29] Davis, C., Rozo, E., Roodman, A., et al. 2018, MNRAS, 477, 2196 [Google Scholar]

[30] de Simoni, F., Sobreira, F., Carnero, A., et al. 2013, MNRAS, 435, 3017 [NASA ADS] [CrossRef] [Google Scholar]

[31] De Vicente, J., Sánchez, E., & Sevilla-Noarbe, I. 2016, MNRAS, 459, 3078 [NASA ADS] [CrossRef] [Google Scholar]

[32] DESI Collaboration (Adame, A. G., et al.) 2024, arXiv e-prints [arXiv:2404.03000] [Google Scholar]

[33] Eifler, T., Miyatake, H., Krause, E., et al. 2021, MNRAS, 507, 1746 [NASA ADS] [CrossRef] [Google Scholar]

[34] Erben, T., Hildebrandt, H., Lerchster, M., et al. 2009, A&A, 493, 1197 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[35] Estrada, J., Sefusatti, E., & Frieman, J. A. 2009, ApJ, 692, 265 [NASA ADS] [CrossRef] [Google Scholar]

[36] Ferrero, I., Crocce, M., Tutusaus, I., et al. 2021, A&A, 656, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[37] Fosalba, P., Crocce, M., Gaztañaga, E., & Castander, F. J. 2015, MNRAS, 448, 2987 [NASA ADS] [CrossRef] [Google Scholar]

[38] Gatti, M., Vielzeuf, P., Davis, C., et al. 2018, MNRAS, 477, 1664 [Google Scholar]

[39] Gatti, M., Giannini, G., Bernstein, G. M., et al. 2022, MNRAS, 510, 1223 [Google Scholar]

[40] Gong, Y., Liu, X., Cao, Y., et al. 2019, ApJ, 883, 203 [NASA ADS] [CrossRef] [Google Scholar]

[41] Goodfellow, I. J., Bengio, Y., & Courville, A. 2016, Deep Learning (Cambridge, MA, USA: MIT Press) [Google Scholar]

[42] Green, D., & Bailey, S. 2023, IEEE Transactions on Signal Processing, 72, 5187 [Google Scholar]

[43] Heymans, C., Grocutt, E., Heavens, A., et al. 2013, MNRAS, 432, 2433 [Google Scholar]

[44] Heymans, C., Tröster, T., Asgari, M., et al. 2021, A&A, 646, A140 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[45] Hikage, C., Oguri, M., Hamana, T., et al. 2019, PASJ, 71, 43 [Google Scholar]

[46] Hildebrandt, H., Viola, M., Heymans, C., et al. 2017, MNRAS, 465, 1454 [Google Scholar]

[47] Hildebrandt, H., van den Busch, J. L., Wright, A. H., et al. 2021, A&A, 647, A124 [EDP Sciences] [Google Scholar]

[48] Hütsi, G. 2010, MNRAS, 401, 2477 [CrossRef] [Google Scholar]

[49] Ilbert, O., Arnouts, S., McCracken, H. J., et al. 2006, A&A, 457, 841 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[50] Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111 [Google Scholar]

[51] Izard, A., Crocce, M., & Fosalba, P. 2016, MNRAS, 459, 2327 [NASA ADS] [CrossRef] [Google Scholar]

[52] Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, arXiv e-prints [arXiv:1110.3193] [Google Scholar]

[53] Lee, D., & Seung, H. S. 2000, Adv. Neural Inf. Process. Syst., 13 [Google Scholar]

[54] Li, R., Napolitano, N. R., Roy, N., et al. 2022, ApJ, 929, 152 [NASA ADS] [CrossRef] [Google Scholar]

[55] Li, X., Zhang, T., Sugiyama, S., et al. 2023, Phys. Rev. D, 108, 123518 [CrossRef] [Google Scholar]

[56] Lima, M., Cunha, C. E., Oyaizu, H., et al. 2008, MNRAS, 390, 118 [Google Scholar]

[57] Mandelbaum, R., Eifler, T., Hložek, R., et al. 2018, arXiv e-prints [arXiv:1809.01669] [Google Scholar]

[58] Masters, D., Capak, P., Stern, D., et al. 2015, ApJ, 813, 53 [Google Scholar]

[59] Matthews, D. J., & Newman, J. A. 2010, ApJ, 721, 456 [Google Scholar]

[60] McQuinn, M., & White, M. 2013, MNRAS, 433, 2857 [Google Scholar]

[61] Mena-Fernández, J., Rodríguez-Monroy, M., Avila, S., et al. 2024, Phys. Rev. D, 110, 063514 [CrossRef] [Google Scholar]

[62] Ménard, B., Scranton, R., Schmidt, S., et al. 2013, arXiv e-prints [arXiv:1303.4722] [Google Scholar]

[63] Miyatake, H., Sugiyama, S., Takada, M., et al. 2023, Phys. Rev. D, 108, 123517 [NASA ADS] [CrossRef] [Google Scholar]

[64] Morrison, C. B., Hildebrandt, H., Schmidt, S. J., et al. 2017, MNRAS, 467, 3576 [Google Scholar]

[65] Naidoo, K., Johnston, H., Joachimi, B., et al. 2023, A&A, 670, A149 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[66] Newman, J. A. 2008, ApJ, 684, 88 [Google Scholar]

[67] Newman, J. A., & Gruen, D. 2022, ARA&A, 60, 363 [NASA ADS] [CrossRef] [Google Scholar]

[68] Padmanabhan, N., Schlegel, D. J., Seljak, U., et al. 2007, MNRAS, 378, 852 [NASA ADS] [CrossRef] [Google Scholar]

[69] Peng, H., & Yu, Y. 2024, JCAP, 2024, 025 [CrossRef] [Google Scholar]

[70] Peng, H., Xu, H., Zhang, L., Chen, Z., & Yu, Y. 2022, MNRAS, 516, 6210 [Google Scholar]

[71] Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[72] Rau, M. M., Dalal, R., Zhang, T., et al. 2023, MNRAS, 524, 5109 [NASA ADS] [CrossRef] [Google Scholar]

[73] Sadeh, I., Abdalla, F. B., & Lahav, O. 2016, PASP, 128, 104502 [NASA ADS] [CrossRef] [Google Scholar]

[74] Salvato, M., Ilbert, O., & Hoyle, B. 2019, Nat. Astron., 3, 212 [NASA ADS] [CrossRef] [Google Scholar]

[75] Schmidt, S. J., Ménard, B., Scranton, R., Morrison, C., & McBride, C. K. 2013, MNRAS, 431, 3307 [Google Scholar]

[76] Schneider, M., Knox, L., Zhan, H., & Connolly, A. 2006, ApJ, 651, 14 [Google Scholar]

[77] Secco, L. F., Samuroff, S., Krause, E., et al. 2022, Phys. Rev. D, 105, 023515 [NASA ADS] [CrossRef] [Google Scholar]

[78] Seo, H.-J., Ho, S., White, M., et al. 2012, ApJ, 761, 13 [NASA ADS] [CrossRef] [Google Scholar]

[79] Simon, P. 2007, A&A, 473, 711 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[80] Song, R., Chan, K. C., Xu, H., & Zheng, W. 2024, MNRAS, 530, 881 [NASA ADS] [CrossRef] [Google Scholar]

[81] Spergel, D., Gehrels, N., Baltay, C., et al. 2015, arXiv e-prints [arXiv:1503.03757] [Google Scholar]

[82] Sugiyama, S., Miyatake, H., More, S., et al. 2023, Phys. Rev. D, 108, 123521 [NASA ADS] [CrossRef] [Google Scholar]

[83] Tassev, S., Zaldarriaga, M., & Eisenstein, D. J. 2013, JCAP, 2013, 036 [Google Scholar]

[84] Troxel, M. A., MacCrann, N., Zuntz, J., et al. 2018, Phys. Rev. D, 98, 043528 [Google Scholar]

[85] van den Busch, J. L., Hildebrandt, H., Wright, A. H., et al. 2020, A&A, 642, A200 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[86] Wright, A. H., Hildebrandt, H., van den Busch, J. L., & Heymans, C. 2020, A&A, 637, A100 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[87] Xu, H., Zhang, P., Peng, H., et al. 2023, MNRAS, 520, 161 [Google Scholar]

[88] Zhan, H. 2011, Sci. Sin. Phys. Mech. Astron., 41, 1441 [NASA ADS] [CrossRef] [Google Scholar]

[89] Zhang, P., Pen, U.-L., & Bernstein, G. 2010, MNRAS, 405, 359 [NASA ADS] [Google Scholar]

[90] Zhang, L., Yu, Y., & Zhang, P. 2017, ApJ, 848, 44 [NASA ADS] [CrossRef] [Google Scholar]

[91] Zhou, R., Newman, J. A., Mao, Y.-Y., et al. 2021, MNRAS, 501, 3309 [NASA ADS] [CrossRef] [Google Scholar]

[92] Zhu, G. 2016, arXiv e-prints [arXiv:1612.06037] [Google Scholar]

[93] Zhu, Z., Yang, Z., & Oja, E. 2013, in 18th conference Scandinavian Conferences on Image Analysis (SCIA 2013) Espoo, Finland, June 17–20, 2013 (Germany: Springer Gabler), 143 [Google Scholar]