Open Access
Issue
A&A
Volume 693, January 2025
Article Number A102
Number of page(s) 18
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361/202452053
Published online 07 January 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

The estimation of photometric redshifts is a fundamental task for numerous objectives in cosmology, as it enables the mapping of the Universe’s large-scale structure and the study of the physical properties of galaxies (e.g. Jarrett 2004; Tanaka & Kodama 2004). These redshifts also allow for the exploration of the Universe’s evolution throughout its history, contributing to the validation and improvement of cosmological models, including the study of dark matter and dark energy (Weinberg et al. 2013; Dark Energy Survey and Kilo-Degree Survey Collaboration 2023). Improving the precision of photometric redshifts is crucial for creating comprehensive catalogues, which can in turn enhance the quality and validity of scientific studies based on such redshift data. It also aids in decision-making for observational plans and survey strategies. Achieving high precision in redshifts for a large number of observations and over a significant volume of data sampled simultaneously remains a challenge both for spectroscopic and imaging surveys (Hildebrandt et al. 2010; Salvato et al. 2019).

Spectroscopic surveys are a valuable source of high-precision redshifts; however, their approach is limited by the number of observations that can be carried out within a specific volume, the time needed to acquire high-quality spectra, and the challenges associated with studying bright galaxies at low redshifts and faint galaxies at high redshifts. Additionally, selection based on galaxy colour also contributes to these limitations. All of these constraints make the generation of spectroscopic catalogues in extensive and uniform volumes difficult. Nonetheless, the Dark Energy Spectroscopic Instrument (DESI) has created the largest 3D map of the Universe and measured the rate at which the Universe expanded 8–11 billion years ago with a precision of 1%, providing a powerful way to study dark energy (DESI Collaboration 2024).

However, imaging surveys with several filter bands can provide denser galaxy samples over larger volumes by going deeper than spectroscopic surveys, but with lower precision redshifts. For example, the CFHTLenS survey achieved a precision of ∼0.05 from broadbands (BBs) for galaxies with iAB <  22, measured from the distribution of relative differences between the spectroscopic redshift, zs, and the photometric redshift, zp, expressed as Δz ≡ (zp − zs)/(1 + zs) (Hildebrandt et al. 2012), which is similar to what other surveys have achieved (Hoyle et al. 2018). To enhance the precision in estimating zp, it is necessary to have a higher resolution over the wavelength range of the spectral energy distribution (SED). Several surveys have therefore incorporated medium-band and narrow-band (NB) filters to obtain improved photometric redshifts (Martí et al. 2014a; Molino et al. 2020), such as the Advanced Large Homogeneous Area Medium Band Redshift Survey (ALHAMBRA; Molino et al. 2014), the Javalambre Physics of the Accelerating Universe Survey (J-PAS; Benítez et al. 2014), the High-redshift and Dead Sources Redshift Survey (SHARDS; Barro et al. 2019), the Javalambre Photometric Local Universe Survey (J-PLUS; Cenarro et al. 2019), and the Southern Photometric Local Universe Survey (S-PLUS; De Oliveira et al. 2019), as well as the Physics of the Accelerating Universe Survey (PAUS) used in this work (Padilla et al. 2019).

The PAUS data can be used to obtain high-quality photometric redshift measurements through the combination of data from 40 NB photometric filters with existing, deeper BB photometry. This combination improves precision compared to estimates based solely on BBs (Alarcon et al. 2021), which has a direct impact on the studies carried out with PAUS, such as measuring intrinsic alignments, studying galaxy clustering (Johnston et al. 2021), characterising properties of galaxies (Tortorelli et al. 2021; Renard et al. 2022; Csizi et al. 2024), studying galaxy evolution and formation (Manzoni et al. 2024), investigating cosmic shear (Van Den Busch et al. 2022), and identifying close galaxy pairs (Gonzalez et al. 2023).

Different techniques have been implemented to improve and expand photometric redshift catalogues estimated using the PAUS data (Eriksen et al. 2019; Alarcon et al. 2020). Eriksen et al. (2019) introduced the BCNz2 code, which was specifically designed to handle the combination of 40 narrow filters and BB filters from PAUS and Subaru, obtained as part of the COSMOS-20 survey (Taniguchi et al. 2015). The BCNz2 code fits templates to the observed fluxes and provides zp in the COSMOS field with a high level of precision, which is inferred by comparison to available spectroscopic redshifts. The measured precision is ∼0.0037 for 50% of all galaxies adjudged to have the highest quality photometric redshift (in terms of the quality factor, see Eriksen et al. 2019) with magnitudes iAB <  22.5 and measured redshifts zs <  1.2. Alarcon et al. (2020) extended the development of a hierarchical Bayesian model to estimate redshifts. This method has been tested on realistic simulations, showing that the incorporation of galaxy clustering information improves redshift determinations and reduces systematic redshift uncertainties.

Eriksen et al. (2020) implemented DEEPz, a deep learning code that includes simulations in the initial training phase and is then trained on observational data. As a result, DEEPz reduced the σ68 dispersion statistic by 50% at iAB = 22.5 compared to the BCNz2 code in the COSMOS field.

The aim of this work is to determine zp in the wide fields of PAUS with DEEPz a method that has not been implemented in these fields. This study also investigates the observational and instrumental effects that may vary between photometric and spectroscopic surveys and the wide PAUS fields on the precision of zp estimates up to apparent magnitudes of iAB = 23 and to compare our zp measurements with those presented in Navarro-Gironés et al. (2024) using the BCNz2 method, as both methodologies are affected by different issues. In the case of template methods, such as BCNz2, it is expected that zp catalogues will exhibit significant dispersion and low precision at high redshifts, whereas machine learning-based methods, such as DEEPz, are occasionally affected by the selection of the spectra sample. Additionally, we investigate whether these catalogues can be implemented in studies concerning such topics as the identification of close galaxy pairs.

The article is organised as follows. Section 2 provides an overview of the observational and simulated data used in each field. Section 3 introduces the DEEPz model used for determination of the zp catalogues and outlines the metrics used to evaluate the zp performance. In this section, we also present the description of the various tests conducted to generate accurate photometric redshifts in three observed wide fields. In Sect. 4, we present the obtained zp for each field and the combination of them. We analyse the variation in the precision of zp according to instrumental and observational effects and the spectroscopic sample used in the training of DEEPz. Additionally, we assess the performance of zp in identifying galaxy pairs and compare the results obtained with DEEPz and BCNz2, and we describe the generation of the catalogues in each field. In Sect. 5, we implement zp in the identification of close galaxy pairs. Finally, Sect. 6 provides a summary of the conclusions of this work. Throughout this paper, we adopt a Planck 2015 cosmology (Planck Collaboration XIII 2016) with the following parameters: H0 = 67.3 km s−1 Mpc−1, Ωm = 0.315, and ΩΛ = 0.685.

2. Data

Here, we calculate photometric redshift values in three wide fields observed by PAUS, labelled as W1, W3, and G09 (see Fig. 1). To estimate zp in each field, we used a combination of simulated and observational data. The observational data is comprised of individual exposures, coadded fluxes from 40 NBs of PAUS, and spectroscopic redshifts from various surveys. The specific data used depends on the area under consideration, as set out below.

thumbnail Fig. 1.

Aitoff Galactic projection showing the location of the PAUS fields and the Galactic extinction. The level of extinction is shown in terms of the E(B − V) parameter on a linear scale, as shown by the key. The data are from the dust map provided by the Planck Collaboration (Planck Collaboration Int. XLVIII 2016). The W1, G09, W3, and COSMOS wide fields are shown in green, orange, violet, and pink, respectively.

2.1. PAUS data

The PAUS catalogues have been meticulously crafted by the Port d’Informació Cientifica (PIC) data centre. The catalogues are derived from 40 NB optical images, which were acquired with the PAUCam instrument on the William Herschel Telescope at the Observatorio del Roque de los Muchachos in La Palma, Canary Islands (Padilla et al. 2019; Castander et al. 2012). The data spans the wavelength range from 4500 Å to 8500 Å, with a uniform spacing of 100 Å between contiguous bands, giving an average spectral resolution of R ∼ 65. The target fields covered by PAUS are the COSMOS field, the Canada–France–Hawaii Telescope Legacy Survey CFHTLS fields (W1, W3 and W4), and the KiDS/GAMA G09 field (Heymans et al. 2012; Erben et al. 2013; Tortorelli et al. 2021). PAUS has covered 12.04 deg2 in W1, 15.7 deg2 in G09, 22.64 deg2 in W3, and 1 deg2 in the COSMOS field. PAUS has performed only a few observations in the W4 field, so we have not included this here. Figure 1 shows, in the Galactic Aitoff projection, the location of these four fields, along with the dust distribution provided by Planck (Planck Collaboration Int. XLVIII 2016).

The data reduction and galaxy photometry are obtained through two pipelines, the primary pipeline, referred to as the nightly pipeline, which handles all the image processing, with customised algorithms used for photometric calibration (Castander et al. 2024; Serrano et al. 2023). The multi-epoch and multi-band analysis (MEMBA) pipeline performs forced photometry on a reference catalogue to optimise photometric redshift performance.

2.2. W1 field

The CFHTLS-WIDE observations cover four patches of the sky at high Galactic latitudes, and are made up of pointings of 1° × 1° (Erben et al. 2013). The W1 field has 72 pointings centred on RA = 02h18m00s, Dec = −07d00m00s. In this field, the PAUS coadded fluxes of 40 NBs correspond to the MEMBA production 1015 and are produced using inverse variance weighting of the individual measurements. Some of these sources have several individual exposures in a single band reaching a maximum of 11 observations. However, most of them have only one observation. We used the CFHTLenS data set for the BB in five filters u*griz′ for sources with star_flag =0.

The spectroscopic redshift sample used to train DEEPz in this field is made up of data from 13 catalogues. Approximately 60% of the spectroscopic data used is derived from the VIMOS Public Extragalactic Redshift Survey (VIPERS; Guzzo et al. 2014; Garilli et al. 2014). VIPERS covers a total area of approximately 24 deg2 located within the CFHTLS-Wide W1 and W4 fields and is limited to objects with magnitudes iAB <  22.5 and redshifts in the range of 0.5 <  z <  1.5. The other half of the spectroscopic data, which we will refer to as GAMA-SDSS from now on, comes from various sources, with the Galaxy and Mass Assembly GAMA (Driver et al. 2009, 2011; Baldry et al. 2018) catalogue being one of the most significant contributors. GAMA has obtained redshifts for a quarter of a million galaxies, primarily using the 2dF/AAOmega instrument on the Anglo-Australian Telescope. The final data are described in Liske et al. (2015), with GAMA achieving a high completeness (better than 98%) down to rAB <  19.8 over an area of 180 deg2 in the equatorial G09, G12, and G15 fields (60 deg2 each). The W1 field represents 20% of the total sample. The Sloan Digital Sky Survey Data Release 16 SDSS-DR16 (Ahumada et al. 2020) contributes approximately 13% of the dataset, and the following catalogues provide the remaining information: VANDELS (Garilli et al. 2021), KBSS-MOSFIRE (Steidel et al. 2014), VVDS (Le Fèvre et al. 2013), DES_AAOMEGA (Childress et al. 2017), 3DHST (Brammer et al. 2012), DES_IMACS (Dressler et al. 2011), ZFIRE (Nanayakkara et al. 2016), C3R2 (Masters et al. 2019), and CDB (Sullivan et al. 2010).

2.3. W3 field

The W3 field is centred on RA = 14h17m54s, Dec = +01d19m00s, with 49 pointings covering a total area of approximately 30 deg2. Similar to the W1 field, we utilise information from the 40 PAUS NBs (production 1012), along with the five CFHTLenS BB. Part of their observations are part of the multiple observations of the EGS field, which is centered at RA = 14h17m54s, Dec = +01d19m00s. It has been extensively observed by the All-wavelength Extended Groth Strip International Survey (Davis et al. 2007) and is one of the five fields studied by the Cosmic Assembly Near-IR Deep Extragalactic Legacy Survey (Koekemoer et al. 2011, CANDELS).

In this field, we use a total of seven spectroscopic catalogues to provide the training data. Half of the data comes from SDSS-DR16 (Ahumada et al. 2020). The bulk of the remaining data (40%) is from DEEP2-DR4 (Newman et al. 2013), with a magnitude limit of rAB = 24.1, reaching a z = 1.5. The remaining minor contributions of spectroscopic information come from 3DHST (Brammer et al. 2012), C3R2 (Masters et al. 2019), CDB (Sullivan et al. 2010), and SAGA (Geha et al. 2017).

2.4. G09 field

The PAUS team conducted observations in a limited portion of the CFHTLS-W2 patch, focusing on 33 pointings centred on the coordinates RA 08h54m00s and Dec −04d15m00s. This area is located within one of the four equatorial regions mapped by the GAMA survey, namely, 9 h (G09), which has been documented in various publications (Driver et al. 2011, 2016; Hopkins et al. 2013; Liske et al. 2015). This region is 12 × 4 deg2 and is located within the footprint of the public ESO Kilo-Degree Survey and the VISTA Infrared Kilo-Degree Galaxy Survey, commonly known as KiDS and VIKING, respectively (de Jong et al. 2013, 2015; Edge et al. 2013; Kuijken et al. 2015; Venemans et al. 2015; Bellstedt et al. 2020).

KiDS has obtained optical images over 1500 deg2, captured using OmegaCAM on the VLT Survey Telescope (VST) in the u, g, r, and i filters. Additionally, VIKING has near-infrared coverage through the VISTA telescope in the Z, Y, J, H, and Ks bands. Consequently, galaxies in the G09 field have information in nine BB.

The GAMA survey is the primary source of spectroscopic information in this area, contributing 80% of the total. The remaining spectroscopic information comes from SDSS. The objects in this field that form the training sample reach apparent magnitudes of iAB = 22. A complementary sample called KiDz-COSMOS, extracted from the KiDS DR5 (Wright et al. 2024), has been created, containing information in the nine KiDS and VIKING bands, reaching an apparent magnitude of iAB = 22.5. Most of the spectroscopic information in this sample comes from the extensive redshift study known as zCOSMOS-bright (Lilly et al. 2009), which covers 1.7 deg2 down to iAB = 22.5 and 0.1 <  z <  1.2 and is a sub-sample of G10-COSMOS (Davies et al. 2015).

2.5. Sub-samples

To evaluate the precision of the zp determined by DEEPz, which incorporates supervised machine learning models, we have divided the galaxies with spectroscopic information into three random groups. One of them is used to train the models (the training set), another group is used to evaluate the model performance (the validation set), and a third (the test set) is not used in previous steps, but is reserved to estimate how precise the zp will be for galaxies lacking spectroscopic information.

In Table 1, we show the size of the sets (i.e., the total sample and the training, validation, and test sets). The first four columns are the number of galaxies in each field. The last two columns present the data augmentation, specifically the training, validation, and test samples are combined based on their similar photometric characteristics. This augmentation was applied to the W1 and W3 fields from the CFHTLenS survey, which includes (BB) photometry, as well as to a combined dataset from the G09 and COSMOS fields, which integrates BB information from the KiDS and VIKING surveys.

Table 1.

Sample sizes for each wide field.

2.6. Simulations

The estimation of photometric redshifts by DEEPz uses a combination of observational and simuated data, as explained in the next section (Eriksen et al. 2020). Reproducing the methodology of DEEPz, we have included the code for simulating galaxy spectral energy distributions from flexible stellar population synthesis (FSPS; Conroy et al. 2009; Conroy & Gunn 2010) in the neural network training before the input of observational data. This code uses stellar population synthesis (SPS) models to estimate stellar masses, mean ages, metallicities, and star formation histories for different star formation histories. The simulations are generated using the same parameter ranges described in Sect. 3.2.2 of Eriksen et al. (2020). Therefore, the samples in the W1 and W3 fields are identical to those used in that work, consisting of a million galaxies uniformly selected with redshift information, as well as fluxes in the PAUS NBs and in the BB of the CFHTLenS survey. Regarding the simulation of the same number of galaxies in the G09 field, the same parameter selection is used. However, in this case the FSPS code is extended to include the nine KiDS and VIKING filters (de Jong et al. 2013, 2015; Kuijken et al. 2015; Edge et al. 2013; Venemans et al. 2015), and the redshift range is extended from 1.5 to 2.1.

3. Methodology

The precision of studies conducted using a statistical tool like machine learning depends on several factors: the model, the information available, in particular the amount and quality of the data provided to the model. Among the various factors that influence the performance of the model and, therefore, the determination of photometric redshifts, we have studied in each field the impact of refining the data on the photometric redshifts. In Sect. 3.1, we describe the DEEPz code. Then, in Sect. 3.2, we specify the metrics used to study the quality of photometric redshifts. Finally, in Sect. 3.3, we present the tests of data refinement.

3.1. DEEPz

DEEPz was designed for photometric redshift estimation in PAUS, using NB and BB data. Its effectiveness has been tested in the COSMOS field using photometry from the 40 NBs of PAUS, the BBs of Subaru, and the u-band of CFHTLenS (Eriksen et al. 2020).

The methodology of DEEPz involves implementing an architecture composed of three neural networks: the first two make up an autoencoder and the third is a mixture density network (MDN). Figure 2 shows the network flow diagram. The autoencoder is used to extract features without prior knowledge of the redshift and to improve performance for faint sources. The latent variables encoded by the autoencoder, along with the original input fluxes divided by the target band, are fed into the MDN, where the target band is defined independently for each BB survey. In the case of CFHTLenS, the target band is the i-band, while for KiDS, it is the r-band. The third neural network, the MDN gives the probability distribution of the photometric redshift. The mode of this distribution is zp for a specific galaxy. In datail, the network architecture consists of an autoencoder with ten layers and 250 nodes in both the encoder and decoder. Each layer includes linear transformations followed by ReLU non-linearities, batch normalisation, and a 2% dropout, except for the last three layers. The autoencoder is fed galaxy flux ratios. The zp network, which follows the same structure as the autoencoder, takes both the galaxy flux ratios and autoencoder features as input. It includes 1% dropout after all linear layers. This network is a MDN, representing the redshift distribution as a linear combination of 10 normal distributions.

thumbnail Fig. 2.

DEEPz network flow diagram. The data are shown in grey and the networks in violet. The network architecture consists of an autoencoder with ten layers and 250 nodes in both the encoder and decoder. Each layer includes linear transformations followed by ReLU non-linearities, batch normalisation, and a 2% dropout, except for the last three layers. The autoencoder is fed galaxy flux ratios. The zp network, which follows the same structure as the autoencoder, takes both the galaxy flux ratios and autoencoder features as input. It includes 1% dropout after all linear layers. This network is a MDN, representing the redshift distribution as a linear combination of 10 normaldistributions.

We use pre-training to help adjust the weights of neural networks so that they can later be fine-tuned with observational data. The model training process begins with simulations. Performing this pre-training step before training with observational data reduces the scatter in zp by 50% for faint sources in terms of apparent magnitude. The subsequent training was conducted with observational data, including data augmentation by constructing several co-add fluxes from individual NB exposures. These were generated on-the-fly and weighted inversely by variance, including each exposure with a probability α, which we set to α = 0.8. This caused each galaxy to appear differently to the network in each epoch i.e., each time the network parameters were adjusted, thus mimicking repeated measurements over the same sky areas and following systematic patterns to produce combined measurements with reduced noise and ultimately allowing the observation of fainter objects. This technique has multiple issues when applied to observational data, such as variability in the number of exposures and the need to inform the network about which measurements are present. We compared with Eriksen et al. (2020), we increased the number of layers in the autoencoder from ten to 20 while keeping the number of latent variables unchanged. Additionally, we adapted the input dimensions of the networks to fit the number of bands used. Specifically, in the W1 and W3 fields, the inputs comprise a maximum of 40 NBs and five BBs from CFHTlens, unlike the G09 field, where 40 NBs and nine BBs from KiDS and VIKING are used. Additionally, we studied the removal of eight blue NBs and the use of only NBs, as we outline in the Sect. 3.3.

3.2. Metrics

To assess the precision of zp, we have used the σ68 statistic as the primary metric (see Eq. (1)). This statistic is calculated from Δz. Then

σ 68 P [ 84 ] P [ 16 ] 2 , $$ \begin{aligned} \sigma _{68} \equiv \frac{P[84] - P[16]}{2}, \end{aligned} $$(1)

where P[x] represents the value of the Δz distribution at percentile x. σ68 is a centralised measure of the width or scatter in the accuracy of the estimated redshifts that is not affected by outliers.

In addition to σ68, another commonly used metric in these types of studies is σNMAD, which provides a robust measure of scatter in the presence of outliers. In Appendix A, we show the ratio between σ68 and σNMAD to compare the performance of these metrics.

We formally defined outliers as galaxies that meet the condition

| z p z s | 1 + z s > 0.02 . $$ \begin{aligned} \frac{\left|z_{\rm p} - z_{\rm s}\right|}{1 + z_{\rm s}} > 0.02. \end{aligned} $$(2)

We note that this equation is stricter than the outlier criteria typically used in BB papers (Hildebrandt et al. 2012).

3.3. Data refinement tests

During the reduction process and calibration, the collected observations undergo a series of steps and techniques aimed at improving their quality and completeness, eliminating possible errors in the measurements or defects such as noise or Galactic extinction, and extracting the most relevant information. These procedures involve curation and preprocessing techniques, along with the use of statistical tools such as univariate analysis for the selection of relevant features.

Here, seven tests were conducted to examine whether curation and preprocessing procedures, dependent on observational conditions and instruments, affect the precision of zp. These tests ranged from combining measurements made by two instruments to selecting specific bands. Each test generated diverse datasets, which influenced the training of DEEPz and, consequently, the precision of the photometric redshifts.

Before describing the tests, we set out the baseline sample first

Baseline. As a starting point for these studies, we exclude galaxies without a measurement in all of the bands. Consequently, the samples of galaxies, both with and without spectroscopic information, are reduced by 9%, 11%, and 2% in the W1, W3, and G09 fields, respectively. These samples are used as a reference to evaluate and compare the performance of the tests.

Next we describe each of the tests performed.

Cross-validation (CV). The W1 and W3 fields have measurements classified as missing in the i-band of CFHTLenS due to issues with the filter identified as “i.MP9702” during observation. However, new measurements were obtained using the successor filter, “i.MP902”, and part of the sample was completed with these values. Therefore, we evaluated the impact on the precision of the redshift by combining the measurements from these two filters, compared to the performance obtained without considering “i.MP902” measurements, that is, with the baseline sample. This was done to validate that there is no impact on zp when using one of these two filters.

Inferring missing NB values (non-NaN). The catalogue contains missing observations in some NBs. The DEEPz architecture uses a fixed number of inputs, making the code unable to handle a variable number of NBs. In Eriksen et al. (2020), the application of the code was restricted to galaxies with measurements in all 40 NBs. Imposing this condition results in the loss of 55.22% of the galaxies in the W1 field, around 38.52% in the W3 field, and 57.82% in the G09 field. To reduce the impact on completeness, instead of removing all galaxies, we infer the missing NB values for galaxies with at least 30 NB measurements. In this case, the reduction in the sample size is 30% of the total sample of galaxies with at least one observation in the W1 field, approximately 28% in the W3 field, and 42% in the G09 field. The NB estimate was generated through a quadratic fit using the information from the g, r, and i-bands. This type of study allows us to estimate a more complete galaxy catalogue, including galaxies without observations in all filters.

Low signal-to-noise ratio fluxes. We investigate sources in the CFHTLenS catalogue that have at least one BB with a low signal-to-noise ratio, denoted by a value of 99. To generate a catalogue with a greater number of objects, we examine the impact of including sources under these conditions, even though this may not be the case for all bands, and catalogues are usually generated with a high S/N based on the target band.

Bright galaxies (iAB <  22.5). We study if there is a significant difference in precision when faint galaxies are included or discarded in the analysis. Therefore, we cut the sample of galaxies at an apparent magnitudes of 22.5 according to the target band of the BB surveys corresponding to each wide field.

Galactic extinction correction (Ext. correction). Although the wide fields used in this study are not at low Galactic latitudes, they are weakly and differentially affected by the dust, gas, and stellar density of the Milky Way, though the latter does not affect extinction. Therefore, we performed a correction to the NB and BB to undo the effects of Galactic extinction. Before the correction, we removed the Galactic extinction on the BBs based on the dust maps presented in Schlegel et al. (1998), which already included their apparent magnitudes. Subsequently, the correction was carried out for the BBs and NBs using the extinction values, E(B − V), provided in the Planck 2015 thermal dust map (Planck Collaboration Int. XLVIII 2016).

The Galactic extinction value at the source position and the corresponding correction factors were estimated through

ϕ corr = ϕ uncorr · 1 C 0 · E ( B V ) 2 + C 1 · E ( B V ) + C 2 , $$ \begin{aligned} \phi _{\mathrm{corr}} = \phi _{\mathrm{uncorr}} \cdot \frac{1}{{C_0 \cdot E(B-V)^2 + C_1 \cdot E(B-V) + C_2}}, \end{aligned} $$(3)

where the band-dependent coefficients (C0, C1, and C2) were estimated using a second-degree polynomial fit to the median extinction affecting the Pickles stellar templates (Pickles 1998).

We conducted a univariate analysis of the bands and zp using the Pearson correlation, mutual information, and regression methods (see Appendix B). We found that the BBs behave similarly in score to the NBs in terms of their importance for determining zp according to wavelength and these methods, so it is interesting to study the case of only using the NBs to determine zp. Additionally, we found a lower score for wavelengths in the blue, so we investigated their importance for the precision of zp. The following two tests are based on this study.

Only NBs (NBs). We consider eliminating the BBs and using only the NBs. In this case, the NBs are normalised by creating an artificial band from the average of the NBs within the wavelength range of the target selection band, i.e., the i-band in the case of fields W1 and W3 and the r-band in the case of field G09. This allows us to study whether the more general SED characteristics at low wavelength resolution and a high S/N provided by the BBs are important in the precision of zp. Therefore, as input, DEEPz uses the 40 NB fluxes normalised by the artificial band. This change in the number of bands used to determine zp results in a change in the number of neurons in the input layer of the neural networks.

SED in the blue wavelength range (blue-lines). Since the univariate analysis shows a low correlation for wavelengths in the blue range of the galaxy SED, we explore the possibility of saving time by disregarding the detailed information about the SED in the blue wavelength range, contained in eight filters of PAUS, within the wavelength interval of 4550 Å–5250 Å. Through this test, we evaluate the relevance of lines identified in the blue region with respect to the general SED characteristics within this spectral range when considering all wide bands.

In Table 2, we provide a brief description of each test and specify the number of galaxies in the training and validation sets for each case.

Table 2.

Description of the data refinement test.

4. Results and discussion

In Sect. 4.1, we perform a comprehensive analysis of the results obtained in each wide field and their combination. Additionally, we analyse the outcomes corresponding to the data refinement tests described in Sect. 3.3. In Sect. 4.2 we consider the origin of the spectroscopic redshifts and the effect on the zp. Additionally, in Sect. 4.3 we compare our best results with the zp obtained through the BCNz2 method. Finally, we describe the generation of the zp catalogue in the wide fields in Sect. 4.4.

All measurements are performed in the intervals 18 ≤ iAB ≤ 23 and zs ≤ 2 unless stated otherwise.

4.1. Photometric redshift precision

In our study, the precision of the zp is intrinsically linked to the field under consideration and the data available in each field. This is because the latter are used in the training and validation process, which depends on the fluxes of the NBs and BBs, the number of galaxies in the sets, and the distributions of zs.

Therefore, in this section we present the comparison of the results using the baseline samples of each field and their combinations. In Appendix C, the results of each data refinement test in each field and their combination are shown, implementing the training and validation samples differentiated by these conditions.

4.1.1. Performance by field

Initially, we consider each field individually, as well as the combination of the CFHTLenS fields, W1 with W3, and the KiDS/VIKING fields G09 with COSMOS. The analysis is carried out in terms of the σ68 trends, which are analysed in relation to the i-band magnitude limit and zs.

We compare the precision of the baseline samples in the W1 and W3 fields and the combination W1+W3. We performed the same analysis for the baseline samples in the G09 and COSMOS fields and their combination. The behaviour of these two cases is shown in Fig. 3.

thumbnail Fig. 3.

Trends in the measurement of σ68. The distribution is divided into ten bins, each containing the same number of objects, based on the apparent magnitude in the i-band and zs, in the left and right columns, respectively. These trends show the precision, in terms of the centralised scatter σ68, in the W1+W3 and G09+COSMOS validation sets.

As expected, in all fields, it can be seen that the precision in terms of σ68, as a function of iAB and zs, is higher for bright and nearby galaxies (lower σ68), and decreases for faint and distant galaxies (higher σ68). When comparing the values obtained from each test in each individual field, we find a systematic superiority in the precision of W1 over W3 across all magnitude range and of G09 over COSMOS only for bright galaxies. We also find that the combination of fields with the same NBs and BBs, W1+W3 and G09+COSMOS, produces better results than the individual samples W3 and COSMOS, respectively, in terms of the σ68 trends when studied as a function of iAB and zs, compared to using individual samples from each field. Regarding the comparison of results between the W1+W3 and G09+COSMOS samples, differentiated by the photometry and spectroscopic redshift catalogues, we observe that in all studies, W1+W3 outperforms G09+COSMOS. The G09 and COSMOS fields have four more BBs than the CFHTLenS fields but with lower S/N; this does not generate differences in the trends of the metrics measured as a function of i and zs between G09+COSMOS and the CFHTLenS fields. However, we note that DEEPz determines zp with less precision in G09+COSMOS compared to W1+W3 as in the individual comparisons.

4.1.2. Analysis of the data refinement tests

The comparison between refinement tests is shown in Fig. 4, in which we show the trends of σ68 and the fraction of outliers as a function of apparent magnitudes in the i-band, zp, and zs for sets of models trained with different galaxy samples. Two baseline samples are taken, one combining galaxies from the W1 and W3 fields of CFHTLenS and the other galaxies from the G09 and COSMOS fields of KiDS plus VIKING. These two samples are adapted to the specific conditions of each test, and each model is trained with them. The trends are obtained with the baseline samples, i.e., without any restriction.

thumbnail Fig. 4.

The upper row depicts trends and variations in the measurement of σ68, while the lower row shows the fraction of outlier values. The distribution is divided into ten bins, each containing an equal number of objects, based on the magnitude in the i-band, zp, and zs, positioned in the left, centre, and right columns, respectively. These trends represent the findings from seven tests, summarised in Table 2, implemented on the W1+W3 and G09+COSMOS baseline samples.

We now comment on the test in the same orders as they were presented in Sect. 3.3:

CV. The CV test revealed that galaxies in the W1, W3 fields, and their combination, do not show a degradation in the precision of zp when including measurements from different filters with similar sensitivities in the i-band of the CFHTLenS catalogue. Therefore, the differences between the measurements from two filters are not statistically significant. Consequently, we can obtain more comprehensive catalogues, such that the measurements of both instruments are indistinguishable within the precision limits of the zero point.

Infer missing NBs. Inferring missing data in the NBs increases the sample size by approximately 35% in W1 and W3, and around 20% in G09 and COSMOS. The results suggest that inferring missing data in this way is an effective strategy for improving catalogue completeness without significantly affecting the performance of redshift determination. In fact, this is the test that shows the best performance in both W1+W3 and G09+COSMOS.

Low S/N. It can be seen that sample loss due to low signal-to-noise ratio in the BBs is not significant in any field, accounting for around 1% of the sample. Therefore, the performance in these samples is very similar to that obtained with the baseline samples.

i < 22.5. The analysis of galaxies limited to apparent magnitudes brighter than iAB <  22.5 indicates good precision, as in the other tests, but shows a decrease in precision for a sample of galaxies that include magnitudes close to 23 and at high redshifts, compared to the other data refinement tests. This slight decrease in precision is a little more noticable in the G09 and fields.

Ext.Correction. We find that the Galactic latitude of the fields does not have any effect on the precision of zp provided that the extinction correction in Eq. (3) is applied.

NBs. In the analysis focused on using only the NB in the photometric redshift estimation, in the W1+W3 sample, a reduction in precision is observed with increasing i-band magnitude and redshift, and this more pronounced than in other tests. However, the model performance remains similar to the other cases at low redshifts and for bright galaxies. This suggests that for these two fields, ignoring the BB does not significantly affect the precision in zp for galaxies with i <  20, as there is a high S/N in the NBs. For the G09+COSMOS sample, we find poor performance for both faint and bright galaxies, as well as for close and distant galaxies. This finding underlines the importance of the number of examples with which the network is trained, since the training sample in this field is smaller than that for W1+W3, and the NB fluxes have the same S/N.

Blue-lines. The performance of DEEPz does not show any significant changes when the detailed SED features provided by NBs in the blue wavelength range are omitted, so they may not be necessary to estimate zp in the W1, G09, and W3 fields. Since the model trained with the full NB sample has the best performance compared to other tests in W1+W3 and G09+COSMOS, we decided to adopt this methodology as the main one for determining the photometric redshift. That is, the model will receive as input galaxies with information from 40 NBs + BBs, with the number of the latter depending on the field. In the event that a galaxy has at least ten missing measurements in NBs, a quadratic fit on BBs will be made to infer these measurements.

To improve the completeness of the photometric redshift catalogues, we include the methodology of models trained with galaxies with imputation in the i-band of the CFHTLenS catalogue, i.e., the CV samples and the models trained with galaxies with a low S/N. Since these two models show a similar precision to that obtained with the baseline model in the trends of σ68 and the outlier fraction for weak and distant galaxies in each field or their combination.

4.2. Effect of spectroscopic surveys

When spectroscopic catalogues present non-uniform distributions in zs, contain sources of variable brightness, data affected by contamination by absorption lines or exposure time limitations, the recovered zp will reflect these conditions. Therefore, in our analysis, we also consider the origin of the redshifts in the study, since in each field at least the combination of two spectroscopic catalogues is used, and the ranges and statistics of the distributions of zs of these differ, as illustrated in the upper panel of Fig. 5. We compare the dominant redshift distributions of the baseline sample in the W1 field, VIPERS and GAMA_SDSS. The upper panels show the trends of σ68 with respect to i and zs, respectively.

thumbnail Fig. 5.

The upper panels show trends in the measurement of σ68 with respect to i (left panel) and zs (right panel). There are separate curves for the results using the VIPERS and GAMA_SDSS redshifts. The lower panel shows the spectroscopic redshift distribution from the VIPERS and GAMA_SDSS surveys in the W1 field.

The performance comparison between the VIPERS and GAMA_SDSS samples was carried out with the same number of sources from the baseline sample in the W1 field, taking the largest possible number of examples from each survey, resulting in samples of 1760 galaxies.

The σ68 trends in the W1 area, separated by the VIPERS and GAMA_SDSS samples, show slight differences for faint and distant galaxies. VIPERS has higher precision than GAMA_SDSS. Although both surveys include measurements for this type of galaxy and were used in the model training, the number of examples from each survey is different in these ranges.

Finding this dependency in the spectroscopic survey data indicates that there will be better precision in the photometric redshifts for galaxies with the characteristics of the survey with the majority of examples.

4.3. Comparison between DEEPz and BCNz2

We present the zp of the W1 and G09 baseline test samples obtained with models trained with W1+W3 and G09+COSMOS samples, respectively, along with zp obtained in Navarro-Gironés et al. (2024), which use the method BCNz2.

In Fig. 6, we show the trends of σ68 and the fraction of outliers with respect to the apparent magnitude in the i-band and the spectroscopic redshift obtained with BCNz2 and DEEPz for the W1 and G09 baseline samples.

thumbnail Fig. 6.

Comparison of the σ68 values (top row) and outlier fractions (bottom row) for the W1 and G09 samples obtained using the DEEPz and BCNz2 methods. Columns (1) and (2) correspond to the implementation of the model trained with the W1 sample with missing NB values inferred. Columns (3) and (4) correspond to the implementation of the model trained with the G09+COSMOS sample with missing NB flues inferred. Upper panels (a) and (b): σ68 for DEEPz and BCNz2 plotted against i-band magnitude for (a) and against redshift for (b). Lower panels (c) and (d): Fraction of outliers with Δz >  0.02 plotted as a function of i-band magnitude for (c) and redshift for (d).

The measurements of zp in the W1 and W3 fields indicate that the σ68 metric, for both DEEPz and BCNz2, shows the expected behaviour: a decrease in precision and an increase in the fraction of outliers moving to fainter galaxies and higher spectroscopic redshifts. The performance of the two methods is similar for bright galaxies up to an apparent i-band magnitude of approximately 20 and for nearby galaxies up to a redshift of around 0.3. However, beyond these values, DEEPz is more accurate than the template method BCNz2, reaching a difference in σ68 of 0.02(1 + zs) between the two methods for faint galaxies with an apparent magnitude in the i-band of 23.

This observed trend between DEEPz and BCNz2 on the W1 baseline sample using the model resulting from the combination of the W1 and W3 fields is consistent with that found in the individual studies of the W1 and W3 fields, for any data refinement test. It is also consistent with the measurements in the COSMOS field, where the methods are applied to samples with a range of apparent magnitude limits in the i-band in [20–22.5]. However, this behaviour is completely different in the case of bright galaxies in the i-band, i.e., for the G09 baseline sample, the BCNz2 template method is more accurate than DEEPz in all cases. However, when a model is implemented with the combination of galaxies from the G09 and COSMOS samples on the G09 baseline sample, DEEPz shows an improvement in precision for bright galaxies but not larger than that for BCNz2. These results show that the precision of DEEPz increases as the number of examples in the training sample increases, as well as when the number of BB filters increases, as demonstrated in the COSMOS field (Eriksen et al. 2020).

Since we have an estimate of the error behaviour of each method according to i and zs, we wonder if there is any correlation in the errors of each method, i.e., if the two methods tend to underestimate or overestimate the redshift in the same way. Figure 7 shows the comparison of errors Δz from each method, categorised by apparent magnitude in the i-band and zs using a colour code. For the baseline sample, we find that the two methods tend towards the perfect case, i.e., if both methods determined zp identical to zs, all points would be concentrated at a single point in the centre of the plot. In the scatter plots, we see that the errors of DEEPz and BCNz2 correlate linearly, meaning that there is a set of galaxies for which both methods determine zp with the same level of precision. This is to be expected as both methods use the same photometry. Additionally, on the scatter plots, we observe two patterns, one horizontal and one vertical, located at zero on each axis. These patterns indicate that there are cases where one method is more accurate than the other. However, we find less dispersion overall for the DEEPz method. Additionally, we observe that both methods decrease in precision when the apparent magnitude in the i-band and the redshift zs increase.

thumbnail Fig. 7.

Scatter plots of errors on the x-axis for the DEEPz method and on the y-axis for the errors obtained when determining zp using the BCNz2 method. Top panel: scatter plot of errors for each method along with a histogram showing the distribution of points along each axis. Bottom-left panel: scatter plot of errors for each method colour-coded according to the apparent magnitude in the i-band. Bottom-right panel: scatter plot of errors for each method colour-coded according to zs.

4.4. Catalogues

Based on the results obtained in each field and the study regarding observational effects, as well as the performance comparison between DEEPz and the BCNz2 template method presented in Navarro-Gironés et al. (2024), we have generated catalogues of photometric redshifts in the W1, W3, and G09 fields. In all three fields, we have chosen to utilise the non-NaN model, which uses a training set with inferred values for misssing NBs obtained using a fit to the measurements in the r, g and i-bands.

We applied the CV and low S/N models as additional strategies to enhance the completeness of the catalogues. In the first case, we combined the galaxies belonging to W1 and W3 that have measurements with the second filter in the i-band and galaxies with a low S/N in both the W1 and W3 fields. The resulting zp catalogue in the W1 field has 388 375 galaxies, of which 86 were assigned a zp = 0, which means the model could not determine a value. After excluding these, a total of 388 289 galaxies had a photometric redshift estimated with a mean zp = 0.59. In the W3 field, 779 935 photometric redshifts were obtained with an average zp = 0.58, excluding 172 galaxies with zp = 0. Lastly, in the G09 field, the photometric redshifts of 490 617 galaxies were estimated, with an average zp = 0.60, after discarding 31 galaxies with zp = 0.

5. Close galaxy pairs

To study the performance of zp obtained here in a specific application, the identification of close galaxy pairs are analysed. From the study of the interaction between the members of these systems, it can be inferred how their physical properties are affected (Toomre & Toomre 1972; Mesa et al. 2014). The identification of pairs of close galaxies in the W1 and W3 fields and the estimation of their properties are carried out following the methodology described in Gonzalez et al. (2023). We compare our results to those obtained in the same study, where the photometric redshifts were determined using the BCNz2 method.

Gonzalez et al. (2023) analyse two samples, one considering the entire galaxy catalogue in each field (total sample) and the other after applying a quality cut using the photometric redshift values obtained with BCNz2 (gold sample). Here, the comparison of the performance of zp in the identification of pairs of galaxies is performed on the total sample. This sample matches 99.65% of our catalogue with the BCNz2 sample in the W1 field and 97.05% of the BCNz2 sample in the W3 field.

The identification of pairs of galaxies is carried out using the algorithm described in Rodriguez et al. (2020), which applies criteria regarding the projected distance between galaxies (rp <  50 kpc), the difference in projected velocity (ΔV <  3500 km s−1), and isolation. Despite the difference in the number of galaxies used to generate pairs, the number of pairs of galaxies identified is similar in both fields. In the W1 field, 656 pairs were detected through zp determined by DEEPz (zDEEPz) and 637 pairs with zp from BCNz2 (zBCNz2). In the W3 field, 1521 pairs were found with zDEEPz and 1627 pairs with zBCNz2. Additionally, as shown in Table 3, we also find that the number of galaxy pairs obtained using both catalogues agree in the different samples defined according to the absolute magnitudes in the r-band of the pairs, defined as Mrp = −2.5log(L1 + L2), where L2 and L1 represent the luminosities in the r-band of the fainter and brighter galaxies in the pair system, respectively. We also found similarity in the luminosity ratio between the members of the pairs (L2/L1), the redshift of the pair, and the classification of the pairs according to the colour-magnitude diagram into red and blue categories, as set out in Gonzalez et al. (2023).

Table 3.

Close galaxy pair counts of sub-samples identified with DEEPz and BCNz2.

Among the estimated properties, the mass was determined using weak gravitational lensing stacking techniques in the subsets already defined (see Table 3). Through stacking techniques, the signal-to-noise measurement of the lensing effect is improved by artificially increasing the source galaxy density, from which the lens parameters are derived. This allows the calculation of the posterior probability distribution, which in this case refers to the probability distribution of the mass of the galaxy pair. It is obtained by considering Bayesian techniques, where prior information about the parameters (the prior distribution) is combined with the information provided by the data (the likelihood) to obtain the posterior distribution. The posterior density distributions were analysed, considering the physical properties described in Table 3. Additionally, the analysis includes the results obtained in Gonzalez et al. (2023) through the BCNz2 method on the total sample.

As seen in Fig. 8, the posterior density distributions for the DEEPz and BCNz2 methods are shown for the samples based on magnitude, colour, redshift, and luminosity ratio criteria. For samples selected under magnitude and colour criteria, we found similar trends and values. The probability profiles for pairs with z <  0.4 and red pairs show differences, such that the pairs identified with DEEPz that are close and red are less massive compared to BCNz2. For pairs with z ≥ 0.4 and satellite galaxies that meet L2/L1 ≥ 0.5, we found that DEEPz pairs are more massive compared to BCNz2. Nevertheless, the error intervals of the medians of all distributions overlap. Consequently, the mass estimates for both samples are statistically equivalent.

thumbnail Fig. 8.

Posterior density distributions of the fitted log M200. Each row shows the distributions for the pairs selected from the BCNz2 and DEEPz methods in purple and orange, respectively. The selection cuts according to pair properties are shown using thicker and narrower lines as referred to in the legends. Vertical lines indicate the median values and the shaded regions enclose 68% of the distributions corresponding to the errors. In each panel, the stacked pair sub-sample described in Table 3 is specified.

While the samples of DEEPz and BCNz2 pairs are compatible in terms of number and mass, even when selecting sub-samples, we find that only 249 are common pairs, meaning the central and satellite galaxies of the pair coincide. This amount represents approximately 40% of the total number of pairs obtained with zDEEPz.

Regarding complementary galaxy pairs, which are pairs that do not match in the identifications made through zDEEPz and zBCNz2, we find that 250 galaxy members are present in both catalogues. Of these 250 galaxies, it was observed that central galaxies in DEEPz are identified as satellites in BCNz2 in three cases, and vice versa, satellite galaxies in DEEPz are classified as central in BCNz2 in three instances. Despite having the same number, they are not related as pairs; in other words, the central galaxy in one catalogue does not become a satellite in the other. Therefore, the similarity in number and properties results from the identification algorithm, which links close galaxies that are likely in the same environment, implying that they have similar physical properties and also inherit the global properties of the sample used for their identification.

When comparing the distributions of photometric redshifts of the pairs, represented in Fig. 9 through histograms and box plots1, we observe that the common pairs identified with zDEEPz and zBCNz2 ( z DEEPz = $ z_{\mathrm{DEEPz}}^{=} $ and z BCNz 2 = $ z_{\mathrm{BCNz2}}^{=} $) tend to have lower values compared to their complementary sets ( z DEEPz $ z_{\mathrm{DEEPz}}^{\neq} $ and z BCNz 2 $ z_{\mathrm{BCNz2}}^{\neq} $). This difference is significant, as the confidence intervals of the box plots do not overlap, indicating that the medians of zp of z DEEPz = $ z_{\mathrm{DEEPz}}^{=} $ and z BCNz 2 = $ z_{\mathrm{BCNz2}}^{=} $ are statistically different from their complements. However, the statistical difference in the medians of zp between common pairs and their complements is likely because the redshifts of the latter are higher due to the increased volume. This is evident as their zp distribution extends across the entire range.

thumbnail Fig. 9.

Histograms and box plots showing a comparison of the distributions of the photometric redshifts of close galaxy pairs. Orange represents the sub-samples of pairs determined using zDEEPz, and black represents sub-samples of pairs determined with zBCNz2, respectively. Upper panel: distributions corresponding to the samples DEEPz= and BCNz2=. Lower panel: distributions corresponding to the samples DEEPz and BCNz2.

This behaviour is in agreement with what was observed in the comparison between σ68 and the fraction of outliers with respect to the apparent magnitude in the i-band and zs bands for both methods. The redshifts determined by each method for bright, close galaxies are similar in their trends and precision. Regarding galaxies with apparent magnitude in the i-band fainter than 20 and zpair greater than 0.3, the σ68 measurements between these two methods show differences of up to ∼0.02 in the estimates of zp for the faintest galaxies with respect to the apparent magnitude in the i-band, with DEEPz showing higher precision compared to BCNz2. Additionally, we find from galaxies with spectroscopic information that the difference between the pairs of each method is due to when one of these methods deviates significantly from the actual redshift value of the member galaxy of the pair, as shown in Fig. 10. Where we observe that the larger the errors in one of the member galaxies of the pair, the more likely the difference between pairs identified with the zp of DEEPz and BCNz2.

thumbnail Fig. 10.

Scatter plot between the errors of DEEPz and BCNz2 along with density distributions of galaxies in common pairs (pairDEEPz = pairBCNz2) and galaxies in non-common pairs (pairDEEPz ≠ pairBCNz2) between the two methods.

Considering the behaviour between the two methods and the fact that the only difference in identifying close galaxy pairs is the determination of the photometric redshift, it is estimated that DEEPz agrees with the purity and completeness ranges of close galaxy pair catalogues, as described in Table 1 of (Rodriguez et al. 2020), under the criteria of magnitude i <  22.5 and spectroscopic redshift 0.2 <  z <  0.6.

Therefore, we can initially conclude that the photometric redshift catalogues generated with DEEPz are effective for the identification of galaxy pairs. Furthermore, through the identification of common pairs and their complements, along with the Δz values of the member galaxies obtained with each method, we observe that combining both methodologies can increase the purity of pair catalogues when the same pairs are identified, and also test their performance. However, a more detailed study is needed to correlate the photometric errors of each method with the identification of galaxy pairs and their properties.

6. Conclusions

The PAU Survey is an optical survey made using the PAUCam instrument on the William Herschel Telescope in La Palma, Canary Islands. The survey employs 40 NB optical filters evenly spaced at 100 Å intervals covering a range from 4500 Å to 8500 Å, and it achieves an average spectral resolution of R ∼ 65. The survey focuses on fields such as COSMOS; CFHTLS (W1, W3, and W4); and KiDS/GAMA G09. We have presented an estimation of the photometric redshift zp of galaxies in three of the fields observed by PAUS: W1, W3, and G09. We focused on fields such as COSMOS; CFHTLS (W1, W3, and W4); and KiDS/GAMA G09, and we have presented the estimation of the photometric redshift zp of galaxies in three of the fields observed by PAUS: W1, W3, and G09. For the estimation of zp in each field, a combined approach using simulated and observational data was employed. The observational data included individual exposure information, combined fluxes from 40 NBs of PAUS, BB data from CFHTLenS in the W1 and W3 fields, and data from KiDS + VIKING in the G09 field along with spectroscopic redshifts from various studies.

We utilised DEEPz (Eriksen et al. 2020) as the main tool, a code tested in the COSMOS field that includes one autoencoder and a mixture density network. The primary metric used to evaluate the precision of photometric redshifts was the centralised scatter, σ68 (see Eq. (1)), derived from the distribution of the normalised difference between spectroscopic and photometric redshifts. Additionally, the fraction of outliers with |Δz|> 0.02 (see Eq. (2)) was used as a secondary metric. Below, we present the main results from the studies conducted to evaluate the precision of the obtained photometric redshift estimates:

Performance by field. The performance of DEEPz was evaluated by building models using samples from the individual fields W1, W3, G09, and COSMOS as well as combinations of W1+W3 and G09+COSMOS. The precision of the photometric redshifts with DEEPz for all samples is σ68 <  0.06, which is within the magnitude range for galaxies with i <  23. The combinations of the W1+W3 and G09+COSMOS fields achieved better results in terms of σ68 and a lower outlier fraction compared to the individual fields, due to the larger training sample. When comparing between combinations, W1+W3 showed a slightly better performance than G09+COSMOS.

Data refinement. We conducted seven tests (see Table 2) to assess the impact of various data refinement strategies on the precision of photometric redshifts (zp), considering such instrumental factors as completeness, and galaxy properties. In particular, the imputation of missing NB data (non-NaN test) increased the sample size by approximately 35% in W1 and W3 and by about 20% in G09 and COSMOS. This approach proved to be the most effective among the data refinement tests, as it remained outside the error bars of other test trends and achieved a lower maximum value of σ68 compared to the other tests, reaching 0.5 for i = 23. This indicates that filling in the missing NB values is a viable strategy for generating high-precision photometric redshift catalogs. In the case of G09+COSMOS, this refinement also showed better performance trends compared to other tests, suggesting that this method is optimal for all fields.

On the other hand, the analysis that includes only NBs for obtaining the redshift estimates (NBs test) showed a decline in precision with increasing i-band magnitudes and redshifts compared to the analysis including BBs. However, for low-redshift and bright galaxies, the performance was similar, suggesting that excluding BBs does not significantly affect zp accuracy in these cases. This underscores the importance of the high S/N in BB data for capturing general SED features critical for accurate redshift estimation. Furthermore, excluding NBs in the blue wavelength region yielded results comparable to those obtained with low S/N data, indicating that detailed SED features provided by blue NBs may not be essential for estimating zp in W1, W3, and G09 fields.

Spectral biases. Additionally, we studied if there is any dependence on the photometric redshift precision based on the spectroscopic samples used in the training of DEEPz. We found a dependence of zp precision on the spectroscopic survey used in the training. This gave us an implicit measure that there will be better photometric redshift precision for galaxies with the characteristics of the survey predominant in the number of examples, which in our case is VIPERS.

DEEPz versus BCNz2. We compared our photometric redshift measurements with DEEPz to those presented in Navarro-Gironés et al. (2024), which used the BCNz2 template method. We found similar trends in accuracy for both apparent magnitude in the i-band and zs. However, DEEPz demonstrates a 20–50% reduction in σ68 and an outlier fraction for the faintest galaxies (i = 21 − 23) compared to BCNz2 (see Fig. 6). For a few galaxies, we also observed a linear correlation between their errors, with both methods slightly tending to underestimate errors within certain redshift ranges.

Close galaxy pairs. Finally, as an example test case, following the methodology of Gonzalez et al. (2023), where galaxy pairs are identified using zp obtained with BCNz2, we implemented the zp galaxy catalogues determined with DEEPz in the W1+W3 fields with a precision of σ68 = 0.01. Based on the results of Gonzalez et al. (2023) and our own results, we found a good agreement in the number of identified pairs and their physical properties. However, despite this agreement in number and physical properties, DEEPz and BCNz2 only match 40% of the total sample of galaxy pairs concerning the DEEPz sample, which is characterised by including photometric redshifts of nearby galaxies. Regarding the 60% of pairs identified by DEEPz that are not found in the identification by BCNz2, we found that this difference is due to the low precision for distant galaxies by both methods, with BCNz2 having a greater error in such cases. Nevertheless, since all the galaxy pairs identified with redshifts provided by DEEPz and BCNz2 meet the purity and completeness criteria detailed in Rodriguez et al. (2020), we consider that the catalogues generated here are capable of identifying close galaxy pairs and that the combination of both methods improves their purity.

We note that the precision of DEEPz increases with more training examples. DEEPz shows biases from the spectroscopic surveys used in its training, and its precision is comparable to or better than BCNz2 in certain fields. These aspects and biases can be improved by including more spectroscopic information, which is time-consuming. Implementing data augmentation and sampling techniques could potentially solve this issue. This step will be crucial to continue with transfer learning in order to improve DEEPz and make the models more general. In turn, more general models would enable broader use of the catalogues and contribute to a more precise understanding of the Universe.

7. Data availability

The DEEPz photometric redshift catalogues in the W1, W3, and G09 fields will become publicly available on the CosmoHub platform (Tallada et al. 2020); 2017ehep.confE.488C in an upcoming data release at https://cosmohub.pic.es/catalogs/319. The data is currently available on a reasonable request to the author.


1

In the boxplot diagrams, the top and bottom lines of the boxes represent the 25th and 75th percentiles of the distributions, while the wrists of the boxes represent the medians. Notches display the confidence interval (95% confidence level) symmetrically around the medians. When comparing distributions, if the notches of two boxes do not overlap, there is a statistically significant difference between the medians (McGill et al. 1978; Krzywinski & Altman 2014). For skewed distributions or small sized samples it might be the case that the CI is wider than the 25th or 75th percentile, therefore the plot will display some “inside out” shape. The lines extending from the boxes are called whiskers. The boundary of the whiskers is based on the 1.5 interquartile range (IQR) value. The whiskers extend from the bottom/top of the boxes up to the lowest/largest data point that falls within 1.5 times the IQR. The whisker lengths might not be symmetrical since they must end at an observed data point.

Acknowledgments

We would like to thank the Hanyue Guo for the useful comments and suggestions which has helped to improve this paper. CosmoHub has been developed by the Port d’Informació Científica (PIC), maintained through a collaboration of the Institut de Física d’Altes Energies (IFAE) and the Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas (CIEMAT) and the Institute of Space Sciences (CSIC & IEEC). CosmoHub was partially funded by the “Plan Estatal de Investigación Científica y Técnica y de Innovación" program of the Spanish government, has been supported by the call for grants for Scientific and Technical Equipment 2021 of the State Program for Knowledge Generation and Scientific and Technological Strengthening of the R+D+i System, financed by MCIN/AEI/10.13039/501100011033 and the EU NextGeneration/PRTR (Hadoop Cluster for the comprehensive management of massive scientific data, reference EQC2021-007479-P) and by MICIIN with funding from European Union NextGenerationEU(PRTR-C17.I1) and by Generalitat de Catalunya. The PAU data centre is hosted by the Port d’Informació Científica (PIC), maintained through a collaboration of CIEMAT and IFAE, with additional support from Universitat Autónoma de Barcelona and ERDF. We acknowledge the PIC services department team for their support and fruitful discussions. This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under the Marie Sklodowska-Curie grant agreement No 734374 and HORIZON-MSCA-2021-SE-01 Research and Innovation programme under the Marie Sklodowska-Curie grant agreement number 101086388. This work was also partially supported by the Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET, Argentina), Agencia Nacional de Promoción Científica y Tecnológica and the Secretaría de Ciencia y Tecnologóa de la Universidad Nacional de Córdoba (SeCyT-UNC, Argentina). This work has made use of CosmoHub. CosmoHub has been developed by the Port d’Informació Científica (PIC), maintained through a collaboration of the Institut de Física d’Altes Energies (IFAE) and the Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas (CIEMAT), and was partially funded by the “Plan Estatal de Investigación Científica y Técnica y de Innovación" program of the Spanish government. M. Eriksen acknowledges funding by MCIN with funding from European Union NextGenerationEU (PRTR-C17.I1) and by Generalitat de Catalunya. This work has been also partially supported by the Polish National Agency for Academic Exchange (Bekker grant BPN/BEK/2021/1/00298/DEC/1), the European Union’s Horizon 2020 Research and Innovation programme under the Maria Sklodowska-Curie grant agreement (No. 754510). F. Rodriguez would like to acknowledge support from the ICTP through the Junior Associates Programme 2023–2028. C.M Baugh acknowledges support from the Science Technology Facilities Council through grant number ST/X001075/1. A. Wittje is supported by the DFG (SFB 1491). CIEMAT participation is supported by the grant PID2021-123012NB-C42P funded by MCIN/AEI/10.13039/501100011033. C. Padilla acknowledges support from the Spanish Plan Nacional project PID2019-111317GB-C32 and PID2022-141079NB-C32. P. Renard acknowledges the support by the Tsinghua Shui Mu Scholarship, the funding of the National Key R&D Program of China (grant No. 2023YFA1605600), the science research grants from the China Manned Space Project with No. CMS-CSST2021-A05, the Tsinghua University Initiative Scientific Research Program (No. 20223080023) and the National Science Foundation of China (grant No. 12350410365). J. Carretero acknowledges support from the grant PID2021-123012NA-C44 funded by MCIN/AEI/10.13039/501100011033 and by “ERDF A way of making Europe". H. Hildebrandt is supported by a DFG Heisenberg grant (Hi 1495/5-1), the DFG Collaborative Research Center SFB1491, as well as an ERC Consolidator Grant (No. 770935). M. Siudek acknowledges support by the Polish National Agency for Academic Exchange (Bekker grant BPN/BEK/2021/1/00298/DEC/1), the State Research Agency of the Spanish Ministry of Science and Innovation under the grants “Galaxy Evolution with Artificial Intelligence” (PGC2018-100852-A-I00) and “BASALT” (PID2021-126838NB-I00). This work was partially supported by the European Union’s Horizon 2020 Research and Innovation program under the Maria Sklodowska-Curie grant agreement (No. 754510). E.J. Gonzalez acknowledges support from the grant PID2021-123012NA-C44 funded by MCIN/AEI/10.13039/501100011033 and by “ERDF A way of making Europe", and also from the European Union NextGenerationEU (PRTR-C17.I1) and by Generalitat de Catalunya. J. Carretero acknowledge support from the grant PID2021-123012NA-C44 funded by MCIN/AEI/10.13039/501100011033 and by “ERDF A way of making Europe".

References

  1. Ahumada, R., Prieto, C. A., Almeida, A., et al. 2020, ApJS, 249, 3 [Google Scholar]
  2. Alarcon, A., Sánchez, C., Bernstein, G. M., & Gaztanãga, E. 2020, MNRAS, 498, 2614 [NASA ADS] [CrossRef] [Google Scholar]
  3. Alarcon, A., Gaztanaga, E., Eriksen, M., et al. 2021, MNRAS, 501, 6103 [NASA ADS] [CrossRef] [Google Scholar]
  4. Baldry, I. K., Liske, J., Brown, M. J., et al. 2018, MNRAS, 474, 3875 [NASA ADS] [CrossRef] [Google Scholar]
  5. Barro, G., Pérez-González, P. G., Cava, A., et al. 2019, ApJS, 243, 22 [NASA ADS] [CrossRef] [Google Scholar]
  6. Bellstedt, S., Driver, S. P., Robotham, A. S., et al. 2020, MNRAS, 496, 3235 [NASA ADS] [CrossRef] [Google Scholar]
  7. Benítez, N., Dupke, R., Moles, M., et al. 2014, in Proceedings of the 11th Scientific Meeting of the Spanish Astronomical Society– Highlights of Spanish Astrophysics VIII, SEA 2014 [Google Scholar]
  8. Brammer, G. B., van Dokkum, P. G., & Coppi, P. 2008, ApJ, 686, 1503 [Google Scholar]
  9. Brammer, G. B., Van Dokkum, P. G., Franx, M., et al. 2012, ApJS, 200, 13 [Google Scholar]
  10. Carretero, J., Tallada, P., Casals, J., et al. 2017, in Proceedings of the European Physical Society Conference on High Energy Physics, 5–12 July, 488 [CrossRef] [Google Scholar]
  11. Castander, F. J., Ballester, O., Bauer, A., et al. 2012, in Ground-based and Airborne Instrumentation for Astronomy IV (SPIE), Proc. of SPIE, 8446, 1962 [Google Scholar]
  12. Castander, F. J., Serrano, S., Eriksen, M., et al. 2024, MNRAS, 531, 5067 [NASA ADS] [CrossRef] [Google Scholar]
  13. Cenarro, A. J., Moles, M., Cristóbal-Hornillos, D., et al. 2019, A&A, 622, A176 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Childress, M. J., Lidman, C., Davis, T. M., et al. 2017, MNRAS, 472, 273 [Google Scholar]
  15. Conroy, C., & Gunn, J. E. 2010, ApJ, 712, 833 [Google Scholar]
  16. Conroy, C., Gunn, J. E., & White, M. 2009, ApJ, 699, 486 [Google Scholar]
  17. Csizi, B., Tortorelli, L., Siudek, M., et al. 2024, A&A, 689, A37 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Dark Energy Survey and Kilo-Degree Survey Collaboration (Abbott, T. M. C., et al.) 2023, Open J. Astrophys., 6, 36 [NASA ADS] [Google Scholar]
  19. Davies, L. J., Driver, S. P., Robotham, A. S., et al. 2015, MNRAS, 447, 1014 [Google Scholar]
  20. Davis, M., Guhathakurta, P., Konidaris, N. P., et al. 2007, ApJ, 660, L1 [NASA ADS] [CrossRef] [Google Scholar]
  21. de Jong, J. T., Verdoes Kleijn, G. A., Kuijken, K. H., & Valentijn, E. A. 2013, Exp. Astron., 35, 25 [NASA ADS] [CrossRef] [Google Scholar]
  22. de Jong, J. T., Verdoes Kleijn, G. A., Boxhoorn, D. R., et al. 2015, A&A, 582, A62 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  23. De Oliveira, C. M., Ribeiro, T., Schoenell, W., et al. 2019, MNRAS, 489, 241 [NASA ADS] [CrossRef] [Google Scholar]
  24. DESI Collaboration (Adame, A.G., et al.) 2024, ArXiv e-prints [arXiv:2404.03002] [Google Scholar]
  25. Dressler, A., Bigelow, B., Hare, T., et al. 2011, PASP, 123, 288 [NASA ADS] [CrossRef] [Google Scholar]
  26. Driver, S. P., Norberg, P., Baldry, I. K., et al. 2009, Astron. Geophys., 50, 5 [Google Scholar]
  27. Driver, S. P., Hill, D. T., Kelvin, L. S., et al. 2011, MNRAS, 413, 971 [Google Scholar]
  28. Driver, S. P., Davies, L. J., Meyer, M., et al. 2016, in The Universe of Digital Sky Surveys, eds. N. R. Napolitano, G. Longo, M. Marconi, M. Paolillo, & E. Iodice, Astrophysics and Space Science Proceedings, 42, 205 [NASA ADS] [CrossRef] [Google Scholar]
  29. Edge, A., Sutherland, W., Kuijken, K., et al. 2013, The Messenger, 154, 32 [NASA ADS] [Google Scholar]
  30. Erben, T., Hildebrandt, H., Miller, L., et al. 2013, MNRAS, 433, 2545 [Google Scholar]
  31. Eriksen, M., Alarcon, A., Gaztanaga, E., et al. 2019, MNRAS, 484, 4200 [NASA ADS] [CrossRef] [Google Scholar]
  32. Eriksen, M., Alarcon, A., Cabayol, L., et al. 2020, MNRAS, 497, 4565 [CrossRef] [Google Scholar]
  33. Garilli, B., Guzzo, L., Scodeggio, M., et al. 2014, A&A, 562, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  34. Garilli, B., McLure, R., Pentericci, L., et al. 2021, A&A, 647, A150 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  35. Geha, M., Wechsler, R. H., Mao, Y.-Y., et al. 2017, ApJ, 847, 4 [NASA ADS] [CrossRef] [Google Scholar]
  36. Gonzalez, E. J., Rodriguez, F., Navarro-Gironés, D., et al. 2023, MNRAS, 522, 5655 [NASA ADS] [CrossRef] [Google Scholar]
  37. Guzzo, L., Scodeggio, M., Garilli, B., et al. 2014, A&A, 566, A108 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  38. Heymans, C., Van Waerbeke, L., Miller, L., et al. 2012, MNRAS, 427, 146 [Google Scholar]
  39. Hildebrandt, H., Arnouts, S., Capak, P., et al. 2010, A&A, 523, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  40. Hildebrandt, H., Erben, T., Kuijken, K., et al. 2012, MNRAS, 421, 2355 [Google Scholar]
  41. Hopkins, A. M., Driver, S. P., Brough, S., et al. 2013, MNRAS, 430, 2047 [NASA ADS] [CrossRef] [Google Scholar]
  42. Hoyle, B., Gruen, D., Bernstein, G. M., et al. 2018, MNRAS, 478, 592 [Google Scholar]
  43. Jarrett, T. 2004, PASA, 21, 396 [Google Scholar]
  44. Johnston, H., Joachimi, B., Norberg, P., et al. 2021, A&A, 646, A147 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  45. Koekemoer, A. M., Faber, S. M., Ferguson, H. C., et al. 2011, ApJS, 197, 36 [NASA ADS] [CrossRef] [Google Scholar]
  46. Krzywinski, M., & Altman, N. 2014, Nature Methods, 11, 119 [CrossRef] [Google Scholar]
  47. Kuijken, K., Heymans, C., Hildebrandt, H., et al. 2015, MNRAS, 454, 3500 [Google Scholar]
  48. Le Fèvre, O., Cassata, P., Cucciati, O., et al. 2013, A&A, 559, A14 [Google Scholar]
  49. Lilly, S. J., Le Brun, V., Maier, C., et al. 2009, ApJS, 184, 218 [Google Scholar]
  50. Liske, J., Baldry, I. K., Driver, S. P., et al. 2015, MNRAS, 452, 2087 [Google Scholar]
  51. Manzoni, G., Baugh, C. M., Norberg, P., et al. 2024, MNRAS, 530, 1394 [NASA ADS] [CrossRef] [Google Scholar]
  52. Martí, P., Miquel, R., Bauer, A., & Gaztañaga, E. 2014a, MNRAS, 437, 3490 [CrossRef] [Google Scholar]
  53. Martí, P., Miquel, R., Castander, F. J., et al. 2014b, MNRAS, 442, 92 [Google Scholar]
  54. Masters, D. C., Stern, D. K., Cohen, J. G., et al. 2019, ApJ, 877, 81 [Google Scholar]
  55. McGill, R., Tukey, J. W., & Larsen, W. A. 1978, Am. Statistician, 32, 12 [Google Scholar]
  56. Mesa, V., Duplancic, F., Alonso, S., Coldwell, G., & Lambas, D. G. 2014, MNRAS, 438, 1784 [CrossRef] [Google Scholar]
  57. Molino, A., Benítez, N., Moles, M., et al. 2014, MNRAS, 441, 2891 [Google Scholar]
  58. Molino, A., Costa-Duarte, M. V., Sampedro, L., et al. 2020, MNRAS, 499, 3884 [CrossRef] [Google Scholar]
  59. Nanayakkara, T., Glazebrook, K., Kacprzak, G. G., et al. 2016, ApJ, 828, 21 [NASA ADS] [CrossRef] [Google Scholar]
  60. Navarro-Gironés, D., Gaztañaga, E., Crocce, M., et al. 2024, MNRAS, 534, 1504 [CrossRef] [Google Scholar]
  61. Newman, J. A., Cooper, M. C., Davis, M., et al. 2013, ApJS, 208, 5 [Google Scholar]
  62. Padilla, C., Castander, F. J., Alarcón, A., et al. 2019, AJ, 157, 246 [NASA ADS] [CrossRef] [Google Scholar]
  63. Pickles, A. J. 1998, PASP, 110, 683 [Google Scholar]
  64. Planck Collaboration XIII. 2016, A&A, 594, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  65. Planck Collaboration Int. XLVIII. 2016, A&A, 596, A109 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  66. Renard, P., Siudek, M., Eriksen, M. B., et al. 2022, MNRAS, 515, 146 [NASA ADS] [CrossRef] [Google Scholar]
  67. Rodriguez, F., Gonzalez, E. J., O’Mill, A. L., et al. 2020, A&A, 634, A123 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  68. Salvato, M., Ilbert, O., & Hoyle, B. 2019, Nat. Astron., 3, 212 [NASA ADS] [CrossRef] [Google Scholar]
  69. Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525 [Google Scholar]
  70. Serrano, S., Gaztañaga, E., Castander, F. J., et al. 2023, MNRAS, 523, 3287 [NASA ADS] [CrossRef] [Google Scholar]
  71. Soo, J. Y., Joachimi, B., Eriksen, M., et al. 2021, MNRAS, 503, 4118 [NASA ADS] [CrossRef] [Google Scholar]
  72. Steidel, C. C., Rudie, G. C., Strom, A. L., et al. 2014, ApJ, 795, 165 [Google Scholar]
  73. Sullivan, M., Conley, A., Howell, D. A., et al. 2010, MNRAS, 406, 782 [NASA ADS] [Google Scholar]
  74. Tallada, P., Carretero, J., Casals, J., et al. 2020, Astron. Comput., 32, 100391 [Google Scholar]
  75. Tanaka, M., & Kodama, T. 2004, in Proceedings of Science, 2004 [Google Scholar]
  76. Taniguchi, Y., Kajisawa, M., Kobayashi, M. A., et al. 2015, PASJ, 67, 104 [NASA ADS] [CrossRef] [Google Scholar]
  77. Toomre, A., & Toomre, J. 1972, ApJ, 178, 623 [Google Scholar]
  78. Tortorelli, L., Siudek, M., Moser, B., et al. 2021, JCAP, 2021, 013 [CrossRef] [Google Scholar]
  79. Van Den Busch, J. L., Wright, A. H., Hildebrandt, H., et al. 2022, A&A, 664, A170 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  80. Venemans, B. P., Verdoes Kleijn, G. A., Mwebaze, J., et al. 2015, MNRAS, 453, 2259 [Google Scholar]
  81. Weinberg, D. H., Mortonson, M. J., Eisenstein, D. J., et al. 2013, Phys. Rep., 530, 87 [Google Scholar]
  82. Wright, A. H., Kuijken, K., Hildebrandt, H., et al. 2024, A&A, 686, A170 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

Appendix A: Comparison of σ68 and σNMAD

We provide a comparison between two commonly used metrics in the analysis of redshift precision: σ68 and σNMAD. As defined in Sect. 3.2, σ68 is a measure of the scatter in the accuracy of zp (see Eq. 1). In general, we use σ68 because this approach was adopted in Martí et al. (2014b), and from this, several PAUS papers related to photometric redshift use this metric, for example, (Eriksen et al. 2019; Alarcon et al. 2020; Eriksen et al. 2020; Soo et al. 2021; Navarro-Gironés et al. 2024).

The normalised median absolute deviation (σNMAD) is also a robust estimator of scatter, but it is particularly useful in the presence of outliers, providing a complementary perspective on the performance of zp estimates.

In Brammer et al. (2008), σNMAD is defined as

σ NMAD 1.48 × median ( | z p z s | 1 + z s ) , $$ \begin{aligned} \sigma _{\rm NMAD} \equiv 1.48 \times \text{ median}\left(\frac{\left|z_{\rm p} - z_{\rm s}\right|}{1 + z_{\rm s}}\right), \end{aligned} $$(A.1)

where zp is the photometric redshift and zs is the spectroscopic redshift.

To quantitatively compare these metrics, we present the ratio of σ68 to σNMAD on the W1 validation set. Figure A.1 shows the comparison of these metrics in bins of iAB and zs.

thumbnail Fig. A.1.

Comparison of σ68 and σNMAD. The plot shows the σ68/σNMAD ratio for the redshift estimates in the W1 validation set. Panel (a) is a function of the i-band magnitude, while panel (b) shows the trends in terms of spectroscopic redshift.

In general terms, both metrics exhibit similar behaviour, particularly for galaxies with i-band magnitude less than 21. However, σ68 tends to impose a stronger penalty compared to σNMAD, which is more noticeable for faint or high-redshift galaxies. This is because σ68 is more affected by outliers.

Appendix B: Feature selection

One of the most important steps in classifier determination is generating and selecting the features with the highest entropy. To do this, we implemented different statistical methods, particularly unsupervised learning and univariate analysis. In Fig. B.1, we show in the left panel the heat map of the different Pearson correlations between the bands, and in the central and right panels, the scores obtained applying the mutual information and regression methods, respectively. This type of analysis allows us to estimate the importance of the features, i.e., the bands for the analysis or modelling, and to discard irrelevant or redundant ones. Reducing the number of features while preserving the most important information improves the efficiency and interpretation of the models. We observe that the trend of the BBs is similar to the trend of the NBs in terms of importance when studying their relationship with the spectroscopic redshift intervals, so it is interesting to study the necessity of the BBs. Additionally, we note that the regression method assigns a lower score to the bands of blue wavelengths compared to the rest of the bands, so it is interesting to study their importance in determining the zp.

thumbnail Fig. B.1.

Visualisation of univariate study of bands used in W1 and W3 fields. Left panel: correlation among the bands through Pearson correlation. Central panel and right panel: bar chart of the scores of the correlated bands with the spectroscopic redshift bins calculated through MI and regression methods.

Appendix C: Performance for field and refinement data test

Each row corresponds to a different refinement test for the five samples. In the left column, the results are presented according to 10 i-band bins, while in the right column, the results are shown according to 10 zs bins. Each result is linked to the initial training and validation samples within each field and its combination; these samples may increase, decrease, or remain constant in the number of examples, depending on the refinement data test being conducted. Therefore, the intervals in each curve of each figure vary. The different data refinement tests are showed in Fig. C.1 and are summarised in the heatmap of Fig. C.2, this plots are showing the value of the σ68 metric for the validation sample.

thumbnail Fig. C.1.

Trend of σ68 as a function of the i-band magnitude and spectroscopic redshift for each field and the combination W1+W3 and G09+COSMOS. Each row corresponds to a data refinement study, hence the validation sets are different.

thumbnail Fig. C.2.

Heatmaps of σ68 from the validation sample of each study. Top panel: galaxy samples in W1, W3, and W1+W3 combined. Bottom panel: galaxy samples in G09, COSMOS, and G09+COSMOS combined.

In general, we find that σ68 as a function of i and zs varies in the same way as seen in the Sect. 4 for the baseline samples. However, this behaviour is not observed for the G09 field. This is because the samples from the G09 field have spectroscopic information restricted up to i = 20.5 and zs = 0.6, i.e., it considers only bright galaxies. Therefore, in this field, lower values are generally observed in the metrics compared to other samples.

Appendix D: Extrapolation to other regions of the sky

In this appendix we study the impact of applying DEEPZ on sky regions not included in the training set. Specifically, we test how the performance changes if being is trained in W1 and validated in W3 or vice versa. In Fig.D.1, you can see σ68 as a function of the i-band magnitude, photo-z, and spectroscopic redshift.

The trends shown in Fig. D.1, when the training and validation samples match, are consistent with those seen in Figs. 3 and C.2. A higher precision is observed in the W1 field compared to the W3 field, with a slight deterioration when the validation samples are swapped between these two fields. However, this decrease is more pronounced when DEEPz is trained exclusively with galaxies from the W3 field and applied to galaxies from the W1 field. Based on these trends, it is likely that performance in new fields will vary slightly in new regions.

thumbnail Fig. D.1.

Trends in the measurement of σ68. The distribution is divided into ten bins, each containing the same number of objects, based on the apparent magnitude in the i-band, zp and zs, in the left and right columns, respectively. These trends show the precision, in terms of the centralised scatter σ68, in the W1 and W3 validation sets. The solid line represents cases where the training and validation samples use the same field, while the dashed line indicates training with W1 and validation with W3, or vice versa.

All Tables

Table 1.

Sample sizes for each wide field.

Table 2.

Description of the data refinement test.

Table 3.

Close galaxy pair counts of sub-samples identified with DEEPz and BCNz2.

All Figures

thumbnail Fig. 1.

Aitoff Galactic projection showing the location of the PAUS fields and the Galactic extinction. The level of extinction is shown in terms of the E(B − V) parameter on a linear scale, as shown by the key. The data are from the dust map provided by the Planck Collaboration (Planck Collaboration Int. XLVIII 2016). The W1, G09, W3, and COSMOS wide fields are shown in green, orange, violet, and pink, respectively.

In the text
thumbnail Fig. 2.

DEEPz network flow diagram. The data are shown in grey and the networks in violet. The network architecture consists of an autoencoder with ten layers and 250 nodes in both the encoder and decoder. Each layer includes linear transformations followed by ReLU non-linearities, batch normalisation, and a 2% dropout, except for the last three layers. The autoencoder is fed galaxy flux ratios. The zp network, which follows the same structure as the autoencoder, takes both the galaxy flux ratios and autoencoder features as input. It includes 1% dropout after all linear layers. This network is a MDN, representing the redshift distribution as a linear combination of 10 normaldistributions.

In the text
thumbnail Fig. 3.

Trends in the measurement of σ68. The distribution is divided into ten bins, each containing the same number of objects, based on the apparent magnitude in the i-band and zs, in the left and right columns, respectively. These trends show the precision, in terms of the centralised scatter σ68, in the W1+W3 and G09+COSMOS validation sets.

In the text
thumbnail Fig. 4.

The upper row depicts trends and variations in the measurement of σ68, while the lower row shows the fraction of outlier values. The distribution is divided into ten bins, each containing an equal number of objects, based on the magnitude in the i-band, zp, and zs, positioned in the left, centre, and right columns, respectively. These trends represent the findings from seven tests, summarised in Table 2, implemented on the W1+W3 and G09+COSMOS baseline samples.

In the text
thumbnail Fig. 5.

The upper panels show trends in the measurement of σ68 with respect to i (left panel) and zs (right panel). There are separate curves for the results using the VIPERS and GAMA_SDSS redshifts. The lower panel shows the spectroscopic redshift distribution from the VIPERS and GAMA_SDSS surveys in the W1 field.

In the text
thumbnail Fig. 6.

Comparison of the σ68 values (top row) and outlier fractions (bottom row) for the W1 and G09 samples obtained using the DEEPz and BCNz2 methods. Columns (1) and (2) correspond to the implementation of the model trained with the W1 sample with missing NB values inferred. Columns (3) and (4) correspond to the implementation of the model trained with the G09+COSMOS sample with missing NB flues inferred. Upper panels (a) and (b): σ68 for DEEPz and BCNz2 plotted against i-band magnitude for (a) and against redshift for (b). Lower panels (c) and (d): Fraction of outliers with Δz >  0.02 plotted as a function of i-band magnitude for (c) and redshift for (d).

In the text
thumbnail Fig. 7.

Scatter plots of errors on the x-axis for the DEEPz method and on the y-axis for the errors obtained when determining zp using the BCNz2 method. Top panel: scatter plot of errors for each method along with a histogram showing the distribution of points along each axis. Bottom-left panel: scatter plot of errors for each method colour-coded according to the apparent magnitude in the i-band. Bottom-right panel: scatter plot of errors for each method colour-coded according to zs.

In the text
thumbnail Fig. 8.

Posterior density distributions of the fitted log M200. Each row shows the distributions for the pairs selected from the BCNz2 and DEEPz methods in purple and orange, respectively. The selection cuts according to pair properties are shown using thicker and narrower lines as referred to in the legends. Vertical lines indicate the median values and the shaded regions enclose 68% of the distributions corresponding to the errors. In each panel, the stacked pair sub-sample described in Table 3 is specified.

In the text
thumbnail Fig. 9.

Histograms and box plots showing a comparison of the distributions of the photometric redshifts of close galaxy pairs. Orange represents the sub-samples of pairs determined using zDEEPz, and black represents sub-samples of pairs determined with zBCNz2, respectively. Upper panel: distributions corresponding to the samples DEEPz= and BCNz2=. Lower panel: distributions corresponding to the samples DEEPz and BCNz2.

In the text
thumbnail Fig. 10.

Scatter plot between the errors of DEEPz and BCNz2 along with density distributions of galaxies in common pairs (pairDEEPz = pairBCNz2) and galaxies in non-common pairs (pairDEEPz ≠ pairBCNz2) between the two methods.

In the text
thumbnail Fig. A.1.

Comparison of σ68 and σNMAD. The plot shows the σ68/σNMAD ratio for the redshift estimates in the W1 validation set. Panel (a) is a function of the i-band magnitude, while panel (b) shows the trends in terms of spectroscopic redshift.

In the text
thumbnail Fig. B.1.

Visualisation of univariate study of bands used in W1 and W3 fields. Left panel: correlation among the bands through Pearson correlation. Central panel and right panel: bar chart of the scores of the correlated bands with the spectroscopic redshift bins calculated through MI and regression methods.

In the text
thumbnail Fig. C.1.

Trend of σ68 as a function of the i-band magnitude and spectroscopic redshift for each field and the combination W1+W3 and G09+COSMOS. Each row corresponds to a data refinement study, hence the validation sets are different.

In the text
thumbnail Fig. C.2.

Heatmaps of σ68 from the validation sample of each study. Top panel: galaxy samples in W1, W3, and W1+W3 combined. Bottom panel: galaxy samples in G09, COSMOS, and G09+COSMOS combined.

In the text
thumbnail Fig. D.1.

Trends in the measurement of σ68. The distribution is divided into ten bins, each containing the same number of objects, based on the apparent magnitude in the i-band, zp and zs, in the left and right columns, respectively. These trends show the precision, in terms of the centralised scatter σ68, in the W1 and W3 validation sets. The solid line represents cases where the training and validation samples use the same field, while the dashed line indicates training with W1 and validation with W3, or vice versa.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.