Mapping the Milky Way with Gaia Bp/Rp spectra

Xianhao Ye; Wenbo Wu; Carlos Allende Prieto; David S. Aguado; Jingkun Zhao; Jonay I. González Hernández; Rafael Rebolo; Gang Zhao; Zhuohan Li; Carlos del Burgo; Yuqin Chen

doi:10.1051/0004-6361/202452871

Home

All issues

Volume 695 (March 2025)

A&A, 695 (2025) A75

Full HTML

Open Access

Issue		A&A Volume 695, March 2025


Article Number		A75
Number of page(s)		17
Section		Catalogs and data
DOI		https://doi.org/10.1051/0004-6361/202452871
Published online		11 March 2025

A&A, 695, A75 (2025)

I. Systematic flux corrections and atmospheric parameters for 68 million stars

Xianhao Ye¹^,2^,3, Wenbo Wu¹^,2^,3^★, Carlos Allende Prieto²^,3^★★, David S. Aguado²^,3, Jingkun Zhao¹, Jonay I. González Hernández²^,3, Rafael Rebolo²^,3, Gang Zhao¹^,4^★★, Zhuohan Li¹^,4, Carlos del Burgo² and Yuqin Chen¹

¹ National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100101, PR China
² Instituto de Astrofísica de Canarias, Vía Láctea, 38205 La Laguna, Tenerife, Spain
³ Universidad de La Laguna, Departamento de Astrofísica, 38206 La Laguna, Tenerife, Spain
⁴ School of Astronomy and Space Science, University of Chinese Academy of Sciences, Beijing 100049, PR China

^★★ Corresponding authors; carlos.allende.prieto@iac.es; gzhao@nao.cas.cn

Received: 4 November 2024
Accepted: 13 January 2025

Abstract

Context. Gaia Bp/Rp spectrophotometry for over two hundred million stars has been publicly released as part of Gaia Data Release 3 (DR3). These data have great potential for mapping metallicity across the Milky Way. Several recent studies have analyzed this data set to derive atmospheric parameters and identify new metal-poor stars. In addition, systematics in the fluxes of the Bp/Rp spectra have also been identified and characterized.

Aims. We aim to construct an alternative catalog of atmospheric parameters from Gaia Bp/Rp spectra by fitting them with synthetic spectra based on model atmospheres, and provide corrections to the Bp/Rp fluxes according to stellar colors, magnitudes, and interstellar extinction.

Methods. We use GaiaXPy to obtain calibrated spectra and apply FERЯ to match the corrected Bp/Rp spectra with models and infer atmospheric parameters. We train a neural network (NN) using stars in the Apache Point Observatory Galactic Evolution Experiment (APOGEE) to predict flux corrections as a function of wavelength for each target.

Results. Based on the comparison with APOGEE parameters, we conclude that our estimated parameters have systematic errors and uncertainties in T_eff, log g, and [M/H] about −38 ± 167 K, 0.05 ± 0.40 dex, and −0.12 ± 0.19 dex, respectively, for stars in the range 4000 ≤ T_eff ≤ 7000 K. The corrected Bp/Rp spectra show improved agreement with both models and Hubble Space Telescope (HST) CALSPEC data. Our correction increases the precision of the relative spectrophotometry of the Bp/Rp data from 3.2–3.7% to 1.2–2.4%. We also compare our results with other similar catalogs from the literature and validate them using star clusters. Finally, we have built a catalog of atmospheric parameters for stars within 4000 ≤ T_eff ≤ 7000 K, comprising 68 394 431 sources, along with a subset of 124 188 stars with [M/H] ≤ −2.5. Our catalogs and flux correction code are publicly available.

Conclusions. Our results confirm that the Gaia Bp/Rp flux calibrated spectra show systematic patterns as a function of wavelength that are tightly related to colors, magnitudes, and extinction. Our optimization algorithm can give us accurate atmospheric parameters of stars with a clear and direct link to models of stellar atmospheres, and can be used to efficiently search for extremely metal-poor (EMP) stars.

Key words: catalogs / stars: abundances / stars: fundamental parameters / Galaxy: stellar content

^★

The second author also made substantial contributions to the paper.

© The Authors 2025

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

The stellar metallicities [M/H] keep track of the formation and evolution of the Milky Way. Due to the chemical enrichment of the interstellar medium by successive stellar generations, [M/H] can be used as a proxy of age and reflects the birth environment of a star. Assuming that metal-rich disk stars are born on nearly circular orbits in the Galactic plane from chemically well-mixed cold gas, mono-abundance stellar populations with the same metal content ([M/H], [α/Fe]) shall be formed at the same birth radius R_b and look-back time τ (Schönrich & Binney 2009). This assumption has been used to establish a comprehensive chemodynamical model of our Galaxy (Sharma et al. 2021; Lian et al. 2022; Imig et al. 2023; Chen et al. 2023; Binney & Vasiliev 2024). Combined with kinematics, the metallicities of metal-poor halo stars serve as a powerful tool to separate in-situ populations from the remnants of ancient accretion events (Naidu et al. 2020; Belokurov & Kravtsov 2022; Conroy et al. 2022; Deason & Belokurov 2024). These newly found accreted substructures are useful to check the ACDM cosmological paradigm in which Milky Way–sized halos are built from the mergers of smaller satellite galaxies. Extremely metalpoor (EMP) stars may have formed from gas enriched only by the very first stars, also called Pop III (Klessen & Glover 2023). Therefore, their abundance patterns hold the key to constrain nucleosynthesis in the earliest supernova events (Nomoto et al. 2006; Heger & Woosley 2010; Aoki et al. 2014; Nomoto et al. 2013; Koutsouridou et al. 2023). The low-metallicity tail of the metallicity distribution function (MDF) provides essential constrains on the chemical enrichment in the early phases of the formation of the Milky Way (Salvadori et al. 2007; Komiya et al. 2010; Yamada et al. 2013; Sarmento et al. 2019; Tarumi et al. 2020; Youakim et al. 2020). However, at present only a few hundred stars with [M/H] < −3 and significantly fewer with [M/H] < −4 have been identified, and these are insufficient to give a complete picture.

We are witnessing an era in which stellar atmospheric parameters are available for massive samples thanks to large spectroscopic surveys, such as the Sloan Digital Sky Survey (SDSS) I-IV (York et al. 2000; Abazajian et al. 2004; Aihara et al. 2011; Abdurro’uf et al. 2022), the Large Sky Area MultiObject Fiber Spectroscopic Telescope (LAMOST) survey (Zhao et al. 2006, 2012; Cui et al. 2012; Luo et al. 2012), the GALactic Archaeology with HERMES¹ (GALAH) survey (De Silva et al. 2015; Buder et al. 2021), the SDSS V (Kollmeier et al. 2017), the Dark Energy Spectroscopic Instrument (DESI) survey (Cooper et al. 2023), the 4-metre Multi-Object Spectroscopic Telescope (4MOST) survey (de Jong et al. 2019), or the WHT² Enhanced Area Velocity Explorer (WEAVE) survey (Jin et al. 2024). These surveys have already accumulated tens of millions of stellar spectra, and continue to gather more. Combining the spectroscopic data with accurate measurements of proper motions and parallaxes provided by the European Space Agency (ESA) Gaia mission (Gaia Collaboration 2016, 2023b), enables us to carry out a detailed investigation of the chemodynamic properties of different stellar populations of the Milky Way.

In addition to the astrometric catalog, Gaia Data Release 3 (DR3; Gaia Collaboration 2023b) includes a catalog of 32.2 million high-resolution spectra (R ∼ 8000) centered on the near infrared Ca II triplet, obtained by the Radial Velocity Spectrometer (RVS) instrument (Cropper et al. 2018; Sartoretti et al. 2018; Andrae et al. 2023a). It also includes a catalog of 220 million low-resolution spectra, referred to as Bp/Rp spectra (hereafter XP spectra), obtained with the blue (wavelength range of 330– 680 nm) and red (wavelength range of 640–1050 nm) Gaia slitless spectrophotometers, BP and RP, respectively (Carrasco et al. 2021; Montegriffo et al. 2023; De Angeli et al. 2023). The XP spectra have 110 pixels and a variable resolving power, ranging from 20 to 90, as a function of wavelength. The XP spectra in DR3 are represented by Hermite basis functions (Montegriffo et al. 2023).

The Gaia Data Processing and Analysis Consortium (DPAC) has developed the Python library GaiaXPy³ to facilitate handling XP spectra. This software allows the transformation of coefficients into calibrated spectra and photometry. Recent studies have shown that the synthetic photometry from XP spectra can be used to perform a highly precise calibration of other photometry surveys, such as Pristine (Martin et al. 2024), the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS) (Xiao et al. 2023a), or the Javalambre- Photometric Local Universe Survey (J-PLUS) (Xiao et al. 2023b; Chandra et al. 2024). However, other studies have revealed that XP calibrated spectra suffer from systematic errors that show up as “wiggles” (Huang et al. 2024a), introduced by the combination of noise and the choice of data representation (Weiler et al. 2023). The pattern of wiggles largely depends on the stellar color G_BP – G_RP and the apparent magnitude G (Montegriffo et al. 2023), and may cause a modest bias in the synthetic photometry Huang et al. (2024a) and the inferred stellar parameters. Due to its low-resolution and the aforementioned systematics, it is hard to determine chemical abundances from XP spectra. Nonetheless, we are still able to obtain a reliable estimation of the overall metallicity ([M/H]) down to the EMP domain, as shown using mock XP spectra in Witten et al. (2022). Considering the large data volume and the full sky coverage, XP spectra provide us with a unique opportunity to build a complete metallicity map of the Milky Way.

Previous studies have adopted different methodologies to extract information from the low-resolution XP spectra. Traditional model-driven methods predict the atmospheric parameters by comparing the observed spectra with stellar spectral libraries (e.g., Castelli et al. 1997; Lejeune et al. 1998; Coelho et al. 2005; Gustafsson et al. 2008; Husser et al. 2013; Allende Prieto et al. 2018). As part of the astrophysical parameters inference system (Apsis), the Gaia General Stellar Parameterizer from Photometry (GSP-phot) adopts a Bayesian forward-modelling approach to fit XP spectra with isochrone models and four different stellar libraries (Andrae et al. 2023a). It proviides an estimate of effective temperature T_eff, surface gravity log_eff, metallicity [M/H], absolute magnitude M_G, radius, distance, and extinction for each star. However, the authors of GSP-Phot advise against using these metallicity estimates since they are dominated by large systematic errors.

An et al. (2024) constructed an all-sky 3D extinction map by comparing the low-resolution XP spectra with their empirically calibrated synthetic spectra and gave a reliable estimate of metallicity for stars with [M/H] > −1. Martin et al. (2024) built a model linking the metallicity-sensitive synthetic CaHK magnitudes to the photometric metallicity [M/H]_phot from XP spectra. Even though only [M/H] measurements are provided, their results show good agreement with the literature in the full metallicity range, even for stars under [M/H] = −3. Bellazzini et al. (2023) and Xylakis-Dornbusch et al. (2024) also tried to extract metallicity information from the synthetic photometry, but they only determined [M/H]phot for a small fraction of the XP spectra available in DR3. The intrinsic nature of the estimation of [M/H]_phot also involves a comparison between synthetic spectra and observations, albeit indirectly, through the photometry generated from them. The accuracy of model-driven methods highly relies on the consistency between the XP spectra and the synthetic spectra. Unfortunately, there are several points that may induce discrepancies between them.

First, the theoretical spectra are limited by our knowledge of stellar atmospheres, atomic and molecular physics. Second, XP spectra are not perfectly flux-calibrated. Third, the information in the low-resolution spectra is highly sensitive to surface temperature T_eff and gravity log ɡ. The metallicity determined from XP spectra with low (T_eff < 4000 K) and high (T_eff > 7000 K) temperature are likely unreliable, due to the limited strength of metal lines in stars of such low metallicity and high effective temperature.

In the last decade, data-driven and machine learning methods have been widely adopted to overcome the gap between synthetic spectra and observations (e.g., Ness et al. 2015; Ting et al. 2017, 2019; Xiang et al. 2019; Leung & Bovy 2019). Data-driven methods use a large training sample with known labels to construct a relationship between the stellar parameters and the spectra. Previous studies find that data-driven methods are effective to derive precise atmospheric parameters and elemental abundances for low-resolution spectra by applying them to the LAMOST survey (Wilson et al. 2019; Li et al. 2022b; Li &Lin 2023). Therefore, it is natural to turn to data-driven methods to infer stellar properties from XP spectra (e.g., Rix et al. 2022; Andrae et al. 2023b; Zhang et al. 2023; Sanders & Matsunaga 2023; Li et al. 2024; Xylakis-Dornbusch et al. 2024; Laroche & Speagle 2024; Leung & Bovy 2024; Yao et al. 2024; Avdeeva et al. 2024; Fallows & Sanders 2024).

Rix et al. (2022) derived the stellar metallicity for a sample of 2 million giant stars within 30° of the Galactic center using the XGBoost algorithm. They achieved a remarkably median precision of δ[M/H] < 0.1 by adopting the SDSS DR17 (the Apache Point Observatory Galactic Evolution Experiment, shortened as APOGEE; Abdurro’uf et al. 2022) as the training sample. Andrae et al. (2023b) applied the XGBoost algorithm to the whole catalog and estimated the stellar atmospheric parameters (T_eff, log ɡ, [M/H]) for over 175 million stars. They have a mean stellar parameter precision of 0.1 dex in [M/H], 50 K in T_eff, and 0.08 dex in log ɡ. Besides APOGEE DR17, they also included the metal-poor stars from Li et al. (2022a) in the training sample to break the low-metallicity boundary.

Zhang et al. (2023) developed an empirical forward model using a neural network (NN) to estimate the stellar atmospheric parameters, distance, and extinction for 220 million stars from XP spectra. Given the stellar parameters, they could provide a prediction of the XP spectra. Laroche & Speagle (2024) developed a novel implementation of a variational auto-encoder which achieves competitive XP spectra reconstructions without relying on stellar labels. Their results suggest that meaningful information related to [α/Fe] is hidden in the XP spectra. Recently, Li et al. (2024) and Hattori (2024) successfully extracted the information on [α/Fe] for partial XP spectra by using machine learning models trained on APOGEE DR17. In addition, Leung & Bovy (2024) built a transformer-based model and trained it in a self-supervised manner on a compiled data set. Differently from previous studies, they could not only derive stellar parameters from XP spectra, but also predict the XP spectra given the stellar labels. In general, data-driven methods are efficient and accurate in analyzing the XP spectra, and have allowed the deviation of several parameters, like [α/Fe], that are hard to derive using traditional model-driven methods at such low spectral resolution.

Although data-driven methods have the potential to achieve great success in exploring the low-resolution XP spectra, their output is highly constrained by the quality of the training data. Most studies adopted APOGEE DR17 as a training sample, and several selection criteria were applied to ensure the quality of the stellar labels from APOGEE. Due to those cuts, the T_eff of the training sample is mainly in the range of 3500 K to 7000 K. Therefore, their constructed models are not applicable to stars outside this temperature range. To cover more spectral types in the training sample, Zhang et al. (2023) combined the standard AFGK catalog from LAMOST DR8 and the hot stars (high T_eff) catalog from the Hot Payne (Xiang et al. 2022). However, there are systematic differences between these two catalogs, and their model prediction exhibits a large scatter from the literature for stars of T_eff > 7500 K. Besides the limitation in T_eff, both the APOGEE and LAMOST catalogs have a low boundary of metallicity at [M/H] = −2.5, which prevents the application of the method to more metal-poor stars. Despite the inclusion of more metal-poor stars from other surveys can remove the low-metallicity boundary, the very small size of the training sample prevents an appropriate coverage of the parameter space, and the metallicity estimations only show a very small improvement at [M/H] < −2.5. As a comparison, the high-resolution spectroscopic follow-up of metal-poor candidates selected from the catalogs of [M/H]_phot demonstrated the potential of identifying EMP stars through traditional model-driven methods Xylakis-Dornbusch et al. (2024); Mardini et al. (2024).

In this study, we combine data-driven and model-driven methods to construct a catalog (T_eff, log ɡ, [M/H]) for about 68 million stars with XP spectra. To reduce the influence of systematic errors (wiggles), we develop a NN model which corrects the XP spectra according to their stellar labels in Section 4. This correction helps in overcoming the gap between synthetic spectra and observations, and the stellar atmospheric parameters predicted from the corrected spectra are more reliable, as shown in Section 5. The paper is summarized in Section 6.

2 Data

We determine and study systematic errors in the Gaia XP fluxes using a sample of XP spectra cross-matched with the final release of APOGEE, included in the DR17 of SDSS. We correct the XP spectra using the systematic pattern detected in the APOGEE sample, and check the impact on the inferred stellar atmospheric parameters (comparing to APOGEE) and on the accuracy of the fluxes (comparing to CALSPEC data; Bohlin et al. 2014, 2019; Bohlin & Lockwood 2022). In this section, we describe each reference catalog in detail.

2.1 Gaia XP spectra

Since XP spectra are described by a set of polynomial coefficients, we use the Python library GaiaXPy to transform these coefficients into flux-calibrated spectra and synthetic photometry. GaiaXPy provides an option to represent the spectrum with a smaller set of basis functions, avoiding the noise associated to higher-order terms. In addition to this truncation, we apply a cut in the wavelength range, eliminating regions where the transmission is less than 10%. To remove the influence of the dust extinction, we use the Python package extinction⁴ adopting the ccm89 model (Cardelli et al. 1989). The reddening for each individual spectrum is calculated from the two-dimensional SFD dust map (Schlegel et al. 1998; Schlafly & Finkbeiner 2011) using the Python package dustmap (Green 2018).

The data analysis code FERЯ (Allende Prieto et al. 2006)⁵, which matches numerical models to the observations by minimizing the χ², is used to search for the optimal stellar atmospheric parameters (T_eff, [M/H], and log ɡ). Several algorithms are available in FERЯ, and the Powell’s UOBYQA (unconstrained optimization by quadratic approximation) algorithm (Powell 2002) is adopted in our work, initializing the search at the grid center. This is a local algorithm, which may lead to incorrect solutions in some cases, but it is chosen due to its high speed, critical for very large samples. We tested applying a Markov chain Monte Carlo algorithm (MCMC, algor= 5 in FERЯ), which performs a global search, to find out that the results are very similar to those obtained with the algorithm adopted in this work. We therefore conclude that Powell’s UOBYQA algorithm are both efficient and accurate in this case. We computed a fresh library of model spectra with constant resolution using Synple⁶, and Kurucz model atmospheres as in the nsc library of Allende Prieto et al. (2018), spanning −5 < [M/H] ≤ +0.5, 3500 K ≤ T_eff < 8000 K, and 1 ≤ log ɡ < 5. This library is publicly released together with this paper. For the model with variable resolution, the upper limit of T_eff is 12 000 K.

Gaussian convolution is performed to reduce the resolution of the finely sampled spectral synthesis calculations and match it to the XP data. In this study, we adopt two different resolutions for the model spectra: one is a constant resolution of R ~ 104, and the other is a variable resolution that changes with the wavelength as the XP spectra (refer to Table 1 in Montegriffo et al. 2023). For the constant resolution model, the corresponding sampled XP spectrum (S_const) ranges from 360 to 990 nm, and has 330 points evenly spaced on a logarithmic scale. For the variable resolution, the corresponding XP spectrum (S_var) ranges from 360 to 992.1 nm, and has 270 points in increments of the half of Full width at half maximum ( $FWHM = \frac{λ}{R}$ ${\rm{FWHM}} = {\lambda \over R}$ , where λ is the wavelength), based on information from Montegriffo et al. (2023, see also Carrasco et al. 2021).

FERЯ returns the best fitting spectra that most closely resemble the XP observations. The spectra are normalized by their mean values as in Equation (1), and we define the residual N∆ Flux as the difference between the normalized fitting (NFlux_fitting) and XP (NFlux_XP) spectra.

$\begin{matrix} N Flux = Flux / Mean(Flux) \\ N Δ Flux = N {Flux}_{fitting} - N {Flux}_{XP} \end{matrix}$ $\matrix{ {N{\rm{Flux}} = {\rm{Flux}}/{\rm{Mean(Flux}})} \cr {N{\rm{\Delta Flux}} = N{\rm{Flux}}{{\rm{}}_{{\rm{fitting}}}} - N{\rm{Flux}}{{\rm{}}_{{\rm{XP}}}}} \cr }$ (1)

In previous studies of Montegriffo et al. (2023) and Huang et al. (2024a), they also gave a similar definition of the systematic error as (Flux_XP − Flux_ref)/Flux_ref or Flux_ref /Flux_XP, where Flux_ref is the corresponding reference spectra from external data. All the fluxes we discuss hereafter are normalized, and the symbol N will be dropped for clarity. The general shapes of the XP spectra are consistent with the best fitting models in most cases. The residuals are presented in a complex function that oscillates with the wavelength as shown in Figure 1, and its amplitude is usually larger in the blue band than the red band. In this study, we refer to these residuals as wiggles.

Visual inspection indicates that stars with similar parameters exhibit similar wiggles. To fully understand what are the main parameters that control the pattern, we randomly select 800 000 stars at different Galactic latitude: 200 000 at low-latitude (|b| < 10°), 300 000 at medium-latitude (10° ≤ |b| < 30°), and 300 000 at high-latitude (30° < |b| < 90°). The corresponding Flux_fitting and ∆Flux of these stars are obtained using FERЯ. The randomly selected XP sources have an extended distribution in the color-magnitude diagram (CMD), with −0.5 < G_BP − G_RP < 3.5 and 4 < G < 17.5. In Section 3 we explore the impact of stellar colors, apparent magnitude, latitude, and metallicity on the wiggles using these randomly selected XP sources.

Fig. 1

Density distribution of residuals as a function of wavelength for stars with different parameters. Subfigures (a), (b), and (c) are results of stars in bins of the same magnitude range 13 < G₀ < 15 but different stellar colors. Subfigures (b), (d), and (e) show the density distribution of residuals for stars within the same range of color 1.0 < (G_BP − G_RP)₀ < 1.2 but different G₀. Subfigure (f) shows the residuals of stars of the same color-magnitude as subfigures (b) but in different Galactic latitude (|b| < 10°). The black dash lines represent the P₅₀ percentiles distributions, which can be viewed as a robust representation of the patterns of wiggles. From the P₅₀ lines we can see that the wiggles change with (G_BP − G_RP)₀ and G₀ , but the stellar color has a larger impact than the magnitude. Stars of low latitude have a much more diffuse distribution of ∆Flux. The large uncertainties of the extinction map towards the Galactic disc may cause a bad extinction correction of Flux_XP for some stars. Therefore, Flux_XP is more likely to deviate from the fitting result Flux_fitting for stars at lower latitude.

2.2 Metal-poor stars sample

The search for metal-poor stars is one of the most interesting applications of XP data, and therefore we carry out specific evaluations for that type of stars. We select metal-poor stars from three libraries: JINAbase (Abohalima & Frebel 2018), the Pristine survey (Aguado et al. 2019), and the LAMOST-Subaru metal-poor survey (Li et al. 2022a; Aoki et al. 2022). JINAbase is a collection of chemical abundances and stellar parameters for 1659 metal-poor stars (60% of which have [M/H] < −2.5) from the literature, published in the period between 1991 and 2016. The Pristine survey includes a medium-resolution spectroscopic follow-up of 1007 metal-poor candidates identified from the narrow-band photometry, and more than 900 stars which have been confirmed to have [M/H] < −2. The LAMOST-Subaru survey presents measurements for over 20 elements in 385 stars covering a wide metallicity range from −1.7 to −4.3. After crossmatching with the XP spectra and applying a cut of [M/H] < −2, our final metal-poor sample consists of 1813 stars. Most of them are at high-latitude, with E(B − V) < 0.1. The colors and magnitudes of these stars are roughly in the range of 0.5 < G_BP − G_RP < 1.5 and 6 < G < 17.5.

2.3 APOGEE DR17

APOGEE (Majewski et al. 2017; Wilson et al. 2019) is a large-scale spectroscopic survey of stars in the Milky Way. APOGEE provides high-resolution (R = λ/Δλ ~ 22 500) and high signal-to-noise ratio (S/N > 70 typically) spectra throughout the near infrared wavelength range of 1.51−1.70 µm.

The stellar atmospheric parameters and elemental abundances are determined by the APOGEE Stellar Parameters and Chemical Abundances Pipeline (ASPCAP; García Pérez et al. 2016) based on FERЯ. APOGEE provides high-precision measurements of the kinematics and chemistry of the Milky Way structures (bulge, disk, and halo). The Seventeenth Data Release of the APOGEE survey contains spectra and abundances for 733 901 stars (Abdurro’uf et al. 2022).

2.4 CALSPEC libraries

Considering the wide wavelength coverage of the XP spectra, we choose the CALSPEC library (Bohlin et al. 2014, 2019; Bohlin & Lockwood 2022) as the reference spectra. CALSPEC is a library of flux standards on the Hubble Space Telescope (HST) system. Most of them have a complete HST’s Space Telescope Imaging Spectrograph (STIS) coverage of wavelength from the ultraviolet to near-infrared band with a resolving power R ≈ 560–700. We find 109 CALSPEC stars with XP spectra, and their colors and magnitudes are mainly in the range of −1.0 < G_BP − G_RP < 1.5 and 4 < G < 18. The CALSPEC collection of spectra are likely among the most accurate flux calibrated spectra available, with a 2–3% accuracy in absolute flux. The uncertainty of the monochromatic flux at 555.75 nm (555.6 nm in air) is 0.5% or 0.005 mag see Bohlin et al. 2014, and the CALSPEC Calibration Database).

2.5 LAMOST

LAMOST is a ground-based innovative telescope designed with a large aperture and a wide field-of-view (Zhao et al. 2006, 2012; Cui et al. 2012; Luo et al. 2012). Beginning with a pilot survey in 2011, LAMOST has observed over ten million spectra in the Northern sky with a limiting magnitude of r = 17.8 (Yan et al. 2022).

LAMOST Low-Resolution Spectroscopic (LRS) survey provides low-resolution (R ∼ 1800) spectra with a wavelength coverage of 0.37-0.90 µm. In this study, we use the LRS Stellar Parameter Catalog of A, F, G and K stars from the LAMOST DR11, which contains the atmospheric parameters of 7 774 147 stars with a typical error of ~43 K for T_eff, 0.06 dex for log ɡ, and 0.04 dex for [M/H].

3 Pattern of wiggles

Comparison between XP spectra and model predicted flux, Montegriffo et al. (2023) shows that the residuals are correlated with G_BP − G_RP color and G magnitude. The stellar color is closely associated with surface temperature, which plays a critical role in shaping the spectra energy distribution (SED) through photoionization (Allende Prieto 2023), mainly through hydrogen atoms and the H⁻ ion. The dependence of the wiggles on magnitude may likely be caused by the complex internal calibration of XP spectra. As illustrated by Andrae et al. (2023a), the spectra of stars brighter than G = 11.5 are recorded in 2D windows, which have different read-out configuration (gates) depending on magnitude, while fainter stars are mainly observed as 1D windows and under a more uniform readout strategy. Montegriffo et al. (2023) noted that a built-in assumption in the external calibration (which involves calibrating the internally calibrated, continuously represented mean spectra to an absolute system) is that the instrument model is independent of brightness (or magnitude). Therefore, it cannot easily account for inconsistencies caused by the different observing modes, which may finally result in a correlation between residuals and G magnitude.

Inspired by previous studies, we randomly distribute our selected XP sources, limited to E(B − V) < 2, into different bins in de-reddened (G_BP − G_RP)₀ color and G₀ magnitude. Figure 1 shows the density distribution of residuals as a function of wavelength for stars with different values of (G_BP − G_RP)₀, G₀, and Galactic latitude b. We find that the wiggles change dramatically with color. For stars in the same color bins, we can see some scatter, but most of them have a very similar pattern, especially for stars in the range 1.0 < (G_BP − G_RP)₀ < 1.2.

Panels b, d, and e of Figure 1 show the distributions of residuals for stars with the same color but different magnitude G₀. The residuals’ pattern clearly changes with G₀ as well, although to a lesser extent than with (G_BP − G_RP)₀. The stellar color is a major determining factor of the SED, while the apparent magnitude G₀ is connected to the SED through the instrument model. Since the wiggles in the residuals largely depend on the SED, it is only natural that the stellar color plays a much more important role. For stars in the same color-magnitude bin but located at lower Galactic latitudes, the general pattern is similar, but the residuals have a much more diffuse distribution. We conclude that the stellar color G_BP − G_RP, apparent magnitude G, and dust extinction E(B − V) should be considered when modeling the observed systematic patterns.

Huang et al. (2024a) also considered the intrinsic color and extinction in their proposed corrections, but disregarded the weaker effects of [M/H] and log ɡ. This choice simplifies the correcting process, focusing on the accuracy of the correction for most stars. However, the libraries they used mainly consist of stars with [M/H] > −1, and the SEDs of very metal-poor (VMP) stars are quite different from those of metal-rich ones. To fully understand the influence of [M/H], we adopt the sample from Section 2.2, and obtain its density distribution of residuals in Figure 2. We can see that the largest differences are concentrated on the blue band, where the residuals of VMP stars have a smaller amplitude, with the maximum absolute value Max(|P₅₀|) < 0.08. In general, the pattern of fitting residuals for VMP stars is far more featureless than for metal-rich stars. Since one of our aims is to make an accurate estimation of metallicity from the wiggle-corrected XP spectra, we should take extra care with the influence of [M/H].

Fig. 2

Density distribution of residuals as a function of wavelength for VMP stars. Top panel shows the density distribution of the residuals of VMP stars with 1.0 < G_BP − G_RP < 1.2 and 13 < G < 15. Compared to the relatively more metal-rich sample in Figure 1, their wiggles have a smaller amplitude with Max (|P₅₀|) < 0.08. The bottom panel shows the difference of P₅₀ between these two samples. The main difference is in the blue band of λ < 6500 Å, while the wiggles in the red band is little affected by the change of metallicity.

4 Method

Our aim is to correct the wiggles observed in the residuals for a given spectrum and calculate atmospheric parameters by fitting the observations with model spectra using FERЯ. Therefore, we first need to characterize the systematic patterns as a function of color, magnitude, reddening and metallicity. This is what machine learning is good at. Therefore, we use the main relevant parameters as input into a neural-network model with multiple hidden layers, where the residuals from the fits are the output. We may not have the information of metallicity for a given star. Therefore, we strongly prefer to avoid using metallicity as input, and instead adopt various metallicity-sensitive photometric indices.

It is important to stress that there is a limited number of spectra with metallicity below −2.5 to train the NN model, so we do not trust and do not apply the derived correction to stars with metallicity under −2.5. In this section, we first describe how we predict the patterns via the NN, including the training dataset and the architecture of our NN model. Then we provide additional details on how we correct and fit all XP spectra in Gaia DR3.

4.1 Training database

To ensure the patterns in the residuals we characterize really reflect the differences between the XP and the theoretical spectra, we only use spectra with reliable estimations of atmospheric parameters, judging from the differences with the APOGEE parameters. We refer to the resulting sample as the training APOGEE sample (TAS) hereafter. The APOGEE sample is firstly cleaned by applying the cuts below, where the symbol A indicates APOGEE parameters: $\begin{array}{l} - & S / N > 70; \\ - & σ_{T_{eff}^{A}} \leq {250, σ}_{log g^{A}} \leq 0 . 5; \\ - & 3500 \leq T_{eff}^{A} \leq 8000 K or 12000 K, 1 \leq log g^{A} \leq 5 . \end{array}$ $- \hfill & {{{\rm{S}} \mathord{\left/ {\vphantom {{\rm{S}} {{\rm{N > 70;}}}}} \right. \kern-\nulldelimiterspace} {{\rm{N > 70;}}}}} \hfill \cr - \hfill & {{{\rm{\sigma }}{{\rm{T}}_{{\rm{eff}}}^{\rm{A}}}} \le {\rm{250,}}{{\rm{\sigma }}{{\rm{log}}\,{{\rm{g}}^{\rm{A}}}}} \le {\rm{0}}{\rm{.5;}}} \hfill \cr - \hfill & {{\rm{3500}} \le {\rm{T}}_{{\rm{eff}}}^{\rm{A}} \le {\rm{8000}}\,{\rm{K}}\,{\rm{or}}\,{\rm{12000}}\,{\rm{K,}}\,{\rm{1}} \le {\rm{log}}\,{{\rm{g}}^{\rm{A}}} \le {\rm{5}}{\rm{.}}} \hfill \cr$

We adopted two different values as the upper limits of $T_{eff}^{A}$ $T_{{\rm{eff}}}^{\rm{A}}$ for the two models with different resolutions. The upper limit is 8000 K for S_const and 12000 K for S_var. In the APOGEE DR17 catalog, there are some sources with Gaia source id = 0, which are dropped in our APOGEE sample. In addition, for sources with the same Gaia source id, we only keep the first one (~10% sources are duplicated). Then we obtain the initial parameters for the APOGEE sample (IAS). Next, we calculate the atmospheric parameters for XP spectra using FERЯ. After deriving T_eff, log ɡ, and [M/H], we trim down the IAS applying the following constraints: $\begin{array}{l} - & | T_{eff}^{XP} - T_{eff}^{A} | < 200 \\ - & | \log g^{XP} - \log g^{A} | < 0.5 \\ - & | {[M / H]}^{XP} - {[M / H]}^{A} | < 0.5, \end{array}$ $- \hfill & {\left| {T_{{\rm{eff}}}^{{\rm{XP}}} - T_{{\rm{eff}}}^{\rm{A}}} \right| < 200} \hfill \cr - \hfill & {\left| {\log \,{g^{{\rm{XP}}}} - \log \,{g^{\rm{A}}}} \right| < 0.5} \hfill \cr - \hfill & {\left| {{{\left[ {{{\rm{M}} \mathord{\left/ {\vphantom {{\rm{M}} {\rm{H}}}} \right. \kern-\nulldelimiterspace} {\rm{H}}}} \right]}^{{\rm{XP}}}} - {{\left[ {{{\rm{M}} \mathord{\left/ {\vphantom {{\rm{M}} {\rm{H}}}} \right. \kern-\nulldelimiterspace} {\rm{H}}}} \right]}^{\rm{A}}}} \right| < 0.5,} \hfill \cr$

where $T_{eff}^{XP}, \log g^{XP}, {[M / H]}^{XP}$ $T_{{\rm{eff}}}^{{\rm{XP}}},\log \,\,{g^{{\rm{XP}}}},{[{\rm{M}}/{\rm{H}}]^{{\rm{XP}}}}$ , are parameters estimated from the Gaia XP spectra by FERЯ. Finally, our TAS includes 157 478 stars for S_const, and 131 173 stars for S_var.

4.2 Input and output

As mentioned before, the shape of the residuals is highly related to the color, magnitude, and reddening of a star, and to a lesser extent, to its metallicity. Therefore, we use various of colors and magnitudes, including metallicity-sensitive colors, generated from XP spectra, as input for our NN model. We employ GaiaXPy and the built-in photometric system (Gaia Collaboration 2023a) to obtain the SkyMapper magnitudes u, υ, ɡ, i, and the metallicity-sensitive colors (Chiti et al. 2021) ɡ − i, υ − ɡ − 0.9 × (ɡ − i), and u − υ − 0.9 × (ɡ − i). In addition, the reddening E (B − V) calculated from dustmaps is included as one of the parameters.

To summarize, the 14 parameters involved are:

metallicity-sensitive colors: ɡ − i, υ − ɡ − 0.9 × (ɡ − i), u − υ − 0.9 × (ɡ − i).
SkyMapper photometric passbands: u, υ, ɡ, i.
Gaia photometric passbands and colors: phot ɡ mean mag, phot bp mean mag, phot rp mean mag, bp rp, bp ɡ, rp ɡ
reddening: E (B − V).

The output is the predicted flux corrections as a function of wavelength.

4.3 Neural network architecture

A simple NN model based on Pytorch (Paszke et al. 2017, 2019)⁷ is built for training with the TAS. The basic diagram illustrating our NN model is summarized in Figure 3. Briefly, we have 7 hidden layers in our model, and the number of neurons per hidden layer is shown in the figure. We have 14 input elements, as described above. The length of output for each target is 330 (for S_const, and 270 for S_var). The loss function and optimizer adopted in our model are MSELoss and Adam. The initial learning rate for training is set to 0.001, and it is reduced to half that value after 15 epochs, with the lower limit set to 10⁻⁶ . In order to prevent overfitting, we use EarlyStopping in our model.

The TAS data set is divided into a training sample (60%), a validation sample (20%), and a testing sample (20%). When the loss curve for the validation sample is not declining after 25 epochs, the train process is stopped. Our finally adopted model is trained on an Nvidia RTX 4090 in about 5 minutes for 234 epochs.

Our NN model performs very well in predicting the pattern in the residual for a given star. In Figure 4, we demonstrate the predicted wiggles for 20% sources in the TAS. As we can see from the figure, the predicted wiggles show significantly less scatter than the actual wiggles, but the mean pattern in the actual wiggles is clearly shown in the predicted ones. The bottom panel of this figure also indicates the patterns are largely removed after applying the correction.

Fig. 3

Diagram of our NN model based on Pytorch.

4.4 Fit all XP spectra

To determine atmospheric parameters for all the stars with Gaia XP spectra, we use FERЯ. This process is performed in three steps:

(1)
We fit all de-reddened XP spectra with FERЯ and arrive at an approximate estimation of T_eff and [M/H]:
- Stars with extinction A_V > 15 are removed; (This value is set quite high, as we do not intend to exclude any spectrum for its high reddening. The limit is primarily in place to prevent calculation errors. If we lower this value to 1.0 or 2.0, it will not exclude many more stars, as more than 99.2% of the stars in our final catalog have A_V < 2.0.)
(2)
We use the NN model to correct the systematics in the spectral energy distribution (wiggles):
- Spectra for which we are not able to generate the full set of synthetic photometry are not corrected (The reason is explained later);
- Spectra with T_eff in the first estimation larger than 8000 K will not be corrected (only activated during the process of fitting spectra with variable resolution models);
- Spectra with [M/H] < −2.5 in the first estimation will not be corrected.
(3)
We fit the XP spectra with FERЯ again.

Some spectra cannot produce all the passbands we require, typically the u and υ bands of SkyMapper in our test, with the u band being the most affected. According to Table 1 in Onken et al. (2024), the central wavelength and filter FWHM for the u band are 350 nm and 43 nm, respectively, while for the v band, these values are 384 nm and 31 nm. The starting wavelength of XP spectra is 330 nm, covering most wavelength range of the u band and the full wavelength range of the υ band. However, GaiaXPy fails to generate these two bands for some spectra, primarily for faint stars, though not all faint stars are affected by this issue.

Fig. 4

Comparison between the real patterns from TAS (20% of the whole sample, top panel) and the NN model predicted patterns (middle panel). The bottom panel shows the difference between the real and predicted patterns, from which we can see that the wiggles disappear.

Fig. 5

Testing on atmospheric parameters using sampling S_const. Top panels: comparison of T_eff, logg and [M/H] estimated from original XP spectra (Y -axis) and APOGEE survey (X-axis). Middle panels: similar to the top panels, but with Y-axis replaced by the results from corrected XP spectra. The color bar in each subfigure displays the number density. Bottom panels: histograms present the differences of T_eff, log ɡ and [M/H] between XP and APOGEE before and after correcting the systematic patterns. The mean values and standard deviations are shown in the labels.

5 Results

In this section, we discuss the estimation of parameters for the IAS after correcting the systematic pattern predicted by our NN model trained on the TAS data set. As described below, this correction makes the XP spectra fit the model spectra better, and in particular, we find a significantly better agreement with the HST CALSPEC observations.

5.1 Stellar atmospheric parameters

Besides determining the absolute flux, we find that the correction of the systematic pattern can improve the estimation of stellar atmospheric parameters from XP spectra. We first calculate parameters from XP spectra with sampling S_const. In Figure 5 we show the distributions of the stellar atmospheric parameters (T_eff, log ɡ, [M/H]) inferred from the original and wiggle-corrected spectra, comparing with parameters from the APOGEE survey. The top and middle panels compare parameters from XP and APOGEE before and after the correction. We keep stars with 4000 ≤ T_eff ≤ 7000 K to compare with APOGEE, because our analysis for stars with lower and higher temperatures performs worse. In addition, the stars in this figure have passed the selection based on the quality flag dflux_per, which describes the percentage of data points from ∆Flux ≡ Flux_XP − Flux_model exceeding ±0.05. We retain stars with dflux per below 20% before the correction and 8% after the correction. It is worth noting that more stars remain after correction, as the correction improves the fit between the XP spectra and the models. As a result, even with more stringent selection criteria, we still obtain more stars than before the correction.

Figure 5 shows that our estimations are highly consistent with APOGEE’s parameters both before and after corrections within the range 4000 ≤ T_eff ≤ 7000 K. But comparing the middle panels with the top panels, one can easily find that estimations for T_eff , log ɡ and [M/H] become somewhat better after correction, especially when examining the differences in log ɡ and [M/H]. Correcting the systematic pattern not only reduces the biases in T_eff and log ɡ, but it also makes our results more precise, as suggested from the smaller dispersion relative to APOGEE results.

We provide below a brief summary of the changes in the mean value and standard deviation of ∆T_eff, ∆ log ɡ, and ∆[M/H] for the nominal S const analysis:

∆T_eff: from −58.17 ± 170.61 to −38.40 ± 167.21;
∆ log ɡ: from 0.11 ± 0.53 to 0.05 ± 0.40;
∆[M/H]: from −0.09 ± 0.22 to −0.12 ± 0.19.

The most significant improvement is in the estimation of log ɡ, where the dispersion σ_{∆ log ɡ} is reduced from 0.53 to 0.40.

We also calculate parameters for S_var for the same sample of stars (IAS) with the same selection rules. We find that ∆T_eff , ∆ log ɡ, and ∆[M/H] are very similar with previous results using sampling S_const, summarized as follows.

∆Teff: from −62.11 ± 177.21 to −50.28 ± 163.46;
∆ log g: from −0.09 ± 0.73 to −0.05 ± 0.47;
∆[M/H]: from −0.13 ± 0.23 to −0.14 ± 0.19.

Despite the overall results are quite similar for the two different samplings, those for S_const before correcting patterns are slightly better. After correcting the patterns, we find that ∆T_eff , ∆ log ɡ, and ∆[M/H] from the two samplings are similar. However, the constant sampling still seems to provide better results, especially for T_eff and log ɡ. Therefore, we adopt constant sampling in the following sections and our series of papers.

5.2 Flux corrections

Correcting the wiggles has a significant impact on the fitting of XP spectra. We examine the log₁₀ (χ²) and RMS (root mean square) distributions and both parameters are significantly reduced, as shown in Figure 6 for the stars in the IAS. The RMS is defined: $RMS = \sqrt{\frac{1}{n}} \sum_{i} {(F_{i}^{XP} - F_{i}^{model})}^{2}$ $\mathrm{RMS} = \sqrt{\frac{1}{n} \sum_{i} \left( F^{\mathrm{XP}}_{i} - F^{\mathrm{model}}_{i} \right)^{2}}$ (2)

where F^XP, F^model are the fluxes of XP spectrum and model spectrum, i indicates the points of sampling, and n is the number of data points. Our correction makes log₁₀ (χ²) and RMS becoming way much smaller. As shown in the bottom panel of Figure 6, the peak of the RMS histogram decreases from 3.7% to 1.2%. In the following we check whether the correction helps in bringing the XP spectra closer to the CALSPEC data, which have a superb flux calibration.

Fig. 6

Histograms present log₁₀ (χ²) and RMS between the XP spectra and model spectra for the IAS data set before and after correcting the pattern, indicating by blue and orange, respectively.

5.3 The CALSPEC library

Given the excellent absolute flux calibration of the CALSPEC spectra, we use this library to perform an independent check of our proposed corrections. Since they have significantly larger resolving power, we apply Gaussian convolution to match their resolution to the XP spectra (and our model spectra). Then, we perform an interpolation of the smoothed spectra to match the sampling of XP data, and correct extinction in the same way we do for the XP spectra. Only spectra of stars estimated to have T_eff < 10⁴ K are used in this comparison, based on the parameter teff_gspphot in Gaia DR3.

We first check the RMS between the XP spectra and the spectra from CALSPEC, comparing the results before and after the correction. All spectra from XP and CALSPEC are dereddened with the same method (see in Section 2). The results for stars without quality cuts are shown in the left-hand panels of Figure 7, and the right panels display the results for stars that have passed the quality cuts. We find that the RMS with the correction of the pattern is overall reduced, with only one exception for stars that have passed the quality cuts. From the median RMS values, indicated by the red dashed lines in the figure, our correction reduces the RMS from 3.9% to 3.2%, or from 3.2% to 2.4%, depending on whether quality cuts are applied. Overall, the XP spectra are fit better with CALSPEC after correcting the systematic pattern. We also use the package provided by Huang et al. (2024a) to correct XP spectra as a comparison, shown in the two subfigures in the bottom panels. We de-redden and interpolate the spectra corrected by the package to match the sampling used in this paper, because the default sampling in our work differs from that of Huang et al. (2024a). From the orange and green histograms, shown in the middle and bottom panels in Figure 7, the overall improvements from our correction and the correction by Huang et al. (2024a) are comparable, regardless of whether quality cuts are applied. We should also mention that the resolution of our model spectrum is not exactly the same as that of the XP spectrum, as we use a constant resolution of around 100. However, the typical resolution of XP spectra is below ∼100. Consequently, the XP spectrum may be over-corrected, with some wiggles potentially arising from the higher resolution of models adopted in this paper. However, we have checked the results using variable resolution and reach the same conclusions.

We also present plots for a few individual XP spectra comparing them to CALSPEC in Figure 8. After correcting the pattern, the XP spectra match better the CALSPEC data smoothed to the resolution of the Gaia spectrophotometry. Nonetheless, there are some wavelengths at which the corrected version gives poorer agreement with CALSPEC. It is worth noting that the resolution adopted in Huang et al. (2024a) is lower than that used in this study, which accounts for some of the observed differences between the corrected spectra by Huang et al. (2024a) and CAL-SPEC at the blue end of the spectra. We have also repeated the check with the Next Generation Spectral Library (NGSL; Gregg et al. 2006; Pal et al. 2023), but the improvement is not so clear, which we associate to the lower quality of the flux calibration of the library compared to the exquisite CALSPEC data.

Fig. 7

Distribution of RMS between CALSPEC and XP spectra. Left panels: histograms present RMS between the XP spectra and the spectra from CALSPEC libraries before and after correcting patterns, indicating by blue and orange, respectively. The green histogram shows the corresponding results obtained using the Python package from Huang et al. (2024a) to correct the XP spectra. Right panels: similar to those panels in the left, but with stars that have passed the quality cuts applied in Section 5.1. The red dashed lines represent the median values, which are also indicated by the number displayed in each panel.

5.4 Atmospheric parameters catalog of Gaia XP spectra

In Section 4.4 we describe our analysis of the whole sample of Gaia XP spectra. Using the same restrictions in Section 5.1 (4000 ≤ T_eff ≤ 7000 K, and a maximum of 8% or 20% of the data points with ΔFlux larger than 0.05), we build our catalogs of stellar parameters for stars with Gaia XP spectra. We have a global catalog containing 68 394 431 stars, and a metal-poor catalog containing 124 188 stars that have [M/H] < −2.5 and A_V ≤ 1.5.

Fig. 8

Examples of how pattern correction can help improve the fitting between the XP spectra and the CALSPEC. Three spectra with different temperatures are presented in the sub-panels, from top to bottom: BD+54 1216, HD 115169, and KF06T2 in CALSPEC. In each panel, the smoothed CALSPEC spectrum is depicted as a bold gray line, while the XP spectra, with and without correcting patterns, are shown in blue and orange, respectively. The green lines show the corrected spectra using the package provided by Huang et al. (2024a). To enhance readability, we include a few zoomed-in diagrams in each panel to highlight the improvements on spectrophotometry. The ∆Flux between XP and CAL- SPEC are also presented at the bottom of each panel.

5.4.1 Comparison with LAMOST

The global catalog has also been cross-matched with LAMOST DR11 low-resolution catalog using the Gaia source id, finding 3 006 606 common sources. We compare the atmospheric parameters T_eff, log ɡ, and [M/H] between our results and LAMOST in Figure 9.

5.4.2 Comparisons with XP catalogs from the literature

We also compare our results with recent catalogs from literature. The details of the comparisons with Andrae et al. (2023b) and Zhang et al. (2023) are provided in Appendix A. The bottom line is that there is fair agreement between the three catalogs, suggesting that the uncertainties in our effective temperatures and metallicities are similar to those in the other catalogs, typically about 150 K and 0.2 dex, respectively, while our gravities are more uncertain than those in the other catalogs, about 0.4 dex for us but nearly 0.2 dex for the others. The analysis by Andrae et al. (2023b) and Zhang et al. (2023) are different in nature from ours. Not only they employ data-driven methods, while our results are obtained from synthetic spectra based on model atmospheres, but they make use of the trigonometric parallaxes from Gaia, while we do not. We plan to adapt our algorithm to make use of that valuable information, which can directly and dramatically improve the retrieved surface gravities, as well as constraining better the effective temperatures and metallicities. Nonetheless, our present results are independent from models of stellar structure and evolution.

The most used training datasets for machine learning, such as the catalogs from APOGEE and LAMOST, have a fairly restrictive lower boundary in metallicity, which limits their application to detect metal-poor star candidates. Additional comparisons with Andrae et al. (2023b) and Zhang et al. (2023) regarding VMP stars are also presented in Appendix A. In summary, our catalog offers some advantages in the parameter range beyond those covered by previous work, and has more clean and direct connection to physical models of stellar atmospheres.

Fig. 9

Comparison of atmospheric parameters T_eff, log ɡ, and [M/H] between our catalog and LAMOST DR11 low-resolution catalog.

5.4.3 Star clusters

We use open clusters (OCs) and globular clusters (GCs) to validate the metallicities in our catalog. Details of the comparison are outlined in Appendix B. Although there are some outliers in the metallicity distribution, as seen in GC NGC 3201, the overall results are in excellent agreement with the literature.

5.4.4 The final catalog

The first column in our global catalog is name, representing the Gaia source id of each source. Following this, we provide the atmospheric parameters T_eff, log ɡ, and [M/H] as Teff, logg, FeH. The log₁₀ χ² value for each spectrum is also included in the column log10_chi2. Another parameter reflecting the quality of the atmospheres parameters in our catalog is dflux per, as discussed earlier in Section 5.1. This quality flag has already been utilized to select reliable results. Stricter criteria, such as dflux_per< 0.10 or < 0.12, can be applied to identify even more reliable sources, particularly for metal-poor stars. For the metal-poor catalog, we also include the extinction A_V used in this paper, as we should exercise caution with the low-latitude high extinction stars. Additionally, the catalog has been crossmatched with Gaia to obtain various parameters from Gaia DR3, such as proper motion and magnitude. The global and metal-poor catalogs, along with the code for correcting the systematic patterns in the spectra, are made publicly available.

6 Summary

In this paper, we characterize the patterns of systematic errors present in the absolute-calibrated Gaia XP spectra, using a very large number of XP spectra and their best-fitting synthetic spectra based on model atmospheres. We find that those patterns depend on stellar colors, brightness, extinction, and metallicity. We present a simple NN that relates the systematic flux patterns with stellar color, magnitude, and extinction. The predicted patterns match those in the data very well. After correcting the wiggles, FERЯ is applied to derive atmospheric parameters from corrected Gaia XP spectra. Our methodology is validated from the comparison with APOGEE DR17 parameters and stars with observations in the HST CALSPEC collection.

Compared to APOGEE, our estimation of atmospheric parameters is accurate in the temperature range 4000 ≤ T_eff ≤ 7000 K, with slight systematic errors and standard deviations around −38 ± 167 K, 0.05 ± 0.40 dex, and −0.12 ± 0.19 dex in T_eff, log ɡ, and [M/H], respectively. The estimation of atmospheric parameters and spectra flux are both improved by correcting the systematic patterns. Our corrections improve the quality of the relative spectrophotometry of the Gaia XP data from 3.2–3.7% to 1.2–2.4%, as verified against our models and the high-quality CALSPEC standards. Our results are also compared with other catalogs generated from XP data in the literature, and the metallicity is validated through the use of star cluster members. Finally, we publish our atmospheric parameters catalog of 68 394 431 sources, with a metal-poor ([M/H] ≤ −2.5) subset including 124 188 stars. Our catalogs and flux-correction code are publicly available.

Data availability

Tables and codes are made publicly available at https://doi.org/10.5281/zenodo.14028588. The tables are available in electronic form at the CDS via anonymous ftp to cdsarc.cds.unistra.fr (130.79.128.5) or via https://cdsarc.cds.unistra.fr/viz-bin/cat/J/A+A/695/A75

Acknowledgements

This study is supported by the National Key R&D Program of China under grant Nos. 2023YFE0107800, 2024YFA1611900, and National Natural Science Foundation of China under grant Nos. 11988101, 12273055, 11927804. This study is also supported by International Partnership Program of Chinese Academy of Sciences Grant No.178GJZ2022040GC. XY and WW acknowledge the support from the China Scholarship Council. CAP acknowledges financial support from the Spanish Ministry MICIU projects PID2020-117493GB-I00 and PID2023-149982NB-I00. This research made use of computing time available on the high-performance computing systems at the Instituto de Astrofisica de Canarias. The authors are thankful for the technical expertise and assistance provided by the Spanish Supercomputing Network (Red Espanola de Supercomputacion), and the staff at the Instituto de Astrofisica de Canarias. CdB acknowledges support from a Beatriz Galindo senior fellowship (BG22/00166) from the Spanish Ministry of Science, Innovation and Universities. This work presents results from the European Space Agency (ESA) space mission Gaia. Gaia data are being processed by the Gaia Data Processing and Analysis Consortium (DPAC). Funding for the DPAC is provided by national institutions, in particular the institutions participating in the Gaia MultiLateral Agreement (MLA). The Gaia mission website is https://www.cosmos.esa.int/gaia. The Gaia archive website is https://archives.esac.esa.int/gaia. This job has made use of the Python package GaiaXPy, developed and maintained by members of the Gaia Data Processing and Analysis Consortium (DPAC), and in particular, Coordination Unit 5 (CU5), and the Data Processing Centre located at the Institute of Astronomy, Cambridge, UK (DPCI). Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the U.S. Department of Energy Office of Science, and the Participating Institutions. SDSS-IV acknowledges support and resources from the Center for High Performance Computing at the University of Utah. The SDSS website is www.sdss4.org. SDSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, Center for Astrophysics | Harvard & Smithsonian, the Chilean Participation Group, the French Participation Group, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU) / University of Tokyo, the Korean Participation Group, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatário Nacional / MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University. This work made extensive use of TOPCAT (Taylor 2005).

Appendix A Comparisons with Andrae et al. (2023b) and Zhang et al. (2023)

The direct comparison between our results and those from Andrae et al. (2023b) and Zhang et al. (2023) are presented in the first and third rows of Figure A.1. The mean and standard deviation values of ΔT_eff, Δ log ɡ, and Δ[M/H] indicate that our results are more similar to those of Zhang et al. (2023) in T_eff and log ɡ, but align more closely with Andrae et al. (2023b) for [M/H]. Since machine learning results can exhibit artificially high performance on the training data, as the data is used to optimize model parameters, we use parameters from different surveys as reference points in our comparison. Andrae et al. (2023b) provides atmospheric parameters using XGBoost training on APOGEE. To make a fair comparison, we cross-match our catalog and theirs with LAMOST DR11 low-resolution catalog. With no additional cuts, there are almost 3 million stars in common. We calculate ΔT_eff, Δ log ɡ, Δ[M/H] between our results and those from LAMOST, which are presented in the second row of Figure A.1. ΔT_eff, Δ log ɡ, Δ[M/H] between Andrae et al. (2023b) and LAMOST are also presented in those panels with different colors. We can see that we have a better estimation for T_eff when using the reference T_eff from LAMOST. However, our gravity estimation is significantly worse than theirs, and the metallicity from Andrae et al. (2023b) is also better than ours. In the last row of Figure A.1, we use APOGEE DR17 as a reference to calculate ΔT_eff, Δ log ɡ, Δ[M/H] between our catalog (or the catalog from Zhang et al. (2023)) with the catalog from APOGEE. From these histograms, using APOGEE as a reference, all three atmospheric parameters from Zhang et al. (2023) are better than ours. In the middle panel of the last row, a spike appears at log ɡ − log ɡ_APOGEE ~ 0.35 − 0.40, which is caused by a problematic estimation of log ɡ for low T_eff stars. A deeper analysis of the T_eff vs. log ɡ − log ɡ_APOGEE diagram reveals an overdensity at log ɡ − log ɡ_APOGEE ~ 0.35 − 0.40 for low temperature stars (T_eff < 4500).

In addtion, we cross-match our catalog and the catalogs from the literature with VMP stars described in Section 2.2. It should be pointed out that Andrae et al. (2023b) uses metal-poor stars to replenish their training sample, which contains many sources in common with our VMP selection. Therefore, comparison between our results and theirs for VMP stars is not fair. Not surprisingly, Andrae et al. (2023b) performs better in such comparison, but our method performs better compared to Zhang et al. (2023), as shown in Figure A.2.

Appendix B Validating our catalog using star clusters

The star clusters are selected primarily at random, ensuring a sufficient number of member candidates to construct the metallicity distribution, while also ensuring that each cluster has a single, well-defined metallicity. Member candidates are from Hunt & Reffert (2024) and Vasiliev & Baumgardt (2021) for OCs and GCs⁸, respectively. To confirm the membership of stars, we use the membership probability Prob > 0.6 and memberprob = 1 for OCs and GCs, respectively. Similar to Huang et al. (2024b), we select only giants (with our estimated log ɡ < 3.5) in the case of GCs, and fit the metallicity distribution for each cluster, using a Gaussian function after applying 3 − σ clipping. The metallicity distributions for the three OCs and three GCs are shown in Figure B.1. In this figure, we include all the candidates, but highlight the distribution after applying 3 − σ clipping. The number of member stars for each star cluster (after 3 − σ clipping) and Gaussian fit to the clipped distribution are shown in each panel as well.

From each panel in Figure B.1, stars within the star cluster present strong compatibility in [M/H]. The metallicity of OCs is close to 0 but still shows variation among different OCs, while the metallicity of GCs falls in the range −2.3 ~ −1.6. We also compare the metallicity distribution with values from literature in Figure B.1, indicated by the black vertical lines. For OCs, metallicities are adopted from Dias et al. (2021), while for GCs, we use the values from Gonzalez & Wallerstein (1998) for NGC 3201, Castilho et al. (2000) for NGC 6397, and Lee et al. (2005) for NGC 4590.

Fig. A.1

Comparison of our catalog with other catalogs from literatures using LAMOST or APOGEE as references. The first row: comparison between our catalog and a catalog from Andrae et al. (2023b), annotated with the means and standard deviations of the differences in T_eff, log ɡ, and [M/H]. The second row: histograms in red showing the differences in atmospheric parameters between our catalog and the catalog from LAMOST, while histograms in black showing the differences between the catalog from Andrae et al. (2023b) and the one from LAMOST, with means and standard deviations annotated. The last two rows: similar to the first and second rows, we compare our catalog with the catalog from Zhang et al. (2023), using APOGEE as a reference in those panels in the last row.

Fig. A.2

Similar to Figure A.1, Comparison of our catalog with other catalogs from literatures using VMP stars as references.

Fig. B.1

The metallicity distribution from our catalog is shown for member stars of three open clusters (NGC 2632, NGC 2516, NGC 752) and three globular clusters (NGC 3201, NGC 6397, NGC 4590). In each panel, we display the metallicity distribution of member stars both before and after applying 3 − σ clipping, depicted as step and filled histograms, respectively, with the number of remaining members annotated in the labels. Gaussian fits are overlaid as blue curves in all panels. The mean and standard deviation of [M/H] for each cluster are also presented in the respective panel. The black vertical lines indicate the metallicities of these star clusters as reported in the literature.

References

Abazajian, K., Adelman-McCarthy, J. K., Agüeros, M. A., et al. 2004, AJ, 128, 502 [NASA ADS] [CrossRef] [Google Scholar]
Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35 [NASA ADS] [CrossRef] [Google Scholar]
Abohalima, A., & Frebel, A. 2018, ApJS, 238, 36 [NASA ADS] [CrossRef] [Google Scholar]
Aguado, D. S., Youakim, K., González Hernández, J. I., et al. 2019, MNRAS, 490, 2241 [NASA ADS] [CrossRef] [Google Scholar]
Aihara, H., Allende Prieto, C., An, D., et al. 2011, ApJS, 193, 29 [NASA ADS] [CrossRef] [Google Scholar]
Allende Prieto, C. 2023, Atoms, 11, 61 [Google Scholar]
Allende Prieto, C., Beers, T. C., Wilhelm, R., et al. 2006, ApJ, 636, 804 [NASA ADS] [CrossRef] [Google Scholar]
Allende Prieto, C., Koesterke, L., Hubeny, I., et al. 2018, A&A, 618, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
An, D., Beers, T. C., & Chiti, A. 2024, ApJS, 272, 20 [NASA ADS] [CrossRef] [Google Scholar]
Andrae, R., Fouesneau, M., Sordo, R., et al. 2023a, A&A, 674, A27 [CrossRef] [EDP Sciences] [Google Scholar]
Andrae, R., Rix, H.-W., & Chandra, V. 2023b, ApJS, 267, 8 [NASA ADS] [CrossRef] [Google Scholar]
Aoki, W., Tominaga, N., Beers, T. C., Honda, S., & Lee, Y. S. 2014, Science, 345, 912 [NASA ADS] [CrossRef] [Google Scholar]
Aoki, W., Li, H., Matsuno, T., et al. 2022, ApJ, 931, 146 [NASA ADS] [CrossRef] [Google Scholar]
Avdeeva, A. S., Kovaleva, D. A., Malkov, O. Y., & Zhao, G. 2024, MNRAS, 527, 7382 [Google Scholar]
Bellazzini, M., Massari, D., De Angeli, F., et al. 2023, A&A, 674, A194 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Belokurov, V., & Kravtsov, A. 2022, MNRAS, 514, 689 [NASA ADS] [CrossRef] [Google Scholar]
Binney, J., & Vasiliev, E. 2024, MNRAS, 527, 1915 [Google Scholar]
Bohlin, R. C., & Lockwood, S. 2022, Update of the STIS CTE Correction Formula for Stellar Spectra, Instrument Science Report STIS 2022-7, 11 [Google Scholar]
Bohlin, R. C., Gordon, K. D., & Tremblay, P. E. 2014, PASP, 126, 711 [NASA ADS] [Google Scholar]
Bohlin, R. C., Deustua, S. E., & de Rosa, G. 2019, AJ, 158, 211 [NASA ADS] [CrossRef] [Google Scholar]
Buder, S., Sharma, S., Kos, J., et al. 2021, MNRAS, 506, 150 [NASA ADS] [CrossRef] [Google Scholar]
Cardelli, J. A., Clayton, G. C., & Mathis, J. S. 1989, ApJ, 345, 245 [Google Scholar]
Carrasco, J. M., Weiler, M., Jordi, C., et al. 2021, A&A, 652, A86 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Castelli, F., Gratton, R. G., & Kurucz, R. L. 1997, A&A, 318, 841 [NASA ADS] [Google Scholar]
Castilho, B. V., Pasquini, L., Allen, D. M., Barbuy, B., & Molaro, P. 2000, A&A, 361, 92 [NASA ADS] [Google Scholar]
Chandra, V., Semenov, V. A., Rix, H.-W., et al. 2024, ApJ, 972, 112 [Google Scholar]
Chen, B., Hayden, M. R., Sharma, S., et al. 2023, MNRAS, 523, 3791 [NASA ADS] [CrossRef] [Google Scholar]
Chiti, A., Frebel, A., Mardini, M. K., et al. 2021, ApJS, 254, 31 [NASA ADS] [CrossRef] [Google Scholar]
Coelho, P., Barbuy, B., Meléndez, J., Schiavon, R. P., & Castilho, B. V. 2005, A&A, 443, 735 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Conroy, C., Weinberg, D. H., Naidu, R. P., et al. 2022, OJAp, submitted [arXiv:2204.02989] [Google Scholar]
Cooper, A. P., Koposov, S. E., Allende Prieto, C., et al. 2023, ApJ, 947, 37 [NASA ADS] [CrossRef] [Google Scholar]
Cropper, M., Katz, D., Sartoretti, P., et al. 2018, A&A, 616, A5 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Cui, X.-Q., Zhao, Y.-H., Chu, Y.-Q., et al. 2012, Res. Astron. Astrophys., 12, 1197 [Google Scholar]
De Angeli, F., Weiler, M., Montegriffo, P., et al. 2023, A&A, 674, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
de Jong, R. S., Agertz, O., Berbel, A. A., et al. 2019, The Messenger, 175, 3 [NASA ADS] [Google Scholar]
De Silva, G. M., Freeman, K. C., Bland-Hawthorn, J., et al. 2015, MNRAS, 449, 2604 [NASA ADS] [CrossRef] [Google Scholar]
Deason, A. J., & Belokurov, V. 2024, New A Rev., 99, 101706 [NASA ADS] [CrossRef] [Google Scholar]
Dias, W. S., Monteiro, H., Moitinho, A., et al. 2021, MNRAS, 504, 356 [NASA ADS] [CrossRef] [Google Scholar]
Fallows, C. P., & Sanders, J. L. 2024, MNRAS, 531, 2126 [CrossRef] [Google Scholar]
Gaia Collaboration (Brown, A. G. A., et al.) 2016, A&A, 595, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Gaia Collaboration (Montegriffo, P., et al.) 2023a, A&A, 674, A33 [CrossRef] [EDP Sciences] [Google Scholar]
Gaia Collaboration (Vallenari, A., et al.) 2023b, A&A, 674, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
García Pérez, A. E., Allende Prieto, C., Holtzman, J. A., et al. 2016, AJ, 151, 144 [Google Scholar]
Gonzalez, G., & Wallerstein, G. 1998, AJ, 116, 765 [NASA ADS] [CrossRef] [Google Scholar]
Green, G. M. 2018, J. Open Source Softw., 3, 695 [Google Scholar]
Gregg, M. D., Silva, D., Rayner, J., et al. 2006, in The 2005 HST Calibration Workshop: Hubble After the Transition to Two-Gyro Mode, eds. A. M. Koekemoer, P. Goudfrooij, & L. L. Dressel, 209 [Google Scholar]
Gustafsson, B., Edvardsson, B., Eriksson, K., et al. 2008, A&A, 486, 951 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hattori, K. 2024, arXiv e-prints, [arXiv:2404.01269] [Google Scholar]
Heger, A., & Woosley, S. E. 2010, ApJ, 724, 341 [Google Scholar]
Huang, B., Yuan, H., Xiang, M., et al. 2024a, ApJS, 271, 13 [NASA ADS] [CrossRef] [Google Scholar]
Huang, B., Yuan, H., Xu, S., et al. 2024b, ApJS, submitted [arXiv:2410.19895] [Google Scholar]
Hunt, E. L., & Reffert, S. 2024, A&A, 686, A42 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Husser, T. O., Wende-von Berg, S., Dreizler, S., et al. 2013, A&A, 553, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Imig, J., Price, C., Holtzman, J. A., et al. 2023, ApJ, 954, 124 [CrossRef] [Google Scholar]
Jin, S., Trager, S. C., Dalton, G. B., et al. 2024, MNRAS, 530, 2688 [NASA ADS] [CrossRef] [Google Scholar]
Klessen, R. S., & Glover, S. C. O. 2023, ARA&A, 61, 65 [NASA ADS] [CrossRef] [Google Scholar]
Kollmeier, J. A., Zasowski, G., Rix, H.-W., et al. 2017, arXiv e-prints [arXiv:1711.03234] [Google Scholar]
Komiya, Y., Habe, A., Suda, T., & Fujimoto, M. Y. 2010, ApJ, 717, 542 [NASA ADS] [CrossRef] [Google Scholar]
Koutsouridou, I., Salvadori, S., Skúladóttir, Á., et al. 2023, MNRAS, 525, 190 [NASA ADS] [CrossRef] [Google Scholar]
Laroche, A., & Speagle, J. S. 2024, ApJ, submitted [arXiv:2404.07316] [Google Scholar]
Lee, J.-W., Carney, B. W., & Habgood, M. J. 2005, AJ, 129, 251 [NASA ADS] [CrossRef] [Google Scholar]
Lejeune, T., Cuisinier, F., & Buser, R. 1998, A&AS, 130, 65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Leung, H. W., & Bovy, J. 2019, MNRAS, 483, 3255 [NASA ADS] [Google Scholar]
Leung, H. W., & Bovy, J. 2024, MNRAS, 527, 1494 [Google Scholar]
Li, X., & Lin, B. 2023, MNRAS, 521, 6354 [NASA ADS] [CrossRef] [Google Scholar]
Li, H., Aoki, W., Matsuno, T., et al. 2022a, ApJ, 931, 147 [NASA ADS] [CrossRef] [Google Scholar]
Li, Z., Zhao, G., Chen, Y., Liang, X., & Zhao, J. 2022b, MNRAS, 517, 4875 [NASA ADS] [CrossRef] [Google Scholar]
Li, J., Wong, K. W. K., Hogg, D. W., Rix, H.-W., & Chandra, V. 2024, ApJS, 272, 2 [NASA ADS] [CrossRef] [Google Scholar]
Lian, J., Zasowski, G., Mackereth, T., et al. 2022, MNRAS, 513, 4130 [NASA ADS] [CrossRef] [Google Scholar]
Luo, A. L., Zhang, H.-T., Zhao, Y.-H., et al. 2012, Res. Astron. Astrophys., 12, 1243 [CrossRef] [Google Scholar]
Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94 [NASA ADS] [CrossRef] [Google Scholar]
Mardini, M. K., Frebel, A., & Chiti, A. 2024, MNRAS, 529, L60 [Google Scholar]
Martin, N. F., Starkenburg, E., Yuan, Z., et al. 2024, A&A, 692, A115 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Montegriffo, P., De Angeli, F., Andrae, R., et al. 2023, A&A, 674, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Naidu, R. P., Conroy, C., Bonaca, A., et al. 2020, ApJ, 901, 48 [Google Scholar]
Ness, M., Hogg, D. W., Rix, H. W., Ho, A. Y. Q., & Zasowski, G. 2015, ApJ, 808, 16 [NASA ADS] [CrossRef] [Google Scholar]
Nomoto, K., Tominaga, N., Umeda, H., Kobayashi, C., & Maeda, K. 2006, Nucl. Phys. A, 777, 424 [CrossRef] [Google Scholar]
Nomoto, K., Kobayashi, C., & Tominaga, N. 2013, ARA&A, 51, 457 [CrossRef] [Google Scholar]
Onken, C. A., Wolf, C., Bessell, M. S., et al. 2024, PASA, 41, e061 [NASA ADS] [CrossRef] [Google Scholar]
Pal, T., Khan, I., Worthey, G., Gregg, M. D., & Silva, D. R. 2023, ApJS, 266, 41 [NASA ADS] [CrossRef] [Google Scholar]
Paszke, A., Gross, S., Chintala, S., et al. 2017, in NIPS 2017 Workshop on Autodiff [Google Scholar]
Paszke, A., Gross, S., Massa, F., et al. 2019, in Advances in Neural Information Processing Systems, 32, eds. H. Wallach, H. Larochelle, A. Beygelzimer, F. d’Alché-Buc, E. Fox, & R. Garnett (Curran Associates, Inc.) [Google Scholar]
Powell, M. J. D. 2002, Math. Program., 92, 555 [CrossRef] [Google Scholar]
Rix, H.-W., Chandra, V., Andrae, R., et al. 2022, ApJ, 941, 45 [NASA ADS] [CrossRef] [Google Scholar]
Salvadori, S., Schneider, R., & Ferrara, A. 2007, MNRAS, 381, 647 [NASA ADS] [CrossRef] [Google Scholar]
Sanders, J. L., & Matsunaga, N. 2023, MNRAS, 521, 2745 [NASA ADS] [CrossRef] [Google Scholar]
Sarmento, R., Scannapieco, E., & Côté, B. 2019, ApJ, 871, 206 [CrossRef] [Google Scholar]
Sartoretti, P., Katz, D., Cropper, M., et al. 2018, A&A, 616, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Schlafly, E. F., & Finkbeiner, D. P. 2011, ApJ, 737, 103 [Google Scholar]
Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525 [Google Scholar]
Schönrich, R., & Binney, J. 2009, MNRAS, 399, 1145 [Google Scholar]
Sharma, S., Hayden, M. R., & Bland-Hawthorn, J. 2021, MNRAS, 507, 5882 [NASA ADS] [CrossRef] [Google Scholar]
Tarumi, Y., Hartwig, T., & Magg, M. 2020, ApJ, 897, 58 [NASA ADS] [CrossRef] [Google Scholar]
Taylor, M. B. 2005, in Astronomical Society of the Pacific Conference Series, 347, Astronomical Data Analysis Software and Systems XIV, eds. P. Shopbell, M. Britton, & R. Ebert, 29 [NASA ADS] [Google Scholar]
Ting, Y.-S., Rix, H.-W., Conroy, C., Ho, A. Y. Q., & Lin, J. 2017, ApJ, 849, L9 [NASA ADS] [CrossRef] [Google Scholar]
Ting, Y.-S., Conroy, C., Rix, H.-W., & Cargile, P. 2019, ApJ, 879, 69 [Google Scholar]
Vasiliev, E., & Baumgardt, H. 2021, MNRAS, 505, 5978 [NASA ADS] [CrossRef] [Google Scholar]
Weiler, M., Carrasco, J. M., Fabricius, C., & Jordi, C. 2023, A&A, 671, A52 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Wilson, J. C., Hearty, F. R., Skrutskie, M. F., et al. 2019, PASP, 131, 055001 [NASA ADS] [CrossRef] [Google Scholar]
Witten, C. E. C., Aguado, D. S., Sanders, J. L., et al. 2022, MNRAS, 516, 3254 [NASA ADS] [CrossRef] [Google Scholar]
Xiang, M., Ting, Y.-S., Rix, H.-W., et al. 2019, ApJS, 245, 34 [Google Scholar]
Xiang, M., Rix, H.-W., Ting, Y.-S., et al. 2022, A&A, 662, A66 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Xiao, K., Yuan, H., Huang, B., et al. 2023a, ApJS, 268, 53 [NASA ADS] [CrossRef] [Google Scholar]
Xiao, K., Yuan, H., López-Sanjuan, C., et al. 2023b, ApJS, 269, 58 [NASA ADS] [CrossRef] [Google Scholar]
Xylakis-Dornbusch, T., Christlieb, N., Hansen, T. T., et al. 2024, A&A, 687, A177 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Yamada, S., Suda, T., Komiya, Y., Aoki, W., & Fujimoto, M. Y. 2013, MNRAS, 436, 1362 [Google Scholar]
Yan, H., Li, H., Wang, S., et al. 2022, The Innovation, 3, 100224 [NASA ADS] [CrossRef] [Google Scholar]
Yao, Y., Ji, A. P., Koposov, S. E., & Limberg, G. 2024, MNRAS, 527, 10937 [Google Scholar]
York, D. G., Adelman, J., Anderson, John E. J., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]
Youakim, K., Starkenburg, E., Martin, N. F., et al. 2020, MNRAS, 492, 4986 [CrossRef] [Google Scholar]
Zhang, X., Green, G. M., & Rix, H.-W. 2023, MNRAS, 524, 1855 [NASA ADS] [CrossRef] [Google Scholar]
Zhao, G., Chen, Y.-Q., Shi, J.-R., et al. 2006, Chinese J. Astron. Astrophys., 6, 265 [NASA ADS] [CrossRef] [Google Scholar]
Zhao, G., Zhao, Y.-H., Chu, Y.-Q., Jing, Y.-P., & Deng, L.-C. 2012, Res. Astron. Astrophys., 12, 723 [NASA ADS] [CrossRef] [Google Scholar]

¹

High Efficiency and Resolution Multi Element Spectrograph.

²

William Herschell Telescope.

³

https://gaiaxpy.readthedocs.io/en/latest/cite.html, DOI v2.1.0: 10.5281/zenodo.8239995.

⁴

https://extinction.readthedocs.io/en/latest/index.html

⁵

https://github.com/callendeprieto/ferre

⁶

https://github.com/callendeprieto/synple

⁷

https://github.com/pytorch/pytorch/tree/main

All Figures

Fig. 1

Density distribution of residuals as a function of wavelength for stars with different parameters. Subfigures (a), (b), and (c) are results of stars in bins of the same magnitude range 13 < G₀ < 15 but different stellar colors. Subfigures (b), (d), and (e) show the density distribution of residuals for stars within the same range of color 1.0 < (G_BP − G_RP)₀ < 1.2 but different G₀. Subfigure (f) shows the residuals of stars of the same color-magnitude as subfigures (b) but in different Galactic latitude (|b| < 10°). The black dash lines represent the P₅₀ percentiles distributions, which can be viewed as a robust representation of the patterns of wiggles. From the P₅₀ lines we can see that the wiggles change with (G_BP − G_RP)₀ and G₀ , but the stellar color has a larger impact than the magnitude. Stars of low latitude have a much more diffuse distribution of ∆Flux. The large uncertainties of the extinction map towards the Galactic disc may cause a bad extinction correction of Flux_XP for some stars. Therefore, Flux_XP is more likely to deviate from the fitting result Flux_fitting for stars at lower latitude.

In the text

Fig. 2

Density distribution of residuals as a function of wavelength for VMP stars. Top panel shows the density distribution of the residuals of VMP stars with 1.0 < G_BP − G_RP < 1.2 and 13 < G < 15. Compared to the relatively more metal-rich sample in Figure 1, their wiggles have a smaller amplitude with Max (|P₅₀|) < 0.08. The bottom panel shows the difference of P₅₀ between these two samples. The main difference is in the blue band of λ < 6500 Å, while the wiggles in the red band is little affected by the change of metallicity.

In the text

	Fig. 3 Diagram of our NN model based on `Pytorch`.
In the text

	Fig. 4 Comparison between the real patterns from TAS (20% of the whole sample, top panel) and the NN model predicted patterns (middle panel). The bottom panel shows the difference between the real and predicted patterns, from which we can see that the wiggles disappear.
In the text

Fig. 5

Testing on atmospheric parameters using sampling S_const. Top panels: comparison of T_eff, logg and [M/H] estimated from original XP spectra (Y -axis) and APOGEE survey (X-axis). Middle panels: similar to the top panels, but with Y-axis replaced by the results from corrected XP spectra. The color bar in each subfigure displays the number density. Bottom panels: histograms present the differences of T_eff, log ɡ and [M/H] between XP and APOGEE before and after correcting the systematic patterns. The mean values and standard deviations are shown in the labels.

In the text

	Fig. 6 Histograms present log₁₀ (χ²) and RMS between the XP spectra and model spectra for the IAS data set before and after correcting the pattern, indicating by blue and orange, respectively.
In the text

Fig. 7

Distribution of RMS between CALSPEC and XP spectra. Left panels: histograms present RMS between the XP spectra and the spectra from CALSPEC libraries before and after correcting patterns, indicating by blue and orange, respectively. The green histogram shows the corresponding results obtained using the Python package from Huang et al. (2024a) to correct the XP spectra. Right panels: similar to those panels in the left, but with stars that have passed the quality cuts applied in Section 5.1. The red dashed lines represent the median values, which are also indicated by the number displayed in each panel.

In the text

Fig. 8

Examples of how pattern correction can help improve the fitting between the XP spectra and the CALSPEC. Three spectra with different temperatures are presented in the sub-panels, from top to bottom: BD+54 1216, HD 115169, and KF06T2 in CALSPEC. In each panel, the smoothed CALSPEC spectrum is depicted as a bold gray line, while the XP spectra, with and without correcting patterns, are shown in blue and orange, respectively. The green lines show the corrected spectra using the package provided by Huang et al. (2024a). To enhance readability, we include a few zoomed-in diagrams in each panel to highlight the improvements on spectrophotometry. The ∆Flux between XP and CAL- SPEC are also presented at the bottom of each panel.

In the text

	Fig. 9 Comparison of atmospheric parameters T_eff, log ɡ, and [M/H] between our catalog and LAMOST DR11 low-resolution catalog.
In the text

Fig. A.1

Comparison of our catalog with other catalogs from literatures using LAMOST or APOGEE as references. The first row: comparison between our catalog and a catalog from Andrae et al. (2023b), annotated with the means and standard deviations of the differences in T_eff, log ɡ, and [M/H]. The second row: histograms in red showing the differences in atmospheric parameters between our catalog and the catalog from LAMOST, while histograms in black showing the differences between the catalog from Andrae et al. (2023b) and the one from LAMOST, with means and standard deviations annotated. The last two rows: similar to the first and second rows, we compare our catalog with the catalog from Zhang et al. (2023), using APOGEE as a reference in those panels in the last row.

In the text

	Fig. A.2 Similar to Figure A.1, Comparison of our catalog with other catalogs from literatures using VMP stars as references.
In the text

Fig. B.1

The metallicity distribution from our catalog is shown for member stars of three open clusters (NGC 2632, NGC 2516, NGC 752) and three globular clusters (NGC 3201, NGC 6397, NGC 4590). In each panel, we display the metallicity distribution of member stars both before and after applying 3 − σ clipping, depicted as step and filled histograms, respectively, with the number of remaining members annotated in the labels. Gaussian fits are overlaid as blue curves in all panels. The mean and standard deviation of [M/H] for each cluster are also presented in the respective panel. The black vertical lines indicate the metallicities of these star clusters as reported in the literature.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Abazajian, K., Adelman-McCarthy, J. K., Agüeros, M. A., et al. 2004, AJ, 128, 502 [NASA ADS] [CrossRef] [Google Scholar]

[2] Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35 [NASA ADS] [CrossRef] [Google Scholar]

[3] Abohalima, A., & Frebel, A. 2018, ApJS, 238, 36 [NASA ADS] [CrossRef] [Google Scholar]

[4] Aguado, D. S., Youakim, K., González Hernández, J. I., et al. 2019, MNRAS, 490, 2241 [NASA ADS] [CrossRef] [Google Scholar]

[5] Aihara, H., Allende Prieto, C., An, D., et al. 2011, ApJS, 193, 29 [NASA ADS] [CrossRef] [Google Scholar]

[6] Allende Prieto, C. 2023, Atoms, 11, 61 [Google Scholar]

[7] Allende Prieto, C., Beers, T. C., Wilhelm, R., et al. 2006, ApJ, 636, 804 [NASA ADS] [CrossRef] [Google Scholar]

[8] Allende Prieto, C., Koesterke, L., Hubeny, I., et al. 2018, A&A, 618, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[9] An, D., Beers, T. C., & Chiti, A. 2024, ApJS, 272, 20 [NASA ADS] [CrossRef] [Google Scholar]

[10] Andrae, R., Fouesneau, M., Sordo, R., et al. 2023a, A&A, 674, A27 [CrossRef] [EDP Sciences] [Google Scholar]

[11] Andrae, R., Rix, H.-W., & Chandra, V. 2023b, ApJS, 267, 8 [NASA ADS] [CrossRef] [Google Scholar]

[12] Aoki, W., Tominaga, N., Beers, T. C., Honda, S., & Lee, Y. S. 2014, Science, 345, 912 [NASA ADS] [CrossRef] [Google Scholar]

[13] Aoki, W., Li, H., Matsuno, T., et al. 2022, ApJ, 931, 146 [NASA ADS] [CrossRef] [Google Scholar]

[14] Avdeeva, A. S., Kovaleva, D. A., Malkov, O. Y., & Zhao, G. 2024, MNRAS, 527, 7382 [Google Scholar]

[15] Bellazzini, M., Massari, D., De Angeli, F., et al. 2023, A&A, 674, A194 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[16] Belokurov, V., & Kravtsov, A. 2022, MNRAS, 514, 689 [NASA ADS] [CrossRef] [Google Scholar]

[17] Binney, J., & Vasiliev, E. 2024, MNRAS, 527, 1915 [Google Scholar]

[18] Bohlin, R. C., & Lockwood, S. 2022, Update of the STIS CTE Correction Formula for Stellar Spectra, Instrument Science Report STIS 2022-7, 11 [Google Scholar]

[19] Bohlin, R. C., Gordon, K. D., & Tremblay, P. E. 2014, PASP, 126, 711 [NASA ADS] [Google Scholar]

[20] Bohlin, R. C., Deustua, S. E., & de Rosa, G. 2019, AJ, 158, 211 [NASA ADS] [CrossRef] [Google Scholar]

[21] Buder, S., Sharma, S., Kos, J., et al. 2021, MNRAS, 506, 150 [NASA ADS] [CrossRef] [Google Scholar]

[22] Cardelli, J. A., Clayton, G. C., & Mathis, J. S. 1989, ApJ, 345, 245 [Google Scholar]

[23] Carrasco, J. M., Weiler, M., Jordi, C., et al. 2021, A&A, 652, A86 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[24] Castelli, F., Gratton, R. G., & Kurucz, R. L. 1997, A&A, 318, 841 [NASA ADS] [Google Scholar]

[25] Castilho, B. V., Pasquini, L., Allen, D. M., Barbuy, B., & Molaro, P. 2000, A&A, 361, 92 [NASA ADS] [Google Scholar]

[26] Chandra, V., Semenov, V. A., Rix, H.-W., et al. 2024, ApJ, 972, 112 [Google Scholar]

[27] Chen, B., Hayden, M. R., Sharma, S., et al. 2023, MNRAS, 523, 3791 [NASA ADS] [CrossRef] [Google Scholar]

[28] Chiti, A., Frebel, A., Mardini, M. K., et al. 2021, ApJS, 254, 31 [NASA ADS] [CrossRef] [Google Scholar]

[29] Coelho, P., Barbuy, B., Meléndez, J., Schiavon, R. P., & Castilho, B. V. 2005, A&A, 443, 735 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[30] Conroy, C., Weinberg, D. H., Naidu, R. P., et al. 2022, OJAp, submitted [arXiv:2204.02989] [Google Scholar]

[31] Cooper, A. P., Koposov, S. E., Allende Prieto, C., et al. 2023, ApJ, 947, 37 [NASA ADS] [CrossRef] [Google Scholar]

[32] Cropper, M., Katz, D., Sartoretti, P., et al. 2018, A&A, 616, A5 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[33] Cui, X.-Q., Zhao, Y.-H., Chu, Y.-Q., et al. 2012, Res. Astron. Astrophys., 12, 1197 [Google Scholar]

[34] De Angeli, F., Weiler, M., Montegriffo, P., et al. 2023, A&A, 674, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[35] de Jong, R. S., Agertz, O., Berbel, A. A., et al. 2019, The Messenger, 175, 3 [NASA ADS] [Google Scholar]

[36] De Silva, G. M., Freeman, K. C., Bland-Hawthorn, J., et al. 2015, MNRAS, 449, 2604 [NASA ADS] [CrossRef] [Google Scholar]

[37] Deason, A. J., & Belokurov, V. 2024, New A Rev., 99, 101706 [NASA ADS] [CrossRef] [Google Scholar]

[38] Dias, W. S., Monteiro, H., Moitinho, A., et al. 2021, MNRAS, 504, 356 [NASA ADS] [CrossRef] [Google Scholar]

[39] Fallows, C. P., & Sanders, J. L. 2024, MNRAS, 531, 2126 [CrossRef] [Google Scholar]

[40] Gaia Collaboration (Brown, A. G. A., et al.) 2016, A&A, 595, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[41] Gaia Collaboration (Montegriffo, P., et al.) 2023a, A&A, 674, A33 [CrossRef] [EDP Sciences] [Google Scholar]

[42] Gaia Collaboration (Vallenari, A., et al.) 2023b, A&A, 674, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[43] García Pérez, A. E., Allende Prieto, C., Holtzman, J. A., et al. 2016, AJ, 151, 144 [Google Scholar]

[44] Gonzalez, G., & Wallerstein, G. 1998, AJ, 116, 765 [NASA ADS] [CrossRef] [Google Scholar]

[45] Green, G. M. 2018, J. Open Source Softw., 3, 695 [Google Scholar]

[46] Gregg, M. D., Silva, D., Rayner, J., et al. 2006, in The 2005 HST Calibration Workshop: Hubble After the Transition to Two-Gyro Mode, eds. A. M. Koekemoer, P. Goudfrooij, & L. L. Dressel, 209 [Google Scholar]

[47] Gustafsson, B., Edvardsson, B., Eriksson, K., et al. 2008, A&A, 486, 951 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[48] Hattori, K. 2024, arXiv e-prints, [arXiv:2404.01269] [Google Scholar]

[49] Heger, A., & Woosley, S. E. 2010, ApJ, 724, 341 [Google Scholar]

[50] Huang, B., Yuan, H., Xiang, M., et al. 2024a, ApJS, 271, 13 [NASA ADS] [CrossRef] [Google Scholar]

[51] Huang, B., Yuan, H., Xu, S., et al. 2024b, ApJS, submitted [arXiv:2410.19895] [Google Scholar]

[52] Hunt, E. L., & Reffert, S. 2024, A&A, 686, A42 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[53] Husser, T. O., Wende-von Berg, S., Dreizler, S., et al. 2013, A&A, 553, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[54] Imig, J., Price, C., Holtzman, J. A., et al. 2023, ApJ, 954, 124 [CrossRef] [Google Scholar]

[55] Jin, S., Trager, S. C., Dalton, G. B., et al. 2024, MNRAS, 530, 2688 [NASA ADS] [CrossRef] [Google Scholar]

[56] Klessen, R. S., & Glover, S. C. O. 2023, ARA&A, 61, 65 [NASA ADS] [CrossRef] [Google Scholar]

[57] Kollmeier, J. A., Zasowski, G., Rix, H.-W., et al. 2017, arXiv e-prints [arXiv:1711.03234] [Google Scholar]

[58] Komiya, Y., Habe, A., Suda, T., & Fujimoto, M. Y. 2010, ApJ, 717, 542 [NASA ADS] [CrossRef] [Google Scholar]

[59] Koutsouridou, I., Salvadori, S., Skúladóttir, Á., et al. 2023, MNRAS, 525, 190 [NASA ADS] [CrossRef] [Google Scholar]

[60] Laroche, A., & Speagle, J. S. 2024, ApJ, submitted [arXiv:2404.07316] [Google Scholar]

[61] Lee, J.-W., Carney, B. W., & Habgood, M. J. 2005, AJ, 129, 251 [NASA ADS] [CrossRef] [Google Scholar]

[62] Lejeune, T., Cuisinier, F., & Buser, R. 1998, A&AS, 130, 65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[63] Leung, H. W., & Bovy, J. 2019, MNRAS, 483, 3255 [NASA ADS] [Google Scholar]

[64] Leung, H. W., & Bovy, J. 2024, MNRAS, 527, 1494 [Google Scholar]

[65] Li, X., & Lin, B. 2023, MNRAS, 521, 6354 [NASA ADS] [CrossRef] [Google Scholar]

[66] Li, H., Aoki, W., Matsuno, T., et al. 2022a, ApJ, 931, 147 [NASA ADS] [CrossRef] [Google Scholar]

[67] Li, Z., Zhao, G., Chen, Y., Liang, X., & Zhao, J. 2022b, MNRAS, 517, 4875 [NASA ADS] [CrossRef] [Google Scholar]

[68] Li, J., Wong, K. W. K., Hogg, D. W., Rix, H.-W., & Chandra, V. 2024, ApJS, 272, 2 [NASA ADS] [CrossRef] [Google Scholar]

[69] Lian, J., Zasowski, G., Mackereth, T., et al. 2022, MNRAS, 513, 4130 [NASA ADS] [CrossRef] [Google Scholar]

[70] Luo, A. L., Zhang, H.-T., Zhao, Y.-H., et al. 2012, Res. Astron. Astrophys., 12, 1243 [CrossRef] [Google Scholar]

[71] Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94 [NASA ADS] [CrossRef] [Google Scholar]

[72] Mardini, M. K., Frebel, A., & Chiti, A. 2024, MNRAS, 529, L60 [Google Scholar]

[73] Martin, N. F., Starkenburg, E., Yuan, Z., et al. 2024, A&A, 692, A115 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[74] Montegriffo, P., De Angeli, F., Andrae, R., et al. 2023, A&A, 674, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[75] Naidu, R. P., Conroy, C., Bonaca, A., et al. 2020, ApJ, 901, 48 [Google Scholar]

[76] Ness, M., Hogg, D. W., Rix, H. W., Ho, A. Y. Q., & Zasowski, G. 2015, ApJ, 808, 16 [NASA ADS] [CrossRef] [Google Scholar]

[77] Nomoto, K., Tominaga, N., Umeda, H., Kobayashi, C., & Maeda, K. 2006, Nucl. Phys. A, 777, 424 [CrossRef] [Google Scholar]

[78] Nomoto, K., Kobayashi, C., & Tominaga, N. 2013, ARA&A, 51, 457 [CrossRef] [Google Scholar]

[79] Onken, C. A., Wolf, C., Bessell, M. S., et al. 2024, PASA, 41, e061 [NASA ADS] [CrossRef] [Google Scholar]

[80] Pal, T., Khan, I., Worthey, G., Gregg, M. D., & Silva, D. R. 2023, ApJS, 266, 41 [NASA ADS] [CrossRef] [Google Scholar]

[81] Paszke, A., Gross, S., Chintala, S., et al. 2017, in NIPS 2017 Workshop on Autodiff [Google Scholar]

[82] Paszke, A., Gross, S., Massa, F., et al. 2019, in Advances in Neural Information Processing Systems, 32, eds. H. Wallach, H. Larochelle, A. Beygelzimer, F. d’Alché-Buc, E. Fox, & R. Garnett (Curran Associates, Inc.) [Google Scholar]

[83] Powell, M. J. D. 2002, Math. Program., 92, 555 [CrossRef] [Google Scholar]

[84] Rix, H.-W., Chandra, V., Andrae, R., et al. 2022, ApJ, 941, 45 [NASA ADS] [CrossRef] [Google Scholar]

[85] Salvadori, S., Schneider, R., & Ferrara, A. 2007, MNRAS, 381, 647 [NASA ADS] [CrossRef] [Google Scholar]

[86] Sanders, J. L., & Matsunaga, N. 2023, MNRAS, 521, 2745 [NASA ADS] [CrossRef] [Google Scholar]

[87] Sarmento, R., Scannapieco, E., & Côté, B. 2019, ApJ, 871, 206 [CrossRef] [Google Scholar]

[88] Sartoretti, P., Katz, D., Cropper, M., et al. 2018, A&A, 616, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[89] Schlafly, E. F., & Finkbeiner, D. P. 2011, ApJ, 737, 103 [Google Scholar]

[90] Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525 [Google Scholar]

[91] Schönrich, R., & Binney, J. 2009, MNRAS, 399, 1145 [Google Scholar]

[92] Sharma, S., Hayden, M. R., & Bland-Hawthorn, J. 2021, MNRAS, 507, 5882 [NASA ADS] [CrossRef] [Google Scholar]

[93] Tarumi, Y., Hartwig, T., & Magg, M. 2020, ApJ, 897, 58 [NASA ADS] [CrossRef] [Google Scholar]

[94] Taylor, M. B. 2005, in Astronomical Society of the Pacific Conference Series, 347, Astronomical Data Analysis Software and Systems XIV, eds. P. Shopbell, M. Britton, & R. Ebert, 29 [NASA ADS] [Google Scholar]

[95] Ting, Y.-S., Rix, H.-W., Conroy, C., Ho, A. Y. Q., & Lin, J. 2017, ApJ, 849, L9 [NASA ADS] [CrossRef] [Google Scholar]

[96] Ting, Y.-S., Conroy, C., Rix, H.-W., & Cargile, P. 2019, ApJ, 879, 69 [Google Scholar]

[97] Vasiliev, E., & Baumgardt, H. 2021, MNRAS, 505, 5978 [NASA ADS] [CrossRef] [Google Scholar]

[98] Weiler, M., Carrasco, J. M., Fabricius, C., & Jordi, C. 2023, A&A, 671, A52 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[99] Wilson, J. C., Hearty, F. R., Skrutskie, M. F., et al. 2019, PASP, 131, 055001 [NASA ADS] [CrossRef] [Google Scholar]

[100] Witten, C. E. C., Aguado, D. S., Sanders, J. L., et al. 2022, MNRAS, 516, 3254 [NASA ADS] [CrossRef] [Google Scholar]

[101] Xiang, M., Ting, Y.-S., Rix, H.-W., et al. 2019, ApJS, 245, 34 [Google Scholar]

[102] Xiang, M., Rix, H.-W., Ting, Y.-S., et al. 2022, A&A, 662, A66 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[103] Xiao, K., Yuan, H., Huang, B., et al. 2023a, ApJS, 268, 53 [NASA ADS] [CrossRef] [Google Scholar]

[104] Xiao, K., Yuan, H., López-Sanjuan, C., et al. 2023b, ApJS, 269, 58 [NASA ADS] [CrossRef] [Google Scholar]

[105] Xylakis-Dornbusch, T., Christlieb, N., Hansen, T. T., et al. 2024, A&A, 687, A177 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[106] Yamada, S., Suda, T., Komiya, Y., Aoki, W., & Fujimoto, M. Y. 2013, MNRAS, 436, 1362 [Google Scholar]

[107] Yan, H., Li, H., Wang, S., et al. 2022, The Innovation, 3, 100224 [NASA ADS] [CrossRef] [Google Scholar]

[108] Yao, Y., Ji, A. P., Koposov, S. E., & Limberg, G. 2024, MNRAS, 527, 10937 [Google Scholar]

[109] York, D. G., Adelman, J., Anderson, John E. J., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]

[110] Youakim, K., Starkenburg, E., Martin, N. F., et al. 2020, MNRAS, 492, 4986 [CrossRef] [Google Scholar]

[111] Zhang, X., Green, G. M., & Rix, H.-W. 2023, MNRAS, 524, 1855 [NASA ADS] [CrossRef] [Google Scholar]

[112] Zhao, G., Chen, Y.-Q., Shi, J.-R., et al. 2006, Chinese J. Astron. Astrophys., 6, 265 [NASA ADS] [CrossRef] [Google Scholar]

[113] Zhao, G., Zhao, Y.-H., Chu, Y.-Q., Jing, Y.-P., & Deng, L.-C. 2012, Res. Astron. Astrophys., 12, 723 [NASA ADS] [CrossRef] [Google Scholar]