Open Access
Issue
A&A
Volume 695, March 2025
Article Number A75
Number of page(s) 17
Section Catalogs and data
DOI https://doi.org/10.1051/0004-6361/202452871
Published online 11 March 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

The stellar metallicities [M/H] keep track of the formation and evolution of the Milky Way. Due to the chemical enrichment of the interstellar medium by successive stellar generations, [M/H] can be used as a proxy of age and reflects the birth environment of a star. Assuming that metal-rich disk stars are born on nearly circular orbits in the Galactic plane from chemically well-mixed cold gas, mono-abundance stellar populations with the same metal content ([M/H], [α/Fe]) shall be formed at the same birth radius Rb and look-back time τ (Schönrich & Binney 2009). This assumption has been used to establish a comprehensive chemodynamical model of our Galaxy (Sharma et al. 2021; Lian et al. 2022; Imig et al. 2023; Chen et al. 2023; Binney & Vasiliev 2024). Combined with kinematics, the metallicities of metal-poor halo stars serve as a powerful tool to separate in-situ populations from the remnants of ancient accretion events (Naidu et al. 2020; Belokurov & Kravtsov 2022; Conroy et al. 2022; Deason & Belokurov 2024). These newly found accreted substructures are useful to check the ACDM cosmological paradigm in which Milky Way–sized halos are built from the mergers of smaller satellite galaxies. Extremely metalpoor (EMP) stars may have formed from gas enriched only by the very first stars, also called Pop III (Klessen & Glover 2023). Therefore, their abundance patterns hold the key to constrain nucleosynthesis in the earliest supernova events (Nomoto et al. 2006; Heger & Woosley 2010; Aoki et al. 2014; Nomoto et al. 2013; Koutsouridou et al. 2023). The low-metallicity tail of the metallicity distribution function (MDF) provides essential constrains on the chemical enrichment in the early phases of the formation of the Milky Way (Salvadori et al. 2007; Komiya et al. 2010; Yamada et al. 2013; Sarmento et al. 2019; Tarumi et al. 2020; Youakim et al. 2020). However, at present only a few hundred stars with [M/H] < −3 and significantly fewer with [M/H] < −4 have been identified, and these are insufficient to give a complete picture.

We are witnessing an era in which stellar atmospheric parameters are available for massive samples thanks to large spectroscopic surveys, such as the Sloan Digital Sky Survey (SDSS) I-IV (York et al. 2000; Abazajian et al. 2004; Aihara et al. 2011; Abdurro’uf et al. 2022), the Large Sky Area MultiObject Fiber Spectroscopic Telescope (LAMOST) survey (Zhao et al. 2006, 2012; Cui et al. 2012; Luo et al. 2012), the GALactic Archaeology with HERMES1 (GALAH) survey (De Silva et al. 2015; Buder et al. 2021), the SDSS V (Kollmeier et al. 2017), the Dark Energy Spectroscopic Instrument (DESI) survey (Cooper et al. 2023), the 4-metre Multi-Object Spectroscopic Telescope (4MOST) survey (de Jong et al. 2019), or the WHT2 Enhanced Area Velocity Explorer (WEAVE) survey (Jin et al. 2024). These surveys have already accumulated tens of millions of stellar spectra, and continue to gather more. Combining the spectroscopic data with accurate measurements of proper motions and parallaxes provided by the European Space Agency (ESA) Gaia mission (Gaia Collaboration 2016, 2023b), enables us to carry out a detailed investigation of the chemodynamic properties of different stellar populations of the Milky Way.

In addition to the astrometric catalog, Gaia Data Release 3 (DR3; Gaia Collaboration 2023b) includes a catalog of 32.2 million high-resolution spectra (R ∼ 8000) centered on the near infrared Ca II triplet, obtained by the Radial Velocity Spectrometer (RVS) instrument (Cropper et al. 2018; Sartoretti et al. 2018; Andrae et al. 2023a). It also includes a catalog of 220 million low-resolution spectra, referred to as Bp/Rp spectra (hereafter XP spectra), obtained with the blue (wavelength range of 330– 680 nm) and red (wavelength range of 640–1050 nm) Gaia slitless spectrophotometers, BP and RP, respectively (Carrasco et al. 2021; Montegriffo et al. 2023; De Angeli et al. 2023). The XP spectra have 110 pixels and a variable resolving power, ranging from 20 to 90, as a function of wavelength. The XP spectra in DR3 are represented by Hermite basis functions (Montegriffo et al. 2023).

The Gaia Data Processing and Analysis Consortium (DPAC) has developed the Python library GaiaXPy3 to facilitate handling XP spectra. This software allows the transformation of coefficients into calibrated spectra and photometry. Recent studies have shown that the synthetic photometry from XP spectra can be used to perform a highly precise calibration of other photometry surveys, such as Pristine (Martin et al. 2024), the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS) (Xiao et al. 2023a), or the Javalambre- Photometric Local Universe Survey (J-PLUS) (Xiao et al. 2023b; Chandra et al. 2024). However, other studies have revealed that XP calibrated spectra suffer from systematic errors that show up as “wiggles” (Huang et al. 2024a), introduced by the combination of noise and the choice of data representation (Weiler et al. 2023). The pattern of wiggles largely depends on the stellar color GBPGRP and the apparent magnitude G (Montegriffo et al. 2023), and may cause a modest bias in the synthetic photometry Huang et al. (2024a) and the inferred stellar parameters. Due to its low-resolution and the aforementioned systematics, it is hard to determine chemical abundances from XP spectra. Nonetheless, we are still able to obtain a reliable estimation of the overall metallicity ([M/H]) down to the EMP domain, as shown using mock XP spectra in Witten et al. (2022). Considering the large data volume and the full sky coverage, XP spectra provide us with a unique opportunity to build a complete metallicity map of the Milky Way.

Previous studies have adopted different methodologies to extract information from the low-resolution XP spectra. Traditional model-driven methods predict the atmospheric parameters by comparing the observed spectra with stellar spectral libraries (e.g., Castelli et al. 1997; Lejeune et al. 1998; Coelho et al. 2005; Gustafsson et al. 2008; Husser et al. 2013; Allende Prieto et al. 2018). As part of the astrophysical parameters inference system (Apsis), the Gaia General Stellar Parameterizer from Photometry (GSP-phot) adopts a Bayesian forward-modelling approach to fit XP spectra with isochrone models and four different stellar libraries (Andrae et al. 2023a). It proviides an estimate of effective temperature Teff, surface gravity logeff, metallicity [M/H], absolute magnitude MG, radius, distance, and extinction for each star. However, the authors of GSP-Phot advise against using these metallicity estimates since they are dominated by large systematic errors.

An et al. (2024) constructed an all-sky 3D extinction map by comparing the low-resolution XP spectra with their empirically calibrated synthetic spectra and gave a reliable estimate of metallicity for stars with [M/H] > −1. Martin et al. (2024) built a model linking the metallicity-sensitive synthetic CaHK magnitudes to the photometric metallicity [M/H]phot from XP spectra. Even though only [M/H] measurements are provided, their results show good agreement with the literature in the full metallicity range, even for stars under [M/H] = −3. Bellazzini et al. (2023) and Xylakis-Dornbusch et al. (2024) also tried to extract metallicity information from the synthetic photometry, but they only determined [M/H]phot for a small fraction of the XP spectra available in DR3. The intrinsic nature of the estimation of [M/H]phot also involves a comparison between synthetic spectra and observations, albeit indirectly, through the photometry generated from them. The accuracy of model-driven methods highly relies on the consistency between the XP spectra and the synthetic spectra. Unfortunately, there are several points that may induce discrepancies between them.

First, the theoretical spectra are limited by our knowledge of stellar atmospheres, atomic and molecular physics. Second, XP spectra are not perfectly flux-calibrated. Third, the information in the low-resolution spectra is highly sensitive to surface temperature Teff and gravity log ɡ. The metallicity determined from XP spectra with low (Teff < 4000 K) and high (Teff > 7000 K) temperature are likely unreliable, due to the limited strength of metal lines in stars of such low metallicity and high effective temperature.

In the last decade, data-driven and machine learning methods have been widely adopted to overcome the gap between synthetic spectra and observations (e.g., Ness et al. 2015; Ting et al. 2017, 2019; Xiang et al. 2019; Leung & Bovy 2019). Data-driven methods use a large training sample with known labels to construct a relationship between the stellar parameters and the spectra. Previous studies find that data-driven methods are effective to derive precise atmospheric parameters and elemental abundances for low-resolution spectra by applying them to the LAMOST survey (Wilson et al. 2019; Li et al. 2022b; Li &Lin 2023). Therefore, it is natural to turn to data-driven methods to infer stellar properties from XP spectra (e.g., Rix et al. 2022; Andrae et al. 2023b; Zhang et al. 2023; Sanders & Matsunaga 2023; Li et al. 2024; Xylakis-Dornbusch et al. 2024; Laroche & Speagle 2024; Leung & Bovy 2024; Yao et al. 2024; Avdeeva et al. 2024; Fallows & Sanders 2024).

Rix et al. (2022) derived the stellar metallicity for a sample of 2 million giant stars within 30° of the Galactic center using the XGBoost algorithm. They achieved a remarkably median precision of δ[M/H] < 0.1 by adopting the SDSS DR17 (the Apache Point Observatory Galactic Evolution Experiment, shortened as APOGEE; Abdurro’uf et al. 2022) as the training sample. Andrae et al. (2023b) applied the XGBoost algorithm to the whole catalog and estimated the stellar atmospheric parameters (Teff, log ɡ, [M/H]) for over 175 million stars. They have a mean stellar parameter precision of 0.1 dex in [M/H], 50 K in Teff, and 0.08 dex in log ɡ. Besides APOGEE DR17, they also included the metal-poor stars from Li et al. (2022a) in the training sample to break the low-metallicity boundary.

Zhang et al. (2023) developed an empirical forward model using a neural network (NN) to estimate the stellar atmospheric parameters, distance, and extinction for 220 million stars from XP spectra. Given the stellar parameters, they could provide a prediction of the XP spectra. Laroche & Speagle (2024) developed a novel implementation of a variational auto-encoder which achieves competitive XP spectra reconstructions without relying on stellar labels. Their results suggest that meaningful information related to [α/Fe] is hidden in the XP spectra. Recently, Li et al. (2024) and Hattori (2024) successfully extracted the information on [α/Fe] for partial XP spectra by using machine learning models trained on APOGEE DR17. In addition, Leung & Bovy (2024) built a transformer-based model and trained it in a self-supervised manner on a compiled data set. Differently from previous studies, they could not only derive stellar parameters from XP spectra, but also predict the XP spectra given the stellar labels. In general, data-driven methods are efficient and accurate in analyzing the XP spectra, and have allowed the deviation of several parameters, like [α/Fe], that are hard to derive using traditional model-driven methods at such low spectral resolution.

Although data-driven methods have the potential to achieve great success in exploring the low-resolution XP spectra, their output is highly constrained by the quality of the training data. Most studies adopted APOGEE DR17 as a training sample, and several selection criteria were applied to ensure the quality of the stellar labels from APOGEE. Due to those cuts, the Teff of the training sample is mainly in the range of 3500 K to 7000 K. Therefore, their constructed models are not applicable to stars outside this temperature range. To cover more spectral types in the training sample, Zhang et al. (2023) combined the standard AFGK catalog from LAMOST DR8 and the hot stars (high Teff) catalog from the Hot Payne (Xiang et al. 2022). However, there are systematic differences between these two catalogs, and their model prediction exhibits a large scatter from the literature for stars of Teff > 7500 K. Besides the limitation in Teff, both the APOGEE and LAMOST catalogs have a low boundary of metallicity at [M/H] = −2.5, which prevents the application of the method to more metal-poor stars. Despite the inclusion of more metal-poor stars from other surveys can remove the low-metallicity boundary, the very small size of the training sample prevents an appropriate coverage of the parameter space, and the metallicity estimations only show a very small improvement at [M/H] < −2.5. As a comparison, the high-resolution spectroscopic follow-up of metal-poor candidates selected from the catalogs of [M/H]phot demonstrated the potential of identifying EMP stars through traditional model-driven methods Xylakis-Dornbusch et al. (2024); Mardini et al. (2024).

In this study, we combine data-driven and model-driven methods to construct a catalog (Teff, log ɡ, [M/H]) for about 68 million stars with XP spectra. To reduce the influence of systematic errors (wiggles), we develop a NN model which corrects the XP spectra according to their stellar labels in Section 4. This correction helps in overcoming the gap between synthetic spectra and observations, and the stellar atmospheric parameters predicted from the corrected spectra are more reliable, as shown in Section 5. The paper is summarized in Section 6.

2 Data

We determine and study systematic errors in the Gaia XP fluxes using a sample of XP spectra cross-matched with the final release of APOGEE, included in the DR17 of SDSS. We correct the XP spectra using the systematic pattern detected in the APOGEE sample, and check the impact on the inferred stellar atmospheric parameters (comparing to APOGEE) and on the accuracy of the fluxes (comparing to CALSPEC data; Bohlin et al. 2014, 2019; Bohlin & Lockwood 2022). In this section, we describe each reference catalog in detail.

2.1 Gaia XP spectra

Since XP spectra are described by a set of polynomial coefficients, we use the Python library GaiaXPy to transform these coefficients into flux-calibrated spectra and synthetic photometry. GaiaXPy provides an option to represent the spectrum with a smaller set of basis functions, avoiding the noise associated to higher-order terms. In addition to this truncation, we apply a cut in the wavelength range, eliminating regions where the transmission is less than 10%. To remove the influence of the dust extinction, we use the Python package extinction4 adopting the ccm89 model (Cardelli et al. 1989). The reddening for each individual spectrum is calculated from the two-dimensional SFD dust map (Schlegel et al. 1998; Schlafly & Finkbeiner 2011) using the Python package dustmap (Green 2018).

The data analysis code FERЯ (Allende Prieto et al. 2006)5, which matches numerical models to the observations by minimizing the χ2, is used to search for the optimal stellar atmospheric parameters (Teff, [M/H], and log ɡ). Several algorithms are available in FERЯ, and the Powell’s UOBYQA (unconstrained optimization by quadratic approximation) algorithm (Powell 2002) is adopted in our work, initializing the search at the grid center. This is a local algorithm, which may lead to incorrect solutions in some cases, but it is chosen due to its high speed, critical for very large samples. We tested applying a Markov chain Monte Carlo algorithm (MCMC, algor= 5 in FERЯ), which performs a global search, to find out that the results are very similar to those obtained with the algorithm adopted in this work. We therefore conclude that Powell’s UOBYQA algorithm are both efficient and accurate in this case. We computed a fresh library of model spectra with constant resolution using Synple6, and Kurucz model atmospheres as in the nsc library of Allende Prieto et al. (2018), spanning −5 < [M/H] ≤ +0.5, 3500 K ≤ Teff < 8000 K, and 1 ≤ log ɡ < 5. This library is publicly released together with this paper. For the model with variable resolution, the upper limit of Teff is 12 000 K.

Gaussian convolution is performed to reduce the resolution of the finely sampled spectral synthesis calculations and match it to the XP data. In this study, we adopt two different resolutions for the model spectra: one is a constant resolution of R ~ 104, and the other is a variable resolution that changes with the wavelength as the XP spectra (refer to Table 1 in Montegriffo et al. 2023). For the constant resolution model, the corresponding sampled XP spectrum (Sconst) ranges from 360 to 990 nm, and has 330 points evenly spaced on a logarithmic scale. For the variable resolution, the corresponding XP spectrum (Svar) ranges from 360 to 992.1 nm, and has 270 points in increments of the half of Full width at half maximum (FWHM=λR$${\rm{FWHM}} = {\lambda \over R}$$, where λ is the wavelength), based on information from Montegriffo et al. (2023, see also Carrasco et al. 2021).

FERЯ returns the best fitting spectra that most closely resemble the XP observations. The spectra are normalized by their mean values as in Equation (1), and we define the residual N∆ Flux as the difference between the normalized fitting (NFluxfitting) and XP (NFluxXP) spectra.

NFlux = Flux / Mean(Flux )NΔ Flux =NFlux fitting NFlux XP$\matrix{ {N{\rm{Flux}} = {\rm{Flux}}/{\rm{Mean(Flux}})} \cr {N{\rm{\Delta Flux}} = N{\rm{Flux}}{{\rm{}}_{{\rm{fitting}}}} - N{\rm{Flux}}{{\rm{}}_{{\rm{XP}}}}} \cr } $(1)

In previous studies of Montegriffo et al. (2023) and Huang et al. (2024a), they also gave a similar definition of the systematic error as (FluxXP − Fluxref)/Fluxref or Fluxref /FluxXP, where Fluxref is the corresponding reference spectra from external data. All the fluxes we discuss hereafter are normalized, and the symbol N will be dropped for clarity. The general shapes of the XP spectra are consistent with the best fitting models in most cases. The residuals are presented in a complex function that oscillates with the wavelength as shown in Figure 1, and its amplitude is usually larger in the blue band than the red band. In this study, we refer to these residuals as wiggles.

Visual inspection indicates that stars with similar parameters exhibit similar wiggles. To fully understand what are the main parameters that control the pattern, we randomly select 800 000 stars at different Galactic latitude: 200 000 at low-latitude (|b| < 10°), 300 000 at medium-latitude (10° ≤ |b| < 30°), and 300 000 at high-latitude (30° < |b| < 90°). The corresponding Fluxfitting and ∆Flux of these stars are obtained using FERЯ. The randomly selected XP sources have an extended distribution in the color-magnitude diagram (CMD), with −0.5 < GBPGRP < 3.5 and 4 < G < 17.5. In Section 3 we explore the impact of stellar colors, apparent magnitude, latitude, and metallicity on the wiggles using these randomly selected XP sources.

thumbnail Fig. 1

Density distribution of residuals as a function of wavelength for stars with different parameters. Subfigures (a), (b), and (c) are results of stars in bins of the same magnitude range 13 < G0 < 15 but different stellar colors. Subfigures (b), (d), and (e) show the density distribution of residuals for stars within the same range of color 1.0 < (GBPGRP)0 < 1.2 but different G0. Subfigure (f) shows the residuals of stars of the same color-magnitude as subfigures (b) but in different Galactic latitude (|b| < 10°). The black dash lines represent the P50 percentiles distributions, which can be viewed as a robust representation of the patterns of wiggles. From the P50 lines we can see that the wiggles change with (GBPGRP)0 and G0 , but the stellar color has a larger impact than the magnitude. Stars of low latitude have a much more diffuse distribution of ∆Flux. The large uncertainties of the extinction map towards the Galactic disc may cause a bad extinction correction of FluxXP for some stars. Therefore, FluxXP is more likely to deviate from the fitting result Fluxfitting for stars at lower latitude.

2.2 Metal-poor stars sample

The search for metal-poor stars is one of the most interesting applications of XP data, and therefore we carry out specific evaluations for that type of stars. We select metal-poor stars from three libraries: JINAbase (Abohalima & Frebel 2018), the Pristine survey (Aguado et al. 2019), and the LAMOST-Subaru metal-poor survey (Li et al. 2022a; Aoki et al. 2022). JINAbase is a collection of chemical abundances and stellar parameters for 1659 metal-poor stars (60% of which have [M/H] < −2.5) from the literature, published in the period between 1991 and 2016. The Pristine survey includes a medium-resolution spectroscopic follow-up of 1007 metal-poor candidates identified from the narrow-band photometry, and more than 900 stars which have been confirmed to have [M/H] < −2. The LAMOST-Subaru survey presents measurements for over 20 elements in 385 stars covering a wide metallicity range from −1.7 to −4.3. After crossmatching with the XP spectra and applying a cut of [M/H] < −2, our final metal-poor sample consists of 1813 stars. Most of them are at high-latitude, with E(BV) < 0.1. The colors and magnitudes of these stars are roughly in the range of 0.5 < GBPGRP < 1.5 and 6 < G < 17.5.

2.3 APOGEE DR17

APOGEE (Majewski et al. 2017; Wilson et al. 2019) is a large-scale spectroscopic survey of stars in the Milky Way. APOGEE provides high-resolution (R = λλ ~ 22 500) and high signal-to-noise ratio (S/N > 70 typically) spectra throughout the near infrared wavelength range of 1.51−1.70 µm.

The stellar atmospheric parameters and elemental abundances are determined by the APOGEE Stellar Parameters and Chemical Abundances Pipeline (ASPCAP; García Pérez et al. 2016) based on FERЯ. APOGEE provides high-precision measurements of the kinematics and chemistry of the Milky Way structures (bulge, disk, and halo). The Seventeenth Data Release of the APOGEE survey contains spectra and abundances for 733 901 stars (Abdurro’uf et al. 2022).

2.4 CALSPEC libraries

Considering the wide wavelength coverage of the XP spectra, we choose the CALSPEC library (Bohlin et al. 2014, 2019; Bohlin & Lockwood 2022) as the reference spectra. CALSPEC is a library of flux standards on the Hubble Space Telescope (HST) system. Most of them have a complete HST’s Space Telescope Imaging Spectrograph (STIS) coverage of wavelength from the ultraviolet to near-infrared band with a resolving power R ≈ 560–700. We find 109 CALSPEC stars with XP spectra, and their colors and magnitudes are mainly in the range of −1.0 < GBPGRP < 1.5 and 4 < G < 18. The CALSPEC collection of spectra are likely among the most accurate flux calibrated spectra available, with a 2–3% accuracy in absolute flux. The uncertainty of the monochromatic flux at 555.75 nm (555.6 nm in air) is 0.5% or 0.005 mag see Bohlin et al. 2014, and the CALSPEC Calibration Database).

2.5 LAMOST

LAMOST is a ground-based innovative telescope designed with a large aperture and a wide field-of-view (Zhao et al. 2006, 2012; Cui et al. 2012; Luo et al. 2012). Beginning with a pilot survey in 2011, LAMOST has observed over ten million spectra in the Northern sky with a limiting magnitude of r = 17.8 (Yan et al. 2022).

LAMOST Low-Resolution Spectroscopic (LRS) survey provides low-resolution (R ∼ 1800) spectra with a wavelength coverage of 0.37-0.90 µm. In this study, we use the LRS Stellar Parameter Catalog of A, F, G and K stars from the LAMOST DR11, which contains the atmospheric parameters of 7 774 147 stars with a typical error of ~43 K for Teff, 0.06 dex for log ɡ, and 0.04 dex for [M/H].

3 Pattern of wiggles

Comparison between XP spectra and model predicted flux, Montegriffo et al. (2023) shows that the residuals are correlated with GBPGRP color and G magnitude. The stellar color is closely associated with surface temperature, which plays a critical role in shaping the spectra energy distribution (SED) through photoionization (Allende Prieto 2023), mainly through hydrogen atoms and the H ion. The dependence of the wiggles on magnitude may likely be caused by the complex internal calibration of XP spectra. As illustrated by Andrae et al. (2023a), the spectra of stars brighter than G = 11.5 are recorded in 2D windows, which have different read-out configuration (gates) depending on magnitude, while fainter stars are mainly observed as 1D windows and under a more uniform readout strategy. Montegriffo et al. (2023) noted that a built-in assumption in the external calibration (which involves calibrating the internally calibrated, continuously represented mean spectra to an absolute system) is that the instrument model is independent of brightness (or magnitude). Therefore, it cannot easily account for inconsistencies caused by the different observing modes, which may finally result in a correlation between residuals and G magnitude.

Inspired by previous studies, we randomly distribute our selected XP sources, limited to E(BV) < 2, into different bins in de-reddened (GBPGRP)0 color and G0 magnitude. Figure 1 shows the density distribution of residuals as a function of wavelength for stars with different values of (GBPGRP)0, G0, and Galactic latitude b. We find that the wiggles change dramatically with color. For stars in the same color bins, we can see some scatter, but most of them have a very similar pattern, especially for stars in the range 1.0 < (GBPGRP)0 < 1.2.

Panels b, d, and e of Figure 1 show the distributions of residuals for stars with the same color but different magnitude G0. The residuals’ pattern clearly changes with G0 as well, although to a lesser extent than with (GBPGRP)0. The stellar color is a major determining factor of the SED, while the apparent magnitude G0 is connected to the SED through the instrument model. Since the wiggles in the residuals largely depend on the SED, it is only natural that the stellar color plays a much more important role. For stars in the same color-magnitude bin but located at lower Galactic latitudes, the general pattern is similar, but the residuals have a much more diffuse distribution. We conclude that the stellar color GBPGRP, apparent magnitude G, and dust extinction E(BV) should be considered when modeling the observed systematic patterns.

Huang et al. (2024a) also considered the intrinsic color and extinction in their proposed corrections, but disregarded the weaker effects of [M/H] and log ɡ. This choice simplifies the correcting process, focusing on the accuracy of the correction for most stars. However, the libraries they used mainly consist of stars with [M/H] > −1, and the SEDs of very metal-poor (VMP) stars are quite different from those of metal-rich ones. To fully understand the influence of [M/H], we adopt the sample from Section 2.2, and obtain its density distribution of residuals in Figure 2. We can see that the largest differences are concentrated on the blue band, where the residuals of VMP stars have a smaller amplitude, with the maximum absolute value Max(|P50|) < 0.08. In general, the pattern of fitting residuals for VMP stars is far more featureless than for metal-rich stars. Since one of our aims is to make an accurate estimation of metallicity from the wiggle-corrected XP spectra, we should take extra care with the influence of [M/H].

thumbnail Fig. 2

Density distribution of residuals as a function of wavelength for VMP stars. Top panel shows the density distribution of the residuals of VMP stars with 1.0 < GBPGRP < 1.2 and 13 < G < 15. Compared to the relatively more metal-rich sample in Figure 1, their wiggles have a smaller amplitude with Max (|P50|) < 0.08. The bottom panel shows the difference of P50 between these two samples. The main difference is in the blue band of λ < 6500 Å, while the wiggles in the red band is little affected by the change of metallicity.

4 Method

Our aim is to correct the wiggles observed in the residuals for a given spectrum and calculate atmospheric parameters by fitting the observations with model spectra using FERЯ. Therefore, we first need to characterize the systematic patterns as a function of color, magnitude, reddening and metallicity. This is what machine learning is good at. Therefore, we use the main relevant parameters as input into a neural-network model with multiple hidden layers, where the residuals from the fits are the output. We may not have the information of metallicity for a given star. Therefore, we strongly prefer to avoid using metallicity as input, and instead adopt various metallicity-sensitive photometric indices.

It is important to stress that there is a limited number of spectra with metallicity below −2.5 to train the NN model, so we do not trust and do not apply the derived correction to stars with metallicity under −2.5. In this section, we first describe how we predict the patterns via the NN, including the training dataset and the architecture of our NN model. Then we provide additional details on how we correct and fit all XP spectra in Gaia DR3.

4.1 Training database

To ensure the patterns in the residuals we characterize really reflect the differences between the XP and the theoretical spectra, we only use spectra with reliable estimations of atmospheric parameters, judging from the differences with the APOGEE parameters. We refer to the resulting sample as the training APOGEE sample (TAS) hereafter. The APOGEE sample is firstly cleaned by applying the cuts below, where the symbol A indicates APOGEE parameters: S/N >70;σTeffA250,σlog gA0.5;3500TeffA8000 K or 12000 K, 1log gA5.$ - \hfill & {{{\rm{S}} \mathord{\left/ {\vphantom {{\rm{S}} {{\rm{N > 70;}}}}} \right. \kern-\nulldelimiterspace} {{\rm{N > 70;}}}}} \hfill \cr - \hfill & {{{\rm{\sigma }}{{\rm{T}}_{{\rm{eff}}}^{\rm{A}}}} \le {\rm{250,}}{{\rm{\sigma }}{{\rm{log}}\,{{\rm{g}}^{\rm{A}}}}} \le {\rm{0}}{\rm{.5;}}} \hfill \cr - \hfill & {{\rm{3500}} \le {\rm{T}}_{{\rm{eff}}}^{\rm{A}} \le {\rm{8000}}\,{\rm{K}}\,{\rm{or}}\,{\rm{12000}}\,{\rm{K,}}\,{\rm{1}} \le {\rm{log}}\,{{\rm{g}}^{\rm{A}}} \le {\rm{5}}{\rm{.}}} \hfill \cr $

We adopted two different values as the upper limits of TeffA$$T_{{\rm{eff}}}^{\rm{A}}$$ for the two models with different resolutions. The upper limit is 8000 K for Sconst and 12000 K for Svar. In the APOGEE DR17 catalog, there are some sources with Gaia source id = 0, which are dropped in our APOGEE sample. In addition, for sources with the same Gaia source id, we only keep the first one (~10% sources are duplicated). Then we obtain the initial parameters for the APOGEE sample (IAS). Next, we calculate the atmospheric parameters for XP spectra using FERЯ. After deriving Teff, log ɡ, and [M/H], we trim down the IAS applying the following constraints: | TeffXPTeffA |<200| loggXPloggA |<0.5| [ M/H ]XP[ M/H ]A |<0.5,$ - \hfill & {\left| {T_{{\rm{eff}}}^{{\rm{XP}}} - T_{{\rm{eff}}}^{\rm{A}}} \right| < 200} \hfill \cr - \hfill & {\left| {\log \,{g^{{\rm{XP}}}} - \log \,{g^{\rm{A}}}} \right| < 0.5} \hfill \cr - \hfill & {\left| {{{\left[ {{{\rm{M}} \mathord{\left/ {\vphantom {{\rm{M}} {\rm{H}}}} \right. \kern-\nulldelimiterspace} {\rm{H}}}} \right]}^{{\rm{XP}}}} - {{\left[ {{{\rm{M}} \mathord{\left/ {\vphantom {{\rm{M}} {\rm{H}}}} \right. \kern-\nulldelimiterspace} {\rm{H}}}} \right]}^{\rm{A}}}} \right| < 0.5,} \hfill \cr $

where TeffXP,log  gXP,[M/H]XP$T_{{\rm{eff}}}^{{\rm{XP}}},\log \,\,{g^{{\rm{XP}}}},{[{\rm{M}}/{\rm{H}}]^{{\rm{XP}}}}$, are parameters estimated from the Gaia XP spectra by FERЯ. Finally, our TAS includes 157 478 stars for Sconst, and 131 173 stars for Svar.

4.2 Input and output

As mentioned before, the shape of the residuals is highly related to the color, magnitude, and reddening of a star, and to a lesser extent, to its metallicity. Therefore, we use various of colors and magnitudes, including metallicity-sensitive colors, generated from XP spectra, as input for our NN model. We employ GaiaXPy and the built-in photometric system (Gaia Collaboration 2023a) to obtain the SkyMapper magnitudes u, υ, ɡ, i, and the metallicity-sensitive colors (Chiti et al. 2021) ɡi, υɡ − 0.9 × (ɡi), and uυ − 0.9 × (ɡi). In addition, the reddening E (BV) calculated from dustmaps is included as one of the parameters.

To summarize, the 14 parameters involved are:

  • metallicity-sensitive colors: ɡi, υɡ − 0.9 × (ɡi), uυ − 0.9 × (ɡi).

  • SkyMapper photometric passbands: u, υ, ɡ, i.

  • Gaia photometric passbands and colors: phot ɡ mean mag, phot bp mean mag, phot rp mean mag, bp rp, bp ɡ, rp ɡ

  • reddening: E (BV).

The output is the predicted flux corrections as a function of wavelength.

4.3 Neural network architecture

A simple NN model based on Pytorch (Paszke et al. 2017, 2019)7 is built for training with the TAS. The basic diagram illustrating our NN model is summarized in Figure 3. Briefly, we have 7 hidden layers in our model, and the number of neurons per hidden layer is shown in the figure. We have 14 input elements, as described above. The length of output for each target is 330 (for Sconst, and 270 for Svar). The loss function and optimizer adopted in our model are MSELoss and Adam. The initial learning rate for training is set to 0.001, and it is reduced to half that value after 15 epochs, with the lower limit set to 10−6 . In order to prevent overfitting, we use EarlyStopping in our model.

The TAS data set is divided into a training sample (60%), a validation sample (20%), and a testing sample (20%). When the loss curve for the validation sample is not declining after 25 epochs, the train process is stopped. Our finally adopted model is trained on an Nvidia RTX 4090 in about 5 minutes for 234 epochs.

Our NN model performs very well in predicting the pattern in the residual for a given star. In Figure 4, we demonstrate the predicted wiggles for 20% sources in the TAS. As we can see from the figure, the predicted wiggles show significantly less scatter than the actual wiggles, but the mean pattern in the actual wiggles is clearly shown in the predicted ones. The bottom panel of this figure also indicates the patterns are largely removed after applying the correction.

thumbnail Fig. 3

Diagram of our NN model based on Pytorch.

4.4 Fit all XP spectra

To determine atmospheric parameters for all the stars with Gaia XP spectra, we use FERЯ. This process is performed in three steps:

  • (1)

    We fit all de-reddened XP spectra with FERЯ and arrive at an approximate estimation of Teff and [M/H]:

    • Stars with extinction AV > 15 are removed; (This value is set quite high, as we do not intend to exclude any spectrum for its high reddening. The limit is primarily in place to prevent calculation errors. If we lower this value to 1.0 or 2.0, it will not exclude many more stars, as more than 99.2% of the stars in our final catalog have AV < 2.0.)

  • (2)

    We use the NN model to correct the systematics in the spectral energy distribution (wiggles):

    • Spectra for which we are not able to generate the full set of synthetic photometry are not corrected (The reason is explained later);

    • Spectra with Teff in the first estimation larger than 8000 K will not be corrected (only activated during the process of fitting spectra with variable resolution models);

    • Spectra with [M/H] < −2.5 in the first estimation will not be corrected.

  • (3)

    We fit the XP spectra with FERЯ again.

Some spectra cannot produce all the passbands we require, typically the u and υ bands of SkyMapper in our test, with the u band being the most affected. According to Table 1 in Onken et al. (2024), the central wavelength and filter FWHM for the u band are 350 nm and 43 nm, respectively, while for the v band, these values are 384 nm and 31 nm. The starting wavelength of XP spectra is 330 nm, covering most wavelength range of the u band and the full wavelength range of the υ band. However, GaiaXPy fails to generate these two bands for some spectra, primarily for faint stars, though not all faint stars are affected by this issue.

thumbnail Fig. 4

Comparison between the real patterns from TAS (20% of the whole sample, top panel) and the NN model predicted patterns (middle panel). The bottom panel shows the difference between the real and predicted patterns, from which we can see that the wiggles disappear.

thumbnail Fig. 5

Testing on atmospheric parameters using sampling Sconst. Top panels: comparison of Teff, logg and [M/H] estimated from original XP spectra (Y -axis) and APOGEE survey (X-axis). Middle panels: similar to the top panels, but with Y-axis replaced by the results from corrected XP spectra. The color bar in each subfigure displays the number density. Bottom panels: histograms present the differences of Teff, log ɡ and [M/H] between XP and APOGEE before and after correcting the systematic patterns. The mean values and standard deviations are shown in the labels.

5 Results

In this section, we discuss the estimation of parameters for the IAS after correcting the systematic pattern predicted by our NN model trained on the TAS data set. As described below, this correction makes the XP spectra fit the model spectra better, and in particular, we find a significantly better agreement with the HST CALSPEC observations.

5.1 Stellar atmospheric parameters

Besides determining the absolute flux, we find that the correction of the systematic pattern can improve the estimation of stellar atmospheric parameters from XP spectra. We first calculate parameters from XP spectra with sampling Sconst. In Figure 5 we show the distributions of the stellar atmospheric parameters (Teff, log ɡ, [M/H]) inferred from the original and wiggle-corrected spectra, comparing with parameters from the APOGEE survey. The top and middle panels compare parameters from XP and APOGEE before and after the correction. We keep stars with 4000 ≤ Teff ≤ 7000 K to compare with APOGEE, because our analysis for stars with lower and higher temperatures performs worse. In addition, the stars in this figure have passed the selection based on the quality flag dflux_per, which describes the percentage of data points from ∆Flux ≡ FluxXP − Fluxmodel exceeding ±0.05. We retain stars with dflux per below 20% before the correction and 8% after the correction. It is worth noting that more stars remain after correction, as the correction improves the fit between the XP spectra and the models. As a result, even with more stringent selection criteria, we still obtain more stars than before the correction.

Figure 5 shows that our estimations are highly consistent with APOGEE’s parameters both before and after corrections within the range 4000 ≤ Teff ≤ 7000 K. But comparing the middle panels with the top panels, one can easily find that estimations for Teff , log ɡ and [M/H] become somewhat better after correction, especially when examining the differences in log ɡ and [M/H]. Correcting the systematic pattern not only reduces the biases in Teff and log ɡ, but it also makes our results more precise, as suggested from the smaller dispersion relative to APOGEE results.

We provide below a brief summary of the changes in the mean value and standard deviation of ∆Teff, ∆ log ɡ, and ∆[M/H] for the nominal S const analysis:

  • Teff: from −58.17 ± 170.61 to −38.40 ± 167.21;

  • ∆ log ɡ: from 0.11 ± 0.53 to 0.05 ± 0.40;

  • ∆[M/H]: from −0.09 ± 0.22 to −0.12 ± 0.19.

The most significant improvement is in the estimation of log ɡ, where the dispersion σ∆ log ɡ is reduced from 0.53 to 0.40.

We also calculate parameters for Svar for the same sample of stars (IAS) with the same selection rules. We find that ∆Teff , ∆ log ɡ, and ∆[M/H] are very similar with previous results using sampling Sconst, summarized as follows.

  • Teff: from −62.11 ± 177.21 to −50.28 ± 163.46;

  • ∆ log g: from −0.09 ± 0.73 to −0.05 ± 0.47;

  • ∆[M/H]: from −0.13 ± 0.23 to −0.14 ± 0.19.

Despite the overall results are quite similar for the two different samplings, those for Sconst before correcting patterns are slightly better. After correcting the patterns, we find that ∆Teff , ∆ log ɡ, and ∆[M/H] from the two samplings are similar. However, the constant sampling still seems to provide better results, especially for Teff and log ɡ. Therefore, we adopt constant sampling in the following sections and our series of papers.

5.2 Flux corrections

Correcting the wiggles has a significant impact on the fitting of XP spectra. We examine the log102) and RMS (root mean square) distributions and both parameters are significantly reduced, as shown in Figure 6 for the stars in the IAS. The RMS is defined: RMS=1ni(FiXP-Fimodel)2$\mathrm{RMS} = \sqrt{\frac{1}{n} \sum_{i} \left( F^{\mathrm{XP}}_{i} - F^{\mathrm{model}}_{i} \right)^{2}}$(2)

where FXP, Fmodel are the fluxes of XP spectrum and model spectrum, i indicates the points of sampling, and n is the number of data points. Our correction makes log102) and RMS becoming way much smaller. As shown in the bottom panel of Figure 6, the peak of the RMS histogram decreases from 3.7% to 1.2%. In the following we check whether the correction helps in bringing the XP spectra closer to the CALSPEC data, which have a superb flux calibration.

thumbnail Fig. 6

Histograms present log102) and RMS between the XP spectra and model spectra for the IAS data set before and after correcting the pattern, indicating by blue and orange, respectively.

5.3 The CALSPEC library

Given the excellent absolute flux calibration of the CALSPEC spectra, we use this library to perform an independent check of our proposed corrections. Since they have significantly larger resolving power, we apply Gaussian convolution to match their resolution to the XP spectra (and our model spectra). Then, we perform an interpolation of the smoothed spectra to match the sampling of XP data, and correct extinction in the same way we do for the XP spectra. Only spectra of stars estimated to have Teff < 104 K are used in this comparison, based on the parameter teff_gspphot in Gaia DR3.

We first check the RMS between the XP spectra and the spectra from CALSPEC, comparing the results before and after the correction. All spectra from XP and CALSPEC are dereddened with the same method (see in Section 2). The results for stars without quality cuts are shown in the left-hand panels of Figure 7, and the right panels display the results for stars that have passed the quality cuts. We find that the RMS with the correction of the pattern is overall reduced, with only one exception for stars that have passed the quality cuts. From the median RMS values, indicated by the red dashed lines in the figure, our correction reduces the RMS from 3.9% to 3.2%, or from 3.2% to 2.4%, depending on whether quality cuts are applied. Overall, the XP spectra are fit better with CALSPEC after correcting the systematic pattern. We also use the package provided by Huang et al. (2024a) to correct XP spectra as a comparison, shown in the two subfigures in the bottom panels. We de-redden and interpolate the spectra corrected by the package to match the sampling used in this paper, because the default sampling in our work differs from that of Huang et al. (2024a). From the orange and green histograms, shown in the middle and bottom panels in Figure 7, the overall improvements from our correction and the correction by Huang et al. (2024a) are comparable, regardless of whether quality cuts are applied. We should also mention that the resolution of our model spectrum is not exactly the same as that of the XP spectrum, as we use a constant resolution of around 100. However, the typical resolution of XP spectra is below ∼100. Consequently, the XP spectrum may be over-corrected, with some wiggles potentially arising from the higher resolution of models adopted in this paper. However, we have checked the results using variable resolution and reach the same conclusions.

We also present plots for a few individual XP spectra comparing them to CALSPEC in Figure 8. After correcting the pattern, the XP spectra match better the CALSPEC data smoothed to the resolution of the Gaia spectrophotometry. Nonetheless, there are some wavelengths at which the corrected version gives poorer agreement with CALSPEC. It is worth noting that the resolution adopted in Huang et al. (2024a) is lower than that used in this study, which accounts for some of the observed differences between the corrected spectra by Huang et al. (2024a) and CAL-SPEC at the blue end of the spectra. We have also repeated the check with the Next Generation Spectral Library (NGSL; Gregg et al. 2006; Pal et al. 2023), but the improvement is not so clear, which we associate to the lower quality of the flux calibration of the library compared to the exquisite CALSPEC data.

thumbnail Fig. 7

Distribution of RMS between CALSPEC and XP spectra. Left panels: histograms present RMS between the XP spectra and the spectra from CALSPEC libraries before and after correcting patterns, indicating by blue and orange, respectively. The green histogram shows the corresponding results obtained using the Python package from Huang et al. (2024a) to correct the XP spectra. Right panels: similar to those panels in the left, but with stars that have passed the quality cuts applied in Section 5.1. The red dashed lines represent the median values, which are also indicated by the number displayed in each panel.

5.4 Atmospheric parameters catalog of Gaia XP spectra

In Section 4.4 we describe our analysis of the whole sample of Gaia XP spectra. Using the same restrictions in Section 5.1 (4000 ≤ Teff ≤ 7000 K, and a maximum of 8% or 20% of the data points with ΔFlux larger than 0.05), we build our catalogs of stellar parameters for stars with Gaia XP spectra. We have a global catalog containing 68 394 431 stars, and a metal-poor catalog containing 124 188 stars that have [M/H] < −2.5 and AV ≤ 1.5.

thumbnail Fig. 8

Examples of how pattern correction can help improve the fitting between the XP spectra and the CALSPEC. Three spectra with different temperatures are presented in the sub-panels, from top to bottom: BD+54 1216, HD 115169, and KF06T2 in CALSPEC. In each panel, the smoothed CALSPEC spectrum is depicted as a bold gray line, while the XP spectra, with and without correcting patterns, are shown in blue and orange, respectively. The green lines show the corrected spectra using the package provided by Huang et al. (2024a). To enhance readability, we include a few zoomed-in diagrams in each panel to highlight the improvements on spectrophotometry. The ∆Flux between XP and CAL- SPEC are also presented at the bottom of each panel.

5.4.1 Comparison with LAMOST

The global catalog has also been cross-matched with LAMOST DR11 low-resolution catalog using the Gaia source id, finding 3 006 606 common sources. We compare the atmospheric parameters Teff, log ɡ, and [M/H] between our results and LAMOST in Figure 9.

5.4.2 Comparisons with XP catalogs from the literature

We also compare our results with recent catalogs from literature. The details of the comparisons with Andrae et al. (2023b) and Zhang et al. (2023) are provided in Appendix A. The bottom line is that there is fair agreement between the three catalogs, suggesting that the uncertainties in our effective temperatures and metallicities are similar to those in the other catalogs, typically about 150 K and 0.2 dex, respectively, while our gravities are more uncertain than those in the other catalogs, about 0.4 dex for us but nearly 0.2 dex for the others. The analysis by Andrae et al. (2023b) and Zhang et al. (2023) are different in nature from ours. Not only they employ data-driven methods, while our results are obtained from synthetic spectra based on model atmospheres, but they make use of the trigonometric parallaxes from Gaia, while we do not. We plan to adapt our algorithm to make use of that valuable information, which can directly and dramatically improve the retrieved surface gravities, as well as constraining better the effective temperatures and metallicities. Nonetheless, our present results are independent from models of stellar structure and evolution.

The most used training datasets for machine learning, such as the catalogs from APOGEE and LAMOST, have a fairly restrictive lower boundary in metallicity, which limits their application to detect metal-poor star candidates. Additional comparisons with Andrae et al. (2023b) and Zhang et al. (2023) regarding VMP stars are also presented in Appendix A. In summary, our catalog offers some advantages in the parameter range beyond those covered by previous work, and has more clean and direct connection to physical models of stellar atmospheres.

thumbnail Fig. 9

Comparison of atmospheric parameters Teff, log ɡ, and [M/H] between our catalog and LAMOST DR11 low-resolution catalog.

5.4.3 Star clusters

We use open clusters (OCs) and globular clusters (GCs) to validate the metallicities in our catalog. Details of the comparison are outlined in Appendix B. Although there are some outliers in the metallicity distribution, as seen in GC NGC 3201, the overall results are in excellent agreement with the literature.

5.4.4 The final catalog

The first column in our global catalog is name, representing the Gaia source id of each source. Following this, we provide the atmospheric parameters Teff, log ɡ, and [M/H] as Teff, logg, FeH. The log10 χ2 value for each spectrum is also included in the column log10_chi2. Another parameter reflecting the quality of the atmospheres parameters in our catalog is dflux per, as discussed earlier in Section 5.1. This quality flag has already been utilized to select reliable results. Stricter criteria, such as dflux_per< 0.10 or < 0.12, can be applied to identify even more reliable sources, particularly for metal-poor stars. For the metal-poor catalog, we also include the extinction AV used in this paper, as we should exercise caution with the low-latitude high extinction stars. Additionally, the catalog has been crossmatched with Gaia to obtain various parameters from Gaia DR3, such as proper motion and magnitude. The global and metal-poor catalogs, along with the code for correcting the systematic patterns in the spectra, are made publicly available.

6 Summary

In this paper, we characterize the patterns of systematic errors present in the absolute-calibrated Gaia XP spectra, using a very large number of XP spectra and their best-fitting synthetic spectra based on model atmospheres. We find that those patterns depend on stellar colors, brightness, extinction, and metallicity. We present a simple NN that relates the systematic flux patterns with stellar color, magnitude, and extinction. The predicted patterns match those in the data very well. After correcting the wiggles, FERЯ is applied to derive atmospheric parameters from corrected Gaia XP spectra. Our methodology is validated from the comparison with APOGEE DR17 parameters and stars with observations in the HST CALSPEC collection.

Compared to APOGEE, our estimation of atmospheric parameters is accurate in the temperature range 4000 ≤ Teff ≤ 7000 K, with slight systematic errors and standard deviations around −38 ± 167 K, 0.05 ± 0.40 dex, and −0.12 ± 0.19 dex in Teff, log ɡ, and [M/H], respectively. The estimation of atmospheric parameters and spectra flux are both improved by correcting the systematic patterns. Our corrections improve the quality of the relative spectrophotometry of the Gaia XP data from 3.2–3.7% to 1.2–2.4%, as verified against our models and the high-quality CALSPEC standards. Our results are also compared with other catalogs generated from XP data in the literature, and the metallicity is validated through the use of star cluster members. Finally, we publish our atmospheric parameters catalog of 68 394 431 sources, with a metal-poor ([M/H] ≤ −2.5) subset including 124 188 stars. Our catalogs and flux-correction code are publicly available.

Data availability

Tables and codes are made publicly available at https://doi.org/10.5281/zenodo.14028588. The tables are available in electronic form at the CDS via anonymous ftp to cdsarc.cds.unistra.fr (130.79.128.5) or via https://cdsarc.cds.unistra.fr/viz-bin/cat/J/A+A/695/A75

Acknowledgements

This study is supported by the National Key R&D Program of China under grant Nos. 2023YFE0107800, 2024YFA1611900, and National Natural Science Foundation of China under grant Nos. 11988101, 12273055, 11927804. This study is also supported by International Partnership Program of Chinese Academy of Sciences Grant No.178GJZ2022040GC. XY and WW acknowledge the support from the China Scholarship Council. CAP acknowledges financial support from the Spanish Ministry MICIU projects PID2020-117493GB-I00 and PID2023-149982NB-I00. This research made use of computing time available on the high-performance computing systems at the Instituto de Astrofisica de Canarias. The authors are thankful for the technical expertise and assistance provided by the Spanish Supercomputing Network (Red Espanola de Supercomputacion), and the staff at the Instituto de Astrofisica de Canarias. CdB acknowledges support from a Beatriz Galindo senior fellowship (BG22/00166) from the Spanish Ministry of Science, Innovation and Universities. This work presents results from the European Space Agency (ESA) space mission Gaia. Gaia data are being processed by the Gaia Data Processing and Analysis Consortium (DPAC). Funding for the DPAC is provided by national institutions, in particular the institutions participating in the Gaia MultiLateral Agreement (MLA). The Gaia mission website is https://www.cosmos.esa.int/gaia. The Gaia archive website is https://archives.esac.esa.int/gaia. This job has made use of the Python package GaiaXPy, developed and maintained by members of the Gaia Data Processing and Analysis Consortium (DPAC), and in particular, Coordination Unit 5 (CU5), and the Data Processing Centre located at the Institute of Astronomy, Cambridge, UK (DPCI). Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the U.S. Department of Energy Office of Science, and the Participating Institutions. SDSS-IV acknowledges support and resources from the Center for High Performance Computing at the University of Utah. The SDSS website is www.sdss4.org. SDSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, Center for Astrophysics | Harvard & Smithsonian, the Chilean Participation Group, the French Participation Group, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU) / University of Tokyo, the Korean Participation Group, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatário Nacional / MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University. This work made extensive use of TOPCAT (Taylor 2005).

Appendix A Comparisons with Andrae et al. (2023b) and Zhang et al. (2023)

The direct comparison between our results and those from Andrae et al. (2023b) and Zhang et al. (2023) are presented in the first and third rows of Figure A.1. The mean and standard deviation values of ΔTeff, Δ log ɡ, and Δ[M/H] indicate that our results are more similar to those of Zhang et al. (2023) in Teff and log ɡ, but align more closely with Andrae et al. (2023b) for [M/H]. Since machine learning results can exhibit artificially high performance on the training data, as the data is used to optimize model parameters, we use parameters from different surveys as reference points in our comparison. Andrae et al. (2023b) provides atmospheric parameters using XGBoost training on APOGEE. To make a fair comparison, we cross-match our catalog and theirs with LAMOST DR11 low-resolution catalog. With no additional cuts, there are almost 3 million stars in common. We calculate ΔTeff, Δ log ɡ, Δ[M/H] between our results and those from LAMOST, which are presented in the second row of Figure A.1. ΔTeff, Δ log ɡ, Δ[M/H] between Andrae et al. (2023b) and LAMOST are also presented in those panels with different colors. We can see that we have a better estimation for Teff when using the reference Teff from LAMOST. However, our gravity estimation is significantly worse than theirs, and the metallicity from Andrae et al. (2023b) is also better than ours. In the last row of Figure A.1, we use APOGEE DR17 as a reference to calculate ΔTeff, Δ log ɡ, Δ[M/H] between our catalog (or the catalog from Zhang et al. (2023)) with the catalog from APOGEE. From these histograms, using APOGEE as a reference, all three atmospheric parameters from Zhang et al. (2023) are better than ours. In the middle panel of the last row, a spike appears at log ɡ − log ɡAPOGEE ~ 0.35 − 0.40, which is caused by a problematic estimation of log ɡ for low Teff stars. A deeper analysis of the Teff vs. log ɡ − log ɡAPOGEE diagram reveals an overdensity at log ɡ − log ɡAPOGEE ~ 0.35 − 0.40 for low temperature stars (Teff < 4500).

In addtion, we cross-match our catalog and the catalogs from the literature with VMP stars described in Section 2.2. It should be pointed out that Andrae et al. (2023b) uses metal-poor stars to replenish their training sample, which contains many sources in common with our VMP selection. Therefore, comparison between our results and theirs for VMP stars is not fair. Not surprisingly, Andrae et al. (2023b) performs better in such comparison, but our method performs better compared to Zhang et al. (2023), as shown in Figure A.2.

Appendix B Validating our catalog using star clusters

The star clusters are selected primarily at random, ensuring a sufficient number of member candidates to construct the metallicity distribution, while also ensuring that each cluster has a single, well-defined metallicity. Member candidates are from Hunt & Reffert (2024) and Vasiliev & Baumgardt (2021) for OCs and GCs8, respectively. To confirm the membership of stars, we use the membership probability Prob > 0.6 and memberprob = 1 for OCs and GCs, respectively. Similar to Huang et al. (2024b), we select only giants (with our estimated log ɡ < 3.5) in the case of GCs, and fit the metallicity distribution for each cluster, using a Gaussian function after applying 3 − σ clipping. The metallicity distributions for the three OCs and three GCs are shown in Figure B.1. In this figure, we include all the candidates, but highlight the distribution after applying 3 − σ clipping. The number of member stars for each star cluster (after 3 − σ clipping) and Gaussian fit to the clipped distribution are shown in each panel as well.

From each panel in Figure B.1, stars within the star cluster present strong compatibility in [M/H]. The metallicity of OCs is close to 0 but still shows variation among different OCs, while the metallicity of GCs falls in the range −2.3 ~ −1.6. We also compare the metallicity distribution with values from literature in Figure B.1, indicated by the black vertical lines. For OCs, metallicities are adopted from Dias et al. (2021), while for GCs, we use the values from Gonzalez & Wallerstein (1998) for NGC 3201, Castilho et al. (2000) for NGC 6397, and Lee et al. (2005) for NGC 4590.

thumbnail Fig. A.1

Comparison of our catalog with other catalogs from literatures using LAMOST or APOGEE as references. The first row: comparison between our catalog and a catalog from Andrae et al. (2023b), annotated with the means and standard deviations of the differences in Teff, log ɡ, and [M/H]. The second row: histograms in red showing the differences in atmospheric parameters between our catalog and the catalog from LAMOST, while histograms in black showing the differences between the catalog from Andrae et al. (2023b) and the one from LAMOST, with means and standard deviations annotated. The last two rows: similar to the first and second rows, we compare our catalog with the catalog from Zhang et al. (2023), using APOGEE as a reference in those panels in the last row.

thumbnail Fig. A.2

Similar to Figure A.1, Comparison of our catalog with other catalogs from literatures using VMP stars as references.

thumbnail Fig. B.1

The metallicity distribution from our catalog is shown for member stars of three open clusters (NGC 2632, NGC 2516, NGC 752) and three globular clusters (NGC 3201, NGC 6397, NGC 4590). In each panel, we display the metallicity distribution of member stars both before and after applying 3 − σ clipping, depicted as step and filled histograms, respectively, with the number of remaining members annotated in the labels. Gaussian fits are overlaid as blue curves in all panels. The mean and standard deviation of [M/H] for each cluster are also presented in the respective panel. The black vertical lines indicate the metallicities of these star clusters as reported in the literature.

References

  1. Abazajian, K., Adelman-McCarthy, J. K., Agüeros, M. A., et al. 2004, AJ, 128, 502 [NASA ADS] [CrossRef] [Google Scholar]
  2. Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35 [NASA ADS] [CrossRef] [Google Scholar]
  3. Abohalima, A., & Frebel, A. 2018, ApJS, 238, 36 [NASA ADS] [CrossRef] [Google Scholar]
  4. Aguado, D. S., Youakim, K., González Hernández, J. I., et al. 2019, MNRAS, 490, 2241 [NASA ADS] [CrossRef] [Google Scholar]
  5. Aihara, H., Allende Prieto, C., An, D., et al. 2011, ApJS, 193, 29 [NASA ADS] [CrossRef] [Google Scholar]
  6. Allende Prieto, C. 2023, Atoms, 11, 61 [Google Scholar]
  7. Allende Prieto, C., Beers, T. C., Wilhelm, R., et al. 2006, ApJ, 636, 804 [NASA ADS] [CrossRef] [Google Scholar]
  8. Allende Prieto, C., Koesterke, L., Hubeny, I., et al. 2018, A&A, 618, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  9. An, D., Beers, T. C., & Chiti, A. 2024, ApJS, 272, 20 [NASA ADS] [CrossRef] [Google Scholar]
  10. Andrae, R., Fouesneau, M., Sordo, R., et al. 2023a, A&A, 674, A27 [CrossRef] [EDP Sciences] [Google Scholar]
  11. Andrae, R., Rix, H.-W., & Chandra, V. 2023b, ApJS, 267, 8 [NASA ADS] [CrossRef] [Google Scholar]
  12. Aoki, W., Tominaga, N., Beers, T. C., Honda, S., & Lee, Y. S. 2014, Science, 345, 912 [NASA ADS] [CrossRef] [Google Scholar]
  13. Aoki, W., Li, H., Matsuno, T., et al. 2022, ApJ, 931, 146 [NASA ADS] [CrossRef] [Google Scholar]
  14. Avdeeva, A. S., Kovaleva, D. A., Malkov, O. Y., & Zhao, G. 2024, MNRAS, 527, 7382 [Google Scholar]
  15. Bellazzini, M., Massari, D., De Angeli, F., et al. 2023, A&A, 674, A194 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Belokurov, V., & Kravtsov, A. 2022, MNRAS, 514, 689 [NASA ADS] [CrossRef] [Google Scholar]
  17. Binney, J., & Vasiliev, E. 2024, MNRAS, 527, 1915 [Google Scholar]
  18. Bohlin, R. C., & Lockwood, S. 2022, Update of the STIS CTE Correction Formula for Stellar Spectra, Instrument Science Report STIS 2022-7, 11 [Google Scholar]
  19. Bohlin, R. C., Gordon, K. D., & Tremblay, P. E. 2014, PASP, 126, 711 [NASA ADS] [Google Scholar]
  20. Bohlin, R. C., Deustua, S. E., & de Rosa, G. 2019, AJ, 158, 211 [NASA ADS] [CrossRef] [Google Scholar]
  21. Buder, S., Sharma, S., Kos, J., et al. 2021, MNRAS, 506, 150 [NASA ADS] [CrossRef] [Google Scholar]
  22. Cardelli, J. A., Clayton, G. C., & Mathis, J. S. 1989, ApJ, 345, 245 [Google Scholar]
  23. Carrasco, J. M., Weiler, M., Jordi, C., et al. 2021, A&A, 652, A86 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  24. Castelli, F., Gratton, R. G., & Kurucz, R. L. 1997, A&A, 318, 841 [NASA ADS] [Google Scholar]
  25. Castilho, B. V., Pasquini, L., Allen, D. M., Barbuy, B., & Molaro, P. 2000, A&A, 361, 92 [NASA ADS] [Google Scholar]
  26. Chandra, V., Semenov, V. A., Rix, H.-W., et al. 2024, ApJ, 972, 112 [Google Scholar]
  27. Chen, B., Hayden, M. R., Sharma, S., et al. 2023, MNRAS, 523, 3791 [NASA ADS] [CrossRef] [Google Scholar]
  28. Chiti, A., Frebel, A., Mardini, M. K., et al. 2021, ApJS, 254, 31 [NASA ADS] [CrossRef] [Google Scholar]
  29. Coelho, P., Barbuy, B., Meléndez, J., Schiavon, R. P., & Castilho, B. V. 2005, A&A, 443, 735 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  30. Conroy, C., Weinberg, D. H., Naidu, R. P., et al. 2022, OJAp, submitted [arXiv:2204.02989] [Google Scholar]
  31. Cooper, A. P., Koposov, S. E., Allende Prieto, C., et al. 2023, ApJ, 947, 37 [NASA ADS] [CrossRef] [Google Scholar]
  32. Cropper, M., Katz, D., Sartoretti, P., et al. 2018, A&A, 616, A5 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  33. Cui, X.-Q., Zhao, Y.-H., Chu, Y.-Q., et al. 2012, Res. Astron. Astrophys., 12, 1197 [Google Scholar]
  34. De Angeli, F., Weiler, M., Montegriffo, P., et al. 2023, A&A, 674, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  35. de Jong, R. S., Agertz, O., Berbel, A. A., et al. 2019, The Messenger, 175, 3 [NASA ADS] [Google Scholar]
  36. De Silva, G. M., Freeman, K. C., Bland-Hawthorn, J., et al. 2015, MNRAS, 449, 2604 [NASA ADS] [CrossRef] [Google Scholar]
  37. Deason, A. J., & Belokurov, V. 2024, New A Rev., 99, 101706 [NASA ADS] [CrossRef] [Google Scholar]
  38. Dias, W. S., Monteiro, H., Moitinho, A., et al. 2021, MNRAS, 504, 356 [NASA ADS] [CrossRef] [Google Scholar]
  39. Fallows, C. P., & Sanders, J. L. 2024, MNRAS, 531, 2126 [CrossRef] [Google Scholar]
  40. Gaia Collaboration (Brown, A. G. A., et al.) 2016, A&A, 595, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Gaia Collaboration (Montegriffo, P., et al.) 2023a, A&A, 674, A33 [CrossRef] [EDP Sciences] [Google Scholar]
  42. Gaia Collaboration (Vallenari, A., et al.) 2023b, A&A, 674, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  43. García Pérez, A. E., Allende Prieto, C., Holtzman, J. A., et al. 2016, AJ, 151, 144 [Google Scholar]
  44. Gonzalez, G., & Wallerstein, G. 1998, AJ, 116, 765 [NASA ADS] [CrossRef] [Google Scholar]
  45. Green, G. M. 2018, J. Open Source Softw., 3, 695 [Google Scholar]
  46. Gregg, M. D., Silva, D., Rayner, J., et al. 2006, in The 2005 HST Calibration Workshop: Hubble After the Transition to Two-Gyro Mode, eds. A. M. Koekemoer, P. Goudfrooij, & L. L. Dressel, 209 [Google Scholar]
  47. Gustafsson, B., Edvardsson, B., Eriksson, K., et al. 2008, A&A, 486, 951 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  48. Hattori, K. 2024, arXiv e-prints, [arXiv:2404.01269] [Google Scholar]
  49. Heger, A., & Woosley, S. E. 2010, ApJ, 724, 341 [Google Scholar]
  50. Huang, B., Yuan, H., Xiang, M., et al. 2024a, ApJS, 271, 13 [NASA ADS] [CrossRef] [Google Scholar]
  51. Huang, B., Yuan, H., Xu, S., et al. 2024b, ApJS, submitted [arXiv:2410.19895] [Google Scholar]
  52. Hunt, E. L., & Reffert, S. 2024, A&A, 686, A42 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  53. Husser, T. O., Wende-von Berg, S., Dreizler, S., et al. 2013, A&A, 553, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  54. Imig, J., Price, C., Holtzman, J. A., et al. 2023, ApJ, 954, 124 [CrossRef] [Google Scholar]
  55. Jin, S., Trager, S. C., Dalton, G. B., et al. 2024, MNRAS, 530, 2688 [NASA ADS] [CrossRef] [Google Scholar]
  56. Klessen, R. S., & Glover, S. C. O. 2023, ARA&A, 61, 65 [NASA ADS] [CrossRef] [Google Scholar]
  57. Kollmeier, J. A., Zasowski, G., Rix, H.-W., et al. 2017, arXiv e-prints [arXiv:1711.03234] [Google Scholar]
  58. Komiya, Y., Habe, A., Suda, T., & Fujimoto, M. Y. 2010, ApJ, 717, 542 [NASA ADS] [CrossRef] [Google Scholar]
  59. Koutsouridou, I., Salvadori, S., Skúladóttir, Á., et al. 2023, MNRAS, 525, 190 [NASA ADS] [CrossRef] [Google Scholar]
  60. Laroche, A., & Speagle, J. S. 2024, ApJ, submitted [arXiv:2404.07316] [Google Scholar]
  61. Lee, J.-W., Carney, B. W., & Habgood, M. J. 2005, AJ, 129, 251 [NASA ADS] [CrossRef] [Google Scholar]
  62. Lejeune, T., Cuisinier, F., & Buser, R. 1998, A&AS, 130, 65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  63. Leung, H. W., & Bovy, J. 2019, MNRAS, 483, 3255 [NASA ADS] [Google Scholar]
  64. Leung, H. W., & Bovy, J. 2024, MNRAS, 527, 1494 [Google Scholar]
  65. Li, X., & Lin, B. 2023, MNRAS, 521, 6354 [NASA ADS] [CrossRef] [Google Scholar]
  66. Li, H., Aoki, W., Matsuno, T., et al. 2022a, ApJ, 931, 147 [NASA ADS] [CrossRef] [Google Scholar]
  67. Li, Z., Zhao, G., Chen, Y., Liang, X., & Zhao, J. 2022b, MNRAS, 517, 4875 [NASA ADS] [CrossRef] [Google Scholar]
  68. Li, J., Wong, K. W. K., Hogg, D. W., Rix, H.-W., & Chandra, V. 2024, ApJS, 272, 2 [NASA ADS] [CrossRef] [Google Scholar]
  69. Lian, J., Zasowski, G., Mackereth, T., et al. 2022, MNRAS, 513, 4130 [NASA ADS] [CrossRef] [Google Scholar]
  70. Luo, A. L., Zhang, H.-T., Zhao, Y.-H., et al. 2012, Res. Astron. Astrophys., 12, 1243 [CrossRef] [Google Scholar]
  71. Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94 [NASA ADS] [CrossRef] [Google Scholar]
  72. Mardini, M. K., Frebel, A., & Chiti, A. 2024, MNRAS, 529, L60 [Google Scholar]
  73. Martin, N. F., Starkenburg, E., Yuan, Z., et al. 2024, A&A, 692, A115 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  74. Montegriffo, P., De Angeli, F., Andrae, R., et al. 2023, A&A, 674, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  75. Naidu, R. P., Conroy, C., Bonaca, A., et al. 2020, ApJ, 901, 48 [Google Scholar]
  76. Ness, M., Hogg, D. W., Rix, H. W., Ho, A. Y. Q., & Zasowski, G. 2015, ApJ, 808, 16 [NASA ADS] [CrossRef] [Google Scholar]
  77. Nomoto, K., Tominaga, N., Umeda, H., Kobayashi, C., & Maeda, K. 2006, Nucl. Phys. A, 777, 424 [CrossRef] [Google Scholar]
  78. Nomoto, K., Kobayashi, C., & Tominaga, N. 2013, ARA&A, 51, 457 [CrossRef] [Google Scholar]
  79. Onken, C. A., Wolf, C., Bessell, M. S., et al. 2024, PASA, 41, e061 [NASA ADS] [CrossRef] [Google Scholar]
  80. Pal, T., Khan, I., Worthey, G., Gregg, M. D., & Silva, D. R. 2023, ApJS, 266, 41 [NASA ADS] [CrossRef] [Google Scholar]
  81. Paszke, A., Gross, S., Chintala, S., et al. 2017, in NIPS 2017 Workshop on Autodiff [Google Scholar]
  82. Paszke, A., Gross, S., Massa, F., et al. 2019, in Advances in Neural Information Processing Systems, 32, eds. H. Wallach, H. Larochelle, A. Beygelzimer, F. d’Alché-Buc, E. Fox, & R. Garnett (Curran Associates, Inc.) [Google Scholar]
  83. Powell, M. J. D. 2002, Math. Program., 92, 555 [CrossRef] [Google Scholar]
  84. Rix, H.-W., Chandra, V., Andrae, R., et al. 2022, ApJ, 941, 45 [NASA ADS] [CrossRef] [Google Scholar]
  85. Salvadori, S., Schneider, R., & Ferrara, A. 2007, MNRAS, 381, 647 [NASA ADS] [CrossRef] [Google Scholar]
  86. Sanders, J. L., & Matsunaga, N. 2023, MNRAS, 521, 2745 [NASA ADS] [CrossRef] [Google Scholar]
  87. Sarmento, R., Scannapieco, E., & Côté, B. 2019, ApJ, 871, 206 [CrossRef] [Google Scholar]
  88. Sartoretti, P., Katz, D., Cropper, M., et al. 2018, A&A, 616, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  89. Schlafly, E. F., & Finkbeiner, D. P. 2011, ApJ, 737, 103 [Google Scholar]
  90. Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525 [Google Scholar]
  91. Schönrich, R., & Binney, J. 2009, MNRAS, 399, 1145 [Google Scholar]
  92. Sharma, S., Hayden, M. R., & Bland-Hawthorn, J. 2021, MNRAS, 507, 5882 [NASA ADS] [CrossRef] [Google Scholar]
  93. Tarumi, Y., Hartwig, T., & Magg, M. 2020, ApJ, 897, 58 [NASA ADS] [CrossRef] [Google Scholar]
  94. Taylor, M. B. 2005, in Astronomical Society of the Pacific Conference Series, 347, Astronomical Data Analysis Software and Systems XIV, eds. P. Shopbell, M. Britton, & R. Ebert, 29 [NASA ADS] [Google Scholar]
  95. Ting, Y.-S., Rix, H.-W., Conroy, C., Ho, A. Y. Q., & Lin, J. 2017, ApJ, 849, L9 [NASA ADS] [CrossRef] [Google Scholar]
  96. Ting, Y.-S., Conroy, C., Rix, H.-W., & Cargile, P. 2019, ApJ, 879, 69 [Google Scholar]
  97. Vasiliev, E., & Baumgardt, H. 2021, MNRAS, 505, 5978 [NASA ADS] [CrossRef] [Google Scholar]
  98. Weiler, M., Carrasco, J. M., Fabricius, C., & Jordi, C. 2023, A&A, 671, A52 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  99. Wilson, J. C., Hearty, F. R., Skrutskie, M. F., et al. 2019, PASP, 131, 055001 [NASA ADS] [CrossRef] [Google Scholar]
  100. Witten, C. E. C., Aguado, D. S., Sanders, J. L., et al. 2022, MNRAS, 516, 3254 [NASA ADS] [CrossRef] [Google Scholar]
  101. Xiang, M., Ting, Y.-S., Rix, H.-W., et al. 2019, ApJS, 245, 34 [Google Scholar]
  102. Xiang, M., Rix, H.-W., Ting, Y.-S., et al. 2022, A&A, 662, A66 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  103. Xiao, K., Yuan, H., Huang, B., et al. 2023a, ApJS, 268, 53 [NASA ADS] [CrossRef] [Google Scholar]
  104. Xiao, K., Yuan, H., López-Sanjuan, C., et al. 2023b, ApJS, 269, 58 [NASA ADS] [CrossRef] [Google Scholar]
  105. Xylakis-Dornbusch, T., Christlieb, N., Hansen, T. T., et al. 2024, A&A, 687, A177 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  106. Yamada, S., Suda, T., Komiya, Y., Aoki, W., & Fujimoto, M. Y. 2013, MNRAS, 436, 1362 [Google Scholar]
  107. Yan, H., Li, H., Wang, S., et al. 2022, The Innovation, 3, 100224 [NASA ADS] [CrossRef] [Google Scholar]
  108. Yao, Y., Ji, A. P., Koposov, S. E., & Limberg, G. 2024, MNRAS, 527, 10937 [Google Scholar]
  109. York, D. G., Adelman, J., Anderson, John E. J., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]
  110. Youakim, K., Starkenburg, E., Martin, N. F., et al. 2020, MNRAS, 492, 4986 [CrossRef] [Google Scholar]
  111. Zhang, X., Green, G. M., & Rix, H.-W. 2023, MNRAS, 524, 1855 [NASA ADS] [CrossRef] [Google Scholar]
  112. Zhao, G., Chen, Y.-Q., Shi, J.-R., et al. 2006, Chinese J. Astron. Astrophys., 6, 265 [NASA ADS] [CrossRef] [Google Scholar]
  113. Zhao, G., Zhao, Y.-H., Chu, Y.-Q., Jing, Y.-P., & Deng, L.-C. 2012, Res. Astron. Astrophys., 12, 723 [CrossRef] [Google Scholar]

1

High Efficiency and Resolution Multi Element Spectrograph.

2

William Herschell Telescope.

3

https://gaiaxpy.readthedocs.io/en/latest/cite.html, DOI v2.1.0: 10.5281/zenodo.8239995.

All Figures

thumbnail Fig. 1

Density distribution of residuals as a function of wavelength for stars with different parameters. Subfigures (a), (b), and (c) are results of stars in bins of the same magnitude range 13 < G0 < 15 but different stellar colors. Subfigures (b), (d), and (e) show the density distribution of residuals for stars within the same range of color 1.0 < (GBPGRP)0 < 1.2 but different G0. Subfigure (f) shows the residuals of stars of the same color-magnitude as subfigures (b) but in different Galactic latitude (|b| < 10°). The black dash lines represent the P50 percentiles distributions, which can be viewed as a robust representation of the patterns of wiggles. From the P50 lines we can see that the wiggles change with (GBPGRP)0 and G0 , but the stellar color has a larger impact than the magnitude. Stars of low latitude have a much more diffuse distribution of ∆Flux. The large uncertainties of the extinction map towards the Galactic disc may cause a bad extinction correction of FluxXP for some stars. Therefore, FluxXP is more likely to deviate from the fitting result Fluxfitting for stars at lower latitude.

In the text
thumbnail Fig. 2

Density distribution of residuals as a function of wavelength for VMP stars. Top panel shows the density distribution of the residuals of VMP stars with 1.0 < GBPGRP < 1.2 and 13 < G < 15. Compared to the relatively more metal-rich sample in Figure 1, their wiggles have a smaller amplitude with Max (|P50|) < 0.08. The bottom panel shows the difference of P50 between these two samples. The main difference is in the blue band of λ < 6500 Å, while the wiggles in the red band is little affected by the change of metallicity.

In the text
thumbnail Fig. 3

Diagram of our NN model based on Pytorch.

In the text
thumbnail Fig. 4

Comparison between the real patterns from TAS (20% of the whole sample, top panel) and the NN model predicted patterns (middle panel). The bottom panel shows the difference between the real and predicted patterns, from which we can see that the wiggles disappear.

In the text
thumbnail Fig. 5

Testing on atmospheric parameters using sampling Sconst. Top panels: comparison of Teff, logg and [M/H] estimated from original XP spectra (Y -axis) and APOGEE survey (X-axis). Middle panels: similar to the top panels, but with Y-axis replaced by the results from corrected XP spectra. The color bar in each subfigure displays the number density. Bottom panels: histograms present the differences of Teff, log ɡ and [M/H] between XP and APOGEE before and after correcting the systematic patterns. The mean values and standard deviations are shown in the labels.

In the text
thumbnail Fig. 6

Histograms present log102) and RMS between the XP spectra and model spectra for the IAS data set before and after correcting the pattern, indicating by blue and orange, respectively.

In the text
thumbnail Fig. 7

Distribution of RMS between CALSPEC and XP spectra. Left panels: histograms present RMS between the XP spectra and the spectra from CALSPEC libraries before and after correcting patterns, indicating by blue and orange, respectively. The green histogram shows the corresponding results obtained using the Python package from Huang et al. (2024a) to correct the XP spectra. Right panels: similar to those panels in the left, but with stars that have passed the quality cuts applied in Section 5.1. The red dashed lines represent the median values, which are also indicated by the number displayed in each panel.

In the text
thumbnail Fig. 8

Examples of how pattern correction can help improve the fitting between the XP spectra and the CALSPEC. Three spectra with different temperatures are presented in the sub-panels, from top to bottom: BD+54 1216, HD 115169, and KF06T2 in CALSPEC. In each panel, the smoothed CALSPEC spectrum is depicted as a bold gray line, while the XP spectra, with and without correcting patterns, are shown in blue and orange, respectively. The green lines show the corrected spectra using the package provided by Huang et al. (2024a). To enhance readability, we include a few zoomed-in diagrams in each panel to highlight the improvements on spectrophotometry. The ∆Flux between XP and CAL- SPEC are also presented at the bottom of each panel.

In the text
thumbnail Fig. 9

Comparison of atmospheric parameters Teff, log ɡ, and [M/H] between our catalog and LAMOST DR11 low-resolution catalog.

In the text
thumbnail Fig. A.1

Comparison of our catalog with other catalogs from literatures using LAMOST or APOGEE as references. The first row: comparison between our catalog and a catalog from Andrae et al. (2023b), annotated with the means and standard deviations of the differences in Teff, log ɡ, and [M/H]. The second row: histograms in red showing the differences in atmospheric parameters between our catalog and the catalog from LAMOST, while histograms in black showing the differences between the catalog from Andrae et al. (2023b) and the one from LAMOST, with means and standard deviations annotated. The last two rows: similar to the first and second rows, we compare our catalog with the catalog from Zhang et al. (2023), using APOGEE as a reference in those panels in the last row.

In the text
thumbnail Fig. A.2

Similar to Figure A.1, Comparison of our catalog with other catalogs from literatures using VMP stars as references.

In the text
thumbnail Fig. B.1

The metallicity distribution from our catalog is shown for member stars of three open clusters (NGC 2632, NGC 2516, NGC 752) and three globular clusters (NGC 3201, NGC 6397, NGC 4590). In each panel, we display the metallicity distribution of member stars both before and after applying 3 − σ clipping, depicted as step and filled histograms, respectively, with the number of remaining members annotated in the labels. Gaussian fits are overlaid as blue curves in all panels. The mean and standard deviation of [M/H] for each cluster are also presented in the respective panel. The black vertical lines indicate the metallicities of these star clusters as reported in the literature.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.