Open Access
Issue
A&A
Volume 693, January 2025
Article Number A306
Number of page(s) 23
Section Catalogs and data
DOI https://doi.org/10.1051/0004-6361/202451491
Published online 28 January 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

Over the last century, a worldwide campaign involving groundbased observatories has allowed the accumulation of a vast and heterogeneous dataset, comprising chemical abundance measurements for approximately two million stars. However, this dataset exhibits significant spatial heterogeneity due to fragmented sky coverage (Recio-Blanco et al. 2023). This scenario was revolutionized by the third Gaia data release (Gaia DR3 - Gaia Collaboration 2021.), which contains the parametrization of radial velocity spectrometer (RVS) data performed by the general stellar parametrizer-spectroscopy module, GSP-Spec. This module estimates the chemo-physical parameters from combined RVS spectra of single stars, without additional inputs from astrometric, photometric, or spectro-photometric BP/RP data (e.g., Gaia Collaboration 2018, 2021). Individual chemical abundances of N, Mg, Si, S, Ca, Ti, Cr, Fe I, Fe II, Ni, Zr, Ce, and Nd have recently been derived from Gaia data (Recio-Blanco et al. 2023). With about 5.6 million stars, the Gaia DR3 GSP-Spec all-sky catalog is the most extensive compilation of stellar chemo-physical parameters ever published and the first drawn from space-based data. The inclusion of Gaia’s extensive spectral capabilities highlights the importance of these methods as a complementary approach, improving the accuracy and broadening the scope of stellar chemical abundance studies. The homogeneity and quality of the estimated parameters enable chemo-dynamical studies of Galactic stellar populations, interstellar extinction studies from individual spectra, and explicit constraints on stellar evolution models.

Estimating stellar parameters and abundances using photometry represents a fundamental frontier in contemporary astrophysics. The technological advances, unprecedented data availability, and the growing need for comprehensive knowledge of stellar populations drive this scenario. While spectroscopic surveys, exemplified by instruments such as the Large Sky Area Multi-Object Fiber Spectroscopy Telescope (LAMOST; Cui et al. 2012), are capable of gathering around 4000 spectra in a single exposure, photometric surveys provide a comprehensive view of the universe, capturing the luminous signatures of millions of astrophysical objects in the sky. Photometric surveys maintain a significantly higher observational efficiency, yielding more data in less time. This abundance of data offers the potential to estimate stellar parameters for a vast number of stars, as highlighted by recent surveys of critical astronomical phenomena. For instance, Gore et al. (2024) emphasized the importance of targeting M and K stars known to host exoplanets. Other examples of investigations highlighting the synergy between spectroscopic observations and photometric analyses include Borucki et al. (2010), who precisely calculated stellar metallicities and parameters, including systems with planets identified by missions such as Kepler, COnvection, internal ROtation and transiting planets (CoRoT Baglin et al. 2007; Auvergne et al. 2009) and Transiting Exoplanet Survey Satellite (TESS) (e.g., Fischer & Valenti 2005; Huber et al. 2013; Buchhave et al. 2014; Buchhave & Latham 2015). These efforts to combine spectroscopic and photometric observations allow us to confirm the presence of exoplanets and accurately determine stellar and exoplanet parameters.

Schonhut-Stasik et al. (2024) highlighted the importance of estimating stellar parameters in large samples. By combining spectroscopic observations from the Apache Point Observatory Galactic Evolution Experiment (APOGEE; Wilson et al. 2019) with asteroseismology from Kepler 2 (K2– Howell et al. 2014), the fundamental stellar properties for approximately 7500 evolved stars were obtained. This comprehensive catalog furnishes temperatures, metallicities, and global asteroseismic parameters, shedding light on stellar evolution and Galactic dynamics. It includes the kinematic properties of low-metallicity stars, highlighting their potential role in Galactic mergers as well as their critical role in the investigation of past Galactic mergers that gave rise to the present-day Galaxy (Yates et al. 2012). Characterizing the selection function for the APO-K2 sample allows robust statistical inferences, highlighting the significance of large-scale surveys in advancing our understanding of the universe (Fallows & Sanders 2022).

On the other hand, large-scale ground-based photometric surveys, including Sloan Digital Sky Survey (SDSS, York et al. 2000), SkyMapper Southern Survey (SMSS, Keller et al. 2007), UKIRT Infrared Deep Sky Survey (UKIDSS Lawrence et al. 2007), All-Sky Automated Survey for Supernovae (ASAS-SN, Kochanek et al. 2017), Stellar Abundance and Galactic Evolution (SAGE, Gordon et al. 2011), Javalambre-Physics of the Accelerated Universe Astrophysical Survey (J-PAS, Benitez et al. 2014; Bonoli et al. 2021), Panoramic Survey Telescope and Rapid Response System 1 survey (Pan-STARRS1, Chambers et al. 2016), and Dark Energy Survey (DES, DESI Collaboration 2016). As the first all-sky survey, the SDSS has set a benchmark in astrostatistics and the astro-photosurvey era, offering a comprehensive database that has been crucial for numerous studies. Its contributions range from mapping the large-scale structure of the universe to identifying and classifying a myriad of celestial objects (e.g., Eisenstein et al. 2011). The Javalambre Photometric Local Universe Survey (J-PLUS, Cenarro et al. 2019) and the Southern Photometric Local Universe Survey (S-PLUS, Mendes de Oliveira et al. 2019) now provide vast amounts of valuable photometric data for millions of astronomical objects. Characterized by the use of medium- and narrowband filters optimized for precise measurements of key stellar properties, these surveys herald a new era in the determination of stellar types, surface gravity (log g), effective temperatures (Teff), and metallicity indicators such as [M/H] and [Fe/H] (e.g., Árnadóttir et al. 2010; Bailer-Jones et al. 2013; Placco et al. 2022; Bellazzini et al. 2023).

Several empirical and theoretical approaches have emerged to extract stellar labels from photometric data, fueled by the wealth of high-quality spectroscopic datasets from surveys like the Medium-Resolution LAMOST, APOGEE, and the Galactic Archaeology with HERMES (GALAH, Sheinis et al. 2015). These methods, ranging from empirical correlations of metallicity-dependent stellar parameter to machine learning algorithms, now provide stellar labels that can be accurately determined and combined with the precision of spectroscopy for high-quality photometry. Examples include using stellar loci in color-magnitude diagrams and applying machine learning to large photometric datasets (e.g., Fallows & Sanders 2022; Bailer-Jones et al. 2019).

For instance, the estimated elemental abundances in the current work can be utilized to undertake a more nuanced exploration of the impact of narrowband filters in identifying metal-poor stars (e.g., Almeida-Fernandes et al. 2023; Placco et al. 2021, 2022, 2023). The utilization of these filters has significantly enhanced the precision in locating these rare stellar specimens, representing a notable advancement in astronomical research. For instance, the groundbreaking discovery reported by Placco et al. (2021) exemplifies the efficacy of narrowband photometry in detecting ultra-metal-poor star candidates. Their findings, initially made possible through the analysis of data from the S-PLUS Data Release 1, underscore the potential of such surveys in identifying targets with low metallicity. This achievement not only validates the effectiveness of narrowband photometric techniques but also charts new pathways for exploration in astrophysics. Building on this, we now delve into specific catalogs and studies that illustrate the impact of narrowband filters in identifying metal-poor stars. For example, the research conducted by Galarza et al. (2022), Placco et al. (2022), and Placco et al. (2023) offers valuable insights into how the application of narrowband filters has contributed to the identification and characterization of these stellar populations. In a parallel study, Whitten et al. (2021) provided photometric estimates of key stellar parameters for more than 700 000 stars within S-PLUS Data Release 2, identifying 364 candidate carbon-enhanced and metal-poor stars. This further emphasizes the significance of narrowband photometry in delineating specific stellar populations within large datasets.

The S-PLUS narrowband filters are strategically centered around important spectral features: [O II] for J0378; Ca H + K for J0395; Hδ for J0410; CH G-band for J0430; Mgb triplet for J0515; Hα for J0660; and Ca triplet for J0861. For instance, the Ca line, measured by the J0395 filter, is particularly sensitive to metallicity. In this wavelength, narrowband photometry around this feature is typically employed by surveys searching for metal-poor stars (Keller et al. 2007; Starkenburg et al. 2017). Similarly, the Mgb triplet, measured by the J0515 filter, is well known for its surface gravity sensitivity Geisler (1984); Majewski et al. (2000), and narrowband photometry has been used by surveys such as APOGEE to search for giant stars Zasowski et al. (2013); Majewski et al. (2016). Through a combination of broad- and narrowband filters that serve to identify the main stellar spectral features (absorption lines and continuum), this photometric system was designed for the optimal classification of stars (Gruel et al. 2012; Marín-Franch et al. 2012).

Recently, Almeida-Fernandes et al. (2023) explored the chemodynamical properties and ages of 522 metal-poor candidates, initially selected by Placco et al. (2022) from the S-PLUS DR3. As noted in Placco et al. (2022), approximately 92% of these stars were confirmed to be metal-poor based on mediumresolution spectroscopic analysis. Together, these studies highlight the crucial role of spectroscopic estimates in identifying precise targets for more detailed investigations.

Gutiérrez-Soto et al. (2020) leveraged the strong Hα emission lines characteristic of Planetary Nebulae (PNe) to explore the S-PLUS survey within the context of PNe. Similarly, Fabiano de Souza et al. (2024) examined age and metallicity determinations of the Small Magellanic Cloud, demonstrating a preference for younger clusters in the center and older ones predominantly located in outer regions. Additionally, Quispe-Huaynasi et al. (2024) achieved highly accurate estimates of magnesium abundances for approximately 64 high-velocity stars, exemplifying the vast potential of spectral parameter determination from multi-wavelength data. Yang et al. (2022) used J-PLUS data to determine metallicity ([Fe/H]), [α/Fe], and the abundances of four elements ([C/Fe], [N/Fe], [Mg/Fe], and [Ca/Fe]). Similar investigations using S-PLUS data are currently absent.

In this research, we propose a methodology to extract atmospheric parameters directly from S-PLUS photometric data. By utilizing all possible S-PLUS color combinations (66), we conducted estimations based on three distinct training samples obtained through the LAMOST, APOGEE, and GALAH surveys. The analysis employed the cost-sensitive neural network (NN, Whitten et al. 2019; Ksoll et al. 2020; Whitten et al. 2021; Yang et al. 2022) and Random Forest (e.g., Breiman 2001) methods. Figure 1 exhibits the regions in the sky where these surveys and their respective data releases overlap. We use this overlapping data to build predictive models using deep and machine learning algorithms.

The following sections of this paper will delve into database construction (see Sect. 2), detailing the acquisition of training data. Section 3 will outline the methodology and constraints employed to enhance outputs. Next, Sect. 4 will present the primary findings and provide a catalog description of the sources studied in this paper. In Sect. 5, we present our final remarks.

thumbnail Fig. 1

Sky footprints of S-PLUS DR4, APOGEE, GALAH DR3, and LAMOST MRS DR8 surveys across the celestial sphere. Each survey’s distinctive coverage and observed regions are depicted, along with their overlapping regions.

2 Data construction

S-PLUS observes the sky in 12 filters covering the visible spectrum from 3000 Å to 10 000 Å, comprising five wide bands (ugriz), akin to those used in the Sloan Digital Sky Survey (SDSS – York et al. 2000), and seven narrowbands (J0378, J0395, J0410, J0430, J0515, J0660, and J0861) unique to the Javalambre system (Marín-Franch et al. 2012). Figure 2 illustrates spectral energy distributions (SEDs) constructed from S-PLUS photometry, alongside SDSS spectra, for both a dwarf star and a giant star. The S-PLUS survey, conducted with the T80-South (T80S) telescope, a Ritchey-Chrétien Cassegrain telescope with an 86 cm primary mirror, located at the InterAmerican Observatory of Cerro Tololo, Chile. It uses the T80SCam camera, which features a detector with dimensions of 9200 × 9200 pixels, a plate scale of 0.55 arcsec/pixel, and covering a field of view of 1.4 × 1.4 degrees (Mendes de Oliveira et al. 2019).

The starting point for our analysis was the S-PLUS DR4 (Herpich et al. 2024), which provides both aperture and point spread function (PSF) photometries for objects in a 3022.7 sq deg patch of sky visible from the Southern Hemisphere in 12 different optical bands. We utilized aperture photometry measured in a 3-arcsec aperture and corrected for the amount of missing flux (referred to as PStotal magnitudes in the S-PLUS survey, Almeida-Fernandes et al. 2022) for all cases when this aperture is available. In more crowded regions, such as those at low Galactic latitudes or in the central areas of the Magellanic Clouds, PSF photometry was preferred to account for blending and contamination issues.

We used PSF photometry, which is less affected by the contamination of neighboring sources. Furthermore, we imposed a sample selection criterion, restricting our dataset to sources having measurements in all 12 available filters, with a signal- to-noise ratio (S/N) greater than 5 and error bars smaller than 0.1. These selection criteria serve two purposes: they enable us to retain more spectral information and establish the 66 S- PLUS colors, while also ensuring the reliability of photometric measurements. Subsequently, we performed a crossmatch with Gaia Data Release 3 (Gaia DR3 - Gaia Collaboration 2021) to obtain preliminary estimates of parameters such as log 𝑔 and Teff. This resulted in an initial sample of approximately five million sources.

We corrected the magnitudes for interstellar medium extinctions using the maps from Schlegel et al. (1998, commonly referred to as SFD) for the source of the EBVSFD$E_{B - V}^{{\rm{SFD}}}$ excess-color parameter, while also adopting the correction proposed by Schlafly & Finkbeiner (2011) for this extinction map: EBVadopted =0.86×EBVSFD$E_{B - V}^{{\rm{adopted }}} = 0.86 \times E_{B - V}^{{\rm{SFD}}}$. We utilized the extinction coefficients Aλ/AV provided by the Spanish Virtual Observatory Filter Profile Ser- vice1 for the S-PLUS filters and adopted AV/EB−V = 3.1.

Yang et al. (2022) employed a similar methodology to estimate a comprehensive set of stellar parameters, including Teff, surface gravity (log 𝑔), and metallicity ([Fe/H]), as well as [α/Fe] and four elemental abundances ([C/Fe], [N/Fe], [Mg/Fe], and [Ca/Fe]). They utilized data from the first data release (DR1) of J-PLUS, integrating it with Gaia DR2 information and spectroscopic labels from LAMOST to train a series of cost-sensitive neural networks. Furthermore, the authors conducted a thorough comparison of their parameter and abundance estimates with those derived from other spectroscopic catalogs such as APOGEE and GALAH. The results demonstrated an overall agreement between their estimates and those from the aforementioned catalogs. We similarly adhere to the prescriptions outlined by Yang et al. (2022).

In this study, we adopted a cost-sensitive NN approach proposed by Yang et al. (2022), which consists of a combination of a traditional seven-layer feed-forward NN architecture and a 2D cost-sensitive learning algorithm to estimate stellar parameters and chemical abundances. The input (first) layer imports the stellar colors, the following five hidden layers extract nonlinear features from these colors, and finally, the output layer generates a stellar label through a weighted sum of the features learned in the last hidden layer. The cost-sensitive learning algorithm used to train the NN achieves better measurement precision for less frequent target values in an imbalanced training set. By adjusting the cost function to assign higher weights to less frequent cases, this approach mitigates the bias. This adjustment enables the neural network to learn more effectively, thereby enhancing its ability to accurately predict these minority cases.

The separation between giant and dwarf stars in our training sample was based on the log 𝑔 and Teff values obtained from the Gaia Collaboration (2021, Gaia DR3). We followed the selection criteria proposed by Thomas et al. (2019) for the log 𝑔 vs Teff distribution (Kiel diagram) to differentiate between dwarfs and giants. For the selection of giants, Thomas et al. (2019) proposed a range of 5000 ≤ Teff ≤ 6000 and 3.2 ≤ log 𝑔 ≤ 3.9, based on a machine learning algorithm trained on multiband photometric and spectroscopic data, applicable to southern hemisphere targets. Their work did not consider the subgiant class, and the specified limits exclude the majority of asymptotic giant branch (AGB) stars. To include more AGB stars, red giant branch stars (RGB), and subgiants in our giant selection and to further distinguish a supergiant class, we adjusted the limits as: for giants: 3000 ≤ Teff ≤ 7000 and 1.8 ≤ log 𝑔 ≤ 3.9; for supergiants: 3000 ≤ Teff ≤ 7000 and log 𝑔 ≤ 1.8 (see Fig. 4).

Defining precise boundaries between dwarfs, giants, and supergiants on the Kiel diagram presents significant challenges, particularly when accounting for uncertainties in log 𝑔, which tend to obscure the delineations between these stellar classifications. The inclusion of a supergiant category further complicates the classification algorithm and does not resolve the inherent ambiguity among these stellar classes, a challenge also noted by Thomas et al. (2019). Therefore, for model training and subsequent analysis, we have simplified the classification into two primary categories: dwarfs and giants. Within this framework, the giants category encompasses supergiants, RGB stars, AGB stars, and subgiants, if present. This classification criterion has been applied consistently across all three training sample sets. Table 1 displays the number of sources and the limits for each parameter used in our training sets, categorized by survey type for both dwarfs and giants. The GALAH data offer the widest range of parameters for both dwarf and giant class, while the APOGEE dataset includes a highest number of giant stars among all the three surveys. On the other hand, the LAMOST survey has the smallest number of metal-poor stars, providing a limited sample for estimating stellar parameters and elemental abundances for these stars (see Table 2).

The initial sample was subjected to cross-matching with LAMOST at medium-resolution, APOGEE, and GALAH. LAMOST uses fibers with a diameter of 3.3 arcseconds, achieving a spectral resolving power of approximately R ≃ 7500. APOGEE, on the other hand, utilizes fibers with a sky diameter of 2.0 arcseconds and offers a higher spectral resolving power, approximately R ≃ 22 500. Meanwhile, GALAH utilizes fibers with a diameter of 2.1 arcseconds, providing a spectral resolution range from approximately R ≃ 20 000 to R ≃ 50 000. As a note of caution, we emphasize that the LAMOST data used here is limited to the medium-resolution spectra.

It is important to note that machine learning techniques may not successfully recover all elemental abundances from the LAMOST, GALAH, and APOGEE surveys if those abundances do not significantly impact the photometric data obtained from the 12 S-PLUS filters. The S-PLUS filters provide 66 photometric colors, capturing a broad range of elemental abundance information that may not be immediately visible. Our initial hypothesis is that the S-PLUS colors retain significant information about these abundances, particularly those detected by the narrowband filters. Nevertheless, these techniques have the potential to uncover intricate correlations that may not be immediately apparent. It is crucial to exercise caution when interpreting elemental abundances that lack corresponding features in S-PLUS filters. However, retaining parameters that cannot be determined directly from the input data enables us to thoroughly assess potential biases and explore unexpected correlations identified by our analytical approach. A detailed discussion of the parameters used in this work is provided in Sect. 4.

To reduce contamination due to fiber size, we performed cross-matching individually between the S-PLUS data and each survey’s catalog, LAMOST, APOGEE, and GALAH, using a search radius of 2 arcseconds. For each survey, we accounted for the specific fiber size by removing crossmatched sources that had multiple counterparts within the fiber aperture. This approach allowed us to handle potential overlaps and ensure that only the most accurate matches were considered. The accuracy of these surveys varies significantly due to differences in resolution and fiber size. Therefore, we chose not to combine all the data acquired from these crossmatches, allowing for the consideration of any divergences among them in further studies. According to Hegedűs et al. (2023), the parameters from APOGEE and GALAH surveys are generally in agreement within their respective uncertainties. However, offsets larger than 0.1 dex in the absolute abundance scales were observed in certain regimes. Therefore, when combining the data, it is crucial to consider the peculiarities of each survey. Moreover, by independently estimating the parameters using three distinct datasets, we can effectively compare the results, which, in turn, aids in validating and enhancing our analytical methodologies.

The cross-matching resulted in the identification of numerous sources, designated as ~2877, ~5916, and ~573, in the APOGEE, GALAH, and LAMOST datasets, respectively. It is important to note that the number of valid values for each spectroscopy parameter may exhibit variations due to data availability and completeness. To ensure data quality, a minimum S/N of 20 was adopted in the GALAH, LAMOST, and APOGEE datasets. Our parameter estimation approach was carried out separately for each survey, enabling a comparative assessment of the goodness-of-fit of our estimations. This separation led to the creation of dedicated matrices for each parameter, thereby maximizing the number of elements for our analysis. This approach acknowledges the complexities arising from the unique characteristics of each survey, ensuring a more accurate and robust analysis tailored to the specific attributes of the data.

In Figure 3, we present an overview of the parameter space, comprising Teff, log 𝑔, metallicity ([Fe/H]), and [α/M] abundances for the three distinct trainsets we are using: APOGEE (blue dots), LAMOST (red dots), and GALAH (green dots). Notably, these distributions cover an extensive range of Teff , log 𝑔, [Fe/H], and [α/M], with values ranging from approximately 3000 K to 7000 K for Teff, 0.5 to 4.5 for log 𝑔, −3 to 1 for [Fe/H], and −0.4 to 0.6 for [α/M]. In summary, the three data samples consist of dwarf stars and giants stars, exhibiting similar distributions (see the diagram of log 𝑔 as a function of Teff in the second panel line of Fig. 3). Despite the clear separation of super-giant and giant stars in the plot, they are considered as a single class to maximize the number of training data (see top left panel in Fig. 3). Indeed, the GALAH sample is the largest, varying from about 19 to 25 thousand sources, with a primary focus on stars with temperatures exceeding 4000K. Conversely, the LAMOST and GALAH datasets primarily include stars with higher temperatures than APOGEE; on the other hand, LAM- OST has a wider range of log 𝑔 than does GALAH (for more details, see Table 1).

We did not impose a typical cutoff in temperatures, so values for Teff < 4000 K or Teff > 6500 K should be interpreted with caution. The GALAH, LAMOST, and APOGEE datasets provided a wide array of elemental abundances, encompassing parameters such as [α/Fe], [Fe/H], [S/Fe], [Cr/Fe], [Mn/Fe], [Co/Fe], [O/Fe], [Na/Fe], [K/Fe], [Ti/Fe], [C/Fe], [N/Fe], [Mg/Fe], and [Ca/Fe]. This paper explores the possible linear or non-linear relationships between these parameters, using machine learning techniques, employing S-PLUS colors within our initial S-PLUS sample.

Figure 4 shows the array of parameter planes within the APOGEE training sets. These planes include various parameters plotted against [Fe/H]. Incorporating these diverse parameters expands our ability to estimate stellar parameters across a wider range, broadening the scope of our analysis. Similar figures showing the distribution of available parameters for GALAH and LAMOST data can be found in Appendix A.

thumbnail Fig. 2

S-PLUS photometry and SDSS spectra of a dwarf star (top panel) and a giant star (bottom panel). The colored symbols represent S-PLUS photometry in flux units for the following filters: u, J0378, J0395, J0410, J0430, g, J0515, r, J0660, i, J0861, and z (from left to right). Square symbols denote broad-band filters, while circular symbols indicate narrowband filters. The filter responses are shown accordingly.

Table 1

Summary of training datasets used for neural network (NN) and random forest (RF) models.

Table 2

Summary of the training datasets categorized by [Fe/H] intervals.

3 Machine learning approaches and data normalization

Neural networks (NNs) and random forests (RFs) were employed for a dual verification of the results, aiming to mitigate the dependency of the estimation on a single method. NNs, with their capacity to model intricate, nonlinear data relationships, are highly adept at accommodating a wide variety of data types and capturing complex patterns. On the other hand, RF offers robustness against overfitting, demonstrates competence in managing high-dimensional data, and furnishes valuable insights through the importance of the input parameters. Moreover, the decision tree structure of RF enhances interpretability, a feature that can be more challenging to achieve with a NN. A pragmatic approach combines NN and RF predictions for more robust and accurate parameter estimation (Cutler et al. 2007; LeCun et al. 2015). This approach aligns with best practices in machine learning, which emphasize the importance of feature selection and engineering to enhance model performance.

We harnessed the widely-used Keras library (Chollet & others 2018) and constructed a NN with multiple layers for processing. In this context, we explored various configurations, varying the number of layers from 1 to 10 and the number of neurons from 32 to 2000, namely, typical parameters used to define the network’s capacity to comprehend complex data structures. The optimal configuration was identified as using six layers with 1664 neurons distributed among them. Indeed, this approach closely follows the methodology applied to J-PLUS observations (Cenarro et al. 2018; Yang et al. 2022).

In the evaluation of RF, we conducted an extensive parameter-tuning exercise to identify the optimal configuration for our model. We systematically tested hyperparameters such as the number of trees, maximum depth of trees, and maximum features for each stellar parameter or abundance, exploring their impact on the model’s predictive capability. By analyzing the mean square error associated with different hyperparameter configurations, we selected the most effective model for each stellar atmospheric parameter or abundance. This iterative process, which involved refining model parameters through repeated testing, enabled us to determine the optimal combination that yielded the best predictive results and assess the importance of each color in predicting stellar parameters or abundances, facilitating an astrophysical interpretation.

The available stellar atmospheric parameters from GALAH, LAMOST, and APOGEE were accessed using standardized input vectors, pre-processed data that ensure consistent scaling and units, along with the optimal NN and RF configurations. We partitioned the data into two segments: 80% for training and 20% for testing, adhering to standard cross-validation practices (e.g., Belgiu & Drăguţ 2016). Each data sample was analyzed independently and this partitioning strategy was employed to evaluate the model’s training efficacy. The results of this estimation are detailed in Section 4.

The input variables, standardized to ensure uniform scales, form the foundation for training our robust NN and RF models. Initially, our analysis focuses on stars with all 12 photometric measurements. In future work, utilizing S-PLUS DR 5, we plan to extend our approach to include sources with incomplete photometric data. The 12 S-PLUS magnitudes result in a total of 66 colors, and we will employ three distinct approaches to leverage this data effectively: (A) using a 66-dimensional input vector composed of S-PLUS colors; (B) estimating the effective temperature (Teff) using the initial 66 S-PLUS colors. Subsequently, a matrix with 67 columns, comprising both the colors and Teff, is employed to estimate additional parameters; and (C) estimating both the effective temperature (Teff) and surface gravity (log 𝑔) using the 66 S-PLUS colors. Following this, a matrix with 68 columns, including the colors, Teff , and log 𝑔, is utilized to estimate the remaining parameters. The normalization is outlined as follows: y=yμσ,$y' = {{y - \mu } \over \sigma },$(1)

where y and y′ are the original and standardized input vectors, respectively. Additionally, µ and σ represent the mean and standard deviation of all the original input vectors, respectively.

thumbnail Fig. 3

Corner plot for the APOGEE, LAMOST, and GALAH training sets, that coincide with S-PLUS data, indicated in green, red, and blue, respectively, showing the distribution of Teff, log g, [Fe/H], and [α/X], where X denotes [M] for LAMOST and APOGEE data, and [Fe] for GALAH data. The histograms along the leading diagonal illustrate the parameter distributions for Teff, log g, [Fe/H], and [α/X] across the three trainning datasets.

thumbnail Fig. 4

Density plots illustrating the distribution of log g, Teff , and different elemental abundances of the APOGEE sample used in the training. In the top-right corner of each panel the number of sources (N) in each distribution is shown. A randomly selected sample of 100 sources for dwarfs, giants, and supergiants is scatter plotted to avoid crowding.

4 Results

This section presents the results obtained from two distinct methods, RF and NN, implemented using three different approaches (ABC, see Sect. 3) and applied to three separate datasets. This results in a total of 18 unique strategies for parameter estimation. To determine the most effective method, we utilized the r2_score function from scikit-learn (Pedregosa et al. 2011), which we labeled as the goodness-of-fit. This function provides a relative measure of goodness-of-fit, indicating the extent to which the model aligns with the observed data in comparison to a basic mean baseline. We chose to use the r2_score over root mean square error (RMSE) because r2_score offers a normalized measure that can indicate the proportion of variance explained by the model. The best possible score is 1.0, while a score of 0.0 indicates that the model performs no better than the mean baseline. Negative values indicate that the model performs worse than the mean baseline. This makes r2_score particularly useful for comparing the performance of different models across varying scales. On the other hand, RMSE provides an absolute measure of error, which can be more challenging to compare across different parameters or datasets. We only considered parameters with a goodness-of-fit greater than 50% (see Appendix B) across all approaches used. Such a limit constrains our parameters to Teff, log 𝑔, [Fe/H], [α/Fe], [Al/Fe], [C/Fe], [Cu/Fe], [Li/Fe], [Mg/Fe], [O/Fe], and [Si/Fe]. Figure 5 shows the goodness-of- fit for the parameters considered, all of which have scores greater than 50%. However, not all parameters are available for both giants and dwarfs. Furthermore, the LAMOST sample for giant stars includes several parameters with a training set smaller than 100, and these were excluded from our analysis.

At this stage, two critical aspects require careful examination: 1) the optimization of machine learning by adjusting input parameters to ensure alignment with S-PLUS filters and 2) the assessment of parameter reliability. These criteria culminate in the establishment of flags, which are described below.

The NN approach consistently outperforms the RF approach, showcasing goodness-of-fit improvements of up to ∼6% (see Appendix B). These results established it as the standard prediction method. However, according to our results, the RF approach performs better for [Si/Fe] and [C/Fe].

The RF approach offers better goodness-of-fit than NN for APOGEE [C/Fe] in giant stars, as well as for [Si/Fe] and log 𝑔 in dwarfs. Additionally, LAMOST log 𝑔 shows slightly better results with the RF approach.

The accuracies of APOGEE, GALAH, and LAMOST for the same parameters vary. For example, the goodness-of-fit of Teff estimation for APOGEE is approximately 9% better than that found for GALAH in the case of giant stars. In general, S-PLUS estimations based on GALAH input data exhibit smaller accuracies compared to those derived from APOGEE and LAMOST data. The near-infrared spectroscopy provided by APOGEE, along with the wide spectral coverage offered by LAMOST, appears to confer advantages for the parameters studied. However, a straightforward comparison of the performance of the different training sets should be approached carefully, as they do not equally cover the full range of values or sample all intervals uniformly (see Fig. 3 and Table 1). For example, the GALAH data tends to assume lower values of Teff and [Fe/H], which may result in reduced in-sample accuracy for GALAH measurements.

Incorporating Teff and surface gravity (log 𝑔) as input parameters enhances the goodness-of-fit of parameter estimations by approximately ∼1% across several parameters. Specifically, we observe an increase in accuracy of about 2% and 5% for determining [Mg/Fe] and [Fe/H] ratios, respectively. This suggests that these particular parameters exhibit heightened sensitivity to variations in Teff and log 𝑔, compared to other parameters within the tested sample.

As Teff increases, the thermal ionization and excitation rates of atoms change, leading Teff as an important feature to estimate such parameters. Indeed, the dependence of spectral lines on Teff and log 𝑔 are a fundamental aspect of stellar spectroscopy (e.g., Payne 1925; Kurucz 1979; Cayrel de Strobel et al. 2001; Gray 2005). The influence of log g on the studied chemical abundances is particularly notable in elements related to surface gravity-sensitive features. On the other hand, Teff plays a crucial role in shaping the spectral features related to abundance ratios like [C/Fe], [O/Fe], and [Ti/Fe] which are known to be strongly influenced by changes in temperature. Indeed, Beeson et al. (2024) show that the use of photometric priors can improve the goodness-of-fit of abundances and log 𝑔. [α/Fe], [Al/Fe], and [Mg/Fe] are known to exhibit significant variations with changes in log 𝑔; accordingly, our results also present log 𝑔 as the main feature to estimate such parameters (see Fig. 6). The [C/Fe], [O/Fe], and [Cu/Fe] lines originate from atomic transitions that are sensitive to the thermal conditions in the stellar atmosphere.

Moreover, we have also observed that the narrowband filters J0378, J0395, J0410, and J0430, along with the u-band, play a crucial role in determining stellar parameters and abundances. These filters contribute significantly to the key features used in our analysis, as the feature importance for many of the outputs is strongly linked to these filters (see Fig. 6). Intriguingly, the importance of these filters varies across different surveys, suggesting that their efficacy may be influenced by the unique characteristics of each dataset (refer to Appendix C). Moreover, despite reddening correction, high extinction regions, such as those in the disk and bulge (observed with APOGEE-S), will lead to large uncertainties in colors, particularly when using bluer bands, potentially resulting in poorer outcomes. This should be considered when selecting reliable samples.

In summary, the results obtained using RF and NN with approaches ABC were compared and found to agree within the uncertainties. However, the NN implementation using approach C tends to yield superior results in terms of estimated accuracies (see Appendix D). Figure 7 displays the predicted versus estimated values for Teff, log 𝑔, [α/Fe], and [Fe/H] across the APOGEE, LAMOST, and GALAH datasets. The results are largely consistent. However, there exists a distinct cluster of stars, primarily dwarf stars, in the GALAH dataset that falls outside the boundaries defined by the dashed line. We isolated this group of stars and found that their GALAH effective temperatures fall within the range of approximately ∼3200 to ∼4500K. However, the predicted temperatures using NN indicate a wider range, spanning from ∼4300 to ∼6300K. To investigate this discrepancy, we crossmatched these stars with the Gaia DR3 and the Transiting Exoplanet Survey Satellite (TESS - Ricker et al. 2015) catalogs, which present temperature intervals of approximately ∼4300 to ∼6600 K and ∼4300 to ∼6300 K, respectively. These Teff values align more closely with our predictions, suggesting a possible underestimation in GALAH temperature measurements for this group.

Previous studies have also addressed similar aspects of GALAH temperature calibrations. For example, Giribaldi et al. (2023) note differences in GALAH DR2 parameters and suggest that additional standard stars could enhance calibration. Hegedűs et al. (2023) discuss variations across high-resolution surveys and the need for further investigation into these discrepancies. Additionally, Wheeler et al. (2020) highlights variations in GALAH’s temperature and metallicity measurements, suggesting that further validation could be useful. These references underscore the importance of a thorough reassessment and incorporation of complementary parameters, such as photometric colors, variability, evolutionary stage indicators, and reddening characteristics to address potential differences in GALAH’s temperature estimations more effectively.

For each parameter and training set, we obtained one NN and one RF model. Users can generally apply these models to estimate stellar parameters and abundances for new data obtained by S-PLUS that will be available at the S-PLUS cloud2. Input requirements include the 12 magnitudes with their respective errors for the determination of Teff and log 𝑔. For the remaining parameters, users can employ the ABC approach. For the B approach, 66 colors plus Teff values are required, while for the C approach, log 𝑔 values are also necessary.

We also investigated the correlation of the abundance ratios determined in this study with temperature and log g, but found no strong correlation between them. The ability of the NN and RF methods to uncover non-linear relationships among these parameters, coupled with the abundance ratios falling within the S-PLUS filters, enhances the reliability of the estimated elemental abundances. However, to provide a cautionary note regarding the parameter estimates, the latter were flagged as follows: Flag 00: Stellar parameters or elemental abundances with a goodness-of-fit higher than 60% in all approaches and methods used in this work. Flag 01: Elemental abundances of [Mg/Fe] and [α/M] within S-PLUS filters that have a goodness-of-fit below 60%. These abundance ratios were also flagged as 00 using another sample. Flag 02: Elemental abundances of [Cu/Fe], [O/Fe], and [Si/Fe] with a goodness-of-fit above 50% and below 60% that were not flagged as 00 or 01. Such abundances do not match with higher accuracy in any other tests using different samples. Given the lower accuracy, they should be used with extra caution.

thumbnail Fig. 5

Comparative analysis of RF and NN goodness-of-fit across diverse parameters in the APOGEE, GALAH, and LAMOST MRS Surveys. These results are based on models constructed using only S-PLUS color information. Only parameters with goodness-of-fit exceeding 50% are included.

thumbnail Fig. 6

Ten primary features utilized for prediction are arranged in decreasing order of their importance in the RF model. At the top of each diagram, the star type (giant or dwarf), data sample (LAMOST, APOGEE, or GALAH), and the spectroscopic parameter is specified.

4.1 S-PLUS parameter estimation

The initial sample comprises approximately 5 million stars observed through the 12 filters. Subsequently, this sample was crossmatched with Gaia DR3 data (using a search radius of two arcseconds) to obtain Gaia log 𝑔 and Teff . This procedure facilitates the classification of stars as giants or dwarfs. Stars near the classification thresholds or parameter limits should be interpreted with caution. However, in addition to this step, we also can also determine log 𝑔 and Teff solely based on the S-PLUS data.

To handle data that fall outside the range of our training set, a normalization method is applied, where values below the minimum threshold are set to zero, and values above the maximum threshold are set to one. This method can introduce biases for sources that lie outside these thresholds, so it should be used with caution. To assess the extent of this issue, we introduce a flag feature (FF) that indicates the proportion of features for each source that fall outside the training set limits. The FF value ranges from 0% to 100%, representing the percentage of parameters that exceed the established training set limits.

Table 3 displays the number of sources processed using our methodology, comprising ∼140 thousand giant stars and ∼4.9 million dwarf stars. Remarkably, the abundance ratios and stellar parameters of about ∼90% for giant stars and ∼75% of dwarf stars fall within the constraints of the training sets. While the optimal outcome is achieved when all features are within the training set limits, cases where missing features are deemed unimportant for determination may still yield acceptable results, particularly for sources with lower flag numbers.

4.2 Catalog description

Appendix F describes the stellar parameters and abundances ratios provided in this work and instructions on how to access them. The catalog primarily consists of the 12 S-PLUS magnitudes along with their respective errors, in addition to the abundance ratios estimated, each accompanied by a flag indicating the number of features inside the training set limits (see Section 4.1). Figure 8 displays the distribution of the predicted parameters for APOGEE data pertaining to giant stars. Teff and log 𝑔 do not exhibit any correlation with metallicity. Since these parameters are mainly determined by stellar characteristics unrelated to metallicity, such as stellar evolution and surface properties, the lack of correlation between Teff and metallicity is not expected. The apparent lack of correlation between Teff, log(𝑔), and metallicity in the giant star sample may be explained by the inclusion of both red clump and red giant branch stars. Red giant branch stars exhibit a metallicity dependence, particularly from one side of the branch to the other (Salaris & Cassisi 2002), but this trend can be obscured by the more stable properties of red clump stars (Pietrinferni et al. 2004), resulting in a diminished overall correlation.

The plot of [Mg/Fe] as a function of [Fe/H] displays a bimodality. The bimodal distribution observed in the [Mg/Fe] plot may indicate distinct stellar populations or formation mechanisms. One possible explanation is the presence of two populations of stars with different enrichment histories: one formed from pristine gas with low [Fe/H] (stars at the MW thick disk) and another formed from gas enriched by previous generations of stars (stars at the MW tin disk). The weaker bimodality observed, particularly for the APOGEE sample, can be attributed to the fact that the S-PLUS survey primarily observed stars from the thin disk, with a lower representation of thick disk stars. Furthermore, only stars with observations in all 12 filters were included, which further reduces the sample size, particularly for thick disk populations. Such bimodality was recently revealed using APOGEE data (Rojas-Arriagada et al. 2019; Horta et al. 2023), indicating the consistency of our results.

Overall, [α/M], [Si/Fe], and [O/Fe] exhibit a weak correlation with [Fe/H]. The weak correlation between these elements and [Fe/H] may be attributed to their diverse formation processes and sensitivity to environmental conditions. For instance, Hα emission is influenced by factors such as stellar activity and gas dynamics in the interstellar medium, which may not directly correlate with [Fe/H]. This lack of correlation, as noted, could be related to different formation processes. Similarly, [Si/Fe] and [O/Fe] are produced through different nucleosynthesis pathways leading to varying degrees of sensitivity to [Fe/H] across different stellar populations (e.g., Snaith et al. 2014; Amarsi et al. 2019; Rossi et al. 2024).

[C/Fe] displays a very weak or no correlation with [Fe/H]. Similarly to the previous topic, the lack of correlation between these elements and [Fe/H] can be attributed to their complex nucleosynthetic origins and dependence on multiple astrophysical processes. [C/Fe], for example, can be produced through both stellar nucleosynthesis and nucleosynthesis in supernova explosions, leading to a wide range of carbon abundances across different stellar populations.

A thorough analysis of abundance ratio correlations requires addressing the literature where these trends have been extensively discussed. Such a discussion is beyond the scope of this work that aims to release the parameters. A forthcoming paper will address a chemo-dynamical description of the Milky Way. We also utilize Gaia DR3 distances to obtain an overview of the distribution of the parameters across the sky. Figure 9 displays the sky distribution of our results where the cylindrical galactocentric distance is denoted as R, and the distance to the Galactic plane is Z:

  • [Fe/H]: The iron abundance reveals a clear gradient in the Milky Way. The [Fe/H] values range from approximately 0.5 to values below −2.5, indicating a decrease in metallicity from the disk to the outer regions of the Galaxy. This metallicity gradient is consistent with previous studies that have shown the prevalence of metal-rich stars in the inner regions of the Galaxy and metal-poor stars in the outer regions (e.g., Hayden et al. 2015; Gaia Collaboration 2023);

  • [Mg/Fe]: Trends for this abundance range change with galactocentric distance. In the disk region, it remains relatively constant at around 0.1, indicating a balance between Type Ia and Type II supernova enrichment. However, as we move towards the outer regions (higher Z values), [Mg/Fe] shows a gradual increase, reaching approximately 0.3. This trend suggests a higher contribution from Type II supernovae or a lower contribution from Type Ia supernovae in the outer regions, resulting in enhanced magnesium enrichment (e.g., Nidever et al. 2014; Hayden et al. 2015);

  • [Al/Fe]: The distribution of aluminum abundance exhibits complex behavior across different galactocentric distances. At the Galactic center, [Al/Fe] is approximately 0.1, indicating a relatively low abundance compared to iron. However, higher Z values around 3 kpc, [Al/Fe] increases to around 0.2. Interestingly, for even higher Z values, drops below −0.1, suggesting a decrease in aluminum abundance in the outer regions of the Galaxy. These variations may reflect different enrichment processes and stellar populations in different regions of the Milky Way (e.g., Magrini et al. 2021). Indeed, the current assumptions are based on a broad overview but the current sample primarily represents stars from the thin and thick disks, with other regions being less represented. A more detailed analysis, particularly focusing on halo stars and streams, will be addressed in future work;

  • [Si/Fe]: Close to the Galactic center, the silicon abundance compared to iron is around 0.05. For Z around 1 kpc, it increases to approximately 0.15,; this trend is accentuated for even higher Z values, where [Si/Fe] exceeds 0.25. This behavior suggests a higher contribution from Type II supernovae or silicon-rich stellar populations in the outer regions (e.g., Hogg et al. 2016);

  • [C/Fe]: This abundance ratio displays significant variations with galactocentric distance, ranging from −0.1 at the center to around 0.05 at a Z-height of about 1 kpc. Remarkably, at even higher Z values, [C/Fe] drops below −0.15, indicating a decrease in carbon abundance in the outer regions of the Galaxy. These variations may be attributed to different enrichment processes and stellar populations (e.g., Rossi et al. 2024);

  • [α/M]: The abundance ratio of α elements to iron displays consistent trends across different galactocentric distances. Initially, close to the Galactic center, [α/M] is approximately 0.05. However, as we move towards outer regions, [α/M] exhibits a linear increase, reaching values around 0.2. Beyond Z values of 5 kpc, [α/M] shows scatter. These trends reflect the varying contributions of different nucleosynthesis processes and stellar populations in different regions of the Milky Way (e.g., Anders et al. 2014);

  • [O/Fe]: As expected, the distribution of oxygen abundance exhibits similar trends to [α/Fe], albeit with a different range of values. Like [α/Fe], [O/Fe] shows an increase from the Galactic center towards outer regions, reflecting the enrichment history of the Galaxy. However, [O/Fe] values may differ due to the specific nucleosynthesis processes involved in oxygen production and the different chemical evolution pathways of oxygen compared to other α elements (e.g., Nissen et al. 2014).

In summary, the observed distributions of elemental abundances align with the chemical evolution and enrichment history of the Milky Way Galaxy. However, further investigations are essential, including targeted spectroscopic follow-up on selected objects to refine parameter determinations and assure a higher level of accuracy. Potential spectroscopic follow-ups could greatly enhance our understanding of specific stellar populations, including halo stars, globular clusters, and metal-poor stars. This would provide more detailed insights into their chemical compositions and evolutionary histories. The current study’s framework also allows for expansion into these regions using photometric data, with broader studies including the halo being feasible without immediate spectroscopy. The parameters estimated in this study reproduce the global trends for the MW, in good agreement with the existing knowledge. This also holds the potential of expanding the method (without spectroscopy) to other studies. However, the integrating our with proper motion measurements from surveys such as Gaia can enhance target selection for further studies of star clusters and other stellar populations.

thumbnail Fig. 7

Results of the testing set for various astrophysical parameters (Teff, log g, [Fe/H], α/M, [Mg/Fe]) obtained in this study using the NN model. Each plot features a one-to-one correlation between the parameters obtained from APOGEE, GALAH, and LAMOST surveys and those predicted by the NN, with identity line included for reference. Blue and red dots correspond to dwarfs and giants, respectively. For both dwarfs and giants, the colorbar indicates the Teff. The size of the symbols represents the value of log ɡ.

Table 3

Summary of the number of giant and dwarf stars with computed stellar parameters or abundances.

thumbnail Fig. 8

Density plot depicting the pairwise distribution of different atmospheric parameters and chemical abundances for a total of ~ 140 000 giant stars predicted in this study, based on the APOGEE based trainning set. The title of each plot indicates the exact number of stars plotted, while the color bar represents the density distribution.

thumbnail Fig. 9

Spatial distribution of Milky Way giant stars and their various chemical abundances calculated in this study. In each plot, radial distance (R) in kpc from the center of the Milky Way is plotted along the abscissa, and the Galactic vertical height (z) is plotted along the ordinate. The colorbar represents the range of various calculated chemical abundances obtained in this work, illustrating how these abundances vary across different Galactic positions. The title of each plot indicates the training data set used to calculate the respective chemical abundance and the number of giant stars plotted.

4.3 Star clusters

We performed a crossmatch with star clusters to evaluate the accuracy of our derived parameters. The catalogs compiled by Vasiliev & Baumgardt (2021) and Hunt & Reffert (2023) include 170 globular clusters and 7167 open clusters, providing membership probabilities that indicate the likelihood of a star being a member of a specific cluster. These studies generate updated lists based on refined models and parameters, meaning that the membership probabilities may vary with new data or methodologies (e.g. Galleti et al. 2006; Alonso-García et al. 2021; Dias et al. 2021). For our analysis, we initially considered only stars with a membership probability greater than 0.95, using a search radius of one arcsecond to identify cluster members.

A more detailed analysis was conducted, taking into account that star cluster members have been observed by GALAH, APOGEE, or LAMOST and should exhibit similar metallicity. For example, we identified two stars, 6816573828484824192 and 6817326448488737152, with metallicities of [Fe/H] ≃ −0.04 and [Fe/H] ≃ −0.48, respectively, as computed from GALAH spectra. These stars were previously considered members of NGC 7099, which has an estimated metallicity of [Fe/H] ≃ −2.29. Our own estimations yielded similar metallicities for these stars, i.e., [Fe/H] ≃ −0.05 and [Fe/H] ≃ −0.41. This discrepancy raises several issues, such as the possibility that our data may identify stars that do not belong to the cluster or highlight photometric problems that result in inaccurate metallicity estimations.

To analyze how closely the photometric metallicities align with the spectroscopic values, we focus only on stars whose metallicities are close to those reported in the literature, applying an error tolerance of 0.2 × (1 + |[Fe/H]Literature|). This method imposes a minimum error of 0.2 dex for stars with metallicity equal to zero. Such an assumption helps ensure the best membership assignment for the clusters. However, stars not included in this analysis may either not belong to the cluster, or have been accreted. Alternatively, our measurements may be under- or overestimated due to issues such as crowding. Appendix G presents the literature values, along with the total number of giants and dwarfs with membership probabilities greater than 0.95 (Nall). It also shows the number of giants (NG) and dwarfs (ND) used to compute our metallicity estimates, based on the criteria outlined above. The key findings are summarized below:

Figure 10 presents a comparison between the literature metallicities and those derived from our data. Nevertheless, the sigma values obtained from our estimates are smaller than those assumed in our criteria for selecting cluster members. The largest discrepancies are observed for metal-poor stars; however, the limited number of sources in this region prevents a deeper analysis. Our results indicate that our estimates can reliably be used to identify member stars in star cluster analyses based on their metallicity values.

The proportion of stars with membership probabilities consistent with our estimated metallicity ranges varies from approximately 15% to 100%. This variation may arise from several factors, including the possibility that many stars do not belong to the cluster, as well as issues related to our estimations, such as photometric contamination, reddening, and stellar brightness. A more detailed analysis, focusing specifically on star clusters, will be presented in a forthcoming paper.

NGC 7099 (M30) is the star cluster with the lowest metal-licity among the tested samples (~−2.29 dex), and 81 out of 339 stars (~24%) classified as members have metallicity values consistent with our estimations (O’Malley & Chaboyer 2018). Indeed, our estimated value is ~0.3 dex higher than that found in the literature, suggesting that our method may overestimate the metallicity of metal-poor stars. Further investigation is required to understand the factors contributing to this discrepancy and to improve the accuracy of our estimations.

NGC 7492 and NGC 7089 have literature metallicity values of about −1.65 dex. While the results for giant stars from APOGEE align with these values, the metallicities for dwarf stars appear to be underestimated. On the other hand, this trend is also observed in other star clusters in our sample, suggesting that metallicity estimates for giant stars are generally more reliable than those for dwarf stars. Empirical adjustments between spectroscopic and photometric metallicities (e.g., Calamida et al. 2009; Lianou et al. 2011; Bellazzini et al. 2023) can be made to bring the results closer. However, applying such corrections could bias the estimates in cases where the original results are already in agreement (see the discussion on NGC 104 and NGC 3201).

NGC 3201 and NGC 288 star clusters have literature metallicities of −1.46, and −1.30, respectively. Our results can be used to assess membership probability based on the metallicity criterion. The values derived using the GALAH giant and APOGEE dwarf training sets are in agreement with previous estimations inside the error bars. However, our estimations indicate slightly higher metallicities for both clusters. The values derived using the APOGEE giant and GALAH dwarf training sets are more reliable compared to other methods. Based on these estimations, we find metallicities of −1.35 to −1.15 for NGC 3201 and NGC 288 respectively, both with similar sigma values of approximately 0.15 dex. Despite the consistency in these estimations, further research and analysis are necessary to fully understand the underlying mechanisms driving the observed differences in metallicity.

The star clusters NGC 1261, NGC 362, and Pal 12 show metallicity values consistent with those computed by us, primarily using GALAH and APOGEE as training sets. The results from LAMOST are in agreement but tend to fall at the lower limit of the literature values. Similar patterns are observed in other globular clusters when comparing with literature metallicities.

In NGC 104 (47 Tucanae), the literature value for [Fe/H] (−0.67 ± 0.04) aligns well with the values derived from the three data samples. NGC 104 has member stars with probabilities and metallicities close to the expected values for star clusters, ranging from 360 to 5582. This means approximately 80% of the stars are identified based only on the probability values. We observed a standard deviation of [Fe/H] smaller than 0.18 dex when compared to the literature. Remarkably, this cluster boasts the highest number of members among those studied, likely due to its proximity and one of the most massive GCs in the Milky Way, which facilitated a more comprehensive analysis in forthcoming work.

For star clusters with metallicities higher than −0.5 dex, we found good agreement between our estimates and values from the literature, with sigma values smaller than −0.5 dex. However, Ruprecht 4 stands out with a larger discrepancy: the literature reports a metallicity of about −0.09 dex, while our estimate is −0.27 dex. Carraro et al. (2007) analyzed five stars in Ruprecht 4, finding that three of the brighter stars had metallicities around 0.00 dex, while two fainter stars had values near −0.35 dex. The brighter stars were better aligned with the isochrones used by the author, leading to the adoption of the higher metallicity for the cluster. Since our study includes different stars than those analyzed in the literature, a re-evaluation of this cluster is necessary to better understand its true metallicity.

In general, our approach demonstrates that one can select member stars based on our outputs (see Fig. 10), although the LAMOST models exhibit the least accuracy, with sigma values around 0.2 dex. As discussed earlier, this was necessary because some stars flagged as cluster members showed metallicities significantly different from the expected values for their clusters. To be cautious, we limited our analysis to stars with metallicities not too divergent from the cluster’s average, which introduced some bias into the results, even though the sigma values are smaller than those found in other studies. A more thorough investigation focusing exclusively on cluster stars will be conducted to clarify this issue. We also observed a notable discrepancy for metal-poor stars. Unfortunately, we were unable to find a counterpart for any of the star clusters analyzed in the LAMOST data, which prevents us from verifying whether these discrepancies are due to the signal-to-noise ratio limit we applied (S/N > 20) or another factor. Indeed, the training set with the largest sample of metal-poor stars belongs to the GALAH data, in contrast to the LAMOST data (see Table 2). Consequently, the lower accuracy of [Fe/H] for LAMOST may be attributed to its limited sample of metal-poor stars, as well as the fiber size of 3 arcseconds, which makes it ill-suited for globular cluster analysis. Based on these findings, we recommend that users rely on the metallicity estimates derived from GALAH data, as they are likely to be more accurate.

The results for giant and dwarf stars derived from the GALAH and APOGEE training sets show better agreement compared to those from the LAMOST estimates. Typically, globular clusters are monometallic, meaning they exhibit a well-defined metallicity with narrow distributions of [Fe/H] (Harris 1996). However, a particularly notable globular cluster is Omega Centauri (ω Cen), which is unique due to its significant spread in metallicity, ranging from approximately −2.2 to −0.6 dex (Gratton et al. 2011). Research into ω Cen has revealed multiple stellar populations, apart from typical globular clusters, making it a key focus in understanding the formation and evolution of globular clusters (Kraft 1994; Bedin et al. 2004). The metal-licity estimates for entire clusters highlight intriguing targets for further investigation, particularly regarding potential metal-licity variations and the structural properties of these clusters. For instance, in Fig. 11 (left panel), the sky plot illustrates the distribution of [Fe/H] for NGC 104. Notably, the metallicity distribution of NGC 104 is quite homogeneous for all member sources. The no matched metallicity values suggests that some stars within these clusters may not belong to the cluster, or may have been accreted, or may have poor spectroscopic and/or photometric estimates.

On the other hand, the metallicity distribution of NGC 3201 near the center of each cluster appears to be lower compared to the outer regions. Conversely, NGC 3201 exhibits a distinct metallicity gradient, with stars near the center being notably metal-poor, while those towards the outer regions show a scattering of more metal-rich stars. The observed [Fe/H] spread may be influenced by several systematic factors. Aperture photometry, affected by crowding, likely introduces inaccuracies in flux measurements, which can be mitigated by reprocessing the data with PSF photometry. To address potential field star contamination, a crossmatch with Gaia DR3 is recommended, leveraging its precise astrometry to isolate true cluster members. Additionally, ensuring accurate photometric calibration and robust SED fitting procedures is crucial to avoid biases. By addressing these factors, we aim to determine if the observed abundance spread is genuine or an artifact of the analysis process. For the true multiple population star cluster, such results serves as a mere example of how available data can be leveraged to explore various scenarios and identify potential targets. However, comprehensive studies are imperative to substantiate these observations further (Bonatto et al. 2019).

thumbnail Fig. 10

Comparison of metallicity ([Fe/H]) values from the literature (x-axis) with the estimations obtained in this work (y-axis) for the selected star clusters.

thumbnail Fig. 11

Sky plots for globular clusters NGC 104 (left panel) and NGC 3201 (right panel). The color bar in each plot indicates the [Fe/H] values determined in this study for each member star of the cluster, utilizing the GALAH data as training set.

4.4 TESS input catalog

The TESS Input Catalog (TIC) is a comprehensive catalog of stellar parameters and photometric data for stars observed by the TESS mission (Stassun et al. 2019). The creation of the TIC involves several steps where an initial catalog is cross-matched and integrated to create a unified catalog with consistent stellar parameters and photometry. Rigorous quality control measures are applied to ensure the reliability and accuracy of the data in the TIC. This includes filtering out spurious detection’s, eliminating duplicate entries, and flagging uncertain measurements. Figure 12 illustrates the comparison between the [Fe/H] values derived from our analysis and the [M/H] values provided by the TIC. While [Fe/H] specifically measures the abundance of iron relative to hydrogen, [M/H] encompasses all elements heavier than helium in a star’s atmosphere. Therefore, comparing the two measures requires careful consideration. The fact that [α/M] increases for metal-poor stars is expected, as it reflects the higher relative abundances of α elements (like O, Ne, Mg, Si, S, Ar, Ca, and Ti) in older, metal-poor stars due to their formation in core-collapse supernovae. A complete relation between [Fe/H] and [M/H] requires knowledge of both [α/M] and [α/Fe], which is not available in our current analysis.

In Figure 12, we can observe a systematic discrepancy between [M/H] from the TIC catalog and [Fe/H] derived from the machine learning model based on S-PLUS colors, particularly when using the APOGEE and LAMOST datasets. This discrepancy can be attributed to several factors. APOGEE, which operates in the near-infrared, typically measures higher metallicity values compared to optical surveys like GALAH, especially at lower Teff values (Hegedűs et al. 2023). Moreover, APOGEE tends to report lower α-element abundances [α/Fe] than optical measurements, as noted by Jönsson et al. (2020). Similar issues also affect LAMOST, which has lower resolution compared to GALAH. The reduced sensitivity of machine learning models using APOGEE and LAMOST data, relative to GALAH, can be explained by the role of α elements in shaping the spectral energy distribution. α elements are more prominent in optical spectra and less sensitive in the near-infrared. Studies have shown that optical surveys like GALAH, with higher resolution, provide more accurate measurements of α-element abundances, especially for metal-poor stars and halo populations (Buder et al. 2018; Ness et al. 2015). The S-PLUS colors, based on optical narrowband filters, are more adept at capturing these variations, leading to better agreement when trained on optical data from GALAH. In contrast, the near-infrared range of APOGEE and the low resolution of LAMOST not capture the subtle differences in α-element abundances as effectively, resulting in discrepancies when used for machine learning predictions.

The relation between [M/H] and [Fe/H] can vary depending on the correlation with stellar parameters and chemical abundances (e.g., Salaris et al. 1993). In particular, Zwitter et al. (2008) proposed a straightforward relation between [M/H] and [Fe/H], given by [M/H]=a×[Fe/H]+b×(1±(1exp(3.6|[Fe/H]+0.55|))),$[{\rm{M}}/{\rm{H}}] = a \times [{\rm{Fe}}/{\rm{H}}] + b \times (1 \pm (1 - \exp ( - 3.6|[{\rm{Fe}}/{\rm{H}}] + 0.55|))),$(2)

where the plus sign applies for [Fe/H] < −0.55 and the minus sign otherwise. The parameters a and b were estimated to be 1 and 0.11, respectively, based on the dataset analyzed by the authors. The change in the [M/H] to [Fe/H] relation around [Fe/H] < −0.55 results from differences in element synthesis and enrichment in stellar populations (McWilliam 1997). Stars with [Fe/H] < −0.55 are typically older and formed in a less enriched interstellar medium, leading to a higher relative abundance of α-elements (like oxygen, magnesium, and silicon) compared to iron (Thomas et al. 2005; Bensby et al. 2014). The value of −0.55 dex marks a transition where [α/Fe] increases, due to the dominance of Type II supernovae (which produce α-elements) in early galaxy enrichment, while Type Ia supernovae (which produce iron) contribute later (Tinsley 1979; Woosley & Weaver 1995). Consequently, below [Fe/H] ≈ −0.55, the [M/H] relation becomes non-linear, reflecting the enhanced contribution of α-elements relative to iron (Gratton et al. 2000; Schuster et al. 2004).

The key points observed in Figure 12 are as follows. For giant and dwarf slightly metal-rich stars (SMRS − [Fe/H] > 0), the value of [M/H] typically exceeds [Fe/H]. This is because [M/H] incorporates additional contributions from α-elements and other metals, resulting in a higher overall metallicity. Therefore, for stars in the SMRS, [M/H] is expected to be greater than [Fe/H], reflecting the increased metallicity of these stars (e.g., Casagrande et al. 2011; Mikolaitis et al. 2017).

For stars with near-solar metallicities ([Fe/H] = 0.0 ± 0.5), the contributions from α elements are relatively minor compared to iron. In these stars, iron lines dominate the spectra, leading to [Fe/H] values that closely reflect the iron abundance. As a result, [M/H] and [Fe/H] are approximately equal, since the overall metallicity measured by [M/H] is heavily influenced by iron, and the additional contributions from α elements do not significantly alter this balance (e.g., Beers & Christlieb 2005; Nomoto et al. 2013).

The distribution of [Fe/H] as a function of [M/H] for dwarfs and giants, determined using GALAH data as a training set, are nearly equal. They also follow the relation [Fe/H] ≃ [M/H], which suggests that the determination of metallicity for MPS could be biased by the presence of α elements. This indicates that further analysis is needed to accurately disentangle the contributions of various elements to the overall metallicity.

As discussed above, APOGEE, which operates in the near-infrared, generally reports higher metallicity values compared to optical surveys, such as GALAH. On the other hand, LAMOST, with its lower spectral resolution, faces similar challenges when compared to higher-resolution optical data like GALAH. The systematic difference observed in both surveys in comparisson with those found for GALAH data can be related to the underestimation of α-element abundances that are more important for stars having smaller metallicity values.

The model provided by Eq. (2) offers a way to improve the agreement between [Fe/H] and [M/H] values. The inconsistency found for the LAMOST and APOGEE training sets, where a notable break occurs at metallicities around ~−0.5, suggests that the [Fe/H] and [M/H] relationship is influenced by the different spectral ranges and resolutions of these surveys with their correlations with the S-PLUS colors. Since the model lacks a physical basis when [M/H] is lower than [Fe/H], the [Fe/H] values from APOGEE and LAMOST, particularly for [Fe/H] < −0.5, should be interpreted with caution. An additional flag (FlagTIC) was introduced to identify stars that do not align with the analysis based on the TIC dataset, with a value of 1 assigned to such cases and 0 otherwise.

For the GALAH training sets, the coefficients in both parts of Eq. (2) are nearly equal, indicating a consistent correlation between [Fe/H] and [M/H] for both giant and dwarf stars (see Table 4). This suggests that high-resolution optical spectroscopic surveys are better suited for developing models based on photometric colors for the elements analyzed in the current study.

A note of caution, TIC is compiled from various catalogs, some of which may include sources that overlap with our training set. However, the total number of entries in the TIC is at least three times larger than that of our training set. Additionally, [M/H] is not directly measured from [Fe/H]. The variation in coefficients governing the relationship between [Fe/H] and [M/H] for dwarfs and giants across different surveys underscores the complexity of determining stellar metallicities. While some results align with theoretical expectations, caution is warranted when interpreting the parameters provided. Notably, APOGEE and GALAH for giant stars, and GALAH for dwarf stars, exhibit good agreement between [M/H] and [Fe/H]. However, disparities arise between APOGEE and LAMOST for dwarf stars. A potential explanation for these differences lies in the varying spectral resolutions among surveys, with GALAH having a resolution up to seven times higher than APOGEE and LAMOST, as well as differences in the spectral range in which these surveys operate.

thumbnail Fig. 12

Comparison of [M/H] vs. [Fe/H] for dwarf stars (top panels) and giant stars (bottom panels). The color gradient represents [α/M], while the dashed and solid grey lines depict the linear fits and one-to-one relationship, respectively.

thumbnail Fig. 13

Empirical cumulative distribution functions (ECDFs) comparing predicted [Fe/H], Teff, and log g values from the S-PLUS (this work) with the predicted values of J-PLUS, and spectroscopic values from GALAH for 186 dwarf stars. The dotted lines represent the absolute differences between the ECDFs of the survey pairs, and the dashed horizontal lines indicate the Kolmogorov-Smirnov (KS) statistic for each pair of surveys, with lower KS values meaning closer agreement between the respective surveys.

Table 4

Values of the fitted coefficients from Eq. (2).

4.5 Comparison with J-PLUS and GALAH

We crossmatched our predicted S-PLUS catalog with the predicted catalog of J-PLUS and the spectroscopic catalog of GALAH DR3; and found a total of 186 dwarfs and 8 giants common in all three catalogs. To assess the similarity and differences between our predicted values along with the predicted values of J-PLUS and GALAH, in terms of [Fe/H], Teff, and log g of the 186 dwarf stars, we performed the Kolmogorov-Smirnov test (KS: Kolmogorov 1933; Massey 1951). From this test, we compare the empirical cumulative distribution functions (ECDFs) of the datasets, providing a statistic that quantifies the maximum difference between them. A lower KS statistic indicates greater similarity between the distributions.

Fig. 13 illustrates the ECDFs and KS statistics of the 186 dwarfs stars in three surveys. For [Fe/H], the KS statistic between S-PLUS and J-PLUS is 0.1613, between S-PLUS and GALAH is 0.2462, and between J-PLUS and GALAH is 0.1393, yielding the similarity between the three datasets. For the Teff, the KS statistics are notably lower, with S-PLUS vs J-PLUS yielding 0.1022, S-PLUS vs GALAH yielding 0.0600, and J-PLUS vs GALAH yielding 0.0878. These small values indicate a higher degree of similarity in the temperature distributions across the three surveys. For log g the KS statistics show larger variability, with S-PLUS vs J-PLUS at 0.2312, S-PLUS vs GALAH at 0.0762, and J-PLUS vs GALAH at 0.2580. This indicates that the log ɡ values differ more significantly between J-PLUS and the other two surveys, particularly in comparison with GALAH, where S-PLUS versus GALAH shows a much closer agreement.

It is important to note that the crossmatching for giant stars yielded only eight common stars, which was insufficient for a meaningful KS test comparison and, thus, the giant sample was not included in the KS analysis. Figure 14 displays the distribution of pairwise difference of the predicted values among three surveys, for 186 dwarf stars. For the Teff and log ɡ values of dwarf stars, S-PLUS surpasses JPLUS, when comparing with GALAH, with 1σ values of 162 and 0.23 for Teff and log ɡ, respectively. For [Fe/H] values of dwarf stars, the J-PLUS-GALAH has lower standard deviation, namely, 0.23. Table 5 presents the standard deviation values of [Fe/H], Teff, and log ɡ for comparisons between S-PLUS, J-PLUS, and GALAH, based on a sample of eight giant stars and 186 dwarf stars. The sigma values for giant stars are typically smaller than those for dwarfs; however, due to the lower statistical significance, it is challenging to draw a definitive conclusion on this topic.

5 Conclusions

We estimated the stellar abundances [Al/Fe], [C/Fe], [Li/Fe], [Mg/Fe], [O/Fe], [Si/Fe], and [Cu/Fe], along with the fundamental stellar parameters ([Fe/H], log ɡ, Teff, α/M) for approximately ~5 million sources selected from the 12-band multi-filter photometry provided by the S-PLUS survey. To achieve this, we utilized three training sets (APOGEE, GALAH, and LAMOST) employing RF and NN methods. The NN including Teff and log ɡ provides a better result than the other approaches tested. We included only those parameters that achieved a goodness-of-fit greater than 50% across all approaches (see Appendix B) to enhance the reliability of the estimated parameters. Additionally, we introduced a flag to inform users which stars have all input parameters within the training set limits (see Table 3). This combination of information serves as a strong indicator of the reliability of the computed abundances. Our findings highlight the importance of narrowband filters, such as J0378, J0395, J0410, and J0430, along with the u-band, for the determination of photometric abundances. Importantly, the frameworks developed in this study serve as a cost-effective alternative to spectroscopy and are designed for application in the S-PLUS Internal Data Release 5 (IDR5) as well, which is already available for internal use of the collaboration, as well as in the S-PLUS Ultra-Short Survey (USS- Perottoni et al. 2024). IDR5 encompasses approximately twice the data volume of DR4, providing expanded opportunities for research and analysis. Meanwhile, the USS covers the observation of bright stars across approximately 9300 square degrees of the Southern sky. We expect that the present work, which provides a means to determine detailed abundance ratios and physical parameters for both dwarf and giant stars from S-PLUS multiband photometry, will enable a variety of astrophysical applications in the near future.

thumbnail Fig. 14

Comparison of the pairwise differences between the predicted values from the S-PLUS (this work), J-PLUS and spectroscopic values from GALAH for [Fe/H], Teff, and log g, for 186 dwarf stars. The dashed-dotted lines represent mean value of each distribution. The 1σ value of the difference of each pair of surveys are displayed in the legend.

Table 5

Standard deviation values for the comparison of cross-matched S-PLUS, J-PLUS, and GALAH samples for giant and dwarf stars.

Data availability

The catalog is available in electronic form at the CDS via anonymous ftp to cdsarc.cds.unistra.fr (200.54.220.98) or via https://cdsarc.cds.unistra.fr/viz-bin/cat/J/A+A/693/A306. The supplementary materials are also available in the Zenodo repository.

Acknowledgements

C.M. de O. thanks the S-PLUS team for their efforts in data reduction, precise calibration, and photometry. The authors thank Dr. Jessica Schonhut-Stasik for her insightful comments and contributions, which have greatly enhanced the quality and clarity of this paper. C.E.F.L. is supported by ANID’s Millennium Science Initiative through grant ICN12_12009, awarded to the Millennium Institute of Astrophysics (MAS); by ANID/FONDECYT Regular grant 1231637; by DIUDA 88231R11; LSST Discovery Alliance grant; by GEMINI grant 32240028. Support for N.M is provided by doctorado becas Chile/2021 - 21211323. The work of V.M.P. is supported by NOIRLab, which is managed by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the U.S. National Science Foundation. G.L. acknowledges FAPESP proc. 2021/10429-0. Support for M.C. is provided by ANID’s FONDECYT Regular grant #1171273; ANID’s Millennium Science Initiative through grants ICN12_009 and AIM23-0001, awarded to the Millennium Institute of Astrophysics (MAS); and ANID’s Basal project FB210003. D.H. acknowledges the support provided by ANID through doctoral fellowship grant 21232262 for pursuing Ph.D. and ANID/FONDECYT Regular grant 1231637. LAGS and AVSC acknowledges financial support from Con-sejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Agencia I+D+i (PICT 2019-03299) and Universidad Nacional de La Plata (Argentina). J.A.-G. acknowledges support from Fondecyt Regular 1201490 and by ANID – Millennium Science Initiative Program — ICN12_009 awarded to the Millennium Institute of Astrophysics MAS. V.O. acknowledges Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) – Finance Code 001. M.B.F. acknowledges financial support from the National Council for Scientific and Technological Development (CNPq) Brazil (grant number: 307711/2022-6). AAC acknowledges financial support from the Severo Ochoa grant CEX2021-001131-S funded by MCIN/AEI/10.13039/501100011033. MEDR acknowledges support from Agencia Nacional de Promoción de la Investigación, el Desarrollo Tecnológico y la Innovación (Agencia I+D+i, PICT-2021-GRF-TI-00290, Argentina). The work of E.M.-P. is supported by CAPES (proc. 88887.605761/2021-00 and 88881.846754/2023-01) and by NOIRLab, which is managed by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the U.S. National Science Foundation. S.D. acknowledges CNPq/MCTI for grant 306859/2022-0. PKH gratefully acknowledges the Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) for the support grant 2023/14272-4. F.A.-F. acknowledges funding for this work from FAPESP (procs 2018/20977-2). The work of E.M.-P. is supported by CAPES (proc. 88887.605761/2021-00 and 88881.846754/2023-01) and by NOIRLab, which is managed by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the U.S. National Science Foundation. D.R.G. acknowledges grants from FAPERJ (E-26/211.527/2023) and CNPq (315307/2023-4). B.D. acknowledges support by ANID-FONDECYT iniciación grant No. 11221366 and from the ANID Basal project FB210003.

Appendix A Training sets (see link)

This section presents the plot containing the distribution of parameters identified in the GALAH and APOGEE datasets, the same as presented in Fig. 4

Appendix B Goodness-of-fit values (see link)

Goodness-of-fit values for the three methodologies (ABC, see Section 3) employed in this study.

Appendix C Importance plots (see link)

This section presents the importance feature plots for some of the parameters analyzed in this work. The remaining plots can be requested from the authors.

Appendix D Plots of RF as function of NNs (see link)

This section presents the plots illustrating RF as a function of NN estimations. These visualizations help to understand the relationship and dependencies between RF and NN in our analysis.

Appendix E Test set plots (see link)

This section features plots illustrating the testing set results for stellar parameters and elemental abundances using the Neural Network (NN) approach C.

Appendix F Catalog description (see link)

Description of the released table containing the estimated parameters obtained in this study. The full catalog is available at the CDS.

Appendix G Comparison of [Fe/H] Values for star clusters (see link)

The section contains the table summarizing the comparison between [Fe/H] values from various star clusters in our predicted sample and corresponding values from the literature.

References

  1. Almeida-Fernandes, F., SamPedro, L., Herpich, F. R., et al. 2022, MNRAS, 511, 4590 [NASA ADS] [CrossRef] [Google Scholar]
  2. Almeida-Fernandes, F., Placco, V. M., Rocha-Pinto, H. J., et al. 2023, MNRAS, 523, 2934 [CrossRef] [Google Scholar]
  3. Alonso-García, J., Smith, L. C., Catelan, M., et al. 2021, A&A, 651, A47 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Amarsi, A. M., Nissen, P. E., & Skúladóttir, Á. 2019, A&A, 630, A104 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  5. Anders, F., Chiappini, C., Santiago, B. X., et al. 2014, A&A, 564, A115 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  6. Árnadóttir, A. S., Feltzing, S., & Lundström, I. 2010, A&A, 521, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  7. Auvergne, M., Bodin, P., Boisnard, L., et al. 2009, A&A, 506, 411 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  8. Baglin, A., Auvergne, M., Barge, P., et al. 2007, in American Institute of Physics Conference Series, 895, Fifty Years of Romanian Astrophysics, eds. C. Dumitrache, N. A. Popescu, M. D. Suran, & V. Mioc, 201 [NASA ADS] [CrossRef] [Google Scholar]
  9. Bailer-Jones, C. A. L., Andrae, R., Arcay, B., et al. 2013, A&A, 559, A74 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  10. Bailer-Jones, C. A. L., Fouesneau, M., & Andrae, R. 2019, MNRAS, 490, 5615 [CrossRef] [Google Scholar]
  11. Bedin, L. R., Piotto, G., Anderson, J., et al. 2004, ApJ, 605, L125 [Google Scholar]
  12. Beers, T. C., & Christlieb, N. 2005, ARA&A, 43, 531 [NASA ADS] [CrossRef] [Google Scholar]
  13. Beeson, K. L., Kos, J., de Grijs, R., et al. 2024, MNRAS, 529, 2483 [NASA ADS] [CrossRef] [Google Scholar]
  14. Belgiu, M., & Dragut, L. 2016, ISPRS J. Photogram. Rem. Sens., 114, 24 [NASA ADS] [CrossRef] [Google Scholar]
  15. Bellazzini, M., Massari, D., De Angeli, F., et al. 2023, A&A, 674, A194 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Benitez, N., Dupke, R., Moles, M., et al. 2014, arXiv e-prints [arXiv:1403.5237] [Google Scholar]
  17. Bensby, T., Feltzing, S., & Lundström, I. 2014, A&A, 562, A71 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Bonatto, C., Chies-Santos, A. L., Coelho, P. R. T., et al. 2019, A&A, 622, A179 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  19. Bonoli, S., Marín-Franch, A., Varela, J., et al. 2021, A&A, 653, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  20. Borucki, W. J., Koch, D., Basri, G., et al. 2010, Science, 327, 977 [Google Scholar]
  21. Breiman, L. 2001, Mach. Learn., 45, 5 [Google Scholar]
  22. Buchhave, L. A., & Latham, D. W. 2015, ApJ, 808, 187 [Google Scholar]
  23. Buchhave, L. A., Bizzarro, M., Latham, D. W., et al. 2014, Nature, 509, 593 [Google Scholar]
  24. Buder, S., Gilmore, G., Harris, N.W. C., et al. 2018, MNRAS, 478, 151 [Google Scholar]
  25. Calamida, A., Bono, G., Stetson, P. B., et al. 2009, ApJ, 706, 1277 [NASA ADS] [CrossRef] [Google Scholar]
  26. Carraro, G., Geisler, D., Villanova, S., Frinchaboy, P. M., & Majewski, S. R. 2007, A&A, 476, 217 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  27. Casagrande, L., Schönrich, R., Asplund, M., et al. 2011, A&A, 530, A138 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  28. Cayrel de Strobel, G., Soubiran, C., & Ralite, N. 2001, A&A, 373, 159 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  29. Cenarro, A. J., Ederoclite, A., Íñiguez, C., et al. 2018, SPIE Conf. Ser„ 10700, 107000D [NASA ADS] [Google Scholar]
  30. Cenarro, A. J., Moles, M., Cristóbal-Hornillos, D., et al. 2019, A&A, 622, A176 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  31. Chambers, K. C., Magnier, E. A., Metcalfe, N., et al. 2016, arXiv e-prints [arXiv:1612.05560] [Google Scholar]
  32. Chollet, F., & others. 2018, Keras: The Python Deep Learning library, Astrophysics Source Code Library [record ascl:1806.022] [Google Scholar]
  33. Cui, X.-Q., Zhao, Y.-H., Chu, Y.-Q., et al. 2012, Res. Astron. Astrophys., 12, 1197 [Google Scholar]
  34. Cutler, D. R., Edwards Jr., T. C., Beard, K. H., et al. 2007, Ecology, 88, 2783 [CrossRef] [Google Scholar]
  35. DESI Collaboration (Aghamousa, A., et al.) 2016, arXiv e-prints [arXiv:1611.00036] [Google Scholar]
  36. Dias, W. S., Monteiro, H., Moitinho, A., et al. 2021, MNRAS, 504, 356 [NASA ADS] [CrossRef] [Google Scholar]
  37. Eisenstein, D. J., Weinberg, D. H., Agol, E., et al. 2011, AJ, 142, 72 [Google Scholar]
  38. Fabiano de Souza, G., Westera, P., Almeida-Fernandes, F., et al. 2024, MNRAS, 527, 1733 [Google Scholar]
  39. Fallows, C. P., & Sanders, J. L. 2022, MNRAS, 516, 5521 [NASA ADS] [CrossRef] [Google Scholar]
  40. Fischer, D. A., & Valenti, J. 2005, ApJ, 622, 1102 [NASA ADS] [CrossRef] [Google Scholar]
  41. Gaia Collaboration (Brown, A. G. A., et al.) 2018, A&A, 616, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  42. Gaia Collaboration (Brown, A. G. A., et al.) 2021, A&A, 649, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  43. Gaia Collaboration (Recio-Blanco, A., et al.) 2023, A&A, 674, A38 [CrossRef] [EDP Sciences] [Google Scholar]
  44. Galarza, C. A., Daflon, S., Placco, V. M., et al. 2022, A&A, 657, A35 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  45. Galleti, S., Federici, L., Bellazzini, M., Buzzoni, A., & Fusi Pecci, F. 2006, A&A, 456, 985 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  46. Geisler, D. 1984, PASP, 96, 723 [NASA ADS] [CrossRef] [Google Scholar]
  47. Giribaldi, R. E., Van Eck, S., Merle, T., et al. 2023, A&A, 679, A110 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  48. Gordon, K. D., Meixner, M., Meade, M. R., et al. 2011, AJ, 142, 102 [NASA ADS] [CrossRef] [Google Scholar]
  49. Gore, R., Giacalone, S., Dressing, C. D., et al. 2024, ApJS, 271, 48 [NASA ADS] [CrossRef] [Google Scholar]
  50. Gratton, R. G., Carretta, E., & Desidera, S. 2000, A&A, 354, 169 [NASA ADS] [Google Scholar]
  51. Gratton, R. G., Johnson, C. I., Lucatello, S., D’Orazi, V., & Pilachowski, C. 2011, A&A, 534, A72 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  52. Gray, D. F. 2005, The Observation and Analysis of Stellar Photospheres [Google Scholar]
  53. Gruel, N., Moles, M., Varela, J., et al. 2012, SPIE Conf. Ser., 8448, 84481V [NASA ADS] [Google Scholar]
  54. Gutiérrez-Soto, L. A., Gonçalves, D. R., Akras, S., et al. 2020, A&A, 633, A123 [Google Scholar]
  55. Harris, W. E. 1996, AJ, 112, 1487 [Google Scholar]
  56. Hayden, M. R., Bovy, J., Holtzman, J. A., et al. 2015, ApJ, 808, 132 [Google Scholar]
  57. Hegedűs, V., Mészáros, S., Jofré, P., et al. 2023, A&A, 670, A107 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  58. Herpich, F. R., Almeida-Fernandes, F., Oliveira Schwarz, G. B., et al. 2024, A&A, 689, A249 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  59. Hogg, D. W., Casey, A. R., Ness, M., et al. 2016, ApJ, 833, 262 [NASA ADS] [CrossRef] [Google Scholar]
  60. Horta, D., Schiavon, R. P., Mackereth, J. T., et al. 2023, MNRAS, 520, 5671 [NASA ADS] [CrossRef] [Google Scholar]
  61. Howell, S. B., Sobeck, C., Haas, M., et al. 2014, PASP, 126, 398 [Google Scholar]
  62. Huber, D., Chaplin, W. J., Christensen-Dalsgaard, J., et al. 2013, ApJ, 767, 127 [Google Scholar]
  63. Hunt, E. L., & Reffert, S. 2023, A&A, 673, A114 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  64. Jönsson, H., Holtzman, J. A., Allende Prieto, C., et al. 2020, AJ, 160, 120 [Google Scholar]
  65. Keller, S. C., Schmidt, B. P., Bessell, M. S., et al. 2007, PASA, 24, 1 [NASA ADS] [CrossRef] [Google Scholar]
  66. Kochanek, C. S., Shappee, B. J., Stanek, K. Z., et al. 2017, PASP, 129, 104502 [Google Scholar]
  67. Kolmogorov, A. 1933, Sulla determinazione empirica di una legge di Distribuzione, 4 (Giornale dell’Istituto Italiano degli Attuari), 83 [Google Scholar]
  68. Kraft, R. P. 1994, PASP, 106, 553 [Google Scholar]
  69. Ksoll, V. F., Ardizzone, L., Klessen, R., et al. 2020, MNRAS, 499, 5447 [NASA ADS] [CrossRef] [Google Scholar]
  70. Kurucz, R. L. 1979, ApJS, 40, 1 [NASA ADS] [CrossRef] [Google Scholar]
  71. Lawrence, A., Warren, S. J., Almaini, O., et al. 2007, MNRAS, 379, 1599 [Google Scholar]
  72. LeCun, Y., Bengio, Y., & Hinton, G. 2015, Nature, 521, 436 [Google Scholar]
  73. Lianou, S., Grebel, E. K., & Koch, A. 2011, A&A, 531, A152 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  74. Magrini, L., Smiljanic, R., Franciosini, E., et al. 2021, A&A, 655, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  75. Majewski, S. R., Ostheimer, J. C., Kunkel, W. E., & Patterson, R. J. 2000, AJ, 120, 2550 [NASA ADS] [CrossRef] [Google Scholar]
  76. Majewski, S. R., APOGEE Team & APOGEE-2 Team. 2016, Astron. Nachr., 337, 863 [NASA ADS] [CrossRef] [Google Scholar]
  77. Marín-Franch, A., Chueca, S., Moles, M., et al. 2012, SPIE Conf. Ser., 8450, 84503S [Google Scholar]
  78. Massey, F. J. 1951, J. Am. Statist. Assoc., 46, 68 [CrossRef] [Google Scholar]
  79. McWilliam, A. 1997, ARA&A, 35, 503 [NASA ADS] [CrossRef] [Google Scholar]
  80. Mendes de Oliveira, C., Ribeiro, T., Schoenell, W., et al. 2019, MNRAS, 489, 241 [NASA ADS] [CrossRef] [Google Scholar]
  81. Mikolaitis, Š., de Laverny, P., Recio-Blanco, A., et al. 2017, A&A, 600, A22 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  82. Ness, E. K., Gilmore, G., G., T. J. M. M. R. A., et al. 2015, MNRAS, 449, 494 [NASA ADS] [CrossRef] [Google Scholar]
  83. Nidever, D. L., Bovy, J., Bird, J. C., et al. 2014, ApJ, 796, 38 [Google Scholar]
  84. Nissen, P. E., Chen, Y. Q., Carigi, L., Schuster, W. J., & Zhao, G. 2014, A&A, 568, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  85. Nomoto, K., Kobayashi, C., & Tominaga, N. 2013, ARA&A, 51, 457 [CrossRef] [Google Scholar]
  86. O’Malley, E. M., & Chaboyer, B. 2018, ApJ, 856, 130 [CrossRef] [Google Scholar]
  87. Payne, C. H. 1925, PhD thesis, Radcliffe College, USA [Google Scholar]
  88. Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011, J. Mach. Learn. Res., 12, 2825 [Google Scholar]
  89. Perottoni, H. D., Placco, V. M., Almeida-Fernandes, F., et al. 2024, A&A, 691, A138 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  90. Pietrinferni, A., Cassisi, S., Salaris, M., & Castelli, F. 2004, ApJ, 612, 168 [Google Scholar]
  91. Placco, V. M., Roederer, I. U., Lee, Y. S., et al. 2021, ApJ, 912, L32 [NASA ADS] [CrossRef] [Google Scholar]
  92. Placco, V. M., Almeida-Fernandes, F., Arentsen, A., et al. 2022, ApJS, 262, 8 [CrossRef] [Google Scholar]
  93. Placco, V. M., Almeida-Fernandes, F., Holmbeck, E. M., et al. 2023, ApJ, 959, 60 [NASA ADS] [CrossRef] [Google Scholar]
  94. Quispe-Huaynasi, F., Roig, F., Placco, V. M., et al. 2024, MNRAS, 527, 6173 [Google Scholar]
  95. Recio-Blanco, A., de Laverny, P., Palicio, P. A., et al. 2023, A&A, 674, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  96. Ricker, G. R., Winn, J. N., Vanderspek, R., et al. 2015, J. Astron. Telesc. Instrum. Syst., 1, 014003 [Google Scholar]
  97. Rojas-Arriagada, A., Zoccali, M., Schultheis, M., et al. 2019, A&A, 626, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  98. Rossi, M., Romano, D., Mucciarelli, A., et al. 2024, A&A, 691, A284 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  99. Salaris, M., & Cassisi, S. 2002, Evolution of Stars and Stellar Populations (Wiley) [Google Scholar]
  100. Salaris, M., Chieffi, A., & Straniero, O. 1993, ApJ, 414, 580 [NASA ADS] [CrossRef] [Google Scholar]
  101. Schlafly, E. F., & Finkbeiner, D. P. 2011, ApJ, 737, 103 [Google Scholar]
  102. Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525 [Google Scholar]
  103. Schonhut-Stasik, J., Zinn, J. C., Stassun, K. G., et al. 2024, AJ, 167, 50 [NASA ADS] [CrossRef] [Google Scholar]
  104. Schuster, W. J., Moitinho, A., & Santos, J. F. C. 2004, AJ, 128, 2858 [NASA ADS] [Google Scholar]
  105. Sheinis, A., Anguiano, B., Asplund, M., et al. 2015, J. Astron. Telesc. Instrum. Syst., 1, 035002 [NASA ADS] [CrossRef] [Google Scholar]
  106. Snaith, O. N., Haywood, M., Di Matteo, P., et al. 2014, ApJ, 781, L31 [CrossRef] [Google Scholar]
  107. Starkenburg, E., Martin, N., Youakim, K., et al. 2017, MNRAS, 471, 2587 [NASA ADS] [CrossRef] [Google Scholar]
  108. Stassun, K. G., Oelkers, R. J., Paegert, M., et al. 2019, AJ, 158, 138 [Google Scholar]
  109. Thomas, D., Maraston, C., & Bender, R. 2005, ApJ, 621, 673 [NASA ADS] [CrossRef] [Google Scholar]
  110. Thomas, G. F., Annau, N., McConnachie, A., et al. 2019, ApJ, 886, 10 [NASA ADS] [CrossRef] [Google Scholar]
  111. Tinsley, B. M. 1979, ApJ, 229, 1046 [Google Scholar]
  112. Vasiliev, E., & Baumgardt, H. 2021, MNRAS, 505, 5978 [NASA ADS] [CrossRef] [Google Scholar]
  113. Wheeler, A., Ness, M., Buder, S., et al. 2020, ApJ, 898, 58 [Google Scholar]
  114. Whitten, D. D., Placco, V. M., Beers, T. C., et al. 2019, A&A, 622, A182 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  115. Whitten, D. D., Placco, V. M., Beers, T. C., et al. 2021, ApJ, 912, 147 [NASA ADS] [CrossRef] [Google Scholar]
  116. Wilson, J. C., Hearty, F. R., Skrutskie, M. F., et al. 2019, PASP, 131, 055001 [NASA ADS] [CrossRef] [Google Scholar]
  117. Woosley, S. E., & Weaver, T. A. 1995, ApJS, 101, 181 [Google Scholar]
  118. Yang, L., Yuan, H., Xiang, M., et al. 2022, A&A, 659, A181 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  119. Yates, R. M., Kauffmann, G., & Guo, Q. 2012, MNRAS, 422, 215 [NASA ADS] [CrossRef] [Google Scholar]
  120. York, D. G., Adelman, J., Anderson, John E., J., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]
  121. Zasowski, G., Johnson, J. A., Frinchaboy, P. M., et al. 2013, AJ, 146, 81 [NASA ADS] [CrossRef] [Google Scholar]
  122. Zwitter, T., Siebert, A., Munari, U., et al. 2008, ApJ, 136, 421 [Google Scholar]

All Tables

Table 1

Summary of training datasets used for neural network (NN) and random forest (RF) models.

Table 2

Summary of the training datasets categorized by [Fe/H] intervals.

Table 3

Summary of the number of giant and dwarf stars with computed stellar parameters or abundances.

Table 4

Values of the fitted coefficients from Eq. (2).

Table 5

Standard deviation values for the comparison of cross-matched S-PLUS, J-PLUS, and GALAH samples for giant and dwarf stars.

All Figures

thumbnail Fig. 1

Sky footprints of S-PLUS DR4, APOGEE, GALAH DR3, and LAMOST MRS DR8 surveys across the celestial sphere. Each survey’s distinctive coverage and observed regions are depicted, along with their overlapping regions.

In the text
thumbnail Fig. 2

S-PLUS photometry and SDSS spectra of a dwarf star (top panel) and a giant star (bottom panel). The colored symbols represent S-PLUS photometry in flux units for the following filters: u, J0378, J0395, J0410, J0430, g, J0515, r, J0660, i, J0861, and z (from left to right). Square symbols denote broad-band filters, while circular symbols indicate narrowband filters. The filter responses are shown accordingly.

In the text
thumbnail Fig. 3

Corner plot for the APOGEE, LAMOST, and GALAH training sets, that coincide with S-PLUS data, indicated in green, red, and blue, respectively, showing the distribution of Teff, log g, [Fe/H], and [α/X], where X denotes [M] for LAMOST and APOGEE data, and [Fe] for GALAH data. The histograms along the leading diagonal illustrate the parameter distributions for Teff, log g, [Fe/H], and [α/X] across the three trainning datasets.

In the text
thumbnail Fig. 4

Density plots illustrating the distribution of log g, Teff , and different elemental abundances of the APOGEE sample used in the training. In the top-right corner of each panel the number of sources (N) in each distribution is shown. A randomly selected sample of 100 sources for dwarfs, giants, and supergiants is scatter plotted to avoid crowding.

In the text
thumbnail Fig. 5

Comparative analysis of RF and NN goodness-of-fit across diverse parameters in the APOGEE, GALAH, and LAMOST MRS Surveys. These results are based on models constructed using only S-PLUS color information. Only parameters with goodness-of-fit exceeding 50% are included.

In the text
thumbnail Fig. 6

Ten primary features utilized for prediction are arranged in decreasing order of their importance in the RF model. At the top of each diagram, the star type (giant or dwarf), data sample (LAMOST, APOGEE, or GALAH), and the spectroscopic parameter is specified.

In the text
thumbnail Fig. 7

Results of the testing set for various astrophysical parameters (Teff, log g, [Fe/H], α/M, [Mg/Fe]) obtained in this study using the NN model. Each plot features a one-to-one correlation between the parameters obtained from APOGEE, GALAH, and LAMOST surveys and those predicted by the NN, with identity line included for reference. Blue and red dots correspond to dwarfs and giants, respectively. For both dwarfs and giants, the colorbar indicates the Teff. The size of the symbols represents the value of log ɡ.

In the text
thumbnail Fig. 8

Density plot depicting the pairwise distribution of different atmospheric parameters and chemical abundances for a total of ~ 140 000 giant stars predicted in this study, based on the APOGEE based trainning set. The title of each plot indicates the exact number of stars plotted, while the color bar represents the density distribution.

In the text
thumbnail Fig. 9

Spatial distribution of Milky Way giant stars and their various chemical abundances calculated in this study. In each plot, radial distance (R) in kpc from the center of the Milky Way is plotted along the abscissa, and the Galactic vertical height (z) is plotted along the ordinate. The colorbar represents the range of various calculated chemical abundances obtained in this work, illustrating how these abundances vary across different Galactic positions. The title of each plot indicates the training data set used to calculate the respective chemical abundance and the number of giant stars plotted.

In the text
thumbnail Fig. 10

Comparison of metallicity ([Fe/H]) values from the literature (x-axis) with the estimations obtained in this work (y-axis) for the selected star clusters.

In the text
thumbnail Fig. 11

Sky plots for globular clusters NGC 104 (left panel) and NGC 3201 (right panel). The color bar in each plot indicates the [Fe/H] values determined in this study for each member star of the cluster, utilizing the GALAH data as training set.

In the text
thumbnail Fig. 12

Comparison of [M/H] vs. [Fe/H] for dwarf stars (top panels) and giant stars (bottom panels). The color gradient represents [α/M], while the dashed and solid grey lines depict the linear fits and one-to-one relationship, respectively.

In the text
thumbnail Fig. 13

Empirical cumulative distribution functions (ECDFs) comparing predicted [Fe/H], Teff, and log g values from the S-PLUS (this work) with the predicted values of J-PLUS, and spectroscopic values from GALAH for 186 dwarf stars. The dotted lines represent the absolute differences between the ECDFs of the survey pairs, and the dashed horizontal lines indicate the Kolmogorov-Smirnov (KS) statistic for each pair of surveys, with lower KS values meaning closer agreement between the respective surveys.

In the text
thumbnail Fig. 14

Comparison of the pairwise differences between the predicted values from the S-PLUS (this work), J-PLUS and spectroscopic values from GALAH for [Fe/H], Teff, and log g, for 186 dwarf stars. The dashed-dotted lines represent mean value of each distribution. The 1σ value of the difference of each pair of surveys are displayed in the legend.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.