Free Access
Issue
A&A
Volume 659, March 2022
Article Number A181
Number of page(s) 27
Section Catalogs and data
DOI https://doi.org/10.1051/0004-6361/202142724
Published online 25 March 2022

© ESO 2022

1. Introduction

Precise determinations of basic stellar parameters and elemental abundances (hereafter referred to as stellar labels) play a fundamental role in a number of fields, including stellar physics, Galactic structure, the formation and chemical evolution of the Galaxy, and the distribution and properties of dust in the Galaxy. Stellar labels can be determined both spectroscopically and photometrically. These approaches are broadly complementary, each having advantages and disadvantages.

Currently, a number of large-scale photometric surveys, for example, the SkyMapper Southern Survey (SMSS): DR1.1 – Wolf et al. (2018), DR2 – Onken et al. (2019), the Stellar Abundance and Galactic Evolution (SAGE): Zheng et al. (2018), the Javalambre Physics of the accelerating universe Astrophysical Survey (J-PAS): Benitez et al. (2014), J-PLUS: Cenarro et al. (2019), and the Southern Photometric Local Universe Survey (S-PLUS): Mendes de Oliveira et al. (2019) are producing huge amounts of valuable photometric data for tens of millions of astronomical objects. The medium- and narrow-band filters of these photometric surveys are designed and optimized for precision measurements of key stellar features, opening up a new era of precise and accurate stellar label determinations (see, e.g., Bailer-Jones 2002; Árnadóttir et al. 2010).

A number of different empirical and theoretical approaches have been developed to determine stellar labels from the use of photometric data. Thanks to the modern large-scale spectroscopic surveys, such as the Sloan Extension for Galactic Understanding and Exploration (SEGUE) (Yanny et al. 2009), Large sky Area Multi-Object fiber Spectroscopic Telescope (LAMOST) (Deng et al. 2012; Liu et al. 2014), and Apache Point Observatory for Galactic Evolution Experiment (APOGEE) (Majewski et al. 2017), and their precise estimates of stellar parameters, advanced (high-quality, large sample size, and good coverage in parameter space) training and calibration datasets are available for inferring stellar labels from photometry. These approaches are now capable of delivering photometric stellar labels with comparable precision to spectroscopy for high-quality photometry. Using a tool based on empirical metallicity-dependent stellar loci, Yuan et al. (2015b,c) estimated photometric metallicites for a half million FGK dwarf stars in Stripe 82, with a typical error of δ [Fe/H] ∼ 0.1 to 0.2 dex. Later, Zhang et al. (2021) obtained metallicity-dependent stellar loci for red-giant stars, which were then used to derive metallicities of giants to a precision of δ [Fe/H] ∼ 0.20 to 0.25 dex and discriminate metal-poor red giants from main-sequence stars based on SDSS photometry.

From corrected broad-band Gaia Early Data Release (EDR3) colours alone (Niu et al. 2021b; Yang et al. 2021), Xu et al. (2022) determined reliable metallicity estimates for a magnitude-limited sample of about 27 million stars down to [Fe/H] = −2.5. Considering that the specially designed SkyMapper filters uvgriz of the SMSS are more sensitive to stellar atmospheric parameters than the SDSS filters, Huang et al. (2019) used different polynomials to build empirical relations between atmospheric parameters and photometric colours for red-giant stars, and derived accurate atmospheric parameters (e.g., effective temperature, Teff, surface gravity, log g, and metallicity, [Fe/H]) for about one million red-giant stars from SMSS DR1.1. Chiti et al. (2020) described a grid-based synthetic photometry approach, which was employed by Chiti et al. (2021) to obtain photometric metallicities for over a quarter of a million giants from SMSS DR2. With the recalibrated SMSS DR2 and Gaia EDR3, Huang et al. (2022) further determined metallicities for over 24 million stars with a technique similar to the metallicity-dependent stellar loci. Thanks to the strong metallicity sensitivity of the SMSS uv filters, the correction of systematic calibration errors in SMSS DR2 and the use of well-selected training datasets, the achieved precision is comparable to or slightly better than that derived from spectroscopy for stars with a metallicity as low as [Fe/H] ∼ −3.5.

In addition to empirical metallicity-dependent stellar loci, machine learning methods such as the random forest algorithm (Miller et al. 2015; Bai et al. 2019; Galarza et al. 2022), Bayesian inference (Bailer-Jones 2011), and artificial neural networks (ANN; Whitten et al. 2019; Ksoll et al. 2020; Whitten et al. 2021), have been suggested as effective ways to derive precise atmospheric parameters from photometric colours. In particular, ANNs that build an explicit function to map photometric colours to stellar parameters have become a popular tool for estimating atmospheric parameters and elemental abundances. With the J-PLUS photometry, Whitten et al. (2019) proposed the Stellar Photometric Index Network Explorer (SPHINX), a network of ANNs, to derive Teff and [Fe/H] over the range of 4500 K < Teff < 6200 K, and obtain [Fe/H] down to about −3.0 in J-PLUS DR11, with a typical scatter of δ [Fe/H] = 0.25 dex. Later, Whitten et al. (2021) extended their ANN approach to estimate [C/Fe] to a precision of ∼0.35 dex for SDSS Stripe 82 stars contained in S-PLUS DR2 (Almeida-Fernandes et al. 2022).

The J-PLUS narrow-band (∼100 Å) filters are centered on key stellar absorption features; J0378 for the CN band, J0395 for Ca II H+K, J0410 for Hδ, J0430 for the CH G-band, J0515 for the Mg b triplet, J0660 for Hα, and J0861 for the Ca triplet. Such specially designed filters make it possible to not only determine the basic stellar atmospheric parameters (Teff, log g, and [Fe/H]), but also to constrain [α/Fe] and elemental abundances such as [C/Fe], [N/Fe], [Mg/Fe], and [Ca/Fe]. Motivated by the above possibilities and their high scientific impact, we developed a cost-sensitive set of neural networks, CSNet, to derive precise and robust stellar labels (Teff, log g, [Fe/H], [C/Fe], [N/Fe], [Mg/Fe], [Ca/Fe], and [α/Fe]) for stars in J-PLUS DR1, adopting J-PLUS stars in common with LAMOST and Gaia as training sets. Our models improve the prediction accuracy on the whole by increasing the error penalty for the relatively rare samples of extreme stars.

The paper is organized as follows: Sect. 2 describes the data used in this work. Section 3 introduces the framework of the proposed method in detail. Section 4 reports the results for the training and the testing samples, as well as comparisons with various validation samples. The resulting catalog of stellar parameters and chemical abundances for stars in J-PLUS DR1 are also presented in Sect. 4. Section 5 discusses some challenges associated with this work, followed by a summary in Sect. 6.

2. Data

The dataset used for training and testing CSNet is constructed by stars in common between J-PLUS DR1 (Cenarro et al. 2019), Gaia DR2 (Gaia Collaboration 2018), and LAMOST DR52 (Luo et al. 2015; Xiang et al. 2019), where the first two datasets provide input stellar colours and the last one provides stellar labels (Teff, log g, [Fe/H], [C/Fe], [N/Fe], [Mg/Fe], [Ca/Fe] and [α/Fe]).

2.1. Stellar colours

J-PLUS3 is being conducted from the Observatorio Astrofísico de Javalambre (OAJ, Teruel, Spain; Cenarro et al. 2014) using the 83 cm Javalambre Auxiliary Survey Telescope (JAST80) and T80Cam, a panoramic camera of 9.2k × 9.2k pixels that provides a 2 deg2 field of view (FoV) with a pixel scale of 0.55 arsec pix−1 (Marin-Franch et al. 2015). Its unique combination of five broad-band (ugriz), and seven medium- and narrow-band filters (J0378, J0395, J0410, J0430, J0515, J0660, J0861), optimally designed to extract the rest-frame spectral features (including the Balmer jump region, which covers the molecular CN feature, Ca II H+K, Hδ, the CH G-band, the Mg b triplet, Hα, and the Ca triplet) plays a key role in characterizing stellar types and elemental abundances in this work. The J-PLUS observational strategy, image reduction, and main scientific goals are presented in Cenarro et al. (2019).

J-PLUS DR1 covers 1022 deg2 with 511 pointings in its footprint observed from November 2015 to January 2018 (Cenarro et al. 2019). The 5σ limiting magnitudes (3″ aperture) in the filters reach a limit of about 21. Using a stellar locus method, López-Sanjuan et al. (2019) have reached a calibration accuracy of 1–2 per cent, larger for bluer filters. Using the stellar colour regression method (SCR; Yuan et al. 2015a, Huang et al. 2021, Niu et al. 2021a,b, Huang & Yuan 2022) and by combining the LAMOST DR5 spectroscopy and Gaia DR2 photometry, we recalibrated the J-PLUS DR1 data and achieved an accuracy of about 0.5 per cent or better for all the J-PLUS filters (Yuan et al., in prep.). Strongly correlated calibration errors are found in the previous calibration due to the strong metallicity-dependence of stellar loci for blue filters (e.g., Yuan et al. 2015b, López-Sanjuan et al. 2021) and errors in the 3D dust map and reddening coefficients used (Yuan et al., in prep.). In order to make the best use of the full power of J-PLUS filters, the recalibrated J-PLUS DR1 data were used in this work. The same data were also used in star, galaxy, or quasar classifications (Wang et al. 2022), as well as for estimates of basic stellar parameters (Wang et al., in prep.), with a support vector machine technique.

Gaia DR2 (Gaia Collaboration 2018) delivers five-parameter astrometric solutions as well as integrated photometry in three very broad bands: G, BP (330–680 nm), and RP (630–1050 nm), for 1.4 billion sources with G < 21. It provides not only the best astrometric data ever obtained, but also the most precise photometric data. The typical uncertainties in Gaia DR2 measurements at G = 17 are ∼2 mmag in the G-band photometry, and ∼10 mmag in BP and RP magnitudes (Gaia Collaboration 2018).

With the additional BP and RP magnitudes from Gaia DR2, a total of 13 stellar colours are computed and used in this work, as listed in Table 1. These combinations of colours are believed to contain all the pertinent stellar parameter and elemental-abundance information hidden in the J-PLUS photometry. Reddening corrections have been applied to these colours with empirical reddening coefficients determined using the star-pair technique (Yuan et al. 2013, and in prep.) and E(B − V) reddening values from the SFD98 map (Schlegel et al. 1998). The coefficients are also listed in Table 1.

Table 1.

Empirical reddening coefficients for stellar colours.

The input colours for CSNet are constructed by cross-matching J-PLUS DR1 with Gaia DR2, adopting a matching radius of 1.0 arcsec. Considering that the 6 arcsec aperture measurements of J-PLUS DR1 were used, we required stars to satisfy the photo_bp_rp_excess_factor ≤ 1.25 + 0.06(BP − RP)2, which is slightly stricter than that suggested by Evans et al. (2018), to avoid possible contaminations from nearby sources. To ensure the quality of the photometry, we required stars to satisfy FLAGS = 0. For the training and the testing samples, we further required photometric uncertainties of the 12 J-PLUS filters be lower than 0.01 mag for the griz filters, 0.02 mag for the J0410, J0430 and J0515 filters, and 0.03 mag for the J0378, J0395, and J0410 filters, respectively. With the above constraints, the G magnitude ranges over 11.6–16.7 for the training and the testing samples.

2.2. LAMOST stellar labels

LAMOST (Cui et al. 2012) collects low-resolution (R ∼ 1800) and medium-resolution (R ∼ 7500) spectra in a FoV of 20 deg2. In its fifth data release, LAMOST DR5, this survey has delivered more than 8 million stellar spectra with a spectral resolution R = 1800 and limiting magnitude of r∼ 17.8 (Deng et al. 2012; Liu et al. 2014). Stellar effective temperatures, Teff, surface gravities, log g, and metallicities, [Fe/H], are derived by the LAMOST Stellar Parameter Pipeline (LASP; Wu et al. 2011).

For the elemental abundances [C/Fe], [N/Fe], [Mg/Fe], [Ca/Fe], and [α/Fe], the data-driven Payne (the DD-Payne) results of Xiang et al. (2019) are used. The DD-Payne inherits essential ingredients from both the Payne (Ting et al. 2019) and the Cannon (Ness et al. 2015), and incorporates constraints from theoretical model spectra to ensure physically meaningful abundance estimates. Stars in common between LAMOST DR5 and either Galactic Archaeology with High Eciency and Resolution Multi-Element Spectrograph (GALAH) DR2 (Buder et al. 2018) or APOGEE DR14 (Holtzman et al. 2018) were used as the training dataset to provide abundances4 for 16 elements (C, N, O, Na, Mg, Al, Si, Ca, Ti, Cr, Mn, Fe, Co, Ni, Cu, and Ba). For stars with spectral signal to noise ratios S/Ng higher than 50, the typical internal uncertainties of the estimated abundances are about 0.05 dex for [Fe/H], [Mg/Fe], and [Ca/Fe], 0.1dex for [C/Fe] and [N/Fe]. The [α/Fe] of this catalog was defined as a weighted mean of [Mg/Fe], [Ca/Fe], [Ti/Fe], and [Si/Fe]. We note that for the elemental abundances used in this work, [Mg/Fe] was trained using GALAH DR2, while the others used APOGEE DR14.

2.3. Experimental data construction

The experimental dataset for training and testing CSNet consists of 67 709 stars within certain constraints for the photo_bp_rp_excess_factor and the photometric uncertainties (see Sect. 2.1 for more details). In particular, eight CSNets with the same structure and hyper-parameters are trained: three for basic stellar atmospheric parameters and five for elemental abundances. Considering the density distribution of the experimental dataset in the label space, we selected the experimental stars with extra criteria shown in Table 2. These stars were divided into the training set and the testing set in the ratio of 3:1. Figure 1 shows the distributions of the training and the testing sets in the planes of Teff–log g, Teff–[Fe/H], [BPRP]–G, [α/Fe]–[Fe/H], [C/Fe]–[Fe/H], [N/Fe]–[Fe/H], [Mg/Fe]–[Fe/H], [Ca/Fe]–[Fe/H], [C/N]–[Fe/H], [C/α]–[Fe/H], [Mg/α]–[Fe/H], and [Ca/α]–[Fe/H]. We can see that both the training set and the testing set have wide coverages in temperature and surface gravity: 4000 K < Teff < 7500 K and 0.0 < log g < 5.0. Panel g of Fig. 1 shows that stars with reference [Mg/Fe] values have metallicities in the −1.0< [Fe/H] <  + 0.5 range. Since the training and the testing sets were randomly selected from the original (trimmed) dataset, both have similar distributions. We note that panel a of Fig. 1 shows no stars with log g < 2.0 in the testing set. This is simply due to the 10 per cent random selection procedure applied to the data.

thumbnail Fig. 1.

Distributions of the CSNet training and the testing samples in the planes of Teff–log g, Teff–[Fe/H], [BPRP]–G, [α/Fe]–[Fe/H], [C/Fe]–[Fe/H], [N/Fe]–[Fe/H], [Mg/Fe]–[Fe/H], [Ca/Fe]–[Fe/H], [C/N]–[Fe/H], [C/α]–[Fe/H], [Mg/α]–[Fe/H], and [Ca/α]–[Fe/H], from top to bottom and left to right. The black and red dots represent the training set and testing set stars, respectively. To avoid crowding, only 10 per cent of the selected stars are plotted.

Table 2.

Adopted constraints on the datasets for training and testing CSNet.

Table 3.

Description for the J-PLUS CSNet stellar parameters and chemical abundances catalog (1).

3. Method: the cost-sensitive ANN

Considering the nature of the photometric data from J-PLUS DR1, we developed CSNet, a combination of a traditional ANN architecture and a novel 2D, cost-sensitive learning algorithm to estimate stellar parameters and chemical abundances. Training the neural network was performed by the cost-sensitive learning algorithm in order to achieve better measurement precision.

3.1. Data normalization

Input variables with the same scale are the basis for training the robust CSNet. Thus, stellar colours are standardized before entering the network by the following z-score normalization:

(1)

where x and x′ are the original and standardized input vectors, respectively, with the 13 stellar colours, respectively, while μ and σ are the mean and standard deviation of all the original input vectors, respectively.

3.2. The ANN

To balance the under-fitting and over-fitting during the training process, after several experiments, the appropriate architecture for the multilayered feed-forward network in this work is 13-300-200-100-1 for estimating each stellar label, consisting of three hidden layers to extract deep features from the photometry with the 13 stellar colours. The model structure is illustrated in Fig. 2.

thumbnail Fig. 2.

ANN structure of CSNet. The structure consists of a scalable, five-layered feed-forward network. The input layer imports the 13 stellar colours transformed by J-PLUS DR1 and Gaia DR2 after extinction correction as the basic features. The following three hidden layers with activation functions extract the deep nonlinear features of the stellar colours. The output layer provides a stellar label with a weighted sum of the learned features in the last hidden layer.

Each node calculates a weighted sum of all its input values and produces an output value with a nonlinear activation function. This process can be described as follows:

(2)

(3)

where Xl and Hl represent the input vector and output vector of the lth layer, respectively, w is the weight vector, b represents the bias terms, and g(•) represents the LeakyReLU activation function with negative slope coefficient α = 0.01 in the hidden layer to solve “Dead Neuron” problems.

Then, adjustable parameters (including w and b) of the ANN can be calculated through minimizing the following cost:

(4)

where denotes the corresponding known stellar labels.

However, the over-fitting problem is more likely to occur when a large number of neurons are employed to extract deep features from the input data, which means that the accuracy of the prediction in the testing set decreases as the performance of the model in the training set improves. L2 regularization, dropout layers, and batch normalization layers are all effective to mitigate this problem. In this work, an L2 regularized term is added to the cost function above as follows:

(5)

where λ ∈ (0, 1) is the trade-off coefficient between the residual error and the regularized term.

The back-propagation learning algorithm with adaptive moment estimation (ADAM) is utilized to minimize the cost function Eq. (5) effectively. Let θ = (w, b) be parameters of the model. Then, the change of parameter θ becomes:

(6)

where η is the step size, t is the number of current iterations, and α ∈ [0, 1] is the decay rate of the previous weight change.

The process of obtaining the resulting parameters θ can be described as follows (see Kingma & Ba 2014):

Initialize: as the step size, β1, β2 ∈ (0, 1) as the decay rates for the moment estimate, ϵ > 0, C(θ(t)) as a convex differentiable cost function, θ0 as an initial parameter vector, m0 and v0 as an initial first- and second-moment moment vector, respectively, and t = 0 as the initial time step.

while: θ(t) not converged do:

t = t + 1

gt = ∇θC(θ(t − 1))

mt = β1mt − 1 + (1 − β1)gt

end

return θ(t)

Next, a mapping from the input stellar colours to the output stellar labels is established with definite parameters θ = (w, b).

3.3. Modifications of the ANN

When the training set exhibits an unbalanced distribution in the population, the most frequent cases dominate the predicted values. Namely, the predictions will have a systematic trend toward the coverage of the greatest number of target values. Oommen et al. (2011) showed that techniques reducing the sampling bias in the target value space could improve the prediction precision for regression tasks.

In this paper, we modify the cost function in the ANN that takes the rare target values into account. Given an input dataset X with continuous numeric label pairs (Y1, Y2), a frequency distribution histogram was generated with M * N bins in Y1Y2 space, where Y1 represents one of the given stellar labels and Y2 represents the [Fe/H] value. Next, different costs were computed for samples in different bins of the histogram with the following rule:

(7)

where xi ∈ X is a sample, fn(•) is a function to calculate the frequency of samples in a bin which includes in the histogram, and γ > 0 controls the difference degree of the cost among different bins.

Then, the cost function of ANN in Eq. (5) is changed to

(8)

This favors labels of the minority cases with higher expected error costs.

4. Results

To demonstrate the accuracy and reliability of the results from CSNet, we performed extensive experiments examining the estimated stellar labels from different aspects. After introducing the experimental setting, we trained and tested CSNet with the constructed dataset (see Sect. 2.3) and compared the measurements on the stars in common with other precision survey catalogs. Then, we applied this model to 4 387 568 stars (MAGABDUALOBJ_CLASS_STAR ≥ 0.6) selected by cross-matching J-PLUS DR1 with Gaia DR2.

4.1. Experimental setting

The experiments were performed with Keras 2.1.4, Tensorflow 1.5.0, CUDA 9, and cuDNN 7. ADAM was used as the optimizer because of its good robustness regarding the initial learning rate. Before training CSNet, there are several hyper-parameters that were set manually: s is the batch size of the training samples, epoch defines the training iterations, η is the learning rate, M * N is the number of bins in the training set frequency distribution histogram, γ is the exponent of the weight assignment function Eq. (7), λ is the penalty coefficient of the cost function Eq. (8), and the parameters are β1, β2, ϵ for ADAM. Large s occupying the GPU memory speeds the convergence, small η consuming more time helps find the optimal value, proper γ ensures similar distribution between the training and testing set, λ is positively associated with the typical value of the loss function, and other hyper-parameters (β1, β2, ϵ) in ADAM can use its default values. Through experimentation, we found that reasonable default settings for the stellar label determination are s = 2000, epoch = 5000, η = 10−4, γ = 0.5, λ = 5 * 10−5, β1 = 0.9, β2 = 0.999, and ϵ = 10−8. According to the range of the expected stellar labels, recommended bins (M * N) are (20 * 1), (20 * 1), (20 * 1), (20 * 60), (28 * 60), (35 * 50), (20 * 40), and (25 * 60) for Teff, log g, [Fe/H], [α/Fe], [C/Fe], [N/Fe], [Mg/Fe], and [Ca/Fe], respectively.

4.2. Performance

We restricted the application of CSNet to stars within the same [BPRP] coverage as the training set. Furthermore, only stellar label estimates located in the same range as the training set (see Table 2) are considered to be reliable, as we are cautious to extrapolate, because of a well-known limitation of ANN-based approaches. We evaluated the performance of these labels on the training, testing, and validation samples.

4.2.1. Parameter determination on the training and the testing sets

To assess the accuracy of stellar labels derived from our model, we first compare the predictions from the model and their corresponding LAMOST labels in both the training and the testing sets.

For the stellar parameters (Teff, log g, and [Fe/H]), Fig. 3 plots distributions of the CSNet results in the plane of Teff–log g (panel a) and residuals for dwarfs and giants as a function of effective temperature (panels b, c, and e), surface gravity (panel d), and metallicity (panel f). Giant stars (BP − RP > 0.95 and MG < 3.9) and dwarf stars (BP − RP ≤ 0.95 or MG ≥ 3.9) are distinguished in the colour-magnitude diagram. The top two panels show that giants and dwarfs are also clearly distinguished in the plane of Teff–log g. Errors of the predicted stellar labels are evaluated by the mean deviation (“bias”) and 1σ uncertainties, which are estimated using Gaussian fits. There is no significant bias in the training and the testing sets between the CSNet results and the LAMOST values, even in regions with small numbers of training stars, demonstrating that the trained model is robust to the small sample field and avoids over-fitting. The uncertainties of the residuals are δTeff ∼ 55 K, δlog g ∼ 0.15 dex, and δ [Fe/H] ∼ 0.07 dex, respectively.

thumbnail Fig. 3.

Training set (left) and testing set (right) for Teff, log g, and [Fe/H] for the CSNet results. The black and red dots represent dwarfs and giants, respectively. Error bars, coloured by blue and green, for dwarfs and giants, respectively, denote the mean value “bias” and 1σ uncertainty of the residuals estimated using Gaussian fits.

Similarly, for the elemental abundances, Fig. 4 indicates that the results are commensurate with the reference values in both the training and the testing sets. The uncertainties of the residuals are δ [α/Fe] = 0.03 dex, δ [C/Fe] = 0.04 dex, δ [N/Fe] = 0.08 dex, δ [Mg/Fe] = 0.05 dex, and δ [Ca/Fe] = 0.05 dex, respectively. The low levels of bias and uncertainty between the CSNet predictions and LAMOST DD-Payne values suggest that the trained model performs well on abundance determinations over the validity range.

thumbnail Fig. 4.

Similar to Fig. 3, but here we show residuals for the CSNet elemental abundances as a function of effective temperature.

4.2.2. Comparisons with other surveys

To examine the reliability of the CSNet results, we select stars in common between the J-PLUS DR1 and other three reference catalogs including precise stellar parameters and elemental abundances derived from their available spectra. The above three cross-matched catalogs will be introduced one by one.

The first one is the APOGEE–Payne catalog. APOGEE (Majewski et al. 2017) is a high-resolution (R ∼ 22 500), high signal-to-noise ratio (S/N > 100), near-infrared (1.51–1.70 μm) spectroscopic survey. Ting et al. (2019) provided accurate stellar parameters and abundances derived from the APOGEE DR14 spectra with the neural-network-based method, the Payne. We cross-matched the J-PLUS DR1 stars with the APOGEE–Payne catalog, finding 7703 stars in common.

The second one is the APOGEE–ASPCAP catalog. The APOGEE Stellar Parameters and Chemical Abundances Pipeline (ASPCAP; García Pérez et al. 2016) produces a catalog of stellar labels for APOGEE’s fifteenth data release (DR15) with all quantities for each combined spectrum. We cross-matched the J-PLUS DR1 with the APOGEE–ASPCAP catalog for stars with S/N higher than 100 and obtain 10 688 stars in common.

The third one is the GALAH–Cannon catalog. GALAH (De Silva et al. 2015) is a high-resolution (R ∼ 28 000) spectroscopic survey using the Anglo-Australian Telescope over a two-degree field of view. The second data release (DR2) of GALAH contains the catalog of stellar parameters and abundances determined by the data-driven algorithm, the Cannon (Ness et al. 2015). We cross-matched the J-PLUS DR1 with the GALAH–Cannon catalog and obtain 252 stars in common.

Figure 5 shows the comparisons of Teff, log g, and [Fe/H] between our results and the above three reference catalogs. Overall, our results are in good agreement with the values from these catalogs. We do not expect a perfect match, considering that the reference LAMOST DR5 catalog used for training has differences with respect to these three catalogs. The APOGEE–Payne Teff values are consistent with ours for both giant and dwarf stars, except for dwarfs with Teff lower than 4800 K, where there is an obvious systematic trend – the APOGEE–Payne values are higher than ours, and the bias reaches about 100–220 K. Another noticeable difference is for the giant stars with Teff lower than 4300 K, as the APOGEE–Payne log g values are lower than ours, and the bias reaches 0.2–0.3 dex. The difference between [Fe/H] from our result and that from the APOGEE–Payne shows weak systemic trends for dwarf stars with Teff lower than 4800 K, and the bias reaches 0.10–0.22 dex. Similarly to the comparisons between the APOGEE–Payne values and ours, APOGEE–ASPCAP and GALAH–Cannon exhibit good consistency with the results from our method, except that both sets of results also show systematic deviations in Teff and [Fe/H] for dwarf stars (Teff < 4800 K), and log g for giant stars (Teff < 4300 K). These differences in Teff, log g, and [Fe/H] mainly reflect the systematic differences in our training set, LAMOST DR5, and the reference catalogs. Direct comparisons of the LAMOST DR5, APOGEE–Payne, APOGEE–ASPCAP, and GALAH–Cannon stellar parameters for stars in common are presented in the Appendix (Fig. B.1). It shows consistent patterns with those presented in Fig. 5, demonstrating that CSNet has merely inherited the systematic errors from the training sets.

thumbnail Fig. 5.

Similar to Fig. 3, but for stars in common between the CSNet results and the reference catalogs.

Figure 6 shows the comparisons of elemental abundances between our estimates and that of APOGEE–Payne, APOGEE–ASPCAP, and GALAH–Cannon. A close inspection suggests that there are systematic offsets and trends for [Mg/Fe] and [Ca/Fe] compared to the results from of APOGEE–Payne and APOGEE–ASPCAP. However, our [Mg/Fe] estimates are more consistent with that of GALAH–Cannon. This is because the [Mg/Fe] in the LAMOST–DD-Payne catalog, which is our training set, is derived using the GALAH–Cannon values as a reference (see Sect. 2.2). Systematic deviations of the other elemental abundances (e.g., [α/Fe], [C/Fe], [N/Fe], and [Ca/Fe]) are inherited from the differences among the reference catalogs, as consistent difference patterns exist in the training sets shown in the appendix (Fig. B.2).

thumbnail Fig. 6.

Similar to Fig. 4, but for stars in common between the CSNet results and the reference catalogs.

4.3. The J-PLUS DR1 catalog of stellar parameters and chemical abundances

We selected 4 387 568 stars (MAGABDUALOBJ_CLASS_STAR ≥ 0.6) by cross-matching J-PLUS DR1 with Gaia DR2, and determine their stellar parameters and chemical abundances using CSNet. The catalog is publicly available5. A description of the J-PLUS CSNet stellar parameters and chemical abundances catalog is provided in Table 2.

Considering the photometric quality of J-PLUS DR1 and the limitations of CSNet, we recommend stellar labels for 2 343 597 stars with FLAGS = 0 in all 12 J-PLUS filters and 0.063 < BP − RP < 1.786. As discussed in Sect. 4.2.2, Teff for dwarf stars (Teff < 4800 K) and log g for giant stars (Teff < 4500 K) in our results should be used with caution because they show non-negligible systematic errors related to the Teff values. To avoid large label uncertainties caused by photometric errors, particularly for elemental abundances, we further selected 0.61 million stars with G < 18 and magnitude errors in the 12 J-PLUS filters less than 0.1 mag, including 0.57 million dwarfs and 44 686 giants. We note that the giants and dwarfs are distinguished in the colour-magnitude diagram (BP − RP > 0.95 and MG < 3.9 for giants; BP − RP ≤ 0.95 or MG ≥ 3.9 for dwarfs). Figures 7 and 8 show the stellar density distributions in the planes of Teff–log g, Teff–[Fe/H], (BP − RP)–G, and different CSNet abundances with respect to [Fe/H] for the 0.57 million dwarfs and 44 686 giants, respectively. Their distributions in the Teff–log g diagrams are consistent with those in the colour-magnitude diagrams. The abundance trends are encouraging and are consistent with literature results from LAMOST DD–Payne (Figs. C.1 and C.2). Figures 9 and 10 show distributions of the number density and different CSNet abundances in the R–Z plane for the same sets of dwarfs and giants, respectively. Similarly, the abundance trends perform as expected. For example, values of [Fe/H] decrease and values of [Mg/Fe] and [C/N] increase with increasing distance from the Galactic plane. Detailed scientific investigations of the catalog, such as stellar populations, Galactic components, and gradients based on these abundance results will be presented in the future.

thumbnail Fig. 7.

Density distributions of selected J-PLUS DR1 dwarf stars in the planes of Teff–log g, Teff–[Fe/H], (BP − RP)–G, and different CSNet abundances with respect to [Fe/H], all colour-coded by stellar number density. Only stars with reliable labels by the following criteria are used: (1) FLAGS = 0; (2) 0.063 < BP − RP < 1.786; (3) G < 18; (4) err (all J-PLUS filters) < 0.1 mag.

thumbnail Fig. 8.

Similar to Fig. 7, but for selected J-PLUS DR1 giant stars.

thumbnail Fig. 9.

Distributions of number density and different CSNet abundances in the plane of R–Z for selected J-PLUS DR1 dwarf stars. Only stars with reliable labels by the following criteria are used: (1) FLAGS = 0; (2) 0.063 < BP − RP < 1.786; (3) G < 18; (4) err (all J-PLUS filters) < 0.1 mag.

thumbnail Fig. 10.

Similar to Fig. 9, but for selected J-PLUS DR1 giant stars.

5. Discussion

We combined the recalibrated J-PLUS DR1 and Gaia DR2 to construct 13 stellar colours. Then, the cost-sensitive neural-network-based CSNet algorithm is designed and trained to map from the 13 colours to precise stellar labels. Thanks to the specially designed J-PLUS filters and GaiaBP and RP passbands, CSNet can not only determine the basic stellar atmospheric parameters (effective temperature, Teff, surface gravity, log g, and metallicity, [Fe/H]), but also deliver [α/Fe] and elemental abundances including [C/Fe], [N/Fe], [Mg/Fe], and [Ca/Fe]. This method performs well even if the training set has a suboptimal distribution of stellar sample properties by increasing the error penalty for the rare subsets of stars. Our results show a high level of agreement with those from the testing samples. Comparisons with the APOGEE–Payne, APOGEE–ASPCAP, and GALAH–Cannon stars also show a good agreement, although some systematic discrepancies do exist, mainly caused by the systematic errors between the different surveys (see more details in Appendix B).

We also investigated the accuracy of the CSNet approach to derive the reddening, E(B − V). Additional experiments show that CSNet is capable of estimating the E(B − V) from stellar colours with a 1σ uncertainty smaller than 0.02 mag. When CSNet is trained using the stellar colours without reddening correction, it achieves slightly lower accuracy for most stellar labels, suggesting that the J-PLUS filters should work well in regions of high extinction.

Deep learning networks, including CSNet, require sufficient data with known labels to train and test a reliable model, which limits the coverage of the stellar labels that can be predicted. In the future, we plan to improve our method to estimate [Fe/H] for very and extremely metal-poor stars. In addition, there are chemically peculiar stars (e.g., the carbon-enhanced metal-poor stars) with abundance ratios that lie outside the ranges we presently explore. We plan to remedy this limitation with the addition of such stars to the training and testing samples in the near future.

We point out that the J-PLUS and S-PLUS surveys are ongoing, and rapidly growing in their coverage of the Northern and Southern Hemispheres, respectively. Application of CSNet, and its planned refinements, to both of these datasets will eventually be able to provide similar results to those presented in this paper for hundreds of millions of stars over much of the sky.

6. Summary

By combining photometric data from the recalibrated J-PLUS DR1, Gaia DR2, and spectroscopic labels from LAMOST, we designed and trained a cost-sensitive neural network, CSNet, to learn the nonlinear mapping from stellar colours to their labels. Special attention is paid to the minority populations in the label space by assigning different weights according to their density distributions. Thanks to the specially designed J-PLUS narrow-band filters, CSNet can not only determine the basic stellar atmospheric parameters (effective temperature, Teff, surface gravity, log g, and metallicity, [Fe/H]), but also deliver [α/Fe] and elemental abundances including [C/Fe], [N/Fe], [Mg/Fe], and [Ca/Fe]. We have achieved precisions of δTeff ∼ 55 K, δlog g ∼ 0.15 dex, and δ [Fe/H] ∼ 0.07 dex, respectively. The uncertainties of the abundance estimates for [α/Fe] and the four individual elements are on the order of δ ∼ 0.04–0.08 dex. We compared our parameter and abundance estimates with those from other spectroscopic catalogs such as APOGEE and GALAH, finding an overall good agreement. Applying our method to J-PLUS DR1, we obtained the aforementioned parameters for over two million stars, providing a powerful dataset for chemo-dynamical analyses of the Milky Way. The catalog of the estimated parameters is publicly accessible. Our results also demonstrate the potential of well-designed and high-quality photometric data in determining stellar parameters and individual elemental abundances.


Acknowledgments

The authors thank Marwan Gebran for his detailed reading and suggestions that improved the clarity of our presentation. We acknowledge David Sobral for a careful reading of the manuscript. This work is supported by the National Natural Science Foundation of China through the projects NSFC 12173007, 11603002, National Key Basic R & D Program of China via 2019YFA0405500, and Beijing Normal University grant No. 310232102. T. C. B. acknowledges partial support from grant PHY 14-30152, Physics Frontier Center/JINA Center for the Evolution of the Elements (JINA-CEE), awarded by the US National Science Foundation. His participation in this work was initiated by conversations that took place during a visit to China in 2019, supported by a PIFI Distinguished Scientist award from the Chinese Academy of Science. C. A. G. acknowledges financial support from the CAPES Brazilian Funding Agency. We acknowledge the science research grants from the China Manned Space Project with NO. CMS-CSST-2021-A08 and CMS-CSST-2021-A09. J. V. acknowledges the technical members of the UPAD for their invaluable work: Juan Castillo, Tamara Civera, Javier Hernández, Ángel López, Alberto Moreno, and David Muniesa. J. A. F. O. acknowledges the financial support from the Spanish Ministry of Science and Innovation and the European Union – NextGenerationEU through the Recovery and Resilience Facility project ICTS-MRR-2021-03-CEFCA. Based on observations made with the JAST/T80 telescope at the Observatorio Astrofísico de Javalambre (OAJ), in Teruel, owned, managed and operated by the Centro de Estudios de Física del Cosmos de Aragón. We acknowledge the OAJ Data Processing and Archiving Unit (UPAD) for reducing and calibrating the OAJ data used in this work. Funding for the J-PLUS Project has been provided by the Governments of Spain and Aragón through the Fondo de Inversiones de Teruel; the Aragón Government through the Reseach Groups E96, E103, and E16_17R; the Spanish Ministry of Science, Innovation and Universities (MCIU/AEI/FEDER, UE) with grants PGC2018-097585-B-C21 and PGC2018-097585-B-C22, the Spanish Ministry of Economy and Competitiveness (MINECO) under AYA2015- 66211-C2-1-P, AYA2015-66211-C2-2, AYA2012-30789, and ICTS-2009-14; and European FEDER funding (FCDD10-4E-867, FCDD13-4E-2685). The Brazilian agencies FINEP, FAPESP, and the National Observatory of Brazil have also contributed to this project. Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement.

References

  1. Almeida-Fernandes, F., Sampedro, L., Herpich, F. R., et al. 2022, MNRAS, 511, 4590 [NASA ADS] [CrossRef] [Google Scholar]
  2. Árnadóttir, A. S., Feltzing, S., & Lundström, I. 2010, A&A, 521, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  3. Bai, Y., Liu, J., Bai, Z., et al. 2019, AJ, 158, 93 [NASA ADS] [CrossRef] [Google Scholar]
  4. Bailer-Jones, C. A. L. 2002, Ap&SS, 280, 21 [NASA ADS] [CrossRef] [Google Scholar]
  5. Bailer-Jones, C. A. L. 2011, MNRAS, 411, 435 [Google Scholar]
  6. Benitez, N., Dupke, R., Moles, M., et al. 2014, ArXiv e-prints [arXiv:1403.5237] [Google Scholar]
  7. Buder, S., Asplund, M., Duong, L., et al. 2018, MNRAS, 478, 4513 [Google Scholar]
  8. Cenarro, A. J., Moles, M., Marín-Franch, A., et al. 2014, Proc. SPIE, 9149, 91491I [Google Scholar]
  9. Cenarro, A. J., Moles, M., Cristóbal-Hornillos, D., et al. 2019, A&A, 622, A176 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  10. Chiti, A., Frebel, A., Jerjen, H., et al. 2020, ApJ, 891, 8 [NASA ADS] [CrossRef] [Google Scholar]
  11. Chiti, A., Frebel, A., Mardini, M. K., et al. 2021, ApJS, 254, 31 [NASA ADS] [CrossRef] [Google Scholar]
  12. Cui, X.-Q., Zhao, Y.-H., Chu, Y.-Q., et al. 2012, Res. Astron. Astrophys., 12, 1197 [Google Scholar]
  13. Deng, L.-C., Newberg, H. J., Liu, C., et al. 2012, Res. Astron. Astrophys., 12, 735 [Google Scholar]
  14. De Silva, G. M., Freeman, K. C., Bland-Hawthorn, J., et al. 2015, MNRAS, 449, 2604 [NASA ADS] [CrossRef] [Google Scholar]
  15. Evans, D. W., Riello, M., De Angeli, F., et al. 2018, A&A, 616, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Gaia Collaboration (Brown, A. G. A., et al.) 2018, A&A, 616, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  17. Galarza, C. A., Daflon, S., Placco, V. M., et al. 2022, A&A, 657, A35 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. García Pérez, A. E., Allende Prieto, C., Holtzman, J. A., et al. 2016, AJ, 151, 144 [Google Scholar]
  19. Holtzman, J. A., Hasselquist, S., Shetrone, M., et al. 2018, AJ, 156, 125 [Google Scholar]
  20. Huang, B., & Yuan, H. 2022, ApJS, 259, 26 [NASA ADS] [CrossRef] [Google Scholar]
  21. Huang, Y., Chen, B.-Q., Yuan, H.-B., et al. 2019, ApJS, 243, 7 [CrossRef] [Google Scholar]
  22. Huang, Y., Yuan, H., Li, C., et al. 2021, ApJ, 907, 68 [NASA ADS] [CrossRef] [Google Scholar]
  23. Huang, Y., Beers, T. C., Wolf, C., et al. 2022, ApJ, 925, 164 [NASA ADS] [CrossRef] [Google Scholar]
  24. Kingma, D. P., & Ba, J. 2014, ArXiv e-prints [arXiv:1412.6980] [Google Scholar]
  25. Ksoll, V. F., Ardizzone, L., Klessen, R., et al. 2020, MNRAS, 499, 5447 [NASA ADS] [CrossRef] [Google Scholar]
  26. López-Sanjuan, C., Varela, J., Cristóbal-Hornillos, D., et al. 2019, A&A, 631, A119 [Google Scholar]
  27. López-Sanjuan, C., Yuan, H., Vázquez Ramió, H., et al. 2021, A&A, 654, A61 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  28. Luo, A.-L., Zhao, Y.-H., Zhao, G., et al. 2015, Res. Astron. Astrophys., 15, 1095 [Google Scholar]
  29. Liu, X.-W., Yuan, H.-B., Huo, Z.-Y., et al. 2014, IAU Symp., 298, 310 [Google Scholar]
  30. Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94 [Google Scholar]
  31. Marin-Franch, A., Taylor, K., Cenarro, J., Cristobal-Hornillos, D., & Moles, M. 2015, IAU General Assembly, 29, 2257381 [Google Scholar]
  32. Mendes de Oliveira, C., Ribeiro, T., Schoenell, W., et al. 2019, MNRAS, 489, 241 [NASA ADS] [CrossRef] [Google Scholar]
  33. Miller, A. A., Bloom, J. S., Richards, J. W., et al. 2015, ApJ, 798, 122 [NASA ADS] [CrossRef] [Google Scholar]
  34. Ness, M., Hogg, D. W., Rix, H.-W., et al. 2015, ApJ, 808, 16 [NASA ADS] [CrossRef] [Google Scholar]
  35. Niu, Z., Yuan, H., & Liu, J. 2021a, ApJ, 908, L14 [NASA ADS] [CrossRef] [Google Scholar]
  36. Niu, Z., Yuan, H., & Liu, J. 2021b, ApJ, 909, 48 [NASA ADS] [CrossRef] [Google Scholar]
  37. Onken, C. A., Wolf, C., Bessell, M. S., et al. 2019, PASA, 36, e033 [Google Scholar]
  38. Oommen, T., Baise, L. G., & Vogel, R. M. 2011, Math. Geosci., 43, 99 [CrossRef] [Google Scholar]
  39. Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525 [Google Scholar]
  40. Ting, Y.-S., Conroy, C., Rix, H.-W., et al. 2019, ApJ, 879, 69 [NASA ADS] [CrossRef] [Google Scholar]
  41. Wang, C., Bai, Y., López-Sanjuan, C., et al. 2022, A&A, 659, A144 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  42. Whitten, D. D., Placco, V. M., Beers, T. C., et al. 2019, A&A, 622, A182 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  43. Whitten, D. D., Placco, V. M., Beers, T. C., et al. 2021, ApJ, 912, 147 [NASA ADS] [CrossRef] [Google Scholar]
  44. Wolf, C., Onken, C. A., Luvaul, L. C., et al. 2018, PASA, 35, e010 [Google Scholar]
  45. Wu, Y., Luo, A.-L., Li, H.-N., et al. 2011, Res. Astron. Astrophys., 11, 924 [Google Scholar]
  46. Xiang, M., Ting, Y.-S., Rix, H.-W., et al. 2019, ApJS, 245, 34 [Google Scholar]
  47. Xu, S., Yuan, H. B., Niu, Z. X., et al. 2022, ApJS, 258, 44 [NASA ADS] [CrossRef] [Google Scholar]
  48. Yang, L., Yuan, H., Zhang, R., et al. 2021, ApJ, 908, L24 [NASA ADS] [CrossRef] [Google Scholar]
  49. Yanny, B., Rockosi, C., Newberg, H. J., et al. 2009, AJ, 137, 4377 [Google Scholar]
  50. Yuan, H. B., Liu, X. W., & Xiang, M. S. 2013, MNRAS, 430, 2188 [NASA ADS] [CrossRef] [Google Scholar]
  51. Yuan, H., Liu, X., Xiang, M., et al. 2015a, ApJ, 799, 134 [NASA ADS] [CrossRef] [Google Scholar]
  52. Yuan, H., Liu, X., Xiang, M., et al. 2015b, ApJ, 803, 13 [CrossRef] [Google Scholar]
  53. Yuan, H., Liu, X., Xiang, M., et al. 2015c, ApJ, 799, 133 [NASA ADS] [CrossRef] [Google Scholar]
  54. Zhang, R.-Y., Yuan, H.-B., Liu, X.-W., et al. 2021, Res. Astron. Astrophys., 21, 319 [CrossRef] [Google Scholar]
  55. Zheng, J., Zhao, G., Wang, W., et al. 2018, Res. Astron. Astrophys., 18, 147 [CrossRef] [Google Scholar]

Appendix A: Correlation analyses between J-PLUS colours and stellar labels

The core idea of CSNet is to construct a mapping from J-PLUS colours to stellar labels. This leaves the question as to whether our results are from specific sensitive J-PLUS colours or just from correlations among the stellar labels themselves.

Figures A.1A.6 show comparisons for [Mg/Fe], [C/Fe], and [N/Fe] between our results and LAMOST DD–Payne in the plane of [X-Fe]–[Fe/H]. The differences between the CSNet results and the LAMOST DD–Payne are very small. Figs. A.7A.12 show comparisons in the colour-colour diagrams (BP − J0515 – BP − RP for [Mg/Fe], BP − J0430 – BP − RP for [C/Fe], and BP − J0378 – BP − RP for [N/Fe]). We can see that abundances of our results are all consistent with those of LAMOST DD–Payne, indicating that the training process of CSNet works as expected. At a given narrow [Fe/H] range and BP − RP colour, correlations between BP − J0515 and [Mg/Fe], BP − J0430 and [C/Fe], BP − J0378 and [N/Fe] are significant, demonstrating that CSNet measures these elemental abundances from abinitio features, rather than drawing on astrophysical correlations among the stellar labels.

thumbnail Fig. A.1.

Stellar number density distributions in the plane of [Mg/Fe] – [Fe/H] from the LAMOST catalog (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by stellar number density. Different columns are for stars of different BP − RP ranges. From left to right these are [0.95, 1.05], [1.05, 1.15], and [1.15, 1.40], respectively.

thumbnail Fig. A.2.

Similar to Fig. A.1, but for the dwarf stars. The BP − RP ranges are [0.25, 0.70] (left column), [0.70, 1.00] (middle column), and [1.00, 1.50] (right column), respectively.

thumbnail Fig. A.3.

Stellar number density distributions in the plane of [C/Fe] – [Fe/H] from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by stellar number density. Different columns are for stars of different BP − RP ranges. From left to right these are [0.95, 1.05], [1.05, 1.20], and [1.20, 1.60], respectively.

thumbnail Fig. A.4.

Similar to Fig. A.3, but for the dwarf stars. The BP − RP ranges are [0.25, 0.70] (left column), [0.70, 1.00] (middle column) and [1.00, 1.50] (right column), respectively.

thumbnail Fig. A.5.

Stellar number density distributions in the plane of [N/Fe] – [Fe/H] from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by stellar number density. Different columns are for stars of different BP − RP ranges. From left to right these are [0.95, 1.10], [1.10, 1.20], and [1.20, 1.60], respectively.

thumbnail Fig. A.6.

Similar to Fig. A.5, but for the dwarf stars. The BP − RP ranges are [0.50, 0.80] (left column), [0.80, 1.10] (middle column) and [1.10, 1.50] (right column), respectively.

thumbnail Fig. A.7.

Distributions of [Mg/Fe] in the [BP − RP] – [BP − J0515] colour-colour diagram from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by [Mg/Fe]. Different columns are for stars of different [Fe/H] ranges. From left to right these are [−0.5, −0.3], [−0.3, −0.1], and [−0.1, 0.1], respectively.

thumbnail Fig. A.8.

Similar to Fig. A.7, but for the dwarf stars.

thumbnail Fig. A.9.

Distributions of [C/Fe] in the [BP − RP] – [BP − J0430] colour-colour diagram from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by [C/Fe]. Different columns are for stars of different [Fe/H] ranges. From left to right these are [−1.2, −0.9], [−0.7, −0.5], and [−0.1, 0.1], respectively.

thumbnail Fig. A.10.

Similar to Fig. A.9, but for the dwarf stars.

thumbnail Fig. A.11.

Distributions of [N/Fe] in the [BP − RP] – [BP − J0378] colour-colour diagram from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing giant stars, all colour-coded by [N/Fe]. Different columns are for stars of different [Fe/H] ranges. From left to right these are [−0.7, −0.5], [−0.5, −0.3], and [−0.3, −0.1], respectively.

thumbnail Fig. A.12.

Similar to Fig. A.11, but for the dwarf stars.

Appendix B: Comparisons of LAMOST with APOGEE–Payne, APOGEE–ASPCAP, and GALAH–Cannon stellar labels

We show in this study that systematic errors between CSNet and the validation samples are inherited from the training sets, rather than from the models themselves. Figs. B.1 and Figs. B.2 show comparisons of LAMOST catalogs and other surveys (APOGEE–Payne, APOGEE–ASPCAP, and GALAH–Cannon) for basic stellar atmospheric parameters and elemental abundances, respectively. Systematic discrepancies and trends for stellar labels between the LAMOST catalogs and above three surveys are found. More importantly, these patterns are consistent with those shown in the CSNet results (Figs. 5 and Figs. 6), suggesting that the systematic offsets shown in the main text are inherited from the training data.

thumbnail Fig. B.1.

Comparisons for stellar atmospheric parameters including Teff, log g, and [Fe/H] between the LAMOST DR5 and the APOGEE–Payne (left column), APOGEE–ASPCAP (middle column), and GALAH–Cannon (right column). Only stars in commn with the LAMOST training and testing samples are used. The black and red dots represent dwarfs and giants, respectively. Error bars are coloured in blue and green for dwarfs and giants, respectively, and indicate the mean value “bias” and 1-σ uncertainty of the residuals estimated using a Gaussian fit.

thumbnail Fig. B.2.

Similar to Fig. B.1, but for the elemental abundances as a function of Teff. We note that here the LAMOST elemental abundances are from the DD–Payne.

Appendix C: Stellar label distributions of LAMOST

Here, we selected stars in common between the J-PLUS DR1, Gaia DR2, and LAMOST DR5. For these stars, we further applied the same criteria as those in Fig. 7: (1) FLAGS = 0; (2) 0.063 < BP − RP < 1.786; (3) G < 18; (4) err (all J-PLUS filters) < 0.1 mag. Fig. C.1 and Fig. C.2 show the density distributions of the selected dwarfs and giants in the planes of Teff – log g, Teff – [Fe/H], and (BP − RP) – G, and different elemental abundances with respect to [Fe/H], with the stellar labels from LAMOST, respectively. The results are consistent with those shown in Fig. 7 and Fig. 8.

thumbnail Fig. C.1.

Density distributions of selected J-PLUS-Gaia-LAMOST dwarf stars in the planes of Teff–log g, Teff–[Fe/H], (BP − RP)–G, and different elemental abundances with respect to [Fe/H], all colour-coded by stellar number density. The stellar labels are from LAMOST. Only stars that satisfy the following criteria are used: (1) FLAGS = 0; (2) 0.063 < BP−RP < 1.786; (3) G < 18; (4) err (all J-PLUS filters) < 0.1 mag.

thumbnail Fig. C.2.

Similar to Fig. C.1, but for giant stars.

All Tables

Table 1.

Empirical reddening coefficients for stellar colours.

Table 2.

Adopted constraints on the datasets for training and testing CSNet.

Table 3.

Description for the J-PLUS CSNet stellar parameters and chemical abundances catalog (1).

All Figures

thumbnail Fig. 1.

Distributions of the CSNet training and the testing samples in the planes of Teff–log g, Teff–[Fe/H], [BPRP]–G, [α/Fe]–[Fe/H], [C/Fe]–[Fe/H], [N/Fe]–[Fe/H], [Mg/Fe]–[Fe/H], [Ca/Fe]–[Fe/H], [C/N]–[Fe/H], [C/α]–[Fe/H], [Mg/α]–[Fe/H], and [Ca/α]–[Fe/H], from top to bottom and left to right. The black and red dots represent the training set and testing set stars, respectively. To avoid crowding, only 10 per cent of the selected stars are plotted.

In the text
thumbnail Fig. 2.

ANN structure of CSNet. The structure consists of a scalable, five-layered feed-forward network. The input layer imports the 13 stellar colours transformed by J-PLUS DR1 and Gaia DR2 after extinction correction as the basic features. The following three hidden layers with activation functions extract the deep nonlinear features of the stellar colours. The output layer provides a stellar label with a weighted sum of the learned features in the last hidden layer.

In the text
thumbnail Fig. 3.

Training set (left) and testing set (right) for Teff, log g, and [Fe/H] for the CSNet results. The black and red dots represent dwarfs and giants, respectively. Error bars, coloured by blue and green, for dwarfs and giants, respectively, denote the mean value “bias” and 1σ uncertainty of the residuals estimated using Gaussian fits.

In the text
thumbnail Fig. 4.

Similar to Fig. 3, but here we show residuals for the CSNet elemental abundances as a function of effective temperature.

In the text
thumbnail Fig. 5.

Similar to Fig. 3, but for stars in common between the CSNet results and the reference catalogs.

In the text
thumbnail Fig. 6.

Similar to Fig. 4, but for stars in common between the CSNet results and the reference catalogs.

In the text
thumbnail Fig. 7.

Density distributions of selected J-PLUS DR1 dwarf stars in the planes of Teff–log g, Teff–[Fe/H], (BP − RP)–G, and different CSNet abundances with respect to [Fe/H], all colour-coded by stellar number density. Only stars with reliable labels by the following criteria are used: (1) FLAGS = 0; (2) 0.063 < BP − RP < 1.786; (3) G < 18; (4) err (all J-PLUS filters) < 0.1 mag.

In the text
thumbnail Fig. 8.

Similar to Fig. 7, but for selected J-PLUS DR1 giant stars.

In the text
thumbnail Fig. 9.

Distributions of number density and different CSNet abundances in the plane of R–Z for selected J-PLUS DR1 dwarf stars. Only stars with reliable labels by the following criteria are used: (1) FLAGS = 0; (2) 0.063 < BP − RP < 1.786; (3) G < 18; (4) err (all J-PLUS filters) < 0.1 mag.

In the text
thumbnail Fig. 10.

Similar to Fig. 9, but for selected J-PLUS DR1 giant stars.

In the text
thumbnail Fig. A.1.

Stellar number density distributions in the plane of [Mg/Fe] – [Fe/H] from the LAMOST catalog (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by stellar number density. Different columns are for stars of different BP − RP ranges. From left to right these are [0.95, 1.05], [1.05, 1.15], and [1.15, 1.40], respectively.

In the text
thumbnail Fig. A.2.

Similar to Fig. A.1, but for the dwarf stars. The BP − RP ranges are [0.25, 0.70] (left column), [0.70, 1.00] (middle column), and [1.00, 1.50] (right column), respectively.

In the text
thumbnail Fig. A.3.

Stellar number density distributions in the plane of [C/Fe] – [Fe/H] from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by stellar number density. Different columns are for stars of different BP − RP ranges. From left to right these are [0.95, 1.05], [1.05, 1.20], and [1.20, 1.60], respectively.

In the text
thumbnail Fig. A.4.

Similar to Fig. A.3, but for the dwarf stars. The BP − RP ranges are [0.25, 0.70] (left column), [0.70, 1.00] (middle column) and [1.00, 1.50] (right column), respectively.

In the text
thumbnail Fig. A.5.

Stellar number density distributions in the plane of [N/Fe] – [Fe/H] from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by stellar number density. Different columns are for stars of different BP − RP ranges. From left to right these are [0.95, 1.10], [1.10, 1.20], and [1.20, 1.60], respectively.

In the text
thumbnail Fig. A.6.

Similar to Fig. A.5, but for the dwarf stars. The BP − RP ranges are [0.50, 0.80] (left column), [0.80, 1.10] (middle column) and [1.10, 1.50] (right column), respectively.

In the text
thumbnail Fig. A.7.

Distributions of [Mg/Fe] in the [BP − RP] – [BP − J0515] colour-colour diagram from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by [Mg/Fe]. Different columns are for stars of different [Fe/H] ranges. From left to right these are [−0.5, −0.3], [−0.3, −0.1], and [−0.1, 0.1], respectively.

In the text
thumbnail Fig. A.8.

Similar to Fig. A.7, but for the dwarf stars.

In the text
thumbnail Fig. A.9.

Distributions of [C/Fe] in the [BP − RP] – [BP − J0430] colour-colour diagram from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing sample giant stars, all colour-coded by [C/Fe]. Different columns are for stars of different [Fe/H] ranges. From left to right these are [−1.2, −0.9], [−0.7, −0.5], and [−0.1, 0.1], respectively.

In the text
thumbnail Fig. A.10.

Similar to Fig. A.9, but for the dwarf stars.

In the text
thumbnail Fig. A.11.

Distributions of [N/Fe] in the [BP − RP] – [BP − J0378] colour-colour diagram from the LAMOST DD–Payne (top panels) and the CSNet results (bottom panels) for the training and testing giant stars, all colour-coded by [N/Fe]. Different columns are for stars of different [Fe/H] ranges. From left to right these are [−0.7, −0.5], [−0.5, −0.3], and [−0.3, −0.1], respectively.

In the text
thumbnail Fig. A.12.

Similar to Fig. A.11, but for the dwarf stars.

In the text
thumbnail Fig. B.1.

Comparisons for stellar atmospheric parameters including Teff, log g, and [Fe/H] between the LAMOST DR5 and the APOGEE–Payne (left column), APOGEE–ASPCAP (middle column), and GALAH–Cannon (right column). Only stars in commn with the LAMOST training and testing samples are used. The black and red dots represent dwarfs and giants, respectively. Error bars are coloured in blue and green for dwarfs and giants, respectively, and indicate the mean value “bias” and 1-σ uncertainty of the residuals estimated using a Gaussian fit.

In the text
thumbnail Fig. B.2.

Similar to Fig. B.1, but for the elemental abundances as a function of Teff. We note that here the LAMOST elemental abundances are from the DD–Payne.

In the text
thumbnail Fig. C.1.

Density distributions of selected J-PLUS-Gaia-LAMOST dwarf stars in the planes of Teff–log g, Teff–[Fe/H], (BP − RP)–G, and different elemental abundances with respect to [Fe/H], all colour-coded by stellar number density. The stellar labels are from LAMOST. Only stars that satisfy the following criteria are used: (1) FLAGS = 0; (2) 0.063 < BP−RP < 1.786; (3) G < 18; (4) err (all J-PLUS filters) < 0.1 mag.

In the text
thumbnail Fig. C.2.

Similar to Fig. C.1, but for giant stars.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.