Issue |
A&A
Volume 619, November 2018
|
|
---|---|---|
Article Number | A134 | |
Number of page(s) | 27 | |
Section | Numerical methods and codes | |
DOI | https://doi.org/10.1051/0004-6361/201833351 | |
Published online | 16 November 2018 |
ATHOS: On-the-fly stellar parameter determination of FGK stars based on flux ratios from optical spectra⋆,⋆⋆
1
Astronomisches Rechen-Institut, Zentrum für Astronomie der Universität Heidelberg, Mönchhofstr. 12-14, 69120
Heidelberg, Germany
e-mail: mhanke@ari.uni-heidelberg.de
2
Max Planck Institute for Astronomy, Königstuhl 17, 69117 Heidelberg, Germany
3
Dark Cosmology Centre, The Niels Bohr Institute, Juliane Maries Vej 30, 2100 Copenhagen, Denmark
Received:
3
May
2018
Accepted:
30
August
2018
The rapidly increasing number of stellar spectra obtained by existing and future large-scale spectroscopic surveys feeds a demand for fast and efficient tools for the spectroscopic determination of fundamental stellar parameters. Such tools should not only comprise customized solutions for one particular survey or instrument, but, in order to enable cross-survey comparability, they should also be capable of dealing with spectra from a variety of spectrographs, resolutions, and wavelength coverages. To meet these ambitious specifications, we developed ATHOS (A Tool for HOmogenizing Stellar parameters), a fundamentally new analysis tool that adopts easy-to-use, computationally inexpensive analytical relations tying flux ratios (FRs) of designated wavelength regions in optical spectra to the stellar parameters effective temperature (T eff), iron abundance ([Fe/H]), and surface gravity (log g). Our T eff estimator is based on FRs from nine pairs of wavelength ranges around the Balmer lines Hβ and Hα, while for [Fe/H] and log g we provide 31 and 11 FRs, respectively, which are spread between ∼4800 Å and ∼6500 Å; a region covered by most optical surveys. The analytical relations employing these FRs were trained on N = 124 real spectra of a stellar benchmark sample that covers a large parameter space of T eff ≈ 4000–6500 K (spectral types F to K), [Fe/H] ≈ −4.5 to 0.3 dex, and log g ≈ 1–5 dex, which at the same time reflects ATHOS’ range of applicability. We find accuracies of 97 K for T eff, 0.16 dex for [Fe/H], and 0.26 dex for log g, which are merely bounded by finite uncertainties in the training sample parameters. ATHOS’ internal precisions can be better by up to 70%. We tested ATHOS on six independent large surveys spanning a wide range of resolutions (R = λ/Δ λ ≈ 2000–52 000), amongst which are the Gaia-ESO and the SDSS/SEGUE surveys. The exceptionally low execution time (< 30 ms per spectrum per CPU core) together with a comparison to the literature parameters showed that ATHOS can successfully achieve its main objectives, in other words fast stellar parametrization with cross-survey validity, high accuracy, and high precision. These are key to homogenize the output from future surveys, such as 4MOST or WEAVE.
Key words: methods: data analysis / surveys / techniques: spectroscopic / stars: abundances / stars: fundamental parameters
Full Tables 1 and 2 are only available at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/619/A134
© ESO 2018
1. Introduction
While spectroscopic campaigns aimed at dissecting the formation history of the Milky Way have a long-standing history (e.g., Beers et al. 1985; Christlieb et al. 2001; Yanny et al. 2009), the astronomical landscape of the next decades will be governed by ever-larger spectroscopic surveys that aim at painting a complete chemo-dynamic map of our Galaxy. Amongst these are the surveys RAVE (Steinmetz et al. 2006), SEGUE (Yanny et al. 2009), APOGEE (Majewski et al. 2017), GALAH (De Silva et al. 2015), Gaia-RVS (Cropper et al. 2018), LAMOST (Zhao et al. 2012), Gaia-ESO (Gilmore et al. 2012), 4MOST (de Jong et al. 2012), and WEAVE (Dalton et al. 2012). All these build on the multiplexing capacities of present and future spectrographs and have the goal of expanding the six-dimensional phase space into a multidimensional information space by adding chemical abundance measurements of a large number of tracers of chemical evolution for several hundred thousands of stars. Inevitably, this requires high spectral resolution (R = λ/F W H M ≳ 20 000; Caffau et al. 2013), but also a precise and accurate knowledge of the stellar parameters1 of the target stars.
Various methods for parameter determination are in use, ranging from photometric calibrations of a temperature scale (e.g., Alonso et al. 1996, 1999), excitation equilibrium using large numbers of Fe lines, Balmer-line scrutiny, to least-squares fitting of spectral templates or line indices over a broad parameter grid (Lee et al. 2008). Systematic effects can, to first order, be decreased by using analysis techniques differentially to a standard star of known parameters (Fulbright et al. 2006; Koch & McWilliam 2008). To ensure success, all these methods, in turn, require accurate atomic data and stellar model atmospheres (Barklem et al. 2002), and yet, degeneracies and covariances, in particular between T eff and log g, are often inevitable (McWilliam et al. 1995; Hansen et al. 2011). Further problems arise with large data sets, where the homogenization of parameter scales (Venn et al. 2004; Smiljanic et al. 2014) and the sheer computational time for spectral analysis become an issue.
Here, we introduce a new, fast, and efficient algorithm for stellar parameter determination, named ATHOS (A Tool for HOmogenizing Stellar parameters). ATHOS relies on the measurement of flux ratios (FRs) between well-tested spectral regions that are sensitive to specific parameter combinations and that we optimize to reproduce a compilation of training spectra and their parameters, amongst which are the accurate and precise parameters of the Gaia-ESO benchmark stars (Jofré et al. 2014).
Line-depth ratios (LDRs) of metal lines have long been used as T eff indicators, taking advantage of variations of the lines’ temperature sensitivity over broad ranges, while at the same time being largely pressure-independent as long as the lines are not saturated (Gray & Johanson 1991; Kovtyukh et al. 2003). An extension of these methods rather employs the ratios of flux points that do not necessarily coincide with the line cores, but with other parts of the lines that show empirical, strong sensitivities to the parameter of choice. Kovtyukh et al. (2003) provided a very precise calibration of LDRs to the temperatures of F to K dwarfs. However, their stellar sample restricted its applicability to a narrow metallicity window of −0.5 < [Fe/H] < +0.5 dex and the way of measuring line depths of the lines – that is profile fits to the line – made this method prone to the uncertainties of continuum normalization, which is circumvented by using FRs with rather narrow wavelength spacing. By not relying on pairs of low- and high-excitation lines, ATHOS further allows for measurements of parameters down to much lower metallicities.
Notable features of ATHOS are its fast performance (∼25 ms/< 10 ms for a high-/low-resolution spectrum), applicability over a wide range of resolutions (R ≳ 10 000 for all parameters; R ≳ 2000 for the T eff scale), and validity over a broad range of stellar parameters (T eff ≈ 4000–6500 K, [Fe/H] ≈ −4.5 to 0.3 dex, log g ≈ 1–5 dex). This makes it an ideal tool to provide precise and accurate stellar parameters for large samples within seconds – an important asset in the era of future spectroscopic missions.
This tool is meant to work for all optical spectra, not just for stars originating from one survey as most tailored pipelines do, but it offers a way to homogenize large samples from different surveys. It is by no means an attempt to supersede various survey pipelines, but a simple way to put the millions of stars to be observed on the same scale, so that these are homogeneously treated and not biased by the choices or methods adopted within individual surveys.
This paper is organized as follows: In Sect. 2 we present the set of training spectra used in this study together with a brief discussion of the stellar parameter derivation. Furthermore, the homogenization procedure and treatment of telluric contamination is described. Section 3 presents the concept of FRs and how we used correlation coefficients to find the FRs carrying information about stellar parameters, while in Sect. 4 the computational implementation of the deduced analytical relations is outlined. ATHOS’ stability against resolution and signal-to-noise ratio (S/N) is discussed in Sect. 5, where we also compare our parameter scales to various spectroscopic surveys. Finally, we conclude with a summary of our results in Sect. 6.
2. Training set
This project was originally meant to be model-driven. Therefore, we initially synthesized spectra of a homogeneous and dense coverage of the parameter space and conducted the analysis outlined in Sect. 3. Unfortunately, it turned out that the optimal FRs deduced from theory alone cannot be reproduced in real spectra, and vice versa. A possible explanation is the oversimplification made throughout the modeling by preferring the assumptions of local thermodynamic equilibrium (LTE) and plane-parallel, static atmospheres over a fully three-dimensional, hydrodynamic treatment under non-LTE conditions. This, in turn, would come with enormous computational costs. Further, we identify inaccurate line data as an additional caveat of the theoretical treatment. Due to these limitations, the decision was made to base this work entirely on observed spectra with accurately determined stellar parameters.
To this end, we have compiled a grid of in total 195 high-resolution, high S/N spectra of 124 stars covering the visual wavelength range including the regions around the strong features of the Hβ profile at 4861.3 Å, the Mg I b triplet at ∼5175 Å, the Na-D doublet at ∼5890 Å, and Hα at 6562.8 Å. A valuable part of the sample is the Gaia FGK benchmark star library of Blanco-Cuaresma (2014; GBS) with attributed stellar parameters from Jofré et al. (2014) and Heiter et al. (2015) for the more metal-rich stars, and Hawkins et al. (2016) for the metal-poor targets. Their data were obtained with four different spectrometers and resolutions. These are the HARPS spectrograph at the ESO, La Silla 3.6 m telescope; UVES on the VLT/UT2 at Cerro Paranal, Chile; ESPaDOnS on the Canada-France-Hawaii Telescope at the Mauna Kea observatory, Hawaii, and its twin NARVAL mounted on the 2 m Telescope Bernard Lyot on Pic du Midi, France. Most of the stars in the library have data available from at least two of these instruments and the S/N is mostly well above 100 pixel−1. Moreover, the GBS library includes high-quality spectra of the Sun and Arcturus observed by Hinkle et al. (2000) with the Echelle spectrograph at the Coudé Feed telescope on Kitt Peak, Arizona. We added UVES dwarf and giant spectra from Hansen et al. (2012; hereafter CJH12) to the grid. The metal-poor (−1 dex to −4.5 dex) end was additionally populated by stars characterized in Roederer et al. (2014; henceforth R14). Unfortunately, their spectra are not publicly available. Consequently, we used the ESO Advanced Data Products (ADP) query to cross-check for publicly available data, yielding matches for 48 stars with mainly UVES and some HARPS observations. In order to fill the otherwise sparsely populated horizontal branch (HB), we once again employed the ADP to retrieve spectra for the cooler targets in the spectroscopic HB studies of For & Sneden (2010) and Afşar et al. (2012) (red HB). A list of all the data including the respective stellar parameters and sources thereof can be found in Table 2.
Training set information.
2.1. Stellar parameters
For the GBS sample, Heiter et al. (2015) and Hawkins et al. (2016) inferred T eff and log g from angular diameters and bolometric fluxes, as well as from fitting stellar evolutionary tracks. The deduced errors for these procedures range from ∼20 K–100 K and 0.01 dex–0.15 dex for stars other than the Sun, which exhibits comparatively negligible errors. In light of the new interferometric temperature measurements for HD 140283, HD 122563, and HD 103095 by Karovicova et al. (2018), who found earlier measurements to have suffered from systematic effects much larger than the provided errors, we decided to use their more recent values and errors for these stars. For the metallicity, Jofré et al. (2014) averaged line-by-line abundances that stem from up to seven different analysis codes. These have been each corrected for departures from LTE. Uncertainties achieved in this way span 0.03 dex up to 0.40 dex.
CJH12 derived effective temperatures for their sample using photometric color-T eff relations. The same study provides log g based on either parallaxes in conjunction with stellar structure equations, or by enforcing ionization balance, that is requiring deduced abundances from Fe I and Fe II to agree with each other. The provided [Fe/H] values originate from an LTE analysis of Fe I lines.
R14 employed a strictly spectroscopic determination of T eff by balancing abundances of Fe I transitions at low and high excitation potential. For stars cooler than the main sequence turnoff (MSTO), log g was based on fits to theoretical isochrones, while for the hotter stars it was inferred from ionization balance. [Fe/H] values for the R14 targets were calculated using LTE Fe II abundances, which ought to experience smaller corrections in an NLTE treatment (e.g., Bergemann et al. 2012; Lind et al. 2012).
Due to the former two studies pursuing different approaches to obtain stellar parameters – notably T eff from photometry or from excitation balance and [Fe/H] from Fe I or from Fe II abundances – we decided to re-analyze both samples in a homogeneous investigation2. This was done by employing a common, carefully selected and inspected Fe line list, in conjunction with equivalent widths (EWs). The Fe line list was compiled for CJH12 and used therein. The lines were chosen such that there was no trend with wavelength, and so that excitation and ionization trends were tight (little scatter). The Fe I lines are from the Vienna Atomic Line Database (VALD, Piskunov et al. 1995, 2008 version), the Oxford group Blackwell et al. (1979a,b, 1982a,b,c), Blackwell & Shallis (1979), O’Brian et al. (1991), and Nissen et al. (2007). For Fe II the list was based on VALD, Blackwell et al. (1980), and Nissen et al. (2007). The line list is presented in Table 1.
In case of the CJH12 spectra, EWs were computed from our library spectra (see next section) using our own, semi-automated EW tool EWCODE (Hanke et al. 2017). For spectra in the R14 sample, we relied on published EWs after cross-matching our line list with Roederer et al. (2014). Systematic differences in the methods to determine EWs and/or the spectrographs (UVES compared to MIKE) could be excluded by checking the EW results of Fe lines for the five stars in common between R14 and CJH12. At a mean deviation of 0.67 ± 2.07 mÅ (root mean square deviation, rms) no significant discrepancy was found (see Fig. 1). We note that not all of these EWs entered our subsequent analysis, because the stars in question were present in the GBS sample, which supersedes our parameters. We used EWs to constrain T eff by enforcing excitation equilibrium of Fe I lines adopting plane-parallel ATLAS9 model atmospheres interpolated from the grid by Castelli & Kurucz (2004). Line-by-line abundances were computed using the LTE analysis code MOOG (Sneden 1973, July 2014 release). Typical temperature uncertainties are of order 100 K. In parallel to excitation equilibrium, the empirical atmospheric microturbulence parameter, v t, was tuned to satisfy agreement between weak and strong lines. For the majority of stars, where CJH12 and R14 used parallaxes and photometry to deduce log g, we did not allow the model gravity to vary and used the literature values instead. For the other stars, ionization equilibrium was required to derive log g (labeled in Table 2). Finally, our estimate for [Fe/H] is based on Fe II abundances derived from the optimal set of model atmosphere parameters. Following, for example, Roederer et al. (2014), here we preferred the ionized species over the neutral one at the expense of number statistics, because it is less prone to NLTE effects. For the iron abundance, we adopted errors of 0.10 dex.
![]() |
Fig. 1. Upper panel: comparison of our EWCODE runs on UVES spectra with the literature values determined from MIKE spectra by R14. The one-to-one trend is shown by the black dashed line. Lower panel: residual distribution. The solid and dashed red lines indicate the mean deviation of 0.67 mÅ and the rms scatter of 2.07 mÅ, respectively. |
For the red HB sample, For & Sneden (2010) and Afşar et al. (2012) used only spectroscopic indicators, that is excitation equilibrium for T eff and LTE ionization balance for log g and consequently [Fe/H] from Fe lines. The provided uncertainties are 150 K, 0.16 dex, and ∼0.1 dex, respectively. Here, we adopted the literature parameters.
Since a few stars have been covered by more than one of the individual subsets discussed above, we had to homogenize the deduced stellar parameters from the different studies. For those stars that occur in the GBS, we chose the GBS parameters as reference. If this was not the case we averaged over the parameters we have derived from different EW sources and used the deviations and respective uncertainties for a new uncertainty estimate. In the present study we relied on the solar chemical composition by Asplund et al. (2009) stating log ϵ(Fe)⊙ = 7.50 dex. Hence, GBS metallicities, which are based on log ϵ(Fe)⊙ = 7.45 dex by Grevesse et al. (2007), had to be adjusted accordingly. The final training parameters of the spectra entering our analysis can be found in Table 2.
The selected spectra cover stars in the most relevant parts of the Hertzsprung–Russel diagram (left panel of Fig. 2), viz. on the main sequence (MS), the MSTO, the subgiant branch (SGB), the red-giant branch (RGB), and the HB. In terms of stellar parameters, the training set spans a parameter space from T eff ≈ 4000 to 6500 K, log g ≈ 1–5 dex, and [Fe/H] ≈ −4.5 to 0.30 dex. Figure 2 illustrates the distribution of our training sample in stellar parameter space (T eff, log g, [Fe/H]).
![]() |
Fig. 2. Distribution of the training set in the parameters T eff, [Fe/H], and log g. The color coding in each plot indicates the respective missing dimension. |
2.2. Grid homogenization
For the purpose of spectral homogeneity and computationally efficient access, spectra from different sources and spectrographs were first shifted to rest wavelengths. This was achieved by using radial velocities determined from a cross-correlation, either with a template spectrum of the Sun or Arcturus (α Boo), depending on which of those is closer to the target in stellar parameter space. Imprecisions introduced by using these metal-rich templates for metal-poor targets are unproblematic for this investigation (effect of less than 1 km s−1), which we validated by cross-correlating some of the RV-shifted, metal-poor targets against each other. Next, the spectra were degraded to match a resolution of R = 45 000 by convolution with a Gaussian kernel of appropriate width. Some of the spectra in the R14 sample are originally at resolutions slightly below the desired one, but still well above 40 000. We kept those at their original value and point out that this has negligible effects on this study (see Sect. 5.1). Finally, the data were re-binned to a common, linear wavelength scale with equidistant spacing of δ λ = 0.017 Å pixel−1, this configuration being representative for a typical UVES580 setup (Pasquini et al. 2000). We did not normalize the training spectra since the method introduced here (Sect. 3) considers relative fluxes and is consequently scale-free.
2.3. Telluric contamination
In the Hα region, telluric absorption plays a non-negligible role in the line shape of this feature. As briefly noted in, for example, Eaton (1995) and Cayrel et al. (2011), there is a wealth of absorption features caused by H2O vapor in the Earth’s lower atmosphere falling right in the spectral region around Hα. None of the archival spectra in our set has been corrected for telluric contamination. We address this issue in Fig. 3, where we plotted a portion around Hα of four out of the six available spectra of the metal-poor SGB star HD 140283 (T eff = 5522 K, log g = 3.58 dex, [Fe/H] = −2.41 dex; Jofré et al. 2014; Heiter et al. 2015) acquired at different epochs. Being fairly metal-poor and hot, HD 140283 is not expected to show substantial stellar absorption in the presented region except for Hα itself. However, there is a clear indication of contamination from lines moving with the topocentric – that is the telescope’s – rest frame. We retrieved a telluric absorption model using SkyCalc (Noll et al. 2012; Jones et al. 2013), a tool dedicated to compute sky models for the VLT observatory on Cerro Paranal at 2640 m above sea level. We did not attempt to match the ambient conditions of the observations of HD 140283, but used a global model for a zenith pointing and a seasonal averaged precipitable water vapor of 2.5 mm. A comparison of the models to the observed spectra reveals that the vast majority of the small-scale features originate from telluric absorbers. We note the variations of the depths of the real tellurics between observations and attribute them to varying observing conditions such as airmass and/or water vapor content in the lower atmosphere.
![]() |
Fig. 3. Telluric contamination due to H2O vapor in Earth’s lower atmosphere. Shown as solid lines are rest-frame spectra of HD 140283 taken at different epochs and line-of-sight radial velocities. The apparent motion with respect to the heliocentric velocity v helio = −169.00 km s−1 is indicated in the legend. All dashed lines represent the same topocentric telluric absorption model (gray, see text for details) but being shifted in wavelength space to match the radial velocity of their observed counterpart (light-colored). For better visibility the models are shifted in flux direction as well. |
As we are aiming for stellar parameter determinations irrespective of the targets’ relative motions, we have to take into account the fact that – depending on radial velocity – contamination can in principle prevail at any wavelength in the vicinity of Hα. In the training set with precisely known velocities, we achieve this by masking out tellurics in the individual spectra based on their velocity shifts. This was done by looking for flux minima with line depths above 3% in the telluric model described above and masking the neighboring wavelength ranges of one FHWM on either side. We favor this masking procedure over a detailed modeling and removal of telluric features because of the lack of knowledge of ambient observing conditions for all the spectra and the possible caveats coming from model uncertainties. Moreover, since many of the stars in the grid are represented by more than one spectrum with varying topocentric velocity, a range masked in the one may well be accessible in the other (as can be seen in Fig. 3). Keeping the tellurics in would increase the rms scatter in our Hα-based temperature scales to 280 K compared to the 122 K we find below.
3. Method
For this study, we investigated how FRs are affected by stellar parameters and, vice versa, can be used to constrain them. We define a specific FR as the ratio of the two mean flux levels of a spectrum F in the open intervals of width w around the central wavelengths λ 1 and λ 2, that is,
with
Using FRs bears the main benefit that they are scale-free, meaning they circumvent the caveats of normalization procedures. These are heavily affected by, among others, S/N and resolution of the spectrum, as well as by intrinsic physical quantities such as metallicity and temperature. Provided that the two dispersion points from which an FR is computed are closely spaced in wavelength, the local continuum can be approximated to be constant. This holds true even for merged echelle spectra with clearly extrinsic large-scale continuum variation. Another advantage of measuring FRs over employing iterative minimization approaches, such as profile fits or full spectrum fits – on which to our knowledge any other approach of determining stellar parameters relies – is the comparatively reduced computation time. Therefore, per spectrum, the demand for computational resources can be tremendously lowered.
In order to quantify the information content of an FR of a set of two wavelength regions in the grid with respect to a stellar parameter x, we chose the Pearson product-moment correlation coefficient
where N corresponds to the number of grid points that do not contain pixels masked as tellurics in either of the two wavelength bins, or which are not accessible for other reasons. Here, ⟨FR⟩ represents the mean of the measured FR i and ⟨x⟩ the mean of the investigated parameter values x i . According to Eq. (3), both strong anticorrelations and correlations, that is absolute values |r FR, x | close to unity, indicate a tight linear relation between the tested FR and the quantity x. The demand for monotonic and linear – i.e. with constant sensitivity – analytical functions describing the behavior of a parameter with an FR justifies the use of Pearson’s correlation coefficient as test measure.
Looking for the strongest correlations for each of the parameters T eff, [Fe/H], and log g, we tested all possible wavelength combinations in a window of width 17 Å – or 1000 pixels in the grid – around each dispersion point in the training set and ranked them by decreasing absolute value of the correlation coefficient. The width of 1000 pixels was chosen so that the maximum spacing between two ranges was 8.5 Å, thus ensuring the aforementioned condition of a close-to-constant continuum. In a subsequent step we excluded those FRs containing a wavelength interval that is overlapping with one of the other FRs of higher |r FR, x |. In doing so we made sure that individually measured ratios are independent of each other, thus enabling linear combinations of observables. In the following, we comment on the individual stellar parameters and the relations derived from them.
3.1. Effective temperature
According to the method outlined above, the best spectral regions to derive T eff irrespective of the other stellar classifiers appear to be the Balmer lines of neutral hydrogen, Hα and Hβ. This does not come by surprise, as in FGK stars – that is at temperatures below 8000 K – the wings of the Balmer lines are rather pressure insensitive (see Amarsi et al. 2018; and references therein) and have previously been fit to accurately constrain T eff (e.g., Barklem et al. 2002). Indeed, only in the wings of Hβ and Hα, |r FR, T eff | reaches values > 0.97. Because these wings are potentially very wide, the general method was altered to allow for a maximum dispersion spacing of 2000 pixels instead of 500 (corresponding to 34 Å instead of 8.5 Å). Hence, the FRs are in principle more prone to continuum variations, which seem to play a subordinate role, because we could not identify significant differences in the FRs among training spectra of the same targets from different spectrographs and therefore blaze functions.
In order to describe the tightest relations analytically, we first deduced the individual linear trends. For this task we fitted the function
by performing an orthogonal distance regression (ODR), that is minimizing the sum of the squared normalized orthogonal distances
of the points (FR i , T eff, i ) to the function. Here a and b denote the slope and intercept to be fit, N is the number of FRs measured for the particular relation, whereas σ T eff, ref, i and σ FR i represent the standard errors of the grid temperatures T eff, ref, i and measured flux ratios FR i , respectively. In a subsequent step the T eff fit residuals were checked in a visual inspection for trends with metallicity and/or surface gravity. Fits showing (anti-) correlations in either of the two were rejected. In addition, we omitted sets of FRs where the minimum FR does not deviate by more than 15% from the maximum FR. In doing so we reduced the influence of S/N, because larger spreads of possibly measurable FRs across the temperature range for one relation imply less sensitivity to S/N for that particular relation.
The aforementioned cleaning procedures left us with nine FR-T eff relations. The strongest for either of the two profiles Hβ and Hα is presented in Fig. 4. There, we show how T eff behaves with the FRs measured using a bin width w = 0.357 Å at (λ1, λ2) = (4860.279 Å, 4854.482 Å) and (λ1, λ2) = (6564.478 Å, 6585.898 Å), respectively. These wavelengths correspond to the blue wing of Hβ and the red wing of Hα. We explored various realizations of w and found a bin width of 0.357 Å, that is 21 pixels in the training grid’s dispersion direction, to be the best trade-off value between noise and resolution dependency on the one hand and information content on the other hand. A visual inspection of Fig. 4 confirms the strong temperature trend with FR as already indicated by r FR, T eff . The fit results along with their respective temperature residuals are indicated in the same figure. For the two extreme values in terms of T eff, for each of the two Balmer line regions, we show how the profile shapes and consequently the flux ratios of the wavelength regions of interest differ. We point out that N is not the same in both panels and does not resemble the total number of stars in the training set but the number of spectra free of telluric absorption in the regions of interest. If a star has more than one available spectrum satisfying this condition, we averaged the deduced FRs to a single FR for that star. This ensures that intrinsically identical spectra are not over-represented in the fit.
We found that optimal solutions converge toward sets of two ranges obeying the following necessary conditions: At low metallicities and/or high temperatures, the ratio should incorporate one region with low and one region with high flux variance with temperature. In this case the first acts as pseudo-continuum while the latter carries the temperature information. At high metallicities and/or low temperatures, the line depths at both wavelengths, λ 1 and λ 2, should be equally sensitive to metallicity (and, less importantly, gravity), hence assuring a constant FR at a given T eff. This behavior is similar to the one employed by the classical LDR approach of Kovtyukh et al. (2003).
The above statements are bolstered by Fig. 5, where we demonstrate how the profile shape changes with T eff at low (<−2.0 dex) and high (>−0.1 dex) metallicities. We picked four representatives for each of the two [Fe/H] bins from the training set. While being clearly identifiable in the low-metallicity regime, the FR-T eff trend is less obvious at high metallicities in combination with low temperatures. This is due to the features of species other than H dominating the wings of the Balmer lines, meaning that both components of the FR vary and have to be taken into account simultaneously. The latter observation is more pronounced in the Hβ feature, because it lies in the blue part of the optical spectrum, where metal absorption is much more frequent.
![]() |
Fig. 4. Upper panels: exemplary scatter plots for the FR-T eff relations of the two strongest correlations |r FR, T eff | in our test grid around Hβ (left) and Hα (right). The color coding is the same as in Fig. 2. Black-dashed lines resemble the linear ODR fits to the data. The small inlays at the bottom show cut-out spectra normalized by their 99th percentile for the highest (black) and lowest (gray) corresponding T eff, the former being offset by −0.1 in flux direction. Gaps in the spectra mark the expected positions of strong telluric contamination for the observed radial velocity (Sect. 2.3). The blue and green shaded regions in the inlays indicate the ranges from which the respective FRs were computed using Eqs. (1) and (2) with the values provided in the upper corner of each panel. Lower panels: residual distribution. |
![]() |
Fig. 5. Comparison of the shape changes with T eff of the Hβ (left) and Hα (right) profiles at low (top) and high (bottom) metallicities. As in Fig. 4, the spectra were normalized by the 99th percentile of the flux in the shown wavelength intervals. Gaps in the spectra indicate telluric lines. Blue and green colored ranges denote the same FRs as in the inlays of Fig. 4, while the additional colored regions show the remaining seven relations ranked by the number at the top of each panel. |
Typically, the rms scatter of T eff around an individual relation is of order 110 K–130 K. This is close to the median uncertainty of the training values of 100 K.
It is not straight forward to linearly combine results from the nine individual measurements for each star to reduce T eff uncertainties. As it turned out, the fit residuals are correlated as we show in Fig. 6. Ideally, this plot should consist of ellipses that are aligned with the coordinate axes, implying uncorrelated errors. Yet, the diagonal orientation of the ellipses indicates that the residuals are not purely noise-induced, but of systematic origin.
![]() |
Fig. 6. Residuals of the inferred and the literature temperatures for the nine strongest FR-T eff relations versus the mean residuals. The color coding is the same as in Fig. 2. Spectra with telluric contamination in one of the regions were omitted. Errors on the abscissa were computed via the standard deviation of the ΔT eff, i (see Eq. (8)), while the ordinate errors only denote the claimed uncertainty in the literature data. |
We discuss two possible reasons for this behavior, the first being the existence of hidden parameters. One or more additional parameters could affect the profile shape of Balmer lines and consequently lead to a correlation of the individual FRs and thus of the residuals of the linear FR-T eff trends. Such parameters can either arise in the observations themselves or be of stellar origin. An observational bias could be introduced by the continuum shape (i.e. the blaze function of the spectrograph) in the spectral order where the profiles appear. We tested this possibility on a normalized version of the training grid and found no significant improvement as compared to the unnormalized case. Moreover, due to their large wavelength spacing, Hα and Hβ are commonly not located in the same spectral order and hence not subject to the same part of the blaze function. Yet, there is a non-negligible correlation of fit results from Hα with the ones from Hβ.
As far as stellar parameters are concerned, [Fe/H] and log g can be ruled out to be responsible since correlations of the temperature residuals with these two were explicitly omitted. Moreover, we could not find any trends with the microturbulent velocity. For the GBS sample, there are v sini measurements available (ranging from 0 to 13 km s−1), which enable us to eliminate rotation as driving mechanism for the deviations, too. The last parameter is chemical peculiarity. Depending on the chemical enrichment history of the star, elements other than iron do not necessarily have to scale with metallicity (here [Fe/H]). Most importantly, α-elements such as Mg are strong electron donors, so that their over- or under abundance can have effects on the electron pressure in the stellar atmosphere and accordingly the line formation. Using tabulated abundances for the GBS sample (Jofré et al. 2015) we could not find any trends of the T eff residuals with abundances of any of the available chemical species.
The second plausible origin for the described systematics is the influence of inaccurate training values. So far, we have assumed that the training T eff are of utmost accuracy, in other words the true individual temperatures should not deviate significantly from the training values when taking into account their uncertainties. If we dropped this hypothesis, correlations of the fit residuals to assumed uncorrelated FRs would indicate that either the error estimates in the sample temperatures are underestimated or that the procedures adopted to derive temperatures produce inaccurate temperatures. Considering the established accuracy of bolometric flux calibrations, which were used to determine T eff for the GBS sample, this option seems unlikely. The spectroscopic determinations of T eff for the remainder of the training set, however, might be subject to, for example, NLTE-, 3D-effects, or inaccurate atomic data. These can cause the true temperature of a star to expose non-zero slopes with excitation potential in an LTE treatment, which results in offset temperatures when enforcing excitation balance (e.g., Hanke et al. 2017).
For the above reasons we split the error budget on mean temperatures derived from our relations into a statistical and a systematic component. The latter is estimated by
with
and
Here, for the jth star in the training sample, T eff(FR i ) j is the measured temperature for the ith relation and T eff, ref, j is the corresponding literature value. We find a value of σ T eff, sys ≈ 97 K. Our fit results are summarized in Table A.1.
3.2. Metallicity
Our method revealed that, using FRs, a star’s metallicity can be estimated best by investigating transitions of Fe I with low-energy, lower level. The line strength (or depth) of these features is governed by the temperature and the abundance of the respective atoms in the atmospheric layers where the lines form, while log g and v t, in comparison, play a rather subordinate role. Assuming LTE, higher temperatures shift the excitation-de-excitation equilibrium toward favoring higher occupation numbers at high-energy levels and consequently lead to lower occupation in the lower levels. This results in a weakening of lines that are excited from these levels with respect to the ones at lower temperatures. The same effect would be observed at constant T eff but at a lower [Fe/H], that is a lower number of atoms in the atmosphere column and thus less strong spectral features. We note that the former is a vastly simplified picture, which gets especially complicated by NLTE considerations, such as interactions between energy levels and over-ionization due to an enhanced UV-background (see, e.g., Lind et al. 2012, for a detailed discussion on NLTE effects on Fe lines). However, since this study concentrates on observables of real spectra and not the theoretical modeling thereof, these effects enter only to the extent that they affected the original determinations of the training values. Given that we can infer the effective temperature independently from other stellar quantities (see previous section), we can break the degeneracy between line strength (by means of FRs), [Fe/H], and T eff and hence determine [Fe/H].
To this end, we first identified transitions that are readily described by FRs and detectable in all training sample spectra. We chose to pursue an empirically driven approach and only later consult predictions from literature atomic data in order to be unbiased and not to miss features that are susceptible to stellar parameters in our FR approach. Due to the expected degeneracy between [Fe/H] and T eff, the test statistic had to be modified to the multiple correlation coefficient
where the various r denote the correlation coefficients among the respective quantities as defined in Eq. (3). Here, once again, values of r′ close to unity indicate that [Fe/H] is strongly correlated with the two independent variables T eff and FR. If the latter is satisfied, points in FR-T eff-[Fe/H] space generate a two-dimensional hypersurface. As opposed to the FR-T eff relations, it is not sufficient to describe these using only first-order terms of the independent variables. A better description can be achieved by allowing a second-order interaction term. We use the modified algebraic hypersurface
to describe the behavior. The exponential cut-off term with coefficients β and γ was introduced since for some relations [Fe/H] asymptotically drops for FRs above ∼0.9. This can be intuitively understood, because the line depth of a profile will inevitably go to zero as the metallicity decreases. As a consequence, any FR will approach unity. Therefore, depending on T eff, for some relations the metallicity sensitivity sharply decreases for [Fe/H] ≤ −2.5 dex. Since those cases are still very sensitive indicators at higher metallicities, we decided to keep them and introduce the cut-off term. Neglecting the latter for those relations would lead to systematic overestimates of [Fe/H] by up 0.5 dex in the regime of very low metallicities ([Fe/H] ≤ −2.5 dex). Figure 7 presents the closest resemblance (r′ = 0.986) to a surface described by Eq. (10) we could find. In contrast to the derivation of effective temperatures, here we are dealing with small-scale flux variations. Hence, the windows from which the mean flux levels were computed had to be decreased to 11 pixels, or 0.187 Å, at the expense of stability against S/N. The information carrier in this best case is a Fe I line at 5429.643 Å with its pseudo-continuum slightly further in the blue in a region devoid of substantial absorption. The distribution in the FR-[Fe/H] plane is shown next to its three-dimensional counterpart.
![]() |
Fig. 7. [Fe/H] dependency on FR and T eff. The left panel illustrates the distribution of points of the tightest relation in FR-T eff-[Fe/H] space and the surface they span therein. To guide the eye, the best-fit surface according to Eq. (10) is overlaid as a light-colored, distorted grid, where the grid lines indicate the isothermal and iso-FR lines. The same distribution of points is shown in the top right panel but in a coordinate frame that is rotated such that it is aligned with the FR and [Fe/H] axes. Dashed lines indicate the track of the isothermal lines on the surface. In analogy to Fig. 4, the inlay shows the most extreme spectra in terms of metallicity, as well as the two wavelength regions the FR was computed from (blue). The fit residuals with respect to FR are presented in the lower right frame. All three scatter-plots follow the same T eff color coding as indicated by the bar on the right. |
With in total 340 of these planes obeying r′ ≥ 0.95 we found surprisingly many tight relations. Following the same approach as in the previous section, we lowered this number to 41 by demanding non-significant correlations of the fit residuals with log g and v t. Hence, while the total amount of the absorbed flux in the line surely depends on pressure and turbulence, we empirically deduced ratios of pairs of smaller ranges in stellar spectra that are not significantly influenced by these two quantities.
Variations in the α-element abundances are amongst the most common departures from the solar-scaled abundance distribution. Therefore, in order to not bias our method against α-enhanced or α-depleted stars, we had to ensure that no strong feature of the species O, Mg, Si, Ca, or Ti was present in the wavelength ranges considered for the computation of the FRs. VALD offers the extraction mode “extract stellar” to retrieve atomic data and estimates of the strength of transitions in a given wavelength interval with a particular set of stellar parameters. We employed this tool to cross-match the literature wavelengths from a line list for a sun-like atmosphere (T eff = 5750 K, log g = 4.50 dex, [Fe/H] = 0.00 dex) and a solar-metallicity giant atmosphere (T eff = 4500 K, log g = 1.00 dex, [Fe/H] = 0.00 dex) with our 41 values for λ 1 and λ 2. We excluded another ten pairs of FR regions with α-element features in the vicinity (±0.5 Å). To this end, the rejection threshold for unbroadened – that is intrinsic, spectrograph-independent – line depths in both hypothetical atmospheres was set to 0.2. Consequently, the final number of clean, metallicity-sensitive FRs is 31.
The intra-relation rms metallicity scatter ranges from 0.16 up to 0.20 dex, while the inter-relation rms scatter for individual stars ranges from 0.01 to 0.31 dex. As fit residuals of individual hypersurfaces are correlated, in analogy to Eq. (6), we split the error into a statistical and a systematic part, that is σ [Fe/H],sys = 0.16 dex. The systematic error budget of 0.16 dex can be explained by the uncertainties in the training metallicities, only. We tabulate the wavelengths together with the ODR fit results for the 31 most promising FRs in Table A.2.
The species of the closest and anticipated strongest (in terms of line depth provided by VALD) theoretical transition in the Sun are provided in separate columns in Table A.2. This information, however, has to be treated with caution, because first of all – apart from lines contaminated by α-elements – we did not restrict our analysis to blend-free lines, but aimed for wavelength regions with a close to constant sensitivity to stellar parameters over a wide range of stellar parameters. Secondly, and more importantly, even if a line was isolated in the solar spectrum, this does not necessarily mean that it will be isolated in a different star with significantly different parameters. Fortunately, the interplay between these factors is intrinsically accounted for in our approach. In fact, our strongest correlation (see inlay of Fig. 7) incorporates not only the one Fe I transition listed, but several other possible Fe I blends on either side of the profile. We emphasize at this point that our method is prone to biases introduced by chemical peculiarity to the extent that any non-regular chemical behavior of a target star could affect the relative strength of a blend in our wavelength regions and therefore cause the corresponding FRs to deviate from the ones expected at the star’s metallicity. This effect can, for example, be expected to be encountered in spectra of carbon-enhanced metal-poor (CEMP) stars, where absorption bands of carbonaceous molecules dominate the appearance of wide spectral bands.
3.3. Surface gravity
The derivation of stellar surface gravity using only dedicated FRs that are valid for the whole range of stellar parameters of our sample is less well defined. For dwarfs at moderately low to high metallicities, the wings of lines in the Mg I b triplet have proven to be good log g indicators (e.g., Ramírez et al. 2006). However, for certain combinations of stellar parameters like low gravities and metallicities pronounced wings do not form and cannot be used as universal gravity indicators. In fact, the largest correlation for surface gravity with FRs in the grid spectra is found to be r FR,log g = 0.89. Unfortunately, strong r FR,log g seem to be mainly explained by the spurious correlation of T eff and log g in the grid (r T eff, log g = 0.66). Hence, it was necessary to once again increase the dimensionality of the product-moment correlation coefficient to include T eff and [Fe/H] as independent variables to end up with
where
33 combinations of wavelength ranges of width 0.187 Å in the grid satisfy R′ ≥ 0.95. Our analytical description for these 33 FRs is the hyperplane
After carefully checking the residuals for small-scale structure and contamination by α-element transitions (see previous section), we ended up having 11 reliable relations. In Fig. 8 we visualize the 4-dimensional plane in analogy to Fig. 7 for the strongest association at R′ = 0.977. In the particular case of the shown relation, the blue wavelength region includes a strong Fe II line at 5316.508 Å, while the red range falls in a continuum window. Naturally, line strengths and therefore FRs involving Fe II lines have a strong [Fe/H] sensitivity. In fact, given gravity as prior, most FR combinations discussed here would offer a good metallicity indicator. Here, we use the fact that on top of their strong metallicity- and weak temperature-dependence profiles of ionized species expose a sensitivity to stellar surface gravity. Our 11 log g relations with the involved wavelength ranges and strongest contributing features in the Sun are listed in Table A.3.
![]() |
Fig. 8. log g dependency on FR, T eff, and [Fe/H]. The left panel illustrates the distribution of points of the tightest relation in FR-T eff-log g space, where the colors introduce metallicity as a fourth dimension. The fit solutions to Eq. (13) at fixed [Fe/H] values of −3.5 dex, −1.75 dex, and 0.00 dex are shown as three parallel hyperplanes. The top right panel resembles the distribution in FR-log g space, only. There, dashed lines indicate the track of the iso-metallicity lines on the surface at a fixed temperature of 5000 K. The inlay shows the most extreme spectra in terms of gravity, as well as the two wavelength regions the FR was computed from (blue). The fit residuals with respect to FR are presented in the lower right frame. All three scatter-plots follow the same [Fe/H] color coding as indicated by the bar on the right. |
Figure 8 reveals the main weakness of our empirical approach to derive log g. Ideally, an unbiased training sample would span a regularly spaced grid in parameter space. The sample used in this study, however, lacks coverage at the extreme values of almost all parameters (with the exception of high log g dwarfs, cf. Fig. 2). Even in an idealized homogeneously distributed case, our sample size of 124 ≈ 53 stars would only allow limited explanatory power in three dimensions of independent variables (FR, T eff, and [Fe/H]). As a consequence, the low-number statistics prevents us from detecting subtle non-linearities causing systematic biases such as the ones seen in the high-FR regime in Sect. 3.2 at lower dimensionality (only two independent variables instead of three). A good example for potential non-linear behavior is the spectrum of the star β Ara. Its gravity is underestimated by ∼1 dex in almost all relations, suggesting a breakdown of the linear relation towards β Ara’s corner in the parameter space (T eff = 4197 K, [Fe/H] = −0.05 dex, log g = 1.05 dex). For the discussed reasons, one has to exercise caution when using the provided log g relations, especially for extreme cases corresponding to the edges of the parameter space studied here.
4. ATHOS
We implemented all the information provided in Tables A.1–A.3, as well as the fit covariance matrices and systematic errors of Eqs. (4), (10), and (13) in one stellar parameter tool, called ATHOS. ATHOS is a Python-based software capable of dealing with hundred thousands to millions of spectra within a short amount of time. In part, this is due to the incorporation of parallelization capabilities making use of modern multi-core CPU architectures. Using only FRs computed from input spectra and the dedicated analytical relations, the stellar parameters can be estimated in well under 30 ms. ATHOS’ workflow for each spectrum is divided into four main phases:
The first step is the read phase. ATHOS operates on one-dimensional, RV-corrected input spectra of various file structures. Among these are the standard FITS spectrum formats and binary tables, as well as NumPy arrays, comma-, tab-, and whitespace-separated ASCII files. A minimal input consists of spectral fluxes and the corresponding dispersion information. At least one of the Balmer lines Hβ and Hα should fall in the covered wavelength. Otherwise, an external estimate of T eff has to be given. For a proper treatment of noise, the error spectrum should be provided. Alternatively, a global S/N value for the entire spectrum will be set (not recommended). On a state-of-the-art machine with solid-state drive, the required time to read a spectrum with 2 × 105 dispersion points into memory ranges from 1 ms through 3 ms to 117 ms for NumPy arrays, FITS spectra, and ASCII files, respectively.
The second step masks out regions of potential telluric contamination. The correction is of utmost importance for the temperature determination from the heavily contaminated Hα profile. For this purpose ATHOS internally stores the strongest topocentric wavelengths of our telluric model (see Sect. 2.3). In order to exclude these from consideration, the velocity of the stellar rest frame with respect to the topocenter has to be provided. If Hα is not included, or if the spectrum has been corrected for tellurics, this step can be omitted. The involved operations take about 0.5 ms per spectrum.
Next, the FRs are computed. To this end, the input spectrum is linearly interpolated between dispersion points in order to compute the mean fluxes in the interval (λ − w/2, λ + w/2). The typical execution time required here for a spectrum with 2 × 105 dispersion points is ∼13 ms. In case the resolution of the input spectrum is less than 45 000, the FRs are corrected using the relations discussed in Sect. 5.1.
The final phase is the parameter cascade. Starting from the FRs measured in the previous step, this routine computes the stellar parameters according to Eqs. (4), (10), and (13) in the order T eff, [Fe/H], and finally log g. As any subsequent relation depends on the result of the former, this structure is mandatory. By default, each final parameter and its (internal) error are estimated as the median and the median absolute deviation (mad) of the distribution of results from the different relations. These two measures were chosen, because they are more robust against outliers possibly arising from spectral artifacts compared to, for example, means or weighted means. In some situations, such as highly reddened stars, the S/N may strongly vary on larger scales from the blue to the red. To take this into account, we implemented the possibility of introducing wavelength-dependent weights for the individual FR-relations in the form of polynomials with user-defined degrees and coefficients. In these cases, the weighted median is used to compute the final set of parameters. Systematic errors are propagated throughout the whole cascade. The time demand of this computation step is ∼0.6 ms.
5. Performance tests
5.1. Resolution dependencies
All the FR-parameter relations implemented in ATHOS have been trained on a spectral grid at a fixed resolution of R = 45 000. In order to be applicable to a variety of spectroscopic surveys conducted with different instruments and resolutions, these relations have to be either insensitive to R, or exhibit predictable deviations with R. For this reason, we investigated R dependencies by degrading our spectral grid to resolutions of 40 000, 30 000, 20 000, 10 000, 7500, 5000, and 2500 followed by a re-computation of all FRs which were established at R = 45 000.
Figure 9 shows how the newly computed FRs for our nine T eff indicators deviate from the originally derived ones. Since there is a simple linear relation between FR and T eff, the resulting temperature offset can be deduced by scaling the ΔFR i with the respective slope a i . We found that down to R ≈ 20 000 there is generally no significant deviation in FR, meaning the temperature deviations remain well below 150 K. For even lower values of R the situation becomes non-trivial. Then, the absolute value and sign of the offset appears to not only depend on R, but on T eff and [Fe/H], as well. For metallicities of −1 dex and below, we found significant deviations only as “late” as R ≲ 10 000. At these low metallicities, the profile shapes of the Balmer lines are dominated by H I itself and consequently expose larger scale lengths than just the line spread function of the spectrograph. Hence, convolving with Gaussians of FWHMs shorter than the range width of w = 0.357 Å – the mathematical equivalent of employing a spectrograph of lower resolution – does not disperse large amounts of flux out of the wavelength ranges from which the FRs are measured. Accordingly, the FRs remain largely unaffected. For higher metallicities than −1 dex, a non-linear offset trend of the FRs with decreasing resolution and temperature manifests itself already for R ≲ 20 000. We suspect the driving mechanism for this behavior to be metal absorption from neighboring wavelengths being dispersed into the ranges of interest, or, vice versa, being dispersed out of the ranges. The strength of this additional (or lack of) absorption is mainly determined by T eff, log g, and – more importantly – [Fe/H]. Consequently, the T eff relations are not clean anymore, but get susceptible to pressure and metallicity. Depending on the degree of isolation of the FR ranges from neighboring metal lines, this effect is more or less severe, which explains the large variety of absolute temperature deviations between relations in Fig. 9.
![]() |
Fig. 9. T eff deviations with respect to resolution and [Fe/H] ordered from the strongest to the weakest relation (see Table A.1 and Fig. 5) from top left to bottom right. The deviations were computed by scaling ΔFR with the slope of the respective relation. Points measured for the same star but at different Rs are connected by solid lines. The color coding is the same T eff scale as in Fig. 7. |
The same reasoning holds for the R dependence of the [Fe/H] relations, with the only difference being that optimal solutions usually consist of only one information carrier and a normalization component. In the vast majority of cases, resolution deviations from the training value only affect the information carrier, since the regions of the pseudo-continuum tend to be free of absorption, or at least have a negligible absorption component with respect to the information carrier. This behavior can be seen in the upper right panel of Fig. 10, where we show how resolutions between 45 000 and 2500 affect the profiles in the solar spectrum in our strongest metallicity relation. Moreover, the FR deviations for all benchmark stars at their different metallicities and temperatures are presented in the left panel. It is obvious that higher metallicities, that is lower FRs, correspond to higher deviations in FR once the resolution is decreased. In general, low metallicities cause the line depths of the information carrier to be low and thus the FRs to reside close to unity. If the flux gets dispersed out of these profiles due to broader line spread functions, the FR is only marginally influenced. At higher metallicities, in turn, the FRs tend to be lower than unity and therefore the R-induced deviation can be much larger. Empirically, we found that all corrections can be well described by the relation
![]() |
Fig. 10. Left panel: deviations of the FRs measured in all training spectra for the strongest metallicity relation (see Table A.2) at different resolutions with respect to the FRs computed at the original resolution of 45 000. In contrast to Fig. 9, we note that there is a non-linear relation between FR and [Fe/H], which is why we illustrate only the FR- and not the [Fe/H] deviations. The colored surface resembles the best-fit model to the data according to Eq. (14), where the color indicates the magnitude of the correction to be applied (see color bar on the right). Red points and their connecting line denote the track of the corrections to the FRs measured in the solar spectrum at different resolutions. Upper right panel: portion of the solar spectrum at resolutions of 45 000, 40 000, 30 000, 20 000, 10 000, 7500, 5000, and 2500 (dark- to light-colored). The color-coding refers to the same FR corrections as in the left panel. Blue vertical bars indicate the wavelength ranges from which the FR is computed. Lower right panel: derived [Fe/H] for the Sun from uncorrected (gray) and corrected (red) FRs using Eq. (10) and only the strongest metallicity relation shown in the upper right panel. |
within an acceptable margin of error. Here, FR R is the FR measured at the resolution R and p i j is a coefficient matrix with six entries. The best-fit surface that is spanned by Eq. (14) for our strongest metallicity relation is illustrated in the left panel of Fig. 10. In the lower right panel of the same figure. we show the growing importance of the FR corrections with decreasing resolution by comparing the metallicity from the uncorrected FR with the one derived using the corrected FR. Similar to [Fe/H], our log g surfaces are based on small-scale flux variations. Hence, we see comparable trends of the corresponding FR residuals with R to the one shown in Fig. 10. The coefficient matrices p i j in Eq. (14) for all analytical relations outlined in this work are tabulated in Table A.4.
Despite the fact that Eq. (14) seems to hold down to the lowest resolutions, there are additional circumstances to be considered. First of all, a decrease in resolution shrinks the range of possible manifestations of FRs, which increases the noise sensitivity of Eqs. (4), (10), and (13). Secondly, and more importantly, once the dispersion sampling of a real spectrum reaches the order of the width of the wavelength range we aim to measure – e.g., w = 0.187 Å for the [Fe/H] and log g relations – the mean of the fluxes within that width might correspond to only a fraction of the flux in one pixel. If we considered, for example, the 0.7′′ slit configuration of the X-shooter spectrograph at the VLT with its moderate resolution of ≈10 000 and a sampling of ∼5 pixels FWHM−1, we would end up with only 1.87 pixels w −1 at a wavelength of λ = 5000 Å.
In testing ATHOS’ sensitivity to spectral resolution under the aforementioned conditions, we retrieved and analyzed real spectra from the X-shooter spectral library (Chen et al. 2014) of the seven stars in common with our training sample. The results are presented in Fig. 11. There, we show the departures of the parameter values at moderate resolution (10 000) from the parameters obtained at high-resolution (45 000) for corrected and uncorrected FRs, respectively. From the diagnostic plot for T eff and the mean deviation and scatter of 29 ± 32 K in the R-corrected case we conclude that our temperature relations are very stable against R – at least in the parameter space covered by the seven stars discussed here. This holds true even for the computed temperatures without any applied correction, where we found a marginally significant deviation of 45 ± 26 K (rms). The importance of introducing resolution corrections becomes apparent when looking at the [Fe/H] findings. By not accounting for resolution effects, ATHOS would underestimate the metallicity from the X-shooter spectra by on average −0.78 ± 0.23 dex, while the deviation vanishes for corrected FRs (0.00 ± 0.09 dex). Similarly, an uncorrected mean deviation of −0.31 ± 0.22 dex in the gravity results can be alleviated to −0.13 ± 0.17 dex.
![]() |
Fig. 11. Parameter deviations between ATHOS runs on X-shooter spectra and on high-resolution spectra of the same stars from our training sample. Here, gray circles indicate the values without any correction applied to the involved FRs, while red circles incorporate the corrections described by Eq. (14). The mean deviation and scatter for each stellar parameter are depicted by solid and dashed red lines, respectively. |
5.2. Spectrum noise and systematics
In order to test the stability of ATHOS against S/N we ran a series of Monte Carlo (MC) simulations by adding Poisson noise to high-S/N spectra of the Sun and α Boo as representatives of well studied and accurately parametrized dwarf and giant stars. The resulting spectra were fed to ATHOS to compute stellar parameters. This procedure was repeated 103 times for S/N values of 100, 50, and 20. Figures 12 and 13 show the distributions of the output. It is evident, for S/N = 100 and above, that ATHOS errors are not governed by random-noise effects in the input spectra, which can easily reach down to σ T eff = 50 K, σ [Fe/H] = 0.05 dex, and σ log g = 0.05 dex. This has been discussed already in Sects. 3.1–3.3. The systematic errors provided there (σ T eff, sys = 97 K, σ [Fe/H],sys = 0.16 dex, σ log g, sys = 0.26 dex), readily account for the deviations of non-stochastic origin found in the MC simulations.
![]() |
Fig. 12. ATHOS outputs from an MC-simulation on the solar spectrum. Noise was artificially induced such that S/N values of 100 (gray), 50 (dark blue), and 20 (light blue) with 103 random initializations each were realized. The literature parameters (5771 K, 0.00 dex, 4.44 dex; Heiter et al. 2015) are indicated by black dashed lines. |
Another implication of Figs. 12 and 13 is the strong inter-dependency among the stellar parameters visualized by the inclination of the distribution ellipses. This does not come by surprise as the parameter cascade (phase 4) within ATHOS requires the output of all previous steps as input. The only truly independent quantity is T eff, because we made sure that the respective relations are free of significant [Fe/H] or log g trends. Nevertheless, any deviation from the true value of T eff enters the [Fe/H] relation Eq. (10) as a constant and a (mild) additional slope with FR. Likewise, T eff and [Fe/H] offsets propagate linearly onto the log g surfaces described by Eq. (13). We emphasize that this kind of behavior is also immanent to the established methods of using EWs or spectrum fitting to determine stellar parameters (see, e.g., the detailed discussion in the appendix of McWilliam et al. 1995). The inter-dependencies of the stellar parameters, however, tend to be neglected in most published studies; a practice that only started to change more recently (see, e.g., García Pérez et al. 2016). As our approach is capable of predicting temperature irrespective of the other two quantities, we can at least break the degeneracy with T eff similar to studies employing spectrum fitting of Balmer lines (see, e.g., Barklem et al. 2002).
5.3. Comparison to spectroscopic surveys: ELODIE 3.1
The ELODIE library (Prugniel & Soubiran 2001) in its current version 3.1 (Prugniel et al. 2007) consists of 1959 spectra of 1389 stars obtained with the ELODIE spectrograph at the Observatoire de Haute-Provence. The spectra have a continuous coverage from 3900 to 6800 Å at a nominal resolution of 42 000 and were released together with a catalog of stellar parameters compiled from literature data and quality flags. A restriction to the two best out of the four quality flags – i.e. maximum uncertainties in T eff and [Fe/H] of 115 K and 0.09 dex – left us with 288 spectra of 201 stars. The median S/N of these spectra is 123 at 5550 Å, while the minimum and maximum is 38 and 411, respectively. The library is provided such that the spectra are already shifted to the stellar rest-frame and tellurics were already masked, enabling an immediate analysis with ATHOS. The results of an ATHOS run on the 288 spectra and a comparison to the literature values are shown in Fig. 14.
![]() |
Fig. 14. Upper panels: comparison of the ATHOS output for T eff, [Fe/H], and log g with literature results for the ELODIE spectral library (version 3.1, Prugniel et al. 2007, see text for quality cuts made here). In case a star occurs with multiple spectra in the library, it is reflected by more than one point here. Dashed lines represent the one-to-one relation. The colors reflect the internal statistical uncertainties computed with ATHOS. Lower panels: residual distribution. All residuals are determined via Δx = x lit − x ATHOS. |
We find an excellent agreement between ATHOS values for T eff and the Elodie compilation. The mean deviation and scatter over the entire temperature range is 64 ± 76 K (rms). For 4750 K < T eff < 5500 K, there seems to be a systematic offset of 105 K from unity. Cross-matching our training sample with the ELODIE library resulted in five overlaps in the temperature range in question (Table 3). A possible source for the deviation could be the fact that our training T eff are on average 71 K cooler. Two points in Fig. 14 around 5750 K are clearly off from the overall trend by more than 200 K. Both correspond to the star HD 245, which ATHOS consistently finds to be ∼240 K warmer than the literature value of 5433 K. Comparing ELODIE spectra of the Sun with the two spectra of HD 245 shows a remarkable similarity of the Balmer profiles Hα and Hβ between the two stars. We conclude that both should have an almost identical temperature. Hence, the ATHOS result for HD 245 should be more reliable, because it is closer to the solar value (5771 K). A visual inspection of the Balmer profiles in some of the ELODIE spectra revealed another possible explanation for T eff differences to be bad pixel artifacts. We found several unphysical spikes with heights of a few 10% of the continuum level neighboring Hα and Hβ. These are neither masked nor flagged in the ELODIE library and can possibly falsify parameter measurements.
T eff for overlapping stars between our training sample and the ELODIE library in the range 4750 K–5500 K.
The mean deviation in [Fe/H] is 0.21 ± 0.18 dex (rms). We investigated this behavior in the Sun. There are in total six solar spectra available through the ELODIE library, four of which satisfy S/N > 100. From these we derive [Fe/H] = −0.19 dex with a negligibly small scatter. In Fig. 15, we compare a small wavelength portion to one of the solar spectra from our training grid. Here, the line cores in all ELODIE spectra do not reach as low as the ones in the reference spectrum. As a consequence, the overall line strength inferred from ELODIE is weaker, which translates into higher FRs and hence lower metallicities from ATHOS. Despite the fact that, compared to our training grid, ELODIE’s resolution is slightly lower (42 000 vs. 45 000), this finding cannot be attributed to resolution differences. If resolution were the sole reason, the missing line depth in the profile cores would be fully recovered in the wings, such that the total area of the profile stays constant and the integrated residuals evaluate to zero. This is not observed. We conclude that there must be an additive flux offset to ELODIE library spectra that leads to an unphysical weakening of absorption lines.
![]() |
Fig. 15. Upper panel: comparison of the four ELODIE spectra for the Sun with S/N > 100 (gray) to the atlas spectrum by Hinkle et al. (2000) (red dashed). The latter was degraded to match the resolution of our training grid and re-sampled to the ELODIE dispersion scale. The blue regions show the fluxes used in one of our [Fe/H] relations. All spectra were normalized to the mean flux in the rightmost blue band. Lower panel: flux residuals. |
Looking at log g, we computed a mean difference and scatter of 0.29 ± 0.30 dex (rms). The residual distribution for giants (log g ≤ 2 dex) is well described by random scatter, while for higher gravities there is a positive offset, which correlates strongly with Δ[Fe/H] and ΔT eff. This is an expected behavior, because, as discussed in Sect. 5.2, our log g relations are very sensitive to prior metallicity estimates.
5.4. Comparison with the S4N library
S4N is a high-resolution spectral library of 119 stars by Allende Prieto et al. (2004). The library encompasses a complete census of stars in the local volume (R ⊙ ≤ 15 pc) down to an absolute magnitude of M V = 6.5 mag. The spectra were obtained with either the 2dcoudé spectrograph (R ≈ 52 000) at McDonald Observatory, or the FEROS spectrograph (R ≈ 45 000) at the ESO La Silla observatory. The S/N of the sample is very high (several 100s pixel−1), so that noise-induced effects are uncritical. The spectra cover all wavelengths of all of our parameter relations (3620 Å to at least 9210 Å). The provided T eff was based on photometric colors assuming negligible reddening, while the metallicities were determined via full spectrum fitting of a 150 Å range around Hβ (see Allende Prieto 2003). S4N gravities were inferred from fitting theoretical isochrones using Hipparcos parallaxes. We note that the sample may not be as homogeneous as wished, for example, HD 82328 is a spectroscopic binary and HD 10780 is a BY Dra variable.
We computed stellar parameters for the S4N library using ATHOS. To this end, we masked the Hα profiles of the FEROS spectra, because Allende Prieto et al. (2004) caution that those features have unreliable shapes due to fiber reflections within the spectrograph. Moreover, since the spectra are velocity-shifted but not corrected for tellurics, we had to cross-correlate our line list of tellurics with the library spectra in order to compute the respective velocities of the topocenter. This step was necessary to be able to include the Hα relations of the 2dcoudé spectra without being biased by telluric contamination. Three binaries (HD 110379, HD 188088, and HD 223778) were excluded from consideration.
For T eff, there seems to be a constant offset between the library parameters and ATHOS. Our temperatures are on average 105 ± 92 K (rms) warmer. We suspect that this is due to the literature values originating from photometric calibrations. The reader is referred, for example, to Lind et al. (2008) for a comparison of the Alonso et al. (1996, 1999) temperature scale with the one obtained from fitting Hα. Allende Prieto et al. (2004) cross-validated their photometric T eff using the method of fitting Balmer lines by Barklem et al. (2002). Even though they have found almost negligibly warmer temperatures by on average 35 ± 84 K (rms), it seems that the small-scale trends in their Fig. 5 would – taken as correction for the literature values – improve our systematic discrepancies. Table 4 shows a comparison of the temperatures of the stars in common between our sample and S4N. All of them are also part of the GBS and therefore have highly reliable training temperatures, which are on average 88 K warmer. Another striking evidence showing that the photometric calibrations are systematically cooler than spectroscopic results can be seen when we compare our findings to those for the 32 stars in common with Kovtyukh et al. (2003). Their method for determining T eff is based on spectroscopic line-depth ratios and has proven to have vanishingly small internal errors. In fact, as can be seen in Fig. 17, employing their published temperatures reduces the mean deviation and scatter of ATHOS results to 14 ± 81 K (rms), in other words the scatter approaches the order of the systematic uncertainty expected for our relations, σ T eff, sys = 97 K.
T eff for overlapping stars between our training sample and the S4N library.
In the middle panel of Fig. 16, there are several clear outliers (marked by red star symbols), which are predicted by ATHOS to have much lower metallicities than their tabulated reference values. Upon closer investigation, we found that the spectra expose significant rotational broadening. In fact, using v sin i values measured by Allende Prieto et al. (2004), there is a well defined behavior between Δ[Fe/H] and v sin i (see Fig. 18). To first order, rotational broadening looks similar to broadening by Gaussian line-spread functions. Thus the same reasoning as for the resolution dependencies in Sect. 5.1 holds: Rotation disperses flux out of the small-scale profiles used to determine [Fe/H]. Despite its implemented correction for R, ATHOS is currently not capable of characterizing stars at rotational speeds ≳5 km s−1.
![]() |
Fig. 16. Same as Fig. 14, but for the S4N library (Allende Prieto et al. 2004). Red star symbols resemble stars with rotational velocities v sin i ≥ 5 km s−1. For better visibility, we did not show these stars in the residual distributions of the middle and right panel, because they are far off (see discussion in the main text). |
![]() |
Fig. 17. Comparison of the residual temperature distribution of the S4N literature temperatures (see bottom left panel of Fig. 16) with the ones in common with Kovtyukh et al. (2003, red points). Vertical red lines connect points sharing the same spectrum and therefore ATHOS output. The red dashed line corresponds to the mean difference of 14 K when using the Kovtyukh et al. (2003) reference and the solid red lines indicate the ±1σ scatter of 81 K. |
Overall, the [Fe/H] differences between S4N literature values and ATHOS results are small, at an average of −0.06 ± 0.13 dex (rms). In the high metallicity regime, we find higher metallicities by 0.14 ± 0.10 dex (rms). Assuming that the model temperatures for the full spectrum fitting of Allende Prieto et al. (2004) were systematically too cool, the majority of the synthesized (neutral) profiles would have been stronger at the true metallicity. As a consequence, [Fe/H] would have been underestimated to match the observed spectrum. This probably explains part of the ATHOS metallicities being slightly higher. Given that the quoted errors on S4N gravities are very small, we can conclude from the linear Δlog g trend with log g ATHOS (see right panels of Fig. 16) that our method’s internal gravity error is relatively large. On average, however, the deviations do not exceed the σ log g, sys = 0.26 dex provided for ATHOS (Sect. 3.3).
5.5. Comparison with the Gaia-ESO survey
The Gaia-ESO survey (henceforth GES, Gilmore et al. 2012) is a large-scale spectroscopic survey of ∼105 stars in the Milky Way. Here, we studied how ATHOS performs on the high-resolution (∼47 000) UVES spectra (4800–5800 Å) of field and cluster stars that were released in data release 3.1 (DR3.1). GES published a catalog of recommended astrophysical parameters (Smiljanic et al. 2014). The spectra were analyzed by 13 different automated pipelines (“nodes”). Seven nodes measure EWs of Fe I and Fe II (see Sect. 2.1) via fully automated codes. The remaining six nodes fit either observed or synthetic spectra. All subgroups employed the same GES line list. The recommended, publicly available set of parameters are the weighted median results from all 13 nodes. In order to construct the respective weights, the nodes were applied to the GBS and the results split in three groups: metal-rich dwarfs, metal-rich giants, and metal-poor stars (here grouping dwarfs and giants). The average difference for each group was then used to estimate the weights for the respective nodes. We point out that, while vaguely accounting for changing systematic errors with gravity (at least for high-metallicity stars) and metallicity, this treatment does not consider varying systematics of individual nodes over the considerably large temperature range of the survey. If any such differential systematic deviation existed, it would bias the recommended temperature value. In order to be consistent with our reference solar Fe-abundance logϵ(Fe)⊙ = 7.50 dex by Asplund et al. (2009), we subtracted 0.05 dex from GES metallicities, which are based on Grevesse et al. (2007).
The spectra were shifted to their rest-wavelengths using RVs provided in the GES parameter catalog. Spectra with S/N < 40 were omitted, as well as stars/spectra with GES peculiarity or technical flags including possible binarity, Balmer emission, strong molecular bands, strong rotation, radial velocity problems, or oversubtraction issues. The lower temperature limit was set to be 4000 K3. Moreover, in terms of parameter quality cuts, we set upper thresholds for the GES internal errors at 150 K and 0.2 dex for T eff and [Fe/H], respectively. The final test sample comprised 1009 spectra of 912 stars. The median S/N is 67, which is considerably lower compared to the ELODIE and S4N libraries. The results are illustrated in Fig. 19.
![]() |
Fig. 19. Same as Fig. 14, but for the high-resolution spectra and recommended parameters from the Gaia-ESO survey DR3. Red stars, golden triangles, orange diamonds, and blue pluses indicate the peculiar spectra we found to show high rotational broadening, strong molecular blending, binary components, or erroneous wavelength calibrations, respectively. In the residual distributions we excluded those, because they resemble extreme outliers (see text and Fig. 20). |
The most striking feature is an apparent bifurcation in the [Fe/H] comparison (upper plot in the middle panel) very similar to the one in Fig. 16 which is induced by strong rotational broadening of the underlying spectra. We visually inspected the spectra of the stars with the strongest disagreement between GES and ATHOS metallicities and identified four distinct reasons (see Fig. 20). The most frequent reason for deviations is rotational broadening. This is surprising, because the GES spectra that were flagged as exposing fast rotation were explicitly omitted. The same applies to stars flagged as showing binary signatures. Yet, we identified several spectra showing either pronounced double-lined or asymmetrically distorted single-lined profiles over the entire spectral range. The third intrinsic effect is strong blending by molecular features and was identified in eight spectra of two stars. These molecular features prevail in cool, metal-rich dwarfs and affect most of the otherwise blend-free components of the FRs used in the [Fe/H] relations (Eq. (10)). ATHOS wrongly identifies the affected stars as being more metal-poor, because the portions of metal lines and their normalizing component are closer in flux than they would usually be at a given metallicity. The effect is 0.5 dex for the two stars studied here. For the remaining outliers, we found that there are severe inconsistencies in the wavelength calibration of the spectra. This can be seen in the lower right panel of Fig. 20: with increasing wavelength the blue spectrum appears compressed in comparison. We suspect that this is due to the wavelength solution diverging toward the edges of individual echelle orders. For some spectra the effect is so extreme that in the regions of overlapping orders, the order merging resulted in the spectrum appearing twice, spaced by up to 1.5 Å apart. We conclude that the observed disagreements are not caused by the method implemented in ATHOS, but by issues in the reduction pipelines and quality control of GES.
![]() |
Fig. 20. Comparison of representative peculiar spectra (colored in the same way as in Fig. 19) to the solar spectrum (gray). An approximate normalization was performed by manually setting the continuum. The wavelength ranges were chosen such that the respective reason for peculiarity is illustrated best. The GES identifiers of the stars are (from upper left to lower right) 08100380-4901071, 18184436-4500066, 23001517-2231268, and 08090542-4740261. |
Over the entire T eff range, ATHOS temperatures are marginally warmer by 39 ± 101 K (rms) compared to the recommended GES parameters. A closer look at the lower left panel of Fig. 19 showing the residual distribution reveals two subgroups. The dividing T eff is at around 5500 K. Below this temperature, there is an excellent agreement at a mean difference and scatter of 14 ± 97 K. Above 5500 K, on the other hand, the mean deviation is −83 ± 82 K (rms), showing that GES is slightly cooler than our findings. Possible reasons are the potential systematics in the averaging process within GES on the one hand, or subtle non-linearities of T eff with the FRs measured in the Balmer profiles, which were not detected. Nonetheless, given the rather low quality of the GES spectra in terms of S/N, it is remarkable that the scatter after removing the discussed biases of the two temperature groups is of order ∼90 K.
ATHOS recovers the GES [Fe/H] with a mean deviation and scatter of −0.04 ± 0.15 dex. Again, considering the low S/N of the GES products and the fact that metallicities are computed from only a few pixels in the spectra, this is an extraordinary result. At super-solar metallicities there is a slight disagreement in that ATHOS metallicities are higher by ∼0.15 dex. We argue that the trend arises due to the comparison sample (GES), which in Smiljanic et al. (2014), Fig. 18, shows the same trend. We point out that our method returns an rms of 0.15 dex for the entire sample, while they only show the GES stars for which the dispersion is less than 0.20 dex (so the trend may be even more pronounced). ATHOS accurately reproduces GES gravities at mean residuals of −0.18 ± 0.35 dex (rms).
5.6. Comparison with globular cluster studies
While the previous sections focused on an assessment of ATHOS’ performance on surveys targeting heterogeneous stellar populations, we also aimed at testing results from the very homogeneous populations of RGB stars in globular clusters (GCs).
5.6.1. The Carretta et al. (2009) sample
We compare our method to the UVES study by Carretta et al. (2009, henceforth C09) of 202 stars of 17 GCs spanning a wide range of metallicities and masses. Out of the 17 GCs, four – NGC 104 (47 Tuc), NGC 2808, NGC 6752, and NGC 7078 (M 15) – were re-reduced from the C09 archival data by GES and made publicly available through DR34. Reduced spectra for the remaining 13 clusters were kindly provided by E. Carretta (priv. commun.). For T eff, C09 relied on photometric calibrations and the assumption that the RGB of a GC is intrinsically narrow, which was used to infer T eff from V magnitudes alone. Despite the lower star-to-star errors, we emphasize that the approach might still be subject to biases introduced by the photometric calibrations, E(B − V), or differential reddening. C09 determined surface gravities from isochrone fitting and Fe abundances using EWs of both ionization stages5.
We deduced a median S/N of 56 for the bluer spectral regions, where most of our FRs reside. Lacking information about the topocentric velocities by the time of the observations, Hα profiles were masked out entirely, so that tellurics would not hamper the analysis. Figure 21 presents our results. On top of the C09 literature values, we show the recommended parameters by GES, which were based on a re-reduction and analysis of C09 raw data and make up a considerable fraction of the metal-poor stars discussed in the previous section.
![]() |
Fig. 21. Same as Fig. 14, but for the high-resolution UVES sample of GC stars from the study of C09. Points connected by a red line represent the same spectrum but have two different literature values. The additional red open circles indicate the recommended parameters for DR3 of GES. |
ATHOS temperatures deviate on average by −30 ± 132 K (rms) from C09, hence there is an excellent agreement between the entirely photometry-based C09 T eff-scale and our purely spectroscopically constrained one. We note that the intra-cluster temperature deviations are less scattered than the global scatter of 132 K. There seems to be a systematic offset between individual clusters on the two scales. In Table 5, we present the average temperature and [Fe/H] deviations between C09/GES and ATHOS. For example, the five stars of NGC 6171 constitute five of the six strongest deviating temperatures (lower left panel of Fig. 21). The mean deviation for this GC is −300 K with a scatter of only 80 K. Likewise, ATHOS predicts NGC 6397 stars to be cooler than the C09 findings by on average 204 ± 42 K (rms). Generally, the very low intra-cluster scatters confirm the high precision of the Balmer-relations implemented in ATHOS. Systematic inter-cluster biases could either be founded in inaccuracies of the FR-relations for Balmer profiles in ATHOS, that is linked to unresolved systematics connected to additional parameters, or in the already mentioned possible caveats of the photometric relations of C09. For NGC 6171, we suspect dereddening to be the main source causing the offset (cf. O’Connell et al. 2011) and for NGC 6397, assuming a too warm estimate of T eff for all cluster members in C09 could explain the discrepant metallicities of C09 and Koch & McWilliam (2011).
Mean T eff and [Fe/H] residuals of C09 and GES with respect to ATHOS for individual GCs, as well as the mean [Fe/H] and scatter determined in the three studies.
Globally, our metallicity scale agrees within ⟨Δ[Fe/H]⟩=0.03 ± 0.19 dex (rms) with the one by C09. For the fairly large scatter, random uncertainties are likely to play a rather minor role. Figure 21 implies a trend of Δ[Fe/H] with metallicity, which is also reflected on the individual cluster scale (see Table 5). Deviations of clusters are most likely caused by a drift in the T eff scale. This is confirmed by Fig. 22, where we present a strong correlation between the mean temperature deviations and the mean metallicity residuals between C09 and ATHOS for individual GCs. In case of the strongest outlier in positive direction, NGC 6397, Koch & McWilliam (2011) found a metallicity of −2.10 dex compared to −2.03 dex. Adopting these values would reduce ATHOS’ offset to 0.20 dex. At this point it is noteworthy that Lind et al. (2008) found much lower metallicities for NGC 6397 in better agreement with our findings for a stellar sample between the TO and the blue RGB. However, they showed that there is a significant trend of [Fe/H] with evolutionary stage (−2.41 dex to −2.28 dex).
![]() |
Fig. 22. [Fe/H] residuals averaged per GC versus mean temperature offset. The color coding resembles the cluster metallicity (see color bar on the right). Light colored error bars indicate the respective σ ΔT eff and σ [Fe/H] from Table 5. Red lines connect the same clusters corresponding to the same stars and spectra, but with stellar parameters either from C09, or GES DR3 (red circles). |
We found a marginal overall deviation of 0.17 ± 0.55 dex (rms) in the residual distribution of log g between C09 and ATHOS. Given that all stars considered here are giants clustering around log g ≈ 1.3 dex, the apparent trend in the residual distribution can be attributed to larger errors on individual ATHOS gravities leading to an increased star-to-star scatter with respect to C09. The difference between the two distributions produces the observed anticorrelation of log g ATHOS and Δlog g. Given that most of the information carried by the FRs is related to [Fe/H] (see Sect. 3.3), here the S/N of the giant spectra probably plays an important role for the large scatter in log g.
5.6.2. The MIKE sample
During the last ten years, we have published a series of detailed spectroscopic studies of GC member stars. In total, 46 targets of six GCs (see Table 6 for details and references) have been observed using the high-resolution (R ≈ 40 000) MIKE spectrograph mounted at the 6.5 m Clay telescope at Las Campanas Observatory. The stellar sample of NGC 6397 contains two MSTO stars, for which Koch & McWilliam (2011) used profile fits to Hα to deduce T eff. With the exception of stars belonging to NGC 5897 and NGC 6864, all temperatures of RGB stars were derived in a differential approach, that is excitation balance was enforced differentially on abundances of Fe I lines with respect to the same lines in well understood standard stars with reliable reference parameters (see Koch & McWilliam 2008). These should be as close as possible to the stellar parameters of the stars to be analyzed. Then, the differential treatment cannot only forgo relying on uncertain oscillator strengths, but reduce the influence from NLTE-induced excitation imbalances as well (Hanke et al. 2017; and references therein). In all studies, log g was computed from basic stellar structure equations. Here, we refer to abundances of Fe II, which were computed from EWs as [Fe/H].
Comparison of the literature stellar parameters of the MIKE GC sample to ATHOS.
Figure 23 together with Table 6 show the comparison between the original works on the GCs and an ATHOS run. The mean T eff difference is −12 ± 161 K (rms), with a scatter dominated by differences between individual clusters. In terms of temperature, we found one cluster to systematically deviate from the one-to-one trend.
![]() |
Fig. 23. Same as Fig. 14, but for a sample of MIKE spectra of GC stars. The colored open circles indicate the membership in either of the GCs 47 Tuc (blue, Koch & McWilliam 2008), M 5 (pink, Koch & McWilliam 2010), NGC 6397 (yellow, Koch & McWilliam 2011), M 75 (orange, Kacharov et al. 2013), NGC 5897 (black, Koch & McWilliam 2014), or NGC 6426 (red, Hanke et al. 2017). |
While exposing a rather small star-to-star dispersion in the residuals, all four stars of NGC 6426 are predicted to be cooler by ∼203 K compared to their respective reference values. One possible explanation for this observation is that none of the stars in Hanke et al. (2017) satisfy excitation equilibrium of Fe I in an absolute abundance treatment. In the differential approach pursued in that work the imbalances were compensated by the strong excitation imbalance (on the absolute scale) of the very similar, metal-poor benchmark star HD 122563. The imbalance in this star is likely to originate from NLTE effects (Mashonkina et al. 2011). We showed that in order to achieve excitation equilibrium T eff had to be lowered by 200 K compared to the differential case. As the majority of the temperatures of the metal-poor training stars in this study were determined using absolute abundances, the offset for NGC 6426 can be explained by ATHOS’ temperature scale being tied to absolute rather than to differential methods. We point out that the other two metal-poor clusters, NGC 5897 and NGC 6397, do not show the same deviation. NGC 5897 was studied with an absolute treatment, hence the findings are accurately reproduced by ATHOS. For NGC 6397, the employed benchmark star was Arcturus, which does not show any excitation imbalances in an absolute LTE analysis (Koch & McWilliam 2008). Consequently, the differential temperature scale is essentially the same as the absolute one.
Two GCs, namely NGC 6397 and NGC 6864, exhibit a large scatter (285 K and 179 K, respectively) with respect to what we would expect given the high S/N of the spectra. This was partly due to Hβ not being covered by most of the MIKE spectra and additional loss of Hα relations due to telluric contamination. An important source for the strongest offset (∼400 K), which is shown by the RGB star #13414 in NGC 6397, is its strong mass-loss that manifests in wind-induced emission spikes in the profile wings of Hα. For the two MSTO stars in NGC 6397, we identify the treatment of different broadening mechanisms in the LTE Balmer synthesis of the original work as possible origin of deviations in the temperature scales. The large intra-cluster scatter of Δ T eff for NGC 6864 (156 K) remains inexplicable.
In terms of metallicity, the mean residual deviation and scatter is −0.06 ± 0.20 dex. Similar to before, neither mean nor scatter are representative for all the subsamples and differences can be linked to deviating temperatures of individual stars.
The comparison of the literature gravities with our results for log g confirms that especially for gravities well below 1 dex ATHOS can only provide a rough estimate. Partly, this is also related to rather low S/N values of the spectra, which increase the internal log g errors of ATHOS (see right panels of Fig. 23).
5.7. Comparison with SDSS
The Sloan Digital Sky Survey (SDSS, Abolfathi et al. 2018) released several hundred thousand optical spectra at low resolution (R ≈ 1500–2500) with an accompanying catalog of stellar parameters for a subset thereof. These parameters were determined using the SEGUE Stellar Parameter Pipeline (SSPP, Lee et al. 2008), which obtains stellar parameters from several photometric and spectroscopic estimators. The final adopted SSPP parameters are averaged from the individual measurements using a decision tree that excludes certain results based on photometric quality cuts and S/N values of the analyzed spectra (see Lee et al. 2008, for a detailed discussion).
We used the DR14 SSPP catalog – which has not changed since DR12 – to retrieve and analyze spectra obeying our own quality cuts. These were set such that only spectra without peculiar flags, maximum SSPP uncertainties on T eff, [Fe/H], and log g of 50 K, 0.05 dex, and 0.1 dex, as well as with S/N > 80 were considered. The fact that we analyzed the resulting 3966 spectra within 16 s, that is 4 ms per spectrum, impressively shows that ATHOS’ execution time is merely limited by the size of the input spectra (in this case ∼3800 dispersion points). We illustrate the comparison between ATHOS and SSPP results in Fig. 24.
![]() |
Fig. 24. Same as Fig. 14, but for 3966 SDSS low-resolution (R ≈ 1500–2500) spectra. As opposed to earlier figures, here the color does not represent ATHOS errors but the local point density (arbitrary units) obtained via a Gaussian kernel-density estimate. |
The mean offset and rms scatter in T eff is −37 ± 115 K, which proves our new temperature indicator to be accurate (on the scale of SSPP) and precise even at resolutions as low as ∼2000. Further, despite not being covered by our benchmark sample, the temperature range between 6500 K and 7000 K appears to be described well by extrapolating our linear FR-trends.
In terms of metallicity, we found a bias of 0.20 ± 0.24 dex (rms) and additional substructure in the residual distribution. This is not further surprising, because the pixel spacing in SDSS spectra resides mostly well above 1 Å. Hence, ATHOS determines metallicities (and gravities) for the individual relations from wavelength ranges (w = 0.187 Å) that span less than a fifth of a pixel’s width. Due to this, any inaccuracy in the wavelength solution or in the radial velocity determination of the input spectra adds a systematic contribution that potentially has a large influence on the deduced parameters. Our log g estimate is accurate on the SSPP scale on a 0.01 dex level with a scatter of 0.50 dex. While its T eff method is highly accurate, ATHOS’ metallicity and gravity estimates at these low resolutions are rather vague. Nevertheless, they can still be used, for example, as means to distinguish dwarfs from giants and metal-poor from metal-rich stars.
5.8. Comparison across surveys
A key goal of this project was to enable cross-validation and parameter homogenization across various survey pipelines. This main strength of ATHOS is illustrated in Fig. 25, where we show the combined results of all six data sets (ELODIE, S4N, GES, C09, our MIKE GC studies, and SDSS/SEGUE) that have been described in detail before. It is noteworthy that all 1579 high-resolution spectra of different resolutions originating from six different instruments were analyzed within 43 s, corresponding to an average execution time of 27 ms per spectrum. For the 3966 low-resolution spectra the average time demand was considerably lower at 4 ms. We emphasize that this was achieved using only one CPU core on a regular laptop.
![]() |
Fig. 25. Comparison of the results from all surveys discussed in Sects. 5.3–5.7. The color coding is indicated by the legend in the right panel. |
From Fig. 25 we conclude that using ATHOS’ homogeneous parameter scale together with its wide-ranging applicability enables the detection of discrepancies between the methods employed by the different surveys. Our in-depth investigations of the individual samples (see earlier Sects.) revealed that departures from the ATHOS scale – especially for T eff – are likely to be founded either in erroneous spectroscopic data, or possible shortcomings of the various survey pipelines.
6. Conclusion
In this paper we presented a novel approach of deriving the fundamental stellar parameters T eff, log g, and [Fe/H] by employing parameter-sensitive FRs of close-by wavelength windows in optical stellar spectra. Using a set of simple analytical relations, the FRs described here were tied to the well-calibrated parameters of a training set of 124 stars. The training sample comprised stars spanning a wide domain of stellar parameters, that is dwarfs and giants of spectral type F to K that range from very metal-poor ([Fe/H] ≈ −4.5 dex) to super-solar metallicities. The introduced technique is non-iterative – that is computationally inexpensive – and depends on models only to the degree that the training parameters have been determined via model-dependent approaches.
Our method was implemented into a Python-based code we call ATHOS, which is made publicly available6. Being fairly simple, it is straightforward to implement the relations established here in any other piece of spectrum analysis code. We have shown that our method accurately recovers the atmospheric parameters of the large samples of the ELODIE and S4N libraries, the high-resolution part of the Gaia-ESO survey, the GC studies by C09 and our own group comprising in total 20 clusters, and – at least in terms of T eff – the SDSS/SEGUE survey. Thus, ATHOS temperatures have proven to be insensitive to spectral resolution over more than a magnitude of resolutions (R ≈ 2000–52 000), while [Fe/H] and log g are reproduced accurately down to R ≈ 10 000.
Our machine-learned, optimal FRs converged toward determining a star’s effective temperature from wavelength regions around the Balmer lines Hβ and Hα. As opposed to many other parameter-estimation methods, this can be done truly independent from any other stellar parameter, such as metallicity. We could show that our new temperature estimator agrees very well with the LDR-based scale of Kovtyukh et al. (2003) (Δ T eff = 14 ± 81 K, based on spectra of 32 stars). Our method, however, has a much more extended range of applicability, because it is also capable of analyzing giants and – due to relying on Balmer- instead of metal lines – metal-poor stars down to metallicities of ∼ − 4.5 dex.
Based on the determined temperature, we can use FRs involving spectral features of atomic and ionized species to deduce [Fe/H] and log g, respectively. We have been more lenient with the gravity estimates, since we assume that Gaia parallaxes (Gaia Collaboration et al. 2018) will – at least for the brighter and/or near-by stellar populations – oust any spectroscopic method in the future.
Provided the optical spectra include parameter sensitive wavelength regions like Hα, ATHOS can efficiently and fast (a few milliseconds per spectrum) determine the three key stellar parameters, T eff, [Fe/H], and log g. Unlike previously established automated methods of determining stellar parameters such as EW-matching algorithms (e.g., Smiljanic et al. 2014; and references therein) or full spectrum fitting techniques (e.g., The Cannon, Ness et al. 2015), the approach introduced here is largely insensitive to continuum normalization and resolution of the spectra. Hence the strength of ATHOS is that it is not confined to a specific survey or spectrograph and that it does not require a separate training for each individual survey. Furthermore, it can deal with any spectral type (within the training sample’s limitations) irrespective of a priori assumptions or priors. These aspects constitute ATHOS’ main strength, which is the determination of homogeneous stellar parameters across surveys.
Our new tool is capable of parameterizing one million stars in just about eight hours on a single CPU core. Hence, in its current version, ATHOS can homogeneously and robustly analyze all high-resolution spectra ever to be observed with 4MOST and WEAVE in less than one day, without any manual interference or having to use extensive computational resources. In addition, it is a very useful tool for flagging outliers (binaries, fast rotators, chemically peculiar stars) which previous surveys like GES may have overlooked.
Future add-ons in ATHOS will focus on the investigation of peculiar stars like CEMP stars, fast rotators, and low-gravity (log g < 1 dex) stars. Another class of peculiar stars we desire to explore is the collection of variable stars, such as RR-Lyrae or Cepheid variables, which can exhibit atmospheric shocks that might distort spectral profiles like Hα. Finally, we will examine the possibility to extend our FR-based method to near-infrared spectra in order to enlarge ATHOS’ wavelength applicability.
Here taken as effective temperature, Teff, surface gravity log g, microturbulence, vt, and the overall metallicity [M/H], which we will use synonymously with [Fe/H] in the following, though we are aware that the latter nomenclature is at odds with the formally correct definition. We chose, however, to follow the common usage in the literature. Higher order parameters such as stellar rotation will only be briefly discussed.
For further information on the discrepancy of photometric and spectroscopic parameter scales, see Sect. 5.4.
Acknowledgments
A. K. and E. K. G. gratefully acknowledge support from the Collaborative Research Cluster “The Milky Way System” (SFB 881) of the German Research Foundation (DFG), particularly through subprojects A5, A8, and A9. We thank the referee for helpful and constructive inputs. The authors are grateful to E. Carretta et al. for providing ready-to-use spectra of their large-scale GC program. Based in part on data products from observations made with ESO Telescopes at the Paranal Observatory under program ID 188.B-3002 (GES). Based in part on data obtained from the ESO Science Archive Facility. Furthermore, this paper includes data gathered with the 6.5 m Magellan telescopes located at Las Campanas Observatory, Chile. This work has made use of the VALD database, operated at Uppsala University, the Institute of Astronomy RAS in Moscow, and the University of Vienna.
References
- Abolfathi, B., Aguado, D. S., Aguilar, G., et al. 2018, ApJS, 235, 42 [NASA ADS] [CrossRef] [Google Scholar]
- Afşar, M., Sneden, C., & For, B.-Q. 2012, AJ, 144, 20 [NASA ADS] [CrossRef] [Google Scholar]
- Allende Prieto, C. 2003, MNRAS, 339, 1111 [NASA ADS] [CrossRef] [Google Scholar]
- Allende Prieto, C., Barklem, P. S., Lambert, D. L., & Cunha, K. 2004, A&A, 420, 183 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Alonso, A., Arribas, S., & Martinez-Roger, C. 1996, A&A, 313, 873 [NASA ADS] [Google Scholar]
- Alonso, A., Arribas, S., & Martínez-Roger, C. 1999, A&AS, 140, 261 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Amarsi, A. M., Nordlander, T., Barklem, P. S., et al. 2018, A&A, 615, A139 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Asplund, M., Grevesse, N., Sauval, A. J., & Scott, P. 2009, ARA&A, 47, 481 [NASA ADS] [CrossRef] [Google Scholar]
- Barklem, P. S., Stempels, H. C., Allende Prieto, C., et al. 2002, A&A, 385, 951 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Beers, T. C., Preston, G. W., & Shectman, S. A. 1985, AJ, 90, 2089 [NASA ADS] [CrossRef] [Google Scholar]
- Bergemann, M., Lind, K., Collet, R., Magic, Z., & Asplund, M. 2012, MNRAS, 427, 27 [NASA ADS] [CrossRef] [Google Scholar]
- Blackwell, D. E., & Shallis, M. J. 1979, MNRAS, 186, 673 [NASA ADS] [CrossRef] [Google Scholar]
- Blackwell, D. E., Ibbetson, P. A., Petford, A. D., & Shallis, M. J. 1979a, MNRAS, 186, 633 [NASA ADS] [CrossRef] [Google Scholar]
- Blackwell, D. E., Petford, A. D., & Shallis, M. J. 1979b, MNRAS, 186, 657 [NASA ADS] [CrossRef] [Google Scholar]
- Blackwell, D. E., Shallis, M. J., & Simmons, G. J. 1980, A&A, 81, 340 [NASA ADS] [Google Scholar]
- Blackwell, D. E., Petford, A. D., Shallis, M. J., & Simmons, G. J. 1982a, MNRAS, 199, 43 [NASA ADS] [CrossRef] [Google Scholar]
- Blackwell, D. E., Petford, A. D., & Simmons, G. J. 1982b, MNRAS, 201, 595 [NASA ADS] [CrossRef] [Google Scholar]
- Blackwell, D. E., Shallis, M. J., & Simmons, G. J. 1982c, MNRAS, 199, 33 [NASA ADS] [CrossRef] [Google Scholar]
- Blanco-Cuaresma, S., Soubiran, C., Jofré, P., & Heiter, U. 2014, A&A, 566, A98 (GBS) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Caffau, E., Koch, A., Sbordone, L., et al. 2013, Astron. Nachr., 334, 197 [NASA ADS] [Google Scholar]
- Carretta, E., Bragaglia, A., & Gratton, R. 2009, A&A, 505, 139 (C09) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Castelli, F., & Kurucz, R. L. 2004, ArXiv e-prints [arxiv:astro-ph/0405087] [Google Scholar]
- Cayrel, R., van’t Veer-Menneret, C., Allard, N. F., & Stehlé, C. 2011, A&A, 531, A83 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Chen, Y.-P., Trager, S. C., Peletier, R. F., et al. 2014, A&A, 565, A117 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Christlieb, N., Wisotzki, L., Reimers, D., et al. 2001, A&A, 366, 898 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Cropper, M., Katz, D., Sartoretti, P., et al. 2018, A&A, 616, A5 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Dalton, G., Trager, S. C., Abrams, D. C., et al. 2012, in Ground-based and Airborne Instrumentation for Astronomy IV, Proc. SPIE, 8446, 84460P [CrossRef] [Google Scholar]
- de Jong, R. S., Bellido-Tirado, O., Chiappini, C., et al. 2012, Ground-based and Airborne Instrumentation for Astronomy IV, Proc. SPIE, 8446, 84460T [Google Scholar]
- De Silva, G. M., Freeman, K. C., Bland-Hawthorn, J., et al. 2015, MNRAS, 449, 2604 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
- Eaton, J. A. 1995, AJ, 109, 1797 [NASA ADS] [CrossRef] [Google Scholar]
- For, B.-Q., & Sneden, C. 2010, AJ, 140, 1694 [NASA ADS] [CrossRef] [Google Scholar]
- Fulbright, J. P., McWilliam, A., & Rich, R. M. 2006, ApJ, 636, 821 [NASA ADS] [CrossRef] [Google Scholar]
- Gaia Collaboration (Brown, A. G. A., et al.) 2018, A&A, 616, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- García Pérez, A. E., Allende Prieto, C., Holtzman, J. A., et al. 2016, AJ, 151, 144 [NASA ADS] [CrossRef] [Google Scholar]
- Gilmore, G., Randich, S., Asplund, M., et al. 2012, The Messenger, 147, 25 (GES) [NASA ADS] [Google Scholar]
- Gray, D. F., & Johanson, H. L. 1991, PASP, 103, 439 [NASA ADS] [CrossRef] [Google Scholar]
- Grevesse, N., Asplund, M., & Sauval, A. J. 2007, Space Sci. Rev., 130, 105 [Google Scholar]
- Hanke, M., Koch, A., Hansen, C. J., & McWilliam, A. 2017, A&A, 599, A97 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Hansen, C. J., Nordström, B., Bonifacio, P., et al. 2011, A&A, 527, A65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Hansen, C. J., Primas, F., Hartman, H., et al. 2012, A&A, 545, A31 (CJH12) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Hawkins, K., Jofré, P., Heiter, U., et al. 2016, A&A, 592, A70 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Heiter, U., Jofré, P., Gustafsson, B., et al. 2015, A&A, 582, A49 [NASA ADS] [CrossRef] [EDP Sciences] [PubMed] [Google Scholar]
- Hinkle, K., Wallace, L., Valenti, J., & Harmer, D. 2000, Visible and Near Infrared Atlas of the Arcturus Spectrum 3727–9300 A [Google Scholar]
- Jofré, P., Heiter, U., Soubiran, C., et al. 2014, A&A, 564, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Jofré, P., Heiter, U., Soubiran, C., et al. 2015, A&A, 582, A81 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Jones, A., Noll, S., Kausch, W., Szyszka, C., & Kimeswenger, S. 2013, A&A, 560, A91 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Kacharov, N., Koch, A., & McWilliam, A. 2013, A&A, 554, A81 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Karovicova, I., White, T. R., Nordlander, T., et al. 2018, MNRAS, 475, L81 [NASA ADS] [CrossRef] [Google Scholar]
- Koch, A., & McWilliam, A. 2008, AJ, 135, 1551 [NASA ADS] [CrossRef] [Google Scholar]
- Koch, A., & McWilliam, A. 2010, AJ, 139, 2289 [NASA ADS] [CrossRef] [Google Scholar]
- Koch, A., & McWilliam, A. 2011, AJ, 142, 63 [NASA ADS] [CrossRef] [Google Scholar]
- Koch, A., & McWilliam, A. 2014, A&A, 565, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Kovtyukh, V. V., Soubiran, C., Belik, S. I., & Gorlova, N. I. 2003, A&A, 411, 559 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Lee, Y. S., Beers, T. C., Sivarani, T., et al. 2008, AJ, 136, 2022 [NASA ADS] [CrossRef] [Google Scholar]
- Lind, K., Korn, A. J., Barklem, P. S., & Grundahl, F. 2008, A&A, 490, 777 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Lind, K., Bergemann, M., & Asplund, M. 2012, MNRAS, 427, 50 [NASA ADS] [CrossRef] [Google Scholar]
- Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94 [NASA ADS] [CrossRef] [Google Scholar]
- Mashonkina, L., Gehren, T., Shi, J.-R., Korn, A. J., & Grupp, F. 2011, A&A, 528, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- McWilliam, A., Preston, G. W., Sneden, C., & Searle, L. 1995, AJ, 109, 2757 [NASA ADS] [CrossRef] [Google Scholar]
- Ness, M., Hogg, D. W., Rix, H.-W., Ho, A. Y. Q., & Zasowski, G. 2015, ApJ, 808, 16 [NASA ADS] [CrossRef] [Google Scholar]
- Nissen, P. E., Akerman, C., Asplund, M., et al. 2007, A&A, 469, 319 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Noll, S., Kausch, W., Barden, M., et al. 2012, A&A, 543, A92 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- O’Brian, T. R., Wickliffe, M. E., Lawler, J. E., Whaling, W., & Brault, J. W. 1991, Physics, 8, 1185 [Google Scholar]
- O’Connell, J. E., Johnson, C. I., Pilachowski, C. A., & Burks, G. 2011, PASP, 123, 1139 [NASA ADS] [CrossRef] [Google Scholar]
- Pasquini, L., Avila, G., Allaert, E., et al. 2000, in Optical and IR Telescope Instrumentation and Detectors, eds. M. Iye, & A. F. Moorwood, Proc. SPIE, 4008, 129 [NASA ADS] [CrossRef] [Google Scholar]
- Piskunov, N. E., Kupka, F., Ryabchikova, T. A., Weiss, W. W., & Jeffery, C. S. 1995, A&AS, 112, 525 [NASA ADS] [Google Scholar]
- Prugniel, P., & Soubiran, C. 2001, A&A, 369, 1048 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Prugniel, P., Soubiran, C., Koleva, M., & Le Borgne, D. 2007, ArXiv e-prints [arxiv:astro-ph/0703658] [Google Scholar]
- Ramírez, I., Allende Prieto, C., Redfield, S., & Lambert, D. L. 2006, A&A, 459, 613 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Roederer, I. U., Preston, G. W., Thompson, I. B., et al. 2014, AJ, 147, 136 (R14) [NASA ADS] [CrossRef] [Google Scholar]
- Smiljanic, R., Korn, A. J., Bergemann, M., et al. 2014, A&A, 570, A122 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Sneden, C. 1973, PhD Thesis, Univ. of Texas at Austin [Google Scholar]
- Steinmetz, M., Zwitter, T., Siebert, A., et al. 2006, AJ, 132, 1645 [NASA ADS] [CrossRef] [Google Scholar]
- Venn, K. A., Irwin, M., Shetrone, M. D., et al. 2004, AJ, 128, 1177 [NASA ADS] [CrossRef] [Google Scholar]
- Yanny, B., Rockosi, C., Newberg, H. J., et al. 2009, AJ, 137, 4377 [NASA ADS] [CrossRef] [Google Scholar]
- Zhao, G., Zhao, Y.-H., Chu, Y.-Q., Jing, Y.-P., & Deng, L.-C. 2012, Res. Astron. Astrophys., 12, 723 [CrossRef] [Google Scholar]
Appendix A: Additional tables
Fit information on FR-T eff relations (Eq. (4)) sorted by decreasing |r FR,T eff |.
Information on the strongest FR-T eff-[Fe/H] relations (Eq. (10)) sorted by decreasing r′.
Information on the strongest FR-T eff-[Fe/H]-log g relations (Eq. (13)) sorted by decreasing R′.
Coefficients for the resolution-dependent correction to measured FRs (Eq. (14)).
All Tables
T eff for overlapping stars between our training sample and the ELODIE library in the range 4750 K–5500 K.
Mean T eff and [Fe/H] residuals of C09 and GES with respect to ATHOS for individual GCs, as well as the mean [Fe/H] and scatter determined in the three studies.
Fit information on FR-T eff relations (Eq. (4)) sorted by decreasing |r FR,T eff |.
Information on the strongest FR-T eff-[Fe/H] relations (Eq. (10)) sorted by decreasing r′.
Information on the strongest FR-T eff-[Fe/H]-log g relations (Eq. (13)) sorted by decreasing R′.
Coefficients for the resolution-dependent correction to measured FRs (Eq. (14)).
All Figures
![]() |
Fig. 1. Upper panel: comparison of our EWCODE runs on UVES spectra with the literature values determined from MIKE spectra by R14. The one-to-one trend is shown by the black dashed line. Lower panel: residual distribution. The solid and dashed red lines indicate the mean deviation of 0.67 mÅ and the rms scatter of 2.07 mÅ, respectively. |
In the text |
![]() |
Fig. 2. Distribution of the training set in the parameters T eff, [Fe/H], and log g. The color coding in each plot indicates the respective missing dimension. |
In the text |
![]() |
Fig. 3. Telluric contamination due to H2O vapor in Earth’s lower atmosphere. Shown as solid lines are rest-frame spectra of HD 140283 taken at different epochs and line-of-sight radial velocities. The apparent motion with respect to the heliocentric velocity v helio = −169.00 km s−1 is indicated in the legend. All dashed lines represent the same topocentric telluric absorption model (gray, see text for details) but being shifted in wavelength space to match the radial velocity of their observed counterpart (light-colored). For better visibility the models are shifted in flux direction as well. |
In the text |
![]() |
Fig. 4. Upper panels: exemplary scatter plots for the FR-T eff relations of the two strongest correlations |r FR, T eff | in our test grid around Hβ (left) and Hα (right). The color coding is the same as in Fig. 2. Black-dashed lines resemble the linear ODR fits to the data. The small inlays at the bottom show cut-out spectra normalized by their 99th percentile for the highest (black) and lowest (gray) corresponding T eff, the former being offset by −0.1 in flux direction. Gaps in the spectra mark the expected positions of strong telluric contamination for the observed radial velocity (Sect. 2.3). The blue and green shaded regions in the inlays indicate the ranges from which the respective FRs were computed using Eqs. (1) and (2) with the values provided in the upper corner of each panel. Lower panels: residual distribution. |
In the text |
![]() |
Fig. 5. Comparison of the shape changes with T eff of the Hβ (left) and Hα (right) profiles at low (top) and high (bottom) metallicities. As in Fig. 4, the spectra were normalized by the 99th percentile of the flux in the shown wavelength intervals. Gaps in the spectra indicate telluric lines. Blue and green colored ranges denote the same FRs as in the inlays of Fig. 4, while the additional colored regions show the remaining seven relations ranked by the number at the top of each panel. |
In the text |
![]() |
Fig. 6. Residuals of the inferred and the literature temperatures for the nine strongest FR-T eff relations versus the mean residuals. The color coding is the same as in Fig. 2. Spectra with telluric contamination in one of the regions were omitted. Errors on the abscissa were computed via the standard deviation of the ΔT eff, i (see Eq. (8)), while the ordinate errors only denote the claimed uncertainty in the literature data. |
In the text |
![]() |
Fig. 7. [Fe/H] dependency on FR and T eff. The left panel illustrates the distribution of points of the tightest relation in FR-T eff-[Fe/H] space and the surface they span therein. To guide the eye, the best-fit surface according to Eq. (10) is overlaid as a light-colored, distorted grid, where the grid lines indicate the isothermal and iso-FR lines. The same distribution of points is shown in the top right panel but in a coordinate frame that is rotated such that it is aligned with the FR and [Fe/H] axes. Dashed lines indicate the track of the isothermal lines on the surface. In analogy to Fig. 4, the inlay shows the most extreme spectra in terms of metallicity, as well as the two wavelength regions the FR was computed from (blue). The fit residuals with respect to FR are presented in the lower right frame. All three scatter-plots follow the same T eff color coding as indicated by the bar on the right. |
In the text |
![]() |
Fig. 8. log g dependency on FR, T eff, and [Fe/H]. The left panel illustrates the distribution of points of the tightest relation in FR-T eff-log g space, where the colors introduce metallicity as a fourth dimension. The fit solutions to Eq. (13) at fixed [Fe/H] values of −3.5 dex, −1.75 dex, and 0.00 dex are shown as three parallel hyperplanes. The top right panel resembles the distribution in FR-log g space, only. There, dashed lines indicate the track of the iso-metallicity lines on the surface at a fixed temperature of 5000 K. The inlay shows the most extreme spectra in terms of gravity, as well as the two wavelength regions the FR was computed from (blue). The fit residuals with respect to FR are presented in the lower right frame. All three scatter-plots follow the same [Fe/H] color coding as indicated by the bar on the right. |
In the text |
![]() |
Fig. 9. T eff deviations with respect to resolution and [Fe/H] ordered from the strongest to the weakest relation (see Table A.1 and Fig. 5) from top left to bottom right. The deviations were computed by scaling ΔFR with the slope of the respective relation. Points measured for the same star but at different Rs are connected by solid lines. The color coding is the same T eff scale as in Fig. 7. |
In the text |
![]() |
Fig. 10. Left panel: deviations of the FRs measured in all training spectra for the strongest metallicity relation (see Table A.2) at different resolutions with respect to the FRs computed at the original resolution of 45 000. In contrast to Fig. 9, we note that there is a non-linear relation between FR and [Fe/H], which is why we illustrate only the FR- and not the [Fe/H] deviations. The colored surface resembles the best-fit model to the data according to Eq. (14), where the color indicates the magnitude of the correction to be applied (see color bar on the right). Red points and their connecting line denote the track of the corrections to the FRs measured in the solar spectrum at different resolutions. Upper right panel: portion of the solar spectrum at resolutions of 45 000, 40 000, 30 000, 20 000, 10 000, 7500, 5000, and 2500 (dark- to light-colored). The color-coding refers to the same FR corrections as in the left panel. Blue vertical bars indicate the wavelength ranges from which the FR is computed. Lower right panel: derived [Fe/H] for the Sun from uncorrected (gray) and corrected (red) FRs using Eq. (10) and only the strongest metallicity relation shown in the upper right panel. |
In the text |
![]() |
Fig. 11. Parameter deviations between ATHOS runs on X-shooter spectra and on high-resolution spectra of the same stars from our training sample. Here, gray circles indicate the values without any correction applied to the involved FRs, while red circles incorporate the corrections described by Eq. (14). The mean deviation and scatter for each stellar parameter are depicted by solid and dashed red lines, respectively. |
In the text |
![]() |
Fig. 12. ATHOS outputs from an MC-simulation on the solar spectrum. Noise was artificially induced such that S/N values of 100 (gray), 50 (dark blue), and 20 (light blue) with 103 random initializations each were realized. The literature parameters (5771 K, 0.00 dex, 4.44 dex; Heiter et al. 2015) are indicated by black dashed lines. |
In the text |
![]() |
Fig. 13. Same as Fig. 12, but for α Boo (4286 K, −0.57 dex, 1.64 dex). |
In the text |
![]() |
Fig. 14. Upper panels: comparison of the ATHOS output for T eff, [Fe/H], and log g with literature results for the ELODIE spectral library (version 3.1, Prugniel et al. 2007, see text for quality cuts made here). In case a star occurs with multiple spectra in the library, it is reflected by more than one point here. Dashed lines represent the one-to-one relation. The colors reflect the internal statistical uncertainties computed with ATHOS. Lower panels: residual distribution. All residuals are determined via Δx = x lit − x ATHOS. |
In the text |
![]() |
Fig. 15. Upper panel: comparison of the four ELODIE spectra for the Sun with S/N > 100 (gray) to the atlas spectrum by Hinkle et al. (2000) (red dashed). The latter was degraded to match the resolution of our training grid and re-sampled to the ELODIE dispersion scale. The blue regions show the fluxes used in one of our [Fe/H] relations. All spectra were normalized to the mean flux in the rightmost blue band. Lower panel: flux residuals. |
In the text |
![]() |
Fig. 16. Same as Fig. 14, but for the S4N library (Allende Prieto et al. 2004). Red star symbols resemble stars with rotational velocities v sin i ≥ 5 km s−1. For better visibility, we did not show these stars in the residual distributions of the middle and right panel, because they are far off (see discussion in the main text). |
In the text |
![]() |
Fig. 17. Comparison of the residual temperature distribution of the S4N literature temperatures (see bottom left panel of Fig. 16) with the ones in common with Kovtyukh et al. (2003, red points). Vertical red lines connect points sharing the same spectrum and therefore ATHOS output. The red dashed line corresponds to the mean difference of 14 K when using the Kovtyukh et al. (2003) reference and the solid red lines indicate the ±1σ scatter of 81 K. |
In the text |
![]() |
Fig. 18. Trend of Δ[Fe/H] with v sin i for the red stars in Fig. 16. |
In the text |
![]() |
Fig. 19. Same as Fig. 14, but for the high-resolution spectra and recommended parameters from the Gaia-ESO survey DR3. Red stars, golden triangles, orange diamonds, and blue pluses indicate the peculiar spectra we found to show high rotational broadening, strong molecular blending, binary components, or erroneous wavelength calibrations, respectively. In the residual distributions we excluded those, because they resemble extreme outliers (see text and Fig. 20). |
In the text |
![]() |
Fig. 20. Comparison of representative peculiar spectra (colored in the same way as in Fig. 19) to the solar spectrum (gray). An approximate normalization was performed by manually setting the continuum. The wavelength ranges were chosen such that the respective reason for peculiarity is illustrated best. The GES identifiers of the stars are (from upper left to lower right) 08100380-4901071, 18184436-4500066, 23001517-2231268, and 08090542-4740261. |
In the text |
![]() |
Fig. 21. Same as Fig. 14, but for the high-resolution UVES sample of GC stars from the study of C09. Points connected by a red line represent the same spectrum but have two different literature values. The additional red open circles indicate the recommended parameters for DR3 of GES. |
In the text |
![]() |
Fig. 22. [Fe/H] residuals averaged per GC versus mean temperature offset. The color coding resembles the cluster metallicity (see color bar on the right). Light colored error bars indicate the respective σ ΔT eff and σ [Fe/H] from Table 5. Red lines connect the same clusters corresponding to the same stars and spectra, but with stellar parameters either from C09, or GES DR3 (red circles). |
In the text |
![]() |
Fig. 23. Same as Fig. 14, but for a sample of MIKE spectra of GC stars. The colored open circles indicate the membership in either of the GCs 47 Tuc (blue, Koch & McWilliam 2008), M 5 (pink, Koch & McWilliam 2010), NGC 6397 (yellow, Koch & McWilliam 2011), M 75 (orange, Kacharov et al. 2013), NGC 5897 (black, Koch & McWilliam 2014), or NGC 6426 (red, Hanke et al. 2017). |
In the text |
![]() |
Fig. 24. Same as Fig. 14, but for 3966 SDSS low-resolution (R ≈ 1500–2500) spectra. As opposed to earlier figures, here the color does not represent ATHOS errors but the local point density (arbitrary units) obtained via a Gaussian kernel-density estimate. |
In the text |
![]() |
Fig. 25. Comparison of the results from all surveys discussed in Sects. 5.3–5.7. The color coding is indicated by the legend in the right panel. |
In the text |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.