Hierarchical Bayesian model to infer PL(Z) relations using Gaia parallaxes

H. E. Delgado; L. M. Sarro; G. Clementini; T. Muraveva; A. Garofalo

doi:10.1051/0004-6361/201832945

Home

All issues

Volume 623 (March 2019)

A&A, 623 (2019) A156

Full HTML

Free Access

Issue		A&A Volume 623, March 2019


Article Number		A156
Number of page(s)		13
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/201832945
Published online		26 March 2019

A&A 623, A156 (2019)

Hierarchical Bayesian model to infer PL(Z) relations using Gaia parallaxes

H. E. Delgado¹, L. M. Sarro¹, G. Clementini², T. Muraveva² and A. Garofalo²^,3

¹ Dpto. de Inteligencia Artificial, UNED, c/ Juan del Rosal, 16, 28040 Madrid, Spain
e-mail: hed_up@iasystems.org
² INAF, Osservatorio di Astrofisica e Scienza dello Spazio di Bologna, Via Piero Gobetti 93/3, 40129 Bologna, Italy
³ Dipartimento di Fisica e Astronomia, Università di Bologna, Via Piero Gobetti 93/2, 40129 Bologna, Italy

Received: 2 March 2018
Accepted: 4 February 2019

Abstract

In a recent study we analysed period–luminosity–metallicity (PLZ) relations for RR Lyrae stars using the Gaia Data Release 2 (DR2) parallaxes. It built on a previous work that was based on the first Gaia Data Release (DR1), and also included period–luminosity (PL) relations for Cepheids and RR Lyrae stars. The method used to infer the relations from Gaia DR2 data and one of the methods used for Gaia DR1 data was based on a Bayesian model, the full description of which was deferred to a subsequent publication. This paper presents the Bayesian method for the inference of the parameters of PL(Z) relations used in those studies, the main feature of which is to manage the uncertainties on observables in a rigorous and well-founded way. The method encodes the probability relationships between the variables of the problem in a hierarchical Bayesian model and infers the posterior probability distributions of the PL(Z) relationship coefficients using Markov chain Monte Carlo simulation techniques. We evaluate the method with several semi-synthetic data sets and apply it to a sample of 200 fundamental and first-overtone RR Lyrae stars for which Gaia DR1 parallaxes and literature K_s-band mean magnitudes are available. We define and test several hyperprior probabilities to verify their adequacy and check the sensitivity of the solution with respect to the prior choice. The main conclusion of this work, based on the test with semi-synthetic Gaia DR1 parallaxes, is the absolute necessity of incorporating the existing correlations between the period, metallicity, and parallax measurements in the form of model priors in order to avoid systematically biased results, especially in the case of non-negligible uncertainties in the parallaxes. The relation coefficients obtained here have been superseded by those presented in our recent paper that incorporates the findings of this work and the more recent Gaia DR2 measurements.

Key words: methods: statistical / methods: data analysis / stars: variables: RR Lyrae / parallaxes

© ESO 2019

1. Introduction

Cepheids and RR Lyrae stars are primary standard candles of the cosmological distance ladder because they follow canonical relations linking their intrinsic luminosity to the pulsation period and/or the metal abundance. Specifically, for Cepheids the intrinsic luminosity (L) at any passband depends on the period (P) of light variation. This is traditionally referred to as the Cepheid period–luminosity relation or Leavitt law, after its discoverer Mrs Henrietta Swan Leavitt (Leavitt & Pickering 1912). Modern realisations of the Cepheid PL relations from optical to infrared passbands include, among others, the ground-based studies of Madore & Freedman (1991), Ripepi et al. (2012), and Gieren et al. (2013), works based on Hubble Space Telescope (HST) data such as those by Freedman et al. (2001), Saha et al. (2006), and Riess et al. (2011), and theoretical investigations such as those by Marconi et al. (2005). Among the most recent studies of the Cepheid period–luminosity relations are those based on Gaia trigonometric parallaxes of Galactic Cepheids (e.g. Clementini 2017; hereafter Paper I and references therein; Riess et al. 2016, 2018). For RR Lyrae stars the intrinsic luminosity (L) in the infrared passbands depends on P and possibly stellar metallicity (Z; PL – metallicity relation – PL(Z)), as first shown by Longmore et al. (1986) and later confirmed by (i) empirical studies of field and cluster RR Lyrae stars (e.g. Sollima et al. 2006, 2008; Borissova et al. 2009), (ii) theoretical models by Marconi et al. (2015) and Neeley et al. (2017), and (iii) the Gaia parallax-calibrated relations of Sesar et al. (2017), Paper I and references therein, and Muraveva et al. (2018a,b). In the visual passband, the luminosity L depends on Z in the form of the so-called RR Lyrae luminosity–metallicity relation (see e.g. Cacciari & Clementini 2003; Clementini et al. 2003; the pulsation models by Bono et al. 2003; the theoretical calibration by Catelan et al. 2004; or the Gaia-based relations in Paper I; Muraveva et al. 2018a and references therein). The predicted precision of the Gaia end-of-mission parallaxes for local Cepheids and RR Lyrae stars¹ will allow us to determine the slope and zero-point of these fundamental relations with unprecedented accuracy, thus setting the basis for a global reassessment of the whole cosmic distance ladder. As a first anticipation of the Gaia potential in this field of the cosmic distance ladder and a first assessment of improved precision with respect to previous astrometric missions such as HIPPARCOS, and the dramatic increase in statistics compared to what is achievable, for instance, through measuring parallaxes with the HST, Gaia DR1 published parallaxes for more than 700 Galactic Cepheids and RR Lyrae stars, computed as part of the Tycho-Gaia Astrometric Solution (TGAS; Lindegren et al. 2016). A number of papers after Gaia intermediate data releases in 2016 and 2018 (Gaia Data Release 1 – DR1 and DR2, respectively) have discussed Gaia Cepheids and RR Lyrae stars, specifically presenting the released samples (Clementini et al. 2016, 2019), their parallaxes (e.g. Lindegren et al. 2016) and possible offsets affecting them (Arenou et al. 2017, 2018); and addressing in particular their use as standard candles (Casertano et al. 2017 and Paper I for Gaia DR1 and Riess et al. 2018; Muraveva et al. 2018a for Gaia DR2). In Paper I we have used TGAS parallaxes, along with literature photometry and spectroscopy, to calibrate the zero-point of the PL relations of classical and type II Cepheids, and the near-infrared PL and PL(Z) relations of RR Lyrae stars by fitting these relations through adopting different techniques that operate either in parallax or absolute magnitude space. In that paper, different sources of biases affecting the TGAS samples of Cepheids and RR Lyrae stars were discussed at some length, and the possible systematic errors caused in the inferred luminosity calibrations were analysed in detail.

Section 3.2 of Paper I in particular discussed the problem of fitting general luminosity relations between the absolute magnitude M_True, the decadic logarithm of the period P_True, and possibly also the metallicity Fe/H_True of the form

$\begin{matrix} M_{True} = b + c \cdot log (P_{True}) + k \cdot {[Fe / H]}_{True} \end{matrix}$ $\begin{aligned} M_{\rm True} = b + c\cdot \log (P_{\rm True}) + k\cdot \left[\mathrm {Fe/H}\right]_{\rm {True}} \end{aligned}$ (1)

with a sample that is truncated in parallax (by removing the non-positive values) and for which the assumption of normality of uncertainties in the absolute magnitude is not valid. A more detailed description of the intricacies involved in using astrometric measurements for the inference of quantities of astrophysical interest in general and PLZ relation coefficients in particular can be found in Luri et al. (2018). Our proposal in Paper I was to construct a two-level statistical model that distinguishes between true and measured parallaxes. This model can then be used to infer the true parallaxes and absolute magnitudes from the measurements. One of the rigorous ways to construct such a model is to apply the Bayesian method, where a prior probability distribution is assigned to the true parallax population. In doing so, a suitable selection of this prior will improve the estimation of individual true parallaxes in the sense that their posterior credible intervals are “shrunken” with respect to the measurement uncertainties. Setting a specific prior is always controversial, but in principle, it is possible to define only a functional form that depends on a set of unknown parameters. The specific prior is then inferred from the data as part of the global inference process. This prior functional form should be flexible enough to properly model the true distribution of parallaxes but should also be sufficiently restrictive to enforce a plausible distribution for the true parallaxes on the basis of the knowledge present in the astronomical literature.

The solution described in the previous paragraph can be represented as a graph model that incorporates the PL(Z) relation, the definition of absolute magnitudes in terms of the apparent magnitude m and the parallax ϖ, and the corresponding distribution of the measurements given the true values. In this way, we guarantee that the observational uncertainties are simultaneously propagated through the graph and that the uncertainties of the parameters of the PLZ relationship are estimated in a way that is consistent with the measurement uncertainties. Moreover, the effect of including the relationship

$\begin{matrix} b + c \cdot log P + k \cdot [Fe / H] = m + 5 log ϖ - 10 \end{matrix}$ $\begin{aligned} b+c\cdot \log P+k\cdot \left[\mathrm {Fe/H}\right]=m + 5\log \varpi -10\, \end{aligned}$ (2)

in the model is to constrain the parameter space in such a way that the PLZ relationship coefficients and the individual true parallaxes have to be consistent.

The objective of this paper is to infer estimates of the parameters of the PLZ relationship. We apply the hierarchical Bayesian method, which consists of dividing the variability of the statistical inference problem into several levels. In this way, we partition the parameter space associated with inferring the PLZ relation into population-level parameters and observations. We represent the hierarchical Bayesian model with a directed acyclic graph and perform the inference using Markov chain Monte Carlo (MCMC) simulation techniques (Robert & Casella 2013). A minimal description of the methodology and preliminary results was already presented in Paper I which we intend to extend and clarify here. For reasons of clarity and scope, we focus on the inference of the PLZ relationship in the K band for 200 fundamental and first-overtone RR Lyrae stars, the main properties of which are provided in Table A.3 of Paper I. The model is applicable with minimal modifications to other variability types such as Cepheids or long-period variables and different photometric bands. In this work we present the results of the full model including the slopes of the relation, expanding the results presented in Paper I where only the zero-points were inferred, while the slopes were fixed to literature values.

A similar method has been applied by Sesar et al. (2017) to constrain PLZ relations of fundamental-mode (ab type) RR Lyrae stars in the mid-infrared W1 and W2 bands of the Wide-field Infrared Survey Explore (WISE; Wright et al. 2010), using TGAS parallaxes, but modelling true distances with an exponentially decreasing volume density (hereafter EDVD) prior proposed by Bailer-Jones (2015).

The Bayesian hierarchical method presented in Paper I used a log-normal prior to model the distribution of true parallaxes independently of the other model parameters. With this prior, the log (P) slope were severely underestimated when compared to the literature values, although this result was not specifically discussed therein. In the present work we extend the Bayesian analysis performed in Paper I in three directions. First, we validate the model with semi-synthetic data and analyse the causes of the slope underestimation. Second, we extend the Bayesian analysis by testing alternative prior distributions for parallaxes and demonstrate that one of them mitigates to some degree the problem of the underestimation of the PLZ log (P) slope. Third, we study the sensitivity of the Bayesian analysis results under different prior choices for some critical hyperparameters of our hierarchical model (HM).

The structure of the paper is as follows. In Sect. 2 we summarize the theoretical foundations of the hierarchical Bayesian method and describe extensively the HM used for inferring the PLZ relationship in Paper I and Muraveva et al. (2018a) (in the latter case with minor adaptations). In Sect. 3 we study the data set in detail and explore its properties by means of semi-synthetic samples constructed assuming a known PLZ relation; in Sect. 4 we present the full results of the MCMC samples of the posterior distribution for the Gaia DR1 data used in Paper I; in Sect. 5 we study the sensitivity of the results to the choice of hyper-parameters, and in Sect. 6 we summarise the findings of the paper.

2. Hierarchical Bayesian model

A full introduction to Bayesian inference and hierarchical Bayes is beyond the scope of this manuscript. We refer to Gelman et al. (2004) and Gelman & Hill (2007) for very instructive introductions, and to Luri et al. (2018) for a more astronomy-oriented introduction. In what follows, we summarise the main concepts of the method. Bayesian inference is based on Bayes’ rule:

$\begin{matrix} p (Θ ∣ D) \propto p (D ∣ Θ) \times p (Θ), \end{matrix}$ $\begin{aligned} p\left(\Theta \mid \mathcal{D} \right)\propto p\left(\mathcal{D} \mid \Theta \right)\times p\left(\Theta \right), \end{aligned}$ (3)

where 𝒟 are the observations (data), Θ are the parameters of a model proposed to explain the data, and p represents a probability distribution. The right side of Eq. (3) represents the model itself, specified by the joint probability distribution p(𝒟,Θ) of the data and the parameters. This distribution factorizes into

the conditional distribution p(𝒟∣Θ) of the data given the parameters (the so-called likelihood), and
the prior distribution of the parameters p(Θ), which represents our knowledge about plausible parameter values before observing the data.

The basic model of Eq. (3) divides the variability of the statistical problem into two levels: observations and parameters. The hierarchical Bayesian method consists of distinguishing further levels of variability. In our case, we introduce a new dependence of the prior distribution p (Θ) on a new set of parameters Φ (the so called hyperparameters) and assign hyperprior distributions p(Φ) to them. We explain its nature in the following sections. In order to have a better understanding of the dependency structure dictated by the model, it is customary to represent the factorization of the joint probability distribution p (D,Θ,Φ) using the Bayesian network formalism (Pearl 1988; Lauritzen 1996), which consists of drawing a directed acyclic graph (DAG) in which nodes encode model parameters, measurements, or constants, and directed links represent conditional probability dependence relationships.

The inference in a hierarchical Bayesian model proceeds by calculating the marginal joint posterior distribution of a set of parameters of interest given the data. In complex problems with many parameters, the posterior distribution usually is not available in an analytically tractable closed form, but can be approximately evaluated using MCMC simulation techniques.

2.1. Conditional dependencies

In this and the next section we describe our sample and the hierarchical model that encodes the conditional probability relations between the observations and the parameters of the linear PL(Z) relations. We include Fig. 19 of Paper I here as Fig. 1 to facilitate reading, but include some additional clarifications that could not be described there due to space and scope limitations.

Fig. 1.

Directed acyclic graph that represents the forward model used to infer the PLZ relation coefficients when the prior of true metallicities, logarithm of true periods, and (natural) logarithm of true parallaxes is assumed to be a 3D Gaussian mixture distribution.

In the following, we change the notation to avoid cluttering of subscripts. We denote measured quantities with a circumflex accent ( ̂ ) and remove the subscript True from the true values. The DAG in Fig. 1 encodes the probabilistic relationships between the variables of our model and shows the measurements at the bottom level: decadic logarithm of periods log P̂_i, apparent magnitudes m̂_i, metallicities ${\hat{[Fe / H]}}_{i}$ $\widehat{\left[\mathrm{Fe/H}\right]}_{i}$ , parallaxes ${\hat{ϖ}}_{i}$ $\hat{\varpi}_i$ , and extinctions Â_{m_i} The subindex i runs from 1 to the total number of stars N in each sample. Our model assumes that the measurements

$\begin{matrix} d_{i} = ({\hat{m}}_{i}, log {\hat{P}}_{i}, {\hat{ϖ}}_{i}, {\hat{[Fe / H]}}_{i}, {\hat{A}}_{m_{i}}) \end{matrix}$ $\begin{aligned} \mathbf d _{i}=\left(\hat{m}_{i},\log \hat{P}_{i},\hat{\varpi }_{i}, \widehat{\left[\mathrm{Fe/H} \right]}_{i}, \hat{A}_{m_{i}}\right) \end{aligned}$ (4)

are realisations from normal distributions centred at the true (unknown) values and with standard deviations given by the measurement uncertainties

$\begin{matrix} σ_{d_{i}} = (σ_{m_{i}}, σ_{log P_{i}}, σ_{ϖ_{i}}, σ_{{[Fe / H]}_{i}}, σ A_{m_{i}}) . \end{matrix}$ $\begin{aligned} \boldsymbol{\sigma }_\mathbf{d _{i}}=\left(\sigma _{m_{i}},\sigma _{\log P_{i}},\sigma _{\varpi _{i}},\sigma _{\left[\mathrm{Fe/H} \right]_{i}},\sigma A_{m_{i}}\right). \end{aligned}$ (5)

Our test sample $D = {d_{i}}_{i = 1}^{N}$ $\mathcal{D}=\left\{\mathbf{d}_{i}\right\}_{i=1}^{N}$ consists of N = 200 fundamental-mode and first-overtone RR Lyrae (RRL) stars with near-infrared (NIR) photometry (m̂_i = m̂_{K_si}) selected amongst the stars of the Dambis et al. (2013) compilation for which TGAS parallaxes (Lindegren et al. 2016) and associated uncertainties were available. This is essentially the same K_s-band sample as in Paper I for which the periods of first-overtone stars were “fundamentalised” by adding 0.127 to the decadic logarithm of the period and uncertainties on log (P) were estimated as σ_{log (P)} = 0.01 ⋅ log (P), which is equivalent to an uncertainty of 2% in the period. Unlike in Paper I (Sect. 6.1), where metal abundances were transformed from the Zinn & West (1984) to the Gratton et al. (2004) metallicity scale to be consistent with a period term slope of the PM_{K_s}Z relationship fixed to the value of −2.73 mag dex⁻¹ reported by Muraveva et al. (2015), in this paper we use the original metal abundances provided by Dambis et al. (2013) because we aim to infer the period term slope. Because Dambis et al. (2013) did not provide metallicity uncertainties, in Paper I we assigned a constant uncertainty of 0.2 dex to all metallicities in the sample. In this paper we distinguish among techniques used to estimate metal abundances and respectively adopt uncertainties of 0.1, 0.2 and 0.3 dex for metallicities estimated from high-resolution spectroscopy, measured by the ΔS method of Preston (1959) and determined from photometry or other non-spectroscopic methods. The Dambis et al. (2013) catalogue does not include uncertainties on absorption. We estimate them as σ_{A_K} = 0.114 ⋅ σ_{A_V}, where σ_{A_V} = 3.1 ⋅ σ_E(B−V) (Cardelli et al. 1989) and σ_E(B−V) = 0.16 ⋅ E(B−V) (Schlegel et al. 1998). All measured quantities are represented as blue nodes in the DAG where we do not include the nodes corresponding to the uncertainties of Eq. (5) in order to facilitate interpretation.

We represent the likelihood of the model with the nodes corresponding to the true values m_i, ϖ_i, log P_i, [Fe/H]_i, and A_m_i and the arcs going from true values to measurements. True values and observations are all enclosed in a black rectangle that represents replication for the N stars in the sample (plate notation). Equation (1) can be written for every star i in the sample as

$\begin{matrix} M_{i} = b + c \cdot log P_{i} + k \cdot {[Fe / H]}_{i}, \end{matrix}$ $\begin{aligned} M_{i}=b+c\cdot \log P_{i}+k\cdot {\left[\mathrm{Fe/H} \right]}_{i}, \end{aligned}$ (6)

where M_i represents the true absolute magnitude for star i. This is a linear model in the parameters: the intercept b, the slope c for the period term, and the slope k for the metallicity term. The last term can be dropped if metallicities are thought to play a negligible role in the relationship. We keep it in the following for the sake of completeness, but the particularisation to PL relations is straightforward. In Fig. 1 we shadow the left-hand rectangle that includes the metallicity terms to highlight this choice.

In Fig. 1 the PL(Z) relationship of Eq. (6) is denoted by the grey node M_i and all incoming arrows from c, k, b, P_i, and [Fe/H]_i (i.e. the three parameters and two predictive variables). An additional arrow links w and M_i. w represents an intrinsic dispersion in the PL(Z) relationship that may be due to evolutionary effects, for example. This dependence on additional predictive variables that are not accounted for in the model is incorporated as an additional Gaussian spread of the standard deviation w. This spread is analysed as part of the inference results. Including the additional Gaussian spread that represents unaccounted predictive variables, we have that

$\begin{matrix} M_{i} \sim N (b + c \cdot log P_{i} + k \cdot {[Fe / H]}_{i}, w), \end{matrix}$ $\begin{aligned} M_{i}\sim \mathsf{N }\left(b+c\cdot \log P_{i}+k\cdot \left[\mathrm{Fe/H} \right]_{i},{ w}\right), \end{aligned}$ (7)

where ∼ should be read as “is distributed as”, N represents the normal (Gaussian) distribution, and the comma separates values inside the parenthesis that represent the mean and standard deviation of the normal distribution, respectively.

Of course, we do not observe absolute magnitudes, and our model has to account for the transformation between absolute magnitudes and the observations, that are (potentially affected by interstellar absorption) apparent magnitudes. This is shown in the lower part of Fig. 1, where the parallaxes (right-hand block) are handled as we explain next.

The transformation from absolute to unabsorbed apparent magnitudes is a well-known deterministic one:

$\begin{matrix} {m_{0}}_{i} = M_{i} - 5 log (ϖ_{i}) + 10, \end{matrix}$ $\begin{aligned} {m_0}_i = M_{i} -5\log (\varpi _{i})+10, \end{aligned}$ (8)

where the parallax ϖ_i is measured in mas. This is not a probabilistic relation, and we use dashed lines in the arrows going into m₀_i to distinguish them from the arcs denoting conditional probability links. The absorbed apparent magnitudes are computed as m_i = m₀_i + A_{m_i}, where the grey node A_m_i represents the true absorption. The model also contemplates the possibility of a TGAS global parallax offset ϖ₀. The offset can be inferred by the model or fixed to a predefined value. We shadow the top right rectangle of the graph that includes the offset node to denote this choice.

2.2. Priors, hyperparameters, and hyperpriors

Prior distributions allow us to pose probabilistic statements about plausible values of the model parameters based on knowledge available prior to and independent of the observations. Most importantly, however, they allow us by means of Bayes’ theorem to make statements about the distribution of the parameters we aim to infer (the posterior distribution of the model parameters in the left side of Eq. (3)). In the astrophysical context of this paper, we aim at formulating probabilistic statements about the values of the hyperparameters: the most probable value of the PL(Z) slopes or intercepts or their credible intervals. We use green rectangular nodes at the top of the graph to denote fixed prior hyperparameters.

Often the prior definitions used in the literature are conservative choices in the sense that they aim to be as non-informative as possible. For both slopes c and k of the PL(Z) relationship of Eq. (6), we specify a standard Cauchy prior (centred at 0 with scale parameter equal to 1), which is equivalent to a uniform prior supported on the interval [−π/2,+π/2] assigned to the angles θ₁ = arctan (c) and θ₂ = arctan (k). The prior probability distribution of the intercept is a Cauchy distribution centred at its mode μ_b = 0 with scale parameter σ_b = 10. The intrinsic dispersion of the PLZ relation is given by an exponential prior with inverse scale λ_w = 1.

The only block of the graph that remains to be clarified is the left-hand top block describing the distribution of periods, metallicities, and parallaxes. Our model assigns a joint 3D prior to the true distribution of metallicities, periods, and parallaxes. Furthermore, we distinguish two components with different chemical composition and a different relationship between pulsation period and parallax (distance). The prior is defined as a mixture of Gaussian distributions (Gaussian mixture, GM) given by

$\begin{matrix} ({[Fe / H]}_{i}, log P_{i}, ln ϖ_{i}) \sim \sum_{k = 1}^{2} ϕ_{k} MN (μ^{k}, T^{k} Ω^{k} T^{k}), \end{matrix}$ $\begin{aligned} \left(\left[\mathrm{Fe/H} \right]_{i},\log P_{i},\ln \varpi _{i}\right)\sim \mathop \sum \limits _{k=1}^{2}\phi _{k} \mathsf{MN }\left(\boldsymbol{\mu }^{k},\mathrm{T} ^{k}\Omega ^{k}\mathrm{T} ^{k}\right), \end{aligned}$ (9)

where ϕ_k represents the mixing proportion of the kth component of the mixture, and MN is a 3D Gaussian probability density with mean vector $μ^{k} = (μ_{Z}^{k}, μ_{P}^{k}, μ_{ϖ}^{k})$ $\boldsymbol{\mu}^{k}=(\mu_{Z}^{k},\mu_{P}^{k},\mu_{\varpi}^{k})$ , diagonal matrix of standard deviations $T^{k} = diag (σ_{Z}^{k}, σ_{P}^{k}, σ_{ϖ}^{k})$ $\mathrm{T}^{k}=\mathrm{diag}(\sigma_{Z}^{k},\sigma_{P}^{k},\sigma_{\varpi}^{k})$ and correlation matrix

$\begin{matrix} Ω^{k} = (\begin{matrix} 1 & \begin{matrix} 0 & 0 \end{matrix} \\ \begin{matrix} 0 \\ 0 \end{matrix} & Ω_{P, ϖ}^{k} \end{matrix}) . \end{matrix}$ $\begin{aligned} \Omega ^{k}=\left(\begin{array}{cc} 1&\begin{array}{cc} 0&0\end{array}\\ \begin{array}{c} 0\\ 0 \end{array}&\Omega _{P,\varpi }^{k} \end{array}\right). \end{aligned}$ (10)

The parameters of the prior defined by Eq. (9) are themselves model parameters and subject to the Bayesian inference as well. As such, they have their own hyperpriors. We assign a Gaussian prior centred at the first (Q₁) and third (Q₃) quartiles of the distribution of the measurements $([\hat{Fe / H}], log \hat{P}, ln \hat{ϖ})$ $\left(\left[\widehat{\mathrm{Fe/H}}\right],\log\hat{P},\ln\hat{\varpi}\right)$ and a covariance matrix equal to diag (0.2²,0.1²,0.05²) for the mean vector μ^k of each mixture component. This prior is chosen to prevent the non-identifiability of mixture components (the posterior multimodality arising from the fact that swapping the label of the two 3D Gaussian components results in exactly the same solution). We assign a weakly informative exponential prior with inverse scale λ_T^k = 1 to the standard deviations in each T^k. For each correlation submatrix $Ω_{P, ϖ}^{k}$ $\Omega_{P,\varpi}^{k}$ , we specify a LKJ prior (Lewandowski et al. 2009) with ν = 1 degrees of freedom. The choice of prior given by Eq. (9) will prove to be critical for the correct inference of the PLZ relation coefficients for reasons that will become apparent in Sect. 3.

Figure 1 translates into the following likelihood:

$\begin{matrix} \begin{matrix} p (D ∣ Θ) = \prod_{i = 1}^{N} p (d_{i} ∣ Θ) = & \prod_{i = 1}^{N} p ({\hat{[Fe / H]}}_{i} ∣ {[Fe / H]}_{i}, σ_{Z_{i}}) \\ \cdot p (log {\hat{P}}_{i} ∣ log P_{i}, σ_{P_{i}}) \\ \cdot p ({\hat{m}}_{i} ∣ M_{i}, A_{m_{i}}, ϖ_{i}, σ_{m_{i}}) \\ \cdot p ({\hat{ϖ}}_{i} ∣ ϖ_{i}, σ_{ϖ_{i}}, ϖ_{0}) \\ \cdot p ({\hat{A}}_{m_{i}} ∣ A_{m_{i}}, σ_{A_{m_{i}}}) \end{matrix} \end{matrix}$ $\begin{aligned} \begin{split} p\left(\mathcal{D} \mid \mathbf \Theta \right) = \prod _{i=1}^{N}p\left(\mathbf d _{i}\mid \mathbf \Theta \right) =& \prod _{i=1}^{N}p\left(\widehat{[\mathrm {Fe/H}]}_{i}\mid {[\mathrm {Fe/H}]}_{i},\sigma _{Z_{i}}\right) \\& \cdot p\left(\log \hat{P}_{i} \mid \log P_{i},\sigma _{P_{i}}\right) \\& \cdot p\left(\hat{m}_{i}\mid M_{i} ,A_{m_{i}},\varpi _{i},\sigma _{m_{i}}\right) \\& \cdot p\left(\hat{\varpi }_{i}\mid \varpi _{i}, \sigma _{\varpi _{i}},\varpi _0\right) \\& \cdot p\left(\hat{A}_{m_{i}}\mid A_{m_{i}},\sigma _{A_{m_i}}\right) \end{split} \end{aligned}$ (11)

and priors:

$\begin{matrix} \begin{matrix} π (Θ) = & π (ϕ^{{k}}, μ^{{k}}, T^{{k}}, Ω_{P, ϖ}^{{k}}) \cdot π (b, c, k, w) \cdot π (ϖ_{0}) \\ \cdot \prod_{i = 1}^{N} π ({[Fe / H]}_{i}, log P_{i}, ln ϖ_{i} ∣ ϕ^{{k}}, μ^{{k}}, T^{{k}}, Ω^{{k}}) \\ \cdot π (M_{i} ∣ b, c, k, w, log P_{i}, {[Fe / H]}_{i}), \end{matrix} \end{matrix}$ $\begin{aligned} \begin{split} \pi \left(\mathbf \Theta \right) =& \pi \left(\phi ^{\left\{ k\right\} },\boldsymbol{\mu }^{\left\{ k\right\} },\mathrm{T} ^{\left\{ k\right\} },\Omega _{P,\varpi }^{\left\{ k\right\} }\right) \cdot \pi \left(b,c,k,{ w}\right)\cdot \pi \left(\varpi _0\right) \\& \cdot \prod _{i=1}^{N} \pi \left(\left[\mathrm{Fe/H} \right]_{i},\log P_{i},\ln \varpi _{i}\mid \phi ^{\left\{ k\right\} },\boldsymbol{\mu }^{\left\{ k\right\} },\mathrm{T} ^{\left\{ k\right\} },\Omega ^{\left\{ k\right\} }\right)\\& \cdot \pi \left(M_{i}\mid b, c,k,{ w},\log P_{i},{[\mathrm {Fe/H}]}_{i}\right), \end{split} \end{aligned}$ (12)

where each prior probability is defined in Table 1.

Table 1.

Prior (π) definitions for the hierarchical Bayesian model of the PL(Z) relations.

We have encoded our HM using the Stan probabilistic modelling language (Carpenter et al. 2017) and used the No-U-Turn sampler (NUTS) of Hoffman & Gelman (2014) to compute the MCMC samples corresponding to the parameters of interests.

3. Model validation with semi-synthetic data

In this section we aim at validating the improved HM described in Sect. 2 on synthetic data as close as possible to true data sets but exactly following an RR Lyrae PM_{K_s}Z relation from the literature. We simulate three sets A, B, and C of semi-synthetic true absolute magnitudes and parallaxes using that PM_{K_s}Z relation and the apparent magnitudes of the sample described in Sect. 2. The only difference between the synthetic data sets A and B lies in the assumed parallax uncertainties. In data set A we generate parallax uncertainties from an hypothesized distribution, and in data set B we use the TGAS uncertainties. No parallax offset is introduced in the simulations. Data set C is identical to data set A except for the uncertainty on metallicity measurements, which is reduced by half. Our objective is to analyse the impact of the hyperprior choice and the influence of the parallax and metallicity uncertainties on the inferred coefficients under these three scenarios, and detect potential biases in the sample.

3.1. Validation data sets

In what follows, we describe the construction of the three data sets A, B and C that reproduce the generative process of the observations of Eq. (4) from the model hyper-parameters for the three simulated scenarios. In the generation of the semi-synthetic samples, we first used a Bayesian model to draw individual true metallicities and logarithms of the true periods from Gaussian posterior distributions inferred from their measurements and associated uncertainties in the real data set described in Sect. 2.1 with vague Gaussian priors assigned to both sets of parameters. Then the observed values were drawn from a Gaussian distribution centred at the true value and with standard deviation given by the measurement uncertainties. For the particular case of data set C, the metallicity measurements have been generated dividing by 2 the uncertainties in the real sample. We generated true absolute magnitudes from the PM_KZ theoretical calibration of Catelan et al. (2004) adopting [α/Fe] = 0.3 and converting from [Fe/H] to log Z by means of its Eqs. (9) and (10):

$\begin{matrix} M_{K_{s}} = - 2.353 log P + 0.175 [Fe / H] - 0.869 . \end{matrix}$ $\begin{aligned} M_{K_{s}}=-2.353\log P+0.175\left[\mathrm{Fe/H} \right]-0.869. \end{aligned}$ (13)

We generated true parallaxes from

$\begin{matrix} ϖ_{i} = 10^{2 + 0.2 (M_{i} - {m_{0}}_{i})}, \end{matrix}$ $\begin{aligned} \varpi _{i}=10^{2+0.2\left(M_{i}-{m_0}_{i}\right)}, \end{aligned}$ (14)

where m₀_i is the unabsorbed apparent magnitude of the ith star in the real sample. For data set A, we generated measured parallaxes from a Gaussian distribution centred at the true parallaxes given by Eq. (14) with standard deviations drawn from an exponential distribution with inverse-scale parameter equal to 10 plus a zero-point of 0.01 mas. We note that the uncertainties on TGAS parallaxes are higher than these by approximately one order of magnitude, but our objective here is to evaluate the performance of our HM under the small uncertainties typical of the Gaia DR2. The measured parallaxes of data set B were generated using the TGAS parallax uncertainties.

Figure 2 represents in the top panel the values of the absolute magnitudes and logarithms of the true periods generated for simulations A and B. The red line shows the PL relation for the median value of [Fe/H] in the simulated sample. Because this plot is a 2D projection of the 3D PM_{K_s}Z relation and the absolute magnitudes were sampled from it, any deviations from the red line can only be explained by metallicities differing from the median and the intrinsic dispersion of the relation (which is symmetric). A correlation between periods and metallicities is evident, which results in correlated residuals (lower panel) with respect to the assumed PL for the median true metallicity. We also observe in the figure a correlation with the distance: brighter magnitudes correspond (on average) to longer periods and lower metal abundances at larger distances (lower parallaxes) and vice versa. Figure 3 demonstrates that the correlation between periods, metallicities, and parallaxes is not an added effect in the simulations and is present in the measurements both for the simulated and the real data set. The figure represents the measured metallicities versus the logarithm of the period for simulations A and B and for the real sample. The colour code reflects the natural logarithm of the parallaxes: the same true value for simulations A and B in the top panel, the observed value for simulation B in the middle panel, and the measured value of the TGAS catalogue in the bottom panel. The black crosses denote the first and third quartiles of the marginal distributions along each axis. In principle, we expect the period distribution to be independent of distance. However, Fig. 3 shows that the left half of the plot (stars with short periods) is predominantly populated by stars with larger parallaxes and higher metallicities, while the right half is, again on average, predominantly populated by distant stars (smaller parallaxes) with lower metallicities. We observe that the correlation between the simulated true parallaxes, measured periods, and metallicities shown in the top panel of the figure persists for the measured parallaxes depicted in the middle panel, although with larger dispersion due to the higher parallax uncertainties of simulation B. We also note that the correlation is also present in the real sample of TGAS parallaxes (bottom panel). The interpretation is as follows: we expect nearby stars in our sample to be characterised on average by the higher metallicities of the disc, while the opposite is true for those farther away in the halo. This dependence of distance on metallicity is visible in the colour code of Fig. 3 and is crudely characterised by the two black crosses in each panel. Each cross represents the projection of the point Q_k ([Fe/H],log P,lnω) (with Q_k denoting the kth quartile for k = 1, 2) onto the period–metallicity plane. The crosses in the top panel of the figure correspond to distances of 1.87 and 1.07 kpc for a lower (−1.72 dex) and higher (−1.11 dex) metallicity component, respectively.

Fig. 2.

Observed absolute magnitudes as a function of the true decadic logarithm of the period for simulations A and B. The red solid line represents the projection of the PM_{K_s}Z relation of Eq. (13) for a value of the true metallicity equal to the median of the values generated according to the text. Colours encode the simulated true parallaxes according to the logarithmic scale on the right.

Fig. 3.

Scatter plot of observed metallicities and log(P) for validation sets A and B and the real sample used in the paper. The black crosses represent the first (Q₁) and third (Q₃) quartile of the distribution of measured log(P) and [Fe/H] for simulations A and B (top and middle panel) and the real sample (bottom panel). The colour encodes the natural logarithm of true parallaxes of simulations A and B (top panel), measured parallaxes of simulation B (middle panel), and TGAS parallax estimates (bottom panel).

These correlations show themselves in the period luminosity diagram of Fig. 2 in the following way. Because higher metallicities correspond to shorter periods (as illustrated in Fig. 3), we then expect the nearby stars (which we recall are on average more metal-rich) to be characterised by shorter periods (the left half of the PL diagram). The opposite is also true: the distant (small parallax) halo stars have on average lower metallicities and hence longer periods (the right half of the PL diagram). This scenario is then prone to systematic biases in the log(P) slope inference results because it is precisely at the right edge of the PL diagram that there is a concentration of the most distant sources that will inevitably be characterised by larger fractional parallax uncertainties. We know that in general, the prior plays a minor role when the uncertainties are small because a narrow likelihood dominates the posterior. The opposite is true for stars with large fractional parallax uncertainties: the likelihood is barely informative, and it is the prior that dominates the posterior. Bailer-Jones (2015) showed very instructive illustrations of this for the problem of inferring distances for parallaxes under several prior specifications. In our case, the stars for which the prior has a larger impact on the inference of the parallax are predominantly placed at the rightmost range of periods.

These relatively hidden correlations have important consequences for the inference as we show below. In particular, they have an impact on the choice of prior. In general, the parallax prior has to have support (non-vanishing values) in the entire range of true parallaxes. This is even more important given the correlation between periods and true parallaxes, however, because in the case of our simulations, long periods have the smallest parallaxes on average. If the prior has zero probability density for the small true parallaxes, and given the relatively large parallax uncertainties in our sample, the model will systematically assign parallaxes larger than the true ones (will overestimate them). Because in both simulated data sets the full range of absolute magnitudes is reduced to a brighter magnitude range for lower parallaxes and because of the strong deterministic relationship between absolute magnitudes and true parallaxes established by Eq. (8) of the HM, the model will infer absolute magnitudes fainter than the true (brighter) ones. Finally, if the stars with long periods are assigned fainter absolute magnitudes, the model will systematically underestimate the absolute value of the period slope coefficient of the PLZ relation. Hence, if our interpretation is correct, the distance prior has to be made dependent on metallicity. The model described in Sect. 2 addresses this problem using a 3D prior that distinguishes between two probabilistic classes of metal abundance in the data and constrains the true parallaxes of each class by means of their relationship with periods. Figure 4 shows an example of our 3D GM prior fitted to the distribution of true ([Fe/H], log P, lnϖ) in simulation B using a Bayesian HM. The figure represents the projection of the fitted probability density function (PDF) onto the plane period-parallax (with the parallax represented in linear scale) and depicts the measured pairs log(P)-parallax in the simulated data set. We note that without a proper modelling of the selection effect that gives rise to the correlation between periods and true parallaxes, the inference will return a severely underestimated log(P) slope, as we demonstrate towards the end of this section, where our model is compared with a model that uses 1D independent prior distributions for period, metallicity, and parallax. This comparison illustrates the shrinking power of hierarchical models.

Fig. 4.

Projection of a two-component 3D GM PDF fitted to the true values of ([Fe/H], log P, lnϖ) in data set B onto the plane period-parallax. The grey contours represent iso-probability lines. The coloured and grey points represent true and measured parallaxes, respectively. The black crosses indicate the median of the mean posterior distribution of each GM component. The colour encodes the metallicity.

3.2. Validation results

Figure 5 compares the parallaxes inferred by our HM with the true parallaxes simulated for data sets A (top panel) and B (bottom panel). The colours in both panels represent the log(P) according to the colour scale to the right. For simulation B we observe that lower parallaxes (typical of the longer periods) are slightly overestimated. We also observe that the smallest (and overestimated) parallaxes correlate with longer periods, as expected. The largest parallax overestimations do not correspond systematically to the longest periods, however, which is guaranteed by the 3D prior used in our HM. The overestimation of the smallest parallaxes is interpreted as a deficiency of our Gaussian 3D prior that fails to adequately represent the more complex spatial distribution of RR Lyrae stars in the Galaxy.

Fig. 5.

Comparison between inferred and true parallaxes for validation sets A and B. The solid lines represent the bisectors and the colour encodes the value of the decadic logarithm of the period in days according to the colour scale on the right.

Figure 6 compares for simulations A and B the inferred absolute magnitudes with the true periods and represents the PM_KZ relation inferred in each case. Table 2 summarises the coefficients of the resulting PM_KZ relations providing 68% credible intervals around the median of the posterior distributions of them. For simulation B (bottom panel of Fig. 6) we observe that brighter magnitudes are slightly underestimated for long-period stars. This mild bias is not present for data set A (upper panel) because the negligible parallax uncertainties tightly constrain the model parameters (the true parallaxes and hence, the absolute magnitudes and the slopes of the relation). To better appreciate this result, we suggest that the inferred absolute magnitudes of the two scenarios in Fig. 6 are also compared with their simulated values in Fig. 2. The mild underestimation of brighter magnitudes for simulation B translated into a mild underestimation of the inferred log(P) slope (second row of Table 2) with a credible interval of ${- 2.13}_{- 0.78}^{+ 0.75}$ ${-2.13}_{-0.78}^{+0.75}$ mag dex⁻¹, which in any case is in good agreement with the value c = −2.35 used for the simulation taken into account the large parallax uncertainties in this case. For simulation A, the credible interval obtained for the log(P) slope was ${- 2.48}_{- 0.14}^{+ 0.15}$ ${-2.48}_{-0.14}^{+0.15}$ mag dex⁻¹, which slightly overestimate c = −2.35. We hypothesize that this mild overestimation of the log(P) slope is a consequence of the correlation between period and metallicity and the relatively large uncertainties of measured metallicities. In order to evaluate this hypothesis, we used the third semi-synthetic data set (labelled C), whose uncertainties on simulated metallicity measurements are of the order of magnitude that is typical for high-resolution spectroscopic techniques. The results for simulation C, presented in the bottom row of Table 2, confirm our intuition.

Fig. 6.

Comparison between inferred absolute magnitudes and true periods for validation sets A and B. The black solid and red dashed lines represent the projections of the PM_KZ relation of Eq. (13) and the relation inferred by our HM (adopting the median value of the posterior distribution of each coefficient), respectively, for a value of the true metallicity equal to the median of the values generated according to the text. The colour encodes the metallicity according to the scale on the right.

Table 2.

Coefficients of the PM_KZ relations inferred from the measurements of simulations A, B, and C: slopes (c and k), zero-point (b), and intrinsic dispersion (w).

Table 3 compares the results obtained with the 3D GM prior discussed above with those of alternative 1D priors based on the log-normal prior used in Paper I and the EDVD prior of Bailer-Jones (2015) given by Eq. (15).

$\begin{matrix} p (ϖ) = \frac{1}{2 L^{3} ϖ^{4}} exp (- \frac{1}{ϖ L}) \cdot \end{matrix}$ $\begin{aligned} p\left(\varpi \right)=\frac{1}{2L^{3}\varpi ^{4}}\exp \left(-\frac{1}{\varpi L}\right)\cdot \end{aligned}$ (15)

Table 3.

Coefficients of the PM_KZ relations inferred for the simulated scenarios A and B by an HM using the 3D GM prior and two 1D parallax priors (the log-normal prior used in Paper I and the EDVD prior of Bailer-Jones 2015): slopes (c and k), zero-point (b), and intrinsic dispersion (w).

It shows that the existing correlations amongst periods, parallaxes, and metallicities have a small impact in the context of the small parallax uncertainties that characterise simulation A, but significantly affect (worsen) the inference outcome for the typical TGAS uncertainties. Figure 7 illustrates these facts by comparing the parallaxes inferred by an HM with an EDVD prior with the true parallaxes of both semi-synthetic data sets A and B. We observe that in simulation B (bottom panel) the overestimation and underestimation of the inferred parallaxes is severer than in Fig. 5 of Sect. 3. It is important to bear in mind that although succesive Gaia data releases will tend to decrease the measurement uncertainties in general, the community will often work with samples of stars (not necessarily classical pulsators like in the case of Paper I; Muraveva et al. 2018a) with Gaia uncertainties in the range exemplified by our TGAS sample.

Fig. 7.

Comparison between inferred and true parallaxes for validation sets A and B when an HM with an EDVD prior is used. The colour encodes the value of the decadic logarithm of the period in days according to the colour scale on the right.

4. Application to the RR Lyrae Gaia DR1 data

Table 4 presents summary statistics associated with the PM_KZ relationships obtained by our HM trained with the RR Lyrae sample described in Sect. 2 for the cases of a potential TGAS global parallax offset inferred by the model or fixed to literature values. Figure 8 shows the MCMC posterior samples (in 2D projections) of the relationship parameters for the model with inferred offset. The black contours in the figure represent iso-probability lines. We see clear correlations between the three strong parameters (two slopes and the intercept). The posterior medians of the log(P) slope, the metallicity slope and the intercept are −2.1 mag dex⁻¹, +0.25 mag dex⁻¹ and −0.79 mag, respectively (top portion of Table 4). The inferred period slope is consistent with the values reported in the literature for empirical and theoretical studies (see Table 3 of Muraveva et al. 2015). The metallicity slope is systematically higher than the values reported from empirical studies, but is in good agreement with the theoretical calibrations of Bono et al. (2003) and Catelan et al. (2004). The posterior median of the intrinsic width is 0.15 mag. The credible interval of the parallax offset ϖ₀ = +0.014 ± 0.032 disagrees with the negative estimate ϖ₀ = −0.036 ± 0.002 of Arenou et al. (2017), but is consistent with the MAP estimate ϖ₀ = +0.02 inferred by the probabilistic approach of Sesar et al. (2017) using W2 band RR Lyrae data. However, we explain a hypothetical overtestimation (towards positive values) of the global parallax offset inferred by our HM model as follows. As we showed in Sect. 3.2, the model with 3D GM prior mitigates the systematic overstimation of smaller parallaxes for the longest periods in a scenario of large parallax uncertainties. As a consequence of this, the lowest parallaxes inferred by the model are on average smaller that their measurements (with the obvious exception of parallaxes with negative measurements), which in turn leads to a positive offset estimate. We stress that the parallax offset reported in this paper should no to be used as a reliable estimate of the potential TGAS offset. On the contrary, the global parallax offset ϖ₀ = −0.057 reported in Muraveva et al. (2018a) for Gaia DR2 was inferred from a scenario of sufficiently precise parallax uncertainties in which the parallax prior played a relatively minor role and is therefore in principle more reliable.

Table 4.

Coefficients of the PM_KZ relation inferred from the real RRL sample described in Sect. 2: slopes (c and k), zero-point (b), intrinsic dispersion (w), global parallax offset (ϖ₀), and LMC distance modulus (μ_LMC).

Fig. 8.

Marginal posterior distributions from the MCMC samples of the PM_KZ relationship parameters for our HM with the potential TGAS parallax offset included as a model parameter. The diagonal shows the uni-dimensional marginal distributions for the slope of the log(P) term in the linear relation, the slope of the metallicity term, the intercept, and the intrinsic dispersion of the relationship. Green and red lines point towards the median and the MAP estimate, respectively, of each 1D posterior marginal distribution.

The bottom portion of Table 4 presents the PLZ relationship parameters inferred by our HM with a global parallax offset fixed to the value ϖ₀ = −0.036 mas estimated by Arenou et al. (2017). We do not observe major differences in the slopes with regard to those obtained by the HM based on the inferred offset of +0.014 mas (see top portion of the table). Nevertheless, the difference between the intercepts is equal to 0.11 mag, which translates into a Large Magellanic Cloud (LMC) distance modulus 0.09 mag longer than inferred by the model with offset fixed to ϖ₀ = −0.036 mas. The distance moduli were estimated from a sample of 70 RRLs located close to the LMC bar, with photometry in the K_s band and spectroscopically measured metallicities (described and used in Paper I; Muraveva et al. 2018a).

Figure 9 shows a comparison between the parallaxes catalogued in TGAS and the posterior estimates of our hierarchical model. The horizontal error bars represent TGAS uncertainties and the vertical ones are given by 68% credible intervals calculated around the median of the marginal posterior distributions. Our hierarchical model is capable of reducing (“shrinking”) the uncertainties using the constraint that the absolute magnitudes must follow a linear relationship with (the logarithm of the) periods and metallicities with a slope in agreement with previous estimates. The median of the standard deviations of the posterior samples is 0.07 mas with a maximum value of 0.2, which is the minimum value of the TGAS parallax uncertainties. As shown in Fig. 9, the maximum uncertainties of the MCMC parallax samples correspond to the same stars with minimum TGAS uncertainties (those with maximum TGAS parallax measurements). This means that the hierarchical model is not capable of significantly improving the parallax uncertainties of the stars near the Sun. We also see that there are stars with TGAS and HM parallaxes that disagree beyond the error bars. We plot in red stars that are two to three standard deviations away from the diagonal (as measured in the 2D plane of Fig. 9 by the Mahalanobis distance $\sqrt{{(x - μ)}^{T} Σ^{- 1} (x - μ)}$ $\sqrt{(\mathbf{x}-\boldsymbol{\mu})^T\mathbf{\Sigma}^{-1}(\mathbf{x}-\boldsymbol{\mu})}$ , where x is the vector (ϖ_TGAS,ϖ_HM) and μ is the perpendicular projection of x onto the diagonal).

Fig. 9.

Comparison between the TGAS parallaxes and the maximum a posteriori estimates from the HM. The error bars correspond to the TGAS parallax uncertainties (horizontal) and credible intervals calculated as the median plus minus the difference between the median and the 84th and 16th percentile (vertical). The red circles correspond to stars with parallax difference beyond twice the combined uncertainties.

Figure 10 shows the PL relations derived from the HM. Each grey line corresponds to one sample in the Markov chain. All PM_KZ relations have been particularised to a value of the metallicity [Fe/H] = −1.46 dex, which is the median of the distribution of inferred values. In the left-hand panel we show the values of the absolute magnitude in the K band derived from the MCMC samples as

$\begin{matrix} M_{K}^{i, n} = c^{n} \cdot log (P^{i, n}) + k^{n} \cdot {[Fe / H]}^{i, n} + b^{n}, \end{matrix}$ $\begin{aligned} M_{\rm K}^{i,n} = c^{n}\cdot \log (P^{i,n})+k^{n}\cdot {[\mathrm {Fe/H}]}^{i,n}+b^{n}, \end{aligned}$ (16)

where the superindex i tags stars (from 1 to N) and the superindex n tags the sample in the MCMC set of samples. In the right-hand panel we show the same diagram, but computing the absolute magnitude from the measured apparent magnitude and the MCMC parallax:

$\begin{matrix} M_{K}^{i, n} = {\hat{m}}_{0}^{i} + 5 \cdot log (ϖ^{i, n}) - 10, \end{matrix}$ $\begin{aligned} M_{\rm K}^{i,n} = \hat{m}_0^i+5\cdot \log (\varpi ^{i,n})-10, \end{aligned}$ (17)

where ${\hat{m}}_{0}^{i}$ $\hat{m}_0^i$ represents the measured value of the apparent magnitude corrected for the measured absorption (i.e. we only use the parallaxes from the model, but the absorptions and apparent magnitudes used in Eq. (17) are the measured values in the sample described in Sect. 2). The black circles correspond to the discrepant sources marked by red circles in Fig. 9. The outlier at log(P) ≈ −0.26 corresponds to V363 Cas. This star was classified as a double-mode pulsator by Hajdu et al. (2009). A detailed analysis of its nature is beyond the scope of this paper, but we note that it would be consistent with its discrepant position in the diagrams. The two panels of Fig. 10 can be compared to the period-absolute magnitude diagram of Fig. 11, in which the absolute magnitudes were predicted from a PLZ relation whose coefficients were estimated through fitting by weighted non-linear least-squares the following equation:

$\begin{matrix} \hat{ϖ} 10^{0.2 {\hat{m}}_{0} - 2} = 10^{0.2 (c log \hat{P} + k [\hat{Fe / H}] + b)}, \end{matrix}$ $\begin{aligned} \hat{\varpi }10^{0.2\hat{m}_{0}-2}=10^{0.2\left(c\log \hat{P}+ k\left[\widehat{\mathrm{Fe/H} }\right]+b\right)}, \end{aligned}$ (18)

where the dependent variable $α = \hat{ϖ} 10^{0.2 {\hat{m}}_{0} - 2}$ $\alpha=\hat{\varpi}10^{0.2\hat{m}_{0}-2}$ in the left side of Eq. (18) is the astrometry-based luminosity (ABL) defined by Arenou & Luri (1999), where $\hat{ϖ}$ $\hat{\varpi}$ denotes the TGAS parallax. The red line in Fig. 11 represents the projection of the PLZ relation derived by this method for a value of the metallicity equal to its median in the sample. The log(P) slope, metallicity slope, and intercept estimates were c = −1.34 ± 0.95 mag dex⁻¹, k = 0.20 ± 0.13 mag dex⁻¹, and b = −0.62 ± 0.40 mag, respectively.

Fig. 10.

Left panel: samples of the PLZ relations derived from the MCMC samples for [Fe/H] = −1.46 (the median of the distribution of inferred metallicities; grey lines) and period-M_K values inferred by the HM and computed according to Eq. (16). Right panel: as in the left panel, but with M_K computed according to Eq. (17). The colour encodes the inferred metallicity according to the scale on the right.

Fig. 11.

PLZ relations defined by the MCMC samples for the value of [Fe/H] = −1.46 (dark grey lines) and the measured periods, and absolute magnitudes predicted by using the ABL method described in the text. The red line represents the projection of the PLZ relation used for predictions for a value of the metallicity equal to its median in the sample.

One of the advantages of addressing the problem of calibrating a PL(Z) relationship by means of a Bayesian HM is that the posterior distribution of any parameter of interest can be inferred. In particular, we aimed to derive individual heliocentric distances to the RRL stars in our sample and locate their positions in the Galaxy. For this, the measured coordinates (α, δ) of our RRL stars were first transformed to Galactic coordinates (l, b). We then derived for each RR Lyrae samples of the posterior distribution of its rectangular coordinates (x_i, y_i, z_i) from samples of its posterior parallax (ϖ_i) in the Cartesian Galactocentric coordinate system of Jurić et al. (2008) by

$\begin{matrix} \begin{matrix} x_{i}^{n} & = R_{⊙} - d_{i}^{n} cos (l_{i}) cos (b_{i}) \\ y_{i}^{n} & = - d_{i}^{n} sin (l_{i}) cos (b_{i}) \\ z_{i}^{n} & = d_{i}^{n} sin (b_{i}), \end{matrix} \end{matrix}$ $\begin{aligned} \begin{split} x_{i}^{n}&=R_\odot -d_{i}^{n}\cos \left(l_{i}\right)\cos \left(b_{i}\right)\\ { y}_{i}^{n}&=-d_{i}^{n}\sin \left(l_{i}\right)\cos \left(b_{i}\right)\\ z_{i}^{n}&=d_{i}^{n}\sin \left(b_{i}\right),\\ \end{split} \end{aligned}$ (19)

where $d_{i}^{n} = 1 / ϖ_{i}^{n}$ $d_{i}^{n}=1/\varpi_{i}^{n}$ denotes the posterior distance calculated as the reciprocal of the posterior parallax and R_⊙ = 8.3 kpc is the adopted distance to the Galactic centre (GC; Gillessen et al. 2009). The samples of the posterior distribution of the radial distance of each star to the GC were calculated as

$\begin{matrix} {r_{GC}}_{i}^{n} = \sqrt{{(x_{i}^{n})}^{2} + {(y_{i}^{n})}^{2}} . \end{matrix}$ $\begin{aligned} {r_{\mathrm{GC} }}_{i}^{n}=\sqrt{\left(x_{i}^{n}\right)^{2}+ \left({ y}_{i}^{n}\right)^{2}}. \end{aligned}$ (20)

Figure 12 represents the spatial distribution and metal abundance of our RR Lyrae sample in the plane z − r_GC associated with the Galactocentric reference frame. Each Sun-centred circumference in the figure corresponds to Galactocentric coordinates calculated from the Galactic longitudes l = 0° and l = 180° (at any Galactic latitude) and the distances corresponding to the median of the mean log-parallax posterior distribution associated with each Gaussian mixture component of our HM.

Fig. 12.

Spatial distribution and metallicity of the RR Lyrae sample represented in a Galactocentric reference frame. r_GC and z denote the radial distance to the GC and the vertical distance with regard to the Galactic midplane x − y, respectively. The Cartesian Galactocentric coordinates and the radial distances to the GC have been summarised by the median plus minus the difference in absolute value between the median and the 84th and 16th percentile of the posterior samples of Eqs. (19) and 20.

5. Sensitivity analysis

In this section we analyse the sensitivity of the inference results obtained by our HM with a joint 3D GM prior for metallicity, period, and parallax to variations of some critical parameters assigned to the different prior distributions. In the following, we consider the hyperparameters of Table 1 (for which the results presented in Sect. 4 were obtained) as reference values and compare these results with those obtained by varying the values of the hyperparameters. Table 5 compares the posterior medians and credible intervals of the PLZ relationship parameters for the different values of the prior hyperparameters. Its third row lists the reference results introduced in Sect. 4.

Table 5.

Summary statistics corresponding to the sensitivity analysis performed to the HM with joint 3D GM prior presented in this paper: slopes (c and k), zero-point (b), intrinsic dispersion (w), and global parallax offset (ϖ₀).

In Sect. 2 we chose the prior distributions of the PLZ relationship coefficients for obvious reasons to be as non-informative as possible. For the sample of RRL stars we expect significant variations of the posterior distributions for other choices of their prior hyperparameters because the prior plays a major role in this range of uncertainties. In particular the assignment of the slightly more informative popular Cauchy prior with scale parameter σ = 2.5 of Gelman et al. (2008) to the slopes c and k gave rise to a higher absolute value of the log(P) slope posterior median and a slightly narrower credible interval (top portion of Table 5). The closeness of the posterior median and MAP estimates indicates that the MCMC algorithm successfully explored the complex parameter space of the problem in this case.

We have also tried different values for the inverse scale hyperparameter λ_w of the intrinsic width prior distribution from 0.1 to 10 kpc. The results (middle part of the table) indicate a slight decrease of the intrinsic dispersion as λ_w increases.

The most critical HM hyperparameters are those that correspond to the GM prior of Eq. (9), which models the true distribution of metallicities, periods, and parallaxes. We have assigned informative hyperparameters to the standard deviations $(σ_{μ_{Z}}^{k}, σ_{μ_{P}}^{k}, σ_{μ_{ϖ}}^{k})$ $\left(\sigma_{\mu_{Z}}^{k},\sigma_{\mu_{P}}^{k},\sigma_{\mu_{\varpi}}^{k}\right)$ of the 3D Gaussian prior chosen for the mean vector μ^k of each GM component. In particular, we chose for the logarithm of parallax $σ_{μ_{ϖ}}^{k} = 0.05$ $\sigma_{\mu_{\varpi}}^{k}=0.05$ . We also tested the values 0.1 and 0.5 with the results listed in the bottom part of Table 5, where we observe that the log(P) slope is severely underestimated for $σ_{μ_{ϖ}}^{k} = 0.5$ $\sigma_{\mu_{\varpi}}^{k}=0.5$ . For this latter case, the inferred posterior medians of μ_ϖ were equal to −0.50 and −0.04 (equivalent to 0.60 and 0.95 mas or 1.65 and 1 kpc). These are the only cases in which the inferred parallax offset turns out to be negative and reflect the fact that the 3D GM prior does not adequately constrict the range of smaller parallaxes.

6. Summary and conclusions

We have applied the hierarchical Bayesian method to infer estimates for the parameters of the PLZ relationship in the K band for fundamental and first-overtone RR Lyrae stars. We extended the analysis performed in Paper I by testing new prior distributions and analysing correlations in the data, their influence on the inference, and the consequences of the prior choice.

In Sect. 3 we have demonstrated through the use of semi-synthetic data that the RR Lyrae sample used in Paper I presents strong correlations that result in different spatial distributions for the different metallicites and periods. As a result, the larger parallax uncertainties are not spread uniformly in period, but are concentrated in the region of long periods, thus making the inference results strongly dependent on the prior. This is the main result of this work. We proved that in the context of significant parallax uncertainties (this amounts to a median fractional uncertainty σ_ϖ/ϖ of 0.43 in the TGAS samples), simple independent priors will result in systematically biased estimates of the PLZ slopes and intercept. For small parallax uncertainties (typically one order of magnitude smaller than the TGAS uncertainties) the effect of such correlations on the parameters of the PLZ relation inferred by our HM for a wide variety of priors is small. In this simulated scenario, our HM is able to successfully recover the PLZ relation of Catelan et al. (2004) given by Eq. (13) independently of the prior choice. Conversely, for the TGAS parallax uncertainties used in Paper I, we propose a mixture of two Gaussian 3D components with correlations between periods and parallaxes. We proved that this prior is much less affected by the correlations in the data set and that it recovers the correct parameters for semi-synthetic data with metallicity uncertainties that are half of those available in the literature. The inadequacy of 1D priors for the inference of PL(Z) relations and the necessity of modelling the correlations in the data set is the second main result of our study.

In Sect. 4 we applied our HM to the sample of 200 fundamental and first-overtone RR Lyrae stars and Gaia DR1 parallaxes used in Paper I. The value of the PM_KZ coefficients thus derived ( $c = {- 2.10}_{- 0.75}^{+ 0.87}$ $c\,{=}\,{-2.10}_{-0.75}^{+0.87}$ , $k = {0.25}_{- 0.11}^{+ 0.12}$ $k\,{=}\,{0.25}_{-0.11}^{+0.12}$ , $b = {- 0.79}_{- 0.32}^{+ 0.36}$ $b\,{=}\,{-0.79}_{-0.32}^{+0.36}$ , and $w = {0.15}_{- 0.05}^{+ 0.06}$ $\mathit{w}\,{=}\,{0.15}_{-0.05}^{+0.06}$ ) can be compared with the estimate derived from the much more precise measurements of Gaia DR2 ( $c = {- 2.58}_{- 0.20}^{+ 0.20}$ $c\,{=}\,{-2.58}_{-0.20}^{+0.20}$ , $k = {0.17}_{- 0.03}^{+ 0.03}$ $k\,{=}\,{0.17}_{-0.03}^{+0.03}$ , $b = {- 0.84}_{- 0.09}^{+ 0.09}$ $b\,{=}\,{-0.84}_{-0.09}^{+0.09}$ , and $w = {0.16}_{- 0.01}^{+ 0.01}$ $\mathit{w}\,{=}\,{0.16}_{-0.01}^{+0.01}$ ; Muraveva et al. 2018a). We see that the 68% credible intervals have large overlap regions that make the two estimates fully consistent. We note that the results presented in Muraveva et al. (2018a) already incorporate the findings of the study presented here, with the only exception that the 3D prior for parallaxes, periods, and metallicities is not a mixture of Gaussians, but a single Gaussian distribution: in the typical parallax uncertainty regime of Gaia DR2, the data are sufficiently precise to distinguish the two metallicity populations without enforcing this separation in the prior.

¹

See https://www.cosmos.esa.int/web/gaia/science-performance

Acknowledgments

This study has made use of data gathered by the European Space Agency (ESA) Gaia mission (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement. This work was supported in part by the Spanish Ministerio de Ciencia, Innovación y Universidades through the grant AyA2018-84089, by PRIN-INAF2014, “EXCALIBUR’S” (P.I. G. Clementini), by the Agenzia Spaziale Italiana (ASI) through grants ASI I/058/10/0 and ASI 2014-025-R.1.2015, and by Premiale 2015, “MITiC” (P.I. B. Garilli). The statistical analysis carried out in this work has extensively used the R environment for statistical computing (R Core Team 2017) and its package Rstan (Stan Development Team 2018).

References

Arenou, F., & Luri, X. 1999, in Harmonizing Cosmic Distance Scales in a Post-Hipparcos Era, eds. D. Egret, & A. Heck, ASP Conf. Ser., 167, 13 [NASA ADS] [Google Scholar]
Arenou, F., Luri, X., Babusiaux, C., et al. 2017, A&A, 599, A50 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Arenou, F., Luri, X., Babusiaux, C., et al. 2018, A&A, 616, A17 [Google Scholar]
Bailer-Jones, C. A. L. 2015, PASP, 127, 994 [NASA ADS] [CrossRef] [Google Scholar]
Bono, G., Caputo, F., Castellani, V., et al. 2003, MNRAS, 344, 1097 [NASA ADS] [CrossRef] [Google Scholar]
Borissova, J., Rejkuba, M., Minniti, D., Catelan, M., & Ivanov, V. D. 2009, A&A, 502, 505 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Cacciari, C., & Clementini, G. 2003, in Stellar Candles for the Extragalactic Distance Scale, eds. D. Alloin, & W. Gieren (Berlin: Springer Verlag), Lect. Notes Phys., 635, 105 [NASA ADS] [CrossRef] [Google Scholar]
Cardelli, J., Clayton, G., & Mathis, J. 1989, ApJ, 345, 245 [NASA ADS] [CrossRef] [Google Scholar]
Carpenter, B., Gelman, A., Hoffman, M., et al. 2017, J. Stat. Soft., 76, 1 [Google Scholar]
Casertano, S., Riess, A. G., Bucciarelli, B., & Lattanzi, M. G. 2017, A&A, 599, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Catelan, M., Pritzl, B., & Smith, H. 2004, ApJS, 154, 633 [Google Scholar]
Clementini, G., Gratton, R., Bragaglia, A., et al. 2003, AJ, 125, 1309 [NASA ADS] [CrossRef] [Google Scholar]
Clementini, G., Ripepi, V., Leccia, S., et al. 2016, A&A, 595, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Clementini, G., Ripepi, V., Molinaro, R., et al. 2019, A&A, 622, A60 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Dambis, A. K., Berdnikov, L. N., Kniazev, A. Y., et al. 2013, MNRAS, 435, 3206 [NASA ADS] [CrossRef] [Google Scholar]
Freedman, W. L., Madore, B. F., Gibson, B. K., et al. 2001, ApJ, 553, 47 [NASA ADS] [CrossRef] [Google Scholar]
Gaia Collaboration (Clementini, G., et al.) 2017, A&A, 605, A79 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Gelman, A., Carlin, J. B., Stern, H. S., & Rubin, D. B. 2004, Bayesian Data Analysis (Boca Raton: Chapman& Hall/CRC) [Google Scholar]
Gelman, A., & Hill, J. 2007, Data Analysis Using Regression and Multilevel/Hierarchical Models, Analytical Methods for Social Research (Cambridge: Cambridge University Press) [Google Scholar]
Gelman, A., Jakulin, A., Pittau, M. G., & Su, Y.-S. 2008, Ann. Appl. Stat., 2, 1360 [CrossRef] [Google Scholar]
Gieren, W., Górski, M., Pietrzyński, G., et al. 2013, ApJ, 773, 69 [NASA ADS] [CrossRef] [Google Scholar]
Gillessen, S., Eisenhauer, F., Trippe, S., et al. 2009, ApJ, 692, 1075 [NASA ADS] [CrossRef] [Google Scholar]
Gratton, R. G., Bragaglia, A., Clementini, G., et al. 2004, A&A, 421, 937 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hajdu, G., Jurcsik, J., & Sodor, A. 2009, Inf Bull. Variab. Stars, 5882 [Google Scholar]
Hoffman, M. D., & Gelman, A. 2014, J. Mach. Learn. Res., 15, 1593 [Google Scholar]
Jurić, M., Ivezić, Ž., Brooks, A., et al. 2008, ApJ, 673, 864 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
Lauritzen, S. 1996, Graphical Models (Oxford: Oxford University Press) [Google Scholar]
Leavitt, H. S., & Pickering, E. C. 1912, Harv. Coll. Obs. Circ., 173, 1 [Google Scholar]
Lewandowski, D., Kurowicka, D., & Joe, H. 2009, J. Multivar. Anal., 100, 1989 [CrossRef] [Google Scholar]
Lindegren, L., Lammers, U., Bastian, U., et al. 2016, A&A, 595, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Longmore, A. J., Fernley, J. A., & Jameson, R. F. 1986, MNRAS, 220, 279 [NASA ADS] [Google Scholar]
Luri, X., Brown, A. G. A., Sarro, L. M., et al. 2018, A&A, 616, A9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Madore, B. F., & Freedman, W. L. 1991, PASP, 103, 933 [NASA ADS] [CrossRef] [Google Scholar]
Marconi, M., Musella, I., & Fiorentino, G. 2005, ApJ, 632, 590 [NASA ADS] [CrossRef] [Google Scholar]
Marconi, M., Coppola, G., Bono, G., et al. 2015, ApJ, 808, 50 [NASA ADS] [CrossRef] [Google Scholar]
Muraveva, T., Palmer, M., Clementini, G., et al. 2015, ApJ, 807, 127 [NASA ADS] [CrossRef] [Google Scholar]
Muraveva, T., Delgado, H. E., Clementini, G., Sarro, L. M., & Garofalo, A. 2018a, MNRAS, 481, 1195 [NASA ADS] [CrossRef] [Google Scholar]
Muraveva, T., Garofalo, A., Scowcroft, V., et al. 2018b, MNRAS, 480, 4138 [NASA ADS] [CrossRef] [Google Scholar]
Neeley, J. R., Marengo, M., Bono, G., et al. 2017, ApJ, 841, 84 [NASA ADS] [CrossRef] [Google Scholar]
Pearl, J. 1988, Probabilistic Reasoning in Intelligent Systems: Networks of Plausble Inference (Burlington: Morgan Kaufmann Pub) [Google Scholar]
Preston, G. 1959, ApJ, 130, 507 [NASA ADS] [CrossRef] [Google Scholar]
R Core Team. 2017, R: A Language and Environment for Statistical Computing (Vienna, Austria: R Foundation for Statistical Computing) [Google Scholar]
Riess, A. G., Macri, L., Casertano, S., et al. 2011, ApJ, 730, 119 [NASA ADS] [CrossRef] [Google Scholar]
Riess, A. G., Macri, L. M., Hoffmann, S. L., et al. 2016, ApJ, 826, 56 [NASA ADS] [CrossRef] [Google Scholar]
Riess, A. G., Casertano, S., Yuan, W., et al. 2018, ApJ, 861, 126 [Google Scholar]
Ripepi, V., Moretti, M. I., Marconi, M., et al. 2012, MNRAS, 424, 1807 [NASA ADS] [CrossRef] [Google Scholar]
Robert, C., & Casella, G. 2013, Monte Carlo Statistical Methods, Springer Texts in Statistics (New York: Springer) [Google Scholar]
Saha, A., Thim, F., Tammann, G. A., Reindl, B., & Sandage, A. 2006, ApJS, 165, 108 [NASA ADS] [CrossRef] [Google Scholar]
Schlegel, D., Finkbeiner, D., & Davis, M. 1998, ApJ, 500, 525 [NASA ADS] [CrossRef] [Google Scholar]
Sesar, B., Fouesneau, M., Price-Whelan, A. M., et al. 2017, ApJ, 838, 107 [NASA ADS] [CrossRef] [Google Scholar]
Sollima, A., Cacciari, C., & Valenti, E. 2006, MNRAS, 372, 1675 [NASA ADS] [CrossRef] [Google Scholar]
Sollima, A., Cacciari, C., Arkharov, A. A. H., et al. 2008, MNRAS, 384, 1583 [NASA ADS] [CrossRef] [Google Scholar]
Stan Development Team 2018, RStan: the R interface to Stan [Google Scholar]
Wright, E. L., Eisenhardt, P. R. M., Mainzer, A. K., et al. 2010, AJ, 140, 1868 [NASA ADS] [CrossRef] [Google Scholar]
Zinn, R., & West, M. 1984, ApJS, 55, 45 [NASA ADS] [CrossRef] [Google Scholar]

All Tables

Table 1.

Prior (π) definitions for the hierarchical Bayesian model of the PL(Z) relations.

In the text

Table 2.

Coefficients of the PM_KZ relations inferred from the measurements of simulations A, B, and C: slopes (c and k), zero-point (b), and intrinsic dispersion (w).

In the text

Table 3.

Coefficients of the PM_KZ relations inferred for the simulated scenarios A and B by an HM using the 3D GM prior and two 1D parallax priors (the log-normal prior used in Paper I and the EDVD prior of Bailer-Jones 2015): slopes (c and k), zero-point (b), and intrinsic dispersion (w).

In the text

Table 4.

Coefficients of the PM_KZ relation inferred from the real RRL sample described in Sect. 2: slopes (c and k), zero-point (b), intrinsic dispersion (w), global parallax offset (ϖ₀), and LMC distance modulus (μ_LMC).

In the text

Table 5.

Summary statistics corresponding to the sensitivity analysis performed to the HM with joint 3D GM prior presented in this paper: slopes (c and k), zero-point (b), intrinsic dispersion (w), and global parallax offset (ϖ₀).

In the text

All Figures

	Fig. 1. Directed acyclic graph that represents the forward model used to infer the PLZ relation coefficients when the prior of true metallicities, logarithm of true periods, and (natural) logarithm of true parallaxes is assumed to be a 3D Gaussian mixture distribution.
In the text

	Fig. 2. Observed absolute magnitudes as a function of the true decadic logarithm of the period for simulations A and B. The red solid line represents the projection of the PM_{K_s}Z relation of Eq. (13) for a value of the true metallicity equal to the median of the values generated according to the text. Colours encode the simulated true parallaxes according to the logarithmic scale on the right.
In the text

Fig. 3.

Scatter plot of observed metallicities and log(P) for validation sets A and B and the real sample used in the paper. The black crosses represent the first (Q₁) and third (Q₃) quartile of the distribution of measured log(P) and [Fe/H] for simulations A and B (top and middle panel) and the real sample (bottom panel). The colour encodes the natural logarithm of true parallaxes of simulations A and B (top panel), measured parallaxes of simulation B (middle panel), and TGAS parallax estimates (bottom panel).

In the text

Fig. 4.

Projection of a two-component 3D GM PDF fitted to the true values of ([Fe/H], log P, lnϖ) in data set B onto the plane period-parallax. The grey contours represent iso-probability lines. The coloured and grey points represent true and measured parallaxes, respectively. The black crosses indicate the median of the mean posterior distribution of each GM component. The colour encodes the metallicity.

In the text

	Fig. 5. Comparison between inferred and true parallaxes for validation sets A and B. The solid lines represent the bisectors and the colour encodes the value of the decadic logarithm of the period in days according to the colour scale on the right.
In the text

Fig. 6.

Comparison between inferred absolute magnitudes and true periods for validation sets A and B. The black solid and red dashed lines represent the projections of the PM_KZ relation of Eq. (13) and the relation inferred by our HM (adopting the median value of the posterior distribution of each coefficient), respectively, for a value of the true metallicity equal to the median of the values generated according to the text. The colour encodes the metallicity according to the scale on the right.

In the text

	Fig. 7. Comparison between inferred and true parallaxes for validation sets A and B when an HM with an EDVD prior is used. The colour encodes the value of the decadic logarithm of the period in days according to the colour scale on the right.
In the text

Fig. 8.

Marginal posterior distributions from the MCMC samples of the PM_KZ relationship parameters for our HM with the potential TGAS parallax offset included as a model parameter. The diagonal shows the uni-dimensional marginal distributions for the slope of the log(P) term in the linear relation, the slope of the metallicity term, the intercept, and the intrinsic dispersion of the relationship. Green and red lines point towards the median and the MAP estimate, respectively, of each 1D posterior marginal distribution.

In the text

Fig. 9.

Comparison between the TGAS parallaxes and the maximum a posteriori estimates from the HM. The error bars correspond to the TGAS parallax uncertainties (horizontal) and credible intervals calculated as the median plus minus the difference between the median and the 84th and 16th percentile (vertical). The red circles correspond to stars with parallax difference beyond twice the combined uncertainties.

In the text

Fig. 10.

Left panel: samples of the PLZ relations derived from the MCMC samples for [Fe/H] = −1.46 (the median of the distribution of inferred metallicities; grey lines) and period-M_K values inferred by the HM and computed according to Eq. (16). Right panel: as in the left panel, but with M_K computed according to Eq. (17). The colour encodes the inferred metallicity according to the scale on the right.

In the text

	Fig. 11. PLZ relations defined by the MCMC samples for the value of [Fe/H] = −1.46 (dark grey lines) and the measured periods, and absolute magnitudes predicted by using the ABL method described in the text. The red line represents the projection of the PLZ relation used for predictions for a value of the metallicity equal to its median in the sample.
In the text

Fig. 12.

Spatial distribution and metallicity of the RR Lyrae sample represented in a Galactocentric reference frame. r_GC and z denote the radial distance to the GC and the vertical distance with regard to the Galactic midplane x − y, respectively. The Cartesian Galactocentric coordinates and the radial distances to the GC have been summarised by the median plus minus the difference in absolute value between the median and the 84th and 16th percentile of the posterior samples of Eqs. (19) and 20.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Arenou, F., & Luri, X. 1999, in Harmonizing Cosmic Distance Scales in a Post-Hipparcos Era, eds. D. Egret, & A. Heck, ASP Conf. Ser., 167, 13 [NASA ADS] [Google Scholar]

[2] Arenou, F., Luri, X., Babusiaux, C., et al. 2017, A&A, 599, A50 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[3] Arenou, F., Luri, X., Babusiaux, C., et al. 2018, A&A, 616, A17 [Google Scholar]

[4] Bailer-Jones, C. A. L. 2015, PASP, 127, 994 [NASA ADS] [CrossRef] [Google Scholar]

[5] Bono, G., Caputo, F., Castellani, V., et al. 2003, MNRAS, 344, 1097 [NASA ADS] [CrossRef] [Google Scholar]

[6] Borissova, J., Rejkuba, M., Minniti, D., Catelan, M., & Ivanov, V. D. 2009, A&A, 502, 505 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[7] Cacciari, C., & Clementini, G. 2003, in Stellar Candles for the Extragalactic Distance Scale, eds. D. Alloin, & W. Gieren (Berlin: Springer Verlag), Lect. Notes Phys., 635, 105 [NASA ADS] [CrossRef] [Google Scholar]

[8] Cardelli, J., Clayton, G., & Mathis, J. 1989, ApJ, 345, 245 [NASA ADS] [CrossRef] [Google Scholar]

[9] Carpenter, B., Gelman, A., Hoffman, M., et al. 2017, J. Stat. Soft., 76, 1 [Google Scholar]

[10] Casertano, S., Riess, A. G., Bucciarelli, B., & Lattanzi, M. G. 2017, A&A, 599, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[11] Catelan, M., Pritzl, B., & Smith, H. 2004, ApJS, 154, 633 [Google Scholar]

[12] Clementini, G., Gratton, R., Bragaglia, A., et al. 2003, AJ, 125, 1309 [NASA ADS] [CrossRef] [Google Scholar]

[13] Clementini, G., Ripepi, V., Leccia, S., et al. 2016, A&A, 595, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[14] Clementini, G., Ripepi, V., Molinaro, R., et al. 2019, A&A, 622, A60 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[15] Dambis, A. K., Berdnikov, L. N., Kniazev, A. Y., et al. 2013, MNRAS, 435, 3206 [NASA ADS] [CrossRef] [Google Scholar]

[16] Freedman, W. L., Madore, B. F., Gibson, B. K., et al. 2001, ApJ, 553, 47 [NASA ADS] [CrossRef] [Google Scholar]

[17] Gaia Collaboration (Clementini, G., et al.) 2017, A&A, 605, A79 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[18] Gelman, A., Carlin, J. B., Stern, H. S., & Rubin, D. B. 2004, Bayesian Data Analysis (Boca Raton: Chapman& Hall/CRC) [Google Scholar]

[19] Gelman, A., & Hill, J. 2007, Data Analysis Using Regression and Multilevel/Hierarchical Models, Analytical Methods for Social Research (Cambridge: Cambridge University Press) [Google Scholar]

[20] Gelman, A., Jakulin, A., Pittau, M. G., & Su, Y.-S. 2008, Ann. Appl. Stat., 2, 1360 [CrossRef] [Google Scholar]

[21] Gieren, W., Górski, M., Pietrzyński, G., et al. 2013, ApJ, 773, 69 [NASA ADS] [CrossRef] [Google Scholar]

[22] Gillessen, S., Eisenhauer, F., Trippe, S., et al. 2009, ApJ, 692, 1075 [NASA ADS] [CrossRef] [Google Scholar]

[23] Gratton, R. G., Bragaglia, A., Clementini, G., et al. 2004, A&A, 421, 937 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[24] Hajdu, G., Jurcsik, J., & Sodor, A. 2009, Inf Bull. Variab. Stars, 5882 [Google Scholar]

[25] Hoffman, M. D., & Gelman, A. 2014, J. Mach. Learn. Res., 15, 1593 [Google Scholar]

[26] Jurić, M., Ivezić, Ž., Brooks, A., et al. 2008, ApJ, 673, 864 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]

[27] Lauritzen, S. 1996, Graphical Models (Oxford: Oxford University Press) [Google Scholar]

[28] Leavitt, H. S., & Pickering, E. C. 1912, Harv. Coll. Obs. Circ., 173, 1 [Google Scholar]

[29] Lewandowski, D., Kurowicka, D., & Joe, H. 2009, J. Multivar. Anal., 100, 1989 [CrossRef] [Google Scholar]

[30] Lindegren, L., Lammers, U., Bastian, U., et al. 2016, A&A, 595, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[31] Longmore, A. J., Fernley, J. A., & Jameson, R. F. 1986, MNRAS, 220, 279 [NASA ADS] [Google Scholar]

[32] Luri, X., Brown, A. G. A., Sarro, L. M., et al. 2018, A&A, 616, A9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[33] Madore, B. F., & Freedman, W. L. 1991, PASP, 103, 933 [NASA ADS] [CrossRef] [Google Scholar]

[34] Marconi, M., Musella, I., & Fiorentino, G. 2005, ApJ, 632, 590 [NASA ADS] [CrossRef] [Google Scholar]

[35] Marconi, M., Coppola, G., Bono, G., et al. 2015, ApJ, 808, 50 [NASA ADS] [CrossRef] [Google Scholar]

[36] Muraveva, T., Palmer, M., Clementini, G., et al. 2015, ApJ, 807, 127 [NASA ADS] [CrossRef] [Google Scholar]

[37] Muraveva, T., Delgado, H. E., Clementini, G., Sarro, L. M., & Garofalo, A. 2018a, MNRAS, 481, 1195 [NASA ADS] [CrossRef] [Google Scholar]

[38] Muraveva, T., Garofalo, A., Scowcroft, V., et al. 2018b, MNRAS, 480, 4138 [NASA ADS] [CrossRef] [Google Scholar]

[39] Neeley, J. R., Marengo, M., Bono, G., et al. 2017, ApJ, 841, 84 [NASA ADS] [CrossRef] [Google Scholar]

[40] Pearl, J. 1988, Probabilistic Reasoning in Intelligent Systems: Networks of Plausble Inference (Burlington: Morgan Kaufmann Pub) [Google Scholar]

[41] Preston, G. 1959, ApJ, 130, 507 [NASA ADS] [CrossRef] [Google Scholar]

[42] R Core Team. 2017, R: A Language and Environment for Statistical Computing (Vienna, Austria: R Foundation for Statistical Computing) [Google Scholar]

[43] Riess, A. G., Macri, L., Casertano, S., et al. 2011, ApJ, 730, 119 [NASA ADS] [CrossRef] [Google Scholar]

[44] Riess, A. G., Macri, L. M., Hoffmann, S. L., et al. 2016, ApJ, 826, 56 [NASA ADS] [CrossRef] [Google Scholar]

[45] Riess, A. G., Casertano, S., Yuan, W., et al. 2018, ApJ, 861, 126 [Google Scholar]

[46] Ripepi, V., Moretti, M. I., Marconi, M., et al. 2012, MNRAS, 424, 1807 [NASA ADS] [CrossRef] [Google Scholar]

[47] Robert, C., & Casella, G. 2013, Monte Carlo Statistical Methods, Springer Texts in Statistics (New York: Springer) [Google Scholar]

[48] Saha, A., Thim, F., Tammann, G. A., Reindl, B., & Sandage, A. 2006, ApJS, 165, 108 [NASA ADS] [CrossRef] [Google Scholar]

[49] Schlegel, D., Finkbeiner, D., & Davis, M. 1998, ApJ, 500, 525 [NASA ADS] [CrossRef] [Google Scholar]

[50] Sesar, B., Fouesneau, M., Price-Whelan, A. M., et al. 2017, ApJ, 838, 107 [NASA ADS] [CrossRef] [Google Scholar]

[51] Sollima, A., Cacciari, C., & Valenti, E. 2006, MNRAS, 372, 1675 [NASA ADS] [CrossRef] [Google Scholar]

[52] Sollima, A., Cacciari, C., Arkharov, A. A. H., et al. 2008, MNRAS, 384, 1583 [NASA ADS] [CrossRef] [Google Scholar]

[53] Stan Development Team 2018, RStan: the R interface to Stan [Google Scholar]

[54] Wright, E. L., Eisenhardt, P. R. M., Mainzer, A. K., et al. 2010, AJ, 140, 1868 [NASA ADS] [CrossRef] [Google Scholar]

[55] Zinn, R., & West, M. 1984, ApJS, 55, 45 [NASA ADS] [CrossRef] [Google Scholar]