A method to deconvolve stellar rotational velocities

R. Orellana; P. Escárate; M. Curé; A. Christen; R. Carvajal; J. C. Agüero

doi:10.1051/0004-6361/201833455

Home

All issues

Volume 623 (March 2019)

A&A, 623 (2019) A138

Full HTML

Free Access

Issue		A&A Volume 623, March 2019


Article Number		A138
Number of page(s)		11
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/201833455
Published online		20 March 2019

A&A 623, A138 (2019)

III. The probability distribution function via maximum likelihood utilizing finite distribution mixtures

R. Orellana¹^,2, P. Escárate³^,4^,5, M. Curé⁶, A. Christen⁷, R. Carvajal¹ and J. C. Agüero¹

¹ Electronics Engineering Department, Universidad Técnica Federico Santa María, Valparaíso, Chile
² Universidad de Los Andes, Mérida, Venezuela
³ Instituto de Electricidad y Electrónica, Facultad de Ciencias de la Ingeniería, Universidad Austral, Valdivia, Chile
e-mail: pedro.escarate@uach.cl
⁴ Large Binocular Telescope Observatory, Steward Observatory, Tucson, AZ 85546, USA
⁵ Núcleo Milenio de Formación Planetaria – NPF, Chile
⁶ Instituto de Física y Astronomía, Universidad de Valparaíso, Chile
⁷ Instituto de Estadística, Pontificia Universidad Católica de Valparaíso, Chile

Received: 18 May 2018
Accepted: 16 February 2019

Abstract

Aims. The study of accurate methods to estimate the distribution of stellar rotational velocities is important for understanding many aspects of stellar evolution. From such observations we obtain the projected rotational speed (v sin i) in order to recover the true distribution of the rotational velocity. To that end, we need to solve a difficult inverse problem that can be posed as a Fredholm integral of the first kind.

Methods. In this work we have used a novel approach based on maximum likelihood (ML) estimation to obtain an approximation of the true rotational velocity probability density function (PDF) expressed as a sum of known distribution families. In our proposal, the measurements have been treated as random variables drawn from the projected rotational velocity PDF. We analyzed the case of Maxwellian sum approximation, where we estimated the parameters that define the sum of distributions.

Results. The performance of the proposed method is analyzed using Monte Carlo simulations considering two theoretical cases for the PDF of the true rotational stellar velocities: (i) an unimodal Maxwellian probability density distribution and (ii) a bimodal Maxwellian probability density distribution. The results show that the proposed method yielded more accurate estimates in comparison with the Tikhonov regularization method, especially for small sample length (N = 50). Our proposal was evaluated using real data from three sets of measurements, and our findings were validated using three statistical tests.

Conclusions. The ML approach with Maxwellian sum approximation is a accurate method to deconvolve the rotational velocity PDF, even when the sample length is small (N = 50).

Key words: stars: rotation / stars: fundamental parameters / methods: numerical / methods: data analysis / methods: statistical / methods: analytical

© ESO 2019

1. Introduction

The estimation of the probability distribution of rotational velocities of stars is essential to describe and model many aspects of stellar evolution. From observations, it is only possible to obtain the projected velocity, (v sin i), where i is the inclination angle with respect the line of sight and v is the true (non-projected) rotational velocity. These measurements are assumed to be realizations of a random variable drawn from a probability density function (PDF), f_Y(y|β), that satisfies the following (see e.g., Curé et al. 2014):

$\begin{matrix} f_{Y} (y | β) = \int p (y | x) f_{X} (x | β) d x, \end{matrix}$ $\begin{aligned} f_{Y}({ y}|\beta ) = \int p({ y}|x)f_{X}(x|\beta )\,\mathrm{d}x, \end{aligned}$ (1)

where the unknown function, f_X(x|β), appears under the integral sign. In our problem of interest, f_Y(y|β) is the PDF of the available measurements, f_X(x|β) is the unknown distribution, p(y|x) is the conditional distribution of the projected angles and β is a parameter vector that defines the marginal distributions.

Typically, f_X(x|β) is solved from the integral equation Eq. (1) utilizing data y = (y₁, …, y_N) to estimate f_Y(y|β). This corresponds to a standard solution of the Fredholm equation Eq. (1) (see Chandrasekhar & Münch 1950; Lucy 1974). There are several methods that deal with the Fredholm integral, see for example (Yalçinbaş & Aynigül 2011; Shirin & Islam 2013; Alipanah & Esmaeili 2011). These methods solve the integral equation by approximating the unknown function in the integral via basis functions or polynomials. However, the unknown function in our problem in Eq. (1) is a PDF. Hence, the solution of the Fredholm equation must satisfy

$\begin{matrix} f_{X} (x | β) > 0, \forall x \land \int_{- \infty}^{\infty} f_{X} (x | β) d x = 1, \end{matrix}$ $\begin{aligned} f_{X}(x|\beta )>0, \, \forall x \;\;\wedge \;\; \int _{-\infty }^\infty f_{X}(x|\beta )\,\mathrm{d}x = 1, \end{aligned}$ (2)

which is not always the case for basis functions and polynomials. Therefore, it is necessary to develop mathematical methods to actually deconvolve the measured projected velocity in order to determine the true PDF of the rotational velocity. In addition, it is necessary to directly estimate the PDF for easy handling and for the analysis of important properties of the distribution (e.g., mean, mode, kurtosis, etc.).

A standard assumption used for p(y|x) in the deconvolution problem in Eq. (1) is that the stellar rotational axes are uniformly distributed over the unit sphere. Using this assumption, Chandrasekhar & Münch (1950) studied the Fredholm integral (Eq. (1)) that describes the probability distribution of true (v) and apparent (v sin i) rotational velocities, obtaining a formal solution to it that is proportional to the derivative of an Abel’s integral. Nevertheless, this method is not usually applied, because the differentiation of the formal solution can yield misleading results due to numerical problems related to the derivative of the Abel’s integral. To circumvent this problem, in Curé et al. (2014) the cumulative distribution function (CDF) from a set of samples of projected rotational velocities, (v sin i), was obtained using a novel single step method. Although the CDF provides information of the distribution of the speed of rotation, it is necessary to obtain the PDF for easy handling and for the direct estimation of certain properties of the distribution (e.g., the maximum, its symmetry, etc.).

On the other hand, in Christen et al. (2016) a method to estimate the PDF from the Fredholm integral (Eq. (1)) by means of the Tikhonov regularization method (TRM) was obtained. Even though regularization methods are techniques widely used to deconvolve inverse problems such as image processing, geophysics and machine learning (see Bouhamidi & Jbilou 2007; Fomel 2007; Deng et al. 2013), they do not guarantee that the solution is a PDF.

In this paper we propose a novel method to obtain the maximum likelihood (ML) estimate of the parameters that define the PDF of the rotational velocity written as a Maxwellian Sum Approximation (MSA). The main idea is based on Carvajal et al. (2018), where a general estimation algorithm is developed using data augmentation. In this work we specialize the proposal in Carvajal et al. (2018) for a MSA to estimate f_X(x|β) from Eq. (1).

The main departure from Christen et al. (2016) and the standard solutions for the integral equation is that in our approach, the measurements are treated as realizations of the PDF that defines the projected rotational velocity instead of evaluations of the PDF. Thus, the observed samples are used as realizations of a known parametric random variable (y), belonging to a known parametric family that is a mixture (see Eq. (1)). We use the ML method (estimators with good statistical properties) to estimate the parameters, which are calculated without directly estimating f_Y(y). Other methods need to perform an approximation in order to use a Kernel density estimator (KDE) to estimate the projected rotational speed density (see e.g., Curé et al. 2014 or Christen et al. 2016). This implies that, since we obtain the ML estimates of the parameters that define the unknown PDF, our solution is, in general, well-conditioned (more data points than unknown parameters). This is due to the fact that, in general, the length of samples will be greater than the number of unknown parameters (e.g., four parameters for a Maxwellian sum approximation with two terms).

This article is structured as follows: in Sect. 2 the problem of interest is described. In Sect. 3 we provide the mathematical description of the method for the attainment of the ML estimate of the stellar rotational velocities using MSA. In Sect. 4, Monte Carlo simulations are presented to show the benefits of this method, comparing the results with TRM algorithm proposed by Christen et al. (2016). In Sect. 5, real samples of a stellar cluster are deconvolved using the ML estimation algorithm proposed in this work. A comparison between the estimations from our method and TRM is presented. In the final section, our conclusions and future work are shown. Finally, the proofs of our results and the details of the expressions that explain the development of this work are shown in the appendix.

2. Maximum likelihood approach

Many inverse problems in physics and astronomy are given in terms of the Fredholm integral (Eq. (1)) of the first kind (Lucy 1974; Hansen 2010). In Curé et al. (2014) a method to deconvolve the inverse problem given by the Fredholm integral, Eq. (1) is developed, obtaining the CDF for stellar rotational velocities extending the work of Chandrasekhar & Münch (1950). Assuming an uniform distribution of stellar axes over the unit sphere, this integral equation reads as follows (see Curé et al. 2014 for more details):

$\begin{matrix} f_{Y} (y | β) = \int_{y}^{\infty} \underset{p (y | x)}{\underset{⏟}{\frac{y}{x \sqrt{x^{2} - y^{2}}}}} f_{X} (x | β) d x, \end{matrix}$ $\begin{aligned} f_{Y}({ y}|\beta ) = \int _{{ y}}^{\infty }\underbrace{\frac{{ y}}{x\sqrt{x^2-{ y}^2}}}_{p({ y}|x)}f_{X}(x|\beta )\,\mathrm{d}x, \end{aligned}$ (3)

where x = v, is the true rotational speed, y = xsini, is the projected rotational speed, and i, is the (unknown) inclination angle. Furthermore, f_Y(y|β) represents the PDF of projected rotational velocities and f_X(x|β) is the density of true rotational velocities. The function p(y|x) in this integral is related to the distribution of projected angles (Curé et al. 2014). We note that the conditional distribution p(y|x) in Eq. (3) is valid for an isotropic stellar rotational axes distribution.

The distribution of rotational velocities has been studied in the literature, providing strong evidence for the occurrence of Gaussian and Maxwellian distributions in astrophysical systems. Deutsch (1970) and Gaigé (1993) proved (analytically) that a Maxwellian distribution corresponds to the rotational speed distribution when the distribution of stellar axes is uniform distributed over the unit sphere (random axes orientations). The work in Chandrasekhar & Münch (1950) revived the suggestion of van Diem that proposed a double Gaussian distribution to describe the rotational speed of stars, presenting an analysis for B to G0 stars in the Pleiades cluster. For non-Gaussian statistics, Carvalho et al. (2009) prove that the Tsallis distribution (with the k parameter) corresponds to the distribution of rotational velocities, and in the limit case when k → 0, the Maxwellian distribution is recovered. In addition, it is known that for a similar problem, the mass distribution of exoplanets is described by a mixture of two Gaussian distributions in the logarithm of the planet mass (see e.g., Malhotra 2015). However, the distribution f_X(x|β) is usually unknown (not just the parameters, but also the parametric family). In such cases, an adequate alternative is to approximate the PDF using a distribution sum approximation (DSA) with a known and suitable PDF as a basis. This concept arises from the Wiener approximation theorem Achieser (1956), which, in simple terms, states that any function 𝔣(⋅) can be approximated by a linear combination of translations of another given function, 𝔤(⋅), if the Fourier transform of 𝔤(⋅) is not equal to zero in the domain of interest. A common choice for a basis to form a DSA is the Gaussian distribution Alspach & Sorenson (1972), which satisfies the Wiener approximation theorem. In our problem, we consider a Maxwellian sum approximation, since it has been suggested that the distribution of stellar rotational velocities can be modelled by a Maxwellian distribution (Chandrasekhar & Münch 1950; Deutsch 1970; Gaigé 1993).

We note that an MSA does not approximate every PDF, unless a translation term is included in the Maxwellian distribution. Nevertheless, as shown in the following sections, a MSA, even without translation, may provide an adequate fit to the PDF of the stellar rotational velocities.

The unknown true rotational velocities distribution function can be expressed as the following DSA:

$\begin{matrix} f_{X} (x | β) \approx \sum_{j = 1}^{K} λ_{j} g (x | θ_{j}), \end{matrix}$ $\begin{aligned} f_{X}(x|\beta )\approx \sum _{j=1}^{K}\lambda _j { g}(x|\theta _j), \end{aligned}$ (4)

where g(x|θ_j) represents a PDF characterized by the parameter θ_j, where λ_j is the weight of the corresponding jth distribution and K represents the number of distributions used to approximate the PDF subject to $\sum_{j = 1}^{K} λ_{j} = 1$ $\sum_{j=1}^{K}\lambda_j=1$ . Thus, the parameter vector to estimate is given by

$\begin{matrix} β = [\underset{β_{1}}{\underset{⏟}{λ_{1}, θ_{1}}}, \dots, \underset{β_{K}}{\underset{⏟}{λ_{K}, θ_{K}}}] . \end{matrix}$ $\begin{aligned} \beta = [\underbrace{\lambda _1, \theta _1}_{\beta _1},\dots , \underbrace{\lambda _K, \theta _K}_{\beta _K}]. \end{aligned}$ (5)

If we assume that the available data y_t (t = 1, …, N) are independent and identically distributed random variables, we obtain for y = (y₁, …, y_N):

$\begin{matrix} f_{Y} (y | β) = \prod_{t = 1}^{N} f_{Y} (y_{t} | β), \end{matrix}$ $\begin{aligned} f_{Y}({ y}|\beta ) = \prod _{t=1}^N f_{Y}({ y}_t|\beta ), \end{aligned}$ (6)

where

$\begin{matrix} f_{Y} (y_{t} | β) & = \sum_{j = 1}^{K} λ_{j} f_{j} (y_{t} | θ_{j}), \end{matrix}$ $\begin{aligned} f_{Y}({ y}_t|\beta )&=\sum _{j=1}^{K}\lambda _j f_j({ y}_t|\theta _j), \end{aligned}$ (7)

$\begin{matrix} f_{j} (y_{t} | θ_{j}) & = \int_{y_{t}}^{\infty} p (y_{t} | x_{t}) g (x_{t} | θ_{j}) d x_{t} . \end{matrix}$ $\begin{aligned} f_j({ y}_t|\theta _j)&=\int _{{ y}_t}^{\infty } p({ y}_t|x_t){ g}(x_t|\theta _j)\,\mathrm{d}x_t. \end{aligned}$ (8)

Then, the likelihood function can be expressed as

$\begin{matrix} L_{N} (β) = f_{Y} (y_{1 : N} | β) = \prod_{t = 1}^{N} \sum_{j = 1}^{K} λ_{j} f_{j} (y_{t} | θ_{j}) . \end{matrix}$ $\begin{aligned} \mathcal{L} _N(\beta )=f_{Y}({ y}_{1:N}|\beta ) =\prod _{t=1}^{N}\sum _{j=1}^{K}\lambda _jf_j({ y}_t|\theta _j). \end{aligned}$ (9)

The log-likelihood function ℓ_N(β)=log(L_N(β)) reads

$\begin{matrix} ℓ_{N} (β) = \sum_{t = 1}^{N} log [\sum_{j = 1}^{K} λ_{j} f_{j} (y_{t} | θ_{j})] . \end{matrix}$ $\begin{aligned} \ell _N(\beta )=\sum _{t=1}^{N}\log \left[\sum _{j=1}^{K}\lambda _j f_j({ y}_t|\theta _j)\right]. \end{aligned}$ (10)

We note that the expression in Eq. (8) can be understood as a generalized PDF (see e.g., Degroot 2004). We develop our estimation algorithm in the following section based on this interpretation.

3. Maximum likelihood estimation using Maxwellian sum approximation

In this section we develop the ML estimation algorithm when modelling f_X(x|β) as a Maxwellian sum approximation (MSA). This work is an extension of the estimation procedure proposed in Carvajal et al. (2018) and in Orellana et al. (2018). In particular, in this work we solve the problem of interest (Eq. (3)) considering a discrete Maxwellian mixture.

In Carvajal et al. (2018) an estimation algorithm was shown for a general class of problems with data augmentation, based on the Expectation–Maximization (EM) algorithm (Dempster et al. 1977). They consider a general optimization problem that can be tailored to solve the problem in Eq. (3) to estimate the parameters that define the approximation of the true rotational velocities (e.g: MSA). Inspired by the EM algorithm, an optimization problem using an auxiliary function is defined, and then an iterative algorithm is obtained.

It is worth noting that the estimation method we are proposing is a parametric one, based on the ML estimation principle, giving a set of parameters that define the PDF of true rotational velocities using a sum of K Maxwellian distributions, to describe a sample of data. This is an important difference with respect to the TRM proposed by Christen et al. (2016) where they present a non-parametric estimation approach; from the projected rotational velocities applying a KDE used by Silverman (1986) discretizing the Fredholm integral Eq. (3) and obtained the Tikhonov regularization solution.

For the special case of the distribution of a sample of stellar rotational speeds, (Deutsch 1970; Gaigé 1993) demonstrated that the density f_X(x|β) is given by a Maxwellian distribution with dispersion σ. In Appendix A we show the details of our MSA formulation. The optimization of the auxiliary function with respect to vector of parameters β is shown as follows.

Considering g(x|θ_j) as a Maxwellian PDF (Eq. (4)). The Maxwellian PDF is defined by

$\begin{matrix} ϕ_{M} (x ; σ_{j}) = \sqrt{\frac{2}{π}} \frac{x^{2}}{σ_{j}^{3}} exp {- \frac{x^{2}}{2 σ_{j}^{2}}}, \end{matrix}$ $\begin{aligned} \phi _{M}(x;\sigma _j)=\sqrt{\frac{2}{\pi }}\frac{x^2}{\sigma _j^3}\exp \left\{ -\frac{x^2}{2\sigma _j^2}\right\} , \end{aligned}$ (11)

where x > 0 and σ_j is the dispersion parameter that defines de PDF. Then, we can define the following function:

$\begin{matrix} K_{M} (x_{t}, β_{j}) = λ_{j} ϕ_{M} (x_{t} ; σ_{j}), \end{matrix}$ $\begin{aligned} K_{M}(x_t,\beta _j)=\lambda _j\phi _{M}(x_t;\sigma _j), \end{aligned}$ (12)

where β_j = {λ_j,σ_j}. The corresponding optimization problem is solved iteratively. Thus, assuming that the estimates of the parameters at the mth iteration are available, the vector of parameters ${\hat{β}}^{(m + 1)}$ $\hat{\beta}^{(m+1)}$ that optimize the auxiliary function (Eq. (A.11)) is now given by

$\begin{matrix} {\hat{λ}}_{j}^{(m + 1)} = \frac{P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{\sum_{l = 1}^{K} P_{M} (x_{t}, {\hat{β}}_{l}^{(m)})}, \end{matrix}$ $\begin{aligned}&\hat{\lambda }_j^{(m+1)}=\frac{\mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})}{\sum _{l=1}^{K}\mathcal{P} _M(x_t,\hat{\beta }_l^{(m)})}, \end{aligned}$ (13)

$\begin{matrix} {\hat{σ}}_{j}^{(m + 1)} = {[\frac{S_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{3 P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}]}^{\frac{1}{2}}, \end{matrix}$ $\begin{aligned}&\hat{\sigma }_j^{(m+1)}=\left[\frac{\mathcal{S} _M(x_t,\hat{\beta }_j^{(m)})}{3\mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})}\right]^{\frac{1}{2}}, \end{aligned}$ (14)

where

$\begin{matrix} P_{M} (x_{t}, {\hat{β}}_{j}^{(m)}) = \sum_{t = 1}^{N} \int_{y_{t}}^{\infty} \frac{K_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{V_{M_{t}} ({\hat{β}}^{(m)})} d μ (x_{t}), \end{matrix}$ $\begin{aligned}&\mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})=\sum _{t=1}^{N}\int _{{ y}_t}^{\infty }\frac{K_M\left(x_t,\hat{\beta }_j^{(m)}\right)}{\mathcal{V} _{M_t}\left(\hat{\beta }^{(m)}\right)}\,\mathrm{d}\mu (x_t), \end{aligned}$ (15)

$\begin{matrix} S_{M} (x_{t}, {\hat{β}}_{j}^{(m)}) = \sum_{t = 1}^{N} \int_{y_{t}}^{\infty} x_{t}^{2} \frac{K_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{V_{M_{t}} ({\hat{β}}^{(m)})} d μ (x_{t}), \end{matrix}$ $\begin{aligned}&\mathcal{S} _M(x_t,\hat{\beta }_j^{(m)})=\sum _{t=1}^{N}\int _{{ y}_t}^{\infty }x_t^2\frac{K_M\left(x_t,\hat{\beta }_j^{(m)}\right)}{\mathcal{V} _{M_t}\left(\hat{\beta }^{(m)}\right)}\,\mathrm{d}\mu (x_t), \end{aligned}$ (16)

$\begin{matrix} V_{M_{t}} ({\hat{β}}^{(m)}) = \sum_{j = 1}^{K} \int_{y_{t}}^{\infty} K_{M} (x_{t}, {\hat{β}}_{j}^{(m)}) d μ (x_{t}) . \end{matrix}$ $\begin{aligned}&\mathcal{V} _{M_t}(\hat{\beta }^{(m)})=\sum _{j=1}^{K}\int _{{ y}_t}^{\infty }K_{M}(x_t,\hat{\beta }_j^{(m)})\,\mathrm{d}\mu (x_t). \end{aligned}$ (17)

In Appendix A we demonstrate the optimization procedure to obtain the MSA. We summarise the proposed iterative algorithm with MSA as follows:

(i)
Set a number of K distributions for MSA.
(ii)
Obtain an initial guess ${\hat{β}}_{j}^{(0)}$ $\hat{\beta}_j^{(0)}$ for j = 1, …, K.
(iii)
Set m = 0.
(iv)
Compute the integrals in Eqs. (15)–(17).
(v)
Compute ${\hat{β}}^{(m + 1)}$ $\hat{\beta}^{(m+1)}$ from Eqs. (13) and (14).
(vi)
Set m = m + 1 and go back to step iv) until convergence is achieved.

In iterative optimization algorithms, it is customary to define one or more stopping criteria, such as a maximum number of interactions and/or when the error reaches a certain value. In this work the stopping criterion is defined by the normalized relative error $‖ {\hat{β}}^{(m)} - {\hat{β}}^{(m - 1)} ‖ / ‖ {\hat{β}}^{(m)} ‖$ ${\Vert\hat{\beta}^{(m)}-\hat{\beta}^{(m-1)}\Vert}/{\Vert\hat{\beta}^{(m)}\Vert}$ reaching the arbitrary tolerance of 10⁻⁶. That is,

$\begin{matrix} \frac{‖ {\hat{β}}^{(m)} - {\hat{β}}^{(m - 1)} ‖}{‖ {\hat{β}}^{(m)} ‖} < 10^{- 6} . \end{matrix}$ $\begin{aligned} \frac{\Vert \hat{\beta }^{(m)}-\hat{\beta }^{(m-1)}\Vert }{\Vert \hat{\beta }^{(m)}\Vert } < 10^{-6}. \end{aligned}$ (18)

We note that it is possible to utilize a Gaussian sum approximation instead of a MSA. In fact, it was shown in Lo (1972) that any PDF can be approximated as closely as desired by a density of the form of Gaussian sum approximation for some finite components of the mixture. However, in our experience, for the problem of interest (true rotational velocities), we need a large number of Gaussian components to obtain an adequate agreement with the true distributions, avoiding the truncation of the estimated PDF for lower velocities. Hence, we focus on utilizing a MSA, comparing the performance of our proposal against TRM method proposed by Christen et al. (2016).

4. Monte Carlo simulations

In this section we present the results of Monte Carlo (MC) numerical simulations to assess the performance of ML estimation using MSA. A comparison is made using the TRM proposed by Christen et al. (2016). We simulated the following cases:

(a) Unimodal distribution: we chose the density of true rotational velocities f_M(x|σ)^(True) as

$\begin{matrix} f_{M} {(x | σ)}^{(True)} = \sqrt{\frac{2}{π}} \frac{x^{2}}{σ^{3}} exp {- \frac{x^{2}}{2 σ^{2}}}, x > 0 \end{matrix}$ $\begin{aligned} f_{\text{ M}}(x|\sigma )^{\text{(True)}}=\sqrt{\frac{2}{\pi }}\frac{x^2}{\sigma ^3}\exp \left\{ -\frac{x^2}{2\sigma ^2}\right\} ,\;\;\;\;\;\;\;\;x>0 \end{aligned}$ (19)

where dispersion parameter σ = 8, which is the same distribution used in Curé et al. (2014) and Christen et al. (2016).

(b) Bimodal distribution: for a mixture of two Maxwellian distributions, the PDF reads

$\begin{matrix} f_{2M} {(x | β)}^{(True)} & = \sqrt{\frac{2}{π}} x^{2} (\frac{λ_{1}}{σ_{1}^{3}} exp {- \frac{x^{2}}{2 σ_{1}^{2}}} \\ + \frac{λ_{2}}{σ_{2}^{3}} exp {- \frac{x^{2}}{2 σ_{2}^{2}}}), x > 0 \end{matrix}$ $\begin{aligned} f_{\text{2M}}(x|\beta )^{\text{(True)}}&=\sqrt{\frac{2}{\pi }}x^2\left( \frac{\lambda _1}{\sigma _1^3}\exp \left\{ -\frac{x^2}{2\sigma _1^2}\right\} \right. \nonumber \\&\quad \left.+\frac{\lambda _2}{\sigma _2^3}\exp \left\{ -\frac{x^2}{2\sigma _2^2}\right\} \right) ,\qquad x>0 \end{aligned}$ (20)

where β = [λ₁, σ₁, λ₂, σ₂], the dispersion parameters are: σ₁ = 5 and σ₂ = 15, and the mixing weights are λ₁ = 0.7 and λ₂ = 0.3, which corresponds to the same distribution used in Christen et al. (2016).

In Curé et al. (2014) it is shown an explicit analytic solution from the Fredholm integral Eq. (3) for these cases, obtaining

$\begin{matrix} f_{M} {(y | σ)}^{(True)} & = \frac{y}{σ^{2}} exp {- \frac{y^{2}}{2 σ^{2}}}, \end{matrix}$ $\begin{aligned} f_{\text{ M}}({ y}|\sigma )^{\text{(True)}}&=\frac{{ y}}{\sigma ^2}\exp \left\{ -\frac{{ y}^2}{2\sigma ^2}\right\} , \end{aligned}$ (21)

$\begin{matrix} f_{2M} {(y | β)}^{(True)} & = λ_{1} \frac{y}{σ_{1}^{2}} exp {- \frac{y^{2}}{2 σ_{1}^{2}}} + λ_{2} \frac{y}{σ_{2}^{2}} exp {- \frac{y^{2}}{2 σ_{2}^{2}}} . \end{matrix}$ $\begin{aligned} f_{\text{2M}}({ y}|\beta )^{\text{(True)}}&=\lambda _1\frac{{ y}}{\sigma _1^2}\exp \left\{ -\frac{{ y}^2}{2\sigma _1^2}\right\} +\lambda _2\frac{{ y}}{\sigma _2^2}\exp \left\{ -\frac{{ y}^2}{2\sigma _2^2}\right\} . \end{aligned}$ (22)

The synthetic data y₁, y₂, …, y_N, for both cases, is generated using the Slice Sampler (see e.g., Neal 2003). The simulation setup is as follows:

(1)
Three sample lengths N = 50, 500, 2000 are considered.
(2)
The MSA algorithms with three values for the number of distributions K = 1, 2, 3 is considered. The stopping criterion is given by Eq. (18).
(3)
The TRM algorithm is set following the procedure in Christen et al. (2016).
(4)
The number of MC simulations is n_MC = 100.

4.1. Estimation using the Tikhonov regularization method

In order to estimate the performance of TRM algorithm with simulated data, we follow the procedure described in Christen et al. (2016) for each independent MC simulation considering the three sample lengths cases. The TRM algorithm performance is closely related with the PDF estimation of the projected rotational velocities using KDE Silverman (1986). They use a kernel distribution as a non-parametric representation of the PDF of projected velocities, whose behavior is sensitive to the smoothing of the function and the bandwidth (BW) value. This features control the smoothness of the resulting density distribution. Figure 1a shows the mean estimated PDFs of all MC simulations for unimodal distribution using the TRM algorithm. Similarly, Fig. 1b (bimodal distribution) exhibits the mean estimated PDFs. The gray-shaded regions (Figs. 1a and b) represent the surrounding area in which independent MC simulations lie. Also, the region between the red dashed-dotted lines corresponds to one standard deviation level of the all estimated PDFs. We note a variability in the PDF estimation for all cases, especially for small sample data lengths of order N = 50. In this sense, a different setting for KDE estimation (e.g., different smoothing of the function and different BW value) would be needed for each MC simulation to improve the PDF estimation, due to the fact that the bandwidth parameter controls the smoothness of the resulting density curve (see e.g., Silverman 1986).

Fig. 1.

Monte Carlo simulation results to true rotational velocities PDF estimation: panel a: using TRM algorithm for unimodal Maxwellian distribution. Panel b: using TRM algorithm for bimodal Maxwellian distribution. The black line represents the true rotational velocities PDF. The blue line represents the average of the estimated PDF using TRM. The region between the red dashed-dotted lines corresponds to one standard deviation level of the all estimated PDFs.

We note that for small sample lengths some MC simulations might not yield a PDF, since negative values could be obtained. In addition, the true rotational velocities that is more likely to occur is wrongly estimated by 2 − 3 km s⁻¹ for unimodal distribution and by 5 km s⁻¹ for the bimodal distribution. Despite of the above, the mean estimated PDFs show a good agreement with the true distributions (unimodal and bimodal distributions) for larger sample lengths (N = 500 and N = 2000). We also observe that the shaded regions decrease in area, implying that the estimates for the MC simulations exhibit a smaller variance.

4.2. Estimation using Maxwellian sum approximation

In order to evaluate the performance of the method that we propose, we considered a MSA with K = 1 for the unimodal and a MSA with K = 2 for bimodal cases. From our simulations, the weights of the MSA are λ_j ≈ 0, j ≥ 2, for the unimodal case, whilst for the bimodal PDF the same is true when j ≥ 3.

Figure 2a shows the true unimodal distribution (Eq. (19)) together with the mean estimated PDF of all MC simulations. The gray-shaded regions represent the surrounding area to which all independent MC simulations using the MSA algorithm lie in (Figs. 2a and b). Also, the region between the red dashed-dotted lines corresponds to one standard deviation level of the all estimated PDFs using our algorithm. For larger sample length (N = 2000), the MSA based estimation exhibits an excellent agreement between the true unimodal distribution and the mean estimated PDFs. Figure 2b shows the results for the bimodal Maxwellian distribution (Eq. (20)). On average, the agreement between the estimated PDF using MSA algorithm and the true distribution is very good, even for sample lengths of the order N = 50, since the mean estimated PDF is very similar to the true PDF. We note that the shaded regions show less variability of the estimated PDFs than TRM (Figs. 1a and b). For MSA, the variance of the estimates is higher for small sample lengths (N = 50) than large sample length (N = 500, 2000; see Figs. 2a and b). The initialization of the MSA estimation algorithm was set to a random initial guess. In our experience, this was good enough to provide good estimates, exhibiting good convergence properties. In Appendix B we show tables with the mean values and standard deviations of the estimated parameters using MSA. We observe that, in general, the estimated parameters are close to the true values for all sample lengths when the MSA algorithm is used, including sample lengths of order N = 50.

Fig. 2.

Monte Carlo simulation results to true rotational velocities PDF estimation: panel a: using K = 1 for MSA algorithm. Panel b: using K = 2 for MSA algorithm. The black line represents the true rotational velocities PDF. The blue line represents the average of the estimated PDF using MSA. The region between the red dashed-dotted lines corresponds to one standard deviation of the all estimated PDFs.

Following Curé et al. (2014) method to compare TRM and MSA algorithms, we calculate the mean integrated square error (MISE) to quantify the error of the estimated PDF, respect the true PDF, then we have:

$\begin{matrix} MISE = \frac{1}{n_{MC}} \sum_{j = 1}^{n_{MC}} (\frac{1}{N} \sum_{i = 1}^{N} {({\hat{f}}_{j} (x_{i} | \hat{β}) - f_{j} (x_{i} | β))}^{2}), \end{matrix}$ $\begin{aligned} \mathrm{MISE}=\frac{1}{n_{\rm {MC}}}\sum _{j=1}^{n_{\rm {MC}}}\left(\frac{1}{N}\sum _{i=1}^{N}\left(\hat{f}_j(x_i|\hat{\beta })-f_j(x_i|\beta )\right)^2\right), \end{aligned}$ (23)

where f_j(x_i|β) represents the true PDF of rotational speeds evaluated at the ith sample of the jth MC simulation, ${\hat{f}}_{j} (x_{i} | \hat{β})$ $\hat{f}_j(x_i|\hat \beta)$ is the estimated PDF using the MSA algorithm for the jth MC simulation evaluated at the ith sample, n_MC is the number of MC simulations and N is the sample length. In Appendix C we summarize the MISE values for the MC simulations for the MSA and TRM estimation. Table C.1 shows the MISE values for unimodal Maxwellian estimation (K = 1), where the MISE for MSA algorithm is in the order of MISE ≈ 10⁻⁶. In Table C.2 we show the MISE values for different sample lengths N for bimodal Maxwellian distribution estimation (K = 2). The MISE values that were obtained for MSA algorithm are all very similar, in the order of MISE ≈ 10⁻⁶. The MC simulation show that the TRM method and the MSA method have a similar average behavior for larger samples lengths N = 2000 (MISE ≈ 10⁻⁶), although for small sample length N = 50 the MSA method (MISE ≈ 10⁻⁶) exhibits a better estimation performance than the TRM method (MISE ≈ 10⁻⁵) in terms of average and variance. Details of the results are presented in Appendix C.

5. Deconvolving real samples

In this section, we perform the following: (i) apply the MSA algorithm to a sample of measured (v sin i) data of stars in order to estimate the PDF of the true rotational velocities; and (ii) compare the performance of this proposed algorithm with TRM proposed by Christen et al. (2016).

5.1. Coma Berenice sample

From the catalog Mermilliod et al. (2009), we select the Coma Berenice cluster (Melotte 111) data which has N = 60 values of v sin i > 0 from 0 km s⁻¹ up to 50 km s⁻¹ for F-K dwarf stars.

The estimated parameter using MSA algorithm with one Maxwellian distribution is σ₁ = 7.5675. In addition, the estimated parameters using two Maxwellian distributions (K = 2) are σ₁ = 2.7903, σ₂ = 11.9939, λ₁ = 0.6364 and λ₂ = 0.3636. Similarly, we obtain the results considering a three Maxwellian mixture distribution (K = 3), where the estimated parameters are σ₁ = 2.7902, σ₂ = 11.9934, σ₃ = 11.9954, λ₁ = 0.6364, λ₂ = 0.2810 and λ₃ = 0.0827. We can observe that the mixing weight of the third component (λ₃) of Maxwellian mixture distribution is close to zero. In this sense, for simplicity and clarity on the presentation, analyzed the case for which K = 2. In Fig. 3 (left panel) we show the rotational velocity distribution using MSA algorithm with two Maxwellian mixture distribution (K = 2; blue solid line), and using TRM (black dotted line). The main differences in the estimated PDFs are due to the non-parametric nature of TRM. In particular, important differences can be observed between 15 and 25 km s⁻¹ and around 30 km s⁻¹ since the KDE utilized in TRM approximates f_Y(y|β) as closely as possible to the histogram of the data. In addition, we note that when estimated PDF by TRM is used, has its maximum toward larger values of rotational speed. This could be due to a TRM error in detecting the most probable speed, as we have already observed in the numerical simulation (see Sect. 4) mainly for small number of samples.

Fig. 3.

Estimated PDF of true rotational velocities for real samples cases using K = 2 (Coma Berenice and Tarantula samples) and K = 3 (Geneva sample) for MSA algorithm. The TRM algorithm is setting following Christen et al. (2016). The blue line represents the MSA estimated PDF. The black dashed-dotted line represent the TRM estimated PDF.

5.2. Tarantula sample

We selected the Tarantula sample for single O-type stars form the VLT Flames Tarantula Survey presented in Ramírez-Agudelo et al. (2013), where the authors deconvolved the rotational velocity distribution using the method in Lucy (1974) and TRM. This sample contains 216 stars with v sin i data from 40 km s⁻¹ up to 610 km s⁻¹.

The estimated parameter using the MSA algorithm with one Maxwellian distribution (K = 1) is σ₁ = 131.39. The parameters estimated for the two Maxwellian mixture distribution (K = 2) are σ₁ = 64.54, σ₂ = 185.67, λ₁ = 0.568 and λ₂ = 0.432. The estimation results using three Maxwellian mixture distribution (K = 3) are σ₁ = 64.54, σ₂ = 185.67, σ₃ = 64.54, λ₁ = 0.568, λ₂ = 0.432 and λ₃ = 8.03 × 10⁻¹⁷. It is evident that the mixing weight of the third component (λ₃) is not relevant in the Maxwellian mixture distribution model. In this sense, we focus on the results we obtained with K = 2. Figure 3, middle panel, shows the rotational velocity distribution using MSA algorithm with K = 2 (blue solid line) and the estimation using TRM (black dotted line). Similarly to the results from Coma Berenice sample, the estimated PDFs are similar, but exhibit some differences, particularly around 380–480 km s⁻¹, due to the use of KDE in TRM.

5.3. Geneva sample

We selected a large sample data of measured v sin i data of the Geneva-Copenhagen survey of the solar neighborhood (Nordström et al. 2004; Holmberg et al. 2007), which contains information about 16.500 F and G main-sequence field stars. We observed that this data sample presents important uncertainties showing velocities of 0 km s⁻¹, in consequence, we only selected stars with 0 < v sin i ≤ 30 km s⁻¹, obtaining a sample of 11 685 stars.

The estimated parameter using MSA algorithm with one Maxwellian distribution is σ₁ = 7.13. Additionally, the estimated parameters for K = 2 are σ₁ = 3.32, σ₂ = 10.26, λ₁ = 0.578 and λ₂ = 0.422. The estimation results using three Maxwellian mixture distribution are σ₁ = 4.21, σ₂ = 10.66, σ₃ = 2.41, λ₁ = 0.40, λ₂ = 0.37 and λ₃ = 0.23. In this case, we focus on the estimation obtained with three mixture components. Figure 3, right panel, shows the rotational velocity distribution using MSA algorithm with three Maxwellian mixture distribution (blue solid line) and TRM (black dotted line). In this case, we obtain an important agreement between the MSA estimation with K = 3 and the TRM algorithm estimation, except for the interval between 25 and 35 km s⁻¹.

5.4. Comparison of the estimation results

In this section we describe our analysis of the estimation of the true rotational velocities for three data sets: Coma Berenice, Tarantula and Geneva, see Fig. 3. In addition, we considered the following: from the estimated true PDF of the rotational velocity, using MSA and TRM, we obtained an estimation of the projected rotational velocity PDF by solving the integral in Eq. (3), in other words,

$\begin{matrix} {\hat{f}}_{Y} (y) = \int_{y}^{\infty} \frac{y}{x \sqrt{x^{2} - y^{2}}} {\hat{f}}_{X} (x) d x, \end{matrix}$ $\begin{aligned} \hat{f}_{Y}({ y}) = \int _{{ y}}^{\infty }\frac{{ y}}{x\sqrt{x^2-{ y}^2}}\hat{f}_{X}(x)\,\mathrm{d}x, \end{aligned}$ (24)

where ${\hat{f}}_{X} (x) = f_{X} (x | \hat{β})$ $\hat{f}_X(x)=f_X(x|\hat{\beta})$ for MSA and ${\hat{f}}_{X} (x)$ $\hat{f}_X(x)$ is the estimated density using TRM, shown in Fig. 4.

Fig. 4.

Contrasting projected and observed rotational velocities for the real samples using K = 2 (Coma Berenice and Tarantula samples) and K = 3 (Geneva sample) for MSA algorithm. Gray bars correspond to Histograms of the projected rotational velocities for the real samples. The blue line represents the corresponding estimated f_Y(y|β) using MSA. The black dashed-dotted line represents the corresponding estimated f_Y(y|β) using TRM.

From Fig. 3 from a large number of samples (Tarantula and Geneva samples) both methods show similar results, except for at very low rotational speeds, where TRM wrongly estimates a non-zero probability for nought rotational velocities, that is, ${\hat{f}}_{X} (x = 0) > 0$ $\hat{f}_X(x=0) > 0$ . On the contrary, our method correctly provides a zero probability for nought rotational velocities, that is, ${\hat{f}}_{X} (x = 0) = 0$ $\hat{f}_X(x=0)=0$ . In addition, the estimated projected rotational velocities are expected to provide a zero probability to nought projected rotational velocities. Our method provides the correct estimation (see Eqs. (21) and (22)), as is shown in Fig. 4. On the other hand, similarly to the true rotational velocities estimation, the result provided by TRM is incorrect for zero projected rotational velocities. For small number of samples (i.e., Coma Berenice sample), the true rotational velocity (Fig. 3) and the projected rotational velocity (Fig. 4) PDFs differ. However, the estimated projected rotational velocities we obtained with our method closely resembles the histogram from collected data (see Fig. 4). This results suggest that our method provides better estimate of the rotational velocity.

To validate our conclusion, we performed three statistical tests, a Kolmogorov–Smirnov (KS) test, an Anderson–Darling (AD) test and a q–q plot utilizing the measurements and the estimated projected rotational velocities. All three show that our method provides more accurate estimates of the true rotational velocities from the collected data for the three sets of samples. In particular, as is expected, for a large number of samples (i.e., Tarantula and Geneva samples), the results from the test are similar. For a small number of samples (i.e., Coma Berenice sample), our method has resulted in less discrepancy between the PDFs of the estimated projected rotational velocities and the one obtained form the measurements.

We used the KS test to support this analysis, where the not-rejection significance level is given as a percentage-the lower it is, the more reliable is the estimated model. We consider the empirical PDF from Coma Berenice sample and its corresponding CDF, obtained via numerical integration (trapezoidal method) using the estimated PDF from TRM and MSA algorithms. We tested performance between the real samples and the estimation results for the null hypothesis H₀ “the projected velocities sample comes from the projected estimated distribution (from MSA or TRM)”, and the complementary alternative hypothesis. Considering the MSA algorithm, we observed that the null hypothesis is not rejected at a significance level of 7.4%, and from TRM algorithm, the null hypothesis is not rejected at a significance level of 39.6%. Then, we also obtained the q–q plot (Fig. 5) of the densities estimated using TRM and MSA algorithms, confirming that MSA estimation closely fall on the reference line with respect to TRM results.

Fig. 5.

q–q plot from Coma Berenice sample. The blue-mark line represents the quantiles of the corresponding estimated f_Y(y|β) using MSA. The black-square line represents the quantiles of the corresponding estimated f_Y(y|β) using TRM. The red-dashed line is the reference line.

In order to corroborate our findings, we considered the AD test (see e.g., Anderson & Darling 1952), considering a significance level of 5%. From the Coma Berenice sample, the null hypothesis is not rejected when the MSA algorithm is used, and the null hypothesis is rejected for TRM results. This might imply that the estimated PDF from TRM does not represent the actual data. From the Tarantula and Geneva samples, the null hypothesis is not rejected when we use the two algorithms (TRM and MSA) to estimate the corresponding f_Y(y|β).

6. Conclusions and final remarks

In this work we proposed a novel PDF estimation algorithm using ML approach in terms of finite distribution mixtures, particularly, Maxwellian distributions. This algorithm is utilized to obtain the estimated probability distribution of true rotational stellar velocities. The advantage of this algorithm is that we were able to use the sample data from the experiments to obtain an estimated PDF of the projected rotational velocities f_Y(y). This algorithm allowed us to use the sample data directly, without intermediate steps or approximations, unlike the estimation using TRM that requires the utilization of KDE, a non-parametric estimation technique.

We analyzed the performance of the proposed algorithm utilizing synthetic data and several Monte Carlo simulations, when the rotational velocity is described by: (i) a Maxwell distribution and (ii) a mixture of two or three Maxwellian distributions, considering different sample lengths (N = 50, N = 500 and N = 2000). In general, we observed satisfactory results for each situation under different scenarios using MSA algorithm to estimate the true rotational velocity, exhibiting a better overall performance when compared to TRM. Moreover, we use the MISE as a measure of the performance of the PDF estimation. We obtained excellent results with MSA algorithm for all sample sizes, while the performance of TRM is satisfactory only for samples lengths of sizes N = 500 and N = 2000.

The PDF estimation algorithm was also tested in a set of real observed data from the Coma Berenice, Tarantula clusters and Geneva sample. We note that TRM has a lack of smoothing, likely due to the use of KDE. In addition, when utilizing TRM, the mode of the estimated function is slightly shifted to the right, compared to the estimated PDF from the proposed MSA. On the other hand, when using the proposed MSA algorithm with the Tarantula samples, the estimated PDF (for K = 2, 3) is very similar to the estimated function when using TRM method. This agreement in the estimates is due to the larger length of samples (N ≈ 500). Finally, from the Geneva sample we obtained an important agreement between estimations from MSA with K = 3 and TRM algorithms due to large length of samples used N > 2000. The proposed methodology can also be utilized for the estimation of other distributions associated with the projected rotational velocity. We have tested this idea with simulated data for the projected rotational velocity drawn from Tsallis distribution Carvalho et al. (2009), Kanadiakis distribution Carvalho et al. (2009), and Gamma-Normal distribution Ramírez-Agudelo et al. (2013). The results yielded an important agreement in the statistical description of the projected rotational velocities, particularly with K = 4 Maxwellian terms in the MSA. If more complex PDFs are to be estimated, other distributions can be used in the finite mixture distribution, for example Gaussian, provided they satisfy the Wiener approximation theorem Achieser (1956) in the desired domain of the PDF.

To summarize, we proposed a PDF estimation algorithm for rotational velocities based on ML, utilizing a finite mixture distribution that can be used even for small length of samples. The TRM proposed by Christen et al. (2016) exhibits a similar performance, but for a larger length of samples.

Acknowledgments

This work was partially supported by FONDECYT trough grant No 1181158, the Advanced Center for Electrical and Electronic Engineering (AC3E, Proyecto Basal FB0008), Núcleo Milenio de Formación Planetaria – NPF and Programa de Iniciación a la Investigación Científica (PIIC) de la Dirección de Postgrado UTFSM convenio No 015/2018. Michel Curé thanks the support from the Centro de Astrofísica de Valparaíso and the Centro Interdisciplinario de Estudios Atmosféricos y Astroestadística.

References

Achieser, N. 1956, Theory of Approximation (New York: Ungar Publishing Co.) [Google Scholar]
Alipanah, A., & Esmaeili, S. 2011, J. Comput. Appl. Math., 235, 5342 [CrossRef] [Google Scholar]
Alspach, D. L., & Sorenson, H. W. 1972, IEEE Trans. Auto. Control, AC-17, 439 [CrossRef] [Google Scholar]
Anderson, T., & Darling, D. 1952, Ann. Math. Stat., 23, 193 [CrossRef] [Google Scholar]
Bouhamidi, A., & Jbilou, K. 2007, J. Comput. Appl. Math., 206, 86 [NASA ADS] [CrossRef] [Google Scholar]
Carvajal, R., Orellana, R., Katselis, D., Escárate, P., & Agüero, J. C. 2018, PLOS ONE, 13, 1 [CrossRef] [Google Scholar]
Carvalho, J. C., do Nascimento, J. D., Silva, R., & DeMedeiros, J. R. 2009, ApJ, 696, L48 [NASA ADS] [CrossRef] [Google Scholar]
Chandrasekhar, S., & Münch, G. 1950, ApJ, 111, 142 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
Christen, A., Escárate, P., Curé, M., Rial, D. F., & Cassetti, J. 2016, A&A, 595, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Curé, M., Rial, D. F., Christen, A., & Cassetti, J. 2014, A&A, 85, A13 [Google Scholar]
Degroot, M. H. 2004, Optimal Statistical Decisions (New York: Jhon Wiley and Sons, Inc) [CrossRef] [Google Scholar]
Dempster, A. P., Laird, N. M., & Rubin, D. 1977, R. Stat. Soc., 39, 1 [Google Scholar]
Deng, L., Huang, T., Zhao, L., & Wang, S. 2013, J. Opt. Soc. Am., 30, 5 [Google Scholar]
Deutsch, A. J. 1970, Stellar Rotation, Proceedings of the IAU Colloquium, 207 [CrossRef] [Google Scholar]
Fomel, S. 2007, Geophysics, 72, A29 [NASA ADS] [CrossRef] [Google Scholar]
Gaigé, Y. 1993, A&A, 269, 267 [Google Scholar]
Hansen, P. 2010, Discrete Inverse Problems (Philadelphia: Society for Industrial and Applied Mathematics) [Google Scholar]
Holmberg, J., Nordström, B., & Andersen, J. 2007, A&A, 475, 519 [NASA ADS] [CrossRef] [EDP Sciences] [MathSciNet] [Google Scholar]
Lo, J. 1972, IEEE Trans. Inf. Theory, 18, 583 [CrossRef] [Google Scholar]
Lucy, L. B. 1974, AJ, 79, 745 [NASA ADS] [CrossRef] [Google Scholar]
Malhotra, R. 2015, ApJ, 808, 71 [NASA ADS] [CrossRef] [Google Scholar]
Mermilliod, J., Mayor, M., & Udry, S. 2009, A&A, 498, 949 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Neal, R. M. 2003, Ann. Stat., 31, 705 [Google Scholar]
Nordström, B., Mayor, M., Andersen, J., et al. 2004, A&A, 418, 989 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Orellana, R., Carvajal, R., & Agüero, J. C. 2018, 18th IFAC Symposium on System Identification, SYSID [Google Scholar]
Ramírez-Agudelo, O., Simón-Díaz, S., Sana, H., et al. 2013, A&A, 560, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Shirin, A., & Islam, M. S. 2013, ArXiv e-prints [arXiv:1309.6311] [Google Scholar]
Silverman, B. W. 1986, Density Estimation for Statistics and Data Analysis (London: Chapman & Hall) [Google Scholar]
Yalçinbaş, S., & Aynigül, M. 2011, Math. Comput. Appl., 16, 497 [Google Scholar]

Appendix A: Computing the parameters of the MSA

We define the following

$\begin{matrix} K (x_{t}, β_{j}) = λ_{j} g (x_{t} ; θ_{j}), \end{matrix}$ $\begin{aligned}&K(x_t,\beta _j)=\lambda _j{ g}(x_t;\theta _j), \end{aligned}$ (A.1)

$\begin{matrix} d μ (x_{t}) = p (y_{t} | x_{t}) d x_{t}, \end{matrix}$ $\begin{aligned}&d\mu (x_t)=p({ y}_t|x_t)\,\mathrm{d}x_t, \end{aligned}$ (A.2)

then, the log-likelihood function in Eq. (10) can be expressed as

$\begin{matrix} ℓ_{N} (β) = \sum_{t = 1}^{N} log [V_{t} (β)], \end{matrix}$ $\begin{aligned} \ell _N(\beta )=\sum _{t=1}^{N}\log \left[\mathcal{V} _t(\beta )\right], \end{aligned}$ (A.3)

with

$\begin{matrix} V_{t} (β) = \sum_{j = 1}^{K} \int_{y_{t}}^{\infty} K (x_{t}, β_{j}) d μ (x_{t}), \end{matrix}$ $\begin{aligned} \mathcal{V} _t(\beta )=\sum _{j=1}^{K}\int _{{ y}_t}^{\infty }K(x_t,\beta _j)\,\mathrm{d}\mu (x_t), \end{aligned}$ (A.4)

thus, the ML estimator is obtained from

$\begin{matrix} {\hat{β}}_{} ML = arg max_{β} \sum_{t = 1}^{N} log [V_{t} (β)] . \end{matrix}$ $\begin{aligned} \hat{\beta }_\text{ ML}=\arg \max _{\beta }\; \sum _{t=1}^{N}\log \left[\mathcal{V} _t(\beta )\right]. \end{aligned}$ (A.5)

By defining ℬ_t(β)=log[𝒱_t(β)], we can follow a similar analysis as in Carvajal et al. (2018), obtaining

$\begin{matrix} B_{t} (β) = Q_{t} (β, {\hat{β}}^{(m)}) - H_{t} (β, {\hat{β}}^{(m)}), \end{matrix}$ $\begin{aligned} \mathcal{B} _t(\beta )=\mathcal{Q} _t(\beta ,\hat{\beta }^{(m)})-\mathcal{H} _t(\beta ,\hat{\beta }^{(m)}), \end{aligned}$ (A.6)

where

$\begin{matrix} Q_{t} (β, {\hat{β}}^{(m)}) = \sum_{j = 1}^{K} \int_{y_{t}}^{\infty} log [K (x_{t}, β_{j})] \frac{K (x_{t}, {\hat{β}}_{j}^{(m)})}{V_{t} ({\hat{β}}^{(m)})} d μ (x_{t}), \end{matrix}$ $\begin{aligned}&\mathcal{Q} _t(\beta ,\hat{\beta }^{(m)})=\sum _{j=1}^{K}\int _{{ y}_t}^{\infty }\log \left[K(x_t,\beta _j)\right]\frac{K(x_t,\hat{\beta }^{(m)}_j)}{\mathcal{V} _t(\hat{\beta }^{(m)})}\,\mathrm{d}\mu (x_t), \end{aligned}$ (A.7)

$\begin{matrix} H_{t} (β, {\hat{β}}^{(m)}) = \sum_{j = 1}^{K} \int_{y_{t}}^{\infty} log [\frac{K (x_{t}, β_{j})}{V_{t} (β)}] \frac{K (x_{t}, {\hat{β}}_{j}^{(m)})}{V_{t} ({\hat{β}}^{(m)})} d μ (x_{t}) . \end{matrix}$ $\begin{aligned}&\mathcal{H} _t(\beta ,\hat{\beta }^{(m)})=\sum _{j=1}^{K}\int _{{ y}_t}^{\infty } \! \log \left[\frac{K(x_t,\beta _j)}{\mathcal{V} _t(\beta )}\right]\frac{K(x_t,\hat{\beta }^{(m)}_j)}{\mathcal{V} _t(\hat{\beta }^{(m)})}\,\mathrm{d}\mu (x_t). \end{aligned}$ (A.8)

Thus, from Eqs. (A.5) and (A.6), the ML estimator can be locally obtained from

$\begin{matrix} {\hat{β}}_{} ML = arg max_{β} \sum_{t = 1}^{N} B_{t} (β) . \end{matrix}$ $\begin{aligned} \hat{\beta }_\text{ ML}=\arg \max _{\beta }\; \sum _{t=1}^{N}\mathcal{B} _t(\beta ). \end{aligned}$ (A.9)

Then, the function $H_{t} (β, {\hat{β}}^{(m)})$ $\mathcal{H}_t(\beta,\hat{\beta}^{(m)})$ is a decreasing function for any value of β and satisfies the following:

$\begin{matrix} H_{t} (β, {\hat{β}}^{(m)}) - H_{t} ({\hat{β}}^{(m)}, {\hat{β}}^{(m)}) \leq 0 . \end{matrix}$ $\begin{aligned} \mathcal{H} _t(\beta ,\hat{\beta }^{(m)})-\mathcal{H} _t(\hat{\beta }^{(m)},\hat{\beta }^{(m)})\le 0. \end{aligned}$ (A.10)

In Orellana et al. (2018) is detailed the proof of this inequality. From this inequality inspired by the EM algorithm, we can formulate the following iterative algorithm:

$\begin{matrix} \bar{Q} (β, {\hat{β}}^{(m)}) = \sum_{t = 1}^{N} Q_{t} (β, {\hat{β}}^{(m)}), \end{matrix}$ $\begin{aligned}&\bar{\mathcal{Q} }(\beta ,\hat{\beta }^{(m)})=\sum _{t=1}^{N}\mathcal{Q} _t(\beta ,\hat{\beta }^{(m)}),\end{aligned}$ (A.11)

$\begin{matrix} {\hat{β}}^{(m + 1)} = arg max_{β} \bar{Q} (β, {\hat{β}}^{(m)}) . \end{matrix}$ $\begin{aligned}&\hat{\beta }^{(m+1)}=\arg \max _{\beta }\; \bar{\mathcal{Q} }(\beta ,\hat{\beta }^{(m)}). \end{aligned}$ (A.12)

We note that Eqs. (A.11) and (A.12) correspond to the E-step and M-step of the EM algorithm, respectively. Taking derivative of $\bar{Q} (β, {\hat{β}}^{(m)})$ $\bar{\mathcal{Q}}(\beta,\hat{\beta}^{(m)})$ with respect to $α = 1 / σ_{j}^{2}$ $\alpha = 1/\sigma _j^2$ and equating to zero we obtain

$\begin{matrix} \frac{\partial \bar{Q} (β, {\hat{β}}^{(m)})}{\partial α} & = \frac{3}{2 {\hat{α}}^{(m + 1)}} \sum_{t = 1}^{N} \int_{y_{t}}^{\infty} \frac{K_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{V_{M_{t}} ({\hat{β}}^{(m)})} d μ (x_{t}) \\ - \frac{1}{2} \sum_{t = 1}^{N} \int_{y_{t}}^{\infty} x_{t}^{2} \frac{K_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{V_{M_{t}} ({\hat{β}}^{(m)})} d μ (x_{t}) = 0 . \end{matrix}$ $\begin{aligned} \frac{\partial \bar{\mathcal{Q} }(\beta ,\hat{\beta }^{(m)})}{\partial \alpha }&=\frac{3}{2\hat{\alpha }^{(m+1)}}\sum _{t=1}^{N}\int _{{ y}_t}^{\infty }\frac{K_M\left(x_t,\hat{\beta }_j^{(m)}\right)}{\mathcal{V} _{M_t}\left(\hat{\beta }^{(m)}\right)}\,\mathrm{d}\mu (x_t)\nonumber \\&\quad -\frac{1}{2}\sum _{t=1}^{N}\int _{{ y}_t}^{\infty }x_t^2\frac{K_M\left(x_t,\hat{\beta }_j^{(m)}\right)}{\mathcal{V} _{M_t}\left(\hat{\beta }^{(m)}\right)}\,\mathrm{d}\mu (x_t)=0. \end{aligned}$ (A.13)

Using Eqs. (15) and (16) we have

$\begin{matrix} {[{\hat{σ}}_{j}^{2}]}^{(m + 1)} & = \frac{1}{3 P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})} \sum_{t = 1}^{N} \int_{y_{t}}^{\infty} x_{t}^{2} \frac{K_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{V_{M_{t}} ({\hat{β}}^{(m)})} d μ (x_{t}), \\ {[{\hat{σ}}_{j}]}^{(m + 1)} & = {[\frac{S_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{3 P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}]}^{\frac{1}{2}} . \end{matrix}$ $\begin{aligned} \left[{{\hat{\sigma }}^2_j}\right]^{\,\,(m+1)}&=\frac{1}{3\mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})}\sum _{t=1}^{N}\int _{{ y}_t}^{\infty }\!\!\!x_t^2\frac{K_M\left(x_t,\hat{\beta }_j^{(m)}\right)}{\mathcal{V} _{M_t}\left(\hat{\beta }^{(m)}\right)}\,\mathrm{d}\mu (x_t),\nonumber \\ \left[{\hat{\sigma }_j}\right]^{\,\,(m+1)}&=\left[\frac{\mathcal{S} _M(x_t,\hat{\beta }_j^{(m)})}{3\mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})}\right]^{\frac{1}{2}}. \end{aligned}$ (A.14)

For the parameter λ_j we define R_M(λ_j) as follows:

$\begin{matrix} R_{M} (λ_{j}) = \sum_{j = 1}^{K} log [λ_{j}] {P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}, \end{matrix}$ $\begin{aligned} R_M(\lambda _j)=\sum _{j=1}^{K}\log \left[\lambda _j\right]\left\{ \mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})\right\} , \end{aligned}$ (A.15)

subject to

$\begin{matrix} \sum_{j = 1}^{K} λ_{j} = 1 . \end{matrix}$ $\begin{aligned} \sum _{j=1}^{K}\lambda _j=1. \end{aligned}$ (A.16)

Then, using Lagrange multipliers and optimizing with respect λ_j:

$\begin{matrix} L_{M} (λ_{j}, γ) = \sum_{j = 1}^{K} log [λ_{j}] {P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})} - γ_{M} (\sum_{j = 1}^{K} λ_{j} - 1) . \end{matrix}$ $\begin{aligned} \mathcal{L} _M(\lambda _j,\gamma )=\sum _{j=1}^{K}\log \left[\lambda _j\right]\left\{ \mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})\right\} -\gamma _M\left(\sum _{j=1}^{K}\lambda _j-1\right). \end{aligned}$ (A.17)

Taking the derivative of ℒ_M(λ_j, μ) with respect to λ_j and γ_M, then equating to zero we obtain

$\frac{\partial L_{M} (λ_{j}, γ_{M})}{\partial λ_{j}} = \frac{P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{{\hat{λ}}_{j}^{(m + 1)}} - γ_{M} = 0,$ $\frac{{\partial {{\cal L}_M}({\lambda _j},{\gamma _M})}}{{\partial {\lambda _j}}} = \frac{{{{\cal P}_M}({x_t},\hat \beta _j^{(m)})}}{{\hat \lambda _j^{(m + 1)}}} - {\gamma _M} = 0,$ (A.18)

$\frac{\partial L_{M} (λ_{j}, γ_{M})}{\partial γ_{M}} = \sum_{j = 1}^{K} λ_{j} - 1 = 0.$ $\frac{{\partial {{\cal L}_M}({\lambda _j},{\gamma _M})}}{{\partial {\gamma _M}}} = \sum\limits_{j = 1}^K {{\lambda _j}} - 1 = 0.$ (A.19)

Then,

$\begin{matrix} {\hat{λ}}_{j}^{(m + 1)} = \frac{P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{γ_{M}} \cdot \end{matrix}$ $\begin{aligned} \hat{\lambda }_j^{(m+1)}=\frac{\mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})}{\gamma _M}\cdot \end{aligned}$ (A.20)

If we sum over K in (A.20) and use (A.19) we have

$\sum_{j = 1}^{K} {\hat{λ}}_{j}^{(m + 1)} = \sum_{j = 1}^{K} \frac{P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{γ_{M}} = 1,$ $\sum\limits_{j = 1}^K {\hat \lambda _j^{(m + 1)}} = \sum\limits_{j = 1}^K {\frac{{{{\cal P}_M}({x_t},\hat \beta _j^{(m)})}}{{{\gamma _M}}}} = 1,$ (A.21)

$γ_{M} = \sum_{j = 1}^{K} P_{M} (x_{t}, {\hat{β}}_{j}^{(m)}) .$ ${\gamma _M} = \sum\limits_{j = 1}^K {{{\cal P}_M}} ({x_t},\hat \beta _j^{(m)}).$ (A.22)

Finally, we obtain

$\begin{matrix} {\hat{λ}}_{j}^{(m + 1)} = \frac{P_{M} (x_{t}, {\hat{β}}_{j}^{(m)})}{\sum_{l = 1}^{K} P_{M} (x_{t}, {\hat{β}}_{l}^{(m)})} \cdot \end{matrix}$ $\begin{aligned} \hat{\lambda }_j^{(m+1)}=\frac{\mathcal{P} _M(x_t,\hat{\beta }_j^{(m)})}{\sum _{l=1}^{K}\mathcal{P} _M(x_t,\hat{\beta }_l^{(m)})}\cdot \end{aligned}$ (A.23)

Appendix B: Estimation parameters from Monte Carlo simulations

Table B.1 shows the mean values and standard deviation of the estimation parameter of MSA algorithm using one Maxwellian mixture distribution. The case of study represents the unimodal distribution for true rotational velocities. Table B.2 shows the mean values and standard deviation of the estimation parameter of MSA algorithm using two Maxwellian mixture distribution. The case of study represents the bimodal distribution for true rotational velocities.

Table B.1.

Parameters estimated using MSA algorithm for one Maxwellian mixture distribution.

Table B.2.

Parameters estimated using MSA algorithm for two Maxwellian mixture distribution.

Appendix C: Mean integrated square error (MISE) for Monte Carlo simulations

Table C.1 shows the MISE (Eq. (23)) values for unimodal Maxwellian estimation considering one component (K = 1) for MSA algorithm and the TRM algorithm for different sample data length. Similar, Table C.2 shows the MISE values for bimodal Maxwellian estimation considering one component (K = 2) for MSA algorithm and the TRM algorithm different sample data length.

Table C.1.

MISE values of PDF estimation for unimodal Maxwellian distribution example.

Table C.2.

MISE values of PDF estimation for bimodal Maxwellian distribution example.

All Tables

Table B.1.

Parameters estimated using MSA algorithm for one Maxwellian mixture distribution.

In the text

Table B.2.

Parameters estimated using MSA algorithm for two Maxwellian mixture distribution.

In the text

Table C.1.

MISE values of PDF estimation for unimodal Maxwellian distribution example.

In the text

Table C.2.

MISE values of PDF estimation for bimodal Maxwellian distribution example.

In the text

All Figures

Fig. 1.

Monte Carlo simulation results to true rotational velocities PDF estimation: panel a: using TRM algorithm for unimodal Maxwellian distribution. Panel b: using TRM algorithm for bimodal Maxwellian distribution. The black line represents the true rotational velocities PDF. The blue line represents the average of the estimated PDF using TRM. The region between the red dashed-dotted lines corresponds to one standard deviation level of the all estimated PDFs.

In the text

Fig. 2.

Monte Carlo simulation results to true rotational velocities PDF estimation: panel a: using K = 1 for MSA algorithm. Panel b: using K = 2 for MSA algorithm. The black line represents the true rotational velocities PDF. The blue line represents the average of the estimated PDF using MSA. The region between the red dashed-dotted lines corresponds to one standard deviation of the all estimated PDFs.

In the text

	Fig. 3. Estimated PDF of true rotational velocities for real samples cases using K = 2 (Coma Berenice and Tarantula samples) and K = 3 (Geneva sample) for MSA algorithm. The TRM algorithm is setting following Christen et al. (2016). The blue line represents the MSA estimated PDF. The black dashed-dotted line represent the TRM estimated PDF.
In the text

Fig. 4.

Contrasting projected and observed rotational velocities for the real samples using K = 2 (Coma Berenice and Tarantula samples) and K = 3 (Geneva sample) for MSA algorithm. Gray bars correspond to Histograms of the projected rotational velocities for the real samples. The blue line represents the corresponding estimated f_Y(y|β) using MSA. The black dashed-dotted line represents the corresponding estimated f_Y(y|β) using TRM.

In the text

	Fig. 5. q–q plot from Coma Berenice sample. The blue-mark line represents the quantiles of the corresponding estimated f_Y(y\|β) using MSA. The black-square line represents the quantiles of the corresponding estimated f_Y(y\|β) using TRM. The red-dashed line is the reference line.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Achieser, N. 1956, Theory of Approximation (New York: Ungar Publishing Co.) [Google Scholar]

[2] Alipanah, A., & Esmaeili, S. 2011, J. Comput. Appl. Math., 235, 5342 [CrossRef] [Google Scholar]

[3] Alspach, D. L., & Sorenson, H. W. 1972, IEEE Trans. Auto. Control, AC-17, 439 [CrossRef] [Google Scholar]

[4] Anderson, T., & Darling, D. 1952, Ann. Math. Stat., 23, 193 [CrossRef] [Google Scholar]

[5] Bouhamidi, A., & Jbilou, K. 2007, J. Comput. Appl. Math., 206, 86 [NASA ADS] [CrossRef] [Google Scholar]

[6] Carvajal, R., Orellana, R., Katselis, D., Escárate, P., & Agüero, J. C. 2018, PLOS ONE, 13, 1 [CrossRef] [Google Scholar]

[7] Carvalho, J. C., do Nascimento, J. D., Silva, R., & DeMedeiros, J. R. 2009, ApJ, 696, L48 [NASA ADS] [CrossRef] [Google Scholar]

[8] Chandrasekhar, S., & Münch, G. 1950, ApJ, 111, 142 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]

[9] Christen, A., Escárate, P., Curé, M., Rial, D. F., & Cassetti, J. 2016, A&A, 595, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[10] Curé, M., Rial, D. F., Christen, A., & Cassetti, J. 2014, A&A, 85, A13 [Google Scholar]

[11] Degroot, M. H. 2004, Optimal Statistical Decisions (New York: Jhon Wiley and Sons, Inc) [CrossRef] [Google Scholar]

[12] Dempster, A. P., Laird, N. M., & Rubin, D. 1977, R. Stat. Soc., 39, 1 [Google Scholar]

[13] Deng, L., Huang, T., Zhao, L., & Wang, S. 2013, J. Opt. Soc. Am., 30, 5 [Google Scholar]

[14] Deutsch, A. J. 1970, Stellar Rotation, Proceedings of the IAU Colloquium, 207 [CrossRef] [Google Scholar]

[15] Fomel, S. 2007, Geophysics, 72, A29 [NASA ADS] [CrossRef] [Google Scholar]

[16] Gaigé, Y. 1993, A&A, 269, 267 [Google Scholar]

[17] Hansen, P. 2010, Discrete Inverse Problems (Philadelphia: Society for Industrial and Applied Mathematics) [Google Scholar]

[18] Holmberg, J., Nordström, B., & Andersen, J. 2007, A&A, 475, 519 [NASA ADS] [CrossRef] [EDP Sciences] [MathSciNet] [Google Scholar]

[19] Lo, J. 1972, IEEE Trans. Inf. Theory, 18, 583 [CrossRef] [Google Scholar]

[20] Lucy, L. B. 1974, AJ, 79, 745 [NASA ADS] [CrossRef] [Google Scholar]

[21] Malhotra, R. 2015, ApJ, 808, 71 [NASA ADS] [CrossRef] [Google Scholar]

[22] Mermilliod, J., Mayor, M., & Udry, S. 2009, A&A, 498, 949 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[23] Neal, R. M. 2003, Ann. Stat., 31, 705 [Google Scholar]

[24] Nordström, B., Mayor, M., Andersen, J., et al. 2004, A&A, 418, 989 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[25] Orellana, R., Carvajal, R., & Agüero, J. C. 2018, 18th IFAC Symposium on System Identification, SYSID [Google Scholar]

[26] Ramírez-Agudelo, O., Simón-Díaz, S., Sana, H., et al. 2013, A&A, 560, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[27] Shirin, A., & Islam, M. S. 2013, ArXiv e-prints [arXiv:1309.6311] [Google Scholar]

[28] Silverman, B. W. 1986, Density Estimation for Statistics and Data Analysis (London: Chapman & Hall) [Google Scholar]

[29] Yalçinbaş, S., & Aynigül, M. 2011, Math. Comput. Appl., 16, 497 [Google Scholar]