R. L. Branham, Jr.
Instituto Argentino de Nivología y Glaciología (IANIGLA), C.C. 330, 5500 Mendoza, Argentina
Received 19 February 2004 / Accepted 23 March 2004
Abstract
The velocity ellipsoid has traditionally been calculated by use of moments
of the velocity components after the distribution of parallaxes has been
corrected for parallax error. An alternative, used in this study, fits a
quadric surface to the velocity components and reduces the data by use of a
new mathematical technique, semidefinite programming, and a criterion,
difference of squares, that calculates a unique ellipsoid that represents a
global minimum of the reduction criterion. Use is made of 246 stars of
spectral class O-B5 and luminosity class V to calculate the velocity
ellipsoid. The results may be considered superior to those found from the
method of moments.
Key words: Galaxy: kinematics and dynamics - methods: data analysis
The velocity ellipsoid remains essential for the study of stellar kinematics. Until recently the paucity of parallaxes assured that the ellipsoid was calculated mainly by use of radial velocities alone or proper motions alone. The publication of the Hipparcos catalog (ESA 1997), with over 100 000 stellar parallaxes, remedied this situation. One may now calculate the velocity ellipsoid from total space motions for a reasonable number of stars.
Having an adequate number of stars also implies that one should look for the best method to calculate the ellipsoid. Trumpler & Weaver's classic text, Statistical Astronomy (1962), outlines a method based on computing the moments of the space velocities. Ogorodnokov (1965) presents basically the same prescription. Most of the traditional values for the parameters of the velocity ellipsoid, such as those Delhaye (1965) tabulates, are based on the method of moments. Oblak (1983), however, prefers an approach based on maximum likelihood estimation. A relatively recent study (Dehnen & Binney 1998), and one of the first to make extensive use of the Hipparcos data, returns to the use of statistical moments. This paper presents yet another idea, but one with attractive features, in particular that the ellipsoid calculated is unique.
Because stellar parallaxes incorporate significant errors, the median parallax for the Hipparcos catalog, 4.790 milli-arcsec (mas), having a median error of 1.090 mas, the velocities also contain significant errors. Trumpler & Weaver recommend correcting the parallaxes for error by use of an assumed distribution for the error: the corrected parallaxes may then be used to calculate the moments.
To use a reduction technique that allows for errors such as parallax error and implements a more rigorous reduction model than statistical moments seems preferable, however. Such a reduction technique, known as total least squares (TLS) exists. See Branham (2001) for a discussion of the method. To fit a quadric surface to the calculated space velocities becomes the more rigorous reduction model. I have applied the method, and it works well, but unfortunately suffers from a severe drawback, so severe that it cannot be recommended for general use. To understand the more complicated technique actually adopted, called semidefinite programming and which, unfortunately, does not correct for parallax error, it is instructive to review the TLS approach.
Let
be the space velocities of a star. The quadric
surface to fit to these velocities becomes
The procedure outlined so far can work well in practice; see Branham (2003).
But a possible pitfall must be addressed. Granted, Eq. (2) will calculate
the coefficients of a quadric surface, but how do we know that the surface
will be an ellipsoid? Suppose the coefficients correspond to a paraboloid,
hyperboloid, or some other surface? Given the observational error in the
data, can the possibility of calculating a surface other than an ellipsoid
be ruled out a priori? The answer is no, the possibility cannot be
ruled out. But at the cost of substituting a nonlinear reduction procedure
for the linear one in Branham (2001) one can enforce the condition that the
surface be an ellipsoid. The condition that a quadric surface be an
ellipsoid is, if we use the coefficients of Eq. (2), that the matrix
It therefore seems as it a TLS reduction can be retained by use of nonlinear programming. Unfortunately, such a procedure suffers, as I found out from experience, as have others, from such severe problems of divergence, slow convergence, or convergence to a local rather than global minimum that it becomes unsuitable for general application.
What can be done, then? Use the coefficients of Eq. (1) and let
Semidefinite programming can be summarized by (see Vandenberghe & Boyd
(1996) for a complete discussion):
Calafiore (2002) shows how to convert the problem of fitting an ellipsoid to
data points to an SDP problem. Let u be a vector of 1's of size m and
an m-vector of positive scalars. Define
Because the DOS criterion is not a least squares criterion, much less a TLS criterion, parallax errors will not be corrected, as was done automatically
with the algorithm presented in Branham (2003). One must, therefore, correct
the parallax errors. I performed this operation using Smith & Eichhorn's
(1996) procedure, which seems superior to the method that Trumpler & Weaver
use: if
is the observed distribution of parallax error, the
real distribution can be calculated from
After the coefficients of Eq. (4) have been found, the components of the
3-vector c give the solar velocity. Alternatively, Eq. (4) can be
converted into Eq. (1) and the solar velocity calculated after the linear
terms g,h,k are, by a translation of the axes, removed:
The preceding analysis first translates the axes to remove the linear terms and then rotates the new axes by an orthogonal matrix to diagonalize the matrix of second order terms. One could just as easily first rotate and then translate, after the coefficients g,h,k have also been rotated. The results will be the same except that the components of the solar velocity will be referred to the new axes rather than the original axes. I prefer to translate and then rotate.
Mean errors remain an essential part of data adjustment. Their calculation
depends on an n
n covariance matrix, COV, derived from Eq. (5),
Research has shown that the velocity ellipsoid depends on the spectrum-luminosity class of the stars. I decided to investigate the ellipsoid for dwarf stars (luminosity class V) of spectral classes O-B5 after the stars belonging to the Gould belt had been eliminated. Why use these particular stars rather than some others? Because I had already completed a study of the kinematics of the non-Gould belt O-B5 stars (Branham 2002), and to calculate a velocity ellipsoid represents a useful extension of the work. The data, moreover, were available and, because these stars exhibit a noticeable vertex deviation, it would be important to see if the SDP approach finds a deviation. Parallaxes and proper motions were taken from the Hipparcos catalog and radial velocities from the Wilson (Nagy 1991) and Barbier-Brossat et al. (1994) catalogs. This yielded 246 stars. Because Galactic longitude l and latitude b are more natural coordinates for studies of Galactic kinematics than right ascension and declination, the proper motion form the Hipparcos catalog was converted to proper motion in the Galactic system.
The treatment of the data involved the following steps. Smith & Eichhorn's
Eq. (21) corrected the parallaxes for parallax error on a star by star basis
using the parallax with its associated mean error taken from the Hipparcos catalog. Proper motions and radial velocities were corrected for
Galactic rotation. Let
be the proper motion in Galactic
longitude,
the proper motion in Galactic latitude,
the
radial velocity, and A and B the Oort constants. Then the corrected
proper motions,
and
and radial
velocity,
are:
Table 1: Solution for coefficients of velocity ellipsoid from SDP.
Because the DOS criterion is sensitive to outliers, it is important to
eliminate discordant data from the sample before proceeding. To select
outliers I used a filter that excluded space velocities that exceeded five
times the mean absolute deviation of the velocities. I have used this
criterion before. It seems to work well and, because few OB stars are high
velocity, there is little chance that many genuine high velocity stars will
erroneously be considered outliers and rejected. The criterion rejected no
stars. A different criterion, such as Pierce's (Branham 1990) would have
rejected thirteen stars, and changed significantly the dispersions and mean
errors in Table 1, but Pierce's criterion is based on a normal distribution
for residuals. As we shall soon see a normal distribution of error hardly
characterizes the stars used in this study. Figure 1 graphs the velocities of
these stars. The elongated shape of the velocities, suggestive of an
ellipsoidal distribution, is clearly visible. Figure 2 shows a projection onto
the
plane, where a vertex deviation becomes manifest.
![]() |
Figure 1: Velocity distribution of 246 O-B5 stars. |
| Open with DEXTER | |
![]() |
Figure 2:
Velocity distribution of 246 O-B5 stars in
|
| Open with DEXTER | |
Table 2: Velocity ellipsoid from marginal moments.
From these data I calculated from SDP the solution shown in Table 1. For
comparison I also calculated a solution using the procedure, based on
marginal moments, outlined in Trumpler & Weaver (1962). This solution,
given in Table 2, differs significantly from the SDP solution. The
significance of the difference will be discussed in a moment. (No mean
errors accompany this solution because Trumpler & Weaver provide no
prescription for their calculation.) As a check on the criterion used to
eliminate outliers, even though in fact no outliers were eliminated, I
performed a solution using the robust L1 criterion. Replace Eq. (5) by
![]() |
Figure 3: Velocity ellipsoid. |
| Open with DEXTER | |
![]() |
Figure 4:
Velocity ellipsoid in
|
| Open with DEXTER | |
![]() |
Figure 5:
Velocity ellipsoid in
|
| Open with DEXTER | |
![]() |
Figure 6:
Velocity ellipsoid in
|
| Open with DEXTER | |
We have calculated a solution by use of SDP. Why take this more complicated
route when Trumpler & Weaver's procedure with statistical moments also
calculates the coefficients of an ellipsoid, albeit without mean errors,
with less labor? Trumpler & Weaver justify their procedure by reminding us
that statistical moments coincide with what maximum likelihood estimation
gives when the residuals are randomly selected from a normal distribution.
But do the residuals follow a normal distribution and are they random? First
we must specify what we mean by a residual. For the SDP solution the
definition of a residual comes from the DOS criterion,
Table 3: Statistics of the residuals.
where the residual now has dimensions of km s-1. Table 3 and Figs. 7 and 8 give a negative answer to whether the distributions are normal. Figure 7 comes from the SDP solution and Fig. 8 from the Trumpler & Weaver solution. A simple glance shows that the residuals are far from normal. Table 3 quantifies this first impression. The runs test measures the randomness of the residuals, essential if one asserts that a least squares solution is optimum. The difference between the expected 123 runs versus the observed 82 runs for SDP and 106 for Trumpler-Weaver demonstrates that the residuals are in fact not random. The nonzero coefficients of skewness confirm the asymmetry of the residuals. The kurtosis indicates that the SDP solution is leptokurtic, the Trumpler-Weaver solution platykurtic. And the Q factor, measuring the weight of the tails of a distribution, shows that the tails of the actual distributions of residuals are far lighter than those for a normal distribution. (Because different definitions are used for a residual, cols. two and three of Table 3 are not directly comparable. Suffice it to say that if we use the DOS definition of a residual with the Trumpler-Weaver solution, the dispersion of the residuals becomes 1871 km2 s-2, there are 132 runs with a skewness of 1.49, kurtosis of 5.52, and a Q factor of 0.37.)
![]() |
Figure 7: Histogram of residuals from SDP solution. |
| Open with DEXTER | |
![]() |
Figure 8: Histogram of residuals from Trumpler-Weaver solution. |
| Open with DEXTER | |
This behavior of the residuals, however, hardly surprises. The ellipsoidal hypothesis of the distribution of velocities, although explaining the distribution's main features, still only approximates the real, and complicated, stellar velocity structure in the solar vicinity. But the behavior also undercuts the theoretical justification for using statistical moments. We have no reason to assert that statistical moments calculate a solution superior to that given by SDP. When one remembers that the SDP solution with the DOS criterion also calculates a unique ellipsoid that represents a global minimum of Eq. (5), then one can maintain that the SDP solution becomes superior.
One may dispute such an assertion by claiming that some of the parameters
presented in Table 1 differ considerably from values tabulated elsewhere, in
Delhaye (1965), for example. The solar motion and vertex deviation more or
less agree, but the velocity dispersions are far higher. But the traditional
determinations rely for the most part on statistical moments, and
concordance is hardly to be expected. They also differ from Dehnen &
Binney's (1998) determinations. Dehnen & Binney do not explicitly find the
velocity ellipsoid for O-B5 stars, but divide their data into nine color
bins and calculate the ellipsoid, but not the solar motion, for each bin.
The bin with
(B-V)=-0.238 comes closest to my data. The velocity
dispersions,
km s-1,
km s-1,
km s-1, do not differ too much from those of
Table 1, particularly if the mean errors are taken into account; their
for example, varies from 33.72 to 51.22 at the 84.3% confidence level. But surprisingly they find a positive rather than a negative vertex deviation. Once again, however, because their analysis
relies on statistical moments, concordance with my results becomes unlikely.
Semidefinite programming works well when used to calculate the velocity ellipsoid and by calculating a unique ellipsoid that represents a global minimum of the reduction criterion seems superior to the method of moments. This becomes manifest when the theoretical justification for our using statistical moments, maximum likelihood estimation applied to a normal distribution, is untenable. The flexibility of semidefinite programming allows one, should one so desire, to incorporate futher criteria, such as calculating the most spherical ellipsoid that satisfies the data.
Acknowledgements
I would like to thank G. Calafiore for sending me Matlab routines for ellipsoid fitting to data points.