In this appendix, we describe in detail the statistical tests used thorough this paper.
Pearson's correlation coefficient r evaluates the association
between two continuous variables x and y (such as the orbit semi-major axis
a and the V-R color). r is given by
The tests described in this section aim at comparing two continuous,
1D distributions (such as the V-R colors of two MBOSS families).
These three tests estimate the validity of the null hypothesis "the
two samples are extracted from the same population.'' This is
performed by computing an estimator (f, t and d resp., defined
below), whose direct interest is limited. From the estimator, a much
more interesting value is derived: Prob, the probability that the
statistical estimator is as large as measured by chance. Probis the probability to get a statistical estimator as large as or
larger than the value measured while the two samples compared being
actually random sub-samples of a same distribution. Large values of
Prob indicate that it is very probable to get the measured estimator
by chance, or in other words, that we have no reason to claim (on statistical bases) that the two samples come from different
distributions. Remember, however, that this does not allow us to say
that the samples are identical, only that they are not statistically
incompatible. On the other hand, small values of Prob indicate that
the chances of getting the observed estimator by chance while extracting
the two samples from the same distributions are small, or in other
words, that the two samples are not statistically compatible. The size
of the sub-samples is taken into account in the computation of
Prob. While it is definitely safer to work on "large'' samples, the
advantage of these methods is that they start to give fairly reliable
results with fairly small samples; in this study, we set the threshold
as 7. The probability at which one can conclude that samples
are different depends on the certainty level required. Traditional
values are 0.05 and 0.003, corresponding to the usual 2 and
levels. For this study, we will start raising the warning
flags at
.
Of course, if we raise 10 such flags, we can
expect that one of them will be a random effect.
The statistic tests are described in more detail, together with their original references and with the algorithms we used in Press et al. (1992).
This test checks whether the means of two distributions are
significantly different. The basic implementation of this test implies
that the variance of both distributions are equal. For the MBOSSes
colors, this cannot be guaranteed (we deal with that question with the
next section). We therefore used a modified version of the t test
that deals with unequal variances:
The f-test evaluates whether two distributions have significantly
different variances. The statistic f is simply the ratio of the
largest variance to the smaller one:
Obviously, the whole information from a distribution is not contained
in its two first moments (mean and variance). A more complete
comparison of the color distributions is therefore interesting. The
ideal statistics tool for this purpose is the Kolmogorov-Smirnov (KS)
test. The distributions are compared through their Cumulative
Probability Function (CPF) S(x), which is defined as the fraction of
the sample whose value is smaller or equal to x. f starts at 0 and
increases till it reaches 1 for the x corresponding to largest
element of the distribution. d, the KS test, is the maximum
(vertical) distance between the CPFs S1 and S2 of the samples to be
compared, i.e.
Copyright ESO 2002