A&A 449, 151-159 (2006)
DOI: 10.1051/0004-6361:20054464
S. Schmeja - R. S. Klessen
Astrophysikalisches Institut Potsdam, An der Sternwarte 16, 14482 Potsdam, Germany
Received 2 November 2005 / Accepted 23 November 2005
Abstract
Context. Understanding the formation and evolution of young star clusters requires quantitative statistical measures of their structure.
Aims. We investigate the structures of observed and modelled star-forming clusters. By considering the different evolutionary classes in the observations and the temporal evolution in models of gravoturbulent fragmentation, we study the temporal evolution of the cluster structures.
Methods. We apply different statistical methods, in particular the normalised mean correlation length and the minimum spanning tree technique. We refine the normalisation of the clustering parameters by defining the area using the normalised convex hull of the objects and investigate the effect of two-dimensional projection of three-dimensional clusters. We introduce a new measure
for the elongation of a cluster. It is defined as the ratio of the cluster radius determined by an enclosing circle to the cluster radius derived from the normalised convex hull.
Results. The mean separation of young stars increases with the evolutionary class, reflecting the expansion of the cluster. The clustering parameters of the model clusters correspond in many cases well to those from observed ones, especially when the
values are similar. No correlation of the clustering parameters with the turbulent environment of the molecular cloud is found, indicating that possible influences of the environment on the clustering behaviour are quickly smoothed out by the stellar velocity dispersion. The temporal evolution of the clustering parameters shows that the star cluster builds up from several subclusters and evolves to a more centrally concentrated cluster, while the cluster expands slower than new stars are formed.
Key words: stars: formation - stars: pre-main sequence - ISM: clouds - Galaxy: open clusters and associations: general - methods: statistical
Almost all stars form in clusters. Embedded clusters contain various types of young stars, making them ideally suited to study the early stages of star formation as they provide a large and genetically homogeneous sample (see Lada & Lada 2003, for a review). Understanding the formation and evolution of young stellar clusters requires quantitative statistical measures of their structure, which may give important clues to the formation process. While some clusters are centrally concentrated with a smooth radial density gradient, others show filaments and signs of fractal subclustering. If and how different structures are connected to the environmental conditions of the molecular clouds and how they depend on the evolutionary stage of the cluster is not yet clear.
Different methods have been used to describe the clustering properties of star clusters, e.g., the mean surface density of companions (Larson 1995) or spanning trees. Cartwright & Whitworth (2004, hereafter CW04) presented a review of various statistical methods for analysing the structures of star clusters and a detailed investigation of both observed and artificially created clusters. We apply the methods discussed there to clusters created by numerical simulations of gravoturbulent star formation and investigate the clustering behaviour with time. We extend the analysis of observed clusters by considering different evolutionary classes.
Four classes of young stellar objects (YSOs) are distinguished
according to the properties of their spectral energy distributions
(SEDs) (e.g. André et al. 2000):
Class 0 sources are deeply embedded protostars with a large sub-mm to
bolometric luminosity ratio (
/
).
The Class 0 stage is the main accretion phase and lasts only a few
104 yr. Class 1 objects are relatively evolved protostars, which
are surrounded by an accretion disc and a circumstellar envelope.
Pre-main-sequence stars in Class 2 and 3 correspond to classical
and weak line T Tauri stars, respectively. They are characterised by a
circumstellar disc (optically thick in Class 2, optically thin in
Class 3) and the lack of a dense circumstellar envelope.
These four classes are usually interpreted as an evolutionary sequence
from Class 0 to 3.
The progenitors of these forming stars are prestellar cores (starless
cores, prestellar condensations). These are gravitationally bound,
dense molecular cloud cores with typical stellar masses that may
already be in a state of collapse, but have not formed a central
protostellar object yet.
Section 2 explains the statistical methods used. In Sects. 3 and 4 we describe the observations and the models, respectively, and in Sect. 5 the application of the methods to the data. The results are analysed and discussed in Sect. 6, while we present our conclusions in Sect. 7. The Appendices give the details about the normalisation of the clustering parameters.
A wide range of statistical methods has been developed to analyse the structure of star clusters
(see CW04 for a review).
A simple approach is to study the distribution of source separations, as it has been
done e.g. by Kaas et al. (2004) for the Serpens cloud core.
Larson (1995), extending the analysis by Gomez et al. (1993),
introduced the mean surface density of companions
,
a
tool that since then has often been used to study star-forming clusters (e.g. Bate et al. 1998; Gladwin et al. 1999; Klessen & Kroupa 2001).
The mean surface density of companions (MSDC) specifies the average number of neighbours per square degree
on the sky at an angular separation
for each cluster star. Knowing the distance to the cluster,
can be converted to an absolute distance r to determine
.
The physical interpretation of the MSDC can be difficult, and CW04 have shown
that the normalised correlation length is a better indicator for the clustering behaviour.
The normalised correlation length
is the mean separation s between stars in the cluster,
normalised by dividing by the radius of the cluster,
.
This radius is defined via the normalised convex hull of the objects (see Appendix A).
The
values are independent of the number of stars in the cluster (CW04).
The minimum spanning tree (MST), a construct from graph theory, is the unique set of straight
lines ("edges'') connecting a given set of points ("vortices'') without closed loops,
such that the sum of the edge
lengths is a minimum (Kruskal 1956; Prim 1957; Gower & Ross 1969).
In astrophysics, minimum spanning trees have so far mainly been used to analyse the structure of
galaxy clusters (e.g. Barrow et al. 1985; Adami & Mazure 1999;
Doroshkevich et al. 2004).
From the MST the normalised mean edge length
is derived. Unlike the mean
separation length s, the mean edge length m depends on the number of stars in the cluster,
therefore it has to be normalised by the factor
(A/n)1/2(Marcelpoil 1993),
where n is the total number of stars and A the two-dimensional area of
the cluster.
In the three-dimensional case the normalisation factor is
(V/n)1/3,
where V is the volume of the cluster.
Area and volume are defined by the normalised convex hull of the objects
(see Appendix A).
The normalisation factors are discussed in detail in Appendix B.
An additional reducing operation, called separating, can be used to isolate subclusters (Barrow et al. 1985). Separating means removing all edges of the MST whose lengths exceed a certain limit. When removing edges from a MST, each remaining subgraph is again a MST of its vortices (Robins et al. 2000).
The values
and
,
on their own, can quantify, but cannot distinguish between, a smooth
large-scale radial density gradient and multiscale fractal subclustering.
Dussert et al. (1986) combine the mean edge length m of a MST and its standard deviation
and use the (m,
)-plane to separate different
degrees of order in various systems. However, CW04 show that this is not sufficient to
differentiate between a smooth large-scale radial density gradient and fractal subclustering.
Therefore, CW04 introduced the parameter
,
which can provide this distinction.
Large Q values (
)
indicate centrally concentrated clusters having a volume
density
,
where Q increases with increasing
(i.e. with
increasing degree of central concentration).
Small Q values (
)
describe clusters with fractal substructure, where Q decreases with
increasing degree of subclustering.
Table 1: Clustering measures and numbers for the observed star-forming regions for prestellar cores (p), Class 1, 2, 3 sources, and all YSOs. (Note that the total number of YSOs is larger than the sum of objects in the individual classes, because it contains Class 0 sources and objects with unclear classification not considered in the analysis of the individual classes.) For details see the discussion in Sect. 6.1.
We also investigate the elongation of the clusters.
We define the elongation
of a cluster as the ratio of the cluster radius
defined by an enclosing circle to the cluster radius derived from the normalised convex
hull of the objects.
A value of
describes a spherical cluster, while a value of
corresponds to an elongated elliptical cluster with an axis ratio of
.
See Appendix C for details.
Our observational data are based on the sample of YSOs in embedded clusters compiled from various
sources as discussed in detail by Schmeja et al. (2005, hereafter SKF05).
However, in the current analysis only the regions
Ophiuchi, Taurus and Serpens will be studied in detail. These are clusters,
where sufficient information on the evolutionary classes as well as on the positions
is given in the literature.
IC 348 and Chamaeleon I are used for determining additional clustering parameters.
The data of
Ophiuchi are taken from Bontemps et al. (2001)
(YSOs) and Stanke et al. (2005) (prestellar cores), the Serpens
data are from Kaas et al. (2004) (Class 1/2),
Hurt & Barsony (1996), and Froebrich (2005) (Class 0), the Taurus data are
taken from Hartmann (2002) (YSOs) and Lee & Myers (1999) (prestellar cores).
The data of IC 348 are taken from Luhman et al. (2003) and those of Chamaeleon I
from Cámbresy et al. (1998).
For further details on the compilation of the original sample see SKF05.
The numbers of objects are given in Table 1, the positions of all
YSOs are plotted in the upper panel of Fig. 1.
The adopted distances used to determine the absolute values of s and m are 140 pc
for
Oph (Bontemps et al. 2001), 260 pc for Serpens
(Kaas et al. 2004), 140 pc for Taurus (Hartmann 2002),
315 pc for IC 348 (Luhman et al. 2003), and 150 pc for Cha I
(Haikala et al. 2005).
We perform numerical simulations of the fragmentation and collapse of turbulent, self-gravitating gas clouds and the resulting formation and evolution of protostars as described in Schmeja & Klessen (2004, hereafter SK04). We use a code based on smoothed particle hydrodynamics (SPH; Monaghan 1992) in order to resolve large density contrasts and to follow the evolution over a long timescale. The code includes periodic boundary conditions (Klessen 1997) and sink particles (Bate et al. 1995) that replace high-density cores while keeping track of mass and linear and angular momentum. We determine the resolution limit of our SPH calculations using the Bate & Burkert (1997) criterion. This is sufficient for the highly nonlinear density fluctuations created by supersonic turbulence as confirmed by convergence studies with up to 107 SPH particles (Jappsen et al. 2005; Li et al. 2005).
Our simulations consist of two globally unstable models that contract
from Gaussian initial conditions without turbulence and of 22 models
where turbulence is maintained with constant rms Mach numbers ,
in the range
.
We distinguish between
turbulence that carries its energy mostly on large scales, at
wavenumbers
,
on intermediate scales, i.e.
,
and on small scales with
.
The naming of the
models, G1 and G2 for the Gaussian runs, and M
kk (with rms
Mach number
and wavenumber k) for the turbulent models,
follows SK04. Details of the individual
models are given in their Table 1.
The dynamical behaviour of isothermal self-gravitating gas is scale free
and depends only on the ratio
between internal energy and potential energy:
.
Since we are only interested in the positions of the young stars, and since
all the clustering parameters are normalised by the cluster radius, the physical
scaling is irrelevant for the present study.
The YSO classes are determined as follows (see SKF05 and Froebrich et al. 2005
for a detailed discussion): The beginning of Class 0 is
identified with the formation of the first hydrostatic core, when
the central object has a mass of about
(Larson 2003). The transition from Class 0 to Class 1 is
reached when the envelope mass is equal to the mass of the central
protostar (André et al. 2000).
We determine the transition from Class 1 to Class 2 when the optical
depth of the remaining envelope becomes unity at
(K-band).
The end of Class 0 and Class 1 corresponds to a mass of about 0.43
and 0.85 times the final mass, respectively (SKF05).
Lacking a feasible criterion to distinguish Class 2 from Class 3 objects,
we consider both classes combined.
Prestellar cores are identified by a clump-finding algorithm described in
Klessen & Burkert (2000; see also SKF05).
We only consider models with a numerical resolution of at least 200 000 particles.
Furthermore, in order to get reasonable numbers of protostars for the
statistics, we select only those models where more than 37 protostars with
(roughly corresponds to the detection limits of the observations)
are formed.
This reduces our set of models to 16. Again, see Table 1 of SK04
for further details.
We construct the MST following Prim's (1957) algorithm, as also described by
Gower & Ross (1969).
For both the observations and the models, the parameters ,
,
Q, and
are computed for
the different evolutionary classes independently (provided, there is a sufficiently large number of
YSOs of that class) as well as for the entire cluster.
In no region is the number of Class 0 sources large enough to be included.
In addition, these parameters are determined at
frequent timesteps of the simulations to obtain the temporal evolution of the parameters.
In the case of
,
this requires to construct the MST anew at every chosen timestep.
As an example, Fig. 2 shows the MST of one model at different evolutionary stages.
While the parameters
,
and Q are calculated using the normalisation factor given above,
,
and Q* are determined with the normalisation factor and cluster
radius of CW04 (see also the discussion in the Appendix).
As shown by CW04, the effect of binary stars on the clustering parameters
is not negligible.
Since binaries create very short edge lengths, a large fraction of binaries will
significantly reduce the mean edge length
(and as a consequence also change Q).
As the binaries are not part of the clustering regime, their influence
on the clustering parameters has to be minimized.
While for most of the clusters it is not relevant, Taurus is known to
have a large binary population.
Thus we removed the known binary companions from our sample of YSOs in Taurus.
![]() |
Figure 1:
Upper panel:
observational data for the star-forming clusters ![]() |
Open with DEXTER |
![]() |
Figure 2: The 2D minimum spanning tree for the YSOs of model M6k4a projected into the xy-plane at a star formation efficiency of 10%, 20%, 30% and 40% ( from left to right). |
Open with DEXTER |
We calculate the clustering parameters ,
,
Q, and
for the
model cluster in three dimensions, and, in order to compare them with the
apparent two-dimensional observational data, projected into the
xy-, xz-, and yz-plane.
The positions of the stars are corrected for the periodic boundary conditions.
Only objects inside a box of ten times the
side length of the original computational box are considered,
if there are any objects outside this volume, we assume that they
have left the cluster and are not relevant for the clustering process.
In the beginning stages of the cluster as a whole, and of the individual evolutionary classes, the number of objects is rather low. In such cases the analysis may not be statistically significant and has to be taken with a grain of salt.
![]() |
Figure 3:
The clustering parameters ![]() ![]() ![]() ![]() |
Open with DEXTER |
![]() |
Figure 4:
The temporal evolution of the 3D clustering parameters ![]() ![]() ![]() |
Open with DEXTER |
Table 1 lists the clustering measures introduced in Sect. 2 for the observations.
Columns 4 to 7 list the (non-normalised) mean separation of objects and mean MST edge lengths
in arcminutes and parsec. Columns 8 to 10 give the parameters ,
and Q, while
Cols. 11 to 13 list the same values calculated using the normalisation of CW04
(
,
,
Q*).
The last column gives the elongation
.
The values of
,
and Q* agree with the values given by CW04, except for
and
of Taurus
(interestingly, however, our Q* is the same as theirs).
We attribute small differences to the slightly different underlying samples, and
the discrepancy for Taurus to the different treatment of binaries.
Due to the different definition of the radius/area of the cluster,
and
differ significantly from
and
,
while Q and Q* are roughly
the same, in particular for large samples (see the discussion in Appendix B).
Taurus and Chamaeleon I have substructure, while
Oph and
IC 348 are centrally concentrated clusters (see the discussion in CW04).
Serpens (not discussed by CW04) has Q = 0.90, corresponding to a central
concentration with a radial density exponent
,
similar
to IC 348.
The linear distances s and m are in the same range for Oph,
Serpens, and IC 348, significantly larger for Chamaeleon I, and about an order
of magnitude larger in the case of Taurus, confirming the notion that Taurus
represents a somewhat less clustered mode of star formation.
However, when considering only the central part of the Taurus region, the
values decrease significantly to s = 3.05 pc and m = 0.49 pc.
The latter is comparable to the 0.3 pc estimated as the average distance
to the nearest stellar neighbour in the central region of Taurus by Gomez
et al. (1993) and Hartmann (2002).
The mean separation s increases with the evolutionary class for all three clusters investigated in this regard, reflecting the expansion of the cluster, which can also be seen in Fig. 1. While Class 0 protostars are formed in the high-density central regions, more evolved YSOs already had time to move to more remote regions (see also Kaas et al. 2004). Note that the prestellar cores do not fit into this sequence, as they are distributed over an area roughly as large as the entire cluster. Thus we speculate that not all objects identified as prestellar cores will eventually form stars. Only in the central parts of the cluster may the density be high enough to make the cores collapse. This is consistent with the findings of Vázquez-Semadeni et al. (2005) that a significant number of "failed cores'' should exist, which may redisperse and which may correspond to the observed starless cores.
The elongation of the clusters ranges from
for the almost perfectly
spherical cluster IC 348 to
for Serpens. The
value can
differ significantly for subclusters, e.g. the filamentary central part of the
Taurus region has an elongation of
.
We compute the clustering parameters ,
,
Q, and
for all models and compare
them with each other as well as with those from the observations.
Figure 3 shows the clustering parameters for all models, sorted by
the Mach number at a star formation efficiency (SFE) of 10% and 40%.
The clustering parameters of the observed clusters (from Table 1)
are shown as horizontal lines at their Mach number (taken from
SKF05).
The
and
values of the models are in general significantly larger than
those from the observations.
A particularly large discrepancy is noted when the model cluster is strongly elongated.
A large
value reduces the normalisation factor and increases
and
.
While the observed clusters show rather moderate elongations in the range
,
in some models the stars form in a single filament with an elongation
.
Model clusters with elongations in the range of the observations show
also a good agreement in the other parameters. For example, the fairly spherical
cluster of model M6k4a (shown in Fig. 2) has (at 40% SFE and
projected into the xy-plane) an elongation of
and the clustering
parameters
,
,
and Q = 0.89. These values
are almost identical to those of
Ophiuchi (
,
,
,
Q = 0.85).
For the other projections the values differ by less than 8%.
Most of the Q values, which are independent of the area, lie in the
same range as those from the observations.
We find no correlation of the clustering parameters with the properties of the turbulent
driving (Mach number or wave number k) of the models.
Neither do the values from the observations show any correlation with the Mach number. Thus we conclude
that if there is any systematic influence of the turbulent environment on the clustering behaviour,
it is only existent in the earliest phase of cluster formation, before it is smoothed out
by the motions of the individual protostars (see also Bate et al. 1998).
![]() |
Figure 5:
The temporal evolution of the clustering parameters ![]() ![]() ![]() |
Open with DEXTER |
![]() |
Figure 6:
The MST of the Taurus cluster as a whole ( left panel) and separated at
![]() ![]() ![]() |
Open with DEXTER |
Figure 2 shows the MST of the model cluster M6k4a at different
evolutionary stages and reveals the expansion of the cluster.
We analyse the temporal evolution of the clustering parameters: (t),
(t), Q(t),
in all models. As an example, Fig. 4 shows this sequence for the said model M6k4a.
The general behaviour is the same for all models:
and
decline slightly with time, while Q increases slowly or stays at a roughly
constant value. This evolution is shown by the entire cluster as well as by the individual
classes, although in later stages the latter values might fall to zero as the number of
objects in a particular class becomes zero.
Decreasing
and
values indicate that star formation sets in in different,
rather dispersed regions of the cloud. The cluster becomes denser as more and more gas
is turned into protostars.
New stars are formed faster than the cluster expands.
According to the Q values, the cluster evolves slowly from fractal subclustering
to a more centrally concentrated cluster
although in no model does
rise significantly above the "divding value'' of 0.8.
This again shows that the cluster builds up from separate groups which will grow into
a single, more centrally concentrated cluster, as also found by Bonnell et al. (2003) and Clark et al. (2005).
![]() |
Figure 7:
The clustering parameters ![]() ![]() ![]() |
Open with DEXTER |
Looking at the two-dimensional projections of the 3D model clusters does not significantly
change the picture as a whole. The individual ,
and Q values can
indeed differ for the projection into the xy, xz, and yz plane, but the
qualitative behaviour of the evolution is more or less the same, independent of
the projection (Fig. 5).
While the
and
values usually are higher than the values of the
projections,
tends to be lower than the values of the projections.
The investigation of several hundred randomly created clusters shows that
is always expected to be larger than the value for the projections
,
while
the
values can be in the range
with a mean
value of 1.1.
In the extreme case,
can differ by up to
from the 2D value.
Note that the physical interpretation of Q as given by CW04 is based
on the two-dimensional analysis. Therefore, for an interpretation of the numerical
values (and not only the trend) of Q, the projected values have to be used.
The elongation measure ,
on the other hand, depends strongly on the projection.
Obviously, an elongated, filament-like structure seen from the side
will look spherical when observed along its major axis.
In the case of the models the three-dimensional value of
can be used to
describe the true shape of the cluster.
A large scatter in the
values means a high
value
and vice versa.
We separate the observed clusters by succesively removing all MST edges with lengths
l larger than
,
,
,
and
,
where
denotes the standard deviation
of the mean edge length of the MST.
This process is demonstrated for the Taurus star-forming region in
Fig. 6.
When we compare all three regions with sufficient data, we see that for
Serpens and Taurus only solitary stars in the outskirts of the
cluster are removed, while the central part of the cluster stays connected.
However, the more homogeneous
Oph cluster
breaks down in two roughly equally large clusters at the last step.
In Fig. 7 we show the clustering parameters of the largest
remaining subcluster (i.e., the one containing the largest number of stars after
each level of separation).
The parameter Q is less affected by the separating procedure than
and
.
Significant changes are only seen when large parts of the cluster are excluded
(as in the step from 1 to
for
Oph, when the cluster breaks up
into two large subclusters).
The elongation measure
varies significantly with the level of separation.
For example, the remaining subcluster in Taurus shows a filamentary structure
and thus a larger
(
)
than the cluster as a whole.
We show that the normalised mean correlation length
and the mean edge length
of the minimum spanning tree
,
and in particular the combination of both
parameters, Q, as proposed by CW04,
are very useful tools to study the structures of star clusters, both observed
ones and those from numerical simulations.
We refine the definiton of the cluster area by using the normalised convex hull
rather than a circular or rectangular area around the objects.
Unlike
and
the parameter Q is independent of the definiton of the area
of the cluster. In addition, it is less affected when removing stars from the cluster
by separating the MST.
We introduce a new measure
for the elongation of a cluster. It is
defined as the ratio of the cluster radius determined by an enclosing
circle to the cluster radius derived from the normalised convex
hull. This is a stable statistical measure not influenced by fractal
substructre, which could also be applied to the filamentary structure of
molecular clouds.
The mean separation s increases with the evolutionary class, reflecting
the expansion of the cluster. The prestellar cores do not follow that sequence,
leading us to the speculation that not all objects classified as prestellar
cores will eventually form stars.
The clustering values of the models lie roughly in the same range as those
from observed clusters.
A particularly good agreement is reached when the clusters have similar elongation
values .
No correlation with the Mach number or the wave number of the turbulence
is found. We conclude that possible influences of the turbulent environment
on the clustering behaviour are quickly smoothed out by the velocity dispersion
of the young stars.
The temporal evolution of the clustering parameters shows that the star cluster builds up from several subclusters and evolves to a more centrally concentrated state. New stars are formed faster than the cluster expands. The projection of the 3D models into a 2D plane changes the clustering parameters, but not the general behaviour with time.
Acknowledgements
This work is funded by the Emmy Noether Programme of the Deutsche Forschungsgemeinschaft (grant No. KL1358/1). We are very grateful to Roland Gredel, Thomas Stanke, Michael Smith and Tigran Khanzadyan for providing us with theirOph data prior to publication and to Lee Hartmann for sending us his data of the Taurus cloud. We wish to thank Dan Kushnir and Spyridon Kitsionas for valuable discussions and the referee, Anthony Whitworth, for his prompt report. S.S. acknowledges the hospitality of the Institute for Pure and Applied Mathematics, University of California, Los Angeles, during part of this work.
For the normalisation of the parameters
and
the radius and the
area (or volume) of the cluster are needed (see Appendix B).
Since a star cluster has no well-defined natural boundary, different
approaches to determine the cluster area have been used.
CW04 define the cluster radius as the distance
between the mean position of all cluster members and the most distant star and
the area as a circle with the cluster radius:
.
Adami & Mazure (1999) define the area used for the normalisation of the MST
edge lengths as the maximum rectangle of the point set.
However, both methods tend to significantly overestimate the area of the cluster, in particular
if the cluster is elongated or irregularly shaped rather than spherical.
Therefore, we estimate the area A of the cluster using the convex hull
of the data points, normalised by an additional geometrical factor taking
into account the ratio of the number of objects inside and on the convex hull:
![]() |
Figure A.1: Randomly distributed data points and the area they span according to different definitions (circle, rectangle, convex hull). |
Figure A.1 demonstrates that the definition of the cluster area
is crucial, since it can differ by a factor of two or more. In the given example
of randomly distributed data points,
the area of the circle is 46.33, that of the rectangle is 30.40, the
area of the convex hull is 24.19, and the normalised area (Eq. (A.1))
is 24.83.
For the calculation of the parameter
the size
of the radius/area is irrelevant (as long as the area used for the normalisation of
depends on the radius used for the normalisation of
or vice versa).
The radius is cancelled since it is used to normalise both,
and
.
The edge lengths of a minimum spanning tree depend on the number of points and on the area,
therefore the mean edge length m has to be normalised, in order to compare the results
from samples with different numbers of objects and/or different areas.
Beardwood et al. (1959) show that the length of the shortest closed path through n points
l(Pn) in a plane region of area A is asymptotically proportional to
for large n.
Since the number of edges in a MST is (n-1), Hoffman & Jain (1983) and CW04
claim that the expected length of a randomly selected edge of a MST is asymptotically proportional
to
In the three-dimensional case the general result for k dimensions of Beardwood et al. (1959) (their Eq. (2)) can be written as
,
leading to the factor
![]() |
(B.4) |
![]() |
(B.5) |
![]() |
Figure B.1:
Mean edge lengths ![]() ![]() |
Note that we use different definitions of the radius and the area from CW04
(Appendix A), resulting in an additional difference between our
and their parameters
and
.
However, in the calculation of Q the radius contained in the normalisation factor
is cancelled out, so the difference in the Q parameters boils down to
Eq. (B.3).
Unlike
and
,
the more relevant parameter Qis the same for a large number of objects, independent of the normalisation method.
To allow comparison, we list the parameters computed using the normalisation factor
and the cluster radius according to CW04 in Table 1 as well.
![]() |
Figure C.1:
The relation between the axis ratio a/b of an elliptical
area and the elongation ![]() ![]() |
We notice that some of the model clusters are strongly elongated, causing a large difference
in the cluster area depending on whether it is defined by the enclosing circle or by
the normalised convex hull.
We use this fact to propose a new, statistically stable characterisation of the
elongation of a cluster.
We define the elongation measure
as the ratio
![]() |
(C.1) |
In order to test if
is indeed a good measure for the elongation we place 350
points randomly on an elliptical area with increasing axis ratio (
).
To minimise the statistical scatter we perform 500 different realisations for each
a/b and determine the mean
and its standard deviation for each step.
Figure C.1 shows that
increases with the axis ratio a/b.
For a/b = 1 (i.e., a circle),
,
as expected.