Free Access

This article has an erratum: [https://doi.org/10.1051/0004-6361/200811212e]


Issue
A&A
Volume 494, Number 2, February I 2009
Page(s) 539 - 551
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361:200811212
Published online 04 December 2008

The mass function of young star clusters in spiral galaxies

S. S. Larsen

Astronomical Institute, University of Utrecht, Princetonplein 5, 3584 CC Utrecht, The Netherlands

Received 22 October 2008 / Accepted 20 November 2008

Abstract
Aims. The initial cluster mass function (ICMF) in spiral discs is constrained and compared with data for old globular clusters and young clusters in starbursts.
Methods. For a given absolute magnitude, the cluster age distribution depends on the ICMF. Here, the behaviour of the median age-magnitude relation is analysed in detail for Schechter ICMFs with various cut-off masses, $M_{\rm c}$. The calculated relations are compared with observations of the brightest clusters in spiral galaxies. Schechter functions are also fitted directly to observed mass functions (MFs).
Results. A single Schechter ICMF with an $M_{\rm c}$ of a few times 10 $^5~M_{\odot}$ can reproduce the observed ages and luminosities of the brightest (and 5th brightest) clusters in the spirals if disruption of optically visible clusters is dominated by relatively slow secular evolution. A Schechter function fit to the combined cluster MF for all spirals in the sample yields $M_{\rm c} = (2.1\pm0.4)\times10^5~M_{\odot}$. The MFs in cluster-poor and cluster-rich spirals are statistically indistinguishable. An $M_{\rm c}=2.1\times10^5~M_{\odot}$ Schechter function also fits the MF of young clusters in the Large Magellanic Cloud. If the same ICMF applies in the Milky Way, a bound cluster with $M>10^5~M_{\odot}$ will form about once every 107 years, while an $M>10^6~M_{\odot}$ cluster will form only once every 50 Gyr. Luminosity functions (LFs) of model cluster populations drawn from an $M_{\rm c}=2.1\times10^5~M_{\odot}$ Schechter ICMF generally agree with LFs observed in spiral galaxies.
Conclusions. The ICMF in present-day spiral discs can be modelled as a Schechter function with $M_{\rm c}\approx2\times10^5~M_{\odot}$. However, the presence of significant numbers of $M>10^6~M_{\odot}$ (and even $M>10^7~M_{\odot}$) clusters in some starburst galaxies makes it unlikely that the $M_{\rm c}$ value derived for spirals is universal. In high-pressure environments, such as those created by complex gas kinematics and feedback in mergers, $M_{\rm c}$ can shift to higher masses than in quiescent discs.

Key words: Galaxy: open clusters and associations: general - galaxies: star clusters - galaxies: spiral

1 Introduction

Star clusters connect many branches of astrophysics. Classically, they have been important test benches for models of stellar evolution (Maeder & Mermilliod 1981). A large fraction of all star formation occurs in clusters (Lada & Lada 2003), and massive, long-lived globular clusters (GCs) are potentially useful tracers of stellar populations in their parent galaxies (Brodie & Strader 2006). It is therefore of considerable interest to understand how the properties of star clusters are related to those of their parent galaxy, and in particular whether the formation (and survival) of GC-like objects requires special conditions. Three important properties of cluster populations which bear on this issue can, to some extent, be studied separately: 1) the initial cluster mass function (ICMF); 2) the formation efficiency of (bound) star clusters (Parmentier & Fritze 2009; Larsen & Richtler 2000; Bastian 2008); and 3) dynamical evolution and cluster disruption, addressed both theoretically and observationally in many studies (McLaughlin & Fall 2008; Whitmore et al. 2007; Gieles & Baumgardt 2008; Baumgardt & Makino 2003; Lamers et al. 2005; Vesperini & Zepf 2003). All three may, in principle, depend on time and location within a galaxy. This paper is concerned with the mass function, primarily its high-mass end and the possible existence of an upper cut-off mass.

Highly luminous, compact star clusters are observed to form in on-going starbursts such as those associated with major gas-rich mergers and, in smaller numbers, in many other external galaxies with elevated star formation rates (e.g., Larsen 2006). However, conversion of observed cluster luminosities to masses requires knowledge of the mass-to-light ratios which, in turn, are strongly age-dependent. Dynamical mass estimates via integrated-light measurements of velocity dispersions have been obtained for only a limited number of clusters, but have confirmed that young clusters with masses above $10^5~M_\odot$ (and, in some cases, $10^6~M_{\odot}$) are quite common in external star-forming galaxies (de Grijs & Parmentier 2007; McCrady & Graham 2007; Larsen et al. 2004; Ho & Filippenko 1996; Smith & Gallagher 2001; Larsen & Richtler 2004; Larsen et al. 2001). On the other hand, most young star clusters known in the Milky Way are relatively sparse with masses of $10^2{-}10^4~M_\odot$. This apparent contrast between the properties of Milky Way open clusters and the luminous star clusters encountered in some external galaxies raises important questions about the universality of the cluster formation process. Do the most massive clusters form preferentially in starburst environments for some physical reason, or do they just happen to be statistically more likely to form there because the total numbers of stars and clusters formed are very large?

Most studies of the statistical properties of star cluster populations have concentrated on luminosity functions (LFs), which are more readily derived for large samples of clusters than mass functions (MFs). There is a roughly linear relation between the luminosity of the brightest cluster in a galaxy and the total number of clusters, as expected if luminosities are drawn at random from a power-law distribution with slope $\approx$-2 (Weidner et al. 2004; Whitmore et al. 2007; Billett et al. 2002; Whitmore 2003; Larsen 2002). This implies that if the LF has a physical upper limit, no cluster system studied to date is sufficiently rich to sample the LF up to that limit, and the (young) cluster populations in most galaxies might in fact be drawn from very similar underlying LFs.

Clearly, the MF would be a physically more interesting property to study than the LF. The LF generally depends on both the mass- and age distribution of the cluster sample in question (e.g. Fall 2006), and since the latter is often unknown, the observed LFs offer only limited insight into the underlying MFs. If the goal is to obtain information about the initial MF, disruption effects must also be taken into account. In the small number of cases where MFs have been determined, they are generally found to be consistent with power-laws ${\rm d}N/{\rm d}M \propto M^\alpha$ with a slope $\alpha \approx -2$ (McCrady & Graham 2007; Dowell et al. 2008; Bik et al. 2003; Zhang & Fall 1999; de Grijs et al. 2003). However, the mass ranges differ from one study to another (e.g. $2.5\times10^3 < M/M_{\odot} < 5\times10^4$ for the Bik et al. study of NGC 5194, and $10^4 < M/M_{\odot} < 10^6$ for the Antennae clusters studied by Zhang & Fall), so it is not clear that the similarity of the slopes derived for different galaxies actually implies a universal ICMF.

Even if the MF can typically be fit by a power-law, this might only apply over a limited mass range. One potential difficulty is the large number of very massive clusters predicted in the Milky Way for a uniform power-law MF. For a star formation rate in bound star clusters of $5.2\times10^{-10}~M_{\odot}$ yr-1 pc-2 in the Solar neighbourhood (Lamers et al. 2005, hereafter LAM05), the Milky Way disc would have formed about 100 clusters with $M>10^6~M_{\odot}$ over a 10 Gyr lifetime, assuming that the cluster formation rate was constant in the past (probably a conservative assumption; e.g. Fuchs et al. 2009). These are not observed. Since the completeness of existing surveys of young Milky Way clusters remains extremely poorly quantified it cannot be excluded that some remain hidden behind massive extinction, and others may have dissolved (Sect. 2.2). Another possibility is that they simply never formed, or at least in smaller numbers than predicted by extrapolation of a pure power-law MF.

An upper limit to the ICMF need not be a strict cut-off, but could also take the form of an exponential decline above some characteristic mass $M_{\rm c}$, i.e. a Schechter (1976) function:

\begin{displaymath}\frac{{\rm d}N}{{\rm d}M} \propto (M/M_{\rm c})^\alpha \exp \left(-M/M_{\rm c}\right).
\end{displaymath} (1)

According to Bastian (2008), the relation between the luminosity of the brightest cluster in a galaxy and the star formation rate (SFR) can be reproduced if clusters in all galaxies are drawn from a Schechter function with $M_{\rm c} = (1{-}5)\times10^6~M_{\odot}$. More indirect evidence of a truncation or steepening of the MF in young cluster systems comes from the detection of a bend in the LF of clusters in NGC 5194 (M 51), NGC 6946 and the Antennae galaxies (NGC 4038/39), which is consistent with a power-law MF truncated at a mass of $5\times10^5{-}10^6~M_{\odot}$ (Gieles et al. 2006a; Zhang & Fall 1999).

The aim of this paper is to examine, for a larger sample of ``normal'' spiral galaxies, the behaviour of the cluster MF at its high-mass end and constrain the upper mass limit, $M_{\rm c}$. The most direct way to do this would perhaps be to simply obtain luminosities and ages for a large sample of clusters in each galaxy and converting these to masses, ideally using Hubble Space Telescope (HST) imaging to obtain clean cluster samples. Although a large amount of data for nearby galaxies have by now accumulated in the HST archive, many datasets suffer from two basic shortcomings: 1) they lack (deep) imaging in a (roughly) U-band equivalent filter, which is essential for age-dating of the clusters; and 2) they cover only a small fraction of the galaxy. The latter limitation is particularly problematic when studying the most massive clusters, which tend to be rare. The only spiral which is sufficiently nearby to take advantage of HST's imaging resolution for cluster identification and is fully covered by multi-passband HST imaging including the ultraviolet is NGC 5194 (Scheepmaker et al. 2007; Hwang & Lee 2008), which is arguably not a ``normal'' spiral due to its on-going interaction with NGC 5195.

Instead of studying the complete MF in detail for each individual galaxy, we begin by taking an approach similar to that adopted in several previous studies of the LF: from the correlation between the magnitude of the brightest cluster and some estimate of the total cluster population, one can test whether the data are consistent with random sampling from a single, universal LF. Here, we go one step further and also attempt to draw conclusions about the MF from observations of the brightest (and 5th brightest) clusters. As will become clear below, the crucial extra information that makes this possible are the cluster ages, but because these are needed only for the brightest clusters, ground-based data are generally adequate.

The general outline of the paper is as follows: in Sect. 2 we determine the relation between the median age and luminosity of the brightest and 5th brightest clusters (or, in fact, any sample with magnitudes in a fixed, narrow range) for a given sample size, cut-off mass $M_{\rm c}$, and age distribution. This statistical age-luminosity relation should not be confused with the general fading of an individual star cluster that is the result of stellar evolution and cluster dissolution. The (log(age), $\mathcal{M}_V$) predictions are compared (in Sect. 2.5) with previously published ground-based data for clusters in a sample of nearby spiral galaxies in order to put constraints on $M_{\rm c}$. In Sect. 3, a direct Schechter function fit to observed MFs is performed and yields $M_{\rm c} = (2.1\pm0.4)\times10^5~M_{\odot}$. In Sect. 4 the properties of the LF for a cluster sample drawn from an $M_{\rm c}=2.1\times10^5~M_{\odot}$ Schechter MF are compared with observed LFs. In Sect. 5 the implications of an $M_{\rm c}=2.1\times10^5~M_{\odot}$ Schechter MF are discussed in the context of other cluster systems, and it is concluded that while such a MF is consistent with existing data for Milky Way and Large Magellanic Cloud clusters, it is unlikely to be universal. Finally, in Sect. 6 the results are discussed in the context of host galaxy properties and current views on star/cluster formation.

2 Constraining the upper limit of the MF via ages and luminosities of the brightest clusters

2.1 Uniform age distribution and a truncated power-law mass function

 \begin{figure}
\par\includegraphics[width=7.8cm,clip]{1212f1.eps}
\end{figure} Figure 1:

Simulated absolute magnitude $\mathcal{M}_V$ versus log(age) plots for cluster systems of increasing richness. A power-law mass function truncated at $10^6~M_{\odot}$ is assumed.

Open with DEXTER

As a first illustration of the statistical behaviour of the upper mass- and luminosity boundaries of a cluster population, clusters with masses between $10^4~M_{\odot}$ and $10^6~M_{\odot}$ were drawn at random from a power-law mass distribution, ${\rm d}N/{\rm d}M \propto M^{-2}$. Each cluster was assigned a random age between 5 Myr and 1 Gyr. Luminosities corresponding to the age and mass of each cluster were calculated using solar-metallicity simple stellar population (SSP) models from Bruzual & Charlot (2003, hereafter BC03). These models will be used throughout this paper and assume a Salpeter-like ( ${\rm d}N/{\rm d}M \propto M^{-2.35}$) stellar initial mass function (IMF) for $0.1 < M/M_{\odot} < 100$, which probably overestimates the number of low-mass stars. For more realistic Kroupa (2002) or Chabrier (2003) IMFs, the mass-to-light ratios (and corresponding masses quoted in this paper) would be smaller by a factor of about 1.5.

Figure 1 shows the resulting (log(age), $\mathcal{M}_V$) plots for samples of N=102, 103, 104 and 105 clusters (note the use of calligraphic letters $\mathcal{M}_V$ in this paper to distinguish absolute magnitudes from masses). The dashed line in each panel is the fading line of a $10^6~M_{\odot}$ cluster, the maximum possible brightness at a given age for the truncated power-law MF assumed here. For a Schechter function, there would be no strict upper limit. While the effects of cluster disruption are ignored in this plot, it serves to demonstrate a couple of important points. First, although the mass function is sampled up to near the cut-off mass in all four cases, this is not the case for the luminosity function. The brightest $\mathcal{M}_V$ magnitude possible for a 106 $M_\odot$ cluster is $\mathcal{M}_V^{\rm max} = -14.5$, but only the two most populous samples get close to this value. In other words, the maximum cluster mass in each sample is close to the physical limit set by the MF, but the maximum brightness is determined by random sampling. This holds true over several orders of magnitude in N. Second, the brightest cluster tends to be younger in the more populous cluster systems. For a poor cluster system, rapid fading reduces the probability that a very massive cluster happens to be caught in the short time interval after its birth when it is most luminous. As the number of clusters is increased, it becomes increasingly likely to encounter a young cluster with a mass near the physical upper limit and a luminosity that cannot be matched by an older object. Thus, the mean or median age of the brightest cluster shifts towards younger values.

The specific age-luminosity relation for the brightest cluster will depend on the actual upper mass limit, whether this is a strict cut-off as in Fig. 1 or the $M_{\rm c}$ of a Schechter function. It therefore potentially offers a diagnostic of the upper limit. This will be discussed in further detail below (Sect. 2.3).

2.2 Cluster disruption

The simple experiment described in the previous paragraph suffers from a number of shortcomings. In particular, the relation between the age and magnitude of the brightest cluster will depend on the age distribution of the cluster population. This will, in general, be affected by variations in the star (cluster) formation rate as well as by disruption.

A typical open cluster in the Milky Way will gradually evaporate and dissolve completely within a few times 108 years (Wielen 1971), due to the combined effects of stellar evolution, internal two-body relaxation and external perturbations (Terlevich 1987; Spitzer 1987; Lamers & Gieles 2008; Spitzer 1958). This ``secular evolution'' (Parmentier & Fritze 2009) causes a cluster to lose mass at a (roughly) constant rate, so that lifetimes are in general expected to scale with cluster mass. The exact relation between lifetime and mass will also depend on the structural parameters of the clusters, the relation between half-mass radius and mass, and the environment in which they find themselves. According to LAM05, a useful fitting formula is

\begin{displaymath}t_{\rm dis} = t_4 \left(\frac{M_{\rm i}}{10^4~M_{\odot}}\right)^\gamma
\end{displaymath} (2)

where the parameter $\gamma$ has a value of about 0.62 and $t_{\rm dis}$ is the disruption time scale of a cluster with initial mass $M_{\rm i}$. For the Milky Way, LAM05 find a disruption time scale for a $10^4~M_{\odot}$ cluster of $t_4 = 1.3\times10^9$ years, so that extrapolation of Eq. (2) leads to a lifetime of $\sim$5 Gyr for a $10^5~M_\odot$ cluster and $\sim$23 Gyr for a $10^6~M_{\odot}$ cluster. Thus, the most massive clusters are expected to survive for a significant fraction of the age of the Universe in the Galactic disc. It should be noted that a $\gamma$-value closer to unity is required to reproduce the MF of old GCs as a result of dynamical evolution from an initial power-law or Schechter function (e.g. Jordán et al. 2007). It therefore remains uncertain whether a universal $\gamma$ value applies everywhere, but for our purpose the exact numerical value is not important.

Many clusters are likely born out of virial equilibrium, or are forced into a state of non-equilibrium once residual gas from the star formation process is expelled. During the ensuing violent relaxation, clusters may disrupt completely or partially and undergo a phase of ``infant mortality'' or ``infant weight loss'' which lasts up to a few tens of Myr (Whitmore et al. 2007; Goodwin & Bastian 2006; Kroupa & Boily 2002; Lada & Lada 2003). Unlike the secular dissolution discussed above, this process is believed to affect clusters of all masses more or less equally. In the Milky Way, the Magellanic Clouds, and possibly several other galaxies, mass-independent disruption seems to be relatively unimportant for non-embedded clusters older than $\sim$10 Myr (Gieles & Bastian 2008; Lamers et al. 2005; Gieles et al. 2007; de Grijs & Goodwin 2008, but see Chandar et al. 2006 for an alternative view regarding the SMC). For the Antennae galaxies, mass-independent disruption instead seems to lead to the loss of 80-90% of the cluster population per logarithmic age bin for the first $\sim$ 108-109 years (Fall et al. 2005; Whitmore et al. 2007).

For secular dissolution the conclusions drawn from Fig. 1 would not change dramatically unless the disruption time scale is much shorter than in the Milky Way. Even for a disruption time as short as t4=200 Myr, as derived for the inner disc of NGC 5194 (Gieles et al. 2005), a $10^5~M_\odot$ cluster would still survive for about 800 Myr (although its mass would be steadily decreasing over this time span). Mass-independent disruption, on the other hand, can have a strong impact on the age distribution of any subsample. In the following, both mass-dependent and mass-independent disruption will be considered.

2.3 Including cluster disruption and Schechter mass functions: the set-up

 \begin{figure}
\par\includegraphics[width=7.8cm,clip]{1212f2.eps}
\end{figure} Figure 2:

Absolute $\mathcal{M}_V$ magnitude versus age for the brightest cluster when sampling N clusters at random from a Schechter mass function with $M_{\rm c} = 10^6~M_\odot$. For each N, a number of realisations are shown. The curve is the $\mathcal{M}_V$ magnitude vs. age for a $10^6~M_{\odot}$ cluster. No disruption is included.

Open with DEXTER

Figure 2 shows $\mathcal{M}_V$ vs. log(age) for the brightest cluster in random samples of N=103, 104, 105 and 106 clusters with $M>5000~M_{\odot}$ drawn from a Schechter mass function with $M_{\rm c} = 10^6~M_\odot$ and $\alpha =-2$. In this case, no cluster disruption was included and the age distribution was assumed to be uniform over 1010 years. As expected, $\mathcal{M}_V$ generally becomes brighter, and log(age) younger, when sampling more clusters. It is perhaps not quite intuitive that the $\mathcal{M}_V$ and age of the brightest cluster should be uncorrelated for fixed $M_{\rm c}$ and N. This is due to the fact that clusters are selected based on their luminosities, so that the scatter in log(age) in Fig. 2 simply represents the range of possible ages for a given $\mathcal{M}_V$. For small N the age distribution is very extended and the brightest cluster can be either young (and relatively low mass) or old and relatively massive. For large N the brightest cluster will generally be fairly young and massive. Also included in the figure is the BC03 model prediction for the $\mathcal{M}_V$ magnitude of a $10^6~M_{\odot}$ cluster as a function of age. Since the Schechter function does not cut off abruptly at $M_{\rm c}$, the brightest cluster will be more massive than $M_{\rm c}$ in some fraction of the realisations. This occurs more frequently in rich cluster systems where the high-mass tail of the Schechter function is more fully sampled over the entire age range, but can happen occasionally even in a cluster-poor galaxy. For the Monte-Carlo simulations shown in Fig. 2, the brightest cluster is more massive than $M_{\rm c}$ in 39% of the realisations for N=106, and in 7% of the cases for N=103. This again illustrates that while the luminosity of the brightest cluster is strongly dependent on N, this does not translate trivially to a similar scaling of the maximum cluster mass with N.

To quantify the effects of cluster disruption, three cases are considered:

  • Case A: no disruption;
  • Case B: no infant mortality, and secular evolution with t4 = 108 years and exponent $\gamma = 0.62$. The disruption time scale assumed here is deliberately chosen to be much shorter (a factor of 10) than derived for the Solar neighbourhood, so as to illustrate the effect of secular disruption in an extreme case. In this scenario, the mass M(t) at time t of a cluster with initial mass $M_{\rm i}$ can be approximated by

    \begin{displaymath}M(t) \simeq M_{\rm i} \left[ (\mu_{\rm ev}(t))^\gamma - \frac...
...frac{10^4~M_\odot}{M_{\rm i}}\right)^\gamma\right]^{1/\gamma}
\end{displaymath} (3)

    where $\mu_{\rm ev}(t)$ indicates the fraction of mass left in the cluster after accounting for mass loss due to stellar evolution (LAM05). We use the parameterisation of $\mu_{\rm ev}(t)$ given by LAM05 for Solar metallicity. Eq. (3) can be inverted to

    \begin{displaymath}M_{\rm i} = 10^4~M_{\odot} \left[\left(\frac{M(t)}{10^4~M_{\o...
... \frac{\gamma t}{t_4}\right]^{1/\gamma} \mu_{\rm ev}(t)^{-1};
\end{displaymath} (4)

  • Case C: infant mortality leading to the loss of $\rm IMR=90$% of clusters per log(age) interval until $\tau_{\rm im,max} = 10^8$ years and no secular evolution. For a mass-limited sample, the surviving fraction of clusters $f_{\rm surv}$ will be

    \begin{displaymath}f_{\rm surv} = \left\{ \begin{array}{ll}
1 & \mbox{for $\tau...
...mbox{for $\tau \ge \tau_{\rm im,max}$ }
\end{array} \right.
\end{displaymath} (5)

    where $\tau_0$ marks the onset of infant mortality. We assume $\tau_0 = 5\times10^6$ yr so $f_{\rm surv} = \left[\tau/(5\times10^6 ~ {\rm years})\right]^{-1}$ for the first 108 years and $f_{\rm surv} = 0.05$ afterwards (Whitmore et al. 2007).
Monte-Carlo simulations similar to those shown in Fig. 2 could, in principle, be used to determine how $\mathcal{M}_V$ and log(age) for the brightest cluster depend on $M_{\rm c}$ and N for each disruption scenario. However, this can also be established more generally by considering the distribution of cluster ages corresponding to a particular absolute magnitude, $\mathcal{M}_V$, or luminosity LV. For a sample of clusters with fixed age $\tau$, the luminosity function will be

\begin{displaymath}\left. \frac{{\rm d}N}{{\rm d}L_V}\right\vert _\tau = \frac{{...
...{\rm d}M} \left. \frac{{\rm d}M}{{\rm d}L_V} \right\vert _\tau
\end{displaymath} (6)

where here $M(L_V,\tau)$ is the mass of a cluster with age $\tau$ and (V-band) luminosity LV. The first factor on the right-hand side is just the mass function of clusters with age $\tau$ and the second factor is the mass-to-light ratio, $\Upsilon(\tau) \equiv M(L_V,\tau)/L_V$. If there is no disruption, ${\rm d}N/{\rm d}M$ is the ICMF, while secular dissolution can be included by replacing the current mass $M(L_V,\tau)$ with the initial mass $M_{\rm i}(L_V,\tau)$ (Eq. (4)) in the ICMF. For a Schechter ICMF, the distribution of cluster numbers vs. age $\tau$ and luminosity LV is then

\begin{displaymath}\frac{{\rm d}^2N}{{\rm d}\tau~{\rm d}L_V} \!=\! \left(\frac{M...
...right)\! \Upsilon(\tau) ~ {\rm CFR}(\tau) ~ f_{\rm surv}(\tau)
\end{displaymath} (7)

where CFR is the cluster formation rate (of bound clusters) and  $f_{\rm surv}(\tau)$ is the survival fraction after applying mass-independent disruption. It is assumed that $\Upsilon$ only depends on $\tau$ so that it can be computed from standard SSP models, but a mass dependency (e.g. Kruijssen 2008) could in principle also be included.
 \begin{figure}
\par\includegraphics[width=7.8cm,clip]{fig3em} %\end{figure} Figure 3:

Age distribution of the brightest cluster for $M_{\rm c} = 10^6~M_\odot$ and N=104. The histogram is for the N=104 Monte-Carlo simulation in Fig. 2 while the solid (blue) curve shows the exact relation according to Eq. (7) for $\mathcal{M}_V=-10.3$ mag.

Open with DEXTER

Figure 3 compares the age distribution for the Monte-Carlo simulation for $M_{\rm c} = 10^6~M_\odot$ and N=104 (Fig. 2) with the exact expression (7), scaled to match the Monte-Carlo data. We have used $\mathcal{M}_V=-10.3$, which is the median absolute magnitude of the brightest cluster for this $M_{\rm c}$ and N. The Monte-Carlo simulations and exact expression show excellent agreement within the Poisson errors. From Eq. (7) it is straight forward to calculate statistics such as the mean or median age of clusters with a given luminosity, as well as the standard deviation, mean absolute deviation, etc. These statistics will apply to any cluster sample selected by luminosity, i.e. not only the brightest cluster, but also the 5th brightest, or clusters with a fixed magnitude. In addition, the integral over all $\tau$ gives the luminosity function,  ${\rm d}N/{\rm d}L_V$.

2.4 Predicted median (log(age), $\mathcal{M}_V$) relations for the different disruption scenarios

The median (log(age), $\mathcal{M}_V$) relations for the three disruption scenarios are shown in Fig. 4. The relations have been calculated for Schechter functions with $\alpha =-2$ and $M_{\rm c}=10^4~M_{\odot}$, $10^5~M_\odot$, $10^6~M_{\odot}$ and $10^7~M_{\odot}$ (solid lines) as well as for $M_{\rm c}=3\times10^5~M_{\odot}$ and $M_{\rm c}=10^{15}~M_{\odot}$ (dotted lines). The latter is essentially an untruncated power-law with $\alpha =-2$. An age range of 5 Myr-10 Gyr and a constant CFR are assumed in all cases. In addition to the constant-$M_{\rm c}$ curves, lines for constant number of clusters (n6) brighter than $\mathcal{M}_V=-6$ are drawn with (red) dashed lines. Note the use of lower-case n to distinguish this from the number of clusters above a particular mass limit, N, which is more difficult to relate directly to observations. The constant-n6 curves were determined by first finding a normalisation constant n0 for the luminosity functions so that

\begin{displaymath}n_6 = n_0 ~ \int^{\mathcal{M}_V=-6}_{\mathcal{M}_V=-\infty} \frac{{\rm d}N}{{\rm d}\mathcal{M}_V} ~ {\rm d}\mathcal{M}_V
\end{displaymath} (8)

and then solving

\begin{displaymath}n_{\mathcal{M}_V < \rm br} = n_0 \int_{\mathcal{M}_V=-\infty}...
...} \frac{{\rm d}N}{{\rm d}\mathcal{M}_V} ~ {\rm d}\mathcal{M}_V
\end{displaymath} (9)

for the magnitude of the brightest cluster, $\mathcal{M}_{V,n_6}^{\rm br}$ and

\begin{displaymath}n_{\mathcal{M}_V < \rm 5th} = n_0 \int_{\mathcal{M}_V=-\infty...
...} \frac{{\rm d}N}{{\rm d}\mathcal{M}_V} ~ {\rm d}\mathcal{M}_V
\end{displaymath} (10)

for $\mathcal{M}_{V,n_6}^{\rm 5th}$, the magnitude of the 5th brightest cluster. The constants on the left-hand sides of Eq. (9) and Eq. (10) were found by comparison with Monte-Carlo experiments where n6 clusters were sampled at random from the LFs. Based on 1000 such samples for each of a variety of disruption scenarios, $M_{\rm c}$ and n6 values, $n_{\mathcal{M}_V<\rm br}=0.75$ and $n_{\mathcal{M}_V<\rm 5th}=5$ were found to be suitable values. With these constants, Eqs. (9) and (10) reproduced the median magnitudes of the brightest and 5th brightest clusters within better than 0.1 mag. The Monte-Carlo experiments were also used to estimate the vertical error bars in Fig. 4, which indicate the median absolute deviation of the brightest and 5th brightest cluster magnitudes for fixed n6 and $M_{\rm c}$. Note that the scatter in $\mathcal{M}_V$ is significantly smaller for the 5th brightest cluster. The horizontal error bars, indicating the median absolute deviation of $\tau$, were calculated directly by integrating Eq. (7) over $\tau$ to find the positive and negative age intervals with respect to the median that include half the clusters for a given $\mathcal{M}_V$. The sizes of the horizontal error bars are given simply by the shape of the age distribution, and are therefore identical for the brightest and 5th brightest clusters. Because the constant-$M_{\rm c}$ curves are fixed by the age distribution of clusters for a given magnitude, the model relations in the upper and lower panels differ only in the constant-n6 (dashed) curves. These are shifted to fainter magnitudes in the lower panels, in accordance with the median magnitude difference between the brightest and 5th brightest cluster for each ($M_{\rm c}$, n6) combination.

In the special case where there is no disruption and the MF is a uniform power-law, the age distribution is luminosity-independent. In this case, the median age of the brightest cluster (or any other luminosity-selected sample) will also be independent of luminosity and n6, as indicated by the vertical dashed line for the Case A scenario in Fig. 4. This is true regardlessly of the actual value of the MF slope $\alpha$, although the specific median age will depend on $\alpha$. For Schechter MFs with a finite $M_{\rm c}$, the median age will instead decrease as n6 and the luminosity of the brightest cluster increase, as discussed above and illustrated by the solid lines in the Case A panels. The introduction of secular dissolution does not strongly affect the age distribution of the brightest clusters in cluster-rich systems, where these clusters will tend to be massive and long-lived. Hence, the Case A and Case B scenarios resemble each other for large n6. However, for very cluster-poor systems secular dissolution starts to have a noticeable effect on the age distribution of even the brightest clusters, and the median age shifts back towards younger ages as n6 becomes small. For the Case C scenario the age distribution is, per definition, deficient in older clusters of all masses. In this case the median age shifts to younger values for all n6, albeit with a tail towards older ages.

For Case A and Case B, Fig. 4 shows a clear separation between the curves for different $M_{\rm c}$, suggesting that this type of diagram may indeed be used as a diagnostic tool to constrain $M_{\rm c}$. However, it is equally clear that much of this diagnostic power is lost for cluster samples that are dominated by a high degree of mass independent disruption, as in Case C. Due to the large scatter, one should not assign much significance to the location of any particular galaxy in Fig. 4, but for large galaxy samples it is still possible to draw conclusions about $M_{\rm c}$ in a statistical sense. The inconvenience of having to include a large number of galaxies is counterbalanced by the fact that only the brightest (or few brightest) clusters have to be identified and age-dated. Furthermore, it is not a strict requirement that the entire galaxy is covered; any sub-sample of clusters can equally well be compared with the grids, as long as the sub-sample can be assumed to be representative of the global cluster population in a galaxy. Of course, this assumption may not always be justified.

 \begin{figure}
\par\includegraphics[width=16cm,clip]{1212f4.eps}
\end{figure} Figure 4:

The median absolute magnitude of the brightest ( top) and 5th brightest ( bottom) cluster versus median age. Results are shown for Schechter mass functions with $M_{\rm c} = 10^4$, 105, 106 and $10^7~M_{\odot}$ and $\alpha =-2$ (solid curves), as well as for $M_{\rm c}=3\times10^5~M_{\odot}$ and $M_{\rm c}=10^{15}~M_{\odot}$ (dotted curves). The (red) dashed lines are for constant total number of clusters (n6=101, 102, ..., 105) brighter than $\mathcal{M}_V=-6$. The error bars mark the median absolute deviation of log(age) and $\mathcal{M}_V$ for fixed $M_{\rm c}$ and n6. The triangles mark clusters in irregular galaxies while circles are for spirals (see Table 1).

Open with DEXTER

It is important to stress that the grids in Fig. 4 are only suitable for comparison with galaxies for which it is plausible that the cluster formation rate has been approximately constant over the past few Gyrs. More specifically, the cluster population should not be dominated by a single strong burst so galaxies which are currently undergoing strong bursts of star formation, or have done so in the past (e.g. merger remnants) would require a custom-made grid tailored for their specific star formation histories. However, variations in the cluster formation rate of a factor of a few are still acceptable. Among normal disc galaxies, which are our primary interest here, star- and cluster formation histories are more likely to have been roughly uniform, at least on average. For the Milky Way, several studies suggest that variations of a factor of 2-3 in the star formation rate on time scales of 108-109 years (Rocha-Pinto et al. 2000; Hernandez et al. 2000) are applicable.

2.5 Comparison with observations

Table 1:   Data for the brightest and 5th brightest clusters in a sample of galaxies.

Table 1 lists data for the brightest and 5th brightest cluster in a sample of galaxies. These are mostly spirals, taken from our previous work, with the addition of a few irregular galaxies. For the galaxies with data from Larsen (1999,2002), which are ground-based with more or less homogeneous detection limits, the last column lists the total number of clusters detected in each galaxy. Formally, the detection limit is at $\mathcal{M}_V=-8.5$ ( $\mathcal{M}_V=-9.5$ for clusters bluer than U - B = -0.4), but significant incompleteness likely sets in well above these limits. In selecting the cluster samples in the spirals, objects with H$\alpha$ emission were excluded so that the samples mostly consist of clusters older than about 10 Myr.

Overall properties of the cluster systems and the host galaxies have been discussed extensively by Larsen & Richtler (2000). The spiral galaxies span more than an order of magnitude in absolute and area-normalised star formation rate, and are mostly type Sbc or later. With the exception of NGC 5194, none is involved in any strong on-going interaction. The star formation rates show a strong correlation with the richness of the cluster systems (Larsen 2002), so that the last column in Table 1 may also be taken as an indication of the relative star formation rates in the galaxies. The irregular galaxies constitute a more heterogeneous sample, with at least some showing clear evidence of bursty star formation histories.

A requirement in collecting data for Table 1 was that age estimates and V-magnitudes were available for each individual cluster. In a few cases where less than five clusters were detected, the columns for the 5th brightest cluster are left empty. Ages were obtained from UBV photometry, employing the S-sequence method (Girardi et al. 1995). The S-sequence is essentially a curve drawn through the mean colours of Large and Small Magellanic Cloud clusters in the (U - B, B - V) plane. Age increases along the curve from blue to red colours, and clusters which do not fall exactly on the curve are projected onto it along straight lines that take into account the effects of stochastic colour fluctuations as well as interstellar extinction. Alternatively, ages could have been estimated by comparing the observed colours with SSP models. This was also tried (using the BC03 models), and differences with respect to the S-sequence were found to be minor. The latter was preferred here due to its empirical foundation, although the S-sequence is strictly valid only for LMC-like metallicity. For further details concerning the properties of individual galaxies and the data reduction and analysis, please consult the references given in Table 1.

The spirals are overplotted with filled circles in Fig. 4 while the irregulars are shown with triangles. Comparing with the Case A and B panels, most data points fall between the  $M_{\rm c}=10^5~M_{\odot}$ and $M_{\rm c} = 10^6~M_\odot$ curves. The two most outlying data points are for NGC 1569 and NGC 1705, both of which have experienced recent bursts of star formation (Angeretti et al. 2005; Annibali et al. 2003) and are dominated by one or two very bright, young clusters. These two systems therefore provide a good illustration of the limitations of this method. For the rest of the data points, a Schechter function with an $M_{\rm c}$ of a few times $10^5~M_\odot$ fits the mean trend quite well for both the brightest and 5th-brightest cluster. An exception to this statement occurs at low n6 in the Case B scenario, which may well be an indication that the disruption time scale assumed here is too short to match these galaxies. For the Case C scenario the picture is less clear. No single $M_{\rm c}$ value can match the data for the disruption parameters assumed here. A lower IMR would alleviate this problem to some extent by making the (log(age), $\mathcal{M}_V$) relations more similar to those for the no-disruption scenario. For example, an IMR of 50% per dex over 109 years would provide an acceptable fit. Alternatively, the trend towards higher median ages in low-n6 galaxies might be explained by invoking a lower infant mortality rate in these galaxies specifically, causing a shallower decline in $f_{\rm surv}$ with age.

Of the galaxies in Table 1, cluster disruption has only been studied in a few. In NGC 5194, the age-mass distribution of clusters older than about 10 Myr can be well fitted by a secular evolution model with a rather short disruption time scale of $t_4\approx 200$ Myr (Gieles et al. 2005; Bastian et al. 2005). Five additional galaxies (NGC 45, NGC 1313, NGC 4395 NGC 5236, NGC 7793) have been studied by Mora et al. (2008), but their data did not allow a clear distinction between secular evolution and infant mortality disruption scenarios. Without detailed knowledge about the cluster disruption laws in the galaxies, the interpretation of Fig. 4 in terms of $M_{\rm c}$ remains somewhat ambiguous. However, it does seem clear that if the MF is a Schechter function, then $M_{\rm c}$ cannot be much lower than a few times $10^5~M_\odot$ in any of these galaxies, regardless of the cluster disruption law. More generally, any significant steepening (or truncation) of the MF below a few times $10^5~M_\odot$, compared to a power-law with a slope around -2, is ruled out.

2.6 Radial trends

 \begin{figure}
\par\includegraphics[width=7.3cm,clip]{1212f5.eps}
\end{figure} Figure 5:

$\mathcal{M}_V$ versus log age for the brightest and 5th brightest clusters in radial sub-samples in NGC 5236 and NGC 6946. In each galaxy, each radial bin contains an equal number of clusters. The model lines are the same as for the Case A scenario in Fig. 4.

Open with DEXTER

Many properties of galaxies correlate with galactocentric distance: gas density, star formation rate, relative velocities of spiral arms and stellar orbits, metallicity, etc. It is not unreasonable to suspect that this might also lead to radial variations in the cluster MF. So far, radial trends in the properties of (young) cluster populations remain poorly studied. In NGC 5194, a radially dependent bend in the LF suggests that the upper limit of the MF might decrease outwards (Haas et al. 2008).

Figure 5 shows the same Case A model grids as the left-hand column in Fig. 4, together with the age and magnitude of the brightest and 5th brightest cluster in three radial subsamples for NGC 5236 and NGC 6946. These two galaxies have the largest number of detected clusters among those in Table 1. The radial bins were defined such that each bin contains an equal number of clusters (49 in NGC 5236, 35 in NGC 6946). The corresponding radial ranges are indicated in the figure legend. As in Fig. 4, the scatter of the data points is well within the error bars, i.e. the data are consistent with a radially independent $M_{\rm c}$ in both NGC 5236 and NGC 6946.

3 Comparing MFs directly

The ground-based data used in Sect. 2 are, unfortunately, not ideal for direct studies of the MF. Visual inspection played a central role in the identification of cluster candidates, so completeness effects are hard to quantify. Nevertheless, it is tempting to attempt a comparison of MFs derived from these data with a Schechter function.

First, a purely differential comparison was carried out by dividing the spiral galaxies in Table 1 into a cluster-rich and a cluster-poor sample. The cluster-rich sample consists of those galaxies with more than 100 clusters each (NGC 5236, NGC 6946) and the cluster-poor sample of galaxies with less than 40 clusters each. NGC 2997 formally falls in the cluster-poor category, but has been excluded because it is more distant than most of the other galaxies in the sample. Its high specific U-band cluster luminosity, $T_{\rm L}(U)=1.45$ (Larsen & Richtler 2000) suggests that it would probably have fallen in the cluster-rich sample if located at the distance of NGC 5236 or NGC 6946.

Having an estimate of the age and $\mathcal{M}_V$ for each cluster, masses were derived using mass-to-light ratios from BC03. In principle, both the S-sequence method and a comparison with SSP models allow the extinction to be estimated for each individual cluster, by determining the shift along the extinction vector that is required to obtain the best match between observed and model/S-sequence colours. Both methods were tried, and yielded small mean AV values for both the cluster-poor and cluster-rich samples ( $\langle A_V \rangle \approx -0.1{-}+0.2$ mag). However, the scatter was large (0.4-0.6 mag for the S-sequence method and 0.7-1.0 mag when using the SSP models) with a significant tail towards negative extinctions. This suggests that a correction for extinction for each individual cluster would introduce significant spurious scatter, possibly because differences between observed and model colours can have other causes (e.g., stochastic colour fluctuations due to the finite number of stars in a cluster, or simply photometric errors). A choice was therefore made to omit extinction corrections for individual clusters. However, the following analysis was, in fact, also carried out (though not described in detail here) for extinction-corrected cluster samples, with no change to the conclusions.

The MFs for clusters younger than 200 Myr in the two spiral samples are compared in Fig. 6. This age range was chosen such that secular evolution can be expected to play a relatively minor role in most galaxies. At 200 Myr, the detection limit corresponds to a limiting mass of about $7\times10^4~M_{\odot}$, but as in any cluster sample limited by a magnitude cut, this mass limit is strongly age-dependent and younger clusters with masses down to $\sim$ $10^4~M_{\odot}$ are present in the samples. Also shown is the MF for young clusters in the Antennae in the age range 25-160 Myr (Zhang & Fall 1999), derived assuming a distance of 19.2 Mpc. Recently, Saviane et al. (2008) claimed a smaller distance of 13.3 Mpc based on the red giant branch tip, but Schweizer et al. (2008) find a distance of 22 Mpc based on observations of the type-Ia supernova 2007sr and point out a number of problems with a smaller distance. Hence, we have kept the original mass scale of Fall & Zhang. The apparent peaks in the MFs for the two spiral samples are undoubtedly due to incompleteness. The relatively higher fraction of clusters with masses below $10^5~M_\odot$ in the cluster-poor spirals is most likely due the easier identification of low-mass clusters in these galaxies, which tend to have lower star formation rates, lower surface brightness, and less crowding. Above $10^5~M_\odot$, however, the two spiral MFs appear remarkably similar to each other, and a Kolmogorov-Smirnov test confirms this impression by returning a P-value of 0.75 when comparing the two samples. In other words, the MFs in the cluster-rich and cluster-poor spirals are statistically indistinguishable. Also worth noting is that the spiral MFs extend to nearly as massive clusters as those in the Antennae, although with a steeper slope.

 \begin{figure}
\par\includegraphics[width=7cm,clip]{1212f6.eps}
\end{figure} Figure 6:

Combined cluster mass functions for the two galaxies with more than 100 clusters each (NGC 5236, NGC 6946) and those with less than 40 clusters each. Clusters with ages < $2\times 10^8$ years are included. Also shown is the mass function for young clusters in the Antennae (Zhang & Fall 1999) and a Schechter function with $M_{\rm c}=2.1\times10^5~M_{\odot}$ and $\alpha =-2$.

Open with DEXTER

A maximum-likelihood fit of a Schechter function with fixed faint-end slope $\alpha =-2$ to the combined spiral data returns a best-fit $M_{\rm c} = (2.1\pm0.4)\times10^5~M_{\odot}$, also indicated in Fig. 6 (arbitrarily scaled). Comparing this fit with the data, a K-S test returns a P-value of 0.22 for the combined spiral sample and P=0.17 and P=0.86 for the cluster-rich and cluster-poor samples separately. In all cases the mass distributions are consistent with being drawn from a single Schechter function. Moreover, the fitted $M_{\rm c}$ value appears consistent when comparing with the Case A and B scenarios in Figs. 4 and 5. The fit is not unique: for example, a Schechter function with a steeper faint-end slope ( $\alpha=-2.2$) and a higher $M_{\rm c}=2.5\times10^5~M_{\odot}$ provides an equally good fit. However, a Schechter function with $\alpha =-2$ and $M_{\rm c} = 10^6~M_\odot$ is ruled out at high confidence level; in this case a K-S test returns a P value of only $2\times10^{-6}$ and even higher $M_{\rm c}$ are ruled out at still higher confidence level. Similarly, $M_{\rm c}$ values below $10^5~M_\odot$ are also ruled out.

The MF derived here differs from that obtained by Dowell et al. (2008) who find power-law slopes of $\alpha\sim-1.8$ for $10^5 < M/M_\odot < 10^7$ for clusters in a sample of nearby spiral and irregular galaxies. The authors of this study point out that their cluster sample may be contaminated by blending and more diffuse associations because of the relatively poor resolution of their Sloan Digital Sky Survey data, which might lead to an overestimate of the number of very massive objects. They also note that completeness effects may have been underestimated at the low-mass end, which would lead to a too shallow overall slope. Interestingly, however, they find similar MFs in spirals and irregulars.

4 Implications for the luminosity function

Luminosity functions require no conversion from observed luminosities to masses via age-dependent mass-to-light ratios, so it is not surprising that more information is available about LFs than MFs. Larsen (2002) fitted power-laws ${\rm d}N/{\rm d}L \propto L^{\alpha_{\rm L}}$ to the LFs of young clusters in 6 spiral galaxies with HST/WFPC2 imaging, and found slopes in the range $-2 > \alpha_{\rm L} > -2.4$ for typical $\mathcal{M}_V$ intervals $-6 > \mathcal{M}_V > -9$. A trend for the slopes to become steeper in brighter magnitude intervals was noted. Similar numbers were found for a sample of 5 spirals by Mora et al. (2008), using HST/ACS data. The most detailed analysis of the LF in a single spiral galaxy is for NGC 5194, which has ACS imaging covering the entire disc. From these data, slopes between $\alpha_{\rm L}=-2.5$ and $\alpha_{\rm L}=-2.6$ have been obtained for clusters brighter than $\mathcal{M}_V=-8$ (Haas et al. 2008; Hwang & Lee 2008). This is similar to the LF slope derived by Dolphin & Kennicutt (2002) for clusters in NGC 3627: $\alpha_{\rm L} = -2.53\pm0.15$ for $-8 > \mathcal{M}_V > -11$. Gieles et al. (2006b) have shown that the LF of clusters in NGC 5194 is consistent with clusters having been drawn from a Schechter mass function with $M_{\rm c} \approx 10^5~M_{\odot}$.

The top panel in Fig. 7 shows the luminosity functions corresponding to various cluster disruption scenarios for a Schechter ICMF with $M_{\rm c}=2.1\times10^5~M_{\odot}$ and $\alpha =-2$, plotted as number of clusters per magnitude bin. LFs are shown for no disruption, for secular dissolution with t4=200 Myr and t4=1 Gyr, and for mass-independent disruption with $\rm IMR=90$% for 108 years (i.e. the same as the Case C scenario in Sect. 2). These LFs were obtained by integrating Eq. (7) over all $\tau$, assuming a constant cluster formation rate. It is evident that the LFs all steepen towards the bright end and hence cannot be well approximated by a single power-law. Local values of the logarithmic slope, i.e., the equivalent of the exponent of a local power-law fit ${\rm d}N/{\rm d}L \propto L^{\alpha_{\rm L}}$, are plotted in the bottom panel of the figure. In the absence of secular, mass-dependent evolution the faint-end slope of the LF approaches that of the ICMF. At the bright end, the LF steepens, while at the faint end secular dissolution of low-mass clusters can lead to a shallower slope.

According to Fig. 7, LF slopes steeper than $\alpha_{\rm L}=-2$ are generally expected for the magnitude ranges typically covered in studies of extragalactic star cluster populations. Comparing with the NGC 5194 observations specifically, the no-disruption model predicts a mean slope of $\langle \alpha_{\rm L} \rangle = -2.52$ for $-8 > \mathcal{M}_V > -10$, in excellent agreement with the studies cited above. Inclusion of disruption effects leads to a somewhat shallower slope, e.g. $\langle \alpha_{\rm L} \rangle = -2.19$ over the same $\mathcal{M}_V$ range for $t_4=2\times10^8$ years. However, age-dependent extinction can steepen the LF slope (Larsen 2002) and there are indeed indications that younger clusters in M 51 are more heavily extincted (Bastian et al. 2005). Furthermore, the ACS mosaic for which the LF data were derived cover the whole disc where a longer disruption time scale than 200 Myr may be applicable, and the assumption of a constant cluster formation rate may well be invalid for M 51. Lastly, the disruption time estimates for M 51 were based on the assumption of an $\alpha =-2$ power-law ICMF. The smaller number of high-mass clusters predicted for an $M_{\rm c}=2\times10^5~M_{\odot}$ Schechter function would most strongly affect the detectable number of clusters at old ages, thus requiring less destruction and possibly lead to a longer estimate of the disruption time scale. Interestingly, the faint end of the Milky Way open cluster LF ( $-2 > \mathcal{M}_V > -9$) has a relatively shallow slope of $\alpha=-1.5$ (van den Bergh & Lafontaine 1984). Due to secular dissolution, this is not necessarily inconsistent with an ICMF with low-mass slope $\alpha =-2$. In summary, a range of LF slopes is clearly expected when fitting cluster samples in different galaxies over different magnitude intervals. Generally, we conclude that an $M_{\rm c}=2.1\times10^5~M_{\odot}$ Schechter MF can lead to LFs that are consistent with observations of real cluster samples, but once again note that disruption and extinction effects make it difficult to draw strong conclusions about MFs based on observations of LFs alone.

 \begin{figure}
\par\includegraphics[width=7.8cm,clip]{fig7em} %\end{figure} Figure 7:

Top: luminosity functions for a cluster sample with a Schechter mass function with $M_{\rm c}=2.1\times10^5~M_{\odot}$. Bottom: local logarithmic slope. In all cases, a constant cluster formation history has been assumed and various disruption scenarios are applied, as indicated in the legend.

Open with DEXTER

5 The cluster mass function in other environments - universal or not?

While the analysis in the previous sections indicates than an $M_{\rm c}\approx2\times10^5~M_{\odot}$ Schechter function fits the MFs of clusters in both cluster-poor and cluster-rich spirals (the latter also in different radial bins), the data are hardly sufficient to claim that this fit is universally valid. In the following, we examine to what extent this MF is consistent with observations of star clusters in other environments.

5.1 Open clusters in the Milky Way

When discussing the Milky Way open clusters, the poorly quantified completeness of current catalogues is a severe obstacle and the following comparison therefore remains necessarily sketchy. We first estimate, for a given cluster formation rate and MF, how many clusters should have formed in the Galactic disc in various mass intervals. We assume a star formation rate in bound clusters of $5.2\times10^{-10}~M_{\odot}$ yr-1 pc-2 in the Solar neighbourhood (LAM05) and further approximate the Galactic disc as having a uniform cluster formation rate within a radius of R=10 kpc. The total cluster formation rate is then $\sim$0.16 $M_\odot$ yr-1, which we assume to have been constant in the past. If this mass formed in clusters with masses sampled from a Schechter function with a lower mass limit of 100 $M_\odot$ and $M_{\rm c}=2\times10^5~M_{\odot}$, the Milky Way has then formed about 19 000 and 700 clusters more massive than $10^4~M_{\odot}$ and $10^5~M_\odot$ over a 10 Gyr period, respectively. This corresponds to one (bound) cluster more massive than $10^5~M_\odot$ forming about every 13 Myr somewhere in the disc, or one more massive than $10^4~M_{\odot}$ about every $\sim$500 000 years.

About a dozen star clusters with ages $\la$10 Myr and masses in the range $10^4~M_{\odot}{-}10^5~M_{\odot}$ have now been found within several kpc from the Sun (Figer 2008). Based on the above, only 5 clusters with $M>10^4~M_{\odot}$ and younger than 10 Myr are expected within 5 kpc, but the calculation can hardly be considered more than an order-of-magnitude estimate and it is also possible that some of the young Milky Way clusters identified in current surveys will not remain bound. Hence, the observed numbers of young massive clusters does not seem to be in significant disagreement with what is expected based on the estimated formation rate of bound clusters and the assumption of an $M_{\rm c}=2\times10^5~M_{\odot}$ Schechter MF.

Comparison with masses $10^5{-}10^6~M_{\odot}$ is more difficult. No young disc clusters in this range have been identified, but only $\sim$1 is expected to have formed in the last 107 years. About half of the total $\sim$700 clusters in this mass range would already have disrupted or have lost a large fraction of their initial mass, assuming that the LAM05 disruption parameters were valid also in the past. If the remaining objects are evenly scattered across the Galactic disc, their average surface density is about 1 kpc-2, so a few should be found within $\sim$1 kpc. This is in fair agreement with the estimate by LAM05 (based on the catalogue of Kharchenko et al. 2005) that the most massive cluster formed within 600 pc over the past few Gyr had an initial mass of $\sim$ $10^5~M_\odot$.

A 1-Gyr old cluster with an initial mass of $10^5~M_\odot$ should still have a present-day mass of $\sim$60 000 $M_\odot$. The surface density of clusters younger than 1 Gyr and initial masses above $10^5~M_\odot$ should be about 0.25 kpc-2 so a more complete survey of clusters in the Galactic disc out to several kpc might reveal a significant population of intermediate-age, massive clusters. It is of interest to note that the Milky Way does host a number of several Gyr-old open clusters which must initially have been quite massive (Janes & Phelps 1994; Friel 1995). Identification of these objects is helped by the fact that many of them tend to be located quite far above the Galactic disc.

Extrapolating further to above $10^6~M_{\odot}$, the Milky Way disc would only form one such cluster about every 40 Gyr for an $M_{\rm c}=2\times10^5~M_{\odot}$ Schechter MF, hence it is not surprising that none has been found. This latter number is in stark contrast to the $\sim$100 clusters predicted by extrapolation of an $\alpha =-2$ power-law.

5.2 The large Magellanic Cloud

 \begin{figure}
\par\includegraphics[width=7.8cm,clip]{1212f8.eps}
\end{figure} Figure 8:

Mass function for star clusters in the LMC younger than 1 Gyr, based on photometry from Bica et al. (1996). The dotted-dashed line is the same Schechter function as in Fig. 6, arbitrarily scaled.

Open with DEXTER

The LMC has long been known to host a rich population of young massive star clusters (e.g. Hodge 1961), and studies of the LMC clusters benefit from the advantage of having a clear view of the entire galaxy. For the brighter clusters, current databases are unlikely to suffer from significant incompleteness. Excluding the old GCs, the most massive clusters in the LMC have masses of $\approx$ $2\times10^5~M_{\odot}$ (Anders & de Grijs 2006; Hunter et al. 2003), and the MF can be well fitted by a power-law with slope $\alpha =-2$. Since the LMC cluster MF barely overlaps with the spiral data in Fig. 6, a direct comparison is not possible, but one can ask whether the LMC data are consistent with the Schechter MF derived for the spirals. To this end, the UBV photometry tabulated by Bica et al. (1996) was used to derive ages and masses for 504 LMC clusters in the same way as was done for the spirals in Sect. 3 (except that Z=0.008 SSP models were used, as appropriate for the LMC). A distance modulus of $m-\mathcal{M}=18.5$ and foreground extinction AV=0.25 mag were assumed. The resulting MF for clusters younger than 1 Gyr is shown in Fig. 8 together with an $M_{\rm c}=2.1\times10^5~M_{\odot}$ Schechter function. The Bica et al. (1996) data have an estimated completeness limit of $V\approx13$, which corresponds to a limiting mass of $\sim$ $10^4~M_{\odot}$ for an age of 1 Gyr. The more recent datasets go deeper, but for comparison with the spiral data the Bica et al. photometry suffices. This analysis yields 90 clusters more massive than $10^4~M_{\odot}$, of which two exceed $10^5~M_\odot$. A K-S test comparing the LMC cluster MF for $M>10^4~M_{\odot}$ with the Schechter function returns P=0.94, indicating excellent agreement (although the data are also consistent with an untruncated power-law; P=0.56). It can therefore be concluded that the LMC cluster MF can be fitted by the same MF derived for spirals, although there is little actual overlap in mass between the two samples. Thus, the impression that the LMC is particularly rich in ``massive'' clusters is most likely an observational selection effect.

5.3 Starbursts and mergers

Turning to more extreme star forming environments, there are indications that the MF derived for spirals may not be adequate. We have already noted from the comparison of spiral and Antennae cluster MFs in Fig. 6 that there are relatively more high-mass clusters in the Antennae sample. Indeed, a Schechter function fit to the Zhang & Fall data yields $\log M_{\rm c} = 5.9^{+0.45}_{-0.25}$ (Jordán et al. 2007), while an $M_{\rm c}$ that high is ruled out at high confidence level for the spirals. The conditions in the Antennae seem to be such that $M_{\rm c}$ has shifted to a higher value, so that a relatively larger number of high-mass clusters form. The same appears to be the case in M 82, although analysis of the cluster MF there is complicated by the large amount of extinction internally in the galaxy (Smith et al. 2007). From dynamical mass measurements for 15 clusters with masses between $2\times10^5~M_{\odot}$ and $4\times10^6~M_{\odot}$, McCrady & Graham (2007) fit a power-law with a slope of $\alpha=-1.91\pm0.06$. This is again inconsistent with an $M_{\rm c}=2\times10^5~M_{\odot}$ Schechter MF although selection effects remain difficult to quantify.

The mere presence of > $10^7~M_{\odot}$ clusters in NGC 1316, NGC 7252 and Arp 220 (Wilson et al. 2006; Bastian et al. 2006) is also inconsistent with a Schechter MF with an $M_{\rm c}$ in the range $10^5{-}10^6~M_{\odot}$. If a peak star formation rate of $\sim$ $1000~M_{\odot}$ yr-1 (Bastian 2008) were turned completely into clusters, the number of clusters with $M>10^7~M_{\odot}$ formed would be less than $5\times10^{-5}$ Myr-1 for $M_{\rm c} = 10^6~M_\odot$. Even if such a high SFR could be sustained for hundreds of Myrs, a negligible number of $10^7~M_{\odot}$ clusters would form. Hence, it seems that the cluster MF in these more extreme environments must be different than that observed in spiral discs.

5.4 Old globular clusters

Ultimately, it would be desirable to understand the properties of old GC populations in the context of their younger counterparts in present-day star forming galaxies. However, classical GCs have been subject to dynamical evolution over a Hubble time, which complicates the comparison. Few clusters with initial masses less than $\sim$ $10^5~M_\odot$ would have survived for a Hubble time (Fall & Zhang 2001; Vesperini & Heggie 1997; Baumgardt & Makino 2003) and the shape of the low-mass end of the globular cluster MF is therefore strongly affected by dynamical evolution. At higher masses, the MF is more likely to resemble its original shape and there is substantial evidence of a steepening of the MF at the high-mass end in the rich GC systems of giant elliptical galaxies (McLaughlin & Pudritz 1996; Burkert & Smith 2000; Jordán et al. 2007). The ``evolved Schechter function'' fits to mass functions for GC systems in Virgo cluster galaxies by Jordán et al. (2007) indicate an $M_{\rm c}$ increasing steadily from $(3{-}4)\times10^5~M_{\odot}$ in the faintest galaxies ( $\mathcal{M}_V \sim -16$) to $(2{-}3)\times10^6~M_{\odot}$ in the giant ellipticals. Given such evidence of variations in $M_{\rm c}$ even among old GC systems, it is not surprising that the present-day ICMF may also be environmentally dependent. Ultimately, understanding these variations may be a powerful tool for constraining the formation conditions for GC systems at cosmological epochs.

6 Discussion

The suggestion that the ICMF has a characteristic upper mass scale naturally raises the question as to what physical mechanism is responsible. It is tempting to assume that the ICMF is related to the mass function of the giant molecular clouds (GMCs) in which cluster formation takes place. The MF of GMCs in the Milky Way and other Local Group galaxies can be fitted by a power-law with slope $\approx$-1.7, slightly shallower than for the cluster MF, below a fairly well defined upper limit of $M \approx 6\times10^6~M_{\odot}$ (Rosolowsky 2007; Blitz et al. 2007; Williams & McKee 1997). Much of the mass in a (Galactic) GMC is at low density and global star formation efficiencies (SFEs) are typically only a few percent (Myers et al. 1986; Lada & Lada 2003). Although it is not necessarily true that all star formation in a GMC results in a single cluster, the upper GMC mass limit and low SFEs seem roughly compatible with the $M_{\rm c}$ derived in this paper for the ICMF in spiral galaxies. However, the local SFE within cluster-forming clumps in GMCs has to be high to produce a bound star cluster (30%-50%; Elmegreen 1983; Hills 1980; Baumgardt & Kroupa 2007) and ultimately the mass spectrum of these clumps may be more relevant.

The formation of molecular clouds and the origin of their mass function remain poorly understood. The low-mass slope may follow from fractal structure in a turbulence-dominated interstellar medium (ISM) (Elmegreen & Falgarone 1996). It has often been suggested that the upper mass limit is related to the maximum unstable length scale in a rotating disc, due to shear and/or the coriolis force (Dowell et al. 2008; Hunter et al. 1998; Toomre 1964). Analytic arguments and detailed numerical simulations lead to maximum cloud masses of $\sim$ $10^7~M_{\odot}$, in rough agreement with observations (McKee & Ostriker 2007). The maximum cloud mass may be expected to scale with gas/total mass fraction and total gas mass in discs, so that more massive clouds should be able to form e.g. in gas-rich proto-galactic discs (Escala & Larson 2008). However, the picture is complicated by the highly dynamic character of the interstellar medium. GMCs are probably rather transient structures (Elmegreen 2007; Mac Low & Klessen 2004) so it is unclear to what extent an overall stability criterion is applicable in general.

What might be responsible for shifting the maximum cluster mass upwards in starburst environments? If the most massive clusters form at similarly low global efficiencies, massive ``Super-GMCs'' (SGMCs) would be required (Harris & Pudritz 1994; McLaughlin & Pudritz 1996). In the context of old GC systems, these SGMCs may be identified with Searle & Zinn (1978)-style fragments and cosmological simulations suggest that the first GCs may indeed have formed in such clouds, embedded within dark matter halos (Kravtsov & Gnedin 2005). Some evidence that elements of this scenario may be relevant in present-day star forming environments is provided by observations of cloud complexes with masses up to nearly $10^9~M_{\odot}$ in the Antennae (Wilson et al. 2003). However, the Antennae is too far away to study these structures in much detail with current radio interferometers. Giant molecular associations with masses of $10^7{-}10^8~M_{\odot}$, potentially suitable for forming $10^5{-}10^6~M_{\odot}$ clusters at low overall efficiency, have been found in nearby cluster-rich spirals like NGC 5194 and NGC 5236 but tend to break up into smaller GMC-like structures when observed at high resolution (Rand et al. 1999; Vogel et al. 1988; Lundgren et al. 2004; Kuno et al. 1995). In general, the ISM in galaxies is hierarchically structured also on scales above the scale of individual GMCs (Elmegreen 2007; Elmegreen & Falgarone 1996) and SGMC-like structures may appear conspicuous in the Antennae simply because a larger fraction of the gas is in molecular form there (Gao et al. 2001). In M 82, the nearest starburst, observations at high spatial resolution have failed to reveal distinct SGMCs (Keto et al. 2005).

Alternatively, a shift towards higher maximum cluster mass could result if the global SFE within more modest-sized GMCs is increased. In mergers, GMCs may be compressed to high densities by an increased ambient ISM pressure as the two galaxies collide, leading to a strong increase in the overall star formation rate and formation of massive, bound clusters (Ashman & Zepf 2001; Jog & Solomon 1992). Indeed, observations show that a large fraction of the interstellar medium in starburst galaxies (many of which are also mergers) is in a dense phase (Gao & Solomon 2004). There is some evidence of shock-triggered cluster formation in the Antennae, but large-scale shocks are probably not directly responsible for most cluster formation there (Zhang et al. 2001). GMCs in M 82 also seem to be shock-compressed, in this case not due to a major merger but due to the pressure from surrounding ionized gas (Keto et al. 2005), a mechanism which may also be at work in the Antennae.

The distinction between formation of massive clusters in dense clumps within SGMCs and in normal-sized but compressed GMCs may be somewhat artificial in most present-day environments, since any dense concentration of molecular gas will always be surrounded by lower-density material. A more relevant issue may be what delivers the pressure necessary to confine a large amount of gas in dense clumps while star formation is taking place (Elmegreen & Efremov 1997). Here, differences in local conditions of the interstellar medium, such as gas kinematics, may be of relevance. The most actively star-forming spiral galaxies in our sample, like NGC 6946, have overall star formation rates (SFRs) which are no more than a factor of a few smaller than in the Antennae (Sauty et al. 1998; Larsen 2002; Stanford et al. 1990; Zhang et al. 2001). In spite of this high SFR, the kinematic situation in NGC 6946 (as in most spirals) is relatively quiescent. The gas resides in a large, orderly rotating disc (Boomsma et al. 2008) with no major distortions apart from a number of (probably extra-planar) high velocity clouds. The galaxy is among the most isolated in the nearby Universe (Pisano & Wilcots 2000), so its high SFR is unlikely to be interaction-induced. The gas kinematics in the Antennae are instead complex with a velocity spread of several hundred km s-1, notably in the overlap region of the two galaxies where star formation is currently most active (Gao et al. 2001; Gordon et al. 2001; Gilbert & Graham 2007). The starburst in M 82 is driving a superwind with outflow velocities in excess of 100 km s-1, carrying a significant fraction of the molecular gas in the system (Walter et al. 2002), again showing that a large amount of energy is being injected into the interstellar gas. Similar complex gas kinematics are observed in other starbursts (Martin 2006; Arribas et al. 2001). In these environments, formation of massive clusters may be favoured in regions of high gas pressure (Keto et al. 2005; Elmegreen & Efremov 1997). Theoretically, this is supported by simulations of major mergers (Li et al. 2004; Bournaud et al. 2008; Mihos & Hernquist 1996) which show that a larger fraction of the gas in the mergers is found in regions of high pressure and high velocity dispersion (>30 km s-1) compared to a spiral disc.

It is difficult to tell whether star formation in spirals and mergers/starbursts represent truly different modes or are part of a continuum. While a relatively larger fraction of the gas in violent starbursts is in high-pressure regions, such conditions may also be encountered more locally in spiral discs (Elmegreen et al. 2000; Efremov et al. 2007). Indeed, the comparison in Fig. 6 shows that the maximum cluster mass encountered in spirals is not very different from that in the Antennae, although the high-mass slope of the ICMF appears steeper in the spirals. The Antennae is still in an early stage of the merging process and the conditions are likely to get more extreme over the next several 108 years (Mihos et al. 1993) with a possible shift of $M_{\rm c}$ to even higher values. There may well be a range of possible $M_{\rm c}$ values in present-day star forming environments and not just two distinct (``quiescent'' and ``starburst'') modes.

Some of the most massive clusters might form by a different mechanism altogether, such as merging of less massive clusters (Fellhauer & Kroupa 2005; Elmegreen et al. 2000). This would bypass the need for a mechanism to collect > $10^7~M_{\odot}$ of gas in a single dense clump, but would still require many lower-mass clusters to form sufficiently near each other to merge. The merger products are predicted to be relatively diffuse so this mechanism is unlikely to account for more compact clusters such as those found in NGC 1316 and NGC 7252 (Bastian et al. 2006). However, extremely massive objects, like the $8\times10^7~M_{\odot}$ object W3 in NGC 7252 (Maraston et al. 2004), could have formed this way.

7 Summary and conclusions

This paper provides three main lines of evidence suggesting that the mass function of young star clusters in spiral galaxies can be approximated by a Schechter function with $M_{\rm c}=2.1\times10^5~M_{\odot}$ and low-mass slope $\alpha =-2$:

  • The observed age-luminosity relations for the brightest and 5th brightest cluster in a sample of mostly late-type spirals are consistent with those expected for cluster populations drawn from a Schechter MF with an $M_{\rm c}$ of a few times $10^5~M_\odot$. This result is not significantly affected by secular cluster disruption.
  • A maximum-likelihood fit of a Schechter function to the combined MF for clusters more massive than $10^5~M_\odot$ in these spirals yields $M_{\rm c} = (2.1\pm0.4)\times10^5~M_{\odot}$ for fixed low-mass slope $\alpha =-2$. The cluster MFs for cluster-poor and cluster-rich subsamples of the galaxies are statistically indistinguishable.
  • The luminosity function for a model cluster sample with masses drawn from an $M_{\rm c}=2.1\times10^5~M_{\odot}$ Schechter function is consistent with existing data for the LFs of clusters in spiral galaxies, including the well studied M 51 cluster system.
An $M_{\rm c}=2.1\times10^5~M_{\odot}$ (or $M_{\rm c}\approx1.4\times10^5~M_{\odot}$ for a Kroupa- or Chabrier IMF) Schechter function also fits the MF of clusters in the Large Magellanic Cloud, and it is argued that it is probably consistent with current surveys of young clusters in the Milky Way. It is, however, unlikely to be universal. In merger galaxies and other starbursts, a Schechter function fit requires a significantly higher $M_{\rm c}$ than derived for the spirals. Since some spirals are not very different from mergers like the Antennae in terms of overall star formation rate or gas content, differences in the upper end of the cluster MFs are more likely related to the different gas kinematics in these systems. The fact that $M_{\rm c}$ values are found to be greater for old GC systems than for young cluster populations in discs suggests that star (cluster) formation in present-day spiral discs generally proceeds in a relatively more orderly way than when most old GCs formed. In the future, observations with ALMA should provide crucial insight into the properties of molecular gas on scales that are directly relevant to formation of individual clusters in galaxies well beyond the Local Group.

Acknowledgements
I thank Mark Gieles for encouragement to get this work published and Brad Whitmore for an interesting discussion during the Science with the new Hubble Space Telescope meeting in Bologna, January 2008. The manuscript has benefited from helpful comments from Mark Gieles, Henny Lamers and the referee, Nate Bastian.

References

All Tables

Table 1:   Data for the brightest and 5th brightest clusters in a sample of galaxies.

All Figures

  \begin{figure}
\par\includegraphics[width=7.8cm,clip]{1212f1.eps}
\end{figure} Figure 1:

Simulated absolute magnitude $\mathcal{M}_V$ versus log(age) plots for cluster systems of increasing richness. A power-law mass function truncated at $10^6~M_{\odot}$ is assumed.

Open with DEXTER
In the text

  \begin{figure}
\par\includegraphics[width=7.8cm,clip]{1212f2.eps}
\end{figure} Figure 2:

Absolute $\mathcal{M}_V$ magnitude versus age for the brightest cluster when sampling N clusters at random from a Schechter mass function with $M_{\rm c} = 10^6~M_\odot$. For each N, a number of realisations are shown. The curve is the $\mathcal{M}_V$ magnitude vs. age for a $10^6~M_{\odot}$ cluster. No disruption is included.

Open with DEXTER
In the text

  \begin{figure}
\par\includegraphics[width=7.8cm,clip]{fig3em} %\end{figure} Figure 3:

Age distribution of the brightest cluster for $M_{\rm c} = 10^6~M_\odot$ and N=104. The histogram is for the N=104 Monte-Carlo simulation in Fig. 2 while the solid (blue) curve shows the exact relation according to Eq. (7) for $\mathcal{M}_V=-10.3$ mag.

Open with DEXTER
In the text

  \begin{figure}
\par\includegraphics[width=16cm,clip]{1212f4.eps}
\end{figure} Figure 4:

The median absolute magnitude of the brightest ( top) and 5th brightest ( bottom) cluster versus median age. Results are shown for Schechter mass functions with $M_{\rm c} = 10^4$, 105, 106 and $10^7~M_{\odot}$ and $\alpha =-2$ (solid curves), as well as for $M_{\rm c}=3\times10^5~M_{\odot}$ and $M_{\rm c}=10^{15}~M_{\odot}$ (dotted curves). The (red) dashed lines are for constant total number of clusters (n6=101, 102, ..., 105) brighter than $\mathcal{M}_V=-6$. The error bars mark the median absolute deviation of log(age) and $\mathcal{M}_V$ for fixed $M_{\rm c}$ and n6. The triangles mark clusters in irregular galaxies while circles are for spirals (see Table 1).

Open with DEXTER
In the text

  \begin{figure}
\par\includegraphics[width=7.3cm,clip]{1212f5.eps}
\end{figure} Figure 5:

$\mathcal{M}_V$ versus log age for the brightest and 5th brightest clusters in radial sub-samples in NGC 5236 and NGC 6946. In each galaxy, each radial bin contains an equal number of clusters. The model lines are the same as for the Case A scenario in Fig. 4.

Open with DEXTER
In the text

  \begin{figure}
\par\includegraphics[width=7cm,clip]{1212f6.eps}
\end{figure} Figure 6:

Combined cluster mass functions for the two galaxies with more than 100 clusters each (NGC 5236, NGC 6946) and those with less than 40 clusters each. Clusters with ages < $2\times 10^8$ years are included. Also shown is the mass function for young clusters in the Antennae (Zhang & Fall 1999) and a Schechter function with $M_{\rm c}=2.1\times10^5~M_{\odot}$ and $\alpha =-2$.

Open with DEXTER
In the text

  \begin{figure}
\par\includegraphics[width=7.8cm,clip]{fig7em} %\end{figure} Figure 7:

Top: luminosity functions for a cluster sample with a Schechter mass function with $M_{\rm c}=2.1\times10^5~M_{\odot}$. Bottom: local logarithmic slope. In all cases, a constant cluster formation history has been assumed and various disruption scenarios are applied, as indicated in the legend.

Open with DEXTER
In the text

  \begin{figure}
\par\includegraphics[width=7.8cm,clip]{1212f8.eps}
\end{figure} Figure 8:

Mass function for star clusters in the LMC younger than 1 Gyr, based on photometry from Bica et al. (1996). The dotted-dashed line is the same Schechter function as in Fig. 6, arbitrarily scaled.

Open with DEXTER
In the text


Copyright ESO 2009

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.