A&A 493, 735-745 (2009)
DOI: 10.1051/0004-6361:200810519
M. Lombardi1,2
1 - European Southern Observatory, Karl-Schwarzschild-Straße 2,
85748 Garching bei München, Germany
2 - University of Milan,
Department of Physics, via Celoria 16, 20133 Milan, Italy
Received 4 July 2008 / Accepted 16 September 2008
Abstract
Observational data and theoretical calculations show that
significant small-scale substructures are present in dark molecular
clouds. These inhomogeneities can provide useful clues to the
physical conditions inside the clouds, but can also severely bias
extinction measurements. We present NICEST, a novel method
to account and correct for inhomogeneities in molecular cloud
extinction studies. The method, tested against numerical
simulations, removes almost completely the biases introduced by
sub-pixel structures and by the contamination of foreground stars.
We applied NICEST to 2MASS data of the Pipe molecular
complex. The map thereby obtained shows significantly higher (up to
in AK) extinction peaks than the standard
NICER (Lombardi & Schneider 2001, A&A, 373, 359) map. This first application
confirms that substructures in nearby molecular clouds, if not
accounted for, can significantly bias extinction measurements in
regions with
;
the effect, moreover, is
expected to increase in more distant molecular clouds, because of the
poorer physical resolution achievable.
Key words: dust, extinction - methods: statistical - ISM: clouds - infrared: ISM - ISM: structure - ISM: individual objects: Pipe molecular complex
Although the details of star and planet formation are little known, there is a general consensus that these objects are created inside dark molecular clouds from the contraction and fragmentation of dense, cold cores. As a result, in the last decades molecular clouds have been studied in detail using many different techniques, from optical number counts (Wolf 1923; Bok 1937), to radio observations of carbon monoxide (CO) molecules (Frerking et al. 1982; Wilson et al. 1970), to near-infrared (NIR) extinction measurements (Lada et al. 1994).
Molecular hydrogen and helium represent approximately
of the
mass of a cloud, but the absence of a dipole moment in these molecules
makes them virtually undetectable at the low temperatures (
)
of these objects. Hence, the (projected) mass
distribution of dark clouds is usually inferred from the distribution
of relatively rare tracers (such as CO or dust grains) under the
assumption that these are uniformly distributed in the cloud.
As pointed out by Lada et al. (1994), the reddening of background stars in NIR bands is a simple and reliable method to study the dust distribution and thus the hydrogen column density. Compared to optical bands, NIR bands are less affected by extinction and are less sensitive to the physical properties of the dust grains (Mathis 1990), and thus their color excess can be directly translated into a hydrogen column density. In a series of papers we reconsidered and improved the NIR color excess (NICE) technique, by generalizing it to use three or more bands (NICER, see Lombardi & Alves 2001) and by taking a maximum-likelihood approach (Lombardi 2005).
Although the NICE and NICER techniques have been very successful in studying molecular clouds (see, e.g. Alves et al. 2001), they are plagued by two complications: small-scale inhomogeneities and contamination by foreground stars. NIR studies typically assume a uniform extinction over (small) patches of the sky; however, in the presence of substructures such as steep gradients, unresolved filaments, or turbulence-induced inhomogeneities (Heyer & Brunt 2004; Larson 1981; Lada et al. 1994), the background stars used to estimate the cloud extinction would not be uniformly distributed in the patch, but would be preferentially observed in low density regions. Similarly, in the presence of foreground stars, their relative fraction increases with the dust column density. Unfortunately, both effects will bias all color excess estimators toward low column densities; moreover, the bias will be especially important in the dense regions of molecular clouds, which is where the star formation takes place.
In this paper we describe in detail the repercussions of small scale inhomogeneities and foreground stars in extinction measurements, and propose a simple method to obtain estimates of the column density in the presence of both effects that, under simple working hypotheses, is asymptotically unbiased. Our method does not rely on any (usually poorly verified) assumption regarding the small scale structure of the cloud; moreover, it can be applied to a general class of NIR extinction measurements (marked spatial point processes).
The paper is organized as follows. In Sect. 2 we introduce a general smoothing technique used in all cases to create smooth, continuous maps from the discrete pencil-beam extinction measurements carried out for each background star; we also show that in two typical applications (moving average and nearest neighbour smoothing) a bias is expected. In Sect. 3 we propose a new method that can correct this bias without making any assumption on the underlying form of the cloud substructure. This new method has been tested with numerical simulations, as described in Sect. 4, and with a real-case application, presented in Sect. 5. We discuss and comment on the results obtained in Sect. 6, and finally we briefly present our conclusions in Sect. 7.
Measurements of the reddening of stars observed through a molecular
cloud provide estimates of the cloud column densities in pencil beams
characterized by a diameter of the order of a fraction of milliarcsec.
However, these high resolution measurements are strongly undersampled
and are affected by large uncertainties due to the photometric errors
and to the intrinsic scatter of star colors in the NIR. Smooth and
continuous extinction maps are normally obtained by interpolating and
binning the various pencil beams. Different authors use different
recipes to smooth the data (see, e.g. Lombardi 2002,
for a description of several interpolation techniques), but in most
cases the interpolation follows a standard scheme. Consider NK-band extinction
measurements
obtained, for example, from the color excess of the N background
(see Lada et al. 1994 or Lombardi & Alves 2001).
The extinction
at any location in the sky is evaluated using
a weighted average
The binning in Eq. (1) washes out the cloud substructure on
scales smaller than the typical size where the
are
significantly different from zero, but this is needed in order to have
smooth maps and to increase the signal-to-noise ratio. However,
Eq. (1) also introduces a significant bias on the estimated
column density. Suppose that, in the region of the sky that we are
investigating (i.e., in the area covered by the N stars used to
estimate A) the column density has significant variations. Because
of this, the local density
of stars is not homogeneous
throughout the cloud, but rather follows the scheme
The main issue considered in this paper is the bias introduced by
unresolved substructures in Eq. (1). In general, the bias
should be defined here as the difference between the expected, mean
value of
and the true column density A at
the same position. However, clearly any smoothing technique
introduces a bias because of the smoothing itself, and this bias is
normally acceptable if it does not introduces a systematic effect on
the total column density. In other words, a required
property is
In the following, we will calculate explicitly this bias in two common smoothing schemes.
In this section we will consider a simple weighting scheme in
Eq. (1) where each weight
is a
simple function of the location
of the nth star
(typically,
will depend only on the modulus
).
As discussed above, our aim is to compare the expected column density
estimate with the one that we would obtain in the absence of any
selection effect in the number of background stars. We need thus to
evaluate two averages: (i)
,
where
is calculated according to Eq. (1) and the
ensemble average is taken over all positions of stars with the density
given by Eq. (2) and over all individual column density
measurements
;
and (ii) the average
,
which
is basically the same quantity evaluated with a constant density of
background stars
.
In principle, both averages can be calculated analytically using the
method described by Lombardi & Schneider
(2001; see below
Sect. 3); in practice, for the purposes of
this section a simple approximation can be used provided that in the
weighted average of Eq. (1) a relatively large number of
column densities are used with weights significantly different from 0.
In this case we find for the first average
It is also interesting to evaluate the bias
in the presence of small variations
of A within the region considered. In this case, the probability
distribution
is peaked around the mean value of
Eq. (7), and we can expand to second order the two
exponential functions present in the numerator and denominator of
Eq. (6). After a few manipulations we obtain then
Cambrésy et al. (2002) suggest using a different prescription to make smooth extinction maps from the individual column density measurements. For each point in the map, they take the average extinction of the N angularly closest stars (nearest neighbours interpolation). As argued by Cambrésy et al. (2006,2005), this method can potentially alleviate the bias introduced by the varying background density of stars described by Eq. (2), because measurements from low density regions will mostly appear isolated and will thus be used for relatively large patches of the sky.
Interestingly, using the technique described by
Lombardi (2002) it is possible to obtain an analytical
estimate for the average of
in the smoothing scheme proposed:
![]() |
Figure 1:
The bias of the nearest neighbors interpolation on a model
where the true extinction is
|
| Open with DEXTER | |
In order to better understand the points mentioned in the previous
paragraph, it is useful to look at a simple example. Consider a cloud
characterized by a Heaviside column density:
One might wonder whether the bias decreases by using a median
estimator instead of a simple mean, as done by
Cambrésy et al. (2006,2005). These authors still
identify, for each sky position
,
the N nearest neighbour
stars, and then evaluate a simple median of the relative extinction
measurements. Luckily, it is possible to evaluate the exact
statistical properties of the median estimator (see
Lombardi 2002, Appendix A). For this purpose, it is
useful to evaluate the probability distribution
that
one of the N neighbours around the position
has a column
density A:
![]() |
Figure 2:
Similarly to Fig. 1, but for the median-nearest
neighbors interpolation. Again, the solid lines, representing the
expected mean measurements, are always below the dashed lines,
representing the same measurements in the ideal case where the
density of background stars is uniform within the whole field.
Note that because of the properties of the median, for large
values of N the ``transition'' from
|
| Open with DEXTER | |
In summary, a careful analysis of the nearest neighbors shows that
this interpolation technique, for the specific case of extinction
measurements, is still affected by a significant bias. At least for the case of a simple mean estimator, the origin of the problem can be traced to the different behaviour of the kernel
with respect to the density
in Eq. (9): in particular, the kernel K does not respond directly to variations of
,
and instead depends only
,
i.e. on the total ``mass'' within a disk of radius
.
As shown by Eq. (8), the bias discussed here strongly depends on the small-scale structure of the molecular cloud through the probability distribution pA(A). Although on large scales many models predict a log-normal distribution for pA, clearly we should expect significant deviations from the theoretical expectations on the small scales often investigated in NIR studies. In addition, even at large scales the superposition of different cloud complexes, or a significant ``thickness'' of a cloud along the line of sight, can produce significant deviations from a log-normal distribution (Vázquez-Semadeni & García 2001; see also Lombardi et al. 2006, for a case where the log-normal distribution is not a good approximation). Hence, we need to be able to correct for the substructure bias in a model independent way.
![]() |
Figure 3:
Similarly to Fig. 1, but for the moving average
smoothing with a Gaussian window function with different dispersion parameters |
| Open with DEXTER | |
In order to better understand, and then fix, the bias present in
Eq. (1) in the case of the moving average smoothing, we use
the theory developed in Lombardi & Schneider (2002,2003,2001). The key equations to
obtain the ensemble average of
are summarized below:
The bias of Eq. (1) is essentially due to a change in the
density of extinction measurements as a function of A(Eq. (2)). In the framework of the moving average smoothing,
it is thus reasonable to try to ``compensate'' for this effect by
including in the weights of the estimator (1) a factor
:
In order to better understand the properties of the estimator (1) with the new weight, we evaluate the average
.
For simplicity, initially we will ignore measurement errors and focus on the effects introduced by a
non-uniform density. We first note that Eq. (28) is equivalent to the use of a weight
,
where
In Fig. 4 we show the effects of the proposed smoothing
technique on the model discussed in Sect. 2.2.
A comparison with Fig. 3 shows that the NICER
interpolation is very effective in reducing the bias of the simple
moving average interpolation, especially for relatively large values
of the smoothing factor
(and thus of the weight number of
objects
).
![]() |
Figure 4:
Similar to Fig. 3, but for the NICEST
moving average smoothing (see Sect. 3). In this case a significant bias is observed only for
|
| Open with DEXTER | |
We now consider the effects of measurement errors on the modified estimator. For simplicity, we consider the limit of a large number of stars (
), so that
(in any case, as shown above, the technique proposed is inefficient when
is of the order of unity). In this case, a simple calculation shows that measurement errors introduce a bias
of the order of
In summary, we propose to replace the estimator (1) with
If one is ready to accept a small bias on
for extinction
measurement uncertainties, it is also possible to use
Eq. (34) without the last term. In this respect, we also
note that the correcting factor present in Eq. (34) is
independent of the local extinction, and only depends on the
individual errors on the measurements (these typically are a
combination of the photometric uncertainties and of the intrinsic
scatter of colors of the stellar population considered). As a result,
it is not unrealistic to expect that this factor is approximately
constant within the field of observation, and that it is equally
present in the control field used to fix the zero of the extinction.
Finally, we note that in NICEST a central role is played by
the slope of the number counts
,
operationally defined through
Eq. (2). In reality, however, it clear that Eq. (2)
cannot strictly hold, because there is an upper limit on the
(un-reddened) magnitude of stars. Hence, depending on the limiting
magnitude of the observations, there is an upper limit to the
measurable extinction of stars. As a result, in regions where the
extinction is greater than this limit, the density of background stars
vanishes, with the result that in these cases even NICEST
cannot be unbiased. The exact value of the upper measurable
extinction depends on many factors, including the waveband
and limiting magnitude
of the observations, and the
distance d of the molecular cloud. An approximate relation for the
maximum extinction
is
![]() |
Figure 5:
Top left: the original 2MASS/NICER map of
the Pipe nebula used in the simulations. Top right: the
mean reconstructed map that one would obtain if the density of
background stars were uniform. This is basically a smoothed
version of the original map on the left. Levels are spaced at
|
| Open with DEXTER | |
Foreground stars do not contribute to the extinction signal, but do
contribute to the noise of the estimators (1) and (34), and thus whenever possible they should be excluded from
the analysis. Usually, foreground stars are easily identified in high
(
)
column density regions, but are almost
impossible to distinguish in low and mid extinction regions. So far
we assumed that all stars are background to the cloud, but clearly in
real observations we should expect a fraction f of foreground
objects. In principle, if this fraction is known, we could correct
the column density estimate
into
:
in this
way, we would obtain an unbiased estimator of the column density at
the price of increased noise (due to foreground objects). In
practice, the fraction of foreground stars is itself an increasing
function of the local column density, i.e. f = f(A), because of the
decreased density of background stars in highly extincted regions.
Hence, the simple scheme proposed above is not easily implemented.
![]() |
Figure 6:
Left: average reconstructed map using a 1-star (N
= 1) Voronoi method, assuming a uniform distribution of stars. Levels are spaced at
|
| Open with DEXTER | |
Interestingly, NICEST perfectly adapts to the presence of
foreground stars. Suppose that in regions with negligible
extinction a fraction
of stars is
foreground to the cloud: then, in our notation, we can rephrase this
by assigning a non-vanishing probability to
pA(A = 0), i.e. by
treating foreground stars as a special case of substructure (much like
``holes'' in the cloud). As we consider higher density regions, the
fraction f(A) of foreground stars increases but, because of the
correcting factor in wn, we still expect to measure
(1 - f0) A:
in other words, the estimator
,
with
given by Eq. (34) is expected to be unbiased both for
small-scale inhomogeneities and for foreground contamination. Note
that the correcting factor
(1 - f0)-1 is usually very close to
1 for nearby molecular clouds, and can be easily evaluated by
comparing the density of foreground stars (easily measured in highly
extinguished regions) with the total density of stars in regions free
from extinction.
![]() |
Figure 7:
Same as Fig. 5 ( bottom), but with
a fraction of f = 0.05 of foreground stars. The increases of
relative number of foreground stars in the denser regions of the
cloud makes the bias more pronounced. Still, NICEST
performs very well. Contours levels are spaced at
|
| Open with DEXTER | |
In order to test the effects of substructures and the reliability of
the method proposed here in a real case scenario we performed a series
of simulations. We started from a 2MASS/NICER map of the
Pipe nebula (Lombardi et al. 2006), a nearby complex of molecular
clouds seen in projection to the Galactic bulge, an ideal case for
extinction studies (see also Alves et al. 2008; Onishi et al. 1999).
We took this map as the true dust column density of a cloud
complex, and simulated a set of background stars. In order to show
the effects of substructures, we used a very low density of background
stars (
instead of the original
used to build the 2MASS/NICER map). This, effectively, corresponds to a linear downsizing of the structures of the Pipe nebula by a factor of
60.
The simulations were carried out using the following simple technique.
We generated a set of background stars uniformly distributed on the
field of view. The stars were characterized by exponential number
counts with exponent
in the K band and by intrinsic
color and color scatters similar to the ones observed in the 2MASS
catalog (see Table 2 of Lombardi 2005, for a complete
list of the parameters used). We added to each star magnitude the
local value of the 2MASS/NICER extinction and also some
random measurement errors:
We repeated the whole procedure 1 000 times, using each time a
different set of random background stars, and took the average maps
obtained with the three techniques. We then compared
these average maps with similar maps obtained from uniformly
distributed stars (in order to produce such maps, we applied the
completeness limit to un-extinguished magnitudes).
Figures 5 and 6 show the results obtained, which
support NICEST. In particular, both the simple moving
average and the nearest neighbor suffer from significant biases, up to
or above, particularly in the most dense
regions of the Pipe nebula. As shown in various papers
(e.g. Lombardi et al. 2006; Lada et al. 1994; Lombardi et al. 2008),
the amount of small scale structure present in dark molecular clouds
increases with the column density, and thus it is not surprising that
larger biases are observed there. In contrast, the method presented
in this paper reduces the bias by
10, a factor comparable to
the weight number of stars
(which in our simulations is
in the higher column density regions).
The simulation performed here allows us also to evaluate the average
quadratic difference between the extinction map obtained for the
various methods
and the true (smoothed) map
,
i.e. the quantity
Figure 7 shows the result of simulation carried out with the presence of foreground stars. In particular, we generated stars following the same prescriptions described in the previous paragraph, but allowing for a fraction f0 = 0.05 of foreground objects. Because of the effects of extinction, the effective fraction of foreground objects increases in the most dense regions of the cloud, which usually are also the ones with the larger substructures: the two biases thus add up and can be particularly severe. As shown by Fig. 7, NICEST is effectively able to cope with relatively large fractions of foreground stars while still providing a virtually unbiased column density estimate.
![]() |
Figure 8: The NICEST extinction map of the Pipe nebula, using the modified estimator (28). |
| Open with DEXTER | |
The method presented in this paper was finally applied to the whole
2MASS point source catalog for the Pipe nebula region. We used the
4.5 million stars of the 2MASS catalog located in the window
The final, effective density of stars of about 8 stars
per pixel guarantees that the approximation used to derive the
unbiased estimator (34) is valid, and that a significant
improvement over the standard NICER method can be expected.
The highest extinction was measured close to Barnard 59, where
(a value that is
higher
than what was obtained in Lombardi et al. 2006).
Figures 9 and 10 show a comparison between dense
regions mapped using NICER and NICEST. We note
that, as expected, the two methods are equivalent in low-density
regions, while the new one consistently estimates higher column
densities as A increases. The same effect can be appreciated more
quantitatively from Fig. 11, where we plot the relationship
between the average estimates
obtained in
Lombardi et al. (2006) and here.
Finally, we argue that the plot of Fig. 11 is strongly
related to Fig. 9 of Lombardi et al. (2006) where we show the
increase in the scatter of
for the different stars used at
each point of the extinction map as a function of the average
at the point. This A-
relation is most likely due to
unresolved substructures in the cloud that are expected to be more
prominent in high-density regions. Recently, we reconsidered the
A-
relation and defined a quantity called
(Lombardi et al. 2008). This quantity is simply related to the local
scatter of measured extinctions, but also properly takes into account
the contribution of measurement errors to the observed scatter.
Interestingly, this quantity can be defined in terms of simple
observables, and can be shown to be directly related to the local
scatter of column densities:
![]() |
Figure 9:
The difference between the modified extinction estimates
(Eq. (28)) and the standard ones [Eq. (1)] around
Barnard 59. The contour levels are at
|
| Open with DEXTER | |
![]() |
Figure 10:
The difference between the modified extinction estimates
(Eq. (28)) and the standard ones (Eq. (1))
around the peak ID 3 of Lombardi et al. (2006). The
contour levels are at
|
| Open with DEXTER | |
![]() |
Figure 11:
The average difference |
| Open with DEXTER | |
The numerical simulations and a first sample application of NICEST have shown that the method presented in this paper can significantly alleviate the bias introduced by small-scale structures and by foreground stars in extinction studies. Of course, NICEST too has some limitations, which however are largely unavoidable (and thus inherent to any extinction-based method).
First, we note that in order for NICEST to work effectively,
the weight number of background stars
must be
significantly greater than unity: as shown in
Sect. 3.2,
is directly related to
the reduction in the bias provided by NICEST with respect to
NICER, and thus having
does not give any
benefit. This point is also important when correcting for foreground
star contamination. For example, if
and the
local fraction f of foreground stars is large (e.g., because we are
in a particularly dense core), on average we do not expect any
background stars, and we will thus consistently measure a vanishing
extinction. Hence, NICEST, like any other extinction-based
method, only works if there are a sizable number of background stars
that can be used for reliable extinction measurements.
The above point might be interpreted as an exceedingly stringent
requirement for the smoothing window used in NICEST. In
fact, if a weight function
invariant upon translation is used, this function should be taken
broad enough to guarantee that the weight number of background stars
is significantly larger than unity everywhere.
This, in turn, would imply that
in the
intermediate extinction regions, i.e. that the extinction map in
these regions has a poor resolution, well below the limits imposed by
the density of background stars. In reality, the whole derivation of
the statistical properties of the NICEST technique is still
valid for weight functions which are not spatially invariant. Hence,
one does not need to use a fixed window size for
,
but rather this function could be taken to change shape for different
locations
.
For example, a simple scheme could be the use of
a Gaussian shape for
with the typical scale
chosen according to a local estimate of the density of background
stars, in a way such that
.
This choice would guarantee an optimal resolution everywhere,
and would still allow one to make use of the NICER technique
to reduce the inhomogeneity bias.
Another point to keep in mind is that NICEST gives a large
weight to red sources. This has two potential unwanted consequences.
First, it introduces a small bias, that has been corrected to first
order in Eq. (34). Second, as shown also by the numerical
simulations described in Sect. 5, it
slightly increases the noise in the final map. Again, since the noise
in final map
is proportional to
,
the solution is to make sure that
is sufficiently large.
We believe that a small increase in the noise is a fair price to pay
for the significant reduction in the bias provided by NICEST
over NICER (see also Eadie et al. 1971).
A potentially more severe problem can be young stellar objects (YSOs) with infrared excess. These sources can be present in the most dense star-forming cores of molecular clouds, where they can severely bias our method. We note, however, that since these sources have peculiar colors and are not usually present in the control field used for the calibration, they represent a problem for all extinction based methods, including NICER and the nearest neighbour ones. A median estimator is in principle safer to use in these cases, as long as the YSOs present in the core are a minority of the total number of background stars observed in the region; unfortunately, the tendency of YSOs to appear in clusters does not help here (note also that a median is usually more noisy than the simple average). The obvious solution is thus to exclude as much as possible YSOs from the list of sources used in the extinction maps.
Finally, let us briefly mention alternative possibilities to reduce
the bias considered in this paper. Since the substructure bias is due
to a relationship between the local density
and the
extinction map
(Eq. (2)), one could try to use a
tracer of the local density to perform the correction. We already
discussed a method based on this concept, the nearest neighbours
interpolation. However, as shown in Sect. 4, this
method fails (see also Sect. 2.2). In
general, a problem with methods based on the local estimate of the
density of background stars is that one needs to be able to obtain
on scales smaller than the ones that characterize the
weight
,
or otherwise it is not possible to give
different weights to the different stars that contribute significantly
to the weighted average of Eq. (1). However, if the density
of background stars is large enough to allow accurate measurements of
the local density
,
one can disregard substructures,
because it is always possible to make extinction maps at the same
resolution as the density map. Hence, the only effective way to deal
with substructures involves the use of the local extinction
measurements, as done in NICEST. Note also that any
density-based correction will be completely ineffective with
foreground stars, in contrast to the method presented here.
The main results obtained in this paper can be summarized in the following points:
Acknowledgements
We thank J. Alves, C. J. Lada, and G. Bertin for stimulating discussions and suggestions, and the referee, L. Cambresy, for helping us improve the paper. This research has made use of the 2MASS archive, provided by NASA/IPAC Infrared Science Archive, which is operated by the Jet Propulsion Laboratory, California Institute of Technology, under contract with the National Aeronautics and Space Administration.