Issue |
A&A
Volume 515, June 2010
|
|
---|---|---|
Article Number | A51 | |
Number of page(s) | 9 | |
Section | Catalogs and data | |
DOI | https://doi.org/10.1051/0004-6361/200913000 | |
Published online | 08 June 2010 |
Searching for dark clouds in the outer galactic plane![[*]](/icons/foot_motif.png)
I. A statistical approach for identifying extended red(dened) regions in 2MASS
W. W. F. Frieswijk1 - R. F. Shipman2,1
1 - Kapteyn Astronomical Institute, University of Groningen, PO Box 800, Landleven 12, 9700 AV Groningen, The Netherlands
2 -
SRON, National Institute for Space Research, PO Box 800, Landleven 12, 9700 AV Groningen, The Netherlands
Received 28 July 2009 / Accepted 29 January 2010
Abstract
Context. Most of what is known about clustered star
formation to date comes from well studied star forming regions located
relatively nearby, such as Rho-Ophiuchus, Serpens and Perseus. However,
the recent discovery of infrared dark clouds may give new insights in
our understanding of this dominant mode of star formation in the
Galaxy. Though the exact role of infrared dark clouds in the formation
process is still somewhat unclear, they seem to provide useful
laboratories to study the very early stages of clustered star
formation. Infrared dark clouds have been identified predominantly
toward the bright inner parts of the galactic plane. The low background
emission makes it more difficult to identify similar objects in
mid-infrared absorption in the outer parts. This is unfortunate,
because the outer Galaxy represents the only nearby region where we can
study effects of different (external) conditions on the star formation
process.
Aims. The aim of this paper is to identify extended red regions
in the outer galactic plane based on reddening of stars in the
near-infrared. We argue that these regions appear reddened mainly due
to extinction caused by molecular clouds and young stellar objects. The
work presented here is used as a basis for identifying star forming
regions and in particular the very early stages. An accompanying paper
describes the cross-identification of the identified regions with
existing data, uncovering more on the nature of the reddening.
Methods. We use the Mann-Whitney U-test, in combination with a
friends-of-friends algorithm, to identify extended reddened regions in
the 2MASS all-sky JHK survey. We process the data on a regular grid using two different resolutions, 60
and 90
.
The two resolutions have been chosen because the stellar surface
density varies between the crowded spiral arm regions and the sparsely
populated galactic anti-center region.
Results. We identify 1320 extended red regions at the higher
resolution and 1589 in the lower resolution run. The linear extent of
the identified regions ranges from a few arc-minutes to about a degree.
Conclusions. The majority of extended red regions are associated
with major molecular cloud complexes, supporting our hypothesis that
the reddening is mostly due to foreground clouds and embedded objects.
The reliability of the identified regions is >99.9%. Because we
choose to identify object with a high reliability we can not quantify
the completeness of the list of regions.
Key words: methods: statistical - catalogs - stars: formation - ISM: clouds - dust, extinction
1 Introduction
Dark clouds represent the earliest stages of star formation in the Galaxy. Nearby dark clouds have been studied in great detail and provide a wealth of information on the low-mass isolated star formation process (e.g., Shu et al. 1987). However, most stars do not form in isolation but in groups or clusters (Lada & Lada 2003), and this clustered mode of star formation is not well understood. Some key questions of current star formation research are the origin of the stellar mass distribution, the relation between high-mass star formation and stellar clusters and the effect of environment on the star formation process. See Zinnecker & Yorke (2007) for a recent review of this topic.In the mid 1990 s, infrared surveys with the infrared space observatory (ISO)
and the Midcourse Space Experiment (MSX) have revealed a class of dark clouds,
referred to as infrared dark clouds (IRDCs Egan et al. 1998; Pérault et al. 1996).
These clouds are observed in silhouette against the bright infrared background
emission of the galactic plane.
IRDCs are typically cold (<25 K) and dense (105 cm-3) with masses ranging
from
100-105
(Carey et al. 2000; Egan et al. 1998; Carey et al. 1998) and they are
located at larger distances (>1 kpc) compared to well-studied low-mass star forming
regions.
A recently published catalog of dark clouds extracted from the GLIMPSE Survey by
Peretto & Fuller (2009) shows
that most IRDCs have column densities below N(H
cm-2,
with some extreme objects reaching N(H
2)>1023 cm-2.
This new class of dark clouds are believed to represent the very
early stages of clustered star formation, hence commonly referred to as cluster-forming
clumps (e.g., Rathborne et al. 2006).
While most of the star-forming activity in the Galaxy takes place in the inner
spiral arms and the molecular ring, significant activity also occurs beyond the solar
circle (
kpc). Prominent examples include well studied nearby (<500 pc)
regions such as the Orion and Taurus-Perseus-Auriga complexes as well as more distant
(>2 kpc) active star forming regions such as W3 and NGC 7538.
However, the outer Galaxy is generally somewhat neglected in studies of star formation
compared with the inner Galaxy, which is unfortunate because it is a useful laboratory
to study the effect of different external conditions on the process that initiates stellar
birth. In particular, the metallicity, pressure, and radiation field in the outer Galaxy are
considerably different from the inner Galaxy (Brand & Wouterloot 1995; Rudolph et al. 2006), which makes
it by far the most nearby region to study the effect of such variations on star
formation.
The aim of this work is to locate star-forming regions in the outer galactic plane. In order to find concentrations of molecular gas in the outer galactic plane, where the mid-infrared background is insufficient to measure absorption, we use the extinction to background stars. The traditional method to do this is via star counts (e.g., Wolf 1923), which is limited by extinction to nearby clouds, especially when using optical data. This problem can be avoided by using near-infrared data, but pick-up of emission from embedded objects makes this method unreliable.
An alternative to star counts is to use the near-infrared colour excess (NICE, Lada et al. 1994) which is widely used to map the extinction toward dark clouds (e.g., Cambrésy et al. 2002; Lada et al. 1999; Alves et al. 1998). However, this method relies on knowledge of the distance to the object, so that a correction for foreground stars can be made. The presence of foreground stars can reduce the measured color excess, and hence may influence an identification based on this, significantly (Lombardi 2005). Furthermore, identifying objects in an automated, uniform way toward large regions in the sky is difficult using NICE because the extinction law is not uniform in every direction. Also, an average reference color of background stars is required, which is usually adopted from a nearby, well selected, extinction-free region.
The identification of dark clouds by the extinction they impose upon background stars requires a method that:
- 1.
- determines the extinction from the colors of stars,
- 2.
- can be applied to small numbers of stars (for high resolution),
- 3.
- can be treated uniformly in all directions,
- 4.
- puts a confidence level to measurements,
- 5.
- is independent of foreground stars.
In two papers, we describe the identification process of candidate regions in the outer galactic plane with the goal to build up a sample of objects that can be used for future studies of (clustered) star formation throughout the Galaxy.
In this first paper we describe the Mann-Whitney U-test. This distribution-free statistical test can be applied uniformly to large data-sets in order to identify parts of the data with deviating properties. The purpose of this paper is to identify extended red regions in the sky using data from the Two Micron All Sky Survey (2MASS). In an accompanying paper (Frieswijk et al. 2010, hereafter Paper II) we perform a cross-correlation between the red regions presented here and existing data (optical dark clouds, MSX, IRAS and CO where available). The purpose of the second paper is to provide an indication of the actual nature of the reddening toward the identified regions
In Sect. 2 we summarize the 2MASS data and discuss why we use the
colors
for the identification.
In Sect. 3 we explain the Mann-Whitney U-test and describe the procedure that we
follow in order to determine the statistics.
In Sect. 4 we describe the friends-of-friends approach that is used to extract extended
red regions from the reliability images produced by the U-test.
In Sect. 5 we present the results. A catalog of the extended red regions is available
online
and at the CDS. The reliability
images of the entire outer galactic plane are made available in FITS format
from the same website.
The final section summarizes our main conclusions.
2 Data: the two micron all sky survey
The two micron all sky survey (2MASS) scanned the entire sky uniformly in three
near-IR bands: J (1.25 m), H (1.65
m) and
(2.17
m).
The 2MASS Point Source Catalog (2MASS PSC, Skrutskie et al. 2006)
consists of accurate positions (astrometric accuracy rms <200 mas) and brightness
information for over 400 million stars.
The point source catalog is >99
complete up to J < 15.8, H < 15.1
and
< 14.3 mag. The photometrical signal-to-noise ratio is >10
(i.e.
mag) for sources brighter than the above
stated completeness limits. The so-called faint extension of the point source
catalog includes sources that reach 0.5 to 1.0 mag beyond the above limits.
The completeness, reliability and uniformity of the faint extension sources are not as
good as the high-reliability PSC and photometric errors increase up to
mag.
We make use of all available sources in 2MASS, including the faint extension.
This increases the total number of objects by almost a factor 2 compared to star count
studies, where the faint extension cannot be used because the data are incomplete.
The
color is an appropriate choice to perform the identification
of red regions because the intrinsic
color of stars spans only a narrow range.
For spectral types A0V to M6III the range is about 0.00 to 0.30 mag
(Bessell & Brett 1988; Wainscoat et al. 1992). In the Solar neighbourhood the average
color
is about 0.13 mag with a standard deviation of 0.1 mag. This assumes no reddening
due to extinction and takes into account the sensitivity limit of 2MASS
(14.3 mag for
-band). By including the faint extension, the average
color increases slightly (0.15 mag) due to faint, intrinsically
redder M-dwarfs. If a region appears significantly redder, this suggests that either
foreground extinction is present or the intrinsic color distribution of stars in that
direction is different. The latter can be important if for example a group of
young stellar objects, which are intrinsically redder, is located in the observed
field. Though such regions will be identified being intrinsically red rather than
reddened, they still represent star forming regions. A discrimination between
the two, which is not obvious from the work in this paper, can be done by
looking at other available data and will be discussed in Paper II.
We analyse the 2MASS data in the outer Galaxy from
to 270
and b=-3.5
to +3.5
.
In order to process the data on a desktop computer,
we divided the area up in regions of
,
with a sampling separation
between different fields of one degree.
3 Method
3.1 The Mann-Whitney U-test
The goal of this project is to identify, based on near-IR
colors, those regions which
are redder compared to the background. One potential approach is to calculate an
average color of a few stars and test whether that average is different than a
background sample of stars.
This approach has some significant drawbacks: how many sample stars should be used and
what is the stellar color of the background given that this value changes over the
Galactic Plane? Is the average the correct statistic to calculate? Why not use the median
or some other statistic?
Furthermore, the distribution of the colors of stars is not a standard parametric
distribution. It is definitely not a Normal distribution which is needed if we want to
use the average colors of stars and calculate a confidence based on a standard deviation.
What is needed is a statistic, which does not depend on the distribution of stellar colors
and which works when comparing small samples against a background of many stars in a
robust manner.
The Mann-Whitney U-test (also referred to as the Wilcoxon rank-sum test, hereafter
referred to as U-test) is a well-known non-parametric significance test first proposed by
Wilcoxon (1945) and extended to arbitrary sample sizes by Mann & Whitney (1947).
The statistic is based on the rank of the observation and not the observation value itself.
This makes it a non-parametric statistic. The test is used to assess whether two samples
of observations, say sample A and B, come from the same distribution.
Samples A and B are required to be independent and the observations should be ordinal or
continuous measurements.
The U-test is hypothesis testing, where the hypothesis is that the two samples come
from the same distribution. This is called the Null hypothesis. The statistic which is
calculated is the sum of the joint ranking (U) of all the observations, in our case the
colors.
An example
The distribution of the sum of ranks of A and B, is readily calculated for samples
drawn from the same parent distribution. Take the case of 3 observations against 3 other
observations. If the observations are truly randomly drawn from the same parent
distribution, then one would expect a lower chance of pulling the three highest values
for sample A (ranks 4, 5, 6) and B therefore resulting in the three lowest ranks
1, 2, 3. One would rather expect a more random mix, e.g., 1, 4, 6 and 2, 3, 5.
In the example above the sum of the ranks in the first case, UA=15, UB=6 is less
likely than the second sum of ranks, UA=11, UB=10. In this way the distribution
of rank sums is built up.
If the rank sum statistic is UA=15, the probability of this occurring randomly is small (5%), which means the Null hypothesis can be rejected on a statistical basis to a particular level of confidence (95%), in favor of the alternative or test hypothesis that the A samples are larger than the B samples. The alternative hypothesis must be created in such a way that it is the only alternative to the Null hypothesis. Rejecting the Null hypothesis means that the alternative hypothesis must be accepted at that level of confidence.
The U-test can be applied to large scale surveys in order to identify regions where source properties deviate significantly with respect to a reference field. We use the U-test statistic to test whether a set of colors of stars is different than the set of colors of stars from another sample. We test against the Null hypothesis that the set of colors is actually the same. When the test rejects the Null hypothesis (to a certain confidence level), we must accept the test hypothesis that the colors are actually different. As can be seen from the example above, there is a fundamental difference if UA is greater or less than UB. For near-IR colors this means we could test for stars redder or bluer than the background stars.
For this work, we want to compare as few stars as possible against a background of tens of thousands. The benefit of using a non-parametric test is that whatever size the two samples have, the distribution of the U-test statistic is known from pre-calculated tables (e.g., Wall & Jenkins 2003), unlike for a parametric test. However, tables for finding the probabilities of U usually do not contain values for samples with size in excess of 20. For large sample sizes though, the distribution will approach a Normal distribution thanks to the Central Limit Theorem. The mean and standard deviation of the variables is simply given by the mean and standard deviation of the Normal distribution. Moses (1964) stipulates that the smallest number of samples is 5, to still be able to use the Normal approximation for the U-test. For a parametric test such as the average, a Normal approximation is only valid for higher sample sizes. This is evaluated in Sect. 3.5.
3.2 Applying the U-test procedure
We perform our calculations on a regular grid. For each grid-cell the distribution of properties, i.e., star colors, is referred to as sample A. A reference field represents sample B and is determined locally. The choice of the reference field is explained in Sect. 3.3. The size of the grid-cells can be chosen at will, but a minimum number of sources (5; see Sect. 3.5) is required to retrieve a reliability based on the Normal approximation. The choice of the resolution is explained in more detail in Sect. 3.4.
The following steps are involved in the calculation of the probability P for any given grid-cell, where P is the probability that a cell contains a redder color distribution compared to the reference field. The whole procedure is schematically given also in Fig. 1.
- 1.
- Let A be the distribution of colors in the cell (with m members) and
B the local reference distribution (with n members, where n is very large in this
study, i.e.,
104-5). The Null hypothesis (H0) is that A and B have the same parent population, or in perspective of our work, that the parent population of A is given by B. Note that when m is less than 5, the probability P is set to 50% and the next cell is processed.
- 2.
- Rank the colors in ascending order for the combined members of A and B. Preserve the A or B identity for each member.
- 3.
- Sum the number of A-rankings to get the statistical value UA.
![]() |
Figure 1: Schematic representation of the U-test procudure. |
Open with DEXTER |
For large samples, the sampling distribution tends to a Normal distribution
(see also Sect. 3.5) with a mean
and variance
given by
![]() |
(1) |
respectively, where N=m+n. The significance, z, can then be assessed from the Normal distribution by calculating
![]() |
(2) |
The term



![]() |
(3) |
3.3 Reference distribution
A single reference distribution is almost certainly not representative for every field along the galactic plane. First, because the stellar population, and thus the color distribution, may vary with galactic coordinates. Second, because foreground extinction will be present everywhere. The reddening caused by the extinction depends on the amount of foreground material and on the dust properties toward different lines of sight. Although the overall extinction features are interesting, the target for this work is to find extreme color changes relative to the local environment. Therefore we make no attempt to extract large scale extinction features along the galactic plane.
Instead of using a single reference distribution everywhere, we define the distribution of
colors in every
field as a reference for the grid cells in the inner
square degree of the respective field. This means that we select a local reference
distribution as best representation for the colors in each field, thus avoiding the effect
of color variations on galactic scale.
Note that the colors of stars in the cells that are being compared to the local reference
are omitted from the reference distribution itself.
3.4 Resolution limitation
The limiting factor for the spatial resolution that can be achieved in performing the
U-test is determined by the minimum of 5 sources required for the Normal approximation.
As a requirement for the grid-cell size (resolution) we state that in every field at
least 75% of the cells
should contain 5 or more sources. Because the stellar surface density decreases toward
the galactic anti-centre (l=180,
b=0
), we process the data at two different resolutions.
The highest possible resolution, mostly applicable towards the spiral arm regions at
-140
and
-270
,
is 60
.
For the region between
-240
we require a grid with cells of
90
.
Note that we process the entire outer galactic plane at both resolutions
and all data are made available.
The average number of stars is about 8 per cell for the high resolution grid toward the
spiral arm regions as well as for the low-resolution grid toward the anti-center region.
3.5 Minimum requirements: a Monte Carlo approach
In the U-test we assume that even for small samples drawn from a large set
of sources, the distribution of the significance values (z) tends to Normal.
With a Monte Carlo simulation we show that the significance distribution of randomly
drawn samples from a large distribution is indeed close to a Gaussian probability density
distribution. We specifically simulate samples of 8 sources because this is the average
number of sources that are present in a grid cell.
The large distribution is extracted from an observed field centered on l=122,
b=1
.
We have performed the same simulation using various other fields to test
whether the results below change when less, or more reddening is present in a field.
We find that this is not the case.
The resulting distribution of z is given in the lower left panel in Fig. 2.
In the upper left panel we display the distribution of z for samples of 5 sources, the
minimum sample size for a proper U-test result according to Moses (1964).
The peak as well as the wings are slightly overestimated by the
Gaussian profile in the simulation for samples of 5 sources, but
increasing to samples of 8 shows that the simulation
converges fast to a Normal distribution. The difference between the the
areas under the curves for the wing region (z>2) is less
than 10% (<5% for samples of 8).
By assuming that
the profile is given by a Gaussian distribution, we underestimate the actual probability
of having a red pixel in the regime considered here, i.e.,
corresponding to a probability
% when evaluating the integral over Eq. (3).
We compared the simulated results of the U-test with two parametric tests: the average
and median. The right panels of Fig. 2 show that the distribution of
average values deviates much more from a Gaussian profile compared to the U-test.
The distribution is clearly skewed and cannot be represented by a simple profile.
The area in the wing (
mag) is underestimated by
50% (
100% for 8 samples). The shape of the distribution of average
values approaches a Normal distribution only for sample sizes
20, but the area
in the wing remains underestimated.
Almost identical results are found for the distribution of median colors, with a
comparably large discrepancy in the wing area.
The average and median colors are often used, e.g., for near-infrared color excess (NICE). Even though the median value is less sensitive to (extreme) outliers in the distribution than the average, the main issue is, that both are not well represented by simple profiles. This makes it difficult, if not impossible, to put a significance to the measured values. This justifies the use of the non-parametric U-test statistics for the automated identification process presented in this paper.
![]() |
Figure 2:
Visualization of the result of a Monte Carlo simulation, where we randomly
draw 6000 samples of 5 ( upper histograms) and 8 ( lower histograms) 2MASS
colors from a test field at l=122 |
Open with DEXTER |
3.6 Reliability and completeness
There are several considerations that need to be kept in mind when interpreting the resulting images created by the U-test method. These are related to the reliability and completeness of the cells that are identified as being red. We discuss them here.Type I error
The first error is related to the reliability, or the significance level of the selected cells. If, as in our case, rejection is defined at 99% this suggests that about 1% of the cells will be selected on statistical arguments, rather than being red. However, these cells are expected to be randomly distributed over the fields. To exemplify that this is indeed the case, we consider the following values extracted from the low-resolution




Type II error
This error is related to the cells that are not identified, i.e., where the Null hypothesis is not rejected but should have been. It is very difficult to put a realistic value to this error. In a broad sense it is inversely related to the Type I error. That is, the more reliable the sample is the less complete it is and vice versa. Furthermore, the Type II error usually occurs when sample sizes are too small. Because we have chosen to produce a high reliability catalog, we are giving in on completeness due to this error.
Sample statistics
According to Moses (1964), when one of the distributions that go into the U-test contains fewer than 5 sources the outcome of the U-test is unreliable. There is not sufficient information available on the color distribution. Cells containing fewer than 5 sources are omitted in the identification process and, thus, introduce another form of incompleteness which depends on the resolution. Processing at higher resolution means being less complete.
Considering the limitations and reliability of the method, we have decided to put the effort in producing a reliable target list at relatively high resolution (60


![]() |
Figure 3:
The left panel displays the probability image returned by the U-test. The
grey-scales represent the probability that the color distributions in the cells are
redder with respect to the reference distribution. The friends-of-friends method
identified the extended red regions, displayed in arbitrary colors. The ellipses
give the spatial extent of the regions. Black cells are identified as being red, but
are considered isolated. Objects 463 and 470 correspond to NGC 7538 and S 159,
respectively. Object 474 is the first outer Galaxy ``cluster-forming clump''-candidate,
identified as such by the work presented here.
The right image shows the same region in MSX 8 |
Open with DEXTER |
4 Extracting candidate sources
Friends-of-friends approachTo increase the reliability of red regions and to avoid the selection of random cells, we add an additional step to the process. By means of a friends-of-friends algorithm, we select only clusters of 4 or more red cells for the final list of candidate objects. The reason for using 4 cells here is based on the probability of identifying clusters of randomly distributed red cells, which becomes negligible (

Linking length
A crucial parameter required for the friends-of-friend method is the linking length, L, used to determine whether selected cells belong to the same group or not. If Lis taken too large, all cells will be assigned to the same group. If L is too small all cells will be considered isolated. Because we do not know the cause of the red color distribution in different cells (either extinction due to foreground material at unknown distance, or a concentration of intrinsically red objects at unknown distance) it is impossible without further information to put a physical size to L. We therefore choose a linking length in terms of the cell size,

Table 1: Excerpt from the catalog of extended red objects.
An upper limit for



![]() |
(4) |
(Clark & Evans 1954; Hertz 1909).
A linking length much larger than this average
distance will group cells together even if
the distribution is random. This obviously
contradicts the goal of finding clustered regions.
A lower limit is obviously the cell
size itself. We have tested the method by
varying L between 2 and 5. The higher values
(>3) identify mostly large scale structures
(>0.5). For a value of 2, most cells are
isolated or grouped with fewer than 3 other
cells. We decide to use a linking length half
that of the average distance between random
points, i.e., L=2.5 times the cell size, for
both resolutions. The probability for finding
``accidental groups'' of 4 cells using L=2.5 and considering
that 1% of the cells are randomly selected by the U-test
was evaluated in the Monte Carlo simulations
described above and is
0.1%.
A consequence of setting the linking length to 2.5 is that groups are identified with cells in between the red ones that may not be identified as red. This can either be due to a type II error (Sect. 3.6), or insufficient stars present in that cell (the sample statistics, Sect. 3.6). In both cases this illustrates that not all reddened regions can be detected and thus implies that we are limited in the completeness of the catalog.
5 Results
Figure 3 displays an example of the output of the U-test in the form of a probability
image for the region centered on l=112,
b=1
.
For comparison we
display the 8
m emission as observed by MSX. We identify 24 extended red regions
in this field. The red cells of these regions are presented in different colors
in Fig. 3, with the spatial extent of the objects indicated by the ellipses.
Some objects clearly correlate with extended
bright 8
m emission coming from well-known star forming regions such as NGC 7538 or
S 159. This field also contains object H474, which has been selected as a ``massive
dark cloud''-candidate for follow-up observations. These observations have revealed the
first infrared dark cloud identified as such in the outer galactic plane
(Frieswijk et al. 2008,2007).
The resulting catalog of extended red regions and the probability images in
FITS-format are available
online. The low-resolution
part consists of 1589 objects and the high resolution part of 1320. The catalog is available
in electronic form at the CDS with sources designated FrSh LNNNN and FrSh HNNNN for low-
and high resolution objects, respectively.
Table 1 presents an excerpt from the high resolution catalog.
The objects in the table correspond to those that are identified in the field displayed in
Fig. 3 and include
NGC 7538 (FrSh H463) and S 159 (FrSh H470). The different
columns present the following information:
Column (1): Object identification,
Columns (2)-(3): Galactic coordinates of the object centre,
Column (4): Total number of red pixels,
Columns (5)-(6): Extent in l and b in pixel units (1 pix = 1),
Column (7): Number of 2MASS sources.
Note that the number of 2MASS sources in Col. (7) contains information
on the stellar surface density. In principle this can be used to distinguish between
objects identified by extinction and objects representing a clustering of
young stars, because extinction may lower the stellar density whereas a
clustering may increase it. However, because we are using the 2MASS catalog
including the faint extension the variations in stellar surface density may
also arise from the incompleteness of the catalog and we do not discuss it
further in this paper.
![]() |
Figure 4: Histogram displaying the distribution of the number of cells per extended red region for the high-resolution (solid) and low-resolution (dashed) catalog. |
Open with DEXTER |
5.1 Size- and spatial distribution
![]() |
Figure 5:
Spatial distribution of extended red regions identified in the outer galactic
plane at 60
|
Open with DEXTER |
The linear extent of the extended red regions ranges from a few arc-minutes to about a degree. Figure 4 gives an overview of the cell-number distribution for the regions in the high- and low resolution catalog. This shows that about half of the regions are identified as groups of 4 or 5 cells. Regions consisting of more than 10 cells account for 18% of the objects in the high resolution catalog and for 28% of the objects in the low resolution one.
Figure 5 shows the sky distribution of the extended red regions in the
outer galactic plane.
The solid histogram in the lower panel represents the number of high-resolution regions
in bins of 5
galactic longitude. The dashed histogram displays
the distribution of low-resolution regions.
The galactic latitude distribution is given in the horizontal histogram
in the right panel. Several authors have found that the distribution of (molecular) clouds
toward the inner galactic plane peaks at negative galactic latitude
(Rosolowsky et al. 2010; Schuller et al. 2009; Peretto & Fuller 2009), suggested to result from the Sun being
located slightly above the Galactic plane.
We do not find a similar distribution toward the outer Galaxy. In fact, about 60% of the
high resolution objects are at positive latitudes (
50% for low resolution).
However, this can be attributed to the local, well-populated, Vela and Cygnus/Cepheus star
forming regions which are located mainly above the plane.
The 2-dimensional spatial distribution is
displayed in the upper panel. The red and blue ellipses display the location and linear
extent of the high and low resolution regions, respectively.
Some well-known areas associated with molecular clouds and star forming activity in the
outer Galaxy are indicated. Most of the regions we find are well confined within the extent
of these large complexes, and by far most regions are identified in
the crowded spiral arm regions near Vela and Cygnus/Cepheus.
Toward lower stellar densities (
)
there are more
regions identified with the low resolution than the high resolution grid because at
high resolution there are often insufficient stars in a cell to use for the U-test.
Many of the large-scale regions identified at low resolution, in particular visible toward
the Vela and Cygnus/Cepheus regions, are identified as groups of smaller regions with the
high resolution grid.
5.2 Column density sensitivity
The objects in the catalog are identified without using a parameter that relates to a
physical quantity. However, we do have information on the average
colors of cells in
all objects and we use it to derive a rough estimate of the column density sensitivity.
Note however, that we can not account for any corrections due to contamination of unreddened
foreground stars and the values below represent only lower limits.
The average color in the cells of identified regions is
mag
with a standard deviation of
0.18 mag. Some cells have an average color
up to
2 mag.
These values can be converted to a visual extinction
using
the following equations;
![]() |
(5) |
(Rieke & Lebofsky 1985),
![]() |
(6) |
where





6 Conclusions
We have used a statistical approach in combination with a friends-of-friends algorithm
to measure deviations in the spatial
color distribution of stars in the outer
galactic plane, with the goal to identify extended reddened regions.
We processed the galactic plane at 60
and 90
resolution, resulting
in the identification of 1320 high resolution and 1589 low resolution extended red
regions. The reliability of individual red cells that make up each object is 99% and
by using a friend-of-friends approach, the resulting catalog of objects is 99.9%
reliable. Because we have put the effort in producing a highly reliable source list
we cannot quantify the completeness of the catalog.
The majority of the objects are located toward well-known molecular cloud complexes and some correspond to specific, well-known objects such as NGC 7538 and S 159. This correlation strengthens the argument that the red nature of the regions is caused by extinction and/or embedded young stellar objects rather than intrinsic star color variations. The main goal of our analysis is to find previously undetected dark clouds in the outer Galaxy. However, many other types of objects are included in the catalog and the results may be compared to other outer Galaxy studies in the future.
Based on the parameters derived from the statistical test it is impossible to determine the nature of the reddening. The next step is to cross-correlate the objects with existing optical, infrared and CO data. This process is described in an accompanying paper (Frieswijk et al. 2010).
AcknowledgementsWe thank the anonymous referee for his/her care- ful reading of the manuscript and his/her constructive remarks. We would also like to thank Marco Spaans and Floris van der Tak for their helpful discussions and suggestions which have improved this manuscript. This publication makes use of data products from the Two Micron All Sky Survey, which is a joint project of the University of Massachusetts and the Infrared Processing and Analysis Center/California Institute of Technology, funded by the National Aeronautics and Space Administration and the National Science Foundation.
References
- Alves, J. ., Lada, C. J., Lada, E. A., Kenyon, S. J., & Phelps, R. 1998, ApJ, 506, 292 [NASA ADS] [CrossRef] [Google Scholar]
- Bessell, M. S., & Brett, J. M. 1988, PASP, 100, 1134 [NASA ADS] [CrossRef] [Google Scholar]
- Bohlin, R. C., Savage, B. D., & Drake, J. F. 1978, ApJ, 224, 132 [NASA ADS] [CrossRef] [Google Scholar]
- Brand, J., & Wouterloot, J. G. A. 1995, A&A, 303, 851 [NASA ADS] [Google Scholar]
- Cambrésy, L., Beichman, C. A., Jarrett, T. H., & Cutri, R. M. 2002, AJ, 123, 2559 [NASA ADS] [CrossRef] [Google Scholar]
- Carey, S. J., Clark, F. O., Egan, M. P., et al. 1998, ApJ, 508, 721 [NASA ADS] [CrossRef] [Google Scholar]
- Carey, S. J., Feldman, P. A., Redman, R. O., et al. 2000, ApJ, 543, L157 [NASA ADS] [CrossRef] [Google Scholar]
- Clark, P. J., & Evans, F. C. 1954, Ecology, 35, 445 [CrossRef] [Google Scholar]
- Egan, M. P., Shipman, R. F., Price, S. D., et al. 1998, ApJ, 494, L199 [NASA ADS] [CrossRef] [Google Scholar]
- Frieswijk, W. F., Spaans, M., Shipman, R. F., et al. 2008, ApJ, 685, L51 [NASA ADS] [CrossRef] [Google Scholar]
- Frieswijk, W. W. F., Spaans, M., Shipman, R. F., Teyssier, D., & Hily-Blant, P. 2007, A&A, 475, 263 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Hertz, P. 1909, Mathematische Annalen, 67, 387 [Google Scholar]
- Huchra, J. P., & Geller, M. J. 1982, ApJ, 257, 423 [NASA ADS] [CrossRef] [Google Scholar]
- Lada, C. J., Alves, J., & Lada, E. A. 1999, ApJ, 512, 250 [NASA ADS] [CrossRef] [Google Scholar]
- Lada, C. J., & Lada, E. A. 2003, ARA&A, 41, 57 [NASA ADS] [CrossRef] [Google Scholar]
- Lada, C. J., Lada, E. A., Clemens, D. P., & Bally, J. 1994, ApJ, 429, 694 [NASA ADS] [CrossRef] [Google Scholar]
- Lombardi, M. 2005, A&A, 438, 169 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mann, H. B., & Whitney, D. R. 1947, Annals of Mathematical Statistics, 18, 50 [Google Scholar]
- Moses, E. L. 1964, Journal of the American Statistical Association, 59, 645 [CrossRef] [Google Scholar]
- Pérault, M., Omont, A., Simon, G., et al. 1996, A&A, 315, L165 [NASA ADS] [Google Scholar]
- Peretto, N., & Fuller, G. A. 2009, A&A, 505, 405 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Rathborne, J. M., Jackson, J. M., & Simon, R. 2006, ApJ, 641, 389 [NASA ADS] [CrossRef] [Google Scholar]
- Rieke, G. H., & Lebofsky, M. J. 1985, ApJ, 288, 618 [NASA ADS] [CrossRef] [Google Scholar]
- Rosolowsky, E., Dunham, M. K., Ginsburg, A., et al. 2010, ApJS, 188, 123 [NASA ADS] [CrossRef] [Google Scholar]
- Rudolph, A. L., Fich, M., Bell, G. R., et al. 2006, ApJS, 162, 346 [NASA ADS] [CrossRef] [Google Scholar]
- Schuller, F., Menten, K. M., Contreras, Y., et al. 2009, A&A, 504, 415 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Shu, F. H., Adams, F. C., & Lizano, S. 1987, ARA&A, 25, 23 [Google Scholar]
- Skrutskie, M. F., Cutri, R. M., Stiening, R., et al. 2006, AJ, 131, 1163 [NASA ADS] [CrossRef] [Google Scholar]
- Wainscoat, R. J., Cohen, M., Volk, K., Walker, H. J., & Schwartz, D. E. 1992, ApJS, 83, 111 [NASA ADS] [CrossRef] [Google Scholar]
- Wall, J. V., & Jenkins, C. R. 2003, in Practical Statistics for Astronomers (Cambridge University Press) [Google Scholar]
- Wilcoxon, F. 1945, Biometrics Bulletin, 1, 80 [Google Scholar]
- Wolf, M. 1923, Astron. Nachr., 219, 109 [NASA ADS] [CrossRef] [Google Scholar]
- Zinnecker, H., & Yorke, H. W. 2007, ARA&A, 45, 481 [NASA ADS] [CrossRef] [Google Scholar]
Footnotes
- ... plane
- Full Table 1 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/515/A51
- ...
online
- http://www.astro.rug.nl/ ismgroup/OuterGalaxy/
- ...
online
- http://www.astro.rug.nl/ ismgroup/OuterGalaxy/
All Tables
Table 1: Excerpt from the catalog of extended red objects.
All Figures
![]() |
Figure 1: Schematic representation of the U-test procudure. |
Open with DEXTER | |
In the text |
![]() |
Figure 2:
Visualization of the result of a Monte Carlo simulation, where we randomly
draw 6000 samples of 5 ( upper histograms) and 8 ( lower histograms) 2MASS
colors from a test field at l=122 |
Open with DEXTER | |
In the text |
![]() |
Figure 3:
The left panel displays the probability image returned by the U-test. The
grey-scales represent the probability that the color distributions in the cells are
redder with respect to the reference distribution. The friends-of-friends method
identified the extended red regions, displayed in arbitrary colors. The ellipses
give the spatial extent of the regions. Black cells are identified as being red, but
are considered isolated. Objects 463 and 470 correspond to NGC 7538 and S 159,
respectively. Object 474 is the first outer Galaxy ``cluster-forming clump''-candidate,
identified as such by the work presented here.
The right image shows the same region in MSX 8 |
Open with DEXTER | |
In the text |
![]() |
Figure 4: Histogram displaying the distribution of the number of cells per extended red region for the high-resolution (solid) and low-resolution (dashed) catalog. |
Open with DEXTER | |
In the text |
![]() |
Figure 5:
Spatial distribution of extended red regions identified in the outer galactic
plane at 60
|
Open with DEXTER | |
In the text |
Copyright ESO 2010
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.