Open Access
Issue
A&A
Volume 646, February 2021
Article Number A74
Number of page(s) 15
Section Catalogs and data
DOI https://doi.org/10.1051/0004-6361/202038956
Published online 17 February 2021

© P. Mège et al. 2021

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1. Introduction

In the Galaxy, studying the early phases of (high-mass) star formation requires distance information but often faces problems of high extinction and large distances. The study of star formation laws, from small to large scales, is based on distances and this is at the heart of the Galactic plane large-scale surveys and high-resolution pointed observation studies. Among these surveys, the Herschel Galactic plane survey Hi-GAL (Molinari et al. 2010) delivered unprecedented multi-wavelength (70 μm, 160 μm, 250 μm, 350 μm, and 500 μm) observations of star-formation sites at different spatial scales, namely from core to clump scales (Elia et al. 2017). However, the derivation of general conclusions regarding Galactic star-formation laws requires the determination of distances for a large number of sources.

Many studies have presented results for compact sources in the Galaxy using available molecular data to determine the radial velocity of sources with respect to the local standard of rest (LSR), and then to derive their kinematic distance using a model of the Galactic rotation curve (Russeil et al. 2011; Ellsworth-Bowers et al. 2015; Wienen et al. 2015; Whitaker et al. 2017; Zetterlund et al. 2017). Recently, Gaia (Gaia Collaboration 2000), measuring stellar parallaxes, revolutionized our view of the distribution and distances of sources in our Galaxy (Cantat-Gaudin et al. 2018; Zucker et al. 2019; Yan et al. 2019; Lallement et al. 2019) but this holds for a limited volume around the Sun (heliocentric distance up to 3 kpc in the Galactic plane) and for main sequence stars which do not suffer from high extinction (up to AV ≃ 3.5). Indeed, several methods could be used to determine the distance of Hi-GAL sources with Gaia data. One can determine the distance to the parental molecular cloud from the line-of-sight extinction, but as underlined by Yan et al. (2019) and Lallement et al. (2019) this is limited to extended clouds and to distances up to 3 kpc. One could determine the distance of any associated cluster, but looking at Cantat-Gaudin & Anders (2020) the mean Bayesian Gaia distance of the open clusters (with −2° < b <  2°) is 2.84 kpc (maximal distance reached being 9.75 kpc). Finally, one could determine the distance of young stellar objects (YSOs) embedded in Hi-GAL sources; this method would be the best. To evaluate this possibility we estimate the distance that can be reached with this method. From a sample of 20 massive Galactic star-forming complexes (d = 0.4–3.6 kpc) studied by Povich et al. (2013), we estimate a median value for YSO extinction of AV = 7.1 (and a mean of AV = 12) then AG = 6 (with AG/AV = 0.859). A 2 Myr PARSEC isochrone (Marigo et al. 2017) gives us a G band absolute magnitude of 5 and −1.64 for YSOs of 0.5 M and 8 M, respectively. Knowing that the Gaia-DR2 catalog is complete between G = 12 and G = 17, this means that YSOs of 0.5 M and 8 M can be detected up to 158 pc and 3.4 kpc, respectively (for AV = 7.1). Therefore, Gaia results are insufficient for the study of distant and/or highly embedded sources.

In the framework of the European FP7 SPACE project VIALACTEA1 dedicated to the scientific exploitation of the Hi-GAL survey, we embarked on the determination of distances for the 150223 compact sources from velocity assignment and kinematic distance determination. Given the large number of sources, this represents an unprecedented effort to deliver reliable distances to the scientific community, allowing the statistical study of clump physical properties.

This paper is organized as follows: Sect. 2 presents the sample of Hi-GAL sources and the molecular data used to determine the sources velocity. Section 3 describes the method used to determine velocity and distance and illustrates the different steps of the process. Results obtained from this work together with comparisons with published results available in the literature are presented in Sect. 4. Conclusions are given in Sect. 5.

2. Sample and data used

The Herschel infrared survey of the Galactic plane, Hi-GAL (Molinari et al. 2010), is a complete survey of the Galactic plane performed in five infrared photometric bands centered at 70, 160, 250, 350 and 500 μm. After band merging, 150 223 compact sources have a spectral energy distribution (SED) eligible for gray-body fit (Elia et al., in prep.) but lack the distance information needed to evaluate their physical properties. A first version of the catalog is presented in Elia et al. (2017) for 100 922 sources located in the inner Galaxy (−71° < l <  −67°). Table 1 list the spectroscopic surveys used to assign radial velocities to the Hi-GAL compact sources. Some surveys cover a large area of the Galactic plane, such as for example the SEDIGISM survey (−60° <  l <  +18°, |b| < 0.5°) while some others have a more limited coverage, such as CHIMPS (28° <  l <  46° and |b| < 0.5°). Also, there are some gaps in the coverage of the Galactic plane, such as for sources in the longitude range 195°–205°, for sources with |b| > 0.5° and 1° <  l <  17°, and for sources with b > 1.7° and 66° < l <  101°.

Table 1.

Surveys used to assign radial velocities to the Hi-GAL compact sources.

3. Distance determination method

The first step in determining the distance is to measure the radial velocity of the Hi-GAL sources. Here, because most of the Hi-GAL sources are expected to be associated with dense molecular material, we measure their radial velocities from molecular line observations. Many spectroscopic surveys of the Galactic plane are available and have been used for this work; these are listed in Table 1.

To derive the distance of sources in the Milky Way we have to assume that the measured radial velocity (VLSR) with respect to the local standard of rest of the source arises from its differential Galactic rotation. Subsequently, using a model for the rotation of the Galaxy, one can obtain the kinematic distance to the source from its radial velocity, VLSR. To this end, we adopt the revised Galactic rotation curve presented in Russeil et al. (2017). The rotation curve models axially symmetric circular orbits, but the true rotation pattern can be more complicated. Indeed velocities can depart from circular rotation, mainly due to streaming motions (Burton 1971; Stark & Brand 1989; Russeil 2003), and then can cause large uncertainty in the derived distances. Typical velocity departures are of about 10 km s−1 (e.g., Roman-Duval et al. 2009; Anderson et al. 2012; Wienen et al. 2015) but can reach up to 40 km s−1 (Brand & Blitz 1993). In addition, while the procedure of transforming velocities into distances is straightforward in the outer Galaxy, there is the well-known kinematic distance ambiguity (KDA) for sources in the inner Galaxy. Each velocity measurement leads to two possible distances, the near and far distances, which correspond to the two equidistant points from the tangent point (the closest position on the line of sight to the Galactic center). Choosing between the near and far distances is only possible using additional information, such as for example extinction and H I data cubes along the same line of sight (e.g., Russeil et al. 2011).

3.1. Algorithm outline

We developed the Python program starbird to determine the velocity and the distance of Hi-GAL sources. starbird assigns heliocentric radial velocities and distances to sources of the Hi-GAL survey by means of the VIALACTEA Knowledge Base (VLKB, Molinaro et al. 2016). The starbird program has five main steps, as shown on Fig. 1.

thumbnail Fig. 1.

Flowchart for velocity determination.

starbird accepts an input table of sources in a specific format. This table contains mandatory information on the sources including a source running number, a source name which here is the Hi-GAL name (as given in the catalog of Elia et al. 2017), the Galactic and equatorial coordinates (in degrees), and the source elliptical footprint information (the major and minor axes full width at half maximum (FWHM) in arc-seconds and position angle (PA) in degrees) measured at 250 μm. starbird executes a series of actions (labeled QUERY, DOWNLOAD, RADIAL, FILTER, MORPHO, SELECT-VLSR, BROWSE and COMPUTE) which are described below.

3.2. Velocity extraction

The first step to finding the source velocity is to download the molecular data cubes and fit the spectra extracted at the position of the source. For a given source, the tasks QUERY and DOWNLOAD list, query, and download the portion of the observational data cubes available in the VLKB. Depending on the Galactic longitude of the source there can be up to 11 different molecular data cubes available (see Table 1).

For all downloaded data cubes, the RADIAL task extracts a single profile at the central position of the source and performs a multi-Gaussian fitting. The developed fitting method is described in Appendix A. The FILTER task is then applied on the fitted lines, and is applied in every region of the l − VLSR plane except in kinematic avoidance zones (±20 degrees range around the Galactic center and anti-center direction), because in these regions anomalous streaming motions cause VLSR to strongly depart from the ideal circular motion. It first removes all individual velocity components that have a post-fit peak intensity with a signal-to-noise ratio (S/N) < 5 for the 5σ catalog and < 3 for the 3σ catalog. It then removes spiky features by removing overly narrow lines. This rejection is based on a minimum fitted FWHM threshold that depends on the survey (see last column in Table 1 and Appendix A). FILTER also removes the velocities that would lead to a negative rotation frequency and those that are inconsistent with the angular rotation model; the suspected extragalactic velocities (i.e., those that give a galactocentric distance larger than 17 kpc); the kinematically forbidden VLSR (i.e., those that go beyond the tangent point velocity by more than 20 km s−1); and finally it applies a selection process favoring velocities falling in the l − VLSR map from Dame et al. (2001).

Figure 2 shows an example of fitted spectra for the Hi-GAL source HIGALBM340.2690-0.0542 for which a velocity of −125 km s−1 is adopted. On this example we note that the dense-gas tracer lines (Figs. 2e and f) allow a reliable and unique velocity determination. Because they better probe the clumps velocity and show simpler spectra, dense-gas tracers better determine the velocity of a source. Indeed, for BGPS (Bolocam Galactic Plane Survey) objects, Ellsworth-Bowers et al. (2015) show that for objects with a HCO+(3–2) velocity, 95% have 13CO(1–0) velocities in agreement. In addition, we need that the dense-gas tracer line often corresponds to the most intense line in the lower-density-gas tracer. However, in about 40% of the cases only the low-density-gas tracer, or a single molecular transition spectrum with multiple lines, is available. However, instead of adopting the most intense line velocity directly, we perform a morphological analysis as described in the following section.

thumbnail Fig. 2.

Example of fitted molecular spectra for VLSR detection for the Hi-GAL source HIGALBM340.2690-0.0542. The spectra are (a) NANTEN 12CO(1–0), (b) MOPRA 12CO(1–0), (c) TRUMMS 12CO(1–0), (d) SEDIGISM 13CO(2–1), (e) SEDIGISM C18O(2–1), (f) MALT90 HCO+. In all panels the black, red, and green curves are the extracted profile, the fitted lines, and the residual, respectively. The background gray area is the ±σnoise value where σnoise is the root mean squared value of the noise amplitude (see Appendix A). The horizontal blue line shows the 5σ level.

3.3. Morphological analysis and velocity assignment

Most of the time the observed spectra show several peaks and then the velocity extraction process returns several velocities. Along the line of sight, the molecular emission can come from a diffuse layer or from a more structured medium such as clumps and filaments. One way to choose the best velocity is to suppose that the Hi-GAL sources belong to such a structured molecular medium. The MORPHO task has been developed to lead such a morphological analysis. For every fitted velocity, the script automatically extracts the plane from the data cube at this velocity and performs a basic source extraction using SExtractor software (Bertin & Arnouts 1996) to reveal the presence of some elliptical emission structures. If the molecular emission is diffuse then no source will be found, while if the emission is structured, molecular sources will be detected and some of them may intersect the Hi-GAL footprint. For the SExtractor detected sources, their roundness, their center angular distance to the Hi-GAL source center, and their overlapping area and filling factor with the Hi-GAL source are evaluated. A score is then attributed (coded by an integer between 0 and 4) spotting whether there is no overlapping source, a partial overlap, the source is included in the Hi-GAL one (and vice versa), or a complete overlap, respectively. The task SELECT-VSLR then scans all the available information on the fitted lines in order to choose the best line. Regarding the existence of positive scores for the molecular lines, different criteria are used. For each of these lines, an additional condition is applied: the line has to be confirmed by the presence of another line in a different molecular tracer among those available (with a velocity difference of less than 7 km s−1). If no line is confirmed, we choose by default the velocity of the most intense line from the densest available tracer. For confirmed lines the choice of the velocity is as follows. If lines have a morphological score attached to their velocities (by the previous action MORPHO), the velocity of the highest scoring line is selected. If two or more lines share the best score, the best angular separation (the smallest) is the next criterion. If they also have the same angular separation, the roundness (the highest) is the next criterion. If none of the confirmed lines have a score, we choose the velocity of the most intense line from the confirmed densest molecular tracer lines.

Figure 3 shows an example of the morphological analysis and the velocity assignment performed to the source HIGALBM340.1288-0.1837. For this source two molecular data cubes are available and both extracted spectra are in agreement, showing three lines at −3.4 km s−1, −52.5 km s−1, and −125.1 km s−1 but with different relative intensity. In MOPRA 12CO(1–0), the highest peak is for the −52 km s−1 while in SEDIGISM 13CO(2–1) this is the −125.1 km s−1. This could be due to an optical thickness effect knowing that the 12CO(1–0) emission usually comes from the cloud surface and/or a more diffuse medium. Thanks to the morphology analysis we can see that a structured source is found towards the Hi-GAL source (clear overlapping of the footprints) only at −125.1 km s−1 (Fig. 3e) which leads us to assign this velocity to the source. This also confirms that the −52.5 km s−1 velocity component corresponds to more-diffuse emission as illustrated in Fig. 3d.

thumbnail Fig. 3.

Example of the morphological analysis of the molecular emission for the Hi-GAL source HIGALBM340.1288-0.1837. The upper panels show the MOPRA 12CO(1–0) (a) and SEDIGISM 13CO(2–1) (b) fitted spectra. The labels in these panels are the same as in Fig. 2. The lower panels show the channel maps at −3.4 km s−1 (c), −52.5 km s−1 (d), and −125.1 km s−1 (e). The red ellipse is the footprint of the Hi-GAL source while the yellow ellipses are the footprints of the structures extracted by SExtractor.

3.4. Distance determination: solving the kinematic distance ambiguity

The distance determination is performed by the task COMPUTE. The distance of a source can be derived either from kinematics or in a more direct way (stellar distance of the associated H II region or maser parallax distance). For the kinematic distance calculation, the Reid et al. (2009) fortran code was used but with the revised rotation curve adopted for the VIALACTEA project (Russeil et al. 2017).

When the VLSR of a source is assigned, the kinematic distance (dkin) can be calculated. At this step, three scenarios must be considered: (1) the source is outside the solar circle and therefore dkin is unique; (2) the source lies inside the Solar circle but its velocity is forbidden, hence the tangent point distance is adopted; (3) the source lies inside the Solar circle with a valid velocity, and therefore two distances are possible and the kinematic distance ambiguity (KDA) must be solved to choose between the far distance, dFAR, and the near distance, dNEAR.

To solve the KDA, or to directly adopt a stellar or maser parallax distance, the task first cross-correlates (both spatially and in velocity, when possible) the source with available published catalogs (listed in the note of Table 2) in which the KDA is already solved, or giving us sufficient information to solve it or to assign a stellar or maser parallax distance.

Table 2.

Methods used for distance disambiguation with their associated priority codes.

Because Hi-GAL sources located in quadrants Q2 and Q3 are not affected by any KDA, their distance is directly assigned from their known radial velocity. For the Hi-GAL sources located in Q1 and Q4, the near/far distance ambiguity is resolved going down through the list of methods arranged in order of priority (see priority order in Table 2). The methods used to decide whether the source concerned by KDA is located at the near or far distance (dNEAR or dFAR) are summarized in Table 2.

For example, because infrared-dark clouds (IRDCs) are cold and dense molecular clouds seen as extinction features against the bright Galactic background, if a source can be associated to an IRDC the near distance will be favored. If a source can be associated to an optical H II region or to a region with a maser parallax then the stellar distance of the H II region (or the near distance) or the parallax distance will be adopted. Following the H I absorption/emission method (e.g., Anderson & Bania 2009), when a source can be assigned to a radio H II region for which H2CO or H I line of sight absorption lines are observed with absolute velocity larger than the source VLSR then the far distance will be adopted. However, as many sources have no such information, we develop automatic scripts to perform H I self-absorption (HISA) and extinction curve analysis.

By default, the source is placed at the far distance but if a HISA feature is found at the source VLSR or if an extinction feature is found close to the near distance then dNEAR is adopted. The HISA is illustrated here for the source HIGALBM329.2039+0.7101 (Fig. 4). The script automatically identifies all the significant (S/N larger than 3) and regular dips in the HI spectrum (see Appendix B). If a dip is found at the same velocity (within a 3 km s−1 tolerance) as the source VLSR, as is the case here, then dNEAR is adopted.

thumbnail Fig. 4.

Example of HISA identification for the source HIGALBM329.2039+0.7101 with assigned VLSR = −63.562 km s−1. The H I profile is plotted in blue, and the identified dips are shown as red dots. The horizontal gray thick line shows the ±σnoise and the dashed red line shows the 3σnoise level. The black vertical dashed line and the gray shadow are the velocity of the source and the 3 km s−1 tolerance.

The extinction cubes are from Marshall et al. (2006)2. They give AKs value in the form of “l, b, distance” data cubes. The extinction profile (AKs vs. distance along the line of sight) at the central position of the source is extracted, and then the extinction layers along the line of sight are identified from the jump they induce in the extinction profile. As these jumps are easier to identify on the three-order derivative, the script automatically identifies them as minima deeper than 3σ from the baseline level. The extinction analysis is illustrated here for the source HIGALBM1.7647-0.4806 (Fig. 5) for which dFAR = 13.04 kpc and dNEAR = 3.63 kpc, respectively. In our example, one extinction layer at 3.7 kpc is identified, favoring the near distance. We therefore adopt dNEAR = 3.63 kpc for this source.

thumbnail Fig. 5.

Example of extinction layer identification for the source HIGALBM1.7647-0.4806. The detection used the third derivative of the extinction curve. Here, one extinction layer is identified (red dot and dashed line) at a distance of 3.7 kpc.

A last option used to distinguish between dFAR and dNEAR is to follow the method used by Solomon et al. (1987). This method considers the physical distance of the source to the Galactic plane: if by taking the far distance the cloud is too far from the Galactic plane (i.e., > 140 pc, which is the scale height of the gaseous disk of the Galaxy) then the near distance will be favored. We note that because the sources with KDA are located within the Solar circle, this criterion is not affected by the Galactic warp which starts outside the Solar circle.

The distance determination script tests the different methods in a hierarchical way following the priority list given in Table 2. At the beginning, the source is placed at the far distance, and then this distance is changed if one of the listed conditions is fulfilled. Usually the choice of the distance is validated by only one method (which is indicated as the “STATUS” in the final table), but because all the methods are tested, the number of validated methods is coded in the final table as the “PROBA” number; the higher this number, the more reliable the distance decision.

4. Results

Results are presented in Table 3 (the complete table is available at CDS and the 3σ version of the table is also delivered as described in Appendix E). The columns are as follows: Col. 1: Hi-GAL source name; Cols. 2–5: equatorial and Galactic coordinates; Cols. 6 and 7: adopted velocity and its uncertainty; Col. 8: molecular tracer used for the velocity determination; Cols. 9 and 10: assigned distance and its uncertainty; Col. 11: method used for the distance determination as listed in Table 2; Col. 12: probability that the source has the assigned distance (this is a simple evaluation of the number of different methods assigning the chosen distance, e.g., PROBA = 0.22 means that two methods over the nine agree with the adopted distance assignment); Cols. 13–18: far and near kinematic distances and their upper and lower uncertainties (without any correction applied); Col. 19: the galactocentric distance; and Col. 20: detected extinction features listed as distance, AKs pairs.

Table 3.

Velocity and distance for the Hi-GAL compact sources

One problem that arises when determining the kinematic distances in the Galaxy is that the arm perturbation can introduce radial and azimuthal gas streaming motions (e.g., Roberts 1969; Englmaier 2000; Ramón-Fox & Bonnell 2018). Such streaming motions are taken into account in the determination of the distance uncertainty by assuming a typical velocity dispersion of 7 km s−1, but when the streaming motion induces a systematic and larger velocity shift, as is the case in the Perseus arm between l = 90° and l = 160° (e.g., Roberts 1972; Georgelin & Georgelin 1976; Brand & Blitz 1993) we must add a velocity correction. We then first delineate the Perseus arm by determining the arm velocity range (only between l = 90° to l = 160°) from the 12CO longitude-velocity plot of Dame et al. (2001). We then select HiGAL sources in these velocity and longitude ranges and apply to these sources (4405 sources with a kinematic distance) a correction of 14.9 km s−1 (Russeil et al. 2007) to the measured VLSR before re-calculating the kinematic distance. This correction is illustrated in Fig. 6.

thumbnail Fig. 6.

Longitude–distance plot in the range of the Perseus arm streaming motion correction (l = 90° and l = 160°). Blue and red symbols are the source distance before and after the correction. Sources that draw horizontal alignment are sources with the same stellar distance assignation.

Figure 7 shows the distribution of sources with determined radial velocities. We are able to assign a velocity for 124 069 sources (82.6%). We can split the sample of sources with no velocity assignment into sources that are not covered by any molecular surveys (57.4%) and sources for which the extracted spectrum is too noisy to perform any line fitting or to obtain a fitted line above a 5σ threshold (42.6%). For the first group of sources, we clearly identify the two gaps in the longitude coverage (around l = 190° and l = 270°) and latitude coverage limitation of the survey for some part of Q1 and Q2. For the second group of sources we note that 19% of them have a velocity if the fitted line intensity threshold is lowered to 3σ (instead of 5σ).

thumbnail Fig. 7.

Spatial distribution of Hi-GAL sources with determined radial velocities (a). Panel b: Hi-GAL sources superimposed with no assigned velocity. Sources not covered by any molecular surveys are shown in red and sources with no line fitted from an overly noisy extracted molecular spectrum are shown in blue. The vertical lines separate the different Galactic plane quadrants (Q1, Q2, Q3 and Q4).

Figure 8 shows a zoom (−60° < l <  60°) on the longitude–velocity distribution of all Hi-GAL sources for which we obtained a velocity (124069 sources). The gray-scale image shows the distribution of the molecular gas traced by the integrated 12CO emission from Dame et al. (2001). We note that the sources closely follow the large-scale prominent molecular features.

thumbnail Fig. 8.

Longitude–velocity distribution of all Hi-GAL sources in the range −60° < l <  60° for which we have obtained a velocity. The gray-scale image shows the distribution of the molecular gas traced by the integrated 12CO emission from Dame et al. (2001).

Urquhart et al. (2018) derived the physical properties of approximately 8000 ATLASGAL clumps (located in quadrants 4 and 1). To do so, they first assigned radial velocities to the clumps and used a series of criteria (see their Fig. 5 with the flow chart) to determine distances for the clumps. In order to compare our results with their velocity and distance determinations, we first cross-correlated the ATLASGAL and Hi-GAL compact-source catalogs with a radius of 14″ for the cone search. ATLASGAL and Hi-GAL sources are paired on the basis of their physical proximity. Due to the maximum radius of 14″ and differences in ATLASGAL and Hi-GAL survey coverage, the number of paired sources is 6268. Among them, 5976 sources have velocity in both surveys.

Figure 9 shows the comparison of velocity determination for paired ATLASGAL and Hi-GAL sources. The colors code the separation between the sources. We note that the source scattering around the one-to-one line is not due to mismatching. We find that 89.4% and 91.8% of the sources have a velocity difference smaller than 5 km s−1 and 10 km s−1, respectively.

thumbnail Fig. 9.

Comparison between paired ATLASGAL and Hi-GAL source velocity. The separation of the sources (in arcsec) is coded in color.

To make a comparison of distance determinations for paired sources, and in particular to compare the KDA solution, we must select sources with similar velocity. We then select sources with velocity difference smaller than 5 km s−1. In addition, we keep sources with longitude outside ±12° around the center and anticenter directions (because their distance determination is highly uncertain which is due to noncircular motion in the bar or the perpendicularity of the circular motion to the line of sight), and we exclude ATLASGAL sources labeled as “ambiguous” and Hi-GAL sources labeled “NO_KDA”. Figure 10 shows the comparison of distance determinations for this sample (3528 sources). In Fig. 10a, apart from the sources distributed around the equality line, we observe the classical behavior of anti-correlated distances, which is due to the differences in the distance disambiguation, that is, difference in assignation for a given source between the near and far distance (see Sect. 3.4). This is observed for any sources catalog, as illustrated in Fig. C.1 comparing ATLASGAL and BGPS (Ellsworth-Bowers et al. 2015) catalogs. Selecting the ATLASGAL sources for which the KDA distance has been specifically solved (Flags (iv) to (viii)) we find that 29.5% of the sources disagree for the KDA solution adopted. Nevertheless, we find that 2248 sources (64%) have a distance difference of less than 0.7 kpc (0.7 kpc being the typical distance uncertainty value for the paired Hi-GAL sources) with a mean value of 0.21 kpc. To better understand the distance difference between ATLASGAL and Hi-GAL sources we identify in Fig. 10b the sources with the tangent distance choice and sources with velocity lower than 10 km s−1, which are nearby sources (for which the velocity is more representative of any local motions) or sources close to the Solar circle (for which any particular motions produce inaccurate kinematic distances).

thumbnail Fig. 10.

Distance comparison between paired ATLASGAL and Hi-GAL sources (3528 sources). (a) Sources with velocity difference less than 5 km s−1 and longitude outside ±12° around the center and anti center directions. (b) Same figure but identifying the sources placed at the tangent distance in ATLASGAL (in red), and the sources with velocity lower than 10 km s−1 (in green). The red dash-dotted line is the y = x line.

We performed the same selection and the same comparison (see Fig. 11) with the BGPS catalog (Ellsworth-Bowers et al. 2015) which probes mainly quadrants 1 and 2. The number of paired sources with velocity is 2242, among which 89.2% (2000 sources) have velocity in agreement (Δv ≤ 5 km s−1). From this subsample of sources, we discard those with an unconstrained KDA solution in both surveys. This leaves us with 799 sources among which 64% (510 sources) have a distance difference of less than 0.5 kpc (which is the typical distance uncertainty value for the paired Hi-GAL sources) with a mean value of 0.16 kpc. In addition, selecting the BGPS sources for which the KDA distance has been specifically solved (Flag N or F) we find that 29.6% of the sources disagree for the KDA solution adopted; a proportion similar to that found for the ATLASGAL comparison.

thumbnail Fig. 11.

(a) Velocity (2285 sources) and (b) distance (2050 sources) comparison between paired BGPS and Hi-GAL sources. The red dash-dotted line is the y = x line. Same color coding as in Fig. 10.

Zucker et al. (2020) present a method that combines stellar photometric data with Gaia DR2 parallax measurements in a Bayesian framework to infer the distances of nearby dust clouds to a typical accuracy of ∼5%. These latter authors derived a catalog of distances to molecular clouds presented in the Handbook of Star Formation, Volumes I (Reipurth 2008a) and II (Reipurth 2008b). To compare our distance results with those of Zucker et al. (2020) we select the regions that are covered by Hi-GAL in their list. There are 13 regions common to both our list and that of Zucker et al. (2020), namely CMaOB1, Carina, Cygnus X, GemOB1, IC 2944, L1293, M20, M17, North America, RCW38, W3, W4, and W5. We then select the Hi-GAL sources located at the positions of each region as listed by Zucker et al. (2020) and in a similar area (0.2°) but within ±5 km s−1 around the systemic velocity of the region. For most of the region the adopted velocity is the H II region VLSR, but for CMaOB1, GemOB1, and IC 2944 the adopted velocity is the VLSR calculated from the mean radial velocity of the stars (Mel’Nik & Dambis 2009; Sepúlveda et al. 2011) and for L1293 it is the molecular velocity (Conrad et al. 2017). We then split the sample into regions located in quadrants 2 and 3 (then with no KDA) and in quadrants 1 and 4 (with KDA) and display them in Fig. 12, a velocity–distance plot (see also Fig. D.1). From Fig. 12 we find a good agreement for regions W3, W4, W5, RCW38, Carina, and CMaOB1, an agreement within the error bars for GemOB1, Cygnus X, North America, IC 2944, and L1293, a shift for M17, and a strong disagreement for M20. For M20 and GemOB1, their longitude placing them close to the Galactic center and anti-center directions, respectively, leads to an unreliable kinematic distance. Similarly, for North-America, although a few sources agree, others are at the adopted stellar distance while several are too far but can be explained by their longitude being near l = 90°, a location where the kinematic distance is also uncertain mainly because of the small value of the velocity in this region. In W5, many sources have been placed at the stellar distance which is slightly farther than the Zucker et al. (2020) distance but in agreement with the 2.3 kpc distance found for W3 and W5 by Chen et al. (2020). The most surprising is the disagreement found for M20 and M17. For these two regions, 10 among 13 and 10 among 19 Hi-GAL sources, respectively, have been placed at the far distance (not plotted in Fig. 12). However, for M17 the OB star distance is 2.1 kpc (Hoffmeister et al. 2008), in agreement with our data. This could suggest that the cloud detected by Zucker et al. (2020) is in front of M17 or its distance is underestimated. We note that Chen et al. (2020) find distances between 1.6 and 1.8 kpc for clouds in the neighborhood of M17. We can invoke the same explanation for M20, as Marshall et al. (2009) and Schultheis et al. (2014) estimated a distance greater than 4 kpc for dark clouds and extinction features in the direction and in the neighborhood of the H II region while Tapia et al. (2018) re-evaluate the M20 distance to 2 kpc.

thumbnail Fig. 12.

Velocity–distance plot of Hi-GAL sources in regions from Zucker et al. (2020) for sources in quadrants 1 and 2 (a) and in quadrants 2 and 4 (b), respectively. The velocity (±5 km s−1) and the distance (and its uncertainty) of the sight lines as listed in Zucker et al. (2020) are plotted in black. In panel a the Hi-GAL sources are plotted in red (for W3 and GemOB1), blue (for W5 and L1293), and green (for W5, CMaOB1 and RCW38). In panel b the Hi-GAL sources are plotted in red (for Carina, North America, and M20), blue (for Cygnus X), and green (for IC 2944 and M17).

Comparison with results presented in Zucker et al. (2020) allows us to illustrate the fact that when studying a particular region it is better to associate the region and the Hi-GAL sources from their velocities and only then to adopt the same distance calculated from the mean or representative velocity of the region. Indeed a small difference in velocity (due e.g., to internal motion) could lead to a large difference in distance.

We must keep in mind that the comparison presented here is done on small areas and in lines of sight that are not always pointing directly toward the considered H II regions. To avoid this, we illustrate the Hi-GAL sources association with a typical star forming region, the W3–W4 H II regions complex, for a larger view. We select all the Hi-GAL sources located in the (l, b) coverage of the region and plot these sources in (l, b, v) and (l, b, D) diagrams, v and D representing the velocity and distance obtained from this work. Results are presented for the W3–W4 regions. Figure 13 shows the Herschel PACS 160 μm image of the W3–W4 Galactic star forming regions together with the 1544 Hi-GAL sources overlaid as points, color-coded according to the velocity-range to which they belong. The associated sources (858 sources, shown in green) have velocities in the range [−55, −40] km s−1 which corresponds to velocities observed for the ionized and molecular gas in these regions. Unfitted sources are coded in blue (422 sources) and sources with velocities out of the [−55, −40] km s−1 range are coded in red (264 sources). Of the 1122 sources with velocity information, 858 (76.5%) are seen in the range −55 to −40 km s−1.

thumbnail Fig. 13.

Herschel PACS 160 μm image of the W3–W4 Galactic star forming regions. The Hi-GAL sources are overlaid as points, color-coded by their velocity: green for sources in the −55 to −40 km s−1 range, blue for unfitted sources, and red for the others with velocities outside the [−55, −40] velocity range.

From Fig. 13 it is clear that knowledge of velocity (and distance) is key to studying star formation. We note that the blue unfitted sources represent an important number of sources. Dedicated studies can be envisioned to obtain molecular data for these sources and ascertain their velocity and distance.

Figure 14 shows the (l, b, D) 3D diagram for the regions. Over the 1544 sources (with 422 unfitted, therefore a total of 1122 sources), 478 (43%) are observed in the 1.8–2.3 kpc range, the range obtained by Zucker et al. (2020) using Gaia data for the W3–W4 regions (see also Navarete et al. 2019 for W3). The remaining 644 sources (57%) are observed out of this distance range.

thumbnail Fig. 14.

Three-dimensional (l, b, D) diagram for the W3–W4 regions. 478 (43%) sources are observed in the range 1.8–2.3 kpc, the range of distances obtained by Zucker et al. (2020) for these two star forming regions. The color coding is the same as in Fig. 13, that is, sources in the range 1.8–2.3 kpc are coded in green, sources outside this range are coded in red, and unfitted sources (with no distance information) are coded in blue; the distance layer for these latter sources is put arbitrarily at 3 kpc for clarity in the plot.

Figure 15 shows the velocity versus distance plot for the W3–W4 regions. The 1544 Hi-GAL sources are represented together with the selected velocity range (−55 to −40 km s−1) in green and the selected distance range (1.8–2.3 kpc) in red. We see that, within the velocity range [−35, 0] and [−90, −60], sources follow the relation between velocity and distance given by the kinematic distance derivation. A similar relation is observed within −35 to −60 km s−1 where the velocity correction has been applied to take into account streaming motions observed in this region of the Galaxy. We clearly see the correspondence between the velocity range and the distance range, visually explaining how a lower number of sources are found in the corresponding selected distance range (1.8–2.3 kpc) when compared to the number of sources found in the selected velocity range.

thumbnail Fig. 15.

Velocity vs. distance for the W3–W4 regions. The 1544 Hi-GAL sources are represented together with the selected velocity range (−55 to −40 km s−1) in green and the selected distance range (1.8–2.3 kpc) in red.

In all cases studied, we note that about 50% of the Hi-GAL sources detected in a given region are in the D layer associated with a given molecular star forming cloud. We also note the possibility that for some unfitted sources (hence with no velocity or distance information), when a clear spatial association of the source can be ascertained with complementary information (e.g., near-infrared images), that this spatial association could be used to assign the source to a given molecular star forming region. We recommend users take a careful look at an infrared 2D image of the region, the (l, b, v) and (l, b, D) diagrams, when working on specific regions.

5. Conclusions

In an unprecedented effort to deliver reliable velocity and distance information for a large number of Hi-GAL sources, we developed the Python program starbird, which enables the determination and the assignation of velocity and distance to the Hi-GAL compact sources. We obtained a distance for 124 069 compact sources (among 150 223 sources) of the Hi-GAL catalog for the 5σ level (and 128 351 sources for the 3σ level). This is the first time that this information has been determined for such a large number of compact sources in a homogeneous way. This Hi-GAL compact source catalog with distance determination constitutes an invaluable resource for studies related to star formation in the Galaxy and Galaxy structure.

A comparison of our results with published velocities and distances coming from ATLASGAL and BGPS surveys shows that the velocity assignment is effective (with typically 89.3% of the sources with a velocity difference of less than 5 km s−1), and the resolution of the KDA disagrees in only 29.6% of the cases. The comparison with nearby regions from Zucker et al. (2020) illustrates that the cataloged distances can be used statistically to lead studies at the scale of the Galaxy. When studying a particular region, we recommend comparing the velocity and spatial distribution of the Hi-GAL sources with the systemic velocity of the region and its morphology, respectively. Finally, a path to improvement will be to update the distance (up to 4 kpc) of the nearby optical H II regions and star forming regions (e.g., Cantat-Gaudin & Anders 2020; Zucker et al. 2020; Maíz Apellániz et al. 2020) used in the process and the inclusion of recent molecular cloud distance catalogs (such as Chen et al. 2020), both based on the recent Gaia-DR2 data.


1

FP7-SPACE-2013-1 call with Grant Agreement No. 607380.

2

In practice an updated version of the cubes is used (Marshall et al., in prep.).

Acknowledgments

This work is part of and relied on the support of the European FP7 VIALACTEA project and the developments lead within the project, including the ViaLactea Knowledge Base (VLKB). A.Z. thanks the support of the Institut Universitaire de France. This paper made use of the SEDIGISM survey data. SEDIGISM data were acquired with the Atacama Pathfinder Experiment (APEX) under programmes 092.F-9315 and 193.C-0584. APEX is a collaboration among the Max-Planck-Institut fur Radioastronomie, the European Southern Observatory, and the Onsala Space Observatory. The SEDIGISM survey database can be found at https://sedigism.mpifr-bonn.mpg.de, which was constructed by James Urquhart and hosted by the Max Planck Institute for Radio Astronomy.

References

  1. Anderson, L. D., & Bania, T. M. 2009, ApJ, 690, 706 [Google Scholar]
  2. Anderson, L. D., Bania, T. M., Balser, D. S., & Rood, R. T. 2012, ApJ, 754, 62 [Google Scholar]
  3. Anderson, L. D., Armentrout, W. P., Johnstone, B. M., et al. 2015, ApJS, 221, 26 [Google Scholar]
  4. Araya, E., Hofner, P., Churchwell, E., & Kurtz, S. 2002, ApJS, 138, 63 [Google Scholar]
  5. Barnes, P., Wong, T., Fukui, Y., et al. 2006, CHaMP - A Galactic Census of High- and Medium-mass Protostars (ATNF Proposal) [Google Scholar]
  6. Barnes, P. J., Muller, E., Indermuehle, B., et al. 2015, ApJ, 812, 6 [Google Scholar]
  7. Bertin, E., & Arnouts, S. 1996, A&AS, 117, 393 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  8. Bobylev, V. V., & Bajkova, A. T. 2013, Astron. Lett., 39, 809 [Google Scholar]
  9. Braiding, C., Wong, G. F., Maxted, N. I., et al. 2018, PASA, 35, e029 [Google Scholar]
  10. Branch, M., Coleman, T., & Li, Y. 1999, SIAM J. Sci. Comput., 21, 1 [Google Scholar]
  11. Brand, J., & Blitz, L. 1993, A&A, 275, 67 [NASA ADS] [Google Scholar]
  12. Burton, W. B. 1971, A&A, 10, 76 [NASA ADS] [Google Scholar]
  13. Busfield, A. L., Purcell, C. R., Hoare, M. G., et al. 2006, MNRAS, 366, 1096 [Google Scholar]
  14. Cantat-Gaudin, T., & Anders, F. 2020, A&A, 633, A99 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  15. Cantat-Gaudin, T., Jordi, C., Vallenari, A., et al. 2018, A&A, 618, A93 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Chen, B. Q., Li, G. X., Yuan, H. B., et al. 2020, MNRAS, 493, 351 [Google Scholar]
  17. Chira, R. A., Beuther, H., Linz, H., et al. 2013, A&A, 552, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Conrad, C., Scholz, R. D., Kharchenko, N. V., et al. 2017, A&A, 600, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  19. Dame, T. M., Hartmann, D., & Thaddeus, P. 2001, ApJ, 547, 792 [NASA ADS] [CrossRef] [Google Scholar]
  20. Dempsey, J. T., Thomas, H. S., & Currie, M. J. 2013, ApJS, 209, 8 [Google Scholar]
  21. Du, F., & Yang, J. 2008, ApJ, 686, 384 [Google Scholar]
  22. Elia, D., Molinari, S., Schisano, E., et al. 2017, MNRAS, 471, 100 [Google Scholar]
  23. Ellsworth-Bowers, T. P., Rosolowsky, E., Glenn, J., et al. 2015, ApJ, 799, 29 [Google Scholar]
  24. Englmaier, P. 2000, Rev. Mod. Astron., 13, 97 [Google Scholar]
  25. Gaia Collaboration (Brown, A. G. A., et al.) A&A, 616, A1 [Google Scholar]
  26. Georgelin, Y. M., & Georgelin, Y. P. 1976, A&A, 49, 57 [NASA ADS] [Google Scholar]
  27. Golub, G., & Peyrera, V. 2007, SIAM J. Numer. Anal., 10, 413 [Google Scholar]
  28. Heyer, M. H., Brunt, C., Snell, R. L., et al. 1998, ApJS, 115, 241 [Google Scholar]
  29. Hoffmeister, V. H., Chini, R., Scheyda, C. M., et al. 2008, ApJ, 686, 310 [Google Scholar]
  30. Honma, M., Nagayama, T., Ando, K., et al. 2012, PASJ, 64, 136 [Google Scholar]
  31. Jackson, J. M., Rathborne, J. M., Shah, R. Y., et al. 2006, ApJS, 163, 145 [Google Scholar]
  32. Jackson, J. M., Finn, S. C., Rathborne, J. M., Chambers, E. T., & Simon, R. 2008, ApJ, 680, 349 [Google Scholar]
  33. Jackson, J. M., Rathborne, J. M., Foster, J. B., et al. 2013, PASA, 30, e057 [NASA ADS] [CrossRef] [Google Scholar]
  34. Kolpak, M. A., Jackson, J. M., Bania, T. M., Clemens, D. P., & Dickey, J. M. 2003, ApJ, 582, 756 [Google Scholar]
  35. Lallement, R., Babusiaux, C., Vergely, J. L., et al. 2019, A&A, 625, A135 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  36. Liao, W., & Fannjiang, A. 2016, Appl. Comput. Harmonic Anal., 40, 33 [Google Scholar]
  37. Liu, X.-L., Wang, J.-J., & Xu, J.-L. 2013, MNRAS, 431, 27 [Google Scholar]
  38. Maíz Apellániz, J., Crespo Bellido, P., Barbá, R. H., Fernández Aranda, R., & Sota, A. 2020, A&A, 643, A138 [CrossRef] [EDP Sciences] [Google Scholar]
  39. Marigo, P., Girardi, L., Bressan, A., et al. 2017, ApJ, 835, 77 [NASA ADS] [CrossRef] [Google Scholar]
  40. Marshall, D. J., Robin, A. C., Reylé, C., Schultheis, M., & Picaud, S. 2006, A&A, 453, 635 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Marshall, D. J., Joncas, G., & Jones, A. P. 2009, ApJ, 706, 727 [Google Scholar]
  42. McClure-Griffiths, N. M., Dickey, J. M., Gaensler, B. M., et al. 2005, ApJS, 158, 178 [Google Scholar]
  43. Mel’Nik, A. M., & Dambis, A. K. 2009, MNRAS, 400, 518 [NASA ADS] [CrossRef] [Google Scholar]
  44. Mizuno, A., & Fukui, Y. 2004, ASP Conf. Ser., 317, 59 [Google Scholar]
  45. Molinari, S., Swinyard, B., Bally, J., et al. 2010, A&A, 518, L100 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  46. Molinaro, M., Butora, R., Bandieramonte, M., et al. 2016, Proc. SPIE, 9913, 185 [Google Scholar]
  47. Mottram, J. C., & Brunt, C. M. 2010, ASP Conf. Ser., 438, 98 [NASA ADS] [Google Scholar]
  48. Navarete, F., Galli, P. A. B., & Damineli, A. 2019, MNRAS, 487, 2771 [Google Scholar]
  49. Osborn, M. 1975, SIAM J. Numer. Anal., 12, 571 [Google Scholar]
  50. Osborn, M. 2007, Electron. Transac. Numer. Anal. ETNA 28, 28, 1 [Google Scholar]
  51. Otrupcek, R. E., Hartley, M., & Wang, J. S. 2000, PASA, 17, 92 [Google Scholar]
  52. Pandian, J. D., Momjian, E., & Goldsmith, P. F. 2008, A&A, 486, 191 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  53. Peretto, N., & Fuller, G. A. 2009, A&A, 505, 405 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  54. Peter, T., Plonka, G., & Schaback, R. 2015, Proc. Appl. Math. Mech., 15, 665 [Google Scholar]
  55. Plonka, G., & Tasche, M. 2014, Gesellschaft fur Angewandte Mathematik und Mechanik, 37, 152 [Google Scholar]
  56. Potts, D., & Tasche, M. 2010, Signal Process., 90, 1631 [Google Scholar]
  57. Potts, D., & Tasche, M. 2013, Linear Algebra Appl., 439, 1024 [Google Scholar]
  58. Povich, M. S., Kuhn, M. A., Getman, K. V., et al. 2013, ApJS, 209, 31 [Google Scholar]
  59. Ramón-Fox, F. G., & Bonnell, I. A. 2018, MNRAS, 474, 2028 [NASA ADS] [CrossRef] [Google Scholar]
  60. Reid, M. J., Menten, K. M., Brunthaler, A., et al. 2014, ApJ, 783, 130 [Google Scholar]
  61. Reid, M. J., Menten, K. M., Zheng, X. W., et al. 2009, ApJ, 700, 137 [Google Scholar]
  62. Reipurth, B. 2008a, Handbook of Star Forming Regions, Volume I: The Northern Sky, 4 [Google Scholar]
  63. Reipurth, B. 2008b, Handbook of Star Forming Regions, Volume II: The Southern Sky, 5 [Google Scholar]
  64. Rigby, A. J., Moore, T. J. T., Plume, R., et al. 2016, MNRAS, 456, 2885 [Google Scholar]
  65. Roberts, W. W. 1969, ApJ, 158, 123 [Google Scholar]
  66. Roberts, W. W., Jr 1972, ApJ, 173, 259 [Google Scholar]
  67. Roman-Duval, J., Jackson, J. M., Heyer, M., et al. 2009, ApJ, 699, 1153 [Google Scholar]
  68. Russeil, D. 1998, PhD Thesis, University of Provence [Google Scholar]
  69. Russeil, D. 2003, A&A, 397, 133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  70. Russeil, D., Adami, C., & Georgelin, Y. M. 2007, A&A, 470, 161 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  71. Russeil, D., Pestalozzi, M., Mottram, J. C., et al. 2011, A&A, 526, A151 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  72. Russeil, D., Zavagno, A., Mège, P., et al. 2017, A&A, 601, L5 [Google Scholar]
  73. Schuller, F., Urquhart, J. S., Csengeri, T., et al. 2021, MNRAS, 500, 3064 [Google Scholar]
  74. Schultheis, M., Chen, B. Q., Jiang, B. W., et al. 2014, A&A, 566, A120 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  75. Sepúlveda, I., Anglada, G., Estalella, R., et al. 2011, A&A, 527, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  76. Sewilo, M., Watson, C., Araya, E., et al. 2004, ApJS, 154, 553 [Google Scholar]
  77. Simon, R., Jackson, J. M., Rathborne, J. M., & Chambers, E. T. 2006a, ApJ, 639, 227 [Google Scholar]
  78. Simon, R., Rathborne, J. M., Shah, R. Y., Jackson, J. M., & Chambers, E. T. 2006b, ApJ, 653, 1325 [Google Scholar]
  79. Solomon, P. M., Rivolo, A. R., Barrett, J., & Yahil, A. 1987, ApJ, 319, 730 [Google Scholar]
  80. Stark, A. A., & Brand, J. 1989, ApJ, 339, 763 [Google Scholar]
  81. Stil, J. M., Taylor, A. R., Dickey, J. M., et al. 2006, AJ, 132, 1158 [Google Scholar]
  82. Tapia, M., Persi, P., Román-Zúñiga, C., et al. 2018, MNRAS, 475, 3029 [Google Scholar]
  83. Taylor, A. R., Gibson, S. J., Peracaula, M., et al. 2003, AJ, 125, 3145 [Google Scholar]
  84. Urquhart, J. S., Hoare, M. G., Lumsden, S. L., et al. 2012, MNRAS, 420, 1656 [Google Scholar]
  85. Urquhart, J. S., König, C., Giannetti, A., et al. 2018, MNRAS, 473, 1059 [NASA ADS] [CrossRef] [Google Scholar]
  86. Watson, C., Araya, E., Sewilo, M., et al. 2003, ApJ, 587, 714 [Google Scholar]
  87. Whitaker, J. S., Jackson, J. M., Rathborne, J. M., et al. 2017, AJ, 154, 140 [Google Scholar]
  88. Wienen, M., Wyrowski, F., Menten, K. M., et al. 2015, A&A, 579, A91 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  89. Wu, Y. W., Sato, M., Reid, M. J., et al. 2014, A&A, 566, A17 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  90. Yan, Q.-Z., Yang, J., Sun, Y., Su, Y., & Xu, Y. 2019, ApJ, 885, 19 [Google Scholar]
  91. Zetterlund, E., Glenn, J., & Rosolowsky, E. 2017, ApJ, 835, 203 [Google Scholar]
  92. Zucker, C., Speagle, J. S., Schlafly, E. F., et al. 2019, ApJ, 879, 125 [Google Scholar]
  93. Zucker, C., Speagle, J. S., Schlafly, E. F., et al. 2020, A&A, 633, A51 [CrossRef] [EDP Sciences] [Google Scholar]

Appendix A: Fitting method

The method developed to fit the spectra is based on the Prony/MUSIC method (Potts & Tasche 2010, 2013; Plonka & Tasche 2014; Peter et al. 2015; Liao & Fannjiang 2016). This method is used in signal processing to recover signal parameters from a sample of data with noise. The fitting method is designed to reproduce the observed spectra and extract the characteristic of the lines observed (FWHM, velocity peak, and peak intensity). This method works in three steps, as described below.

  1. Line spectrum extraction, noise estimate.

  2. Detection of line peaks and associated VLSR.

  3. Full multi-Gaussian modelling of the lines.

1. Spectrum extraction and noise estimate.

The first step consists in extracting the spectrum observed at the position of the Hi-GAL source and estimating its noise. The spectral resolution and the Galactic coverage of each survey–molecule pair are given in Table 1. The spectral channel corresponding to the position (l, b) of a given Hi-GAL source is extracted and its outliers are masked: NaNs, temperatures outside the [−10K, 100K] range. Then we estimate the noise level (σ in K) in the spectrum. Two estimates are made: a local estimate of the noise is carried out in a 10 km s−1 range located at the high end of the spectral channel (where lines are assumed to be absent). A global estimate of the noise is also carried out on the signal, resulting from the subtraction [channel – channel averaged over 2 pixels (sliding window)]. Then we select the noise value. By default, the noise value determined with the local estimate is chosen except for cases where the values are outliers (therefore masked). In this case the noise value obtained from the global estimate is chosen.

2. Line detection and associated velocity peak in the spectrum.

The spectrum is deconvolved from a Gaussian of fixed size adapted to each survey–molecule pair. The FWHM of this Gaussian is fixed a priori and is set by the median FWHM obtained on fiducial fits made for each line–survey pair (4000 spectra per pair). The values are given in Table A.1. The deconvolved signal is in first approximation an inhomogeneous train of diracs. As a first approximation, its Fourier transform is a sum of complex exponentials whose phase is modulated by the VLSR centroid of each peak. Prony’s method is suitable for compressing the representation of this signal into an exponential sum. The approximation takes place here in Fourier space. The order compression of the model is based on a singular value decomposition of the Hankel matrix associated with the discrete Fourier transform of the auxiliary signal which makes it possible to separate and threshold the significant components of the exponential sum by thresholding above the noise level ≥3 or 5σ. This method is described in Potts & Tasche (2010), Peter et al. (2015), Liao & Fannjiang (2016).

Table A.1.

Initial FWHM values given for the spectral lines fit.

3. Multi-Gaussian fit of the spectrum.

A separable nonlinear least-squares fit is performed taking the median FWHM for each line and the VLSR obtained from the Prony method (Golub & Peyrera 2007; Osborn 1975, 2007). The fit is constrained in FWHM and is based on the Trust Region Reflective algorithm (TRF, Branch et al. 1999). The algorithm runs up to give a minimum difference between the fitted and the observed sprectrum. For each source, the final output of the fit shows the original spectrum and the lines fitted with the multi-Gaussian fit process described above, together with their associated VLSR peak, as shown in Fig. 2.

Appendix B: Dips detection in the H I profiles

In order to solve the KDA, we developed a code to analyze the dips in H I spectra (see Sect. 3.4). The code first identifies all the extrema (Fig. B.1 upper panel) and then selects only the ones above three times the noise level (σnoise). A dip is then defined as the intersection between a rising edge and a falling edge (displayed by the arrows in Fig. B.1 – middle panel) linking a minimum to its two nearest maxima. In addition, the amplitude of the two edges must be larger than 3σnoise to be considered as significant. In this frame, the smaller and/or the overly irregular (partially refilled, asymmetric, one edge below 3σnoise) dips are not kept by the code (Fig. B.1 lower panel).

thumbnail Fig. B.1.

Example of HISA identification for the source HIGALBM329.2039+0.7101 with assigned VLSR = −63.562 km s−1. The H I profile is plotted in black. The horizontal thick gray line shows the rms noise (σnoise) and the red dashed line shows the 3σnoise threshold. The black vertical dashed line and the gray shadow are the velocity of the source and the 3 km s−1 tolerance. The upper panel shows the identification of minima and maxima. The middle panel shows the identification of the falling and rising sides of the dip. The lower panel shows the final dips selection.

Appendix C: BGPS and ATLASGAL comparison

To better evaluate the reliability of our distance determination we use the comparison of ATLASGAL and BGPS catalogs as a reference. There are 697 sources with distance in common in these two samples. Figure C.1(a) shows that they have velocity in good agreement with 92% (634 sources) of them having a velocity difference ≤5 km s−1. Figure C.1(b) shows the typical shape due to the differences in the distance disambiguation (as for Figures 10 and 11 sources in the ±12° toward the Galactic center direction were discarded). We find that ∼50% (324) of sources have a distance difference ≤0.5 kpc (the typical uncertainty of the BGPS distances).

thumbnail Fig. C.1.

Velocity (a) and distance (b) comparison between paired BGPS and ATLASGAL sources. The red dash-dotted line is the y = x line.

Appendix D: Additional figures

thumbnail Fig. D.1.

Same as Fig. 12(a) and (b) but without the error bars.

Appendix E: The 3σ catalog

In addition to the 5σ catalog, which is constructed from velocity extracted from lines with post-fit peak intensity with S/N > 5 we also produce (and deliver it as Table E.1) a 3σ catalog (selecting lines with post-fit peak intensity with S/N > 3). With this lower selection level we can assign velocity to 4282 additional sources. The comparison of these catalogs (see Fig. E.1) shows that among the sources with velocity in both catalogs, 89.5% have similar velocity (VLSR ≤ 5 km s−1) and 90.8% have a distance difference ≤1 kpc.

thumbnail Fig. E.1.

Comparison of the 5σ and the 3σ catalogs in terms of velocity (upper panel) and distance (lower panel). The red lines show the ±5 km s−1 and ±1 kpc limits respectively.

All Tables

Table 1.

Surveys used to assign radial velocities to the Hi-GAL compact sources.

Table 2.

Methods used for distance disambiguation with their associated priority codes.

Table 3.

Velocity and distance for the Hi-GAL compact sources

Table A.1.

Initial FWHM values given for the spectral lines fit.

All Figures

thumbnail Fig. 1.

Flowchart for velocity determination.

In the text
thumbnail Fig. 2.

Example of fitted molecular spectra for VLSR detection for the Hi-GAL source HIGALBM340.2690-0.0542. The spectra are (a) NANTEN 12CO(1–0), (b) MOPRA 12CO(1–0), (c) TRUMMS 12CO(1–0), (d) SEDIGISM 13CO(2–1), (e) SEDIGISM C18O(2–1), (f) MALT90 HCO+. In all panels the black, red, and green curves are the extracted profile, the fitted lines, and the residual, respectively. The background gray area is the ±σnoise value where σnoise is the root mean squared value of the noise amplitude (see Appendix A). The horizontal blue line shows the 5σ level.

In the text
thumbnail Fig. 3.

Example of the morphological analysis of the molecular emission for the Hi-GAL source HIGALBM340.1288-0.1837. The upper panels show the MOPRA 12CO(1–0) (a) and SEDIGISM 13CO(2–1) (b) fitted spectra. The labels in these panels are the same as in Fig. 2. The lower panels show the channel maps at −3.4 km s−1 (c), −52.5 km s−1 (d), and −125.1 km s−1 (e). The red ellipse is the footprint of the Hi-GAL source while the yellow ellipses are the footprints of the structures extracted by SExtractor.

In the text
thumbnail Fig. 4.

Example of HISA identification for the source HIGALBM329.2039+0.7101 with assigned VLSR = −63.562 km s−1. The H I profile is plotted in blue, and the identified dips are shown as red dots. The horizontal gray thick line shows the ±σnoise and the dashed red line shows the 3σnoise level. The black vertical dashed line and the gray shadow are the velocity of the source and the 3 km s−1 tolerance.

In the text
thumbnail Fig. 5.

Example of extinction layer identification for the source HIGALBM1.7647-0.4806. The detection used the third derivative of the extinction curve. Here, one extinction layer is identified (red dot and dashed line) at a distance of 3.7 kpc.

In the text
thumbnail Fig. 6.

Longitude–distance plot in the range of the Perseus arm streaming motion correction (l = 90° and l = 160°). Blue and red symbols are the source distance before and after the correction. Sources that draw horizontal alignment are sources with the same stellar distance assignation.

In the text
thumbnail Fig. 7.

Spatial distribution of Hi-GAL sources with determined radial velocities (a). Panel b: Hi-GAL sources superimposed with no assigned velocity. Sources not covered by any molecular surveys are shown in red and sources with no line fitted from an overly noisy extracted molecular spectrum are shown in blue. The vertical lines separate the different Galactic plane quadrants (Q1, Q2, Q3 and Q4).

In the text
thumbnail Fig. 8.

Longitude–velocity distribution of all Hi-GAL sources in the range −60° < l <  60° for which we have obtained a velocity. The gray-scale image shows the distribution of the molecular gas traced by the integrated 12CO emission from Dame et al. (2001).

In the text
thumbnail Fig. 9.

Comparison between paired ATLASGAL and Hi-GAL source velocity. The separation of the sources (in arcsec) is coded in color.

In the text
thumbnail Fig. 10.

Distance comparison between paired ATLASGAL and Hi-GAL sources (3528 sources). (a) Sources with velocity difference less than 5 km s−1 and longitude outside ±12° around the center and anti center directions. (b) Same figure but identifying the sources placed at the tangent distance in ATLASGAL (in red), and the sources with velocity lower than 10 km s−1 (in green). The red dash-dotted line is the y = x line.

In the text
thumbnail Fig. 11.

(a) Velocity (2285 sources) and (b) distance (2050 sources) comparison between paired BGPS and Hi-GAL sources. The red dash-dotted line is the y = x line. Same color coding as in Fig. 10.

In the text
thumbnail Fig. 12.

Velocity–distance plot of Hi-GAL sources in regions from Zucker et al. (2020) for sources in quadrants 1 and 2 (a) and in quadrants 2 and 4 (b), respectively. The velocity (±5 km s−1) and the distance (and its uncertainty) of the sight lines as listed in Zucker et al. (2020) are plotted in black. In panel a the Hi-GAL sources are plotted in red (for W3 and GemOB1), blue (for W5 and L1293), and green (for W5, CMaOB1 and RCW38). In panel b the Hi-GAL sources are plotted in red (for Carina, North America, and M20), blue (for Cygnus X), and green (for IC 2944 and M17).

In the text
thumbnail Fig. 13.

Herschel PACS 160 μm image of the W3–W4 Galactic star forming regions. The Hi-GAL sources are overlaid as points, color-coded by their velocity: green for sources in the −55 to −40 km s−1 range, blue for unfitted sources, and red for the others with velocities outside the [−55, −40] velocity range.

In the text
thumbnail Fig. 14.

Three-dimensional (l, b, D) diagram for the W3–W4 regions. 478 (43%) sources are observed in the range 1.8–2.3 kpc, the range of distances obtained by Zucker et al. (2020) for these two star forming regions. The color coding is the same as in Fig. 13, that is, sources in the range 1.8–2.3 kpc are coded in green, sources outside this range are coded in red, and unfitted sources (with no distance information) are coded in blue; the distance layer for these latter sources is put arbitrarily at 3 kpc for clarity in the plot.

In the text
thumbnail Fig. 15.

Velocity vs. distance for the W3–W4 regions. The 1544 Hi-GAL sources are represented together with the selected velocity range (−55 to −40 km s−1) in green and the selected distance range (1.8–2.3 kpc) in red.

In the text
thumbnail Fig. B.1.

Example of HISA identification for the source HIGALBM329.2039+0.7101 with assigned VLSR = −63.562 km s−1. The H I profile is plotted in black. The horizontal thick gray line shows the rms noise (σnoise) and the red dashed line shows the 3σnoise threshold. The black vertical dashed line and the gray shadow are the velocity of the source and the 3 km s−1 tolerance. The upper panel shows the identification of minima and maxima. The middle panel shows the identification of the falling and rising sides of the dip. The lower panel shows the final dips selection.

In the text
thumbnail Fig. C.1.

Velocity (a) and distance (b) comparison between paired BGPS and ATLASGAL sources. The red dash-dotted line is the y = x line.

In the text
thumbnail Fig. D.1.

Same as Fig. 12(a) and (b) but without the error bars.

In the text
thumbnail Fig. E.1.

Comparison of the 5σ and the 3σ catalogs in terms of velocity (upper panel) and distance (lower panel). The red lines show the ±5 km s−1 and ±1 kpc limits respectively.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.