GLOSTAR -- Radio Source Catalog I: $28^{\circ}<\textit{l}<36^{\circ}$ and $|b|<1^{\circ}$

The GLOSTAR survey will study the star formation in the Galactic plane between $-2^{\circ}<\textit{l}<85^{\circ}$ and $|b|<1^{\circ}$ with unprecedented sensitivity in both, flux density ($\sim$ 40 $\mu Jy$ beam$^{-1}$) and range of angular scales ($\sim$ 1."5 to the largest radio structures in the Galaxy). In this paper we present the first results obtained from a radio continuum map of a 16 square degree sized region of the Galactic plane centered on $\textit{l} = 32^{\circ}$ and $b = 0^{\circ}$ ($28^{\circ}<\textit{l}<36^{\circ}$ and $|b|<1^{\circ}$). This map has a resolution of 18"and sensitivity of $\sim$ 60-150 $\mu Jy$ beam$^{-1}$. We present data acquired with the VLA in D-configuration. Two 1 GHz wide sub-bands were observed simultaneously and centred at 4.7 and 6.9 GHz. These data were calibrated and imaged using the $\textit{Obit}$ software package. The source extraction has been performed using the BLOBCAT software package and verified through a combination of visual inspection and cross-matching with other radio and mid-infrared surveys. The final catalog consists of 1575 discrete radio sources and 27 large scale structures (including W43 and W44). By cross-matching with other catalogs and calculating the spectral indices ($S(\nu) \propto \nu^\alpha$), we have classified 231 continuum sources as HII regions, 37 as ionization fronts, and 46 as planetary nebulae. The longitude and latitude distribution and negative spectral indices are all consistent with the vast majority of the unclassified sources being extragalactic background sources. We present a catalog of 1575 radio continuum sources and discuss their physical properties, emission nature and relation with previously reported. These first GLOSTAR results have increased the number of reliable HII regions in this part of the Galaxy by a factor of four.


Observation Strategy
The Karl G. Jansky Very Large Array (VLA) was used in Dconfiguration to observe the C-band (4−8 GHz) continuum emission.The correlator setup consisted of two 1 GHz wide subbands, centered at 4.7 and 6.9 GHz.The primary beam FWHMs are ∼ 9 and ∼ 6 at 4.7 and 6.9 GHz, respectively.Each sub-band was divided in eight spectral windows of 128 MHz, and each spectral window observed 64 channels of 2 MHz.Higher spectral resolution windows were used to cover the most prominent methanol maser emission line at 6.7 GHz, formaldehyde absorption at 4.8 GHz, and seven radio recombination lines.The results of these line observations will be reported in forthcoming papers.The chosen setup avoids strong and persistent Radio Frequency Interference (RFI) seen at 4.1 and 6.3 GHz and allows a rough estimation of spectral indices.
The region of the GLOSTAR-VLA survey presented in this paper was observed over a total of eight epochs and a total of 40 hours of telescope time (project ID 13A-334).The observations were taken during the spring of 2013 (see Table 1 for detailes).Each epoch consisted of observations of five hours.The  ×1 • was covered by about 630 pointings.The pointings were on a hexagonal grid with a spacing of θ hex = 3. 25.This corresponds to θ B /2, where θ B = 6. 5 is the FWHM width of the primary beam at the central frequency of the higher frequency continuum band (6.9 GHz).Each grid position was observed twice for 11 seconds which, after considering the slewing time, resulted in a total integration time of 15 seconds per position.The theoretical noise level from these observations is ∼90 µJy beam −1 per position and per subband.This noise is improved by a factor of two by combining the two observations of each position and both 1 GHz sub-bands (theoretical noise ∼45 µJy beam −1 ).

Calibration, Data Reduction and Imaging of Continuum data
The data were calibrated using the Obit software packages (Cotton 2008), which was designed for handling radio astronomy data.Obit applications can be accessed through a Python interface (ObitTalk).Obit also inter-operates with Classic AIPS (Greisen 2003) and has access to its tasks.Observations made with the VLA are well standardized, as is the data calibration, and as such calibration pipelines are now available.These implement the data reduction and calibration using amplitude, phase, and bandpass calibrators (for an example consult the CASA pipeline webpage: https://science.nrao.edu/facilities/vla/dataprocessing/pipeline).Following the methodology of these pipelines, we have developed a pipeline in Obit to calibrate the GLOSTAR-VLA data with some differences.The details of these differences are described in what follows: 1.One of the first steps that most pipelines apply is to flag 3the first record of every scan to consider the slew of the antennas.As our target scans last only a few seconds, ∼11 s, this flagging covers an important fraction of our science data.Furthermore, the distance between the target pointings is only a few arcminutes so that the slewing time is minimised.Thus, our pipeline only performs the flagging of the first record on scans that observe calibrators.
2. The GLOSTAR observations use a mixed setup of continuum and spectral lines, which are not handled well by pipelines.Thus, the next step of our pipeline is to distinguish between the different observational sets; i.e., for continuum, radio recombination line, formaldehyde and methanol data, and then load solely the continuum data.
3. A standard VLA calibration was applied in the Obit package which alternates editing of data affected by instrumental problems or interfering signals with various calibration steps.External calibration includes VLA system temperature (T sys ) calibration, group delay, bandpass, amplitude and phase, and polarization calibration.After a Hanning smoothing of the data, an automated search for outliers in the data is performed by a comparison against running medians in time and frequency for each data product.Calibrator data is checked for adequate signal to noise ratio by an RMS/mean test.After the initial pass at data editing, the various parallel hand calibration steps are performed applying further editing based on outliers in the calibration solutions.Following the first pass at calibration, a second set of automated editing steps is done on the calibrated data.The calibration tables are then reset, the flagging table kept and the calibration is repeated.This assures that the calibration is based on fully edited data.After flagging cross hand outliers, the polarization calibration is performed.Per pointing flagging is also included as part of the imaging.An initial, shallow CLEAN is done to determine the approximate flux density in the field.Data for that pointing is then clipped at 5 times the estimated pointing integrated flux density.
4. The Obit task MFImage was used for the continuum imaging (see Cotton et al. 2018 for details).First, we imaged the data after a shallow cleaning to identify strong sources that needed to be imaged in outlier fields in the final imaging.This program uses wide-band multifrequency synthesis by dividing the observed bandwidth into frequency bins, i.e. sections of the sub-band, with a fractional bandwidth of less than 5%.The number of frequency bins for our observation is eight.These are narrow enough such that effects of variable spectral index and antenna pattern variations are minor performing a joint spectral deconvolution.
5. The CLEANing is driven by a weighted average of the residual images from these frequency bins.Each pointing is imaged separately with a 0.28 × 0.28 square degrees sized area, centered on the phase center, using a Briggs' weighting with robustness parameter 0 and using its native restoring beam and with a pixel size of 2. 5. Outlier fields are used if a source with a integrated flux density of more than 10 mJy is known from the NVSS or a previous imaging run from our own data out to a distance of 0.4 degrees.Multi-scale clean was used with three different sizes: a delta function, and tapers of 20 and 40 pixels (50 and 100 ), respectively.Cleaning stopped when 5000 iterations were reached or the maximum in the residual image was below a level 0.25 mJy.
Phase only, and amplitude plus phase self-calibration were applied for images with sufficiently bright peaks (> 10 mJy for phase only and > 100 mJy for amplitude plus phase self-calibrations).The shortest baselines of 30 meter allowed imaging of scales up to 4 .
6.After imaging the data from each pointing, the images were merged together in Obit.Since all pointings had a slightly different native beam size and orientation due do different uv-coverage, all pointings were smoothed in a first step to a circular synthesized beam of 18 in diameter.This beam size corresponds roughly to the major axis of the largest beam size of all the pointings.Primary beam corrections were applied during the merging of the images.After combining all the pointings at each frequency bin, a combined image at the reference frequency was generated.
7. The maximum sensitivity can be obtained by a weighted combination of the overlapping observed pointing images onto a set of images covering the surveyed region.All else being equal, weighting by the square of the antenna power pattern in a given direction gives the optimum sensitivity.
The mosaic formation process for each image plane is given by the summation over overlapping pointing images: where A i (x, y) is the antenna gain of pointing i in the direction (x, y) and I i (x, y) is the pointing i pixel value interpolated to the direction (x, y) and M is the mosaic image.After combination, the spectrum in each pixel is redetermined.Note, the pointing images have already been corrected by the antenna gain pattern.A more detailed discussion of these image considerations is given by Cotton et al. (2018).

Radio continuum map
A mosaic of the 16 square degrees that results from the 8 sub images integrated and discussed in this paper is presented in the upper panel of Fig. 1.The effective frequency of the integrated image is 5.8 GHz.Although the observed region extends slightly beyond |b| = 1, the noise increases significantly in these regions due to poorer uv-coverage and differences in the beam sizes of the different sub-maps and therefore these edges have been removed.The final map has a spatial resolution of 18 .In the lower panel of this figure, we present a map of the noise distribution created using SExtractor (Bertin & Arnouts 1996;Holwerda 2005).A full description of the maps can be found in Sect.3.2.In Fig. 2 we present a histogram of the pixel values from which we estimate the sensitivity of the survey by fitting a Gaussian profile to the pixels clustered around zero.The standard deviation of the distribution of the pixel valuess is σ rms = 150 µJy beam −1 .However, as can be seen from the lower panel of Fig. 1 there are strongly localized variations in the noise across the map particularly towards prominent complexes (e.g., W43, W44, G34.26+0.15),most of which are concentrated towards the midplane.This is a consequence of the poor uv-coverage of the observations themselves.These large variations of the noise across of the GLOSTAR map produce the non-Gaussian profile of the pixels values in Fig. 2. In regions free of extended emission the noise level is around 60 µJy beam −1 .In Fig. 3 we show the noise as a function of Galactic latitude, which reveals a steep increase in noise closer to the Galactic mid-plane from ∼100 µJy beam −1 to ∼450 µJy beam −1 .
To measure the accuracy of the astrometry of the GLOSTAR map, we have matched the GLOSTAR unresolved sources (Yfactor < 1.2; this factor is the ratio of the integrated and peak flux densities, will be fully described in Sect.5) with more compact radio sources detected by the 5 GHz CORNISH radio continuum survey (Purcell et al. 2013) and measured the positional offsets between them.The offset in and b are plotted in Fig. 4, which ). Lower panel: Full resolution noise background map determined by SExtractor using a mesh size of 80 × 80 pixels and threshold of 5σ.The yellow boxes correspond to the complexes that have been identified (see Sect. 3.2.3 and    are clearly clustered around the centre; the mean values are 0. 2 ± 0. 9 and -0. 4 ± 0. 9, respectively, where the uncertainties are the standard deviations in the measurements.We therefore conclude that the GLOSTAR astrometry is reliable.We estimate the combined uncertainty on position, by adding the standard deviations in quadrature, to be 1. 2, which is an order of magnitude lower than our angular resolution (18 ).

Source extraction
The source extraction has been performed using the BLOBCAT software package developed by Hales et al. (2012).BLOBCAT is a Python script that utilizes a flood fill algorithm to detect and identify blobs, or islands of pixels representing sources, in two- dimensional astronomical radio-wavelength images.The package has been successfully used to create the source catalog for The HI/OH/Recombination line survey of the inner Milky Way (THOR; Beuther et al. 2016) survey, and has been adopted by us to facilitate the comparison with this complementary VLA continuum and line survey.We have followed the same method used by the THOR team at 1.4 GHz, which is outlined in Bihr et al. (2016).
The GLOSTAR radio continuum image has positiondependent noise and so we need to produce an independent noise map of the region to perform the automatic source extraction.We use the rms estimation algorithm within the SExtractor package that has been proven to be trustworthy in creating noise maps from radio data (Bondi et al. 2003;Huynh et al. 2005).This algorithm defines the rms value for each pixel in an image by determining the distribution of pixel values within a local mesh until all values are around a chosen σ value.Most real emission is removed from the noise image and the determined noise map contains the correct noise level.We use a detection and analysis threshold of 5σ, a minimum size of 5 pixels, and a mesh size of 80 × 80 pixels 2 (following the calculation of Hales et al. 2012).The resulting noise map is shown in the lower panel of Fig. 1, its large scale emission is consistent with the large scale emission previous reported by Bihr et al. (2016) at 1.4 GHz.
To perform the automatic source extraction, we use the noise map as an input into BLOBCAT along with the FITS image of the region under study.We applied a detection threshold of (dSNR) of 4 and a minimum source size of 5 pixels in diameter (∼13 ).This resulted in the extraction of all sources that have flux above 4 times the local noise (i.e., the ratio between the input image and the rms map) and have a size comparable or larger than the beam; these criteria reduce the number of false detections, particularly in complex regions.
These input parameters resulted in the detection of 1975 sources within the 16 square degrees map.This sample is, however, likely to contain a significant number of artifacts and emission from large scale structures that have been over-resolved, resulting in the emission being separated into multiple compo-nents (e.g., the W44 supernova remnant at = 34.6 • , b = 0.5 • ), and from the blending of discrete sources towards star formation regions found towards the Galactic mid-plane.In the following subsections, we will describe how we have dealt with some of these issues while in Sect. 4 we will describe the steps taken to identify and remove artifacts, and check the consistency of our catalog.

Superposition of distinct radio sources
In the visual inspection process, we flagged any sources that appeared to consist of two or more distinct emission peaks.We have grouped these into three categories: 1) elongated 'double lobed' sources; these look like extended emission regions powered by twin jets from active galactic nuclei (AGN; Treichel et al. 2001;Malarecki et al. 2015;Neff et al. 2015); 2) superposition of a compact bright radio source and a larger more diffuse radio source; 3) superposition of spherical radio sources; these are likely to be H ii regions in the same star forming complexes.An example of each of these three types is shown in the upper, middle and lower panels of Fig. 5.
We have made no attempt to separate the emission for the double lobed sources as it is likely to be associated with the same object.We have also made no attempt to separate the emission in cases in which a bright compact source is coincident with larger scale structures as there is not enough information to split them into reliable components as any measurement will be highly uncertain.There are a total of 51 of those cases.We have, however, separated the emission into two or more sources where the overlap is small and the integrated flux density can be reliably determined (see lower panel of Fig. 5 as an example).
Inspection of the GLIMPSE 4.5, 8.0, and 24 µm mid-infrared images of these blended sources identified 20 cases for which the emission peaks appear to be associated with distinct sources and could be separated.We split them into two or three sources according to the number of infrared counterparts.We use the CASA software to select and split the regions in which emission is above 4σ and then calculate their parameters such as flux and size from their emission area.For some sources, e.g.uniform and compact radio emission, we make a Gaussian fit using the task imfit also from the CASA software suite.An example of these type of sources is shown in the lower panel of the Fig. 5.This process resulted in 44 sources being identified from the 20 blended sources examined.These split sources are treated in the source catalog (see Table 7) as individual sources but we also include a flag to the entry so these can be easily identified.The peak and centroid position values are considered the same for these cases.

Compact sources in noisy regions
Visual inspection of the extracted sources and comparison with other catalogs (some of which are discussed in Sect.4.1) revealed a small number of compact (15), low surface brightness sources (with peak flux ∼1 mJy beam −1 ) that are located near regions of bright extended sources.An example is the source in Fig. 6.BLOBCAT does not extract these objects due to the higher localised noise surrounding them and poor dynamic range of snapshot interferometric observations.We have calculated their physical properties again using the CASA task imfit.This performs an elliptical Gaussian fit to the region identified around the source.The source parameters for these objects are included in the final catalog presented in Table 7 but, as with the split Fig. 5. Examples of various types of radio sources (see text).The contour levels start at 3σ and increase in steps determined using the dynamic range power-law fitting scheme where D = N i + 3 where D is the dynamic range, S peak /σ, N is the number of contour, in this case 6, and i is the power-law that determines the separation between consecutive contours; this is a modified version of a scheme developed by Thompson et al. 2006.The red hatched circle shown in the lower left corner of the map indicates the GLOSTAR beam size.The cyan boxes indicate the two sources identified and the region from which the flux has been estimated.Fig. 6.In the upper panel we show an example of an instance where a bright extended source has increased the local noise such that a nearby compact 7σ source located to the south-west has been missed.The position of this source is indicated by the cyan arrow.In the lower panel we show a smaller region centred on the missed compact source.sources, we also include a flag to the entry so these can be easily identified.

Large scale structures
We have identified 27 regions with large-scale emission by comparing the distribution of radio emission to that of the 8 µm maps extracted from the GLIMPSE archive.We have drawn a box around the coherent infrared structures that are morphologically correlated with the radio emission, but for which the radio emission has been broken up into 2 or more components.These 27 regions are indicated by squares in the upper and lower panels of Fig. 1.It is clear that these regions are also correlated with the noisiest regions in the map where much of the large angular scale emission is poorly recovered by the observations.In Table 2 we give the name of each structure, the extent of the boxed region in and b and the integrated flux of the total emission in the box minus the emission from any compact sources that are considered distinct sources.The name of each structure has been constructed from the central coordinate of the enclosing box.
In Fig. 7 we show some examples of these large scale structures.All extended radio sources in these boxed regions are considered to be part of the large-scale structure and have been excluded from the final catalog of radio sources.However, compact radio sources that are considered likely to be associated with discrete objects and not part of the larger-scale structure have been retained in the final catalog as distinct sources.
In total, 195 sources are associated with the 27 large-scale structures identified.Of the 27 large scale regions identified, two stand out as being particularly significant; these are the W43 star forming complex (29 • ≤ ≤ 32 • and |b| ≤ 1 • ; Blum et al. 1999) and the well known supernova remnant W44 (Clark & Parkinson 1975).Between them they contain 92 of the 195 individual fragments that have been associated with large scale emission regions.We note that radio recombination lines have been detected towards a radio source (G034.793−00.711)seen towards the south-eastern edge of W44 (Lockman 1989;G34.758-0.681).In the lower right panel of Fig. 7 we present a map of the radio emission seen towards W44 and indicate the position of this nearby H ii region with an arrow.The association of RRL emission with this radio source provides strong evidence that this source is a H ii region that is coincident with the edge of this supernova remnant (G034.8−00.7;see Ortega et al. 2010 for detailed discussion of this H ii region) rather than part of W44 itself.Therefore, we treat this as a separate source, which we classify as a H ii region.
The W43 region has two prominent components: W43-main and W43-south.W43-main hosts a giant H ii region (Blum et al. Fig. 7. Example of radio emission associated with large-scale structures.The background images are GLIMPSE 8 µm while the yellow contours are the GLOSTAR 5.8 GHz radio continuum emission.These regions often show coherent infrared structures that are morphologically correlated with most of the radio emission (see examples presented in the upper panels), however, the correlation is not always present but the radio emission is clearly correlated (see examples presented in the lower panels).These large-scale structures have been excluded from the final catalog of radio sources.The contours are determined as described in Fig. 5 and the red filled circle shown in the lower left corners shows the GLOSTAR beam.The magenta circles indicate the peak positions of radio sources associated with these regions.In the lower-right panel the W44 SNR is shown; the cyan arrow points to the H ii region (G034.793−00.711)located close to the edge of this complex (see text for details).
1999) and has been classified as a mini star-burst region (Motte et al. 2003).Due to the complexity of the region and our limited uv-coverage, the central parts of both regions are not well reconstructed in the radio emission map.That could also be a reason for the different integrated flux density measurement of the central part of W43 main between the THOR team, 55.55 Jy at 2 GHz (Bihr et al. 2016), and our value, 13.43 Jy at 6.0 GHz.However, our observations have been able to recover very well much of the emission from the shell structure of W44 (see lowerright panel of Fig. 7).This extended object is ∼30 pc across with thermal x-ray emission seen towards its center (Clark & Parkinson 1975).We measure an integrated flux density of 2.96 Jy, which is significantly larger than the value of 1.2 Jy determined by Wynn-Williams et al. ( 1981) from 5 GHz VLA observations; this new integrated flux density estimation is due to our higher sensitivity to more extended emission.
Our sample of large-scale emission structures also includes three other well known regions.The first of these is the very bright ultra-compact H ii region G34.3+0.2 (G034.260+00.125 in Table 2).Garay et al. (1986) estimated the integrated flux density at 5 and 8 GHz, to be ∼1.5 Jy and ∼2 Jy respectively.We calculate the flux to be significantly higher at ∼10 Jy but we are unable to fully resolve the structure of this object with the resolution of the current GLOSTAR data.Our higher integrated flux density determination is likely to be caused by our sensitivity to larger scale emission that its resolved out in the higher angu-lar resolution data of Garay et al. (1986).The other two regions worth mentioning are G035.506−00.026 and G030.462+00.449,both of which have counterparts in the catalog of star-forming regions reported by Rahman & Murray (2010) (complexes 17 and 23 respectively).The W44 and W43 regions have also counterparts in this catalog.Therefore, we have recovered four of the most luminous free-free sources in the Galaxy.
Searching the literature we find that 10 of these large scale structures, including W44, have been previously identified as SNR (Green 2014); this corresponds to over a third of this sample.Where a match has been made we have adopted the catalog name from Green (2014) in Table 2.

Catalog completeness and reliability
The BLOBCAT source extraction method was verified by the THOR team (Bihr et al. 2015;Wang et al. 2018).They performed a detailed set of completeness tests.They choose a region of 0.5 × 0.5 degrees with a constant noise level and added artificial 2D Gaussian sources with the size of the beam and different peak intensities.They measured the fraction of sources recovered as a function of intensity and found that at a threshold of 7σ the extraction algorithm was 95% complete.Given that our data are similar to the THOR data (both are VLA continuum surveys and have similar resolutions) we have simply adopted their results.
The CORNISH team used a similar flux threshold (7.2 σ; Purcell et al. 2013) as the minimum required for inclusion in their catalog.This was determined by calculating the number of spurious detections expected as a function of σ and set at a level above which no spurious detections would be expected (see discussion by Medina et al. 2018).The CORNISH survey consisted of 5.6 × 10 8 beams and so a high threshold was required to avoid any false detections.The region presented here consists of approximately 1000 times fewer beams (∼ 5.65 × 10 5 beams) and so we can set our threshold for reliable sources at a lower level.Based on Gaussian statistics, the number of spurious sources we would expect in our catalog using 5σ flux threshold is < 1 and so this is the level above which sources in the catalog are considered to be highly reliable.We find another ∼250 radio sources between 4 and 5σ of which we estimate 7% are likely to be spurious (∼17 sources); we include these in the final catalog as the vast majority are likely to be real sources and have counterparts in other surveys, but these are considered to be less reliable.

Verification of the Source Catalog
As discussed in the previous section, only a small number of spurious sources are expected in our final catalog.However, due to the nature of these interferometric snapshot observations, the uv-plane is relatively sparsely sampled, which can result in significant number of imaging artifacts in the restored maps.As a consequence the number of spurious sources expected from Gaussian statistics is, therefore, a lower limit to the actual number of spurious sources detected.
To improve the reliability of the catalog produced by BLOBCAT we need to identify any spurious noise peaks and imaging artifacts (sidelobes and emission from over-resolved structures).We have already taken emission from over-resolved structures into account by identifying the large-scale structures as described in Sect.3.2.3;this reduces the number of sources in the catalog to 1819.Next we assume that any radio source that has a counterpart in a published radio or mid-infrared catalog is real; this is because it is extremely unlikely that a false detection would randomly appear in at same position in other independent surveys.

Complementary surveys
A number of complementary multi-wavelength surveys have produced catalogs of sources located in the Galactic plane that we have used to verify if the sources recovered from the GLOSTAR map are real and to help determine their nature.Below we provide a brief summary of the surveys used in this paper.In all cases we have used a 10 radius to identify matches with the exception of the ATLASGAL dust emission; this is because the dust emission tends to be more extended.The surveys selected are either well matched in resolution (e.g.WISE, AT-LASGAL, MAGPIS and THOR, with angular resolutions of 6-19 ) or in wavelength (e.g.CORNISH at 5 GHz).

CORNISH
The Co-Ordinated Radio 'N' Infrared Survey for High-mass star formation (CORNISH; Hoare et al. 2012) is a sensitive and highresolution 5 GHz radio continuum survey that focuses on the northern GLIMPSE region (10 • < < 65 • and |b| < 1 • ).The survey was conducted using the Very Large Array before its upgrade in B-configuration, which provides an angular resolution of 1. 5 and has a 7.2σ sensitivity of ∼2.7 mJy beam −1 .
Table 4. Statistics of the matches between GLOSTAR and other published surveys.A match radius of 10 has been used centred on the peak of the GLOSTAR emission except for the ATLASGAL dust emission which is more extended than the radio and mid-infrared catalogs.

Survey
Wavelength Resolution rms level Num. of sources Num. of ( ) mJy beam  The CORNISH catalog consists of ∼2600 sources above 7σ (Purcell et al. 2013), 353 of which are located in the GLOSTAR region presented in this paper.We find a GLOSTAR counterpart for 327 of these (∼80%).Inspection of the emission maps for the brighter CORNISH sources reveals that 20 more are associated with extended or multi-peak emission sources found in GLOSTAR where the positions are less well constrained and consequently these sources were not matched.In Fig. 8 we show the peak flux distribution of all 353 CORNISH sources and the 26 sources where we have failed to find a GLOSTAR counterpart using either 10 search radius or through inspection of the emission maps.It is clear from this plot that all of the radio sources that are not picked up by GLOSTAR correspond to the weakest CORNISH sources.We have estimated the noise values from the GLOSTAR maps towards these 26 CORNISH position and find a mean noise value of 166 µJy beam −1 and a standard deviation of 154 µJy beam −1 (minimum and maximum noise values are 47 and 768 µJy beam −1 , respectively), which is significantly lower than the typical CORNISH 1σ (0.33 mJy beam −1 ) noise values (see Fig. 9).
There are two possible explanations for these missing sources: 1) the flux density is variable and has decreased between the time the CORNISH observations were completed and the GLOSTAR observations were made; and 2) these are spuri- ).In Table 5 we provide the source name and integrated and peak flux densities of the unmatched CORNISH sources (taken from Purcell et al. 2013) along the local rms noise determined from the GLOSTAR maps.
In the case of variable sources, we should expect to find approximately as many sources with significantly larger fluxes in CORNISH than measured in GLOSTAR and comparing the fluxes we find 7 matches for which this is the case (following Kalcheva et al. 2018 we set this criterion to be > 50% more flux).The radio names of these sources and the flux ratio are given in Table 6.The discrepancy between this number and that of the number of CORNISH sources not detected in GLOSTAR (Table 5) suggests that variability alone cannot fully account for the 26 CORNISH source not detected in GLOSTAR.The highest flux ratio is 3 and so the differences are relatively modest compared to those reported by Kalcheva et al. (2018) (range between 2.4 and 73.9).All of the sources are either marginally detected or unresolved, have integrated fluxes below 10 mJy, and all but one have negative spectral indices, and so the variability appears to be restricted to weak and unresolved extragalactic sources.We note that these radio properties are similar to those reported for T Tauri stars (e.g., Dzib et al. 2013), however, given these radio sources appear to be mid-infrared dark this possibility is considered less likely.Kalcheva et al. (2018) considered only H ii regions in their analysis, whereas none of the variable candidates we have identified have been classified as being H ii regions, PNe or radio stars.However, Kalcheva et al. (2018) only found one variable H ii region in the region being considered here and given the shorter temporal baseline (∼7 yrs) the lack of variable H ii regions and the lower levels of variation are not surprising.

MAGPIS
One of the first VLA surveys of a large fraction of the Galactic plane at 5 GHz was conducted in the early 1990s (350 • < < 40 • and |b| < 0.4 • ; Becker et al. 1994).This survey is now part of a larger collection of re-reduced radio surveys known as the Multi-Array Galactic Plane Imaging Survey (MAGPIS; Helfand et al. 2006). 4This part of the MAGPIS survey has a resolution of 6-9 and a 5σ sensitivity of ∼2.5 mJy beam −1 .This reprocessing has resulted in the identification of 1272 discrete sources including 218 that are located within the GLOSTAR region presented in this paper.Of these, we have found that 128 are coincident with 125 GLOSTAR sources (the lower resolution of GLOSTAR results in three cases where a GLOSTAR source is associated with 2 MAGPIS sources).A further 22 MAGPIS sources are coincident with sources associated with large scale structures discussed in Sect.3.2.3.In total, 150 MAGPIS sources are matched in GLOSTAR, which corresponds to approximately 70% of the MAGPIS 5 GHz sample.We have investigated a few of the MAGPIS sources not matched in GLOSTAR and find many of these to be associated with sources that are overresolved in MAGPIS.The offsets between the partially recov- ered fragments and the GLOSTAR source are larger than the 10 used to identify counterparts.

THOR
The H i, OH, Recombination line survey (THOR; Beuther et al. 2016;Bihr et al. 2015;Wang et al. 2018) is a 1.4 GHz radio continuum and line survey of ∼100 deg 2 of the Galactic plane in the first quadrant of the Milky Way.This survey covers the 21 cm H i line, 4 OH transitions, 19 radio recombination lines, and continuum emission between 1 to 2 GHz.The survey was conducted using the Karl G. Jansky Very Large Array (VLA) in C-array and has a spatial resolution of 10-25 .The data reduction strategy and source catalog for the first half of the survey ( = 14 • -37.9 • , = 47.1 • -51.2 • , and |b| ≤ 1.1 • ) are described in Bihr et al. (2016).The catalog contains a total of ∼4,400 sources, of which ∼1200 are spatially resolved and ∼1,000 are possible artifacts. 5 total of 1036 THOR sources are located in the region of the GLOSTAR survey described here; we are able to find a GLOSTAR counterpart for 711 of these.We have matched a further 61 THOR sources with GLOSTAR sources that have been identified as being part of large scale structures.In Fig. 10 we show the 1.4 GHz peak flux distribution of all THOR sources located in the GLOSTAR region and the flux distribution of those without a counterpart in GLOSTAR (grey and red-hatched histograms, respectively).It is clear from this plot that a higher proportion of the weaker THOR sources have not been matched.Spectral index measurements are only available in the THOR catalog for the brightest of these unmatched sources (203 of the 264 unmatched) and the majority of these have negative values (71% have α < 0, with a mean value = −0.85),which means they are weaker at 5 GHz than at 1.4 GHz and are hence less likely to be detected in the GLOSTAR map.The smaller proportion of unmatched sources that do have a positive spectral index (29% have α > 0, with a mean value = 0.28) are a little harder to ex-plain.However, we note that the THOR team acknowledge that ∼23% of their catalog are possibly artifacts, which corresponds well with the fraction of unmatched sources we have found here.

ATLASGAL
The APEX Telescope Large Area Survey of the Galaxy (AT-LASGAL; Schuller et al. 2009;Beuther et al. 2012) is the first high-resolution (≈ 20 FWHM) ground-based submillimetre (870 µm) survey of the thermal dust emission in the inner Galactic plane.It has produced an unbiased catalog of ∼10000 massive pre-and proto-stellar clumps (ATLASGAL Compact Source Catalog (CSC); Contreras et al. 2013;Urquhart et al. 2014a).Correlating the GLOSTAR and ATLASGAL catalogs is an excellent way to identify embedded or dust enshrouded objects such as UC H ii regions and planetary nebula (e.g., Irabor et al. 2018, Urquhart et al. 2007, 2009;Purcell et al. 2013;Urquhart et al. 2013).
As previously mentioned, the dust emission is more extended than the radio sources, the majority of which are unresolved and so rather than using a 10 matching radius we identify matches where the radio source falls within the 4σ boundary of the AT-LASGAL dust emission.There are 747 ATLASGAL sources located in the GLOSTAR region of which 132 are positionally coincident with a radio source.Of these, 92 have been previously identified as H ii regions (Kalcheva et al. 2018;Anderson et al. 2014;Giveon et al. 2005a) and one has been identified as a PN and one as a radio star by the CORNISH team (Purcell et al. 2013).The relatively low fraction of ATLASGAL matches is to be expected as only clumps hosting H ii regions will have a radio counterpart and H ii regions are only present in the most massive and evolved clumps.

WISE
The Wide-field Infrared Survey Explorer (WISE) is a NASA infrared-wavelength astronomical space telescope mission that mapped the entire sky in four infrared bands W1, W2, W3, and W4 centered at 3.4, 4.6, 12, and 22 µm using a 40 cm telescope feeding arrays with a total of 4 million pixels; these wavelengths correspond to angular resolutions of 6.1, 6.4, 6.5 and 12 .For a full description of the mission see Wright et al. (2010).
The WISE ALL-sky Release Source Catalog contains astrometry and photometry for over half billion objects.We have searched this data repository to identify and extract the midinfrared fluxes towards 988 GLOSTAR sources; where two or more WISE sources are found within the search radius, we have picked the infrared counterpart to be the source with the smallest angular offset to the peak radio emission.
The WISE catalog is optimised for compact sources and so may overlook more extended mid-infrared emission associated with more evolved H ii regions.We have therefore also crossmatched the GLOSTAR sample with a catalog of H ii region candidates identified from a combination of visual and automatic search of WISE data based on their MIR emission morphology (Anderson et al. 2014).This catalog contains ∼8000 H ii region candidates, of which 667 are located in the GLOSTAR field.We find 130 of these mid-infrared sources to be coincident with a radio source, including 26 not found in the search made of the WISE catalog.Although we have only found a radio counterpart for ∼20% of the H ii region candidates Anderson et al. (2014) estimates that they are sensitive to H ii regions with radio fluxes greater than 0.2 mJy, which is lower than our 4σ sensitiv-ity, which can be as poor as 1.6 mJy towards the Galactic midplane (i.e., see Fig. 3) where most of the H ii regions are likely to be located.Also, objects being resolved out (> 4 ) could only explain a small proportion (10%) of unmatched WISE H ii regions and so the rest are weak H ii regions.In total, we have matched 988 radio sources with a WISE counterpart.

Correlation with other surveys
We have searched for counterparts in the THOR, CORNISH and MAGPIS radio continuum surveys and the ATLASGAL dust and WISE mid-infrared surveys.The correlation process with other catalogs allow us to find any large scale structures and all compact sources that have a counterpart.This analysis has identified counterparts for 1315 radio sources and these are therefore considered to be reliable detections.
The remaining 465 radio sources are a mix of compact, moderately extended sources, imaging artefacts and spurious detections.The extended sources are also likely to be associated with extended H ii regions or supernova remnant.This can be confirmed by comparing the morphology of the radio emission with that at mid-infrared wavelengths.This can also identify weak mid-infrared counterparts to the compact radio sources that were not picked up in the mid-infrared catalogs.The next step is, therefore, to compare the radio and mid-infrared emission maps for these 465 sources; sources for which mid-infrared counterparts can be identified can be considered real.This visual inspection is also useful in identifying and removing imaging artifacts.
We have compared the radio emission with mid-infrared maps extracted from the GLIMPSE 4.5 and 8.0 µm bands, and MIPSGAL 24 µm band archives.We have discarded any compact radio sources that appear to be localised peaks in high-noise regions, sources that are smaller than the beam (these tended to be clustered towards the detection threshold) and extended emission that appears to be associated with large-scale structures that, due to poor uv-coverage, have not been reliably imaged (these have a wider range of fluxes).This process has resulted in 237 sources being excluded from the 465 without a counterpart in one of the other surveys.
Combined with the split sources and recovery (by hand) of compact sources in noisy regions, previously discussed, the final radio catalog therefore consists of 1575 sources.In Table 3 we present a summary of the various steps taken in the source verification process and in Table 4 the summary of the search for counterparts with continuum, dust and mid-infrared surveys.

Comparison of fluxes with other 5 GHz surveys
As mention earlier, we have identified matched counterparts of GLOSTAR sources in the CORNISH and MAGPIS catalogs.Although these surveys were conducted at a higher resolution than the part of the GLOSTAR survey reported here, they were conducted at a similar frequency and can hence provide a useful consistency check on the measured fluxes.Due to the different resolutions we have compared the GLOSTAR peak fluxes with the integrated fluxes from CORNISH and MAGPIS, except when the sources in those surveys are larger than the GLOSTAR beam in which case we compare the integrated fluxes from both surveys; CORNISH filters out any structures larger than ∼15 and so this is only a consideration for some of the MAGPIS sources.
In Fig. 11 we show the results of this flux comparison.It is clear that the fluxes from all three surveys are in very good agreement, however, it is noticeable that there is bias towards higher  (1994) source size is less than 20 and the integrated flux from both surveys if the source is larger than 20 .The gray line is the line of equality.Lower panel: Flux ratio distribution (as described above), which has a mean of 0.9 and standard deviation of 0.34.
fluxes measured by GLOSTAR.This is to be expected given the lower resolution resulting in more flux being present in the beam and a result of some of the flux being filtered out by the highresolution surveys due to their poorer sensitivity to larger angular scales.We also note that a handful of CORNISH sources have a significantly higher flux than measured in the GLOSTAR data; these appear as a small cluster of sources above the line of equality towards the lower left corner of the plot and have been previously discussed in Sect.4.1.1.

Catalog properties
The final catalog consists of 1575 distinct sources with flux densities above 4σ, where σ corresponds to the local noise.
We give the names of the sources and the observed parameters in Table 7.The source names are constructed from the peak flux density positions.The position given in Cols. 2 and 3 are the barycentric coordinates determined from the first order moments of the longitude and latitude profiles.The barycentric position generally defines the centroid position of a source.However, if the emission profile of a source is extended and consists of multiple peaks, then this can be offset from the peak position.The mean uncertainties on the peak flux density positions is 1. 22 with a standard deviation of 0. 81; the positional uncertainties associated with individual sources are provided in the electronic version of Table 7. Combined with the uncertainty in the map astrometry the mean uncertainty in the catalog positions is ∼ 2 .The signal to noise values are given in Col. 4; these are estimated by dividing the peak flux by the rms at the same position.The flux density values given in Cols.5-8 have been corrected for clean bias and bandwidth smearing (see Hales et al. (2012) for details) and in Col. 9 we give the Y-factor, which is the ratio of the integrated and peak flux densities (i.e., S int /S peak ); this gives an indication of how compact a source is with unresolved sources having a value of one and very extended sources having values of several 10s.The radius in Col. 10 is determined from the number of pixels associated with the source and is therefore a rather crude estimate of the size of a source particularly for extended structures and should be used with caution.In Col. 11 and 12 we give the spectral index for compact sources (i.e.Y-factor < 2) that have a 20 cm counterpart detected by the THOR survey while in Col. 13 and 14 are the values calculated from the 8 sub images discussed in Sect.3.1; this will be described in Sect.5.2.In Cols.15 and 16 we give the classification assigned to the source and a reference when available.

Fluxes and angular sizes
In Fig. 12 we show the peak and integrated flux densities distribution of the detected sources.Together with the peak flux density distribution we also show the integrated flux density distribution of all GLOSTAR sources that are matched to sources previously identified by CORNISH and/or MAGPIS (red histogram overlaid on the upper panel); this shows clearly that the vast majority of 5 GHz sources detected in the GLOSTAR map with a peak flux density above a few mJy beam −1 have been detected before.The sources not picked up by the previous surveys are all extended and would have been filtered out by CORNISH or in regions not covered by MAGPIS.What is also clear from this plot is that the improved sensitivity of GLOSTAR has resulted in the detections of many previously unknown integrated low-flux density sources.
In Fig. 13 we show the distribution of Y-factor for the sample; this parameter is the ratio between integrated and peak flux density (i.e., S int /S peak ) and gives an indicate of the source size.While for unresolved sources the Y-factor should be equal to 1, allowing for measurement uncertainties we classify all sources with a Y-factor < 1.2 as unresolved (cf.Bihr et al. 2016).We make a further distinction between compact (which includes all of the unresolved sources) and extended sources using a Y-factor = 2 as dividing line.We note that a significant number of sources have Y-factors smaller than 1 and this has been discussed in detail by Bihr et al. (2016) who identified three possible explanations for this unexpected result: 1) BLOBCAT does not fit enough pixels for weak sources, which leads to lower integrated flux densities; 2) unresolved sources are located in slightly negative 28.173 −0.606 4.9 1.13 0.24 14.55 0.76 5.5 32 28.178 0.072 4.3 1.49 0.35 9.56 0.59 2.5 21  Notes: Only a small portion of the data is provided here, the full table is available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.125.5)or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A&A/.side lobes from nearby strong sources; 3) weak sources are not cleaned properly.Bihr et al. suggest using the peak flux densities for unresolved sources in any analysis and we have also adopted this recommendation.
BLOBCAT gives source size in terms of the number of pixels (parameter npix) that are flood filled in the source extraction process.We convert this into a radial size using: where R source is the effective radius, A is the area covered by the source calculated by npix times the pixel size squared, which in this case is 2. 5 × 2. 5. Fig. 14 shows the distribution of the angular radius of the sources.

Spectral index
Astrophysical sources mainly emit two types of radio emission, thermal and non-thermal.The first refers to bremsstrahlung radi-ation and the latter is also known as synchrotron radiation (see, e.g., Wilson et al. 2012).Thermal bremsstrahlung radiation is produced in H ii regions and planetary nebulae and usually has spectral indices between −0.1 and 2 depending on whether the emission region is optically thin or thick, respectively.Active Galactic Nuclei (AGN) galaxies and supernova remnants give rise to synchrotron radiation, which has negative spectral indices, typically between −1 and −0.5 (Condon 1984;Dzib et al. 2013;Rodríguez et al. 2012 and references therein).The spectral index can, therefore, help to broadly distinguish between different types of radio sources.
As mentioned in Sect.2.2, the calibration process produced 8 sub images in the frequency range between 4 and 8 GHz which we used to estimate the spectral index only of the compact sources (i.e.Y-factor < 2).We used the peak flux density from each sub image, derived with the CASA task imfit, and a χ 2 linear fit to calculated the spectral index value.
We also estimate the spectral index of the compact sources by combining their peak flux densities with the peak lowerfrequency flux densities determined by THOR.As previously mentioned, we have matched 711 sources with a THOR source, of which 640 have Y-factors smaller than 2 and so can be assumed to be relatively compact.The mid-frequency for THOR and GLOSTAR are 1.5 GHz and 5.8 GHz, respectively.Following Kalcheva et al. (2018) we estimate the spectral index using the following: α = ln (S GLOSTAR /S THOR ) ln(5.8/1.5), where S THOR and S GLOSTAR are the peak flux densities at 6 cm and 20 cm, respectively.The uncertainty is calculated using: ∆α = (σ THOR /S THOR ) 2 + (σ GLOSTAR /S GLOSTAR ) 2 ln (5.8/1.5) where σ GLOSTAR and σ THOR are the uncertainties in the peak flux densities.
For a comparison of both methods discussed above, we selected sources with reliable GLOSTAR internal spectral indices, σ α < 0.2, and compare them with their GLOSTAR-THOR spectral indices (see Fig. 15).A linear fit to these values yields that the slope is m = 0.94 ± 0.03, close to the slope of the equality line.In conclusion, the spectral index values derived with the two methods are statistically similar.
The spectral index derived from GLOSTAR and THOR data combination and their associated uncertainty are given in Cols.11 and 12 of Table 7.In the upper panel of Fig. 16 we show the distribution of spectral indices; these cover a range of values between −1.5 and 1 and have a bimodal distribution with peaks at approximately −0.7 and 0.1.The dip between these two peaks occurs at approximately −0.2, which we consider the transition from thermal and non-thermal emission.We also show the spectral indices of radio sources that are associated with midinfrared emission (i.e., have a WISE counterpart) and associated with dust emission (i.e., have an ATLASGAL counterpart); these 2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 Glostar-Thor Spec.Index are shown in red and yellow hatched histograms, respectively.It is clear from a comparison of these distributions that the sources with mid-infrared counterparts span the whole range of spectral indices, while the sources associated with dust are almost exclusively thermal sources.So while sources associated with dust are likely to be a mix of H ii regions and PNe, the association of a particular source with mid-infrared emission does not provide any constraints on its nature.We will explore this issue of source types in the next section.In the lower panel of Fig. 16 we show the distribution of the uncertainties associated with the spectral index measurements; these have typical values of ±0.1 but can be as large as ±0.2.
We have calculated the spectral indices for 671 GLOSTAR sources without a THOR counterpart.The values and their associated uncertainties are given in Cols.13 and 14 of Table 7.We note that the limited frequency range or poor signal to noise results in large uncertainties for the majority (74% have ∆α > 0.2).The spectral indices range between −1 and 1 for the sources for which they are considered to be reliable.Given that the sample of GLOSTAR-internal spectral index is relatively modest we decided not to perform a detailed statistical analysis of the spectral index.

Literature search
A number of studies have investigated the nature of radio and mid-infrared sources.By correlating our catalog with these we can adopt their classifications.Cross-matching our radio sources with H ii region catalogs compiled by Purcell et al. (2013) and Kalcheva et al. (2018) using CORNISH data, Giveon et al. (2005b) using MAGPIS data and Ortega et al. (2010) we have identified a total of 46 previously known H ii regions.As described in Sect.4.1.5we also cross-matched our radio sources with a catalog of H ii region candidates identified by Anderson et al. (2014) from their analysis of WISE colors.This revealed a match for 130 radio sources including 90 not previously matched to a CORNISH or MAGPIS source.Of these, 43 have spectral indexes determined from the THOR and GLOSTAR data; 42 of these have spectral indexes between −0.17 to 0.88, which supports their classification as H ii regions (Anderson et al. 2014).Combined with the 46 sources identified from the literature brings the sample of reliable H ii regions in the GLOSTAR region to 136.
A further 28 sources have been identified as PNe by the CORNISH team (Purcell et al. 2013 andIrabor et al. 2018), and another 15 as radio stars (12 identified by Purcell et al. 2013 and 3 found in SIMBAD).We found, with a cross match radius of 10 , that 3 of these candidates have a counterpart in the HASH (Hong Kong/AAO/Strasbourg Hα) planetary nebula database (Parker et al. 2016).Our literature search has also identified another three interesting compact radio sources that are consistent in position with pulsars: G032.763+00.092 with PSR J1850-0006 (Keith et al. 2009), G028.880−00.938 with PSR B1844-04, and G028.193−00.785 with PSR B1842-04 (Hobbs et al. 2004).Radio continuum emission from energetic pulsars is usually associated to the integrated emission of their pulses, but other mechanisms such as a pulsar wind and magnetic activity have been recently discussed by Dzib et al. (2018).For the GLOSTAR data discussed here, the angular resolution is insufficient to decide whether or not the radio emission comes from the pulsar, from the surrounding medium, or from an unassociated object.This will be better constrained with the GLOSTAR VLA B-configuration data when it becomes available.
The importance of pulsars with radio continuum emission is that they may targeted by Very Long Baseline Interferometry (VLBI) observations to determine their astrometric parameters, such as proper motions and trigonometric parallaxes (Brisken et al. 2000).This information may be used to associate them with supernova remnant (SNR) or to better constrain their luminosity, and Galactic electron distribution models (Brisken 2005;Deller 2009).
We also identified 2 supernova remnants from a crossmatching with a radius of 10 between the GLOSTAR sources and the THOR ∼20 supernova remnants candidates identified by Anderson et al. (2017) located in the GLOSTAR field; these are G028.642+00.187 and G028.218−00.080.The low number of matches may be due the negative spectral index of the supernova remnants, which are better recover at lower frequencies, or may be due to the fact that SNRs tend to be extended and so many are likely to be either filtered out by the interferometer entirely or only partially detected and classified as extended emission.To investigate this further we also searched for SNR using the Green (2014) catalog.This identifies 12 SNR within the GLOSTAR region, 10 of which are matched with the largescale emission regions identified in Table 2 and mentioned in Sect.3.2.3.Of the remaining 2 SNRs in the field 1 is not detected at all (G032.1−00.9),and one is detected as a compact radio source (G029.7−00.3);see Fig. 17 for an image of this compact SNR.The one SNR not recovered in our map happens to be the largest identified by Green (2014) in the GLOSTAR field with a size of 40 × 40 and so its surface brightness may have fallen below the GLOSTAR detection threshold.

Analysis of mid-infrared images
The correlation of compact radio emission with dust and mid-infrared emission is an indicator of the presence of H ii regions and there have been a number of studies that have used this correlation to identify samples of H ii regions (e.g., Urquhart et al. 2013;Urquhart et al. 2009;Urquhart et al. 2007;Thompson et al. 2006).In addition to identifying and removing spurious sources (as discussed in Sect.4.2) we have, therefore, also used the mid-infrared images to distinguish between the nature of different kinds of radio emission.In the following two subsections we will give a brief description of the rationale used to define these different kinds of sources.To do this, we split the remaining unclassified radio source into extended sources (Y-factor > 2) and compact sources (Y-factors < 2) because the size and structure of the radio emission itself can provide some useful clues to their nature.
We indicate the emission type attributed to each source and a reference if the identification has come from the literature in the last column of Table 7.In Table 8 we provide a summary of the number of each type identified.There are 225 radio sources in the catalog that have Y-factors greater than 2 that have not been previously classified (see Sect. 6.1 for details).We can assume that any extended radio source is also likely to have thermal bremsstrahlung emission associated with evolved H ii regions.In some cases, although the extended radio emission is positionally coincident with the midinfrared emission, the morphological correlation is poor, and in these cases, we classify the emission more loosely as being associated with larger scale extended structures, the nature of which is unclear.In other cases, the radio emission appears to be associated with a section of an ionized rim of an evolved H ii region and these are classified as ionization fronts.Although in these cases the radio emission is likely to be associated with an H ii region, we are only capturing part of the emission and want to make the distinction between compact H ii regions where we can be confident we are capturing all of the emission and more extended sources where we are only capturing a less well determined fraction of the total integrated flux density.We have classified 57 radio sources as being associated with large-scale emission and a further 31 sources have been associated with ionization fronts.We have classified 71 of the extended radio sources as H ii regions.In Fig. 18 we present three-color composite images of two examples of each of these three morphological types of structures.We classify another two sources as potential PNe due to the compact nature of the mid-infrared emission and relative isolation and lack of any nebulosity typically observed towards star forming regions.The nature of the remaining 68 extended radio sources that are not classified as either extended emission, ionization front or H ii region is still unclear.

Classification of compact radio sources (Y-factor < 2)
There are 1284 compact radio source in the catalog, of which 133 have been previously classified.In the previous section, we have seen that the spectral index is useful for separating the compact radio sources into thermal and non-thermal bremsstrahlung emission.However, PNe and some extragalactic radio sources, known as High Frequency Peakers (Dallacasa et al. 2000), can have positive spectral indices and so this alone does not definitively identify H ii regions.From an inspection of the midinfrared and submillimetre dust emission towards all of the remaining compact sources (1151) we have identified another 20 H ii region candidates.The mid-infrared emission towards these sources generally reveals a bubble-like or complex morphology and is associated with compact dust emission.
We have identified a further 16 sources as PNe candidates due to the correlation between the compact radio and midinfrared emission and the lack of any evidence of dust emission from the ATLASGAL maps.This will be further explored via the information provided by the GLOSTAR RRL data.We have also classified 6 more sources as being associated with an ionization front and 4 others as being extended emission; although in these cases the radio emission is compact, it is coincident with a strong and/or extended 8 µm emission.The nature of the remaining 1105 not classified is uncertain.The mean value of the spectral index, for those 523 with this parameter, is ∼ −0.5 with a standard deviation of 0.3.Suggesting that the vast majority of these unclassified sources are likely to be extragalactic background objects.
To calculate the number of extragalactic background objects expected in the surveyed area (57600 arcmin 2 ), we have followed Fomalont et al. (1991).These authors observed an area free of stars, and showed that the number of background objects at 5 GHz above a flux S , is described by . Even though our observed frequency is slightly different, this study is the nearest in frequency of this kind to ours.Thus we may use their results for an approximation.Using S = 450 µJy, i. e., three times the mean noise level, the number of expected background sources in our map is 990 ± 640.These numbers also suggest that most of the 1151 unclassified sources are extragalactic radio sources.

Mid-infrared colors of radio sources
We have identified WISE counterparts for 988 radio sources from the WISE all sky point source catalog and will use these to investigate their mid-infrared color.Of these only 470 have reliable fluxes in all four bands.This increases to 475 when considering the three longest wavelength bands.We will use these to examine the colors of previously identified H ii regions in an effort to confirm the classifications made and to look for possible trends that can be used to define a set of criteria to identify new H ii region candidates.In Fig. 19 we present color-color and color-magnitude diagrams showing the distribution of all H ii regions and PNe identified in the literature (filled blue and red circles, respectively) and from our visual inspection of the midinfrared images (open blue and red circles, respectively).We also show the distributions of the radio stars (filled magenta circles).
Inspection of these plots shows that the locations in colorcolor and color-magnitude diagrams of H ii regions and PNe identified in our work agree very well with those published in the literature, providing confidence that our classifications are robust.The unclassified radio sources (shown as grey filled circles) and radio stars almost exclusively occupy the same region of the parameter space as field stars and background objects (cf. figures B2 and B3 in Csengeri et al. 2014) while the embedded and dust enshrouded objects (H ii regions and PNe) have redder colors and are clearly separated from the field stars.This is particularly clear in the color-magnitude plot shown in the lower panel of Fig. 19 where all of the H ii regions and the vast majority of the PNe are located in an area of parameter space defined by: [12] Mag.< 1.74 × ([12] − [22]) + 2. (1) This is also indicated on the lower panel of Fig. 19 with a cyan line.Only two PNe (G029.493+00.931and G033.120−00.894) and 2 H ii regions (G030.246-00.91 and G033.704−00.414) are found to the left of this color criterion.One of the PNe was identified by the CORNISH team, while the other PN and both H ii regions have been classified by us.We note the latter of the two H ii regions is very evolved with a number of WISE sources being located within its boundary.For the other three sources it is possible that these are the result of chance alignments between radio and mid-infrared sources, but it is also possible that the colors of these objects cover a larger range or that the classifications themselves are wrong.The sample is too small to test any of these possibilities statistically but we note that the one of the PNe identified by the CORNISH team that falls on the wrong side of this criterion (G033.120−00.894) is positionally coincident with the WISE source J185457.96-001424.3.The positional offset is ∼3 and the photometric flux qualities are classified as AABB and so the correlation and data are reliable. 6

New compact H ii regions
Our analysis has identified a total of 231 H ii regions, of which 46 were previously known (Kalcheva et al. 2018;Ortega et al. 2010;Giveon et al. 2005b) and 90 were proposed as candidates from analysis of the WISE data by Anderson et al. (2014).We have therefore identified 96 new H ii region and confirmed the nature of the those proposed by Anderson et al. (2014).The majority of the new H ii regions (84) have Y-factors > 1.2 and are therefore resolved in GLOSTAR and are unlikely to have been picked up in earlier high-resolution surveys where the extended emission is filtered out (CORNISH and MAGPIS).The remaining 12 H ii region candidates are unresolved in GLOSTAR and so could have been picked up in earlier surveys.However, their peak fluxes are lower than the sensitivity limit of CORNISH (7σ ∼ 2.7 mJy beam −1 ) except in the case of two sources.We checked the CORNISH maps of these two sources but did not find any emission present coincident with the GLOSTAR position (the noise values are 0.34 mJy and 0.32 mJy for G030.371+00.483 and G030.678+00.767,respectively).Both of these sources have mid-infrared counterparts and are hence real.These sources are therefore either both extended and completely filtered out in the CORNISH data or are variable radio sources.

6
The quality of the profile-fit photometry measurement is indicated by a four letter code (one for each band [W1/W2/W3/W4]; see Sect.4.1.5).The letters 'A' and 'B' correspond to the sources detected in a particular band with a flux signal to noise ratio > 10 and between 3 and 10, respectively.Detections with a quality flag of 'A' and 'B' are considered to be reliable.
In Table 9 we give the peak flux densities and spectral indices of the new unresolved H ii regions; only three sources have a detection in THOR and consequently we have only a measurement for the spectral index for three of these H ii regions, all of which are consistent with what is expected for optically thin H ii regions (i.e.α = −0.1).This suggests that these are more evolved than the UC H ii region stage, which are usually characterised by optically thick emission (α ∼ 1-2; Kurtz & Hofner 2005).We note that no dust emission is detected towards two of these H ii regions (G030.241−00.592and G030.678+00.767),which is also consistent with them being more evolved having already dissipated their natal clump.We further note that the latter of these is also one of the bright sources we would have expected to have been detected by CORNISH and so the spectral index and absence of dense envelope are consistent with this source being quite evolved and perhaps explains its non-detection in CORNISH.
In Table 9 we also list the new H ii regions association with mid-infrared and submillimetre emission.As can be seen from this table, all of these sources are associated with mid-infrared emission and the majority are associated with dust emission.The mid-infrared emission looks to be extended in two cases indicating these are likely to be quite evolved H ii regions (> 12 in diameter). 7We have searched SIMBAD towards all of these sources.This revealed that G035.344+00.348had been previously identified as an H ii region (Urquhart et al. 2011;MSX6C G035.3449+00.3474).In the last column of Table 9 we identify the H ii regions that are likely to be good UC H ii region candidates; this is done by eliminating sources that are not associated with dust, have spectral indices indicting optically thin emission, or are extended in the mid-infrared, all of which point to sources that are more evolved than the UC H ii phase.

Properties and Galactic distribution of the H ii regions
From a combination of visual inspection of mid-infrared images and cross-matching with other studies we have identified a total of 231 H ii regions in our catalog.This is the most complete sample of H ii regions identified in this part of the sky to date incorporating H ii regions with sizes of a few arcseconds to a few arcminutes.In this section we will provide an analysis of their properties and distributions.

Observed and measured properties
In Fig. 20 we present plots of the peak and integrated flux density for the whole GLOSTAR catalog and of the H ii regions identified.By comparing these two samples we see that nearly all of the very bright radio sources are associated with H ii regions but that the fraction of the population decreases as the flux decreases and turns over between 0.5 and 2 mJy beam −1 and between 1-2 mJy for the peak and integrated flux densities respectively.This indicates that there are relatively fewer weak H ii regions in comparison to the total number of radio sources detected at low surface brightnesses.The fact that we find no H ii region with fluxes below the turnover in the peak flux density may indicate a minimum flux density threshold for H ii regions.However,  this corresponds to a flux density of ∼ 0.4 mJy beam −1 , which is close to the 3-4σ noise found towards the Galactic mid-plane where the vast majority of H ii regions are to be found.Radio sources detected below this threshold are primarily extragalactic background sources located away from the mid-plane where the noise level drops off significantly (see lower panel of Fig. 3).
In Fig. 21 we compare the source sizes and spectral indices for the whole GLOSTAR sample and those identified as H ii regions.It is clear from the upper panel in this figure that the vast majority of the radio sources are unresolved with the numbers dropping off steeply for sizes larger than ∼15 .The distribution of the H ii region sizes appears to be roughly uniform over the whole range of sizes, which is consistent with other studies (e.g., Kurtz et al. 1999;Masqué et al. 2017), and is to be expected given that the sample is likely to cover a larger range of distances and ages.We also note that the H ii regions contribute a significant fraction of the radio sources with sizes above ∼30 with the others being classified as ionization fronts or extended emission sources.We find very few radio sources with sizes larger than ∼100 , but this is a consequence of the interferometric snapshot observations themselves as the poor uv-coverage puts limits on the angular size of structures that can be reliably imaged.
The spectral index distribution of the whole sample is bimodal and peaks at α = −0.6 and 0.1; the former is likely to be associated with synchrotron emission from the extragalactic background population while the latter is likely to be associated with thermal emission from Galactic sources such as H ii regions and PNe.It is therefore reassuring to see the distribution of H ii regions correlating with the thermal emission peak.This is comparable with the spectral index calculation of the THOR team (Wang et al. 2018), but our sources show a higher density of positive index.

Galactic distribution
The majority of the detected radio sources are likely to be extragalactic which is expected to be smoothly distributed across the whole sky.However, the H ii region population should show significant variations in Galactic longitude and latitude with peaks that correlate with the most active star forming complexes and the distribution of molecular material, both of which are concentrated in the Galactic mid-plane.In Fig. 22 we show the Galactic latitude distribution of all the radio sources and the H ii region subsample.It is clear from this plot that the H ii regions are indeed tightly concentrated around the Galactic mid-plane with a peak at b ∼ 0 • , which is consistent with previous studies of H ii regions (Urquhart et al. 2013;Kalcheva et al. 2018) and dense massive star forming clumps (Urquhart et al. 2014b).The distribution of the full sample is relatively flat for the inner part of the mid-plane but rises towards at the edges of the surveyed region (|b| ∼ 1 • ), which is probably related to the reduction in the local noise value as one move away from the Galactic mid-plane where much of the extended emission from extended H ii regions and supernova remnants are located (see Figs. 1 and 3).
In Fig. 23 we show the distribution of all GLOSTAR sources and the H ii regions as a function of the Galactic longitude.Before we discuss the longitude distribution we should bear in mind that since we have excluded emission associated with the 27 large scale structures, many of which are associated with active H ii regions, the peaks seen in the H ii region distribution are likely to be somewhat suppressed.All of these large scale structures are Galactic in origin and are unlikely to affect the distribution of the extragalactic population, which are rarely resolved.
The distribution of the whole catalog can be described as being broadly flat.As expected, the distribution of H ii regions reveals some significant peaks.We note the strong peak near ∼30 • which is related with emission from the massive star forming regions W43 main and W43 south.This region is located where the Scutum-Centaurus arm is interacting with the end of the Galactic long bar (Rodriguez-Fernandez & Combes 2008;Nguyen Luong et al. 2011), which is expected to lead to a rise in the star formation activity in this area.
The latitude and longitude distributions are consistent with the distribution of the high-reliability (σ ≥ 7) CORNISH catalog (Purcell et al. 2013) and the lower frequency THOR catalog (Bihr et al. 2016;Wang et al. 2018), which provides confidence that the catalog we have produced is reliable.

Summary and conclusions
The GLOSTAR survey is a VLA survey of the Galactic midplane 1st quadrant (−10 • < < 80 • and |b| < 1 • ).In this paper we present the radio continuum data for a 16 square degree part of the survey covering 8 degrees in longitude and 2 degrees in latitude and centered on = 32 • , b = 0 • (i.e., 28 • < < 36 • and |b| < 1 • ).We use the BLOBCAT source extraction algorithm to produce a catalog of radio sources and identify a number of different type of morphologies, from unresolved to compact and extended.
We have also identified 27 large scale structures that are over-resolved at the GLOSTAR resolution (∼18 ) and where the emission has been broken up into multiple radio sources.In these cases we have regrouped the various fragments and give the parameters for these objects separately.The individual fragments are removed from the final catalog, which consists of 1575 distinct radio sources.The initial catalog has been cross-matched with other radio and mid-infrared catalogs to verify that the extracted sources are real.We have visually examined mid-infrared and submillimetre images for radio sources without a counter- part to identify and remove artifacts.We discuss the observed properties of this new catalog of sources and compare them to previous studies.
We have used the multi-wavelength analysis and searched the literature to identify the nature of the radio sources.This has resulted in the identification of 231 H ii regions.Of these, only 46 had been previously identified with high resolution radio surveys, and a further 90 H ii region candidates identified in the midinfrared by Anderson et al. (2014) have now been confirmed.We have identified a further 96 H ii regions not previously known.The majority of the new H ii regions are well-resolved (71 with Y-factor > 2) and have been missed by previous high-resolution interferometric surveys due to their lack of short baseline data and consequent poor sensitivity to large angular scales.Twelve of the new H ii regions identified are considered to be unresolved (i.e., Y-factor < 1.2), of these we identify 8 UC H ii region candidates.
We have investigated the properties of the full sample of H ii regions and compared them to the previously identified samples.The improved flux sensitivity (∼100-400 µJy beam −1 ) and sensitivity to larger angular scales (∼20-200 ) of GLOSTAR compare with other surveys, has increased the number of H ii regions in this part of the Galaxy by a factor of 4.
In addition to the H ii regions we have also identified 15 radio stars, 46 PNe, 3 pulsars, 3 compact SNR-candidates and 99 radio sources associated with more extended structures such as the ionization fronts of much more extended H ii regions.We are unable to classify 1186 radio sources; these can be broadly categorized as being weak and unresolved, the majority of which are likely to be extragalactic background sources.
Here we present a catalog of 1575 radio sources and have investigated their nature and physical properties.The data presented here is only a small fraction of the total GLOSTAR coverage, which also includes spectral line and higher-resolution data obtained with the VLA in B-array.This will be complemented with observations that are being performed with the 100-m Effelsberg telescope, which will provide sensitivity to scales larger than seen in the D-configuration observations.

Fig. 2 .
Fig. 2. Noise distribution of the map presented in Fig. 1.We have restricted the range of pixel values to between −2 and 2 mJy in order to estimate the noise in the map and determine its sensitivity.The red line shows the results of a Gaussian fit to the distribution, which gives the standard deviation of the noise as 150 µJy.The bin size is 50 µJy.

Fig. 3 .
Fig. 3. Noise distribution of the map presented in Fig. 1 as a function of Galactic latitude.This has been produced in the same way as Fig. 2 but for each increment in latitude.

Fig. 4 .
Fig. 4. Position offsets between GLOSTAR compact sources (Y-factor < 1.2) with their CORNISH counterparts.The blue cross indicates the mean value of the offsets and the dashed lines indicate where the longitude and latitude offsets are equal to zero.

Fig. 8 .
Fig. 8. Peak flux distribution of CORNISH sources located in the GLOSTAR field.The grey and red hatched histograms show the distribution of all CORNISH sources and those without a GLOSTAR counterpart, respectively.The bin width is 0.15 dex.

Fig. 9 .
Fig. 9. Example of a CORNISH source not detected in GLOSTAR.The greyscale is the GLOSTAR map of a 2 × 2 region centered on the position of the CORNISH source G031.4235−00.9138(as indicated by the yellow cross).The noise towards this source is 63 µJy beam −1 and given that the integrated CORNISH flux is 3.47 mJy it should have been easily detected in the GLOSTAR map.The red hatched circle shown in the lower left corner indicated the size of the GLOSTAR beam.

Fig. 10 .
Fig. 10.Peak flux distribution of THOR sources located in the GLOSTAR field.The grey and red-hatched histograms show the distribution of all THOR sources and those without a GLOSTAR counterpart, respectively.The bin width is 0.25 dex.

Fig. 11 .
Fig. 11.Upper panel: Comparison of the GLOSTAR flux measurements with the Becker et al. (1994) survey (MAGPIS) and CORNISH; data from these are represented as red (filled) and blue (open) circles, respectively.Due to the difference in resolution between the surveys we compare the integrated CORNISH flux densities with the peak GLOSTAR flux densities, while for the Becker et al. (1994) survey we compare the integrated flux densities with the peak GLOSTAR flux densities, when theBecker et al. (1994) source size is less than 20 and the integrated flux from both surveys if the source is larger than 20 .The gray line is the line of equality.Lower panel: Flux ratio distribution (as described above), which has a mean of 0.9 and standard deviation of 0.34.

Fig. 12 .
Fig. 12. Peak and integrated flux distribution of GLOSTAR sources are shown in the upper and lower panels, respectively.The thick red histogram on the upper panel shows the flux distribution of sources previously identified by CORNISH and/or MAGPIS.The bin width is 0.15 dex.

Fig. 13 .
Fig. 13.Distribution of the Y-factor (S int /S peak ).The red dash-dot (Yfactor = 1.2) and cyan dash-dotted lines (Y-factor = 2) indicate the criterion used to distinguish between unresolved, compact and extended sources, respectively.The bin size used is 0.1 dex.

Fig. 15 .
Fig. 15.Comparison of spectral index values between only GLOSTAR data with uncertainties less than 0.2 and GLOSTAR combined with THOR data.The m and b are the slope and intersection of the linear fit, respectively, and is represented with the blue line.The red line is the line of equality.

Fig. 16 .
Fig. 16.In the upper panel we show the spectral index distribution of all compact GLOSTAR sources with Y-factor < 2 and a counterpart detected at 1.5 GHz in the THOR survey.The red and yellow hatched histograms show the distribution of compact radio sources that are associated with mid-infrared sources and dust emission, respectively.In the lower panel we show the distribution of the associated uncertainties to the spectral index measurements.The bin size used in the upper and lower panels are 0.1 and 0.02, respectively.

Fig. 17 .
Fig. 17.The compact SNR G029.689−00.242identified from the Green (2014) catalog (G029.7−00.3 in their catalog).The background image is a false color composite image produced by combining the GLIMPSE 5.8 and 8.0 µm bands and the MIPSGAL 24 µm band; these are shown in blue, green and red, respectively.The yellow contours show the distribution of the 5.8 GHz radio emission associated with the GLOSTAR source.

Fig. 18 .
Fig. 18.In the upper, middle and lower panels we show examples of sources classified as extended radio sources, ionization fronts and evolved H ii regions, respectively.In all cases the background image is a false color composite image produced by combining the GLIMPSE 5.8 and 8.0 µm bands and the MIPSGAL 24 µm band; these are shown in blue, green, and red, respectively.The yellow contours show the distribution of the GLOSTAR 5.8 GHz radio emission.

Fig. 19 .
Fig. 19.Mid-infrared colors of compact GLOSTAR sources associated with a WISE counterpart.Sources classified as H ii regions from the literature are shown as filled blue circles while the H ii classified in this study are shown as open blue circles.Similarly, PNe identified by Purcell et al. (2013) are shown as red filled circles while PNe identified in this paper are shown as open circles.Radio stars identified by Purcell et al. (2013) are indicated by filled magenta circles.The smaller grey circles show the distribution of compact sources with a WISE counterpart we have been unable to classify.

Fig. 20 .
Fig. 20.Distribution of H ii region properties.In the upper and lower panels we show the peak and integrated flux density for the whole GLOSTAR catalog (grey) and those classified as H ii regions (red).The bin sizes in both plots is 0.25 dex.

Fig. 21 .
Fig. 21.Distribution of H ii region properties.In the upper and lower panels we show the angular radius and spectral index for the whole GLOSTAR catalog (grey) and those classified as H ii regions (red).The dashed-dotted vertical line drawn on the upper panel shows the radius of the beam.The bin sizes used in the upper and lower panels are 0.1 dex and 0.1, respectively.

Fig. 22 .
Fig. 22. Distribution of GLOSTAR radio sources as a function of Galactic latitude.The whole catalog is shown as the grey histogram while the sources identified as H ii regions are shown as the hatched red histogram.The bin size used for both distributions is 0.1 • .

Fig. 23 .
Fig. 23.Distribution of GLOSTAR radio sources as a function of Galactic longitude.The whole catalog is shown as the grey histogram while the sources identified as H ii regions are shown as the hatched red histogram.The bin size used for both distributions is 0.1 • .
Table 2 for more detailed and positions, respectively).

Table 2 .
Green (2014)structures.The source name is constructed from the central position of the box used to encapsulate the regions.The sources with * are associate with supernova remnants from the catalog ofGreen (2014).

Table 3 .
Summary of detection categories.

Table 5 .
Catalog names and fluxes for CORNISH sources not detected in GLOSTAR.The last column gives the measured rms towards the position of the CORNISH source in the GLOSTAR map.The flux errors are of the order of the CORNISH noise level, 0.33 mJy beam −1 .

Table 6 .
Variable source candidates where the integrated CORNISH flux is more than 50% higher than the peak GLOSTAR flux.

Table 7 .
GLOSTAR source catalog.The source names are appended with a ‡ to indicate if a source has been split (as described in Sect.3.2.1)and a † if the source has been recovered (as described in Sect.3.2.2).

Table 8 .
Summary of emission types.The number in parenthesis in the SNR column indicates the number of SNR identified as large-scale structures in Sect.3.2.3.

Table 9 .
Catalog of new compact H ii region candidates.Sources that are bright enough to have been detected by CORNISH are indicated by appending a † to the source name.The fluxes and spectral indices are drawn from the GLOSTAR catalog (i.e., Table7).In the last two column we indicate the presence of compact (), extended () or absence () of any corresponding ATLASGAL or WISE emission.In the last column we indicate which are still UC H ii region candidates after analysis (see text for details).