ALMA-IMF IV – A comparative study of the main hot cores in W43-MM1: detection, temperature and molecular composition

Context. Hot cores are signposts of the protostellar activity of dense cores in star-forming regions. W43-MM1 is a young region, very rich in terms of high-mass star formation, highlighted by the presence of a large number of high-mass cores and outﬂows. Aims. We aim to systematically identify the massive cores which contain a hot core and compare their molecular composition. Methods. We use ALMA high-spatial resolution ( ∼ 2500 au) data of W43-MM1 to identify line-rich protostellar cores and make a comparative study of their temperature and molecular composition. The identiﬁcation of hot cores is based on both the spatial distribution of the complex organic molecules and the contribution of molecular lines relative to the continuum intensity. We rely on the analysis of CH 3 CN and CH 3 CCH to estimate the temperatures of the selected cores. Finally, we rescale the spectra of the di ﬀ erent hot cores based on their CH 3 OCHO line intensities to directly compare the detections and line intensities of the other species. Results. W43-MM1 turns out to be a region rich in massive hot cores. It contains at least 1 less massive (core #11, 2 M (cid:12) ) and 7 massive (16 to 100 M (cid:12) ) hot cores. The excitation temperature of CH 3 CN, whose emission is centred on the cores, is of the same order for all of them (120–160 K). There is a factor of up to 30 di ﬀ erence in the intensity of the complex organic molecules (COMs) lines. However the molecular emission of the hot cores appears to be the same within a factor 2–3. This points towards both a similar chemical composition and excitation of most of the COMs over these massive cores, which span about an order of magnitude in core mass. In contrast, CH 3 CCH emission is found to preferentially trace more the envelope, with a temperature ranging from 50 K to 90 K. Lines in core #11 are less optically thick, which makes them proportionally more intense compared to the continuum than lines observed in the more massive hot cores. Core #1, the most massive hot core of W43-MM1, shows a richer line spectrum than the other cores in our sample, in particular in N-bearing molecules and ethylene glycol lines. In core #2, the emission of O-bearing molecules, like OCS, CH 3 OCHO and CH 3 OH, does not peak at the dust continuum core center; the blue and red shifted emission correspond to the outﬂow lobes, suggesting a formation via the sublimation of the ice mantles through shocks or UV irradiation on the walls of the cavity. These data establish a benchmark for the study of other massive star formation regions and hot cores.


Introduction
W43-MM1 is a massive star-formation region 5.5 kpc away (Zhang et al. 2014) and located at the tip of the Galactic bar (Nguyen Luong et al. 2011).Among the 131 cores with typical sizes of 2000 au identified by Motte et al. (2018;hereafter M18) using Atacama Large Millimeter/sub-millimeter Array (ALMA) data, 18 have masses >10 M .The large number of massive cores detected in W43-MM1 make it an ideal laboratory for exploring the physical processes and chemical evolution involved in the formation of massive stars.
The identification of massive cores in W43-MM1 and the characterisation of their environment results from many years of sustained effort involving observations and state-of-the-art simulations and models.Motte et al. (2003) found such a high star-formation rate and efficiency in W43 that they deemed this region a 'mini-starburst', reminiscent of the galaxies referred to as such.Herschel and ground-based observations revealed a complex structure of molecular filaments hosting dense cores exposed to the radiation from neighbouring massive stars (Bally et al. 2010;Cortes et al. 2010;Cortes 2011;Nguyen Luong et al. 2011;Nguyen-Luong et al. 2013, 2017;Carlhoff et al. 2013).Herpin et al. (2009Herpin et al. ( , 2012) )  Notes.The wide Band 6 spw7 spectral window is also referred to as the 'continuum' band.
a cloud-cloud collision, and revealed the presence of numerous bipolar outflows.In the meantime, Sridharan et al. (2014) confirmed the presence of dense cores with the Submillimeter Array (SMA) and revealed local variations of the magnetic field.Cortes et al. (2016) studied the magnetic field structure at ∼0.5 resolution.
ALMA observations revolutionised our understanding of this region: M18 identified and characterised 131 pre-and protostellar cores in the region at ∼0.5 resolution, measuring their mass, temperature, size, and density, and obtaining an unexpected core mass function (CMF) with an excess of massive cores.Nony et al. (2018) studied the physical structure of the remarkably massive (∼55 M in 1300 au radius) pre-stellar core candidate (core #6) found in the region, while its chemistry was studied by Molet et al. (2019).Nony et al. (2020) investigated the ejection-accretion link by studying the characteristics of 46 molecular outflow lobes identified with ALMA and found evidence for time-variable ejection processes with a timescale of ∼ 500 yr.
The large program 'ALMA-IMF: ALMA transforms our view of the origin of stellar masses' (project # 2017.1.01355.L; Ginsburg et al. 2022;Motte et al. 2022;Pouteau et al. 2022) extends the work by M18 that found the first 'top-heavy' CMF in the W43-MM1 protocluster.ALMA-IMF consists of the observation of 15 massive protoclusters to investigate the distribution of the 0.5-200 M cores at a ∼2000 au scale and thus characterise the CMF evolution.Another aim of the program is to determine the pre-stellar, protostellar, or UCHII region nature of the cores.
In this paper we focus on the identification and characterisation of the hot cores in W43-MM1.By analogy with the Orion hot core (e.g.Morris et al. 1980), a hot core is usually defined as a hot (T ≥ 100 K), dense (density ≥10 6 cm −3 ), and compact (diameter <0.1 pc) region where a large number of molecular lines from complex organic molecules (COMs) are detected (e.g.Cesaroni et al. 1994;Herbst & van Dishoeck 2009;Charnley 2011).The study of the chemistry of star-forming regions can provide us with precious information on the physical evolution of protostars (e.g.Jørgensen et al. 2020).
The present article is organised as follows.We present our data in Sect. 2. In Sect.3, we identify eight hot cores with two methods: one using the spatial distribution of molecules and the other one using the line densities compared to the continuum level in the continuum cores, and we confirm the nature of these cores using methyl formate and methyl cyanide maps.In Sect.4, we determine the temperature of the hot cores from the CH 3 CN and CH 3 CCH emission.In Sect.5, we compare the molecular composition of the hot cores from their spectra normalised to the intensities of the methyl formate lines.We discuss the molecular similarity of the hot cores using scaled spectra and correlation plots in Sect.6.Our conclusions are presented in Sect.7.

Observations
We use band 3 and band 6 ALMA observations of W43-MM1 carried out between 2014 and 2018.The 1.3 mm observations (216 to 234 GHz) are from ALMA Cycle 2 (project #2013.1.01365.S) and Cycle 3 (#2015.1.01273.S) and were previously presented in Molet et al. (2019), together with our continuum-subtraction method.The 1.3 mm dataset is composed of nine bands of bandwidths between 0.1 and 1.9 GHz, with a spatial resolution of ∼0.45 , a spectral resolution ranging from 0.2 to 1.3 km s −1 and an rms between 0.1 and 0.5 K (see Table 1).The observations are 2.1 pc × 1.4 pc mosaics taken with the ALMA 12 m and ACA 7 m arrays.The gridding was performed with Briggs' weighting using a robustness parameter of 0.5, and the cleaning used the multiscale option excluding the borders of the mosaic to avoid divergence problems.Hence, between 120 and 129 of the 131 cores of M18 are in the cleaned field, depending on the band.
The 3 mm observations (91.7-105.4GHz) are from ALMA Cycle 5 and are part of the large program ALMA-IMF.This program covered the W43-MM1 region at 3 mm at a comparable resolution to the previous 1 mm data.In this paper, we present a preliminary data reduction and analysis of these 3 mm data.We use the same cleaning and continuum subtraction method (based on the distribution of channel intensities) as presented in A140, page 2 of 18 Molet et al. (2019), using CASA1 .Cunningham et al. (in prep.)present the standardised data reduction methods applied by the ALMA-IMF consortium to homogenise the analysis of all the regions observed.On average, the observational parameters for the four selected bands have a spatial resolution of 0.46 (2500 au), a spectral resolution of 1.5 km s −1 (0.5 MHz), and an rms of 0.6 K per channel; the detailed parameters for each band are given in Table 1.These bands include several lines of CH 3 CN (spw1), CH 3 CCH (spw2), and CH 3 OH (spw3).
We also made 1.3 mm maps with a higher spatial resolution -applying a uniform weighting to visibilities in the griddingin order to study the distribution of the molecules in Sect.5.2.We reached a resolution that is 1.5 times better (0.3 or 1600 au) at the expense of a lower sensitivity.
The conversion from flux density to brightness temperature was made using the formula2 : where T is the brightness temperature, I the flux in mJy beam −1 , ν the frequency in GHz, and θ maj and θ min are the half-power beam widths along the major and minor axes, respectively.

Identification of hot cores
To identify hot cores, we used two different approaches, one that uses molecules known to trace hot cores (see Sect. 3.5) and another one that does not need any line identification and relies on the richness of the hot core spectra in COM lines.Two methods based on this latter approach are presented in Sect.3.3 and Sect.3.4.The first one is based on the sum of the brightness temperature of all lines over the Band 6 spw7 band; this sum is computed for each pixel of a 2500 au spatial resolution map, and requires no prior knowledge of the region.The selected band is particularly rich in COM lines.The second one is based on the sum over a band of the line contribution averaged over the area of a continuum core and compared to the continuum emission.We use the different bands from Table 1 (bandwidth between 0.1 and 2 GHz).We study first the brightness temperature of the line emission to estimate the 'contamination' of the continuum emission by line emission, and then simply the number of lines detected to get the 'density in lines', which is the fraction of the band showing detected lines.This method requires identification of the continuum cores as a first step.We note that, as indicated in Sect.2, up to 11 of the 131 cores from M18 are out of the bounds of our data cubes.Specifically, this is the case for cores #15, #39, #44, #59, and #67 located in W43-MM1 SW, all of which are associated with molecular outflows (Nony et al. 2020).Among these cores, M18 indicated that only core #15 has detectable molecular lines.None of these are included in the analysis that follows.

Continuum level
We separated the continuum and the molecular emission in each pixel of the image using the method presented in Molet et al. (2019).The continuum level is estimated from the spectrum intensity channel distribution, after fitting it with an exponentially modified Gaussian to adjust both the Gaussian distribution The conversion from flux density to brightness temperature was made using the formula 2 : where T is the brightness temperature, I the flux in mJy beam −1 , ν the frequency in GHz, and θ maj and θ min are the half-power beam widths along the major and minor axes, respectively.

Identification of hot cores
To identify hot cores, we used two different approaches, one that uses molecules known to trace hot cores (see Sect. 3.5) and another one that does not need any line identification and relies on the richness of the hot core spectra in COM lines.Two methods based on this latter approach are presented in Sect.3.3 and Sect.3.4.The first one is based on the sum of the brightness temperature of all lines over the Band 6 spw7 band; this sum is computed for each pixel of a 2500 au spatial resolution map, and requires no prior knowledge of the region.The selected band is particularly rich in COM lines.The second one is based on the sum over a band of the line contribution averaged over the area of a continuum core and compared to the continuum emission.We use the different bands from Table 1 (bandwidth between 0.1 and 2 GHz).We study first the brightness temperature of the line emission to estimate the 'contamination' of the continuum emission by line emission, and then simply the number of lines detected to get the 'density in lines', which is the fraction of the band showing detected lines.This method requires identification of the continuum cores as a first step.We note that, as indicated in Sect.2, up to 11 of the 131 cores from M18 are out of the bounds of our data cubes.Specifically, this is the case for cores #15, #39, #44, #59, and #67 located in W43-MM1 SW, all of which are associated with molecular outflows (Nony et al. 2020).Among these cores, M18 indicated that only core #15 has detectable molecular lines.None of these are included in the analysis that follows.

Continuum level
We separated the continuum and the molecular emission in each pixel of the image using the method presented in Molet et al. (2019).The continuum level is estimated from the spectrum intensity channel distribution, after fitting it with an exponentially 2 https://science.nrao.edu/facilities/vla/proposing/TBconvmodifi noise a Figure

In
To qua ature a sum it refer t tively.
<> R in the spa the cha

H C
To  of the noise and the asymmetric distribution of the lines intensities (see Fig. 1).

Indicators of richness in lines
To quantify the line richness, we take the line brightness temperature at a pixel or average it over a given spatial region R, and sum it on individual channels over a given frequency range, and refer to the integrated line emission as I pix Lines and I R Lines , respectively.
where T Lines, i is the brightness temperature due to the lines in channel i; it is computed using the total brightness temperature T Total, i and the continuum value T Cont determined previously: <> R indicates the average of the line brightness temperature over the spatial region R, n chan is the number of channels, and ∆ν is the channel width.

Hot core identification from the spatial distribution of COMs
To highlight the presence of hot cores, we focus on the analysis of the 'continuum' band (spw7) at 233 GHz, because this band offers a large band width (∼2 GHz) that is not contaminated by strong emission lines coming from the simplest molecules (like H 2 CO, CO or SiO) but mainly by COMs.Moreover, M18 and Molet et al. (2019) have already studied the 233 GHz band towards W43-MM1.
For each pixel we compute the integrated line emission I pix Lines and divide by the number of channels (T pix Lines = I pix Lines /n chan ) to produce the mean line brightness temperature map in Fig. 2. Similarly, using the total channel intensity and the continuum contribution, we obtain maps of T pix Total and T pix Cont which are also shown in the same figure.We assume that the continuum intensity does not vary significantly over the spectral band (∆ν = 1.9 GHz), the expected difference being only 2% in this range of frequency.
Because the continuum emission is bright at 1.3 mm, the structures in the total integrated map are very similar to the ones found in the continuum map.To reveal the presence of hot cores, we look on the map for places where line emission T pix Lines is greater than the noise (first 3 sigma contour in Fig. 2).With A140, page 3 of 18 A&A 665, A140 (2022) A&A proofs: manuscript no.Brouillet-aa43669 Fig. 2: Lines, and total and continuum emission maps obtained from the spw7 band at 233 GHz.Contours represent 3,5,7,10,20,30,50,70, and 100 σ, with σ = 0.22 K, the rms in a channel of resolution ∆ν= 0.122 MHz.The hot cores are marked by a star symbol.The total emission is averaged over the 1.9 GHz band.For the line map, the first two contours are added in red to reveal the fainter hot cores.The spectra are spatially averaged over the source size.The grey horizontal line across the observed spectra (in black) is the continuum level obtained by the method of Molet et al. (2019).Below this line, the grey area represents the continuum integrated flux.Above this line, the blue area is the continuum brightness of the lines, which is estimated in each channel depending on the noise, whose 1σ and 2σ values are represented by horizontal lines.for the comparative line methods discussed in Sects.4 and 5.The detected lines are individually not significant enough for a clear constraint of line parameters (shape, centre, and width) and therefore the uncertainties on physical parameters such as temperature and column densities are too high.Nonetheless, a detailed previous study showed that we can analyse the same lines in the sources of W43-MM1 with the faintest 1.3 mm line, such as core #6, which is a high-mass prestellar or very young protostellar core.A complete analysis of the spectra from cores #3 and #6 can be found in Molet et al. (2019).A direct analysis for cores #1 and #4 is difficult because they appear to be interacting and affected by significant spectral confusion.

Hot core identification from the line densities in continuum cores
As M18 have already identified the continuum cores in the W43-MM1 region and defined their sizes and locations, we propose another method to highlight the cores that contain a hot core.Unlike the pixel-based analysis in Sect.3.3, we look for the relative contributions of continuum and molecular line emission in the spectra of the continuum cores, spatially integrated over the source size given by M18.
For each spectrum, we distinguish channels with molecular emission from line-free (or continuum) channels by verifying whether the core total intensity T core Total, i of the channel i -once the continuum is subtracted-is above or below 1σ (the rms measured in a line free part of the spectra), respectively (see Figure 3).After tests on synthetic spectra, we find that the contribution of the lines is better taken into account if we consider all the line signal for a channel with an intensity greater than 2σ, and only signal above 1σ for a channel with an intensity of between 1σ and 2σ.
Therefore, for a core spectrum, we define the line contribution C core Lines to the total brightness as Article number, page 4 of 19 this method, seven structures are highlighted, among which the largest one previously identified as N1a, detected at 5 × 3 resolution with NOEMA (Louvet et al. 2014), and separated into two substructures at 2400 au resolution with ALMA (Motte et al. 2018;Nony et al. 2020).The centre of these eight structures A&A proofs: manuscript no.Brouillet-aa43669 Fig. 2: Lines, and total and continuum emission maps obtained from the spw7 band at 233 GHz.Contours represent 3, 5, 7, 10, 20, 30, 50, 70, and 100 σ, with σ = 0.22 K, the rms in a channel of resolution ∆ν= 0.122 MHz.The hot cores are marked by a star symbol.The total emission is averaged over the 1.9 GHz band.
For the line map, the first two contours are added in red to reveal the fainter hot cores.The spectra are spatially averaged over the source size.The grey horizontal line across the observed spectra (in black) is the continuum level obtained by the method of Molet et al. (2019).Below this line, the grey area represents the continuum integrated flux.Above this line, the blue area is the continuum brightness of the lines, which is estimated in each channel depending on the noise, whose 1σ and 2σ values are represented by horizontal lines.for the comparative line methods discussed in Sects.4 and 5.The detected lines are individually not significant enough for a clear constraint of line parameters (shape, centre, and width) and therefore the uncertainties on physical parameters such as temperature and column densities are too high.Nonetheless, a detailed previous study showed that we can analyse the same lines in the sources of W43-MM1 with the faintest 1.3 mm line, such as core #6, which is a high-mass prestellar or very young protostellar core.A complete analysis of the spectra from cores #3 and #6 can be found in Molet et al. (2019).A direct analysis for cores #1 and #4 is difficult because they appear to be interacting and affected by significant spectral confusion.

Hot core identification from the line densities in continuum cores
As M18 have already identified the continuum cores in the W43-MM1 region and defined their sizes and locations, we propose another method to highlight the cores that contain a hot core.Unlike the pixel-based analysis in Sect.3.3, we look for the relative contributions of continuum and molecular line emission in the spectra of the continuum cores, spatially integrated over the source size given by M18.
For each spectrum, we distinguish channels with molecular emission from line-free (or continuum) channels by verifying whether the core total intensity T core Total, i of the channel i -once the continuum is subtracted-is above or below 1σ (the rms measured in a line free part of the spectra), respectively (see Figure 3).After tests on synthetic spectra, we find that the contribution of the lines is better taken into account if we consider all the line signal for a channel with an intensity greater than 2σ, and only signal above 1σ for a channel with an intensity of between 1σ and 2σ.
Therefore, for a core spectrum, we define the line contribution C core Lines to the total brightness as The spectra of the other cores identified by M18 do not show any lines in this band or the lines are not bright enough for the comparative line methods discussed in Sects.4 and 5.The detected lines are individually not significant enough for a clear constraint of line parameters (shape, centre, and width) and therefore the uncertainties on physical parameters such as temperature and column densities are too high.Nonetheless, a detailed previous study showed that we can analyse the same lines in the sources of W43-MM1 with the faintest 1.3 mm line, such as core #6, which is a high-mass prestellar or very young protostellar core.A complete analysis of the spectra from cores #3 and #6 can be found in Molet et al. (2019).A direct analysis for cores #1 and #4 is difficult because they appear to be interacting and affected by significant spectral confusion.

Hot core identification from the line densities in continuum cores
As M18 have already identified the continuum cores in the W43-MM1 region and defined their sizes and locations, we propose another method to highlight the cores that contain a hot core.Unlike the pixel-based analysis in Sect.3.3, we look for the relative contributions of continuum and molecular line emission in the spectra of the continuum cores, spatially integrated over the source size given by M18.
For each spectrum, we distinguish channels with molecular emission from line-free (or continuum) channels by verifying whether the core total intensity T core Total, i of the channel i -once the continuum is subtracted -is above or below 1σ (the rms measured in a line free part of the spectra), respectively (see Fig. 3).After tests on synthetic spectra, we find that the contribution of the lines is better taken into account if we consider all the line signal for a channel with an intensity greater than 2σ, and only signal above 1σ for a channel with an intensity of between 1σ and 2σ.The blue areas represent the results for spectra without many lines above the 1σ, 2σ, and 3σ levels, where σ is the rms noise level in one channel of the corresponding band.Cores identified as containing a hot core following the criterion described in Sect. 1 are marked by their core number.and using Equation 2: . (5) The left panel of Figure 4 presents, for the nine 1.3 mm ALMA bands, the relative contribution C core Lines of lines to the total flux versus T core Cont for all cores.We note that the rapid increase in the relative line contribution at low continuum values corresponds to the cores fainter in lines, for which the method is biased by the noise (blue areas).However, as in Sect.3.3, cores #1, #2, #3, #4, #5, #9, #10, and #11 clearly stand out.The results are consistent between the bands but the difference between the cores is more obvious for the bands with no strong line (e.g., at 216 GHz), as the strong lines come from molecules which are widespread; the difference is also better in bands with a large frequency range (e.g. the three bands from 231 to 234 GHz).
To avoid the confusion observed for the weak continuum values, another approach using the same method is to focus on the fraction F core Lines of channels considered as containing lines detected at a 2σ level, defined as: The results are presented in the right panel of Figure 4.The relative comparison highlights the same eight cores and we also observe a relation between the number of channels with molecular emission and the continuum level; this correlation is clearer than in the left panel for all bands, even for the 'CO' 230.30−230.76GHz band.Methyl formate (CH 3 OCHO) and methyl cyanide (CH 3 CN) are two abundant COMs and are therefore often used to trace hot cores (e.g.Blake et al. 1987;Wink et al. 1994, see also Sect.5.2).We used Equation 2 to integrate the line emission at each pixel and Fig. 5 presents maps of the two molecules using the CH 3 OCHO doublet at 216.21 GHz (see Sect. 5) and the CH 3 CN transitions from 91.958 to 91.987 GHz (see Sect. 4).
The methyl formate map highlights the eight hot cores identified on Fig. 2.These hot cores also stand out in the methyl cyanide map but they are surrounded by more widespread extended emission.Such extended CH 3 CN emission was also observed in DR21(OH) by Csengeri et al. (2011) who proposed that it traces warm gas associated with the low-velocity shocks due to converging flows coinciding with velocity shears.In W43-MM1, the extended CH 3 CN emission follows the spatial distribution of the narrow line width component of the SiO emission which originates from low-velocity shocks (Louvet et al. 2016, see their left panel of Fig. 4).These shocks are also likely associated with the ridge formation through colliding flows or cloud-cloud collision.

Comparison of the methods used here to identify hot cores
The two methods described in Sect.3.3 and 3.4 succeed in identifying the same hot cores.The interest of using only the spatial distribution of COMs is that it allows the user to identify potential hot cores independently of the identification of the continuum cores.However, a spectral band is required that has lines mainly coming from COMs, as in the Band 6 spw7 band, and Article number, page 5 of 19 Therefore, for a core spectrum, we define the line contribution C core Lines to the total brightness as C core Lines = I core Lines /I core Total , (4) and using Eq.2: . (5) The left panel of Fig. 4 presents, for the nine 1.3 mm ALMA bands, the relative contribution C core Lines of lines to the total flux versus T core Cont for all cores.We note that the rapid increase in the relative line contribution at low continuum values corresponds to the cores fainter in lines, for which the method is biased by the noise (blue areas).However, as in Sect.3.3, cores #1, #2, #3, #4, #5, #9, #10, and #11 clearly stand out.The results are consistent between the bands but the difference between the cores is more obvious for the bands with no strong line (e.g., at 216 GHz), as the strong lines come from molecules which are widespread; the difference is also better in bands with a large frequency range (e.g. the three bands from 231 to 234 GHz).
To avoid the confusion observed for the weak continuum values, another approach using the same method is to focus on the fraction F core Lines of channels considered as containing lines detected at a 2σ level, defined as: The results are presented in the right panel of Fig. 4. The relative comparison highlights the same eight cores and we also observe a relation between the number of channels with molecular emission and the continuum level; this correlation is clearer than in the left panel for all bands, even for the 'CO' 230.30−230.76GHz band.

Identification from the spatial distribution of CH 3 OCHO and CH 3 CN
Methyl formate (CH 3 OCHO) and methyl cyanide (CH 3 CN) are two abundant COMs and are therefore often used to trace hot cores (e.g.Blake et al. 1987;Wink et al. 1994, see also Sect.5.2).We used Eq. 2 to integrate the line emission at each pixel and Fig. 5 presents maps of the two molecules using the CH 3 OCHO doublet at 216.21 GHz (see Sect. 5) and the CH 3 CN transitions from 91.958 to 91.987 GHz (see Sect. 4).The methyl formate map highlights the eight hot cores identified on Fig. 2.These hot cores also stand out in the methyl cyanide map but they are surrounded by more widespread extended emission.Such extended CH 3 CN emission was also observed in DR21(OH) by Csengeri et al. (2011) who proposed that it traces warm gas associated with the low-velocity shocks due to converging flows coinciding with velocity shears.In W43-MM1, the extended CH 3 CN emission follows the spatial distribution of the narrow line width component of the SiO emission which originates from lowvelocity shocks (Louvet et al. 2016, see their left panel of Fig. 4).These shocks are also likely associated with the ridge formation through colliding flows or cloud-cloud collision.

Comparison of the methods used here to identify hot cores
The two methods described in Sects.3.3 and 3.4 succeed in identifying the same hot cores.The interest of using only the spatial distribution of COMs is that it allows the user to identify A140, page 5 of 18 the sensitivity will be limited by the width of the spectral band and the number of strong COM lines therein.
The second set of methods based on the relative contribution of lines C core Lines with respect to the continuum emission or the fraction F core Lines of channels with detected line emission needs first to identify the continuum cores, and the catalog of continuum cores will depend on the software package used for extraction (see e.g.Pouteau et al. 2022).M18 also identified potential hot cores from the line contamination in the emission of the continuum cores.These authors compared the fluxes measured in a 1.9 GHz 'continuum' band and in a selection of line-free channels summing up to 65 MHz and found the same eight cores as in Sect.3.4.However, they detected two more cores: core #15 which is not included in our analysis and core #30 which does not display any COM lines when looking at the spectra.
The method used in Sect.3.5 directly uses hot core tracers.However, it requires first to identify the lines and to be sure that these lines are not blended with other species, which is often the case in hot cores.Furthermore, one needs to determine the velocity of the cores to centre the map on the emission line.When the field of view is large, there is a velocity gradient that makes it more difficult to make a map: one can make a 'composite' map, adapting the velocity throughout the field, or one can take a large velocity window to integrate the emission but the sensibility will be less and the risk of blending lines will be higher.In the case of methyl cyanide, we have also noted that extended emission is also present, which makes it more difficult to identify the faintest hot cores.

Core temperatures
Methyl cyanide (CH 3 CN) and methyl acetylene (CH 3 CCH) are considered as two good thermometers, as long as the lines are optically thin, because their emission K-ladder lines are close in frequency and cover a large enough range of upper level energies E u (see e.g.Giannetti et al. 2017).

CH 3 CN
There are five lines of CH 3 CN (J = 5 − 4) between 91.95 and 91.99 GHz, with upper level energies E u ranging from 13 to 128 K. CH 3 CN (5 0 -4 0 ) and CH 3 CN (5 1 -4 1 ) are only separated by 2.3 MHz, and because of the average line width of 5 km s −1 , these two lines are blended.In the same ALMA spectral window, there are also five lines of the isotopolog CH 13  3 CN (J = 5 − 4) with the same E u ranging from 13 to 128 K, as well as ten CH 3 CN (J = 5 − 4) 8 =1 lines with E u ranging from 532 to 706 K.The spectroscopic parameters of the lines are given in Table 3.
The CH 3 CN and CH 13 3 CN spectra averaged over the beam for the eight cores are plotted in Figure 6.The pattern of the CH 3 CN lines is similar for all the cores, except for cores #1, #2, #3, and #4, where the five lines are almost equally intense because of line opacity.Nonetheless, the relative intensities of the optically thin lines of the isotopologue CH 13  3 CN of these four cores are the same as the optically thin CH 3 CN pattern of the other cores.The similarity of the pattern with lines of different E u suggests an equivalent temperature for all the cores.The CH 3 CN 8 =1 lines are only detected towards cores #1, #2, #3, #4, and #11 and are marginally detected towards core #5 (see Figure 7).
For each core in Fig. 6, we overlay a synthetic spectrum considering the temperatures, column densities, line widths, and a source size indicated in Table 2. Due to the high average H 2 density of these cores (3 -76 × 10 8 cm −3 , see M18), we consider that all lines are thermalised.The values are obtained with the Monte-Carlo Markov Chain algorithm and the LTE model of the CASSIS software 3 (Vastel et al. 2015).For cores #1, #2, #3, #4, #5, and #11, the temperatures, column densities, and line widths are first derived from a fit to the optically thin CH 13  3 CN and CH 3 CN 8 =1 lines assuming a source size equal to the beam (0.49 ).The CH 3 CN lines are then taken into account to derive the source size.For cores #9 and #10, the parameters are derived from a fit to the optically thin CH 3 CN and CH 13  3 CN lines assuming a source size equal to the beam.We find an isotopic ratio of about 42, consistent with the 40-50 value at 5.5 kpc (Milam et al. 2005).We assume here a simple model with a uniform source and the emission of CH 3 CN, CH 13  3 CN, and CH 3 CN 8 =1 coming from the same region.A more realistic source model will be used in a forthcoming paper.
The temperatures are similar for all the cores, ranging from 120 K to 160 K with uncertainties of ± 20 K.The broader line potential hot cores independently of the identification of the continuum cores.However, a spectral band is required that has lines mainly coming from COMs, as in the Band 6 spw7 band, and the sensitivity will be limited by the width of the spectral band and the number of strong COM lines therein.
The second set of methods based on the relative contribution of lines C core Lines with respect to the continuum emission or the fraction F core Lines of channels with detected line emission needs first to identify the continuum cores, and the catalog of continuum cores will depend on the software package used for extraction (see e.g.Pouteau et al. 2022).M18 also identified potential hot cores from the line contamination in the emission of the continuum cores.These authors compared the fluxes measured in a 1.9 GHz 'continuum' band and in a selection of line-free channels summing up to 65 MHz and found the same eight cores as in Sect.3.4.However, they detected two more cores: core #15 which is not included in our analysis and core #30 which does not display any COM lines when looking at the spectra.
The method used in Sect.3.5 directly uses hot core tracers.However, it requires first to identify the lines and to be sure that these lines are not blended with other species, which is often the case in hot cores.Furthermore, one needs to determine the velocity of the cores to centre the map on the emission line.When the field of view is large, there is a velocity gradient that makes it more difficult to make a map: one can make a 'composite' map, adapting the velocity throughout the field, or one can take a large velocity window to integrate the emission but the sensibility will be less and the risk of blending lines will be higher.In the case of methyl cyanide, we have also noted that extended emission is also present, which makes it more difficult to identify the faintest hot cores.

Core temperatures
Methyl cyanide (CH 3 CN) and methyl acetylene (CH 3 CCH) are considered as two good thermometers, as long as the lines are optically thin, because their emission K-ladder lines are close in frequency and cover a large enough range of upper level energies E u (see e.g.Giannetti et al. 2017).

CH 3 CN
There are five lines of CH 3 CN (J = 5-4) between 91.95 and 91.99 GHz, with upper level energies E u ranging from 13 to 128 K. CH 3 CN (5 0 -4 0 ) and CH 3 CN (5 1 -4 1 ) are only separated by 2.3 MHz, and because of the average line width of 5 km s −1 , these two lines are blended.In the same ALMA spectral window, there are also five lines of the isotopolog CH 13  3 CN (J = 5-4) with the same E u ranging from 13 to 128 K, as well as ten CH 3 CN (J = 5-4) 8 = 1 lines with E u ranging from 532 to 706 K.The spectroscopic parameters of the lines are given in Table 3.
The CH 3 CN and CH 13 3 CN spectra averaged over the beam for the eight cores are plotted in Fig. 6.The pattern of the CH 3 CN lines is similar for all the cores, except for cores #1, #2, #3, and #4, where the five lines are almost equally intense because of line opacity.Nonetheless, the relative intensities of the optically thin lines of the isotopologue CH 13  3 CN of these four cores are the same as the optically thin CH 3 CN pattern of the other cores.The similarity of the pattern with lines of different E u suggests an equivalent temperature for all the cores.The CH 3 CN 8 = 1 lines are only detected towards cores #1, #2, #3, #4, and #11 and are marginally detected towards core #5 (see Fig. 7).
For each core in Fig. 6, we overlay a synthetic spectrum considering the temperatures, column densities, line widths, and a source size indicated in Table 2. Due to the high average H 2 density of these cores (3-76 × 10 8 cm −3 , see M18), we consider that all lines are thermalised.The values are obtained with the Monte-Carlo Markov Chain algorithm and the LTE model of the CASSIS software 3 (Vastel et al. 2015).For cores #1, #2, #3, #4, #5, and #11, the temperatures, column densities, and line widths are first derived from a fit to the optically thin CH 13  3 CN and CH 3 CN 8 = 1 lines assuming a source size equal to the beam (0.49 ).The CH 3 CN lines are then taken into account to derive the source size.For cores #9 and #10, the parameters are derived from a fit to the optically thin CH 3 CN and CH 13  3 CN lines assuming a source size equal to the beam.We find an isotopic ratio of about 42, consistent with the 40-50 value at 5.5 kpc (Milam et al. 2005).We assume here a simple model with a uniform source and the emission of CH 3 CN, CH 13  3 CN, and CH 3 CN 8 = 1 coming from the same region.A more realistic source model will be used in a forthcoming paper.
The temperatures are similar for all the cores, ranging from 120 K to 160 K with uncertainties of ±20 K.The broader line widths for cores #2 and #5 can be due to multiple velocity components as seen in other COM lines (see Sect. 5.2).    2.
widths for cores #2 and #5 can be due to multiple velocity components as seen in other COM lines (see Sect. 5.2 and Figure 11).
The CH 3 CN transitions are also detected towards the possibly younger core #6 studied by Molet et al. (2019) and the derived temperature is 60 ± 20 K which is in agreement with the determinations in that paper.This temperature is notably different from the temperatures T ex ∼150 K we find here towards the hot cores.

CH 3 CCH
We selected five CH 3 CCH (J = 6 − 5) lines between 102.51 and 102.55 GHz, with upper level energies E u ranging from 17 to 132 K. CH 3 CCH (6 5 -5 5 ) is not studied here, because the line is too weak towards all the cores.As for CH 3 CN, the CH 3 CCH (6 0 -5 0 ) and CH 3 CCH (6 1 -5 1 ) lines are blended, and are also contaminated by acetone (CH 3 COCH 3 ) lines at 102.547 GHz in cores #1, #2, and #4.Furthermore the CH 3 CCH (6 2 -5 2 ) line at 102.540 GHz is contaminated by the ethylene glycol, Fig. 7: CH 3 CN 8 =1 synthetic spectra (in red for core #11, in black for the others) overlaid on the observed spectra.The parameters used for the synthetic spectra are listed in Table 2.The spectra for cores #3, #5, and #11 are smoothed to a velocity resolution of 6.35 km s −1 .Fig. 6.CH 13  3 CN (from 91.91 to 91.94 GHz) and CH 3 CN (91.95-91.99GHz) synthetic spectra (in red for cores #10 and #11, in black for the others) overlaid on the observed spectra.The parameters used for the synthetic spectra are listed in Table 2.
The CH 3 CN transitions are also detected towards the possibly younger core #6 studied by Molet et al. (2019) and the derived temperature is 60 ± 20 K which is in agreement with the determinations in that paper.This temperature is notably different from the temperatures T ex ∼ 150 K we find here towards the hot cores.
The CH 3 CN transitions are also detected towards the possibly younger core #6 studied by Molet et al. (2019) and the derived temperature is 60 ± 20 K which is in agreement with the determinations in that paper.This temperature is notably different from the temperatures T ex ∼150 K we find here towards the hot cores.
The CH 3 CCH spectra averaged over the beam are presented in Fig. 8.The emission from this molecule appears to be opti-Fig.7: CH 3 CN 8 =1 synthetic spectra (in red for core #11, in black for the others) overlaid on the observed spectra.The parameters used for the synthetic spectra are listed in Table 2.The spectra for cores #3, #5, and #11 are smoothed to a velocity resolution of 6.35 km s −1 .Fig. 8: CH 3 CCH synthetic spectra (in red for cores #10 and 102.540GHz is contaminated by the ethylene glycol, (CH 2 OH) 2 , (9 2,7 -8 2,6 ) line at 102.539 GHz mainly in core #1.
The CH 3 CCH spectra averaged over the beam are presented in Fig. 8.The emission from this molecule appears to be optically thin in all the cores.We overlay synthetic spectra whose parameters are indicated in Table 2.In the figure, we note the detection of an ethanol (C 2 H 5 OH) line at 102.534 GHz in cores #1, #2, #3, and #4, exhibiting a varying intensity from one core to the next.
Twelve CH 3 CCH 10 = 1 transitions with E u ranging from 487 to 741 K are included in the frequency range of the observations (between 102.74 and 102.94 GHz).The intensities estimated from the parameters in Table 2 are significantly below the noise level for a detection in any of the cores.
The difference in temperature between CH 3 CN (120-160 K) and CH 3 CCH (50-90 K) suggests that CH 3 CCH traces the outer envelope whereas CH 3 CN traces the inner part.Furthermore, the line width of the CH 3 CCH lines compared to that of the CH 3 CN lines is also smaller for each core.The observations and gasgrain chemical modelling suggest that the CH 3 CN emission in IRAS 16293-2422 also arises from a warmer and more interior region of the envelope than the CH 3 CCH emission (Andron et al. 2018).

Similarity of the normalised spectra
To compare the molecular composition of the selected cores, we first superposed their spectra.Because some molecular cores have much more intense lines than the others, we normalised the spectra using three bright methyl formate (CH 3 OCHO, hereafter MF) doublets for this purpose; their spectroscopic parameters are listed in Table 3.The MF doublet transitions have similar E u levels (99-109 K) and are therefore most probably tracing the same volume of gas.This strategy has the following advantages: (i) MF lines are common in all hot core spectra, (ii) easy identification of lines, (iii) lower optically thickness than for CH 3 OH lines, (iv) transitions of the two torsional A-and E-species close in frequency, (v) very low contamination level of these doublets.The spectra of each core have been aligned in velocity and multiplied by a factor so that the MF intensity of these three doublets is the same, taking core #4 as a reference.The individual velocity of the cores and the derived intensity ratio are displayed in Table 4.By applying these corrections, we obtain the superpositions shown in Fig. 9 for the six selected MF lines.The first doublet is slightly contaminated by the DCO + (3-2) line at 216.1126 GHz.We mapped the spatial distribution of DCO + emission; it is located in the high-density regions but avoids the hot cores (Molet 2019).Nonetheless, the line stands out on spectra for cores #5, #9, and #10 because the MF lines are much fainter than in the other hot cores.   CH 3 OH -Core #2 Fig. 11: Two velocity components are visible in COM lines for cores #2 and #5.For CH 3 OCHO and OC 33 S, the red component is at 99.2 km s −1 and the blue component at 94.8 km s −1 , with a half-power width of 4.2 km s −1 .For 13 CH 3 OH, the red component is at 99.2 km s −1 and the blue component at 94.7 km s −1 , with a half-power width of 6.0 km s −1 .
In the continuum spw7 band at 233 GHz, some lines are clearly more intense in the three cores #1, #4, and #11.These are all associated to NH 2 CHO transitions.As E u for these lines ranges from 94 K to 258 K, this is not an effect of temperature but can come from a larger relative abundance or a difference in the NH 2 CHO emission size.
We note also that core #1 is the richest core in molecules, for example with transitions of (CH 2 OH) 2 , H 13 CONH 2 , and NH 2 CN which are not present in the spectra of the other cores.is the inner part where the temperature is higher than ∼100 K and it would be interesting to know whether or not methyl formate is a good tracer of the heating due to the luminosity of protostellar objects.All COMs do not originate from gas with the same physical conditions.Chemical differentiation (CN-vs.O-bearing molecules) has been widely observed in a great many sources (e.g.Csengeri et al. 2019, and references therein).Towards G328.2551-0.5321,Csengeri et al. (2019) find that several O-bearing COMs peak at the proposed accretion shocks rather than at the radiatively heated core whereas CN-bearing The superposition result for the Band 6 spw7 is shown in Fig. 10 and the entire spectral band superposition results are shown in Appendix A. After normalisation to the MF lines, the spectra of the eight hot cores are relatively similar.The upper level energies of the transitions are different, with a large E u coverage for some molecules (e.g.CH 3 OH, CH 3 OCHO, C 2 H 5 CN).The fact that the intensity factor between the cores is the same for low-E u and high-E u transitions implies that the excitation temperatures of the cores are of the same order, which is in agreement with the results from the CH 3 CN analysis (see Sect. 4).
The strongest lines come from the simplest molecules (shown as vertical dotted lines in Fig. A.1).They are associated to the following molecules: CO (and C 18 O), SO, SiO, DCN, H 2 CO (and H 13  2 CO), HC 3 N, OCS (and 33 S and 13 C isotopologues), 13 CS, and H 2 C 34 S. The study of the distribution of these molecules for core #3 showed that they are mainly not peaking at the core, except for H 2 C 34 S, 13 CS, OCS, and its isotopologues (see Molet et al. 2019;Molet 2019).
For the molecules not centred on cores, the lines are generally wider and we can see line wings associated to the outflows.A detailed study of the CO(2-1) and SiO(5-4) outflows in W43-MM1 can be found in Nony et al. (2020).A broad and bright high-velocity component for core #9, which is especially visible on the HC 3 N, CO, and SO lines, is due to the presence in the projection plane of an outflow knot close to the core centre (see Fig. 3d of Nony et al. 2020).Lines are relatively intense for cores #5, #9, #10, and #11, which is probably because they are less optically thick and avoid self-absorption.A high-velocity component is visible on OCS and 13 CS lines for core #9.
The effect of optical thickness is visible in the line profiles of the OCS line at 231.061 GHz.If we consider that the OCS/MF ratio is the same in all the cores, the relative thickness of the OCS line for each core compared to the others is directly observable in Fig. A.1.Core #11 is the least optically thick in OCS, while cores #1 and #4 are the thickest.Furthermore, the dip at the centre of the line for these two cores confirms their strong opacity.Likewise, lines of CH 3 OH and its isotopologue 13 CH 3 OH in core #11 are more intense than in the other cores as they are optically thin in this core.
In the continuum spw7 band at 233 GHz, some lines are clearly more intense in the three cores #1, #4, and #11.These are all associated to NH 2 CHO transitions.As E u for these lines ranges from 94 K to 258 K, this is not an effect of temperature but can come from a larger relative abundance or a difference in the NH 2 CHO emission size.
We note also that core #1 is the richest core in molecules, for example with transitions of (CH 2 OH) 2 , H 13 CONH 2 , and NH 2 CN which are not present in the spectra of the other cores.
A c-C 3 H 2 transition is detected in the core #10 spectrum at 216.278 GHz, but the spatial distribution map seems to indicate that it is mainly associated to the outflows of core #2 (see Molet 2019).

Methyl formate as a tracer of hot cores
The current understanding is that COMs, like methyl formate, mainly form through ice chemistry on grains and are then released when dust temperatures become high enough for ices to sublimate (e.g.Öberg 2016; Van Dishoeck 2017).The hot core is the inner part where the temperature is higher than ∼100 K and it would be interesting to know whether or not methyl formate is a good tracer of the heating due to the luminosity of protostellar objects.All COMs do not originate from gas with the same physical conditions.Chemical differentiation (CNvs.O-bearing molecules) has been widely observed in a great A140, page 9 of 18 13 CH 3 OH -Core #2 Fig. 11: Two velocity components are visible in COM lines for cores #2 and #5.For CH 3 OCHO and OC 33 S, the red component is at 99.2 km s −1 and the blue component at 94.8 km s −1 , with a half-power width of 4.2 km s −1 .For 13 CH 3 OH, the red component is at 99.2 km s −1 and the blue component at 94.7 km s −1 , with a half-power width of 6.0 km s −1 .
In the continuum spw7 band at 233 GHz, some lines are clearly more intense in the three cores #1, #4, and #11.These are all associated to NH 2 CHO transitions.As E u for these lines ranges from 94 K to 258 K, this is not an effect of temperature but can come from a larger relative abundance or a difference in the NH 2 CHO emission size.
We note also that core #1 is the richest core in molecules, for example with transitions of (CH 2 OH) 2 , H 13 CONH 2 , and NH 2 CN which are not present in the spectra of the other cores.
A c-C 3 H 2 transition is detected in the core #10 spectrum at 216.278 GHz, but the spatial distribution map seems to indicate that it is mainly associated to the outflows of core #2 (see Molet 2019).

Methyl formate as a tracer of hot cores
The current understanding is that COMs, like methyl formate, mainly form through ice chemistry on grains and are then released when dust temperatures become high enough for ices to sublimate (e.g.Öberg 2016; Van Dishoeck 2017).The hot core Fig. 12: Emission inside the dotted circle is the 218 GHz methyl formate doublet integrated in velocity (blue: 90-97 km s −1 , red: 97-104 km s −1 ) towards core #2.The emission outside is the SiO (blue: 82-88 km s −1 , red: 108-119 km s −1 ).Contours are 50% and 80% of maximum.The black ellipse is the methyl formate 0.28 × 0.20 beam.The black cross marks the centre of the continuum core and the red and blue crosses the maximum of the methyl formate emission.
is the inner part where the temperature is higher than ∼100 K and it would be interesting to know whether or not methyl formate is a good tracer of the heating due to the luminosity of protostellar objects.All COMs do not originate from gas with the same physical conditions.Chemical differentiation (CN-vs.O-bearing molecules) has been widely observed in a great many sources (e.g.Csengeri et al. 2019, and references therein).Towards G328.2551-0.5321,Csengeri et al. (2019) find that several O-bearing COMs peak at the proposed accretion shocks rather than at the radiatively heated core whereas CN-bearing molecules peak towards the central protostar.
In W43-MM1, at a spatial resolution of ∼2500 au, the emission of the COMs is spatially centred on the hot cores.However a clear second component of methyl formate (shifted by ∼ 4 km s −1 ) is visible for cores #2 and #5 (Figure 11).This component is also spatially centred on the continuum core and does not come from a nearby source.A second faint component may be present as well for cores #9 and #10 and a faint third component for core #5 (shifted by ∼ 7 km s −1 ).These components are visible in other O-bearing molecules such as CH 18  3 OH and OCS and its isotopologues and for optically thin lines.
Article number, page 9 of 19 Fig. 11.Two velocity components are visible in COM lines for cores #2 and #5.For CH 3 OCHO and OC 33 S, the red component is at 99.2 km s −1 and the blue component at 94.8 km s −1 , with a half-power width of 4.2 km s −1 .For 13 CH 3 OH, the red component is at 99.2 km s −1 and the blue component at 94.7 km s −1 , with a half-power width of 6.0 km s −1 .y of the main hot cores in W43-MM1: detection, temperature, and molecular composition he spectra are aligned in velocity and multiplied by a factor in order to es of the 216200 and 218200 MHz bands. is the inner part where the temperature is higher than ∼100 K and it would be interesting to know whether or not methyl formate is a good tracer of the heating due to the luminosity of protostellar objects.All COMs do not originate from gas with the same physical conditions.Chemical differentiation (CN-vs.O-bearing molecules) has been widely observed in a great many sources (e.g.Csengeri et al. 2019, and references therein).Towards G328.2551-0.5321,Csengeri et al. (2019) find that several O-bearing COMs peak at the proposed accretion shocks rather than at the radiatively heated core whereas CN-bearing molecules peak towards the central protostar.
In W43-MM1, at a spatial resolution of ∼2500 au, the emission of the COMs is spatially centred on the hot cores.However a clear second component of methyl formate (shifted by ∼ 4 km s −1 ) is visible for cores #2 and #5 (Figure 11).This component is also spatially centred on the continuum core and does not come from a nearby source.A second faint component may be present as well for cores #9 and #10 and a faint third component for core #5 (shifted by ∼ 7 km s −1 ).These components are visible in other O-bearing molecules such as CH 18  3 OH and OCS and its isotopologues and for optically thin lines.many sources (e.g.Csengeri et al. 2019, and references therein).Towards G328.2551-0.5321,Csengeri et al. (2019) find that several O-bearing COMs peak at the proposed accretion shocks rather than at the radiatively heated core whereas CN-bearing molecules peak towards the central protostar.
In W43-MM1, at a spatial resolution of ∼2500 au, the emission of the COMs is spatially centred on the hot cores.However a clear second component of methyl formate (shifted by ∼4 km s −1 ) is visible for cores #2 and #5 (Fig. 11).This component is also spatially centred on the continuum core and does not come from a nearby source.A second faint component may be present as well for cores #9 and #10 and a faint third component for core #5 (shifted by ∼7 km s −1 ).These components are visible in other O-bearing molecules such as CH 18  3 OH and OCS and its isotopologues and for optically thin lines.
Figure 12 presents the spatial distribution of the blue and red components of the methyl formate lines for core #2 but at a higher resolution (0.24 or ∼1300 au).These components do not peak at the continuum centre and their positions coincide with the blue and red parts of the outflows as traced by the CO and SiO emission (see also Fig. 3c of Nony et al. 2020).This suggests that the methyl formate emission, as well as that from the methanol and OCS, is related to the outflows and that they could have been released from the ice mantles via sublimation through shocks or UV irradiation by the protostar on the walls of the outflow cavity.The enhancement of COMs in regions of outflows is commonly observed towards high-mass (e.g.Favre et al. 2011;Palau et al. 2017) and low-mass (e.g.Drozdovskaya et al. 2015;Lefloch et al. 2017;Belloche et al. 2020;De Simone et al. 2020) protostars.If the protostellar luminosity increases, the phenomena of accretion, ejection, and shocks will be enhanced and the methyl formate emission will increase as well and can be used to trace the hot cores and the thermal heating from their embedded protostellar objects (Bonfand et al., in prep.).A full modelling of each source in terms of physical structure and chemical composition would be the ideal approach but this is a lengthy process.We propose here a simpler, preliminary approach to start quantifying the similarity of the cores based on correlation plots of each source spectrum with that of a reference source.

Discussion
The plots are presented in Fig. 13, taking core #3 as a reference.The intensity of each channel in the spectra of the different cores is plotted with respect to the intensity of the same velocity channel of core #3, after shifting the spectra with respect to the core velocities.Here we selected core #3 as a reference source, as the lines are intense but less optically thick than in core #4, which avoids biases (see Sect. 6.2).Larger circles indicate peaks in the core #3 spectrum, defined as channels above their two closest neighbours and stronger than 1 K (to remain well above the noise).The plots show a general correlation but with some dispersion, and a tendency in some cases towards a curved rather than a linear relation.The slopes of the linear fits (renormalised to 1 for core #4 for comparison) are given in Table 4 and they are plotted versus the methyl formate scaling factors in Fig. 14.It appears that the two methods for comparing the line spectra give coherent results.
Hereafter, we first discuss various causes of possible dissimilarity in the spectra and how they affect the spectrum versus spectrum plots.We then discuss the observed similarity in a more quantitative way and find an agreement within a factor 2-3.

Reasons for dissimilarity
We consider that all lines are thermalised (see Sect. 4.1).If the molecular emission of two cores is identical, the spectrumspectrum plot should be linear, and only affected by the observational noise (5 sigma ∼1 K).Possible reasons for dissimilarity are listed below and illustrated using simulated spectra in Appendix B: -Velocities: If the emission lines of the molecules are not centred at the same velocity in two cores, each line will appear as a loop in the diagram due to the Doppler shift.To remove this effect, we realigned each core spectrum with respect to the   A full modelling of each source in terms of physical structure and chemical composition would be the ideal approach but this is a lengthy process.We propose here a simpler, preliminary approach to start quantifying the sim-  Notes.The coordinates, masses, and temperatures are from M18. is the core velocity in the local standard of rest.The CH 3 OCHO intensity factors are estimated from the six lines (three doublets) of CH 3 OCHO so that the intensities match those of core #4.The slope refers to the fit of Figure 13 and is renormalised to 1 for core #4 for comparison with the CH 3 OCHO intensity factor.
Fig. 14: Slope of the linear fit of Figure 13 versus the methyl formate scaling factor of Figure 10.
ilarity of the cores based on correlation plots of each source spectrum with that of a reference source.
spectrum plot should be linear, and only affected by the observational noise (5 sigma ∼1 K).Possible reasons for dissimilarity are listed below and illustrated using simulated spectra in Appendix B: -Velocities: If the emission lines of the molecules are not centred at the same velocity in two cores, each line will appear as a loop in the diagram due to the Doppler shift.To remove this effect, we realigned each core spectrum with respect to the reference core spectrum by varying the relative velocity shift to minimise the dispersion in the spectrum-spectrum plot.
-Linewidths: a difference in line width would increase the dispersion, but here all sources have similar line widths, except core #5 (see Table 2).-Masses: If the cores have a similar structure, but one is more massive, the relation between optically thin lines will be linear, but with a slope different from 1.The stronger lines will not be affected by optical thickness in the same way, and the line profiles will differ.-Temperatures: if the temperature is different in the two cores, each individual optically thin line will still lead to linear re- i .For all sources, R i remains close to 1 within a factor of 3, and in many cases within a factor of 2, for most channels i where the core #3 spectrum is above 2 K (core #3 maximum intensity being about 12.9 K); these limits are indicated respectively by the green and red horizontal lines.One notes a slight decrease in the ratio R i with T S 3 i which is due to opacity effects in the strongest lines.Core #11, which is the least massive, appears the least well correlated to core #3.The rather limited dispersion in the plots indicates a limited role of the potential dissimilarity factors listed above.reference core spectrum by varying the relative velocity shift to minimise the dispersion in the spectrum-spectrum plot.
-Linewidths: a difference in line width would increase the dispersion, but here all sources have similar line widths, except core #5 (see Table 2).
-Masses: if the cores have a similar structure, but one is more massive, the relation between optically thin lines will be linear, but with a slope different from 1.The stronger lines will not be affected by optical thickness in the same way, and the line profiles will differ.
-Temperatures: if the temperature is different in the two cores, each individual optically thin line will still lead to linear relations but with different slopes for different energy levels.
-Abundances: if the relative abundance of molecules is not exactly the same, thin lines will present a linear relation but with different slopes for each species.From an astrophysical point of view, sources in our sample differ by a factor 6 in mass for the massive cores (even up to ∼50 if core #11 is included), leading to much higher column densities and opacities in some of them.Moreover, some cores might include unresolved multiple sources and some might have a noticeable proportion of molecules released by shocks (e.g.linked to bipolar flows) in addition to thermal desorption and ice sublimation.In those cases, the kinematics and the composition of the gas could be affected to some extent.However, observationally, as shown below, the molecular emission spectra of the cores are quite similar.

Similarity from ratio plots
To get a more quantitave indication of the similarity of the spectra, we plotted in Fig. 15 the logarithm of the ratio of one spectrum to the linear fit (see Fig. 13).More precisely, we define the quantity R i for each frequency channel i as R i = T S i /(a 1 × T S ref i ) where T S i and T S ref i are the channel intensities, respectively, for core S and a core S ref taken as reference, and a 1 is the slope of the linear fit (with no constant term) of T S i versus T S ref i .As mentioned above, the spectrum of S was first realigned in velocity with respect to the spectrum of S ref .Figure 15 presents plots of log (R i ) versus T S ref i .For all sources, R i remains close to 1 within a factor of 3, and in many cases within a factor of 2, for most channels i where the core #3 spectrum is above 2 K (core #3 maximum intensity being about 12.9 K); these limits are indicated respectively by the green and red horizontal lines.One notes a slight decrease in the ratio R i with T S 3 i which is due to opacity effects in the strongest lines.Core #11, which is the least massive, appears the least well correlated to core #3.The rather limited dispersion in the plots indicates a limited role of the potential dissimilarity factors listed above.
The agreement within a factor 2-3 between the spectra suggests a similar molecular composition of the cores, which could be due to the formation of the molecules.The lines in the spw7 band are principally those of COMs which are mainly formed on similar ices in the filament and desorbed by shocks and/or by the high temperature.

Conclusion
Here, we studied the molecular composition of the rich highmass star forming region W43-MM1 with ALMA at high spatial resolution (0.5 ).This study proposes analysis tools and lays the groundwork for future comparisons to similar systems.
-We developed different methods to identify the molecular hot cores.The first one relies on the continuum versus line emission separation method developed by Molet et al. (2019), which we applied to a 2 GHz band around 233 GHz that is rich in COM lines and is not contaminated by strong lines of simpler species.Hot cores are then identified in the map of the continuum-subtracted brightness temperature averaged over the band, the peaks of which highlight intense COM emission.
-A second hot core identification method uses the relative contribution of lines and continuum, but in spectra spatially averaged on previously identified continuum cores.In this case, all the nine bands we observed are used, narrow as well as broad.The criterion relies either on the summed line intensities or on the number of line channels.The results are in general agreement with the first method but some bands with strong lines of simple species are definitely less sensitive.
-We made methyl formate (CH 3 OCHO) and methyl cyanide (CH 3 CN) maps which highlight the same hot cores as previously determined and confirmed their nature.We also note extended methyl cyanide emission which may trace the warm gas associated with the low-velocity shocks also observed in SiO.
-Seven hot cores with 16 to 100 M in mass and one less massive 2 M core were identified.
-For each identified core, we determined mean temperatures using the classical 'thermometer molecules' methyl cyanide (CH 3 CN) and methyl acetylene (CH 3 CCH) lines at 3 mm.The CH 13  3 CN isotopologue lines allowed us to circumvent the optical thickness of the lines in the strongest sources (#1-4).CH 3 CN temperatures are all consistently around 150 K, whereas CH 3 CCH leads to a lower value in the 50-90 K range.This is interpreted as being due to a distribution extending further into the envelope beyond the hot core region where ice mantles have been sublimated.The previous studied core #6 is confirmed as atypical with a lower CH 3 CN temperature of ∼60 K.
-We compared the chemical composition of the cores using two methods.First, we directly superposed the COM-rich ∼2 GHz wide spectra around 233 GHz after a scaling in intensity based on methyl formate lines.We then plotted correlation diagrams of the brightness temperature in each channel.No line identification is required.We find general good agreement, that is, to within a factor of 2-3, between the mean in the chemical composition of the various hot cores, which cover an order of magnitude in mass.
-Simpler species, such as SiO, DCN, H 2 CO, CO, and SO do not have emission concentrated in the cores; but H 2 C 34 S, 13 CS, and OCS (and isotopologues) do show such concentrations.
-In core #2, we find a spatial association between the blue and red velocity components of methyl formate and the outflow lobes.We plan to develop these studies in the frame of forthcoming analyses of the sources in the W43-MM2 and W43-MM3 ALMA-IMF regions.

NFig. 1 :
Fig.1: Left: Spectrum with molecular line emission in the Band 6 spw7 band.Right: Distribution of the intensity channels (in black).An exponentially modified Gaussian (in red) is adjusted to fit the Gaussian part due to the noise, whose peak is taken as the continuum value, and the tail associated with the molecular emission.

Fig. 1 .
Fig.1.Left: spectrum with molecular line emission in the Band 6 spw7 band.Right: distribution of the intensity channels (in black).An exponentially modified Gaussian (in red) is adjusted to fit the Gaussian part due to the noise, whose peak is taken as the continuum value, and the tail associated with the molecular emission.

Fig. 3 :
Fig.3: Separation of continuum and line contributions for cores #5 and 11 as examples.The spectra are spatially averaged over the source size.The grey horizontal line across the observed spectra (in black) is the continuum level obtained by the method ofMolet et al. (2019).Below this line, the grey area represents the continuum integrated flux.Above this line, the blue area is the continuum brightness of the lines, which is estimated in each channel depending on the noise, whose 1σ and 2σ values are represented by horizontal lines.

Fig. 2 .
Fig.2.Lines, and total and continuum emission maps obtained from the spw7 band at 233 GHz.Contours represent 3, 5, 7, 10, 20, 30, 50,  70, and 100 σ, with σ = 0.22 K, the rms in a channel of resolution ∆ν = 0.122 MHz.The hot cores are marked by a star symbol.The total emission is averaged over the 1.9 GHz band.For the line map, the first two contours are added in red to reveal the fainter hot cores.

Fig. 3 :
Fig.3: Separation of continuum and line contributions for cores #5 and 11 as examples.The spectra are spatially averaged over the source size.The grey horizontal line across the observed spectra (in black) is the continuum level obtained by the method ofMolet et al. (2019).Below this line, the grey area represents the continuum integrated flux.Above this line, the blue area is the continuum brightness of the lines, which is estimated in each channel depending on the noise, whose 1σ and 2σ values are represented by horizontal lines.

Fig. 4 :
Fig.4: Relative line contribution to the total flux (left) and fraction of channels that contain molecular emission (right) as a function of the continuum level obtained for the 1.3 mm bands.Red dots represent high-mass cores with M > 10 M , grey dots represent the other cores.The blue areas represent the results for spectra without many lines above the 1σ, 2σ, and 3σ levels, where σ is the rms noise level in one channel of the corresponding band.Cores identified as containing a hot core following the criterion described in Sect. 1 are marked by their core number.

3. 5 .
Identification from the spatial distribution of CH 3 OCHO and CH 3 CN

Fig. 4 .
Fig. 4. Relative line contribution to the total flux (left) and fraction of channels that contain molecular emission (right) as a function of the continuum level obtained for the 1.3 mm bands.Red dots represent high-mass cores with M > 10 M , grey dots represent the other cores.The blue areas represent the results for spectra without many lines above the 1σ, 2σ, and 3σ levels, where σ is the rms noise level in one channel of the corresponding band.Cores identified as containing a hot core following the criterion described in Sect. 1 are marked by their core number.
Fig.6: CH13  3 CN (from 91.91 to 91.94 GHz) and CH 3 CN (91.95 to 91.99 GHz) synthetic spectra (in red for cores #10 and #11, in black for the others) overlaid on the observed spectra.The parameters used for the synthetic spectra are listed in Table2.
Fig.6: CH13  3 CN (from 91.91 to 91.94 GHz) and CH 3 CN (91.95 to 91.99 GHz) synthetic spectra (in red for cores #10 and #11, in black for the others) overlaid on the observed spectra.The parameters used for the synthetic spectra are listed in Table2.

Fig. 7 .
Fig. 7. CH 3 CN 8 = 1 synthetic spectra (in red for core #11, in black for the others) overlaid on the observed spectra.The parameters used for the synthetic spectra are listed in Table2.The spectra for cores #3, #5, and #11 are smoothed to a velocity resolution of 6.35 km s −1 .

Fig. 8 :
Fig.7: CH 3 CN 8 =1 synthetic spectra (in red for core #11, in black for the others) overlaid on the observed spectra.The parameters used for the synthetic spectra are listed in Table2.The spectra for cores #3, #5, and #11 are smoothed to a velocity resolution of 6.35 km s −1 .

Fig. 8 .
Fig. 8. CH 3 CCH synthetic spectra (in red for cores #10 and #11, in black for the others) overlaid on the observed spectra.The parameters used for the synthetic spectra are listed in Table 2.The CH 3 CCH lines are contaminated by an ethanol line at 102.534 GHz in cores #1, #2, #3, and #4, an ethylene glycol line at 102.539 GHz mainly in core #1, and acetone lines at 102.547 GHz in cores #1, #2, and #4.

Fig. 9 .
Fig. 9. Superposition of the methyl formate doublet spectra of the hot cores.The spectra are normalised with respect to the six strongest and lightly contaminated methyl formate lines.The DCO + (3-2) emission at 216.1126 GHz is weak with respect to the methyl formate emission towards the hot cores.

Notes.
Fig. 10: Comparison of the spectra of the hot cores.The spectra are aligned in velocity and multiplied by a factor in order to normalise to a peak value of 1 for the methyl formate lines of the 216200 and 218200 MHz bands.

Fig. 12 :
Fig.12: Emission inside the dotted circle is the 218 GHz methyl formate doublet integrated in velocity (blue: 90-97 km s −1 , red: 97-104 km s −1 ) towards core #2.The emission outside is the SiO (blue: 82-88 km s −1 , red: 108-119 km s −1 ).Contours are 50% and 80% of maximum.The black ellipse is the methyl formate 0.28 × 0.20 beam.The black cross marks the centre of the continuum core and the red and blue crosses the maximum of the methyl formate emission.

Fig. 10 .
Fig. 10.Comparison of the spectra of the hot cores.The spectra are aligned in velocity and multiplied by a factor in order to normalise to a peak value of 1 for the methyl formate lines of the 216 200 and 218 200 MHz bands.
Fig. 10: Comparison of the spectra of the hot cores.The spectra are aligned in velocity and multiplied by a factor in order to normalise to a peak value of 1 for the methyl formate lines of the 216200 and 218200 MHz bands.
Fig.12: Emission inside the dotted circle is the 218 GHz methyl formate doublet integrated in velocity (blue: 90-97 km s −1 , red: 97-104 km s −1 ) towards core #2.The emission outside is the SiO (blue: 82-88 km s −1 , red: 108-119 km s −1 ).Contours are 50% and 80% of maximum.The black ellipse is the methyl formate 0.28 × 0.20 beam.The black cross marks the centre of the continuum core and the red and blue crosses the maximum of the methyl formate emission.

Fig. 12 .
Fig.12.Emission inside the dotted circle is the 218 GHz methyl formate doublet integrated in velocity (blue: 90-97 km s −1 , red: 97-104 km s −1 ) towards core #2.The emission outside is the SiO (blue: 82-88 km s −1 , red: 108-119 km s −1 ).Contours are 50% and 80% of maximum.The black ellipse is the methyl formate 0.28 × 0.20 beam.The black cross marks the centre of the continuum core and the red and blue crosses the maximum of the methyl formate emission.
: similarity of the molecular composition of the hot cores 6.1.General similarity of the spectra The simple superposition of the spectra of the different cores (Fig. A.1) suggests a general similarity.If confirmed, this would point towards both a similar chemical composition and excitation of most of the COMs.

Fig. 13 :
Fig.13: Intensity of frequency channels for cores #1, #2, #4, #5, #9, #10, and #11 versus the intensity of the same frequency channel for core #3 for the spectra of the spw7 band after correction of the velocity.The circles mark the peaks of the spectra.The line is a linear fit to the point distribution.
General similarity of the spectra The simple superposition of the spectra of the different cores (Figure A.1) suggests a general similarity.If confirmed, this would point towards both a similar chemical composition and excitation of most of the COMs.

Fig. 13 .
Fig.13.Intensity of frequency channels for cores #1, #2, #4, #5, #9, #10, and #11 versus the intensity of the same frequency channel for core #3 for the spectra of the spw7 band after correction of the velocity.The circles mark the peaks of the spectra.The line is a linear fit to the point distribution.

Fig. 14 .
Fig. 14.Slope of the linear fit of Fig. 13 versus the methyl formate scaling factor of Fig. 10.

Fig. 15 .
Fig. 15.Logarithm of the normalised ratio R i = T S i /(a 1 × T S 3 i ) for cores #1, #2, #4, #5, #9, #10, and #11 versus T S 3 i , the channel intensity of the reference source core #3 (see Sect. 6.3).Horizontal red and green lines indicate a departure from R i = 1 by a factor 2 and 3, respectively.Circles indicate peaks in the core #3 spectrum.Most points right of T S 3 i = 2 K remain within a factor of 2 of the linear fit shown in Fig.13(slope a 1 ), which indicates a good general similarity in the spectra.

NFig
Fig. A.1: Comparison of the spectra of the eight hot cores.The spectra are aligned in velocity and multiplied by a factor in order to normalise the methyl formate lines of the 216200 and 218200 MHz bands.The hatched rectangles indicate the regions of the spectra with strong noise.Article number, page 15 of 19 Fig. A.1: Comparison of the spectra of the eight hot cores.The spectra are aligned in velocity and multiplied by a factor in order to normalise the methyl formate lines of the 216200 and 218200 MHz bands.The hatched rectangles indicate the regions of the spectra with strong noise.

Table 1 .
Parameters of the 3 mm and 1.3 mm ALMA spectral windows.

Table 2 .
CH 3 CN and CH 3 CCH column densities and temperatures towards the hot cores.

Table 3 .
Spectroscopic parameters of the CH 3 CN, CH 3 CCH, and CH 3 OCHO lines studied.Notes.ν is the frequency, E u the upper state energy, S µ 2 the line strength, and A i j the Einstein coefficient for spontaneous emission.

Table 2 :
CH 3 CN and CH 3 CCH column densities and Fig.9: Superposition of the methyl formate doublet spectra of the hot cores.The spectra are normalised with respect to the six strongest and lightly contaminated methyl formate lines.The DCO + (3-2) emission at 216.1126 GHz is weak with respect to the methyl formate emission towards the hot cores.matedfrom the parameters in Table2are significantly below the noise level for a detection in any of the cores.

Table 4 .
Mass, dust temperature, velocity, and intensity scale factors of the hot cores.

Table 4 :
Mass, dust temperature, velocity, and intensity scale factors of the hot cores.