Free Access
Issue
A&A
Volume 654, October 2021
Article Number A158
Number of page(s) 24
Section Numerical methods and codes
DOI https://doi.org/10.1051/0004-6361/202039360
Published online 27 October 2021

© ESO 2021

1. Introduction

Characterising the composition of astrophysical ice mantles in the interstellar medium (ISM) is crucial for understanding the chemical evolution of young stellar objects (YSOs). With this goal, systematic studies in the infrared (IR) with ground- and space-based telescopes (e.g. Chiar et al. 1995; Schutte et al. 1996; Gibb et al. 2000; Pontoppidan et al. 2003; Boogert et al. 2008; Zasowski et al. 2009; Thi et al. 2011; Öberg et al. 2011) aided by laboratory experiments simulating diverse chemical environments and irradiation fields (e.g. Gerakines et al. 1995; Schutte et al. 1993; Palumbo et al. 1998; Watanabe & Kouchi 2002; Fuchs et al. 2009; Pilling et al. 2010; Linnartz et al. 2015; Terwisscha van Scheltinga et al. 2018; Rachid et al. 2020; Ioppolo et al. 2021) have advanced our knowledge of the composition of interstellar ices.

The Infrared Space Observatory (ISO) allowed the first systematic studies of ices in star-forming regions (e.g. Schutte et al. 1996; van Dishoeck et al. 1998). These observations revealed the presence of H2O, NH3, and CH4, and allowed the synthesis of new chemical species to be inferred, such as CH3OH and , which are both byproducts of heating and energetic processing by photons and cosmic rays (Gibb et al. 2000, 2004). Tentative assignments of CH3CHO (acetaldehyde) and HCOOH (formic acid) relying on the absorption features at 7.24 μm and 7.41 μm were made by Schutte et al. (1999). However, the strong bands at 5.80 μm associated with these two molecules have not yet been unequivocally detected, and therefore the presence of these molecules is still a matter of debate.

The Spitzer Space Telescope, which has improved sensitivity compared to ISO, allowed ice surveys toward low-mass star-forming regions, background stars, and high-UV environments (e.g. Bergin et al. 2005; Whittet et al. 2009; Reach et al. 2009; Boogert et al. 2013). Systematic analyses of the 5 − 8 μm absorption complex, the CH4 band at 7.7 μm, the CO2 bending mode at 15.2 μm, and NH3 and CH3OH absorption features at 9.01 μm and 9.74 μm were performed by Boogert et al. (2008), Öberg et al. (2008), Pontoppidan et al. (2008), and Bottinelli et al. (2010), respectively. All of the above authors used a phenomenological approach and/or analytical function decomposition to estimate environment-dependent variations in order to avoid degenerated solutions in characterising the ice morphology. A statistical analysis of the ice column density and abundance with respect to H2O in these four studies was done by Öberg et al. (2011), who concluded that, with the exception of CO, CO2, and CH4 (the most volatile species), there is no significant statistical abundance variation between low- and high-mass YSOs in the case of OCN, CH3OH, and the broad residual component C5 described in Boogert et al. (2008). However, in the case of background sources, the abundances of CO and CO2 are similar to those seen in low-mass YSOs and are higher than those seen in the high-mass YSOs by a factor of around 3.4 (Boogert et al. 2013, 2015).

Considering pathways and morphology, the analysis of Spitzer observations demonstrated that CO2, NH3, and CH4 ice abundances do not vary significantly with respect to H2O ice, which indicates co-formation (Öberg et al. 2011). On the contrary, the same work shows that CH3OH and CO ices vary by an order of magnitude relative to water ice, indicating different formation pathways. is anti-correlated with H2O, likely because of the high desorption temperature compared to other volatiles (Greenberg & D’Hendecourt 1985). In the case of CO and CO2, there is evidence in support of cosmic-ray processing –given the abundance ratio CO:CO2– and of thermal evolution, which is mainly based on the CO2 bending mode splitting at 15.2 μm (Pontoppidan et al. 2008). Attempts at assigning more complex molecules such as HCOOH, CH3CHO, and CH3CH2OH were also made by Öberg et al. (2011), although they are still relying on the absorption features between 7 and 8 μm.

The main disadvantage of the phenomenological approach is the limited information about ice morphology and the a priori assumption of carriers associated with the absorption bands. In addition, these methods are usually limited to a short wavelength range to address a small set of spectral components (e.g. 3−5; Pontoppidan et al. 2003; Thi et al. 2006; Boogert et al. 2008). Other approaches combining laboratory ice spectra to fit the observed spectra have been adopted in previous works (Merrill et al. 1976; Gibb et al. 2004; Thi et al. 2006; Zasowski et al. 2009; Suutarinen 2015). Although the degeneracies are high in these methods, some issues in the direct comparison of the observations with the experimental data were revealed, as is the case for the absorption peak at 6 μm attributed to H2O ice –which is found to be deeper than the features at 3 μm and 13 μm (Schutte et al. 1996; Keane et al. 2001)– and the nature of the peaks at 3.55 μm and 6.85 μm, which cannot be solely attributed to CH3OH ice (Dartois & d’Hendecourt 2001; Schutte & Khanna 2003; Bottinelli et al. 2010).

In the interest of exploiting large data sets of laboratory ice spectra to identify and quantify molecules in ice mantles observed toward YSOs, we present a new computational code for spectral analysis. The code uses a genetic modelling algorithm (Holland 1975) to decompose IR spectra of protostars using a linear combination of laboratory data themselves. Evolutionary algorithms have successfully been used in the literature to derive the dust composition of asymptotic giant branch (AGB) stars (Baier et al. 2010), and to derive the physical properties of protostars (Woitke et al. 2016). This paper is laid out as follows: Sect. 2 lists the public databases containing ice spectra used in this work and Sect. 3 details the code methodology. The accuracy and performance tests of this tool are discussed in Sect. 4, and an application to a real source, the Class I protostar Elias 29, is shown in Sect. 5. A summary of the capabilities of the ENIIGMA fitting tool and the results of the Elias 29 spectral analysis are given in Sect. 6.

2. The ENIIGMA fitting tool

The ENIIGMA (dEcompositioN of Infrared Ice features using Genetic Modelling Algorithm) fitting tool is designed for the analysis of observational spectra containing ice absorption features. As a general overview, this tool is split into three major modules; the first provides Python functions to calculate the continuum spectral energy distribution (SED) of YSO spectra and to remove silicate features. The second focuses on the unbiased spectral decomposition using a linear combination of laboratory ice spectra. The third offers a statistical analysis of the fits such as confidence intervals and degeneracy quantification. The ENIIGMA fitting tool documentation and download instructions are publicly available1.

In the first module, the ENIIGMA makes use of previous techniques available in the literature for continuum determination (e.g. Boogert et al. 2000) and silicate removal from a template (e.g. Bottinelli et al. 2010). The second and third modules represent the novelty of the code: genetic modelling algorithms are used to search for the optimal solution in a large problem space followed by a robust statistical analysis of the results. Global optimisation is a mathematical method that aims to search for the best overall solution providing the maximum likelihood among the parameters. This method is unbiased in the sense that a large sample of ice spectra are tested without targeting any particular species, and the final statistics-based decision of the best fit is made. In the context of ice observations in space, ENIIGMA allows us to explore a significant amount of IR laboratory data and to decipher the best linear combination of those that fit the observational spectrum. ice components. A flowchart of the entire procedure is shown in Fig. 1, and the different steps are detailed in the following sections.

thumbnail Fig. 1.

Flowchart of the ENIIGMA fitting tool. The light grey dashed box highlights the steps during the spectral decomposition. The light yellow region highlights the Genetic Algorithm (GA) module, whereas the light green region shows the steps performed during the post-processing statistical analysis. The small grey boxes indicate the input data requested by the tool. Small blue boxes are the output data. White boxes indicate the processes performed in each step.

2.1. Input data, continuum determination, and optical depth

The ENIIGMA fitting tool works with three types of input data. The first possibility is to work with observed spectra representing the flux in units of Jy. The second possibility is to use spectra in an optical depth scale, including the silicate features at 9.8 μm and 18 μm. Finally, it is possible to provide the tool with spectra where the silicate features have been removed. The file format and structure must be ASCII (American Standard Code for Information Interchange), with three columns containing the wavelength in micrometres, flux or optical depth, and the errors, respectively.

The accurate analysis of the chemical composition and column densities of ices in star-forming regions depends on the subtraction of continuum SEDs. However, determining an accurate continuum SED for YSOs is challenging because of the large width of the ice and silicate features. Wavelengths shortwards of 5 μm are better constrained because of the lack of ice bands at the K-band and the narrow features of CO2 and CO ices at 4.27 and 4.67 μm, respectively. Some methods for the continuum determination shortwards of 4 μm have been adopted in the literature such as the near-IR excess removal using the Kurucz stellar atmosphere models (e.g. Pontoppidan et al. 2004) and a linear combination of black bodies to fit near- and mid-IR photometric data (e.g. Ishii et al. 2002; Moultaka et al. 2004; Perotti et al. 2020, 2021). Both methods not only provide the flux continuum, but also physical properties such as near-IR extinction or effective temperature.

The continuum above 5 μm on the other hand is much less constrained, even though short-wavelength data are available. The entire range between 5 and 30 μm is dominated by multiple broad and narrow absorption and emission features associated to ices and polycyclic aromatic hydrocarbons (PAHs), respectively (see reviews by Tielens 2008; Boogert et al. 2015). Hydrogenated amorphous carbon (HAC), or more generally a−C(:H), is another class of material observed toward sources at the Galactic centre with multiple features in this spectral range (Chiar et al. 2000; Jones 2012). Toward YSOs, a−C(:H) has been attributed to the absorption bands at 3.47 μm, 6.85 μm, and 7.25 μm (e.g. Gibb et al. 2004; Alata et al. 2014). Nevertheless, the profile at 3.47 μm shows a good correlation with the water ice band at 3 μm, suggesting that this band is associated with volatile chemical species instead (e.g. Brooke et al. 1999; Dartois et al. 2002; Boogert et al. 2015). The strong feature at 6.85 μm observed towards YSOs has also been excluded as a potential carrier of a−C(:H) –as it is observed in a similar wavelength in the diffuse ISM– because of the absence of the strong feature at 3.40 μm (Boogert et al. 2015). The 6.85 μm band has also been attributed to the ion in minerals, but the lack of absorption bands at short or long wavelengths leads us to discard this possibility (e.g. Tielens & Allamandola 1987; Schutte et al. 1996; Boogert et al. 2008, 2015). Finally, the feature at 7.25 μm can be associated with HCOOH (Schutte et al. 1999), ethanol (CH3CH2OH; Öberg et al. 2011; Terwisscha van Scheltinga et al. 2018), and aminomethanol (NH2CH2OH; Bossa et al. 2009). However, Jones (2016) discuss that the other compounds containing C=O bonds (e.g. ketone, organic carbonate) could be present in the grains before or at the onset of ice mantle formation, and therefore the bands at 3.47 μm, 6.85 μm, and 7.25 μm could be consistent with carbonyl functional groups. Because of all of these uncertainties, a low-order polynomial fit has often been used to trace the continuum between the 5 and 32 μm spectral range (e.g. Gibb et al. 2000, 2004; Pontoppidan et al. 2005; Boogert et al. 2008, 2011; Zasowski et al. 2009; Öberg et al. 2011; Noble et al. 2013, 2017).

Given that context, the low-order polynomial and blackbody combination methods are incorporated in the ENIIGMA tool for the continuum determination of the observational flux. In the polynomial approach, the function below is used:

(1)

where ak are the constants, λk is the wavelength, and k is the polynomial order. The second method adopts a sum of blackbodies, given by

(2)

where fi is a scale factor, h is Planck’s constant, c is the light velocity, kB the Boltzmann’s constant, λ the wavelength, and Ti is the temperature of each blackbody.

Once the continuum SED is determined from Eqs. (1) or (2), the observed spectrum () is converted to an optical depth scale using the following equation:

(3)

2.2. Silicate removal

The spectral range between 8 and 25 μm is dominated by strong and weaker silicate absorption features at 9.7 μm and 18 μm, respectively. However, other functional groups associated with simple or complex molecules are also blended in the silicate band at 9.7 μm. Some examples are ammonia (9.01 μm; ν2 umbrella vibrational mode) and methanol (9.74 μm; ν4 stretching vibrational mode of C−O bond) as discussed in Boogert et al. (2008) and Bottinelli et al. (2010). Additionally, ethanol (CH3CH2OH) also shares the C−O stretching mode at 9.51 μm with the silicate broad profile.

Proper removal of the silicate feature is therefore critical, although not straightforward. Different methodologies have been used in the literature, such as the local continuum and silicate spectrum template of sources toward the Galactic centre (e.g. the GCS 3 silicate absorption profile, taken from Kemper et al. 2004). In Boogert et al. (2008), this absorption feature is narrower compared to the observed silicate band of HH 46 IRS. As a consequence, when the GCS 3 absorption band is used for silicate removal, a significant residual is left between 8 and 10 μm. A systematic study by van Breemen et al. (2011) shows that such an IR excess is absent in the sightline of the diffuse ISM, while it is present toward molecular clouds. Although this difference can be associated with ice growth in some cases (e.g. H2O, NH3, and CH3OH; Ossenkopf & Henning 1994; Boogert & Ehrenfreund 2004; Boogert et al. 2008; McClure 2009; Bottinelli et al. 2010), it is also observed toward sources with weak or absent absorption of these molecules at other wavelengths (e.g. SSTc2d_J182835.8+002616; van Breemen et al. 2011). In this regard, such an IR excess is likely due to the chemical composition of the dust, and most specifically the combination of magnesium-rich pyroxene-type (MgSiO3) and amorphous olivine-type (Mg2SiO4) silicates. Further evidence that the silicate band is not associated with only one composition has been found by Poteet et al. (2015), who fitted the silicate profile with a linear combination of amorphous silicates, in particular Mg2SiO4, MgFeSiO4, and MgSiO3. This latter study indicates that although small, variations in the silicate band shape cannot be ruled out. In the same framework, the THEMIS (The Heterogeneous dust Evolution Model for Interstellar Solids) code also uses amorphous olivine-type and pyroxene-type silicates with inclusions of iron compounds and a−C(:H) to interpret the observed absorption bands in the diffuse ISM (Jones et al. 2013). Additionally, the effects of dust evolution from the diffuse ISM to denser regions can be addressed with the THEMIS code (e.g. Köhler et al. 2015; Jones et al. 2017).

The debate about the nature (geometry, size, and composition) of the silicate feature toward Galactic centre sources and protostars is still to be clarified by high-quality data provided by the upcoming James Webb Space Telescope observations. To avoid the uncertainties underlying the silicate feature, the ENIIGMA fitting tool assumes a synthetic silicate profile. This method is similar to the silicate template scaling approach used by Bottinelli et al. (2010). The steps adopted here are the following: (I) the observational GCS 3 silicate profile is scaled to match the bands at 9.7 μm and 18 μm (same as Bottinelli et al. 2010; II) these two silicate bands are decomposed into six Gaussian components (three for each band); and (III) a small variation of the width and height of the six Gaussian components are performed to match the YSO silicate profile. The synthetic silicate is therefore given by the following set of equations:

(4a)

(4b)

(4c)

where G(λ; A, λ0, σ) is the Gaussian function, is the synthetic silicate (ss) optical depth, λ0 are the fixed peak positions of each component, A is the amplitude and σ is the width of the components. We note that in this approach only A and σ are allowed to vary. This method assumes that the silicate width of protostars is substantially caused by the dust chemical composition as shown by van Breemen et al. (2011) and the ice features are not affected.

Figure 2 shows an example of how the synthetic silicate method is applied to an observational spectrum. As shown in the top panel, the GCS 3 silicate profile is scaled to the peak at 9.7 μm of the B1-b spectrum taken from Boogert et al. (2008) (Step I). After steps (II) and (III), the synthetic silicate feature is calculated to match the observational spectrum. However, the band at 18 μm is less deep than the observed profile. In the bottom panel, the residual spectrum of B1-b after silicate removal is shown and compared with the residual spectra taken from Bottinelli et al. (2010). To extract the 9.7 μm silicate feature, these latter authors used the local and template continuum methods. Briefly, the local continuum mimics the silicate feature by a short-range polynomial fit, and in particular the intervals of 8.25−8.75, 9.23−9.37, and 9.98−10.4 μm. Conversely, the template continuum uses empirical profiles separated into three general categories, namely (1) sources with a straight 8 μm wing, (2) sources with a curved 8 μm wing, and (3) sources with a rising 8 μm wing (‘emission’ sources). In the case of the source B1-b, the curved 8 μm wing has been used. As mentioned by Bottinelli et al. (2010), the template method is not suitable to fit the 9.7 μm silicate feature toward all YSOs in their sample, and the local continuum approach was adopted instead. However, the latter method provides narrower bands for the feature at 9 μm – often associated with the NH3 ice – than in the method adopting the silicate template as pointed out by Öberg et al. (2011). From a simple comparison with the naked eye, the synthetic silicate method introduced in this paper can be seen to agree very well with the methods adopted by Bottinelli et al. (2010). In this way, it can be an alternative to the local continuum method when the template profile is not available. It is worth noting than an advantage of the synthetic silicate method is that it provides the silicate extraction of both 9.7 μm and 18 μm bands simultaneously, which allows a better analysis of the H2O libration band at around 13 μm. Despite the challenges, new laboratory data combining ice and dust could provide a better understanding of the link between ice and dust in star-forming regions. To this end, Potapov et al. (2018, 2021) measured spectrum and optical constants of silicate (MgSiO3) and H2O ice, and showed that this experimental data can partially fit the 3 μm of some astronomical objects.

thumbnail Fig. 2.

Illustration of the silicate removal with the synthetic silicate method. Top: B1-b optical depth is shown by the black solid line, whereas the dashed and solid lines are the scaled GCS 3 silicate profile and the synthetic silicate feature, respectively. Bottom: residual optical depth after synthetic silicate removal (black line). The blue and red colours show the residual optical depth after local and template continuum methods, respectively (see text for details; Bottinelli et al. 2010).

2.3. Laboratory ice spectra and databases

In order to build an internal database of experimental data for the ENIIGMA to decompose the spectra of YSOs, a data set of around 100 laboratory ice spectra was compiled from the public online databases listed below:

  • NASA Ames: The Astrophysics & Astrochemistry Lab at NASA Ames Research Center2;

  • Leiden DB: Leiden Ice Database3;

  • UNIVAP: Laboratorio de Astroquimica & Astrobiologia da UNIVAP4.

The ice samples compiled in this paper were recorded in transmittance (T) or absorbance (A) mode using the Fourier Transform Infrared Spectrometer (FTIR). All spectra in transmittance scale were converted to absorbance using the equation A = −log10(T). In the absorbance scale, the ice spectra can be converted to optical depth using the following equation (D’Hendecourt & Allamandola 1986):

(5)

In addition to the compiled data, two IR spectra containing organic residue material (henceforth: residue) at high temperature (> 100 K) were taken from the literature. Residue is produced with or without UV irradiation of the ice mixture samples and subsequent warming up to room temperature (300 K). In particular, the spectra taken from Schutte & Khanna (2003) are HNCO:NH3 heated to 120 K without energetic processing and H2O:CO2:NH3:O2 irradiated with UV and heated to 200 K. Both of these experiments identified the ammonium ion () formation, a likely carrier of the absorption band at 6.85 μm in YSO spectra (Schutte et al. 1996).

Accurate baseline correction of the laboratory data is required to avoid misinterpretations of the carriers. As an example, the band at 5.85 μm that was attributed to HCOOH by Schutte et al. (1996, 1999) was put into question by Boogert et al. (2008) and Öberg et al. (2011) because of unreliable laboratory baselines. Instead, ethanol (CH3CH2OH) was proposed as the carrier by those authors. To avoid this issue, the baselines of the data compiled in this paper were checked and corrected when necessary. Appendix A lists the data sets used in this paper and the reference from which the data were taken. Briefly, these ice data are grouped according to common characteristics, such as (i) pure, (ii) pure and heated, (iii) heated mixtures, (iv) irradiated mixtures, and (v) residue.

2.4. Genetic algorithm module

Inspired by the theory of natural selection and biological evolution, the genetic algorithms (GA) aim to provide a global optimisation process for a proposed problem. For clarification, the GA nomenclature is briefly explained: (i) gene corresponds to the value of a parameter (e.g. scaling factor of the ice spectrum to match the observation), (ii) chromosome is a set of values proposed as a solution of the parameters, (iii) fitness function is the criteria to evaluate the fit (e.g. squared residual, reduced χ2), selection is the election of the best parameters from the entire set of potential solutions (chromosome) used to generate improved values, (iv) crossover is the mixing of values from two distinct solutions in order to create a new solution, (v) mutation is a small variation of one or more parameters (gene), (vi) global minimum is the optimal solution of the parameters, and (vii) generation corresponds to an interaction where potential solutions are tested.

The GA module implemented in the ENIIGMA fitting tool is built on Pyevolve5 (Perone 2009), an open-source and extendible library dedicated to performing evolutionary computation in Python programming language. The GA computation is composed of three main characteristics, namely generation of a random population of probable solutions, fitness-oriented to evaluate the population, and variation-driven to improve the next population (Holland 1975; Koza 1992). The population follows the chromosome-like structure, in which the vector wijRm×n is given by:

(6)

where each variable w is called gene, and the rows are the combination of genes that provide a solution. The quality of the population is evaluated by a fitness function (F) – the squared residual as shown below:

(7)

where the optical depth of the observational spectrum () was defined in Eq. (3), is the optical depth of the laboratory data, wj is the scale factor (the genes in the chromosome; Eq. (6)), and is the error of optical depth propagated from the error in the flux.

After checking the random population with the fitness function in the first iteration, the genetic operators are used to improve the population in the subsequent generations. Selection is the first operator and aims to guarantee the survival of the best genes (coefficients) from one generation to another. One of the selection methods adopted in this paper is called the ‘roulette wheel’ and elects the best individuals based on their probability of providing good solutions, which is defined as:

(8)

where Fi is the fitness of individual i in the population, and N is the number of individuals in the population. The second selection method is called ‘tournament’, and randomly selects a fixed number of individuals from the entire population. The ‘winner’ of the tournament, that is, the coefficient with the highest fitness, is passed to the next generations. In comparison with the roulette wheel method, the tournament selection has a lower computational cost. Moreover, the roulette wheel method suffers from the problem of premature convergence if there are individuals with a high probability to be selected, whereas the selection pressure in the tournament selection is constant and maintains the diversity of the population (Blickle & Thiele 1996).

The best individuals passed from one generation to another are elected to the crossover operation. Formally, if Eq. (6) is given for only two chromosomes of two genes, that is, W1 = [w11, w12] and W2 = [w21, w22], the single-point crossover operator is given by HW1, W2, which will allow the elected genes to exchange positions with one another. Depending on the complexity of the problem, for example, a linear combination of more than four components, two-point or multi-point crossovers can also be used in the ENIIGMA. The crossover rate is defined as 80% in Pyevolve and provides good solutions for many problems in the literature (e.g. Peng et al. 2003; Baier et al. 2010). However, a crossover rate of 90% provided better results in the present study when the tournament selection method was adopted. The mutation operator is used next, and consists of applying a small perturbation ζ in one or more genes (scale factor), formally expressed as . In particular, a Gaussian mutation is adopted with a mutation rate of 10% in this paper. Compared with other mutation methods, a Gaussian distribution of the genes was found to be superior to other approaches (Hinterding 1995). Once the genetic operators are applied, the fitness function evaluates how close the new genes are to the solution. This process is repeated a number of times until the convergence criteria are reached, which in this paper is the lowest values of the fitness function after all generations.

Although genetic algorithms are one of the simplest random-based evolutionary algorithms (EAs), it is not less robust than other EAs or even other conventional optimisation methods. As such, it has been used to solve complex problems in astrophysics such as the huge degeneracy behind the physical parameters of protoplanetary discs and the silicate composition in the mid-IR (e.g. Charbonneau 1995; Hetem & Gregorio-Hetem 2007; Baier et al. 2010; Woitke et al. 2019).

2.5. Searching for the best solution

The ENIIGMA fitting tool requires an initial guess (G) composed of four laboratory spectra to start searching for the best combination that minimises the residual in Eq. (7). At this initial stage, the GA module is applied to fit G to the observational optical depth , and the residual F1 is calculated. Next, each laboratory data sample is added to the initial guess one at a time in a numerical loop, and another combination is used to fit , which leads to another residual calculation, F2. All the laboratory data providing F2 <  F1 are stored, and after this stage, the GA module is applied to combine them into groups of eight or more genes, which results in the global minimum. It is worth noting that during the search for the best solution, the information from the initial guess might be lost, because other ice spectra can provide better solutions. In this sense, the ENIIGMA fitting tool does not target a specific molecule or ice sample, but rather it searches for the best linear combination from a large database performing an unbiased search for the global minimum solution.

The population (W) and the generation numbers are two of the most important parameters in GAs. If these two numbers are too small, the GA module cannot find a proper solution, because the variability of the genes is essential in the context of EAs. High numbers on the other hand will provide good solutions, but the computation limit must also be taken into account. Following previous works employing GAs (e.g. Szkody et al. 2010; Harrison & Hamilton 2015), the ENIIGMA fitting tool uses the population-to-generation ratio of 90/100 or 150/200, which allows it to search the global minimum parameters from a large population whilst avoiding premature convergence.

2.6. Ice column density

The optical depths obtained from the GA optimisation are used to calculate the ice column densities following the equation:

(9)

where 𝒜 is the band strength of a specific vibrational mode. If the ice is composed of only one molecule (pure ice), the absorption features themselves are used to calculate the column density. However, in the opposite case, blended features are avoided, and the isolated vibrational modes are used instead. If blended profiles cannot be avoided, a Gaussian or Lorentzian decomposition is applied to separate the components. Table 1 lists the band strengths of molecules present in the ENIIGMA database used to calculate the ice column densities.

Table 1.

Ice features and band strengths of the identified molecules during the spectral decomposition.

2.7. Statistical analysis

The heuristic nature of GA optimisation makes the final result unpredictable, although one expects that the global minimum is always found. In addition, the decomposition of the IR ice features is a source of huge degeneracy itself on top of the fitting procedure. Given these two aspects, a statistical analysis becomes necessary to evaluate how good the solutions are after an enormous number of combinations of laboratory ice data. For example, if the GA module is applied to combine 13 ice spectra in groups of 8 and without repetition, there are 1287 different solutions for the same problem. The aim of the post-processing statistical analysis is to check how close the other combinations of the optimal solution are. In this way, both the heuristic aspect and degeneracy itself can be evaluated as a whole.

The statistical analysis is based on the minimum Δχ2 method (Avni & Bahcall 1980), given by the following equations:

(10a)

(10b)

where F is the squared residual in Eq. (7) and σ the standard deviation taken from the observational optical depth; ν and α are the number of free parameters and the statistical significance, respectively. corresponds to the goodness-of-fit in the global minimum solution. One can observe, that confidence regions do not depend on the accuracy of the fit, but the number of free parameters. Equations (10a)a,b are calculated from a normal variation around the optimal scale factors, w, by using a probability distribution p(x) available in the numpy.random.normal numpy (Harris et al. 2020) routine:

(11)

where the mean, μ, is the optimal scale factor itself, and σ is the standard deviation taken from the observational optical depth. From this analysis, the confidence intervals of the linear combination coefficients are calculated with a significance of 3σ.

Column density error bars. The uncertainty in the column densities is calculated from the confidence intervals derived with the minimum χ2 method. In this regard, the lower and upper scale parameters are used to calculate the minimum and maximum optical depths. The ice column density calculated from these limits provides the bounds for the error bars of each molecule found during the spectral decomposition. It must be stressed that in this method the error bar depends on the number of parameters in the fit instead of the covariance matrix in non-heuristic methods (e.g. non-linear least squares problems). For example, for a fit with two components, the 1−3σ confidence intervals are obtained for Δχ2 equal to 2.41, 4.61, and 9.21, respectively. On the other hand, if eight laboratory data samples are used, the confidence intervals become Δχ2 equal to 9.52, 13.36, and 20.09, respectively.

Recurrence plots. The degeneracy of the spectral decomposition is quantified with a recurrence plot analysis. All solutions found by the ENIIGMA are sorted into increasing order by the χ2 value of the fit. Next, Eq. (10a)b is used to calculate Δχ2 and define the bounds for the 3σ confidence interval. All solutions outside this level of significance are excluded from the analysis. Finally, the frequency of each ice sample is calculated, and the recurrence is derived using the following equation:

(12)

where fi is the absolute frequency of sample i and S is the sum of all solutions. The recurrence of the ice data is shown as a percentage, and 100% means that a given sample was found in all solutions, indicating that such a component is required for the spectral decomposition. Lower percentage values mean that the corresponding ice sample can be replaced by another data sample and still provide a solution with high statistical significance.

Histograms. Because of the degeneracy involved in the fit of observational YSO spectra, ice column densities derived from local minimum solutions might deviate from values calculated from the optimal combination. In this regard, the column density variation of the ice components shown in the Recurrence plot is estimated from a histograms analysis. The bin sizes are proportional to the column density variances and were calculated by the Freedman Diaconis Estimator taken from the numpy.histogram_bin_edges Python package. Additionally, the histograms are log-transformed and normalised by the median ice column density. With this analysis, both the mean and 3σ confidence interval for the column densities are calculated (see also Appendix B).

3. Accuracy tests

In this section, we address the capability of the ENIIGMA fitting tool to provide accurate solutions. For these tests, known ice samples present in the database (see Appendix A) are used as input data to check the accuracy of the tool to search for the correct sample and fractionation.

Depending on the complexity of the problem, the method to find the global minimum solution with genetic algorithms may require more robust genetic operators. In this section, three methods were adopted to tackle the proposed problems, and their parameters are shown in Table 2. The population size and generation number are important input values. If lower values are adopted for these two parameters, the solution space might not be entirely covered. In addition, two selection methods and crossover rates were adopted depending on the population size.

Table 2.

Laboratory data of pure and thermally processed ices used in this paper.

3.1. Identification of known samples

The ice spectra used to test the ENIIGMA tool with known ice samples were taken from the public databases shown in Sect. 2.3, and are listed in Tables A.1 and A.4.

Three tests were performed in this section, namely (i) non-blind: the solution is indicated in the initial guess and is also present in the database; (ii) fully blind: the solution is not included in the initial guess but it is available in the database, and (iii) solution not available: the correct sample is deliberately removed from the database and not chosen as initial guess. In all cases, the three searching methods in Table 2 provided the expected global minimum solution.

Pure ice. The test with the pure ice sample is performed with H2O (Hudgins et al. 1993) and ethanol (CH3CH2OH; Terwisscha van Scheltinga et al. 2018) ice at 15 K. Water ice is selected because it is the most abundant molecule in the solid phase observed towards YSOs. On the other hand, ethanol was selected because of the multiple narrow features in the IR spectrum, and because it has some common absorption features with methanol (CH3OH) ice, which is also observed in star-forming regions. Figures 3 and 4 show the results of the three tests. The non-blind and blind tests in the left and middle panels indicate that the ENIIGMA tool can identify the correct ice spectra associated with pure water and ethanol. The residuals shown in the bottom boxes are below 5% and are the result of the intrinsic randomness behind the genetic optimisation. It is worth noting that the difference in the residuals in tests 1 and 2 are not associated to the presence or absence of the correct ice sample in the initial guess, but rather are related to the randomness of the method itself. The right panel shows the fit when the correct solution is not available in the database. In that case, the ENIIGMA tool fits the input spectra with the most similar data sample available in the database, that is, H2O at 40 K and CH3CH2OH at 30 K, respectively. As a result, the residuals in some wavelengths are higher than in tests 1 and 2.

thumbnail Fig. 3.

ENIIGMA fit (dashed green line) of the pure H2O ice sample at 15 K (black line). Left and middle panels: fit in non-blind and blind tests, respectively. Right panel: fit when the correct solution does not exist. The residuals of the fit are shown in the boxes below the fits.

thumbnail Fig. 4.

Same as Fig. 3, but for pure CH3CH2OH at 15 K.

Binary ice mixture. A mixture dominated by H2O (95%) ice containing a fraction of ethanol (5%) at 15 K is used in the test with the binary ice sample. Despite the small fraction, ethanol clearly shows features at around 9 μm and 11 μm. The fitting results are shown in Fig. 5. The left and middle panels showing the fully sighted and fully blinded tests indicate that the correct ice spectra were assigned with residuals lower than 5%. In the case where the correct solution is not available (right panel), the H2O:CH3CH2OH at 30 K was selected by the fitting routine. As in the pure ice case, this fit indicates that the ENIIGMA tool is able to select the most similar ice sample to fit the input spectrum. Because of the small differences of this ice mixture at 15 K and 30 K, the residual is around 5%.

thumbnail Fig. 5.

Same as Fig. 3, but for H2O:CH3CH2OH (20:1) at 15 K.

Ternary ice mixture. The third test is performed with a tertiary ice mixture composed by CO:CH3OH:CH3CH2OH (20:20:1). As seen in Fig. 6, the fits were also accurate for the fully sighted and blinded tests (left and right panels). The residual in the middle panel is higher than in the left panel, clearly showing the random nature of the method. The fit in the right panel was performed by combining two ice samples, namely CO:CH3CH2OH at 30 K and CH3CH2OH at 30 K. Because of the prominent CO feature in the input spectrum not present in the ethanol ice, the code combined the two samples in order to provide a good fit. The pure CO ice was not used in this fit because its full width at half maximum is 30% narrower than in the mixture with ethanol.

thumbnail Fig. 6.

Same as Fig. 3, but for CO:CH3OH:CH3CH2OH (20:20:1) at 15 K.

Quaternary ice mixture. The final test for the identification of known samples is performed with H2O:CH3OH:CO:NH3 (100:50:1:1) at 10 K, where both water and methanol features are prominent. Left and middle panels of Fig. 7 show the results from the non-blind and fully blind tests, respectively. Again, the fits result in residuals of lower than 5%. Slightly higher residuals are seen in the right panel, where the solution is not available in the database. In that case, the ice sample composed by the sample molecules at 40 K was provided with the fit. As in the case of the tertiary mixture, this result indicates that a combination of the pure molecules does not result in a good fit. For example, Dawes et al. (2016) show that the peak position of the O−H vibrational at around 3.0 μm is redshifted in a mixture of H2O:CH3OH compared to the pure H2O. Nevertheless, the temperature difference of this mixture at 10 and 40 K is evident in the residuals. At this temperature, most of the CO ice has been thermally desorbed (Collings et al. 2004; Acharyya et al. 2007), whereas a small fraction trapped in the porous ice has enough energy to migrate over active sites.

thumbnail Fig. 7.

Same as Fig. 3, but for H2O:CO:NH3:CH3OH at 10 K.

3.2. Fractionation

As water represents the major component of astrophysical ices observed toward YSOs, in this section we aim to check the capability of the ENIIGMA tool in finding the correct coefficients in a linear combination of H2O ice and another component. In the tests shown in Sects. 3.2.13.2.3, two IR ice spectra were summed with different coefficients, and Gaussian noise of standard deviation equal to 0.01 has been added to the data to resemble the noise present in the astronomical observations. Although the tests in Sects. 3.2.1 and 3.2.2 use water ice spectra at two different temperatures, the aim is to discuss the fractionation of constructed mixtures with similar spectral bands, rather than thermal processing. The searching Methods 1−3 in Table 2 provided the expected global minimum solutions. In the test shown in Sect. 3.2.5, eight IR ice spectra were considered to mimic the ice absorption features observed toward protostars. In this case, only Methods 2 and 3 provided the best fits.

3.2.1. H2O (15 K) and H2O (40 K)

Infrared spectra of pure water at 15 K and 40 K were summed with proportions of 50:50 and 90:10, respectively. These two ice data samples were selected because they have small differences in the band shapes and thus represent a challenging task for fitting methods. Figure 8 shows the test results for these two H2O ice spectra. The top left panel shows the good match between the ENIIGMA fit and combined H2O ice data with proportions of 50:50. The residual plot in the bottom part of this figure shows that only the Gaussian residual noise is left after subtracting the model from the data. The top right panel shows a Δχ2 map, where the confidence intervals (1−3σ) are indicated by the green, yellow, and red contours, respectively. As we note from this degeneracy analysis, the increasing H2O proportion at 15 K with simultaneous decreasing H2O proportion at (40 K) also provides statistically significant results inside the confidence intervals.

thumbnail Fig. 8.

Spectral decomposition and confidence interval analysis of the ENIIGMA results. Left panels: spectral decomposition of the combined water ice spectra (black solid line) with proportions of 50:50 and 90:10. The components of the decomposition are given by the red and blue dotted lines, whereas the ENIIGMA fit is given by the green dashed line. The residual of the fit is shown in the bottom parts of these graphs. Right panels: confidence interval analysis of the coefficients in the linear combination. The grey-scale colours are the difference between chi-square and minimum chi-square values; the olive, yellow, and red contours are 1σ, 2σ, and 3σ confidence intervals, respectively.

The two bottom panels of Fig. 8 show the same test for the proportion of 90:10 at 15 K and 40 K, respectively. The spectral decomposition shown in the left panel also shows a good match between the ENIIGMA fit and the combined H2O ice data, with residuals of lower than 5%. In this test, the Δχ2 map shown in the right panel indicates that a solution considering only H2O (15 K) is also statistically possible when the fraction of H2O (40 K) is lower than 10%. The same solution is still found by the ENIIGMA tool with the σnoise equal to 0.02. Above that limit, the ENIIGMA is unable to find the H2O (40 K) component.

For the case where the H2O ice proportions are 90:10, a population evolution analysis is shown in Fig. 9. At the initial generations, most of the population values result in a high fitness score (white region) and therefore the optimal coefficients have not yet been found. As the optimisation evolves, the population is improved and better coefficients are generated because of the survival of the best solution. Close to generation 85, most of the population results in a low fitness score, which increases the probability of finding the global minimum solution of the problem. The fitness score evolution over the generations is shown in the bottom panel of Fig. 9. As noted in the top panel, the fitness score at the initial generations is high for any value in the population. With the improvement of the population, the fit is improved until the optimal solution is found between generations 80 and 85, with . As the population has not converged in this problem, the fitness score starts increasing again until generation 100, which is the stopping criteria in this problem.

thumbnail Fig. 9.

Top: heat map of the fitness score for each population over the generations. Bottom: fitness score over the generations. The blue and red colours indicate the best and worst fitness based on the population of each generation. The grey shaded area between the two lines shows the difference between the worst and best fitness.

3.2.2. H2O (15 K) and H2O (75 K)

A second check with water ice samples at (15 K) and (75 K) was also made. At high temperatures the water band shapes are significantly different from those in the cases of 15 K and 40 K because of the transition from amorphous to cubic crystalline ice (Hudgins et al. 1993). In particular, the O−H band at 3 μm is gradually sharpened until the full crystalline ice is formed between 100 and 140 K (Sack & Baragiola 1993). As shown in the left panels of Fig. 10, the ENIIGMA tool is able to identify the correct proportions of the two samples in the cases of 50:50 and 10:90, respectively. This is also indicated by the low fit residuals of around 5% in both situations.

thumbnail Fig. 10.

Same as Fig. 8, but for H2O at 15 K and 75 K.

The confidence intervals of the optimal solution are shown in the right panels of Fig. 10. In these plots, the reduced area of the contours compared to Fig. 8 indicates that the solution is less degenerated. The reason is the differences of the band shapes of the cold and warm H2O ice components. In the case of 50:50 proportion shown in the top right panel, the 3σ confidence interval of both coefficients is around 20%. On the other hand, in the bottom right panel, the degeneracy of the 10:90 proportion is 50% and 10%, respectively, which indicates that the H2O (75 K) component is better constrained than the H2O (15 K) component.

After the global minimum solution is found, the degeneracy analysis of the ice composition is performed for the case with proportions of 10:90. Figures 11a–c show the IR spectra of H2O ice at 15, 40, and 75 K. We note here that the spectral shape of the bands is similar between the samples at 15 and 40 K, and significantly different at 75 K because of the crystallisation of the ice matrix via warm-up. An initial test has been done with the sum of the two H2O ice samples at 15 and 75 K without Gaussian noise and the recurrence plot was created. The recurrence plot shown in panel d indicates that no degeneracy exists inside the 3σ confidence interval when the spectrum is free of noise. On the other hand, if Gaussian noise is added to the summed spectrum, the degeneracy becomes evident. The dominant ice component, H2O at 75 K, is still present in all solutions inside the same confidence interval, whereas the recurrence of the water component at 15 K is reduced to 50% because of its lower contribution. The presence of other samples in this recurrence analysis indicates that the H2O (15 K) can be replaced by one of those data samples and a good solution inside 3σ confidence interval is still provided.

thumbnail Fig. 11.

Degeneracy analysis for the combination of H2O (15 K) + H2O (75 K): 10:90. Panels a–c: stretching, bending, and librational modes of H2O ice, respectively, at 15, 40, and 75 K. These panels highlight the temperature effect on the band shapes in the water ice IR spectra. Panel d: recurrence pie chart for the fit of the combined data H2O(15 K):H2O(75 K) with proportions 10:90 and without adding Gaussian noise to the spectrum. Panel e: same as panel d, but after adding Gaussian noise to the spectrum (σ = 0.01).

3.2.3. H2O (15 K) and CO:CH3OH:CH3CH2OH (15 K)

In this test, the combination of H2O and the mixture CO:CH3OH:CH3CH2OH (20:20:1) at 15 K is used. In general, these two ice data samples have different band peaks and shapes in the IR spectrum, although the O−H stretching modes of H2O and CH3OH overlaps around 3 μm. Another difference is that the samples do not have the same ice thickness. Figure 12 shows the fits of the proportion 50:50. Because of the thickness difference, the water ice has a major contribution to the band at 3 μm compared to methanol, whereas at 4.67 μm and 8.86 μm the presence of the mixture sample becomes evident. The residuals of the fit are lower than 5% and indicate a good match between fit and combined data. The confidence interval analysis of the coefficients shown in the top right panel indicates that the water and mixture components vary by about 20% and 40%, respectively, inside 3σ.

thumbnail Fig. 12.

Same as Fig. 8, but for H2O and CO:CH3CH2OH at 15 K.

The bottom left panel shows the fit of the proportions 90:10 for the water and mixture samples, respectively. In this case, the contribution of methanol at 3 μm is minimal, although it is still visible around 9 μm. As seen from the residuals of the fit, the decomposition process was also able to identify the correct coefficients of the two samples. However, because of the small contribution of the mixture, the confidence interval analysis (bottom right panel) indicates that solutions considering only pure water are statistically significant as well.

3.2.4. Spurious features

To test the effect of spurious bands that could be present in a real observational spectrum, we added emission and absorption features of non-ice compounds to the constructed mixture shown in Sect. 3.2.3. The emission features mimic the hydrogen lines at 3.04 μm (HI Pfϵ 5−10), 3.30 μm (HI Pfδ 5−9), and 3.74 μm (HI Pfγ 5−8). These lines are seen in spectra observed with ground-based telescopes when the standard star is not properly modelled (e.g. Pontoppidan et al. 2004; Thi et al. 2006). These emission profiles were created with the Python package pysynphot6 (STScI Development Team 2013) by adopting a full width at half maximum of 50 Å. For the absorption feature, we use the naphthalene (C10H8) spectrum taken from Hudgins & Sandford (1998), which is the simplest polycyclic aromatic hydrocarbon containing two rings. Naphthalene has multiple bands between 3.20 μm and 3.44 μm, where features of methanol and ethanol are also present.

Figure 13 displays the spectral decomposition of a constructed mixture containing ice bands (H2O and CO:CH3OH:CH3CH2OH) and spurious features. Once these emission and absorption features are located shortwards of 4 μm, the ENIIGMA fit is performed between 2.5 μm and 4.0 μm. This test shows that the ENIIGMA fitting tool is able to correctly identify the ice bands and can also fit the narrow features associated with spurious absorption and emission lines. In this example, the spectral profile between 3.2 μm and 3.5 μm is the most affected with spurious features, but those features do not change the shape of the ice bands. We highlight that in real observations, other spurious features might occur in the spectrum due to poor continuum subtraction, as well as the effect of spectral glitches and baseline jumps. In these cases, a broad band fit of the spectrum covering other vibrational modes of the same molecule could minimise the effect of spurious features in the spectral decomposition with the ENIIGMA package.

thumbnail Fig. 13.

Spectral decomposition of the H2O and CO:CH3OH:CH3CH2OH constructed mixture containing spurious features mimicked with hydrogen lines and naphthalene bands.

3.2.5. Synthetic ice spectrum

Multiple ice features have been observed in the IR spectra of protostars (e.g. Gibb et al. 2004; Zasowski et al. 2009). For example, a linear combination of five empirical components associated to more than one carrier have been adopted to decompose the short spectral range between 5 and 8 μm (Boogert et al. 2008). In this test, a synthetic ice spectrum is created based on the median ice composition derived by Öberg et al. (2011) towards low-mass protostars, namely H2O:CO:CO2:CH3OH:NH3:CH4:XCN (1:0.29:0.29:0.03:0.05:0.05:0.003). However, the chemical species XCN is replaced by 2% CH3CHO and 1% CH3CH2OH with respect to the water ice to check the feasibility of identifying these molecules with the ENIIGMA tool. The synthetic spectrum shown in Fig. 14a is created with the linear combination of IR spectra of eight pure ices (temperatures: 10−15 K) listed in Table A.1 and the silicate spectrum from GCS 3 taken from Kemper et al. (2004). Also, a Gaussian noise of σ equal to 0.01 is added to the spectrum. This highlights that a real spectrum might have such strong silicate features, and that they must be subtracted before spectral decomposition with the ENIIGMA tool. The procedure of silicate removal in an observed spectrum is shown in Sects. 2.2 and 4.1.

thumbnail Fig. 14.

Spectral decomposition of the synthetic ice spectrum composed of eight components of pure ice samples (black line) and Gaussian noise (σ = 0.01). Panel a: synthetic spectrum along with the silicate band from 2.5 − 20 μm. Panel b: synthetic spectrum without the silicate feature and residual plot. The ENIIGMA fit is shown by the dashed green line and the ice components by the other colours. Panels c–e: small spectral ranges highlighting the contribution of shallow bands.

The initial guess adopted in this test is made of H2O (40 K), NH3 (10 K), and CH3OH (75 K). Figures 14b–e show the result of the spectral decomposition. Although the expected global minimum solution was found by the three methods in Table 2, Method 1 provided less accurate coefficients by about 4−7%. In the top panel, the fit is shown for the range between 2.5 and 20 μm. At this scale, only the major features are seen, such as H2O (3 μm, 6 μm, 13 μm), CO2 (4.27 μm, 15.53 μm), CO (4.67 μm), and CH4 (7.673 μm). The residual plot shows that only the Gaussian noise is left after subtracting the data from the model, which indicates that both the samples and the coefficients were found by the ENIIGMA fitting tool. The bottom panels show a zoom onto the regions where the weak features are detected. The left panel highlights the C−H stretching mode at 3.53 μm. The middle panel shows the contribution of CH3CHO at 5.8 μm and 7.4 μm, the contribution of NH3 at 6.2 μm, and the contribution of CH3OH at 6.8 μm. The right panel shows the dominant absorption features of NH3 at 9.35 μm and CH3OH at 9.8 μm, with small contributions of CH3CHO at 8.9 μm and the CH3CH2OH double peaks at 9.2 μm and 9.5 μm.

In summary, these tests show that ENIIGMA can find the correct components and linear combination coefficients in the combined spectral data. However, Methods 2 and 3 provide better results than Method 1 when the number of components is large.

4. Application to infrared spectra of the Class I protostar Elias 29

To test the ENIIGMA fitting tool with a real source, a spectral analysis of the well-known Class I YSO Elias 29 was carried out. This object is located in the Ophiuchus cloud at an estimated distance of 135 pc (Ortiz-León et al. 2018). Elias 29 is the brightest source in the ρ Oph E core and was observed in broad IR spectral range by the Infrared Space Observatory. Multiple absorption bands were securely and tentatively detected in the IR spectrum of this protostar (Gibb et al. 2004; Boogert et al. 2000; Rocha & Pilling 2015). Millimeter observations toward Elias 29 show high abundances of simple molecules (e.g. CO, H2CO, HCO+, SO, SO2; Boogert et al. 2002; Oya et al. 2019) and a deficiency in complex molecules (e.g. CH3OH, HCOOCH3, CH3OCH3; Artur de la Villarmois et al. 2019; Oya et al. 2019). In particular, Oya et al. (2019) attribute the lack of organic molecules and richness in SO and SO2 to the evolved nature of the source or the enhancement of the dust temperature above the CO freeze-out regime (> 20 K) as suggested by Rocha & Pilling (2018) based on radiative transfer models. Artur de la Villarmois et al. (2019) suggest that the non-detection of warm CH3OH toward Elias 29 and other sources in the Ophiuchus molecular cloud is due to the absence of a hot-core-like region in the inner envelope close to the protostar or the due to higher temperatures in the precursor environments, as also discussed in the case of Corona Australis (Lindberg et al. 2014).

4.1. Continuum and optical depth calculation, and silicate feature extraction

In the present paper, the continuum SED of Elias 29 was determined with a blackbody function shortwards of 4.2 μm and a low-order polynomial function between 4.1 μm and 30 μm. In the former case, the spectrum between the ranges 2.5−2.8 μm and 3.9−4.2 μm is free of ice features and was therefore used to constrain the fitting. In this case, a temperature of 717 K provided the best match with the observations. Longwards of 4.1 μm, a third-order polynomial fitting passing through spectral ranges without ice features (4.1−4.25 μm, 4.3−4.5 μm, 5.3−5.5 μm and 25 − 30 μm) has been adopted. Pontoppidan et al. (2005) mention that although this method has some uncertainty involved, the shapes of the bands are not affected, even though the total optical depth is uncertain by up to 20%. The top panel of Fig. 15 shows the result of this fit and other calculations in the literature. The fits from Boogert et al. (2000) and Rocha & Pilling (2015) agree better with the continuum SED calculated in the present paper, whereas the fit from Boogert et al. (2002) does not match the observations.

thumbnail Fig. 15.

Continuum determination and optical depth calculation of the Elias 29 spectrum. Top: blue and red dashed lines show the blackbody and polynomial continuum calculation used in this paper, respectively. The grey dotted line shows the continuum determination from Boogert et al. (2000), which is similar to the continuum in the present paper. The dotted magenta line is the continuum calculation with radiative transfer modelling from Boogert et al. (2002). The green solid line is the continuum calculation with radiative transfer modelling from Rocha & Pilling (2015). In both cases, the silicate feature is redshifted in the models compared with the observations. Middle: grey line shows the optical depth scale of Elias 29 spectrum adopting the continuum from Boogert et al. (2000), whereas the blue and red lines show the same by respectively adopting blackbody and polynomial continuum derived in this paper. Bottom: difference between optical depth estimates from this paper and those in Boogert et al. (2000).

Once the continuum SED is determined, the observational optical depth is calculated with Eq. (3), and the result is shown in the middle panel of Fig. 15. The continuum methods used in this paper and Boogert et al. (2000) result in similar optical depths. The results from the two methods are shown in the bottom panel of Fig. 15 and differ by less than 10% in the range 2.5 − 30 μm. A similar variation is reported by van Breemen et al. (2011) for the band at 9.8 μm, when different continuum models are assumed for the objects StRs 164 and SSTc2d_J163346.2−242753. Most importantly, these latter authors concluded that the shape of the 9.8 μm bands remains virtually unaffected, which is in agreement with the result shown in this work. On the other hand, the effects of alterations in the shape of the silicate band at 18 μm are much less constrained.

The silicate absorption in the spectrum of Elias 29 was removed following the procedure described in Sect. 2.2. From the top panel of Fig. 16, we see that the GCS 3 silicate feature closely resembles the absorption band seen toward Elias 29, suggesting they could have similar mineralogy. In this particular case, the modification of the 9.7 μm band shape is minimal as can be noted in the result of the silicate extraction in the bottom panel of Fig. 16. The major difference between the two methods is only the absorption excess at 8.8 μm, which is discussed in Sect. 2.2. With the synthetic silicate band, this leftover residual is subtracted. Figure 16a also shows an absorption excess in the range 11 − 18 μm between the broad silicate bands at 9.8 μm and 18 μm that is clearly seen in the spectrum where the silicate feature is subtracted (Fig. 16b). Because of the uncertainties in the continuum and in the silicate band profile at 18 μm, some effect could be added to this band with silicate subtraction. However, Boogert et al. (2008) and Zasowski et al. (2009) convincingly show that this entire feature is associated with the H2O ice libration mode.

thumbnail Fig. 16.

Silicate extraction procedure in the Elias 29 spectrum. Top: black line shows the optical depth scale of the Elias 29 spectrum calculated in this paper. Red and blue lines show, respectively, the scaled CGS 3 and synthetic silicate bands at 9.7 μm and 18 μm. Bottom: red and blues lines show the Elias 29 mid-IR optical depth after silicate band extractions with scaled CGS 3 and synthetic silicate profiles, respectively.

4.2. Spectral decomposition

The Methods 1−3 in Table 2 were used to decompose the IR spectrum of Elias 29. The initial guess adopted in this study is composed of the pure ice samples H2O, CO, CO2, and NH3. After selecting the best candidates for the solution, the entire sample is combined into groups of seven components. The best global minimum was found in Method 3, and the result is shown in Figs. 17a–f. In panel a, the spectral decomposition in the range 2.5 − 20 μm highlights the fitting of the most prominent bands, whereas the panels b–f show a zoom into small spectral ranges and the contributions of shallow features. The spectral features of a−C(:H) taken from Alata et al. (2014) are also shown in these panels to illustrate potential contributions of carbonaceous grains as discussed by Jones (2012, 2016).

thumbnail Fig. 17.

Spectral decomposition of the mid-IR spectrum of Elias 29 protostar. a: optical depth scale of the observed spectrum between 2.5 and 20 μm, and the seven ice components used to fit the observed spectrum. The best fit is shown by the dashed green line. The residual spectrum is shown below this panel. b–f: zoom onto short spectral ranges highlighting the contribution of small features. In these plots, the spectral features of amorphous carbon, a−C(:H), that were not used in the spectral decomposition are shown by the dashed grey lines above the Elias 29 spectrum and the ice data components. This illustrates a potential contribution of features other than vibrational modes of ice species. Panel b: L-band red wing absorption excess that is not entirely reproduced by the ice components (see Sect. 4.2). Panels c and d: CO2 and CO features. Panel e: important contributions of UV and cosmic-ray processed samples. Finally, panel f highlights the contribution of the broad NH3 ice feature at 9 μm, CH3CH2OH at 9.6 μm, and CH3OH at 9.8 μm.

The global minimum solution in the Elias 29 spectral analysis is given by the combination of pure and mixed ice samples. The dominant component in this broadband analysis is H2O:NH3:CO2:CH4 (10:1:1:1) processed by heavy ions at 35 K, that simulates the effect of cosmic rays in the ice mantle. This sample has significant contributions at 3 μm, 6 μm, 11 − 18 μm, and 15 μm. The second prominent component is CO2 with vibrational modes at 4.27 μm and 15 μm. Pure and mixed H2O components also contribute about 20% of the remaining absorption at 3 μm. The shallow features observed in this spectral decomposition are CO, NH3, CH4, and CH3CH2OH.

An old-standing problem in the analysis of mid-IR spectra of YSOs and background sources is that the band at 6 μm is deeper than what is expected from the 3 μm and 13 μm H2O ice bands (e.g. Cox 1989; Gibb et al. 2000; Boogert et al. 2008, 2011; Bottinelli et al. 2010). A potential explanation for the absorption excess at 6 μm is the presence of organic refractory material (Gibb & Whittet 2002), although this hypothesis lacks independent spectroscopic confirmation (Boogert et al. 2015). Another open question in this context is the correct carrier of the band at 6.85 μm. Although the most likely candidate is (Schutte & Khanna 2003), a convincing profile fit is lacking, as pointed out by Boogert et al. (2015).

In the present paper, the global optimisation with the ENIIGMA tool provided a good fit of the absorption bands observed in the Elias 29 spectrum in the range 2.5 − 20 μm. The feature at 3 μm associated with the O−H vibrational mode is fitted by multiple ice samples containing water. However, there is an absorption excess in the Elias 29 spectrum that is not fitted with the ice samples in this case (Fig. 17b). One of the explanations for this excess is the absorption caused by ammonia hydrates (e.g. Knacke et al. 1982; Dartois & d’Hendecourt 2001). Another cause is attributed to the light scattering due to large grains (Smith et al. 1989). Jones (2016) also suggest that compounds containing the carbonyl functional group (C=O) present in the dust before or during the formation of the ice mantle could be a potential explanation for this absorption excess. Figure 17b shows that a−C(:H) could contribute to the large absorption excess around 3.5 μm, although the absorption features at 6.85 μm and 7.24 μm lack a visual match with the observations (Fig. 17e). Because of the low spectral resolution, this does not rule out the presence of a small fraction of a−C(:H) in the dust grains toward Elias 29. However, Boogert et al. (2000) argue that for the case of Elias 29, the scattered light due to large grains is the most likely explanation because of the close match of this red wing obtained with large icy grains.

The CO2 stretching component at 4.27 μm (Fig. 17c) is found to be a combination of pure and mixed CO2. However, the peak optical depth in the fit does not match with the observations because it is also constrained by the CO2 bending mode at 15.1 μm. Adding more CO2 to the feature at 4.27 μm would lead to absorption excess at 15.1 μm. On the other hand, the CO ice band at 4.67 μm (Fig. 17d) is reproduced in this study by three components, namely pure CO ice, CO:CH3OH, and CO formed with the cosmic-ray processing of H2O:NH3:CO2:CH4 (10:1:1:1) at 35 K, which is in agreement with the literature. Previous studies of the structure of the CO ice band (e.g. Pontoppidan et al. 2003; Thi et al. 2006; Perotti et al. 2020) showed that this band is well fitted by three Gaussian components or a combination of laboratory data and analytical functions. Such components are associated with pure CO, and CO mixed in polar ice matrices (e.g. H2O and CH3OH; Cuppen et al. 2011).

The broadband in the range 5.5−8.0 μm (Fig. 17e) is well fitted by combining multiple components. In particular, two ice samples processed by UV (H2O:NH3:CH3OH:CO:CO2; Muñoz Caro & Schutte 2003) and cosmic rays (H2O:NH3:CO2:CH4Pilling et al. 2010) significantly contribute to fit the Elias 29 spectrum. The ammonium ion () is a common byproduct of the ice processing in these two cases, and is one of the proposed carriers of the band at 6.85 μm. The significant contribution of these two ice samples to the spectrum of Elias 29 might indicate that the chemical evolution of this YSO is mostly induced by energetic processing of the ice mantles.

In the presented spectral analysis, CH3CH2OH mixed in H2O ice is a likely carrier of absorption bands at 7.5 μm and 9.6 μm (Figs. 17e,f). Although the ethanol bands cannot be resolved at 7.5 μm with ISO observations, the band at 9.6 μm could indicate a tentative detection of this complex organic molecule toward Elias 29. This result also shows the contribution of methanol ice at 9.75 μm. Most importantly, this result shows that the band between 9.5 and 10 μm might not only be associated with methanol as suggested previously (e.g. Boogert et al. 2008; Bottinelli et al. 2010), but might be due to the presence of other alcohols in ices as discussed by Jones (2016). Consequently, the reduced amount of methanol in ices would contribute to partially solve the conundrum of the low gas-to-ice ratio observed toward star-forming regions (∼10−4; Öberg et al. 2009; Perotti et al. 2020, 2021), as also conjectured by Jones (2016). However, we stress that high-resolution spectral data are required to unambiguously detect complex molecules in ices beyond methanol.

4.3. Recurrence analysis of the spectral decomposition

Because the ice spectral features vary with the physical and chemical environment, that is parameters such as composition, temperature, and radiation field, the fits of observational IR spectra of YSOs are often degenerate. The recurrence pie chart (Sect. 2.7) is used to verify the uniqueness of the solution found with the ENIIGMA and the result is displayed in Fig. 18.

thumbnail Fig. 18.

Recurrence pie chart of the IR ice data contributing to solutions inside 3σ confidence interval (Δχ2 ≤ 18.5). The size of each piece is proportional to the contribution of each sample.

This analysis shows that samples 1−7 have recurrences above 75%, and are also the components providing the global minimum solution as seen in Fig. 17. Samples 1−3 have a recurrence of 100%, and thus are solutions that cannot be replaced; additionally, they provide a significant contribution to the strong absorption features in the Elias 29 spectrum. The CO sample has a recurrence of 93.8% because the C−O stretching mode at 4.67 μm can be combined with other solutions containing CO, such as sample 1 (CO formed after processing; see Sect. 4.2) and samples 7 and 9. The recurrence of pure H2O (87.5%) indicates that this component is necessary for the global minimum solution, but is not crucial. Other samples containing H2O (e.g. 6, 9 and 10) can also be adopted to provide a good fit of the water absorption bands. Sample 6 is the only component containing CH3CH2OH. However, because this sample is dominated by water ice, which can be replaced by another component, its recurrence is 78.1%. Sample 7 has a recurrence of 75% and is important to fit the red wing of the absorption band at 4.67 μm.

Samples 8−11 are less recurrent (37.7−68.8%), although still inside the 3σ confidence interval. In particular, sample 8 contains a strong feature that could replace component 2 in the fits. However, it is not part of the global minimum solution because the bandwidth at 6.85 μm of component 8 is lower than in component 2, which provided a good match with the observational spectrum. Samples 9 and 10 provide a less accurate fit than components 1, 5, and 6. The less recurrent component, HCOOH, has been tentatively detected towards Elias 29. Despite formic acid not being part of the global minimum solution, it is not excluded as a potential carrier to fit the Elias 29 spectrum.

4.4. Ice column density

Ice bands can be blended in the IR spectrum, and therefore the selection of clear absorption features is essential to providing accurate quantification of the ice column densities. With the spectral decomposition used in this paper, the bands associated with different molecules were isolated, and their ice column densities calculated using Eq. (9) and the band strengths listed in Table 1. The derived values are shown in Table 3 and are compared with previous calculations by Boogert et al. (2000). The lower and upper bounds are calculated from the 3σ confidence interval analysis shown in Fig. 19. Table 3 also shows the median ice column densities calculated from the histograms shown in Appendix B. These values take into account all solutions found by the ENIIGMA tool inside 3σ confidence interval.

Table 3.

Ice column densities towards Elias 29.

thumbnail Fig. 19.

Two-dimensional Δχ2 maps in grey scale showing the correlation among the linear combination coefficients of seven ice laboratory data used to decompose the Elias 29 spectrum. The green, yellow, and red contours indicate 1−3σ confidence intervals, respectively. For the seven components, they are defined for Δχ2 equal to 8.38, 12.02, and 18.48.

Water ice has multiple bands detected in the Elias 29 spectrum. In this paper, the water libration mode at 14 μm is used in the calculation as it is easily separated from the CO2 vibrational mode at 15 μm. The bands at 3 μm and 6 μm are blended with strong features of some molecules such as CH3OH, HCOOH and processed ice, which makes the correct extraction of the water band difficult. In the cases of CO2 and CO, the respective bands at 4.27 μm and 4.67 μm were used to derive the column densities. However, the pure and mixed CO2 components found in the decomposition were not sufficient to reproduce the observed absorption band, and the NCO2 ice column density in this paper is 28% lower than that found by Boogert et al. (2000). On the other hand, the NCO values match well within the uncertainties. The ice column densities of calculated in this paper for H2CO (5.83 μm), CH3OH (6.83 μm), CH4 (7.67 μm), and (6.85 μm) are also in agreement with the findings of Boogert et al. (2000).

From the ENIIGMA fits, CH3CH2OH contributes to the absorption band at 9.6 μm, and has abundances with respect to H2O and CH3OH ices of 0.25% and 10%, respectively. Other molecules such as HCOOH, OCS and OCN were not found in this work.

5. Limitations

The methodology adopted in the ENIIGMA fitting tool seems to work well to decompose ice bands in the spectra of YSOs. However, we mention in this section the limitations of this tool, and list opportunities for further development.

First. A poorly subtracted continuum or silicate band would add spurious features to the spectrum that could bias the ENIIGMA spectral decomposition. For example, due to inherent uncertainties in the broad 18 μm amorphous silicate band subtraction, the shape of the H2O libration and CO2 bands around 13 μm could be affected. We therefore recommend fitting other water and carbon dioxide absorption bands when possible to increase the reliability of the spectral decomposition method. We highlight that the approaches included in the ENIIGMA fitting tool are simplified techniques that have been successful while dealing with IR observations containing ice bands (e.g. Boogert et al. 2008; Bottinelli et al. 2010). Nevertheless, more accurate approaches than the ENIIGMA package could be adopted, instead. For example, a separate subtraction of silicate and other dust features could be performed with other computational tools (e.g. THEMIS code; Jones et al. 2017), and the silicate-subtracted spectrum decomposed with the ENIIGMA fitting tool.

Second. The solutions found with the ENIIGMA fitting tool rely on the amount and diversity of data stored in the ENIIGMA database, which is flexible and can be enlarged or reduced. While adding data to the database, we recommend caution, and encourage the addition of properly baselined laboratory data, as inaccurate baselines would bias the results as well.

Third. Experimental works show that both the temperature and chemical environment change the shape and peak position of ice bands (e.g. Schutte & Khanna 2003; Dawes et al. 2016). This imposes an intrinsic caveat to the methodology adopted in the ENIIGMA fitting tool, namely the effects of intermolecular interactions within the ice are limited to the mixtures present in the ENIIGMA database. It is worth noting, however, that to the best of our knowledge, there is no analytical method that systematically explains such a change in the ice bands with the chemical environment. Works by Bonfim & Pilling (2018) started addressing this issue using quantum chemistry calculations, where an average dielectric constant simulates a generic mixture. One way to overcome this limitation is to add reliable experimental data of icy mixtures to the database.

Fourth. The current version of the ENIIGMA fitting tool does not account for changes in the shapes of the spectral bands caused by the geometry and properties of the grain where the ice is formed in interstellar conditions.

6. Summary

In this paper, the ENIIGMA fitting tool is introduced as a new approach to perform spectral decomposition of ice features in the IR using a genetic modelling algorithm. Python functions to perform continuum calculation and silicate removal in YSO spectra before the decomposition are also available in the tool. We also added a post-processing module dedicated to statistical analysis of the solutions to the capabilities of the ENIIGMA tool. Compiling 103 laboratory ice spectra split into pure and mixed, as well as thermally or energetically processed ices, we created an internal database for use in the spectral analysis.

In order to test all capabilities of the ENIIGMA fitting tool, fully blind and non-blind tests were performed on known species. We applied the statistical module to derive the confidence intervals of coefficients in the global minimum solution. Moreover, the code was able to successfully decompose a synthetic ice spectrum containing different fractions of chemical species with respect to water, as well as a constructed mixture containing spurious absorption and emission features.

A final test on a real source showed that the ENIIGMA successfully decomposed the Elias 29 spectrum with seven components, which correspond to pure and mixed ice samples. From this analysis, multiple molecules were identified, including a tentative detection of CH3CH2OH at 9.6 μm. Moreover, the ice column densities and their lower and upper limits derived with the ENIIGMA tool from the global minimum solution and histogram analysis are in agreement with the previous estimations in the literature.

The tests performed in the present study with synthetic and observational ice spectra show that the ENIIGMA fitting tool can identify both strong and weak absorption features associated with different molecules. In this regard, it will be a useful toolbox for the analysis of data provided by future near- and mid-IR observations of YSOs, such as the data that will be provided by the James Webb Space Telescope.


Acknowledgments

We thank the referee, Anthony Jones, for his constructive comments and suggestions that significantly improved this paper. This work benefited from support by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program through ERC Consolidator Grant “S4F” (grant agreement No 646908) to JKJ. The research of LEK is supported by a research grant (No 19127) from VILLUM FONDEN. WRMR also thanks Dr. Ahmed Gad for the fruitful discussions about the genetic algorithm principles.

References

  1. Acharyya, K., Fuchs, G. W., Fraser, H. J., van Dishoeck, E. F., & Linnartz, H. 2007, A&A, 466, 1005 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  2. Alata, I., Cruz-Diaz, G. A., Muñoz Caro, G. M., & Dartois, E. 2014, A&A, 569, A119 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  3. Artur de la Villarmois, E., Jørgensen, J. K., Kristensen, L. E., et al. 2019, A&A, 626, A71 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Avni, Y., & Bahcall, J. N. 1980, ApJ, 235, 694 [NASA ADS] [CrossRef] [Google Scholar]
  5. Baier, A., Kerschbaum, F., & Lebzelter, T. 2010, A&A, 516, A45 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  6. Bergin, E. A., Melnick, G. J., Gerakines, P. A., Neufeld, D. A., & Whittet, D. C. B. 2005, ApJ, 627, L33 [NASA ADS] [CrossRef] [Google Scholar]
  7. Bisschop, S. E., Fuchs, G. W., Boogert, A. C. A., van Dishoeck, E. F., & Linnartz, H. 2007, A&A, 470, 749 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  8. Blickle, T., & Thiele, L. 1996, Evol. Comput., 4, 361 [Google Scholar]
  9. Bonfim, V. S., & Pilling, S. 2018, in IAU Symposium, eds. M. Cunningham, T. Millar, & Y. Aikawa, 332, 346 [Google Scholar]
  10. Boogert, A. C. A., & Ehrenfreund, P. 2004, in Astrophysics of Dust, eds. A. N. Witt, G. C. Clayton, & B. T. Draine, ASP Conf. Ser., 309, 547 [NASA ADS] [Google Scholar]
  11. Boogert, A. C. A., Tielens, A. G. G. M., Ceccarelli, C., et al. 2000, A&A, 360, 683 [NASA ADS] [Google Scholar]
  12. Boogert, A. C. A., Hogerheijde, M. R., Ceccarelli, C., et al. 2002, ApJ, 570, 708 [NASA ADS] [CrossRef] [Google Scholar]
  13. Boogert, A. C. A., Pontoppidan, K. M., Knez, C., et al. 2008, ApJ, 678, 985 [Google Scholar]
  14. Boogert, A. C. A., Huard, T. L., Cook, A. M., et al. 2011, ApJ, 729, 92 [NASA ADS] [CrossRef] [Google Scholar]
  15. Boogert, A. C. A., Chiar, J. E., Knez, C., et al. 2013, ApJ, 777, 73 [NASA ADS] [CrossRef] [Google Scholar]
  16. Boogert, A. C. A., Gerakines, P. A., & Whittet, D. C. B. 2015, ARA&A, 53, 541 [Google Scholar]
  17. Bossa, J. B., Theule, P., Duvernay, F., & Chiavassa, T. 2009, ApJ, 707, 1524 [Google Scholar]
  18. Bottinelli, S., Boogert, A. C. A., Bouwman, J., et al. 2010, ApJ, 718, 1100 [NASA ADS] [CrossRef] [Google Scholar]
  19. Bouilloud, M., Fray, N., Bénilan, Y., et al. 2015, MNRAS, 451, 2145 [Google Scholar]
  20. Brooke, T. Y., Sellgren, K., & Geballe, T. R. 1999, ApJ, 517, 883 [NASA ADS] [CrossRef] [Google Scholar]
  21. Charbonneau, P. 1995, ApJS, 101, 309 [Google Scholar]
  22. Chiar, J. E., Adamson, A. J., Kerr, T. H., & Whittet, D. C. B. 1995, ApJ, 455, 234 [NASA ADS] [CrossRef] [Google Scholar]
  23. Chiar, J. E., Tielens, A. G. G. M., Whittet, D. C. B., et al. 2000, ApJ, 537, 749 [NASA ADS] [CrossRef] [Google Scholar]
  24. Collings, M. P., Anderson, M. A., Chen, R., et al. 2004, MNRAS, 354, 1133 [NASA ADS] [CrossRef] [Google Scholar]
  25. Cox, P. 1989, A&A, 225, L1 [NASA ADS] [Google Scholar]
  26. Cuppen, H. M., Penteado, E. M., Isokoski, K., van der Marel, N., & Linnartz, H. 2011, MNRAS, 417, 2809 [Google Scholar]
  27. Dartois, E., & d’Hendecourt, L. 2001, A&A, 365, 144 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  28. Dartois, E., d’Hendecourt, L., Thi, W., Pontoppidan, K. M., & van Dishoeck, E. F. 2002, A&A, 394, 1057 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  29. Dawes, A., Mason, N. J., & Fraser, H. J. 2016, Phys. Chem. Chem. Phys. (Inc. Faraday Trans.), 18, 1245 [NASA ADS] [CrossRef] [Google Scholar]
  30. D’Hendecourt, L. B., & Allamandola, L. J. 1986, A&AS, 64, 453 [NASA ADS] [Google Scholar]
  31. Fuchs, G. W., Cuppen, H. M., Ioppolo, S., et al. 2009, A&A, 505, 629 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  32. Gerakines, P. A., Schutte, W. A., Greenberg, J. M., & van Dishoeck, E. F. 1995, A&A, 296, 810 [NASA ADS] [Google Scholar]
  33. Gerakines, P. A., Schutte, W. A., & Ehrenfreund, P. 1996, A&A, 312, 289 [NASA ADS] [Google Scholar]
  34. Gibb, E. L., & Whittet, D. C. B. 2002, ApJ, 566, L113 [NASA ADS] [CrossRef] [Google Scholar]
  35. Gibb, E. L., Whittet, D. C. B., Schutte, W. A., et al. 2000, ApJ, 536, 347 [Google Scholar]
  36. Gibb, E. L., Whittet, D. C. B., Boogert, A. C. A., & Tielens, A. G. G. M. 2004, ApJS, 151, 35 [NASA ADS] [CrossRef] [Google Scholar]
  37. Greenberg, J. M., & D’Hendecourt, L. B. 1985, in NATO Advanced Science Institutes (ASI) Series C, eds. J. Klinger, D. Benest, A. Dollfus, & R. Smoluchowski, 156, 185 [Google Scholar]
  38. Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357 [Google Scholar]
  39. Harrison, T. E., & Hamilton, R. T. 2015, AJ, 150, 142 [NASA ADS] [CrossRef] [Google Scholar]
  40. Hetem, A., & Gregorio-Hetem, J. 2007, MNRAS, 382, 1707 [NASA ADS] [Google Scholar]
  41. Hinterding, R. 1995, Proceedings of 1995 IEEE International Conference on Evolutionary Computation, 1, 384 [CrossRef] [Google Scholar]
  42. Holland, J. H. 1975, Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence (Ann Arbor: University of Michigan Press) [Google Scholar]
  43. Hudgins, D. M., & Sandford, S. A. 1998, J. Phys. Chem. A, 102, 329 [NASA ADS] [CrossRef] [Google Scholar]
  44. Hudgins, D. M., Sandford, S. A., Allamandola, L. J., & Tielens, A. G. G. M. 1993, ApJS, 86, 713 [NASA ADS] [CrossRef] [Google Scholar]
  45. Hudson, R. L., & Moore, M. H. 2004, Icarus, 172, 466 [Google Scholar]
  46. Ioppolo, S., Fedoseev, G., Chuang, K. J., et al. 2021, Nat. Astron., 5, 197 [NASA ADS] [CrossRef] [Google Scholar]
  47. Ishii, M., Nagata, T., Chrysostomou, A., & Hough, J. H. 2002, AJ, 124, 2790 [NASA ADS] [CrossRef] [Google Scholar]
  48. Jones, A. P. 2012, A&A, 540, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  49. Jones, A. P. 2016, R. Soc. Open Sci., 3, 160224 [NASA ADS] [CrossRef] [Google Scholar]
  50. Jones, A. P., Fanciullo, L., Köhler, M., et al. 2013, A&A, 558, A62 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  51. Jones, A. P., Köhler, M., Ysard, N., Bocchio, M., & Verstraete, L. 2017, A&A, 602, A46 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  52. Keane, J. V., Tielens, A. G. G. M., Boogert, A. C. A., Schutte, W. A., & Whittet, D. C. B. 2001, A&A, 376, 254 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  53. Kemper, F., Vriend, W. J., & Tielens, A. G. G. M. 2004, ApJ, 609, 826 [Google Scholar]
  54. Knacke, R. F., McCorkle, S., Puetter, R. C., Erickson, E. F., & Kraetschmer, W. 1982, ApJ, 260, 141 [NASA ADS] [CrossRef] [Google Scholar]
  55. Köhler, M., Ysard, N., & Jones, A. P. 2015, A&A, 579, A15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  56. Koza, J. R. 1992, Genetic programming: On the Programming of Computers by Means of Natural Selection (Cambridge: The MIT Press) [Google Scholar]
  57. Lindberg, J. E., Jørgensen, J. K., Brinch, C., et al. 2014, A&A, 566, A74 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  58. Linnartz, H., Ioppolo, S., & Fedoseev, G. 2015, Int. Rev. Phys. Chem., 34, 205 [Google Scholar]
  59. McClure, M. 2009, ApJ, 693, L81 [NASA ADS] [CrossRef] [Google Scholar]
  60. Merrill, K. M., Russell, R. W., & Soifer, B. T. 1976, ApJ, 207, 763 [NASA ADS] [CrossRef] [Google Scholar]
  61. Moultaka, J., Eckart, A., Viehmann, T., et al. 2004, in The Dense Interstellar Medium in Galaxies, eds. S. Pfalzner, C. Kramer, C. Staubmeier, & A. Heithausen, 91, 295 [NASA ADS] [CrossRef] [Google Scholar]
  62. Muñoz Caro, G. M., & Schutte, W. A. 2003, A&A, 412, 121 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  63. Noble, J. A., Fraser, H. J., Aikawa, Y., Pontoppidan, K. M., & Sakon, I. 2013, ApJ, 775, 85 [Google Scholar]
  64. Noble, J. A., Fraser, H. J., Pontoppidan, K. M., & Craigon, A. M. 2017, MNRAS, 467, 4753 [Google Scholar]
  65. Öberg, K. I., Boogert, A. C. A., Pontoppidan, K. M., et al. 2008, ApJ, 678, 1032 [Google Scholar]
  66. Öberg, K. I., Bottinelli, S., & van Dishoeck, E. F. 2009, A&A, 494, L13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  67. Öberg, K. I., Boogert, A. C. A., Pontoppidan, K. M., et al. 2011, ApJ, 740, 109 [Google Scholar]
  68. Ortiz-León, G. N., Loinard, L., Dzib, S. A., et al. 2018, ApJ, 869, L33 [Google Scholar]
  69. Ossenkopf, V., & Henning, T. 1994, A&A, 291, 943 [NASA ADS] [Google Scholar]
  70. Oya, Y., López-Sepulcre, A., Sakai, N., et al. 2019, ApJ, 881, 112 [Google Scholar]
  71. Palumbo, M. E., Baratta, G. A., Brucato, J. R., et al. 1998, A&A, 334, 247 [Google Scholar]
  72. Peng, S., Xu, Q., Ling, X. B., et al. 2003, FEBS Lett., 555, 358 [Google Scholar]
  73. Perone, C. S. 2009, Pyevolve: A Python Open-Source Framework for Genetic Algorithms (New York: Association for Computing Machinery) [Google Scholar]
  74. Perotti, G., Rocha, W. R. M., Jørgensen, J. K., et al. 2020, A&A, 643, A48 [EDP Sciences] [Google Scholar]
  75. Perotti, G., Jørgensen, J. K., Fraser, H. J., et al. 2021, A&A, 650, A168 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  76. Pilling, S., Seperuelo Duarte, E., da Silveira, E. F., et al. 2010, A&A, 509, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  77. Pontoppidan, K. M., Fraser, H. J., Dartois, E., et al. 2003, A&A, 408, 981 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  78. Pontoppidan, K. M., van Dishoeck, E. F., & Dartois, E. 2004, A&A, 426, 925 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  79. Pontoppidan, K. M., Dullemond, C. P., van Dishoeck, E. F., et al. 2005, ApJ, 622, 463 [NASA ADS] [CrossRef] [Google Scholar]
  80. Pontoppidan, K. M., Boogert, A. C. A., Fraser, H. J., et al. 2008, ApJ, 678, 1005 [NASA ADS] [CrossRef] [Google Scholar]
  81. Potapov, A., Mutschke, H., Seeber, P., Henning, T., & Jäger, C. 2018, ApJ, 861, 84 [NASA ADS] [CrossRef] [Google Scholar]
  82. Potapov, A., Bouwman, J., Jäger, C., & Henning, T. 2021, Nat. Astron., 5, 78 [NASA ADS] [CrossRef] [Google Scholar]
  83. Poteet, C. A., Whittet, D. C. B., & Draine, B. T. 2015, ApJ, 801, 110 [NASA ADS] [CrossRef] [Google Scholar]
  84. Rachid, M. G., Pilling, S., Rocha, W. R. M., et al. 2020, MNRAS, 494, 2396 [NASA ADS] [CrossRef] [Google Scholar]
  85. Reach, W. T., Faied, D., Rho, J., et al. 2009, ApJ, 690, 683 [NASA ADS] [CrossRef] [Google Scholar]
  86. Rocha, W., & Pilling, S. 2014, Spectrochim. Acta Part A: Mol. Biomol. Spectrosc., 123, 436 [CrossRef] [Google Scholar]
  87. Rocha, W. R. M., & Pilling, S. 2015, ApJ, 803, 18 [NASA ADS] [CrossRef] [Google Scholar]
  88. Rocha, W. R. M., & Pilling, S. 2018, MNRAS, 478, 5190 [NASA ADS] [CrossRef] [Google Scholar]
  89. Rocha, W. R. M., Pilling, S., de Barros, A. L. F., et al. 2017, MNRAS, 464, 754 [NASA ADS] [CrossRef] [Google Scholar]
  90. Rocha, W., Pilling, S., Domaracka, A., Rothard, H., & Boduch, P. 2019, Spectrochim. Acta Part A: Mol. Biomol. Spectrosc., 228, 117826 [Google Scholar]
  91. Sack, N. J., & Baragiola, R. A. 1993, Phys. Rev. B, 48, 9973 [NASA ADS] [CrossRef] [Google Scholar]
  92. Schutte, W. A., & Khanna, R. K. 2003, A&A, 398, 1049 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  93. Schutte, W. A., Allamandola, L. J., & Sandford, S. A. 1993, Icarus, 104, 118 [NASA ADS] [CrossRef] [Google Scholar]
  94. Schutte, W. A., Tielens, A. G. G. M., Whittet, D. C. B., et al. 1996, A&A, 315, L333 [NASA ADS] [Google Scholar]
  95. Schutte, W. A., Boogert, A. C. A., Tielens, A. G. G. M., et al. 1999, A&A, 343, 966 [NASA ADS] [Google Scholar]
  96. Smith, R. G., Sellgren, K., & Tokunaga, A. T. 1989, ApJ, 344, 413 [CrossRef] [Google Scholar]
  97. STScI Development Team 2013, Astrophysics Source Code Library [record ascl:1303.023] [Google Scholar]
  98. Suutarinen, A. 2015, PhD Thesis, The Open University, UK [Google Scholar]
  99. Szkody, P., Mukadam, A., Gänsicke, B. T., et al. 2010, ApJ, 716, 1531 [NASA ADS] [CrossRef] [Google Scholar]
  100. Terwisscha van Scheltinga, J., Ligterink, N. F. W., Boogert, A. C. A., van Dishoeck, E. F., & Linnartz, H. 2018, A&A, 611, A35 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  101. Thi, W.-F., van Dishoeck, E. F., Dartois, E., et al. 2006, A&A, 449, 251 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  102. Thi, W.-F., Woitke, P., & Kamp, I. 2011, MNRAS, 412, 711 [NASA ADS] [Google Scholar]
  103. Tielens, A. G. G. M. 2008, ARA&A, 46, 289 [NASA ADS] [CrossRef] [Google Scholar]
  104. Tielens, A. G. G. M., & Allamandola, L. J. 1987, in Physical Processes in Interstellar Clouds, eds. G. E. Morfill, & M. Scholer, NATO Adv. Study Inst. (ASI) Ser. C, 210, 333 [NASA ADS] [CrossRef] [Google Scholar]
  105. van Breemen, J. M., Min, M., Chiar, J. E., et al. 2011, A&A, 526, A152 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  106. van Dishoeck, E. F., Wright, C. M., Cernicharo, J., et al. 1998, ApJ, 502, L173 [NASA ADS] [CrossRef] [Google Scholar]
  107. Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Meth., 17, 261 [Google Scholar]
  108. Watanabe, N., & Kouchi, A. 2002, ApJ, 571, L173 [Google Scholar]
  109. Whittet, D. C. B., Cook, A. M., Chiar, J. E., et al. 2009, ApJ, 695, 94 [NASA ADS] [CrossRef] [Google Scholar]
  110. Woitke, P., Min, M., Pinte, C., et al. 2016, A&A, 586, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  111. Woitke, P., Kamp, I., Antonellini, S., et al. 2019, PASP, 131, 064301 [NASA ADS] [CrossRef] [Google Scholar]
  112. Zasowski, G., Kemper, F., Watson, D. M., et al. 2009, ApJ, 694, 459 [CrossRef] [Google Scholar]

Appendix A: List of ice laboratory data

We compiled a data set containing 102 laboratory data for astrophysical ices from different public online databases and divided them into four categories, namely (i) pure and non-processed ices – Table A.1, (ii) pure and thermally processed ices – Table A.2, (iii) organic residue samples – Table A.3, and (iv) mixed processed ices – Table A.4. In the last category, the mixtures are grouped into three subcategories, namely non-processed, thermally processed, and energetically processed. These ice samples were formed inside a high-vacuum chamber (P ∼ 10−7 − 10−10 mbar) by the condensation of liquids or gases onto an IR transparent substrate (e.g. ZnSe, CsI) cooled to temperatures of around 10 K. In the cases where the ice samples were irradiated by UV, a hydrogen discharge lamp dominated by Ly-α emission with a flux of 1014 photons cm−2 s−1 was used (Schutte & Khanna 2003; Muñoz Caro & Schutte 2003). Some samples were irradiated by heavy ions simulating the effect of cosmic rays in the ISM. In this case, different ions were used as a projectile with energies in the range 0.2−632 MeV.

Table A.1.

Laboratory data of pure and non-processed ices used in this paper.

Table A.2.

Laboratory data of pure and thermally processed ices used in this paper.

Table A.3.

Laboratory data of organic residue.

Table A.4.

Laboratory data of mixed and processed ices (thermal or energetic) used in this paper.

Appendix B: Histograms

All solutions inside a 3σ confidence interval out of 1100 combinations (11 components) are analysed with histograms (Figure B.1). The mean, lower, and upper limits of the column densities are derived in this analysis (see Table 3). The bin size is proportional to the column density variation and is calculated by the Freedman Diaconis Estimator, which is given by:

(B.1)

where IQR is the interquartile range, and is robust to the outliers. In order to reduce the skewness in the distribution, the histograms are log-transformed, although some data still show tails in the distribution. To take into account that skewness factor, the mean and upper limits are calculated by scipy.stats.skewnorm available in the SciPy library (Virtanen et al. 2020). The skewness parameter is calculated, and the statistical quantities derived. The histograms

thumbnail Fig. B.1.

Log-transformed and median-centred histograms of the ice column densities derived from all solutions inside a 3σ confidence interval in the case of Elias 29.

All Tables

Table 1.

Ice features and band strengths of the identified molecules during the spectral decomposition.

Table 2.

Laboratory data of pure and thermally processed ices used in this paper.

Table 3.

Ice column densities towards Elias 29.

Table A.1.

Laboratory data of pure and non-processed ices used in this paper.

Table A.2.

Laboratory data of pure and thermally processed ices used in this paper.

Table A.3.

Laboratory data of organic residue.

Table A.4.

Laboratory data of mixed and processed ices (thermal or energetic) used in this paper.

All Figures

thumbnail Fig. 1.

Flowchart of the ENIIGMA fitting tool. The light grey dashed box highlights the steps during the spectral decomposition. The light yellow region highlights the Genetic Algorithm (GA) module, whereas the light green region shows the steps performed during the post-processing statistical analysis. The small grey boxes indicate the input data requested by the tool. Small blue boxes are the output data. White boxes indicate the processes performed in each step.

In the text
thumbnail Fig. 2.

Illustration of the silicate removal with the synthetic silicate method. Top: B1-b optical depth is shown by the black solid line, whereas the dashed and solid lines are the scaled GCS 3 silicate profile and the synthetic silicate feature, respectively. Bottom: residual optical depth after synthetic silicate removal (black line). The blue and red colours show the residual optical depth after local and template continuum methods, respectively (see text for details; Bottinelli et al. 2010).

In the text
thumbnail Fig. 3.

ENIIGMA fit (dashed green line) of the pure H2O ice sample at 15 K (black line). Left and middle panels: fit in non-blind and blind tests, respectively. Right panel: fit when the correct solution does not exist. The residuals of the fit are shown in the boxes below the fits.

In the text
thumbnail Fig. 4.

Same as Fig. 3, but for pure CH3CH2OH at 15 K.

In the text
thumbnail Fig. 5.

Same as Fig. 3, but for H2O:CH3CH2OH (20:1) at 15 K.

In the text
thumbnail Fig. 6.

Same as Fig. 3, but for CO:CH3OH:CH3CH2OH (20:20:1) at 15 K.

In the text
thumbnail Fig. 7.

Same as Fig. 3, but for H2O:CO:NH3:CH3OH at 10 K.

In the text
thumbnail Fig. 8.

Spectral decomposition and confidence interval analysis of the ENIIGMA results. Left panels: spectral decomposition of the combined water ice spectra (black solid line) with proportions of 50:50 and 90:10. The components of the decomposition are given by the red and blue dotted lines, whereas the ENIIGMA fit is given by the green dashed line. The residual of the fit is shown in the bottom parts of these graphs. Right panels: confidence interval analysis of the coefficients in the linear combination. The grey-scale colours are the difference between chi-square and minimum chi-square values; the olive, yellow, and red contours are 1σ, 2σ, and 3σ confidence intervals, respectively.

In the text
thumbnail Fig. 9.

Top: heat map of the fitness score for each population over the generations. Bottom: fitness score over the generations. The blue and red colours indicate the best and worst fitness based on the population of each generation. The grey shaded area between the two lines shows the difference between the worst and best fitness.

In the text
thumbnail Fig. 10.

Same as Fig. 8, but for H2O at 15 K and 75 K.

In the text
thumbnail Fig. 11.

Degeneracy analysis for the combination of H2O (15 K) + H2O (75 K): 10:90. Panels a–c: stretching, bending, and librational modes of H2O ice, respectively, at 15, 40, and 75 K. These panels highlight the temperature effect on the band shapes in the water ice IR spectra. Panel d: recurrence pie chart for the fit of the combined data H2O(15 K):H2O(75 K) with proportions 10:90 and without adding Gaussian noise to the spectrum. Panel e: same as panel d, but after adding Gaussian noise to the spectrum (σ = 0.01).

In the text
thumbnail Fig. 12.

Same as Fig. 8, but for H2O and CO:CH3CH2OH at 15 K.

In the text
thumbnail Fig. 13.

Spectral decomposition of the H2O and CO:CH3OH:CH3CH2OH constructed mixture containing spurious features mimicked with hydrogen lines and naphthalene bands.

In the text
thumbnail Fig. 14.

Spectral decomposition of the synthetic ice spectrum composed of eight components of pure ice samples (black line) and Gaussian noise (σ = 0.01). Panel a: synthetic spectrum along with the silicate band from 2.5 − 20 μm. Panel b: synthetic spectrum without the silicate feature and residual plot. The ENIIGMA fit is shown by the dashed green line and the ice components by the other colours. Panels c–e: small spectral ranges highlighting the contribution of shallow bands.

In the text
thumbnail Fig. 15.

Continuum determination and optical depth calculation of the Elias 29 spectrum. Top: blue and red dashed lines show the blackbody and polynomial continuum calculation used in this paper, respectively. The grey dotted line shows the continuum determination from Boogert et al. (2000), which is similar to the continuum in the present paper. The dotted magenta line is the continuum calculation with radiative transfer modelling from Boogert et al. (2002). The green solid line is the continuum calculation with radiative transfer modelling from Rocha & Pilling (2015). In both cases, the silicate feature is redshifted in the models compared with the observations. Middle: grey line shows the optical depth scale of Elias 29 spectrum adopting the continuum from Boogert et al. (2000), whereas the blue and red lines show the same by respectively adopting blackbody and polynomial continuum derived in this paper. Bottom: difference between optical depth estimates from this paper and those in Boogert et al. (2000).

In the text
thumbnail Fig. 16.

Silicate extraction procedure in the Elias 29 spectrum. Top: black line shows the optical depth scale of the Elias 29 spectrum calculated in this paper. Red and blue lines show, respectively, the scaled CGS 3 and synthetic silicate bands at 9.7 μm and 18 μm. Bottom: red and blues lines show the Elias 29 mid-IR optical depth after silicate band extractions with scaled CGS 3 and synthetic silicate profiles, respectively.

In the text
thumbnail Fig. 17.

Spectral decomposition of the mid-IR spectrum of Elias 29 protostar. a: optical depth scale of the observed spectrum between 2.5 and 20 μm, and the seven ice components used to fit the observed spectrum. The best fit is shown by the dashed green line. The residual spectrum is shown below this panel. b–f: zoom onto short spectral ranges highlighting the contribution of small features. In these plots, the spectral features of amorphous carbon, a−C(:H), that were not used in the spectral decomposition are shown by the dashed grey lines above the Elias 29 spectrum and the ice data components. This illustrates a potential contribution of features other than vibrational modes of ice species. Panel b: L-band red wing absorption excess that is not entirely reproduced by the ice components (see Sect. 4.2). Panels c and d: CO2 and CO features. Panel e: important contributions of UV and cosmic-ray processed samples. Finally, panel f highlights the contribution of the broad NH3 ice feature at 9 μm, CH3CH2OH at 9.6 μm, and CH3OH at 9.8 μm.

In the text
thumbnail Fig. 18.

Recurrence pie chart of the IR ice data contributing to solutions inside 3σ confidence interval (Δχ2 ≤ 18.5). The size of each piece is proportional to the contribution of each sample.

In the text
thumbnail Fig. 19.

Two-dimensional Δχ2 maps in grey scale showing the correlation among the linear combination coefficients of seven ice laboratory data used to decompose the Elias 29 spectrum. The green, yellow, and red contours indicate 1−3σ confidence intervals, respectively. For the seven components, they are defined for Δχ2 equal to 8.38, 12.02, and 18.48.

In the text
thumbnail Fig. B.1.

Log-transformed and median-centred histograms of the ice column densities derived from all solutions inside a 3σ confidence interval in the case of Elias 29.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.