Open Access
Issue
A&A
Volume 699, July 2025
Article Number A135
Number of page(s) 15
Section Planets, planetary systems, and small bodies
DOI https://doi.org/10.1051/0004-6361/202554563
Published online 04 July 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

Heteroelements in organic molecules, i.e. oxygen (O), nitrogen (N), sulfur (S), and phosphorus (P), play crucial roles in the biochemistry of carbon-based life as they define the chemical functionality of a molecule or a molecular moiety. Thus, their study is of great interest already at the earliest stages of molecule formation in the interstellar medium (ISM; e.g. Tielens 2021). However, while smaller complex organic molecules (COMs) such as methanol are common targets of remote spectroscopic observations of the ISM, the low volatility and relative abun-dances of larger COMs impose major obstacles towards their gas phase detection. As of mid-2021, more than 300 molecules have been detected (McGuire 2022) and current frontiers are being pushed further in the frameworks of large surveys of organic-rich environments such as the Taurus molecular cloud (TMC-1) (McGuire et al. 2020; Cernicharo et al. 2021). Nevertheless, we still lack a comprehensive view on the inventory of molecular complexity in pristine interstellar ices. Retrieved extraterrestrial samples have allowed us to work around the limitations of remote observations and provided insights into the chemical composition of Solar System organics. Carbonaceous chondrites and their parent bodies, the C-group asteroids that are thought to have formed in the outer Solar System before migrating inwards, harbour an extensive cache of extraterrestrial organics that are accessible to state-of-the-art analytical tools (Botta & Bada 2002; Schmitt-Kopplin et al. 2010; Alexander et al. 2017). Samples returned from the various Solar System bodies have revealed valuable information: dust from comet 81P/Wild 2 returned by the National Aeronautics and Space Administration’s (NASA’s) Stardust mission (Sandford 2008), material from asteroid Ryugu returned by the Japan Aerospace Exploration Agency’s (JAXA’s) Hayabusa2 mission (Oba et al. 2023b; Potiszil et al. 2023; Yabuta et al. 2023; Schmitt-Kopplin et al. 2023), or material from asteroid Bennu returned by NASA’s OSIRIS-REx mission (Glavin et al. 2025). However, such analyses of extraterres-trial organics are affected by various types of processes causing chemical changes to the original material. Depending on the type of sample, these range from heating and shocks during entry into the Earth’s atmosphere and impact during sampling procedures in space to terrestrial weathering prior to sample collection and analytical protocols (e.g. involving solvent extractions). In addition, asteroids, or rather their parent bodies, underwent different degrees of post-accretional processing, for instance by hydrother-mal alteration, thermal metamorphism, or shocks. Effects of these alteration processes could be elucidated and constrained by comparative compositional studies of pristine Solar System matter like that in comets and asteroidal matter that has been processed to varying degrees.

It is therefore crucial to study pristine cometary organics, and the in situ data collected in the framework of the European Space Agency (ESA) Rosetta mission offer a unique opportunity to do so. The Rosetta spacecraft rendezvoused with its target, comet 67P/Churyumov-Gerasimenko (67P hereafter), in mid-2014 and accompanied it for 2 years on its orbit around the Sun. The Rosetta Orbiter Spectrometer for Ion and Neutral Analysis (ROSINA; Balsiger et al. 2007) instrument suite and especially its high-resolution Double Focusing Mass Spectrometer (DFMS) revealed the chemical composition of cometary volatiles including COMs (Altwegg et al. 2016; Rubin et al. 2019b; Hänni et al. 2022, 2023, 2024) as they outgassed from the cometary nucleus and ejected particles. Before Rosetta, only a few N- and NO-bearing COMs had been identified from remote-sensing observations at millimetre wavelengths, as listed in Table 1, compiled from Biver & Bockelée-Morvan (2019). These molecules have also been identified in DFMS data as reviewed by Altwegg et al. (2019).

An extraordinarily rich dataset was collected by the DFMS on 3 August 2015, while comet 67P was at 1.25 au and close to its perihelion at 1.24 au. Hänni et al. (2022) reduced and investigated this dataset (for an overview, see their Fig. 1). They suggested that enhanced desorption from the ejected dust led to the detection of a plethora of volatile organics with molecular weights of up to at least 140 Da and an average sum formula of C1H1.56O0.134N0.046S0.017. This average sum formula is similar to soluble organic matter from meteorites (Botta & Bada 2002; Alexander et al. 2017) and even to organics found in Saturn’s ring rain (Miller et al. 2020) by Cassini’s mass spectrometer. It could be indicative of a shared origin of that matter in the early Solar System. An Occam’s razor-based deconvolution approach was used to attempt identification (or exclusion) of individual COMs as candidates present in the gas mixture of the cometary coma. After having discussed comet 67P’s pure hydrocarbon budget in Hänni et al. (2022) and the budget of O-bearing COMs in Hänni et al. (2023), the current work uses, also for the sake of comparability, the same approach to investigate the chemical nature and diversity of the N- and NO-bearing species of the general sum formula CnHmN and CnHmNO, respectively, where n and m are the stoichiometric coefficients of carbon and hydrogen. Signals of species with multiple N and O atoms are sparse and will be the topic of future work. Section 2 revisits the key points of the instrument as well as the data reduction, and highlights the scientific value of the rationale behind the data interpretation approach. In Sect. 3 we present a list of observed CnHmN and CnHmNO parent sum formulas, and discuss potential candidate molecular structures that could reproduce the observations. Section 4 discusses the limitations of our interpretative approach, and compares our findings to the available literature on other well-studied reservoirs of Solar System organics, also highlighting the (pre)biotic relevance of the chemical function-alities featured in our work. We conclude with a short summary and outlook in Sect. 5.

Table 1

N-bearing molecules detected in comets by means of remote-sensing at millimetre wavelengths.

thumbnail Fig. 1

High-resolution mass spectrum collected on 3 August 2015 by the DFMS in 67P’s coma around m/z = 41 (black square markers). The peaks, associated with C3H5, C2H3N, CHN2, and C2HO at the exact positions indicated by the dashed grey lines, are normally fitted with double-Gaussian peak profile functions (dashed blue lines with the sum shown as a solid red line).

2 Data evaluation

A detailed instrument description can be found in Balsiger et al. (2007) The data acquisition, reduction, and evaluation procedures including error estimations for this specific dataset are explained in Hänni et al. (2022, 2023). In the following, we only give a brief summary thereof, focusing on the specificities of the fragmentation behaviour of N- and NO-bearing molecules.

2.1 Method and instrumentation

A mass spectrometer allows the characterization of an analyte, here the complex gas mixture of the cometary coma, in terms of the exact masses of the individual components. In the case of the DFMS, the gaseous species are ionized by electron ionization (EI) and the resulting cations are separated according to their mass-to-charge ratio (m/z) by an electrostatic analyser and a subsequent curved permanent magnet. Eventually, the transmitted ions impinge on a microchannel plate-linear electron detector array assembly (De Keyser et al. 2017) and the intensities of the signals are retrieved in arbitrary units (a.u.). As the detector yield and the ionization cross-sections are usually not known for COMs, the derivation of absolute local abundances (molecules/ccm) is not possible and we indicate the fragment sum relative to the common COM methanol. As water, commonly used to normalize cometary species, does not correlate well with species sublimating from dust, we have chosen to use methanol, frequently observed in the ISM, instead (for a more detailed explanation, see Hänni et al. 2023).

A high-resolution DFMS mass spectrum covers just a narrow region of the mass range around the commanded integer mass. In order to cover a wider mass range, from m/z = 13 to 140 (the upper end of the DFMS mass range at that time), as in the dataset investigated for this work, the DFMS scanned integer masses sequentially. Figure 1 demonstrates that the DFMS mass resolution of m/Δm = 3000 (at 1% of the peak height at m/z = 28; Balsiger et al. 2007) is sufficient to separate pure hydrocarbon species from heteroatom-bearing ones. In this work, we make use of DFMS’ resolving power and attempt a full and untargeted analysis of the subset of CnHmN and CnHmNO signals collected on 3 August 2015 and shown in Fig. 1 in Hänni et al. (2022), panels c and e. We have improved the drift correction, which results in slightly different intensities than those shown in Fig. 1 in Hänni et al. (2022). As detailed in Hänni et al. (2022), the drift in the observed signal intensities (due to changing local densities resulting from factors such as spacecraft or comet motion while stepping through the mass range) is corrected based on the interpolation of frequent water measurements. For masses below about m/z = 40, the improved drift correction yielded slightly smaller intensities. Moreover, we do not consider signals in this evaluation that are only upper limits due to overlap with intense peaks usually from the pure hydrocarbon or O-bearing hydrocarbon on the same integer mass. For some masses, we also have improved our fits. General details about the DFMS data reduction routines have been reported previously (Le Roy et al. 2015; Rubin et al. 2019a; De Keyser et al. 2019, 2024).

The major difficulty in interpreting high-resolution mass spectra of a complex gas mixture lies in EI-induced fragmentation. Parent species usually fragment into daughter species and the charge-retaining fragments are registered alongside (or instead of) the molecular ion (M) signal and together yield the so-called fragmentation pattern of the parent. For a gas mixture such as 67P’s coma, the individual components as well as their EI-induced fragments are simultaneously present in the instrument’s ionization chamber and as a result, a sum fragmentation pattern, i.e. a stack of individual fragmentation patterns, is measured. Information on how such a sum fragmentation pattern may be interpreted will be given in the following Sect. 2.2.

2.2 Sum fragmentation pattern deconvolution

–To determine which chemical species are present at the time of data acquisition, the sum fragmentation pattern must be deconvolved. For the spectral deconvolution, reference mass spectra are needed, such as those collected in the extensive mass spectrometry database of the National Institute of Standards and Technology (NIST; Steins 2018). The ionization energy used to record the reference mass spectra archived in the NIST database normally is 70 or 75 eV while DFMS uses 45 eV. As it is impossible to calibrate every compound on the DFMS laboratory twin model, we have to rely on these standard mass spectra. It has been demonstrated for pure (Schuhmann et al. 2019a) and O-bearing hydrocarbons (Schuhmann et al. 2019b), that the calibrated spectra deviate slightly from the standards in that relatively less of the fragments is observed. How these differences might systematically affect our evaluation will be discussed in Sect. 4. Moreover, the NIST reference mass spectra provide unit-resolution information only, which results in ambiguities for certain signals as to whether the heteroatom is retained in the fragment species or not. For example, a given molecule contains a N atom but also an ethyl group and makes a fragment on m/z =29. Now this fragment could either be due to the N-related functionality (i.e. CH3N) or due to the ethyl group (i.e. C2H5) or even due to a mixture of both and high-resolution data are needed to clarify. Alternatively, sometimes deuterated or 13C isotopologues are measured to clarify the fragmentation mechanisms. For statistical reasons, if two heteroatoms are present in the parent, the instances of ambiguous fragments increase. Where no literature with high-resolution spectra of candidate molecules is available (and such literature on COMs is scarce), it is necessary to interpret those unit-resolution mass spectra. Therefore, we considered the rules and fundamentals of EI-MS-based structure determination as collected for instance in Mikaia (2022, 2023) and summarized briefly below. In the following, ‘–’ indicates chemical bonds (all valencies), whereas ‘–’ indicates that a fragment with a certain mass was lost from the molecular ion M. For the sake of clarity, we omit indicating the positive charges resulting from the EI process.

–Stevenson’s rule (Stevenson 1951): in the case of competing fragments, the fragment with the lowest ionization energy is usually the most abundant one and forms the base peak (i.e. the most intense peak in the fragmentation pattern of the molecule).

–McLafferty rearrangement in carbonyl-containing ions (McLafferty 1959): the hydrogen atom on the γ-carbon (i.e. four atoms away from the carbonyl oxygen) is transferred in a 1,5-shift, which creates a distonic radical cation (i.e. charge and radical site are separated) while the 6-membered transition state is sterically favourable.

–Fragmentation of aliphatic amines (R–NH2, where R is an aliphatic residue): Generally, the charge site is on the N atom, meaning that the N atom is retained while CH2 groups (14 Da) are lost. Rarely, NH3 (17 Da) is lost, leading to an M-NH3 signal. The largest aliphatic chain residue is lost preferentially because the resulting radical is more stable. In primary amines with an unbranched α-carbon, the base peak is CH4N on m/z = 30, which corresponds to the resonance stabilized iminium ion. For methy-lated secondary amines, C2H6N on m/z = 44 is the base peak as a result of β-cleavage. This peak increases in the homologous series of primary aliphatic amines with chain length. For dimethylated tertiary amines C3H8N on m/z = 58, resulting from β-cleavage, is the base peak. If the amine group is located on an aromatic ring, usually M and M-H are strong. If the amine on the aromatic ring is a secondary amine (Ar–NH–R, where Ar is an aromatic ring), then Ar–NH–CH2 on m/z = 106, which results from β-cleavage at the R site, is resonance-stabilized and hence strong. If HCN (27 Da) is lost, then C5H5 on m/z = 65 and C5H6 on m/z = 66 are high.

–Fragmentation of N-heterocycles (Rc–N(H)–Rc, where Rc is a cyclic structure): these species normally yield strong M and M-H signals because α-cleavage, which leads to a ring-opening, can also entail loss of a hydrogen radical. Often, either M or M-H is the base peak.

–Fragmentation of nitriles (R–CN): Nitriles tend to produce M-H after losing an α-hydrogen atom but no M. Hence, they are difficult to identify. A prominent and frequent base peak at resonance-stabilized NH-C-CH2 on m/z = 41 is the result of the McLafferty rearrangement in compounds whose α-carbon atom is not branched. Straight-chain nitriles containing seven or more carbon atoms lead to a characteristic signal on m/z = 97, which corresponds to a 6-membered ring of carbon atoms with an NH-substituent (C6H11N).

–Fragmentation of carboxamides (R–C(O)–NRR′′, where R and R′′ are two more residues): CH2NO on m/z = 44 is a common base peak of primary amides if the abstractable hydrogen for the McLafferty rearrangement is missing. Primary amides also produce a 5-membered cyclic amide fragment on m/z = 86. If there is an abstractable hydrogen, then C2H5NO on m/z = 59 is the base peak for primary, secondary, and tertiary amides. CNH4N on m/z = 30 (as for amines) is obtained, if the alkyl groups on the nitrogen are longer than two carbon atoms. Aromatic amides with alkyl substituents lead to Ar-CO (i.e. M-NR2). Subsequent loss of CO leads to C6H5 on m/z = 77.

The key points summarized here capture the typical fragmentation behavior of the most common groups of N-bearing molecules and are a guide to interpreting the unit resolution reference spectra available from NIST (Steins 2018).

Generally, a specific molecule in a mass spectrum of a complex mixture like that of the cometary coma can be identified unambiguously if two conditions are met: First, if it has a detectable M signal plus fragment peaks that are compatible with the observed pattern. Note that fragment signals alone are ambiguous under such conditions as many species could yield the same fragments. Second, if no alternative structural isomer exists or all but one can be ruled out. The latter is only the case for small molecules as the number of structural isomers grows with the number of atoms in the molecule, resulting in a combinatorial explosion. Mass spectrometry, by default, cannot distinguish between structural isomers as they have the same exact mass, unless their fragmentation patterns differ significantly. One example, where the distinction of two structural isomers (dimethylsulfide and ethanethiol) is possible, has been presented by Hänni et al. (2024). On the other hand, if a given molecule shows a signal (parent or fragment) in its fragmentation pattern that is not present in the observed data, then this is an exclusion criterion. In that way, certain molecules can be clearly excluded while it may still not be possible to pin down one specific candidate molecule for an observed parent sum formula.

In summary, to derive the true composition of the cometary neutral coma based on the sum fragmentation pattern observed by the DFMS on 3 August 2015 and presented in Hänni et al. (2022), a spectral deconvolution into individual components is necessary. In the following paragraph, we argue that strictly mathematical deconvolution methods are too limited and explain which approach we pursued instead.

2.3 Data interpretation strategy

Mass spectra deconvolution often employs mathematical approaches such as residual minimization. As a consequence of the uncertainties mentioned above as well as in Hänni et al. (2022) and Hänni et al. (2023), the mathematical problem is ill-conditioned and minimizing residuals is equivalent to finding a global minimum in a flat subspace. We instead adopt a discussion-driven approach to explore the complex data, incorporating additional astrochemical constraints. On the one hand, we take into account observational reports from studies of relevant astrophysical environments such as other comets or the ISM. On the other hand, we assume a bottom-up chemistry, where (organic) molecules are built up from atomic building blocks in the first place, which should lead to decreasing abundance with increasing size of the molecule, a trend reflecting in the DFMS data overviewed in Fig. 1 in Hänni et al. (2022). Bottom-up chemical reaction networks are preferentially used to explain COMs in the ISM but are debated for larger carbon-based species such as polycyclic aromatic hydrocarbons (Lee et al. 2019). The credo of a bottom-up logic is that high abundances of a complex species do not make sense if simpler species in the bottom-up formational pathway have much lower abundances. We also favour bottom-up chemistry intermediates where compatible with our data. Guided by Occam’s razor (Clauberg 1654) and in combination with the above constraints, we arrive at a non-unique set of cometary COM candidates that can explain the DFMS data. This set is not only non-unique but it almost certainly underestimates the true chemical complexity for COMs with many isomers. Nevertheless, it sets a lower limit to the chemical diversity and complexity. More important though, is a thorough discussion of every observed parent sum formula and potential candidate molecular structures as far as the available reference data allows. Therefore, we do not compile a list of the selected candidates but rather refer to our Sect. 3. We assign each selected candidate molecule with a level-of-confidence indicator (LCI) that could also be viewed as a degeneracy indicator: 1 = maybe present, 2 = likely present, 3 = with great certainty present. Notably, our LCIs are determined with respect to the available reference data. This means that a certain candidate is assigned with LCI = 3, if it is the only isomer with reference data in NIST and is in reasonable agreement with our data. It still may be possible, though, that other isomers, for which no reference fragmentation data exists, would be an even better match. Specific caveats and biases resulting from our approach and the application of Occam’s razor to deconvolve DFMS mass spectra will be discussed thoroughly in Sect. 4. We also address some limitations of the method of EI-MS itself to capture organic complexity.

3 Results

Figs. 2 and 3 show our non-unique Occam’s razor-based deconvolution results for the CmHnN and CmHnNO sum spectra observed by Rosetta’s DFMS on 3 August 2015, accounting for all relevant signals. Our analysis assumes these two subsets of data to be largely independent, which seems justified for relative abundance reasons discussed in detail previously (Hänni et al. 2022, 2023). However, certain interdependences are expected as many CmHnN and CmHnNO parents have fragments of other classes than CmHnN and CmHnNO, namely when heteroatom-bearing fragments are lost. For instance, CmHnNO parents often yield CmHnN, CmHnO, or CmHn type fragments (and not just CmHnNO type fragments) and we take such inter-dependences into consideration and discuss them where needed. We applied Occam’s razor starting from the smallest molecules with the least number of isomers and hence the least ambiguity and this is also how we present the results here. For every parent sum formula, we discuss potential candidate molecular structures with available reference spectra and assign the most suitable one with an LCI as described above. Notably, fully saturated species are always parents as such fragments are rare and if produced then normally lost as neutral molecules. Generally, neutral parent molecules of the CmHnN and CmHnNO type must have an odd nominal mass, even mass representatives are always fragments. From the candidate selected to represent a parent, we use the sum over all observed ions from the reference spectrum to derive the fragment sum relative to methanol (FSRM; in%). We have chosen methanol because the investigated data captures outgassing predominantly from grains rather than from the nominal, water-dominated gas coma and because this simple COM is a common reference species in observations of various astrophysical environments including the ISM (Hänni et al. 2023).

thumbnail Fig. 2

Occam’s razor-based deconvolution result of the subset of signals associated with CnHmN parent and fragment species detected by the DFMS on 3 August 2015. The measured signals (measured) are given in arbitrary units (a.u.) with 30% error margins. The different contributions of the chosen candidate molecules are colour-coded according to their molecular structure – in shades of red and purple for chain-based molecules and in shades of green and blue for cyclic molecules.

thumbnail Fig. 3

Occam’s razor-based deconvolution result of the subset of signals associated with CnHmNO parent and fragment species detected by the DFMS on 3 August 2015. The measured signals (measured) are given in arbitrary units (a.u.) with 30% error margins. The different contributions of the chosen candidate molecules are colour-coded according to their molecular structure – in shades of red and purple for chain-based molecules and in shades of green and blue for cyclic molecules.

3.1 CnHmN species

–HCN (m/z = 27): hydrogen cyanide (LCI = 3; FSRM = 81%) is identified unambiguously via M and M-H. However, a contribution of its isomer HNC cannot be excluded as the isomer’s fragmentation pattern is unknown. Both isomers have been reported to be present in cometary outgasing from ground-based observations of various comets. For comet Hyakutake, an HNC/HCN ratio of ≈0.06, which is similar to that observed in quiescent molecular clouds, was presented as evidence for ISM inheritance of cometary ices (Irvine et al. 1996). However, the ratio of isomers seems to vary with cometocentric and heliocentric distances, suggesting multiple sources or chemical processes at play (Biver et al. 1997; Rodgers & Charnley 1998; Agúndez et al. 2014). The debate is ongoing.

–CH3N (m/z = 29): methanimine (LCI = 2; FSRM = 5.1%) is identified via M and M-H under assumption of a fragmentation analog to formaldehyde. The fragmentation pattern of metha-nimine has not been measured to the best of our knowledge.

–CH5N (m/z = 31): methylamine (LCI = 3; FSRM = 17%) is identified unambiguously via M, M-H, M-2H, and M-3H. Notably, methylamine does not have any structural isomers which is why we can exactly determine its contribution to the iminium ion signal on m/z = 30.

–C2H3N (m/z = 41): acetonitrile (LCI = 3; FSRM = 14%) is identified via M and M-H. But the EI-MS fragmentation pattern of acetonitrile is indistinguishable from that of its structural isomer, methylisocyanide. We assign all intensity to acetonitrile but it could be assigned as well to methylisocyanide or any mixture of the two.

–C2H5N (m/z = 43): aziridine (LCI = 3; FSRM = 8.3%) is identified based on the M-H/M ratio and the M-CH3 fragment. Ethanimine might be a reasonable alternative if it fragments similarly to methanimine/formaldehyde, which produces quite strong M and M-H signals. Unfortunately, no EI-MS reference data are available. According to McGuire (2022), aziridine was detected tentatively in the ISM while ethanimine was conclusively identified (also see discussion below).

–C2H7N (m/z = 45): ethylamine (LCI = 2; FSRM = 13%) is identified via M, M-H, and M-CH3. Dimethylamine, the only isomer of ethylamine, is a possible alternative candidate. Dimethylamine would almost fully account for the C2H6N fragment on m/z = 44 via its M-H fragment, just as ethylamine fully accounts within error margins for the residual CH4N intensity on m/z = 30, i.e. the intensity that is not explained already by methylamine. Since both these fragments are characteristic for the whole class of amines, the abundance of amines with 3 or more C atoms is strongly limited, no matter which isomer is chosen for C2H7N. Ethylamine but not dimethylamine has been detected in acid-hydrolyzed samples returned by NASA’s Stardust mission from comet Wild2 (Glavin et al. 2008) and in hot water extracts of Ryugu samples (Naraoka et al. 2023) (for more details, see Sect. 4.2) and Altwegg et al. (2016) have reported the presence of ethylamine in comet 67P. In the ISM, a tentative detection of ethylamine towards TMC-1 (Zeng et al. 2021) and a non-detection of dimethylamine towards the same source (Müller et al. 2023) unfortunately do not constrain the relative abundance ratio of the two isomers much. However, in experimental work on irradiated ices, ethylamine shows a better compatibility with the data than its isomer (Foerstel et al. 2017). Therefore, we choose ethylamine for our solution while acknowledging that this choice influences the selection especially of higher-mass amines.

–C3HN (m/z = 51): cyanoacetylene (LCI = 2; FSRM = 1.6%) is identified via M and M-H. However, the M-H/M ratio does not fit perfectly and it cannot be excluded that the structural isomer isocyanoacetylene would produce a better fit. Unfortunately, no fragmentation data of isocyanoacetylene are available. Both these isomers, as well as the isomer of the structure HNCCC, have been reported in the ISM (McGuire 2022; Cernicharo et al. 2020). An alternative explanation for the underpopulated M-H signal on m/z = 50 could be the presence of the C3N radical (cyanoethinyl; see further discussions in Sect. 3.3).

–C3H3N (m/z = 53): 2-propenenitrile (LCI = 3; FSRM = 4.3%) is identified via M, M-H, and M-2H. However, the M-H/M ratio does not fit well, which might indicate the (additional) contribution of other isomers. But no fragmentation data are available for any of them.

–C3H5N (m/z = 55): ethylisocyanide (LCI = 2; FSRM = 15%) is identified via M and M-H as the most likely candidate. However, it cannot be distiguished well from N-methyleneethenamine, which has a very similar fragmentation pattern. Propanenitrile or propargylamine yield substantially more M-H and a contribution is needed to explain the observed M-H signal (LCI = 2; FSRM = 9.2%; considering the fragmentation pattern of propanenitrile). The dehydrogenated four-membered ring, 1-azetine, as well as vinylimine do not have reference fragmentation data on NIST.

–C3H7N (m/z = 57): a combination of 1-methylaziridine (LCI = 3; FSRM = 4.4%) and 2-methylaziridine (LCI = 3; FSRM = 1.1%) best explains the observed intensity via M, M-H, and M-CH3. Cyclopropylamine and 2-propen-1-amine do have compatible M/M-H ratios, but they fail to explain the residual intensity on m/z = 42, which is assigned to a fragment after loss of a methyl-group (especially strong in the fragmentation pattern of 1-methylaziridine). Azetidine neither explains the intensity on m/z = 42 nor shows a compatible M/M-H ratio which strongly constrains its presence.

–C3H9N (m/z = 59): a combination of N,N-dimethylmethylamine (LCI = 2; FSRM = 3.4%) and 2-propanamine (LCI = 2; FSRM = 1.1%) suitably explains the observations. The former mainly yields M and M-H, while the latter yields M-H and M-CH3. One alone cannot reproduce the M/M-H ratio. Propylamine is not favourable as the observed signal on CH4N, the typical primary amine EI-fragment on m/z = 30, is already well explained and a substantial abundance of larger primary amines is hence unlikely.

–C4H5N (m/z = 67): cyclopropanecarbonitrile (LCI = 1; FSRM = 5.9%) is the most likely candidate, identified via M, M-H, M-2H, and M-3H, where M-H is stronger than M, although several other isomeric nitriles produce a similar M/M-H ratio. This group of signals could also contain some pyrrol, which, however, has a strong M but no M-H and is hence less favourable according to Occam’s razor. The unexplained intensity at M-2H (C4H3N) and especially at M-3H (C4H2N) might indicate a missing parent with the sum formula C4H2N. Further details will be discussed in Sect. 3.3 later on.

–C4H7N (m/z = 69): 1-pyrroline (LCI = 2; FSRM = 1.3%) is the best suitable candidate. In this case, the reference mass spectrum has been taken from Connections (2024) as it is not available from NIST. 1-Ethenylaziridine, reproduces the M/M-H ratio and additional fragments reasonably well. Therefore, it is a viable alternative.

–C4H9N (m/z = 71): pyrrolidine yields relevant M and M-H signals and a base peak on m/z = 43, which corresponds to C2H5N and forms through cleavage of one C–C bond adjacent to the N atom and subsequent ethylene elimination (Spiteller 1967). However, C4H8N (M-H) cannot be sufficiently explained, calling for additional contributions. Instead of a combination from multiple parents, Occam’s razor suggests consideration of 1-acetylpyridine (a CnHmNO species, which will be discussed below). 1-Acetylpyridine sufficiently covers C4H8N via its M-Ac (Ac = acetyl group) fragment. But a small contribution of pyrrolidine or other C4H9N isomers that yield a relevant M signal is likely.

–C4H11N (m/z = 73): N-methyl-2-propanamine (LCI = 2; FSRM = 3.8%) is a good candidate. N-Ethyl- or N,N-dimethylethanamine are less likely because the respective ratios of M, M-H and M-CH3 are less compatible with the observations. Minor contributions cannot be ruled out, though.

–C5H5N (m/z = 79): pyridine (LCI = 3; FSRM = 3.5%) is identified via M and absence of M-H. 2,4-Pentadienenitrile, the only isomer with an available reference mass spectrum, has a considerably less prominent M signal and is hence less likely. The M-H signal should be about 12% with respect to the M signal for pyridine and even more for 2,4-pentadienenitrile. However, as C5H4N is hidden under the strong C6H6 peak on m/z = 78, only an upper limit of roughly 15% (with respect to M) can be estimated, compatible with pyridine. Both species yield a pronounced fragment on m/z = 52, which likely corresponds to M-HCN and hence to C4H4. Indeed, C4H4 is still underpopulated as can be seen in supplementary Fig. 1 in Hänni et al. (2022).

–C5H7N (m/z = 81): 1-methyl-1H-pyrrole (LCI = 1; FSRM = 1.4%) is selected as it is compatible with the observations in terms of M and M-H, given an additional contribution of 2-ethyl-1H-pyrrole to C5H6N via its M-CH3 fragment (see below). Alternatively, there are molecules that fit the observed M/M-H ratio better, for example pyrroles with the methyl group on positions 2 and 3 instead of 1 (N). Isomers with nitrile functionality expose M/M-H ratios similar to that of the N-substituted pyrrol but, in addition, a clear signal on m/z = 54. Probably HCN is lost, resulting in a C4H6 fragment on m/z = 54. However, in supplementary Fig. 1 in Hänni et al. (2022) C4H6 is already slightly overpopulated be beyond the error margin, making nitriles less likely candidates. For 1,4-dihydropyridine no fragmentation data are available but it may be a good candidate, given the fact that tetrahydropyridine is a good candidate as well and the presence of molecules with different degrees of hydrogenation has been previously reported (Hänni et al. 2022, 2023).

–C5H9N (m/z = 83): 1,2,3,6-tetrahydropyridine (LCI = 2; FSRM = 9.2%) is identified via M and M-H. The double bond might not be on the N atom as the resulting fragmentation pattern, with an absent M-H fragment, is less compatible. Most isomers, such as isocyano, nitrile, or amine species, do not yield relevant M. Only the ternary amine N,N-dimethyl-2-propyn-1-amine would be a possible alternative.

–C5H11N (m/z = 85): piperidine (LCI = 2; FSRM = 6.0%) explains the observed M and M-H signals well, while its other fragments are reasonably compatible with the observations. 1-Methylpyrrolidine would show strong signals on m/z = 57 (C4H9 from N–C-bond alpha-cleavage) and m/z = 42 (C3H6 from ring-cleavage). Both these signals are already overpopulated in supplementary Fig. 1 in Hänni et al. (2022), which renders the N-substituted pyrrolidine incompatible. 2-Methylpyrrolidine as well as N-ethyl-2-propen-1-amine yield strong M-CH3 signals on m/z = 70 that cannot be accommodated in the observed data. While secondary and tertiary amines such as N-allyl-N,N-dimethylamine show favourable M/M-H ratios, they overpopulate the observed C3H8N signal on m/z = 57. A contribution of 3-methylpyrrolidine cannot be ruled out.

–C6H7N (m/z = 93): 2-methylpyridine (LCI = 1; FSRM = 5.9%) is selected as it is compatible with the data in terms of M, M-H, and M-CH3, considering that the M-CH3 fragment of the selected C7H9N parent (4-ethyl-pyridine) is coincident with M-H of 2-methylpyridine. However, other molecules such as N-2-propynyl-2-propyn-1-amine (also known as dipropargy-lamine), yielding larger M-H signals, would lead to a better fit if for C7H9N (see below) a parent was selected that does not yield M-CH3. Furthermore, the position of the methyl group is badly constrained from the fragmentation pattern alone. Aniline, also known as phenylamine, does not fit as well due to its relatively small M-H signal. A contribution cannot be ruled out, though.

–C6H9N (m/z = 95): a combination of 2-ethyl-1H-pyrrole (LCI = 2; FSRM = 0.7%) and 2,4-dimethyl-1H-pyrrole (LCI= 2; FSRM = 3.5%) explains well the observations in terms of the M, M-H, and M-CH3 signals, the latter being produced only from the ethyl-substituted pyrrole. The ethyl-substituted pyrrole yields more M-H while the dimethyl-substituted one yields more M. From the isomers with reference mass spectra on NIST, only the 1H-pyrrole derivatives produce reasonably strong M signals and hence might contribute while all other isomers do not have strong M and, hence, are not suitable candidates.

–C7H5N (m/z = 103): benzonitrile (LCI = 2; FSRM = 6.0%) or 2-ethynylpyridine are both compatible with the observations and cannot be distinguished based on their fragmentation patterns. Hänni et al. (2022) have previously suggested the presence of benzonitrile and written that other isomers are less likely. However, within error margins, the two said isomers are indistinguishable. No reference data are available for other structural isomers. Benzonitrile was detected towards TMC-1 (McGuire et al. 2018).

–C7H9N (m/z = 107): 4-ethylpyridine (LCI = 2; FSRM = 8.6%) is identified via M, M-H, and M-CH3. The M/M-H ratio fits perfectly. Alternatively, a secondary amine on an aromatic ring (Ar) could lead to Ar–NH–CH2 on m/z = 106. Therefore, Hänni et al. (2022) have suggested the presence of benzylamine. However, the M/M-H ratio of benzylamine is less suitable than that of 4-ethylpyridine and other alkylated pyridine derivatives. Only the isomers with more M-H than M, or no M and M-H at all, can be deemed incompatible with Occam’s razor. Benzy-lamine is a possible minor contributor but it is more likely that the N atom is in the ring of a 6-membered heterocycle. C7H9N is the largest N-bearing species observed in this dataset.

3.2 CnHmNO species

–CHNO (m/z = 43): isocyanic acid (LCI = 3; FSRM = 14%) is identified via M and M-H. The reference mass spectrum, which is not available from NIST, was taken from Hand & Bogan (1971). For its structural isomer, cyanic acid, the Pub-Chem database (For Biotechnology Information 2024) reports the liquid chromatography-mass spectrometry (LC-MS) spectrum, which was recorded using electro-spray ionization (ESI). Hence, this spectrum contains an M+H signal on m/z = 44. Nevertheless, the LC-MS spectrum indicates that the M/M-H fragment ratio of cyanic acid might be less compatible with our observations. For other structural isomers no reference data are available.

–CH3NO (m/z = 45): formamide (LCI = 3; FSRM = 12%) is unambiguously identified via M and M-H. The CH4N fragment (iminium ion) on m/z = 30 in the reference spectrum of the structural isomer nitrosomethane cannot be accommodated. For the third isomer, formadehyde oxime, no reference data are available. While formamide is a common molecule in the ISM, none of its isomers has been reported to date (McGuire 2022).

–C2H3NO (m/z = 57): isocyanatomethane (LCI = 3; FSRM= 2.9%) is identified via M and M-H. It is likely that the fragments of isocyanatopropane contribute to the m/z = 57 and 56 signals (see further discussion below) as loss of the carbonyl function is probably negligible compared to loss of the terminal ethyl group. This makes the presence of hydroxyacetonitrile unnecessary according to Occam’s razor. For other isomers, no reference mass spectra are available.

–C2H5NO (m/z = 59): acetamide (LCI = 2; FSRM = 4.6%) is identified via a strong M, an absent M-H, and a very strong M-CH3 signal. M-NH2 and M-NH3 fragments on m/z = 43 and 42, respectively, do not appear in Fig. 3 as the N is lost. The other isomers, for which reference data are available, are less likely, except N-methylformamide, which potentially contributes to the observed signals and also has been reported to be present in the ISM (Belloche et al. 2019).

–C3H3NO (m/z = 69): oxazole (LCI = 2; FSRM = 1.3%) is identified via strong M and negligible M-H. An alternative is propiolamide, which also yields a strong M but no M-H signal. However, this molecule’s base peak on m/z = 53 is ambiguous and no high-resolution data are available as far as we know. Thus, we cannot make clear statements in this case. Isoxazol, in contrast, can be ruled out based on its strong M-H fragment. However, C3H2NO, corresponding to M-H, is observed, but it is already explained by the M-CH3 fragment of 2-methyloxazole. We further discuss such entanglements below.

–C3H7NO (m/z = 73): propanamide (LCI = 2; FSRM = 9.9%) is chosen because its M, M-H, and M-CH2CH3 signals fit well. N,N-Dimethylformamide also fits well based on its M and M-CH3 signals. According to Li et al. (2016), its strong fragment on m/z = 44 corresponds to M-CHO (C2H6N) and the one on m/z = 42 to M-CH3O (C2H4N). However, especially the observed C2H6N signal in Fig. 2 is already overpopulated, which limits the contribution of N-dimethylformamide. The fragments of the other isomers with reference mass spectra do not fit well and they are thus unlikely candidates.

–C4H5NO (m/z = 83): 2-methyloxazole (LCI = 2; FSRM = 21%) is identified via M, M-H, and M-CH3. The reference spectra for several oxazoles can be retrieved from spectrabase (Connections 2024) as on NIST only isoxazole derivatives are available and are less compatible with Occam’s razor as they only yield M signals. The position of the methyl group, however, is not well defined as the reference spectra are similar. The high FSRM might indicate that several different oxazole species are simultaneously present as the effective abundance of specific COMs should decrease with increasing number of atoms. Nevertheless, we abstain from introducing more isomers with the methyl group in a different position, for example, and instead just note that if the FSRM was distributed on three parent molecules, it would better align with the expected abundance decrease.

–C4H7NO (m/z = 85): 2-pyrrolidinone (LCI = 2; FSRM = 6.6%) yields M and M-H signals. High-resolution data are available from Duffield et al. (1964). Further isomers with reference spectra can be ruled out based on their fragmentation patterns mismatching the observations. However, most of the observed M-H intensity is attributed to the M-CH3 fragment of the C5H9NO parent 5-methyl-2-pyrrolidinone (see below). This creates an entanglement and, hence, we do not assign LCI = 3 in this case.

–C4H9NO (m/z = 87): N,N-dimethylacetamide (LCI = 1; FSRM = 12%) is identified via its strong M and M-CH3 signals. Moreover, the strong signal on m/z = 44 corresponds to M-Ac and hence to C2H6N. This fragment appears in Fig. 2. N-Methylpropanamide or O-methyloxime-2-propanone also yield M but no M-H and hence could be viable alternative candidates. N-Ethyl-acetamide is not likely present due to a strong iminium ion signal which cannot be accommodated. Morpholine and 2-butanoneoxime are possible candidates (they make M and 30–40% M-H), if N,N-diethylformamide (the parent selected for C5H11NO) is replaced with a molecule that does not yield M-CH3. This interference of M-CH3 fragments of heavier species with M-H fragments is observed in various places and makes the interpretation of the mass spectrum more ambiguous (see further discussion below). Other isomers with available reference spectra do not yield relevant M signals and hence are not likely candidates.

–C5H7NO (m/z = 97): 3,5-dimethylisoxazole (LCI = 2; FSRM = 11%) yields compatible M and M-CH3 signals and no M-H signal. All other isomers with reference data are less compatible as they either produce too much M-H or too little M-CH3. Notably, the observed intensity on C4H4NO, i.e. the M-CH3 fragment of the selected candidate 3,5-dimethylisoxazole, is still underexplained but we did not find a more suitable combination of molecules. Contributions of the oxazole derivatives among the isomers cannot be ruled out.

–C5H9NO (m/z = 99): 2-piperidinone (LCI = 2; FSRM = 8.3%) is compatible with its strong M and negligible M-H signals but its FSRM would be too high. We therefore introduce an additional parent: 5-Methyl-2-pyrrolidinone (LCI = 2; FSRM = 5.4%). This molecule can appropriately explain the residual M and M-CH3 signals. All isomers with reference spectra that do not yield a substantial M signal or have a relevant M-H fragment are not compatible candidates.

–C5H11NO (m/z = 101): N,N-diethylformamide (LCI = 2; FSRM = 15%) has suitable M and M-CH3 signals. Its base peak is the resonance-stabilized CH4N fragment ion on m/z = 30, which is accommodated in Fig. 2. N,2-Dimethylpropanamide, which could explain M and M-CH3, too, yields too much C2H4NO on m/z = 58. N,N-Dimethylpropanamide is an appropriate alternative candiate. Given the high FSRM of the for-mamide derivative, the presence of an additional parent species seems rather likely. Based on the non-observation of the M-H signal and the already fully explained CH4N fragment ion, all other isomers with a relevant M signal have been ruled out.

–C6H9NO (m/z = 111): trimethyloxazole (LCI = 2; FSRM = 7.0%) is a suitable candidate because of its M, absent M-H, and M-CH3 signal. The isoxazol isomer shows a strong M-CH3 signal, which makes it less favourable. A good alternative is 1-ethenyl-2-pyrrolidinone, which only yields one strong signal beside the M signal, namely, a CHO-fragment (C3H4O) on m/z= 56 from the ring-opening reaction. Notably, there are many isomers that do not have reference data.

–C6H11NO (m/z = 113): 1-acetylpyrrolidine (LCI = 3; FSRM = 8.4%) yields M and M-Ac but no M-H and no M-CH3. The M-Ac fragment ion shows in Fig. 2 on m/z = 70. Unlike with the iminium fragment, the acetyl fragment ion can be accommodated within the signal’s error margins as the signal has a very high intensity (see Fig. 2 in Hänni et al. 2023). All other isomers with reference mass spectra and relevant M signals can be ruled out based on the M-H, the M-CH3, and/or the iminium ion fragment that cannot be accommodated.

–C7H11NO (m/z = 125): 2-acetyl-1,4,5,6-tetrahydropyridine (LCI = 1; FSRM = 7.0%) yields M, negligible M-H, and no relevant M-CH3. M-Ac on m/z = 82 shows up in the Fig. 2. On m/z = 97 (C5H8NO), no signal has been measured, which disfavours 2-acetyl-3,4,5,6-tetrahydropyridine. 1-Acetyl-1,2,3,4-tetrahydropyridine, on the other hand, has a similar spectrum to 2-acetyl-1,4,5,6-tetrahydropyridine. Hence it could contribute to the observed intensity as its additional signal on m/z = 83, likely the C4H5NO fragment after ring opening and subsequent loss of C3H6, can be accommodated within the observed signal’s error margins. All other isomers with reference mass spectra and relevant M signals can be ruled out based on the M-H, the M-CH3, the M-C2H6, and/or the iminium ion fragment that cannot be accommodated.

3.3 Unexplained intensity

The measured intensities that remain unexplained with our Occam’s razor-based spectral deconvolution procedure, i.e. blanks in Figs. 2 and 3, are mostly attributed to species with a small number of hydrogen atoms. For the CnHmN species, these are: C3N (m/z = 50) or C4H2N (m/z = 64). The former signal might be associated with the cyanoethinyl radical, first detected tentatively towards CW Leonis (Infrared Catalogue +10216; Guelin & Thaddeus 1977) and later confirmed towards TMC-1 (McGuire 2022). There is no signal registered in the analysed DFMS dataset at the position of the next species in this carbon-chain series, the cyanobutadiynyl radical (C5N), which might be explained by a decreasing abundance with an increasing carbon-number. In the ISM, this molecule has been detected towards TMC-1 by Guelin et al. (1998). For a thorough review of the carbon-chain chemistry in the ISM (see e.g. Taniguchi et al. 2024). The underexplained C4H2N signal might be associated with M-H of cyanopropyne or cyanoallene or another structural isomer of the sum formula C4H3N. According to McGuire (2022) and literature therein, cyanopropyne (also known as methylcyanoacetylene) and cyanoallene have both been detected towards TMC-1. Recently, also the 1-cyano (Cabezas et al. 2025) and 3-cyano propargyl radicals (Cabezas et al. 2021) were reported towards TMC-1.

Of the CmHnNO species, C2HNO (m/z = 55) is clearly underpopulated. This likely indicates an additional parent that yields an M but not an M-H signal. Possible structures with some chemical relevance are cyanoformaldehyde, iminoethenone, nitrosoacetylene, ketoaziridine, or oxoacetonitrile. However, no EI-MS reference data for those species are available. Cyanoformaldehyde (IUPAC name: formyl cyanide) has been detected towards the star-forming region Sagittarius B2(N) with the Green Bank Telescope (Remijan et al. 2008). A very minor but unaccounted for signal of C3H8NO on m/z = 74 might be an M-H fragment of a C3H9NO parent that does not yield a relevant M signal. However, this signal’s intensity is close to the instrument’s detection limit and no reliable statement can be made.

These observations for unaccounted CmHnN and CmHnNO species are consistent with previous reports for 67P’s pure hydro-carbons (Hänni et al. 2022) and O-bearing hydrocarbons (Hänni et al. 2023), for which it has been suggested already that residual intensity is likely associated with radicals and other less stable and more reactive species without reference spectra. However, future systematic investigations of cometary data are needed to obtain a more comprehensive picture of hydrogen-deficient species.

In addition, some characteristic EI-fragments seem to be slightly underpopulated. One example is the typical, resonance-stabilized nitrile fragment NH–C–CH2 on m/z = 41, which results from McLafferty rearrangements in nitriles whose α-carbon atom is not branched. However, also the two neighbouring signals show the same slight underpopulation which might rather indicate missing parent species. Considering the fact that the DFMS uses a 45 eV ionization voltage as compared to 70 or 75 eV used for the standard reference mass spectra collected in the NIST database, we expect a tendency that our instrument yields relatively less fragments. Consequently, stacking of NIST spectra might result in a systematic overpopulation of typical fragment signals. Indeed, we observe an overpopulation for certain characteristic fragments, for example the iminium ion on m/z = 30 or the C2H6N fragment on m/z = 44. A thorough calibration campaign may shine more light on the extent of this effect. Previous calibration work for a series of pure and O-bearing hydrocarbons, however, has shown that deviations of the DFMS spectra from the NIST ones are usually below or around 10% of the relative signal intensity (Schuhmann et al. 2019b, a).

4 Discussion

Here, we comment on our results and discuss methodological caveats and limitations of EI-MS. Furthermore, we review the different chemical functional groups observed and contex-tualize our findings by comparing them to other reservoirs of extraterrestrial N- and NO-bearing organics.

4.1 General interpretation and caveats

Generally, the local abundance of molecules is expected to decrease with increasing molecular weight for combinatorial reasons (under assumption of a bottom-up chemistry) and for reasons of decreasing volatility. This means that more complex species are not only expected to form less abundantly due to longer formation pathways, but also to be less detectable via gas phase as they do not easily sublimate anymore. This is well consistent with our observations of decreasing signal intensities and increasing number of gaps in homologous peak series in the investigated dataset overviewed in Fig. 1 in Hänni et al. (2022). However, the CmHnN and CmHnNO species appear to be rather exceptional. While the CmHnN species have an intensity dip when m = 4, the CmHnNO species rather have an intensity peak when m = 4. In the following, we attempt a discussion of this observation:

For the CmHnN signals, our Occam’s razor-based solution, presented in Sect. 3, suggests that signals of species with m ≤ 4 are mostly associated with chain-based molecules with preferably amine or nitrile functionality (Fig. 2 shades of red) while species with m > 4 are favourably associated with heterocycles (Fig. 2 shades of green and blue). Obviously, a minimum number of three atoms is needed to form a ring. The stability of cyclic species is influenced by factors including ring strain and aromaticity. Ring strain is commonly understood as a combination of angle strain and torsional strain. The angle strain is minimal when the molecular orbitals of a molecule overlap in a favourable way, which is for instance the case in 6-membered rings. Aromaticity generally increases the stability of a molecule as it lowers the energy of the molecular orbitals. Notably, the N atom can contribute to aromaticity with its electron lone pair. This property seems to be more pronounced in 5-membered rings as compared to 6-membered rings (Zong et al. 2020). Our solution comprises several pyrrole and pyridine derivatives, which are aromatic 5- and 6-membered N-heterocycles with contribution of the N lone pair electrons. Generally, our solution suggests that the N atom is favourably located within the ring to form a heterocycle rather than an N-functionalized carbocycle which could be due to said stability gain. While our solution features 3-, 5-, and 6-membered rings, it does not contain any 4-membered rings. It must be said, though, that for the 4-membered N-heterocycles with a double bond, 1- and 2-azetine, reference spectra are not available. Azetidine, without a double bond, is not a favourable candidate based on its fragmentation pattern. The methylated 3-membered heterocycles yield a better fit to the data. Why 3-membered but not 4-membered hetero-cycles should be observed is chemically not obvious. However, Chuang et al. (2024) recently reported the tentative detection of a 3-membered ring, cyclopropenylidene, in several comets. This supports the evidence for 3-membered rings in this dataset, both in the form of 3-membered carbocycles (Hänni et al. 2022) and of 3-membered heterocycles (this work). In addition, we note an analogy between the methylated 3-membered and the methy-lated 5-membered heterocycle, which, according to our solution, both seem more likely than the respective 4-membered and 6-membered symmetrical heterocycles. This could imply preferred formation of the chiral isomer.

The picture we find for the CmHnNO species is similar when it comes to the position of the heteroatoms N and O. Both, the N and the O atom, seem to favourably participate in ring-formation, which yields molecules such as oxazole and isoxazole derivatives. Furthermore, our solution also suggests the presence of O-functionalized N-heterocycles, while the inverse is less compatible with the observations. The reason for the peak in signal intensity for CmHnNO species where m = 4, however, is unclear. The best candidates for this group of signals are methylated heterocycles or functionalized carbocycles. Alkylation on both chains and cycles seems to be abundantly present. Especially methylated chains and rings are frequently the most suitable candidates. We note that the solution with a methylated 5-membered heterocycle (where both N and O are in the ring) is more favourable than the solution with a 6-membered heterocycle (where both N and O are in the ring). Ethylation appears to be less common and propylation is not represented at all according to our solution. This result is intuitively compatible with a bottom-up formation of the observed ensemble of organics.

While our Occam’s razor-based approach provides fundamental insights into cometary molecular complexity, the existence of isomers leads to a degeneracy in the parameter space, and the following methodological caveats require thorough consideration: first, the obtained solution is non-unique as there is some entanglement. The most striking example is the M/M-H ratio, which we often used to determine the most suitable candidate to explain the observed M and M-H parent signals. However, this ratio may not always be a good indicator as fragments of larger species, most frequently M-CH3 fragments, could interfere. We indicated and discussed these cases in Sect. 3 above. On the other hand, there are also helpful cases where, for instance, the non-observation of M-H or M-CH3 fragments can be used to rule out certain isomers. Especially for larger COMs, certain isomers can be clearly ruled out, while still more than one isomers remain compatible with the observations.

Second, some groups of molecules are not favourably identified via EI-MS of complex gas mixtures as they do not yield strong M signals or no M signals at all. Our analysis, however, requires the presence of an M signal for an unambiguous identification as we must explain the signals of the observed parent species. Moreover, fragments alone do not provide enough constraints as they can always be associated with various parents. Species with missing or weak M are for example amines, especially primary, linear ones with three or more carbon atoms, irrespective of where the amine function is located. But also carboxamides usually have fragment signals as base peak. For most radical species no reference mass spectra are available and we do not know whether or not these species yield strong M signals. Despite the fact that molecules showing weak or absent M signals (i.e. only fragment signals) might evade identification, their total contribution cannot exceed the fraction of the fragment signals that remains unexplained in our solution, plus the error on the observed signals. Most chemical functional groups yield characteristic fragments, such as m/z = 30, 44, and 58 for chain-based amines, which should be underpopulated if parent molecules (and their fragments) are missing. Above, in Sect. 3.3, we have discussed that such characteristic fragments rather tend to be overpopulated and not underpopulated (with the exception of the nitrile fragment on m/z = 41). On the other hand, cyclic and especially aromatic species tend to produce strong M and M-H signals and hence are easily identifiable with EI-MS.

Third, Occam’s razor might not capture the true diversity of cometary organic matter, especially at higher masses as it aims to minimize the number of candidates used for the solution. For example, one ring molecule (explaining M and M-H simultaneously) is favoured over a combination of two chain-based molecules (explaining M and M-H separately). This might explain the observed underabundance of nitriles (which often do not yield M but just M-H) as inferred from the slight underpopulation of the typical nitrile fragment (NH-C-CH2) at m/z = 41. However, the chosen approach is the only way to identify specific molecules in the cometary coma through spectral deconvolution. Alternatively, statistical approaches, as pursued for instance in Ruf & Danger (2022) or Varmuza et al. (2020), only describe the properties of the whole ensemble of organics. While the application of such methods allows the extraction of unbiased information on the organic complexity, we are convinced that our Occam’s razor-based method has its own merit as it allows us to place constraints on many individual candidate species. Moreover, we complemented Occam’s razor with naive bottom-up chemistry considerations. In a few cases, we thus favoured a solution with two candidates over a solution with just one candidate if that one candidate would have an unreasonably high FSRM. Where we found that multiple isomers cannot be distinguished based on their fragmentation patterns and the selected candidate shows an unreasonably high FSRM, we stated that its FSRM probably should be understood rather as a total FSRM of multiple isomers.

4.2 Contextualization

In the following, we strive to embed our findings into the bigger picture of the evolution of organic material in space – from its cradle in the ISM towards the biochemistry of carbon-based life on Earth. The role of comets in this picture is two-fold:

(1) Comets contain some of the most pristine Solar System matter and it has been shown that they share many low-mass volatile species with astrophysical environments in the ISM, including N-bearing ones such as hydrogen cyanide, acetonitrile, cyanoacetylene, 2-propenenitrile, isocyanic acid, or methyl iso-cyanide (Bockelée-Morvan et al. 2000; Drozdovskaya et al. 2019; López-Gallifa et al. 2024), even in similar abundances relative to methanol. As of mid-2021, McGuire (2022) listed more than 50 N-bearing neutrals in the census of interstellar molecules. These are mostly radicals, nitriles, imines, and a few amides. A detailed understanding of the cometary ices is a crucial point of reference for ISM studies and vice versa. This also holds for more altered Solar System matter: The inter-comparison especially to C-type asteroids, containing original organic matter that underwent significant post-processing including liquid water chemistry, allows us to constrain chemical evolution. Until today, very limited evidence exists for the occurrence of aqueous alteration in comets (Lisse et al. 2006; Berger et al. 2011).

(2) Alongside interplanetary dust, meteorites, and asteroids, comets may have delivered substantial amounts of prebiotic species to the early Earth through impacts and hence may have helped to kick-start life (see e.g. Rubin et al. (2019b) and references therein). As N-bearing species are key players in prebiotic chemistry and eventually are incorporated in many biotically relevant molecules such as amino acids or nucleobases, they are a major target of the astro- and cosmochemistry communities. It has to be stated that molecular, i.e. soluble, organics from meteorites (Schmitt-Kopplin et al. 2010) as well as from samples returned from asteroids (Schmitt-Kopplin et al. 2023; Glavin et al. 2025) expose an extremely high molecular diversity, which is missed in targeted searches. Untargeted characterizations of the organic chemical complexity were also done for analog materials prepared in the laboratory, for example in ice irradiation experiments (Ruf et al. 2019; Ruf & Danger 2022; Kipfer et al. 2024) or in nebulotron-synthesized N-containing samples (Lévêque et al. 2024).

In Table 2, we summarize our findings by indicating the total FSRM of the various functional groups as determined from our Occam’s razor-based deconvolution approach. The total FSRM can be viewed as an estimate of the order of the magnitude of the abundances ratios. We group the molecules represented in our solution into different categories (cyclic versus acyclic; chemical functionality; organic versus inorganic). Before comparing our findings with both asteroids and meteorites as well as the ISM, we comment on the absolute relevance of the CnHmN and CnHmNO species. Therefore, we compare organic molecules with N to inorganic molecules with N observed with the DFMS. The most abundant N-bearing compound is ammonia (NH3), followed by hydrogen cyanide (HCN) or its isomer (HNC), isocyanic acid (HNCO) or its isomer (HOCN), molecular nitrogen (N2), and nitric oxide (NO). Note that because of its chemical properties, which are similar to those of inorganic cyanides, we follow the convention to assign HCN and HNC to the inorganic compounds, despite the fact they do contain a CH-bond, unlike HCNO or HOCN, among others. Nitrogen dioxide (NO2), another inorganic candidate species, would be hidden under the CO18O isotopologue peak of CO2, but its abundance is probably negligible. For an investigation of the bulk abundances of these common cometary volatiles in 67P, we refer to Rubin et al. (2019a). In the dataset investigated for this work, NH3 is significantly more prominent than methanol (FSRM = 172%). Rubin et al. (2019a) reported NH3 to be roughly three times more abundant relative to water than methanol around the comet’s inbound equinox when they come conditions were relatively stable. Especially in Jupiter family comets, to which also comet 67P belongs, the NH3 abundance relative to water seems to show a high variability (Dello Russo et al. 2016). Altwegg et al. (2020) have presented evidence for several ammonium salts sublimating from 67P’s dust and have argued that semi-volatile ammonium salts may be a reason for the heliocentric distance dependence of the NH3/H2O ratio. In data presented here, the inorganic N-bearing compounds, including NH3 as the most relevant volatile N carrier, make up for a total FSRM of 301% while organic N-bearing ones amount to a total FSRM of 292%. This suggests that organic CnHmN and CnHmNO species roughly equal the inorganic ones, at least when the cometary outgassing is including enhanced sublimation from ejected dust particles as is the case on 3 August 2015. Notably, a few organic species with multiple N or O atoms shown in Fig. 1 in Hänni et al. (2022) are not included in this estimation as they will be topic of future investigations. In terms of total FSRM, these species do not substantially change the balance between organic and inorganic N carrieres as it presents itself here, though.

Our findings impact the apparent cometary N deficiency, which, for comet 1P/Halley, was reported to be about a factor 5 compared to the Solar System N abundance value (see Fig. 1 in Geiss 1987). Geiss used an N abundance values for the Solar System from Anders & Ebihara (1982), but recent updates are available (Lodders et al. 2025). The total N abundance estimation for Halley included an amount of N from cometary volatiles observed at that time by the ion mass spectrometer aboard ESA’s Giotto spacecraft (Balsiger et al. 1986), mainly associated with HCN and NH3, and an amount of N from cometary dust measured by a dust impact-ionization mass spectrometer on the Soviet Vega-1 space probe (Jessberger et al. 1988). If now the gas abundance of N roughly doubles due to the here reported reservoir of organic N-bearing volatiles, this raises the total N abundance by a factor of about 1.5 and brings it closer to the Solar System value. If an additional reservoir of semi-volatile ammonium salts is considered, with evidence for 67P reported by Altwegg et al. (2020) and Poch et al. (2020), the total cometary N abundance might be similar to the Solar System one. However, this estimation is based on FSRM and hence affected by differences in the ionization cross-sections. Most likely, it overestimates the heavy COMs as compared to the lighter molecules as the former tend to have larger cross-sections (see e.g. Beran & Kevan (1969) and Kumar et al. 2019). Consequently, the importance of heavy COMs as a cometary N reservoir might be less pronounced as suggested in the above estimation. The N abundance, as well as its distribution between volatiles and refractories phases, including salts, is also topic of investigation for asteroids (Schmitt-Kopplin et al. 2023; Pilorget et al. 2024; Glavin et al. 2025) and may be a key to understanding post-accretional processing (see below).

Table 2

Total fragment sum relative to methanol (FSRM) of the different groups of volatiles from this work.

4.2.1 Comparison to meteorites and asteroids

Comparisons of our findings from cometary in situ data to those from analysis of collected meteorite samples or returned asteroid samples in laboratories on the ground must be made carefully. Chondrite analysis, for instance, involves solvent extraction and especially the use of acids leads to the hydrolysis of certain functional groups such as amides or lactams (Burton et al. 2012). Consequently, the studies without acid extraction step in the sample work-up protocol should yield results that are more comparable to ours. Nevertheless, chemical reactions in the liquid phase during extraction, even if the solvent is simply water, are difficult to rule out or constrain. For the Murchison meteorite, a well-studied CM-type chondrite, Jungclaus et al. (1976) report 8 µg aliphatic amines per gram meteorite while Aponte et al. (2014) reported 106 nmol/g free and 157 nmol/g acid hydrolyzed aliphatic amine content. Acid extraction increases the amine content as it hydrolyzes functional groups (e.g. amides) into amines (e.g., Sephton 2002). The reported numbers indicate an increase by a factor of about 1.5. The biggest contribution is methylamine, which is roughly five times more abundant than ethylamine, the second most abundant amine. In comet 67P, we identified the following aliphatic amines with reasonable to high certainty: methylamine (FSRM = 17%), ethylamine (FSRM = 13%), 2-propanamine (isopropylamine; FSRM 1.1%), N,N-dimethyl-methylamine (FSRM = 3.4%), and N-methyl-2-propanamine (FSRM = 3.8%). The FSRM decrease from methylamine to ethylamine seems less pronounced. While it is likely that the primary amine isomer is more abundant than the secondary one, as has been discussed in Sect. 3, their true ratio is difficult to constrain with mass spectrometry alone. Glavin et al. (2008) reported the presence of methylamine and ethylamine for acid-hydrolyzed hot water extracts of the cometary matter returned from comet Wild 2 by the Stardust mission (Brownlee et al. 2006). However, there was no significant detection of the free form, indicating, that these compounds were rather bound in some hdyrolyzable precursor. In the Murchison meteorite, the primary amine ethylamine was reported to be almost a factor of 5 more abundant than its secondary amine counterpart dimethylamine (Aponte et al. 2014). Generally, secondary amines seem to be significantly less abundant than primary ones in this meteorite (Aponte et al. 2014). For comet 67P, our solution suggests a similar conclusion: As we have argued above, the iminium ion on m/z = 30 (proxy for primary amines), the C2H6N fragment on m/z = 44 (proxy for secondary amines), and the C3H8N fragment on m/z = 58 (proxy for tertiary amines) allow us to place constraints on the total abundance of the primary, secondary, and tertiary amines, respectively. We observe, that the iminium ion is more abundant than the C2H6N fragment by approximately a factor of 3 but more abundant than the C3H8N fragment by only a factor of about 2. Notably, a direct comparison of the intensities of these characteristic amine fragments is not subject to biases of the chosen deconvolution strategy and applicable to any mixture of COMs analysed by EI-MS. A total of 25 aliphatic amines up to C6 have been found in the Murchison meteorite (Aponte et al. 2014) while in the Orgueil meteorite, a CI-type and hence more aqueously altered chondrite, only 12 aliphatic amines up to C4 could be detected (Aponte et al. 2015). Antarctic carbonaceous chondrites seem to possess the same suite of aliphatic amines as the Murchison meteorite (Aponte et al. 2016).

In fact, also N-heterocycles classify as secondary or tertiary (if the N is bonding to C only) amines, respectively. While we are planning to investigate and discuss nucleobases in a future study, we focus on simple N-bearing heterocycles with one N here: Over many decades, basic N-heterocycles have been searched for and studied extensively in meteorites, especially since the fall of the Murchison meteorite in 1969 (Folsome et al. 1971). Murchison meteoritic matter contains a series of purines, pyrimidines, quinolines, and pyridines, including alkylated derivatives (Sephton 2002). A recent review of the extensive meteoritic inventory of N-heterocycles by Martins (2018) shows a diversity ranging from pyrrolidine derivatives (5-membered rings), to purine (annulated rings), and lactam representatives (up to 8-membered rings). In our deconvolution solution, N-heterocycles make up for a total FSRM of 89% (36% are additionally O-functionalized), while NO-heterocycles make up for a total FSRM of 41%. Including another 12% attributed to carbocycles, a bit over a third (i.e. 142 out of 387%) of the total observed CnHmN and CnHmNO signal intensity is attributed to cyclic species, the rest to acyclic ones. Aponte et al. (2014) also reported the presence of the cyclic secondary amine pyrrolidine for the first time in meteoritic matter (from Murchison). While the evidence for the presence of pyrrolidine is insufficient, our data contains evidence for its dehydrogenated forms, i.e. pyrrolines and pyrroles, and potentially for its acetylated form. Our work indicates the presence of 3-, 5-, and 6-membered N-heterocycles, often alkylated or dehydrogenated or even aromatic. For 4-membered rings we find less compelling evidence, but they may be present at the expense of some methylated aziridine derivatives. These findings from this work are consistent with reports for 67P’s pure carbocycles, where the 4-membered carbocyles seem to be less abundant than the 3- and 5-membered ones (Hänni et al. 2022). The reason, however, is unclear as the ring strain is largest for the 3-membered rings. Possible synthetic pathways towards N-heterocycles in meteorites originate in an ammonium cyanide chemistry (Botta & Bada 2002; Callahan et al. 2011), but also hexamethylenetetramine (HMT) or urea are high-ranking precursors (see e.g. Oba et al. (2023a) and references cited therein). Notably, ammonium cyanide (NH4CN) has been reported to sublimate from 67P’s dust (Altwegg et al. 2020). Extensive review work on the study of carbonaceous chondrites is available (Cronin & Chang 1993; Botta & Bada 2002; Sephton 2002; Glavin et al. 2018).

An additional unknown in meteorite studies is terrestrial weathering that may occur prior to collection and curation to variable extents. Therefore, samples collected from near-Earth asteroids, such as asteroid (101955) Bennu and asteroid (162173) Ryugu, returned to Earth and kept in curation facilities under nitrogen fluxes provide valuable complementary insights into organics. According to its mineralogy, Ryugu seems to be associated with CI chondrites (with much less water content, though), while for Bennu the case is less clear cut. From remote sensing, Bennu appeared to be a CM-like asteroid while from laboratory analyses it rather seemed similar to Ryugu and CI chondrites. Ryugu samples analysed for small soluble organic molecules showed the aliphatic amines methylamine, ethylamine, n-propylamine, and isopropylamine (Naraoka et al. 2023). No dimethylamine was reported. The individual abundance was decreasing with increasing chain length while the total aliphatic amine abundance was an order of magnitude less than in the Orgueil chondrite (Parker et al. 2023). New results from similar analyses of samples returned from asteroid Bennu showed that the C1-C6 amine distribution follows the trend of decreasing concentrations with increasing carbon number as well (Glavin et al. 2025). However, the reported abundances were higher overall and a larger diversity of amines was observed as compared to Ryugu which was suggested to be a result of less extensive aqueous alteration processes on Bennu’s parent body. Unlike Bennu, where isoproylamine and n-propylamine are present in similar abundances, the branched isomer of C3H9N seems to be more abundant than the straight-chain isomer in Ryugu by almost an order of magnitude. Parker et al. (2023) suggested the dominance of the branched isomer of C3H9N in Ryugu to be the result of either favoured formation in a radical reaction or enhanced thermal stability, as branched molecules are less prone to thermal degradation. We find that the straight-chain isomer is less compatible with the analysed DFMS data as its strong iminium ion fragment cannot be accommodated in our solution, or only a minor amount of it, if some of the ethylamine is replaced with dimethylamine (see Sect. 3). We thus cannot place stringent constraints on the branched-versus-straight ratio in C3H9N. Aponte et al. (2014) reported the branched C3H9N isomer to be more abundant in the Murchison chondrite than the straight-chain one by about a factor 1.5 in the free form. The difference was found to be less pronounced in the acid-hydrolyzed extract.

Similar to meteorites, also the samples returned from asteroids seem to possess a diversity of N-heterocycles: For Ryugu, alkylated derivatives of pyridine, piperidine, pyrimidine, imidazole, and pyrrole were identified in a methanol extract (Naraoka et al. 2023). It was suggested that differences in the number of carbon atoms in the alkylpyridine homologs identified in Ryugu and CM chondrites, including Murchison, may be related to hydrothermal history of the parent bodies or irradiation conditions. In Bennu matter, an extensive set of N-heterocycles, including all five nucleobases of terrestrial life, have been identified (Glavin et al. 2025). The total abundance of N-heterocycles in Bennu extracts amounts to approximately 5 nmol/g. This is up to an order of magnitude more than what was reported for Ryugu and was suggested to be the result of less pronounced aqueous alteration on Bennu’s parent body. The degree and effects of aqueous alteration and the related chemical implications for the organics budget of both these C-type asteroids, Bennu and Ryugu, have not been fully understood until today. However, finding the same main N-bearing species among comet 67P’s volatile organics and in Ryugu, Bennu, and carbonaceous chondrites might be an indication that actually many of those species could have formed prior to planetesimal formation. Substantial work will be needed, though, to systematically investigate the abundances of individual cometary COMs and especially of different isomers, which could be realized for instance in the framework of a cometary sample return mission featuring a gas-chromatography-coupled mass spectrometer. This would pave the way towards an enhanced understanding of the post-accretional processing of organics and its effects on the chemical diversity and complexity.

4.2.2 Comparison to the ISM

While organic amines and N-heterocycles are good candidates to explain the mass spectra analysed for this work and are abundantly present in meteorite and asteroid matter, this does not seem to be the case for the ISM. Until today, only methylamine has been repeatedly identified in the ISM (McGuire 2022) and more recently a detection of vinylamine was reported towards the Galactic center cloud G+0.693-0.027 (Zeng et al. 2021). The latter authors also reported a tentative identification of ethylamine. The detection of N-heterocycles has remained elusive until today. McGuire (2022) mentions the tentative detection of aziridine. Aziridine, pyrrolidine, and piperidine (including dehydrogenated and alkylated forms of those molecules) provide an Occam’s razor-compatible explanation for the DFMS data analysed in this work.

Notably, molecules must possess a dipole moment in order to be detectable via rotational spectroscopy, the technique commonly used to probe the gas phase in the ISM. Consequently, some groups of molecules are more easily identified than others and some, for example unfunctionalized symmetric hydrocarbons, are hardly detectable, for example unfunctionalized polyynes (carbon chains) or many aromatic ring systems. A work-around is the probing of such symmetric hydrocarbons via CN-functionalized analogs. It has been proven practical to use the molecules benzonitrile (McGuire et al. 2018) and cyanonaphthalene (McGuire et al. 2021), among others, as proxies for their unfunctionalized counterparts. The presence of a dipole moment is not required for the analytical method of EI-MS applied in this work. As discussed above, also EI-MS has its limitations, and hence, methodological differences are adding to the effective differences between cometary organics and organics in the ISM. For these reasons, common ISM targets are the CN-functionalized carbocycles. But from our study, CN-functionalized carbocycles are clearly less common in comets than N-heterocycles.

Low-mass nitriles and N-bearing acids are easily detectable by both EI-MS and rotational spectroscopy. A subset of such molecules, including multiple previously reported cometary species such as hydrogen cyanide, acetonitrile, cyanoacetylene, 2-propenenitrile, isocyanic acid, or methylisocyanate, has been previously shown to be present in both comet 67P and the ISM (Drozdovskaya et al. 2019; López-Gallifa et al. 2024). For imines, unfortunately, no reference mass spectra exist and we only infer the presence of methanimine under the assumption of similarity to the fragmentation behavior of formaldehyde. If ethanimine yields substantial M and M-H as well, then this molecule might be a viable (partial) replacement of aziridine.

Some ISM environments, however, such as the molecular cloud G+0.693-0.027, appear to be particularly rich in imines. In the same cloud, methanimine, ethanimine, cyanomethanimine, and propargylimine have been detected (Bizzocchi et al. 2020). The presence of imines in comet 67P’s coma is therefore possible if not likely, but probably also with decreasing abundances as observed for the amines. The acquisition of reference EI mass spectra has the potential to place further constrains on the presence of cometary imines. Also, the hydrogen-deficient molecules, including radicals abundant in the ISM, have to be further investigated and constrained. As we have discussed above in Sect. 3.3, there is yet unattributed intensity in the DFMS sum spectrum analysed for this work that is compatible with common ISM species such as C3N (potentially the cyanoethinyl radical), C4H2N (potentially cyanopropyne or cyanoallene or an isomer thereof), or C2HNO (potentially cyanoformaldehyde or an isomer thereof). Future systematic work will allow us to place more stringent constraints on such species.

5 Summary and conclusions

In this work, we presented and discussed a non-unique Occam’s razor-based spectral deconvolution solution of an organic-rich EI-MS sum spectrum acquired by the DFMS in comet 67P’s coma on 3 August 2015, focusing on the CnHmN and CnHmNO species. Occam’s razor was combined with abundance constraints compatible with a naive bottom-up chemistry as well as information from studies of other relevant astrophysical environments. This approach led us to the following conclusions:

  • As already reported for comet 67P’s pure hydrocarbons (Hänni et al. 2022) and for its O-COMs (Hänni et al. 2023), the observed signals of CnHmN and CnHmNO species are best reproduced by a chemically diverse ensemble of cometary COMs similar to meteoritic and asteroidal soluble organic matter. We have found evidence for various chemical functionalities including amines, nitriles, and heterocycles, the ratio of acyclic to cyclic species being roughly 2 to 1. We summed up the FSRM values in Table 2, which suggests that inorganic and organic nitrogen carriers may be similarly abundant in total;

  • Cyclic species, which often produce strong M and M-H signals, are more easily identified than chain-based species, which often produce either strong M or M-H. Occam’s razor suggests that the solution with the smaller number of molecules is the preferable one, which appears debatable in face of the growth of combinatorial possibilities of structural isomers with a growing number of atoms in the molecule. However, characteristic fragment signals, such as the typical amine fragment on m/z = 30, can be used to constrain the contribution of such hard-to-detect molecules. These characteristic fragment signals tend to be overpopulated in our solution. This means that while additional contributions of hard-to-detect species are highly likely, their abundance must be limited by the unaccounted for intensity plus the error margins on the observations;

  • The typical primary amine fragment (CH4N on m/z = 30) is approximately three times more abundant than the typical secondary amine fragment (C2H6N on m/z = 44), but is only two times more abundant than the typical tertiary amine fragment (C3H8N on m/z = 58). These fragments reliably constrain the total abundance of primary, secondary, and tertiary amines in an ensemble of COMs without going through full deconvolution of the dataset;

  • N-heterocycles seem to be more compatible with the observations than N-functionalized carbocycles, and they are good candidates for the signals observed, especially in the higher-mass part of the DFMS spectrum. In NO-bearing species, both NO-heterocycles (e.g. oxazoles and isoxazoles) and O-functionalized (e.g. acylated) N-heterocycles are observed. From our study, there is no evidence for N-functionalized O-heterocycles;

  • Alkylation seems to be very common in both chain-based and cyclic species in the form of short side-chains. For example, and in agreement with reports from meteorite and asteroid Ryugu studies, isopropylamine more closely matches the observed signals than its unbranched counter-part, which would lead to further overpopulation of the iminium ion fragment on m/z = 30;

  • The presence of amino acids and nucleobases falling into the class of CnHmNxOy species, where x + y > 2, will be investigated separately in the future. It will be challenging to clearly identify such molecules in the complex mixture of organics present in 67P’s coma. But it may be possible to derive meaningful upper limits from the absence of signals that should be clearly detectable according to EI-MS reference data.

The set of molecules used in our approach, together with their isomers where applicable, may inform future searches in related astrophysical environments. We achieved the clear identification of many smaller N- and NO-COMs, but the true chemical diversity for larger COMs, where the parameter space becomes increasingly degenerate, is probably underestimated. While we consider residual minimization as being too limited and the unambiguous identification of large COMs from DFMS sum fragmentation patterns to be impossible, we see more potential for instance in multivariate statistical methods or chemical networks to characterize the ensemble of cometary organic matter in unbiased ways. Particularly for larger COMs, top-down synthetic pathways may also play a role. This renders collaborative efforts among modelling, laboratory, and observational communities essential to deepen our understanding of the origin and evolution of organic complexity, from the ISM towards emerging and evolved planetary systems.

Acknowledgements

We gratefully acknowledge the work of the many engineers, technicians and scientists involved in the Rosetta mission and in the planning, construction, and operation of the ROSINA instrument in particular. Without their contributions, ROSINA would not have produced such outstanding data and scientific results. Rosetta is an ESA mission with contributions from its member states and NASA. Work at the University of Bern was funded by the Canton of Bern and the Swiss National Science Foundation (200020_207312). S.F.W. acknowledges financial support of the SNSF Eccellenza Professorial Fellowship PCEFP2_181150. M.R.C. acknowledges support from NASA grants 80NSSC18K1280 and 80NSSC20K0651. J.D.K. acknowledges support from the Belgian Science Policy Office. Especially, we thank Prof. Dr. B. McGuire for sharing with us his ISM perspective on the work presented here.

References

  1. Agúndez, M., Biver, N., Santos-Sanz, P., Bockelée-Morvan, D., & Moreno, R. 2014, A&A, 564, L2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  2. Alexander, C. M. O. D., Cody, G. D., De Gregorio, B. T., Nittler, L. R., & Stroud, R. M. 2017, Chem. Erde/Geochemistry, 77, 227 [NASA ADS] [Google Scholar]
  3. Altwegg, K., Balsiger, H., Bar-Nun, A., et al. 2016, Sci. Adv., 2, e1600285 [NASA ADS] [CrossRef] [Google Scholar]
  4. Altwegg, K., Balsiger, H., & Fuselier, S. A. 2019, ARA&A, 57, 113 [NASA ADS] [CrossRef] [Google Scholar]
  5. Altwegg, K., Balsiger, H., Hänni, N., et al. 2020, Nat. Astron., 4, 533 [NASA ADS] [CrossRef] [Google Scholar]
  6. Anders, E., & Ebihara, M. 1982, Geochim. Cosmochim. Acta, 46, 2363 [NASA ADS] [CrossRef] [Google Scholar]
  7. Aponte, J. C., Dworkin, J. P., & Elsila, J. E. 2014, Geochim. Cosmochim. Acta, 141, 331 [Google Scholar]
  8. Aponte, J. C., Dworkin, J. P., & Elsila, J. E. 2015, Meteor. Planet. Sci., 50, 1733 [Google Scholar]
  9. Aponte, J. C., McLain, H. L., Dworkin, J. P., & Elsila, J. E. 2016, Geochim. Cos-mochim. Acta, 189, 296 [Google Scholar]
  10. Balsiger, H., Altwegg, K., Buhler, F., et al. 1986, Nature, 321, 330 [Google Scholar]
  11. Balsiger, H., Altwegg, K., Bochsler, P., et al. 2007, Space Sci. Rev., 128, 745 [NASA ADS] [CrossRef] [Google Scholar]
  12. Belloche, A., Garrod, R. T., Müller, H. S. P., et al. 2019, A&A, 628, A10 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  13. Beran, J. A., & Kevan, L. 1969, J. Phys. Chem., 73, 3866 [Google Scholar]
  14. Berger, E. L., Zega, T. J., Keller, L. P., & Lauretta, D. S. 2011, Geochim. Cos-mochim. Acta, 75, 3501 [Google Scholar]
  15. Biver, N., & Bockelée-Morvan, D. 2019, ACS Earth Space Chem., 3, 1550 [Google Scholar]
  16. Biver, N., Bockelée-Morvan, D., Colom, P., et al. 1997, Earth Moon Planets, 78, 5 [NASA ADS] [CrossRef] [Google Scholar]
  17. Bizzocchi, L., Prudenzano, D., Rivilla, V. M., et al. 2020, A&A, 640, A98 [EDP Sciences] [Google Scholar]
  18. Bockelée-Morvan, D., Lis, D. C., Wink, J. E., et al. 2000, A&A, 353, 1101 [Google Scholar]
  19. Botta, O., & Bada, J. L. 2002, Surv. Geophys., 23, 411 [NASA ADS] [CrossRef] [Google Scholar]
  20. Brownlee, D., Tsou, P., Aléon, J., et al. 2006, Science, 314, 1711 [CrossRef] [PubMed] [Google Scholar]
  21. Burton, A. S., Stern, J. C., Elsila, J. E., Glavin, D. P., & Dworkin, J. P. 2012, Chem. Soc. Rev., 41, 5459 [Google Scholar]
  22. Cabezas, C., Agúndez, M., Marcelino, N., et al. 2021, A&A, 654, L9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  23. Cabezas, C., Agúndez, M., Marcelino, N., et al. 2025, A&A, 693, L14 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  24. Callahan, M. P., Smith, K. E., Cleaves, H. J., et al. 2011, PNAS, 108, 13995 [Google Scholar]
  25. Cernicharo, J., Marcelino, N., Agúndez, M., et al. 2020, A&A, 642, L8 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  26. Cernicharo, J., Agúndez, M., Kaiser, R. I., et al. 2021, A&A, 652, L9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  27. Chuang, Y.-L., Kuan, Y.-J., & Charnley, S. B. 2024, MNRAS, 534, 3439 [Google Scholar]
  28. Clauberg, J. 1654, Logica Vetus Et Nova (Whitefish, MT, United States: Kessinger Publishing) [Google Scholar]
  29. Connections, M. 2024, Spectra Base (John Wiley & Sons, Inc.) [Google Scholar]
  30. Cronin, J. R., & Chang, S. 1993, in NATO Advanced Study Institute (ASI) Series C, 416, The Chemistry of Life’s Origins, eds. J. M. Greenberg, C. X. Mendoza-Gómez, & V. Pirronello, 209 [Google Scholar]
  31. De Keyser, J., Dhooghe, F., Altwegg, K., et al. 2017, MNRAS, 469, S695 [Google Scholar]
  32. De Keyser, J., Altwegg, K., Gibbons, A., et al. 2019, Int. J. Mass Spectrom., 446, 116232 [Google Scholar]
  33. De Keyser, J., Dhooghe, F., Altwegg, K., et al. 2024, Int. J. Mass Spectrom., 504, 117304 [Google Scholar]
  34. Dello Russo, N., Kawakita, H., Vervack, R. J., & Weaver, H. A. 2016, Icarus, 278, 301 [NASA ADS] [CrossRef] [Google Scholar]
  35. Drozdovskaya, M. N., van Dishoeck, E. F., Rubin, M., Jørgensen, J. K., & Altwegg, K. 2019, MNRAS, 490, 50 [Google Scholar]
  36. Duffield, A. M., Budzikiewicz, H., & Djerassi, C. 1964, J. Am. Chem. Soc., 86, 5536 [Google Scholar]
  37. Foerstel, M., Bergantini, A., Maksyutenko, P., Gobi, S., & Kaiser, R. I. 2017, ApJ, 845, 83 [Google Scholar]
  38. Folsome, C. E., Lawless, J., Romiez, M., & Ponnamperuma, C. 1971, Nature, 232, 108 [Google Scholar]
  39. For Biotechnology Information, N. C. 2024, PubChem Database [Google Scholar]
  40. Geiss, J. 1987, A&A, 187, 859 [NASA ADS] [Google Scholar]
  41. Glavin, D., Dworkin, J., & Sandford, S. 2008, Meteor. Planet. Sci., 43, 399 [Google Scholar]
  42. Glavin, D. P., Alexander, C. M., Aponte, J. C., et al. 2018, in Primitive Meteorites and Asteroids, ed. N. Abreu (Elsevier), 205 [Google Scholar]
  43. Glavin, D. P., Dworkin, J. P., Alexander, C. M. O., et al. 2025, Nat. Astron., 9, 199 [Google Scholar]
  44. Guelin, M., & Thaddeus, P. 1977, ApJ, 212, L81 [Google Scholar]
  45. Guelin, M., Neininger, N., & Cernicharo, J. 1998, A&A, 335, L1 [Google Scholar]
  46. Hand, C. W., & Bogan, D. J. 1971, J. Phys. Chem., 75, 1532 [Google Scholar]
  47. Hänni, N., Altwegg, K., Combi, M., et al. 2022, Nat. Commun., 13, 3639 [CrossRef] [Google Scholar]
  48. Hänni, N., Altwegg, K., Baklouti, D., et al. 2023, A&A, 678, A22 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  49. Hänni, N., Altwegg, K., Combi, M., et al. 2024, ApJ, 976, 74 [Google Scholar]
  50. Irvine, W. M., Bockelee-Morvan, D., Lis, D. C., et al. 1996, Nature, 383, 418 [Google Scholar]
  51. Jessberger, E. K., Christoforidis, A., & Kissel, J. 1988, Nature, 332, 691 [Google Scholar]
  52. Jungclaus, G., Cronin, J. R., Moore, C. B., & Yuen, G. U. 1976, Nature, 261, 126 [Google Scholar]
  53. Kipfer, K. A., Galli, A., Riedo, A., et al. 2024, Icarus, 410, 115742 [NASA ADS] [CrossRef] [Google Scholar]
  54. Kumar, Y., Kumar, M., Kumar, S., & Kumar, R. 2019, Atoms, 7, 60 [Google Scholar]
  55. Le Roy, L., Altwegg, K., Balsiger, H., et al. 2015, A&A, 583, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  56. Lee, K. L. K., McGuire, B. A., & McCarthy, M. C. 2019, Phys. Chem. Chem. Phys. (Incorp. Faraday Trans.), 21, 2946 [Google Scholar]
  57. Li, Z., Dawley, M. M., Carmichael, I., & Ptasińska, S. 2016, Int. J. Mass Spectrom., 410, 36 [Google Scholar]
  58. Lisse, C. M., VanCleve, J., Adams, A. C., et al. 2006, Science, 313, 635 [NASA ADS] [CrossRef] [Google Scholar]
  59. Lodders, K., Bergemann, M., & Palme, H. 2025, Space Sci. Rev., 221, 23 [Google Scholar]
  60. López-Gallifa, Á., Rivilla, V. M., Beltrán, M. T., et al. 2024, MNRAS, 529, 3244 [CrossRef] [Google Scholar]
  61. Lévêque, P., Queffelec, C., Sotin, C., et al. 2024, ACS Earth Space Chem., 8, 1281 [Google Scholar]
  62. Martins, Z. 2018, Life, 8, 28 [Google Scholar]
  63. McGuire, B. A. 2022, ApJS, 259, 30 [NASA ADS] [CrossRef] [Google Scholar]
  64. McGuire, B. A., Burkhardt, A. M., Kalenskii, S., et al. 2018, Science, 359, 202 [Google Scholar]
  65. McGuire, B. A., Burkhardt, A. M., Loomis, R. A., et al. 2020, ApJ, 900, L10 [Google Scholar]
  66. McGuire, B. A., Loomis, R. A., Burkhardt, A. M., et al. 2021, Science, 371, 1265 [Google Scholar]
  67. McLafferty, F. W. 1959, Analyt. Chem., 31, 82 [Google Scholar]
  68. Mikaia, A. 2022, J. Phys. Chem. Ref. Data, 51, 031501 [Google Scholar]
  69. Mikaia, A. 2023, J. Phys. Chem. Ref. Data, 52, 021501 [Google Scholar]
  70. Miller, K. E., Waite, J. H., Perryman, R. S., et al. 2020, Icarus, 339, 113595 [Google Scholar]
  71. Müller, H. S. P., Garrod, R. T., Belloche, A., et al. 2023, MNRAS, 523, 2887 [CrossRef] [Google Scholar]
  72. Naraoka, H., Takano, Y., Dworkin, J. P., et al. 2023, Science, 379, eabn9033 [CrossRef] [Google Scholar]
  73. Oba, Y., Koga, T., Takano, Y., et al. 2023a, Nat. Commun., 14, 1292 [Google Scholar]
  74. Oba, Y., Takano, Y., Dworkin, J. P., & Naraoka, H. 2023b, Nat. Commun., 14, 3107 [Google Scholar]
  75. Parker, E. T., McLain, H. L., Glavin, D. P., et al. 2023, Geochim. Cos-mochim. Acta, 347, 42 [Google Scholar]
  76. Pilorget, C., Baklouti, D., Bibring, J. P., et al. 2024, Nat. Astron., 8, 1529 [Google Scholar]
  77. Poch, O., Istiqomah, I., Quirico, E., et al. 2020, Science, 367, aaw7462 [NASA ADS] [CrossRef] [Google Scholar]
  78. Potiszil, C., Yamanaka, M., Sakaguchi, C., et al. 2023, Life, 13, 1448 [Google Scholar]
  79. Remijan, A. J., Hollis, J. M., Lovas, F. J., et al. 2008, ApJ, 675, L85 [Google Scholar]
  80. Rodgers, S. D., & Charnley, S. B. 1998, ApJ, 501, L227 [Google Scholar]
  81. Rubin, M., Altwegg, K., Balsiger, H., et al. 2019a, MNRAS, 489, 594 [Google Scholar]
  82. Rubin, M., Bekaert, D. V., Broadley, M. W., Drozdovskaya, M. N., & Wampfler, S. F. 2019b, ACS Earth Space Chem., 3, 1792 [Google Scholar]
  83. Ruf, A., & Danger, G. 2022, Analyt. Chem., 94, 14135 [Google Scholar]
  84. Ruf, A., Poinot, P., Geffroy, C., Le Sergeant d’Hendecourt, L., & Danger, G. 2019, Life, 9, 35 [Google Scholar]
  85. Sandford, S. A. 2008, Annu. Rev. Analyt. Chem., 1, 549 [Google Scholar]
  86. Schmitt-Kopplin, P., Gabelica, Z., Gougeon, R. D., et al. 2010, PNAS, 107, 2763 [CrossRef] [Google Scholar]
  87. Schmitt-Kopplin, P., Hertkorn, N., Harir, M., et al. 2023, Nat. Commun., 14, 6525 [Google Scholar]
  88. Schuhmann, M., Altwegg, K., Balsiger, H., et al. 2019a, A&A, 630, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  89. Schuhmann, M., Altwegg, K., Balsiger, H., et al. 2019b, ACS Earth Space Chem., 3, 1854 [Google Scholar]
  90. Sephton, M. A. 2002, Natural Product Rep., 19, 292 [NASA ADS] [CrossRef] [Google Scholar]
  91. Spiteller, G. 1967, Adv. Heterocyclic Chem., 7, 301 [Google Scholar]
  92. Steins, S. E. 2018, Chemistry WebBook, NIST Standard Reference Database Number 69, National Institute of Standards and Technology [Google Scholar]
  93. Stevenson, D. P. 1951, Discuss. Faraday Soc., 10, 35 [Google Scholar]
  94. Taniguchi, K., Gorai, P., & Tan, J. C. 2024, Ap&SS, 369, 34 [NASA ADS] [CrossRef] [Google Scholar]
  95. Tielens, A. 2021, Molecular Astrophysics (Cambridge, UK: Cambridge University Press) [Google Scholar]
  96. Varmuza, K., Filzmoser, P., Fray, N., et al. 2020, J. Chemometrics, 34, e3218 [Google Scholar]
  97. Yabuta, H., Cody, G. D., Engrand, C., et al. 2023, Science, 379, eabn9057 [CrossRef] [Google Scholar]
  98. Zeng, S., Jiménez-Serra, I., Rivilla, V. M., et al. 2021, ApJ, 920, L27 [NASA ADS] [CrossRef] [Google Scholar]
  99. Zong, H.-H., Yao, C., Sun, C. Q., Zhang, J.-G., & Zhang, L. 2020, Molecules, 25, 3232 [Google Scholar]

All Tables

Table 1

N-bearing molecules detected in comets by means of remote-sensing at millimetre wavelengths.

Table 2

Total fragment sum relative to methanol (FSRM) of the different groups of volatiles from this work.

All Figures

thumbnail Fig. 1

High-resolution mass spectrum collected on 3 August 2015 by the DFMS in 67P’s coma around m/z = 41 (black square markers). The peaks, associated with C3H5, C2H3N, CHN2, and C2HO at the exact positions indicated by the dashed grey lines, are normally fitted with double-Gaussian peak profile functions (dashed blue lines with the sum shown as a solid red line).

In the text
thumbnail Fig. 2

Occam’s razor-based deconvolution result of the subset of signals associated with CnHmN parent and fragment species detected by the DFMS on 3 August 2015. The measured signals (measured) are given in arbitrary units (a.u.) with 30% error margins. The different contributions of the chosen candidate molecules are colour-coded according to their molecular structure – in shades of red and purple for chain-based molecules and in shades of green and blue for cyclic molecules.

In the text
thumbnail Fig. 3

Occam’s razor-based deconvolution result of the subset of signals associated with CnHmNO parent and fragment species detected by the DFMS on 3 August 2015. The measured signals (measured) are given in arbitrary units (a.u.) with 30% error margins. The different contributions of the chosen candidate molecules are colour-coded according to their molecular structure – in shades of red and purple for chain-based molecules and in shades of green and blue for cyclic molecules.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.