Issue |
A&A
Volume 687, July 2024
|
|
---|---|---|
Article Number | A110 | |
Number of page(s) | 25 | |
Section | Planets and planetary systems | |
DOI | https://doi.org/10.1051/0004-6361/202348802 | |
Published online | 03 July 2024 |
Information content of JWST spectra of WASP-39b
1
Ludwig Maximilian University, Faculty of Physics, University Observatory,
Scheinerstr. 1,
Munich
81679,
Germany
e-mail: anna.lueber@physik.lmu.de
2
Valongo Observatory, Federal University of Rio de Janeiro,
Ladeira do Pedro Antonio, 43,
20080-090,
Rio de Janeiro,
Brazil
e-mail: aline12@ov.ufrj.br
3
Astrophysics, University of Oxford,
Denys Wilkinson Building, Keble Road,
Oxford,
OX1 3RH,
UK
4
ARTORG Center for Biomedical Engineering Research, University of Bern,
Murtenstrasse 50,
3008
Bern,
Switzerland
5
University College London, Department of Physics & Astronomy,
Gower St,
London,
WC1E 6BT,
UK
6
University of Warwick, Department of Physics, Astronomy & Astrophysics Group,
Coventry
CV4 7AL,
UK
7
Center for Space and Habitability, University of Bern,
Gesellschaftsstrasse 6,
CH-3012,
Bern,
Switzerland
Received:
30
November
2023
Accepted:
2
May
2024
Context. The era of James Webb Space Telescope (JWST) transmission spectroscopy of exoplanetary atmospheres commenced with the study of the Saturn-mass gas giant WASP-39b as part of the Early Release Science (ERS) program. WASP-39b was observed using several different JWST instrument modes (NIRCam,, NIRISS, NIRSpec G395H and NIRSpec PRISM) and the spectra were published in a series of papers by the ERS team.
Aims. The current study examines the information content of these spectra measured using the different instrument modes, focusing on the complexity of the temperature-pressure profiles and number of chemical species warranted by the data. We examine if the molecules H2O, CO, CO2, K, H2S, CH4, and SO2 are detected in each of the instrument modes.
Methods. Two Bayesian inference methods are used to perform atmospheric retrievals: the standard nested sampling method, as well as the supervised machine learning method of the random forest (trained on a model grid). For nested sampling, Bayesian model comparison is used as a guide to identify the set of models with the required complexity to explain the data.
Results. Generally, non-isothermal transit chords are needed to fit the transmission spectra of WASP-39b, although the complexity of the temperature-pressure profile required is mode-dependent. The minimal set of chemical species needed to fit a spectrum is mode-dependent as well, and also depends on whether grey or non-grey clouds are assumed. When a non-grey cloud model is used to fit the NIRSpec G395H spectrum, it generates a spectral continuum that compensates for the water opacity. The same compensation is absent when fitting the non-grey cloud model to the NIRSpec PRISM spectrum (which has broader wavelength coverage), suggesting that it is spurious. The interplay between the cloud spectral continuum and the water opacity determines if sulphur dioxide is needed to fit either spectrum.
Conclusions. The inferred elemental abundances of carbon and oxygen and the carbon-to-oxygen (C/O) ratios are all mode- and model-dependent, and should be interpreted with caution. Bayesian model comparison does not always offer a clear path forward for favouring specific retrieval models (e.g. grey versus non-grey clouds) and thus for enabling unambiguous interpretations of exoplanet spectra.
Key words: techniques: spectroscopic / planets and satellites: atmospheres / planets and satellites: composition / planets and satellites: individual: WASP-39b
© The Authors 2024
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
1 Introduction
The hot Jupiter WASP-39b's optimal combination of low surface gravity (log g ≈ 2.6 in cgs units; Faedi et al. 2011) and somewhat high equilibrium temperature (about 1100 K; Faedi et al. 2011), which collectively yields a large atmospheric pressure scale height, makes it an optimal target for transmission spectroscopy. For these reasons, it was chosen as one of the targets of the James Webb Space Telescope (JWST) Transiting Exoplanet ERS program (Stevenson et al. 2016; Bean et al. 2018; Ahrer et al. 2023a). The ERS team has measured transmission spectra of WASP-39b using the NIRCam (Ahrer et al. 2023b), NIRISS (Feinstein et al. 2023), NIRSpec G395H (Alderson et al. 2023), and NIRSpec PRISM (Ahrer et al. 2023a; Rustamkulov et al. 2023) instrument modes of JWST.
WASP-39b was previously observed using the Wide Field Camera 3 (WFC3) onboard the Hubble Space Telescope, where the detection of water was reported (Wakeford et al. 2018). Due to the limited wavelength range covered by WFC3 (0.8-1.7 μm), it was not possible to make definitive statements on other molecular species. Sodium was also detected using the Space Telescope Imaging Spectrograph (STIS) onboard Hubble, which covered the wavelength range of 0.29 to 1.025 μm (Fischer et al. 2016). The JWST spectra of WASP-39b collectively cover a wavelength range of 0.5-5.5 microns, which contains the spectral features of all of the simple carbon, hydrogen, and oxygen carriers (H2O, CO, CO2, CH4), as well as that of potassium. The tentative detection of sulphur dioxide (SO2) was also reported (Rustamkulov et al. 2023), which is believed to be a photochemical byproduct of hydrogen sulphide (H2S) and evidence for the presence of photochemistry (Tsai et al. 2023).
Collectively, the ERS team has reported the detection of carbon dioxide (Ahrer et al. 2023a; Alderson et al. 2023; Rustamkulov et al. 2023), water (Alderson et al. 2023; Ahrer et al. 2023b; Feinstein et al. 2023; Rustamkulov et al. 2023), carbon monoxide (Rustamkulov et al. 2023), sulphur dioxide (Alderson et al. 2023; Rustamkulov et al. 2023), sodium (Rustamkulov et al. 2023), and potassium (Feinstein et al. 2023), as well as a non-detection of methane (Ahrer et al. 2023a,b; Rustamkulov et al. 2023). Carbon dioxide was also detected by applying a cross correlation method to the NIRSpec G395H spectrum (Esparza-Borges et al. 2023). Fitting forward models to the spectra of the different instrument modes yields supersolar metallicities to varying degrees (Alderson et al. 2023; Ahrer et al. 2023b; Feinstein et al. 2023; Rustamkulov et al. 2023). However, whether the C/O ratio is sub-solar (Alderson et al. 2023; Ahrer et al. 2023b; Feinstein et al. 2023), solar (Alderson et al. 2023) or super-solar (Rustamkulov et al. 2023) depends on the instrument mode being considered. The mode and model dependence of the inferred atmospheric metallicity and C/O ratio is confirmed by the present study.
The richness of information encoded in these JWST spectra of WASP-39b, as well as the differences in the inferred metallicities and C/O ratios motivate a thorough investigation using a Bayesian framework of inference. In the field of exoplanetary atmospheres, this is known as atmospheric retrieval (Madhusudhan & Seager 2009), which solves the inverse problem of inferring chemical abundances and atmospheric properties from fitting a measured spectrum (for a review, see Barstow & Heng 2020). In the current paper, we study the JWST spectra of WASP-39b using a suite of atmospheric retrievals. We implement these retrievals in two ways: first, using the standard approach of nested sampling (Skilling 2006); second, using the supervised machine learning method of the random forest (Márquez-Neila et al. 2018). Questions we wish to address include:
- 1.
Are isothermal transit chords sufficient to fit JWST transmission spectra? If not, what is the minimum level of complexity (non-isothermal behaviour) needed to produce a best fit?
- 2.
Which molecules may we declare to be detected?
- 3.
Are grey or non-grey clouds required to fit the spectrum? And how do they interact with other components of the model?
- 4.
Are any of our inferences dependent on the instrument mode being considered?
We anticipate that a family of models, rather than a single model, will be able to fit each spectrum. To implement Occam's Razor and penalise models that are too complex (given the data quality), we implement Bayesian model comparison (Trotta 2008), which is a natural outcome of the nested sampling algorithm (Skilling 2006). As we report, there are instances where Bayesian model comparison does not allow us to clearly select a single retrieval model for interpreting the data.
In Sect. 2, we describe our methodology including the data curated and the retrieval methods implemented. In Sec. 3, we present the outcomes of our suites of retrievals. In Sec. 4, we summarise our findings and discuss their implications.
2 Methodology
2.1 Spectral sample
WASP-39b is a 0.28 MJ and 1.27 RJ Saturn-like exoplanet with an equilibrium temperature of about 1100 K, orbiting a G7 V star with a mass of 0.93 M⊙ and radius of 0.895 R⊙ (Faedi et al. 2011). No new data are presented in the current study. We analyse the four spectra previously published by Ahrer et al. (2023b) (NIRCam, 2.4-4.0 μm), Feinstein et al. (2023) (NIRISS, 0.6-2.8 μm), Alderson et al. (2023) (NIRSpec G395H, 2.7-5.2 μm), and Rustamkulov et al. (2023) (NIRSpec PRISM, 0.5-5.5 μm). We make no attempt at re-reducing or combining these spectra, or correcting for detector offsets between them. The four spectra used in the current study are displayed in Fig. 1.
2.2 Atmospheric retrieval techniques
2.2.1 Nested sampling
For standard Bayesian retrievals, we use the open-source, GPU-accelerated Helios-r2 code (Kitzmann et al. 2020)1. It is based on the nested sampling technique (Skilling 2006) and implements the MULTINEST algorithm (Feroz & Hobson 2008; Feroz et al. 2009). Helios-r2 is capable of fitting both transmission and emission spectra, and has the option to include grey and non-grey clouds (see Kitzmann et al. 2020 and Lueber et al. 2022 for details). The one-dimensional, plane-parallel model atmosphere consists of 99 layers (100 levels) spanning 10 bar to 1 μbar. The temperature-pressure profile is described by a finite element approach that ensures smoothness and continuity. In the current study, it is parameterised by between one (isothermal) and four parameters, where the required complexity is data-driven and guided by Bayesian model comparison (Trotta 2008).
The marginalised likelihood, which is also termed the Bayesian evidence, is computed for each retrieval model. The ratio of Bayesian evidences between a pair of retrieval models is termed the “Bayes factor” . It may, in principle, allow one to identify the model that best explains the data with the appropriate level of complexity (Trotta 2008). For the convenience of the reader, we summarise some key points from Table 2 of Trotta (2008): when ln
< 1, both models explain the data equally well. When ln
≥ 5, the model with the higher Bayesian evidence is strongly favoured. When ln
≥ 2.5, it is only moderately favoured.
The open-source HELIOS-K opacity calculator (Grimm & Heng 2015; Grimm et al. 2021) was used to convert spectroscopic line lists into absorption cross sections or opacities (cross sections per unit mass) of atoms and molecules. We utilise the line lists for H2O (Polyansky et al. 2018), CO2 (Tashkun & Perevalov 2011), CO (Li et al. 2015), SO2 (Underwood et al. 2016), H2S (Azzam et al. 2016), and CH4 (Yurchenko & Tennyson 2014). The resonant line wing shapes of the alkali metals (K and Na) are taken from Allard et al. (2016, 2019). Collision-induced absorption (CIA) by H2−H2 (Abel et al. 2011) and H2−He (Abel et al. 2012) pairs are included. For a review of spectroscopic databases, please see Tennyson & Yurchenko (2017). The opacities are publicly available on the DACE opacity database (Grimm et al. 2021)2.
The grey cloud model simply assumes a single, constant opacity. Our non-grey cloud model is taken from Eq. (32) of Kitzmann & Heng (2018) and is calibrated on refractive indices measured in the laboratory of a library of different aerosol compositions. Based on an empirical description of Mie theory, it smoothly connects extinction by small and large particles, which correspond to Rayleigh scattering and grey extinction, respectively (Pierrehumbert 2010). Spherical particles of a single size are assumed. The parameters of our non-grey cloud model include the cloud-top pressure Pcloudtop, the optical depth referenced to 1 μm, a composition parameter Q0 (which determines the value of 2πrcloud/λ where the transition from small-to large-particle extinction occurs), the particle radius rcloud, and the index of the slope corresponding to small-particle scattering a0 (which is exactly 4 for Rayleigh scattering). For both the grey and non-grey cloud models, the cloud is assumed to be semi-infinite, meaning the cloud-bottom pressure extends to the bottom of the model domain3.
The parameter space to be explored is vast, since we have to consider multiple atomic and molecular species (in various combinations), non-isothermal temperature-pressure profiles and grey versus non-grey clouds. Figure 2 describes how we conduct our parameter explorations by examining the treatment of the temperature-pressure profile and chemistry (number of species included).
Our baseline model is cloudfree, isothermal, and contains vertically uniform abundances for H2O, CO2, CO, SO2, H2S, CH4, and K. For simplicity, the Na abundance is not treated as a fitting parameter and is instead assumed to be given by the solar elemental abundance ratio between it and K. However, NIRSpec PRISM retrievals present an exception where the line centre is accessible, allowing Na to be treated as an independent fitting parameter. The cloudfree model is then compared to models with grey versus non-grey clouds. For all four spectra, cloudfree models are disfavoured, to different degrees, via Bayesian model comparison (Trotta 2008), yielding values for the logarithm of the Bayes factor from 1.66 to 33.01, as documented in Fig. 3. Since one of our motivations is to understand how the cloud model interacts with other components of the retrieval, we run two parallel sets of retrievals for grey and non-grey clouds (Fig. 2).
For each set of retrievals (with either grey or non-grey clouds), we next explore the effects of varying the complexity of the temperature-profile. We use Bayesian model comparison to guide our choice of temperature-pressure profile, which has between 1 and 4 fitting parameters. If a subset of these models (with different temperature-pressure profiles) are associated with ln < 1, then we apply Occam's Razor and select the model with the least number of parameters describing the temperature-pressure profile. We term this the “reference model”.
After the temperature-pressure profile has been chosen, we remove the chemical species one at a time and record the Bayesian evidence. If the Bayesian evidence is higher when a species is removed, then we exclude it from the model regardless of whether the logarithm of the Bayes factor is greater or less than unity. If the Bayesian evidence is lower when a species is removed and the logarithm of the Bayes factor exceeds unity, then we include this species in the model. By considering each of the seven species in turn (eight species for the PRISM retrieval), we are able to identify the model with the least number of chemical species required to fit the data. We term this model the “simplified model”.
The Bayesian framework used within Helios-r2 requires us to define prior distributions for all fitting parameters of the forward model. All parameters and their priors may be found in Table 1. In particular, the assumed prior distributions of the stellar radius Rs, planetary white-light radius Rp, and planetary surface gravity g are based on the measurements of Mancini et al. (2018).
![]() |
Fig. 1 Curated JWST ERS transmission data of WASP-39b from the following four instruments: NIRCam (blue), NIRISS (red), NIRSpec G395H (green), and NIRSpec PRISM (yellow). The data were originally published by Ahrer et al. (2023b), Feinstein et al. (2023), Rustamkulov et al. (2023), and Alderson et al. (2023), respectively. |
![]() |
Fig. 2 Schematic describing our suite of nested sampling retrievals, where models with grey and non-grey clouds are tested separately for variations in the temperature-pressure profile, and number of atoms and molecules considered. |
![]() |
Fig. 3 Bayesian model comparison between cloudfree and cloudy retrievals, corresponding to the first step of the schematic shown in Fig. 2. The Bayesian evidence relative to the cloudfree model is computed and its natural logarithm is displayed. For all four JWST instruments, there are significant increases in the Bayesian evidence when cloudy models are used to fit the spectra. |
Summary of retrieval parameters and the assumptions for their prior distributions.
2.2.2 Random forest
We also perform atmospheric retrieval using the supervised machine learning method of the random forest (Ho 1998; Breiman 2001) implemented via the code HELA (Márquez-Neila et al. 2018)1. It uses the pre-computed grid of model spectra published by Crossfield (2023) as a training set for performing inference in the framework of Approximate Bayesian Computation (Sisson et al. 2018). This model grid was computed using the VULCAN photochemical kinetics code (Tsai et al. 2017, 2021) and the petitRadTrans radiative transfer code for generating transmission spectra (Mollière et al. 2019). It considers metallicities from solar to 100× solar (by varying the elemental abundances of C, O, and S), which amounts to 1331 transmission spectra in total. It implements a spectral resolution of ~1000 between 1 to 25 μm. This wavelength range implies that part of the NIRISS and NIRSpec PRISM spectra are not analysed.
To use the model grid as a training set, we bin the model spectrum (in terms of transit depth D) according to the wavelength bins of each measured spectrum. The specific details of the binning differ between the spectra from the four instrument modes. Within each bin, there is a measured uncertainty δ. We use δ/D as the standard deviation of a Gaussian from which we randomly draw uncertainties for the model spectrum. This uncertainty is then added or subtracted from the binned model transit depth (Oreshenko et al. 2020; Lueber et al. 2023). In this manner, even the same model spectrum may generate multiple realisations of model spectra that include noise. Using this approach, we generate a training set (with noise) of 2662 model spectra using the Crossfield (2023) grid. A total of 3000 regression trees with no restriction on the maximum tree depth were used. Instead, each tree grows by splitting the space of the models until the decrease in variance in the associated parameters of further splits is smaller than a fractional value of 0.01.
![]() |
Fig. 4 Comparing the median values of posterior distributions of selected parameters from retrievals performed on the four different spectra. Arrows indicate upper or lower limits. For each spectrum, a suite of four retrievals with different complexity of the temperature-pressure profile is performed. Isothermal profiles are denoted by “isoT”, while “FP” means that the temperature-pressure profile is parameterised by F fitting/free parameters. Separate suites of retrievals for grey versus non-grey cloud models are performed. Within each suite, filled circles are models where the logarithm of the Bayes factor is less than unity, implying that all of these models explain the data equally well. Within this subset of models indicated by filled circles, we choose the temperature-pressure profile that uses the least number of parameters in our subsequent explorations. |
3 Results
3.1 Nested sampling retrievals
3.1.1 Complexity of temperature-pressure profile
Figure 4 shows the outcomes of a suite of retrievals that explore the complexity of the temperature-pressure profile used, while including all seven chemical species (eight for PRISM retrievals) previously described. To explore the sensitivity of outcomes to the choice of cloud model, separate sub-suites of retrievals are performed using grey versus non-grey cloud models. Let the Bayes factor be and its natural logarithm be ln
. In Fig. 4, the filled circles indicate the subset of models where ln
< 1 (relative to the model with the highest Bayesian evidence). We term this subset the “favoured models”.
The water and carbon dioxide abundances retrieved from NIRCam spectra depend on the complexity of the temperature-pressure profile adopted and not on the choice of grey versus non-grey model. When isothermal transit chords are assumed, cloudy models are weakly preferred for fitting NIRCam spectra with ln = 2.06 and ln
= 1.66 for grey and non-grey clouds, respectively (Fig. 3). For the isothermal retrieval, the optical depth for the grey cloud is log
. When non-isothermal temperature-pressure profiles are used the clouds remain optically thin (Fig. 4). Figure 5 displays examples of best-fit reference models (which include all seven or eight chemical species) to NIRCam spectra. When the contribution of clouds is removed from the model spectrum via post-processing, one sees that it is essentially identical to the reference model curve, implying that the clouds have no effect on the spectrum. The effects of clouds on the NIRCam spectrum is negligible regardless of whether grey or non-grey clouds are assumed.
For the NIRISS spectra, carbon dioxide is not detected regardless of the complexity of the temperature-pressure profile (Fig. 4). The non-detection of carbon dioxide is consistent with the finding of Feinstein et al. (2023). Retrievals with grey clouds have upper limits that are optically thick, whereas those with non-grey clouds have optical depths of about unity (at 1 μm). The retrieved water abundances are somewhat consistent between the eight different retrievals (four types of temperature-pressure profiles and grey versus non-grey cloud models).
For the NIRSpec G395H spectra, a major outcome of the current study is that the retrieved water abundances depend on whether grey or non-grey cloud models are used. Figure 4 shows that the grey clouds are optically thin regardless of the temperature-pressure profile used, while the non-grey clouds are optically thick. Figure 6 corroborates this finding, where the model spectrum with grey clouds removed (via post-processing) is identical to the reference model curve (with all chemical species and grey clouds included). By contrast, the non-grey clouds have a non-negligible effect. The spectral continuum associated with the non-grey cloud model compensates for the spectral continuum associated with the water line wings, altering the retrieved abundance of water.
Water abundances derived from the NIRSpec PRISM spectrum are somewhat robust to the choice of cloud model, at least to within an order of magnitude (Fig. 4). Figure 8 shows that the effects of clouds (whether grey or non-grey) are limited to ≲2.5 μm when fitting model spectra to the NIRSpec PRISM spectrum. This is consistent with our finding that the NIRCam spectrum is well fitted by models with optically thin clouds. The compensation of the spectral continuum by the non-grey-cloud model at ~3 μm, which is seen for the NIRSpec G395H spectrum (Fig. 6), is not present for the NIRSpec PRISM spectrum, suggesting that this compensation is spurious. It further suggests that having a broader wavelength coverage is more important than higher spectral resolution if the goal is to better understand the effects of clouds on the transmission spectrum. The effects of clouds on the NIRSpec PRISM spectrum has major implications for the detection or non-detection of chemical species, as we will explore later.
![]() |
Fig. 5 Fitting the model containing all chemical species (termed the “reference model”) to the NIRCam spectrum (grey data points) previously published by Ahrer et al. (2023b). Corresponding to our findings in Fig. 4, the 3-parameter temperature-pressure profile is used and grey (left panel) versus non-grey (right panel) are considered in separate retrievals. In each panel, post-processing is performed to isolate the contribution of water, carbon dioxide, clouds, and collision-induced absorption (CIA) due to hydrogen and helium. The natural logarithm of the Bayes factor between the retrievals with grey and non-grey clouds is 0.7, implying that both grey and non-grey cloud models explain the data equally well because both clouds are optically thin. |
3.1.2 Detection or non-detection of chemical species
To determine if a specific chemical species is detected, we ran a second suite of retrievals using the reference model (which includes all chemical species) as a starting point. Each of the seven (or eight for PRISM retrievals) chemical species is then excluded, in turn, in separate retrievals. The logarithm of the Bayes factor ln , relative to the reference model, is recorded. Figure 7 shows the outcome of this suite of retrievals for all four JWST instrument modes. If the exclusion of an atom or molecule results in a decrease in the Bayesian evidence, relative to the reference model, and yields ln
> 1, then we deem it part of the minimal set of chemical species required to explain the data. By contrast, if the exclusion of an atom or molecule results in an increase in the Bayesian evidence (regardless of the Bayes factor value), then we exclude it from this “simplified model”. We consider the chemical species that are part of the simplified model to be detected in the data.
Using this approach, Fig. 7 shows that only H2O and CO2 are required to fit the NIRCam spectrum. Our derived CO2 abundances from the grey-cloud and non-grey-cloud retrievals are consistent with each other: log and
, respectively. For completeness, Figs. B.1 and B.5 display the full set of posterior distributions of parameters. We checked that the optical depth of the grey cloud is much less than unity (Fig. B.1). We also checked that the range of optical depths for the non-grey cloud, across the NIRCam range of wavelengths, is much less than unity (median values ~10−5−10−4; not shown).
For the NIRISS spectrum, only H2O and K are required to explain the data. However, the derived logarithm of the potassium abundances are inconsistent, because of the effect of clouds: (grey clouds) versus
(non-grey clouds). Again for completeness, Figs. B.2 and B.6 display the full set of posterior distributions of parameters. The optical depth of the grey cloud is well above unity (Fig. B.6). Across the NIRISS range of wavelengths, the optical depth of the non-grey cloud ranges from ~10−2−10 (not shown).
For the NIRSpec G395H spectrum, the simplified model includes H2O and CO2. However, when a non-grey cloud model is used, one has to additionally include SO2. Furthermore, the derived logarithm of the H2O abundances are noticeably different depending on whether the grey or non-grey cloud model is used: (grey cloud) versus
(non-grey cloud). By contrast, we have for the logarithm of the CO2 abundances:
(grey cloud) versus
(non-grey cloud). The full set of posterior distributions of parameters are displayed in Figs. B.3 and B.7. The grey cloud is optically thin (Fig. B.7), whereas the non-grey cloud is optically thick across part of the NIRSpec G395H range of wavelengths.
For NIRSpec PRISM, methane is not clearly detected (based on Bayesian model comparison) regardless of whether grey or non-grey clouds are used, consistent with the findings of Rustamkulov et al. (2023). CO and SO2 are only clearly detected when grey clouds are assumed. When non-grey clouds are assumed, the best-fit water opacity is altered, which in turn compensates for the spectral features of CO and SO2 (Fig. 8). In other words, the compensation of the spectral features by the non-grey cloud model is indirect and occurs via the water spectral features. Figures B.4 and B.8 display the full set of posterior distributions of parameters. The grey cloud is optically thin, while the non-grey cloud has a range of optical depths ~10−1−103 across the PRISM range of wavelengths.
![]() |
Fig. 6 Fitting the model containing all chemical species (termed the “reference model”) to the NIRSpec G395H spectrum (grey data points) previously published by Alderson et al. (2023). Corresponding to our findings in Fig. 4, the retrieval assuming grey clouds (left panel) uses an isothermal profile while the retrieval assuming non-grey clouds (right panel) uses a two-parameter temperature-pressure profile. In each panel, post-processing is performed to isolate the contribution of water, carbon dioxide, sulphur dioxide, clouds, and collision-induced absorption (CIA) due to hydrogen and helium. The natural logarithm of the Bayes factor between the retrievals with grey and non-grey clouds is 33.8, with the non-grey ones being strongly preferred. |
3.1.3 Instrument-specific best fit spectra
The best-fit spectra from our suite of nested-sampling retrievals are shown in Fig. 9. For each spectrum, we choose between the grey versus non-grey cloud model using the logarithm of the Bayes factor. For the NIRCam spectrum, the Bayesian evidence is higher for the grey-cloud model, but the logarithm of the Bayes factor is only ln = 0.41, implying that the non-grey cloud model fits the data equally well. As previously explained, this is because both the grey and non-grey clouds are optically thin and the model is effectively cloud-free. For the NIRISS spectrum, Bayesian model comparison strongly favours the non-grey cloud model with ln
= 11.
That the non-grey cloud model is preferred for the NIRSpec G395H spectrum (ln = 34), despite its spectral continuum spuriously compensating for the retrieved water abundances, is a cautionary tale for the uncritical acceptance of Bayesian model comparison implemented via nested sampling. In particular, this study has shown that parametric cloud models of capable of mimicking gas absorption features if they have multiple degrees of freedom.
For the NIRSpec PRISM spectrum, Bayesian model comparison is agnostic between the grey and non-grey cloud models (ln = 0.61). But as Figs. 7 and 8 already demonstrate, CO and SO2 are detected only when the grey-cloud model is assumed.
The full sets of posterior distributions are provided in the appendix (Figs. B.1–B.8), while the retrieved values of the parameters are stated in Table A.1. Generally, the retrieved values of log g, Rp, and Rs are consistent with the assumed prior distributions to within one standard deviation. Since these assumed prior distributions are based on the measurements of Mancini et al. (2018), it implies that the planetary radius, mass and gravity are self-consistent with one another. An exception is the inferred planetary gravity from the retrieval performed on the PRISM spectrum assuming non-grey clouds: log g = 2.77 ± 0.02, which is discrepant with the assumed prior value of 2.63 ± 0.05. We speculate that the enhanced gravity comes from the retrieval seeking a diminished pressure scale height to fit the spectral features. However, it is unclear why this occurs only for this particular case, and is possibly a combination of effects from multiple parameters.
![]() |
Fig. 7 Bayesian model comparison between models that include different subsets of chemical species. Starting with the reference model (which includes all chemical species), a selected atom or molecule is excluded. If the Bayesian evidence decreases significantly (such that the logarithm of the Bayes factor is much greater than unity), then this chemical species is considered part of the “simplified model” (which contains the minimal set of species required to fit the data). All other chemical species that produce increases in the Bayesian evidence when they are excluded from the retrieval are thus excluded from the simplified model. In each of the panels, the natural logarithm of the Bayes factor (relative to the reference model) is displayed. Retrievals assuming grey (left column) versus non-grey (right panel) clouds are considered separately. The sophistication of the temperature-pressure profile used is based on the findings of Fig. 4. |
![]() |
Fig. 8 Fitting the model containing all chemical species (termed the “reference model”) to the NIRSpec PRISM spectrum (grey data points) previously published by Rustamkulov et al. (2023). Corresponding to our findings in Fig. 4, the retrievals assuming grey clouds (left panel) and non-grey clouds (right panel) both use isothermal temperature-pressure profiles. In each panel, post-processing is performed to isolate the contribution of water, carbon dioxide, carbon monoxide, sulphur dioxide, and clouds. The natural logarithm of the Bayes factor between this pair of retrievals is 0.61, implying that Bayesian model comparison has no preference for grey versus non-grey clouds. |
![]() |
Fig. 9 Best-fit models to the various measured spectra (grey data points) from our nested sampling retrievals. The shaded regions associated with the model curves show fit uncertainties of 2 and 5 standard deviations. Spectral features corresponding to various chemical species are labelled. The best-fit models to NIRCam and NIRSpec PRISM spectra assume a grey-cloud model, while those to NIRISS and NIRSpec G395H assume a non-grey-cloud model. Full posterior distributions of parameters corresponding to these fits are provided in Figs. B.1–B.4. |
3.1.4 Elemental abundances
Figure 10 shows the retrieved elemental abundances of carbon (C/H) and oxygen (O/H), which are sometimes referred to as the “metallicity”. The retrieved C/H and O/H values from NIRCam are consistent with each other between the grey and non-grey cloud models, because the clouds are optically thin. C/H is not retrieved from NIRISS, because no carbon-bearing molecule is detected. For NIRSpec G395H, the retrieved C/H values are consistent with each other between the grey and non-grey cloud models, while the O/H values are discrepant by about an order of magnitude. The retrieved C/H values from NIRSpec PRISM are somewhat discrepant between the grey- and non-grey-cloud retrievals by a factor ~100. Overall, the inferred C/H and O/H values vary by factors ~103 and ~105, respectively, between the four different instruments, implying that deriving metallicities from a single instrument mode should be done with caution.
Except for O/H derived from NIRISS, the retrieved C/H and O/H values are sub-solar to solar (relative to the Asplund et al. 2009 values). However, these values cannot be directly compared to those reported by the ERS papers (Alderson et al. 2023; Ahrer et al. 2023b; Feinstein et al. 2023; Rustamkulov et al. 2023), because these were derived using model grids that assume chemical equilibrium and reported as the sum of elemental abundances.
![]() |
Fig. 10 Retrieved elemental abundances (relative to their solar values) from the simplified models (minimal set of chemical species needed to fit data). The solar elemental abundances assumed are C/H = 2.95 × 10−4 and O/H = 5.37 × 10−4 (Asplund et al. 2009). Separate retrievals are performed assuming grey versus non-grey clouds. Displayed are the median values of the posterior distributions and their 1-σ uncertainties. There is no C/H entry for NIRISS, because no carbon-bearing species are detected from that spectrum. |
3.2 Random forest retrievals
In addition to the nested-sampling retrievals, we performed another suite of random-forest retrievals using the Crossfield (2023) model grid as the training set. The best-fit spectra and inferred C, O, and S elemental abundances are displayed in Figs. 11 and 12. The retrieved values of the parameters are stated in Table A.2, while the full sets of posterior distributions are provided in the appendix (Figs. C.1–C.4).
We note that we exclude all data points below 1.0 μm for the NIRISS and NIRSpec PRISM spectra, as the model grid only includes wavelengths above that value. All four retrievals consistently yield super-solar metallicities, albeit with generous uncertainties, consistent with the findings of the ERS team (Alderson et al. 2023; Ahrer et al. 2023b; Feinstein et al. 2023; Rustamkulov et al. 2023).
The random forest retrievals are able to report C/H values from NIRISS spectra, probably because carbon-carrying molecules are assumed to be present. The Crossfield (2023) model grid extends only to elemental abundances of 100× solar. Only the O/H value associated with NIRISS approaches that boundary.
Previous studies have shown that the random forest method tends to be conservative with uncertainty estimation compared to the nested sampling method (Márquez-Neila et al. 2018; Oreshenko et al. 2020; Fisher et al. 2020; Lueber et al. 2023). These larger uncertainties are also reflected in the best-fit spectra (Fig. 11). Another reason for the large model uncertainties is the sparseness of the training set of models.
3.3 Model dependency of C/O ratios
Given the retrieved molecular abundances, the C/O ratio is constructed using
(1)
where Xi is the volume mixing ratio (relative abundance by number) of species i.
Earlier, it was demonstrated that the retrieved water abundances from the NIRSpec G395H spectrum are model-dependent. Therefore, we refrain from constructing C/O ratios using retrieved molecular abundances from nested-sampling retrievals performed on the NIRSpec G395H spectrum. Since the nested-sampling retrievals do not require carbon-bearing species to fit the NIRISS spectrum, we do not construct C/O ratios from it as well.
By contrast, water abundances retrieved from the NIRSpec PRISM spectrum appears to be robust to the choice of cloud model. For the nested sampling retrievals, five of these models are associated with the logarithm of the Bayes factor being less than unity. Therefore, we construct C/O ratios from the retrieved chemical abundances. For the NIRCam spectrum, we construct C/O ratios only from the pair of models with 3-parameter temperature-pressure profiles (based on Bayesian model comparison).
Figure 13 shows that the C/O ratios derived from the NIRCam spectrum are consistent between the retrievals with grey versus non-grey clouds, because both of these clouds are optically thin, i.e. the retrievals are effectively cloudfree. The derived values are consistent with the stellar C/O value of 0.46 ± 0.09 (Polanski et al. 2022), but inconsistent with substellar (and sub-solar) values derived by the ERS team (Ahrer et al. 2023b). Figure 14 demonstrates that the derived water and carbon dioxide abundances are consistent between the grey-cloud and non-grey-cloud retrievals. The C/O ratio derived from the random forest retrievals, using the Crossfield (2023) model grid, has larger uncertainties but is also consistent with the stellar C/O.
However, the C/O ratios derived from the NIRSpec PRISM spectrum depend on whether grey or non-grey clouds are assumed (Fig. 13). The ones inferred from grey-cloud retrievals are super-stellar (and super-solar), whereas the ones inferred from non-grey cloud models are sub-stellar (and sub-solar). At face value, this finding contradicts our earlier claim that the retrieved PRISM water and carbon dioxide abundances are robust. However, Fig. 14 demonstrates that, while the model dependency is diminished (compared to those associated with the NIRSpec G395H spectrum) these retrieved abundances still differ by about half an order of magnitude between grey-cloud and non-grey-cloud retrievals. This variation is sufficient to render the retrieval C/O ratios model-dependent.
Unfortunately, Bayesian model comparison does not offer a way forward, because the logarithm of the Bayes factor is less than unity (In = 0.61) when comparing the grey-cloud and non-grey-cloud retrievals associated with NIRSpec PRISM. Generally, accurate and precise C/O ratios are challenging to obtain, because variations in the models produce small changes in the logarithm of the chemical abundances, which lead to large changes in the C/O ratio.
![]() |
Fig. 11 Best-fit models to the various measured spectra (grey data points) from our random forest retrievals trained on the model grid of Crossfield (2023). The shaded regions associated with the model curves show fit uncertainties of 1 standard deviation. Spectral features corresponding to various chemical species are labelled. |
![]() |
Fig. 12 Median values (and their 1σ uncertainties) of elemental abundances inferred from the median values of the posterior distributions associated with our random forest retrievals. Separate entries for each JWST instrument mode are given. The elemental abundances are given in terms of their solar values. |
4 Summary and discussion
4.1 Comparison to previous work
Since WASP-39b is a benchmark object for JWST exoplanet science, due to its selection for being part of the ERS program, a detailed comparison with previous studies by the ERS team is indispensable.
![]() |
Fig. 13 Median values of the carbon-to-oxygen ratio (C/O) inferred from both our nested sampling and random forest retrievals. Only retrievals performed on NIRCam (top panel) and NIRSpec PRISM (bottom panel) spectra are considered (see text for more details). Only models with the logarithm of the Bayes factor being less than unity (relative to the model with the highest Bayesian evidence) are shown. The three horizontal blue lines are C/O ratios taken from Ahrer et al. (2023b) for NIRCam, while the four horizontal yellow lines are those taken from Rustamkulov et al. (2023) for NIRSpec PRISM. The dashed grey line indicates the solar value of 0.55 (Asplund et al. 2009), while the solid grey line represents the stellar value of 0.46 ± 0.09 (Polanski et al. 2022). |
4.1.1 NIRCam spectrum
Ahrer et al. (2023b) previously reported the detection of water and an upper limit on the abundance of methane from the NIRCam spectrum. Our Bayesian model comparison analysis (Fig. 7) is consistent with the detection of water and non-detection of methane. Ahrer et al. (2023b) reported how the “prominent carbon dioxide feature at 2.8 micrometres is largely masked by water”. Based on the Bayesian evidence, our nested sampling retrievals require H2O and CO2 to fit the NIRCam spectrum.
Ahrer et al. (2023b) inferred “atmospheric metallicities of 1-100 times solar and a substellar C/O ratio”. Our inferred C/O ratios, from both nested-sampling and random forest retrievals, are consistent with the stellar value (Fig. 13). The source of this discrepancy remains unclear.
These metallicities are consistent with the elemental abundances inferred using our random forest retrievals (Fig. 12). Our nested-sampling retrievals infer C/H and O/H values that are consistent with being solar (Fig. 10).
A significant difference between our analysis and that of Ahrer et al. (2023b) is whether clouds are needed to fit the NIRCam spectrum. Figure 3 of that study demonstrates clearly that the cloud model implemented by Ahrer et al. (2023b) provides almost the entire spectral continuum both blueward and redward of the peak of the 2.8 μm CO2 feature.
In Fig. 5, we demonstrate that water and CIA (associated with hydrogen and helium) provide the opacity sources for the spectral continuum. When the contributions of CO2 and H2O are removed via post-processing, a featureless continuum remains that is due to CIA. These properties explain why the clouds in our NIRCam retrievals are optically thin (transparent), regardless of whether they are grey or non-grey. It also explains why our nested-sampling retrievals are able to robustly extract CO2 abundances, despite the water opacity partially obscuring the carbon dioxide opacity in the same wavelength range.
![]() |
Fig. 14 Retrieved chemical abundances (volume mixing ratios) corresponding to the models shown in Fig. 13. Displayed are the median values of the posterior distributions and their 1σ uncertainties. |
4.1.2 NIRISS spectrum
Feinstein et al. (2023) previously reported the detection of water and potassium from the NIRISS spectrum. This is confirmed by our Bayesian model comparison analysis (Fig. 7), which requires only these two species to fit the NIRISS spectrum. We are unable to verify the sub-solar C/O ratio reported by Feinstein et al. (2023) as our nested-sampling retrievals do not require carbon-bearing species to fit the NIRISS spectrum, and therefore we are unable to compute C/O ratios. Our random forest retrievals, which are trained on the Crossfield (2023) model grid, return a C/O ratio of 0.63 ± 0.15.
4.1.3 NIRSpec G395H spectrum
Guzmán-Mesa et al. (2020) previously suggested that the NIRSpec G395H mode is optimal for detecting multiple species of simple carbon-, oxygen-, and hydrogen-carrying molecules. Alderson et al. (2023) confirmed this prediction by reporting the detection of carbon dioxide, water, and sulphur dioxide from the NIRSpec G395H spectrum. Our Bayesian model comparison (Fig. 7) confirms the detection of water and carbon dioxide. However, whether sulphur dioxide is detected depends on whether a grey or non-grey cloud model is used, an issue we will investigate in the next subsection. CIA associated with hydrogen and helium again provides much of the spectral continuum (Fig. 6).
4.1.4 NIRSpec PRISM spectrum
Rustamkulov et al. (2023) previously reported the detection of water, carbon monoxide, carbon dioxide, sodium, and sulphur dioxide from the NIRSpec PRISM spectrum, as well as the nondetection of methane. Our Bayesian model comparison (Fig. 7) confirms these findings and also reports the detection of hydrogen sulphide. However, whether sulphur dioxide is detected depends again on whether a grey or non-grey cloud model is used.
Figure 13 demonstrates that the C/O ratios from our nested-sampling retrievals are consistent with the super-solar values reported by Rustamkulov et al. (2023) when grey-cloud models are assumed, but inconsistent with them when non-grey clouds are assumed. Our random forest retrievals return a C/O ratio that is consistent with the stellar value. Bayesian model comparison does not offer us a way out of these discrepancies, because the logarithm of the Bayes factor is less than unity between the grey-cloud and non-grey-cloud nested sampling retrievals.
4.2 Is sulphur dioxide detected?
Earlier, we demonstrated that the use of a non-grey cloud model produces a spectral continuum at ~3 μm that compensates for the spectral continuum associated with water in the NIRSpec G395H spectrum (Fig. 6). The same figure shows that the presence of the non-grey cloud diminishes the H2O abundance enough that it allows for the spectral feature at 4.0 μm to be fitted by SO2.
When the same non-grey cloud model is fitted to the NIRSpec PRISM spectrum, it does not compensate for the spectral continuum over the same wavelength range (Fig. 8). Given the broader wavelength coverage of PRISM, it suggests that this compensation by the non-grey cloud model, for the fit to the G395H spectrum, is spurious.
The non-grey cloud model influences the required water abundance needed to fit the NIRSpec PRISM spectrum, which in turn leaves no room for the spectral feature at 4.0 μm to be fitted by SO2. Sulphur dioxide is needed for the model fit when grey clouds are assumed (which is also paired with an isothermal profile). Physically, grey clouds are possible when the cloud particle radius exceeds the longest wavelength covered by the spectrum (divided by 2π) i.e. 5.5/2π ~ 1 μm, which is a plausible particle size. There is no preference for either the grey or non-grey cloud model (ln = 0.61) when all chemical species are included in the retrieval (the so-called “reference model”), as well as when only the minimal set of molecules are included (the so-called “simplified model”).
While the spectral feature at 4.0 μm is real, its interpretation (and the detection of SO2) appears to be model-dependent. Bayesian model comparison does not offer us a way out of this conundrum. Generally, we advocate for the robustness of detections to be checked using spectra measured by multiple instruments subjected to different data reduction techniques and interpreted using several retrieval codes.
4.3 The normalization degeneracy of transmission spectra
Benneke & Seager (2012) and Griffith (2014) previously identified a degeneracy between the absolute value of a transmission spectrum and its encoded molecular abundances. Heng & Kitzmann (2017) showed that this “normalization degeneracy” may be explained analytically and is a three-way degeneracy between an arbitrary reference transit radius, its corresponding reference pressure and the total cross section or opacity of the atoms and molecules in the atmosphere. Fisher & Heng (2018) suggested that the shape of spectral features and the continuum could encode enough pressure-dependent information to break this degeneracy.
Figures B.1–B.8 demonstrate that the joint posterior distributions between Rp (which is essentially the normalization of each transmission spectrum as it is the transit radius at 10 bar) and the abundances of atoms and molecules are not degenerate, meaning that these abundances do not depend on the normalization. The width of the posterior distribution of Rp is determined by the degeneracy with the stellar radius. A different choice of the stellar radius would simply produce a different fit for Rp without influencing the retrieved chemical abundances. While the normalization of transmission spectra should remain a fitting parameter in retrievals, we conclude that it should not be degenerate with the retrieved chemical abundances for high-quality JWST spectra.
The degeneracies are between the different molecular abundances themselves. For example, the degeneracy between CO2 and H2O abundances in the interpretation of the NIRCam spectrum corresponds to the obscuration of the opacity of the former by that of the later.
4.4 Summary
In the current study, we have investigated the information content of the JWST spectra of WASP-39b measured by four different instruments. Our main findings are:
The complexity of the temperature-pressure profile required to fit the data depends on the instrument mode used (and the wavelength coverage).
The minimum set of atoms and molecules required to fit a spectrum depends on the instrument mode used.
Using a non-grey cloud model to fit the NIRSpec G395H spectrum results in a spectral continuum that spuriously compensates for the water opacity.
Generally, the elemental abundances retrieved using “free chemistry” models are sub-solar to solar. (The only exception are the oxygen abundances derived from NIRISS, which are super-solar.) However, their exact values are model- and mode-dependent. The elemental abundances inferred using the random forest method trained on the Crossfield (2023) model grid are super-solar (and more consistent with the findings of the ERS papers).
The retrieved C/O values range from sub- to super-solar. C/O ratios retrieved from the NIRCam spectrum are consistent with the stellar value, because these retrievals are essentially cloud-free. By contrast, the C/O ratios retrieved from the NIRSpec PRISM spectrum are super- and sub-solar when grey and non-grey clouds are assumed, respectively.
The detection of sulphur dioxide from the NIRSpec G395H and PRISM spectra depends on whether grey or non-grey clouds are assumed, because the cloud spectral continuum interacts with the water opacity to influence whether the SO2 feature is needed to fit the data.
Acknowledgements
A.L. acknowledges partial financial support from the Swiss National Science Foundation and the European Research Council (via a Consolidator Grant to KH; grant number 771620), as well as administrative support from the Centre for Space and Habitability (CSH). A.N. acknowledges financial support from the Coordination of Improvement of Higher Education Personnel (CAPES) and LMU-Munich, and Luan Ghezzi for support and discussion. We thank Daniel Kitzmann and Jo Barstow for their valuable input. This study would not have been possible without the open-source codes Helios-r2 (Kitzmann et al. 2020) and HELA (Márquez-Neila et al. 2018). Both of them can be found on the Exoclimes Simulation Platform: https://github.com/exoclime. Additionally, we gratefully acknowledge the open-source libraries in the Python programming language that made this work possible: scikit.learn (Pedregosa et al. 2011), numpy (Harris et al. 2020), matplotlib (Hunter 2007), and astropy (Astropy Collaboration 2013, 2018, 2022). This publication makes use of The Data & Analysis Center for Exoplanets (DACE), which is a facility based at the University of Geneva (CH) dedicated to extrasolar planets data visualisation, exchange, and analysis. DACE is a platform of the Swiss National Centre of Competence in Research (NCCR) PlanetS, federating the Swiss expertise in exoplanet research. The DACE platform is available at https://dace.unige.ch.
Appendix A Helios-r2 and HELA outcomes
As explained in the main body of the paper, the “simplified models” are nested-sampling retrievals performed with the minimal set of chemical species required to fit the data. Table A.1 states the median values of the posterior distributions of parameters, as well as their 1-σ uncertainties. Elemental abundances (in terms of their solar values) retrieved using random forest retrievals are stated in Table A.2. Again, these are the median values of the posterior distributions and their 1-σ uncertainties.
Summary of our best-fit Helios-r2 retrieval outcomes for each of the four JWST spectra: NIRCam, NIRISS, NIRSpec G395H, and NIRSpec PRISM.
Inferred elemental abundances (in terms of their solar values) from our best-fit HELA retrieval outcomes for each of the four JWST spectra: NIRCam, NIRISS, NIRSpec G395H, and NIRSpec PRISM.
Appendix B Helios-r2 posteriors and spectra
For completeness, the full sets of posterior distributions of the parameters and spectra from the simplified Helios-r2 (nested sampling) retrieval models are shown in Figs. B.1 to B.8.
![]() |
Fig. B.1 Full set of posterior distributions from our nested sampling retrieval performed on the NIRCam spectrum. A simplified model assuming grey clouds was used. Relative to the model assuming non-grey clouds (see Fig. B.5), the logarithm of the Bayes factor is 0.41. For each marginalised posterior distribution, the vertical dashed lines indicate the median value of the distribution and the associated 1-σ uncertainties. The temperature-pressure profile also displays median values and the associated 1-σ uncertainties. The best-fit spectrum is associated with 1-σ uncertainties as well. |
![]() |
Fig. B.2 Same as Fig. B.1 but for the NIRISS spectrum and assuming the non-grey cloud model. Relative to the model assuming grey clouds (see Fig. B.6), the logarithm of the Bayes factor is 11. |
![]() |
Fig. B.3 Same as Fig. B.1 but for the NIRSpec G395H spectrum and assuming the non-grey cloud model. Relative to the model assuming grey clouds (see Fig. B.7), the logarithm of the Bayes factor is 34. |
![]() |
Fig. B.4 Same as Fig. B.1 but for the NIRSpec PRISM spectrum and assuming the grey cloud model. Relative to the model assuming non-grey clouds (see Fig. B.8), the logarithm of the Bayes factor is 0.61. |
Appendix C HELA posteriors
For completeness, the full sets of posterior distributions of the parameters from the HELA (random forest) retrieval models are shown in Figs. C.1 to C.4. The random forest algorithm was trained on the model grid of Crossfield (2023).
![]() |
Fig. C.1 Full set of posterior distributions from our random forest retrieval (trained on the model grid of Crossfield 2023) performed on the NIRCam spectrum. Shown are the logarithm (base 10) of the elemental abundances of C, O, and S, in terms of their solar value. In each scatter plot, every point represents an individual prediction generated by a single regression tree within the random forest ensemble. The solid vertical lines denote the median values of the posterior distributions. |
References
- Abel, M., Frommhold, L., Li, X., & Hunt, K. L. C. 2011, J. Phys. Chem. A, 115, 6805 [NASA ADS] [CrossRef] [Google Scholar]
- Abel, M., Frommhold, L., Li, X., & Hunt, K. L. C. 2012, J. Chem. Phys., 136, 044319 [NASA ADS] [CrossRef] [Google Scholar]
- Ahrer, E.-M., JWST Transiting Exoplanet Community Early Release Science Team, Alderson, L., et al. 2023a, Nature, 614, 649 [NASA ADS] [CrossRef] [Google Scholar]
- Ahrer, E.-M., Stevenson, K. B., Mansfield, M., et al. 2023b, Nature, 614, 653 [NASA ADS] [CrossRef] [Google Scholar]
- Alderson, L., Wakeford, H. R., Alarm, M. K., et al. 2023, Nature, 614, 664 [NASA ADS] [CrossRef] [Google Scholar]
- Allard, N. F., Spiegelman, F., & Kielkopf, J. F. 2016, A&A, 589, A21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Allard, N. F., Spiegelman, F., Leininger, T., & Molliere, P. 2019, A&A, 628, A120 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Asplund, M., Grevesse, N., Sauval, A. J., & Scott, P. 2009, ARA&A, 47, 481 [NASA ADS] [CrossRef] [Google Scholar]
- Astropy Collaboration (Robitaille, T. P., et al.) 2013, A&A, 558, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Astropy Collaboration (Price-Whelan, A. M., et al.) 2018, AJ, 156, 123 [Google Scholar]
- Astropy Collaboration (Price-Whelan, A. M., et al.) 2022, ApJ, 935, 167 [NASA ADS] [CrossRef] [Google Scholar]
- Azzam, A. A. A., Tennyson, J., Yurchenko, S. N., & Naumenko, O. V. 2016, MNRAS, 460, 4063 [NASA ADS] [CrossRef] [Google Scholar]
- Barstow, J. K., & Heng, K. 2020, Space Sci. Rev., 216, 82 [NASA ADS] [CrossRef] [Google Scholar]
- Bean, J. L., Stevenson, K. B., Batalha, N. M., et al. 2018, PASP, 130, 114402 [NASA ADS] [CrossRef] [Google Scholar]
- Benneke, B., & Seager, S. 2012, ApJ, 753, 100 [NASA ADS] [CrossRef] [Google Scholar]
- Breiman, L. 2001, Mach. Learn., 45, 5 [Google Scholar]
- Crossfield, I. J. M. 2023, ApJ, 952, L18 [NASA ADS] [CrossRef] [Google Scholar]
- Esparza-Borges, E., López-Morales, M., Adams Redai, J. I., et al. 2023, ApJ, 955, L19 [NASA ADS] [CrossRef] [Google Scholar]
- Faedi, F., Barros, S. C. C., Anderson, D. R., et al. 2011, A&A, 531, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Feinstein, A. D., Radica, M., Welbanks, L., et al. 2023, Nature, 614, 670 [NASA ADS] [CrossRef] [Google Scholar]
- Feroz, F., & Hobson, M. P. 2008, MNRAS, 384, 449 [NASA ADS] [CrossRef] [Google Scholar]
- Feroz, F., Hobson, M. P., & Bridges, M. 2009, MNRAS, 398, 1601 [NASA ADS] [CrossRef] [Google Scholar]
- Fischer, P. D., Knutson, H. A., Sing, D. K., et al. 2016, ApJ, 827, 19 [CrossRef] [Google Scholar]
- Fisher, C., & Heng, K. 2018, MNRAS, 481, 4698 [Google Scholar]
- Fisher, C., Hoeijmakers, H. J., Kitzmann, D., et al. 2020, AJ, 159, 192 [NASA ADS] [CrossRef] [Google Scholar]
- Griffith, C. A. 2014, Philos. Trans. R. Soc. London Ser. A, 372, 20130086 [Google Scholar]
- Grimm, S. L., & Heng, K. 2015, ApJ, 808, 182 [NASA ADS] [CrossRef] [Google Scholar]
- Grimm, S. L., Malik, M., Kitzmann, D., et al. 2021, ApJS, 253, 30 [NASA ADS] [CrossRef] [Google Scholar]
- Guzmán-Mesa, A., Kitzmann, D., Fisher, C., et al. 2020, AJ, 160, 15 [Google Scholar]
- Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357 [NASA ADS] [CrossRef] [Google Scholar]
- Heng, K., & Kitzmann, D. 2017, MNRAS, 470, 2972 [Google Scholar]
- Ho, T. K. 1998, IEEE Trans. Pattern Anal. Mach. Intell., 20, 832 [Google Scholar]
- Hunter, J. D. 2007, Comput. Sci. Eng., 9, 90 [NASA ADS] [CrossRef] [Google Scholar]
- Kitzmann, D., & Heng, K. 2018, MNRAS, 475, 94 [NASA ADS] [CrossRef] [Google Scholar]
- Kitzmann, D., Heng, K., Oreshenko, M., et al. 2020, ApJ, 890, 174 [Google Scholar]
- Li, G., Gordon, I. E., Rothman, L. S., et al. 2015, ApJS, 216, 15 [NASA ADS] [CrossRef] [Google Scholar]
- Lueber, A., Kitzmann, D., Bowler, B. P., Burgasser, A. J., & Heng, K. 2022, ApJ, 930, 136 [NASA ADS] [CrossRef] [Google Scholar]
- Lueber, A., Kitzmann, D., Fisher, C. E., et al. 2023, ApJ, 954, 22 [NASA ADS] [CrossRef] [Google Scholar]
- Madhusudhan, N., & Seager, S. 2009, ApJ, 707, 24 [NASA ADS] [CrossRef] [Google Scholar]
- Mancini, L., Esposito, M., Covino, E., et al. 2018, A&A, 613, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Márquez-Neila, P., Fisher, C., Sznitman, R., & Heng, K. 2018, Nat. Astron., 2, 719 [CrossRef] [Google Scholar]
- Mollière, P., Wardenier, J. P., van Boekel, R., et al. 2019, A&A, 627, A67 [Google Scholar]
- Oreshenko, M., Kitzmann, D., Márquez-Neila, P., et al. 2020, AJ, 159, 6 [Google Scholar]
- Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011, J. Mach. Learn. Res., 12, 2825 [Google Scholar]
- Pierrehumbert, R. T. 2010, Principles of Planetary Climate (Cambridge: Cambridge University Press) [CrossRef] [Google Scholar]
- Polanski, A. S., Crossfield, I. J. M., Howard, A. W., Isaacson, H., & Rice, M. 2022, Res. Notes Am. Astron. Soc., 6, 155 [Google Scholar]
- Polyansky, O. L., Kyuberis, A. A., Zobov, N. F., et al. 2018, MNRAS, 480, 2597 [NASA ADS] [CrossRef] [Google Scholar]
- Rustamkulov, Z., Sing, D. K., Mukherjee, S., et al. 2023, Nature, 614, 659 [NASA ADS] [CrossRef] [Google Scholar]
- Sisson, S. A., Fan, Y., & Beaumont, M. 2018, Handbook of Approximate Bayesian Computation (Boca Raton: CRC Press) [Google Scholar]
- Skilling, J. 2006, AIP Conf. Ser., 872, 321 [NASA ADS] [CrossRef] [Google Scholar]
- Stevenson, K. B., Lewis, N. K., Bean, J. L., et al. 2016, PASP, 128, 094401 [NASA ADS] [CrossRef] [Google Scholar]
- Tashkun, S. A., & Perevalov, V. I. 2011, J. Quant. Spectrosc. Radiat. Transf., 112, 1403 [NASA ADS] [CrossRef] [Google Scholar]
- Tennyson, J., & Yurchenko, S. N. 2017, Mol. Astrophys., 8, 1 [NASA ADS] [CrossRef] [Google Scholar]
- Trotta, R. 2008, Contemp. Phys., 49, 71 [Google Scholar]
- Tsai, S.-M., Lyons, J. R., Grosheintz, L., et al. 2017, ApJS, 228, 20 [NASA ADS] [CrossRef] [Google Scholar]
- Tsai, S.-M., Malik, M., Kitzmann, D., et al. 2021, ApJ, 923, 264 [NASA ADS] [CrossRef] [Google Scholar]
- Tsai, S.-M., Lee, E. K. H., Powell, D., et al. 2023, Nature, 617, 483 [CrossRef] [Google Scholar]
- Underwood, D. S., Tennyson, J., Yurchenko, S. N., et al. 2016, MNRAS, 459, 3890 [NASA ADS] [CrossRef] [Google Scholar]
- Wakeford, H. R., Sing, D. K., Deming, D., et al. 2018, AJ, 155, 29 [Google Scholar]
- Yurchenko, S. N., & Tennyson, J. 2014, MNRAS, 440, 1649 [Google Scholar]
All Tables
Summary of retrieval parameters and the assumptions for their prior distributions.
Summary of our best-fit Helios-r2 retrieval outcomes for each of the four JWST spectra: NIRCam, NIRISS, NIRSpec G395H, and NIRSpec PRISM.
Inferred elemental abundances (in terms of their solar values) from our best-fit HELA retrieval outcomes for each of the four JWST spectra: NIRCam, NIRISS, NIRSpec G395H, and NIRSpec PRISM.
All Figures
![]() |
Fig. 1 Curated JWST ERS transmission data of WASP-39b from the following four instruments: NIRCam (blue), NIRISS (red), NIRSpec G395H (green), and NIRSpec PRISM (yellow). The data were originally published by Ahrer et al. (2023b), Feinstein et al. (2023), Rustamkulov et al. (2023), and Alderson et al. (2023), respectively. |
In the text |
![]() |
Fig. 2 Schematic describing our suite of nested sampling retrievals, where models with grey and non-grey clouds are tested separately for variations in the temperature-pressure profile, and number of atoms and molecules considered. |
In the text |
![]() |
Fig. 3 Bayesian model comparison between cloudfree and cloudy retrievals, corresponding to the first step of the schematic shown in Fig. 2. The Bayesian evidence relative to the cloudfree model is computed and its natural logarithm is displayed. For all four JWST instruments, there are significant increases in the Bayesian evidence when cloudy models are used to fit the spectra. |
In the text |
![]() |
Fig. 4 Comparing the median values of posterior distributions of selected parameters from retrievals performed on the four different spectra. Arrows indicate upper or lower limits. For each spectrum, a suite of four retrievals with different complexity of the temperature-pressure profile is performed. Isothermal profiles are denoted by “isoT”, while “FP” means that the temperature-pressure profile is parameterised by F fitting/free parameters. Separate suites of retrievals for grey versus non-grey cloud models are performed. Within each suite, filled circles are models where the logarithm of the Bayes factor is less than unity, implying that all of these models explain the data equally well. Within this subset of models indicated by filled circles, we choose the temperature-pressure profile that uses the least number of parameters in our subsequent explorations. |
In the text |
![]() |
Fig. 5 Fitting the model containing all chemical species (termed the “reference model”) to the NIRCam spectrum (grey data points) previously published by Ahrer et al. (2023b). Corresponding to our findings in Fig. 4, the 3-parameter temperature-pressure profile is used and grey (left panel) versus non-grey (right panel) are considered in separate retrievals. In each panel, post-processing is performed to isolate the contribution of water, carbon dioxide, clouds, and collision-induced absorption (CIA) due to hydrogen and helium. The natural logarithm of the Bayes factor between the retrievals with grey and non-grey clouds is 0.7, implying that both grey and non-grey cloud models explain the data equally well because both clouds are optically thin. |
In the text |
![]() |
Fig. 6 Fitting the model containing all chemical species (termed the “reference model”) to the NIRSpec G395H spectrum (grey data points) previously published by Alderson et al. (2023). Corresponding to our findings in Fig. 4, the retrieval assuming grey clouds (left panel) uses an isothermal profile while the retrieval assuming non-grey clouds (right panel) uses a two-parameter temperature-pressure profile. In each panel, post-processing is performed to isolate the contribution of water, carbon dioxide, sulphur dioxide, clouds, and collision-induced absorption (CIA) due to hydrogen and helium. The natural logarithm of the Bayes factor between the retrievals with grey and non-grey clouds is 33.8, with the non-grey ones being strongly preferred. |
In the text |
![]() |
Fig. 7 Bayesian model comparison between models that include different subsets of chemical species. Starting with the reference model (which includes all chemical species), a selected atom or molecule is excluded. If the Bayesian evidence decreases significantly (such that the logarithm of the Bayes factor is much greater than unity), then this chemical species is considered part of the “simplified model” (which contains the minimal set of species required to fit the data). All other chemical species that produce increases in the Bayesian evidence when they are excluded from the retrieval are thus excluded from the simplified model. In each of the panels, the natural logarithm of the Bayes factor (relative to the reference model) is displayed. Retrievals assuming grey (left column) versus non-grey (right panel) clouds are considered separately. The sophistication of the temperature-pressure profile used is based on the findings of Fig. 4. |
In the text |
![]() |
Fig. 8 Fitting the model containing all chemical species (termed the “reference model”) to the NIRSpec PRISM spectrum (grey data points) previously published by Rustamkulov et al. (2023). Corresponding to our findings in Fig. 4, the retrievals assuming grey clouds (left panel) and non-grey clouds (right panel) both use isothermal temperature-pressure profiles. In each panel, post-processing is performed to isolate the contribution of water, carbon dioxide, carbon monoxide, sulphur dioxide, and clouds. The natural logarithm of the Bayes factor between this pair of retrievals is 0.61, implying that Bayesian model comparison has no preference for grey versus non-grey clouds. |
In the text |
![]() |
Fig. 9 Best-fit models to the various measured spectra (grey data points) from our nested sampling retrievals. The shaded regions associated with the model curves show fit uncertainties of 2 and 5 standard deviations. Spectral features corresponding to various chemical species are labelled. The best-fit models to NIRCam and NIRSpec PRISM spectra assume a grey-cloud model, while those to NIRISS and NIRSpec G395H assume a non-grey-cloud model. Full posterior distributions of parameters corresponding to these fits are provided in Figs. B.1–B.4. |
In the text |
![]() |
Fig. 10 Retrieved elemental abundances (relative to their solar values) from the simplified models (minimal set of chemical species needed to fit data). The solar elemental abundances assumed are C/H = 2.95 × 10−4 and O/H = 5.37 × 10−4 (Asplund et al. 2009). Separate retrievals are performed assuming grey versus non-grey clouds. Displayed are the median values of the posterior distributions and their 1-σ uncertainties. There is no C/H entry for NIRISS, because no carbon-bearing species are detected from that spectrum. |
In the text |
![]() |
Fig. 11 Best-fit models to the various measured spectra (grey data points) from our random forest retrievals trained on the model grid of Crossfield (2023). The shaded regions associated with the model curves show fit uncertainties of 1 standard deviation. Spectral features corresponding to various chemical species are labelled. |
In the text |
![]() |
Fig. 12 Median values (and their 1σ uncertainties) of elemental abundances inferred from the median values of the posterior distributions associated with our random forest retrievals. Separate entries for each JWST instrument mode are given. The elemental abundances are given in terms of their solar values. |
In the text |
![]() |
Fig. 13 Median values of the carbon-to-oxygen ratio (C/O) inferred from both our nested sampling and random forest retrievals. Only retrievals performed on NIRCam (top panel) and NIRSpec PRISM (bottom panel) spectra are considered (see text for more details). Only models with the logarithm of the Bayes factor being less than unity (relative to the model with the highest Bayesian evidence) are shown. The three horizontal blue lines are C/O ratios taken from Ahrer et al. (2023b) for NIRCam, while the four horizontal yellow lines are those taken from Rustamkulov et al. (2023) for NIRSpec PRISM. The dashed grey line indicates the solar value of 0.55 (Asplund et al. 2009), while the solid grey line represents the stellar value of 0.46 ± 0.09 (Polanski et al. 2022). |
In the text |
![]() |
Fig. 14 Retrieved chemical abundances (volume mixing ratios) corresponding to the models shown in Fig. 13. Displayed are the median values of the posterior distributions and their 1σ uncertainties. |
In the text |
![]() |
Fig. B.1 Full set of posterior distributions from our nested sampling retrieval performed on the NIRCam spectrum. A simplified model assuming grey clouds was used. Relative to the model assuming non-grey clouds (see Fig. B.5), the logarithm of the Bayes factor is 0.41. For each marginalised posterior distribution, the vertical dashed lines indicate the median value of the distribution and the associated 1-σ uncertainties. The temperature-pressure profile also displays median values and the associated 1-σ uncertainties. The best-fit spectrum is associated with 1-σ uncertainties as well. |
In the text |
![]() |
Fig. B.2 Same as Fig. B.1 but for the NIRISS spectrum and assuming the non-grey cloud model. Relative to the model assuming grey clouds (see Fig. B.6), the logarithm of the Bayes factor is 11. |
In the text |
![]() |
Fig. B.3 Same as Fig. B.1 but for the NIRSpec G395H spectrum and assuming the non-grey cloud model. Relative to the model assuming grey clouds (see Fig. B.7), the logarithm of the Bayes factor is 34. |
In the text |
![]() |
Fig. B.4 Same as Fig. B.1 but for the NIRSpec PRISM spectrum and assuming the grey cloud model. Relative to the model assuming non-grey clouds (see Fig. B.8), the logarithm of the Bayes factor is 0.61. |
In the text |
![]() |
Fig. B.5 Counterpart to Fig. B.1 assuming non-grey clouds. |
In the text |
![]() |
Fig. B.6 Counterpart to Fig. B.2 assuming grey clouds. |
In the text |
![]() |
Fig. B.7 Counterpart to Fig. B.3 assuming grey clouds. |
In the text |
![]() |
Fig. B.8 Counterpart to Fig. B.4 assuming non-grey clouds. |
In the text |
![]() |
Fig. C.1 Full set of posterior distributions from our random forest retrieval (trained on the model grid of Crossfield 2023) performed on the NIRCam spectrum. Shown are the logarithm (base 10) of the elemental abundances of C, O, and S, in terms of their solar value. In each scatter plot, every point represents an individual prediction generated by a single regression tree within the random forest ensemble. The solid vertical lines denote the median values of the posterior distributions. |
In the text |
![]() |
Fig. C.2 Same as Fig. C.1 but for the NIRISS spectrum. |
In the text |
![]() |
Fig. C.3 Same as Fig. C.1 but for the NIRSpec G395H spectrum. |
In the text |
![]() |
Fig. C.4 Same as Fig. C.1 but for the NIRSpec PRISM spectrum. |
In the text |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.