Issue |
A&A
Volume 698, May 2025
|
|
---|---|---|
Article Number | A35 | |
Number of page(s) | 11 | |
Section | Cosmology (including clusters of galaxies) | |
DOI | https://doi.org/10.1051/0004-6361/202453115 | |
Published online | 26 May 2025 |
Combining summary statistics with simulation-based inference for the 21 cm signal from the Epoch of Reionisation
Sorbonne Université, Observatoire de Paris, PSL University, CNRS, LUX, F-75014 Paris, France
⋆ Corresponding author: benoit.semelin@obspm.fr
Received:
21
November
2024
Accepted:
16
April
2025
The 21 cm signal from the Epoch of Reionisation will be observed with the upcoming Square Kilometer Array (SKA). We expect it to yield a full tomography of the signal, which opens up the possibility to explore its non-Gaussian properties. This raises the question of how can we extract the maximum information from tomography and derive the tightest constraint on the signal. In this work, instead of looking for the most informative summary statistics, we investigate how to combine the information from two sets of summary statistics using simulation-based inference. To this end, we trained neural density estimators (NDE) to fit the implicit likelihood of our model, the LICORICE code, using the Loreli II database. We trained three different NDEs: one to perform Bayesian inference on the power spectrum, one to perform it on the linear moments of the pixel distribution function (PDF), and one to work with the combination of the two. We performed ∼900 inferences at different points in our parameter space and used them to assess both the validity of our posteriors with a simulation-based calibration (SBC) and the typical gain obtained by combining summary statistics. We find that our posteriors are biased by no more than ∼20% of their standard deviation and under-confident by no more than ∼15%. Then, we established that combining summary statistics produces a contraction of the 4D volume of the posterior (derived from the generalised variance) in 91.5% of our cases, and in 70–80% of the cases for the marginalised 1D posteriors. The median volume variation is a contraction of a factor of a few for the 4D posteriors and a contraction of 20–30% in the case of the marginalised 1D posteriors. This shows that our approach is a possible alternative to looking for so-called sufficient statistics in the theoretical sense.
Key words: methods: numerical / methods: statistical / dark ages / reionization / first stars
© The Authors 2025
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.