Issue |
A&A
Volume 699, July 2025
|
|
---|---|---|
Article Number | A213 | |
Number of page(s) | 19 | |
Section | Numerical methods and codes | |
DOI | https://doi.org/10.1051/0004-6361/202453339 | |
Published online | 08 July 2025 |
Simulation-based inference has its own Dodelson–Schneider effect (but it knows that it does)
1
University Observatory, Faculty of Physics, Ludwig-Maximilians-Universität,
Scheinerstr. 1,
81677
Munich,
Germany
2
Munich Center for Machine Learning (MCML),
Germany
3
Excellence Cluster ORIGINS,
Boltzmannstr. 2,
85748
Garching,
Germany
★ Corresponding author: jed.homer@physik.lmu.de
Received:
6
December
2024
Accepted:
7
May
2025
Context. Making inferences about physical properties of the Universe requires knowledge of the data likelihood. A Gaussian distribution is commonly assumed for the uncertainties with a covariance matrix estimated from a set of simulations. The noise in such covariance estimates causes two problems: it distorts the width of the parameter contours, and it adds scatter to the location of those contours that is not captured by the widths themselves. For non-Gaussian likelihoods, an approximation may be derived via simulation-based inference (SBI). It is often implicitly assumed that parameter constraints from SBI analyses, which do not use covariance matrices, are not affected by the same problems as parameter estimation with a covariance matrix estimated from simulations.
Aims. We aim to measure the coverage and marginal variances of the posteriors derived using density-estimation SBI over many identical experiments to investigate whether SBI suffers from effects similar to those of covariance estimation in Gaussian likelihoods.
Methods. We used a neural-posterior and likelihood estimation with continuous and masked autoregressive normalising flows for density estimation. We fitted our approximate posterior models to simulations drawn from a Gaussian linear model, so the SBI result can be compared to the true posterior, and effects related to noise in the covariance estimate are known analytically. We tested linear and neural-network-based compression, demonstrating that neither method circumvents the issues of covariance estimation.
Results. SBI suffers an inflation of posterior variance that is equal to or greater than the analytical result in covariance estimation for Gaussian likelihoods for the same number of simulations. This inflation of variance is captured conservatively by the reported confidence intervals, leading to an acceptable coverage regardless of the number of simulations. The assumption that SBI requires a smaller number of simulations than covariance estimation for a Gaussian likelihood analysis is inaccurate. The limitations of traditional likelihood analysis with simulation-based covariance remain for SBI with finite simulation budget. Despite these issues, we show that SBI correctly draws the true posterior contour when there are enough simulations.
Key words: methods: data analysis / methods: statistical / cosmological parameters / large-scale structure of Universe
© The Authors 2025
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.