Probabilistic cosmological inference on HI tomographic data
We explore the possibility of retrieving cosmological information along with its inherent uncertainty from 21-cm tomographic data at intermediate redshift. The first step in our approach consists of training an encoder, composed of several three dimensional convolutional layers, to cast the neutral...
Saved in:
Published in | Astrophysics and space science Vol. 370; no. 8; p. 76 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
Dordrecht
Springer Netherlands
01.08.2025
Springer Nature B.V |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We explore the possibility of retrieving cosmological information along with its inherent uncertainty from 21-cm tomographic data at intermediate redshift. The first step in our approach consists of training an encoder, composed of several three dimensional convolutional layers, to cast the neutral hydrogen 3D data into a lower dimension latent space. Once pre-trained, the featurizer is able to generate 3D grid representations which, in turn, will be mapped onto cosmology (
Ω
m
,
σ
8
) via likelihood-free inference. For the latter, which is framed as a density estimation problem, we consider a Bayesian approximation method which exploits the capacity of Masked Autoregressive Flow to estimate the posterior. It is found that the representations learned by the deep encoder are separable in latent space. Results show that the neural density estimator, trained on the latent codes, is able to constrain cosmology with a precision of
R
2
≥
0.91
on all parameters and that most of the ground truth of the instances in the test set fall within
1
σ
uncertainty. It is established that the posterior uncertainty from the density estimator is reasonably calibrated. We also investigate the robustness of the feature extractor by using it to compress out-of-distribution dataset, that is either from a different simulation or from the same simulation but at different redshift. We find that, while trained on the latent codes corresponding to different types of out-of-distribution dataset, the probabilistic model is still reasonably capable of constraining cosmology, with
R
2
≥
0.80
in general. This highlights both the predictive power of the density estimator considered in this work and the meaningfulness of the latent codes retrieved by the encoder. We believe that the approach prescribed in this proof of concept will be of great use when analyzing 21-cm data from various surveys in the near future. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 0004-640X 1572-946X |
DOI: | 10.1007/s10509-025-04470-3 |