Issue |
A&A
Volume 689, September 2024
|
|
---|---|---|
Article Number | A33 | |
Number of page(s) | 17 | |
Section | Numerical methods and codes | |
DOI | https://doi.org/10.1051/0004-6361/202449961 | |
Published online | 29 August 2024 |
SubDLe: Identification of substructures in cosmological simulations with deep learning
An image segmentation approach to substructure finding
1
Department of Physics, Astronomy Section, University of Trieste,
via G. Tiepolo 11,
34131
Trieste, Italy
e-mail: michela.esposito@phd.units.it
2
INAF – Osservatorio Astronomico di Trieste,
via G. Tiepolo 11,
34143
Trieste, Italy
e-mail: michela.esposito@inaf.it
3
IFPU – Institute for Fundamental Physics of the Universe,
via Beirut 2,
34014
Trieste, Italy
4
INFN – Instituto Nazionale di Fisica Nucleare,
via Valerio 2,
34127
Trieste, Italy
5
ICSC – Italian Research Center on High Performance Computing, Big Data and Quantum Computing,
via Magnanelli 2,
40033
Casalecchio di Reno, Italy
Received:
13
March
2024
Accepted:
19
June
2024
Context. The identification of substructures within halos in cosmological hydrodynamical simulations is a fundamental step to identify the simulated counterparts of real objects, namely galaxies. For this reason, substructure finders play a crucial role in extracting relevant information from the simulation outputs. In general, they are based on physically motivated definitions of substructures, performing multiple steps of particle-by-particle operations, and for this reason they are computationally expensive.
Aims. The purpose of this work is to develop a fast algorithm to identify substructures, especially galaxies, in simulations. The final aim, besides a faster production of subhalo catalogs, is to provide an algorithm fast enough to be applied with a fine time cadence during the evolution of the simulations. Having access to galaxy catalogs while the simulation is evolving is indeed necessary for sub-resolution models based on the global properties of galaxies.
Methods. In this context, machine learning methods offer a wide range of automated tools for fast analysis of large data sets. So, we chose to apply the architecture of a well-known fully convolutional network, U-Net, for the identification of substructures within the mass density field of the simulation. We have developed SubDLe (Substructure identification with Deep Learning), an algorithm that combines a 3D generalization of U-Net and a Friends-of-Friends algorithm, and trained it to reproduce the identification of substructures performed by the SubFind algorithm in a set of zoom-in cosmological hydrodynamical simulations of galaxy clusters. For the feasibility study presented in this work, we have trained and tested SubDLe on galaxy clusters at z = 0, using a NVIDIA P100 GPU. We focused our tests on the version of the algorithm working on the identification of purely stellar substructures, stellar SubDLe.
Results. Our stellar SubDLe proved very efficient in identifying most of the galaxies, 82% on average, in a set of 12 clusters at z = 0. In order to prove the robustness of the method, we also performed some tests at z = 1 and increased the resolution of the input density grids. The average time taken by our SubDLe to analyze one cluster is about 70 s, around a factor 30 less than the typical time taken by SubFind in a single computing node.
Conclusions. Our stellar SubDLe is capable of identifying the majority of galaxies in the challenging high-density environment of galaxy clusters in short computing times. This result has interesting implications in view of the possibility of integrating fast subhalo finders within simulation codes, which can take advantage of accelerators available in state-of-the-art computing nodes.
Key words: hydrodynamics / methods: data analysis / methods: numerical / galaxies: clusters: general
© The Authors 2024
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.