Issue |
A&A
Volume 692, December 2024
|
|
---|---|---|
Article Number | A208 | |
Number of page(s) | 17 | |
Section | Numerical methods and codes | |
DOI | https://doi.org/10.1051/0004-6361/202450370 | |
Published online | 13 December 2024 |
Transient classifiers for Fink
Benchmarks for LSST
1
Centro Brasileiro de Pesquisas Físicas,
Rua Dr, Xavier Sigaud 150,
Rio de Janeiro,
Brazil
2
Centro Federal de Educação Tecnológica Celso Suckow da Fonseca,
Rodovia Márcio Covas, lote J2,
Itaguaí,
Brazil
3
Université Clermont-Auvergne, CNRS, LPCA,
63000
Clermont-Ferrand,
France
4
Université Paris-Saclay, CNRS/IN2P3, IJCLab,
15 rue Georges Clemenceau,
91405
Orsay,
France
5
Centre for Astrophysics and Supercomputing, Swinburne University of Technology,
Mail Number H29,
PO Box 218,
31122
Hawthorn,
VIC,
Australia
6
ARC Centre of Excellence for Gravitational Wave Discovery (OzGrav),
John St,
Hawthorn,
VIC
3122,
Australia
7
Aix Marseille Univ, CNRS, CNES, LAM,
Marseille,
France
8
European Southern Observatory,
Karl-Schwarzschild-Straße 2,
Garching
85748,
Germany
★ Corresponding author; bernardo@cbpf.br
Received:
15
April
2024
Accepted:
10
November
2024
Context. The upcoming Legacy Survey of Space and Time (LSST) at the Vera C. Rubin Observatory is expected to detect a few million transients per night, which will generate a live alert stream during the entire ten years of the survey. This stream will be distributed via community brokers whose task is to select subsets of the stream and direct them to scientific communities. Given the volume and complexity of the anticipated data, machine learning (ML) algorithms will be paramount for this task.
Aims. We present the infrastructure tests and classification methods developed within the FINK broker in preparation for LSST. This work aims to provide detailed information regarding the underlying assumptions and methods behind each classifier and enable users to make informed follow-up decisions from FINK photometric classifications.
Methods. Using simulated data from the Extended LSST Astronomical Time-series Classification Challenge (ELAsTiCC), we showcase the performance of binary and multi-class ML classifiers available in FINK. These include tree-based classifiers coupled with tailored feature extraction strategies as well as deep learning algorithms. Moreover, we introduce the CBPF (Centro Brasileiro de Pesquisas Físicas) Alert Transient Search (CATS), a deep learning architecture specifically designed for this task.
Results. Our results show that FINK classifiers are able to handle the extra complexity that is expected from LSST data. CATS achieved ≥93% precision for all classes except ‘long’ (for which it achieved ∼83%), while our best performing binary classifier achieves ≥98% precision and ≥99% completeness when classifying the periodic class.
Conclusions. ELAsTiCC was an important milestone in preparing the FINK infrastructure to deal with LSST-like data. Our results demonstrate that FINK classifiers are well prepared for the arrival of the new stream, but this work also highlights that transitioning from the current infrastructures to Rubin will require significant adaptation of the currently available tools. This work was the first step in the right direction.
Key words: methods: data analysis / surveys / supernovae: general
© The Authors 2024
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.