Issue |
A&A
Volume 697, May 2025
|
|
---|---|---|
Article Number | A204 | |
Number of page(s) | 15 | |
Section | Extragalactic astronomy | |
DOI | https://doi.org/10.1051/0004-6361/202453630 | |
Published online | 19 May 2025 |
Selection of optically variable active galactic nuclei via a random forest algorithm⋆
1
Department of Physics, University of Napoli “Federico II”, via Cinthia 9, 80126 Napoli, Italy
2
Millennium Institute of Astrophysics (MAS), Nuncio Monseñor Sotero Sanz 100, Providencia, Santiago, Chile
3
INAF – Osservatorio Astronomico di Capodimonte, via Moiariello 16, 80131 Napoli, Italy
4
CIRA – Centro Italiano di Ricerche Aerospaziali, via Maiorise s.n.c., 81043 Capua, Italy
5
INFN – Sezione di Napoli, via Cinthia 9, 80126 Napoli, Italy
6
Dipartimento di Matematica e Fisica, Università Roma Tre, Via della Vasca Navale 84, 00146 Roma, Italy
7
INAF – Osservatorio astronomico di Roma, Via Frascati 33, I-00040 Monte Porzio Catone, Italy
8
European Southern Observatory, Karl-Schwarzschild-Strasse 2, 85748 Garching bei München, Germany
⋆⋆ Corresponding author: demetra.decicco@unina.it
Received:
30
December
2024
Accepted:
21
March
2025
Context. A defining characteristic of active galactic nuclei (AGN) that distinguishes them from other astronomical sources is their stochastic variability, which is observable across the entire electromagnetic spectrum. Upcoming optical wide-field surveys, such as the Vera C. Rubin Observatory’s Legacy Survey of Space and Time, are set to transform astronomy by delivering unprecedented volumes of data for time domain studies. This data influx will require the development of the expertise and methodologies necessary to manage and analyze it effectively.
Aims. This project focuses on optimizing AGN selection through optical variability in wide-field surveys and aims to reduce the bias against obscured AGN. We tested a random forest (RF) algorithm trained on various feature sets to select AGN. The initial dataset consisted of 54 observations in the r-band and 25 in the g-band of the COSMOS field, captured with the VLT Survey Telescope over a 3.3-year baseline.
Methods. Our analysis relies on feature sets derived separately from either band plus a set of features combining data from both bands, mostly characterizing AGN on the basis of their variability properties and obtained from their light curves. We trained multiple RF classifiers using different subsets of selected features and assessed their performance via targeted metrics.
Results. Our tests provide valuable insights into the use of multiband and multivisit data for AGN identification. We compared our findings with previous studies and dedicated part of the analysis to potential enhancements in selecting obscured AGN. The expertise gained and the methodologies developed here are readily applicable to datasets from other ground- and space-based missions.
Key words: methods: statistical / surveys / galaxies: active
© The Authors 2025
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.