Issue |
A&A
Volume 687, July 2024
|
|
---|---|---|
Article Number | A269 | |
Number of page(s) | 20 | |
Section | Numerical methods and codes | |
DOI | https://doi.org/10.1051/0004-6361/202346426 | |
Published online | 23 July 2024 |
Identifying type II quasars at intermediate redshift with few-shot learning photometric classification
1
Departamento de Física e Astronomia, Faculdade de Ciências, Universidade do Porto,
Rua do Campo Alegre 687,
4169-007
Porto,
Portugal
e-mail: pedro.cunha@astro.up.pt
2
Instituto de Astrofísica e Ciências do Espaço, Universidade do Porto, CAUP,
Rua das Estrelas,
4150-762
Porto,
Portugal
3
DTx – Digital Transformation CoLab, Building 1, Azurém Campus, University of Minho,
4800-058
Guimarães,
Portugal
4
Departamento de Física, Faculdade de Ciências, Universidade de Lisboa,
Edifício C8, Campo Grande,
1749-016
Lisboa,
Portugal
5
Instituto de Astrofísica e Ciências do Espaço, Universidade de Lisboa,
OAL, Tapada da Ajuda,
1349-018
Lisboa,
Portugal
Received:
15
March
2023
Accepted:
14
May
2024
Context. A sub-population of AGNs where the central engine is obscured are known as type II quasars (QSO2s). These luminous AGNs have a thick and dusty torus that obscures the accretion disc from our line of sight. Thus, their special orientation allows for detailed studies of the AGN-host co-evolution. Increasing the sample size of QSO2 sources in critical redshift ranges is crucial for understanding the interplay of AGN feedback, the AGN-host relationship, and the evolution of active galaxies.
Aims. We aim to identify QSO2 candidates in the ‘redshift desert’ using optical and infrared photometry. At this intermediate redshift range (i.e. 1 ≤ ɀ ≤ 2), most of the prominent optical emission lines in QSO2 sources (e.g. CIVλl549; [OIII]λλ4959, 5008) fall either outside the wavelength range of the SDSS optical spectra or in particularly noisy wavelength ranges, making QSO2 identification challenging. Therefore, we adopted a semi-supervised machine learning approach to select candidates in the SDSS galaxy sample.
Methods. Recent applications of machine learning in astronomy focus on problems involving large data sets, with small data sets often being overlooked. We developed a ‘few-shot’ learning approach for the identification and classification of rare-object classes using limited training data (200 sources). The new AMELIA pipeline uses a transfer-learning based approach with decision trees, distance-based, and deep learning methods to build a classifier capable of identifying rare objects on the basis of an observational training data set.
Results. We validated the performance of AMELIA by addressing the problem of identifying QSO2s at 1 ≤ ɀ ≤ 2 using SDSS and WISE photometry, obtaining an F1-score above 0.8 in a supervised approach. We then used AMELIA to select new QSO2 candidates in the ‘redshift desert’ and examined the nature of the candidates using SDSS spectra, when available. In particular, we identified a sub-population of [NeV]λ3426 emitters at ɀ ~ 1.1, which are highly likely to contain obscured AGNs. We used X-ray and radio crossmatching to validate our classification and investigated the performance of photometric criteria from the literature showing that our candidates have an inherent dusty nature. Finally, we derived physical properties for our QSO2 sample using photoionisation models and verified the AGN classification using an SED fitting.
Conclusions. Our results demonstrate the potential of few-shot learning applied to small data sets of rare objects, in particular QSO2s, and confirms that optical-IR information can be further explored to search for obscured AGNs. We present a new sample of candidates to be further studied and validated using multi-wavelength observations.
Key words: methods: data analysis / methods: statistical / techniques: photometric / techniques: spectroscopic / galaxies: active / quasars: general
© The Authors 2024
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.