Issue |
A&A
Volume 691, November 2024
|
|
---|---|---|
Article Number | A223 | |
Number of page(s) | 14 | |
Section | Stellar structure and evolution | |
DOI | https://doi.org/10.1051/0004-6361/202451247 | |
Published online | 15 November 2024 |
Advanced classification of hot subdwarf binaries using artificial intelligence techniques and Gaia DR3 data
1
Institute of Theoretical Physics and Astronomy, Vilnius University, Sauletekio av. 3, 10257 Vilnius, Lithuania
2
Centro de Astrobiología (CSIC-INTA), Camino Bajo del Castillo s/n, E-28692 Villanueva de la Cañada, Madrid, Spain
3
Applied Physics Department, Universidade de Vigo, Campus Lagoas-Marcosende, s/n, E-36310 Vigo, Spain
4
Centro de Investigación Mariña, Universidade de Vigo, GEOMA, Edificio Olimpia Valencia, Campus Lagoas-Marcosende, E-36310 Vigo, Spain
5
CIGUS CITIC – Department of Computer Science and Information Technologies, University of A Coruña, s/n, E-15071 A Coruña, Spain
6
CIGUS CITIC – Department of Nautical Sciences and Marine Engineering, University of A Coruña, Paseo de Ronda 51, E-15011 A Coruña, Spain
7
INAF – Osservatorio Astrofisico di Arcetri, Largo E. Fermi 5, 50125 Firenze, Italy
8
IES de Beade, Consellería de Educación e Ordenación Universitaria, Camino do Outeiro 10, E-36312 Vigo, Spain
9
Observatorio Astronómico Nacional (OAN), Alfonso XII 3, 28014 Madrid, Spain
⋆ Corresponding author; carlos.viscasillas@ff.vu.lt
Received:
25
June
2024
Accepted:
24
September
2024
Context. Hot subdwarf stars are compact blue evolved objects, burning helium in their cores surrounded by a tiny hydrogen envelope. In the Hertzsprung-Russell Diagram they are located by the blue end of the Horizontal Branch. Most models agree on a quite probable common envelope binary evolution scenario in the Red Giant phase. However, the current binarity rate for these objects is yet unsolved, but key, question in this field.
Aims. This study aims to develop a novel classification method for identifying hot subdwarf binaries within large datasets using Artificial Intelligence techniques and data from the third Gaia data release (GDR3). The results will be compared with those obtained previously using Virtual Observatory techniques on coincident samples.
Methods. The methods used for hot subdwarf binary classification include supervised and unsupervised machine learning techniques. Specifically, we have used Support Vector Machines (SVM) to classify 3084 hot subdwarf stars based on their colour-magnitude properties. Among these, 2815 objects have Gaia DR3 BP/RP spectra, which were classified using Self-Organizing Maps (SOM) and Convolutional Neural Networks (CNN). In order to ensure spectral quality, previously to SOM and CNN classification, our 2815 BP/RP set were pre-analysed with two different approaches: the cosine similarity technique and the Uniform Manifold Approximation and Projection (UMAP) technique. Additional analysis onto a golden sample of 88 well-defined objects, is also presented.
Results. The findings demonstrate a high agreement level (∼70–90%) with the classifications from the Virtual Observatory Sed Analyzer (VOSA) tool. This shows that the SVM, SOM, and CNN methods effectively classify sources with an accuracy comparable to human inspection or non-AI techniques. Notably, SVM in a radial basis function achieves 70.97% reproducibility for binary targets using photometry, and CNN reaches 84.94% for binary detection using spectroscopy. We also found that the single–binary differences are especially observable on the infrared flux in our Gaia DR3 BP/BR spectra, at wavelengths larger than ∼700 nm.
Conclusions. We find that all the methods used are in fairly good agreement and are particularly effective to discern between single and binary systems. The agreement is also consistent with the results previously obtained with VOSA. In global terms, considering all quality metrics, CNN is the method that provides the best accuracy. The methods also appear effective for detecting peculiarities in the spectra. While promising, challenges in dealing with uncertain compositions highlight the need for caution, suggesting further research is needed to refine techniques and enhance automated classification reliability, particularly for large-scale surveys.
Key words: methods: data analysis / techniques: spectroscopic / binaries: general / subdwarfs
© The Authors 2024
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.