Radio source-component association for the LOFAR Two-metre Sky Survey with region-based convolutional neural networks

Rafaël I. J. Mostert; Kenneth J. Duncan; Lara Alegre; Huub J. A. Röttgering; Wendy L. Williams; Philip N. Best; Martin J. Hardcastle; Raffaella Morganti

doi:10.1051/0004-6361/202243478

Home

All issues

Volume 668 (December 2022)

A&A, 668 (2022) A28

Abstract

Open Access

Issue		A&A Volume 668, December 2022


Article Number		A28
Number of page(s)		21
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/202243478
Published online		01 December 2022

A&A 668, A28 (2022)

Radio source-component association for the LOFAR Two-metre Sky Survey with region-based convolutional neural networks

Rafaël I. J. Mostert¹^,2^,3, Kenneth J. Duncan¹^,4, Lara Alegre⁴, Huub J. A. Röttgering¹, Wendy L. Williams¹, Philip N. Best⁴, Martin J. Hardcastle⁵ and Raffaella Morganti²^,6

¹ Leiden Observatory, Leiden University, PO Box 9513, 2300 RA Leiden, The Netherlands
e-mail: mostert@strw.leidenuniv.nl
² ASTRON, the Netherlands Institute for Radio Astronomy, Oude Hoogeveensedijk 4, 7991 PD Dwingeloo, The Netherlands
³ Leiden Institute of Advanced Computer Science, Niels Bohrweg 1, 2300 RA Leiden, The Netherlands
⁴ SUPA, Institute for Astronomy, Royal Observatory, Blackford Hill, Edinburgh EH9 3HJ, UK
⁵ Centre for Astrophysics Research, Department of Physics, Astronomy and Mathematics, University of Hertfordshire, College Lane, Hatfield AL10 9AB, UK
⁶ Kapteyn Astronomical Institute, University of Groningen, PO Box 800, 9700 AV Groningen, The Netherlands

Received: 4 March 2022
Accepted: 9 September 2022

Abstract

Context. Radio loud active galactic nuclei (RLAGNs) are often morphologically complex objects that can consist of multiple, spatially separated, components. Only when the spatially separated radio components are correctly grouped together can we start to look for the corresponding optical host galaxy and infer physical parameters such as the size and luminosity of the radio object. Existing radio detection software to group these spatially separated components together is either experimental or based on assumptions that do not hold for current generation surveys, such that, in practice, astronomers often rely on visual inspection to resolve radio component association. However, applying visual inspection to all the hundreds of thousands of well-resolved RLAGNs that appear in the images from the Low Frequency Array (LOFAR) Two-metre Sky Survey (LoTSS) at 144 MHz, is a daunting, time-consuming process, even with extensive manpower.

Aims. Using a machine learning approach, we aim to automate the radio component association of large (>15 arcsec) radio components.

Methods. We turned the association problem into a classification problem and trained an adapted Fast region-based convolutional neural network to mimic the expert annotations from the first LoTSS data release. We implemented a rotation data augmentation to reduce overfitting and simplify the component association by removing unresolved radio sources that are likely unrelated to the large and bright radio components that we consider using predictions from an existing gradient boosting classifier.

Results. For large (>15 arcsec) and bright (>10 mJy) radio components in the LoTSS first data release, our model provides the same associations for 85.3% ± 0.6 of the cases as those derived when astronomers perform the association manually. When the association is done through public crowd-sourced efforts, a result similar to that of our model is attained.

Conclusions. Our method is able to efficiently carry out manual radio-component association for huge radio surveys and can serve as a basis for either automated radio morphology classification or automated optical host identification. This opens up an avenue to study the completeness and reliability of samples of radio sources with extended, complex morphologies.

Key words: methods: data analysis / catalogs / surveys / galaxies: active

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe-to-Open model. Subscribe to A&A to support open access publication.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.