Volume 624, April 2019
|Number of page(s)||15|
|Section||Catalogs and data|
|Published online||01 April 2019|
National Centre for Nuclear Research, Astrophysics Division, ul. Hoża 69, 00-681 Warsaw, Poland
2 Leiden Observatory, Leiden University, PO Box 9513, 2300 RA Leiden, The Netherlands
3 Center for Theoretical Physics, Polish Academy of Sciences, al. Lotników 32/46, 02-668 Warsaw, Poland
4 Astronomical Observatory of the Jagiellonian University, 31-007 Kraków, Poland
5 Faculty of Physics, Ludwig-Maximilians-Universität, Scheinerstr. 1, 81679 Munich, Germany
6 INAF – Osservatorio Astronomico di Capodimonte, Salita Moiariello 16, 80131 Napoli, Italy
7 School of Physics and Astronomy, Sun Yat-sen University, Zhuhai Campus, Guangzhou 519082, PR China
Accepted: 4 February 2019
We present a catalog of quasars selected from broad-band photometric ugri data of the Kilo-Degree Survey Data Release 3 (KiDS DR3). The QSOs are identified by the random forest (RF) supervised machine learning model, trained on Sloan Digital Sky Survey (SDSS) DR14 spectroscopic data. We first cleaned the input KiDS data of entries with excessively noisy, missing or otherwise problematic measurements. Applying a feature importance analysis, we then tune the algorithm and identify in the KiDS multiband catalog the 17 most useful features for the classification, namely magnitudes, colors, magnitude ratios, and the stellarity index. We used the t-SNE algorithm to map the multidimensional photometric data onto 2D planes and compare the coverage of the training and inference sets. We limited the inference set to r < 22 to avoid extrapolation beyond the feature space covered by training, as the SDSS spectroscopic sample is considerably shallower than KiDS. This gives 3.4 million objects in the final inference sample, from which the random forest identified 190 000 quasar candidates. Accuracy of 97% (percentage of correctly classified objects), purity of 91% (percentage of true quasars within the objects classified as such), and completeness of 87% (detection ratio of all true quasars), as derived from a test set extracted from SDSS and not used in the training, are confirmed by comparison with external spectroscopic and photometric QSO catalogs overlapping with the KiDS footprint. The robustness of our results is strengthened by number counts of the quasar candidates in the r band, as well as by their mid-infrared colors available from the Wide-field Infrared Survey Explorer (WISE). An analysis of parallaxes and proper motions of our QSO candidates found also in Gaia DR2 suggests that a probability cut of pQSO > 0.8 is optimal for purity, whereas pQSO > 0.7 is preferable for better completeness. Our study presents the first comprehensive quasar selection from deep high-quality KiDS data and will serve as the basis for versatile studies of the QSO population detected by this survey.
Key words: catalogs / surveys / quasars: general / large-scale structure of Universe / methods: data analysis / methods: observational
A copy of the catalog is available at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (188.8.131.52) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/624/A13
We publicly release the resulting catalog at http://kids.strw.leidenuniv.nl/DR3/quasarcatalog.php, and the code at https://github.com/snakoneczny/kids-quasars.
© ESO 2019
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.