Automated reliability assessment for spectroscopic redshift measurements★
Aix Marseille Univ. CNRS LAM, Laboratoire d’Astrophysique de Marseille,
e-mail: firstname.lastname@example.org; email@example.com
2 Université Lyon, Université Lyon 1, CNRS/IN2P3, Institut de Physique Nucléaire de Lyon, 69622 Villeurbanne cedex, France
3 INAF – Istituto di Astrofisica Spaziale e Fisica Cosmica Milano, via Bassini 15, 20133 Milano, Italy
4 Dipartimento di Fisica e Astronomia, Università di Bologna, via Gobetti 93/2, 40129 Bologna, Italy
5 INAF–Osservatorio Astronomico di Bologna, via Gobetti 93/3, 40129 Bologna, Italy
Accepted: 9 September 2017
Context. Future large-scale surveys, such as the ESA Euclid mission, will produce a large set of galaxy redshifts (≥106) that will require fully automated data-processing pipelines to analyze the data, extract crucial information and ensure that all requirements are met. A fundamental element in these pipelines is to associate to each galaxy redshift measurement a quality, or reliability, estimate.
Aim. In this work, we introduce a new approach to automate the spectroscopic redshift reliability assessment based on machine learning (ML) and characteristics of the redshift probability density function.
Methods. We propose to rephrase the spectroscopic redshift estimation into a Bayesian framework, in order to incorporate all sources of information and uncertainties related to the redshift estimation process and produce a redshift posterior probability density function (PDF). To automate the assessment of a reliability flag, we exploit key features in the redshift posterior PDF and machine learning algorithms.
Results. As a working example, public data from the VIMOS VLT Deep Survey is exploited to present and test this new methodology. We first tried to reproduce the existing reliability flags using supervised classification in order to describe different types of redshift PDFs, but due to the subjective definition of these flags (classification accuracy ~58%), we soon opted for a new homogeneous partitioning of the data into distinct clusters via unsupervised classification. After assessing the accuracy of the new clusters via resubstitution and test predictions (classification accuracy ~98%), we projected unlabeled data from preliminary mock simulations for the Euclid space mission into this mapping to predict their redshift reliability labels.
Conclusions. Through the development of a methodology in which a system can build its own experience to assess the quality of a parameter, we are able to set a preliminary basis of an automated reliability assessment for spectroscopic redshift measurements. This newly-defined method is very promising for next-generation large spectroscopic surveys from the ground and in space, such as Euclid and WFIRST.
Key words: methods: data analysis / methods: statistical / techniques: spectroscopic / galaxies: distances and redshifts / surveys
A table of the reclassified VVDS redshifts and reliability is only available at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (188.8.131.52) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/611/A53
© ESO 2018