Automated reliability assessment for spectroscopic redshift measurements

S. Jamal; V. Le Brun; O. Le Fèvre; D. Vibert; A. Schmitt; C. Surace; Y. Copin; B. Garilli; M. Moresco; L. Pozzetti

doi:10.1051/0004-6361/201731305

Home

All issues

Volume 611 (March 2018)

A&A, 611 (2018) A53

Abstract

Free Access

Issue		A&A Volume 611, March 2018


Article Number		A53
Number of page(s)		26
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/201731305
Published online		29 March 2018

A&A 611, A53 (2018)

Automated reliability assessment for spectroscopic redshift measurements^★

S. Jamal¹, V. Le Brun¹, O. Le Fèvre¹, D. Vibert¹, A. Schmitt¹, C. Surace¹, Y. Copin², B. Garilli³, M. Moresco⁴^,5 and L. Pozzetti⁵

¹ Aix Marseille Univ. CNRS LAM, Laboratoire d’Astrophysique de Marseille, 13013 Marseille, France
e-mail: sara.jamal@lam.fr; vincent.lebrun@lam.fr
² Université Lyon, Université Lyon 1, CNRS/IN2P3, Institut de Physique Nucléaire de Lyon, 69622 Villeurbanne cedex, France
³ INAF – Istituto di Astrofisica Spaziale e Fisica Cosmica Milano, via Bassini 15, 20133 Milano, Italy
⁴ Dipartimento di Fisica e Astronomia, Università di Bologna, via Gobetti 93/2, 40129 Bologna, Italy
⁵ INAF–Osservatorio Astronomico di Bologna, via Gobetti 93/3, 40129 Bologna, Italy

Received: 2 June 2017
Accepted: 9 September 2017

Abstract

Context. Future large-scale surveys, such as the ESA Euclid mission, will produce a large set of galaxy redshifts (≥10⁶) that will require fully automated data-processing pipelines to analyze the data, extract crucial information and ensure that all requirements are met. A fundamental element in these pipelines is to associate to each galaxy redshift measurement a quality, or reliability, estimate.

Aim. In this work, we introduce a new approach to automate the spectroscopic redshift reliability assessment based on machine learning (ML) and characteristics of the redshift probability density function.

Methods. We propose to rephrase the spectroscopic redshift estimation into a Bayesian framework, in order to incorporate all sources of information and uncertainties related to the redshift estimation process and produce a redshift posterior probability density function (PDF). To automate the assessment of a reliability flag, we exploit key features in the redshift posterior PDF and machine learning algorithms.

Results. As a working example, public data from the VIMOS VLT Deep Survey is exploited to present and test this new methodology. We first tried to reproduce the existing reliability flags using supervised classification in order to describe different types of redshift PDFs, but due to the subjective definition of these flags (classification accuracy ~58%), we soon opted for a new homogeneous partitioning of the data into distinct clusters via unsupervised classification. After assessing the accuracy of the new clusters via resubstitution and test predictions (classification accuracy ~98%), we projected unlabeled data from preliminary mock simulations for the Euclid space mission into this mapping to predict their redshift reliability labels.

Conclusions. Through the development of a methodology in which a system can build its own experience to assess the quality of a parameter, we are able to set a preliminary basis of an automated reliability assessment for spectroscopic redshift measurements. This newly-defined method is very promising for next-generation large spectroscopic surveys from the ground and in space, such as Euclid and WFIRST.

Key words: methods: data analysis / methods: statistical / techniques: spectroscopic / galaxies: distances and redshifts / surveys

^★

A table of the reclassified VVDS redshifts and reliability is only available at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/611/A53

© ESO 2018

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

Automated reliability assessment for spectroscopic redshift measurements★

Automated reliability assessment for spectroscopic redshift measurements^★