Automated supervised classification of variable stars - I. Methodology

J. Debosscher; L. M. Sarro; C. Aerts; J. Cuypers; B. Vandenbussche; R. Garrido; E. Solano

doi:10.1051/0004-6361:20077638

Home

All issues

Volume 475 / No 3 (December I 2007)

A&A, 475 3 (2007) 1159-1183

Abstract

Free Access

Issue		A&A Volume 475, Number 3, December I 2007


Page(s)		1159 - 1183
Section		Astronomical instrumentation
DOI		https://doi.org/10.1051/0004-6361:20077638
Published online		24 September 2007

A&A 475, 1159-1183 (2007)

Automated supervised classification of variable stars^*

I. Methodology

J. Debosscher¹, L. M. Sarro²^,3, C. Aerts¹^,4, J. Cuypers⁵, B. Vandenbussche¹, R. Garrido⁶ and E. Solano⁷^,3

¹ Instituut voor Sterrenkunde, KU Leuven, Celestijnenlaan 200B, 3001 Leuven, Belgium
² Dpt. de Inteligencia Artificial , UNED, Juan del Rosal, 16, 28040 Madrid, Spain
³ Spanish Virtual Observatory, INTA, Apartado de Correos 50727, 28080 Madrid, Spain
⁴ Department of Astrophysics, Radbout University Nijmegen, PO Box 9010, 6500 GL Nijmegen, The Netherlands
⁵ Royal Observatory of Belgium, Ringlaan 3, 1180 Brussel, Belgium
⁶ Instituto de Astrofísica de Andalucía-CSIC, Apdo 3004, 18080 Granada, Spain
⁷ Laboratorio de Astrofísica Espacial y Física Fundamental, INSA, Apartado de Correos 50727, 28080 Madrid, Spain

Received: 13 April 2007
Accepted: 7 August 2007

Abstract

Context.The fast classification of new variable stars is an important step in making them available for further research. Selection of science targets from large databases is much more efficient if they have been classified first. Defining the classes in terms of physical parameters is also important to get an unbiased statistical view on the variability mechanisms and the borders of instability strips.

Aims.Our goal is twofold: provide an overview of the stellar variability classes that are presently known, in terms of some relevant stellar parameters; use the class descriptions obtained as the basis for an automated “supervised classification” of large databases. Such automated classification will compare and assign new objects to a set of pre-defined variability training classes.

Methods.For every variability class, a literature search was performed to find as many well-known member stars as possible, or a considerable subset if too many were present. Next, we searched on-line and private databases for their light curves in the visible band and performed period analysis and harmonic fitting. The derived light curve parameters are used to describe the classes and define the training classifiers.

Results.We compared the performance of different classifiers in terms of percentage of correct identification, of confusion among classes and of computation time. We describe how well the classes can be separated using the proposed set of parameters and how future improvements can be made, based on new large databases such as the light curves to be assembled by the CoRoT and Kepler space missions.

Conclusions.The derived classifiers' performances are so good in terms of success rate and computational speed that we will evaluate them in practice from the application of our methodology to a large subset of variable stars in the OGLE database and from comparison of the results with published OGLE variable star classifications based on human intervention. These results will be published in a subsequent paper.

Key words: stars: variables: general / stars: binaries: general / techniques: photometric / methods: statistical / methods: data analysis

^*

The documented classification software codes as well as the light curves and the set of classification parameters for the definition stars, are only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/475/1159

© ESO, 2007

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

Automated supervised classification of variable stars*

I. Methodology

Automated supervised classification of variable stars^*