A package for the automated classification of periodic variable stars

Dae-Won Kim; Coryn A. L. Bailer-Jones

doi:10.1051/0004-6361/201527188

Home

All issues

Volume 587 (March 2016)

A&A, 587 (2016) A18

Abstract

Free Access

Issue		A&A Volume 587, March 2016


Article Number		A18
Number of page(s)		15
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/201527188
Published online		12 February 2016

A&A 587, A18 (2016)

A package for the automated classification of periodic variable stars^⋆

Dae-Won Kim^⋆⋆ and Coryn A. L. Bailer-Jones

Max-Planck Institute for Astronomy, Königstuhl 17, 69117 Heidelberg, Germany

Received: 13 August 2015
Accepted: 4 December 2015

Abstract

We present a machine learning package for the classification of periodic variable stars. Our package is intended to be general: it can classify any single band optical light curve comprising at least a few tens of observations covering durations from weeks to years with arbitrary time sampling. We use light curves of periodic variable stars taken from OGLE and EROS-2 to train the model. To make our classifier relatively survey-independent, it is trained on 16 features extracted from the light curves (e.g., period, skewness, Fourier amplitude ratio). The model classifies light curves into one of seven superclasses – δ Scuti, RR Lyrae, Cepheid, Type II Cepheid, eclipsing binary, long-period variable, non-variable – as well as subclasses of these, such as ab, c, d, and e types for RR Lyraes. When trained to give only superclasses, our model achieves 0.98 for both recall and precision as measured on an independent validation dataset (on a scale of 0 to 1). When trained to give subclasses, it achieves 0.81 for both recall and precision. The majority of misclassifications of the subclass model is caused by confusion within a superclass rather than between superclasses. To assess classification performance of the subclass model, we applied it to the MACHO, LINEAR, and ASAS periodic variables, which gave recall/precision of 0.92/0.98, 0.89/0.96, and 0.84/0.88, respectively. We also applied the subclass model to Hipparcos periodic variable stars of many other variability types that do not exist in our training set, in order to examine how much those types degrade the classification performance of our target classes. In addition, we investigate how the performance varies with the number of data points and duration of observations. We find that recall and precision do not vary significantly if there are more than 80 data points and the duration is more than a few weeks.

Key words: methods: data analysis / methods: statistical / stars: variables: general / techniques: miscellaneous

^⋆

The classifier software of the subclass model is available (in Python) from the GitHub repository (https://goo.gl/xmFO6Q).

^⋆⋆

Current address: Institute of Astronomy and Astrophysics, Academia Sinica, PO Box 23-141, Taipei 10617, Taiwan.

© ESO, 2016

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

A package for the automated classification of periodic variable stars⋆

A package for the automated classification of periodic variable stars^⋆