A&A 470, 1201-1214 (2007)
DOI: 10.1051/0004-6361:20077571
T. H. Dall1 - C. Foellmi2 - J. Pritchard3 - G. Lo Curto4 - C. Allende Prieto5 - H. Bruntt6 - P. J. Amado7 - T. Arentoft8 - M. Baes9 - E. Depagne4,10 - M. Fernandez11 - V. Ivanov4 - L. Koesterke5 - L. Monaco4 - K. O'Brien4 - L. M. Sarro12 - I. Saviane4 - J. Scharwächter4 - L. Schmidtobreick4 - O. Schütz4 - A. Seifahrt13 - F. Selman4 - M. Stefanon4 - M. Sterzik4
1 - Gemini Observatory, 670 N. A'ohoku Pl., Hilo, HI 96720, USA
2 -
Laboratoire d'AstrOphysique de Grenoble, 414 rue de la Piscine, 38400 Saint-Martin d'Hères, France
3 -
European Southern Observatory, Karl Schwarzschild-Str. 2, 85748 Garching bei München, Germany
4 -
European Southern Observatory, Alonso de Cordova 3107, Casilla 19001, Vitacura, Santiago, Chile
5 -
McDonald Observatory and Department of Astronomy, The University of Texas, Austin, TX 78712-1083, USA
6 -
School of Physics A28, University of Sydney, 2006 NSW, Australia
7 -
Universidad de Granada-IAA(CSIC), PO Box 3004, 18080 Granada, Spain
8 -
Department of Physics and Astronomy, University of Aarhus, 8000 Aarhus C, Denmark
9 -
Sterrenkundig Observatorium, Universiteit Gent, Krijgslaan 281 S9, 9000 Gent, Belgium
10 -
Pontificia Universidad Catolica de Chile, Vicuna Mackenna 4860, Santiago de Chile, Chile
11 -
Instituto de Astrofísica de Andalucía, Camino Bajo de Huétor 50, 18008 Granada, Spain
12 -
Department of Artificial Intelligence, ETSI Informática, Juan del Rosal 16, 28040 Madrid, Spain
13 -
Astrophysikalisches Institut und Universitäts-Sternwarte Jena, Schillergässchen 2, 07745 Jena, Germany
Received 29 March 2007 / Accepted 28 May 2007
Abstract
Context. About 500 new variable stars enter the General Catalogue of Variable Stars (GCVS) every year. Most of them however lack spectroscopic observations, which remains critical for a correct assignement of the variability type and for the understanding of the object.
Aims. The Variable Star One-shot Project (VSOP) is aimed at (1) providing the variability type and spectral type of all unstudied variable stars, (2) process, publish, and make the data available as automatically as possible, and (3) generate serendipitous discoveries. This first paper describes the project itself, the acquisition of the data, the dataflow, the spectroscopic analysis and the on-line availability of the fully calibrated and reduced data. We also present the results on the 221 stars observed during the first semester of the project.
Methods. We used the high-resolution echelle spectrographs HARPS and FEROS in the ESO La Silla Observatory (Chile) to survey known variable stars. Once reduced by the dedicated pipelines, the radial velocities are determined from cross correlation with synthetic template spectra, and the spectral types are determined by an automatic minimum distance matching to synthetic spectra, with traditional manual spectral typing cross-checks. The variability types are determined by manually evaluating the available light curves and the spectroscopy. In the future, a new automatic classifier, currently being developed by members of the VSOP team, based on these spectroscopic data and on the photometric classifier developed for the COROT and Gaia space missions, will be used.
Results. We confirm or revise spectral types of 221 variable stars from the GCVS. We identify 26 previously unknown multiple systems, among them several visual binaries with spectroscopic binary individual components. We present new individual results for the multiple systems V349 Vel and BC Gru, for the composite spectrum star V4385 Sgr, for the T Tauri star V1045 Sco, and for DM Boo which we re-classify as a BY Draconis variable. The complete data release can be accessed via the VSOP web site.
Key words: stars: variables: general - stars: fundamental parameters - methods: observational - astronomical data bases: miscellaneous
Recent examples of the misidentification of variables, where the designation was based solely on photometric light curve appearance, and subsequently corrected by taking one single snapshot spectrum, include:
FH Leo, that was long thought to be the only known cataclysmic variable (CV) to form part of a binary system, being designated as a nova-like variable by Kazarovets et al. (2003) based on an outburst observed by the Hipparcos satellite. High-resolution FEROS spectroscopy allowed us to refute the classification and show that the stars are normal F8 and G0 dwarfs (Dall et al. 2005), and that the outburst cannot possibly be due to an accretion disk, but rather to a superflare or to erroneous Hipparcos measurements, or due to a CV "hidden'' in the light of the two normal stars (Vogt 2006).
XY Pic was included in a study of statistical properties of W UMa type variables (active,
very fast rotating contact binaries) by Selam (2004), who concluded that it was among the
most active stars of the sample, based on a fit of its Hipparcos light curve, using
synthetic light curves based on physical parameters. FEROS spectra of XY Pic allowed us to
show that the star is a rather slowly rotating F3 giant, with no measurable
chromospheric activity (Dall 2005), and is likely a Scuti pulsator.
This example shows, that even a high-quality
light curve analysis can result in wrong conclusions about the nature of an object without
spectroscopic confirmation.
TV Ret was long thought to be a CV due to an outburst observed
photometrically in 1977. A single low resolution spectrum, revealed the object to be a compact
emission line galaxy at
,
possibly hosting an extremely bright supernova as the cause of the
outburst (Schmidtobreick et al. 2007).
The above examples illustrate the need for snapshot spectra, and shows that the vast collection of poorly studied variable stars contains many errors in terms of variability type designation, which may in many cases "cover up'' some potentially interesting physical phenomena under a wrong and seemingly dull label. In this paper we describe a new large project, the Variable Stars One-shot Project (VSOP), undertaken to provide the required "snapshots''. We present the motivation and scope of VSOP in Sect. 2, the instrumentation and data handling in Sect. 3, and present results from the first observing semester from the European Southern Observatory's La Silla site in Sect. 4, listing the revised spectral and variability types for 221 stars. The results and the reduced data are freely accessible from our website. We conclude the paper with plans for the future of VSOP in Sect. 5, where we also address the problem of automatic variability classification.
Motivated by the situation outlined above, the goals of VSOP are:
Another aspect that contributes to the science efficiency, is serendipity. The VSOP observations
are targeting poorly studied variable stars, many of which are exhibiting poorly studied phenomena. We
thus expect to obtain by chance
data that either merit follow-up in-depth studies, or sheds light on some hitherto obscured phenomenon.
Much of this work may naturally be done by groups not affiliated with VSOP.
We cross-correlated the GCVS with the SDSS spectroscopic list to estimate the possible contribution of the SDSS towards accurate spectroscopic classification of the variable stars. The overlap consists of 80 objects (less than 0.3% of the GCVS) and given the degree of SDSS completeness we expect some additional objects, on order of 10, leading us to conclude that less than 0.5% of the GCVS have been covered by SDSS. The low number of stars in common is a consequence both of the faint brightness limit of the SDSS (g>14), and of the science goals of the SDSS, which dictated selection for spectroscopic follow up primarily for the extragalactic targets, obtaining spectra of stars only if some fibers remained unused.
However, spectra of a significant number of stars will be obtained under the Sloan Extension for Galactic Understanding and Exploration (SEGUE; Newberg & Sloan Digital Sky Survey Collaboration 2003; Rockosi 2006) which plans to obtain spectra of 240 000 Milky Way stars over 3500 square degrees with the same spectrograph. The goal of this survey is to provide radial velocities and metallicities with typical accuracies of 10 km s-1 and 0.3 dex respectively. SEGUE would thus complement VSOP. However, VSOP provides superior spectral resolution and S/N at any given magnitude, since we use 2-4 m-class telescopes. Plus, VSOP is already producing and releasing data.
Another large survey is the RAdial Velocity Experiment (RAVE; Steinmetz et al. 2006), aimed at kinematic studies of the local Milky Way environment. While this survey targets a large number of stars (24 748 in the first data release), the spectral coverage is limited to the IR Ca triplet region, and only at a moderate resolution of 7500. Thus, RAVE is likely not very useful as a general classification and discovery study.
Of existing surveys, the GAUDI (Solano et al. 2005) is the one most similar to VSOP. GAUDI is a
photometric and spectroscopic database of objects that may be observed by the COROT mission, covering
all targets down to V=9.5 inside the COROT accessibility zone - an area on the sky of
radius.
While GAUDI covers a small area of the sky, looking for stellar variability in all available objects, VSOP
is targeting known variables all over the sky. Thus, while complimentary, our scope is different.
Furthermore, a number of spectroscopic surveys have been performed in recent years, targeting the variability of
individual types or classes of stars. Examples include surveys for
Cep stars (Telting et al. 2006),
Hipparcos-selected O-B supergiants (Lefever et al. 2007),
Dor stars (de Cat et al. 2006), and
studies of Ap star oscillations (Kurtz et al. 2006). While these are all high-resolution spectroscopic studies,
they target a limited subset of stellar types, while the scope of VSOP is all of stellar variability, spanning
the complete HR diagram, including all phenomena. In this respect, VSOP is a unique project, and it is our hope that VSOP
will also turn out to be a unique resource for researchers of any stellar variability phenomena.
We present here the results of the first semester of VSOP observations, collected between April and October 2006 with the two high-resolution Echelle spectrographs FEROS and HARPS, of the ESO La Silla Observatory in Chile.
The 221 stars reported here, belong to the bright end of the unstudied variables of the
GCVS, which is now 40% complete (i.e., having reliable, wide wavelength coverage spectroscopy)
down to mv = 10. The rate of discovery of new variables have been
relatively constant in recent years, and is very low at the bright end of the distribution. Assuming
similar number of observations for the coming semesters, we can expect to complete the bright end of the
unstudied variables of the Southern hemisphere in 1-2 additional observing seasons.
Going to fainter magnitudes, the completeness decreases rapidly, reaching a plateau of around 20-25% fainter than mv = 13,
not including as yet unrecognized variables.
The spectrograph is fed by two fibres providing simultaneous spectra
of object plus, in the case of VSOP observations, an empty sky region
for background subtraction. The fibres are illuminated via apertures
of 2.0
on the sky separated by 2.9
.
A dedicated
pipeline implemented as a MIDAS context provides, in almost real-time,
extracted 1-dimensional, wavelength calibrated spectra.
FEROS Period-77 VSOP observations have been obtained with exposure
times ranging from 180 s to 1200 s. Given the relaxed observing
constraints, signal-to-noise (S/N) ranges from 10 to
370 at V.
The standard calibration plan, which provides bias, flat-field,
wavelength calibration and spectrophotometric standard star
observations, has been used for this programme.
The standard calibration set executed prior to each observing night included bias, flat-fields and wavelength calibration. The HARPS data are automatically processed upon acquisition by a dedicated pipeline developed by the HARPS consortium and which provides bias subtraction, order localization, flat fielding, cosmics filtering, order extraction (using the Horne technique, Horne 1986, assigning lower weights to the pixels away from the peak in the spatial profile at any given wavelength) and radial velocity determination through cross correlation of each spectral order with a predefined stellar mask (synthetic spectrum).
HARPS Period-77 VSOP observations have been obtained with exposure times ranging from 90 s
to 1200 s, resulting in S/N between 30 and 150, averaging to 100 at 550 nm.
All the data and basic information about the stars are stored in a wiki-wiki website located at http://vsop.sc.eso.org, from where the reduced data of this First Data Release can be freely accessed. We expect to make incoming data freely available through subsequent data releases, with only a few months delay to allow for our inital data analysis. Research work benefiting from VSOP data should reference this paper, and include the following acknowledgement:
Based on data provided by the VSOP collaboration, through the VSOP wiki database operated at ESO Chile and ESO Garching.For the organization of information, we have chosen the MediaWiki software, developed for the open and free on-line encyclopedia Wikipedia. This ensure a reliable and extendable website where all VSOP members can contribute easily from their own daily workplace. This is of growing importance given the distribution of VSOP members around the world, as evidenced by the list of affiliations for this paper.
The MediaWiki software is based on the article/discussion wiki philosophy, which means that to each article page there is an associated discussion page. For VSOP we have extended the software to make the discussion pages restricted to VSOP members only, while the article pages are reserved for already published results, freely accessible to anybody. Thus, each star has a dedicated article page, where basic informations (coordinates, magnitude, link to SIMBAD, finding charts, old variability and spectral types - when available) are provided. Also, the observation details are described as well as the analysis, its results, a list of references, catalogues and download links to plots of the spectra as well as to all the reduced data products: Cross-Correlation Function (CCF) and wavelength calibrated one-dimensional spectrum, all of which are publicly available.
Finally, the MediaWiki software allows the wiki website to be scriptable. We have thus developed a VSOP-dedicated software module written in Python which makes the development of scripts dedicated to VSOP pages much easier. These robot scripts can then update a large amount of repetitive information, or collect the results of given subcategories of stars. Table 2 of this paper, for instance, is automatically produced by one of these scripts.
Table 2 lists the 221 stars observed during ESO Period 77: 90 of these were observed with HARPS, 131 with FEROS.
Table 1:
A sample of results from the automatic analyses.
We list the names, HD numbers and spectral types from the manual spectral typing.
We give the
,
and [Fe/H] and estimated uncertainties determined by VASP and VWA.
values are calculated with VWA and have uncertainties of 10-20%.
The first four targets are not VSOP targets, but high S/N HARPS spectra taken from Dall et al. (2006), which we have used
to calibrate our tools. Full detailed results for all VSOP stars can be found online (see text).
Table 2: The VSOP stars of the first observing season.
We designate SB2 and SB3 binarity status from the presence of multiple peaks in the CCF, or via a careful analysis of the spectrum, identifying spectral features belonging to stars of different spectral types. The latter case is when the stars have widely different spectral types, and the CCF mask only "sees'' one of the components (see e.g. V4385 Sgr, Sect. 4.3.3). Since we have only one epoch, the SB1 designation is not used.
Out of the 221 observed targets, we identify 22 new SB2 binaries, several of these as components of wide visual binaries. In addition we find four new SB3 binaries. However, the binarity of most of our targets remains undetermined due to our single-epoch approach. For many of the stars we could not compute a CCF, due to the difficulty to build reliable templates for such peculiar objects. We postpone accurate determination of the binary status of such stars to a later work.
The search for the optimal solution is carried out using the Nelder-Mead algorithm and
third order interpolation in a grid of synthetic spectra based on Kurucz (2006) model
atmospheres and modern line and continuous opacities (Allende Prieto et al. 2003a,b).
The grid currently in use covers
K.
To overcome problems with fast rotation, the grid has been constructed
with a resolution of 38 km s-1, corresponding to a spectral resolving power of 7700.
For faster rotational velocities, the accuracy
of the fits decrease significantly as rotation increases.
The solar reference abundances are from the photospheric values compiled by Asplund et al. (2005). Known spectroscopic binaries (cf. Sect. 4.1), as well as stars clearly outside the grid boundaries, are not run through VASP.
Future upgrades to VASP will include wider temperature range grids,
as currently only about one third of our targets fall within the limits of the grid. Other upgrades will be
parameter estimation using other
spectral intervals besides H,
and the ability to handle
rotational broadening.
For comparison, we have included results from a classical abundance analysis
obtained with the VWA package (Bruntt et al. 2004).
VWA works with the original (full resolution) spectra and determines the abundance of each individual line.
It relies on the lines of Fe, Cr and Ti to automatically adjust the microturbulence,
and
of the
applied atmospheric models (Heiter et al. 2002).
VWA is a semi-automatic procedure and to obtain optimal results the user needs to make
(1) a careful correction of the continuum and
(2) inspect the fit of individual lines.
The abundances found with VWA are based on corrected
values,
which are derived from the HARPS spectrum of the Sun.
We did not analyse PP Hya due to its high
,
which is known to
cause problems for VWA's automatic procedures.
While VWA seems to produce more robust results, the process involves a lot of manual intervention and is
not at this point suited for an automated analysis.
We have in addition performed manual spectral
classifications, by comparison with standard
stars of the MK spectral classification as defined by Morgan et al. (1978) and Keenan & McNeil (1976).
The practical comparison has been done with the help of the Digital Spectral Classification Atlas by Gray,
using high resolution spectra of spectral standards obtained with HARPS, FEROS and UVES.
Whenever the emission cores of the Ca II H&K lines have been present, we have determined the
luminosity type from the Wilson-Bappu effect (Wilson & Vainu Bappu 1957), using the calibration by Pace et al. (2003).
Since many older classifications are done in this way, it is instructive to investigate the differences between this traditional
human skill driven task, and a modern automatic classification.
We have identified several causes for problems in the spectral classifications, the most common one associated with binarity.
Spectroscopic and very close visual binaries (separations <1.5
)
often show multiple peaks in the CCF, and are thus easy
to filter out of the VASP analysis. One such example is V349 Vel (Sect. 4.3.1).
We also found that early-type and very metal-weak stars limits the precision of the VASP fit, and of course influences also the
manual classification.
In many cases, apparent low metallicity may be due to a binary companion contributing light to the spectrum, causing
the metallic lines to appear weaker. One clear example of this can be seen in V4385 Sgr (Sect. 4.3.3).
In a few cases we have found disagreement between the VASP-computed
and the absolute magnitude derived from the
Wilson-Bappu relation, in most cases caused by being near the lower temperature
limit of the grid.
![]() |
Figure 1: The CCF of V349 Vel. Capitals denote the CCDM designation, while lower case letters label the individual components of each visual component. |
Open with DEXTER |
The GCVS variability type is Canum Venaticorum (ACV),
which is reserved for magnetic B-A stars with peculiar spectra.
We suspect that the SB4 appearance of V349 Vel may have been misinterpreted as spectral peculiarities. We do not see
evidence of magnetic activity in the spectrum, hence we also consider BY Dra
type variability unlikely.
From our spectral analysis (Sect. 4.2) we
find that the parameters of the primary component is
consistent with an early F type star near the main sequence.
Thus, the primary component of V349 Vel could
well belong to the
Scuti or
Dor variables. Alternatively,
one or both of the SB2 components could show eclipses.
Otero (2003) listed V349 Vel as an EA-type eclipsing binary, with P = 3.02 d, and notes that additional
shorter periods may be present as well.
More detailed photometric or spectroscopic observations are needed to
understand the components of the system better.
Our FEROS spectrum of BC Gru not only confirms the contact binary nature, but also reveals a third component,
as evident in Fig. 2. All three stars have approximately the same spectral type.
We have performed a simple spectral fitting using archive HARPS spectra of the K2V star HD 22049 ( Eri).
We have artificially broadened and Doppler-shifted two copies of this template, then combined with an unbroadened copy of
the same template to emulate the spectrum of BC Gru. The fitting has been done using STARMOD (Barden 1985; Montes et al. 1995,2000).
A better fit could likely be obtained using
different templates for each component, but in order to do a more accurate analysis, one would need spectra at several different
orbital phases. Our analysis yields rotational velocities of
km s-1 and
km s-1,
while the sharp-lined component is a very slow rotator, with measured
km s-1. In Fig. 2 the acomponent is the red-most one.
![]() |
Figure 2:
H![]() ![]() |
Open with DEXTER |
Our high quality VSOP spectrum of V4385 Sgr
(S/N = 130 at 550 nm) was obtained on 2006-05-04 using HARPS, and is the
first full optical range high resolution spectrum of the source.
The HARPS CCF is single-peaked at high contrast (11%) but asymmetric, as expected due to geometric distortion.
The CCF is constructed using a G2 mask, and hence represents the average line profile of
the F-component only at a radial velocity of 14.5 km s-1.
The FWHM of the CCF is 13 km s-1, which is a safe upper limit for .
Thus the rotation is significantly slower than
for the average early F-star, which is surprising given the short variability period, and the fact
that binary evolution tends to increase rotation rates as the orbit
shrinks.
The spectrum (Fig. 3) shows the typical He I lines of the B component superimposed on the metallic-line spectrum. Note that the RV of the two components seem compatible within the uncertainties.
The VSOP spectrum shows Balmer emission lines, strongest at H
(FWHM = 132 km s-1),
core fill-in at H
and H
(see also Merrill & Burwell 1950).
Weak P Cyg profiles
can be seen in several lines, however not in the Balmer lines.
Note that the lines of both components, including the strong H
emission, are compatible with
km s-1.
The interstellar Na doublet is saturated and presents also a P Cyg profile from the underlying stellar spectrum.
Based on the observed emission properties, and on the undisputable two-component nature, we can safely
rule out that the 2.62 d period can be due to -Dor like pulsations in a single F star. Rather,
due to the slow rotation of the F-star and the apparent same RV of the two components, we propose
that V4385 Sgr could be a near pole-on viewed close binary system, showing only slight eclipses. We propose that
the orbital period is equal to the photometric period, caused either by a partial eclipse of the smaller component, or by
variations in the wind structure over the orbit.
Unfortunately, only scattered photometric data exists, as summarized by Reed (1998), making it difficult at
this point to test the hypothesis. Given its brightness,
the star would be an easy target for small telescopes.
![]() |
Figure 3:
Plot of selected spectral regions of V4385 Sgr. Fluxes are normalized to the continuum.
Top: over-plotted is the VSOP spectrum of the B5 star AI Pyx. The composite B+F nature of the V4385 Sgr
spectrum is evident.
Middle: H![]() |
Open with DEXTER |
We have secured two FEROS spectra of this object, on
2006-08-11 and
2006-09-16. Our spectral
typing procedure (Sect. 4.2) yields consistently a somewhat later spectral type, K7,
compatible with a giant (or sub-giant). A radial velocity of 25.77 km s-1 is determined.
A striking feature is the H profile (Fig. 4).
The line is in absorption, but exhibits slightly asymmetric double-peaked emission features in both wings,
possibly indicative of a rotating disk.
The star exhibits a strong lithium (6708 Å) absorption, with a measured equivalent width of 0.48 Å. Taken together with its late spectral type, this is a strong youth indicator. Note that the star
is compatible with being a (weak-lined, as H
emission is inversed) T Tauri star according
to the commonly accepted spectroscopic criteria by Martin (1997). Comparing its lithium
strength to recent measurements in a sample of nearby, young objects (Torres et al. 2006), we conclude that it is
at least as young as the
10 Myr old
Pictoris moving group members. Its apparent
youth and its location in the northern outskirts of the Upper Scorpius OB association indicates
that V1045 Sco might be related to, or in the foreground of, this 5 Myr young association (Preibisch & Zinnecker 1999).
This is supported by the apparent presence of remnant circumstellar material, inferred by the
H
profile and mid-IR excess.
Its measured heliocentric
RV = 25.77 km s-1 is, however, inconsistent with the bulk motion of
Upper Scorpius members that peak around -10 km s-1
(see, e.g., Sartori et al. 2003; Jilinski et al. 2006)
The star may therefore well be located in the foreground of the association, possibly related to the Gould Belt
(Guillout et al. 1998).
![]() |
Figure 4:
Selected spectral regions of V1045 Sco:
Top: H![]() |
Open with DEXTER |
There are also uvby observations (one set, from 1993) and the V and B values from Tycho.
The Tycho-2 Spectral Type Catalog (Wright et al. 2003) converts the G5 spectral class
into a temperature of
K. The GCVS lists the star as a IB type, i.e., a
poorly studied irregular variable of intermediate to late spectral type.
The VSOP FEROS spectrum was taken on 2006-08-12 with an integration time of 600 s. From the CCF
we derive a
of
65 km s-1 and a radial velocity of -42 km s-1.
The most prominent features are strong Ca II H&K emission cores, which, together with the
fast rotation, points to a young active star. Analysis of the lithium line region reveals higher
than solar photospheric lithium abundance, confirming the notion of a relatively young, active star.
We thus feel confident classifying DM Boo as a BY Draconis star, noting the low
photometric amplitude typical of such stars.
To compute some statistics of the VSOP observations, we must rely on "old'' values as they were before VSOP observations. This concerns mostly the spectral type and the binary status. The old spectral type is determined automatically by querying SIMBAD. To determine the binary status of a target, we query VizieR catalogues with the Multiple and Double stars, Spectroscopic, Cataclysmic and Eclipsing binary keywords to see if the star belongs to one of these classes. If the star belongs to a catalogue of spectroscopic binaries, or one providing orbits of stars, the star is said to be a binary. If the star belongs to some visual binaries or multiple and double stars catalogs, the star is suspected to be a visual binary. Finally, if it only belongs to the Tycho's and Hipparcos catalogues, the suspected binary status is inferred from the MultFlag parameter of the catalogue. The procedure is quite complex, and is described in the wiki website (Binary Status page). We have tested various combination of criteria, which seems to converge to a relatively stable results. Our statistics are based on these tests.
The main points of the VSOP observations can be summarized as follows:
This paper has presented the first data release from VSOP, covering ESO Period 77. Observations are ongoing in Periods 78 and 79, and data from these periods will be released as soon as the periods end.
Future space missions like COROT and GAIA will provide a wealth of data for variable stars that will be observed with unprecedented precision and sampling of their light curves. Nevertheless, situations of ambiguity in the determination of the variability type can arise. Also, as in the case of Hipparcos, it is expected that these data will lead to the discovery of a number of new classes of variables. For all of them, it will be necessary to refine their variability type and position in the HR diagram with spectroscopic data. This is what VSOP has been doing with objects of the GCVS, but the sheer number of stars provided by these missions would deem this classification impossible to be determined by hand.
For the next steps, the main aim of VSOP is to acquire and develop the necessary tools to provide good and reliable spectral and variability classification of stars automatically from the available data, either the spectrum, the light curve, color information, or combinations of these.
There have been in the past several attempts to mine large archives of variable objects data, mainly based on photometric time series and, sometimes, on photometric colours as well. Good examples in the field of classification, are the series of papers on MACHO, OGLE and ASAS data (for RR Lyrae stars, for example, see Alcock et al. (2003), Soszynski et al. (2003) and Wils et al. (2006), for MACHO, OGLE and the NSVS respectively), where catalogues of distinct variability classes are compiled according to several selection criteria (rules) and, in some occasions, with human intervention. Basically, the problem of supervised classification of variable objects can be described as that of defining general boundaries (hard or fuzzy) in the hyperspace of the features that describe the classes, based on a set of examples of each class.
The field of Machine Learning and Pattern Recognition offers a wealth of alternatives for defining more complex, flexible, and general boundaries than the hyperboxes used in the compilation of catalogues, minimizing at the same time the human intervention in the classification process. In this sense, the VSOP automatic variability classifier will build upon a previous effort (Debosscher et al. 2006; Sarro et al. 2006; Debosscher et al. 2007) carried out during the past few years to i) create a well defined training set of bona fide variable objects belonging to the most important and numerous classes, ii) analyse the most relevant and informative features that describe these classes and iii) to study and compare different approaches to the task of classification, from Bayesian Networks (Pearl 1988) to Bayesian averages of artificial neural networks (Neal 1996) or SVMs (Support Vector Machines, Vapnik 1995). The development of this classifier was motivated by the wealth of data expected from the COROT space mission (Baglin et al. 2000) and was thus designed to facilitate the in-depth analysis of representative samples of these classes as observed by COROT. It produces probabilistic class assignements based on photometric time series parameters (harmonic amplitudes of component frequencies, phase differences, amplitude ratios, etc.). The objective is twofold: i) to generate class specific object lists for further analysis by COROT's Additional Program scientists and ii) to detect objects lying outside the known density distribution of objects/classes in the parameter space; these objects can possibly represent new astrophysical scenarios for variability. The classifier presented in Debosscher et al. (2007) has now been extended by the same authors to incorporate the photometric colours B-V, V-I, J-H, H-K and, for a reduced number of classes, also Strömgren indices (Sarro et al. 2007).
The effort described in the previous paragraphs is now being continued and adapted as part of the GAIA Data Processing and Analysis Consortium (Gilmore et al. 2000) to incorporate attributes that will be provided by GAIA instruments such as Blue and Red spectrophotometry or spectroscopy near the Calcium infrared triplet (Eyer 2006).
The VSOP automatic classifier is designed as a Virtual Observatory compliant service capable of producing probabilistic class assignements for objects with a wide variety of attributes available (from time series photometry to multi-wavelength spectra or photometric colours) and will thus represent the culmination of all the efforts on which it is based.
The development plan necessarily includes a first stage where
spectra of at least a representative sample of the COROT training
set objects have to be obtained in order to allow for the
incorporation of spectral information to the classifier. Here is
where VSOP initially will play a major part by collecting this
dataset. Subsequent to this, a study will be conducted in order to
determine an optimal subset of features providing the best
classification performance. Obvious candidate features will be
line or band fluxes, equivalent widths, ratios, and/or combinations thereof, line
asymmetry measures, and derived physical parameters (
,
,
[Fe/H]). All these will be subject to the statistical feature
analysis classical in Machine Learning applications.
One obvious requisite of the classifier will be its capability for prediction based on incomplete data. This can happen if the spectral information only covers a fraction of the wavelength range used for training or if the resolution is too poor to separate several lines. Based on these specifications, several state-of-the-art machine learning algorithms will be applied to the training set and the performance assessed according to standard figures of merit like overall misclassification rates or the area under the receiver operating curve (Fawcett 2003).
While these efforts are already undergoing in the context of the space missions, the VSOP team is actively taking part in these new developments towards complete automatic classification.
While it is growing rapidly, VSOP is flexible enough to allow us to imagine a long-term future. VSOP is a project implemented only by astronomers, using the MediaWiki software. With a simple, though large, collection of scripts developed over the year of operation, in addition to the already implemented spectrograph's pipelines, VSOP is a quasi-automatic spectrum production machine whose results are automatically available through a wiki website. As emphasized before, the spectral and variability analysis will also become automatic in the future. The wiki provides a scriptable and clean interface which requires minimum human intervention, while centralizing all the work done by the team. In this sense, it combines both the advantages of an automatically generated content website (homogeneity, reliability, cleanness), with the total flexibility for the contributors to customize a specific point, and for the public to have access to the data, the information and the history. In that perspective, we could imagine an all-automatic wiki-database accepting pipeline-reduced spectra from any observatories in the world, not necessarily dedicated to variable stars.
This larger vision of complete automation while retaining absolute flexibility, is at the core of the VSOP future. As a first step in this direction, we have included the released data in Wikimbad (http://wikimbad.org) as well as directly on our own VSOP server.
We believe in open sharing of information on all levels, and we believe this can be accomplished without sacrificing individual scientific ambitions by basing our collaboration on team-work and the drive for fast scientific turnover. We would like to conclude with an open invitation to participate in VSOP and its mission, either as part of the VSOP team, or independently through the freely accessible data releases.
Acknowledgements
This research has made extensive use of the Simbad and VizieR databases, and their XML interface, operated at CDS, Strasbourg, Fance. We are very grateful to O. Hainaut for suggesting the project name, and to the referee for constructive comments that improved the paper. M.F. was supported by the Spanish grants AYA2004-05395 and AYA2004-21521-E. Supported by the Gemini Observatory, which is operated by the Association of Universities for Research in Astronomy, Inc., on behalf of the international Gemini partnership of Argentina, Australia, Brazil, Canada, Chile, the UK, and the United States of America.