Issue |
A&A
Volume 620, December 2018
Gaia Data Release 2
|
|
---|---|---|
Article Number | A197 | |
Number of page(s) | 16 | |
Section | Catalogs and data | |
DOI | https://doi.org/10.1051/0004-6361/201833357 | |
Published online | 17 December 2018 |
Gaia Data Release 2
Short-timescale variability processing and analysis
1
Département d’Astronomie, Université de Genève,
Chemin des Maillettes 51,
1290
Versoix,
Switzerland
e-mail: maroussia.roelens@unige.ch
2
Département d’Astronomie, Université de Genève,
Chemin d’Ecogia 16,
1290
Versoix,
Switzerland
3
SixSq,
Rue du Bois-du-Lan 8,
1217
Meyrin,
Switzerland
4
GÉPI, Observatoire de Paris, Université PSL, CNRS,
Place Jules Janssen 5,
92190
Meudon,
France
5
Institute of Astronomy, University of Cambridge,
Madingley Road,
Cambridge CB3 0HA,
UK
6
Harvard-Smithsonian Center for Astrophysics,
60 Garden Street,
Cambridge,
MA
02138,
USA
7
Department of Astrophysics / IMAPP, Radboud University,
PO Box 9010,
6500GL,
Nijmegen,
The Netherlands
Received:
4
May
2018
Accepted:
14
August
2018
Aims. We describe the methods used and the analysis performed in the frame of the Gaia data processing activities to produce the Gaia Data Release 2 (DR2) sample candidates with short-timescale variability together with associated parameters.
Methods. The Gaia DR2 sample of candidates with short-timescale variability results from the investigation of the first 22 months of Gaia G per-CCD, GBP, and GRP photometry for a subsample of sources at the Gaia faint end (G ~ 16.5−20 mag). For this first short-timescale variability search exploiting Gaia data, we limited ourselves to the case of suspected rapid periodic variability. Our study combines fast-variability detection through variogram analysis, a high-frequency search by means of least-squares periodograms, and an empirical selection based on the investigation of specific sources seen through the Gaia eyes (e.g., known variables or visually identified objects with peculiar features in their light curves). The progressive definition, improvement, and validation of this selection criterion also benefited from supplementary ground-based photometric monitoring of a few tens of preliminary candidates with short-timescale variability, performed at the Flemish Mercator telescope in La Palma (Canary Islands, Spain) between August and November 2017.
Results. As part of Gaia DR2, we publish a list of 3018 candidates with short-timescale variability, spread throughout the sky, with a false-positive rate of up to 10–20% in the Magellanic Clouds, and a more significant but justifiable contamination from longer-period variables between 19% and 50%, depending on the area of the sky. Although its completeness is limited to about 0.05%, this first sample of Gaia short-timescale variables recovers some very interesting known short-period variables, such as post-common envelope binaries or cataclysmic variables, and brings to light some fascinating, newly discovered variable sources. In the perspective of future Gaia data releases, several improvements of the short-timescale variability processing are considered, by enhancing the existing variogram and period-search algorithms or by classifying the identified variability candidates. Nonetheless, the encouraging outcome of our Gaia DR2 analysis demonstrates the power of this mission for such fast-variability studies, and opens great perspectives for this domain of astrophysics.
Key words: stars: variables: general / astronomical databases: miscellaneous / methods: data analysis / techniques: photometric / surveys
© ESO 2018
1 Introduction
Gaia is a cornerstone mission in the science program of the European Space Agency (ESA). Launched in December 2013 for a five-year nominal mission duration, the main aim of this astrometric successor of the HIPPARCOS ESA mission is to determine highly accurate positions, parallaxes, and proper motions for more than one billion stars in the Milky Way. Gaia is expected to observe objects in our Galaxy and beyond, spread throughout the sky, providing precise astrometry at the 10–20 μ arcsec level, photometry and spectrophotometry down to G ≈ 20.7 mag (where G is the Gaia broad-band white-light magnitude) with standard errors down to a few millimagnitudes (mmag) for bright sources, and medium-resolution spectroscopy down to G ≈ 17 mag (Gaia Collaboration 2016a, 2016b).
The second Gaia Data Release (hereafter Gaia DR2)1, published on April 25, 2018, is based on the data collected during the first 22 months of the Gaia nominal mission (Gaia Collaboration 2018). Among all the analyses performed for this data release, the Gaia DR2 variability processing, described in Holl et al. (2018), resulted in the publication of more than 500 000 variable sources, with associated light curves and additional variability parameters when appropriate, belonging to diverse variability classes from BY Draconis candidates to long-period variables. The publication also includes more than 3000 candidates with short-timescale variability, that is to say, sources showing photometric variabilitywith characteristic timescales from a few tens of seconds to a dozen hours. Various astronomical sources are known to exhibit such fast variability, including periodic and non-periodic phenomena, and they are spread throughout the Hertzsprung–Russell (HR; Russell 1914) diagram. The amplitudes for types with variabiltiy on such short timescales can rank from a few millimagnitudes to a few magnitudes. The diverse phenomena at the origin of the variability reach from stellar pulsations to binarity and eruptions. Hence, an improved knowledge and understanding of short-timescale variables can bring invaluable clues into several fields of astrophysics. Until now, the discovered number of such short-timescale variables remains quite modest compared to other types of longer-term variability (such as Mira or Cepheid stars). This is directly linked to the observational constraints when dealing with fast variability, both in terms of time sampling and photometric precision. Nonetheless, since the early 2000s, high-cadence photometric monitoring surveys in space (e.g., Kepler; Borucki et al. 2010) and from ground (e.g., the Optical Gravitational Lensing Experiment; Udalski et al. 1992) allowed a significant advance in the rather unexplored domain of short-timescale variability. Moreover, during the past decade, some projects specifically dedicated to the detection and characterisation of short-timescale variables arose, like the Rapid Temporal Survey (RATS; Barclay et al. 2011) or the OmegaWhite survey (Macfarlane et al. 2015; Toma et al. 2016).
In this context, Gaia offers a unique opportunity for comprehensive, fast-variability studies over the whole sky. Its peculiar time sampling in G band, involving fast cadences from a few seconds to a few hours (de Bruijne 2012), and its high photometric precision (Evans et al. 2018)enable us to probe the short-timescale variability domain, down to timescales of a few tens of seconds, including low-amplitude phenomena.
In this paper, we present the Gaia DR2 catalogue of 3018 candidates with short-timescale variability. This is published as part of the Gaia DR2 archive2.
In Sect. 2 we detail the algorithms and specific metrics we used to detect short-timescale variability and select bona fide candidates. Section 3.1 summarises the various filtering steps and selection criteria we applied to retrieve the final 3018 candidates with short-timescale variability that is suspected to be periodic, published in Gaia DR2. In Sect. 4 we present some statistical and astrophysical properties of our set of candidates with short timescale variability, together with a few specific interesting examples. Finally, Sect. 5 sums up the Gaia DR2 short-timescale variability processing and results.
2 Detection and characterisation of candidates with short-timescale variability
In this section, we present the Gaia DR2 short-timescale variability processing and analysis as part of the Gaia Data Processing and Analysis Consortium (DPAC) activities, which resulted in the identification of 3018 candidates with short-timescale variability. For this first publication of Gaia short-timescale variables, we limited ourselves to a selection of sources with suspected periodic variability as a test case for probing the efficiency of the foreseen Gaia short-timescale variability processing (Eyer et al. 2017). We restricted the study to variability phenomena that can be confirmed with a sufficient level of confidence. An analysis of transient variability will be included in the Gaia short-timescale variability treatment for future data releases. Nevertheless, parallel to this work, Wevers et al. (2018) presented specific methods and preliminary results for an investigation of transients with Gaia.
2.1 Input data
The Gaia short-timescale variability analysis is mostly based on Gaia per-CCD photometric time series in G band after cleaning spurious values and outlier points (Eyer et al. 2017; Holl et al. 2018). We recall that on-board G measurements occur in groups of (most often) nine CCD observations (one every 4.85 s), one such group being referred to as a field-of-view (FoV) transit. Often, two or more FoV transits are repeated, with time intervals of 1h 46min or 4h 14min between successive FoVs, following the Gaia scanning law (de Bruijne 2012).
It is important to mention that the use of G per-CCD photometry is meant as a preliminary exercise, and that the analysed G per-CCD light curves are not made available to the scientific community as part of Gaia DR2 because the publication of these data is planned for the final release of the nominal mission3.
At thisstage of the Gaia processing, only a set of selected G per-CCD light curves is investigated for short-timescale variability. For each source observed by Gaia, a statistical test is performed to determine whether the scatter in the G per-CCD points of each FoV transit is greater than a specific significance level. If more than half of the FoV transits for that source show “variability” according to this criterion, it is further analysed.
Finally, in addition to these G per-CCD time series, the Gaia short-timescale analysis also made use of the filtered magnitude time series in G FoV (i.e., averaging the G CCD measurements within one FoV transit), GBP, and GRP as describedin Eyer et al. (2017) and in Holl et al. (2018), benefiting from all the time-series cleaning operators developed for the global variability processing.
We emphasise that for Gaia DR2, we did not aim for completeness with the published sample of sources with short timescale variability, first because of the selection of analysed per-CCD time series, but also because of the intermediate status of this photometry both in terms of time span and calibration. Our initial goal was rather to provide a significant sample of at least a few thousand bona fide candidates. We are aware that we probably missed a significant fraction of the true short-timescale variables observed by Gaia, and we expect to reach completeness at the end of nominal mission with the whole 5-yr time span per-CCD photometry for all the scanned sources.
2.2 Variogram analysis
2.2.1 Principle
Roelens et al. (2017) predicted the potential of the variogram approach in the Gaia context for short-timescale variability analysis by means of end-of-mission light-curve simulation, and by adopting a specifically tailored variogram formulation and detection threshold definition. This preliminary work was the cornerstone of the Gaia DR2 search for short-timescale candidates, where the variogram method applied to Gaia G per-CCD time series was used to preselect candidates with fast variability prior to further characterisation.
We recall that the idea of the variogram method is to quantify the magnitude variations between photometric measurements as a function of the time lag between them. The variogram value for a time lag h is noted γ(h). For a light curve with magnitudes (mi) observed at times (ti) for i = 1...n, γ(h) is calculated as a function of the magnitude differences mj − mi on all the pairs (i, j) such that ∣tj − ti ∣ = h ± ϵh, with ϵh the tolerance accepted for grouping the pairs by time lag. By exploring different time-lag values, defined by the time sampling, it is then possible to build a variogram plot (hereafter referred to as a variogram, see e.g., Fig. 1), which provides information on how variable the considered source is, and it informs on the variability characteristics if appropriate.
After variograms have been obtained for all the investigated sources, candidates with short-timescale variability are identified as those with a variability that is significant enough (i.e., whose variogram values are high enough), at time lags shorter than 12 hr, in other words, verifying (1)
where the detection threshold γdet defines the lowest variability level required to consider that variations are not only due to noise. τdet is the shortest lag explored for which γ ≥ γdet (if any), and it helps focusing on identified events with the fastest variability. This principle of the variogram detection is illustrated in Fig. 1.
In the Gaia context, we explored time lags involved in G per-CCD time series, that is, the time intervals between the different CCDs within a single FoV (4.85, 9.7 s, etc., up to 38.8 s), and the time intervals between the different FoVs (1 h 46 min, 4 h 14 min, 6 h, 7 h 46 min, etc.), up to h ≈ 1.5 d.
Various variogram definitions can be found in the literature, the one adopted in Roelens et al. (2017) being an empirical weighted formulation using uncertainties on G per-CCD measurements. However, for the Gaia DR2 exercise, the estimation of the uncertainty for per-CCD photometry was not accurate enough to use this variogram definition. Consequently, we implemented an alternative formulation based on the inter-quartile range (IQR) estimation (see e.g., MacLeod et al. 2012), (2)
which is more robust than the classical unweighted variogram (defined e.g., in Hughes et al. 1992) to possibly remaining outliers, but is less time-consuming than the robust variogram described in Eyer & Genton (1999).
Finally, because the data we analysed to produce the Gaia DR2 sample of candidates with short-timescale variability covered a time span of only 22 months instead of 5 yr, we limited our analysis to sources with more than 20 available G FoV transits, which ensured that we had enough data points for the variogram analysis to be reliable.
Fig. 1 Principle of identifying candidates with short-timescale variability from a variogram analysis, illustrated with the variogram of a simulated transient event with an amplitude of 0.9 mag and a duration of 2 h (derived from Roelens et al. 2017). |
2.2.2 Defining the detection threshold
As in Roelens et al. (2017), we used a magnitude-dependent detection threshold to take into account the magnitude dependency of photometric errors in Gaia G band (Evans et al. 2017): γdet = γdet(ḠCCD), where ḠCCD is the mean of G per-CCD magnitudes of the considered source.
To adopt the appropriate γdet definition, we performed the IQR-based variogram analysis on a subsample of known OGLE sources, crossmatched with objects observed by Gaia, for which per-CCD photometry involving more than 20 FoV transits was available. This working sample contains 7419 and 380 periodic variables from the OGLE III (Udalski et al. 2008) and OGLE IV (Udalski et al. 2015) catalogues, respectively, as well as 459 “constant” stars from OGLE IV, that is to say, sources with the smallest variations in both V and I bands (Eyer et al. 2017). The OGLE periods (POGLE) of the OGLE periodic variables in this data set rank from 28 min to 10 000 d.
Figure 2 shows max(γ) as a function of ḠCCD for each of these crossmatched OGLE sources. The grey line in Fig. 2 corresponds to γdet,simu, the magnitude-dependent detection threshold definition for the IQR-based variogram formulation, derived from the simulated Gaia-like 5-yr time-span light curves of Roelens et al. (2017). Clearly, γdet,simu is not adapted to real Gaia data, first because the time span is 22 months instead of the 5 yr in the simulations (i.e., it has fewer data points per source), and then because of the intermediate photometric calibration. However, by simply scaling γdet,simu by a factor of 10 (the brown line in Fig. 2), it is possible to efficiently separate constant OGLE sources from sources that are periodically variable. Additionally, with γdet = 10γdet,simu, the periodic variables that are not detected are either long-period variables (Fig. 2, panel a) or low-amplitude sources (Fig. 2, panel b).
When we applied the short-timescale detection criterion of Eq. (1) with γdet = 10γdet,simu to our OGLE working sample, we recovered about 48% of the short-period sources (i.e., with POGLE ≤ 0.5 d). The contamination level from false positives (namely constant sources flagged as short-timescale variables) was 2%, and contamination was at about 20% from variable sources with periods longer than 1 d. Sources with periods between 0.5 and 1 d are at the limit of our definition of short-timescale variability, but are still of interest and can be accepted as “extended” short-timescale variables.
The relatively low recovery rate of short-period variables results from the fact that most of such variables in the OGLE test sample have amplitudes below 0.1 − 0.2 mag, which means that they are at the limit of what can be detected according to Fig. 2 (panel b). Moreover, because fewer data points are available in each per-CCD light curve than what is expected at the end-of-nominal mission, it is not always possible to form pairs of measurements and calculate variogram values for all the lags that correspond to the inter-FoV time intervals, where detection should be triggered for the short periods involved in our test sample (typically POGLE between half an hour and 1 d). Although contamination from longer-period sources is quite high, it is not a great problem. The vast majority of longer-period variables that are flagged as short-timescale candidates with our variogram criterion have periods no longer than a few days, amplitudes greater than a few tenths of magnitudes, and can exhibit steep variations. Consequently, their global variation rate is sufficient to justify their detection at short timescales, even though they are not short-period variables per se.
Since the OGLE crossmatched sample we used to build the relevant variogram detection criterion contains only objects with a G magnitude between about 16.5 and 20 mag, we limited our analysis to this magnitude range, where the γdet definition has been validated and tested.
Fig. 2 Maximum variogram value as a function of the mean G CCD magnitude for known constant and variable sources from the OGLE survey as observed by Gaia. The grey line shows the detection threshold derived from simulated Gaia-like light curves (see Roelens et al. 2017). The brown line corresponds to the same threshold definition scaled by a factor of 10. Panel a: colour-coded by period POGLE from the OGLE catalogue, and panel b: colour-coded by IQR from Gaia per-CCD G photometry. It appears that with this detection threshold γdet = 10γdet,simu, panel a: majority of constant stars and a significant fraction of longer-period variables are not detected as candidates with short-timescale variability when the variogram is used, and panel b: variable sources that are not detected with the variogram approach are low-amplitude sources. |
2.2.3 Application to real Gaia data
For our global search for short-timescale variability, we applied the short-timescale detection criterion detailed above to all the Gaia sources for which per-CCD photometry was available, with more than 20 FoV transits, and with a mean G magnitude of between 16.5 and 20 mag. This represents a working sample of about 5.6 million sources that are to be investigated for short-timescale variability. From these 5.6 million sources, a more complete crossmatch with various variable star catalogues from the literature, not only OGLE, but also Catalina (Drake et al. 2014a,b), LINEAR (Palaversa et al. 2013), Kepler (Debosscher et al. 2011), EROS2 (Kim et al. 2014), or the Half-Million quasar catalogue (HMQ; Flesch 2015), resulted in a set of 4747 known variables, including 439 with PLit ≤ 0.5 d (with PLit the literature period), 382 with 0.5 > PLit ≤ 1 d, 1574 with PLit > 1 d, and 1658 non-periodic variable sources. The remaining sources are variables of periodic type, but no information on their period is available from the considered catalogue.
Of the 5.6 million sources processed with the variogram analysis, 3.9 million sources were flagged as short-timescale candidates, which is a huge fraction of the analysed sample and may question the reliability of our variogram approach in this context. However, this unexpectedly high fraction of candidates with short-timescale variability is a direct consequenceof the pre-selection of the objects for which per-CCD data have been analysed. Since these sources are considered as likely to show fast variability according to the criterion described in Sect. 2.1, having about 70% of the investigated sources flagged as candidates with short-timescale variability simply means that the variogram detection criterion is coherent with that selection. From this list of 3.9 million candidates, we recovered 2892 of the crossmatched sources from the catalogues mentioned above, including 356 with PLit ≤ 0.5 d, 280 with 0.5 < PLit ≤ 1 d, 738 with PLit > 1 d, and 1051 non-periodic variable sources. We used these 2892 sources as a reference set to assess and improve the efficiency of our approach for finding short-timescale variables with suspected periods (see Sects. 2.3 and 3). From now on, this is referred to as the reference crossmatched sample.
2.3 High-frequency search
For each candidate whose short-timescale variability was identified by the variogram analysis (Sect. 2.2), we additionally performed a high-frequency search to further characterise the candidates with suspected variability and to help distinguishing periodic variations from transient events. We decided to explore frequencies between 1 and 144 d−1 (i.e., periods between 10 min and 1 d), which roughly corresponds to the period range represented in our reference crossmatched sample. It thus enables us to assess the quality of the retrieved period search results from Gaia. Several periodogram-based methods have been considered for this purpose: the Deeming periodogram (Deeming 1975), the Lomb–Scargle periodogram (Scargle 1982), the least-squares periodogram described by Zechmeister & Kürster (2009), the string length method (Lafler & Kinman 1965), and the phase-dispersion minimization (PDM) approach (Schwarzenberg-Czerny 1989). They were all tested on a set of 112 well-characterised short-period sources (i.e., with PLit ≤ 1 d) from the reference crossmatched sample. We applied the considered period search algorithms on both G per-CCD and G FoV light curves to determine which gave the best period-recovery rate. The Gaia period from the short-timescale analysis, hereafter PGaia, for a given source, is defined as the inverse of the frequency fGaia of the highest peak in the corresponding frequencygram. For each method, we checked the fraction of sources for which we had PGaia ≈ PLit ± 10% or PGaia ≈ PLit∕2 ± 10% (as it is often the case for eclipsing binary systems). The resulting numbers are summarised in Table 1. The least-squares periodogram appeared to be the best alternative, whether applied to G per-CCD or to G FoV light curves. Since the final aim of the Gaia short-timescale variability module is to search for variability at timescales down to a few tens of seconds, which can be probed only using per-CCD photometry, and even if such very short periods were not really explored in that exercise, we decided to use G per-CCD time series.
Figure 3 shows PGaia for all the periodic sources in the reference crossmatch sample described at the end of Sect. 2.2 as function of their literature period PLit. The period recovery with the least-squares method applied to Gaia G CCD photometry is not exceptional for this set of objects: of 1374 crossmatched variables that are flagged as short-timescale variables with external period information, 636 have PLit ≤ 1 d, and for only 45 of them does the Gaia short-timescale period recover PLit by 10%. For 104 of them, mostly eclipsing binaries, the Gaia short-timescale period recovers PLit ∕2 by 10%.
The horizontal trends in PGaia, visible at the shortest periods retrieved from Gaia data, correspond to aliases due to the 6 h rotation period of the Gaia satellite, that is, to frequencies of 4 d−1 and its multiples, a phenomenon that was expected, as described in Eyer et al. (2017).
For each source flagged as a candidate for short-timescale variability and analysed for high-frequency search, we retrieved the false-alarm probability (FAP) of the most pro-eminent peak in the least-squares periodogram (Zechmeister & Kürster 2009), that is, the FAP associated to fGaia. Figure 4 shows the distribution of this FAP for the reference crossmatched sample, distinguishing non-periodic variables, short-period variables, and longer-period variables. Although sources in the first category cannot be strictly separated from the others on the basis of the FAP alone, non-periodic variables tend to have higher FAP values, by definition of the FAP. Hence, rejecting candidates with FAP values greater than 10−30, for instance, should help eliminating a significant fraction of the known non-periodic variables of the sample without loosing too many short-period sources. The overall approach adopted to focus on the suspected periodic variability is described in more detail in Sect. 3.1.
From this analysis, it was clear that the published PGaia values should be taken with caution, and that they are more indicative than really accurate. As described previously, at this point, period-search results are strongly affected by aliasing issues, which represents a major axis of improvement for the future Gaia data releases. Moreover, we targeted many different variable types (e.g., ZZ Ceti stars, AM CVn stars, δ Scuti stars etc.) that showed various light-curve shapes, and we kept in mind the possibility of observing unknown short-timescale variable types yet to be discovered. In contrast to other variability processing modules, such as those dedicated to analyses of RR Lyrae and Cepheids (Clementini et al. 2019), we consequently did not perform any further light-curve modelling to improve the accuracy of the period we found, which partly explains the moderate short-period recovery rate.
Summary of the period recovery results for various period-searchmethods, tested on a set of selected well-characterised short-period variables from the literature.
Fig. 3 Gaia short-timescale period as a function of the literature period for the crossmatched periodic sources flagged as short-timescale candidates from the variogram analysis. The red and grey dashed lines correspond to PGaia = PLit and PGaia = PLit∕2, respectively.In total, the literature period or half of the literature period is recovered only for 23% of the crossmatched sources with PLit ≤ 1 d. |
Fig. 4 False-alarm probability distribution from the least-squares period search for the 2892 crossmatched known variable sources flagged as short-timescale candidates from the variogram analysis. |
2.4 Other statistics and parameters
In addition to the variogram (Sect. 2.2) and frequency-search (Sect. 2.3) analysis, we calculated a series of statistics to characterise and identify short-timescale candidates with a suspected periodic variability (see Sect. 3).
For each short-timescale candidate, the amplitude estimate AG-CCD was defined as the difference between the 95th and the 5th quantiles of G per-CCD time series.
We also calculates the mean Abbe value per-transit, which is defined as follows:
are the per-CCD magnitudes for one transit, n is the number of per-CCD measurements in the transit, σ2 is the unbiased variance on the magnitudes of the transit, and Ntransits is the number of transits for the considered source. The idea of mean Abbe value per transit is to spot sources in which several transits exhibit a smooth and significant variability at the level of one transit, that is, over a timescale of about 40 s. Such transits are expected to have lower values than purely noisy transits, and thus have lower . However, this statistics was not really exploited in Gaia DR2.
For each analysed source, we defined the median variogram ratio as the median of its variogram values for lags shorter than 40 s divided by the median of its variogram values for lags longer than 40 s. This parameter quantifies how flat or how “step-shaped” the variogram is when its values at shorter lags are compared to values at longer lags. As detailed in Roelens et al. (2017), the typical variogram plot for periodic sources is expected to first show a plateau at the shortest lags, then an increase, and oscillations for lags increasing towards and beyond the variation period. For periodic variables with periods between 10 min and 1 d, the variogram plot is therefore expected to be relatively flat for lags at CCD level (i.e., from 4.85 to 40 s), and to show oscillations at higher values for lags at the FoV level, typically between 1 h 46 min and 1.5 d, because depending on the variability period, the oscillations starts at different lags. For these sources, the median variogram ratio is therefore expeted to be relatively low. Conversely, variograms derived for fast transient events (e.g. flares) should be quite flat except a local increase resulting from the flare. Variograms for longer-period variables (e.g., periods longer than 5 or 10 d) are likely to be flat as well, and possibly show an increase that starts at the longest explored lags. For both transient and longer-period variables, the median variogram ratio is then expected to be higher than for short-period variables. This means that the median variogram ratio is a useful metric to distinguish short-period variability from other variability features (see Sect. 3.1).
Finally, our short-timescale analysis process made use of some of the classical statistical parameters produced by the statistics module of Gaia variability processing (Eyer et al. 2017), typically the IQR and the Abbe value on the times series in the three photometric bands (G, GBP, and GRP), as well as the Spearman correlation between the G and GBP bands and betweenthe G and GRP bands, noted rG,BP and rG,RP, respectively (Sects. 3.1 and 3.4).
3 Selection of bona fide candidates with short-timescale variability
For this first release of short-timescale variables in Gaia, we focused on candidates with short-timescale variability, whose variations are suspected to be periodic as a test case. In the following sections, we describe theprocess for selecting the sources that had strong indications in favour of periodicity of the 3.9 million candidates flagged from the variogram analysis (see Sect. 2.2), and we also report on the validation of the criteria we used. These two aspects involved investigating the photometry for known variable sources in the candidate sample (presented in Sect. 2.2 and used in Sect. 2.3), as well as visual inspection of some candidates’ light curves. In parallel, photometric monitoring of a few preliminary candidates crucially helped building the final criterion we used to select the published candidates of Gaia DR2. The whole selection process is summarised in Table 2.
Summary of the step-by-step selection criteria for candidates with a suspected periodic short-timescale variability.
3.1 Selection of candidates with suspected periods
The main idea for distinguishing suspected periodic variables of all the short-timescale candidates identified with the variogram was to define different cuts in the various statistics described in Sect. 2.4, so as to exclude phenomena that we were not interested in for this first exercise. The choice of such validity intervals for short-period variability relied essentially on the analysis of the reference crossmatch sample of 2892 known variables, the idea being to limit contamination from non-periodic and longer-period variables as much as possible within the final list of candidates. The definition of the cuts to adopt was based on simple histograms of the different statistical parameters available, similar to Figure 4, comparing the distributions for non-periodic variables, short-period variables, and longer-period variables. One very important point of our selection strategy is that we required the variability seen in G band to be confirmed by consistent features in GBP and/or GRP bands for considering it as relevant.
After several iterations and tests, we decided to keep in our list of candidates with suspected periodic and short-timescale variability only those sources that met the following statements:
More than 18 observations in GBP and GRP. Even though we required the G time series to contain at least 20 FoV, the GBP or GRP time series can sometimes contain fewer points after cleaning. We therefore imposed a lower limit on the number of observations in these two bands to ensure that we had enough information to confirm the variability.
An Abbe value on the G FoV time series . As explained in Mowlavi (2014), low Abbe values are expected to correspond to transient variability, not to periodic variations.
-
FAP ≤ 10−30. By definition, and as shown in Sect. 2.3, the FAP should be higher for non-periodic variables.
-
Median variogram ratio ≤ 0.26. As detailed in Sect. 2.4, this value is expected to be lower for short-period sources than for longer-period or transient variables.
-
0.32 ≤IQRXP∕IQRG ≤ 3.2, where XP stands for BP or RP.
-
Spearman correlation coefficients rG,BP ≥ 0 and rG,RP ≥ 0.
The IQR ratio criterion is a consistency check, ensuring the coherence between behaviours in G, GBP, and GRP, which was confirmed to be necessary in the study of the IQR ratio distributions for the reference crossmatch sample (Fig. 5). Even though the investigation of known variables showed that the amplitudes in three bands can be different, we chose to focus on the cases without a strong discrepancy between the G, GBP, and GRP IQRs. It appearedthat for this data set, the statistical behaviour of the GBP and GRP bands was different. However, we did not wish to bias our analysis towards a certain type of known variables (e.g., RR Lyrae stars or eclipsing binaries, which represent the majority of the sources in the crossmatch sample). In particular, we aimed to avoid missing any less well-known or unknown short-period variable types that could be observed by Gaia. Hence, we kept a symmetric criterion for both GBP and GRP photometry.
Similarly, the last cut on the correlation values between the G and GBP/GRP bands is a first (and not very strict) constraint to ensure that the variability phenomena observed in G are compatible with the GBP and/or GRP time series. This criterion has been tightened afterwards (see Sect. 3.4).
When this selection criterion was applied to the reference crossmatched sample, we kept 303 sources out of 2892 as suspected periodic short-timescale variables, including 127 with PLit ≤ 0.5 d, 74 with 0.5 < PLit ≤ 1 d, 88 with PLit > 1 d, and 3 non-periodic sources. The remaining 11 objects are sources whose type is compatible with periodic variability, for which no period information is available, however. For periodic variables with PLit ≤ 1 d, the recovery rate was at about 32% with respect to the known short-period variables that are flagged as short-timescale candidates from the variogram (36% for PLit ≤ 0.5 d, 26% for 0.5 < PLit ≤ 1 d), and at about 18% when compared to the processed known short-period variables (19% for PLit ≤ 0.5 d, 16% for 0.5 < PLit ≤ 1 d). The contamination rate was about 1% from non-periodic sources and 30% from longer-period variables. However, the latter corresponds mostly to variables with periods between 1 and 5 d and with amplitudes of a few tenths of magnitudes, whose detection at the short-timescale level is justified, as discussed previously. Finally, when we applied our selection criteria for candidates with suspected periodic variability to the 3.9 million short-timescale candidates from the variogram analysis, we obtained a preliminary sample of 16 703 short-timescale candidates with suspected periodic variability.
Fig. 5 Distribution of the IQR ratios between G and GBP band (panel a) and between G and GRP bands (panel b) for the 2892 crossmatched known variable sources flagged as short-timescale candidates from the variogram analysis. |
3.2 Ground-based follow-up of some preliminary candidates
At that point of our analyis, we also benefited from some supplementary ground-based observations of a few tens of candidates, chosen from the listof preliminary short-timescale variables described in Sect. 3.1, so as to confirm or invalidate their variability at the expected timescale. This photometric monitoring was performed between August and November 2017 at the 1.2 m Mercator Flemish Telescope at the Spanish Observatory Roque de Los Muchachos (La Palma, Canary Islands, Spain) using the Mercator Advanced Imager for Asteroseismology (MAIA; Raskin et al. 2013). The data were reduced using the ePipe photometric reduction pipeline developed by Sergi Blanco-Cuaresma at the Geneva Observatory (see Roelens et al. 2016). We emphasize that the goal of this follow-up campaign was not to better characterise the identified candidates with short-timescale variability, for example, by refining their period. By doing so, we would have taken advantage of our privileged access to Gaia data to do early science, which would be in total disagreement with the Gaia DPAC rules. Our aim was first of all to verify for each of the monitored sources whether the light curve as observed from the ground exhibited some variability that might be compatible with the features observed by Gaia, particularly in terms of amplitude, detection timescale, and period involved to ensure that the detection at short-timescale level was justified. The idea here was to evaluate the overall quality of the preliminary sample of candidates with short-timescale variability to verify whether further filtering was required prior to publication as part of Gaia DR2.
Taking into account the MAIA instrument capacities, we chose the sources that were to be monitored at the bright side of the preliminary sample of short-timescale candidates (G ~ 16.5–17 mag), with Gaia amplitudes of a few tenths of magnitudes and suspected periods from a few tens of minutes to a few hours. Accordingly, and after several tests, it appeared that the observing scheme that was best suited for validation given these characteristics involves a follow-up of about 1 h–1 h30 min, continuously in one single night, with exposure times of 120 s. This limits the follow-up to variability that is observed at the level of tens of minutes. Unfortunately, with shorter exposure times, which would have enabled probing faster variability phenomena, the quality of the retrieved photometry was not high enough to determine the presence or absence of variations.
In the end, a total of 25 preliminary candidates with short-timescale variability, either chosen to probe the reliability of specific phenomena visible in Gaia photometry or randomly chosen, were photometrically monitored from the ground. For one of them, the variability inferred from Gaia data was confirmed at the expected timescale, that is to say, what was observed from ground was compatible with both τdet and PGaia. Nine were confirmed at timescales longer than expected, that is, with a variability compatible with τdet, but likely at a period longer than PGaia, typically a few hours instead of a few tens of minutes. Six of them could neither be confirmed nor rejected based on their ground photometry because of the relatively poor data quality resulting from bad weather. The ground-based photometry of three of the followed candidates was contaminated by bright and/or close neighbours. For six of them, the variability seen in Gaia was proved to be spurious, as no corresponding feature was visible in MAIA measurements. The outcome of our follow-up campaign therefore was to show the necessity for further filtering and refinement of our selection criteria, as described in Sects. 3.3 and 3.4.
3.3 Environment filtering
Further investigation of some of the preliminary variable candidates we selected (Sect. 3.1) showed that the Gaia short-timescale variabiliy analysis can be significantly affected by contamination of photometry that is due to the environment of the considered sources across the sky. We therefore decided to perform some filtering of our candidate list at that stage, based on the projected vicinity of selected sources on the celestial sphere, similarly to what was done in Wevers et al. (2018).
First, we removed from our preliminary sample of 16 703 candidates all the sources that were not dominant in their immediate neighbourhood, that is, objects that were not the brightest in G band, by at least 1 mag, within a radius of 1 arcsecond (arcsec). Non-dominant sources can be confused with their neighbour(s) during the reconstruction of Gaia photometry, leading to an artificial magnitude change caused by bad source identification. Figure 6 shows an example candidate of a short-timescale variablity with a suspected period as it remained after the selection of Sect. 3.1, with a contaminated light curve as described. The measured magnitudes seem to alternate between two discrete levels. In this case, the considered source of mean magnitude G = 18.42 mag has a neighbour with mean G = 18.38 mag at a distance of 0.38 arcsec. Based on the corresponding Gaia light curves, they are likely two sources whose real G magnitudes are rather around 17.9 and 18.6 mag. The mistake in the mean G that is effectively measured is induced by the mixing in their photometry.
Additionally, we excluded candidates with a neighbouring object within a radius of 30 arcsec that had G ≤ 12 mag because of the a high probability that it might be contaminated by the brighter object. Here we were slightly more restrictive than Wevers et al. (2018), since we used a limiting radius of 30 arcsec instead of 10 arcsec. This decision was motivated by some of the Mercator supplementary photometric observations performed during the Gaia DR2 variability processing, which showed that this radius needed to be extended in the frame of our analysis. Figure 7 shows an example of a probably spurious short-period candidate, for which ground-based follow-up revealed no significant variation at the expected timescale. When we inspected the corresponding MAIA images (Fig. 7, panel b), the considered object has two neighbours of magnitude G ~ 10 mag at distances of 20 and 26 arcsec, respectively. Although a slight variation of about 0.1 mag is visible in the MAIA R differential-magnitude light curve (Fig. 7, panel c), it remains within the uncertainties on the measurements, and it is much smaller than the expected amplitude from the Gaia G light-curve (Fig. 7, panel a) at the expected timescale (τdet = 1 h 46 min, PGaia = 36 min). This favours the hypothesis of spurious variability that is due to contamination.
Fig. 6 Example of preliminary candidates for short-timescale, suspected periodic variability whose G light curve is contaminated by a nearby star. The difference of magnitude between them is smaller than 1 mag. “Time” is expressed in BJD in TBC − 2455197.5 d. |
Fig. 7 Example of preliminary candidates for short-timescale and suspected periodic variability whose G light curve iscontaminated by two nearby bright stars. Panel a: Gaia G per-CCD phase-folded light curve. Panel b: excerpt of animage in the R band from the MAIA photometer (Raskin et al. 2013); the targeted candidate is encircled in magenta. Panel c: MAIA differential photometry light-curve in R band obtained with ePipe. |
3.4 Removing other spurious variability
By visually inspecting randomly selected light curves of the 16 703 preliminary candidates with short-timescale variability and suspected periodicity, we found several sources whose photometry globally brightens in the G band, but fades in the GBP and GRP bands (see Fig. 8). Such unlikely behaviour may result from photometric contamination or calibration issues. In the frame ofour analysis, we removed these types of spurious candidates by mean of cuts on the skewness value on light curves in the three Gaia photometric bands. In this way, all sources with were excluded to eliminate relatively flat G light curves with a few significant flares. We also removed candidates that were strongly skewed in one direction in G and were strongly skewed in the other direction in both GBP and GRP, that is, those that met the removal condition (3)
In the meantime, this phenomenon has been investigated by the Gaia photometry team. The sources found with these features apparently have a nearby star (at a distance of about 1–2 arcsec) that is not necessarily bright nor of similar magnitude. This star is sometimes inside of or at the edge of the window assigned to the target for photometry integration (the Gaia windowing scheme is described; e.g., in Gaia Collaboration 2016b), and sometimes it is not. The presence or absence of the neighbouring source in the photometric window causes the brighter and fainter measurements, respectively. The dimensions of the window used for GBP/GRP spectrophotometry are different from those of the window used for G band, which explains that all three bands do not necessarily brighten or fade at the same time. More appropriate and specific treatment of such contaminated transits will be implemented in the future, enabling a transit cleaning at the time-series level rather than a rejection of the candidates, as has been done for this data release.
Figure 9 represents the Spearman correlation between the G and GRP time series versus the Spearman correlation between the G and GBP time series for the 16 703 preliminary candidates with short-timescale and suspected periodic variability, in particular highlighting the 303 crossmatched known variables within this sample. The preliminary cuts on the Spearman correlation values (Sect. 3.1) cause the absence of candidates with negative correlations. For a significant fraction of sources in the preliminary sample, the correlations between G and both GBP and GRP were quite low. Whereas for 80% of the known crossmatched variables with PLit ≤ 1 d of the set of considered candidates one of these two correlation values was higher than 0.45 and the other was higher than 0.35, only 27% of the whole preliminary sample met this condition. This discrepancy caused us to question the cases of candidates with relatively low correlations: are they reliable, or should we eliminate them? When we visually inspected the Gaia light curves of a handful of these specific sources, we realised that although the time-series filtering is performed upstream, a few important outliers remained in the light curves. This induced spuriously low (or high) correlation values. Consequently, we applied an additional time-series cleaning operator to the GXP time series, based on the expected amplitude of variation (if really present) in G as defined in Sect. 2.4. This operator, specific to short-timescale variability processing, removes all the points in the GXP time series that lie farther away from the GXP median magnitude than , where is a proxy for the expected variation amplitude in GXP according to what is seen in G band. Figure 10 shows the Spearman correlation distribution for the preliminary sample, after recalculating correlation values with the additionally cleaned XP time series. Since some preliminary candidates had negative correlation values after this data filtering step, it seemed clear that the cut on correlations made in Sect. 3.1 was not sufficient. Again, building the appropriate correlation selection criterion relied on visual inspection of some low-correlation sources (with the new correlation calculation), ground-based photometric follow-up, and investigation of the crossmatched known variable sources in the sample. In the end, we decided to keep only candidates for which one of their Spearman correlation values (rG-BP or rG-RP) was higher than 0.45 and the other value higher than 0.35.
Fig. 8 Example of a preliminary candidate with a short-timescale variability and a suspected period that shows anti-correlated global behaviours in G and in GBP and GRP. Gaia light curves in G CCD (panel a), G FoV (panel b), GBP (panel c), and GRP (panel d) bands. “Time” is expressed in BJD in TCB − 2455197.5 d. |
Fig. 9 Comparison of Spearman correlations of G vs. GBP and G vs. GRP for the sample of 16 703 preliminary candidates with short-timescale and suspected periodic variability. The filled circles represent the 303 crossmatched known variables in this sample and are colour-coded according to the period listed for them in the literature. |
Fig. 10 Same as Fig. 9, but with recalculated correlations with the additionally cleaned GBP and GRP time series. |
3.5 Removing eclipsing binaries
As part of the Gaia variability processing, a specific analysis dedicated to the identification and characterisation of eclipsing binary stars has been performed as a test case, based on the variable classification described in Rimoldini et al. (in prep.). However, it has been decided to postpone the publication of the resulting candidate eclipsing binaries from all the Gaia DR2 variability products, since reporting of new eclipsing binaries discovered from Gaia data is planned only for Data Release 3 and onwards4. Hence, the candidate eclipsing binaries identified have been removed from most of the other Gaia DR2 variability candidate lists.
In the case of the short-timescale variability analysis, 623 candidate eclipsing binaries were found to overlap our preliminary sample of 16703 candidates. They were therefore excluded prior to Gaia DR2 publication.
4 Published sample of short-timescale candidates in Gaia Data Release 2
After applying all the selection criteria described in Sect. 3, we obtained a final list of 3018 sources with short-timescale and suspected periodic variability that were published in Gaia DR2. This final sample includes 138 known variables from the reference crossmatched source list presented in Sect 2.2: 71 with PLit ≤ 0.5 d, 32 with 0.5 < PLit ≤ 1 d, 27 with PLit > 1 d, and 8 with no PLit information from the literature, whose type is compatible with short-timescale variability, however. None of the constant sources and non-periodic variables from the reference crossmatched sample remains in the final list of short-timescale candidates. The completeness of our sample is about 12% of the 439 + 382 short-period variables in the input sample of 5.6 million sources (see Sect. 2.2). Based on this crossmatch, the contamination would be assessed to lie at about 19%, but consisting of longer-period variability alone.
To proceed in the completeness and contamination analysis, we decided to focus on areas covered by both Gaia and OGLE, thus restricted essentially to the region of the Magellanic Clouds, and to compare the variability results for these two surveys. According to the OGLE III and IV catalogues of variable stars, 45 966 sources are either in the Large Magellanic Cloud (LMC) or in the Small Magellanic Cloud (SMC), and have periods PLit ≤ 1 d. Only 24 of them are part of the published short-timescale sample, which means that the global completeness in this area is as low as 0.05%. We recall that completeness was not the main goal of the Gaia DR2 short-timescale analysis; it is expected to be significantly improved in further data releases.
In total, 48 OGLE variables in the Magellanic Clouds were identified as short-timescale candidates with suspected periods (24 with POGLE ≤ 1 d, 24 with POGLE > 1 d). This results in a contamination from longer-period variables of 50%, although these longer-period variables have periods shorter than 7 d (and mostly shorter than 2 d) and amplitudes of between 0.13 and 0.98 mag, which means that they belong in the category of justified and acceptable longer-period contamination.
To determine whether longer-period variables were the only source of contamination remaining in the Gaia DR2 sample of short-timescale variables, we crossmatched it with the OGLE II photometric database, which covers parts of the LMC, and compared the I-band OGLE light curves to the Gaia G light curves for a few tens of sources in common. If the features seen by Gaia are reproduced and compatible with the OGLE II photometry, then the variability is confirmed. Otherwise, the short-timescale candidate is considered as spurious. Based on this analysis, we obtained a contamination level from spurious variables of between 10% and 20% in the LMC. This region is quite dense in sources, and is therefore more likely to be affected by contamination from neighbouring stars than the Galactic halo, for example. This contamination is therefore likely an upper limit for the whole sample contamination.
Returning to the 138 crossmatched sources, it appears that with our short-timescale analysis, we recovered very interesting known variables, such as some post-common envelope binaries (PCEB) or cataclysmic variables (CV). The first striking example of a known PCEB within the short-timescale candidate list we wish to highlight is NN Ser, an eclipsing system whose orbital period is 3.12 h (Haefner 1989). This system is also known to be orbited by two candidate exoplanets (Beuermann et al. 2010; the presence of the exoplanets is inferred from variations in the eclipse timings). The Gaia DR2 source_id of NN Ser is 1191504471436192512. We investigated it because we were curious to see if we could detect evidence of the eclipse of the white dwarf by the secondary star in this well-known binary. The short period found in Gaia short-timescale analysis exactly recovers the period from the literature. Figure 11 shows that NN Ser has one strongly fading FoV transit in its G CCD light curve, losing more than 1 mag over 40 s. According to the ephemeris from Beuermann et al. (2010), this transit rather corresponds to an eclipse of the binary system than to a transit of the known planets. However, we emphasise that this tremendous fading transit is removed from the cleaned G FoV time series because of its relatively high G magnitude uncertainty. This demonstrates the high relevance and necessity of an available G per-CCD time series at the end of Gaia mission search for short-timescale variability.
Figures 12 and 13 represent the phase-folded Gaia G FoV light curve and variogram, as well as the corresponding phase-folded Catalina V light curve from the literature5, for a known PCEB (CSS J210017.4−141125, hereafter CSS J210017) and a known AM Her variable star (CSS J231330.8+165416, hereafter CSS J231330), respectively. The corresponding Gaia source identifiers are 6888269309535155456 and 2818311909906928384. The Gaia and literature periods are quite similar for both sources (for CSS J210017 PLit = 0.14503 s and PGaia = 0.14503 d, for CSS J231330 PLit = 0.05670 d and PGaia = 0.05659 d), and their phase-folded Gaia light curves are convincing and coherent in the G, GBP, and GRP bands. The Gaia variogram exhibits variations that are compatible with the periods we found. It is very interesting to see that in the case of CSS J210017, the eclipse is already sampled and visible despite the sparse scanning law and limited time span of the analysed data.
The fields of the vari_short_timescale table published in the Gaia DR2 archive6 are the following:
solution_id: a numeric field that unequivocally identifies the version of all subsystems and input data used to produce the table content,
source_id: a unique numeric field identifying the source within all Gaia products,
amplitude_estimate: estimate of the amplitude of the variation from per-CCD time series, AG-CCD, as defined in Sect. 2.4,
number_of_fov_transits: the number of transits for the considered sources with more than 7 points in it (it can then be smaller than 20, although we work on sources with more than 20 FoVs),
-
mean_of_fov_abbe_values: the mean Abbe value per transit, , as defined in Sect. 2.4,
-
variogram_num_points: the number of points constituting the variogram, that is, the number of explored lags (here it is always 26),
-
variogram_char_timescales: the variogram characteristic timescale(s) extracted for the source; by now, the only characteristic timescale retrieved is the detection timescale τdet as defined in Sect. 2.2,
-
variogram_values: the variogram value(s) associated with the characteristic timescale(s); here it is simply the variogram value corresponding to the detection timescale,
-
frequency: the frequency fGaia resultingfrom a high-frequency search as defined in Sect. 2.3.
As detailed in Roelens et al. (2017), by quantifying the averaged variation rate of the considered light curve, the variogram detection timescale and associated variogram value give clues for future ground-based follow-up of the short-timescale candidates published in Gaia DR2. For example, the CV CSS J231330 has τdet = 19.4 s and γ(τdet) = 0.00337 mag2, which means that if the photometric instrument used for follow-up has an accuracy of about 55 mmag, then the observing cadence for detecting the variability should be as short as 20 s.
Figure 14 represents the sky density map of the 3,018 published short-timescale candidates of Gaia DR2. The majority of the candidates are close to the Galactic plane, with the expected lack of objects with more than 20 FoV transits around the Galactic centre. The sources found in the halo globally follow the Gaia scanning law (Holl et al. 2018). We also see slight overdensities in the southern hemisphere, corresponding to the LMC and SMC.
The frequency – amplitude diagram (Fig. 15) shows that our final sample of candidates with short-timescale variability includes high-amplitude as well as low-amplitude variables, down to about 0.1 mag in G band. We note that similarly to Fig. 3, aliasing features are clearly visible in this diagram, particularly at higher frequencies/shorter periods. However, as detailed in Sect. 2.3, we are confident that although they may not be short-period variables per se, these candidates with aliased periods are reliable periodic variable sources and have an averaged magnitude variation rate that is sufficient to justify their detection at the short-timescale level. Figure 16 shows an example of a candidate with short-timescale variability from our list (source_id 6234022782497834624), with a relatively low amplitude (around 0.12 mag) and a period of 19 min. Although the period may be spurious, the variogram clearly suggests a periodicity at timescales of a few hours, the variations in all three Gaia bands are coherent, and even the phase-folded light curves look quite convincing.
Figure 17 shows the HR diagram of 59 of the 3,018 published candidates whose astrometry, photometry, and parallax estimates are good enough, according to the selection criterion of Gaia Collaboration (2019), to be safely positioned in this picture. We note that no correction for extinction is applied in this plot. Some known variables among those 59 sources are also indicated. About 8 of these candidates fall on the main sequence, a few lie on the white dwarf sequence, but the majority are in between, in a region that is normally sparsely populated. However, we see that several known variables end in this area of the HR diagram, typically CV, white dwarf – main-sequence binaries, and novae. This indicates that our candidates are probably some sort of extreme binary systems, involving main-sequence stars and degenerate or semi-degenerate companions.
Among the bluer and brighter short-timescale candidates in the same HR diagram region as the PCEB CSS J210017 (purple diamond in Fig. 17), we find a very good example of a possibly unknown PCEB (source_id 5646693014160460416) with a period of 2.7 h (Fig. 18).
By visually inspecting the light curves of all the 3,018 short-timescale candidates, we also spotted some peculiar and interesting candidates with short-timescale variability that are new discoveries, as far as we know. In particular, we wish to highlight the case of source 5637827617537477504, represented in Fig. 19. It exhibits very strong eclipses of more than 1− 1.5 mag in the G, GBP, and GRP bands and shows a significant out-of-eclipse variability, with an overall shape similar to what is expected for AM CVn stars, for instance. However, the period from the short-timescale analysis is 3.4 h, which is longer than the orbital periods of known AM CVn stars, which are between 5 and 65 min (see e.g. Levitan et al. 2015). Further investigation and modelling is required to better understand and characterise this curious system.
Fig. 11 Phase-folded Gaia G CCD light curve of the PCEB NN Ser (panel a), and zoom onthe fading transit that is visible at phase around 0.3 (panel b). “Times” are expressed in BJD in TCB − 2455197.5 d. The reference time used to fold the Gaia light curve is 1738.448 d (in BJD in TCB − 2455197.5 d). |
Fig. 12 Phase-folded light curves and variogram of the PCEB CSS J210017. Catalina V light curve phase-folded with the period from Drake et al. (2014b; panel a), Gaia G FoV (panel b), GBP (panel c), and GRP (panel d) light curves phase-folded with the period from the short-timescale analysis; and Gaia variogram from the short-timescale analysis (panel e). The orange dashed line indicates the Gaia short-timescale period, and the purple star shows the variogram point that triggered the detection. Different reference times have been used to phase-fold the Gaia and Catalina light curves. The reference time used to fold the Gaia light curves is 1757.338 d (in BJD in TCB − 2455197.5 d). |
Fig. 13 Same as Fig. 12 for the CV of type AM Her CSS J231330, but this time with PLit from PTF Collaboration et al. (2014). The reference time used to fold the Gaia light curves is 1675.660 d (in BJD in TCB − 2455197.5 d). |
Fig. 14 Sky density map of the 3018 published short-timescale candidates, in galactic coordinates. The imprint of the Gaia scanning law and the effect of selecting sources with more than 20 FoV transits is clearly visible (see Holl et al. 2018) |
Fig. 15 Frequency – amplitude diagram for the 3018 published short-timescale candidates. Known variables from the reference crossmatch catalogues are indicated by colour-coded filled circles. |
Fig. 16 Phase-folded light curves in the G FoV (panel a), GBP (panel b), GRP (panel c), and variogram (panel d) of one of the low-amplitude and short-period short-timescale candidates. Phase-folding is done using the Gaia short-timescale period PGaia. The reference time used to fold the Gaia light curves is 1708.972 d (in BJD in TCB − 2455197.5 d). |
Fig. 17 Observational HR diagram, without correcting for extinction, of 59 of the 3018 published short-timescale candidates, which are those with the most reliable astrometry and photometry information (blue filled circles). The filled circles of other colours represent the sources of the 59 candidates with short-timescale variability whose type is known in the literature. The candidate selection of those with a “good” parallax follows the criteria described in Gaia Collaboration (2019), with a relative parallax precision better than 20%. The grey background shows a subset of sources that lie closer than 200 pc in the HR diagram of Gaia Collaboration (2019). |
Fig. 18 Same as Fig. 16 for a blue and bright source, a possible PCEB, of the short-timescale candidates. The reference time used to fold the Gaia light curves is 1764.737 d (in BJD in TCB − 2455197.5 d). |
Fig. 19 Same as Fig. 18 for a curious eclipsing binary among the short-timescale candidates. The reference time used to fold the Gaia light curves is 1765.980 d (in BJD in TCB − 2455197.5 d). |
5 Conclusion
By combining the variogram analysis, the least-squares high-frequency search, and selection criteria based on various metrics, involving Gaia G and GBP/GRP photometry, we identified a first Gaia set of 3,018 candidates with short-timescale and suspected periodic variability. The completeness of this sample is assessed to be about 0.05% of all the known short-period variables scanned by Gaia during its first 22 months of science operations (regardless of any selection based on per-CCD data or on variogram analysis), and to be about 12% of the input sample (i.e. compared to the known short-period variables that have been processed by the short-timescale module). The contamination from false positives and non-periodic variable sources can be as high as 10-20% in denser regions. Contamination from longer-period variables is more significant (about 20-50%), but this can be justified as this contamination comes from variable objects with periods of a few days and amplitudes that are significant enough for a detection to be triggered at short timescales.
Owing to limited period-recovery capabilities when compared to the literature, the period information provided as part of the Gaia DR2 analysis results of short-timescale variables must be used with caution, and is rather communicated for indicative purposes. For the upcoming Gaia data releases, we plan to improve the high-frequency search method, so as to better handle the aliasing problem and obtain a significantly higher period recovery rate. In this perspective, implementing the estimation of typical timescale(s) from the variogram analysis, as described in Roelens et al. (2017), would be a real asset that would add information complementary to frequency-search results.
This Gaia DR2 short-timescale sample is one of the first lists of such variable candidates resulting from a global, comprehensive search for any fast periodic variability over a large fraction of the sky. Even by analysing only the first 22 months of intermediate Gaia per-CCD photometry in G band, we obtained promising results with the recovery and discovery of very interesting candidates with short-timescale variability, which shows the great potential of the Gaia mission for fast-variability studies.
As explained throughout this paper, the aim of this analysis was not to reach a high level of completeness nor to describe theretrieved candidates with a very high level of detail, but more to open a new door on the rather unexplored domain of fast astronomical variability, encouraging further follow-up and characterisation of the identified sources of interest. For the next Gaia data releases, our goal is not only to extend the list of published candidates, widening the explored magnitude range, and benefitting from the improved photometric calibration and longer time-span of the processed data (hence with more sources having a sufficient number of transits for variogram investigation), but also to proceed beyond in the analysis by classifying the detected candidates based on their magnitude, colour, astrometry, and any relevant information from Gaia products (e.g., astrophysical parameter estimations).
Acknowledgements
This work was supported by the Swiss National Science Foundation (fns 200020_166230/OB5276). It has made use of data from the ESA space mission Gaia, processed by the Gaia Data Processing and Analysis Consortium (DPAC). Funding for the DPAC has been provided by national institutions, some of which participate in the Gaia Multilateral Agreement, which include, for Switzerland, the Swiss State Secretariat for Education, Research and Innovation through the ESA PRODEX program, the “Mesures d’accompagnement”, the “Activités Nationales Complémentaires”, the Swiss National Science Foundation, and the Early Postdoc. Mobility fellowship; Belgium, the BELgian federal Science Policy Office (BELSPO) through PRODEX grants; for Italy, Istituto Nazionale di Astrofisica (INAF) and the Agenzia Spaziale Italiana (ASI) through grants I/037/08/0, I/058/10/0, 2014-025-R.0, and 2014-025-R.1.2015 to INAF (PI M.G. Lattanzi); for France, the Centre National d’Etudes Spatiales (CNES). This paper is partly based on observations made with the Mercator Telescope, operated on the island of La Palma by the Flemish Community, at the Spanish Observatorio del Roque de los Muchachos of the Instituto de Astrofísica de Canarias. In particular, it makes use of observations obtained with the MAIA camera, which was built by the Institute of Astronomy of KU Leuven, Belgium, thanks to funding from the European Research Council under the European Community’s Seventh Framework Programme (FP7/2007-2013)/ERC grant agreement no 227224 (PROSPERITY, PI: Conny Aerts) and from the Research Foundation - Flanders (FWO) grant agreement G.0410.09. The CCDs of MAIA were developed by e2v in the framework of the ESA Eddington space mission project; they were offered by ESA on permanent loan to KU Leuven. We gratefully acknowledge Mark Taylor for creating and implementing new requested features in the astronomy-oriented data handling and visualisation software TOPCAT (Taylor 2005).
Appendix A Catalogue retrieval
We summarise below the ADQL queries to be used in the web interface to the Gaia DR2 archive7 for retrieving the Gaia DR2 candidates with short-timescale variability, their attributes, and their light curves.
Retrieving the attributes of the candidates with short-timescale variability:
SELECT sts.*
FROM gaiadr2.vari_short_timescale AS sts
Retrieving the light curves of the short-timescale candidates. The following query retrieves the URL at which the photometry of an individual Gaia source, in the example with source_id 5637827617537477504, can be downloaded:
SELECT gaia.epoch_photometry_url
FROM gaiadr2.gaia_source AS gaia
INNER JOIN gaiadr2.vari_short_timescale AS sts
ON gaia.source_id = sts.source_id
AND sts.source_id = 5637827617537477504
Retrieving the statistics on the Gaia photometric light curves (from the vari_time_series_statistics table), for example, the mean, median, or the IQR of G FoV, GBP, and GRP magnitudes, for all the DR2 candiates with short-timescale variability, together with their short-timescale attributes:
SELECT stat.mean_mag_g_fov, stat.median_mag_g_fov,
stat.iqr_mag_g_fov, stat.mean_mag_bp,
stat.median_mag_bp, stat.iqr_mag_bp,
stat.mean_mag_rp, stat.median_mag_rp,
stat.iqr_mag_rp, sts.*
FROM gaiadr2.vari_time_series_statistics AS stat
INNER JOIN gaiadr2.vari_short_timescale AS sts
ON stat.source_id = sts.source_id
Retrieving the coordinates, parallaxes, and attributes for the candidates with short-timescale variability:
SELECT gaia.source_id, ra, dec, parallax,
parallax_error, sts.*
FROM gaiadr2.gaia_source AS gaia
INNER JOIN gaiadr2.vari_short_timescale AS sts
ON gaia.source_id = sts.source_id
References
- Barclay, T., Ramsay, G., Hakala, P., et al. 2011, MNRAS, 413, 2696 [NASA ADS] [CrossRef] [Google Scholar]
- Beuermann, K., Hessman, F. V., Dreizler, S., et al. 2010, A&A, 521, L60 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Borucki, W. J., Koch, D., Basri, G., et al. 2010, Science, 327, 977 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
- Clementini, G., Ripepi, V., Molinaro, R., et al. 2019, A&A, in press, DOI: 10.1051/0004-6361/201833374 (Gaia 2 SI) [Google Scholar]
- Debosscher, J., Blomme, J., Aerts, C., & De Ridder, J. 2011, A&A, 529, A89 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- de Bruijne, J. H. J. 2012, Ap&SS, 341, 31 [NASA ADS] [CrossRef] [Google Scholar]
- Deeming, T. J. 1975, Ap&SS, 36, 137 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
- Drake, A. J., Gänsicke, B. T., Djorgovski, S. G., et al. 2014a, MNRAS, 441, 1186 [NASA ADS] [CrossRef] [Google Scholar]
- Drake, A. J., Graham, M. J., Djorgovski, S. G., et al. 2014b, ApJS, 213, 9 [NASA ADS] [CrossRef] [Google Scholar]
- Evans, D. W., Riello, M., De Angeli, F., et al. 2017, A&A, 600, A51 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Evans, D. W., Riello, M., De Angeli, F., et al. 2018, A&A, 616, A4 (Gaia 2 SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Eyer, L., & Genton, M. G. 1999, A&AS, 136, 421 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Eyer, L., Mowlavi, N., Evans, D. W., et al. 2017, A&A, submitted [arXiv:1702.03295] [Google Scholar]
- Flesch, E. W. 2015, PASA, 32, e010 [NASA ADS] [CrossRef] [Google Scholar]
- Gaia Collaboration (Brown, A. G. A., et al.) 2016a, A&A, 595, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gaia Collaboration (Prusti, T., et al.) 2016b, A&A, 595, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gaia Collaboration (Brown, A. G. A., et al.) 2018, A&A, 616, A1 (Gaia 2 SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gaia Collaboration (Eyer, L., et al.) 2019, A&A, in press, DOI: 10.1051/0004-6361/201833304 (Gaia 2 SI) [Google Scholar]
- Haefner, R. 1989, A&A, 213, L15 [NASA ADS] [Google Scholar]
- Holl, B., Audard, M., Nienartowicz, K., et al. 2018, A&A, 618, A30 (Gaia 2 SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Hughes, P. A., Aller, H. D., & Aller, M. F. 1992, ApJ, 396, 469 [NASA ADS] [CrossRef] [Google Scholar]
- Kim, D.-W., Protopapas, P., Bailer-Jones, C. A. L., et al. 2014, A&A, 566, A43 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Lafler, J., & Kinman, T. D. 1965, ApJS, 11, 216 [NASA ADS] [CrossRef] [Google Scholar]
- Levitan, D., Groot, P. J., Prince, T. A., et al. 2015, MNRAS, 446, 391 [NASA ADS] [CrossRef] [Google Scholar]
- Macfarlane, S. A., Toma, R., Ramsay, G., et al. 2015, MNRAS, 454, 507 [NASA ADS] [CrossRef] [Google Scholar]
- MacLeod, C. L., Ivezić, Ž., Sesar, B., et al. 2012, ApJ, 753, 106 [NASA ADS] [CrossRef] [Google Scholar]
- Mowlavi, N. 2014, A&A, 568, A78 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Palaversa, L., Ivezić, Ž., Eyer, L., et al. 2013, AJ, 146, 101 [CrossRef] [Google Scholar]
- PTF Collaboration (Margon, B., et al.) 2014, in Stellar Novae: Past and Future Decades, eds. P. A. Woudt & V. A. R. M. Ribeiro, ASP Conf. Ser., 490, 389 [NASA ADS] [Google Scholar]
- Raskin, G., Bloemen, S., Morren, J., et al. 2013, A&A, 559, A26 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Roelens, M., Blanco-Cuaresma, S., Eyer, L., et al. 2016, in 19th Cambridge Workshop on Cool Stars, Stellar Systems, and the Sun (CS19), 121 [Google Scholar]
- Roelens, M., Eyer, L., Mowlavi, N., et al. 2017, MNRAS, 472, 3230 [NASA ADS] [CrossRef] [Google Scholar]
- Russell, H. N. 1914, Pop. Astron., 22, 275 [NASA ADS] [Google Scholar]
- Scargle, J. D. 1982, ApJ, 263, 835 [NASA ADS] [CrossRef] [Google Scholar]
- Schwarzenberg-Czerny, A. 1989, MNRAS, 241, 153 [NASA ADS] [CrossRef] [Google Scholar]
- Taylor, M. B. 2005, in Astronomical Data Analysis Software and Systems XIV, eds. P. Shopbell, M. Britton, & R. Ebert, ASP Conf. Ser., 347, 29 [Google Scholar]
- Toma, R., Ramsay, G., Macfarlane, S., et al. 2016, MNRAS, 463, 1099 [NASA ADS] [CrossRef] [Google Scholar]
- Udalski, A., Szymanski, M., Kaluzny, J., Kubiak, M., & Mateo, M. 1992, Acta Astron., 42, 253 [NASA ADS] [Google Scholar]
- Udalski, A., Szymanski, M. K., Soszynski, I., & Poleski, R. 2008, Acta Astron., 58, 69 [NASA ADS] [Google Scholar]
- Udalski, A., Szymański, M. K., & Szymański, G. 2015, Acta Astron., 65, 1 [NASA ADS] [Google Scholar]
- Wevers, T., Jonker, P. G., Hodgkin, S. T., et al. 2018, MNRAS, 473, 3854 [NASA ADS] [CrossRef] [Google Scholar]
- Zechmeister, M., & Kürster, M. 2009, A&A, 496, 577 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
For more details on the Gaia DR2 contents, see https://www.cosmos.esa.int/web/Gaia/dr2.
For more information on the data release scenario, see https://www.cosmos.esa.int/web/Gaia/release.
For more information on the Gaia Data Release scenario, see https://www.cosmos.esa.int/web/Gaia/release.
All Tables
Summary of the period recovery results for various period-searchmethods, tested on a set of selected well-characterised short-period variables from the literature.
Summary of the step-by-step selection criteria for candidates with a suspected periodic short-timescale variability.
All Figures
Fig. 1 Principle of identifying candidates with short-timescale variability from a variogram analysis, illustrated with the variogram of a simulated transient event with an amplitude of 0.9 mag and a duration of 2 h (derived from Roelens et al. 2017). |
|
In the text |
Fig. 2 Maximum variogram value as a function of the mean G CCD magnitude for known constant and variable sources from the OGLE survey as observed by Gaia. The grey line shows the detection threshold derived from simulated Gaia-like light curves (see Roelens et al. 2017). The brown line corresponds to the same threshold definition scaled by a factor of 10. Panel a: colour-coded by period POGLE from the OGLE catalogue, and panel b: colour-coded by IQR from Gaia per-CCD G photometry. It appears that with this detection threshold γdet = 10γdet,simu, panel a: majority of constant stars and a significant fraction of longer-period variables are not detected as candidates with short-timescale variability when the variogram is used, and panel b: variable sources that are not detected with the variogram approach are low-amplitude sources. |
|
In the text |
Fig. 3 Gaia short-timescale period as a function of the literature period for the crossmatched periodic sources flagged as short-timescale candidates from the variogram analysis. The red and grey dashed lines correspond to PGaia = PLit and PGaia = PLit∕2, respectively.In total, the literature period or half of the literature period is recovered only for 23% of the crossmatched sources with PLit ≤ 1 d. |
|
In the text |
Fig. 4 False-alarm probability distribution from the least-squares period search for the 2892 crossmatched known variable sources flagged as short-timescale candidates from the variogram analysis. |
|
In the text |
Fig. 5 Distribution of the IQR ratios between G and GBP band (panel a) and between G and GRP bands (panel b) for the 2892 crossmatched known variable sources flagged as short-timescale candidates from the variogram analysis. |
|
In the text |
Fig. 6 Example of preliminary candidates for short-timescale, suspected periodic variability whose G light curve is contaminated by a nearby star. The difference of magnitude between them is smaller than 1 mag. “Time” is expressed in BJD in TBC − 2455197.5 d. |
|
In the text |
Fig. 7 Example of preliminary candidates for short-timescale and suspected periodic variability whose G light curve iscontaminated by two nearby bright stars. Panel a: Gaia G per-CCD phase-folded light curve. Panel b: excerpt of animage in the R band from the MAIA photometer (Raskin et al. 2013); the targeted candidate is encircled in magenta. Panel c: MAIA differential photometry light-curve in R band obtained with ePipe. |
|
In the text |
Fig. 8 Example of a preliminary candidate with a short-timescale variability and a suspected period that shows anti-correlated global behaviours in G and in GBP and GRP. Gaia light curves in G CCD (panel a), G FoV (panel b), GBP (panel c), and GRP (panel d) bands. “Time” is expressed in BJD in TCB − 2455197.5 d. |
|
In the text |
Fig. 9 Comparison of Spearman correlations of G vs. GBP and G vs. GRP for the sample of 16 703 preliminary candidates with short-timescale and suspected periodic variability. The filled circles represent the 303 crossmatched known variables in this sample and are colour-coded according to the period listed for them in the literature. |
|
In the text |
Fig. 10 Same as Fig. 9, but with recalculated correlations with the additionally cleaned GBP and GRP time series. |
|
In the text |
Fig. 11 Phase-folded Gaia G CCD light curve of the PCEB NN Ser (panel a), and zoom onthe fading transit that is visible at phase around 0.3 (panel b). “Times” are expressed in BJD in TCB − 2455197.5 d. The reference time used to fold the Gaia light curve is 1738.448 d (in BJD in TCB − 2455197.5 d). |
|
In the text |
Fig. 12 Phase-folded light curves and variogram of the PCEB CSS J210017. Catalina V light curve phase-folded with the period from Drake et al. (2014b; panel a), Gaia G FoV (panel b), GBP (panel c), and GRP (panel d) light curves phase-folded with the period from the short-timescale analysis; and Gaia variogram from the short-timescale analysis (panel e). The orange dashed line indicates the Gaia short-timescale period, and the purple star shows the variogram point that triggered the detection. Different reference times have been used to phase-fold the Gaia and Catalina light curves. The reference time used to fold the Gaia light curves is 1757.338 d (in BJD in TCB − 2455197.5 d). |
|
In the text |
Fig. 13 Same as Fig. 12 for the CV of type AM Her CSS J231330, but this time with PLit from PTF Collaboration et al. (2014). The reference time used to fold the Gaia light curves is 1675.660 d (in BJD in TCB − 2455197.5 d). |
|
In the text |
Fig. 14 Sky density map of the 3018 published short-timescale candidates, in galactic coordinates. The imprint of the Gaia scanning law and the effect of selecting sources with more than 20 FoV transits is clearly visible (see Holl et al. 2018) |
|
In the text |
Fig. 15 Frequency – amplitude diagram for the 3018 published short-timescale candidates. Known variables from the reference crossmatch catalogues are indicated by colour-coded filled circles. |
|
In the text |
Fig. 16 Phase-folded light curves in the G FoV (panel a), GBP (panel b), GRP (panel c), and variogram (panel d) of one of the low-amplitude and short-period short-timescale candidates. Phase-folding is done using the Gaia short-timescale period PGaia. The reference time used to fold the Gaia light curves is 1708.972 d (in BJD in TCB − 2455197.5 d). |
|
In the text |
Fig. 17 Observational HR diagram, without correcting for extinction, of 59 of the 3018 published short-timescale candidates, which are those with the most reliable astrometry and photometry information (blue filled circles). The filled circles of other colours represent the sources of the 59 candidates with short-timescale variability whose type is known in the literature. The candidate selection of those with a “good” parallax follows the criteria described in Gaia Collaboration (2019), with a relative parallax precision better than 20%. The grey background shows a subset of sources that lie closer than 200 pc in the HR diagram of Gaia Collaboration (2019). |
|
In the text |
Fig. 18 Same as Fig. 16 for a blue and bright source, a possible PCEB, of the short-timescale candidates. The reference time used to fold the Gaia light curves is 1764.737 d (in BJD in TCB − 2455197.5 d). |
|
In the text |
Fig. 19 Same as Fig. 18 for a curious eclipsing binary among the short-timescale candidates. The reference time used to fold the Gaia light curves is 1765.980 d (in BJD in TCB − 2455197.5 d). |
|
In the text |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.