A&A 437, 355-368 (2005)
DOI: 10.1051/0004-6361:20042334
C. Moutou1 - F. Pont1,5 - P. Barge1 - S. Aigrain2 - M. Auvergne3 -
D. Blouin1 - R. Cautain1 - A. R. Erikson6 - V. Guis1 -
P. Guterman1,7 - M. Irwin2 - A. F. Lanza4 - D. Queloz5 - H. Rauer6 -
H. Voss6 - S. Zucker5,![]()
1 - LAM, Traverse du Siphon, BP 8, Les Trois Lucs, 13376 Marseille Cedex 12, France
2 -
Institute of Astronomy (IoA), University of Cambridge,
Madingley Road, Cambridge CB3 0HA, UK
3 -
OPM, Place J. Janssen, 92195 Meudon Cedex, France
4 -
INAF - Osservatorio Astrofisico di Catania, via S. Sofia, 78, 95123
Catania, Italy
5 -
Observatoire de Genève, 51 Chemin des Maillettes, 1290 Sauverny,
Switzerland
6 -
DLR Institute of Planetary Research, Rutherfordstr. 2, 12489 Berlin, Germany
7 -
Gemplus Card International, La Ciotat, France
Received 8 November 2004 / Accepted 16 February 2005
Abstract
Because photometric surveys of exoplanet transits are very promising sources of future
discoveries, many algorithms are being developed to detect transit signals in stellar light curves.
This paper compares such algorithms for the next generation
of space-based transit detection surveys like CoRoT, Kepler, and Eddington.
Five independent analyses of a thousand synthetic light curves are presented.
The light curves were produced with an end-to-end instrument simulator and
include stellar micro-variability and a varied sample of stellar and
planetary transits diluted within a much larger set of light curves.
The results show that different algorithms perform quite differently, with varying degrees
of success in detecting real transits and avoiding false positives. We also find that
the detection algorithm alone does not make all the
difference, as the way the light curves are filtered and detrended beforehand also has
a strong impact on the detection limit and on the false alarm rate. The
microvariability of sun-like stars is a limiting factor only in extreme cases,
when the fluctuation amplitudes are large and the star is faint.
In the majority of cases it does not prevent
detection of planetary transits. The most sensitive analysis is performed
with periodic box-shaped detection filters.
False positives are method-dependent, which should allow reduction of their
detection rate in real surveys.
Background eclipsing binaries are wrongly identified as planetary
transits in most cases, a result which confirms that contamination by background
stars is the main limiting factor.
With parameters simulating the CoRoT mission, our detection test indicates
that the smallest detectable planet radius is on the order of 2 Earth radii for a 10-day orbital period planet
around a K0 dwarf.
Key words: planetary systems - methods: data analysis - techniques: photometric - methods: observational
Several transit detection algorithms were proposed in the recent literature: Bayesian algorithms (Defaÿ et al. 2001; Aigrain & Favata 2002; Doyle et al. 2000), matched filters (Jenkins et al. 1996), box-shaped transit finder (Aigrain & Irwin 2004), and the Box-fitting Least Squares (BLS) method (Kovács et al. 2002). A theoretical comparison of these methods was proposed (Tingley 2003), which concluded that "no detector is clearly superior for all transit signal energies'', but an optimized BLS algorithm still performs slightly better for shallower transits. Here, we adopt a more empirical approach to make the comparison by using as a testbench a set of synthetic light curves with detailed simulations of the instrumental noise and astrophysical sources of variability. The test of these five different transit detection techniques was blind, as the five different detection teams had no prior knowledge of their content.
This comparison of detection algorithms is likely to be relevant for all transit-search programmes, both from the ground and from space, although it was focussed here on CoRoT, to be launched in 2006, as the first space mission largely dedicated to transit searches. The CoRoT characteristics are given in Boisnard & Auvergne (2004), and its planet detection capability is estimated in Bordé et al. (2003). This ability is empirically addressed in this paper.
The goals of this blind detection simulation are the following:
Section 2 presents the light curve building procedure; Sect. 3 then describes the five light curve analysis methods, and Sect. 4 discusses the results and reaches conclusions.
The synthetic light curves were built by combining several components: the instrumental model, stellar micro-variability, and in some cases, an additional event, such as a planetary transit, an eclipsing binary or a variable star.
An instrument model (Auvergne et al. 2003) has been designed for CoRoT in order to evaluate the instrument detection capabilities and test the onboard and ground-based software. We use the output of this model as the basis of our synthetic light curve construction. Let us recall that the CoRoT onboard software will perform photometry on a pre-determined list of stars (12 000 per pointing) every 8 min during 150 days, by summing all the signal within pre-defined aperture covering between 100 and 60 pixels depending on the magnitude. Environmental perturbations, such as light scattered by the Earth, radiation flux, Attitude Control System jitter and temperature variations, are computed by specialised models. The outputs are light curves at the focal plane level, proton fluxes with a 10 mm CCD shielding, satellite angular depointing and temperature curves for the most sensitive sub-systems. Monochromatic PSFs are then provided using an optical model of the telescope, and used to compute white PSFs, taking into account the optical transmission, CCD quantum efficiency, and target flux for main sequence stars in the effective temperature range 3500 to 9000 K. The appropriate photometric aperture is computed, depending on the star position, magnitude and colour (Llebaria et al. 2003).
We build 25 basic light curves based on stars scanning 5 mag from 12 to 16 and 5 temperatures from 4500 K to 6750 K, all located at the same CCD position. They contain the following realistic noise contributions:
![]() |
Figure 1: Example of an instrumental light curve before ( top) and after the partial correction of scattered light (once underestimated ( middle), and once overestimated ( bottom)). The sharp peaks in the upper plot are due to the SAA crossing; they become gaps in the output light curves. |
| Open with DEXTER | |
In order to simulate optical light curves for main-sequence stars
with faster rotation than the Sun and with a higher activity level, the rotation period and the
areas of the three model active regions are varied:
the areas of the three active regions, as well as the uniform background term, are
multiplied by a
factor
where
is the average amplitude of the optical light curves
of a star of rotation period P and spectral class Sp derived from Messina et al. (2003),
and where
mag is the maximum amplitude of the
solar optical variability.
For stars with a rotation period longer than 12 days, there is no
information on the amplitude of the rotational modulation in the optical
passband (except for the Sun), so that fis assumed to be in the range 1.5 to 6 for a spectral type varying from F5V to K5V.
The coordinates of the three active regions are those of the
solar model active regions, and the inclination of the stellar
rotation axis with respect to the line of sight is fixed at
.
To reduce
the impact of the small discontinuities occurring every 7.0 days at the passage
from a fit to the next, the model parameters are linearly interpolated in
time between successive best fits. The brightness contrast coefficients and their
center-to-limb variations are the solar ones.
The ratio of the area
of the faculae to that of the sunspots in an active region is estimated by
extrapolating the relationship given by Chapman et al. (1997) to larger sunspot
areas. The resulting facular contribution is found to be negligible for stars
with a rotation period shorter than 20 days and spectral type later than G8.
The variability on time scales significantly shorter than the rotation
period is modelled by scaling the residuals of the best fits to the solar TSI
variations, which are due to the evolution of the solar active regions on time scales
shorter than 4-5 days (Lanza et al. 2003,2004). In order to increase the
amplitude of the short-term stellar variability to make the planetary transit search
more challenging, the residual solar variability is multiplied by a factor 3f and
linearly interpolated to get an even time sampling of 8 minutes. Finally, Poisson
random fluctuations with a relative standard deviation of
are
added to simulate short-term variations due to microflaring
or convection on time scales of several minutes.
In addition to the original TSI light curve, 9 light curves were produced with this method, with spectral types F5, G0, and G8 and rotation periods 3, 10, and 20 days. The amplitude of micro-variability ranges from 0.1 to 4%. The stellar optical time series so obtained are dominated by the rotational modulation except for rotation periods longer than 15-20 days for which the active region evolution prevails on the rotational modulation signal. A few small discontinuities are present, due to the passage from a 14-d fit to the successive one, but they never exceed 5% of the amplitude of the rotational modulation, even in the case of the most active stars.
SIMLC is a tool to simulate stellar micro-variability for stars with spectral types F5 to K5 and ages later than 625 Myr. It works by computing an artificial power spectrum, starting from a fit to solar data and scaling it using empirical scaling laws. The power spectrum is then sampled as appropriate, given the time sampling and light curve duration required, coupled with a random phase array and reverse Fourier-transformed to the time domain. More details can be found in Aigrain et al. (2004), so only a brief summary is given here.
Following Andersen et al. (1994), the power spectrum of the Sun's total
irradiance variations up to
600
Hz (as observed with the
PMO6 radiometer, which is part of the VIRGO experiment on SOHO) is
modelled as a sum of three broken power laws, each characterised by an
amplitude, characteristic timescale, and slope.
There are 3 components with timescales of 10 days, 4 days, and 10 min. The powerlaw slopes are 3.8, 1.8, and 2.0. All these values are
those measured for the Sun. Note that because the slope of
the first powerlaw is quite steep it falls of quickly for timescales larger than
10 days, while the second powerlaw, which is quite shallow, is still
the dominant component at 100
Hz (timescales of a few hours,
typical of transits). The amplitude of the
lowest frequency, or "active regions'', component is correlated with
simultaneous measurements of the Ca II K-line index indicator of
chromospheric activity. Higher frequency components, which have much
smaller amplitude, are thought to be related, respectively, to super-
or meso- granulation and to a superposition of granulation,
oscillations, and photon noise.
Empirically derived scaling laws can be used to scale the amplitude and timescale of each power law to what might be expected for other stars. Currently this can be done only for the dominant low-frequency component, using chromospheric activity as a proxy. Observational constraints are currently insufficient to derive scaling laws for the other components, including the second component that corresponds to the timescales characteristic of planetary transits, so those are thus left as they have been measured in the Sun. Upcoming data, in particular from the MOST (Micro-variability and Oscillations of STars) satellite (Walker et al. 2003), are expected to provide constraints on this component in the near future.
A set of 45 light curves lasting 150 days, with 8 min sampling, were generated for the present exercise. They correspond to a grid of stars of spectral type F5, F8, G0, G2, G5, G8, K0, K2, and K5, and ages 0.625, 1, 2, 3, and 4.5 Gyr. The amplitude of the dominant, "active regions'' component of the variation scales with convection zone thickness (which is larger in later spectral types) and the inverse of the rotation period (which is larger in older stars), while the characteristic timescale scales roughly with the rotation period. As a result, at 0.625 Gyr the most variable stars are F-stars, while at 4.5 Gyr they are K-stars. The amplitude of micro-variability ranges from 0.01 to 0.1%, a level much lower than those obtained with the method described in Sect. 2.2.1. This is thought to be due to the more coherent nature of micro-variability in active stars, which SIMLC currently cannot reproduce.
Twenty planet transits were
simulated. For a thousand light curves, this represents about an order of magnitude
more transit events than expected in real samples
(Bordé et al. 2003). It is important that light curves without transit vastly outnumber
those with transits in the simulation, so that the detection thresholds have to be set realistically high.
The characteristics of the inserted transits are not chosen with the
goal of reproducing planet statistics, because those are mostly unknown in the
range where CoRoT will discover planets; the idea is instead to test
limitations and to explore the borders of detectability.
The objectives are then (1) to sample a variety of system cases; and (2)
to investigate the detection limit by including a large number of small
planets in light curves with a varying noise level. The characteristics of
the transits are summarized in Table 1.
The planet size spans the range from 1.6 Earth
radius (
)
to 1.3 Jupiter radius (
). One system with two
planets is inserted. The period domain is 4 to 90 days.
Target stars with
the planetary transits are chosen at "directed random'', with the aim
of exploring the regions near the limit of detectability.
For instance, the largest planets are
inserted in the light curve of faint and/or active stars. The largest
planets are also the ones with the lower number of transits (the hot
Jupiter configurations, as easy cases for space transit searches, are
not emphasized here).
Table 1: The characteristics of the transits that were inserted in the light curves: the star radius R (in solar radius units), the stellar limb darkening coefficient (LD), the planet radius r, the orbital period in days, the system inclination in degrees, the semi-major axis a, the star magnitude, the final standard deviation of the light curve in percents, and some comments. The detection flag shows a series of + and - signs, corresponding to each team, respectively, from 1 to 5; + means a positive detection (for Team 1 in position 1, etc.), - means that the event is missed.
The transit light curves are simulated with the aid of the Universal Transit Modeler (Deeg 1999). Limb darkening of stars are estimated from recent calculations from ATLAS9 models and the CoRoT bandpasses (Barban priv. comm., see method in Barban et al. 2003), considerig both a linear limb-darkening law and a classical mixing-length theory.
The characteristics of these light curves are summarized in Table 2. Again, the characteristics of the systems are chosen to cover most possible combinations rather than to reproduce the expected characteristics of real samples. Our eclipsing binary transits include curves with anti-transit signals, with sine and double-sine modulations outside the transits due to the ellipsoidal deformation of the primary under the gravitational influence of the secondary, V-shaped eclipses (grazing) and U-shaped eclipses (central eclipse in a background contaminant system). For grazing eclipsing binaries, the algorithms of Mandel & Agol (2002) and Wichmann (1998) are used. The Universal Transit Modeler (Deeg 1999) is used for background eclipsing binaries and the triple star. The variable star light curves are taken from the literature and from the archives of the AAVSO (American Association of Variable Star Observers).
Table 2: Table of contaminating events introduced into the light curves: magnitude, event type ("BEB'' stands for background eclipsing binaries, "GrB'' stands for grazing binaries), period and relative flux (contribution of the background star to the total flux), and the standard deviation of the final light curve. Detection flag: detection and correct identification (+), wrong identification (i), no detection (-), for each team from 1 to 5. References: UTM (Deeg, 1999, UTM), Nightfall (Wichmann, 1998, W98), (Mandel & Agol, 2002, MA), AAVSO (American Association of Variable Star Observers), Andreasen (1988) (A88).
The temporal sampling of the final light curve is 8 min, with a duration of 150 days, as for CoRoT long observing runs. A complete light curve contains 25 056 data points.
The package of 999 light curves (identified with ID 1 to 999 in the following) were supplied to the detection teams with information neither on their content nor on the way they were calculated, wether the number of hidden planets or the nature of injected noise sources. In the real case with CoRoT light curves, some data will be known beforehand, such as the star magnitude, spectral type, luminosity class, contamination by neighbours, and pipeline processing parameters. This knowledge is not fundamental for transit detection but will obviously help in the identification of the detected events.
In this section, we describe the five methods used for detrending the light curves and detecting the transits. Their elements span a wide range of complexity from fairly basic to very evolved. They also differ by their previous use: one team started from scratch with no experience in transit detection, two teams use algorithms that they developed for ground-based transit surveys (BEST and OGLE), and two teams are working on algorithms for space-based transit searches.
The first algorithm is based on correlation of the light curve with a single sliding template but without prior detrending. Systematic noise on short timescales is removed from the correlation function, then candidates with a high signal in the correlation function are examined individually by eye to pick up the final detections.
The light curves are correlated with a sliding template to compute a
correlation function C(t). The template is a transit shape based on the
algorithm of Mandel & Agol (2002). The use of a unique transit template is
sufficient and makes the method much simpler; the optimum template has
a transit duration of
8 h and is bordered by
two flat segments of
14 h. Previous filtering of the long-term variations
is not crucial in this case, because the template covers only a small part of
the light curve at a time. Figure 2 shows the resulting
correlation functions for a few cases. In this method, no periodicity is assumed in the transit signal, and the period is estimated a posteriori.
One advantage of the correlation method is that it is not affected by gaps in time coverage of the data. Missing epochs simply make no contribution to the correlation function, which avoids those problems caused by any interpolation of the data in the gaps.
![]() |
Figure 2: A) and B) are two correlation functions ("detection curves'' DC) showing systematic noise. Artefacts are sometimes obvious (synchroneous spikes and similar envelope) or can be hidden, with a known or unknown origin. C) and D) show DC613 before then after detrending (note the very different y-axis scales). |
| Open with DEXTER | |
As explained above, no detrending was done on the long-term
variations. Correlation curves show a common pattern of perturbation on short time scales,
associated with instrumental effects like temperature changes ("breathing''),
scattered light or pointing jitter. We assume that this instrumental noise introduces a common noise
in all correlation functions, except for a scale factor.
We model this by
where
is the temporal correlation curve,
,
the
unknown noise-free correlation curve, and p (with
by convention) the
unknown instrumental perturbation common to all objects, weighted by the
unknown
.
It appears that the average of
is close to zero, so that p cannot be simply estimated by averaging the
curves. To retrieve p we apply the following sequence:
Detrended correlation functions exhibiting a strong signal (i.e. about 5% of the light curve sample) are then examined by eye, selecting the candidates with strictly periodic signals and folding accordingly each light curve to point out autosimilarity of the shape.
It turns out that the "families'' of objects used to remove the noise in the correlation function often correspond to sets of light curves based on the same parent noise curve. Therefore, with this method, the removal of the systematic noise is probably more efficient on simulated data than it would be in reality.
Correlation with a sliding transit template is among the simplest possible methods for transit detection, short of direct examination of all light curves by eye, and the results of this algorithm on our synthetic sample can be used as a reference point of comparison for the performances of the other algorithms.
![]() |
Figure 3: The different steps of the light curve analysis of Team 2 for light curve ID34. First the data gaps (not visible at this scale) are interpolated and the light curve is normalized ( top), and a lowpass-filter is applied to remove high frequency signals ( middle). Finally the stellar variability is modeled and a search for period signals performed ( bottom). The periodic signal found is marked in the figure. |
| Open with DEXTER | |
Finally all light curves with detected events are classified as either possible transit-like or other events.
In this method, the light curves are detrended by fitting 200+5 harmonics, then transits are detected with a box-fitting on the phase-folded signal.
![]() |
Figure 4: Top: The adjusted SR function for light curve 34, which shows the typical peaks of a transit signal. Bottom: The distribution of the normalized SR of the 999 light curves. The arrow points to the adopted detection threshold of 7.0 (Team 3). |
| Open with DEXTER | |
Removing harmonics with periods as short as 1.5 days may modify the shape of the transit signal a little, but it does not affect the detection capability. Final characterization of the transit signal is done by fitting a simplified transit model. We used linear ingress and egress, and a 'flat-bottom' transit. Fitting it together with the harmonics proved quite easy, using the SVD pseudoinverse method, and the derived transit signal is not modulated by the harmonics.
The GF detrending procedure is the following: (i) the light curve is successively under-sampled and expanded with a linear recursive interpolation method over the data gaps in order to keep the total size of the light curve unchanged; (ii) the resulting light curve is smoothed out with a 4-width smoothing filter (widths are 2n with n = 6, 7, 8, 9), producing smoothed light curves with different low frequency ranges; (iii) the final light curve is chosen as the optimum of the four filtered light curves. The final choice is made in Fourier space looking at the local minima of the energy contained in the four light curves and selecting the one within the lowest frequency range (i.e. the furthest from the transit frequencies).
For a given light curve, the best fit of the low frequency modulations obtained with our detrending method is denoted CLF. In most light curves, the low frequency modulations are quite weak and using under-sampled light curves with loose smoothing is sufficient.
With the above method a total of 25 light curves were found to contain transit-like features (Tables 1, 2): 19 are identified thanks to Fourier correlation; 13 (resp. 2) correspond to single (resp. bi-) periodic features present all along the light curve, and 4 have the characteristics of an eclipsing binary. Bi-periodic events are characterized by two non-commensurable periods. Selection by peak sorting allowed the identification of the 6 other detections, with a lower confidence level but also some secondary features.
Long term (stellar) variations are then removed using an iterative
clipped non-linear filter (Aigrain & Irwin 2004). First the light
curve is pre-filtered with a combined median/boxcar filter (duration
7, 3 samples) to remove short duration glitches and to
minimise the removal of signal from transit-like
features. A "continuum'' is then computed
from this pre-filtered curve by iteratively applying a similar
median/boxcar filter (duration 2d,d samples, where d is the trial
transit duration), flagging points where the difference between
continuum and original is >3
,
and recomputing the continuum
without the flagged points up to 5 times. The
is robustly
re-computed at each iteration from the median of the absolute deviations
of the difference signal. The final clipped continuum is
subtracted from the original signal and the median level restored to
give the filtered (white-noise-like) light curve (see example in
Fig. 5).
![]() |
Figure 5: Light curve 34 before ( top) and after ( bottom) iterative non-linear filtering with a trial duration of 3.3 h (Team 5). The Y-axis represents a relative flux. |
| Open with DEXTER | |
Light curves without events form a clump at low
and
,
while
those containing significant residual stellar variations form a tail
at high
,
with
.
A threshold of the form
was therefore used to pick out periodic events, with
a=1 (a makes the threshold more stringent at low
's) and
b=1.3. All events below a similar line with b=1.4 are marked as
low-confidence events. All light curves with
are also
included in the candidate lists as potentially containing single deep
transits.
![]() |
Figure 6:
Candidate selection in the multiple ( |
| Open with DEXTER | |
The long-term variation filtering and transit search are run for
trial transit durations of 3.3, 6.7 and 13.3 h, yielding 3 initial
lists of 30, 74, and 167 candidates respectively. After examining the
corresponding light curves by eye to remove obviously spurious
candidates, the final (merged) list contains 31 candidates, of which
6 are low-confidence detections (
), 5
are identified as eclipsing binaries due to visible secondary
eclipses, 1 as a triple star system and 1 as showing only sinusoidal
variations (no transits).
The actual duration of transit candidates is estimated as the
full-width at half-minimum of the transits in the phase-folded,
filtered light curve. If that differed from the trial duration, the
filtering is re-run using the measured duration to obtain a better
transit depth measurement; period and epoch were deduced from the
transit search itself. Along with the transit search, a search
for periodic variations with
days is run by
sine-fitting, providing improved period estimates for the stellar
variables identified by the transit search, and one additional
detection of sinusoidal variation.
Tables 1 and 2 give details of the detection ability of each team for each transit and other contaminating events. From direct comparison of the individual results we observe that:
The results show that the simple correlation method proposed by Team 1 is already an effective detection tool (22 detected events over 38 inserted). It also appears that Teams 3 and 5 have detected significantly more transit events than the other three teams (26 detected events). Team 3, moreover, had no false positive, compared to five false positives for Team 5. Team 5 could have included less false positives with a higher threshold (see Fig. 6), but the method of Team 3 has the additional advantage of a very natural way of setting the threshold (Fig. 4, bottom). This points towards a greater robustness of the method used by Team 3. It confirms that the BLS algorithm is more sensitive to faint transits, a result which also shows up in the theoretical comparison performed by Tingley (2003) or in the recent re-analysis of the OGLE data (Udalski et al. 2003). The better results of Team 3 could also be due to a more efficient detrending technique.
Figure 7 shows the three types of results (5 detections, 1 to 4 detections, 0 detection) against the main parameters that affect detection
sensitivity: transit depth d and number of transits n in the light curve. The
non-detected events are all situated below the empirical detection curve
,
except one which corresponds to
a difficult case described earlier (ID 715).
The detection capability of CoRoT derived from this blind test analysis (where
r is the planet radius and R the star radius) are:
Table 3 gives the corresponding values of the minimal detected planet size for four types of parent stars, F0V, G0V, K0V, and M0V.
![]() |
Figure 7: Depth of the transits versus number of transits. Plus signs show the non-detected events, diamonds show the events detected by five groups independently, and filled circles correspond to 1 to 4 detection occurences. The dashed line thus shows the border of the simulated CoRoT detection limit (proportional to n-1/2). The only plus sign above the detection line is a grazing planet on a faint fluctuating star. |
| Open with DEXTER | |
Table 3: Minimum planet radius for F0V, G0V, K0V, and M0V stars in unit of Earth radius, corresponding to the empirical detection curve estimated by the blind test, which possibly overestimates the minimal radius of the detected planets at the longest periods. The star radii are from Allen (2000), i.e. 1.5, 1.1, 0.85, and 0.6 solar radius, respectively.
False detections:
None, due to the low sensitivity limit of the method
and to visual elimination steps.
Prospects for further improvements: The periodicity of the transit signal could be used in the detection. The removal of the instrumental noise could be improved with a tool such as Principal-Component Analysis. Filtering the long-period variations would also be useful.
Non-detections:
For most of the non-detections some individual events were detected on
a very low confidence level, but most signals were below the detection
limit of our routine. To detect these events a search in folded light
curves is necessary to improve the S/N ratio of the signals.
For ID 168, several transit-shaped events were detected with medium
confidence level, but many resulted from the variability of the stars,
thus confusing the detection algorithm searching for a periodicity. Consequently
the treatment of the variability of the stars and the robustness of the
periodicity search has to be improved.
False detections:
Only one false detection was made by the team. In light curve ID 213,
simulating a faint star, a false transit event was found. This
detection had the lowest confidence level of all our detections (3
).
Prospects for further improvements: A first step would be to search for transits in folded light curves to be able to detect fainter transits in noisier environments. We also plan to test a Fourier analysis and remove frequencies that can be identified as instrumental noise. The deformation of the transit events can be prevented that way. Additionally, the light curves of variable stars have to be analysed more carefully to reduce periodic variations that can confuse the detection algorithm.
Prospects for further improvements: The detrending process may benefit significantly from new procedures recently developed for systematic-effect removal (Kovács et al. 2005; Tamuz et al. 2005). This procedure may remove a significant part of the stellar variability, but also some systematic effects that were not modelled in this exercise.
The detection stage may benefit from the correction to the BLS algorithm proposed by Tingley (2003). In theory, the corrected BLS should be somewhat more powerful in distinguishing between a transit signal and random noise, thus improving detection ability. Another improvement in application of the BLS may be related to a better sampling of the frequency space, fine tuning of the algorithm parameters (maximum transit width, bin width, etc.), or better adjustment of the SR function. Finally, one could also make a 2-D search that looks at both the "SDE'' and "DDE'' parameters of Kovács et al. (2002) to check whether this allows some gain in the detection capability.
Non-Detections:
A posteriori analyses show that the algorithm cannot detect a planet with
radius less than
or when the noise (likely stellar noise) is so strong that the denoising algorithm starts
modifying the transit itself.
False detections:
Among the detected signals, three of them turned out to be false detection (IDs 701, 703, 983).
The case of ID 983 corresponds to a discontinuity of the light curve produced
by the stellar variability simulation (Sect. 2.2.1).
In the other cases (IDs 701, 703), transit features were erroneously identified with the peak
sorting method due to a random and unlucky location of the peaks in the convolution curve M.
This kind of false detection should, however, not be specific to our
algorithm.
Our best results were obtained when the matched filter was associated with a peak
selection by Fourier correlation. No false alarm is found in this case, while
selection by peak sorting can lead to a number of false alarms due to
ambiguities with noise artefacts.
Finally, the number of false positives does not change with the
detection threshold, which is automatically optimized from an estimate of the noise in the
input signal. The threshold thus strongly depends on the quality of the detrending process.
Prospects for further improvement: The method developed in the present exercise can certainly be improved for higher noise level. A new filter based on image processing is presently being tested to improve the detection capacity. It is developed on the same grounds as the detrending tool presented in Sect. 3.4.
Another issue is the actual robustness of the algorithms to periodicity changes, due for example to binarity, secondary planets, or residual instrumental drift. This question has not been addressed in the present exercise since transit signals were assumed strictly periodic.
Prospects for further improvements: Future improvements will include refinement of the detrending stages, of the choice of threshold through Monte Carlo simulations, and of the post-detection transit characterisation.
The results also show that false detections may not be a major difficulty when various detection methods are applied, since no false event was ever detected twice independently in the simulation. Also, one method (harmonic-fitting filtering plus BLS detection) does not suffer from any false detections within the synthetic sample. We note that stellar micro-variability limits the transit detection only when its standard deviation is larger than 0.5% and its main frequency is around 0.1 day-1. In most cases, stellar micro-variability such as simulated here (Sect. 2.2) is not the main limitation, mostly because the fluctuation frequencies are not in the domain of the transit duration, and the amplitude is usually low. This result compares well with the conclusions of Jenkins (2003) and are important in the context of space transit detection missions. Of course, this is true only as long as activity models based on the solar case correctly describe other stars. In the next few years, space astero-seismology missions may provide better constraints on stellar micro-variability on timescales of a few hours.
The present study shows that the detrending method is almost as important for detecting faint transits such as the detection algorithm itself. Precise detrending processes can cancel almost all the variability and reflected light contamination. On the other hand, artefacts of the detrending can cause spurious transit detections. The relative importance of detrending and detection could be quantified by coupling the detrending and detection phases between the five algorithms, but was not attempted in the present study.
Processing of real data from space will suffer more systematic effects than those introduced in this exercise, due to temperature cycles, pointing jitter, or scattered light gradients along the detector. In ground-based transit surveys, systematics are mainly due to fluctuations caused by Earth's atmosphere. Future work will include a comparative study of gains from the correction of systematics using comparison stars, as recently proposed by Tamuz et al. (2005) and Kovács et al. (2005).
The characterisation of transits (shape, radius ratio, orbital inclination, etc.) requires an entirely different set of analysis tools and no particular insight was obtained about it from the detection simulation - apart from confirming that eclipsing binaries can easily be confused with planetary transits.
Some of the algorithms used here focus on the detection of individual transits, as well as on strictly periodic signals. Detection of not strictly periodic transit signals is an issue that was not considered here. In most realistic cases (two planets, circumbinary planet), the transits will be very nearly periodic and the algorithms for periodic signals will probably be able to detect them. But among the algorithms that are studied here, at least two have reached "maturity'' for monochromatic light curves without a priori information. Continuation of this study could consider including more information: e.g. chromatic light curves (CoRoT), colour or spectroscopic information about the target star. It could also include other instrumental contents (Kepler, Eddington) and a refinement of stellar micro-variability in the frequency-amplitude parameter zone where it may mimic transit features.
The 999 light curves produced and a table with the parameters used are available to the community by request to the authors for testing and improving other detection algorithms.
Another by-product of our blind comparison of detecting transits
in light curves simulated as CoRoT data is a refined estimate of the detection limitation on this
instrument under development: a 3-day 1.1
planet around an M0 dwarf star would probably be detected.
CoRoT would also detect the transits of a planet like
,
the 14.5-Earth mass planet
with 9.55-day period recently discovered in radial-velocity surveys
(Santos et al. 2004), if it is larger than 2.7
,
i.e. with a density up to that of
terrestrial planets.
Acknowledgements
We are grateful to the CoRoT PI Annie Baglin and to the whole CoRoT/Exoplanet Working Group for their support and fruitful discussions during this exercise. S.Z. wishes to acknowledge support by the European RTN "The Origin of Planetary Systems'' (PLANETS, contract number HPRN-CT-2002-00308) in the form of a fellowship. Finally, we express our acknowledgements to the anonymous referee for his/her detailed reading and many interesting suggestions.