Free Access
Issue
A&A
Volume 636, April 2020
Article Number A70
Number of page(s) 18
Section Planets and planetary systems
DOI https://doi.org/10.1051/0004-6361/201937412
Published online 20 April 2020

© ESO 2020

1 Introduction

The following decade will see the outcome of several missions in the field of extrasolar planets. With the new space missions like the Transiting Exoplanet Survey Satellite (TESS; Ricker et al. 2015), the Characterizing Exoplanet Satellite (CHEOPS; Fortier et al. 2014), and the Planetary Transits and Oscillations of stars mission (PLATO; Rauer et al. 2014), we expect to be able to detect and precisely characterize several thousands of new transiting Neptune- to Earth-like planets. However, as it was already the case with bright stars observed with Kepler (Gilliland et al. 2011, 2015), the high-precision photometry of these instruments will not be limited by photon noise but by stellar variability. Indeed, at the stellar surfaces, several phenomena (e.g., spots, plages, flares, convection, oscillations) evolve on different timescales and generate variability that degrades the detection and the shape determination of planetary transits (see e.g., Oshagh 2018 for a recent review).

In this paper, we focus on the stellar variability taking place on timescales similar to the duration of a single planetary transit (i.e., <1 day). On these timescales, the dominant stellar activity contribution of quiet stars of around solar mass comes from the surface convective motions and the pressure-mode oscillations. In the case of exoplanet transits, star spot crossings can also punctually contribute to the short-timescale noise. In this study however, we disregard these punctual noise sources as they are not regularly present in the observations, and their contribution can be averaged out by analyzing hundreds of short individual time series.

Oscillations are studied extensively through asteroseismology. Pressure-mode oscillations are generated in the stellar convective envelope of stars of solar mass and allow us to probe the stellar interiors. Their characteristics (amplitudes and periods) directly inform on the evolutionary stage of the star. Solar-type stars oscillate with a period of several minutes and generate a photometric background signal with an amplitude of several tens of parts per million (ppm; Harvey 1988). The amplitudes and frequencies of these p-modes are known to change with the stellar magnetic cycle (Chaplin et al. 2011; Salabert et al. 2011; García et al. 2013).

Convection results from turbulent plasma motion at the stellar surface. When resolved, ascending hot plasma surfacing at the granules appears bright, and cool plasma descending in the intergranular lanes appears darker, leading to variable contrasts in brightness over time. The individual granulation cells can only be resolved for the solar surface, where the individual cells have average sizes of 1000 km (Title et al. 1989) and a median “turnover” timescale between 7 and 10 min (Nesis et al. 2002). However, granulation is an evolving process: the size of the granules grows and shrinks with time, the cells merge and split with the surrounding granules, and the resulting photometric variability appears correlated over timescales larger than the turnover period (Seleznyov et al. 2011). At large scales (~ 3 × 107 m), we can also observe a conglomerate of convection cells when mapping the magnetic flow on the solar surface.

Thanks to the Solar and Heliospheric Observatory (SoHO) measurements, the photometric signature of solar granulation is known to be approximately 100 ppm (Dravins 1988; Fröhlich et al. 1997; Aigrain et al. 2004). The typical amplitudes and turnover timescales of surface granulation depend on the stellar parameters. Amplitudes increase with effective temperature (and possibly thestellar metallicity; see Corsaro et al. 2017) and decrease with decreasing stellar mass and/or increasing surface gravity (Svensson & Ludwig 2005). Both decrease with increasing mean oscillation frequency νmax (Dravins 1988; Kallinger et al. 2014). Recently, high-precision Kepler measurements have made it possible to measure the photometricamplitude of the granulation variability in increasing detail, which has given rise to a new technique for deriving the stellar surface gravity (Mathur et al. 2011; Bastien et al. 2016; Cranmer et al. 2014; Pande et al. 2018) and density (Kipping et al. 2014).

While granulation contains extensive information for stellar physics, because of its stochastic nature it is typically considered as noise from the point of view of detecting and/or characterizing planets and their atmospheres. In an exoplanetary context, granulation is often referred to flicker noise as its power spectrum follows the form of a power-law function in a specific frequency range related to its turnover timescales. In practice, because the granulation turnover timescale is small (≪ 1 day) for solar-type stars, this stellar variability can be considered uncorrelated from one planetary transit to another. Therefore, the influence of this noise can be reduced when several transit events are observed and the overall signal-to-noise ratio (S/N) can be improved byphase-folding the planetary transits with the planet orbital period. However, this technique is not efficient when only a small number of transits are observed (e.g., for long period planets) or when the transits show transit timing variations (multiplanetary systems). Hence, the statistical properties of the flicker noise need to be known in order to design dedicated modeling strategies correctly accounting for (and possibly reducing) its influence when estimating the parameters of transiting exoplanets.

Granulation noise is often modeled using Harvey law profiles (Harvey 1985) with parameters (amplitude and timescales) estimated based on the observed stellar power spectral density (PSD; Pallé et al. 1999; Kallinger et al. 2014; Cranmer et al. 2014). One success of this technique is the relation between the parameters of the Harvey law profiles and the stellar fundamental parameters, which appear to be correlated (Kallinger et al. 2014). Recently, Pereira et al. (2019) proposed to couple Gaussian Processes (GP) noise modeling (Rasmussen & Williams 2005) with PSDs based on Harvey-like profiles and showed that taking into account the flicker correlations within the transit analysis tends to improve the accuracy of the inferred transit parameters. Barclay et al. (2015) came to a similar conclusion using other GP noise models to fit for granulation and derive the parameters of the transiting hot Jupiter Kepler-91b. A more realistic way to model stellar granulation is to turn to modern three-dimensional radiative hydrodynamical simulations of stellar convection (Nordlund et al. 2009). While computationally demanding, these codes allow the generation of realistic photometric time series of flicker noise and could provide valuable diagnostics to extract the flicker properties affecting transits (Chiavassa et al. 2017). This has been shown by Chiavassa et al. (2015) with the transit of Venus in 2004.

In this work, we aim to understand the effect of stellar flicker on determining accurate and precise properties of exoplanets through transit photometry and describe how this noise behaves as a function of the stellar parameters. This paper is organized as follows. In Sect. 2, we summarize the statistical properties of solar flicker using 21 yr of continuous solar observations provided by SoHO. In Sect. 3, we analyze the impact of solar flicker on the determination of the transit parameters. In Sect. 4, we use short-cadence Kepler observations to extract the flicker properties and show their dependence with the stellar parameters. In Sect. 5, we discuss the potential for the photometric characterization of this noise source with the future high-precision observations of CHEOPS and PLATO. We conclude in Sect. 6.

2 Statistical characterization of short-timescale solar variability

This section summarizes the statistical properties of solar granulation in photometric observations. We aim to list the properties that are needed by signal processing routines to analyze the influence of this noise source and derive appropriate noise models.

2.1 VIRGO observations

The Sun has been monitored since 1996 by the ESA/NASA SoHO spacecraft. Onboard, the Variability of solar IRradiance and Gravity Oscillations (VIRGO) experiment measures the spectral irradiance with a three-channel sun photometer (SPM) of 5 nm bandwidth at wavelengths of 402 (blue), 500 (green), and862 (red) nm. Observations at these wavelengths span different heights in the solar photosphere (from − 20 km compared to the base of the solar photosphere for the green and blue channels to + 10 km for the red, Jiménez et al. 2005). The data are integrated over 60 s and centered around the full minute. The duty cycle of the 21 yr of VIRGO time series (from April 11, 1996, to March 30, 2017) is around 96%, making it the best data set available today to derive the statistical properties of the solar variability. The photon noise in each of the three channels is below 10 ppm (Salabert et al. 2017). A full description of the instrument’s characteristics and technical calibration procedures can be found in Fröhlich et al. (1995, 1997), and Jiménez et al. (2002).

The available data (level 1) have been converted to physical units (W m−2 nm−1), corrected bytemperature variations, and calibrated to a constant distance between the spacecraft and the Sun. We carefully corrected for the instrumental degradation1 evolving over time following the procedure below:

  • 1.

    For computational reasons, we split the whole data set into subseries, each spanning 365 days.

  • 2.

    We smoothed each subseries using a running average of 3 days in length and localized the 3σ outliers. These outliers were disregarded in step 3.

  • 3.

    We fitted a high-degree polynomial function to the smoothed time series and used it to normalize the initial time series (containing the outliers).

  • 4.

    We removed the 5σ outliers from each corrected time series,

  • 5.

    We finally compared the resulting three SPM datasets and kept the common data points between these three series.

As the emergent flux decreases from optical to infrared, we observe a strong color dependence between the datasets, with higher variability at short (blue) wavelengths than at long (red) wavelengths (see also Aigrain et al. 2003). This is illustrated in Fig. 1 through the time series root-mean-square (RMS) measurement, where the signature of the approximately eleven-year solar cycle is evident. To study the short-timescale solar variability, we finally divided the detrended datasets into one-day subseries, and removed those subseries that had missing data and strong instrumental features. We obtained a total of 5912 regularly sampled one-day subseries (i.e., ≈275 subseries yr−1). We use this dataset to characterize the properties of the granulation activity in the following subsections.

thumbnail Fig. 1

Yearly RMS of the VIRGO time series for the blue, green, and red SPM channels (see colors). Solar minima at the beginning ofcycles 23 (from 1996 to 2008) and 24 (since 2009) are clearly visible.

2.2 A stationary stochastic colored noise

The solar short-timescale variability (periods < 1 day) is dominated by instrumental noise, oscillation modes, and convection. The distinct signatures of these three noise sources can be well discerned in the PSD of the VIRGO observations. Figure 2 shows an estimate of this PSD through the averaged periodogram defined as (Bartlett 1950)2: PL(νk):=1Ll=1L1N|j=1N Xl(tj) ei2πνktj|2,\begin{equation*} P_{\textrm{L}}(\nu_k): = \frac{1}{L} \sum_{\ell=1}^{L} \frac{1}{N} \Bigg| \sum_{j=1}^{N} ~X_{\ell}(t_j)~\mathrm{e}^{-{\textrm{i}}2\pi\nu_k t_j} \Bigg| ^2,\end{equation*}(1)

with L being the number of available one-day subseries, {Xl}l=0,,L$\{X_{\ell}\}_{\ell=0,\hdots,L}$, per year, N the number of data points of these subseries, and tj = j ×dt the time of the measurements with j = 0, …, N. Also, dt is the temporal sampling and νk:=kN$\nu_k{:}\,{=}\,\frac{k}{N}$ are the Fourier frequencies3 computed for k = 0, …, N − 1. We observe a strong frequency dependence of the PSD due to correlations over several timescales. This frequency dependence is characteristic of a noise that is defined as a colored noise, in opposition to a white Gaussian noise (WGN) that shows a constant PSD over the whole frequency range4. The periodic signatures of solar oscillations are clearly visible around 3000 μHz. Granulation dominates the frequency range corresponding to a typical transit duration (gray area): between activity and oscillations. As long as the planet transit does not cross a spot or plage, this may be the dominant stellar noise source in high-precision photometric observations. By comparing the averaged periodograms computed using subseries taken during a minimum and a maximum of the solar cycle (see dashed and solid lines, respectively), we only note significant differences at low frequencies (ν < f = 44.9 μHz), in agreement with Seleznyov et al. (2011). Although not visible in this comparison, we also note the presence of small changes of the p-modes signatures (both in amplitude and frequency) over the magnetic cycle, in line with findings by Chaplin et al. (2011), Salabert et al. (2011), and García et al. (2013). Interestingly, the frequency region that is dominated by the granulation variability, ν ∈ [f,  2300] μHz, is not significantly affected by the solar magnetic cycle (see Seleznyov et al. 2011; Muller et al. 2018). Therefore, we can state that granulation variability is stationary with respect to the solar cycle (i.e., its statistical properties remain constant over the solar cycle).

thumbnail Fig. 2

Averaged periodograms computed with Eq. (1) using all one-day subseries available during a minimum (2008, solid dashed lines) and a maximum (2003, solid solid lines) of the solar cycle. Each color indicates the observation of one SPM channel. Differences with the solar cycle are observed at frequencies ν < fl (i.e., at periods >6.2 h). The peak at ν = 5570 μHz affecting the PSD of the blue and green SPM data is an electronic artifact related to the calibration period used by the acquisition system of VIRGO. The gray shaded area indicates the frequency range corresponding to the typical duration of planetary transit (from ~ 25 min to several hours).

thumbnail Fig. 3

Example of a one-day VIRGO time series (top) detrended (bottom) using a SG filter with a window size of 15 h (colored solid lines). From left to right: dataset from the blue, green, and red SPM channels.

2.3 Time domain analysis: amplitude distributions

By nature, the influence of a stochastic process cannot be known exactly. Consequently, it is only possible to make a probabilistic statement about its behavior. Here, to characterize the short-timescale variability in the time domain, we analyze how its amplitudes are distributed. By “amplitude”, here we mean the flux offset from the mean value of a given one-day subseries.

To do so, we first eliminated low-frequency noise by applying a Savitzky-Golay (SG) filter (Savitzky & Golay 1964), parametrized by a window length of 15 h and a polynomial degree of 3. This allowed us to consider only the frequency range ν < 2 × f. We note that, to avoid an inappropriate effect of the SG filter at the edges of the one-day subseries, we applied this low-frequency noise removal to the whole one-year time series (i.e., before dividing it into daily subseries as described in Sect. 2.1). This step is illustrated in Fig. 3, where we observe the wavelength dependence of these variations, which can reach several tens (red channel) to hundreds (blue channel) of parts per million (Dravins 1988). The amplitudes of the flicker noise therefore increase when observing deeper layers of the solar photosphere (Jiménez et al. 2005). Intuitively, we can expect this variability to follow a Gaussian distribution. Indeed, if we assume the dominant source of variability comes from convective motions, then this variability results from a multitude (~ 106) of independentstochastic granules, each generating photometric variability of a similar amplitude. This is a physical manifestation of the central limit theorem. To validate this hypothesis, we performed a Shapiro-Wilk normality test (Shapiro & Wilk 1965) on each solar one-day subseries. This results in the nonrejection of the Gaussian hypothesis for more than 95% of our dataset. Without establishing a formal proof of Gaussianity, this test does not argue against this hypothesis. In this approximation, we can therefore state that flicker noise is Gaussian (i.e., the amplitudes of the noise follow a Gaussian distribution). We note that Gaussianity here refers to the probability distribution of the amplitude of the flicker but does not inform on the way the power of this noise is distributed in the frequency domain (correlations). The latter remains, a priori, unknown (see Sect. 2.4). Gaussianity is important as it implies that noises correlated over timescales of minutes to hours (mostly convection processes) can be completely defined by the two first moments of their distribution: the mean and power spectral density (related to the variance, Simon 2006).

Other common measurements to characterize a stochastic noise are the RMS and the amplitude range. For granulation, the “eight-hour flicker” measure (or “F8”), corresponding roughly to the RMS evaluated over a timescale between 0.5 and 8 h, is also often computed. To compute the F8 measurement as described in Bastien et al. (2016), we binned the solar one-day subseries into intervals of 30 min (to mimic the Kepler long-cadence observations) and used a 16-point boxcar filter5 to remove the long-term activity over periods >8 h. Figure 4 shows the distribution of these three quantities evaluated on the one-day VIRGO subseries taken over a year. In each panel, the distributions obtained using subseries taken during a solar cycle minimum (resp. solar cycle maximum) are shown by the colored (resp. black) histograms. Typical RMS values are 30− 40 ppm (red), 50− 60 ppm (green), and 70− 90 ppm (blue). Extreme values can reach several hundred ppm (200, 350, and 500 ppm for the red, green and blue channels, respectively). Although comparable to the global RMS, the F8 values are slightly smaller because they are based on data binned into intervals of 30 min and thus encapsulate less of the signal due to granulation. By comparing these values with the transit depth expected for an Earth analogue (see vertical lines), we gather that this variability can easily impede the transit detection or bias the inferred parameters.

In practice, binning the light curve of a single transit to decrease the impact of granulation noise is not optimal because of the long timescale of the stellar convection (Meunier et al. 2015). This is shown in Fig. 5, which illustrates the slower decrease of the RMS of the solar flicker noise compared to purely white noise. For high-precision photometric data, the short-timescale variability cannot be approximated as a WGN. Instead, convection processes produce a colored stochastic noise. We note that acoustic modes are better described as deterministic noise sources (in the sense that they only affect specific frequencies depending on the stellar properties).

2.4 Frequency domain analysis: the solar power spectra

The total dispersion of the time series is a scaled sum of the power at each frequency component (Parseval’s identity, Li 2014, see Eq. (6.1.7), p. 169). As discussed in Sect. 2.2, the solar PSD exhibits correlations over a multitude of timescales. For a given characteristic timescale, one noise process may dominate the others.

Acoustic mode timescales

To localize the typical upper and lower frequency bounds of the acoustic modes, we use the autocorrelation function (ACF) of the observations, similarly to Kallinger et al. (2016). The first zero-crossing of the ACF is the autocorrelation time (tc), that is, the time interval over which the noise de-correlates. For the solar time series, tc corresponds to the corner frequency (fc) and indicates the lower frequency of the p-modes (the separation between the regime dominated by the deterministic p-modes and those dominated by the stochastic convection). The upper limit of the p-mode signature (fh) can be identifiedby the first dip in the ACF. To precisely evaluate these frequencies, we computed the ACFs of each one-day subseries taken during one year and averaged them. The averaged ACF is shown in the main panel of Fig. 6 (black), showing fh = 4000 μHz (~ 4 min) and fc = 2000 μHz (~ 8 min).

thumbnail Fig. 4

From left to right: empirical distribution of the RMS, amplitude range, and F8 measurements evaluated on a set of one-day VIRGO subseries taken during a solar-cycle minimum (2008, colors are related to the three SPM channels). The empirical distributions obtained during a solar-cycle maximum (2003) are shown by the black contours. Distributions derived using the HMI data described in Sect. 3 are shown by the yellow histograms.In each panel, the vertical dashed line indicates the transit depth expected for an Earth analogue (84 ppm).

thumbnail Fig. 5

Effect of data binning on the RMS of one-day solar subseries (1996, black), on the synthetic times series generated with Eq. (3) (yellow), and on synthetic subseries of WGNs (blue). The RMS of the WGN series decreases as Nbin/N$\sqrt{N_{\textrm{bin}}/N}$ (light blue). Solid lines indicate the median values of the observed dispersion (shaded areas). The RMS of the time series at large bin sizes shows a significant dispersion as the number of data points decreases. Horizontal dotted lines indicate the fraction of a typical Earth-like transit depth (i.e., 84 ppm).

Convection timescales

The PSD of the VIRGO observations shows a “knee” shape in the frequency region ν < fc (see inset panel of Fig. 6). The physical origin of this kink is debated and is not observed in radial velocity measurements. Works based on Harvey-profile functions (Harvey 1985) that fit for the stellar background generally attribute this kink to the regime where supergranulation or mesogranulation6 noise dominate. However, based on synthetic simulations of solar granulation alone, Seleznyov et al. (2011) attributed this frequency to the large dispersion of the granulation turnover timescales (the distribution of the granule lifetimes varies from 0 to 30 min). We estimate this frequency (that we called the flicker frequency in this work) at fg = 555 μHz (30.3 min). We finally set the limit between the convection and the activity regimes (constrained here by the low-frequency SG filter with a width of 15 h) at f = 44.9 μHz (6.2 h).

According to Seleznyov et al. (2011), the physical properties of the stellar granule cells, such as the number of cells over the visible stellar surface, their photometric contrast ratio, their lifetime, and average size, can be extracted from the study of the observed stellar PSD. For example, we expect the frequency at which the PSD starts to become “flat” (i.e., fg) to decrease with the increase of the granulation median turnover timescale. This would make this frequency a potential tracker for deriving the lifetime distribution of the granule cells over the stellar surface. Furthermore, the amplitude of the stellar PSD can be related to the average size of the granule cells. We should expect a decrease of the PSD amplitudes with the decrease of the size of the granules, as the ratio between bright granules and intergranular lanes decreases (Seleznyov et al. 2011).

In practice, the stellar PSDs are often modeled with Harvey functions (Harvey 1985). As discussed in Karoff (2012), the choice of this model is empiric and various slightly modified Harvey profiles are found in the literature (see e.g., Harvey 1985; Pallé et al. 1999; Aigrain et al. 2004; Kallinger et al. 2014; Cranmer et al. 2014; Pereira et al. 2019). The difference between these models mainly comes from the choice of the power-law exponent, chosen to fit the observed stellar PSD (Mathur et al. 2011). Consequently, we chose here to represent the noise correlations by using a simpler model based on 1∕να power laws (Pallé et al. 1999). We refer to this model as the flicker model in the following. Written in logarithmic scale, the model is: logPL(ν)=αlog(ν)+β,\begin{equation*} \log P_{\textrm{L}}(\nu) = \alpha \log (\nu) &#x002B; \beta,\end{equation*}(2)

with PL being the yearly averaged periodogram of the one-day solar subseries defined as in Eq. (1), α the power index, β a constant, and νRf$\nu\in \mathcal{R}_{\textrm{f}}$ the frequencies in the regions Rf:={A,B,C,D}$\mathcal{R}_{\textrm{f}}{:}\,{=} \{ \textrm{A, B, C, D}\}$ as labeled in Fig. 5, corresponding to the different regimes outlined above. The power index α and the constant offset β have to be determined for each frequency region Rf$\mathcal{R}_{\textrm{f}}$.

We then estimated the parameter set {αi,βi}i=A,B,C,D$\{\alpha_i,\beta_i\}_{i\,{=}\,\textrm{A,B,C,D}}$ given in Eq. (2) through a least-square minimization of the solar PSDs. The average values are listed in Table 1 and the estimated power indices for each of the 21 yr of VIRGO observations are shown in Fig. 7.

As visible in the PSD, the highest values of the power index α are found for the highest frequency region (A). For all regions considered, the power indices observed in the red channel appear relatively constant with time. For the green and blue channels however, which are more strongly influenced by the detector ageing, the power index drops significantly in recent years (> 2008) in the high-frequency regions A and B. We note that this effect is likely purely instrumental due to increasing white noise. For the timescales inferior to several hours (regions A, B and C), the index values do not vary along the solar cycle but they do for region D. Once again, these results illustrate both the global high quality of VIRGO observations and the stationarity of the flicker variability. Moreover, for ν < fc (regions C and D), the similarities between the time series observed in different colors indicate similar correlations independently of wavelength. Consequently, alternatively to Harvey models, a flicker-based model can be designed based on simple power laws as in Eq. (2). Synthetic time series of this stationary stochastic process can then be generated as: I(t)=k=02 S(νk) δν cos(2πνkt+ϕk),\begin{equation*} I(t) \,{=}\, \sum_{k\,{=}\,0}^{\infty} \sqrt{2~S(\nu_k)~\delta\nu} ~ \cos{(2\pi \nu_k t &#x002B; \phi_k)},\end{equation*}(3)

with S(ν) = exp(βνα) being the parametric noise PSD based on Eq. (2), t the time of the observations, δν the sampling frequency, and ϕ the random phase ∈ [0, 2π]. Following Eq. (3), we generated synthetic time series based on parameters listed in Table 1. We compared these synthetic series with solar observations in Fig. 5. As soon as the number of data points is sufficiently large (i.e., at bin sizes <3 h), we observe a similar behavior for both series demonstrating that our simple flicker-noise model is realistic enough to provide a first-order approximation of short-timescale solar variability.

thumbnail Fig. 6

Estimation of the splitting frequencies. Main panel: L = 364 ACFs corresponding to the available solar one-day subseries observed in 1996 with the blue SPM channel (gray lines) and their mean ACF (black). The mean ACF allows the automatic derivation of the upper and lower frequencies surrounding the acoustic mode regime {fh, fc }. Inset panel: logarithm representation of the associated PSD estimates computed using Eq. (1). The PSD allows us to localize the flicker frequency {fg }. The lower frequency {f} is set by the window length of the SG filter applied to each one-day subseries to remove the long-term variability. In both panels,the splitting frequencies are indicated by the dotted vertical red lines. The blue dotted line represents the slope of the PSD (see Eq. (2)) measured in the frequency region associated with the granulation regime ν ∈ [fg, fc]. From high to low frequency, the distinct regions are denoted by letters A to D.

Table 1

Average values of parameters {α, β} corresponding to Eq. (2) for regions A, B, C, and D and for observations in the three SPM channels.

thumbnail Fig. 7

Evolution of the inferred power index estimated on averaged periodograms of one-day solar subseries. From left to right: index evaluated for (a) the high-frequency region, (b) the p-modes region, (c) the granulation region, and (d) the lowest frequency region. Each color corresponds to an SPM channel. In region (c), the index obtained on the HMI solar observations discussed in Sect. 3.1 is shown by the horizontal black line (with the 1σ uncertainties in gray). This index results from the averaged periodogram computed using the 91 one-day subseries that have been selected during the period of a solar minimum (2017–2019).

3 Impactof solar short-timescale variability on inferred transit parameters

Before extracting the statistical properties of flicker noise from stars other than the Sun, we aim to evaluate its impact on the parameters inferred from exoplanet transit light curves. To this end, we generated artificial light curves of planetary transits based on resolved images of the solar disk during a solar-cycle minimum. In this section, we describe the analysis of these light curves assuming purely white Gaussian noise and quantify the errors made on the inferred parameters when the correlation properties of flicker noise are not correctly taken into account.

3.1 Artificial transit light curves in HMI observations

In order to generate realistic planetary transit light curves, we selected observations7 taken by the Helioseismic and Magnetic Imager (HMI) instrument onboard the Solar Dynamics Observatory (SDO). Since 2010, HMI has been observing the photospheric Fe I absorption line at 617.3 nm almost continuously, producing one image of the solar disk every 45 s. The resolution of HMI is 0.505 arcsec pixel−1 and the optical resolution is 0.91 arcsec, corresponding roughly to 366 km on the solar surface at disk center (Schou et al. 2012).

We selected 91 different dates during a minimum of the solar activity cycle where no significant signatures of active structures (spots, plages) were observed on the visible part of the solar surface. For each date, we downloaded images spanning one day and extracted the solar flux from each of them by integrating the intensity over all pixels.

To create artificial transit light curves of exoplanets in each one-day solar dataset, we superimposed a black sphere on the images, and moved it across the disk, mimicking a transiting planet. We assume exoplanet sizes of Rp = 1, 3, 5, 7, and 10 Earth radii (R). For each planet size, we simulated transits with impact parameters b = 0, 0.2, 0.4, 0.6 and 0.8. This led to 25 artificial transit light curves for each of the 91 solar time series. One of these, corresponding to Rp = 5 R and b = 0, is shown in the top panel of Fig. 8 (see gray line).

This way of generating artificial transit light curves has three effects that are extraneous to real exoplanet transits. First, since the SDO spacecraft is on an inclined geosynchronous orbit around Earth, the apparent size of the Sun on the CCD images changes with time. Second, as the size of the planet is scaled in terms of pixels, the planet-to-sun radius ratio (p = RpR) slightly changes over time. To increase the accuracy of p, which is needed to assume that the radius ratio is constant over time, we oversampled the artificial exoplanet by a factor of five. This means that on one pixel of the HMI image there are 5 × 5 pixels in the exoplanet mask8. The exoplanet mask was then derived in the high-resolution oversampled regime, and interpolated back to the original image resolution under conservation of flux within each pixel. This leads to partial pixel eclipses at the boundary of the exoplanet, and consequently to a greatly enhanced accuracy. In Appendix A, we show the intrinsic variation of the transit depth (p2) over time, which is <0.02% of the true value, leading to relative errors between the transit models and the transits resulting from our experiment below 2 ppm. Finally, with this experiment, we assume discrete exoplanet positions for each HMI image, neglecting the movement of the planet during HMI’s 45s exposures. While the effect of exposure time is known to affect the transit parameters (Kipping 2010), we assume our temporal cadence to be sufficiently small to do not impact our resulting light curves.

The effect induced by the orbit of the recording satellite (period ~ one day) is visible in Fig. 8 as a long-term quasi-sinusoidal variation. From the analysis based on VIRGO data (see Sect. 2), we know that granulation noise dominates periods of <30 min. To correct the light curves from the effect of the satellite motion, we chose to filter each raw (i.e., without transits) solar time series with a smooth SG filter that has a passband larger than ten times the characteristic period of granulation (i.e., >5 h). An example of the final residuals resulting from this data filtering is shown in the bottom panel of Fig. 8. The corrected transit light curves are shown in Fig. 9 (see also Fig. 10 for a better display of a central transit of an Earth-sized planet). We note that we also tried to perform this correction using GP, but to avoid any influence of the GP on the flicker noise, we chose to use simple smooth functions.

Finally, the HMI observations are given without error bars, which are necessary to derive the transit parameters (see Sect. 3.2) together with their uncertainties. We estimated the errors on the individual data points from the residual scatter of the transit-free light curves after correction of the variation due to the satellite motion. On average, we obtained σ = 20−30 ppm (see Appendix. B). The whole set of artificial transit light curves is publicly available online9.

thumbnail Fig. 8

Top: example of a raw solar light curve extracted from HMI images (black) and the corresponding smooth detrending function based on SG filters with a window size of 5 h (red). An artificial transit of a 5 R planet crossing the center of the solar disk (b = 0) is shown in gray. Bottom: residuals of the raw solar light curve after correcting by the SDO satellite motion.

3.2 Impact of flicker on the inferred transit parameters

To retrieve the transit parameters, we used state-of-the-art transit modeling based on the Mandel & Agol (2002) algorithm. For each artificial transit light curve associated to a given set of solar observations, we performed a Markov chain Monte Carlo (MCMC) analysis following the scheme described in Lendl et al. (2017, 2020) and using the differential-evolution MCMC engine developed by Cubillos et al. (2017).

We assumed the planet orbital period (1 yr) was known and fitted for the planet-to-star radius ratio (RpRs), the epoch of mid-transit (T0), the impact parameter (b), the transit duration (td), and the quadratic limb darkening coefficients (u1, u2). We applied uniform priors to each of these parameters. The results are discussed in the following.

We found relative errors on RpRs increasing with decreasing planet size, which is expected as larger planets create deeper transits while the noise level remains similar. Absolute percentage errors (difference between the peak of the MCMC posterior and the true value normalized by the true value) on RpRs appear small (<1–2% for planets with sizes above 3 R), but can be large (~10%) for Earth-sized planets (see Fig. 11). For comparison, we generated synthetic light curves containing only WGN and derived the errors on RpRs using the same MCMC approach. To generate these synthetic WGN time series, we isolated the high-frequency noise present in the data by (i) removing the true transit model from our time series, (ii) applying a SG filter to filter out correlated noise at timescales above 15 min, and (iii) generating atransit light curve with the Mandel & Agol (2002) analytical model. For all involved transit parameters, we found errors on RpRs ≪ 1% (see red histograms in Fig. 11).

We now focus on the impact of the solar flicker noise on the inferred planet-to-star radius ratios. Figure 12 (left panel) shows that for Earth-sized planets, the true values are not within the 1σ uncertainties for 49% of the cases when b = 0 and for 81% when b = 0.8. Moreover, the true value is not even included within the 3σ uncertainties for 11% of the realizations when b = 0 and for 4.5% when b = 0.8. For the largest exoplanets (10 R, see right panel), while the percentage error is very small (<1%), none of our inferred values contain the true radius ratio within their 1 and 3σ uncertainties when b = 0. This offset is not observed when the planet crosses the solar limb (b = 0.8). This effect is likely due to uncertainties on the limb darkening parameters that can bias the retrieved transit parameters (Espinoza & Jordán 2016). In line with this result, Cubillos et al. (2017) showed that MCMC analyses that ignore time-correlated noise produce inaccurate transit-depth estimates and can largely underestimate their uncertainties. Both effects increase as the variance of the correlated noise increases.

For the transit duration, we found a distribution of the relative errors centered around 0%, and with a dispersion up to 2% for the Earth-sized planet (see first row of Fig. 13). For this planet, we measure a dispersion around the true value (td = 13.09 hr) of ± 31.3 min for b = 0 (i.e., error up to 4%) and of ± 55.4 min (i.e., error up to 10%) for b = 0.8 (with the true value td = 7.97 hr).

For the time of mid-transit, T0, we found nooffset but a dispersion of the inferred parameters around the true value that is also quite large for the Earth-like planet: ± 19.2 min for b = 0 and ± 15.4 min for b = 0.8 (see second row of Fig. 13). Moreover (not shown here), the true T0 was not contained inside the 1σ uncertainties for more than 76% of the cases when b = 0 (28% fell outside the 3σ uncertainties) and 77% for b = 0.8 (25% fell outside the 3σ uncertainties). These errors can directly affect the measurement of transit timing variations in multi-planet systems.

Finally, for the impact parameter, the difference between the true and inferred value decreases with the size of the planet (see last row of Fig. 13). For the Earth-sized planet, the distribution is almost uniform making the inferred impact parameter essentially unconstrained by the observations.

These results suggest that previous analyses of single transit events performed on light curves dominated by stellar noise (e.g., for bright solar-like Kepler targets, see Gilliland et al. 2011, 2015) may be miscalculated by a few percent due to a misunderstanding of the correlation structure of the short-timescale stellar noise. All these errors can directly impact the characterization of the exoplanet. As a consequence, they can impact models of the interior structure of planets (see e.g., Dorn et al. 2015)and atmosphere. The present experiment demonstrates the need for new statistical tools dedicated to correctly accounting for the effect of flicker noise, and strategies to mitigate its effect on the derived transit parameters. These new tools will need to encapsulate the statistical properties of flicker that have been derived in this paper (see Sect. 2 for the Sun and the Sect. 4 for other main sequence stars).

Some studies have already focused on identifying the component of flicker noise in photometric data and modeling it. Recently, Morris et al. (2020) performed a similar study, also using HMI data to reproduce exoplanet transits. This study conceptually differs from ours: they used a single HMI image to estimate the flicker variability amplitudes. As a consequence, they underestimate the flicker noise within the resulting light curves. Then, to probe the effect of short-timescale stellar variability on transits, they injected model transit light curves into transit free data. The strength of our study lies in the fact that we extract the effect of this variability on planetary transits without any prior assumptions (e.g., limb darkening) of the transit light curve shape, and at the same time we also account for variability introduced by the changing photospheric flux obscured by the planet.

Using 3D radiative hydrodynamical simulations of solar convection, Chiavassa et al. (2015) succeeded to evaluate the contributionof flicker on the transit light curve of Venus (which appeared in 2004). Moreover, these 3D simulations allow us to derive very accurate limb-darkening laws (Chiavassa et al. 2015), which is another critical aspect we must consider for deriving unbiased transit parameters as described above.

Other studies proposed to use GP to model the short-timescale variability noise sources (including flicker, oscillations p-modes, and high-frequency noise; see e.g., Barclay et al. 2015; Pereira et al. 2019). Compared to analyses based on WGN models (as in this section), modeling the flicker noise with GP models increases the parameter uncertainties but could allow us to improve the accuracy on the transit parameters. However, this modeling approach alone does not improve the precision of the transit parameters. A noteworthy advantage of the GP approach lies in its capacity to constrain the stellar noise properties. For example, Pereira et al. (2019) found that their GP regression is able to derive accurate values of the p-mode mean oscillation frequency νmax. Following this same idea of linking the properties of the flicker noise with the stellar parameters, we now investigate the correlations associated to flicker using Kepler observations for a range of very bright stars on (or near) the main sequence.

thumbnail Fig. 9

Example of artificial exoplanet transit light curves generated using solar HMI observations (quiet Sun, 2018-12-10). Each panel shows five transits for planets with sizes ranging from 1 to 10 R (see legend). Panels from left to right: different orbit configurations (see the panels’ header).

thumbnail Fig. 10

Top: example of artificial transit of an Earth-sized planet crossing the disk center of the Sun (b = 0, black). The transit model with the true input parameters is shown in green and the model computed using the inferred parameters in red. The error on RpRs is around 2% in this example. Bottom: residuals based on the inferred transit model.

thumbnail Fig. 11

Distribution of the absolute percentage error on the planet-to-star radius ratio inferred from our simulated transits for planets with sizes Rp = [1, 3, 5] R (left to right) and impact parameters b = 0 (top) and b = 0.8 (bottom). Inferred errors on radius ratios derived from light curves containing only WGN are shown in red.

thumbnail Fig. 12

Planet-to-star radius ratio and 1σ uncertainties inferred from the MCMC analyses performed on the artificial light curves of exoplanets of size Rp = 1 R (left) and Rp = 10 R (right). The case of b = 0 is shown in black and b = 0.8 in red. The true radius ratios are indicated by the horizontal dotted lines.

thumbnail Fig. 13

From top to bottom: distribution of the absolute percentage errors on td, T0, and b for the whole set of artificial transit light curves.

4 Short-timescale stellar variability on Kepler stars

In this section, we aim to extract the granulation properties using Kepler observations of bright stars. The flicker amplitude is already known to be related to the stellar parameters (see e.g., Bastien et al. 2016, based on F8 measurements). Therefore, we focus here on the relation between the flicker characteristics timescales and the stellar fundamental parameters.

4.1 Kepler short-cadence observations

The Kepler prime mission was operating from 2009 to 2013 (Borucki et al. 2010). It operated in the optical wavelength range λ ∈ [400, 865] nm: a much broader passband than that of the solar observatories VIRGO and HMI (see Sects. 2 and 3). To compare Kepler images with solar observations from VIRGO, previous studies often used a sum of the red and green channels as these are the closest to the Kepler passband (see e.g., Basri et al. 2013; Salabert et al. 2017). Kepler long-cadence observations (29.4 min) have been intensively studied to derive the long-timescale stellar variability for stars of different stellar types (Basri et al. 2013; McQuillan et al. 2012) as well as the short-timescale variability evolving periods of less than one day (see e.g., Mathur et al. 2011; Cranmer et al. 2014; Bastien et al. 2016; Pande et al. 2018). For the latter, it has been shown that the granulation amplitude (F8 measurements) can be linked to the stellar surface gravity. The same can be applied for the turnover granulation timescales (i.e., the period corresponding to fc, Kallinger et al. 2016).

In this section, we focus on the short-cadence (SC, 58.8 s) Kepler observations that have been performed on a small number of stars, as long-cadence observations do not carry information about noise correlations at timescales below 30 min, where most of the granulation signal is located. We focus here on the determination of the correlation properties that we defined in Sect. 2.4 through a flicker power index αg (measured as the slope of the PSD in the frequency range between the corner and flicker frequencies, fc and fg, respectively).

For this purpose, we selected the brightest stars observed by Kepler in SC mode (apparent magnitude mv < 11.5), for which no planet has been detected (a total of 3970 objects). From this sample, we removed binary stars, rotationally variable stars, stars in clusters, peculiar stars, and red giants (ending up with 1401 objects). At this point, our sample contained mainly G and F stars as late-type stars (K to M) were rejected by the magnitude cutoff. For each star from our sample, we downloaded the whole set of SC observations and detrended the light curves following a similar procedure as for the VIRGO time series (see Sect. 2). As the Kepler spacecraft rotated by 90 degrees every 90 days (to keep the solar panels in the direction of the Sun), the observations of a given star are divided into four subseries (called “quarters”) per year. For each target and quarter, we carry out the following procedure: (1) we remove the data points affected by spacecraft safe-mode events and corrected for the background; (2) we smooth the time series using a running average of 3 days length and localize the 3σ outliers; (3)excluding these outliers, we bin the resulting time series into intervals of 24 h, apply a spline function to this binned series, and use this function to normalize the initial time series (containing the outliers of step 2.); and (4) finally we remove the 5σ outliers from the time series.

Following this procedure we combined all observations of the same target from all quarters and split the final detrended time series into one-daysubseries. For each one-day subseries, we filtered out the low-frequency noise associated with magnetic activity using a SG filter with a 15 h passband.

For all targets, the photometric contribution of granulation to the high frequencies (HF) cannot be observed (contrary to VIRGO observations; see region ν < 5000 μHz in Fig. 2). Indeed, the HF noise in these Kepler data is non-negligible with an amplitude comparable to granulation noise.

thumbnail Fig. 14

Example of an averaged periodogram of the Kepler target KIC 7940546 (Ms = 1.152 M, Rs = 1.807 R, Teff = 6244 K, mv = 7.397). The number of available one-day subseries for this target is L = 75. Cut-off frequencies fc and fg resulting from the MCMC analysis are indicated by the vertical dashed lines.

4.2 Frequency domain analyses

For each target, we selected the whole number of available one-day subseries ensuring a minimum number of data points per series. The required condition for a subseries to be considered is that it must have no more than 10% missing data, with a maximum length for a single gap of five consecutive data points. Using these subseries, we then generated the corresponding averaged periodogram using Eq. (1) for each target. An example of one of these periodograms for an F6IV star isshown in Fig. 14.

As in Sect. 2, we aim to describe these stellar PSDs with power law functions defined as in Eq. (2). For this purpose, the target PSDs have to be split into four regions that are: the high-frequency region (A) delimited by fh, the p-mode region (B) delimited by fh and fc, the flicker region (C) delimited by fc and fg, and the low-frequency region (D) delimited by fg and the frequency cut-off of the SG filter, f. These regions are labelled in Figs. 5 and 14. To improve the automatic identification of these cut-off frequencies, and because the Kepler observations are noisier than solar VIRGO observations, we made use of a priori knowledge on the mean oscillation frequency (νmax) to help the localisation of the oscillation p-modes (which are not clearly visible in all Kepler periodograms). Following Brown et al. (1991), Kjeldsen & Bedding (1995), and Belkacem et al. (2011), we estimated νmax based on the stellar parameters as: νmax=νmax,×(MsM)×(RsR)2×(TeffTeff,)0.5,\begin{equation*} \nu_{\textrm{max}} = \nu_{\textrm{max}, \odot} \times \left(\frac{M_{\textrm{s}}}{M_{\odot}}\right) \times \left(\frac{R_{\textrm{s}}}{R_{\odot}}\right)^{-2} \times \left(\frac{T_{\textrm{eff}}}{T_{\textrm{eff},\odot}}\right)^{-0.5},\end{equation*}(4)

with νmax,⊙ = 3150 μHz being the solar value. Moreover, we fixed frequency fh to the solar VIRGO value (i.e., 4000.15 μHz, see Sect. 2) as we found this value to be a good estimate for all considered stars with detectable acoustic modes signatures (mostly G and F stars). For each star, we then inferred the parameters of Eq. (2) and the cut-off frequencies using MCMC analyses. For each MCMC, the fitted (“jump”) parameters are the two cut-off frequencies fc and fg, the three associated power indices α:=[αh,αc,αg]T$\mathbf{\alpha}{:}\,{=} [\alpha_{\textrm{h}}, \alpha_{\textrm{c}}, \alpha_{\textrm{g}}]^T$, and constants β:=[βh,βc,βg]T$\mathbf{\beta}{:}= [\beta_{\textrm{h}}, \beta_{\textrm{c}}, \beta_{\textrm{g}}]^T$ as defined in Eq. (2). The starting values were set to the solar values obtained in Sect. 2 and no other prior than νmax was used to avoid influencing the results.

For approximately two-thirds of the targets in our sample, we found a flicker index αg < 0.1 with large uncertainties, which means that the PSD slope in the frequency range of granulation is not clearly detected. As granulation signals are undetectable for these targets, we do not consider them in the following analysis. Our final set of targets contains 335 stars, among which we selected the 82 “best” targets, as these are bright (mv < 10, see Figs. 15), which makes the measurement of the granulation parameters (fc, fg and αg) more precise. The values for fg and αg resulting from the MCMC analyses of our 335 targets are shown as a function of the stars’ fundamental parameters10 in Figs. 16 and 17 (top panel).

In both figures, the subsample of best targets is highlighted. Moreover, we quantified the strength of any potential correlation between the fitted parameters (fg and αg) and the stellar parameters using the Pearson (ρP) and Spearman’s (ρS) coefficients (see numerical values in each panel).

As shown in Fig. 16, we observe strong correlations (i.e., |ρP |, |ρS| > 0.2) between the flicker frequency fg and the stellar mass, radius, and surface gravity. These correlations are particularly noteworthy when placing the solar fg value derived from VIRGO data on these plots (shown as star symbols). As expected, the granulation timescales decrease, that is, the flicker frequencies increase, for decreasing stellar mass and radius. This is in agreement with the work of Mathur et al. (2011), who extracted the characteristic timescales of granulation based on different Harvey law fits on Kepler red giant stars. We note that no significant correlation is observed with the stellar magnitude (|ρP|, |ρS|~ 0.1), which means that the flicker frequency can be derived independently of the HF noise level for all targets for which granulation as a whole is detectable (i.e., αg > 0.1).

In Fig. 18, we also show the characteristic frequencies fc and fg as a function of the oscillation frequency νmax that has been derived using Eq. (4). We observe clear linear correlations between these three stellar characteristic frequencies. This indicates that the typical granulation timescales, combined with the mean frequency of the acoustic modes, may be able to track stellar characteristics, such as the stellar surface gravity (Kallinger et al. 2016).

In Fig. 17 (top panel), we display the inferred power index αg resulting from the MCMC analyses as a function of the stellar parameters. As for parameter fg, we observe strong correlations between this parameter and stellar mass, radius, surface gravity, and acoustic oscillation frequency νmax (see last column). However, we also observe a significant correlation with stellar apparent magnitude. This indicates that the inferred flicker indices are influenced by the high level of HF noise, which biases the correlations seen with the stellar parameters. This is particularly evident when comparing the inferred αg with the flicker index extracted from VIRGO solar observations (which have very low HF noise σW~ 5 ppm, shownwith stars symbols). To compare the solar flicker index with Kepler observations, and to coherently interpret Kepler data, we need to find a way to combine flicker indices computed from PSDs with different white noise levels. For this purpose, we added different levels of synthetic WGN ~N(0,σW2)$\sim {\cal{N}}(0,\sigma^2_{\textrm{W}})$ with variance σW2$\sigma^2_{\textrm{W}}$ to the solar data and computed the flicker indices of these data as done for the Kepler observations. Figure 19 shows the fast decrease of the solar flicker index with increasing HF noise (σW). The black dots represent the flicker index measured on the averaged periodogram of several Sun-like stars observed by Kepler as a function of the level of HF noise (see list in Table C.1). These values illustrate that correlated noise due to granulation becomes more difficult to detect with increasing HF noise. Uncertainties on αg are directly related to the number of one-day subseries available to compute the averaged periodogram. When correcting the measured solar flicker index in Fig. 17 (star symbols) for the effect of additional WGN corresponding to that present in Kepler observations, we found a solar flicker index in better agreement with the correlations observed between the Kepler starsand the stellar parameters (see square symbols).

However, to derive the correct relation between the flicker index and the stellar parameters, we have to account for different levels of HF noise when comparing the derived flicker indices. This is the objective of the following section.

thumbnail Fig. 15

Selected Kepler targets represented as a function of their mass and radius. The color code indicates the inferred index parameters in the frequency region of granulation (αg) derived aftercorrection by the HF noise. The best targets with R2 > 0.9 are shown in color (see Sect. 4.3).

thumbnail Fig. 16

Estimated values of the cut-off flicker frequency fg as a function of stellar parameters resulting from the MCMC analyses performed on periodograms of the selected SC Kepler targets. From left to right: stellar mass, radius, effective temperature, surface gravity, and apparent magnitude. Black dots represent targets with magnitude mv ≤ 10 and gray dots targets with magnitude mv > 10. The star symbol shows the solar cut-off frequency evaluated from VIRGO data (see Sect. 2.4). Pearson (ρP) and Spearman’s (ρS) coefficients, evaluated for stars with mv ≤ 10, are indicated in each panel.

thumbnail Fig. 17

Estimated flicker index associated to granulation (αg) as a function of stellar parameters. From left to right: stellar mass, radius, effective temperature, surface gravity, apparent magnitude, and normalized νmax resulting from (4). Top: values obtained from the MCMC analyses. The color code indicates the apparent magnitude of the target: mv < 10 (black) and mv > 10 (gray). The star symbol represents the index derived for the Sun based on VIRGO green channel observations: αg = 1.26 with σW = 5 ppm. The square symbol represents the index obtained after adding a HF noise level (corresponding to that seen in Kepler observations of the Sun-like star KIC 3427720) to the VIRGO subseries. Bottom: values obtained after interpolation that corresponds to a HF level of σW = 30 ppm (see Sect. 4.3). The color code indicates here the data with R2 > 0.8 (i.e., the best fits, black) and R2 > 0.5 (blue). The triangle symbol represents the value of the flicker index derived by adding a WGN of σW = 30 ppm in solar VIRGO observations, for which we find: αg = 0.9464. This is the raw level of HF noise we expect for Sun-like stars observed with CHEOPS. Pearson (ρP) and Spearman’s (ρS) coefficients associated with these plots for the best targets (R2 > 0.9) are indicated on each panel.

thumbnail Fig. 18

Correlations between νmax (determined using Eq. (4)), the corner frequency, and the flicker frequency. The corner and flicker frequencies have been derived for each Kepler target using the MCMC analysis described in Sect. 4.2. The color code indicates the apparent magnitude of the target: mv < 10 (black) and mv > 10 (gray). Solar values derived from VIRGO observations are shown by the yellow star in each panel. Pearson (ρP) and Spearman’s (ρS) coefficients associated with these plots are indicated on each panel.

thumbnail Fig. 19

Estimated values of the flicker index as a function of the HF noise level added in the VIRGO time series (red, blue and green SPM channels). Symbols show the power index measured on the PSD of Kepler Sun-like stars listed in Table C.1.

4.3 Flicker index derived at a constant HF noise level

To correct for the influence of the HF noise level on the flicker index and derive unbiased correlations between αg and the stellar parameters, we choose to rely on interpolation techniques.

For each Kepler target, we first empirically measured the decrease of the power index as a function of added HF noise (σW). We proceed in the same way as for VIRGO solar observations (see Sect. 4.2 and Fig. 19). For each considered value σW, we added a synthetic WGN ~N(0,σW2)$\sim {\cal{N}}(0,\sigma^2_{\textrm{W}})$ to the available one-day subseries, computed the averaged periodogram, and derived the flicker index associated to the frequency region ν ∈ [fg, fc], with fg and fc the flicker cut-off frequencies of the star in question (see Sect. 4.2). An example is shown in Fig. 20 for a bright F-star (black dots). For this particular target, we measured an initial HF noise level of σW = 69 ppm that corresponds to αg = 1.23 (red dot). We then used the measured decrease to extrapolate the flicker index towards smaller values, using an exponentially decreasing function of the form: αg(σW)=a eb σW+c,\begin{equation*} \alpha_{\textrm{g}}(\sigma_{\textrm{W}}) = a ~ e^{-b~\sigma_{\textrm{W}}} &#x002B; c,\end{equation*}(5)

with parameters {a,b,c}$\{a,b,c\}\in\mathbb{R}$ to be fitted. We performed a least-square regression of our empirical curve αg (σW) to derive the coefficients {a, b, c} (gray line in Fig. 20). For this example target, we found a corrected power index of αg = 2.1 at the HF noise level of the solar VIRGO observations (i.e., σW = 5 ppm).

We performed similar least-square regressions for all targets in our sample. As expected, the quality of our interpolations depends on the initial level of HF noise that is present within the data: the higher the HF noise, the more inaccurate the interpolated flicker index at low σW. To measure the goodness-of-fit, we used the coefficient of determination (also known as r-squared coefficient R2), that gives an idea of the distance between the best fit and the observed data points (Heinisch 1962).

We disregarded all targets with a corrected power index with R2 < 0.5, leaving us with 245 Kepler targets with R2 > 0.5, among which 118 have R2 > 0.8. The corresponding parameters {a, b, c} associated with model Eq. (5) for these remaining targets are shown as a function of the stellar parameters in Fig. 21. We observe strong correlations between these parameters and the stellar parameters. The combination of these correlations with Eq. (5) allows us to estimate the flicker index we will observe for a target star observed with a given level of HF noise and make some predictions for future high-precision observations of CHEOPS and PLATO (see Sect. 5).

We then chose a reference level of σW = 30 ppm, because this is close to the HF noise level expected to be reached with CHEOPS for Sun-like stars with a magnitude mv < 8 (see following Sect. 5). The corrected indices (interpolated at the level of σW = 30 ppm) are shown as a function of the stellar parameters in the bottom panel of Fig. 17. Comparing with the raw flicker indices of the top panels, we observe more significant correlations with the stellar parameters, as expected. The Pearson and Spearman’s coefficients reveal strong positive correlations with the stellar mass and radius and negative correlation with the surface gravity, in particular when considering only the best targets (R2 > 0.9, black dots). If we include the whole Kepler sample (i.e., 0.5 < R2 < 1, gray dots), the correlations are slightly less pronounced as the flicker indices show a larger dispersion. We expect these relations to become increasingly precise with future high-precision observations of CHEOPS and PLATO.

thumbnail Fig. 20

Flicker index αg as a function of the level of HF noise (black) for the F-star KIC 7940546. The first value, computedfrom raw Kepler observations, is indicated by the red dot and corresponds to αg = 1.23 and σW = 69 ppm. The gray line shows the interpolated function (see Eq. (5)), for which we obtained a quality factor of R2 = 0.96. The interpolated index at σW = 30 ppm is αg = 1.69 (blue triangle).

5 Predictions for CHEOPS and PLATO

CHEOPS isthe first ESA S-class mission. Its objective is to characterize transiting extrasolar planets with high-precision photometric observations (Fortier et al. 2014). The passband (λ ∈ [400, 1100] nm) and high cadence (1 min) of CHEOPS will be similar to the Kepler SC observations. However, CHEOPS will mainly focus on bright stars making this instrument a very promising tool to characterize the stellar variability affecting high-precision observations. For example, Moya et al. (2018) recently analyzed the detectability of the oscillation frequency νmax on main sequence bright stars that will be observed with CHEOPS. These latter authors found that νmax will be detectable on most main sequence stars, which can help to precisely constrain the age, mass, radius, and density of the host stars, also aiding the characterization of the observed transiting planets.

In this section, we explore to what extent the upcoming missions CHEOPS and PLATO will allow us to characterize stellar granulation through the power index defined in Sect. 2. We consider flicker noise to be detectable in light curves with inferred power indices of αg > 0.2, while HF noise dominates otherwise. Our objective is to derive the limiting magnitudes (mv,lim) for which our measurements possess the necessary precision to measure at least this limiting flicker index αg,lim.

In Sect. 4, we showed the dependence of this index on the level of HF noise, which is related to the stellar apparent magnitude of the target star. We also derived the relation between the flicker indexand the level of HF noise through Eq. (5), which involves a set of parameters {a, b, c}. These parameters are correlated with the stellar mass (Ms) and radius (Rs, see Fig. 21).

In the following, we consider main sequence G- and F-type stars (as well as slightly evolved F-stars) that are known to host a convective envelop. Our set of stellar parameters {Ms, Rs} encapsulates:

  • stars with 0.8 MMs < 1.4 M and 0.8 RRs ≤ 2.5 R,

  • stars with 1.4 MMs ≤ 1.5 M and 1.7 RRs ≤ 2.5 R,

  • stars with Ms = 1.6 M that have a shallow convective envelope, though still present, at Rs = 2.1, 2.2 and 2.3 R.

We do not include stars of M and K spectral types with Ms < 0.8 M because the flicker indices of such stars were not constrained by our sample of Kepler observations (see Sect. 4). Moreover, according to stellar models based on the Code Liégeois d’Evolution Stellaire (CLES) stellar evolution code (Scuflaire et al. 2008; Fernandes et al. 2019), no sufficiently thick convective envelopes are expected for stars with masses above 1.6 M.

For each set of parameters θs := {Ms, Rs}, we first predicted the values of parameters {a, b, c} as defined in Eq. (5) using linear and quadratic functions of the stellar mass and radius. We found the following relations (see red lines in see Fig. 21): a(θs) &=0.631Ms+0.162Rs0.379,b(θs) &=0.010Ms20.025Ms+0.00062Rs20.0057Rs+0.030,c(θs) &=0.002Ms+0.029Rs0.019.\begin{equation*} \left\{ \begin{aligned} a(\theta_s) &= 0.631 M_{\textrm{s}} &#x002B; 0.162 R_{\textrm{s}} -0.379,\\ b(\theta_s) &= 0.010 M_{\textrm{s}}^2 - 0.025 M_{\textrm{s}} &#x002B; 0.00062 R_{\textrm{s}}^2 - 0.0057 R_{\textrm{s}} &#x002B; 0.030,\\ c(\theta_s) &= 0.002 M_{\textrm{s}} &#x002B; 0.029 R_{\textrm{s}} -0.019.\\ \end{aligned} \right.\end{equation*}(6)

We then derived the HF noise level (σW,up) corresponding to the limiting flicker index αg,lim using Eq. (5) with parameters {a, b, c} given by Eq. (6). For each target in our defined {Ms, Rs} grid, we came up with the highest HF noise level, σW,up, that is acceptable to observe a flicker index αgαg,lim. We then derived the limiting magnitudes mv,lim corresponding to σW,up. To do so, we computed the expected precision for CHEOPS under the assumption that the targets will be observed during times when stray light from the Earth is no more than 0.62 phot s-1 pix-1, a medium value found from simulation, accounting for all other noise sources as done in the mission’s Exposure Time Calculator11. We used a 5920 K black-body SED (similar to that of a G0 star, typical for our sample), and a time window of one hour.

However, it is important to note that the number of one-day subseries (L) also plays arole as the variance of the averaged periodogram at a given frequency ν decreases with L (and so do the uncertainties on αg).

The limiting magnitudes (i.e., those corresponding to σW,up) derived for the grid of {Ms, Rs} are shown in the left panel of Fig. 22. For solar-like stars, flicker noise will become relevant for magnitudes brighter than mv ≤ 10. This encapsulates a large fraction of the expected targets observed with CHEOPS. However, for slightly evolved F-stars, the limiting magnitude will be around 13 and therefore flicker noise is expected to be relevant in most if not all CHEOPS light curves of bright F-type stars. For these stars, precise flicker index values should be measurable.

To make predictions for the PLATO mission (Rauer et al. 2014), we use the precision estimate by Marcos-Arenal et al. (2014), who (when using 24 cameras) quote an expected noise level of 27, 34, and 80 ppm per hour for stars with mv = 10.8, 11.3 and 13, respectively. We show the limiting stellar magnitudes expected from future PLATO observations in the right panel of Fig. 22. We predict a higher impact of flicker noise in PLATO light curves, and expect that the characterization of the flicker properties (amplitudes and timescales) should be well feasible for most F and G stars.

We see through the analysis of solar observations (see Sect. 3) that flicker variability can lead to significant errors on the inferred transit parameters of the smallest planets in the case of a single (or a small number of) observed transit(s). The development of accurate noise modeling procedures will not only allow us to decrease the errors on the transit parameters, but also to strengthen our understanding of the underlying link between the noise properties and stellar physics (see Sect. 4).

thumbnail Fig. 21

Coefficients {a, b, c} involved in Eq. (5) as a function of the Kepler stellar parameters. Red curves represent the linear and quadratic functions described in Eq. (6).

thumbnail Fig. 22

Illustration of the limiting stellar apparent magnitude, depending on the stellar parameters, (radius and mass) that is needed to measure a flicker index with αg > 0.2 with the future CHEOPS (left) and PLATO (right) high-precision observations.

6 Conclusions

We present a statistical characterization of the short-timescale stellar variability associated (mainly) with granulation noise. Based on solar observations, we find this noise source to be: (i) stochastic, (ii) colored, (iii) stationary with respect to the solar cycle, and (iv) wavelength dependent. It can generate variability of several hundred parts per million in amplitude. In the relevant frequency region of the PSD, the introduced correlations can be modeled by a simple power law. We chose to use the power index resulting from fits to the PSD as an indicator of the noise correlations. We used HMI images of the Sun to create artificial transit light curves of hypothetical planets transiting the Sun, and analyzed the impact of this flicker noise on the inferred transit parameters. We showed that flicker noise is critical for the smallest planets, for which we find that the inferred parameters can be substantially offset from their true values. This however is likely due to inaccurate limb-darkening parameters which have been shown to introduce biases of the same magnitude by Espinoza & Jordán 2016.

We then turned to Kepler short-cadence observations to extract the dependence of the flicker power index on the stellar parameters. We found the inferred power index values to be heavily affected by the level of high-frequency noise (which is related to the stars’ apparent magnitude). Correcting for this influence, we observe a strong correlation between the corrected indices and the stellar radius, mass, and surface gravity. No clear correlation is observed with stellar effective temperature. These correlations confirm the already known relation between this stellar variability and the stellar properties (e.g., see Bastien et al. 2016).

Using this interpolated power index and the observed dependence with the stellar parameters, we predicted the limiting stellar apparent magnitude for which flicker noise will be characterizable with future high-precision observations of CHEOPS and PLATO. We find that the signature of this noise will be observable for most of the CHEOPS and PLATO light curves of Solar-like targets. This study highlights the need to design robust signal processing routines adapted to the characteristics of flicker noise in order to reduce errors on the inferred parameters of small exoplanets, which will be the objective of forthcoming studies.

Acknowledgements

The authors would like to thank René Heller for his peer review containing very useful suggestions, as well as D. Mary for his constructive comments. S.S., M.L., L.F. and P.C. acknowledge support from the Austrian Research Promotion Agency (FFG) under project 859 724 “GRAPPA”. V. Van Grootel is a F.R.S.-FNRS Research Associate. The VIRGO instrument onboard SoHO is a cooperative effort of scientists, engineers, and technicians, to whom we are indebted. SoHO is a project of international collaboration between ESA and NASA. This paper includes data collected by the Kepler mission. Funding for the Kepler mission is provided by the NASA Science Mission directorate.

Appendix A Validity of the artificial transit light-curve modeling

The artificial transit light curve experiment is based on solar observations and is subject to unresolved phenomena compared to true extrasolar planet transit events: the change of the solar radius over time and the variability of the number of the covered pixels by the black body sphere mimicking the planetary transit. This leads to temporal variations of the amount of masked solar surface that can introduce errors when applying traditional light curve models, such as those by Mandel & Agol (2002).

To quantify the error on the transit depth, we computed the ratio of the sum of the number of pixels covered by the exoplanet to the sum of the number of pixels of the solar disk. Both sums evolve as a function of time. We then measured this ratio in-transit (δin) and compared it to the true transit depth (δ), taken as the square of the planet radius over the solar radius. The distribution of the percentage error for each set (Rp, b) is shown in Fig. A.1.

We observe a percentage error that is <0.01% of δ for each set of parameters. This value is far smaller than the global error found for the inferred transit parameters due to the flicker noise (see Sect. 3.2). It leads to a difference between the transit model and the artificial transit without noise that is <1 ppm in-transit and <2 ppm in the ingress and egress regions of the transits. We note slightly larger differences in the ingress and egress regions due to variation of the exact number of pixels covered by the planet as a function of time.

We conclude that our experiment generating artificial transit light curves in solar observations is reliable as the temporal variability of the size of the surface area covered by the planet is not significant. This approximation of a constant planet-to-sun radius ratio is valid when oversampling the raw HMI observations by a factor of two (not shown) or more. In our experiment, we oversampled each pixel of the HMI images by a factor of five as we found this value to be a good compromise between computational cost and constant number of covered pixels over time.

thumbnail Fig. A.1

Distributions of the percentage error on the transit depth (δ) shown for the artificial transits generated using one solar time series (2018-12-10). Each panel represents the errors for a different planet size (Rp = 1, 3, 5, 7, 10 R from top to bottom, resp.) and impact parameter (see legend). The temporal evolution of the ratio of covered to uncovered numbers of pixels has been measured in transit (δin).

Appendix B Synthetic error bars added to solar HMI observation

Errors are not given for HMI observations but are necessary information to run the MCMC analyses and derive the uncertainties on the inferred transit parameters. To add synthetic error bars on our artificial light curve dataset we turned to GP (Rasmussen & Williams 2005). The GP modeling aims to roughly correct for the correlated components of the flicker noise to extract the remaining whitened scatter noise. This is a flexible noise modeling commonly used in the exoplanet community to take into accountthe correlated stochastic noise within the observations. For that purpose, we use the George package developed by Ambikasaran et al. (2015).

We chose to parametrize the noise covariance matrix in the solar observations (without artificial transit) with a product of two kernels that roughly describe the noise correlation: a constant (γ) and the Matèrn 3∕2 kernel, the latter being known to be flexible regarding unexpected local behavior of the observations and have already been used for modeling the short-timescale granulation noise (Giles et al. 2018). Explicitly, this kernel writes: k(x)=γ (1+3xl)exp(3xl),\begin{equation*} k(\mathbf{x}) = \gamma ~ \Bigg(1&#x002B;\frac{\sqrt{3}\mathbf{x}}{\ell}\Bigg) \exp{\Bigg(-\frac{\sqrt{3}\mathbf{x}}{\ell}\Bigg)}, \end{equation*}

with being the kernel’s metric and x = |titj| the data inputs associated to the ith and jth data points, respectively. The covariance matrix, K, is then: Kij=σi2 δij+k(x),\begin{equation*} \mathbf{K_{ij}} = \sigma_i^2~\delta_{ij}&#x002B;k(\mathbf{x}), \end{equation*}

with σi being the uncertainties of the observations at time i and δij being the Kronecker delta.

We performed a GP regression following the method described in Gibson et al. (2012) with the fit quality determined by minimizing the negative log-likelihood function corresponding to our GP model: log L(r)=12r K112log |K|N2log (2π),\begin{equation*} \textrm{log}~{\cal L}(\mathbf{r}) = -\frac{1}{2}\mathbf{r}^{\top}~\mathbf{K}^{-1}-\frac{1}{2}\textrm{log}~|\mathbf{K}|-\frac{N}{2} \textrm{log}~(2\pi), \end{equation*}

with r being the data residuals and N the number of data points.

An example of a GP fit is shown in Fig. B.1. For each dataset, we measured the standard deviation (σ) of the data residuals. We obtained σ = 20−30 ppm depending on the considered solar time series. We used these error bars in the MCMC simulations described in Sect. 3.2.

thumbnail Fig. B.1

Top: example of a raw solar dataset (black) and mean of the predictive distribution of the GP model (red). Bottom: residuals of the solar dataset corrected by the GP model. The standard deviation of these residuals is used as input synthetic errorbars in our artificial transit light curves.

Appendix C List of Kepler Sun-like stars

Table C.1

Studied Kepler Sun-like stars and their inferred flicker index.

References

  1. Aigrain, S., Gilmore, G., Favata, F., & Carpano, S. 2003, ASP Conf. Ser., 294, 441 [NASA ADS] [Google Scholar]
  2. Aigrain, S., Favata, F., & Gilmore, G. 2004, A&A, 414, 1139 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  3. Ambikasaran, S., Foreman-Mackey, D., Greengard, L., et al. 2015, IEEE Trans. Pattern Anal. Mach. Intell., 38, 252 [NASA ADS] [CrossRef] [Google Scholar]
  4. Barclay, T., Endl, M., Huber, D., et al. 2015, ApJ, 800, 46 [NASA ADS] [CrossRef] [Google Scholar]
  5. Bartlett, M. S. 1950, Biometrika, 37, 1 [CrossRef] [MathSciNet] [Google Scholar]
  6. Basri, G., Walkowicz, L. M., & Reiners, A. 2013, ApJ, 769, 37 [NASA ADS] [CrossRef] [Google Scholar]
  7. Bastien, F. A., Stassun, K. G., Basri, G., & Pepper, J. 2016, ApJ, 818, 43 [NASA ADS] [CrossRef] [Google Scholar]
  8. Belkacem, K., Goupil, M. J., Dupret, M. A., et al. 2011, A&A, 530, A142 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  9. Borucki, W. J., Koch, D., Basri, G., et al. 2010, Science, 327, 977 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  10. Brown, T. M., Gilliland, R. L., Noyes, R. W., & Ramsey, L. W. 1991, ApJ, 368, 599 [NASA ADS] [CrossRef] [Google Scholar]
  11. Chaplin, W. J., Bedding, T. R., Bonanno, A., et al. 2011, ApJ, 732, L5 [Google Scholar]
  12. Chiavassa, A., Pere, C., Faurobert, M., et al. 2015, A&A, 576, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  13. Chiavassa, A., Caldas, A., Selsis, F., et al. 2017, A&A, 597, A94 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Corsaro, E., Mathur, S., García, R. A., et al. 2017, A&A, 605, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  15. Cranmer, S. R., Bastien, F. A., Stassun, K. G., & Saar, S. H. 2014, ApJ, 781, 124 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Cubillos, P., Harrington, J., Loredo, T. J., et al. 2017, AJ, 153, 3 [NASA ADS] [CrossRef] [Google Scholar]
  17. Dorn, C., Khan, A., Heng, K., et al. 2015, A&A, 577, A83 [EDP Sciences] [Google Scholar]
  18. Dravins, D. 1988, IAU Symp., 132, 239 [NASA ADS] [Google Scholar]
  19. Espinoza, N., & Jordán, A. 2016, MNRAS, 457, 3573 [NASA ADS] [CrossRef] [Google Scholar]
  20. Fernandes, C. S., Van Grootel, V., Salmon, S. J. A. J., et al. 2019, ApJ, 879, 94 [NASA ADS] [CrossRef] [Google Scholar]
  21. Fortier, A., Beck, T., Benz, W., et al. 2014, Proc. SPIE, 9143, 91432J [Google Scholar]
  22. Fröhlich, C., Romero, J., Roth, H., et al. 1995, Sol. Phys., 162, 101 [NASA ADS] [CrossRef] [Google Scholar]
  23. Fröhlich, C., Andersen, B. N., Appourchaux, T., et al. 1997, Sol. Phys., 170, 1 [NASA ADS] [CrossRef] [Google Scholar]
  24. García, R. A., Salabert, D., Mathur, S., et al. 2013, J. Phys. Conf. Ser., 440, 012020 [NASA ADS] [CrossRef] [Google Scholar]
  25. Gibson, N. P., Aigrain, S., Roberts, S., et al. 2012, MNRAS, 419, 2683 [NASA ADS] [CrossRef] [Google Scholar]
  26. Giles, H. A. C., Osborn, H. P., Blanco-Cuaresma, S., et al. 2018, A&A, 615, L13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  27. Gilliland, R. L., Chaplin, W. J., Dunham, E. W., et al. 2011, ApJS, 197, 6 [NASA ADS] [CrossRef] [Google Scholar]
  28. Gilliland, R. L., Chaplin, W. J., Jenkins, J. M., Ramsey, L. W., & Smith, J. C. 2015, AJ, 150, 133 [NASA ADS] [CrossRef] [Google Scholar]
  29. Harvey, J. 1985, ESA SP, 235, 256 [Google Scholar]
  30. Harvey, J. W. 1988, IAU Symp., 123, 497 [NASA ADS] [Google Scholar]
  31. Heinisch, O. 1962, Biometrische Zeitschrift, 4, 207 [CrossRef] [Google Scholar]
  32. Jiménez, A., Roca Cortés, T., & Jiménez-Reyes, S. J. 2002, Sol. Phys., 209, 247 [NASA ADS] [CrossRef] [Google Scholar]
  33. Jiménez, A., Jiménez-Reyes, S. J., & García, R. A. 2005, ApJ, 623, 1215 [NASA ADS] [CrossRef] [Google Scholar]
  34. Kallinger, T., De Ridder, J., Hekker, S., et al. 2014, A&A, 570, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  35. Kallinger, T., Hekker, S., García, R. A., Huber, D., & Matthews, J. M. 2016, Sci. Adv., 2, e1500654 [NASA ADS] [CrossRef] [Google Scholar]
  36. Karoff, C. 2012, MNRAS, 421, 3170 [NASA ADS] [CrossRef] [Google Scholar]
  37. Kipping, D. M. 2010, MNRAS, 408, 1758 [NASA ADS] [CrossRef] [Google Scholar]
  38. Kipping, D. M., Bastien, F. A., Stassun, K. G., et al. 2014, ApJ, 785, L32 [NASA ADS] [CrossRef] [Google Scholar]
  39. Kjeldsen, H., & Bedding, T. R. 1995, A&A, 293, 87 [NASA ADS] [Google Scholar]
  40. Lendl, M., Cubillos, P. E., Hagelberg, J., et al. 2017, A&A, 606, A18 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Lendl, M., Bouchy, F., Gill, S., et al. 2020, MNRAS, 492, 1761 [NASA ADS] [CrossRef] [Google Scholar]
  42. Li, T. 2014, Time Series with Mixed Spectra (Boca Raton, Florida: CRC Press) [Google Scholar]
  43. Mandel, K., & Agol, E. 2002, ApJ, 580, L171 [NASA ADS] [CrossRef] [Google Scholar]
  44. Marcos-Arenal, P., Zima, W., De Ridder, J., et al. 2014, A&A, 566, A92 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  45. Mathur, S., Hekker, S., Trampedach, R., et al. 2011, ApJ, 741, 119 [NASA ADS] [CrossRef] [Google Scholar]
  46. McQuillan, A., Aigrain, S., & Roberts, S. 2012, A&A, 539, A137 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  47. Meunier, N., Lagrange, A. M., Borgniet, S., & Rieutord, M. 2015, A&A, 583, A118 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  48. Morris, B. M., Bobra, M. G., Agol, E., Lee, Y. J., & Hawley, S. L. 2020, MNRAS, 493, 5489 [NASA ADS] [CrossRef] [Google Scholar]
  49. Moya, A., Barceló Forteza, S., Bonfanti, A., et al. 2018, A&A, 620, A203 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  50. Muller, R., Hanslmeier, A., Utz, D., & Ichimoto, K. 2018, A&A, 616, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  51. Nesis, A., Hammer, R., Roth, M., & Schleicher, H. 2002, A&A, 396, 1003 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  52. Nordlund, Å., Stein, R. F., & Asplund, M. 2009, Liv. Rev. Sol. Phys., 6, 2 [Google Scholar]
  53. Oshagh, M. 2018, Asteroseismology and Exoplanets: Listening to the Stars and Searching for New Worlds (Berlin: Springer), 49, 239 [NASA ADS] [CrossRef] [Google Scholar]
  54. Pallé, P. L., et al. 1999, ASP Conf. Ser., 173, 297 [NASA ADS] [Google Scholar]
  55. Pande, D., Bedding, T. R., Huber, D., & Kjeldsen, H. 2018, MNRAS, 480, 467 [NASA ADS] [CrossRef] [Google Scholar]
  56. Pereira, F., Campante, T. L., Cunha, M. S., et al. 2019, MNRAS, 489, 5764 [NASA ADS] [CrossRef] [Google Scholar]
  57. Ploner, S. R. O., Solanki, S. K., & Gadun, A. S. 2000, A&A, 356, 1050 [NASA ADS] [Google Scholar]
  58. Rasmussen, C. E., & Williams, C. K. I. 2005, Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) (Cambridge, MA: The MIT Press) [Google Scholar]
  59. Rauer, H., Catala, C., Aerts, C., et al. 2014, Exp. Astron., 38, 249 [NASA ADS] [CrossRef] [Google Scholar]
  60. Ricker, G. R., Winn, J. N., Vanderspek, R., et al. 2015, J. Astron. Teles. Instrum. Syst., 1, 014003 [Google Scholar]
  61. Salabert, D., Garcia, R. A., Palle, P. L., & Jimenez, A. 2011, J. Phys. Conf. Ser., 271, 012030 [CrossRef] [Google Scholar]
  62. Salabert, D., García, R. A., Jiménez, A., et al. 2017, A&A, 608, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  63. Savitzky, A., & Golay, M. J. E. 1964, Anal. Chem., 36, 1627 [NASA ADS] [CrossRef] [Google Scholar]
  64. Schou, J., Scherrer, P. H., Bush, R. I., et al. 2012, Sol. Phys., 275, 229 [Google Scholar]
  65. Scuflaire, R., Théado, S., Montalbán, J., et al. 2008, Ap&SS, 316, 83 [NASA ADS] [CrossRef] [Google Scholar]
  66. Seleznyov, A. D., Solanki, S. K., & Krivova, N. A. 2011, A&A, 532, A108 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  67. Shapiro, S. S., & Wilk, M. B. 1965, Biometrika, 52, 591 [Google Scholar]
  68. Simon, M. K. 2006, Probability Distributions Involving Gaussian Random Variables: A Handbook for Engineers, Scientists and Mathematicians (Berlin, Heidelberg: Springer-Verlag) [Google Scholar]
  69. Stein, R. F., & Nordlund, A. 1989, ApJ, 342, L95 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  70. Svensson, F., & Ludwig, H.-G. 2005, ESA SP, 560, 979 [NASA ADS] [Google Scholar]
  71. Title, A. M., Tarbell, T. D., Topka, K. P., et al. 1989, ApJ, 336, 475 [NASA ADS] [CrossRef] [Google Scholar]

1

Higher instrumental ageing affects the blue channel detector.

2

The definition based on Eq. (1) differs slightly from Bartlett (1950), who proposed to split a given time series X(t) into L subseries X(t).

3

In the following, we use the Greek letter “ν” for the Fourier frequencies in general and the Latin letter “f” for pointing to a characteristic frequency of νk (except for the p-modes frequency, νmax).

4

Useful notes: “Gaussian” and “colored” are unrelated properties. A random process is Gaussian if it is normally distributed (i.e., Gaussian probability distribution function). If the process is white, the covariance matrix is the Identity matrix (and the PSD is constant); if it is colored, the covariance matrix is not Identity (and the PSD is not constant in frequency). (Second-order) Stationarity means that second-order statistics (i.e., the correlation structure) do not change in time.

5

We made use of the function “convolution.Box1DKernel” available from the Python package http://www.astropy.org

6

We note that the existence of intermediate scales of convective motions (<3 × 107 m), known as mesogranulation, is controversial even for the Sun (Stein & Nordlund 1989; Ploner et al. 2000).

8

We note that with this oversampling, a synthetic Earth-like planet disk corresponds to 23 200 (oversampled) HMI pixels (compared to 928 without oversampling). For comparison, a typical convection cell size (1 Mm) is roughly 13 times smaller than the Earth diameter (12.7 Mm).

10

Taken in the Kepler_stellar17.csv.gz catalog, https://archive.stsci.edu/kepler/catalogs.html

All Tables

Table 1

Average values of parameters {α, β} corresponding to Eq. (2) for regions A, B, C, and D and for observations in the three SPM channels.

Table C.1

Studied Kepler Sun-like stars and their inferred flicker index.

All Figures

thumbnail Fig. 1

Yearly RMS of the VIRGO time series for the blue, green, and red SPM channels (see colors). Solar minima at the beginning ofcycles 23 (from 1996 to 2008) and 24 (since 2009) are clearly visible.

In the text
thumbnail Fig. 2

Averaged periodograms computed with Eq. (1) using all one-day subseries available during a minimum (2008, solid dashed lines) and a maximum (2003, solid solid lines) of the solar cycle. Each color indicates the observation of one SPM channel. Differences with the solar cycle are observed at frequencies ν < fl (i.e., at periods >6.2 h). The peak at ν = 5570 μHz affecting the PSD of the blue and green SPM data is an electronic artifact related to the calibration period used by the acquisition system of VIRGO. The gray shaded area indicates the frequency range corresponding to the typical duration of planetary transit (from ~ 25 min to several hours).

In the text
thumbnail Fig. 3

Example of a one-day VIRGO time series (top) detrended (bottom) using a SG filter with a window size of 15 h (colored solid lines). From left to right: dataset from the blue, green, and red SPM channels.

In the text
thumbnail Fig. 4

From left to right: empirical distribution of the RMS, amplitude range, and F8 measurements evaluated on a set of one-day VIRGO subseries taken during a solar-cycle minimum (2008, colors are related to the three SPM channels). The empirical distributions obtained during a solar-cycle maximum (2003) are shown by the black contours. Distributions derived using the HMI data described in Sect. 3 are shown by the yellow histograms.In each panel, the vertical dashed line indicates the transit depth expected for an Earth analogue (84 ppm).

In the text
thumbnail Fig. 5

Effect of data binning on the RMS of one-day solar subseries (1996, black), on the synthetic times series generated with Eq. (3) (yellow), and on synthetic subseries of WGNs (blue). The RMS of the WGN series decreases as Nbin/N$\sqrt{N_{\textrm{bin}}/N}$ (light blue). Solid lines indicate the median values of the observed dispersion (shaded areas). The RMS of the time series at large bin sizes shows a significant dispersion as the number of data points decreases. Horizontal dotted lines indicate the fraction of a typical Earth-like transit depth (i.e., 84 ppm).

In the text
thumbnail Fig. 6

Estimation of the splitting frequencies. Main panel: L = 364 ACFs corresponding to the available solar one-day subseries observed in 1996 with the blue SPM channel (gray lines) and their mean ACF (black). The mean ACF allows the automatic derivation of the upper and lower frequencies surrounding the acoustic mode regime {fh, fc }. Inset panel: logarithm representation of the associated PSD estimates computed using Eq. (1). The PSD allows us to localize the flicker frequency {fg }. The lower frequency {f} is set by the window length of the SG filter applied to each one-day subseries to remove the long-term variability. In both panels,the splitting frequencies are indicated by the dotted vertical red lines. The blue dotted line represents the slope of the PSD (see Eq. (2)) measured in the frequency region associated with the granulation regime ν ∈ [fg, fc]. From high to low frequency, the distinct regions are denoted by letters A to D.

In the text
thumbnail Fig. 7

Evolution of the inferred power index estimated on averaged periodograms of one-day solar subseries. From left to right: index evaluated for (a) the high-frequency region, (b) the p-modes region, (c) the granulation region, and (d) the lowest frequency region. Each color corresponds to an SPM channel. In region (c), the index obtained on the HMI solar observations discussed in Sect. 3.1 is shown by the horizontal black line (with the 1σ uncertainties in gray). This index results from the averaged periodogram computed using the 91 one-day subseries that have been selected during the period of a solar minimum (2017–2019).

In the text
thumbnail Fig. 8

Top: example of a raw solar light curve extracted from HMI images (black) and the corresponding smooth detrending function based on SG filters with a window size of 5 h (red). An artificial transit of a 5 R planet crossing the center of the solar disk (b = 0) is shown in gray. Bottom: residuals of the raw solar light curve after correcting by the SDO satellite motion.

In the text
thumbnail Fig. 9

Example of artificial exoplanet transit light curves generated using solar HMI observations (quiet Sun, 2018-12-10). Each panel shows five transits for planets with sizes ranging from 1 to 10 R (see legend). Panels from left to right: different orbit configurations (see the panels’ header).

In the text
thumbnail Fig. 10

Top: example of artificial transit of an Earth-sized planet crossing the disk center of the Sun (b = 0, black). The transit model with the true input parameters is shown in green and the model computed using the inferred parameters in red. The error on RpRs is around 2% in this example. Bottom: residuals based on the inferred transit model.

In the text
thumbnail Fig. 11

Distribution of the absolute percentage error on the planet-to-star radius ratio inferred from our simulated transits for planets with sizes Rp = [1, 3, 5] R (left to right) and impact parameters b = 0 (top) and b = 0.8 (bottom). Inferred errors on radius ratios derived from light curves containing only WGN are shown in red.

In the text
thumbnail Fig. 12

Planet-to-star radius ratio and 1σ uncertainties inferred from the MCMC analyses performed on the artificial light curves of exoplanets of size Rp = 1 R (left) and Rp = 10 R (right). The case of b = 0 is shown in black and b = 0.8 in red. The true radius ratios are indicated by the horizontal dotted lines.

In the text
thumbnail Fig. 13

From top to bottom: distribution of the absolute percentage errors on td, T0, and b for the whole set of artificial transit light curves.

In the text
thumbnail Fig. 14

Example of an averaged periodogram of the Kepler target KIC 7940546 (Ms = 1.152 M, Rs = 1.807 R, Teff = 6244 K, mv = 7.397). The number of available one-day subseries for this target is L = 75. Cut-off frequencies fc and fg resulting from the MCMC analysis are indicated by the vertical dashed lines.

In the text
thumbnail Fig. 15

Selected Kepler targets represented as a function of their mass and radius. The color code indicates the inferred index parameters in the frequency region of granulation (αg) derived aftercorrection by the HF noise. The best targets with R2 > 0.9 are shown in color (see Sect. 4.3).

In the text
thumbnail Fig. 16

Estimated values of the cut-off flicker frequency fg as a function of stellar parameters resulting from the MCMC analyses performed on periodograms of the selected SC Kepler targets. From left to right: stellar mass, radius, effective temperature, surface gravity, and apparent magnitude. Black dots represent targets with magnitude mv ≤ 10 and gray dots targets with magnitude mv > 10. The star symbol shows the solar cut-off frequency evaluated from VIRGO data (see Sect. 2.4). Pearson (ρP) and Spearman’s (ρS) coefficients, evaluated for stars with mv ≤ 10, are indicated in each panel.

In the text
thumbnail Fig. 17

Estimated flicker index associated to granulation (αg) as a function of stellar parameters. From left to right: stellar mass, radius, effective temperature, surface gravity, apparent magnitude, and normalized νmax resulting from (4). Top: values obtained from the MCMC analyses. The color code indicates the apparent magnitude of the target: mv < 10 (black) and mv > 10 (gray). The star symbol represents the index derived for the Sun based on VIRGO green channel observations: αg = 1.26 with σW = 5 ppm. The square symbol represents the index obtained after adding a HF noise level (corresponding to that seen in Kepler observations of the Sun-like star KIC 3427720) to the VIRGO subseries. Bottom: values obtained after interpolation that corresponds to a HF level of σW = 30 ppm (see Sect. 4.3). The color code indicates here the data with R2 > 0.8 (i.e., the best fits, black) and R2 > 0.5 (blue). The triangle symbol represents the value of the flicker index derived by adding a WGN of σW = 30 ppm in solar VIRGO observations, for which we find: αg = 0.9464. This is the raw level of HF noise we expect for Sun-like stars observed with CHEOPS. Pearson (ρP) and Spearman’s (ρS) coefficients associated with these plots for the best targets (R2 > 0.9) are indicated on each panel.

In the text
thumbnail Fig. 18

Correlations between νmax (determined using Eq. (4)), the corner frequency, and the flicker frequency. The corner and flicker frequencies have been derived for each Kepler target using the MCMC analysis described in Sect. 4.2. The color code indicates the apparent magnitude of the target: mv < 10 (black) and mv > 10 (gray). Solar values derived from VIRGO observations are shown by the yellow star in each panel. Pearson (ρP) and Spearman’s (ρS) coefficients associated with these plots are indicated on each panel.

In the text
thumbnail Fig. 19

Estimated values of the flicker index as a function of the HF noise level added in the VIRGO time series (red, blue and green SPM channels). Symbols show the power index measured on the PSD of Kepler Sun-like stars listed in Table C.1.

In the text
thumbnail Fig. 20

Flicker index αg as a function of the level of HF noise (black) for the F-star KIC 7940546. The first value, computedfrom raw Kepler observations, is indicated by the red dot and corresponds to αg = 1.23 and σW = 69 ppm. The gray line shows the interpolated function (see Eq. (5)), for which we obtained a quality factor of R2 = 0.96. The interpolated index at σW = 30 ppm is αg = 1.69 (blue triangle).

In the text
thumbnail Fig. 21

Coefficients {a, b, c} involved in Eq. (5) as a function of the Kepler stellar parameters. Red curves represent the linear and quadratic functions described in Eq. (6).

In the text
thumbnail Fig. 22

Illustration of the limiting stellar apparent magnitude, depending on the stellar parameters, (radius and mass) that is needed to measure a flicker index with αg > 0.2 with the future CHEOPS (left) and PLATO (right) high-precision observations.

In the text
thumbnail Fig. A.1

Distributions of the percentage error on the transit depth (δ) shown for the artificial transits generated using one solar time series (2018-12-10). Each panel represents the errors for a different planet size (Rp = 1, 3, 5, 7, 10 R from top to bottom, resp.) and impact parameter (see legend). The temporal evolution of the ratio of covered to uncovered numbers of pixels has been measured in transit (δin).

In the text
thumbnail Fig. B.1

Top: example of a raw solar dataset (black) and mean of the predictive distribution of the GP model (red). Bottom: residuals of the solar dataset corrected by the GP model. The standard deviation of these residuals is used as input synthetic errorbars in our artificial transit light curves.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.