Open Access
Issue
A&A
Volume 644, December 2020
Article Number A77
Number of page(s) 24
Section Stellar atmospheres
DOI https://doi.org/10.1051/0004-6361/202038710
Published online 02 December 2020

© N. Meunier et al. 2020

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1 Introduction

Stellar activity affects all indirect techniques (except for those that are based on microlensing) that are used to detect exoplanets, that is, radial velocity (RV), photometric transits, and astrometry. RV is most strongly affected because the convective blueshift is inhibited in plages (Meunier et al. 2010; Haywood et al. 2016) and because several velocity fields are present at various scales, such as granulation, supergranulation, and meridional flows (Meunier et al. 2015; Meunier & Lagrange 2019a, 2020a,b). In addition, the contrast of spots and plages also affects transits and astrometry (Saar & Donahue 1997; Hatzes 2002; Saar et al. 2003; Wright 2005; Desort et al. 2007; Lagrange et al. 2010; Meunier et al. 2010, 2019a; Boisse et al. 2012; Dumusque et al. 2014; Borgniet et al. 2015; Dumusque 2016). Because of the nature of the photometric transits and their typical timescales, stellar activity mostly affects the transit characterisation of the radius and atmosphere (e.g. Silva 2003; Pont et al. 2008; Chiavassa et al. 2017) and not detectability. It is difficult, however, to reach long orbital periods, which would allow detecting Earth-like planet in the habitable zone (HZ) of solar-type stars, for example, with transits: this is one of the main goals of the PLAnetary Transits and Oscillations of stars (PLATO) mission (Rauer et al. 2014), which will be launched in 2026. Only transiting planets will be detected by PLATO, however, that is, a very small fraction of existing planetary systems. Furthermore, their mass will have to be estimated from RV follow-ups, which is expected to be difficult given the stellar activity impact. On the other hand, astrometry is much less affected by stellar activity than RV (Makarov et al. 2010; Lagrange et al. 2011). Gaia is sensitive only to very massive planets (typically sub-Neptunes), which produce a signal far above the stellar activity for main-sequence stars. The main technique used so far to search for planets in the habitable zone around solar-type stars is therefore the RV technique (intermediate between photometric transits and direct imaging in terms of orbital periods) and it allows follow-ups (microlensing techniques are not suitable for this purpose), but so far, no Earth-like planet in the habitable zone around solar-type stars has been detected because of stellar activity. The low impact of stellar activity on the astrometric signal compared to radial velocity is therefore one of the reasons why high-precision astrometric space missions have been proposed to detect low-mass planets around stars in our neighbourhood (Léger et al. 2015; Janson et al. 2018), such as the Space Interferometry Mission (SIM; e.g. Shao et al. 1995; Svensson & Ludwig 2005; Catanzarite et al. 2008; Makarov et al. 2009), the Nearby Earth Astrometric Telescope (NEAT; Malbet et al. 2012; Crouzier et al. 2016), or Telescope for Habitable Exoplanets and Interstellar/Intergalactic Astronomy (THEIA; Theia Collaboration 2017), although they present a huge technological challenge (e.g. Malbet et al. 2019).

The effect of stellar activity on the astrometric signal has first been considered with a single-spot model (Bastian & Hefele 2005; Reffert et al. 2005) and simple spot configurations (Eriksson & Lindegren 2007). A statistical model of the solar signal has later been developed by Catanzarite et al. (2008) in the context of the SIM mission, followed by a realistic reconstruction of the solar signal over a cycle by Makarov et al. (2009), with spots only, and by Makarov et al. (2010) and Lagrange et al. (2011), including both spots and plages. These two studies, performed for the Sun seen edge-on, agree with each other, and the latter, for example, predicts a root-mean-square (rms) of the signal over solar cycle 23 of 0.07 μas in the X direction (i.e. along the equatorial plane) and 0.05 μas in the Y direction (i.e. along the rotation axis), for a Sun at 10 pc. This is much smaller than the Earth signal (0.3 μas), and it was concluded that stellar activity was not an obstacle for detecting low-mass planets in the habitable zone around solar-type stars. It is indeed compatible with the fact that the relative contribution of the planetary signal with respect to the stellar signal is stronger in astrometry than in radial velocities. Theia Collaboration (2017) therefore used the solar value of 0.07 μas from Lagrange et al. (2011), although they did not appear to scale it with the distance. No variability with spectral type was considered either. Finally, the effect of granulation is very small (Svensson & Ludwig 2005) compared to the contribution from magnetic activity.

It is therefore necessary to study the effect of stellar activity on astrometry in more detail. First, only the Sun seen edge-on was modelled in a realistic way. The large number of simulations now available for F6-K4 stars and all stellar inclinations in Meunier et al. (2019a), hereafter Paper I, also allows estimating this effect more realistically. The RV and photometric time series obtained from these simulations were analysed in Meunier & Lagrange (2019b,c) and in Meunier et al. (2019b). We focus here on stellar activity, even though more effects also need to be considered for a complete model of the astrometric signal (Sozzetti et al. 2002; Sozzetti 2005; Eisner & Kulkarni 2001; Traub et al. 2010), such as parallaxes, the proper motion of the star, or the presence of additional more massive planets and interaction between them, which are neglected here. We take the instrumental noise into account.

Our main objective in this paper is therefore to quantify with a systematic approach the magnetic activity signal for a wide range of old main-sequence F-G-K stars and determine how this signal compares to a signal from an Earth-like planet (i.e. in terms of mass) in the habitable zone around such stars. We also characterise in more detail the expected activity signal as a function of spectral type and inclination, as a reference for other applications and to determine whether astrometry may be used to characterise stellar activity. The outline of the paper is as follows. In Sect. 2 we describe how we model stellar activity and the planetary signal, and then explain the methods we implemented to assess exoplanet detectability. We describe the properties of stellar activity seen in astrometry in Sect. 3. The effect of stellar activity on exoplanet detectability is then analysed in detail in Sect. 4, first with a simple method based on the signal to noise ratio (S/N), then based on a theoretical false-positive level due to stellar activity, and finally based on standard tools that are used to analyse observations, in particular, blind tests. Additional blind tests and comparisons of different techniques are presented in the appendix. We conclude in Sect. 5.

2 Methods

In this section, we present the stellar activity simulations, how we model the planets, and the considered instrumental configuration. The exoplanet detectability criteria used in the analysis are then discussed.

2.1 Modelling stellar activity

We have developed a model for the Sun, described in detail in Borgniet et al. (2015), and extended it to solar-type stars in Paper I. The model produces spots, plages, and the magnetic network in a consistent way. Several observables, such as radial velocity, photometry, chromospheric emission (converted into logRHK$\log R'_{\textrm{HK}}$ to characterise the activity level), and astrometry are produced, which provides long time series that cover one to three cycles, depending on the stars. Some input parameters depend on spectral type, such as the activity dependence, the rotation rate, or the contrasts, while others (sizes and latitude coverage) are fixed to the solar values. The latitudinal extent is chosen as in Paper I to be similar to that of the Sun or with a maximum latitude θmax of the activity pattern higher than the solar one by 10° or 20°. We refer to Paper I for more details.

As in Paper I, the activity levels are restricted to stars with an average logRHK$\log R'_{\textrm{HK}}$ below −4.5 for the most massive (F6) and below −4.85 for the less massive stars (K4), which corresponds to the plage-dominated regime of Lockwood et al. (2007). The rotation rates were then deduced from activity-rotation relationships, increasing from a few days (F6 stars) to 30–70 days (K4). When the activity-rotation-age relationship of Mamajek & Hillenbrand (2008) is assumed, they correspond to ages in the range of 0.5–3 Gyr for the most massive and 4–10 Gyr for the less massive stars. The plage contrasts are from Norris (2018), while two laws were considered for spots: a lower bound, ΔTspot1, equal to thesolar contrast used in Borgniet et al. (2015), that is, 605 K, and an upper bound law, ΔTspot2, depending on Teff (Berdyugina 2005), that is, 0.75 × Teff − 2250 K. We assumedthat actual star spots have contrasts within this range.

A total of 22 842 (11 421 for each spot contrast) time series were generated, corresponding to different sets of parameters. Each of these time series was produced for ten inclinations between edge-on and pole-on configurations. The astrometry time series correspond naturally to two time series, one for the X direction (i.e. along the equatorial plane) and the other for the Y direction (i.e. along the rotation axis).

2.2 Modelling the planet

In this section we describe how we modelled the planetary signal and the relevant parameters. For simplicity, we considered only circular orbits, with an amplitude (in μas) α equal to 3 Mpla apla Mstar1$M_{\textrm{star}}^{-1}$ Dstar1$D_{\textrm{star}}^{-1}$ as in Theia Collaboration (2017), where the stellar mass is given in solar mass and the planet mass in Earth mass. The planetary mass is often chosen to be 1 MEarth, although other values are considered in some computations.

The semi-major axis apla (in astronomical units) corresponds to the habitable zone, which we chose as in Meunier & Lagrange (2019b), depending on the spectral type (Kasting et al. 1993; Jones et al. 2006; Zaninetti 2008). We chose a simple classical definition following Kasting et al. (1993) to estimate the range of distances where liquid water could be present on the surface of the planet, and taking only luminosity effects into account: the inner side corresponds to a runaway greenhouse effect, which would imply the evaporation of all the surface water, and the outer side is the maximum distance corresponding to a temperature of 273 K in a cloud-free CO2 atmosphere. This is a conservative range because additional effects could widen it (see the discussion in Jones et al. 2006). We focused on three orbital periods for each spectral type: the inner side of the habitable zone (PHZin), the middle of the habitable zone (PHZmed), and the outer side of the habitable zone (PHZout). This corresponds to periods between 409 and 1174 days for F6 stars and between 179 and 501 days for K4 stars.

The stellar mass (in solar mass) follows the laws adopted in Paper I and Meunier & Lagrange (2019b). The distance Dstar is usually 10 pc, unless mentioned otherwise.

A few computations were made for a star seen edge-on only, with the orbital plane of the planet assumed to be equal to the equatorial plane of the star. This is described in Appendix A. However, for most results presented in this paper, various stellar inclinations were considered, and the planetary orbit inclination followed two different assumptions that are detailed in Appendix A.

2.3 Observing parameters

The stellar activity signal (Sect. 3) was characterised based on the full time series described in the previous section. The time series cover 3327 to 5378 days (typically 9–15 yr) depending on the simulation, with one point per night and no gap. All other computations that are studying the effect on exoplanet detectability were performed using the configuration proposed in the THEIA mission proposal (Theia Collaboration 2017): they cover 3.5 yr with 50 visits. The sampling was chosen randomly from a list of 100 random samplings, and they were also phased randomly within the cycle. For each activity simulation, one of those samplings was therefore chosen randomly, shifted randomly, and applied to this particular time series. We also added 0.199 μas noise level on each data point following the THEIA proposal. Most computations were made for a star at 10 pc: this distance affects both the activity signal and the amplitudes of the planetary signal.

No proper motion of the star was included here because we focus on stellar activity. The time series we considered are therefore (1) the activity time series alone over the full sample to characterise stellar activity in astrometry (Sect. 3), and (2)the sum of the activity time series, the planetary signal (whenever pertinent), and the Gaussian instrumental noise for the THEIA configuration to characterise exoplanet detectability (Sect. 4).

2.4 Principle of the analysis

In this section, we describe how we quantify planet detectability. We use different approaches (theoretical and observational false positive levels) and criteria (frequential and temporal analysis).

2.4.1 Signal-to-noise ratio approach

Traditionally, χ2 probabilities have been used to assess the presence of a planetary signal in an astrometric time series (Sozzetti et al. 2002; Sozzetti 2005; Ford 2004; Marcy et al. 2005). This is not applicable here, however, because stellar activity contributes, which could be as significant as the planetary signal. The probabilities that indicate that a significant signal is present therefore cannot be used to indicate the likely presence of a planet. A first simple approach we considered was to define a criterion based on the signal to noise (S/N) ratio (Casertano & Sozzetti 1999; Sozzetti et al. 2002; Sozzetti 2005; Eriksson & Lindegren 2007; Traub et al. 2010), where the signal S is equal to α, the amplitude of the planetary signal, and the noise N is the of the signal due to activity and instrumental noise, which can easily be computed for each simulation (in both directions and then combining the two). A threshold on S/N defines a detection limit (i.e. a value of α that can be converted into a planetary mass) corresponding to that threshold. This method is very easy to apply, but it has some drawbacks: the S/N is computed globally, while in reality, the planetary and stellar activity signals have different frequential signatures, and the S/N of a peak in a periodogram, for example, may be more relevant than the global one. We discuss this point in Sect. 4.2.5. Furthermore, the method does not allow us to estimate the level of false positives to which this detection limit corresponds. We apply this technique in Appendix D for comparison purposes with previous works using this approach and to determine how it compares to more sophisticated approaches.

2.4.2 Detectability criteria from a theoretical point of view

In a second step, we took the temporal variability of the time series and their frequency distribution into account. We also wish to be able to estimate a level of false positives to which the signal caused by a planet can be compared. For this purpose, we considered two points of view, theoretical (detailed here), and observational (next Sect. 2.4.3). Here, we first directly estimate the level of false positives from our activity synthetic time series (without a planet). For a given criterion (either from a frequential or temporal analysis), and assuming that our simulations correspond to a perfect knowledge of the stellar contribution, these simulations allow us to estimate the level corresponding to 1% of the false positives, for example.

After adding a planet, we can compare the values based on the same criterion and estimate the percentage of cases for which the presenceof the planet (of a certain mass and orbital period) leads to a signal that is stronger than this false-positive level: this gives the detection rate for this planetary mass. This approach was used, for example, by Eisner & Kulkarni (2001). The two criteria and details of the computation are described in Appendix B. In this approach, we use the term “false-positive level” or “fp”, which is not to be confused with the “false-alarm probability” (see next section), which we also use in this paper. In the following, the fp level always refers to a realistic level corresponding to the stellar signal alone, based on our simulations.

2.4.3 Detectability criteria from an observational point of view

Our second point of view is observational. When a given star is observed, our whole set of simulations is not necessarily representative of that particular star: as a consequence, the theoretical fp level may not be correct for this particular star, and the false positive level might be desired to be estimated directly from the time series, which is in fact what is usually done. A usual way to do this is to compute a false alarm probability (FAP): we performed N realisations of a bootstrap of the time series, and computed the periodogram for each of these N realisations:the maximum value in the periodogram was computed, leading to N values. The 1% (e.g.) highest values provide the 1% FAP level. The amplitude of the periodogram of the original time series is compared to the FAP: if it is higher than the FAP, then a significant signal is considered to be detected, with a FAP of 1%. In the following, we use the term “FAP” for this definition only. This approach makes the strong assumption that the signal, except for the planet, follows a white-noise distribution with the same rms as that of the original time series. This is of course not a good assumption when the activity signal of the star is also present because it is not white noise (and it also includes a contribution of the planetary signal we wish to detect if present), and this will be discussed. We chose, however, to use this method because it is widely used, and we show some of its limitations in the following. To estimate the detection rates when we used this approach, we also performed ablind test, where the detections based on an FAP criterion were automatically compared with what was injected (planet and its properties, or no planet). The corresponding results are presented in Sect. 4.2.

To test this approach, we computed the FAP for a subset of simulations and compared it with the true fp level determined above in Appendix E. Another approach to compute detection limits from an observed time series has been proposed in Meunier et al. (2012) for the radial velocity, but it can be applied to astrometric time series as well. This method, called the local power analysis (LPA), is based on the comparison between the power that would be due to a planet at a given orbital period and the maximum power in a restricted range around that orbital period. We characterise this approach and the corresponding exclusion rates in Appendix E.

3 Properties of the activity time series

We study the statistical properties of the activity time series as a function of spectral type, average activity level, and inclination. Synthetic time series are also obtained for other observables such as photometry and RV (Paper I) and are compared to the astrometric signal.

3.1 Dependence on spectral type and activity level

Figure 1 shows the rms of the astrometric time series separately in the X and Y directions versus spectral type (first column) and versus the average logRHK$\log R'_{\textrm{HK}}$ (second column) for the different inclinations and ΔTspot values. There is a trend for an increase of the rms toward higher stellar masses and naturally toward more active stars, although there is a very large dispersion because for a given spectral type and average activity level, stars with a wide range of long-term variability are observed. The rms is higher on average for K4 than for adjacent spectral types because quiet stars are lacking in this domain, as shown in the grid of parameters chosen in BV and logRHK$\log R'_{\textrm{HK}}$ in Paper I for these simulations (see their Fig. 3). The rms tends to be slightly higher for the edge-on configuration than for thepole-on configuration for the X direction, while this is the opposite for the Y direction. For ΔTspot1, values can be up to twice as high as the values derived for the Sun seen edge-on by Lagrange et al. (2011), with a maximum of about 0.15 μas in the X direction and 0.22 μas in the Y direction. For G2 stars, the solar value is close to the average in the X direction, which is expected because in our range of parameters the Sun is also close to the average. For the Y direction, it is closer to the lower bound: the reason may be thattwo-thirds of the simulations are made with a higher latitudinal extent of the activity pattern (see θmax in Sect. 2.1) because the rms in the Y direction is sensitive to this parameter, with higher rms for high θmax values. For the upper boundary of the spot contrast, however, values are higher, up to ~0.3 μas, that is, comparable with the Earth signal. A more detailed comparison between the behaviour as a function of inclination and between the X and Y directions can be found in Appendix C, as well as the difference between the spot and plage contributions. We conclude that high inclinations are associated with a stronger variability in the X direction. The rms due to plages is of the same order of magnitude as the spot contribution when computed with ΔTspot1.

thumbnail Fig. 1

Rms of the activity time series in both directions for (from upper to lower panel) the X direction and ΔTspot1, the Y direction and ΔTspot1, the X direction and ΔTspot2, and the Y direction and ΔTspot2. Left column:rms of the astrometric signal vs. spectral type for the ten inclinations (from yellow for pole-on configurations to blue for edge-on configurations), and one simulation out of four for clarity (small dots) and binned in spectral type (circles). The horizontal dashed line is the solar value from Lagrange et al. (2011). Right column: same vs. logRHK$\log R'_{\textrm{HK}}$.

3.2 Relation between astrometry and other variables

The average rms astrometric signal (separately in the X and Y directions) versus other variables (photometry, radial velocity, and plage filling factor) is shown in Fig. 2. The dispersion around these averages is significant (this is due to the different spectral types and activity range covered by the simulations). A more detailed comparison between astrometry and photometry is shown in Appendix C. We conclude that for the given conditions (inclinations and spot contrast) the relation with the variability deduced from other variables is linear, with a trend of saturation at high activity levels.

thumbnail Fig. 2

Upper panels: average rms of the astrometric signal in the X direction vs. rms of photometric signal (left), rms of RV signal (middle), and average filling factor (right), for the ten inclinations (colour code as in Fig. 1) and all spectral types. Solid curves represent ΔTspot1 simulations, and dashed lines represent ΔTspot2 simulations. Lower panels: same for the Y direction.

4 Effect of activity on exoplanet detectability

In this section, we study the effect of the stellar signal characterised in Sect. 3 on exoplanet detectability. The detectability based on a simple estimation of the S/N is presented in Appendix D. We analyse here the results obtained with the different approaches described in Sect. 2.4, first based on theoretical fp estimated from our knowledge of stellar activity, and then using blind tests representative of an observational approach.

4.1 Detectability based on true false positive levels

In this section, we consider the planetary orbit described by Eqs. (A.4)–(A.9) and a distribution of the angle Ψ between the orbital plane and the equatorial plane of the star described in Sect. 2.2 and Appendix A. The detections rates are first considered for a 1 MEarth, and then aniteration on the mass allows us to estimate detection limits. The orbital period takes one of the three values (inner side, middle, and outer side of the habitable zone for each spectral type). The true fp levels derived from the whole set of synthetic time series are used, which are either constant for a given spectral type or depend on stellar variability or inclination, as described in Sect. 2.4.2 and Appendix B.

4.1.1 Periods and masses

Figure 3 shows the distribution of the departure from the true value for the orbital period (period of the peak in the periodogram) and for the planet mass for the different configurations. All spectral types are first combined together (upper panels), and the dependence of the distribution width and of the average on spectral type is shown in the middle and lower panels. The uncertainties on the periods are larger for the outer side of the habitable zone than at the inner side, and lower for low-mass stars, with values of a few days and typically up to 50 days. The uncertainties on the masses are typically between 10 and 20% (at the 1σ level), and are higher for the inner side of the habitable zone. There is no significant bias on the period, but the mass is systematically overestimated by about 2–12% depending on the orbital period and spot contrast (which directly controls the rms of the stellar activity signal, as shown in Sect. 3).

4.1.2 Detection rates

The same protocol allows us to compute the detection rates based on the false positive levels described in Sect. 2.4.2. We first assumed a constant level for all simulations corresponding to a given spectral type and spot contrast (i.e. all activity levels and inclinations). The results are shown in Fig. 4 for a detection criterion based on the planet mass and for a detection criterion on the power in the periodogram (i.e. temporal and frequential approach, see Sect. 2.4.2). The detection rates are excellent for the frequential approach: the detection rates are almost always at the 100% level, except for the inner side of the habitable zone, especially with the high spot contrast (with detection rates down to 60%). As already illustrated in Sect. 2.4.2, the detection rates are lower for the temporal approach (for a given mass), with very low detection rates for the inner side of the habitable zone. They lie between 50 and 100% for the middle of the habitable zone. Eisner & Kulkarni (2001, 2002) argued that the temporal approach (i.e. a direct fit) may be more robust, but we observe here that this temporal approach leads to higher false positive levels compared to the planet signal than with the frequential approach, which is therefore more suitable. The effect of the assumptions made to compute the false positive on the results is presented in Appendix B.3.

thumbnail Fig. 3

Left panels: distribution of the difference between fitted period and true period for ΔTspot1 (solid) and ΔTspot2 (dashed), and for the lower HZ (black), medium HZ (red), and upper HZ (green) for all simulations (and therefore all spectral types), the distribution width for all simulations vs. spectral types, and the median of the difference between fitted and true period vs. spectral type. Right panels: same for the mass.

thumbnail Fig. 4

Detection rate vs. spectral type for 1 MEarth planet for ΔTspot1 (solid) and ΔTspot2 (dashed) forthe temporal analysis (upper panel) and frequential analysis (lower panel), and for different orbital periods: lower HZ (black), medium HZ (red), and upper HZ (green).

4.1.3 S/N distributions

The detection rates obtained using the frequential analysis in Fig. 4 for a true level of fp of 1% are very close to those obtained in Appendix D using the SN > 1 criterion. In this section, we confirm the S/N distribution corresponding to those computations. The S/N was computed as in Appendix D(amplitude α of the signal due to the 1 MEarth planet, divided by the rms of the signal due to stellar activity and instrumental noise). We show the distributions for the simulations corresponding to a planetary signal above the false positive level and the frequential analysis (Fig. 5). For the low spot contrast, the S/N is in the range 0.6–1.6 for the inner side of the habitable zone (peak of the distribution around 1.2),and in the range 1.5–3.5 for the outer side (peak around 2.4). The S/N is slightly lower for the high spot contrast. This shows that a relatively low global S/N does not prevent us from obtaining good detection rates (we recall that they correspond to a 1% fp level). We discuss S/N issues further in Sect. 4.2.5.

thumbnail Fig. 5

Distribution of the S/N for simulations above the 1% fp level and frequential analysis for different configurations: ΔTspot1 (solid line) and ΔTspot2 (dashed line), and lower HZ (black), medium (red), and upper HZ (green).

4.1.4 Detection limits

Finally, we considered a more simple configuration: we focused on stars seen edge-on, with no inclination between the orbital plane and the equatorial plane, because the computations are much more time consuming and our objective is to analyse a very large set of simulations. We first computed detection rates as in Sect. 4.1.2, but for different masses, from which we deduced a detection limit as explained in Sect. 2.4.2: this detection limit corresponds to a given detection rate (e.g. 95%) and a given fp level (here 1%). The results are shown in Fig. 6 for the temporal analysis (left panels) and for the frequential analysis (right panels) and three different levels of detection rates (50, 9, and 99%). Again, the detection limits are higher for the temporal analysis. For a very good detection rate of 99%, the detection limits lie around 1 MEarth or below, except for the inner side of the habitable zone, for which they are slightly higher. The S/N distributions peak in the range 0.5–2.2 for the frequential analysis and 0.6–6 for the temporal analysis, with values depending on the requested detection rate. We expect detection limits to be slightly lower for pole-on configurations.

4.2 Detectability based on observational approach: blind tests

In the previous section, we have computed detection rates based on a theoretical knowledge of the false positive levels for a given set of simulations (e.g. averaged over a given spectral type): the fp level was computed over all simulations, and then applied to each of these simulations separately. In practice, the level of false positives is often determined from the observed time series itself, without any theoretical knowledge, and in particular using the FAP defined in Sect. 2.4.3 based on a bootstrap computation. We therefore performed blind tests based on FAP estimation to evaluate the detection rates and false positive levels corresponding to this classical approach. Comparisons between the FAP and our fp levels, as well as characterisation of LPA detection limits (Meunier et al. 2012) are performed in Appendix E.

4.2.1 Protocol

We performed blind tests similar to those implemented in Meunier & Lagrange (2020b). For each spectral type, we randomly selected 400 realisations of the activity simulations and randomly selected the inclination assuming uniform distribution in cos(i). The sampling was chosen randomly as in previous sections. In statistically half the cases, no planet was injected, and a planet was injected in the remaining simulations. The planetary parameters were chosen as follows for our reference blind test (A): 1 MEarth, period chosen randomly in the habitable zone, random phase, and angle between orbital plane and stellar equatorial plane Ψ following the distribution used previously. The configuration was similar to previous tests (star at 10 pc, 0.199 μas per point, 50 points randomly chosen over 3.5 yr). This was done for both spot contrasts separately.

These time series were then automatically analysed in a simple way given the number of realisations: the FAP at the 1% level was computed, and the highest peak in the 2-2000 day range was identified. If it is higher than the FAP, we considered it to be a detection, otherwise we considered that there is no detection for this time series (other possibilities are discussed below). If it was a detection, the fit was made as before, and the stellar inclination was assumed to be known. The results can automatically be compared to the true parameters, which allows us to define different categories in terms of detections and false positives as described below. We note that in all our simulations, the second highest peak is almost always below the FAP, except for only 0.06% of the cases (see Sect. 4.2.5), so that the analysis based on the highest peak above the FAP is enough. The effect of the window function is not taken into account, but is expected to be small because of the random sampling: there are indeed no dominant peaks because in more than 80% of the cases the second highest peak of the window function is within 20% in amplitude of the highest peak (see Sect. 4.2.3 for a more detailed discussion). In addition to this work, additional blind tests are performed to evaluate the effect of various parameters: they are listed in Table F.1, and the averaged rates are shown and discussed in Appendix F. We conclude that the detection rates depend on distance, mass, and period, as expected, and they remain very good, except for a higher noise level: this is therefore a critical instrumental constraint.

thumbnail Fig. 6

First column: detection limits based on a true fp level (based on power and frequential analysis) vs. spectral type for a 50% detection rate (first panel), for a 95% detection rate (second panel), and a for 99% detection rate (third panel). Curves are for ΔTspot1 (solid) and ΔTspot2 (dashed), and for the lower HZ (black), medium HZ (red), and upper HZ (green). Second column: same for fp based on a temporal analysis (based on mass).

Table 1

Categories of results from blind tests.

4.2.2 Categories

Depending on how the results compare to the input parameters (planet injected or not injected and its period), we identify different categories to estimate the detection and false positive rates as in Meunier & Lagrange (2020b), adapted from Dumusque et al. (2017). They are defined as follows: (1) no retrieved planet for no injection (good retrieval), (2) retrieved planet for no injection (false positive), (3) no retrieved planet although a planet was injected (characterised as a rejected planet if the period is close to the true period, and missed planet if the period is far away from the true planet), (4) retrieved planet with a good period when a planet was injected (allowing us to compute the detection rate by comparison with the number of injected planets), (5) retrieved planet with a poor period (i.e. an incorrect planet, which is a second category of false positive). They are summarised in Table 1. The criterion for determining whether the period is close was derived from the difference distribution of the fitted period and the true period (see next paragraph) and was chosen to be 100 days.

4.2.3 Parameter distributions

We analysed the difference between the planet parameters obtained in the analysis (period and mass) with the true parameters of the injected planet. The results are summarised in Fig. 7. The typical width at half-maximum of the period difference distribution is of the order of 20 days. We note that in a few cases, the periods diverge during the fit, that is, the highest peak in the periodogram is close to the true planet period, but after the fit, it has converged at a completely different peak. This divergence is extremely rare (the two estimate periods differ by more than 100 d in less than 0.2% of the cases), however, and the rms of the difference between the two period estimates is only 10 days. This does not seem to be due to the window function because the fitted period is not closer to the highest peak in the periodogram of that function.

The full width at half maximum for the mass distribution is only of the order of 0.2 MEarth. The dispersion does not vary significantly with spectral type and lies in the range 10–14%, which is a very good performance. The larger uncertainty is observed for ΔTspot2. The bias on the mass of the order of 5–8% on average is significant, however, as we showed in Sect. 4.1. These properties are close to what we obtained in Sect. 4.1 with the theoretical false positive level of 1%.

thumbnail Fig. 7

Upper panels: distribution of period (left) and mass (right) difference (between found and true period) for ΔTspot1 (solid line) and ΔTspot2 (dashed line) from blind tests A. The period plots are for the period corresponding to the highest peak in the periodogram (black) and fitted on the temporal series (red). Middle panels: median period and mass vs. spectral type. The true value is shown in green. The dotted lines in the period plot indicate the boundaries of the habitable zone we consider. Lower panels: distribution width (period difference and mass difference) vs. spectral type.

Table 2

Average detection rates and poor recovery rates (%).

4.2.4 Detection and false positive rates

The average rates are shown in Table 2. For an FAP level of 1% and ΔTspot1, most planets are recovered, while the false positive rate when no planet is injected is very close to zero. For ΔTspot2, the results are still excellent when no planet is injected and the detection rates are slightly lower when a planet is injected (99.6 and 80% on average, respectively). The losses are mostly due to rejected planets (second row of the figure), that is, due to the highest peak at the proper period but below the FAP, and to a lesser extent, the losses are due to missed planets (incorrect period and below the FAP.) We note in particular the very low false positive rates when either no planet is injected (0.4% for both ΔTspot1 and ΔTspot2) and when a planet is injected (0.5 and 0.7%): they are below the 1% rate expected from the FAP and therefore correspond to a better performance than expected. There is no significant trend with spectral type (Fig. F.1). For an FAP level of 0.1%, the false positives are very rare (0% when no planet is injected and 0.4–0.5% when a planet is injected), but the rejected planet rates strongly increase when a planet is injected (22.4 and 36.1% for ΔTspot1 and ΔTspot2, respectively). This FAP level is therefore not very interesting because the effective false positive level with the FAP at 1% is already verylow and not improved here.

Figure 8 shows the distribution of the FAP levels when no planet is injected compared to when a planet is injected. There is little difference between the two spot contrasts (the FAP levels are higher for ΔTspot2 by a few percent,as expected). However, the presence of a planet strongly affects the FAP (the ratio of the median with planet divided by the median without planet is about 2.2), although the objective of the FAP is to provide an estimation of the false positive level due to all processes excluding the planet. This strong overestimation of the FAP when a planet is present probably explains the strong rejected planet rate in Fig. F.1. The FAP is therefore a poor estimation of the false positive level for this type of time series, although it is conservative. We note that the sum of the good and rejected planets (i.e. highest peak at the proper period) represents 99.1% of the realisations for ΔTspot1 and 96.1% for ΔTspot2, so that if the false positive level could be better estimated, the detection rates would be excellent.

thumbnail Fig. 8

Distribution of FAP values for ΔTspot1(upper panel) and ΔTspot2 (lower panel) when a planet is injected (solid line) and when no planet is injected (dashed line).

4.2.5 Relationship between detection and simulation parameters

We detect most planets even with global S/N as low as 1.4–1.6, and all injected planets are retrieved for S/N above 1.6–1.8, depending on the spot contrast. This means that it may not be necessary to impose a very high S/N such as 5 or 6 to obtain excellent performance (we recall that our false positive rates are very low in these conditions, of the order of 0.5%, and even lower in the habitable zone, as we show below). The reason is that the global S/N as computed above does not take thefrequency behaviour of our signal into account: stellar activity mostly affects short periods (around Prot), while the planet is responsible for a peak at much longer periods. We therefore computed S/N estimates corresponding to the peak in the periodogram as a more representative indicator: this peak S/N is defined as the ratio between the amplitude of the peak and the amplitude of the highest of the remaining peaks in the whole period range considered (2–2000 days) and for periods longer than 100 days. The distribution of these values is shown is Fig. 9 for ΔTspot1. This peak S/N reaches about 5 for all periods (median of 2.5) and about 16 for periods longer than 100 d (median of 4.4), so that the effective S/N is much higher than 1. We find that all planets are retrieved for a peak S/N (period above 100d) higher than 5.

Most false positive either have very short or very long periods (above 1000 days). The latter case is probably due to the limited length of the time series (3.5 yr). There may be a trend for a higher number of false positives formore active stars (evaluated with different criteria), but it is not very significant given their low number. Their periods are compared to the rotation period in Fig. 10: they are never close to Prot (except for four points in the ΔTspot2 case), and otherwise lie at short periods (below 10 days) or close to 1000 days as noted before. An important conclusion is that they do not correspond to planets in the habitable zone, with only nine points there out of our 15 200 simulations: the false positive rate in the habitable zone is therefore extremely low (0.06%) and much below the 1% FAP rate we used to perform the analysis. This confirms that the FAP is strongly overestimated. Furthermore, when no planet are injected, only one false positive out of 15 200 simulations is detected, that is 0.007%, so that false positives are most of the time due to an injected planet at a different period (even if the planet peak is not significant).

Finally, we also examined the properties of the second highest peak. In most cases, it is below the FAP (except for only nine simulations). Most points lie either at a short period but are not associated with Prot or around 1000days, as shown in the lower panels of Fig. 10.

thumbnail Fig. 9

Distribution of S/N for ΔTspot1 and detected planets: global S/N (black), peak S/N (over the whole range, red), and peak S/N (over period higher than 100 days, green).

thumbnail Fig. 10

Period of the maximum peak in the periodogram vs. rotation rate for ΔTspot1 (left panels) and ΔTspot2 (right panels) and in two cases. First row: false positive level when no planet is injected (black) and when a planet is injected (red). Stars correspond to peaks outside the habitable zone and circles to peaks inside the habitable zone. The brown line indicates the position of the rotation rate. The two black curves represent the extent of the position of the inner side of the habitable zone (for different spectral types), and the green curves show the position of the outer side of the habitable zone. Second row: second highest peak in the periodogram, those in green indicate the peaks above the FAP. The brown line is similar to the first row.

5 Conclusion

We have used a large set of synthetic time series of complex (solar-like) activity patterns for F6-K4 old main-sequence stars to evaluate their effect on exoplanet detectability in astrometry. We focused on Earth-mass planets orbiting in the habitable zone (and therefore on high-precision astrometry) around such stars and implemented different complementary approaches to assess detectability. The comparison between the different approaches is provided in Appendix G.

The rms of the astrometric activity time series in the X direction can be up to two to four times the solar (edge-on) value, depending on spot contrast assumption and inclination, and up to four to five times in the Y direction. The time series exhibit a strong skewness in the Y direction that strongly depends on inclination, which means that the statistical properties of the time series contain information on stellar inclination. However, the presence of noise prevents us from determining this useful stellar parameter. We compared the rms of the astrometric signal to other observables and in particular with photometry and radial velocities. Radial velocities are affected by other processes that we did not include in this comparison, but the knowledge of the stellar variability should allow us to derive a range of astrometric variability (within typically a factor two), and possibly adapt the observational strategy (number of visits for example).

We found that detection rates for 1 MEarth planets in the habitable zone are very good because a large fraction of such planets are expected to be found, with rates above 50%, which is much better than the rates expected for the RV, especially in the middle of the habitable zone and beyond, for conditions similar to those proposed in the THEIA mission (only 50 visits over 3.5 yr) for stars at 10 pc, if technological challenges can be overcome to reach high-precision astrometry. This technique is therefore much more suitable than RV to detect such planets in the range of spectral type and age we considered. By comparison, the detection rates obtained using a simple method based on the same set of simulations for RV are far lower and very close to 0% for G stars even with a very large (several thousands) number of observations (Meunier & Lagrange 2019b). The presence of granulation and supergranulation in RV also leads to poor detection rates based on blind tests similar to those performed in the present paper, usually below 50% (Meunier & Lagrange 2020b). We expect RV performance to be slightly better for K stars than for F-G stars (e.g. Meunier & Lagrange 2019b), so that astrometry could be very interesting for the latter category of stars, where many stellar processes significantly affect the RVs. Furthermore, the uncertainty on the fitted periods, and more importantly, on the mass are very good. The uncertainties on the mass are indeed below 20%, which would be very interesting for PLATO follow-ups.

The application of the classical bootstrap method to estimating the FAP in our blind tests led to important conclusions. First, the FAP is significantly overestimated, especially when a planet is present (by more than a factor 2), so that the false positive level is very low, especially in the habitable zone. However, for the same reason, some planets remain undetected even though the highest peak most of the time is at the true planetary period. Better indicators of the false positive should then be used, for example, using our theoretical knowledge of the stars as shown by the theoretical false positive levels obtained in this paper. A possible strategy to obtain better results would also be to make more visits to certain stars: doubling the number of points provides rates close to 100%. We also observed that it is possible to obtain very good rates with a low level of false positives (<0.5%) for a global S/N between only 1 and 2 because this indicator does not represent the S/N of the peaks in the periodogram well, for which the S/N is much higher.

In a future work, we will use this large set of synthetic time series and the approaches described in this paper to re-evaluate the expected detection rates and detection limits of the stars in our neighbourhood. They constitute the main targets of future high-precision astrometry missions.

Acknowledgements

This work has been funded by the ANR GIPSE ANR-14-CE33-0018. We are very grateful to Charlotte Norris who has provided us the plage contrasts used in this work prior the publication of her thesis. We thank Pascal Rubini for his work to convert the simulation code in C++. This work was supported by the “Programme National de Physique Stellaire” (PNPS) of CNRS/INSU co-funded by CEA and CNES. This work was supported by the Programme National de Planétologie (PNP) of CNRS/INSU, co-funded by CNES.

Appendix A Planetary orbits

The planetary orbit for the first category of computations (planet seen edge-on, aligned with the X-axis of the star) is described as (Eisner & Kulkarni 2001) xpla=Accos(ωt)+Assin(ωt),\begin{equation*}x_{\textrm{pla}}=A_c \cos(\omega t) &#x002B; A_s \sin (\omega t) ,\end{equation*}(A.1)

where Ac = α sin ϕ et As = α cos ϕ (ϕ related to the phase of the planet on its orbit), and ω is 2π/Ppla.

For the more complete approach, we considered two assumptions because the angle Ψ between the planetary orbital plane and the stellar equatorial plane is poorly constrained:

  • A. Distribution of the angle Ψ. These choices result from the observation that there are known exoplanets with values of Ψ significantly different from 0: we expect Ψ to affect the relationship between the astrometric signal in the X and Y directions. We used the results obtained using the Rossiter-McLaughlin effects (obtained for more massive transiting planets). Very few studies provide the true angle, and in general, only the projected (on the sky) angle is provided. We therefore used the list of projected obliquities provided in the TEPCat catalogue Southworth (2011)1, and then performed a Monte Carlo simulation to derive a realistic distribution of true obliquities corresponding to these projected obliquities, following the procedure described in Fabrycky & Winn (2009): we assumed a planetary orbit seen edge-on (because they are seen to be transiting in this catalogue2) and a uniform distribution in cos i. The resulting distribution in Ψ is shown in Fig. A.1. Such an approach has been used in previous works (e.g. Triaud et al. 2010; Brothwell et al. 2014).

  • B. Ψ = 0 (i.e. the orbital plane and the equatorial plane are the same). This simple assumption has been used in several works (e.g. Meunier & Lagrange 2020b), and low values of Ψ correspond to telluric planets in the Solar System.

thumbnail Fig. A.1

Upper panel: distribution of the angle between the stellar equatorial plane and the planet orbital plane Ψ assuming that transiting planets are seen edge-on seen (solid line) and assuming that transiting planets are seen with a small dispersion around the edge-on configuration (dashed line). Lower panel: same for the cumulated distribution.

The planetary orbits were then modelled as follows. The orbit was first described in the orbital plane of the planet as (Eisner & Kulkarni 2001) xpla=Accos(ωt)+Assin(ωt)\begin{equation*}x_{\textrm{pla}}=A_c \cos(\omega t)&#x002B;A_s \sin (\omega t) \end{equation*}(A.2) ypla=Ascos(ωt)+Acsin(ωt).\begin{equation*}y_{\textrm{pla}}=-A_s \cos(\omega t)&#x002B;A_c \sin (\omega t) .\end{equation*}(A.3)

We then perform several successive projections: following the true obliquity Ψ (either 0 or following the distribution described previously), then following the angle between the line of sight and the node line Φ0 (chosen randomly), and then following the stellar inclination i. This leads to the coordinates of the planet in the plane of the sky, xpla=Acos(ωt)+Bsin(ωt)\begin{equation*}x_{\textrm{pla}}&#x0027;=A&#x0027; \cos(\omega t)&#x002B;B&#x0027; \sin (\omega t) \end{equation*}(A.4) ypla=Acos(ωt)+Bsin(ωt),\begin{equation*}y_{\textrm{pla}}&#x0027;=A&#x0027;&#x0027; \cos(\omega t)&#x002B;B&#x0027;&#x0027; \sin (\omega t) ,\end{equation*}(A.5)

where xpla$x_{\textrm{pla}}&#x0027;$ corresponds to the X direction of the star, and where A=Ascos(Ψ)cosΦ0+AcsinΦ0\begin{equation*}A&#x0027;= -A_s \cos(\Psi) \cos \Phi_0 &#x002B; A_c \sin \Phi_0 \end{equation*}(A.6) B=Accos(Ψ)cosΦ0+AssinΦ0\begin{equation*}B&#x0027;= A_c \cos(\Psi) \cos \Phi_0 &#x002B; A_s \sin \Phi_0 \end{equation*}(A.7) A=Assin(Ψ)sin(i)+(Ascos(Ψ)sinΦ0+AccosΦ0)cos(i)\begin{equation*}A&#x0027;&#x0027; = -A_s \sin(\Psi) \sin (i) &#x002B;(A_s \cos(\Psi) \sin \Phi_0 &#x002B; A_c \cos \Phi_0) \cos (i) \end{equation*}(A.8) B=Acsin(Ψ)sin(i)(Accos(Ψ)sinΦ0AscosΦ0)cos(i).\begin{equation*}B&#x0027;&#x0027; = A_c \sin(\Psi) \sin (i) -(A_c \cos(\Psi) \sin \Phi_0 -A_s \cos \Phi_0) \cos (i) .\end{equation*}(A.9)

Appendix B False positive levels

We first detail the criteria we used to compare the signal with an injected planet with a false positive level, and then the compute the false fp levels from the simulations for each spectral type. Then we show effect of the dependence of the fp level on activity and inclination on the results.

B.1 Criteria

Two types of criteria were used in this paper. First, we performed a frequential analysis (Marcy et al. 2005; Catanzarite et al. 2008; Makarov et al. 2009, 2010; Traub et al. 2010): we computed the maximum power in the periodogram at periods close to the orbital period we wished to characterise, first without a planet to compute the false positive level, and then with a planet to compute the detection rate by comparison with the false positive level computed over a large sample of simulations. Because we are interested in the false positive level in a certain domain (habitable zone), we considered (PHZin, PHZmed, and PHZout) for each of the orbital periods, the maximum of the periodogram was computed in the range [0.5P,2P], where P takes one of these three values (the results are not very sensitive to the exact range).

Second, a direct fit of the signal, using a χ2 minimisation, can be made, assuming a planetary signal, as in Eisner & Kulkarni (2001, 2002), who claimed that this should constitute a more robust approach. The model used to perform the fit is either the model described in Eq. (A.1) (for the simple edge-on configuration) or using Eqs. (A.4) and (A.5) (and then using Eqs. (A.6)–(A.9) to retrieve the original parameters, and then α and the planet mass). An offset was also added to the model. We note that the main objective here was not to refine the minimisation technique but to analyse a very large number of synthetic time series using standard tools. Unless indicated otherwise, the stellar inclination i was assumed to be known: this should not affect the fitted amplitude and period significantly3 because its effect should be on the angles Ψ and Φ0, and there is a strong degeneracy between these three angles in any case (see discussion in Eisner & Kulkarni 2001). We estimate the false positive level separately for each spectral type and spot contrast. This corresponds to 6480 simulations in most cases (except for K stars for which the number of simulations is slightly lower).

thumbnail Fig. B.1

Detection rate (above 1% false positive level) vs. mass for G2 stars, PHZin, and ΔTspot2 for the frequential (black) and temporal analysis (red). The vertical lines indicate the position of the 50% detection rate levels.

We used a similar approach using the RV technique to estimate the effect of granulation and supergranulation on exoplanet detectability (Meunier & Lagrange 2020b). The situation is more complex here, however. In Meunier & Lagrange (2020b), we considered anumber of realisations, for example, of granulation, for a given spectral type, but all simulations corresponded to the same amplitude of the signal. In the present case, simulations of a given spectral type correspond to different activity levels, both on average and temporal variability. They also have different inclinations, which could correspond to different stellar signal amplitudes. The details of the computation are presented in the following section, as well as the effect of a variable fp determination corresponding to a given spectral type on the results. In most of the paper, we used a constant fp level for a given spectral type. We show that the effect is minor because the detection rates are very close to 100% in many configurations.

Finally, using a similar approach but considering different masses, it is possible to compute the mass (i.e. the detection limit) corresponding to a given detection rate (e.g. 50 or 95%), for the same 1% false positive rate. This is illustrated in Fig. B.1. We note that the frequential approach leads to lower detection rates than the temporal approach (for a given mass). The reason is not yet clear, but it may be due to a difference in sensitivity to the noise because we see, for example, in Sect. 4.1.1 that the noise leads to a bias in the mass estimation so that the temporal method is sensitive to it.

thumbnail Fig. B.2

Illustration of fp computation for K2 stars, all inclinations, ΔTspot2,PHZin, representing power (frequential analysis, upper panel) and fitted mass (temporal analysis, lower panel) vs. rms of logRHK$\log R&#x0027;_{\textrm{HK}}$ for no planet (black) and with an injected planet (orange, 1 MEarth). The green line corresponds to a constant fp level of 1%, and the red line shows a fp level depending on the rms of logRHK$\log R&#x0027;_{\textrm{HK}}$. The green and red dots show the upper and lower bounds corresponding to the criterion (see text).

B.2 Computation of false positive levels

In these simulations, the stellar activity level should affect the fp level:

  • If we consider a single value of the fp level for a given spectral type: this means that we consider all stars of this spectral type as equivalent because we do not have any information on the activity level of the star, for example. We are then interested in the probability that a planet can be detected around a star of that spectral type globally, that is, knowing only its spectral type.

  • However, we expect that if a star is quiet, then it should be easier to detect a given planet around it, but if the star is active, it should be more difficult, that is, the fp level might in fact be slightly different. If the variability of the star is well characterised, then we can estimate the fp level by separating stars that are strongly variable from those that are not.

The same argument might be made by separating simulations according to their inclination.

thumbnail Fig. B.3

False positive fp averaged over all simulations vs. inclination for different PHZ (inner side in black, middle in red, and outer side in green) and for ΔTspot1 (solid line) and ΔTspot2 (dashed line) for the frequential analysis (upper panel) and temporal analysis (lower panel).

Figure B.2 illustrates the computation of the fp level for a K2 star, ΔTspot2, and the inner side of the habitable zone using the frequential approach (upper panel) and the temporal approach (lower panel). The black dots correspond to the amplitude (either the power or the mass) when no planet is injected, which we used to compute the fp level: the green horizontal line is the fp level of 1% when it is considered constant for that spectral type, that is, 1% of the black dots are above the green line. The orange dots, corresponding to the same simulations with a 1 MEarth mass planet injected, can then be used to compute the detection rate corresponding to this 1% fp level.

The red line, on the other hand, provides the fp level when a dependence on the rms of logRHK$\log R&#x0027;_{\textrm{HK}}$, hereafter r, is considered, that is, based on the amplitude of the stellar variability. It is computed as follows: we defined bins in r and assumed that fp follows a linear trend in r. We computed 1000 simulations of the slope, adjusting the straight line so that the percentage of black dots above the line was as close as possible to 1% in each bin. We kept the best solutions (i.e. that met the condition for the maximum number of bins, usually 4). The median of those solutions is shown as the red line in Fig. B.2. With this computation, the detection rates should be better for the less variable stars than for the most variable stars. However, we note that this is observed mostly for the configuration shown in this figure (ΔTspot2, inner side of the habitable zone): for the other configurations, there is not much difference between the two choices (constant or variable fp level) because the detection rates are much better and in fact at the 100% level (most orange dots are well above the black dots). The effects of this choice is investigated in Appendix B.3.

thumbnail Fig. B.4

Detection rates vs. spectral type for 1 MEarth planet based for the frequential analysis (upper panel) and temporal analysis (lower panels), separately for ten inclinations (colour code as in Fig. 3). The detection rates are shown for ΔTspot2 and the inner side of the habitable zone. The fp levels depend on inclination for the right panels, but they do not for the left panels.

We also studied how the false positive levels depend on inclination. For each orbital period (three values in the habitable zone) and ΔTspot, we averaged the fp levels over all simulations of a given spectral type and inclination. The results are shown in Fig. B.3. A few trends can be seen. There is a bump around 20–30° for the frequential approach (upper panel), mostly for the middle and outer habitable zone, and then a small trend for ΔTspot2. This is explained in Sect. 3.1. The curves are mostly flat for the temporal approach (lower panel), except in a few cases that are close to edge-on.

B.3 Effect of the assumptions made during the false positive level computation

We showed in Sect. 4.2.2 the detection rates corresponding to the theoretical fp level determined for each spectral type. Here, we consider the effect of the activity level and of inclination on this determination and on the subsequent detection rates.

When a variable level of fp is used within a spectral type bin (i.e. versus inclination or activity variability), these detection rates are very similar. However, when we consider stars of various activity levels or different inclinations separately, for example, we expect different detection rates. Because they are often very close to 100%, however, the effect is in fact very small. We illustrate the effect of these variable false positives levels in the case of the inner side of the habitable zone and the high spot contrast, forwhich the effect is the largest because the detection rates are below 100%. Figure B.4 shows the effect of inclination in this configuration. The left panels correspond to the constant fp level, and on the right side, both fp levels and detection rates are computed separately for each inclination. When a constant fp level (i.e. averaged over all inclinations) is considered, the detection rates are higher when the star is seen pole-on. This is expected because more information is available in this case (from the X and Y directions). The difference between edge-on and pole-on configurations is small, however. When the inclination-dependent false positive level (obtained in Sect. 2.4.2) is considered, the trend is slightly reinforced, so that pole-on configurations are clearly easier targets.

thumbnail Fig. B.5

Same as Fig. B.4 for fp levels depending on activity variability. The colour code corresponds to different activity variability levels (based on the rms of logRHK$\log R&#x0027;_{\textrm{HK}}$): quiet (orange), intermediate (red), and active (green).

Figure B.5 shows a similar analysis for the dependence on the activity variability. When the same false positives for all simulations within a spectral type bin (left panels) are considered, there is a difference in detection rate depending on the activity level, but it is very small and not significant. When the false positive rates are computed separately for the different simulation categories (criterion based on the rms in logRHK$\log R&#x0027;_{\textrm{HK}}$), however, the detection rates are better for the quiet stars than for the most active stars, as expected (e.g. between 80% and 40% for K stars).This gives an estimate of the range in detection rates that is expected for stars of different variability levels. We recall, however, that this is seen mostly for the inner side of the habitable zone and high spot contrasts: for all other configurations, the rates are always very close to 100% and show little departure from it even for the most active stars.

Appendix C Properties of the stellar astrometric signal

In this appendix, we show more detailed results concerning the characterisation of the astrometric signal. We first show the effect of inclination on the properties of the astrometric signal, and discuss the properties for spots and plages separately. The relationship between the properties in the X and Y directions is studied in more detail. Finally, we show in more detail the relationship between the astrometric and the photometric signals.

C.1 Effect of inclination

Figure C.1 shows the distributions of rms and skewness computed for each time series for the different inclinations and all spectral types and activity levels we considered. We observe a trend with inclination, as well as a positive skewness forthe Y direction when it is far from pole-on. Figure C.2 shows the average rms versus inclination separately for the different spectral types. The rms in the X direction is decreasing for increasing inclination (from pole-on to edge-on). In the Y direction and when the two directions are combined, there is a bump around 20°, followed by an increase in ΔTspot2, which may explain how fp depends on inclination (Sect. 2.4.2).

C.2 Contribution of spots and plages

We illustrate separately the contribution of spots and plages to the astrometric signal. The distributions of the rms in the X and Y directions and of the skewness in the Y direction are shown for spots alone (ΔTspot1 in the upper panels and ΔTspot2 in the middle panels) and plages alone (lower panels) in Fig. C.3. For spots, the inclination effect is more clearly seen for the rms in the X direction and the skewness. For plages, the effect is more visible for the rms in the Y direction and the skewness. The sign of the skewness is reversed compared to spots. The shapes of the rms distributions are different as well. The rms due to plages is of the same order of magnitude as that for spots alone (ΔTspot1 assumption).

C.3 Relation between the stellar astrometric signal in the X and Y directions

As already shown in Sect. 3.1 (Fig. 1), the stellar astrometric signal in the X and Y directions does not show the same behaviour: the amplitude can be different, with a ratio that depends on inclination, and the skewness properties are also different. Figure C.4 illustrates these properties in more detail. All spectral types and activity levels are considered. The first panel shows the fraction of simulations where the rms in the Y direction is higher than in the X direction versus stellar inclination: This fraction is larger for low inclinations (above 50%) and smaller (below 50%) for high inclinations. It is sensitive to the latitudinal extent of the activity pattern. The second panel shows the fraction of simulations in which the skewness in the Y direction (which showed an asymmetry in Fig. 1) is positive: it is overall larger than 50%, but only at low inclinations (below 40°) for ΔTspot1, and it is present for most inclinations for ΔTspot2. The two other panels illustrate this from another point of view. The frame of reference is rotated, so that a θ angle of 0° correspondsto the time series projected on the original X direction, and 90° corresponds to the time series projected on the original Y direction. For each time series projected using various values of θ, we identify the θ value for which the skewness is highest. A peak in the distribution at 90°, for example, means that the skewness is maximal in the Y direction. We find that there are indeed peaks at 90° and 270°, especially for ΔTspot2. For ΔTspot1, these peaks are present, but for certain inclinations (intermediate inclinations, curves in red), the skewness is highest for different projections (intermediate between X and Y).

thumbnail Fig. C.1

rms (left) and skewness (right) distributions of the activity time series in both directions for (from upper to lower panel) the X direction and ΔTspot1, the Y direction and ΔTspot1, the X direction and ΔTspot2, and the Y direction and ΔTspot2. Different curves show the ten inclinations (same colour code as in Fig. 1).

thumbnail Fig. C.2

rms of time series in the X direction (upper panels) and the Y direction (middle panels), and combining both directions (low panels) for ΔTspot1 (left) and ΔTspot2 (right). The colour code represents the spectral type from F6 (yellow) to K4 (blue).

We examined whether such properties might be used to determine the stellar inclination as well as the X and Y directions from a given observation. We first considered the full time series without noise. The protocol was the following. The distribution of θ values at which the skewness in Y was highest was plotted. We also separated the simulations into two groups, those in which the rms in Y is higher than the rms in X (called SEL LOW), and those in which the rms in Y is lower (called SEL HIGH). We then compared the inclination distribution for these two categories: if the distributions are well separated, then we expect to be able to use the criterion to identify stars with low or high inclinations. The results are shown as solid lines in Fig. C.5. The inclination distributions for the two selections peak in different domains, although there is an overlap. When the sampling is degraded, however (dashed lines), the distributions completely overlap, and even more when noise is added (dotted ones). We conclude that it is therefore unlikely that such properties could be used to characterise the geometry of the system.

thumbnail Fig. C.3

Distribution of rms in the X direction (left), of rms in the Y direction (middle) and of skewness in the Y direction (right) for ten inclinations (the colour code is similar to Fig. 1) for different contributions: spots with ΔTspot1 (upper panels), spots alone ΔTspot2 (middle panels), and plages alone (lower panels).

C.4 Relationship between astrometry and photometry

Figure C.6 separately illustrates in more detail the relationship between astrometry and photometry for three different spectral types and two inclinations. This illustrates the typical expected amplitude of the astrometric signal when the photometricvariability is known. The relations are not strictly linear because there is saturation at high activity level. The inclination effect in some of these relations is strong, mostly for the astrometric signal in the Y direction.

thumbnail Fig. C.4

First panel: fraction of simulations in which the rms is higher in Y than in X direction for ΔTspot1 (solid) and ΔTspot2 (dashed). The colour code represents different θmax values: solar (red), solar+10° (green), solar+20° (blue), and all (black). Second panel: same for the fraction of simulations with a positive skewness in the Y direction. Third panel: distribution of the angles at which the skewness in Y is highest (ΔTspot1 only) for the ten inclinations (same colour code as Fig. 3). Fourth panel: same for ΔTspot2.

thumbnail Fig. C.5

Left column, from top to bottom: distribution of different variables characterising activity for ΔTspot1 and for different time series: complete without noise (solid lines), degraded sampling without noise (dashed lines), and degraded sampling with noise (dotted lines). Variables are the angle at which the skewness is highest (90° and 270° represent the Y direction), inclination for simulations in which the rms in this direction is higher than in the orthogonal direction, and inclination for simulations in which the rms in this direction is lower than in the orthogonal direction. Right column: same for ΔTspot2.

thumbnail Fig. C.6

Rms of astrometric signal vs. rms photometry for various configurations (spectral type and inclination) for ΔTspot1 (black) and ΔTspot2 (red).

Appendix D Detectability based on S/N

In this appendix, we compute detection rates based on a global S/N, as described in Sect. 2.4.1. We considered S/N thresholds between 1 and 4. For a given S/N threshold, we computed the percentage of simulations with S/N above that threshold (for the 1 MEarth planet), which provides the detection rate shown in the upper panel in Fig. D.1 for the three habitable zone orbital periods and the two spot contrasts. The detection rates are excellent when the threshold is equal to 1, except for the inner side of the habitable zone, where they can be as low as 20% for K stars. They are much lower than 100% for a threshold of 2, with very poor performancefor the inner habitable zone. For higher thresholds (S/N of 3 and 4), the detection rate is equal to 0. This does not imply that adetection with S/N of 1 or 2 is a good detection, but we point out that with this standard definition of the S/N, a 1 MEarth planet should not be detectable for stars like the Sun. We discuss this S/N definition in Sect. 4.2.5. A much higher threshold (such as 5 or 6 as used in some previous works, e.g. Theia Collaboration 2017) would lead to a detection rate of 0% as well for such low-mass planets, as α is never higher than five or six times thenoise for a 1 MEarth planet (at 10 pc). We compare these rates with more sophisticated approaches taking the temporal and frequential properties of the signal into account in Appendix G.

thumbnail Fig. D.1

First row: detection rate vs. spectral type deduced from different S/N thresholds (from left to right: 1, 2, 3, and 4) for ΔTspot1 (solid) and ΔTspot2 (dashed) for a 1 MEarth planet at different orbital periods: lower HZ (black), medium HZ (red), and upper HZ (green). Curves that are not visible are at the 100% level S/N thresholds 1 and 2, and at the 0% level for thresholds of 3 and 4. Second row: mass detection limit with a 50% confidence level for the same criteria (same colour and line code). Third row: same with a 95% confidence level.

The same approach can then be used to determine the planet mass corresponding to a certain detection rate (i.e. a certain percentage of simulations with an S/N above the considered threshold). The results are shown for 50 and 95% detection rates in Fig. D.1 (second and third row, respectively). Detection limits can be as low as 0.4 MEarth for certain configurations (massive stars and ΔTspot1) and are always below 3 MEarth for the low thresholds, but they are between 2 and 4 MEarth for thresholds of 3–4 on S/N.

Appendix E Comparison of FAP, fp, and LPA detection limits

In this appendix, we discuss some observational tools in more detail. We first compare the FAP with the theoretical false positive levels and then characterise the properties of the LPA detection limits.

E.1 Comparison of the FAP with the true false positive levels

We compared the FAP and the theoretical fp in a simplified configuration: stars seen edge-on only, and no inclination between the orbital plane and the equatorial plane (Ψ = 0). We focused on the middle of the habitable zone. For each of these simulations (no planet was added), we computed the false positive level (as in Sect. 4.1) and the FAP level (using a bootstrap analysis with 1000 bootstrap iterations), both at the 1% level. The FAP corresponds to the assumption that the signal is due to Gaussian noise. Our objective in this section is to compare these two estimates of the false positive levels, and the effect on the detection rates was studied in the blind tests in Sect. 4.2. The ratio distributions of FAP/fp are shown in Fig. E.1 (upper panel). There is an overlap witha ratio of 1, but the distributions are clearly shifted towards values higher than 1, that is, on average, the FAP overestimates the false positive level: this should lead to a conservative approach when a detection is to be made. The ratio depends only slightly on spectral type, and the weak trend is not significant. The percentage of simulations for which the FAP is higher than fp lies between 60 and 95% depending on the spot contrast we considered.

E.2 LPA detection limits

We characterised the detection limits provided by the LPA approach (Meunier et al. 2012) in two cases, for G2 and K2 stars. The two spectraltypes lead to similar conclusions. Again, we focused on stars seen edge-on, without an inclination between the orbital plane and the equatorial plane. The principle of the LPA estimation, which is a fast method for computing detection limits, is as follows. The maximum of the periodogram around the period of interest was computed, and we searched for the planet mass that would produce a peak 1.3 times the observed peak amplitude (Meunier et al. 2012; Lannier et al. 2017): we assumed that a more massive planet is excluded by the data because it would have produced a higher peak than observed. The obtained mass is therefore an exclusion rate, that is, we aim at excludingthe presence of planets above a certain mass (the LPA detection limit) at the considered orbital period.

thumbnail Fig. E.1

Distribution of the FAP/fp ratio (upper panel), and percentage of simulations for which the FAPis higher than the fp (lower panel) vs. spectral type. The solid lines correspond to ΔTspot1 and the dashed lines to ΔTspot2. The thin lines are for a constant fp and the thick lines for an fp dependent on stellar variability.

thumbnail Fig. E.2

LPA caracterisation for G2 stars (left) and K2 stars (right). First row: distribution of the LPA detection limits for ΔTspot1 (solid lines) and ΔTspot2 (dashed lines) for the different HZ periods: lower HZ (black), medium HZ (red), and upper HZ (green). Third row: same for the distribution of the detection rates corresponding to the LPA masses. Fourth row: same for the distribution of exclusion rates corresponding to the LPA masses.

The distribution of detections limits found for the three values of orbital periods in the habitable zone and two spot contrasts are shown in Fig. E.2 (upper panel): they cover a range between 0.1 and 1.5 MEarth, with distribution peaks at about 0.2 (for ΔTspot1) and about 0.4–0.5 (for ΔTspot2), that is, theyare lower than the detection limits estimated in Sect. 4.1.4. This is explained by the fact that they correspond to a very low detection rate, as illustrated in the middle panels: for each simulation, we superposed 100 simulations of a planetary signal corresponding to the LPA mass (but different phases) and computed the periodogram and maximum power at the planet period. This allowed us to compute an exclusion rate for each simulation, as well as a detection rate by comparison with the theoretical fp. The distribution of exclusion rates is also shown in the lower panels. They are always higher than 50%, with a strong peak at 100%. For example, for G2 stars, the median exclusion rate is 78%, and 17% of the simulations have a 100% exclusion rate.

Appendix F Additional blind test results

In the appendix, we first show complementary results for the reference blind test A. Additional blind tests are then performed, in which individual parameters are modified compared to our reference blind test A (Table F.1). This allows us to show the effect of various conditions (stellar properties with C-F, observational strategy with N, instrumental noise with H, and planet properties with B, G, and I-M) on the detection rates and false positive rates. These blind tests were only made for one spectral type out of two, which is sufficient for the comparison. The average rates are shown in Table F.2. The false positive levels without a planet lie between 0.2 and 0.6%, that is, they are relatively stable in the different tests, and lower than the 1% FAP level used to perform the analysis. The false positive levels with a planet are usually also low, but depend on the planet parameter (mass and distance to its host star).

Table F.1

Blind tests.

F.1 Dependence on spectral type for reference blind test A

Figure F.1 shows the detection rates and the various rates of poor recovery versus spectral type for the reference blind test A. They are discussed in Sect. 4.2.4. The average rates are shown for all blind tests in Table 2.

F.2 Effect of orbit inclination with respect to the line of sight (blind test A)

We also compared the detection rates for various inclinations of the orbit with respect to the line of sight in the case of blind test A. For this purpose, we computed the ratio r between theapparent semi-major axis and the apparent semi-minor axis (true parameters) for all injected planets. The simulations were then separated into two subsets depending on the value of r: values of r close to one correspond to close to circular orbit (orbit seen pole-on), and high values correspond to orbits close to edge-on. Table F.3 shows the resulting detection rates for various selections. We conclude that the planets with a pole-on configuration are better detected than the other configurations, as already seen for more simple noise contribution by Eisner & Kulkarni (2001): this is also true here despite the complex behaviour of the activity signal in the two directions.

F.3 Results for blind test B: Ψ = 0

The average rates for blind test B, that is, Ψ = 0, to compare with reference blind test A, are shown in Table F.2. The rates from blind tests A and B are extremely similar, which means that the exact distribution of the angle between orbital plane and equatorial plane Ψ is not critical and does not affect our results. For this reason, all other blind tests were made with the distribution of Ψ used in blind testA.

thumbnail Fig. F.1

Percentage of good and poor recovery rates vs. spectral type for 1% FAP level (two first rows) and 0.1% FAP level (two last rows) from blind tests A. The good recovery rates correspond to the case without a planet (solid black line) and with a planet (dashed green line). The poor recovery rates correspond to the case without a planet (solid black line, false positive) and with a planet: incorrect planets (red dashed line, false positive), rejected planets (blue dashed line), and missed planets (brown dashed line). In the third row all curves except one are at the 100% level, and most poor recovery curves are close to 0%.

F.4 Results for blind tests C and D: low and high average activity level

The average rates for blind tests C and D, that is, for low logRHK$\langle \log R&#x0027;_{\textrm{HK}}\rangle$ (quiet stars) and high logRHK$\langle \log R&#x0027;_{\textrm{HK}}\rangle$ (active stars), respectively, to compare with reference blind test A, are shown in Table F.2. For ΔTspot1, the effect is very weak, with very similar (and high) detection rates for quiet and active stars. On the other hand, for ΔTspot2, the detection rate with planet is slightly better for quiet stars (83% instead of 76%), although the difference is not great.

F.5 Results for blind tests E and F: low and high Acyc (quiet)

The average rates for blind tests E and F, that is, for low cycle amplitude and high cycle amplitude, respectively, to compare with reference blind test A, are shown in Table F.2. The effect is similar to the selection based on the average activity level (blind tests C and D), with a better detection rate for quiet stars.

Table F.2

Average rates for all blind tests.

Table F.3

Detection rates (in percent) depending on planet orbit inclination with respect to the line of sight.

F.6 Results for blind test G: 2 MEarth

The average rates for blind test G, that is, a 2 MEarth planet, to compare with reference blind test A, are shown in Table F.2. The performance is excellent in this case, with almost no false positive and no missed planets.

F.7 Results for blind test H: instrumental noise x 2

The average rates for blind test H, that is, an instrumental noise twice higher, to compare with reference blind test A, are shown in Table F.2. The effect is not significant when no planet is injected, but the detection rates are very sensitive to this parameter.

F.8 Results for blind tests I, J, and K: star at 5, 15, and 20 pc

The average rates for blind tests I, J, and K, that is, different stellar distances, to compare with reference blind test A, are shown in Table F.2. Stellar distance significantly affects the detectability, as expected. At 15 pc, the rates decrease to 54–36% depending on the spot contrast that is assumed, with many rejected planets. They are below 23% for a distance of 20 pc.

F.9 Results for blind tests L and M: 0.5 MEarth, star at 5 pc and 10 pc

The average rates for blind tests L and M, that is, a 0.5 MEarth planet, to compare with reference blind test A, shown in Table F.2. The detection rates for a 0.5 MEarth planet at 5 pc is very similar to the rates for a 1 MEarth at 10 pc (blind test A). If the 0.5 MEarth planet is at 10 pc, the detection rates are naturally much lower, but they show that some of these planets could be detected with astrometry.

F.10 Results for blind tests N: PHZin, PHZmed, and PHZout

The averagerates for the three blind tests N, that is, with planet orbital periods fixed specifically at PHZin, PHZmed, and PHZout in order to compare with the computations made in Sect. 4.2 with the theoretical false positive levels, are shown in Table F.2. The results are discussed in Appendix G.

F.11 Results for blind tests O: 100 observations

We performed two blind tests O, that is, with a 100 sampling (instead of 50), covering either 3.5 yr as blind test A, or seven years. Our main objective was to determine whether a higher number of points can improve the significance of the planet peaks that were below the FAP (but at the proper period) in blind test A, as well as the missed planets for the most active stars. We considered two strategies. In the first (blind test O1), we observed 100 points over the same duration as before, which could correspond to a situation where the star has been identified as active (e.g. from previous photometric data), and which may require more points to reach a good performance. The second strategy (blind tests O2), with 100 points covering seven years, could correspond to stars with no detection during the first 3.5 yr but the presence of a peak above the noise level (with a peak S/N defined as in Sect. 4.2.5) although below the FAP, high enough to suggest the possible presence of a planet (candidate), requiring additional data. With 100 points, the detection rates of injected planets are very close to 100%, which is better than the 50-point configuration. The average rates are shown in Table F.2. There is not strong difference between the 3.5 y and 7 y coverages. We note that the false positive level without a planet is slightly higher than for 50 points given a similar criterion on the FAP level, so that we caution about this side effect. It may be mitigated, for example, by also considering subsets of data.

Appendix G Comparison of detection rates using the different approaches

In this section, we compare the detection rates obtained in previous sections using different approaches for a 1 MEarth planet. The detection rates versus spectral types are shown in Fig. G.1. We first compare the rates obtained with the S/N threshold and the theoretical fp. The two correspond reasonably well for the SN > 1 computation, while SN > 2 leads to very poor rates compared to what could be theoretically be achieved. The agreement is best for fp based on power (frequential analysis). For PHZin, ΔTspot2, the rates from fp are slightly better than those obtained with SN > 1.

Figure G.1 also allows us to compare the detection rates obtained with the true fp (frequential analysis) with the blind tests. Because blind test A was implemented for the whole habitable zone (dashed lines), that is, the periods were randomly chosen in the whole habitable zone, we also performed blind tests dedicated to the three orbital periods (blind tests N1, N2, andN3, see Appendix F.10) for a proper comparison. The trends are similar, but for PHZin and PHZmed, for which the detection rates are not as close to 100% as for PHZout, the detection rates are much lower from the blind test compared to the use of the true fp level (frequential analysis). The reason is that the FAP evaluation in the blind test is overestimated compared to fp (and particularly when a planet is present), meaning that the effective false positive level in the blind test is much lower than 1% and some planets are undetected even thoughthe highest peak is at the true planet period. This leads to a significant difference between the two estimates.

thumbnail Fig. G.1

Comparison of detection rates vs. spectral type for different ΔTspot and orbital periods in the habitable zone for different conditions: based on SN > 1 (solid black lines), on SN > 2 (dashed black lines), true 1% fp from frequential analysis (brown lines), true 1% fp from temporal analysis (red lines), and blind test A (dashed green line, covering the whole habitable zone), and blind tests N (green lines).

Finally, we underline that the performance is very sensitive to the position inside the habitable zone. The performance is excellent in the outer part of the habitable zone, but significantly poorer (for the distance and planet mass considered here) in the inner part.

References

  1. Bastian, U., & Hefele, H. 2005, ESA SP, 576, 215 [Google Scholar]
  2. Berdyugina, S. V. 2005, Liv. Rev. Sol. Phys., 2, 8 [Google Scholar]
  3. Boisse, I., Bonfils, X., & Santos, N. C. 2012, A&A, 545, A109 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Borgniet, S., Meunier, N., & Lagrange, A.-M. 2015, A&A, 581, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  5. Brothwell, R. D., Watson, C. A., Hébrard, G., et al. 2014, MNRAS, 440, 3392 [NASA ADS] [CrossRef] [Google Scholar]
  6. Casertano, S., & Sozzetti, A. 1999, ASP Conf. Ser., 194, 171 [Google Scholar]
  7. Catanzarite, J., Law, N., & Shao, M. 2008, Proc. SPIE, 7013, 70132K [CrossRef] [Google Scholar]
  8. Chiavassa, A., Caldas, A., Selsis, F., et al. 2017, A&A, 597, A94 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  9. Crouzier, A., Malbet, F., Henault, F., et al. 2016, A&A, 595, A108 [CrossRef] [EDP Sciences] [Google Scholar]
  10. Desort, M., Lagrange, A.-M., Galland, F., Udry, S., & Mayor, M. 2007, A&A, 473, 983 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  11. Dumusque, X. 2016, A&A, 593, A5 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  12. Dumusque, X., Boisse, I., & Santos, N. C. 2014, ApJ, 796, 132 [NASA ADS] [CrossRef] [Google Scholar]
  13. Dumusque, X., Borsa, F., Damasso, M., et al. 2017, A&A, 598, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Eisner, J. A., & Kulkarni, S. R. 2001, ApJ, 561, 1107 [NASA ADS] [CrossRef] [Google Scholar]
  15. Eisner, J. A., & Kulkarni, S. R. 2002, ApJ, 574, 426 [NASA ADS] [CrossRef] [Google Scholar]
  16. Eriksson, U., & Lindegren, L. 2007, A&A, 476, 1389 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  17. Fabrycky, D. C., & Winn, J. N. 2009, ApJ, 696, 1230 [NASA ADS] [CrossRef] [Google Scholar]
  18. Ford, E. B. 2004, PASP, 116, 1083 [CrossRef] [Google Scholar]
  19. Hatzes, A. P. 2002, Astron. Nachr., 323, 392 [NASA ADS] [CrossRef] [Google Scholar]
  20. Haywood, R. D., Collier Cameron, A., Unruh, Y. C., et al. 2016, MNRAS, 457, 3637 [NASA ADS] [CrossRef] [Google Scholar]
  21. Janson, M., Brandeker, A., Boehm, C., & Martins, A. K. 2018, Handbook of Exoplanets (Cham: Springer), 87 [Google Scholar]
  22. Jones, B. W., Sleep, P. N., & Underwood, D. R. 2006, ApJ, 649, 1010 [NASA ADS] [CrossRef] [Google Scholar]
  23. Kasting, J. F., Whitmire, D. P., & Reynolds, R. T. 1993, Icarus, 101, 108 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  24. Lagrange, A.-M., Desort, M., & Meunier, N. 2010, A&A, 512, A38 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  25. Lagrange, A.-M., Meunier, N., Desort, M., & Malbet, F. 2011, A&A, 528, L9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  26. Lannier, J., Lagrange, A. M., Bonavita, M., et al. 2017, A&A, 603, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  27. Léger, A., Defrère, D., Malbet, F., Labadie, L., & Absil, O. 2015, ApJ, 808, 194 [NASA ADS] [CrossRef] [Google Scholar]
  28. Lockwood, G. W., Skiff, B. A., Henry, G. W., et al. 2007, ApJS, 171, 260 [NASA ADS] [CrossRef] [Google Scholar]
  29. Makarov, V. V., Beichman, C. A., Catanzarite, J. H., et al. 2009, ApJ, 707, L73 [NASA ADS] [CrossRef] [Google Scholar]
  30. Makarov, V. V., Parker, D., & Ulrich, R. K. 2010, ApJ, 717, 1202 [NASA ADS] [CrossRef] [Google Scholar]
  31. Malbet, F., Léger, A., Shao, M., et al. 2012, Exp. Astron., 34, 385 [NASA ADS] [CrossRef] [Google Scholar]
  32. Malbet, F., Abbas, U., Alves, J., et al. 2019, ArXiv e-prints [arXiv:1910.08028] [Google Scholar]
  33. Mamajek, E. E., & Hillenbrand, L. A. 2008, ApJ, 687, 1264 [NASA ADS] [CrossRef] [Google Scholar]
  34. Marcy, G. W., Fischer, D. A., McCarthy, C., & Ford, E. B. 2005, ASP Conf. Ser., 338, 191 [Google Scholar]
  35. Meunier, N., & Lagrange, A. M. 2019a, A&A, 625, L6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  36. Meunier, N., & Lagrange, A. M. 2019b, A&A, 628, A125 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  37. Meunier, N., & Lagrange, A. M. 2019c, A&A, 629, A42 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  38. Meunier, N., & Lagrange, A. M. 2020a, A&A, 638, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  39. Meunier, N., & Lagrange, A.-M. 2020b, A&A, 642, A157 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  40. Meunier, N., Desort, M., & Lagrange, A.-M. 2010, A&A, 512, A39 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Meunier, N., Lagrange, A.-M., & De Bondt, K. 2012, A&A, 545, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  42. Meunier, N., Lagrange, A.-M., Borgniet, S., & Rieutord, M. 2015, A&A, 583, A118 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  43. Meunier, N., Lagrange, A. M., Boulet, T., & Borgniet, S. 2019a, A&A, 627, A56 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  44. Meunier, N., Lagrange, A.-M., & Cuzacq, S. 2019b, A&A, 632, A81 [CrossRef] [EDP Sciences] [Google Scholar]
  45. Norris, C. 2018, PhD thesis, Imperial College London, UK [Google Scholar]
  46. Pont, F., Knutson, H., Gilliland, R. L., Moutou, C., & Charbonneau, D. 2008, MNRAS, 385, 109 [Google Scholar]
  47. Rauer, H., Catala, C., Aerts, C., et al. 2014, Exp. Astron., 38, 249 [NASA ADS] [CrossRef] [Google Scholar]
  48. Reffert, S., Launhardt, R., Hekker, S., et al. 2005, ASP Conf. Ser., 338, 81 [Google Scholar]
  49. Saar, S. H., & Donahue, R. A. 1997, ApJ, 485, 319 [NASA ADS] [CrossRef] [Google Scholar]
  50. Saar, S. H., Hatzes, A., Cochran, W., & Paulson, D. 2003, The Future of Cool-Star Astrophysics: 12th Cambridge Workshop on Cool Stars, Stellar Systems, and the Sun, eds. A. Brown, G. M. Harper, & T. R. Ayres, 694 [Google Scholar]
  51. Shao, M., Livermore, T. R., Wolff, D. M., Yu, J. W., & Colavita, M. M. 1995, AAS Meeting Abstracts, 187, 71.03 [Google Scholar]
  52. Silva, A. V. R. 2003, ApJ, 585, L147 [NASA ADS] [CrossRef] [Google Scholar]
  53. Southworth, J. 2011, MNRAS, 417, 2166 [NASA ADS] [CrossRef] [Google Scholar]
  54. Sozzetti, A. 2005, PASP, 117, 1021 [NASA ADS] [CrossRef] [Google Scholar]
  55. Sozzetti, A., Casertano, S., Brown, R. A., & Lattanzi, M. G. 2002, PASP, 114, 1173 [NASA ADS] [CrossRef] [Google Scholar]
  56. Svensson, F., & Ludwig, H.-G. 2005, ESA SP, 560, 979 [Google Scholar]
  57. Theia Collaboration (Boehm, C., et al.) 2017, ArXiv e-prints [arXiv:1707.01348] [Google Scholar]
  58. Traub, W. A., Beichman, C., Boden, A. F., et al. 2010, EAS Publ. Ser., 42, 191 [CrossRef] [Google Scholar]
  59. Triaud, A. H. M. J., Collier Cameron, A., Queloz, D., et al. 2010, A&A, 524, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  60. Wright, J. T. 2005, PASP, 117, 657 [NASA ADS] [CrossRef] [Google Scholar]
  61. Zaninetti, L. 2008, Serb. Astron. J., 177, 73 [NASA ADS] [CrossRef] [Google Scholar]

1

Orbital obliquity observations for transiting planetary systems, http://www.astro.keele.ac.uk/jkt/tepcat/obliquity.html

2

Assuming a distribution close to the edge-on configuration does not significantly change the results.

3

A dedicated blind test similar to those performed in Sect. 4 shows that the results are not significantly impacted.

All Tables

Table 1

Categories of results from blind tests.

Table 2

Average detection rates and poor recovery rates (%).

Table F.1

Blind tests.

Table F.2

Average rates for all blind tests.

Table F.3

Detection rates (in percent) depending on planet orbit inclination with respect to the line of sight.

All Figures

thumbnail Fig. 1

Rms of the activity time series in both directions for (from upper to lower panel) the X direction and ΔTspot1, the Y direction and ΔTspot1, the X direction and ΔTspot2, and the Y direction and ΔTspot2. Left column:rms of the astrometric signal vs. spectral type for the ten inclinations (from yellow for pole-on configurations to blue for edge-on configurations), and one simulation out of four for clarity (small dots) and binned in spectral type (circles). The horizontal dashed line is the solar value from Lagrange et al. (2011). Right column: same vs. logRHK$\log R&#x0027;_{\textrm{HK}}$.

In the text
thumbnail Fig. 2

Upper panels: average rms of the astrometric signal in the X direction vs. rms of photometric signal (left), rms of RV signal (middle), and average filling factor (right), for the ten inclinations (colour code as in Fig. 1) and all spectral types. Solid curves represent ΔTspot1 simulations, and dashed lines represent ΔTspot2 simulations. Lower panels: same for the Y direction.

In the text
thumbnail Fig. 3

Left panels: distribution of the difference between fitted period and true period for ΔTspot1 (solid) and ΔTspot2 (dashed), and for the lower HZ (black), medium HZ (red), and upper HZ (green) for all simulations (and therefore all spectral types), the distribution width for all simulations vs. spectral types, and the median of the difference between fitted and true period vs. spectral type. Right panels: same for the mass.

In the text
thumbnail Fig. 4

Detection rate vs. spectral type for 1 MEarth planet for ΔTspot1 (solid) and ΔTspot2 (dashed) forthe temporal analysis (upper panel) and frequential analysis (lower panel), and for different orbital periods: lower HZ (black), medium HZ (red), and upper HZ (green).

In the text
thumbnail Fig. 5

Distribution of the S/N for simulations above the 1% fp level and frequential analysis for different configurations: ΔTspot1 (solid line) and ΔTspot2 (dashed line), and lower HZ (black), medium (red), and upper HZ (green).

In the text
thumbnail Fig. 6

First column: detection limits based on a true fp level (based on power and frequential analysis) vs. spectral type for a 50% detection rate (first panel), for a 95% detection rate (second panel), and a for 99% detection rate (third panel). Curves are for ΔTspot1 (solid) and ΔTspot2 (dashed), and for the lower HZ (black), medium HZ (red), and upper HZ (green). Second column: same for fp based on a temporal analysis (based on mass).

In the text
thumbnail Fig. 7

Upper panels: distribution of period (left) and mass (right) difference (between found and true period) for ΔTspot1 (solid line) and ΔTspot2 (dashed line) from blind tests A. The period plots are for the period corresponding to the highest peak in the periodogram (black) and fitted on the temporal series (red). Middle panels: median period and mass vs. spectral type. The true value is shown in green. The dotted lines in the period plot indicate the boundaries of the habitable zone we consider. Lower panels: distribution width (period difference and mass difference) vs. spectral type.

In the text
thumbnail Fig. 8

Distribution of FAP values for ΔTspot1(upper panel) and ΔTspot2 (lower panel) when a planet is injected (solid line) and when no planet is injected (dashed line).

In the text
thumbnail Fig. 9

Distribution of S/N for ΔTspot1 and detected planets: global S/N (black), peak S/N (over the whole range, red), and peak S/N (over period higher than 100 days, green).

In the text
thumbnail Fig. 10

Period of the maximum peak in the periodogram vs. rotation rate for ΔTspot1 (left panels) and ΔTspot2 (right panels) and in two cases. First row: false positive level when no planet is injected (black) and when a planet is injected (red). Stars correspond to peaks outside the habitable zone and circles to peaks inside the habitable zone. The brown line indicates the position of the rotation rate. The two black curves represent the extent of the position of the inner side of the habitable zone (for different spectral types), and the green curves show the position of the outer side of the habitable zone. Second row: second highest peak in the periodogram, those in green indicate the peaks above the FAP. The brown line is similar to the first row.

In the text
thumbnail Fig. A.1

Upper panel: distribution of the angle between the stellar equatorial plane and the planet orbital plane Ψ assuming that transiting planets are seen edge-on seen (solid line) and assuming that transiting planets are seen with a small dispersion around the edge-on configuration (dashed line). Lower panel: same for the cumulated distribution.

In the text
thumbnail Fig. B.1

Detection rate (above 1% false positive level) vs. mass for G2 stars, PHZin, and ΔTspot2 for the frequential (black) and temporal analysis (red). The vertical lines indicate the position of the 50% detection rate levels.

In the text
thumbnail Fig. B.2

Illustration of fp computation for K2 stars, all inclinations, ΔTspot2,PHZin, representing power (frequential analysis, upper panel) and fitted mass (temporal analysis, lower panel) vs. rms of logRHK$\log R&#x0027;_{\textrm{HK}}$ for no planet (black) and with an injected planet (orange, 1 MEarth). The green line corresponds to a constant fp level of 1%, and the red line shows a fp level depending on the rms of logRHK$\log R&#x0027;_{\textrm{HK}}$. The green and red dots show the upper and lower bounds corresponding to the criterion (see text).

In the text
thumbnail Fig. B.3

False positive fp averaged over all simulations vs. inclination for different PHZ (inner side in black, middle in red, and outer side in green) and for ΔTspot1 (solid line) and ΔTspot2 (dashed line) for the frequential analysis (upper panel) and temporal analysis (lower panel).

In the text
thumbnail Fig. B.4

Detection rates vs. spectral type for 1 MEarth planet based for the frequential analysis (upper panel) and temporal analysis (lower panels), separately for ten inclinations (colour code as in Fig. 3). The detection rates are shown for ΔTspot2 and the inner side of the habitable zone. The fp levels depend on inclination for the right panels, but they do not for the left panels.

In the text
thumbnail Fig. B.5

Same as Fig. B.4 for fp levels depending on activity variability. The colour code corresponds to different activity variability levels (based on the rms of logRHK$\log R&#x0027;_{\textrm{HK}}$): quiet (orange), intermediate (red), and active (green).

In the text
thumbnail Fig. C.1

rms (left) and skewness (right) distributions of the activity time series in both directions for (from upper to lower panel) the X direction and ΔTspot1, the Y direction and ΔTspot1, the X direction and ΔTspot2, and the Y direction and ΔTspot2. Different curves show the ten inclinations (same colour code as in Fig. 1).

In the text
thumbnail Fig. C.2

rms of time series in the X direction (upper panels) and the Y direction (middle panels), and combining both directions (low panels) for ΔTspot1 (left) and ΔTspot2 (right). The colour code represents the spectral type from F6 (yellow) to K4 (blue).

In the text
thumbnail Fig. C.3

Distribution of rms in the X direction (left), of rms in the Y direction (middle) and of skewness in the Y direction (right) for ten inclinations (the colour code is similar to Fig. 1) for different contributions: spots with ΔTspot1 (upper panels), spots alone ΔTspot2 (middle panels), and plages alone (lower panels).

In the text
thumbnail Fig. C.4

First panel: fraction of simulations in which the rms is higher in Y than in X direction for ΔTspot1 (solid) and ΔTspot2 (dashed). The colour code represents different θmax values: solar (red), solar+10° (green), solar+20° (blue), and all (black). Second panel: same for the fraction of simulations with a positive skewness in the Y direction. Third panel: distribution of the angles at which the skewness in Y is highest (ΔTspot1 only) for the ten inclinations (same colour code as Fig. 3). Fourth panel: same for ΔTspot2.

In the text
thumbnail Fig. C.5

Left column, from top to bottom: distribution of different variables characterising activity for ΔTspot1 and for different time series: complete without noise (solid lines), degraded sampling without noise (dashed lines), and degraded sampling with noise (dotted lines). Variables are the angle at which the skewness is highest (90° and 270° represent the Y direction), inclination for simulations in which the rms in this direction is higher than in the orthogonal direction, and inclination for simulations in which the rms in this direction is lower than in the orthogonal direction. Right column: same for ΔTspot2.

In the text
thumbnail Fig. C.6

Rms of astrometric signal vs. rms photometry for various configurations (spectral type and inclination) for ΔTspot1 (black) and ΔTspot2 (red).

In the text
thumbnail Fig. D.1

First row: detection rate vs. spectral type deduced from different S/N thresholds (from left to right: 1, 2, 3, and 4) for ΔTspot1 (solid) and ΔTspot2 (dashed) for a 1 MEarth planet at different orbital periods: lower HZ (black), medium HZ (red), and upper HZ (green). Curves that are not visible are at the 100% level S/N thresholds 1 and 2, and at the 0% level for thresholds of 3 and 4. Second row: mass detection limit with a 50% confidence level for the same criteria (same colour and line code). Third row: same with a 95% confidence level.

In the text
thumbnail Fig. E.1

Distribution of the FAP/fp ratio (upper panel), and percentage of simulations for which the FAPis higher than the fp (lower panel) vs. spectral type. The solid lines correspond to ΔTspot1 and the dashed lines to ΔTspot2. The thin lines are for a constant fp and the thick lines for an fp dependent on stellar variability.

In the text
thumbnail Fig. E.2

LPA caracterisation for G2 stars (left) and K2 stars (right). First row: distribution of the LPA detection limits for ΔTspot1 (solid lines) and ΔTspot2 (dashed lines) for the different HZ periods: lower HZ (black), medium HZ (red), and upper HZ (green). Third row: same for the distribution of the detection rates corresponding to the LPA masses. Fourth row: same for the distribution of exclusion rates corresponding to the LPA masses.

In the text
thumbnail Fig. F.1

Percentage of good and poor recovery rates vs. spectral type for 1% FAP level (two first rows) and 0.1% FAP level (two last rows) from blind tests A. The good recovery rates correspond to the case without a planet (solid black line) and with a planet (dashed green line). The poor recovery rates correspond to the case without a planet (solid black line, false positive) and with a planet: incorrect planets (red dashed line, false positive), rejected planets (blue dashed line), and missed planets (brown dashed line). In the third row all curves except one are at the 100% level, and most poor recovery curves are close to 0%.

In the text
thumbnail Fig. G.1

Comparison of detection rates vs. spectral type for different ΔTspot and orbital periods in the habitable zone for different conditions: based on SN > 1 (solid black lines), on SN > 2 (dashed black lines), true 1% fp from frequential analysis (brown lines), true 1% fp from temporal analysis (red lines), and blind test A (dashed green line, covering the whole habitable zone), and blind tests N (green lines).

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.