Open Access
Issue
A&A
Volume 689, September 2024
Article Number A229
Number of page(s) 19
Section Stellar structure and evolution
DOI https://doi.org/10.1051/0004-6361/202449893
Published online 17 September 2024

© The Authors 2024

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

An important aspect of the upcoming Planetary Transits and Oscillations of Stars mission (PLATO, Rauer et al. 2014) will be its ability to investigate stellar surface rotation and photometric activity modulations of a large sample of stars with convective outer layers, both on the main sequence and at the subgiant stage of evolution. Indeed, as active regions travel over the observed stellar disc, the apparent brightness of the star experiences modulations with periodicities directly related to photospheric surface rotation. Over a longer time frame, the amplitude of these modulations varies due to stellar activity cycles, as observed in the case of the 11 year solar Schwabe cycle (e.g. Salabert et al. 2017). Surface rotation is a crucial proxy with which to estimate stellar ages through gyrochronology (e.g. Skumanich 1972; Barnes 2003, 2007), while rotation–activity relations are directly linked to dynamo processes in the convective envelope (see e.g. Brun & Browning 2017, and references therein).

Space missions dedicated to long-baseline photometry or including such aspects, such as the Convection, Rotation and planetary Transits satellite (CoRoT, Baglin et al. 2006), Kepler (Borucki et al. 2010), K2 (Howell et al. 2014), the Transiting Exoplanet Survey Satellite (TESS, Ricker et al. 2015), or the Gaia mission (Gaia Collaboration 2016), have allowed measurements of stellar surface rotation for tens of thousands of low-mass stars, from the main sequence to evolved stages (Nielsen et al. 2013; Reinhold et al. 2013; García et al. 2014; McQuillan et al. 2014; Lanzafame et al. 2018; Santos et al. 2019, 2021a; Reinhold & Hekker 2020; Gordon et al. 2021; Holcomb et al. 2022; Distefano et al. 2023; Reinhold et al. 2023; Claytor et al. 2024). The availability of such a large amount of data allowed studies of the connection between stellar rotation, age, and activity (e.g. Masuda 2022a,b; Santos et al. 2023; Mathur et al. 2023), between stellar rotation and metallicity (e.g. Amard et al. 2020; See et al. 2021, 2023), dynamo processes (e.g. Cao et al. 2023), the behaviour of magnetic braking with stellar age (e.g. van Saders et al. 2016, 2019; Hall et al. 2021), the transition between different fast-rotating regimes (e.g. Lanzafame et al. 2019), and the relation between rotation and stellar flares (e.g. Yang & Liu 2019; Notsu et al. 2019; Raetz et al. 2020). Additional connections with structural and evolutionary aspects have also been explored by characterising the convection–rotation regimes in the envelope via the estimation of the Rossby number (e.g. Corsaro et al. 2021; Noraz et al. 2022) and the role of core–envelope decoupling in rotational evolution (e.g. Spada & Lanzafame 2020; Angus et al. 2020; Lu et al. 2022). Studies related to exoplanetary science could also be performed by analysing the influence of rotation over the orbital architecture of exoplanets (e.g. McQuillan et al. 2013b; Messias et al. 2022; García et al. 2023) and the potential connection between the presence of short-periodic planets and fast stellar rotation with respect to stellar age (Ahuir et al. 2021; Deeg et al. 2023). Such measurements are also useful for asteroseismology in order to investigate the connection between rotation, activity, and pulsations (e.g. Mathur et al. 2019). Finally, they can be used to perform physically motivated starspot modelling and characterisation of active features (e.g. Lanza et al. 2009, 2019; Özavcı et al. 2018; de Freitas et al. 2021; Breton et al. 2024).

In this paper, we describe the algorithms that will be implemented in the PLATO pipeline module dedicated to analysing the surface rotation and activity of the main sequence and subgiant stars observed by the mission. To demonstrate the performances of the algorithms, we applied them to a realistic set of simulated light curves and assessed how the inclusion of noise and PLATO systematics impacts the recovery of observables. The layout of the paper is as follows. In Sect. 2, we describe the algorithms that will be implemented in the module and we provide the corresponding physical motivations. In Sect. 3, we present a set of noise-free and noisy PLATO simulated light curves that were generated for this work. In particular, in the case of noisy light curves, we describe how we handle the correction of the PLATO systematics included in the simulated data. In Sect. 4, we show the results we obtain when applying the algorithms to the simulated light curves. Conclusions and perspectives for future developments are provided in Sect. 5.

2. Descriptions of the algorithms

The flow diagram of the module is shown in Fig. 1 and the corresponding elements are described in the following subsections. The diagram is simplified in the sense that, for sake of clarity and concision, we indicate only the physical meaning of the module inputs and outputs, without describing the corresponding data structures in detail. The aim of this work is to show the physical motivations of the implemented algorithms. In what follows, we therefore describe the abilities of the module thematically, referring each time to the corresponding submodule of the flow diagram.

thumbnail Fig. 1.

Flow diagram for the module. The main inputs for the module, the light curves, and the periodograms are shown on the left. The different module outputs are encircled in red.

All analyses in this work are performed using the unofficial open-source prototype module star-privateer1, which will be used as a basis for the algorithm implementation in the PLATO pipeline.

2.1. Inputs

For each target star, two light curves obtained from the same raw data –but with distinct calibrations– are provided to the module. The first light curve, LC1, preserves low-frequency variability (periods beyond 60 days) and uses a two-hour sampling. This sampling is well suited to looking for rotational modulations while reducing the computing time of some of the algorithms compared to analysing light curves with the 25 second sampling that PLATO will also be able to provide. The second light curve, LC2, follows the same calibration procedure as LC1 but an additional high-pass filter with a cutoff of 60 days is applied. The effect of the filtering on the variability included in the light curve is illustrated in Appendix B.3. The generalised Lomb-Scargle periodograms (GLS, Lomb 1976; Scargle 1982; Zechmeister & Kürster 2009) of the two light curves are also provided to the module. These periodograms have a frequency binning chosen to ensure independence from one frequency bin to another (therefore with noise distribution following a χ2 with two degrees of freedom), which corresponds to the periodogram resolution obtained when applying a discrete Fourier transform algorithm to the light curve: consecutive frequency bins are separated by an interval of 1/tobs, where obs is the length of the light curve. It should be noted that the periodogram of the 25s light curve is also provided to the module but is only used for the background fitting step of Submodule 1 (see Sect. 2.4).

2.2. Stellar rotation

2.2.1. Average rotation period

Firstly, rotational modulations are searched for in the periodogram of LC2 (Submodule 1). In order to obtain a proper estimation of the false-alarm probability of the rotation peaks, a quick non-parametric estimation of the mean background power level in the periodogram is computed with the following methodology: a logarithmically spaced frequency vector of n = 10 bins is constructed, and the corresponding power values b are computed as

b ( ν k ) = ( 9 / 8 ) 3 × median ( { x i } ) , $$ \begin{aligned} b (\nu _k) = (9/8)^3 \times \mathrm{median} \, (\{x_i\}) , \end{aligned} $$(1)

where {xi} is the set of power values for which the frequency νk in the resampled vector is the closest to νi, the frequency corresponding to xi. The non-parametric background power level is then computed by interpolating b for each frequency bin of the input periodogram. The use of a corrected median as the metric is motivated by the fact that we aim to estimate the mean power of the periodogram without including the power excess of the rotation peaks. Dividing the periodogram by this background estimation, we obtain a signal-to-noise-ratio (S/N) periodogram and we can now compute false-alarm probabilities assuming that the likelihood follows a χ2 with two degrees of freedom (Woodard 1984) with a mean of unity. The false-alarm probability corresponding to the probability that a given peak of height z is due to a noise fluctuation is computed for each peak as

p FA = e z , $$ \begin{aligned} p_{\rm FA} = \mathrm{e} ^{-z} , \end{aligned} $$(2)

and only peaks with a false-alarm probability of smaller than 10−6 are considered.

The largest peak in the periodogram is then fitted using a Gaussian profile. The fitted profile is subtracted from the periodogram and the procedure is repeated until there remains no peak above the false-alarm probability threshold.

As we look for signal in the 60 day filtered LC2 periodogram for stellar rotation, we consider only peaks with period below 60 days. Given the position of the photometric detection edge at slow rotation encountered in the Kepler mission (see e.g. Santos et al. 2021a; Masuda 2022b), this cutoff should allow us to recover the rotation period for the vast majority of stars where rotational modulations are detectable. The period of the largest fitted peak is kept as a candidate for the average rotation period, PGLS, while the other fitted peaks are retained to be analysed as possible differential rotation signatures. The corresponding uncertainties for each peak are taken as half width at half maximum (HWHM) of the fitted Gaussian profile. The spread of power (both in terms of peak width and harmonic pattern) from rotational modulation in the periodogram is due to the combined effect of active region lifetime, differential rotation, and stellar inclination. The method described above allows us to capture the fine structure of this power distribution, especially for slower rotators where the ratio between spot lifetime and rotation period is smaller. Nevertheless, this means that the uncertainty on the average rotation period might be underestimated. To overcome this issue, we smooth the periodogram with a triangular window with a width of one-tenth of the period of the peak of largest amplitude. We then fit a Gaussian profile centred on this period, and we use its HWHM as the uncertainty over PGLS.

Secondly, rotational modulations are searched for in LC2 by computing its auto-correlation function (ACF, McQuillan et al. 2013a) in Submodule 2. Selecting the period corresponding to the highest peak in the Lomb-Scargle of the ACF, a Gaussian window with width equal to one-tenth of this period is applied in order to smooth the ACF. The location of the first local maximum of the ACF is then selected as a candidate value for the average rotation period, PACF. The ACF value at this period is referred to as GACF (it should be noted that this is a slightly different definition from what is used in Ceillier et al. 2017). In addition, the peak height, HACF, defined as the mean difference between GACF and the two surrounding local minima, is also extracted. Given the difficulty in estimating a meaningful uncertainty on the periods obtained with the ACF, in its current state the module does not provide an uncertainty value for the ACF measurements.

In order to increase the amplitude of signals exhibited both by the periodogram and the ACF while decreasing those of signals appearing in only one method, the composite spectrum (CS; Ceillier et al. 2016, 2017) is computed in Submodule 3 by multiplying the ACF by the interpolated LC2 periodogram, normalised with its maximal value. Assuming a normal distribution, the largest peak of the CS is fitted with a Gaussian profile and is selected as a candidate value for the average rotation period, PCS.

Considering these three candidate values and their uncertainties, the stellar rotation period Prot is selected in Submodule 3 using an updated version of the Random fOrest Over STEllar Rotation methodology (ROOSTER, Breton et al. 2021). The ROOSTER implementation in the PLATO pipeline uses two random forest classifiers (Breiman et al. 1984; Breiman 2001), which correspond to the RotClass and PeriodSel classifiers described in Breton et al. (2021). We remind the reader that the methodology from Breton et al. (2021) included a third classifier, PollFlag, in order to flag possible contaminants in the rotating sample. Nevertheless such an identification role is not devoted to the PLATO rotation and activity module, and this third classifier is left out of the analysis presented here. RotClass is used to assess the robustness of the detection of rotational modulation in the light curve; it yields a rotation score ranging from 0 to 1, corresponding to the fraction of decision trees in the forest that provided a detection for rotation. This means that the classifier favours a signal detection related to rotation when this score is larger than 0.5, and that it favours an instrumental or a noise fluctuation origin otherwise. Considering the three candidate values provided by the periodogram fit, the ACF analysis, and the CS computation, PeriodSel then selects the stellar rotation period and its corresponding uncertainties. It should therefore be noted that the uncertainty on Prot will be missing if PACF is selected. The two classifiers are trained with the same input parameters and hyperparameters. These input parameters include the candidate period values, the corresponding activity indexes (these are described in more detail in Sect. 2.3.1), and other control parameters such as HACF and GACF. It should be noted that, for completeness, the module will provide a rotation period for every target, along with the corresponding rotation score.

We briefly discuss some of the differences between the updated ROOSTER methodology used in this work and the original version presented in Breton et al. (2021). We note that, in the original version, ROOSTER had to select between nine candidate periods. Indeed three calibrated light curves were considered for each target, differing by the choice of cutoff period that was made for the high-pass filter applied in order to mitigate Kepler instrumental effects: 20, 55, and 80 days. From each of these light curves, three candidate periods were extracted, and, in the end, the nine possible values were considered by ROOSTER. For PLATO, considering the fact that the Level-0/Level-1 light-curve calibration performed upstream from the stellar analysis system should allow significant mitigation of the effects of PLATO instrumental systematic errors, the choice was made to consider a unique light curve filtered at 60 days, hence reducing the number of candidate periods from nine to three. We also point out that the number of training parameters is significantly reduced with respect to the methodology presented in Breton et al. (2021). A large number of training parameters were included by these authors in order to study which of them had the greatest influence on the classification results. Building on the results of this previous analysis, we decided here to retain only those parameters that were found by Breton et al. (2021) to have the most significant affect. In what follows, we show how the analysis of the simulated light curves presented in Sect. 3 allows us to validate this choice.

Finally, the stellar Rossby number, Ro, which is connected to the differential rotation regime in the envelope (see e.g. Brun et al. 2017), is estimated from Prot and the effective temperature, Teff, using the prescription from Noraz et al. (2022). In order to avoid dealing with the uncertainties related to the exact value of the solar Rossby number, R o, the Ro value provided by the module is normalised by R o

Ro R o = P rot P rot , ( T eff T eff , ) 3.29 . $$ \begin{aligned} \frac{Ro}{{Ro_\odot }} = \frac{P_\mathrm{rot} }{P_{\rm rot,\odot }} \left( \frac{T_\mathrm{eff} }{T_{\rm eff,\odot }} \right)^{3.29} . \end{aligned} $$(3)

2.2.2. Differential rotation

One of the goals of the PLATO rotation and activity analysis module is to directly detect and analyse signatures related to latitudinal differential rotation. These measurements are crucial in order to validate magnetohydrodynamic simulation assumptions about the connection between Ro and the differential rotation regime, and provide observational constraints for stellar dynamo models (e.g. Jouve et al. 2008). However, hare-and-hounds exercises performed in the past (see e.g. Aigrain et al. 2015) demonstrated that, because of important degeneracies between differential rotation, stellar inclination, spot latitude, and lifetime (Santos et al. 2017; Basri & Shah 2020), techniques developed to measure differential rotation directly from the properties of peaks in the periodogram lack robustness and fail in a large proportion of cases. As the PLATO tool to measure differential rotation is still under development, we decided to defer this topic to future work.

2.3. Long-term modulations

2.3.1. Activity index

The average photometric activity index, ⟨Sph⟩ (García et al. 2014; Mathur et al. 2014b), can be computed once Prot is known (Submodule 3). First, the Sph time series is computed as the standard deviation of light-curve segments of length 5 × Prot. Here, ⟨Sph⟩ is taken as the mean of this time series, corrected by subtracting the photon noise. This method provides a reliable proxy for unveiling signatures of stellar activity (Salabert et al. 2016, 2017). We remind the reader that, due to the combined impact that stellar inclination and latitude of active regions have on observed variability, this proxy has to be considered as a lower limit of the real level of photospheric activity. For the same level of starspot coverage, it should also be remembered that the observed stellar variability will be impacted by the spot-to-faculae coverage ratio and the spot distribution on the photosphere (see e.g. Basri 2018; Luger et al. 2021).

2.3.2. Cyclic modulations

Submodule 5 is designed to detect cyclic modulations in the light curves of stars observed by PLATO. These modulations can be related to year-long Schwabe-like magnetic activity cycles or to shorter periodicities analogous to the Rieger modulations (e.g. Rieger et al. 1984; Gurgenashvili et al. 2022) or the quasi-biennial oscillations (QBO, e.g. Mehta et al. 2022), both observed in the Sun. In principle, as shown by the solar cases, signatures from these shorter cycles are expected to be low-amplitude perturbations superimposed on the stronger Schwabe-like cycle.

In principle, amplitude modulations of the envelope of the light curve might be related to beating phenomena arising from differential rotation and close periodicities in the observable rotational signal (see Mathur et al. 2014a). Nevertheless, these beating periodicities affect the envelope and are not visible in the Fourier decomposition of the signal, and therefore cross-validation of the detection by comparing the Fourier analysis and the ACF analysis should allow us to minimise the occurrence of such signatures in the reported measurements.

Similarly to the search for rotational modulations, significant peaks are fitted in the LC1 periodogram beyond 100 days (Submodule 1) and the false-alarm probability is computed, keeping only signals where this probability is below 10−6. As the frequency region explored in this step has a very small number of frequency bins, the method described in Sect. 2.2.1 to estimate the background will most likely filter out the relevant signal. The S/N of the peaks, z, required to compute the false-alarm probability through Eq. (2) is therefore estimated by comparing the height of the peaks with the mean power level in the spectrum. The ACF of LC1 is then computed (Submodule 2). Before this step, in order to remove fluctuations that shorter-period rotational modulation will create, we smooth the light curve with a 60 day Gaussian window. Comparing periods extracted with the ACF to the ones fitted in the periodogram, we keep only signals agreeing within 30%; this somewhat high threshold is chosen to ensure that there is an excess of power in the LC1 periodogram in the vicinity of periodicities detected by the ACF. To minimise the number of false positives, we also consider only signals above a certain threshold in terms of GACF and HACF, depending on the length of the light curve. Our main aim here is to minimise the number of false positives in the validated periods. To limit the rate of false positives for the shortest temporal baselines, we put a stringent threshold on the length of the accepted light curves, retaining only those shorter than four years, and imposing the criteria GACF > 0.5 and HACF > 0.7 in order to validate a signal. For light curves with a length equal to four years, we relax this constraint to GACF > 0.2 and HACF > 0.5. For longer light curves, we impose GACF > 0.2 and HACF > 0.35. Comparing the results for the different temporal baselines we use in what follows, we show that this choice allows us to efficiently discard false positives while validating a significant amount of cyclic modulation recovery. We underline that these thresholds are optimised using the simulated light curves we analyse in this work. They will therefore be reassessed when dealing with real data in order to optimise the yield while limiting the number of false positives. In addition to this, complementary indicators from spectroscopic ground observations will be available for some PLATO stars. The existence of such a sample will be useful to refine the design of the algorithm dedicated to validating the existence of cyclic modulations in targets for which only space photometry is available to constrain long-term activity.

Additionally, the ACF of the Sph time series is computed and its periodogram is searched for significant peaks (Submodule 3). If their values are compatible with the periodicities measured with the periodogram and the light curve ACF, they shall be provided as additional outputs of the module.

2.4. Surface gravity and background

In addition to the algorithms for surface rotation and stellar activity presented in detail in this paper, the PLATO pipeline module will be used to measure the level of convective granulation by fitting the 25 second LC1 periodogram background profile (Submodule 1) with the high-DImensional And multi-MOdal NesteD Sampling legacy code (DIAMONDS, Corsaro & De Ridder 2014; Corsaro et al. 2015). Finally, using the Flicker in Power (FliPer, Bastien et al. 2016; Bugnet et al. 2018) random forest methodology, the module will be able to provide a model-independent estimate of the logarithm of the stellar surface gravity parameter (log g, Submodule 4).

3. A dataset of simulated light curves

3.1. Light-curve simulation procedure

The rotational modulations are simulated with the pyspot code following the procedure described in Aigrain et al. (2015). The pyspot code allows simulation of the brightness variations induced by stellar dark spots while omitting the facular contribution and neglecting limb-darkening effects. Simulated spots emerge close to active latitudes and decay with time, with a decay rate following a log-normal distribution (Martinez Pillet et al. 1993). Each simulated time series has an activity cycle period, Pcycle. During the course of the cycle, active latitudes, spot number, and area vary. As in the solar case, active centres migrate from high to low latitudes during the course of the activity cycle, allowing the reconstruction of a butterfly diagram for each simulation. Additionally, the overlap between cycles is included (e.g. Wilson et al. 1988). Differential rotation is also taken into account as the rotation period of the simulated spots is a function of their latitude of emergence. The differential rotation law used in this work is solar-like for all simulations –that is the rotation period increases with latitude. No active nesting is included in the simulations –in other words, there is no preferred active longitude for the emergence of the spots. The contribution of convective granulation is included in the simulations as Harvey models (Harvey 1985), considering the scaling laws inferred by Kallinger et al. (2014). More details are provided on this aspect in Appendix B.2.

Given that they encompass only spot rotational modulation (no faculae), spot evolution, granulation, and cyclic activity, the set of simulations we generate here is ideal. When using them to test the algorithms described in Sect. 2, we assume that we are dealing with single stars with light curves where planetary transits and flares have been completely removed.

In practice, using the PLATO Input Catalog (PIC) from Montalto et al. (2021), we select 2000 stars with parameters meeting the apparent magnitude V and effective temperature Teff pre-requisites necessary to be included in the PLATO P1 or P2 sample. The P1 sample will consist of F5 to K7 dwarf and subgiant stars with magnitudes of 8.5 ≤ V ≤ 11, while the P2 sample will consist of F5 to K7 dwarf and subgiant stars with magnitudes of V ≤ 8.5. The main parameters of each simulation are the stellar spin inclination angle i, the average rotation period Prot (used as the reference value for our period recovery exercise), the minimum and maximum rotation periods, Pmin and Pmax, the maximum latitude of activity λmax, the activity cycle length Pcycle, and the overlap between consecutive cycles δcycle. These are generated drawing from a distribution that depends on the effective temperature Teff, using the prescriptions from Meunier et al. (2019). The method used to draw these parameters is described in detail in Appendix A. In particular, it should be noted that Prot, Pmin, and Pmax are chosen such that Prot is the average between Pmin and Pmax, as can be seen from Eq. (A.10). We also account for stellar inclination in the simulations: each star has an inclination i drawn considering a uniform sin i distribution to ensure that the orientation of the stellar rotation axis is isotropic.

The rotational and activity modulations in the light curves were generated with the pyspot code. We used the PLATO Solar-Like Simulator2 (PSLS, Samadi et al. 2019) in order to add PLATO instrumental systematics, camera random noise, and convective granulation. We generated 2000 light curves with rotational modulations. With the same input parameters, we generated 2000 other light curves that include only the granulation signal. Here, these light curves without rotational modulation were used to train the ROOSTER algorithms to distinguish light curves with a rotation signature from light curves without.

The Prot versus Pcycle distribution of the simulated sample is shown in Fig. 2. The Prot values range from a few days to a few tens of days, while Pcycle are comprised within 2 and 14 years. For this reason, in this work, only light curves longer than 2 years encompass complete activity cycles for some of the simulated targets. Although, in our simulations, stars with short rotation periods are more likely to have shorter activity cycles (see e.g. Fig. 6 from Meunier et al. 2019), there is no strong correlation between Prot and Pcycle in the simulated sample (see Eq. (A.12) for the relation between Prot and Pcycle). The actual connection between Prot and Pcycle is probably more complex, and both the form of the correlation and its origin are debated (see e.g. Bonanno & Corsaro 2022).

thumbnail Fig. 2.

Rotation period Prot and cyclic periodicity Pcycle distribution of the simulated sample.

In order to compare the variability amplitude of our simulations to an observational sample, we computed the Sph distribution of the simulated light curves. We represent this distribution in Fig. 3, and we compare the Prot versus Sph of our simulated sample with the Kepler targets from Santos et al. (2021a). We also show the maximum and minimum level of photometric solar variability as measured by Salabert et al. (2016). It appears that, in terms of ⟨Sph⟩, our simulated light curves coincide with the moderate-to-low-activity Kepler targets from Santos et al. (2021a). The vast majority of them have ⟨Sph⟩ below the maximal solar level, and about half of them are below the minimal solar level of activity. We underline that it is important to focus on these types of targets, as these are the ones for which measurement of surface rotation is likely to prove challenging because of their low amplitude of variability. Moreover, these stars are also the ones for which the detection of acoustic oscillations (p modes) is expected, as high levels of magnetic activity inhibit stochastic pulsations in solar-type stars (see Chaplin et al. 2011; Mathur et al. 2019).

thumbnail Fig. 3.

Rotation period Prot and activity index Sph distribution of the simulated sample (blue) compared with the sample from Santos et al. (2021a, grey). The right panel shows the projected Sph density distribution for the two populations. The projected density distribution for the Santos et al. (2021a) targets is computed considering only stars with Sph < 400 ppm. The dashed horizontal orange lines correspond to the solar minimal and maximal values measured by Salabert et al. (2016).

Each light curve is generated on a 8 year baseline, which is the maximum time frame over which a field may be observed by the PLATO mission3. It should be noted that, in order to save computing time and storage space, after having validated that this did not introduce any bias in the photon noise level of the noisy simulations (see Appendix B.1), we directly generated the light curves with the two-hour sampling that will be used by the rotation and activity module.

In order to provide a picture of the ability of the module at the different stages of the mission, from the first data release to legacy catalogues, we performed this analysis on simulations for several different baselines: 6 months, and 1, 2, 4, 6, and 8 years. We show example simulated light curves in Fig. 4, with Prot = 21 days, Pcycle = 5.4 years, δcycle = 0.45 years, and i = 83°. The vertical coloured lines highlight the extent of the segments corresponding to each temporal baseline. At the beginning of the light curve, the simulated star is in the descending phase of its activity cycle, and the latitude of spot formation gradually comes closer to the equator. The appearance of spots at a higher latitude marks the beginning of a new cycle. It is interesting to note that, considering the different baseline we are using in this work, we are going to analyse the light curves at different stages of the activity cycle. In the example of Fig. 4, for light curves shorter than two years, only the minimum of activity of the star can be observed, and thus the ⟨Sph⟩ we would be inferring shall be biased towards small values. A four-year baseline enables us to additionally monitor the rising phase of the activity cycle (e.g. in the case of the Kepler light curve of the bright solar analogue HD 173701; see e.g. Karoff et al. 2018). A 6 year baseline is longer than the stellar activity cycle but the descending phase of the current cycle is missing. Finally, the 8 year baseline allows observation of the entire cycle and also includes the first year of a new cycle.

thumbnail Fig. 4.

Example of spot activity included in the simulations (top) and the corresponding 8 year light curve, including convective granulation (bottom). The simulation has Prot = 21 days, Pcycle = 5.4 years, δcycle = 0.45 years, and i = 83°. The vertical coloured lines highlight the extent of the light-curve segments considered in this work in addition to the full 8 year light curves: 6 months (yellow), 1 year (blue), 2 years (orange), 4 years (red), and 6 years (cyan).

3.2. PLATO systematics and random noise

The PLATO systematics included in the light curves were simulated using the PSLS abilities described in Samadi et al. (2019). For each simulation, we consider the case where the star is observed together by the four groups of six cameras of PLATO (see e.g. Nascimbeni et al. 2022). The random noise in each simulation is computed with the PLATO_SIMU option, considering the target input V magnitude and the reference PLATO simulations included in PSLS, with a medium drift level. Our set of simulations therefore represents what we can expect from good quality targets located at the centre of the PLATO field of view.

3.3. Correction of systematics

In order to analyse the noisy light curves with the algorithms described in Sect. 2, we implement a correction strategy, with the aim being to produce corrected light curves that are similar –in terms of their properties– to those that will be provided to the rotation and activity PLATO module. The first step is to apply a low-pass filter with a 400 day cutoff in order to isolate long-term variability that could be related to stellar cyclic activity. The extracted signal is subtracted from the light curves, which are then folded on a quarter-period of 90 days. At this step, the quarter modulation trend is extracted as the median value at each phase of the folding. The trend is then subtracted quarter-wise from the light curve and the filtered low trend is added again.

The two upper panels of Fig. 5 show how inclusion of the PLATO systematics affects the light curve presented in Fig. 4. The effect of the drift during each 90 day quarter is apparent. In order to illustrate how a given segment of the corrected light curve can vary depending on the total observation length considered for the correction, the two lower panels show the light curve obtained by applying our correction procedure on the first half of the light curve and on its full extent, respectively. We note that a remnant of the quarter modulation is still clearly visible at the beginning of the 4 year corrected light curve.

thumbnail Fig. 5.

Comparison between the 8 year noise-free light curve (black), the uncorrected 8 year noisy light curve (grey), the corrected 4 year light curve (blue), and the corrected 8 year light curve (orange).

Figure 6 shows a comparison of the power spectral density (PSD) of the noise-free light curve, of the uncorrected noisy light curve, and of the corrected noisy light curve. We clearly see that the PSD of the uncorrected noisy light curves is dominated by the harmonics of the 90 day quarter modulation. For the vast majority of the simulated noisy light curves, our correction procedure allows removal of the power contribution of this modulation almost entirely, as illustrated here. The impact of the random noise in the corrected light curves is visible in the PSD beyond a few μHz as a higher flat noise level in the corrected PSD than in the noise-free PSD.

thumbnail Fig. 6.

Comparison between the PSD obtained from the noise-free light curve (black), the uncorrected noisy light curve (grey), and the corrected noisy light curve (orange).

4. Performances on simulated light curves

4.1. Recovery of the rotation period

The 8000 simulated light curves (2000 with rotation and 2000 without rotation for both the noise-free and the corrected noisy cases) are analysed homogeneously, considering the periodogram, the ACF, and the CS (see Sect. 2.2.1). In Fig. 7, we show an example of this analysis. The light curve used for the example is the noise free light curve shown in Fig. 4, and in this case, the analysis is performed on a full time series after applying a high pass filter with 60 day cutoff to obtain the LC2 input as explained in Sect. 2.1 (see also Appendix B.3). The main rotation peak around 21 days is fitted both in the periodogram and in the CS. As seen in the periodogram, additional peaks are also fitted, namely the second and third harmonics of the average rotation period, close to 10 and 5 days, respectively. The ACF exhibits a very clear periodicity related to the rotational modulation. Because of the decreased signal coherence, the envelope amplitude of the ACF decays as the shift in period increases (Santos et al. 2021b).

thumbnail Fig. 7.

Example of GLS periodogram (top), ACF (middle), and CS (bottom) analyses performed to recover the rotational modulation of the light curve shown in Fig. 4, using the full time series. The GLS periodogram was normalised to account for the background contribution according to the method described in Sect. 2.2.1. The set of Gaussian profiles fitted on the GLS periodogram and the CS are shown in orange. The smoothed periodogram is shown in cyan, while the Gaussian profile fitted on it and centred on PLS is shown in blue.

The ROOSTER methodology is applied after this first step. The random forest classifiers have to be trained beforehand. To this purpose, we split the light curves between a training set and a test set. We consider noise-free and corrected noisy light curves separately. Initially, each set includes 1000 light curves with rotational modulations and 1000 stars without rotation. However, for light curves with rotational modulations, we compare the results of the previous analysis with the true rotation period. If no candidate value coincides with the true rotation period with a 10% tolerance, the light curve is removed from the set. This is done in the training set because we flag the candidate period closest to the true rotation period as the one that should be selected by the random forest classifier. If this happens in the test set, we consider that the methodology was not able to correctly measure the rotation period and, taking this into account, we correct the global accuracy score of the classifier dedicated to select the rotation periods. This corrected score corresponds to the recovery rate of our method, which, among 1000 light curves, is the fraction for which PeriodSel provides a rotation period that is consistent with the reference value to within 10%. This 10% threshold is obviously arbitrary, but chosen to be reasonably commensurable to the typical uncertainties measured on Prot (see e.g. Figs. C.1 and C.2). For all considered temporal baselines, we use the same split between training and test set. We note that the analysis performed by Breton et al. (2021) demonstrated that the performances of the methodology were not dependent on the exact composition of the training set. We also stress that the period recovery exercise performed here is designed in such a way that the training set and the test set both incorporate the same physical and instrumental ingredients, with ground truth known in both cases. This will obviously not be the case with actual PLATO observations, which could in turn affect the algorithm performance. The composition of the training set that will be used in the PLATO pipeline has yet to be determined and describing what will be its exact composition is out of the scope of this paper. To ensure that the best performance is achieved, the light curves included in the training set will be carefully selected, with the possibility to incorporate relevant light curves from previous space missions (Kepler/K2, TESS) together with PLATO simulations. It is planned that the module performance will be evaluated within the PLATO consortium after launch, once the first observations are available, and the possibility to include a subset of real PLATO targets with rotation measurements validated independently from ROOSTER is under consideration.

In what follows, we report the rotation period recovery obtained for the test set. First, using the RotClass classifier, we evaluate the accuracy with which ROOSTER is able to distinguish between light curves with rotational modulations and light curves without a rotational signal. We note that the main issue that Breton et al. (2021) encountered at this step was related to training ROOSTER to properly distinguish genuine stellar signals from instrumental modulations from Kepler.

We define the classifier accuracy as the fraction of light curves correctly classified. As summarised in Table 1, for the noise-free light curves, the accuracy of RotClass is therefore 99.2%, 99.4%, and 99.9% for the 6 month, 1 year, and 2 year light curves, respectively, and is 100% for all other time-series lengths. This accuracy slightly decreases for the corrected noisy light curves because of the occasional remnant presence of periodic quarter modulations that can be confused with rotational modulation. In this case, we obtain an accuracy of 98.5%, 98.7%, 98.9%, 99.0%, 99.5%, and 99.5% for the 6 month, 1 year, 2 year, 4 year, 6 year, and 8 year light curves, respectively. When inspecting the misclassified targets, it appears that they correspond to light curves that include rotational modulations but with an apparent rotational signal of very low amplitude comparable to or below the level of remnant instrumental noise, generally because the simulated star is in a minimum of activity or because the stellar inclination is low (almost pole-on view). Nevertheless, the overall accuracy of this step remains very high even for the noisy data.

Table 1.

Summary of the results obtained for the ROOSTER RotClass classifier dedicated to assess the presence of a rotational modulation.

We then assess the quality of the period retrieval by ROOSTER PeriodSel. The summary of this analysis is shown in Fig. 8 for the noise-free light curves, and in Fig. 9 for the corrected noisy light curves. These two figures show comparisons of the rotation period recovery for the different temporal baselines considered for the exercise. We show the relative error δProt = (Prot, recovered − Prot, true)/Prot, true against Prot, true. For light curves shorter than 2 years, it should be noted that we clearly see resolution effects from the periodogram binning on the recovered distribution. We also note that the spread in relative error is larger for longer periods. For every range of periods, the spread in δProt decreases for longer temporal baselines and the accuracy of the ROOSTER significantly improves.

thumbnail Fig. 8.

Rotation period recovery for the different temporal baselines considered in this work, in the case of noise-free light curves: 6 months (top left), 1 year (top right), 2 years (middle left), 4 years (middle right), 6 years (bottom left), and 8 years (bottom right). The histogram to the right of each panels shows the δProt distribution for the complete tested sample. Rotation period recovery for the different temporal baselines considered in this work, in the case of noise-free light curves: 6 months (top left), 1 year (top right), 2 years (middle left), 4 years (middle right), 6 years (bottom left), and 8 years (bottom right). The histogram to the right of each panels shows the δProt distribution for the complete tested sample.

For short temporal baselines, we note that some of the recovered periods are located close to ridges related to the frequency binning of the GLS periodogram (this is especially visible in the two top panels of Fig. 9).

thumbnail Fig. 9.

Same as Fig. 8 but for the case of corrected noisy light curves.

In the case of noise-free light curves, it goes from 86.3% for the 6 month temporal baseline to 97.5% for the 8 year temporal baseline. Nevertheless, the accuracy of the rotation period recovery is already satisfying after only six months of observations, as the majority of the recovered periods are within 10% of the true period. It should be noted that, as we increase the temporal baseline, the number of light curves removed from the test set –because we did not manage to properly measure the rotation period– substantially decreases, from 114 for the 6 month baseline to only 3 for the 8 year baseline. These results are summarised in Table 2. Considering the corrected noisy light curves, the accuracy scores are significantly affected with respect to the noise-free case. In particular, in the case of 6 month light curves, the accuracy score goes down to 72.3%. Nevertheless, we see again that considering longer time series allows recovery of the rotation periods with a significantly larger accuracy, obtaining a 86.0% accuracy for 1 year light curves, 91.7% for 2 year light curves, 94.8% for 4 year light curves, 95.5% for 6 year light curves, and 96.2% for 8 year light curves. For stars for which a valid period candidate value was obtained from either GLS, ACF, or CS, we note that it can occur that the first overtone (second harmonic) of the rotation period is selected rather than the actual Prot, similarly to what was discussed in Breton et al. (2021). In order to illustrate this last aspect, we provide additional recovery diagnostics in Appendix C.

Table 2.

Summary of the results obtained for the ROOSTER PeriodSel classifier dedicated to selecting the rotation period.

The performance we obtain when analysing both the noise-free and noisy simulated samples validates that, as discussed in Sect. 2.2.1, the modifications made to the ROOSTER methodology with respect to the Breton et al. (2021) version are justified, even if some aspects are still susceptible to evolution as the performance of the algorithms are assessed with real PLATO observations.

4.2. Long-term modulation recovery

Most of the Pcycle included in the simulated light curves are comparable to or larger than the observing time (see Fig. 2). This means that the corresponding excess of power will not be resolved in the periodogram and that the corresponding period measurement will be biased by the frequency bin distribution of the periodogram (in other words, the values of the frequency bins will act as attractors in the distribution of Pcycle measured in the GLS periodogram). This effect can already be seen in the case of rotation periods, where, for observing times of shorter than two years, the recovered values at long periods are distributed along ridges related to the frequency binning of the periodogram. Checking whether or not these excesses of power exist is nevertheless useful in order to validate the modulations detected in the ACF. Indeed, the rotation frequency beating signature will be measured by the ACF but will not produce peaks in the periodogram as they are not fundamental frequencies in the Fourier decomposition of the time series.

In Fig. 10, we show the periods that we recover when computing the ACF of the LC1 noise-free light curves compared to the actual Pcycle values of the simulated light curves. We remind the reader that periods for which the HACF and GACF do not meet the validation thresholds defined in Sect. 2.3.2 are discarded. As stated in Sect. 2.3.2, to be validated, periods measured in the ACF must also be consistent within 30% with one of the periods measured in the GLS. In what follows, for each light curve, in cases where we validate several periods, we consider the recovered Pcycle to be the validated period with the largest HACF. We then define measurements that are not consistent within 30% with the reference Pcycle as false positives. Given the low-frequency of the modulation we are trying to recover in this exercise, this criterion is deliberately chosen to be more permissive than the acceptance range we set (10%) for rotation period recovery. We show the results of the same analysis applied to the LC1 noisy light curves in Fig. 11. As expected, the activity cycle modulations are too long to be properly recovered with light curves of shorter than 4 years. The GACF and HACF thresholds are tuned to discard all false positives in this case, and therefore we do not show diagrams corresponding to the 6 month, 1 year, and 2 year light curves. For longer temporal baselines, it is important to note that, even if we are able to discard most of the modulations that are not actually related to a cycle, the selection methodology still yields a small number of false positives. Nevertheless, with the 4 year light curves, we start recovering periods in the ACF that are related to the activity cycle of some stars with Pcycle ≤ 4 year. However, we note that the longest periods measured with the ACF are underestimated compared to the true value. We also have a significant fraction of false positives: in the noise-free sample, we validate activity modulations for 1.9% of the simulated light curves (39 stars) but 38.5% of these validations are actually false positives. Things improve significantly when considering 6 year and 8 year baselines, although a significant portion of the recovered periods are still underestimated. Nevertheless, with a 6 year baseline, we validate a cyclic modulation for 21.2% of the sample (424 stars), with only 7.8% false positives, while with an 8 year baseline, we improve upon this score to obtain a cycle detection for 38.3% of the sample (766 stars), with only 2.0% false positives. We note that we obtain a greater number of false positives in the case of corrected noisy light curves. For the 4 year temporal baseline, we validate 3.1% of the sample (63 stars), but 68.3% of them are actually false positives. For the 6 year temporal baseline, we validate a cyclic modulation in 21.7% of the sample (434 stars), with 13.8% being false positives. For the 8 year temporal baseline, we validate a cyclic modulation in 38.5% of the sample (770 stars), with 5.8% false positives. Obtaining a similar yield in actual PLATO observations would provide vast and robust samples of solar-type stars with strong constraints on their magnetic activity cycle length, a catalogue which is still lacking from previous space missions dedicated to photometry.

thumbnail Fig. 10.

Measured long-term modulations (blue) compared to the cycle length included in the simulations in the case of noise-free light curves for the temporal baselines considered in this work, where we retrieve modulations: 4 years (top), 6 years (middle), and 8 years (bottom). The dashed orange line corresponds to the 1:1 line, while the upper dotted orange line and the lower dotted orange line are the 1:1.1 and 1:0.9 lines, respectively. The upper dotted yellow line and the lower dotted yellow line are the 1:1.3 and 1:0.7 lines, respectively.

thumbnail Fig. 11.

Same as Fig. 10 but in the case of corrected noisy light curves.

We finally reiterate that, in real PLATO observations, variability modulations with both smaller amplitude and shorter period than the simulated activity cycles, such as Rieger- or QBO-like cycles, will also be searched for in the light curves. Due to the uncertainties related to the physical behaviour and the manifestation of these types of cycles in the light curve, they were not included in the simulations used in the present work. If detectable, properly constraining these modulations will require shorter temporal baselines.

5. Conclusion

This paper is the first in a series of publications dedicated to discussing the possibilities offered by PLATO to monitor the surface rotation and magnetic activity of solar-type stars. Here, we present the algorithms that will be implemented for this purpose in the standard analysis pipeline of the mission.

The algorithms were applied to a set of simulated light curves specifically computed for this work, encompassing convective granulation, spot-induced flux rotational modulation, spot emergence and decay, magnetic activity cycle, latitudinal differential rotation, and migration of active latitudes with time. To produce these simulations, we interfaced the PSLS with the pyspot code in order to include PLATO instrumental systematics and camera random noise in the light curves. To assess how instrumental effects could affect the analysis of stellar signal, we compared the performances of the rotation and activity algorithms when considering noise-free and noisy light curves.

Regarding the algorithmic performances, we demonstrate that the time of observation that PLATO is able to dedicate to each field will significantly affect the rotation period recovery rate of the observed targets, especially with slow rotators. Nevertheless, we show that an acceptable recovery rate (above 72% for the noisy set of light curves) for the recovery of average rotation periods should already be reached with only 6 months of observations. This rate will be significantly improved as more data are collected, reaching a 86.0% score for 1 year time series, and 91.7% for 2 year time series. Longer observations will allow improvement of the quality of the rotation measurements, especially for slow rotators. In the noisy set of light curves, we are indeed able to obtain a recovery rate of almost 95% for the recovery of average rotation period for a 4 year temporal baseline (which is the length of the nominal PLATO mission). PLATO instrumental systematics as predicted by PSLS do not significantly limit rotation measurements, either by sensibly reducing the fraction of stars for which we detect rotational modulation or by biasing the recovered period. In addition to the core mission objectives of detecting exoplanets and characterising their host stars through asteroseismology, photometric measurements of stellar surface rotation will therefore be an important added value of the PLATO mission.

Under the assumptions made to include activity cycles in the simulated light curves, we are also able to show that recovering Schwabe-like magnetic activity cycles will require observation of the same field for at least four years, and that staying in the same field for six or eight years will significantly improve the mission yield in regard to this observable. Indeed, considering the case of noisy four-year light curves, we are able to detect an activity cycle for 3.1% of the simulated sample, with 68.3% false positives, while extending the observing time to eight years allows us to constrain the activity cycle for 38.5% of the sample and reduce the false positives to 5.8% of the validated periods. Such measurements of variability due to long-term activity will be of primary importance for improving our understanding of the dependencies of the dynamo mechanisms of solar-type stars on age, structure, and dynamics (e.g. mean rotation rate, convective flow velocity, and density gradient).

The data products yielded by the module are designed to be of direct use to the community. Until the launch and during flying operations, the algorithms presented in this work will undergo continuous improvement within the PLATO consortium in order to provide the community with the most accurate measurements of rotation and magnetic activity upon each data release.


1

The code source for the demonstrator is accessible at https://gitlab.com/sybreton/star_privateer and the corresponding documentation is hosted at https://star-privateer.readthedocs.io/en/latest. The analysis presented in this paper was performed with v1.1.2 of the module, which can be downloaded and installed from PyPi: https://pypi.org/project/star-privateer

2

The PSLS documentation is accessible at https://sites.lesia.obspm.fr/psls. We used v1.5 of the PSLS in this work.

3

In this scenario, PLATO would observe only one of the two selected long-pointing fields (Nascimbeni et al. 2022). While the first PLATO long-pointing field was announced by the PLATO Science Working Team on 11 June 2023, the total amount of time that PLATO will remain in this field is conditioned on a mission performance assessment after one year of flight. See the PLATO first field announcement: https://platomission.com/2023/07/11/first-plato-long-duration-observation-phase-lop-field-selected/

Acknowledgments

This work presents results from the European Space Agency (ESA) space mission PLATO. The PLATO payload, the PLATO Ground Segment and PLATO data processing are joint developments of ESA and the PLATO Mission Consortium (PMC). Funding for the PMC is provided at national levels, in particular by countries participating in the PLATO Multilateral Agreement (Austria, Belgium, Czech Republic, Denmark, France, Germany, Italy, Netherlands, Portugal, Spain, Sweden, Switzerland, Norway, and United Kingdom) and institutions from Brazil. Members of the PLATO Consortium can be found at https://platomission.com. The ESA PLATO mission website is https://www.cosmos.esa.int/plato. The authors thank the teams working for PLATO for all their work. They acknowledge the critical reading and the constructive comments from the anonymous referee that significantly allowed improving the original version of this paper. They finally thank R. Samadi for helpful advice and suggestions concerning the PSLS abilities. S.N.B, A.F.L, S.Me, I.P and E.C acknowledge support from PLATO ASI-INAF agreement no. 2022-28-HH.0 “PLATO Fase D”. S.N.B, L.A, A.S.B, Q.N, and A.S acknowledge financial support by ERC Whole Sun Synergy grant #810218. S.N.B, R.A.G, L.A, A.S.B, Q.N., D.B.P, E.P, and A.S acknowledge the support from PLATO CNES grant. R.A.G, D.B.P, and E.P acknowledge the support from SoHO/GOLF CNES grant. A.S.B, Q.N, and A.S acknowledge the support from INSU/PNST grant and Solar Orbiter CNES grant. A.S acknowledges funding from from the European Union’s Horizon-2020 research and innovation program (grant agreement no. 776403 ExoplANETS-A) and the Programme National de Planétologie (PNP). A.R.G.S acknowledges the support from the FCT through national funds and FEDER through COMPETE2020 (UIDB/04434/2020, UIDP/04434/2020, 2022.03993.PTDC) and the support from the FCT through the work contract No. 2020.02480.CEECIND/CP1631/CT0001. S.Ma acknowledges support from the Spanish Ministry of Science and Innovation (MICINN) with the Ramón y Cajal fellowship no. RYC-2015-17697 and through AEI under the Severo Ochoa Centres of Excellence Programme 2020–2023 (CEX2019-000920-S). S.Ma acknowledges support from the Spanish Ministry of Science and Innovation (MICINN) with the grant no. PID2019-107187GB-I00. M.J.G., K.B., R.M.O, J.P, O.R., C.R. acknowledge support from CNES. The computations were performed with the IRFU/CEA Saclay server facilities, funded by ERC Synergy grant WholeSun No.810218, the P2IO Labex emergence project FlarePredict, and CNES PLATO funds. Software:star-privateer (this work), pyspot (Aigrain et al. 2015), PSLS (Samadi et al. 2019), numpy (Harris et al. 2020), matplotlib (Hunter 2007), scipy (Virtanen et al. 2020), astropy (Astropy Collaboration 2022), pandas (Wes McKinney 2010; The pandas development team 2020), scikit-learn (Pedregosa et al. 2011).

References

  1. Ahuir, J., Strugarek, A., Brun, A. S., & Mathis, S. 2021, A&A, 650, A126 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  2. Aigrain, S., Llama, J., Ceillier, T., et al. 2015, MNRAS, 450, 3211 [Google Scholar]
  3. Amard, L., Roquette, J., & Matt, S. P. 2020, MNRAS, 499, 3481 [NASA ADS] [CrossRef] [Google Scholar]
  4. Angus, R., Beane, A., Price-Whelan, A. M., et al. 2020, AJ, 160, 90 [Google Scholar]
  5. Astropy Collaboration (Price-Whelan, A. M., et al.) 2022, ApJ, 935, 167 [NASA ADS] [CrossRef] [Google Scholar]
  6. Baglin, A., Auvergne, M., Boisnard, L., et al. 2006, 36th COSPAR Sci. Assembly, 36, 3749 [NASA ADS] [Google Scholar]
  7. Barnes, S. A. 2003, ApJ, 586, 464 [Google Scholar]
  8. Barnes, S. A. 2007, ApJ, 669, 1167 [Google Scholar]
  9. Basri, G. 2018, ApJ, 865, 142 [Google Scholar]
  10. Basri, G., & Shah, R. 2020, ApJ, 901, 14 [Google Scholar]
  11. Bastien, F. A., Stassun, K. G., Basri, G., & Pepper, J. 2016, ApJ, 818, 43 [NASA ADS] [CrossRef] [Google Scholar]
  12. Bonanno, A., & Corsaro, E. 2022, ApJ, 939, L26 [NASA ADS] [CrossRef] [Google Scholar]
  13. Borucki, W. J., Koch, D., Basri, G., et al. 2010, Science, 327, 977 [Google Scholar]
  14. Breiman, L. 2001, Mach. Learn., 45, 5 [Google Scholar]
  15. Breiman, L., Friedman, J., Stone, C., & Olshen, R. 1984, Classification and Regression Trees, The Wadsworth and Brooks-Cole Statistics-probability Series (Taylor& Francis) [Google Scholar]
  16. Breton, S. N., Santos, A. R. G., Bugnet, L., et al. 2021, A&A, 647, A125 [EDP Sciences] [Google Scholar]
  17. Breton, S. N., Lanza, A. F., & Messina, S. 2024, A&A, 682, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Brun, A. S., & Browning, M. K. 2017, Liv. Rev. Sol. Phys., 14, 4 [Google Scholar]
  19. Brun, A. S., Strugarek, A., Varela, J., et al. 2017, ApJ, 836, 192 [Google Scholar]
  20. Bugnet, L., García, R. A., Davies, G. R., et al. 2018, A&A, 620, A38 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  21. Cao, L., Pinsonneault, M. H., & van Saders, J. L. 2023, ApJ, 951, L49 [NASA ADS] [CrossRef] [Google Scholar]
  22. Ceillier, T., van Saders, J., García, R. A., et al. 2016, MNRAS, 456, 119 [Google Scholar]
  23. Ceillier, T., Tayar, J., Mathur, S., et al. 2017, A&A, 605, A111 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  24. Chaplin, W. J., Bedding, T. R., Bonanno, A., et al. 2011, ApJ, 732, L5 [Google Scholar]
  25. Claytor, Z. R., van Saders, J. L., Cao, L., et al. 2024, ApJ, 962, 47 [NASA ADS] [CrossRef] [Google Scholar]
  26. Corsaro, E., & De Ridder, J. 2014, A&A, 571, A71 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  27. Corsaro, E., De Ridder, J., & García, R. A. 2015, A&A, 579, A83 [CrossRef] [EDP Sciences] [Google Scholar]
  28. Corsaro, E., Bonanno, A., Mathur, S., et al. 2021, A&A, 652, L2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  29. de Freitas, D. B., Lanza, A. F., da Silva Gomes, F. O., & Das Chagas, M. L. 2021, A&A, 650, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  30. Deeg, H. J., Georgieva, I. Y., Nowak, G., et al. 2023, A&A, 677, A12 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  31. Distefano, E., Lanzafame, A. C., Brugaletta, E., et al. 2023, A&A, 674, A20 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  32. Gaia Collaboration (Prusti, T., et al.) 2016, A&A, 595, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  33. García, R. A., & Ballot, J. 2019, Liv. Rev. Sol. Phys., 16, 4 [Google Scholar]
  34. García, R. A., Ceillier, T., Salabert, D., et al. 2014, A&A, 572, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  35. García, R. A., Gourvès, C., Santos, A. R. G., et al. 2023, A&A, 679, L12 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  36. Gordon, T. A., Davenport, J. R. A., Angus, R., et al. 2021, ApJ, 913, 70 [Google Scholar]
  37. Gurgenashvili, E., Zaqarashvili, T. V., Kukhianidze, V., et al. 2022, A&A, 660, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  38. Hall, O. J., Davies, G. R., van Saders, J., et al. 2021, Nat. Astron., 5, 707 [NASA ADS] [CrossRef] [Google Scholar]
  39. Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357 [NASA ADS] [CrossRef] [Google Scholar]
  40. Harvey, J. 1985, ESA Spec. Publ., 235, 199 [Google Scholar]
  41. Holcomb, R. J., Robertson, P., Hartigan, P., Oelkers, R. J., & Robinson, C. 2022, ApJ, 936, 138 [NASA ADS] [CrossRef] [Google Scholar]
  42. Howell, S. B., Sobeck, C., Haas, M., et al. 2014, PASP, 126, 398 [Google Scholar]
  43. Hunter, J. D. 2007, Comput. Sci. Eng., 9, 90 [NASA ADS] [CrossRef] [Google Scholar]
  44. Jouve, L., Brun, A. S., Arlt, R., et al. 2008, A&A, 483, 949 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  45. Kallinger, T., De Ridder, J., Hekker, S., et al. 2014, A&A, 570, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  46. Kallinger, T., Hekker, S., Garcia, R. A., Huber, D., & Matthews, J. M. 2016, Sci. Adv., 2, 1500654 [Google Scholar]
  47. Karoff, C., Metcalfe, T. S., Santos, Â. R. G., et al. 2018, ApJ, 852, 46 [Google Scholar]
  48. Kjeldsen, H., & Bedding, T. R. 1995, A&A, 293, 87 [NASA ADS] [Google Scholar]
  49. Lanza, A. F., Pagano, I., Leto, G., et al. 2009, A&A, 493, 193 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  50. Lanza, A. F., Netto, Y., Bonomo, A. S., et al. 2019, A&A, 626, A38 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  51. Lanzafame, A. C., Distefano, E., Messina, S., et al. 2018, A&A, 616, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  52. Lanzafame, A. C., Distefano, E., Barnes, S. A., & Spada, F. 2019, ApJ, 877, 157 [NASA ADS] [CrossRef] [Google Scholar]
  53. Lomb, N. R. 1976, Ap&SS, 39, 447 [Google Scholar]
  54. Lu, Y. L., Curtis, J. L., Angus, R., David, T. J., & Hattori, S. 2022, AJ, 164, 251 [CrossRef] [Google Scholar]
  55. Luger, R., Foreman-Mackey, D., Hedges, C., & Hogg, D. W. 2021, AJ, 162, 123 [NASA ADS] [CrossRef] [Google Scholar]
  56. Martinez Pillet, V., Moreno-Insertis, F., & Vazquez, M. 1993, A&A, 274, 521 [Google Scholar]
  57. Masuda, K. 2022a, ApJ, 937, 94 [NASA ADS] [CrossRef] [Google Scholar]
  58. Masuda, K. 2022b, ApJ, 933, 195 [NASA ADS] [CrossRef] [Google Scholar]
  59. Mathur, S., García, R. A., Ballot, J., et al. 2014a, A&A, 562, A124 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  60. Mathur, S., Salabert, D., García, R. A., & Ceillier, T. 2014b, J. Space Weather Space Clim., 4, A15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  61. Mathur, S., García, R. A., Bugnet, L., et al. 2019, Front. Astron. Space Sci., 6, 46 [Google Scholar]
  62. Mathur, S., Claytor, Z. R., Santos, Â. R. G., et al. 2023, ApJ, 952, 131 [NASA ADS] [CrossRef] [Google Scholar]
  63. McQuillan, A., Aigrain, S., & Mazeh, T. 2013a, MNRAS, 432, 1203 [Google Scholar]
  64. McQuillan, A., Mazeh, T., & Aigrain, S. 2013b, ApJ, 775, L11 [Google Scholar]
  65. McQuillan, A., Mazeh, T., & Aigrain, S. 2014, ApJS, 211, 24 [Google Scholar]
  66. Mehta, T., Jain, K., Tripathy, S. C., et al. 2022, MNRAS, 515, 2415 [NASA ADS] [CrossRef] [Google Scholar]
  67. Messias, Y. S., de Oliveira, L. L. A., Gomes, R. L., et al. 2022, ApJ, 930, L23 [NASA ADS] [CrossRef] [Google Scholar]
  68. Meunier, N., Lagrange, A. M., Boulet, T., & Borgniet, S. 2019, A&A, 627, A56 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  69. Montalto, M., Piotto, G., Marrese, P. M., et al. 2021, A&A, 653, A98 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  70. Nascimbeni, V., Piotto, G., Börner, A., et al. 2022, A&A, 658, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  71. Nielsen, M. B., Gizon, L., Schunker, H., & Karoff, C. 2013, A&A, 557, L10 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  72. Noraz, Q., Breton, S. N., Brun, A. S., et al. 2022, A&A, 667, A50 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  73. Notsu, Y., Maehara, H., Honda, S., et al. 2019, ApJ, 876, 58 [Google Scholar]
  74. Noyes, R. W., Hartmann, L. W., Baliunas, S. L., Duncan, D. K., & Vaughan, A. H. 1984, ApJ, 279, 763 [Google Scholar]
  75. Özavcı, I., Şenavcı, H. V., Işık, E., et al. 2018, MNRAS, 474, 5534 [Google Scholar]
  76. Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011, J. Mach. Learn. Res., 12, 2825 [Google Scholar]
  77. Raetz, S., Stelzer, B., Damasso, M., & Scholz, A. 2020, A&A, 637, A22 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  78. Rauer, H., Catala, C., Aerts, C., et al. 2014, Exp. Astron., 38, 249 [Google Scholar]
  79. Reinhold, T., & Hekker, S. 2020, A&A, 635, A43 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  80. Reinhold, T., Reiners, A., & Basri, G. 2013, A&A, 560, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  81. Reinhold, T., Shapiro, A. I., Solanki, S. K., & Basri, G. 2023, A&A, 678, A24 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  82. Ricker, G. R., Winn, J. N., Vanderspek, R., et al. 2015, J. Astron. Telesc. Instrum. Syst., 1, 014003 [Google Scholar]
  83. Rieger, E., Share, G. H., Forrest, D. J., et al. 1984, Nature, 312, 623 [Google Scholar]
  84. Salabert, D., García, R. A., Beck, P. G., et al. 2016, A&A, 596, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  85. Salabert, D., García, R. A., Jiménez, A., et al. 2017, A&A, 608, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  86. Samadi, R., Deru, A., Reese, D., et al. 2019, A&A, 624, A117 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  87. Santos, A. R. G., Cunha, M. S., Avelino, P. P., García, R. A., & Mathur, S. 2017, A&A, 599, A1 [CrossRef] [EDP Sciences] [Google Scholar]
  88. Santos, A. R. G., García, R. A., Mathur, S., et al. 2019, ApJS, 244, 21 [Google Scholar]
  89. Santos, A. R. G., Breton, S. N., Mathur, S., & García, R. A. 2021a, ApJS, 255, 17 [NASA ADS] [CrossRef] [Google Scholar]
  90. Santos, A. R. G., Mathur, S., García, R. A., Cunha, M. S., & Avelino, P. P. 2021b, MNRAS, 508, 267 [NASA ADS] [CrossRef] [Google Scholar]
  91. Santos, A. R. G., Mathur, S., García, R. A., et al. 2023, A&A, 672, A56 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  92. Scargle, J. D. 1982, ApJ, 263, 835 [Google Scholar]
  93. See, V., Roquette, J., Amard, L., & Matt, S. P. 2021, ApJ, 912, 127 [Google Scholar]
  94. See, V., Roquette, J., Amard, L., & Matt, S. 2023, MNRAS, 524, 5781 [NASA ADS] [CrossRef] [Google Scholar]
  95. Skumanich, A. 1972, ApJ, 171, 565 [Google Scholar]
  96. Spada, F., & Lanzafame, A. C. 2020, A&A, 636, A76 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  97. The pandas development team 2020, https://doi.org/10.5281/zenodo.3509134 [Google Scholar]
  98. van Saders, J. L., Ceillier, T., Metcalfe, T. S., et al. 2016, Nature, 529, 181 [Google Scholar]
  99. van Saders, J. L., Pinsonneault, M. H., & Barbieri, M. 2019, ApJ, 872, 128 [NASA ADS] [CrossRef] [Google Scholar]
  100. Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Methods, 17, 261 [Google Scholar]
  101. Wes McKinney 2010, in Proceedings of the 9th Python in Science Conference, eds. S. van der Walt, & J. Millman, 56 [CrossRef] [Google Scholar]
  102. Wilson, P. R., Altrocki, R. C., Harvey, K. L., Martin, S. F., & Snodgrass, H. B. 1988, Nature, 333, 748 [Google Scholar]
  103. Woodard, M. F. 1984, Ph.D. Thesis, University of California, San Diego, USA [Google Scholar]
  104. Yang, H., & Liu, J. 2019, ApJS, 241, 29 [NASA ADS] [CrossRef] [Google Scholar]
  105. Zechmeister, M., & Kürster, M. 2009, A&A, 496, 577 [CrossRef] [EDP Sciences] [Google Scholar]

Appendix A: Generation of rotational modulation parameters

The prescriptions from Meunier et al. (2019) are used to generate rotational modulation parameters from the input Teff selected from the PIC. In what follows, we denote as 𝕏 a random variable with uniform distribution between 0 and 1. First, B − V is interpolated from Teff using Table 1 from Meunier et al. (2019). Using B − V, an activity proxy log R HK $ R^\prime_{\rm HK} $ is randomly drawn with the following procedure.

log R HK = log R HK , min + ( log R HK , max log R HK , min ) X , $$ \begin{aligned} \log {R\prime }_{\rm HK} = \log {R\prime }_{\rm HK, min} + (\log R^{\prime }_{\rm HK, max} - \log R^{\prime }_{\rm HK, min}) \mathbb{X} \; , \end{aligned} $$(A.1)

where

log R HK , max = 0.375 ( B V ) 4.4 , $$ \begin{aligned} \log R^{\prime }_{\rm HK, max} = - 0.375 (B-V) - 4.4 \; , \end{aligned} $$(A.2)

and log R HK , min $ \log {R^{\prime}}_{\mathrm{\rm HK, min}} $ has been obtained through a minimum value of the activity index S, Smin

S min = { 0.144 if B V < 0.94 0.0269231 ( B V ) + 0.118892 otherwise , $$ \begin{aligned} S_\mathrm{min} = \left\{ \begin{aligned}&0.144 \text{ if} B-V < 0.94 \\&0.0269231 (B-V) + 0.118892 \text{ otherwise} \end{aligned} \right. \; , \end{aligned} $$(A.3)

and then

log R H K , min = 4 + log [ 1.34 ( B V ) ] + log C cf , $$ \begin{aligned} \log R^{\prime }_{\rm \mathrm {HK, min}} = - 4 + \log [1.34 (B-V)] + \log C_{cf} \; , \end{aligned} $$(A.4)

where, writing μ = 0.63 − (B − V)

log C cf = { log C c f , 0 if B V 0.63 log C c f , 0 + 0.135 μ 0.814 μ 2 + 6.03 μ 3 otherwise , $$ \begin{aligned} \log C_{cf} = \left\{ \begin{aligned}&\log C_{cf,0} \text{ if} B-V \ge 0.63 \\&\log C_{cf,0} + 0.135 \mu - 0.814 \mu ^2 + 6.03 \mu ^3 \text{ otherwise} \end{aligned} \right. \; , \end{aligned} $$(A.5)

with

log C c f , 0 = 1.13 ( B V ) 3 3.91 ( B V ) 2 + 2.84 ( B V ) 0.47 . $$ \begin{aligned} \log C_{cf,0} = 1.13 (B-V)^3 - 3.91 (B-V)^2 + 2.84 (B-V) - 0.47 \; . \end{aligned} $$(A.6)

At this stage, B − V and log R HK $ R^\prime_{\rm HK} $ are combined to draw the average rotation period Prot, using the Rossby number, RoNoyes, and the convective turnover time, τc, estimates from Noyes et al. (1984). We have, writing x = 1 − (B − V)

log τ c = { 1.362 0.166 x + 0.025 x 2 5.323 x 3 if x > 0 1.362 0.14 x if x 0 , $$ \begin{aligned} \log \tau _c = \left\{ \begin{aligned}&1.362 - 0.166x + 0.025x^2 - 5.323x^3 \text{ if} x > 0 \\&1.362 - 0.14x \text{ if} x \le 0 \end{aligned} \right. \; , \end{aligned} $$(A.7)

and

R o Noyes = 0.808 2.966 ( log R HK + 4.52 ) . $$ \begin{aligned} Ro_{\rm Noyes} = 0.808 - 2.966 (\log R^{\prime }_{\rm HK} + 4.52) \; . \end{aligned} $$(A.8)

Subsequently we draw Prot as

P rot = ( R o Noyes + 0.4 X 0.2 ) τ c . $$ \begin{aligned} P_\mathrm{rot} = (Ro_{\rm Noyes} + 0.4 \mathbb{X} - 0.2) \tau _c \; . \end{aligned} $$(A.9)

Once Prot has been drawn, the amplitude of latitudinal differential rotation, given by the parameters Pmin and Pmax, is deterministic and depends on Teff and Prot

{ p 0 = 3.485 + 2.4781 T eff 10 4 K , p 1 = 1.597 1.351 T eff 10 4 K , log α = p 0 + p 1 log ( P rot 1 day ) , P max = 2 P rot / ( 2 α ) , P min = ( 1 α ) P max , $$ \begin{aligned} \left\{ \begin{aligned}&p_0 = - 3.485 + 2.4781 \, \frac{T_\mathrm{eff} }{10^4 K} \; , \\&p_1 = 1.597 - 1.351 \, \frac{T_\mathrm{eff} }{10^4 K} \; , \\&\log \alpha = p_0 + p_1 \log \left(\frac{P_\mathrm{rot} }{1 \text{ day}} \right) \; , \\&P_\mathrm{max} = 2 P_\mathrm{rot} / (2 - \alpha ) \; , \\&P_\mathrm{min} = (1 - \alpha ) P_\mathrm{max} \; , \end{aligned} \right. \end{aligned} $$(A.10)

where p0, p1 and α are dimensionless parameters. The latitudinal extent of the active region formation range between λ = 0 and λmax drawn as

λ max = 32 o + 20 o × X . $$ \begin{aligned} \lambda _\mathrm{max} = 32^\mathrm{o} + 20^\mathrm{o} \times \mathbb{X} \; . \end{aligned} $$(A.11)

The activity cycle length, Pcycle is drawn as

log ( P cycle 1 year ) = 0.16 log ( P rot 1 day ) + 3.14 + 0.6 X 0.3 , $$ \begin{aligned} \log \left( \frac{P_\mathrm{cycle} }{1 \, \mathrm{year} } \right) = 0.16 \log \left(\frac{P_\mathrm{rot} }{1 \text{ day}} \right) + 3.14 + 0.6\mathbb{X} - 0.3 \; , \end{aligned} $$(A.12)

and the overlap between consecutive cycles, δcycle, as

δ cycle = 0.1 X P cycle . $$ \begin{aligned} \delta _{\rm cycle} = 0.1 \mathbb{X} P_{\rm cycle} \; . \end{aligned} $$(A.13)

Appendix B: Light curves properties and preprocessing

In this appendix, we present and discuss some additional aspects related to the PSLS light curves and their preprocessing.

B.1. Photon noise level

In order to isolate the contribution of photon noise level in the simulations presented in this work, we generated a set of 90-day (one quarter) light curves with only instrumental signal, using as input the same magnitude V and PSLS random seed as for the 8-year simulations with rotational modulations. In order to validate that using the PSLS to directly generate two-hour light curves does not affect the photon noise, we generate these 90 day light curves with both a two-hour and a 25 second sampling. We rebin the 25 second sampled time series at one hour and two hours. After filtering out the low-frequency drift, we compute the photon noise level as the standard deviation of the time series. We show the magnitude V versus photon noise diagram in Fig. B.1.

thumbnail Fig. B.1.

Photon noise level as a function of magnitude V. The orange distribution corresponds to photon noise level computed for pure noise time series generated directly with a 2-hour sampling. The yellow distribution corresponds to photon noise level computed for pure noise time series generated with a 25s sampling, rebinned at 2-hour. The grey distribution corresponds to photon noise level computed for pure noise time series generated with a 25s sampling, rebinned at 1-hour.

The orange and yellow distributions correspond to the photon noise of the simulations analysed in this work, and to what is currently expected for two-hour rebinned PLATO light curves in this range of magnitude. Comparing these two distributions also allows us to validate that using the PSLS to directly generate light curves with two-hour sampling does not introduce a bias in the resulting photon noise level. We show for comparison (in grey) the noise level obtained for the one-hour rebinning of the 25 second time series. This timescale is the reference used to define PLATO stellar sample in terms of noise: P1 and P2 sample will have, by construction, a photon noise below 50 ppm in one hour (e.g. Montalto et al. 2021). We remind that, for all of our simulations, we considered the case of stars observed by the 4 groups of 6 cameras together. For a given magnitude V, the noise level of a star observed by a smaller number of cameras is therefore expected to be slightly larger (but still under 50 ppm in one hour for P1 and P2).

B.2. Granulation level

As mentioned in Sect. 3.1, granulation is included in the simulated light curves using the scaling laws derived by Kallinger et al. (2014). The granulation is modelled by two pseudo-Lorentzian functions (also referred as Harvey models) with distinct characteristic timescales. The amplitudes a1 and a2 of theses functions, in ppm, is given by

a 1 = 3382 × ν max 0.609 , $$ \begin{aligned} a_1&= 3382 \times \nu _{\rm max}^{-0.609} \; , \end{aligned} $$(B.1)

a 2 = 3710 × ν max 0.613 M 0.26 . $$ \begin{aligned} a_2&= 3710 \times \nu _{\rm max}^{-0.613} M^{-0.26} \; . \end{aligned} $$(B.2)

where M is the stellar mass and νmax is the frequency of maximal power of the stellar oscillations (see e.g. García & Ballot 2019). The total intensity fluctuation related to granulation Agran in the background will therefore be

A gran = C bol a 1 2 + a 2 2 , $$ \begin{aligned} A_{\rm gran} = C_{\rm bol} \sqrt{a_1^2 + a_2^2} \; , \end{aligned} $$(B.3)

where the bolometric correction Cbol is given by Cbol = (Teff/T0)α with T0 = 5934 K and α = 0.8. In Fig B.2, we represent how Agran scales with the Teff and log g parameters of the star. The strong correlation between Agran and log g is explained by the fact that the νmax values strongly depends on log g (e.g. Kjeldsen & Bedding 1995; Kallinger et al. 2016)

thumbnail Fig. B.2.

Granulation power level as a function of Teff (top) and log g (bottom).

B.3. Light curve filtering

We illustrate here the effect that the 60 day high pass filtering applied on LC1 to obtain LC2 has on stellar variability. We show the comparison between the observed signal for a noise-free light curve before and after filtering in Fig. B.3. We also compare the periodograms of the two light curves, where the impact of the 60 day cutoff (∼0.19 μHz) on the LC2 periodogram is clearly visible.

thumbnail Fig. B.3.

Top: comparison between LC1 (black) and LC2 (grey) for a noise-free light curve. The light curves have been shifted to improve readibility. Bottom: corresponding LC1 and LC2 periodograms.

Appendix C: Additional rotation recovery diagnostics

We provide in this appendix diagnostics complementary to those presented in Sect. 4.1. We show in Fig. C.1 and Fig. C.2 the comparison between the true rotation periods Prot, true, and the recovered rotation periods Prot, recovered, for both noise-free light curves and noisy light curves. Uncertainties on Prot, recovered are represented when available (we remind that the case where uncertainties are not estimated means that ROOSTER selected PACF as Prot, see Sect. 2.2.1). The uncertainties increases with Prot, recovered. For the considered range (8 < V < 11), we do not not find a dependence of the uncertainties on the magnitude. We note that for short temporal baselines, long periods are more generally selected by ROOSTER from PACF, while, for longer baselines, ROOSTER favours PGLS or PCS. We finally note that, for a very small fraction of the sample, our methodology selects the second harmonic of Prot rather than Prot itself, an issue that was already discussed in Breton et al. (2021). Nevertheless, the vast majority of the recovered periods are located in the ±10% interval around Prot, true: considering that the second harmonic was chosen if 0.49Prot, true < Prot, recovered < 0.51Prot, true, we never have more than 1.2 % of the recovered periods laying in this area.

thumbnail Fig. C.1.

Comparison between the true rotation periods Prot, true, and the recovered rotation periods Prot, recovered for the different temporal baselines considered in this work, in the case of noise free light curves: 6 months (top left), 1 year (top right), 2 years (middle left), 4 years (middle right), 6 years (bottom left), and 8 years (bottom right). Uncertainties on Prot, recovered are shown when available. The dashed grey lines correspond to the 1:1 ratio and the ±10% interval. The percentage of recovered periods laying in the interval 0.49Prot, true < Prot, recovered < 0.51Prot, true is indicated.

thumbnail Fig. C.2.

Same as Fig. C.1 in the case of corrected noisy light curves.

Appendix D: Nomenclature

For readability considerations, we made the choice to reduce as much as possible the use of PLATO technical acronyms and denominations. We provide in this appendix the correspondance between the simple nomenclature used in this work to describe the module structure and the current PLATO denominations. The rotation and activity module is a component of the PLATO Stellar Analysis System (SAS), under the name of Module for Stellar Astrophysics number 4 (MSAP4). Each submodule presented in this work is referred to using an additional index, as listed in Table D.1.

Table D.1.

Correspondance of denominations between this work and PLATO nomenclature.

All Tables

Table 1.

Summary of the results obtained for the ROOSTER RotClass classifier dedicated to assess the presence of a rotational modulation.

Table 2.

Summary of the results obtained for the ROOSTER PeriodSel classifier dedicated to selecting the rotation period.

Table D.1.

Correspondance of denominations between this work and PLATO nomenclature.

All Figures

thumbnail Fig. 1.

Flow diagram for the module. The main inputs for the module, the light curves, and the periodograms are shown on the left. The different module outputs are encircled in red.

In the text
thumbnail Fig. 2.

Rotation period Prot and cyclic periodicity Pcycle distribution of the simulated sample.

In the text
thumbnail Fig. 3.

Rotation period Prot and activity index Sph distribution of the simulated sample (blue) compared with the sample from Santos et al. (2021a, grey). The right panel shows the projected Sph density distribution for the two populations. The projected density distribution for the Santos et al. (2021a) targets is computed considering only stars with Sph < 400 ppm. The dashed horizontal orange lines correspond to the solar minimal and maximal values measured by Salabert et al. (2016).

In the text
thumbnail Fig. 4.

Example of spot activity included in the simulations (top) and the corresponding 8 year light curve, including convective granulation (bottom). The simulation has Prot = 21 days, Pcycle = 5.4 years, δcycle = 0.45 years, and i = 83°. The vertical coloured lines highlight the extent of the light-curve segments considered in this work in addition to the full 8 year light curves: 6 months (yellow), 1 year (blue), 2 years (orange), 4 years (red), and 6 years (cyan).

In the text
thumbnail Fig. 5.

Comparison between the 8 year noise-free light curve (black), the uncorrected 8 year noisy light curve (grey), the corrected 4 year light curve (blue), and the corrected 8 year light curve (orange).

In the text
thumbnail Fig. 6.

Comparison between the PSD obtained from the noise-free light curve (black), the uncorrected noisy light curve (grey), and the corrected noisy light curve (orange).

In the text
thumbnail Fig. 7.

Example of GLS periodogram (top), ACF (middle), and CS (bottom) analyses performed to recover the rotational modulation of the light curve shown in Fig. 4, using the full time series. The GLS periodogram was normalised to account for the background contribution according to the method described in Sect. 2.2.1. The set of Gaussian profiles fitted on the GLS periodogram and the CS are shown in orange. The smoothed periodogram is shown in cyan, while the Gaussian profile fitted on it and centred on PLS is shown in blue.

In the text
thumbnail Fig. 8.

Rotation period recovery for the different temporal baselines considered in this work, in the case of noise-free light curves: 6 months (top left), 1 year (top right), 2 years (middle left), 4 years (middle right), 6 years (bottom left), and 8 years (bottom right). The histogram to the right of each panels shows the δProt distribution for the complete tested sample. Rotation period recovery for the different temporal baselines considered in this work, in the case of noise-free light curves: 6 months (top left), 1 year (top right), 2 years (middle left), 4 years (middle right), 6 years (bottom left), and 8 years (bottom right). The histogram to the right of each panels shows the δProt distribution for the complete tested sample.

In the text
thumbnail Fig. 9.

Same as Fig. 8 but for the case of corrected noisy light curves.

In the text
thumbnail Fig. 10.

Measured long-term modulations (blue) compared to the cycle length included in the simulations in the case of noise-free light curves for the temporal baselines considered in this work, where we retrieve modulations: 4 years (top), 6 years (middle), and 8 years (bottom). The dashed orange line corresponds to the 1:1 line, while the upper dotted orange line and the lower dotted orange line are the 1:1.1 and 1:0.9 lines, respectively. The upper dotted yellow line and the lower dotted yellow line are the 1:1.3 and 1:0.7 lines, respectively.

In the text
thumbnail Fig. 11.

Same as Fig. 10 but in the case of corrected noisy light curves.

In the text
thumbnail Fig. B.1.

Photon noise level as a function of magnitude V. The orange distribution corresponds to photon noise level computed for pure noise time series generated directly with a 2-hour sampling. The yellow distribution corresponds to photon noise level computed for pure noise time series generated with a 25s sampling, rebinned at 2-hour. The grey distribution corresponds to photon noise level computed for pure noise time series generated with a 25s sampling, rebinned at 1-hour.

In the text
thumbnail Fig. B.2.

Granulation power level as a function of Teff (top) and log g (bottom).

In the text
thumbnail Fig. B.3.

Top: comparison between LC1 (black) and LC2 (grey) for a noise-free light curve. The light curves have been shifted to improve readibility. Bottom: corresponding LC1 and LC2 periodograms.

In the text
thumbnail Fig. C.1.

Comparison between the true rotation periods Prot, true, and the recovered rotation periods Prot, recovered for the different temporal baselines considered in this work, in the case of noise free light curves: 6 months (top left), 1 year (top right), 2 years (middle left), 4 years (middle right), 6 years (bottom left), and 8 years (bottom right). Uncertainties on Prot, recovered are shown when available. The dashed grey lines correspond to the 1:1 ratio and the ±10% interval. The percentage of recovered periods laying in the interval 0.49Prot, true < Prot, recovered < 0.51Prot, true is indicated.

In the text
thumbnail Fig. C.2.

Same as Fig. C.1 in the case of corrected noisy light curves.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.