Free Access
Issue
A&A
Volume 658, February 2022
Article Number A177
Number of page(s) 16
Section Planets and planetary systems
DOI https://doi.org/10.1051/0004-6361/202141197
Published online 18 February 2022

© ESO 2022

1 Introduction

In many astronomical fields, one searches for periodicity in a time series of measurements. Due to observational constraints, the samples often are unevenly spaced, possibly with large gaps in the sampling. The search for periodicity is then typically done with periodograms, which can be defined in several ways. The different definitions can be viewed as a metric (most often the χ2 difference) that compares two models: a base model and a model that includes this base model plus a periodic signal at a given frequency, for a grid of frequencies. Typically, periodic components are sinusoids, and the base model includes white, Gaussian noise (Lomb 1976; Scargle 1982). It can also include a constant (Ferraz-Mello 1981; Cumming et al. 1999; Reegen 2007; Zechmeister & Kürster 2009), general linear models Baluev (2008), non-sinusoidal functions Baluev (2013b, 2015), several periodic components Baluev (2013,a), or correlated noise (Delisle et al. 2020a). Once the periodogram is computed, detections are claimed if the false alarm probability (FAP) of the highest peak is below a certain threshold. The search for periodic signals can also be performed in a Bayesian framework. Periodograms can be defined as comparisons of marginal likelihoods of the two competing models, with or without a given frequency (Mortier et al. 2015; Feng et al. 2017). The analysis can also be done directly from the posterior distributions of orbital elements. In that case, to determine how many periodic components can be confidently detected, one computes the ratio of Bayesian evidences – or Bayes factors (Kass & Raftery 1995) – of models with n and n + 1 components. If the Bayes factor is above a certain threshold (usually 150), the addition of a periodic component is validated (e.g. Gregory 2007b,a; Tuomi 2012; Faria et al. 2016). Alternatively, one can use a detection criterion based on Bayesian model averaging (Hara et al. 2022).

A low FAP or high Bayes factor gives a measure of the confidence in a detection of a periodic component within a certain model of the data. If the model is inappropriate, there might be significant signals that do not correspond to strictly periodic signals. Certain noises are particularly likely to yield significant peaks: the ones that present a periodicity that is localised in time, similarly to wavelets. This situation is encountered in particular in the search for planets in radial velocity (RV) data. This type data consist of a time series of velocity projected along the line of sight (the RV) of a given star, measured thanks to the Doppler effect. If a planet orbits the star, then one expects periodic variation in the velocity. However, the surface of a star is not uniform: Spots and faculae break the flux balance between the approaching and receding limb of the star; furthermore, they inhibit convective blueshift. As a consequence, during their lifetime, stellar features introduce RV signals that might mimic planetary ones, in particular at the stellar rotation period or its harmonics (e.g. Saar & Donahue 1997; Boisse et al. 2011; Meunier et al. 2010; Dumusque et al. 2014), but also possibly at periods apparently unrelated to the stellar rotation (Nava et al. 2020).

One approach to this problem consists in trying to improve the data model and the detection metrics so that a significant detection has a meaning as close as possible to the detection of the signal of interest (here, a planet). In RV data analysis, stellar activity is typically modelled with a Gaussian process. This approach is taken in Haywood et al. (2014), Rajpaul et al. (2015), Jones et al. (2017), Gilbertson et al. (2020), and Hara et al. (2022). The stellar activity models are, however, imperfect, and another approach consists in being more agnostic to the form of stellar activity. For instance, Gregory (2016) suggests computing the Bayesian evidence of an apodised model, that is, a periodic planetary signal model multiplied by a function of the form e(tt0)2τ2$\textrm{e}^{-\frac{(t-t_0)^2}{\tau^2}}$, where τ and t0 are free parameters. A signal is claimed to be of planetary origin if it is significant and if the posterior probability of the event τTobs, where Tobs is the total time span of observation, is greater than a given threshold. The rationale is that a planetary signal is a purely periodic signal and should therefore be identical from the beginning to the end of the dataset. In a periodogram setting, Schuster (1898); Mortier & Collier Cameron (2017) adopt a similar approach. They suggest computing the periodogram by adding one point at a time and determining whether the significance of a peak at a certain period grows steadily with the number of points.

One canenvision other diagnostics for determining if an apparently periodic signal presents signs of variability. In the present work we present tests to assess the timescale of a signal and to determine if its period, phase, and amplitude are constant. We suggest tests based on the periodogram and Bayesian formalisms.

The article is organised as follows. In Sect. 2, we define the diagnostics mentioned above. In Sect. 3, we show examples of applications to the solar HARPS-N data (Dumusque et al. 2021), HD 215152 (Delisle et al. 2018), HD 69830 (Lovis et al. 2006), and HD 13808 (Ahrer et al. 2021). We finally present our conclusions in Sect. 4.

2 Methods

2.1 Signal timescales

2.1.1 Apodised sinusoids periodograms

In this section, we define the apodised sinusoids periodogram (ASP) of a time series y=(y(tk))k=1N$\bm y = (y(t_k))_{k=1\ldots N}$. Following the method of Baluev (2008) to define periodograms, we consider two alternative models of the time series y: a linear base model μH$\mu_{\mathcal{H}}$ of p parameters, and a model μK$\mu_{\mathcal{K}}$ that includes H$\mathcal{H}$ plus an apodised sinusoidal component. The base model, as a vector with N components, is defined as μH(θH)=φHθH,\begin{align*} \bm \mu_{\mathcal{H}}(\bm \theta_{\mathcal{H}}) = \mathbf{\varphi}_{\mathcal{H}} \bm \theta_{\mathcal{H}},\end{align*}(1)

where φH$\mathbf{\varphi}_{\mathcal{H}}$ is an N × p matrix and θH$\bm \theta_{\mathcal{H}}$ is the vector of p parameters to be fitted. For instance, the matrix ϕH$\mathbf{\phi}_{\mathcal{H}}$ can be simply taken as a column vector with all N entries equal to one, thus modelling an offset. If one assumes that the sinusoidal signal of frequency ω is in the data by default, ϕH$\mathbf{\phi}_{\mathcal{H}}$ can be defined as an N × 3 matrix with line i equal to [1, cos ωti, sin ωti]. The alternative model is defined as μK(θH,θ)=φHθH+μ(t,θ),\begin{align*} \bm \mu_{\mathcal{K}}(\bm \theta_{\mathcal{H}}, \bm \theta) = \varphi_{\mathcal{H}} \bm \theta_{\mathcal{H}} + \bm \mu(t, \bm \theta),\end{align*}(2)

where μ(t, θ) is an apodised periodic function. We compared the hypotheses H$\mathcal{H}$: y=μH(θH)+ϵ$\bm y = \bm \mu_{\mathcal{H}}(\bm \theta_{\mathcal{H}}) + \bm \epsilon$ for some θH$\bm \theta_{\mathcal{H}}$ and K$\mathcal{K}$: y=μK(θK)+ϵ$\bm y = \bm \mu_{\mathcal{K}}(\bm \theta_{\mathcal{K}}) + \bm \epsilon$ for some θK$\bm \theta_{\mathcal{K}}$, where ϵ is a Gaussian noise model (or random variable) of covariance matrix V.

In the present work, μ is specified as μ(t,ω,τ,t0,A,B)=w(τ,t0)(Acosωt+Bsinωt),\begin{align*} \mu(t, \omega, \tau,t_0, A,B) = w(\tau, t_0) (A \cos \omega t + B \sin \omega t),\end{align*}(3)

where w(τ, t0) is the apodisation function. Here we use Gaussian and box-shaped functions, that is, wG(τ,t0):=e(tt0)22τ2wB(τ,t0):=1[t0τ2,t0+τ2](τ,t0), \begin{align} w_G(\tau, t_0) &:= \textrm{e}^{-\frac{(t-t_0)^2}{2\tau^2}}\\ w_B(\tau, t_0) &:= \mathbbm{1}_{\left[t_0 - \frac{\tau}{2}, t_0 + \frac{\tau}{2}\right]} (\tau, t_0),\end{align}

where 1X(x)$ \mathbbm{1}_X(x)$ is a function that is zero everywhere except when xX, but other choices are possible. One can envision further generalisations. Indeed, the periodic signal need not besinusoidal. It can be, for instance, Keplerian (Baluev 2015). Also, the base model H$\mathcal{H}$ can be generalised to non-linear models.

Considering a model μi, i=H$i= \mathcal{H}$ or K$\mathcal{K}$ with parameters θi, and column vector residuals r = yμi(θi), the χ2 is defined as rT V−1r, where the suffix T denotes the matrix transpose. In the following equation, χH2$\chi^2_{\mathcal{H}}$ is the χ2 of the residuals after fitting the parameters θH$\bm \theta_{\mathcal{H}}$ of model (1), and, for a set of fixed ω, t0, and τ, the χK2(ω,t0,τ)$\chi^2_{\mathcal{K}}(\omega, t_0,\tau)$ is the χ2 of the residuals after fitting the parameters θH,A,$\bm\theta_{\mathcal{H}}, A,$ and B of model (2), on which the model depends linearly. We now define a periodogram-type counterpart to that of Gregory (2016): z(ω,t0,τ)=χH2χK2(ω,t0,τ), \begin{align*} z(\omega, t_0,\tau) &= \chi^2_{\mathcal{H}} - \chi^2_{\mathcal{K}}(\omega, t_0,\tau),\end{align*}(6)

which represents the improvement of the fit gained by adding the μ component compared to the base model H$\mathcal{H}$. In the following sections we use the following representation. For a given τ we represent the quantity z(ω,τ)=maxt0z(ω,t0,τ). \begin{align*} z\prime(\omega,\tau) &= \max\limits_{t_0} z(\omega, t_0,\tau).\end{align*}(7)

The values of z′(ω, τ) are overplotted for different values of τ. In Fig. 1 we represent the values of z′ for four values of τ for the solarHARPS-N data (see Sect. 3.1 for more details). We call such a figure an apodised sinusoids periodogram (ASP).

Our definition of Eqs. (3) and (4) is similar to a continuous wavelet transform (Grossmann & Morlet 1984) applied to irregularly sampled time series (Foster 1996). Our choice of Eq. (4) is inspired by the Gabor wavelet, which minimises the product of time and Fourier domain variance (Daugman 1998). However, in the wavelet transform context the timescale, τ, and frequency, ω, are dependent on each other. Indeed, a family of wavelet (ψa,b)a,b$(\psi_{a,b})_{a,b}$ is defined as the translation and dilation of a mother wavelet ψ, ψa,b(x)=ψ((xb)/a)/a$\psi_{a,b}(x) = \psi((x-b)/a)/\sqrt{a}$. In our case, the lifetime τ of a quasi-periodic feature is not unequivocally linked to its frequency, ω. As a consequence, even though having three free parameters instead of two introduces an extra computational load, we leave ω, τ, and t0 as independent parameters.

2.1.2 Grid

Regular periodograms are usually computed on a grid of equispaced frequencies, ωk = kΔω. The choice of Δω depends on the typical width of the periodogram peaks, which is approximately equal to 2πTobs, where Tobs is the total time span of the observations. Then, Δω is chosen as 2πTobs * 1∕Nω, where Nω is called the oversampling factor.

For the ASPs we need to construct a grid in three dimensions: ω, t0, and T. The naive approach, consisting in having a fixed grid for each parameter and trying all the possible combinations, is unnecessarily computationally expensive. Indeed, for a given timescale, T, such that T < Tobs, the typical width of the peaks in ω is ≈ 2πT > 2πTobs. To set the grid, the approach adopted here is to define the following quantities: (i) a maximum frequency, ωmax ; (ii) two oversampling factors, Nω and Nt ; and (iii) a fixed number α < 1 and a number of timescales Nτ such that the grid of τ is defined as τk = αk−1Tobs, k = 1…Nτ. We define two reference grids for ω and t0 as ωref=(ωk)k=1nω$\omega^{\mathrm{ref}} = (\omega_k)_{k=1\ldots n_{\omega}}$ and t0ref=(t0k)k=1n$t_0^{\mathrm{ref}}= ({t_0}_k)_{k=1\ldots n}$, where ωk = kΔω, with Δω = 2πTmaxNω and t0k=kΔt0${t_0}_k = k\Delta t_0$, with Δt0 = τminNt.

For eachτk, the waveletogram is computed on a sub-grid of ωref and t0ref$t_0^{\mathrm{ref}}$, denoted by ωτ and t0τ$t_0^{\tau}$. These are defined as ωkτ=ωksωτref$\omega^{\tau}_k = \omega^{\mathrm{ref}}_{k s_{\omega}^{\tau}} $ and t0τk=t0refkstτ${t_0^{\tau}}_k = {t_0^{\mathrm{ref}}}_{k s_t^{\tau}} $, where sωτ=τmax/τ$ s_{\omega}^{\tau} = \lfloor \tau_{\mathrm{max}}/\tau \rfloor$ and stτ=τ/τmin$ s_t^{\tau} = \lfloor \tau/\tau_{\mathrm{min}} \rfloor$. The rationale behind these numbers is that, for a given τ, the grid spacing in ω is approximately 2πτNω and the grid spacing in t0 is τNt. Secondly, since the omega grid is not recomputed for every timescale, τ, one only has to compute the cos ωt and sin ωt terms once. Once this method is specified, one has to choose specific values for Nω, Nt, ωmax, α, and nT. The list of symbols and, when relevant, the default values we chose are reported in Table 1.

Table 1

Symbols.

2.1.3 Statistics

In Gregory (2016), one computes the Bayesian evidence (or marginal likelihoods) of models containing k periodic components multiplied by an apodisation factor e(tt0)22τ2$ \textrm{e}^{-\frac{(t-t_0)^2}{2\tau^2}} $. A planet detection is claimed if a signal is statistically significant (i.e. if the ratio of evidence of the model with it to the evidence without it is greater than a certain threshold) and if the posterior distribution of τ favours values greater than the total time span of the observations.

We here propose an alternative way to estimate the timescale on which the signal remains coherent, which can be computed very quickly from the periodogram. Denoting by tτ,ω the value of t0 and maximising the value of periodogram (6) for a given ω and τ, we computed the distribution of Dz=z(ω,tτ,ω,τ)z(ω,tτ,ω,τ)$D_z = z(\omega, t_{\tau,\omega}, \tau) - z(\omega, t_{\tau\prime,\omega}, \tau\prime)$ with the hypothesis that model K(ω,tτ,ω,τ,A,B)$\mathcal{K}(\omega, t_{\tau,\omega}, \tau, A^{\star}, B^{\star})$ is correct, where the fitted cosine and sine amplitude A, B are obtained by fitting model K$\mathcal{K}$ to the data. The distribution of Dz can easily be expressedas a generalised χ2 distribution, and its mean and variance is given by an analytical expression, which is given in Eqs. (A.9) and (A.10). This method does not take into account that ω is chosen as a maximum of the periodogram and could be refined in future work.

Secondly, one can define a FAP associated with Eq. (6), the same ways as done for a regular periodogram. In the remainder of the article, we do not use the FAP for ASPs; however, we use a FAP for regular periodograms (Baluev 2008; Delisle et al. 2020a). For the sake of conciseness, the method is described in Appendix B.

In general, assuming that the signal is in the form of the model K$\mathcal{K}$ (Eq. (2)) will be an incorrect assumption. In particular, if aliasing cannot be neglected and the data contain several signal sources, not accounting for them might result in an inaccurate uncertainty on the timescale, τ, of the signal of interest. We suggest including all the periodic signals found to be significant in the base model (we give an example of this procedure in Sect. 3). Alternatively, at the price of a higher computational cost, one can search for several signals simultaneously, as shown in the next section.

2.1.4 Bayesian approach

In Gregory (2016), a detection is claimed if the posterior probability of the event τ > Tobs is greater than a certain threshold. However, such a criterion can be misleading. For instance, there might be datasets in which there are a few isolated measurements that are separated from the bulk of the measurements by a large time gap. In that case, most of the samples could correspond to τ < Tobs, though there is a possibility that the signal is consistent with a fully periodic signal. In addition to the posterior distribution of τ, we suggest considering a quantity f that quantifies the fraction of information captured by a certain window. We computed the posterior distribution of f=w(τ,t0)TV1w(τ,t0)1TV11,\begin{align*} f = \frac{\bm w(\tau, t_0)^T \mathbf{V}^{-1} \bm w(\tau, t_0)}{\bm 1^T\mathbf{V}^{-1} \bm 1},\end{align*}(8)

conditioned on the frequency of the planet being within ± Δω of a given frequency of interest. In Eq. (8), w=(w(ti))i=1N$\bm w = (w(t_i))_{i=1\ldots N}$ is the apodisation window (see Eq. (3)), V is the covariance matrix, and 1 is a vector of size N. The values of the vector w are between 0 and 1. If f is close to 1, then the apodised signal is close to a strictly periodic signal. On the contrary, a value of f close to 0 implies that the w has concentrated information in time. The definition of f is valid for any apodised signal – in particular, the planet model can be circular or Keplerian – and for all the signals in the data.

In Gregory (2016), the number of planets is determined by checking that the Bayes factor comparing the n + 1 and n planets model is greater than a certain threshold. The significance of the planet can also be established with the false inclusion probability (FIP; Hara et al. 2022), defined as follows. Considering a certain parameter space S (for instance, a period and eccentricity interval), the true inclusion probability (TIP) is the probability of having a planet with orbital elements in S marginalised over the number of planets in the system. The FIP is defined as 1−TIP. Instead of claiming a detection if the Bayes factor is above a certain threshold, we claim a detection if the probability of not having a signal with a period in [ω − Δω, ω + Δω] – marginalised on all parameters, including t0 and τ – is below a certain threshold.

thumbnail Fig. 1

ASPs computed iteratively on the solar data. Left column: ASPs corresponding to Eq. (7) for different values of τ. Middle column: zoomed-in view of the maximum peak of the ASP. Right: statistical test on the timescale described in Sect. 2.1.3. On the x axis we represent the ratio of τ to the total observation time assumed to be true. For each τi, i = 1…4 in abscissa, the position of the markers corresponding to τj, j = 1…4 is represented as the value of the periodogram peak for τi minus the expectancy of Dz=z(ω,tτ,ω,τi)z(ω,tτj,ω,τj)$D_z = z(\omega, t_{\tau,\omega}, \tau_i) - z(\omega, t_{\tau_j,\omega}, \tau_j)$, and the error bar is the square root of the variance of Dz. Informally, the markers represent what the periodogram values at τj should be if τi in abscissa were the true timescale.

2.2 Amplitude and phase consistency

The second diagnostic we present in this work relies on computing the phase and amplitude of a signal at a given period with a moving time window. We consider, as in Sect. 2.1.1, the linear models H$\mathcal{H}$ Eq. (1) and K$\mathcal{K}$ Eq. (2). For the function μ, we adopt the definition of Eq. (3), where the window function w is box shaped. For each value of ω, t0, and τ, one has estimates of parameters A and B (Eq. (3)). The uncertainties on A and B are obtained from the diagonal elements of the covariance matrix of the fit. From the estimates of A and B, we compute the local semi-amplitude K and phase ϕ K(ω,t0,τ)=A2+B2ϕ(ω,t0,τ)=atan2(B,A), \begin{align} K(\omega, t_0, \tau) &= \sqrt{A^2 + B^2}\\ \phi(\omega, t_0, \tau) &= \mathrm{atan2}(-B,A),\end{align}

as well as the uncertainties on K and ϕ propagated from those on A and B with a simple Monte Carlo simulation. Alternatively, one can estimate the uncertainties on A, B, K, and ϕ with their posterior distributions, computed with a Monte Carlo Markov chain algorithm, which is more computationally expensive but propagates the uncertainties on the orbital elements more accurately.

For a given frequency ω and timescale τ, the quantities A,B, ϕ, and K and their uncertainties (computed from the least square fit or posterior distributions) can be used to check the consistency of the phase and amplitude of a signal at a given period. Assuming that the signal contains a pure sine at ω, the values of A and B or K and ϕ as a function of t0 should be consistent with the hypothesis that A and B or K and ϕ are constant.

Besides a visual inspection of A and B or K and ϕ as a function of t0, the hypothesis that the signal has a constant phase or amplitude can be checked with statistical tests. In particular, one can isolate values of A and B estimated in different windows. Here we denote estimates of A and B in disjoint windows with (Ai)i=1M$(A_i)_{i=1\ldots M}$ and (Bi)i=1M$(B_i)_{i=1\ldots M}$. If one neglects correlations that might happen between those estimates due to correlated noise, under the hypothesis that phase and amplitude are constant, then the statistics χA2=i=1M(AiμA)2σAi2χB2=i=1M(BiμB)2σBi2, \begin{align*} \chi^2_A &= \sum\limits_{i=1}^M \frac{(A_i - \mu_A)^2}{\sigma_{A_i}^2} \\ \chi^2_B &= \sum\limits_{i=1}^M \frac{(B_i - \mu_B)^2}{\sigma_{B_i}^2},\end{align*}

where μX=Xiσi2/1σi2,\begin{align*} \mu_X = \sum \frac{X_i}{\sigma_i^2} \left/ \sum \frac{1}{\sigma_i^2}, \right.\end{align*}(13)

with X = A or X = B, should follow a χ2 distribution with M − 1 degrees of freedom. We can measure the quantiles of χA2$\chi^2_A$ and χB2$\chi^2_B$: If they are below a certain threshold, then the hypothesis that they are constant through time can be rejected. A χ2 distribution with M − 1 degrees of freedom has mean M − 1 and standard deviation 2(M1)$\sqrt{2(M-1)}$. For a quick diagnostic, we suggest computing the quantity Nσχi2=χi2(M1)2(M1),i=A,B,\begin{align*} N\sigma_{\chi^2_i} = \frac{\chi^2_i - (M-1)}{\sqrt{2(M-1)}}, \;\;\; i = A, B,\end{align*}(14)

which is the difference between the χi2$\chi^2_i $ and its expected value divided by the standard deviation.

In the test suggested in this section, the time window w in Eq. (3) is assumed to be a box-shaped one. One could envision a more general one. However, for a consistent analysis, since the noise is also multiplied by the window, the covariance matrix of the noise would have to be modified according to the chosen window. If it is non-zero everywhere, then this comes down to doing the analysis of the data without any window, unless some numerical errors lead the covariance to be non-invertible. In that case, using a window comes down to a rectangular window. As a consequence, we here only consider box-shaped time windows (see Eq. (5)).

We stress that to evaluate the phase and amplitude consistency of a signal, one must have an accurate estimate of the period of the signal to analyse. If it is poorly estimated, then the phase will spuriously appear as a linear phase drift. As a consequence, we advise selecting the value of ω to be studied in depth from the regular periodogram containing all the data.

As mentioned in Sect. 2.1.3, in general, assuming that the signal is in the form of the model K$\mathcal{K}$ (Eq. (2)) will be an incorrect assumption, and aliasing can be a problem. For instance, if the signal contains several sources that are strictly periodic, due to aliasing, the phase and amplitude of the signal under consideration might appear to vary significantly even though the signal is periodic. As a consequence, the fact that the quantities defined in Eq. (12) deviate from the mean is not always interpretable as a test of the hypothesis that the signal at frequency ω is constant in phase and amplitude. One of the assumptions in K$\mathcal{K}$ is the noise model; however, it might not be known and needs to be adjusted to the data. We suggest including in the base model all the periodic signals found to be significant; this is elaborated on in Sect. 3.

In addition to testing for the consistency of the phase and amplitude of a signal, it might be interesting to study its relative phase and amplitude with another signal. For instance, one might want to see the evolution of the phase difference between the RV and logRHK$\log R\prime_{\textrm{HK}}$ as a function of time to study the stellar activity. An example of such an analysis is given in Sect. 3.1.

2.3 Period consistency

Assuming a sinusoidal shape of the signal, the last parameter of an apparently periodic signal whose consistency in time can be evaluated is the period. A convenient representation is, for a given timescale τ, a colour map z(ω, t0, τ) as defined in Eq. (6). For a given quasi-periodic signal, a local change in the χ2 value mightbe due to a clustering or spacing of the observation time. A convenient representation for jointly evaluating period and amplitude consistency is the fitted semi-amplitude K:=A2+B2$K :=\sqrt{A^2 + B^2}$ (see Eq. (3)) as a function of ω and t0. To give meaningful diagnostics, it might be suitable to mask the values of K with an uncertainty above a certain threshold.

We suggest another test. Considering a frequency range [ωl, ωr] containing the frequency of a candidate periodic signal ω0, for a given timescale τ, we plot ω(t0):=argmaxω[ωl,ωr]z(ω,t0,τ)\begin{align*} \omega^{\star} (t_0) := \arg \max\limits_{ \omega \in [\omega_l,\omega_r]} z(\omega, t_0, \tau)\end{align*}(15)

as a function of t0, where wB is defined by Eqs. (3) and (5).

As in Sect. 2.2, one can test the hypothesis that approximately independent measurements of ω stem from the same distribution. One can envision a further generalisation where one looks for shape variations in periodic signals. However, in the cases considered here, the signal-to-noise ratio would not allow conclusions to be drawn on this aspect. As a consequence, we do not consider it here.

We note that estimating the local phase, amplitude, and frequency of a signal is a problem also encountered in signal demodulation in the context of telecommunications (e.g. Madhow 2008). Phase consistency tests have been suggested in the context of discrete events, more specifically γ ray pulsars, to detect phase shifts relative to an expected position (de Jager 1994). Assessing whether period and amplitudes are constant with time has been done for Cepheids (Süveges & Anderson 2018b,a)

3 Applications

3.1 Solar data

3.1.1 Data analysis

In this section, we apply our methods to the HARPS-N solar data (Dumusque et al. 2021). This dataset contains the RV of the Sun measured with the HARPS-N solar telescope (Dumusque et al. 2015). This dataset spans three years, from BJD 2457222.1788 to 2458315.9982, and contains 34 550 measurements with a median interval between two samples of 5 min 38 s and a median nominal error of 0.22 m s−1. The data are publicly available on the Data Analysis Center for Exoplanets (DACE) platform1. We binned the data by half days, which means that data acquired between midnight and noon and between noon and midnight (local time at La Palmaobservatory) were averaged, weighted by their nominal uncertainties. We considered a Gaussian, uncorrelated noise model, with a 1 m s−1 jitter added in quadrature to the nominal uncertainties.

We first performed an iterative search of signals using the ASPs defined in Sect. 2.1.1. We considered four timescales and computed Eq. (6) for τTobs = 10, 1/3, 1/9, and 1/27, where Tobs is the total time span of the observations. The grid in t0 and ω is defined asin Sect. 2.1.2. We computed the ASP and then added to the base model the model corresponding to the maximum peak.

The left panel of Fig. 1a shows the first ASP, which presents a maximum at period ω(0)= 9000 days for τ(0) = Tobs∕9 and t0(0)$t_0^{(0)}$. We then added w(τ(0),t0(0))cosω(0)t$w(\tau^{(0)}, t_0^{(0)})\cos \omega^{(0)} t$ and w(τ(0),t0(0))sinω(0)t$w(\tau^{(0)}, t_0^{(0)})\sin \omega^{(0)} t$ to the base model and computed the ASP, obtaining Fig. 1b. The same operation was repeated, yielding Figs. 1c and d. In Figs. 2a–d, we represent the models that correspond to the highest peaks of Figs. 1a–d, respectively.

The middle panel of Fig. 1 presents a zoomed-in view of the maximum peak of the successive ASP. Horizontal dashed lines correspond to the values of z at the frequency where the maximum is attained. For instance, in (b) the maximum value is attained at period = 13.39 days. The blue, orange, red, and green horizontal lines represent the level of the periodogram peaks with τTobs = 10, 1/3, 1/9, and 1/27 at P(1) = 13.39 days. We found empirically that a downsizing factor of 1/3 for τ allows a reasonable compromise between resolution and grid size. Finally, the right panel represents the statistical test presented in Sect. 2.1.3. The four timescales corresponds to a value of τ, reported in abscissa. We now turn to Fig. 1b. For each timescale τ, we assumed that the data contain a signal at frequency P(1) and timescale τ. The points with error bars correspond to the expected value of the periodogram peak and its standard deviation, assuming τ is the correct timescale. For instance, in Fig. 1b, assuming τ = Tobs∕3 = 365 days would lead us to expect a z value with τ = Tobs over four sigmas away from the observed one. It then appears that the 13.39 day signal is consistently detected on the whole dataset. In comparison, signals at 9000 days, 158 days, and 26.9 days (Figs. 1a, c, and d) seem to be localised in time. The peaks at 26.9 and 13.39 days are compatible with the first and second harmonic of the solar rotation period. Low frequency structures are expected to come from correlated patterns stemming either from the star or the instrument.

The 13.39 day signal seems the most coherent of the four signals found. To further estimate if it is purely periodic, weapplied the methods of Sects. 2.2 and 2.3. As suggested in Sect. 2.3, we studied the consistency of the signal frequency by representing the values of z(ω, t0, τ) (see Eq. (6)) and the amplitude K(ω, t0, τ) for a given τ as a colour map. In Fig. 3 we represent z ((a) and (b)) and K ((c) and (d)) for τ = Tobs∕3 = 365 days ((a) and (c)) and τ = Tobs∕9 = 121 days ((b) and (d)). The base model is identical to the one used to compute Fig. 1b. It appears that the 13 day signal is present throughout the dataset. We notice that its apparent waning towards the edge of the dataset is due to the fact that when the t0 reaches the beginning or end of the time series, the window includes half as many points compared to the middle of the time series. Signals at ~ 160 and ~ 25 days also seem tobe present but with more variability, which is consistent with the analysis of Fig. 1. In Fig. 3, we represent the quantity defined in Eq. (15), that is, the local maximum of z between certain frequencies as a function of t0 for fixed τ (τ = Tobs∕3 = 365 days for (c) and τ = Tobs∕9 = 121 days for (d)). We chose a collection of frequencies, ωk = kΔω, which are represented in grey in Figs. 3c and d. The maxima of z between 10 and 20 days consistently occurs at 13.39 days.

The 13 day signal seems to be consistently present in the data and to have a steady period. We further examined whether its phase, ϕ, and semi-amplitude, K, as defined in Eqs. (10) and (9) are constants of time. To avoid being polluted by other signals, we included a 26.9-day signal as well as a ninth-order polynomial in the base model, filtering out low frequency signals. In Figs. 4a and b, we represent the evolution of ϕ and K in red and green, respectively, as a function of t0 for fixed τ (τ = Tobs∕3 = 365 days for (a) and τ = Tobs∕9 = 121 days for (b)); the uncertainties are represented with colour-shaded areas. Dashed red and green lines represent the mean values of ϕ and K. Dots corresponds to measurements done on disjoint box-shaped windows, which are thus approximately statistically independent, as shown in Appendix A.2. In Figs. 4c and d, we represent the values of A and B as defined in Eq. (3), with their uncertainties in blue (τ = Tobs∕3 = 365 days for (c) and τ = Tobs∕9 = 121 days for (d)). Points in orange correspond to the measurement times marked with dots in Figs. 4a and b, which yield approximately independent estimates of A and B. At the shortest timescale, it appears that Nσχ2$N\sigma_{\chi}^2$ as defined in Eq. (14) is greater than 5 (see Fig. 4d). In Fig. 4b the variation in amplitude does seem to significantly vary over time, with two peaks around BJD 2457400 and BJD 2458000, which is consistent with the behaviour seen in Fig. 3b. The statistical tests depend on the assumed noise model. To obtain more robust statistics, we performed the same analysis but adjusting an extra jitter term at each trial t0 and propagated the uncertainty on K, ϕ, A, and B. The results are very similar and qualitatively unchanged. As a conclusion, it appears that the 13.39 day signal has a long timescale and a consistent period and phase, but its amplitude varies significantly with time.

thumbnail Fig. 2

Models corresponding to the maximum of the ASPs in Fig. 1.

3.1.2 Discussion

From a phenomenological point of view, the presence of the 13.39-day signal, as well as the one at ~25 days, can be fully understood by the presence of active regions rotating with the solar surface, as those two periods corresponds to the second and Prot term. By locally modifying the flux intensity of the solar surface (e.g. Saar & Donahue 1997) and by changing convection (e.g. Meunier & Lagrange 2013), the spot and the faculae on the surface of a solar-like star will impact RV data, with a semi-periodic signal that can be decomposed as a Fourier series, thus as a sum of periodic signals at the rotation period of the Sun and its first harmonics (Boisse et al. 2011).

It is often observed on solar-like stars that activity injects more power at Prot∕2 than its rotation period Prot (e.g. Boisse et al. 2011). This can be explained by the fact that not just one but several active regions perturb the RV measurement at the same time, but also because the Sun is seen equator-on; thus, active regions are seen only during half the rotation. On a star such as the Sun, the main contribution of the RV stellar activity signal comes from faculae (e.g. Meunier et al. 2010; Dumusque et al. 2014; Collier Cameron et al. 2019; Milbourne et al. 2019), which induce a strictly positive RV offset whatever their position on the solar disc, and the effect of an active region, in first approximation, can be viewed as a sinusoid truncated atzero (e.g. Meunier et al. 2010; Dumusque et al. 2014). This truncation alone cannot explain the predominance of the Prot∕2 term, as the Fourier expansion of such a truncated sinusoid has a stronger Prot term than Prot∕2, where Prot denotes the stellar rotation period. If spots are at different longitudes, the RV signal would resemble a sum of truncated sine signals with different phases, and random phase differences do not preferentially affect a given harmonic. However, a phase difference of two signals of 180° cancels out the first harmonic. As noted in Borgniet et al. (2015), there are generally two persistent active longitudes per hemisphere, shifted by 180° (Berdyugina & Usoskin 2003), and this might explain the second harmonic. We add that even if there are several such configurations of two diametrically opposed active longitudes at different phases, the sum of their contributions has a vanishing component at Prot. The presence of active regions on the surface, not in opposite phase but randomly distributed, may introduce a signal at Prot. We finally note that the RV effect of active regions is not only due to convective blueshift inhibition, but also to their difference in brightness relative to the surroundings, introducing an asymmetry in approaching and receding limbs. This signal of photometric origin resembles a sine function with period Prot∕2 on [kProt∕2, (k + 1)Prot∕2] for k even and equal to 0 for k odd (e.g. Dumusque et al. 2014). This truncated signal has a dominant power at Prot∕2, which might partly explain the strong Prot∕2 signal in Fig. 1b.

The change in amplitude and stability in phase can be explained by the presence of a few very large active regions that last for several months on the solar surface. The induced RV effect will be large and the signal consistent over a long period of time. The fact that the amplitude is large around BJD 2457400 can be explained by the Sun being active in 2015, with a lot of large active regions on its surface. The lower χ2 closer to the beginning of the time series around BJD 2457217 is due to an edge effect. The sudden increase in amplitude around BJD 2458000 can be explained by the last large active region that appeared on the solar surface before the Sun reached the minimum of magnetic cycle 24 (clearly seen in the Log(RHK$R_{\textrm{HK}}^{\prime}$) time series in Fig. 6 of Dumusque et al. 2021).

Overall, our tentative explanation of the strength of the 13.39 day signal, its phase stability, and its amplitude variation is that stable magnetic regions diametrically opposed might have caused it. This speculative explanation requires further work to be properly accepted or rejected. We note that, thankfully, the amplitude of the solar 13.39 day harmonic is more variable than its phase, making it less likely that such a signal in stellar data could be mistaken for a planet.

thumbnail Fig. 3

Consistency of the period of signals in the solar data. (a and b): value of the χ2 defined in Eq. (6) for the solar HARPS-N RV data as a function of t0 for τ = 365 (=Tobs ∕3) days (a) and τ = 121 (=Tobs ∕9) days (b). (c and d): value of the semi-amplitude as a function of t0 for τ = 365 (=Tobs ∕3) days (c) and τ = 121 (=Tobs ∕9) days (b). Grey areas correspond to 1 σ uncertainties greater than 30 cm s−1. (e and f): in blue, position of the mode of the ASP between frequencies represented in grey as a function of t0, as defined in Sect. 2.3. Computed for τ = 365 (=Tobs ∕3) days (c) and τ = 121 (=Tobs ∕9) days (d).

3.1.3 Radial velocity and log R'_HK$\prime_{\textrm{HK}}$

We now compare the RV and logRHK$\log R\prime_{\textrm{HK}}$ time series. We note that there is no apparent signal at 13-14 days in the logRHK$\log R\prime_{\textrm{HK}}$ time series. In Fig. 5 we show the χ2 map of the solar logRHK$\log R\prime_{\textrm{HK}}$, binned with the same pattern as the RV. The base model is a second-order polynomial, and we added a jitter to the noise model equal to the standard deviation of the logRHK$\log R\prime_{\textrm{HK}}$ time series divided by 2. In Fig. 5, there are low frequency signals at 200–400 days as well as signals close to the stellar rotation period, but none close to its first harmonic. As for the RV, we then performed an iterative search for signals (six iterations) and did not find signals at 13 days.

Furthermore, as suggested in Sect. 2.2, we studied the phase difference of RV and logRHK$\log R\prime_{\textrm{HK}}$. In both time series, there is a signal at the solar rotation period. We fixed P = 26.25 days and computed the difference Δϕ RV phase - logRHK$\log R\prime_{\textrm{HK}}$ as a function of t0 for τ = 364 (=Tobs∕3) days, wherethe phases are defined in Eq. (10). The uncertainty on Δϕ is defined as the quadratic addition of the uncertainties on the RV and logRHK$\log R\prime_{\textrm{HK}}$ phases. These were computed by fitting a free jitter error term for each choice of t0. In Fig. 6, we represent the phase difference as a function of t0 in purple, the shaded areas represent ± 1σ uncertainties, and the dots represent statistically independent phase measurements. The average difference of phase is 20 ± 6°, and this difference seems approximately constant throughout the observations. The RV 26 day signal is consistently in advance on the logRHK$\log R\prime_{\textrm{HK}}$ by 1.5 ± 0.5 days.

thumbnail Fig. 4

Phase and amplitude of the 13.39 day signal as a function of the time centre of the window t0 for the solar HARPS-N data, as described in Sect. 2.2 (solid red and green lines, respectively). Markers correspond to statistically independent estimates. Dashed red and green lines represent the average phase and amplitudes. The figures are all computed for a timescale of τ = 539 days, corresponding to τTobs = 1/9. Plots (a), (b), (c), and (d) are computed by adding iteratively the signals corresponding to the highest peaks to the base model.

thumbnail Fig. 5

Value of the χ2 defined in Eq. (6) for the solar HARPS-N logRHK$\log R\prime_{\textrm{HK}}$ as a functionof t0 for τ121 (=Tobs ∕9) days.

3.2 HD 215152

The star HD215152 was observed with the HARPS spectrograph over a 13 year period, from BJD 2452808 to 2457659. The data consist of 373 data points with mean error 0.73 m s−1. The data are shown in Fig. 7. HARPS experienced an update of optical fibres in May 2015, which introduced a velocity offset. Data taken before and after the fibre update, labelled HARPS03 and HARPS15, are represented in blue and red, respectively.This system has been studied in Delisle et al. (2018), who claim the discovery of four super-Earths with orbital periods of 5.76, 7.28, 10.86, and 25.20 days. The analysis of Delisle et al. (2018) proceeds as follows. The data were modelled as a sum of Keplerian signals and two offsets: one corresponding to HARPS03 data and one to HARPS15 data. The logRHK$\log R\prime_{\textrm{HK}}$ (Noyes 1984) was smoothed with a low-pass filter, and the resulting time series as well as the residuals (high-passed filter logRHK$\log R\prime_{\textrm{HK}}$) were used as linear predictors. The data were analysed with a Gaussian noise model whose kernel is defined as the sum in quadrature of the nominal uncertainties, a free jitter, and a correlated component with a Gaussian auto-covariance (or kernel). The auto-covariance is σR2eΔt2/(2τR2)$\sigma_R^2 \textrm{e}^{- \Delta t^2/(2\tau_R^2)}$, where Δt is the time difference between measurements and σR2$\sigma_R^2 $ and τR are free parameters. The significance of the signals was established by computing periodograms where σJ, σR, and τR are fitted at all trial frequencies. The periodogram was computed as the difference in the Bayesian information criterion (Schwarz 1978) of models H$\mathcal{H}$ and K$\mathcal{K}$. The rotation period of the star was estimated to be 43 days.

In this section, we present a re-analysis of the same data to assess the stability in amplitude, phase, and period of the detected signals. We have excluded the two points represented in black in Fig. 8 from the datasets on the ground that they deviate from the median of the HARPS15 data (blue points) by over four median absolute deviations.

To simplify the discussion, we assumed the same model as Delisle et al. (2018) and fixed the values of the noise parameters to their posterior medians, σJ = 0.6 m s−1 and σG = 1.2 m s−1, and the timescale of the noise is τR = 2 days (we note that this is different from the apodisation timescale τ). In the base model (1), we also included the low-pass filtered logRHK$\log R\prime_{\textrm{HK}}$ as well as the two offsets. We then computed the ASP (Eq. (6)) in four cases. In each one, we added to the base model six linear predictors, cos ω1t, sin ω1t, cos ω2t, sin ω2t, cos ω3t, and sin ω3t, where ω1, ω2, and ω3 are the frequencies of three planets out of four. This comes down to assuming that three planets are securely detected and the consistency of the fourth is tested. The frequency grid goes from 0 to 0.8 cycles per day to avoid showing the aliases of the peaks.

When leaving out the 5.76, 7.28, and 10.86 day planets from the base model, we obtain respectively Figs. 8 a, b, and c. We see clearly that the maximum occurs for the longest coherence time (in blue) for planets at 5.76 days and 10.86 days. In the case of the 7.28 day signal, the maximum occurs at τTobs = 1∕3. However, as shown in the right panel, the τTobs = 10 hypothesis is not excluded. We note that the sampling of the HARPS data is very irregular, with a few samples between BJD 2453000 and 2454500 (see Fig. 8). When leaving out the 25 day planet, we obtain Fig. 8d. In that case, the highest peak corresponds to 46 days. If the τ = 10Tobs were true, the periodogram peak corresponding to τ = Tobs∕3 would be ~2 σ away from the highest peak (see the right panel of Fig. 8d). This is not surprising, as 46 days is close to the estimated rotation period of the star. When subtracting the 46 day signal with τ = Tobs∕3, we obtain Fig. 8e. The maximum peak is attained at 25 days and appears to be consistent with τ = 10Tobs, and thus a planet. We note, however, that the FAP obtained with a regular periodogram of the 25 day planet is 1%, so it might be suitable to acquire more points for a more significant detection.

We performed the test described in Sect. 2.2 for each of the four cases. Figure 9 shows the resulting plot. It appears that the phase and amplitude of the signals are compatible with the hypothesis that they are constant in all cases.

thumbnail Fig. 6

Difference between the RV phase and logRHK$\log R\prime_{\textrm{HK}}$ phase as a function of time for τ = Tobs ∕3.

thumbnail Fig. 7

HD 215152 HARPS radial velocities. Data taken before and after the fibre update are represented in blue and red, respectively.

3.3 HD 69830

In this section, we illustrate the Bayesian method proposed in Sect. 2.1.4 with the HARPS03 data of HD 69830. This dataset contains 254 data points and spans 11.5 yr. The HD 69830 system is known to harbour three planets with minimum masses close to Neptune’s (Lovis et al. 2006) and periods of 8.667, 31.56, and 197 days.

As suggested in Sect. 2.1.4, we computed the joint posterior distribution of the orbital elements and the number of planets. The signal is represented as a sum of k Keplerian functions multiplied by an apodisation factor and a noise model (a random variable) y=i=1ke(tt0)22τ2Kepi(P,e,K,ω,M0)+ϵ,\begin{align*} \bm y = \sum\limits_{i=1}^k \textrm{e}^{-\frac{(\bm t-t_0)^2}{2\tau^2}} \text{Kep}_i(P,e,K,\omega, M_0) + \bm \epsilon,\end{align*}(16)

where Kep(P, e, K, ω, M0) is a Keplerian function (e.g. Perryman 2011) of period P, eccentricity e, semi-amplitude K, argument of periastron ω, and initial mean anomaly M0. We used a Gaussian noise model for ϵ with a jitter term and an exponential decay, implemented with the spleaf software (Delisle et al. 2020b). The covariance matrix V has the form Vij=δij(σi2+σW2)+σR2e(titj)22T2,\begin{align*} \mathbf{V}_{ij} = \delta_{ij} (\sigma_i^2 + \sigma_W^2) + \sigma_R^2\textrm{e}^{-\frac{(t_i-t_j)^2}{2T^2},}\end{align*}(17)

where δij is the Kronecker symbol and σi is the nominal uncertainty on measurement i. The priors on the parameters are presented in Table 2. As in Hara et al. (2022), the posterior distributions were computed with POLYCHORD (Handley et al. 2015b,a).

In Fig. 10, we represent the posterior distribution of τ (defined in Eq. (16)) and f (defined in Eq. (8)) conditioned on the period P being such that |1∕P − 1∕Pi| < 1∕Tobs for i = 1, 2, 3 and P1 = 8.66 days, P2 = 31.56 days, and P3 = 197 days. Here, Tobs is the total time span of the observations. The τ distribution for the P3 = 197 day planets peaks below Tobs. In the f distribution (right) it is clear that all distributions peak at 1. In conclusion, all the three known planets are indeed consistently detected.

thumbnail Fig. 8

ASPs computed on the HD 215152 HARPS data. All are computed with a base model that includes a linear activity model and offsets, as described in Sect. 3.2. They also include three of the four planets. Rows (a), (b), (c), and (d) correspond respectively to leaving out planets at 5.75, 7.28, 10.86, and 25.20 days. Left column: ASPs corresponding to Eq. (7) for different values of τ (blue, orange, green, and red correspond to τTobs = 10, 1/3, 1/9, and 1/27). Middle column: zoom-in on the maximum peak of the ASP. Right: statistical test on the timescale. When subtracting the 46 day signal with τ = Tobs∕3 (period at which the maximum of the ASP is attained in (d)), we obtain (e) (on the following page), where a coherent 25.2 d signal appears.

thumbnail Fig. 9

Phase and amplitude as a function of the time centre of the window t0, as described in Sect. 2.2 (solid red and green lines, respectively). Markers correspond to statistically independent estimates. Dashed red and green lines represent the average phase and amplitudes. The figures are all computed for a timescale of τ = 539 days, corresponding to τTobs = 1∕9. Plots (a), (b), (c), and (d) correspond to base models that include offsets, linear activity indicators, and all the claimed planets except the 5, 7, 10, and 25 day planets, respectively.

thumbnail Fig. 10

Diagnostics of the consistency of prensence of signals in the HD 69830 HARPS dataset. Left and right: posterior distribution of τ (defined in Eq. (16)) and f (defined in Eq. (8)) conditioned on the period P being such that |1∕P − 1∕Pi| = 1∕Tobs for i = 1, 2, 3 and P1 = 8.66 days, P2 = 31.56 days, and P3 = 197 days. The i = 1, 2, 3 correspondrespectively to the blue, orange, and green histograms. The black dotted line corresponds to τ = Tobs.

Table 2

Priors used for the computation of the FIP periodogram of HD 69830 and HD 13808.

3.4 HD 13808

The HD 13808 system was observed with HARPS (Mayor et al. 2003) over a 10 year time span. The dataset contains 246 RV measurements and presents three candidate signals, at 14.1, 19, and 53.7 days. In Ahrer et al. (2021), the HARPS dataset was analysed with many different noise models based on the bisector span (Queloz et al. 2001) and the logRHK$\log R\prime_{\textrm{HK}}$ (Noyes 1984). The stellar activity models were built upon the framework of Gaussian processes (Rajpaul et al. 2015) and the FF′ method (Aigrain et al. 2012). The models with different numbers of planets and different stellar activity models were compared through their Bayesian evidence. The authors conclude that the 14.1 and 53.7 day planets are securely detected and that the 19 day planet cannot be confidently claimed.

We next performed the same analysis as in Sect. 3.3. Following the method of Sect. 2.1.4, we computed the joint posterior distribution of ω, t0, τ, and the number of planets. We used the same noise model (see Eq. (17)) and the same priors (see Table 2) as for HD 69830 as well as the same numerical method, POLYCHORD (Handley et al. 2015b,a).

First, to check that the signals are significant, as in Hara et al. (2022), we computed the FIP periodogram. It is definedas follows. We considered a grid of frequency intervals with a fixed length. The element k of the grid, Ik, is defined as [ωk − Δω∕2, ωk + Δω∕2], where Δω = 2πTobs, Tobs is the total observation time span, and ωk = kΔωNoversampling. We took Noversampling = 5. We then computed the probability of having at least one planet in Ik (the TIP) and the FIP = 1 - FIP. We then represented − log10 FIP as a function of ωk.

As suggested in Hara et al. (2022), we computed the FIP periodogram for different maximum numbers of planets. We found that the FIP periodograms obtained for three and four planets are almost identical and chose the three-planet models. To assess the computational uncertainties, we computed the FIP periodogram with three different runs of POLYCHORD. The resulting FIP periodogram is shown in Fig. 11; each run corresponds to an FIP periodogram of a different colour. It appears that the probability of not having a planet at 14.1 and 53.7 days is below 10−10 and 10−2, respectively. The significance of the 18.9 day signal is much lower, with an FIP ~ 70%. We also did the same calculation with a white noise model, for which the 18.9 day signal is much more significant (FIP of 1%).

We then computed the posterior distribution of the apodisation timescale τ and the fraction of information captured by a window of given width f, as in Sect. 3.3. In Fig. 12 we represent the posterior distribution of τ (defined in Eq. (16)) and f (defined in Eq. (8)) conditioned on the period P being such that |1∕P − 1∕Pi| < 1∕Tobs for i = 1, 2, 3 and P1 = 14.18 days, P2 = 19 days, and P3 = 53.8 days. We find that all signals have an f distribution peaking at 1.

In conclusion, in accordance with Ahrer et al. (2021), we find that the 14.1 and 53.7 day signals are statistically significant. The significance of the 19 day signal is too low to claim a detection. However, it appears that the signal is consistent in time, so a planetary origin cannot be excluded. However, this is not likely since, as shown in Ahrer et al. (2021), a 19 day signal is present in the ancillary indicators. We have seen in the case of the Sun (see Sect. 3.1) that the first harmonic of the rotation period exhibits a constant period and phase, but not a constant amplitude. Since the stellar rotation period is ~38 days, the 19 day signal could be another case of a first, stable harmonic of the rotation period.

thumbnail Fig. 11

FIP periodogram of HD 13808: − log10 FIP (in blue) and log10 TIP (in yellow) of the presence of a planet in a centred frequency interval [ω − Δω, ω + Δω] as a functionof ω. Here, Δω = 2πTobs, where Tobs is the total observation time span. Different colours correspond to independent computations with POLYCHORD. The plot inside the black box represents a zoom-in on the region close to 18.9 days.

thumbnail Fig. 12

HD 13808 dataset. Left and right: posterior distribution of τ (defined in Eq. (16)) and f (defined in Eq. (8)) conditioned on the period P being such that |1∕P − 1∕Pi| < 1∕Tobs for i = 1, 2, 3 and P1 = 14.18 days, P2 = 19 days, and P3 = 53.8 days. The i = 1, 2, 3 correspondrespectively to the blue, orange, and green histograms. The black dotted line corresponds to τ = Tobs.

4 Discussion and conclusions

In the present work, our goal has been to assess whether an apparently periodic signal is strictly so, with a focus on the detection of planets in RV data. This can be done with models intended to capture all variations due to the planets and stellar activity (Haywood et al. 2014; Rajpaul et al. 2015; Jones et al. 2017; Hara et al. 2022). However, these models are not guaranteed to be trustworthy, which motivates the approach of the present work, whose methods offer a complementary outlook on the data.

Following Gregory (2016), we considered wavelet-like functions: periodic functions multiplied by an apodisation term. We suggested different representations and statistical tests to explore whether a signal is truly periodic, by constraining its timescale (see Sect. 2.1), its variation of phase and amplitude (see Sect. 2.2), and its frequency variation (see Sect. 2.3). We considered tests based on periodograms (see Sects. 2.1.1, 2.1.2, and 2.1.3) and on the Bayesian formalism (see Sect. 2.1.4). Periodograms provide interpretable figures and scan all the periods robustly and quickly, while their exploration might be difficult with random searches used for Bayesian inference. On the contrary, the latter method has the advantage of providing easily interpretable quantities. Depending on the objective and the size of the dataset, one of these two approaches might be more adapted.

The variability in the signal of interest can be assessed at different timescales with a typical Heisenberg uncertainty principle: the shorter the timescale of variation probed, the higher the uncertainty on the local estimates of phase, amplitude, and period. We recommend studying this variability on a grid of timescales chosen with an exponential decrease, as in wavelet decomposition (Mallat 1989).

Our tests were defined to assess the consistency with a certain data model. If other signals are present (trends, other periodic or quasi-periodic signals) or the shape of the signal examined is not sinusoidal – for instance, it corresponds to an eccentric planet – our indicators might present significant variations despite the signal being truly periodic. Asa consequence, for the results to be reliable it is crucial to model the signal of interest and other signals present simultaneously. The approach we suggest is either to model the data as a sum of apodised periodic components, as in Gregory (2016), or to assume that all periodic signals are truly periodic, except the one under consideration.

When assessing the phase and amplitude variability of a signal at frequency ω0, it is crucial to use all the data to estimate ω0 since the longest baseline gives the most precise estimate of ω0. An inaccurate estimate of ω0 might result in an apparent spurious linear phase shift with time.

We analysed the solar RV as measured by HARPS-N (Dumusque et al. 2021) as well as the HARPS data of HD 215152 (Delisle et al. 2018), HD 69830 (Lovis et al. 2006), and HD 13808 (Ahrer et al. 2021). The solar RV contains quasi-periodic signals at the first and second harmonic of the rotation period, the second harmonic having the larger amplitude of the two. The ~25–27 day region shows a varying period, phase, and amplitude. The second harmonic, appearing at 13.39 days, shows a period and phase constant within the uncertainties of statistical tests, showing that stellar activity signals can exhibit traits of purely periodic signals. Fortunately, the amplitude of the solar 13.39 day harmonic shows significant variations at the timescale of 121 days (total observation time span divided by nine). Such variability makes it less likely that such a stellar signal could be mistaken for a planet. These amplitude variations can be mapped to variations in the level of activity. This case shows that variations in amplitude, phase, and frequencies due to stellar activity can be very subtle and require a high signal-to-noise ratio to be statistically significant. These observations can speculatively be explained by the persistence of large active regions that are diametrically opposed, as suggested by Borgniet et al. (2015). A more rigorous testing of this hypothesis is left for future work.

In HD 13808, there is a potential low amplitude signal at 19 days, which also presents consistency in time. We therefore agree with the conclusion of Ahrer et al. (2021) that two planets can be confidently claimed. A planetary origin of the signal cannot, however, be excluded, which might motivate further observations.

The detection of four and three planets in HD 215152 and HD 69830 was claimed in Delisle et al. (2018) and Lovis et al. (2011), respectively. We find in both cases that all the planets exhibit constant phases and amplitudes, which strengthens these detection claims.

In the present work, we have suggested several methods to assess whether a signal is strictly periodic or not, but many others can be considered. The periodicity criteria and the techniques for simultaneous modelling of quasi-periodic signals, periodic signals, and noise and could both be explored in further detail. We leave this exploration and the comparison of the different criteria for future work.

Acknowledgements

The authors thank the anonymous referee for their very insightful comments, which helped to improve the paper. N.C.H thanks Michaël Cretigner for his input. N.C.H. and J.-B.D. acknowledge the financial support of the National Centre for Competence in Research PlanetS of the Swiss National Science Foundation (SNSF).

Appendix A Computations

A.1 Distribution of the difference of two χ2 variables

We consider two models, K(ω,t0,τ)$\mathcal{K}(\omega, t_0, \tau)$ and K(ω,t0 ,τ)$\mathcal{K}(\omega\prime, t_0\prime, \tau\prime)$, which we denote by K(ω,t0,τ):y=Ax+ϵ,ϵ~G(0,V)K(ω,t0 ,τ):y=Bx+ϵ,ϵ~G(0,V), \\begin{align*} \mathcal{K}(\omega, t_0, \tau): y = Ax + \epsilon, \; \epsilon \sim G(0,V) \\ \mathcal{K}(\omega\prime, t_0\prime, \tau\prime): y = Bx + \epsilon, \; \epsilon \sim G(0,V), \ \end{align*}

where G(0, V) is a multivariate Gaussian distribution of mean 0 and covariance V. Our aim is to compute the distribution of z(ω,tτ,ω,τ)z(ω,tτ,ω,τ)$z(\omega, t_{\tau,\omega}, \tau) - z(\omega, t_{\tau\prime,\omega}, \tau\prime) $ under the hypothesis that K(ω,t0,τ)$\mathcal{K}(\omega, t_0, \tau)$ is correct. The data are supposed to follow the model vA + ϵ, where vA = Axt.

We further denote PM=M(MTV1M)1MTV1$P_M = M (M^T V^{-1} M)^{-1} M^T V^{-1}$ and HM = V−1PM for a given matrix M. The residuals of the fit of models MA$\mathcal{M}_A$ and MB$\mathcal{M}_B$ are rA=(IPA)ϵrB=(IPB)(ϵ+vA). \begin{align*} r_A = & (I- P_A)\epsilon \\ r_B = &(I- P_B)(\epsilon +v_A). \end{align*}

We use the notations uA := (IPB)vA and V = LLT, where L is the Cholesky decomposition of V, η := L−1ϵ, and χA2=ηTLT(V1HA)LηχB2=ηTLT(V1HB)Lη+2uAV1(IPB)Lη+uATV1uA. \begin{align*} \chi^2_A &= \eta^T L^T(V^{-1} - H_A) L \eta \\ \chi^2_B & = \eta^T L^T(V^{-1} - H_B) L \eta + 2 u_AV^{-1}(I- P_B)L\eta +u_A^T V^{-1}u_A. \end{align*}

Then, the difference of periodogram values is Dz=z(ω,tτ,ω,τ)z(ω,tτ,ω,τ)=χA2χB2=(ηTDη+βTη+α), \begin{align*} D_z &= z(\omega, t_{\tau,\omega}, \tau) - z(\omega, t_{\tau\prime,\omega}, \tau\prime) =\chi^2_A \ - \chi^2_B \\ & = - (\eta^T D \eta + \beta ^T \eta + \alpha), \end{align*}

where D = HAHB, β = 2uAV−1(IPB)L, and α=uATV1uA$\alpha = u_A^T V^{-1}u_A$. Then, the expectancy and variance of Dz are nn\mathbbmE{Dz}=iDiiαnn\mathbbmV{Dz}=i,jDij2+iβi2.\begin{align} \mathbbm{E} \{D_z\} = - \sum_i D_{ii} - \alpha\\ \mathbbm{V} \{D_z\} = \sum_{i,j} D_{ij}^2 + \sum_i \beta_i^2. \end{align}

A.2 Independence of semi-amplitude and phase estimate

We supposed the data are y=y0+ϵ,\begin{align*} y = y_0 + \epsilon, \end{align*}(A.11)

where y0 is the model and ϵ is a random variable following a Gaussian distribution with null mean and covariance V. The two alternative linear models considered are: y = Mx + ϵ and y = Mx + ϵ. Thus, their corresponding least square estimates are xM:=(MTWM)1MTWy$x_M :=(M^TWM)^{-1} M^TW y$ and xM :=(MTWM)1MTWy,$x_{M\prime} :=({M\prime}^TW{M\prime})^{-1} {M\prime}^TW y,$ where W := V−1. Then, the covariance of xM and xM $x_{M\prime}$ is Cov(xM,xM )=(MTWM)1MTWM(MTWM)1.\begin{align*} Cov(x_M, x_{M\prime}) =(M^TWM)^{-1} M^T W {M\prime} ({M\prime}^TW{M\prime})^{-1}.\end{align*}(A.12)

We then address the case where M = [m, M0] and M′ = [m′, M0], where the brackets designate the concatenation of two matrices and m = [w(tt0)cos ωt, w(tt0)sin ωt], and m=[w(tt0 )cosωt,w(tt0 )sinωt],$m\prime = [ w(t-t_0\prime) \cos \omega t, w(t-t_0\prime) \sin \omega t],$ where w is a box-shaped time window, zero everywhere except between t0τ∕2 and t0 + τ∕2, and |t0t0 |>τ$|t_0 - t_0\prime|> \tau$. That is, fitting M and M′ corresponds to fitting models of type K$\mathcal{K}$ (see Eq. (2)) with different time centres t0.

If the noise is uncorrelated, V is diagonal and so is W. Since w(tt0)w(tt0 )=0,$ w(t-t_0) w(t-t_0\prime)=0,$ the coefficients of columns m and m′ from Eq. (A.12) are independent. If the noise is correlated, W is non-diagonal and w(tt0)TWw(tt0 ),$w(t-t_0)^T W w(t-t_0\prime),$ where w(t) is the column vector (w(ti))i=1..N.$(w(t_i))_{i=1..N}. $

Appendix B FAP calculation

Here, we denote by the generic symbol Z(Y, θ) a function of the data Y that is either z, as defined in Eq. (6), or a fixed T, ω, t0z(ω, t0, T). The FAP is defined as FAP=Pr{maxθZ(Y,θ)Zmax|Y~H},\begin{align*} \mathrm{FAP} = \mathrm{Pr} \{ \max _{\theta} Z(Y,\theta) \geqslant Z_{\mathrm{max}} | Y \sim \mathcal{H}\}, \end{align*}(B.1)

where Zmax is the observed maximum of the periodogram on the data to be analysed and H$\mathcal{H}$ is defined in Eq. (1). The FAP can be computed using simulations that generate data following model ~H$\sim \mathcal{H}$ and compute the empirical distribution of maxθZ(Y,θ)$\max_{\theta} Z(Y,\theta)$ or simulations that generate Y with random permutations of the input data. The value of θH$\theta_{\mathcal{H}}$ can be fixed to zero without loss of generality. This procedure can be very computationally heavy to reach low FAPs. The ASP is the special case of non-linear ‘Von Mises’ periodograms defined in Baluev (2013b), for which analytical approximations of the FAP can be derived. In that case, the specific form of the apodisation function must be specified. We here adopted the approach of Süveges (2014), which consists in generating data as in the simulation approach, but then fitting a generalised extreme distribution (GEV) to the generated samples. The analytical form of a GEV distribution is p(z;μ,σ,ξ)=1σt(z)ξ+1et(z),\begin{align*} p(z; \mu, \sigma, \xi) = \frac{1}{\sigma} t(z)^{\xi +1} \textrm{e}^{-t(z)}, \end{align*}(B.2)

where t(z)={(1+ξzμσ)1ξ if ξ0ezμσ if ξ=0 . \begin{align*} t(z) = \left\{ \begin{array}{ll} \left(1 + \xi \frac{z - \mu}{\sigma} \right)^{-\frac{1}{\xi}} \text{ if } \xi \neq 0\\ \textrm{e}^{-\frac{z - \mu}{\sigma} } \text{ if } \xi = 0 \end{array}. \right. \end{align*}(B.3)

The parameters μ, σ, and ξ can be estimated with a maximum likelihood estimate, and the uncertainties obtained from the inverse of the Fisher information matrix can be propagated to p(z;μ, σ, ξ) for a certain value of z to ensure that the computed FAP is sufficiently accurate.

This approach allows the number of simulations needed to estimate the low FAP levels to be drastically reduced. To illustrate this, we show in Fig. B.1 the histogram of 12,000 maxima of periodograms as well as the distribution (blue) and the fit of the 200 first simulations made with a Nelder-Mead algorithm (in orange). The simulation is made with the 70 first measurements of HD 85512 (Pepe et al. 2011).

thumbnail Fig. B.1

Empirical distribution of the maximum of periodograms generated.

References

  1. Ahrer, E., Queloz, D., Rajpaul, V. M., et al. 2021, MNRAS, 503, 1248 [NASA ADS] [CrossRef] [Google Scholar]
  2. Aigrain, S., Pont, F., & Zucker, S. 2012, MNRAS, 419, 3147 [NASA ADS] [CrossRef] [Google Scholar]
  3. Baluev, R. V. 2008, MNRAS, 385, 1279 [Google Scholar]
  4. Baluev, R. V. 2013, MNRAS, 436, 807 [NASA ADS] [CrossRef] [Google Scholar]
  5. Baluev, R. V. 2013a, Astron. Comput., 3, 50 [Google Scholar]
  6. Baluev, R. V. 2013b, MNRAS, 431, 1167 [NASA ADS] [CrossRef] [Google Scholar]
  7. Baluev, R. V. 2015, MNRAS, 446, 1478 [NASA ADS] [CrossRef] [Google Scholar]
  8. Berdyugina, S. V., & Usoskin, I. G. 2003, A&A, 405, 1121 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  9. Boisse, I., Bouchy, F., Hébrard, G., et al. 2011, A&A, 528, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  10. Borgniet, S., Meunier, N., & Lagrange, A. M. 2015, A&A, 581, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  11. Collier Cameron, A., Mortier, A., Phillips, D., et al. 2019, MNRAS, 487, 1082 [Google Scholar]
  12. Cumming, A., Marcy, G. W., & Butler, R. P. 1999, ApJ, 526, 890 [Google Scholar]
  13. Daugman, J. G. 1998, Information Theory and Coding [Google Scholar]
  14. de Jager O. C., 1994, ApJ, 436, 239 [NASA ADS] [CrossRef] [Google Scholar]
  15. Delisle, J.-B., Ségransan, D., Dumusque, X., et al. 2018, A&A, 614, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Delisle, J. B., Hara, N., & Ségransan, D. 2020a, A&A, 635, A83 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  17. Delisle, J. B., Hara, N., & Ségransan, D. 2020b, A&A, 638, A95 [EDP Sciences] [Google Scholar]
  18. Dumusque, X., Boisse, I., & Santos, N. C. 2014, ApJ, 796, 132 [NASA ADS] [CrossRef] [Google Scholar]
  19. Dumusque, X., Pepe, F., Lovis, C., & Latham, D. W. 2015, ApJ, 808, 171 [NASA ADS] [CrossRef] [Google Scholar]
  20. Dumusque, X., Cretignier, M., Sosnowska, D., et al. 2021, A&A, 648, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  21. Faria, J. P., Haywood, R. D., Brewer, B. J., et al. 2016, A&A, 588, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  22. Feng, F., Tuomi, M., & Jones, H. R. A. 2017, MNRAS, 470, 4794 [NASA ADS] [CrossRef] [Google Scholar]
  23. Ferraz-Mello, S. 1981, AJ, 86, 619 [NASA ADS] [CrossRef] [Google Scholar]
  24. Foster, G. 1996, AJ, 112, 1709 [NASA ADS] [CrossRef] [Google Scholar]
  25. Gilbertson, C., Ford, E. B., Jones, D. E., & Stenning, D. C. 2020, ApJ, 905, 155 [Google Scholar]
  26. Gregory, P. C. 2007a, MNRAS, 374, 1321 [NASA ADS] [CrossRef] [Google Scholar]
  27. Gregory, P. C. 2007b, MNRAS, 381, 1607 [NASA ADS] [CrossRef] [Google Scholar]
  28. Gregory, P. C. 2016, MNRAS, 458, 2604 [NASA ADS] [CrossRef] [Google Scholar]
  29. Grossmann, A., & Morlet, J. 1984, SIAM J. Math. Anal., 15, 723 [Google Scholar]
  30. Handley, W. J., Hobson, M. P., & Lasenby, A. N. 2015a, MNRAS, 450, L61 [NASA ADS] [CrossRef] [Google Scholar]
  31. Handley, W. J., Hobson, M. P., & Lasenby, A. N. 2015b, MNRAS, 453, 4384 [Google Scholar]
  32. Hara, N. C., Unger, N., Delisle, J.-B., Díaz, R., & Ségransan, D. 2022, A&A, in press, https://doi.org/10.1051/0004-6361/202140543 [Google Scholar]
  33. Haywood, R. D., Collier Cameron, A., Queloz, D., et al. 2014, MNRAS, 443, 2517 [Google Scholar]
  34. Jones, D. E., Stenning, D. C., Ford, E. B., et al. 2017, ArXiv e-prints [arXiv:1711.01318] [Google Scholar]
  35. Kass, R. E., & Raftery, A. E. 1995, J. Am. Stat. Assoc., 90, 773 [Google Scholar]
  36. Kipping, D. M. 2014, MNRAS, 444, 2263 [NASA ADS] [CrossRef] [Google Scholar]
  37. Lomb, N. R. 1976, Ap&SS, 39, 447 [Google Scholar]
  38. Lovis, C., Mayor, M., Pepe, F., et al. 2006, Nature, 441, 305 [NASA ADS] [CrossRef] [Google Scholar]
  39. Lovis, C., Ségransan, D., Mayor, M., et al. 2011, A&A, 528, A112 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  40. Madhow, U. 2008, Demodulation (Cambridge University Press), 74 [Google Scholar]
  41. Mallat, S. G. 1989, IEEE Trans. Pattern Anal. Mach. Intell., 11, 674 [NASA ADS] [CrossRef] [Google Scholar]
  42. Mayor, M., Pepe, F., Queloz, D., et al. 2003, The Messenger, 114, 20 [NASA ADS] [Google Scholar]
  43. Meunier, N., & Lagrange, A. M. 2013, A&A, 551, A101 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  44. Meunier, N., Desort, M., & Lagrange, A.-M. 2010, A&A, 512, A39 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  45. Milbourne, T. W., Haywood, R. D., Phillips, D. F., et al. 2019, ApJ, 874, 107 [Google Scholar]
  46. Mortier, A., & Collier Cameron, A. 2017, A&A, 601, A110 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  47. Mortier, A., Faria, J. P., Correia, C. M., Santerne, A., & Santos, N. C. 2015, A&A, 573, A101 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  48. Nava, C., López-Morales, M., Haywood, R. D., & Giles, H. A. C. 2020, AJ, 159, 23 [Google Scholar]
  49. Noyes, R. W. 1984, in Space Research in Stellar Activity and Variability, eds. A. Mangeney, & F. Praderie, 113 [Google Scholar]
  50. Pepe, F., Lovis, C., Ségransan, D., et al. 2011, A&A, 534, A58 [CrossRef] [EDP Sciences] [Google Scholar]
  51. Perryman, M. 2011, The Exoplanet Handbook (Cambridge University Press) [Google Scholar]
  52. Queloz, D., Henry, G. W., Sivan, J. P., et al. 2001, A&A, 379, 279 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  53. Rajpaul, V., Aigrain, S., Osborne, M. A., Reece, S., & Roberts, S. J. 2015, MNRAS, 452, 2269 [NASA ADS] [CrossRef] [Google Scholar]
  54. Reegen, P. 2007, A&A, 467, 1353 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  55. Saar, S. H., & Donahue, R. A. 1997, ApJ, 485, 319 [Google Scholar]
  56. Scargle, J. D. 1982, ApJ, 263, 835 [Google Scholar]
  57. Schuster, A. 1898, Terrestrial Magnet., 3, 13 [NASA ADS] [CrossRef] [Google Scholar]
  58. Schwarz, G. 1978, Ann. Statist., 6, 461 [Google Scholar]
  59. Süveges, M. 2014, MNRAS, 440, 2099 [CrossRef] [Google Scholar]
  60. Süveges, M., & Anderson, R. I. 2018a, MNRAS, 478, 1425 [CrossRef] [Google Scholar]
  61. Süveges, M., & Anderson, R. I. 2018b, A&A, 610, A86 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  62. Tuomi, M. 2012, A&A, 543, A52 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  63. Zechmeister, M., & Kürster, M. 2009, A&A, 496, 577 [CrossRef] [EDP Sciences] [Google Scholar]

1

Data can be downloaded from https://dace.unige.ch/sun/?

All Tables

Table 2

Priors used for the computation of the FIP periodogram of HD 69830 and HD 13808.

All Figures

thumbnail Fig. 1

ASPs computed iteratively on the solar data. Left column: ASPs corresponding to Eq. (7) for different values of τ. Middle column: zoomed-in view of the maximum peak of the ASP. Right: statistical test on the timescale described in Sect. 2.1.3. On the x axis we represent the ratio of τ to the total observation time assumed to be true. For each τi, i = 1…4 in abscissa, the position of the markers corresponding to τj, j = 1…4 is represented as the value of the periodogram peak for τi minus the expectancy of Dz=z(ω,tτ,ω,τi)z(ω,tτj,ω,τj)$D_z = z(\omega, t_{\tau,\omega}, \tau_i) - z(\omega, t_{\tau_j,\omega}, \tau_j)$, and the error bar is the square root of the variance of Dz. Informally, the markers represent what the periodogram values at τj should be if τi in abscissa were the true timescale.

In the text
thumbnail Fig. 2

Models corresponding to the maximum of the ASPs in Fig. 1.

In the text
thumbnail Fig. 3

Consistency of the period of signals in the solar data. (a and b): value of the χ2 defined in Eq. (6) for the solar HARPS-N RV data as a function of t0 for τ = 365 (=Tobs ∕3) days (a) and τ = 121 (=Tobs ∕9) days (b). (c and d): value of the semi-amplitude as a function of t0 for τ = 365 (=Tobs ∕3) days (c) and τ = 121 (=Tobs ∕9) days (b). Grey areas correspond to 1 σ uncertainties greater than 30 cm s−1. (e and f): in blue, position of the mode of the ASP between frequencies represented in grey as a function of t0, as defined in Sect. 2.3. Computed for τ = 365 (=Tobs ∕3) days (c) and τ = 121 (=Tobs ∕9) days (d).

In the text
thumbnail Fig. 4

Phase and amplitude of the 13.39 day signal as a function of the time centre of the window t0 for the solar HARPS-N data, as described in Sect. 2.2 (solid red and green lines, respectively). Markers correspond to statistically independent estimates. Dashed red and green lines represent the average phase and amplitudes. The figures are all computed for a timescale of τ = 539 days, corresponding to τTobs = 1/9. Plots (a), (b), (c), and (d) are computed by adding iteratively the signals corresponding to the highest peaks to the base model.

In the text
thumbnail Fig. 5

Value of the χ2 defined in Eq. (6) for the solar HARPS-N logRHK$\log R\prime_{\textrm{HK}}$ as a functionof t0 for τ121 (=Tobs ∕9) days.

In the text
thumbnail Fig. 6

Difference between the RV phase and logRHK$\log R\prime_{\textrm{HK}}$ phase as a function of time for τ = Tobs ∕3.

In the text
thumbnail Fig. 7

HD 215152 HARPS radial velocities. Data taken before and after the fibre update are represented in blue and red, respectively.

In the text
thumbnail Fig. 8

ASPs computed on the HD 215152 HARPS data. All are computed with a base model that includes a linear activity model and offsets, as described in Sect. 3.2. They also include three of the four planets. Rows (a), (b), (c), and (d) correspond respectively to leaving out planets at 5.75, 7.28, 10.86, and 25.20 days. Left column: ASPs corresponding to Eq. (7) for different values of τ (blue, orange, green, and red correspond to τTobs = 10, 1/3, 1/9, and 1/27). Middle column: zoom-in on the maximum peak of the ASP. Right: statistical test on the timescale. When subtracting the 46 day signal with τ = Tobs∕3 (period at which the maximum of the ASP is attained in (d)), we obtain (e) (on the following page), where a coherent 25.2 d signal appears.

In the text
thumbnail Fig. 9

Phase and amplitude as a function of the time centre of the window t0, as described in Sect. 2.2 (solid red and green lines, respectively). Markers correspond to statistically independent estimates. Dashed red and green lines represent the average phase and amplitudes. The figures are all computed for a timescale of τ = 539 days, corresponding to τTobs = 1∕9. Plots (a), (b), (c), and (d) correspond to base models that include offsets, linear activity indicators, and all the claimed planets except the 5, 7, 10, and 25 day planets, respectively.

In the text
thumbnail Fig. 10

Diagnostics of the consistency of prensence of signals in the HD 69830 HARPS dataset. Left and right: posterior distribution of τ (defined in Eq. (16)) and f (defined in Eq. (8)) conditioned on the period P being such that |1∕P − 1∕Pi| = 1∕Tobs for i = 1, 2, 3 and P1 = 8.66 days, P2 = 31.56 days, and P3 = 197 days. The i = 1, 2, 3 correspondrespectively to the blue, orange, and green histograms. The black dotted line corresponds to τ = Tobs.

In the text
thumbnail Fig. 11

FIP periodogram of HD 13808: − log10 FIP (in blue) and log10 TIP (in yellow) of the presence of a planet in a centred frequency interval [ω − Δω, ω + Δω] as a functionof ω. Here, Δω = 2πTobs, where Tobs is the total observation time span. Different colours correspond to independent computations with POLYCHORD. The plot inside the black box represents a zoom-in on the region close to 18.9 days.

In the text
thumbnail Fig. 12

HD 13808 dataset. Left and right: posterior distribution of τ (defined in Eq. (16)) and f (defined in Eq. (8)) conditioned on the period P being such that |1∕P − 1∕Pi| < 1∕Tobs for i = 1, 2, 3 and P1 = 14.18 days, P2 = 19 days, and P3 = 53.8 days. The i = 1, 2, 3 correspondrespectively to the blue, orange, and green histograms. The black dotted line corresponds to τ = Tobs.

In the text
thumbnail Fig. B.1

Empirical distribution of the maximum of periodograms generated.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.