Free Access
Issue
A&A
Volume 527, March 2011
Article Number A136
Number of page(s) 14
Section Astronomical instrumentation
DOI https://doi.org/10.1051/0004-6361/201015454
Published online 11 February 2011

© ESO, 2011

1. Introduction

One of the most common methods to search for periodicity in an astronomical time series is the Lomb-Scargle periodogram (Lomb 1976; Scargle 1982). The method is, however, not suitable for analysing data where the periodic variations do not fit to a simple single harmonic model. A more general period analysis method, the three stage period analysis (TSPA), was formulated by Jetsu & Pelt (1999, hereafter Paper I). This method utilises a higher order harmonic model: a Kth order Fourier series. It can thus model more accurately periodic data with a complex form. The main motivation in developing the TSPA was to formulate a suitable method for analysing photometry of active stars.

The TSPA method can still be improved. In this paper we discuss and implement three such improvements. First of all, TSPA is not flexible enough for studying fast evolution of the light curve, as it divides the photometry into separate datasets that do not overlap. A better time resolution can be achieved by choosing the datasets with a sliding window and allowing the adjacent datasets to overlap. This produces a sequence of analysis results akin to video view in contrast to the set of snapshots provided by the TSPA.

The second improvement is to test several models of different order K, instead of just one, as possible descriptions of the data. The best model can then be selected with a suitable criterion. By including a constant brightness model into the set of tested possible models, we can investigate whether a periodic function is an appropriate description of the data at all. This is of particular interest when analysing low amplitude stellar light curves.

As the third improvement to the TSPA, we define the parameter TC as the approximate time scale in which the shape of the analysed light curve undergoes significant changes. It characterises the time interval after which any particular light curve model does not adequately describe the subsequent light curves any more. This parameter is, however, only an approximation, as it depends also on the quality and quantity of the photometric data. Nevertheless, when analysing photometry of active stars, the parameter may tell us how fast the spot distribution on the visible stellar surface is changing.

We formulate the new continuous period search (CPS) method in Sect. 2 by incorporating the above three improvements to the TSPA, and use simulated data to test the performance of the CPS in Sect. 3. Finally, we apply the CPS to real photometric observations of the spot activity on the surface of the nearby young solar analogue HD 116956 (NQ UMa) in Sect. 4.

HD 116956 has a spectral type of G9V, an effective temperature of Teff = 5170 K, a projected rotation velocity of vsini = 5.6 km s-1\hbox{${}^{-1}$} and a lithium abundance of log N(Li) = 1.42    ±    0.12 (Gaidos 1998). From its space motion, Gaidos et al. (2000) identified the star as a local association member. This would suggest an age between 20 and 150 Myr (Montes et al. 2001). On the other hand, Gaidos et al. (2000) noted that none of the Local Association stars in their sample display characteristics of such a young age and rather suggest that they have similar ages to the Ursa Major group stars (300 Myr). Nevertheless, HD 116956 must be considerably younger than the Sun, as also indicated by its lithium content. The radial velocity of HD 116956 is constant  − 12.3 km s-1\hbox{${}^{-1}$}, i.e. it seems to be a single star (Gaidos et al. 2000).

HD 116956 displays numerous signs of magnetic activity, such as rotational modulation of its brightness caused by large spots on its surface. The amplitude of this variability is typically 0.05 mag. Gaidos et al. (2000) reported a preliminary rotation period estimate of P = 7.80    ±    0.02 days. Another activity indicator is the logarithmic ratio of X-ray and bolometric luminosities RX =  −4.48, which indicates that the star has an active corona (Gaidos 1998). Finally, the logarithmic ratio of Ca II emission and bolometric luminosity, RHK=4.447\hbox{$R'_{\rm HK}=-4.447$}, means that HD 116956 is chromospherically active (Gray et al. 2003).

2. The CPS method

The light curves of active stars can undergo rapid changes, but may as well remain stable over several rotations. These light curves have usually been modelled with one simple sinusoid for the whole data or for fixed parts of it. In reality, the time span of observations required for reliable light curve modelling cannot be determined uniquely, because it is certainly not constant. Nor does the best model for the light curve remain the same, e.g. periodicity can vanish and another periodicity can reappear at some later epoch. This means that the selection of the slices of data, hereafter called datasets, must be flexible and the complexity of the model must adapt to the light curve changes. The CPS method utilises overlapping datasets, identifies the best model for each of these and gives a quantitative estimate for the temporal stability of these models. Although the CPS is applied here only to simulated and real ground based photometry of active stars, this method can be easily adjusted to search for periodicity in any type of unevenly spaced data.

2.1. Datasets and segments

The input data for the CPS method are observations yi = y(ti) at time points ti with observational errors σi. Before any modelling is done, the data must be divided into short datasets (SET), each of which is modelled individually. The length of these datasets is ΔT = tn − t1, which we limit to have the maximum length of ΔTmax={\begin{eqnarray} \Delta T_{\rm max} \!=\! \left\{ {\begin{array}{rl} \Delta T_1 \ , & P_0 < \frac{1}{2}\Delta T_1 \\ 2P_0 \ , & \frac{1}{2}\Delta T_1 \leq P_0 \leq \frac{1}{2}\Delta T_2 \\ \Delta T_2 \ , & P_0 > \frac{1}{2}\Delta T_2, \end{array}} \right. \label{deltatmax} \end{eqnarray}(1)where P0 is the a priori estimate for the photometric rotation period. The lower limit ΔT1 ensures that most of the datasets contain enough observations for modelling. Datasets with n < 10 are not analysed with the CPS. The upper limit ΔT2 is necessary to prevent significant changes in the light curve during a single dataset for stars with longer rotation periods. In other words, the parameter ΔTmax is applied to limit the length ΔT = tn − t1 of the analysed datasets so that they yield a consistent light curve model but still have enough observations for reliable modelling. In practice, suitable values for ΔT1 and ΔT2 are 25 d and 200 d respectively, when analysing ground based photometry of active stars. For different kinds of data, the values of ΔT1 and ΔT2 must be determined separately.

Unlike in the TSPA, the datasets are allowed to overlap in the CPS. This gives a better time resolution. In typical ground based observations, we begin a new dataset at the first time point t1 of each night and include all data points yi = y(ti) within t1 ≤ ti ≤ t1 + ΔTmax. A candidate for the next consecutive dataset is chosen by moving the window one night forward. We require that the observations of two consecutive datasets fulfill the condition SETkSETk+1andSETk+1SETk.\begin{equation} {\rm SET}_k \not\subset {\rm SET}_{k+1} \ \mathrm{and} \ {\rm SET}_{k+1} \not\subset {\rm SET}_k. \end{equation}(2)In other words, both datasets must contain at least one observation that does not belong to the other one. If this condition is violated, the candidate dataset SETk + 1 is rejected. In this case, the dataset window is moved one night further. If there is much data during each night and P0 is short, a denser selection of datasets might be desirable. In this case, one may even start a new dataset at each new observation. This kind of a much denser dataset selection rule would be ideal for satellite photometry.

Observations that would be included only in datasets having n < 10 are of little use for modelling with the CPS, as for them n approaches the number of free parameters of the model. Such observations are rejected as temporally isolated data points.

The data are further divided into longer segments (SEG) whenever there is a gap longer than ΔTmax in the observations. Such gaps may have contained rejected isolated data points. These segments are defined solely for the purpose of identifying datasets belonging to different observing seasons.

The benefit of letting the subsets overlap is a better time resolution compared to the TSPA, where the datasets were separated from each other. As a drawback, the modelling results from two overlapping datasets are correlated with each other. One can easily eliminate this correlation by comparing only the non-overlapping datasets. The alternatives in selecting these non-overlapping (hereafter called the independent) datasets are numerous. The most simple alternative is to select the independent datasets by beginning from the first dataset of each segment and then selecting the next independent dataset with the criterion that it does not have any common data points with the preceding independent dataset.

2.2. Modelling of the observations

The TSPA model (ti)=(ti,β̅)=M+k=1K[Bkcos(k2πfti)+Cksin(k2πfti)],\begin{equation} \hat{y}(t_i) = \hat{y}(t_i,\bar{\beta}) = M + \sum_{k=1}^K{[B_k\cos{(k2\pi ft_i)} + C_k\sin{(k2\pi ft_i)}]}, \label{model} \end{equation}(3)i.e. a Kth order Fourier series, is also used in the CPS. This model has an order K and 2K + 2 free parameters, \hbox{$\bar{\beta}=[M,B_1,\ldots,B_K,C_1,\ldots,C_K,f]$}. In other words, the free parameters are the mean M, the individual cosine and sine amplitudes Bk and Ck and the frequency f.

In the TSPA, the modelling proceeded through three stages: the pilot search, the grid search and the refined search. In the CPS, the pilot search, where the initial period estimates were identified within a broad frequency interval (Paper I, Sect. 3.1), is not used. Rather, the grid search (Paper I, Sect. 3.2) is performed straight away. In the grid search a dense grid of frequencies is tested one at the time by fitting the model (Eq. (3)) to the observations. When the frequency f is fixed, the model becomes linear and the remaining free parameters  [M,B1,...,BK,C1,...,CK]  of the model have unique solutions. The grid search is performed within the period range of (1q)P0P(1+q)P0,\begin{equation} (1-q)P_0 \leq P \leq (1+q)P_0, \label{gridpp} \end{equation}(4)where P0 is the a priori rotation period estimate of Eq. (1) and q regulates the interval of the period search. We use a value of q = 0.15. This is usually sufficient, since for many active stars there already exist period determinations in the literature and the real period changes in photometry can be expected to fall inside the  ± 15% range of P0. If necessary, this tested period range can be expanded. If no P0 is available, one can be obtained, for example, with the TSPA.

The final model parameters \hbox{$\bar{\beta}$} are obtained from the refined search (Paper I, Sect. 3.3), which consists of a standard nonlinear Marquardt iteration that minimises χ2 with weights, χ2(,β̅)=i=1nwiϵi2,\begin{equation} \chi^2(\bar{y},\bar{\beta}) = \sum_{i=1}^n{w_i\epsilon_i^2}, \label{chi2} \end{equation}(5)where wi=σi-2\hbox{$w_i=\sigma_i^{-2}$} are the weights and \hbox{$\epsilon_i=y_i-\hat{y}(t_i,\bar{\beta})$} are the residuals. The refined search can find the correct period even if it is outside but close to the grid search range (Eq. (4)).

Before modelling, the data is preprocessed by removing outliers. These outliers are determined using a preliminary model of order K′. We set this preliminary modelling order equal to the highest model order used in the actual modelling of the data, K′ = Klim. The preliminary model ŷ′(t) gives the residuals ϵi=yi(ti)\hbox{$\epsilon_i' = y_i-\hat{y}'(t_i)$}. For each dataset, the observations having residuals larger than three times the standard deviation of all residuals \hbox{$\bar{\epsilon}'$} are removed as outliers. For our data, we found that there are typically only a few such outliers among several hundreds of observations.

The problem of determining the order K of the Fourier model (Eq. (3)) is crucial to the light curve modelling procedure. The value of K must be high enough to allow a good fit to the observed light curve but not too high to result in overfitting to the data. In the TSPA, the order K was selected beforehand and the same value was used for all datasets. In contrast, the CPS uses a Bayesian information criterion to select the best K value separately for each dataset. This criterion consists of minimising a type of χ2 with an additional penalty term for the extra degrees of freedom introduced by a higher order K.

Before testing for the optimal K for each dataset, the upper limit Klim for the highest accepted modelling order must be selected. Since the smooth light curves of spotted stars usually display only one or two minima, not very high values of Klim are necessary. For more complicated light curve shapes, such as those of eclipsing binaries, higher Klim values should be used to achieve a satisfactory fit. In practice, the value of Klim = 2 is sufficient for analysing ground based photometry of most spotted stars.

The problem of determining K is equivalent to finding the number of sinusoids embedded in the data. This number is just the order K of the model. A solution for this particular problem was presented by Stoica & Selén (2004), who studied the applicability of different model order selection rules. They argued that the best performance is achieved with the Bayesian information criterion. It has the property that it always finds the correct K as the number of data points n increases to infinity.

The Bayesian information criterion is given by RBIC=2nlnλ(,β̅)+(5K+1)lnn,\begin{equation} R_{\rm BIC} = 2n\ln{\lambda(\bar{y},\bar{\beta})} + (5K+1)\ln{n}, \label{bic} \end{equation}(6)where the first term gives the logarithmic likelihood with λ(,β̅)=χ2(,β̅)i=1n[wi]-1\hbox{$\lambda(\bar{y},\bar{\beta}) = \chi^2(\bar{y},\bar{\beta})\left[\sum_{i=1}^n{w_i}\right]^{-1}$}. The criterion is used by first modelling the dataset separately with all models between K = 0 and K = Klim and then calculating RBIC for each of these models. The model with K = 0 (i.e. constant) is obtained simply from the weighted mean my of all data points yi in the dataset, i.e. ŷ(ti) = M = my. The model for the dataset that minimises RBIC has the optimal order K and is chosen as the best model for the dataset.

The function of the two terms in Eq. (6) are the following. The first term describes the goodness of the fit to the data. It is naturally smaller for higher order models, as they allow a more detailed fit. The second term, (5K + 1)lnn, is a penalty term which grows linearly as a function of K. It balances the smaller values of of the first term \hbox{$2n\ln{\lambda(\bar{y},\bar{\beta})}$} at a high K and thus prevents overfitting. Because of this second term there will be an optimal K where RBIC reaches its minimum value and then starts to grow again as K increases.

Most of the free parameters of the model, \hbox{$\bar{\beta} = [M,B_1,\ldots,B_K,C_1,\ldots,C_K,f]$}, are not very useful as such. The physically meaningful parameters are the mean magnitude M and the total amplitude A of the light curve, the period P = f-1, as well as the epochs tmin,   1 and tmin,   2 of the primary and secondary minima of the light curve. The mean M and the period P = f-1 are obtained directly from the free parameters \hbox{$\bar{\beta}$} of the model, but the values of A, tmin,   1 and tmin,   2 must be determined numerically from the light curve model. Note that only the models with K ≥ 2 can provide all of these parameters. For K = 1, the secondary minimum tmin,   2 does not exist and the only parameter that can be obtained for K = 0 is the mean M = my.

The error estimates and reliabilities of these parameters are determined as in the TSPA (Paper I, Sects. 4 and 6.3). This process consists of running a bootstrap with 200 rounds for the residuals \hbox{$\bar{\epsilon}$} and all model parameters. The errors of the parameters are the standard deviations of the 200 bootstrap estimates. All these bootstrap distributions are tested against the Gaussian hypothesis

HG: \hbox{$\bar{x}$}represents a random sample drawn from a Gaussian distribution,

where \hbox{$\bar{x}$} denotes the residuals or the bootstrap distributions of any of the model parameters. If any of these distributions fails to satisfy HG, all of the model parameters are considered unreliable. The Gaussian hypothesis HG is tested using the Kolmogorov-Smirnov test with preassigned significance level γ = 0.01 for rejection. Also, if the secondary minimum tmin,   2 is present in less than 95% of the bootstrap samples, it is considered unreliable.

2.3. Time scale of change

The modelling gives the mean M(τ), total amplitude A(τ), period P(τ) and the minimum epochs tmin,   1(τ) and tmin,   2(τ) of the light curve, where τ is the mean of all observing times ti in the current dataset. As the shape of the light curve usually evolves with time, the model must also evolve. It is of interest to investigate for how long an unchanged model for a specific dataset still reasonably well fits with the observations of the subsequent datasets. We estimate the time scale of change TC for each dataset ι by determining for how many of the following datasets ι + κ the model of dataset ι still fits the data.

The dataset ι contains the observations \hbox{$\bar{y}_{\iota}$} which yield a model \hbox{$\hat{y}_{\iota}(\bar{t}_{\iota})$}. Another dataset ι + κ at a later epoch contains the data \hbox{$\bar{y}_{\iota+\kappa}$}. Assuming that the model \hbox{$\hat{y}_{\iota}(\bar{t}_{\iota})$} is valid for both datasets, we define the residuals ϵ̅ι=ιι(ι)\begin{equation} \bar{\epsilon}_{\iota} = \bar{y}_{\iota} - \hat{y}_{\iota}(\bar{t}_{\iota}) \end{equation}(7)and ϵ̅ι,κ=ι+κι(ι+κ).\begin{equation} \bar{\epsilon}_{\iota,\kappa} = \bar{y}_{\iota+\kappa} - \hat{y}_{\iota}(\bar{t}_{\iota+\kappa}). \end{equation}(8)If the light curve has not changed between datasets ι and ι + κ, the residuals \hbox{$\bar{\epsilon}_{\iota}$} and \hbox{$\bar{\epsilon}_{\iota,\kappa}$} should have similar distributions, i.e. they should satisfy the hypothesis

HK2: \hbox{$\bar{\epsilon}_{\iota,\kappa}$}and \hbox{$\bar{\epsilon}_\iota$} represent random samples drawn from the same distribution.

HK2 is tested using the two-sided Kolmogorov-Smirnov test. The preassigned significance level for rejecting HK2 is γ = 0.01. The time scale of change is not determined for datasets where the model parameters have been found unreliable. This means that we already know that the residuals \hbox{$\bar{\epsilon}_\iota$} follow a Gaussian distribution. Thus the distribution of \hbox{$\bar{\epsilon}_{\iota,\kappa}$} should also resemble a Gaussian distribution, if HK2 is not rejected.

The computation of the time scale of change for the model of dataset ι starts from the dataset ι + κ, where κ = 1. If the sets of residuals \hbox{$\bar{\epsilon}_{\iota}$} and \hbox{$\bar{\epsilon}_{\iota,\kappa=1}$} for the two datasets pass the test for HK2, the comparison of residuals proceeds to the next dataset, i.e. κ → κ + 1, and the same test is applied again. Finally, when this test fails, the comparison process is terminated. In this case, the time difference between the dataset ι and the last dataset passing the test, ι − κ − 1, is taken as the time scale of change for the subset ι, i.e. TC(τι)=τι+κ1τι.\begin{equation} T_{\rm C}(\tau_{\iota})=\tau_{\iota+\kappa-1}-\tau_{\iota}. \label{tc} \end{equation}(9)This computation of TC only proceeds until the end of each segment. No datasets from other segments are compared, because there is no data confirming or refuting the model during the gap between the two segments and this might introduce a bias into the TC estimates. If the computation process hits the end of the segment, a special value TC =  −2 is given for the correlation time scale. This denotes the case that this particular model describes adequately all the datasets following it within the segment.

3. Testing the method

Before analysing real stellar photometry, we used simulated data to determine the performance of the CPS method in certain critical situations. This allowed us to get more insight in understanding some of the results of the analysis. Failing to do this can lead to a biased interpretation of the results.

Since the distribution of the observing times can have a profound effect to the results of the analysis, we generated our simulated test data using real observing times of the star HD 116956. Thus, any spurious effects introduced by the observing times alone should also be present in the analysis of both the simulated data and the real data of HD 116956 (Sect. 4). Because other active stars are observed in a similar manner, the results of this section are applicable also when analysing them.

The observing times ti used for generating the simulated test data were taken from the first 152 observations of the time series of HD 116956. These form a complete unbroken observing season and are temporally fairly evenly distributed. To get a longer time series for the simulations, we appended a duplicated set of ti + t152 − t1 to the original set of ti and removed the double occurrence of t152. This resulted in a set of 303 observation times with a complete time span of 377.43 d.

When simulating the periodic test data, we used a period of P0 = 7.8 d. This imitated the a priori period estimate for HD 116956. With the chosen observing times and this particular period, our test analyses each yielded a total of 184 datasets with ΔTmax = 25 d, where the number of simulated observations in each dataset was in the range 12 ≤ n ≤ 28.

3.1. Sensitivity to changes in the model order

As already stated in Sect. 2.2, the Bayesian information criterion (Eq. (6)) should determine the correct modelling order with a very high probability when the number of observations is large. However, when analysing photometry of active stars, this number of observations is often quite limited. Typically we have 10 ≤ n ≤ 20 in a dataset. This scarcity of data can potentially affect the performance of the model order selection.

To test the performance of the model order selection, we analysed three different simulated time series. These consisted of one set of constant data and two sets of sinusoidal data each with random Gaussian noise yK=0(ti)=ϵN,iyK=1(ti)=0.01sin(2πfti)+ϵN,iyK=2(ti)=\begin{eqnarray} y_{K=0}(t_i) &=& \epsilon_{{\rm N},i} \nonumber\\ y_{K=1}(t_i) &=& 0.01\sin{(2\pi ft_i)} + \epsilon_{{\rm N},i} \nonumber\\ y_{K=2}(t_i) &=& 0.01\sin{(4\pi ft_i)} + \epsilon_{{\rm N},i}. \label{genmod} \end{eqnarray}(10)Here f-1 = P0 = 7.8 d and ϵN,i denotes Gaussian noise distribution having a zero mean and a variance σN2\hbox{$\sigma_{\rm N}^2$}, i.e. ϵN,i~N(0,σN2)\hbox{$\epsilon_{{\rm N},i} \sim N(0,\sigma_{\rm N}^2)$}. These three simulation models were chosen so that the correct modelling order should be K = 0, K = 1 and K = 2, respectively. Although both yK = 1(ti) and yK = 2(ti) are simple sinusoids, the CPS should naturally select K = 2 to model the data generated with yK = 2(ti) when all tested period candidates are close to P0. Then the CPS has to use a higher order model to correctly describe the two minima of yK = 2(ti) occurring with the period P0. All the analyses were performed with Klim = 3. This allowed us to examine the possibility that the criterion of Eq. (6) may yield a too high K even when analysing simulated data generated with yK = 2(ti).

In the cases of yK = 1(ti) and yK = 2(ti), the test data were simulated using four different noise levels where σN was 0.005, 0.002, 0.001 or 0.0005. Given the light curve half amplitude A/2 = 0.01, these correspond to amplitude to noise ratios A/N = 2, A/N = 5, A/N = 10 and A/N = 20, using the definition A/N = A/2σN. According to this definition, the data simulated with yK = 0(ti) and any σN has A/N = 0, and thus the results for K = 0 are independent of the absolute level of noise.

After analysing the simulated test data, we checked how often the CPS arrived at a K value different from that used in the simulation model. Fractions of false K in 15 simulated time series (a total of 2760 analysed datasets) are presented in Table 1. They are reported separately for those datasets that had n ≤ 20 and those that had n > 20. In the case of yK = 1(ti) and yK = 2(ti), the probability for a false model selection seems to be quite insensitive to A/N. There is no clear structure visible in the failure probabilities at different A/N, nor between K = 1 and K = 2. This seems to indicate that the sensitivity of the selection criterion does not strongly depend on A/N or the complexity of the periodic model. On the other hand, the number of observations in the datasets has a substantial influence to its success rate so that with more observations the correct model order is detected with a much higher probability. Nevertheless, the success rate stays between 80% and 95% in both the cases n ≤ 20 and n > 20, which can be considered as a satisfactory performance.

Unfortunately, in the case of yK = 0(ti), i.e. pure noise, the CPS still detected spurious periodicity in 21% of the datasets having n > 20 and 39% of those having n ≤ 20. This can cause problems if the analysis results are not interpreted critically. Luckily, these spurious periodic models can often be identified from their very low amplitudes. As a rule, the amplitudes of these models have values comparable to the preset noise level σN and the observed standard deviation of the residuals σϵ so that they fulfill A ≈ 2σN ≈ 2σϵ. This is expected, as the amplitudes are artifacts resulting solely from the modelling of noise.

Table 1

Fractions of falsely determined model order K in the analysis stable simulated data.

thumbnail Fig. 1

Model order K from analysis of test data simulated with a) model yK = 0(ti) in Eq. (10) and b) model yK = 1(ti) in Eq. (10) having amplitude to noise ratio A/N = 10. The number of data points in each dataset is presented in panel c).

thumbnail Fig. 2

Examples of single 25 d long datasets simulated with a) model yK = 0(ti) in Eq. (10) and b) model yK = 1(ti) in Eq. (10) having amplitude to noise ratio A/N = 10. The phases of these simulated observations are calculated using the period of the simulation model, P0 = 7.8 d.

Examples of the behaviour of K during a test analysis for typical simulated data are presented graphically in Fig. 1. The analysis of pure noise displays a large scatter of false results, although the correct value K = 0 is still clearly the most commonly obtained result (Fig. 1a). The scatter is large especially near the mid section of the time series, where the number of simulated observations is small, n < 20. The analysis of the data simulated with yK = 1(ti) and A/N = 10 behaves much better with only some overfitting and no false detections of K = 0 models (Fig. 1b).

Clues to why the Bayesian information criterion has trouble in detecting the correct K for the data simulated using the yK = 0(ti) model can be seen in Fig. 2. This figure presents simulated observations during two arbitrary simulated datasets and folded into phase representation using the model period P0 = 7.8 d. Because of the relatively small number of available data points (n = 19), the case of pure noise, i.e. underlying K = 0, still vaguely resembles some periodic form (Fig. 2a). The probability of chance resemblance of periodic data decreases only for a larger number of data. In the case of an underlying K = 1 simulation model (Fig. 2b) it is quite clear that a single sinusoid is the correct model for the data.

We conclude that the CPS detects real periodicity even from low amplitude data of a high noise level. If there is no real periodicity, spurious periodicity is still detected in about a quarter of the analysed datasets. These spurious period detections can, however, be identified because their observed amplitude to noise ratio is near unity, (A/N)obs=A2σϵ1.\begin{eqnarray*} ({A/N})_{\rm obs} = \frac{A}{2\sigma_{\epsilon}} \approx 1. \end{eqnarray*}In conclusion, the results need always to be interpreted with special care when the dataset contains few observations and the model amplitude A is low.

3.2. Sensitivity to two close minima

When analysing the distribution of starspots on the surface of a star, one often simply identifies the light curve minima as direct markers of the spot longitudes. In the case of a single spot or two clearly separated ones, this assumption, of course, holds and the light curve minimum phases give good approximations of the spot longitudes.

However, if two spots are longitudinally close enough to each other, the observable light curve minima can be significantly shifted. As the two minima produced by the two spots move closer to each other, their tails start to merge. At first, the minima stay separated but are shifted somewhat towards each other. At phase separations below some critical value Δφcrit, the minima finally merge and produce one single broad minimum. The observed phase of this single minimum lies between the phases of the two underlying spots.

The tendency of the light curve minima, produced by two individual spots, to merge depends on their width. Note, that no minimum can be very narrow, even in stars with an inclination close to 90°. For example, a spot located at the stellar equator is typically visible over half of the stellar rotation. During this time it modulates the visible brightness of the star. Thus the total width of the minimum spans about one half of the total rotation period. If the spot lies closer to the visible pole or the stellar inclination is small, it stays at the visible stellar disk for a longer period and causes an even broader minimum. In the extreme case, a spot can stay at the visible hemisphere at all times. In this case it causes a very broad minimum induced only by its variable projected area and limb darkening. Only a spot below the equator, closer to the unseen pole, may cause a narrow minimum, as it stays visible for less than half of the rotation period. However, all minima significantly narrower than half of the rotation can only be produced by spots that appear on the visible hemisphere briefly and never come very far from the limb of the star. Because of limb darkening and a small projected spot area, these minima are very shallow and are therefore strongly affected by noise.

To get an estimate of the critical phase separation Δφcrit, consider a model light curve of the form yspot,1(φ)={\begin{eqnarray} y_{\rm spot,\,\,1}(\phi) \!=\! \left\{ {\begin{array}{ll} 0 \ , & \phi < -0.25 \\ \frac{1}{2}[1 + \cos{(4\pi\phi)}] \ , & -0.25 \leq \phi \leq 0.25 \\ 0 \ , & \phi > 0.25 \end{array}} \right. \label{spot1} \end{eqnarray}(11)in phase space within the interval  − 0.5 < φ ≤ 0.5. This model approximates reasonably well the shape of a light curve produced by a single spot at the stellar equator at phase φ = 0. Putting two equally strong spots at phases φ0 and  − φ0 results in a merged light curve. The merged light curve within the interval φ0 − 0.25 ≤ φ ≤ 0.25 − φ0 has the form yspot,2(φ)=12[2+cos(4π(φ+φ0))+cos(4π(φφ0))].\begin{eqnarray*} y_{\rm spot,\,2}(\phi) = \frac{1}{2}[2 + \cos{(4\pi(\phi+\phi_0))} + \cos{(4\pi(\phi-\phi_0))}]. \end{eqnarray*}Note that this is only a linear approximation, as actual photometry is measured in magnitudes which can not be added in this manner themselves. In the case of starspot induced light variability where the amplitude of the light curve is in the range of 0.01 − 0.1 mag, this linear approximation can, however, be done. The combination of two yspot,   1(φ) curves at the phases φ0 = 0.15 and φ0 =  −0.15 is visualised in Fig. 4. The two underlying curves are drawn with dashed lines and the merged curve with a solid line. The phase separation of the two components is indicated with the two headed arrow.

The two spots can be identified from the light curve if they cause two separate light curve minima. In other words, there should be a secondary maximum at phase φ = 0 between the two minima. This corresponds to the condition yspot,2′′(0)=8π2[cos(4π(φ0))+cos(4π(φ0))]<0cos(4πφ0)>0.\begin{eqnarray*} y_{\rm spot,\,2}''(0) = -8\pi^2[\cos{(4\pi(\phi_0))} + \cos{(4\pi(-\phi_0))}] &<& 0 \\ \Rightarrow \cos{(4\pi\phi_0)} &>& 0. \end{eqnarray*}For 0 ≤ φ0 ≤ 0.25, this corresponds to the phase separation of the two spots being Δφ=2φ0>0.25,\begin{equation} \Delta\phi = 2\phi_0 > 0.25, \label{criticallimit} \end{equation}(12)

thumbnail Fig. 3

Primary (squares) and secondary (triangles) minimum phases from analysis of test data simulated with yspot(ti/P0) of Eq. (13). Filled symbols denote reliable and open symbols unreliable phase estimates. The two straight lines denote the correct phases where the simulated spots should have been detected.

i.e. no spots closer to each other than Δφcrit = 0.25 should cause two separate light minima. Less than optimal performance of the analysis algorithm may yield an even larger value of Δφcrit.

To determine the phase resolution of the CPS, we tested a two spot simulation model based on two adaptions of Eq. (11), yspot(tiP0)=yspot,1(tiP0)+yspot,1(tiP0+ΔP+0.1)+ϵN,i.\begin{equation} y_{\rm spot}\left(\frac{t_i}{P_0}\right) = y_{\rm spot,\,1}\left(\frac{t_i}{P_0}\right) + y_{\rm spot,\,1}\left(\frac{t_i}{P_0+\Delta P}+0.1\right) + \epsilon_{{\rm N},i}. \label{spotspot} \end{equation}(13)The model describes two spots rotating with periods P0 and P0 + ΔP and having an initial phase separation of Δφ = 0.1. The difference in period, ΔP =  −0.001327P0, was chosen so that at the end of the 377.43 d long time series the phase difference between the two spots would have increased from Δφ = 0.1 to 0.6. The random noise used in simulating this time series was ϵN,i ~ N(0,0.0012), i.e. A/N = 10.

The resulting minimum phases from the CPS analysis of this simulated data are shown in Fig. 3. Superimposed on the minimum phases detected with the CPS are two straight lines indicating the correct simulated phases where the two spots should have been found. At the beginning of the time series, the two underlying minima are inseparable and produce one common merged minimum with a phase between the two correct phases, as expected. Towards the end of the time series, the two minima both become detectable and have estimated phases very close to their correct values. However, near the time when the two minima first become separable their estimated phase differences are systematically smaller than the correct simulated values.

The two minima are first detected separately at time t = 167.99 d at which time their phase separation given by the CPS is Δφ = 0.27. The simulated correct phase separation at that moment is Δφ = 0.32. Both of these two values exceed the limit Δφcrit = 0.25 derived in Eq. (12). This indicates that the ability of the CPS to separate individual minima is not perfect. We conclude that spots with a phase separation smaller than about one third of a rotation are not expected to be separable with the CPS.

The failure to detect two close by spots separately from a light curve means that any time when two spots are detected from a light curve, they already have a large longitudinal separation. In other words, our impression of the distribution of the spots is biased towards spots lying nearly at opposite sides of the star. This can lead to detections of spurious active longitudes even in the case when the underlying physical spot distribution has a completely different geometry.

3.3. Stability of the method

The results of the analysis should be reliable regardless of random effects caused by noise and uneven distribution of the observing times. Analysis of test data simulated with an unchanging underlying model should thus give stable parameter estimates throughout the whole time series.

To test the stability of the CPS, we analysed sinusoidal test data simulated with yK = 1(ti) (Eq. (10)). The standard deviation of the noise was selected from σN ∈  [0.0005,  0.0007,  0.001,  0.002,  0.003,  0.005]  corresponding to amplitude to noise ratios A/N = 20, A/N = 14.3, A/N = 10, A/N = 5, A/N = 3.3 and A/N = 2, respectively. The analysis yielded generally rather stable values for the simulated light curve mean M and amplitude A and the single minimum phase φmin,   1. However, the period estimate P showed quite significant variations, as can be seen in Fig. 5. This type of spurious variations must be taken into account, since the period changes of spotted stars are commonly used to measure the strength of their surface differential rotation (Jetsu 1993).

thumbnail Fig. 4

Two single spot model light curves according to yspot,   1(φ) plotted around phases φ0 = 0.15 and φ0 =  −0.15 (dashed lines) and the combined light curve (solid line). The phase separation Δφ is indicated with the two headed arrow.

thumbnail Fig. 5

Period estimates from the analysis of test data simulated with yK = 1(ti) having A/N = 10 (see Eq. (10)).

To quantify the effect of the spurious period variations, we calculate the parameter Z=6ΔPwPw\begin{equation} Z = \frac{6\Delta P_{\rm w}}{P_{\rm w}} \label{z} \end{equation}(14)from the period estimates of the simulated data. This parameter measures the variability of P within its weighted  ± 3σ limits. The definitions of the parameters are wi=σP,i-2\hbox{$w_i=\sigma_{P,i}^{-2}$}, Pw =  ∑ wiPi[∑ wi]-1 and ΔPw=wi(PiPw)2/wi\hbox{$\Delta P_{\rm w} = \sqrt{\sum{w_i(P_i-P_{\rm w})^2}}/\sqrt{\sum{w_i}}$} (Jetsu 1993). The Z values calculated from the independent datasets for all simulated models are given in Table 2. When using the same parameter Z to characterise the differential rotation of a star, we may call these values spurious differential rotation Zspu.

What is striking in the results of Table 2, are the large values the spurious differential rotation Zspu, especially with lower A/N. This severely affects the detectability of weak differential rotation. For example, real differential rotation causing variability of only 2% in P would be hard to distinguish from the spurious differential rotation even from data of good quality with A/N = 20. The values of Zspu are roughly inversily proportional to A/N. Even very large 15% variations of P would be indistinguishable from the spurious differential rotation with A/N = 2.

By considering the period variations caused by physical and spurious differential rotation, i.e. Zphys and Zspu, to be independent effects, we can estimate the contribution of the real physical differential rotation. In this case, the observed value of Z2 is the sum of the squares of the two components, Z2=Zphys2+Zspu2\hbox{$Z^2=Z_{\rm phys}^2+Z_{\rm spu}^2$}, yielding Zphys2=Z2Zspu2.\begin{equation} Z_{\rm phys}^2 = Z^2 - Z_{\rm spu}^2. \label{zphys} \end{equation}(15)The value of Zspu in this equation can be interpolated for any A/N based on the values in Table 2. In theory, Eq. (15) gives an exact instantaneous value for Zphys under the assumption that Zphys and Zspu are uncorrelated. Note, however, that both the amplitude A of the light curve and the observational accuracy σϵ are likely to change during a time series. Thus it is impossible to get a unique estimate for Zspu. Therefore, the relation of Eq. (15) only gives a rough guideline for estimating the effect of the spurious period variations.

4. Analysis of HD 116956

Table 2

The spurious changes of the period P in units of Z (Eq. (14)) for test data simulated with yK = 1(ti) (Eq. (10)) and different values of A/N.

In this section we present results of the CPS analysis of long-term photometry of the young solar analogue HD 116956. The analysis was done using the a priori period estimate P0 = 7.80 d based on the preliminary results of Gaidos et al. (2000). The upper limit for the modelling order was set to be Klim = 2.

To further justify the choice of P0, we can estimate the lower limit for the stellar radius, Rsini = Pvsini/2π = 0.86   R, by using P = P0 and vsini = 5.6 km s-1\hbox{${}^{-1}$} as given in Sect. 1. This limit is slightly smaller than the radius of the Sun, as expected for a star with a somewhat later spectral type. Another estimate for the stellar radius can be obtained using the Barnes-Evans relation (Lacy 1977) log(R/R)=7.47240.2V02FV+logd,\begin{eqnarray*} \log{(R/R_{\odot})} = 7.4724 - 0.2V_0 - 2F_V + \log{d}, \end{eqnarray*}where FV = 3.977 − 0.429(V − R)0 and  [d]  =  pc. Using V0 = 7.286, (V − R)0 = 0.5 and d = 21.9 pc from the Hipparcos (Perryman et al. 1997) and USNO-B (Monet et al. 2003) catalogues and neglecting the interstellar reddening because of the small distance to the star, we get R = (0.66 ± 0.06)   R. This value is slightly smaller than the lower limit calculated using the rotation period and velocity. Nevertheless, the two results for the radius are of the same order and P0 is a reasonable estimate for the rotation period.

Our photometry1 of HD 116956 was obtained over 12 consecutive observing seasons between HJD = 2   451   172 (24 December 1998) and HJD = 2   455   342 (25 May 2010) with the T3 0.4 m automatic photoelectric telescope (APT) at Fairborn Observatory in Arizona. The APT performs differential photometry in the Johnson V and B passbands. A brief description of the operation of the APT and the reduction of the data can be found in Fekel & Henry (2005) and references therein. A total of 1408 differential V band observations were acquired with HD 114446 as the comparison star. This data is shown in Fig. 6, which also shows the season numbers that coincide with the segment division of the CPS analysis. The seasonal means are denoted with filled triangles which have been connected with continuous lines.

The precision of the observations taken with the T3 telescope is 0.004 − 0.005 mag, as shown by simultaneous measurements of the comparison star HD 114446 against a check star HD 119992 (see Fig. 6b). The difference between the comparison and check star showed no seasonal changes during the whole observing period, i.e. there are no prominent systematic errors in the data. This is also supported by the χ2-test for the n = 1273 comparison star minus check star differential magnitudes. Using ϵi = yi − my and σi = 0.005 gives χ2 = 1166.6 (Eq. (5)). This corresponds to the critical level Q = 0.9829, where Q states the probability of χ2 reaching the observed value under the null hypothesis that these differences remain constant. There is no need to reject this constant brightness hypothesis. In contrast, the n = 1408 HD 116956 minus comparison star differential magnitudes give χ2 = 12002.9 which corresponds to Q ≪ 10-20, i.e. these observations certainly exhibit variability. For a more detailed description of the data acquisition see Henry (1999).

4.1. Graphical presentation of the results

thumbnail Fig. 6

The differential photometry of a) HD 116956 and b) the comparison star HD 114446. Each observing season is labelled with its segment number SEG = 1, 2 , ..., 12. The filled triangles connected with continuous lines denote seasonal mean differential magnitudes.

thumbnail Fig. 7

The CPS analysis of SEG = 1 of the photometry of HD 116956. Descriptions of the subplots are given in Sect. 4.1.

The CPS divided the data into twelve segments, each constituting of a complete observing season. The observing seasons typically lasted from the beginning of December to the beginning of July. The first season of our data has also been analysed by Gaidos et al. (2000). The analysis of this first segment (SEG = 1) is presented graphically in Fig. 7, where the units are V magnitudes for σϵ, M and A and days for TC and P. All the rest of the parameters are dimensionless. This figure contains the subplots, where the parameters are plotted as functions of τ:

  • (a)

    standard deviation of residualsσϵ(τ);

  • (b)

    modelling order K(τ) (squares, units on the left y-axis); and number of observations per dataset n (crosses, units on the right y-axis);

  • (c)

    mean differential V-magnitude M(τ) (HD 116956 minus HD 114446;

  • (d)

    time scale of change TC(τ);

  • (e)

    amplitude A(τ);

  • (f)

    period P(τ);

  • (g)

    primary (squares) and secondary (triangles) minimum phases φmin,   1(τ) and φmin,   2(τ);

  • (h)

    M(τ) versus P(τ);

  • (i)

    A(τ) versus P(τ);

  • (j)

    M(τ) versus A(τ).

In the subplots (a), (c) and (e)–(g), the reliable parameter estimates are indicated by filled symbols and unreliable ones by open symbols. The reliability of the parameter estimates is determined as described in Sect. 2.2. In the case of P, also the level of the a priori period estimate P0 is shown (horizontal line). This fits reasonably well to the detected P values since the same data of SEG = 1 was used by Gaidos et al. (2000) to determine P0. The minimum phases are calculated using the median period Pmed of the segment. In subplot (d), the upward pointing arrows indicate that the computation of the time scale of change has reached the end of the segment and they correspond to the values TC =  −2 (see Appendix A). In the correlation plots (h)–(j), the error bars have been drawn only for the parameter estimates of independent datasets. The linear Pearson correlation coefficients r0 for the independent datasets, as well as the probabilities P(|r|  > r0), are given. Finally, the values of the a priori period estimate P0, the median period Pmed, the limiting modelling order Klim and the maximum length the a dataset ΔTmax are given above the plot.

thumbnail Fig. 8

Light curves of independent datasets: each dataset was first modelled with the phases φ1 = FRAC [(t − tmin,   1(τ))/P(τ)] . The phases φal,   1 of the primary minimum epochs tmin,   1(τ) were then computed with the constant period the ephemeris HJDal = 2   451   176.94 + 7.8416E (see Fig. 10). Finally, the data and the light curves were plotted as a function of the phase φ = φ1 + φal,   1.

The light curve fits for the independent datasets are presented in Fig. 8. Because the CPS models each of the datasets with a different period P(τ), it is in principle impossible to represent these light curves with an ephemeris based on one constant period. Each dataset was therefore first modelled with the phase φ1 = FRAC [(t − tmin,   1(τ))/P(τ)] , i.e. a different period value was used for each dataset. Then the phases φal,   1 of the primary minimum epochs tmin,   1 were computed with the constant period ephemeris HJDal = 2   451   176.94 + 7.8416E (see Sect. 4.3). Finally, the data and the light curve models were displayed as a function of the phase φ = φ1 + φal,   1. We chose this particular definition for the phase, because it allows an easier comparison between Figs. 8 and 10.

4.2. Long term variations

Our Fig. 9 displays the long term variation of the light curve parameters M, A and P. With the exception of the rotation period P, these seem to display some regular behaviour. This may be a sign of activity cycles in the star. The variations of P seem to be more or less random.

The long term variations are most striking in the mean brightness M. During 2004–2005 it underwent a dip of  ~ 0.02 mag. A similar dip occurred later in 2008. Also at the beginning of the observations in 1999, the values of M were dimmer than the average suggesting another similar dip. The same variability can be seen in the raw V data (Fig. 6). In addition to the minima before 1999 and in 2005 and 2008, there seems to have been one during 2001–2002. This one cannot be seen in the M variations. It is possible that signs of this particular dip are lost in the gap between the observing seasons of 2001 and 2002. Put together, these timings might suggest an activity cycle of roughly 3 years in length.

The variations of the mean brightness of the star have the most straight forward interpretation as an activity proxy. As the spottedness of the star increases, its mean effective temperature decreases and a lower brightness is observed. Another proxy is provided by the light curve amplitude A. It measures more or less the nonaxisymmetry of the spot distribution. There may or may not be a correlation between this and the total spottedness and likewise between the variations of A and M. As it happens, the connection between the long term variations of M and A is not immediately clear. There seem to be variations in A with about the same time scale as in M. However, these variations are more erratic than those of M.

To investigate somewhat further, whether the star has any activity cycles, we performed the CPS analysis for the independent M and A estimates, using σM-2\hbox{$\sigma_{M}^{-2}$} and σA-2\hbox{$\sigma_{A}^{-2}$} respectively as weights. Our a priori cycle period estimate was P0 = 3.0 yrs. The resulting cycle periods were PM = 3.26    ±    0.04 yrs and PA = 3.09    ±    0.13 yrs. However, these models had χM2=1206.2nind=72\hbox{$\chi^2_{M}=1206.2\gg n_{\rm ind}=72$} and χA2=847.1nind=72\hbox{$\chi^2_{A}=847.1\gg n_{\rm ind}=72$}, where nind is the total number of the independent datasets. Any reasonable model should satisfy χ2 ≈ nind, i.e. these two cycles are simply not statistically significant. Thus, the data does not seem to support the interpretation of the seasonal M and A variations as signs of activity cycles.

There may be short time scale correlations between M, A and P. For SEG = 1, the linear Pearson correlation coefficients are r0(P,M) =  −0.649, r0(P,A) = 0.324 and r0(A,M) =  −0.179. Because of the small number of independent datasets (nind = 7) none of these are significant. The same applies for all other segments as well. Other than simply linear correlations are also possible, though even more difficult to detect. The presence of such correlations is suggested by paths of parameter estimate pairs in Figs. 7h–j.

4.3. Active longitudes

The long term variation of the primary and secondary light curve minimum phases is displayed in Fig. 10 with the same notation as in Fig. 9. The minimum phases φmin,   1 and φmin,   2 are calculated from the minimum epochs tmin,   1 and tmin,   2 using the period Pal = 7.8416    ±    0.0011 d. This period was detected when the Kuiper test was applied to the n = 90 reliable primary and secondary minimum epoch estimates of all independent datasets.

The formulation of this nonweighted K-method was the same as in Jetsu & Pelt (1996). It tests the null hypothesis (H0) that the phases of the epochs of the primary and secondary minima are evenly distributed between 0 and 1. Under H0, one can determine the probability that the K-method test statistic reaches any particular value within the chosen tested period interval. This probability is called the critical level QK and if it is very small, the null hypothesis must be rejected. The best detected period has the smallest QK.

The period interval that was tested with the K-method was between Pmin = 0.85P0 = 6.63 d and Pmax = 1.15P0 = 8.97 d. The critical level of the best 7.8416 d periodicity was extremely significant, QK = 8.7 × 10-11. It exceeds all critical level estimates given in Table 2 by Jetsu (1996), where active longitudes were detected in λ And, σ Gem, II Peg and V711 Tau. Such an extreme value of QK confirms the presence of two long-lived active longitudes in HD 116956.

A striking feature of the minimum phases is that they are confined to two active longitudes with a phase separation of Δφ ≈ 0.5. Comparing Figs. 10 and 7g shows that this phenomenon is present in both long and short term evolution of the light curve. Considering the results of the analysis in Sect. 3.2, the existence of two confined areas of light curve minimum phases could be attributed to a selection effect. However, both the long term stability and the fact that there is very little short term random changes in the minimum phases show that the observed active longitudes are indeed likely to be real. If they were created by a selection effect, they would not remain stable on time scales longer than the length of single datasets ΔTmax.

The existence of a stable period with which the active longitudes rotate indicates that there is some coherent magnetic structure inside the star rotating with the period Pal. Such a structure could be generated by a nonaxisymmetric dynamo mode (Krause & Rädler 1980, p. 271).

In addition to the long term stability, the active longitudes seem to have experienced periods of migration. This is evident especially from the seasonal primary minimum phases. Between the observing seasons of 2004 and 2005, there was a phase jump of Δφ ≈ 0.3. During the next years, the primary minimum migrated to the same phase where it had resided before the phase jump. Quite remarkably, this transient occurred at the same time as the minimum of M at 2005. It is not clear whether these two phenomena are related to each other. Another short phase jump seems to have occurred at 2009 and possibly one also at 2000. In both cases, the primary and secondary minima seemed to experience the same phase shift with their mutual phase separation staying more or less constant.

thumbnail Fig. 9

Evolution of M, A and P during the complete time series. Parameter estimates from independent datasets are denoted as squares with error bars and from all other datasets as small crosses. M and A are given in V magnitudes and P in days.

thumbnail Fig. 10

Minimum phases φmin,   1 and φmin,   2 with the constant period ephemeris HJDal = 2   451   176.94 + PalE, where Pal = 7.4816 days. Estimates from independent datasets are denoted as squares (primary minima) and triangles (secondary minima) with error bars. Estimates from the rest of the datasets are denoted with small crosses.

Another remarkable property in the distribution of the minimum phases is that for most of the time the primary minima are confined to one active longitude and the secondary minima to the opposite one. This is evident in Fig. 10 where the primary minima present in the independent datasets are denoted as squares and the secondary minima as triangles. Only at the beginning of the time series are there exceptions to this. Two of the independent datasets had their primary minima near the phases where the following independent datasets had their secondary minima and one had its secondary minimum near the phases where the other independent datasets had their primary minima. The course of events can more closely be examined in Fig. 7g. At the beginning of the first segment, the primary light curve minima were located near φ = 0.7. As the segment continued, there was a migration of observed primary minimum phases to φ = 0.2, where a new active longitude formed. Roughly 70 d after the start of the segment, there was a shift in the order of the minima so that the secondary minimum grew deeper and became the new primary minimum. Towards the end of the segment, the old primary minimum gradually faded away. This change from a primary minimum to a secondary and vice versa is a direct proof of the occurrence of a flip-flop event in HD 116956 (Jetsu et al. 1993).

thumbnail Fig. 11

The light curve of HD 116956 during SEG = 1 folded with median period Pmed = 7.8896 d and divided into three parts. Panel a) includes observations from the first 29 nights of the segment, panel b) observations from the subsequent 56 nights and panel c) observations from the last 104 nights. The scale of the 1σ accuracy of the data (0.005 mag) is indicated with a separate error bar at the right upper corner of the panels.

Another closer look at the flip-flop in segment 1, that is observations between HJD = 2   451   172 and HJD = 2   451   361, is provided by Fig. 11. It displays the observations folded with the median period Pmed = 7.8896 d of the segment. Because of period variations during the segment, the resulting light curves are blurred. Nevertheless, the general variations of the light curve shape remain clearly visible. The observations in Fig. 11 are divided into three parts, panel (a) includes observations from the first 29 nights of the segment, panel (b) observations from the subsequent 56 nights and panel (c) observations from the last 104 nights. During the first and second parts of the segment, the light curve displayed two minima near φ = 0.2 and φ = 0.8 and its total amplitude remained relatively low. A distinguishing feature between the two parts is the relation between the depths of the two minima. During the first part, the minimum near φ = 0.2 is distinctly the deeper of the two. During the second part, no clear differences between the depths are visible. Quite remarkably, there was an abrupt change in the light curve shape between the second and third parts. The minimum near φ = 0.8 was slightly shifted and the minimum near φ = 0.2 disappeared. Also the light curve amplitude increased at the same time. As the observations continued unbroken during the changes, we must conclude that the changes were relatively fast. At the same time with the changes in the light curve shape there was a jump in M (Fig. 7c), which may have been connected to this flip-flop event.

The phase migrations at 2005 and 2009 may not be interpreted as flip-flops in the sense of Jetsu et al. (1993), as there were no switches of activity between the two active longitudes. Rather, both the primary and secondary minima remained at their own active longitudes, as well as preserved their phase separation.

4.4. Differential rotation

If the stellar photosphere rotates differentially, we can expect to observe different rotation periods for spots at different latitudes. At any particular time, only one value of P can be observed. But as the spot distribution evolves, so should the observed rotation period. The observed period values correspond to the total contribution of all the spots on the star weighted by their contrast to the unspotted surface and their visibility.

As already described in Sect. 3.3, we can measure the strength of the surface differential rotation using the  ± 3σ limits of P variations (Eq. (14)). Using only period estimates from the independent datasets, we get the weighted mean period Pw ± ΔPw = 7.8288    ±    0.1423 d and the strength of the variations Z = 0.11 ≡ 11%.

Combining the observational accuracy of 0.005 mag and a typical light curve amplitude A = 0.05 mag gives a signal to amplitude ratio A/N = 5. Using the results of the analysis in Sect. 3.3, this A/N value would introduce spurious period variations of Zspu = 0.06 ≡ 6%. We can investigate the contribution of this effect by calculating a rough estimate for the physical component of the period variations using Eq. (15). This gives Zphys ≈ 0.09 ≡ 9%, which is still larger than Zspu, although being of the same order of magnitude. Thus there remains a possibility that even these period variations are not physical and no differential rotation is observed. The possibility of a constant period in the observational data can, however, be rejected by computing the χ2 value (Eq. (5)) of the nind = 72 independent period estimates against the hypothesis P = Pw ≡ const. The residuals ϵi = Pi − PW and the errors σPi give χ2 = 282, which corresponds to a critical level Q ≪ 10-20, i.e. the hypothesis of a constant period must be rejected.

Assuming that the period variations are indeed caused by differential rotation, we can try to interpret the Z value in somewhat more depth. If we make the typical assumption of solar like differential rotation profile P(b) = Peq/(1 − ksin2b), where b is latitude and Peq the equatorial rotation period, we can relate the calculated Z value to the differential rotation coefficient k with the scaling law  | k |  ≈ Z/h, where h = sin2bmax − sin2bmin and bmin and bmax are the minimum and maximum latitudes of spot formation (Jetsu et al. 2000). In order to do this we must, however, know the latitudinal extent  [bmin,bmax]  of the spot activity. If the spots are restricted to a narrow latitude interval, we get observational data only from the rotation periods within that region. The actual value of k is thus larger than what Z suggests. If the spot activity spans all the way from the equator to the poles, the scaling coefficient h approaches unity and Z ≈  | k | . However, spots near the poles contribute less, or not at all, to the rotational modulation of brightness. This means that the suitable values of h are always somewhat less than unity.

In the case of HD 116956 we have no knowledge of the latitude extent of the spot activity. We can, however, apply different scaling factors. In the case of total range of spot activity from bmin = 0° to bmax = 90°, we have h = 1 and  |k|  = 0.11. This is roughly half of the solar value of k = 0.20. But for a solar like distribution of spot activity from bmin = 0° to bmax = 30°, we get h ≈ 0.25 and  |k|  = 0.44, over twice the solar value. Such a large value renders the assumption of solar like spot distribution highly dubious. It would be more likely that the spot activity on HD 116956 is distributed over a larger latitudinal range than on the Sun and the value of k be closer to k = 0.11. This would correspond to differential rotation rate ΔΩ = kΩ = 2πk/P = 0.088 rad d-1\hbox{${}^{-1}$} between the equator and the poles.

Henry et al. (1995) investigated the relation between the observed values of k and the stellar rotation periods Prot and arrived at the relation logk=2.12+0.76logProt0.57F,\begin{equation} \log{k} = -2.12 + 0.76\log{P_{\rm rot}} - 0.57F, \label{kp} \end{equation}(16)where Prot is in days. The Roche lobe filling factor F = Rstar/RRoche reduces to F = 0 for single stars. Using the weighted mean period Pw = 7.8288 d we get a prediction k = 0.036 for HD 116956. This is a low value, about one third of the observed lower limit k ≥ 0.11. Thus it seems that, both the lower and upper limits k = 0.11 and k = 0.44 reside in or near the scattered region of differential rotation estimates in Fig. 28 of Henry et al. (1995), and therefore HD 116956 would appear to undergo much stronger differential rotation than expected.

Another comparison was done using the results presented by Collier Cameron (2007). Their relation between differential rotation rate (ΔΩ) and the effective temperature of the star (Teff = 5170 K) was ΔΩ=0.053(Teff5130)8.6,\begin{equation} \Delta\Omega = 0.053\left(\frac{T_{\rm eff}}{5130}\right)^{8.6}, \end{equation}(17)where Teff is in kelvins and ΔΩ in radians per day. This predicts ΔΩ = 0.057 rad d-1\hbox{${}^{-1}$} for HD 116956. This is again smaller than the observed value, although closer to it than what was predicted earlier by Eq. (16).

4.5. Time scale of change

The parameter TC is an estimate of the typical time scale in which the spot distribution changes. It is an important parameter when investigating the short term dynamics of the spot activity. Unfortunately, interpreting the values of TC is not simple. As can be seen in Fig. 7d, the TC values evolve quite rapidly. Moreover, there are a large number of datasets where the model has been applicable right to the end of the segment (denoted with arrows), i.e. no TC estimate has been obtained.

The seemingly random evolution of TC stems from many causes. First of all, the complexity of the model affects the time that this same model stays applicable to future observations. Simple models with low K tend to have longer TC than more complex ones. This is apparent from the very first few datasets of SEG = 1, as they have K = 1 (Fig. 7b) and TC > 160 d (Fig. 7d) reaching the end of the segment.

An incomplete phase coverage of the light curve can also cause a too low value of TC. If there is a phase gap in the light curve with no observations, there may be a situation where the step to the next dataset introduces new observations to the phase gap thus completing the phase coverage. In this case, the contribution of new observations can significantly alter the light curve so that the model of the previous dataset is no longer applicable and TC gets a very low value.

Even if there are no significant phase gaps in the observations, an increasing amount of observational data can affect the value of TC. This is because with more data the light curve can be determined more accurately and there is a higher probability of choosing a more complex model. This leads to smaller values for TC. Through a concrete example, it can be seen how a larger number of observations and a better observational accuracy decrease the values of TC. These effects can be nicely seen in the observations of ϵ Eri by Croll et al. (2006). They analysed continuous photometry of the star taken with the MOST satellite during three rotation periods. Their light curve is both well sampled and accurate and it turns out to have a different shape during each of the three rotations, as can be seen in their Fig. 2.

Because of the complex behaviour of TC, it is best to look at the long-term mean time scale of change rather than studying the short term changes. This long-term mean for HD 116956 is TC=44.1\hbox{$\overline{T}_{\rm C}=44.1$} d. This value is nearly two times as large as the length of the datasets ΔTmax = 25 d. Thus the light curve usually remains unchanged during one typical dataset and the chosen value of ΔTmax is reasonable.

The timescale of light curve change can be compared to the convective turnover time τc, as mixing of the convection zone may alter the spot distribution which is then observed as a change in the light curve. Kim & Demarque (1996) performed theoretical modelling to find the values of τc in late type stars. Ossendrijver (1997) published an interpolation formula for their results by fitting a cubic polynomial to the theoretically calculated values, τc=68.3+224.8x177.2x2+57.0x3,\begin{equation} \tau_{\rm c} = -68.3 + 224.8x - 177.2x^2 + 57.0x^3, \end{equation}(18)where x = B − V. This formula gives τc in days. By using the Hipparcos value of B − V = 0.823 for HD 116956 (Perryman et al. 1997), we get τc = 28.5 d. Even if the two values are of the same order of magnitude, it is not clear whether there is a meaningful link between τc and TC\hbox{$\overline{T}_{\rm C}$}.

5. Conclusions

We have formulated a new continuous period search (CPS) method by improving the earlier TSPA method described in Paper I. Our goal was to develop better tools for the analysis of photometry of active stars. The three new features are:

  • (1)

    A sliding window for selecting overlapping datasets.

  • (2)

    A criterion for selecting the correct order for the model.

  • (3)

    A time scale for the change TC of the model.

Using the sliding window significantly improves the time resolution of the analysis. The CPS determines the light curve parameters with a time resolution much shorter than the length of individual datasets. In contrast, the TSPA could only achieve a time resolution equal to the dataset length. A similar sliding window was applied for a first order harmonic model in Berdyugina et al. (2002, their Eq. (3)). Our model order selection criterion allows the selection of a model with suitable complexity for describing the data in each individual dataset. This even allows the alternative of a constant light curve with no periodicity. Since the CPS can use Fourier series of any arbitrary order as a model, it can in principle be applied to any type of periodic data. Finally, the computed TC parameter measures the time scale of change of the light curve.

In order to characterise the performance of the CPS method, we tested it with simulated data. We found imperfect performance in three critical situations.

Firstly, the model order selection criterion does not give the correct order K in all of the analysed datasets. This is caused by the limited number of observations. Typically, when analysing photometry of active stars, each dataset contains 10 ≤ n ≤ 30 observations. The success rate in finding the correct modelling order depends on n. In the case of analysing periodic data, the success rate is between 80% and 90% for datasets with n ≤ 20 and larger than 90% for datasets with n > 20. In the case of constant noisy data with no periodicity, this success rate is much smaller, and spurious periodicity is still detected in 40% and 20% of the datasets having n ≤ 20 and n > 20, respectively. This reduces our ability to distinguish between time intervals of periodic variation and constant brightness in the light curve. However, this type of spurious periodicity can often be identified from the low observed amplitude to noise ratio of the models.

Secondly, it is not always possible to uniquely detect two separate minima from the light curves of spotted stars. This is a general problem and common to all time series analysis methods. Below some critical phase separation Δφcrit, the two minima merge into one common minimum. We calculated a theoretical limit of Δφcrit = 0.25 and then obtained Δφcrit ≈ 0.33 from simulated data analysed with the CPS. The existence of this type of a phase limit is a concern when analysing photometry of active stars, since it means that two individual spots must have a considerable longitudinal separation to be detected separately. If one is not careful, this may lead to detections of spurious active longitudes.

Thirdly, there is an intrinsic instability in the period estimation. This is a concern because period variations are often used to measure the differential rotation of active stars. For good quality data the effect is limited, but grows substantially for more noisy data. This period instability is caused by random effects due to noise, which is a general problem for all time series analysis methods, especially when the analysed datasets are short.

We applied the CPS to 12 years of V band photometry of the young solar analogue HD 116956. The analysis revealed variations in the mean magnitude M and the light curve amplitude A with an apparent period around 3 years. However, we found no conclusive evidence supporting the interpretation that these variations are signs of an activity cycle, as the CPS analysis for the M and A estimates gave very high χ2 values for a periodic fit.

The star also displays two active longitudes that have remained stable over the whole 12 year observing period. The standard Kuiper test of the primary and secondary minimum epochs gave an extreme critical level QK = 8.7 × 10-11 for the rotation period Pal = 7.8416    ±    0.0011 d of these active longitudes. There have been only few transient excursions to other longitudes and the separation of the primary and secondary minima has stayed nearly unchanged. During the first observing season at 1999, the star underwent a flip-flop. In this event, the major activity switched from one active longitude to the other, while the old primary minimum disappeared completely. This flip-flop event happened very fast.

We estimated the differential rotation of the star, assuming that it is the cause of the observed photometric period variations. These  ± 3σ variations of the period gave the value Z = 11%, which is much larger than what could be caused only by spurious period variations. Assuming that the spot distribution covered the whole latitude range from equator to pole and that the solar law of differential rotation were valid also for HD 116956, these variations would correspond to a differential rotation coefficient  | k |  = 0.11, or equivalently a differential rotation rate ΔΩ = 0.057 rad d-1\hbox{${}^{-1}$}. The observed solar value is k = 0.20. If the latitudinal extent of the spot activity in HD 116956 were the same as in the Sun, surface differential rotation would be much stronger, i.e.  | k |  = 0.44.

The mean time scale of change of the light curve, i.e. the spot distribution of the star, was TC=44.1\hbox{$\overline{T}_{\rm C}=44.1$} d. This exceeds the length of the datasets, ΔTmax = 25.0 d. Hence the spot distribution remained unchanged during a typical dataset. There may be a link between the above TC\hbox{$\overline{T}_{\rm C}$} value and the convective turnover time τc = 28.5 d, having the same order of magnitude. This could be expected, if convective mixing causes the observed changes in the distribution of starspots.


1

The V band photometry used in this paper is published electronically at the CDS.

2

The full results of the analysis of HD 116956 are published electronically in this format at the CDS.

Acknowledgments

The work by P. Kajatkari was supported by the Vilho, Yrjö and Kalle Väisälä Foundation. The automated astronomy program at Tennessee State University has been supported by NASA, NSF, TSU and the State of Tennessee through the Centers of Excellence program.

References

  1. Berdyugina, S., Pelt, J., & Tuominen, I. 2002, A&A, 394, 505 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  2. Collier Cameron, A. 2007, Astron. Nachr., 328, 1030 [NASA ADS] [CrossRef] [Google Scholar]
  3. Croll, B., Walker, G., Kusching, R., et al. 2006, ApJ, 648, 607 [NASA ADS] [CrossRef] [Google Scholar]
  4. Fekel, F., & Henry, G. 2005, AJ, 129, 1669 [NASA ADS] [CrossRef] [Google Scholar]
  5. Gaidos, E. 1998, PASP, 110, 1259 [NASA ADS] [CrossRef] [Google Scholar]
  6. Gaidos, E., Henry, G., & Henry, S. 2000, AJ, 120, 1006 [NASA ADS] [CrossRef] [Google Scholar]
  7. Gray, R., Corbally, C., Garrison, R., McFadden, M., & Robinson, P. 2003, AJ, 126, 2048 [NASA ADS] [CrossRef] [Google Scholar]
  8. Henry, G. 1999, PASP, 111, 845 [NASA ADS] [CrossRef] [Google Scholar]
  9. Henry, G., Eaton, J., Hamer, J., & Hall, D. 1995, ApJS, 97, 513 [NASA ADS] [CrossRef] [Google Scholar]
  10. Jetsu, L. 1993, A&A, 276, 345 [NASA ADS] [Google Scholar]
  11. Jetsu, L. 1996, A&A, 314, 153 [NASA ADS] [Google Scholar]
  12. Jetsu, L., & Pelt, J. 1996, A&AS, 118, 587 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  13. Jetsu, L., & Pelt, J. 1999, A&AS, 139, 629 (Paper I) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Jetsu, L., Pelt, J., & Tuominen, I. 1993, A&A, 278, 449 [NASA ADS] [Google Scholar]
  15. Jetsu, L., Hackman, T., Hall, D., et al. 2000, A&A, 362, 223 [NASA ADS] [Google Scholar]
  16. Kim, Y.-C., & Demarque, P. 1996, ApJ, 457, 340 [NASA ADS] [CrossRef] [Google Scholar]
  17. Krause, F., & Rädler, K.-H. 1980, Mean-field magnetohydrodynamics and dynamo theory (Oxford: Pergamon Press) [Google Scholar]
  18. Lacy, C. 1977, ApJ, 213, 458 [NASA ADS] [CrossRef] [Google Scholar]
  19. Lomb, N. 1976, Ap&SS, 39, 447 [NASA ADS] [CrossRef] [Google Scholar]
  20. Monet, D., Levine, S., Canzian, B., et al. 2003, AJ, 125, 984 [NASA ADS] [CrossRef] [Google Scholar]
  21. Montes, D., López-Santiago, J., Gálvez, M., et al. 2001, MNRAS, 328, 45 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
  22. Ossendrijver, A. 1997, A&A, 323, 151 [NASA ADS] [Google Scholar]
  23. Perryman, M., Lindegren, L., Kovalevsky, J., et al. 1997, A&A, 323, L49 [NASA ADS] [Google Scholar]
  24. Scargle, J. 1982, ApJ, 263, 835 [NASA ADS] [CrossRef] [Google Scholar]
  25. Stoica, P., & Selén, Y. 2004, IEEE Signal Processing Magazine, 21, 36 [Google Scholar]

Appendix A: Format of the results

Through the CPS method we obtain a lot of information useful for studying stellar activity. These results are published electronically and it is therefore necessary to describe the notation format in detail. The results of each individual dataset have been compressed to nine lines of an ASCII file. The parameter estimates are written in the format given in Table A.1. This table also specifies the units of the parameters.

The first row in Table A.1 gives the segment number (SEG), the epoch of the first observation in the segment (t0) and the dataset number (SET). The second row gives the epochs of the first (t1) and the last (tn) observation in the subset, the mean epoch of the dataset (τ) and specifies whether the dataset is considered an independent dataset or not (IND). For independent datasets the value is IND = 1, otherwise IND = 0. The third row gives the number of observations (n), their mean (my) and their standard deviation (sy). The fourth row gives the order of the model (K), the standard deviation of the residuals (σϵ) and the time scale of change (TC). As already mentioned in Sect. 2.3, the value TC =  −2 indicates that the model describes well all the remaining datasets in the segment. The rest of the rows give values for the light curve parameters (M, P, A, tmin,   1, tmin,2) and their errors (σM, σP, σA, σtmin,1, σtmin,   2). It is also specified whether these parameter estimates are considered reliable or not. Reliable estimates have R=0, while unreliable ones have R = 1. All the heliocentric Julian dates are given in the truncated form HJD − 2  400  000. If for some reason no value has been obtained for some parameter, the “dummy” value  − 1 is given.

A typical example of an entry into the output file is:

                  1     51171.9835                      17 
 51200.9434    51225.8738     51214.2168        1
                22              0.2658              0.0086
                  2              0.0055             22.2344
        0.2654              0.0012                         0
        7.1382              0.1267                         0
        0.0274              0.0045                         0
51207.7204             0.1689                         0
51203.6602             0.2811                         1

This is the 17th dataset in the first segment (SEG = 1, SET = 17) in the analysis2 of the star HD 116956 presented in Sect. 4. It is a dataset of n = 22 observations obtained during a period of about ΔT = tn − t1 = 25 d centered around HJD = 2   451   214.22. The dataset has been labelled as an independent one (IND = 1). The modelling has been done with K = 2 and the values for all model parameters M, P, A, tmin,   1 and tmin,   2 have been obtained. Only the tmin,   2 estimate is found to be unreliable (R = 1).

Another example is the 11th dataset in the second segment (SEG = 2, SET = 11):

                   2    51515.0410                    11
51560.9421    51585.8988    51574.4281        0
                 13             0.2561             0.0084
                    1            0.0063            -1.0000 
          0.2568            0.0019                        1
          8.3355            0.5855                        1
          0.0154            0.0046                        1
 51567.3488            0.7673                        1
        -1.0000          -1.0000                        1

This dataset is not independent (IND = 0). It contains only n = 13 observations and has been modelled with a simpler K = 1 model. For this model, there does not exist a secondary minimum and correspondingly tmin,   2 has a dummy value  − 1. All light curve parameters have been found unreliable (R = 1). Also, because the model parameters have been found unreliable, no value has been computed for the time scale of change TC.

Table A.1

The format of the CPS output file for each individual datset.

All Tables

Table 1

Fractions of falsely determined model order K in the analysis stable simulated data.

Table 2

The spurious changes of the period P in units of Z (Eq. (14)) for test data simulated with yK = 1(ti) (Eq. (10)) and different values of A/N.

Table A.1

The format of the CPS output file for each individual datset.

All Figures

thumbnail Fig. 1

Model order K from analysis of test data simulated with a) model yK = 0(ti) in Eq. (10) and b) model yK = 1(ti) in Eq. (10) having amplitude to noise ratio A/N = 10. The number of data points in each dataset is presented in panel c).

In the text
thumbnail Fig. 2

Examples of single 25 d long datasets simulated with a) model yK = 0(ti) in Eq. (10) and b) model yK = 1(ti) in Eq. (10) having amplitude to noise ratio A/N = 10. The phases of these simulated observations are calculated using the period of the simulation model, P0 = 7.8 d.

In the text
thumbnail Fig. 3

Primary (squares) and secondary (triangles) minimum phases from analysis of test data simulated with yspot(ti/P0) of Eq. (13). Filled symbols denote reliable and open symbols unreliable phase estimates. The two straight lines denote the correct phases where the simulated spots should have been detected.

In the text
thumbnail Fig. 4

Two single spot model light curves according to yspot,   1(φ) plotted around phases φ0 = 0.15 and φ0 =  −0.15 (dashed lines) and the combined light curve (solid line). The phase separation Δφ is indicated with the two headed arrow.

In the text
thumbnail Fig. 5

Period estimates from the analysis of test data simulated with yK = 1(ti) having A/N = 10 (see Eq. (10)).

In the text
thumbnail Fig. 6

The differential photometry of a) HD 116956 and b) the comparison star HD 114446. Each observing season is labelled with its segment number SEG = 1, 2 , ..., 12. The filled triangles connected with continuous lines denote seasonal mean differential magnitudes.

In the text
thumbnail Fig. 7

The CPS analysis of SEG = 1 of the photometry of HD 116956. Descriptions of the subplots are given in Sect. 4.1.

In the text
thumbnail Fig. 8

Light curves of independent datasets: each dataset was first modelled with the phases φ1 = FRAC [(t − tmin,   1(τ))/P(τ)] . The phases φal,   1 of the primary minimum epochs tmin,   1(τ) were then computed with the constant period the ephemeris HJDal = 2   451   176.94 + 7.8416E (see Fig. 10). Finally, the data and the light curves were plotted as a function of the phase φ = φ1 + φal,   1.

In the text
thumbnail Fig. 9

Evolution of M, A and P during the complete time series. Parameter estimates from independent datasets are denoted as squares with error bars and from all other datasets as small crosses. M and A are given in V magnitudes and P in days.

In the text
thumbnail Fig. 10

Minimum phases φmin,   1 and φmin,   2 with the constant period ephemeris HJDal = 2   451   176.94 + PalE, where Pal = 7.4816 days. Estimates from independent datasets are denoted as squares (primary minima) and triangles (secondary minima) with error bars. Estimates from the rest of the datasets are denoted with small crosses.

In the text
thumbnail Fig. 11

The light curve of HD 116956 during SEG = 1 folded with median period Pmed = 7.8896 d and divided into three parts. Panel a) includes observations from the first 29 nights of the segment, panel b) observations from the subsequent 56 nights and panel c) observations from the last 104 nights. The scale of the 1σ accuracy of the data (0.005 mag) is indicated with a separate error bar at the right upper corner of the panels.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.