Free Access
Issue
A&A
Volume 541, May 2012
Article Number A46
Number of page(s) 11
Section Astronomical instrumentation
DOI https://doi.org/10.1051/0004-6361/201117335
Published online 26 April 2012

© ESO, 2012

1. Introduction

Astronomical interferometers combine the light from multiple telescopes to form interference fringes. The amplitude and phase of the fringes corresponding to interference between a given pair of telescopes can be combined into a single complex number, the fringe complex amplitude, which in an ideal interferometer is proportional to a single Fourier component of the object brightness distribution.

The complex amplitude forms the fundamental observable in phase-stable interferometric experiments, for example at radio wavelengths, but at optical wavelengths the phase of the fringes is unstable on timescales of order milliseconds, because of optical pathlength perturbations introduced by the Earth’s atmosphere. As a result, observers need to combine the information from multiple short-exposure interferograms to recover the desired science data. Since the phase of the fringes is randomly fluctuating, simply averaging the complex amplitude over multiple interferograms would lead to a vanishingly small result. Instead two observables, the power spectrum and bispectrum, are calculated from each interferogram and these can be averaged over frames as both are immune (to first order) to the atmospheric and instrumental phase perturbations. The power spectrum serves to capture information about the modulus of the complex amplitude, whilst the phase of the bispectrum (also known as the “closure phase”) captures information about the phase of the complex amplitude on closed triplets of baselines.

At the low light levels typically encountered in astronomical interferometry, the effects of the noise arising in the detection process become important. This noise can usually be modelled as a mixture of Poisson noise, for example photon noise, and signal-independent Gaussian noise, for example readout noise in the detector electronics. We use the term “detection noise” to refer to the combination of all the noise processes in the detection process including the photon noise.

Detection noise produces random measurement errors which can be reduced by averaging over many interferograms, but in the case of the power spectrum and bispectrum it also produces systematic biases in the averaged estimates. There has been considerable study in the literature on these biases (see for example Goodman & Belsher 1976a,b; Sibille et al. 1979; Wirnitzer 1985; Beletic 1989; Perrin 2003). Some authors have derived formulae for the power spectrum bias which allow for both Poisson and Gaussian noise sources, but existing bispectrum bias estimates have been derived under the assumption of Poisson noise only. As a result, it is difficult to derive an unbiased estimate of the closure phase from observations where both Poisson and Gaussian noise are significant. Such conditions are increasingly present in interferometric observations made at near-infrared wavelengths so the absence of such an estimator is becoming a critical limitation to the acquisition of accurate interferometric data as we will demonstrate in Sect. 8.

Much of the existing power spectrum and bispectrum analysis relies on the direct Fourier transform of the interferogram to recover the fringe complex amplitudes. Even under noise-free conditions the use of a Fourier-based estimator will return inaccurate estimates of the fringe complex amplitude if the interferogram is, for example, non-uniformly sampled. To account for this, Tatulli & Le Bouquin (2006) introduced a pixel-to-visibility transformation matrix (P2VM) for deriving fringe amplitude estimates in circumstances where the Fourier-based estimators are inaccurate, which includes the Fourier estimators as a special case.

There has been considerable study of the variance of the power spectrum and bispectrum (Goodman & Belsher 1976a,b; Dainty & Greenaway 1979; Sibille et al. 1979; Wirnitzer 1985; Roddier 1987; Ayers et al. 1988), but only the work of Tatulli & Le Bouquin (2006) applies the more general P2VM approach, and in this case the variance is calculated under assumptions valid only for purely Gaussian noise processes. Knowing the variance, and hence the signal-to-noise ratio is important for predicting the integration times and detection limits prior to an observation run.

This paper builds on the introduction by Gordon et al. (2010) and previous work by Thorsteinsson & Buscher (2004) to present a general formalism for computing biases and variances in the presence of detection noise. Our formalism has two advantages over current treatments: (a) it can be applied when the fringe complex amplitudes are derived using any linear estimator, including the P2VM and Fourier transform as special cases and (b) it yields accurate answers for any additive mixture of Poisson noise and Gaussian noise. We derive formulae for the biases affecting both the power spectrum and bispectrum under these conditions and show how our results allow the construction of bias-free estimators. We present simulations to show that using our new bispectrum estimator can significantly enhance the accuracy of measuring the closure phase under realistic observation conditions. We also use our formalism to derive expressions for the variance of the power spectrum, and explain the method by which the reader can calculate the bispectrum variance if required.

2. The interferometric equations

2.1. The interferogram

We will start from a specific idealised example and use this to introduce our more general model which will be applicable to a wide range of realistic scenarios. In our example, the light from Ntel telescopes is combined to produce a 1-dimensional interference pattern, which consists of the superposition of a set of sinusoidal fringes at different fringe frequencies fij corresponding to the interference between telescopes i and j. The interference pattern is sampled to produce a set of Npix intensity measurements  {ip} . We shall call this discrete set of intensity measurements the “interferogram”. In our example the interferogram is given by: i p = C 00 N pix + 1 N pix i < j N tel Re [ C ij e 2 π i α p f ij ] + noise , \begin{equation} \label{eq:simplesample} i_p=\frac{C^{00}}{N_{\rm pix}}+ \frac{1}{N_{\rm pix}}\sum_{i<j}^{N_{\rm tel}}\mathrm{Re}\left[ C^{ij}{\rm e}^{2\pi{\rm i}\alpha_{p}f^{ij}} \right] +{\rm noise}, \end{equation}(1)where Cij denotes the complex amplitude of the fringes corresponding to interference between light from telescopes i and j, C00 denotes an offset due to the light from all the telescopes (we shall call this the “zero-spacing flux”), and αp is the coordinate of pixel p.

Recognising that Eq. (1) represents a sum over a set of Fourier components, we can derive an estimate cij of the complex amplitudes from the sampled intensity data using a discrete Fourier transform (DFT): c ij = p = 1 N pix i p e 2 π i α p f ij , \begin{equation} \label{eq:simpledft} c^{ij}= \sum_{p=1}^{N_{\rm pix}} i_p{\rm e}^{2\pi{\rm i}\alpha_{p}f^{ij}}, \end{equation}(2)where we note that this equation can also be used to estimate the zero-spacing flux by setting f00 = 0. In noise-free conditions, the estimator cij will be equal to the true amplitudes Cij provided that the samples are evenly spaced in the coordinate α and the frequencies fij are integer multiples of the fundamental frequency given by ffundamental = 1/ [Npix(α2 − α1)] . We will call this set of conditions the “DFT conditions”.

We can rewrite the above equations in the form: i = W C + noise , \begin{equation} \label{eq:generalforward} \vec i=\boldsymbol{W}\vec{C}+{\rm noise}, \end{equation}(3)and: c = H i , \begin{equation} \label{eq:generalinverse} \vec c=\boldsymbol{H}\vec{i}, \end{equation}(4)respectively where i =  [i1,i2,...]  is a vector containing the pixel intensities, C =  [C00, Re {C12}, Im {C12}, Re {C13}, Im {C13}, ...] is a vector describing the true complex amplitudes, c =  [c00,Re {c12}, Im {c12}, Re {c13}, Im {c13}, ...] is a vector describing the estimated complex amplitudes, W is a matrix known as the “visibility-to-pixel matrix” (V2PM) and H is the pixel-to-visibility matrix (P2VM) mentioned before. When the DFT conditions hold, the matrices W and H will consist of sine and cosine terms.

If we now assume that the DFT conditions do not hold, for example if the samples are not evenly spaced or there are “dead” pixels, then it is often still possible to set up a linear set of equations in the form of Eq. (3) by modifying the matrix W. For the case of any “working” beam combiner, we can always find the corresponding matrix H such that: H W = I , \begin{equation} \label{eq:identity} \boldsymbol{H}\boldsymbol{W}=\boldsymbol{I}, \end{equation}(5)where I is the identity matrix, i.e. Eq. (5) is the condition that H is a pseudo-inverse of W. It is then clear that Eq. (4) can be used to derive a value for c such that c = C under noise-free conditions. The pseudo-inverse generalises the Fourier inverse to the cases in which the DFT conditions may not hold; in general the pseudo-inverse matrix will not consist of simple Fourier terms.

Equation (4) serves to define the basis of our more general model for an interferometric measurement. We assume only a discrete set of intensity measurements  {ip} related to a set of complex fringe amplitudes  {Cij} . We do not require a simple Fourier relationship between the amplitudes and the pixel intensities; we only require that there exists a set of estimators for  {Cij} which are linear in  {ip} . In other words we require only that there exists at least one set of complex weights { H p ij } \hbox{$\{H^{ij}_p\}$} such that the complex amplitude estimators: c ij = p i p H p ij , \begin{equation} c^{ij}= \sum_{p}i_{p}H^{ij}_{p}, \label{eqn:c_and_H} \end{equation}(6)have the property that cij = Cij in the limit where detection noise can be neglected. This generalised model can be applied to a wide variety of beam combination scenarios: the interferograms can be sampled temporally or spatially or a combination of the two, and multiple telescopes can be combined “all-in-one” into a single multiplexed fringe pattern or pairwise onto separate detectors.

Note that pseudo-inverse techniques are only presented as one possible route to deriving { H p ij } \hbox{$\{H^{ij}_{p}\}$}; under circumstances where the forward matrix W is not known or not fixed (for example if the fringe pattern has not been spatially filtered) alternative techniques to derive { H p ij } \hbox{$\{H^{ij}_{p}\}$}, perhaps based on knowing the statistical properties of W, may be needed. Nevertheless, the results presented here will be valid independent of the technique used to derive values for H p ij \hbox{$H^{ij}_{p}$}.

2.2. Interferometric observables

Under phase-unstable conditions, it is common to choose the power spectrum, Sij and bispectrum, Bijk as the interferometric observables: S ij = | C ij | 2 , \begin{equation} S^{ij} = \left|C^{ij}\right|^{2},\label{eqn:S_first} \end{equation}(7)and: B ijk = C ij C jk C ki . \begin{equation} B^{ijk} = C^{ij}C^{jk}C^{ki}.\label{eqn:B_first} \end{equation}(8)Typically the values of Cij will change randomly from interferogram to interferogram due to atmospheric disturbances, but the values of Sij and Bijk will be more stable. However, unless otherwise specified, we do not assume any particular distribution for the values of Cij; for example the results will apply equally well under incoherent conditions where the average value of Cij is zero or conditions where a fringe tracker is being used and the value of Cij is stable from frame to frame. Our aim is to determine suitable estimators, S 0 ij \hbox{$S_{0}^{ij}$}, B 0 ijk \hbox{$B_{0}^{ijk}$}, for the power spectrum and bispectrum respectively that do not suffer from bias in the presence of noise. For this analysis these observables are calculated on an interferogram-by-interferogram basis and then averaged. Derived quantities, such as the closure phase, are calculated using the averaged observables, rather than on an interferogram-by-interferogram basis themselves.

3. Noise model

The signal recorded by a real detector system will always have some noise component, which becomes especially important at low light levels. We model two distinct sources of noise, photon shot noise and detector read noise.

3.1. Photon noise

Photon noise is always present in a real interferogram as a result of the quantum nature of the detection process which converts the incoming photons to an electrical signal. For a given pixel p, the probability of Np photoevents occurring is given by the Poisson distribution: P Poisson ( N p | Λ p ) = n δ ( N p n ) Λ p N p N p ! exp [ Λ p ] , \begin{equation} P_{\textrm{Poisson}}\left(N_{p}|\Lambda_{p}\right) = \sum_{n}^{\infty}\delta\left(N_{p}-n\right)\frac{\Lambda_{p}^{N_{p}}}{N_{p}!}\exp\left[-\Lambda_{p}\right],\label{eqn:poisson} \end{equation}(9)where Λp is the ideal noise-free intensity, also called the “classical intensity” or “high-light-level intensity”. Note that this equation implicitly defines the normalisation of Λp in units of photoevents, and we require that ip = Λp under noise-free conditions, thereby defining the normalisation of ip.

3.2. Read noise

Read noise is present in all optical and IR detectors and is often significant at low light levels. Assuming the read noise is independent of the number of photoevents, the probability of a given measurement error ϵp = ip − Np is given by the Gaussian distribution: P Gaussian ( ϵ p | σ p , N p ) = 1 2 π σ p 2 exp [ ( ϵ p ) 2 2 σ p 2 ] \begin{equation} P_{\textrm{Gaussian}}\left(\epsilon_{p}|\sigma_{p}, N_{p}\right) = \frac{1}{\sqrt{2\pi\sigma_{p}^{2}}}\exp\left[-\frac{\left(\epsilon_{p}\right)^{2}}{2\sigma_{p}^{2}}\right] \end{equation}(10)where σ p 2 \hbox{$\sigma_{p}^2$} is the variance of the noise on pixel p; note that the variance is not assumed to be the same on all pixels.

3.3. Other sources of noise

There are additional noise sources that follow the same statistics as photon or read noise. Detector dark current, Dp, for example is also described by the Poisson distribution and could be included in our formalism by replacing Λp with Λp + Dp in Eq. (9). For the sake of readability dark current will not be explicitly included in this paper.

3.4. Combined noise model

If the photon noise and read noise are statistically independent then the probability of obtaining the noisy data, ip, from the underlying ideal data, Λp, is given by: P ( i p | Λ p , σ p ) = 0 i p P Poisson ( N p | Λ p ) P Gaussian ( ϵ p | σ p , N p ) d N p . \begin{equation} P\left(i_{p}|\Lambda_{p},\sigma_{p}\right) \!=\! \int_{0}^{i_{p}}\!P_{\textrm{Poisson}}\!\left(N_{p}|\Lambda_{p}\right)\!P_{\textrm{Gaussian}}\!\left(\epsilon_{p}|\sigma_{p}, N_{p}\right)\!{\rm d}N_{p}. \end{equation}(11)The combined noise distribution is a convolution of the Poisson noise and Gaussian noise distributions, so it is convenient to derive the moments of our noise model via the moment-generating function of the probability distribution. The moment-generating function for the combined noise model probability density P i p ( | Λ p , σ p ) \hbox{$P\left(i_{p}|\Lambda_{p},\sigma_{p}\right)$} is proportional to the product of the moment-generating functions MPoisson and MGaussian of the individual distributions: M ( ν ) = M Poisson ( ν ) M Gaussian ( ν ) = exp [ σ p 2 ν 2 2 ] exp [ Λ p + Λ p e ν ] . \begin{equation} M\left(\nu\right) \!=\! M_{\textrm{Poisson}}\left(\nu\right)M_{\textrm{Gaussian}}\left(\nu\right) \!=\! \exp\left[\frac{\sigma_{p}^{2}\nu^{2}}{2}\right]\exp\left[-\Lambda_{p}+\Lambda_{p} \mathrm{e}^{\nu}\right]. \end{equation}(12)The first four moments of the noise model as required to derive the results given in this paper are: i p = d d ν M ( ν ) | ν = 0 = Λ p , i p 2 = d 2 d ν 2 M ( ν ) | ν = 0 = Λ p 2 + Λ p + σ p 2 , i p 3 = d 3 d ν 3 M ( ν ) | ν = 0 = Λ p 3 + 3 Λ p 2 + Λ p + 3 Λ p σ p 2 , i p 4 = d 4 d ν 4 M ( ν ) | ν = 0 = Λ p 4 + 6 Λ p 3 + 7 Λ p 2 + Λ p + 6 Λ p 2 σ p 2 + 6 Λ p σ p 2 + 3 σ p 4 , \begin{eqnarray} \label{eqn:noise_model_1}\langle i_{p} \rangle = \frac{{\rm d}}{{\rm d}\nu}M\left(\nu \right)\Big\lvert_{\nu=0} &=& \Lambda_{p}, \\ \label{eqn:noise_model_2} \langle i_{p}^{2} \rangle =\frac{{\rm d}^2}{{\rm d}\nu^2}M\left(\nu \right)\Big\lvert_{\nu=0} &=& \Lambda_{p}^{2} + \Lambda_{p} +\sigma_{p}^{2},\\ \label{eqn:noise_model_3} \langle i_{p}^{3} \rangle = \frac{{\rm d}^3}{{\rm d}\nu^3}M\left(\nu\right)\Big\lvert_{\nu=0} &=& \Lambda_{p}^{3} +3\Lambda_{p}^{2} + \Lambda_{p} +3\Lambda_{p}\sigma_{p}^{2},\\ \langle i_{p}^{4} \rangle = \frac{{\rm d}^4}{{\rm d}\nu^4}M\left(\nu \right)\Big\lvert_{\nu=0} &=& \Lambda_{p}^{4} + 6\Lambda_{p}^{3} + 7\Lambda_{p}^{2} + \Lambda_{p} + 6\Lambda_{p}^{2}\sigma_{p}^{2} \nonumber\\ \label{eqn:noise_model_4} & +&6\Lambda_{p}\sigma_{p}^{2} + 3\sigma_{p}^{4}, \end{eqnarray}where  ⟨ ... ⟩ denotes taking the expectation over the probability distribution of the detection noise. For simplicity, only the statistics of the measured data for fixed  {Λp} is considered at first; averaging over multiple interferograms will be considered later. As such the angle brackets do not denote taking an average over the distribution of possible values of  {Λp}, due to, for example, atmospheric distortions to the interferograms.

An important assumption of the following analysis is that the detection noise on different pixels is statistically independent so that for example  ⟨ ip1ip2 ⟩  =  ⟨ ip1 ⟩  ⟨ ip2 ⟩  for p1 ≠ p2. It should be noted that this assumption will not be true in all cases of practical interest. For example when temporally-sampling an interferogram with a near-IR detector, it is common not to reset the charge accumulation capacitors at each pixel between successive time samples, so that a flux measurement ip is derived from the difference between the voltage readings on two successive samples; in this case the final reading for one sample represents the initial reading for the next sample and so the noise on the two samples is correlated. Treating such readout schemes is beyond the scope of the current paper.

4. Bias-free power spectrum estimator

We seek an estimator S 0 ij \hbox{$S_0^{ij}$} for the noise-free power spectrum Sij that does not suffer from statistical bias introduced by the noise on the interferogram. A bias-free estimator was calculated by Tatulli & Duvert (2007) in the presence of photon noise and read noise, but we present an alternative derivation here to demonstrate our analysis method. For the purposes of evaluating the power spectrum, we drop all telescope-pair-dependent superscripts, for example writing H p ij \hbox{$H^{ij}_p$} as Hp.

We start by evaluating the expectation value of the biased estimator |c|2. Combining Eqs. (6) and (7) we get: | c | 2 = p 1 p 2 i p 1 i p 2 H p 1 H p 2 . \begin{equation} \left\langle \left|c\right|^{2}\right\rangle=\left\langle\sum_{p1}\sum_{p2}i_{p1}i_{p2}H_{p1}H^{*}_{p2}\right\rangle.\label{eqn:powerspec} \end{equation}(17)We split the sum in Eq. (17) for the cases where p1 = p2(≡ p) and for p1 ≠ p2 which gives: | c | 2 = p i p 2 | H p | 2 + p 1 p 2 i p 1 i p 2 H p 1 H p 2 , \begin{equation} \left\langle \left|c\right|^{2}\right\rangle = \sum_{p}\left\langle i_{p}^{2}\right\rangle\left|H_{p}\right|^{2} + \underset{\ p1\ \neq\ p2}{\sum\sum}\left\langle i_{p1}\right\rangle\left\langle i_{p2}\right\rangle H_{p1}H^{*}_{p2}, \end{equation}(18)noting that  ⟨ ip1ip2 ⟩  =  ⟨ ip1 ⟩  ⟨ ip2 ⟩  for p1 ≠ p2. Substitution of the moments of the noise model given in Eqs. (13) and (14) leads to: | c | 2 = p ( Λ p 2 + Λ p + σ p 2 ) | H p | 2 + p 1 p 2 Λ p 1 Λ p 2 H p 1 H p 2 . \begin{equation} \left\langle \left|c\right|^{2}\right\rangle= \sum_{p} \left( \Lambda_{p}^{2} + \Lambda_{p} + \sigma_{p}^{2}\right) \left|H_{p}\right|^{2} + \underset{\ p1\ \neq\ p2}{\sum\sum}\Lambda_{p1}\Lambda_{p2} H_{p1}H^{*}_{p2}.\label{eqn:powerspec_split} \end{equation}(19)The noise-free power spectrum, S, can be expressed in terms of the noise-free interferogram, Λp, as: S = | C | 2 = p 1 p 2 Λ p 1 Λ p 2 H p 1 H p 2 , \begin{equation} S = \left|C\right|^{2} =\sum_{p1}\sum_{p2} \Lambda_{p1}\Lambda_{p2}H_{p1}H^{*}_{p2},\label{eqn:ideal_powerspec} \end{equation}(20)this can be split into terms equivalent to those in Eq. (19): S = p Λ p 2 | H p | 2 + p 1 p 2 Λ p 1 Λ p 2 H p 1 H p 2 , \begin{equation} S = \sum_{p} \Lambda_{p}^{2} \left|H_{p}\right|^{2}+\underset{\ p1\ \neq\ p2}{\sum\sum}\Lambda_{p1}\Lambda_{p2} H_{p1}H^{*}_{p2},\label{eqn:ideal_powerspec_split} \end{equation}(21)which can be substituted into Eq. (19) to give: | c | 2 = S + p Λ p | H p | 2 , \begin{equation} \left\langle \left|c\right|^{2}\right\rangle= S + \sum_{p} \Lambda_{p}\left|H_{p}\right|^{2}, \end{equation}(22)and written in terms of the noisy interferograms we see: S = | c | 2 p ( i p + σ p 2 ) | H p | 2 . \begin{equation} S = \left\langle \left|c\right|^{2} \right\rangle - \left\langle\sum_{p} \left( i_{p} + \sigma_{p}^{2}\right) \left|H_{p}\right|^{2}\right\rangle. \end{equation}(23)The bias-free power spectrum estimator, S0, is then: S 0 = | c | 2 p ( i p + σ p 2 ) | H p | 2 . \begin{equation} S_{0} = \left|c\right|^{2} -\sum_{p} \left( i_{p} + \sigma_{p}^{2}\right) \left|H_{p}\right|^{2}.\label{eqn:power_spec_est} \end{equation}(24)It needs to be remembered that S0 is intended to be evaluated on a per-interferogram, i.e. frame-by-frame basis as atmospheric turbulence changes the shape of the interferogram between frames. It is straightforward to show that when this estimator is averaged over a number of frames that the average is itself unbiased. This is because taking the average over frames commutes with taking the expectation over the detection statistics, so that: 1 N f f S 0 = 1 N f f S 0 = 1 N f f S \begin{equation} \label{eq:1} \left\langle \frac1{N_{f}}\sum_{f}{S_0}\right\rangle= \frac1{N_{f}}\sum_{f}\left\langle{S_0}\right\rangle= \frac1{N_{f}}\sum_{f}S \end{equation}(25)where Nf denotes the number of frames over which the average is taken. This result depends on the assumption that the detection noise is dependent only on the value of the flux Λp in each pixel and is statistically independent of the particular shape of the interferogram and hence of the atmospheric distortions affecting a particular interferogram.

For the case where read noise is negligible (σp = 0e) and H is the DFT (|Hp|2 = |e2πiαpf|2 = 1), Eq. (24) reduces to the estimator reported by Goodman & Belsher (1976a): S 1 = | c | 2 N, \begin{equation} S_{1} = \left|c\right|^{2} - N, \end{equation}(26)where N =  ∑ pip, is the total number of photons in an interferogram. For high light levels (negligible photon noise and read noise), the |c|2 term dominates (as this goes as N2), and we reduce our estimator further to the uncorrected estimator (correct only under zero noise conditions): S 2 = | c | 2 . \begin{equation} S_{2} = \left|c\right|^{2}. \end{equation}(27)

5. Bias-free power spectrum variance

The derivation for the power spectrum variance requires the first four moments of the noise model, Eqs. (13) to (16), and stretches to multiple pages due to the |cij|4 term. The derivation follows the methodology presented above and in the derivation of the bias-free bispectrum estimator given in Appendix A but is not presented in this paper. For readability, the formulation of the power spectrum variance, var S 0 ( ) \hbox{$\mathrm{var}\left(S_{0}\right)$}, is split into terms that dominate in the case where the noise is dominated by photon noise, read noise or where both noise sources are significant: var ( S 0 ) = ( | c | 2 p ( i p + σ p 2 ) | H p | 2 ) 2 ( S ) 2 \begin{eqnarray} \mathrm{var}\left(S_{0}\right) = \left\langle\left( \left|c\right|^{2} -\sum_{p} \left( i_{p} + \sigma_{p}^{2}\right) \left|H_{p}\right|^{2}\right)^{2} \right\rangle - \left( S \right)^{2}\notag \end{eqnarray}Photon noise terms: = 2 | C | 2 p Λ p | H p | 2 + CC p Λ p H p H p + C C p Λ p H p H p + [ p Λ p | H p | 2 ] 2 + p Λ p H p H p p Λ p H p H p \begin{eqnarray} &=&2\left|C\right|^{2}\sum_{p}\Lambda_{p}\left|H_{p}\right|^{2}+ CC\sum_{p}\Lambda_{p} H_{p}^{*}H_{p}^{*} + C^{*}C^{*}\sum_{p}\Lambda_{p} H_{p}H_{p}\nonumber\\ &+& \left[\sum_{p}\Lambda_{p}\left|H_{p}\right|^{2}\right]^{2} + \sum_{p}\Lambda_{p} H_{p}H_{p}\sum_{p}\Lambda_{p} H_{p}^{*}H_{p}^{*}\notag \end{eqnarray}Read noise terms: + p σ p 2 | H p | 4 + [ p σ p 2 | H p | 2 ] 2 + p σ p 2 H p H p p σ p 2 H p H p \begin{eqnarray} + \sum_{p}\sigma_{p}^{2}\left|H_{p}\right|^{4}+ \left[ \sum_{p}\sigma_{p}^{2} \left|H_{p}\right|^{2} \right]^{2}+ \sum_{p}\sigma_{p}^{2}H_{p}H_{p}\sum_{p}\sigma_{p}^{2}H_{p}^{*}H_{p}^{*} \notag \end{eqnarray}Coupled terms: + 2 p Λ p | H p | 2 p σ p 2 | H p | 2 + p Λ p H p H p p σ p 2 H p H p + p Λ p H p H p p σ p 2 H p H p + 2 | C | 2 p σ p 2 | H p | 2 + CC p σ p 2 H p H p + C C p σ p 2 H p H p 2 C p σ p 2 | H p | 2 H p 2 C p σ p 2 | H p | 2 H p . \begin{eqnarray} &+& 2 \sum_{p}\Lambda_{p} \left|H_{p}\right|^{2}\sum_{p}\sigma_{p}^{2}\left|H_{p}\right|^{2} +\sum_{p}\Lambda_{p} H_{p}H_{p}\sum_{p}\sigma_{p}^{2}H_{p}^{*}H_{p}^{*}\nonumber\\ &+&\sum_{p}\Lambda_{p}H_{p}^{*}H_{p}^{*}\sum_{p}\sigma_{p}^{2}H_{p}H_{p} + 2\left|C\right|^{2}\sum_{p}\sigma_{p}^{2}\left|H_{p}\right|^{2}\nonumber\\ &+&CC\sum_{p}\sigma_{p}^{2}H_{p}^{*}H_{p}^{*}+ C^{*}C^{*}\sum_{p}\sigma_{p}^{2}H_{p}H_{p}- 2 C\sum_{p}\sigma_{p}^{2}\left|H_{p}\right|^{2}H_{p}^{*}\nonumber\\ \label{eqn:powspecvar} &-& 2C^{*}\sum_{p}\sigma_{p}^{2}\left|H_{p}\right|^{2}H_{p}.\nonumber \notag \end{eqnarray}

For comparison with previous work we rewrite Eq. (28) assuming DFT conditions, that read noise is constant across all pixels ( p σ p 2 = N pix σ 2 \hbox{$\sum_{p}\sigma^{2}_{p}=N_{\mathrm{pix}}\sigma^{2}$}), and that atmospheric phase fluctuations are random and as such the cross-product of independent complex quantities average to zero: var ( S DFT ) = Photon noise terms 2 N | C | 2 + N 2 + p Λ p e 4 π i α p f ij p Λ p e 4 π i α p f ij 􏽺 0 􏽽􏽼 0 􏽻 + N pix σ 2 + N pix 2 σ 4 􏽺 0 􏽽􏽼 0 􏽻 Read noise terms + Coupled terms 2 | C | 2 N pix σ 2 + 2 N N pix σ 2 􏽺 0 􏽽􏽼 0 􏽻 . \begin{eqnarray} \mathrm{var}\left(S_{\textrm{DFT}}\right)&=&\overbrace{2N\left|C\right|^{2} + N^{2} + \sum_{p}\Lambda_{p} \mathrm{e}^{4\pi{\rm i}\alpha_{p}f^{ij}}\sum_{p}\Lambda_{p} \mathrm{e}^{-4\pi{\rm i}\alpha_{p}f^{ij}}}^{\mathrm{Photon\ noise\ terms}}\nonumber\\ &+& \overbrace{ N_{\mathrm{pix}}\sigma^{2}+ N_{\mathrm{pix}}^{2}\sigma^{4}}^{\mathrm{Read\ noise\ terms}}\nonumber\\ \label{eqn:powspecvar2} &+& \overbrace{2\left|C\right|^{2}N_{\mathrm{pix}}\sigma^{2} + 2NN_{\mathrm{pix}}\sigma^{2}}^{\mathrm{Coupled\ terms}}. \end{eqnarray}(28)Separate analysis of the photon and detector noise terms, as is often done (see e.g. Perrin & Ridgway 2005; Le Bouquin 2005), means that the Gaussian-Poisson coupled terms are missed. Inspection of Eq. (28) and Fig. 1 show the importance of including these coupled terms for a wide range of photon fluxes, and therefore of incorporating both the photon noise and read noise into a single noise model. In the above example, for  ~300 source photons on the detector, the variance on the powerspectrum is underestimated by a factor of over 6 if cross-terms are excluded from the variance calculation.

thumbnail Fig. 1

The relative importance of the photon, read (σp = 3e) and coupled noise terms under DFT conditions for the power spectrum variance with respect to the number of source photons detected. The area highlighted in grey is the region where the coupled terms dominate the power spectrum variance.

6. Bias-free bispectrum estimator

The bias-free bispectrum estimator is calculated in the same manner as the bias-free power spectrum estimator. The definition of the bispectrum presented in Eq. (8) expands to: B ijk = C ij C jk C ki = p 1 p 2 p 3 Λ p 1 Λ p 2 Λ p 3 H p 1 ij H p 2 jk H p 3 ki . \begin{equation} B^{ijk} = C^{ij}C^{jk}C^{ki} = \sum_{p1}\sum_{p2}\sum_{p3} \Lambda_{p1}\Lambda_{p2}\Lambda_{p3} H^{ij}_{p1}H^{jk}_{p2}H^{ki}_{p3}.\label{eqn:bispec_expanded} \end{equation}(29)Splitting the sum in Eq. (29) gives 5 terms rather than 2 as for the power spectrum estimator. As such the derivation is quite lengthy and is given in Appendix A. We present here the bias-free bispectrum estimator, B 0 ijk \hbox{$B^{ijk}_{0}$}: B 0 ijk = c ij c jk c ki c ij p ( i p + σ p 2 ) H p jk H p ki c jk p ( i p + σ p 2 ) H p ij H p ki c ki p ( i p + σ p 2 ) H p ij H p jk + p ( 2 i p + 3 σ p 2 ) H p ij H p jk H p ki . \begin{eqnarray} \label{eqn:bias_free_bispec} B^{ijk}_{0}&=& c^{ij}c^{jk}c^{ki}\nonumber\\ &-& c^{ij}\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{jk}_{p}H^{ki}_{p} \nonumber\\ &-&\ c^{jk}\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{ij}_{p}H^{ki}_{p} \nonumber\\ &-& c^{ki}\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{ij}_{p}H^{jk}_{p} \nonumber\\ &+& \sum_{p}\left(2 i_{p} +3\sigma^{2}_{p}\right)H^{ij}_{p}H^{jk}_{p}H^{ki}_{p}. \end{eqnarray}(30)This equation has been verified though computational simulations to the 0.1% level. In Sects. 7 and 8 we present the results of these simulations regarding the closure phase, derived from the bias-free bispectrum estimator in Eq. (30).

If we assume DFT conditions so that H p ij = e 2 π i α p f ij \hbox{$H_p^{ij}=\mathrm{e}^{2\pi{\rm i}\alpha_{p}f^{ij}}$} and an all-in-one interference pattern such that fij + fjk + fki = 0, then we have H p jk H p ki = H p ji \hbox{$H^{jk}_{p}H^{ki}_{p}=H_{p}^{ji}$} where we note H p ji = H p ij ) ( \hbox{$H_{p}^{ji} = \left(H_{p}^{ij}\right)^{*}$} and H p ij H p jk H p ki = 1 \hbox{$H^{ij}_{p}H^{jk}_{p}H^{ki}_{p}=1$}.

If we also assume the read noise σp = σ is constant across all pixels rather than varying on a pixel by pixel basis then: p σ 2 e 2 π i α p f ji = 0. \begin{equation} \sum_{p}\sigma^{2}\mathrm{e}^{2\pi{\rm i}\alpha_{p}f^{ji}}=0. \label{eqn:const_sigma_dft} \end{equation}(31)The fact that the third order moment of the Gaussian distribution is zero would suggest that there should be no read noise terms under DFT conditions for constant σ, however it must be remembered that there are second order moments in the bias correction terms which will contribute to the bias under these conditions.

If we further assume that read noise is negligible (σp = 0), then B 0 ijk \hbox{$B^{ijk}_{0}$} reduces to the estimator presented by Wirnitzer (1985) accounting for the bias introduced by photon noise alone: B 1 ijk = c ij c jk c ki | c ij | 2 | c jk | 2 | c ki | 2 + 2 N, \begin{eqnarray} \label{eqn:hist_biaspec} B^{ijk}_{1} &=& c^{ij}c^{jk}c^{ki} \nonumber\\ &-&\left|c^{ij}\right|^{2} -\left|c^{jk}\right|^{2} -\left|c^{ki}\right|^{2} +2 N, \end{eqnarray}(32)where N =  ∑ pip is the mean number of photons in the interferogram.

For high light levels (negligible photon noise and read noise), the cijcjkcki term dominates (as this goes as N3), and we reduce our estimator further to the uncorrected estimator (correct only under zero noise conditions): B 2 ijk = c ij c jk c ki . \begin{equation} B^{ijk}_{2} = c^{ij}c^{jk}c^{ki}. \end{equation}(33)The bispectrum variance is not presented in this paper as the |cijcjkcki|2 term leads to a long and tedious derivation. The zealous reader is invited to calculate the expression for the bispectrum variance if required using the method presented for calculating the bispectrum in Appendix A.

thumbnail Fig. 2

The transform matrix, W under non-DFT conditions. The DC term shows the beam profile adopted, and the cosine (solid lines) and sine (dotted lines) components of the fringe pattern for each baseline show the deviation from integer number of fringe periods across the detector (with frequencies in the ratio 1.1 ×  {1:2:3}). This is similar (but with the addition of a DC term) to the visibility to pixel matrix introduced by Le Bouquin & Tatulli (2006).

The remainder of this paper will be concerned with demonstrating the magnitude of the bias introduced when the wrong estimator is used in the presence of photon and read noise at low light levels.

7. Closure phase bias

To illustrate the magnitude of the systematic bias in a realistic observation we present the closure phase data from a set of simple simulations. We created ideal interferograms resulting from a three telescope interferometer observing a target with a known and controllable closure phase. Photon noise and read noise were added, and the bispectrum (and hence closure phase) extracted using each of the three estimators presented above. The interferograms were created under either DFT or non-DFT conditions. The fringe frequency on the detector for each baseline was set at a ratio  {1:2:3} for the DFT case, and 1.1 ×  {1:2:3} for the non-DFT conditions. To simulate non-DFT conditions we rewrite Eq. (1) as:

i p = b p C 00 + b p i < j N tel Re [ C ij e i 2 π α p f ij ] + noise , \begin{equation} i_{p} = b_{p}C^{00} + b_{p}\sum_{i < j}^{N_{\rm tel}} \mathrm{Re}\left[ C^{ij}{\rm e}^{{\rm i}2\pi \alpha_{p} f^{ij}}\right]+{\rm noise},\label{eq:simple_more_complex} \end{equation}(34)where the beam profile, bp is either flat: bp = 1/Npix for DFT conditions (as in Eq. (1)), or a cropped Gaussian of the form: b p = exp [ ( p μ ) 2 2 ς 2 ] p N pix exp [ ( p μ ) 2 2 ς 2 ] , \begin{equation} b_{p}=\frac{\exp\left[-\frac{\left(p-\mu\right)^{2}}{2\varsigma^{2}}\right]} {\sum_{p'}^{N_{\rm pix}}\exp\left[-\frac{\left(p'-\mu\right)^{2}}{2\varsigma^{2}}\right]}, \end{equation}(35)where ς = 22 pixels and μ is centred on the 64 pixel detector. From Eq. (34) we can construct a V2PM, W, which is correct under DFT and non-DFT conditions. The form of W is shown in Fig. 2 for the non-DFT conditions described. We use the Singular Value Decomposition method to find the optimum (in terms of least squares) pseudo-inverse of W, which gives us H. An instrumental visibility loss was introduced such that the maximum visibility on a baseline is |Vi| = 1/3. The results presented in Fig. 3 are calculated using the ideal interferograms and therefore show only the closure phase bias and suffer no variance contribution. These are given by: B 2 ijk = B ijk + p Λ p e 2 π i α p f ij p ( Λ + σ 2 ) e 2 π i α p f ji + p Λ p e 2 π i α p f jk p ( Λ + σ 2 ) e 2 π i α p f kj + p Λ p e 2 π i α p f ki p ( Λ + σ 2 ) e 2 π i α p f ik + N, \begin{eqnarray} \label{eq:ideal_cp_2} B^{ijk}_{2} &=& B^{ijk} + \sum_{p}\Lambda_{p}\mathrm{e}^{2\pi{\rm i}\alpha_{p}f^{ij}}\sum_{p}\left(\Lambda + \sigma^{2}\right)\mathrm{e}^{2\pi{\rm i}\alpha_{p}f^{ji}}\nonumber\\ &+& \sum_{p}\Lambda_{p}\mathrm{e}^{2\pi{\rm i}\alpha_{p}f^{jk}}\sum_{p}\left(\Lambda + \sigma^{2}\right)\mathrm{e}^{2\pi{\rm i}\alpha_{p}f^{kj}}\nonumber\\ &+& \sum_{p}\Lambda_{p}\mathrm{e}^{2\pi{\rm i}\alpha_{p}f^{ki}}\sum_{p}\left(\Lambda + \sigma^{2}\right)\mathrm{e}^{2\pi{\rm i}\alpha_{p}f^{ik}} + N, \end{eqnarray}(36)and: B 1 ijk = B 2 ijk S ij S jk S ki N 3 N pix σ 2 , \begin{equation} B^{ijk}_{1} = B^{ijk}_{2} - S^{ij} - S^{jk} - S^{ki} - N - 3N_{\rm pix}\sigma^{2},\label{eq:ideal_cp_1} \end{equation}(37)whilst the bias free estimator is simply: B 0 ijk = B ijk . \begin{equation} B^{ijk}_{0} = B^{ijk}. \end{equation}(38)We note the order of the indices in the DFT terms in Eqs. (36) and (37) and that the ideal source bispectrum and power spectrum are independent of the measurement system (weather or not if meets the DFT conditions), whereas the bias terms implicitly assume DFT conditions for the B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$} estimators. Figure 3 shows a simulation of the bias present on the closure phase for the three estimators for simulations of 120 and 300 photons per interferogram reaching the detector.

thumbnail Fig. 3

The bias in closure phase for B 0 ijk \hbox{$B^{ijk}_{0}$} (solid line), B 1 ijk \hbox{$B^{ijk}_{1}$} (dashed line), and B 2 ijk \hbox{$B^{ijk}_{2}$} (dotted line) estimators under a number of measurement conditions. Plots a) and b) show DFT conditions, whilst plots c) and d) show non-DFT conditions (described in the text). Read noise of σp = 3e is present in plots b) and d). The thin and think lines show 120 and 300 photons reaching the detector respectively. The shaded regions in plots c) and d) show the range of possible biases resulting from different possible combinations of phases on the three baselines contributing to the source closure phase.

7.1. Zero read noise

In the situation where photon noise dominates the noise model (σp = 0e) and the instrument is setup such that the interferograms are formed under DFT conditions, B 0 ijk \hbox{$B^{ijk}_{0}$} and B 1 ijk \hbox{$B^{ijk}_{1}$} both return the correct closure phase, however B 2 ijk \hbox{$B^{ijk}_{2}$} is affected by a bias term resulting from photon noise. At these light levels, the bias in B 2 ijk \hbox{$B^{ijk}_{2}$} becomes significant for the majority of source closure phases.

7.2. Non-zero read noise

At low light levels the read noise on the detector becomes important and comparable to the photon noise. Figure 3b shows simulations under DFT conditions, as in Fig. 3a, but with σp = 3e read noise included. This level of read noise is quite low compared to many of the optical detectors in use today. We see that failing to account for read noise introduces a significant bias in closure phase estimates for both B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$}. It is clear that the magnitude of the bias term is strongly dependent on the source brightness, and that B 2 ijk \hbox{$B^{ijk}_{2}$} is preferable to B 1 ijk \hbox{$B^{ijk}_{1}$} under certain combinations of noise and photon flux.

7.3. Non-DFT conditions

Deviation from DFT conditions changes the way statistical bias affects the bispectrum. Inspection of Eq. (30) shows that the statistical bias can affect both the real and imaginary parts of the bispectrum when H is not the DFT. If the measurement system conforms to DFT conditions, and the read noise is negligible or constant across all pixels, the bias only affects the real part of the bispectrum estimators as shown when applying Eq. (31) to (30). It is also apparent that the magnitude of the bias is no longer dependent only on the value of the closure phase, but on the value of the individual phase of each baseline. The non-uniform beam profile causes an overlap of the fringe components in frequency space, and the non-integral fringe frequency on the detector invalidates the assumption that H p jk H p ki = H p ji \hbox{$H^{jk}_{p}H^{ki}_{p}=H_{p}^{ji}$} and H p ij H p jk H p ki = 1 \hbox{$H^{ij}_{p}H^{jk}_{p}H^{ki}_{p}=1$} which led to the formulation of B 1 ijk \hbox{$B^{ijk}_{1}$}. Figure 3c is provided as an example of the impact of non-DFT conditions on the closure phase bias for a photon noise dominated instrument (σp = 0). Note that the non-DFT conditions change the bias terms, meaning B 1 ijk \hbox{$B^{ijk}_{1}$} is now biased when it was not under DFT conditions. Figure 3d shows the effect of non-DFT conditions on a system with photon and read noise and again there is an alteration of the bias terms which B 0 ijk \hbox{$B^{ijk}_{0}$} corrects but the other estimators do not. The grey regions on the non-DFT plots show the range of biases associated with each estimator depending on the individual phases of the triplet producing a given closure phase. The lines within the shaded regions represent the expectation of the closure phase bias over all possible individual phases.

We see that the bias on the closure phase does not cancel when accumulating frames; specifically, that there is no advantage to observing in phase-unstable conditions over phase-stable conditions as a means of reducing bias. Therefore, bias-wise, one should always favor fringe-tracker assisted observations whereby the longer discrete integration times result in higher SNR, even in the photon-rich regime where fringe tracking is not strictly necessary.

7.4. Closure phase bias and statistical uncertainty

The bias is only significant at low light levels, and tends to zero as the light level increases. At low light levels the variance is also high, and we must determine the relative importance of these two processes. We present Fig. 4 which shows the relative importance of bias, Φbias, and closure phase standard deviation, ςΦ, using different estimators.

We show the number of frames, Nf, such that: Φ bias = ς Φ N f , \begin{equation} \Phi^{\mathrm{bias}} = \frac{\varsigma_{\Phi}}{\sqrt{N_{f}}}, \end{equation}(39)which is the number of interferograms required to reduce the magnitude of the statistical uncertainty to that of the bias. The closure phase standard deviation per frame, ςΦ, is given by: ς Φ = 1 | B ijk | × var [ Im ( B f ijk × B ijk | B ijk | ) ] , \begin{equation} \label{eq:cp_std_dev} \varsigma_{\Phi} = \frac{1}{\left|B^{ijk}\right|}\times\sqrt{\mathrm{var}\left[\mathrm{Im}\left(B^{ijk}_{f}\times\frac{B^{ijk*}}{\left|B^{ijk}\right|}\right)\right]}, \end{equation}(40)where B f ijk \hbox{$B^{ijk}_{f}$} is the bispectrum calculated per frame using the chosen estimator and Im ( B f ijk × B ijk | B ijk | ) \hbox{$\mathrm{Im}\left(B^{ijk}_{f}\times\frac{B^{ijk*}}{\left|B^{ijk}\right|}\right)$} gives the projection of B f ijk \hbox{$B^{ijk}_{f}$} on a unit vector perpendicular to Bijk, and. This method is used because it is insensitive to wrapping errors in the complex plane when the variance is large. In this example, the magnitude of the bias and variance are measured when the source closure phase is at 90°. The values presented in Fig. 4 were derived using numerical simulations to the 0.1% level, and the bias contribution was verified analytically using Eqs. (36) and (37).

It can be seen in Fig. 4 that the bias is important for realistic numbers of frames (e.g. 103 frames or 25 s integration for 25 ms exposures) compared to the uncertainty for a wide range of typical light levels.

thumbnail Fig. 4

The number of integrated frames required to reduce the statistical uncertainty to the level of the bias for the B 1 ijk \hbox{$B^{ijk}_{1}$} (dashed) and B 2 ijk \hbox{$B^{ijk}_{2}$} (dotted) estimators.

8. Simulation: detection of faint companions

Table 1

Simulation parameters.

Closure phase nulling is an observational techniques that allows the detection of faint companions (Chelli et al. 2009). The technique has been demonstrated by Duvert et al. (2010), detecting a 5 mag fainter companion using AMBER/VLTI around HD 59717.

Closure phase nulling uses the fact that the closure phase jumps from 0 to 180 degrees as the visibility function of the primary in the system passes through a null. The presence of a companion becomes evident as a deviation from an instantaneous phase jump. The profile of the phase jump gives information about the projected separation and flux ratio of the system components. We show that statistical bias in the closure phase leads directly to incorrect estimation of the system properties.

We follow Chelli et al. (2009) and simulate a binary system where the primary is described by a uniform disc of radius, R, with a point source companion at a separation, s, with flux ratio, r. For simplicity we simulate a three telescope interferometer such that the position angle of the binary system is parallel to the frequency axis of the interferometer. The interferometer samples three frequencies  {u12,u23,u13}  such that u13 = u12 + u23. The instrumental parameters are given in Table 1 and we assume the instrument is functioning under DFT conditions.

The complex visibility, Cij(u), at a given frequency, u, is described by: C ij ( u ) = V ( u ) + r e i 2 πus 1 + r , \begin{equation} C^{ij}(u) = \frac{V_{\star}(u) + r \mathrm{e}^{{\rm i}2\pi us}}{1+r},\label{eqn:comp_vis_binary} \end{equation}(41)which is the visibility function, V(u), of the primary: V ( u ) = 2 J 1 ( 2 πu R ) 2 πu R , \begin{equation} V_{\star}(u) = 2\frac{J_{1}(2\pi u R_{\star})}{2\pi u R_{\star}}, \end{equation}(42)plus the contribution of the secondary with a phase modulation related to the projected separation, s, and flux ratio, r. J1 is the first order Bessel function and R is the radius of the primary.

thumbnail Fig. 5

Demonstration of the effect of bias on a closure phase measurement. Plot a): the closure phase trace resulting from a binary pair consisting of a resolved primary of radius R and an unresolved secondary with flux ratio r = 0.01 for separations (s = 10 R,100 R,1000 R). Plot b): the closure phase trace for the 100 R system recovered by the three estimators. B 0 ijk \hbox{$B^{ijk}_{0}$} correctly returns the ideal closure phase, whilst B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$} do not. Both plots show the single source closure phase signal (thin solid line) for reference. Plots c) and d) show the closure phase calculated using B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$} respectively as in plot b) along with the statistical uncertainty on the measurement after 103, 104 and 105 frames.

The ideal bispectrum is extracted from Eq. (41) using Eq. (8) and the closure phase is taken as the argument of the averaged bispectrum. The closure phase signal around the first null in the visibility profile is shown in Fig. 2 of Chelli et al. (2009) and reproduced using our simulation model in Fig. 5a in this paper for a number of companion separations (s = 10R,100 R,1000 R). The closure phase is wrapped modulo 2π and the frequencies  {u12,u23,u13}  are set to the ratio  {1:2:3} on the detector and scaled by the radius of the primary, R. For simplicity, the baseline to fringe-frequency mapping is chosen such that it preserves the ratios of the two. The specifics of the mapping is not expected to have a significant impact on the bias. The closure phase of the primary alone (single source) is shown for comparison.

We present in Fig. 5b a simulation of the closure phase from a 100 R separation binary with flux ratio 0.01. B 0 ijk \hbox{$B^{ijk}_{0}$} returns the correct closure phase signal in agreement with the ideal case presented by Chelli et al. (2009). B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$} are strongly affected by bias terms, meaning the closure phase is significantly different. Depending which regions of the closure phase trace are sampled by the interferometer, B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$} would return an incorrect value for the primary radius and companion flux ratio. Figures 5c and d show the closure phase calculated using B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$} respectively and the statistical uncertainty on the measurement after 103, 104 and 105 frames calculated using Eq. (40). We see that even for very short total integration times (103 frames is under a minute using typical discrete integration times) the bias still produces a significant contribution.

9. Conclusions

Interferometry offers exceptional angular resolution at the expense of sensitivity, as the collecting area is, by necessity, smaller than that of a single dish with the same resolution. In ground-based optical interferometry these sensitivity limitations are accentuated by the need for short observations to reduce the phase perturbations introduced by the atmosphere. The result is that noise levels on individual observation frames are often high, exacerbated by the need for fast read-out of the detector. Without understanding the statistical properties of the noise, bias terms are introduced when extracting information from the interferogram using the non-linear power spectrum and bispectrum constructs. We have shown in this paper that:

  • 1.

    Neglecting the cross terms that arise in the presence of both readnoise and photon noise leads to bias on the extraction ofinterferometric observables resulting from the statistics of thenoise.

  • 2.

    The statistical bias on the observables results in incorrect phisical parameters being extracted from observation data, for example the flux ratio of binary pairs or the diameter of a target as shown for the case of closure phase nulling.

To remedy this we have presented bias-free estimators for both the power spectrum and bispectrum (and hence the closure phase) which can be used in the presence of any combination of photon and read noise under any fringe encoding scheme in which the measurement equation is linear. Our bias-free estimators are presented in a format allowing easy implementation into current data reduction pipelines.

Acknowledgments

We thank the anonymous referee for constructive comments and insiteful suggestions that have undoubtedly improved this paper.

References

  1. Ayers, G. R., Northcott, M. J., & Dainty, J. C. 1988, J. Opt. Soc. Am. A, 5, 963 [NASA ADS] [CrossRef] [Google Scholar]
  2. Beletic, J. W. 1989, Opt. Comm., 71, 337 [NASA ADS] [CrossRef] [Google Scholar]
  3. Chelli, A., Duvert, G., Malbet, F., & Kern, P. 2009, A&A, 498, 321 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Dainty, J. C., & Greenaway, A. H. 1979, J. Opt. Soc. Am., 69, 786 [NASA ADS] [CrossRef] [Google Scholar]
  5. Duvert, G., Chelli, A., Malbet, F., & Kern, P. 2010, A&A, 509, A7 [Google Scholar]
  6. Goodman, J. W., & Belsher, J. F. 1976a, Stanford Univ. CA Information Systems Lab [Google Scholar]
  7. Goodman, J. W., & Belsher, J. F. 1976b, Stanford Univ. CA Information Systems Lab [Google Scholar]
  8. Gordon, J., Buscher, D., & Thorsteinsson, H. 2010, in Optical and Infrared Interferometry II, ed. W. C. Danchi, F. Delplancke, & J. K. Rajagopal, Proc. SPIE, 7734, 77344A–7 [Google Scholar]
  9. Le Bouquin, J.-B. 2005, Imagerie par synthèse d’ouverture optique, application aux étoiles chimiquement particulières, PhD Thesis, Univ. Grenoble I [Google Scholar]
  10. Le Bouquin, J.-B., & Tatulli, E. 2006, MNRAS, 372, 639 [NASA ADS] [CrossRef] [Google Scholar]
  11. Perrin, G. 2003, A&A, 398, 385 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  12. Perrin, G., & Ridgway, S. T. 2005, ApJ, 626, 1138 [NASA ADS] [CrossRef] [Google Scholar]
  13. Roddier, F. 1987, J. Opt. Soc. Am. A, 4, 1396 [NASA ADS] [CrossRef] [Google Scholar]
  14. Sibille, F., Chelli, A., & Lena, P. 1979, A&A, 79, 315 [NASA ADS] [Google Scholar]
  15. Tatulli, E., & Duvert, G. 2007, New Astron. Rev., 51, 682 [NASA ADS] [CrossRef] [Google Scholar]
  16. Tatulli, E., & Le Bouquin, J.-B. 2006, MNRAS, 368, 1159 [NASA ADS] [CrossRef] [Google Scholar]
  17. Thorsteinsson, H., & Buscher, D. F. 2004, in New Frontiers in Stellar Interferometry, Proc. SPIE, 5491, 1498 [NASA ADS] [CrossRef] [Google Scholar]
  18. Wirnitzer, B. 1985, J. Opt. Soc. Am. A, 2, 14 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Derivation of the bias-free bispectrum estimator

The ideal, noise-free bispectrum is given by: B ijk = C ij C jk C ki = p 1 p 2 p 3 Λ p 1 Λ p 2 Λ p 3 H p 1 ij H p 2 jk H p 3 ki , \appendix \setcounter{section}{1} \begin{equation} B^{ijk} = C^{ij}C^{jk}C^{ki} = \sum_{p1}\sum_{p2}\sum_{p3} \Lambda_{p1}\Lambda_{p2}\Lambda_{p3} H^{ij}_{p1}H^{jk}_{p2}H^{ki}_{p3}, \end{equation}(A.1)which can be split into: B ijk = p Λ p 3 H p ij H p jk H p ki + p p 1 Λ p 2 Λ p 1 H p 1 ij H p jk H p ki + p p 2 Λ p 2 Λ p 2 H p ij H p 2 jk H p ki + p p 3 Λ p 2 Λ p H p ij H p jk H p 3 ki + p 1 p 2 p 3 Λ p 1 Λ p 2 Λ p 3 H p 1 ij H p 2 jk H p 3 ki . \appendix \setcounter{section}{1} \begin{eqnarray} \label{eqn:bispec_split} B^{ijk} &=& \sum_{p} \Lambda^{3}_{p}H^{ij}_{p}H^{jk}_{p}H^{ki}_{p} \nonumber\\ &+& \underset{\ p\ \neq\ p1}{\sum\sum}\Lambda_{p}^{2}\Lambda_{p1}H^{ij}_{p1}H^{jk}_{p}H^{ki}_{p}\nonumber\\ &+& \underset{\ p\ \neq\ p2}{\sum\sum}\Lambda_{p}^{2}\Lambda_{p2}H^{ij}_{p}H^{jk}_{p2}H^{ki}_{p}\nonumber\\ &+& \underset{\ p\ \neq\ p3}{\sum\sum}\Lambda_{p}^{2}\Lambda_{p}H^{ij}_{p}H^{jk}_{p}H^{ki}_{p3}\nonumber\\ &+&\underset{\ p1\ \neq\ p2\ \neq\ p3}{\sum\sum\sum}\Lambda_{p1} \Lambda_{p2}\Lambda_{p3} H^{ij}_{p1}H^{jk}_{p2}H^{ki}_{p3}. \end{eqnarray}(A.2)The equivalent expression for real data embedded with noise when average over frames is: c ij c jk c ki = p 1 p 2 p 3 i p 1 i p 2 i p 3 H p 1 ij H p 2 jk H p 3 ki , \appendix \setcounter{section}{1} \begin{equation} \left\langle c^{ij}c^{jk}c^{ki}\right\rangle= \sum_{p1}\sum_{p2}\sum_{p3} \left\langle i_{p1}i_{p2}i_{p3}\right\rangle H^{ij}_{p1}H^{jk}_{p2}H^{ki}_{p3}, \end{equation}(A.3)which splits into: c ij c jk c ki = p i p 3 H p ij H p jk H p ki + p p 1 i p 2 i p 1 H p 1 ij H p jk H p ki + p p 2 i p 2 i p 2 H p ij H p 2 jk H p ki + p p 3 i p 2 i p 3 H p ij H p jk H p 3 ki + p 1 p 2 p 3 i p 1 i p 2 i p 3 H p 1 ij H p 2 jk H p 3 ki . \appendix \setcounter{section}{1} \begin{eqnarray} \left\langle c^{ij}c^{jk}c^{ki}\right\rangle &=& \sum_{p}\left\langle i^{3}_{p}\right\rangle H^{ij}_{p}H^{jk}_{p}H^{ki}_{p} \nonumber\\ &+& \underset{\ p\ \neq\ p1}{\sum\sum} \left\langle i^{2}_{p}\right\rangle \left\langle i_{p1}\right\rangle H^{ij}_{p1}H^{jk}_{p}H^{ki}_{p}\nonumber\\ &+& \underset{\ p\ \neq\ p2}{\sum\sum} \left\langle i^{2}_{p}\right\rangle \left\langle i_{p2}\right\rangle H^{ij}_{p}H^{jk}_{p2}H^{ki}_{p}\nonumber\\ &+& \underset{\ p\ \neq\ p3}{\sum\sum} \left\langle i^{2}_{p}\right\rangle \left\langle i_{p3}\right\rangle H^{ij}_{p}H^{jk}_{p}H^{ki}_{p3}\nonumber\\ &+&\underset{\ p1\ \neq\ p2\ \neq\ p3}{\sum\sum\sum} \left\langle i_{p1}\right\rangle\left\langle i_{p2}\right\rangle \left\langle i_{p3}\right\rangle H^{ij}_{p1}H^{jk}_{p2}H^{ki}_{p3}. \end{eqnarray}(A.4)Substitution of the noise model in Eqs. (13) to (15) gives the noisy bispectrum in terms of the ideal interferogram and associated noise: c ij c jk c ki = p ( Λ p 3 + 3 Λ p 2 + Λ p + 3 Λ p σ p 2 ) H p ij H p jk H p ki + p p 1 ( Λ p 2 + Λ p + σ p 2 ) Λ p 1 H p 1 ij H p jk H p ki + p p 2 ( Λ p 2 + Λ p + σ p 2 ) Λ p 2 H p ij H p 2 jk H p ki + p p 3 ( Λ p 2 + Λ p + σ p 2 ) Λ p 3 H p ij H p jk H p 3 ki + p 1 p 2 p 3 Λ p 1 Λ p 2 Λ p 3 H p 1 ij H p 2 jk H p 3 ki . \appendix \setcounter{section}{1} \begin{eqnarray} \left\langle c^{ij}c^{jk}c^{ki}\right\rangle &=& \sum_{p}\left( \Lambda^{3}_{p} + 3\Lambda^{2}_{p} + \Lambda_{p} + 3\Lambda_{p}\sigma^{2}_{p} \right) H^{ij}_{p}H^{jk}_{p}H^{ki}_{p} \nonumber\\ &+& \underset{\ p\ \neq\ p1}{\sum\sum}\left( \Lambda^{2}_{p} + \Lambda_{p} + \sigma^{2}_{p}\right)\Lambda_{p1} H^{ij}_{p1}H^{jk}_{p}H^{ki}_{p}\nonumber\\ &+& \underset{\ p\ \neq\ p2}{\sum\sum}\left( \Lambda^{2}_{p} + \Lambda_{p} + \sigma^{2}_{p}\right)\Lambda_{p2} H^{ij}_{p}H^{jk}_{p2}H^{ki}_{p}\nonumber\\ &+& \underset{\ p\ \neq\ p3}{\sum\sum}\left( \Lambda^{2}_{p} + \Lambda_{p} + \sigma^{2}_{p}\right)\Lambda_{p3} H^{ij}_{p}H^{jk}_{p}H^{ki}_{p3}\nonumber\\ &+&\underset{\ p1\ \neq\ p2\ \neq\ p3}{\sum\sum\sum}\Lambda_{p1} \Lambda_{p2}\Lambda_{p3} H^{ij}_{p1}H^{jk}_{p2}H^{ki}_{p3}. \end{eqnarray}(A.5)Substituting the ideal bispectrum split sum from Eq. (A.2) gives: c ij c jk c ki = B ijk + p ( 3 Λ p 2 + Λ p + 3 Λ p σ p 2 ) H p ij H p jk H p ki + p p 1 ( Λ p + σ p 2 ) Λ p 1 H p 1 ij H p jk H p ki + p p 2 ( Λ p + σ p 2 ) Λ p 2 H p ij H p 2 jk H p ki + p p 3 ( Λ p + σ p 2 ) Λ p 3 H p ij H p jk H p 3 ki . \appendix \setcounter{section}{1} \begin{eqnarray} \left\langle c^{ij}c^{jk}c^{ki}\right\rangle &=& B^{ijk} + \sum_{p}\left(3\Lambda^{2}_{p} + \Lambda_{p} + 3\Lambda_{p}\sigma^{2}_{p} \right) H^{ij}_{p}H^{jk}_{p}H^{ki}_{p} \nonumber\\ &+& \underset{\ p\ \neq\ p1}{\sum\sum}\left( \Lambda_{p} + \sigma^{2}_{p}\right)\Lambda_{p1} H^{ij}_{p1}H^{jk}_{p}H^{ki}_{p}\nonumber\\ &+& \underset{\ p\ \neq\ p2}{\sum\sum}\left( \Lambda_{p} + \sigma^{2}_{p}\right)\Lambda_{p2} H^{ij}_{p}H^{jk}_{p2}H^{ki}_{p}\nonumber\\ &+& \underset{\ p\ \neq\ p3}{\sum\sum}\left( \Lambda_{p} + \sigma^{2}_{p}\right)\Lambda_{p3} H^{ij}_{p}H^{jk}_{p}H^{ki}_{p3}. \end{eqnarray}(A.6)By rearranging and substituting in for the ensemble average over the real interferogram we find the ideal bispectrum in terms of the noisy interferogram: B ijk = c ij c jk c ki p ( 3 i p 2 2 i p + 3 i p σ p 2 3 σ p 2 ) H p ij H p jk H p ki p p 1 i p + σ p 2 i p 1 H p 1 ij H p jk H p ki p p 2 i p + σ p 2 i p 2 H p ij H p 2 jk H p ki p p 3 i p + σ p 2 i p 3 H p ij H p jk H p 3 ki . \appendix \setcounter{section}{1} \begin{eqnarray} B^{ijk} &=& \left\langle c^{ij}c^{jk}c^{ki}\right\rangle\nonumber\\ &-& \left\langle\sum_{p}\left(3 i^{2}_{p} -2 i_{p} + 3 i_{p}\sigma^{2}_{p} - 3 \sigma^{2}_{p} \right) H^{ij}_{p}H^{jk}_{p}H^{ki}_{p}\right\rangle \nonumber\\ &-& \underset{\ p\ \neq\ p1}{\sum\sum} \left\langle i_{p} + \sigma^{2}_{p}\right\rangle\left\langle i_{p1}\right\rangle H^{ij}_{p1}H^{jk}_{p}H^{ki}_{p}\nonumber\\ &-& \underset{\ p\ \neq\ p2}{\sum\sum} \left\langle i_{p} + \sigma^{2}_{p}\right\rangle\left\langle i_{p2}\right\rangle H^{ij}_{p}H^{jk}_{p2}H^{ki}_{p}\nonumber\\ &-& \underset{\ p\ \neq\ p3}{\sum\sum} \left\langle i_{p} + \sigma^{2}_{p}\right\rangle\left\langle i_{p3}\right\rangle H^{ij}_{p}H^{jk}_{p}H^{ki}_{p3}. \end{eqnarray}(A.7)Expanding out the double summations using the fact that  ∑  ∑ ab =  ∑  ∑ a b −  ∑ a = b gives:

B ijk = c ij c jk c ki p ( 3 i p 2 2 i p + 3 i p σ p 2 3 σ p 2 ) H p ij H p jk H p ki + 3 p ( i p 2 + i p σ p 2 ) H p ij H p jk H p ki p ( i p + σ p 2 ) H p jk H p ki p 1 i p 1 H p 1 ij p ( i p + σ p 2 ) H p ij H p ki p 2 i p 2 H p 2 jk p ( i p + σ p 2 ) H p ij H p jk p 3 i p 3 H p 3 ki . \appendix \setcounter{section}{1} \begin{eqnarray} B^{ijk}&=& \left\langle c^{ij}c^{jk}c^{ki}\right\rangle\nonumber\\ &-& \left\langle\sum_{p}\left(3i^{2}_{p} -2 i_{p} + 3 i_{p}\sigma^{2}_{p} - 3 \sigma^{2}_{p} \right) H^{ij}_{p}H^{jk}_{p}H^{ki}_{p}\right\rangle \nonumber\\ &+& 3\left\langle\sum_{p}\left(i^{2}_{p} + i_{p}\sigma^{2}_{p}\right) H^{ij}_{p}H^{jk}_{p}H^{ki}_{p}\right\rangle \nonumber\\ &-& \left\langle\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{jk}_{p}H^{ki}_{p}\sum_{p1} i_{p1} H^{ij}_{p1}\right\rangle\nonumber\\ &-&\left\langle\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{ij}_{p}H^{ki}_{p}\sum_{p2} i_{p2} H^{jk}_{p2}\right\rangle\nonumber\\ &-&\left\langle\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{ij}_{p}H^{jk}_{p}\sum_{p3} i_{p3} H^{ki}_{p3}\right\rangle. \end{eqnarray}(A.8)Cancelling terms and noting that, p i p H p ij = c ij \hbox{$\sum_{p}i_{p}H^{ij}_{p} = c^{ij}$}, we find the bias free bispectrum estimator:

B bias free ijk = c ij c jk c ki c ij p ( i p + σ p 2 ) H p jk H p ki c jk p ( i p + σ p 2 ) H p ij H p ki c ki p ( i p + σ p 2 ) H p ij H p jk + p ( 2 i p + 3 σ p 2 ) H p ij H p jk H p ki . \appendix \setcounter{section}{1} \begin{eqnarray} B^{ijk}_{\mathrm{bias-free}}&=& c^{ij}c^{jk}c^{ki}\nonumber\\ &-& c^{ij}\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{jk}_{p}H^{ki}_{p} \nonumber\\ &-&c^{jk}\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{ij}_{p}H^{ki}_{p} \nonumber\\ &-& c^{ki}\sum_{p}\left(i_{p} + \sigma^{2}_{p}\right) H^{ij}_{p}H^{jk}_{p} \nonumber\\ &+& \sum_{p}\left(2 i_{p} +3\sigma^{2}_{p}\right)H^{ij}_{p}H^{jk}_{p}H^{ki}_{p}. \end{eqnarray}(A.9)

All Tables

Table 1

Simulation parameters.

All Figures

thumbnail Fig. 1

The relative importance of the photon, read (σp = 3e) and coupled noise terms under DFT conditions for the power spectrum variance with respect to the number of source photons detected. The area highlighted in grey is the region where the coupled terms dominate the power spectrum variance.

In the text
thumbnail Fig. 2

The transform matrix, W under non-DFT conditions. The DC term shows the beam profile adopted, and the cosine (solid lines) and sine (dotted lines) components of the fringe pattern for each baseline show the deviation from integer number of fringe periods across the detector (with frequencies in the ratio 1.1 ×  {1:2:3}). This is similar (but with the addition of a DC term) to the visibility to pixel matrix introduced by Le Bouquin & Tatulli (2006).

In the text
thumbnail Fig. 3

The bias in closure phase for B 0 ijk \hbox{$B^{ijk}_{0}$} (solid line), B 1 ijk \hbox{$B^{ijk}_{1}$} (dashed line), and B 2 ijk \hbox{$B^{ijk}_{2}$} (dotted line) estimators under a number of measurement conditions. Plots a) and b) show DFT conditions, whilst plots c) and d) show non-DFT conditions (described in the text). Read noise of σp = 3e is present in plots b) and d). The thin and think lines show 120 and 300 photons reaching the detector respectively. The shaded regions in plots c) and d) show the range of possible biases resulting from different possible combinations of phases on the three baselines contributing to the source closure phase.

In the text
thumbnail Fig. 4

The number of integrated frames required to reduce the statistical uncertainty to the level of the bias for the B 1 ijk \hbox{$B^{ijk}_{1}$} (dashed) and B 2 ijk \hbox{$B^{ijk}_{2}$} (dotted) estimators.

In the text
thumbnail Fig. 5

Demonstration of the effect of bias on a closure phase measurement. Plot a): the closure phase trace resulting from a binary pair consisting of a resolved primary of radius R and an unresolved secondary with flux ratio r = 0.01 for separations (s = 10 R,100 R,1000 R). Plot b): the closure phase trace for the 100 R system recovered by the three estimators. B 0 ijk \hbox{$B^{ijk}_{0}$} correctly returns the ideal closure phase, whilst B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$} do not. Both plots show the single source closure phase signal (thin solid line) for reference. Plots c) and d) show the closure phase calculated using B 1 ijk \hbox{$B^{ijk}_{1}$} and B 2 ijk \hbox{$B^{ijk}_{2}$} respectively as in plot b) along with the statistical uncertainty on the measurement after 103, 104 and 105 frames.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.