Coronagraphic phase diversity: performance study and laboratory demonstration

B. Paul; J.-F. Sauvage; L. M. Mugnier

doi:10.1051/0004-6361/201220940

Home

All issues

Volume 552 (April 2013)

A&A, 552 (2013) A48

Full HTML

Free Access

Issue		A&A Volume 552, April 2013


Article Number		A48
Number of page(s)		11
Section		Astronomical instrumentation
DOI		https://doi.org/10.1051/0004-6361/201220940
Published online		25 March 2013

A&A 552, A48 (2013)

Coronagraphic phase diversity: performance study and laboratory demonstration

B. Paul¹^,2^,3, J.-F. Sauvage¹^,3 and L. M. Mugnier¹^,3

¹ Onera – The French Aerospace Lab, 92322 Chatillon, France
e-mail: baptiste.paul@onera.fr
² Aix Marseille Université, CNRS, LAM (Laboratoire d’Astrophysique de Marseille) UMR 7326, 13388 Marseille, France
³ Groupement d’intérêt scientifique PHASE (Partenariat Haute résolution Angulaire Sol et Espace) between Onera, Observatoire de Paris, CNRS, Université Diderot, Laboratoire d’Astrophysique de Marseille and Institut de Planétologie et d’Astrophysique de Grenoble, France

Received: 17 December 2012
Accepted: 31 January 2013

Abstract

Context. The final performance of current and future instruments dedicated to exoplanet detection and characterization (such as SPHERE on the European Very Large Telescope, GPI on Gemini North, or future instruments on Extremely Large Telescopes) is limited by uncorrected quasi-static aberrations. These aberrations create long-lived speckles in the scientific image plane, which can easily be mistaken for planets.

Aims. Common adaptive optics systems require dedicated components to perform wave-front analysis. The ultimate wave-front measurement performance is thus limited by the unavoidable differential aberrations between the wave-front sensor and the scientific camera. To reach the level of detectivity required by high-contrast imaging, these differential aberrations must be estimated and compensated for. In this paper, we characterize and experimentally validate a wave-front sensing method that relies on focal-plane data.

Methods. Our method, called COFFEE (for COronagraphic Focal-plane wave-Front Estimation for Exoplanet detection), is based on a Bayesian approach, and it consists in an extension of phase diversity to high-contrast imaging. It estimates the differential aberrations using only two focal-plane coronagraphic images recorded from the scientific camera itself.

Results. We first present a thorough characterization of COFFEE’s performance by means of numerical simulations. This characterization is then compared with an experimental validation of COFFEE using an in-house adaptive optics bench and an apodized Roddier & Roddier phase mask coronagraph. An excellent match between experimental results and the theoretical study is found. Lastly, we present a preliminary validation of COFFEE’s ability to compensate for the aberrations upstream of a coronagraph.

Key words: instrumentation: adaptive optics / instrumentation: high angular resolution / techniques: image processing / methods: numerical / methods: laboratory / telescopes

© ESO, 2013

1. Introduction

Exoplanet imaging is one of the main challenges in today’s astronomy. A direct observation of these planets can provide information on both the chemical composition of their atmospheres and their temperatures. Such observations have recently been made possible (Kalas et al. 2008; Marois et al. 2008; Lagrange et al. 2009), but only thanks to their high mass or their wide apparent distance from their host star.

Being able to image an object as faint as an extra-solar planet very close to its parent star requires the use of extreme AO (XAO) systems coupled to a high-contrast imaging technique, such as coronagraphy. Instruments dedicated to exoplanet imaging using these two techniques (SPHERE on the VLT, Beuzit et al. 2007; GPI on Gemini North, Macintosh et al. 2008) are currently being integrated. The performance of such systems is limited by residual speckles on the detector. These speckles, which originate in quasi-static non common path aberrations (NCPA), strongly decrease the extinction provided by the coronagraph and can be difficult to distinguish from an exoplanet. To achieve the ultimate system performance, these aberrations must be measured and compensated for. The current-generation instruments, SPHERE and GPI, respectively rely on phase diversity (Gonsalvez 1982) and an interferometry approach (Wallace et al. 2010) to compensate for these NCPA.

Several techniques dedicated to high-contrast imaging system optimization have been proposed for future systems. Some of them rely on a dedicated wave-front sensing hardware (Guyon et al. 2009), others use scientific focal plane data assuming small aberrations. Speckle nulling iterative techniques (Bordé & Traub 2006; Give’on et al. 2007) estimate the electric field in the detector plane using at least three images. The technique proposed by Baudoz et al. (2006) relies on a modification of the imaging system, but requires only one image. These techniques aim at minimizing the energy in a chosen area (“dark hole”), leading to a contrast optimization on the detector (Trauger et al. 2010; Baudoz et al. 2012) in a closed loop process.

We have recently proposed a focal-plane wave-front sensor, COFFEE (Sauvage et al. 2012), which is an extension of conventional phase diversity (Mugnier et al. 2006) to a coronagraphic system. Since COFFEE uses focal-plane images, it is possible to characterize the whole bench without any differential aberration. This method requires only two focal-plane images to estimate the aberrations upstream of the coronagraph without any modification of the coronagraphic imaging system or assuming small aberrations. COFFEE’s principle and its application to the apodized Roddier & Roddier phase mask (ARPM) are described in Sect. 2. In Sect. 3, we evaluate the quality of NCPA estimation by realistic simulations. In Sect. 4, we present the experimental results from the laboratory demonstration of COFFEE on an in-house adaptive optics bench (BOA) with an ARPM. Section 5 concludes the paper.

2. COFFEE: principle

2.1. Extension of phase diversity to coronagraphic images

Figure 1 describes the coronagraphic imaging scheme considered in this paper. We consider four successive planes denoted A (circular entrance pupil of diameter D_u), B (coronagraphic focal plane), C (Lyot Stop), and D (detector plane). The optical aberrations are considered as static and introduced in the pupil planes A and C. The coronagraphic device is composed of a focal plane mask located in plane B and a Lyot Stop situated in plane C. No particular assumption is made on the pupil shape or intensity. Thus, the description of COFFEE is compatible with several coronagraphic devices. COFFEE uses two images, $i_{c}^{f}$ $\hbox{$\boldsymbol{i}_{\rm c}^\text{f}$}$ and $i_{c}^{d}$ $\hbox{$\boldsymbol{i}_{\rm c}^\text{d}$}$ , recorded on the detector (plane D in Fig. 1) that, as in phase diversity, differ from a known aberration, φ_div, to estimate aberrations both upstream (φ_u) and downstream (φ_d) of the coronagraph.

Fig. 1

Coronagraphic imaging instrument: principle.

Considering the calibration of the instrument with an unresolved object, we use the following imaging model: $\begin{matrix} \begin{matrix} \end{matrix} i_{c}^{foc} = α . h_{\det} ⋆ h_{c} (φ_{u}, φ_{d}) + n^{foc} + β \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} i_{c}^{div} = α . h_{\det} ⋆ h_{c} (φ_{u} + φ_{div}, φ_{d}) + n^{div} + β \end{matrix}$ $\begin{equation} \label{eq_im_model} \begin{aligned} \boldsymbol{i}_{\rm c}^{\text{foc}}&=\alpha.\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}(\boldsymbol{\phi}_{\rm u},\boldsymbol{\phi}_{\rm d})+\boldsymbol{n}^{\text{foc}}+\beta\\ \boldsymbol{i}_{\rm c}^{\text{div}}&=\alpha.\boldsymbol{h}_{\text{det}}\star \boldsymbol{h}_{\rm c}(\boldsymbol{\phi}_{\rm u}+\boldsymbol{\phi}_{\rm div},\boldsymbol{\phi}_{\rm d})+\boldsymbol{n}^{\text{div}}+\beta \end{aligned} \end{equation}$ (1)where α is the incoming flux, h_c the coronagraphic “point spread function” (PSF) of the instrument (i.e. the response of a coronagraphic imaging system to a point source), h_det the known detector PSF, n^foc and n^div are the measurement noises, β is a uniform background (offset), and ⋆ denotes the discrete convolution operation. Such an imaging model can be used for any coronagraphic PSF expression h_c. The measurement noises n^foc and n^div comprise both photon and detector noises. Because calibration is assumed to be performed with high flux levels, we adopt a non-stationary white Gaussian model, which is a good approximation of a mix of photon and detector noises. Its variance is the sum of the photon and detector noise variances: $σ_{n}^{2} [t] = σ_{ph}^{2} [t] + σ_{\det}^{2}$ $\hbox{$\boldsymbol{\sigma}^2_{\rm n}[t]=\boldsymbol{\sigma}^2_{\text{ph}}[t] + \sigma^2_{\text{det}}$}$ (Mugnier et al. 2004), with t the pixel position in the detector plane. The former can be estimated as the image itself thresholded to positive values, and the latter can be calibrated prior to the observations.

We adopt a maximum a posteriori (MAP) approach and estimate the aberrations φ_u and φ_d, the flux α, and the background β that minimize the neg-log-likelihood of the data, potentially penalized by regularization terms on φ_u and φ_d designed to enforce smoothness of the sought phases: $(α̂, β̂, \hat{φ_{u}}, \hat{φ_{d}}) = \underset{α,β, φ_{u}, φ_{d}}{\arg \begin{matrix} \min \end{matrix}} J (α,β, φ_{u}, φ_{d})$ $\begin{equation} (\hat{\alpha}, \hat{\beta}, \hat{\boldsymbol{\phi}_{\rm u}}, \hat{\boldsymbol{\phi}_{\rm d}}) = \underset{\alpha, \beta, \boldsymbol{\phi}_{\rm u}, \boldsymbol{\phi}_{\rm d}}{\arg \min} J(\alpha, \beta, \boldsymbol{\phi}_{\rm u}, \boldsymbol{\phi}_{\rm d} ) \end{equation}$ (2)where $\begin{matrix} J (α,β, φ_{u}, φ_{d}) & = & \frac{1}{2} {\begin{matrix} ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \end{matrix} \frac{i_{c}^{foc} - (α . h_{d} ⋆ h_{c} (φ_{u}, φ_{d}) + β)}{σ_{n}^{foc}} \begin{matrix} ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \end{matrix}}^{2} \\ + \frac{1}{2} {\begin{matrix} ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \end{matrix} \frac{i_{c}^{div} - (α . h_{d} ⋆ h_{c} (φ_{u} + φ_{div}, φ_{d}) + β)}{σ_{n}^{div}} \begin{matrix} ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \end{matrix}}^{2} \\ + ℛ (φ_{u}) + ℛ (φ_{d}) \end{matrix}$ $\begin{eqnarray} \label{eq-pb-inverse} J(\alpha, \beta, \boldsymbol{\phi}_{\rm u}, \boldsymbol{\phi}_{\rm d})&=&\frac{1}{2}\left\|\frac{\boldsymbol{i}_{\rm c}^{\text{foc}} - (\alpha.\boldsymbol{h}_{\text{d}}\star\boldsymbol{h}_{\rm c}(\boldsymbol{\phi}_{\rm u},\boldsymbol{\phi}_{\rm d})+\beta)}{\boldsymbol{\sigma}_{\rm n}^{\text{foc}}} \right\|^2 \nonumber \\ &&+\frac{1}{2}\left\|\frac{\boldsymbol{i}_{\rm c}^{\text{div}} - (\alpha.\boldsymbol{h}_{\text{d}}\star\boldsymbol{h}_{\rm c}(\boldsymbol{\phi}_{\rm u}+\boldsymbol{\phi}_{\rm div},\boldsymbol{\phi}_{\rm d})+\beta)}{\boldsymbol{\sigma}_{\rm n}^{\text{div}}}\right\|^2\nonumber \\ &&+\mathcal{R}(\boldsymbol{\phi}_{\rm u}) + \mathcal{R}(\boldsymbol{\phi}_{\rm d}) \end{eqnarray}$ (3)where ∥x∥² denotes the sum of squared pixel values of map x, $σ_{n}^{foc}$ $\hbox{$\boldsymbol{\sigma}_{\rm n}^{\text{foc}}$}$ , and $σ_{n}^{div}$ $\hbox{$\boldsymbol{\sigma}_{\rm n}^{\text{div}}$}$ are the noise standard deviation maps of each image, and ℛ is a regularization metric for the phase.

Any aberration φ is expanded on a basis { Z_k } that is typically either Zernike polynomials or the pixel indicator functions in the corresponding pupil plane: φ = ∑ _ka_kZ_k where the summation is, in practice, limited to the number of coefficients considered sufficient to correctly describe the aberrations. In this paper, the phase is expanded on a truncated Zernike basis. The impact of using a regularization metric with such a basis is studied later in this paper. In the MAP framework, the regularization metrics ℛ(φ_u) and ℛ(φ_d) are deduced from the assumed a priori statistics of φ_u and φ_d. Assuming these aberrations are zero-mean, Gaussian, and neglecting a priori correlations between Zernike modes, we obtain, for an estimation performed on N Zernike modes: $ℛ (φ_{x}) = \frac{1}{2} a_{x}^{t} R_{a_{x}}^{-1} a_{x} = \frac{1}{2} \sum_{k = 1}^{N} \frac{a_{x_{k}}^{2}}{σ_{x_{k}}^{2}},$ $\begin{equation} \label{eq_regul} \mathcal{R}(\boldsymbol{\phi}_x)=\frac{1}{2}\boldsymbol{a}_x^tR_{a_x}^{-1}\boldsymbol{a}_x = \frac{1}{2}\sum_{k=1}^N\frac{a_{x_k}^2}{\sigma_{x_k}^2}\text{,} \end{equation}$ (4)where $σ_{x_{k}}^{2}$ $\hbox{$\sigma_{x_k}^2$}$ is the assumed phase variance per Zernike mode, R_{a_k} the covariance matrix, and a_x a N element vector containing the estimated Zernike coefficients a_{x_k}. Here x is either u (upstream) or d (downstream).

The minimization of metric J(α,β,φ_u,φ_d) of Eq. (3)is performed by means of a limited memory variable metric (BFGS) method (Press et al. 2007; Thiébaut 2002), which is a fast quasi-Newton type minimization method. It uses both gradients $\frac{∂J}{\partial φ_{u}}$ $\hbox{$\frac{\partial J}{\partial\boldsymbol{\phi}_{\rm u}}$}$ and $\frac{∂J}{\partial φ_{d}}$ $\hbox{$\frac{\partial J}{\partial\boldsymbol{\phi}_{\rm d}}$}$ . Flux α and offset β are analytically obtained using gradients $\frac{∂J}{∂α}$ $\hbox{$\frac{\partial J}{\partial \alpha}$}$ and $\frac{∂J}{∂β}$ $\hbox{$\frac{\partial J}{\partial \beta}$}$ (implementation details, including gradient expressions, can be found in Appendix A).

Sauvage et al. (2012) established that a suitable diversity phase φ_div for COFFEE was a mix of defocus and astigmatism: $φ_{div} = a_{4}^{div} Z_{4} + a_{5}^{div} Z_{5}$ $\hbox{$\boldsymbol{\phi}_{\rm div}=a_4^{\rm div}\boldsymbol{Z}_4+a_5^{\rm div}\boldsymbol{Z}_5$}$ with $a_{4}^{div} = a_{5}^{div} = 80 nmrms$ $\hbox{$a_4^{\rm div}=a_5^{\rm div}=80\text{ nm rms}$}$ , introduced upstream of the coronagraph. We therefore use this diversity phase in the following.

2.2. Coronagraphic imaging model

The imaging model used by COFFEE in the criterion minimization (Eq. (3)) requires a coronagraphic PSF expression. In this paper, we use the analytical coronagraphic imaging model developed by Sauvage et al. (2010), whose formalism is developed in this section, where r is the pupil plane position vector, r its modulus, and γ the focal plane position vector. The entrance pupil function P_u(r) is such that: $P_{u} (r) = Π (\frac{2 r}{D_{u}}) Φ (r)$ $\begin{equation} \label{pup_in_trans} \boldsymbol{P}_{\rm u}(\boldsymbol{r})=\boldsymbol{\Pi}\left(\frac{2r}{D_{\rm u}}\right)\boldsymbol{\Phi}(\boldsymbol{r}) \end{equation}$ (5)with $Π {}^{(}{\frac{2 r}{D_{u}}}^{)} = 1$ $\hbox{$\boldsymbol{\Pi}\left(\frac{2r}{D_{\rm u}}\right) = 1$}$ for $r \leq \frac{D_{u}}{2}$ $\hbox{$r\leq \frac{D_{\rm u}}{2}$}$ , pupil entrance diameter, 0 otherwise, and Φ is an apodization function. In this paper, we consider that the impact of amplitude aberrations is negligible, which is a reasonable assumption for a ground-based, high-contrast imaging system such as SPHERE. Considering only static aberrations (no residual turbulent aberrations), the electric field Ψ_A in the entrance pupil plane can be written as $Ψ_{A} (r) = P_{u} (r) e^{j φ_{u} (r)} .$ $\begin{equation} \boldsymbol{\Psi}_{\rm A}(\boldsymbol{r})=\boldsymbol{P}_{\rm u}(\boldsymbol{r}){\rm e}^{j\boldsymbol{\phi}_{\rm u}(\boldsymbol{r})}. \end{equation}$ (6)The field amplitude Ψ_B(γ) in plane B can be calculated, following Sauvage et al. (2010), using the analytical coronagraphic imaging model (which is called “perfect coronagraph model” hereafter): $Ψ_{B} (γ) = {FT}^{-1} (Ψ_{A} (r)) - η_{0} {FT}^{-1} (P_{u} (r)),$ $\begin{equation} \label{eq_coro_B} \boldsymbol{\Psi}_{\rm B}(\boldsymbol{\gamma})=\text{FT}^{-1}(\boldsymbol{\Psi}_{\rm A}(\boldsymbol{r}))-\eta_0\text{FT}^{-1}(\boldsymbol{P}_{\rm u}(\boldsymbol{r}))\text{,} \end{equation}$ (7)where η₀ is the scalar that minimizes the outcoming energy from focal plane B, whose analytical value is given by $η_{0} = \frac{1}{𝒩} ”_{S} Ψ_{A}^{*} (r) P_{u} (r) d r,$ $\begin{equation} \label{eta0} \eta_0=\frac{1}{\mathcal{N}}\iint_{\rm S}\boldsymbol{\Psi}_{\rm A}^*(\boldsymbol{r})\boldsymbol{P}_{\rm u}(\boldsymbol{r}){\rm d}\boldsymbol{r}\text{,} \end{equation}$ (8)where $𝒩 = ”_{S} P_{u}^{*} (r) P_{u} (r) d r .$ $\begin{equation} \mathcal{N}=\iint_{\rm S}\boldsymbol{P}_{\rm u}^*(\boldsymbol{r})\boldsymbol{P}_{\rm u}(\boldsymbol{r}){\rm d}\boldsymbol{r}. \end{equation}$ (9)It is worthy mentioning that η₀ is the exact definition of the instantaneous Strehl ratio given by Born & Wolf (1989). One can notice that η₀ = 1 when there is no aberration upstream of the coronagraph (φ_u(r) = 0), so that Ψ_B = 0 in such a case. No aberration in the entrance pupil leads to no outcoming energy from plane B, and thus to a perfect extinction in the detector plane D.

Propagating the wave from plane B (Eq. (7)) to plane D, we can write the electric field Ψ_D(γ) in the detector plane: $\begin{matrix} \begin{matrix} \end{matrix} Ψ_{D} (γ) = {FT}^{-1} {P_{d} (r) e^{j (φ_{u} (r) + φ_{d} (r))}} \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} - η_{0} {FT}^{-1} {P_{d} (r) e^{j φ_{d} (r)}}, \end{matrix}$ $\begin{equation} \begin{aligned} \boldsymbol{\Psi}_D(\boldsymbol{\gamma})=\,&\text{FT}^{-1}\left\{\boldsymbol{P}_{\rm d}(\boldsymbol{r}){\rm e}^{j(\boldsymbol{\phi}_{\rm u}(\boldsymbol{r})+\boldsymbol{\phi}_{\rm d}(\boldsymbol{r}))}\right\}\\ &-\eta_0\text{FT}^{-1}\left\{\boldsymbol{P}_{\rm d}(\boldsymbol{r}){\rm e}^{j\boldsymbol{\phi}_{\rm d}(\boldsymbol{r})}\right\}\text{,} \end{aligned} \end{equation}$ (10)where P_d(r) is the Lyot stop pupil function: $P_{d} (r) = Π {}^{(}{\frac{2 r}{D_{d}}}^{)} P_{u} (r)$ $\hbox{$\boldsymbol{P}_{\rm d}(\boldsymbol{r})=\boldsymbol{\Pi}\left(\frac{2r}{D_{\rm d}}\right)\boldsymbol{P}_{\rm u}(\boldsymbol{r})$}$ , with D_d the Lyot stop pupil diameter (D_d ≤ D_u). For the sake of simplicity, we omit the spatial variables r and γ in the following. The coronagraphic PSF of the instrument, denoted by h_c, is the square modulus of Ψ_D: $\begin{matrix} \begin{matrix} \end{matrix} h_{c} (φ_{u}, φ_{d}) = | {FT}^{-1} (P_{d} e^{j (φ_{u} + φ_{d})}) \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} - η_{0} {FT}^{-1} (P_{d} e^{j φ_{d}}) |^{2} . \end{matrix}$ $\begin{equation} \label{eq_coro_prf} \begin{aligned} \boldsymbol{h}_{\rm c}(\boldsymbol{\phi}_{\rm u},\boldsymbol{\phi}_{\rm d})=&\big|\text{FT}^{-1}(\boldsymbol{P}_{\rm d}{\rm e}^{j(\boldsymbol{\phi}_{\rm u}+\boldsymbol{\phi}_{\rm d})})\\ &-\eta_0\text{FT}^{-1}(\boldsymbol{P}_{\rm d}{\rm e}^{j\boldsymbol{\phi}_{\rm d}})\big|^2. \end{aligned} \end{equation}$ (11)In this paper, this expression of the coronagraphic PSF is the one used by COFFEE for estimating φ_u and φ_d; i.e., Eq. (11)is inserted into the imaging model (Eq. (1)) used in the criterion minimization described in Eq. (3).

As described by Sauvage et al. (2010), this model, which analytically describes the impact of a coronagraph in an imaging system, considers that the coronagraph removes the projection of the incoming electric field on an Airy pattern, represented by the parameter η₀ (Eq. (8)). Since it does not assume small aberrations, it can be used for any wave-front error upstream of the coronagraph. The quality of the fit of this analytical imaging model with the ARPM coronagraph is discussed later in this paper (Sect. 3.5).

3. Performance assessment by numerical simulation

The aim of this section is to quantify the impact of each error source on COFFEE’s aberration estimation. Such a study will show COFFEE’s sensitivity to the classical error sources that limit the phase retrieval in a real system (and thus the final extinction of the coronagraph), which will be of high interest in defining COFFEE’s upgrades. Likewise, it will allow us to estimate the accuracy level expected on our AO bench. In this section, we present the evolution of this reconstruction error with respect to the incoming flux (Sect. 3.1), to the size of the source (Sect. 3.2), to an error made on the assumed diversity phase used in the reconstruction (Sect. 3.3), and to the number of Zernike modes used in the reconstruction (Sect. 3.4). For each error source, coronagraphic images will be computed using the imaging model presented in Eq. (1), using the perfect coronagraph model to calculate the coronagraphic PSF h_c whose expression is given Eq. (11). COFFEE will then perform the phase estimation using these two images. The compatibility of COFFEE with realistic coronagraphic images will be studied as well (Sect. 3.5) by computing coronagraphic images using a realistic coronagraph model and then running COFFEE to estimate the aberrations both upstream and downstream of the coronagraph.

Table 1 gathers the parameters used for these simulations.

Table 1

COFFEE: simulation parameters used for the performance assessments of Sects. 3.1–3.3.

The chosen wave-front error (WFE) values upstream and downstream of the coronagraph for these simulations are typical of the aberrations that will be estimated on our AO bench in Sect. 4 (so that experimental results can be compared to the following simulations). Since these simulations are performed with a small number of Zernike modes (36), there is no need of regularization metrics in such simulations.

To simulate realistic aberrations, we have considered that the variance per Zernike mode $σ_{k}^{2}$ $\hbox{$\sigma_k^2$}$ was decreasing with the radial order n(k) of the considered Zernike mode k (Noll 1976): $σ_{k}^{2} \propto \frac{1}{n (k)^{2}} \cdot$ $\begin{equation} \label{phi_spectrum} \sigma_k^2 \propto \frac{1}{n(k)^2}\cdot \end{equation}$ (12)This corresponds to a decrease in the static aberration spatial spectrum as $\frac{1}{| ν |^{2}}$ $\hbox{$\frac{1}{|\nu|^2}$}$ , where ν is the spatial frequency, which is a common assumption for mirror fabrication errors. To evaluate COFFEE’s performance, we define the reconstruction error ϵ_x (x stands for u (upstream) or d (downstream)) as $ϵ = \sqrt{\sum_{k = 2}^{N - 1} | a_{k} - â k |^{2}}$ $\begin{equation} \label{eq_err_rec} \epsilon=\sqrt{\sum_{k=2}^{N-1}|a_k-\hat{a}_k|^2} \end{equation}$ (13)with a_k the Zernike coefficients (starting with k = 2 corresponding to tilt) used for the simulation, â_k the reconstructed Zernike coefficients, and N the number of Zernike modes. In this section, every reconstruction error value is an average value, computed from ten independent simulated phases.

3.1. Noise propagation

The ultimate limitation of an instrument lies in the amount of noise in the images. In Fig. 2, we present the reconstruction error for the aberrations upstream (φ_u) and downstream (φ_d) of the coronagraph with respect to the total incoming flux. Photon noise and detector noise (σ_det = 6e⁻) are added in the coronagraphic images for simulation.

Fig. 2

Aberrations upstream (φ_u (WFE = 80 nm), top) and downstream (φ_d (WFE = 20 nm), bottom) of the coronagraph: reconstruction error (solid red line) as a function of the incoming flux α. For comparison, $\frac{1}{α}$ $\hbox{$\frac{1}{\alpha}$}$ (cyan dashed line) and $\frac{1}{\sqrt{α}}$ $\hbox{$\frac{1}{\sqrt{\alpha}}$}$ (magenta dashed line) theoretical behaviours are plotted for detector noise only and photon noise only (respectively).

The evolution of the reconstruction error presented in Fig. 2 is proportional to (1/α) for the detector noise limited regime (low flux) and to ( $1 / \sqrt{α}$ $\hbox{$1/\!\!\sqrt{\alpha}$}$ ) for the photon noise limited regime (high flux). In this figure, it can be seen that for an incoming flux α ≥ 10⁶ photons, the reconstruction error ϵ_u for the phase upstream of the coronagraph is smaller than 1nmrms. Thus, in a calibration process, where high values of flux (≥10⁶ photons) can be easily reached, COFFEE’s performance will not be significantly affected by noise.

It is noteworthy that the results of many similar simulations with various levels of upstream aberrations show that COFFEE’s reconstruction error does not depend on the amplitude of the aberrations upstream of the coronagraph, as long as the diversity phase amplitude is larger than the WFE of the aberrations to be estimated.

3.2. Impact of the source size on the reconstruction error

Our imaging model, presented in Sect. 2.1 (Eq. (1)), assumes an unresolved object. Thus, the presence of a real source with a given spatial extension will have an impact on the phase reconstruction, which is quantified here. We consider here a Gaussian-shaped laser source, emitted from a single-mode fiber. Because of the incoming light coherence, it can be represented as a Gaussian amplitude in the entrance pupil plane (where COFFEE assumes a uniform amplitude). Knowing this, coronagraphic images are simulated by considering a small coherent Gaussian-shaped beam ( $FWHM \leq 0.5 \frac{λ}{D}$ $\hbox{${\it FWHM} \leq 0.5\frac{\lambda}{D}$}$ ) on the coronagraph, and then processed by COFFEE.

Fig. 3

Error reconstructions upstream (red line) and downstream (blue line) of the coronagraph as functions of the size of the source on the coronagraph.

Since the imaging model assumes an unresolved object, both reconstruction errors for the phases upstream and downstream of the coronagraph increase with the FWHM of the coherent object, as showed in Fig. 3, but remains low: for an FWHM smaller than $\frac{λ}{3 D}$ $\hbox{$\frac{\lambda}{3D}$}$ , the reconstruction error is indeed sub-nanometric. The size of the laser source will thus definitely not be a limitation for COFFEE: if this error is not negligible in the total error budget, it is possible to include it in the imaging model used by COFFEE (Eq. (1)) as a non-uniform (Gaussian) entrance pupil function P_u(r).

3.3. Sensitivity to a diversity phase error

The diversity phase $φ_{div} = a_{4}^{div} Z_{4} + a_{5}^{div} Z_{5}$ $\hbox{$\boldsymbol{\phi}_{\rm div}=a_4^{\rm div}\boldsymbol{Z}_4+a_5^{\rm div}\boldsymbol{Z}_5$}$ has been defined in Sect. 2.1. This phase φ_div is one of the inputs that COFFEE needs in order to perform phase retrieval, so it must be calibrated as accurately as possible. To optimize the use of COFFEE, the impact of an error on such a calibration is studied. In this section, we consider that the diversity phase used to create the diversity image is not perfectly known. The coronagraphic simulated diversity image is computed with a diversity phase $φ_{div}^{'} = φ_{div} + φ_{err}$ $\hbox{$\boldsymbol{\phi}_{\rm div}'=\boldsymbol{\phi}_{\rm div}+\boldsymbol{\phi}_{\rm err}$}$ , with φ_err a randomly generated phase of given rms value, and COFFEE’s phase reconstruction is done considering that the diversity phase is equal to φ_div. In Fig. 4, we see that the reconstruction error increases linearly with the calibration error on the diversity phase, with a slope of 0.5. Thus, the requirement on the calibration precision for the diversity phase is typically the precision wanted for the aberration measurement.

Fig. 4

Error reconstructions upstream (solid red line) and downstream (solid blue line) of the coronagraph as functions of the error on the diversity phase.

3.4. Impact of aliasing

The phase estimation is performed here on a truncated Zernike basis. In real images (recorded from a bench), some speckles will originate in high-order aberrations. These aberrations, which cannot be fitted by the truncated Zernike basis, will have an impact on the phase estimation, called aliasing error hereafter. Thus, it is necessary to study this aliasing error as a function of the number of Zernike modes used in the phase reconstruction. Here, we generate a phase on a large number of Zernike modes, and compute the corresponding images using the perfect coronagraph model. Aberrations both upstream and downstream of the coronagraph are then estimated by COFFEE using an increasing number of Zernike modes. Since one of the aims of this simulation is to determine the size of the truncated Zernike basis to be used with experimental data recorded on an in-house bench, the noise level in the simulated images corresponds to the one we have on this bench. The total incoming flux is 5 × 10⁶ photons, and the detector noise is σ_det = 1e⁻ per pixel. Parameters used for this simulation are gathered in Table 2. This simulation has been done with and without a regularization metric, so that we can demonstrate the relevance of this metric on phase estimation.

Table 2

COFFEE: simulation parameters for studying the aliasing error.

Fig. 5

Error reconstructions upstream (top) and downstream (bottom) of the coronagraph as functions of the number of reconstructed Zernike modes, with a regularization metric (solid blue line) and without (solid red line).

Figure 5 presents the evolution of the reconstruction errors when the number of reconstructed Zernike modes increases. Here, every reconstruction error (Eq. (13)) is calculated on a basis of 350 Zernike modes; thus, the error originates both in high-order aberrations, which are not considered by COFFEE because of the Zernike basis finite size (modelling error), and in the impact of these high-order aberrations on the estimated ones (aliasing). The WFE corresponding to the aberrations that are not estimated by COFFEE (from N to 350, where N varies between 15 and 275 according to Table 2) is called “unmodelled WFE” hereafter.

In the plot of the reconstruction error upstream of the coronagraph evolution (Fig. 5, top), one can see that without a regularization metric, the reconstruction error increases for a large number of Zernike modes. An interpretation of this behaviour is the following: because high-order aberrations have a smaller variance, their associated speckle intensity is lower. Thus, owing to the photon and detector noise in the image, the SNR is smaller for these aberrations. Such behaviour leads to a trade-off between aliasing and noise amplification for the optimal number of Zernike modes (Fig. 5). The best number of Zernike modes is then a function of the aberrations level (WFE) and spectrum, as well as of the level of noise. The use of a regularization metric allows us to avoid this noise amplification (Fig. 5): the reconstruction error roughly reaches a saturation level (rather than growing to very high values). Additionally, the use of regularization reduces the aliasing error, and avoids the need for the difficult and somewhat ad hoc choice of number of Zernike modes for the reconstruction.

According to the results presented in Fig. 5, we have chosen to estimate the aberrations upstream and downstream of the coronagraph on 170 Zernike modes with the regularization metric of Eq. (4).

3.5. Model mismatch

We have already demonstrated that ARPM images are compatible with the perfect coronagraph model and therefore with COFFEE estimation in Sauvage et al. (2012). The Roddier & Roddier phase mask (RRPM; Roddier & Roddier 1997; Guyon et al. 1999) consists in a π phase shifting mask slightly smaller than the Airy disk. Additionally, the use of a circular prolate function as entrance pupil apodization Φ_P (ARPM), proposed by Soummer et al. (2003), leads in a perfect case (no aberrations upstream of the coronagraph) to a total suppression of signal in the detector plane. In the simulations presented hereafter, realistic ARPM coronagraphic images are computed following Soummer et al. (2007) to consider an accurate numerical representation of Lyot-style coronagraphs. Then, we use COFFEE to reconstruct both phases upstream and downstream of the coronagraph. Here, when using the formalism developed in Sect. 2.2, the prolate apodization function Φ_P is included in both simulation and reconstruction imaging models.

Fig. 6

Error reconstruction upstream of the coronagraph with respect to the WFE of the aberration upstream of the coronagraph.

Because the perfect coronagraph model is not exactly identical to an ARPM (although their responses to aberrations is very close), there is a model mismatch in the estimation of aberrations upstream of the coronagraph φ_u, which varies linearly with the WFE of φ_u, as shown in Fig. 6. The model mismatch can thus be quantified as 7.5% of the WFE rms value of φ_u, except for very small WFE (≤1 nm rms), where the variation is non-linear, but remains below 1 nm rms.

Since the variation in this model mismatch varies linearly with the WFE of φ_u, it should not limit the ability to compensate for the aberration upstream of an ARPM using COFFEE as focal plane wave-front sensor (WFS).

4. Laboratory demonstration

In this section we present experimental validations in the coronagraphic phase diversity. These validations are done on the bench BOA, described in Sect. 4.1. Section 4.2 describes a carefully designed method developed to introduce calibrated static aberrations on the AO bench to be measured with COFFEE. The error made on the measurements of aberrations upstream of the coronagraph (NCPA) is quantified in Sect. 4.3. Section 4.4 presents the static aberration measurement performance, and Sect. 4.5 details the procedure for compensating for the measured aberrations.

4.1. Experimental setup

Fig. 7

Adaptive optics testbed schematic representation. M_i: fold mirrors; MP_i: parabolic mirrors; L_i: lenses (doublets); BS: beam splitter; TTM: Tip-Tilt mirror; DM: deformable mirror; RRPM: coronagraphic focal plane mask; Φ: prolate apodizer; WFS: AO wave-front sensor

Figure 7 shows the design of our in-house bench. The input beam, emitted from a fibered laser source (λ = 635 nm) comes through the prolate apodizer Φ, which is in the entrance pupil plane (P_u). The beam is reflected by the tip-tilt mirror (TTM) and then by the deformable mirror (DM, entrance pupil, D_u = 40 mm, 6 × 6 actuators). The beam-splitter sends a fraction of the beam to the AO wave-front sensor (Shack-Hartmann, 5 × 5 sub-apertures). On the other channel, the light is focused onto a RRPM, whose diameter is d_c = 18.1 μm (angular diameter is $1.06 \frac{λ}{D_{u}}$ $\hbox{$1.06\frac{\lambda}{D_{\rm u}}$}$ ). After going through the Lyot stop plane (P_d, with D_d = 0.99D_u), the beam is focused onto the camera (256 × 256 pixels images with an oversampling of 2.75, detector noise σ_det = 1e⁻). For faster computations, recorded images are re-binned to 128 × 128 pixels images with an oversampling of 1.38.

4.2. Introduction of calibrated aberrations

To evaluate COFFEE’s performance, we introduce calibrated aberrations on the bench using a process described in this section. We consider an aberration phase φ_cal to be introduced on BOA. First, since the phase is represented by the DM with a finite number of actuators (6 × 6), the introduced aberration will not match the aberration φ_cal perfectly, as illustrated in Fig. 8 in the case of a pure spherical aberration.

Fig. 8

Introduction of calibrated aberration on BOA: case of a pure spherical aberration. Left: theoretical wave-front (top) and DM introduced wave-front (bottom). Right: corresponding Zernike modes for the theoretical introduced aberration (solid red line) and the DM introduced aberration (dashed blue line).

Our aim is here to introduce, using the DM, the closest aberration to the aberration φ_cal. We let F be the DM influence matrix (obtained by calibration); any DM introduced aberration φ^DM can be described as a set of actuator voltages u (φ^DM = Fu). We are thus looking for the set u_cal which solves the least-squares problem: $u_{cal} = \underset{u}{\arg \begin{matrix} \min \end{matrix}} {\begin{matrix} ∥ \\ ∥ \\ ∥ \end{matrix} F u - φ_{cal} \begin{matrix} ∥ \\ ∥ \\ ∥ \end{matrix}}^{2} .$ $\begin{equation} \boldsymbol{u}_{\rm cal}=\underset{\boldsymbol{u}}{\arg\min}\left\| \boldsymbol{F}\boldsymbol{u} - \boldsymbol{\phi}_{\rm cal} \right\|^2. \end{equation}$ (14)The solution of this problem can be written as $u_{cal} = T φ_{cal},$ $\begin{equation} \boldsymbol{u}_{\rm cal}=\boldsymbol{T}\boldsymbol{\phi}_{\rm cal}\text{,} \end{equation}$ (15)with T the generalized inverse of matrix F. Using the interaction matrix D (resulting from calibration), we can compute the corresponding set of slopes s_cal (s_cal = Du_cal), which can then be used to modify the AO loop reference slopes s_ref. Thus, closing the AO loop with the reference slopes s_ref + s_cal, we introduce an aberration $φ_{cal}^{DM} = F u_{cal} = F T φ_{cal}$ $\hbox{$\boldsymbol{\phi}^{\rm DM}_{\rm cal} = \boldsymbol{F}\boldsymbol{u}_{\rm cal}=\boldsymbol{F}\boldsymbol{T}\boldsymbol{\phi}_{\rm cal}$}$ on the bench, which is the best fit of φ_cal in the least squares sense.

We also have to consider that the bench BOA presents its own unknown static aberrations $φ_{u}^{BOA}$ $\hbox{$\boldsymbol{\phi}^{\text{BOA}}_{\rm u}$}$ and $φ_{d}^{BOA}$ $\hbox{$\boldsymbol{\phi}^{\text{BOA}}_{\rm d}$}$ upstream and downstream of the coronagraph (respectively). Thus, if a calibrated aberration φ_cal is introduced in the entrance pupil, aberrations φ_u upstream of the coronagraph will be $φ_{u} = φ_{cal} + φ_{u}^{BOA} .$ $\begin{equation} \label{eq_ncpa_boa} \boldsymbol{\phi}_{\rm u}=\boldsymbol{\phi}_{\rm cal}+\boldsymbol{\phi}^{\text{BOA}}_{\rm u}. \end{equation}$ (16)To get rid of the unknown aberration $φ_{u}^{BOA}$ $\hbox{$\boldsymbol{\phi}^{\text{BOA}}_{\rm u}$}$ , we perform a differential phase estimation:

1.
We introduce the aberration $φ_{cal}^{DM}$ $\hbox{$\boldsymbol{\phi}^{\rm DM}_{\rm cal}$}$ on the bench. A phase $\hat{φ} \begin{matrix} + \\ u \end{matrix} = \hat{φ} \begin{matrix} DM \\ cal \end{matrix} + \hat{φ} \begin{matrix} BOA \\ u \end{matrix}$ $\hbox{$\hat{\boldsymbol{\phi}}_{\rm u}^+=\hat{\boldsymbol{\phi}}^{\rm DM}_{\rm cal}+\hat{\boldsymbol{\phi}}^{\rm BOA}_{\rm u}$}$ is estimated using focused and diverse images recorded on the camera.
2.
The opposite aberration $- φ_{cal}^{DM}$ $\hbox{$-\boldsymbol{\phi}^{\rm DM}_{\rm cal}$}$ is then introduced. A phase $\hat{φ} \begin{matrix} - \\ u \end{matrix} = - \hat{φ} \begin{matrix} DM \\ cal \end{matrix} + \hat{φ} \begin{matrix} BOA \\ u \end{matrix}$ $\hbox{$\hat{\boldsymbol{\phi}}_{\rm u}^-=-\hat{\boldsymbol{\phi}}^{\rm DM}_{\rm cal}+\hat{\boldsymbol{\phi}}^{\rm BOA}_{\rm u}$}$ is estimated.
3.
The half difference $\hat{φ} \begin{matrix} DM \\ cal \end{matrix} = \frac{\hat{φ} \begin{matrix} + \\ u \end{matrix} - \hat{φ} \begin{matrix} - \\ u \end{matrix}}{2}$ $\hbox{$\hat{\boldsymbol{\phi}}^{\rm DM}_{\rm cal}=\frac{\hat{\boldsymbol{\phi}}_{\rm u}^+-\hat{\boldsymbol{\phi}}_{u}^-}{2}$}$ is our estimate of φ_cal.

The first use of this process is to calibrate the diversity phase itself. Since this phase will be introduced using the AO system, the actually introduced diversity phase will not exactly match the theoretical mix of defocus and astigmatism. We introduce the aberrations φ_div and − φ_div on the bench using the AO system. These two aberrations are then estimated using classical phase diversity (no coronagraph), with a pure defocus of diversity phase introduced using a flat glass plate of known thickness e in a focused beam.

Such a process gives us an accurate estimation of the diversity phase really introduced on the bench, with an estimated accuracy of 4 nm rms on the introduced aberration. This calibration is then used in COFFEE’s estimations performed on experimental images.

4.3. Performance assessment: error budget

From simulations presented in Sect. 3, we establish an error budget for estimating aberrations upstream of the coronagraph using experimental data:

⋄
Photon and detector noise error: on the BOA bench, the typicalincoming flux isf_BOA = 5 × 10⁶ photons. Knowing that we have photon noise and a detector noise with σ_det = 1e⁻, we can evaluate the noise error: ϵ_noise = 0.9 nm rms.
⋄
The diversity phase φ_div has been calibrated using classical phase diversity, using the process presented in Sect. 4.2. Such an estimation has been performed with an error of 4.0 nm rms (value calculated from an error budget evaluated for a classical phase diversity estimation on the BOA bench. Such accuracy has already been obtained on this bench by Sauvage et al. 2007). According to Sect. 3.3, this error on the diversity phase leads to an error ϵ_model = 2.0 nm rms.
⋄
The source is a coherent Gaussian-shaped beam whose FWHM is $0.27 \frac{λ}{D}$ $\hbox{$0.27\frac{\lambda}{D}$}$ on the coronagraph. According to the simulations of Sect. 3.2, this leads to a reconstruction error: ϵ_obj = 0.7 nm rms.
⋄
Residual turbulent speckles, which originate in uncorrected turbulent aberrations, are not included in the imaging model. To measure the impact of these speckle on the reconstruction, several wave-fronts have been successively recorded using a commercial Shack-Hartmann wave-front sensor. From these acquisitions, we calculate the WFE of the residual turbulent phase: σ_{φ_turb} = 1.2 nm rms. This residual turbulence will create speckles on the detector, which will be considered by COFFEE as originating in NCPA. Thus, the residual turbulence error ϵ_turb made by COFFEE is estimated to ϵ_turb = σ_{φ_turb} = 1.2 nm rms.
⋄
Aliasing error, which originates in high-order aberrations, has been studied in Sect. 3.4. For a phase upstream of the coronagraph estimated on N = 170 Zernike modes, we have ϵ_aliasing = 18.3 nm rms.
⋄
From simulations, we know that the model mismatch is 7.5% of WFE. For this study, we will not estimate aberrations with a WFE stronger than 80 nm rms. For such a WFE, the model error is ϵ_model = 6.0 nm rms.

As one can see in Table 3, the error budget is mainly driven by the aliasing error. The second most important term is the model mismatch (even though it goes to zero with the WFE).

Table 3

COFFEE: error budget for the estimation of an aberration upstream of the coronagraph on BOA.

4.4. Measurement of aberrations upstream of the coronagraph

In this section, we introduce calibrated aberrations on the BOA bench upstream of the coronagraph, and then estimate them with COFFEE in order to evaluate its performance. In the course of this study, we realized that the position of the coronagraphic image on the detector (quantified by the tip-tilt downstream of the coronagraph) is a critical issue. Indeed, it occurred that COFFEE was able to perform phase retrieval only for downstream tip-tilt [a₂,a₃] values within the range [−100nmrms;100nmrms] ( $[- \frac{λ}{6 D}; \frac{λ}{6 D}]$ $\hbox{$[-\frac{\lambda}{6D}; \frac{\lambda}{6D}]$}$ ). To get rid of this constraint, we have developed a method to perform a preliminary estimation of the tip-tilt downstream of the coronagraph. This method, which uses the diversity image, is fully described in Appendix B.

4.4.1. Measurement of tip-tilt upstream of the coronagraph

We present the estimation of a tilt aberration upstream of the coronagraph using COFFEE in this section. Using the AO system, we introduce a tilt aberration by adding a constant value δs_TT to the AO wave-front sensor references slopes s_ref, and then closing the AO loop on the slopes s_ref + δs_TT. To accurately calibrate the introduced tilt, for each position, we first estimate the aberrations using classical phase diversity (no coronagraph). Then, the RRPM is put in the focal plane, and the same operation is repeated: for each position, we record two images, and then estimate the aberrations using COFFEE.

Fig. 9

Estimation of a tilt aberration on BOA: calibration (solid blue line) and COFFEE’s estimation with bound on the tip-tilt downstream of the coronagraph (dashed crossed red line) and without boundaries (dashed diamond green line).

From the upstream tilt reconstruction performed by COFFEE (Fig. 9), we calculate an average reconstruction error: ϵ_tilt = 2.1 nm. Part of this error is due to an error on the estimation of tip-tilt downstream of the coronagraph. An improved estimation has been performed by setting boundaries on the downstream tip-tilt. Its value is evaluated before COFFEE’s estimation using the method described in Appendix B with the diversity coronagraphic image recorded for a tip-tilt upstream the coronagraph value close to 0 nm rms (centered coronagraph). Such an estimation process gives us an estimation of tip-tilt downstream of the coronagraph ${a_{2}^{do}, a_{3}^{do}}$ $\hbox{$\{a_2^{\text{do}},a_3^{\text{do}}\}$}$ with an accuracy of ± 1.5 nm rms. Using this estimation as the starting value for the minimization, and setting bounds of ± 1.5 nm rms on it, we processed the same experimental data. This, in turn, results in a better estimation of tilt upstream of the coronagraph (Fig. 9), with an average error ϵ_tilt = 1.5 nm, which is close to the expected error per Zernike mode given in Sect. 4.3 (ϵ′ = 1.6 nm rms).

4.4.2. NCPA measurements

In this section, we introduce aberrations upstream of the coronagraph. The aberration φ_cal is expanded on the first 15 Zernike modes (which is the largest number of modes we can properly describe with our 6 × 6 DM), and then we estimate these aberrations using COFFEE, following the process described in Sect. 4.2. To take the DM action into account on the introduced phase (illustrated in Fig. 8), aberrations φ_cal are first estimated with classical phase diversity (no phase mask in the coronagraphic focal plane, Sauvage et al. 2007). This estimation gives us a calibration of the introduced aberration, which is then used to evaluate the accuracy of COFFEE’s estimation.

Fig. 10

COFFEE: NCPA estimation of an introduced phase φ_cal on BOA. Top: for an aberration + φ_cal, recorded coronagraphic image from the bench (left) and computed image using the reconstructed aberration $\hat{φ} \begin{matrix} + \\ u \end{matrix}$ $\hbox{$\hat{\boldsymbol{\phi}}_{\rm u}^+$}$ (right) (log. scale, same range for both images). Middle: same images for an aberration − φ_cal introduced and a reconstructed aberration $\hat{φ} \begin{matrix} - \\ u \end{matrix}$ $\hbox{$\hat{\boldsymbol{\phi}}_{\rm u}^-$}$ (log. scale, same range for both images). Bottom: calibrated introduced aberration (left) and COFFEE estimated aberration (right).

At convergence of the reconstruction, a very good match can be observed between the experimental images and the ones computed for the estimated aberrations (Fig. 10, top and middle). This, in turn, results in a very good match between the aberrations measured by COFFEE (Fig. 10, right) and the introduced ones (Fig. 10, left).

From the experimental phase estimation presented in Fig. 10, we compute a reconstruction error between the classical diversity phase calibrated aberration and COFFEE’s estimation: $ϵ_{\exp} = 22.5 nmrms .$ $\begin{equation} \epsilon_{\text{exp}}=22.5\ \text{nm rms}. \end{equation}$ (17)One can notice that this error is close to the expected error budget, i.e. that there is a good match between the performance assessment study carried out in Sect. 3 and the experimental results presented in this section.

4.5. Low-order NCPA compensation

Lastly, the ability of COFFEE to compensate for the aberrations upstream of the coronagraph is experimented on BOA. In Sect. 4.4, the aberrations upstream of the coronagraph are expanded on 170 Zernike modes, in order to have the smallest reconstruction error (according to Sect. 3.4).

As previously mentioned, the compensation on BOA is limited to the 15th Zernike mode. Thus, what is required in a closed loop process is the most accurate estimation of 15 Zernike modes rather than an accurate measurement of every estimated Zernike mode. Using a basis of 36 Zernike modes for the reconstruction is sufficient to give an accurate estimation of the first 15 Zernike modes: the aliasing error, which is the most important error source, will mainly degrade the estimation accuracy of the reconstructed high orders (close to Z₃₆).

To demonstrate the ability of COFFEE to be used in a closed loop, we introduce a set of aberrations on the DM by modifying the reference slopes, as described in Sect. 4.2. Then, we use the pseudo-closed loop (PCL) method described in Sauvage et al. (2007). This iterative process has two stages: for the PCL iteration i:

1.
acquisition of the focused $i_{c}^{f}$ $\hbox{$\boldsymbol{i}_{\rm c}^{f}$}$ and diverse $i_{d}^{f}$ $\hbox{$\boldsymbol{i}_{\rm d}^{f}$}$ images;
2.
estimation of the aberration $\hat{φ} \begin{matrix} i \\ u \end{matrix}$ $\hbox{$\hat{\boldsymbol{\phi}}_{\rm u}^i$}$ upstream of the coronagraph;
3.
computation of the corresponding reference slopes correction $δ s = g DT \hat{φ} \begin{matrix} i \\ u \end{matrix}$ $\hbox{$\delta \boldsymbol{s}=g\boldsymbol{DT}\hat{\boldsymbol{\phi}}_{\rm u}^i$}$ , where D and T are the interaction and influence matrices defined in Sect. 4.2 and g is the PCL gain;
4.
the AO loop is closed on the modified reference slopes.

The computation time (step 2) varies from 1 min to 2.5 min, allowing us to compensate for quasi-static aberrations upstream of the coronagraph. This compensation process is limited by the estimation accuracy of the first 15 Zernike modes performed by COFFEE, which corresponds to the error budget established in Sect. 4.3), and by the ability of the DM to reproduce a given wave-front. Indeed, the correction introduced on the bench (step 2 of the PCL process) is the best fit of the estimated phase $\hat{φ} \begin{matrix} i \\ u \end{matrix}$ $\hbox{$\hat{\boldsymbol{\phi}}_{\rm u}^i$}$ in the least-square sense (as presented in Sect. 4.2). The difference between the estimated aberration and the actual introduced correction will thus limit the compensation performance of the PCL process. Considering these two limitations, one can compute the variance $σ_{BOA}^{2}$ $\hbox{$\sigma_{\text{BOA}}^2$}$ (for the first 15 Zernike modes) that can be reached on the BOA bench: $σ_{BOA}^{2} = 4.4 \times 10^{-2} {radrms}^{2} .$ $\begin{equation} \sigma_{\text{BOA}}^2=4.4\times 10^{-2}\ \text{rad rms}^2. \end{equation}$ (18)The correction and stabilization of the NCPA variance can be seen in Fig. 11. One can see that the variance of the 15 corrected Zernike modes reaches the expected asymptotic value $σ_{BOA}^{2}$ $\hbox{$\sigma_{\text{BOA}}^2$}$ . This result is the very first demonstration of COFFEE’s ability to compensate for aberrations upstream of the coronagraph. A compensation at levels compatible with SPHERE or GPI-like instruments will require using a DM with many more actuators, and working on the reduction of the dominant term of the error budget, which is aliasing.

Fig. 11

PCL on the bench BOA (g_PCL = 0.5): variance of the residual static aberrations upstream of the coronagraph for the 36 COFFEE estimated Zernike modes (solid red line) and the 15 corrected modes (solid blue line). The magenta dashed line represents the ultimate performance one can reach according to the error budget detailed in 4.3.

5. Conclusion

In this paper, we have presented a thorough simulation study (Sect. 3) and a first experimental validation (Sect. 4) of the coronagraphic wave-front sensor called COFFEE, which consists mainly in the extension of the phase diversity concept to a coronagraphic imaging system. From the validation and careful performance assessment of COFFEE, we showed that COFFEE is currently limited by the aliasing error, due to high-order aberrations, which are difficult to model with a Zernike basis.

In Sect. 4, we presented a first experimental validation of COFFEE using an ARPM. We introduced calibrated aberrations upstream of the coronagraph (NCPA), using the AO sub-system, and estimated them with COFFEE. The accuracy we obtained on these estimation shows a very good match with our error budget. Lastly, we used COFFEE in an iterative process to perform a preliminary validation of COFFEE’s ability to compensate for the aberrations upstream of the coronagraph.

Several perspectives are currently considered to optimize COFFEE: firstly, in order to minimize the impact of the aliasing error on the phase reconstruction, we plan to perform the phase reconstruction on a pixel-wise map, which is more suitable than a truncated Zernike basis. Secondly, we would like to improve the imaging model, both to make COFFEE work with other coronagraph than the ARPM and to reduce the model error, which is currently the second most important one, even though it goes to zero with the WFE. Two solutions are considered. In the absence of residual turbulence, an accurate imaging model is obtained by propagating the electric field through each plane of the coronagraphic imaging system (Fig. 1) for an arbitrary focal plane coronagraphic mask. Such a method, where no model error needs to be considered, can be used for a laboratory calibration. Alternatively, a more accurate analytical imaging model, which could include a residual turbulent aberration, can be developed. Such a model, which could include a residual turbulent aberration, will ultimately allow us to perform NCPA estimation on images from the sky. These improvements should allow us to estimate and compensate for the aberrations upstream of the coronagraph using COFFEE with a nanometric precision in a closed loop process.

A further perspective is to extend COFFEE to phase and amplitude aberration estimation, in order to create a dark hole region in the coronagraphic image.

Acknowledgments

The authors would like to thank Mamadou N’Diaye, Kjetil Dohlen and Thierry Fusco for stimulating discussions, as well as Marc Ferrari, David Mouillet and Jean-Luc Beuzit for their support, and the Région Provence-Alpes-Côte d’Azur for partial financial support of B.P. scholarship. This work has been partially funded by the European Commission under FP7 Grant Agreement No. 312430 Optical Infrared Coordination Network for Astronomy.

References

Baudoz, P., Boccaletti, A., Baudrand, J., & Rouan, D. 2006, in Direct Imaging of Exoplanets: Science & Techniques, Proc, IAU Colloq. 200, eds. C. Aime, & F. Vakili (Cambridge, UK: Cambridge University Press), 553 [Google Scholar]
Baudoz, P., Mazoyer, J., Mas, M., Galicher, R., & Rousset, G. 2012, in Ground-based and Airborn Intrusmentation for Astronomy IV, Proc. Soc. Photo-Opt. Instrum. Eng., 8446 [Google Scholar]
Beuzit, J.-L., Feldt, M., Dohlen, K., et al. 2007, in Proc. Conference In the Spirit of Bernard Lyot: The Direct Detection of Planets and Circumstellar Disks in the 21st Century, ed. P. Kalas (University of California, Berkeley, CA, USA) [Google Scholar]
Bordé, P. J., & Traub, W. A. 2006, ApJ, 638 [Google Scholar]
Born, M., & Wolf, E. 1989, Principles of Optics (Pergamon Press) [Google Scholar]
Give’on, A., Belikov, R., Shaklan, S., & Kasdin, J. 2007, Opt. Express, 15 [Google Scholar]
Gonsalvez, R. 1982, Opt. Eng., 21 [Google Scholar]
Gratadour, D., Mugnier, L. M., & Rouan, D. 2005, A&A, 443, 357 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Guyon, O., Roddier, C., Graves, J., et al. 1999, PASP, 111 [Google Scholar]
Guyon, O., Matsuo, T., & Angel, R. 2009, ApJ, 693 [Google Scholar]
Kalas, P., Graham, J. R., Chiang, E., et al. 2008, Science, 332 [Google Scholar]
Lagrange, A.-M., Gratadour, D., Chauvin, G., et al. 2009, A&A, 493, L21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Macintosh, B. A., Graham, J. R., Palmer, D. W., et al. 2008, in Adaptive Optics Systems, Proc. Soc. Photo-Opt. Instrum. Eng., 7015 [Google Scholar]
Marois, C., Macintosh, B., Barman, T., et al. 2008, Science, 322 [Google Scholar]
Mugnier, L. M., Robert, C., Conan, J.-M., Michau, V., & Salem, S. 2001, J. Opt. Soc. Am. A, 18, 862 [NASA ADS] [CrossRef] [Google Scholar]
Mugnier, L. M., Fusco, T., & Conan, J.-M. 2004, J. Opt. Soc. Am. A, 21, 1841 [NASA ADS] [CrossRef] [Google Scholar]
Mugnier, L. M., Blanc, A., & Idier, J. 2006, in Advances in Imaging and Electron Physics, ed. P. Hawkes (Elsevier), 141, 1 [Google Scholar]
Noll, R. J. 1976, J. Opt. Soc. Am., 66, 207 [NASA ADS] [CrossRef] [Google Scholar]
Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. 2007, Numerical Recipes: the art of scientific computing (Cambridge University Press) [Google Scholar]
Roddier, F., & Roddier, C. 1997, PASP, 109 [Google Scholar]
Sauvage, J.-F., Fusco, T., Rousset, G., & Petit, C. 2007, J. Opt. Soc. Am. A, 24, 2334 [NASA ADS] [CrossRef] [Google Scholar]
Sauvage, J.-F., Mugnier, L. M., Rousset, G., & Fusco, T. 2010, J. Opt. Soc. Am. A, 27, A157 [Google Scholar]
Sauvage, J.-F., Mugnier, L., Paul, B., & Villecroze, R. 2012, Opt. Lett., 37, 4808 [NASA ADS] [CrossRef] [Google Scholar]
Soummer, R., Aime, C., & Falloon, P. 2003, A&A, 397, 1161 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Soummer, R., Pueyo, L., Sivaramakrishnan, A., & Vanderbei, R. 2007, Opt. Express, 15 [Google Scholar]
Thiébaut, E. 2002, in Astronomical Data Analysis II, Proc. Soc. Photo-Opt. Instrum. Eng., 4847, 174 [Google Scholar]
Thiébaut, E., & Conan, J.-M. 1995, J. Opt. Soc. Am. A, 12, 485 [NASA ADS] [CrossRef] [Google Scholar]
Trauger, J., Give’on, A., Gordon, B., et al. 2010, in Tecniques and Instrumentation for Detection of Exoplanets III, Proc. Soc. Photo-Opt. Instrum. Eng., 6693 [Google Scholar]
Wallace, J. K., Burruss, R. S., Bartos, R. D., et al. 2010, in Adaptive Optics Systems II, Proc. Soc. Photo-Opt. Instrum. Eng., 7736 [Google Scholar]

Appendix A: Implementation details

COFFEE performs a phase estimation by minimizing a criterion J whose expression is given by Eq. (3). To estimate φ_u and φ_d (expanded on a truncated Zernike basis), we need both gradients $\frac{∂J}{\partial a_{u}}$ $\hbox{$\frac{\partial J}{\partial\boldsymbol{a}_{\rm u}}$}$ and $\frac{∂J}{\partial a_{d}}$ $\hbox{$\frac{\partial J}{\partial\boldsymbol{a}_{\rm d}}$}$ , where a_x = { a_x₁,a_x₂,...,a_{x_N} } is a vector that contains the Zernike coefficients, for an aberration expanded on N Zernike modes (x is for u (upstream) or d (downstream)).

Let us write the numerical expression of J^foc, using the notations defined in Sect. 2.1: $\begin{matrix} \begin{matrix} \end{matrix} J = \frac{1}{2} \sum_{t} | \frac{i_{c}^{foc} [t] - α . h_{\det} [t] ⋆ h_{c}^{foc} [t] - β}{σ_{n}^{foc} [t]} | 2 \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} + \frac{1}{2} \sum_{t} | \frac{i_{c}^{div} [t] - α . h_{\det} [t] ⋆ h_{c}^{div} [t] - β}{σ_{n}^{div} [t]} | 2 \\ \begin{matrix} \end{matrix} + ℛ_{φ_{u}} + ℛ_{φ_{d}} \\ \begin{matrix} \end{matrix} = J^{foc} + J^{div} + ℛ_{φ_{u}} + ℛ_{φ_{d}} . \end{matrix}$ $\appendix \setcounter{section}{1} \begin{equation} \begin{aligned} J=\,&\frac{1}{2}\sum_{t} \left |\frac{\boldsymbol{i}_{\rm c}^{\text{foc}}[t] -\alpha.\boldsymbol{h}_{\text{det}}[t]\star\boldsymbol{h}_{\rm c}^{\text{foc}}[t]-\beta}{\boldsymbol{\sigma}_{\rm n}^{\text{foc}}[t]} \right |^2\\[2mm] &+\frac{1}{2}\sum_{t} \left |\frac{\boldsymbol{i}_{\rm c}^{\text{div}}[t] - \alpha.\boldsymbol{h}_{\text{det}}[t]\star\boldsymbol{h}_{\rm c}^{\text{div}}[t]-\beta}{\boldsymbol{\sigma}_{\rm n}^{\text{div}}[t]} \right |^2\\[2mm] &+\mathcal{R}_{\boldsymbol{\phi}_{\rm u}}+\mathcal{R}_{\boldsymbol{\phi}_{\rm d}}\\[2mm] =\,& J^{\text{foc}}+J^{\text{div}}+\mathcal{R}_{\boldsymbol{\phi}_{\rm u}}+\mathcal{R}_{\boldsymbol{\phi}_{\rm d}}. \end{aligned} \end{equation}$ (A.1)With t the pixel position in the detector plane. $σ_{n}^{foc}$ $\hbox{$\boldsymbol{\sigma}_{\rm n}^{\text{foc}}$}$ and $σ_{n}^{div}$ $\hbox{$\boldsymbol{\sigma}_{\rm n}^{\text{div}}$}$ are the noise variance maps. Considering the expression of J, we derive J^foc, and then deduce the gradients expressions of J^div using a trivial substitution. Expressions of the regularization terms gradients $\frac{\partial ℛ_{φ_{x}}}{\partial a_{x}}$ $\hbox{$\frac{\partial \mathcal{R}_{\boldsymbol{\phi}_{\rm x}}}{\partial \boldsymbol{a}_{\rm x}}$}$ are given by $\frac{\partial ℛ_{φ_{x}}}{\partial a_{x}} = R_{a_{x}}^{-1} a_{x} .$ $\appendix \setcounter{section}{1} \begin{equation} \frac{\partial \mathcal{R}_{\boldsymbol{\phi}_{\rm x}}}{\partial \boldsymbol{a}_{\rm x}}=R_{\boldsymbol{a}_{\rm x}}^{-1}\boldsymbol{a}_{\rm x}. \end{equation}$ (A.2)The calculation of gradients $\frac{∂J}{\partial φ_{u}}$ $\hbox{$\frac{\partial J}{\partial\boldsymbol{\phi}_{\rm u}}$}$ and $\frac{∂J}{\partial φ_{d}}$ $\hbox{$\frac{\partial J}{\partial\boldsymbol{\phi}_{\rm d}}$}$ is done following Mugnier et al. (2001): first, we calculate the gradient of J^f with respect to the PSF h_c: $\frac{\partial J^{foc}}{\partial h_{c}^{foc}} = \frac{1}{{σ_{n}^{foc}}^{2}} [α h_{\det} {(α . h_{\det} ⋆ h_{c}^{foc} - {i_{c}^{foc}}^{)}}^{]} .$ $\appendix \setcounter{section}{1} \begin{equation} \frac{\partial J^{\text{foc}}}{\partial\boldsymbol{h}_{\rm c}^{\text{foc}}}= \frac{1}{{\boldsymbol{\sigma}_{\rm n}^{\text{foc}}}^2}\left[\alpha\boldsymbol{h}_{\text{det}}\left(\alpha.\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{foc}}-\boldsymbol{i}_{\rm c}^\text{foc}\right)\right]. \end{equation}$ (A.3)Then, the calculation consists in derivating the gradient of the PSF h_c with respect to phases φ_u [k] and φ_d [l] at pixels k, l in pupils upstream and downstream of the coronagraph, respectively, and applying the chain rule, as already done in a non-coronagraphic case, e.g. in Thiébaut & Conan (1995). The calculation of both gradients $\frac{\partial J^{foc}}{\partial φ_{u} [k]}$ $\hbox{$\frac{\partial J^{\text{foc}}}{\partial\boldsymbol{\phi}_{\rm u}[k]}$}$ and $\frac{\partial J^{foc}}{\partial φ_{d} [l]}$ $\hbox{$\frac{\partial J^{\text{foc}}}{\partial\boldsymbol{\phi}_{\rm d}[l]}$}$ gives

$\begin{matrix} \begin{matrix} \begin{matrix} \end{matrix} \frac{\partial J^{foc}}{\partial φ_{u} [k]} = 2 ℑ {ψ^{*} [k] [FT (\frac{\partial J^{foc}}{\partial h_{c}^{foc}} (Ψ - η_{0} Ψ_{d}))]} [k] \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} - 2 ℜ (\frac{\partial η_{0}}{\partial φ_{u} [k]} \sum_{t} \frac{\partial J^{f}}{\partial h_{c}^{foc}} Ψ^{*} Ψ_{d}) \\ \begin{matrix} \end{matrix} + \frac{\partial | η_{0} |^{2}}{\partial φ_{u} [k]} \sum_{t} \frac{\partial J^{f}}{\partial h_{c}^{foc}} | Ψ_{d} | 2 \end{matrix} \\ \begin{matrix} \begin{matrix} \end{matrix} \frac{\partial J^{foc}}{\partial φ_{d} [l]} = 2 ℑ ((ψ^{*} [l] - η_{0}^{*} ψ_{d}^{*} [l]) \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} \times {FT [\frac{\partial J^{f}}{\partial h_{c}^{foc}} (Ψ - η_{0} Ψ_{d})]} [l]) . \end{matrix} \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} &&\begin{aligned} \frac{\partial J^{\text{foc}}}{\partial\boldsymbol{\phi}_{\rm u}[k]}=\,& 2\Im\left\{\boldsymbol{\psi}^{*}[k]\left[\text{FT}\left(\frac{\partial J^{\text{foc}}}{\partial\ \boldsymbol{h}_{\rm c}^{\text{foc}}}(\boldsymbol{\Psi}-\eta_0\boldsymbol{\Psi}_{\rm d})\right)\right]\right\}[k]\\ &-2\Re\left(\frac{\partial\eta_0}{\partial\boldsymbol{\phi}_{\rm u}[k]}\sum_{t}\frac{\partial J^{\text{f}}}{\partial\ \boldsymbol{h}_{\rm c}^{\text{foc}}}\boldsymbol{\Psi}^{*}\boldsymbol{\Psi}_{\rm d}\right)\\ &+\frac{\partial|\eta_0|^2}{\partial\boldsymbol{\phi}_{\rm u}[k]}\sum_{t}\frac{\partial J^{\text{f}}}{\partial\ \boldsymbol{h}_{\rm c}^{\text{foc}}}|\boldsymbol{\Psi}_{\rm d}|^{2}\\ \end{aligned} \\ &&\begin{aligned} \frac{\partial J^{\text{foc}}}{\partial\boldsymbol{\phi}_{\rm d}[l]}=\,& 2\Im\bigg((\boldsymbol{\psi}^{*}[l]-\eta_0^{*}\boldsymbol{\psi}_{\rm d}^{*}[l]) \\ &\left.\times\left\{\text{FT}\left[\frac{\partial J^{\text{f}}}{\partial\ \boldsymbol{h}_{\rm c}^{\text{foc}}}(\boldsymbol{\Psi}-\eta_0\boldsymbol{\Psi}_{\rm d})\right]\right\}[l]\right).\\ \end{aligned} \end{eqnarray}$ With ℑ and ℜ the imaginary and real part (respectively), and

$\begin{matrix} \frac{\partial η_{0}}{\partial φ_{u}} = j P_{u}^{2} e^{j φ_{u}} \\ ψ (φ_{u}, φ_{d}) = P_{d} e^{j (φ_{u} + φ_{d})} Ψ (φ_{u}, φ_{d}) = {FT}^{-1} (ψ) \\ ψ_{d} (φ_{d}) = P_{d} e^{j φ_{d}} Ψ_{d} (φ_{d}) = {FT}^{-1} (ψ_{d}) . \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} &&\frac{\partial\eta_0}{\partial\boldsymbol{\phi}_{\rm u}}=j\boldsymbol{P}_{\rm u}^2{\rm e}^{j\boldsymbol{\phi}_{\rm u}} \nonumber \\ &&\boldsymbol{\psi}(\boldsymbol{\phi}_{\rm u},\boldsymbol{\phi}_{\rm d})=\boldsymbol{P}_{\rm d}{\rm e}^{j(\boldsymbol{\phi}_{\rm u}+\boldsymbol{\phi}_{\rm d})} \qquad \boldsymbol{\Psi}(\boldsymbol{\phi}_{\rm u},\boldsymbol{\phi}_{\rm d})=\text{FT}^{-1}(\boldsymbol{\psi})\nonumber \\ &&\boldsymbol{\psi}_{\rm d}(\boldsymbol{\phi}_{\rm d})=\boldsymbol{P}_{\rm d}{\rm e}^{j\boldsymbol{\phi}_{\rm d}} \qquad \boldsymbol{\Psi}_{\rm d}(\boldsymbol{\phi}_{\rm d})=\text{FT}^{-1}(\boldsymbol{\psi}_{\rm d}). \end{eqnarray}$ (A.6)Since the phases are expanded on a Zernike basis, we need the gradients of J^foc with respect to the Zernike coefficients a_{x_i} of phase φ_x. These gradients are given by the expression (Mugnier et al. 2001): $\frac{\partial J^{foc}}{\partial a_{x_{i}}} = \sum_{k} \frac{\partial J^{foc}}{\partial φ_{x} [k]} Z_{i} [k] .$ $\appendix \setcounter{section}{1} \begin{equation} \frac{\partial J^{\text{foc}}}{\partial a_{{\rm x}_i}} = \sum_{k}\frac{\partial J^{\text{foc}}}{\partial \boldsymbol{\phi}_{\rm x}[k]}Z_i[k]. \end{equation}$ (A.7)Flux α and constant background β are also analytically estimated during the minimization, considering that $J^{p} [t] = \frac{1}{2} \sum_{t} | \frac{- i_{c}^{p} [t] + α . h_{\det} [t] ⋆ h_{c}^{p} [t] + β}{σ_{n}^{p} [t]} | 2$ $\appendix \setcounter{section}{1} \begin{equation} J^{\text{p}}[t]=\frac{1}{2}\sum_{t} \left |\frac{-\boldsymbol{i}_{\rm c}^{\text{p}}[t] + \alpha.\boldsymbol{h}_{\text{det}}[t]\star\boldsymbol{h}_{\rm c}^{\text{p}}[t]+\beta}{\boldsymbol{\sigma}_{\rm n}^{\text{p}}[t]} \right |^2 \end{equation}$ (A.8)where p is for “foc” (focused) or “div” (diverse). For the sake of simplicity, we shall omit the variable t. We have $\begin{matrix} \begin{matrix} \end{matrix} \frac{\partial J^{p}}{∂α} = α \begin{matrix} \sum \end{matrix} \frac{(h_{\det} ⋆ h_{c}^{p})^{2}}{{σ_{n}^{p}}^{2}} + β \begin{matrix} \sum \end{matrix} \frac{h_{\det} ⋆ h_{c}^{p}}{{σ_{n}^{p}}^{2}} \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} - \begin{matrix} \sum \end{matrix} \frac{(h_{\det} ⋆ h_{c}^{p}) i_{c}^{p}}{{σ_{n}^{p}}^{2}} \\ \begin{matrix} \end{matrix} \frac{\partial J^{p}}{∂β} = α \begin{matrix} \sum \end{matrix} \frac{h_{\det} ⋆ h_{c}^{p}}{{σ_{n}^{p}}^{2}} + β \begin{matrix} \sum \end{matrix} \frac{1}{{σ_{n}^{p}}^{2}} - \begin{matrix} \sum \end{matrix} \frac{i_{c}^{p}}{{σ_{n}^{p}}^{2}} \cdot \end{matrix}$ $\appendix \setcounter{section}{1} \begin{equation} \begin{aligned} \frac{\partial J^{\text{p}}}{\partial \alpha}=\,& \alpha\sum\frac{(\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{p}})^2}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2} +\beta\sum\frac{\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{p}}}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2}\\ &-\sum\frac{(\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{p}})\boldsymbol{i}_{\rm c}^{\text{p}}}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2}\\ \frac{\partial J^{\text{p}}}{\partial \beta}=\,& \alpha\sum\frac{\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{p}}}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2} +\beta\sum\frac{1}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2} -\sum\frac{\boldsymbol{i}_{\rm c}^{\text{p}}}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2}\cdot \end{aligned} \end{equation}$ (A.9)Which gives us, in a matricial form: $\begin{matrix} \begin{matrix} \end{matrix} (\begin{matrix} \end{matrix}) (\begin{matrix} \end{matrix}) = (\begin{matrix} \end{matrix}) . \end{matrix}$ $\appendix \setcounter{section}{1} \begin{equation} \begin{aligned} \begin{pmatrix} \sum\frac{(\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{p}})^2}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2} & \sum\frac{\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{p}}}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2}\\ \sum\frac{\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{p}}}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2} & \sum\frac{1}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2}\\ \end{pmatrix} \begin{pmatrix} \alpha\\ \beta\\ \end{pmatrix} = \begin{pmatrix} \sum\frac{(\boldsymbol{h}_{\text{det}}\star\boldsymbol{h}_{\rm c}^{\text{p}})\boldsymbol{i}_{\rm c}^{\text{p}}}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2}\\ \sum\frac{\boldsymbol{i}_{\rm c}^{\text{p}}}{{\boldsymbol{\sigma}_{\rm n}^{\text{p}}}^2}\\ \end{pmatrix} . \end{aligned} \end{equation}$ (A.10)A simple matrix inversion gives us the analytical estimation of the flux α and the background β for each iteration.

Appendix B: Tip-tilt estimation downstream of the coronagraph

The tip-tilt downstream of the coronagraph (which represents the image position on the detector) strongly limits COFFEE’s performance. Indeed, we determine that the phase estimation was accurate when − 100 nmrms ≤ a_i ≤ 100 nmrms, with a_i the Zernike coefficient for tip or tilt (i ∈ {2,3}). Beyond this range, COFFEE is unable to properly estimate both phases φ_u and φ_d. Such a phenomenon strongly limits COFFEE’s performance on a bench, since its utilization requires a restrictive location of the PSF on the detector.

To get rid of this limitation, we have developed a simple and fast method of estimating the tip-tilt downstream of the coronagraph before COFFEE’s estimation, based on the diversity image. This image is created by adding a known aberration $φ_{div} = a_{4}^{div} Z_{4} + a_{5}^{div} Z_{5}$ $\hbox{$\boldsymbol{\phi}_{\rm div}=a_4^{\rm div}Z_4+a_5^{\rm div}Z_5$}$ ( $a_{4}^{div} = a_{5}^{div} = 80$ $\hbox{$a_4^{\rm div}=a_5^{\rm div}=80$}$ nm rms) to φ_u. Since the amplitude of this aberration is important (σ_{φ_div} = 113 nm rms), the speckles we have in the coronagraphic diversity image mainly originate in this diversity aberration. This is illustrated in Fig. B.1, where we show two diversity images: one computed with randomly generated phases φ_u (WFE 30 nm rms), φ_d (WFE 10 nm rms), and another computed with no aberrations other than the diversity ones.

Fig. B.1

Coronagraphic diversity images computed for an aberration φ_u + φ_div upstream, φ_d downstream of (left) and the only diversity aberration φ_div (right). The shape of both images is mainly driven by diversity aberration.

As one can see in Fig. B.1, we can clearly identify the aberrations which originate in the diversity φ_div. The principle of our method lies in the research of these well-known aberrations (since we know the phase φ_div we introduce) in the diversity image $i_{c}^{d}$ $\hbox{$i_{\rm c}^{\text{d}}$}$ by comparing it with a theoretical diversity image $i_{c_{th}}^{d}$ $\hbox{$i_{{\rm c}_{th}}^{\text{d}}$}$ , calculated with no other aberrations than the diversity ones: $i_{c_{th}}^{d} = h_{\det} ⋆ h_{c} (φ_{div}, φ_{d} = 0) .$ $\appendix \setcounter{section}{2} \begin{equation} \boldsymbol{i}_{{\rm c}_{th}}^{\text{d}}=\boldsymbol{h}_{\text{det}}\star \boldsymbol{h}_{\rm c}(\boldsymbol{\phi}_{\rm div},\boldsymbol{\phi}_{\rm d}=0). \end{equation}$ (B.1)The comparison of $i_{c_{th}}^{d}$ $\hbox{$i_{{\rm c}_{th}}^{\text{d}}$}$ with $i_{c}^{d}$ $\hbox{$i_{\rm c}^{\text{d}}$}$ is performed using the method developed by Gratadour et al. (2005), which consists in minimizing the following criterion J_TT $J_{TT} (x,y) = {\begin{matrix} ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \end{matrix} \frac{i_{c}^{div} (x_{o}, y_{o}) - i_{c_{th}}^{div} (x_{o}, y_{o}) ⋆ δ (x_{o} - x, y_{o} - y)}{σ_{n}^{div}} \begin{matrix} ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \\ ∥ \end{matrix}}^{2},$ $\appendix \setcounter{section}{2} \begin{equation} J_{\text{TT}}(x,y)=\left \|\frac{\boldsymbol{i}_{\rm c}^{\text{div}}(x_o,y_o) - \boldsymbol{i}_{{\rm c}_{th}}^{\text{div}}(x_o,y_o) \star \boldsymbol{\delta}(x_o-x,y_o-y)}{\boldsymbol{\sigma}_{\rm n}^{\text{div}}}\right\|^2\text{,} \end{equation}$ (B.2)

where δ is the dirac function. Minimization of J_TT gives us the shift [x_M,y_M]between both images. It is then possible to calculate the corresponding tip (a₂) and tilt (a₃) downstream of the coronagraph knowing the image sampling s: $a_{2} = \frac{π}{2 s} x_{M} a_{3} = \frac{π}{2 s} y_{M} \cdot$ $\appendix \setcounter{section}{2} \begin{equation} a_2 = \frac{\pi}{2s}x_M \qquad a_3 = \frac{\pi}{2s}y_M\cdot \end{equation}$ (B.3)Finally, these estimated tip-tilt values are given to COFFEE as an input of the minimization, and are used as initial values to begin phase reconstruction. This method performs, on our experimental images, a fast preliminary estimation (~1 s for a 256 × 256 image) of the tip-tilt downstream of the coronagraph with an accuracy of 1.5 nm rms, which is far enough, compared to the level of accuracy (± 100 nm rms) required by COFFEE.

All Tables

Table 1

COFFEE: simulation parameters used for the performance assessments of Sects. 3.1–3.3.

In the text

Table 2

COFFEE: simulation parameters for studying the aliasing error.

In the text

Table 3

COFFEE: error budget for the estimation of an aberration upstream of the coronagraph on BOA.

In the text

All Figures

	Fig. 1 Coronagraphic imaging instrument: principle.
In the text

Fig. 2

Aberrations upstream (φ_u (WFE = 80 nm), top) and downstream (φ_d (WFE = 20 nm), bottom) of the coronagraph: reconstruction error (solid red line) as a function of the incoming flux α. For comparison, $\frac{1}{α}$ $\hbox{$\frac{1}{\alpha}$}$ (cyan dashed line) and $\frac{1}{\sqrt{α}}$ $\hbox{$\frac{1}{\sqrt{\alpha}}$}$ (magenta dashed line) theoretical behaviours are plotted for detector noise only and photon noise only (respectively).

In the text

	Fig. 3 Error reconstructions upstream (red line) and downstream (blue line) of the coronagraph as functions of the size of the source on the coronagraph.
In the text

	Fig. 4 Error reconstructions upstream (solid red line) and downstream (solid blue line) of the coronagraph as functions of the error on the diversity phase.
In the text

	Fig. 5 Error reconstructions upstream (top) and downstream (bottom) of the coronagraph as functions of the number of reconstructed Zernike modes, with a regularization metric (solid blue line) and without (solid red line).
In the text

	Fig. 6 Error reconstruction upstream of the coronagraph with respect to the WFE of the aberration upstream of the coronagraph.
In the text

	Fig. 7 Adaptive optics testbed schematic representation. M_i: fold mirrors; MP_i: parabolic mirrors; L_i: lenses (doublets); BS: beam splitter; TTM: Tip-Tilt mirror; DM: deformable mirror; RRPM: coronagraphic focal plane mask; Φ: prolate apodizer; WFS: AO wave-front sensor
In the text

	Fig. 8 Introduction of calibrated aberration on BOA: case of a pure spherical aberration. Left: theoretical wave-front (top) and DM introduced wave-front (bottom). Right: corresponding Zernike modes for the theoretical introduced aberration (solid red line) and the DM introduced aberration (dashed blue line).
In the text

	Fig. 9 Estimation of a tilt aberration on BOA: calibration (solid blue line) and COFFEE’s estimation with bound on the tip-tilt downstream of the coronagraph (dashed crossed red line) and without boundaries (dashed diamond green line).
In the text

Fig. 10

COFFEE: NCPA estimation of an introduced phase φ_cal on BOA. Top: for an aberration + φ_cal, recorded coronagraphic image from the bench (left) and computed image using the reconstructed aberration $\hat{φ} \begin{matrix} + \\ u \end{matrix}$ $\hbox{$\hat{\boldsymbol{\phi}}_{\rm u}^+$}$ (right) (log. scale, same range for both images). Middle: same images for an aberration − φ_cal introduced and a reconstructed aberration $\hat{φ} \begin{matrix} - \\ u \end{matrix}$ $\hbox{$\hat{\boldsymbol{\phi}}_{\rm u}^-$}$ (log. scale, same range for both images). Bottom: calibrated introduced aberration (left) and COFFEE estimated aberration (right).

In the text

	Fig. 11 PCL on the bench BOA (g_PCL = 0.5): variance of the residual static aberrations upstream of the coronagraph for the 36 COFFEE estimated Zernike modes (solid red line) and the 15 corrected modes (solid blue line). The magenta dashed line represents the ultimate performance one can reach according to the error budget detailed in 4.3.
In the text

	Fig. B.1 Coronagraphic diversity images computed for an aberration φ_u + φ_div upstream, φ_d downstream of (left) and the only diversity aberration φ_div (right). The shape of both images is mainly driven by diversity aberration.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Baudoz, P., Boccaletti, A., Baudrand, J., & Rouan, D. 2006, in Direct Imaging of Exoplanets: Science & Techniques, Proc, IAU Colloq. 200, eds. C. Aime, & F. Vakili (Cambridge, UK: Cambridge University Press), 553 [Google Scholar]

[2] Baudoz, P., Mazoyer, J., Mas, M., Galicher, R., & Rousset, G. 2012, in Ground-based and Airborn Intrusmentation for Astronomy IV, Proc. Soc. Photo-Opt. Instrum. Eng., 8446 [Google Scholar]

[3] Beuzit, J.-L., Feldt, M., Dohlen, K., et al. 2007, in Proc. Conference In the Spirit of Bernard Lyot: The Direct Detection of Planets and Circumstellar Disks in the 21st Century, ed. P. Kalas (University of California, Berkeley, CA, USA) [Google Scholar]

[4] Bordé, P. J., & Traub, W. A. 2006, ApJ, 638 [Google Scholar]

[5] Born, M., & Wolf, E. 1989, Principles of Optics (Pergamon Press) [Google Scholar]

[6] Give’on, A., Belikov, R., Shaklan, S., & Kasdin, J. 2007, Opt. Express, 15 [Google Scholar]

[7] Gonsalvez, R. 1982, Opt. Eng., 21 [Google Scholar]

[8] Gratadour, D., Mugnier, L. M., & Rouan, D. 2005, A&A, 443, 357 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[9] Guyon, O., Roddier, C., Graves, J., et al. 1999, PASP, 111 [Google Scholar]

[10] Guyon, O., Matsuo, T., & Angel, R. 2009, ApJ, 693 [Google Scholar]

[11] Kalas, P., Graham, J. R., Chiang, E., et al. 2008, Science, 332 [Google Scholar]

[12] Lagrange, A.-M., Gratadour, D., Chauvin, G., et al. 2009, A&A, 493, L21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[13] Macintosh, B. A., Graham, J. R., Palmer, D. W., et al. 2008, in Adaptive Optics Systems, Proc. Soc. Photo-Opt. Instrum. Eng., 7015 [Google Scholar]

[14] Marois, C., Macintosh, B., Barman, T., et al. 2008, Science, 322 [Google Scholar]

[15] Mugnier, L. M., Robert, C., Conan, J.-M., Michau, V., & Salem, S. 2001, J. Opt. Soc. Am. A, 18, 862 [NASA ADS] [CrossRef] [Google Scholar]

[16] Mugnier, L. M., Fusco, T., & Conan, J.-M. 2004, J. Opt. Soc. Am. A, 21, 1841 [NASA ADS] [CrossRef] [Google Scholar]

[17] Mugnier, L. M., Blanc, A., & Idier, J. 2006, in Advances in Imaging and Electron Physics, ed. P. Hawkes (Elsevier), 141, 1 [Google Scholar]

[18] Noll, R. J. 1976, J. Opt. Soc. Am., 66, 207 [NASA ADS] [CrossRef] [Google Scholar]

[19] Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. 2007, Numerical Recipes: the art of scientific computing (Cambridge University Press) [Google Scholar]

[20] Roddier, F., & Roddier, C. 1997, PASP, 109 [Google Scholar]

[21] Sauvage, J.-F., Fusco, T., Rousset, G., & Petit, C. 2007, J. Opt. Soc. Am. A, 24, 2334 [NASA ADS] [CrossRef] [Google Scholar]

[22] Sauvage, J.-F., Mugnier, L. M., Rousset, G., & Fusco, T. 2010, J. Opt. Soc. Am. A, 27, A157 [Google Scholar]

[23] Sauvage, J.-F., Mugnier, L., Paul, B., & Villecroze, R. 2012, Opt. Lett., 37, 4808 [NASA ADS] [CrossRef] [Google Scholar]

[24] Soummer, R., Aime, C., & Falloon, P. 2003, A&A, 397, 1161 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[25] Soummer, R., Pueyo, L., Sivaramakrishnan, A., & Vanderbei, R. 2007, Opt. Express, 15 [Google Scholar]

[26] Thiébaut, E. 2002, in Astronomical Data Analysis II, Proc. Soc. Photo-Opt. Instrum. Eng., 4847, 174 [Google Scholar]

[27] Thiébaut, E., & Conan, J.-M. 1995, J. Opt. Soc. Am. A, 12, 485 [NASA ADS] [CrossRef] [Google Scholar]

[28] Trauger, J., Give’on, A., Gordon, B., et al. 2010, in Tecniques and Instrumentation for Detection of Exoplanets III, Proc. Soc. Photo-Opt. Instrum. Eng., 6693 [Google Scholar]

[29] Wallace, J. K., Burruss, R. S., Bartos, R. D., et al. 2010, in Adaptive Optics Systems II, Proc. Soc. Photo-Opt. Instrum. Eng., 7736 [Google Scholar]