Instantaneous phase retrieval with the vector vortex coronagraph

P. Riaud; D. Mawet; A. Magette

doi:10.1051/0004-6361/201219614

Home

All issues

Volume 545 (September 2012)

A&A, 545 (2012) A151

Full HTML

Free Access

Issue		A&A Volume 545, September 2012


Article Number		A151
Number of page(s)		32
Section		Astronomical instrumentation
DOI		https://doi.org/10.1051/0004-6361/201219614
Published online		24 September 2012

A&A 545, A151 (2012)

Theoretical and optical implementation

P. Riaud¹^,2, D. Mawet³^,4 and A. Magette⁵^,2

¹ 60 rue des bergers, 75015 Paris, France
e-mail: riaud.pierre@gmail.com
² Université de Liège, 17 Allée du 6 Août, 4000 Sart Tilman, Belgium
³ European Southern Observatory, Alonso de Cordóva 3107, Vitacura, Santiago, Chile
e-mail: dmawet@eso.org
⁴ NASA-Jet Propulsion Laboratory, California Institute of Technology, 4800 Oak Grove Drive, Pasadena, CA 91109, USA
⁵ Techspace-Aero, route de Liers 121, 4041 Milmort, Belgium
e-mail: arnaud.magette@techspace-aero.be

Received: 16 May 2012
Accepted: 19 July 2012

Abstract

Coronagraphs are used as high-contrast imaging tools. However, it is well-known that the achievable contrast is primarily limited by wavefront aberrations in the optical train. Various kinds of devices and methods have been proposed to correct and calibrate these errors and, hence, improve the efficiency of coronagraphs. Here, we present an innovative idea that allows instantaneous measuring of the phase and the amplitude of residual stellar speckles in coronagraphic images. The technique is based on the unique polarization properties of the vector vortex coronagraph, which serves as a new type of phase diversity (POAM diversity), as well an extension of the Nijboer-Zernike theory of aberrations. We also propose and discuss a simple practical optical implementation of the technique, which only requires polarization splitting at the back-end of any existing vector vortex coronagraph systems.

Key words: instrumentation: high angular resolution / instrumentation: adaptive optics / methods: numerical / techniques: polarimetric

© ESO, 2012

1. Introduction

The direct detection of exoplanets is limited by both the contrast and the angular separation between the planet and its host star. For example, in the visible, Earth-like exoplanets would be 10¹⁰ times fainter than their host stars and located within a fraction of an arcsecond even for nearby systems. Many coronagraphs have been proposed to suppress the starlight diffracted light, but all of them are limited by the imperfections of the entrance wavefront: residual speckle patterns are the dominant source of noise in the high-contrast imaging. Even when using high-order or extreme adaptive optics (XAO), the performance of coronagraphs is still limited by the phase and the amplitude knowledge used for the dark-hole generation, and maintenance while acquiring science data (Bordé & Traub 2006; Give’on et al. 2007). We propose to measure the speckle phase and amplitude simultaneously with the science integration using the polarization properties of the vectorial vortex coronagraph (VVC), Mawet et al. (2005), Mawet et al. (2009). For that, we split the output pupil field into its two orthogonal circular polarization components with a simple circular polarization splitter (see Sect. 6). In this paper, we analytically demonstrate, using the polar Nijboer-Zernike diffraction theory of light (Magette 2010), that the polarized VVC coronagraphic images present sufficient diversity to retrieve the phase and the amplitude information of the wavefront at the telescope entrance pupil. The main advantage of the proposed optical implementation and phase retrieval scheme is its simplicity and quasi-instantaneous nature. It allows minimizing the non-common path wavefront errors in the optical system. It also presents a high transmission coefficient, higher than 90%.

The retrieved wavefront complex amplitude of the telescope pupil can be used directly as a synthetic reference image with image subtraction or as a dynamic speckles calibration system using adaptive optics corrections (amplitude and phase).

The paper is organized as follows: first, we briefly present the Nijboer-Zernike diffraction theory for low and high levels of aberrations in Sect. 2. The extension of the NZ theory for the vortex coronagraph is presented in Sect. 3, followed by a detailed presentation of the phase and amplitude retrieval procedure in Sect. 4. The overall system architecture adopted for our speckles calibration system is presented in Sect. 6. Finally, the phase retrieval accuracy under “end-to-end” numerical simulations is presented in Sect. 7.

2. The Nijboer-Zernike theory

The Nijboer-Zernike (NZ) theory emerged from the work of Nijboer (1943) on the diffraction theory of aberrations in polar coordinates. Nijboer first introduced the relation between the diffraction equation and the optical aberrations expressed in the form of Zernike polynomials. However, the complexity of the equations forced him to limit his theory to small aberrations, typically smaller than one wave (Nijboer 1947). Janssen (2002) later completed Nijboer’s theory by using an explicit Bessel series representation for the diffraction integral (high-order Hankel transforms). He also proposed a convenient way to numerically compute the expressions that involve these Bessel series (the $V_{n}^{m}$ $\hbox{$V_n^m$}$ functions, discussed below). This extended NZ theory allows one to quickly and analytically compute the intensity point spread function (PSF) of any complex system with circular pupil from its known aberrations.

2.1. Fraunhofer diffraction integral of an aberrated pupil in polar coordinates

The expression of the complex amplitude in the image plane U(r,φ) as a function of the pupil complex amplitude P(ρ,θ) can be calculated by the well-known Fraunhofer diffraction integral, expressed here in polar coordinates: $U (r,φ) = \frac{1}{π} \int_{0}^{1} \int_{0}^{2 π} P (ρ,θ) e^{(- 2 i πrρ \cos (θ - φ))} d θρ d ρ .$ $\begin{equation} \label{diffr} U(r,\phi) = \frac{1}{\pi} \int_{0}^{1} \int_{0}^{2\pi}P(\rho,\theta) \: {\rm e}^{(-2 {\rm i} \pi \: r \: \rho \: \cos (\theta - \phi))}\: {\rm d} \theta \rho \: {\rm d} \rho. \\ \end{equation}$ (1)The integration limits are defined by the pupil function 0 ≤ r ≤ 1 and 0 ≤ θ ≤ 2π. The pupil function itself can be expressed in terms of Zernike polynomials using classical Zernike coefficients α, or the generalized coefficients β, defined as follows (Magette 2010): $P (ρ,θ) = e^{i \cdot \sum_{n,m} α_{n}^{m} Z_{n}^{m} (ρ,θ)} / P (ρ,θ) = \sum_{n,m} β_{n}^{m} Z_{n}^{m} (ρ,θ) α_{n}^{m}, β_{n}^{m} \in C .$ $\begin{equation} \label{pup_expr} P(\rho,\theta) = {\rm e}^{{\rm i} \cdot \sum_{n,m} \alpha_n^m Z_n^m(\rho,\theta)} / P(\rho,\theta) = \sum_{n,m} \beta_n^m Z_n^m(\rho,\theta) \qquad \alpha_n^m ,\,\beta_n^m \in C. \end{equation}$ (2)The Zernike polynomials $Z_{n}^{m} (ρ,θ)$ $\hbox{$Z_n^m(\rho,\theta)$}$ are defined as usual: $Z_{n}^{m} (ρ,θ) = R_{n}^{m} (ρ) {\begin{matrix} \cos (mθ), evenpolynomials \\ \sin (mθ), oddpolynomials \\ 1, whenm = 0. \end{matrix}$ $\begin{equation} Z_n^m(\rho,\theta)=R_n^m(\rho) \left\{ \begin{array}{l} \cos (m\theta), \: \text{even polynomials}\\ \sin (m\theta), \: \text{odd polynomials}\\ 1, \: \text{when m=0.}\\ \end{array} \right. \end{equation}$ (3)where $R_{n}^{m}$ $\hbox{$R_n^m$}$ is a radial polynomial defined as $R_{n}^{m} (ρ) = \sum_{s = 0}^{\frac{n - m}{2}} C_{z} (n,m,s) ρ^{n - 2 s} .$ $\begin{equation} \label{eq:zern_rad} R_n^m(\rho)=\sum^{\frac{n-m}{2}}_{s=0} C_z(n,m,s) \rho^{n-2s}. \end{equation}$ (4)The radial polynomials $R_{n}^{m} (ρ)$ $\hbox{$R_n^m(\rho)$}$ are even or odd in ρ depending on the n,m values. Note that the polynomials corresponding to m = 0 are treated as even polynomials since cos(0) = 1. Wavefront surface development with Zernike polynomials, and the full expression of the C_z(n,m,s) coefficients is detailed in Appendix A.

2.2. General Zernike coefficients $β_{n}^{m}$ $\hbox{$\mathsfsl{\beta_n^m}$}$

Under the small aberrations assumption, the simplified expression of the pupil aberrations $e^{i \cdot \sum_{n,m} α_{n}^{m} Z_{n}^{m}} \approx 1 + i \cdot \sum_{n,m} α_{n}^{m} Z_{n}^{m}$ $\hbox{${\rm e}^{{\rm i} \cdot \sum_{n,m} \alpha_n^m Z_n^m}\approx 1+ {\rm i} \cdot \sum_{n,m} \alpha_n^m Z_n^m$}$ is generally sufficient to describe the entrance wavefront, and simply connects the $α_{n}^{m}$ $\hbox{$\alpha_n^m$}$ and $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients by identification of the terms in Eqs. (2). The $α_{n}^{m}$ $\hbox{$\alpha_n^m$}$ and $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients capture both phase and amplitude aberrations.

The use of general Zernike coefficients $β_{m}^{n}$ $\hbox{$\beta_m^n$}$ ¹ is preferable since they corresponds to the proper aberration basis of the NZ theory. However, the physical interpretation of the $β_{m}^{n}$ $\hbox{$\beta_m^n$}$ coefficients is not as easy as for the usual $α_{m}^{n}$ $\hbox{$\alpha_m^n$}$ Zernike coefficients. Here, for the sake of completeness and accuracy, we chose to use the generalized Zernike coefficients $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ . Subsequently, the Fraunhofer diffraction integral given in the Eq. (1) can be seen as a linear system: $U (r,φ) = \sum_{n,m} β_{n}^{m} U_{n}^{m} (r,φ) U_{n}^{m} (r,φ) = \frac{1}{π} \int_{0}^{1} \int_{0}^{2 π} Z_{n}^{m} (ρ,θ) e^{- 2 i πrρ \cos (θ - φ)} ρ d ρ d θ .$ $\begin{equation} U(r,\phi) = \sum_{n,m} \beta_n^m U_n^m (r,\phi) \qquad U_n^m(r,\phi) = \frac{1}{\pi} \int_{0}^{1} \int_{0}^{2\pi}Z_n^m(\rho,\theta) \: {\rm e}^{-2 {\rm i} \pi \: r \: \rho \: \cos (\theta - \phi)} \: \rho \: {\rm d} \rho \: {\rm d} \theta. \end{equation}$ (5) $U_{n}^{m} (r,φ)$ $\hbox{$U_n^m(r,\phi)$}$ is the image plane complex amplitude corresponding to the Zernike polynomial (n,m).

For the sake of simplicity, we split the $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients into two categories: $β_{cn}^{m}$ $\hbox{$\beta_{\rm cn}^m$}$ for even Zernike polynomials (cos), and $β_{sn}^{m}$ $\hbox{$\beta_{\rm sn}^m$}$ for odd Zernike polynomials (sin), $U (r,φ) = \sum_{n,m} (β_{cn}^{m} U_{n}^{m} (r,φ)_{c} + β_{sn}^{m} U_{n}^{m} (r,φ)_{s}) .$ $\begin{equation} \label{eq:fullUnm} U(r,\phi) = \sum_{n,m} \left(\beta_{\rm cn}^m \: U_n^m (r,\phi)_{\rm c} + \beta_{\rm sn}^m \: U_n^m (r,\phi)_{\rm s} \right). \end{equation}$ (6)Detailed analytical expressions of the $U_{n}^{m} (r,φ)$ $\hbox{$U_n^m (r,\phi)$}$ complex amplitudes are given in Appendix C.

3. Extension of the NZ theory for the vector vortex coronagraph

3.1. The vector vortex voronagraph

The VVC is a transparent focal plane phase-mask that creates two opposite phase screw dislocations Exp $[\pm i l_{p} φ^{]}$ $\hbox{$\left[\pm {\rm i} l_{\rm p} \phi\right]$}$ , with l_p the topological charge or the photon orbital angular momentum (POAM, Poynting 1909; Yao & Padgett 2011), and φ the azimuthal coordinate. When the phase singularity is centered on the PSF, it redirects the starlight outside the pupil where it can be blocked by a Lyot stop (Mawet et al. 2005).

3.2. Field expression at the coronagraph

The effect of a VVC with a topological charge l_p on the complex amplitude of the PSF can be written as $U_{v} (r,φ, l_{p}) = \exp [i {(l_{p} φ - π / 2^{)}}^{]} \cdot U (r,φ),$ $\begin{equation} \label{eq:amp_vort} U_v(r,\phi,l_{\rm p}) = {\rm exp}{\left[{\rm i} \left( l_{\rm p} \phi-\pi/2 \right)\right]} \cdot U(r,\phi), \end{equation}$ (7)where l_p = 2 and l_p = −2 correspond to the right-handed and left-handed POAM in circular polarization vector basis, respectively.

Using the expression for U(r,φ) previously defined, the complex amplitude in the coronagraph image plane, U_v(r,φ,l_p) becomes: $U_{v} (r,φ, l_{p}) = 2 \sum_{n,m} β_{cn}^{m} i^{m} V_{n,m}^{l_{p}} (r,φ) C_{m} (φ) + 2 \sum_{n,m} β_{sn}^{m} i^{m} V_{n,m}^{l_{p}} (r,φ) S_{m} (φ) .$ $\begin{equation} \label{eq:NZzern2psfB0} U_v(r,\phi,l_{\rm p}) = 2\sum_{n,m} \beta_{\rm cn}^m \: {\rm i}^m \: V_{n,m}^{l_{\rm p}}(r,\phi) C_m(\phi) + 2\sum_{n,m} \beta_{\rm sn}^m \: {\rm i}^m \: V_ {n,m}^{l_{\rm p}}(r,\phi) S_m(\phi). \end{equation}$ (8)The C_m(φ) and S_m(φ) functions are defined as $\begin{matrix} - m = 0 : C_{m} (φ) = 1 and S_{m} (φ) = 0 \\ - 0 < m < n : C_{m} (φ) = \cos (mφ) and S_{m} (φ) = \sin (mφ) \\ - m = n : C_{m} (φ) = (\cos (mφ) - i Sign (l_{p}) \sin (mφ)^{)} / 2 and S_{m} (φ) = (\sin (mφ) + i Sign (l_{p}) \cos (mφ)^{)} / 2. \end{matrix}$ $\begin{eqnarray} \label{ceq} && - \, m=0\textrm{:} \qquad \qquad C_m(\phi)=1 \quad {\rm and} \quad S_m(\phi)=0 \\ && - \, 0<m<n\textrm{:} \qquad \,C_m(\phi)=\cos(m\phi)\quad {\rm and} \quad S_m(\phi)=\sin(m\phi) \\ && - \, m=n\textrm{:} \qquad \qquad C_m(\phi)=\left(\cos(m\phi)-{\rm i}\: {\rm Sign}(l_{\rm p})\sin(m\phi)\right)/2\quad {\rm and} \quad S_m(\phi)=\left(\sin(m\phi)+{\rm i}\: {\rm Sign}(l_{\rm p})\cos(m\phi)\right)/2.~~~~~~~~~~ \end{eqnarray}$ r,φ are the polar coordinates in the the coronagraphic image plane, and l_p, Sign(l_p) are the VVC topological charge and the chirality of the modulation, respectively.

The $V_{n,m}^{l_{p}} (r,φ)$ $\hbox{$V_{n,m}^{l_{\rm p}}(r,\phi)$}$ functions are defined for cosine and sinus modes as follows (see Appendices D and E): $V_{n,m}^{l_{p}} (r,φ) = ϵ_{l_{p}} (- 1)^{(n + m) / 2} \frac{J_{n + 1} (2 πr)}{2 πr} e^{i l_{p} φ} ϵ_{l_{p}} = - i, l_{p} \neq 0 ϵ_{l_{p}} = 1, l_{p} = 0.$ $\begin{equation} \label{vnmdef} V_{n,m}^{l_{\rm p}}(r,\phi)=\epsilon_{l_{\rm p}} (-1)^{(n+m)/2} \frac{J_{n+1}(2\pi r)} {2\pi r} {\rm e}^{{\rm i} l_{\rm p}\phi} \qquad \epsilon_{l_{\rm p}}=-{\rm i} \:,\: l_{\rm p}\neq0 \qquad \epsilon_{l_{\rm p}}=1 \:,\: l_{\rm p}=0. \end{equation}$ (12)The impact of the $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ on the intensity distribution is to azimuthally modulate the residual aberrations. Note that the $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ are normalized in intensity (see Appendix F). Let us now extract the dominant term (n = N, m = 0) from the sum and rewrite Eq. (7) as $U_{v} (r,φ,N) = 2 β_{N}^{0} (d_{c}) . V_{N, 0}^{l_{p}} (r,φ) + 2 \sum_{n,m}^{~} i^{m} β_{cn}^{m} (d_{c}) V_{n,m}^{l_{p}} (r,φ) C_{m} (φ) + 2 \sum_{n,m}^{~} i^{m} β_{sn}^{m} (d_{c}) V_{n,m}^{l_{p}} (r,φ) S_{m} (φ),$ $\begin{equation} \label{Uv} U_v(r,\phi,N)= 2\beta_N^0(d_{\rm c}).V_{N,0}^{l_{\rm p}}(r,\phi) +2\sum_{n,m}^{\sim}{\rm i}^m\:\beta_{\rm cn}^m(d_{\rm c}) \: V_{n,m}^{l_{\rm p}}(r,\phi) C_m(\phi) +2\sum_{n,m}^{\sim}{\rm i}^m\: \beta_{\rm sn}^m(d_{\rm c}) \:V_{n,m}^{l_{\rm p}}(r,\phi) S_m(\phi), \end{equation}$ (13)where the ~ symbol means that the term (n = N, m = 0) is absent from the sum. $β_{N}^{0}$ $\hbox{$\beta_N^0$}$ is larger than the other $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ . This separation allows us to virtually create a dominant linear term with respect to the optical aberrations. In other words, as we will see in the following section, the $β_{N}^{0}$ $\hbox{$\beta_N^0$}$ extraction from the sum yields the coupled modal functions $V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}$ $\hbox{$V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}$}$ , present in the linear term of the final intensity expression, which have the key property $V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *} ≫ V_{n,m}^{l_{p}} V_{n,m}^{l_{p} *}$ $\hbox{$V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*} \gg V_{n,m}^{l_{\rm p}} \: V_{n,m}^{l_{\rm p}*} \:$}$ .

3.3. Expression of the coronagraphic intensity

The action of the VVC is to redirect the field amplitude outside of the relayed pupil, conjugate to the entrance pupil. The coronagraphic suppression of starlight is then obtained by inserting a diaphragm smaller than the pupil diameter into this pupil plane, called the “Lyot stop”. The coronagraphic intensity in the camera plane downstream from the Lyot stop plane is (see Appendix M) $\begin{matrix} I_{c} (r,φ, l_{p}) & = & {| U_{v} (r,φ, l_{p}) |}^{2} \to I_{c} (r,φ, l_{p}) \\ = & 4 {(β_{N}^{0} (d_{c})^{)}}^{2} \cdot {| V_{N, 0}^{l_{p}} |}^{2} + f^{(1)} [β_{cn}^{m} (d_{c}), β_{sn}^{m} (d_{c})] + f^{(2)} [(β_{cn}^{m} (d_{c}))^{2}, (β_{sn}^{m} (d_{c}))^{2}, (β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c}))^{]} . \end{matrix}$ $\begin{eqnarray} \label{Ic} I_{\rm c}(r,\phi,l_{\rm p}) &=& \left| U_v(r,\phi,l_{\rm p})\right|^2 \quad \rightarrow \quad I_{\rm c}(r,\phi,l_{\rm p}) \nonumber \\ &=&4\left( \beta_N^0(d_{\rm c})\right)^2 \cdot \left|V_{N,0}^{l_{\rm p}} \right|^2 +f^{(1)}\left[\beta_{\rm cn}^m(d_{\rm c}),\beta_{\rm sn}^m(d_{\rm c})\right] +f^{(2)}\left[(\beta_{\rm cn}^m(d_{\rm c}))^2,(\beta_{\rm sn}^m(d_{\rm c}))^2,(\beta_{\rm cn}^m(d_{\rm c})\cdot\beta_{\rm sn}^m(d_{\rm c}))\right]. \end{eqnarray}$ (14)It is composed of three different terms:

$4 {}^{(}β_{N}^{0} (d_{c})^{)}^{2} \cdot {| V_{N, 0}^{l_{p}} |}^{2}$ $\hbox{$4\left( \beta_N^0(d_{\rm c})\right)^2 \cdot \left|V_{N,0}^{l_{\rm p}} \right|^2$}$
$f^{(1)} [β_{cn}^{m} (d_{c}), β_{sn}^{m} (d_{c})]$ $\hbox{$f^{(1)}\left[\beta_{\rm cn}^m(d_{\rm c}),\beta_{\rm sn}^m(d_{\rm c})\right]$}$ : a linear function of inner products between the $β_{n}^{m} (d_{c})$ $\hbox{$\beta_n^m(d_{\rm c})$}$ coefficients and the $β_{N}^{0} (d_{c})$ $\hbox{$\beta_N^0(d_{\rm c})$}$ term.
$f^{(2)}^{[} (β_{cn}^{m} (d_{c}))^{2}, (β_{sn}^{m} (d_{c}))^{2}, (β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c}))^{]}$ $\hbox{$f^{(2)}\left[(\beta_{\rm cn}^m(d_{\rm c}))^2,(\beta_{\rm sn}^m(d_{\rm c}))^2,(\beta_{\rm cn}^m(d_{\rm c})\cdot\beta_{\rm sn}^m(d_{\rm c}))\right]$}$ : a term quadratic in the $β_{n}^{m} (d_{c})$ $\hbox{$\beta_n^m(d_{\rm c})$}$ coefficients and cos/sin cross terms.

Note that in practice, the Lyot stop is always slightly undersized compared to the pupil, which in the present NZ theory simply yields a normalization of the radial Zernike polynomials with the diaphragm size d_c < 1. The general $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients then become a function of (d_c): $(β_{n}^{m} (d_{c}))$ $\hbox{$(\beta_n^m(d_{\rm c}))$}$ (see Sect. 5.2).

4. NZ phase retrieval theory

The NZ phase retrieval method is based on the projection of the measured PSF on the basis of template modes, which leads to a system of decoupled linear equations. We first assume linearity: $I_{c} \approx 4 {(β_{N}^{0} (d_{c})^{)}}^{2} \cdot {| V_{N, 0}^{l_{p}} |}^{2} + f^{(1)} [β_{cn}^{m} (d_{c}), β_{sn}^{m} (d_{c})] f^{(2)} [(β_{cn}^{m} (d_{c}))^{2}, (β_{sn}^{m} (d_{c}))^{2}, (β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c}))^{]} < f^{(1)} [β_{cn}^{m} (d_{c}), β_{sn}^{m} (d_{c})] .$ $\begin{equation} \label{eq_approx} I_{\rm c}\approx 4\left( \beta_N^0(d_{\rm c})\right)^2 \cdot \left|V_{N,0}^{l_{\rm p}} \right|^2+f^{(1)}\left[\beta_{\rm cn}^m(d_{\rm c}),\beta_{\rm sn}^m(d_{\rm c})\right] \qquad f^{(2)}\left[(\beta_{\rm cn}^m(d_{\rm c}))^2,(\beta_{\rm sn}^m(d_{\rm c}))^2,(\beta_{\rm cn}^m(d_{\rm c})\cdot\beta_{\rm sn}^m(d_{\rm c}))\right]<f^{(1)}\left[\beta_{\rm cn}^m(d_{\rm c}),\beta_{\rm sn}^m(d_{\rm c})\right]. \end{equation}$ (15)However, note that the quadratic term will be accounted for later on by a recursive corrector approach (see Sect. 4.3).

In practice, its implementation is a two-step process. On one hand, it requires projecting the measured PSF on the basis of radial template modes by means of a polar Fourier transform: $Ψ_{meas}^{m} (r, l_{p}) = \frac{1}{2 π} \int_{0}^{2 π} I_{meas} (r,φ, l_{p}) e^{i mφ} d φ,$ $\begin{equation} \label{transfp} \Psi_{\rm meas}^{m}(r,l_{\rm p})=\frac{1}{2\pi} \int_0^{2\pi}I_{\rm meas}(r,\phi,l_{\rm p}){\rm e}^{{\rm i} m\phi} {\rm d}\phi, \end{equation}$ (16)where I_meas(r,φ,l_p) is the measured coronagraphic image, which depends on l_p, the topological charge.

On the other hand, the same projection is performed analytically on the final intensity expression Eq. (14): $Ψ_{c}^{m} (r, l_{p}) = \frac{1}{2 π} \int_{0}^{2 π} I_{c} (r,φ, l_{p}) e^{i mφ} d φ,$ $\begin{equation} \label{transfpc} \Psi_{\rm c}^{m}(r,l_{\rm p})=\frac{1}{2\pi} \int_0^{2\pi}I_{\rm c}(r,\phi,l_{\rm p}){\rm e}^{{\rm i} m\phi} {\rm d}\phi, \end{equation}$ (17)where I_c(r,φ,l_p) is the analytical expression of the final coronagraphic image for the topological charge l_p.

Comparing both the measured PSF decomposition and the analytically modeled one leads to the formation of decoupled systems of linear equations.

4.1. Modal analysis of the analytical intensity equation

The analytical expression of the linear term f⁽¹⁾ of the coronagraphic intensity is $\begin{matrix} f^{(1)} [β_{cn}^{m} (d_{c}), β_{sn}^{m} (d_{c})] = & \sum_{n,m}^{~} [A_{c}^{m} \cdot 8 ℜ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}) \cdot C_{m} (φ)] - \sum_{n,m}^{~} [B_{c}^{m} \cdot 8 ℑ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}) \cdot C_{m} (φ)] \\ + \sum_{n,m}^{~} [A_{s}^{m} \cdot 8 ℜ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}) \cdot S_{m} (φ)] - \sum_{n,m}^{~} [B_{s}^{m} \cdot 8 ℑ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}) \cdot S_{m} (φ)] \end{matrix}$ $\begin{eqnarray} \label{flvr} f^{(1)}\left[\beta_{\rm cn}^m(d_{\rm c}),\beta_{\rm sn}^m(d_{\rm c})\right]= && \sum_{n,m}^{\sim}\left[A_{\rm c}^m\cdot 8\Re\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\cdot C_m(\phi)\right] - \sum_{n,m}^{\sim}\left[B_{\rm c}^m\cdot 8\Im\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\cdot C_m(\phi)\right] \nonumber\\ &&+\, \sum_{n,m}^{\sim}\left[A_{\rm s}^m\cdot 8\Re\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\cdot S_m(\phi)\right] - \sum_{n,m}^{\sim}\left[B_{\rm s}^m\cdot 8\Im\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\cdot S_m(\phi)\right] \end{eqnarray}$ (18) $\begin{matrix} A_{c}^{m} = ℜ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{m} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{m} (d_{c})) A_{s}^{m} = ℜ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{m} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{m} (d_{c})) \\ B_{c}^{m} = ℜ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{m} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{m} (d_{c})) B_{s}^{m} = ℜ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{m} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{m} (d_{c})) . \end{matrix}$ $\begin{eqnarray} \label{flvrb} &&A_{\rm c}^m=\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^m(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^m(d_{\rm c})) \qquad A_{\rm s}^m=\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^m(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^m(d_{\rm c})) \\[2mm] && B_{\rm c}^m=\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^m(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^m(d_{\rm c})) \qquad B_{\rm s}^m=\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^m(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^m(d_{\rm c})).~~~~~~~~~~~~~~ \end{eqnarray}$ ℜ and ℑ are the real and imaginary parts, respectively. The complete demonstration is detailed in Appendix M.

To facilitate the analytical computation of Eq. (17) using Eq. (18), we introduce the following functions: $Ψ_{n,m}^{l_{p}} = - 8 ϵ_{m}^{-1} ℑ [i^{m} V_{n,m}^{l_{p}} . V_{N, 0}^{l_{p} *}] χ_{n,m}^{l_{p}} = 8 ϵ_{m}^{-1} ℜ [i^{m} V_{n,m}^{l_{p}} . V_{N, 0}^{l_{p} *}] .$ $\begin{equation} \label{innerprod} \Psi_{n,m}^{l_{\rm p}}=-8 \epsilon_{m}^{-1}\Im\left[{\rm i}^m V_{n,m}^{l_{\rm p}} . V_{N,0}^{l_{\rm p}*}\right] \qquad \chi_{n,m}^{l_{\rm p}}=8 \epsilon_{m}^{-1}\Re\left[{\rm i}^m V_{n,m}^{l_{\rm p}} . V_{N,0}^{l_{\rm p}*}\right]. \end{equation}$ (21)The Ψ and χ functions correspond to the phase and to the amplitude aberration templates, respectively. The full analytical computation presented in Appendix N demonstrates that the measured intensity $Ψ_{meas}^{m} (r, l_{p})$ $\hbox{$\Psi_{\rm meas}^{m}(r,l_{\rm p})$}$ in the polar Fourier plane is a linear combination of the phase and amplitude aberration templates $Ψ_{n,m}^{l_{p}}$ $\hbox{$\Psi_{n,m}^{l_{\rm p}}$}$ and $χ_{n,m}^{l_{p}}$ $\hbox{$\chi_{n,m}^{l_{\rm p}}$}$ .

We now multiply this result with $Ψ_{n,m}^{l_{p}}$ $\hbox{$\Psi_{n,m}^{l_{\rm p}}$}$ and $χ_{n,m}^{l_{p}}$ $\hbox{$\chi_{n,m}^{l_{\rm p}}$}$ , which produces the “inner” products of the aberration templates, physically corresponding to the autocorrelation between the phase and amplitude aberration templates: $\begin{matrix} (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = \int_{- \infty}^{+ \infty} \int_{0}^{+ \infty} Ψ_{n,m}^{l_{p}} . χ_{n^{'},m}^{l_{p} *} r d r d l_{p} \\ (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = \frac{1}{2 | l_{p} |} \sum_{- | l_{p} |}^{+ | l_{p} |} \int_{0}^{+ \infty} Ψ_{n,m}^{l_{p}} . χ_{n^{'},m}^{l_{p} *} r d r \forall l_{p} \in N . \end{matrix}$ $\begin{eqnarray} \label{f_inner_p} &&\left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right)=\int_{-\infty}^{+\infty}\int_0^{+\infty}\Psi_{n,m}^{l_{\rm p}}.\chi_{n',m}^{l_{\rm p}*}r{\rm d}r {\rm d}l_{\rm p} \\ &&\left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right)=\frac{1} {2|l_{\rm p}|}\sum_{-|l_{\rm p}|}^{+|l_{\rm p}|}\int_0^{+\infty}\Psi_{n,m}^{l_{\rm p}}.\chi_{n',m}^{l_{\rm p}*}r{\rm d}r \qquad \forall l_{\rm p} \in N. \nonumber \end{eqnarray}$ (22)It can easily be demonstrated that these inner products have the following properties: $\begin{matrix} (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = (χ_{n,m}^{l_{p}}, Ψ_{n^{'},m}^{l_{p}}) \forall n, n^{'} (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = 0 \forall n, n^{'} = m,m + 2, ... \\ ({| V_{N, 0}^{l_{p}} |}^{2}, Ψ_{n^{'}, 0}^{l_{p}}) = 0 \forall n^{'} V_{n,m}^{l_{p}} = V_{n,m}^{- l_{p} *} . \end{matrix}$ $\begin{eqnarray} \label{f_inner_p2} &&\left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right)=\left(\chi_{n,m}^{l_{\rm p}}\:,\:\Psi_{n',m}^{l_{\rm p}}\right) \quad \forall n\:,\:n' \qquad \left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right) = 0 \quad \forall n\:,\:n' = m,m+2,...\\ \nonumber &&\left(\left| V_{N,0}^{l_{\rm p}}\right|^2\:,\:\Psi_{n',0}^{l_{\rm p}}\right) = 0 \quad \forall n'\qquad V_{n,m}^{l_{\rm p}} = V_{n,m}^{-l_{\rm p}*}. \nonumber \end{eqnarray}$ (23)Using the inner products, it is now possible to build a linear system of equations, $\begin{matrix} G_{n, n^{'}}^{m, l_{p}} (Ψ^{)} . u [β_{n}^{m}] = r_{n^{'}}^{m, l_{p}} (Ψ^{)} G_{n, n^{'}}^{m, l_{p}} (χ) . u [β_{n}^{m}] = r_{n^{'}}^{m, l_{p}} (χ) \\ G_{n, n^{'}}^{m, l_{p}} (Ψ^{)} = (Ψ_{n,m}^{l_{p}}, Ψ_{n^{'},m}^{l_{p}}) G_{n, n^{'}}^{m, l_{p}} (χ) = (χ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) r_{n^{'}}^{m, l_{p}} (Ψ^{)} = (Ψ_{meas}^{m}, Ψ_{n^{'},m}^{l_{p}}) r_{n^{'}}^{m, l_{p}} (χ) = (Ψ_{meas}^{m}, χ_{n^{'},m}^{l_{p}}) \end{matrix}$ $\begin{eqnarray} \label{lse1} && G_{n,n'}^{m,l_{\rm p}}\left(\Psi\right).u\left[\beta_n^m \right]=r_{n'}^{m,l_{\rm p}}\left(\Psi\right) \qquad G_{n,n'}^{m,l_{\rm p}}\left(\chi\right).u\left[\beta_n^m \right]=r_{n'}^{m,l_{\rm p}}\left(\chi\right) \\ && G_{n,n'}^{m,l_{\rm p}} \left(\Psi\right) = \left(\Psi_{n,m}^{l_{\rm p}}\:,\:\Psi_{n',m}^{l_{\rm p}}\right) \qquad G_{n,n'}^{m,l_{\rm p}} \left(\chi\right) = \left(\chi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right) \qquad r_{n'}^{m,l_{\rm p}}\left(\Psi\right) = \left(\Psi_{\rm meas}^{m}\:,\:\Psi_{n',m}^{l_{\rm p}}\right) \qquad r_{n'}^{m,l_{\rm p}}\left(\chi\right) = \left(\Psi_{\rm meas}^{m}\:,\:\chi_{n',m}^{l_{\rm p}}\right) \nonumber \end{eqnarray}$ (24)where G is a Gram matrix, and is defined with all possible inner products coefficients. $u [β_{n}^{m}]$ $\hbox{$u\left[\beta_n^m \right]$}$ is the vector containing the unknown coefficients $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ , and r is a vector with the polar Fourier transform image analysis coefficients. Note that after the proper integration of the inner products the G matrix includes all three Pancharatnam topological charges − l_p,0, + l_p.

Fig. 1

Schematic view of the NZ vortex phase-retrieval process. It consists of using the real input coronagraphic images, corresponding to l_p = 0, ± 2 (the l_p = ± 2 images are provided by polarization splitting, while the unpolarized image (l_p = 0) is directly given by the sum of the two polarized images) projected on aberrations templates, and finding the unknown coefficients by resolving a system of linear equations. Left: the three images needed for the full aberration analysis. Center: linear systems to retrieve the $β_{N}^{0} (d_{c})$ $\hbox{$\beta_N^0(d_{\rm c})$}$ and the f⁽¹⁾ terms. Right: the predictor-corrector approach to evaluate the quadratic correction $f^{(2)}^{[} (β_{cn}^{m} (d_{c}))^{2}, (β_{sn}^{m} (d_{c}))^{2}, (β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c}))^{]}$ $\hbox{$f^{(2)}\left[(\beta_{\rm cn}^m(d_{\rm c}))^2,(\beta_{\rm sn}^m(d_{\rm c}))^2,(\beta_{\rm cn}^m(d_{\rm c})\cdot\beta_{\rm sn}^m(d_{\rm c}))\right]$}$ .

The r vector must be defined for three different cases: radial modes (m = 0), purely cosine modes, and purely sine modes. For radial modes, we use the cosine description. The end of Appendix N details the analytical expressions for all coefficients used in the matrix G and the two vectors u and r.

4.2. The f⁽¹⁾ term retrieval: solving a linear system of equations

Introducting the inner product (see Eq. (21)) allows us to calculate the linear term of the intensity equations (see Eq. (18)). The vectorial vortex complex amplitude retrieval is sensitive to the amplitude and the phase effects in the image. In practice, and to accelerate the convergence of the algorithm, we must bind the real part of the $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ parameters (amplitude effect) to the most probable range of values. Indeed, amplitude and phase effects in the final intensity images are completely indistinguishable and potentially, many solutions exist. However, if we restrict ( $ℜ (β_{n}^{m}) < 0.1$ $\hbox{$\Re\left(\beta_n^m \right)<0.1$}$ ) to a physical solution, which corresponds to small amplitude errors, the algorithm converges quickly.

4.2.1. The $β_{N}^{0}$ $\hbox{$\mathsf{\beta_N^0}$}$ (d_c) coefficient

The first term $β_{0}^{0} (d_{c})$ $\hbox{$\beta_0^0(d_{\rm c})$}$ (piston) is equal to zero in the ideal case (Mawet et al. 2005). For a single-dish telescope, the piston phase is a gauge invariant and we chose 0 for both the imaginary parts of $β_{0}^{0}$ $\hbox{$\beta_0^0$}$ and $β_{0}^{0} (d_{c})$ $\hbox{$\beta_0^0(d_{\rm c})$}$ , $ℜ (β_{0}^{0}) \approx 1 ℑ (β_{0}^{0}) = 0 ℜ (β_{0}^{0} (d_{c})) \approx 0 ℑ (β_{0}^{0} (d_{c})) = 0.$ $\begin{equation} \Re(\beta_0^0)\approx1 \quad \Im(\beta_0^0)=0 \qquad \Re(\beta_0^0(d_{\rm c}))\approx0 \quad \Im(\beta_0^0(d_{\rm c}))=0. \nonumber \end{equation}$ $β_{2}^{0} (d_{c})$ $\hbox{$\beta_2^0(d_{\rm c})$}$ , the defocus term, is generally small, and can be easily minimized in practice. The first dominant purely radial mode is thus the spherical aberration ( $β_{N}^{0} (d_{c}),N = 4$ $\hbox{$\beta_N^0(d_{\rm c}),N=4$}$ ). The first set of unknown coefficients to retrieve, $β_{N}^{0}$ $\hbox{$\beta_N^0$}$ , corresponds to the main aberration. When the images are projected onto the radial mode template basis $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ , all $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients will be compared to this $β_{N}^{0}$ $\hbox{$\beta_N^0$}$ term. As we developed in Eqs. (8), the C_m(φ) and S_m(φ) functions show a great variability for different m modes. Each case must then be processed separately (see the NZ retrieval diagram in Fig. 1).

4.2.2. Resolution of the system for m = 0

If m = 0, the vector of unknown coefficients u can be written as $u_{1} = [({β_{N}^{0}}^{)} / 2, A_{c}^{0} (0), ..., A_{c}^{0} (n_{\max})^{]} u_{2} = [B_{c}^{0} (0), ..., B_{c}^{0} (n_{\max})^{]},$ $\begin{equation} \label{lse2} u_1=\left[\left(\beta_N^0 \right)/2, A_{\rm c}^0(0),...,A_{\rm c}^0(n_{\rm max})\right]\qquad u_2=\left[B_{\rm c}^0(0),...,B_{\rm c}^0(n_{\rm max})\right], \end{equation}$ (25)where n_max is the maximum number of modes. The first system (u₁) allows determining the $β_{N}^{0}$ $\hbox{$\beta_N^0$}$ coefficient. Previously, we fixed the piston term $ℜ {}^{(}β_{00} (d_{c})^{)} \approx 0$ $\hbox{$\Re\left(\beta_0^0(d_{\rm c})\right)\approx 0$}$ but in the practical real-life case (Fresnel diffraction and finite coronagraphic mask), this coefficient is not zero. This piston term corresponds to the mask limitation in the considered optical system (chromaticity, F-number, small Lyot dot in the center, Lyot stop in the pupil plane, etc.) and must be calibrated on the optical system before astronomical use. In that case, the complex value of the $β_{0}^{0}$ $\hbox{$\beta_0^0$}$ coefficient can be fixed in the diffraction model and the $β_{N}^{0}$ $\hbox{$\beta_N^0$}$ coefficient required for the retrieval is known thanks to the resolution of the first equation with ${}^{(}{β_{N}^{0}}^{)} / 2$ $\hbox{$\left(\beta_N^0 \right)/2$}$ , $A_{c}^{0} (0)$ $\hbox{$A_{\rm c}^0(0)$}$ and $B_{c}^{0} (0)$ $\hbox{$B_{\rm c}^0(0)$}$ .

4.2.3. Resolution of the system for 0 < m < n

If 0 < m < n, the vector of unknown coefficients u given by Eq. (24) can be written as $u_{1} = [A_{c}^{m} (0), ..., A_{c}^{m} (n_{\max})] u_{2} = [B_{c}^{m} (0), ..., B_{c}^{m} (n_{\max})] u_{3} = [A_{s}^{m} (0), ..., A_{s}^{m} (n_{\max})] u_{4} = [B_{s}^{m} (0), ..., B_{s}^{m} (n_{\max})] .$ $\begin{equation} \label{lse2bis} u_1=\left[A^m_{\rm c}(0),...,A^m_{\rm c}(n_{\rm max})\right]\qquad u_2=\left[B^m_{\rm c}(0),...,B^m_{\rm c}(n_{\rm max})\right]\qquad u_3=\left[A^m_{\rm s}(0),...,A^m_{\rm s}(n_{\rm max})\right]\qquad u_4=\left[B^m_{\rm s}(0),...,B^m_{\rm s}(n_{\rm max})\right]. \end{equation}$ (26)

4.2.4. Resolution of the system for m = n

In the case where m = n, the vector of unknown coefficients u has to be calculated separately, and the vector can be written as: $\begin{matrix} u_{1} = [A_{c}^{n} (0) + A_{s}^{n} (0), ..., A_{c}^{n} (n_{\max}) + A_{s}^{n} (n_{\max})] u_{2} = [B_{c}^{n} (0) + B_{s}^{n} (0), ..., B_{c}^{n} (n_{\max}) + B_{s}^{n} (n_{\max})] \\ u_{3} = [A_{c}^{n} (0) - A_{s}^{n} (0), ..., A_{c}^{n} (n_{\max}) - A_{s}^{n} (n_{\max})] u_{4} = [B_{c}^{n} (0) - B_{s}^{n} (0), ..., B_{c}^{n} (n_{\max}) - B_{s}^{n} (n_{\max})] . \end{matrix}$ $\begin{eqnarray} \label{lse3} &&u_1=\left[A^n_{\rm c}(0)+A^n_{\rm s}(0),...,A^n_{\rm c}(n_{\rm max})+A^n_{\rm s}(n_{\rm max})\right]\qquad u_2=\left[B^n_{\rm c}(0)+B^n_{\rm s}(0),...,B^n_{\rm c}(n_{\rm max})+B^n_{\rm s}(n_{\rm max})\right]\\ &&u_3=\left[A^n_{\rm c}(0)-A^n_{\rm s}(0),...,A^n_{\rm c}(n_{\rm max})-A^n_{\rm s}(n_{\rm max})\right]\qquad u_4=\left[B^n_{\rm c}(0)-B^n_{\rm s}(0),...,B^n_{\rm c}(n_{\rm max})-B^n_{\rm s}(n_{\rm max})\right].\nonumber \end{eqnarray}$ (27)The complete demonstrations for the $A_{c}^{0}, B_{c}^{0}, A_{c}^{m}, B_{c}^{m}, A_{s}^{m}, B_{s}^{m}, A_{c}^{n}, B_{c}^{n}, A_{s}^{n}, B_{s}^{n}$ $\hbox{$A_{\rm c}^0,B_{\rm c}^0,A^m_{\rm c},B^m_{\rm c},A^m_{\rm s},B^m_{\rm s},A^n_{\rm c},B^n_{\rm c},A^n_{\rm s},B^n_{\rm s}$}$ coefficients are detailed in Appendix N.

4.3. The quadratic correction f⁽²⁾: the predictor-corrector approach

Keeping only the linear term of the final coronagraphic intensity allows us to simplify the retrieval process using linear algebra only. This approximation allows determining the $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients with a precision of a few percents. However, in a coronagraphic system, coupling between phase and amplitude appears due to the crossed terms in “sin / cos”. These terms are also quadratic and are taken into account in the predictor-corrector analysis.

In this section, we introduce an improvement that allows us to account for the quadratic terms. When a coronagraphic device is used, $ℜ (β_{0}^{0})$ $\hbox{$\Re(\beta_0^0)$}$ is attenuated and the purely quadratic phase terms and the amplitude-phase coupled terms are of the same order of magnitude. They should thus be taken into account. Hereafter, we present a simple way of doing this using a predictor-corrector approach. This technique proceeds as follows: $I_{c} = f^{(1)} + f^{(2)} f^{(2)} < f^{(1)} .$ $\begin{equation} \label{pc_eqs1} I_{\rm c}=f^{(1)}+ f^{(2)} \qquad f^{(2)}< f^{(1)}. \end{equation}$ (28)The retrieval approach presented previously is based on the following simplification: I_c ≈ f⁽¹⁾.

Solving this equation leads to a first approximation $β_{n}^{' m}$ $\hbox{$\beta_n^{'m}$}$ of $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ , which corresponds to an image $I_{c}^{'}$ $\hbox{$I'_{\rm c}$}$ such that $I_{c}^{'} - f^{(2)} = f^{(1)}$ $\hbox{$ I'_{\rm c}- f^{(2)} = f^{(1)}$}$ .

$f^{(2)} = I_{c}^{'} - I_{c}$ $\hbox{$ f^{(2)}=I'_{\rm c}-I_{\rm c}$}$ is then defined based on approximating the exact quadratic term.

After several iterations, $β_{n}^{' m}$ $\hbox{$\beta_n^{'m}$}$ tends to $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ if, and only if, f⁽²⁾ < f⁽¹⁾.

The predictor-corrector approach as in the classical NZ phase retrieval (Magette 2010) leads to a better estimation of the $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients.

4.4. Phase amplification

An optical vortex phase-mask “amplifies” the small input pupil phase error ( $i α_{n}^{m}$ $\hbox{${\rm i}\alpha_n^m$}$ ) on the output coronagraphic pupil. As already suggested in Sect. 4, the first term $ℜ (β_{0}^{0} (d_{c})) \approx 0$ $\hbox{$\Re(\beta_0^0(d_{\rm c}))\approx 0$}$ . This is why in the coronagraphic image plane all wavefront errors in the entrance pupil plane are enhanced by the phase-mask coronagraphic device. Indeed, in the proposed phase-retrieval analysis, we compare all Zernike polynomials with the first non-zero $β_{N}^{0} (d_{c})$ $\hbox{$\beta_N^0(d_{\rm c})$}$ term in the linear system equations to avoid numerical singularities. A good estimation of the $β_{N}^{0} (d_{c})$ $\hbox{$\beta_N^0(d_{\rm c})$}$ term remains mandatory to allow a correct wavefront retrieval.

4.5. The $β_{0}^{0}$ $\hbox{$\mathsf{\beta_0^0}$}$ coefficient

The $β_{0}^{0} (d_{c})$ $\hbox{$\beta_0^0(d_{\rm c})$}$ coefficient remains a free variable, and it is naturally obtained with the value of the nulling factor.

Indeed, the nulling coefficient ϵ_c can be defined as the sum of all residual aberrations in the coronagraphic pupil plane, $ϵ_{c} = \sum_{n,m} {| β_{cn}^{m} (d_{c}) + β_{sn}^{m} (d_{c}) |}^{2} .$ $\begin{equation} \epsilon_{\rm c}=\sum_{n,m} \left|\beta_{\rm cn}^m(d_{\rm c})+\beta_{\rm sn}^m(d_{\rm c})\right|^2. \end{equation}$ (29)We calibrate the nulling factor with an instrumental PSF of the optical system without any coronagraphic device (the l_p = 0 term). In our NZ development, the sum of the two circular polarization images can be used as l_p = 0 image. This composite image can be seen as the instrumental PSF from which the perfect Airy pattern |J₁(r)/r|² has been subtracted.

4.6. Optimum number of images for phase retrieval

In the classical phase diversity algorithm (Gonsalves 1982; Blanc et al. 2003), the estimation of the entrance aberrations from the sole focused image does not ensure the uniqueness of the solution. Indeed, two different aberrations can produce the same PSF in the image plane. At least, two images with known phase variations are needed to remove this indetermination. If the phase diversity is performed in the pupil plane as in Roddier & Roddier (1993), the result is the same: two images of defocused pupil are needed. The indetermination is fully removed if the two images are π phase-shifted. This is the case of phase retrieval using classical NZ theory (with focus variation ± f), and in this modified NZ theory for the VVC device (with POAM modulation ± l_p).

In coronagraphy, another indetermination appears: the residual speckle pattern changes in intensity with the entrance Strehl ratio due to the induced variation of the coronagraph rejection factor, but not in overall morphology. Therefore, the absolute values of $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ need to be retrieved using a good estimation of $β_{0}^{0}$ $\hbox{$\beta_0^0$}$ , $β_{N}^{0}$ $\hbox{$\beta_N^0$}$ (see the previous section). For that, a third image, such as a simple PSF is the key to ensure a true phase retrieval. Moreover, for space telescopes, the PSF allows taking “telescope breathing” (equivalent to Strehl variations) effects into account.

5. Phase retrieval in the presence of practical limitations

In addition to phase and amplitude wavefront errors, starlight suppression is also limited by intrinsic properties of the optical system and features of the coronagraphic device: for instance, the telescope central obscuration (see Appendix H.2) and support structures (see Appendix L). In the following, we review the impact of these characteristics on the phase retrieval process.

5.1. Annular pupil

For an on-axis telescope with a central obscuration ϵ (ϵ < 1), the full aberration retrieval can be obtained directly by replacing the C_z(n,m,s) Zernike coefficients (see Eq. (4)) with the C_z(n,m,s,ϵ) using the normalization described in Mahajan (1981a,b), $C_{z} (n,m,s) \to C_{z} (n,m,s,ϵ) R_{n}^{m} (ρ) \to R_{n}^{m} (ρ,ϵ) Z_{n}^{m} (ρ,θ) \to Z_{n}^{m} (ρ,θ,ϵ) V_{n,m}^{l_{p}} (r,φ) \to V_{n,m}^{l_{p},ϵ} (r,φ) .$ $\begin{equation} \label{annular_base} C_z(n,m,s) \rightarrow C_z(n,m,s,\epsilon) \qquad R_n^m(\rho)\rightarrow R_n^m(\rho,\epsilon)\qquad Z_n^m(\rho,\theta)\rightarrow Z_n^m(\rho,\theta,\epsilon)\qquad V_{n,m}^{l_{\rm p}}(r,\phi)\rightarrow V_{n,m}^{l_{\rm p},\epsilon}(r,\phi). \end{equation}$ (30)Even though the complex amplitude pupil retrieval process presented in this paper is fully applicable (see Appendix B) to on-axis telescopes, for the sake of simplicity, the ϵ coefficient is omitted.

5.2. Impact of the Lyot stop diameter

The main effect of the diaphragm d_c (Lyot stop) in the coronagraphic pupil plane (Lyot plane) is to reduce the coherent term of the diffraction by a factor ϵ_c. The second effect of the Lyot stop is to rescale the Zernike radial polynomial $R_{n}^{m}$ $\hbox{$R_n^m$}$ as $\sum_{n,m} β_{n}^{m} Z_{n}^{m} (ρ,θ) \to \sum_{n,m} β_{n}^{m} (d_{c}) Z_{n}^{m} (ρ,θ) d_{c} < 1.$ $\begin{equation} \label{scale_rho} \sum_{n,m} \beta_n^m Z_n^m(\rho,\theta) \rightarrow \sum_{n,m} \beta_n^m(d_{\rm c}) Z_n^m(\rho,\theta) \quad d_{\rm c}<1. \end{equation}$ (31)We derived a recurrence formula to calculate the $β_{n}^{m} (d_{c})$ $\hbox{$\beta_n^m(d_{\rm c})$}$ in Appendix I, yielding, with n = m,m + 2,...: $β_{n}^{m} (d_{c}) = \sum_{n^{'}} β_{n^{'}}^{m} \cdot [R_{n^{'}}^{n} (d_{c}) - R_{n^{'}}^{n + 2} (d_{c})^{]} β_{n}^{m} = (n + 1) \sum_{n^{'}} β_{n^{'}}^{m} (d_{c}) \cdot \frac{[R_{n}^{n^{'}} (d_{c}) - R_{n}^{n^{'} + 2} (d_{c})^{]}}{(n^{'} + 1)},$ $\begin{equation} \label{new_orth_finside} \beta_n^m(d_{\rm c})=\sum_{n'} \beta_{n'}^m \cdot \left[ R_{n'}^n(d_{\rm c})-R_{n'}^{n+2}(d_{\rm c}) \right] \qquad \beta_n^m= (n+1) \sum_{n'} \beta_{n'}^m(d_{\rm c}) \cdot \frac{\left[ R_{n}^{n'}(d_{\rm c})-R_{n}^{n'+2}(d_{\rm c}) \right]} {(n'+1)}, \nonumber \end{equation}$ where $R_{n}^{n + 2} (d_{c}) = 0$ $\hbox{$R_{n}^{n+2}(d_{\rm c})=0$}$ and $R_{n}^{n + 2} (1) = 1$ $\hbox{$R_{n}^{n+2}(1)=1$}$ . The $β_{n}^{m} (d_{c})$ $\hbox{$\beta_n^m(d_{\rm c})$}$ coefficients are simply normalized to the size of the diaphragm in the Lyot plane.

5.3. Lyot stop optimization

The Lyot plane field expression is presented in Appendix D under the assumption of Fraunhofer diffraction, and in Appendix H under the Fresnel propagation assumption, respectively. The Fresnel number $N = d_{c}^{2} / λz$ $\hbox{$N=d_{\rm c}^2/\lambda z$}$ is generally larger than 100, where d_c is the radius of the Lyot stop and z is the propagation distance. The Fresnel case thus only shows a slight blurring effect of the pseudo-Zernikes polynomials given in the Fraunhofer approximation. Therefore, the Fraunhofer diffraction is sufficient to describe coronagraphic aberration residuals in the pupil plane. As far as the pupil edge effect is concerned, the Lyot stop has to cover enough residual rings inside the pupil plane to ensure a proper nulling ratio (see Appendix K).

5.4. The imperfect vortex device

The VVC manufacturing imperfections lead to the following error terms:

The phase-shift error with respect to π (chromaticity).
The region in the center of the mask may present a deviation from the overall large-scale pattern called the region of disorientation (this defect largely depends on the technological approach chosen to manufacture the vortex device).
The finite size of the mask in the coronagraphic image plane.

These terms can be taken into account in the coronagraphic NZ retrieval by a simple Fourier simulation to estimate the $β_{0}^{0}$ $\hbox{$\beta^0_0$}$ term (see Appendices J and L for more details).

6. Practical implementation

In this section, we present the instrumental concept we propose to instantaneously acquire the phase-amplitude information about the residual wavefront errors. We propose to use a simple Wollaston device coupled to a pair of achromatic quarter-wave plates to separate the coronagraphic images into left and right circular polarizations, allowing for a simple optical implementation that minimizes non-common path errors. Figure 2 illustrates the optical implementation downstream of the Lyot stop.

Fig. 2

Polarization splitter analysis: possible implementation of the polarization splitter system used after the Lyot stop. An achromatic quarter-wave plate (quartz/MgF₂) and a Wollaston in MgF₂ combined here with a simple achromatic doublet. This simple optical implementation allows us to image the two residual coronagraphic images in the left and right circular polarization basis on the detector. The two images presented in this figure in logarithmic gray scale show the two polychromatic PSF (650–900 nm) obtained by this simple scheme for the two circular polarization images. Note that the PSF shows a small chromatic smearing residual on the x-axis (Strehl = 96.5%) due to the substantial wavelength bandpass. This polychromatic smearing is detailed in Appendix P.

6.1. Tolerancing of the optical design

To show the validity of our approach, we considered potential sources of disturbances one by one. First of all, the entrance pupil introduces its own time-evolving phase defects due to polishing errors and time-dependent thermal effects. The latter affect low-order Zernikes and will be simulated with a power spectral density (PSD) of f^-2. The coronagraphic device possesses its own limitation (manufacturing defect and chromatism, see Sect. 5.4), which need to be taken into account as well.

Thick optical devices such as quarter-wave plates, Wollaston and the lens/mirrors are the sources of four types of second-order aberrations:

Polishing errors: represented by a PSD with wavefront error (WFE) < λ/50 rms for low spatial frequencies, and < 0.2 nm rms in the roughness scale (see optical specifications in Riaud et al. 2003).
Non-common path aberration: due to the small beam separation angle, it is expected that only roughness will play a significant role; we chose to represent it with two phase screens of 0.2 nm rms each.
Light scattering in thick materials, ghost features: scattering limitation and ghost features of the proposed calibration analysis system are calculated directly with the scattering function (see the Appendix O) and the Zemax software.
Residual polarization ellipticity due to non-ideal retardation over the entire wavelength range: the quarter-wave plates were optimized Riaud (2003), and we obtained a plate thickness of 0.24104 mm and 0.30792 mm at 20°C for the MgF₂ and the quartz material, respectively. These thicknesses must be controlled to 1 μm with a Babinet compensator and the plate temperature must be controlled to ±0.5°C. The residual phase error becomes σ² = 0.021 radian for the entire spectral bandpass.

The numerical simulations show that the scattering in the glass material is about 10^-6−10^-7 of the entrance residual coronagraphic starlight, which is not a problem even for Earth-like planet detection provided that the first coronagraphic stage is efficient enough to remove most of the starlight (ϵ_c < 10^-4).

7. End-to-end modeling

This section presents thorough numerical simulations. The goal of these simulations is to show that the entrance pupil wavefront phase can be retrieved even in the presence of real-life optical imperfections, and to which level of accuracy it can be retrieved. Usually, we model a coronagraphic instrument using FFT-based optical propagation; three FFT are needed to fully simulate a coronagraphic image. Here, we used our NZ modal decomposition $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ of the entrance aberrations seen through the VVC to directly construct the final image. Before going into the details of the end-to-end modeling, we tested our analytical method vs. a FFT-based propagation prescription to show their equivalence.

7.1. Analytical vs. Fourier propagation

To study the accuracy of the modal decomposition with the $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ analytical functions, we performed numerical simulations with FFT and the direct expression of modal functions. The entrance pupil is a perfect unobscured circular pupil affected by wavefront (polishing) errors described by a set of Zernikes up to n = 860 with a weighting of 1/(n + 1)², leading to an initial Strehl ratio of 95%. Note that the FFTs must be performed with large arrays (2K × 2K) or (4K × 4K) to minimize aliasing effects. Super-sampling is also used in the coronagraphic plane. All simulations are preformed for the following two cases: the β_n,m representation in the pupil plane ( $\sum β_{n}^{m} \cdot Z_{n}^{m}$ $\hbox{$\sum \beta^m_n \cdot Z_n^m$}$ ) and the α_n,m classical representation ( $e^{i \sum α_{n,m} Z_{n}^{m}}$ $\hbox{${\rm e}^{{\rm i}\sum \alpha_{n,m}Z^m_n}$}$ ). The Nijboer-Zernike uses the first pupil representation and is not limited to high Strehl ratio (see Appendix A for a full analysis). Figure 3 compares the analytical NZ functions under the two pupil representations. Note that FFT simulations always show a high-frequency residual noise. Moreover, the FFT propagation through the VVC acts as a high-pass filter, artificially minimizing the residual coronagraphic stellar flux in the final image near the center. The NZ $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ analytical functions do not present these problems: they are very fast and more accurate. For a complete comparison between the semi-analytical and pure numerical method, see Appendix G.

7.2. Summary of simulation parameters

We present realistic polychromatic simulations incorporating the various defects detailed above.

Using the first 860 Zernikes (40 complete modes), the input Strehl ratio is set to 95% @ 650 nm.
An imperfect phase-mask: where the s-transmittance is 97%, the p-transmittance is 98% and the local phase retardance is π ± Δφ, while following a quadratic law (see Eq. (P.1) in Appendix P.2).
The quarter-wave plates have an absolute phase dependence equal to (λ in μm): φq_λ = 0.3442 + 2.94887λ − 1.74126λ².
Common and non-common path errors are λ/71 rms @ 650 nm both (the total is λ/50 rms at 650 nm).
Polishing error: DSP f^-2 and non-common path error.
Polychromatic speckle smearing due to residual Wollaston chromatism given by the Zemax model.
Photon noise, readout noise (6 e⁻), full-well capacity of 10⁵e⁻, residual flat of 1% rms.

These inputs are commented on in Appendix P, where we also present the full sets of images.

Fig. 3

Numerical simulation illustrating the principle of Nijboer-Zernike retrieval applied on the vortex coronagraph. (Up to down), numerical simulation for each circular polarization with l_p = 2 and l_p = −2 respectively using the first 860 Zernike polynomials (40 complete modes) and a Lyot stop of 99%. The Strehl ratio of the PSF before coronagraphic filtering is 95%. The Lyot stop remove the strong diffraction value in ρ = 1 but allows us to show all images without scaling the $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ coefficients ( $β_{n}^{m} \to β_{n}^{m} (d_{c})$ $\hbox{$\beta_n^m \rightarrow \beta_n^m(d_{\rm c})$}$ ). (Left to right), final monochromatic (λ = 650 nm) coronagraphic image obtained with the sum of $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ analytical functions, the FFT of the direct sum of Zernike polynomials $(\sum β_{n}^{m} \cdot Z_{n}^{m})$ $\hbox{$(\sum \beta^m_n \cdot Z_n^m)$}$ in the pupil plane, and finally, the classical phase function obtained by $e^{i \sum α_{n}^{m} \cdot Z_{n}^{m}}$ $\hbox{${\rm e}^{{\rm i}\sum \alpha^m_n \cdot Z_n^m}$}$ . Due to the difference of basis, the final simulation with the classical phase function presents minor discrepancies in the speckle background (I_c < 10^-7, I_c is the coronagraphic final intensity). The image scales are not linear $(I_{c}^{1 / 4})$ $\hbox{$(I_{\rm c}^{1/4})$}$ .

7.3. The accuracy functions

To quantify the quality of the proposed modal decomposition based on the NZ theory, we define the following $χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ function: $χ_{l}^{2} = \sum_{pixels} \frac{{| I_{FFT} - I_{V_{n,m}^{l_{p}}} |}^{2}}{I_{n}} \cdot$ $\begin{equation} \label{likelihood} \chi^2_l=\sum_{\rm pixels} \frac{\left|I_{\rm FFT}-I_{V^{l_{\rm p}}_{n,m}}\right|^2} {I_n}\cdot \end{equation}$ (32)This metric is applicable for all cases (monochromatic and polychromatic). But for a proper knowledge of the modal decomposition quality under adaptive optics correction (Krist et al. 2011), we can define the Err(σ_A) function $Err (σ_{A}) = \frac{σ ({| A_{FFT} - A_{V_{n,m}^{l_{p}}} |}^{2})}{\max (PSF)} \cdot$ $\begin{equation} \label{accuracy_ampl} Err(\sigma_{\rm A})=\frac{\sigma\left(\left|A_{\rm FFT}-A_{V^{l_{\rm p}}_{n,m}}\right|^2\right)}{\max (PSF)}\cdot \end{equation}$ (33)This function is applicable if we know the amplitude function $A_{FFT} / A_{V_{n,m}^{l_{p}}}$ $\hbox{$A_{\rm FFT}/A_{V^{l_{\rm p}}_{n,m}}$}$ of the coronagraphic residual. Indeed, this metric is only applicable in the monochromatic case. Numerical simulations for all Zernike modes for the monochromatic case are performed in Appendix G. This procedure allows us to estimate the robustness of our method in the two metrics. Table 1 presents the numerical accuracy of the NZ theory in the two presented metrics.

Table 1

NZ accuracy metrics (see Appendix G).

7.3.1. Polychromatic numerical simulations

Phase-mask coronagraphs are inherently chromatic, and so is the pure physical propagation process (see Fig. 4). The NZ phase retrieval must take that effect into account. A full set of simulated images is shown in Appendix P. We scanned all physical parameters (spectral bandpass and phase-shift error of the mask), including photon and detector noises below.

Fig. 4

Polychromatic coronagraphic simulation for l_p = 2 with Δλ = 250 nm of spectral bandpass and a phase error of Δφ = ± 0.001 radian for the vortex mask. We also include defects of our optical implementation. Left: without the speckle smearing. Right: with the Wollaston speckle smearing. The chromatic effect on the speckles is small in the two cases, the contrast remains high. The brightness scale is the same between the two images and is not linear.

7.3.2. RESULTS without photon noise

Here we test the maximum likelihood ( $χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ minimization) of the VVC coronagraphic images with the sum of monochromatic $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ modal functions (see Appendix M). The first set of simulations is given without photon and readout noises. This process allows us to determine the global NZ phase retrieval behavior with respect to the wavelength bandpass and the phase-shift error on the coronagraphic device. Owing to the larger bandpass, we tuned the sampling of the NZ images. Numerical simulations are presented in Fig. 5, which shows the $χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ value variation as a function of the wavelength bandpass and the maximum phase-shift error of the vortex phase-mask (10^-1/10^-2/10^-3 radian).

Fig. 5

$χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ values as a function of the wavelength bandpass and a phase-mask with a 10^-1/10^-2/10^-3 radian of phase-shift error. The black lines correspond to $χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ for the in: $β_{n}^{m} - \to out$ $\hbox{$\beta^m_n \longrightarrow out$}$ : $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ retrieval, the gray lines are for the in: $α_{n}^{m} - \to out$ $\hbox{$\alpha^m_n \longrightarrow out$}$ : $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ retrieval: the solid lines are for the “p” modes and the dashed lines are for the “m” modes. The x and y scales are common between the three images. The $χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ values for of 10^-2 rad for the mask error are poorer than in the previous simulations. The 10^-1 rad case presents significant limitations because of a strong peak in the image center (β_0,0 = 0.04). The dotted line with $Err (σ_{A}) = 10^{-10} / χ_{l}^{2} = 0.1$ $\hbox{$Err(\sigma_A)=10^{-10}/\chi^2_l=0.1$}$ and $Err (σ_{A}) = 10^{-9} / χ_{l}^{2} = 1$ $\hbox{$Err(\sigma_A)=10^{-9}/\chi^2_l=1$}$ shows the chromatic limit of a desired retrieval precision in the “AO metric”.

We notice that simulations are presented for the two input basis $α_{n}^{m}$ $\hbox{$\alpha^m_n$}$ and $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ , but the second pupil phase decomposition is better suited for the NZ analysis. These simulations also show that the bandpass sets a limit of Err(σ_A) = 10^-10 on the precision of the input electric field. The main result for our 95% of input Strehl ratio is that the NZ decomposition in the $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ basis must be narrow band (Δλ = 65 nm or R = 10) to obtain the desired precision on the wavefront error. Now, with the photon limited image sets, we are only interested in the in: $β_{n}^{m} - \to out$ $\hbox{$\beta^m_n \longrightarrow out$}$ : $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ modal decomposition.

7.3.3. RESULTS on photon-noise limited images

The previous simulations shows that the monochromatic modal decomposition is good with a small $χ_{l}^{2} < 1$ $\hbox{$\chi^2_l<1$}$ for a Δλ < 100 − 150 nm. In this section, we present noisy simulations using the same monochromatic NZ modal functions to compare results with the previous ones. Figure 6 shows NZ retrieval results for coronagraphic images realistically limited by simulated detector (read-out,flat) and photon noises. A satisfying wavefront retrieval can be obtained after averaging at least 100 exposures. The gain for more exposures is only on the high order Zernike modes (n > 20).

To increase the precision of the retrieval with a broad bandwidth, we developed in Appendix E a polychromatic $V_{n,m}^{l_{p},λ} (r)$ $\hbox{$V_{n,m}^{l_{\rm p},\lambda}(r)$}$ set of functions. These functions take into account the contrast loss in the Bessel J rings due to the broad bandwidth. This effect would increase the robustness of the NZ retrieval. This new feature is added in the NZ algorithm.

Fig. 6

$χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ values as a function of the signal-to-noise ratio in the input coronagraphic images for a in: $β_{n}^{m} - \to out$ $\hbox{$\beta^m_n \longrightarrow out$}$ : $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ retrieval. We present the effect of the modal decomposition quality with respect to the phase-shift error of the mask. Several realistic cases are presented with 1/10/100/1000 exposures in the final image. A good wavefront retrieval is given after an average image of at least 100 exposures.

8. Discussion

Thanks to the phase retrieval technique, we have shown in the previous sections that using POAM on the starlight given by the VVC, it is possible to instantaneously measure the phase of residual stellar speckles in coronagraphic images and hence improve the sensitivity of high-contrast telescopes. The main limitations are essentially coming from the limited coronagraphic nulling due to the input Strehl ratio, and the mask chromaticity over broad bandwidths. To investigate the impact of these instrumental limitations, we considered various input Strehl ratios in our simulations and estimated the coronagraphic nulling ratio and the maximum wavelength bandpass that are necessary to provide an accuracy as good as Err(σ_A) = 10^-10. The results of this analysis are shown in Table 2. They were computed for an optimized high-contrast optical workbench for a best VVC (Δφ = 10^-3 radian).

Table 2

Maximum coronagraphic nulling ratio and wavelength bandpass achievable for various levels of input Strehl ratio.

Table 2 shows that our phase retrieval technique can still be used for a Strehl ratio as low as 95% with a mask chromaticity of 10^-3 < Δφ < 10^-2 radian provided that it is applied on a sufficiently narrow wavelength bandpass (65 nm). In practice, this means that the technique becomes more time-consuming because it must be applied on more spectral channels for decreasing Strehl ratios. Owing to the mask imperfection, a maximum nulling of 10^-5 on the stellar peak seems to be feasible for an input Strehl ratio of 99.6%. The proposed method can handle various instrumental limitations and is sufficiently flexible to be coupled with the EFC algorithm (Give’on et al. 2007) to minimize the electric field in a desired region of the focal plane.

To ensure optimized wavefront corrections in the entrance pupil if we use an EFC algorithm, we need to use the complete Zernike development in the $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ basis. We can null all present residual speckles in the the final coronagraphic image, not an individual speckle or identifiable feature only. Indeed, our process directly gives the input pupil phase and amplitude by image analysis but this is not an EFC algorithm. On the other hand, we must modify the EFC algorithm with our formalism to increase speed-up and obtain a direct global electric field minimization problem on the entrance pupil plane. A previous paper (Riaud 2012) on a new deformable mirror architecture provides all mathematical tools to use this NZ theory in the speckle cancellation process. Finally, we notice that if the corrections are not exactly in the pupil plane, some Fresnel propagations occur and small corrections of the Fraunhofer diffraction are needed (see Appendix H for a complete mathematical demonstration).

9. Conclusion

We demonstrated a full analytical modal decomposition of the effect aberrations on the vectorial vortex phase-mask final image using polarization properties. This procedure can be used on a very stable coronagraphic system for detecting terrestrial planets in the visible around nearby stars such as TPF-C (and precursors). Indeed, this proposed simple optical implementation allows us to calibrate the residual speckle pattern directly on the final coronagraphic images. End-to-end simulations show that the common and non-common path errors (λ/50 rms at 650 nm) after the filtered pupil due to the beam separation system are not influencing the final images. Indeed, the level of precision can be as good as ≈0.1% for the retrieved phase and is only limited by the detector and the photon noises. We also presented the effect of the polychromatic coronagraphic images on the maximum likelihood between the Fourier and the modal simulations. A precise ( ≈ 1%) modal decomposition with a wavelength bandpass of Δλ < 140 nm can be obtained with the proposed optical implementation. Finally, the main source of error and performance limitation of the presented modal decomposition is the signal-to-noise ratio of the two coronagraphic images and the direct PSF image (l_p = 0). A very stable optical system is needed to stack several images ( ≈ 100 see Sect. 7.3.3) and reach the 10^-10 speckle level that opens the way to detect terrestrial planets in the visible with a space telescope.

An option is the direct phase correction with extreme adaptive optics (XAO) to increase the rejection factor, the main limitation will again be the phase retrieval precision.

¹

Details on the method for converting $α_{n}^{m}$ $\hbox{$\alpha_n^m$}$ to $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ for large aberrations (Strehl ratio > 50%) are given in Appendix A.

Acknowledgments

This work received the support of the University of Liège. This work was partly carried out at the European Southern Observatory (ESO) site of Vitacura (Santiago, Chile). The authors are grateful to C. Hanot (IAGL), D. Defrère (MPI-RA) for manuscript corrections. The authors wish to thank the referee Wesley Traub for his useful comments and corrections. The authors also acknowledge support from the Communauté française de Belgique – Actions de recherche concertées – Académie universitaire Wallonie-Europe. This idea dates back to 2005-2006 and the first author is grateful to section 17 and the CNAP French commissions for their outstanding recruitment work.

References

Allen, L., Courtial, J., & Padgett, M. 1999, Phys. Rev. E, 60, 74977503 [CrossRef] [Google Scholar]
Blanc, A., Fusco, T., Hartung, M., Mugnier, L. M., & Rousset, G. 2003, A&A, 399, 373 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Bohren, C. F., & Huffman, D. R. 1983, Absorption and scattering of light by small particles (New York: Wiley) [Google Scholar]
Bordé, P. J., & Traub, W. A. 2006, ApJ, 638, 488 [NASA ADS] [CrossRef] [Google Scholar]
Born, M., & Wolf, E. 1999, Principles of Optics (Cambridge University Press) [Google Scholar]
Give’on, A., Kern, B., Shaklan, S., Moody, D. C., & Pueyo, L. 2007, SPIE Conf. Ser., 6691, 66910A [Google Scholar]
Gonsalves, R. A. 1982, Opt. Eng., 21, 829 [Google Scholar]
Gradshteyn, I. S., & Ryzhik, I. M. 1994, Table of Integrals, Series and Products (Academic Press Inc), 1762 [Google Scholar]
Henyey, L., & Greenstein, J. 1941, ApJ, 93, 70 [NASA ADS] [CrossRef] [Google Scholar]
Janssen, A. 2002, JOSA A, 19, 849 [NASA ADS] [CrossRef] [Google Scholar]
Krist, J. E., Belikov, R., Pueyo, L., et al. 2011, in SPIE Conf. Ser., 8151, 81510E [Google Scholar]
Magette, A. 2010, Ph.D. Thesis: The International Liquid Mirror Telescope: optical testing and alignment using a Nijboer-Zernike aberration retrieval approch, IAGL, University of Liège, 1 [Google Scholar]
Mahajan, V. N. 1981a, J. Opt. Soc. Am., 71, 75 [Google Scholar]
Mahajan, V. N. 1981b, J. Opt. Soc. Am., 71, 1408 [Google Scholar]
Mawet, D., Riaud, P., Absil, O., & Surdej, J. 2005, ApJ, 633, 1191 [NASA ADS] [CrossRef] [Google Scholar]
Mawet, D., Serabyn, E., Liewer, K., et al. 2009, Opt. Expr., 17, 1902 [Google Scholar]
ming Dai, G. 2006, J. Opt. Soc. Am. A, 23, 539 [NASA ADS] [CrossRef] [Google Scholar]
Nijboer, B. 1943, Physica, 10, 679 [NASA ADS] [CrossRef] [Google Scholar]
Nijboer, B. 1947, Physica, 13, 605 [NASA ADS] [CrossRef] [Google Scholar]
Poynting, J. 1909, Proc. Roy. Soc. London Ser. A, 82, 560567 [NASA ADS] [Google Scholar]
Riaud, P. 2003, Ph.D. Thesis, Université Paris VI [Google Scholar]
Riaud, P. 2012, A&A, 545, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Riaud, P., & Schneider, J. 2007, A&A, 469, 355 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Riaud, P., Boccaletti, A., Baudrand, J., & Rouan, D. 2003, PASP, 115, 712 [NASA ADS] [CrossRef] [Google Scholar]
Roddier, C., & Roddier, F. 1993, J. Opt. Soc. Am. A, 10, 2277 [NASA ADS] [CrossRef] [Google Scholar]
Yao, A. M., & Padgett, M. J. 2011, Adv. Opt. Photon., 3, 161 [CrossRef] [Google Scholar]

Appendix A: Expression of the pupil aberration function

A.1. Zernike decomposition of the wavefront

The general pupil function is $P (ρ,θ) = e^{i \cdot \sum_{n,m} α_{n}^{m} Z_{n}^{m} (ρ,θ)},$ $\begin{equation*} P(\rho,\theta) = {\rm e}^{{\rm i} \cdot \sum_{n,m} \alpha_n^m Z_n^m(\rho,\theta)} , \end{equation*}$ where $Z_{n}^{m}$ $\hbox{$Z_n^m$}$ are the classical Zernike polynomials. When the aberration level is low (Strehl ratio > 90%), we can approximate this function by $P (ρ,θ) \approx 1 + i \cdot \sum_{n,m} α_{n}^{m} Z_{n}^{m} (ρ,θ) = \sum_{n,m} β_{n}^{m} Z_{n}^{m} (ρ,θ) .$ $\appendix \setcounter{section}{1} \begin{equation} P(\rho,\theta) \approx 1+ {\rm i} \cdot \sum_{n,m} \alpha_n^m Z_n^m(\rho,\theta) = \sum_{n,m} \beta_n^m Z_n^m(\rho,\theta). \end{equation}$ (A.1)The $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients are then directly related to the classical $α_{n}^{m}$ $\hbox{$\alpha_n^m$}$ coefficients. Since the Zernike polynomials form an orthogonal basis, the classical pupil function with $α_{n}^{m}$ $\hbox{$\alpha_n^m$}$ can always be fully described by a sum of Zernike functions, $e^{i \cdot \sum_{n,m} α_{n}^{m} Z_{n}^{m}} = \sum_{n,m} β_{n}^{m} Z_{n}^{m} \leftarrow \to \cos (\sum_{n,m} α_{n}^{m} Z_{n}^{m}) = \sum_{n,m} ℜ (β_{n}^{m}) Z_{n}^{m} \sin (\sum_{n,m} α_{n}^{m} Z_{n}^{m}) = \sum_{n,m} ℑ (β_{n}^{m}) Z_{n}^{m} .$ $\appendix \setcounter{section}{1} \begin{equation} {\rm e}^{{\rm i} \cdot \sum_{n,m} \alpha_n^m Z_n^m} = \sum_{n,m} \beta_n^m Z_n^m \quad \longleftrightarrow \quad \cos \left( \sum_{n,m} \alpha_n^m Z_n^m\right) = \sum_{n,m} \Re \left(\beta_n^m\right)Z_n^m \qquad \sin \left( \sum_{n,m} \alpha_n^m Z_n^m\right) = \sum_{n,m} \Im \left(\beta_n^m\right)Z_n^m . \end{equation}$ (A.2)Note that the Strehl ratio Sr can be calculated directly with the $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients: $Sr = \frac{{| β_{0}^{0} |}^{2}}{\sum_{n,m} {| β_{n}^{m} |}^{2}},$ $\appendix \setcounter{section}{1} \begin{equation} Sr= \frac{\left|\beta_0^0\right|^2} {\sum_{n,m} \left|\beta_n^m\right|^2}, \end{equation}$ (A.3)By decomposing the real and imaginary parts of the pupil into Zernike polynomials, it is thus possible to obtain the real and imaginary parts of the β coefficients. If the $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients represent the weights of the aberrations of the surface, the orthogonality relation yields $β_{n^{'}}^{m^{'}} = \frac{1}{2 π} \sum_{n,m} \int_{0}^{1} \int_{0}^{2 π} β_{n}^{m} Z_{n}^{m} (ρ,θ) \cdot Z_{n^{'}}^{m^{'}} (ρ,θ) ρ d ρ d θ$ $\appendix \setcounter{section}{1} \begin{equation} \beta_{n'}^{m'} = \frac{1}{2\pi} \sum_{n,m} \int_0^1 \int_0^{2\pi} \beta_n^m Z_n^m(\rho,\theta) \cdot Z_{n'}^{m'}(\rho,\theta) \rho {\rm d}\rho {\rm d}\theta \end{equation}$ (A.4)for the real and imaginary parts of Zernike polynomials.

High spatial frequency variations are difficult to represent with a limited sum of Zernike polynomials. More β coefficients than the number of α coefficients are then required to compute aberrations in the pupil. For coronagraphic imaging, where we use a diaphragm smaller than, or equal to, the pupil radius, the strong variations at the edge of the pupil are masked and the number of β and α coefficients are somewhat equal. The phase function φ_Π(ρ,θ) of the pupil in the β basis can be determined by $φ_{Π} (ρ,θ) = \arctan (\frac{\sum_{n,m} ℜ (β_{n}^{m}) Z_{n}^{m} (ρ,θ)}{\sum_{n,m} ℑ (β_{n}^{m}) Z_{n}^{m} (ρ,θ)}) \cdot$ $\appendix \setcounter{section}{1} \begin{equation} \phi_{\Pi}(\rho, \theta)=arctan\left( \frac{\sum_{n,m} \Re \left(\beta_n^m\right)Z_n^m(\rho, \theta)}{\sum_{n,m} \Im \left(\beta_n^m\right)Z_n^m(\rho, \theta)}\right)\cdot \end{equation}$ (A.5)Because the arctangent function is defined between ± π, the phase function needs to be unwrapped before it is decomposed, if the level of aberrations is relatively high (Strehl < 80%).

A.2. Radial Zernike functions calculation

$R_{n}^{m} (ρ) = \sum_{s = 0}^{\frac{n - m}{2}} C_{z} (n,m,s) ρ^{n - 2 s}$ $\appendix \setcounter{section}{1} \begin{equation} \label{eq:zern_rad1} R_n^m(\rho)=\sum^{\frac{n-m}{2}}_{s=0} C_z(n,m,s) \rho^{n-2s} \end{equation}$ (A.6)For a proper mathematical stability, the C_z(n,m,s) coefficients of the radial Zernike functions must be calculated recursively. The following radial Zernike functions can be calculated easily: $R_{n}^{n + 2} (ρ) = 0 R_{n}^{n} (ρ) = ρ^{n} R_{n}^{n - 2} (ρ) = n ρ^{n} - (n - 1) ρ^{n - 2} .$ $\appendix \setcounter{section}{1} \begin{equation} \label{eq:zern_rad2} R_n^{n+2}(\rho)=0 \quad R_n^n(\rho)=\rho^n \quad R_n^{n-2}(\rho)=n\rho^n-(n-1)\rho^{n-2} . \end{equation}$ (A.7)The low-order radial Zernike functions $R_{n}^{0}$ $\hbox{$R_n^0$}$ to $R_{n}^{n - 4}$ $\hbox{$R_n^{n-4}$}$ can be calculated with the following recurrence formula: $R_{n}^{m} (ρ) = \frac{n}{n^{2} - m^{2}} [(4 (n - 1) ρ^{2} - \frac{(n + m - 2)^{2}}{n - 2} - \frac{(n - m)^{2}}{n}) R_{n - 2}^{m} (ρ) - (\frac{(n - 2)^{2} - m^{2}}{n - 2}) R_{n - 4}^{m} (ρ)] .$ $\appendix \setcounter{section}{1} \begin{equation} \label{eq:zern_rad3} R_n^m(\rho)=\frac{n}{n^2-m^2}\left[\left( 4(n-1)\rho^2 -\frac{(n+m-2)^2}{n-2}-\frac{(n-m)^2}{n}\right)R_{n-2}^m(\rho) - \left( \frac{(n-2)^2-m^2}{n-2}\right)R_{n-4}^m(\rho)\right]. \end{equation}$ (A.8)

Appendix B: Expression of the Hankel transform H_m

B.1. The classical Zernike radial functions $R_{n}^{m} (ρ)$ $\hbox{$\mathsfsl{R_n^m(\rho)}$}$

To calculate the amplitude in the image plane in the presence of optical aberrations given by the Zernike polynomials, we need the direct mth order Hankel transform H_m for the radial coordinate. We have $\begin{matrix} H_{m} [R_{n}^{m} (ρ)] = \int_{0}^{1} ρ R_{n}^{m} (ρ) J_{m} (2 πrρ) d ρ \\ H_{m} [R_{n}^{m} (ρ)] = \sum_{s = 0}^{\frac{n - m}{2}} C_{z} (n,m,s) \int_{0}^{1} ρ^{n - 2 s + 1} J_{m} (2 πrρ) d ρ - \to H_{m} [R_{n}^{m} (ρ)] = \sum_{s = 0}^{\frac{n - m}{2}} f_{z} (n,m,s) . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} \label{ht1} && H_m\left[R_n^m(\rho) \right]=\int_{0}^{1} \rho \, R_n^m(\rho) J_m(2 \pi r \rho) \: {\rm d} \rho\\ && \nonumber H_m\left[R_n^m(\rho) \right]=\sum^{\frac{n-m}{2}}_{s=0} C_z(n,m,s) \int_{0}^{1} \rho^{n-2s+1} J_m(2 \pi r \rho) \: {\rm d} \rho \quad \longrightarrow \quad \nonumber H_m\left[R_n^m(\rho) \right]=\sum^{\frac{n-m}{2}}_{s=0} f_z(n,m,s) . \end{eqnarray}$ (B.1)The radial integration between the ρ^{n − 2s + 1} polynomial and the Bessel function J_m can be calculated in the following way: $\int_{0}^{1} ρ^{n - 2 s + 1} J_{m} (2 πrρ) d ρ = \frac{(πr)^{m}}{2} \frac{Γ [\frac{m + n}{2} - s + 1^{]}}{Γ [m + 1] \cdot Γ [\frac{m + n}{2} - s + 2^{]}} \cdot 2 F_{1} (\frac{m + n}{2} - s + 1,m + 1, \frac{m + n}{2} - s + 2, - π^{2} r^{2}),$ $\appendix \setcounter{section}{2} \begin{equation} \label{ht2} \int_{0}^{1} \rho^{n-2s+1} J_m(2 \pi r \rho) \: {\rm d} \rho = \frac{(\pi r)^m} {2} \frac{\Gamma\left[\frac{m+n}{2}-s+1\right]} {\Gamma\left[m+1\right] \cdot \Gamma\left[\frac{m+n}{2}-s+2\right]}\cdot {_2}F_1\left(\frac{m+n}{2}-s+1, m+1,\frac{m+n}{2}-s+2, -\pi^2 r^2\right) , \end{equation}$ (B.2)where ${}_{2}F_{1}$ $\hbox{${_2}F_1$}$ is the Gauss hypergeometric function. The full expression of the f_z(n,m,s) function becomes $f_{z} (n,m,s) = \frac{(πr)^{m} (- 1)^{s} Γ [n - s + 1]}{2 Γ [m + 1] Γ [s + 1] Γ [\frac{n - m}{2} - s + 1^{]} Γ [\frac{n + m}{2} - s + 2^{]}} \cdot 2 F_{1} (\frac{m + n}{2} - s + 1,m + 1, \frac{m + n}{2} - s + 2, - π^{2} r^{2}) .$ $\appendix \setcounter{section}{2} \begin{equation} \label{ht3} f_z(n,m,s)=\frac{(\pi r)^m (-1)^s \Gamma\left[n-s+1\right]} {2 \Gamma\left[m+1\right] \Gamma\left[s+1\right] \Gamma\left[\frac{n-m}{2}-s+1\right] \Gamma\left[\frac{n+m}{2}-s+2\right]} \cdot {_2}F_1\left(\frac{m+n}{2}-s+1, m+1,\frac{m+n}{2}-s+2, -\pi^2 r^2\right) . \end{equation}$ (B.3)The Gauss hypergeometric function can be expressed in terms of Bessel J functions: $\frac{J_{m + 1} (2 πr)}{2 πr} = \frac{(πr)^{m}}{2 \cdot Γ [m + 2]} \cdot 2 F_{1} (m + 1,m + 1,m + 2, - π^{2} r^{2}) .$ $\appendix \setcounter{section}{2} \begin{equation} \label{ht4} \frac{J_{m+1}(2\pi r)}{2\pi r}=\frac{(\pi r)^m} {2 \cdot \Gamma\left[m+2\right]}\cdot {_2}F_1\left(m+1, m+1,m+2, -\pi^2 r^2\right) . \end{equation}$ (B.4)After a simple variable change, the f_z(n,m,s) function becomes $\begin{matrix} s = (n - m) / 2 - x \\ f_{z} (n,m,x) = \frac{Γ [\frac{n + m}{2} + x + 1^{]} (- 1)^{(n - m) / 2}}{(πr)^{x} Γ [x + 1] Γ [m + x + 1] Γ [\frac{n - m}{2} - x + 1^{]}} (P (m) \frac{J_{m + x + 1} (2 πr)}{2 πr} + Q (m) \frac{J_{m + x + 2} (2 πr)}{2 πr}), \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} \label{ht5} && s=(n-m)/2-x \nonumber\\ && f_z(n,m,x) = \frac{\Gamma\left[\frac{n+m}{2}+x+1\right](-1)^{(n-m)/2}} { (\pi r)^x \Gamma\left[x+1\right] \Gamma\left[m+x+1\right] \Gamma\left[\frac{n-m}{2}-x+1\right]}\left( P(m) \frac{J_{m+x+1}(2\pi r)}{2\pi r} + Q(m) \frac{J_{m+x+2}(2\pi r)}{2\pi r}\right) , \end{eqnarray}$ (B.5)where P(m) and Q(m) are two polynomials (see Table B.1). In fact, the summation of two f_z functions (f_z(n,m,x) + f_z(n,m,x + 1)) allows us to simplify the general expression of the mth order Hankel transform. We use the following Bessel J transformation: $\frac{J_{m + x + 2} (2 πr)}{2 πr} = (\frac{m + x + 1}{πr}) \frac{J_{m + x + 1} (2 πr)}{2 πr} - \frac{J_{m + x} (2 πr)}{2 πr} \cdot$ $\appendix \setcounter{section}{2} \begin{equation} \label{ht5b} \frac{J_{m+x+2}(2\pi r)}{2\pi r}=\left(\frac{m+x+1}{\pi r}\right) \frac{J_{m+x+1}(2\pi r)}{2\pi r}-\frac{J_{m+x}(2\pi r)}{2\pi r}\cdot \end{equation}$ (B.6)After Bessel J functions simplification, we obtain $H_{m} [R_{n}^{m} (ρ)] = (- 1)^{(n - m) / 2} \frac{J_{n + 1} (2 πr)}{2 πr} \cdot$ $\appendix \setcounter{section}{2} \begin{equation} \label{ht6} H_m\left[R_n^m(\rho) \right]=(-1)^{(n-m)/2} \frac{J_{n+1}(2\pi r)} {2\pi r} \cdot \end{equation}$ (B.7)

Table B.1

Values of some P(m)/Q(m) polynomials.

B.2. The annular Zernike radial function $R_{n}^{m} (ρ,ϵ)$ $\hbox{$\mathsfsl{R_n^m(\rho,\epsilon)}$}$

For a telescope with a central obscuration ϵ (ϵ < 1), the aberration function can be obtained directly by replacement of the C_z(n,m,s) Zernike coefficients (see Eq. (4)) by the C_z(n,m,s,ϵ) using the normalization described in Mahajan (1981a,b). The expression of the f_z(n,m,s) must take into account the new normalization system ( $f_{a}^{n,m} (ϵ) / \sqrt{\sum_{i = 0}^{n} ϵ^{2 i}}$ $\hbox{$f^{n,m}_a(\epsilon)/\sqrt{\sum_{i=0}^{n}\epsilon^{2i}}$}$ ). The mth order Hankel transform becomes $H_{m} [R_{n}^{m} (ρ,ϵ)] = \sum_{x = 0}^{\frac{n - m}{2}} f_{z} (n,m,x,ϵ) (C_{z} (n,m,s,ϵ) = C_{z} (n,m,s) \frac{f_{a}^{n,m} (ϵ)}{\sqrt{\sum_{i = 0}^{n} ϵ^{2 i}}} \to f_{z} (n,m,x,ϵ) = f_{z} (n,m,x) \frac{f_{a}^{n,m} (ϵ)}{\sqrt{\sum_{i = 0}^{n} ϵ^{2 i}}}) \cdot$ $\appendix \setcounter{section}{2} \begin{equation} \label{hte1} H_m\left[R_n^m(\rho,\epsilon) \right]=\sum^{\frac{n-m}{2}}_{x=0} f_z(n,m,x,\epsilon) \qquad \left( C_z(n,m,s,\epsilon) = C_z(n,m,s) \frac{f^{n,m}_a(\epsilon)} {\sqrt{\sum_{i=0}^{n}\epsilon^{2i}} } \quad \rightarrow \quad f_z(n,m,x,\epsilon) = f_z(n,m,x) \frac{f^{n,m}_a(\epsilon)} {\sqrt{\sum_{i=0}^{n}\epsilon^{2i}} } \right)\cdot \end{equation}$ (B.8)During the Bessel J simplification process (see Eqs. (B.5), (B.6)), the presence of the $f_{a}^{n,m} (ϵ)$ $\hbox{$f^{n,m}_a(\epsilon)$}$ terms does not allows us a complete Bessel J low-order elimination (J_{m + x + 2} < J_n + 1). This effect is very small and a simple renormalization by $\sqrt{\sum_{i = 0}^{n} ϵ^{2 i}}$ $\hbox{$\sqrt{\sum_{i=0}^{n}\epsilon^{2i}}$}$ coefficients gives a good approximation of the mth order Hankel transform under annular Zernike decomposition, $\begin{matrix} H_{m} [R_{n}^{m} (ρ,ϵ)] \approx \frac{(- 1)^{(n - m) / 2}}{\sqrt{\sum_{i = 0}^{n} ϵ^{2 i}}} \frac{J_{n + 1} (2 πr)}{2 πr} (\forall n,m ϵ̇ \leq 0.4 err \leq 1 %^{)} . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} \label{hte3} H_m\left[R_n^m(\rho,\epsilon) \right] \approx \frac{(-1)^{(n-m)/2}} {\sqrt{\sum_{i=0}^{n}\epsilon^{2i}} } \frac{J_{n+1}(2\pi r)}{2\pi r} \quad \left(\forall n,m \quad \dot\epsilon\leq 0.4 \quad err\leq 1\% \right) . \end{eqnarray}$ (B.9)

Appendix C: Expression of the U $\begin{matrix} m \\ n \end{matrix}$ $\hbox{${_n^m}$}$ functions

To calculate the amplitude in the coronagraphic image plane in the presence of optical aberrations $U_{n}^{m}$ $\hbox{$U_n^m$}$ , we use the diffraction integral in polar coordinates. Depending on the function parity (cos or sin), the $U_{n}^{m}$ $\hbox{$U_n^m$}$ functions given by the Eq. (6) becomes $\begin{matrix} U_{n}^{m} (r,φ)_{c} = \frac{1}{π} \int_{0}^{1} \int_{0}^{2 π} ρ R_{n}^{m} (ρ) (\cos (mθ) \cos (mφ) - \sin (mθ) \sin (mφ)) e^{- 2 i πrρ \cos (θ - φ)} d θ d ρ \\ U_{n}^{m} (r,φ)_{s} = \frac{1}{π} \int_{0}^{1} \int_{0}^{2 π} ρ R_{n}^{m} (ρ) (\sin (mθ) \cos (mφ) + \cos (mθ) \sin (mφ)) e^{- 2 i πrρ \cos (θ - φ)} d θ d ρ . \end{matrix}$ $\appendix \setcounter{section}{3} \begin{eqnarray} \label{UcUs} &&U_n^m (r,\phi)_{\rm c}= \frac{1}{\pi} \int_{0}^{1} \int_{0}^{2\pi} \rho R_n^m(\rho) (\cos(m\theta) \cos(m\phi) - \sin(m\theta) \sin(m\phi)) {\rm e}^{-2 {\rm i} \pi r \rho \cos (\theta-\phi)} {\rm d} \theta {\rm d} \rho~~~~~~~~~~~~~\\ &&U_n^m (r,\phi)_{\rm s}= \frac{1}{\pi} \int_{0}^{1} \int_{0}^{2\pi}\rho R_n^m(\rho) (\sin(m\theta) \cos(m\phi) + \cos(m\theta) \sin(m\phi)) {\rm e}^{-2 {\rm i} \pi r \rho \cos (\theta-\phi)} {\rm d} \theta {\rm d} \rho . \end{eqnarray}$ In both cases, the integral of the term sin(mθ) between 0 and 2π vanishes. Moreover, using the definition of the Bessel functions, we have $J_{m} (2 πrρ) {\begin{matrix} \cos (mφ) \\ \sin (mφ) \end{matrix} = \frac{(- 1)^{- m} i^{- m}}{2 π} \int_{0}^{2 π} e^{- 2 i πrρ \cos (θ - φ)} {\begin{matrix} \cos (mθ) \\ \sin (mθ) \end{matrix} d θ .$ $\appendix \setcounter{section}{3} \begin{equation} \label{eq:Bessel} J_m(2 \pi r \rho) \left\{ \begin{array}{l} \cos (m\phi)\\ \sin (m\phi)\\ \end{array} \right. = \frac{(-1)^{-m} {\rm i}^{-m}}{2\pi} \int_0^{2\pi} {\rm e}^{-2 {\rm i} \pi r \rho \cos (\theta-\phi)} \left\{ \begin{array}{l} \cos (m\theta)\\ \sin (m\theta)\\ \end{array} \right. {\rm d}\theta . \end{equation}$ (C.3)Substituting this expression into the two previous equations while using properties of the Bessel functions (Gradshteyn & Ryzhik 1994), and the radial result for the direct mth order Hankel transform H_m (see Eq. (B.7)), we finally obtain $U_{n}^{m} (r,φ)_{c} = 2 i^{m} \cos (mφ) (- 1)^{(n + m) / 2} \frac{J_{n + 1} (2 πr)}{2 πr} U_{n}^{m} (r,φ)_{s} = 2 i^{m} \sin (mφ) (- 1)^{(n + m) / 2} \frac{J_{n + 1} (2 πr)}{2 πr} \cdot$ $\appendix \setcounter{section}{3} \begin{equation} U_n^m (r,\phi)_{\rm c} = 2 {\rm i}^m \cos(m\phi) (-1)^{(n+m)/2} \frac{J_{n+1}(2\pi r)} {2\pi r} \qquad U_n^m (r,\phi)_{\rm s} = 2 {\rm i}^m \sin(m\phi) (-1)^{(n+m)/2} \frac{J_{n+1}(2\pi r)} {2\pi r} \cdot \end{equation}$ (C.4)

Appendix D: Expression of the V $\begin{matrix} l_{p} \\ n, m \end{matrix}$ $\hbox{$\mathsfsl{_ {n,m}^{l_{\sf p}}}$}$ function (Fraunhofer diffraction)

In the classical Nijboer-Zernike theory without defocus the expression of the V_n,m is given directly by $V_{n,m} (r) = (- 1)^{(n + m) / 2} \frac{J_{n + 1} (2 πr)}{2 πr} \cdot$ $\appendix \setcounter{section}{4} \begin{equation} \label{vnm1} V_{n,m}(r) = (-1)^{(n+m)/2} \frac{J_{n+1}(2\pi r)} {2\pi r}\cdot \end{equation}$ (D.1)With the VVC, the coronagraphic pupil expression is given by the inverse l_p + m order Hankel transform of the Bessel function included in the $U_{n}^{m} (r,φ)$ $\hbox{$U_n^m (r,\phi)$}$ , $\begin{matrix} Π_{c} (ρ,θ) = 2 π^{2} i^{m} (- 1)^{(n + m) / 2} H_{l_{p} \pm m}^{-1} [\frac{J_{n + 1} (2 πr)}{2 πr} e^{i (l_{p} φ - π / 2)} {\begin{matrix} \end{matrix}] \\ Π_{c} (ρ,θ) = 2 π^{2} i^{m} (- 1)^{(n + m) / 2} \frac{1}{π} \int_{0}^{+ \infty} \int_{0}^{2 π} \frac{J_{n + 1} (2 πr)}{2 πr} e^{i (l_{p} φ - π / 2)} {\begin{matrix} \end{matrix} e^{i 2 πrρ \cos (φ - θ)} d φr d r \end{matrix}$ $\appendix \setcounter{section}{4} \begin{eqnarray} && \Pi_{\rm c}(\rho,\theta)=2\pi^2 {\rm i}^m (-1)^{(n+m)/2} H_{l_{\rm p}\pm m}^{-1} \left[ \frac{J_{n+1}(2\pi r )} {2\pi r } {\rm e}^{{\rm i} (l_{\rm p} \phi-\pi/2)} \left\{ \begin{array}{l} \cos (m\phi)\\ \sin (m\phi)\\ \end{array} \right. \right]\\ && \Pi_{\rm c}(\rho,\theta)=2\pi^2 {\rm i}^m (-1)^{(n+m)/2} \frac{1}{\pi} \int_{0}^{+\infty} \int_{0}^{2\pi} \frac{J_{n+1}(2\pi r)} {2\pi r} {\rm e}^{{\rm i}(l_{\rm p} \phi-\pi/2)} \left\{ \begin{array}{l} \cos (m\phi)\\ \sin (m\phi)\\ \end{array} \right. {\rm e}^{{\rm i} 2\pi r \rho \cos(\phi-\theta)} {\rm d}\phi \: r{\rm d}r \nonumber \end{eqnarray}$ (D.2) $\begin{matrix} \int \begin{matrix} 2 π \\ 0 \end{matrix} e^{i (l_{p} φ - π / 2)} \cos (mφ) e^{i 2 πrρ \cos (φ - θ)} d φ = ϵ_{l_{p}} π (i^{l_{p} - m} J_{l_{p} - m} (2 πrρ) e^{i (l_{p} - m) θ} + i^{l_{p} + m} J_{l_{p} + m} (2 πrρ) e^{i (l_{p} + m) θ}) \\ \int \begin{matrix} 2 π \\ 0 \end{matrix} e^{i (l_{p} φ - π / 2)} \sin (mφ) e^{i 2 πrρ \cos (φ - θ)} d φ = \frac{ϵ_{l_{p}}}{i} π (i^{l_{p} + m} J_{l_{p} + m} (2 πrρ) e^{i (l_{p} + m) θ} - i^{l_{p} - m} J_{l_{p} - m} (2 πrρ) e^{i (l_{p} - m) θ}) \\ \int_{0}^{2 π} e^{i (l_{p} φ - π / 2)} e^{i 2 πrρ \cos (φ - θ)} d φ = ϵ_{l_{p}} 2 π (i^{l_{p}} J_{l_{p}} (2 πrρ) e^{i l_{p} θ}) ϵ_{l_{p}} = - i, l_{p} \neq 0 ϵ_{l_{p}} = 1, l_{p} = 0. \end{matrix}$ $\appendix \setcounter{section}{4} \begin{eqnarray} &&\int_{0}^{2\pi} {\rm e}^{{\rm i}(l_{\rm p} \phi-\pi/2)} \cos (m\phi) {\rm e}^{{\rm i} 2\pi r \rho \cos(\phi-\theta)} {\rm d}\phi= \epsilon_{l_{\rm p}}\pi \left({\rm i}^{l_{\rm p}-m} J_{l_{\rm p}-m}(2\pi r \rho)\: {\rm e}^{{\rm i} (l_{\rm p}-m) \theta} + {\rm i}^{l_{\rm p}+m} J_{l_{\rm p}+m}(2\pi r \rho)\: {\rm e}^{{\rm i} (l_{\rm p}+m) \theta} \right) \\ &&\int_{0}^{2\pi} {\rm e}^{{\rm i}(l_{\rm p} \phi-\pi/2)} \sin (m\phi) {\rm e}^{{\rm i} 2\pi r \rho \cos(\phi-\theta)} {\rm d}\phi=\frac{\epsilon_{l_{\rm p}}} {\rm i}\pi \left({\rm i}^{l_{\rm p}+m} J_{l_{\rm p}+m}(2\pi r \rho)\: {\rm e}^{{\rm i} (l_{\rm p}+m) \theta} - {\rm i}^{l_{\rm p}-m} J_{l_{\rm p}-m}(2\pi r \rho)\: {\rm e}^{{\rm i} (l_{\rm p}-m) \theta} \right) \\ &&\int_{0}^{2\pi} {\rm e}^{{\rm i}(l_{\rm p} \phi-\pi/2)} {\rm e}^{{\rm i} 2\pi r \rho \cos(\phi-\theta)} {\rm d}\phi= \epsilon_{l_{\rm p}} 2\pi \left({\rm i}^{l_{\rm p}} J_{l_{\rm p}}(2\pi r \rho)\: {\rm e}^{{\rm i} l_{\rm p}\theta}\right) \qquad \epsilon_{l_{\rm p}}=-{\rm i} \:,\: l_{\rm p}\neq0 \qquad \epsilon_{l_{\rm p}}=1 \:,\: l_{\rm p}=0 .~~~~~~~~~~~~~~~~~~~ \end{eqnarray}$ With l_p even (necessary conditions to obtain the VVC coronagraphic effect), we have $\begin{matrix} Π_{c} (ρ,θ) = i^{m} ζ_{n,m}^{l_{p}} (ρ,θ)_{c / s} \\ ζ_{n,m}^{l_{p}} (ρ,θ)_{c} = \frac{ϵ_{l_{p}}}{2} 2 π (- 1)^{(n + m) / 2} \int_{0}^{+ \infty} J_{n + 1} (2 πr) [i^{l_{p} - m} e^{i (l_{p} - m) θ} J_{l_{p} - m} (2 πrρ) + i^{l_{p} + m} e^{i (l_{p} + m) θ} J_{l_{p} + m} (2 πrρ)^{]} d r \\ ζ_{n,m}^{l_{p}} (ρ,θ)_{s} = \frac{ϵ_{l_{p}}}{2 i} 2 π (- 1)^{(n + m) / 2} \int_{0}^{+ \infty} J_{n + 1} (2 πr) [i^{l_{p} + m} e^{i (l_{p} + m) θ} J_{l_{p} + m} (2 πrρ) - i^{l_{p} - m} e^{i (l_{p} - m) θ} J_{l_{p} - m} (2 πrρ)^{]} d r \\ R_{n}^{| m |} (ρ) = (- 1)^{(n - | m |) / 2} 2 π \int_{0}^{+ \infty} J_{n + 1} (2 πr) J_{| m |} (2 πρr) d r 0 \leq ρ < d_{c} \\ ζ_{n,m}^{l_{p}} (ρ,θ)_{c} = \frac{ϵ_{l_{p}}}{2} (- 1)^{l_{p} / 2} i^{l_{p} - m} e^{i l_{p} θ} [R_{n}^{| l_{p} + m |} (ρ) e^{i mθ} + R_{n}^{| l_{p} - m |} (ρ) e^{- i mθ}] (0 < ρ < d_{c}) \\ ζ_{n,m}^{l_{p}} (ρ,θ)_{s} = \frac{ϵ_{l_{p}}}{2 i} (- 1)^{l_{p} / 2} i^{l_{p} - m} e^{i l_{p} θ} [R_{n}^{| l_{p} + m |} (ρ) e^{i mθ} - R_{n}^{| l_{p} - m |} (ρ) e^{- i mθ}] (0 < ρ < d_{c}) \\ ζ_{n,m}^{l_{p}} (ρ,θ)_{m = 0} = ϵ_{l_{p}} (- 1)^{l_{p} / 2} i^{l_{p}} e^{i l_{p} θ} [R_{n}^{| l_{p} |} (ρ)] (0 < ρ < d_{c}) . \end{matrix}$ $\appendix \setcounter{section}{4} \begin{eqnarray} &&\Pi_{\rm c}(\rho,\theta)={\rm i}^m \zeta_{n,m}^{l_{\rm p}}(\rho,\theta)_{\rm c/s} \\ &&\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)_{\rm c}=\frac{\epsilon_{l_{\rm p}}} {2} 2\pi (-1)^{(n+m)/2} \int_{0}^{+\infty} J_{n+1}(2\pi r) \left[{\rm i}^{l_{\rm p}-m} \:{\rm e}^{{\rm i}(l_{\rm p}-m)\theta} J_{l_{\rm p}-m}(2 \pi r \rho) + {\rm i}^{l_{\rm p}+m} \:{\rm e}^{{\rm i}(l_{\rm p}+m)\theta} J_{l_{\rm p}+m}(2 \pi r \rho)\right] {\rm d}r \nonumber \\ &&\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)_{\rm s}=\frac{\epsilon_{l_{\rm p}}} {2{\rm i}} 2\pi (-1)^{(n+m)/2} \int_{0}^{+\infty} J_{n+1}(2\pi r) \left[{\rm i}^{l_{\rm p}+m} \: {\rm e}^{{\rm i}(l_{\rm p}+m)\theta} J_{l_{\rm p}+m}(2 \pi r \rho) - {\rm i}^{l_{\rm p}-m} \:{\rm e}^{{\rm i}(l_{\rm p}-m)\theta} J_{l_{\rm p}-m}(2 \pi r \rho)\right] {\rm d}r \nonumber\\ && R_{n}^{|m|}(\rho)=(-1)^{(n-|m|)/2} \: 2\pi \int_0^{+\infty} J_{n+1}(2\pi r)J_{|m|}(2\pi \rho r) {\rm d}r \qquad 0\leq\rho<d_{\rm c} \nonumber \\ &&\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)_{\rm c}=\frac{\epsilon_{l_{\rm p}}} {2} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}-m}\: {\rm e}^{{\rm i} l_{\rm p}\theta}\: \left[R_n^{|l_{\rm p}+m|}(\rho) \: {\rm e}^{{\rm i}m\theta} + R_n^{|l_{\rm p}-m|}(\rho) \: {\rm e}^{-{\rm i}m\theta} \right]\quad (0<\rho<d_{\rm c})\nonumber\\ &&\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)_{\rm s}=\frac{\epsilon_{l_{\rm p}}} {2 {\rm i}} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}-m}\: {\rm e}^{{\rm i} l_{\rm p}\theta}\: \left[R_n^{|l_{\rm p}+m|}(\rho) \: {\rm e}^{{\rm i}m\theta} - R_n^{|l_{\rm p}-m|}(\rho) \: {\rm e}^{-{\rm i}m\theta}\right]\quad (0<\rho<d_{\rm c})\nonumber\\ &&\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)_{m=0}= \epsilon_{l_{\rm p}} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}} \: {\rm e}^{{\rm i} l_{\rm p}\theta}\: \left[R_n^{|l_{\rm p}|}(\rho) \right]\quad (0<\rho<d_{\rm c}) . \end{eqnarray}$ Outside the coronagraphic pupil geometric area, it is necessary to change variables in the integrals to ensure a proper convergence: n → l_p ± m l_p ± m → n + 2 ρ → 1/ρ.

For n = 0,m = 0,l_p = 2: $\begin{matrix} ζ_{0, 0}^{2} (ρ,θ)_{m = 0} = 0 \to β_{0}^{0} = 0 l_{p} = 2, 4, 6, 8 (0 < ρ < d_{c}) \\ ζ_{0, 0}^{2} (ρ,θ)_{m = 0} = - i R_{2}^{2} (1 / ρ) e^{i 2 θ} (ρ > d_{c}) (R_{n}^{n + | l_{p} |} (ρ) = 0) . \end{matrix}$ $\appendix \setcounter{section}{4} \begin{eqnarray} &&\zeta_{0,0}^{2}(\rho,\theta)_{m=0}=0 \rightarrow \beta_0^0=0 \quad l_{\rm p}=2,4,6,8\quad (0<\rho<d_{\rm c})\\ \nonumber &&\zeta_{0,0}^{2}(\rho,\theta)_{m=0}=-{\rm i} \:R_2^2(1/\rho) \: {\rm e}^{{\rm i}2\theta} \quad (\rho>d_{\rm c}) \qquad \left( R_n^{n+|l_{\rm p}|}(\rho)=0 \right) . \end{eqnarray}$ (D.8)These last two results correct typos in Eq. (C.7) in the appendix of (Mawet et al. 2005) for n = 0,m = 0.

Fig. D.1

Input classical Zernike $Z_{n}^{m}$ $\hbox{$Z_n^m$}$ .

Fig. D.2

$ℜ (ζ_{n,m}^{l_{p}} (ρ,θ))$ $\hbox{$\Re\left(\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)\right)$}$ function in the coronagraphic pupil.

Fig. D.3

$ℑ (ζ_{n,m}^{l_{p}} (ρ,θ))$ $\hbox{$\Im\left(\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)\right)$}$ function in the coronagraphic pupil.

Fig. D.4

U_v(r,φ),l_p = ± 2 functions in the coronagraphic image plane for the 10 Zernike polynomials (Z₂ − Z₁₁). Top: l_p = + 2, Bottom: l_p = −2. The vertical solid line separates the real part from the imaginary part of the two U_v(r,φ) functions.

Note that the pupil integration on the ρ coordinate is applied between 0 and d_c because of the presence of the Lyot stop, $β_{n}^{m} ζ_{n,m}^{l_{p}} (ρ)_{c / s} (0 < ρ < d_{c}) - \to β_{n}^{m} (d_{c}) ζ_{n,m}^{l_{p}} (ρ / d_{c})_{c / s} - \to Π_{c} (ρ,θ) = \sum_{n,m} β_{n}^{m} (d_{c}) i^{m} ζ_{n,m}^{l_{p}} (ρ / d_{c})_{c / s} (0 < ρ < 1) .$ $\appendix \setcounter{section}{4} \begin{equation} \beta_n^m \zeta_{n,m}^{l_{\rm p}}(\rho)_{\rm c/s}\quad (0<\rho<d_{\rm c})\quad \longrightarrow \quad \beta_n^m(d_{\rm c}) \zeta_{n,m}^{l_{\rm p}}(\rho/ d_{\rm c})_{\rm c/s}\quad \longrightarrow \quad\Pi_{\rm c}(\rho,\theta)=\sum_{n,m} \beta_n^m(d_{\rm c}) {\rm i}^m \zeta_{n,m}^{l_{\rm p}} (\rho/ d_{\rm c})_{\rm c/s} \quad (0<\rho<1) . \end{equation}$ (D.9)To retrieve the coronagraphic amplitude residual in the final imaging plane, we use the direct l_p + m order Hankel transform of the coronagraphic pupil plane, $\begin{matrix} U_{v} (r,φ) = H_{l_{p} \pm m} [Π_{c} (ρ,θ)] \\ U_{v} (r,φ)_{c} = ϵ_{l_{p}} \sum_{n,m} β_{n}^{m} (d_{c})_{c} 2 i^{m} (- 1)^{(n + m) / 2} e^{i l_{p} φ} \frac{J_{n + 1} (2 πr)}{2 πr} C_{m} (φ) U_{v} (r,φ)_{s} = ϵ_{l_{p}} \sum_{n,m} β_{n}^{m} (d_{c})_{s} 2 i^{m} (- 1)^{(n + m) / 2} e^{i l_{p} φ} \frac{J_{n + 1} (2 πr)}{2 πr} S_{m} (φ) \\ C_{m} (φ) = (e^{i mθ} + e^{- i mθ}) / 2 S_{m} (φ) = (e^{i mθ} - e^{- i mθ}) / 2 i . \end{matrix}$ $\appendix \setcounter{section}{4} \begin{eqnarray} &&U_v (r,\phi)= H_{l_{\rm p}\pm m} \left[\Pi_{\rm c}(\rho,\theta) \right]\\ &&U_v (r,\phi)_{\rm c}=\epsilon_{l_{\rm p}} \sum_{n,m} \beta_n^m(d_{\rm c})_{\rm c} \: 2{\rm i}^{m} (-1)^{(n+m)/2} \: {\rm e}^{{\rm i} l_{\rm p}\phi} \frac{J_{n+1}(2\pi r)} {2\pi r} C_m(\phi) \qquad U_v (r,\phi)_{\rm s}=\epsilon_{l_{\rm p}} \sum_{n,m} \beta_n^m(d_{\rm c})_{\rm s} \: 2{\rm i}^{m} (-1)^{(n+m)/2} \: {\rm e}^{{\rm i} l_{\rm p}\phi} \frac{J_{n+1}(2\pi r)} {2\pi r} S_m(\phi)\nonumber\\ &&C_m(\phi)=\left( {\rm e}^{{\rm i} m \theta} + {\rm e}^{-{\rm i} m \theta} \right)/2 \qquad S_m(\phi)=\left({\rm e}^{{\rm i} m \theta} - {\rm e}^{-{\rm i} m \theta}\right)/2{\rm i} .\nonumber \end{eqnarray}$ (D.10)We do not simplify ${}^{(}e^{i mθ} \pm e^{- i mθ})$ $\hbox{$\left({\rm e}^{{\rm i} m \theta} \pm {\rm e}^{-{\rm i} m \theta} \right)$}$ with 2.cos(mθ) or 2i.sin(mθ) because when n = m the Zernike polynomial $R_{n}^{| n + l_{p} |} (ρ) = 0$ $\hbox{$R_n^{|n+l_{\rm p}|}(\rho)=0$}$ or $R_{n}^{| - n - l_{p} |} (ρ) = 0$ $\hbox{$R_n^{|-n-l_{\rm p}|}(\rho)=0$}$ , $U_{v} (r,φ) = \sum_{n,m} 2 i^{m} (β_{n}^{m} (d_{c})_{c} C_{m} (φ) + β_{n}^{m} (d_{c})_{s} S_{m} (φ)) \cdot V_{n,m}^{l_{p}} (r,φ) V_{n,m}^{l_{p}} (r,φ) = ϵ_{l_{p}} (- 1)^{(n + m) / 2} e^{i l_{p} φ} \frac{J_{n + 1} (2 πr)}{2 πr}$ $\appendix \setcounter{section}{4} \begin{equation} \label{tr2} U_v (r,\phi) = \sum_{n,m} 2{\rm i}^{m} \left(\beta_n^m(d_{\rm c})_{\rm c} \: C_m(\phi) + \beta_n^m(d_{\rm c})_{\rm s} \: S_m(\phi)\right) \cdot V_{n,m}^{l_{\rm p}}(r,\phi) \qquad V_{n,m}^{l_{\rm p}}(r,\phi)=\epsilon_{l_{\rm p}} (-1)^{(n+m)/2} \: {\rm e}^{{\rm i} l_{\rm p}\phi} \frac{J_{n+1}(2\pi r)} {2\pi r} \end{equation}$ (D.11)

m = 0 − → C_m(φ) = 1 S_m(φ) = 0
0 < m < n − → C_m(φ) = cos(mφ) S_m(φ) = sin(mφ)
m = n − → C_m(φ) = ⁽cos(mφ) − iSign(l_p)sin(mφ)⁾/2 S_m(φ) = ⁽sin(mφ) + iSign(l_p)cos(mφ)⁾/2.

In the final imaging plane, a VVC device of order l_p adds a POAM l_p on the incoming residual light (Poynting 1909; Allen et al. 1999).

Appendix E: Polychromatic V $\begin{matrix} l_{p},λ \\ n, m \end{matrix}$ $\hbox{$\mathsfsl{_{n,m}^{l_{\sf p},\lambda}}$}$ expression

In the broadband case, we can use a polychromatic set of functions $V_{n,m}^{l_{p},λ}$ $\hbox{$V_{n,m}^{l_{\rm p},\lambda}$}$ to enable a better complex amplitude aberration retrieval. The main effect is to broaden the monochromatic $V_{n,m}^{l_{p}} (r)$ $\hbox{$V_{n,m}^{l_{\rm p}}(r)$}$ set of functions and to decrease the contrast of the Bessel J_n + 1 rings of the coronagraphic images. These two effects can be implemented with a simple Gaussian convolution of the monochromatic functions, $V_{n,m}^{l_{p},λ} (r) = e^{- r^{2} / 2 σ_{g}^{2}} \otimes V_{n,m}^{l_{p}} (r) (r \geq 0) - \to V_{n,m}^{l_{p},λ} (r) = ϵ_{l_{p}} (- 1)^{(n + m) / 2} e^{i l_{p} φ} \int_{0}^{+ \infty} e^{- x^{2} / 2 σ_{g}^{2}} \frac{J_{n + 1} (2 π (r - x))}{2 π (r - x)} d x .$ $\appendix \setcounter{section}{5} \begin{equation} \label{polycc} V_{n,m}^{l_{\rm p},\lambda}(r)= {\rm e}^{-r^2/2\sigma_g^2} \otimes V_{n,m}^{l_{\rm p}}(r) \qquad (r\geq 0) \quad \longrightarrow \quad V_{n,m}^{l_{\rm p},\lambda}(r)= \epsilon_{l_{\rm p}} (-1)^{(n+m)/2} \: {\rm e}^{{\rm i} l_{\rm p}\phi} \int_0^{+\infty} {{\rm e}^{-x^2/2\sigma_g^2} \frac{J_{n+1}(2\pi(r-x))}{2\pi(r-x)} {\rm d}x}. \end{equation}$ (E.1)Using the two following relations in the convolution $\frac{J_{n + 1} (2 π (r - x))}{2 π (r - x)} = \frac{1}{2 (n + 1)} [J_{n} (2 π (r - x)) + J_{n + 2} (2 π (r - x))] J_{n} (2 π (r - x)) = \sum_{k = 0}^{+ \infty} J_{n + k} (2 πr) . J_{k} (2 πx),$ $\appendix \setcounter{section}{5} \begin{equation} \label{polycc1} \frac{J_{n+1}(2\pi(r-x))} {2\pi(r-x)}= \frac{1}{2(n+1)} \left[ J_n(2\pi(r-x)) + J_{n+2}(2\pi(r-x))\right] \qquad \qquad J_n(2\pi(r-x))=\sum_{k=0}^{+\infty} {J_{n+k}(2\pi r).J_{k}(2\pi x)} , \end{equation}$ (E.2)we obtain $\int_{0}^{+ \infty} e^{- x^{2} / 2 σ_{g}^{2}} \frac{J_{n + 1} (2 π (r - x))}{2 π (r - x)} d x = \frac{1}{2 (n + 1)} \sum_{k = 0}^{+ \infty} \int_{0}^{+ \infty} e^{- x^{2} / 2 σ_{g}^{2}} J_{k} (2 πx) [J_{n + k} (2 πr) + J_{n + 2 + k} (2 πr)] d x .$ $\appendix \setcounter{section}{5} \begin{equation} \label{polycc2a} \int_0^{+\infty} {{\rm e}^{-x^2/2\sigma_g^2} \frac{J_{n+1}(2\pi(r-x))} {2\pi(r-x)} {\rm d}x}= \frac{1}{2(n+1)}\sum_{k=0}^{+\infty} {\int_0^{+\infty} {{\rm e}^{-x^2/2\sigma_g^2} J_{k}(2\pi x) \left[ J_{n+k}(2\pi r) + J_{n+2+k}(2\pi r)\right] {\rm d}x}} . \end{equation}$ (E.3)Following Gradshteyn & Ryzhik (1994), p1094, the infinite integral can be calculated analytically, $\int_{0}^{+ \infty} e^{- x^{2} / 2 σ_{g}^{2}} J_{k} (2 πx) d x = σ_{g} e^{- π^{2} σ_{g}^{2}} \sqrt{\frac{π}{2}} I_{k / 2} (π^{2} {σ_{g}^{2}}^{)},$ $\appendix \setcounter{section}{5} \begin{equation} \label{polycc2b} \int_0^{+\infty} {\rm e}^{-x^2/2\sigma_g^2} J_{k}(2\pi x) {\rm d}x = \sigma_g {\rm e}^{-\pi^2 \sigma_g^2} \sqrt{\frac{\pi}{2}} I_{k/2}\left(\pi^2 \sigma_g^2\right) , \end{equation}$ (E.4)where I_k/2 is the Bessel I function. Note that the two Bessel J functions can be compacted as follows: $[J_{n + k} (2 πr) + J_{n + 2 + k} (2 πr)] = 2 (n + k + 1) \frac{J_{n + k + 1} (2 πr)}{2 πr} \cdot$ $\appendix \setcounter{section}{5} \begin{equation} \label{polycc3} \left[ J_{n+k}(2\pi r) + J_{n+2+k}(2\pi r)\right]= 2 (n+k+1) \frac{J_{n+k+1}(2\pi r)} {2\pi r}\cdot \end{equation}$ (E.5)The polychromatic set of functions $V_{n,m}^{l_{p},λ}$ $\hbox{$V_{n,m}^{l_{\rm p},\lambda}$}$ then becomes $V_{n,m}^{l_{p},λ} = ϵ_{l_{p}} (- 1)^{(n + m) / 2} e^{i l_{p} φ} \sum_{k = 0}^{+ \infty} \sqrt{\frac{π}{2}} \frac{(n + k + 1)}{(n + 1)} σ_{g} \cdot e^{- π^{2} σ_{g}^{2}} \cdot I_{k / 2} (π^{2} σ_{g}^{2}) \cdot \frac{J_{n + k + 1} (2 πr)}{2 πr} \cdot$ $\appendix \setcounter{section}{5} \begin{equation} \label{polycc4} V_{n,m}^{l_{\rm p},\lambda}=\epsilon_{l_{\rm p}} (-1)^{(n+m)/2} \: {\rm e}^{{\rm i} l_{\rm p}\phi} \sum_{k=0}^{+\infty} \sqrt{\frac{\pi}{2}} \frac{(n+k+1)} {(n+1)} \sigma_g \cdot {\rm e}^{-\pi^2 \sigma_g^2} \cdot I_{k/2}(\pi^2 \sigma_g^2) \cdot \frac{J_{n+k+1}(2\pi r)} {2\pi r} \cdot \end{equation}$ (E.6)The Bessel I function decreases quickly with respect to the k variable. Therefore, the discrete summation on the first 20 to 30 k coefficients is sufficient to retrieve the final function to an accuracy of 10^-12. Note that the β_n,m coefficients retrieved in the polychromatic case correspond to the worst attenuation factor at the shortest wavelength.

Appendix F: Energy conservation

Each $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ mode must be normalized by its intensity for a proper retrieval analysis, $V_{n,m}^{l_{p}} (r,φ) - \to V_{n,m}^{l_{p}} (r,φ) / \sqrt{E (V_{n,m}^{l_{p}})} E (V_{n,m}^{l_{p}}) = \int_{φ = 0}^{2 π} \int_{r = 0}^{r_{\max}} r \cdot {| V_{n,m}^{l_{p}} (r,φ) |}^{2} d r d φ .$ $\appendix \setcounter{section}{6} \begin{equation} \label{modes_E1} V_{n,m}^{l_{\rm p}}(r,\phi) \longrightarrow V_{n,m}^{l_{\rm p}}(r,\phi) / \sqrt{E(V_{n,m}^{l_{\rm p}})} \qquad E(V_{n,m}^{l_{\rm p}})=\int_{\phi=0}^{2\pi} \int_{r=0}^{r_{\rm max}} r \cdot \left|V_{n,m}^{l_{\rm p}}(r,\phi) \right|^2 {\rm d}r {\rm d}\phi . \end{equation}$ (F.1) $E (V_{n,m}^{l_{p}}) = 2 r_{\max}^{2 n + 2} π^{2 n + 2} \frac{Γ [2 n + 2]}{Γ [n + 2]^{2} Γ [2 n + 3]} 2 F_{3} (n + 1,n + 3 / 2; n + 2,n + 2, 2 n + 3; 4 π^{2} {r_{\max}^{2}}^{)} E (V_{0, 0}^{l_{p}})_{r_{\max} \to + \infty} = 1.$ $\appendix \setcounter{section}{6} \begin{equation} \label{modes_E2} E(V_{n,m}^{l_{\rm p}})=2 r^{2n+2}_{\rm max} \pi^{2n+2} \frac{\Gamma[2n+2]}{\Gamma[n+2]^2\Gamma[2n+3]} {_2}F_3 \left(n+1,n+3/2;n+2,n+2,2n+3;4\pi^2 r^2_{\rm max}\right) \qquad E(V_{0,0}^{l_{\rm p}})_{r_{\rm max}\rightarrow +\infty} = 1. \end{equation}$ (F.2)Similarly, we obtain in the polychromatic case $V_{n,m}^{l_{p},λ} (r,φ) - \to V_{n,m}^{l_{p},λ} (r,φ) / \sqrt{E (V_{n,m}^{l_{p},λ})} E (V_{n,m}^{l_{p},λ}) = \sum_{k = 0}^{+ \infty} \frac{π (n + k + 1)^{2}}{2 (n + 1)^{2}} σ_{g}^{2} \cdot e^{- 2 π^{2} σ_{g}^{2}} \cdot {| I_{k / 2} (π^{2} {σ_{g}^{2}}^{)} |}^{2} E (V_{n + k,m}^{l_{p}}) .$ $\appendix \setcounter{section}{6} \begin{equation} \label{modes_E3} V_{n,m}^{l_{\rm p},\lambda}(r,\phi) \longrightarrow V_{n,m}^{l_{\rm p},\lambda}(r,\phi) / \sqrt{E\left(V_{n,m}^{l_{\rm p},\lambda}\right)} \qquad E\left(V_{n,m}^{l_{\rm p},\lambda}\right)=\sum_{k=0}^{+\infty} \frac{\pi (n+k+1)^2} {2(n+1)^2 } \sigma^2_g \cdot {\rm e}^{-2\pi^2 \sigma_g^2} \cdot \left|I_{k/2}\left(\pi^2 \sigma_g^2\right)\right|^2 E\left(V_{n+k,m}^{l_{\rm p}}\right) . \end{equation}$ (F.3)

Appendix G: Analytical vs. Fourier simulations

We performed numerical simulations to compare the robustness of the $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ analytical functions in the monochromatic case to the classical FFT computations. These simulations were performed on 2K × 2K arrays using the “super-resolution” method presented in Sect. 7.1. A χ² and Err(σ_A) are calculated for each Zernike on a box centered on the coronagraphic image of 256 × 256 pixel square. We assume that each $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ function is optimized in terms of sampling, $χ^{2} = \sum_{pixels} \frac{{| I_{V_{n,m}^{l_{p}}} - I_{FFT} |}^{2}}{I_{FFT}} and Err (σ_{A}) = \frac{σ ({| A_{FFT} - A_{V_{n,m}^{l_{p}}} |}^{2})}{\max (PSF)} \cdot$ $\appendix \setcounter{section}{7} \begin{equation} \label{chi2pix} \chi^2=\sum_{\rm pixels}{\frac{\left|I_{V^{l_{\rm p}}_{n,m}}-I_{\rm FFT}\right|^2} {I_{\rm FFT}}}\quad {\rm and} \quad Err(\sigma_{\rm A})=\frac{\sigma\left(\left|A_{\rm FFT}-A_{V^{l_{\rm p}}_{n,m}}\right|^2\right)} {{\rm max}(PSF)} \cdot \end{equation}$ (G.1)The χ² is normalized to the number of pixels inside an annulus centered on the $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ mode: typically, a χ² < 1 is considered as good. The following Figs. G.1 and G.2 show structuring features as a function of the Zernike mode parity number m.

Fig. G.1

Comparison between Fourier simulations for each Zernike polynomial (2-860) and $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ mode in terms of χ². Left: the direct summation in the pupil (β_n,m), right: the classical phase function (e^iα_n,m).

Fig. G.2

Comparison between Fourier simulations for each Zernike polynomial (2-860) and $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ mode in terms of Err(σ_A). Left: the direct summation in the pupil (β_n,m), right: the classical phase function (e^iα_n,m).

Appendix H: Expression of the pupil function in Fresnel diffraction ( $Π \begin{matrix} F \\ c \end{matrix}$ $\hbox{$\Pi\mathsfsl{_c^F}$}$ )

H.1. Circular pupil

This appendix presents the calculation of the Fresnel diffraction of the coronagraphic pupil $ζ_{n,m}^{l_{p}} (ρ,θ)_{c / s}$ $\hbox{$\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)_{\rm c/s}$}$ . Indeed, due to the uncertainty of the Lyot stop position, the diffracted field must be treated according to the Fresnel diffraction theory, $\begin{matrix} ζ_{n,m}^{l_{p}} (ρ,θ)_{c / s} - \to ζ_{n,m}^{l_{p},F} (ρ^{'}, θ^{'})_{c / s} \\ ζ_{n,m}^{l_{p},F} (ρ^{'}, θ^{'}) = \frac{e^{i kz} . e^{i k ρ^{' 2} / 2 z}}{- i λz} \int_{0}^{2 π} \int_{0}^{+ \infty} e^{i k ρ^{2} / 2 z} ζ_{n,m}^{l_{p}} (ρ,θ) e^{i k ρ^{'} ρ \cos (θ - φ) / z} d θρ d ρ \\ ζ_{n,m}^{l_{p},F} (ρ^{'}, θ^{'}) = i N e^{i kz} . e^{i πN ρ^{' 2}} \int_{0}^{2 π} \int_{0}^{+ \infty} e^{i πN ρ^{2}} ζ_{n,m}^{l_{p}} (ρ,θ) e^{i 2 πN ρ^{'} ρ \cos (θ - φ)} d θρ d ρ, \end{matrix}$ $\appendix \setcounter{section}{8} \begin{eqnarray} \label{fr1} &&\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)_{\rm c/s} \longrightarrow \zeta_{n,m}^{l_{\rm p},F}(\rho',\theta')_{\rm c/s}\\ &&\zeta_{n,m}^{l_{\rm p},F}(\rho',\theta')=\frac{{\rm e}^{{\rm i} k z}.{\rm e}^{{\rm i} k \rho'^{2} /2z}}{-{\rm i} \lambda z} \int_{0}^{2\pi} \int_{0}^{+\infty} {\rm e}^{{\rm i} k \rho^{2}/2z} \zeta_{n,m}^{l_{\rm p}}(\rho,\theta) {\rm e}^{{\rm i} k \rho' \rho \cos(\theta-\phi)/z} {\rm d}\theta \: \rho {\rm d}\rho \nonumber\\ &&\zeta_{n,m}^{l_{\rm p},F}(\rho',\theta')={\rm i}N {\rm e}^{{\rm i} k z}.{\rm e}^{{\rm i} \pi N \rho'^{2}} \int_{0}^{2\pi} \int_{0}^{+\infty} {\rm e}^{{\rm i} \pi N \rho^{2}} \zeta_{n,m}^{l_{\rm p}}(\rho,\theta) {\rm e}^{{\rm i} 2 \pi N \rho' \rho \cos(\theta-\phi)} {\rm d}\theta \: \rho {\rm d}\rho ,\nonumber \end{eqnarray}$ (H.1)where k = 2π/λ, z is the Fresnel distance propagation, and $N = d_{c}^{2} / λz$ $\hbox{$N=d_{\rm c}^2/\lambda z$}$ is the Fresnel number.

Inside the pupil plane, we have $\begin{matrix} ζ_{n,m}^{l_{p},F} (ρ^{'}, θ^{'})_{c} = \frac{P_{F}^{l_{p}}}{2} \int_{0}^{d_{c}} e^{i πN ρ^{2}} [R_{n}^{| l_{p} + m |} (ρ) J_{l_{p} + m} (2 πNρ ρ^{'}) e^{i m θ^{'}} + R_{n}^{| l_{p} - m |} (ρ) J_{l_{p} - m} (2 πNρ ρ^{'}) e^{- i m θ^{'}}] ρ d ρ \\ ζ_{n,m}^{l_{p},F} (ρ^{'}, θ^{'})_{s} = \frac{P_{F}^{l_{p}}}{2 i} \int_{0}^{d_{c}} e^{i πN ρ^{2}} [R_{n}^{| l_{p} + m |} (ρ) J_{l_{p} + m} (2 πNρ ρ^{'}) e^{i m θ^{'}} - R_{n}^{| l_{p} - m |} (ρ) J_{l_{p} - m} (2 πNρ ρ^{'}) e^{- i m θ^{'}}] ρ d ρ \\ P_{F}^{l_{p}} (ρ^{'}, θ^{'}) = 2 i πN e^{i kz} . e^{i πN ρ^{' 2}} ϵ_{l_{p}} (- 1)^{l_{p} / 2} i^{l_{p} - m} e^{i l_{p} θ^{'}} . \end{matrix}$ $\appendix \setcounter{section}{8} \begin{eqnarray} \label{fr2a} &&\zeta_{n,m}^{l_{\rm p},F}(\rho',\theta')_{\rm c}=\frac{P_F^{l_{\rm p}}} {2}\int_{0}^{d_{\rm c}} {\rm e}^{{\rm i} \pi N \rho^{2}} \left[R_n^{|l_{\rm p}+m|}(\rho) J_{l_{\rm p} + m}(2 \pi N \rho \rho' )\: {\rm e}^{{\rm i}m\theta'} + R_n^{|l_{\rm p}-m|}(\rho) J_{l_{\rm p}-m}(2 \pi N \rho \rho' )\: {\rm e}^{-{\rm i}m\theta'} \right] \rho {\rm d}\rho\\ &&\zeta_{n,m}^{l_{\rm p},F}(\rho',\theta')_{\rm s}=\frac{P_F^{l_{\rm p}}} {2i}\int_{0}^{d_{\rm c}} {\rm e}^{{\rm i} \pi N \rho^{2}} \left[R_n^{|l_{\rm p}+m|}(\rho) J_{l_{\rm p} + m}(2 \pi N \rho \rho' )\: {\rm e}^{{\rm i}m\theta'} - R_n^{|l_{\rm p}-m|}(\rho) J_{l_{\rm p}-m}(2 \pi N \rho \rho' )\: {\rm e}^{-{\rm i}m\theta'} \right] \rho {\rm d}\rho~~~~~~~~~~~~~~~~~~~~~~~~\\ &&P_F^{l_{\rm p}}(\rho',\theta')=2{\rm i}\pi N {\rm e}^{{\rm i} k z}.{\rm e}^{{\rm i} \pi N \rho'^{2}} \epsilon_{l_{\rm p}} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}-m}\: {\rm e}^{{\rm i}l_{\rm p}\theta'} .\nonumber \end{eqnarray}$ Outside the pupil plane, we have $\begin{matrix} ζ_{n,m}^{l_{p},F} {(ρ^{'}, θ^{'})}_{c} = \frac{P_{F}^{l_{p}}}{2} \int_{d_{c}}^{+ \infty} e^{i πN ρ^{2}} [R_{| l_{p} + m |}^{n + 2} (\frac{1}{ρ}) J_{l_{p} + m} (2 πNρ ρ^{'}) e^{i m θ^{'}} + R_{| l_{p} - m |}^{n + 2} (\frac{1}{ρ}) J_{l_{p} - m} (2 πNρ ρ^{'}) e^{- i m θ^{'}}] ρ d ρ \\ ζ_{n,m}^{l_{p},F} {(ρ^{'}, θ^{'})}_{s} = \frac{P_{F}^{l_{p}}}{2 i} \int_{d_{c}}^{+ \infty} e^{i πN ρ^{2}} [R_{| l_{p} + m |}^{n + 2} (\frac{1}{ρ}) J_{l_{p} + m} (2 πNρ ρ^{'}) e^{i m θ^{'}} - R_{| l_{p} - m |}^{n + 2} (\frac{1}{ρ}) J_{l_{p} - m} (2 πNρ ρ^{'}) e^{- i m θ^{'}}] ρ d ρ . \end{matrix}$ $\appendix \setcounter{section}{8} \begin{eqnarray} \label{fr2a2} &&\zeta_{n,m}^{l_{\rm p},F}\left(\rho',\theta'\right)_{\rm c}=\frac{P_F^{l_{\rm p}}} {2}\int_{d_{\rm c}}^{+\infty} {\rm e}^{{\rm i} \pi N \rho^{2}} \left[R_{|l_{\rm p}+m|}^{n+2}\left(\frac{1}{\rho}\right) J_{l_{\rm p} + m}\left(2 \pi N \rho \rho' \right)\: {\rm e}^{{\rm i}m\theta'} + R_{|l_{\rm p}-m|}^{n+2}\left(\frac{1}{\rho}\right) J_{l_{\rm p}-m}\left(2 \pi N \rho \rho' \right)\: {\rm e}^{-{\rm i}m\theta'} \right] \rho {\rm d}\rho\\ &&\zeta_{n,m}^{l_{\rm p},F}\left(\rho',\theta'\right)_{\rm s}=\frac{P_F^{l_{\rm p}}} {2i}\int_{d_{\rm c}}^{+\infty} {\rm e}^{{\rm i} \pi N \rho^{2}} \left[R_{|l_{\rm p}+m|}^{n+2}\left(\frac{1}{\rho}\right) J_{l_{\rm p} + m}\left(2 \pi N \rho \rho' \right)\: {\rm e}^{{\rm i}m\theta'} - R_{|l_{\rm p}-m|}^{n+2}\left(\frac{1}{\rho}\right) J_{l_{\rm p}-m}\left(2 \pi N \rho \rho' \right)\: {\rm e}^{-{\rm i}m\theta'} \right] \rho {\rm d}\rho .~~~~~~~~~~~~~ \end{eqnarray}$ Inside the pupil plane, we have two terms: the Zernike polynomials, and a pupil edge effect. Here, we make use of Babinet’s principle of complementary screens to express the second integral as a classical Fresnel diffraction between 0 to d_c, $F_{n,m}^{l_{p}} (ρ^{'}) = \int_{0}^{+ \infty} e^{i πN ρ^{2}} R_{n}^{| l_{p} + m |} (ρ) J_{l_{p} + m} (2 πNρ ρ^{'}) ρ d ρ - \int_{d_{c}}^{+ \infty} e^{i πN ρ^{2}} J_{l_{p} + m} (2 πNρ ρ^{'}) ρ d ρ \to F_{n,m}^{l_{p}} (ρ^{'}) = F_{n,m}^{l_{p}} (ρ^{'}, 1) + F_{0, 0}^{l_{p}} (ρ^{'}, 2)$ $\appendix \setcounter{section}{8} \begin{equation} \label{fr3} F_{n,m}^{l_{\rm p}}( \rho')= \int_{0}^{+\infty} \: {\rm e}^{{\rm i} \pi N \rho^{2}} R_n^{|l_{\rm p} + m|}(\rho) J_{l_{\rm p} + m}(2 \pi N \rho \rho' ) \rho {\rm d}\rho - \int_{d_{\rm c}}^{+\infty} \: {\rm e}^{{\rm i} \pi N \rho^{2}} J_{l_{\rm p} + m}(2 \pi N \rho \rho' ) \rho {\rm d}\rho \quad \rightarrow \quad F_{n,m}^{l_{\rm p}}( \rho') = F_{n,m}^{l_{\rm p}}( \rho',1) + F_{0,0}^{l_{\rm p}}( \rho',2) \end{equation}$ (H.6) $F_{n,m}^{l_{p}} (ρ^{'}, 1) = \int_{0}^{+ \infty} e^{i πN ρ^{2}} R_{n}^{| l_{p} + m |} (ρ) J_{l_{p} + m} (2 πNρ ρ^{'}) ρ d ρ F_{0, 0}^{l_{p}} (ρ^{'}, 2) = 1 + \int_{0}^{d_{c}} e^{i πN ρ^{2}} J_{l_{p}} (2 πNρ (1 - ρ^{'})) ρ d ρ .$ $\appendix \setcounter{section}{8} \begin{equation} \label{fr3a} F_{n,m}^{l_{\rm p}}( \rho',1) = \int_{0}^{+\infty} \: {\rm e}^{{\rm i} \pi N \rho^{2}} R_n^{|l_{\rm p} + m|}(\rho) J_{l_{\rm p} + m}(2 \pi N \rho \rho' ) \rho {\rm d}\rho \qquad \qquad F_{0,0}^{l_{\rm p}}( \rho',2) = 1+ \int_{0}^{d_{\rm c}} \: {\rm e}^{{\rm i} \pi N \rho^{2}} J_{l_{\rm p}}(2 \pi N \rho (1-\rho') ) \rho {\rm d}\rho .\nonumber \end{equation}$ The first integral can be calculated with all Zernike polynomials: $\begin{matrix} F_{n,m}^{l_{p}} (ρ^{'}, 1) = \sum_{s = 0}^{\frac{n - | m + l_{p} |}{2}} C_{z} (n, | m + l_{p} |,s) \int_{0}^{+ \infty} ρ^{n - 2 s + 1} e^{i πN ρ^{2}} J_{l_{p} + m} (2 πNρ ρ^{'}) d ρ \\ F_{n,m}^{l_{p}} (ρ^{'}, 1) = \sum_{s = 0}^{\frac{n - | m + l_{p} |}{2}} C_{z} (n, | m + l_{p} |,s) (- i πN)^{(| l_{p} + m | - n - 2 + 2 s) / 2} \cdot \frac{Γ [(\frac{| l_{p} + m | + n + 2 - 2 s}{2}] * ρ^{' | l_{p} + m |}}{Γ [(| l_{p} + m | + 1]} 1 F_{1} (\frac{| l_{p} + m | + n + 2 - 2 s}{2}, | l_{p} + m | + 1, - i πN ρ^{' 2}), \end{matrix}$ $\appendix \setcounter{section}{8} \begin{eqnarray} \label{fr3b} &&F_{n,m}^{l_{\rm p}}( \rho',1)= \sum^{\frac{n-|m+l_{\rm p}|}{2}}_{s=0} C_z(n,|m+l_{\rm p}|,s) \int_{0}^{+\infty} \: \rho^{n-2s+1 } {\rm e}^{{\rm i} \pi N \rho^{2}} J_{l_{\rm p} + m}(2 \pi N \rho \rho' ) {\rm d}\rho \\ &&F_{n,m}^{l_{\rm p}}( \rho',1)= \!\! \sum^{\frac{n-|m+l_{\rm p}|}{2}}_{s=0}\!\! C_z(n,|m+l_{\rm p}|,s) (-{\rm i}\pi N)^{(|l_{\rm p}+m|-n-2+2s)/2} \cdot \frac{\Gamma[(\frac{|l_{\rm p}+m|+n+2-2s} {2}]*\rho'^{|l_{\rm p}+m|}}{\Gamma[(|l_{\rm p}+m|+1]} {_1}F_1\left(\frac{|l_{\rm p}+m|+n+2-2s} {2},|l_{\rm p}+m|+1,-{\rm i}\pi N \rho'^2 \right), \nonumber \end{eqnarray}$ (H.7)where ${}_{1}F_{1}$ $\hbox{${_1}F_1$}$ is the confluent hypergeometric function.

The second integral only represents a diffraction effect to the J_{l_p} order, $\begin{matrix} F_{0, 0}^{l_{p}} (ρ^{'}, 2) = 1 + \int_{0}^{1} e^{i x^{2} u / 2} J_{l_{p}} (vx) x d x \\ x = ρ / d_{c} u = 2 πN v = 2 πN (1 - ρ^{'} / d_{c}) J_{2} (vx) = \frac{2}{vx} J_{1} (vx) - J_{0} (vx) (l_{p} = \pm 2) \\ F_{0, 0}^{l_{p}} (ρ^{'}, 2) = 1 - \int_{0}^{1} e^{i x^{2} u / 2} J_{0} (vx) x d x + \frac{2}{v} \int_{0}^{1} e^{i x^{2} u / 2} J_{1} (vx) d x . \end{matrix}$ $\appendix \setcounter{section}{8} \begin{eqnarray} \label{fr3c} && F_{0,0}^{l_{\rm p}}( \rho',2)= 1 + \int_{0}^{1} {\rm e}^{{\rm i} x^2 u/2} J_{l_{\rm p}}(vx) x {\rm d}x \\ &&x=\rho/d_{\rm c} \quad u=2\pi N \quad v=2\pi N (1-\rho'/d_{\rm c}) \quad J_2(vx)=\frac{2} {vx} J_1(vx)-J_0(vx) \quad (l_{\rm p}=\pm 2)\nonumber\\ &&F_{0,0}^{l_{\rm p}}( \rho',2)= 1 - \int_{0}^{1} {\rm e}^{{\rm i} x^2 u/2} J_0(vx) x {\rm d}x + \frac{2} {v} \int_{0}^{1} {\rm e}^{{\rm i} x^2 u/2}J_1(vx) {\rm d}x .\nonumber \end{eqnarray}$ (H.8)We define the Lommel functions of two variable generally used in the Fresnel diffraction of a circular screen (Born & Wolf 1999): $\begin{matrix} U_{n} (u,v) = \sum_{j = 0}^{+ \infty} (- 1)^{j} {(\frac{u}{v})}^{n + 2 j} J_{n + 2 j} (v) V_{n} (u,v) = \sum_{j = 0}^{+ \infty} (- 1)^{j} {(\frac{v}{u})}^{n + 2 j} J_{n + 2 j} (v) \\ \int \begin{matrix} 1 \\ 0 \end{matrix} e^{i x^{2} u / 2} J_{0} (vx) x d x = \frac{e^{i u / 2}}{u} (U_{1} (u,v) - i U_{2} (u,v)) \frac{2}{v} \int_{0}^{1} e^{i x^{2} u / 2} J_{1} (vx) d x = \frac{2}{uv} (e^{- i v^{2} / 2 u} + (J_{0} (v) - V_{0} (u,v) - i V_{1} (u,v)) e^{i u / 2}) \\ F_{0, 0}^{l_{p}} (ρ^{'}, 2) = 1 + \frac{2 e^{- i v^{2} / 2 u}}{uv} + \frac{e^{i u / 2}}{u} (\frac{2 J_{0} (v)}{v} - \frac{2 V_{0} (u,v) - 2 i V_{1} (u,v)}{v} - U_{1} (u,v) + i U_{2} (u,v)) \end{matrix}$ $\appendix \setcounter{section}{8} \begin{eqnarray} \label{fr3d} && U_n(u,v)=\sum_{j=0}^{+\infty} (-1)^j \left(\frac{u} {v}\right)^{n+2j} J_{n+2j}(v) \qquad V_n(u,v)=\sum_{j=0}^{+\infty} (-1)^j \left(\frac{v} {u}\right)^{n+2j} J_{n+2j}(v) \\ \label{fr3e} && \int_{0}^{1} {\rm e}^{{\rm i} x^2 u/2} J_0(vx) x {\rm d}x = \frac{{\rm e}^{{\rm i} u/2}} {u} \left(U_1(u,v)-{\rm i} U_2(u,v) \right) \qquad \frac{2} {v} \int_{0}^{1} {\rm e}^{{\rm i} x^2 u/2} J_1(vx) {\rm d}x =\frac{2} {uv}\left({\rm e}^{-{\rm i}v^2/2u} + \left( J_0(v)- V_0(u,v)- {\rm i} V_1(u,v) \right) {\rm e}^{{\rm i} u/2} \right)\nonumber\\[2mm] &&F_{0,0}^{l_{\rm p}}( \rho',2)=1+\frac{2 {\rm e}^{-{\rm i}v^2/2u}} {uv}+ \frac{{\rm e}^{{\rm i} u/2}} {u} \left(\frac{2 J_0(v)} {v}- \frac{2 V_0(u,v)- 2{\rm i} V_1(u,v)}{v} - U_1(u,v) + {\rm i} U_2(u,v) \right) \end{eqnarray}$ $F_{0, 0}^{l_{p}} (ρ^{'}, 1) = 0 - \to F_{0, 0}^{l_{p}} (ρ^{'}) = F_{0, 0}^{l_{p}} (ρ^{'}, 2) .$ $\appendix \setcounter{section}{8} \begin{equation} \label{fr3f} F_{0,0}^{l_{\rm p}}( \rho',1)=0 \longrightarrow F_{0,0}^{l_{\rm p}}( \rho')= F_{0,0}^{l_{\rm p}}( \rho',2) . \end{equation}$ (H.11)We include the diaphragm edge effect due to the Fresnel propagation only in the first term $F_{0, 0}^{l_{p}}$ $\hbox{$F_{0,0}^{l_{\rm p}}$}$ . The coronagraphic pupil with Fresnel propagation thus becomes $\begin{matrix} Π_{c}^{F} (ρ^{'}, θ^{'})_{c} = 2 i πN ϵ_{l_{p}} e^{i kz} \sum_{n,m} β_{n}^{m} (d_{c}) e^{i πN ρ^{' 2}} (- 1)^{l_{p} / 2} i^{l_{p}} e^{i l_{p} θ^{'}} \frac{1}{2} [F_{n,m}^{l_{p}} (ρ^{'}) e^{i m θ^{'}} + F_{n, - m}^{l_{p}} (ρ^{'}) e^{- i m θ^{'}}] \\ Π_{c}^{F} (ρ^{'}, θ^{'})_{s} = 2 i πN ϵ_{l_{p}} e^{i kz} \sum_{n,m} β_{n}^{m} (d_{c}) e^{i πN ρ^{' 2}} (- 1)^{l_{p} / 2} i^{l_{p}} e^{i l_{p} θ^{'}} \frac{1}{2 i} [F_{n,m}^{l_{p}} (ρ^{'}) e^{i m θ^{'}} + F_{n, - m}^{l_{p}} (ρ^{'}) e^{- i m θ^{'}}] . \end{matrix}$ $\appendix \setcounter{section}{8} \begin{eqnarray} \label{fr4} &&\Pi_{\rm c}^F(\rho',\theta')_{\rm c}=2{\rm i}\pi N \epsilon_{l_{\rm p}} {\rm e}^{{\rm i} k z} \sum_{n,m} \beta_n^m(d_{\rm c}) {\rm e}^{{\rm i} \pi N \rho'^{2}} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}}\: {\rm e}^{{\rm i}l_{\rm p}\theta'} \frac{1} {2} \left[F_{n,m}^{l_{\rm p}}( \rho')\: {\rm e}^{{\rm i}m\theta'} +F_{n,-m}^{l_{\rm p}}( \rho') {\rm e}^{-{\rm i}m\theta'}\right]\\ &&\Pi_{\rm c}^F(\rho',\theta')_{\rm s}=2{\rm i}\pi N \epsilon_{l_{\rm p}} {\rm e}^{{\rm i} k z} \sum_{n,m} \beta_n^m(d_{\rm c}) {\rm e}^{{\rm i} \pi N \rho'^{2}} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}}\: {\rm e}^{{\rm i}l_{\rm p}\theta'} \frac{1} {2{\rm i}}\left[F_{n,m}^{l_{\rm p}}( \rho')\: {\rm e}^{{\rm i}m\theta'} +F_{n,-m}^{l_{\rm p}}( \rho'){\rm e}^{-{\rm i}m\theta'}\right] .~~~~~~~~~~~~~ \end{eqnarray}$

Fig. H.1

$ℜ (ζ_{n,m}^{l_{p},F} (ρ,θ))$ $\hbox{$\Re\left(\zeta_{n,m}^{l_{\rm p},F}(\rho,\theta)\right)$}$ function in the coronagraphic pupil with an extreme case N = 4 to highlight differences on Zernike polynomials. In the real case N > 1000, and only the $ζ_{0, 0}^{l_{p},F}$ $\hbox{$\zeta_{0,0}^{l_{\rm p},F}$}$ must be taken into account in the NZ theory.

Fig. H.2

$ℑ (ζ_{n,m}^{l_{p},F} (ρ,θ))$ $\hbox{$\Im\left(\zeta_{n,m}^{l_{\rm p},F}(\rho,\theta)\right)$}$ function in the coronagraphic pupil with an extreme case N = 4 to highlight significant differences on Zernike polynomials.

H.2. Annular pupil

A telescope with a central obscuration ϵ must be treated with annular Zernike polynomials, but the central obscuration creates a strong diffraction phenomenon at the edge of the pupil (1 > ρ > ϵ). The effect of the optical aberration (n ≠ 0, m ≠ 0) gives $ζ_{n,m}^{l_{p},F} (ρ^{'}, θ^{'})_{c} / ζ_{n,m}^{l_{p},F} (ρ^{'}, θ^{'})_{s} n \neq 0,m \neq 0 R_{n}^{| l_{p} \pm m |} (ρ) - \to R_{n}^{| l_{p} \pm m |} (ρ,ϵ) ϵ < ρ < 1 R_{| l_{p} \pm m |}^{n + 2} (1 / ρ) - \to R_{| l_{p} \pm m |}^{n + 2} (1 / ρ) ρ > 1$ $\appendix \setcounter{section}{8} \begin{equation} \label{fr_an1} \zeta_{n,m}^{l_{\rm p},F}(\rho',\theta')_{\rm c} / \zeta_{n,m}^{l_{\rm p},F}(\rho',\theta')_{\rm s} \qquad n\neq0,m\neq0 \qquad R_n^{|l_{\rm p} \pm m|}(\rho) \longrightarrow R_n^{|l_{\rm p} \pm m|}(\rho,\epsilon) \qquad \epsilon<\rho<1 \qquad R_{|l_{\rm p} \pm m|}^{n+2}(1/\rho) \longrightarrow R_{|l_{\rm p} \pm m|}^{n+2}(1/\rho) \qquad \rho>1\nonumber \end{equation}$ the $ζ_{0, 0}^{l_{p},F} (ρ^{'}, θ^{'})$ $\hbox{$\zeta_{0,0}^{l_{\rm p},F}(\rho',\theta')$}$ must be calculated taking into account the diffraction of the central obscuration. The $F_{0, 0}^{l_{p}} (ρ^{'})$ $\hbox{$F_{0,0}^{l_{\rm p}}( \rho')$}$ function included in the $ζ_{0, 0}^{l_{p},F}$ $\hbox{$\zeta_{0,0}^{l_{\rm p},F}$}$ can thus be written $F_{0, 0}^{l_{p}} (ρ^{'}) = F_{0, 0}^{l_{p}} (ρ^{'}, 2) + F_{0, 0}^{l_{p}} (ρ^{'},ϵ, 2) + F_{0, 0}^{l_{p}} (ρ^{'},ϵ, 3) .$ $\appendix \setcounter{section}{8} \begin{equation} \label{fr_an2} F_{0,0}^{l_{\rm p}}( \rho')= F_{0,0}^{l_{\rm p}}( \rho',2) + F_{0,0}^{l_{\rm p}}( \rho',\epsilon,2) + F_{0,0}^{l_{\rm p}}( \rho',\epsilon,3) . \end{equation}$ (H.14)

$F_{0, 0}^{l_{p}} (ρ^{'}, 2)$ $\hbox{$F_{0,0}^{l_{\rm p}}( \rho',2)$}$ : The Fresnel diffraction of the outer edge of the pupil as calculated previously
$F_{0, 0}^{l_{p}} (ρ^{'},ϵ, 2)$ $\hbox{$F_{0,0}^{l_{\rm p}}( \rho',\epsilon,2)$}$ : The Fresnel diffraction of the the central obscuration edge inside the pupil $F_{0, 0}^{l_{p}} (ρ^{'},ϵ, 2) = F_{0, 0}^{l_{p}} (1 - ϵ - ρ^{'}, 2)$ $\hbox{$F_{0,0}^{l_{\rm p}}( \rho',\epsilon,2)=F_{0,0}^{l_{\rm p}}( 1-\epsilon-\rho',2)$}$ , ρ′ > ϵ
$F_{0, 0}^{l_{p}} (ρ^{'},ϵ, 3)$ $\hbox{$F_{0,0}^{l_{\rm p}}( \rho',\epsilon,3)$}$ : Starlight diffraction on a perfect circular pupil.

$F_{0, 0}^{l_{p}} (ρ^{'},ϵ, 3) = \int_{0}^{+ \infty} ρ^{1 - l_{p}} e^{i πN (ρ / ϵ)^{2}} J_{l_{p}} (2 πNρ ρ^{'} / ϵ) d ρ - \to F_{0, 0}^{l_{p}} (ρ^{'},ϵ, 3) = \frac{i^{l_{p} / 2} ϵ^{l_{p}} Γ [l_{p} / 2 + 2]}{2 (πN)^{l_{p} / 2} ρ^{' l_{p}}} 1 F_{1} (l_{p} / 2 + 2, 3, - i πN ϵ^{2} / ρ^{' 2})$ $\appendix \setcounter{section}{8} \begin{equation} \label{fr_an3} F_{0,0}^{l_{\rm p}}( \rho',\epsilon,3)=\int_{0}^{+\infty} \: \rho^{1-l_{\rm p}} {\rm e}^{{\rm i} \pi N (\rho/\epsilon)^{2}} J_{l_{\rm p} }(2 \pi N \rho \rho'/\epsilon ) {\rm d}\rho \quad \longrightarrow \quad F_{0,0}^{l_{\rm p}}( \rho',\epsilon,3)=\frac{{\rm i}^{l_{\rm p}/2}\epsilon^{l_{\rm p}} \Gamma[l_{\rm p}/2+2]}{2(\pi N)^{l_{\rm p}/2}\rho'^{l_{\rm p}}} {_1}F_1 \left(l_{\rm p}/2+2,3,-{\rm i}\pi N \epsilon^2/\rho'^2 \right) \end{equation}$ (H.15)

Appendix I: Scaled radial Zernike polynomial

ming Dai (2006) gives the Zernike coefficients of the scaled pupil as a function of those of the unscaled pupil, $β_{n}^{m} (d_{c}) = d_{c}^{n} (β_{n}^{m} + (n + 1) \cdot \sum_{i = 1}^{\frac{n_{\max} - n}{2}} β_{n + 2 i}^{m} \sum_{j = 0}^{i} \frac{(- 1)^{i + j} (n + i + j)!}{(n + j + 1)! (i - j)! j!} d_{c}^{2 j}) .$ $\appendix \setcounter{section}{9} \begin{equation} \label{scale_rho1} \beta_n^m(d_{\rm c})=d_{\rm c}^n \left(\beta_n^m +(n+1)\cdot \sum_{i=1}^{\frac{n_{\rm max}-n} {2}} \beta_{n+2i}^m \sum_{j=0}^{i} \frac{(-1)^{{\rm i}+j} (n+i+j)!} {(n+j+1)!(i-j)!j!} d_{\rm c}^{2j}\right) . \end{equation}$ (I.1)This expression is somewhat complicated. In this Appendix, we propose to calculate the scaled coefficients in a simpler way. The main idea is to transform $R_{n}^{m} (ρ) \to R_{n}^{m} (ρ d_{c})$ $\hbox{$R_n^m(\rho)\rightarrow R_n^m(\rho d_{\rm c})$}$ with the coronagraphic diaphragm d_c < 1. We need to calculate a new orthogonality relation: $\begin{matrix} D_{n n^{'}}^{m} (d_{c}) = \int_{0}^{1} R_{n}^{m} (ρ) R_{n^{'}}^{m} (ρ d_{c}) ρ d ρ (\forall n, n^{'} = m,m + 2, ...) - \to β_{n}^{m} (d_{c}) = 2 (n + 1) \sum_{n^{'}} β_{n^{'}}^{m} \cdot D_{n n^{'}}^{m} (d_{c}) \\ R_{n^{'}}^{m} (ρ d_{c}) = (- 1)^{(n^{'} - m) / 2} \int_{0}^{+ \infty} J_{n^{'} + 1} (r) J_{m} (ρ d_{c} r) d r 0 \leq ρ < 1 \\ D_{n n^{'}}^{m} (d_{c}) = (- 1)^{(n^{'} - m) / 2} \int_{0}^{+ \infty} J_{n^{'} + 1} (r) [\int_{0}^{1} R_{n}^{m} (ρ) J_{m} (ρ d_{c} r) ρ d ρ] d r \\ D_{n n^{'}}^{m} (d_{c}) = (- 1)^{(n^{'} + n - 2 m) / 2} \int_{0}^{+ \infty} \frac{J_{n^{'} + 1} (r) . J_{n + 1} (d_{c} r)}{d_{c} r} d r \end{matrix}$ $\appendix \setcounter{section}{9} \begin{eqnarray} \label{new_orth} && D_{nn'}^m(d_{\rm c})=\int_0^1 R_n^m(\rho)R_{n'}^m(\rho d_{\rm c})\rho {\rm d}\rho \qquad \left(\forall n\:,\:n'=m,m+2,... \right) \quad\longrightarrow\quad \beta_n^m(d_{\rm c})= 2(n+1) \sum_{n'} \beta_{n'}^m \cdot D_{nn'}^m(d_{\rm c}) \\ \label{new_sc1} && R_{n'}^m(\rho d_{\rm c})=(-1)^{(n'-m)/2} \int_0^{+\infty} J_{n'+1}(r)J_{m}(\rho d_{\rm c} r) {\rm d}r \qquad 0\leq\rho<1 \nonumber \\ && D_{nn'}^m(d_{\rm c})=(-1)^{(n'-m)/2} \int_0^{+\infty}J_{n'+1}(r) \left[ \int_0^1 R_{n}^m(\rho) J_{m}(\rho d_{\rm c} r) \rho {\rm d}\rho \right]{\rm d}r\nonumber \\ && D_{nn'}^m(d_{\rm c})=(-1)^{(n'+n-2m)/2} \int_0^{+\infty} \frac{J_{n'+1}(r).J_{n+1}(d_{\rm c} r)} {d_{\rm c} r} {\rm d}r\nonumber \end{eqnarray}$ (I.2) $\begin{matrix} \frac{J_{n + 1} (d_{c} r)}{d_{c} r} = \frac{J_{n} (d_{c} r) + J_{n + 2} (d_{c} r)}{2 (n + 1)} \\ D_{n n^{'}}^{m} (d_{c}) = \frac{(- 1)^{(n^{'} + n - 2 m) / 2}}{2 (n + 1)} [\int_{0}^{+ \infty} J_{n^{'} + 1} (r) J_{n} (d_{c} r) d r + \int_{0}^{+ \infty} J_{n^{'} + 1} (r) J_{n + 2} (d_{c} r) d r] \\ D_{n n^{'}}^{m} (d_{c}) = \frac{R_{n^{'}}^{n} (d_{c}) - R_{n^{'}}^{n + 2} (d_{c})}{2 (n + 1)} \cdot \end{matrix}$ $\appendix \setcounter{section}{9} \begin{eqnarray} \label{new_sc2} &&\frac{J_{n+1}(d_{\rm c} r)} {d_{\rm c} r} = \frac{J_{n}(d_{\rm c} r)+J_{n+2}(d_{\rm c} r)} {2(n+1)}\nonumber \\[2mm] && D_{nn'}^m(d_{\rm c})=\frac{(-1)^{(n'+n-2m)/2}} {2(n+1)} \left[\int_0^{+\infty}J_{n'+1}(r) J_{n}(d_{\rm c} r) {\rm d}r + \int_0^{+\infty}J_{n'+1}(r)J_{n+2}(d_{\rm c} r) {\rm d}r \right]\nonumber \\[2mm] && D_{nn'}^m(d_{\rm c})=\frac{R_{n'}^n(d_{\rm c})-R_{n'}^{n+2}(d_{\rm c})} {2(n+1)} \cdot \end{eqnarray}$ (I.3)Finally, the scaled $β_{n}^{m} (d_{c})$ $\hbox{$\beta_n^m(d_{\rm c})$}$ can be calculated more easily than with the complicated formula in ming Dai (2006) and we obtain $β_{n}^{m} (d_{c}) = \sum_{n^{'}} β_{n^{'}}^{m} \cdot [R_{n^{'}}^{n} (d_{c}) - R_{n^{'}}^{n + 2} (d_{c})^{]} n = m,m + 2, ... n^{'} = n,n + 2, ...$ $\appendix \setcounter{section}{9} \begin{equation} \label{new_orth_f} \beta_n^m(d_{\rm c})=\sum_{n'} \beta_{n'}^m \cdot \left[ R_{n'}^n(d_{\rm c})-R_{n'}^{n+2}(d_{\rm c}) \right] \qquad n=m,m+2,... \quad n'=n,n+2,... \end{equation}$ (I.4)Similarly, $D_{n n^{'}}^{' m} (d_{c}) = \int_{0}^{1} R_{n}^{m} (ρ d_{c}) R_{n^{'}}^{m} (ρ) ρ d ρ (\forall n, n^{'} = m,m + 2, ...) - \to β_{n}^{m} = 2 (n + 1) \sum_{n^{'}} β_{n^{'}}^{m} (d_{c}) \cdot D_{n n^{'}}^{' m} (d_{c}) .$ $\appendix \setcounter{section}{9} \begin{equation} \label{new_orth3} D_{nn'}^{'m}(d_{\rm c})=\int_0^1 R_n^m(\rho d_{\rm c})R_{n'}^m(\rho)\rho {\rm d}\rho \qquad \left(\forall n\:,\:n'=m,m+2,... \right) \quad\longrightarrow\quad \beta_n^m= 2(n+1) \sum_{n'} \beta_{n'}^m (d_{\rm c})\cdot D_{nn'}^{'m}(d_{\rm c}) . \end{equation}$ (I.5)In Eq. (I.3), we commute n and n′ $D_{n n^{'}}^{' m} (d_{c}) = \frac{R_{n}^{n^{'}} (d_{c}) - R_{n}^{n^{'} + 2} (d_{c})}{2 (n^{'} + 1)} - \to β_{n}^{m} = (n + 1) \sum_{n^{'}} β_{n^{'}}^{m} (d_{c}) \cdot \frac{[R_{n}^{n^{'}} (d_{c}) - R_{n}^{n^{'} + 2} (d_{c})^{]}}{(n^{'} + 1)} n = m,m + 2, ...$ $\appendix \setcounter{section}{9} \begin{equation} \label{new_sc3} D_{nn'}^{'m}(d_{\rm c})=\frac{R_{n}^{n'}(d_{\rm c})-R_{n}^{n'+2}(d_{\rm c})} {2(n'+1)}\quad \longrightarrow \quad \beta_n^m= (n+1) \sum_{n'} \beta_{n'}^m(d_{\rm c}) \cdot \frac{\left[ R_{n}^{n'}(d_{\rm c})-R_{n}^{n'+2}(d_{\rm c}) \right]} {(n'+1)} \qquad n=m,m+2,... \end{equation}$ (I.6)

Appendix J: Vortex phase-mask nulling limitation

This section presents the main mathematical properties of an imperfect vortex device owing to manufacturing defects. In real life, the optical vortex phase-mask presents three problems:

The phase-shift error with respect to π (chromaticity).
The region in the center of the mask presents a deviation from the overall large-scale pattern, called region of disorientation.
The finite size of the mask in the coronagraphic image plane.

J.1. The phase-shift error

For manufacturing defects, a VVC M_V can be modeled in the following way: $\begin{matrix} M_{V} = C_{M} + γ_{M} \cdot e^{\pm i l_{p} φ} C_{M} = \frac{1}{2} (\sqrt{T_{s}} + \sqrt{T_{p}} . e^{i (π - Δ φ)}) γ_{M} = \frac{1}{2} (\sqrt{T_{s}} - \sqrt{T_{p}} . e^{i (π - Δ φ)}) \\ V_{n,m}^{l_{p}} (r,φ, M_{V}) = C_{M} \cdot V_{n,m}^{0} + γ_{M} \cdot V_{n,m}^{l_{p}}, \end{matrix}$ $\appendix \setcounter{section}{10} \begin{eqnarray} \label{imprefect_mask} &&M_V=C_M + \gamma_M \cdot {\rm e}^{\pm {\rm i} l_{\rm p}\phi} \qquad C_M= \frac{1} {2} \left(\sqrt{T_{\rm s}}+\sqrt{T_{\rm p}}.{\rm e}^{{\rm i}(\pi-\Delta\phi)}\right) \qquad \gamma_M = \frac{1} {2} \left(\sqrt{T_{\rm s}}-\sqrt{T_{\rm p}}.{\rm e}^{{\rm i}(\pi-\Delta\phi)}\right)~~~~~~~~~~~ \\ &&V_{n,m}^{l_{\rm p}}(r,\phi,M_V)=C_M \cdot V_{n,m}^{0} + \gamma_M \cdot V_{n,m}^{l_{\rm p}} , \end{eqnarray}$ where T_s and T_p are the polarization transmittance for s and p polarization, respectively (depolarization issue), Δφ is the local phase retardance deviation around π with respect to the wavelength. In these conditions, the radial mode templates $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ are modified by the C_M,γ_M terms. If C_M = 1,γ_M = 0 the output image intensity is completely described by the classical NZ theory (without coronagraph). In the general case, the complete coronagraphic intensity development becomes more complicated and shows several coupled terms such as $(V_{n,m}^{l_{p}} . V_{N, 0}^{0 *}), (V_{n,m}^{0} . V_{N, 0}^{l_{p} *}), (V_{n,m}^{0} . {V_{N, 0}^{0 *}}^{)} .$ $\appendix \setcounter{section}{10} \begin{equation} \label{mod_imperfect_v} \nonumber \left(V_{n,m}^{l_{\rm p}}.V_{N,0}^{0*}\right)\:,\: \left(V_{n,m}^{0}.V_{N,0}^{l_{\rm p}*}\right)\:,\:\left(V_{n,m}^{0}.V_{N,0}^{0*}\right) . \end{equation}$ Generally, the term C_M ≈ 0. Moreover, remember that the first step of the wavefront retrieval process only uses the linear $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ terms. A simple correction of the vortex phase e^± il_pφ by γ_M allows us to reject all C_M Zernike polynomial terms in the $β_{0}^{0}$ $\hbox{$\beta_0^0$}$ coefficient. The C_M term can be seen as no POAM (Poynting 1909; Allen et al. 1999) added by the vortex coronagraph. Unfortunately, a few part of aberrations pass through the coronagraphic device without speckle modulation because of the Pancharatnam topological charge l_p effect.

J.2. The central region of disorientation

The finite mask extension is easily taking into account in our Nijboer-Zernike development by cutting high-order rings of the Bessel function in the $V_{n,m}^{l_{p}} (r,φ)$ $\hbox{$V_{n,m}^{l_{\rm p}}(r,\phi)$}$ functions. The central region of disorientation is the most important effect on of the coronagraphic rejection. Indeed, most of the stellar energy to be suppressed will fall on it. The uncertainty for the diameter of the central region of disorientation allows us to use a simplistic hole model. The nulling factor can be calculated with the modified NZ theory and the maximum nulling factor N_v is given by $N_{v} = {| 2 π \int_{0}^{s} J_{1} (πu) J_{0} (πu) d u |}^{2} - \to N_{v} = {| 1 - J_{0} (πs)^{2} |}^{2},$ $\appendix \setcounter{section}{10} \begin{equation} \label{null_int1} N_v=\left| 2\pi \int_0^s J_1(\pi u) J_0(\pi u) {\rm d}u \right|^2 \quad \longrightarrow \quad N_v=\left| 1-J_0(\pi s)^2\right|^2, \end{equation}$ (J.3)where s is the radius of the disorientation region expressed in λ/d units.

Covering this central region of disorientation with an opaque mask (small Lyot-like coronagraph) allows us to reach the chromatic limit, but create diffraction effects at the edge of the Lyot stop. In our NZ simulation, we can take the presence of this diffraction into account by changing the $V_{0, 0}^{l_{p}}$ $\hbox{$V_{0,0}^{l_{\rm p}}$}$ function (in the perfect case: $V_{0, 0}^{l_{p}} = 0$ $\hbox{$V_{0,0}^{l_{\rm p}}=0$}$ ). Indeed, the maximum coronagraphic rejection without optical aberration is given by the $β_{0}^{0} (d_{c})$ $\hbox{$\beta_0^0(d_{\rm c})$}$ term, $\begin{matrix} ζ_{0, 0}^{l_{p}} (ρ, s_{1}, s_{2},θ) = ϵ_{l_{p}} (- 1)^{l_{p} / 2} i^{l_{p}} e^{i l_{p} θ} 2 π \int_{0}^{+ \infty} [Π (0, s_{2}) - Π (0, s_{1})] J_{1} (2 πr) J_{l_{p}} (2 πrρ) d r \\ ζ_{0, 0}^{l_{p}} (ρ, s_{1}, s_{2},θ) = H_{l_{p}} [J_{1} (2 πr)] \otimes H_{l_{p}} [Π (0, s_{2}) - Π (0, s_{1})] \\ H_{l_{p}} [Π (0,s)] = \frac{2 (πs)^{l_{p} + 1} ρ^{l_{p}}}{Γ [l_{p} + 2]} 2 F_{1} (\frac{l_{p} + 2}{2}, \frac{l_{p} + 3}{2}, l_{p} + 1, (i πsρ)^{2}) H_{l_{p}} [J_{1} (2 πr)] = ϵ_{l_{p}} (- 1)^{l_{p} / 2} i^{l_{p}} e^{i l_{p} θ} R_{0}^{| l_{p} |} (ρ) \\ ζ_{0, 0}^{l_{p}} (ρ, s_{1}, s_{2},θ) = 2 ϵ_{l_{p}} (- 1)^{l_{p} / 2} i^{l_{p}} e^{i l_{p} θ} \frac{ρ^{l_{p}}}{Γ [l_{p} + 2]} Π (0, 1) \\ \times [(π s_{2})^{l_{p} + 1} 2 F_{1} (\frac{l_{p} + 2}{2}, \frac{l_{p} + 3}{2}, l_{p} + 1, (i π s_{2} ρ)^{2}) - (π s_{1})^{l_{p} + 1} 2 F_{1} (\frac{l_{p} + 2}{2}, \frac{l_{p} + 3}{2}, l_{p} + 1, (iπ s_{1} ρ)^{2})], \end{matrix}$ $\appendix \setcounter{section}{10} \begin{eqnarray} \label{null_int2} &&\zeta_{0,0}^{l_{\rm p}}(\rho,s_1,s_2,\theta)= \epsilon_{l_{\rm p}} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}} \:{\rm e}^{{\rm i}l_{\rm p}\theta} 2\pi \int_{0}^{+\infty} \left[ \Pi(0,s_2) - \Pi(0,s_1)\right] J_1(2\pi r) J_{l_{\rm p}}(2\pi r \rho) {\rm d}r \\ &&\zeta_{0,0}^{l_{\rm p}}(\rho,s_1,s_2,\theta)=H_{l_{\rm p}}\left[J_1(2\pi r)\right] \otimes H_{l_{\rm p}}\left[\Pi(0,s_2) - \Pi(0,s_1)\right]\nonumber \\[2mm] &&H_{l_{\rm p}}\left[\Pi(0,s)\right]=\frac{2 (\pi s)^{l_{\rm p}+1}\rho^{l_{\rm p}}} {\Gamma[l_{\rm p}+2]} {_2}F_1\left(\frac{l_{\rm p}+2}{2}, \frac{l_{\rm p}+3}{2},l_{\rm p}+1, ({\rm i}\pi s \rho)^2\right)\qquad \qquad H_{l_{\rm p}}\left[J_1(2\pi r)\right]=\epsilon_{l_{\rm p}} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}} \:{\rm e}^{{\rm i}l_{\rm p}\theta} R_0^{|l_{\rm p}|}(\rho)\nonumber \\[2mm] \label{null_int22} &&\zeta_{0,0}^{l_{\rm p}}(\rho,s_1,s_2,\theta)=2 \epsilon_{l_{\rm p}} (-1)^{l_{\rm p}/2} {\rm i}^{l_{\rm p}} \:{\rm e}^{{\rm i}l_{\rm p}\theta} \frac{\rho^{l_{\rm p}}}{\Gamma[l_{\rm p}+2]} \Pi(0,1) \\ &&\quad \times \left[(\pi s_2)^{l_{\rm p}+1} {_2}F_1\left(\frac{l_{\rm p}+2}{2}, \frac{l_{\rm p}+3}{2},l_{\rm p}+1, ({\rm i}\pi s_2 \rho)^2\right) - (\pi s_1)^{l_{\rm p}+1} {_2}F_1\left(\frac{l_{\rm p}+2}{2}, \frac{l_{\rm p}+3}{2},l_{\rm p}+1, (i\pi s_1 \rho)^2\right)\right] , \nonumber \end{eqnarray}$ where the vortex mask extent is [s₁,s₂] , Π(0,1) is the unit box function and ${}_{2}F_{1}$ $\hbox{${_2}F_1$}$ is the Gauss hypergeometric function. The $ζ_{0, 0}^{l_{p}}$ $\hbox{$\zeta_{0,0}^{l_{\rm p}}$}$ function inside the pupil can be approximated by a simple parabolic function: 10^-5ρ² 0 < ρ < 1 l_p = 2 as a simple defocus.

Appendix K: Lyot stop optimization

The diaphragm in the coronagraphic pupil plane must be optimized to obtain a good ratio between the nulling factor and the planet throughput attenuation because of this Lyot stop. The total nulling factor ϵ_c can be defined as follows: $ϵ_{c} = \sum_{n,m} {| β_{cn}^{m} (d_{c}) + β_{sn}^{m} (d_{c}) |}^{2} .$ $\appendix \setcounter{section}{11} \begin{equation} \label{tr2bis} \epsilon_{\rm c}=\sum_{n,m} \left|\beta_{\rm cn}^m(d_{\rm c})+\beta_{\rm sn}^m(d_{\rm c})\right|^2 . \end{equation}$ (K.1)The coronagraphic pupil presents in real life three different diffraction patterns: a smooth ρ² residual light in the pupil owing to the small Lyot coronograph in the center (pseudo-apodization), a light modulation inside the pupil owing to optical aberrations, and finally a strong light gradient in the edge of the pupil with diffraction oscillations.

K.1. Residual optical aberrations

The diaphragm optimization in the presence of aberrations can be easily calculated as follows: $\frac{d E_{p}}{d x} = \frac{E (d_{c}) - E (d_{c} - d x)}{d x} - \to E_{p} (d_{c}) = \frac{1}{π d_{c}^{2}} \int_{0}^{2 π} \int_{0}^{d_{c}} ρ {| \sum_{n,m} \frac{1}{n^{pw}} i^{m} ζ_{n,m}^{l_{p}} |}^{2} d ρ d θ,$ $\appendix \setcounter{section}{11} \begin{equation} \frac{{\rm d}E_{\rm p}} {{\rm d}x}=\frac{E(d_{\rm c})-E(d_{\rm c}-{\rm d}x)} {{\rm d}x} \quad \longrightarrow \quad E_{\rm p}(d_{\rm c})= \frac{1} {\pi d_{\rm c}^2} \int_0^{2\pi} \int_0^{d_{\rm c}} \rho \left|\sum_{n,m} \frac{1} {n^{pw}} {\rm i}^m \zeta_{n,m}^{l_{\rm p}}\right|^2 {\rm d}\rho {\rm d}\theta , \end{equation}$ (K.2)where 1/n^pw is the power spectral density of the optical aberrations. We consider two extreme cases: pw = 1 (strong aberration residuals on an adaptive optics for example) and pw = 2 (optical polishing). Results are given in the two following figures (Fig. K.1):

Fig. K.1

Variation of the residual energy (dE_p/dx) in the residual optical aberrations as a function of the number of Zernike polynomials used (nz – without the piston term) and the power spectral density 1/n^pw. The figure shows two diaphragm values: when the gain is maximum and when it is equal to zero. The figure in 1/n presents strong residual oscillations owing to low-order aberrations.

K.2. Diffraction oscillations

We need to calculate the $ζ_{0, 0}^{l_{p},F} (ρ^{'})$ $\hbox{$\zeta_{0,0}^{l_{\rm p},F}( \rho')$}$ function (including $F_{0, 0}^{l_{p}} (ρ^{'})$ $\hbox{$F_{0,0}^{l_{\rm p}}( \rho')$}$ ) previously given in Appendix H. In the visible/near-infrared, these oscillations are generally limited to the region 0.8 < ρ < 1 (see numerical simulations in Riaud & Schneider 2007).

Fig. K.2

Fresnel diffraction of the Lyot stop with an optical vortex coronagraph. We present the real and the imaginary parts of the $ζ_{0, 0}^{l_{p},F} (ρ^{'})$ $\hbox{$\zeta_{0,0}^{l_{\rm p},F}( \rho')$}$ function for four different Fresnel number N. The last figure shows the residual intensity in the coronagraphic pupil plane when we choose a d_c radius of 86% (vertical dotted line).

To minimize the Fresnel residual diffraction in the coronagraphic pupil plane, it is necessary to increase the Fresnel number N. For high-contrast imaging devoted to the research of telluric planets around nearby stars (10^-10 contrast level), the diaphragm (86%) has to be positioned with ± 50 μm accuracy with respect to the real pupil position (N > 10 000) along the optical axis. To simplify the aberration retrieval process, the $ζ_{0, 0}^{l_{p}}$ $\hbox{$\zeta_{0,0}^{l_{\rm p}}$}$ only contains pseudo-apodization pattern owing to the small Lyot dot in the center of the mask and not the one caused by the propagation of the Lommel function of two variables given by the $F_{0, 0}^{l_{p}}$ $\hbox{$F_{0,0}^{l_{\rm p}}$}$ term. Indeed, these functions are somewhat negligible if we use the proper Lyot stop diameter (86% in the case presented here) and a good diaphragm position.

K.3. Annular pupil

The case of the annular pupil is relatively simple, because all simulations already given in the previous section can be used for a larger diameter of the Lyot stop. Concerning the optimization of the smaller diameter of the Lyot stop, the value is completely governed by the starlight diffraction of the telescope obscuration ( ∝ ρ^− l_p). For an obscuration ϵ = 0.14 (Very Large Telescope) an inner radius (d_in) of 0.3 and l_p = ± 2 allows one to obtain a total nulling factor of |ϵ²(ln(d_in) − ln(ϵ))|² ≈ 1/4500.

Appendix L: Spider diffraction in the Fresnel mode

A telescope with a central obscuration possesses a complex spider to hold the secondary mirror. We present here the Fresnel development of the residual diffraction caused by a thin spider in the coronagraphic pupil with the vortex coronagraph.

First of all, we calculate this effect in the classical cartesian coordinates (x,y) taking into account the vortex term (e^il_pθ) and after that, we transform the result into the polar coordinates. We follow the Fresnel notation given in the previous analytical section: ρ → ρ′, θ → θ′, and now x → x′, y → y′. A spider is defined as follows: x is ( ± th/2) the thichness of the spider in radius unit and y is (ϵ to 1) the spider length. The spider is on azimuthal position θ_n. Generally, a telescope possesses three or four spiders. For the VLT-UT telescopes, we sum in amplitude the diffraction contribution of four different spiders: n = 1,2,3,4 with azimuthal angle of 109 degres. The Fresnel diffraction U_n(x′,y′) of one spider becomes $\begin{matrix} U_{n} (x^{'}, y^{'}) = 1 - \frac{e^{i kz}}{i λz} \int_{- th / 2}^{+ th / 2} \int_{ϵ}^{1} e^{i k (x^{'} - x)^{2} / 2 z} e^{i k (y^{'} - y)^{2} / 2 z} e^{i l_{p} (θ_{n} - ATan (y / x))} d x d y \\ N_{x} = t h^{2} / (4 λz) N_{y} = (1 - ϵ)^{2} / (4 λz) ξ_{x} = \sqrt{\frac{k}{πz}} (x - x^{'}) ξ_{y} = \sqrt{\frac{k}{πz}} (y - y^{'}) . \end{matrix}$ $\appendix \setcounter{section}{12} \begin{eqnarray} \label{spider1} &&U_n(x',y')=1-\frac{{\rm e}^{{\rm i} k z}} {{\rm i}\lambda z} \int_{-th/2}^{+th/2} { \int_{\epsilon}^{1} {{\rm e}^{{\rm i}k(x'-x)^2/2z} {\rm e}^{{\rm i}k(y'-y)^2/2z} {\rm e}^{{\rm i} l_{\rm p} (\theta_n-ATan(y/x))}{\rm d}x} {\rm d}y}\\ &&N_x=th^2/(4\lambda z) \quad N_y=(1-\epsilon)^2/(4\lambda z) \quad \xi_x=\sqrt{\frac{k}{\pi z} }(x-x') \quad \xi_y=\sqrt{\frac{k}{\pi z}}(y-y') . \nonumber \end{eqnarray}$ (L.1)After some variable changes, we obtain the spider diffraction as a function of the modified Fresnel integrals C(ξ,l_p) and S(ξ,l_p) $U_{n} (x^{'}, y^{'}) = 1 - \frac{e^{i l_{p} θ_{n}}}{2 i} ([C (ξ_{x 2}, l_{p}) + i S (ξ_{x 2}, l_{p})^{]} - {[C (ξ_{x 1}, l_{p}) + i S (ξ_{x 1}, l_{p})^{]}}^{)} \cdot ([C (ξ_{y 2}, l_{p}) + i S (ξ_{y 2}, l_{p})^{]} - {[C (ξ_{y 1}, l_{p}) + i S (ξ_{y 1}, l_{p})^{]}}^{)}$ $\appendix \setcounter{section}{12} \begin{equation} \label{spider2} U_n(x',y')=1-\frac{{\rm e}^{{\rm i} l_{\rm p} \theta_n}}{2{\rm i}} \left(\left[C(\xi_{x2},l_{\rm p})+{\rm i}S(\xi_{x2},l_{\rm p})\right]-\left[C(\xi_{x1},l_{\rm p})+{\rm i}S(\xi_{x1},l_{\rm p})\right] \right) \cdot \left(\left[C(\xi_{y2},l_{\rm p})+{\rm i}S(\xi_{y2},l_{\rm p})\right]-\left[C(\xi_{y1},l_{\rm p})+{\rm i}S(\xi_{y1},l_{\rm p})\right] \right) \end{equation}$ (L.2)where $\begin{matrix} C (α_{x / y}, l_{p}) = \int_{0}^{α_{x / y}} \cos (π / 2 \cdot {ξ_{x / y}^{2}}^{)} \cos (l_{p} \cdot ArcTan (\frac{ξ_{y} \sqrt{(zλ / 2)} + y^{'}}{ξ_{x} \sqrt{(zλ / 2)} + x^{'}})) d ξ_{x / y} \\ S (α_{x / y}, l_{p}) = \int_{0}^{α_{x / y}} \sin (π / 2 \cdot {ξ_{x / y}^{2}}^{)} \sin (l_{p} \cdot ArcTan (\frac{ξ_{y} \sqrt{(zλ / 2)} + y^{'}}{ξ_{x} \sqrt{(zλ / 2)} + x^{'}})) d ξ_{x / y} . \end{matrix}$ $\appendix \setcounter{section}{12} \begin{eqnarray} \label{spider3} &&C(\alpha_{x/y},l_{\rm p})=\int_{0}^{\alpha_{x/y}} \cos\left(\pi/2\cdot \xi_{x/y}^2\right) \cos\left(l_{\rm p}\cdot {\rm ArcTan}\left(\frac{\xi_{y}\sqrt{(z\lambda/2)}+y'}{\xi_{x}\sqrt{(z\lambda/2)}+x'}\right)\right){\rm d}\xi_{x/y} \\[2mm] &&S(\alpha_{x/y},l_{\rm p})=\int_{0}^{\alpha_{x/y}} \sin\left(\pi/2\cdot \xi_{x/y}^2\right) \sin\left(l_{\rm p}\cdot {\rm ArcTan}\left(\frac{\xi_{y}\sqrt{(z\lambda/2)}+y'}{\xi_{x}\sqrt{(z\lambda/2)}+x'}\right)\right){\rm d}\xi_{x/y} .~~~~~~~~~~~~~~ \end{eqnarray}$ These modified Fresnel integrals are relatively complicated in cartesian coordinates. Now, we use the polar coordinates to calculate this diffraction in a more convenient way: $\begin{matrix} x \to ρ \sin (| θ_{n} - θ |) y \to ρ x^{'} \to ρ^{'} \sin (| θ_{n} - θ^{'} |) y^{'} \to ρ^{'} \\ U_{n} (ρ^{'}, θ^{'}) = 1 - \frac{e^{i l_{p} θ_{n}}}{2 i} ([C (ξ_{s 2}, l_{p}) + i S (ξ_{s 2}, l_{p})^{]} - {[C (ξ_{s 1}, l_{p}) + i S (ξ_{s 1}, l_{p})^{]}}^{)} \cdot ([C (ξ_{ρ 2}) + i S (ξ_{ρ 2})^{]} - {[C (ξ_{ρ 1}) + i S (ξ_{ρ 1})^{]}}^{)} . \end{matrix}$ $\appendix \setcounter{section}{12} \begin{eqnarray} \label{spider4} &&x \rightarrow \rho\sin(|\theta_n-\theta|) \quad y \rightarrow \rho \qquad x' \rightarrow \rho'\sin(|\theta_n-\theta'|) \quad y' \rightarrow \rho' \nonumber \\ &&U_n(\rho',\theta')=1-\frac{{\rm e}^{{\rm i} l_{\rm p} \theta_n}}{2{\rm i}} \left(\left[C(\xi_{s2},l_{\rm p})+{\rm i}S(\xi_{s2},l_{\rm p})\right]-\left[C(\xi_{s1},l_{\rm p})+{\rm i}S(\xi_{s1},l_{\rm p})\right] \right) \cdot \left(\left[C(\xi_{\rho2})+{\rm i}S(\xi_{\rho2})\right]-\left[C(\xi_{\rho1})+{\rm i}S(\xi_{\rho1})\right] \right) . \end{eqnarray}$ (L.5)In this coordinate transform y depends only on ρ and the diffraction integral on ρ′ does not depend on the θ coordinate. The Fresnel integrals become $\begin{matrix} C (ξ_{ρn}) = \int_{0}^{ξ_{ρn}} \cos (π / 2 \cdot ξ^{2}) d ξ ξ_{ρ 1} = - \sqrt{8 N_{y}} (1 / 2 + ρ^{'} / (1 - ϵ)) \\ S (ξ_{ρn}) = \int_{0}^{ξ_{ρn}} \sin (π / 2 \cdot ξ^{2}) d ξ ξ_{ρ 2} = \sqrt{8 N_{y}} (1 / 2 - ρ^{'} / (1 - ϵ)) . \end{matrix}$ $\appendix \setcounter{section}{12} \begin{eqnarray} \label{spider6a} &&C(\xi_{\rho n})=\int_{0}^{\xi_{\rho n}} \cos\left(\pi/2\cdot \xi^2\right) {\rm d}\xi \qquad \xi_{\rho1}=-\sqrt{8N_y}\left(1/2+\rho'/(1-\epsilon)\right) \nonumber\\ &&S(\xi_{\rho n})=\int_{0}^{\xi_{\rho n}} \sin\left(\pi/2\cdot \xi^2\right) {\rm d}\xi \qquad \xi_{\rho2}=\sqrt{8N_y}\left(1/2-\rho'/(1-\epsilon)\right) .\nonumber \end{eqnarray}$ In the coronagraphic pupil plane, we use a Lyot-stop for a proper starlight filtering, but we also notice that this diaphragm suppresses the Fresnel diffraction on the ρ′ axis. The spider creates only residual diffraction on the x′ coordinate. The Fresnel oscillations in the edges of the spider depend on the ρ′ and the θ′ coordinates. In these conditions, the modified Fresnel integrals must be simplified in the ArcTan() using exponential algebra for a fixed value of l_p: $\begin{matrix} C (ξ_{sn}, l_{p} = \pm 2) = \int_{0}^{ξ_{sn}} \cos (π / 2 \cdot ξ^{2}) (\frac{4}{zλ ξ^{2} - \cos (2 (| θ_{n} - θ^{'} |)) + \sqrt{8 zλ} ξ \sin (| θ_{n} - θ^{'} |) + 3} - 1) d ξ \\ S (ξ_{sn}, l_{p} = \pm 2) = \int_{0}^{ξ_{sn}} \sin (π / 2 \cdot ξ^{2}) (\frac{4 (\sqrt{zλ / 2} ξ + \sin (| θ - n - θ^{'} |)^{)}}{zλ ξ^{2} - \cos (2 (| θ_{n} - θ^{'} |)) + \sqrt{8 zλ} ξ \sin (| θ_{n} - θ^{'} |) + 3}) d ξ \end{matrix}$ $\appendix \setcounter{section}{12} \begin{eqnarray} \label{spider6b} &&C(\xi_{\rm sn},l_{\rm p}=\pm 2)=\int_{0}^{\xi_{\rm sn}} \cos\left(\pi/2\cdot \xi^2\right) \left(\frac{4}{z\lambda \xi^2-\cos(2(|\theta_n-\theta'|))+\sqrt{8z\lambda}\xi \sin(|\theta_n-\theta'|)+3}-1 \right) {\rm d}\xi ~~~~~~~~~~~~~~~~~~~~~~~~~\\[2mm] &&S(\xi_{\rm sn},l_{\rm p}=\pm 2)=\int_{0}^{\xi_{\rm sn}} \sin\left(\pi/2\cdot \xi^2\right) \left(\frac{4\left(\sqrt{z\lambda/2}\xi+\sin(|\theta-n-\theta'|)\right)}{z\lambda \xi^2-\cos(2(|\theta_n-\theta'|))+\sqrt{8z\lambda}\xi \sin(|\theta_n-\theta'|)+3} \right){\rm d}\xi \end{eqnarray}$ $ξ_{s 1} = - \sqrt{8 N_{x}} (1 / 2 + ρ^{'} \sin (| θ_{n} - θ^{'} |) / th) ξ_{s 2} = \sqrt{8 N_{x}} (1 / 2 - ρ^{'} \sin (| θ_{n} - θ^{'} |) / th) .$ $\appendix \setcounter{section}{12} \begin{equation} \xi_{s1}=-\sqrt{8N_x}\left(1/2+\rho'\sin(|\theta_n-\theta'|)/th\right) \qquad \xi_{s2}=\sqrt{8N_x}\left(1/2-\rho'\sin(|\theta_n-\theta'|)/th\right) .\nonumber \end{equation}$ The Fig. L.1 shows numerical simulation results for the vortex and simple Fresnel diffraction regime.

Fig. L.1

Spider Fresnel diffraction calculation for a VLT coronagraphic pupil (ϵ = 0.14) and the diameter of the coronagraphic pupil of 18 mm. The figure shows a comparison between the classical Fresnel diffraction (l_p = 0) and the optical propagation including the vortex (l_p = 2). The thickness of the spider is 225 μm.

Appendix M: Intensity in the final coronographic plane

Now, we present the coronagraphic intensity in the final plane using the $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ function definition provided by the Fraunhofer diffraction. Indeed, the Fresnel propagation of aberrations is somewhat complicated and in the high Fresnel number regime (N > 10 000) the image result is very close (err < 10^-5) to the Fraunhofer diffraction. We use here the Fraunhofer diffraction in the final coronagraphic intensity. The intensity on the final coronagraphic plane is given by the square modulus of the complex amplitude: $\begin{matrix} I_{c} (r,φ, l_{p}) = | U_{v} (r,φ, l_{p}) |^{2} \\ I_{c} (r,φ, l_{p}) = 4 {| ℜ (U_{a} + U_{b} + U_{c}) |}^{2} + 4 {| ℑ (U_{a} + U_{b} + U_{c}) |}^{2} \\ I_{c} (r,φ, l_{p}) = 4 ({| U_{a} |}^{2} + {| U_{b} |}^{2} + {| U_{c} |}^{2}) + 8 (ℜ (U_{a} . U_{b} + U_{a} . U_{c} + U_{b} . U_{c}) + ℑ (U_{a} . U_{b} + U_{a} . U_{c} + U_{b} . U_{c})) . \end{matrix}$ $\appendix \setcounter{section}{13} \begin{eqnarray} \label{I1} &&I_{\rm c}(r,\phi,l_{\rm p})=|U_v(r,\phi,l_{\rm p})|^2\\ &&I_{\rm c}(r,\phi,l_{\rm p})=4\left|\Re\left(U_a+U_b+U_{\rm c}\right)\right|^2+4\left|\Im\left(U_a+U_b+U_{\rm c}\right)\right|^2\nonumber\\ &&I_{\rm c}(r,\phi,l_{\rm p})=4\left(\left|U_a\right|^2+\left|U_b\right|^2+\left|U_{\rm c}\right|^2\right)+8\left(\Re\left(U_a.U_b+U_a.U_{\rm c}+U_b.U_{\rm c}\right)+\Im\left(U_a.U_b+U_a.U_{\rm c}+U_b.U_{\rm c}\right)\right) .\nonumber \end{eqnarray}$ (M.1)The Lyot stop action is equivalent to a normalization of the radial Zernike polynomials with the diaphragm size d_c < 1. This is a homothetic transformation in the ρ coordinate and $β_{n}^{m} \to β_{n}^{m} (d_{c})$ $\hbox{$\beta_n^m \rightarrow \beta_n^m(d_{\rm c})$}$ , $U_{a} = β_{N}^{0} (d_{c}) V_{N, 0}^{l_{p}} U_{b} = \sum_{n,m}^{~} i^{m} β_{cn}^{m} (d_{c}) V_{n,m}^{l_{p}} C_{m} U_{c} = \sum_{n,m}^{~} i^{m} β_{sn}^{m} (d_{c}) V_{n,m}^{l_{p}} S_{m} .$ $\appendix \setcounter{section}{13} \begin{equation} \label{I2} U_a=\beta_N^0(d_{\rm c}) \: V_{N,0}^{l_{\rm p}} \qquad U_b=\sum_{n,m}^{\sim} {\rm i}^m \beta_{\rm cn}^m (d_{\rm c})\: V_{n,m}^{l_{\rm p}} C_m \qquad U_{\rm c}=\sum_{n,m}^{\sim}{\rm i}^m \beta_{\rm sn}^m (d_{\rm c})\: V_{n,m}^{l_{\rm p}} S_m . \end{equation}$ (M.2)The coronagraphic intensity consists of a linear part f⁽¹⁾ and a quadratic part f⁽²⁾ as follows: $I_{c} (r,φ, l_{p}) = 4 {(β_{N}^{0} (d_{c})^{)}}^{2} \cdot {| V_{N, 0}^{l_{p}} |}^{2} + f^{(1)} [β_{cn}^{m} (d_{c}), β_{sn}^{m} (d_{c})] + f^{(2)} [(β_{cn}^{m} (d_{c}))^{2}, (β_{sn}^{m} (d_{c}))^{2}, (β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c}))^{]} .$ $\appendix \setcounter{section}{13} \begin{equation} \label{I2b} I_{\rm c}(r,\phi,l_{\rm p}) =4\left( \beta_N^0(d_{\rm c})\right)^2 \cdot \left|V_{N,0}^{l_{\rm p}} \right|^2+f^{(1)}\left[\beta_{\rm cn}^m(d_{\rm c}),\beta_{\rm sn}^m(d_{\rm c})\right]+f^{(2)}\left[(\beta_{\rm cn}^m(d_{\rm c}))^2,(\beta_{\rm sn}^m(d_{\rm c}))^2,(\beta_{\rm cn}^m(d_{\rm c})\cdot\beta_{\rm sn}^m(d_{\rm c}))\right] . \end{equation}$ (M.3)Developing the squared terms yields $\begin{matrix} f^{(1)} [β_{cn}^{m} (d_{c}), β_{sn}^{m} (d_{c})] = \\ 8 \sum_{n,m}^{~} [(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{m} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{m} (d_{c}))^{)} \cdot ℜ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}) C_{m} (φ)] \\ - 8 \sum_{n,m}^{~} [(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{m} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{m} (d_{c}))^{)} \cdot ℑ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}) C_{m} (φ)] \\ + 8 \sum_{n,m}^{~} [(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{m} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{m} (d_{c}))^{)} \cdot ℜ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}) S_{m} (φ)] \\ - 8 \sum_{n,m}^{~} [(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{m} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{m} (d_{c}))^{)} \cdot ℑ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *}) S_{m} (φ)] \\ f^{(1)} [β_{cn}^{m} (d_{c}), β_{sn}^{m} (d_{c})] = \sum_{n,m}^{~} [(A_{c}^{m} C_{m} (φ) + A_{s}^{m} S_{m} (φ)) 8 ℜ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *})] - \sum_{n,m}^{~} [(B_{c}^{m} C_{m} (φ) + B_{s}^{m} S_{m} (φ)) 8 ℑ (i^{m} V_{n,m}^{l_{p}} V_{N, 0}^{l_{p} *})] . \end{matrix}$ $\appendix \setcounter{section}{13} \begin{eqnarray} \label{I3} &&f^{(1)}\left[\beta_{\rm cn}^m(d_{\rm c}),\beta_{\rm sn}^m(d_{\rm c})\right]=\\ &&\qquad 8\: \sum_{n,m}^{\sim} \left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^m(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^m(d_{\rm c}))\right)\cdot \Re\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\:C_m(\phi)\right]\nonumber\\ &&\qquad -8\: \sum_{n,m}^{\sim} \left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^m(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^m(d_{\rm c}))\right)\cdot \Im\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\:C_m(\phi)\right]\nonumber\\ &&\qquad +8\: \sum_{n,m}^{\sim} \left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^m(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^m(d_{\rm c}))\right)\cdot \Re\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\:S_m(\phi)\right]\nonumber\\ &&\qquad -8\: \sum_{n,m}^{\sim} \left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^m(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^m(d_{\rm c}))\right)\cdot \Im\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\:S_m(\phi)\right]\nonumber \\ \label{I3b} && f^{(1)}\left[\beta_{\rm cn}^m(d_{\rm c}),\beta_{\rm sn}^m(d_{\rm c})\right]= \sum_{n,m}^{\sim}\left[\left(A_{\rm c}^m C_m(\phi) + A_{\rm s}^m S_m(\phi)\right) 8\Re\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\right] - \sum_{n,m}^{\sim}\left[\left(B_{\rm c}^m C_m(\phi) + B_{\rm s}^m S_m(\phi) \right) 8\Im\left({\rm i}^m V_{n,m}^{l_{\rm p}} \: V_{N,0}^{l_{\rm p}*}\right)\right] . ~~~~~~~~~~~~~ \end{eqnarray}$ The $A_{c}^{m}, A_{s}^{m}, B_{c}^{m}, B_{s}^{m}$ $\hbox{$A_{\rm c}^m,A_{\rm s}^m,B_{\rm c}^m,B_{\rm s}^m$}$ are defined as follows $\begin{matrix} A_{c}^{m} = ℜ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{m} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{m} (d_{c})) A_{s}^{m} = ℜ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{m} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{m} (d_{c})) \\ B_{c}^{m} = ℜ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{m} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{m} (d_{c})) B_{s}^{m} = ℜ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{m} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{m} (d_{c})) \\ f^{(2)} [(β_{cn}^{m} (d_{c}))^{2}, (β_{sn}^{m} (d_{c}))^{2}, (β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c}))^{]} = \sum_{n,m}^{~} [(- 1)^{m} ({(β_{cn}^{m} (d_{c}))}^{2} C_{m}^{2} (φ) + {(β_{sn}^{m} (d_{c}))}^{2} S_{m}^{2} (φ)^{)} \cdot {(V_{n,m}^{l_{p}})}^{2}] \\ + 8 \sum_{n,m}^{~} [ℜ (i^{m} β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c})) + ℑ (i^{m} β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c})) \cdot {(V_{n,m}^{l_{p}})}^{2} C_{m} (φ) S_{m} (φ)] . \end{matrix}$ $\appendix \setcounter{section}{13} \begin{eqnarray} \label{I3C} &&A_{\rm c}^m=\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^m(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^m(d_{\rm c})) \qquad A_{\rm s}^m=\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^m(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^m(d_{\rm c})) \\[1mm] && B_{\rm c}^m=\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^m(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^m(d_{\rm c})) \qquad B_{\rm s}^m=\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^m(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^m(d_{\rm c})) \\ &&f^{(2)}\left[(\beta_{\rm cn}^m(d_{\rm c}))^2,(\beta_{\rm sn}^m(d_{\rm c}))^2,(\beta_{\rm cn}^m(d_{\rm c})\cdot\beta_{\rm sn}^m(d_{\rm c}))\right] = \sum_{n,m}^{\sim}\left[ (-1)^m\left(\left(\beta_{\rm cn}^m(d_{\rm c})\right)^2 \:C^2_m(\phi) + \left(\beta_{\rm sn}^m(d_{\rm c})\right)^2\:S^2_m(\phi)\right)\cdot \left(V_{n,m}^{l_{\rm p}}\right)^2\right] \\ &&\hspace*{6.5cm}+ 8 \sum_{n,m}^{\sim}\left[\Re({\rm i}^m \beta_{\rm cn}^m(d_{\rm c}) \cdot \beta_{\rm sn}^m(d_{\rm c}))+\Im ({\rm i}^m \beta_{\rm cn}^m(d_{\rm c}) \cdot \beta_{\rm sn}^m(d_{\rm c}))\cdot \left(V_{n,m}^{l_{\rm p}}\right)^2 \:C_m(\phi)\:S_m(\phi)\right] . \nonumber \end{eqnarray}$

Appendix N: Derivation of $Ψ_{meas}^{m} (r, l_{p})$ $\hbox{$\mathsfsl{\Psi_{\sf meas}^{m}(r,l_{\sf p})}$}$

We now define two sets of functions $Ψ_{n,m}^{l_{p}}$ $\hbox{$\Psi_{n,m}^{l_{\rm p}}$}$ and $χ_{n,m}^{l_{p}}$ $\hbox{$\chi_{n,m}^{l_{\rm p}}$}$ : $Ψ_{n,m}^{l_{p}} (r) = - 8 ϵ_{m}^{-1} ℑ (i^{m} V_{n,m}^{l_{p}} . V_{N, 0}^{l_{p} *}) χ_{n,m}^{l_{p}} (r) = 8 ϵ_{m}^{-1} ℜ (i^{m} V_{n,m}^{l_{p}} . V_{N, 0}^{l_{p} *}),$ $\appendix \setcounter{section}{14} \begin{equation} \label{f1} \Psi_{n,m}^{l_{\rm p}}(r)=-8 \epsilon_{m}^{-1}\Im\left({\rm i}^m V_{n,m}^{l_{\rm p}} . V_{N,0}^{l_{\rm p}*}\right)\qquad \chi_{n,m}^{l_{\rm p}}(r)=8 \epsilon_{m}^{-1}\Re\left({\rm i}^m V_{n,m}^{l_{\rm p}}. V_{N,0}^{l_{\rm p}*}\right) , \end{equation}$ (N.1)with ϵ_m = 1 for m = 0 and 2 in all other cases.

We have already defined the inner product: $\begin{matrix} (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = \int_{- \infty}^{+ \infty} \int_{0}^{+ \infty} Ψ_{n,m}^{l_{p}} . χ_{n^{'},m}^{l_{p} *} r d r d l_{p} \forall l_{p} \in N - \to (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = \frac{1}{2 | l_{p} |} \sum_{- | l_{p} |}^{+ | l_{p} |} \int_{0}^{+ \infty} Ψ_{n,m}^{l_{p}} . χ_{n^{'},m}^{l_{p} *} r d r \\ (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = - 8 ϵ_{m}^{-1} \int_{- \infty}^{+ \infty} \int_{0}^{+ \infty} ℜ (i^{m}) ℑ (i^{m}) (V_{n,m}^{l_{p}} . V_{n^{'},m}^{l_{p} *} . {| V_{N, 0}^{l_{p} *} |}^{2}) r d r d l_{p} \\ (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = (χ_{n,m}^{l_{p}}, Ψ_{n^{'},m}^{l_{p}}) \forall n, n^{'} (Ψ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) = 0 \forall n, n^{'} = m,m + 2, ... ({| V_{N, 0}^{l_{p}} |}^{2}, Ψ_{n^{'}, 0}^{l_{p}}) = 0 \forall n^{'} . \end{matrix}$ $\appendix \setcounter{section}{14} \begin{eqnarray} \label{f_inner_p3} &&\left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right)=\int_{-\infty}^{+\infty}\int_0^{+\infty}\Psi_{n,m}^{l_{\rm p}}.\chi_{n',m}^{l_{\rm p}*}r{\rm d}r {\rm d}l_{\rm p} \qquad \forall l_{\rm p} \in N \quad \longrightarrow \quad \left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right)=\frac{1} {2|l_{\rm p}|}\sum_{-|l_{\rm p}|}^{+|l_{\rm p}|}\int_0^{+\infty}\Psi_{n,m}^{l_{\rm p}}.\chi_{n',m}^{l_{\rm p}*}r{\rm d}r \\ &&\nonumber\left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right)=-8 \epsilon_{m}^{-1}\int_{-\infty}^{+\infty}\int_0^{+\infty}\Re\left({\rm i}^m\right)\Im\left({\rm i}^m\right) \left( V_{n,m}^{l_{\rm p}} . V_{n',m}^{l_{\rm p}*} . \left|V_{N,0}^{l_{\rm p}*}\right|^2\right) r{\rm d}r {\rm d}l_{\rm p}\\ &&\nonumber\left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right)=\left(\chi_{n,m}^{l_{\rm p}}\:,\:\Psi_{n',m}^{l_{\rm p}}\right) \quad \forall n\:,\:n' \qquad \left(\Psi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right) = 0 \quad \forall n\:,\:n' = m,m+2,...\qquad \left(\left| V_{N,0}^{l_{\rm p}}\right|^2\:,\:\Psi_{n',0}^{l_{\rm p}}\right) = 0 \quad \forall n'. \end{eqnarray}$ (N.2)The aberrated images are composed of several modes in sine and cosine functions. The first step of the image analysis is the decoupling of these modes, which can be performed with a polar Fourier transform F_p of the coronagraphic images. It is given by $F_{p} (I_{c}) = \frac{1}{2 π} \int_{0}^{2 π} I_{c} (r,φ, l_{p}) e^{i mφ} d φ Ψ_{c}^{m} (r, l_{p}) = F_{p} (I_{c}) Ψ_{meas}^{m} (r, l_{p}) = F_{p} (I_{meas}) .$ $\appendix \setcounter{section}{14} \begin{equation} F_{\rm p} \left(I_{\rm c}\right)=\frac{1}{2\pi} \int_0^{2\pi}I_{\rm c}(r,\phi,l_{\rm p}){\rm e}^{{\rm i} m\phi} {\rm d}\phi \qquad \qquad \Psi_{\rm c}^{m}(r,l_{\rm p})=F_{\rm p} \left(I_{\rm c}\right) \qquad \Psi_{\rm meas}^{m}(r,l_{\rm p})=F_{\rm p} \left(I_{\rm meas}\right) . \end{equation}$ (N.3)In the perfect case without photon and readout noise, the Nijboer-Zernike analysis shows that $Ψ_{meas}^{m} (r, l_{p}) = Ψ_{c}^{m} (r, l_{p})$ $\hbox{$\Psi_{\rm meas}^{m}(r,l_{\rm p})=\Psi_{\rm c}^{m}(r,l_{\rm p})$}$ .

For a linear approximation of the final coronagraphic intensity, $Ψ_{meas}^{m} (r, l_{p}) \approx \frac{1}{2 π} \int_{0}^{2 π} (4 {(β_{N}^{0} (d_{c})^{)}}^{2} \cdot {| V_{N, 0}^{l_{p}} |}^{2} + f^{(1)} [β_{n}^{m} (d_{c})]) e^{i mφ} d φ .$ $\appendix \setcounter{section}{14} \begin{equation} \Psi_{\rm meas}^{m}(r,l_{\rm p})\approx\frac{1}{2\pi} \int_0^{2\pi} \left(4\left( \beta_N^0(d_{\rm c})\right)^2 \cdot \left|V_{N,0}^{l_{\rm p}} \right|^2+f^{(1)}\left[\beta_n^m(d_{\rm c})\right] \right) {\rm e}^{{\rm i} m\phi} {\rm d}\phi . \end{equation}$ (N.4)Now, we can write the polar Fourier transform of the the final coronagraphic images using the Ψ,χ functions. $\begin{matrix} F_{p} (4 {(β_{N}^{0} (d_{c})^{)}}^{2} \cdot {| V_{N, 0}^{l_{p}} |}^{2}) = \frac{1}{2} {(β_{N}^{0} (d_{c})^{)}}^{2} χ_{N, 0}^{l_{p}} \\ F_{p} (C_{m} (φ)) = 1 (m = 0) F_{p} (C_{m} (φ)) = 1 / 2 (0 < m \leq n) \\ F_{p} (S_{m} (φ)) = 0 (m = 0) F_{p} (S_{m} (φ)) = i / 2 (0 < m \leq n) \end{matrix}$ $\appendix \setcounter{section}{14} \begin{eqnarray} \label{f2} &&F_{\rm p}\left(4\left( \beta_N^0(d_{\rm c})\right)^2 \cdot \left|V_{N,0}^{l_{\rm p}} \right|^2\right) = \frac{1} {2} \left( \beta_N^0(d_{\rm c})\right)^2 \chi_{N,0}^{l_{\rm p}}\\[1.5mm] && F_{\rm p}\left(C_m(\phi)\right) = 1 \quad (m=0) \qquad F_{\rm p}\left(C_m(\phi)\right) = 1/2 \quad (0<m\leq n)\\[1.5mm] && F_{\rm p}\left(S_m(\phi)\right) = 0 \quad (m=0) \qquad F_{\rm p}\left(S_m(\phi)\right) = i/2 \quad (0<m\leq n)~~~~~~~~~~~ \end{eqnarray}$ – m = 0 $\begin{matrix} Ψ_{meas}^{0} \approx \frac{1}{2} {(β_{N}^{0} (d_{c})^{)}}^{2} χ_{N, 0}^{l_{p}} & + \sum_{n}^{~} A_{c}^{0} (n) χ_{n, 0}^{l_{p}} + \sum_{n}^{~} B_{c}^{0} (n) Ψ_{n, 0}^{l_{p}} \\ Ψ_{meas}^{0} \approx \frac{1}{2} {(β_{N}^{0} (d_{c})^{)}}^{2} χ_{N, 0}^{l_{p}} & + \sum_{n}^{~} ([{(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{0} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{0} (d_{c}))^{)}}^{]} \cdot χ_{n, 0}^{l_{p}}) \\ + \sum_{n}^{~} ([{(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{0} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{0} (d_{c}))^{)}}^{]} \cdot Ψ_{n, 0}^{l_{p}}) \end{matrix}$ $\appendix \setcounter{section}{14} \begin{eqnarray} \label{f3} \Psi_{\rm meas}^{0}\approx\frac{1} {2} \left( \beta_N^0(d_{\rm c})\right)^2 \chi_{N,0}^{l_{\rm p}}&&+ \sum_{n}^{\sim} A_{\rm c}^0(n) \chi_{n,0}^{l_{\rm p}} + \sum_{n}^{\sim} B_{\rm c}^0(n) \Psi_{n,0}^{l_{\rm p}}\nonumber \\ \Psi_{\rm meas}^{0}\approx\frac{1} {2} \left( \beta_N^0(d_{\rm c})\right)^2 \chi_{N,0}^{l_{\rm p}}&&+ \sum_{n}^{\sim}\left( \left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^0(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^0(d_{\rm c}))\right)\right]\cdot \chi_{n,0}^{l_{\rm p}}\right) \nonumber \\ &&+\sum_{n}^{\sim}\left(\left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^0(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^0(d_{\rm c}))\right)\right]\cdot \Psi_{n,0}^{l_{\rm p}}\right) \end{eqnarray}$ (N.8)– 0 < m < n $\begin{matrix} Ψ_{meas}^{m} & \approx \sum_{n} A_{c}^{m} (n) χ_{n,m}^{l_{p}} + \sum_{n} B_{c}^{m} (n) Ψ_{n,m}^{l_{p}} + i \sum_{n} A_{s}^{m} (n) χ_{n,m}^{l_{p}} + i \sum_{n} B_{s}^{m} (n) Ψ_{n,m}^{l_{p}} \\ Ψ_{meas}^{m} & \approx \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{m} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{m} (d_{c}))^{)}}^{]} \cdot χ_{n,m}^{l_{p}} \\ + \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{m} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{m} (d_{c}))^{)}}^{]} \cdot Ψ_{n,m}^{l_{p}} \\ + i \sum_{n} [(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{m} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{m} (d_{c}))^{)})^{]} \cdot χ_{n,m}^{l_{p}} \\ + i \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{m} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{m} (d_{c}))^{)}}^{]} \cdot Ψ_{n,m}^{l_{p}} \end{matrix}$ $\appendix \setcounter{section}{14} \begin{eqnarray} \label{f4} \Psi_{\rm meas}^{m}\; &&\approx \sum_{n}A^m_{\rm c}(n) \chi_{n,m}^{l_{\rm p}} +\sum_{n}B^m_{\rm c}(n) \Psi_{n,m}^{l_{\rm p}} + {\rm i}\sum_{n}A^m_{\rm s}(n) \chi_{n,m}^{l_{\rm p}} + {\rm i}\sum_{n}B^m_{\rm s}(n) \Psi_{n,m}^{l_{\rm p}}\nonumber \\ \Psi_{\rm meas}^{m}\; &&\approx\sum_{n} \left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^m(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^m(d_{\rm c}))\right)\right]\cdot \chi_{n,m}^{l_{\rm p}} \nonumber \\ &&+\sum_{n}\left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^m(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^m(d_{\rm c}))\right)\right]\cdot \Psi_{n,m}^{l_{\rm p}}\nonumber\\ &&+{\rm i}\sum_{n} \left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^m(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^m(d_{\rm c}))\right))\right]\cdot \chi_{n,m}^{l_{\rm p}}\nonumber\\ &&+{\rm i}\sum_{n}\left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^m(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^m(d_{\rm c}))\right)\right]\cdot \Psi_{n,m}^{l_{\rm p}} \end{eqnarray}$ (N.9)– m = n $\begin{matrix} Ψ_{meas}^{n} & \approx \sum_{n} A_{c}^{n} (n) χ_{n,n}^{l_{p}} + \sum_{n} B_{c}^{n} (n) Ψ_{n,n}^{l_{p}} - i Sign (l_{p}) \sum_{n} A_{s}^{n} (n) χ_{n,n}^{l_{p}} - i Sign (l_{p}) \sum_{n} B_{s}^{n} (n) Ψ_{n,n}^{l_{p}} \\ + \sum_{n} A_{s}^{n} (n) χ_{n,n}^{l_{p}} + \sum_{n} B_{s}^{n} (n) Ψ_{n,n}^{l_{p}} + i Sign (l_{p}) \sum_{n} A_{c}^{n} (n) χ_{n,n}^{l_{p}} + i Sign (l_{p}) \sum_{n} B_{c}^{n} (n) Ψ_{n,n}^{l_{p}} \\ Ψ_{meas}^{n} & \approx \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{n} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{n} (d_{c}))^{)}}^{]} \cdot χ_{n,n}^{l_{p}} \\ + \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{n} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{n} (d_{c}))^{)}}^{]} \cdot Ψ_{n,n}^{l_{p}} \\ + \sum_{n} [(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{n} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{n} (d_{c}))^{)})^{]} \cdot χ_{n,n}^{l_{p}} \\ + \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{n} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{n} (d_{c}))^{)}}^{]} \cdot Ψ_{n,n}^{l_{p}} \\ - i Sign (l_{p}) \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{n} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{n} (d_{c}))^{)}}^{]} \cdot χ_{n,n}^{l_{p}} \\ - i Sign (l_{p}) \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{cn}^{n} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{cn}^{n} (d_{c}))^{)}}^{]} \cdot Ψ_{n,n}^{l_{p}} \\ + i Sign (l_{p}) \sum_{n} [(ℜ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{n} (d_{c})) + ℑ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{n} (d_{c}))^{)})^{]} \cdot χ_{n,n}^{l_{p}} \\ + i Sign (l_{p}) \sum_{n} [{(ℜ (β_{N}^{0} (d_{c})) ℑ (β_{sn}^{n} (d_{c})) - ℑ (β_{N}^{0} (d_{c})) ℜ (β_{sn}^{n} (d_{c}))^{)}}^{]} \cdot Ψ_{n,n}^{l_{p}} . \end{matrix}$ $\appendix \setcounter{section}{14} \begin{eqnarray} \label{f5} \Psi_{\rm meas}^{n}\; &&\approx \sum_{n}A_{\rm c}^n(n) \chi_{n,n}^{l_{\rm p}} +\sum_{n}B_{\rm c}^n(n) \Psi_{n,n}^{l_{\rm p}}-{\rm i}\: {\rm Sign}(l_{\rm p}) \sum_{n}A_{\rm s}^n(n) \chi_{n,n}^{l_{\rm p}} -{\rm i}\: {\rm Sign}(l_{\rm p})\sum_{n}B_{\rm s}^n(n) \Psi_{n,n}^{l_{\rm p}} \nonumber \\[1mm] &&\quad +\sum_{n}A_{\rm s}^n(n) \chi_{n,n}^{l_{\rm p}} +\sum_{n}B_{\rm s}^n(n) \Psi_{n,n}^{l_{\rm p}}+{\rm i}\: {\rm Sign} (l_{\rm p}) \sum_{n}A_{\rm c}^n(n) \chi_{n,n}^{l_{\rm p}} +{\rm i}\: {\rm Sign}(l_{\rm p})\sum_{n}B_{\rm c}^n(n) \Psi_{n,n}^{l_{\rm p}}\nonumber \\[1mm] \Psi_{\rm meas}^{n}\; &&\approx\sum_{n}\left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^n(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^n(d_{\rm c}))\right)\right]\cdot \chi_{n,n}^{l_{\rm p}} \nonumber \\[1mm] &&+\sum_{n}\left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^n(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^n(d_{\rm c}))\right)\right]\cdot \Psi_{n,n}^{l_{\rm p}}\nonumber\\[1mm] &&+\sum_{n} \left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^n(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^n(d_{\rm c}))\right))\right]\cdot \chi_{n,n}^{l_{\rm p}}\nonumber\\[1mm] &&+\sum_{n}\left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^n(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^n(d_{\rm c}))\right)\right]\cdot \Psi_{n,n}^{l_{\rm p}}\nonumber\\[1mm] &&-{\rm i}\: {\rm Sign}(l_{\rm p})\sum_{n}\left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^n(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^n(d_{\rm c}))\right)\right]\cdot \chi_{n,n}^{l_{\rm p}} \nonumber \\[1mm] &&-{\rm i}\: {\rm Sign}(l_{\rm p})\sum_{n}\left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm cn}^n(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm cn}^n(d_{\rm c}))\right)\right]\cdot \Psi_{n,n}^{l_{\rm p}}\nonumber\\[1mm] &&+{\rm i}\: {\rm Sign}(l_{\rm p})\sum_{n} \left[\left(\Re(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^n(d_{\rm c}))+\Im(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^n(d_{\rm c}))\right))\right]\cdot \chi_{n,n}^{l_{\rm p}}\nonumber\\[1mm] &&+{\rm i}\: {\rm Sign}(l_{\rm p})\sum_{n}\left[\left(\Re(\beta_N^0(d_{\rm c}))\Im(\beta_{\rm sn}^n(d_{\rm c}))-\Im(\beta_N^0(d_{\rm c}))\Re(\beta_{\rm sn}^n(d_{\rm c}))\right)\right]\cdot \Psi_{n,n}^{l_{\rm p}}. \end{eqnarray}$ (N.10)Using the inner products between the $Ψ_{meas}^{n}$ $\hbox{$\Psi_{\rm meas}^{n}$}$ and the $Ψ_{n,m}^{l_{p}} / χ_{n,m}^{l_{p}}$ $\hbox{$\Psi_{n,m}^{l_{\rm p}}/\chi_{n,m}^{l_{\rm p}}$}$ functions, it is now possible to retrieve coefficients of the linear system of equation as follows: – m = 0 $\begin{matrix} (Ψ_{meas}^{0}, χ_{n^{'}, 0}^{l_{p}}) = \frac{1}{2} {(β_{N}^{0} (d_{c})^{)}}^{2} (χ_{N, 0}^{l_{p}}, χ_{n^{'}, 0}^{l_{p}}) + \sum_{n}^{~} A_{c}^{0} (n) (χ_{n, 0}^{l_{p}}, χ_{n^{'}, 0}^{l_{p}}) \\ (Ψ_{meas}^{0}, Ψ_{n^{'}, 0}^{l_{p}}) = \sum_{n}^{~} B_{c}^{0} (n) (Ψ_{n, 0}^{l_{p}}, Ψ_{n^{'}, 0}^{l_{p}}) \end{matrix}$ $\appendix \setcounter{section}{14} \begin{eqnarray} \label{f6a} &&\left(\Psi_{\rm meas}^{0}\:,\:\chi_{n',0}^{l_{\rm p}}\right)=\frac{1} {2} \left( \beta_N^0(d_{\rm c})\right)^2 \left(\chi_{N,0}^{l_{\rm p}}\:,\:\chi_{n',0}^{l_{\rm p}}\right) + \sum_{n}^{\sim} A_{\rm c}^0(n) \left(\chi_{n,0}^{l_{\rm p}}\:,\:\chi_{n',0}^{l_{\rm p}}\right)~~~~~~~~~~~~~~~\\ &&\left(\Psi_{\rm meas}^{0}\:,\:\Psi_{n',0}^{l_{\rm p}}\right)=\sum_{n}^{\sim} B_{\rm c}^0(n) \left(\Psi_{n,0}^{l_{\rm p}}\:,\:\Psi_{n',0}^{l_{\rm p}}\right) \end{eqnarray}$ – 0 < m < n $\begin{matrix} (Ψ_{meas}^{m}, χ_{n^{'},m}^{l_{p}}) = \sum_{n} (A_{c}^{m} (n) + i A_{s}^{m} (n)) (χ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) \\ (Ψ_{meas}^{m}, Ψ_{n^{'},m}^{l_{p}}) = \sum_{n} (B_{c}^{m} (n) + i B_{s}^{m} (n)) (Ψ_{n,m}^{l_{p}}, Ψ_{n^{'},m}^{l_{p}}) \end{matrix}$ $\appendix \setcounter{section}{14} \begin{eqnarray} \label{f6b} &&\left(\Psi_{\rm meas}^{m}\:,\:\chi_{n',m}^{l_{\rm p}}\right)=\sum_{n} \left(A^m_{\rm c}(n)+{\rm i}A^m_{\rm s}(n) \right) \left(\chi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right)\\ &&\left(\Psi_{\rm meas}^{m}\:,\:\Psi_{n',m}^{l_{\rm p}}\right)=\sum_{n} \left(B^m_{\rm c}(n)+{\rm i}B^m_{\rm s}(n) \right) \left(\Psi_{n,m}^{l_{\rm p}}\:,\: \Psi_{n',m}^{l_{\rm p}}\right) \end{eqnarray}$ – m = n $\begin{matrix} (Ψ_{meas}^{n}, χ_{n^{'},n}^{l_{p}}) = \sum_{n} ([A_{c}^{n} (n) + A_{s}^{n} (n)] + i Sign (l_{p}) {[A_{c}^{n} (n) - A_{s}^{n} (n)]}^{)} (χ_{n,n}^{l_{p}}, χ_{n^{'},n}^{l_{p}}) \\ (Ψ_{meas}^{n}, Ψ_{n^{'},n}^{l_{p}}) = \sum_{n} ([B_{c}^{n} (n) + B_{s}^{n} (n)] + i Sign (l_{p}) {[B_{c}^{n} (n) - B_{s}^{n} (n)]}^{)} (Ψ_{n,n}^{l_{p}}, Ψ_{n^{'},n}^{l_{p}}) \\ G . u [β_{n}^{m}] = r u [{β_{n}^{0}}^{]} = [A_{c}^{0} (n), B_{c}^{0} (n)] u [β_{n}^{m}] = [A_{c}^{m} (n), A_{s}^{m} (n), B_{c}^{m} (n), B_{s}^{m} (n)], u [β_{n}^{n}] = [A_{c}^{n} (n), A_{s}^{n} (n), B_{c}^{n} (n), B_{s}^{n} (n)] \\ G_{n, n^{'}}^{m, l_{p}} (Ψ^{)} = (Ψ_{n,m}^{l_{p}}, Ψ_{n^{'},m}^{l_{p}}) G_{n, n^{'}}^{m, l_{p}} (χ) = (χ_{n,m}^{l_{p}}, χ_{n^{'},m}^{l_{p}}) \\ r_{n^{'}}^{0, l_{p}} (Ψ^{)} = (Ψ_{meas}^{0}, Ψ_{n^{'}, 0}^{l_{p}}) r_{n^{'}}^{0, l_{p}} (χ) = (Ψ_{meas}^{0}, χ_{n^{'}, 0}^{l_{p}}) (m = 0) \\ r_{n^{'}}^{m, l_{p}} (Ψ^{)} = (Ψ_{meas}^{m}, Ψ_{n^{'},m}^{l_{p}}) r_{n^{'}}^{m, l_{p}} (χ) = (Ψ_{meas}^{m}, χ_{n^{'},m}^{l_{p}}) (0 < m < n) \\ r_{n^{'}}^{n, l_{p}} (Ψ^{)} = (Ψ_{meas}^{n}, Ψ_{n^{'},n}^{l_{p}}) r_{n^{'}}^{n, l_{p}} (χ) = (Ψ_{meas}^{n}, χ_{n^{'},n}^{l_{p}}) (m = n) . \end{matrix}$ $\appendix \setcounter{section}{14} \begin{eqnarray} \label{f6c} &&\left(\Psi_{\rm meas}^{n},\:\chi_{n',n}^{l_{\rm p}}\right)=\sum_{n} \left(\left[A^n_{\rm c}(n)+A^n_{\rm s}(n)\right]+{\rm i}\: {\rm Sign}(l_{\rm p})\left[A^n_{\rm c}(n)-A^n_{\rm s}(n)\right] \right) \left(\chi_{n,n}^{l_{\rm p}}\:,\:\chi_{n',n}^{l_{\rm p}}\right)\\[1mm] &&\left(\Psi_{\rm meas}^{n},\:\Psi_{n',n}^{l_{\rm p}}\right)=\sum_{n} \left(\left[B^n_{\rm c}(n)+B^n_{\rm s}(n)\right]+{\rm i}\: {\rm Sign}(l_{\rm p})\left[B^n_{\rm c}(n)-B^n_{\rm s}(n)\right] \right) \left(\Psi_{n,n}^{l_{\rm p}}\:,\: \Psi_{n',n}^{l_{\rm p}}\right) \\[1mm] \label{lse1bis} && G.u\left[\beta_n^m\right]=r \qquad \quad u\left[\beta_n^0\right] = [A_{\rm c}^0(n), B_{\rm c}^0(n)] \qquad u\left[\beta_n^m\right]=[A^m_{\rm c}(n), A^m_{\rm s}(n), B^m_{\rm c}(n), B^m_{\rm s}(n)],\qquad u\left[\beta_n^n\right]=[A^n_{\rm c}(n), A^n_{\rm s}(n), B^n_{\rm c}(n), B^n_{\rm s}(n)]\nonumber\\[1mm] && G_{n,n'}^{m,l_{\rm p}} \left(\Psi\right) = \left(\Psi_{n,m}^{l_{\rm p}}\:,\:\Psi_{n',m}^{l_{\rm p}}\right) \qquad G_{n,n'}^{m,l_{\rm p}} \left(\chi\right) = \left(\chi_{n,m}^{l_{\rm p}}\:,\:\chi_{n',m}^{l_{\rm p}}\right) \\[1mm] \label{lse2bis1} &&r_{n'}^{0,l_{\rm p}}\left(\Psi\right) = \left(\Psi_{\rm meas}^{0}\:,\:\Psi_{n',0}^{l_{\rm p}}\right) \qquad r_{n'}^{0,l_{\rm p}}\left(\chi\right) = \left(\Psi_{\rm meas}^{0}\:,\:\chi_{n',0}^{l_{\rm p}}\right) \quad (m=0)\nonumber \\[1mm] &&r_{n'}^{m,l_{\rm p}}\left(\Psi\right) = \left(\Psi_{\rm meas}^{m}\:,\:\Psi_{n',m}^{l_{\rm p}}\right) \qquad r_{n'}^{m,l_{\rm p}}\left(\chi\right) = \left(\Psi_{\rm meas}^{m}\:,\:\chi_{n',m}^{l_{\rm p}}\right) \quad (0<m<n)\nonumber\\[1mm] &&r_{n'}^{n,l_{\rm p}}\left(\Psi\right) = \left(\Psi_{\rm meas}^{n}\:,\:\Psi_{n',n}^{l_{\rm p}}\right) \qquad r_{n'}^{n,l_{\rm p}}\left(\chi\right) = \left(\Psi_{\rm meas}^{n}\:,\:\chi_{n',n}^{l_{\rm p}}\right) \quad (m=n) \: .\nonumber \end{eqnarray}$

Appendix O: Scattering function

The physical theory of scattering by small particles is the Mie theory (Bohren & Huffman 1983), but this mathematical development is too complicated to be used with spherical Bessel and infinite sum of Legendre polynomials. The Henyey-Greenstein model (Henyey & Greenstein 1941) is a useful approximation to describe the angular distribution of light scattered by small particles. At the beginning, this model was applied to scattering by interstellar dust clouds in the Milky-Way. It is now used in numerous situations, ranging from the scattering of light in biological tissues to scattering caused by bulk material in an optical system.

In this model, the angular distribution of scattered light is given by $S c (θ) = \frac{1}{4 π} \frac{1 - g^{2}}{{[1 + g^{2} - 2 g . \cos (θ)]}^{3 / 2}} \int_{0}^{2 π} \int_{0}^{π} S c (θ) \sin (θ) d θ d φ = 1.$ $\appendix \setcounter{section}{15} \begin{equation} \label{scat1} S{\rm c}(\theta)=\frac{1} {4\pi} \frac{1-g^2} {\left[1+g^2-2g.\cos(\theta)\right]^{3/2}} \quad \int_0^{2\pi}\int_0^{\pi} S{\rm c}(\theta) \sin(\theta) {\rm d}\theta {\rm d}\phi=1 .\nonumber \end{equation}$ The Henyey-Greenstein phase function Sc(θ) is governed by the variation of only one parameter, , and ranges from backscattering through isotropic scattering to forward scattering. The θ and φ variables are the classical angular coordinates. Forward scattering is θ = 0, while for back-scattering, θ = π. We see that the ratio of forward-to-back scattering is [(1 + g)/(1 − g)] ³.

For g > 0, forward scattering is dominant, while for g < 0, backscattering predominates. To use the H-G function for Monte Carlo models, we need the accumulated distribution: $\begin{matrix} P (\cos (θ)) = \int_{-1}^{\cos (θ)} \frac{(1 - g^{2}) d (\cos (θ))}{{[1 + g^{2} - 2 g . \cos (θ)]}^{3 / 2}} \\ P (\cos (θ)) = \frac{1 - g^{2}}{2 g} ({(1 + g^{2} - 2 g . \cos (θ)^{)}}^{- 1 / 2} - (1 + g)^{-1}) \cos (θ) = \frac{1}{2 g} (1 + g^{2} - {(\frac{1 - g^{2}}{1 + g (2 P - 1)})}^{2}) \cdot \end{matrix}$ $\appendix \setcounter{section}{15} \begin{eqnarray} \label{scat2} &&P(\cos(\theta))=\int_{-1}^{\cos(\theta)} \frac{(1-g^2) {\rm d}(\cos(\theta))}{\left[1+g^2-2g.\cos(\theta)\right]^{3/2}}\\ &&P(\cos(\theta))=\frac{1-g^2}{2g} \left( \left(1+g^2-2g.\cos(\theta)\right)^{-1/2}-(1+g)^{-1}\right) \qquad \cos(\theta)=\frac{1}{2g} \left(1+g^2-\left( \frac{1-g^2} {1+g(2P-1)}\right)^2 \right)\cdot \nonumber \end{eqnarray}$ (O.1)The Zemax software uses this approximation for scattering calculation, and we obtain

Fig. O.1

Non-sequential Zemax analysis of the proposed optical implementation after the Lyot stop to split the two circular polarizations. Left: the polychromatic ray tracing (650–900 nm) for one polarization, and right: the detector view of no-coherent intensity distribution in logarithmic scale. With these classical materials (MgF₂, quartz, infrasil) the maximum scattered light is 10^-6 of the residual coronagraphic intensity. In this simulation, we chose an anti-reflexion coating of 0.5% per surface.

Appendix P: Polychromatic simulations and speckles smearing

We present in this section a complete set of numerical simulations obtained in wide bandpass with an imperfect vortex phase-mask. The inputs of these simulations were given in Sect. 7.2:

The input Strehl ratio is 95% @ 650 nm with the first 860 Zernikes (40 complete modes).
An imperfect phase-mask: where the s-transmittance is 97%, the p-transmittance is 98% and the local phase retardance is π ± Δφ with a quadratic law for the phase error of the mask.
Common and non-common path error are λ/71 rms @ 650 nm (the total is λ/50 rms at 650 nm).
Polishing error: DSP f^-2 with a shift of 8.1% for non-common path error.
Polychromatic speckle smearing due to residual Wollaston chromatism given by the Zemax model.
Photon noise, readout noise (6 e⁻), full-well capacity of 10⁵e⁻, residual flat of 1% rms.

P.1. Telescope pupil and coronagraphic pupil phase screens

We present the entrance pupil for a perfect circular telescope (off-axis configuration) obtained by the direct summation of the first 860 Zernikes, and also the two phase-screens using a density spectral power function (f^-2) where f is the spatial frequency in the Fourier domain.

Fig. P.1

Left to right: the telescope entrance pupil phase obtained with the first 40 Zernike modes. The Strehl ratio is 95% @ 650 nm. In the center, the phase screen that simulates high-frequency polishing defaults for the l_p = 2 beam. Right, the phase screen for the l_p = −2 beam. These two last phase screens are the superposition of two different random simulations (common and non-common path errors). The last case (l_p = −2) is simulated with a shift of the second phase screen (8.1% in the y-axis with respect to the pupil diameter). The sigma of the two phase error screens for l_p = 2 and l_p = −2 are chosen equal to λ/50 rms @ 650 nm.

Fig. P.2

Set of numerical coronagraphic simulations with a maximum phase-shift (Δφ_max) on the vortex mask of 0.001 radian for each bandpass. We chose five different bandpasses: Δλ = 2 nm (corresponding to a single-mode Laser diode), 100 nm, 150 nm (R-filter), 200 nm, and finally 250 nm (R to I astronomical filters). These simulations show the anti-symmetric smearing (bottom: l_p = −2, top: l_p = 2 images) due to the Wollaston optical component and also a high-contrast in the residual speckle due to the wavelength filtering by the coronagraphic device. Indeed, only wavelengths with larger phase-shift errors on the phase-mask (λ_max,λ_min) have a significant effect on the final coronagraphic images. We recall that for λ_max the effect is less than λ_min due to a better Strehl ratio.

Fig. P.3

Same coronagraphic simulations with a maximum phase-shift of 0.01 radian.

Fig. P.4

Same coronagraphic simulations with a maximum phase-shift of 0.1 radian. A coronagraphic strong peak residual appears in the center of the image (β_0,0 ≠ 0).

P.2. Phase error on the vortex coronagraph

The π phase-shift of the vortex coronagraph is generally obtained by index material engineering as engraving a ramp in a material for scalar vortex or in liquid crystal polymer (LCP) using birefringent optical properties, see Mawet et al. (2009) to generate the proper phase helix. The refractive index of the material as a function of the wavelength follows a Sellmeiler law (λ²) (also for the LCP material). This property generates a π phase-shift with a quadratic dependance. In these conditions, the phase-shift error increases strongly at the edges of the chosen wavelength bandpass. To perform our numerical simulations, we chose the following phase-shift wavelength dependence: $Δ φ = Δ φ_{\max} \frac{4 (λ - λ_{0})^{2}}{(λ_{\max} - λ_{\min})^{2}} (λ < λ_{0}) Δ φ = - Δ φ_{\max} \frac{4 (λ - λ_{0})^{2}}{(λ_{\max} - λ_{\min})^{2}} (λ > λ_{0}),$ $\appendix \setcounter{section}{16} \begin{equation} \label{phase_m_shift} \Delta \phi= \Delta \phi_{\rm max} \frac{4(\lambda-\lambda_0)^2}{(\lambda_{\rm max}-\lambda_{\min})^2} \quad (\lambda<\lambda_0) \qquad \Delta \phi= -\Delta \phi_{\rm max} \frac{4(\lambda-\lambda_0)^2}{(\lambda_{\rm max}-\lambda_{\min})^2} \quad (\lambda>\lambda_0) , \end{equation}$ (P.1)where Δφ_max is the maximum phase-shift error in radian (0.001,0.01,0.1), λ₀ is the centering mask wavelength (where the phase-shift is π) and finally λ_max,λ_min corresponds to the wavelength bandpass edges.

P.3. Wollaston residual dispersion

Our optical implementation presents an artificial polychromatic speckle smearing in the x-axis due to the Wollaston residual dispersion. If we use the Sellmeiler law for the refractive index of the MgF₂ or the position of the spot diagram in the Zemax software, we can know the shift of the PSF as a function of the wavelength on the two coronagraphic images. The position of the PSF center in λ/d units is given with very good approximation ± 0.2 μm by the following function: $Δ x_{psf} = Sign (l_{p}) (0.5978 λ^{-1.85} - {1.3298}^{)} .$ $\appendix \setcounter{section}{16} \begin{equation} \label{eqsmear} \Delta x_{\rm psf}={\rm Sign}(l_{\rm p}) \left( 0.5978 \lambda^{-1.85}-1.3298 \right) . \end{equation}$ (P.2)The position of the PSF is completely anti-symmetric in the ± 0.01λ/d level with respect to the sign of the vortex charge l_p.

P.4. Coronagraphic vortex numerical simulations set

The Fourier simulation was performed on 2K × 2K arrays with an entrance perfect circular pupil of 255 pixels diameter. The chromatism was taken into account by changing the diameter of the entrance pupil with one pixel steps that correspond to ≈ 7 nm bandpass steps. For the smaller bandpass (2 nm – the Laser diode), the simulation was performed with 4K × 4K arrays and a circular pupil of 511 pixels diameter. The entrance pupil phase was scaled and a vortex coronagraphic mask was created for each wavelength with the proper s and p transmittance and the mask phase-shift error. The diaphragm was chosen to be just smaller the pupil diameter (98%, 2 pixels apart) and scaled with respect the wavelength. We chose this non-optimized configuration (see Appendix K) to use the β_n,m coefficients without a scaling factor. A phase screen error was added to each coronagraphic pupil (l_p = ± 2) to take the common on the non-common path error of the optical train (Wollaston and achromatic doublet) into account. This phase screen includes the shift of the two beams due to the polarization separation (non-common path error). The maximum shift between the two beams is 8.1% at the output of the final imaging lens doublet, this is the value adopted in our Fourier numerical simulations. Finally, we used Eq. (P.2) to produce the Wollaston chromatic smearing in the coronagraphic image plane by adding a tip in the coronagraphic pupil. These simulations complete the previous results presented in Sect. 7.2, and we show here some images obtained for three maximum phase-shift errors of Δφ_max = 0.001,0.01,0.1 radian and five different wavelength bandpasses Δλ = 2,100,150,200,250 nm, also including photon noise with 80 000 e⁻ on the maximum of coronagraphic image (80% of the full-well capacity), readout noise (6e⁻), and flat noise (1% rms).

Appendix Q: FFT vs. NZ for a “sin” ripple case

Fig. Q.1

$β_{n}^{m}$ $\hbox{$\beta_n^m$}$ modal decomposition for the sin function applied as wavefront error (both amplitude and phase). Up to down, we show the first 180 $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients for the frequencies f_x = 2,4,8.

We present in this appendix a complete set of numerical simulations obtained for a perfect case with a simple “sin” function in the entrance pupil plane. A sin modulation is proposed for both phase and amplitude. The pupil becomes $Π = Π (0, 1) [1 - A_{a} / 2 + A_{a} \sin (2 π f_{x} x) / 2 + i A_{Φ} \sin (2 π f_{x} x) / 2] A_{Φ} = 0.1 λ_{0} A_{a} = 0.01 f_{x} = 2, 4, 8 λ_{0} = 650 nm,$ $\appendix \setcounter{section}{17} \begin{equation} \label{sinx} \Pi=\Pi(0,1) \left[1 - A_a/2 + A_a \sin(2\pi f_x x)/2+ {\rm i} A_{\Phi} \sin(2\pi f_x x)/2 \right] \qquad A_{\Phi}=0.1\lambda_0 \quad A_a=0.01 \quad f_x=2,4,8 \quad \lambda_0=650~{\rm nm}, \end{equation}$ (Q.1)where A_Φ, A_a are the coefficients of the phase and amplitude errors applied in the pupil respectively, and f_x is the spatial frequency of the sin function. The direct $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ decomposition of the three sin modulations is presented in the following figure: the $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ modal decomposition is not well-suited for a “simple” sin function as wavefront error. Indeed, the term “simple” can be applied for the classical Fourier transform in cartesian coordinates (only one frequency in the Fourier plane), but not in the Zernike polynomials basis, where it is necessary to use many $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients. We propose to compare the FFT propagation with VVC and the NZ modal decomposition for these three spatial frequencies.

Fig. Q.2

Set of numerical monochromatic coronagraphic simulations with a vortex mask error of 0.001 radian. The input sin frequency is chosen as f_x = 2 (low-order wavefront error). Each graphic is organized as follows: Left to right: l_p = + 2 coronagraphic image, l_p = −2 coronagraphic image, difference between the two previous images with an amplification of intensity level of 100. Upper: pure amplitude wavefront error (A_a = 0.01), middle: pure phase error (A_Φ = 0.1λ₀), and bottom: both errors, amplitude and phase. The first nine sets of figures left is for the FFT propagation, whereas the nine sets of figures right is for direct $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ modal decomposition. The brightness scales are linear and are the same for the FFT and the NZ images sets. We notice that the amplitude modulation effects on the final coronagraphic images (three upper images) are smaller than the phase modulation error and the brightness scales for the amplitude set of simulations are also smaller to enhance the image contrast. Indeed, for the amplitude error, we chose 10^-7 ≤ I ≤ 10^-5 and for the phase error 10^-5 ≤ I ≤ 2.5 × 10^-4. The direct comparison between the FFT and NZ shows a very significant similarity in the direct images l_p = ± 2 and after substraction of the two images.

Fig. Q.3

Two sets of numerical simulations FFT vs. NZ with a sinus frequency of f_x = 4.

Fig. Q.4

Two sets of numerical simulations FFT vs. NZ with a sinus frequency of f_x = 8 (high-order wavefront error).

All Tables

Table 1

NZ accuracy metrics (see Appendix G).

In the text

Table 2

Maximum coronagraphic nulling ratio and wavelength bandpass achievable for various levels of input Strehl ratio.

In the text

Table B.1

Values of some P(m)/Q(m) polynomials.

In the text

All Figures

Fig. 1

Schematic view of the NZ vortex phase-retrieval process. It consists of using the real input coronagraphic images, corresponding to l_p = 0, ± 2 (the l_p = ± 2 images are provided by polarization splitting, while the unpolarized image (l_p = 0) is directly given by the sum of the two polarized images) projected on aberrations templates, and finding the unknown coefficients by resolving a system of linear equations. Left: the three images needed for the full aberration analysis. Center: linear systems to retrieve the $β_{N}^{0} (d_{c})$ $\hbox{$\beta_N^0(d_{\rm c})$}$ and the f⁽¹⁾ terms. Right: the predictor-corrector approach to evaluate the quadratic correction $f^{(2)}^{[} (β_{cn}^{m} (d_{c}))^{2}, (β_{sn}^{m} (d_{c}))^{2}, (β_{cn}^{m} (d_{c}) \cdot β_{sn}^{m} (d_{c}))^{]}$ $\hbox{$f^{(2)}\left[(\beta_{\rm cn}^m(d_{\rm c}))^2,(\beta_{\rm sn}^m(d_{\rm c}))^2,(\beta_{\rm cn}^m(d_{\rm c})\cdot\beta_{\rm sn}^m(d_{\rm c}))\right]$}$ .

In the text

Fig. 2

Polarization splitter analysis: possible implementation of the polarization splitter system used after the Lyot stop. An achromatic quarter-wave plate (quartz/MgF₂) and a Wollaston in MgF₂ combined here with a simple achromatic doublet. This simple optical implementation allows us to image the two residual coronagraphic images in the left and right circular polarization basis on the detector. The two images presented in this figure in logarithmic gray scale show the two polychromatic PSF (650–900 nm) obtained by this simple scheme for the two circular polarization images. Note that the PSF shows a small chromatic smearing residual on the x-axis (Strehl = 96.5%) due to the substantial wavelength bandpass. This polychromatic smearing is detailed in Appendix P.

In the text

Fig. 3

Numerical simulation illustrating the principle of Nijboer-Zernike retrieval applied on the vortex coronagraph. (Up to down), numerical simulation for each circular polarization with l_p = 2 and l_p = −2 respectively using the first 860 Zernike polynomials (40 complete modes) and a Lyot stop of 99%. The Strehl ratio of the PSF before coronagraphic filtering is 95%. The Lyot stop remove the strong diffraction value in ρ = 1 but allows us to show all images without scaling the $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ coefficients ( $β_{n}^{m} \to β_{n}^{m} (d_{c})$ $\hbox{$\beta_n^m \rightarrow \beta_n^m(d_{\rm c})$}$ ). (Left to right), final monochromatic (λ = 650 nm) coronagraphic image obtained with the sum of $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ analytical functions, the FFT of the direct sum of Zernike polynomials $(\sum β_{n}^{m} \cdot Z_{n}^{m})$ $\hbox{$(\sum \beta^m_n \cdot Z_n^m)$}$ in the pupil plane, and finally, the classical phase function obtained by $e^{i \sum α_{n}^{m} \cdot Z_{n}^{m}}$ $\hbox{${\rm e}^{{\rm i}\sum \alpha^m_n \cdot Z_n^m}$}$ . Due to the difference of basis, the final simulation with the classical phase function presents minor discrepancies in the speckle background (I_c < 10^-7, I_c is the coronagraphic final intensity). The image scales are not linear $(I_{c}^{1 / 4})$ $\hbox{$(I_{\rm c}^{1/4})$}$ .

In the text

Fig. 4

Polychromatic coronagraphic simulation for l_p = 2 with Δλ = 250 nm of spectral bandpass and a phase error of Δφ = ± 0.001 radian for the vortex mask. We also include defects of our optical implementation. Left: without the speckle smearing. Right: with the Wollaston speckle smearing. The chromatic effect on the speckles is small in the two cases, the contrast remains high. The brightness scale is the same between the two images and is not linear.

In the text

Fig. 5

$χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ values as a function of the wavelength bandpass and a phase-mask with a 10^-1/10^-2/10^-3 radian of phase-shift error. The black lines correspond to $χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ for the in: $β_{n}^{m} - \to out$ $\hbox{$\beta^m_n \longrightarrow out$}$ : $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ retrieval, the gray lines are for the in: $α_{n}^{m} - \to out$ $\hbox{$\alpha^m_n \longrightarrow out$}$ : $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ retrieval: the solid lines are for the “p” modes and the dashed lines are for the “m” modes. The x and y scales are common between the three images. The $χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ values for of 10^-2 rad for the mask error are poorer than in the previous simulations. The 10^-1 rad case presents significant limitations because of a strong peak in the image center (β_0,0 = 0.04). The dotted line with $Err (σ_{A}) = 10^{-10} / χ_{l}^{2} = 0.1$ $\hbox{$Err(\sigma_A)=10^{-10}/\chi^2_l=0.1$}$ and $Err (σ_{A}) = 10^{-9} / χ_{l}^{2} = 1$ $\hbox{$Err(\sigma_A)=10^{-9}/\chi^2_l=1$}$ shows the chromatic limit of a desired retrieval precision in the “AO metric”.

In the text

Fig. 6

$χ_{l}^{2}$ $\hbox{$\chi^2_l$}$ values as a function of the signal-to-noise ratio in the input coronagraphic images for a in: $β_{n}^{m} - \to out$ $\hbox{$\beta^m_n \longrightarrow out$}$ : $β_{n}^{m}$ $\hbox{$\beta^m_n$}$ retrieval. We present the effect of the modal decomposition quality with respect to the phase-shift error of the mask. Several realistic cases are presented with 1/10/100/1000 exposures in the final image. A good wavefront retrieval is given after an average image of at least 100 exposures.

In the text

	Fig. D.1 Input classical Zernike $Z_{n}^{m}$ $\hbox{$Z_n^m$}$ .
In the text

	Fig. D.2 $ℜ (ζ_{n,m}^{l_{p}} (ρ,θ))$ $\hbox{$\Re\left(\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)\right)$}$ function in the coronagraphic pupil.
In the text

	Fig. D.3 $ℑ (ζ_{n,m}^{l_{p}} (ρ,θ))$ $\hbox{$\Im\left(\zeta_{n,m}^{l_{\rm p}}(\rho,\theta)\right)$}$ function in the coronagraphic pupil.
In the text

	Fig. D.4 U_v(r,φ),l_p = ± 2 functions in the coronagraphic image plane for the 10 Zernike polynomials (Z₂ − Z₁₁). Top: l_p = + 2, Bottom: l_p = −2. The vertical solid line separates the real part from the imaginary part of the two U_v(r,φ) functions.
In the text

	Fig. G.1 Comparison between Fourier simulations for each Zernike polynomial (2-860) and $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ mode in terms of χ². Left: the direct summation in the pupil (β_n,m), right: the classical phase function (e^iα_n,m).
In the text

	Fig. G.2 Comparison between Fourier simulations for each Zernike polynomial (2-860) and $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ mode in terms of Err(σ_A). Left: the direct summation in the pupil (β_n,m), right: the classical phase function (e^iα_n,m).
In the text

	Fig. H.1 $ℜ (ζ_{n,m}^{l_{p},F} (ρ,θ))$ $\hbox{$\Re\left(\zeta_{n,m}^{l_{\rm p},F}(\rho,\theta)\right)$}$ function in the coronagraphic pupil with an extreme case N = 4 to highlight differences on Zernike polynomials. In the real case N > 1000, and only the $ζ_{0, 0}^{l_{p},F}$ $\hbox{$\zeta_{0,0}^{l_{\rm p},F}$}$ must be taken into account in the NZ theory.
In the text

	Fig. H.2 $ℑ (ζ_{n,m}^{l_{p},F} (ρ,θ))$ $\hbox{$\Im\left(\zeta_{n,m}^{l_{\rm p},F}(\rho,\theta)\right)$}$ function in the coronagraphic pupil with an extreme case N = 4 to highlight significant differences on Zernike polynomials.
In the text

	Fig. K.1 Variation of the residual energy (dE_p/dx) in the residual optical aberrations as a function of the number of Zernike polynomials used (nz – without the piston term) and the power spectral density 1/n^pw. The figure shows two diaphragm values: when the gain is maximum and when it is equal to zero. The figure in 1/n presents strong residual oscillations owing to low-order aberrations.
In the text

	Fig. K.2 Fresnel diffraction of the Lyot stop with an optical vortex coronagraph. We present the real and the imaginary parts of the $ζ_{0, 0}^{l_{p},F} (ρ^{'})$ $\hbox{$\zeta_{0,0}^{l_{\rm p},F}( \rho')$}$ function for four different Fresnel number N. The last figure shows the residual intensity in the coronagraphic pupil plane when we choose a d_c radius of 86% (vertical dotted line).
In the text

	Fig. L.1 Spider Fresnel diffraction calculation for a VLT coronagraphic pupil (ϵ = 0.14) and the diameter of the coronagraphic pupil of 18 mm. The figure shows a comparison between the classical Fresnel diffraction (l_p = 0) and the optical propagation including the vortex (l_p = 2). The thickness of the spider is 225 μm.
In the text

Fig. O.1

Non-sequential Zemax analysis of the proposed optical implementation after the Lyot stop to split the two circular polarizations. Left: the polychromatic ray tracing (650–900 nm) for one polarization, and right: the detector view of no-coherent intensity distribution in logarithmic scale. With these classical materials (MgF₂, quartz, infrasil) the maximum scattered light is 10^-6 of the residual coronagraphic intensity. In this simulation, we chose an anti-reflexion coating of 0.5% per surface.

In the text

Fig. P.1

Left to right: the telescope entrance pupil phase obtained with the first 40 Zernike modes. The Strehl ratio is 95% @ 650 nm. In the center, the phase screen that simulates high-frequency polishing defaults for the l_p = 2 beam. Right, the phase screen for the l_p = −2 beam. These two last phase screens are the superposition of two different random simulations (common and non-common path errors). The last case (l_p = −2) is simulated with a shift of the second phase screen (8.1% in the y-axis with respect to the pupil diameter). The sigma of the two phase error screens for l_p = 2 and l_p = −2 are chosen equal to λ/50 rms @ 650 nm.

In the text

Fig. P.2

Set of numerical coronagraphic simulations with a maximum phase-shift (Δφ_max) on the vortex mask of 0.001 radian for each bandpass. We chose five different bandpasses: Δλ = 2 nm (corresponding to a single-mode Laser diode), 100 nm, 150 nm (R-filter), 200 nm, and finally 250 nm (R to I astronomical filters). These simulations show the anti-symmetric smearing (bottom: l_p = −2, top: l_p = 2 images) due to the Wollaston optical component and also a high-contrast in the residual speckle due to the wavelength filtering by the coronagraphic device. Indeed, only wavelengths with larger phase-shift errors on the phase-mask (λ_max,λ_min) have a significant effect on the final coronagraphic images. We recall that for λ_max the effect is less than λ_min due to a better Strehl ratio.

In the text

	Fig. P.3 Same coronagraphic simulations with a maximum phase-shift of 0.01 radian.
In the text

	Fig. P.4 Same coronagraphic simulations with a maximum phase-shift of 0.1 radian. A coronagraphic strong peak residual appears in the center of the image (β_0,0 ≠ 0).
In the text

	Fig. Q.1 $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ modal decomposition for the sin function applied as wavefront error (both amplitude and phase). Up to down, we show the first 180 $β_{n}^{m}$ $\hbox{$\beta_n^m$}$ coefficients for the frequencies f_x = 2,4,8.
In the text

Fig. Q.2

Set of numerical monochromatic coronagraphic simulations with a vortex mask error of 0.001 radian. The input sin frequency is chosen as f_x = 2 (low-order wavefront error). Each graphic is organized as follows: Left to right: l_p = + 2 coronagraphic image, l_p = −2 coronagraphic image, difference between the two previous images with an amplification of intensity level of 100. Upper: pure amplitude wavefront error (A_a = 0.01), middle: pure phase error (A_Φ = 0.1λ₀), and bottom: both errors, amplitude and phase. The first nine sets of figures left is for the FFT propagation, whereas the nine sets of figures right is for direct $V_{n,m}^{l_{p}}$ $\hbox{$V_{n,m}^{l_{\rm p}}$}$ modal decomposition. The brightness scales are linear and are the same for the FFT and the NZ images sets. We notice that the amplitude modulation effects on the final coronagraphic images (three upper images) are smaller than the phase modulation error and the brightness scales for the amplitude set of simulations are also smaller to enhance the image contrast. Indeed, for the amplitude error, we chose 10^-7 ≤ I ≤ 10^-5 and for the phase error 10^-5 ≤ I ≤ 2.5 × 10^-4. The direct comparison between the FFT and NZ shows a very significant similarity in the direct images l_p = ± 2 and after substraction of the two images.

In the text

	Fig. Q.3 Two sets of numerical simulations FFT vs. NZ with a sinus frequency of f_x = 4.
In the text

	Fig. Q.4 Two sets of numerical simulations FFT vs. NZ with a sinus frequency of f_x = 8 (high-order wavefront error).
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Allen, L., Courtial, J., & Padgett, M. 1999, Phys. Rev. E, 60, 74977503 [CrossRef] [Google Scholar]

[2] Blanc, A., Fusco, T., Hartung, M., Mugnier, L. M., & Rousset, G. 2003, A&A, 399, 373 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[3] Bohren, C. F., & Huffman, D. R. 1983, Absorption and scattering of light by small particles (New York: Wiley) [Google Scholar]

[4] Bordé, P. J., & Traub, W. A. 2006, ApJ, 638, 488 [NASA ADS] [CrossRef] [Google Scholar]

[5] Born, M., & Wolf, E. 1999, Principles of Optics (Cambridge University Press) [Google Scholar]

[6] Give’on, A., Kern, B., Shaklan, S., Moody, D. C., & Pueyo, L. 2007, SPIE Conf. Ser., 6691, 66910A [Google Scholar]

[7] Gonsalves, R. A. 1982, Opt. Eng., 21, 829 [Google Scholar]

[8] Gradshteyn, I. S., & Ryzhik, I. M. 1994, Table of Integrals, Series and Products (Academic Press Inc), 1762 [Google Scholar]

[9] Henyey, L., & Greenstein, J. 1941, ApJ, 93, 70 [NASA ADS] [CrossRef] [Google Scholar]

[10] Janssen, A. 2002, JOSA A, 19, 849 [NASA ADS] [CrossRef] [Google Scholar]

[11] Krist, J. E., Belikov, R., Pueyo, L., et al. 2011, in SPIE Conf. Ser., 8151, 81510E [Google Scholar]

[12] Magette, A. 2010, Ph.D. Thesis: The International Liquid Mirror Telescope: optical testing and alignment using a Nijboer-Zernike aberration retrieval approch, IAGL, University of Liège, 1 [Google Scholar]

[13] Mahajan, V. N. 1981a, J. Opt. Soc. Am., 71, 75 [Google Scholar]

[14] Mahajan, V. N. 1981b, J. Opt. Soc. Am., 71, 1408 [Google Scholar]

[15] Mawet, D., Riaud, P., Absil, O., & Surdej, J. 2005, ApJ, 633, 1191 [NASA ADS] [CrossRef] [Google Scholar]

[16] Mawet, D., Serabyn, E., Liewer, K., et al. 2009, Opt. Expr., 17, 1902 [Google Scholar]

[17] ming Dai, G. 2006, J. Opt. Soc. Am. A, 23, 539 [NASA ADS] [CrossRef] [Google Scholar]

[18] Nijboer, B. 1943, Physica, 10, 679 [NASA ADS] [CrossRef] [Google Scholar]

[19] Nijboer, B. 1947, Physica, 13, 605 [NASA ADS] [CrossRef] [Google Scholar]

[20] Poynting, J. 1909, Proc. Roy. Soc. London Ser. A, 82, 560567 [NASA ADS] [Google Scholar]

[21] Riaud, P. 2003, Ph.D. Thesis, Université Paris VI [Google Scholar]

[22] Riaud, P. 2012, A&A, 545, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]