Free Access
Issue
A&A
Volume 524, December 2010
Article Number A80
Number of page(s) 10
Section Astronomical instrumentation
DOI https://doi.org/10.1051/0004-6361/201014934
Published online 25 November 2010

© ESO, 2010

1. Introduction

A new concept of a quasi-achromatic nulling interferometer has been proposed in a preceding publication (Rouan & Pelat 2008,hereafter Paper I). The whole idea is based on cancelling the polynomial part, according to both the wavelength and spatial coordinates, of the starlight distribution at the centre of the focal plane of the interferometer. This objective was attained thanks to a particular setting of phase shifters on the interferometer pupil planes. The advantage of the proposed design depends on a rather simple arrangement, where the key device is a simple square grid of phase shifters (the so-called “chessboard”) on the pupil plane of the telescopes.

In Paper I, several possible chessboards were proposed and a co-axial (Michelson) setting of the telescopes favoured. Here we concentrate more deeply on two high-performance configurations, one designed for the co-axial case and the other for the multi-axial (Fizeau) case. In Sect. 2, we outline the principles that led to the phase distribution and spatial repartition of the chessboards’ cells. A first approximation of the system performances is also given, with particular emphasis on the principles that govern its effectiveness. In Sect. 3, detailed analytical calculations are carried out to determine the chessboards’ nulling performances with more precision and to explore their robustness to a systematic or random piston on one arm of the interferometer.

2. The chessboard design

We recall that our device consists of two square grids of phase shifters each formed of $n_0^2$ adjacent cells. Finding a high-performance chessboard setting implies closely inspecting the complex amplitude of the electrical field of the interferometer output equipped with this device. If A stands for this amplitude, the point spread function is given by PSF =  | A | 2. According to Fourier optics one gets A(u,v)=Sum(expUV·expPQ)sinc(πdn0λu)sinc(πdn0λv).\begin{equation} \label{eq1} A(u,v)=\Sum({\exp {\rm UV}}\cdot {\exp{\rm PQ}})\sinc\left(\frac{\pi d}{n_0\lambda}u\right)\sinc\left(\frac{\pi d}{n_0\lambda}v\right). \end{equation}(1)In this expression, u and v are the Cartesian angular coordinates of a light ray diffracted by the pupil plane, d is the size of the pupil, n0 the number of cells on one side of the chessboards, and by definition sinc(x) = sin(x) / x. The symbolic expressions expUV and expPQ stand for two square matrices both with the “chessboard” format: n0 × n0. The “dot” operation · between the two matrices is the element-to-element product not the matrices product. The term Sum(  ) realises the sum of all elements of the matrix under the scope of that operator.

The matrix expUV takes the positioning of the cells into account, namely the place where the phase shifters are put on the grid of the chessboards. It is given by expUV=ejπudδ/n0λejπvdδ/n0λ.\begin{equation} \textrm{expUV} = {\rm e}^{j\pi ud\delta/n_0\lambda}\otimes {\rm e}^{j\pi vd\delta/n_0\lambda}. \end{equation}(2)In this expression, δ is the displacement vector. It is a set of integers varying from −(n0−1) to +(n0−1) by step of 2, and the symbol  ⊗  is the tensor product between the two vectors. For example, if n0 = 4, we have δ = (−3, −1,1,3) and we get expUV==[].\begin{eqnarray} \textrm{expUV} &=& \left[\begin{array}{c} {\rm e}^{j\frac{3\pi d}{n_0\lambda} v}\\ {\rm e}^{j\frac{\pi d}{n_0\lambda} v}\\ {\rm e}^{j\frac{-\pi d}{n_0\lambda} v}\\ {\rm e}^{j\frac{-3\pi d}{n_0\lambda} v}\\ \end{array}\right] \left[\begin{array}{cccc} {\rm e}^{j\frac{-3\pi d}{n_0\lambda} u}&{\rm e}^{j\frac{-\pi d}{n_0\lambda} u}&{\rm e}^{j\frac{\pi d}{n_0\lambda} u}&{\rm e}^{j\frac{3\pi d}{n_0\lambda} u} \end{array}\right], \label{eq:expUV}\\ &=& \left[\begin{array}{cccc} {\rm e}^{j\frac{\pi d}{n_0\lambda}(-3u+3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(-u+3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(u+3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(3u+3v)}\\ {\rm e}^{j\frac{\pi d}{n_0\lambda}(-3u+v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(-u+v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(u+v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(3u+v)}\\ {\rm e}^{j\frac{\pi d}{n_0\lambda}(-3u-v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(-u-v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(u-v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(3u-v)}\\ {\rm e}^{j\frac{\pi d}{n_0\lambda}(-3u-3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(-u-3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(u-3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(3u-3v)} \end{array}\right]. \end{eqnarray}The second matrix expPQ, takes the values of the phase shifters into account. Adopting the usual convention that the exponential of a matrix is the matrix of the exponentials, we have expPQ=ejπPΛejπuD/λ+ejπQΛe+jπuD/λ.\begin{equation} \textrm{expPQ} = {\rm e}^{j\pi P\Lambda}{\rm e}^{-j\pi uD/\lambda} + {\rm e}^{j\pi Q\Lambda}{\rm e}^{+j\pi uD/\lambda}. \label{eq:expPQ} \end{equation}(5)The matrices P and Q are the chessboard matrices, of size n0 × n0. All their elements are integers that give, in units of λ0 / 2, the optical path difference (OPD) induced by the corresponding cell of the chessboard. For example, if P =  [ 0 ]  and Q =  [ 1 ] , there is no OPD on one telescope but one of λ0 / 2 on the other one, inducing a phase shift of π at λ0. This particular setting corresponds to a Bracewell nulling interferomer (Bracewell 1978). Finally, D is the distance between the two pupils’ centres and Λ is a chromatic variable Λ=λ0λ=11+Δλλ0,Δλ=λλ0.\begin{equation} \Lambda = \frac{\lambda_0}{\lambda} = \frac{1}{1+\frac{\Delta\lambda}{\lambda_0}}\,,\quad \Delta\lambda=\lambda-\lambda_0. \end{equation}(6)(The Λ variable was named Δ in Paper I.)

To obtain the best achromatism, we propose a progressive approach below where we first look for the best phase-shift distribution and second for their spatial distribution. We start our investigation with the mathematically simpler co-axial design.

2.1. The co-axial chessboards

In the co-axial configuration, the two pupils are superimposed thanks to some optical arrangement. A practical example is given in Fig. 1, where it includes an imaging lens in the last stage since any efficient detection requires concentrating the light beam on a small surface.

thumbnail Fig. 1

A possible optical scheme of the nulling interferometer in co-axial configuration.

To consider this design, we set D = 0 in (5). The expPQ matrix therefore reduces to ejπPΛ + ejπQΛ. As a first step we do not consider the phase shifters’ location on the pupils; i.e., we set the displacement vector δ to zero making all elements of expUV equal to one. Accordingly, the Sum( ) term in (1) takes the simpler form of Sum(ejπPΛ + ejπQΛ), which we call the chromatic term.

2.1.1. The multigrad solution

The idea, explained in Paper I, is to cancel the chromatic term up to the Nth order of its Taylor series. For this purpose, we need the Taylor series expansion of the elements of ejπPΛ and ejπQΛ matrices. For example, if ns stands for the numerical values contained in P, the Taylor expansion, relative to ε = Δλ / λ0, is given by ensΛ=k=01k!εkenspk(ns),\begin{equation} \textstyle {\rm e}^{j\pi n_s\Lambda}=\sum_{k=0}^\infty \frac{1}{k!}\varepsilon^k {\rm e}^{j\pi n_s} p_k(j\pi n_s), \end{equation}(7)where pk stands for a polynomial of degree k. (Note that pk is not a monomial because of the chain rule needed to compute the derivatives.) Of course, a similar formula exists for emsΛ involving the values ms contained in Q. Cancelling the chromatic term up to order N implies solving the equation s=1n02k=0N1k!εk(enspk(ns)+emspk(ms))=0,ork=0N1k!εks=1n02(enspk(ns)+emspk(ms))=0.\begin{eqnarray} \textstyle \sum_{s=1}^{n_0^2}\sum_{k=0}^N\frac{1}{k!}\varepsilon^k \bigl({\rm e}^{j\pi n_s} p_k(j\pi n_s)+{\rm e}^{j\pi m_s} p_k(j\pi m_s)\bigr)&=&0,\;\textrm{or}\\ \textstyle \sum_{k=0}^N\frac{1}{k!}\varepsilon^k \sum_{s=1}^{n_0^2}\bigl({\rm e}^{j\pi n_s} p_k(j\pi n_s)+{\rm e}^{j\pi m_s} p_k(j\pi m_s)\bigr)&=&0. \end{eqnarray}This polynomial is zero for any ε if, and only if, all its coefficients are zero. Therefore we have to solve the system s=1n02(enspk(ns)+emspk(ms))=0,\begin{equation} \textstyle \sum_{s=1}^{n_0^2}\bigl({\rm e}^{j\pi n_s} p_k(j\pi n_s)+{\rm e}^{j\pi m_s} p_k(j\pi m_s)\bigr)=0, \end{equation}(10)for k = 0,...,N. If cℓk stands for the coefficients of the polynomials pk, this reduces to =0kcℓk()s=1n02(ensns+emsms)=0.\begin{equation} \textstyle \sum_{\ell=0}^k c_{\ell k} (j\pi)^\ell\sum_{s=1}^{n_0^2}({\rm e}^{j\pi n_s} n_s^\ell+{\rm e}^{j\pi m_s} m_s^\ell)=0. \end{equation}(11)This system can be solved by induction, because if a solution is found up to order k, the solution up to order k + 1 is found if s=1n02(ensnsk+1+emsmsk+1)=0.\begin{equation} \textstyle \sum_{s=1}^{n_0^2}({\rm e}^{j\pi n_s} n_s^{k+1}+{\rm e}^{j\pi m_s} m_s^{k+1})=0. \end{equation}(12)Therefore, the chromatic term is nullified up to order N if one solves the following system of equations s=1n02(ensnsk+emsmsk)=0;k=0,...,N,\begin{equation} \textstyle \sum_{s=1}^{n_0^2}({\rm e}^{j\pi n_s} n_s^k+{\rm e}^{j\pi m_s} m_s^k)=0\,;\quad k=0,\dots,N, \end{equation}(13)for the unknown integers ns and ms and the parameter n0.

As in a Bracewell interferometer, we impose the constraint that ens = +1 and ems = −1, therefore the ns and ms are respectively even and odd integers. As introduced in Paper I, Appendix C, one ends up with a special set of diophantine equation called a multigrad system, namely s=1n02nsk=s=1n02msk,k=1,...,N.\begin{equation} \textstyle \sum_{s=1}^{n_0^2} n_s^k=\sum_{s=1}^{n_0^2} m_s^k\,,\; k=1,\dots,N. \end{equation}(14)If two sets of integers, say  { ns }  and  { ms } , solve a multigrad system up to order N, they are called multigrad sets. This property is shortly written as $\{m_s\}\stackrel{N}{=}\{n_s\}$. Hereafter, the two sets are supposed to be of equal size, for short $\{m_s\}\stackrel{0}{=}\{n_s\}$ (adopting the convention that 00 = 1).

The key theorem is that if one has $\{n_s\}\stackrel{N}{=}\{m_s\}$, then it follows that $\{n_s,m_s+c\}\stackrel{N+1}{=}\{m_s,n_s+c\}$. This is true for any c, but is non-trivial only if c ≠ 0 and  { ms }  ≠  { ns } . In this way, two multigrad sets up to order N generate two new multigrad sets up to order N + 1. During this operation, the size of the sets is multiplied by two. But in order that the number of cells on the chessboards remains a square, the size of the sets must be multiplied by four. This is done by applying two times the key theorem. Moreover, if the sets  { ns }  and  { ms }  possess the desired parity, we keep it if we choose an odd constant c, say c = 1. We get the final result:

if $\{n_s\}\stackrel{N}{=}\{m_s\}$, then

$\{n_s,m_s+1,m_s+1,n_s+2\}\stackrel{N+2}{=}\{m_s,n_s+1,n_s+1,m_s+2\}$.

This remarkable property is at the heart of the scheme proposed below.

2.1.2. The iterative Bracewell scheme

This scheme runs as follows. One starts with a Bracewell nulling interferometer $\{0\}\stackrel{0}{=}\{-1\}$ and constructs the next one by two applications of the key theorem. In this way, one first gets $\{0,0,0,2\}\stackrel{2}{=}\{-1,1,1,1\}$. Indeed, we have (always with 00 = 1) 00+00+00+20=10+10+10+10,01+01+01+21=11+11+11+11,02+02+02+22=12+12+12+12.\begin{eqnarray} 0^0+0^0+0^0+2^0&=&-1^0+1^0+1^0+1^0,\nonumber\\ 0^1+0^1+0^1+2^1&=&-1^1+1^1+1^1+1^1,\nonumber\\ 0^2+0^2+0^2+2^2&=&-1^2+1^2+1^2+1^2. \end{eqnarray}(15)As expected, this new interferometer cancels the chromatic term up to order N = 2. To construct the next one, we add c = −1 instead of c = +1 in order to gain symmetry and in an effort to keep the z-heights of the cells as low as possible. Finally, to generate the successive phase chessboards, we propose the following iterative scheme P0={0},Q0={1},Pn+1={Pn,Qn±1,Qn±1,Pn±2},Qn+1={Qn,Pn±1,Pn±1,Qn±2},\begin{eqnarray} P_0&=&\set{0},\quad Q_0=\set{-1},\nonumber\\ P_{n+1} &=&\set{P_n,Q_n\pm1,Q_n\pm1,P_n\pm2},\nonumber\\ Q_{n+1} &=&\set{Q_n,P_n\pm1,P_n\pm1,Q_n\pm2}, \end{eqnarray}(16)where  ±  means (−1)n (i.e. +1 if n is even and −1 if it is odd). With this process, the chessboards Pn and Qn retain the shape of two squares possessing n0 = 2n cells on each of their sides. Their combined effect is to cancel the chromatic term up to order N = 2n. The higher n, the more efficent the corresponding nulling interferometer.

Table 1

Number of cells in the interferometer (Pn,Qn) according to the phase shift exp(jkπλ0 / λ).

The iterative scheme presented above leads to constructing the interferometer (Pn,Qn) possessing, for example, the phase-shift distribution shown on Table 1. We observe that the progression of the number of cells that produce the same phase shift follows that of the binomial coefficients in the polynomial expansion of (1 + x)2n + 1. This is easily proved by two applications of the recursive formula ${2n \choose q}={2n-1 \choose q-1}+{2n-1 \choose q}$, which leads to ${2n+1 \choose q}={2n-1\choose q-2}+2{2n-1 \choose q-1}+{2n-1 \choose q}$, which is our iterative scheme. (In these formulæ we adopt the convention that ${n \choose q}=0$ if q ≤ 0.) Table 2 gives the phase-shift, together with the corresponding number of cells (the formulæ are valid for two consecutives orders).

Table 2

Number of cells in (Pn,Qn) producing a given phase shift.

2.1.3. First estimation of the iterative Bracewell scheme performance

In this section we study the nulling efficiency of an interferometer built on this principle. To compare the interferometers (Pn,Qn) in the same conditions, differing only by their order n, the light collecting surface of their mirrors must be kept constant by reducing the size of the cells to a square of surface d2 / 22n. Because at this point we have not distributed the cells on the pupil plane, we only consider An, the sum of the amplitudes. This leads to the formula An=d222nejKπλ0/λq=02n+1(2n+1q)ejqπλ0/λ,=2d2ejKπλ0/λ(1+z2)2n+1,\begin{eqnarray} A_n &=& \frac{d^2}{2^{2n}}{\rm e}^{-jK\pi\lambda_0/\lambda}\sum_{q=0}^{2n+1}{\textstyle {2n+1 \choose q}}\,{\rm e}^{jq\pi\lambda_0/\lambda},\\ &=&2d^2{\rm e}^{-jK\pi\lambda_0/\lambda}\left(\frac{1+z}{2}\right)^{2n+1}, \end{eqnarray}where we have set z = eλ0 / λ and $K=2\bigl\lfloor\frac{n}{2}\bigr\rfloor+1$. The common phase factor ejKπλ0 / λ sets the OPD reference, and the function  ⌊ x ⌋  is the “floor” function (i.e. it returns the greatest integer less than or equal to x.)

If λ = λ0, then z = −1 and An = 0, therefore the nulling is perfect for any n. The device is strictly equivalent to a Bracewell nulling interferometer. Now, if λ ≠ λ0, since the module of the complex (1 + z) / 2 is less than one, An → 0 as n, the order of the interferometers, increases. This is true for all λ except for the values such that z = 1, i.e. λ = ∞,λ0 / 2,λ0 / 4,...,λ0 / 2k,... Aside from the actual distribution of cells on the pupils, we have obtained a quasi-achromatic nulling interferometer.

However, one must judge the perfomances of the device relative to some benchmark. We have demonstrated above that, for an on-axis object, our device has powerful nulling efficiency on a given set of wavelengths. This is of course different for any off-axis object such as a planet which is best detected when its position induces an OPD = Dθplanet = λ0 / 2. In that situation, the recombination, at λ0, is fully constructive for the planet. Therefore, the interferometer with a phase shift of π at λ0 added to one of its pupils will serve as a reference. In so doing, we transform the nulling interferometer into a non-destructive one for the wavelengths set that was just mentioned above. The phase addition on all the, say, odd phase shifters changes the sign of the odd, q = 2p + 1, binomial coefficients in the expression of An. If $A'_n$ stands for the function obtained in this way, one gets An=2d2ejKπλ0/λ(1z2)2n+1·\begin{equation} A'_n=2d^2{\rm e}^{-jK\pi\lambda_0/\lambda}\left(\frac{1-z}{2}\right)^{2n+1}\cdot \end{equation}(19)The only change between the expressions of An and $A'_n$ is the sign of z. If λ = λ0 / 2, then z = 1 and $A'_n=0$, we also have a nulling interferometer. Again, if λ ≠ λ0 / 2, since the modulus of the complex (1 − z) / 2 is also less than one, $A'_n\rightarrow 0$ as n increases. This is true for all λ except for the values that make z = −1; that is, λ = λ0,λ0 / 3,λ0 / 5,...,λ0 / (2k + 1),... The result is an associated “out of phase” nulling interferometer but, fortunately, less efficient than the original one. The off-axis planet will indeed suffer some attenuation but much less than the on-axis star, as the ratio of An to $A'_n$ demonstrates.

We define ρn as $A_n/A'_n$, and name it the relative nulling efficiency. One gets ρn(z)=(1+z1z)2n+1=ρ02n+1(z).\begin{equation} \rho_n(z) = \left(\frac{1+z}{1-z}\right)^{2n+1}=\rho_0^{2n+1}(z). \end{equation}(20)If  | ρ0 |  is less than one and n large enough, the on-axis nulling can be made more efficient than the off-axis one by any factor. Note that ρ0(z) is the relative nulling efficiency of the original Bracewell interferometer.

A deeper understanding of how the iterative Bracewell interferometers works is obtained through careful analysis of the transformation induced by ρ0 on the complex plane C of the Fresnel representation. In fact, under ρ0, the unit circle is transformed into the imaginary axis1. One gets ρ0(z)=jcot(12arg(z)),arg(z)=πλ0λ·\begin{equation} \rho_0(z) = j\cot\bigl(\frac{1}{2}\arg(z)\bigr)\,,\quad\arg(z)=\pi\frac{\lambda_0}{\lambda}\cdot \end{equation}(21)This means that ρ0(z), and the Bracewell relative nulling efficency that follows is obtained by a stereographic projection of z toward the real 1 (see Fig. 2). The image of the half-circles $\frac{\pi}{2}<\theta\pmod{2\pi}<\frac{3\pi}{2}$ is the segment (−j,j) shown in green in the figure. All points in this segment have a modulus less than one, therefore  | ρn(z) |  can be made as small as desired, provided one chooses an interferometer of sufficiently high-order n. This defines many wavelengths orders, the first one being: $\frac{2}{3}\lambda_0<\lambda<2\lambda_0$. Within these bandwiths, an “in phase” object, say a star, is strongly attenuated while an “out of phase” one, say a planet, is much less affected. This property opens interesting perspectives for broadband spectroscopy, or detection, of exoplanets, all the more so for intensity, since the nulling behaves as  | ρ0 | 4n + 2.

From the dispersion of the optical paths difference ΔOPD produced by the chessboards, it is possible to obtain a very good approximation of the full width at half maximum (FWHM) of the planet attenuation spectrum within the full bandwidths, e.g. $(\frac{2}{3}\lambda_0,2\lambda_0)$. Because the OPDs distribution is binomial, one gets $\sigma(\textrm{OPD})=\sqrt{2n+1}\lambda_0/4$ or approximately $\Delta \textrm{OPD}\propto \sqrt{n}\lambda_0$. Now, ΔOPD = cΔt and $\Delta E=h\Delta\nu=hc\Delta\lambda/\lambda^2\approx hc\Delta\lambda/\lambda_0^2$. By applying the Heisenberg uncertainty principle, we have ΔtΔE ≥ h / 4π, from which we get Δλλ01n·\begin{equation} \frac{\Delta \lambda}{\lambda_0}\propto \frac{1}{\sqrt{n}}\cdot \end{equation}(22)The constant of proportionality should not be very different from one, see Fig. 7 for an example where it is approximately equal to $1/\!\sqrt{2}$. While the planet-to-star ratio greatly increases with the order n of the chessboards, the effective bandwidth reduction in $\sqrt{n}$ implies longer exposure times to get the full range of wavelengths.

thumbnail Fig. 2

Graphic determination of the relative nulling efficiency ρ0 of a Bracewell nulling interferometer through a stereographic projection of the Fresnel vector z = eπλ0 / λ toward point 1 on the real axis. The relative nulling efficiency of the iterative Bracewell scheme (Pn,Qn) is $\rho_0^{2n+1}$. This quantity decreases to 0, as n increases where  | ρ0 | <1. That is when ρ0(z) is in the (−j,j) segment. Therefore, a planet dephased by π at λ0 relative to its star may be detected in many bandwidths whose first order is $\frac{2}{3}\lambda_0<\lambda<2\lambda_0$.

Finally, the proposed scheme generates, from one order to the next, a fourfold increase in the number of cells. In Appendix A, we investigate to what extent so many cells are necessary.

2.1.4. Placing the cells on the chessboards

To actually build the interferometer, it is necessary to distribute the chessboards cells on the pupil plane, an operation that leads to a specific PSF on the image plane. The previous performance analysis is valid for a perfectly parallel light beam, therefore, for any distribution of cells, it is only relevant for the central pixel of the image. For the other pixels, it is necessary to ensure that the cells distribution generates a PSF that does not ruin the mighty nulling power of the chessboards as derived above. This can be done by extending the same idea to the spatial dimension of the amplitude A, as given by Eq. (1).

As mentioned in Paper I, we gain insight into this problem by performing a Taylor expansion of the expUV matrix to get expUV=k=01k!(jπdn0λ)k[UV]k.\begin{equation} \textrm{expUV} = \sum_{k=0}^\infty \frac{1}{k!}\Bigl(j\frac{\pi d}{n_0\lambda}\Bigr)^k [\textrm{UV}]^k. \label{eq:a6} \end{equation}(23)In this expression, the symbol [UV] stands for a positioning matrix of size n0 × n0. For example, in the case n0 = 4, we have [UV]=[3u+3vu+3vu+3v3u+3v3u+vu+vu+v3u+v3uvuvuv3uv3u3vu3vu3v3u3v].\begin{equation} [\textrm{UV}] = \left[\begin{array}{cccc} -3u+3v & -u+3v & u+3v & 3u+3v\\ -3u+v & -u+v & u+v & 3u+v\\ -3u-v & -u-v & u-v & 3u-v\\ -3u-3v & -u-3v & u-3v & 3u-3v\\ \end{array}\right]. \end{equation}(24)By convention the elements of  [ UV ] k are the elements of [UV] raised to the power k. Following the above example, we get [UV]k=[(3u+3v)k(u+3v)k(u+3v)k(3u+3v)k(3u+v)k(u+v)k(u+v)k(3u+v)k(3uv)k(uv)k(uv)k(3uv)k(3u3v)k(u3v)k(u3v)k(3u3v)k].\begin{equation} [\textrm{UV}]^k = \left[\begin{array}{cccc} (-3u+3v)^k & (-u+3v)^k & (u+3v)^k & (3u+3v)^k\\ (-3u+v)^k & (-u+v)^k & (u+v)^k & (3u+v)^k\\ (-3u-v)^k & (-u-v)^k & (u-v)^k & (3u-v)^k\\ (-3u-3v)^k & (-u-3v)^k & (u-3v)^k & (3u-3v)^k\\ \end{array}\right]. \label{eq:UVk} \end{equation}(25)Again, the key observation is to recognise that the elements of [UV]k are the sampled values of the 2-D polynomial (x + y)k. Therefore, if we were able to arrange the cells in such a way that the expPQ matrix would turn out to be the coefficients of a finite difference 2-D differential operator of order k + 1, the expression expPQ· [ UV ] k would be zero. Indeed all terms of degree r less than or equal to k in (23) would also be cancelled, since expPQ· [ UV ] r would be the finite difference derivative of order k + 1 of a 2-D polynomials of degree less than or equal to k.

It seems, however, impossible to attain this objective exactly, but it is possible to approach it fairly closely. The first-order approximation, according to ε = Δλ / λ0, of expPQ is expPQejπP(1εjπP)+ejπQ(1εjπQ).\begin{equation} \textrm{expPQ}\approx {\rm e}^{j\pi P}(1-\varepsilon j\pi P)+{\rm e}^{j\pi Q}(1-\varepsilon j\pi Q). \end{equation}(26)In the case where P is even and Q odd, this formula reduces to expPQ ≈ −εjπ(P − Q). It is not too difficult to find an arrangement of cells on the pupil plane that makes P − Q equal to a matrix whose elements are those of a finite difference differential operator (fiddop). We found that it is achievable by the following iterative scheme P0=[0],Q0=[1],Pn+1=[],Qn+1=[].\begin{eqnarray} P_0=[0],&& Q_0=[-1],\nonumber\\ P_{n+1}= \left[\begin{array}{cc} Q_n\pm 1 & P_n\pm 2\\ P_n & Q_n\pm 1 \end{array}\right], && Q_{n+1}= \left[\begin{array}{cc} P_n\pm 1 & Q_n\pm 2\\ Q_n & P_n\pm 1 \end{array}\right]. \end{eqnarray}(27)In this expression  ±  again means (−1)n. (This scheme is the same as the iterative Bracewell one discussed above. Also $P_n\pm1=-Q_n^{\rm T}$, $Q_n\pm1=-P_n^{\rm T}$, where e.g. $P_n^{\rm T}$ stands for the transposition of Pn.) It follows, by induction, that Pn − Qn is, as desired, a fiddop of order n. To start the induction, we have P1Q1=[0200][11-11]=[-1+1+1-1].\begin{equation} P_1-Q_1 = \left[\begin{array}{cc} 0&2\\0&0 \end{array}\right] - \left[\begin{array}{rr} \;\;1&1\\-1&1 \end{array}\right] = \left[\begin{array}{rr} -1&+1\\+1&-1 \end{array}\right]. \end{equation}(28)It is indeed a gradient operator that, as such, will cancel the first-order (constant) term in the Taylor expansion of expUV. More generally we have Pn+1Qn+1=[][]=[].\begin{eqnarray} P_{n+1}-Q_{n+1} &=& \left[\begin{array}{cc} Q_n\pm 1 & P_n\pm 2\\ P_n & Q_n\pm 1 \end{array}\right] - \left[\begin{array}{cc} P_n\pm 1 & Q_n\pm 2\\ Q_n & P_n\pm 1 \end{array}\right]\\ &=& \left[\begin{array}{cc} Q_n-P_n&P_n-Q_n\\P_n-Q_n&Q_n-P_n \end{array}\right]. \end{eqnarray}Therefore, Pn + 1 − Qn + 1 is the gradient of Pn − Qn and, since the later is a fiddop of order n, then Pn + 1 − Qn + 1 is a fiddop of order n + 1. It is remarkable that all lines and columns of Pn − Qn are a Prouhet-Thue-Morse sequence of −1 and +1 of length 2n, as is clear from the way the Pn − Qn were constructed.

We recall that the Prouhet-Thue-Morse sequence Pn is a string of length 2n whose kth element is given by Pn(k) = (−1)s(k) for 0 ≤ k < 2n, where s(k) is the sum of the binary digits of k; e.g. P3 =  +1, −1, −1, +1, −1, +1, +1, −1. This sequence is also constructed iteratively by P0 = +1 and Pn = Pn−1 |  | −Pn−1, where  |  |  is the concatenation operator (Prouhet 1851; Allouche & Mendès-France 2008).

2.2. The multi-axial chessboards

For this design, we set D ≠ 0 in (5) (see Fig. 3 for a practical realisation).

thumbnail Fig. 3

A possible optical scheme of the nulling interferometer in multi-axial configuration.

It is convenient to write the expPQ matrix as expPQ=cos(πuD)(ejπPΛ+ejπQΛ)+jsin(πuD)(ejπPΛ+ejπQΛ).\begin{eqnarray} \textrm{expPQ}&=& \cos(\pi u D)({\rm e}^{j\pi P\Lambda}+{\rm e}^{j\pi Q\Lambda})\nonumber\\ &&+ j\sin(\pi u D)(-{\rm e}^{j\pi P\Lambda}+{\rm e}^{j\pi Q\Lambda}). \end{eqnarray}(31)From this we get Sum(expPQ·expUV)=cos(πuD)Sum((ejπPΛ+ejπQΛ)·expUV)jsin(πuD)Sum((ejπPΛejπQΛ)·expUV).\begin{eqnarray} \Sum(\textrm{expPQ}\cdot\textrm{expUV})=&&\nonumber\\ \cos(\pi uD)&\!\Sum\bigl(({\rm e}^{j\pi P\Lambda}+{\rm e}^{j\pi Q\Lambda})\cdot\textrm{expUV}\bigr)&\nonumber\\ -j\sin(\pi uD)&\!\Sum\bigl(({\rm e}^{j\pi P\Lambda}-{\rm e}^{j\pi Q\Lambda})\cdot\textrm{expUV}\bigr)\!\!&\!. \end{eqnarray}(32)In the first term of the right-hand side member, we recognise the co-axial chromatic term defined above. We know how to cancel it, up to a given polynomial order. One way to cancel the second term is to generate an electrical field amplitude A, which is antisymmetric with respect to u (direction of the interferometer base), and to perform a sum of the electric field prior to its quadratic detection. Such a sum can be achieved in an elegant and simple way thanks to a single-mode fiber optics. The efficiency of this cancelling device has actually been demonstrated (e.g., Wallner et al. 2004; Buisset et al. 2006).

To obtain A(−u,v) = A(u,v), and because the sin(πuD) function is odd, it is sufficient to make sure that Sum((ejπPΛ − ejπQΛ)·expUV(u,v)) is even. It is clear from Eq. (3) that expUV(−u,v) is equal to expUV(u,v) after a symmetry around the pupil’s y-axis. Therefore, if P and Q were left invariant following the same symmetry, our goal would be attained. Moreover, since the Michelson chromatic term must also be made as low as possible, one must use the co-axial chessboards, one way or the other. One solution is to put the co-axial chessboards, shrunken by a factor of two in the upper right corner of the multi-axial chessboards, for example, and to fill the empty cells by means of successive x and y symmetries. This setting already gives good results, but one can gain more nulling power if one requires that A(−u, −v) = A(u,v).

thumbnail Fig. 4

The $(P^*_3,Q^*_3)$ chessboards. The P2, Q2 chessboards from which they were built are in the top right corners. The 2 or (2k + 1)π phase shifts at λ0 are indicated in the cells’ centres by their 2k (blue) or 2k + 1 (red) values. The value zero is not shown.

To achieve this objective, one starts with the co-axial chessboard shrunk by a factor of two, and one again put it in the upper right-hand corner of the multi-axial chessboard under construction. We fill the lower right-hand corner by a x-axis symmetry with a change of sign and finally we fill the remaining left-hand side half plane by a y-axis symmetry, again with a change in sign. We symbolically note ‘unfold’ this procedure. If Pn−1 and Qn−1 stands for two uni-axial chessboards, we construct the corresponding multi-axial chessboards $P_n^*$ and $Q_n^*$ by way of the operations Pn=unfold(Pn1),Qn=unfold(Qn1).\begin{equation} P^*_n=\textrm{unfold}(P_{n-1})\,,\quad Q^*_n=\textrm{unfold}(Q_{n-1}). \end{equation}(33)($P^*_n$ and $Q^*_n$ have the same number of cells as Pn and Qn). We have, for example, P1=[0000],Q1=[+1-1-1+1],\begin{equation} P_1^* =\left[\begin{array}{cc}0&0\\0&0\end{array}\right],\quad Q_1^* =\left[\begin{array}{rr}+1&-1\\-1&+1\end{array}\right], \end{equation}(34)P2=[-200+200000000+200-2],Q2=[-1-1+1+1-1+1-1+1+1-1+1-1+1+1-1-1].\begin{equation} P_2^* =\left[\begin{array}{cccc}-2&0&0&+2\\0&0&0&0\\0&0&0&0\\+2&0&0&-2\end{array}\right],\quad Q_2^* =\left[\begin{array}{rrrr}-1&-1&+1&+1\\-1&+1&-1&+1\\+1&-1&+1&-1\\+1&+1&-1&-1\end{array}\right]. \end{equation}(35)The “unfold” procedure offers many advantages.

  • 1.

    Since the combination (Pn−1,Qn−1) cancels the chromatic term up to order N = 2(n − 1), the $(P^*_n,Q^*_n)$ is expected to cancel it, at least up to the same order N = 2n − 2. In fact, the change in sign during the unfolding process implements the following property: if $\{P_n\}\stackrel{2n}{=}\{Q_n\}$, then $\{P_n,-P_n\}\stackrel{2n+1}{=}\{Q_n,-Q_n\}$. Therefore the pair $(P^*_n,Q^*_n)$ cancels the chromatic term up to order N = 2n − 1.

  • 2.

    We expect $P^*_n-Q^*_n$ to be, as Pn−1 − Qn−1, a fiddop of, at least, order n − 1. This is true if n is even, but thanks to the symmetries of the Prouhet-Thue-Morse sequence, this order is increased to n if n is odd.

  • 3.

    Finally, the symmetry of the multi-axial chessboards is such that it cancels the  [ UV ] k matrices when k is odd (see Eq. (25)).

Because of these supplementary properties, the chessboards $(P^*_n,Q^*_n)$ designed for the multi-axial case are expected to be just slightly less successful than their co-axial (Pn,Qn) counterparts.

Table 3

The nulling power of the chessboards considered in this paper.

Table 3 is a summary of the nulling properties of the chessboards considered in this paper.

3. Performance of the design

A first evaluation of the performances of the chessboards have been established in the preceding section; however, it is possible to obtain far more detailed information by working out the analytical expression of the electrical field amplitude. This is made possible because relatively simple operations allow passing, almost mechanically, from one chessboard of order n to the next one. These are, in the following order,

  • start with a Bracewell setting;

  • shrink the chessboards by a factor of two;

  • shift it to its proper place;

  • add a phase; and

  • perform symmetries with a change in sign.

All these operations, when performed on the pupil plane, have simple Fourier equivalents on the image plane. Therefore, a computation of the amplitude function can be considered. A straightforward, though lengthy, analytical work leads to the formulæAn=[Pnej2θuDd+Qnej2θuDd]sinc(12n1θu)sinc(12n1θv),\begin{equation} A_n = \left[P_n{\rm e}^{-j2\theta_u \frac{D}{d}} + Q_n{\rm e}^{j2\theta_u \frac{D}{d}}\right] \sinc\left(\frac{1}{2^{n-1}}\theta_u\right) \sinc\left(\frac{1}{2^{n-1}}\theta_v\right), \label{eq:champ} \end{equation}(36)Pn=12(Sn+Dn),Qn=12(SnDn),\begin{equation} P_n=\thalf\left(S_n+D_n\right),\quad Q_n=\thalf\left(S_n-D_n\right), \end{equation}(37)where Pn and Qn are, in this context, two functions on the image plane. They are not the chessboard matrices. The expressions for Sn and Dn differ according to the interferometer setting. In the co-axial case, we set D = 0 and obtain Sn=Sn,Dn=Dn,\begin{equation} S_n= S'_n,\quad D_n = D'_n, \end{equation}(38)while in the multi-axial one, D ≠ 0 and we find Sn=Sn′′(θu,θv,φ)+Sn′′(θu,θv,φ)+Sn′′(θu,θv,φ)+Sn′′(θu,θv,φ),\begin{eqnarray} S_n &=& S''_n(\theta_u,\theta_v,-\phi^*)+S''_n(-\theta_u,-\theta_v,-\phi^*)\nonumber\\ &&+S''_n(-\theta_u,\theta_v,\phi^*)+S''_n(\theta_u,-\theta_v,\phi^*), \end{eqnarray}(39)Dn=Dn′′(θu,θv,φ)+Dn′′(θu,θv,φ)+Dn′′(θu,θv,φ)+Dn′′(θu,θv,φ),\begin{eqnarray} D_n&=& D''_n(\theta_u,\theta_v,-\phi^*)+D''_n(-\theta_u,-\theta_v,-\phi^*)\nonumber\\ &&+ D''_n(-\theta_u,\theta_v,\phi^*)+D''_n(\theta_u,-\theta_v,\phi^*), \end{eqnarray}(40)Sn′′(θu,θv,φ)=14ej(θu+θv)Sn1(12θu,12θv,φ),Dn′′(θu,θv,φ)=14ej(θu+θv)Dn1(12θu,12θv,φ).\begin{eqnarray} S''_n(\theta_u,\theta_v,\phi^*) &=& \frac{1}{4}{\rm e}^{j(\theta_u+\theta_v)}S'_{n-1}\left(\frac{1}{2}\theta_u,\frac{1}{2}\theta_v,\phi^*\right),\\ D''_n(\theta_u,\theta_v,\phi^*) &=& \frac{1}{4}{\rm e}^{j(\theta_u+\theta_v)}D'_{n-1}\left(\frac{1}{2}\theta_u,\frac{1}{2}\theta_v,\phi^*\right). \end{eqnarray}The $S'_n$ and $D'_n$ functions reflect the self similarity of the iterative Bracewell scheme and may be called the “nulling functions”: Sn(θu,θv,φ)=(1+ej2φ)􏽑=0n1cos(φ+1(2)θu)×cos(φ+1(2)θv),Dn(θu,θv,φ)=(1ej2φ)􏽑=0n1sin(φ+1(2)θu)×sin(φ+1(2)θv).\begin{eqnarray} S'_n(\theta_u,\theta_v,\phi^*)&=&\left(1+{\rm e}^{j2\phi^*}\right)\tprod_{\ell=0}^{n-1}\cos\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_u\right)\nonumber\\ &&\times\cos\textstyle\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_v\right),\\ D'_n(\theta_u,\theta_v,\phi^*)&=&\left(1-{\rm e}^{j2\phi^*}\right)\tprod_{\ell=0}^{n-1}\sin\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_u\right)\nonumber\\ &&\textstyle\times\sin\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_v\right). \end{eqnarray}For a given λ, the θu,θv are the spatial variables and φ ∗  is the chromatic variable (because it depends on λ0). We then obtain θu=π2dλu,θv=π2dλv,\begin{equation} \theta_u=\frac{\pi}{2}\frac{d}{\lambda}u\,,\quad \theta_v=\frac{\pi}{2}\frac{d}{\lambda}v, \end{equation}(45)φ={φifniseven,+φifnisodd.φ=π2λ0λ·\begin{equation} \phi^* = \left\{\begin{array}{ll} -\phi & \textrm{if } n \textrm{ is even,}\\ +\phi & \textrm{if } n \textrm{ is odd.} \end{array}\right. \quad \phi=\frac{\pi}{2}\frac{\lambda_0}{\lambda}\cdot \end{equation}(46)One recalls that: D is the separation between the two telescopes, d the mirror size, λ the wavelength considered, λ0 the wavelength for which the chessboards are tuned and u,v are the Cartesian angular coordinates of the light diffracted by the pupils. Figure 5 shows the theoretical image of a planetary system similar to ours seen with the $(P^*_7,Q^*_7)$ device. The image was calculated using these formulæ.

thumbnail Fig. 5

Image of a star system seen through a Michelson interferometer d / D = 1 equipped with the multi-axial $(P_n^*,Q_n^*)$ design of order n = 7. The system consists of a star and a 106 fainter planet. The light from the star is shown in yellow and the one from the planet in blue. The image intensity I has been enhanced by a I0.25 filter to see the very low levels (here: the star). The effect of the device is to push away the light from the star, while that of the planet fits in the space left empty. To illustrate the achromatic effect, the observation is simulated for a wavelength detuned by 10% compared to the central wavelength λ0.

To account for a possible path difference between the two arms of the interferometer, we introduce a systematic piston on one arm. If ϵ is the piston value, instead of (36), we have An=[Pnej2θuDd+Qnej(2θuDd+2φϵ)]sinc(12n1θu)\begin{eqnarray} A_n &=& \left[P_n{\rm e}^{-j2\theta_u \frac{D}{d}} + Q_n{\rm e}^{j(2\theta_u \frac{D}{d} + 2\phi_\epsilon)}\right] \sinc\left(\frac{1}{2^{n-1}}\theta_u\right)\nonumber\\ &&\times\, \sinc\left(\frac{1}{2^{n-1}}\theta_v\right), \label{eq:piston} \end{eqnarray}(47)withφϵ=πϵλ·\begin{equation} \textrm{with}\;\phi_\epsilon =\pi\frac{\epsilon}{\lambda}\cdot \end{equation}(48)

3.1. Extinction of the star, attenuation of the planet, and planet-to-starlight ratio

Let us define $R_n^*$ as the residual stellar light intensity. We have $R_n^*=|A_n|^2$. We consider this function successively for the co-axial and multi-axial cases below.

3.1.1. Co-axial case

Residual light from the star.

One finds, in the co-axial case Rn(u,v,λ;λ0)=(cosφϵcosφ􏽑=0n1cos(φ+1(2)θu)cos(φ+1(2)θv)sinφϵsinφ􏽑=0n1sin(φ+1(2)θu)sin(φ+1(2)θv))2×sinc2(12n1θu)sinc2(12n1θv).\begin{eqnarray} R_n^*(u,v,\lambda;\lambda_0,\epsilon)=\nonumber\\ \Bigl (\cos\phi_\epsilon\cos\phi^*&\!\!\tprod_{\ell=0}^{n-1}\cos\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_u\right)\cos\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_v\right)\nonumber\\ -\sin\phi_\epsilon\sin\phi^*&\!\!\tprod_{\ell=0}^{n-1}\sin\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_u\right)\sin\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_v\right)\Bigr)^2\nonumber\\ &\times\,\sinc^2\bigl(\frac{1}{2^{n-1}}\theta_u\bigr)\sinc^2\bigl(\frac{1}{2^{n-1}}\theta_v\bigr). \label{eq:starco} \end{eqnarray}(49)The starlight extinction spectrum shown in Fig. 6 is defined as the integration $\int_{\rm Airy} R_n^*(u,v,\lambda;\lambda_0,0)\,{\rm d}u{\rm d}v$. Of course, as expected, the nulling effect may reach considerable values over a large bandwidth. For example, an extinction factor of 105 can be easily achieved over the bandwidth 6.2–17.5 μm by an interferometer of order n = 7 (orange curve) tuned at λ0 = 9.05μm.

thumbnail Fig. 6

Extinction spectrum of a star by iterative Bracewell nulling uni-axial (Michelson) interferometers of order n = 0,1,...,12. The original Bracewell n = 0 is in black, order 1 in yellow, 2 in red, etc. The x-axis is the wavelength, either Δλ / λ0 (bottom) or λ in μm for λ0 = 9.05   μm (top). The $R_n^*$ integration has been performed over an Airy square of size 2λ0 / d where d is the size of the mirrors. The maximum bandwidth is indicated between the two dashed vertical lines, and it meets the darwin project nominal bandwidth indicated by the two broken vertical lines. The y-axis is the log  of the integrated extinction factor, 0 means no extinction, and the horizontal line at −5 sets the nominal darwin extinction factor of 105.

thumbnail Fig. 7

Attenuation spectrum of a planet following the same setting, except for the linear y-axis, as in Fig. 6. Near the wavelength λ0, where the device is tuned, the planet only suffers very small attenuation. The full width at half maximum (FWHM) is given, with a very good approximation, by $\textit{FWHM}=1/\sqrt{2n}$, where n is the interferometer order.

To understand more deeply how the nulling works, one considers $R_n^*$ at the centre of the image plane (i.e. θu = 0 and θv = 0), leading to Rn(λ;λ0)=(cosφϵcos2n+1φsinφϵsin2n+1φ)2.\begin{equation} R_n^*(\lambda;\lambda_0,\epsilon)=\left(\cos\phi_\epsilon\cos^{2n+1}\phi^*-\sin\phi_\epsilon\sin^{2n+1}\phi^*\right)^2. \end{equation}(50)Finally, we suppose a perfect design: ϵ = 0. One ends up with the chessboard’s extinction capability characterised by Rn(λ;λ0)=cos4n+2(π2λ0λ)·\begin{equation} R_n^*(\lambda;\lambda_0)=\cos^{4n+2}\left(\frac{\pi}{2}\frac{\lambda_0}{\lambda}\right)\cdot \end{equation}(51)As n goes to infinity, the starlight residual goes to zero at all wavelengths, except those where the cosine is equal to  ± 1; that is, λ = ∞,λ0 / 2,λ0 / 4,...,λ0 / 2k,...

Light from the planet.

Since it is off-axis, the planet wavefront is tilted relative to the pupil plane. As in Paper I, we assume that the planet was observed when this tilt induced a phase shift of π at λ0 between the two arms of the interferometer. If, on the pupil plane, we neglect the effect of the tilt, the residual planet intensity is given by $R_n^\oplus=r(\lambda)R_n^*(\phi'_\epsilon)$, where $\phi'_\epsilon=\phi_\epsilon+\pi$. In this expression r(λ) stands for the planet/star intensity ratio at λ, typically r ≈ 10-6 in the thermal infrared. At the image centre, we have Rn(λ;λ0)=r(λ)(sinφϵcos2n+1φcosφϵsin2n+1φ)2,\begin{equation} R_n^\oplus(\lambda;\lambda_0,\epsilon)=r(\lambda)\left(\sin\phi_\epsilon\cos^{2n+1}\phi^*-\cos\phi_\epsilon\sin^{2n+1}\phi^*\right)^2, \end{equation}(52)and, in the absence of piston, Rn(λ;λ0)=r(λ)sin4n+2(π2λ0λ)·\begin{equation} R_n^\oplus(\lambda;\lambda_0)=r(\lambda)\sin^{4n+2}\Bigl(\frac{\pi}{2}\frac{\lambda_0}{\lambda}\Bigr)\cdot \end{equation}(53)As n goes to infinity, the planet light also goes to zero at all wavelengths, except those where the sine is equal to  ± 1, that is: λ = λ0,λ0 / 3,...,λ0 / (2k + 1),... Figure 7 shows the attenuation spectrum of the planet.

Planet-to-starlight ratio.

To successfully detect and to do the spectroscopy of an exoplanet, one must reduce all sources of noise. For the lower levels n of our device, the starlight residuals play a predominant role over other sources of noise (such as the exozodiacal light for example). One must therefore consider the ratio $R_n^\oplus/R_n^*$, which gives the planet residual intensity relatives to the star residual intensity. This quantity may be called the planet-to-starlight ratio and denoted rn. Typically, if rn ≥ 1, the source of noise from the star ceases to be dominant and the planet is likely to be detected. We obtain rn(λ;λ0)=r(λ)(tanφϵ±tan2n+1φ1±tanφϵtan2n+1φ)2,\begin{equation} r_n(\lambda;\lambda_0,\epsilon) = r(\lambda)\left(\frac{\tan\phi_\epsilon\pm\tan^{2n+1}\phi}{1\pm\tan\phi_\epsilon\tan^{2n+1}\phi}\right)^2\,, \end{equation}(54)where again  ±  means + if n is even and − if it is odd. If n → ∞, and to the extent that  | tanφ | >1, then rn → rcot2φϵ. This expression reduces to rn ≈ r(λ)(λ / πϵ)2 for low piston value. For example, if r = 10-6, λ = 10μm, and ϵ = 3nm, we get rn ≈ 1. That means that a planet 106 times fainter than its star is likely to be detected at 10 μm even with a piston of 3nm. If the piston is increased by a factor of 3, then the planet-to-starlight ratio diminishes by approximately a factor of 10. Figure 8 shows the planet-to-starlight ratio for a zero piston.

This discussion is, for the central pixel and a zero piston, in accordance with the previous analysis in 2.1.3. In particular, we have r / rn =  | ρn | 2.

thumbnail Fig. 8

Planet-to-starlight ratio spectrum of a planet 106 times fainter than the star around which it revolves, seen through a uni-axial (Michelson) interferometer. When the value on the y-axis is greater than 0, it is seen more light from the planet than from the star, but at the same time the effective bandwidth of the planet is reduced. A value slightly above 0 seems to be an acceptable objective.

The planet-to-starlight ratio bandwidth.

The condition  | tanφ | >1 defines the planet-to-starlight ratio bandwidth or, for short, the contrast bandwidth. If ϵ = 0, it defines a first order where rn goes to infinity like tan4n + 2φ as n (the interferometer order) increases. The contrast bandwidth of the first order is π / 4 ≤ φ ≤ 3π / 4; which is, $\frac{2}{3}\lambda_0\le\lambda\le 2\lambda_0$. For example, if λ0 = 9μm, one gets the bandwidth 6μm ≤ λ ≤ 18μm, a range consistent with the 6–17 μm one demanded by the darwin spatial project.

Higher orders do indeed exists. The kth order is given by $\frac{2}{4k-1}\lambda_0\le\lambda\le\frac{2}{4k-3}\lambda_0$, k = 1,2,...; elsewhere rn goes to zero. If ϵ ≠ 0, the bandwidth is shrunk. One can show that the first order is now roughly given by φ1 ≤ φ ≤ π − φ1, with $\phi_1=\arctan(\tan^{-1/(2n-1)}\frac{\pi}{2}\epsilon/\lambda_0)$. (This formula is valid for ϵ ≠ 0 and a sufficiently high n, say n > 5.).

3.1.2. Multi-axial case

The discussion is about the same as the previous one; however, there is no simple formula like Eq. (49), the more general Eq. (47) must be used instead. If one considers the image centre, one obtains Rn(λ;ϵ,λ0)=cos2φϵcos4nφ+sin2φϵsin4nφ,Rn(λ;ϵ,λ0)=r(λ)(sin2φϵcos4nφ+cos2φϵsin4nφ).\begin{eqnarray} R_n^*(\lambda;\epsilon,\lambda_0)&=&\cos^2\phi_\epsilon\cos^{4n}\phi+\sin^2\phi_\epsilon\sin^{4n}\phi,\\ R_n^\oplus(\lambda;\epsilon,\lambda_0)&=&r(\lambda)\left(\sin^2\phi_\epsilon\cos^{4n}\phi+\cos^2\phi_\epsilon\sin^{4n}\phi\right). \end{eqnarray}From this one gets the planet-to-starlight ratio rn(λ;ϵ,λ0)=r(λ)tan2φϵ+tan4nφ1+tan2φϵtan4nφ·\begin{equation} r_n(\lambda;\epsilon,\lambda_0)=r(\lambda)\frac{\tan^2\phi_\epsilon+\tan^{4n}\phi}{1+\tan^2\phi_\epsilon\tan^{4n}\phi}\cdot \end{equation}(57)Again, if the piston ϵ is zero, the planet-to-starlight ratio goes to infinity like tan4nφ, as n, the interferometer order, increases. This defines the many spectral orders: $\frac{2}{4k-1}\lambda_0\le\lambda\le\frac{2}{4k-3}\lambda_0$, and the first one $\frac{2}{3}\lambda_0\le\lambda\le 2\lambda_0$ is the most extented.

If the piston is not zero, the planet-to-starlight ratio is approximately multiplied by a factor cot2φϵ ≈ (λ / πϵ)2 on approximately the same reduced bandwidth φ1 ≤ φ ≤ π − φ1 as in the co-axial case. Figure 9 shows this ratio for a piston of zero.

thumbnail Fig. 9

Planet-to-starlight ratio spectrum. Same as Fig. 8 but for a multi-axial (Fizeau) setting. The integration has been performed by means of a single mode cylindrical fibre optics of radius λ0 / D, where D is separation of the two mirrors. The abrupt cut-off at approximately 16.5μm is the long wavelength cut of the fibre optics.

3.2. Introduction of a random piston

Given the rather tight specifications on the value of the acceptable piston, the new concept seems more suited to a spatial project. However, with the advent of modern fringe-tracking techniques, it is possible to consider a ground-based instrument located on a site of excellent quality, such as in Antarctica. In that case, the rapid atmospheric fluctuations must be taken into account, and the residual differential piston ϵ between the P and Q pupils after correction by a fringe tracker, must be considered as a random variable. In this situation, the residual light function $R^*_n$ must be replaced by its expectation $\e{R^*_n}$. To avoid repetition, we focus on the multi-axial case, because the co-axial one is very similar.

To compute $\e{R^*_n}$, we write Rn=cos2φϵcos4nφ+sin2φϵsin4nφ,=12(cos4nφ+sin4nφ)+12cos2φϵ(cos4nφsin4nφ),\begin{eqnarray} R^*_n &=&\cos^2\phi_\epsilon\cos^{4n}\phi+\sin^2\phi_\epsilon\sin^{4n}\phi,\\ &=&\thalf\left(\cos^{4n}\phi+\sin^{4n}\phi\right)\nonumber\\ &&+\, \thalf\cos2\phi_\epsilon\left(\cos^{4n}\phi-\sin^{4n}\phi\right), \end{eqnarray}E{Rn}=12(cos4nφ+sin4nφ)+12E{cos2φϵ}×(cos4nφsin4nφ).\begin{eqnarray} \e{R^*_n} &=&\thalf\left(\cos^{4n}\phi+\sin^{4n}\phi\right) + \thalf\e{\cos2\phi_\epsilon}\nonumber\\ &&\times\,\left(\cos^{4n}\phi-\sin^{4n}\phi\right). \end{eqnarray}(60)To evaluate E {  cos 2φϵ } , we take advantage of the fact that, if X is a random variable, then E {  cos X }  is the real part of its characteristic function Z(ω) = E { ejωX }  evaluated at ω = 1. Furthermore, if X is a Gaussian random variable of mean m and standard deviation s, its characteristic function is $Z(\omega)={\rm e}^{jm\omega-\half s^2\omega^2}$. By setting X = 2πϵ / λ, we obtain m = 2πμ / λ and s = 2πσ / λ, where μ is the mean and σ the standard deviation of ϵ. It follows that E{cos2φϵ}=exp(2π2σ2/λ2)cos(2πμ/λ).\begin{equation} \e{\cos2\phi_\epsilon} = \exp\left({-2\pi^2\sigma^2/\lambda^2}\right)\cos(2\pi\mu/\lambda)\,. \end{equation}(61)It is convenient to express this result in terms of the equivalent piston $\tilde{\epsilon}$ defined as $\cos2\phi_{\tilde{\epsilon}}=\e{\cos2\phi_\epsilon}$. An important example is the unbiased case where μ = 0. If σ / λ is small, one obtains E { cos2φϵ }  = exp(−2π2σ2 / λ2) ≈ 1−2(πσ / λ)2 and since $\cos2\phi_{\tilde{\epsilon}}\approx1-2(\pi\tilde{\epsilon}/\lambda)^2$, then the equivalent piston is equal to σ.

The main result of this section is that, in the case of Gaussian random fluctuations of the piston around zero, the expected residual light function is equal to its deterministic value for the standard deviation of the piston; that is, E{Rn(u,v,λ;λ0)}=Rn(u,v,λ;λ0).\begin{equation} \e{R^*_n(u,v,\lambda;\lambda_0,\epsilon)} = R^*_n(u,v,\lambda;\lambda_0,\sigma). \end{equation}(62)Figures 10 and 11 show the planet-to-starlight ratio (or its expectation) for a piston of 3 nm.

thumbnail Fig. 10

Planet-to-starlight ratio spectrum. Same as Fig. 8, but suffering a piston of 3 nm or a random unbiaised Gaussian piston of σ = 3nm, while the device is tuned at λ0 = 9.05   μm.

thumbnail Fig. 11

Planet-to-starlight ratio spectrum. Same as Fig. 10 but for a multi-axial Fizeau setting.

4. Conclusion

This paper presented a more complete analysis of a very efficient nulling interferometer, which appears as an iterative version of the well known Bracewell nulling interferometer. With the help of extended analytical computations of the amplitude expression, we show that the device acts as an optical differential operator on the 3D light distribution, i.e., spatially and chromatically.

We considered two kinds of systems: the (Pn,Qn) chessboards designed for the Michelson setting and the $(P^*_n,Q^*_n)$ chessboards designed for the Fizeau one. The (Pn,Qn) cannot be used in the Fizeau case because their PSF are not axisymmetric. The $(P^*_n,Q^*_n)$, however, may be considered for a Michelson interferometer because their performances are just slightly inferior to their Fizeau equivalents and present the advantage of an axisymmetric PSF.

The net effect of the device discussed in this paper is to “push away” the stellar light from the centre of the image, leaving room for an “out of phase” object to appear in the area now left free. This takes place for a very broad range of wavelengths if one considers the planet-to-starlight ratio. A design of order n = 7 tuned at λ0 = 9.05μm can meet the darwin space project specifications, i.e., to cancel the starlight by a factor of 105 over a wavelength range of 6–17 μm. Since the planet light is also diminished, the order n should not be pushed far beyond the point where the planet-to-starlight ratio is equal to one. For example, with n = 7, the starlight residuals are reduced below the planet light on most of the darwin bandwidth, while one still has more than 50% of the planet light over 20% of the bandwidth. Therefore, the full nominal darwin bandwidth can be covered for the planet by adjusting the device successively to five different wavelengths λ0.

For an effective implementation of this device on the sky, a detailed study of the signal-to-noise ratio should be made prior to decide to what order the interferometer should be constructed. This choice must result from a compromise between the depth of the nulling and the effective planet-light bandwidth.

Imperfections in design have been considered in the form of a deterministic or a random piston between the two pupils of the interferometer. A systematic piston or equivalently a standard deviation of 3 nm seems to lead to acceptable performances in the case of the darwin project but also for a ground-based interferometer.


1

More precisely, ρ0 performs a Möbius transformation of C. In that case, it is a reflection (inversion) relative to the circle of center 1 and radius $\sqrt{2}$, followed by a reflection about the imaginary axis (see e.g. Ratcliffe, J. G. 1994,Chap. 4). This transformation leaves invariant the two points j and −j.

Acknowledgments

We are very grateful to Raphaël Galicher for a careful reading of the manuscript and to the referee whose comments have greatly helped clarify and improve this paper. This work received the support of PHASE, the high angular resolution partnership between ONERA, Observatoire de Paris, CNRS, and University Denis Diderot Paris 7.

References

  1. Allouche, J.-P., & Mendès-France, M. 2008, Monatshefte für Mathematik, 155, 301 [CrossRef] [Google Scholar]
  2. Bracewell, R. N. 1978, Nature, 274, 780 [NASA ADS] [CrossRef] [Google Scholar]
  3. Buisset, C., Rejeaunier, X., Rabbia, Y., et al. 2006, in Advances in Stellar Interferometry, ed. J. D. Monnier, M. Schöller, & W. C. Danchi, SPIE Conf. Proc., 6268, 626819 [Google Scholar]
  4. Prouhet, E. 1851, Comptes Rendus des Séances de l’Académie des Sciences, 33, 225 [Google Scholar]
  5. Ratcliffe, J. G. 1994, Foundations of Hyperbolic Manifold, 1st edn. (New York: Springer-Verlag) [Google Scholar]
  6. Rouan, D., & Pelat, D. 2008, A&A, 484, 581 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  7. Wallner, O., Perdigues Armengol, J. M., & Karlsson, A. L. 2004, in New Frontiers in Stellar Interferometry, ed. W. A. Traub. (Bellingham, WA: The International Society for Optical Engineering), SPIE Conf. Proc., 5491, 798 [Google Scholar]

Appendix A: Constraints on the number of cells

Here we try to find a lower limit to the number of cells S constituting the chessboards for a given even nulling order N. We recall that the numeric values contained in the Pn and Qn chessboards are even and odd integers, respectively.

We find that our design is optimal for N = 2 and 4, but the question remains open for N ≥ 6.

Cancelling up to the second nulling order.

If ns and ms are the P and Q matrices elements, we must have s=1Sns2=s=1Sms2=Σ,\appendix \setcounter{section}{1} \begin{equation} \tsum_{s=1}^S n_s^2 = \tsum_{s=1}^S m_s^2 = \Sigma, \end{equation}(A.1)where Σ is the common sum. Since the ns are even, Σ must be divisible by 4. Because ms is odd, we have ms = 2ks + 1 and s=1S(2ks+1)20(mod4),4s=1Sks2+4s=1Sks+S0(mod4),S0(mod4).\appendix \setcounter{section}{1} \begin{eqnarray} \tsum_{s=1}^S (2k_s+1)^2 &\equiv& 0\pmod{4},\\ 4\tsum_{s=1}^S k_s^2 + 4\tsum_{s=1}^S k_s + S &\equiv& 0\pmod{4},\\ S &\equiv& 0\pmod{4}. \end{eqnarray}Therefore the number of cells in each chessboard must be a non-zero multiple of 4. We have shown by the construction of the P1 and Q1 chessboards that S = 4 was sufficent, and this is the minimum number of cells required to cancel the chromatic term up to second order.

The above result is a consequence of the property that (2k + 1)2 ≡ 1(mod4). In fact one can show that (2k + 1)2 ≡ 1(mod8) that, in turn, is a particular case with p = 1 of the more general result (2k+1)2p1(mod2p+2),\appendix \setcounter{section}{1} \begin{equation} \label{eq:modulo} (2k+1)^{2^p}\equiv 1\pmod{2^{p+2}}\,, \end{equation}(A.5)which one can easily establish by induction.

Cancelling up to the fourth nulling order.

From (A.5) with p = 2, we obtain (2k + 1)4 ≡ 1(mod16). Then we deduce that S ≡ 0(mod16). Again, we show that the 16 cells in P2 and Q2 were sufficent to cancel the chromatic term up the fourth order, and it is the minimum.

Cancelling beyond the fourth nulling order.

One can use (A.5) to sets limits on the sixth order cancelled by P3 and Q3, but they are too loose. (Unless one can prove that, if 22n is a divisor of \appendix \setcounter{section}{1} $\sum_{s=1}^S(2k_s+1)^{2n}$ then, for n > 1, it is also a divisor of \appendix \setcounter{section}{1} $\sum_{s=1}^S(2k_s+1)^{2^{2n-2}}$). All that can be said is that the number of cells must be a multiple of 16, while P3 and Q3 possesses 64. Up to now, as soon as the nulling order N ≥ 5, no tighter limits than those deduced from (A.5) are known to the authors regarding the number of cells necessary to cancel the chromatic term up to that order.

All Tables

Table 1

Number of cells in the interferometer (Pn,Qn) according to the phase shift exp(jkπλ0 / λ).

Table 2

Number of cells in (Pn,Qn) producing a given phase shift.

Table 3

The nulling power of the chessboards considered in this paper.

All Figures

thumbnail Fig. 1

A possible optical scheme of the nulling interferometer in co-axial configuration.

In the text
thumbnail Fig. 2

Graphic determination of the relative nulling efficiency ρ0 of a Bracewell nulling interferometer through a stereographic projection of the Fresnel vector z = eπλ0 / λ toward point 1 on the real axis. The relative nulling efficiency of the iterative Bracewell scheme (Pn,Qn) is $\rho_0^{2n+1}$. This quantity decreases to 0, as n increases where  | ρ0 | <1. That is when ρ0(z) is in the (−j,j) segment. Therefore, a planet dephased by π at λ0 relative to its star may be detected in many bandwidths whose first order is $\frac{2}{3}\lambda_0<\lambda<2\lambda_0$.

In the text
thumbnail Fig. 3

A possible optical scheme of the nulling interferometer in multi-axial configuration.

In the text
thumbnail Fig. 4

The $(P^*_3,Q^*_3)$ chessboards. The P2, Q2 chessboards from which they were built are in the top right corners. The 2 or (2k + 1)π phase shifts at λ0 are indicated in the cells’ centres by their 2k (blue) or 2k + 1 (red) values. The value zero is not shown.

In the text
thumbnail Fig. 5

Image of a star system seen through a Michelson interferometer d / D = 1 equipped with the multi-axial $(P_n^*,Q_n^*)$ design of order n = 7. The system consists of a star and a 106 fainter planet. The light from the star is shown in yellow and the one from the planet in blue. The image intensity I has been enhanced by a I0.25 filter to see the very low levels (here: the star). The effect of the device is to push away the light from the star, while that of the planet fits in the space left empty. To illustrate the achromatic effect, the observation is simulated for a wavelength detuned by 10% compared to the central wavelength λ0.

In the text
thumbnail Fig. 6

Extinction spectrum of a star by iterative Bracewell nulling uni-axial (Michelson) interferometers of order n = 0,1,...,12. The original Bracewell n = 0 is in black, order 1 in yellow, 2 in red, etc. The x-axis is the wavelength, either Δλ / λ0 (bottom) or λ in μm for λ0 = 9.05   μm (top). The $R_n^*$ integration has been performed over an Airy square of size 2λ0 / d where d is the size of the mirrors. The maximum bandwidth is indicated between the two dashed vertical lines, and it meets the darwin project nominal bandwidth indicated by the two broken vertical lines. The y-axis is the log  of the integrated extinction factor, 0 means no extinction, and the horizontal line at −5 sets the nominal darwin extinction factor of 105.

In the text
thumbnail Fig. 7

Attenuation spectrum of a planet following the same setting, except for the linear y-axis, as in Fig. 6. Near the wavelength λ0, where the device is tuned, the planet only suffers very small attenuation. The full width at half maximum (FWHM) is given, with a very good approximation, by $\textit{FWHM}=1/\sqrt{2n}$, where n is the interferometer order.

In the text
thumbnail Fig. 8

Planet-to-starlight ratio spectrum of a planet 106 times fainter than the star around which it revolves, seen through a uni-axial (Michelson) interferometer. When the value on the y-axis is greater than 0, it is seen more light from the planet than from the star, but at the same time the effective bandwidth of the planet is reduced. A value slightly above 0 seems to be an acceptable objective.

In the text
thumbnail Fig. 9

Planet-to-starlight ratio spectrum. Same as Fig. 8 but for a multi-axial (Fizeau) setting. The integration has been performed by means of a single mode cylindrical fibre optics of radius λ0 / D, where D is separation of the two mirrors. The abrupt cut-off at approximately 16.5μm is the long wavelength cut of the fibre optics.

In the text
thumbnail Fig. 10

Planet-to-starlight ratio spectrum. Same as Fig. 8, but suffering a piston of 3 nm or a random unbiaised Gaussian piston of σ = 3nm, while the device is tuned at λ0 = 9.05   μm.

In the text
thumbnail Fig. 11

Planet-to-starlight ratio spectrum. Same as Fig. 10 but for a multi-axial Fizeau setting.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.