The achromatic chessboard, a new concept of a phase shifter for nulling interferometry

D. Pelat; D. Rouan; D. Pickel

doi:10.1051/0004-6361/201014934

Home

All issues

Volume 524 (December 2010)

A&A, 524 (2010) A80

Full HTML

Free Access

Issue		A&A Volume 524, December 2010


Article Number		A80
Number of page(s)		10
Section		Astronomical instrumentation
DOI		https://doi.org/10.1051/0004-6361/201014934
Published online		25 November 2010

A&A 524, A80 (2010)

II. Theoretical performance assessment

D. Pelat¹, D. Rouan² and D. Pickel¹^,2

¹ LUTH, Observatoire de Paris, CNRS, Université Paris Diderot, 92190 Meudon, France
e-mail: didier.pelat@obspm.fr
² LESIA, Observatoire de Paris, CNRS, UPMC, Université Paris Diderot, 5 place Jules Janssen, 92190 Meudon, France
e-mail: daniel.rouan@obspm.fr; damien.pickel@obspm.fr

Received: 4 May 2010
Accepted: 11 August 2010

Abstract

Context. Nulling interferometry in the mid-IR using two telescopes (commonly referred to a Bracewell interferometer) is one possible way of directly detecting exoplanets in the habitable zone and their characterisation in terms of possible life signatures. A large wavelength domain is needed to simultaneously detect the infrared spectral features of a set of a bio-tracers. An achromatic phase shift of π is then required, and we previously presented a new concept for such a function that allows a simple design with only one device per beam. It is based on two cellular mirrors, called the chessboards, where each cell has a thickness that introduces, for any given central wavelength, a phase shift of (2k + 1)π or of 2kπ on the fraction of the wave it reflects.

Aims. We explore a more rigorous way to establish the optimum cell pattern design to attain the best theoretical performances for planet detection over a broad wavelength range. Two possible types of interferometres are now considered: on-axis and multi-axis.

Methods. We derived a rather simple iterative scheme for both designs, determining the thickness and XY position of the cells. The method confers to the chessboards a high degree of internal symmetry. Each design can be described as an iterative Bracewell interferometer characterised by an integer order. We demonstrate that their efficiencies increases with the power of that order.

Results. The device acts both spatially and versus wavelengths as an optical differential operator on the 3D light distribution. Its power is best understood in the on-axis case since its effect is topush awaythe stellar light from the centre over a very broad range of wavelengths, leaving space for anout of phase object to appear in the cleaned central region. We explore the theoretical performances for on-axis and multi-axis designs in the parameter space, and we especially compute the rejection factor for starlight and the attenuation factor for planet light and introduce the relative nulling efficiency metric. We show that, even with some realistic piston error added, the performances could meet the Darwin space project specifications for both designs, i.e., cancellation of the starlight by a factor of 10⁵ over a wavelength range of 6–17 μm.

Key words: instrumentation: interferometers / techniques: high angular resolution / techniques: interferometric / space vehicles: instruments / planetary systems

© ESO, 2010

1. Introduction

A new concept of a quasi-achromatic nulling interferometer has been proposed in a preceding publication (Rouan & Pelat 2008,hereafter Paper I). The whole idea is based on cancelling the polynomial part, according to both the wavelength and spatial coordinates, of the starlight distribution at the centre of the focal plane of the interferometer. This objective was attained thanks to a particular setting of phase shifters on the interferometer pupil planes. The advantage of the proposed design depends on a rather simple arrangement, where the key device is a simple square grid of phase shifters (the so-called “chessboard”) on the pupil plane of the telescopes.

In Paper I, several possible chessboards were proposed and a co-axial (Michelson) setting of the telescopes favoured. Here we concentrate more deeply on two high-performance configurations, one designed for the co-axial case and the other for the multi-axial (Fizeau) case. In Sect. 2, we outline the principles that led to the phase distribution and spatial repartition of the chessboards’ cells. A first approximation of the system performances is also given, with particular emphasis on the principles that govern its effectiveness. In Sect. 3, detailed analytical calculations are carried out to determine the chessboards’ nulling performances with more precision and to explore their robustness to a systematic or random piston on one arm of the interferometer.

2. The chessboard design

We recall that our device consists of two square grids of phase shifters each formed of $n_0^2$ adjacent cells. Finding a high-performance chessboard setting implies closely inspecting the complex amplitude of the electrical field of the interferometer output equipped with this device. If A stands for this amplitude, the point spread function is given by PSF = | A | ². According to Fourier optics one gets $A (u,v) = Sum (\exp UV \cdot \exp PQ) sinc (\frac{πd}{n_{0} λ} u) sinc (\frac{πd}{n_{0} λ} v) .$ $\begin{equation} \label{eq1} A(u,v)=\Sum({\exp {\rm UV}}\cdot {\exp{\rm PQ}})\sinc\left(\frac{\pi d}{n_0\lambda}u\right)\sinc\left(\frac{\pi d}{n_0\lambda}v\right). \end{equation}$ (1)In this expression, u and v are the Cartesian angular coordinates of a light ray diffracted by the pupil plane, d is the size of the pupil, n₀ the number of cells on one side of the chessboards, and by definition sinc(x) = sin(x) / x. The symbolic expressions expUV and expPQ stand for two square matrices both with the “chessboard” format: n₀ × n₀. The “dot” operation · between the two matrices is the element-to-element product not the matrices product. The term Sum( ) realises the sum of all elements of the matrix under the scope of that operator.

The matrix expUV takes the positioning of the cells into account, namely the place where the phase shifters are put on the grid of the chessboards. It is given by $expUV = e^{jπudδ / n_{0} λ} \otimes e^{jπvdδ / n_{0} λ} .$ $\begin{equation} \textrm{expUV} = {\rm e}^{j\pi ud\delta/n_0\lambda}\otimes {\rm e}^{j\pi vd\delta/n_0\lambda}. \end{equation}$ (2)In this expression, δ is the displacement vector. It is a set of integers varying from −(n₀−1) to +(n₀−1) by step of 2, and the symbol ⊗ is the tensor product between the two vectors. For example, if n₀ = 4, we have δ = (−3, −1,1,3) and we get $\begin{matrix} expUV & = \\ = & [\begin{matrix} \end{matrix}] . \end{matrix}$ $\begin{eqnarray} \textrm{expUV} &=& \left[\begin{array}{c} {\rm e}^{j\frac{3\pi d}{n_0\lambda} v}\\ {\rm e}^{j\frac{\pi d}{n_0\lambda} v}\\ {\rm e}^{j\frac{-\pi d}{n_0\lambda} v}\\ {\rm e}^{j\frac{-3\pi d}{n_0\lambda} v}\\ \end{array}\right] \left[\begin{array}{cccc} {\rm e}^{j\frac{-3\pi d}{n_0\lambda} u}&{\rm e}^{j\frac{-\pi d}{n_0\lambda} u}&{\rm e}^{j\frac{\pi d}{n_0\lambda} u}&{\rm e}^{j\frac{3\pi d}{n_0\lambda} u} \end{array}\right], \label{eq:expUV}\\ &=& \left[\begin{array}{cccc} {\rm e}^{j\frac{\pi d}{n_0\lambda}(-3u+3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(-u+3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(u+3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(3u+3v)}\\ {\rm e}^{j\frac{\pi d}{n_0\lambda}(-3u+v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(-u+v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(u+v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(3u+v)}\\ {\rm e}^{j\frac{\pi d}{n_0\lambda}(-3u-v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(-u-v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(u-v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(3u-v)}\\ {\rm e}^{j\frac{\pi d}{n_0\lambda}(-3u-3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(-u-3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(u-3v)}&{\rm e}^{j\frac{\pi d}{n_0\lambda}(3u-3v)} \end{array}\right]. \end{eqnarray}$ The second matrix expPQ, takes the values of the phase shifters into account. Adopting the usual convention that the exponential of a matrix is the matrix of the exponentials, we have $expPQ = e^{jπP Λ} e^{- jπuD / λ} + e^{jπQ Λ} e^{+ jπuD / λ} .$ $\begin{equation} \textrm{expPQ} = {\rm e}^{j\pi P\Lambda}{\rm e}^{-j\pi uD/\lambda} + {\rm e}^{j\pi Q\Lambda}{\rm e}^{+j\pi uD/\lambda}. \label{eq:expPQ} \end{equation}$ (5)The matrices P and Q are the chessboard matrices, of size n₀ × n₀. All their elements are integers that give, in units of λ₀ / 2, the optical path difference (OPD) induced by the corresponding cell of the chessboard. For example, if P = [ 0 ] and Q = [ 1 ] , there is no OPD on one telescope but one of λ₀ / 2 on the other one, inducing a phase shift of π at λ₀. This particular setting corresponds to a Bracewell nulling interferomer (Bracewell 1978). Finally, D is the distance between the two pupils’ centres and Λ is a chromatic variable $Λ = \frac{λ_{0}}{λ} = \frac{1}{1 + \frac{Δ λ}{λ_{0}}}, Δ λ = λ - λ_{0} .$ $\begin{equation} \Lambda = \frac{\lambda_0}{\lambda} = \frac{1}{1+\frac{\Delta\lambda}{\lambda_0}}\,,\quad \Delta\lambda=\lambda-\lambda_0. \end{equation}$ (6)(The Λ variable was named Δ in Paper I.)

To obtain the best achromatism, we propose a progressive approach below where we first look for the best phase-shift distribution and second for their spatial distribution. We start our investigation with the mathematically simpler co-axial design.

2.1. The co-axial chessboards

In the co-axial configuration, the two pupils are superimposed thanks to some optical arrangement. A practical example is given in Fig. 1, where it includes an imaging lens in the last stage since any efficient detection requires concentrating the light beam on a small surface.

Fig. 1

A possible optical scheme of the nulling interferometer in co-axial configuration.

To consider this design, we set D = 0 in (5). The expPQ matrix therefore reduces to e^jπPΛ + e^jπQΛ. As a first step we do not consider the phase shifters’ location on the pupils; i.e., we set the displacement vector δ to zero making all elements of expUV equal to one. Accordingly, the Sum( ) term in (1) takes the simpler form of Sum(e^jπPΛ + e^jπQΛ), which we call the chromatic term.

2.1.1. The multigrad solution

The idea, explained in Paper I, is to cancel the chromatic term up to the Nth order of its Taylor series. For this purpose, we need the Taylor series expansion of the elements of e^jπPΛ and e^jπQΛ matrices. For example, if n_s stands for the numerical values contained in P, the Taylor expansion, relative to ε = Δλ / λ₀, is given by $e^{jπ n_{s} Λ} = \sum_{k = 0}^{\infty} \frac{1}{k!} ε^{k} e^{jπ n_{s}} p_{k} (jπ n_{s}),$ $\begin{equation} \textstyle {\rm e}^{j\pi n_s\Lambda}=\sum_{k=0}^\infty \frac{1}{k!}\varepsilon^k {\rm e}^{j\pi n_s} p_k(j\pi n_s), \end{equation}$ (7)where p_k stands for a polynomial of degree k. (Note that p_k is not a monomial because of the chain rule needed to compute the derivatives.) Of course, a similar formula exists for e^jπm_sΛ involving the values m_s contained in Q. Cancelling the chromatic term up to order N implies solving the equation $\begin{matrix} \sum_{s = 1}^{n_{0}^{2}} \sum_{k = 0}^{N} \frac{1}{k!} ε^{k} (e^{jπ n_{s}} p_{k} (jπ n_{s}) + e^{jπ m_{s}} p_{k} (jπ m_{s})) & = & 0, or \\ \sum_{k = 0}^{N} \frac{1}{k!} ε^{k} \sum_{s = 1}^{n_{0}^{2}} (e^{jπ n_{s}} p_{k} (jπ n_{s}) + e^{jπ m_{s}} p_{k} (jπ m_{s})) & = & 0. \end{matrix}$ $\begin{eqnarray} \textstyle \sum_{s=1}^{n_0^2}\sum_{k=0}^N\frac{1}{k!}\varepsilon^k \bigl({\rm e}^{j\pi n_s} p_k(j\pi n_s)+{\rm e}^{j\pi m_s} p_k(j\pi m_s)\bigr)&=&0,\;\textrm{or}\\ \textstyle \sum_{k=0}^N\frac{1}{k!}\varepsilon^k \sum_{s=1}^{n_0^2}\bigl({\rm e}^{j\pi n_s} p_k(j\pi n_s)+{\rm e}^{j\pi m_s} p_k(j\pi m_s)\bigr)&=&0. \end{eqnarray}$ This polynomial is zero for any ε if, and only if, all its coefficients are zero. Therefore we have to solve the system $\sum_{s = 1}^{n_{0}^{2}} (e^{jπ n_{s}} p_{k} (jπ n_{s}) + e^{jπ m_{s}} p_{k} (jπ m_{s})) = 0,$ $\begin{equation} \textstyle \sum_{s=1}^{n_0^2}\bigl({\rm e}^{j\pi n_s} p_k(j\pi n_s)+{\rm e}^{j\pi m_s} p_k(j\pi m_s)\bigr)=0, \end{equation}$ (10)for k = 0,...,N. If c_ℓk stands for the coefficients of the polynomials p_k, this reduces to $\sum_{ℓ = 0}^{k} c_{ℓk} (jπ)^{ℓ} \sum_{s = 1}^{n_{0}^{2}} (e^{jπ n_{s}} n_{s}^{ℓ} + e^{jπ m_{s}} m_{s}^{ℓ}) = 0.$ $\begin{equation} \textstyle \sum_{\ell=0}^k c_{\ell k} (j\pi)^\ell\sum_{s=1}^{n_0^2}({\rm e}^{j\pi n_s} n_s^\ell+{\rm e}^{j\pi m_s} m_s^\ell)=0. \end{equation}$ (11)This system can be solved by induction, because if a solution is found up to order k, the solution up to order k + 1 is found if $\sum_{s = 1}^{n_{0}^{2}} (e^{jπ n_{s}} n_{s}^{k + 1} + e^{jπ m_{s}} m_{s}^{k + 1}) = 0.$ $\begin{equation} \textstyle \sum_{s=1}^{n_0^2}({\rm e}^{j\pi n_s} n_s^{k+1}+{\rm e}^{j\pi m_s} m_s^{k+1})=0. \end{equation}$ (12)Therefore, the chromatic term is nullified up to order N if one solves the following system of equations $\sum_{s = 1}^{n_{0}^{2}} (e^{jπ n_{s}} n_{s}^{k} + e^{jπ m_{s}} m_{s}^{k}) = 0; k = 0, ...,N,$ $\begin{equation} \textstyle \sum_{s=1}^{n_0^2}({\rm e}^{j\pi n_s} n_s^k+{\rm e}^{j\pi m_s} m_s^k)=0\,;\quad k=0,\dots,N, \end{equation}$ (13)for the unknown integers n_s and m_s and the parameter n₀.

As in a Bracewell interferometer, we impose the constraint that e^jπn_s = +1 and e^jπm_s = −1, therefore the n_s and m_s are respectively even and odd integers. As introduced in Paper I, Appendix C, one ends up with a special set of diophantine equation called a multigrad system, namely $\sum_{s = 1}^{n_{0}^{2}} n_{s}^{k} = \sum_{s = 1}^{n_{0}^{2}} m_{s}^{k},k = 1, ...,N .$ $\begin{equation} \textstyle \sum_{s=1}^{n_0^2} n_s^k=\sum_{s=1}^{n_0^2} m_s^k\,,\; k=1,\dots,N. \end{equation}$ (14)If two sets of integers, say { n_s } and { m_s } , solve a multigrad system up to order N, they are called multigrad sets. This property is shortly written as $\{m_s\}\stackrel{N}{=}\{n_s\}$ . Hereafter, the two sets are supposed to be of equal size, for short $\{m_s\}\stackrel{0}{=}\{n_s\}$ (adopting the convention that 0⁰ = 1).

The key theorem is that if one has $\{n_s\}\stackrel{N}{=}\{m_s\}$ , then it follows that $\{n_s,m_s+c\}\stackrel{N+1}{=}\{m_s,n_s+c\}$ . This is true for any c, but is non-trivial only if c ≠ 0 and { m_s } ≠ { n_s } . In this way, two multigrad sets up to order N generate two new multigrad sets up to order N + 1. During this operation, the size of the sets is multiplied by two. But in order that the number of cells on the chessboards remains a square, the size of the sets must be multiplied by four. This is done by applying two times the key theorem. Moreover, if the sets { n_s } and { m_s } possess the desired parity, we keep it if we choose an odd constant c, say c = 1. We get the final result:

if $\{n_s\}\stackrel{N}{=}\{m_s\}$ , then

$\{n_s,m_s+1,m_s+1,n_s+2\}\stackrel{N+2}{=}\{m_s,n_s+1,n_s+1,m_s+2\}$ .

This remarkable property is at the heart of the scheme proposed below.

2.1.2. The iterative Bracewell scheme

This scheme runs as follows. One starts with a Bracewell nulling interferometer $\{0\}\stackrel{0}{=}\{-1\}$ and constructs the next one by two applications of the key theorem. In this way, one first gets $\{0,0,0,2\}\stackrel{2}{=}\{-1,1,1,1\}$ . Indeed, we have (always with 0⁰ = 1) $\begin{matrix} 0^{0} + 0^{0} + 0^{0} + 2^{0} & = & - 1^{0} + 1^{0} + 1^{0} + 1^{0}, \\ 0^{1} + 0^{1} + 0^{1} + 2^{1} & = & - 1^{1} + 1^{1} + 1^{1} + 1^{1}, \\ 0^{2} + 0^{2} + 0^{2} + 2^{2} & = & - 1^{2} + 1^{2} + 1^{2} + 1^{2} . \end{matrix}$ $\begin{eqnarray} 0^0+0^0+0^0+2^0&=&-1^0+1^0+1^0+1^0,\nonumber\\ 0^1+0^1+0^1+2^1&=&-1^1+1^1+1^1+1^1,\nonumber\\ 0^2+0^2+0^2+2^2&=&-1^2+1^2+1^2+1^2. \end{eqnarray}$ (15)As expected, this new interferometer cancels the chromatic term up to order N = 2. To construct the next one, we add c = −1 instead of c = +1 in order to gain symmetry and in an effort to keep the z-heights of the cells as low as possible. Finally, to generate the successive phase chessboards, we propose the following iterative scheme $\begin{matrix} P_{0} & = & {0}, Q_{0} = {- 1}, \\ P_{n + 1} & = & {P_{n}, Q_{n} \pm 1, Q_{n} \pm 1, P_{n} \pm 2}, \\ Q_{n + 1} & = & {Q_{n}, P_{n} \pm 1, P_{n} \pm 1, Q_{n} \pm 2}, \end{matrix}$ $\begin{eqnarray} P_0&=&\set{0},\quad Q_0=\set{-1},\nonumber\\ P_{n+1} &=&\set{P_n,Q_n\pm1,Q_n\pm1,P_n\pm2},\nonumber\\ Q_{n+1} &=&\set{Q_n,P_n\pm1,P_n\pm1,Q_n\pm2}, \end{eqnarray}$ (16)where ± means (−1)ⁿ (i.e. +1 if n is even and −1 if it is odd). With this process, the chessboards P_n and Q_n retain the shape of two squares possessing n₀ = 2ⁿ cells on each of their sides. Their combined effect is to cancel the chromatic term up to order N = 2n. The higher n, the more efficent the corresponding nulling interferometer.

Table 1

Number of cells in the interferometer (P_n,Q_n) according to the phase shift exp(jkπλ₀ / λ).

The iterative scheme presented above leads to constructing the interferometer (P_n,Q_n) possessing, for example, the phase-shift distribution shown on Table 1. We observe that the progression of the number of cells that produce the same phase shift follows that of the binomial coefficients in the polynomial expansion of (1 + x)^2n + 1. This is easily proved by two applications of the recursive formula ${2n \choose q}={2n-1 \choose q-1}+{2n-1 \choose q}$ , which leads to ${2n+1 \choose q}={2n-1\choose q-2}+2{2n-1 \choose q-1}+{2n-1 \choose q}$ , which is our iterative scheme. (In these formulæ we adopt the convention that ${n \choose q}=0$ if q ≤ 0.) Table 2 gives the phase-shift, together with the corresponding number of cells (the formulæ are valid for two consecutives orders).

Table 2

Number of cells in (P_n,Q_n) producing a given phase shift.

2.1.3. First estimation of the iterative Bracewell scheme performance

In this section we study the nulling efficiency of an interferometer built on this principle. To compare the interferometers (P_n,Q_n) in the same conditions, differing only by their order n, the light collecting surface of their mirrors must be kept constant by reducing the size of the cells to a square of surface d² / 2²ⁿ. Because at this point we have not distributed the cells on the pupil plane, we only consider A_n, the sum of the amplitudes. This leads to the formula $\begin{matrix} A_{n} & = & \frac{d^{2}}{2^{2 n}} e^{- jKπ λ_{0} / λ} \sum_{q = 0}^{2 n + 1} (\begin{matrix} 2 n + 1 \\ q \end{matrix}) e^{jqπ λ_{0} / λ}, \\ = & 2 d^{2} e^{- jKπ λ_{0} / λ} {(\frac{1 + z}{2})}^{2 n + 1}, \end{matrix}$ $\begin{eqnarray} A_n &=& \frac{d^2}{2^{2n}}{\rm e}^{-jK\pi\lambda_0/\lambda}\sum_{q=0}^{2n+1}{\textstyle {2n+1 \choose q}}\,{\rm e}^{jq\pi\lambda_0/\lambda},\\ &=&2d^2{\rm e}^{-jK\pi\lambda_0/\lambda}\left(\frac{1+z}{2}\right)^{2n+1}, \end{eqnarray}$ where we have set z = e^{jπλ₀ / λ} and $K=2\bigl\lfloor\frac{n}{2}\bigr\rfloor+1$ . The common phase factor e^{−jKπλ₀ / λ} sets the OPD reference, and the function ⌊ x ⌋ is the “floor” function (i.e. it returns the greatest integer less than or equal to x.)

If λ = λ₀, then z = −1 and A_n = 0, therefore the nulling is perfect for any n. The device is strictly equivalent to a Bracewell nulling interferometer. Now, if λ ≠ λ₀, since the module of the complex (1 + z) / 2 is less than one, A_n → 0 as n, the order of the interferometers, increases. This is true for all λ except for the values such that z = 1, i.e. λ = ∞,λ₀ / 2,λ₀ / 4,...,λ₀ / 2k,... Aside from the actual distribution of cells on the pupils, we have obtained a quasi-achromatic nulling interferometer.

However, one must judge the perfomances of the device relative to some benchmark. We have demonstrated above that, for an on-axis object, our device has powerful nulling efficiency on a given set of wavelengths. This is of course different for any off-axis object such as a planet which is best detected when its position induces an OPD = Dθ_planet = λ₀ / 2. In that situation, the recombination, at λ₀, is fully constructive for the planet. Therefore, the interferometer with a phase shift of π at λ₀ added to one of its pupils will serve as a reference. In so doing, we transform the nulling interferometer into a non-destructive one for the wavelengths set that was just mentioned above. The phase addition on all the, say, odd phase shifters changes the sign of the odd, q = 2p + 1, binomial coefficients in the expression of A_n. If $A'_n$ stands for the function obtained in this way, one gets $A_{n}^{'} = 2 d^{2} e^{- jKπ λ_{0} / λ} {(\frac{1 - z}{2})}^{2 n + 1} \cdot$ $\begin{equation} A'_n=2d^2{\rm e}^{-jK\pi\lambda_0/\lambda}\left(\frac{1-z}{2}\right)^{2n+1}\cdot \end{equation}$ (19)The only change between the expressions of A_n and $A'_n$ is the sign of z. If λ = λ₀ / 2, then z = 1 and $A'_n=0$ , we also have a nulling interferometer. Again, if λ ≠ λ₀ / 2, since the modulus of the complex (1 − z) / 2 is also less than one, $A'_n\rightarrow 0$ as n increases. This is true for all λ except for the values that make z = −1; that is, λ = λ₀,λ₀ / 3,λ₀ / 5,...,λ₀ / (2k + 1),... The result is an associated “out of phase” nulling interferometer but, fortunately, less efficient than the original one. The off-axis planet will indeed suffer some attenuation but much less than the on-axis star, as the ratio of A_n to $A'_n$ demonstrates.

We define ρ_n as $A_n/A'_n$ , and name it the relative nulling efficiency. One gets $ρ_{n} (z) = {(\frac{1 + z}{1 - z})}^{2 n + 1} = ρ_{0}^{2 n + 1} (z) .$ $\begin{equation} \rho_n(z) = \left(\frac{1+z}{1-z}\right)^{2n+1}=\rho_0^{2n+1}(z). \end{equation}$ (20)If | ρ₀ | is less than one and n large enough, the on-axis nulling can be made more efficient than the off-axis one by any factor. Note that ρ₀(z) is the relative nulling efficiency of the original Bracewell interferometer.

A deeper understanding of how the iterative Bracewell interferometers works is obtained through careful analysis of the transformation induced by ρ₀ on the complex plane C of the Fresnel representation. In fact, under ρ₀, the unit circle is transformed into the imaginary axis¹. One gets $ρ_{0} (z) = j \cot (\frac{1}{2} \arg (z)), \arg (z) = π \frac{λ_{0}}{λ} \cdot$ $\begin{equation} \rho_0(z) = j\cot\bigl(\frac{1}{2}\arg(z)\bigr)\,,\quad\arg(z)=\pi\frac{\lambda_0}{\lambda}\cdot \end{equation}$ (21)This means that ρ₀(z), and the Bracewell relative nulling efficency that follows is obtained by a stereographic projection of z toward the real 1 (see Fig. 2). The image of the half-circles $\frac{\pi}{2}<\theta\pmod{2\pi}<\frac{3\pi}{2}$ is the segment (−j,j) shown in green in the figure. All points in this segment have a modulus less than one, therefore | ρ_n(z) | can be made as small as desired, provided one chooses an interferometer of sufficiently high-order n. This defines many wavelengths orders, the first one being: $\frac{2}{3}\lambda_0<\lambda<2\lambda_0$ . Within these bandwiths, an “in phase” object, say a star, is strongly attenuated while an “out of phase” one, say a planet, is much less affected. This property opens interesting perspectives for broadband spectroscopy, or detection, of exoplanets, all the more so for intensity, since the nulling behaves as | ρ₀ | ^4n + 2.

From the dispersion of the optical paths difference ΔOPD produced by the chessboards, it is possible to obtain a very good approximation of the full width at half maximum (FWHM) of the planet attenuation spectrum within the full bandwidths, e.g. $(\frac{2}{3}\lambda_0,2\lambda_0)$ . Because the OPDs distribution is binomial, one gets $\sigma(\textrm{OPD})=\sqrt{2n+1}\lambda_0/4$ or approximately $\Delta \textrm{OPD}\propto \sqrt{n}\lambda_0$ . Now, ΔOPD = cΔt and $\Delta E=h\Delta\nu=hc\Delta\lambda/\lambda^2\approx hc\Delta\lambda/\lambda_0^2$ . By applying the Heisenberg uncertainty principle, we have ΔtΔE ≥ h / 4π, from which we get $\frac{Δ λ}{λ_{0}} \propto \frac{1}{\sqrt{n}} \cdot$ $\begin{equation} \frac{\Delta \lambda}{\lambda_0}\propto \frac{1}{\sqrt{n}}\cdot \end{equation}$ (22)The constant of proportionality should not be very different from one, see Fig. 7 for an example where it is approximately equal to $1/\!\sqrt{2}$ . While the planet-to-star ratio greatly increases with the order n of the chessboards, the effective bandwidth reduction in $\sqrt{n}$ implies longer exposure times to get the full range of wavelengths.

Fig. 2

Graphic determination of the relative nulling efficiency ρ₀ of a Bracewell nulling interferometer through a stereographic projection of the Fresnel vector z = e^{πλ₀ / λ} toward point 1 on the real axis. The relative nulling efficiency of the iterative Bracewell scheme (P_n,Q_n) is $\rho_0^{2n+1}$ . This quantity decreases to 0, as n increases where | ρ₀ | <1. That is when ρ₀(z) is in the (−j,j) segment. Therefore, a planet dephased by π at λ₀ relative to its star may be detected in many bandwidths whose first order is $\frac{2}{3}\lambda_0<\lambda<2\lambda_0$ .

Finally, the proposed scheme generates, from one order to the next, a fourfold increase in the number of cells. In Appendix A, we investigate to what extent so many cells are necessary.

2.1.4. Placing the cells on the chessboards

To actually build the interferometer, it is necessary to distribute the chessboards cells on the pupil plane, an operation that leads to a specific PSF on the image plane. The previous performance analysis is valid for a perfectly parallel light beam, therefore, for any distribution of cells, it is only relevant for the central pixel of the image. For the other pixels, it is necessary to ensure that the cells distribution generates a PSF that does not ruin the mighty nulling power of the chessboards as derived above. This can be done by extending the same idea to the spatial dimension of the amplitude A, as given by Eq. (1).

As mentioned in Paper I, we gain insight into this problem by performing a Taylor expansion of the expUV matrix to get $expUV = \sum_{k = 0}^{\infty} \frac{1}{k!} (j \frac{πd}{n_{0} λ})^{k} [UV]^{k} .$ $\begin{equation} \textrm{expUV} = \sum_{k=0}^\infty \frac{1}{k!}\Bigl(j\frac{\pi d}{n_0\lambda}\Bigr)^k [\textrm{UV}]^k. \label{eq:a6} \end{equation}$ (23)In this expression, the symbol [UV] stands for a positioning matrix of size n₀ × n₀. For example, in the case n₀ = 4, we have $[UV] = [\begin{matrix} - 3 u + 3 v & - u + 3 v & u + 3 v & 3 u + 3 v \\ - 3 u + v & - u + v & u + v & 3 u + v \\ - 3 u - v & - u - v & u - v & 3 u - v \\ - 3 u - 3 v & - u - 3 v & u - 3 v & 3 u - 3 v \end{matrix}] .$ $\begin{equation} [\textrm{UV}] = \left[\begin{array}{cccc} -3u+3v & -u+3v & u+3v & 3u+3v\\ -3u+v & -u+v & u+v & 3u+v\\ -3u-v & -u-v & u-v & 3u-v\\ -3u-3v & -u-3v & u-3v & 3u-3v\\ \end{array}\right]. \end{equation}$ (24)By convention the elements of [ UV ] ^k are the elements of [UV] raised to the power k. Following the above example, we get $[UV]^{k} = [\begin{matrix} (- 3 u + 3 v)^{k} & (- u + 3 v)^{k} & (u + 3 v)^{k} & (3 u + 3 v)^{k} \\ (- 3 u + v)^{k} & (- u + v)^{k} & (u + v)^{k} & (3 u + v)^{k} \\ (- 3 u - v)^{k} & (- u - v)^{k} & (u - v)^{k} & (3 u - v)^{k} \\ (- 3 u - 3 v)^{k} & (- u - 3 v)^{k} & (u - 3 v)^{k} & (3 u - 3 v)^{k} \end{matrix}] .$ $\begin{equation} [\textrm{UV}]^k = \left[\begin{array}{cccc} (-3u+3v)^k & (-u+3v)^k & (u+3v)^k & (3u+3v)^k\\ (-3u+v)^k & (-u+v)^k & (u+v)^k & (3u+v)^k\\ (-3u-v)^k & (-u-v)^k & (u-v)^k & (3u-v)^k\\ (-3u-3v)^k & (-u-3v)^k & (u-3v)^k & (3u-3v)^k\\ \end{array}\right]. \label{eq:UVk} \end{equation}$ (25)Again, the key observation is to recognise that the elements of [UV]^k are the sampled values of the 2-D polynomial (x + y)^k. Therefore, if we were able to arrange the cells in such a way that the expPQ matrix would turn out to be the coefficients of a finite difference 2-D differential operator of order k + 1, the expression expPQ· [ UV ] ^k would be zero. Indeed all terms of degree r less than or equal to k in (23) would also be cancelled, since expPQ· [ UV ] ^r would be the finite difference derivative of order k + 1 of a 2-D polynomials of degree less than or equal to k.

It seems, however, impossible to attain this objective exactly, but it is possible to approach it fairly closely. The first-order approximation, according to ε = Δλ / λ₀, of expPQ is $expPQ \approx e^{jπP} (1 - εjπP) + e^{jπQ} (1 - εjπQ) .$ $\begin{equation} \textrm{expPQ}\approx {\rm e}^{j\pi P}(1-\varepsilon j\pi P)+{\rm e}^{j\pi Q}(1-\varepsilon j\pi Q). \end{equation}$ (26)In the case where P is even and Q odd, this formula reduces to expPQ ≈ −εjπ(P − Q). It is not too difficult to find an arrangement of cells on the pupil plane that makes P − Q equal to a matrix whose elements are those of a finite difference differential operator (fiddop). We found that it is achievable by the following iterative scheme $\begin{matrix} P_{0} = [0], & Q_{0} = [- 1], \\ P_{n + 1} = [\begin{matrix} \end{matrix}], & Q_{n + 1} = [\begin{matrix} \end{matrix}] . \end{matrix}$ $\begin{eqnarray} P_0=[0],&& Q_0=[-1],\nonumber\\ P_{n+1}= \left[\begin{array}{cc} Q_n\pm 1 & P_n\pm 2\\ P_n & Q_n\pm 1 \end{array}\right], && Q_{n+1}= \left[\begin{array}{cc} P_n\pm 1 & Q_n\pm 2\\ Q_n & P_n\pm 1 \end{array}\right]. \end{eqnarray}$ (27)In this expression ± again means (−1)ⁿ. (This scheme is the same as the iterative Bracewell one discussed above. Also $P_n\pm1=-Q_n^{\rm T}$ , $Q_n\pm1=-P_n^{\rm T}$ , where e.g. $P_n^{\rm T}$ stands for the transposition of P_n.) It follows, by induction, that P_n − Q_n is, as desired, a fiddop of order n. To start the induction, we have $P_{1} - Q_{1} = [\begin{matrix} 0 & 2 \\ 0 & 0 \end{matrix}] - [\begin{matrix} 1 & 1 \\ -1 & 1 \end{matrix}] = [\begin{matrix} -1 & + 1 \\ + 1 & -1 \end{matrix}] .$ $\begin{equation} P_1-Q_1 = \left[\begin{array}{cc} 0&2\\0&0 \end{array}\right] - \left[\begin{array}{rr} \;\;1&1\\-1&1 \end{array}\right] = \left[\begin{array}{rr} -1&+1\\+1&-1 \end{array}\right]. \end{equation}$ (28)It is indeed a gradient operator that, as such, will cancel the first-order (constant) term in the Taylor expansion of expUV. More generally we have $\begin{matrix} P_{n + 1} - Q_{n + 1} & = & [\begin{matrix} \end{matrix}] - [\begin{matrix} \end{matrix}] \\ = & [\begin{matrix} \end{matrix}] . \end{matrix}$ $\begin{eqnarray} P_{n+1}-Q_{n+1} &=& \left[\begin{array}{cc} Q_n\pm 1 & P_n\pm 2\\ P_n & Q_n\pm 1 \end{array}\right] - \left[\begin{array}{cc} P_n\pm 1 & Q_n\pm 2\\ Q_n & P_n\pm 1 \end{array}\right]\\ &=& \left[\begin{array}{cc} Q_n-P_n&P_n-Q_n\\P_n-Q_n&Q_n-P_n \end{array}\right]. \end{eqnarray}$ Therefore, P_n + 1 − Q_n + 1 is the gradient of P_n − Q_n and, since the later is a fiddop of order n, then P_n + 1 − Q_n + 1 is a fiddop of order n + 1. It is remarkable that all lines and columns of P_n − Q_n are a Prouhet-Thue-Morse sequence of −1 and +1 of length 2ⁿ, as is clear from the way the P_n − Q_n were constructed.

We recall that the Prouhet-Thue-Morse sequence P_n is a string of length 2ⁿ whose kth element is given by P_n(k) = (−1)^s(k) for 0 ≤ k < 2ⁿ, where s(k) is the sum of the binary digits of k; e.g. P₃ = +1, −1, −1, +1, −1, +1, +1, −1. This sequence is also constructed iteratively by P₀ = +1 and P_n = P_n−1 | | −P_n−1, where | | is the concatenation operator (Prouhet 1851; Allouche & Mendès-France 2008).

2.2. The multi-axial chessboards

For this design, we set D ≠ 0 in (5) (see Fig. 3 for a practical realisation).

Fig. 3

A possible optical scheme of the nulling interferometer in multi-axial configuration.

It is convenient to write the expPQ matrix as

\begin{matrix} expPQ & = & \cos (πuD) (e^{jπP Λ} + e^{jπQ Λ}) \\ + j \sin (πuD) (- e^{jπP Λ} + e^{jπQ Λ}) . \end{matrix}

$\begin{eqnarray} \textrm{expPQ}&=& \cos(\pi u D)({\rm e}^{j\pi P\Lambda}+{\rm e}^{j\pi Q\Lambda})\nonumber\\ &&+ j\sin(\pi u D)(-{\rm e}^{j\pi P\Lambda}+{\rm e}^{j\pi Q\Lambda}). \end{eqnarray}$ (31)From this we get

\begin{matrix} Sum (expPQ \cdot expUV) = \\ \cos (πuD) & Sum ((e^{jπP Λ} + e^{jπQ Λ}) \cdot expUV) \\ - j \sin (πuD) & Sum ((e^{jπP Λ} - e^{jπQ Λ}) \cdot expUV) & . \end{matrix}

$\begin{eqnarray} \Sum(\textrm{expPQ}\cdot\textrm{expUV})=&&\nonumber\\ \cos(\pi uD)&\!\Sum\bigl(({\rm e}^{j\pi P\Lambda}+{\rm e}^{j\pi Q\Lambda})\cdot\textrm{expUV}\bigr)&\nonumber\\ -j\sin(\pi uD)&\!\Sum\bigl(({\rm e}^{j\pi P\Lambda}-{\rm e}^{j\pi Q\Lambda})\cdot\textrm{expUV}\bigr)\!\!&\!. \end{eqnarray}$ (32)In the first term of the right-hand side member, we recognise the co-axial chromatic term defined above. We know how to cancel it, up to a given polynomial order. One way to cancel the second term is to generate an electrical field amplitude A, which is antisymmetric with respect to u (direction of the interferometer base), and to perform a sum of the electric field prior to its quadratic detection. Such a sum can be achieved in an elegant and simple way thanks to a single-mode fiber optics. The efficiency of this cancelling device has actually been demonstrated (e.g., Wallner et al. 2004; Buisset et al. 2006).

To obtain A(−u,v) = A(u,v), and because the sin(πuD) function is odd, it is sufficient to make sure that Sum((e^jπPΛ − e^jπQΛ)·expUV(u,v)) is even. It is clear from Eq. (3) that expUV(−u,v) is equal to expUV(u,v) after a symmetry around the pupil’s y-axis. Therefore, if P and Q were left invariant following the same symmetry, our goal would be attained. Moreover, since the Michelson chromatic term must also be made as low as possible, one must use the co-axial chessboards, one way or the other. One solution is to put the co-axial chessboards, shrunken by a factor of two in the upper right corner of the multi-axial chessboards, for example, and to fill the empty cells by means of successive x and y symmetries. This setting already gives good results, but one can gain more nulling power if one requires that A(−u, −v) = A(u,v).

Fig. 4

The $(P^*_3,Q^*_3)$ chessboards. The P₂, Q₂ chessboards from which they were built are in the top right corners. The 2kπ or (2k + 1)π phase shifts at λ₀ are indicated in the cells’ centres by their 2k (blue) or 2k + 1 (red) values. The value zero is not shown.

To achieve this objective, one starts with the co-axial chessboard shrunk by a factor of two, and one again put it in the upper right-hand corner of the multi-axial chessboard under construction. We fill the lower right-hand corner by a x-axis symmetry with a change of sign and finally we fill the remaining left-hand side half plane by a y-axis symmetry, again with a change in sign. We symbolically note ‘unfold’ this procedure. If P_n−1 and Q_n−1 stands for two uni-axial chessboards, we construct the corresponding multi-axial chessboards $P_n^*$ and $Q_n^*$ by way of the operations $P_{n}^{*} = unfold (P_{n - 1}), Q_{n}^{*} = unfold (Q_{n - 1}) .$ $\begin{equation} P^*_n=\textrm{unfold}(P_{n-1})\,,\quad Q^*_n=\textrm{unfold}(Q_{n-1}). \end{equation}$ (33)( $P^*_n$ and $Q^*_n$ have the same number of cells as P_n and Q_n). We have, for example, $P_{1}^{*} = [\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}], Q_{1}^{*} = [\begin{matrix} + 1 & -1 \\ -1 & + 1 \end{matrix}],$ $\begin{equation} P_1^* =\left[\begin{array}{cc}0&0\\0&0\end{array}\right],\quad Q_1^* =\left[\begin{array}{rr}+1&-1\\-1&+1\end{array}\right], \end{equation}$ (34) $P_{2}^{*} = [\begin{matrix} -2 & 0 & 0 & + 2 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ + 2 & 0 & 0 & -2 \end{matrix}], Q_{2}^{*} = [\begin{matrix} -1 & -1 & + 1 & + 1 \\ -1 & + 1 & -1 & + 1 \\ + 1 & -1 & + 1 & -1 \\ + 1 & + 1 & -1 & -1 \end{matrix}] .$ $\begin{equation} P_2^* =\left[\begin{array}{cccc}-2&0&0&+2\\0&0&0&0\\0&0&0&0\\+2&0&0&-2\end{array}\right],\quad Q_2^* =\left[\begin{array}{rrrr}-1&-1&+1&+1\\-1&+1&-1&+1\\+1&-1&+1&-1\\+1&+1&-1&-1\end{array}\right]. \end{equation}$ (35)The “unfold” procedure offers many advantages.

1.
Since the combination (P_n−1,Q_n−1) cancels the chromatic term up to order N = 2(n − 1), the is expected to cancel it, at least up to the same order N = 2n − 2. In fact, the change in sign during the unfolding process implements the following property: if $\{P_n\}\stackrel{2n}{=}\{Q_n\}$ , then $\{P_n,-P_n\}\stackrel{2n+1}{=}\{Q_n,-Q_n\}$ . Therefore the pair cancels the chromatic term up to order N = 2n − 1.
2.
We expect to be, as P_n−1 − Q_n−1, a fiddop of, at least, order n − 1. This is true if n is even, but thanks to the symmetries of the Prouhet-Thue-Morse sequence, this order is increased to n if n is odd.
3.
Finally, the symmetry of the multi-axial chessboards is such that it cancels the [ UV ] ^k matrices when k is odd (see Eq. (25)).

Because of these supplementary properties, the chessboards $(P^*_n,Q^*_n)$ designed for the multi-axial case are expected to be just slightly less successful than their co-axial (P_n,Q_n) counterparts.

Table 3

The nulling power of the chessboards considered in this paper.

Table 3 is a summary of the nulling properties of the chessboards considered in this paper.

3. Performance of the design

A first evaluation of the performances of the chessboards have been established in the preceding section; however, it is possible to obtain far more detailed information by working out the analytical expression of the electrical field amplitude. This is made possible because relatively simple operations allow passing, almost mechanically, from one chessboard of order n to the next one. These are, in the following order,

start with a Bracewell setting;
shrink the chessboards by a factor of two;
shift it to its proper place;
add a phase; and
perform symmetries with a change in sign.

All these operations, when performed on the pupil plane, have simple Fourier equivalents on the image plane. Therefore, a computation of the amplitude function can be considered. A straightforward, though lengthy, analytical work leads to the formulæ $A_{n} = [P_{n} e^{- j 2 θ_{u} \frac{D}{d}} + Q_{n} e^{j 2 θ_{u} \frac{D}{d}}] sinc (12^{n - 1} θ_{u}) sinc (12^{n - 1} θ_{v}),$ $\begin{equation} A_n = \left[P_n{\rm e}^{-j2\theta_u \frac{D}{d}} + Q_n{\rm e}^{j2\theta_u \frac{D}{d}}\right] \sinc\left(\frac{1}{2^{n-1}}\theta_u\right) \sinc\left(\frac{1}{2^{n-1}}\theta_v\right), \label{eq:champ} \end{equation}$ (36) $P_{n} = \frac{1}{2} (S_{n} + D_{n}), Q_{n} = \frac{1}{2} (S_{n} - D_{n}),$ $\begin{equation} P_n=\thalf\left(S_n+D_n\right),\quad Q_n=\thalf\left(S_n-D_n\right), \end{equation}$ (37)where P_n and Q_n are, in this context, two functions on the image plane. They are not the chessboard matrices. The expressions for S_n and D_n differ according to the interferometer setting. In the co-axial case, we set D = 0 and obtain $S_{n} = S_{n}^{'}, D_{n} = D_{n}^{'},$ $\begin{equation} S_n= S'_n,\quad D_n = D'_n, \end{equation}$ (38)while in the multi-axial one, D ≠ 0 and we find $\begin{matrix} S_{n} & = & S_{n}^{′′} (θ_{u}, θ_{v}, - φ^{*}) + S_{n}^{′′} (- θ_{u}, - θ_{v}, - φ^{*}) \\ + S_{n}^{′′} (- θ_{u}, θ_{v}, φ^{*}) + S_{n}^{′′} (θ_{u}, - θ_{v}, φ^{*}), \end{matrix}$ $\begin{eqnarray} S_n &=& S''_n(\theta_u,\theta_v,-\phi^*)+S''_n(-\theta_u,-\theta_v,-\phi^*)\nonumber\\ &&+S''_n(-\theta_u,\theta_v,\phi^*)+S''_n(\theta_u,-\theta_v,\phi^*), \end{eqnarray}$ (39) $\begin{matrix} D_{n} & = & D_{n}^{′′} (θ_{u}, θ_{v}, - φ^{*}) + D_{n}^{′′} (- θ_{u}, - θ_{v}, - φ^{*}) \\ + D_{n}^{′′} (- θ_{u}, θ_{v}, φ^{*}) + D_{n}^{′′} (θ_{u}, - θ_{v}, φ^{*}), \end{matrix}$ $\begin{eqnarray} D_n&=& D''_n(\theta_u,\theta_v,-\phi^*)+D''_n(-\theta_u,-\theta_v,-\phi^*)\nonumber\\ &&+ D''_n(-\theta_u,\theta_v,\phi^*)+D''_n(\theta_u,-\theta_v,\phi^*), \end{eqnarray}$ (40) $\begin{matrix} S_{n}^{′′} (θ_{u}, θ_{v}, φ^{*}) & = & \frac{1}{4} e^{j (θ_{u} + θ_{v})} S_{n - 1}^{'} (\frac{1}{2} θ_{u}, \frac{1}{2} θ_{v}, φ^{*}), \\ D_{n}^{′′} (θ_{u}, θ_{v}, φ^{*}) & = & \frac{1}{4} e^{j (θ_{u} + θ_{v})} D_{n - 1}^{'} (\frac{1}{2} θ_{u}, \frac{1}{2} θ_{v}, φ^{*}) . \end{matrix}$ $\begin{eqnarray} S''_n(\theta_u,\theta_v,\phi^*) &=& \frac{1}{4}{\rm e}^{j(\theta_u+\theta_v)}S'_{n-1}\left(\frac{1}{2}\theta_u,\frac{1}{2}\theta_v,\phi^*\right),\\ D''_n(\theta_u,\theta_v,\phi^*) &=& \frac{1}{4}{\rm e}^{j(\theta_u+\theta_v)}D'_{n-1}\left(\frac{1}{2}\theta_u,\frac{1}{2}\theta_v,\phi^*\right). \end{eqnarray}$ The $S'_n$ and $D'_n$ functions reflect the self similarity of the iterative Bracewell scheme and may be called the “nulling functions”: $\begin{matrix} S_{n}^{'} (θ_{u}, θ_{v}, φ^{*}) & = & (1 + e^{j 2 φ^{*}}) 􏽑_{ℓ = 0}^{n - 1} \cos (φ^{*} + \frac{1}{(- 2)^{ℓ}} θ_{u}) \\ \times \cos (φ^{*} + \frac{1}{(- 2)^{ℓ}} θ_{v}), \\ D_{n}^{'} (θ_{u}, θ_{v}, φ^{*}) & = & (1 - e^{j 2 φ^{*}}) 􏽑_{ℓ = 0}^{n - 1} \sin (φ^{*} + \frac{1}{(- 2)^{ℓ}} θ_{u}) \\ \times \sin (φ^{*} + \frac{1}{(- 2)^{ℓ}} θ_{v}) . \end{matrix}$ $\begin{eqnarray} S'_n(\theta_u,\theta_v,\phi^*)&=&\left(1+{\rm e}^{j2\phi^*}\right)\tprod_{\ell=0}^{n-1}\cos\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_u\right)\nonumber\\ &&\times\cos\textstyle\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_v\right),\\ D'_n(\theta_u,\theta_v,\phi^*)&=&\left(1-{\rm e}^{j2\phi^*}\right)\tprod_{\ell=0}^{n-1}\sin\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_u\right)\nonumber\\ &&\textstyle\times\sin\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_v\right). \end{eqnarray}$ For a given λ, the θ_u,θ_v are the spatial variables and φ^∗ is the chromatic variable (because it depends on λ₀). We then obtain $θ_{u} = \frac{π}{2} \frac{d}{λ} u, θ_{v} = \frac{π}{2} \frac{d}{λ} v,$ $\begin{equation} \theta_u=\frac{\pi}{2}\frac{d}{\lambda}u\,,\quad \theta_v=\frac{\pi}{2}\frac{d}{\lambda}v, \end{equation}$ (45) $φ^{*} = {\begin{matrix} - φ & if n iseven, \\ + φ & if n isodd . \end{matrix} φ = \frac{π}{2} \frac{λ_{0}}{λ} \cdot$ $\begin{equation} \phi^* = \left\{\begin{array}{ll} -\phi & \textrm{if } n \textrm{ is even,}\\ +\phi & \textrm{if } n \textrm{ is odd.} \end{array}\right. \quad \phi=\frac{\pi}{2}\frac{\lambda_0}{\lambda}\cdot \end{equation}$ (46)One recalls that: D is the separation between the two telescopes, d the mirror size, λ the wavelength considered, λ₀ the wavelength for which the chessboards are tuned and u,v are the Cartesian angular coordinates of the light diffracted by the pupils. Figure 5 shows the theoretical image of a planetary system similar to ours seen with the $(P^*_7,Q^*_7)$ device. The image was calculated using these formulæ.

Fig. 5

Image of a star system seen through a Michelson interferometer d / D = 1 equipped with the multi-axial $(P_n^*,Q_n^*)$ design of order n = 7. The system consists of a star and a 10⁶ fainter planet. The light from the star is shown in yellow and the one from the planet in blue. The image intensity I has been enhanced by a I^0.25 filter to see the very low levels (here: the star). The effect of the device is to push away the light from the star, while that of the planet fits in the space left empty. To illustrate the achromatic effect, the observation is simulated for a wavelength detuned by 10% compared to the central wavelength λ₀.

To account for a possible path difference between the two arms of the interferometer, we introduce a systematic piston on one arm. If ϵ is the piston value, instead of (36), we have $\begin{matrix} A_{n} & = & [P_{n} e^{- j 2 θ_{u} \frac{D}{d}} + Q_{n} e^{j (2 θ_{u} \frac{D}{d} + 2 φ_{ϵ})}] sinc (\frac{1}{2^{n - 1}} θ_{u}) \end{matrix}$ $\begin{eqnarray} A_n &=& \left[P_n{\rm e}^{-j2\theta_u \frac{D}{d}} + Q_n{\rm e}^{j(2\theta_u \frac{D}{d} + 2\phi_\epsilon)}\right] \sinc\left(\frac{1}{2^{n-1}}\theta_u\right)\nonumber\\ &&\times\, \sinc\left(\frac{1}{2^{n-1}}\theta_v\right), \label{eq:piston} \end{eqnarray}$ (47) $with φ_{ϵ} = π \frac{ϵ}{λ} \cdot$ $\begin{equation} \textrm{with}\;\phi_\epsilon =\pi\frac{\epsilon}{\lambda}\cdot \end{equation}$ (48)

3.1. Extinction of the star, attenuation of the planet, and planet-to-starlight ratio

Let us define $R_n^*$ as the residual stellar light intensity. We have $R_n^*=|A_n|^2$ . We consider this function successively for the co-axial and multi-axial cases below.

3.1.1. Co-axial case

Residual light from the star.

One finds, in the co-axial case $\begin{matrix} R_{n}^{*} (u,v,λ; λ_{0},ϵ) = \\ (\cos φ_{ϵ} \cos φ^{*} & 􏽑_{ℓ = 0}^{n - 1} \cos (φ^{*} + \frac{1}{(- 2)^{ℓ}} θ_{u}) \cos (φ^{*} + \frac{1}{(- 2)^{ℓ}} θ_{v}) \\ - \sin φ_{ϵ} \sin φ^{*} & 􏽑_{ℓ = 0}^{n - 1} \sin (φ^{*} + \frac{1}{(- 2)^{ℓ}} θ_{u}) \sin (φ^{*} + \frac{1}{(- 2)^{ℓ}} θ_{v}))^{2} \\ \times {sinc}^{2} (\frac{1}{2^{n - 1}} θ_{u}) {sinc}^{2} (\frac{1}{2^{n - 1}} θ_{v}) . \end{matrix}$ $\begin{eqnarray} R_n^*(u,v,\lambda;\lambda_0,\epsilon)=\nonumber\\ \Bigl (\cos\phi_\epsilon\cos\phi^*&\!\!\tprod_{\ell=0}^{n-1}\cos\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_u\right)\cos\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_v\right)\nonumber\\ -\sin\phi_\epsilon\sin\phi^*&\!\!\tprod_{\ell=0}^{n-1}\sin\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_u\right)\sin\left(\phi^*\!+\!\frac{1}{(-2)^\ell}\theta_v\right)\Bigr)^2\nonumber\\ &\times\,\sinc^2\bigl(\frac{1}{2^{n-1}}\theta_u\bigr)\sinc^2\bigl(\frac{1}{2^{n-1}}\theta_v\bigr). \label{eq:starco} \end{eqnarray}$ (49)The starlight extinction spectrum shown in Fig. 6 is defined as the integration $\int_{\rm Airy} R_n^*(u,v,\lambda;\lambda_0,0)\,{\rm d}u{\rm d}v$ . Of course, as expected, the nulling effect may reach considerable values over a large bandwidth. For example, an extinction factor of 10⁵ can be easily achieved over the bandwidth 6.2–17.5 μm by an interferometer of order n = 7 (orange curve) tuned at λ₀ = 9.05μm.

Fig. 6

Extinction spectrum of a star by iterative Bracewell nulling uni-axial (Michelson) interferometers of order n = 0,1,...,12. The original Bracewell n = 0 is in black, order 1 in yellow, 2 in red, etc. The x-axis is the wavelength, either Δλ / λ₀ (bottom) or λ in μm for λ₀ = 9.05 μm (top). The $R_n^*$ integration has been performed over an Airy square of size 2λ₀ / d where d is the size of the mirrors. The maximum bandwidth is indicated between the two dashed vertical lines, and it meets the darwin project nominal bandwidth indicated by the two broken vertical lines. The y-axis is the log of the integrated extinction factor, 0 means no extinction, and the horizontal line at −5 sets the nominal darwin extinction factor of 10⁵.

Fig. 7

Attenuation spectrum of a planet following the same setting, except for the linear y-axis, as in Fig. 6. Near the wavelength λ₀, where the device is tuned, the planet only suffers very small attenuation. The full width at half maximum (FWHM) is given, with a very good approximation, by $\textit{FWHM}=1/\sqrt{2n}$ , where n is the interferometer order.

To understand more deeply how the nulling works, one considers $R_n^*$ at the centre of the image plane (i.e. θ_u = 0 and θ_v = 0), leading to $R_{n}^{*} (λ; λ_{0},ϵ) = {(\cos φ_{ϵ} \cos^{2 n + 1} φ^{*} - \sin φ_{ϵ} \sin^{2 n + 1} φ^{*})}^{2} .$ $\begin{equation} R_n^*(\lambda;\lambda_0,\epsilon)=\left(\cos\phi_\epsilon\cos^{2n+1}\phi^*-\sin\phi_\epsilon\sin^{2n+1}\phi^*\right)^2. \end{equation}$ (50)Finally, we suppose a perfect design: ϵ = 0. One ends up with the chessboard’s extinction capability characterised by $R_{n}^{*} (λ; λ_{0}) = \cos^{4 n + 2} (\frac{π}{2} \frac{λ_{0}}{λ}) \cdot$ $\begin{equation} R_n^*(\lambda;\lambda_0)=\cos^{4n+2}\left(\frac{\pi}{2}\frac{\lambda_0}{\lambda}\right)\cdot \end{equation}$ (51)As n goes to infinity, the starlight residual goes to zero at all wavelengths, except those where the cosine is equal to ± 1; that is, λ = ∞,λ₀ / 2,λ₀ / 4,...,λ₀ / 2k,...

Light from the planet.

Since it is off-axis, the planet wavefront is tilted relative to the pupil plane. As in Paper I, we assume that the planet was observed when this tilt induced a phase shift of π at λ₀ between the two arms of the interferometer. If, on the pupil plane, we neglect the effect of the tilt, the residual planet intensity is given by $R_n^\oplus=r(\lambda)R_n^*(\phi'_\epsilon)$ , where $\phi'_\epsilon=\phi_\epsilon+\pi$ . In this expression r(λ) stands for the planet/star intensity ratio at λ, typically r ≈ 10^-6 in the thermal infrared. At the image centre, we have $R_{n}^{\oplus} (λ; λ_{0},ϵ) = r (λ) {(\sin φ_{ϵ} \cos^{2 n + 1} φ^{*} - \cos φ_{ϵ} \sin^{2 n + 1} φ^{*})}^{2},$ $\begin{equation} R_n^\oplus(\lambda;\lambda_0,\epsilon)=r(\lambda)\left(\sin\phi_\epsilon\cos^{2n+1}\phi^*-\cos\phi_\epsilon\sin^{2n+1}\phi^*\right)^2, \end{equation}$ (52)and, in the absence of piston, $R_{n}^{\oplus} (λ; λ_{0}) = r (λ) \sin^{4 n + 2} (\frac{π}{2} \frac{λ_{0}}{λ}) \cdot$ $\begin{equation} R_n^\oplus(\lambda;\lambda_0)=r(\lambda)\sin^{4n+2}\Bigl(\frac{\pi}{2}\frac{\lambda_0}{\lambda}\Bigr)\cdot \end{equation}$ (53)As n goes to infinity, the planet light also goes to zero at all wavelengths, except those where the sine is equal to ± 1, that is: λ = λ₀,λ₀ / 3,...,λ₀ / (2k + 1),... Figure 7 shows the attenuation spectrum of the planet.

Planet-to-starlight ratio.

To successfully detect and to do the spectroscopy of an exoplanet, one must reduce all sources of noise. For the lower levels n of our device, the starlight residuals play a predominant role over other sources of noise (such as the exozodiacal light for example). One must therefore consider the ratio $R_n^\oplus/R_n^*$ , which gives the planet residual intensity relatives to the star residual intensity. This quantity may be called the planet-to-starlight ratio and denoted r_n. Typically, if r_n ≥ 1, the source of noise from the star ceases to be dominant and the planet is likely to be detected. We obtain $r_{n} (λ; λ_{0},ϵ) = r (λ) {(\frac{\tan φ_{ϵ} \pm \tan^{2 n + 1} φ}{1 \pm \tan φ_{ϵ} \tan^{2 n + 1} φ})}^{2},$ $\begin{equation} r_n(\lambda;\lambda_0,\epsilon) = r(\lambda)\left(\frac{\tan\phi_\epsilon\pm\tan^{2n+1}\phi}{1\pm\tan\phi_\epsilon\tan^{2n+1}\phi}\right)^2\,, \end{equation}$ (54)where again ± means + if n is even and − if it is odd. If n → ∞, and to the extent that | tanφ | >1, then r_n → rcot²φ_ϵ. This expression reduces to r_n ≈ r(λ)(λ / πϵ)² for low piston value. For example, if r = 10^-6, λ = 10μm, and ϵ = 3nm, we get r_n ≈ 1. That means that a planet 10⁶ times fainter than its star is likely to be detected at 10 μm even with a piston of 3nm. If the piston is increased by a factor of 3, then the planet-to-starlight ratio diminishes by approximately a factor of 10. Figure 8 shows the planet-to-starlight ratio for a zero piston.

This discussion is, for the central pixel and a zero piston, in accordance with the previous analysis in 2.1.3. In particular, we have r / r_n = | ρ_n | ².

Fig. 8

Planet-to-starlight ratio spectrum of a planet 10⁶ times fainter than the star around which it revolves, seen through a uni-axial (Michelson) interferometer. When the value on the y-axis is greater than 0, it is seen more light from the planet than from the star, but at the same time the effective bandwidth of the planet is reduced. A value slightly above 0 seems to be an acceptable objective.

The planet-to-starlight ratio bandwidth.

The condition | tanφ | >1 defines the planet-to-starlight ratio bandwidth or, for short, the contrast bandwidth. If ϵ = 0, it defines a first order where r_n goes to infinity like tan^4n + 2φ as n (the interferometer order) increases. The contrast bandwidth of the first order is π / 4 ≤ φ ≤ 3π / 4; which is, $\frac{2}{3}\lambda_0\le\lambda\le 2\lambda_0$ . For example, if λ₀ = 9μm, one gets the bandwidth 6μm ≤ λ ≤ 18μm, a range consistent with the 6–17 μm one demanded by the darwin spatial project.

Higher orders do indeed exists. The kth order is given by $\frac{2}{4k-1}\lambda_0\le\lambda\le\frac{2}{4k-3}\lambda_0$ , k = 1,2,...; elsewhere r_n goes to zero. If ϵ ≠ 0, the bandwidth is shrunk. One can show that the first order is now roughly given by φ₁ ≤ φ ≤ π − φ₁, with $\phi_1=\arctan(\tan^{-1/(2n-1)}\frac{\pi}{2}\epsilon/\lambda_0)$ . (This formula is valid for ϵ ≠ 0 and a sufficiently high n, say n > 5.).

3.1.2. Multi-axial case

The discussion is about the same as the previous one; however, there is no simple formula like Eq. (49), the more general Eq. (47) must be used instead. If one considers the image centre, one obtains $\begin{matrix} R_{n}^{*} (λ; ϵ, λ_{0}) & = & \cos^{2} φ_{ϵ} \cos^{4 n} φ + \sin^{2} φ_{ϵ} \sin^{4 n} φ, \\ R_{n}^{\oplus} (λ; ϵ, λ_{0}) & = & r (λ) (\sin^{2} φ_{ϵ} \cos^{4 n} φ + \cos^{2} φ_{ϵ} \sin^{4 n} φ^{)} . \end{matrix}$ $\begin{eqnarray} R_n^*(\lambda;\epsilon,\lambda_0)&=&\cos^2\phi_\epsilon\cos^{4n}\phi+\sin^2\phi_\epsilon\sin^{4n}\phi,\\ R_n^\oplus(\lambda;\epsilon,\lambda_0)&=&r(\lambda)\left(\sin^2\phi_\epsilon\cos^{4n}\phi+\cos^2\phi_\epsilon\sin^{4n}\phi\right). \end{eqnarray}$ From this one gets the planet-to-starlight ratio $r_{n} (λ; ϵ, λ_{0}) = r (λ) \frac{\tan^{2} φ_{ϵ} + \tan^{4 n} φ}{1 + \tan^{2} φ_{ϵ} \tan^{4 n} φ} \cdot$ $\begin{equation} r_n(\lambda;\epsilon,\lambda_0)=r(\lambda)\frac{\tan^2\phi_\epsilon+\tan^{4n}\phi}{1+\tan^2\phi_\epsilon\tan^{4n}\phi}\cdot \end{equation}$ (57)Again, if the piston ϵ is zero, the planet-to-starlight ratio goes to infinity like tan⁴ⁿφ, as n, the interferometer order, increases. This defines the many spectral orders: $\frac{2}{4k-1}\lambda_0\le\lambda\le\frac{2}{4k-3}\lambda_0$ , and the first one $\frac{2}{3}\lambda_0\le\lambda\le 2\lambda_0$ is the most extented.

If the piston is not zero, the planet-to-starlight ratio is approximately multiplied by a factor cot²φ_ϵ ≈ (λ / πϵ)² on approximately the same reduced bandwidth φ₁ ≤ φ ≤ π − φ₁ as in the co-axial case. Figure 9 shows this ratio for a piston of zero.

Fig. 9

Planet-to-starlight ratio spectrum. Same as Fig. 8 but for a multi-axial (Fizeau) setting. The integration has been performed by means of a single mode cylindrical fibre optics of radius λ₀ / D, where D is separation of the two mirrors. The abrupt cut-off at approximately 16.5μm is the long wavelength cut of the fibre optics.

3.2. Introduction of a random piston

Given the rather tight specifications on the value of the acceptable piston, the new concept seems more suited to a spatial project. However, with the advent of modern fringe-tracking techniques, it is possible to consider a ground-based instrument located on a site of excellent quality, such as in Antarctica. In that case, the rapid atmospheric fluctuations must be taken into account, and the residual differential piston ϵ between the P and Q pupils after correction by a fringe tracker, must be considered as a random variable. In this situation, the residual light function $R^*_n$ must be replaced by its expectation $\e{R^*_n}$ . To avoid repetition, we focus on the multi-axial case, because the co-axial one is very similar.

To compute $\e{R^*_n}$ , we write $\begin{matrix} R_{n}^{*} & = & \cos^{2} φ_{ϵ} \cos^{4 n} φ + \sin^{2} φ_{ϵ} \sin^{4 n} φ, \\ = & \frac{1}{2} (\cos^{4 n} φ + \sin^{4 n} φ^{)} \\ + \frac{1}{2} \cos 2 φ_{ϵ} (\cos^{4 n} φ - \sin^{4 n} φ^{)}, \end{matrix}$ $\begin{eqnarray} R^*_n &=&\cos^2\phi_\epsilon\cos^{4n}\phi+\sin^2\phi_\epsilon\sin^{4n}\phi,\\ &=&\thalf\left(\cos^{4n}\phi+\sin^{4n}\phi\right)\nonumber\\ &&+\, \thalf\cos2\phi_\epsilon\left(\cos^{4n}\phi-\sin^{4n}\phi\right), \end{eqnarray}$ $\begin{matrix} E {R_{n}^{*}} & = & \frac{1}{2} (\cos^{4 n} φ + \sin^{4 n} φ^{)} + \frac{1}{2} E {\cos 2 φ_{ϵ}} \\ \times (\cos^{4 n} φ - \sin^{4 n} φ^{)} . \end{matrix}$ $\begin{eqnarray} \e{R^*_n} &=&\thalf\left(\cos^{4n}\phi+\sin^{4n}\phi\right) + \thalf\e{\cos2\phi_\epsilon}\nonumber\\ &&\times\,\left(\cos^{4n}\phi-\sin^{4n}\phi\right). \end{eqnarray}$ (60)To evaluate E { cos 2φ_ϵ } , we take advantage of the fact that, if X is a random variable, then E { cos X } is the real part of its characteristic function Z(ω) = E { e^jωX } evaluated at ω = 1. Furthermore, if X is a Gaussian random variable of mean m and standard deviation s, its characteristic function is $Z(\omega)={\rm e}^{jm\omega-\half s^2\omega^2}$ . By setting X = 2πϵ / λ, we obtain m = 2πμ / λ and s = 2πσ / λ, where μ is the mean and σ the standard deviation of ϵ. It follows that $E {\cos 2 φ_{ϵ}} = \exp ({- 2 π^{2} σ^{2} / λ^{2}}^{)} \cos (2 πμ / λ) .$ $\begin{equation} \e{\cos2\phi_\epsilon} = \exp\left({-2\pi^2\sigma^2/\lambda^2}\right)\cos(2\pi\mu/\lambda)\,. \end{equation}$ (61)It is convenient to express this result in terms of the equivalent piston $\tilde{\epsilon}$ defined as $\cos2\phi_{\tilde{\epsilon}}=\e{\cos2\phi_\epsilon}$ . An important example is the unbiased case where μ = 0. If σ / λ is small, one obtains E { cos2φ_ϵ } = exp(−2π²σ² / λ²) ≈ 1−2(πσ / λ)² and since $\cos2\phi_{\tilde{\epsilon}}\approx1-2(\pi\tilde{\epsilon}/\lambda)^2$ , then the equivalent piston is equal to σ.

The main result of this section is that, in the case of Gaussian random fluctuations of the piston around zero, the expected residual light function is equal to its deterministic value for the standard deviation of the piston; that is, $E {R_{n}^{*} (u,v,λ; λ_{0},ϵ)} = R_{n}^{*} (u,v,λ; λ_{0},σ) .$ $\begin{equation} \e{R^*_n(u,v,\lambda;\lambda_0,\epsilon)} = R^*_n(u,v,\lambda;\lambda_0,\sigma). \end{equation}$ (62)Figures 10 and 11 show the planet-to-starlight ratio (or its expectation) for a piston of 3 nm.

Fig. 10

Planet-to-starlight ratio spectrum. Same as Fig. 8, but suffering a piston of 3 nm or a random unbiaised Gaussian piston of σ = 3nm, while the device is tuned at λ₀ = 9.05 μm.

Fig. 11

Planet-to-starlight ratio spectrum. Same as Fig. 10 but for a multi-axial Fizeau setting.

4. Conclusion

This paper presented a more complete analysis of a very efficient nulling interferometer, which appears as an iterative version of the well known Bracewell nulling interferometer. With the help of extended analytical computations of the amplitude expression, we show that the device acts as an optical differential operator on the 3D light distribution, i.e., spatially and chromatically.

We considered two kinds of systems: the (P_n,Q_n) chessboards designed for the Michelson setting and the $(P^*_n,Q^*_n)$ chessboards designed for the Fizeau one. The (P_n,Q_n) cannot be used in the Fizeau case because their PSF are not axisymmetric. The , however, may be considered for a Michelson interferometer because their performances are just slightly inferior to their Fizeau equivalents and present the advantage of an axisymmetric PSF.

The net effect of the device discussed in this paper is to “push away” the stellar light from the centre of the image, leaving room for an “out of phase” object to appear in the area now left free. This takes place for a very broad range of wavelengths if one considers the planet-to-starlight ratio. A design of order n = 7 tuned at λ₀ = 9.05μm can meet the darwin space project specifications, i.e., to cancel the starlight by a factor of 10⁵ over a wavelength range of 6–17 μm. Since the planet light is also diminished, the order n should not be pushed far beyond the point where the planet-to-starlight ratio is equal to one. For example, with n = 7, the starlight residuals are reduced below the planet light on most of the darwin bandwidth, while one still has more than 50% of the planet light over 20% of the bandwidth. Therefore, the full nominal darwin bandwidth can be covered for the planet by adjusting the device successively to five different wavelengths λ₀.

For an effective implementation of this device on the sky, a detailed study of the signal-to-noise ratio should be made prior to decide to what order the interferometer should be constructed. This choice must result from a compromise between the depth of the nulling and the effective planet-light bandwidth.

Imperfections in design have been considered in the form of a deterministic or a random piston between the two pupils of the interferometer. A systematic piston or equivalently a standard deviation of 3 nm seems to lead to acceptable performances in the case of the darwin project but also for a ground-based interferometer.

¹

More precisely, ρ₀ performs a Möbius transformation of C. In that case, it is a reflection (inversion) relative to the circle of center 1 and radius $\sqrt{2}$ , followed by a reflection about the imaginary axis (see e.g. Ratcliffe, J. G. 1994,Chap. 4). This transformation leaves invariant the two points j and −j.

Acknowledgments

We are very grateful to Raphaël Galicher for a careful reading of the manuscript and to the referee whose comments have greatly helped clarify and improve this paper. This work received the support of PHASE, the high angular resolution partnership between ONERA, Observatoire de Paris, CNRS, and University Denis Diderot Paris 7.

References

Allouche, J.-P., & Mendès-France, M. 2008, Monatshefte für Mathematik, 155, 301 [CrossRef] [Google Scholar]
Bracewell, R. N. 1978, Nature, 274, 780 [NASA ADS] [CrossRef] [Google Scholar]
Buisset, C., Rejeaunier, X., Rabbia, Y., et al. 2006, in Advances in Stellar Interferometry, ed. J. D. Monnier, M. Schöller, & W. C. Danchi, SPIE Conf. Proc., 6268, 626819 [Google Scholar]
Prouhet, E. 1851, Comptes Rendus des Séances de l’Académie des Sciences, 33, 225 [Google Scholar]
Ratcliffe, J. G. 1994, Foundations of Hyperbolic Manifold, 1st edn. (New York: Springer-Verlag) [Google Scholar]
Rouan, D., & Pelat, D. 2008, A&A, 484, 581 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Wallner, O., Perdigues Armengol, J. M., & Karlsson, A. L. 2004, in New Frontiers in Stellar Interferometry, ed. W. A. Traub. (Bellingham, WA: The International Society for Optical Engineering), SPIE Conf. Proc., 5491, 798 [Google Scholar]

Appendix A: Constraints on the number of cells

Here we try to find a lower limit to the number of cells S constituting the chessboards for a given even nulling order N. We recall that the numeric values contained in the P_n and Q_n chessboards are even and odd integers, respectively.

We find that our design is optimal for N = 2 and 4, but the question remains open for N ≥ 6.

Cancelling up to the second nulling order.

If n_s and m_s are the P and Q matrices elements, we must have $\sum_{s = 1}^{S} n_{s}^{2} = \sum_{s = 1}^{S} m_{s}^{2} = Σ,$ $\appendix \setcounter{section}{1} \begin{equation} \tsum_{s=1}^S n_s^2 = \tsum_{s=1}^S m_s^2 = \Sigma, \end{equation}$ (A.1)where Σ is the common sum. Since the n_s are even, Σ must be divisible by 4. Because m_s is odd, we have m_s = 2k_s + 1 and $\begin{matrix} \sum_{s = 1}^{S} (2 k_{s} + 1)^{2} & \equiv & 0 (\mod 4), \\ 4 \sum_{s = 1}^{S} k_{s}^{2} + 4 \sum_{s = 1}^{S} k_{s} + S & \equiv & 0 (\mod 4), \\ S & \equiv & 0 (\mod 4) . \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} \tsum_{s=1}^S (2k_s+1)^2 &\equiv& 0\pmod{4},\\ 4\tsum_{s=1}^S k_s^2 + 4\tsum_{s=1}^S k_s + S &\equiv& 0\pmod{4},\\ S &\equiv& 0\pmod{4}. \end{eqnarray}$ Therefore the number of cells in each chessboard must be a non-zero multiple of 4. We have shown by the construction of the P₁ and Q₁ chessboards that S = 4 was sufficent, and this is the minimum number of cells required to cancel the chromatic term up to second order.

The above result is a consequence of the property that (2k + 1)² ≡ 1(mod4). In fact one can show that (2k + 1)² ≡ 1(mod8) that, in turn, is a particular case with p = 1 of the more general result $(2 k + 1)^{2^{p}} \equiv 1 (\mod 2^{p + 2}),$ $\appendix \setcounter{section}{1} \begin{equation} \label{eq:modulo} (2k+1)^{2^p}\equiv 1\pmod{2^{p+2}}\,, \end{equation}$ (A.5)which one can easily establish by induction.

Cancelling up to the fourth nulling order.

From (A.5) with p = 2, we obtain (2k + 1)⁴ ≡ 1(mod16). Then we deduce that S ≡ 0(mod16). Again, we show that the 16 cells in P₂ and Q₂ were sufficent to cancel the chromatic term up the fourth order, and it is the minimum.

Cancelling beyond the fourth nulling order.

One can use (A.5) to sets limits on the sixth order cancelled by P₃ and Q₃, but they are too loose. (Unless one can prove that, if 2²ⁿ is a divisor of $\appendix \setcounter{section}{1} $\sum_{s=1}^S(2k_s+1)^{2n}$$ then, for n > 1, it is also a divisor of $\appendix \setcounter{section}{1} $\sum_{s=1}^S(2k_s+1)^{2^{2n-2}}$$ ). All that can be said is that the number of cells must be a multiple of 16, while P₃ and Q₃ possesses 64. Up to now, as soon as the nulling order N ≥ 5, no tighter limits than those deduced from (A.5) are known to the authors regarding the number of cells necessary to cancel the chromatic term up to that order.

All Tables

Table 1

Number of cells in the interferometer (P_n,Q_n) according to the phase shift exp(jkπλ₀ / λ).

In the text

Table 2

Number of cells in (P_n,Q_n) producing a given phase shift.

In the text

Table 3

The nulling power of the chessboards considered in this paper.

In the text

All Figures

	Fig. 1 A possible optical scheme of the nulling interferometer in co-axial configuration.
In the text

Fig. 2

Graphic determination of the relative nulling efficiency ρ₀ of a Bracewell nulling interferometer through a stereographic projection of the Fresnel vector z = e^{πλ₀ / λ} toward point 1 on the real axis. The relative nulling efficiency of the iterative Bracewell scheme (P_n,Q_n) is $\rho_0^{2n+1}$ . This quantity decreases to 0, as n increases where | ρ₀ | <1. That is when ρ₀(z) is in the (−j,j) segment. Therefore, a planet dephased by π at λ₀ relative to its star may be detected in many bandwidths whose first order is $\frac{2}{3}\lambda_0<\lambda<2\lambda_0$ .

In the text

	Fig. 3 A possible optical scheme of the nulling interferometer in multi-axial configuration.
In the text

	Fig. 4 The chessboards. The P₂, Q₂ chessboards from which they were built are in the top right corners. The 2kπ or (2k + 1)π phase shifts at λ₀ are indicated in the cells’ centres by their 2k (blue) or 2k + 1 (red) values. The value zero is not shown.
In the text

Fig. 5

Image of a star system seen through a Michelson interferometer d / D = 1 equipped with the multi-axial $(P_n^*,Q_n^*)$ design of order n = 7. The system consists of a star and a 10⁶ fainter planet. The light from the star is shown in yellow and the one from the planet in blue. The image intensity I has been enhanced by a I^0.25 filter to see the very low levels (here: the star). The effect of the device is to push away the light from the star, while that of the planet fits in the space left empty. To illustrate the achromatic effect, the observation is simulated for a wavelength detuned by 10% compared to the central wavelength λ₀.

In the text

Fig. 6

Extinction spectrum of a star by iterative Bracewell nulling uni-axial (Michelson) interferometers of order n = 0,1,...,12. The original Bracewell n = 0 is in black, order 1 in yellow, 2 in red, etc. The x-axis is the wavelength, either Δλ / λ₀ (bottom) or λ in μm for λ₀ = 9.05 μm (top). The $R_n^*$ integration has been performed over an Airy square of size 2λ₀ / d where d is the size of the mirrors. The maximum bandwidth is indicated between the two dashed vertical lines, and it meets the darwin project nominal bandwidth indicated by the two broken vertical lines. The y-axis is the log of the integrated extinction factor, 0 means no extinction, and the horizontal line at −5 sets the nominal darwin extinction factor of 10⁵.

In the text

	Fig. 7 Attenuation spectrum of a planet following the same setting, except for the linear y-axis, as in Fig. 6. Near the wavelength λ₀, where the device is tuned, the planet only suffers very small attenuation. The full width at half maximum (FWHM) is given, with a very good approximation, by $\textit{FWHM}=1/\sqrt{2n}$ , where n is the interferometer order.
In the text

Fig. 8

Planet-to-starlight ratio spectrum of a planet 10⁶ times fainter than the star around which it revolves, seen through a uni-axial (Michelson) interferometer. When the value on the y-axis is greater than 0, it is seen more light from the planet than from the star, but at the same time the effective bandwidth of the planet is reduced. A value slightly above 0 seems to be an acceptable objective.

In the text

	Fig. 9 Planet-to-starlight ratio spectrum. Same as Fig. 8 but for a multi-axial (Fizeau) setting. The integration has been performed by means of a single mode cylindrical fibre optics of radius λ₀ / D, where D is separation of the two mirrors. The abrupt cut-off at approximately 16.5μm is the long wavelength cut of the fibre optics.
In the text

	Fig. 10 Planet-to-starlight ratio spectrum. Same as Fig. 8, but suffering a piston of 3 nm or a random unbiaised Gaussian piston of σ = 3nm, while the device is tuned at λ₀ = 9.05 μm.
In the text

	Fig. 11 Planet-to-starlight ratio spectrum. Same as Fig. 10 but for a multi-axial Fizeau setting.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Allouche, J.-P., & Mendès-France, M. 2008, Monatshefte für Mathematik, 155, 301 [CrossRef] [Google Scholar]

[2] Bracewell, R. N. 1978, Nature, 274, 780 [NASA ADS] [CrossRef] [Google Scholar]

[3] Buisset, C., Rejeaunier, X., Rabbia, Y., et al. 2006, in Advances in Stellar Interferometry, ed. J. D. Monnier, M. Schöller, & W. C. Danchi, SPIE Conf. Proc., 6268, 626819 [Google Scholar]

[4] Prouhet, E. 1851, Comptes Rendus des Séances de l’Académie des Sciences, 33, 225 [Google Scholar]

[5] Ratcliffe, J. G. 1994, Foundations of Hyperbolic Manifold, 1st edn. (New York: Springer-Verlag) [Google Scholar]

[6] Rouan, D., & Pelat, D. 2008, A&A, 484, 581 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[7] Wallner, O., Perdigues Armengol, J. M., & Karlsson, A. L. 2004, in New Frontiers in Stellar Interferometry, ed. W. A. Traub. (Bellingham, WA: The International Society for Optical Engineering), SPIE Conf. Proc., 5491, 798 [Google Scholar]