Issue 
A&A
Volume 524, December 2010



Article Number  A80  
Number of page(s)  10  
Section  Astronomical instrumentation  
DOI  https://doi.org/10.1051/00046361/201014934  
Published online  25 November 2010 
The achromatic chessboard, a new concept of a phase shifter for nulling interferometry
II. Theoretical performance assessment
^{1}
LUTH, Observatoire de Paris, CNRS, Université Paris Diderot,
92190
Meudon,
France
email: didier.pelat@obspm.fr
^{2}
LESIA, Observatoire de Paris, CNRS, UPMC, Université Paris
Diderot, 5 place Jules
Janssen, 92190
Meudon,
France
email: daniel.rouan@obspm.fr; damien.pickel@obspm.fr
Received:
4
May
2010
Accepted:
11
August
2010
Context. Nulling interferometry in the midIR using two telescopes (commonly referred to a Bracewell interferometer) is one possible way of directly detecting exoplanets in the habitable zone and their characterisation in terms of possible life signatures. A large wavelength domain is needed to simultaneously detect the infrared spectral features of a set of a biotracers. An achromatic phase shift of π is then required, and we previously presented a new concept for such a function that allows a simple design with only one device per beam. It is based on two cellular mirrors, called the chessboards, where each cell has a thickness that introduces, for any given central wavelength, a phase shift of (2k + 1)π or of 2kπ on the fraction of the wave it reflects.
Aims. We explore a more rigorous way to establish the optimum cell pattern design to attain the best theoretical performances for planet detection over a broad wavelength range. Two possible types of interferometres are now considered: onaxis and multiaxis.
Methods. We derived a rather simple iterative scheme for both designs, determining the thickness and XY position of the cells. The method confers to the chessboards a high degree of internal symmetry. Each design can be described as an iterative Bracewell interferometer characterised by an integer order. We demonstrate that their efficiencies increases with the power of that order.
Results. The device acts both spatially and versus wavelengths as an optical differential operator on the 3D light distribution. Its power is best understood in the onaxis case since its effect is topush awaythe stellar light from the centre over a very broad range of wavelengths, leaving space for anout of phase object to appear in the cleaned central region. We explore the theoretical performances for onaxis and multiaxis designs in the parameter space, and we especially compute the rejection factor for starlight and the attenuation factor for planet light and introduce the relative nulling efficiency metric. We show that, even with some realistic piston error added, the performances could meet the Darwin space project specifications for both designs, i.e., cancellation of the starlight by a factor of 10^{5} over a wavelength range of 6–17 μm.
Key words: instrumentation: interferometers / techniques: high angular resolution / techniques: interferometric / space vehicles: instruments / planetary systems
© ESO, 2010
1. Introduction
A new concept of a quasiachromatic nulling interferometer has been proposed in a preceding publication (Rouan & Pelat 2008,hereafter Paper I). The whole idea is based on cancelling the polynomial part, according to both the wavelength and spatial coordinates, of the starlight distribution at the centre of the focal plane of the interferometer. This objective was attained thanks to a particular setting of phase shifters on the interferometer pupil planes. The advantage of the proposed design depends on a rather simple arrangement, where the key device is a simple square grid of phase shifters (the socalled “chessboard”) on the pupil plane of the telescopes.
In Paper I, several possible chessboards were proposed and a coaxial (Michelson) setting of the telescopes favoured. Here we concentrate more deeply on two highperformance configurations, one designed for the coaxial case and the other for the multiaxial (Fizeau) case. In Sect. 2, we outline the principles that led to the phase distribution and spatial repartition of the chessboards’ cells. A first approximation of the system performances is also given, with particular emphasis on the principles that govern its effectiveness. In Sect. 3, detailed analytical calculations are carried out to determine the chessboards’ nulling performances with more precision and to explore their robustness to a systematic or random piston on one arm of the interferometer.
2. The chessboard design
We recall that our device consists of two square grids of phase shifters each formed of adjacent cells. Finding a highperformance chessboard setting implies closely inspecting the complex amplitude of the electrical field of the interferometer output equipped with this device. If A stands for this amplitude, the point spread function is given by PSF =  A  ^{2}. According to Fourier optics one gets (1)In this expression, u and v are the Cartesian angular coordinates of a light ray diffracted by the pupil plane, d is the size of the pupil, n_{0} the number of cells on one side of the chessboards, and by definition sinc(x) = sin(x) / x. The symbolic expressions expUV and expPQ stand for two square matrices both with the “chessboard” format: n_{0} × n_{0}. The “dot” operation · between the two matrices is the elementtoelement product not the matrices product. The term Sum( ) realises the sum of all elements of the matrix under the scope of that operator.
The matrix expUV takes the positioning of the cells into account, namely the place where the phase shifters are put on the grid of the chessboards. It is given by (2)In this expression, δ is the displacement vector. It is a set of integers varying from −(n_{0}−1) to +(n_{0}−1) by step of 2, and the symbol ⊗ is the tensor product between the two vectors. For example, if n_{0} = 4, we have δ = (−3, −1,1,3) and we get The second matrix expPQ, takes the values of the phase shifters into account. Adopting the usual convention that the exponential of a matrix is the matrix of the exponentials, we have (5)The matrices P and Q are the chessboard matrices, of size n_{0} × n_{0}. All their elements are integers that give, in units of λ_{0} / 2, the optical path difference (OPD) induced by the corresponding cell of the chessboard. For example, if P = [ 0 ] and Q = [ 1 ] , there is no OPD on one telescope but one of λ_{0} / 2 on the other one, inducing a phase shift of π at λ_{0}. This particular setting corresponds to a Bracewell nulling interferomer (Bracewell 1978). Finally, D is the distance between the two pupils’ centres and Λ is a chromatic variable (6)(The Λ variable was named Δ in Paper I.)
To obtain the best achromatism, we propose a progressive approach below where we first look for the best phaseshift distribution and second for their spatial distribution. We start our investigation with the mathematically simpler coaxial design.
2.1. The coaxial chessboards
In the coaxial configuration, the two pupils are superimposed thanks to some optical arrangement. A practical example is given in Fig. 1, where it includes an imaging lens in the last stage since any efficient detection requires concentrating the light beam on a small surface.
Fig. 1 A possible optical scheme of the nulling interferometer in coaxial configuration. 

Open with DEXTER 
2.1.1. The multigrad solution
The idea, explained in Paper I, is to cancel the chromatic term up to the Nth order of its Taylor series. For this purpose, we need the Taylor series expansion of the elements of e^{jπPΛ} and e^{jπQΛ} matrices. For example, if n_{s} stands for the numerical values contained in P, the Taylor expansion, relative to ε = Δλ / λ_{0}, is given by (7)where p_{k} stands for a polynomial of degree k. (Note that p_{k} is not a monomial because of the chain rule needed to compute the derivatives.) Of course, a similar formula exists for e^{jπmsΛ} involving the values m_{s} contained in Q. Cancelling the chromatic term up to order N implies solving the equation This polynomial is zero for any ε if, and only if, all its coefficients are zero. Therefore we have to solve the system (10)for k = 0,...,N. If c_{ℓk} stands for the coefficients of the polynomials p_{k}, this reduces to (11)This system can be solved by induction, because if a solution is found up to order k, the solution up to order k + 1 is found if (12)Therefore, the chromatic term is nullified up to order N if one solves the following system of equations (13)for the unknown integers n_{s} and m_{s} and the parameter n_{0}.
As in a Bracewell interferometer, we impose the constraint that e^{jπns} = +1 and e^{jπms} = −1, therefore the n_{s} and m_{s} are respectively even and odd integers. As introduced in Paper I, Appendix C, one ends up with a special set of diophantine equation called a multigrad system, namely (14)If two sets of integers, say { n_{s} } and { m_{s} } , solve a multigrad system up to order N, they are called multigrad sets. This property is shortly written as . Hereafter, the two sets are supposed to be of equal size, for short (adopting the convention that 0^{0} = 1).
The key theorem is that if one has , then it follows that . This is true for any c, but is nontrivial only if c ≠ 0 and { m_{s} } ≠ { n_{s} } . In this way, two multigrad sets up to order N generate two new multigrad sets up to order N + 1. During this operation, the size of the sets is multiplied by two. But in order that the number of cells on the chessboards remains a square, the size of the sets must be multiplied by four. This is done by applying two times the key theorem. Moreover, if the sets { n_{s} } and { m_{s} } possess the desired parity, we keep it if we choose an odd constant c, say c = 1. We get the final result:
if , then
.
This remarkable property is at the heart of the scheme proposed below.
2.1.2. The iterative Bracewell scheme
This scheme runs as follows. One starts with a Bracewell nulling interferometer and constructs the next one by two applications of the key theorem. In this way, one first gets . Indeed, we have (always with 0^{0} = 1) (15)As expected, this new interferometer cancels the chromatic term up to order N = 2. To construct the next one, we add c = −1 instead of c = +1 in order to gain symmetry and in an effort to keep the zheights of the cells as low as possible. Finally, to generate the successive phase chessboards, we propose the following iterative scheme (16)where ± means (−1)^{n} (i.e. +1 if n is even and −1 if it is odd). With this process, the chessboards P_{n} and Q_{n} retain the shape of two squares possessing n_{0} = 2^{n} cells on each of their sides. Their combined effect is to cancel the chromatic term up to order N = 2n. The higher n, the more efficent the corresponding nulling interferometer.
Number of cells in the interferometer (P_{n},Q_{n}) according to the phase shift exp(jkπλ_{0} / λ).
The iterative scheme presented above leads to constructing the interferometer (P_{n},Q_{n}) possessing, for example, the phaseshift distribution shown on Table 1. We observe that the progression of the number of cells that produce the same phase shift follows that of the binomial coefficients in the polynomial expansion of (1 + x)^{2n + 1}. This is easily proved by two applications of the recursive formula , which leads to , which is our iterative scheme. (In these formulæ we adopt the convention that if q ≤ 0.) Table 2 gives the phaseshift, together with the corresponding number of cells (the formulæ are valid for two consecutives orders).
Number of cells in (P_{n},Q_{n}) producing a given phase shift.
2.1.3. First estimation of the iterative Bracewell scheme performance
In this section we study the nulling efficiency of an interferometer built on this principle. To compare the interferometers (P_{n},Q_{n}) in the same conditions, differing only by their order n, the light collecting surface of their mirrors must be kept constant by reducing the size of the cells to a square of surface d^{2} / 2^{2n}. Because at this point we have not distributed the cells on the pupil plane, we only consider A_{n}, the sum of the amplitudes. This leads to the formula where we have set z = e^{jπλ0 / λ} and . The common phase factor e^{−jKπλ0 / λ} sets the OPD reference, and the function ⌊ x ⌋ is the “floor” function (i.e. it returns the greatest integer less than or equal to x.)
If λ = λ_{0}, then z = −1 and A_{n} = 0, therefore the nulling is perfect for any n. The device is strictly equivalent to a Bracewell nulling interferometer. Now, if λ ≠ λ_{0}, since the module of the complex (1 + z) / 2 is less than one, A_{n} → 0 as n, the order of the interferometers, increases. This is true for all λ except for the values such that z = 1, i.e. λ = ∞,λ_{0} / 2,λ_{0} / 4,...,λ_{0} / 2k,... Aside from the actual distribution of cells on the pupils, we have obtained a quasiachromatic nulling interferometer.
However, one must judge the perfomances of the device relative to some benchmark. We have demonstrated above that, for an onaxis object, our device has powerful nulling efficiency on a given set of wavelengths. This is of course different for any offaxis object such as a planet which is best detected when its position induces an OPD = Dθ_{planet} = λ_{0} / 2. In that situation, the recombination, at λ_{0}, is fully constructive for the planet. Therefore, the interferometer with a phase shift of π at λ_{0} added to one of its pupils will serve as a reference. In so doing, we transform the nulling interferometer into a nondestructive one for the wavelengths set that was just mentioned above. The phase addition on all the, say, odd phase shifters changes the sign of the odd, q = 2p + 1, binomial coefficients in the expression of A_{n}. If stands for the function obtained in this way, one gets (19)The only change between the expressions of A_{n} and is the sign of z. If λ = λ_{0} / 2, then z = 1 and , we also have a nulling interferometer. Again, if λ ≠ λ_{0} / 2, since the modulus of the complex (1 − z) / 2 is also less than one, as n increases. This is true for all λ except for the values that make z = −1; that is, λ = λ_{0},λ_{0} / 3,λ_{0} / 5,...,λ_{0} / (2k + 1),... The result is an associated “out of phase” nulling interferometer but, fortunately, less efficient than the original one. The offaxis planet will indeed suffer some attenuation but much less than the onaxis star, as the ratio of A_{n} to demonstrates.
We define ρ_{n} as , and name it the relative nulling efficiency. One gets (20)If  ρ_{0}  is less than one and n large enough, the onaxis nulling can be made more efficient than the offaxis one by any factor. Note that ρ_{0}(z) is the relative nulling efficiency of the original Bracewell interferometer.
A deeper understanding of how the iterative Bracewell interferometers works is obtained through careful analysis of the transformation induced by ρ_{0} on the complex plane C of the Fresnel representation. In fact, under ρ_{0}, the unit circle is transformed into the imaginary axis^{1}. One gets (21)This means that ρ_{0}(z), and the Bracewell relative nulling efficency that follows is obtained by a stereographic projection of z toward the real 1 (see Fig. 2). The image of the halfcircles is the segment (−j,j) shown in green in the figure. All points in this segment have a modulus less than one, therefore  ρ_{n}(z)  can be made as small as desired, provided one chooses an interferometer of sufficiently highorder n. This defines many wavelengths orders, the first one being: . Within these bandwiths, an “in phase” object, say a star, is strongly attenuated while an “out of phase” one, say a planet, is much less affected. This property opens interesting perspectives for broadband spectroscopy, or detection, of exoplanets, all the more so for intensity, since the nulling behaves as  ρ_{0}  ^{4n + 2}.
From the dispersion of the optical paths difference ΔOPD produced by the chessboards, it is possible to obtain a very good approximation of the full width at half maximum (FWHM) of the planet attenuation spectrum within the full bandwidths, e.g. . Because the OPDs distribution is binomial, one gets or approximately . Now, ΔOPD = cΔt and . By applying the Heisenberg uncertainty principle, we have ΔtΔE ≥ h / 4π, from which we get (22)The constant of proportionality should not be very different from one, see Fig. 7 for an example where it is approximately equal to . While the planettostar ratio greatly increases with the order n of the chessboards, the effective bandwidth reduction in implies longer exposure times to get the full range of wavelengths.
Fig. 2 Graphic determination of the relative nulling efficiency ρ_{0} of a Bracewell nulling interferometer through a stereographic projection of the Fresnel vector z = e^{πλ0 / λ} toward point 1 on the real axis. The relative nulling efficiency of the iterative Bracewell scheme (P_{n},Q_{n}) is . This quantity decreases to 0, as n increases where  ρ_{0}  <1. That is when ρ_{0}(z) is in the (−j,j) segment. Therefore, a planet dephased by π at λ_{0} relative to its star may be detected in many bandwidths whose first order is . 

Open with DEXTER 
Finally, the proposed scheme generates, from one order to the next, a fourfold increase in the number of cells. In Appendix A, we investigate to what extent so many cells are necessary.
2.1.4. Placing the cells on the chessboards
To actually build the interferometer, it is necessary to distribute the chessboards cells on the pupil plane, an operation that leads to a specific PSF on the image plane. The previous performance analysis is valid for a perfectly parallel light beam, therefore, for any distribution of cells, it is only relevant for the central pixel of the image. For the other pixels, it is necessary to ensure that the cells distribution generates a PSF that does not ruin the mighty nulling power of the chessboards as derived above. This can be done by extending the same idea to the spatial dimension of the amplitude A, as given by Eq. (1).
As mentioned in Paper I, we gain insight into this problem by performing a Taylor expansion of the expUV matrix to get (23)In this expression, the symbol [UV] stands for a positioning matrix of size n_{0} × n_{0}. For example, in the case n_{0} = 4, we have (24)By convention the elements of [ UV ] ^{k} are the elements of [UV] raised to the power k. Following the above example, we get (25)Again, the key observation is to recognise that the elements of [UV]^{k} are the sampled values of the 2D polynomial (x + y)^{k}. Therefore, if we were able to arrange the cells in such a way that the expPQ matrix would turn out to be the coefficients of a finite difference 2D differential operator of order k + 1, the expression expPQ· [ UV ] ^{k} would be zero. Indeed all terms of degree r less than or equal to k in (23) would also be cancelled, since expPQ· [ UV ] ^{r} would be the finite difference derivative of order k + 1 of a 2D polynomials of degree less than or equal to k.
It seems, however, impossible to attain this objective exactly, but it is possible to approach it fairly closely. The firstorder approximation, according to ε = Δλ / λ_{0}, of expPQ is (26)In the case where P is even and Q odd, this formula reduces to expPQ ≈ −εjπ(P − Q). It is not too difficult to find an arrangement of cells on the pupil plane that makes P − Q equal to a matrix whose elements are those of a finite difference differential operator (fiddop). We found that it is achievable by the following iterative scheme (27)In this expression ± again means (−1)^{n}. (This scheme is the same as the iterative Bracewell one discussed above. Also , , where e.g. stands for the transposition of P_{n}.) It follows, by induction, that P_{n} − Q_{n} is, as desired, a fiddop of order n. To start the induction, we have (28)It is indeed a gradient operator that, as such, will cancel the firstorder (constant) term in the Taylor expansion of expUV. More generally we have Therefore, P_{n + 1} − Q_{n + 1} is the gradient of P_{n} − Q_{n} and, since the later is a fiddop of order n, then P_{n + 1} − Q_{n + 1} is a fiddop of order n + 1. It is remarkable that all lines and columns of P_{n} − Q_{n} are a ProuhetThueMorse sequence of −1 and +1 of length 2^{n}, as is clear from the way the P_{n} − Q_{n} were constructed.
We recall that the ProuhetThueMorse sequence P_{n} is a string of length 2^{n} whose kth element is given by P_{n}(k) = (−1)^{s(k)} for 0 ≤ k < 2^{n}, where s(k) is the sum of the binary digits of k; e.g. P_{3} = +1, −1, −1, +1, −1, +1, +1, −1. This sequence is also constructed iteratively by P_{0} = +1 and P_{n} = P_{n−1}   −P_{n−1}, where   is the concatenation operator (Prouhet 1851; Allouche & MendèsFrance 2008).
2.2. The multiaxial chessboards
For this design, we set D ≠ 0 in (5) (see Fig. 3 for a practical realisation).
Fig. 3 A possible optical scheme of the nulling interferometer in multiaxial configuration. 

Open with DEXTER 
To obtain A(−u,v) = A(u,v), and because the sin(πuD) function is odd, it is sufficient to make sure that Sum((e^{jπPΛ} − e^{jπQΛ})·expUV(u,v)) is even. It is clear from Eq. (3) that expUV(−u,v) is equal to expUV(u,v) after a symmetry around the pupil’s yaxis. Therefore, if P and Q were left invariant following the same symmetry, our goal would be attained. Moreover, since the Michelson chromatic term must also be made as low as possible, one must use the coaxial chessboards, one way or the other. One solution is to put the coaxial chessboards, shrunken by a factor of two in the upper right corner of the multiaxial chessboards, for example, and to fill the empty cells by means of successive x and y symmetries. This setting already gives good results, but one can gain more nulling power if one requires that A(−u, −v) = A(u,v).
Fig. 4
The chessboards. The P_{2}, Q_{2} chessboards from which they were built are in the top right corners. The 2kπ or (2k + 1)π phase shifts at λ_{0} are indicated in the cells’ centres by their 2k (blue) or 2k + 1 (red) values. The value zero is not shown. 

Open with DEXTER 
To achieve this objective, one starts with the coaxial chessboard shrunk by a factor of two, and one again put it in the upper righthand corner of the multiaxial chessboard under construction. We fill the lower righthand corner by a xaxis symmetry with a change of sign and finally we fill the remaining lefthand side half plane by a yaxis symmetry, again with a change in sign. We symbolically note ‘unfold’ this procedure. If P_{n−1} and Q_{n−1} stands for two uniaxial chessboards, we construct the corresponding multiaxial chessboards and by way of the operations (33)( and have the same number of cells as P_{n} and Q_{n}). We have, for example, (34)(35)The “unfold” procedure offers many advantages.

1.
Since the combination (P_{n−1},Q_{n−1}) cancels the chromatic term up to order N = 2(n − 1), the is expected to cancel it, at least up to the same order N = 2n − 2. In fact, the change in sign during the unfolding process implements the following property: if , then . Therefore the pair cancels the chromatic term up to order N = 2n − 1.
 2.
We expect to be, as P_{n−1} − Q_{n−1}, a fiddop of, at least, order n − 1. This is true if n is even, but thanks to the symmetries of the ProuhetThueMorse sequence, this order is increased to n if n is odd.
 3.
Finally, the symmetry of the multiaxial chessboards is such that it cancels the [ UV ] ^{k} matrices when k is odd (see Eq. (25)).
Because of these supplementary properties, the chessboards designed for the multiaxial case are expected to be just slightly less successful than their coaxial (P_{n},Q_{n}) counterparts.
The nulling power of the chessboards considered in this paper.
Table 3 is a summary of the nulling properties of the chessboards considered in this paper.
3. Performance of the design
A first evaluation of the performances of the chessboards have been established in the preceding section; however, it is possible to obtain far more detailed information by working out the analytical expression of the electrical field amplitude. This is made possible because relatively simple operations allow passing, almost mechanically, from one chessboard of order n to the next one. These are, in the following order,

start with a Bracewell setting;

shrink the chessboards by a factor of two;

shift it to its proper place;

add a phase; and

perform symmetries with a change in sign.
All these operations, when performed on the pupil plane, have simple Fourier equivalents on the image plane. Therefore, a computation of the amplitude function can be considered. A straightforward, though lengthy, analytical work leads to the formulæ(36)(37)where P_{n} and Q_{n} are, in this context, two functions on the image plane. They are not the chessboard matrices. The expressions for S_{n} and D_{n} differ according to the interferometer setting. In the coaxial case, we set D = 0 and obtain (38)while in the multiaxial one, D ≠ 0 and we find (39)(40)The and functions reflect the self similarity of the iterative Bracewell scheme and may be called the “nulling functions”: For a given λ, the θ_{u},θ_{v} are the spatial variables and φ^{ ∗ } is the chromatic variable (because it depends on λ_{0}). We then obtain (45)(46)One recalls that: D is the separation between the two telescopes, d the mirror size, λ the wavelength considered, λ_{0} the wavelength for which the chessboards are tuned and u,v are the Cartesian angular coordinates of the light diffracted by the pupils. Figure 5 shows the theoretical image of a planetary system similar to ours seen with the device. The image was calculated using these formulæ.
Fig. 5
Image of a star system seen through a Michelson interferometer d / D = 1 equipped with the multiaxial design of order n = 7. The system consists of a star and a 10^{6} fainter planet. The light from the star is shown in yellow and the one from the planet in blue. The image intensity I has been enhanced by a I^{0.25} filter to see the very low levels (here: the star). The effect of the device is to push away the light from the star, while that of the planet fits in the space left empty. To illustrate the achromatic effect, the observation is simulated for a wavelength detuned by 10% compared to the central wavelength λ_{0}. 

Open with DEXTER 
To account for a possible path difference between the two arms of the interferometer, we introduce a systematic piston on one arm. If ϵ is the piston value, instead of (36), we have (47)(48)
3.1. Extinction of the star, attenuation of the planet, and planettostarlight ratio
Let us define as the residual stellar light intensity. We have . We consider this function successively for the coaxial and multiaxial cases below.
3.1.1. Coaxial case
Residual light from the star.
One finds, in the coaxial case (49)The starlight extinction spectrum shown in Fig. 6 is defined as the integration . Of course, as expected, the nulling effect may reach considerable values over a large bandwidth. For example, an extinction factor of 10^{5} can be easily achieved over the bandwidth 6.2–17.5 μm by an interferometer of order n = 7 (orange curve) tuned at λ_{0} = 9.05μm.
Fig. 6
Extinction spectrum of a star by iterative Bracewell nulling uniaxial (Michelson) interferometers of order n = 0,1,...,12. The original Bracewell n = 0 is in black, order 1 in yellow, 2 in red, etc. The xaxis is the wavelength, either Δλ / λ_{0} (bottom) or λ in μm for λ_{0} = 9.05 μm (top). The integration has been performed over an Airy square of size 2λ_{0} / d where d is the size of the mirrors. The maximum bandwidth is indicated between the two dashed vertical lines, and it meets the darwin project nominal bandwidth indicated by the two broken vertical lines. The yaxis is the log of the integrated extinction factor, 0 means no extinction, and the horizontal line at −5 sets the nominal darwin extinction factor of 10^{5}. 

Open with DEXTER 
Fig. 7
Attenuation spectrum of a planet following the same setting, except for the linear yaxis, as in Fig. 6. Near the wavelength λ_{0}, where the device is tuned, the planet only suffers very small attenuation. The full width at half maximum (FWHM) is given, with a very good approximation, by , where n is the interferometer order. 

Open with DEXTER 
To understand more deeply how the nulling works, one considers at the centre of the image plane (i.e. θ_{u} = 0 and θ_{v} = 0), leading to (50)Finally, we suppose a perfect design: ϵ = 0. One ends up with the chessboard’s extinction capability characterised by (51)As n goes to infinity, the starlight residual goes to zero at all wavelengths, except those where the cosine is equal to ± 1; that is, λ = ∞,λ_{0} / 2,λ_{0} / 4,...,λ_{0} / 2k,...
Light from the planet.
Since it is offaxis, the planet wavefront is tilted relative to the pupil plane. As in Paper I, we assume that the planet was observed when this tilt induced a phase shift of π at λ_{0} between the two arms of the interferometer. If, on the pupil plane, we neglect the effect of the tilt, the residual planet intensity is given by , where . In this expression r(λ) stands for the planet/star intensity ratio at λ, typically r ≈ 10^{6} in the thermal infrared. At the image centre, we have (52)and, in the absence of piston, (53)As n goes to infinity, the planet light also goes to zero at all wavelengths, except those where the sine is equal to ± 1, that is: λ = λ_{0},λ_{0} / 3,...,λ_{0} / (2k + 1),... Figure 7 shows the attenuation spectrum of the planet.
Planettostarlight ratio.
To successfully detect and to do the spectroscopy of an exoplanet, one must reduce all sources of noise. For the lower levels n of our device, the starlight residuals play a predominant role over other sources of noise (such as the exozodiacal light for example). One must therefore consider the ratio , which gives the planet residual intensity relatives to the star residual intensity. This quantity may be called the planettostarlight ratio and denoted r_{n}. Typically, if r_{n} ≥ 1, the source of noise from the star ceases to be dominant and the planet is likely to be detected. We obtain (54)where again ± means + if n is even and − if it is odd. If n → ∞, and to the extent that  tanφ  >1, then r_{n} → rcot^{2}φ_{ϵ}. This expression reduces to r_{n} ≈ r(λ)(λ / πϵ)^{2} for low piston value. For example, if r = 10^{6}, λ = 10μm, and ϵ = 3nm, we get r_{n} ≈ 1. That means that a planet 10^{6} times fainter than its star is likely to be detected at 10 μm even with a piston of 3nm. If the piston is increased by a factor of 3, then the planettostarlight ratio diminishes by approximately a factor of 10. Figure 8 shows the planettostarlight ratio for a zero piston.
This discussion is, for the central pixel and a zero piston, in accordance with the previous analysis in 2.1.3. In particular, we have r / r_{n} =  ρ_{n}  ^{2}.
Fig. 8
Planettostarlight ratio spectrum of a planet 10^{6} times fainter than the star around which it revolves, seen through a uniaxial (Michelson) interferometer. When the value on the yaxis is greater than 0, it is seen more light from the planet than from the star, but at the same time the effective bandwidth of the planet is reduced. A value slightly above 0 seems to be an acceptable objective. 

Open with DEXTER 
The planettostarlight ratio bandwidth.
The condition  tanφ  >1 defines the planettostarlight ratio bandwidth or, for short, the contrast bandwidth. If ϵ = 0, it defines a first order where r_{n} goes to infinity like tan^{4n + 2}φ as n (the interferometer order) increases. The contrast bandwidth of the first order is π / 4 ≤ φ ≤ 3π / 4; which is, . For example, if λ_{0} = 9μm, one gets the bandwidth 6μm ≤ λ ≤ 18μm, a range consistent with the 6–17 μm one demanded by the darwin spatial project.
Higher orders do indeed exists. The kth order is given by , k = 1,2,...; elsewhere r_{n} goes to zero. If ϵ ≠ 0, the bandwidth is shrunk. One can show that the first order is now roughly given by φ_{1} ≤ φ ≤ π − φ_{1}, with . (This formula is valid for ϵ ≠ 0 and a sufficiently high n, say n > 5.).
3.1.2. Multiaxial case
The discussion is about the same as the previous one; however, there is no simple formula like Eq. (49), the more general Eq. (47) must be used instead. If one considers the image centre, one obtains From this one gets the planettostarlight ratio (57)Again, if the piston ϵ is zero, the planettostarlight ratio goes to infinity like tan^{4n}φ, as n, the interferometer order, increases. This defines the many spectral orders: , and the first one is the most extented.
If the piston is not zero, the planettostarlight ratio is approximately multiplied by a factor cot^{2}φ_{ϵ} ≈ (λ / πϵ)^{2} on approximately the same reduced bandwidth φ_{1} ≤ φ ≤ π − φ_{1} as in the coaxial case. Figure 9 shows this ratio for a piston of zero.
Fig. 9
Planettostarlight ratio spectrum. Same as Fig. 8 but for a multiaxial (Fizeau) setting. The integration has been performed by means of a single mode cylindrical fibre optics of radius λ_{0} / D, where D is separation of the two mirrors. The abrupt cutoff at approximately 16.5μm is the long wavelength cut of the fibre optics. 

Open with DEXTER 
3.2. Introduction of a random piston
Given the rather tight specifications on the value of the acceptable piston, the new concept seems more suited to a spatial project. However, with the advent of modern fringetracking techniques, it is possible to consider a groundbased instrument located on a site of excellent quality, such as in Antarctica. In that case, the rapid atmospheric fluctuations must be taken into account, and the residual differential piston ϵ between the P and Q pupils after correction by a fringe tracker, must be considered as a random variable. In this situation, the residual light function must be replaced by its expectation . To avoid repetition, we focus on the multiaxial case, because the coaxial one is very similar.
To compute , we write (60)To evaluate E { cos 2φ_{ϵ} } , we take advantage of the fact that, if X is a random variable, then E { cos X } is the real part of its characteristic function Z(ω) = E { e^{jωX} } evaluated at ω = 1. Furthermore, if X is a Gaussian random variable of mean m and standard deviation s, its characteristic function is . By setting X = 2πϵ / λ, we obtain m = 2πμ / λ and s = 2πσ / λ, where μ is the mean and σ the standard deviation of ϵ. It follows that (61)It is convenient to express this result in terms of the equivalent piston defined as . An important example is the unbiased case where μ = 0. If σ / λ is small, one obtains E { cos2φ_{ϵ} } = exp(−2π^{2}σ^{2} / λ^{2}) ≈ 1−2(πσ / λ)^{2} and since , then the equivalent piston is equal to σ.
The main result of this section is that, in the case of Gaussian random fluctuations of the piston around zero, the expected residual light function is equal to its deterministic value for the standard deviation of the piston; that is, (62)Figures 10 and 11 show the planettostarlight ratio (or its expectation) for a piston of 3 nm.
Fig. 10
Planettostarlight ratio spectrum. Same as Fig. 8, but suffering a piston of 3 nm or a random unbiaised Gaussian piston of σ = 3nm, while the device is tuned at λ_{0} = 9.05 μm. 

Open with DEXTER 
Fig. 11
Planettostarlight ratio spectrum. Same as Fig. 10 but for a multiaxial Fizeau setting. 

Open with DEXTER 
4. Conclusion
This paper presented a more complete analysis of a very efficient nulling interferometer, which appears as an iterative version of the well known Bracewell nulling interferometer. With the help of extended analytical computations of the amplitude expression, we show that the device acts as an optical differential operator on the 3D light distribution, i.e., spatially and chromatically.
We considered two kinds of systems: the (P_{n},Q_{n}) chessboards designed for the Michelson setting and the chessboards designed for the Fizeau one. The (P_{n},Q_{n}) cannot be used in the Fizeau case because their PSF are not axisymmetric. The , however, may be considered for a Michelson interferometer because their performances are just slightly inferior to their Fizeau equivalents and present the advantage of an axisymmetric PSF.
The net effect of the device discussed in this paper is to “push away” the stellar light from the centre of the image, leaving room for an “out of phase” object to appear in the area now left free. This takes place for a very broad range of wavelengths if one considers the planettostarlight ratio. A design of order n = 7 tuned at λ_{0} = 9.05μm can meet the darwin space project specifications, i.e., to cancel the starlight by a factor of 10^{5} over a wavelength range of 6–17 μm. Since the planet light is also diminished, the order n should not be pushed far beyond the point where the planettostarlight ratio is equal to one. For example, with n = 7, the starlight residuals are reduced below the planet light on most of the darwin bandwidth, while one still has more than 50% of the planet light over 20% of the bandwidth. Therefore, the full nominal darwin bandwidth can be covered for the planet by adjusting the device successively to five different wavelengths λ_{0}.
For an effective implementation of this device on the sky, a detailed study of the signaltonoise ratio should be made prior to decide to what order the interferometer should be constructed. This choice must result from a compromise between the depth of the nulling and the effective planetlight bandwidth.
Imperfections in design have been considered in the form of a deterministic or a random piston between the two pupils of the interferometer. A systematic piston or equivalently a standard deviation of 3 nm seems to lead to acceptable performances in the case of the darwin project but also for a groundbased interferometer.
More precisely, ρ_{0} performs a Möbius transformation of C. In that case, it is a reflection (inversion) relative to the circle of center 1 and radius , followed by a reflection about the imaginary axis (see e.g. Ratcliffe, J. G. 1994,Chap. 4). This transformation leaves invariant the two points j and −j.
Acknowledgments
We are very grateful to Raphaël Galicher for a careful reading of the manuscript and to the referee whose comments have greatly helped clarify and improve this paper. This work received the support of PHASE, the high angular resolution partnership between ONERA, Observatoire de Paris, CNRS, and University Denis Diderot Paris 7.
References
 Allouche, J.P., & MendèsFrance, M. 2008, Monatshefte für Mathematik, 155, 301 [CrossRef] [Google Scholar]
 Bracewell, R. N. 1978, Nature, 274, 780 [NASA ADS] [CrossRef] [Google Scholar]
 Buisset, C., Rejeaunier, X., Rabbia, Y., et al. 2006, in Advances in Stellar Interferometry, ed. J. D. Monnier, M. Schöller, & W. C. Danchi, SPIE Conf. Proc., 6268, 626819 [Google Scholar]
 Prouhet, E. 1851, Comptes Rendus des Séances de l’Académie des Sciences, 33, 225 [Google Scholar]
 Ratcliffe, J. G. 1994, Foundations of Hyperbolic Manifold, 1st edn. (New York: SpringerVerlag) [Google Scholar]
 Rouan, D., & Pelat, D. 2008, A&A, 484, 581 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Wallner, O., Perdigues Armengol, J. M., & Karlsson, A. L. 2004, in New Frontiers in Stellar Interferometry, ed. W. A. Traub. (Bellingham, WA: The International Society for Optical Engineering), SPIE Conf. Proc., 5491, 798 [Google Scholar]
Appendix A: Constraints on the number of cells
Here we try to find a lower limit to the number of cells S constituting the chessboards for a given even nulling order N. We recall that the numeric values contained in the P_{n} and Q_{n} chessboards are even and odd integers, respectively.
We find that our design is optimal for N = 2 and 4, but the question remains open for N ≥ 6.
Cancelling up to the second nulling order.
If n_{s} and m_{s} are the P and Q matrices elements, we must have (A.1)where Σ is the common sum. Since the n_{s} are even, Σ must be divisible by 4. Because m_{s} is odd, we have m_{s} = 2k_{s} + 1 and Therefore the number of cells in each chessboard must be a nonzero multiple of 4. We have shown by the construction of the P_{1} and Q_{1} chessboards that S = 4 was sufficent, and this is the minimum number of cells required to cancel the chromatic term up to second order.
The above result is a consequence of the property that (2k + 1)^{2} ≡ 1(mod4). In fact one can show that (2k + 1)^{2} ≡ 1(mod8) that, in turn, is a particular case with p = 1 of the more general result (A.5)which one can easily establish by induction.
Cancelling up to the fourth nulling order.
From (A.5) with p = 2, we obtain (2k + 1)^{4} ≡ 1(mod16). Then we deduce that S ≡ 0(mod16). Again, we show that the 16 cells in P_{2} and Q_{2} were sufficent to cancel the chromatic term up the fourth order, and it is the minimum.
Cancelling beyond the fourth nulling order.
One can use (A.5) to sets limits on the sixth order cancelled by P_{3} and Q_{3}, but they are too loose. (Unless one can prove that, if 2^{2n} is a divisor of then, for n > 1, it is also a divisor of ). All that can be said is that the number of cells must be a multiple of 16, while P_{3} and Q_{3} possesses 64. Up to now, as soon as the nulling order N ≥ 5, no tighter limits than those deduced from (A.5) are known to the authors regarding the number of cells necessary to cancel the chromatic term up to that order.
All Tables
Number of cells in the interferometer (P_{n},Q_{n}) according to the phase shift exp(jkπλ_{0} / λ).
All Figures
Fig. 1 A possible optical scheme of the nulling interferometer in coaxial configuration. 

Open with DEXTER  
In the text 
Fig. 2 Graphic determination of the relative nulling efficiency ρ_{0} of a Bracewell nulling interferometer through a stereographic projection of the Fresnel vector z = e^{πλ0 / λ} toward point 1 on the real axis. The relative nulling efficiency of the iterative Bracewell scheme (P_{n},Q_{n}) is . This quantity decreases to 0, as n increases where  ρ_{0}  <1. That is when ρ_{0}(z) is in the (−j,j) segment. Therefore, a planet dephased by π at λ_{0} relative to its star may be detected in many bandwidths whose first order is . 

Open with DEXTER  
In the text 
Fig. 3 A possible optical scheme of the nulling interferometer in multiaxial configuration. 

Open with DEXTER  
In the text 
Fig. 4
The chessboards. The P_{2}, Q_{2} chessboards from which they were built are in the top right corners. The 2kπ or (2k + 1)π phase shifts at λ_{0} are indicated in the cells’ centres by their 2k (blue) or 2k + 1 (red) values. The value zero is not shown. 

Open with DEXTER  
In the text 
Fig. 5
Image of a star system seen through a Michelson interferometer d / D = 1 equipped with the multiaxial design of order n = 7. The system consists of a star and a 10^{6} fainter planet. The light from the star is shown in yellow and the one from the planet in blue. The image intensity I has been enhanced by a I^{0.25} filter to see the very low levels (here: the star). The effect of the device is to push away the light from the star, while that of the planet fits in the space left empty. To illustrate the achromatic effect, the observation is simulated for a wavelength detuned by 10% compared to the central wavelength λ_{0}. 

Open with DEXTER  
In the text 
Fig. 6
Extinction spectrum of a star by iterative Bracewell nulling uniaxial (Michelson) interferometers of order n = 0,1,...,12. The original Bracewell n = 0 is in black, order 1 in yellow, 2 in red, etc. The xaxis is the wavelength, either Δλ / λ_{0} (bottom) or λ in μm for λ_{0} = 9.05 μm (top). The integration has been performed over an Airy square of size 2λ_{0} / d where d is the size of the mirrors. The maximum bandwidth is indicated between the two dashed vertical lines, and it meets the darwin project nominal bandwidth indicated by the two broken vertical lines. The yaxis is the log of the integrated extinction factor, 0 means no extinction, and the horizontal line at −5 sets the nominal darwin extinction factor of 10^{5}. 

Open with DEXTER  
In the text 
Fig. 7
Attenuation spectrum of a planet following the same setting, except for the linear yaxis, as in Fig. 6. Near the wavelength λ_{0}, where the device is tuned, the planet only suffers very small attenuation. The full width at half maximum (FWHM) is given, with a very good approximation, by , where n is the interferometer order. 

Open with DEXTER  
In the text 
Fig. 8
Planettostarlight ratio spectrum of a planet 10^{6} times fainter than the star around which it revolves, seen through a uniaxial (Michelson) interferometer. When the value on the yaxis is greater than 0, it is seen more light from the planet than from the star, but at the same time the effective bandwidth of the planet is reduced. A value slightly above 0 seems to be an acceptable objective. 

Open with DEXTER  
In the text 
Fig. 9
Planettostarlight ratio spectrum. Same as Fig. 8 but for a multiaxial (Fizeau) setting. The integration has been performed by means of a single mode cylindrical fibre optics of radius λ_{0} / D, where D is separation of the two mirrors. The abrupt cutoff at approximately 16.5μm is the long wavelength cut of the fibre optics. 

Open with DEXTER  
In the text 
Fig. 10
Planettostarlight ratio spectrum. Same as Fig. 8, but suffering a piston of 3 nm or a random unbiaised Gaussian piston of σ = 3nm, while the device is tuned at λ_{0} = 9.05 μm. 

Open with DEXTER  
In the text 
Fig. 11
Planettostarlight ratio spectrum. Same as Fig. 10 but for a multiaxial Fizeau setting. 

Open with DEXTER  
In the text 
Current usage metrics show cumulative count of Article Views (fulltext article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 4896 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.