Issue |
A&A
Volume 524, December 2010
|
|
---|---|---|
Article Number | A80 | |
Number of page(s) | 10 | |
Section | Astronomical instrumentation | |
DOI | https://doi.org/10.1051/0004-6361/201014934 | |
Published online | 25 November 2010 |
The achromatic chessboard, a new concept of a phase shifter for nulling interferometry
II. Theoretical performance assessment
1
LUTH, Observatoire de Paris, CNRS, Université Paris Diderot,
92190
Meudon,
France
e-mail: didier.pelat@obspm.fr
2
LESIA, Observatoire de Paris, CNRS, UPMC, Université Paris
Diderot, 5 place Jules
Janssen, 92190
Meudon,
France
e-mail: daniel.rouan@obspm.fr; damien.pickel@obspm.fr
Received:
4
May
2010
Accepted:
11
August
2010
Context. Nulling interferometry in the mid-IR using two telescopes (commonly referred to a Bracewell interferometer) is one possible way of directly detecting exoplanets in the habitable zone and their characterisation in terms of possible life signatures. A large wavelength domain is needed to simultaneously detect the infrared spectral features of a set of a bio-tracers. An achromatic phase shift of π is then required, and we previously presented a new concept for such a function that allows a simple design with only one device per beam. It is based on two cellular mirrors, called the chessboards, where each cell has a thickness that introduces, for any given central wavelength, a phase shift of (2k + 1)π or of 2kπ on the fraction of the wave it reflects.
Aims. We explore a more rigorous way to establish the optimum cell pattern design to attain the best theoretical performances for planet detection over a broad wavelength range. Two possible types of interferometres are now considered: on-axis and multi-axis.
Methods. We derived a rather simple iterative scheme for both designs, determining the thickness and XY position of the cells. The method confers to the chessboards a high degree of internal symmetry. Each design can be described as an iterative Bracewell interferometer characterised by an integer order. We demonstrate that their efficiencies increases with the power of that order.
Results. The device acts both spatially and versus wavelengths as an optical differential operator on the 3D light distribution. Its power is best understood in the on-axis case since its effect is topush awaythe stellar light from the centre over a very broad range of wavelengths, leaving space for anout of phase object to appear in the cleaned central region. We explore the theoretical performances for on-axis and multi-axis designs in the parameter space, and we especially compute the rejection factor for starlight and the attenuation factor for planet light and introduce the relative nulling efficiency metric. We show that, even with some realistic piston error added, the performances could meet the Darwin space project specifications for both designs, i.e., cancellation of the starlight by a factor of 105 over a wavelength range of 6–17 μm.
Key words: instrumentation: interferometers / techniques: high angular resolution / techniques: interferometric / space vehicles: instruments / planetary systems
© ESO, 2010
1. Introduction
A new concept of a quasi-achromatic nulling interferometer has been proposed in a preceding publication (Rouan & Pelat 2008,hereafter Paper I). The whole idea is based on cancelling the polynomial part, according to both the wavelength and spatial coordinates, of the starlight distribution at the centre of the focal plane of the interferometer. This objective was attained thanks to a particular setting of phase shifters on the interferometer pupil planes. The advantage of the proposed design depends on a rather simple arrangement, where the key device is a simple square grid of phase shifters (the so-called “chessboard”) on the pupil plane of the telescopes.
In Paper I, several possible chessboards were proposed and a co-axial (Michelson) setting of the telescopes favoured. Here we concentrate more deeply on two high-performance configurations, one designed for the co-axial case and the other for the multi-axial (Fizeau) case. In Sect. 2, we outline the principles that led to the phase distribution and spatial repartition of the chessboards’ cells. A first approximation of the system performances is also given, with particular emphasis on the principles that govern its effectiveness. In Sect. 3, detailed analytical calculations are carried out to determine the chessboards’ nulling performances with more precision and to explore their robustness to a systematic or random piston on one arm of the interferometer.
2. The chessboard design
We recall that our device consists of two square grids of phase shifters each formed of
adjacent cells. Finding a
high-performance chessboard setting implies closely inspecting the complex amplitude of the
electrical field of the interferometer output equipped with this device. If
A stands for this amplitude, the point spread function is given by
PSF = | A | 2. According to Fourier optics one gets
(1)In this expression,
u and v are the Cartesian angular coordinates of a light
ray diffracted by the pupil plane, d is the size of the pupil,
n0 the number of cells on one side of the chessboards, and by
definition
sinc(x) = sin(x) / x.
The symbolic expressions expUV and expPQ stand for two square matrices both with the
“chessboard” format: n0 × n0. The
“dot” operation · between the two matrices is the element-to-element product not
the matrices product. The term Sum( ) realises the sum of all elements of the
matrix under the scope of that operator.
The matrix expUV takes the positioning of the cells into account, namely
the place where the phase shifters are put on the grid of the chessboards. It is given by
(2)In this expression,
δ is the displacement vector. It is a set of integers varying from
−(n0−1) to +(n0−1) by step
of 2, and the symbol ⊗ is the tensor product between the two vectors. For example, if
n0 = 4, we have
δ = (−3, −1,1,3)
and we get
The
second matrix expPQ, takes the values of the phase shifters into account.
Adopting the usual convention that the exponential of a matrix is the matrix of the
exponentials, we have
(5)The matrices
P and Q are the chessboard matrices, of size
n0 × n0. All their elements are
integers that give, in units of λ0 / 2, the
optical path difference (OPD) induced by the corresponding cell of the chessboard. For
example, if P = [ 0 ] and Q = [ 1 ] , there is no OPD
on one telescope but one of λ0 / 2 on the other
one, inducing a phase shift of π at λ0. This
particular setting corresponds to a Bracewell nulling interferomer (Bracewell 1978). Finally, D is the
distance between the two pupils’ centres and Λ is a chromatic variable
(6)(The Λ variable was
named Δ in Paper I.)
To obtain the best achromatism, we propose a progressive approach below where we first look for the best phase-shift distribution and second for their spatial distribution. We start our investigation with the mathematically simpler co-axial design.
2.1. The co-axial chessboards
In the co-axial configuration, the two pupils are superimposed thanks to some optical arrangement. A practical example is given in Fig. 1, where it includes an imaging lens in the last stage since any efficient detection requires concentrating the light beam on a small surface.
![]() |
Fig. 1 A possible optical scheme of the nulling interferometer in co-axial configuration. |
2.1.1. The multigrad solution
The idea, explained in Paper I, is to cancel the chromatic term up to the
Nth order of its Taylor series. For this purpose, we need the Taylor
series expansion of the elements of ejπPΛ and
ejπQΛ matrices. For example, if
ns stands for the numerical values
contained in P, the Taylor expansion, relative to
ε = Δλ / λ0,
is given by (7)where
pk stands for a polynomial of degree
k. (Note that pk is not
a monomial because of the chain rule needed to compute the derivatives.) Of course, a
similar formula exists for
ejπmsΛ
involving the values ms contained in
Q. Cancelling the chromatic term up to order N
implies solving the equation
This
polynomial is zero for any ε if, and only if, all its coefficients are
zero. Therefore we have to solve the system
(10)for
k = 0,...,N. If
cℓk stands for the coefficients of the
polynomials pk, this reduces to
(11)This system can be
solved by induction, because if a solution is found up to order k, the
solution up to order k + 1 is found if
(12)Therefore, the
chromatic term is nullified up to order N if one solves the following
system of equations
(13)for the unknown
integers ns and
ms and the parameter
n0.
As in a Bracewell interferometer, we impose the constraint that
ejπns = +1
and
ejπms = −1,
therefore the ns and
ms are respectively even and odd
integers. As introduced in Paper I, Appendix C, one ends up with a special set of
diophantine equation called a multigrad system, namely
(14)If two sets of
integers, say { ns } and
{ ms } , solve a multigrad system up to
order N, they are called multigrad sets. This property is shortly
written as
. Hereafter, the two sets are supposed to be of
equal size, for short
(adopting the convention that
00 = 1).
The key theorem is that if one has , then it follows that
. This is true for any
c, but is non-trivial only if c ≠ 0 and
{ ms } ≠ { ns } .
In this way, two multigrad sets up to order N generate two new
multigrad sets up to order N + 1. During this operation, the size of
the sets is multiplied by two. But in order that the number of cells on the chessboards
remains a square, the size of the sets must be multiplied by four. This is done by
applying two times the key theorem. Moreover, if the sets
{ ns } and
{ ms } possess the desired parity, we
keep it if we choose an odd constant c, say c = 1. We
get the final result:
if , then
.
This remarkable property is at the heart of the scheme proposed below.
2.1.2. The iterative Bracewell scheme
This scheme runs as follows. One starts with a Bracewell nulling interferometer
and constructs the
next one by two applications of the key theorem. In this way, one first gets
. Indeed, we have
(always with 00 = 1)
(15)As
expected, this new interferometer cancels the chromatic term up to order
N = 2. To construct the next one, we add c = −1
instead of c = +1 in order to gain symmetry and in an effort to keep
the z-heights of the cells as low as possible. Finally, to generate the
successive phase chessboards, we propose the following iterative scheme
(16)where
± means (−1)n (i.e. +1 if n is even
and −1 if it is odd). With this process, the chessboards
Pn and
Qn retain the shape of two squares
possessing n0 = 2n cells on each
of their sides. Their combined effect is to cancel the chromatic term up to order
N = 2n. The higher n, the more
efficent the corresponding nulling interferometer.
Number of cells in the interferometer (Pn,Qn) according to the phase shift exp(jkπλ0 / λ).
The iterative scheme presented above leads to constructing the interferometer
(Pn,Qn)
possessing, for example, the phase-shift distribution shown on Table 1. We observe that the progression of the number of
cells that produce the same phase shift follows that of the binomial coefficients in the
polynomial expansion of (1 + x)2n + 1. This
is easily proved by two applications of the recursive formula ,
which leads to
,
which is our iterative scheme. (In these formulæ we adopt the convention that
if
q ≤ 0.) Table 2 gives the
phase-shift, together with the corresponding number of cells (the formulæ are valid for
two consecutives orders).
Number of cells in (Pn,Qn) producing a given phase shift.
2.1.3. First estimation of the iterative Bracewell scheme performance
In this section we study the nulling efficiency of an interferometer built on this
principle. To compare the interferometers
(Pn,Qn)
in the same conditions, differing only by their order n, the light
collecting surface of their mirrors must be kept constant by reducing the size of the
cells to a square of surface
d2 / 22n.
Because at this point we have not distributed the cells on the pupil plane, we only
consider An, the sum of the amplitudes.
This leads to the formula where
we have set
z = ejπλ0 / λ
and
. The common phase factor
e−jKπλ0 / λ
sets the OPD reference, and the function ⌊ x ⌋ is the “floor”
function (i.e. it returns the greatest integer less than or equal to
x.)
If λ = λ0, then z = −1 and An = 0, therefore the nulling is perfect for any n. The device is strictly equivalent to a Bracewell nulling interferometer. Now, if λ ≠ λ0, since the module of the complex (1 + z) / 2 is less than one, An → 0 as n, the order of the interferometers, increases. This is true for all λ except for the values such that z = 1, i.e. λ = ∞,λ0 / 2,λ0 / 4,...,λ0 / 2k,... Aside from the actual distribution of cells on the pupils, we have obtained a quasi-achromatic nulling interferometer.
However, one must judge the perfomances of the device relative to some benchmark. We
have demonstrated above that, for an on-axis object, our device has powerful nulling
efficiency on a given set of wavelengths. This is of course different for any off-axis
object such as a planet which is best detected when its position induces an
OPD = Dθplanet = λ0 / 2.
In that situation, the recombination, at λ0, is fully
constructive for the planet. Therefore, the interferometer with a phase shift of
π at λ0 added to one of its pupils will
serve as a reference. In so doing, we transform the nulling interferometer into a
non-destructive one for the wavelengths set that was just mentioned above. The phase
addition on all the, say, odd phase shifters changes the sign of the odd,
q = 2p + 1, binomial coefficients in the expression
of An. If stands for the function
obtained in this way, one gets
(19)The only change between
the expressions of An and
is the sign of
z. If
λ = λ0 / 2, then
z = 1 and
, we also have a
nulling interferometer. Again, if
λ ≠ λ0 / 2, since the
modulus of the complex (1 − z) / 2 is also less than
one,
as
n increases. This is true for all λ except for the
values that make z = −1; that is,
λ = λ0,λ0 / 3,λ0 / 5,...,λ0 / (2k + 1),...
The result is an associated “out of phase” nulling interferometer but, fortunately, less
efficient than the original one. The off-axis planet will indeed suffer some attenuation
but much less than the on-axis star, as the ratio of
An to
demonstrates.
We define ρn as , and name it the
relative nulling efficiency. One gets
(20)If
| ρ0 | is less than one and n large
enough, the on-axis nulling can be made more efficient than the off-axis one by any
factor. Note that ρ0(z) is the relative
nulling efficiency of the original Bracewell interferometer.
A deeper understanding of how the iterative Bracewell interferometers works is obtained
through careful analysis of the transformation induced by ρ0
on the complex plane C of the Fresnel representation. In fact, under
ρ0, the unit circle is transformed into the imaginary
axis1. One gets
(21)This means that
ρ0(z), and the Bracewell relative nulling
efficency that follows is obtained by a stereographic projection of z
toward the real 1 (see Fig. 2). The image of the
half-circles
is the segment
(−j,j) shown in green in the figure. All points in this segment
have a modulus less than one, therefore
| ρn(z) | can be made
as small as desired, provided one chooses an interferometer of sufficiently
high-order n. This defines many wavelengths orders, the first one
being:
. Within these bandwiths, an “in
phase” object, say a star, is strongly attenuated while an “out of phase” one, say a
planet, is much less affected. This property opens interesting perspectives for
broadband spectroscopy, or detection, of exoplanets, all the more so for intensity,
since the nulling behaves as
| ρ0 | 4n + 2.
From the dispersion of the optical paths difference ΔOPD produced by the chessboards,
it is possible to obtain a very good approximation of the full width at half maximum
(FWHM) of the planet attenuation spectrum within the full bandwidths,
e.g. . Because the OPDs distribution is
binomial, one gets
or approximately
. Now,
ΔOPD = cΔt and
. By applying the Heisenberg
uncertainty principle, we have
ΔtΔE ≥ h / 4π,
from which we get
(22)The constant of
proportionality should not be very different from one, see Fig. 7 for an example where it is approximately equal to
.
While the planet-to-star ratio greatly increases with the order n of
the chessboards, the effective bandwidth reduction in
implies longer exposure times to get the full range of wavelengths.
![]() |
Fig. 2 Graphic determination of the relative nulling
efficiency ρ0 of a Bracewell nulling interferometer
through a stereographic projection of the Fresnel vector
z = eπλ0 / λ
toward point 1 on the real axis. The relative nulling efficiency of the
iterative Bracewell scheme
(Pn,Qn)
is |
Finally, the proposed scheme generates, from one order to the next, a fourfold increase in the number of cells. In Appendix A, we investigate to what extent so many cells are necessary.
2.1.4. Placing the cells on the chessboards
To actually build the interferometer, it is necessary to distribute the chessboards cells on the pupil plane, an operation that leads to a specific PSF on the image plane. The previous performance analysis is valid for a perfectly parallel light beam, therefore, for any distribution of cells, it is only relevant for the central pixel of the image. For the other pixels, it is necessary to ensure that the cells distribution generates a PSF that does not ruin the mighty nulling power of the chessboards as derived above. This can be done by extending the same idea to the spatial dimension of the amplitude A, as given by Eq. (1).
As mentioned in Paper I, we gain insight into this problem by performing a Taylor
expansion of the expUV matrix to get (23)In this
expression, the symbol [UV] stands for a positioning matrix of size
n0 × n0. For example, in the
case n0 = 4, we have
(24)By convention the
elements of [ UV ] k are the elements of [UV] raised to the
power k. Following the above example, we get
(25)Again, the key
observation is to recognise that the elements of [UV]k are
the sampled values of the 2-D polynomial
(x + y)k. Therefore,
if we were able to arrange the cells in such a way that the expPQ matrix would turn out
to be the coefficients of a finite difference 2-D differential operator
of order k + 1, the expression
expPQ· [ UV ] k would be zero. Indeed all terms of
degree r less than or equal to k in (23) would also be cancelled, since
expPQ· [ UV ] r would be the finite difference
derivative of order k + 1 of a 2-D polynomials of degree less than or
equal to k.
It seems, however, impossible to attain this objective exactly, but it is possible to
approach it fairly closely. The first-order approximation, according to
ε = Δλ / λ0,
of expPQ is (26)In the case where
P is even and Q odd, this formula reduces to
expPQ ≈ −εjπ(P − Q). It is not
too difficult to find an arrangement of cells on the pupil plane that makes
P − Q equal to a matrix whose elements are those of
a finite difference
differential
operator
(fiddop). We found that it is achievable by the following iterative
scheme
(27)In
this expression ± again means (−1)n. (This scheme is the
same as the iterative Bracewell one discussed above. Also
,
, where e.g.
stands for the
transposition of Pn.) It follows, by
induction, that
Pn − Qn
is, as desired, a fiddop of order n. To start the
induction, we have
(28)It is indeed a gradient
operator that, as such, will cancel the first-order (constant) term in the Taylor
expansion of expUV. More generally we have
Therefore,
Pn + 1 − Qn + 1
is the gradient of
Pn − Qn
and, since the later is a fiddop of order n, then
Pn + 1 − Qn + 1
is a fiddop of order n + 1. It is remarkable that all
lines and columns of
Pn − Qn
are a Prouhet-Thue-Morse sequence of −1 and +1 of length
2n, as is clear from the way the
Pn − Qn
were constructed.
We recall that the Prouhet-Thue-Morse sequence Pn is a string of length 2n whose kth element is given by Pn(k) = (−1)s(k) for 0 ≤ k < 2n, where s(k) is the sum of the binary digits of k; e.g. P3 = +1, −1, −1, +1, −1, +1, +1, −1. This sequence is also constructed iteratively by P0 = +1 and Pn = Pn−1 | | −Pn−1, where | | is the concatenation operator (Prouhet 1851; Allouche & Mendès-France 2008).
2.2. The multi-axial chessboards
For this design, we set D ≠ 0 in (5) (see Fig. 3 for a practical realisation).
![]() |
Fig. 3 A possible optical scheme of the nulling interferometer in multi-axial configuration. |


To obtain A(−u,v) = A(u,v), and because the sin(πuD) function is odd, it is sufficient to make sure that Sum((ejπPΛ − ejπQΛ)·expUV(u,v)) is even. It is clear from Eq. (3) that expUV(−u,v) is equal to expUV(u,v) after a symmetry around the pupil’s y-axis. Therefore, if P and Q were left invariant following the same symmetry, our goal would be attained. Moreover, since the Michelson chromatic term must also be made as low as possible, one must use the co-axial chessboards, one way or the other. One solution is to put the co-axial chessboards, shrunken by a factor of two in the upper right corner of the multi-axial chessboards, for example, and to fill the empty cells by means of successive x and y symmetries. This setting already gives good results, but one can gain more nulling power if one requires that A(−u, −v) = A(u,v).
![]() |
Fig. 4 The |
To achieve this objective, one starts with the co-axial chessboard shrunk by a factor of
two, and one again put it in the upper right-hand corner of the multi-axial chessboard
under construction. We fill the lower right-hand corner by a x-axis
symmetry with a change of sign and finally we fill the remaining
left-hand side half plane by a y-axis symmetry, again with a change in
sign. We symbolically note ‘unfold’ this procedure. If
Pn−1 and
Qn−1 stands for two uni-axial
chessboards, we construct the corresponding multi-axial chessboards
and
by way of the operations
(33)(
and
have the same number of cells
as Pn and
Qn). We have, for example,
(34)
(35)The “unfold” procedure
offers many advantages.
-
1.
Since the combination (Pn−1,Qn−1) cancels the chromatic term up to order N = 2(n − 1), the
is expected to cancel it, at least up to the same order N = 2n − 2. In fact, the change in sign during the unfolding process implements the following property: if
, then
. Therefore the pair
cancels the chromatic term up to order N = 2n − 1.
- 2.
We expect
to be, as Pn−1 − Qn−1, a fiddop of, at least, order n − 1. This is true if n is even, but thanks to the symmetries of the Prouhet-Thue-Morse sequence, this order is increased to n if n is odd.
- 3.
Finally, the symmetry of the multi-axial chessboards is such that it cancels the [ UV ] k matrices when k is odd (see Eq. (25)).
Because of these supplementary properties, the chessboards designed for the multi-axial case are expected
to be just slightly less successful than their co-axial
(Pn,Qn)
counterparts.
The nulling power of the chessboards considered in this paper.
Table 3 is a summary of the nulling properties of the chessboards considered in this paper.
3. Performance of the design
A first evaluation of the performances of the chessboards have been established in the preceding section; however, it is possible to obtain far more detailed information by working out the analytical expression of the electrical field amplitude. This is made possible because relatively simple operations allow passing, almost mechanically, from one chessboard of order n to the next one. These are, in the following order,
-
start with a Bracewell setting;
-
shrink the chessboards by a factor of two;
-
shift it to its proper place;
-
add a phase; and
-
perform symmetries with a change in sign.
All these operations, when performed on the pupil plane, have simple Fourier equivalents on
the image plane. Therefore, a computation of the amplitude function can be considered. A
straightforward, though lengthy, analytical work leads to the formulæ(36)
(37)where
Pn and
Qn are, in this context, two functions on
the image plane. They are not the chessboard matrices. The expressions for
Sn and
Dn differ according to the interferometer
setting. In the co-axial case, we set D = 0 and obtain
(38)while in the
multi-axial one, D ≠ 0 and we find
(39)
(40)
The
and
functions reflect the self
similarity of the iterative Bracewell scheme and may be called the “nulling functions”:
For
a given λ, the
θu,θv
are the spatial variables and φ ∗ is the
chromatic variable (because it depends on
λ0). We then obtain
(45)
(46)One recalls that:
D is the separation between the two telescopes, d the
mirror size, λ the wavelength considered, λ0
the wavelength for which the chessboards are tuned and u,v are the
Cartesian angular coordinates of the light diffracted by the pupils. Figure 5 shows the theoretical image of a planetary system similar
to ours seen with the
device. The image was calculated using these
formulæ.
![]() |
Fig. 5 Image of a star system seen through a Michelson interferometer
d / D = 1 equipped with the
multi-axial |
To account for a possible path difference between the two arms of the interferometer, we
introduce a systematic piston on one arm. If ϵ is the piston value, instead
of (36), we have (47)
(48)
3.1. Extinction of the star, attenuation of the planet, and planet-to-starlight ratio
Let us define as the residual stellar
light intensity. We have
. We consider this function
successively for the co-axial and multi-axial cases below.
3.1.1. Co-axial case
Residual light from the star.
One finds, in the co-axial case (49)The
starlight extinction spectrum shown in Fig. 6 is
defined as the integration
. Of course, as expected, the nulling
effect may reach considerable values over a large bandwidth. For example, an
extinction factor of 105 can be easily achieved over the bandwidth
6.2–17.5 μm by an interferometer of order n = 7
(orange curve) tuned at λ0 = 9.05μm.
![]() |
Fig. 6 Extinction spectrum of a star by iterative Bracewell nulling
uni-axial (Michelson) interferometers of order
n = 0,1,...,12.
The original Bracewell n = 0 is in black, order 1 in yellow, 2
in red, etc. The x-axis is the wavelength, either
Δλ / λ0
(bottom) or λ in μm for
λ0 = 9.05 μm
(top). The |
![]() |
Fig. 7 Attenuation spectrum of a planet following the same setting, except for the
linear y-axis, as in Fig. 6. Near the wavelength λ0, where the
device is tuned, the planet only suffers very small attenuation. The full width
at half maximum (FWHM) is given, with a very good
approximation, by |
To understand more deeply how the nulling works, one considers
at the centre of the
image plane (i.e. θu = 0 and
θv = 0), leading to
(50)Finally, we
suppose a perfect design: ϵ = 0. One ends up with the chessboard’s
extinction capability characterised by
(51)As n
goes to infinity, the starlight residual goes to zero at all wavelengths, except those
where the cosine is equal to ± 1; that is,
λ = ∞,λ0 / 2,λ0 / 4,...,λ0 / 2k,...
Light from the planet.
Since it is off-axis, the planet wavefront is tilted relative to the pupil plane. As
in Paper I, we assume that the planet was observed when this tilt induced a phase
shift of π at λ0 between the two arms of
the interferometer. If, on the pupil plane, we neglect the effect of the tilt, the
residual planet intensity is given by , where
.
In this expression r(λ) stands for the planet/star
intensity ratio at λ, typically r ≈ 10-6
in the thermal infrared. At the image centre, we have
(52)and, in the
absence of piston,
(53)As n
goes to infinity, the planet light also goes to zero at all wavelengths, except those
where the sine is equal to ± 1, that is:
λ = λ0,λ0 / 3,...,λ0 / (2k + 1),...
Figure 7 shows the attenuation spectrum of the
planet.
Planet-to-starlight ratio.
To successfully detect and to do the spectroscopy of an exoplanet, one must reduce
all sources of noise. For the lower levels n of our device, the
starlight residuals play a predominant role over other sources of noise (such as the
exozodiacal light for example). One must therefore consider the ratio
, which gives the planet
residual intensity relatives to the star residual intensity. This quantity may be
called the planet-to-starlight ratio and denoted
rn. Typically, if
rn ≥ 1, the source of noise from the
star ceases to be dominant and the planet is likely to be detected. We obtain
(54)where again
± means + if n is even and − if it is odd. If
n → ∞, and to the extent that
| tanφ | >1, then
rn → rcot2φϵ.
This expression reduces to
rn ≈ r(λ)(λ / πϵ)2
for low piston value. For example, if r = 10-6,
λ = 10μm, and ϵ = 3nm, we get
rn ≈ 1. That means that a planet
106 times fainter than its star is likely to be detected at
10 μm even with a piston of 3nm. If the piston is increased by a
factor of 3, then the planet-to-starlight ratio diminishes by approximately a factor
of 10. Figure 8 shows the planet-to-starlight
ratio for a zero piston.
This discussion is, for the central pixel and a zero piston, in accordance with the previous analysis in 2.1.3. In particular, we have r / rn = | ρn | 2.
![]() |
Fig. 8 Planet-to-starlight ratio spectrum of a planet 106 times fainter than the star around which it revolves, seen through a uni-axial (Michelson) interferometer. When the value on the y-axis is greater than 0, it is seen more light from the planet than from the star, but at the same time the effective bandwidth of the planet is reduced. A value slightly above 0 seems to be an acceptable objective. |
The planet-to-starlight ratio bandwidth.
The condition | tanφ | >1 defines the
planet-to-starlight ratio bandwidth or, for short, the contrast bandwidth. If
ϵ = 0, it defines a first order where
rn goes to infinity like
tan4n + 2φ as n (the
interferometer order) increases. The contrast bandwidth of the first order is
π / 4 ≤ φ ≤ 3π / 4;
which is, . For example, if
λ0 = 9μm, one gets the bandwidth
6μm ≤ λ ≤ 18μm, a range
consistent with the 6–17 μm one demanded by the darwin
spatial project.
Higher orders do indeed exists. The kth order is given by
,
k = 1,2,...;
elsewhere rn goes to zero. If
ϵ ≠ 0, the bandwidth is shrunk. One can show that the first order
is now roughly given by
φ1 ≤ φ ≤ π − φ1,
with
. (This formula is valid for
ϵ ≠ 0 and a sufficiently high n, say
n > 5.).
3.1.2. Multi-axial case
The discussion is about the same as the previous one; however, there is no simple
formula like Eq. (49), the more general
Eq. (47) must be used instead. If one
considers the image centre, one obtains From
this one gets the planet-to-starlight ratio
(57)Again, if the piston
ϵ is zero, the planet-to-starlight ratio goes to infinity like
tan4nφ, as n, the
interferometer order, increases. This defines the many spectral orders:
, and the first one
is the most extented.
If the piston is not zero, the planet-to-starlight ratio is approximately multiplied by a factor cot2φϵ ≈ (λ / πϵ)2 on approximately the same reduced bandwidth φ1 ≤ φ ≤ π − φ1 as in the co-axial case. Figure 9 shows this ratio for a piston of zero.
![]() |
Fig. 9 Planet-to-starlight ratio spectrum. Same as Fig. 8 but for a multi-axial (Fizeau) setting. The integration has been performed by means of a single mode cylindrical fibre optics of radius λ0 / D, where D is separation of the two mirrors. The abrupt cut-off at approximately 16.5μm is the long wavelength cut of the fibre optics. |
3.2. Introduction of a random piston
Given the rather tight specifications on the value of the acceptable piston, the new
concept seems more suited to a spatial project. However, with the advent of modern
fringe-tracking techniques, it is possible to consider a ground-based instrument located
on a site of excellent quality, such as in Antarctica. In that case, the rapid atmospheric
fluctuations must be taken into account, and the residual differential piston
ϵ between the P and Q pupils after
correction by a fringe tracker, must be considered as a random variable. In this
situation, the residual light function must be
replaced by its expectation
. To avoid repetition, we focus on the
multi-axial case, because the co-axial one is very similar.
To compute , we write
(60)To
evaluate E { cos 2φϵ } , we take advantage
of the fact that, if X is a random variable, then
E { cos X } is the real part of its characteristic function
Z(ω) = E { ejωX }
evaluated at ω = 1. Furthermore, if X is a
Gaussian random variable of mean m and standard
deviation s, its characteristic function is
. By
setting X = 2πϵ / λ,
we obtain m = 2πμ / λ
and s = 2πσ / λ, where
μ is the mean and σ the standard deviation of
ϵ. It follows that
(61)It is convenient to
express this result in terms of the equivalent piston
defined as
. An important example is
the unbiased case where μ = 0. If
σ / λ is small, one obtains
E { cos2φϵ } = exp(−2π2σ2 / λ2) ≈ 1−2(πσ / λ)2
and since
, then the
equivalent piston is equal to σ.
The main result of this section is that, in the case of Gaussian random fluctuations of
the piston around zero, the expected residual light function is equal to its deterministic
value for the standard deviation of the piston; that is,
(62)Figures 10 and 11 show the
planet-to-starlight ratio (or its expectation) for a piston of 3 nm.
![]() |
Fig. 10 Planet-to-starlight ratio spectrum. Same as Fig. 8, but suffering a piston of 3 nm or a random unbiaised Gaussian piston of σ = 3nm, while the device is tuned at λ0 = 9.05 μm. |
4. Conclusion
This paper presented a more complete analysis of a very efficient nulling interferometer, which appears as an iterative version of the well known Bracewell nulling interferometer. With the help of extended analytical computations of the amplitude expression, we show that the device acts as an optical differential operator on the 3D light distribution, i.e., spatially and chromatically.
We considered two kinds of systems: the
(Pn,Qn)
chessboards designed for the Michelson setting and the chessboards designed for the Fizeau one. The
(Pn,Qn)
cannot be used in the Fizeau case because their PSF are not axisymmetric. The
, however, may be considered for a Michelson
interferometer because their performances are just slightly inferior to their Fizeau
equivalents and present the advantage of an axisymmetric PSF.
The net effect of the device discussed in this paper is to “push away” the stellar light from the centre of the image, leaving room for an “out of phase” object to appear in the area now left free. This takes place for a very broad range of wavelengths if one considers the planet-to-starlight ratio. A design of order n = 7 tuned at λ0 = 9.05μm can meet the darwin space project specifications, i.e., to cancel the starlight by a factor of 105 over a wavelength range of 6–17 μm. Since the planet light is also diminished, the order n should not be pushed far beyond the point where the planet-to-starlight ratio is equal to one. For example, with n = 7, the starlight residuals are reduced below the planet light on most of the darwin bandwidth, while one still has more than 50% of the planet light over 20% of the bandwidth. Therefore, the full nominal darwin bandwidth can be covered for the planet by adjusting the device successively to five different wavelengths λ0.
For an effective implementation of this device on the sky, a detailed study of the signal-to-noise ratio should be made prior to decide to what order the interferometer should be constructed. This choice must result from a compromise between the depth of the nulling and the effective planet-light bandwidth.
Imperfections in design have been considered in the form of a deterministic or a random piston between the two pupils of the interferometer. A systematic piston or equivalently a standard deviation of 3 nm seems to lead to acceptable performances in the case of the darwin project but also for a ground-based interferometer.
More precisely, ρ0 performs a Möbius transformation of C. In
that case, it is a reflection (inversion) relative to the circle of center 1 and radius
,
followed by a reflection about the imaginary axis (see e.g. Ratcliffe, J. G. 1994,Chap. 4). This transformation leaves invariant the two
points j and −j.
Acknowledgments
We are very grateful to Raphaël Galicher for a careful reading of the manuscript and to the referee whose comments have greatly helped clarify and improve this paper. This work received the support of PHASE, the high angular resolution partnership between ONERA, Observatoire de Paris, CNRS, and University Denis Diderot Paris 7.
References
- Allouche, J.-P., & Mendès-France, M. 2008, Monatshefte für Mathematik, 155, 301 [CrossRef] [Google Scholar]
- Bracewell, R. N. 1978, Nature, 274, 780 [NASA ADS] [CrossRef] [Google Scholar]
- Buisset, C., Rejeaunier, X., Rabbia, Y., et al. 2006, in Advances in Stellar Interferometry, ed. J. D. Monnier, M. Schöller, & W. C. Danchi, SPIE Conf. Proc., 6268, 626819 [Google Scholar]
- Prouhet, E. 1851, Comptes Rendus des Séances de l’Académie des Sciences, 33, 225 [Google Scholar]
- Ratcliffe, J. G. 1994, Foundations of Hyperbolic Manifold, 1st edn. (New York: Springer-Verlag) [Google Scholar]
- Rouan, D., & Pelat, D. 2008, A&A, 484, 581 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Wallner, O., Perdigues Armengol, J. M., & Karlsson, A. L. 2004, in New Frontiers in Stellar Interferometry, ed. W. A. Traub. (Bellingham, WA: The International Society for Optical Engineering), SPIE Conf. Proc., 5491, 798 [Google Scholar]
Appendix A: Constraints on the number of cells
Here we try to find a lower limit to the number of cells S constituting the chessboards for a given even nulling order N. We recall that the numeric values contained in the Pn and Qn chessboards are even and odd integers, respectively.
We find that our design is optimal for N = 2 and 4, but the question remains open for N ≥ 6.
Cancelling up to the second nulling order.
If ns and
ms are the P and
Q matrices elements, we must have
(A.1)where Σ is the common
sum. Since the ns are even, Σ must be
divisible by 4. Because ms is odd, we have
ms = 2ks + 1
and
Therefore
the number of cells in each chessboard must be a non-zero multiple of 4. We have shown
by the construction of the P1 and
Q1 chessboards that S = 4 was sufficent,
and this is the minimum number of cells required to cancel the chromatic term up to
second order.
The above result is a consequence of the property that
(2k + 1)2 ≡ 1(mod4). In fact one can show that
(2k + 1)2 ≡ 1(mod8) that, in turn, is a particular case
with p = 1 of the more general result (A.5)which one can easily
establish by induction.
Cancelling up to the fourth nulling order.
From (A.5) with p = 2, we obtain (2k + 1)4 ≡ 1(mod16). Then we deduce that S ≡ 0(mod16). Again, we show that the 16 cells in P2 and Q2 were sufficent to cancel the chromatic term up the fourth order, and it is the minimum.
Cancelling beyond the fourth nulling order.
One can use (A.5) to sets limits on
the sixth order cancelled by P3 and
Q3, but they are too loose. (Unless one can prove that, if
22n is a divisor of then, for
n > 1, it is also a divisor of
). All
that can be said is that the number of cells must be a multiple of 16, while
P3 and Q3 possesses 64. Up to
now, as soon as the nulling order N ≥ 5, no tighter limits than those
deduced from (A.5) are known to the
authors regarding the number of cells necessary to cancel the chromatic term up to that
order.
All Tables
Number of cells in the interferometer (Pn,Qn) according to the phase shift exp(jkπλ0 / λ).
All Figures
![]() |
Fig. 1 A possible optical scheme of the nulling interferometer in co-axial configuration. |
In the text |
![]() |
Fig. 2 Graphic determination of the relative nulling
efficiency ρ0 of a Bracewell nulling interferometer
through a stereographic projection of the Fresnel vector
z = eπλ0 / λ
toward point 1 on the real axis. The relative nulling efficiency of the
iterative Bracewell scheme
(Pn,Qn)
is |
In the text |
![]() |
Fig. 3 A possible optical scheme of the nulling interferometer in multi-axial configuration. |
In the text |
![]() |
Fig. 4 The |
In the text |
![]() |
Fig. 5 Image of a star system seen through a Michelson interferometer
d / D = 1 equipped with the
multi-axial |
In the text |
![]() |
Fig. 6 Extinction spectrum of a star by iterative Bracewell nulling
uni-axial (Michelson) interferometers of order
n = 0,1,...,12.
The original Bracewell n = 0 is in black, order 1 in yellow, 2
in red, etc. The x-axis is the wavelength, either
Δλ / λ0
(bottom) or λ in μm for
λ0 = 9.05 μm
(top). The |
In the text |
![]() |
Fig. 7 Attenuation spectrum of a planet following the same setting, except for the
linear y-axis, as in Fig. 6. Near the wavelength λ0, where the
device is tuned, the planet only suffers very small attenuation. The full width
at half maximum (FWHM) is given, with a very good
approximation, by |
In the text |
![]() |
Fig. 8 Planet-to-starlight ratio spectrum of a planet 106 times fainter than the star around which it revolves, seen through a uni-axial (Michelson) interferometer. When the value on the y-axis is greater than 0, it is seen more light from the planet than from the star, but at the same time the effective bandwidth of the planet is reduced. A value slightly above 0 seems to be an acceptable objective. |
In the text |
![]() |
Fig. 9 Planet-to-starlight ratio spectrum. Same as Fig. 8 but for a multi-axial (Fizeau) setting. The integration has been performed by means of a single mode cylindrical fibre optics of radius λ0 / D, where D is separation of the two mirrors. The abrupt cut-off at approximately 16.5μm is the long wavelength cut of the fibre optics. |
In the text |
![]() |
Fig. 10 Planet-to-starlight ratio spectrum. Same as Fig. 8, but suffering a piston of 3 nm or a random unbiaised Gaussian piston of σ = 3nm, while the device is tuned at λ0 = 9.05 μm. |
In the text |
![]() |
Fig. 11 Planet-to-starlight ratio spectrum. Same as Fig. 10 but for a multi-axial Fizeau setting. |
In the text |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.