A&A 397, 1161-1172 (2003)
DOI: 10.1051/0004-6361:20021573
R. Soummer 1 - C. Aime 1 - P. E. Falloon 2
1 - UMR 6525 Astrophysique, Faculté des Sciences, Université de Nice Sophia Antipolis, Parc Valrose, 06108 Nice Cedex 2,
France
2 -
School of Physics, The University of Western Australia, Crawley, WA 6009, Australia
Received 16 September 2002 / Accepted 25 October 2002
Abstract
This paper generalizes to circular apertures the theoretical study of stellar coronagraphy with
prolate apodized rectangular entrance apertures of Aime et al. (2002). The main difference between the two
studies is that circular prolate spheroidal functions are used for a circular aperture instead of
linear prolate spheroidal functions for rectangular apertures. Owing to the radial property of the
problem, the solution to the general equation for coronagraphy is solved using a Hankel transform instead of
a product of Fourier transforms in the rectangular case. This new theoretical study permits a better
understanding of coronagraphy, stressing the importance of entrance pupil apodization. A comparison with the
classical unapodized Lyot technique is performed: a typical gain of 104 to 106 can be obtained
theoretically with this technique. Circular and rectangular apertures give overall comparable results: a
total extinction of the star light is obtained for Roddier & Roddier's phase mask technique whilst optimal
starlight rejections are obtained with a Lyot opaque mask. A precise comparison between a circular aperture
and a square aperture of same surface favors the use of a circular aperture for detection of extrasolar
planets.
Key words: instrumentation: high angular resolution - techniques: high angular resolution - stars: planetary systems
The interest in stellar coronagraphy is relatively recent (Bonneau et al. 1975; Smith & Terrile 1984; Malbet 1996; Beuzit et al. 1997; Mouillet et al. 1997), and nowadays mainly stimulated by the exoplanets searches. The direct imaging of an exoplanet is the most exciting objective for high contrast imaging techniques: it will provide valuable information such as its mass (removing the ambiguity due to the radial velocities technique), its albedo and its atmosphere chemical composition if spectral information can be obtained.
Several other scientific objectives, with different instrumental requirements, may also benefit from the techniques developed for high contrast imaging. Among them, the circumstellar disks for a better understanding of their dynamic and the physics of the grains (Mouillet 1997), the environment of evolved stars which could also include old planets, the Active Galactic Nucleis for a better imaging of the central jets. Since the first indirect exoplanet detection (Mayor & Queloz 1995), several techniques have been proposed to reach high contrast imaging (Gay & Rabbia 1996; Roddier & Roddier 1997; Rouan et al. 2000; Abe et al. 2001).
In a previous paper (Aime et al. 2002), we analyzed the theoretical aspects of the association of coronagraphy (Lyot et al.) with entrance aperture apodization and derived the integral equation for total coronagraphic extinction of the star light. This integral equation depends on the shape of the apodized aperture.
For the rectangular aperture case we analyzed (Aime et al. 2001), the apodization solution is given by the linear prolate spheroidal functions discovered by Slepian & Pollak (1961). These functions have already found many applications in optics, as well described by Frieden (1971). For coronagraphy, the relevant particularity is that these functions are invariant to a finite Fourier transform.
We showed that a theoretical total extinction can be obtained with a perfect Roddier & Roddier coronagraph. For Lyot coronagraphy, we showed that though a total extinction is not possible, very interesting solutions can be obtained with the prolate apodized coronagraph: much smaller masks than for the classical technique are required, and very high dynamic may be obtained, depending on the mask size and apodization. Although less powerful than the phase mask technique, the Prolate Apodized Lyot Coronagraph (PALC) was found to remain interesting because it is easier to implement and much less sensitive to chromatic effects in a wide band experiment. The technique for a square aperture was also compared to the Apodized Square Aperture concept that considers apodization alone (Nisenson & Papaliolios 2001; Soummer et al. 2002b).
The present paper generalizes the study to the case of coronagraphy with an apodized circular aperture. In this case, the solution for the apodization function is given by the circular prolate spheroidal functions developed by Slepian (1964) and Heurtley (1964). The complexity of the formalism was the occasion for us to go much deeper into the theoretical aspects of the problem, to find new analytical simplifications and bring a lot of interesting new results.
The present study is restricted to a perfect circular aperture, operating in space, for a monochromatic unresolved on-axis star. However, the equations considering a polychromatic source are also given and discussed. This is an aspect in which the PALC may perform better than the phase mask technique. The purpose of this study is to settle the theoretical background for circular apertures and give the ideal performance attainable for such a coronagraph. A more technical study of a realistic instrument will be realized in a future work and compared to the perfect case.
The general formalism to describe a coronagraphic experiment with an apodized aperture has been given by
Aime et al. (2002). They showed that a complete extinction of the starlight could be obtained with a coronagraph
implemented at the focus of an apodized telescope. Denoting P(x,y) the aperture transmission
function (
P(x,y)=1 inside the aperture, and 0 outside), and
an apodization function
(
), the total extinction is obtained if the following equation is satisfied:
With a rectangular aperture, the problem is simplified because this two dimensional equation separates into the product of two symmetric one dimensional equations. The apodization function solution is equal to 1 at the center of the rectangular aperture and decreases towards the edges as the product of two linear prolate functions aligned along the length and the width of the rectangle.
With a circular aperture, the problem is much more complicated since we cannot separate anymore the solution
into two linear conditions in x and y. Fortunately, the radial properties of the aperture and mask shape
make the solution possible: a two dimensional radial function
can be represented by its
radial cut
,
where
is the modulus of the position vector
.
For
the sake of simplicity, we will use the same notation for these two functions, the interpretation being
straightforward for the alert reader, depending on the context. In particular, the computations developed
using Fourier transforms in the rectangular separable case, are here solved by means of Hankel transforms.
We consider a circular aperture of diameter D apodized by a radial apodization function :
.
We consider the four successive planes denoted A (telescope pupil), B (telescope
focus, coronagraphic mask), C (relay pupil plane, Lyot Stop) and D (final focus plane)
The complex amplitude at the entrance apodized pupil plane is proportional to:
The wave propagation between each planes (A, B, C, D) writes as a scaled Fourier Transform (FT) and we
assume that the optical layout is properly designed to eliminate the quadratic phase terms associated with
the propagation of the waves (Goodman 1996). The FT of a radial function is also a radial function, which
radial cut (for )
can be expressed using the Hankel Transform (HT):
An optical system is then used to obtain an image of the telescope aperture in plane C. Here also, the
complex amplitude of the wave is written using a scaled HT of Eq. (5). For simplicity, we assume that the focal lengths of the successive optical systems are identical (if not, an appropriate change of variables leads to a similar result). In the relay pupil plane C, we denote by
the amplitude before the application of the diaphragm (Lyot Stop):
Note that the kernel
is analog to the sine cardinal kernel for the rectangular
aperture coronagraphy (Aime et al. 2002).
Considering for example Eq. (6), the coronagraphic effect appears as a subtraction of two wave amplitudes in the relay pupil plane. Though the coronagraphic effect is expected in the final focal plane D, it is here more convenient to consider the relay pupil plane. Ideally, we seek for an exact wave subtraction, within the entire exit pupil, or at least, the best subtraction possible.
The integral in Eq. (7) or Eq. (8) must be proportional to ,
to permit an exact
subtraction of the two wavefronts inside the aperture in the relay pupil plane: the apodization function
must be the eigenfunction of this kernel.
The solutions
are given by the circular prolate functions, which are the eigenfunctions of this
eigenvalue problem. We give a brief presentation of them in Appendix B.
For the apodization function
(normalized circular prolate function):
![]() |
Figure 1:
Roddier & Roddier coronagraphy with prolate entrance pupil apodization. A) Eigenvalue ![]() ![]() |
Open with DEXTER |
For Roddier & Roddier's coronagraphy, the
phase mask imposes
.
The amplitude inside the
exit pupil (Eq. (12)) becomes:
The transmission of the corresponding apodization is represented in amplitude in Fig. 1. This solution
is in excellent agreement with the solution obtained numerically by Guyon & Roddier (2000). A comparison between the
analytical and numerical solution is made in Soummer et al. (2002a). It is also necessary to know the energy
throughput of the entrance pupil. Using the orthogonality relation for circular prolate functions (Eq. (B.5)), the throughput (normalized to the throughput of the raw circular aperture) finds a simple
expression:
For the solution of Roddier & Roddier, we find
In the focal plane B (the phase mask plane), the amplitude of the wave (before the phase mask) can be
expressed analytically, using the invariance of the circular prolate functions to a finite HT (Eq. (B.1)):
This expression can also be used as a good approximation for the planet response. Indeed, for an off-axis
point source, situated at a distance well outside the coronagraphic mask, the extinction can be neglected and
the planet focal intensity is similar to the apodized aperture without coronagraph:
Note that the loss of resolution due to the apodization of the entrance pupil is very low (5%): the
first zero of the impulse response is at
,
instead of
for the classical
unapodized Airy's pattern (the apodization required for Roddier & Roddier is very light), and the equivalent
widths are almost the same to 1% precision.
![]() |
Figure 2:
Approximated solutions for Lyot coronagraphy with prolate apodization. The solutions are given for three arbitrary eigenvalues close to 1:
![]() |
Open with DEXTER |
For this total extinction solution (Roddier & Roddier), the wave amplitude in plane A is the truncated
central part of the prolate solution
(Eq. (2)). Considering Eq. (11), it
appears that in plane C, the wave before the Lyot stop is the complementary external part of the same prolate
(
for R&R). The conservation of energy imposes to the star energy (collected
in plane A), to be totally
rejected outside the pupil aperture in plane C (and eliminated by the Lyot stop).
This physical requirement can be checked mathematically, using the circular prolates properties, following
Frieden (1971). Using Eqs. (B.7) and (B.8), with
,
we can write:
![]() |
Figure 3: Lyot coronagraphy with increasing mask size (and increasing eigenvalues and apodization strength). Top: residual energy for the star, normalized to the entrance pupil transmission. Center: illustration of the loss of transmission for the approximate solutions due to the increasing strength of the apodization. Bottom: illustration of the loss of resolution due to the apodization. The full line represents the position of the first zero of the PSF and can be compared to the half width of the mask (Dashed line). The three values at a=1.96, 2.90 and 3.74 correspond to the examples shown in Fig. 2. |
Open with DEXTER |
For Lyot coronagraphy, the opaque mask imposes
.
The amplitude in the relay pupil plane
(Eq. (12)) becomes:
However, approximate solutions can be obtained for eigenvalues
close to 1 and finite mask size:
taking advantage of the rapid saturation of the eigenvalue curve (Fig. B.2), we can choose a mask
size a corresponding to an eigenvalue close to 1, so that the residual amplitude (Eq. (22))
remains close to zero. A corresponding prolate apodization exists but is no longer an exact solution and a
residual amplitude exists.
The surprising result is that the residual amplitude in the pupil plane (Eq. (22)) is itself
proportional to the initial prolate function, to the factor
.
The overall effect of Lyot
coronagraphy with prolate apodization is here simply an attenuation coefficient for the on-axis point source.
The residual star intensity is then itself apodized. Moreover, in plane D, the energy is maximally concentrated within a surface equivalent to the coronagraphic mask (prolate fundamental property Eqs. (B.7) and (B.8) (Slepian 1964; Frieden 1971). This is an important difference with classical Lyot coronagraphy, for which the residual pupil intensity in plane C is known to be maximum at the edge of the pupil (opposite effect to apodization). This is why prolate apodizations are the optimal solution for Lyot coronagraphy, in terms of maximum residual star energy concentration.
Following Eq. (22), the residual intensity
for the on-axis star, is then simply reduced by the factor
compared to the intensity with the apodizer alone (without the mask) Eq. (17):
The integrated residual energy is simply reduced by the factor
compared to the throughput Tof the apodizer alone (Eq. (14)):
The integrated star residual energy is not the most pertinent criterion to evaluate the performance, since
the high contrast imaging problem comes from the central source diffracted light. A better criterion is to
consider the fractional energy in the residual star diffraction wings, relatively to the instrument
throughput:
The parameters and performances for these different techniques are summarized in Table 1, including typical
values for the residual diffracted light at different positions (
,
and
).
![]() |
Figure 4:
Comparison of the fractional wings energy (in log scale) for classical unapodized Lyot and prolate apodized Lyot, as a function of the mask size. Full line (top curve): unapodized Lyot without Lyot stop reduction. The two similar curves underneath correspond to a Lyot Stop reduction of ![]() ![]() |
Open with DEXTER |
Technique | eigenvalue | Throughput | Mask | Residual | Wings | level @ | level @ | level @ |
![]() |
T | a | starlight |
![]() |
![]() |
![]() |
![]() |
|
R&R Prolate |
1/2 | 72.6% | 1.06/D | 0 | 0 | 0 | 0 | 0 |
Lyot Prolate | 0.9 | 41.6% | 1.96/D | 10-2 | 10-3 |
![]() |
10-6 | 10-7 |
Lyot Prolate | 0.99 | 25.7% | 2.90/D | 10-4 | 10-6 |
![]() |
![]() |
10-10 |
Lyot Prolate | 0.999 | 19.0% | 3.74/D | 10-6 | 10-9 |
![]() |
![]() |
10-13 |
Lyot 100% LS | - | 100% | 4/D |
![]() |
![]() |
![]() |
![]() |
![]() |
Lyot 90% LS | - | 81% | 4/D |
![]() |
![]() |
![]() |
![]() |
![]() |
Lyot 80% LS | - | 64% | 4/D |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Figure 5:
Amplitude aperture transmission (left) and PSF (right), represented in Plane B, for Roddier
& Roddier coronagraphy, and a circular (top) or square (bottom) aperture. The ![]() ![]() ![]() |
Open with DEXTER |
In this section, we compare the technique of coronagraphy with prolate apodization for square or circular
apertures of same collecting surface S, i.e. a square aperture of length
.
With Roddier & Roddier coronagraphy, both shapes can perform a total extinction: in the case of a square
aperture, the apodization solution consists of a product of linear prolate functions whilst for the
circular aperture, the apodization is a circular prolate function. An illustration of the solutions
for Roddier & Roddier is given in Fig. 5: the apodization (in amplitude) and the impulse
response without coronagraph (or the response for a planet well outside the mask). The corresponding phase
masks are also shown in the figures as a black disk and square. The surface of the circular mask (0.69/S)is slightly smaller than that of the square mask (0.72/S). The throughputs are similar, with again a slight
advantage for the circular solutions (
vs.
).
The comparison is also interesting for Lyot coronagraphy. Let us first remind the expression of the residual
star intensity for a square aperture and a Lyot coronagraph, already given by Aime et al. (2002). We assume a
symmetrical configuration for the square aperture of length L (square coronagraphic mask of size m with
the same eigenvalues
):
This can be explained easily as follows: comparing Eqs. (23) and (30), we see that the coronagraphic effect, for an on-axis point source, is simply the multiplication of the focal PSF without coronagraph by the factors
and
,
respectively for a square and circular aperture. These values are different for a same
throughput. This is clearly visible in Fig. 6 where the residual level intensity at the center
of the pattern are
for the square aperture and only
for the circular one. Then the behavior of the residual light is given by the telescope PSF: the square aperture presents a lower residual intensity in four quadrants, but two strong diffraction patterns along its main axes.
![]() |
Figure 6: Comparison of the residual star intensities (in log scale) for Lyot coronagraphy with prolate apodizations for square apertures (dashed lines) and circular apertures (full lines identical in the two figures). Curves are drawn for apertures of same surface and an identical throughput of 30%. Top: circular aperture and square aperture (axial cut). Bottom: circular aperture and square aperture (diagonal cut). |
Open with DEXTER |
![]() |
Figure 7:
Comparison of the residual star intensities for Lyot coronagraphy with prolate apodizations for square
apertures and circular apertures. The two apertures have the same surface, and the same intensity throughput ![]() |
Open with DEXTER |
The fractional wings energy, introduced above (Eq. (26)) can also be used as a criterion
for a comparison between the residual diffracted levels for the circular and square apertures. The
expressions for the fractional wings energy are respectively:
![]() |
Figure 8: Comparison for Lyot coronagraphy with prolate apodizations between square apertures and circular apertures (same surface) as a function of the throughput of the entrance apodization. The curve represents the ratio between the fractional wings energy for the square and the circular apertures. The circular aperture gives a lower residual energy in the diffraction wings, especially with increasing eigenvalues (respectively decreasing throughput). |
Open with DEXTER |
In this paper we have developed a theoretical study for coronagraphy of circular apertures with entrance pupil apodization, following the previous study for rectangular apertures (Aime et al. 2002). All the results for the rectangular configuration find their analog with circular apertures. The formalism is more complex, but the physical contents are similar. We have shown that the natural apodization functions for Lyot's coronagraphy and Roddier & Roddier's coronagraphy are the circular prolate spheroidal functions. We have proven that a total extinction of a monochromatic unresolved on-axis star can be obtained theoretically using a circular aperture apodized by circular prolate functions and Roddier & Roddier's phase mask coronagraphy.
The study we have made is for a monochromatic wave front. R&R's coronagraphy suffers from a double chromatism problem, the size chromatism and the phase shift chromatism compared to other phase shifting techniques, such as the wavelength-independent CIA (Gay & Rabbia 1996; Baudoz et al. 2000a; Baudoz et al. 2000b) or the less sensitive 4QC technique (Rouan et al. 2000), or the achromatic PKC (Abe et al. 2001, 2003). However, these techniques have other drawbacks: the CIA does not produces images, and the 4QC/PKC has a lower efficiency along the axis.
For Lyot coronagraphy, only a size chromatism exits: the mask size is adapted to the wavelength .
At the wavelength
,
the diffraction pattern is not of the same size and the mask size is not
correct. Equation (6) can be re-written, with the mask size:
For the achromatized Lyot technique, only a partial extinction of the star can be obtained, but there also, prolate apodizations give optimal results. Both the star and the planet appear in the final image with the same Point Spread Function (PSF) that is optimal in terms of concentration of light. The overall effect of Lyot's coronagraphy with prolate apodization is simply to reduce the magnitude difference between the planet and its parent star. Note that the multiple stage Lyot coronagraph proposed for rectangular aperture, is then also possible for circular aperture.
The reduction factor obtained with the prolate apodization is good enough to consider Lyot's technique for high contrast imaging, mainly because of its simplicity and its lower dependence on the deviation from the perfect case: chromatism, star diameter, telescope jitter.
The comparisons we made between circular and square aperture with the prolate coronagraphy technique are in
favor of the circular aperture. For a perfect prolate apodized coronagraph using the Roddier & Roddier phase
mask technique, the circular aperture provides a slightly better instrumental throughput for a smaller mask
size. For Lyot coronagraphy (PALC) the results appear even more favorable to the circular aperture that gives
a lower integrated residual energy in the diffraction wings, especially when a strong apodization is used.
For a given throughput, the square aperture may however permit a better planet detection in four quadrants
outside its main axial diffraction patterns. Because of its cross-like dead zone, delicate and time consuming
45 degrees telescope rotation will be needed to investigate the whole field. A circular aperture with a
slightly lower throughput will probably be able to detect directly the presence of a planet in a shorter
time. Moreover, the comparison we have done assume telescopes of same surface. For practical reason, it is
probable that a square telescope would be obtained by diaphragming a circular one, which diameter equals to
the diagonal of the square. In this case the circular aperture would be much more efficient. This comparison
between a square and circular aperture does not apply directly to rectangular apertures, detailed in a
previous study (Aime et al. 2002), since rectangular apertures present some specific interesting advantages (high
angular resolution in one direction, possible
implementation for example)
This paper intended to present a theoretical and analytical approach for coronagraphy without atmosphere, using apodized apertures. Several issues have not been tackled in this paper and remain to be studied numerically. In particular, numerical simulations will be needed to evaluate the sensitivity of these formal solutions to the physical parameters: wavelength dependence, wavefront errors, mask positioning errors, angular diameter of the star, etc. It will be also interesting to check if the technique can give valuable results from the ground using an apodized aperture with adaptive optics. All these points will be studied in a more technical future work, and some of them are already presented in Soummer et al. (2002a). A comparison with the other coronagraphic techniques is also necessary; a major point will be to include signal to noise ratio as the main comparison criterion.
Acknowledgements
The authors would like to thank André Ferrari for helpful discussions.
In Eq. (6), the two dimensional convolution product between the 2 radial functions
and
can be simplified, using the properties of FT and HT. The Fourier Transform
(FT) of the convolution product is a simple product between the FTs of each functions:
![]() |
(A.2) |
![]() |
(A.3) |
An analytical expression of the Kernel can be obtained with the help of Mathematica software
(Wolfram 1999):
Circular prolate functions where invented independently by Slepian (1964) and Heurtley (1964) and have found very interesting applications in Optics, for example for the study of confocal laser modes or wave aberrations. Theses applications of prolate functions in Optics are detailed in the excellent review paper by Frieden (1971). The short presentation of the mathematical properties of circular functions is drawn from Frieden's paper. However, notations have been adapted to fit the present application.
Circular prolate functions are the circular analogy to the linear prolate function, known since Niven (1880) and re-discovered by Slepian in 1959 (Slepian & Pollak 1961). Their properties are strictly analogous to those of the linear functions. In the circular case, Hankel Transforms (HT) replace the linear Fourier Transforms (FT). Circular prolate function are defined for any order N of HT, but in the present application we will only need to consider the HT of zero order, involving the Bessel J0 function that appears in Fourier Optics for Hankel Transforms. We will omit the subscripts N and n in the notations for clarity (the prolate function for n=0 is the only one with maximum at origin).
Circular prolate functions are defined by their invariance to a finite Hankel Transform (the finite HT of a
function is the HT of the truncated function):
Reciprocally, the circular have the following property for the infinite HT:
![]() |
Figure B.1: Illustration of the properties of prolate function. Top: invariance to the finite Hankel Transform (Eq. (B.1)); the HT of the truncated prolate gives the prolate itself. Bottom: the HT of the entire prolate gives the truncated prolate (Eq. (B.2)). |
Performing a finite HT on both side of Eq. (B.1), we have:
![]() |
(B.4) |
A given eigenvalue correspond to a unique mask size a and a unique prolate function, as illustrated in
Fig. B.2 (usually this curve is drawn as a function of the prolateness parameter
,
but for clarity we consider it as a function of the mask size (the telescope
diameter D is a constant).
The double orthogonality property of the linear prolates also exists for the circular prolates of orders mand n:
![]() |
(B.6) |
This property is useful for the computation of the transmissions of the prolate apodizations (with m=n=0).
An other fundamental property is the fractional energy within
which finds a simple
expression:
The functions
are solutions of a generalized spheroidal differential equation,
![]() |
(C.3) |
![]() |
(C.4) |
![]() |
(C.6a) | ||
![]() |
(C.6b) | ||
![]() |
(C.6c) |
![]() |
(C.7) |
![]() |
(C.8) |
![]() |
(C.9) |
![]() |
(C.10) |
![]() |
(C.11) |