A&A 449, 827-836 (2006)
DOI: 10.1051/0004-6361:20054180
D. Busonero1,2 - M. Gai1 - D. Gardiol1 - M. G. Lattanzi1 - D. Loreggia1
1 - National Institute of Astrophysics (INAF), Astronomical
Observatory of Turin,
via Osservatorio 20, 10025 Pino Torinese, Turin, Italy
2 -
University of Siena, Department of Physics, via Roma 56,
53100 Siena, Italy
Received 9 September 2005 / Accepted 20 October 2005
Abstract
Aims. Chromatic effects are usually associated with refractive optics, so reflective telescopes are assumed to be free from them. We show that all-reflective optics still bears significant levels of such perturbations, which are especially critical to modern micro-arcsecond astrometric experiments.
Methods. We analyze the image formation and measurement process to derive a precise definition of the chromatic variation of the image position, and we evaluate the key aspects of optical design with respect to chromaticity.
Results. The fundamental requirement related to chromaticity is the symmetry of the optical design and of the wavefront errors. Finally, we address some optical engineering issues, such as manufacturing and alignment, providing recommendations to minimize the degradation that chromaticity introduces into astrometry.
Key words: astrometry - methods: data analysis - space vehicles: instruments - techniques: high angular resolution - telescopes
This effect can be reduced by using more complex refractive systems (doublets, triplets, etc.), taking advantage of the different dispersion of glasses to achieve a certain degree of compensation. For most purposes, the only way to avoid chromatism is considered to be adoption of a completely reflecting design. Here, we show that, at the demanding level of modern astrometry, reflective optics is still affected by significant chromatic effects, which we will refer to as "chromaticity''. We investigate the source of the chromaticity and provide recommendations for its minimization in optical design, manufacturing, and alignment.
Currently, space astrometry experiments are being designed and implemented with the goal of micro-arcsecond (hereafter,
)
measurements. We
will refer to the framework of the Gaia mission, approved within the space
science program of ESA for launch before 2012. Built upon the implementation
of the Hipparcos (ESA 1997) concept, with the benefit of modern
technology and more advanced astrophysical understanding, Gaia aims to measure
absolute position, parallax and annual proper motion of
109 objects
with a typical accuracy of
15
for V=15 stars, with survey
completeness to V=18 and limiting magnitude V=20 (Perryman et al. 2001). Hipparcos's payload was already affected by chromaticity
at the mas-level, and during Gaia's design phase it soon became apparent that
its all-reflective optics would also be affected by chromaticity at a similar
level (Lattanzi et al. 1998). Therefore chromaticity in Gaia must be suppressed
by more than two orders of magnitude.
![]() |
Figure 1: A Gaia-like astrometric payload layout. Left: the set of two telescopes combined at mirror M4 (planar drawing). Right: individual telescope layout. LOS A and LOS B are the two lines of sight implemented in this two-telescope configuration. |
Open with DEXTER |
We describe how the aberrations of a realistic all-reflective optical system can lead to chromaticity as a consequence of light diffraction, even when propagation occurs in a practically non-dispersive medium as would be the case for payloads sent outside the Earth's atmosphere. Although our discussion focuses on Gaia, the concepts and considerations developed here about sources of chromatic errors and how to deal with them can apply to all experiments wishing to reach astrometric accuracy down to the mas-level and beyond.
The aspects evidenced by our study of chromaticity are of critical importance
for compiling detailed error budgets and deciding on actions for controlling
systematic errors. Also, the degree of detail in instrument and measurement
models required to keep chromaticity at the
-level can help
prevent (during design), monitor (throughout operations), and correct
(in data reduction) other possible sources of systematic errors associated
with an image profile and its temporal and spatial variations. A possible
method for chromaticity correction is described in Gai & Cancelliere (2005).
In Sect. 2, we recall the relevant basic principles of a global astrometric
mission like Gaia. In Sect. 3, we introduce
chromaticity for an all-reflecting optical system through a simplified
analytical model, which is also utilized to prove the linear dependence of
chromaticity on wavelength.
Section 4 presents the chromaticity analysis for a realistic representation of
the Gaia optical system. Section 5 provides prescriptions for minimizing
chromaticity in optical designs. In Sect. 6, we present the field-of-view
(FOV) distribution of chromaticity and methods for its reduction. Finally,
conclusions are drawn in Sect. 7.
In Europe, following formal approval in 2000, the European Space Agency (ESA) is developing its new astrometric mission Gaia, which is based on the same two-FOV's scanning satellite concept implemented on Hipparcos, and is meant to increase the measurement capabilities of its predecessor by about two orders of magnitude.
Two different optical systems were considered during the study phase,
so that we will not know the optical configuration that will fly on Gaia before
final selection of the industrial contractor (expected by early
2006). However, this is not a problem for the scope of this work as both
designs are based on all-reflective optics, and we can still refer to one
of them without loss of generality. We chose to refer to the configuration
known in Gaia jargon as the "Baseline Configuration''
(Perryman et al. 2001). It consists of the combination of two off-axis,
three-mirror monolithic telescopes mounted on a common optical bench, as
shown in Fig. 1 (left). The individual telescope has an effective focal length
(EFL) of 46.7 m and a rectangular collecting area of
,
resulting in a central lobe size (analogous to the Airy
diameter for circular pupils) of
mas at the effective
wavelength
nm. We constructed a computer model of
the Baseline Configuration by utilizing the CODE V optical modelling package
(ORA 1995), and optimized the system over a FOV of
by closely following the procedure described in
Loreggia et al. (2004). With reference to Fig. 1 (right), the
optical train of each telescope consists of seven elements: three mirrors (M1,
M2, M3) with optical power, three flat mirrors (M4, M5, M6) for beam-folding,
and the focal plane (FP). The FOV covers the angular ranges
in the along-scan direction and
in the across-scan direction;
the off-axis design is required to avoid vignetting. One of the possible
implementations investigated for the Gaia FP assembly calls for an array of 17
(along-scan)
10 (across-scan) CCDs working in time delay integration (TDI)
mode (Saint-Pe et al. 2000). Each along-scan strip of devices hosts: two chips for
the functions of target detection and confirmation, eleven CCDs for the
repeated astrometric measurement, four units for broadband photometry. The
rationale for photometry is both astrophysical (stellar classification) and
technical (chromaticity correction discussed below).
During each
transit, the image of a target on any of the eleven astrometric CCDs is binned
across scan, and centered along scan, providing independent one-dimensional
location estimates of comparable precision (Gai et al. 2001 and references therein). The composition of repeated along-scan measurements, taken for every
location on the sky with different orientations, allows the determination of
the bi-dimensional angular coordinates in a common reference system ("global
sphere reconstruction''). The evolution of the apparent source position during
the mission lifetime (5 years) yields absolute parallax and proper
motion.
Note that the relative orientation of the two telescopes, i.e. the
base angle (BA), is a key factor of the measurement process: the
on-sky separation between two stars entering the two different FOV's is
obtained from the measured difference in FP coordinates plus the BA;
therefore, configuration perturbations of a few nm, negligible as to the
optical response, are critical to astrometry as they can induce errors up to
several tens
.
Laser-based metrology lines are expected to monitor and (if necessary) to
control the BA stability that also sets stringent constraints on payload
thermal stability (Gardiol et al. 2004).
We verified the presence of chromaticity is mainly related to wavelength; sampling of the telescope point spread function (PSF) and dimensions of the read-out region of the FP play a secondary role in the actual amount of chromaticity displayed by the realization of any all-reflective optical design.
Also, we will see that for a symmetric PSF there is no
chromaticity apart from possible residual effects due to sampling and other
noise sources (e.g. photon noise). On the other hand, for an
asymmetric PSF, most of the
chromatic effect,
,
can be modelled for every location estimators
tested as a linear function of the change in effective wavelength
,
characterizing the spectral difference of the
sources, through a coefficient g(x,y) that depends on the local shape of the PSF, f(x,y), i.e. on the contributions of the aberrations over the FOV:
We illustrate relation (1) with the help of ad hoc analytical model. We choose an asymmetric, one-dimensional bell-shaped curve,
in the form of a deformed Gaussian profile, i.e. of different width on the
opposite sides of its peak value (at y=0):
![]() |
(3) |
For two different wavelengths
,
the normalization
condition requires that the maximum value decreases as the curve width
increases. This agrees with the physical condition that in the diffraction
limit, the PSF of an
optical system increases in linear size with the wavelength. In particular,
the PSF is a function of the wavelength
,
the position on the FP y,
and of the characteristic linear dimension D of the optical system, in a
well-defined form required for dimensional reasons:
![]() |
(4) |
Let us evaluate the behavior of, e.g. the center of gravity (COG) estimator,
for the model in Eq. (2) over the region of interest [-a,a]:
![]() |
|||
![]() |
|||
![]() |
(5) |
![]() |
(8) |
![]() |
Figure 2:
The LSF obtained from the Fraunhofer integral with the coma
term ![]() |
Open with DEXTER |
With our PSF modelling and measuring code we were able to investigate the
effects of limited detector dimensions (read-out region coincident with the
detector dimension) and sampling resolution (pixel size). We went on to extend
the detector dimension and to shrink the sampling step by two orders of
magnitude (CCD units of 50 cm on a side with
pixels) and
repeated the COG location measurements, which yielded:
.
This is a variation of only
compared to
the value above.
These experiments suggest, within the limits probed (the dimension of the "unlikely'' CCD simulated is very large but not infinite, and its pixels small but not infinitesimal), that chromaticity is an intrinsic property of all-reflective optical systems and that it can be approximated by a linear function of source effective wavelength. Therefore whatever the detector geometry and its spatial extension, different spectral type stars, set in the same nominal position in object space, do not have the same estimated position on the FP, even with the simplest possible algorithm; i.e. the same location on the sky is not uniquely mapped on the focal plane.
Chromaticity, a color-dependent position variation, is an important contribution to the systematic error and must be removed to preserve the desired mission accuracy.
![]() |
Figure 3: Upper: PSF for B3V (solid line) and M8V (dashed line) stars in a representative off-axis field position; lower: PSF difference. |
Open with DEXTER |
The application of the COG method for centering the Gaia-like PSF's of the two
reference stars yielded 20.112
for the blue star and 20.055
for the red star, respectively, a difference of 57 nm. At the
optical scale of Gaia, about 4''/mm, this results in more than 200
,
i.e. comparable with the random location error for stars brighter than
.
The Fourier integral in Eq. (12) is performed over the rectangular
pupil of the Gaia telescope, where the factor
is constant.
The pupil function
is expressed in terms of the wavefront
error (WFE) function,
,
via the relation
The expression for the monochromatic PSF,
,
is therefore
![]() |
(14) |
CODE V can compute the WFE distribution associated to a given field position for a selected optical configuration; it is thus possible to build the PSF for any desired source, using Eqs. (12)- (15). We use a numeric implementation of this model (Busonero et al. 2005) to provide the description of the imaging performance of the Gaia telescope with all the aberration terms given by CODE V. Additional contributions describing the realistic detector response and the effect of TDI observations can be included, as well as the across-scan binning used for Gaia, to build the recorded signal. For any source of known spectrum, it is possible to derive its effective wavelength, the detected signal, and the chromaticity with respect to the selected reference spectral type using the above expressions. In this way, the measurement of Gaia can, in principle, be made achromatic, as the position estimate is no longer function of the source spectral distribution. This correction requires both color information and knowledge of the local instrument response.
Table 1: Effects of individual Standard Zernike terms vs. the non-aberrated (ideal) case.
We then investigate how to minimize, by design, the chromatic errors in any given field position, analyzing the effect of partial or total suppression of selected aberration terms. Finally, we proceed to evaluate the overall field properties of chromaticity, verifying the possibility of compensating for it by taking measurements over a complete crossing of the focal plane, a possibility offered by scanning instruments like Gaia.
In optical engineering, typical expansions of the WFE are in terms of Zernike and Fringe Zernike polynomials, orthogonal, and normalized functions, if mapped on a circular pupil (Born & Wolf 1980). In our analysis we have used expansions with 21 Zernike terms; the set of standard Zernike functions are listed in Table 1 (for Fringe Zernike equivalents see Born & Wolf 1980). Note that the representation in Zernike polynomials is not optimal for non-circular pupils and more convenient expansions have been investigated (Gardiol et al. 2005).
In the following, we show how to identify the main individual contributors to chromaticity in the case of a
realistic representation of the Gaia telescope regardless of the choice of the
Zernike set, and to search for possible
correlations between chromaticity and parameters like rms WFE and image rms
width.
Chromaticity is again evaluated by taking the difference of the COG positions obtained for the adopted reference spectral types B3V and M8V. We used both monochromatic PSFs at the reference wavelengths associated with the two spectral types and the full polychromatic representation of Eq. (15), assuming the two stars behave like blackbodies. The latter description is more representative from the astrophysical standpoint, whereas the former is computationally much simpler. Preliminary experiments proved that the two methods can be considered equivalent when, as is the case here, the primary goal of the investigation is to evaluate of the properties of chromaticity and not its precise calculation.
The image rms width is derived from the monochromatic PSF at
,
roughly representative of solar type objects and sort of midway
between B3V and M8V types. The image COG displacement is always referred to
the ideal, non-aberrated case, which has an image rms width of
.
Each Zernike term is individually evaluated, with the coefficient set
to 0.1 (i.e. the small aberrations regime is assumed): then the WFE is built
from the selected term, and the PSF computed accordingly to the above
model. The results for the first 21 terms are listed in Table 1.
![]() |
Figure 4: Image COG variation ( upper panel) and chromaticity variation ( lower panel) vs. Standard Zernike aberration terms. |
Open with DEXTER |
In Fig. 4, we show the impact of each individual aberration on the COG (i.e. an effect corresponding to classical distortion) and on chromaticity. Some terms strongly contribute to both, but in general there is little correlation. Similarly, we found that there is no simple relationship between rms WFE, or image rms width (both including the contribution of all aberrations) and chromaticity.
The complex relationship between WFE, image rms width, and chromaticity can be understood in terms of symmetry, since the aberrations are in general bidimensional, whereas the parameters relevant to the Gaia measurement are mostly one-dimensional, i.e. referred to the scan (y-axis) direction. Thus, specific aberration terms may contribute to the WFE significantly, to across-scan image width and across-scan (x-axis) centroid displacement, with little impact on the location, noise and chromaticity, due to the across-scan binning of the images (see Sect. 2).
There is a trend toward increasing image rms width with rms WFE, but this is not a strict relation, and several cases of large WFE and small image width degradation are seen. This could be associated to aberrations inducing significant image degradation in the across scan direction only. The COG, in some cases, is affected by a large displacement associated to low chromaticity values. Thus, the images are translated with respect to the non-aberrated position by an amount that does not depend on the spectral distribution of the source. The case of classical distortion fits this description.
From Fig. 4, we see that significant contributions to chromaticity,
i.e.
,
come only from aberrations (Nos. 3, 6, etc.) with
a specific functional form: they are all the odd (sinusoidal) functions of the
angular coordinate, so that they have odd symmetry on the pupil plane. All of
the even (cosinusoidal) terms do not provide net chromaticity; values below
are the result of limited precision in the calculation
scheme that is implemented.
A similar analysis was carried on with the Fringe Zernike polynomials. Again, the critical terms for chromaticity we identified to be those associated to the odd parity portion of the WFE, i.e. the sinusoidal terms, as for the Standard Zernike expansion. Therefore, in the case of Gaia, the terms that should be minimized, by design, manufacturing, and alignment are those with odd parity (anti-symmetric) relative to the across-scan (x) axis.
Table 2: rms WFE, image rms width (for the two reference spectral types), and chromaticity at different field angle locations over the focal plane.
The nominal case provides an image rms width that is reasonably close to the diffraction limit in spite of non negligible rms WFE (40 nm, i.e.Suppression of the symmetric aberrations provides some improvement to the rms WFE. The variation of the image width is marginal and the initial chromaticity is mostly retained. Conversely, removing the anti-symmetric aberrations completely (case 4), we achieve significant improvement, as in the previous case, on the rms WFE, but not nearly as much on the image rms width; chromaticity, however, is reduced to zero, according to expectations. The rms sum of WFE in cases 3 and 4 restores the nominal value: the mutual orthogonality of symmetric and anti-symmetric function sets is preserved, even if this is no longer true for the individual functions within each set. Even when the symmetric aberrations are scaled to restore the initial WFE level of 40 nm (case 5), the chromaticity is still zero.
Besides, when a random subset of anti-symmetric aberrations (9 and 10) is retained, together with the symmetric terms, with the nominal coefficients (case 6), the result has a value that is comparable to the chromaticity in the nominal case (case 2) but with the opposite sign. Futhermore it leads to a dramatic degradation of both WFE and image quality.
Table 3: Changes in rms WFE, image rms width, and chromaticity for the 6 selected cases discussed in Sect. 5.
This shows that the optimization procedure in the ray tracing code actually achieves some partial compensation among different aberrations, providing some, although limited, benefit to chromaticity. On the other hand, standard optical design optimization procedures are based on improving the general image quality parameters such as WFE and spot diagrams. Therefore, it is possible to introduce custom merit functions in the optimisation procedure, which can include computation of the chromaticity or of the critical anti-symmetric contribution to WFE, with some averaging rule over the field. This does not necessarily modify the actual optical configuration in any significant way, since it is always convenient to start after standard optimization, but chromatic aberrations can be further reduced at the expense of the others and, possibly, of a small increase in overall WFE, which is acceptable in many cases. ![]() |
Figure 5: Modifications of the PSF profile due to the different weight of aberration terms for field position F4 of the Gaia-like astrometric optics considered here. |
Open with DEXTER |
![]() |
Figure 6: Chromaticity distribution over the astrometric field in the nominal configuration of the Gaia-like astrometric payload. |
Open with DEXTER |
![]() |
Figure 7: Transit level chromaticity vs. across scan field position in the nominal case and for increasing alignment error, (in arcsec). |
Open with DEXTER |
The transit-level chromaticity remains very close to zero in an across-scan
position of about
,
which appears to be a chromatic-free section
of the field; the residual has opposite signs on either side of this position.
This is due to the structure of the local chromaticity distribution
(Fig. 6), with alternate signs in each quadrant. The result
is that stars of a given spectral type in different regions of the field have
nearly opposite residual chromaticity. This aspect may be exploited for
further reduction of the residual chromaticity in the data reduction
phase. Upon definition of a threshold of acceptable residual chromaticity, at
transit level, it is possible to provide a specification for alignment,
referred to both initial telescope integration, and in-orbit re-alignment.
Table 4: Transit-averaged chromaticity as a function of re-alignment error.
Quantitative analysis and optimization may be performed on specific optical configurations, including manufacturing and alignment aspects based on realistic WFE data from manufacturers. The resulting minimization of the instrumental chromaticity is an improvement to the overall systematic error budget, which is desirable because correction procedures based on the science data are necessarily limited, and in some ways they subtract information.
Specific analyses have been performed with reference to the baseline configuration of the Gaia astrometric payload, but the assumptions, principles, and conclusions of our discussion can be applied to any high accuracy astrometric instrument. The chromatic error is defined as the difference in image photo-center location at different wavelengths, and the exact value depends on the selected measurement process, but the effect is unavoidable.
Independently from the selected WFE expansion, the terms relevant to chromaticity are those associated with anti-symmetry of the PSF in the FP and with anti-symmetric WFE contributions on the pupil plane. Symmetric terms only contribute to the astrometric noise by increasing the effective image width in the measurement direction. As the relation between rms WFE and chromaticity is complex, the specification of only the rms WFE is not a sufficient requirement for controlling chromaticity from the optical manufacturing standpoint.
The first prescription to optical manufacturers is to suppress or at least minimize the anti-symmetric terms. However, it is not necessary to set all chromatic terms to zero: an appropriate combination is still able to provide some local balancing. Standard optical design optimization techniques are able to provide reasonable results by applying the conventional image quality merit functions; optimal results on chromaticity require definition of ad hoc criteria, as much as on tolerancing. The distribution of chromaticity over the field inherits some symmetry properties from the optical system; deviations from symmetry are induced e.g. by manufacturing and alignment errors on each optical component.
In case of repeated measurements in different parts of the field, some chromaticity compensation is achieved in the data combination, depending on the symmetry of both instrument and measurement schemes. In the case of Gaia, each object observed by a symmetric (i.e. correctly re-aligned) telescope along a full transit provides a set of astrometric measurements affected by opposite chromatic errors in symmetric positions; transit-level composition is therefore likely to remove a large fraction of the local chromaticity. Any residual chromaticitymust be removed in the science data processing after the best implementation of the astrometric payload, to minimize the initial systematic error. This requires spectral information for each source and a good knowledge of the detailed instrument response (Gai & Cancelliere 2005).
Acknowledgements
We acknowledge the contribution of Dr. D. Carollo to initial investigations of the subject of chromaticity. The ORA team helped in the detailed operation and best usage of their CodeV package. Our activity on Gaia was partially supported by the Italian Space Agency, under research contracts ASI ARS 96-77 and ASI ARS 98-92. M.G.L. acknowledges the support of the Space Telescope Science Institute through the Institute's Visitor Program for 2005.