A Fourier domain model for estimating astrometry errors due to static and quasistatic optical surface errors
TMT Observatory CorporationInstrumentation Department,
1111 S. Arroyo Pkwy. Ste. 200,
Pasadena,
CA,
91107
USA
email: brente@caltech.edu
Received:
12
January
2013
Accepted:
25
February
2013
Context. The wavefront aberrations due to optical surface errors in adaptive optics systems and science instruments can be a significant error source for high precision astrometry.
Aims. This report derives formulas for evaluating these errors which may be useful in developing astrometry error budgets and optical surface quality specifications.
Methods. A Fourier domain approach is used, and the errors on each optical surface are modeled as “phase screens” with stationary statistics at one or several conjugate ranges from the optical system pupil. Three classes of error are considered: (i) errors in initially calibrating the effects of static surface errors; (ii) the effects of beam translation, or “wander,” across optical surfaces due to (for example) instrument boresighting error; and (iii) quasistatic surface errors which change from one observation to the next.
Results. For each of these effects, we develop formulas describing the position estimation errors in a single observation of a science field, as well as the differential error between two separate observations. Sample numerical results are presented for the three classes of error, including some sample computations for the Thirty Meter Telescope and the NFIRAOS firstlight adaptive optics system.
Key words: astrometry / instrumentation: high angular resolution / instrumentation: adaptive optics
© ESO, 2013
1. Introduction
High precision astrometry is becoming an increasingly important capability for large, and future extremely large, optical and nearinfrared astronomical telescopes (Yelda et al. 2010; Fritz et al. 2010; Trippe et al. 2010). Astrometric accuracies of 40 to 100 micro arc seconds (μas) have now been reported for observations both without and with adaptive optics (Lazorenko et al. 2009; Cameron et al. 2009), and requirements for future systems are specified in the 10 to 50 μas range (TMT Science Requirements Document, at http://www.tmt.org/documents). Much as for high contrast imaging, performance at this level will require the careful evaluation of the full range of possible error sources, some of which have been considered negligible up to now. One sample astrometry error budget includes at least 35 different terms, grouped into the five categories of: reference star catalog errors, atmospheric refraction effects, atmospherc turbulence effects, focal plane measurement errors, and optomechanical errors (Schoeck 2011). This final category places challenging requirements on the design, fabrication, and alignment of science instruments (and adaptive optical systems) in order to control this systematic error source.
The purpose of this paper is to describe a set of engineering formulas for estimating the astrometry errors due to surface errors in optomechanical systems. These formulas can be used to develop quantitative error budgets and the corresponding optical surface specifications. A Fourier domain approach is used, and the phase errors on each optical surface are modeled as “phase screens” with shiftinvariant statistics at one or several conjugate ranges from the optical system pupil. The image distortion errors inherit this property of stationarity, and some steps of the work consequently resemble previous analysis of tilt anisoplanatism in adaptive optical (AO) systems (Roddier et al. 1993). The errors are evaluated as weighted integrals of the power spectral densities (PSDs) of the errors on each optical surface, where the values of the weighting functions depend upon parameters including the telescope aperture diameter, science fieldofview (FoV) diameter, conjugate range to the optical surface, and the category of optical surface effect evaluated.
Section 2 formulates a model for the three different types of optical surface effects considered in the analysis. These are:

the static calibration undersampling error from using only a finite number of reference sources to sample the image distortion pattern in the focal plane;

the error due to random or systematic beam wander translation across the static errors on each surface, which could occur due to an instrument boresighting error or image derotation; and

quasistatic errors due to timevarying optical surface errors, for example a calibration error on a deformable mirror in a multiconjugate AO (MCAO) system.
We develop a notation to keep track of these various errors, both for a single observation of a science field and the differential error between two successive observations. We formulate a model for onsky calibration of the errors (using known reference stars in the image) in terms of removing the loworder modes of the image distortion pattern, such as global tip/tilt and plate scale.
The analytical methods used to evaluate these errors are developed in Sects. 3 through 5. Section 3 describes our standard geometrical optics model for the image distortion due to a set of optical surface errors. Section 4 transforms this model into the spatial frequency domain, and Sect. 5 then focuses the analysis on the special case of a circular, unobscured aperture and a circular FoV.
Section 6 next details the explicit formulas for each of the three different optical surface effects introduced in Sect. 2 and presents sample numerical results. These include: (i) a multiplicative “transfer function” relating the PSD of quasistatic errors on an optical surface to the resulting position estimation errors, and (ii) the errors due to beam wander or calibration undersampling of static optical surface errors with a power law PSD. Finally, Sect. 7 is a brief summary of the key results.
2. Astrometry error models and metrics
The twodimensional image distortion map at the focal plane of an instrument will be denoted Θ(ϕ,p), where ϕ is a twodimensional lineofsight (LOS) through the instrument optics, and p is a set of error profiles for each optical surface of the instrument. The error profiles are sums of static and quasistatic components, s and q. The lineofsight ϕ is a sum of o, the intended LOS for the origin of the science field, δ, the unintended shift in the LOS due to (for example) boresighting error, and α, the relative location of a particular object in the science field.
The estimate of the image distortion map will be denoted Θ_{c}(ϕ,p). This calibration estimate is based upon the measured image of an array of reference sources; it includes the effect of static error profiles s and can be translated to account for the known value of o, but it is ignorant of the error sources q and δ. The resulting twodimensional position estimation error in a single image of the science field takes the form (1)Similarly, the differential error between two separate images is given by (2)where the quasistatic surface errors and boresighting errors can take two different, uncorrelated values in the two exposures (but the static errors s remain the same).
Since we will be using a geometrical optics model, the distortion map Θ(ϕ,p) is linear in p. The formula for the error E can therefore be decomposed into the sum of 3 terms, namely: (3)Here Q is the twodimensional quasistatic error, W is the twodimensional error due to wander in the LOS, and C is the twodimensional calibration error. Similarly, the differential error ΔE can be written as (4)where the three individual terms are defined by the formulas We are interested in developing formulas for the meansquare value of f, where f is any of these six effects of optical surface errors. We write (8)where the angle brackets denote ensemble averaging over the statistics of the optical surface errors and the unknown line of sight errors δ. Note that measurement noise is not included in the analysis.
For some observations, it may be possible to estimate and partially correct for the effects of the surface errors using the known locations of astrometric reference stars in the science field. One star can be used to calibrate a global tip/tilt error, three stars can be used to calibrate plate scale and rotation, and so on. We model this image postprocessing in terms of a loworder mode removal operator P, defined by the formula (9)Here the { 0,1 } valued function Ω defines the science FoV, and the functions m_{j} are the orthogonal loworder distortion modes removed from the image. This is an idealized, best case correction, as it neglects possible errors due to measurement noise, imperfectly known positions of the reference stars, and the aliasing of higherorder distortion modes into an estimate obtained using a finite number of reference stars. If the modes are defined to be orthonormal over the FoV, it follows that the fieldaveraged, meansquare value of Pf is given by (10)Even if the error f is approximated as spatially stationary, the pointwise value of the image distortion variance is no longer spatially invariant (i.e., independent of α) after low order modes have been removed.
3. Image distortion maps due to optical surface errors
We will use firstorder, geometric optics to model the focal plane image distortions due to optical surface errors, an accurate approximation for the case of AOassisted observations with neardiffractionlimited resolution. The image distortion in the direction α due to the set of optical error profiles p is defined to be the rms bestfit tilt to the wavefront aberration in this direction, namely (11)Here r denotes coordinates in the pupil plane, A(r) is a { 0,1 } valued pupil function, and φ(r;α,p) is the wavefront aberration profile in the direction α due to the optical surface errors p. Other models for image motion as a function of the aberration φ are possible, for example the apertureaveraged value of the gradient, but the rms bestfit tilt is a good approximation to the measured motion of the central core of a wellcorrected PSF (Ellerbroek 2009).
If we restrict attention to the case of a circular, unobscured pupil function of diameter D, the first integral in Eq. (11) may be evaluated with the result (12)Next, the wavefront aberration profile φ will be modeled using firstorder optics as a sum of the contributions from each individual optical phase errors p_{j} at conjugate range h_{j}: (13)Substituting Eq. (13) into Eq. (12), we obtain the result (14)
4. Fourier domain analysis
The meansquare image distortion errors introduced in Sect. 2 can be evaluated most easily in the angular frequency domain if we assume that the optical phase errors p are spatially invariant, i.e., have shiftinvariant statistics that can be described in terms of a PSD function. Of course, this assumption introduces approximations at several different levels. Phase errors on actual optical surfaces cannot be fully defined in terms of a spatial PSD because (i) each optical surface is unique, not part of an infinite ensemble of possible surfaces; and (ii) optical surfaces have finite dimension. At the same time, the use of PSD models to specify and characterize optical surface errors has been established for at least several decades (Lawson et al. 1995), and the use of Fourier domain methods to study the performance of finite aperture adaptive optics systems and high contrast imaging systems has met with considerable success (Jolissaint et al. 2006). As with these previous efforts, the intent here is to develop practical formulas for optical surface specifications and error budgeting.
Using Eq. (14) above, the Fourier transform of the image distortion profile Θ is described by the expression (15)where κ is a twodimensional angular frequency variable. This result is essentially the Fourier convolution theorem. If the random optical phase errors p_{j} are shiftinvariant, then the PSD of Θ is expressed in terms of their power spectra Φ_{pj} by the formula (16)The quasistatic and beamwander errors Q, W, and ΔW introduced in Sect. 2 can be expressed as sums of the form (17)where the possibility of only a single term with w_{1} = 1 is included. Using the Fourier shift theorem we have (18)so that the power spectrum of f is given by (19)The twoaxis meansquare value of f is then (20)where tr denotes the trace of a matrix (recall that Φ_{f} is a 2 × 2 matrix).
PSD methods may also be used to evalute the expected value of the inner product integrals appearing in Eq. (10) above for the meansquare value of Pf, the position estimation error remaining after loworder mode removal. Using Parseval’s theorem, we have: (21)It remains to develop a Fourier domain model for the calibration errors C and ΔC. For this purpose, suppose that the image distortion function Θ(α;p) is sampled using an infinite grid of reference sources with onedimensional angular spacing d, and that the image distortion map is estimated from these samples using Fourier interpolation. The calibration Θ_{c}(α;p) is correct for the Fourier modes of the distortion with angular frequencies less than 1/2d in each component, but it aliases the higher frequency modes. More formally, (22)where the sum is taken over all pairs of integers j and k, and κ_{∞} = max( κ_{1}, ∥ κ_{2}). The Fourier transform of the calibration error is then given by (23)For a linear combination of image distortions defined as in Eq. (17) above, using Eq. (18) we obtain (24)From Eq. (23) (and intuition) it follows that the calibration error is not shiftinvariant, since the Fourier components of the error are not statistically independent at distinct angular frequencies. However, the meansquare value of the error can still be expressed in the Fourier domain as (25)on account of the Plancheral theorem. Using Eqs. (23), (24), and the fact that the distortion function Θis shift invariant, the meansquare calibration error now becomes (26)Here κmod d^{1} is the componentwise remainder of κ when divided by d^{1}. Equation (26) is a somewhat awkward expression, and we will only evaluate it explicitly for the calibration error C (i.e., f = Θ) as opposed to ΔC.
5. Specializing to circular apertures and fields
The special case of a circular, unobscured aperture function A(r) and a circular science FoV Ω(α) is generally sufficient for error budgeting purposes. The Fourier transform of the function A(r)r appearing in Eq. (15) then takes the form (27)where D is the aperture diameter, J_{2} is a Bessel function of the first kind, and φ is the angle of the vector ν. Substituting Eq. (27) back into Eq. (16) now yields (28)For a circular FoV of radius ρ, it is reasonable to decompose the image distortion profile in terms of the orthonormal basis functions where the functions Z_{i} are the orthonormal Zernike polynomials on the unit radius disk as used by Noll (1976). Using Eq. (21) and Noll’s formulas for ℱ(Z_{i}) in terms of Bessel functions, we obtain the result (31)for the projection of the function f onto the first N radial orders of the distortion modes. For example, N = 0 corresponds to global tip/tilt removal, N = 1 adds modes for plate scale, animorphic plate scale, and image rotation, and so on.
6. Final formulas and sample numerical results
Using the general results developed in Sects. 3 through 5 above, we can now derive and evaluate expressions for the three error terms of quasistatic distortion, beam wander across static surface distortions, and calibration undersampling error.
6.1. Quasistatic errors
In Sect. 2, the position estimation error due to the effects of the quasistatic optics figure error q is denoted Q, and the residual error after loworder mode removal is written as PQ. This corresponds to the case f = Θ, and by Eq. (19) the particular value of the LoS error o + δ is unimportant. Substituting Eqs. (20), (28), and (31) into Eq. (10) yields the result Here is the meansquare position estimation error due to the surface errors on optical surface number j at conjugate range h_{j}, D is the aperture diameter, ρ is the diameter of the FoV, ν = κ/h_{j}, and the mode removal filter ℳ is defined by (34)where M is the maximum radial order of the modes removed from the image distortion map.
Quasistatic wavefront errors are unpredictable almost by definition, so reliable information on their PSD may not necessarily be available. However, it is still possible to develop a “sensitivity” function to describe the magnitude of the error introduced by one unit of rms wavefront error as a function of spatial frequency. For the higherorder, tip/tilt/piston removed wavefront errors, this sensitivity function is given by the formula (35)where the factor of ℳ(1,Dν) is included to filter out the tip/tilt/piston modes of the wavefront error on a circular aperture of diameter D.
Figure 1 plots this sensitivity function (in units of μas per nm) for quasistatic wavefront errors on the deformable mirror DM11.2 in the NFIRAOS first light AO system for the Thirty Meter Telescope (Herriot et al. 2012). The relevant parameter values for this instrument are D = 30 m, ρ = 30 arcsec, h_{j} = 11.2 km, and N = 0 or 1. The maximum value of the sensitivity function is approximately 3 μas nm^{1} with global tiptilt removal (N = 0), and approximately 0.15 assuming that the plate scale and rotation modes (N = 1) can also be calibrated using reference stars. Results such as this can be used to help specify the requirements on deformable mirror stability in AO systems used for high precision astrometry.
Fig. 1 Sensitivity function relating quasistatic wavefront errors on the deformable mirror DM11.2 in the TMT AO system NFIRAOS to the resulting image distortion after loworder mode correction using reference stars. See the text for further parameter values. 

Open with DEXTER 
Finally, if the random quasistatic wavefront aberrations have the same variance and are uncorrelated for two separate science exposures, the value of the differential error due to the two independent quasistatic errors in two images is simply (36)
6.2. Error due to beam wander effects
The position estimation error in a single exposure due to a beam wander across the optical surface errors is the function W(α) defined as the first term within square brackets in Eq. (3). Using Eqs. (10), (20), (19), (16) and (31) above, we can again evaluate the meansquare value of the error as sum of contributions from each optical surface, with each contribution expressed as a weighted integral of the surface error PSD. For a fixed value of the beam wander δ, the contribution from surface number j is given by the formula (37)If the beam wander vector is treated as a zeromean, normally distibuted random error, the expected error averaged over all realizations of δ is then given by (38)where σ_{δ} is the oneaxis rms value of the beam wander.
A common functional form for the fabrication errors on an optical surface is a (twodimensional) power law PSD, Φ(ν) = cν^{−p}. If the optical surface is specified to have a pistontip/tiltfocus removed rms OPD equal to σ_{OPD} on a clear aperture of radius equivalent to R_{C} for the image of the surface in telescope object space, the coefficient c may be evaluated with the result (39)Here the quantities I_{1}′(p) and I_{n}(p) are similar to the integrals computed by Noll (1976) for the case of Kolmogorov turbulence with p = 11/3, and are given by the expressions By substituting Eq. (39) into Eq. (38) and applying the change of variable ν → Dν, we finally obtain (42)Equation (42) suggests the definition of a normalized sensitivity coefficient s_{PWj} according to the formula (43)We recall that D is the diameter of the telescope aperture, the rms optical path error over the clear aperture of surface j is σ_{OPD}, and the radius of this surface is equivalent to R_{C} when imaged into telescope object space. The normalization defined in Eq. (43) expresses the fact that the rms image distortion due to beam wander on surface j will scale linearly with the rms optical path error within the beamprint (i.e., with (D/R_{C})^{(p − 2)/2}σ_{OPD}), and inversely with the diameter of the telescope (D). According to Eq. (42), the resulting sensitivity coefficient s_{PWj} is a function of the surface error power law p, the order of loworder mode removal N, the normalized FoV diameter hρ/D, and the normalized beam wander hσ_{δ}/D.
Fig. 2 Normalized astrometry error for beam wander with global tip/tilt removed. See Eq. (43) for the definition of this sensitivity coefficient. 

Open with DEXTER 
Fig. 3 Normalized astrometry error for beam wander with global tip/tilt and plate scale removed. See Eq. (43) for the definition of this sensitivity coefficient. 

Open with DEXTER 
The trends in the error due to beam wander across optical surface errors are plotted in Figs. 2 and 3. From the plotted results, we see that

Larger random beam wanders always cause larger position estimation errors, although the trend begins to asymptote for normalized jitters beyond the range of 0.3–1.0;

Larger fields of view also cause a larger error, although this effect also begins to asymptote for normalized fields of view greater than about 1.0. This result implies that surface errors on optics located far from a pupil (i.e., with a large absolute value of h) have the greatest impact on astrometric accuracy.

The expontent of the PSD power law has a relatively smaller effect, with a steeper PSD (more negative exponent) generally yielding a modestly larger error;

The error is always reduced by removing the plate scale modes from the image distortion profile, with a greater reduction obtained for smaller normalized fields of view.
6.3. Calibration undersampling error
The last two terms to consider are the calibration errors C and ΔC. The function f consists of only a single term for C, but two terms for ΔC. On account of Eq. (26), this means that the formula for is much less tractable than for , so we have only considered the simpler case of the error in a single exposure^{1}.
Substituting Eq. (39) for a power law PSD into Eq. (26), the formula for the meansquare calibration error due to surface number j becomes (45)Note that domain of integration is still twodimensional, consisting of all spatial frequency vectors ν with ν_{∞} = max(ν_{1},ν_{2}) > (2h_{j}d/D)^{1}. Since the integrand in Eq. (45) is radially symmetric, we replace the square “hole” in the domain of integration with an equalarea disk so that the integral can be converted into polar coordinates and simplied. The final formula for the calibration error becomes (46)Figure 4 plots the calibration error sensitivity coefficient (47)as a function of the calibration Nyquist frequency D/(h_{j}d) for several different power laws p. We see that for calibration to provide a meaningful improvement, the calibration Nyquist frequency must exceed 1, or equivalently the reference source spacing d must satisfy d < D/h_{j}. The rms error then falls as the Nyquist frequency to the power −(p + 1)/2, as would be expected from Eq. (46) and the asymptotic behavior of the Bessel function J_{2}. Smoother optical surface errors, with larger values of p, are consequently easier to calibrate.
Fig. 4 Normalized rms position estimation error due calibration undersampling. See Eq. (47) for the definition of this sensitivity coefficient. 

Open with DEXTER 
7. Conclusions
We have derived a Fourier domain model for evaluating the position estimation errors due to optical surface errors in astronomical instrumentation and associated adaptive optics systems. Using analytical methods very similar to those used earlier to study adaptive optics, the errors can be evaluated as a sum of weighted integrals of the surface error PSD on each optical element. Separate formulas have been developed for: (i) quasistatic surface errors; (ii) the error due to beam wander across static surface errors; and (iii) the calibration undersampling error for a finite number of bright, regularly spaced reference sources. The benefits of onsky calibration using a number of known reference stars in the science image has also been modeled. It is hoped that these results may be useful for developing optical surface specifications and astrometry error budgets.
There are several aspects of this basic model which could be revisited for greater generality and improved accuracy. Characterizing quasistatic surface errors in terms of a PSD may be challenging, since these errors are difficult to model almost by definition. The assumption that surface errors are uncorrelated between different optical surfaces will not necessarily apply to MCAO system, although these correlations will not matter in the important special case that only the groundlayer DM is adjusted to compensate for the static errors on other surfaces. Finally, the current model for calibration, which is based upon a grid of regularly spaced, bright reference sources, is restricted to laboratory source simulators and cannot be used to evaluate the performance of more sophisticated onsky calibration methods using images of dense star fields. It remains to be seen whether Fourier domain methods are applicable in these cases with an acceptable level of accuracy, or if more computationally intensive approaches such as Monte Carlo simulations may be required instead.
According to Eq. (7), the error ΔC is identically zero if the offset vectors o and o′ are identical.
Acknowledgments
The basic model for astrometry errors described in Sect. 2 above was developed in conversations with Glen Herriot and Matthias Schoeck, and we would also like to thank the referee for providing many helpful suggestions in his review. The TMT Project gratefully acknowledges the support of the TMT collaborating institutions. They are the Association of Canadian Universities for Research in Astronomy (ACURA), the California Institute of Technology, the University of California, the National Astronomical Observatory of Japan, the National Astronomical Observatories of China and their consortium partners, and the Department of Science and Technology of India and their supported institutes. This work was supported as well by the Gordon and Betty Moore Foundation, the Canada Foundation for Innovation, the Ontario Ministry of Research and Innovation, the National Research
Council of Canada, the Natural Sciences and Engineering Research Council of Canada, the British Columbia Knowledge Development Fund, the Association of Universities for Research in Astronomy (AURA) and the US National Science Foundation.
References
 Cameron, P. B., Britton, M. C., & Kulkarni, S. R. 2009, AJ, 137, 83 [NASA ADS] [CrossRef] (In the text)
 Ellerbroek, B. L. 2009, in Adaptive Optics: Methods, Analysis and Applications, eds. B. L. Ellerbroek, & J. Christou (Washington D.C.: OSA) (In the text)
 Fritz, T., Gillessen, S., Trippe, S., et al. 2010, MNRAS, 401, 1177 [NASA ADS] [CrossRef] (In the text)
 Herriot, G., Andersen, D., Atwood, J., et al. 2012, in Adaptive Optics Systems III, eds. B. L. Ellerbroek, E. Marchetti, & J.P. Veran (Bellingham: SPIE), 84471 (In the text)
 Jolissaint, L., Veran, J.P., & Conan, R. 2006, J. Opt. Soc. Am A, 23, 382 [NASA ADS] [CrossRef] (In the text)
 Lawson, J. K., Wolfe, C. R., Manes, K. R., et al. 1995, in Optical Manufacturing and Testing, eds. V. J. Doherty, & H. P. Stahl (Bellingham: SPIE), 38 (In the text)
 Lazorenko, P. F., Mayor, M., Dominik, M., et al. 2009, A&A, 505, 903 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
 Noll, R. J. 1976, J. Opt. Soc. Am., 66, 207 [NASA ADS] [CrossRef] (In the text)
 Roddier, F., Northcott, M. J., Graves, J. E., McKena, D. L., & Roddier, D. 1993, J. Opt. Soc. Am. A, 10, 957 [NASA ADS] [CrossRef] (In the text)
 Schoeck, M. 2011, in Adaptive Optics for Extremely Large Telescopes 2, eds. J.P. Veran, T. Fusco, & Y. Clenet (Paris: ONERA), 721 (In the text)
 Trippe, S., Davies, R., Eisenhauer, F., et al. 2010, MNRAS, 402, 1126 [NASA ADS] [CrossRef] (In the text)
 Yelda, S., Lu, J. R., Ghez, A. M., et al. 2010, ApJ, 725, 331 [NASA ADS] [CrossRef] (In the text)
All Figures
Fig. 1 Sensitivity function relating quasistatic wavefront errors on the deformable mirror DM11.2 in the TMT AO system NFIRAOS to the resulting image distortion after loworder mode correction using reference stars. See the text for further parameter values. 

Open with DEXTER  
In the text 
Fig. 2 Normalized astrometry error for beam wander with global tip/tilt removed. See Eq. (43) for the definition of this sensitivity coefficient. 

Open with DEXTER  
In the text 
Fig. 3 Normalized astrometry error for beam wander with global tip/tilt and plate scale removed. See Eq. (43) for the definition of this sensitivity coefficient. 

Open with DEXTER  
In the text 
Fig. 4 Normalized rms position estimation error due calibration undersampling. See Eq. (47) for the definition of this sensitivity coefficient. 

Open with DEXTER  
In the text 