Free Access
Issue
A&A
Volume 567, July 2014
Article Number A75
Number of page(s) 13
Section Astronomical instrumentation
DOI https://doi.org/10.1051/0004-6361/201423940
Published online 16 July 2014

© ESO, 2014

1. Introduction

A particular interest of astronomical long-baseline optical interferometers is their ability to perform high accuracy angular astrometry (Shao & Colavita 1992). The basic idea is that astrometry can leverage on the high-resolution offered by baselines of several hundred meters. However, implementing astrometry is not straightforward. The practical measurement is a monitoring of the paths of the stellar lights inside the facility. More precisely, it is a measurement of the difference in optical path (OP) length between each arm of the interferometer. This is usually done with an internal metrology system, going from the interferometric lab up to the telescopes. The idea behind this is that an homodyne interferometer converts an angular position on the sky to an optical path difference (OPD). In the first-order approximation, this optical path difference is equal to the direction of the star in the sky (s) projected onto the baseline vector of the interferometer (B): OPD=s·B.\begin{equation} OPD=\vec s \cdot \vec B. \label{eq1} \end{equation}(1)Within this simple equation is hidden a lot of complexity. Above all, how is the baseline vector defined? The intuitive answer is to define the baseline by the vector that links the geographical position of the two telescopes. But unfortunately, it only makes sense in the case of two perfectly identical telescopes. This immediately implies some questions: What happens when the two telescopes are not identical? What happens if, for one reason or another, the optical alignment of the two telescopes is not identical?

When dealing with precision astrometry, this kind of problem is pivotal. The main scientific objective of the GRAVITY instrument is to probe the event horizon of the Galactic center black hole (Eisenhauer et al. 2011). It will have to reach unprecedented astrometric accuracy between two one-arcsec-separated objects on the order of 10 μas (ten part par million). According to Eq. (1), it means that the baseline of the Very Large Telescope Interferometer (VLTI) must be know to a submillimeter level.

To complicate the fact that the baseline must be known to an extreme precision, the literature defines several baselines (Colavita et al. 1999; Hrynevych et al. 2004; Sahlmann et al. 2013). From the simple definition of the baseline by Lawson (2000) (the distance between two apertures), emerged the idea of a wide angle baseline (WAB), a narrow angle baseline (NAB, Colavita 2009) and an IMaging Baseline (IMB, Woillez & Lacour 2013). The idea behind these definitions was that each baseline corresponds to a given use of the interferometer.

But all these baselines affect the astrometric measurement to various degrees. Woillez & Lacour (2013) used the so-called “limit points” to define the baselines vectors. Instead of defining the vector by its direction and length, the baseline vector is defined by the point from where it emerges to the point where it goes: the vector is also defined by its geographical position. In this paper, we establish how the position of the various limit points affect the OPD measurement. Errors due to optical aberrations are intrinsically linked to the position of the limit points, and we will see that they are an important part of astrometric limitations. Throughout this paper, we build an error budget, focusing on the biases that occur due to baseline errors and optical aberrations and showing how GRAVITY, through pupil control, plans to keep these bias below 10 μas. This paper does not intend to cover random process errors like photon noise, detector noise, or background noise.

This paper starts with a presentation of “The baseline paradox of optical interferometry” in Sect. 2. It shows in a pedagogical way how too simple an assumption on the concept of baseline leads to conflicting observations. In Sect. 3, we use a three-dimensional model of the interferometer to calculate the impact of the different baselines’ limit points on the OPD measurement. However, this analytical modeling is not enough to account for aberrations of any order higher than the tip-tilt, so in Sect. 4 we simplify the problem to two dimensions to investigate how these aberrations modify the OPD. The rest of the sections apply those results to the GRAVITY instrument. Section 5 describes how the instrument works and the technical choices made for the astrometric measurement. Section 6 puts concrete numbers on the errors and the way they affect astrometry. The end-product of this section is the error budget presented in Table 1. Section 7 concludes.

2. Necessity for multiple baselines

2.1. An apparent paradox

thumbnail Fig. 1

The baseline paradox. Consider a thought experiment where an interferometer is equipped with two masks that rotate in the pupil of both telescopes. By doing so, the distance between the two apertures changes with time while the telescopes stay firm. In the left panel, the observation is done on a single target, acquired on the axis of the telescopes (perpendicular to the plane of the aperture). The OPD measurement is done between the white-light fringe and the zero reference of the interferometer. The OPD does not change with the rotation of the masks. In the right panel, the interferometer is recording two fringe packets obtained by observing two stars in the same field. The difference in OPD is measured between the two white-light fringes. δOPD does change with the rotation of the masks.

Consider a two-telescope interferometer as depicted in Fig. 1. In front of each telescope there is an opaque mask with a circular aperture. The aperture is not centered with respect to the optical axis of the telescope. The two masks are rotating, moving the respective holes in different directions.

In the first case (left panel), the interferometer looks at a single star. This star is exactly in the direction of observation of each telescope. For an unresolved object, the incoming wavefront arrives perpendicularly to the pupil of the telescope1. As such, the masks cut out part of the incoming wavefront. But what matters is the cross-product between the aperture and the phase of the wavefront over the pupil: the wavefront being flat over the pupil, the result of the cross-product does not depend on the position of the hole. In fact, the phase averaged over the aperture is the same as it would have been if obtained without any mask. As a result, the position of the white-light fringe in the beam combiner is not affected by the rotation of the two masks, even though the distance between the two apertures changes. The optical path difference between the white-light fringe and the zero of the interferometer depends on the distance between the two telescopes (B1): OPD=s·B1.\begin{equation} OPD = \vec s \cdot \vec {B}_1 . \end{equation}(2)The zero of the interferometer is the point at the equal optical path distance of the pupils of the two telescopes.

In the second case (right panel of Fig. 1), we are not interested in the exact OPD of the fringes, but in the differential OPD between two fringe patterns. The interferometer is simultaneously observing two celestial sources of the same field. Equation (1) becomes a differential measurement: δOPD=(sp)·B2\begin{equation} \delta OPD=(\vec s- \vec p) \cdot \vec {B}_2\, \end{equation}(3)where (sp) is the difference in astrometric position between the two sources, and δOPD the difference in position between the two white-light fringes. In this case, the delay in optical length between the fringe patterns matches the distance between the two apertures, projected in the direction of the source. This time, the relevant baseline is the one that is modulated (B2), and the differential position of the fringes will move when the masks rotate.

The paradox is therefore that, on two identical interferometers, two behaviors for the fringes positions can coexist. In one case (single star on-axis), the fringe pattern is fixed in absolute position, while for the other case (two stars), one fringe pattern is moving with respect to the other. In other words, how is it possible that two different baselines B1 and B2 can simultaneously exist and apply to the same interferometer?

2.2. Solution

To face this paradox (Woillez & Lacour 2013) proposed several definition of baselines, which would apply to different kinds of interferometers (or different uses of the same interferometer, as shown above). The idea is that different baselines are used depending on the way the OPD is calculated. The wide angle baseline (WAB) is defined by the two limit points that are the pivot points of the telescopes (the center of rotation of the telescopes). The imaging baseline (IMB) is defined by the two limit points that are the barycentric mean of the effective pupils of each aperture. It must be used when observing at the same time multiple objects inside a same field of view. Last, the narrow angle baseline (NAB) is defined by the limit points that are the image of the metrology end points in primary space. In the example of the previous section, the B1 baseline is the WAB and B2 the IMB.

Some baselines are easier to work with than others. The baseline must be stable to give an accurate astrometric measurement. An ideal baseline would have two properties:

  • 1.

    it should be fixed with respect to the sidereal reference frame or,at least, fixed within a coordinate frame that can be accuratelyconverted into the sidereal frame (e.g., the terrestrial referenceframe);

  • 2.

    it should be fixed with respect to the interferometric plane where the white-light fringe is observed.

The word “fix” has the meaning that it does not change with respect to time, telescope pointing, or stellar object position. Alternatively, instead of being fixed the baseline “fix” can be replaced by “predictable”, although it will complicate data reduction. Requirement (1) is to be able to project the baseline in the reference frame of the astronomical object. Requirement (2) is to produce an OPD measurement independent of the position of the objects in the interferometric field. By definition the pivot points fulfill Req. (1), while the pupil barycenter fulfills Req. (2) (Woillez & Lacour 2013) . The NAB does not necessarily fulfill any of those two properties.

2.3. Practical examples

Depending on the type of interferometers, some baseline definitions are more useful than others. As an example, the WAB is zero for any kind of single telescope interferometer (like SAM on NACO; Tuthill et al. 2010), as well as an interferometer like the LBT (Hill et al. 2006): BWAB = 0. This is because the whole structure rotates around a single pivot point: each entrance aperture has the same pivot point. For these interferometers, the IMB, which is the distance between two apertures (LBT), holes (SAM), or mirrors (Michelson stellar interferometer), is the adequate baseline for interpreting the data.

Long-baseline interferometers with a single-beam combiner (for example, AMBER, PIONIER; Petrov et al. 2007; Le Bouquin et al. 2011) also require using the imaging baseline. However, the imaging baseline is hard to monitor because the effective pupil depends on the whole optical train, pupil vignetting, telescope pointing, etc. In other words, it does not follow Req. (1). Often and for convenience, data reduction softwares thus assume that the IMB is identical to the WAB. However, users of those instruments need to be aware that it introduces an error into their interferometric measurement.

Finally, in dual feed interferometers like PTI (Colavita et al. 1999), PRIMA (Delplancke 2008), GRAVITY (Eisenhauer et al. 2011), or ASTRA (Woillez et al. 2010), the metrological link monitors a third baseline often called the NAB. We show in the following that the NAB should closely match the IMB in order to link the OPD metrology measurement with the angular separation. Moreover, dual feed interferometers have two IMB, one for each combiner. This complicates the equations by adding quasi-static error terms in the OPD measurement.

3. Baseline limit-points and OPD measurement

3.1. The wide angle baseline

thumbnail Fig. 2

Schematic view of a single beam interferometer. The light paths are represented by light rays. The real light rays are reflected by mirrors up to the beam combiner inside the interferometric lab. However, in this simplified representation, all the mirrors are removed. Thus, the represented virtual light rays are straight lines from the star to the virtual images of the beam combiner. The mechanical structure of the interferometer is reduced to two pivot points, L1 and L2 which are the pivot points of the telescopes. The wide angle baseline (WAB) is represented as a vector between the two pivot points: BWAB=0L1L2\hbox{$\vec {B}_{\rm WAB} = \protect \overrightarrow{\vec {L}_1\vec{L}_2}$}. The OPD measurement is made between the beam combiners and the metrology end point. It is related to the stellar position through Eq. (8).

Figure 2 shows the simplest representation possible of a purely theoretical astrometric interferometer. The telescopes are represented by two geographically-localized points L1 and L2. The telescopes point by rotating around themselves, i.e., L1 and L2 are also the pivot points. All mirrors have been suppressed in the representation, and the optical trajectories are represented by virtual rays. Because of this, the beam combiner have two virtual images, one in each arm of the interferometer (in reality, thanks to folding mirrors, the fringes are – of course – physically located at the same position).

The interferometer metrology probes two optical length measurements, one for each telescope. The astrometric measurement uses the difference between the two: OPD=OP2INTOP1INT.\begin{equation} OPD = OP_{\rm 2-INT} - OP_{\rm 1-INT} . \label{OPDint} \end{equation}(4)The two values OP1 − INT and OP2 − INT are measured, to a constant, between the central fringe2 and the telescope pivot points L1 or L2. In all of Sect. 3, we assume that the metrology measurement is obtained with respect to the central fringe, i.e. without any additional OPD introduced at the beam combiner level.

Since by definition the central fringe is the fringe at equal optical path, we can establish, by the following equation, the relation between OPINT and OPEXT: OP1EXT+OP1INT=OP2EXT+OP2INT,\begin{equation} OP_{\rm 1-EXT} + OP_{\rm 1-INT} = OP_{\rm 2-EXT} + OP_{\rm 2-INT} \label{OPDequal} , \end{equation}(5)where the two numbers OP1 − EXT and OP2 − EXT are the optical path lengths up to the stellar emission, outside the interferometer.

Last, the equation that links the two external optical paths is the projection in the stellar direction of the three-dimensional vector delimited by the two pivot points. According to the baseline definitions in Woillez & Lacour (2013), this vector is the WAB: OP1EXTOP2EXT=s·0L1L2.\begin{equation} OP_{\rm 1-EXT} - OP_{\rm 2-EXT} = \vec s \cdot \overrightarrow{ \vec { L}_1\vec{L}_2} \label{OPDproj} . \end{equation}(6)In the whole paper, we use an arrow to represent a vector that goes from a point X to a point Y: 0XY=YX\hbox{$\overrightarrow{ \vec {XY } }= \vec { Y} - \vec { X }$}. Here, 0L1L2=L2L1=BWAB.\begin{equation} \overrightarrow{ \vec { L}_1\vec{L}_2 }= \vec { L}_2 - \vec { L}_1 = \vec {B}_{\rm WAB} . \end{equation}(7)The advantage of the WAB is that it is theoretically fixed with respect to the ground: it does not change with the pointing direction of the telescope.

Equations (4)(6) directly lead to the relation that links the OPD measurement to the stellar position: OPD=s·BWAB.\begin{equation} OPD = \vec { s } \cdot \vec {B}_{\rm WAB} . \label{OPDWAB} \end{equation}(8)

3.2. The narrow angle baseline

thumbnail Fig. 3

Schematic view of an astrometric interferometer. The light paths are virtual, as seen from the star, and all mirrors are removed, resulting in a straight line from the star to the beam combiner. Only the wide angle baseline (WAB) is represented, but all the baselines limit points are represented. L1 and L2 are the pivot points of the telescopes: BWAB=0L1L2\hbox{$\vec {B}_{\rm WAB} = \protect \overrightarrow{\vec {L}_1\vec{L}_2}$}. The M1 and M2 points are the end points of the metrology: BNAB=0M1M2\hbox{$\vec {B}_{\rm NAB} =\protect \overrightarrow{\vec{M}_1\vec{M}_2}$}. Last, IP1, IP2, IS1 and IS2 are the barycenters of the effective pupils of each beam combiners: BIMBS=0I1SI2S\hbox{$\vec {B}_{\rm IMB-S} = \protect \overrightarrow{\vec {I}_{\rm 1S}\vec{I}_{\rm 2S}}$} for the first combiner and BIMBP=0I1PI2P\hbox{$\vec{B}_{\rm IMB-P} =\protect \overrightarrow{\vec{I}_{\rm 1P}\vec{I}_{\rm 2P}}$} for the second. Upper panel: the metrology measurement is made between the beam combiners and the metrology end point. The end points are different from the pivot points, which justify in Eq. (13) the use of the narrow angle baseline instead of the WAB. Lower panel: full schematic representation of the interferometer, including the presence of the imaging limit points. Imaging limit points are necessary to account for error terms due to difference in the propagation direction of the metrology (m1S, m2S, m1P and m2P) with respect to the stellar beams (s and p). It introduces additional error terms that are stated in Eq. (21).

The upper panel of Fig. 3 increases the complexity with respect to Fig. 2 by including four measurements instead of two: the interferometer is now observing two different stars: p and s. The OP measurements are made between the white-light fringes and the end points of the metrology signal, defined in geographic space: M1 and M2. The end points can be inside the optical train. If this is the case, M1 and M2 are the virtual images of the metrology end points, as seen from the star (in the sidereal reference frame). Then, the OPD measurement is twice differential: δOPD=OP2SINTOP1SINT+OP1PINTOP2PINT.\begin{equation} \delta OPD = OP_{\rm 2S-INT} - OP_{\rm 1S-INT} + OP_{\rm 1P-INT} - OP_{\rm 2P-INT} . \label{eqOPDm} \end{equation}(9)Similarly to Sect. 3.1, we can combine Eq. (5), which links the internal OPs to the external OPs, to Eq. (6), which links the external OPs to the baseline vector 0M2M1\hbox{$ \overrightarrow{ \vec { M}_2 \vec{M}_1}$}, to derive OP2SINTOP1SINT=s·0M1M2OP2PINTOP1PINT=p·0M1M2.\begin{eqnarray} OP_{\rm 2S-INT} - OP_{\rm 1S-INT} = \vec s \cdot \overrightarrow{\vec {M}_1\vec {M}_2} \label{OPDMM1}& \\ OP_{\rm 2P-INT} - OP_{\rm 1P-INT} = \vec p \cdot \overrightarrow{\vec {M}_1\vec {M}_2} \label{OPDMM2}.& \end{eqnarray}Then, we can define the narrow angle baseline: 0M1M2=0M1L1+BWAB+0L2M2=BNAB.\begin{equation} \overrightarrow{\vec{M}_1\vec{M}_2} = \overrightarrow{ \vec{M}_1\vec{L}_1}+ \vec {B}_{\rm WAB}+ \overrightarrow{\vec{L}_2\vec{M}_2}=\vec{B}_{\rm NAB}. \end{equation}(12)As shown in Woillez & Lacour (2013) it is the one that must be used to derive the separation between the two stellar objects: δOPD=(sp)·BNAB.\begin{equation} \delta OPD = (\vec s - \vec p) \cdot \vec{B}_{\rm NAB} . \label{OPDNAB} \end{equation}(13)A difficulty with the narrow angle baseline is that the limit points (the end points of the metrology) are generally defined in the telescope reference frame. As a consequence, the NAB depends on the telescope pointing. Also, if the metrology end points are not physically located in the telescope frame, but inside the intereferometric optical train, the narrow angle baseline is more difficult to calculate. It is then defined by the virtual images of the metrology end point, as seen from outside the telescope.

3.3. The imaging baseline

It is necessary to introduce the notion of the imaging baseline once we want to consider the impact of the field position of the targets on the metrology. It can be traced back to the van-Cittert Zernike theorem, which links contrast and phase to the spatial geometry of the astrophysical object (Woillez & Lacour 2013).

For a dual beam interferometer like GRAVITY, there are two imaging baselines: one for each beam combiner. This is because the limit points of the imaging baseline are defined by the effective pupil, which is different from the telescope pupil. For a single-mode interferometer, the effective pupil can be properly determined. It is the acceptance mode of the single-mode fibers as seen from the star (viewed through the telescope pupil). For a dual beam combiner there are four acceptance modes, thus four limit points or two IMBs. In the lower panel of Fig. 3, the limit points are noted I1S and I2S for the first beam combiner, and I1P and I2P for the second. These limit points do not have to be colocated with the telescope pupil center: this is called a pupil positioning error, which can be both lateral and longitudinal.

The propagation directions of the metrology beams are represented in Fig. 3 by the vectors m1S, m2S, m1P, and m2P. The propagation directions of the incoming light of the stellar sources are still s and p. That means that, for example, the vector difference (m2Ss) characterizes the pointing error of the metrology of the first beam combiner on the second telescope toward the source denoted s. This is equivalent to a tip-tilt error.

The imaging baseline limit points have, by definition, an important property: the optical path length between the fringes and the limit points is the same for the metrology and for the stellar light. This is a property of the effective pupil: the pupil is field-independent3. As a consequence, OP1S − INT defined between the beam combiner and the imaging baseline limit point does not depend on the (m1Ss) tilt error. This is also the case for the three other optical path lengths: OP2S − INT, OP1P − INT, and OP2P − INT.

However, the δOPD measurement is done up to the metrology end point. Following the conventions from the lower panel of Fig. 3, Eq. (9) must be rewritten to include the optical path length corresponding to the projected distance between the imaging and metrology end points (the four m·IM terms). Thus, δOPD measured between M1 and M2 becomes δOPDM12=OP2SINT+m2S·0I2SM2OP1SINTm1S·0I1SM1+OP1PINT+m1P·0I1PM1\begin{eqnarray} \delta OPD_{M_{12}} &=& OP_{\rm 2S-INT} + \vec{m}_{\rm 2S} \cdot \overrightarrow{ \vec {I}_{\rm 2S}\vec{M}_2} \nonumber \\ &&-\,\, OP_{\rm 1S-INT} - \vec{m}_{\rm 1S} \cdot \overrightarrow{ \vec { I}_{\rm 1S}\vec{M}_1} \nonumber \\ &&+ \,\, OP_{\rm 1P-INT} + \vec{m}_{\rm 1P} \cdot \overrightarrow{ \vec { I}_{\rm 1P}\vec{M}_1} \nonumber \\ &&-\,\, OP_{\rm 2P-INT} - \vec{m}_{\rm 2P} \cdot \overrightarrow{ \vec { I}_{\rm 2P}\vec{M}_2} . \label{OPDII0} \end{eqnarray}(14)On the other hand, the internal and external optical path lengths must satisfy Eq. (5). In the same way that we obtained Eqs. (10) and (11), we can derive OP2SINTOP1SINT=OP2PINTOP1PINT=\begin{eqnarray} OP_{\rm 2S-INT} - OP_{\rm 1S-INT} &=& \vec s \cdot \overrightarrow{ \vec {I}_{\rm 1S}\vec {I}_{\rm 2S}} \label{OPDII1} \\ OP_{\rm 2P-INT} - OP_{\rm 1P-INT} &=& \vec p \cdot \overrightarrow{ \vec {I}_{\rm 1S}\vec {I}_{\rm 2S}} \label{OPDII2} , \end{eqnarray}where the vectors 0I1SI2S\hbox{$\overrightarrow{\vec{I}_{\rm 1S}\vec{I}_{\rm 2S}}$} and 0I1SI2S\hbox{$\overrightarrow{ \vec{I}_{\rm 1S}\vec{I}_{\rm 2S}}$} correspond to the two imaging baselines: 0I1SI2S=0I1PI2P=\begin{eqnarray} \overrightarrow{ \vec{I}_{\rm 1S}\vec{I}_{\rm 2S}}&=&\overrightarrow{ \vec {I}_{\rm 1S}\vec{L}_1} +\vec {B}_{\rm WAB} + \overrightarrow{ \vec {L}_2\vec{I}_{\rm 2S}}= \vec { B}_{\rm IMB-S} \label{OPDII3} \\ \overrightarrow{ \vec {I}_{\rm 1P}\vec{I}_{\rm 2P}}&=& \overrightarrow{ \vec{I}_{\rm 1P}\vec{L}_1} +\vec{B}_{\rm WAB} + \overrightarrow{ \vec {L}_2\vec{I}_{\rm 2P}}= \vec { B}_{\rm IMB-P} \label{OPDII4} . \end{eqnarray}Combining together Eqs. (14)(18), we can link the δOPDM12 measurement to the differential angular position (sp) and to the wide angle baseline BWAB: δOPDM12=(sp)·BWAB+m2S·0I2SM2m1S·0I1SM1+m1P·0I1PM1m2P·0I2PM2+s·(0I1SL1+0I2SL2)p·(0I1PL1+0I2PL2).\begin{eqnarray} \delta OPD_{M_{12}} &=& \left(\vec s - \vec p\right) \cdot \vec { B}_{\rm WAB} \nonumber \\ &&+\,\, \vec{m}_{\rm 2S} \cdot \overrightarrow{ \vec { I}_{\rm 2S}\vec{M}_2} - \vec { m}_{\rm 1S }\cdot \overrightarrow{ \vec {I}_{\rm 1S}\vec{M}_1} \nonumber \\ && + \,\, \vec{m}_{\rm 1P} \cdot \overrightarrow{ \vec { I}_{\rm 1P}\vec {M}_1} - \vec{m}_{\rm 2P} \cdot \overrightarrow{ \vec{I}_{\rm 2P}\vec {M}_2} \nonumber \\ && +\,\, \vec s \cdot \left(\overrightarrow{ \vec{I}_{\rm 1S}\vec{L}_1} + \overrightarrow{ \vec {I}_{\rm 2S}\vec{L}_2}\right) \nonumber \\ && - \,\,\vec p \cdot \left(\overrightarrow{ \vec { I}_{\rm 1P}\vec{L}_1} + \overrightarrow{ \vec {I}_{\rm 2P}\vec{L}_2}\right) . \end{eqnarray}(19)This equation simplifies to δOPDM12=(sp)·(BWAB+0M1L10M2L2)+(m2Ss)·0I2SM2(m1Ss)·0I1SM1\begin{eqnarray} \delta OPD_{M_{12}} &=& \left(\vec s - \vec p\right) \cdot \left(\vec{B}_{\rm WAB} + \overrightarrow{\vec{M}_1\vec{L}_1} - \overrightarrow{\vec{M}_2\vec{L}_2}\right) \nonumber \\ &&+\,\, \left(\vec{m}_{\rm 2S} - \vec s\right) \cdot \overrightarrow{\vec {I}_{2S}\vec{M}_2} - \left(\vec{m}_{\rm 1S} - \vec s\right) \cdot \overrightarrow{\vec {I}_{\rm 1S}\vec{M}_1} \nonumber \\ && +\,\, \left(\vec{m}_{\rm 1P} - \vec p\right) \cdot \overrightarrow{\vec {I}_{\rm 1P}\vec{M}_1} - \left(\vec{m}_{\rm 2P} - \vec p\right) \cdot \overrightarrow{\vec {I}_{\rm 2P}\vec{M}_2} \, \label{OPDIMGL} \end{eqnarray}(20)or toδOPDM12=(sp)·BNAB+(m2Ss)·0I2SM2(m1Ss)·0I1SM1\begin{eqnarray} \delta OPD_{M_{12}} &=& (\vec s - \vec p) \cdot \vec{B}_{\rm NAB} \nonumber \\ &&+\,\, (\vec{m}_{\rm 2S} - \vec s) \cdot \overrightarrow{\vec {I}_{2S}\vec{M}_2} - (\vec {m}_{\rm 1S} - \vec s) \cdot \overrightarrow{\vec {I}_{\rm 1S}\vec{M}_1} \nonumber \\ && + \,\, (\vec{m}_{\rm 1P} - \vec p) \cdot \overrightarrow{\vec {I}_{\rm 1P}\vec{M}_1} - (\vec {m}_{\rm 2P} - \vec p) \cdot \overrightarrow{\vec { I}_{\rm 2P}\vec{M}_2} . \label{OPDIMG} \end{eqnarray}(21)Hence, δOPDM12 results from the sum of two components. The first component, (spBNAB, is the main contributor to the full measurement. It reflects the assertion stated in Woillez & Lacour (2013) that the NAB baseline is the main baseline of a dual feed astrometric interferometer. However, several second-order terms are present because the imaging baseline limit points are not necessarily the same as the metrology end points (0IM00\hbox{$\overrightarrow{\vec {IM}} \neq \overrightarrow{ 0}$}). Since both 0IM\hbox{$\overrightarrow{\vec {IM}}$} and (ms) are small, the amplitude of the scalar product will be almost negligible compared to the total optical path difference. However, if no care is taken, it adds to the δOPDM12 in such a way that it will bias the estimation of the angular separation (sp).

It is possible to combine the second-order terms to show the influence of the differential length between the imaging baselines BIMB and the narrow angle baseline BNAB: δOPDM12=(sp)·BNAB+(m2S+m1S2s)·(BNABBIMBS)(m2P+m1P2p)·(BNABBIMBP)+(m2Sm1S4+m2Pm1P4)·(0I1SI1P+0I2SI2P)+(m2Sm2P4m1Sm1P4)·(0I1SM1+0I1PM1)+(m2Sm2P4m1Sm1P4)·(0I2SM2+0I2PM2).\begin{eqnarray} \delta OPD_{M_{12}} &=& \left(\vec s - \vec p\right) \cdot \vec{B}_{\rm NAB} \nonumber \\ &&+ \left(\frac{\vec{m}_{\rm 2S}+\vec{m}_{\rm 1S}}{2} - \vec s\right) \cdot \left( \vec{B}_{\rm NAB} - \vec { B}_{\rm IMB-S}\right) \nonumber \\ &&- \left(\frac{\vec{m}_{\rm 2P}+\vec{m}_{\rm 1P}}{2} - \vec p\right) \cdot \left( \vec{B}_{\rm NAB} - \vec { B}_{\rm IMB-P}\right) \nonumber \\ &&+ \left(\frac{\vec{m}_{\rm 2S}-\vec{m}_{\rm 1S}}{4} + \frac{\vec{m}_{\rm 2P}-\vec{m}_{\rm 1P}}{4} \right) \cdot \left( \overrightarrow{\vec {I}_{\rm 1S}\vec{I}_{\rm 1P}} + \overrightarrow{ \vec {I}_{\rm 2S}\vec{I}_{\rm 2P}}\right) \nonumber \\ &&+ \left(\frac{\vec{m}_{\rm 2S}-\vec{m}_{\rm 2P}}{4} - \frac{\vec{m}_{\rm 1S}-\vec{m}_{\rm 1P}}{4} \right) \cdot \left(\overrightarrow{\vec { I}_{\rm 1S}\vec{M}_1} +\overrightarrow{ \vec { I}_{\rm 1P}\vec{M}_1}\right)\nonumber \\ &&+ \left(\frac{\vec{m}_{\rm 2S}-\vec{m}_{\rm 2P}}{4} - \frac{\vec{m}_{\rm 1S}-\vec{m}_{\rm 1P}}{4} \right) \cdot \left(\overrightarrow{\vec { I}_{\rm 2S}\vec{M}_2} +\overrightarrow{\vec { I}_{\rm 2P}\vec{M}_2}\right) .\nonumber \\ \label{OPDIMG123} \end{eqnarray}(22)According to this formula, δOPDM12 can be equal to δOPD as spelled out in Eq. (13) if all the three following conditions are fulfilled:

  • 1.

    The baselines match each other:BNAB = BIMB − S = BIMB − P.

  • 2.

    The limit points of the two imaging baselines are superposed: I1PI1S = I2SI2P (equal to 0 assuming 1).

  • 3.

    The metrology beams have the same propagation angle on each telescope: m1Sm1P = m2Sm2P.

This is not a situation that we assume in the following GRAVITY error budget.

4. High-order optical aberrations

The notion of imaging baseline is intrinsically related to the presence of a pointing error, or tip tilt (ms ≠ 0). These tilt-tilt errors are equivalent to low-order optical aberrations and are considered in the previous section. In this section, we include the effect of the higher order Zernike polynomials.

To be exact, a full diffraction analysis would have to be performed over the whole metrology train. That would mean including a Fresnel analysis over the optics down to the fiber for the stellar light and up to the metrology sensor for the metrology beam. However, we can considerably simplify the problem by neglecting the effect of the z direction inside the optical train. It is equivalent to considering that the atmosphere, the metrology sensor, the pupil of the telescopes, the pupil of the instruments, and the optical aberrations are conjugated inside a single plane. In that plane, only the two lateral directions (x and y) matter. Then, the effect of the high-order aberrations can be seen as a problem of a two-dimensional coupling of the light onto the modes of the beam combiners.

4.1. Aberrations affecting the stellar light

In the case of a single-mode interferometer, the modes of the beam combiners can be identified as the modes of acceptance of the single-mode fibers. The phase delays due to the optical aberrations are therefore the argument of the coupling coefficient of the abnormal light onto the mode of each fiber: δΦ = arg(ρ), where δΦ is the phase error caused by the aberration, and ρ the coupling coefficient of the light into the fiber (Robieux 1959; Shaklan & Roddier 1988): ρ=!pupilE(x,y)EF(x,y)dxdy!pupil|E(x,y)s|2dxdy!pupil|EF(x,y)|2dxdy\begin{equation} \rho = \frac{\iint \limits_{\rm pupil} \! E(x,y) E_{\rm F}^*(x,y) \, \mathrm dx \mathrm dy }{\sqrt{\iint \limits_{\rm pupil} \! |E(x,y)_{\rm s}|^2 \, \mathrm dx \mathrm dy \iint \limits_{\rm pupil} \! |E_{\rm F}(x,y)|^2 \, \mathrm dx \mathrm dy}} \end{equation}(23)where E(x,y) is the incoming aberrated wavefront, and EF(x,y) is the acceptance mode for the electric field of the single-mode fiber. The cross-correlation is easier to understand in the focal plane, i.e. at the entrance of the fiber. However, according to the Parseval-Plancherel theorem, it can also be calculated in the pupil plane, which turns out to be more convenient for modeling the aberrations.

To calculate the impact on the OPD measurement, we have to consider the modes of all four fibers (EFs1, EFs2, EFp1, and EFp2) and of the four distinct incoming wavefront (Es1, Es2, Ep1, and Ep2). Then, the phase error on the OPD measurement will be δΦstellar=arg(pupilEs1(x,y)EFs1(x,y)dxdy)arg(pupilEs2(x,y)EFs2(x,y)dxdy)arg(pupilEp1(x,y)EFp1(x,y)dxdy)\begin{eqnarray} \delta \Phi_{\rm stellar}&=&\arg \left( \iint \limits_{\rm pupil} \! E_{\rm s1}(x,y) E^*_{\rm Fs1}(x,y) \, \mathrm dx \mathrm dy \right) \nonumber \\ &&-\arg \left( \iint \limits_{\rm pupil} \! E_{\rm s2}(x,y) E^*_{\rm Fs2}(x,y) \, \mathrm dx \mathrm dy \right) \nonumber \\ &&-\arg \left( \iint \limits_{\rm pupil} \! E_{\rm p1}(x,y) E^*_{\rm Fp1}(x,y) \, \mathrm dx \mathrm dy \right) \nonumber \\ &&+\arg \left( \iint \limits_{\rm pupil} \! E_{\rm p2}(x,y) E^*_{\rm Fp2}(x,y) \, \mathrm dx \mathrm dy \right) \label{eq:phistel} , \end{eqnarray}(24)where the integration is done over the electric fields present inside the pupil.

4.2. Aberrations affecting the metrology light

While the phases measured inside the beam combiners are affected by the full aberration of the stellar light, the metrology value is modified solely by the optical aberrations inside the optical train. These aberrations can be quantified according to the Zernike polynomials Zn(x,y) (n according to numbering by Noll 1976) of variance unity. Each one of the four metrology beams can be affected by these aberrations. However, only a differential aberration will affect the differential metrology measurement. This is a beam-walk effect: the two stellar beams have different footprints on the optics, which can result in a differential optical aberration. Assuming that the aberrations are a sum of Zernike of amplitude ks1n\hbox{$k^n_{\rm s1}$}, ks2n\hbox{$k^n_{\rm s2}$}, kp1n\hbox{$k^n_{\rm p1}$}, and kp2n\hbox{$k^n_{\rm p2}$} (one for each one of the four metrology beams), we can derive a phase error to the metrology: δΦmet=receiverstel1n[(ks1nkp1n)Zn(x,y)]dxdy\begin{eqnarray} \delta \Phi_{\rm met}&=&\iint \limits_{\rm receivers\ tel1} \! \sum_n \left[(k^n_{\rm s1} - k^n_{\rm p1})\ Z^n(x,y) \right] \mathrm dx \mathrm dy \nonumber \\ &&-\iint \limits_{\rm receivers\ tel2} \! \sum_n \left[(k^n_{\rm s2} - k^n_{\rm p2})\ Z^n(x,y) \right] \mathrm dx \mathrm dy\label{eq:phimet} . \end{eqnarray}(25)The full metrology measurement is a phase measurement averaged over all the metrology receivers. This is a general case. Depending on the type of metrology system and position/number of receivers, this integral can be a simple summation of several measurements at given (x,y) positions.

5. The GRAVITY instrument

5.1. GRAVITY metrology principle

thumbnail Fig. 4

GRAVITY active subsystems involving the astrometric measurement. In red is the optical path of the reference star, in blue the optical path of the science star. The dotted lines are free space-light propagation, and the solid lines correspond to the single-mode fibers. The reference star is used to correct the atmospheric piston in real time by activating the internal piezo and/or the VLTI delay lines. In green is represented the metrology system, injected into the two integrated optics beam combiners (IO), and measured thanks to 4 metrology diodes inside the cryostat, and 4 metrology diodes on each telescope. The metrology measurements are used to control the GRAVITY differential delay lines to position the fringes on the science camera. Yellow represents the acquisition camera control loop that commands the tip-tilt and the pupil position. Orange is the open loop polarization control system.

As mentioned before, GRAVITY is a four-telescope interferometer for the VLTI. It is operating in the K band and is able to couple the four 8.2 m Unit Telescopes (UTs) or the four 1.8 m auxiliary telescopes (ATs). It is assisted with a visible and infrared adaptive optics (AO) system for the UTs. GRAVITY is equipped with two beam combiners in integrated optics (Jocou et al. 2012), one for a reference source and the other for the science target. In this dual mode, the goal of GRAVITY is to provide an accurate differential phase measurement to achieve astrometry with ten micro-arcsecond accuracy in a two-arcsecond field (4-arcsec field with the ATs).

The reference source is used to cancel atmospheric perturbations over the six baselines thanks to a dedicated fringe tracking system. To do so, the GRAVITY instrument will have a fast feedback control loop using steering mirrors on piezo actuators inside the cryostat (with a cutoff bandwidth close to 30 Hz) and a slow feedback (for offloading) to the main VLTI delay lines. GRAVITY will have to fringe track on a K = 10 unresolved object with 300 nm OPD residuals, setting the basic specification of the system.

The basic functions impending the OPD measurement are presented in Fig. 4. The metrology is injected into both beam combiners (Gillessen et al. 2012). On one beam combiner, the metrology is phase-modulated with respect to the other with a lithium niobate phase modulator. Going through the optics in the reverse direction of the stellar light, the metrology beams encounter in the cryostat (Haug et al. 2012): the fibered differential delay lines, the fibered polarization rotators, the pupil actuator mirror, and the fast piezo actuators. Before exiting the cryostat, part of the metrology light is recombined on internal metrology diodes to provide constant and reliable feedback for the differential fiber delay lines (without risk of beam interruption). The rest of the metrology light is back-propagated through the VLTI optical train, across all the optics including M1, up to receivers situated on each four spider arms of each of the four telescopes. For each telescope a phase value is extracted from the mean of the phase value of the four telescope diodes: ΦMi, with i the telescope number.

The stellar light is following a similar path in reverse. Once in the cryostat, after the pupil actuator mirror and before injection into the fluoride fibers, the astrophysical sources are separated with a roof mirror acting as a field separator (Pfuhl et al. 2012). The light is guided inside the single-mode fibers up to integrated optics beam combiners, whose outputs are dispersed and imaged on infrared cameras. On one hand, the light of the bright reference source is sent to a fast eAPD detector (the SAPHIRA detector, Finger et al. 2012) from which a phase measurement is read for each five wavelength channels of the fringe tracker: ΦP(λs). This phase is kept close to zero by the means of the fringe tracker commanding the piezo actuators. On the other hand, the light of the science target is send to a Hawaii 2, which is slower (DIT time close to the second) but larger (2k by 2k pixels) than the SAPHIRA detector. On the Hawaii 2, the light is spectrally dispersed giving a resolving power of Δλ/λ = 20, 500 or 4000 (Straubmeier et al. 2012). On this detector one obtains the phase ΦS(λs). For simplicity, we consider phase measurements on both detectors to be done at a common wavelength λs.

All these phase measurements are combined to give six differential OPD measurements: one for each baseline. For example, the optical delay δOPDM12 is obtained between telescopes 1 and 2 by the relation: δOPDM12=(ΦSΦP)λs2π+(ΦM2ΦM1)λm2π·\begin{equation} \delta OPD_{M_{12}} = \left(\Phi_{\rm S} - \Phi_{\rm P}\right) \frac {\lambda_{\rm s}}{2 \pi} + \left(\Phi_{M_2}-\Phi_{M_1}\right) \frac{\lambda_{\rm m}}{2 \pi}\cdot \label{eqOPDPhi} \end{equation}(26)

5.2. Dispersion in GRAVITY

In Eq. (26) is missing an important term caused by dispersion. Dispersion can have an important impact on the contrast of the white-light fringes (Coudé du Foresto et al. 1995). But it also have an important effect on the astrometry. It comes from the fact that the apparent optical path length depends on the wavelength, as the OP length is proportional to the index of refraction of the medium through which it propagates. It writes as OP(λ)=n(λ)·L,\begin{equation} OP(\lambda)=n(\lambda)\cdot L , \end{equation}(27)where OP(λ) is the apparent optical path length and L the physical length. For an addition of several media along the optical path length measured by the metrology, dispersion introduces an OPD error equal to D(λm,λs)=medΔmed·(nmedλmnmedλs),\begin{equation} D(\lambda_{\rm m},\lambda_{\rm s})=\sum_{\rm med} \Delta_{\rm med} \cdot\left(n^{\lambda_{\rm m}}_{\rm med}-n^{\lambda_{\rm s}}_{\rm med}\right) , \end{equation}(28)where Δmed is the difference of physical length of each optical medium, and nmedλ\hbox{$n^{\lambda}_{\rm med}$} is the refractive index of the corresponding medium at a wavelength λ.

In GRAVITY, the wavelength of the metrology (λm = 1.908 μm) is different from the wavelength of the stellar light (1.95 μm <λs< 2.45 μm). On the path of the metrology, there are three different media: air, vacuum (inside the cryostat), and fluoride glass (fibers). The dispersion of vacuum is zero, so the dispersion term can be written as D(λm,λs)=Δair·(nairλmnairλs)+Δfiber·(nfiberλmnfiberλs).\begin{equation} D(\lambda_{\rm m},\lambda_{\rm s})= \Delta_{\rm air} \cdot(n^{\lambda_{\rm m}}_{\rm air}-n^{\lambda_{\rm s}}_{\rm air}) + \Delta_{\rm fiber} \cdot \left(n^{\lambda_{\rm m}}_{\rm fiber}-n^{\lambda_{\rm s}}_{\rm fiber}\right) . \label{eqDisp} \end{equation}(29)Combining Eqs. (26) with (21), and including the effect of dispersion, we can now write down the analytical formula that links the phase measurements to the angular separation of the targets on sky: (sp)·BNAB=(ΦSΦP)λs2π+(ΦM2ΦM1)λm2πD(λm,λs)\begin{eqnarray} (\vec s - \vec p) \cdot \vec { B}_{\rm NAB} &=&(\Phi_{\rm S} - \Phi_{\rm P})\frac{\lambda_{\rm s}}{2 \pi} +(\Phi_{M_2}-\Phi_{M_1})\frac{\lambda_{\rm m}}{2 \pi} - D(\lambda_{\rm m},\lambda_{\rm s}) \nonumber \\ &&+ \,\, {\rm error\ terms\ \ (2^{\rm nd}\,order)}\, \label{eqOPDtotal} . \end{eqnarray}(30)It is not foreseen to calculate the value of the second-order error terms. The goal is to keep them low enough in the error budget. They come from baseline errors and optical aberrations, as spelled out in Eqs. (21), (24), and (25): errorterms=(m1Ss)·0I1SM1(m2Ss)·0I2SM2+(m2Pp)·0I2PM2(m1Pp)·0I1PM1.\begin{eqnarray} {\rm error\ terms}&=& (\vec{m}_{\rm 1S} - \vec{s}) \cdot\overrightarrow{ \vec{I}_{\rm 1S}\vec{M}_1} - (\vec{m}_{\rm 2S} - \vec{s}) \cdot \overrightarrow{\vec {I}_{\rm 2S}\vec{M}_2} \nonumber \\ && +\,\, (\vec{m}_{\rm 2P} - \vec{p}) \cdot \overrightarrow{\vec{I}_{\rm 2P}\vec{M}_2} - ( \vec{m}_{\rm 1P} - \vec{p}) \cdot \overrightarrow{\vec {I}_{\rm 1P}\vec{M}_1} . \nonumber \\ &&+ \,\,\delta \Phi_{\rm stellar}\frac{\lambda_{\rm s}}{2 \pi} +\delta \Phi_{\rm met}\frac{\lambda_{\rm m}}{2 \pi}\cdot\label{eqOPDerr} \end{eqnarray}(31)

5.3. Geographical position of the limit-points of GRAVITY

thumbnail Fig. 5

Baseline representation for a two-telescope interferometer according to the GRAVITY concept. The baselines are defined by their limit points. The WAB is defined by the two pivot points (which defines the center of the telescopes reference frames). The NAB is defined by the geometrical average of the position of the four metrology photodiodes situated on the spider arms. The coordinate of the NAB limit points are (x, y, z) = (0, 0, 10 m). The two imaging baselines IMB1 and IMB2 (one for each beam combiner) have limit points that correspond to the pupil position of their respective beam combiners, weighted laterally by the telescope aperture, as well as the electric-field acceptance of the single-mode fibers. Internal pupil tracking inside the GRAVITY instrument will track the spider arms to place the IMB limit points on the NAB limit point. Expected lateral position will be |x,y| = 0 ± 4 cm. Expected longitudinal position will be z = 0 ± 10 km.

The idea that drove the design of the GRAVITY instrument is therefore to have monitoring and steering possibilities for both the pupil and the field plane. This is paramount for keeping the error terms in Eq. (31) under control. Focal and pupil plane positions are simultaneously supervised during observation by a dedicated camera situated inside the cryostat (Amorim et al. 2012).

The feedback system of this camera – called the acquisition camera – is presented in yellow in Fig. 4. The first terms to minimize are the field terms: miSs and miPp for each telescope i. To do so, the targets are imaged in the H band on the acquisition camera, and the sources are maintained on the fibers thanks to the tri-axis piezo mirror situated in the pupil plane of the instrument. The second terms to minimize are the mismatch between the imaging baseline limit points and the metrology end points: i.e., 0IiSMi\hbox{$\overrightarrow{\vec { I}_{\it i\rm S}\vec{M}_i}$} and 0IiPMi\hbox{$\overrightarrow{\vec { I}_{\it i\rm P}\vec{M}_i}$}. To do so, four light sources (emitting at a wavelength of 1.2 μm) are placed on the spider arms of each telescope. They are imaged through a 2 × 2 lenslet Shack-Hartmann on the acquisition camera. The low-order wavefront sensor allows both a lateral (tip-tilt) and longitudinal (focus) error of the position of the pupil to be derived. The lateral position is controlled thanks to a pupil actuator mirror situated in a focal plane inside the GRAVITY cryostat. The longitudinal error is corrected thanks to the VLTI delay-line variable curvature mirrors (VCM).

With this setup, we can position the imaging baseline limit points close to the limit points of the two other baselines. Figure 5 gives a schematic representation of the expected position of the WAB, NAB, IMB1, and IMB2 limit points in telescope coordinates.

Of all limit-points, the WAB limit points, the pivot points of the telescopes are, in theory, the easiest to locate in geographical coordinates. They are the center of rotation of the telescope, which can be determined by external measurements. However, problems can come from bending of the structure that may hinder the notion of a fixed pivot point.

On the other hand, the problem of the NAB and IMB limit points are that they are defined in telescope frame coordinates. Because of the telescope optical train (notably M2), the telescope frame coordinates are not necessarily colinear to the celestial frame of the object: the telescope reference frame can be several arcsec away from the direction of the target. The terrestrial coordinates are defined as (u, v, w). The telescope coordinates are defined as (x, y, z), with (0, 0, 0) the coordinate of the pivot point, (x, y) the transverse coordinates, and (z) the longitudinal coordinates.

On the UTs, the metrology measurements will be obtained thanks to four photodiodes positioned on the spider arms, symmetrically around the pupil. The (xd,yd) coordinates are (1.945, 1.692), (1.945, 1.692), (1.945, 1.692), and (1.945, 1.692) in meters. The positions were chosen to cancel any small defocus in the non-common path (Sect. 5.4). The metrology measurement ΦM will be the mean of the four phase measurements. The resulting limit point of the NAB will thus be the mean position of the four metrology sensors. It gives a NAB limit point close to the center of the beam (| ⟨ xd,yd ⟩ | ≈ 0). In longitudinal coordinates, the spider arms are at z ≈ 10 m above the pivot point.

The uncertainty of the position of the imaging baseline limit points is larger due to the relay optics between the cryostat and the telescope pupil. As a result, the effective pupils can move with respect to the telescope pupil. However, on GRAVITY, the internal pupil tracking system will stabilize the effective pupils close to the telescope pupil. The imaging limit points are expected to be at | x,y | = 0 ± 4 cm from the central optical axis. The tracking in the longitudinal direction should also be able to perform with an expected accuracy of z = 10 km between the metrology end points and the imaging baseline limit points4.

5.4. GRAVITY vulnerability to high-order aberrations

The two incoming stellar fields Ep(x,y) and Es(x,y) are defined over the eight meters of the telescope pupil except for the central 1.116 m in diameter due to secondary mirror obscuration (0.1395 of the pupil radius). Their phases are affected by both the atmospheric turbulence and the optical aberrations in the optical train.

The electromagnetic fields accepted by the fibers are also defined in the pupil: EFp(x,y) and EFs(x,y). These fields have a near-Gaussian profile whose amplitude can be approximated by EFp(x,y)=EFs(x,y)=exp(x2+y22σF2),\begin{equation} E_{\rm Fp}(x,y) = E_{\rm Fs}(x,y)= \exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2}\right) , \end{equation}(32)where σF was chosen to maximize the injected flux at optimal correction (for a perfectly flat incoming wavefront), and σF = 0.644573 in units of pupil radius. The phases of the EFs and EFp electric fields are only important relative to the phase of the incoming stellar fields (Es and Ep, respectively), so we suppose the phases of the acceptance fields equal to zero. The phase errors resulting from these fields are therefore δΦstellar=arg(0.1395<ρ<1Es1(x,y)exp(x2+y22σF2)dxdy)arg(0.1395<ρ<1Es2(x,y)exp(x2+y22σF2)dxdy)arg(0.1395<ρ<1Ep1(x,y)exp(x2+y22σF2)dxdy)+arg(0.1395<ρ<1Ep2(x,y)exp(x2+y22σF2)dxdy)\begin{eqnarray} \delta \Phi_{\rm stellar}&=&\arg \left( \iint \limits_{0.1395\,<\,\rho\,<\,1} \! E_{\rm s1}(x,y) \exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2}\right) \, \mathrm dx \mathrm dy \right) \nonumber \\ &&-\arg \left( \iint \limits_{0.1395\,<\,\rho\,<\,1} \! E_{\rm s2}(x,y) \exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2}\right) \, \mathrm dx \mathrm dy \right) \nonumber \\ &&-\arg \left( \iint \limits_{0.1395\,<\,\rho\,<\,1} \! E_{\rm p1}(x,y) \exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2}\right) \, \mathrm dx \mathrm dy \right) \nonumber \\ &&+\arg \left( \iint \limits_{0.1395\,<\,\rho\,<\,1} \! E_{\rm p2}(x,y) \exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2}\right) \, \mathrm dx \mathrm dy \right) \nonumber \end{eqnarray}where ρ=x2+y2\hbox{$\rho=\sqrt{x^2+y^2}$}.

GRAVITY will have four metrology receivers on each telescope. According to Eq. (25), the effect of internal optical aberrations is therefore δΦmet=14d=14n[(ks1nkp1nks2n+kp2n)Zn(xd,yd)]\begin{equation} \delta \Phi_{\rm met}=\frac{1}{4} \sum_{d=1}^4 \! \sum_n \left[(k^n_{\rm s1} - k^n_{\rm p1}-k^n_{\rm s2} + k^n_{\rm p2})\ Z^n(x_{\rm d},y_{\rm d}) \right] \end{equation}(33)where (xd,yd) are the positions of the four metrology sensors, and ks1n\hbox{$k^n_{\rm s1}$}, kp1n\hbox{$k^n_{\rm p1}$}, ks2n\hbox{$k^n_{\rm s2}$}, kp2n\hbox{$k^n_{\rm p2}$} the amplitude of the aberrations affecting each one of the four metrology beams.

The four (xd,yd) sensors’ coordinates can be chosen so that optical aberrations affecting both stellar and metrology beams can be fully compensated: δΦstellarλs + δΦmetλm = 0. Because of the limited number of metrology receivers, the number of Zernike polynomials whose effect can be nullified is limited. To cancel a polynomial n, the (xd,yd) coordinates must satisfy the relation 14d=14Zn(xd,yd)=!0.1395<ρ<1Zn(x,y)exp(x2+y22σF2)dxdy!0.1395<ρ<1exp(x2+y22σF2)dxdy,\begin{equation} \frac{1}{4} \sum_{d\,=\,1}^4 Z^n(x_{\rm d},y_{\rm d}) = \frac{ \iint \limits_{0.1395\,<\,\rho\,<\,1} \! Z^n(x,y) \exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2}\right) \, \mathrm dx \mathrm dy }{ \iint \limits_{0.1395\,<\,\rho\,<\,1} \! \exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2}\right) \, \mathrm dx \mathrm dy } , \end{equation}(34)in the approximation of low-amplitude phase errors.

The choice made for GRAVITY was to position the metrology sensors to ensure that piston, tip, tilt, and defocus are canceled by the metrology. For example, the fourth polynomial, Z4(x,y)=23(x2+y2)3\hbox{$Z_4(x,y)=2\sqrt 3(x^2+y^2)-\sqrt 3$} (defocus, Noll 1976), puts a constraint on the distance between the center of the pupil to the position of the sensors: ρd2=ρ=0.139512πρ3exp(ρ22σF2)dρρ=0.139512πρexp(ρ22σF2)dρ\begin{equation} \rho_{\rm d}^2 =\frac{ \int_{\rho\,=\,0.1395}^1 \! 2\pi \rho^3\exp\left(-\frac{\rho^2}{2\sigma_{\rm F}^2}\right) \, \mathrm d\rho }{ \int_{\rho\,=\,0.1395}^1 \! 2\pi \rho \exp\left(-\frac{\rho^2}{2\sigma_{\rm F}^2}\right) \, \mathrm d\rho } \end{equation}(35)gives ρd = 0.644573 in unit of pupil radius5. However, depending on the position of the metrology sensor, the influence of the other Zernike polynomials in the optical train can be amplified. The idea for GRAVITY was to cancel the aberrations that can be caused by a simple displacement of the single-mode fibers in the focal plane. The influence of the higher order polynomials is investigated numerically in Sect. 6.4.1.

5.5. Polarization in GRAVITY

Much of the preceding discussion has revolved around the spatial properties of the GRAVITY instrument associated with the VLTI optical train, their differential effects on science and reference stars, and the metrology light. The discussion would not be complete without also addressing the polarization states of light. The combined optical train of the VLTI and the GRAVITY instrument is birefringent. This causes phase errors in two ways. First, the differential phase between the metrology beams for the science and reference stars is impacted (in any telescope) if they do not enter the optical train with identical polarization orientation. For the expected magnitude of birefringence, this phase error can be made negligible with an easily achievable alignment accuracy. A second, more subtle effect, results from the combination of a) differential birefringence between optical trains and b) different polarization states of the science and reference starlights. In fact, the light from gas orbiting the galactic center black hole (a prime science target for GRAVITY) is known to be partially linearly polarized.

Table 1

GRAVITY astrometric error budget (between UTs, for a single baseline).

A detailed account is beyond the scope of the present paper, but will be given in a forthcoming publication (Lazareff et al., in prep.). We plan to operate with the assumption of an unpolarized reference star, and a partially linearly polarized science target. Under that condition, fringes detected in a linear polarization (at output) perpendicular to either eigenvector are immune from the phase error. Then, the operation can be summarized as follows: i) define (in our particular context) eigenvectors of the optical train as linear polarizations (at input) that result in linear polarization at output. ii) Rotate the polarization controllers of GRAVITY to realize the orthogonality condition. Because the birefringence of each mirror train is a function of the pointing parameters, this alignment condition will be determined in situ within a short time interval from the actual observation or according to polarization modeling acquired during dedicated campaigns.

6. Astrometric error budget

The following sections derive GRAVITY’s astrometric error budget from the expression of the astrometric measurement established from Eqs. (30) and (31). The goal is to estimate the bias that may affect the measurement of sp. Section 6.1 considers the imprecisions on the BNAB baseline vector. Section 6.2 focuses on the combined effect of the tip-tilt error (the mss and mpp terms) with a pupil-matching error (0IM00\hbox{$\overrightarrow{\vec { IM}} \neq \overrightarrow{ 0}$}). Section 6.3 quantifies the errors brought by the dispersion in the interferometer. (The D(λm,λs) value.) Last, in Sect. 6.4, the errors caused by the high-order aberrations are computed: δΦstellar and δΦmet.

6.1. NAB determination in celestial coordinates

That the limit points of the metrology are not at the same position as the pivot points (0LM00\hbox{$\overrightarrow{\vec { LM}} \ne \overrightarrow{ 0}$} in Eq. (21)) mostly raises a problem of reference systems. The WAB baseline is defined in terrestrial coordinates and fixed in this frame of reference. On the other hand, the narrow angle baseline is defined in a different frame system: the telescope coordinates. Finally, the stellar object, (sp), is set in celestial coordinates.

The effect of the NAB on the OPD measurement appears in the first component of Eq. (20) : (sp).(BWAB+0M1L10M2L2)\hbox{$(\vec s - \vec p) . (\vec{B}_{\rm WAB} + \overrightarrow{\vec{M}_1\vec{L}_1} - \overrightarrow{\vec{M}_2\vec{L}_2})$}. Uncertainties come from

  • how well the 0ML\hbox{$\overrightarrow{\vec { ML}}$} vectors are known in terrestrial coordinates,

  • how well the BWAB is known in terrestrial coordinates,

  • how well the terrestrial coordinates are known with respect to sideral coordinates.

The first item is an issue of telescope pointing, telescope bending, telescope dilatation. The relative orientation between telescope and terrestrial coordinates can be obtained to high precision (and high repeatability) thanks to telescope encoders. The physical and optical pointing of the telescope are not necessarily the same. What matters for determining the 0LM\hbox{$\overrightarrow{\vec { LM}}$} vector in terrestrial coordinates is the physical pointing of the telescope. It is expected to have a reproducibility of roughly 10 arcsec. As a result, for GRAVITY, the error caused by the projection of the 10 m distance between the two limit points is on the order of 10 m × 10″ = 0.5 mm.

The second item is an issue of stability and measurement. Over time, the WAB can be averaged to a very high precision, if stable enough. The stability is a structural issue, inherent to the infrastructure of the interferometer. It is expected to be stable at the sub-millimeter level (except perhaps in the event of a big earthquake). In the error budget, we have assumed an uncertainty of 0.5 mm in the WAB.

The relative orientation between the reference frame of the stellar target and the terrestrial coordinates depends on i) the knowledge of the target coordinates and ii) the local stellar time (LST). We assume that time accuracy will be the limiting factor. Not because the LST is not precisely known, but simply because integration times on the science camera are several seconds. Putting a precise time stamp on the data will therefore be difficult, especially since it would have to be weighted by injection coefficients during the exposure. We estimate our uncertainties to 1/15 s, meaning 1″ on sky, knowing that the earth rotate by 15 arcsec per second. The error caused by an arcsec on the projection of a 100 m baseline is approximately 0.5 mm (decreasing with the proximity to the zenith).

With respect to a 100 m baseline, a 0.5 mm error in baseline will correspond to an error in the astrometric accuracy of 5 ppm, hence 5 μas for two stars separated by 1″. The NAB indetermination factors are summarized in Table 1.

6.2. IMB to NAB mismatch

The IMB has an effect on the astrometric measurement only if i) the metrology beam is not exactly pointing toward the stellar source (ms ≠ 0) and if ii) the limit points of the IMB and the NAB do not agree (0IM00\hbox{$\overrightarrow{\vec{IM}}\neq \overrightarrow{ 0}$}). The effect on the OPD measurements is an additional term for each telescope: (miSs)·0IiSMS(miPp)·0IiPMP\hbox{$(\vec {m}_{\it i\rm S} - \vec{s}) \cdot \overrightarrow{\vec {I}_{\it i\rm S}\vec{M}_{\rm S}} - (\vec {m}_{\it i\rm P} - \vec{p}) \cdot \overrightarrow{\vec {I}_{\it i\rm P}\vec{M}_{\rm P}}$}, with i the telescope number. Although it is technically possible to model its influence, owing to the complexity of this term, it is better to try to nullify it. This is achieved in GRAVITY thanks to the pupil-tracking camera mentioned in Sect. 5.3. The residuals are taken into account in the error budget.

The pupil-tracking camera will track the pupil both on the longitudinal and lateral directions. But owing to its design, the longitudinal accuracy is considerably lower than for the lateral. With the pupil tracking activated, the pupil vector 0IM\hbox{$\overrightarrow{\vec {IM}}$} is expected to be within 10 km in the longitudinal and 4 cm in the lateral direction (z = 0 ± 10 km and | x,y | = 0 ± 4 cm).

In the GRAVITY design (as shown in Fig. 5) the metrology beam is injected inside the IO beam combiner. As a result, by optimizing the injection of the stellar flux inside the fibers, GRAVITY will also make sure that the metrology beam has a propagation direction nearly parallel to the stellar light (but with opposite direction), so for optimum injection, ms = 0. Practically speaking, the GRAVITY acquisition camera should stabilize the injection during observation enough to be able to reach | ms | ≤ 0.2λ/D, or ≈ 10 mas at λ = 2.2μm.

We can split the OPD error induced by the IMB to NAB mismatch into two terms: longitudinal and lateral. The lateral contribution is the 4 cm of the pupil displacement projected toward the normalized (ms) vector: sin(10 mas) × 4 cm= 2 nm. The longitudinal contribution will be [1 − cos(10 mas)] × 10 km= 0.01 nm. Related to the total OPD expected between two target separated by 1′′ observed with a baseline of 100 m, it means that the IMB to NAB mismatching will result for GRAVITY in an additional astrometric error of 4 μas. To account for each of the four terms, this value is multiplied by 4\hbox{$\sqrt 4$} in Table 1.

6.3. Dispersion

According to Eq. (29) the dispersion term writes as D(λm,λs)=Δair·(nairλmnairλs)+Δfiber·(nfiberλmnfiberλs),\begin{equation} D(\lambda_{\rm m},\lambda_{\rm s})= \Delta_{\rm air} \cdot(n^{\lambda_{\rm m}}_{\rm air}-n^{\lambda_{\rm s}}_{\rm air}) + \Delta_{\rm fiber} \cdot (n^{\lambda_{\rm m}}_{\rm fiber}-n^{\lambda_{\rm s}}_{\rm fiber}) , \end{equation}(36)where Δair and Δfiber are the differential delays in air and glass, respectively. Ideally, the differential path is entirely compensated for in fibers: Δair ≈ 0 and ΔfiberδOPDM. This is not exactly the case owing to the non-common path in air or vacuum. The main errors come from uncertainties in the physical length of fiber (Δfiber) and of air (Δair). But uncertainties in the knowledge of the refractive indices (nλmnλs) can also be a problem, since they depend on environmental conditions.

Following the Owens (1967) formula, we computed the dispersion of air for standard atmospheric conditions. It gives nair1.9μmnair2.2μm=8×10-8,\begin{equation} n^{1.9~\rm \mu m}_{\rm air}-n^{2.2 ~\rm\mu m}_{\rm air}= 8\times 10^{-8} , \end{equation}(37)hence a dispersion effect of 80 pm for Δair = 1 mm. This is a conservative upper limit, since the two stellar beams share the same optics up to the cryostat. (The star separator is inside the cryostat.) In the data reduction software, air dispersion will therefore be neglected.

The fluoride fibers are, however, a lot more dispersive (Coudé du Foresto et al. 1995). Thus, their effect cannot be neglected. Because the fibers are inside the cryostat, we assume that the dispersion in the fibers will not change and that it can be precisely modeled. First laboratory results show dispersion coefficients on the order of nfiber1.9μmnfiber2.2μm=2×10-4.\begin{equation} n^{1.9 ~\rm \mu m}_{\rm fiber}-n^{2.2~\rm \mu m}_{\rm fiber}= 2\times 10^{-4} . \end{equation}(38)This is in accordance with Coudé du Foresto et al. (1995). For a physical length Δfiber = 1 mm, the correction term to the OPD is therefore on the order of 200 nm. With a knowledge of the fiber stretching of the differential delay lines of σfiber) = 10 μm, we can model the D(λm,λs) to a precision of 2 nm.

6.4. Optical aberrations

These aberrations can be split into two distinct categories:

  • The common-path aberrations that affect both stellar signals.This is the case of atmospheric perturbations, partially correctedby the AO on the UTs.

  • The non-common path aberrations that affect only one of the stars. This is typically a consequence of a beam-walk effect, when the two stellar beams do not have the same imprint on the optical surfaces.

We assume that the metrology will monitor the non-common path optical aberrations, but not the common-path aberrations. We neglect isoplanetism as being beyond the scope of this paper6.

6.4.1. Non-common path aberrations – beam-walk defects

The goal of the metrology is to monitor and account for the non-common optical path length between the two beam combiners. By construction, the metrology laser does perfectly probe any piston induced in the optical train. Higher orders are not necessarily measured as well. To simulate the effect of non-common path aberrations, we consider the influence of Zernike polynomial (n) individually, affecting only one beam of the metrology (ks2n=kp1n=kp2n=0\hbox{$k^n_{\rm s2}=k^n_{\rm p1}=k^n_{\rm p2}=0$}). Then, δΦmet=14d=14ks1nZn(xd,yd).\begin{equation} \delta \Phi_{\rm met}=\frac{1}{4} \sum_{d=1}^4 \! k^n_{\rm s1} \ Z^n(x_{\rm d},y_{\rm d}) . \end{equation}(39)The same aberration also affects the stellar light, but with a negative sign, Es1(x,y)=exp(ıks1nZn(x,y)),\begin{equation} E_{\rm s1}(x,y)=\exp (-\imath\, k^n_{\rm s1} Z^n(x,y)), \end{equation}(40)hence δΦstellar=arg(pupilexp(ıks1nZn(x,y))exp(x2+y22σF2)dxdy)\begin{equation} \delta \Phi_{\rm stellar}\!=\!\arg \left( \iint \limits_{\rm pupil} \! \! \exp (-\imath\, k^n_{\rm s1} Z^n(x,y)) \exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2}\right) \, \mathrm dx \mathrm dy \! \right) \end{equation}(41)giving an OPD error according to Eq. (31) of (δΦstellarλs2π+δΦmetλm2π)\hbox{$(\delta\Phi_{\rm stellar}\frac{\lambda_{\rm s}}{2\pi} +\delta \Phi_{\rm met}\frac{\lambda_{\rm m}}{2\pi})$} in nanometers.

The amplitude of the non-common path aberrations were measured during two test campaigns at Paranal in August and October 2010. They were used to validate the metrology principle. Two coherent beams were overlapped at an angle corresponding to the angular separation of two stars in the field of view of GRAVITY. The two planar waves interfere to create a characteristic fringe pattern inside the VLTI lab. This fringe pattern was projected into the VLTI, including beam expanders and delay lines, onto the pupil of UT4. Aberrations were measured on the M2 mirror from the scattered light, observed with a commercial IR camera mounted to the telescope structure. We found that deviations occurring at the edge of the pupil are smaller than 8 nm/2 nm for a time base of 30 min (for quadratic and forth order respectively, negligible above).

Table 2

Error caused by non-common path aberrations.

Table 2 are summarizes the errors on the OPD measurement for each Zernike polynomial up to the seventh order (Zernike # ≤36). Owing to the point symmetry of the fundamental mode of the fibers, as well as of the metrology sensor, most of the polynomials do not bias the measurement. Only the ones affecting the phase are shown in Table 2. The variance considered for each one of these Zernike is 8 nm for the quadratic order, 2 nm for the fourth order, and 1 nm for the sixth order. The quadratic sum of all these terms is shown in the last row, which must be multiplied by 2\hbox{$\sqrt 2$} to account for errors induced by the two arms of the interferometer.

6.4.2. Common-path aberrations – atmospheric perturbations

The atmosphere perturbations affect the phase of the electromagnetic field coming from the stars. If we neglect the influence of anisoplanetism, we can consider a similar electric field coming from both stars Ep(x,y) ∝ Es(x,y) ∝ E(x,y). The atmospheric perturbations are therefore common-path aberrations. They are not probed by the metrology (δΦmet = 0) and affect the optical path difference as δΦstellar=arg(pupilE(x,y)EFp(x,y)dxdy)arg(pupilE(x,y)EFs(x,y)dxdy)\begin{eqnarray} \delta \Phi_{\rm stellar} &= & \arg \left( \iint \limits_{\rm pupil} \! E(x,y) E^*_{\rm Fp}(x,y) \, \mathrm dx \mathrm dy \right) \nonumber \\ && - \arg \left( \iint \limits_{\rm pupil} \! E(x,y) E^*_{\rm Fs}(x,y) \, \mathrm dx \mathrm dy \right) \end{eqnarray}(42)where EFp(x,y)\hbox{$E^*_{\rm Fp}(x,y)$} and EFs(x,y)\hbox{$ E^*_{\rm Fs}(x,y)$} are the accepted mode of the fibers. The effect of atmospheric turbulence is computed here for one telescope alone, since the atmosphere on the two telescopes can be considered as uncorrelated.

In the absence of any non-common path aberrations (EFp(x,y)=EFs(x,y)\hbox{$ E^*_{\rm Fp}(x,y) = E^*_{\rm Fs}(x,y) $}), common path aberrations do not change the measured optical path length: δΦstellar = 0. It is therefore necessary to combine them with some level of field and/or pupil offset. Considering a field offset, i.e., a tip-tilt error, we used the following equations to simulate the acceptance fields: EFp(x,y)=exp(2πıΔp·xλsx2+y22σF2)EFs(x,y)=exp(x2+y22σF2)\begin{eqnarray} E_{\rm Fp}(x,y) &=&\exp\left(2\pi \imath \frac{\vec { \Delta p} \cdot x}{\lambda_{\rm s}}-\frac{x^2+y^2}{2\sigma_{\rm F}^2} \right)\\ E_{\rm Fs}(x,y) &=&\exp\left(-\frac{x^2+y^2}{2\sigma_{\rm F}^2} \right) \end{eqnarray}where Δp is the tip-tilt error. Such a tip-tilt error will necessarily be present because the alignment of the fiber cannot be perfectly on the star. For E(x,y), we used the YAO AO simulator7 to generate arrays of phase masks for three different conditions: i) observation with the UTs with the AO locked on a Kmag = 7 off-axis reference star; ii) observation with the UTs without AO; and iii) observation with the ATs8. The seeing conditions are average (r0 = 25 cm at λ = 0.5 μm). The results are presented in Table 3 for tip-tilt errors from 0 to 0.5λ/D.

Table 3

Error caused by atmospheric perturbations.

The OPD errors average with time. However, the coherence time of the common path aberrations depends highly on the seeing conditions, wind speed, etc. A rough, conservative solution is to consider a coherence time of 100 ms, hence a reduction by a factor 10 for a 10 s observation. In the case of the 0.2λ/D stated in Sect. 6.2, it means a 0.9 nm OPD error on the UTs and 12.3 nm error on the ATs. A factor 2\hbox{$\sqrt 2$} is needed to account for the atmospheric perturbations on each telescope. This conservative value is reported in Table 1.

It is important to note that the fiber positioning (responsible for the tip-tilt error) is crucial to an operation without AO. On the ATs, if the alignment is below 0.5λ/D, the phase measurement will wrap over 2π, preventing averaging of the error, and thus any realiable astrometric

7. Discussion and conclusion

7.1. Fifteen years of experimentation

The predominantly theoretical exercise of this paper is supported by 15 years of experimentation carried out on various interferometers. The first astrometric observations with a long baseline interferometer were achieved around 1999 on the Palomar Testbed Interferometer (Colavita et al. 1999), based on the theoretical work of Shao & Colavita (1992). Using small siderostats on 100-m-long baselines, the instrument reached 160 μas precision measurements on bright pairs (Shao et al. 1999). PTI was a technology demonstrator in preparation for the four 1.8 m telescopes planned to be added to the Keck Interferometer. The Keck Outrigger astrometry project was the first to tackle the issue of implementing astrometric baselines on large telescopes. The concept foresees transfer of the metrology end point to primary space by a dedicated baseline monitor (Hrynevych et al. 2004). However, the project was cancelled in 2006 before seeing first light. Started in 2008, and even though it never managed an on-sky astrometric demonstration, the ASTRA project of astrometric extension of the two ten-meter Keck Interferometer was the occasion to study the implementation of metrologies further for astrometric interferometer on large telescopes.

The design of PRIMA (Delplancke 2008), the VLTI equivalent to ASTRA, and the Keck Outriggers started in 2000. In 2011, during its first on-sky observations, PRIMA achieved an astrometric precision of 30 μas, but the astrometric accuracy was limited to about 3 mas, mainly because the metrology end points were located at the telescopes Coude foci, far from primary space (Sahlmann et al. 2013). A second on-sky demonstration run in 2013, with the metrology extended to the telescope secondary mirror, resulted in an astrometric accuracy of about 100 μas.

The GRAVITY metrology (Gillessen et al. 2012) follows a slightly different concept than the previous experiments. Instead of measuring the optical path difference in double pass between two telescopes for each star, as done for PRIMA and ASTRA, the GRAVITY metrology measures the optical path difference between the two beams for each telescope in single pass. And other than in PRIMA and ASTRA, the GRAVITY metrology end points are mounted above the primary mirror on the telescope spider, therefore covering the full optical path and providing a physical realization of the narrow-angle baseline endpoints. A second laser is sent back from the telescope to actively track the pupil on the metrology endpoints (Amorim et al. 2012).

7.2. On the GRAVITY astrometric error budget

A sophisticated error budget was a central design driver for GRAVITY. The idea is to cancel as many error terms as possible thanks to technical choices. For example, the metrology is sent up to the spider arms, above M1. It is located in geographical coordinates to minimize error when transposing the physical narrow-angle baseline to the sidereal reference frame (Sect. 6.1). The metrology is also sent by the same single-mode fibers that accept the star light. The difference in propagation direction between the metrology and the starlight is therefore minimized (Sect. 6.2).

The necessity of active control over the pupils (the imaging baseline limit points) is one of the main requirements given by the error budget. The imaging baselines introduce an error term if the instrument pupil and focal plane are not correctly stabilized to a given position (Sect. 3.3). This cannot be canceled out by technical design, so GRAVITY includes a pupil and field

tracking system inside its own cryostat. This system controls the position of the instrument pupil in real time with respect to four laser beacons on the spider arms of the telescopes. It also monitors the position of the stellar objects to keep them accurately positioned (within 0.2λ/D) on the fibers.

Field tracking is also fundamental to minimize the influence of common-path aberrations. At first sight, common path aberrations (e.g., atmospheric perturbations) do not matter. This is not the case if there is an additional error on the field tracking: it is the cross-term between field error and atmospheric perturbation that matters. In Sect. 6.4.2, we showed that atmospheric perturbations have the potential to be the major limit of the astrometric precision, depending on the level of star tracking. Moreover, we show that GRAVITY’s accuracy will depend on the level of AO correction. Notably, the absence of AO on the ATs does put a stringent requirement on the affordable tip-tilt errors for the system to work (see Table 3).

The main result of the error budget is that GRAVITY should fulfill its objective of 10 μas accuracy on the UTs if i) the telescope pupils are positioned within the instrument to an accuracy of 0.5% of their diameter (4 cm on M1) and ii) the two stellar objects are tracked within 0.2 λ/D.


1

We assume here that no atmospheric perturbations are present.

2

In the absence of dispersion, the central fringe is also the white light fringe, i.e. fringe with maximum contrast.

3

The demonstration for this is in the appendix of Woillez & Lacour (2013). Basically: from the imaging baseline perspective, any angular misalignment (star light versus metrology path coaligned with fringe sensor lobe) is interpretable as an external misalignment, leaving the internal metrology versus star light optical path identical.

4

z = 10 km in the 8 m beam is equivalent to z = 1 m in the 80 mm beam in the VLTI laboratory.

5

ρd = σF, a consequence of injection optimisation.

6

For PRIMA, analytical models predict a precision of 10 μas for one hour of observations between two targets separated by 10 arcsec (Sahlmann et al. 2013).

8

As yet, the ATs are not equipped with an AO system.

Acknowledgments

S.L. would like to thank J.D. Monnier for fruitful discussions about the impact of common-path aberrations.

References

  1. Amorim, A., Lima, J., Anugu, N., et al. 2012, in SPIE Conf. Ser., 8445, 844534 [Google Scholar]
  2. Colavita, M. M. 2009, New Astron. Rev., 53, 344 [NASA ADS] [CrossRef] [Google Scholar]
  3. Colavita, M. M., Wallace, J. K., Hines, B. E., et al. 1999, ApJ, 510, 505 [NASA ADS] [CrossRef] [Google Scholar]
  4. Coudé du Foresto, V., Perrin, G., & Boccas, M. 1995, A&A, 293, 278 [NASA ADS] [Google Scholar]
  5. Delplancke, F. 2008, New Astron. Rev., 52, 199 [NASA ADS] [CrossRef] [Google Scholar]
  6. Eisenhauer, F., Perrin, G., Brandner, W., et al. 2011, The Messenger, 143, 16 [NASA ADS] [Google Scholar]
  7. Finger, G., Baker, I., Alvarez, D., et al. 2012, in SPIE Conf. Ser., 8453, 84530T [Google Scholar]
  8. Gillessen, S., Lippa, M., Eisenhauer, F., et al. 2012, in SPIE Conf. Ser., 8445, 84451O [Google Scholar]
  9. Haug, M., Haussmann, F., Kellner, S., et al. 2012, in SPIE Conf. Ser., 8445, 84452V [Google Scholar]
  10. Hill, J. M., Green, R. F., & Slagle, J. H. 2006, in SPIE Conf. Ser., 6267, 62670Y [Google Scholar]
  11. Hrynevych, M. A., Ligon, III, E. R., & Colavita, M. M. 2004, in SPIE Conf. Ser. 5491, ed. W. A. Traub, 1649 [Google Scholar]
  12. Jocou, L., Perraut, K., Nolot, A., et al. 2012, in SPIE Conf. Ser., 8445, 84452X [Google Scholar]
  13. Lawson, P. R. 2000, Principles of Long Baseline Stellar Interferometry (Pasadena: NASA, JPL, CIT) [Google Scholar]
  14. Le Bouquin, J.-B., Berger, J.-P., Lazareff, B., et al. 2011, A&A, 535, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  15. Noll, R. J. 1976, J. Opt. Soc. America (1917−1983), 66, 207 [Google Scholar]
  16. Owens, J. C. 1967, Appl. Opt., 6, 51 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  17. Petrov, R. G., Malbet, F., Weigelt, G., et al. 2007, A&A, 464, 1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Pfuhl, O., Haug, M., Eisenhauer, F., et al. 2012, in SPIE Conf. Ser., 8445, 84451U [Google Scholar]
  19. Robieux, J. 1959, IEEE Transactions on Antennas and Propagation, 7, 118 [NASA ADS] [CrossRef] [Google Scholar]
  20. Sahlmann, J., Henning, T., Queloz, D., et al. 2013, A&A, 551, A52 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  21. Shaklan, S., & Roddier, F. 1988, Appl. Opt., 27, 2334 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  22. Shao, M., & Colavita, M. M. 1992, A&A, 262, 353 [NASA ADS] [Google Scholar]
  23. Shao, M., Boden, A. F., Colavita, M. M., et al. 1999, BAAS, 31, 1504 [NASA ADS] [Google Scholar]
  24. Straubmeier, C., Fischer, S., Araujo-Hauck, C., et al. 2012, in SPIE Conf. Ser., 8445, 84452R [Google Scholar]
  25. Tuthill, P., Lacour, S., Amico, P., et al. 2010, in SPIE Conf. Ser., 7735, 77351O [Google Scholar]
  26. Woillez, J., & Lacour, S. 2013, ApJ, 764, 109 [NASA ADS] [CrossRef] [Google Scholar]
  27. Woillez, J., Akeson, R., Colavita, M., et al. 2010, in SPIE Conf. Ser., 7734, 773412 [Google Scholar]

All Tables

Table 1

GRAVITY astrometric error budget (between UTs, for a single baseline).

Table 2

Error caused by non-common path aberrations.

Table 3

Error caused by atmospheric perturbations.

All Figures

thumbnail Fig. 1

The baseline paradox. Consider a thought experiment where an interferometer is equipped with two masks that rotate in the pupil of both telescopes. By doing so, the distance between the two apertures changes with time while the telescopes stay firm. In the left panel, the observation is done on a single target, acquired on the axis of the telescopes (perpendicular to the plane of the aperture). The OPD measurement is done between the white-light fringe and the zero reference of the interferometer. The OPD does not change with the rotation of the masks. In the right panel, the interferometer is recording two fringe packets obtained by observing two stars in the same field. The difference in OPD is measured between the two white-light fringes. δOPD does change with the rotation of the masks.

In the text
thumbnail Fig. 2

Schematic view of a single beam interferometer. The light paths are represented by light rays. The real light rays are reflected by mirrors up to the beam combiner inside the interferometric lab. However, in this simplified representation, all the mirrors are removed. Thus, the represented virtual light rays are straight lines from the star to the virtual images of the beam combiner. The mechanical structure of the interferometer is reduced to two pivot points, L1 and L2 which are the pivot points of the telescopes. The wide angle baseline (WAB) is represented as a vector between the two pivot points: BWAB=0L1L2\hbox{$\vec {B}_{\rm WAB} = \protect \overrightarrow{\vec {L}_1\vec{L}_2}$}. The OPD measurement is made between the beam combiners and the metrology end point. It is related to the stellar position through Eq. (8).

In the text
thumbnail Fig. 3

Schematic view of an astrometric interferometer. The light paths are virtual, as seen from the star, and all mirrors are removed, resulting in a straight line from the star to the beam combiner. Only the wide angle baseline (WAB) is represented, but all the baselines limit points are represented. L1 and L2 are the pivot points of the telescopes: BWAB=0L1L2\hbox{$\vec {B}_{\rm WAB} = \protect \overrightarrow{\vec {L}_1\vec{L}_2}$}. The M1 and M2 points are the end points of the metrology: BNAB=0M1M2\hbox{$\vec {B}_{\rm NAB} =\protect \overrightarrow{\vec{M}_1\vec{M}_2}$}. Last, IP1, IP2, IS1 and IS2 are the barycenters of the effective pupils of each beam combiners: BIMBS=0I1SI2S\hbox{$\vec {B}_{\rm IMB-S} = \protect \overrightarrow{\vec {I}_{\rm 1S}\vec{I}_{\rm 2S}}$} for the first combiner and BIMBP=0I1PI2P\hbox{$\vec{B}_{\rm IMB-P} =\protect \overrightarrow{\vec{I}_{\rm 1P}\vec{I}_{\rm 2P}}$} for the second. Upper panel: the metrology measurement is made between the beam combiners and the metrology end point. The end points are different from the pivot points, which justify in Eq. (13) the use of the narrow angle baseline instead of the WAB. Lower panel: full schematic representation of the interferometer, including the presence of the imaging limit points. Imaging limit points are necessary to account for error terms due to difference in the propagation direction of the metrology (m1S, m2S, m1P and m2P) with respect to the stellar beams (s and p). It introduces additional error terms that are stated in Eq. (21).

In the text
thumbnail Fig. 4

GRAVITY active subsystems involving the astrometric measurement. In red is the optical path of the reference star, in blue the optical path of the science star. The dotted lines are free space-light propagation, and the solid lines correspond to the single-mode fibers. The reference star is used to correct the atmospheric piston in real time by activating the internal piezo and/or the VLTI delay lines. In green is represented the metrology system, injected into the two integrated optics beam combiners (IO), and measured thanks to 4 metrology diodes inside the cryostat, and 4 metrology diodes on each telescope. The metrology measurements are used to control the GRAVITY differential delay lines to position the fringes on the science camera. Yellow represents the acquisition camera control loop that commands the tip-tilt and the pupil position. Orange is the open loop polarization control system.

In the text
thumbnail Fig. 5

Baseline representation for a two-telescope interferometer according to the GRAVITY concept. The baselines are defined by their limit points. The WAB is defined by the two pivot points (which defines the center of the telescopes reference frames). The NAB is defined by the geometrical average of the position of the four metrology photodiodes situated on the spider arms. The coordinate of the NAB limit points are (x, y, z) = (0, 0, 10 m). The two imaging baselines IMB1 and IMB2 (one for each beam combiner) have limit points that correspond to the pupil position of their respective beam combiners, weighted laterally by the telescope aperture, as well as the electric-field acceptance of the single-mode fibers. Internal pupil tracking inside the GRAVITY instrument will track the spider arms to place the IMB limit points on the NAB limit point. Expected lateral position will be |x,y| = 0 ± 4 cm. Expected longitudinal position will be z = 0 ± 10 km.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.