1 - Departament d'Astronomia i Meteorologia, Universitat de Barcelona, Av. Diagonal 647, 08028 Barcelona, Spain
2 - Lohrmann Observatory, Dresden Technical University, Mommsenstr. 13, 01062 Dresden, Germany

Abstract
Context. High accuracy astrometric instruments like Gaia aiming at an accuracy of 1 microarcsecond cannot be considered as point-like observers in the framework of relativistic modelling of observable quantities.
Aims. Special-relativistic effects on the imaging by a non-point-like arbitrarily moving optical instrument are discussed.
Methods. A special-relativistic reflection law for a mirror of arbitrary shape and motion is derived in the limit of geometrical optics. The aberration patterns are computed with ray tracing using a full special-relativistic model for two simple rotating optical instruments.
Results. The effect of special-relativistic reflection law on the photocenters of the aberration patterns of an optical system rotating with a moderate angular velocity of $60 \hbox{$^{\prime\prime}$ }/{\rm s}$ may be at the level of 1 microarcsecond if the system involves mirrors significantly inclined relative to the optical axis.
Conclusions. Special-relativistic optical modelling of future astrometric instruments is indispensable if a level of a few microarcseconds is envisaged.

We investigate possible relativistic effects on the imaging of an optical system with arbitrary motion. In the framework of relativity one usually considers point-like observers. The methods to calculate observed quantities for such observers are well known. It is common to assume that the actual instrumentation of the observer is so small that one considers the positions and velocities of each part of the instrument to be the same (and that single position and velocity is called the position and velocity of the observer). In reality even for an Earth-based telescope the velocities of different parts of the primary mirror in inertial coordinates (not rotating with the Earth) are slightly different. However, in the past the accuracy of observations was considered to be "too low'' and the size of the mirror "too small'' for those differences to be of practical relevance.

Due to recent technical developments especially for astrometric space missions like Gaia (Bienaymé & Turon 2002; Perryman et al. 2001; de Boer et al. 2000), JASMINE (Gouda et al. 2002) and SIM (Shao 1998) the situation has changed. In the case of Gaia, we deal with a scanning satellite which permanently rotates in space with a period of 6 h. The size of the primary mirror of Gaia is 1.4 m, comparable with the size of the spacecraft itself. The envisaged best accuracy of Gaia is a few $\mu$ as (and can be even below that limit in some favorable cases). Therefore, one cannot neglect a priori the difference of velocities of various parts of the instruments. It is our purpose to investigate these effects and estimate their magnitude for Gaia.

The general-relativistic model for Gaia has been formulated by Klioner (2004,2003). The model uses two principal relativistic reference systems: (1) the Barycentric Celestial Reference System (BCRS) and (2) the Center of Mass Reference System (CoMRS) of the satellite. The former is a global reference system with its origin at the barycenter of the solar system. It has been recommended by the International Astronomical Union for relativistic modelling of high-accuracy astronomical observations (Soffel et al. 2003). This reference system is used to model the dynamics of massive bodies, space vehicles (e.g., the Gaia satellite) and light rays within the Solar system. The final Gaia catalogue will contain coordinates of celestial objects in the BCRS. The CoMRS is the local relativistic reference system of the satellite. The theory of such local reference systems was laid down by Ni & Zimmermann (1978) and then elaborated by Klioner & Voinov (1993) and Klioner (2004). The gravitational influence of massive bodies is reduced in the CoMRS as much as possible and, according to the equivalence principle, is represented by tidal potentials. The CoMRS has its origin in the center of mass of the satellite and is kinematically non-rotating with respect to the BCRS. The CoMRS is physically adequate to model phenomena occurring in the immediate neighborhood of the satellite: attitude, the process of observation, etc. According to Klioner (2004) the metric tensor of the CoMRS differs from the Minkowski metric in three kinds of terms (the gravitational field of the satellite is too small and can be neglected safely): an inertial term due to non-gravitational accelerations of the satellite (for Gaia these accelerations can be relatively large during orbital maneuvers and only about $2\times10^{-13}~{\rm m}/{\rm s}^2$ in between, mainly due to solar pressure); an inertial term due to the slow rotation of the CoMRS relative to the co-moving Fermi-Walker transported locally inertial reference system (with an angular velocity of $\sim$ $3\times10^{-15}~{\rm s}^{-1}=2\hbox{$^{\prime\prime}$ }~{\rm per\ century}$ ); and tidal gravitational potentials (producing relative accelerations of at most $10^{-12}~{\rm m}/{\rm s}^2$ at a distance of 2.5 m from the satellite's center of mass). Simple calculations show that all these terms influence the CoMRS light propagation within a few meters from the satellite's center of mass at a level much lower than the goal accuracy of 1 $\mu$ as. Therefore, all these terms can be neglected for our purposes and one can consider the CoMRS for a sufficiently small interval of time as an inertial reference system of Special Relativity.

In Sect. 2 we summarize how to calculate the special-relativistic effects in the aberration patterns due to the rotation of the instrument. Section 3 is devoted to a description of ray tracing calculations of the relativistic effects in the aberration patterns for two simple optical systems. The details of the derivation of the special-relativistic deflection law are given in the appendix. There we also introduce a general theoretical scheme we use to treat arbitrarily-shaped and arbitrarily moving mirrors in special relativity.

2 General scheme of computing relativistic effects due to the rotation of an optical system

Our goal is to discuss and calculate the influence of relativistic effects on the imaging by an optical instrument with some non-inertial motion. We simplify our goal in several directions: (1) we consider here the case of optical instruments consisting of mirrors only (no lenses are considered), (2) we do not consider the effects of wave optics and work in the approximation of geometric optics (see, however, the note at the end of Sect. 4).

For an optical system consisting solely of a number of arbitrarily moving mirrors, the most important relativistic effect is the special-relativistic modification of the reflection law. That modified special-relativistic reflection law will produce a change in aberration patterns as compared to the patterns calculated by using the usual reflection law (here and below by "usual reflection law'' we mean that the angles between the normal to the surface of the mirror and the incoming and reflected light ray are equal: $\alpha^\prime=\alpha$ in Fig. A.1). These perturbed aberration patterns could affect astrometric measurements based on an interpretation of the images obtained in the instrument's focal plane.

2.1 Reflection law

First, we formulate the general principles allowing one to calculate the aberration patterns within the framework of Special Relativity. Given a mirror of arbitrary shape in arbitrary motion (see Sect. A.2 for a formal mathematical description of such an arbitrary mirror and Sect. 2.2 for a discussion of such mirrors from the physical point of view) and a light ray hitting the surface of the mirror at a given point and moment of time, we calculate the parameters of the outgoing (reflected) light ray. The simplified problem of a flat mirror moving with a constant velocity perpendicular to its surface has been considered by Einstein (1905) in the first paper on Special Relativity Theory. In the appendix the most general case of this problem within Special Relativity is considered in great detail. Slightly modifying the arguments of Einstein (1905) we first use Lorentz transformations to transform from a laboratory inertial reference system (t,xⁱ) to an inertial reference system (T,X^a) instantaneously co-moving with the element of the mirror where the reflection of a particular light ray occurs, then apply the known reflection law in that reference system and transform the reflected light ray back into the laboratory reference system. The relation of that scheme to direct calculations involving Maxwell's equations is also discussed in the appendix. In our calculations we recover a number of known results for various particular cases. An overview of these known results and the corresponding comparison are also given. The main formula used in all the ray tracing calculations of Sect. 3 is the relativistic reflection law given by Eq. (A.44).

2.2 Arbitrarily shaped and moving mirrors

A very important point of the whole scheme is that the shapes of the mirrors in laboratory coordinates (t,xⁱ) and, possibly, the time-dependence of these shapes are assumed to be given. We describe the shape of each mirror by a two-parameter family of worldlines of each individual particle of the mirror denoted as $x_m^i(t;\xi,\eta)$ . Here $\xi$ and $\eta$ are two continuous parameters "numbering'' the particles that constitute the surface of the mirror. Clearly, for fixed values of $\xi$ and $\eta$ , function $x^i_m(t;\xi,\eta)$ represents the (t,xⁱ)-parametrization of the world line of the corresponding particle. For fixed t the same function $x^i_m(t;\xi,\eta)$ represents the instantaneous position and shape of the mirror in the $t={\rm const.}$ hyperplane of the coordinates (t,xⁱ). In this case ( $t={\rm const.}$ ) the parameters $\xi$ and $\eta$ give a non-degenerated two-dimensional coordinate chart on the surface of the mirror. We consider $x^i_m(t;\xi,\eta)$ to be differentiable with respect to $\xi$ and $\eta$ . This means that the coordinate representation of the surface is a smooth two-dimensional surface for each moment of coordinate time t.

In general there is no inertial reference system where the whole system or any of its mirrors is at rest. In the special cases when such an inertial rest-frame of a mirror does exist, one should consider the shape of the mirror in that rest-frame. In the practical cases considered below such rest-frames do not exist. Moreover, the size of the mirrors is so large that we cannot assume that the velocities of all points of the mirror are approximately constant in any inertial reference system.

We do not consider the question of deformations of the mirrors due to their non-inertial (for example, rotational) motion (i.e., the relation between the intended shapes of the mirrors during their manufacturing and their shapes, e.g., in a rotating satellite, in coordinates (t,xⁱ)). The behaviour of a mirror as a physical body is a separate question, a rigorous relativistic treatment of which would require at least a special-relativistic theory of elasticity. As long as the angular velocity is constant the deformations and special-relativistic effects on the shape (e.g. Lorentz contraction) are also constant. In this case a rigidly rotating mirror can be considered to be Born-rigid (Pauli 1958, Sect. 45). We can also argue that the constant deformations are assumed to be properly taken into account during manufacturing so that the rotating mirrors have the assumed forms. One may argue that the mirrors could be made active to retain the prescribed form (which is the case for many larger Earth-bound instruments, but may appear to be a rather bizarre argument in some other cases).

2.3 Observable aberration patterns

The last issue is the definition of the observing (imaging) device. In analogy to our representation of the mirrors we first define a coordinate "plane'' $x_f^i(t;\zeta,\chi)$ in laboratory coordinates (t,xⁱ) that coincides with the focal "plane'' of the instrument in the Newtonian case. In many cases (e.g. for the case considered in Sect. 3 below) $x_f^i(t;\zeta,\chi)$ can be taken to be a plane in the considered coordinates (that is, for any moment of time there exist nⁱ(t) independent of $\zeta$ and $\chi$ such that $\vec{x}_f\cdot\vec{n}=0$ ). The aberration patterns we calculate below are defined as the set of points at which the light rays from a source hit that coordinate focal plane at some moment $t=t_{\rm obs}={\rm const.}$ Generally speaking the aberration patterns cannot be considered as "infinitely small''. This means that there is no inertial coordinate system in which the part of the detector (that is, of the focal "plane'') registering an aberration pattern can be considered at rest.

If the patterns are "small enough'' (which is the typical case for reasonable high-quality optical instruments) one could introduce an inertial reference system $(\tau,\rho^i)$ instantaneously co-moving with some central point of the aberration pattern and define the "observable'' pattern as a set of points at which the light rays from a source hit that coordinate focal plane at some moment $\tau=\tau_{\rm obs}={\rm const.}$ (here one should also take into account the relativistic effects in spatial coordinates and correspondingly treat Lorentz contraction etc.). First, although this approach seems to be more adequate for non-inertial motion it still gives a coordinate-dependent picture because of finite extension of the patterns. Second, we have explicitly checked that this additional Lorentz boost does not influence any of the figures and numerical results given below.

Note that we are interesting in prediction of the changes in the aberration patterns compared to the prediction made for the "same'' optical device without rotation and using Newtonian geometric optics (this latter prediction is typically available from the manufacturers of the instrumentation). From this point of view, our definition of "observed'' aberration pattern is adequate. In more realistic case one has to model the process of observation in much more detail (e.g., CCD orientation and position within the instrument, CCD clocking, averaging, TDI mode etc.). Such a detailed modelling is however unnecessary for the purposes of this paper.

Summarizing, our aberration pattern modelling consists of (1) fixing the models of the mirrors $x^i_m(t;\xi,\eta)$ and the focal plane $x_f^i(t;\zeta,\chi)$ , and (2) tracing a grid of incoming light rays, which interact with the optical system only at the moments of reflection according to (A.44), until the point of intersection with the focal plane $x_f^i(t;\zeta,\chi)$ , and (3) forming the aberration pattern itself and/or calculating its photocenter.

3 Relativistic astrometric effects due to rotational motion of the satellite

In order to evaluate the relativistic effects in the aberration patterns of planned scanning astrometric instruments, we consider an extended optical system rotating rigidly with a constant angular velocity relative to the inertial reference system (t,xⁱ). For a scanning astrometric satellite the real angular velocity is not constant (e.g., because of the required scanning law), but its changes are small and slow, and will be neglected here. Rigid rotation of the optical instrument means that the whole instrument is at rest in a reference system (t,yⁱ) related to the inertial laboratory reference system (t,xⁱ) as $y^i=R^i_{\ j}~x^j$ , $R^i_{\ j}$ being an orthogonal (rotational) matrix.

To calculate the aberration patterns of several optical systems discussed below we have developed a numerical ray tracing code in Java allowing us to calculate aberration patterns for an arbitrary optical system rigidly rotating in our laboratory coordinates. Each mirror in the system can be individually shaped and oriented in those coordinates. The code allows us to control all intermediate calculations as well as the overall numerical accuracy.

Parameters of the optical systems (size of the mirrors, focal distance, distance of the primary mirror from the rotational axis and angular velocity) considered in Sects. 3.1 and 3.2 below are chosen to qualitatively represent some principal features of planned astrometric missions like Gaia (Perryman et al. 2001) or JASMINE (Gouda et al. 2002), where a scanning satellite comprising two astrometric telescopes continuously rotates with an angular velocity of $\Omega \sim 60 \hbox{$^{\prime\prime}$ }/{\rm s}$ .

3.1 A one-mirror optical system

The first optical system that we will study consists of one rotating parabolic mirror. A diagram of this optical system is given on Fig. 1. The parabolic mirror M₁ is a square mirror of size 1.5 m $\times$ 1.5 m and a focal distance of $d_{\rm f}=46.67$ m. The receiver at the focal plane is considered to be 0.814 m $\times$ 0.814 m in size providing a field of view of $\sim$ $1\hbox{$^\circ$ }\times 1\hbox{$^\circ$ }$ . This roughly corresponds to the astrometric instruments of Gaia. The rotational axis goes through the origin ${\rm O}$ of our coordinates perpendicular to the plane of Fig. 1. The distance from ${\rm O}$ to the center of the primary mirror (being the vertex of the parabola) ${\rm P}_1$ is r = 1.5 m. The distance from ${\rm P}_1$ to the center of the focal plane ${\rm C}$ is obviously the focal distance $d_{\rm f}=46.67$ m. The whole optical system is rotating with respect to ${\rm O}$ with an angular velocity $\Omega =60 \hbox {$^{\prime \prime }$ }/{\rm s}$ . The optical axis of the system is defined as the path of the light ray which goes perpendicular to the surface of the primary mirror through its center provided that the system does not rotate (represented in Fig. 1 by the bold horizontal line going from ${\rm P}_1$ to C). Without rotation light rays parallel to the optical axis converge to the single point C in the focal plane. The direction of an incoming light ray is parameterized with two angles: the along scan angle $a_{\rm L}$ (this angle is changing continuously for a given source because of the rotation; see Fig. 1) and the across scan angle $a_{\rm C}$ . The along scan angle is the angle between the instantaneous directions of the optical axis and the incoming light ray projected into the plane containing the optical axis and perpendicular to the vector of angular velocity of the system (i.e., the plane of Fig. 1). The across scan angle is the angle between the instantaneous directions of the optical axis and the incoming light ray projected into the plane containing both the optical axis and the vector of angular velocity. The along scan and across scan angles are widely used in the context of scanning astrometric missions like HIPPARCOS (Perryman et al. 1997) and Gaia (Perryman et al. 2001).

In order to evaluate the effects due to the rotation of the instrument we calculate aberration patterns for different values of the field angles $a_{\rm L}$ and $a_{\rm C}$ as well as the differences of the photocenters for each considered case. To compute aberration patterns a rectangular grid of parallel incoming light rays with direction characterized by some given $a_{\rm L}$ and $a_{\rm C}$ is generated. These light rays are then traced through the optical system until they intersect the focal plane. The coordinates of the intersection points produce the corresponding aberration pattern in the focal plane (see, e.g., Figs. 2 and 4). The photocenter of a pattern is defined as the mean position of all points of that pattern.

$\begin{figure} \par\includegraphics[width=190pt,clip=true]{4320_2_1.eps}\par\vsk... ...degraphics[width=190pt,clip=true]{4320_2_3.eps}\par\vskip -2pt \par \end{figure}$	Figure 2: Aberration patterns for the one-mirror system: a non-rotating instrument ( upper panel), a rotating instrument considering the light propagation delays and using the usual reflection law (middle panel, and a rotating instrument considering both the light propagation delays and the relativistic reflection law ( lower panel).
Open with DEXTER

We distinguish between two different effects changing the aberration patterns (and their photocenters) of a rotating instrument compared to those of an identical non-rotating instrument. The first effect is the change of orientation of various reflecting surfaces during the time delays needed for a light ray to propagate from the primary mirror to the focal plane. The second effect is the difference between the usual reflection law and the relativistic one.

Clearly, the propagation delays are related only to the finiteness of the light velocity. The delays appear also in the non-rotating case, but can be completely ignored since the orientation of all reflecting surfaces is constant. For a rotating instrument the propagation delays mean, in particular, that the light rays producing an aberration pattern (that is, the light rays intersecting the focal plane at the same moment of time) hit the primary mirror (and, generally speaking, all other mirrors) at different times. The effect of propagation delays can be directly calculated in our ray tracing software by using a specially designed iterative scheme.

There are several effects related to the propagation delays such as just the constant shift of the aberration patterns due to the change of the orientation of the instrument during the propagation time: an image of a star observed at time $t_{\rm obs}$ is produced by the light rays from the star that hit the primary mirror at time $\sim$ $t_{\rm obs}-d_{\rm f}/c$ when the orientation of the mirror differed by $\sim$ $\Omega~d_{\rm f}/c$ from the orientation at $t_{\rm obs}$ . Similar constant shifts will be caused by intermediate mirrors and by the motion of the focal plane during the propagation delay: during the light propagation the focal plane is moving and the photon hits the focal plane at different positions which correspond to different positions on the sky. This can be computed as $\sim$ $\Omega~(d_{\rm f}-r)/c$ for the one-mirror system depicted in Fig. 1. Note that in the limit when the center of rotation is infinitely far from the instrument (that is, when all parts of the instrument effectively have the same velocity), these constant shifts are fully equivalent to the normal aberration of light. The constant shifts of the aberration patterns, that can be relatively large, lead only to a constant time shift in the orientation parameters of the satellite derived from astrometric observations: the orientation obtained from observations at $t_{\rm obs}$ is actually the orientation the satellite had some small earlier time interval. This has only slight consequences on the measurements for any existing or planned astrometric projects. However, the propagation delays also lead to a deformation of the aberration patterns that depends on the field angles. These aberration pattern deformations together with the deformations due to the relativistic reflection law can be important as illustrated below. The distortions of the shape of the patterns are caused by different velocities of different parts of both mirrors and slightly different incident angles for each mirror.

For the one-mirror case these effects are illustrated in Fig. 2. The nine patterns in each of the three panels correspond to nine combinations of the field angles with $a_{\rm L}=-30\hbox{$^\prime$ }, 0\hbox{$^\prime$ }, +30\hbox{$^\prime$ }$ (horizontal direction) and $a_{\rm C}=-30\hbox{$^\prime$ }, 0\hbox{$^\prime$ }, +30\hbox{$^\prime$ }$ (vertical direction). For the focal length $d_{\rm f}=46.67$ m, $30 \hbox{$^\prime$ }$ corresponds to about 407 mm in the focal plane coordinates. The size of the axes in focal plane coordinates is 0.5 mm $\times$ 0.5 mm for all patterns. The aberration patterns in the upper panel are calculated for a non-rotating instrument. In the middle panel the aberration patterns are obtained using the usual reflection law, but the effects of the light propagation delays are taken into account. In the lower panel both the light propagation delays and the relativistic reflection law are used. An extremely high angular velocity $\Omega = 5\times 10^9\ \hbox{$^{\prime\prime}$ }/{\rm s}$ is used to exaggerate the distortion and make it clearly visible. The three rightmost patterns in both the middle and the lower panels are much larger than all other patterns. These six patterns extend to the left from the edge of the figure by about 3 times the size of the horizontal axis in each pattern. These parts of the patterns are not shown in Fig. 2. The axes for each pattern are centered at the corresponding photocenter. Note that these photocenters are significantly shifted between the three panels due to the constant propagation time effects discussed above.

Since for the one-mirror instrument the angle of each light ray with respect to the normal to the mirror at each point of the surface is not greater than $30 \hbox{$^\prime$ }$ , the effect of the relativistic reflection law on aberration patterns is very small. At point ${\rm P}_1$ the velocity vector is perpendicular to the normal to the mirror. Therefore, at this point for any $a_{\rm L}$ and $a_{\rm C}$ the relativistic reflection law coincides with the usual one (see Eq. (A.44)). A light ray going through that point will intersect the focal plane at the same point for both the usual and relativistic reflection laws. The light rays of the same grid not going through ${\rm P}_1$ have different images when using the usual reflection law and the relativistic one.

For realistic $\Omega =60 \hbox {$^{\prime \prime }$ }/{\rm s}$ the mean shift of the photocenters due to the propagation delays amount to $\delta \overline{a}^{~\rm d}_{\rm L} = 18.3842$ $\mu$ as. Note that this number can be reproduced with good accuracy by $\Omega~(2d_{\rm f}-r)/c=18.3807$ $\mu$ as as discussed above. The field-angle dependent change of the photocenters is at the level of 0.001 $\mu$ as and is shown in Table 1. The change of the photocenters due to the relativistic reflection law is a shift in the along-scan direction $\delta a_{\rm L}\approx\delta \overline{a}^{~\rm r}_L = -0.0008$ $\mu$ as and is independent of $a_{\rm L}$ and $a_{\rm C}$ at the level of 0.0001 $\mu$ as.

Table 1: The shifts of the aberration patterns for the one-mirror optical system rotating at $\Omega =60 \hbox {$^{\prime \prime }$ }/{\rm s}$ after subtracting the mean value $\delta \overline{a}^{~\rm d}_{\rm L}+\delta \overline{a}^{~\rm r}_{\rm L}=18.3834$ $\mu$ as.

3.2 A two-mirror optical system

Real optical systems usually have more than one mirror. Often the instruments involve mirrors inclined by about $45\hbox{$^\circ$ }$ to the optical axis (i.e., Nasmith focus, beam combiners, beam splitters, etc.). In this case the effects of the relativistic reflection law on the aberration pattern are significantly larger than in the case discussed above. Here we consider an optical system consisting of one parabolic primary mirror and one flat secondary mirror as depicted in Fig. 3. A flat secondary mirror M₂ has been added to the optical system depicted in Fig. 1. The distance from ${\rm P}_1$ to center of the flat mirror ${\rm P}_2$ is d₁₂. The whole system is again rigidly rotating with a constant angular velocity $\Omega$ in laboratory coordinates. The flat mirror is inclined at an angle $\theta$ with respect to the optical axis of the primary mirror. The focal plane position depends on the angle $\theta$ . The distance from ${\rm P}_1$ to ${\rm P}_2$ is d₁₂=3 m, and the distance from ${\rm P}_1$ to the rotational axis ${\rm O}$ is r=1.5 m. The distance from ${\rm P}_2$ to the center C of the focal plane is $d_{\rm f} - d_{12} = d_{2f}=43.67$ m. The bold line in Fig. 3 representing the optical axis goes from ${\rm P}_1$ to ${\rm P}_2$ and then to the focal plane center C.

Table 2: The shifts of the aberration patterns for the two-mirror optical system rotating at $\Omega =60 \hbox {$^{\prime \prime }$ }/{\rm s}$ for three values of $\theta$ after subtracting the specified mean values $\delta\overline{a}_{\rm L}^{~\rm d}+\delta\overline{a}_{\rm L}^{~\rm r}$ .

We repeat the ray tracing calculations as described in Sect. 3.1 above with this additional flat mirror. We use three different configurations of the flat mirror with inclination angles $\theta=+45 \hbox{$^\circ$ }$ , $\theta=0$ , and $\theta=-45 \hbox{$^\circ$ }$ . Figure 4 shows the aberration patterns obtained with $\theta = 45\hbox {$^\circ $ }$ (again for a large angular velocity of $\Omega = 5\times 10^7 \hbox{$^{\prime\prime}$ }/\rm {s}$ , 100 times lower than for Fig. 2, was used in order to make the effects visible). The same 9 combinations of $a_{\rm L}$ and $a_{\rm C}$ , and the same size and centering of the axes are used for each panel as described above for Fig. 2. The upper panel shows the aberration patterns for a non-rotating instrument ( $\Omega=0$ ). These patterns are identical to those in the left panel of Fig. 1. Clearly, the aberration patterns for the rotating instrument, the middle and the lower panel look differently to Fig. 2. Numerical values of the shifts of the photocenters $\delta a_{\rm L}$ and $\delta a_{\rm C}$ for $\Omega =60 \hbox {$^{\prime \prime }$ }/{\rm s}$ are presented in Table 2 for inclination angles $\theta = 45 \hbox{$^\circ$ }, 0\hbox{$^\circ$ }, -45 \hbox{$^\circ$ }$ . The mean constant shift $\delta\overline{a}_{\rm L}^{~\rm d}$ of the patterns due to the light propagation delays and $\delta\overline{a}_{\rm L}^{~\rm r}$ due to the relativistic reflection law are given at the top of each table. The tables show the part of the total shifts dependent on the field angles. The position-dependent effects in $\delta a_{\rm L}^{~\rm d}$ and $\delta a_{\rm L}^{~\rm r}$ have opposite signs and are 2-3 times larger than the total shift $\delta a_{\rm L}=\delta a_{\rm L}^{~\rm d}+\delta a_{\rm L}^{~\rm r}$ . On the contrary, the effects in $\delta a_{\rm C}^{~\rm d}$ and $\delta a_{\rm C}^{~\rm r}$ are of the same sign and are about 2 times less than in the sum $\delta a_{\rm C}=\delta a_{\rm C}^{~\rm d}+\delta a_{\rm C}^{~\rm r}$ .

$\begin{figure} \par\includegraphics[width=190pt,clip=true]{4320_4_1.eps}\par\vsk... ...degraphics[width=190pt,clip=true]{4320_4_3.eps}\par\vskip -2pt \par \end{figure}$	Figure 4: Aberration patterns for the two-mirror system with $\theta = 45\hbox {$^\circ $ }$ : non-rotating instrument ( upper panel), rotating instrument considering the light propagation delays and using the usual reflection law (middle panel), and rotating instrument considering both the light propagation delays and the relativistic reflection law ( lower panel).
Open with DEXTER

As for the one-mirror system, for any value of $\theta$ the shifts due to the light propagation delays exceed the level of 1 $\mu$ as and amount to $\delta\overline{a}_{\rm L}^{~\rm d}\sim2 ~\hbox{$\mu$ as}$ . For the two-mirror system $\delta\overline{a}_{\rm L}^{~\rm d}$ is significantly lower than for the one-mirror system since the effects of the motion of the primary mirror and the motion of the focal plane largely compensate each other if just one intermediate mirror is present.

For $\theta=0$ the shifts due to the relativistic deflection law are again very small as was the case for the one-mirror system. The situation with these shifts is different for $\theta=\pm 45\hbox{$^\circ$ }$ where the mean shift $\delta\overline{a}_{\rm L}^{~\rm r}\sim0.3~\hbox{$\mu$ as}$ . For $\theta=\pm 45\hbox{$^\circ$ }$ all the light rays hit the flat surface at an angle of about $\alpha = \pm 45\hbox{$^\circ$ }$ with respect to the normal and the factor $\left\vert\sin\alpha\right\vert$ appearing in (A.60) is of the order of $1/\sqrt{2}\approx0.7$ . Each light ray of the grid hits the mirror at a slightly different value of $\alpha$ , but the main perturbation due to the relativistic reflection law can be estimated considering the light ray going along the optical axis. Using (A.60) we obtain

$\displaystyle \left\vert \delta_i \right\vert \simeq \left\vert 2\ \frac{v_i}{c}\ \frac{d_{i\rm f}}{d_{\rm f}} \ \sin \theta_i\ \sin \varphi_i\ \right\vert.$

(2)

The presence of the factor $d_{i\rm f}/d_{\rm f}$ in (1) and (2) can be explained by a small perturbation $\Delta$ of the propagation direction of a light ray by a mirror located at a distance $d_{i\rm f}$ from the focal plane causing a linear shift in the focal plane $d_{i\rm f}~\Delta$ which is efficiently interpreted as an angular shift of $d_{i\rm f}/d_{\rm f}~ \Delta$ . In the more general case when the intermediate reflecting surfaces are not flat, Eq. (2) is no longer valid, but gives a reasonable idea of the magnitude of the effect provided that all reflecting surfaces are not too different from a flat mirror. The cumulative effect of a series of (almost) flat mirrors will not be a direct addition of all $\delta_i$ since the relativistic perturbation may occur at different planes. An analytic expression in vector form can be derived for the combined effect, but since the resulting formula is complicated and still a rough approximation it will not be discussed here. Equation (2) also has been checked for some other optical systems involving more reflecting surfaces of different shapes, sizes and velocities. A good agreement with the numbers from numerical ray tracing was obtained in all cases.

4 Concluding remarks

We have considered in detail the main relativistic effect on the imaging by a rotating optical system which is produced by the relativistic modification of the reflection law. We have considered two simple optical systems containing one and two mirrors. Although the size of the primary mirror, the focal length and the angular velocity of rotation of both systems were defined to agree with the corresponding parameters of Gaia, it is not clear how large these effects will be for the real optical scheme of Gaia. We have seen that the effects are small for the one-mirror system and that they may amount of 0.3 $\mu$ as for the two-mirror system. For a real Gaia optical scheme the effect may be much larger because of the presence of several inclined mirrors. The two examples of a rotating optical system considered above do not allow us to predict the relativity-induced photocenter shifts for a real optical system like Gaia. A detailed calculation of the photocenter shifts in principle can be done using the ray tracing software developed for this investigation.

The part of the effect that does not depend on the position in the focal plane can be interpreted as a constant change in the orientation of the satellite (as discussed at the end of the previous Section for propagation delay effects). Moreover, if a satellite (like Gaia) has two optically different telescopes, the difference in the main effects for these two telescopes can be interpreted as a change in the angle between the two instruments.

In this paper we confined ourselves to ray tracing in the geometric optics limit. A more strict way to analyze the imaging by a rotating optical system is to apply wave optics and calculate corresponding intensity patterns (PSF or similar characteristics). The intensity patterns would then allow us to predict the observable shifts of the photocenters more reliably than the aberration patterns used in this paper. Preliminary calculation with a simplified model fosters the hope that at optical wavelengths the differences in the photocenter shifts calculated from ray tracing and from wave optics are negligible. However, the effects of propagation delays due to the rotation of the telescope may play a role. This deserves separate investigation.

References

Online Material

Appendix A: Reflection of a light ray by an arbitrarily moving mirror

A.1 Notation and conventions

We summarize the most important notation and conventions used throughout the paper:

A.2 Coordinate representation of an arbitrary moving mirror

Let us consider an inertial reference system of Special Relativity (t,xⁱ). We define an arbitrary mirror in arbitrary motion by a bundle of particles moving along worldlines

Here we do not pay attention to any physical properties of the mirror as a "physical body'' (elasticity, deformations, etc.). We just consider that (A.1) formally defines the position of each point of the mirror at each moment of time. The source of information for $x_m^i(t;\xi,\eta)$ for realistic mirrors and the plausibility of these representation of an arbitrarily shaped and arbitrarily moving mirror is discussed in Sect. 2 above.

Starting from (A.1) for any fixed time t at any fixed point of the mirror characterized by some values of $\xi$ and $\eta$ we have two three-dimensional vectors tangent to the surface of the mirror at the considered point as

$\displaystyle l^i={\partial\over\partial\xi}~x_m^i(t;\xi,\eta),$			(A.2)
$\displaystyle m^i={\partial\over\partial\eta}~x_m^i(t;\xi,\eta).$			(A.3)

A.3 Transforming the mirror surface from one inertial reference system to another

Let us now define another reference system (T,X^a) moving with constant velocity vⁱ with respect to (t,xⁱ). The coordinates (T,X^a) and (t,xⁱ) are related by a Lorentz transformation of the form

$\displaystyle c~t=\Lambda^0_0~c~T+\Lambda^0_a~X^a ,$			(A.6)
$\displaystyle x^i=\Lambda^i_0~c~T+\Lambda^i_a~X^a.$			(A.7)

$\displaystyle \Lambda^0_0$	=	$\displaystyle \gamma,$	(A.8)
$\displaystyle \Lambda^0_a$	=	$\displaystyle k^a~\gamma,$	(A.9)
$\displaystyle \Lambda^i_0$	=	$\displaystyle k^i~\gamma,$	(A.10)
$\displaystyle \Lambda^i_a$	=	$\displaystyle \delta^{ia} + \frac{\gamma^2}{1+\gamma}~k^i~k^a ,$	(A.11)
$\displaystyle \gamma$	=	$\displaystyle \left(1-\vec{k}\cdot\vec{k} \right)^{-\frac{1}{2}} ,$	(A.12)
$\displaystyle \vec{k}$	=	$\displaystyle \frac{1}{c}~\vec{v} .$	(A.13)

$\displaystyle c~T=\bar{\Lambda}^0_0~c~t+\bar{\Lambda}^0_i~x^i,$			(A.14)
$\displaystyle X^a=\bar{\Lambda}^a_0~c~t+\bar{\Lambda}^a_i~x^i.$			(A.15)

In the reference system (T,X^a) the mirror can be also represented in the same form as in Sect. A.2

$\displaystyle L^a={\partial\over\partial\xi}~X_m^a(T;\xi,\eta),$			(A.17)
$\displaystyle M^a={\partial\over\partial\eta}~X_m^a(T;\xi,\eta),$			(A.18)
$\displaystyle N^a=\varepsilon_{abc}~L^b~M^c.$			(A.19)

$\displaystyle c~T = \bar{\Lambda}^{0}_{0}~c~t + \bar{\Lambda}^{0}_{i}~x_m^{i}(t;\xi,\eta),$			(A.21)
$\displaystyle X^a_m(T;\xi,\eta) = \bar{\Lambda}^{a}_{0}~c~t + \bar{\Lambda}^{a}_{i}~x_m^{i}(t;\xi,\eta).$			(A.22)

V^a_m	=	$\displaystyle c~\frac{\bar{\Lambda}^a_0+\bar{\Lambda}^a_i~k_m^i}{\bar{\Lambda}^0_0 + \bar{\Lambda}^0_i~k_m^i},$	(A.23)
L^a	=	$\displaystyle \bar{S}^a_i~l^i$	(A.24)
M^a	=	$\displaystyle \bar{S}^a_i~m^i$	(A.25)
$\displaystyle \bar{S}^a_i$	=	$\displaystyle \bar{\Lambda}^a_i-\bar{\Lambda}^0_i\frac{\bar{\Lambda}^a_0+\bar{\Lambda}^a_j~k_m^j}{\bar{\Lambda}^0_0+\bar{\Lambda}^0_j~k_m^j},$	(A.26)

vⁱ_m	=	$\displaystyle c~\frac{\Lambda^i_0+\Lambda^i_a~K_m^a}{\Lambda^0_0 + \Lambda^0_a~K^a_m},$	(A.27)
lⁱ	=	Sⁱ_a L^a,	(A.28)
mⁱ	=	Sⁱ_a M^a,	(A.29)
Sⁱ_a	=	$\displaystyle \Lambda^i_a-\Lambda^0_a\frac{\Lambda^i_0+\Lambda^i_b~K^b_m}{\Lambda^0_0+\Lambda^0_b~K^b_m},$	(A.30)

$\displaystyle \bar{S}^b_j~\bar{S}^c_k\varepsilon_{abc}={1\over \gamma~(1-\vec{k}\cdot\vec{k}_m)}~~S^i_a\varepsilon_{ijk},$			(A.31)
$\displaystyle S^j_b~S^k_c\varepsilon_{ijk}=\gamma~(1-\vec{k}\cdot\vec{k}_m)~\bar{S}^a_i\varepsilon_{abc}.$			(A.32)

$\displaystyle N^a={1\over \gamma~(1-\vec{k}\cdot\vec{k}_m)}~ S^i_a~n^i,$			(A.33)
$\displaystyle n^i=\gamma~(1-\vec{k}\cdot\vec{k}_m)~ \bar{S}^a_i~N^a.$			(A.34)

$\begin{displaymath} \varepsilon_{ajc}~\delta^{kb}+ \varepsilon_{kac}~\delta^{jb}+ \varepsilon_{jkc}~\delta^{ab}= \varepsilon_{ajk}~\delta^{bc}. \end{displaymath}$

(A.35)

A.4 Observable and coordinate normal vectors

Let us consider an infinitely small element of the mirror which is characterized by infinitely small intervals around some fixed values of $\xi$ and $\eta$ . The velocity of the element is $v_m^i(t;\xi,\eta)$ in the laboratory reference system (t,xⁱ). Let us now identify the constant velocity vⁱ of the reference system (T,X^a) relative to (t,xⁱ) with $v_m^i(t;\xi,\eta)$ of the considered point given by $\xi$ and $\eta$ and at some fixed moment of time: $v^i\equiv v_m^i(t;\xi,\eta)$ . Then (T,X^a) is a momentarily co-moving inertial reference system of the considered infinitesimal element of the mirror. The coordinate basis of (T,X^a) gives an orthonormal tetrad of an observer co-moving with the considered element of the mirror. That reference system can be used to describe the results of instantaneous observations made by that observer.

In particular, N^a is the observable normal vector which will be used below to formulate the reflection law for the light rays as seen by the co-moving observer. From now on, N^a is always used in this sense (that is, we always put $\vec{k}_m=\vec{k}$ ). Normalizing the vectors one can see that the unit vector $\hat{\vec{N}}=\vec{N}/\vert\vec{N}\vert$ normal to the surface as seen by an observer instantaneously co-moving with a particular point of the mirror relates to the normal unit vector $\hat{\vec{n}}=\vec{n}/\vert\vec{n}\vert$ seen by an observer at rest relative to (t,xⁱ) as

$\displaystyle \hat{\vec{N}}=\frac{1}{ \sqrt{1 - (\vec{k} \cdot \hat{\vec{n}}) ^... ...\vec{n}}- (\vec{k}\cdot\hat{\vec{n}})~ \frac{\gamma}{1+\gamma}~ \vec{k}\right),$			(A.36)
$\displaystyle \hat{\vec{n}}= \frac{1}{ \sqrt{1 + \gamma^2(\vec{k}\cdot\hat{\vec... ...vec{N}}+ (\vec{k}\cdot\hat{\vec{N}})~ \frac{\gamma^2}{1+\gamma}\vec{k}\right) .$			(A.37)

A.5 Wave vectors in the two inertial reference systems

In order to consider the light reflection from the mirror we first need to relate the wave vectors of the incoming and outgoing light rays in the two considered coordinate systems. In the reference system (t,xⁱ)the incoming light ray is characterized by its null wave vector $p^\mu$ ( $\eta_{\mu\nu}~p^\mu~p^\nu=0$ ). The unit light ray direction $\sigma^i$ ( $\vec{\sigma}\cdot\vec{\sigma}=1$ ) in that reference system is related to $p^\mu$ as $\sigma^i=p^i/p^0$ . In the reference system (T,X^a) the null wave vector of the same light ray is $P^\alpha$ , and the unit light ray direction $\Sigma^a=P^a/P^0$ ( $\vec{\Sigma}\cdot\vec{\Sigma}=1$ ). The frequencies f and F of the light in the corresponding reference systems are linearly proportional to p⁰ and P⁰, respectively.

The wave vectors $p^\mu$ and $P^\alpha$ are related by the Lorentz transformation

$\displaystyle \Sigma^a={\overline\Lambda^a_0+\overline\Lambda^a_i~\sigma^i\over \overline\Lambda^0_0+\overline\Lambda^0_i~\sigma^i},$			(A.39)
$\displaystyle F=\left(\overline\Lambda^0_0+\overline\Lambda^0_i~\sigma^i\right)~f.$			(A.40)

A.6 Reflection as seen by an instantaneously co-moving observer

For an observer instantaneously co-moving with the element of the mirror where the light ray is reflected the following simple reflection law is valid (in an inertial reference system of Special Relativity for a mirror at rest)

$\displaystyle F^\prime$	=	F,	(A.41)
$\displaystyle \vec{\Sigma}^\prime$	=	$\displaystyle \vec{\Sigma}-2~(\hat{\vec{N}}\cdot\vec{\Sigma})~\hat{\vec{N}},$	(A.42)

We consider this reflection law as given, but is well known the method to derive it from Maxwell equations for electromagnetic field for a mirror at rest (Jackson 1975). In the instantaneously co-moving reference system (T,X^a) the coordinate velocity of the reflecting point vanishes but its acceleration may differ from zero. However, the acceleration cannot affect the instantaneous process of reflection considering of the equivalence principle as long as the conditions for geometrical optics are satisfied, i.e., as long as the amplitude, polarization and wave vector of an electromagnetic wave do not change significantly over a distance determined by the wavelength $\lambda$ . This implies that the acceleration a_m of the mirror should satisfy a constraint of the form $a_m\ll c^2/\lambda$ (see, Mashhoon (2005) for a detailed discussion of accelerated observers in special relativity).

A.7 Reflection as seen by a laboratory observer

Now combining the reflection law in reference system (T,X^a) with the transformations discussed in Sects. A.3-A.5 one gets the reflection law as seen in reference system (t,xⁱ) where the mirror is arbitrarily moving

$\displaystyle f^\prime$	=	$\displaystyle f~ {1+\left(\vec{k} \cdot \hat{\vec{n}}\right)~[~\hat{\vec{n}}\cd... ...- 2~\vec{\sigma} \right)~] \over 1-\left(\vec{k} \cdot \hat{\vec{n}}\right)^2},$	(A.43)
$\displaystyle \vec{\sigma}^\prime$	=	$\displaystyle \frac{\left( 1-\left(\vec{k} \cdot \hat{\vec{n}}\right)^2 \right)... ...^2 - 2(\vec{k} \cdot \hat{\vec{n}}) \ (\vec{\sigma} \cdot \hat{\vec{n}}) }\cdot$	(A.44)

This can be used to derive the 4-momentum or 4-velocity of a particle $p^{\prime \mu}$ after a completely elastic collision with a surface of infinite mass:

$\displaystyle p^{\prime 0}$	=	$\displaystyle p^{0} - 2 \vec{k}\cdot \hat{\vec{n}} \ \left( \frac{ \vec{p} \cdo... ... \hat{\vec{n}}\ p^{0} }{1- \left( \vec{k}\cdot \hat{\vec{n}}\right)^2} \right),$	(A.45)
$\displaystyle p^{\prime i}$	=	$\displaystyle p^i - 2 \hat{n}^i \ \left( \frac{ \vec{p} \cdot \hat{\vec{n}}- \v... ... \hat{\vec{n}}\ p^{0} }{1- \left( \vec{k}\cdot \hat{\vec{n}}\right)^2} \right),$	(A.46)

Let us note two important properties of (A.43)-(A.44), also applicable to (A.45)-(A.46):

Multiplying both sides of (A.44) by $\hat{\vec{n}}$ and using the following definitions for the angles between vectors (see Fig. A.1)

$\displaystyle -{\vec{\sigma}}\cdot\hat{\vec{n}}$	=	$\displaystyle \cos\alpha,$	(A.47)
$\displaystyle \vec{\sigma}^\prime \cdot \hat{\vec{n}}$	=	$\displaystyle \cos \alpha^\prime,$	(A.48)
$\displaystyle \vec{k} \cdot \hat{\vec{n}}$	=	$\displaystyle k \cos\left(\varphi - \frac{\pi}{2}\right) = k~\sin \varphi,$	(A.49)

$\displaystyle f^\prime$	=	$\displaystyle f~{1+2~k~\sin\varphi~\cos\alpha+k^2~\sin^2\varphi \over 1-k^2~\sin^2\varphi},$	(A.50)
$\displaystyle \cos \alpha^\prime$	=	$\displaystyle \frac{ 2~ k~ \sin \varphi + \left(1 + k^2~\sin^2 \varphi\right)~ \cos \alpha } {1 + k^2 \sin^2 \varphi + 2~k~\sin \varphi \cos \alpha }\cdot$	(A.51)

$\displaystyle \sin \alpha^\prime$ = $\displaystyle \sin\alpha~ {1-k^2~\sin^2\varphi \over 1+2~k~\sin\varphi~\cos\alpha+k^2~\sin^2\varphi}\cdot$

(A.52)

Angles $\alpha$ , $\alpha^\prime$ and $\varphi$ are illustrated in Fig. A.1. The angle $\alpha$ lies between 0 and $\pi/2$ (since we always consider that the incoming light ray comes to the mirror from one particular side of the tangent plane to the mirror's surface at the point of reflection). For the same reason we have $0\le\alpha^\prime\le\pi/2$ . Angle $\varphi$ lies between $-\pi/2$ and $\pi/2$ . It is negative if the angle between $\vec{k}$ and $\hat{\vec{n}}$ is greater than $\pi/2$ and positive otherwise.

Additionaly, our central results have been derived (A.50)-(A.51) directly from Maxwell's equations by a principle of phase matching: the phase of the incoming wave should agree with the phase of the outgoing one (e.g., Jackson (1975), Sect. 7.3). This generalizes the work of Bolotovskii & Stolyarov (1989) for a flat mirror moving with constant velocity. For an accelerated mirror such a treatment, however, is meaningful only as long as the conditions for geometrical optics are satisfied.

A.8 Particular case of a flat mirror moving with a constant velocity

As a particular example let us apply the developed scheme to a flat mirror moving at constant velocity in reference frame (t,xⁱ). The mathematical expression for that is a worldline Eq. (A.1) of the form

$\begin{displaymath} \vec{x}_m(t,\xi,\eta)=\vec{x}_{m0}+\vec{l}~\xi+\vec{m}~\eta+\vec{v}_m~t, \end{displaymath}$

(A.53)

$\begin{displaymath} \vec{X}_m(T,\xi,\eta)=\vec{X}_{m0}+\vec{L}~\xi+\vec{M}~\eta+\vec{V}_m~T, \end{displaymath}$

(A.54)

Since for a flat mirror $\vec{l}$ and $\vec{m}$ are constants, the unit normal vector $\hat{\vec{n}}$ is also a constant. Since $\vec{v}_m$ is also time-independent the same reflection law described by (A.44) or (A.51) is valid for any point on the mirror and at any moment of time. (A.51) coincides with the results of Gjurchinovski (2004). Our framework is more general than that of Gjurchinovski (2004) since we do not assume the vectors $\vec{\sigma}$ , $\hat{\vec{n}}$ , $\vec{k}$ and $\vec{\sigma}^\prime$ to be coplanar, and our derivation is valid for an arbitrary mirror in arbitrary motion.

A.9 Low velocity limit

It is useful to derive the first-order expansion of (A.43)-(A.51) in powers of v_m/c since in practice the velocity of the mirror will be small compared to the light velocity. One gets

$\displaystyle f^\prime$	=	$\displaystyle f~\left(1-2~(\vec{\sigma}~\hat{\vec{n}})~(\vec{k}\cdot\hat{\vec{n}})+{\cal O}(c^{-2})\right),$	(A.55)
$\displaystyle \vec{\sigma}^\prime$	=	$\displaystyle \vec{\sigma}-2~(\vec{\sigma} \cdot \hat{\vec{n}})~\vec{\hat{n}}$
		$\displaystyle +2~(\vec{k}\cdot\hat{\vec{n}})~\left[\left(1-2~(\vec{\sigma}\cdot... ...vec{n}}+(\vec{\sigma}\cdot\hat{\vec{n}})~\vec{\sigma}\right] +{\cal O}(c^{-2}),$	(A.56)

$\displaystyle f^\prime$	=	$\displaystyle f~\left(1+2~k~\sin\varphi~\cos\alpha+{\cal O}(c^{-2})\right),$	(A.57)
$\displaystyle \cos \alpha^\prime$	=	$\displaystyle \cos\alpha+2~k~\sin\varphi~\sin^2\alpha+{\cal O}(c^{-2}),$	(A.58)
$\displaystyle \sin \alpha^\prime$	=	$\displaystyle \sin\alpha-k~\sin\varphi~\sin2\alpha+{\cal O}(c^{-2}).$	(A.59)

$\begin{displaymath} \alpha^\prime-\alpha=-2~k~\sin\varphi~\sin\alpha+{\cal O}(c^{-2}). \end{displaymath}$

(A.60)

Relativistic effects on imaging by a rotating optical system

1 Introduction