A&A 485, 363-376 (2008)
DOI: 10.1051/0004-6361:20078631

Weak lensing goes bananas: what flexion really measures

P. Schneider¹ - X. Er^1,2

1 - Argelander-Institut für Astronomie, Universität Bonn, Auf dem Hügel 71, 53121 Bonn, Germany
2 - Max-Planck-Institut für Radioastronomie, Auf dem Hügel 69, 53121 Bonn, Germany

Received 7 September 2007 / Accepted 27 February 2008

Abstract
In weak gravitational lensing, the image distortion caused by shear measures the projected tidal gravitational field of the deflecting mass distribution. To lowest order, the shear is proportional to the mean image ellipticity. If the image sizes are not small compared to the scale over which the shear varies, higher-order distortions occur, called flexion. For ordinary weak lensing, the observable quantity is not the shear, but the reduced shear, owing to the mass-sheet degeneracy. Likewise, the flexion itself is unobservable. Instead, higher-order image distortions measure the reduced flexion, i.e., derivatives of the reduced shear. We derive the corresponding lens equation in terms of the reduced flexion and calculate the resulting relation between brightness moments of source and image. Assuming an isotropic distribution of source orientations, estimates for the reduced shear and flexion are obtained and then tested with simulations. In particular, the presence of flexion affects the determination of the reduced shear. The results of these simulations yield the amount of bias of the estimators as a function of the shear and flexion. We point out and quantify a fundamental limitation of the flexion formalism in terms of the product of reduced flexion and source size. If this product increases above the derived threshold, multiple images of the source are formed locally, and the formalism breaks down. Finally, we show how a general (reduced) flexion field can be decomposed into its four components. Two of them are due to a shear field, carrying an E- and B-mode in general. The other two components do not correspond to a shear field, and they can also be split up into corresponding E- and B-modes.

Key words: gravitational lensing - galaxies: evolution - galaxies: statistics - cosmology: diffuse radiation

1 Introduction

Weak gravitational lensing provides a powerful tool for studying the mass distribution of clusters of galaxies, as well as the large scale structure in the Universe (see Mellier 1999; Bartelmann & Schneider 2001; Refregier 2003; Schneider 2006; Munshi et al. 2006, for reviews on weak lensing). It has led to constraints on cosmological parameters, such as those characterizing structure formation and the mass density of the Universe.

In weak lensing, one employs the fact that the image ellipticity of a distant source is modified by the tidal gravitational field of the intervening matter distribution. Based on the assumption that the orientation of distant sources is random, the ellipticity of each image yields an unbiased estimate of the line-of-sight integrated tidal field, usually called shear in lensing. The shear thus carries information about the properties of the mass distribution. Formally, the shear is described in terms of a first-order expansion of the lens equation, i.e., the locally linearized lens equation. This yields a valid description of the mapping from the image to the source sphere, as long as the images are small compared to the length scale on which the shear varies. However, this linear approximation breaks down for larger sources, or does so in regions of the lens plane where the shear varies rapidly. The most visible failure of the linearized lens equation is the occurrence of giant arcs, which in most cases correspond actually to multiple images of a background source. To model them, the full lens equation needs to be studied. However, there is an intermediate regime where the linearized lens equation breaks down, although (locally) no multiple images are formed - the arclets regime. Arclets are fairly strongly distorted images of background sources (Fort et al. 1988; Fort & Mellier 1994), though they do not correspond to multiple images.

Arclets are the most natural application for flexion. Flexion has been introduced by Goldberg & Bacon (2005) and Bacon et al. (2006), and describes the lowest-order deviation of the lens mapping from its linear expansion; it has also been termed ``sextupole lensing'' and been treated by Irwin & Shmakova (2005, 2006, and references therein). It corresponds to the derivative of the shear; in combination with a strong shear, this can deform round images into arclets, giving rise to images which resemble the shape of a banana. In their original paper, Goldberg & Bacon (2005) considered only a single component of flexion which, however, only provides an incomplete description of shear derivatives. In Bacon et al. (2006), the need for a second flexion component was recognized.

In the first part of this paper, we present the general theory of flexion; in contrast to earlier work, we explicitly consider the quantities that can be actually observed, by accounting for the mass-sheet degeneracy (Falco et al. 1985; Gorenstein et al. 1988). That is, a change of the surface mass density $\kappa$ of the form $\kappa \to \lambda\kappa+(1-\lambda)$ leaves the shape of all observed images invariant. In usual weak lensing, this is accounted for by recognizing that not the shear $\gamma$ can be obtained from observations, but only the reduced shear $g=\gamma/(1-\kappa)$ (Schneider & Seitz 1995). The difference of shear and reduced shear is typically small, in particular in applications of cosmic shear, since along most lines-of-sight, the value of $\kappa$ is very much smaller than unity. In applications of flexion, however, we expect that the surface mass density no longer is very small; for instance, arclets occur in the inner parts of clusters where $\kappa\ga 0.1$ . Therefore, the difference between shear and reduced shear can no longer be neglected. Gradients of the shear are not directly observable; only derivatives of the reduced shear are, and thus we define the (reduced) flexion in terms of derivatives of g. In Sect. 2.1 we briefly recall the irreducible tensor components which are defined in term of their behavior under rotations of the coordinate system. It turns out that a complex notation for these tensor components is very useful. In Sect. 2.2 we expand the lens equation to second order, before deriving the corresponding lens equation (and relation for the local Jacobian) which is invariant under mass-sheet transformations. The second-order term in this lens equation is fully described by our reduced flexion components G₁ and G₃.

As is known from usual weak lensing studies, a measured shear is not necessarily accounted for by an (equivalent) surface mass density. Since the shear is a two-component quantity, it has one degree of freedom more than the $\kappa$ field. Therefore, shear fields are decomposed into E- and B-modes (Crittenden et al. 2002; Schneider et al. 2002), where the former are due to a $\kappa$ field, whereas the latter describes the remaining (``curl'') part. A similar situation occurs in flexion, which has four components. Therefore, in Sect. 3 we consider the decomposition of a general flexion field into contributions due to the gradient of the shear and those not related to the shear field. The former one can then be further subdivided into flexion resulting from an E- and B-mode shear field. We carry out this decomposition for the flexion as well as for the reduced flexion.

In Sect. 4 we then define brightness moments of sources and images and derive the transformation laws between them. This approach is very similar to the HOLICs approach developed by Okura et al. (2007a,b) and later considered by Goldberg & Leonard (2007), except that we explicitly write all relations in terms of the reduced shear and the reduced flexion. Generalizing the usual assumption that the expectation value of the source ellipticity is zero - due to the phase averaging over source orientations - to the expectation values of all source shape parameters which are not invariant under coordinate rotations (as appropriate for a statistically isotropic Universe), we obtain in Sect. 5 estimates for the reduced shear and reduced flexion in terms of the brightness moments of the images. In Sect. 6 we perform a number of numerical experiments to test the validity of our approach and the accuracy of the estimators derived. In particular, we point out that there is a fundamental limit where the theory of flexion has to break down - the second-order lens equation is non-linear and will in general have critical curves, leading to multiple images of the source (or parts of it). If the source is cut by a caustic, different parts of it will have different numbers of images, and the assumption of random source orientation (which underlies all weak lens applications) will break down - the caustic introduces a preferred orientation into the source plane. In Appendix B we provide a full classification of the critical curves of the second-order lens equation and use these results in order to obtain the maximum source size (for given values of the reduced flexion) for which the flexion concept still makes sense. We discuss our results in Sect. 7.

2 Complex lensing notation

Like in many other instances in weak lensing, flexion is best described by using complex notation, which we shall briefly introduce next and which will be used for vectors and tensor components throughout this paper.

2.1 Irreducible tensor components

For a two-dimensional vector $\mbox{\boldmath$x$ }=(x_1,x_2)$ , we define the complex number $x=x_1+{\rm i} x_2$ . Under rotations of the coordinate system by an angle $\varphi$ , x gets multiplied by the phase factor ${\rm e}^{-{\rm i}\varphi}$ . For a tensor of second rank, whose Cartesian components are Q_ij, we define the complex numbers $Q_2=Q_{11}-Q_{22}+{\rm i}(Q_{12}+Q_{21})$ and $Q_0=Q_{11}+Q_{22}+{\rm i}(Q_{12}-Q_{21})$ . A rotation of the coordinate systems by an angle $\varphi$ multiplies Q₂ by the phase factor ${\rm e}^{-2{\rm i}\varphi}$ , whereas Q₀ remains unchanged. This is most easily seen by considering that the prototype of a second rank tensor is Q_ij=x_i y_j, where $\mbox{\boldmath$x$ }$ and $\mbox{\boldmath$y$ }$ are vectors; the foregoing statements are then obtained by noting that the complex numbers xy and x^* y are multiplied by ${\rm e}^{-2{\rm i}\varphi}$ and 1, respectively, under coordinate rotations. According to this transformation behavior, we shall loosely speak about Q₀ as a spin-0 quantity, whereas x and Q₂ are spin-1 and spin-2 quantities, respectively.

We shall be dealing only with totally symmetric tensors. If Q_ijis symmetric, then

$\begin{displaymath}Q_2:=Q_{11}-Q_{22}+2{\rm i}Q_{12} \; ; \;\; Q_0:=Q_{11}+Q_{22}. \end{displaymath}$

(1)

If T_ijk is a symmetric third-rank tensor, we define its spin-3 and spin-1 components as

T₃	:=	$\displaystyle T_{111}-3T_{122}+{\rm i}\left(3T_{112}-T_{222}\right) ;$
T₁	:=	$\displaystyle T_{111}+T_{122} +{\rm i}\left(T_{112}+T_{222}\right) .$	(2)

Furthermore, if F_ijkl denotes a symmetric fourth-rank tensor, we decompose it into its spin-4, spin-2 and spin-0 components, respectively,

F₄	:=	$\displaystyle F_{1111}-6F_{1122}+F_{2222}+4{\rm i}\left(F_{1112}-F_{1222}\right) ;$
F₂	:=	$\displaystyle F_{1111}-F_{2222}+2{\rm i}\left(F_{1112}+F_{1222}\right) ;$
F₀	:=	F₁₁₁₁+2F₁₁₂₂+F₂₂₂₂ .	(3)

Apart from notational simplicity, the complex lensing notation provides a check for the validity of equations. In a valid equation, each term has to have the same spin. The product of a spin-m and a spin-n quantity has spin m+n. The complex conjugate of a spin-nquantity has spin -n.

2.2 Second-order expansion of the local lens equation

In weak lensing, the lens equation is linearized locally by writing the relative source coordinate $\mbox{\boldmath$\beta$ }$ in terms of the image position $\mbox{\boldmath$\theta$ }$ as $\beta_i=\theta_i-\psi_{,ij}\theta_j$ , where $\psi$ is the deflection potential, indices separated by a comma denote partial derivatives with respect to $\theta_i$ , and summation over repeated indices is implied. Note that the form of this equation implies that the origin of the lens plane, $\mbox{\boldmath$\theta$ }=0$ , is mapped onto the origin of the source plane. The surface mass density $\kappa$ and the complex shear $\gamma$ at the origin are given in terms of the deflection potential, $\kappa=(\psi_{,11}+\psi_{,22})/2$ , $\gamma=(\psi_{,11}-\psi_{,22})/2 +{\rm i}\psi_{,12}$ , being spin-0 and spin-2 fields, respectively. In our complex notation, the locally linearized lens equation reads

$\begin{displaymath}\beta=(1-\kappa)\theta-\gamma\theta^* . \end{displaymath}$

(4)

We next generalize this result to a second-order local expansion of the lens equation, which in Cartesian coordinates reads $\beta_i=\theta_i-\psi_{,ij}\theta_j-\psi_{,ijk}\theta_j\theta_k$ /2. The third-order derivatives of $\psi$ are related to the gradient of $\kappa$ and $\gamma$ . To write these derivatives also in complex form, we define the differential operators

$\begin{displaymath}\nabla_{\rm c}:={\partial\over \partial \theta_1}+{\rm i}{\pa... ...rtial \theta_1}-{\rm i}{\partial\over \partial \theta_2} \cdot \end{displaymath}$

(5)

The differential operator $\nabla_{\rm c}$ turns a spin-n field into a spin-(n+1) field, whereas $\nabla_{\rm c}^*$ reduces the spin by one unit. One finds, for example,

$\displaystyle \nabla_{\rm c}\kappa$	=	$\displaystyle {1\over 2}\left[ \psi_{,111}+\psi_{,122}+{\rm i}\left(\psi_{,112}+\psi_{,222}\right) \right] ;$
$\displaystyle \nabla_{\rm c}\gamma$	=	$\displaystyle {1\over 2}\left[ \psi_{,111}-3\psi_{,122}+{\rm i}\left(3\psi_{,112}-\psi_{,222}\right) \right] ;$
$\displaystyle \nabla_{\rm c}^*\gamma$	=	$\displaystyle \nabla_{\rm c}\kappa ,$	(6)

and we recognize the combinations of third derivatives of $\psi$ which form the spin-1 and spin-3 combinations defined in (2). The final relation in (6) is the relation between first derivatives of $\kappa$ and $\gamma$ found by Kaiser (1995), here expressed in compact form. It expresses the fact that the third-order derivatives of the deflection potential can be summarized in the spin-3 field ${\cal G}\equiv\nabla_{\rm c}\gamma$ and the spin-1 field ${\cal F}\equiv\nabla_{\rm c}^*\gamma$ , where we introduced the usual notation for the two flexion quantities. The second-order lens equation in our complex notation then reads

$\begin{displaymath}\beta=(1-\kappa)\theta-\gamma\theta^* - {1\over 4}{\cal F}^*~... ... 2}{\cal F}~\theta\theta^* - {1\over 4}{\cal G}~(\theta^*)^2 . \end{displaymath}$

(7)

Since this is no longer a linear equation, a source at $\beta$ may have more than one image. In fact, up to four images of a source can be obtained, as can be seen for the special case of $\gamma=0={\cal F}$ and by placing the source at $\beta=0$ . In this case, if we set ${\cal G}=\vert{\cal G}\vert{\rm e}^{3{\rm i}\zeta}$ , then one solution is $\theta=0$ , and the other three are $\theta=4(1-\kappa)/\vert{\cal G}\vert~{\rm e}^{{\rm i}\varphi}$ , with $\varphi=\zeta$ , $\varphi=\zeta+2\pi/3$ and $\varphi=\zeta+4\pi/3$ . Of course, the origin for the occurrence of these solutions lies in the fact that ${\cal G}$ is a spin-3 quantity. We shall later need the Jacobian determinant $\det \mathcal A$ of this lens equation, which is

$\displaystyle \det\mathcal A$	=	$\displaystyle (1-\kappa)^2- \gamma\gamma^* +\mbox{\boldmath$\theta$ }\cdot \nabla\left[ (1-\kappa)^2-\gamma\gamma^* \right] + {\cal O}(\theta^2)$
	=	$\displaystyle (1-\kappa)^2- \gamma\gamma^* -\theta\left[ (1-\kappa){\cal F}^* +{\gamma^{\cal F}+\gamma~{\cal G}^\over 2} \right]$
		$\displaystyle -\theta^\left[ (1-\kappa){\cal F} +{\gamma^{\cal G}+\gamma~{\cal F}^*\over 2} \right] + {\cal O}(\theta^2)\;,$	(8)

where the first expression is just the first-order Taylor expansion of the Jacobian around the origin, and in the second step we made use of the relation $\mbox{\boldmath$\theta$ }\cdot\nabla=(\theta\nabla_{\rm c}^*+\theta^*\nabla_{\rm c})/2$ . We point out that (8) is not the full Jacobian of the lens Eq. (7), but only its first-order expansion; the full Jacobian contains quadratic terms in $\theta$ . We will return to this important issue further below.

2.3 Accounting for the mass-sheet degeneracy

The observables of a gravitational lens system are unchanged if the surface mass density $\kappa$ is transformed as $\kappa(\mbox{\boldmath$\theta$ }) \to \kappa'(\mbox{\boldmath$\theta$ })=\lambda\kappa(\mbox{\boldmath$\theta$ }) +(1-\lambda)$ (Gorenstein et al. 1988). In the case of weak lensing, the shape of images is unchanged under this transformation (Schneider & Seitz 1995). Because of this mass-sheet degeneracy, the shear is not an observable in weak lensing, but only the reduced shear $g=\gamma/(1-\kappa)$ . In fact, since we expect that the most promising applications of flexion will come from situations where $\kappa$ is not much smaller than unity, the distinction between shear and reduced shear is likely to be more important for flexion than for the usual weak lensing applications. Hence, at best we can expect from higher-order shape measurements to obtain an estimate for the reduced shear and its derivatives. For this reason, we shall rewrite the foregoing expressions in terms of the reduced shear.

The mass-sheet transformation is equivalent to an isotropic scaling of the source plane coordinates. Hence, we divide (7) by $(1-\kappa)$ to obtain

		$\displaystyle \hat\beta\equiv{\beta\over (1-\kappa)}=\theta-g\theta^* - \Psi_1^~\theta^2 -2\Psi_1 ~\theta\theta^ - \Psi_3 ~(\theta^*)^2$
		$\displaystyle {\rm with} \;\; \Psi_1={1\over 4}{{\cal F}\over(1-\kappa)} \; ; \; \; \Psi_3={1\over 4}{{\cal G}\over (1-\kappa)}\cdot$	(9)

We will now express the coefficients in the lens Eq. (9) in terms of the derivatives of the reduced shear,

$\begin{displaymath}G_1\equiv \nabla_{\rm c}^* g={{\cal F}+g{\cal F}^*\over (1-\k... ...iv \nabla_{\rm c} g={{\cal G}+g{\cal F}\over (1-\kappa)} \cdot \end{displaymath}$

(10)

The expression for ${\cal F} / (1-\kappa)$ in terms of the reduced shear and its derivatives has been derived by Kaiser (1995); in our notation it reads

$\begin{displaymath}{{\cal F}\over(1-\kappa)}\equiv -\nabla_{\rm c} \ln(1-\kappa)... ...arrow \; \Psi_1={G_1-g G_1^*\over 4\left(1-gg^*\right)} \cdot \end{displaymath}$

(11)

The expression for the derivative of $\gamma$ in terms of the reduced shear can be easily obtained from differentiating the definition $\gamma=(1-\kappa)g$ ,

		$\displaystyle {\nabla_{\rm c}\gamma\over (1-\kappa)}={{\cal G}\over (1-\kappa)}... ...a_{\rm c}\kappa\over(1-\kappa)} =G_3 - {g\left(G_1-g G_1^\right)\over 1-g g^}$
		$\displaystyle \Rightarrow \;\; \Psi_3={G_3\over 4} - {g\left(G_1-g G_1^\right)\over 4\left(1-g g^\right)} \cdot$	(12)

The derivatives G_1,3 of the reduced shear are those quantities we can hope to observe; to distinguish them from ${\cal F}$ and ${\cal G}$ , one might call G_1,3 the reduced flexion.

The Jacobian determinant $\det\hat\mathcal A$ of the mapping between the image position $\theta$ and the rescaled source position $\hat \beta$ then becomes

		$\displaystyle \det\hat\mathcal A={\det\mathcal A\over (1-\kappa)^2} =1-g g^* -\eta^\theta-\eta\theta^ ,$
		$\displaystyle {\rm where}~~\eta=\nabla_{\rm c}^g - {g \left(\nabla_{\rm c}^g\... ... 2} +{g^* \nabla_{\rm c} g \over 2}= G_1 -{g G_1^* \over 2} + {g^* G_3 \over 2}$	(13)

is a spin-1 quantity. Again, (13) is valid only to linear order in $\theta$ . Note that a similar equation for the determinant was obtained in Okura et al. (2007a; their Eq. (A1)), but they consider only the case of $\vert g\vert\ll 1$ ; this has also consequences for the relations between source and image brightness moments, to be derived further below.

3 Compatibility relations

3.1 Compatibility relations for flexion

Flexion has a total of four components, namely the real and imaginary parts of ${\cal F}$ and ${\cal G}$ . A measurement of flexion will thus yield four components, and we might ask whether these components are independent. We recall a similar situation in shear measurements. The shear has two components; on the other hand, the shear is defined as second partial derivatives of the deflection potential, which is a single scalar field. Therefore, the two shear components cannot be mutually independent if they are due to a gravitational lensing signal. Of course, the measured shear is not guaranteed to satisfy the condition that the two shear components can be derived from a single scalar deflection potential, since observational noise or intrinsic alignments of galaxies may affect the measured shear field. Therefore, one has introduced the notion of E- and B-modes in shear measurements (Crittenden et al. 2002). The E-mode shear is the one that can be written in terms of a deflection potential, whereas the B-mode shear cannot.

Formally, the E- and B-mode decomposition can be written in terms of a complex deflection potential $\psi(\theta)=\psi^{\rm E}(\theta)+{\rm i}\psi^{\rm B}(\theta)$ and a complex surface mass density $\kappa=\kappa^{\rm E} + {\rm i}\kappa^{\rm B}$ (Schneider et al. 2002). Each component of $\psi$ satisfies its own Poisson equation, $\nabla^2\psi^{\rm E}=2\kappa^{\rm E}$ , $\nabla^2\psi^{\rm B} =2\kappa^{\rm B}$ . Making use of this decomposition, the shear becomes

$\displaystyle \gamma$	=	$\displaystyle \gamma_1+{\rm i}\gamma_2=\left(\psi_{,11}-\psi_{,22}\right)/2+{\rm i}\psi_{,12}$
	=	$\displaystyle \left[ {1\over 2}\left(\psi^{\rm E}_{,11}-\psi^{\rm E}_{,22}\righ... ...}_{,12} +{1\over 2}\left(\psi^{\rm B}_{,11}-\psi^{\rm B}_{,22}\right) \right] .$	(14)

The distinction between E- and B-mode shear can be obtained by considering second partial derivatives of the shear components. Taking the derivative of (14), one obtains

$\displaystyle {\cal F}$	=	$\displaystyle \nabla_{\rm c}^*\gamma =(1/2)\left(\psi^{\rm E}_{,111}+\psi^{\rm E}_{,122}- \psi^{\rm B}_{,112}-\psi^{\rm B}_{,222} \right)$
		$\displaystyle +({\rm i}/2)\left(\psi^{\rm E}_{,112}+\psi^{\rm E}_{,222}+ \psi^{\rm B}_{,111}+\psi^{\rm B}_{,122} \right)$
	=	$\displaystyle \kappa^{\rm E}_{,1}-\kappa^{\rm B}_{,2} +{\rm i}\left(\kappa^{\rm E}_{,2}+\kappa^{\rm B}_{,1}\right) ,$	(15)

which can be expressed in more compact form as

$\begin{displaymath}{\cal F}=\nabla_{\rm c}\left(\kappa^{\rm E}+{\rm i}\kappa^{\rm B}\right) =\nabla_{\rm c}\kappa\;. \end{displaymath}$

(16)

A further derivative yields for the components

$\displaystyle {\cal F}_{1,1}$	=	$\displaystyle \kappa^{\rm E}_{,11}-\kappa^{\rm B}_{,12} ;\quad {\cal F}_{1,2}=\... ...{\rm B}_{,22} ;\quad {\cal F}_{2,1}=\kappa^{\rm E}_{,12}+\kappa^{\rm B}_{,11} ;$
$\displaystyle {\cal F}_{2,2}$	=	$\displaystyle \kappa^{\rm E}_{,22}+\kappa^{\rm B}_{,12} .$	(17)

However, it is easier to consider directly the complex derivative of ${\cal F}$ , from which we obtain

$\begin{displaymath}\nabla^*_{\rm c}{\cal F}=\nabla^*_{\rm c}\nabla^*_{\rm c}\gam... ... F}_{2,2} +{\rm i}\left({\cal F}_{2,1}-{\cal F}_{1,2}\right) . \end{displaymath}$

(18)

Thus, if the shear field is a pure E-mode field, $\nabla^*_{\rm c}\nabla^*_{\rm c}\gamma$ is real. An imaginary part of $\nabla^*_{\rm c}\nabla^*_{\rm c}\gamma$ is due to a B-mode field. This then yields the local distinction between E- and B-mode shear.

Since the flexion has four components, whereas the lens can be described by a single scalar field, we expect that there are three constraint relations a flexion field has to satisfy if it is due to a lensing potential. In fact, even if we leave the shear field arbitrary (that is, even if we allow it to be composed of E- and B-modes), then we expect two constraint equations, since the flexion field has two components more than the shear field. These constraint equations are easy to obtain. First, if the flexion field is due to a shear field, then we have

$\begin{displaymath}\nabla_{\rm c}\nabla^*_{\rm c}\gamma=\nabla^*_{\rm c}\nabla_{... ...{\cal H}:= \nabla_{\rm c}{\cal F}-\nabla^*_{\rm c}{\cal G}=0 , \end{displaymath}$

(19)

where we defined the spin-2 quantity ${\cal H}$ . It may describe contributions to the flexion which are not caused by a shear field, such as due to noise, intrinsic source alignments or higher-order terms (such as lens-lens coupling) in the propagation equation for light bundles. As a spin-2 field, a non-zero ${\cal H}$ can be decomposed into its E- and B-modes. If ${\cal H}\equiv 0$ , then the spin-3 flexion ${\cal G}$ is completely determined by the spin-1 flexion ${\cal F}$ up to an additive constant, as can be best seen in Fourier space, for which (19) yields $\hat{\cal G}(\ell)=-{\rm i}\hat\gamma(\ell)~\ell =(\ell/\ell^*)\hat{\cal F}(\ell)$ . Second, if the flexion field is solely caused by a gravitational lens effect, i.e., by a pure E-mode shear field, then $\nabla^*_{\rm c}{\cal F}$ is real, i.e.,

$\begin{displaymath}{\cal F}_{\rm i}:=\nabla^*_{\rm c}{\cal F}-\nabla_{\rm c}{\cal F}^*=0 . \end{displaymath}$

(20)

Thus, flexion from a pure E-mode shear field is characterized by the three constraint equations ${\cal H}\equiv 0$ and ${\cal F}_{\rm i}\equiv0$ , where the former is a two-component equation.

3.2 The axially-symmetric case

To illustrate these compatibility relations, we consider the case of an axi-symmetric flexion field. For that, we introduce polar coordinates $(\theta,\varphi)$ ; hence, in this subsection only, $\theta$ is the radial coordinate, not a complex number. The gradient operators then become

$\begin{displaymath}\nabla_{\rm c}={\rm e}^{{\rm i}\varphi}\left({\partial\over \... ...m i}\over \theta} {\partial\over \partial\varphi}\right) \cdot \end{displaymath}$

We first assume that the flexion derives from a shear field, which in the axi-symmetric case takes the form $\gamma(\theta,\varphi)=\Gamma(\theta)~{\rm e}^{2{\rm i}\varphi}$ . In the case of a pure E-mode shear, $\Gamma(\theta)$ is real, whereas for pure B-modes, $\Gamma$ is imaginary. The two flexions then read

$\displaystyle {\cal F}(\theta,\varphi)$ = $\displaystyle \nabla_{\rm c}^*\gamma= {\rm e}^{{\rm i}\varphi}\left({{\rm d}\Gamma\over {\rm d}\theta}+{2\Gamma\over\theta}\right) ;$
$\displaystyle {\cal G}(\theta,\varphi)$ = $\displaystyle \nabla_{\rm c}\gamma= {\rm e}^{3{\rm i}\varphi}\left({{\rm d}\Gamma\over {\rm d}\theta}-{2\Gamma\over\theta}\right) \cdot$	(21)

A further differentiation then yields the result that

$\begin{displaymath}\nabla_{\rm c}{\cal F} ={\rm e}^{2{\rm i}\varphi}\left({{\rm ... ...eta} -{4\Gamma\over \theta^2}\right)=\nabla_{\rm c}^*{\cal G}, \end{displaymath}$

so that the function ${\cal H}$ defined in (19) vanishes, ${\cal H}\equiv 0$ .

$\begin{figure} \par\includegraphics[width=4.5cm,height=4.5cm]{z2e.eps}\hspace*{4... ...s}\hspace*{4mm} \includegraphics[width=4.5cm,height=4.5cm]{z2bh.eps}\end{figure}$

Figure 1: The four different flexion fields discussed in the text. The upper left (right) panel shows the flexion corresponding to an axially-symmetric E-mode (B-mode) shear field, where arrows indicate the spin-1 flexion and the skeletons the spin-3 flexion component. In the lower left (right) panel, the flexion fields are displayed which are not due to a shear field, but a non-zero E-mode (B-mode) ${\cal H}$ field.

Open with DEXTER

If flexion does not derive from a shear field, then ${\cal H}\ne 0$ ; for example, this is the case if $\nabla_{\rm c}{\cal F}=-\nabla_{\rm c}^*{\cal G}={\cal H}/2$ , which we shall take in the following. Owing to their spin properties, we can write

$\begin{eqnarray*}{\cal F}(\theta,\varphi)&=&F(\theta)~{\rm e}^{{\rm i}\varphi} ;... ...{\cal G}(\theta,\varphi)&=&G(\theta)~{\rm e}^{3{\rm i}\varphi} , \end{eqnarray*}$

which then leads to the differential equations

$\begin{displaymath}{{\rm d}F\over{\rm d}\theta}-{F\over\theta}={H\over 2}\; ;\quad {{\rm d}G\over{\rm d}\theta}+{3G\over\theta}=-{H\over 2} \end{displaymath}$

with the solutions

$\displaystyle F(\theta)={F_0\theta\over \theta_0} + {\theta\over 2}\int_{\theta_0}^\theta{\rm d}\theta'\; {H(\theta')\over \theta'} ;$
$\displaystyle G(\theta)={G_0\theta_0^3\over \theta^3} - {1\over 2\theta^3}\int_{\theta_0}^\theta{\rm d}\theta'\; \theta^{\prime 3}~H(\theta') ,$			(22)

where F₀ and G₀ are constants of integration. We further can distinguish between the cases of ${\cal H}$ being an E-mode field, in which case $H(\theta)$ is real, or a B-mode field, when $H(\theta)$ is imaginary, or a superposition of both.

As an explicit example, we consider the isothermal case. For a singular isothermal sphere with Einstein radius $\theta_{\rm E}$ , one then has

$\begin{displaymath}\gamma=-{1\over 2}~{\theta_{\rm E}\over \theta}~ {\rm e}^{2{\... ...2}~{\theta_{\rm E}\over \theta^2}~ {\rm e}^{3{\rm i}\varphi} . \end{displaymath}$

(23)

A further differentiation then shows that

$\begin{displaymath}\nabla_{\rm c}{\cal F} ={3\over 2}~{\theta_{\rm E}\over \theta^3}~{\rm e}^{2{\rm i}\varphi} =\nabla_{\rm c}^*{\cal G} , \end{displaymath}$

again confirming that ${\cal H}=0$ . The corresponding case for a B-mode shear field is obtained by multiplying all expressions in (23) by ${\rm e}^{{\rm i}\pi/2}={\rm i}$ .

To obtain a similar example for the case that flexion is not derived from a shear field, we choose first a pure E-mode spin-2 field for ${\cal H}$ ,

$\begin{displaymath}{\cal H}={3\theta_{\rm E}\over \theta^3}~{\rm e}^{2{\rm i}\varphi} . \end{displaymath}$

By appropriately choosing the integration constants in (22), the flexions then become

$\begin{displaymath}{\cal F}=-{\theta_{\rm E}\over 2\theta^2}~{\rm e}^{{\rm i}\va... ...=-{3\theta_{\rm E}\over 2\theta^2}~{\rm e}^{3{\rm i}\varphi} . \end{displaymath}$

(24)

Thus, the flexion fields are very similar to those given in (23), except that the relative signs are different. An analogous case for a pure B-mode ${\cal H}$ is obtained by multiplying the foregoing expressions by ${\rm e}^{{\rm i}\pi/2}={\rm i}$ . A graphical illustration of the four different cases is provided in Fig. 1.

3.3 Compatibility relations for reduced flexion

Turning now to the reduced flexion, the compatibility equations can be obtained as follows. First, if the flexion is due to a shear field, we have

$\begin{displaymath}H:=\nabla_{\rm c}G_1-\nabla^*_{\rm c}G_3=0 , \end{displaymath}$

(25)

as follows from the definition (10) of the two flexion components in terms of the reduced shear. Again, if this equation is satisfied, G₃ is completely determined by G₁, up to an additive constant. Second, if the flexion is caused by a pure E-mode shear, i.e., if the shear is due to a real surface mass density, then we employ the quantity $\ln~ (1-\kappa)$ , which is real and invariant under mass-sheet transformations, up to an additive constant. Therefore, $K_2\equiv-\nabla^*_{\rm c}\nabla_{\rm c}\ln~ (1-\kappa)$ must be real. We find:

		$\displaystyle K_2=\nabla_{\rm c}^\left(\nabla_{\rm c}\kappa\over 1-\kappa\right) = \nabla_{\rm c}^\left[ {1\over 1-gg^}\left(G_1-G_1^g\right) \right]$
		$\displaystyle ={\left[ \nabla_{\rm c}^G_1\!-\!g\left(\nabla_{\rm c}G_1\right)^... ...2 g^+g G_1 G_3^* \!-\! G_1 G_1^* -g^2G_1^* G_3^\right)\over (1\!-\!gg^)^2} ,$	(26)

so that a flexion coming from an E-mode shear field satisfies K₂=K₂^*.

We point out that the foregoing relation suggests a natural way to use flexion for finite-field mass reconstructions in weak lensing. Seitz & Schneider (2001) formulated the finite-field mass reconstruction from measured reduced shear in terms of a von Neumann boundary value problem for $K=-\ln~ (1-\kappa)$ , whose solution determines K up to an additive constant. The ``source'' for $\nabla^2 K$ was determined by the reduced shear and its derivatives, and is given by (26). In Seitz & Schneider (2001), the derivatives of the reduced shear were obtained by finite differencing of g. If flexion is measured, one can replace the ``source'' for $\nabla^2 K$ by a weighted sum of the differentiated reduced shear field and the combination (K₂+K₂^*)/2 of the flexion field, with the weights chosen according to the estimated noise properties of both contributions.

4 Brightness moments of source and image

We consider an image of a source, and denote the brightness distribution of the source by $I^{\rm s}(\beta)$ . Since surface brightness is conserved by lensing, the brightness distribution of the image is $I(\theta)=I^{\rm s}(\beta(\theta))$ . Since the scaling of the source plane is unobservable, we shall only work in the following in terms of the scaled source plane coordinates, and therefore drop the hat on $\beta$ , as well as on $\mathcal A$ .

We define the origin of the image (or lens) plane as the center-of-light of the image under consideration, i.e. we require

$\begin{displaymath}\int{\rm d}^2\theta\;\theta~I(\theta) =0 . \end{displaymath}$

(27)

Let $F(\beta)$ be a function of the source coordinate; we define the operator ${\rm Mom}[F(\beta)]$ as

$\displaystyle {\rm Mom}[F(\beta)]$	=	$\displaystyle \int {\rm d}^2\beta\; F(\beta)~I^{\rm s}(\beta) =\int {\rm d}^2\theta\;{\det\mathcal A}(\theta)~F(\beta(\theta))~I(\theta)$
	$\textstyle \approx$	$\displaystyle \int {\rm d}^2\theta\;\left(1-g g^* -\eta^\theta-\eta\theta^\right) ~F(\beta(\theta))~I(\theta) ,$	(28)

where here and in the following, we use the linear approximation for $\det \mathcal A$ . In particular, setting F=1, one finds that

$\displaystyle {\rm Mom}[1]$	$\textstyle \equiv$	$\displaystyle S_0 = \!\!\int\! {\rm d}^2\beta\; I^{\rm s}(\beta) = \!\! \int \!{\rm d}^2\theta\;\left(1-g g^* -\eta^\theta-\eta\theta^\right) ~I(\theta)$
	=	$\displaystyle \left(1-g g^*\right)~S =\det\mathcal A_0 ~ S\;,$	(29)

since first-order moments of the light distribution in the lens plane vanish, due to our choice (27) of the coordinate system. Here, S is the flux of the lensed image, so that $S=S_0/\det\mathcal A_0$ , as usual, where $\det\mathcal A_0$ is the Jacobian at the origin $\theta=0$ .

4.1 Centroid shift

The origin of the coordinates in the source plane is the image of the origin in the lens plane as mapped with the lens equation. In particular, this does not coincide with the center-of-light of the source, which is given by $\bar\beta\equiv {\rm Mom}[\beta]/S_0$ , or

$\displaystyle {\bar\beta}$	=	$\displaystyle {1\over S_0} \!\int\! \!{\rm d}^2\beta\; \beta~I^{\rm s}(\beta) =... ...} \!\!\int \!\!{\rm d}^2\theta\;\left(1-g g^* -\eta^\theta-\eta\theta^\right)$
		$\displaystyle \times\left[ \theta-g\theta^* - \Psi_1^~\theta^2 -2\Psi_1 ~\theta\theta^ - \Psi_3 ~(\theta^*)^2 \right] ~I(\theta) .$	(30)

Expanding the integrand, we note that terms linear in $\theta$ vanish, due to (27). Defining the second-order brightness moments of the image in the form

$\begin{displaymath}Q_2\equiv {1\over S}\int{\rm d}^2\theta\;\theta^2~I(\theta) ;... ...uiv {1\over S}\int{\rm d}^2\theta\;\theta~\theta^*~I(\theta) , \end{displaymath}$

(31)

we obtain for the source centroid shift

$\displaystyle {\bar\beta}$	=	$\displaystyle {3 G_1 g^-5 G_1^-2 g G_3^\over 4(1-g g^)} Q_2$
		$\displaystyle +{4 g G_1^* + g^2 G_3^-G_3 g^ -G_1 (3+g g^)\over 2(1-g g^)} Q_0$
		$\displaystyle +{5 g G_1 -3 g^2 G_1^* -(1-3 g g^) G_3\over 4(1-g g^)} Q_2^* .$	(32)

We now write these equations in a more compact form; for this, we define the matrix $\tens{G}$ by $\tens{G}^{\rm T}=(G_3^*,G_1^*,G_1,G_3)$ , where the ``T'' denotes the transpose of the matrix. Then,

$\begin{displaymath}\bar\beta=\tens{B} \tens{G} , \end{displaymath}$

(33)

where the coefficients of $\tens{B}=(b_1,b_2,b_3,b_4)$ are given by

b₁	=	$\displaystyle { g^2 Q_0-g Q_2\over 2(1-g g^)};\quad b_2={8g Q_0-5 Q_2-3 g^2 Q_2^\over 4(1-g g^*)} ;$
b₃	=	$\displaystyle {3 g^* Q_2-2(3+g g^)Q_0+5 g Q_2^\over 4(1-g g^*)} ;$
b₄	=	$\displaystyle {(3 g g^* -1)Q_2^-2 g^ Q_0 \over 4(1-g g^*)} \cdot$	(34)

The centroid shift in the source plane is thus given by the product of the derivatives of the reduced shear (expressed by G₁ and G₃) and the area of the image, which is proportional to Q₀ and Q₂. Of course, since the reduced shear and its derivatives are not directly observable, the centroid shift in unobservable as well. To get an order-of-magnitude estimate of $\bar\beta$ , we assume that the source has a linear angular size $\Theta_{\rm s}$ , consider the reduced shear to be of order unity, and let $\Theta_{\rm c}$ be the angular scale on which the reduced shear varies. Then,

$\begin{displaymath}G_n={\cal O}\left(1\over \Theta_{\rm c}\right) \; ;\;\; Q_n={... ...\cal O}\left(\Theta_{\rm s}^2\over \Theta_{\rm c}\right) \cdot \end{displaymath}$

(35)

4.2 Transformation of second-order brightness moments

Next we consider the second-order brightness moments of the source, defined as $Q_2^{\rm s}={\rm Mom}[(\beta-\bar\beta)^2]/S_0 ={\rm Mom}[\beta^2]/S_0-\bar\beta^2$ and $Q_0^{\rm s}={\rm Mom}[(\beta-\bar\beta)(\beta-\bar\beta)^*]/S_0={\rm Mom}[\beta\beta^*]/S_0-\bar\beta\bar\beta^*$ . By defining the third-order brightness moments of the image through

$\begin{displaymath}T_3\equiv {1\over S}\int{\rm d}^2\theta\;\theta^3~I(\theta) ;... ...v {1\over S}\int{\rm d}^2\theta\;\theta^2~\theta^*~I(\theta) , \end{displaymath}$

(36)

we obtain

$\displaystyle Q_2^{\rm s}$	=	$\displaystyle Q_2 - 2g Q_0 + g^2 Q_2^* +{2 g^* G_1-3 G_1^-g G_3^\over 2(1-g g^*)} T_3$
		$\displaystyle +{8 g G_1^-(4+3 g g^)G_1-g^* G_3+2 g^2 G_3^\over2(1-g g^)} T_1$
		$\displaystyle +{(7+g g^)g G_1-7g^2 G_1^ +(3 g g^-1) G_3-g^3 G_3^ \over2(1-g g^)} T_1^$
		$\displaystyle +{(1-2 g g^)g G_3-3 g^2 G_1 +2 g^3 G_1^\over2(1-g g^)} T_3^ -\bar\beta^2 ,$	(37)

		$\displaystyle Q_0^{\rm s} = -g^* Q_2+(1+g {g^})Q_0-g Q_2^$
		$\displaystyle +{6 g^* G_1^* + (3 g g^-1) G_3^ - 4 {g^}^2 G_1\over 4(1-g g^)} T_3$
		$\displaystyle +{2 {g^}^2 G_3+(11+3 g g^)g^* G_1-(7 +9g g^)G_1^-(1+3 g g^)g G_3^ \over 4(1-g g^*)} T_1$
		$\displaystyle + {2 g^2 G_3^* + (11+ 3 g g^) g G_1^ - (1+3 g g^)g^ G_3 -(7+9 g g^)G_1\over 4(1-g g^)} T_1^*$
		$\displaystyle +{6 g G_1-4 g^2 G_1^* - (1- 3 g g^)G_3\over 4(1-g g^)} T_3^* -\bar\beta\bar\beta^*.$	(38)

Note that $Q_0^{\rm s}$ is real. In a more compact notation, (37) reads

$\begin{displaymath}Q_2^{\rm s}=Q_2 - 2g Q_0 + g^2 Q_2^* +A \tens{G}-{\bar\beta}^2\;, \end{displaymath}$

(39)

where the matrix $\tens{A}=(a_1,a_2,a_3,a_4)$ has coefficients

a₁	=	$\displaystyle {-g^3 T_1^+2 g^2 T_1 -g T_3 \over 2(1-gg^)} ;$
a₂	=	$\displaystyle {2 g^3 T_3^* -7 g^2 T_1^* + 8 g T_1 -3 T_3 \over 2(1-gg^*)} ;$
a₃	=	$\displaystyle {-3 g^2 T_3^+g(7+ g g^) T_1^* -(4+3 g g^)T_1+2 g^T_3 \over 2(1-gg^*)} ;$
a₄	=	$\displaystyle {g(1-2 g g^)T_3^-(1- 3 g g^)T_1^ - g^* T_1 \over 2(1-gg^*)}\cdot$	(40)

4.3 Transformation of third-order brightness moments

We now define the third-order brightness moments of the source, separated into a spin-3 and a spin-1 component,

$\displaystyle T_3^{\rm s}$	=	$\displaystyle { {\rm Mom}[\left(\beta-\bar\beta\right)^3]\over S_0} ={ {\rm Mom}[\beta^3]\over S_0}-3\bar\beta~{ {\rm Mom}[\beta^2]\over S_0}$
		$\displaystyle +3\bar\beta^2~{ {\rm Mom}[\beta]\over S_0} -\bar\beta^3 ={ {\rm Mom}[\beta^3]\over S_0}-3\bar\beta Q_2^{\rm s} - \bar\beta^3 ,$	(41)

where we used that ${\rm Mom}[\beta^2]/S_0=Q_2^{\rm s}+\bar\beta^2$ and ${\rm Mom}[\beta\beta^*]/S_0=Q_0^{\rm s}+\bar\beta \bar\beta^*$ . Similarly, we obtain

$\displaystyle T_1^{\rm s}$	=	$\displaystyle { {\rm Mom}[\left(\beta-\bar\beta\right)^2 (\beta^-\bar\beta^)]\over S_0} ={ {\rm Mom}[\beta^2\beta^*]\over S_0}-2Q_0^{\rm s}\bar\beta$
		$\displaystyle -Q_2^{\rm s}\bar\beta^-\bar\beta^2\bar\beta^ .$	(42)

Defining the fourth-order brightness moments of the image by

F₀	=	$\displaystyle {1\over S}\int{\rm d}^2\theta\;(\theta \theta^)^2 I(\theta) ;\;\; F_2={1\over S}\int{\rm d}^2\theta\; \theta^3 \theta^~ I(\theta) ;$
F₄	=	$\displaystyle {1\over S}\int{\rm d}^2\theta\; \theta^4~ I(\theta) ,$	(43)

where F_n is a spin-n quantity, we obtain for the third-order moments of the source:

$\begin{displaymath}{\cal T}^{\rm s} = \tau + \tens{C}~\tens{G}~+{\cal O}(\bar\beta ^3)\;, \end{displaymath}$

(44)

where the matrix ${\cal T}^{\rm s}$ is defined by its transpose ${\cal T}^{\rm s, T}=\left( T_3^{s*},T_1^{s*}, T_1^{s}, T_3^{s}\right)$ . The elements of $\tau$ are

$\displaystyle \tau_1$	=	T₃^-3 g^ T₁^+3 g^2 T₁- g^*3 T₃ ;
$\displaystyle \tau_2$	=	-g T₃^* +(1+2 g g^)T₁^ - g^(2 +g g^)T₁ +g^*2 T₃ ;
$\displaystyle \tau_3$	=	$\displaystyle \tau_2^\; ;\;\; \tau_4=\tau_1^ ,$	(45)

where the last two relations are obvious. The $4\times 4$ matrix $\tens{C}$ is given explicitly in Appendix A; each of its elements consists of a sum of terms proportional to fourth-order brightness moments, F_n, and terms proportional to squares of second-order brightness moments. Okura et al. (2007a) and Goldberg & Leonard (2007) have derived expressions similar to (44), though using a number of simplifying assumptions (such as $\vert g\vert\ll 1$ ) and (in the latter paper), not considering the reduced flexion.

We will now consider the order-of-magnitudes of the various terms appearing in (39) and (44). Assuming that the third-order moments of the sources are small, then the third-order moments of the image are given by the product of $\tens{C}$ and $\tens{G}$ . With $\tens{G}={\cal O}\left(1/\Theta_{\rm c}\right)$ and $\tens{C}={\cal O} \left(\Theta_{\rm s}^4\right)$ , we find that $T={\cal O} \left(\Theta_{\rm s}^4/\Theta_{\rm c}\right)= {\cal O} \left(\Theta_{\rm s}^3\right)~ \left(\Theta_{\rm s}/\Theta_{\rm c}\right)$ . To get an estimate of the size of the various terms in (39), we note that the first three terms on the right-hand side (those proportional to the Q_n) are of order ${\cal O}\left(\Theta_{\rm s}^2\right)$ , whereas $\tens{A}\tens{G}={\cal O}\left(\Theta_{\rm s}^4/\Theta_{\rm c}\right) ~{\cal O}\left(1/\Theta_{\rm c}\right)$ and $\bar\beta^2={\cal O}\left(\Theta_{\rm s}^4/\Theta_{\rm c}^2\right)$ . Hence, the last two terms are of equal magnitude in general, each of them being smaller than the first three terms by a factor $\left(\Theta_{\rm s}/\Theta_{\rm c}\right)^2$ . Only if the source is of the same order as the scale over which the reduced shear varies do the last two terms in (39) contribute. In (44), we have neglected the terms $\bar\beta^3$ , since they are two powers of $\left(\Theta_{\rm s}/\Theta_{\rm c}\right)$ smaller than the terms written down.

5 Shear and flexion estimates

5.1 Estimate of the reduced shear

We see that (44) is a linear equation for $\tens{G}$ , which can thus be solved,

$\begin{displaymath}\tens{G}=\tens{C}^{-1}\left( {\cal T}^{\rm s}-\tau \right)\; . \end{displaymath}$

(46)

Inserting this into (39) then yields

$\begin{displaymath}Q_2^{\rm s}=Q_2 - 2g Q_0 + g^2 Q_2^* +\tens{A}~ \tens{C}^{-1}... ...al T}^{\rm s}-\tau \right) -\left(\tens{B}\tens{G}\right)^2 . \end{displaymath}$

(47)

We are thus left with a single complex equation for g, which contains the observable brightness moments of the image, as well as the unobservable brightness moments of the source. This equation can be used to estimate the reduced shear if we make assumptions concerning the properties of the source brightness moments. We assume that the sources are oriented randomly, which implies that all quantities with spin unequal zero have a vanishing expectation value. That is, we set $Q_2^{\rm s}=0$ , ${\cal T}^{\rm s}=0$ , to arrive at

$\begin{displaymath}Q_2 - 2g Q_0 + g^2 Q_2^* = \tens{A}~ \tens{C}^{-1}\tau + \left(\tens{B}\tens{C}^{-1}\tau\right)^2 =: Y(g) , \end{displaymath}$

(48)

where we have indicated that the right-hand side depends on the reduced shear (in fact it does so in a very complex manner). However, since we have argued above that the terms on the left-hand are much larger than those on the right-hand side, an iterative solution of this equation is suggested. Assume the right-hand side is given, then we get the solutions

$\begin{displaymath}g={\chi\over \vert\chi\vert^2}\left(1\pm\sqrt{1-\vert\chi\ver... ...^*\over Q_0}}\right), \;\; {\rm where}\;\; \chi={Q_2\over Q_0} \end{displaymath}$

(49)

is the complex ellipticity of the image. Obviously, there are two solutions g for a given value of Y. This situation is similar to that of ``ordinary'' weak lensing, where this ambiguity also occurs: as shown by Schneider & Seitz (1995), from shape measurements of background galaxies, and cannot distinguish locally between an estimate g and 1/g^*=g/|g|². The same occurs here; we therefore assume that we pick one of the two solutions, say the one corresponding to the ``-'' sign; this then yields for small shear $g\approx \chi/2$ . It should be stressed that flexion impacts the determination of shear from the second-order brightness moments, due to its impact on higher-order brightness moments; hence, in general the determination of shear and flexion are coupled.

We start the iteration by setting Y₀=0. This yields a first-order solution for the estimate of g,

$\begin{displaymath}g_0={\chi\over \vert\chi\vert^2}\left(1-\sqrt{1-\vert\chi\vert^2}\right) . \end{displaymath}$

(50)

We then use the iteration equations

$\begin{displaymath}Y_n=Y(g_{n-1}) \; ; \;\; g_n={\chi\over \vert\chi\vert^2}\left(1-\sqrt{1-\vert\chi\vert^2+{Y_n\chi^*\over Q_0}}\right) \cdot \end{displaymath}$

(51)

This procedure converges quickly to one of the two solutions (g,G₁,G₃); the other solution is obtained by taking the ``+'' sign in the above equations.

Of course, our approach of setting $Q_2^{\rm s}=0$ yields a biased estimator for g; this is true even in the absence of flexion (e.g., Schneider & Seitz 1995). The reason is that, although the expectation value of $Q_2^{\rm s}$ vanishes, the resulting estimator for g is a non-linear function of $\chi^{\rm s}=Q_2^{\rm s}/Q_0^{\rm s}$ and thus biased. The bias depends on the ellipticity distribution of the sources. It should be stressed, however, that a modified definition of image ellipticity exist such that its expectation value is an unbiased estimate of the reduced shear (Seitz & Schneider 1997).

5.2 Estimates for the reduced flexion

The flexion estimator is given by (46). Since the matrix $\tens{C}$ contains many terms, this is a fairly complicated equation in general. A simpler estimate is obtained if we assume that the reduced shear is small, $\vert g\vert\ll 1$ , in which case the matrix $\tens{C}$ simplifies considerably - see Appendix. Furthermore, if we assume that the brightness moments of spin $\ne 0$ are much smaller than the corresponding ones with spin 0, then we find the simple relations

$\begin{displaymath}T_1^{\rm s} \approx T_1-{9 F_0-12 Q_0^2 \over 4}G_1 ;\quad T_3^{\rm s} \approx T_3- {3 F_0 \over 4} G_3 . \end{displaymath}$

(52)

If we then set the $T_n^{\rm s}=0$ , as would be true for the expectation value, then we obtain as estimates for the reduced flexion

$\begin{displaymath}G_1 \approx {4 \over 9F_0-12Q_0^2}T_1 ; \quad G_3 \approx {4\over 3 F_0}T_3. \end{displaymath}$

(53)

Thus, the flexion is then given by the third-order brightness moments of the image, divided by a quantity that just depends on the size of the image. Similar relations to (53) have been given in Goldberg & Leonard (2007), whereas Okura et al. (2007a) obtain a different expression for G₁. We will check the accuracy of (53) in Sect. 6 below.

A more accurate estimate is obtained if we consider the reduced shear as well as the ratios of non-zero spin brightness moments to zero spin moments (such as |Q₂/Q₀| or |F_2,4/F₀|) to be of order $\delta$ , and then expand the flexion to first order in the (small) parameter $\delta$ to obtain

		$\displaystyle G_1 = {4T_1\over 9F_0-12Q_0^2} + {4\left[ 2 F_2^+3 F_0g^ -2 Q_0(2 g^Q_0+Q_2^) \right]\over 9 F_0(4Q_0^2-3F_0)} T_3$
		$\displaystyle + {4\left[ 3 F_0 g-8 F_2 -4 Q_0(g Q_0 -4 Q_2) \right] \over 9(3 F_0-4 Q_0^2)^2} T_1^* + {4 F_4 T_3^*\over 9 F_0(4 Q_0^2-3 F_0)} \;,$
		$\displaystyle G_3 = {4 T_3\over 3 F_0}+{8(5 F_2-9 Q_0Q_2)T_1 \over9 F_0(4 Q_0^2-3 F_0)} +{28 F_4 T_1^*\over 9 F_0(4 Q_0^2-3 F_0)} \cdot$	(54)

6 Numerical tests of flexion estimators

In this section we describe some simulations that we have performed in order to test the behavior of the estimators given in the previous section.

6.1 Description of the simulations

We model the sources as elliptical Gaussians, truncated at three times the scale ``radius'' $\Theta_{\rm s}$ chosen such that the area of a source is independent of its ellipticity. The ellipticity of the sources follows a Gaussian distribution, with a dispersion of $\chi^{\rm s}$ of R=0.4 (i.e., we use the same ellipticity distribution as in Schneider & Seitz 1995). However, for reasons explained in the next section, we truncate the intrinsic ellipticity distribution at $\vert\chi^{\rm s}\vert\le 0.9$ . For each source, we map a grid of pixels from the lens plane to the source plane using the lens equation to obtain the brightness distribution in the lens plane. From this distribution, the brightness moments of the image are measured. A shift in the lens plane coordinates is applied as to satisfy (27). We then apply the shear and flexion estimators described above to the resulting brightness moments Q_n, T_n and F_n. The shear and flexion estimates are then averaged over the Gaussian ellipticity distribution of the sources, in particular over their random orientation.

It should be noted that flexion is a dimensional quantity $\propto$ $\Theta_{\rm c}^{-1}$ . As can be checked explicitly from Sect. 4, the way flexion appears in the equations is always with one order higher in the source (or image) size than the other terms in the equations. As an example, we consider (44); the left-hand side and the first term on the right-hand side are $\propto$ $\Theta_{\rm s}^3$ , whereas the coefficients of the matrix $\tens{C}\propto \Theta_{\rm s}^4$ . This then implies that the accuracy of the flexion estimates does not depend on the magnitude of the flexion and the source size individually, but only on the product $G_n\Theta_{\rm s}$ . Therefore, the following results are quoted always in terms of this product.

$\begin{figure} \par\includegraphics[angle=270,width=8cm,clip]{ccc.ps}\hspace*{4m... ....ps}\hspace*{4mm} \includegraphics[angle=270,width=8cm,clip]{cce.ps}\end{figure}$

Figure 2: Constraints on the combination of source size and reduced flexion for the validity of the concept of flexion. Each curve shows the dividing line between a circular source of limiting isophote $\Theta$ being cut by a caustic (above the curve) or not (below the curve); in the former case, the assumptions underlying the flexion concept break down. The different curves in each panel are for different values of g, chosen as g=0.4,0.2,0.1,0.05,0, as indicated by different line types. Without loss of generality, we choose g to be real and non-negative. The four panels differ in the phase of the reduced flexion, as indicated. E.g., in the upper left panel, the phases of G₁, G₃ are the same as that of g.

Open with DEXTER

6.2 Multiple images, and the breakdown of flexion

$\begin{figure} \par\includegraphics[width=8.5cm,clip]{errorg.ps}\hspace*{4mm} \includegraphics[width=8.5cm,clip]{errorg13.ps} \end{figure}$

Figure 3: Accuracy of the estimates for reduced shear and flexion. The left panel shows contour of constant fractional error of $5\%$ , $10\%$ and $15\%$ , on the estimate of the reduced shear g, as a function of $G_i\Theta _{\rm s}$ , where we chose g=0.05 as input value, and assumed the phases of G₁, G₃ to be the same as that of g. The estimate was obtained by solving the iteration equations given in Sect. 5. The right panel shows the fractional error levels at 3, 5, and 10% for the reduced flexion, as quantified by (55), where the estimate was obtained again with the iterative procedure. In both cases, we assumed circular sources.

Open with DEXTER

$\begin{figure} \par\includegraphics[width=8.5cm,clip]{cg1.ps}\hspace*{4mm} \includegraphics[width=8.5cm,clip]{cg3.ps}\end{figure}$

Figure 4: Comparison of the reduced flexion estimators (53) with the full expression (46) and the input value. The horizontal and vertical axis show $G_i \Theta _{\rm s}, i=1,3$ . For both panels, we take g=0.05, and G₃=0 (G₁=0) for the left (right) panel. The line indicates the input value, the plus symbols show the simplified reduced flexion estimate (53), and the crosses result from the full expression of reduced flexion (46). As can be seen from the left-hand panel, the full estimator for the reduced flexion yields a more biased result that the approximate expression (53); we have not found a reasonable explanation for this behavior.

Open with DEXTER

As we mentioned before, the lens Eq. (7) can give rise to multiple images. As can be seen from the example given after (7), if the flexion is sufficiently small, all but one of these multiple images will be located at a large distance from the origin, and the central image of an extended source will be isolated. In this case, this central, or primary, image (the shape of which we measure here) is not crossed by a critical curve, and thus the source is not crossed by a caustic. The multiple images at large distances from the origin then result from the low-order Taylor expansion of the lens equation, which most likely breaks down at these image positions anyway; hence, these additional images are of no relevance. If, however, the flexion becomes sufficiently large - or if the source is large enough - this is no longer the case, and the multiple images of an extended source will merge. If that happens, the whole method of determining shear and derivatives thereof from brightness moments will break down. This can be most easily seen by considering the caustic curve cutting the source. Different parts of the source will be mapped onto a different number of image points in the lens plane, and the caustic curve introduces a direction into the situation. Hence, the assumption of an isotropic orientation of sources can no longer be employed. Mathematically, this can be seen from (28); there, the transformation between source and image plane no longer is correct if multiple images do occur. More precisely, the transformation between source and image coordinates in the calculation of the brightness moments implicitly assumes that within the limiting isophote of the primary image, the lens equation is invertible. Owing to what was said above, the condition that the central image is isolated (so that locally no multiple images occur) can be expressed solely by the products $G_n\Theta_{\rm s}$ . These products approximately measure the fractional change of the reduced shear across the image of a source.

In our simulations we can check whether a critical curve crosses our central image, just by controlling the sign of the Jacobian determinant (the true one, not the linear approximation Eq. (13)). If the source size becomes too large, some points in the image will have a negative Jacobian. In the Appendix B, we consider the critical curves and caustics of the lens Eq. (9), which allows us to determine the regions in flexion space where no local multiple imaging occurs. Some examples of this are plotted in Fig. 2. Each panel shows the dividing line between parameter pairs $(G_1\Theta,G_3\Theta)$ for a circular source of limiting isophotal radius $\Theta$ ; below the curves, no local multiple images occur, whereas for parameter pairs above the lines, the flexion formalism using moments necessarily breaks down. The different lines in each panel correspond to different values of g. The occurrence of critical curves also is the reason why we truncated the intrinsic ellipticity distribution of the sources in the simulations, since in the limit of $\vert\chi^{\rm s}\vert \to 1$ , keeping the source area fixed, there will be orientation angles for which the source will hit a caustic.

6.3 Estimates of the accuracy

We now present some results of our numerical simulation regarding the accuracy with which the reduced shear and flexion can be obtained with our moment approach. For given input values of g, G₁ and G₃, we either measure the brightness moments for a single circular source, or average the results over an ellipticity distribution, as described above. It should be noted that we have to deal with a 5-dimensional parameter space, namely the 3 complex parameters g, G₁ and G₃, minus one overall phase that can be chosen, e.g., to make g real and positive. Thus, instead of sampling the parameter space comprehensively, we only give a few selected results.

We start by considering a circular source, and determine the effect of flexion on the determination of the reduced shear. The left-hand panel of Fig. 3 shows contours of constant fractional deviation $\Delta g/g$ , in the flexion parameter plane. Here it is assumed that the phase of both flexion components is the same as that of g (as would be the case in an axially-symmetric lens potential). Errors of order 5% occur already for $\sqrt{\vert G_1^2+G_3^2\vert}\Theta_{\rm s}\sim 0.03$ , and the fractional error increases approximately linearly with the strength of flexion (or with the source size), although it does not scale equally with both flexion components. The reason for this effect has been mentioned before - flexion affects the transformation between source and image quadrupole moments, as can be seen in (37).

In Fig. 4, we show the expectation value of the reduced flexion components, as a function of the input flexion. The expectation value has been determined by averaging over an isotropic ensemble of elliptical sources, as described before. The left and right panel show the behavior of the expectation value of G₁ and G₃, respectively, where the other flexion component was set to zero. The dashed curve shows the identity, the plus symbols were obtained by using the approximate estimator (53), whereas the crosses show the expectation values as obtained by employing the full expression (46), where the corresponding value of gwas obtained by the iterative process described in Sect. 5. It is reassuring that the expectation value closely traces the input value, i.e., that the estimates have a fairly small bias. Furthermore, we see that the approximate estimator (53) performs remarkably well. It is seen that the estimates for G₃ behave better than those for G₁. This can also be seen from the right-hand panel of Fig. 3, where we plot contours of constant fractional error

$\begin{displaymath}\Delta G:=\sqrt{\left\vert {\Delta G_1\over G_1} \right\vert^2 +\left\vert {\Delta G_3\over G_3} \right\vert^2} , \end{displaymath}$

(55)

where $\Delta G_n$ is the deviation of the estimate of G_n from its input value. For simplicity, we have assumed a circular source. We see that the accuracy decreases much faster with increasing G₁ than with increasing G₃. The reason for that may be related to the fact that the estimator of G₁ is more strongly affected by the non-linearity of the equations, as can also be seen in (54).

7 Conclusions and further work

In this paper, we have studied the effect of flexion in weak gravitational lensing. The main results are summarized as follows:

Owing to the mass-sheet degeneracy, flexion itself cannot be determined, but only reduced flexion. We have therefore written the second-order lens equation (which contains the derivatives of the reduced shear, i.e., flexion) as well as the relations between the brightness moments of source and image strictly in terms of the reduced shear and the reduced flexion.
We pointed out that a general flexion field can be decomposed into a pair of components which is due to a shear field, i.e., its derivatives, and a pair of components not related to shear. The former pair can be further separated into flexion due to an E- and B-mode shear, with only the E-mode flexion expected to arise from gravitational lensing. For the second pair of components, no physical interpretation is available; if they arise in measurements, they are most likely due to noise or intrinsic shape effects of sources. General relations to separate these components are given.
We derived the relations between low-order brightness moments of source and image, taking into account that the presence of flexion leads to a centroid shift, and it also affects the relation between second-order brightness moments - and thus the estimate of the reduced shear. Hence, the presence of flexion has an impact on the shear measurements. Starting from these moment equations, we obtained approximate estimates for the reduced shear and flexion.
We pointed out a limit where the flexion formalism ceases to be valid, namely when the product of source size and flexion is sufficiently large that parts of the source are multiply imaged locally, i.e., where a caustic cuts through the source. We quantified this with numerical simulations, and also provided a complete classification of the critical curves of the second-order lens equation employed in flexion studies.
We performed a number of numerical experiments to study the bias of the reduced shear and flexion estimators. However, due to the high dimensionality of parameter space, no comprehensive study was presented here. We also pointed out that only the product of flexion and source size matters in the accuracy of estimates.

The possible occurrence of critical curves in highly distorted images may provide a serious obstacle to applications of flexion. Perhaps the most promising application of flexion measurements are those in regions where the shear field varies on small scales, i.e., close to galaxies (and thus can be used for galaxy-galaxy lensing) or in the inner regions of clusters. However, if one finds a strongly distorted image of a background galaxy as in the case of the arclet A5 in Abell 370 (Fort et al. 1988), how can one be sure that it is not due to a merged double image of the source? Using flexion for studying small-scale structure in mass distributions can therefore be affected by the occurrence of multiple imaging.

Similar to the situation in shear measurements, the moment approach for flexion as presented here must be modified in several ways to be applicable to real data. First, brightness moments must be weighted in order not to be dominated by the very noisy outer regions of the image. As is known from shear measurements, such a weighting affects the relation between source and image brightness moments. Secondly, one needs to account for the effects of a point-spread function. Both of these modifications were successfully achieved for second-order brightness moments by Kaiser et al. (1995; see also Luppino & Kaiser 1997). Goldberg & Leonard (2007) consider these effects in the context of flexion. It should be noted, though, that their consideration of the PSF effects is restricted to unweighted moments, for which these effects are given by a simple convolution. In the case of weighted brightness moments, however, the PSF effects are much more subtle. Okura et al. (2007b) indeed developed a PSF correction scheme similar to that of Kaiser et al. (2005), now accounting for higher-order brightness moments of the images and the PSF; their application of this method to synthetic data and to images of the cluster A1689 is encouraging.

But even disregarding these complications, the present paper only scratches the surface in investigating estimators for reduced flexion and their properties. As mentioned before, the second-order lens equation contains five essential parameters. The bias of an estimator for reduced shear and flexion will depend on these parameters, as well as on the intrinsic ellipticity (and higher-order moments) distribution of sources. One might ask whether it is possible to find an unbiased flexion estimator, such as was possible to construct for the reduced shear. Unfortunately, we have been unable to make analytic progress: even for a circular Gaussian source, the brightness moments of the image cannot be calculated analytically. Our ray-tracing algorithm with which we conducted our numerical simulations is almost certainly sub-optimal; a more advanced method should be developed to reduce the numerical efforts in calculating brightness moments. Beside the bias, it would be interesting to calculate the variance of the various estimators, or more precisely, their covariance.

It may turn out that measurements of flexion, and PSF corrections, are more conveniently done with shapelets, as was originally considered by Goldberg & Bacon (2005), Bacon et al. (2006) and Massey et al. (2007a). Even if this turns out to be the case (see Leonard et al. 2007, for an application of flexion measurements in the galaxy cluster A1689), the moment approach provides a more intuitive picture of the effects of flexion. In addition, the weak lensing community has profited substantially from the existence of several different methods to measure shear (see Heymans et al. 2006; Massey et al. 2007b, for the first results of a comprehensive Shear TEsting Programme, in which these various methods are studied and compared); therefore, the development of different techniques for measuring flexion will certainly be of interest once the flexion method is put to extensive use.

Acknowledgements

We thank Jan Hartlap and Ismael Tereno for useful comments on this paper. This work was supported by the Deutsche Forschungsgemeinschaft under the project SCHN 342/6-1 and the TR33 ``The Dark Universe''. X.E. was supported for this research through a stipend from the International Max-Planck Research School (IMPRS) for Radio and Infrared Astronomy at the University of Bonn.

Appendix A: The matrix C

$\begin{figure} \par\includegraphics[width=6.7cm,clip]{hcri.eps}\hspace*{4mm} \includegraphics[width=6.7cm,clip]{hcau.eps}\end{figure}$

Figure A.1: The critical curves ( left-hand panel) and caustics ( right-hand panel) of the lens Eq. (9) for the cases of hyperbolic critical curves, as described in Sect. B.2. The parameters chosen here are g=0.05, $G_1=0.07+0.015{\rm i}$ , $G_3=0.03+0.005{\rm i}$ . A circular source is mapped onto two images, as indicated. If the source size were increased, it would hit the caustic, the two images would merge, and the flexion concept would break down. The unit of the reduced flexion is the inverse of the unit in which coordinates are measured.

Open with DEXTER

$\begin{figure} \par\includegraphics[width=6.7cm,clip]{pcri.eps}\hspace*{4mm} \includegraphics[width=6.7cm,clip]{pcau.eps}\end{figure}$	Figure A.2: Same as Fig. A.1, but for the parabolic case, with parameters g=0.05, G₁=-0.04, G₃=0.112.
Open with DEXTER

In this Appendix, we list the coefficients of the matrix $\tens{C}$ which occurs in (44):

4(1-gg^*) C₁₁	=	-2g F₂^* +(9g g^-3) F₀ +6 g^(1-2 g g^*) F₂
		+g^2(5 g g^-3) F₄ +6 g Q₂^* Q₀ -12 g g^* Q₀²
		+(3 -9 g g^) Q₂^ Q₂ + 6 g^* (4 g g^*-1) Q₀ Q₂
		+3 g^2(1- 3 g g^) Q₂²
4(1-gg^*) C₁₂	=	5 g F₄^* -2(5+6 g g^) F₂^ +9 g^(3+g g^) F₀
		-2 g^2(12+g g^) F₂ +7 g^3 F₄ -9 g Q₂^2
		+6(3 +4 g g^) Q₂^ Q₀ -12 g^(3+g g^) Q₀²
		-3 g^(5+3 g g^) Q₂^* Q₂ +6 g^2(8+g g^) Q₀ Q₂
		-15 g^*3 Q₂²
4(1-gg^*) C₁₃	=	-7 F₄^* +26 g^* F₂^* -36 g^2 F₀ +22 g^3 F₂ -5 g^*4 F₄
		+ 15 Q₂^2 -54 g^ Q₂^* Q₀ +48 g^2 Q₀² +24 g^2 Q₂^* Q₂
		-42 g^3 Q₀ Q₂ +9 g^4 Q₂²
4(1-gg^*) C₁₄	=	-2 g^* F₄^* +6 g^2 F₂^ - 6 g^3 F₀ +2 g^4 F₂
		+6 g^* Q₂^2 -18 g^2 Q₂^* Q₀ +12 g^*3 Q₀²
		+6 g^3 Q₂^ Q₂ -6 g^*4 Q₀ Q₂
4(1-gg^*) C₂₁	=	2 g² F₂^* -6 g² g^* F₀+[4 g g^(1+g g^)-2] F₂
		+2 g^(1-2 g g^) F₄ -6 g² Q₂^* Q₀
		+4 g (1+2 g g^) Q₀² +6 g² g^ Q₂^* Q₂
		+[2-4 g g^(3+2 g g^)] Q₀ Q₂+2 g^* (4 g g^*-1) Q₂²
4(1-gg^*) C₂₂	=	-5 g² F₄^* +2 g (7+4 g g^) F₂^
		-3[3+g g^(8+g g^)] F₀+2 g^(8+5 g g^) F₂
		-7 g^2 F₄ + 9 g² Q₂^2 -2 g (13+8 g g^) Q₂^ Q₀
		+4[3+g g^(8+g g^)] Q₀²
		+[5+g g^(16+3 g g^)] Q₂^* Q₂
		-2 g^(16+11 g g^) Q₀ Q₂ +15 g^*2 Q₂²
4(1-gg^*) C₂₃	=	7g F₄^* -2(4 +9 g g^) F₂^ +3 g^(7+5 g g^) F₀
		-2 g^2(9+2 g g^) F₂+5 g^3 F₄ -15 g Q₂^2
		+(16+38 g g^) Q₂^ Q₀ - 4 g^(7 +5 g g^) Q₀²
		- g^(13+11 g g^) Q₂^* Q₂ +2 g^2(17+4 g g^) Q₀ Q₂
		-9 g^*3 Q₂²
4(1-gg^*) C₂₄	=	(3 g g^-1) F₄^-6 g g^2 F₂^ +3 g^2(1 + g g^) F₀
		$\displaystyle -2 g^{3} F_2 + (1\! -\!7 g g^) Q_2^{2} +2 g^(2+7 g g^) Q_2^ Q_0$
		- 4 g^2(2+g g^) Q₀²-3 g^2(1+g g^) Q₂^* Q₂
		+6 g^*3 Q₀ Q₂.

The other eight elements follow trivially from the foregoing ones, since the second half of the matrix is just the complex conjugate one of the first half, i.e., C₄₄=C₁₁^*, C₃₄=C₂₁^* etc., or in general, C_ij=C_5-i,5-j^*.

$\begin{figure} \par\includegraphics[width=6.9cm,clip]{ecri.eps}\hspace*{4mm} \includegraphics[width=6.9cm,clip]{ecau.eps}\end{figure}$	Figure A.3: Same as Fig. A.1, but for the elliptical case, with parameters $g=0.05,G_1=0.015+0.035{\rm i}, G_3=0.19+0.105{\rm i}$ .
Open with DEXTER

Appendix B: Critical curves and caustics

In this Appendix we consider the critical curves of the lens Eq. (9), with the goal of finding the pricipal limits of the applicability of the flexion formalism - which necessarily breaks down if parts of the source are multiply imaged. For this, we need to derive the full Jacobian, which can most easily be obtained from considering $\theta$ and $\theta^*$ as independent variables, and then use $\partial/\partial\theta_1 = \partial/\partial\theta + \partial/\partial\theta^*$ , $\partial/\partial\theta_2 = {\rm i}\left(\partial/\partial\theta - \partial/\partial\theta^*\right)$ , which can be inverted to yield $\partial/\partial\theta=\nabla_{\rm c}^*/2$ , $\partial/\partial\theta^*=\nabla_{\rm c}/2$ . With these relations, one finds that $\det\mathcal A=(\partial\beta/\partial\theta) (\partial\beta^*/\partial\theta^*... ...eta~\nabla_{\rm c}\beta^* -\nabla_{\rm c}\beta~\nabla_{\rm c}^*\beta^*\right)/4$ . Carrying out these derivatives, the Jacobian becomes

$\begin{displaymath}\det\mathcal A=1-g g^* -\eta^*\theta-\eta\theta^* + A^*\theta^2+B\theta\theta^* +A(\theta^*)^2 , \end{displaymath}$

(B.1)

with

$\displaystyle A=4\left(\Psi_1^2-\Psi_1^\Psi_3\right) ;\quad B=4\left(\Psi_1\Psi_1^-\Psi_3\Psi_3^*\right) ;$
$\displaystyle \eta=4\Psi_1+2g\Psi_1^+2 g^\Psi_3 .$			(B.2)

Note that A is a spin-2 quantity, whereas B is a real scalar, i.e., has spin-0. In the generic case, the critical curves ( $\det\mathcal A=0$ ) are conical sections, which may be degenerate, though. We will now perform a complete classification of cases that can occur, as well as to derive the critical curve(s) in parametric form; the caustics are then obtained by inserting the parametric form of the critical curves into the second-order lens equation. As we shall see, the type of conical section is determined, amongst other parameters, by the discriminant

$\begin{displaymath}\Delta=B^2-4 A A^* . \end{displaymath}$

(B.3)

B.1 Zero discriminant

We start with the case that $\Delta=0$ , which implies B²=4 A A^*, or $B=\pm 2\vert A\vert$ . The case A=0=B either implies that $\Psi_1=0=\Psi_3$ , in which case also $\eta=0$ so that no critical curves occur, or that $\Psi_3=\Psi_1^2/\Psi_1^*$ , for which $\eta\ne 0$ in general. In this case, the critical curve is a straight line, satisfying $\eta^*\theta+\eta\theta^*=1-g g^*$ . As can be seen by inspection, it reads

$\begin{displaymath}\theta={1-g g^*\over 2\eta^*}+{\rm i}\lambda\eta ,\quad -\infty<\lambda<\infty . \end{displaymath}$

(B.4)

If $A\ne 0$ , the phase of A is defined. Since it is a spin-2 quantity, we write $A=\vert A\vert~{\rm e}^{2{\rm i}\varphi_A}$ . Furthermore, we introduce the rotation $\theta=x~{\rm e}^{{\rm i}\varphi_A}$ . Then the equation $\det\mathcal A=0$ for the critical curve reads, after dividing (B.1) by |A|,

$\begin{displaymath}\left(x\pm x^*\right)^2=\nu^* x+\nu x^*+{g g^* -1\over \vert ... ...\; \nu={\eta~{\rm e}^{-{\rm i}\varphi_A} \over \vert A\vert} , \end{displaymath}$

(B.5)

and the sign on the left-hand side of the equation depends on the sign of B, where we used $B=\pm 2\vert A\vert$ . The parametric form of the critical curve, which takes the form of a parabola, can then be written as

$\displaystyle \theta={2~{\rm e}^{{\rm i}\varphi_A}\over (\nu^-\nu)} \left(2\lambda^2-\lambda \nu+{1-g g^\over 2\vert A\vert}\right) ;$
$\displaystyle \theta={2~{\rm e}^{{\rm i}\varphi_A}\over (\nu^+\nu)} \left({1-g g^\over 2\vert A\vert}-{\rm i}\lambda\nu - 2\lambda^2\right) ,$			(B.6)

where the first (second) equation applies for B>0 (B<0). Note that the parabola degenerates into a straight line if $\nu$ is real (for B>0) or purely imaginary (for B<0).

B.2 Non-zero discriminant

If $\Delta\ne 0$ , we can perform a translation to eliminate the linear term in $\det \mathcal A$ . Hence we define $\theta=\theta_0+\vartheta$ and choose $\theta_0$ such that terms linear in $\vartheta$ vanish. We then obtain for $\theta_0$ and for the critical curve condition

$\begin{displaymath}\theta_0={B\eta-2 A\eta^*\over \Delta}\;;\quad A^* \vartheta^2+B\vartheta\vartheta^*+A(\vartheta^*)^2=C\;, \end{displaymath}$

(B.7)

with

C	=	$\displaystyle {B \eta\eta^-A(\eta^)^2-A^\eta^2\over \Delta}+g g^-1$
	=	$\displaystyle -{1\over \Delta}~\left(g A^* +g^* A+B\right)^2 =: -{1\over \Delta}~V^2 ,$	(B.8)

where the second step was obtained by inserting the expression for $\eta$ in terms of the $\Psi$ 's, and in the final one we defined Vas the expression in the parenthesis.

As the first case, we consider A=0 and $B\ne 0$ (the case A=0=Bwas treated above), which implies that $\Psi_1=0$ and $B=-4\Psi_3\Psi_3^*<0$ . The equation for the critical curve then reduces to $B\vert\vartheta\vert^2=C$ . Furthermore, $\Delta=B^2$ , and C=-1. Thus, the critical curve is a circle of radius $1/(2\vert\Psi_3\vert)$ and center $\theta_0$ , or $\theta=\theta_0+{\rm e}^{{\rm i}\lambda}/(2\vert\Psi_3\vert)$ , $0\le \lambda<2\pi$ .

We now consider the case $A\ne 0$ ; then the phase $\varphi_A$ of A is defined, as used before. Introducing a rotation by defining $\vartheta=x~{\rm e}^{{\rm i}{\varphi_A}}$ , the equation for the critical curve becomes

$\begin{displaymath}\vert A\vert\left[ x^2+\left(x^*\right)^2 \right]+B x x^* =\l... ...\vert A\vert\right)x_1^2+\left(B-2\vert A\vert\right)x_2^2=C . \end{displaymath}$

(B.9)

The presence and topology of critical curves now depends on the signs of $\Delta$ and C. We first consider the case C=0; then, if $\Delta>0$ , no critical curves occur, except for the isolated point x=0. If $\Delta<0$ , the critical curves are two straight lines, as can be obtained from (B.7): inserting the ansatz $\vartheta=\lambda~{\rm e}^{{\rm i}\zeta}$ , one obtains ${\rm e}^{2{\rm i}(\zeta-\varphi_A)} =(-B\pm{\rm i}\sqrt{-\Delta})/(2\vert A\vert)$ . Thus, the critical curves are parametrized as

$\displaystyle \theta=\theta_0+\lambda~{\rm e}^{{\rm i}\varphi_A} \sqrt{-B\pm{\rm i}\sqrt{-\Delta}\over 2\vert A\vert} ;$
$\displaystyle -\infty<\lambda<\infty.$			(B.10)

For the case of $C\ne 0$ , the consideration of (B.9) yields the result that for $\Delta<0$ , the critical curves consist of two hyperbolae. From (B.8) we see that negative $\Delta$ implies C>0. Also note that $\Delta<0$ implies that 2|A|-B>0, 2|A|+B>0. The critical curves then read

$\displaystyle \theta=\theta_0+{ {\rm e}^{{\rm i}\varphi_A}~ V\over\sqrt{-\Delta... ...2\vert A\vert+B}} +{\rm i}~ {\sinh\lambda\over \sqrt{2\vert A\vert-B}}\right) ;$
$\displaystyle -\infty<\lambda<\infty.$			(B.11)

For the other case, $\Delta>0$ , we find from (B.8) that C<0. If $B\pm 2\vert A\vert>0$ , we then see from (B.9) that no critical curves exist. If $B\pm2\vert A\vert<0$ , which in particular implies B<0, the critical curve is an ellipse parametrized as

$\displaystyle \theta=\theta_0+{ {\rm e}^{{\rm i}\varphi_A}~ V\over\sqrt{\Delta}... ...-2\vert A\vert-B}} +{\rm i}~ {\sin\lambda\over \sqrt{2\vert A\vert-B}}\right) ;$
$\displaystyle 0\le \lambda < 2\pi .$			(B.12)

This concludes the classification of critical curves of the lens Eq. (9). The caustics are obtained by inserting the parametrized form of the critical curves into the lens equation. In order to see whether a critical curve cuts through the primary image of a circular source of outer isophotal radius $\Theta$ , we calculate the minimum value $\beta_{\rm min}$ of $\vert\beta(\lambda)\vert$ along the caustics. If $\beta_{\rm min}>\Theta$ , the image is not cut by a critical curve. For an elliptical critical curve, the maximum source size allowed is $\beta_{\rm min}$ ; these values are plotted in Fig. 2. In the cases where two critical curves exist (e.g., two straight lines or hyperbolae), the situation is slightly more complicated. Consider, e.g., the case of two straight critical curves. Only those sections of them that are closer to the origin are relevant for this consideration, since if the primary image of the source is not cut by these closer sections of critical curves, it will still be an isolated image; the caustics coming from the outer sections of the critical curves correspond to multiply imagedsource sections of secondary images. Accounting for this complication, the maximum sources size have been obtained, as plotted in Fig. 2.

References

Bacon, D. J., Goldberg, D. M., Rowe, B. T. P., & Taylor, A. N. 2006, MNRAS, 365, 414 [NASA ADS] [CrossRef] (In the text)
Bartelmann, M., & Schneider, P. 2001, Phys. Rep., 340, 291 [NASA ADS] [CrossRef] (In the text)
Crittenden, R. G., Natarajan, P., Pen, U.-L., & Theuns, T. 2002, ApJ, 568, 20 [NASA ADS] [CrossRef] (In the text)
Falco, E. E., Gorenstein, M. V., & Shapiro, I. I. 1985, ApJ, 289, L1 [NASA ADS] [CrossRef] (In the text)
Fort, B., & Mellier, Y. 1994, A&AR, 5, 239 [NASA ADS] (In the text)
Fort, B., Prieur, J. L., Mathez, G., Mellier, Y., & Soucail, G. 1988, A&A, 200, L17 [NASA ADS] (In the text)
Goldberg, D. M., & Bacon, D. J. 2005, ApJ, 619, 741 [NASA ADS] [CrossRef] (In the text)
Goldberg, D. M., & Leonard, A. 2007, ApJ, 660, 1003 [NASA ADS] [CrossRef] (In the text)
Gorenstein, M. V., Shapiro, I. I., & Falco, E. E. 1998, ApJ, 327, 693 [NASA ADS] [CrossRef]
Irwin, J., & Shmakova, M. 2005, New Astron. Rev., 49, 83 [NASA ADS] [CrossRef] (In the text)
Irwin, J., & Shmakova, M. 2006, ApJ, 645, 17 [NASA ADS] [CrossRef] (In the text)
Kaiser, N. 1995, ApJ, 439, 1 [NASA ADS] [CrossRef] (In the text)
Kaiser, N., Squires, G., & Broadhurst, T. 1995, ApJ, 449, 460 [NASA ADS] [CrossRef] (In the text)
Leonard, A., Goldberg, D. M., Haaga, J. L., & Massey, R. 2007, ApJ, 666, 51 [NASA ADS] [CrossRef] (In the text)
Luppino, G. A., & Kaiser, N. 1997, ApJ, 475, 20 [NASA ADS] [CrossRef] (In the text)
Massey, R., Rowe, B., Refregier, A., Bacon, D. J., & Bergé, J. 2007a, MNRAS, 380, 229 [NASA ADS] [CrossRef] (In the text)
Massey, R., Heymans, C., Bergé, J., et al. 2007b, MNRAS, 376, 13 [NASA ADS] [CrossRef] (In the text)
Mellier, Y. 1999, ARA&A, 37, 127 [NASA ADS] [CrossRef] (In the text)
Munshi, D., Valageas, P., Van Waerbeke, L., & Heavens, A. 2006 [arXiv:astro-ph/0612667] (In the text)
Okura, Y., Umetsu, K., & Futamase, T. 2007a, ApJ, 660, 995 [NASA ADS] [CrossRef] (In the text)
Okura, Y., Umetsu, K., & Futamase, T. 2007b [arXiv:0710.2262] (In the text)
Refregier, A. 2003, ARA&A, 41, 645 [NASA ADS] [CrossRef] (In the text)
Schneider, P. 2006, in Schneider, P., Kochanek, C. S., & Wambsganss, J.: Gravitational Lensing: Strong, Weak & Micro, Lecture Notes of the 33rd Saas-Fee Advanced Course, ed. G. Meylan, P. Jetzer, & P. North (Berlin: Springer-Verlag), 269 (In the text)
Schneider, P., & Seitz, C. 1995, A&A, 294, 411 [NASA ADS] (In the text)
Schneider, P., Van Waerbeke, L., & Mellier, Y. 2002, A&A, 389, 729 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
Seitz, C., & Schneider, P. 1997, A&A, 318, 687 [NASA ADS] (In the text)
Seitz, S., & Schneider, P. 2001, A&A, 374, 740 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)

$\displaystyle {\bar\beta}$	=	$\displaystyle {3 G_1 g^-5 G_1^-2 g G_3^\over 4(1-g g^)} Q_2$
		$\displaystyle +{4 g G_1^* + g^2 G_3^-G_3 g^ -G_1 (3+g g^)\over 2(1-g g^)} Q_0$
		$\displaystyle +{5 g G_1 -3 g^2 G_1^* -(1-3 g g^) G_3\over 4(1-g g^)} Q_2^* .$	(32)

$\displaystyle Q_2^{\rm s}$	=	$\displaystyle Q_2 - 2g Q_0 + g^2 Q_2^* +{2 g^* G_1-3 G_1^-g G_3^\over 2(1-g g^*)} T_3$
		$\displaystyle +{8 g G_1^-(4+3 g g^)G_1-g^* G_3+2 g^2 G_3^\over2(1-g g^)} T_1$
		$\displaystyle +{(7+g g^)g G_1-7g^2 G_1^ +(3 g g^-1) G_3-g^3 G_3^ \over2(1-g g^)} T_1^$
		$\displaystyle +{(1-2 g g^)g G_3-3 g^2 G_1 +2 g^3 G_1^\over2(1-g g^)} T_3^ -\bar\beta^2 ,$	(37)

		$\displaystyle Q_0^{\rm s} = -g^* Q_2+(1+g {g^})Q_0-g Q_2^$
		$\displaystyle +{6 g^* G_1^* + (3 g g^-1) G_3^ - 4 {g^}^2 G_1\over 4(1-g g^)} T_3$
		$\displaystyle +{2 {g^}^2 G_3+(11+3 g g^)g^* G_1-(7 +9g g^)G_1^-(1+3 g g^)g G_3^ \over 4(1-g g^*)} T_1$
		$\displaystyle + {2 g^2 G_3^* + (11+ 3 g g^) g G_1^ - (1+3 g g^)g^ G_3 -(7+9 g g^)G_1\over 4(1-g g^)} T_1^*$
		$\displaystyle +{6 g G_1-4 g^2 G_1^* - (1- 3 g g^)G_3\over 4(1-g g^)} T_3^* -\bar\beta\bar\beta^*.$	(38)

4(1-gg^*) C₁₁	=	-2g F₂^* +(9g g^-3) F₀ +6 g^(1-2 g g^*) F₂
		+g^2(5 g g^-3) F₄ +6 g Q₂^* Q₀ -12 g g^* Q₀²
		+(3 -9 g g^) Q₂^ Q₂ + 6 g^* (4 g g^*-1) Q₀ Q₂
		+3 g^2(1- 3 g g^) Q₂²
4(1-gg^*) C₁₂	=	5 g F₄^* -2(5+6 g g^) F₂^ +9 g^(3+g g^) F₀
		-2 g^2(12+g g^) F₂ +7 g^3 F₄ -9 g Q₂^2
		+6(3 +4 g g^) Q₂^ Q₀ -12 g^(3+g g^) Q₀²
		-3 g^(5+3 g g^) Q₂^* Q₂ +6 g^2(8+g g^) Q₀ Q₂
		-15 g^*3 Q₂²
4(1-gg^*) C₁₃	=	-7 F₄^* +26 g^* F₂^* -36 g^2 F₀ +22 g^3 F₂ -5 g^*4 F₄
		+ 15 Q₂^2 -54 g^ Q₂^* Q₀ +48 g^2 Q₀² +24 g^2 Q₂^* Q₂
		-42 g^3 Q₀ Q₂ +9 g^4 Q₂²
4(1-gg^*) C₁₄	=	-2 g^* F₄^* +6 g^2 F₂^ - 6 g^3 F₀ +2 g^4 F₂
		+6 g^* Q₂^2 -18 g^2 Q₂^* Q₀ +12 g^*3 Q₀²
		+6 g^3 Q₂^ Q₂ -6 g^*4 Q₀ Q₂
4(1-gg^*) C₂₁	=	2 g² F₂^* -6 g² g^* F₀+[4 g g^(1+g g^)-2] F₂
		+2 g^(1-2 g g^) F₄ -6 g² Q₂^* Q₀
		+4 g (1+2 g g^) Q₀² +6 g² g^ Q₂^* Q₂
		+[2-4 g g^(3+2 g g^)] Q₀ Q₂+2 g^* (4 g g^*-1) Q₂²
4(1-gg^*) C₂₂	=	-5 g² F₄^* +2 g (7+4 g g^) F₂^
		-3[3+g g^(8+g g^)] F₀+2 g^(8+5 g g^) F₂
		-7 g^2 F₄ + 9 g² Q₂^2 -2 g (13+8 g g^) Q₂^ Q₀
		+4[3+g g^(8+g g^)] Q₀²
		+[5+g g^(16+3 g g^)] Q₂^* Q₂
		-2 g^(16+11 g g^) Q₀ Q₂ +15 g^*2 Q₂²
4(1-gg^*) C₂₃	=	7g F₄^* -2(4 +9 g g^) F₂^ +3 g^(7+5 g g^) F₀
		-2 g^2(9+2 g g^) F₂+5 g^3 F₄ -15 g Q₂^2
		+(16+38 g g^) Q₂^ Q₀ - 4 g^(7 +5 g g^) Q₀²
		- g^(13+11 g g^) Q₂^* Q₂ +2 g^2(17+4 g g^) Q₀ Q₂
		-9 g^*3 Q₂²
4(1-gg^*) C₂₄	=	(3 g g^-1) F₄^-6 g g^2 F₂^ +3 g^2(1 + g g^) F₀
		$\displaystyle -2 g^{3} F_2 + (1\! -\!7 g g^) Q_2^{2} +2 g^(2+7 g g^) Q_2^ Q_0$
		- 4 g^2(2+g g^) Q₀²-3 g^2(1+g g^) Q₂^* Q₂
		+6 g^*3 Q₀ Q₂.