Ambiguities in gravitational lens models: the density field from the source position transformation

Sandra Unruh; Peter Schneider; Dominique Sluse

doi:10.1051/0004-6361/201629048

Home

All issues

Volume 601 (May 2017)

A&A, 601 (2017) A77

Full HTML

Free Access

Issue		A&A Volume 601, May 2017


Article Number		A77
Number of page(s)		13
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/201629048
Published online		08 May 2017

A&A 601, A77 (2017)

Ambiguities in gravitational lens models: the density field from the source position transformation

Sandra Unruh¹, Peter Schneider¹ and Dominique Sluse²

¹ Argelander-Institut für Astronomie, Universität Bonn, Auf dem Hügel 71, 53121 Bonn, Germany
e-mail: sandra@astro.uni-bonn.de; peter@astro.uni-bonn.de
² STAR Institute, Quartier Agora, Allée du six Août, 19c, University of Liège, 4000 Liège, Belgium
e-mail: dsluse@ulg.ac.be

Received: 3 June 2016
Accepted: 24 February 2017

Abstract

Strong gravitational lensing is regarded as the most precise technique to measure the mass in the inner region of galaxies or galaxy clusters. In particular, the mass within one Einstein radius can be determined with an accuracy of the order of a few percent or better, depending on the image configuration. For other radii, however, degeneracies exist between galaxy density profiles, precluding an accurate determination of the enclosed mass. The source position transformation (SPT), which includes the well-known mass-sheet transformation (MST) as a special case, describes this degeneracy of the lensing observables in a more general way. In this paper we explore properties of an SPT, removing the MST to leading order, that is we consider degeneracies which have not been described before. The deflection field $\hbox{$\ahat(\vc\theta)$}$ resulting from an SPT is not curl-free in general, and thus not a deflection that can be obtained from a lensing mass distribution. Starting from a variational principle, we construct lensing potentials that give rise to a deflection field $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ , which differs from $\hbox{$\ahat$}$ by less than an observationally motivated upper limit. The corresponding mass distributions from these “valid” SPTs are studied: their radial profiles are modified relative to the original mass distribution in a significant and non-trivial way, and originally axi-symmetric mass distributions can obtain a finite ellipticity. These results indicate a significant effect of the SPT on quantitative analyses of lens systems. We show that the mass inside the Einstein radius of the original mass distribution is conserved by the SPT; hence, as is the case for the MST, the SPT does not affect the mass determination at the Einstein radius. Furthermore, we analyse a degeneracy between two lens models, empirically found previously, and show that this degeneracy can be interpreted as being due to an SPT. Thus, degeneracies between lensing mass distributions are not just a theoretical possibility, but do arise in actual lens modeling.

Key words: cosmological parameters / gravitational lensing: strong

© ESO, 2017

1. Introduction

Strong gravitational lensing provides a highly valuable tool to obtain mass properties of galaxies and galaxy clusters (see, e.g., Bartelmann 2010; Kochanek 2006, and references therein). In particular, multiple image systems yield strong constraints on the mass distribution. The mass enclosed within the Einstein radius presents the most robust galaxy mass estimate currently available. Furthermore, the shape of the mass distribution (e.g., ellipticity, orientation) is well defined.

However, mass estimates for radii smaller or larger than the Einstein radius are less accurate. If only a finite set of individual lensed compact images is observed, too few observational constraints are available and certainly no unique radial mass profile can be found. The situation changes somewhat if extended source components are lensed where the constraints on the mass distribution are much more stringent. Nonetheless, even if we could find a mass model which reproduces all constraints perfectly, such a mass model would not be unique either. The reason for this degeneracy is known since 1985 (Falco et al. 1985) and is called the mass-sheet transformation (MST). If a given surface mass density κ(θ) reproduces all observational constraints, then the whole family of mass models, $κ_{λ} (θ) = λκ (θ) + (1 - λ),$ $\begin{equation} \kappa_\lambda(\vc\theta)=\lambda\kappa(\vc\theta)+(1-\lambda), \label{eq:MST} \end{equation}$ (1)will do the same. In particular, the MST leaves all observables invariant except the time delay¹. The transformation (1) modifies the slope of the density profile with a constant factor λ. This affects mass measurements outside the Einstein radius θ_E and determination of the Hubble constant H₀ directly.

Schneider & Sluse (2013, hereafter SS13) presented two mass profiles (namely, a Hernquist profile plus a modified Navarro, Frank and White profile, as well as a power-law mass profile) which showed almost the same imaging properties, although they are not exactly related through an MST. Following this unexpected result it became apparent that an even more general invariance transformation than the MST exists. The so-called source-position transformation (SPT) was finally introduced in Schneider & Sluse (2014, hereafter SS14).

For isolated individual images many ambiguities for the lens equation exist. Local transformations of the lensing mass distribution, which still reproduce the positional constraints from the lensed images, lead to an infinite number of mass models (see e.g., Saha & Williams 1997; Diego et al. 2005; Coe et al. 2008; Liesenborgs & De Rijcke 2012). The MST as given in Eq. (1) is a global transformation and equivalent to an isotropic uniform stretching of the source plane by a constant factor λ. The SPT is based on a more general (global) transformation of the source plane coordinates. Such transformations $\hbox{$\bhat (\bvec)$}$ , where $\hbox{$\bhat$}$ denotes the transformed source position, give rise to a new deflection law $\hbox{$\ahat (\tvec) = \tvec - \bhat ( \tvec - \avec(\tvec) )$}$ . The new deflection law $\hbox{$\ahat$}$ will in general not be a gradient field and thus cannot be obtained from the deflection caused by a lens. However, if the curl component of $\hbox{$\ahat$}$ is sufficiently small, then one may find a lensing mass distribution which yields a deflection law which is very close to $\hbox{$\ahat$}$ , so close that it cannot be observationally distinguished from $\hbox{$\ahat$}$ . In this paper we will explore this possibility, which of course depends on the SPT $\hbox{$\bhat(\bvec)$}$ . In particular, if this deformation is “too strong”, then the resulting $\hbox{$\ahat$}$ cannot be approximated with the deflection due to a lens – this will restrict the freedom in choosing transformations $\hbox{$\bhat(\bvec)$}$ .

The outline of the paper is as follows. In Sect. 2 we will recapitulate the principle of the SPT. We characterize the deviation of the deflection law from a gradient field quantitatively in Sect. 3 by finding a gravitational potential $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\ptilde$}$ such that $\begin{matrix} _{˜} \\ α \end{matrix} = \nabla \begin{matrix} _{˜} \\ ψ \end{matrix}$ $\hbox{$\atilde = \nabla \ptilde$}$ is as close as possible to the SPT-transformed deflection law $\hbox{$\ahat$}$ . To do so, we will start from a variational principle and show that the modified deflection potential $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\ptilde$}$ has to fulfill Neumann boundary conditions. Those can be solved using a Green’s function, and the solution will be given explicitly for a circular region. Furthermore, a numerical approach will be presented to find degenerate deflection laws and their corresponding mass profiles. By considering a specific deformation function $\hbox{$\bhat (\bvec)$}$ and assuming a positional accuracy on lensed image positions typical of the Hubble Space Telescope (HST), we will present in Sect. 4 the implications of the “allowed” SPTs on current mass profile determinations, regarding the radial mass profile and the angular structure of the lens. Different diagnostics for the change of the mass profile by an SPT, and how it can be distinguished from an MST, will be explored in Sect. 5 in terms of the aperture mass. Finally, we will discuss our findings in Sect. 6.

2. The principle of the source position transformation

In the following we will describe the principle of the SPT and its properties. For a more detailed account the reader is referred to SS14. We use standard gravitational lensing notation throughout this paper (see, e.g., Schneider 2006).

In general, a surface mass density distribution κ(θ) gives rise to a deflection law α(θ), where θ is the angular position in the lens plane, i.e., the observer’s sky. The mass distribution or convergence κ is defined as the ratio of projected surface mass density to the critical surface mass density, where the latter depends only on the angular diameter distances of lens and source. If that mass distribution is sufficiently concentrated (i.e., typically κ(θ) ≳ 1 for some region in the lens plane) a source may have multiple images, depending on its position relative to the deflector on the sky. Then, the source located at the (unobservable) position β will have its images at locations described by the solutions θ_i = β + α(θ_i) of the lens equation. Since multiple images are from the same source, we can deduce the constraints on the deflection law α(θ) as $θ_{i} - α (θ_{i}) = θ_{j} - α (θ_{j}),$ $\begin{equation} \vc\theta_i-\vc\alpha(\vc\theta_i)=\vc\theta_j-\vc\alpha(\vc\theta_j) , \label{eq:ta} \end{equation}$ (2)or likewise for an alternative deflection law $\hbox{$\ahat(\tvec)$}$ as $θ_{i} - \hat{α} (θ_{i}) = θ_{j} - \hat{α} (θ_{j}),$ $\begin{equation} \vc\theta_i-\hat{\vc\alpha}(\vc\theta_i) =\vc\theta_j-\hat{\vc\alpha}(\vc\theta_j), \label{eq:tahat} \end{equation}$ (3)for all i<j, such that α(θ) as well as $\hbox{$\ahat(\tvec)$}$ yield exactly the same sets of multiple images. If such equivalent deflection laws exist, they will correspond to source positions β = θ−α(θ) or $\hbox{$\bhat = \tvec - \ahat(\tvec)$}$ , respectively (see Fig. 1).

Fig. 1

Illustration of the source position transformation. A source at β causes multiple images θ in the lens plane under the deflection law α. The same multiple images are obtained from a source at $\hbox{$\bhat(\bvec)$}$ , provided the deflection law is changed to $\hbox{$\ahat$}$ , according to Eq. (4).

We can now consider a one-to-one mapping $\hbox{$\bhat(\bvec)$}$ that connects the original source coordinates to the new ones. This allows us to define the transformed deflection law as $\hat{α} (θ) = α (θ) + β - \hat{β} (β) = θ - \hat{β} (θ - α (θ)),$ $\begin{equation} \hat{\vc\alpha}(\vc\theta)=\vc\alpha(\vc\theta) +\vc\beta-\hat{\vc\beta} (\bvec) =\vc\theta-\hat{\vc\beta}(\vc\theta-\vc\alpha(\vc\theta)), \label{eq:hatalpha} \end{equation}$ (4)where in the last step we inserted the original lens equation β = θ−α(θ).

Hence, any bijective (i.e., one-to-one) function $\hbox{$\bhat(\bvec)$}$ leads to an SPT which leaves the condition (2) invariant. Moreover, as can be deduced from the Jacobian $\hbox{$\Ahat= \partial \bhat/\partial \tvec = (\partial \bhat/\partial \bvec)(\partial \bvec/\partial \tvec) $}$ of the modified lens equation, the relative magnification matrices and the relative image shapes between image pairs of the same source $\hbox{$\bhat$}$ remain unchanged. However, the Jacobian $\hbox{$\Ahat$}$ will not be symmetric in general, and therefore $\hbox{$\ahat$}$ cannot be written as the gradient of a deflection potential $\hbox{${\hat{\psi}}$}$ (i.e., $\hbox{$\ahat$}$ is not a curl-free field). This implies that no corresponding mass distribution $\hbox{$\khat$}$ exists that yields a deflection angle $\hbox{$\ahat$}$ , in general. However, it was shown in SS14 that the asymmetric part of the Jacobian can be small in realistic cases; this will be explored more quantitatively in Sect. 3. In the special case that the lens is axisymmetric and the transformation $\hbox{$\bhat(\bvec)$}$ corresponds to a radial stretching of the form $β̂ = f (| β |) β,$ $\begin{equation} \bhat=f(|\bvec|)\bvec, \label{eq:radstretch} \end{equation}$ (5)the SPT is an exact invariance transformation: in this case, the Jacobian $\hbox{$\Ahat$}$ is symmetric, and for every transformation (5) and its corresponding deflection law $\hbox{$\ahat$}$ there exists a corresponding axi-symmetric mass distribution $\hbox{$\khat$}$ .

Provided the curl component of $\hbox{$\ahat$}$ is small, then we expect that there exists a mass distribution $\begin{matrix} ˜ \\ κ \end{matrix}$ $\hbox{${\tilde{\kappa}}$}$ whose corresponding deflection law $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\tilde{\boldmath{\alpha}}$}$ will be very similar to $\hbox{$\ahat$}$ , in the sense that their difference is smaller than the astrometric accuracy of current observations. In this case, the SPT will be, for all practical purposes, a global invariance transformation for lenses.

3. The transformed mass distribution

3.1. The general method

Since the deflection law $\hat{α}$ $\hbox{$\hat{\vc\alpha}$}$ (4) is not a gradient field, it does not correspond to a deflection field caused by a gravitational lens. However, if the curl component of $\hat{α}$ $\hbox{$\hat{\vc\alpha}$}$ is sufficiently small, one may be able to find a deflection potential $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\tilde \psi$}$ and a corresponding deflection law $\begin{matrix} _{˜} \\ α \end{matrix} = \nabla \begin{matrix} _{˜} \\ ψ \end{matrix}$ $\hbox{$\tilde{\vec\alpha}=\nabla \tilde\psi$}$ such that the difference between $\hat{α} a nd \begin{matrix} _{˜} \\ α \end{matrix}$ $\hbox{$\hat{\vec\alpha}{\text and }{\tilde{\vec\alpha}\rm{}}$}$ is small, for example smaller than the astrometric accuracy of current observations. Since only the region of the lens plane where multiple images occur is constrained by lensing observations, the difference $\hat{α} - \begin{matrix} _{˜} \\ α \end{matrix}$ $\hbox{$\hat{\vec\alpha} - \tilde{\vec\alpha}$}$ needs to be small only in a finite region, which we denote as $\hbox{$\cal U$}$ .

We thus consider the “action” $S = \int_{𝒰} d^{2} θ {| \nabla \begin{matrix} ˜ \\ ψ \end{matrix} - \hat{α} |}^{2},$ $\begin{equation} S=\int_{\cal U} \d^2\theta\;\abs{\nabla \tilde\psi-\hat{\vc\alpha}}^2, \label{eq:varprinciple} \end{equation}$ (6)for which we want to find a minimum.

Using this particular variational principle is just one possibilty of finding $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ . One could also apply the Helmholtz theorem and decompose $\hbox{$\ahat$}$ into its irrotational (curl-free) and solenoidal (divergence-free) part. This would lead to similar but not identical results for $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ , thus not changing the main conclusions of this paper². Another possible ansatz would be to find a gradient deflection angle such that its maximum deviation from $\hbox{$\ahat$}$ would be minimized; however, the solution of this problem seems to be much more difficult to find than our variational principle.

Equation (6) can be minimized by considering small variations of $\begin{matrix} _{˜} \\ ψ \end{matrix} \to \begin{matrix} _{˜} \\ ψ \end{matrix} + δ \begin{matrix} _{˜} \\ ψ \end{matrix}$ $\hbox{${\tilde{\psi}} \to {\tilde{\psi}} +\delta{\tilde{\psi}}$}$ , and finding the conditions for which the action is stationary for all variations $δ \begin{matrix} _{˜} \\ ψ \end{matrix}$ $\hbox{$\delta{\tilde{\psi}}$}$ . Up to linear terms in $δ \begin{matrix} _{˜} \\ ψ \end{matrix}$ $\hbox{$\delta{\tilde{\psi}}$}$ , we find $\begin{matrix} S + δS & = & \int_{𝒰} d^{2} θ {| \nabla \begin{matrix} ˜ \\ ψ \end{matrix} + \nabla (δ \begin{matrix} ˜ \\ ψ \end{matrix}) - \hat{α} |}^{2} \\ = & S + 2 \int_{𝒰} d^{2} θ \nabla (δ \begin{matrix} ˜ \\ ψ \end{matrix}) (\nabla \begin{matrix} ˜ \\ ψ \end{matrix} - \hat{α}) \\ = & S + 2 \int_{\partial 𝒰} d sδ \begin{matrix} ˜ \\ ψ \end{matrix} (\nabla \begin{matrix} ˜ \\ ψ \end{matrix} - \hat{α}) \cdot n \\ - 2 \int_{𝒰} d^{2} θδ \begin{matrix} ˜ \\ ψ \end{matrix} (\nabla^{2} \begin{matrix} ˜ \\ ψ \end{matrix} - \nabla \cdot \hat{α}), \end{matrix}$ $\begin{eqnarray} S+\delta S\!\!\!&=&\!\!\!\int_{\cal U} \d^2\theta\;\abs{\nabla \tilde\psi +\nabla(\delta\tilde\psi)-\hat{\vc\alpha}}^2 \nonumber \\ &=&\!\!\!S+2 \int_{\cal U} \d^2\theta\;\nabla(\delta\tilde\psi) \rund{\nabla \tilde\psi -\hat{\vc\alpha}} \nonumber \\ &=&\!\!\!S+2\int_{\partial \cal U}\d s\;\delta\tilde\psi \rund{\nabla \tilde\psi -\hat{\vc\alpha}}\cdot \vc n \nonumber \\ &&\!\!\!-2\int_{\cal U} \d^2\theta\;\delta\tilde\psi \rund{\nabla^2\tilde\psi-\nabla\cdot\hat{\vc\alpha}}, \end{eqnarray}$ (7)where we made use of Gauß divergence theorem. The boundary curve of $\hbox{$\cal U$}$ is denoted as $\hbox{$\partial \cal U$}$ , ds is the line element of the boundary curve, and n(s) the outward directed normal vector. Requiring δS = 0 leads to the Neumann problem $\nabla^{2} \begin{matrix} ˜ \\ ψ \end{matrix} = \nabla \cdot \hat{α} = : 2 κ̂; and \nabla \begin{matrix} ˜ \\ ψ \end{matrix} \cdot n = \hat{α} \cdot n,$ $\begin{equation} \nabla^2\tilde\psi=\nabla\cdot\hat{\vc\alpha}=:2\hat\kappa; \; \hbox{and}\quad \nabla \tilde\psi \cdot \vc n = \hat{\vc\alpha}\cdot \vc n, \label{eq:vNP} \end{equation}$ (8)where the first equation is required for all points $\hbox{$\vc\theta\in {\cal U}$}$ , and the second one for all points on the boundary $\hbox{$\partial \cal U$}$ . The solution $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\tilde\psi$}$ of Eq. (8) is specified only up to an additive constant, since a constant in the deflection potential does not affect the deflection angle.

In order to solve the system (8), we can either use numerical standard methods for such boundary problems, or we can obtain the solution by means of a Green’s function. Both methods will be explored in this section.

3.2. Solving the Neumann problem numerically

We defined the convergence of the transformed deflection law to be $\hbox{$\khat = \nabla \cdot \ahat /2$}$ . The curl component of $\hbox{$\ahat$}$ is reasonably small if the closest curl-free approximation to $\hbox{$\ahat$}$ (which is $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ ) is smaller than a chosen astrometric accuracy ε_acc $| α̂ (θ) - \begin{matrix} ˜ \\ α \end{matrix} (θ) | = | Δ α (θ) | < ε_{acc}$ $\begin{equation} | \, \ahat (\tvec) - \atilde (\tvec) \, | = | \ealpha (\tvec) | < \eacc \label{eq:prec} \end{equation}$ (9)for all $\hbox{$\tvec\in\cal{U}$}$ . To solve the system (8) numerically, we set up a successive overrelaxation method (SOR; Press et al. 1996, their Sect. 19.5) on a square grid to calculate $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\ptilde$}$ . An SOR is a converging iterative process based on the extrapolation of the Gauß-Seidel method, and it is a standard method to solve boundary value problems (see, e.g., Seitz & Schneider 2001). Using a second-order accurate finite differencing scheme, the deflection law $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ is then derived from the deflection potential $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\ptilde$}$ .

The lens is located at the center of the grid, chosen to be also the origin of the coordinate system. The grid has a length of 4 θ_E to cover the relevant area in which multiple images occur, i.e., it covers the region within 2θ_E from the lens center.

The SOR involves the calculation of a weighted average between the previous iterate ${\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k}^{(m - 1)}$ $\hbox{$\ptilde_{i,k}^{(m-1)}$}$ and the computed Gauß-Seidel iterate ${Ψ_{i,k}^{(m)}}_{˜}$ $\hbox{$\tilde{\Psi}_{i,k}^{(m)}$}$ successively for each component ${\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k}^{(m)} = ω {Ψ_{i,k}^{(m)}}_{˜} + (1 - ω) {\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k}^{(m - 1)},$ $\begin{equation} \ptilde_{i,k}^{(m)} = \omega \, \tilde{\Psi}_{i,k}^{(m)} \, + \, ( 1 - \omega ) \, \ptilde_{i,k}^{(m-1)}, \end{equation}$ (10)where ${\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k}^{(m)}$ $\hbox{$\ptilde_{i,k}^{(m)}$}$ is the value of $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\ptilde$}$ for the grid point (i,k) in iteration m, and ω is the extrapolation parameter. The parameter ω is chosen such that it accelerates the rate of convergence of the iterative variable to the solution; in this work $ω = \frac{2}{1 + π / (𝒩 - 1)},$ $\begin{equation} \omega = \frac{2}{1 + \pi/(\N - 1) }, \end{equation}$ (11)is applied, where $\hbox{$\N \times \N$}$ is the total number of grid points. Initially, all ${\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k}$ $\hbox{$\ptilde_{i,k}$}$ are set to zero. In each iteration m, the Gauß-Seidel iterate ${Ψ_{i,k}^{(m)}}_{˜}$ $\hbox{$\tilde{\Psi}_{i,k}^{(m)}$}$ is calculated as follows (a fourth-order accurate finite differencing is used) $\begin{matrix} {Ψ_{i,k}^{(m + 1)}}_{˜} & = & - \frac{1}{60} ({\begin{matrix} ˜ \\ ψ \end{matrix}}_{i + 2,k}^{(m)} + {\begin{matrix} ˜ \\ ψ \end{matrix}}_{i - 2,k}^{(m)} + {\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k + 2}^{(m)} + {\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k - 2}^{(m)}) \\ + & \frac{16}{60} ({\begin{matrix} ˜ \\ ψ \end{matrix}}_{i + 1,k}^{(m)} + {\begin{matrix} ˜ \\ ψ \end{matrix}}_{i - 1,k}^{(m)} + {\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k + 1}^{(m)} + {\begin{matrix} ˜ \\ ψ \end{matrix}}_{i,k - 1}^{(m)}) \\ - \end{matrix}$ $\begin{eqnarray} \tilde{\Psi}_{i,k}^{(m+1)} &= &-\frac{1}{60} \, \Bigl( \ptilde_{i+2,k}^{(m)} + \ptilde_{i-2,k}^{(m)} + \ptilde_{i,k+2}^{(m)} + \ptilde_{i,k-2}^{(m)} \Bigl) \nonumber\\ &+& \frac{16}{60} \, \Bigl( \ptilde_{i+1,k}^{(m)} + \ptilde_{i-1,k}^{(m)} + \ptilde_{i,k+1}^{(m)} + \ptilde_{i,k-1}^{(m)} \Bigl) \nonumber\\ &-& \frac{12}{60} h^2 \, \bigl[ \nabla \cdot \ahat \bigl]_{i,k}, \label{eq:gaussseidel} \end{eqnarray}$ (12)where h is the spacing of grid points. The divergence of $\hbox{$\ahat$}$ is calculated with fourth-order accurate finite differencing method for each grid point, and for points on the boundary of the grid and the neighboring row and column, a second-order accurate finite differencing scheme is employed. Convergence is reached when two requirements are met: (i) at least $\hbox{$ 40 \, \N $}$ iterations have been made; and (ii) the maximum difference $({\begin{matrix} _{˜} \\ ψ \end{matrix}}_{i,k}^{(m)} - {\begin{matrix} _{˜} \\ ψ \end{matrix}}_{i,k}^{(m - 1)})_{\max}$ $\hbox{$(\ptilde^{(m)}_{i,k} - \ptilde^{(m-1)}_{i,k})_\mathrm{max}$}$ between two iterations increases. Typically, slightly more than $\hbox{$ 40 \, \N $}$ are needed to reach convergence. If the process converges, the values of $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\ptilde$}$ at the four corners are calculated by extrapolation.

Fig. 2

Illustration of the extrapolation method used in the SOR method (Sect. 3.2) to calculate $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ is shown: Based on the calculation of $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ on two grids with indices (i,k) and (I,K), we can retrieve ${\begin{matrix} ˜ \\ α \end{matrix}}^{true}$ $\hbox{$\atilde^\mathrm{true}$}$ with a minimum accuracy Δα using the scheme described in the figure.

We consider that the typical accuracy on the image position of observed lens systems is of the order 5 mas, implying that ε_acc in Eq. (9) should be of the same order (this choice will be discussed in Sect. 4). Thus, the numerical error of our method has to be well below 1 mas ≈ 10^-3θ_E for typical galaxy scale lenses which is quite stringent. Increasing the grid size yields a strong increase in computational time, which scales roughly as $\hbox{${\cal N}^3$}$ . Therefore, we added an extrapolation method to the standard SOR to increase accuracy with a more reasonable increase in computational time. The principle of our extrapolation scheme is displayed in Fig. 2 and is based on the observation that the error | Δα | of the computed value $\begin{matrix} _{˜} \\ α \end{matrix} (h)$ $\hbox{$ \atilde ( h ) $}$ scales as $\hbox{$h^2\propto {\cal N}^{-2}$}$ . This can be seen in the top panel of Fig. 3 where we applied our numerical scheme to the case of a non-singular isothermal sphere, i.e., where the true solution is known analytically. In this case, the deflection law $\hbox{$\ahat$}$ is a pure gradient field, and thus $α = α̂ = \begin{matrix} _{˜} \\ α \end{matrix}$ $\hbox{$ \avec = \ahat = \atilde $}$ . Using this scaling behavior we can extrapolate to the true deflection ${\begin{matrix} ˜ \\ α \end{matrix}}^{true}$ $\hbox{$ \atilde^\mathrm{true} $}$ , which would be obtained in the limit h → 0, for every grid point ${\begin{matrix} ˜ \\ α \end{matrix}}_{i,k} (h) = {\begin{matrix} ˜ \\ α \end{matrix}}_{i,k}^{true} + E_{i,k}^{num} (h)^{2},$ $\begin{equation} \atilde_{i,k} ( h ) = \atilde_{i,k}^\mathrm{true} + E^\mathrm{num}_{i,k} (h)^2, \end{equation}$ (13)where E^num is the numerical error³. However, the asymptotic deflection ${\begin{matrix} ˜ \\ α \end{matrix}}^{true}$ $\hbox{$ \atilde^\mathrm{true} $}$ and the value of the numerical error E^num are unknown in general. We can determine the two unknowns by calculating $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ for two different values of h, i.e., for different $\hbox{$\N$}$ . Hence, we calculate $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ on two grids, of $\hbox{$ \N_1 = 2 N $}$ and $\hbox{$ \N_2 = N $}$ points. The coordinates of the first and second grid are denoted respectively with indices (I,K) and (i,k) and we have to match every grid point (i,k) with its corresponding position (I,K). Then we can obtain the true value ${\begin{matrix} ˜ \\ α \end{matrix}}^{true}$ $\hbox{$ \atilde^\mathrm{true} $}$ ${\begin{matrix} ˜ \\ α \end{matrix}}_{i,k}^{true} = \frac{4 {\begin{matrix} ˜ \\ α \end{matrix}}_{I,K} ({\frac{h}{2}}^{)} - {\begin{matrix} ˜ \\ α \end{matrix}}_{i,k} (h)}{3},$ $\begin{equation} \atilde^\mathrm{true}_{i,k} = \frac{ 4 \atilde_{I,K} \left( \frac{h}{2} \right) - \atilde_{i,k} (h) }{3}, \end{equation}$ (14)as indicated in Fig. 2.

Incorporating this extrapolation method in the code decreases the numerical error for the grid point numbers that are used ( $\hbox{$\N \sim400$}$ ) roughly by a factor of 10³, as can be seen in the lower panel of Fig. 3, which also shows that the numerical error with this extrapolation scheme decreases much faster with $\hbox{${\cal N}$}$ than without. The largest numerical deviation (Δα)_max can be found near the center of the grid. This is as expected, since the deviation from ${\begin{matrix} ˜ \\ α \end{matrix}}^{true}$ $\hbox{$ \atilde^\mathrm{true} $}$ depends on higher-order derivatives, which for the chosen lens model are largest near the center. However, multiple images near the center of the lens are usually highly demagnified and rarely observable for galaxies as lenses (see, e.g., Hezaveh et al. 2015; Winn et al. 2004) and are therefore not relevant.

Fig. 3

Maximum difference of $| α̂ - \begin{matrix} _{˜} \\ α \end{matrix} | = | Δ α |$ $\hbox{$ | \ahat - \atilde | = |\ealpha | $}$ for a non-singular isothermal sphere with core radius θ_c = 0.1 θ_E as a function of the number of grid points number $\hbox{$\N$}$ used in the numerical solution. Blue dots are the numerical results, whereas the curves present power-law fits to these points with h being the spacing of grid points. In the top panel, the results are shown for the “standard” method, where the numerical error scales as $\hbox{${\cal N}^{-2}$}$ . Incorporating the extrapolation scheme, the numerical error decreases much faster with the number of grid point, as can be seen in the lower panel (note the different scale for the y-axis in the upper and lower panel). For the typical values used in the papers ( $\hbox{${\cal N}\sim 400$}$ ), a gain in accuracy by three orders of magnitude is obtained with extrapolation with only a modest increase of computational cost (~ 25%). Since extrapolation includes calculating $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ twice with grid points 2N and N, values are only shown up to $\hbox{$\N_2 = 500$}$ which correspond to $\hbox{$ \N_1 =1000 $}$ in the graph above.

We have also tested the accuracy of the numerical implementation. One method is to check whether $\nabla \cdot α̂ (θ) = \nabla \cdot \begin{matrix} _{˜} \\ α \end{matrix} (θ)$ $\hbox{$ \nabla \cdot \ahat (\tvec) = \nabla\cdot \atilde (\tvec) $}$ is valid for the whole grid for any deformation function $\hbox{$\bhat (\bvec)$}$ . In all our calculations, deviations $| κ̂ - \begin{matrix} _{˜} \\ κ \end{matrix} |$ $\hbox{$ | \khat - \ktilde | $}$ are always smaller than 10^-4 if the corner regions, i.e., θ ≥ 2 θ_E, are excluded from our analysis. Thus, we only consider the behavior of the mass profile in the circular region | θ | /θ_E< 2, where numerical errors in $| α̂ - \begin{matrix} _{˜} \\ α \end{matrix} |$ $\hbox{$| \ahat - \atilde |$}$ do not exceed 10^-6θ_E.

3.3. Solution by means of a Green’s function

A different approach is to find a solution of Eq. (8) by means of a Green’s function. For that, we make use of Green’s second theorem, considering a function h(θ), $\int_{𝒰} d^{2} θ [\begin{matrix} ˜ \\ ψ \end{matrix} \nabla^{2} h - h \nabla^{2} {\begin{matrix} ˜ \\ ψ \end{matrix}}^{]} = \int_{\partial 𝒰} d s [\begin{matrix} ˜ \\ ψ \end{matrix} \nabla h \cdot n - h \nabla \begin{matrix} ˜ \\ ψ \end{matrix} \cdot n^{]} .$ $\begin{equation} \int_{\cal U} \d^2\theta\;\eck{\tilde\psi\,\nabla^2 h-h\,\nabla^2 \tilde\psi} =\int_{\partial \cal U}\d s\; \eck{\tilde\psi\, \nabla h\cdot \vc n- h\,\nabla\tilde\psi\cdot \vc n}. \label{eq:GreenT} \end{equation}$ (15)We choose the function h(θ) ≡ H(ϑ;θ) depending on the vector ϑ such that it obeys the following equations: $\begin{matrix} \nabla_{θ}^{2} H (ϑ; θ) = δ (θ - ϑ) - \frac{1}{A} & for & θ \in 𝒰, \\ \nabla_{θ} H (ϑ; θ) \cdot n = 0 & for \end{matrix}$ $\begin{eqnarray} \nabla_\theta^2 H(\vc\vt;\vc\theta)=\delta(\vc\theta-\vc\vt)-\frac{1}{A}\;\;& {\rm for} &\;\; \vc\theta\in {\cal U}, \nonumber\\ \nabla_\theta H(\vc\vt;\vc\theta)\cdot \vc n =0 \;\; &{\rm for} &\;\; \vc\theta\in\partial {\cal U}, \label{eq:Greens} \end{eqnarray}$ (16)where A is the area of $\hbox{$\cal U$}$ , and ϑ is a point within $\hbox{$\cal U$}$ . The term 1 /A in Eq. (16) is needed to satisfy Green’s divergence theorem applied to the vector field ∇h, which requires $\int_{𝒰} d^{2} θ \nabla^{2} h = \int_{\partial 𝒰} d s \nabla h \cdot n;$ $\begin{equation} \int_{\cal U} \d^2\theta\;\nabla^2 h = \int_{\partial \cal U}\d s\;\nabla h\cdot \vc n; \end{equation}$ (17)the conditions (16) set both side of this equation to zero. With (16), Eq. (15) simplifies to $\begin{matrix} \begin{matrix} ˜ \\ ψ \end{matrix} (ϑ) & = & ⟨ {\begin{matrix} ˜ \\ ψ \end{matrix}}^{⟩} + \int_{𝒰} d^{2} θH (ϑ; θ) \nabla^{2} \begin{matrix} ˜ \\ ψ \end{matrix} - \int_{\partial 𝒰} d sH (ϑ; θ) \nabla \begin{matrix} ˜ \\ ψ \end{matrix} \cdot n \\ = \end{matrix}$ $\begin{eqnarray} \tilde\psi(\vc\vt)\!\!\!&=&\!\!\!\ave{\tilde\psi}+\int_{\cal U}\!\! \d^2\theta\; H(\vc\vt;\vc\theta)\,\nabla^2 \tilde\psi -\int_{\partial \cal U}\!\!\!\d s\;H(\vc\vt;\vc\theta)\,\nabla\tilde\psi\cdot \vc n \nonumber\\ \!\!\!&=&\!\!\!\ave{\tilde\psi}+2\int_{\cal U} \!\! \d^2\theta\; H(\vc\vt;\vc\theta)\,\hat\kappa(\vc\theta) -\int_{\partial \cal U}\!\!\!\d s\;H(\vc\vt;\vc\theta)\,\hat{\vc\alpha}\cdot \vc n, \label{eq:vNSol} \end{eqnarray}$ (18)where ${}^{⟨}{\begin{matrix} ˜ \\ ψ \end{matrix}}^{⟩}$ $\hbox{$\ave{\tilde\psi}$}$ is the average of $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\tilde\psi$}$ on $\hbox{${\cal U}$}$ , and we used Eq. (8) in the last step. We note that the integral $f (θ) = \int_{𝒰} d^{2} ϑH (ϑ; θ)$ $\begin{equation} f(\vc\theta)=\int_{\cal U}\d^2\vt\;H(\vc\vt;\vc\theta) \end{equation}$ (19)is a constant, since ∇²f(θ) = 0 and n·∇f = 0 on the boundary of $\hbox{$\cal U$}$ . Therefore, if we integrate Eq. (18) over $\hbox{$\cal U$}$ , the two integrals on the r.h.s. compensate each other, due to the divergence theorem, so that this solution is consistent.

Whereas for a general region $\hbox{$\cal U$}$ it will be difficult to obtain a solution of Eq. (16) for H(ϑ;θ), such a solution is analytically known if $\hbox{$\cal U$}$ is a circle of radius R. In this case, $\begin{matrix} H (ϑ; θ) & = & \frac{1}{4 π} [\ln \frac{{| ϑ - θ |}^{2}}{R^{2}} + \ln (1 - \frac{2 ϑ \cdot θ}{R^{2}} + \frac{| ϑ |^{2} | θ |^{2}}{R^{4}})] \end{matrix}$ $\begin{eqnarray} H(\vc\vt;\vc\theta)\!\!\!&=&\!\!\!\frac{1}{4\pi} \eck{\ln\frac{\abs{\vc\vt-\vc\theta}^2}{R^2} +\ln\rund{1-\frac{2\vc\vt\cdot \vc\theta}{R^2}+\frac{|\vc\vt|^2|\vc\theta|^2}{R^4}}}\nonumber \\ &&\!\!\!-\frac{|\vc\vt|^2+|\vc\theta|^2}{4\pi R^2}, \label{eq:Hcirc} \end{eqnarray}$ (20)which has the properties that

$\begin{matrix} \nabla_{ϑ}^{2} H (ϑ; θ) = δ (ϑ - θ) - \frac{1}{π R^{2}} = \nabla_{θ}^{2} H (ϑ; θ) for ϑ, θ \in 𝒰, \\ \nabla_{θ} H (ϑ; θ) \cdot n (θ) = 0 for θ \in \partial 𝒰 . \end{matrix}$ $\begin{eqnarray} &&\nabla^2_\vt H(\vc\vt;\vc\theta) =\delta(\vc\vt-\vc\theta)-\frac{1}{\pi R^2} = \nabla^2_\theta H(\vc\vt;\vc\theta)\;\;\hbox{for}\;\;\vc\vt, \vc\theta\in \cal U \;,\nonumber \\ &&\nabla_\theta H(\vc\vt;\vc\theta) \cdot \vc n(\vc\theta)=0 \;\; \hbox{for}\;\; \vc\theta\in \partial\cal U. \nonumber \end{eqnarray}$ Hence, Eq. (20) indeed satisfies the conditions (16).

With this explicit solution, we can now calculate the deflection angle corresponding to the potential $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\tilde\psi$}$ , $\begin{matrix} _{˜} \\ α \end{matrix} = \nabla \begin{matrix} _{˜} \\ ψ \end{matrix}$ $\hbox{$\tilde{\vc\alpha}=\nabla\tilde\psi$}$ , by obtaining the gradient of H, $\nabla_{ϑ} H (ϑ; θ) = \frac{1}{2 π} (\frac{ϑ - θ}{| ϑ - θ |^{2}} - \frac{ϑ}{R^{2}} + \frac{| θ |^{2} ϑ - R^{2} θ}{(R^{4} - 2 R^{2} ϑ \cdot θ + | ϑ |^{2} | θ |^{2})}) \cdot$ $\begin{equation} \nabla_\vt H(\vc\vt;\vc\theta)=\frac{1}{2\pi} \rund{\frac{\vc\vt-\vc\theta}{|\vc\vt-\vc\theta |^2}-\frac{\vc\vt}{R^2} +\frac{ |\vc\theta|^2\vc\vt- R^2\vc\theta}{\rund{R^4-2 R^2 \vc\vt\cdot\vc\theta+|\vc\vt|^2|\vc\theta|^2}}}\cdot \label{eq:nabH} \end{equation}$ (21)Then, $\begin{matrix} ˜ \\ α \end{matrix} (ϑ) = 2 \int_{𝒰} d^{2} θ \nabla_{ϑ} H (ϑ; θ) κ̂ (θ) - \int_{\partial 𝒰} d s \nabla_{ϑ} H (ϑ; θ) \hat{α} \cdot n,$ $\begin{equation} \atilde (\vc\vt) = 2\int_{\cal U} \!\! \d^2\theta\; \nabla_\vt H(\vc\vt;\vc\theta)\,\hat\kappa(\vc\theta) -\int_{\partial \cal U}\!\!\!\d s\; \nabla_\vt H(\vc\vt;\vc\theta)\,\hat{\vc\alpha}\cdot \vc n, \label{eq:Hatilde} \end{equation}$ (22)where we have to deal with a pole in the first term of Eq. (21). Using a conformal mapping as described in Appendix A, we can handle this pole numerically. In the third term the pole lies outside the circle and since $\hbox{$ \vc\vt \in \U $}$ there is no pole. However, if θ is on the circle (as occurs in the line integral in Eq. (22)), the third term can become quite large; hence, for points ϑ near the boundary, special care is needed to obtain an accurate solution.

This Green’s function approach has several advantages over using a SOR for a grid. First, the region on which the Neumann problem is defined can be chosen as a circle, instead of a rectangle for the SOR method. Second, the solution by means of the Green’s function yields higher accuracy. The reason for this is that the limited accuracy in finite differencing does not occur here. Third, if one is interested in the deflection only at a few points, this can be calculated much faster than with the SOR which necessarily calculated the solution over the whole region.

3.4. Interpretation

The expression (18) for the deflection potential, or the expression (22) for the deflection angle, contains quite a number of terms. In order to obtain a better understanding of the various terms, we consider again the case where the deflection angle $\hat{α}$ $\hbox{$\hat{\vc\alpha}$}$ is a pure gradient field, in which case $\begin{matrix} _{˜} \\ α \end{matrix} = \hat{α} \equiv α$ $\hbox{$\tilde{\vc\alpha}=\hat{\vc\alpha}\equiv \vc\alpha$}$ . Then the deflection angle at a point $\hbox{$\vc\vartheta\in {\cal U}$}$ can be decomposed into a deflection α_in which is caused by matter inside $\hbox{${\cal U}$}$ , and one due to matter outside $\hbox{${\cal U}$}$ , denoted by α_out. Thus we expect that $α (ϑ) = α_{in} (ϑ) + α_{out} (ϑ) = \frac{1}{π} \int_{𝒰} d^{2} θκ (θ) \frac{ϑ - θ}{| ϑ - θ |^{2}} + α_{out} (ϑ) .$ $\begin{equation} \vc\alpha(\vc\vt)=\vc\alpha_{\rm in}(\vc\vt)+\vc\alpha_{\rm out}(\vc\vt) = \frac{1}{\pi}\int_{\cal U}\d^2\theta\;\kappa(\vc\theta)\, \frac{\vc\vt-\vc\theta}{|\vc\vt-\vc\theta |^2} +\vc\alpha_{\rm out}(\vc\vt). \label{eq:A1} \end{equation}$ (23)Comparing the last Eqs. (23) to (22), we find that $α (ϑ) = α_{in} (ϑ) + A (ϑ) - B_{in} (ϑ) - B_{out} (ϑ),$ $\begin{equation} \vc\alpha(\vc\vt)=\vc\alpha_{\rm in}(\vc\vt)+\vc A(\vc\vt) -\vc B_{\rm in}(\vc\vt)-\vc B_{\rm out}(\vc\vt), \label{eq:A3} \end{equation}$ (24)where

$\begin{matrix} A (ϑ) = \int_{𝒰} d^{2} θκ (θ) (2 \nabla_{ϑ} H (ϑ; θ) - \frac{1}{π} \frac{ϑ - θ}{| ϑ - θ |^{2}}), \end{matrix}$ $\begin{eqnarray} && \vc A(\vc\vt) = \int_{\cal U} \!\! \d^2\theta\;\kappa(\vc\theta)\, \rund{2\nabla_\vt H(\vc\vt;\vc\theta)-\frac{1}{\pi} \frac{\vc\vt-\vc\theta}{|\vc\vt-\vc\theta |^2}}, \nonumber \\ && \vc B_{\rm in, out}(\vc\vt) = \int_{\partial \cal U}\!\!\!\d s\;\nabla_\vt H(\vc\vt;\vc\theta)\;{\vc\alpha}_{\rm in, out}\cdot \vc n, \label{eq:A4.1} \end{eqnarray}$ (25)where we split the deflection angle on the boundary into terms due to matter inside and outside $\hbox{${\cal U}$}$ . Both of the terms A and B_in are due to matter inside $\hbox{${\cal U}$}$ , whose deflection is covered entirely by the first term α_in, so that we expect that $A (ϑ) = B_{in} (ϑ) .$ $\begin{equation} \vc A(\vc\vt)=\vc B_{\rm in}(\vc\vt). \label{eq:A5} \end{equation}$ (26)In Appendix B we show explicitly that this relation holds for the case of a circular region for which H is given by Eq. (21). Hence, Eq. (24) then provides a clean separation of the deflection angle coming from the inner mass distribution (α_in) and that coming from matter outside $\hbox{$\cal U$}$ , given by B_out. This relation may be of practical relevance for the numerical calculation of the lensing properties from a complicated mass distribution, for which the lensing quantities are only needed inside a limited region. Instead of calculating, for every point inside $\hbox{$\cal U$}$ , a two-dimensional integral of the surface mass density κ over the whole lens plane, one can proceed as follows: first, one can reduce the integration range over the region $\hbox{$\cal U$}$ to get the contribution α_in. Second, one can calculate the contribution α_out for points on the boundary by integrating over the outer region of the lens in terms of a two-dimensional integral. Third, the contribution α_out for points inside the region $\hbox{${\cal U}$}$ can then be obtained by a one-dimensional integration over the boundary curve.

In general, if α is given on the boundary, it contains contributions from both the inner and the outer part. In other words, the split of B into B_in and B_out is not provided in that case. The term A then compensates for the contribution B_in of B.

4. Illustrative example – a quadrupole lens and an isotropic SPT

Our goal is to find criteria allowing us to assess whether an SPT-transformed deflection law is valid (i.e. deviates from a gradient field by less than ε_acc), using the methods explained in the previous section. Thus, we set an upper limit on how much the transformed deflection law $\hbox{$\ahat$}$ is allowed to differ from its closest curl-free approximation $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ before leading to a non-negligible shift of the lensed images. Since observed lens systems are usually fit by simple mass models with only a small number of free parameters, we do not expect the fit to be perfect. We always have to deal with observational uncertainties as well as the presence of substructure (Xu et al. 2010; Bradač et al. 2004; Kochanek & Dalal 2004; Mao & Schneider 1998) and line-of-sight inhomogeneities (Xu et al. 2012; Metcalf 2005). Therefore, we cannot reproduce observed positions better than a few milliarcseconds with a smooth mass model. Hence, as long as | Δα(θ) | is less than the smallest angular scale on which modeling with a smooth mass model is still meaningful, differences are of no practical relevance (SS14).

We need to choose a lens model to explore how seriously the SPT may affect lens modeling. First, we consider a situation similar to SS14, namely a quadrupole lens with external shear γ_p $α = κ̅ (| θ |) θ - (\begin{matrix} γ_{p} & 0 \\ 0 & - γ_{p} \end{matrix}) θ$ $\begin{equation} \avec = \kbar (|\tvec|) \, \tvec - \begin{pmatrix} \gp & 0 \\ 0 & -\gp \end{pmatrix} \ \tvec \label{eq:quadlens} \end{equation}$ (27)which is deformed by an SPT corresponding to a radial stretching, as in Eq. (5). Specifically, we choose $β̂ (β) = (1 + \frac{f_{2}}{2 {θ_{E}}^{2}} β^{2}) β .$ $\begin{equation} \bhat (\bvec) = \left( 1 + \frac{f_2}{2 \tE^2} \beta^2 \right) \bvec. \label{eq:deffct} \end{equation}$ (28)This deformation function is the lowest-order expansion of more general stretching functions, and its leading-order term is chosen such as to not yield an MST, to cleanly separate the effect of the MST from that of the more general SPT in this study. Furthermore, we choose as specific lens model a non-singular isothermal sphere (NIS), described by the mean convergence profile $κ̅ = θ_{E} \frac{1}{\sqrt{{θ_{c}}^{2} + θ^{2}}},$ $\begin{equation} \kbar = \tE \, \frac{1}{\sqrt{\tc^2 + \theta^2}}, \end{equation}$ (29)where θ_c is the core radius. For the rest of this paper, we fix the core to be θ_c = 0.1θ_E.

To get a quantitative estimate on how large deviations of $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\tilde{\vc\alpha}$}$ from $\hat{α}$ $\hbox{$\hat{\vc\alpha}$}$ are tolerable before the lensing properties of the SPT deviate markedly from the original lens model, we take the Hubble Space Telescope (HST) as example. We estimate that the highest astrometric accuracy that can be achieved corresponds to about a tenth of a pixel in the ACS camera, or Δθ ≈ 5 mas ≈ 5 × 10^-3θ_E, where the last expression accounts for the fact that the typical Einstein radii of galaxy-scale lenses are of order one arcsecond. Thus, if the solution $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\tilde{\vc\alpha}$}$ satisfies the condition (9) with ϵ_acc = 5 × 10^-3θ_E over the region | θ | ≤ 2 θ_E, we call the corresponding SPT allowed or valid.

4.1. Impact on the deflection law

The model we consider has two free parameters, the distortion parameter f₂ in the SPT (28), and the strength γ_p of the external shear. We start with exploring this parameter space to find the combination that yield allowed transformations, using the methods described in the previous section. In Fig. 4, we display the maximum deviation |Δα| _max as a function of these two parameters. It shows a wide range of allowed parameter combinations, where the allowed range of f₂ decreases with increasing external shear. The white regions in Fig. 4 denotes parameter combinations where |Δα| _max > 0.005 θ_E, and which are therefore not allowed according to our accuracy criterion.

Fig. 4

Values of |Δα| _max are plotted against the parameters f₂ from (28) and external shear strength γ_p. The colored region indicates allowed pairs of parameters that fulfill the |Δα| < 5 × 10^-3θ_E-criterion. For obtaining this figure, we used the SOR method.

In SS14, we speculated that the curl of $\hbox{$\ahat$}$ may yield a good indication for the deviation of the SPT-transformed deflection field from a gradient field. In this case, the curl $\hbox{$ \khat_I = \nabla \times \ahat/2 $}$ , which describes the asymmetric part of the Jacobian, could be used as a proxy for | Δα |. For a quadrupole lens of the form (27) and the deformation law (28), the curl $\hbox{$\khat_{\rm I}$}$ is given in Eq. (42) of SS14, $\begin{matrix} κ̂ I & \approx \\ \times [{γ_{p}}^{2} - (1 - κ̅) (2 γ_{m} + 1 - κ̅) + 2 γ_{m} γ_{p} \cos (2 ϕ)^{]} \sin 2 ϕ, \end{matrix}$ $\begin{eqnarray} \ki &\approx& - \frac{\gp}{2} f_2 \left( \frac{\theta}{\tE} \right)^2 \label{eq:kiSS14}\\ &&\times \left[\gp^2 - (1 - \kbar)(2\gm + 1 - \kbar) + 2 \gm \gp \cos(2\varphi)\right] \sin 2\varphi, \nonumber \end{eqnarray}$ (30)where θ,ϕ describe polar coordinates in the lens plane and $\hbox{$ \gm(|\tvec|) = \kappa (|\tvec|) - \kbar (|\tvec|) $}$ is the shear caused by the NIS lens.

Fig. 5

Values of $\hbox{$|\ki|_\mathrm{max}$}$ are plotted against the parameters f₂ from (28) and external shear strength γ_p. The colored region indicates allowed pairs of parameter that were chosen such that they roughly correspond to | Δα | < 5 × 10^-3θ_E.

Figure 5 shows the maximum of $κ̂ I$ $\hbox{$\ki$}$ as a function of external shear γ_p and deformation “strength” f₂, which indeed is very similar to Fig. 4. The actual difference between those two approaches is seen in Fig. 6. An approximately linear correlation with an expected but modest scatter can be seen. In fact, from that figure we obtain for our specific model that $0.16 | κ̂ I |_{\max} ≲ \frac{| Δ α |_{\max}}{θ_{E}} ≲ 0.3 | κ̂ I |_{\max} .$ $\begin{equation} 0.16\, |\ki|_\mathrm{max} \la {|\Delta\vc\alpha|_{\rm max}\over \tE} \la 0.3\, |\ki|_\mathrm{max}. \label{eq:inequ} \end{equation}$ (31)For other models, the relation between | Δα | _max and $\hbox{$|\ki|_\mathrm{max}$}$ will be different; nevertheless, we see that the curl of $\hbox{$\ahat$}$ indeed provides a useful indication for the validity of an SPT, since calculating $κ̂ I$ $\hbox{$\ki$}$ is much easier then obtaining the numerical solution for $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ .

Fig. 6

For every allowed combination f₂ and γ_p the values of | Δα | _max (Fig. 4) are plotted against $\hbox{$|\ki|_\mathrm{max}$}$ (Fig. 5). A clear correlation between these two quantities can be seen.

Figure 7 illustrates how a specific deflection law in a region | θ | ≤ 2 θ_E is affected by an SPT. It shows | Δα(θ) | for a quadrupole lens with external shear γ_p = 0.1 and deformation strength f₂ = 0.55, which is the highest allowed for this value of the external shear strength and thus is expected to show the largest deviations $\hbox{$\khat$}$ compared to the original mass profile. The figure shows that the largest deviations occur at an angle of 45° with respect to the external shear. This pattern, which is shown for one specific pair of f₂ and γ_p, is qualitatively the same for all f₂-γ_p-combinations.

Fig. 7

Map of | Δα(θ) | is shown for f₂ = 0.55 and γ_p = 0.1. The strong changes in the corners, i.e. θ > 2 θ_E, are biased by large numerical uncertainty and should be neglected.

4.2. Implications for the convergence

We show in Fig. 8 the comparison between original (κ) and SPT-transformed mass distribution ( $\hbox{$\hat\kappa$}$ ) for three different allowed pairs of parameters, f₂ = 0.55 and γ_p = 0.1 (the same combination of parameters as in Fig. 7), f₂ = −0.55 and γ_p = 0.1, and f₂ = 1.2 and γ_p = 0.05. The lower panel of Fig. 8 shows the change of the radial profile as $\hbox{$\khat/\kappa_\mathrm{original}$}$ .

Fig. 8

Upper panel shows the mass profile of the original NIS lens (solid curve), and that of three SPT-transformed lenses, with parameters f₂ and γ_p indicated by the labels. For all of these three models, Δα_max ≈ ε_acc = 5 × 10^-3θ_E. Since the transformed mass distributions have a finite ellipticity, the density is plotted as a function of the geometric mean of the major and minor semi-axis of the best-fitting ellipse to an isodensity contour, except for the case with negative f₂, for which the outer isodensity contours are not closing around the lens center; in this special case, the x-axis corresponds to the θ₁-axis. The convergence changes up to 28% for radii smaller than 1 θ_E, radii larger than that show a significantly smaller convergence for a positive f₂. Negative f₂ show an essentially mirrored behavior compared to positive f₂. This leads to convergence $\hbox{$\khat$}$ that may not decrease monotonically. The lower panel shows the ratio between transformed and original mass profile.

The divergence of $\hbox{$\ahat$}$ (i.e. $\hbox{$\nabla\cdot\ahat=2\hat\kappa$}$ ), was calculated analytically in SS14 (see their Eq. (41)) and can be used to compare our numerical results to the analytic solution. Specialized to our case, it reads $\begin{matrix} κ̂ & = & κ_{NIS} + \frac{f_{2}}{2} {(\frac{θ}{θ_{E}})}^{2} \\ \times (γ_{m} [2 γ_{p}^{2} + 3 (1 - κ̅)^{2}] - 2 (1 - κ̅) [(1 - κ̅)^{2} + 2 γ_{p}^{2}] \\ + [5 γ_{p} (1 - κ̅)^{2} - 6 γ_{p} γ_{m} (1 - κ̅) + γ_{p}^{3}] \cos 2 ϕ + γ_{p}^{2} γ_{m} \cos 4 ϕ) . \end{matrix}$ $\begin{eqnarray} \khat &= &\ \kappa_\mathrm{NIS} + \frac{f_2}{2} \left( \frac{\theta}{\tE} \right)^2 \nonumber\\\label{eq:khatSS14} && \times\biggl( \ \gamma_\mathrm{m} \Bigl[ 2 \gamma_\mathrm{p}^2 + 3 \bigl( 1 - \kbar \bigl)^2 \Bigl] \ - \ 2 \bigl( 1 - \kbar \bigl) \Bigl[ \bigl( 1 - \kbar \bigl)^2 + 2 \gamma_\mathrm{p}^2 \Bigl] \biggl. \\ && + \Bigl[ 5 \gamma_\mathrm{p} \bigl( 1 - \kbar \bigl)^2 - 6 \gamma_\mathrm{p} \gamma_\mathrm{m} \bigl( 1 - \kbar \bigl) + \gamma_\mathrm{p}^3 \Bigl] \, \cos 2 \varphi + \gamma_\mathrm{p}^2 \gamma_\mathrm{m} \, \cos 4 \varphi \biggl. \biggl). \nonumber \end{eqnarray}$ (32)where again θ,ϕ describe polar coordinates in the lens plane. The change $\hbox{$\Delta\kappa=\khat- \kappa_\mathrm{NIS}$}$ is proportional to the stretching parameter f₂, so that Δκ(−f₂) = −Δκ(f₂). This behavior can be seen in Fig. 8. Indeed, we checked that all numerically obtained deflection angles $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ are such that their corresponding surface mass densities agree with the analytical prediction (32). For example, the numerical result for the parameter combination γ_p = 0.1 and f₂ = 0.55 deviates by less than 3 × 10^-3 from the analytical solution.

As seen from Eq. (32), the resulting mass distribution $\hbox{$\khat$}$ is no longer axi-symmetric, but that isodensity contours are nearly elliptical (i.e., a factor proportional to cos(2φ)) with a small boxiness (i.e., a factor proportional to cos(4φ)). Hence, we define the distance from the center generally as the geometric mean $\sqrt{θ_{1} θ_{2}}$ $\hbox{$\sqrt{\theta_1 \theta_2}$}$ using the 1- and 2-axis of the elliptical isodensity contours. However, for sufficiently negative f₂, the outer isodensity contours are no longer concentric, i.e., they are not closed curves around the center of the lens. In addition, for large negative values of f₂, the radial profile can become non-monotonic. We consider such a behavior as non-physical, i.e., such resulting models will be irrelevant in practice.

The ellipticity of the transformed mass profiles is non-negligible as shown in Fig. 9 where ϵ, defined as the axis ratio 1- over 2-axis, is plotted as a function of radius. Integrating the analytic representation (32) of $\hbox{$\khat$}$ up to 1 θ_E it can be shown that the mass enclosed within the Einstein radius of the original lens is conserved, independent of the chosen mass profile κ(θ) (see Appendix C).

Fig. 9

Radial dependence of the axis ratio ϵ. In the unperturbed case the isodensity contours are circular, i.e., $ϵ (\sqrt{θ_{1} θ_{2}}) = 1$ $\hbox{$\epsilon (\!\!\sqrt{\theta_1\theta_2}) = 1$}$ . The SPTed mass distribution shows for radii $\sqrt{θ_{1} θ_{2}} < 1 θ_{E}$ $\hbox{$\sqrt{\theta_1\theta_2} < 1 \, \tE$}$ deviations of up to 5% from circularity, whereas for larger radii the deviations can be up to 20%. The convergence map for f₂ = −0.55 and γ_p = 0.1 does not show concentric isodensity contours for radii larger than 1.3 θ_E and therefore no ellipticity as a function radius can be determined.

5. Characterization of the modified mass distribution

The SPT leads to a modified deflection angle of the lens which yields exactly the same astrometric and photometric observational properties as the original mass distribution. For those modified profiles $\hat{α}$ $\hbox{$\hat{\vc\alpha}$}$ for which a deflection potential $\begin{matrix} ˜ \\ ψ \end{matrix}$ $\hbox{$\tilde\psi$}$ can be found such that the differences between the corresponding Δα is sufficiently small, the modified surface mass density $\hbox{$\hat\kappa$}$ provides a viable alternative to the original mass model κ of the lens. In this section we want to consider a diagnostic for the change of the mass profile, both regarding the radial slope and the angular structure of the lens. Since the strong lensing properties of the lens can only be probed in the inner part of the mass distribution, we will apply these diagnostics only to those regions where multiple images can occur, i.e., | θ | ≲ 2θ_E.

5.1. Radial mass profile

The SPT changes the radial mass profile of the lens. We consider situations in which the original lens is described by a “simple” mass distribution, i.e., an NIS. Combined with a “mild” SPT the resulting $\hbox{$\hat\kappa$}$ remains simple, e.g., still shows closed, concentric isodensity contours.

The mass-sheet transformation is a special case of the SPT, and it is well known that the MST changes the radial profile of the lens. In order to highlight the new feature of the SPT not contained in the MST, we aim at a measure for the radial profile which is invariant under the MST. The MST transforms all derivatives of κ by a constant factor λ, hence it leaves the ratio of derivatives unchanged. Consequently, one possible diagnostic for the effect of the SPT is the radial profile of such ratios, e.g., ⟨κ⟩′′/⟨κ⟩′.

In particular, if the original mass profile is a power law, ⟨κ⟩(θ) ∝ θ^{− s}, then we have θ ⟨κ⟩′′/⟨κ⟩′ = −(s + 1); hence, any deviation from this constant value indicates the effect of the SPT on the modified mass profile $\hbox{$\hat\kappa$}$ . However, if there is no analytical expression of κ and $\hbox{$\hat{\kappa}$}$ , the ratio of derivatives is very sensitive to numerical noise, and therefore of little practical interest. We therefore consider hereafter alternative tests.

Noting that the MST yields a multiplication of 1−κ(θ) by λ, the ratio $R_{κ} (θ) = \frac{1 - ⟨ κ ⟩ (θ)}{{⟨ κ ⟩}^{'} (θ)},$ $\begin{equation} R_\kappa(\theta)=\frac{1-\ave{\kappa}(\theta)}{\ave{\kappa}'(\theta)}, \label{eq:Rk} \end{equation}$ (33)is well defined for monotonically decreasing mass profiles and invariant under the MST. Figure 10 shows R_κ for the NIS and various SPT transformed models. The variations of R_κ are particularly significant above one Einstein radius, in regions where the SPT-transformed profiles deviate also more strongly from the original profile. Despite the fact that $\hbox{$\hat{\kappa}$}$ deviates from κ_orginal by more than 20% within θ_E when f₂ = 1.2 (Fig. 8), the most extreme changes of R_κ(θ) reach no more than ~10% within one Einstein radius. As expected, R_κ deviates more strongly from the original profile when | f₂ | is large. Negative values of f₂ (not shown) are qualitatively similar (but mirrored w.r.t. $R_{κ}^{NIS}$ $\hbox{$R^{\rm NIS}_\kappa$}$ ) to the situation encountered for positive f₂. However, at radii θ ~ 1.2 θ_E, $\hbox{$\hat{\kappa}$}$ stops being monotonically decreasing, and R_κ diverges.

Fig. 10

Top: quantity R_κ(θ) (Eq. (33)) calculated for an NIS with external shear γ_p (cf. Sect. 4; solid black) and for various SPT-transformed models with SPT of the form 1 + f₂/ 2 (β/θ_E)² (Eq. (28)). The range of positive values of f₂ allowed by | Δ α_max | < 5 × 10^-3θ_E (Fig. 4) is explored for two different choices of the shear: γ_p = 0.05 (blue) and γ_p = 0.1 (red). While R_κ is conserved under an MST, it is not under an SPT, with deviation that can reach tens of percents. Bottom: for each curve of the top panel, we show the difference between R_κ of the original NIS model and of the SPT transformed model.

Another possibility to characterize the radial profile change is through the aperture mass (see Schneider 1996). Consider a function $U (θ; θ_{0}) = θ_{0}^{-2} u (θ / θ_{0})$ $\hbox{$U(\theta;\theta_0) = \theta_0^{-2}\,u(\theta/\theta_0)$}$ such that u(x) is non-zero only for x ≤ 1; hence, θ₀ characterizes the range of support of U(θ;θ₀). Furthermore, we require that the filter function U has a vanishing two-dimensional integral over its support, which means that $\begin{matrix} \int_{0}^{1} d xxu (x) = 0. \end{matrix}$ $\begin{eqnarray*} \int_0^1\d x\; x\;u(x)=0. \end{eqnarray*}$ Then we define the aperture mass as $M_{ap} (θ_{0}) = \int d^{2} θκ (θ) U (| θ |; θ_{0}) = 2 π \int d θθ ⟨ κ ⟩ (θ) U (θ; θ_{0}) .$ $\begin{equation} M_{\rm ap}(\theta_0)=\int \d^2\theta\; \kappa(\vc\theta)\, U(|\vc\theta|;\theta_0) =2\pi \int\d\theta\;\theta\,\ave{\kappa}(\theta)\, U(\theta;\theta_0). \label{eq:Map} \end{equation}$ (34)The mass-sheet transformation (1) leads to multiplication of M_ap by a factor λ, whereas the additive term in Eq. (1) drops out, due to the compensated nature of the filter function U. Thus, if we consider the ratio of the aperture mass for two different scale lengths θ₀, the factor λ drops out, and this ratio M_ap(θ₁) /M_ap(θ₂) is invariant under the MST.

Consider again a power-law density profile, ⟨κ⟩(θ) = (1−s/ 2)(θ/θ_E)^{− s}, with 0 <s< 2, where θ_E is the Einstein radius in case of axi-symmetry. Then, $\begin{matrix} M_{ap} = (2 - s) π {(\frac{θ_{0}}{θ_{E}})}^{- s} \int_{0}^{1} d x x^{1 - s} u (x), \end{matrix}$ $\begin{eqnarray*} M_{\rm ap}=(2-s)\pi \rund{\frac{\theta_0}{\theta_{\rm E}}}^{-s} \int_0^1\d x\;x^{1-s}\,u(x), \end{eqnarray*}$ and M_ap(θ₁) /M_ap(θ₂) = (θ₁/θ₂)^{− s}. We thus define the effective slope $s_{eff} : = \frac{\ln [M_{ap} (θ_{1}) / M_{ap} (θ_{2})^{]}}{\ln (θ_{2} / θ_{1})},$ $\begin{equation} s_{\rm eff}:=\frac{\ln\eck{M_{\rm ap}(\theta_1)/M_{\rm ap}(\theta_2)}}{\ln (\theta_2/ \theta_1)}, \label{eq:seff} \end{equation}$ (35)so that for a mass profile of the form ⟨κ⟩(θ) = a + bθ^{− s}, s_eff = s.

One can think of a number of appropriate weight functions u(x) and aperture scales θ_i to characterize the modified mass profile. The simplest form would be the sum of two delta functions, u(x) = δ(x−x₀)−x₀δ(x−1), with x₀< 1, for which M_ap(θ₀) = 2πx₀[⟨κ⟩(x₀θ₀)−⟨κ⟩(θ₀)]. Furthermore, choosing θ₂ = θ₁/x₀, the ratio of aperture masses becomes $\frac{M_{ap} (θ_{1})}{M_{ap} (θ_{1} / x_{0})} = \frac{⟨ κ ⟩ (x_{0} θ_{1}) - ⟨ κ ⟩ (θ_{1})}{⟨ κ ⟩ (θ_{1}) - ⟨ κ ⟩ (θ_{1} / x_{0})} \cdot$ $\begin{equation} \frac{M_{\rm ap}(\theta_1)}{M_{\rm ap}(\theta_1/x_0)} =\frac{\ave{\kappa}(x_0 \theta_1)-\ave{\kappa}(\theta_1)}{\ave{\kappa}(\theta_1)-\ave{\kappa}(\theta_1/x_0)}\cdot \end{equation}$ (36)In the case of (1−x₀) ≪ 1, the expression (35) becomes $s_{eff} = - 1 - \frac{θ_{1} {⟨ κ ⟩}^{′′} (θ_{1})}{{⟨ κ ⟩}^{'} (θ_{1})} + 𝒪 ([1 - x_{0}]^{2}) .$ $\begin{equation} s_{\rm eff}=-1-\frac{\theta_1 \ave{\kappa}''(\theta_1)} {\ave{\kappa}'(\theta_1)} +{\cal O}([1-x_0]^2). \end{equation}$ (37)Hence, we see that in this case s_eff depends just on the ratio of second to first derivative, and reduces to s for a power-law mass profile with slope s.

More practical choices of u would be such that the profile is probed over an annulus around the Einstein radius θ_E. For example, one could use the compensated filter function $u (x) = {\begin{matrix} for x_{0} \leq x \leq 1 \\ else . \end{matrix}$ $\begin{equation} u(x) = \begin{cases} \frac{1}{x}(x-x_0) (2x-x_0-1) (x-1) &\mbox{for}\ \ \ x_0\le x\le 1 \\ 0 & \mbox{else}.\end{cases} \label{eq:ux} \end{equation}$ (38)Figure 11 shows M_ap as a function of θ₀, fixing x₀ = 1/2 in Eq. (38). As expected, for two profiles transformed into each other via an MST, the ratio of aperture masses is independent of θ₀ and equals λ. Conversely, M_ap(θ₀) of the SPT-transformed profiles intersects the aperture mass “function” of the original profile (i.e. NIS), and the ratio between the two curves changes with θ₀. The radius at which the curves intersects is almost independent of the value of f₂. This can easily be deduced from the apparent self-similarity of the SPT-transformed mass density profiles (Fig. 8) for various values of f₂.

Figure 11 motivates a choice of radii corresponding to extrema of M_ap(θ₀) to calculate aperture mass ratios, such as θ₁ = 2 θ_E and θ₂ = θ_E. Then, M_ap(θ₁) will probe the annulus θ_E<θ< 2 θ_E, while for θ₂ ~ θ_E the annulus 0.5 θ_E ≤ θ ≤ θ_E would be probed. Figure 12 shows normalized aperture mass ratios M_ap(θ₁) /M_ap(θ₂) for the various SPT-transformed profiles studied in the previous section. We see that larger aperture mass ratios are found for larger values of f₂. In addition, the ratio depends only weakly of the shear amplitude γ_p, which means that the radial deformation of the mass profile produced by the SPT is mostly governed by the amplitude of f₂.

Fig. 11

Top: M_ap(θ₀) (Eq. (34)) as a function of the “aperture” θ₀. The filter function u(x) defined by Eq. (38), using x₀ = 0.5, has been used such that for M_ap(θ₀), the annulus [θ₀/ (2θ_E),θ₀/θ_E] is probed. The black curve shows M_ap for the NIS profile and the blue curves for the SPT-transformed profiles with γ_p = 0.05 and various values of f₂. The green curve shows M_ap(θ₀) for an MST transformed version of the NIS profile. Bottom: ratio between M_ap derived for the various transformed profiles and for the original NIS profile.

Fig. 12

Ratios of aperture mass between θ₁ = 2 θ_E and θ₂ = θ_E for SPT-transformed profiles with various values of f₂. The ratios of aperture masses are normalized by the corresponding aperture ratios estimated for the original NIS profile (horizontal bar). The blue diamonds are for a shear γ_p = 0.05, and the red squares when γ_p = 0.1.

5.2. A specific lens model

Here, we apply the previous tests to three mass density profiles studied in SS13 and SS14, and used in those papers to illustrate degeneracies produced by the MST and the SPT. The reference model is a composite model constituted of the sum of a (spherically symmetric) Hernquist component to describe the baryonic component, and a (spherical) generalized Navarro-Frank-White (gNFW) density profile to describe the dark matter component of the galaxy. In addition, an external shear of amplitude γ_p = 0.1 is considered. Complex sets of lensed images from an ensemble of sources were generated with that model, and found to be all equally well reproduced by two (single) power-law profiles: a global power law, with an almost isothermal density slope γ′ = 1.98 (hereafter model M1), and a local power-law profile with a core radius θ_c = 0.1′′ and a slope γ′= 2.2 (hereafter model M2). We have applied the tests introduced in the previous subsection to these profiles to identify the nature of the degeneracy between the models. Figure 13 shows the difference R_κ between the original profile and transformed ones, i.e., ΔR_κ = R_κ(original)−R_κ(transformed). For comparison, we also show ΔR_κ obtained for an SPT with f₂ = 0.11. This figure suggests that indeed, this degeneracy is similar to an SPT.

The other diagnostic we present consists in calculating the aperture mass M_ap of the profiles. Figure 14 shows M_ap as a function of θ₀. In addition to the aperture mass calculated for the individual profiles, we also show the aperture mass corresponding to two different MST-transformed mass density profiles. As explained in SS13, model M1 is close to an MST transformed version of the composite model⁴ with λ = 0.84. On the other hand, Fig. 4 of SS14 shows that the MST contribution to M2 corresponds to λ = 0.932. Figure 14 is qualitatively similar to Fig. 11 but there is an offset of M_ap for M1 and M2 compared to the composite model. The reason is probably that the M1 and M2 profiles are transformed versions of the composite model via both an MST and an SPT. The MST contribution with λ = 0.84 is larger for M1 than for M2, for which λ ~ 0.93.

Fig. 13

Difference between R_κ calculated for three different pairs of profiles: In blue, an NIS and an SPT-transformed model with f₂ = 0.11 and γ_p = 0.1; in magenta, a composite Hernquist+gNFW model and a power-law model (M1); in red, Hernquist+gNFW model and a cored power-law model (M2). The shape of ΔR_κ for the models presented in SS13 and SS14 are qualitatively similar to that observed for the fiducial SPT model presented in Sect. 4.

Fig. 14

Top: M_ap(θ₀) (Eq. (34)) as a function of θ₀, for the composite Hernquist+gNFW model (black), for the power law model M1 (blue), and the cored power-law M2 (red). Dashed red (blue) profile shows M_ap(θ₀) for an MST transformed version of the composite model with λ = 0.93 (resp. 0.84). Bottom: ratio of M_ap(θ₀) between the “transformed” models and the composite. The dashed curves correspond to MST-transformed versions of the composite model, and represent the contribution of the MST to M1 and M2. The solid red and blue curves suggest that the remaining of the degeneracy can be associated with an SPT.

6. Discussion – Implications of the SPT for strong lensing

In this paper we have studied several aspects of the SPT, an invariance transformation of the deflection angle that leaves all multiple images properties of gravitational lenses invariant. The central question, of whether there exists a gravitational lensing potential which gives rise to a deflection angle sufficiently close to the SPT-transformed one (which in general is not curl free) has been explored for a particular class of lens models, namely an NIS with external shear and an SPT given as a radial stretching of the source plane. The radial stretching deformation was chosen such that the classical MST did not contribute in altering the original deflection since we are only interested in higher-order effects that go beyond the well known MST. This example has shown that, for a large range of parameters pairs (external shear and distortion parameter of the radial stretching) there indeed exist lensing potentials whose associated deflection is sufficiently close to the one obtained from the SPT that these two cannot be distinguish observationally. We conducted this study by formulating an action as the integral over the squared difference of these two deflection angles, yield a Neumann problem. We gave a detailed description of how this problem can be solved; these methods are expected to be useful for future theoretical studies and applications of the SPT.

We have considered only one criterion for the validity of an SPT, namely that the corresponding gradient deflection field does not deviate from the SPT-transformed deflection by more than 5 × 10^-3θ_E. Changing this observationally motivated limit to a different value will modify the space of allowed parameter combinations. For the example considered here, we expect that the allowed range of the stretching parameter f₂ for a given external shear will be proportional to the allowed maximum deviation of these two deflection angles.

We point out that our method of obtaining a gradient deflection law in form of the variational principle (6) does not necessarily yield an “optimal” modified deflection. As briefly discussed in Sect. 3.1, one could imagine alternative constraints for finding a gradiant deflection “close to $\hbox{$\ahat$}$ ”. In particular, finding a gradient field whose maximum deviation from $\hbox{$\ahat$}$ is minimized over the region of relevance would be a promising ansatz which, however, is analytically challenging, if at all doable. Nevertheless, the solution obtained in this paper yields valuable insight in the freedom of lens model choices offered by the SPT.

The properties of the mass distribution resulting from an SPT were also studied in detail. In contrast to the MST, which is a special case of an SPT, the more general SPT gives rise to non-monotonic changes in the radial mass profile, and to the generation of a finite ellipticity even if the original mass distribution was axi-symmetric. Hence, the SPT offers a much larger range of mass profile modifications which leave all strong lensing observables invariant, than does the MST. This more complex class of invariance transformation is of particular interest because it may be of great relevance when trying to fit real lens system (which are expected to have a rather complex mass profile; see, e.g., Xu et al. (2016) with simple lens models. The fact that simple mass models yield satisfactory fits even in cases with a rich observed image structure may be a consequence of some SPT which transforms the deflection of the true mass distribution into that of a simple mass model. Ignoring the potential complexity of the real mass distribution, and thus the possibility that the SPT may be acting, may lead to biases in estimates of physical parameters of the lens system.

We have defined several diagnostic quantities which can distinguish a general SPT from a pure MST. Applying these diagnostics to the special case of nearly degenerate lens models studies in two earlier papers, we conclude that this degeneracy can to a large degree be accounted by an MST, but that a non-negligible contribution comes from a more general SPT. Hence, an SPT has been found “empirically”, even before the concept of the SPT was developed. In that sense, the SPT is not just a “theoretical possibility” for obtaining different but observationally equivalent mass models, but describes degeneracies which actually occur in real lens modeling.

¹

Although time delay ratios stay constant.

²

E.g., the last term in Eq. (21) would be missing.

³

We note that this extrapolation has to be carried out with the deflection angle, not with the potential, since the latter is determined only up to an additive constant – which may depend on the iteration step m and the number of grid points.

⁴

In SS13, we reported λ = β_fid/β_PL = 1.19, which is the inverse of λ used here that is such that κ_PL = λκ_fid + (1−λ).

Acknowledgments

We would like to thank Bastian Orthen for valuable discussions, Olivier Wertz for valuable comments on this paper, and the referee, Matthias Bartelmann, for his constructive comments and advice. Part of this work was supported by the German Deutsche Forschungsgemeinschaft, DFG project numbers SL 172/1-1 and SCHN 342/13-1. Sandra Unruh is a member of the International Max Planck Research School (IMPRS) for Astronomy and Astrophysics at the Universities of Bonn and Cologne. Dominique Sluse is supported by a Back to Belgium grant from the Belgian Federal Science Policy (BELSPO).

References

Bartelmann, M. 2010, Class. Quantum Gray., 27, 233001 [NASA ADS] [CrossRef] [Google Scholar]
Bradač, M., Schneider, P., Lombardi, M., et al. 2004, A&A, 423, 797 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Coe, D., Fuselier, E., Benítez, N., et al. 2008, ApJ, 681, 814 [NASA ADS] [CrossRef] [Google Scholar]
Diego, J. M., Sandvik, H. B., Protopapas, P., et al. 2005, MNRAS, 362, 1247 [NASA ADS] [CrossRef] [Google Scholar]
Falco, E. E., Gorenstein, M. V., & Shapiro, I. I. 1985, ApJ, 289, L1 [Google Scholar]
Hezaveh, Y. D., Marshall, P. J., & Blandford, R. D. 2015, ApJ, 799, L22 [NASA ADS] [CrossRef] [Google Scholar]
Kochanek, C. S. 2006, in Saas-Fee Advanced Course 33: Gravitational Lensing: Strong, Weak and Micro, eds. G. Meylan, P. Jetzer, P. North, et al., 91 [Google Scholar]
Kochanek, C. S., & Dalal, N. 2004, ApJ, 610, 69 [NASA ADS] [CrossRef] [Google Scholar]
Liesenborgs, J., & De Rijcke, S. 2012, MNRAS, 425, 1772 [NASA ADS] [CrossRef] [Google Scholar]
Mao, S., & Schneider, P. 1998, MNRAS, 295, 587 [NASA ADS] [CrossRef] [Google Scholar]
Metcalf, R. B. 2005, ApJ, 629, 673 [NASA ADS] [CrossRef] [Google Scholar]
Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. 1996, Numerical recipes in C (New York: Cambridge University Press) [Google Scholar]
Saha, P., & Williams, L. L. R. 1997, MNRAS, 292, 148 [NASA ADS] [CrossRef] [Google Scholar]
Schneider, P. 2006, in Saas-Fee Advanced Course 33: Gravitational Lensing: Strong, Weak and Micro, eds. G. Meylan, P. Jetzer, P. North, et al., 1 [Google Scholar]
Schneider, P., & Sluse, D. 2013, A&A, 559, A37 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Schneider, P., & Sluse, D. 2014, A&A, 564, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Seitz, S., & Schneider, P. 2001, A&A, 374, 740 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Winn, J. N., Rusin, D., & Kochanek, C. S. 2004, Nature, 427, 613 [NASA ADS] [CrossRef] [Google Scholar]
Xu, D. D., Mao, S., Cooper, A. P., et al. 2010, MNRAS, 408, 1721 [NASA ADS] [CrossRef] [Google Scholar]
Xu, D. D., Mao, S., Cooper, A. P., et al. 2012, MNRAS, 421, 2553 [NASA ADS] [CrossRef] [Google Scholar]
Xu, D., Sluse, D., Schneider, P., et al. 2016, MNRAS, 456, 739 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Practical integration of Eq. (22) in a circular region

Calculating the deflection angle $\nabla \begin{matrix} _{˜} \\ ψ \end{matrix}$ $\hbox{$\nabla \tilde\psi$}$ in Eq. (18) by integrating the product $\hbox{$\nabla_\vt H\, \hat\kappa$}$ over the circle poses a challenge, due to the pole of the first term in Eq. (21). To integrate over this pole, polar coordinates centered on the pole position ϑ need to be chosen. This can be done by a translation of the integration variable to x = θ−ϑ, and integrating in the polar coordinates of x. However, the integration range of the polar angle will depend on | x |, according to the geometrical overlap of circles centered on the origin and those centered on θ.

A better method is obtained by a conformal mapping of the form $x = \frac{θ - ϑ}{R - ϑ^{*} θ / R},$ $\appendix \setcounter{section}{1} \begin{equation} x=\frac{\theta-\vt}{R-\vt^* \theta/R}, \label{eq:contrans} \end{equation}$ (A.1)where we now use complex notation, i.e., x, ϑ and θ are complex numbers with components ϑ = ϑ₁ + iϑ₂ etc. and an asterisk denotes complex conjugation. This conformal mapping maps the circle | θ | <R onto the unit circle | x | < 1, and the singularity point θ = ϑ is mapped onto the origin x = 0. For example, setting θ = Re^iϕ, we get $\begin{matrix} x = \frac{R e^{i ϕ} - ϑ}{R - ϑ^{*} e^{i ϕ}} = e^{i ϕ} \frac{R - ϑ e^{- i ϕ}}{{(R - ϑ e^{- i ϕ})}^{*}}, \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray*} x=\frac{R{\rm e}^{{\rm i}\vp}-\vt}{R-\vt^* {\rm e}^{{\rm i}\vp}} ={\rm e}^{{\rm i}\vp}\frac{R-\vt {\rm e}^{{-\rm i}\vp}}{\rund{R-\vt {\rm e}^{{-\rm i}\vp }}^*}, \end{eqnarray*}$ from which it is immediately seen that | x | = 1. The inverse of the transformation (A.1) is readily obtained, $θ = \frac{Rx + ϑ}{1 + ϑ^{*} x / R},$ $\appendix \setcounter{section}{1} \begin{equation} \theta=\frac{R x+\vt}{1+\vt^* x/R}, \label{eq:contransinv} \end{equation}$ (A.2)from which one can easily check that the unit circle | x | = 1 is mapped onto the circle | θ | = R. In components, Eq. (A.2) reads $\begin{matrix} θ_{1} & = & \frac{R x_{1} + ϑ_{1} (1 + | x |^{2}) + [x_{1} (ϑ_{1}^{2} - ϑ_{2}^{2}) + 2 ϑ_{1} ϑ_{2} x_{2}] / R}{1 + 2 ϑ \cdot x / R + | ϑ |^{2} | x |^{2} / R^{2}}, \\ θ_{2} & = & \frac{R x_{2} + ϑ_{2} (1 + | x |^{2}) + [2 ϑ_{1} ϑ_{2} x_{1} - x_{2} (ϑ_{1}^{2} - ϑ_{2}^{2})] / R}{1 + 2 ϑ \cdot x / R + | ϑ |^{2} | x |^{2} / R^{2}} \cdot \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} \theta_1&=&\frac{R x_1+\vt_1(1+|x|^2)+[x_1(\vt_1^2-\vt_2^2)+2\vt_1\vt_2x_2]/R} {1+2\vc\vt\cdot\vc x/R+|\vc\vt|^2 |\vc x|^2/R^2}, \nonumber \\ \theta_2&=&\frac{R x_2+\vt_2(1+|x|^2)+[2\vt_1\vt_2 x_1 - x_2(\vt_1^2-\vt_2^2)]/R} {1+2\vc\vt\cdot\vc x/R+|\vc\vt|^2 |\vc x|^2/R^2}\cdot \end{eqnarray}$ (A.3)The Jacobi determinant of the transformation x → θ, needed for the integration, is $| \frac{\partial θ}{\partial x} | = \frac{R^{2} (R^{2} - | ϑ |^{2})^{2}}{(R^{2} + 2 R ϑ \cdot x + | ϑ |^{2} | x |^{2})^{2}},$ $\appendix \setcounter{section}{1} \begin{equation} \abs{ \frac{\partial \vc\theta}{\partial \vc x}} =\frac{R^2 (R^2-|\vc\vt|^2)^2}{(R^2+2 R \vc\vt\cdot\vc x+|\vc\vt|^2 |\vc x|^2)^2}, \end{equation}$ (A.4)which is non-zero for all x inside the unit circle and | ϑ | <R. As a sanity check, we calculate the area of the circle in the transformed coordinates, $\begin{matrix} R^{2} π & = & \int 𝒰 d^{2} θ = \int_{𝒞} d^{2} x | \frac{\partial θ}{\partial x} | \\ = & R^{2} (R^{2} - | ϑ |^{2})^{2} \int_{0}^{1} d xx \int_{0}^{2 π} \frac{d ϕ}{(R^{2} + 2 R ϑ \cdot x + | ϑ |^{2} | x |^{2})^{2}} \cdot \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} R^2\pi&=&\int_{\cal U}\d^2\theta = \int_{\cal C}\d^2 x\;\abs{ \frac{\partial \vc\theta}{\partial \vc x}} \\ &=&R^2 (R^2-|\vc\vt|^2)^2\int_0^1\d x\;x \int_0^{2\pi}\frac{\d \vp}{(R^2+2 R \vc\vt\cdot\vc x+|\vc\vt|^2 |\vc x|^2)^2}\cdot \nonumber \end{eqnarray}$ (A.5)The inner integral yields 2π(R² + | x | ² | ϑ | ²)/(R²− | x | ² | ϑ | ²)³, the outer integral then gives π/ (R²− | ϑ | ²)², and we re-obtain the area πR².

In complex notation, the singular term in Eq. (21) reads $\begin{matrix} \frac{1}{(ϑ - θ)^{*}} = \frac{x + | x |^{2} ϑ / R}{R | x |^{2} (| ϑ |^{2} / R^{2} - 1)}, \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray*} \frac{1}{(\vt-\theta)^*}=\frac{x + |x|^2\vt/R}{R |x|^2 \rund{|\vt|^2/R^2-1}}, \end{eqnarray*}$ yielding $| \frac{\partial θ}{\partial x} | \frac{ϑ - θ}{| ϑ - θ |^{2}} = \frac{R^{2} (| ϑ |^{2} - R^{2})}{{(R^{2} + 2 R ϑ \cdot x + | ϑ |^{2} | x |^{2})}^{2}} (\frac{R}{{| x |}^{2}} x + ϑ) .$ $\appendix \setcounter{section}{1} \begin{equation} \abs{\frac{\partial \vc\theta}{\partial \vc x}} \frac{\vc\vt-\vc\theta}{|\vc\vt-\vc\theta |^2} =\frac{R^2(|\vc\vt|^2-R^2)}{\rund{R^2+2 R \vc\vt\cdot\vc x+|\vc\vt|^2 |\vc x|^2}^2}\rund{\frac{R}{\abs{\vc x}^2}\vc x +\vc\vt}. \end{equation}$ (A.6)We can check the consistency of this expression by calculating the deflection angle of a uniform disk with surface mass density κ₀, which reads $\begin{matrix} α (ϑ) & = & \frac{κ_{0}}{π} \int_{𝒰} d^{2} θ \frac{ϑ - θ}{| ϑ - θ |^{2}} = \frac{κ_{0}}{π} \int_{𝒞} d^{2} x | \frac{\partial θ}{\partial x} | \frac{ϑ - θ}{| ϑ - θ |^{2}} \\ = & \frac{R^{2} (| ϑ |^{2} - R^{2}) κ_{0}}{π} \int_{0}^{1} d xx \\ \times \int_{0}^{2 π} \frac{d ϕ}{{(R^{2} + 2 R ϑ \cdot x + | ϑ |^{2} x^{2})}^{2}} (\frac{R}{x^{2}} x + ϑ) \\ = & \frac{R^{2} (| ϑ |^{2} - R^{2}) κ_{0}}{π} \int_{0}^{1} d xx (\frac{- 2 π}{{(R^{2} - | ϑ |^{2} x^{2})}^{2}}) ϑ \\ = & \frac{R^{2} (| ϑ |^{2} - R^{2}) κ_{0}}{π} (\frac{π}{R^{2} (| ϑ |^{2} - R^{2})}) ϑ = κ_{0} ϑ, \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} \vc\alpha(\vc\vt)&=&\frac{\kappa_0}{\pi}\int_{\cal U}\d^2\theta\; \frac{\vc\vt-\vc\theta}{|\vc\vt-\vc\theta |^2} =\frac{\kappa_0}{\pi}\int_{\cal C}\d^2 x\; \abs{\frac{\partial \vc\theta}{\partial \vc x}} \frac{\vc\vt-\vc\theta}{|\vc\vt-\vc\theta |^2} \nonumber \\ &=&\frac{R^2\rund{|\vc\vt|^2-R^2} \kappa_0}{\pi}\int_0^1\d x\;x \nonumber\\ &&\times \int_0^{2\pi} \frac{\d\vp}{\rund{R^2+2 R \vc\vt\cdot\vc x+|\vc\vt|^2 x^2}^2}\rund{\frac{R}{x^2}\vc x +\vc\vt} \\ &=&\frac{R^2\rund{|\vc\vt|^2-R^2} \kappa_0}{\pi}\int_0^1\d x\;x\, \rund{\frac{-2 \pi}{\rund{R^2-|\vc\vt|^2 x^2}^2}}\vc\vt \nonumber\\ &=& \frac{R^2\rund{|\vc\vt|^2-R^2} \kappa_0}{\pi} \rund{\frac{\pi}{R^2 \rund{|\vc\vt|^2-R^2}}}\vc\vt=\kappa_0\,\vc\vt, \nonumber \end{eqnarray}$ (A.7)as expected.

Appendix B: Proof of the relation (26) for a circular region

In this section we use again the complex notation for two-dimensional vectors, in terms of which the vector field (21) reads $\nabla_{ϑ} H (ϑ; θ) \to \frac{1}{2 π} (\frac{1}{ϑ^{*} - θ^{*}} - \frac{θ}{(R^{2} - ϑ^{*} θ)} - \frac{ϑ}{R^{2}}) \cdot$ $\appendix \setcounter{section}{2} \begin{equation} \nabla_\vt H(\vc\vt;\vc\theta) \to \frac{1}{2\pi}\rund{\frac{1}{\vt^*-\theta^*}-\frac{\theta} {(R^2-\vt^*\theta)}-\frac{\vt}{R^2}}\cdot \end{equation}$ (B.1)Therefore, we obtain for the fields A and B_in the complex expressions $\begin{matrix} A (ϑ) & = & - \frac{1}{π} \int_{𝒰} d^{2} θκ (θ) (\frac{θ}{(R^{2} - ϑ^{*} θ)} + \frac{ϑ}{R^{2}}), \\ B_{in} (ϑ) & = \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} A(\vt)&=&-\frac{1}{\pi}\int_{\cal U} \!\! \d^2\theta\;\kappa(\vc\theta)\, \rund{\frac{\theta}{(R^2-\vt^*\theta)}+\frac{\vt}{R^2}}\;, \nonumber \\ B_{\rm in}(\vt)&=&\frac{1}{2\pi}\int_{\partial \cal U}\!\!\!\d s\;\rund{\frac{1}{\vt^*-\theta^*}-\frac{\theta} {(R^2-\vt^*\theta)}-\frac{\vt}{R^2}}\, a_{\rm in}, \label{eq:Ap2} \end{eqnarray}$ (B.2)where a_in = α_in·n, defined on the boundary of $\hbox{$\cal U$}$ . We first calculate this product, noting that for a boundary point R e^iϕ of the circle, n = e^iϕ. Specializing the complex expression for α_in as given in Eq. (23) to a point on the boundary, ϑ = Rn, we obtain $\begin{matrix} a_{in} (Rn) & = & α_{in} \cdot n = \frac{α_{in} / n + α_{in}^{*} n}{2} \\ = \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} a_{\rm in}(R n)&=&\vc\alpha_{\rm in}\cdot n =\frac{\alpha_{\rm in}/n+\alpha_{\rm in}^* n}{2}\nonumber\\ &=&\frac{1}{2\pi}\int_{\cal U} \!\! \d^2\theta\;\kappa(\vc\theta) \rund{\frac{1}{R-n \theta^*}+\frac{n}{R n-\theta}}\cdot \label{eq:Ap3} \end{eqnarray}$ (B.3)We now insert this expression into Eq. (B.2). The integral over the boundary is written as ds = R dϕ = −i R dn/n, and the integral over n extends over the unit circle. With θ = Rn, we then find $\begin{matrix} B_{in} (ϑ) & = & \frac{R}{2 π} \int_{𝒰} d^{2} θκ (θ) \frac{- i}{2 π} 􏽉 \frac{d n}{n} (\frac{1}{R - n θ^{*}} + \frac{n}{Rn - θ}) \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} B_{\rm in}(\vt)&=&\frac{R}{2\pi}\int_{\cal U} \!\! \d^2\theta\;\kappa(\vc\theta) \,\frac{-{\rm i}}{2\pi} \oint\frac{\d n}{n}\rund{\frac{1}{R-n\theta^*}+\frac{n}{R n-\theta}}\nonumber \\ &&\times \rund{\frac{n}{n\vt^*-R}-\frac{n}{R-n\vt^*}-\frac{\vt}{R^2}}\cdot \label{eq:A4.2} \end{eqnarray}$ (B.4)The inner integrand is an analytic function of n inside the unit circle, except at the poles at n = 0 and at n = θ/R (ϑ and θ are both inside the circle). Applying the theorem of residue, the integral can thus be evaluated. The first pole yields a contribution −ϑ/R³, whereas the second pole results in the expression $\begin{matrix} \frac{θ}{(θ ϑ^{*} - R^{2}) R} - \frac{θ}{(R^{2} - θ ϑ^{*}) R} - \frac{ϑ}{R^{3}} \cdot \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray*} \frac{\theta}{(\theta\vt^*-R^2)R} - \frac{\theta}{(R^2-\theta\vt^*)R}-\frac{\vt}{R^3}\cdot \end{eqnarray*}$ Adding up these two contributions then yields $B_{in} (ϑ) = \frac{1}{π} \int_{𝒰} d^{2} θκ (θ) (- \frac{θ}{(R^{2} - ϑ^{*} θ)} - \frac{ϑ}{R^{2}}),$ $\appendix \setcounter{section}{2} \begin{equation} B_{\rm in}(\vt)=\frac{1}{\pi}\int_{\cal U} \!\! \d^2\theta\;\kappa(\vc\theta)\rund{-\frac{\theta} {(R^2-\vt^*\theta)}-\frac{\vt}{R^2}}, \end{equation}$ (B.5)which we see agrees with the expression for A in Eq. (B.2). Thus we have shown explicitly that for the kernel function (20), the relation (26) holds.

Appendix C: Conservation of mass inside the Einstein radius under SPT

Starting from (32) we can infer the mass inside the Einstein radius by performing an integration up to θ_E. First, we consider the case of an axisymmetric lens model, i.e., γ_p = 0. Thus, the integral we have to solve is $\begin{matrix} M̂ γ_{p} = 0 (\leq θ_{E}) & = & 2 \int_{0}^{θ_{E}} d θθ κ̂ (θ) \\ = M (\leq θ_{E}) \end{matrix}$ $\appendix \setcounter{section}{3} \begin{eqnarray} \hat{M}_{\gp=0}(\le\tE) &=& 2 \int_0^{\tE} \d \theta \; \theta \, \khat(\theta) \nonumber\\ = M(\le\tE) &&+ f_2 \int_0^{\tE} \d \theta \; \theta^3 ( 1 - \kbar )^2 [3 (\kappa - \kbar) - 2 (1 - \kbar)]. \label{eq:MintE} \end{eqnarray}$ (C.1)To show that mass is conserved in case of an SPT, the integral in Eq. (C.1) has to vanish for arbitray mass profiles κ(θ). We note that $\hbox{$\kbar$}$ is given by $\begin{matrix} κ̅ (θ) = \frac{2}{θ^{2}} \int_{0}^{θ} d θ^{'} θ^{'} κ (θ^{'}), \end{matrix}$ $\appendix \setcounter{section}{3} \begin{eqnarray} \kbar (\theta) = \frac{2}{\theta^2} \int_0^\theta \d \theta' \; \theta' \, \kappa(\theta'), \end{eqnarray}$ (C.2)from which follows $\begin{matrix} {κ̅}^{'} & = \end{matrix}$ $\appendix \setcounter{section}{3} \begin{eqnarray} \kbar' &= & \frac{2}{\theta} (\kappa - \kbar), \label{eq:kbarprime} \end{eqnarray}$ (C.3)

of which we will make use in the next step. We perform an integration by parts for the last term in Eq. (C.1), $\begin{matrix} 2 \int_{0}^{θ_{E}} d θ θ^{3} (1 - κ̅)^{3} & = & [2 \frac{θ^{4}}{4} (1 - κ̅)^{3}] \begin{matrix} θ_{E} \\ 0 \end{matrix} + 2 \int_{0}^{θ_{E}} d θ \frac{θ^{4}}{4} 3 (1 - κ̅)^{2} {κ̅}^{'} \\ = & 3 \int_{0}^{θ_{E}} d θ θ^{3} (1 - κ̅)^{2} (κ - κ̅), \end{matrix}$ $\appendix \setcounter{section}{3} \begin{eqnarray} 2 \int_0^{\tE} \d \theta \; \theta^3 (1 - \kbar)^3 &=& \left[ 2 \frac{\theta^4}{4} (1-\kbar)^3 \right]^{\tE}_0 + 2 \int_0^{\tE} \d \theta \; \frac{\theta^4}{4} \, 3 (1-\kbar)^2 \kbar' \nonumber\\ &= &3 \int_0^{\tE} \d \theta \; \theta^3 \, (1-\kbar)^2 (\kappa - \kbar), \end{eqnarray}$ (C.4)where we used $\hbox{$ \kbar (\tE) = 1 $}$ . This result matches exactly the first term in the integral of Eq. (C.1). Hence, in the case γ_p = 0 the mass inside the Einstein ring is conserved under an SPT.

Next, we consider the case for γ_p ≠ 0. To integrate $\hbox{$\khat$}$ over the Einstein radius we make use of the result above, i.e., $\hbox{$\hat{M}_{\gp=0}(\le\tE) = M ( \le \tE )$}$ . Since the two latter parts of Eq. (32) are proportional to cos(2φ) or cos(4φ), respectively, they do not contribute to an integral over a circular area. Hence, we calculate $\begin{matrix} M̂ (\leq θ_{E}) & = & 2 \int_{0}^{θ_{E}} d θθ κ̂ (θ) \\ = \end{matrix}$ $\appendix \setcounter{section}{3} \begin{eqnarray} \hat{M}(\le\tE) &=& 2 \int_0^{\tE} \d \theta \; \theta \khat(\theta) \nonumber\\ &= &M(\le\tE) + f_2 \gp^2 \int_0^{\tE} \d \theta \; \theta^3 [2 (\kappa - \kbar) - 4 (1 - \kbar)]. \label{eq:MintEfull} \end{eqnarray}$ (C.5)Again, we consider first the last term in the integral of Eq. (C.5), and we make use of Eq. (C.3) and $\hbox{$ \kbar (\tE) = 1 $}$ . Then, we obtain $\begin{matrix} 4 \int_{0}^{θ_{E}} d θ θ^{3} (1 - κ̅) & = & [4 \frac{θ^{4}}{4} (1 - κ̅)] \begin{matrix} θ_{E} \\ 0 \end{matrix} + 4 \int_{0}^{θ_{E}} d θ \frac{θ^{4}}{4} {κ̅}^{'} \\ = & 2 \int_{0}^{θ_{E}} d θ θ^{3} (κ - κ̅) . \end{matrix}$ $\appendix \setcounter{section}{3} \begin{eqnarray} 4 \int_0^{\tE} \d \theta \; \theta^3 (1 - \kbar) &=& \left[ 4 \frac{\theta^4}{4} (1-\kbar) \right]^{\tE}_0 + 4 \int_0^{\tE} \d \theta \; \frac{\theta^4}{4} \, \kbar' \nonumber\\ &=& 2 \int_0^{\tE} \d \theta \; \theta^3 \, (\kappa - \kbar). \end{eqnarray}$ (C.6)This matches the first term in the integral exactly and therefore, the integral vanishes. Thus, also in the case of an external shear γ_p the mass enclosed in the Einstein ring is conserved, i.e., $\hbox{$ \hat{M}(\le\tE) = M(\le\tE) $}$ , independent of the choice of mass model κ.

All Figures

	Fig. 1 Illustration of the source position transformation. A source at β causes multiple images θ in the lens plane under the deflection law α. The same multiple images are obtained from a source at $\hbox{$\bhat(\bvec)$}$ , provided the deflection law is changed to $\hbox{$\ahat$}$ , according to Eq. (4).
In the text

Fig. 2

Illustration of the extrapolation method used in the SOR method (Sect. 3.2) to calculate $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ is shown: Based on the calculation of $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ on two grids with indices (i,k) and (I,K), we can retrieve ${\begin{matrix} ˜ \\ α \end{matrix}}^{true}$ $\hbox{$\atilde^\mathrm{true}$}$ with a minimum accuracy Δα using the scheme described in the figure.

In the text

Fig. 3

Maximum difference of $| α̂ - \begin{matrix} _{˜} \\ α \end{matrix} | = | Δ α |$ $\hbox{$ | \ahat - \atilde | = |\ealpha | $}$ for a non-singular isothermal sphere with core radius θ_c = 0.1 θ_E as a function of the number of grid points number $\hbox{$\N$}$ used in the numerical solution. Blue dots are the numerical results, whereas the curves present power-law fits to these points with h being the spacing of grid points. In the top panel, the results are shown for the “standard” method, where the numerical error scales as $\hbox{${\cal N}^{-2}$}$ . Incorporating the extrapolation scheme, the numerical error decreases much faster with the number of grid point, as can be seen in the lower panel (note the different scale for the y-axis in the upper and lower panel). For the typical values used in the papers ( $\hbox{${\cal N}\sim 400$}$ ), a gain in accuracy by three orders of magnitude is obtained with extrapolation with only a modest increase of computational cost (~ 25%). Since extrapolation includes calculating $\begin{matrix} ˜ \\ α \end{matrix}$ $\hbox{$\atilde$}$ twice with grid points 2N and N, values are only shown up to $\hbox{$\N_2 = 500$}$ which correspond to $\hbox{$ \N_1 =1000 $}$ in the graph above.

In the text

	Fig. 4 Values of \|Δα\| _max are plotted against the parameters f₂ from (28) and external shear strength γ_p. The colored region indicates allowed pairs of parameters that fulfill the \|Δα\| < 5 × 10^-3θ_E-criterion. For obtaining this figure, we used the SOR method.
In the text

	Fig. 5 Values of $\hbox{$\|\ki\|_\mathrm{max}$}$ are plotted against the parameters f₂ from (28) and external shear strength γ_p. The colored region indicates allowed pairs of parameter that were chosen such that they roughly correspond to \| Δα \| < 5 × 10^-3θ_E.
In the text

	Fig. 6 For every allowed combination f₂ and γ_p the values of \| Δα \| _max (Fig. 4) are plotted against $\hbox{$\|\ki\|_\mathrm{max}$}$ (Fig. 5). A clear correlation between these two quantities can be seen.
In the text

	Fig. 7 Map of \| Δα(θ) \| is shown for f₂ = 0.55 and γ_p = 0.1. The strong changes in the corners, i.e. θ > 2 θ_E, are biased by large numerical uncertainty and should be neglected.
In the text

Fig. 8

Upper panel shows the mass profile of the original NIS lens (solid curve), and that of three SPT-transformed lenses, with parameters f₂ and γ_p indicated by the labels. For all of these three models, Δα_max ≈ ε_acc = 5 × 10^-3θ_E. Since the transformed mass distributions have a finite ellipticity, the density is plotted as a function of the geometric mean of the major and minor semi-axis of the best-fitting ellipse to an isodensity contour, except for the case with negative f₂, for which the outer isodensity contours are not closing around the lens center; in this special case, the x-axis corresponds to the θ₁-axis. The convergence changes up to 28% for radii smaller than 1 θ_E, radii larger than that show a significantly smaller convergence for a positive f₂. Negative f₂ show an essentially mirrored behavior compared to positive f₂. This leads to convergence $\hbox{$\khat$}$ that may not decrease monotonically. The lower panel shows the ratio between transformed and original mass profile.

In the text

Fig. 9

Radial dependence of the axis ratio ϵ. In the unperturbed case the isodensity contours are circular, i.e., $ϵ (\sqrt{θ_{1} θ_{2}}) = 1$ $\hbox{$\epsilon (\!\!\sqrt{\theta_1\theta_2}) = 1$}$ . The SPTed mass distribution shows for radii $\sqrt{θ_{1} θ_{2}} < 1 θ_{E}$ $\hbox{$\sqrt{\theta_1\theta_2} < 1 \, \tE$}$ deviations of up to 5% from circularity, whereas for larger radii the deviations can be up to 20%. The convergence map for f₂ = −0.55 and γ_p = 0.1 does not show concentric isodensity contours for radii larger than 1.3 θ_E and therefore no ellipticity as a function radius can be determined.

In the text

Fig. 10

Top: quantity R_κ(θ) (Eq. (33)) calculated for an NIS with external shear γ_p (cf. Sect. 4; solid black) and for various SPT-transformed models with SPT of the form 1 + f₂/ 2 (β/θ_E)² (Eq. (28)). The range of positive values of f₂ allowed by | Δ α_max | < 5 × 10^-3θ_E (Fig. 4) is explored for two different choices of the shear: γ_p = 0.05 (blue) and γ_p = 0.1 (red). While R_κ is conserved under an MST, it is not under an SPT, with deviation that can reach tens of percents. Bottom: for each curve of the top panel, we show the difference between R_κ of the original NIS model and of the SPT transformed model.

In the text

Fig. 11

Top: M_ap(θ₀) (Eq. (34)) as a function of the “aperture” θ₀. The filter function u(x) defined by Eq. (38), using x₀ = 0.5, has been used such that for M_ap(θ₀), the annulus [θ₀/ (2θ_E),θ₀/θ_E] is probed. The black curve shows M_ap for the NIS profile and the blue curves for the SPT-transformed profiles with γ_p = 0.05 and various values of f₂. The green curve shows M_ap(θ₀) for an MST transformed version of the NIS profile. Bottom: ratio between M_ap derived for the various transformed profiles and for the original NIS profile.

In the text

	Fig. 12 Ratios of aperture mass between θ₁ = 2 θ_E and θ₂ = θ_E for SPT-transformed profiles with various values of f₂. The ratios of aperture masses are normalized by the corresponding aperture ratios estimated for the original NIS profile (horizontal bar). The blue diamonds are for a shear γ_p = 0.05, and the red squares when γ_p = 0.1.
In the text

Fig. 13

Difference between R_κ calculated for three different pairs of profiles: In blue, an NIS and an SPT-transformed model with f₂ = 0.11 and γ_p = 0.1; in magenta, a composite Hernquist+gNFW model and a power-law model (M1); in red, Hernquist+gNFW model and a cored power-law model (M2). The shape of ΔR_κ for the models presented in SS13 and SS14 are qualitatively similar to that observed for the fiducial SPT model presented in Sect. 4.

In the text

Fig. 14

Top: M_ap(θ₀) (Eq. (34)) as a function of θ₀, for the composite Hernquist+gNFW model (black), for the power law model M1 (blue), and the cored power-law M2 (red). Dashed red (blue) profile shows M_ap(θ₀) for an MST transformed version of the composite model with λ = 0.93 (resp. 0.84). Bottom: ratio of M_ap(θ₀) between the “transformed” models and the composite. The dashed curves correspond to MST-transformed versions of the composite model, and represent the contribution of the MST to M1 and M2. The solid red and blue curves suggest that the remaining of the degeneracy can be associated with an SPT.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Bartelmann, M. 2010, Class. Quantum Gray., 27, 233001 [NASA ADS] [CrossRef] [Google Scholar]

[2] Bradač, M., Schneider, P., Lombardi, M., et al. 2004, A&A, 423, 797 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[3] Coe, D., Fuselier, E., Benítez, N., et al. 2008, ApJ, 681, 814 [NASA ADS] [CrossRef] [Google Scholar]

[4] Diego, J. M., Sandvik, H. B., Protopapas, P., et al. 2005, MNRAS, 362, 1247 [NASA ADS] [CrossRef] [Google Scholar]

[5] Falco, E. E., Gorenstein, M. V., & Shapiro, I. I. 1985, ApJ, 289, L1 [Google Scholar]

[6] Hezaveh, Y. D., Marshall, P. J., & Blandford, R. D. 2015, ApJ, 799, L22 [NASA ADS] [CrossRef] [Google Scholar]

[7] Kochanek, C. S. 2006, in Saas-Fee Advanced Course 33: Gravitational Lensing: Strong, Weak and Micro, eds. G. Meylan, P. Jetzer, P. North, et al., 91 [Google Scholar]

[8] Kochanek, C. S., & Dalal, N. 2004, ApJ, 610, 69 [NASA ADS] [CrossRef] [Google Scholar]

[9] Liesenborgs, J., & De Rijcke, S. 2012, MNRAS, 425, 1772 [NASA ADS] [CrossRef] [Google Scholar]

[10] Mao, S., & Schneider, P. 1998, MNRAS, 295, 587 [NASA ADS] [CrossRef] [Google Scholar]

[11] Metcalf, R. B. 2005, ApJ, 629, 673 [NASA ADS] [CrossRef] [Google Scholar]

[12] Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. 1996, Numerical recipes in C (New York: Cambridge University Press) [Google Scholar]

[13] Saha, P., & Williams, L. L. R. 1997, MNRAS, 292, 148 [NASA ADS] [CrossRef] [Google Scholar]

[14] Schneider, P. 2006, in Saas-Fee Advanced Course 33: Gravitational Lensing: Strong, Weak and Micro, eds. G. Meylan, P. Jetzer, P. North, et al., 1 [Google Scholar]

[15] Schneider, P., & Sluse, D. 2013, A&A, 559, A37 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[16] Schneider, P., & Sluse, D. 2014, A&A, 564, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[17] Seitz, S., & Schneider, P. 2001, A&A, 374, 740 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[18] Winn, J. N., Rusin, D., & Kochanek, C. S. 2004, Nature, 427, 613 [NASA ADS] [CrossRef] [Google Scholar]

[19] Xu, D. D., Mao, S., Cooper, A. P., et al. 2010, MNRAS, 408, 1721 [NASA ADS] [CrossRef] [Google Scholar]

[20] Xu, D. D., Mao, S., Cooper, A. P., et al. 2012, MNRAS, 421, 2553 [NASA ADS] [CrossRef] [Google Scholar]

[21] Xu, D., Sluse, D., Schneider, P., et al. 2016, MNRAS, 456, 739 [NASA ADS] [CrossRef] [Google Scholar]