TDCOSMO

Matthew R. Gomer; Dominique Sluse; Lyne Van de Vyvere; Simon Birrer; Frederic Courbin

doi:10.1051/0004-6361/202244324

Home

All issues

Volume 667 (November 2022)

A&A, 667 (2022) A86

Full HTML

Open Access

Issue		A&A Volume 667, November 2022


Article Number		A86
Number of page(s)		17
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/202244324
Published online		09 November 2022

A&A 667, A86 (2022)

VIII. A key test of systematics in the hierarchical method of time-delay cosmography

Matthew R. Gomer¹, Dominique Sluse¹, Lyne Van de Vyvere¹, Simon Birrer²^,3^,4 and Frederic Courbin⁵

¹ STAR Institute, Quartier Agora, Allée du Six Août, 19c, 4000 Liège, Belgium
e-mail: mgomer@uliege.be
² Kavli Institute for Particle Astrophysics and Cosmology and Department of Physics, Stanford University, Stanford, CA 94305, USA
³ SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
⁴ Department of Physics and Astronomy, Stony Brook University, Stony Brook, NY 11794, USA
⁵ Institute of Physics, Laboratory of Astrophysics, Ecole Polytechnique Fédérale de Lausanne (EPFL), Observatoire de Sauverny, 1290 Versoix, Switzerland

Received: 22 June 2022
Accepted: 1 September 2022

Abstract

The largest source of systematic errors in the time-delay cosmography method likely arises from the lens model mass distribution, where an inaccurate choice of model could in principle bias the value of H₀. A Bayesian hierarchical framework has been proposed which combines lens systems with kinematic data, constraining the mass profile shape at a population level. The framework has been previously validated using a small sample of lensing galaxies drawn from hydro-simulations. The goal of this work is to expand the validation to a more general set of lenses consistent with observed systems, as well as confirm the capacity of the method to combine two lens populations: one which has time delay information and one which lacks time delays and has systematically different image radii. For this purpose, we generated samples of analytic lens mass distributions made of baryons+dark matter and fit the subsequent mock images with standard power-law models. Corresponding kinematics data were also emulated. The hierarchical framework applied to an ensemble of time-delay lenses allowed us to correct the H₀ bias associated with model choice to find H₀ within 1.5σ of the fiducial value. We then combined this set with a sample of corresponding lens systems which have no time delays and have a source at lower z, resulting in a systematically smaller image radius relative to their effective radius. The hierarchical framework has successfully accounted for this effect, recovering a value of H₀ which is both more precise (σ ∼ 2%) and more accurate (0.7% median offset) than the time-delay set alone. This result confirms that non-time-delay lenses can nonetheless contribute valuable constraining power to the determination of H₀ via their kinematic constraints, assuming they come from the same global population as the time-delay set.

Key words: gravitational lensing: strong / cosmological parameters / methods: numerical / galaxies: elliptical and lenticular, cD

© M. R. Gomer et al. 2022

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe-to-Open model. Subscribe to A&A to support open access publication.

1. Introduction

A precise determination of the Hubble constant H₀ is a widely sought after goal, now with multiple mature measurement methods. Some methods measure the early universe and recover H₀ (e.g., Aiola et al. 2020; Planck Collaboration VI 2020; Abbott et al. 2018), while some use standard candles to measure H₀ locally (e.g., Riess et al. 2022; Freedman et al. 2020). There is a discrepancy between these measurements which may require the addition of new physics into the standard cosmological model (e.g., Poulin et al. 2019), as extensive considerations of possible unaccounted-for systematic effects in one or more methods of H₀ determination have not provided an explanation yet (e.g., Freedman 2021; Huang et al. 2020; Kenworthy et al. 2019; Addison et al. 2018). Additional independent measurement methods may help diagnose the tension, and should be explored before making claims of new physics. Many such novel methods have been developed, such as using water megamasers (Pesce et al. 2020) or gravitational sirens (Abbott et al. 2017) to directly measure distance. One such distance-ladder-independent method is the gravitational lensing time-delay method.

The gravitational lensing method measures distances through the use of time delays between different images of the same variable source. The combination of the observed time delays with the lens mass model gives a time-delay distance:

$\begin{matrix} D_{Δ t} = (1 + z_{d}) \frac{D_{d} D_{s}}{D_{ds}} \propto H_{0}^{- 1}, \end{matrix}$ $\begin{aligned} D_{\Delta t} = (1+z_{\rm d}) \frac{D_{\rm d} D_{\rm s}}{D_{\rm ds}}\propto H_0^{-1}, \end{aligned}$ (1)

where z_d is the redshift of the lens (deflector), and D_d, D_s, and D_ds are the angular diameter distances from the observer to the lens, observer to the source, and deflector to the source, respectively. Distance (i.e., H₀) measurements can be combined between numerous systems to increase the statistical precision. The method requires high-quality imaging data, long-term time-delay monitoring, and precise spectroscopy for kinematic measurements. Large collaborations leading the efforts with this method include COSMOGRAIL (Courbin et al. 2018; Bonvin et al. 2019; Millon et al. 2020a), SHARP (Spingola et al. 2018; Chen et al. 2019), STRIDES (Treu et al. 2018; Anguita et al. 2018; Shajib et al. 2019, 2020), and H0LiCOW (Suyu et al. 2017; Wong et al. 2020; Rusu et al. 2020), which together form the TDCOSMO umbrella collaboration (Millon et al. 2020b).

The primary challenges for the lensing method are degeneracies, whereby the same lensing observables (e.g., image positions and relative fluxes) are recovered from a set of mass distributions. Lensing information alone has no ability to constrain which distribution within the set is the truth. Of particular interest is the mass sheet degeneracy (MSD, Falco et al. 1985), a specific case of the more general source position transformation (SPT, Schneider & Sluse 2013, 2014; Unruh et al. 2017; Wertz et al. 2018). The MSD is extensively studied and well-documented in the literature (e.g., Saha & Williams 2006; Xu et al. 2016; Birrer et al. 2016; Sonnenfeld 2018; Birrer et al. 2020; Kochanek 2020, 2021; Gomer & Williams 2020; Blum et al. 2020; Yıldırım et al. 2021; Birrer 2021). The MSD is described through the mass sheet transformation (MST), in which a mass distribution κ(θ) can be scaled by a factor of λ with the addition of a corresponding constant, analogous with the addition of a uniform sheet of mass, into any one realization from a set of transformed distributions:

$\begin{matrix} κ_{λ} (θ) = λ κ (θ) + (1 - λ), \end{matrix}$ $\begin{aligned} \kappa _{\lambda }(\theta ) = \lambda \kappa (\theta ) + (1-\lambda ), \end{aligned}$ (2)

where we have used the λ subscript to indicate the quantity has been transformed by an MST. The physical nature of a mass sheet is often conceptually separated into two different phenomena: an “external” mass sheet coming from additional mass along the line of sight, and an “internal” mass sheet which encodes the possibility of an alternative lens profile shape. This work concerns the internal mass sheet. Together with a corresponding transformation of the unobserved source position β_λ = λβ, the MST leaves image positions and relative fluxes unaffected, while leading to a multiplicative constant on the time-delay distance measurement D_Δtλ = λ⁻¹D_Δt. Since the recovered value of H₀ is inversely proportional to the time-delay distance, a determination of H₀ from a mass-sheet-transformed model results in a biased value of H_0λ = λH₀. As such, the capacity of lens modeling to recover an accurate measure of H₀ is inherently a problem of constraining λ.

Since lensing information alone is unable to constrain λ, one must turn to external information. The impact of the MST can be mitigated via stellar velocity dispersion measurements, which provide an independent measurement of mass (Treu & Koopmans 2002a,b; Koopmans et al. 2003; Barnabe & Koopmans 2007). This addition requires that a given mass distribution reproduces both the lensing data and kinematic data, which places constraints on the profile shape, and therefore the range of allowable λ.

To further constrain λ, it is common to incorporate existing knowledge about galaxy structure through the implementation of a parametric profile. Early-type galaxy mass profiles are known to be well-approximated as a power law with respect to radius with approximately isothermal slope, a phenomenon sometimes referred to as the bulge-halo conspiracy (Gavazzi et al. 2007; Suyu et al. 2009; Koopmans et al. 2009; Humphrey & Buote 2010; Auger et al. 2010; Barnabè et al. 2011; Dutton & Treu 2014). By specifically using a power law to fit a lens, a modeler fixes λ by restricting the fit to the κ_λ which matches the power-law profile form over the relevant radii. As such, the choice of a specific family of model, power-law or otherwise, restricts the range of possible MSTs which fit the data. If a modeler uses a power law to fit a lens which in reality differs from a power law, the MST which causes the profile to best match a power law is implicitly applied, which can result in a biased value of H₀. One approach to account for this effect is to use multiple different families of models for the same lens, which is why H0LiCOW uses both a power-law and a two-component composite model to fit time-delay lenses for cosmography (Suyu et al. 2014; Rusu et al. 2020). TDCOSMO’s exploration of systematics found agreement in H₀ when using a power law or composite model in the fit, providing supporting evidence that this MST effect is not severe in real lenses (Millon et al. 2020b). When using a power-law model for 6 lenses supplemented with aperture kinematic measurements, H0LiCOW found $H_{0} = 73 . 3_{- 1.8}^{+ 1.7}$ $H_0 = 73.3_{-1.8}^{+1.7}$ km s⁻¹ Mpc⁻¹ (Wong et al. 2020).

Birrer et al. (2020, hereafter TDC4) introduced a technique to supplement the parametric method by explicitly folding the effect of the MSD into a hierarchical analysis of the lens systems. This method is able to combine sets of lens modeling results together, provided that the sets of lens galaxies come from the same overall galaxy population. The method applies an MST to the lens modeling results on a population level, serving to capture systemic departure from the initial parametric model. To account for the difference of lensing impact parameter arising from having different lens and source redshifts, the amplitude of the MST is allowed to vary as a function of the ratio between the effective radius and Einstein radius. By directly parameterizing and including this effect, this approach allows the velocity dispersion data of the collective set to constrain the space of possible MSTs. Specifically, systems from the Sloan Lens ACS Survey (SLACS, Bolton et al. 2006, 2008; Shajib et al. 2021), which have lens models and velocity dispersions, but lack time delays, are able to be included in the analysis to place constraints on the population-scale MST, given the hypothesis holds that this population comes from the same global population as that of the time-delay lenses. We detail this method in the following section.

The hierarchical framework has been validated for a population of time-delay lenses by TDC4 based on the simulated lenses from Rung 3 of the time delay lens modeling challenge (TDLMC; Ding et al. 2018, 2021a). The value of H₀ was inferred by modeling hydrodynamically simulated lensed systems where the lensing galaxy was drawn from a set of 16 massive elliptical galaxies from Illustris-TNG and from zoom-in simulations. The resulting value of H₀ ( $68 . 4_{- 3.7}^{+ 3.4}$ $68.4_{-3.7}^{+3.4}$ km s⁻¹ Mpc⁻¹ when using a uniform prior for anisotropy radius) was consistent with the fiducial input value (65.413 km s⁻¹ Mpc⁻¹) with most of the uncertainty coming from limited constraints on the anisotropy of the velocity dispersion. However, the TDLMC Rung 3 lenses are known to have numerical effects in the central region, for which the authors caution against their strict interpretation as realistic lenses (Ding et al. 2021a). We also note that the TDLMC comparison does not test the inclusion of a corresponding non-time-delay lens population. With this in mind, the efficacy of the hierarchical method for real systems is supported, but not confirmed. A test of the method using analytically created mock systems which do not have such artificial cores would supplement this result, which is one goal of this work.

Combining the seven TDCOSMO systems, TDC4 recovered a value of $H_{0} = 74 . 5_{- 6.1}^{+ 5.6}$ $H_0 = 74.5_{-6.1}^{+5.6}$ km s⁻¹ Mpc⁻¹, which includes the MSD in the error budget. This is consistent with the Wong et al. (2020) value, but with widened uncertainty. Seeking better constraints, 33 non-time-delay lenses from SLACS (Shajib et al. 2021) were implemented into the hierarchical framework, under the assumption that the SLACS and TDCOSMO lenses are part of the same population. Interestingly, the resulting value of H₀ moves downward to $67 . 4_{- 3.2}^{+ 4.1}$ $67.4_{-3.2}^{+4.1}$ km s⁻¹ Mpc⁻¹. While these two values are statistically consistent, the hierarchical method nevertheless relies on different assumptions than other methods and so it must be further tested, motivating the experiment in this paper.

The main concern is that if there were some systematic difference between the SLACS and TDCOSMO lens populations, the primary assumption of the hierarchical framework would be flawed, and the result would be unreliable. Unfortunately, we do not have access to the true mass distributions of these galaxies, and as such we cannot confirm or reject the possibility that both samples are drawn from the same population. We can, however, perform targeted tests of systematics; a natural place to begin comes from the fact that the SLACS and TDCOSMO lenses have different redshifts, and therefore probe different radii relative to the effective radius. Ergo, the second goal of this paper is to test the capacity of the hierarchical framework to correctly capture and account for this change in local lensing effect which stems from a difference in redshift. This goal can be addressed by by creating mock lenses consistent with the SLACS and TDCOSMO lens populations and combining them together hierarchically.

In summary, in this work we further test the hierarchical framework by modeling a large sample of mock lensed systems meant to represent those of the TDCOSMO galaxies, constructed as analytical two-component light+dark matter mass distributions. Specifically, we test the capacity of the framework in two respects. Firstly, we wish to confirm that the framework still performs well with intrinsic mass distributions which are less cored in their center than galaxies from hydro-simulations. Secondly, we wish to confirm that the inclusion of the comparison sample of non-time-delay lens systems is correctly accounted for without introducing a bias arising from the change in redshift (i.e., lensing impact parameter). To address this second respect, we construct our lens population such that it is consistent with the TDCOSMO population at one source redshift, and consistent with the SLACS population when the source redshift is changed.

We first describe the hierarchical framework and experiment setup in more detail in Sect. 2, then outline our mock creation in Sect. 3, present our results in Sect. 4 and discuss these results in Sect. 5.

2. Hierarchical framework and experiment setup

The hierarchical framework is detailed in TDC4, but we summarize the main components here. The key idea of the hierarchical framework is that information on the amplitude of the MST can be retrieved by considering a population of lens systems with kinematic data. While single-aperture kinematics can place limited constraints on λ for individual systems, the framework is able to leverage the additional statistical constraining power of a population of systems.

To gain a physical intuition of the method, consider that when an individual lens is fit as a power law (PL), the actual constraint is that the lens profile is a PL+MST with an unknown λ. However, these galaxies as a population are thought to follow approximately the same global mass profile (with a characteristic radius that scales with the luminous effective radius, θ_eff¹), meaning the λ necessary to match a given lens to a PL is purely a function of which part of the profile the Einstein radius (θ_Ein) lands upon. Kinematics also probe some radial range of this global profile, depending on the size of the measurement aperture relative to θ_eff, but unlike lensing information, kinematics are sensitive to λ, as they measure the true mass within a spherical shell. As such, the combination of many systems together can be thought of as probing the global profile at a range of radii with both lensing and kinematic data, where the measurements from one system can help support the unconstrained parts of another system. This combined information allows one to infer the population-scale λ behavior associated with the mapping of the global profile into a PL.

One can show that the MST-invariant term extracted from a lensing measurement is:

$\begin{matrix} ξ = θ_{Ein} \frac{α_{E}^{″}}{1 - κ_{E}}, \end{matrix}$ $\begin{aligned} \xi =\theta _{\rm Ein}\frac{\alpha^{\prime \prime }_{\rm E}}{1-\kappa _{\rm E}}, \end{aligned}$ (3)

where $α_{E}^{″}$ $\alpha^{\prime\prime}_{\rm E}$ is the circularly averaged radial second derivative of the deflection evaluated at θ_Ein, and κ_E is the mean convergence of an annulus at θ_Ein (Sonnenfeld 2018; Kochanek 2020). From this point of view, the hierarchical inference takes a set of ξ constraints at different Einstein radii and uses the kinematic information to provide the conversion from the MST-agnostic description into the true mass distribution with an associated λ, which is allowed to be different for different Einstein radii.

The inference works by ascribing a distribution of MSTs to the galaxy population, where the λ_int distribution is described with Gaussian standard deviation σ(λ_int). The “int” denotation references the conceptual separation between an internal mass sheet transformation as described here from an external mass sheet, owing to convergence along the line of sight κ_ext. To account for differences in the radii at which the global mass profile is probed, a linear scaling with the effective radius relative to the Einstein radius is included as

$\begin{matrix} λ_{int} (θ_{eff} / θ_{Ein}) = λ_{int, 0} + α_{λ} (\frac{θ_{eff}}{θ_{Ein}} - 1) \end{matrix}$ $\begin{aligned} \lambda _{\rm int}(\theta _{\rm eff}/\theta _{\rm Ein}) = \lambda _{\rm int,0}+\alpha _{\lambda } \left(\frac{\theta _{\rm eff}}{\theta _{\rm Ein}}-1\right) \end{aligned}$ (4)

with slope α_λ and intercept λ_int, 0. We note the distinction between the set of λ_int values each lens within a population individually undergoes and the intercept of our linear parameterization of the population, λ_int, 0, a descriptive parameter we recover. This scaling allows for lenses which are probed at different radii to have different MST amplitudes to match the power-law shape. The constraint on λ_int comes from kinematic information, for which even lenses without time delays can provide meaningful constraints. Lens population kinematics are included in a spherical Jeans framework parameterizing the distribution of anisotropy a_ani (further detailed in Sect. 3.4) as Gaussian with mean ⟨a_ani⟩ and standard deviation σ(a_ani). In total, the hierarchical framework recovers values for λ_int, 0, σ(λ_int), α_λ, ⟨a_ani⟩ and σ(a_ani) to describe the distribution of values within the population and place an informed constraint on H₀.

The hierarchical inference has been successfully tested on numerically simulated TDLMC profiles, which have artificial cores arising from numerical effects. However, it has not yet been confirmed to be effective for analytical profiles which do not have such cores. In addition to testing this, we sought to test the capacity of the method to incorporate an external non-time-delay lens population which is similar to the time-delay lens population, except with a different relative impact parameter, (i.e., different θ_eff/θ_Ein), as was done for real systems in TDC4.

To control for all other variables, we chose for the two lens populations to be identical, except for the source redshift, for which we selected one value for each population. Given the same lens surface mass density Σ(θ), a larger source redshift z_s decreases the critical lensing density Σ_crit and hence increase the convergence κ(θ):

$\begin{matrix} Σ_{crit} = \frac{c^{2}}{4 π G} \frac{D_{s}}{D_{d} D_{ds}}, κ (θ, z) = \frac{Σ (θ)}{Σ_{crit} (z_{d}, z_{s})} \cdot \end{matrix}$ $\begin{aligned} \Sigma _{\rm crit} = \frac{c^2}{4\pi G}\frac{D_{\rm s}}{D_{\rm d} D_{\rm ds}}, \qquad \kappa (\theta ,z) = \frac{\Sigma (\theta )}{\Sigma _{\rm crit}(z_{\rm d},z_{\rm s})}\cdot \end{aligned}$ (5)

This increased κ leads to a larger Einstein radius, changing which part of the profile is probed by lensing. If a profile has a changing shape with radius, probing different radii could change the resulting fit, resulting in a different MST and ultimately in a change in H₀. As such, the expected change in λ_int with respect to impact parameter encodes two effects: the change in redshift leading to a different probed radius of a given profile, and the variety of profile shapes arising from the set of two-component mass distributions at a given redshift. The hierarchical setup assumes that these effects collectively are well-described as linear with Gaussian scatter (Eq. (4)). If the combined effect is not well-described by this parameterization, there will be nonrandom departures from the linear description, resulting in an incorrect scatter estimation and a biased H₀. However if the linear parameterization is a good description of the combined effect, such that the variations between systems can be captured by the Gaussian scatter, the effect is accounted for and the correct H₀ will be recovered. As such, an accurate H₀ recovery in this experiment represents supporting evidence for the versatility of the hierarchical method. On the other hand, if the method were to fail in the case where the only difference between populations comes from source redshifts, the hierarchical framework in its current form would be unlikely to be reliable for the more complicated real population.

This experiment is not necessarily intended to perfectly emulate the data sets for either SLACS or TDCOSMO specifically. Instead, the intention is to perform a targeted test of the effect of probed radius within the hierarchical framework, using a set of lenses engineered to come from the same population.

3. Creating mocks

We constructed a set of two-component profiles consistent with the observed lens population. For each profile, we simulated two mock HST-quality images corresponding to the lens with two different source redshifts. Each lens image was modeled using a power-law profile for the lensing galaxy. Mock kinematic measurements were calculated and combined with the lensing posteriors in a hierarchical framework. This section details how these mass profiles were created (Sect. 3.1), how the population of these profiles compares to the observed lenses (Sect. 3.2), how mock images were created and fit (Sect. 3.3), and how kinematics were simulated (Sect. 3.4).

3.1. Mass distribution

We constructed lenses using a two-component composite mass model consisting of a Chameleon profile (Dutton et al. 2011; Suyu et al. 2014) representing the light distribution and an NFW (Navarro et al. 1996, 1997) component representing the dark matter. We assumed that the mass distribution of the baryon component corresponds to the light distribution of observed systems. Most observed light profiles of early-type galaxies are modeled as Sérsic profiles, for which the corresponding surface mass density is given as

$\begin{matrix} Σ (θ_{x}, θ_{y}) = Σ_{eff} \exp {- b_{n} [{(\frac{\sqrt{q θ_{x}^{2} + θ_{y}^{2} / q}}{θ_{eff}})}^{1 / n} - 1]}, \end{matrix}$ $\begin{aligned} \Sigma (\theta _x,\theta _{ y}) = \Sigma _{\rm eff} \,\mathrm{exp}\left\{ -b_n\left[\left(\frac{\sqrt{q \theta _x^2+\theta _{ y}^2/q}}{\theta _{\rm eff}}\right)^{1/n}-1\right]\right\} , \end{aligned}$ (6)

with axis ratio q where θ_eff is the effective radius enclosing half of the total light and n is the Sérsic index (Sérsic 1963; Ciotti & Bertin 1999). The constant b_n is set to ensure the half-light property of θ_eff given n. Unfortunately the elliptical Sérsic profile requires numerical integrals in the lens deflection calculation and as such is computationally expensive in this context. We therefore chose to represent the baryonic matter as a Chameleon profile (Dutton et al. 2011), which is a profile designed to closely match a Sérsic profile at lensing radii using a combination of two softened isothermal profiles (Kassiola & Kovner 1993),

$\begin{matrix} Σ (θ_{x}, θ_{y}) = & \frac{Σ_{0}}{(1 + q)} [\frac{1}{\sqrt{q θ_{x}^{2} + θ_{y}^{2} / q + 4 w_{c}^{2} / {(1 + q)}^{2}}} \\ - \frac{1}{\sqrt{q θ_{x}^{2} + θ_{y}^{2} / q + 4 w_{t}^{2} / {(1 + q)}^{2}}}] \end{matrix}$ $\begin{aligned} \Sigma (\theta _x,\theta _{ y})=&\frac{\Sigma _0}{(1+q)} \left[\frac{1}{\sqrt{q \theta _x^2+\theta _{ y}^2/q+4{ w}_c^2/(1+q)^2}}\right.\nonumber \\&\left. -\frac{1}{\sqrt{q \theta _x^2+\theta _{ y}^2/q+4{ w}_{\rm t}^2/(1+q)^2}}\right] \end{aligned}$ (7)

with parameters w_c and w_t with w_t > w_c. The Chameleon profile is often used for this purpose in the context of lensing (e.g., Suyu et al. 2014; Birrer et al. 2019; Rusu et al. 2020; Shajib et al. 2020; Van de Vyvere et al. 2022a).

Given a Sérsic index and θ_eff, one can use the prescription of Dutton et al. (2011) to construct an analogous Chameleon profile. We note that this prescription claims that the Chameleon profile matches the enclosed mass of the Sérsic profile to within a few percent within the [0.5θ_Ein, 3θ_Ein] range, but actually requires a modification to reach this reported precision, which we have implemented and discussed in Appendix A. With this, we were able to construct a Chameleon profile which closely matches a provided Sérsic.

The NFW profile is described in 3D mass density as

$\begin{matrix} ρ (r) = \frac{ρ_{0}}{(r / r_{s}) {(1 + r / r_{s})}^{2}} \end{matrix}$ $\begin{aligned} \rho (r) = \frac{\rho _0}{(r/r_{\rm s})(1+r/r_{\rm s})^2} \end{aligned}$ (8)

with scale radius r_s and scale density ρ₀. Deflection angles and lens potentials for an elliptical NFW in projection can be calculated. Following Golse & Kneib (2002), ellipticity is added in the lens potential. The parameters for these Chameleon+NFW profiles must be selected to result in a population which is consistent with the observed SLACS and TDCOSMO lensing galaxies (Dutton & Treu 2014; Shajib et al. 2021).

3.2. Creating a lens population

To build up a realistic population of lens systems, we examined the existing population of lens galaxies which have been modeled as a composite profile. We considered the subset of 14 SLACS lenses used by TDC4 which have been modeled by Shajib et al. (2021) and therefore have fit constraints on their slope, γ. These lenses also have stellar masses measured via stellar population analysis using a Salpeter IMF (Auger et al. 2009). Combining these measurements, we examined the distribution of several observed properties (i.e., axis ratio of the light q_L, θ_eff, θ_Ein) as well as some recovered properties: axis ratio of the mass (q_m), mass within θ_Ein (M_Ein), total mass in baryons (M_bar), baryon fraction within θ_Ein (f_bar), and the MST-invariant ξ = 2(γ − 2) for a power law. We chose to plot ξ as recovered from the lensing power-law fits rather than γ directly to better reflect that the fit slope is a property recovered from lensing and may not correspond to the true local slope at the Einstein radius (Kochanek 2020). Additionally, we compiled the same properties for the TDCOSMO set of 7 lenses, although we lack measurements for q, M_bar or f_bar, and plotted the two populations as histograms in Fig. 1.

Fig. 1.

Histograms depict the distribution of observed SLACS (purple) and TDCOSMO (orange) lens properties. Our simulated populations (shown as KDEs of corresponding colors) were created by constructing a large number of profiles and selecting a subset which simultaneously match the 5 target parameter distributions, indicated by the green borders. The simulated SLACS and TDCOSMO populations are the same profiles, only differing by a changed source redshift. For parameters where the two sets are identical, a black line is used instead of orange or purple.

We sought to create a synthetic population of two-component profiles which closely matches both observed populations. To create a two-component profile, 7 parameters (omitting centroid positions and position angles) had to be given values. For the baryon component we have Σ_eff, θ_eff, n, and q_L. For the dark matter halo component we have ρ₀, r_s and q_h. Furthermore, we also must select redshifts for the source of each lens.

The first simplifying assumption we made is that the two components have the same axis ratios (q = q_L = q_h). We also set the position angles and centroids to be aligned. In reality, these components need not be aligned (Gomer & Williams 2018; Shajib et al. 2019), and have been allowed to vary in lensing models (e.g., Rusu et al. 2020; Millon et al. 2020b). However, since the primary focus for this work is the effect of the change in Einstein radius regarding the radial profile, rather than the effects of the angular structure of the lens, we fixed the components to be aligned. This assumption is supported by the joint lensing+dynamics analysis of Shajib et al. (2021), who find that centroids of light and mass match within 218 pc (68% confidence) and position angles align within 10 deg for most SLACS galaxies, consistent with earlier results for quasar lenses (Keeton et al. 1998; Yoo et al. 2006; Sluse et al. 2012). Numerical results using the Illustris simulation also support this level of alignment (Xu et al. 2017). We therefore chose to adopt this simple angular structure for the input mocks, although the lens model we use to fit the systems does have additional azimuthal freedom through external shear.

The intention of our mock population is that it resemble the SLACS distributions for a particular source redshift, z_s, and the TDCOSMO distributions after a change to another z_s. In reality, both populations exist over a range of z_s and lens redshift z_d, shown in Fig. 2. Generally, the TDCOSMO set has larger z_s and z_d. For the targeted experiment in this work, we selected just one z_d to be the same for both sets to set aside the effects of a changing angular scale (which affects image resolution and kinematic aperture size) and a changing cosmological scale factor (which changes comoving values of effective radii). We selected z_d = 0.25, which is higher than the typical SLACS value and lower than the typical TDCOSMO value. Strictly speaking, this lens redshift is below the lowest TDCOSMO z_d, but serves as a nice middle ground which is capable of retaining a Σ_crit for each set which is consistent with each respective population. We selected just two z_s: one to represent each of our two populations. We selected a z_s for what we call the “SLACS-like” set to be 0.6 and a z_s for what we call the “TDCOSMO-like” set to be 2.0. These redshift selections result in Σ_crit values which are close to the medians of those of the respective SLACS and TDCOSMO populations. These calculations use a flat universe with H₀ = 70 km s⁻¹ Mpc⁻¹ and Ω_M = 0.3.

Fig. 2.

Redshift comparison with the observed lens population. Left: the distributions of source redshifts against lens redshifts for the populations of observed SLACS (purple) and TDCOSMO (orange) lenses. Our selections of source and lens redshifts are indicated as dashed lines. Right: these selections result in the dashed-line Σ_crit values, which are consistent with the observed populations.

For the baryonic mass component, we set the Sérsic index of the light component to be n = 4, equivalent to a de Vaucouleurs profile (Sérsic 1963). We then used the prescription described in Appendix A to create a Chameleon profile from an input Σ_eff and θ_eff.

Even once a model family is chosen, there still is considerable freedom in the handful of parameters required to make a lens. Some parameters (such as q) can be directly observed and inserted into a model, while others (such as NFW scale radius r_s) do not easily convert into the corresponding observable constraint (like θ_Ein, which indirectly is determined by r_s, in conjunction with other parameters). We wish to correctly probe the space of observed values, but must create our mocks using parameter values which are sometimes not observable. To solve this problem, we randomly sampled the input parameters over a range, resulting in a large set of profiles with would-be observed values. Comparison to the observed distribution provides a set of target parameters, from which one can select only the realizations which match the targets to our desired precision. This was done by defining a χ² for each targeted quantity and selecting a subset of profiles below a threshold total χ². We selected a population which targets the observable quantities q, θ_Ein, and θ_eff²³.

After selecting a number of profiles which met our targets, we arrived at a population of 465 lens profiles, which are a good match to the observed lens populations⁴, with distributions of parameter values approximated via Kernel Density Estimation (KDE) in Fig. 1. Aside from the intended match to the target parameters, we were also able to compare some nonobservable quantities which were not targeted directly, such as baryon fraction or total mass. These quantities also match well between the observed and simulated populations, with the possible exception that the recovered values of ξ were closer to isothermal than some of the observed SLACS lenses, which recover steeper slopes (recall that ξ = 0 corresponds to an isothermal slope). Nevertheless, our simulated distribution overlaps significantly with the real lenses. In particular we have reproduced the difference in impact parameter between SLACS and TDCOSMO lenses: Fig. 3 shows that TDCOSMO-like lenses have images forming at a larger distance (smaller θ_eff/θ_Ein) than SLACS-like systems. This agreement with the observed data is important for testing the relevance of the parameterization of the MST with radius (Eq. (4)) used in the hierarchical framework.

Fig. 3.

Scaled radial quantity used by the hierarchical framework to parameterize radial dependence for MST, plotted for the observed population (histograms) and the simulated profiles used in this work (KDE curves).

We plot the convergence profiles as a function of radius for our mock populations in Fig. 4. In comparison with the hydrodynamically simulated lenses from the TDLMC previously used to validate the hierarchical framework, our analytical profiles are less cored, with slight differences in shape at larger radii.

Fig. 4.

Convergence profiles for 20 analytical Chameleon+NFW profiles used in this paper (purple: SLACS-like; orange: TDCOSMO-like) alongside 16 profiles from hydro-simulations used in the TDLMC (green). Colored bands correspond to the 16th and 84th percentiles. Vertical colored lines indicate the median Einstein radius of the corresponding population. The diagonal black line indicates an isothermal slope for comparison.

We now have a population of mock lens profiles which are comparable to both observed populations, except that the only difference between our SLACS-like and TDCOSMO-like populations is that we have placed the sources at two specific source redshifts. By drawing a random subset from this population, we created a set of lenses to be modeled while still maintaining confidence that our set is realistic.

3.3. Creating mock images and fits

We followed the setup of Van de Vyvere et al. (2022a) to create HST-like mock images using lenstronomy⁵ (Birrer & Amara 2018; Birrer et al. 2021). The images were created on a 151 × 151 pixel grid with a resolution of 0.08 arcsec and were convolved by the same PSF as used in the TDLMC (Ding et al. 2021a). We included noise using a zero-point of 26 mag, with an exposure corresponding to 5400 s and sky brightness of 22.3 mag arcsec⁻² with a read out noise of 21 e⁻, characteristic of the HST WFC3 camera in the F160W filter (Dressel 2012).

For each profile, two images were created: one SLACS-like image with z_s = 0.6 and one TDCOSMO-like image with z_s = 2.0. In all cases, we restricted our analysis to quadruply imaged configurations. The source was placed randomly within the caustic area in the same location for both source redshifts, which required rescaling the source position by the same factor as the increased caustic size for the higher-z_s TDCOSMO-like set. The source light was a circular Sérsic profile with n = 3, θ_eff = 0.1″ and a total unlensed magnitude of 20.5. Lacking an AGN in reality, the SLACS-like set was not given a point source or time delays. Meanwhile for the TDCOSMO-like set we included a central point source representing an AGN with an unlensed magnitude of 21. These magnitudes were selected to match the observed contrast between the quasar and host galaxy (Van de Vyvere et al. 2022a) and correspond to a system approximately 0.5 mag brighter than the mean intrinsic magnitude of the TDCOSMO lenses measured by Ding et al. (2021b). Time delays were included, centered on their true values, with an uncertainty of 2% or 1 day (whichever is larger) for likelihood calculation. For both sets of lenses, light from the lens galaxy was not included in an effort to focus the exploration on the changing Einstein radius rather than any possible effects of imperfect lens light subtraction. The end result for each lens was two mock high-quality HST images with different Einstein radii probing different parts of the same lens profile. Ten example lenses are shown in Fig. 5.

Fig. 5.

Images (log brightness) and residuals (in units of σ) of 10 systems at 2 source redshifts. Left: SLACS analog with z_s = 0.6 and no point sources, Right: TDCOSMO analog with z_s = 2.0. Lower-left scale bars correspond to 1″.

The images were fit via Particle Swarm Optimization (PSO, Kennedy & Eberhart 1995; Shi & Eberhart 1998) using lenstronomy with a Sérsic model for the source light and a Power-law Elliptical Mass Distribution (PEMD) + external shear model for the mass. Because external shear was included in the fit model but absent in the mock creation, the fit model has more azimuthal freedom than the input. This reduces the possibility for a bias to arise due to insufficient azimuthal freedom (Kochanek 2021). The central 0.4″ were masked in the fitting process to stand in for the sometimes-uncertain nature of lens light subtraction in the centermost region, remaining agnostic to the possiblity of a fifth image (although our PEMD model does not produce a fifth image). We also masked all pixels with intensity below twice the background level, as in Van de Vyvere et al. (2022a). Because this work uses mocks where the source properties are known, we were able to initialize a preliminary fitting using the correct source, but we did not fix the source to the truth during the fitting procedure. We fixed the slope to isothermal for this preliminary fit. A secondary fit started from this result and allows all parameters to vary. From the second result, a Markov chain Monte Carlo (MCMC) via emcee (Goodman & Weare 2010; Foreman-Mackey et al. 2013) was used to sample the posterior distribution of the lens parameters and estimate uncertainties.

All systems were successfully fit with the PEMD+shear models, with reduced χ² < 1.06 in all cases. Before accounting for the MSD, these fits recovered H₀ with a median of 58.6 km s⁻¹ Mpc⁻¹, which is biased by approximately 16% from the input value, with approximately 3% scatter. An example comparison between the input and fit profiles for a single pair of systems is given in Appendix B. The next steps were to add kinematic information to break the MSD and combine the posteriors together hierarchically.

3.4. Kinematics implementation

Stellar kinematic information is used in conjunction with lensing information to break the MSD. To do the same in this work, we had to create for each lens a predicted velocity dispersion corresponding to a mock aperture measurement.

The hierarchical framework uses spherical Jeans modeling as detailed in Binney & Mamon (1982), Birrer et al. (2020). This approximation is commonly employed in lens modeling (e.g., Treu & Koopmans 2002a; Koopmans et al. 2003; Suyu et al. 2010; Sonnenfeld et al. 2012; Wong et al. 2017; Birrer et al. 2019; Rusu et al. 2020) and calculates the velocity dispersion in the limit of a relaxed, spherically symmetric, nonrotating system. Radial ( $σ_{r}^{2}$ $\sigma_{\rm r}^2$ ) and tangential dispersions ( $σ_{t}^{2}$ $\sigma_{\rm t}^2$ ) are described by the spherical Jeans equation:

$\begin{matrix} \frac{\partial (ρ_{*} (r) σ_{r}^{2} (r))}{\partial r} + \frac{2 β_{ani} (r) ρ_{*} (r) σ_{r}^{2} (r)}{r} = - ρ_{*} (r) \frac{\partial Φ (r)}{\partial r} \end{matrix}$ $\begin{aligned} \frac{\partial (\rho _{*}(r)\sigma _{\rm r}^2(r))}{\partial r} + \frac{2\beta _{\rm ani}(r)\rho _{*}(r)\sigma _{\rm r}^2(r)}{r} = - \rho _{*}(r) \frac{\partial \Phi (r)}{\partial r} \end{aligned}$ (9)

for a given stellar distribution ρ_*(r) and gravitational potential Φ(r) where the anisotropy β_ani(r) is defined as

$\begin{matrix} β_{ani} (r) \equiv 1 - \frac{σ_{t}^{2} (r)}{σ_{r}^{2} (r)} \cdot \end{matrix}$ $\begin{aligned} \beta _{\rm ani}(r)\equiv 1-\frac{\sigma ^2_{\rm t}(r)}{\sigma ^2_{\rm r}(r)}\cdot \end{aligned}$ (10)

The luminosity-weighted projected velocity dispersion is given as

$\begin{matrix} Σ_{*} (R) σ_{s}^{2} (R) = 2 \int_{R}^{\infty} (1 - β_{ani} (r) \frac{R^{2}}{r^{2}}) \frac{ρ_{*} (r) σ_{r}^{2} (r) r d r}{\sqrt{r^{2} - R^{2}}}, \end{matrix}$ $\begin{aligned} \Sigma _*(R)\sigma _{\rm s}^2(R) = 2 \int _{R}^{\infty } \left( 1- \beta _{\rm ani}(r) \frac{R^2}{r^2}\right) \frac{\rho _{*}(r)\sigma _{\rm r}^2(r) r \mathrm{d}r}{\sqrt{r^2-R^2}}, \end{aligned}$ (11)

where R is the 2D projected radius and Σ_*(R) is the 2D projected stellar density. Finally, to produce an observable measure, one needs to integrate this quantity over an aperture 𝒜 and convolve by a PSF 𝒫:

$\begin{matrix} {(σ^{P})}^{2} = \frac{\int_{A} [Σ_{*} (R) σ_{s}^{2} (R) * P] d A}{\int_{A} [Σ_{*} (R) * P] d A} \cdot \end{matrix}$ $\begin{aligned} (\sigma ^P)^2 = \frac{\int _{\fancyscript {A}} \left[\Sigma _{*}(R)\sigma _{\rm s}^2(R) *\fancyscript {P}\right]\mathrm{d} A}{\int _{\fancyscript {A}} \left[\Sigma _{*}(R) *\fancyscript {P}\right]\mathrm{d}A}\cdot \end{aligned}$ (12)

All told, one can calculate the observed aperture velocity dispersion given a mass distribution, a light distribution, and a description of the orbital anisotropy. One common parameterization of the anisotropy is to use

$\begin{matrix} β_{ani} (r) = \frac{r^{2}}{r_{ani}^{2} + r^{2}}, \end{matrix}$ $\begin{aligned} \beta _{\rm ani}(r)=\frac{r^2}{r_{\rm ani}^2+r^2}, \end{aligned}$ (13)

which describes the orbits as isotropic at small radii and radial at large radii, with transition radius r_ani (Osipkov 1979; Merritt 1985). It is also useful for our purposes to describe this transition radius relative to the effective radius using the anisotropy parameter

$\begin{matrix} a_{ani} \equiv \frac{r_{ani}}{r_{eff}} \cdot \end{matrix}$ $\begin{aligned} a_{\rm ani} \equiv \frac{r_{\rm ani}}{r_{\rm eff}}\cdot \end{aligned}$ (14)

The spherical Jeans framework has the advantage that it is tractable to calculate quickly for a large number of models, although this is a simplification of the true dynamics of galaxies. We initially experimented with using the more general Jeans anisotropic modeling (JAM; Cappellari 2008, 2020; Shajib 2019), which is starting to be explored for lens systems (e.g. using mock JWST kinematics, Yıldırım et al. 2020, 2021), to calculate an input velocity dispersion while keeping the hierarchical modeling framework as spherical Jeans. However, there are subtleties regarding the differences between the two methods which make interpretation difficult. The main challenge is that the anisotropy is parameterized differently in JAM ( $β = 1 - σ_{z}^{2} / σ_{r}^{2}$ $\beta=1-\sigma^2_{z}/\sigma^2_{\rm r}$ , instead with respect to the z-coordinate of a cylindrical coordinate system), which can change the aperture-measured velocity dispersion. Therefore, the experiment would not be a test of the hierarchical framework’s multiple-redshift capabilities, but instead a test of the accuracy of anisotropy assumptions. Determination of an optimal description of anisotropy for lensing work proves to be a significantly more involved task than the scope of this work, and requires a focused study of its own. In the future, it would be preferable to upgrade the hierarchical framework to work with JAM, but the challenge is that JAM is too computationally expensive to sample within an MCMC. Another project is currently underway to attempt to speed up this calculation through the use of a neural network to precalculate the posterior space for MCMC sampling (paper in prep.). In this present work, we simply used spherical Jeans modeling in both the input and in the hierarchical fitting.

Regarding previous work using Jeans modeling within the hierarchical framework, we note that the kinematics calculation of Birrer et al. (2020), Birrer & Treu (2021) approximated the light distribution as a Hernquist profile for the ρ_* tracer population in Eq. (9). Since our light distribution is a Chameleon profile, we updated lenstronomy so that it is capable of calculating kinematics for Chameleon profiles (see Appendix A for details). This way, we avoid any potential bias that could come from a mismatch between the true light distribution and the one used to calculate the kinematics.

The specific kinematic setup used in this work is the same throughout. For each lens we calculated mock velocity dispersions within 3 apertures with radii of 0.67″, 1.33″, and 2″, convolved with a seeing of 0.7″. Each dispersion was given an uncertainty of 10%. For real systems, IFU spectroscopy is capable of measuring 2D kinematic constraints (such as in MaNGA e.g., Bevacqua et al. 2022) which correspond to multiple aperture constraints (Shajib et al. 2020). Some of the existing lens set already has such constraints via MUSE, typically made with ≃5% precision (see Table E.1 of TDC4 for the SLACS set). Furthermore, resolved kinematics should be possible with JWST (Yıldırım et al. 2020) and adaptive-optics IFU data such as MUSE (Bacon et al. 2010), KCWI (Morrissey et al. 2018) and OSIRIS (Larkin et al. 2006), providing even higher precision constraints for some subset of the lens sample. Indeed, IFU kinematics are anticipated within the TDCOSMO forecast using the hierarchical framework (Birrer & Treu 2021). As such, our kinematic setup uses a conservative estimate of the constraining power of real observations. For our mock kinematics, we must set an input anisotropy a_ani. For real systems a_ani must take on a range of values but the exact physical distribution is not well-known. To focus the experiment on the effect of changing impact parameter rather than the parameterization of anisotropy, we set input a_ani = 1 for all systems. The limitations of this assumption are further discussed in Sect. 5.3.1. This describes the mock measured velocity dispersions calculated for the true profile.

We must also calculate the predicted velocity dispersions from the lensing model. This was done using the same spherical Jeans modeling for the PEMD fit profile, resulting in a measured velocity dispersion for each aperture. The posteriors of the lensing fit were sampled, with uncertainties in the fit slope and Einstein radius included. We also introduced a 10% uncertainty to the effective radius in this sampling. Considering the effect of the MST on velocity dispersion, we note that the infinite uniform sheet term in Eq. (2) does not affect kinematics. As such, the effect of the MST on the velocity dispersion is a modification by a constant factor of $\sqrt{λ_{int}}$ $\sqrt{\lambda_{\mathrm{int}}}$ which follows from the application of the coefficient λ_int to the density in Eq. (11) (or from the simple scaling that v² ∝ GM/R). The hierarchical framework seeks to match the predicted and measured velocity dispersions, sampling over the parameters which describe the population-level a_ani and λ_int.

A summary table of the parameters used for mock lens creation is made available. See Appendix C.

4. Results

We now present results for an ensemble of lens systems. We used the publicly available hierArc package⁶ (Birrer et al. 2020) to sample over the posteriors from the PEMD fits to each system. From the lensing posteriors and kinematic information, hierArc samples for H₀, using a linear parameterization of the population-scale MST (Eq. (4)) with intercept λ_int, 0 and slope α_λ, with a spread of σ(λ_int). Additionally, the distribution of anisotropy is described as Gaussian with median ⟨a_ani⟩ and standard deviation σ(a_ani). This work uses uniform priors for all of these parameters with bounds listed in Appendix C. One can compare the results from including just the lenses from the one or the other source redshift, or a combination using the populations from both source redshifts.

4.1. 20 systems

The results for 20 Chameleon+NFW lenses are plotted in Fig. 6. A comparison of the SLACS-only population, the TDCOSMO-only population, and the combined populations indicates that the combination can increase the precision and accuracy on the final value of H₀, even though the systems span different ranges of θ_eff/θ_Ein.

Fig. 6.

Results of the hierarchical framework using 20 Chameleon+NFW lenses for the population with SLACS source redshift and no time delays (purple), the population with TDCOSMO source redshift (orange) and the combination of both populations (blue).

The TDCOSMO-only set recovered a median H₀ which is higher than the fiducial value of H₀ by approximately 6.2% (1.5σ). The broad width of the posterior is largely due to the degeneracy with ⟨a_ani⟩, which was poorly constrained from a single-redshift set of lenses.

The SLACS-only population had no point sources, therefore no time delays; as a result, it could not place a constraint on H₀ alone, as evidenced by the wide and unconstrained purple contours in the first column of Fig. 6. Rather, the contribution of these systems is to inform the parameters other than H₀: those concerning λ_int and a_ani. This information is combined with the TDCOSMO systems (which themselves bring information about the H₀ posterior) to break the MSD. This combination enabled the fit to recover an unbiased H₀ value of $70 . 6_{- 1.7}^{+ 2.0}$ $70.6^{+2.0}_{-1.7}$ km s⁻¹ Mpc⁻¹. As a particularly nice visual example, the panel which shows α_λ and λ_int, 0 (middle row, second column) illustrates how these parameters, while degenerate within the SLACS posteriors, were nonetheless able to inform the TDCOSMO set to recover an accurate description of λ_int, 0. As in TDC4, the combined-population result was shifted downward, but still consistent with the TDCOSMO-only value of H₀.

4.2. 40 systems

While 20 systems is a reasonable short-term goal of TDCOSMO, a larger number of lenses will soon become available with the advent of LSST, Euclid, and larger telescopes. Increased sample size should increase the precision on H₀, but intrinsic scatter in λ_int may inhibit this precision increase. We took this opportunity to test if an increased number of lens systems within the hierarchical framework can correctly quantify and account for the intrinsic scatter in λ_int. As such, we doubled the number of systems to 40 at each redshift.

With 40 systems at each redshift, we recovered the results in Fig. 7. As intended, the combined SLACS+TDCOSMO value errors did indeed shrink, zeroing in on the correct value of H₀, with a recovered value of $69 . 5_{- 1.3}^{+ 1.2}$ $69.5^{+1.2}_{-1.3}$ km s⁻¹ Mpc⁻¹.

Fig. 7.

Effect of the number of lenses: the hierarchical result of a population of Chameleon+NFW lenses, combining the fits from the TDCOSMO and SLACS source redshifts. The blue is the same as that in Fig. 6, using 20 lenses from each redshift. The pink distribution corresponds to the same setup, but with 40 lenses each.

4.3. Single-redshift test

We were curious about the importance of probing a range of redshifts, that is to say spread over a greater range of impact parameter. To discuss this, we must compare against the case in which the lenses come from the same redshift. In Appendix D, we show the results of a test where we set the systems to the same redshift and ran hierArc. With 20 time delay and 20 non-time-delay lenses all at the SLACS z_s, the result was $H_{0} = 69 . 7_{- 1.6}^{+ 1.6}$ $H_0=69.7^{+1.6}_{-1.6}$ km s⁻¹ Mpc⁻¹.

5. Discussion

The goal of this project has been to test the systematics of the hierarchical framework. The framework has already been tested on a single sample of time-delay lens systems from hydrodynamical simulations; we have expanded this analysis to a different input model using two-component analytical profiles consistent with the observed lens population. Furthermore, we have expanded the analysis to test the effect of combining the sample with an analogous population of non-time-delay lenses with different source redshifts.

5.1. Prehierarchical result

To better understand the results of the hierarchical fitting, it can be useful to compare to the lensing results alone, before the MST is accounted for. Figure 8 shows the lensing-only recovered value of H₀ for 20 systems at each z_s, expressed as H_0,true/H_0,model = λ_int, as a function of the radial scaling parameterization θ_eff/θ_Ein − 1. We note that since the SLACS-like systems have no point sources, they do not recover H₀ values, and so we have instead plotted the results from the test where the same 20 lenses were given point sources in Sect. 4.3. The hierarchical parameters of λ_int, 0, σ(λ_int), and α_λ reflect the summary statistics of a linear-fit description of this plot. We can see λ_int has a median of approximately 1.18 and scatter of 0.03, which should be approximately the truth values of λ_int, 0 and σ(λ_int), assuming α_λ = 0 (which seems reasonable given the weak correlation, Pearson R = −0.18).

Fig. 8.

Lensing-only values of H₀ (i.e., λ_int) as a function of radial impact parameter defined in Eq. (4), which assumes this relationship is described as linear with Gaussian scatter. The SLACS-z_s systems are indicated in red rather than purple because they are alternate versions of the same systems but with point sources and time delays (such that a value for H₀ can be recovered and plotted).

The lensing data is supplemented with kinematic data, and so it can be enlightening to see the velocity dispersion values from the model fits to the lensing data. Before any MST correction was applied, and setting a_ani to the truth value of 1, we found that all fits from both systems recovered model velocity dispersions (σ^P) which were approximately 7.5% lower than the true values. For individual systems this bias value is contained within the uncertainties of individual σ^P measurements (10% uncertainties), but at a population level systems require an average λ_int > 1 to reproduce the (observed) lens velocity dispersion compared to the power-law model. This illustrates the necessity to use information from the population of lenses rather than individually.

5.2. Performance of the hierarchical framework

We first consider the capacity of the hierarchical framework to reproduce H₀ from a population at a single source redshift and therefore a narrow range of θ_eff/θ_Ein. Considering the case with 20 systems using only the TDCOSMO-like source redshift (orange set in Fig. 6), the spread in anisotropy radius ⟨a_ani⟩ propagates into 4.4% uncertainty in H₀, with a true value which lies within the 95% confidence interval of of the posterior. When expanded to 40 systems (not plotted), this resulted in $H_{0} = 72 . 3_{- 1.7}^{+ 2.3}$ $H_0 = 72.3^{+2.3}_{-1.7}$ km s⁻¹ Mpc⁻¹, again with the true value within the 95% confidence interval, but now with an uncertainty of only 2.8%. It is interesting that the result lies approximately 1.5σ away from the truth in both cases, even as the value of σ narrows. This may indicate that discrepancies still could exist within the assumptions in the framework at the ≃1σ level, but 1.5σ is not severe enough to claim significance. Nevertheless, this precision represents a substantial improvement over the MST-biased results from lens modeling alone, which had a median bias of 16% below the fiducial value. This illustrates the success of the hierarchical framework in breaking the MSD for a single population.

We further consider the capacity of the hierarchical framework to account for a change in the impact parameter for a set of otherwise similar lenses. The SLACS-like population is identical to the TDCOSMO-like except that they have no point sources and have a smaller z_s resulting in a larger θ_eff/θ_Ein. Here we compare the result from the single-redshift population to that of the combined population. The combined-population constraints on ⟨a_ani⟩ are more informative than either individual set. The degeneracy between ⟨a_ani⟩ and H₀, visible in the TDCOSMO-only set of Fig. 6, shows how this improved anisotropy constraint leads to a more precise H₀ (2.6% scatter for 20 systems). The accuracy is also improved: our combined set for 20 systems was centered just 0.3σ away from the correct value. When we increased the number of systems to 40, the precision and accuracy improved further, to a final value of $69 . 5_{- 1.3}^{+ 1.2}$ $69.5^{+1.2}_{-1.3}$ km s⁻¹ Mpc⁻¹ (0.7% median offset, 1.8% scatter). The increase in accuracy comes primarily from tighter constraints on λ_int, 0 as the large number of systems is better able to probe the systematic behavior. Nonzero σ(λ_int) was also recovered, meaning the framework was able to capture the presence of the intrinsic variance in the input population. Based on Fig. 8, this scatter is anticipated to be approximately 3%, which was well-matched by the recovered value of σ(λ_int). Because the hierarchical framework explicitly captures this scatter through σ(λ_int), intrinsic scatter in the distribution of λ_int does not represent a systematic floor for H₀, that is to say it does not preclude the recovery of H₀ at a better precision than the intrinsic scatter, and so one can expect the precision to improve as $\sqrt{(} N)$ $\sqrt(N)$ . Importantly, the correct distribution of a_ani was recovered, which forced H₀ to the correct value. The power to constrain a_ani comes from having multiple-aperture kinematic measurements, providing a probe of the behavior of velocity dispersion with radius. In summary, we confirm that the hierarchical framework is capable of incorporating lens populations from different source redshifts.

When considering the test where all lenses are placed at the same redshift, we find that the framework successfully recovered H₀ in this test as well. We conclude that the method can result in an accurate value of H₀ when using either a set of lenses at the same redshift or combining sets from different redshifts.

Birrer & Treu (2021) forecasted the improvement in precision achieved within the hierarchical framework by increasing the size of the lens population, assuming improved constraining power of the velocity dispersion from integral field unit (IFU) spectroscopic data. Unlike this work, Birrer & Treu (2021) did not create and fit mocks, but simply assumed Gaussian constraining power for each individual lens. With such a setup, they found that with 40 time-delay lenses and up to 200 non-time-delay lenses, the precision can be expected to increase to 1.2 − 1.5%, depending on the exact setup. Our result using simulated mocks from start to finish found 1.8% precision with 40 time-delay and 40 non-time-delay lenses, consistent with this forecast.

We note that the general behavior of having a lower value of H₀ for the combined sample than the TDCOSMO-only sample qualitatively resembles the result from TDC4, where the same hierarchical framework was used on the observed lenses. TDC4 found $H_{0} = 74 . 5_{- 6.1}^{+ 5.6}$ $H_0 = 74.5^{+5.6}_{-6.1}$ km s⁻¹ Mpc⁻¹ when using 7 TDCOSMO lenses alone, but found that the value shifted to $67 . 4_{- 3.2}^{+ 4.1}$ $67.4^{+4.1}_{-3.2}$ km s⁻¹ Mpc⁻¹ when incorporating 33 SLACS lenses. Speaking in terms of the downward uncertainty of the TDCOSMO-only set, the value shifted by 1.1σ, whereas for this work we saw a similar 1.3σ shift. While this work cannot confirm or refute the mutual similarity between the real lens populations, it is reassuring to see consistent results with the real population, as it affirms that the observed shift is not necessarily an indicator of dissimilar populations. Such mutual similarity between the populations is the key assumption required to make use of the SLACS lenses hierarchically; this work finds no evidence against this assumption.

5.3. Limitations

5.3.1. Mock data

The results of this project look optimistic for the hierarchical framework, but we would be remiss to not discuss the differences between this work and TDC4, as well as some of the present limitations of our method. Firstly, some choices made for the mock data quality may differ slightly from the real lens population. This work has a larger number of systems compared to TDC4, which at the time included 7 TDCOSMO lenses and 33 SLACS lenses of which only 9 had multiple-aperture velocity dispersion measurements, with approximately 5% precision on σ^P. Here we used 20 (or 40) of each set, all of which have 3 kinematic apertures with 10% precision on σ^P. Image data quality in this work (e.g., source brightness, noise, exposure time, etc.) was modeled after the high S/N template from Van de Vyvere et al. (2022a), and as such are higher-than-average quality compared with current single-band data. We also used transparent lenses, which assumes perfect lens light subtraction. On the other hand, we used single-band images while the observed multiband imaging add additional constraining power and can help deblend lens and source light (e.g., Shajib et al. 2020; Schmidt et al. 2022). This work is intended to be forward-looking, and the sample of real lenses will grow over time to better match these settings in the future.

By design, this work uses only two source redshifts and one lens redshift to more accurately probe what is thought to be the key difference between the SLACS and TDCOSMO lenses. The real lens population spans a range of z_s and z_d, but since our populations already share similar distributions of θ_eff/θ_Ein, implementing this change would be unlikely to make a significant difference. However, if there are different selection effects between the observed SLACS and TDCOSMO samples, or changes to the global profile shape associated with galaxy evolution between the two sample lens redshifts, these effects are not captured in this work.

Regarding our choice of profile, the Chameleon+NFW profiles match many aspects of the observed systems, but may not perfectly describe the truth. Our SLACS-like ξ distribution has less scatter and is more isothermal than the observed set. Additionally, the fact that our PL lensing fits resulted in a different H₀ value than the truth may indicate a different behavior than the real lens population, which find similar values of H₀ whether a PL or composite model is used (Millon et al. 2020b). However, to call this a discrepancy assumes that fitting our systems with composite models would return the correct H₀ (no MST), which we do not necessarily claim. At any rate, we can state that the hierarchical framework successfully accounts for the MSD using either our analytical mocks or the hydro-simulated mocks of the TDLMC, supporting the claim that it is reliable for the true population of (unknown) lens profiles.

A salient difference between our mocks and real galaxies is their elliptical shape, while real galaxies harbor more complex azimuthal structures. The level of impact these structures can have on H₀ inference is debated. Early-type galaxies can be triaxial (van de Ven et al. 2009; Chang et al. 2013; Weijmans et al. 2014), and can have 2D mass projections which can change in ellipticity or position angle with radius (Kormendy et al. 2009). We initially sought to create lenses from 3D triaxial profiles for this work, but challenges with the kinematics calculation forced us to simplify to 2D elliptical profiles. Furthermore, galaxies have been known to have “disky” or “boxy” isophotes, corresponding to an n = 4 multipole perturbation to their mass distributions (Hao et al. 2006). Finally, in the ΛCDM paradigm there are expected to be a large number of subhalos within any given galactic halo which add substructure to the mass distribution (Gilman et al. 2020). Additional angular complexity of a lens can lead to a bias in H₀ when a model is not adequately azimuthally complex (Kochanek 2021). The effects of some of these angular structures in the context of lensing measurements of H₀ are explored in other works (e.g., substructure by Gilman et al. 2020, multipole by Van de Vyvere et al. 2022a, isodensity twists by Van de Vyvere et al. 2022b). However, it is clear that any substructures are a subdominant factor to the elliptical shape of the mass distribution as evidenced by the success of cosmography-grade parametric lens models. As such, our simple elliptical lenses are likely a reasonable approximation for this work, for which the purpose is to probe the effect of a changing radial structure.

Along the same lines, our spherical Jeans framework used to calculate both the input and the fit velocity dispersion is a simplification. There is an inconsistency between using an elliptical mass distribution for the lens and a spherical mass distribution for the velocity. Yıldırım et al. (2020) found, by using JAM in the spherical limit, that this approximation did not bias the resulting D_Δt measurement for their single system, although a formal inclusion of λ_int is not incorporated. This spherical assumption is common in joint lens+kinematics modeling, in which kinematics are meant only to provide a single integrated measure of mass, but may become more important as spatially resolved kinematics become more widely attainable. Perhaps of greater concern is the description of anisotropy through the single value of a_ani. This parameterization is known to be a simplification which does not accurately match N-body simulations (Mamon & Łokas 2005). Even so, it is unrealistic to expect each galaxy to have the same value of a_ani as has been set in this work as well as in Birrer & Treu (2021). Preliminary tests with changing the input distribution of a_ani have found that H₀ is still recovered well when a representative prior is used, but it is possible for biases to arise if the prior is misinformed (similar to the findings of Birrer et al. 2016, 2020). There is also room to alter and explore the kinematics settings (e.g., number of apertures, radii, seeing, uncertainties) to learn if there is an optimal strategy for an ideal kinematics setup. A more thorough quantification of these effects is a target for a future paper. A future goal would be to incorporate the JAM framework into hierArc, which at present may be too computationally demanding.

5.3.2. MST parameterization

Here we describe an ancillary result with possible implications for the theoretical description of the MST. We sought to supplement the empirical success of the linear parameterization of λ_int with impact parameter with an analytical description. The intention was to plot a “truth” theoretical relation in Fig. 8 representing a mapping from Chameleon+NFW to PEMD profiles, and determine if a linear parameterization was appropriate. We performed some tests using the ξ values of each profile to predict the λ_int one would recover from a PL fit (since the PL fit should recover the same ξ as the true profile). However, we found that the predicted H₀ using ξ differs from the actual value recovered from the mock fitting by approximately 3% (median). After considerable exploration, we find that approximately 60% of this discrepancy can be explained by effects of ellipticity: ξ is defined using a spherical lens for which the image radius is unambiguous. However, the remaining discrepancy is still not fully explained, and may be related to other degeneracies in the parameters, possibly dependent on image configuration. This discussion will be the subject of a separate publication (Gomer et al., in prep.). For this current work, this simply means that we cannot reliably use this ξ description to represent a theoretical target.

Finally, we mention in passing that the way the MST is parameterized physically in 3D kinematics is open to interpretation. In this work, the velocity dispersion is modified by a constant factor of $\sqrt{λ_{int}}$ $\sqrt{\lambda_{\mathrm{int}}}$ , noting that the infinite sheet term in Eq. (2) does not affect kinematics. An alternative description by (Blum et al. 2020) describes the MST as a cored profile, which is well-behaved at infinity and would leave a detectable imprint on kinematics (Yıldırım et al. 2021). Other descriptions are possible, as the deprojection of the MST is not unique. This work is confined to the first description, but future versions of hierArc could implement other parameterizations.

6. Conclusion

In this work, we conducted an experiment by creating a population of pairs of mocks of the same lens profile using two different source redshifts, representing the difference between the SLACS and TDCOSMO lens populations, controlling for any other possible differences. These images were constructed with two-component profiles, but fit as a power law, resulting in a ∼15% biased recovery of H₀. The posteriors from these fits were then combined together and supplemented with spherical Jeans velocity dispersion measurements using the hierarchical framework of Birrer et al. (2020). Our resulting value of H₀ using a single population of TDCOSMO-like lenses was consistent with the fiducial H₀ (using 20 lenses: 4.2% precision, 1.5σ median offset). When information from the SLACS-like population of lenses was used to supplement the recovery, H₀ was recovered more precisely and more accurately (using 20 lenses from each set: 2.6% precision, 0.3σ median offset). This precision can be further improved as the number of systems grows (using 40 lenses from each set: 1.8% precision, 0.4σ median offset).

Even though the intrinsic profile was more complex than the power-law model used to fit it, we confirm that the hierarchical framework was able to combine systems from different redshifts and adjust the parameterization of λ_int as a function of θ_eff/θ_Ein in a way which results in an unbiased H₀. While there may be other possible parameterizations, the linear function with respect to scaled effective radius seems acceptable, provided the data quality is good and the number of lenses is large.

There are still a number of open questions for the hierarchical framework, such as the effect of the distribution of a_ani and the proper way to parameterize the anisotropy of galaxies, or the effect of different 3D physical interpretations of the MSD. These questions will be further explored in future work, but this work validates the utility of the method and serves as an important first step.

¹

Our notation is to use θ to refer to a 2D projected radius in angular units, R to refer to the same quantity in physical kpc, and r to refer to the unprojected 3D radius.

²

Since we targeted each radius both in terms of physical size and angular size, the effective radius and Einstein radius were effectively weighted twice the weight of q.

³

The Chameleon profile drops more quickly at large radii than the Sérsic it is meant to emulate. As such, the mass at infinity of the baryon component may not perfectly match a Sérsic. For θ_eff (and R_eff), the value of integrated light at infinity of the Sérsic profile was used, rather than the artificially truncated Chameleon. The other quantities which are not defined using behavior at infinity are expected to be sufficiently accurate: the Einstein radius changes by about 0.01″ comparing a Sérsic+NFW to a Chameleon+NFW. As such, for calculation of the quantities other than θ_eff (and R_eff), the Chameleon profile was used directly.

⁴

The only minor difference arises because of the z_d = 0.25 compromise, which resulted in the physical scale conversion being different for the TDCOSMO lenses than for our lenses, and so even though the Einstein radius and effective radius match well in terms of kpc, our synthetic values are slightly larger than the median TDCOSMO values in terms of arcseconds.

⁵

https://github.com/sibirrer/lenstronomy

⁶

https://github.com/sibirrer/hierArc

⁷

https://github.com/mattgomer/lenstronomy-profile-troubleshooting

Acknowledgments

The authors thank Sherry Suyu, Akın Yıldırım, Martin Millon, Alessandro Sonnenfeld, and Matt Auger for their constructive discussion regarding this paper. This work uses the following Python packages: Python (Oliphant 2007; Millman & Aivazis 2011), Astropy (Astropy Collaboration 2013, 2018), Numpy (van der Walt et al. 2011), Scipy (Virtanen et al. 2020), Matplotlib (Hunter 2007), Pandas (McKinney 2010; The pandas development team 2020), and Seaborn (Waskom 2021). This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 787886).

References

Abbott, B. P., Abbott, R., & Abbott, T. D. 2017, Nature, 551, 85 [NASA ADS] [CrossRef] [Google Scholar]
Abbott, T. M. C., Abdalla, F. B., Annis, J., et al. 2018, MNRAS, 480, 3879 [Google Scholar]
Addison, G. E., Watts, D. J., Bennett, C. L., et al. 2018, ApJ, 853, 119 [Google Scholar]
Aiola, S., Calabrese, E., Maurin, L., et al. 2020, JCAP, 2020, 047 [Google Scholar]
Anguita, T., Schechter, P. L., Kuropatkin, N., et al. 2018, MNRAS, 480, 5017 [NASA ADS] [Google Scholar]
Astropy Collaboration (Robitaille, T. P., et al.) 2013, A&A, 558, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Astropy Collaboration (Price-Whelan, A. M., et al.) 2018, AJ, 156, 123 [Google Scholar]
Auger, M. W., Treu, T., Bolton, A. S., et al. 2009, ApJ, 705, 1099 [Google Scholar]
Auger, M. W., Treu, T., Bolton, A. S., et al. 2010, ApJ, 724, 511 [NASA ADS] [CrossRef] [Google Scholar]
Bacon, R., Accardo, M., & Adjali, L. 2010, in Ground-based and Airborne Instrumentation for Astronomy III, eds. I. S. McLean, S. K. Ramsay, & H. Takami, SPIE Conf. Ser., 7735, 773508 [NASA ADS] [Google Scholar]
Barnabe, M., & Koopmans, L. V. E. 2007, ApJ, 666, 726 [NASA ADS] [CrossRef] [Google Scholar]
Barnabè, M., Czoske, O., Koopmans, L. V. E., Treu, T., & Bolton, A. S. 2011, MNRAS, 415, 2215 [Google Scholar]
Bevacqua, D., Cappellari, M., & Pellegrini, S. 2022, MNRAS, 511, 139 [NASA ADS] [CrossRef] [Google Scholar]
Binney, J., & Mamon, G. A. 1982, MNRAS, 200, 361 [Google Scholar]
Birrer, S. 2021, ApJ, 919, 38 [NASA ADS] [CrossRef] [Google Scholar]
Birrer, S., & Amara, A. 2018, Phys. Dark Univ., 22, 189 [Google Scholar]
Birrer, S., & Treu, T. 2021, A&A, 649, A61 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Birrer, S., Amara, A., & Refregier, A. 2016, JCAP, 2016, 020 [CrossRef] [Google Scholar]
Birrer, S., Treu, T., Rusu, C. E., et al. 2019, MNRAS, 484, 4726 [Google Scholar]
Birrer, S., Shajib, A. J., Galan, A., et al. 2020, A&A, 643, A165 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Birrer, S., Shajib, A., Gilman, D., et al. 2021, J. Open Sour. Softw., 6, 3283 [NASA ADS] [CrossRef] [Google Scholar]
Blum, K., Castorina, E., & Simonović, M. 2020, ApJ, 892, L27 [Google Scholar]
Bolton, A. S., Burles, S., Koopmans, L. V. E., Treu, T., & Moustakas, L. A. 2006, ApJ, 638, 703 [NASA ADS] [CrossRef] [Google Scholar]
Bolton, A. S., Burles, S., Koopmans, L. V. E., et al. 2008, ApJ, 682, 964 [Google Scholar]
Bonvin, V., Millon, M., Chan, J. H. H., et al. 2019, A&A, 629, A97 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Cappellari, M. 2008, MNRAS, 390, 71 [NASA ADS] [CrossRef] [Google Scholar]
Cappellari, M. 2020, MNRAS, 494, 4819 [NASA ADS] [CrossRef] [Google Scholar]
Chang, Y.-Y., van der Wel, A., Rix, H.-W., et al. 2013, ApJ, 773, 149 [NASA ADS] [CrossRef] [Google Scholar]
Chen, G. C. F., Fassnacht, C. D., Suyu, S. H., et al. 2019, MNRAS, 490, 1743 [NASA ADS] [CrossRef] [Google Scholar]
Ciotti, L., & Bertin, G. 1999, A&A, 352, 447 [NASA ADS] [Google Scholar]
Courbin, F., Bonvin, V., Buckley-Geer, E., et al. 2018, A&A, 609, A71 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Ding, X., Treu, T., Shajib, A. J., et al. 2018, ArXiv e-prints [arXiv:1801.01506] [Google Scholar]
Ding, X., Treu, T., Birrer, S., et al. 2021a, MNRAS, 503, 1096 [Google Scholar]
Ding, X., Treu, T., Birrer, S., et al. 2021b, MNRAS, 501, 269 [Google Scholar]
Dressel, L. 2012, Wide Field Camera, 3, 5 [NASA ADS] [Google Scholar]
Dutton, A. A., & Treu, T. 2014, MNRAS, 438, 3594 [NASA ADS] [CrossRef] [Google Scholar]
Dutton, A. A., Brewer, B. J., Marshall, P. J., et al. 2011, MNRAS, 417, 1621 [NASA ADS] [CrossRef] [Google Scholar]
Falco, E. E., Gorenstein, M. V., & Shapiro, I. I. 1985, ApJ, 289, L1 [Google Scholar]
Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]
Freedman, W. L. 2021, ApJ, 919, 16 [NASA ADS] [CrossRef] [Google Scholar]
Freedman, W. L., Madore, B. F., Hoyt, T., et al. 2020, ApJ, 891, 57 [Google Scholar]
Gavazzi, R., Treu, T., Rhodes, J. D., et al. 2007, ApJ, 667, 176 [Google Scholar]
Gilman, D., Birrer, S., & Treu, T. 2020, A&A, 642, A194 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Golse, G., & Kneib, J. P. 2002, A&A, 390, 821 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Gomer, M., & Williams, L. L. R. 2020, JCAP, 2020, 045 [CrossRef] [Google Scholar]
Gomer, M. R., & Williams, L. L. R. 2018, MNRAS, 475, 1987 [NASA ADS] [CrossRef] [Google Scholar]
Goodman, J., & Weare, J. 2010, Commun. Appl. Math. Comput. Sci., 5, 65 [Google Scholar]
Hao, C. N., Mao, S., Deng, Z. G., Xia, X. Y., & Wu, H. 2006, MNRAS, 370, 1339 [CrossRef] [Google Scholar]
Huang, C. D., Riess, A. G., Yuan, W., et al. 2020, ApJ, 889, 5 [Google Scholar]
Humphrey, P. J., & Buote, D. A. 2010, MNRAS, 403, 2143 [Google Scholar]
Hunter, J. D. 2007, Comput. Sci. Eng., 9, 90 [NASA ADS] [CrossRef] [Google Scholar]
Kassiola, A., & Kovner, I. 1993, ApJ, 417, 450 [Google Scholar]
Keeton, C. R., Kochanek, C. S., & Falco, E. E. 1998, ApJ, 509, 561 [NASA ADS] [CrossRef] [Google Scholar]
Kennedy, J., & Eberhart, R. 1995, Proceedings of ICNN’95 – International Conference on Neural Networks, 4, 1942 [CrossRef] [Google Scholar]
Kenworthy, W. D., Scolnic, D., & Riess, A. 2019, ApJ, 875, 145 [NASA ADS] [CrossRef] [Google Scholar]
Kochanek, C. S. 2020, MNRAS, 493, 1725 [NASA ADS] [CrossRef] [Google Scholar]
Kochanek, C. S. 2021, MNRAS, 501, 5021 [Google Scholar]
Koopmans, L. V. E., Treu, T., Fassnacht, C. D., Blandford, R. D., & Surpi, G. 2003, ApJ, 599, 70 [NASA ADS] [CrossRef] [Google Scholar]
Koopmans, L. V. E., Bolton, A., Treu, T., et al. 2009, ApJ, 703, L51 [NASA ADS] [CrossRef] [Google Scholar]
Kormendy, J., Fisher, D. B., Cornell, M. E., & Bender, R. 2009, ApJS, 182, 216 [Google Scholar]
Larkin, J., Barczys, M., Krabbe, A., et al. 2006, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, eds. I. S. McLean, & M. Iye, 6269, 62691A [Google Scholar]
Mamon, G. A., & Łokas, E. L. 2005, MNRAS, 363, 705 [Google Scholar]
Merritt, D. 1985, AJ, 90, 1027 [Google Scholar]
Millman, K. J., & Aivazis, M. 2011, Comput. Sci. Eng., 13, 9 [NASA ADS] [CrossRef] [Google Scholar]
Millon, M., Courbin, F., Bonvin, V., et al. 2020a, A&A, 640, A105 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Millon, M., Galan, A., Courbin, F., et al. 2020b, A&A, 639, A101 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Morrissey, P., Matuszewski, M., Martin, D. C., et al. 2018, ApJ, 864, 93 [Google Scholar]
Navarro, J. F., Frenk, C. S., & White, S. D. M. 1996, ApJ, 462, 563 [Google Scholar]
Navarro, J. F., Frenk, C. S., & White, S. D. M. 1997, ApJ, 490, 493 [Google Scholar]
Oliphant, T. E. 2007, Comput. Sci. Eng., 9, 10 [NASA ADS] [CrossRef] [Google Scholar]
Osipkov, L. P. 1979, Pisma v Astronomicheskii Zhurnal, 5, 77 [Google Scholar]
Pesce, D., Braatz, J., Reid, M., et al. 2020, H₀2020: Assessing Uncertainties in Hubble’s Constant Across the Universe, 20 [Google Scholar]
Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Poulin, V., Smith, T. L., Karwal, T., & Kamionkowski, M. 2019, Phys. Rev. Lett., 122, 221301 [Google Scholar]
Riess, A. G., Yuan, W., Macri, L. M., et al. 2022, ApJ, 934, L7 [NASA ADS] [CrossRef] [Google Scholar]
Rusu, C. E., Wong, K. C., Bonvin, V., et al. 2020, MNRAS, 498, 1440 [Google Scholar]
Saha, P., & Williams, L. L. R. 2006, ApJ, 653, 936 [NASA ADS] [CrossRef] [Google Scholar]
Schmidt, T., Treu, T., Birrer, S., et al. 2022, MNRAS, submitted [arXiv:2206.04696] [Google Scholar]
Schneider, P., & Sluse, D. 2013, A&A, 559, A37 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Schneider, P., & Sluse, D. 2014, A&A, 564, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Sérsic, J. L. 1963, Bol. Assoc. Argent. Astron. Plata Argent., 6, 41 [Google Scholar]
Shajib, A. J. 2019, MNRAS, 488, 1387 [NASA ADS] [CrossRef] [Google Scholar]
Shajib, A. J., Birrer, S., Treu, T., et al. 2019, MNRAS, 483, 5649 [Google Scholar]
Shajib, A. J., Birrer, S., Treu, T., et al. 2020, MNRAS, 494, 6072 [Google Scholar]
Shajib, A. J., Treu, T., Birrer, S., & Sonnenfeld, A. 2021, MNRAS, 503, 2380 [Google Scholar]
Shi, Y., & Eberhart, R. 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360), 69 [CrossRef] [Google Scholar]
Sluse, D., Chantry, V., Magain, P., Courbin, F., & Meylan, G. 2012, A&A, 538, A99 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Sonnenfeld, A. 2018, MNRAS, 474, 4648 [Google Scholar]
Sonnenfeld, A., Treu, T., Gavazzi, R., et al. 2012, ApJ, 752, 163 [Google Scholar]
Spingola, C., McKean, J. P., Auger, M. W., et al. 2018, MNRAS, 478, 4816 [NASA ADS] [CrossRef] [Google Scholar]
Suyu, S. H., Marshall, P. J., Blandford, R. D., et al. 2009, ApJ, 691, 277 [Google Scholar]
Suyu, S. H., Marshall, P. J., Auger, M. W., et al. 2010, ApJ, 711, 201 [Google Scholar]
Suyu, S. H., Treu, T., Hilbert, S., et al. 2014, ApJ, 788, L35 [NASA ADS] [CrossRef] [Google Scholar]
Suyu, S. H., Bonvin, V., Courbin, F., et al. 2017, MNRAS, 468, 2590 [Google Scholar]
Treu, T., & Koopmans, L. V. E. 2002a, ApJ, 575, 87 [NASA ADS] [CrossRef] [Google Scholar]
Treu, T., & Koopmans, L. V. E. 2002b, MNRAS, 337, L6 [NASA ADS] [CrossRef] [Google Scholar]
Treu, T., Agnello, A., Baumer, M. A., et al. 2018, MNRAS, 481, 1041 [NASA ADS] [CrossRef] [Google Scholar]
The pandas development team 2020, https://doi.org/10.5281/zenodo.3509134 [Google Scholar]
Unruh, S., Schneider, P., & Sluse, D. 2017, A&A, 601, A77 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
van de Ven, G., Mandelbaum, R., & Keeton, C. R. 2009, MNRAS, 398, 607 [NASA ADS] [CrossRef] [Google Scholar]
Van de Vyvere, L., Gomer, M. R., Sluse, D., et al. 2022a, A&A, 659, A127 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Van de Vyvere, L., Sluse, D., Gomer, M. R., & Mukherjee, S. 2022b, A&A, 663, A179 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
van der Walt, S., Colbert, S. C., & Varoquaux, G. 2011, Comput. Sci. Eng., 13, 22 [Google Scholar]
Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Meth., 17, 261 [Google Scholar]
Waskom, M. L. 2021, J. Open Sour. Softw., 6, 3021 [NASA ADS] [CrossRef] [Google Scholar]
Weijmans, A.-M., de Zeeuw, P. T., Emsellem, E., et al. 2014, MNRAS, 444, 3340 [Google Scholar]
Wertz, O., Orthen, B., & Schneider, P. 2018, A&A, 617, A140 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
McKinney, W. 2010, in Proceedings of the 9th Python in Science Conference, eds. S. van der Walt, & J. Millman, 56 [Google Scholar]
Wong, K. C., Suyu, S. H., Auger, M. W., et al. 2017, MNRAS, 465, 4895 [NASA ADS] [CrossRef] [Google Scholar]
Wong, K. C., Suyu, S. H., Chen, G. C. F., et al. 2020, MNRAS, 498, 1420 [Google Scholar]
Xu, D., Sluse, D., Schneider, P., et al. 2016, MNRAS, 456, 739 [Google Scholar]
Xu, D., Springel, V., Sluse, D., et al. 2017, MNRAS, 469, 1824 [NASA ADS] [CrossRef] [Google Scholar]
Yıldırım, A., Suyu, S. H., & Halkola, A. 2020, MNRAS, 493, 4783 [Google Scholar]
Yıldırım, A., Suyu, S. H., Chen, G. C. F., & Komatsu, E. 2021, A&A, submitted [arXiv:2109.14615] [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Yoo, J., Kochanek, C. S., Falco, E. E., & McLeod, B. A. 2006, ApJ, 642, 22 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Troubleshooting Chameleon profiles

A.1. Sérsic-matching modification

Initially we intended to use a Sérsic profile directly for the baryon mass component, matching the observed family of light distributions. Unfortunately, the Sérsic profile is computationally prohibitive to implement as a lens. Instead, we followed the practice of Dutton et al. (2011) and Suyu et al. (2014) to create a Chameleon profile which emulates the Sérsic profile by using a combination of two cored isothermal profiles. In the notation of Dutton & Treu (2014), Eq. 7 reads:

$\begin{matrix} Σ (R) = \frac{Σ_{0}}{1 - α} (\frac{R_{0}}{\sqrt{R^{2} + R_{0}^{2}}} - \frac{R_{0}}{\sqrt{R^{2} + {(R_{0} / α)}^{2}}}) \cdot \end{matrix}$ $\begin{aligned} \Sigma (R) = \frac{\Sigma _0}{1-\alpha }\left(\frac{R_0}{\sqrt{R^2+R_0^2}}-\frac{R_0}{\sqrt{R^2+(R_0/\alpha )^2}}\right)\cdot \end{aligned}$ (A.1)

Dutton et al. (2011) provide a prescription to convert a given Sérsic profile into a Chameleon profile based on a polynomial fit. When we followed this prescription, we found we were unable to match our input Sérsic profile to the provided precision. Instead, we found that, at least for a Sérsic with n = 4, we had to manually modify the ratio of the two core radii, dividing α by a factor of 1.5. Only after this modification do our profiles match Fig. A.2 of Dutton et al. (2011). This effect is shown in Fig. A.1. We have not tested the general case. We suspect that there is a transcription error in the polynomial provided within the manuscript, but it is also possible that there is an error in our implementation, so we have made available a Python notebook which shows our calculations⁷. At any rate, the Chameleon profiles we use in this work are constructed from our modified prescription and match Sérsic profiles to the precision shown in Fig. A.1.

Fig. A.1.

Comparing the matching Chameleon profile to the input Sérsic profile. The modification we have made to the prescription better reproduces the corresponding figure from Dutton et al. (2011).

A.2. Multiple Gaussian expansion

While troubleshooting the kinematics calculation within lenstronomy, we discovered a discrepancy can be introduced when the light profile is modeled as a Multiple Gaussian Expansion (MGE) to approximate the profile rather than the analytical profile. The MGE is used because it is simple to deproject, and a 3D mass distribution is required for kinematics. However, lenstronomy uses a 1D MGE for its kinematic calculation because it assumes spherical Jeans, which can lead to inaccuracies in deprojection when starting from an elliptical mass distribution. Namely, we found that the MGE introduces an excess of light at large radii in 3D when the lens is not circular. This can introduce a bias in the calculated σ_v at large radii, which can bias the aperture-weighed v_ap by approximately 3% (see Fig. A.2). The effect worsens for larger radii and for more elliptical lenses (this example used q = 0.8). To avoid this, lenstronomy has been updated to include the 3D profile for the Chameleon profile, such that the MGE is no longer necessary, but this serves as a cautionary tale to beware of subtle effects when calculating kinematics of lens models.

Fig. A.2.

Use of the 1D MGE introduces an excess of light in 3D at large radius for an elliptical lens. This causes the σ_v calculation to be biased by approximately 3% near the aperture radius of 1 arcsecond.

Appendix B: Single fit example

As a demonstration of the MST, we plot the result from lensing fits to a single profile (created with a source placed at both z_s). Figure B.1 shows that the best-fit PEMD result is not equivalent to the convergence profile, but rather has been transformed by an MST. This results in a shallower slope than the truth. Kinematics are required to correct the effect through a determination of λ_int; the lensing data alone returns a biased value of H₀.

Fig. B.1.

Example of a single profile, placed at each redshift and fit as a PEMD. Top: Convergence profiles as a function of radius; solid curves indicate input convergence profiles while dashed lines indicate convergence profiles corresponding to the PEMD fit. Bottom: residuals showing the differences between the input and PEMD fit profiles. Vertical lines correspond to the Einstein radii of the corresponding redshift.

Appendix C: Fitting parameter bounds

The summarized parameters for lens construction and the uniform prior bounds used in this work are displayed in Table C.1.

Table C.1.

Parameters for the mock creation and hierArc priors for this work.

Appendix D: Single-redshift test

Because the hierarchical framework is able to combine systems which span a range of impact parameters into an unbiased H₀ result, we were curious if the range of spread was beneficial to the success of the method. We modified the set of 20 SLACS-like images from Sect. 4.1 by adding point sources with time delay information. For a set with no point sources, we use the remaining 20 SLACS-like images created in Sect. 4.2. As such, all lenses in this test have the same redshifts, with the case with no point sources simply corresponding to a different realization of profiles drawn from the same global distribution. We combine these sets together using the hierarchical framework and plot the results in Fig. D.1.

Fig. D.1.

Hierarchical results from lens populations without point sources (purple) and with point sources (red) are plotted, as well as their combination (blue). Unlike Fig. 6, the red population has the same source redshift as the purple population.

We find that the result is similar to Sect. 4.1, with the combination of populations successfully recovering H₀ with 2.2% precision. One minor difference is that the intrinsic scatter σ(λ_int) is now consistent with zero; the framework captures the fact that this population of lenses has less intrinsic variation than the case where it is being probed at a wider range of different radii.

All Tables

Table C.1.

Parameters for the mock creation and hierArc priors for this work.

In the text

All Figures

Fig. 1.

Histograms depict the distribution of observed SLACS (purple) and TDCOSMO (orange) lens properties. Our simulated populations (shown as KDEs of corresponding colors) were created by constructing a large number of profiles and selecting a subset which simultaneously match the 5 target parameter distributions, indicated by the green borders. The simulated SLACS and TDCOSMO populations are the same profiles, only differing by a changed source redshift. For parameters where the two sets are identical, a black line is used instead of orange or purple.

In the text

Fig. 2.

Redshift comparison with the observed lens population. Left: the distributions of source redshifts against lens redshifts for the populations of observed SLACS (purple) and TDCOSMO (orange) lenses. Our selections of source and lens redshifts are indicated as dashed lines. Right: these selections result in the dashed-line Σ_crit values, which are consistent with the observed populations.

In the text

	Fig. 3. Scaled radial quantity used by the hierarchical framework to parameterize radial dependence for MST, plotted for the observed population (histograms) and the simulated profiles used in this work (KDE curves).
In the text

Fig. 4.

Convergence profiles for 20 analytical Chameleon+NFW profiles used in this paper (purple: SLACS-like; orange: TDCOSMO-like) alongside 16 profiles from hydro-simulations used in the TDLMC (green). Colored bands correspond to the 16th and 84th percentiles. Vertical colored lines indicate the median Einstein radius of the corresponding population. The diagonal black line indicates an isothermal slope for comparison.

In the text

	Fig. 5. Images (log brightness) and residuals (in units of σ) of 10 systems at 2 source redshifts. Left: SLACS analog with z_s = 0.6 and no point sources, Right: TDCOSMO analog with z_s = 2.0. Lower-left scale bars correspond to 1″.
In the text

	Fig. 6. Results of the hierarchical framework using 20 Chameleon+NFW lenses for the population with SLACS source redshift and no time delays (purple), the population with TDCOSMO source redshift (orange) and the combination of both populations (blue).
In the text

	Fig. 7. Effect of the number of lenses: the hierarchical result of a population of Chameleon+NFW lenses, combining the fits from the TDCOSMO and SLACS source redshifts. The blue is the same as that in Fig. 6, using 20 lenses from each redshift. The pink distribution corresponds to the same setup, but with 40 lenses each.
In the text

Fig. 8.

Lensing-only values of H₀ (i.e., λ_int) as a function of radial impact parameter defined in Eq. (4), which assumes this relationship is described as linear with Gaussian scatter. The SLACS-z_s systems are indicated in red rather than purple because they are alternate versions of the same systems but with point sources and time delays (such that a value for H₀ can be recovered and plotted).

In the text

	Fig. A.1. Comparing the matching Chameleon profile to the input Sérsic profile. The modification we have made to the prescription better reproduces the corresponding figure from Dutton et al. (2011).
In the text

	Fig. A.2. Use of the 1D MGE introduces an excess of light in 3D at large radius for an elliptical lens. This causes the σ_v calculation to be biased by approximately 3% near the aperture radius of 1 arcsecond.
In the text

Fig. B.1.

Example of a single profile, placed at each redshift and fit as a PEMD. Top: Convergence profiles as a function of radius; solid curves indicate input convergence profiles while dashed lines indicate convergence profiles corresponding to the PEMD fit. Bottom: residuals showing the differences between the input and PEMD fit profiles. Vertical lines correspond to the Einstein radii of the corresponding redshift.

In the text

	Fig. D.1. Hierarchical results from lens populations without point sources (purple) and with point sources (red) are plotted, as well as their combination (blue). Unlike Fig. 6, the red population has the same source redshift as the purple population.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Abbott, B. P., Abbott, R., & Abbott, T. D. 2017, Nature, 551, 85 [NASA ADS] [CrossRef] [Google Scholar]

[2] Abbott, T. M. C., Abdalla, F. B., Annis, J., et al. 2018, MNRAS, 480, 3879 [Google Scholar]

[3] Addison, G. E., Watts, D. J., Bennett, C. L., et al. 2018, ApJ, 853, 119 [Google Scholar]

[4] Aiola, S., Calabrese, E., Maurin, L., et al. 2020, JCAP, 2020, 047 [Google Scholar]

[5] Anguita, T., Schechter, P. L., Kuropatkin, N., et al. 2018, MNRAS, 480, 5017 [NASA ADS] [Google Scholar]

[6] Astropy Collaboration (Robitaille, T. P., et al.) 2013, A&A, 558, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[7] Astropy Collaboration (Price-Whelan, A. M., et al.) 2018, AJ, 156, 123 [Google Scholar]

[8] Auger, M. W., Treu, T., Bolton, A. S., et al. 2009, ApJ, 705, 1099 [Google Scholar]

[9] Auger, M. W., Treu, T., Bolton, A. S., et al. 2010, ApJ, 724, 511 [NASA ADS] [CrossRef] [Google Scholar]

[10] Bacon, R., Accardo, M., & Adjali, L. 2010, in Ground-based and Airborne Instrumentation for Astronomy III, eds. I. S. McLean, S. K. Ramsay, & H. Takami, SPIE Conf. Ser., 7735, 773508 [NASA ADS] [Google Scholar]

[11] Barnabe, M., & Koopmans, L. V. E. 2007, ApJ, 666, 726 [NASA ADS] [CrossRef] [Google Scholar]

[12] Barnabè, M., Czoske, O., Koopmans, L. V. E., Treu, T., & Bolton, A. S. 2011, MNRAS, 415, 2215 [Google Scholar]

[13] Bevacqua, D., Cappellari, M., & Pellegrini, S. 2022, MNRAS, 511, 139 [NASA ADS] [CrossRef] [Google Scholar]

[14] Binney, J., & Mamon, G. A. 1982, MNRAS, 200, 361 [Google Scholar]

[15] Birrer, S. 2021, ApJ, 919, 38 [NASA ADS] [CrossRef] [Google Scholar]

[16] Birrer, S., & Amara, A. 2018, Phys. Dark Univ., 22, 189 [Google Scholar]

[17] Birrer, S., & Treu, T. 2021, A&A, 649, A61 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[18] Birrer, S., Amara, A., & Refregier, A. 2016, JCAP, 2016, 020 [CrossRef] [Google Scholar]

[19] Birrer, S., Treu, T., Rusu, C. E., et al. 2019, MNRAS, 484, 4726 [Google Scholar]

[20] Birrer, S., Shajib, A. J., Galan, A., et al. 2020, A&A, 643, A165 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[21] Birrer, S., Shajib, A., Gilman, D., et al. 2021, J. Open Sour. Softw., 6, 3283 [NASA ADS] [CrossRef] [Google Scholar]

[22] Blum, K., Castorina, E., & Simonović, M. 2020, ApJ, 892, L27 [Google Scholar]

[23] Bolton, A. S., Burles, S., Koopmans, L. V. E., Treu, T., & Moustakas, L. A. 2006, ApJ, 638, 703 [NASA ADS] [CrossRef] [Google Scholar]

[24] Bolton, A. S., Burles, S., Koopmans, L. V. E., et al. 2008, ApJ, 682, 964 [Google Scholar]

[25] Bonvin, V., Millon, M., Chan, J. H. H., et al. 2019, A&A, 629, A97 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[26] Cappellari, M. 2008, MNRAS, 390, 71 [NASA ADS] [CrossRef] [Google Scholar]

[27] Cappellari, M. 2020, MNRAS, 494, 4819 [NASA ADS] [CrossRef] [Google Scholar]

[28] Chang, Y.-Y., van der Wel, A., Rix, H.-W., et al. 2013, ApJ, 773, 149 [NASA ADS] [CrossRef] [Google Scholar]

[29] Chen, G. C. F., Fassnacht, C. D., Suyu, S. H., et al. 2019, MNRAS, 490, 1743 [NASA ADS] [CrossRef] [Google Scholar]

[30] Ciotti, L., & Bertin, G. 1999, A&A, 352, 447 [NASA ADS] [Google Scholar]

[31] Courbin, F., Bonvin, V., Buckley-Geer, E., et al. 2018, A&A, 609, A71 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[32] Ding, X., Treu, T., Shajib, A. J., et al. 2018, ArXiv e-prints [arXiv:1801.01506] [Google Scholar]

[33] Ding, X., Treu, T., Birrer, S., et al. 2021a, MNRAS, 503, 1096 [Google Scholar]

[34] Ding, X., Treu, T., Birrer, S., et al. 2021b, MNRAS, 501, 269 [Google Scholar]

[35] Dressel, L. 2012, Wide Field Camera, 3, 5 [NASA ADS] [Google Scholar]

[36] Dutton, A. A., & Treu, T. 2014, MNRAS, 438, 3594 [NASA ADS] [CrossRef] [Google Scholar]

[37] Dutton, A. A., Brewer, B. J., Marshall, P. J., et al. 2011, MNRAS, 417, 1621 [NASA ADS] [CrossRef] [Google Scholar]

[38] Falco, E. E., Gorenstein, M. V., & Shapiro, I. I. 1985, ApJ, 289, L1 [Google Scholar]

[39] Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]

[40] Freedman, W. L. 2021, ApJ, 919, 16 [NASA ADS] [CrossRef] [Google Scholar]

[41] Freedman, W. L., Madore, B. F., Hoyt, T., et al. 2020, ApJ, 891, 57 [Google Scholar]

[42] Gavazzi, R., Treu, T., Rhodes, J. D., et al. 2007, ApJ, 667, 176 [Google Scholar]

[43] Gilman, D., Birrer, S., & Treu, T. 2020, A&A, 642, A194 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[44] Golse, G., & Kneib, J. P. 2002, A&A, 390, 821 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[45] Gomer, M., & Williams, L. L. R. 2020, JCAP, 2020, 045 [CrossRef] [Google Scholar]

[46] Gomer, M. R., & Williams, L. L. R. 2018, MNRAS, 475, 1987 [NASA ADS] [CrossRef] [Google Scholar]

[47] Goodman, J., & Weare, J. 2010, Commun. Appl. Math. Comput. Sci., 5, 65 [Google Scholar]

[48] Hao, C. N., Mao, S., Deng, Z. G., Xia, X. Y., & Wu, H. 2006, MNRAS, 370, 1339 [CrossRef] [Google Scholar]

[49] Huang, C. D., Riess, A. G., Yuan, W., et al. 2020, ApJ, 889, 5 [Google Scholar]

[50] Humphrey, P. J., & Buote, D. A. 2010, MNRAS, 403, 2143 [Google Scholar]

[51] Hunter, J. D. 2007, Comput. Sci. Eng., 9, 90 [NASA ADS] [CrossRef] [Google Scholar]

[52] Kassiola, A., & Kovner, I. 1993, ApJ, 417, 450 [Google Scholar]

[53] Keeton, C. R., Kochanek, C. S., & Falco, E. E. 1998, ApJ, 509, 561 [NASA ADS] [CrossRef] [Google Scholar]

[54] Kennedy, J., & Eberhart, R. 1995, Proceedings of ICNN’95 – International Conference on Neural Networks, 4, 1942 [CrossRef] [Google Scholar]

[55] Kenworthy, W. D., Scolnic, D., & Riess, A. 2019, ApJ, 875, 145 [NASA ADS] [CrossRef] [Google Scholar]

[56] Kochanek, C. S. 2020, MNRAS, 493, 1725 [NASA ADS] [CrossRef] [Google Scholar]

[57] Kochanek, C. S. 2021, MNRAS, 501, 5021 [Google Scholar]

[58] Koopmans, L. V. E., Treu, T., Fassnacht, C. D., Blandford, R. D., & Surpi, G. 2003, ApJ, 599, 70 [NASA ADS] [CrossRef] [Google Scholar]

[59] Koopmans, L. V. E., Bolton, A., Treu, T., et al. 2009, ApJ, 703, L51 [NASA ADS] [CrossRef] [Google Scholar]

[60] Kormendy, J., Fisher, D. B., Cornell, M. E., & Bender, R. 2009, ApJS, 182, 216 [Google Scholar]

[61] Larkin, J., Barczys, M., Krabbe, A., et al. 2006, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, eds. I. S. McLean, & M. Iye, 6269, 62691A [Google Scholar]

[62] Mamon, G. A., & Łokas, E. L. 2005, MNRAS, 363, 705 [Google Scholar]

[63] Merritt, D. 1985, AJ, 90, 1027 [Google Scholar]

[64] Millman, K. J., & Aivazis, M. 2011, Comput. Sci. Eng., 13, 9 [NASA ADS] [CrossRef] [Google Scholar]

[65] Millon, M., Courbin, F., Bonvin, V., et al. 2020a, A&A, 640, A105 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[66] Millon, M., Galan, A., Courbin, F., et al. 2020b, A&A, 639, A101 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[67] Morrissey, P., Matuszewski, M., Martin, D. C., et al. 2018, ApJ, 864, 93 [Google Scholar]

[68] Navarro, J. F., Frenk, C. S., & White, S. D. M. 1996, ApJ, 462, 563 [Google Scholar]

[69] Navarro, J. F., Frenk, C. S., & White, S. D. M. 1997, ApJ, 490, 493 [Google Scholar]

[70] Oliphant, T. E. 2007, Comput. Sci. Eng., 9, 10 [NASA ADS] [CrossRef] [Google Scholar]

[71] Osipkov, L. P. 1979, Pisma v Astronomicheskii Zhurnal, 5, 77 [Google Scholar]

[72] Pesce, D., Braatz, J., Reid, M., et al. 2020, H₀2020: Assessing Uncertainties in Hubble’s Constant Across the Universe, 20 [Google Scholar]

[73] Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[74] Poulin, V., Smith, T. L., Karwal, T., & Kamionkowski, M. 2019, Phys. Rev. Lett., 122, 221301 [Google Scholar]

[75] Riess, A. G., Yuan, W., Macri, L. M., et al. 2022, ApJ, 934, L7 [NASA ADS] [CrossRef] [Google Scholar]

[76] Rusu, C. E., Wong, K. C., Bonvin, V., et al. 2020, MNRAS, 498, 1440 [Google Scholar]

[77] Saha, P., & Williams, L. L. R. 2006, ApJ, 653, 936 [NASA ADS] [CrossRef] [Google Scholar]

[78] Schmidt, T., Treu, T., Birrer, S., et al. 2022, MNRAS, submitted [arXiv:2206.04696] [Google Scholar]

[79] Schneider, P., & Sluse, D. 2013, A&A, 559, A37 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[80] Schneider, P., & Sluse, D. 2014, A&A, 564, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[81] Sérsic, J. L. 1963, Bol. Assoc. Argent. Astron. Plata Argent., 6, 41 [Google Scholar]

[82] Shajib, A. J. 2019, MNRAS, 488, 1387 [NASA ADS] [CrossRef] [Google Scholar]

[83] Shajib, A. J., Birrer, S., Treu, T., et al. 2019, MNRAS, 483, 5649 [Google Scholar]

[84] Shajib, A. J., Birrer, S., Treu, T., et al. 2020, MNRAS, 494, 6072 [Google Scholar]

[85] Shajib, A. J., Treu, T., Birrer, S., & Sonnenfeld, A. 2021, MNRAS, 503, 2380 [Google Scholar]

[86] Shi, Y., & Eberhart, R. 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360), 69 [CrossRef] [Google Scholar]

[87] Sluse, D., Chantry, V., Magain, P., Courbin, F., & Meylan, G. 2012, A&A, 538, A99 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[88] Sonnenfeld, A. 2018, MNRAS, 474, 4648 [Google Scholar]

[89] Sonnenfeld, A., Treu, T., Gavazzi, R., et al. 2012, ApJ, 752, 163 [Google Scholar]

[90] Spingola, C., McKean, J. P., Auger, M. W., et al. 2018, MNRAS, 478, 4816 [NASA ADS] [CrossRef] [Google Scholar]

[91] Suyu, S. H., Marshall, P. J., Blandford, R. D., et al. 2009, ApJ, 691, 277 [Google Scholar]

[92] Suyu, S. H., Marshall, P. J., Auger, M. W., et al. 2010, ApJ, 711, 201 [Google Scholar]

[93] Suyu, S. H., Treu, T., Hilbert, S., et al. 2014, ApJ, 788, L35 [NASA ADS] [CrossRef] [Google Scholar]

[94] Suyu, S. H., Bonvin, V., Courbin, F., et al. 2017, MNRAS, 468, 2590 [Google Scholar]

[95] Treu, T., & Koopmans, L. V. E. 2002a, ApJ, 575, 87 [NASA ADS] [CrossRef] [Google Scholar]

[96] Treu, T., & Koopmans, L. V. E. 2002b, MNRAS, 337, L6 [NASA ADS] [CrossRef] [Google Scholar]

[97] Treu, T., Agnello, A., Baumer, M. A., et al. 2018, MNRAS, 481, 1041 [NASA ADS] [CrossRef] [Google Scholar]

[98] The pandas development team 2020, https://doi.org/10.5281/zenodo.3509134 [Google Scholar]

[99] Unruh, S., Schneider, P., & Sluse, D. 2017, A&A, 601, A77 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[100] van de Ven, G., Mandelbaum, R., & Keeton, C. R. 2009, MNRAS, 398, 607 [NASA ADS] [CrossRef] [Google Scholar]

[101] Van de Vyvere, L., Gomer, M. R., Sluse, D., et al. 2022a, A&A, 659, A127 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[102] Van de Vyvere, L., Sluse, D., Gomer, M. R., & Mukherjee, S. 2022b, A&A, 663, A179 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[103] van der Walt, S., Colbert, S. C., & Varoquaux, G. 2011, Comput. Sci. Eng., 13, 22 [Google Scholar]

[104] Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Meth., 17, 261 [Google Scholar]

[105] Waskom, M. L. 2021, J. Open Sour. Softw., 6, 3021 [NASA ADS] [CrossRef] [Google Scholar]

[106] Weijmans, A.-M., de Zeeuw, P. T., Emsellem, E., et al. 2014, MNRAS, 444, 3340 [Google Scholar]

[107] Wertz, O., Orthen, B., & Schneider, P. 2018, A&A, 617, A140 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[108] McKinney, W. 2010, in Proceedings of the 9th Python in Science Conference, eds. S. van der Walt, & J. Millman, 56 [Google Scholar]

[109] Wong, K. C., Suyu, S. H., Auger, M. W., et al. 2017, MNRAS, 465, 4895 [NASA ADS] [CrossRef] [Google Scholar]

[110] Wong, K. C., Suyu, S. H., Chen, G. C. F., et al. 2020, MNRAS, 498, 1420 [Google Scholar]

[111] Xu, D., Sluse, D., Schneider, P., et al. 2016, MNRAS, 456, 739 [Google Scholar]

[112] Xu, D., Springel, V., Sluse, D., et al. 2017, MNRAS, 469, 1824 [NASA ADS] [CrossRef] [Google Scholar]

[113] Yıldırım, A., Suyu, S. H., & Halkola, A. 2020, MNRAS, 493, 4783 [Google Scholar]

[114] Yıldırım, A., Suyu, S. H., Chen, G. C. F., & Komatsu, E. 2021, A&A, submitted [arXiv:2109.14615] [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[115] Yoo, J., Kochanek, C. S., Falco, E. E., & McLeod, B. A. 2006, ApJ, 642, 22 [NASA ADS] [CrossRef] [Google Scholar]