Open Access
Issue
A&A
Volume 678, October 2023
Article Number A125
Number of page(s) 22
Section Cosmology (including clusters of galaxies)
DOI https://doi.org/10.1051/0004-6361/202346510
Published online 13 October 2023

© The Authors 2023

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

Modern cosmological models show that gravitational collapse drives cosmic structure formation through a hierarchical assembly process in which objects merge into larger and larger structures. Clusters of galaxies sit at the endpoint of this process, so the formation and evolution of the cluster population directly traces the growth of cosmic structure over time. This information can be used to constrain cosmological models. For example, the number of clusters as a function of mass and redshift is highly sensitive to the underlying cosmological parameters (e.g., White & Rees 1978; Perrenod 1980; Voit 2005; Allen et al. 2011).

The matter content of clusters – composed of dark matter (DM; 85%), ionised hot gas in the intracluster medium (12%), and stars (∼3%) – reflects that of the Universe. Clusters are typically detected through their baryonic components. Studies based on samples from X-ray and Sunyaev-Zeldovich (SZ) surveys have been very successful at providing cosmological constraints (e.g., Pacaud et al. 2006; Vikhlinin et al. 2009; Hasselfield et al. 2013; Bocquet et al. 2019; Salvati et al. 2022), and increasingly wide-field optical surveys have become competitive in recent years (e.g., Rozo et al. 2010; Hamana et al. 2015; Lesci et al. 2022). However, leveraging the data from these surveys requires linking the baryonic observables (optical richness, X-ray luminosity, and SZ flux) to the underlying mass. Today, survey sample sizes are sufficiently large that the dominant uncertainties in cosmological parameter estimation with clusters lie in systematic effects in the mass estimates and the selection function (e.g., Pratt et al. 2019, and references therein).

The vast amount of matter contained in clusters bends the light of background galaxies. This coherent deflection of the path of light from background sources by an intervening mass is termed gravitational lensing (e.g., Schneider et al. 1992; Pyne & Birkinshaw 1993). At large scales, the distortion in the shapes of the background galaxies is detectable only statistically, and is consequently termed weak gravitational lensing. The image distortions due to the weak lensing effect can be characterised by the shear (a warping of the background source images) and the convergence (a magnification effect on the same). Both shear and convergence can provide insights into the statistical properties of the weak lensing field, and it has been shown that they contain precisely the same information (e.g., Schneider et al. 2002; Pires et al. 2020). The weak lensing effect is directly sensitive to the total projected mass along the line of sight (LOS). As such, it is an attractive method for cluster detection, potentially paving the way towards true mass-selected samples. However, the sensitivity of cluster detection through the weak lensing signal depends critically on the number of background sources that are available to be lensed by the intervening matter. Only recently have deep, wide-field surveys yielded the background source densities needed to detect substantial numbers of objects.

A number of methods have been developed to detect galaxy clusters through their weak lensing shear or convergence signal in optical imaging data. Application of a simple Gaussian filter to the convergence, combined with thresholding, was widely used in early optical surveys (e.g., White et al. 2002; Miyazaki et al. 2002, 2007; Hamana et al. 2004; Tang & Fan 2005; Gavazzi & Soucail 2006; Fan et al. 2010; Shan et al. 2012, 2018). Such methods demonstrated that it was possible to detect clusters through their weak lensing signal, while motivating the development of new, more efficient, approaches. In the subsequent development of optimal filtering techniques, the filter kernel was adapted to include information on the shape of the expected halo profile, while excluding the shape noise and the contribution of large-scale structure (LSS) to the noise budget (e.g., Hennawi & Spergel 2005; Maturi et al. 2005; Wittman et al. 2006).

The widely used aperture mass (AM) technique, introduced by Schneider (1996), consists of convolving the lensing signal with a filter function of a specific scale. A number of new filters have since been developed to maximise the effectiveness of the AM method; they have been tested on simulations and applied to various optical surveys (e.g., Schneider 1996; Schneider et al. 1998; Jarvis et al. 2004; Schirmer et al. 2004, 2007; Hetterscheidt et al. 2005; Hennawi & Spergel 2005; Maturi et al. 2005, 2007; Pace et al. 2007; Dietrich & Hartlap 2010; Hamana et al. 2012, 2020; Lin et al. 2016; Miyazaki et al. 2018; Oguri et al. 2021).

The filter function is a key component of the AM method and must be designed to obtain the optimal signal-to-noise ratio (S/N) at a given scale. A defining characteristic of the above approaches is that they all operate on a single scale. As such, their detection efficiency is highly dependent on the filter design and on its relation to the size of the structures we want to detect. While some studies have proposed using multi-scale wavelet filters to de-noise the convergence map (Starck et al. 2006; Lanusse et al. 2016), they were not optimised for cluster detection because they do not take the LSS contribution into account. Nevertheless, such multi-scale de-noising techniques have been shown to be a promising approach for cluster detection (Leonard et al. 2015).

Motivated by the potential of current and upcoming deep wide-field optical surveys with sufficient background source densities, such as the Hyper Supreme-Cam (HSC) survey (Aihara et al. 2018), the Legacy Survey of Space and Time (LSST1; Closson Ferguson et al. 2009; Ivezić et al. 2019), Euclid2 (Laureijs et al. 2011), and the Roman Space Telescope3 (formerly WFIRST; Spergel et al. 2015), we revisit here the question of cluster detection through the weak lensing effect. We perform a quantitative comparison of existing single-scale detection methods, including a complete analysis of the filters. We introduce a new multi-scale detection approach based on the wavelet transform applied to the convergence. We chose to focus on the convergence because this quantity explicitly traces the total matter distribution integrated along the LOS and is computationally less expensive to analyse, making it ideal for application to upcoming large-scale survey data. We quantify the performance of our new multi-scale approach by applying it to the DUSTGRAIN-pathfinder simulations detailed in Giocoli et al. (2018), which feature source densities similar to those expected from the Euclid survey. We find that the new multi-scale method operating on the convergence is faster and more efficient at detecting clusters than currently used single-scale methods operating on the shear.

This paper is organised as follows. In Sect. 2 we summarise the key aspects of gravitational lensing. The mock dataset DUSTGRAIN-pathfinder simulations are described in Sect. 3. Section 4 introduces the AM formalism and several commonly used AM filters. A description of the wavelet formalism is also provided. Section 5 details the detection procedure that we use to compare the different filters. An analysis of the different options in the implementation of the detection algorithm is provided in Sect. 6. In Sect. 7 we provide the details of the matching procedure that we developed to allow for a fair comparison of the methods. Finally, the performance of the detection methods is evaluated in Sect. 8, and we conclude in Sect. 9.

2. Weak gravitational lensing theory

2.1. Weak lensing theory

The gravitational field of massive objects affects the path of light in their vicinity. Thus, the light from background galaxies is deflected as it travels towards us and their images appear distorted. These distortions, or (reduced) shear, are a direct observable and are an imprint of the intervening large-scale matter distribution. Different structures, such as clusters of galaxies, filaments, or even individual galaxies, can act as lenses and create this warping effect.

We summarise here the gravitational lens theory that is sufficient for the treatment of lensing by galaxy clusters (see e.g., Bartelmann & Schneider 2001). We consider a lens at angular position θ and at distance Dl from the observer. Its surface mass density, Σ(θ), is integrated from its 3D mass density, ρ(θ, z), along the LOS:

Σ ( θ ) = 0 + ρ ( θ , z ) d z . $$ \begin{aligned} \Sigma (\boldsymbol{\theta }) = \int _{0}^{+\infty } \rho (\boldsymbol{\theta },z) \mathrm{d}z . \end{aligned} $$(1)

From the surface mass density, we can define the lensing potential, ψ(θ), according to the position of the background galaxy sources with distance Ds from the observer and distance Dls from the lens:

ψ ( θ ) = 4 G c 2 D l D ls D s R 2 Σ ( θ ) ln ( | θ θ | ) d 2 θ . $$ \begin{aligned} \psi (\boldsymbol{\theta }) = \frac{4 G}{c^2}\frac{D_{\rm l} D_{\rm ls}}{D_{\rm s}} \int _{ \mathbb {R}^2}^ \Sigma ({\boldsymbol{\theta }}^\prime ) \ln {( |\boldsymbol{\theta }^{\prime }-\boldsymbol{\theta } |)}\, \mathrm{d}^2{\boldsymbol{\theta}}^{\prime }. \end{aligned} $$(2)

When light rays travel close to a lens, they are bent, leading to image distortions and, potentially, to multiple images. These images appear with a deflection angle, α(θ), induced by the lensing potential, ψ(θ):

α ( θ ) = ψ ( θ ) . $$ \begin{aligned} \alpha (\boldsymbol{\theta }) = \nabla \psi (\boldsymbol{\theta }). \end{aligned} $$(3)

Here α(θ) is the difference between the angular position, β, where the images would be without the lens and the observed position, θ. This is summarised by the lens equation:

α ( θ ) = θ β . $$ \begin{aligned} \alpha (\boldsymbol{\theta }) = \boldsymbol{\theta } - \boldsymbol{\beta }. \end{aligned} $$(4)

From Eqs. (2)–(4), the deviation, β, of the observed image with respect to the undistorted image can be derived:

A β θ = δ ij 2 ψ ( θ ) θ i θ j = ( 1 κ γ 1 γ 2 γ 2 1 κ + γ 1 ) . $$ \begin{aligned} A \equiv \frac{\partial \boldsymbol{\beta }}{\partial \boldsymbol{\theta }} = \delta _{ij} - \frac{\partial ^2 \psi (\boldsymbol{\theta })}{\partial \theta _i \partial \theta _j} = \begin{pmatrix} 1 - \kappa - \gamma _1&- \gamma _2 \\ - \gamma _2&1 - \kappa + \gamma _1 \end{pmatrix}. \end{aligned} $$(5)

Here

γ 1 = 1 2 ( 1 2 2 2 ) ψ and $$ \begin{aligned}&\gamma _1 = \frac{1}{2} ( \partial _1^2 - \partial _2^2) \psi \ \ \mathrm{and}\end{aligned} $$(6)

γ 2 = 1 2 ψ $$ \begin{aligned}&\gamma _2 = \partial _1\partial _2\psi \end{aligned} $$(7)

correspond to the two components of the shear γ, and

κ = 1 2 ( 1 2 + 2 2 ) ψ $$ \begin{aligned} \kappa = \frac{1}{2} ( \partial _1^2 + \partial _2^2) \psi \end{aligned} $$(8)

corresponds to the convergence that is the shape contraction or dilation of the image.

From the above, we can express the convergence as a function of the critical value of the surface mass density, Σcrit:

κ ( θ ) = Σ ( θ ) Σ crit , $$ \begin{aligned} \kappa (\boldsymbol{\theta }) = \frac{\Sigma (\boldsymbol{\theta })}{\Sigma _{\rm crit}}, \end{aligned} $$(9)

with

Σ crit = c 2 4 π G D s D l D ls . $$ \begin{aligned} \Sigma _{\rm crit} = \frac{c^2}{4\pi G} \frac{D_{\rm s}}{D_{\rm l} D_{\rm ls}}. \end{aligned} $$(10)

Thus, the convergence is a tracer of the total matter distribution integrated along the LOS, weighted by the redshift-dependent factor 1/Σcrit.

The direct observable is the reduced shear, g, that is derived from the ellipticities of the observed background galaxies. The reduced shear, g, is defined as follows:

g = γ 1 κ . $$ \begin{aligned} g = \frac{\gamma }{1 - \kappa } .\end{aligned} $$(11)

In the weak lensing regime (κ ≪ 1), it approximates the shear, γ.

2.2. Application

In practice, we observe a distorted image, from which we measure the galaxy ellipticities and estimate the shear components, γ1 and γ2. Kaiser & Squires (1993) introduced the mass inversion technique, which involves the computation of the convergence map κ from the measured shear field.

We can consider a complex notation to represent the shear field, γ = γ1 + iγ2, and the convergence field, κ = κE + iκB, with κE and κB, called E- and B-modes. Then, taking the Fourier transform of Eqs. (6)–(8), we obtain

γ ̂ = P ̂ κ ̂ , $$ \begin{aligned}&\hat{\gamma } = \hat{P}\, \hat{\kappa },\end{aligned} $$(12)

with : P ̂ = P ̂ 1 + i P ̂ 2 and , $$ \begin{aligned}&{\mathrm{with\!:}\,} \hat{P} = \hat{P}_1 + i\,\hat{P}_2 \mathrm {\,\,and\,}, \end{aligned} $$(13)

P ̂ 1 = k 1 2 k 2 2 k 1 2 + k 2 2 , $$ \begin{aligned}&\hat{P}_1 = \frac{ k_1^2 - k_2^2}{k_1^2 + k_2^2},\end{aligned} $$(14)

P ̂ 2 = 2 k 1 2 k 2 2 k 1 2 + k 2 2 , $$ \begin{aligned}&\hat{P}_2 = \frac{ 2\, k_1^2 k_2^2}{ k_1^2 + k_2^2}, \end{aligned} $$(15)

where the hat symbol refers to the Fourier transform, and ki is the wave number at the angular position, θi.

Considering the conjugate P ̂ = P ̂ 1 i P ̂ 2 $ \hat{P}* = \hat{P}_1 - i \hat{P}_2 $, we can reconstruct κE and κB from the complex shear γ, obtaining

κ ̂ E = P ̂ 1 γ ̂ 1 + P ̂ 2 γ ̂ 2 $$ \begin{aligned}&\hat{\kappa }_E = \hat{P}_1 \hat{\gamma }_1 + \hat{P}_2 \hat{\gamma }_2 \end{aligned} $$(16)

κ ̂ B = P ̂ 2 γ ̂ 1 + P ̂ 1 γ ̂ 2 . $$ \begin{aligned}&\hat{\kappa }_B = -\hat{P}_2 \hat{\gamma }_1 + \hat{P}_1 \hat{\gamma }_2. \end{aligned} $$(17)

Given that ψ is a scalar potential, it can be shown that weak lensing does not in principle produce B-modes. Thus, B-modes can be used to estimate the level of the noise in the data.

Figure 1 shows an example of simulated shear and convergence maps without shape noise derived from the DUSTGRAIN-pathfinder simulation (described in Sect. 3) covering a field of 5° ×5°. The left and middle panels correspond to the two components of the shear, γ1 and γ2 and the right panel to the corresponding E-mode convergence map, κE. Although both the shear and the convergence can be used for cluster detection, the convergence is easier and computationally less expensive to analyse because it is a scalar field that is proportional to the projected matter distribution.

thumbnail Fig. 1.

Simulated shear maps covering a field of 5° ×5° (left and middle panels) and the corresponding E-mode convergence map (right). The E-mode convergence map directly traces the projected matter distribution, the overdensities appearing as bright structures.

3. Weak lensing simulations

3.1. The DUSTGRAIN N-body simulations

We make use of a suite of cosmological N-body simulations called The DUSTGRAIN-pathfinder (see Giocoli et al. 2018, for a detailed description). This DM-only simulation traces the collisionless evolution of 7833 DM particles with a mass mCDM = 8.1 × 1010h−1 M, contained within a periodic cosmological box of side 750 h−1 Mpc. In the present work, we use a subset of the full DUSTGRAIN-pathfinder runs consisting of 256 realisations sharing the same standard cosmological parameters in agreement with Planck Collaboration XIII (2016): ΩM = ΩCDM + Ωb + Ων = 0.31345, Ωb = 0.0481, ΩΛ = 0.68655, h = 0.6731, and σ8 = 0.847. The different LOS realisations were obtained by randomising the stacked comoving cosmological boxes through combinations of the following procedures: (i) changing the sign of the Cartesian coordinates, (ii) redefining the position of the observer, and (iii) modifying the order of the axes in the coordinate system. By construction, these variations preserve the clustering properties of the particle distribution at the scale of the comoving simulation snapshot.

3.2. Mock dark matter halo catalogues

The mock DM halo catalogues were generated by identifying the DM haloes in the DUSTGRAIN simulation through a friends-of-friends algorithm (Davis et al. 1985) with linking distance λ = 0.16 × d, where d is the mean separation distance between particles. The SUBFIND algorithm (Springel et al. 2001) was then used to evaluate the standard parameters of each friends-of-friends-identified halo: the redshift, z, and the virial mass, M200c, and radius R200c, corresponding to the mass and radius of a spherical region around the fiducial centre of each halo enclosing 200 times the critical density of the Universe. Thus, for each of the 256 realisations, the catalogue is composed of the positions (i.e. the right ascension and the declination) of the identified DM haloes, their estimated redshift, zl, and their virial mass, M200c, and radius, R200c. This DM halo catalogue is used in our study for the matching procedure to quantify the purity and completeness of each selected sample.

3.3. Mock galaxy catalogues

The particles stored in 21 different snapshots were used to construct continuous past light cones from z = 0 to z = 4 using the MAPsim pipeline (Giocoli et al. 2015, 2017). The routine extracts the positions of each particle to recreate the past light cone and at the same time, the particles are binned to build 27 different lens planes to recompose the projected matter density distribution. Then, the shear of each galaxy is computed by projection, using the Born approximation, which assumes unperturbed light paths to integrate the lensing distortions (Bartelmann & Schneider 2001). The redshift distribution of the galaxies, n(z), was built to follow a realistic distribution approaching the one expected for the Euclid wide survey (e.g., Cropper et al. 2013). Each realisation contains around 3 million galaxies in a field of view of 5° ×5°, with a galaxy distribution extending up to z = 4. This leads to a galaxy density of about ng = 30 gal. arcmin−2, appropriate for the Euclid wide survey.

Noise was added to the shear to mimic realistic surveys. Uncertainties in the shear (referred to as ‘shape noise’) arise from a combination of the unavoidable intrinsic shape of the galaxies (referred to as the ‘intrinsic shape noise’), and measurement errors that include, among other factors, uncertainties in the galaxy shape measurement and the point spread function correction. The intrinsic shape of the galaxies being the dominant component and the galaxies being randomly distributed, the shape noise can be modelled as an additive noise. This can be well approximated by a Gaussian distribution with a mean of μ = 0 and a standard deviation of σϵ = 0.26 (e.g., Leauthaud et al. 2007; Schrabback et al. 2015, 2018). For each galaxy, the shape noise was therefore included in the catalogue by adding Gaussian noise to the two components of the shear. For each of the 256 realisations, the galaxy catalogue is composed of: (i) the positions (i.e. the right ascension and the declination) of the galaxies, (ii) the two components of the shear, and (iii) the redshift. These noisy galaxy catalogues are then used as the inputs to evaluate the detection algorithms described below.

4. Filters

In this section we present several different filters that have been used for weak lensing cluster detection in previous studies, and we compare them to the wavelet filters that we use in our new multi-scale detection approach. A complete analysis and comparison of the different filters both in real and Fourier space is also provided.

Although used since the very first weak lensing detection algorithms (e.g., White et al. 2002; Miyazaki et al. 2002, 2007; Hamana et al. 2004; Tang & Fan 2005; Gavazzi & Soucail 2006; Fan et al. 2010; Shan et al. 2012, 2018), we excluded the Gaussian filter from this analysis because it behaves as a low-pass filter and is thus not optimal for cluster detection. Although low-pass filters appear to be natural tools for reducing the shape noise that dominates at high-frequencies, they are not adapted to diminish the low-frequency signal coming from LSS, which causes many spurious peaks (see e.g., Maturi et al. 2005). Since the use of such low-pass filters is sub-optimal for the current application, we focus our analysis on pass-band filters that are better able to target the signal around a given scale.

4.1. The aperture mass formalism

The AM method (Schneider 1996) is commonly used to reconstruct maps of the projected matter distribution at a given scale. The AM map can be evaluated at a position θo by convolving the convergence field, κ, with a filter, U, such that

M ap ( θ o ) = R 2 κ ( θ ) U ( | θ θ o | ) d 2 θ , $$ \begin{aligned} M_{\rm ap}(\boldsymbol{\theta _{\rm o}}) = \int _{\mathbb{R} ^2} \kappa (\boldsymbol{\theta })\, U(|\, \boldsymbol{\theta } \, -\, \boldsymbol{\theta _{\rm o}} \,|)\, \mathrm{d}^2 \boldsymbol{\theta }, \end{aligned} $$(18)

where U must be compensated; that is,

0 θ ap θ U ( θ ) d θ = 0 $$ \begin{aligned} \int _{0}^{\theta _{\rm ap}} \theta ^{\prime } \, U(\theta ^{\prime })\,\mathrm{d}\theta ^{\prime } = 0 \end{aligned} $$(19)

must be fulfilled within the aperture of radius θap. Alternatively, the AM map can also be computed from the tangential component of the shear, γt, which is a quantity that can directly be computed from γ1 and γ2:

γ t = γ 1 cos ( ϕ ) + γ 2 sin ( ϕ ) , $$ \begin{aligned} \gamma _{\rm t} = \gamma _1 \cos (\mathrm \phi ) + \gamma _2 \sin (\mathrm \phi ), \end{aligned} $$(20)

where ϕ is the polar angle ϕ(θ, θ0) relative to the centre of the aperture, θ0. The AM map at position θo can then be obtained by convolving the tangential shear with a filter, Q, such that

M ap ( θ o ) = R 2 γ t ( θ ) Q ( | θ θ o | ) d 2 θ , $$ \begin{aligned} M_{\rm ap}({\boldsymbol \theta _o})= \int _{\mathbb{R} ^2} \gamma _t(\boldsymbol{\theta })\, Q(|\, \boldsymbol{\theta } - {\boldsymbol \theta _{\rm o}} \,|)\, \mathrm{d}^2 {\boldsymbol \theta }, \end{aligned} $$(21)

the filters Q and U being linked by the relations (Schneider & Bartelmann 1997)

Q ( θ ) = 2 θ 2 0 θ θ U ( θ ) d θ U ( θ ) $$ \begin{aligned} Q(\theta ) = \frac{2}{\theta ^2} \int _{0}^{\theta } \theta ^{\prime } U(\theta ^{\prime })\,\mathrm{d}\theta ^{\prime } - U(\theta ) \end{aligned} $$(22)

and

U ( θ ) = 2 θ + Q ( θ ) θ d θ Q ( θ ) . $$ \begin{aligned} U(\theta ) = 2 \int _{\theta }^{ +\infty } \frac{Q(\theta ^{\prime })}{\theta ^{\prime }} \,\mathrm{d}\theta ^{\prime } - Q(\theta ). \end{aligned} $$(23)

The performance of the AM technique at finding clusters on a map depends on the exact choice of filter, and on its capacity to isolate the lensing signal from the shape noise and the LSS component. The optimal filters U or Q should be designed to fulfil two properties. In real-space, the filters should be local, that is, decrease smoothly to zero within a finite radius. Ideally, they should fit the shape of the structures of interest. Moreover, they should be local in Fourier space, focusing on a particular angular scale. Ideally, their size should match as closely as possible the size of the structures of interest.

A number of functions or families of functions have been proposed (e.g., Schneider 1996; Schirmer 2004; Schirmer et al. 2007; Jarvis et al. 2004; Hamana et al. 2012; Miyazaki et al. 2018). In the present study, we consider the following four well-known AM filters.

The first is the original AM family of filters defined in the paper by Schneider (1996, hereafter S96):

U S 96 ( θ ) = { 1 x [ 0 , ν 1 R ] , 1 1 c [ ν 1 R ( x ν 1 R ) 2 + ( ν 1 R ) 2 ] x [ ν 1 R , ν 2 R ] , b R 3 ( R x ) 2 ( x α R ) x [ ν 2 R , R ] , 0 x > R , $$ \begin{aligned} U_{\rm S96}(\theta ) = \left\{ \begin{array}{ll} 1&{x}\in [0, \nu _1R],\\ \frac{1}{1-c} \big [\frac{\nu _1R}{\sqrt{(x-\nu _1R)^2+(\nu _1R)^2}} \big ]&{x} \in [\nu _1R,\ \nu _2R], \\ \frac{b}{R^3} (R-x)^2\,(x-\alpha R)&{x} \in [\nu _2R, R], \\ 0&{x} > R, \end{array} \right. \end{aligned} $$(24)

where parameters α, b, and c ensure that the filter is compensated and continuous in real space. The parameters ν1, ν2, and R define the angular extent of the filter and must be adapted to the size and shape of the clusters we want to detect. We implemented the different parameterisations presented in Schneider (1996) and we use ν1 = 0.1, ν2 = 0.9, R = 9′, α = 0.8531, b = −329.8, and c = 0.2415, which we found to maximise the purity and completeness. The corresponding QS96 can be calculated analytically with Eq. (22).

The second is the filter developed by Schirmer (2004, hereafter TANH):

Q TANH ( θ ) = 1 1 + e 6 150 θ R + e 47 + 50 θ R tanh ( θ x c R ) θ x c R , $$ \begin{aligned} Q_{\mathrm{TANH}}(\theta ) = \frac{1}{1+ {e}^{6{-}150\,\frac{\theta }{R}}+ {e}^{-47+50\,\frac{\theta }{R}}} \frac{\tanh \left(\frac{\theta }{x_{\rm c} R}\right)}{\frac{\theta }{x_{\rm c}R}}, \end{aligned} $$(25)

where R is the truncation radius and xc (dimensionless) defines the width of the filter. This filter was directly derived from the shape of the tangential shear of the Navarro-Frenk-White (NFW) profile (Navarro et al. 1996). Following Schirmer et al. (2007), we used xc = 0.1. We fixed the truncation radius R = 7′ as the optimal choice to maximise the number of detections, following the procedure described by Hetterscheidt et al. (2005). The corresponding UTANH can be computed numerically using Eq. (23).

The third is the filter proposed by Jarvis et al. (2004, hereafter J04):

U J 04 ( θ ) = 1 2 π σ 2 ( 1 θ 2 2 σ 2 ) exp ( θ 2 2 σ 2 ) , $$ \begin{aligned}&U_{\rm J04}(\mathrm \theta ) = \frac{1}{\mathrm{2\pi \sigma ^2}}\, \left(1-\frac {\theta ^2}{2 \sigma ^2}\right)\, \exp {\left(-\frac{\theta ^2}{2 \sigma ^2}\right)}, \end{aligned} $$(26)

Q J 04 ( θ ) = 1 2 π σ 2 θ 2 2 σ 2 exp ( θ 2 2 σ 2 ) , $$ \begin{aligned}&Q_{\rm J04}(\mathrm \theta ) = \frac{1}{2\pi \sigma ^2}\frac{\theta ^2}{2 \sigma ^2} \exp {\left(-\frac{\theta ^2}{2 \sigma ^2}\right)}, \end{aligned} $$(27)

where UJ04 was defined following Van Waerbeke (1998) as the second derivative of a Gaussian function with standard deviation σ. The UJ04 is also known as the Mexican hat wavelet filter. In the following mass aperture computation, we use σ = 4′, corresponding to an apparent angular radius of about 4′, and a truncation radius R = 20′, as suggested by Leonard et al. (2012) to minimise oscillations and high frequency mode contamination.

The fourth is the filter introduced by Hamana et al. (2012) and used by Miyazaki et al. (2018, hereafter M18), which is defined as a truncated Gaussian filter:

U M 18 ( θ ) = 1 π θ s 2 exp ( θ 2 θ s 2 ) U o , $$ \begin{aligned} U_{\rm M18}(\theta ) = \frac{1}{\pi \,\theta _{\rm s}^2}\,\exp {\left(\frac{\theta ^2}{\theta _{\rm s}^2}\right)}\,-\, U_{\rm o}, \end{aligned} $$(28)

where U0 is a parameter to ensure that the filter U is compensated and

Q M 18 ( θ ) = 1 π θ 2 [ 1 + ( 1 θ θ s ) 2 ] exp ( θ 2 θ s 2 ) , $$ \begin{aligned} Q_{\rm M18}(\theta ) = \frac{1}{\pi \,\theta ^2}\bigg [1\,+\,\left(1-\frac{\theta }{\theta _{\rm s}}\right)^2\bigg ]\,\exp {\left(\frac{\theta ^2}{\theta _{\rm s}^2}\right)}, \end{aligned} $$(29)

with θs being defined as the angular scale of the aperture. In practice, the choice of θs influences the angular radius of the cluster the filter targets, but the relation is not direct. Following Miyazaki et al. (2018), we use θs = 1.5′, and UM18(θ) = 0 if θ > R, with a truncation radius R = 15′. We fixed this value of R to reduce the impact of the truncation on the filter behaviour as it derives from a Gaussian filter. Indeed, the filter UM18 reaches negative values at UM18(R) and is subsequently forced to 0 by the truncation. This step in the function results in contamination in the signal. As the choice of truncation radius strongly impacts the computation time, there is a clear trade off between the computation time and the filter behaviour.

4.2. The wavelet formalism

The wavelet formalism can also be used to reconstruct maps of the projected matter distribution at a given scale (see e.g., Leonard et al. 2012; Pires et al. 2012). In Leonard et al. (2012), the authors showed that wavelet filter functions at a given scale are formally identical to AM filter functions at that scale. Similar to the AM functions, many different wavelet functions exist, including starlet, Mexican hat, Morlet, and biorthogonal (see e.g., Starck et al. 1998, 2006, for reviews). Many wavelet functions have been specifically designed to fulfil the properties of localisation in real and Fourier spaces, which is a distinct advantage compared to the AM approach.

A wavelet map of the convergence κ at position θ and scale a can be computed as

W a ( θ ) = 1 a κ ( t ) Ψ a ( t θ ) d t , $$ \begin{aligned} W_{a}(\theta ) = \frac{1}{\sqrt{a}} \int \kappa (t)\, \Psi _{a}(\,t-\theta \,) \,\mathrm{d}t, \end{aligned} $$(30)

where Ψa is the wavelet filter of scale a. All wavelet filters are defined such that they respect the condition of zero mean. Therefore, they are compensated by definition.

Another advantage of the wavelet formalism is that it can decompose a given image into several complementary scale components. This is possible using the wavelet transform, for which fast algorithms exist. For our study, we use the isotropic un-decimated wavelet transform, also called the starlet transform (e.g., Starck et al. 1998, 2006), which is able to simultaneously compute several wavelet maps on dyadic scales. The starlet functions are isotropic, which makes them well suited to extracting galaxy clusters, which appear roughly circular in the convergence maps. As such functions decompose an image into several complementary scales (see Sect. 4.3), they are ideal for undertaking a multi-scale analysis and for targeting structures of different angular sizes. The starlet transform decomposes the convergence as follows:

κ ( θ ) = C J ( θ ) + i = 1 J W i ( θ ) , $$ \begin{aligned} \kappa (\theta ) = C_J(\theta ) + \sum _{i=1}^J W_{i}(\theta ) ,\end{aligned} $$(31)

where J is the number of scales of the decomposition, CJ is the corresponding smoothed version of the convergence κ, and Wi are the wavelet maps targeting clusters with an apparent angular radius of 2i − 1 times the size of the pixel. As a consequence, the wavelet scales are fixed by the pixel size, and, by definition, increase as a power of two.

In the starlet transform, the wavelet functions are defined as the difference between two B3-spline functions at different resolutions. Their application to the convergence map was shown to be equivalent to applying the following AM filter (Leonard et al. 2012):

U W i ( u ) = 1 9 [ 93 | u | 3 64 ( | 1 2 u | 3 + ( 1 2 + u ) 3 ) + 18 ( | 1 u | 3 + | 1 + u | 3 ) 1 2 ( | 2 u | 3 + | 2 + u | 3 ) ] , $$ \begin{aligned}&U_{W_i}(u) = \frac{1}{9}\, \bigg [\,93\, \left|u\right|^3\, - \,64\, \left(\,\left|\frac{1}{2}\,-\,u\,\right|^3\,+\,\left(\,\frac{1}{2}\,+\,u\,\right)^3\right) \nonumber \\&+ 18\,\left(\, \left|\,1\,-\,u\,\right|^3\, +\, \left|\,1+ \,u\,\right|^3\,\right) - \frac{1}{2}\,\left(\,\left|\,2-\,u\,\right|^3+\left|\, 2+\,u\,\right|^3\right) \bigg ], \end{aligned} $$(32)

where

u = θ 2 i x , $$ \begin{aligned} u = \frac{\theta }{2^i x}, \end{aligned} $$(33)

and x is the pixel size. The new multi-scale detection method that we introduce in Sect. 5.3 takes advantage of this multi-scale decomposition, the wavelet transform being able rapidly to decompose the convergence into different scales.

4.3. Filter analysis

The performance of each detection method is closely linked to the capacity of the filter functions to reduce the shape noise and the contribution from LSS, while simultaneously minimising the signal loss. In the following, we analyse the properties of the filters described above both in real and Fourier spaces to help in the interpretation of the results presented in Sect. 8.

In Fig. 2 we compare the different filter functions Q and U, defined in Sects. 4.1 and 4.2, in real space. This comparison underlines the differences in filter shape. In particular, we focus our attention on their compensated and local properties. The dotted blue curves correspond to the AM functions defined in Schneider (1996). The filter US96 is compensated and local by design. The dash-dotted red curves correspond to the AM functions defined in Schirmer (2004). The filter UTANH drops rapidly to zero, but it is not local and is only compensated at infinity. The localisation is imposed by a truncation radius at R = 7′. The dash-dotted purple curves correspond to the AM functions defined in Jarvis et al. (2004). This filter UJ04 is also not local, and is also only compensated at infinity since it tends to zero when θ tends to +∞. In practice, the filter is truncated at R = 20′ to have a local support. The dashed green curves correspond to the AM functions defined in Miyazaki et al. (2018). In the definition of UM18, U0 ensures that the filter is compensated within an aperture of radius R. However, the QM18 function is non-local, which can introduce errors when truncating the aperture. The solid orange curves correspond to the wavelet filter function UW4 defined in Eq. (32) for i = 4 (hereafter W4). The wavelet filter is shown to be compensated within an aperture radius equal to two times the filter radius.

thumbnail Fig. 2.

Comparison of different filter functions, U (left) and Q (right), for AM map reconstruction. The figure shows in blue (dotted lines) the AM filter functions defined by Schneider (1996, S96) with R = 9′; in red (dash-dotted lines) the AM filter functions defined in Schirmer (2004, TANH) with R = 7′; in purple (dash-dotted lines) the AM filter functions defined in Jarvis et al. (2004, J04) with σ = 4′; and in green (dashed lines) the AM filter functions defined by Miyazaki et al. (2018, M18). The filter UM18 is also a compensated filter thanks to the U0 term, but due to the high truncation radius we used, this is not distinctly visible. Finally, the AM filter functions corresponding to the wavelet filter functions at scale i = 4 are displayed in orange (solid lines). All the filters are normalised to have a maximum amplitude of 1.

It is also instructive to study the representation of the filter functions in Fourier space as this allows us to fully understand how the filters are designed to locate structures. In Fourier space, an optimal band-pass filter should be local, in order to select only a specific range of frequencies around the scale of interest, and thus reduce the noise and the contribution from LSS. The design of AM filters can be delicate, and some such filters suffer from oscillations in Fourier space owing to truncations applied in real space that deteriorate the band-pass ability of the filter (see e.g., Leonard et al. 2012, for more details). In contrast, many wavelet functions have been specifically designed to fulfil the properties of localisation in real and Fourier space.

To analyse the filters precisely in Fourier space, we studied their impulse response. For this purpose, we simulated a null convergence map with a single peak at the centre. Then we passed this convergence map through the four AM filters and each of the wavelet functions, and computed the corresponding power spectra in Fourier space. The resulting power spectra are compared in Fig. 3. This representation highlights each characteristic filter radius, corresponding to the characteristic size of the structures the filter targets. The 3′ radius of the S96 filter falls between that of the W3 and W4 wavelet filters. The TANH filter, with a radius of about 2′, matches that of the W3 wavelet filter. The J04 filter, which has a radius of about 4′, matches that of the W4 wavelet filter. Finally, the M18 filter can be compared to a combination of the W3 and W4 wavelet filters. In the Fourier domain, we can clearly see the band-pass behaviour of the different functions. In particular, the S96 and TANH filters show oscillatory behaviour, which can be explained by their design.

thumbnail Fig. 3.

Fourier space comparison of several AM filters and wavelet filters, expressed as a function of wave number k in arcmin−1 and filter radius in arcmin. The dotted blue line (upper panel) represents the S96 filter, the dash-dotted red and purple lines (middle panels) represent the TANH and J04 filters, and the dashed green line (lower panel) corresponds to the M18 filter. The solid lines show the different wavelet filters: the teal, green, orange, and red light lines correspond to the W2, W3, W4, and W5 wavelet filters with radii of 1 . 17 $ 1{{\overset{\prime}{.}}}17 $, 2 . 32 $ 2{{\overset{\prime}{.}}}32 $, 4 . 68 $ 4{{\overset{\prime}{.}}}68 $, and 9 . 37 , $ 9{{\overset{\prime}{.}}}37, $ respectively. For comparison purposes, the J04 and M18 filters are forced to the same amplitude as the W4 function to illustrate their behaviour in terms of filter radius.

This Fourier space representation further shows that the selected scales of the wavelet filter appear to be well defined. In the wavelet formalism, the wavelet filter functions have complementary dyadic scales, defined by the pixel resolution as explained in Sect. 4.2. Their comparison with the four AM filters highlights that a wavelet filter at a given scale can directly be compared to an AM filter. However, unlike AM filters, which focus essentially on one scale, the wavelet formalism allows us to extract signal at more than one complementary scale. Generally, the standard AM methods target clusters located at relatively low redshift with a large apparent angular size. Thus, typical AM filter radii are taken between 1′ and 5′ (e.g., Hamana et al. 2012) to maximise the S/N for clusters with a similar angular extent. In contrast, the lensing signal from clusters with different apparent angular sizes can easily be targeted using the wavelet formalism.

5. Detection algorithms

In this section we discuss the implementation of the detection approach we used to compare the filters. The detection algorithms are all based on the same basic principles. The lensing signal is first convolved with a filter function of a specific scale (described in the previous section) to reconstruct E- and B-mode AM maps. Thresholding is then applied to the E-mode AM map to locate the overdensity peaks corresponding to clusters.

5.1. Binning and filtering

In the following, we describe the steps used to produce the E- and B-mode AM maps on which the detection is performed. As explained in Sect. 3.3, the noisy galaxy catalogues are the basic inputs for the detection algorithms. These contain the shear at the discrete positions of each of the galaxies.

At this stage the convolution by the filter function can be undertaken in real space, on the shear directly at the position of the galaxies, or in Fourier space, by binning the shear on a regular grid to build a shear map. A third possibility is to reconstruct the convergence from the shear maps and then to perform the filter convolution on the resulting convergence map. A comparison of these approaches is provided in Sects. 6.1 and 6.2, where we show that we obtain statistically similar results irrespective of approach. For the following, we use the third approach because it is computationally faster, in particular for wavelet filters, with the use of the wavelet transform.

We first binned the observed galaxy ellipticities on a regular grid to create what we refer to as the noisy shear maps. In our study, the simulated field being 5° ×5°, we decided to bin the galaxies in a grid of 512 × 512 pixels, yielding a pixel size of 0 . 58 $ 0{{\overset{\prime}{.}}}58 $. On average, with a galaxy density of 30 galaxies per arcmin2, about 10 galaxies fall into each pixel. The standard method for binning the shear on a regular grid consists of simply calculating the average shear per pixel (see e.g., Kaiser & Squires 1993; van Waerbeke 2000; Pires et al. 2020). In this paper, we introduce a new binning strategy to be closer to the real space approach. This consists of summing the shear of all the galaxies that fall into each pixel. Thus, the shear in each pixel is effectively weighted by the number of galaxies in that pixel. This binning strategy is more adapted to weak lensing detection because additional information on the galaxy density is included in each pixel. The resulting shear map can then be normalised by the mean number of galaxies per pixel to facilitate the comparison with other approaches. However, this global normalisation has no impact on the detection because it affects the E- and B-modes in a similar way. The impact of these two different binning approaches is discussed in more detail in Sect. 6.1. The disadvantage of this approach is that the weak lensing signal depends significantly on the number of galaxies per pixel, complicating its interpretation in terms of mass. However, such calculations can be undertaken in a second step, after detection.

We then applied the Kaiser & Squires inversion described in Sect. 2.2 to the noisy shear maps γ1 and γ2, to reconstruct the E- and B-mode convergence maps. Convolution by the different filters was then performed both on the E- and B-mode convergence maps, to produce the corresponding AM maps.

In the following, we refer to maps of the projected matter distribution at a given scale as AM maps. In practice, these can be obtained by applying AM filters or wavelet filters to the shear or to the convergence. The procedure to detect the haloes on these E- and B-mode AM maps using a single scale is identical, regardless of the method that has been used to produce them.

5.2. Single-scale detection procedure

Once the E- and B-mode AM maps at a specific scale are produced, the detection procedure starts by applying a thresholding step. Since the weak lensing effect produces only E-modes, the B-modes are simply due to the noise and can therefore be used to evaluate the level of the noise in the E-mode AM map. As proposed by Miyazaki et al. (2018), we used the maximum pixel value in the B-modes to define the threshold that is then applied to the E-mode AM map. We then only keep the pixels in the E-mode AM map that are greater than this threshold, setting the other pixels to zero.

In their study of the HSC data, Miyazaki et al. (2018) find that this threshold corresponds to 4.7 times the standard deviation of the noise. The standard deviation depends on the filter under consideration and is computed from the B-mode AM map. For the DUSTGRAIN-pathfinder simulation we use here, the equivalent average thresholds for the 256 realisations correspond to 4.09, 4.37, 4.01, 4.35, 4.57, 4.35; and 4.04 times the standard deviation for the S96, TANH, J04, M18, W2, W3, and W4 filters, respectively.

The final detection step is to identify the peaks in each of the thresholded E-mode AM maps. A peak is defined as a pixel whose value is greater than its eight nearest neighbours. The peaks at the border of the map over a width equal to the filter radius are discarded to avoid contamination due to boundary effects introduced by the convolution step. Following this procedure, we obtain a list of detections and associated peak coordinates. While this procedure is sufficient for single-scale methods, it needs to be further developed in the multi-scale case to deal with multiple detections, as we describe below.

5.3. Multi-scale detection procedure

In the detection method based on the wavelet transform, we chose to decompose the lensing signal into J = 5 scales (see Eq. (31)), as this will encompass a maximum of possible galaxy cluster apparent angular sizes. Figure 4 shows the result of such a decomposition applied to the noisy convergence map of a 5° ×5° simulated field. The top left panel shows a noisy version of the E-mode convergence map displayed on the right panel of Fig. 1, and the other panels show the wavelet maps corresponding to scales i = 1–5 (W1–W5). Each wavelet map gives details of the original convergence map at different scales. The first scale (W1) is mostly dominated by the noise. In contrast, the scales from W2 to W5 shed light on different signals within the input E-mode convergence map. In our analysis, we decide to only keep the scales W2, W3 and W4. We remove scale W1 because it is dominated by noise. Scales W5 and above have also been removed because these filter sizes greatly exceed the expected angular size of the clusters we want to detect.

thumbnail Fig. 4.

Wavelet decomposition of an E-mode convergence map (including shape noise), covering a simulated field of 5° ×5°. The upper-left panel corresponds to the E-mode convergence map to be decomposed. The other panels correspond to the wavelet maps at scale W 1 = 1 . 17 $ W1 = 1{{\overset{\prime}{.}}}17 $ (upper-middle panel), scale W 2 = 2 . 32 $ W2 = 2{{\overset{\prime}{.}}}32 $ (upper-right panel), scale W 3 = 4 . 68 $ W3 = 4{{\overset{\prime}{.}}}68 $ (lower-left panel), scale W 4 = 9 . 37 $ W4 = 9{{\overset{\prime}{.}}}37 $ (lower-middle panel), and scale W 5 = 18 . 75 $ W5 = 18{{\overset{\prime}{.}}}75 $ (lower-right panel). The wavelet maps are sensitive to structures of different apparent angular sizes. The W1 map (upper-middle panel) appears to be dominated by the shape noise. The wavelet decomposition also includes the smoothed version, which we excluded from this representation.

On the remaining scales W2, W3, and W4, we apply the single-scale detection procedure described above to extract the local maxima at each individual scale (hereafter W234). We now have to deal with an issue that is specific to the multi-scale approach. A signal from a given object can be detected at several scales, and with a position that varies slightly from one scale to another. We refer to these henceforth as multiple detections. Once we have obtained a peak candidate list for each scale, it is important to identify such multiple detections and to recombine them. This is achieved by computing the separation distance between all the pairs of detections on consecutive scales. If two peaks on different scales have a separation distance smaller than the larger of the two filter radii, we consider them to be a multiple detection. These are then recombined, and the position of the recombined peak is set by the position of the detection at the finest scale. If there are more than two peaks, we repeat this procedure for all the detection peaks and all the scales consecutively. We keep track of the index i of the finest scale where the recombined detection appears, as this will be considered as the scale at which the signal was detected for the matching procedure described below in Sect. 7. We also saved the detections at each individual wavelet scale to study their complementarity before combination and to allow a comparison of their performance with respect to the corresponding AM filters.

6. Analysis of implementation options for the detection algorithms

In practice, the detection is undertaken on a catalogue of galaxies, for each of which a noisy shear measurement is available. As described above, this is typically undertaken through the convolution of the shear with a compensated filter. This convolution can be performed in real space, directly at the position of the galaxies (e.g., Maturi et al. 2005; Wittman et al. 2006; Miyazaki et al. 2018; Hamana et al. 2020), or in Fourier space, binning the shear on a regular grid (e.g., Hennawi & Spergel 2005; Leonard et al. 2015; Oguri et al. 2021). Moreover, some cluster weak lensing detection algorithms are based on the convolution of the convergence (derived from the shear and also defined on a regular grid) by a compensated filter (e.g., White et al. 2002; Hamana et al. 2004; Tang & Fan 2005; Miyazaki et al. 2007). In this section we explore how binning impacts the signal, and whether the choice to use the shear or the convergence has any effect on the detection efficiency.

6.1. Detection in real space versus Fourier space

In the limit of a perfectly homogeneous distribution of background sources, the detection of convergence peaks, whether in real space or in Fourier space, depends only on the number density of these sources. However, as the background sources are not homogeneously distributed, an additional uncertainty is introduced into the method by the binning. This can impact the position and the S/N of the detections. This point has not been fully studied here because the distribution of the background sources is homogeneous in the simulations. However, as the number of background sources per pixel varies slightly for statistical reasons, our study already provides some insights on the impact of the background source distribution.

6.1.1. Binning strategy

The computation time for AM-based detection algorithms is considerable for the real space approach. Computation time is significantly reduced for AM-based detection algorithms when using the Fourier approach. However, there is a loss of resolution compared to the real space approach because the galaxies must be binned on a shear map before performing the Fourier transform, instead of using their exact position. This loss of resolution introduces a loss of power for the high S/N pixels in the shear map, affecting the efficiency of the detection method. An additional disadvantage of the Fourier approach is that there are artefacts at the border of the field due to the periodic assumption of the discrete Fourier transform. Such effects can be significantly mitigated by removing the image boundaries or by dealing correctly with borders during the mass inversion, as proposed in Pires et al. (2020).

The impact of the loss of resolution when working in Fourier space depends on the map resolution (i.e. the number of pixels of the map), and can be mitigated by using an appropriate value. Indeed, there is a trade-off to be made on the map resolution. Decreasing the resolution too much leads to power leakage, which can affect the detection method. On the other hand, finer binning can oversample the underlying data, and can introduce missing data into the map.

Both real space and Fourier space approaches can be found in the literature (e.g., Hennawi & Spergel 2005; Gavazzi & Soucail 2006; Miyazaki et al. 2018; Hamana et al. 2020; Oguri et al. 2021). In this paper we have introduced an additional Fourier space approach with a different binning strategy as explained in Sect. 5.1.

6.1.2. Impact on the signal

We first assessed the impact of the choice of a real or Fourier space approach on the signal conservation, when using an appropriate map resolution. We considered three different implementations: (i) application of an AM filter to the shear at the positions of the galaxies, directly in real space; (ii) application of an AM filter in Fourier space to the binned shear obtained by averaging the signal from the galaxies that fall in each pixel (averaged pixel binning); and (iii) application of an AM filter in Fourier space to the binned shear obtained by summing the signal from the galaxies that fall in each pixel (summed pixel binning).

First, we studied the differences between the three implementations through their spectra in the Fourier domain. For this test, we used the filter proposed by Jarvis et al. (2004) because it can be easily applied to the shear, either at the position of the galaxies or binned on a regular grid. From the shear catalogue of a 5° ×5° field without noise, we computed the E-mode AM maps for the three different implementations and derived the corresponding power spectrum.

The results are presented in the left-hand panel of Fig. 5, with the power spectra shown for implementations 1, 2, and 3. The power spectra are very similar, showing that on average the signal is equally well preserved by all three approaches. To quantitatively compare the implementations, we computed the integral of the power spectra. Compared to the real space approach, we find that the power spectrum integral of the Fourier space implementation suffers from a 1.2% loss for summed pixel binning, and a 1.5% loss for averaged pixel binning.

thumbnail Fig. 5.

Comparison of the real and Fourier space implementations. Left: power spectra of a 5° ×5° AM map with 512 × 512 pixels obtained using the real space implementation (blue) and using the Fourier space implementations (orange and green lines for summed and averaged pixel binning, respectively). Compared to the real space implementation, there is a 1.2% and 1.5% loss of power for the Fourier space implementation with summed and averaged pixel binning, respectively. Right: detections obtained on the same field when the computation is undertaken in real space (blue circles) or in Fourier space using either the summed pixel binning (golden circles) or the averaged pixel binning (green circles). The Fourier space implementation with summed pixel binning is very close to the real space implementation in terms of detections. When the binning is performed by averaging the shear values, there are fewer detections, and about 30% of these detections are different from those obtained with the real space implementation.

6.1.3. Impact on the detections

We now analyse the impact of the three implementation choices discussed above on the detection efficiency. For each of the three implementations, we built the E- and B-mode AM map and applied the detection procedure described in Sect. 5.2 using the Jarvis et al. (2004) AM filter. This was undertaken on all 256 realisations. The right-hand panel of Fig. 5 compares the detections obtained with the three different implementations for one of the realisations.

When averaging these results over the 256 realisations, we found that the mean number of detections using the real space approach (49.2 ± 0.6) is very close to that from the Fourier space approach with summed pixel binning (47.9 ± 0.5). In contrast, the Fourier space approach with averaged pixel binning gives a smaller mean number of detections (43.4 ± 0.5). Moreover, a closer look on the detections made by the different approaches reveals noticeable differences. We found that 28.2%  ±  2.1% of the detections obtained from the averaged pixel binning approach differ from those made by the real space approach. In contrast, with the summed pixel binning approach only 7.6%±1.3% of the detections are different.

These results show that the binning strategy has a non-negligible impact on the detection efficiency, both in terms of number and distribution of the detections. In light of these results, we conclude that the Fourier space approach with summed pixel binning strategy agrees very well with the real space approach, producing results that are statistically similar. In the following, we decided to perform all tests and comparisons using the third implementation (i.e. the Fourier space approach using a summed pixel binning strategy) because it is computationally less demanding. It has the additional advantage of providing a substantial speed increase with respect to detection in real space: for our applications, we find a speed increase of two orders of magnitude, in agreement with the study by Leonard et al. (2012).

6.2. Detection using shear map versus convergence map

Both shear and convergence can give insights into the statistical properties of the weak lensing field, and indeed, it can be shown that they contain precisely the same information (e.g., Schneider et al. 2002; Pires et al. 2020). The (reduced) shear is a direct observable and is usually preferred for reasons of simplicity. However, the convergence has the key advantage that κE encapsulates all the lensing signal, while it is inevitably shared between γ1 and γ2 in the shear. In this connection, the convergence is more adapted for galaxy cluster detection because it explicitly traces the total matter distribution integrated along the LOS. A further advantage of using the convergence is that it is computationally less expensive to analyse.

Although the reconstruction of the convergence field from survey data is a difficult task, Pires et al. (2020) show that the lensing signal is preserved in the convergence maps provided the mass inversion is performed without noise regularisation, and systematic effects such as irregular sampling and complex survey geometries are well controlled.

The AM formalism can be expressed either on the shear or on the convergence maps. We now assess the difference between these two approaches. We first compared them in Fourier space. For each realisation of the noisy shear catalogue, we produced the E- and B-mode AM maps using the J04 AM filter applied to either the shear or the convergence maps. Then, we computed and compared the power spectra. The result of this comparison for one realisation is shown in the left-hand panel of Fig. 6, where we see that the power spectra obtained from the shear aperture map and the convergence aperture map are in good agreement. Averaging over all 256 realisations, we measure a discrepancy of 0.03% in the power spectrum integral between the shear and the convergence approaches. This difference can be explained by residual border effects that are not totally suppressed when removing the image boundaries.

thumbnail Fig. 6.

Comparison of the shear and convergence approaches. Left: power spectra of a 5° ×5° AM map with 512 × 512 pixels obtained by applying the AM J04 filter to the shear maps (blue line) and to the E-mode convergence map (orange dots) computed from the same data as in Fig. 5. The power spectra agree perfectly. Right: detections obtained on the same field when the convolution by the J04 AM filter is performed on the shear map (blue circles) and on the convergence map (golden circles). There is a one-to-one correspondence between the detections.

We then compared the two approaches in terms of detections, applying the detection procedure described in Sect. 5.2. The results, shown in the right-hand panel of Fig. 6, highlight the perfect agreement between the approaches. This result remains valid for wavelet filters that are formally identical to AM filters (see Leonard et al. 2012 for more details). However, while the AM formalism can be applied to either the shear or the convergence, the wavelet transform is not designed to process spin-2 fields (i.e. with two independent components) such as the shear field. So, the shear and convergence approaches being equivalent, we decided to perform all the tests and comparisons applying the AM and Wavelet filters to the convergence maps, to be able to use the wavelet transform.

7. Matching

Once the peaks are detected in the E-mode AM maps, they must be cross-matched with the position of the haloes in the DM halo catalogue. This matching procedure is essential to characterise the performance of the different methods, and the results can be biased if not undertaken correctly. This is particularly true when the filter characteristics differ significantly between methods. We first identify two main points in the matching procedure that can significantly increase the number of false detections.

The first concerns the characteristics of the halo catalogue used for matching. The DM halo catalogue contains objects down to a mass of M200c = 1012h−1M and up to a redshift of z = 3.6. For the cosmology under consideration, a halo catalogue with these characteristics contains about 15, 000 haloes in a field of 5° ×5°. Obviously, only a small fraction of these haloes will be detectable through the weak lensing effect, which is only sensitive to the most massive clusters with sufficient galaxy sources behind them to trace their mass distribution. Changing the characteristics of the halo catalogue, such as imposing limits in mass and/or redshift, changes the performance of the methods with respect to a number of key points (e.g., matching, false associations).

The second is the association distance. Typically, the matching is performed within a fixed physical, comoving, or angular radius centred on the candidate peak, the association distance of which is usually optimised for a given filter (e.g., Hennawi & Spergel 2005; Gavazzi & Soucail 2006; Miyazaki et al. 2018; Hamana et al. 2020; Oguri et al. 2021). However, when comparing different filters, this association distance must be adapted for each detection method to allow a fair comparison.

In the following, we address the first issue by applying a pre-selection to the halo catalogue based on the theoretical S/N of the detection of an NFW halo profile by weak lensing. Regarding the second issue, we developed a method that allows us to adapt the matching distance to the filter characteristics used in the detection method.

7.1. Halo catalogue pre-selection by S/N

Imposing a pre-selection on the DM halo catalogue, by removing those haloes that are unlikely to be detected through weak lensing, reduces significantly the number of false associations. We defined our pre-selection by deriving a measure of the halo detectability in the form of the theoretical S/N of the detection by weak lensing (e.g., Hamana et al. 2004; Hetterscheidt et al. 2005; Bergé et al. 2010; Andreon & Bergé 2012). We followed the approach proposed by Bergé et al. (2010). Assuming the filter perfectly represents the signal κ, the S/N of an halo of mass M200c and redshift zl can be expressed as

ν = n g σ ϵ κ ( θ ) 2 d θ 2 , $$ \begin{aligned} \nu = \frac{\sqrt{\mathrm n_{\rm g}}}{\sigma _{\epsilon }} \sqrt{\int \int \kappa ({\boldsymbol \theta })^2 \,\mathrm d{\boldsymbol \theta }^2}, \end{aligned} $$(34)

where σϵ is the mean intrinsic shape noise of the source galaxies and ng the mean density of background sources. Assuming the mass of the cluster follows an NFW distribution (Navarro et al. 1996) its density profile is given by

ρ ( r ) = ρ s ( r / r s ) ( 1 + r / r s ) 2 , $$ \begin{aligned} \rho (r) = \frac{\rho _{\rm s}}{(r/r_{\rm s})\,(1 + r/r_{\rm s})^2}, \end{aligned} $$(35)

where ρs is the characteristic density and rs = R200c/c is the scale radius. The surface mass density Σ, projected along the LOS, can be written as (see Bartelmann & Schneider 2001)

Σ ( x ) = 2 r s ρ s g ( x ) , $$ \begin{aligned} \Sigma (x) = 2 \, r_{\rm s} \, \rho _{\rm s} \, g(x), \end{aligned} $$(36)

where x = r/rs is a dimensionless radius. The function g is defined with respect to the distance to the halo centre and the concentration, c. If c > 1, it can be written as (Wright & Brainerd 2000)

g ( x ) = 1 1 + c { c 2 x 2 ( 1 x 2 ) ( 1 + c ) + 1 ( 1 x 2 ) 3 / 2 arccosh ( x 2 + c x ( 1 + c ) ) ( x < 1 ) , c 2 + 1 3 ( 1 + 1 1 + c ) ( x = 1 ) , c 2 x 2 ( 1 x 2 ) ( 1 + c ) 1 ( x 2 1 ) 3 / 2 arccos ( x 2 + c x ( 1 + c ) ) ( 1 < x < c ) , 0 ( x > c ) . $$ \begin{aligned} g(x) = \frac{1}{1+c}\left\{ \begin{array}{ll} - \frac{\sqrt{c^2-x^2}}{(1-x^2)(1+c)} + \frac{1}{(1-x^2)^{3/2}}\mathrm{arccosh} \left(\frac{x^2+c}{x(1+c)}\right)&{(x}< 1),\\ \frac{\sqrt{c^2+1}}{3}\left(1+\frac{1}{1+c}\right)&{ (x}\,=\,1), \\ - \frac{\sqrt{c^2-x^2}}{(1-x^2)(1+c)} - \frac{1}{(x^2-1)^{3/2}}\mathrm{arccos} \left( \frac{x^2+c}{x(1+c)} \right)&{(1 < x < c),} \\ 0&{(x>c).} \end{array} \right. \end{aligned} $$(37)

The concentration parameter c is derived using the semi-analytical model introduced by Diemer & Joyce (2019), as implemented in COLOSSUS (Diemer 2018). The S/N of the halo can then be written as (Bergé et al. 2010)

ν = Z 2 2 π n g σ ϵ r s 2 ρ s D d Σ 0 c x g ( x ) 2 d x . $$ \begin{aligned} \nu =\, \langle Z \rangle \,2\sqrt{2\,\pi } \,\frac{\sqrt{n_{\rm g}}}{\sigma _{\epsilon }} \frac{r_{\rm s}^2\, \rho _{\rm s}}{D_{\rm d} \Sigma _{\infty }} \sqrt{\int _0^{\,c} x g(x)^2 \mathrm{d}x}. \end{aligned} $$(38)

The weight parameter Z is designed to take into account the impact of the distribution of the sources (e.g., Seitz & Schneider 1997). the angular-diameter distance to the lens is described by Dd, and Σ is the value of Σcrit for a source at a redshift of infinity. For each halo in the catalogue, we computed its theoretical S/N (ν) assuming a Euclid-like redshift source distribution pzs (derived from the simulation), a galaxy density of ng = 30 gal.arcmin−2 and a shape noise of σϵ = 0.26.

The left-hand panel of Fig. 7 shows the resulting ideal weak lensing detection selection function in the mass-redshift plane for the characteristics of the simulation. The figure is obtained by generating a grid in the mass-redshift plane and computing the S/N value for each point of the grid using Eq. (38). The colour map highlights the increase in S/N towards the upper-left corner, corresponding to the detection of massive clusters at low redshift. The solid white lines correspond to S/N contours of level S/N = [0, 1, 2, 3, 4, 5, 6, 7, 10, 12, 15, 20, 25, 30, 35], which have been interpolated from the (z, M) grid points.

thumbnail Fig. 7.

Application of a theoretical S/N pre-selection to the halo catalogue. Left: weak-lensing selection function for clusters of galaxies, in the redshift–mass plane. The selection function has been computed assuming a Euclid-like redshift distribution with ng = 30 gal.arcmin−2 and σϵ = 0.26. Contours denote the theoretical S/N for an NFW profile. Right: example of halo catalogue pre-selection using the theoretical S/N, for one 5° ×5° field. The light blue dots correspond to all the haloes in the field, and the dark blue dots are the selected haloes with S/N > 2.

However, the above only yields an average value for the S/N of detection assuming a mean source distribution and an optimal filter. In practice, the filters are never perfectly adapted to the cluster shape as assumed in the theoretical S/N estimation. Furthermore, the detection S/N depends on the redshift distribution of the sources that are behind the cluster. The galaxies behind the cluster being few in number, small variations in their redshift can artificially boost or decrease the weak lensing signal.

We applied a conservative threshold to the 256 DM halo catalogues by keeping all the haloes with S/N > 2. Although a halo with S/N = 2 is unlikely to be detected, we applied this value to take into account the variance in the detection due to the distribution of the sources, and the use of an ideal filter. Below this value we do not expect to detect many clusters. We discuss this choice of pre-selection threshold further in Sect 8.3.2. With this pre-selection, in practice, from a typical 5° ×5° field, only 2–3% of the 15 000 haloes in the catalogue are kept. The remaining 300 − 400 haloes are then the basis for the matching procedure. The right-hand panel of Fig. 7 shows the selected haloes in the mass-redshift plane for one 5° ×5° field.

7.2. Matching procedure

The matching procedure between the catalogue haloes and the peak candidates is a complex task. The typical approach consists of evaluating the distance between the candidate peak positions and the closest halo, and to consider these matched if the distance is below a given maximum matching distance (MMD).

In practice, most matching methods simply draw a circle of a given apparent angular radius around the candidate peak and match the closest halo that falls in the circle (e.g., Hamana et al. 2004, 2015, 2020; Gavazzi & Soucail 2006). However, in some procedures the matching is more refined, drawing a circle of a given comoving distance (in Mpc) around the halo and then matching the detection with highest S/N within this radius (e.g., Miyazaki et al. 2018; Oguri et al. 2021). The latter is more precise, but requires access to the redshift of the haloes in the catalogue.

Our study is further complicated by the characteristics of the different filter functions, and the inherent multi-scale nature of the wavelet approach. We therefore used a different method, drawing on SZ cluster survey matching procedures (see e.g., Planck Collaboration XXIX 2014). In our study, the matching is undertaken by computing the apparent angular distance D (hereafter the association distance) between each candidate peak and each halo in the DM halo catalogue, after S/N pre-selection. This association distance is then normalised by the apparent halo radius θ200c. This normalised distance D/θ200c, together with the MMD defined below, are our matching criteria. Together, they take into account the filter size and have the effect of favouring the closest and most massive clusters in the matching.

The matching procedure is implemented iteratively. For N detections and M haloes, we obtain an array of N × M measured distances. We start the iterative process for a given candidate peak by selecting the minimum normalised distance D/θ200c among all the measured normalised distances. The detection and halo corresponding to this minimum normalised distance are considered as matched and the corresponding row and column are removed from the array. The procedure is repeated until we reach the defined MMD. This procedure is perfectly suited to cross-match detections with simulated halo catalogues, yielding excellent results compared to typical methods. However, for application to cross-matching with real cluster catalogues (e.g., optical, X-ray, etc.) access to an estimate of the R200c (or M200c) of the haloes is required.

7.3. Maximum matching distance

The choice of the MMD is critical, and must take the positional precision of the detection method into account, as larger filter sizes will lead to a loss of precision on the coordinates of the peak centre. At the same time, if the chosen MMD is too large, false associations may be introduced even after pre-selecting the halo catalogue, while associations may be missed if the MMD is too small. Therefore, the MMD must be optimised and adapted to the detection method in question.

For this study, we developed an empirical method that is able to adapt the MMD to the detection method. Our approach derives the MMD from the uncertainty in the position of the detection introduced by the size of the filter. In practice, this is undertaken by performing an association procedure on all the 256 fields, which corresponds to applying the matching procedure without any limit in the matching distance.

Figure 8 shows an example of the distribution of the corresponding association distances for all the 256 realisations (about 75 000 haloes after catalogue pre-selection) for the filter W3, for which the filter radius equals 2 . $ 2{{\overset{\prime}{.}}} $32. Each blue dot corresponds to a pairing between a peak detection and its closest halo. The upper-left panel shows the absolute association distance D, expressed in arcminutes, as a function of the association distance D normalised by the angular halo radius θ200c. We clearly distinguish two distinct groups of associations. In the upper group, the detections are outside the characteristic halo radius. Therefore, we can consider this group to be false associations. The lower group is composed of associations for which the peak position is within the θ200c radius of the halo and can thus be considered as true associations. To isolate the correct associations and define a correct MMD, we use a different visualisation of the results. In the lower-right panel of Fig. 8, the blue dots represent a unique peak detection-halo pair whose association distance D has been decomposed in terms of RA and Dec (we show only the central 20 × 20 arcmin). The high-density central region corresponds to peak detections that have been correctly matched to an underlying halo. The dispersion in this high-density central region in the RA-Dec plane highlights the uncertainties in the position of the peak detection introduced by the size of the filter.

thumbnail Fig. 8.

Dispersion of the association distance (without maximum association distance) for the W3 wavelet filter, considering all 256 realisations. In all the panels, the blue dots correspond to an association between a peak detection and its closest halo. Upper left: absolute association distance, D, as a function of its normalised expression with respect to the characteristic halo radius, θ200c. Lower right: zoomed-in view of the central 20 × 20 arcmin distribution of absolute association distances expressed in RA and Dec, to highlight their dispersion. The two remaining panels show the projection of the absolute association distance distribution onto the RA and Dec axes: the projection onto the Dec axis (lower-left panel) and the projection onto the RA axis (upper-right panel). The RA and Dec axes are zoomed around the central area. On these two panels, the dashed orange line shows the result of a fit by a Gaussian function whose parameters (μ, σ) are labelled ‘Mean’ and ‘Stddev’.

To highlight the properties of the dispersion, we projected the distribution on both RA and Dec axes, displayed for the W3 filter in the upper-right and lower-left panels of Fig. 8, respectively. The positional uncertainty is well approximated by a Gaussian distribution. In practice, we obtain the same parameters if we fit a 2D Gaussian distribution to the RA-Dec plane or a 1D Gaussian distribution separately to the RA and Dec components. From the fit we extracted the standard deviation σ, which corresponds to the dispersion of the positional uncertainty of the detection with respect to the true halo position. The Gaussian fit shows a y-offset that can be explained by randomly matched haloes. The Gaussian fit also shows an x-offset corresponding, to the mean of the Gaussian distribution μ, which is due to the pixelisation. From these parameters, we defined the MMD to be 5σ  +  |μ|. We then applied this procedure to all the filters, and the resulting MMDs for each detection method are shown in Table 1. Comparing these values with the values of the filter radii shown in Fig. 3, we see that the MMD is directly proportional to the filter radius, as expected.

Table 1.

Estimated maximum matching distance (MMD) for different filter functions, evaluated using the procedure described in Sect. 7.3.

Once the MMDs are defined, we can apply the matching procedure defined in Sect. 7.2. Figure 9 shows the results of the matching for the W3 filter. The blue dots still refer to the associations from all the detections and their closest halo. The orange dots correspond to the detections that have been successfully matched. Considering only the pairs for which the association distance is lower than the MMD allows us to separate the two groups of associations distinctly. This further allows us to build two catalogues: one of matched detections, and another of unmatched detections.

thumbnail Fig. 9.

Matched detections for the W3 filter. The two panels are similar to Fig. 8. The blue dots correspond to an association between a peak detection and its closest halo. The orange dots correspond to the detections with an absolute association distance lower than the defined MMD.

7.4. Multi-scale matching procedure

In the multi-scale approach, a same cluster can be detected on several successive scales, complicating the matching analysis. To assess the performance of the multi-scale approach, it is important to recombine the multiple detections and to associate them with one single scale (the finest scale), as explained in Sect. 5.3.

In practice, the multi-scale matching procedure works as follows. Starting with the finest scale, the matching procedure is performed using the appropriate matching distance defined in Table 1. Associations at that scale are considered to be unique, and these haloes are removed from the catalogue that is used on subsequent scales. This procedure is repeated for each subsequent scale.

The final result is a catalogue of associated and non-associated detections for each wavelet scale. These are important to measure the individual performance of each wavelet scale, and to compare their overall contribution in terms of detections and associations. The final catalogues of associated and non-associated detections are then obtained by concatenation of the individual single-scale catalogues.

8. Results and performance

We now assess and compare the performance of the detection methods. We start by quantifying the completeness and purity of each sample, as commonly done by other studies (e.g., Miyazaki et al. 2018; Euclid Collaboration 2019; Hamana et al. 2020). Then we analyse in more detail our detections by performing an analysis of the distribution of the matched detections in terms of redshift, mass and scale. We then quantify the number of false detections due to the shape noise for each method. Finally, we undertake a characterisation and a classification of the matched and unmatched detections. For the wavelet filters, we undertake the above both on one single scale (W4), and using the full multi-scale approach (W234).

8.1. Detection method performance

8.1.1. Completeness

To quantify the performance of the different methods and to compare them, we used the completeness, C, defined as follows:

C = Number of matched detections Number of clusters in the halo catalogue · $$ \begin{aligned} C= \frac{\mathrm{Number\,\, of\,\, matched \,\, detections}}{\mathrm{Number\,\, of\,\, clusters\,\, in\,\, the\,\, halo\,\, catalogue}}\cdot \end{aligned} $$(39)

We note that the completeness depends on the characteristics of the halo catalogue used for matching. Changing the pre-selection S/N in the halo catalogue will change the values of the completeness for each method. However, this does not affect the comparison between the methods.

To further develop our analysis, we computed the completeness for each detection method using different S/N thresholds in the halo catalogue. For each method, we computed the completeness of the 256 realisations considering only the haloes in the catalogue whose theoretical S/N is above a given value. We repeated this operation for different S/N limits. The results, averaged over the 256 realisations, are shown in Table 2. We see that considering the haloes with S/N > 2 results in a completeness below 25% for all the detection methods. As expected, the completeness increases with the halo pre-selection S/N threshold, and almost all of the highest S/N clusters are detected by the different detection methods. Even if the overall behaviour is comparable, this quantification shows that the detection methods based on the W234 and M18 filters outperform those based on the S96, TANH, and J04 filters in terms of completeness.

Table 2.

Completeness of the different detection methods for different S/N halo pre-selections, in percent.

8.1.2. Purity

Although the completeness for a given S/N halo selection is a good indicator of the performance of the different detection methods, it does not provide any information on the proportion of unmatched detections. That is why a measure of the purity of the sample is also needed to refine the comparison. The purity, P, is defined as

P = Number of matched detections Total number of detections · $$ \begin{aligned} P = \frac{\mathrm{Number\,\, of\,\, matched \,\,detections}}{\mathrm{Total\,\, number\,\, of\,\, detections}}\cdot \end{aligned} $$(40)

This quantity measures the proportion of matched detections in the sample (e.g., Euclid Collaboration 2019) and can be used to evaluate the false detection rate. We computed the purity of the samples obtained from each detection method for a fixed catalogue pre-selection threshold of S/N > 2. This threshold was chosen so as to not decrease the purity artificially as explained in Sect. 7.1. The results for the different detection methods, averaged over the 256 realisations, are summarised in Table 3. The estimated purity for all detection methods is comparable, at around 85%. This is somewhat lower than the expected number from the false detections estimated in Sect. 8.2.1. This led us to perform a case-by-case analysis of the unmatched detections, as detailed in Sect. 8.3.2. In particular, we observed that for some detections the matching procedure has failed or is too conservative. Although this obviously has an impact on the estimated purity, it does not affect the comparison between the methods because it affects all of them to the same extent.

Table 3.

Detection method performance.

8.1.3. Purity versus completeness

Comparing the results in Table 3, we see that the detection methods based on the S96, TANH, J04, and W4 filters lead to higher purity and lower completeness. Although the completeness is not directly linked to the number of detections, but rather to the number of matched detections, the lower completeness for these filters is consistent with their lower mean number of detections compared to the two other methods, and also with their Fourier space representation, whose band-pass width is narrower. We note that the wavelet filter W4 alone attains similar performance to the single-scale filters S96, TANH and J04. Conversely, the completeness is higher and the purity is lower for the M18 and W234 filters, which use larger band-pass filters. However, the multi-scale approach spans a wider range of possible cluster scales, and therefore allows us to reach a higher completeness. As a consequence, the multi-scale approach results in a higher mean number of detections for the same purity. Compared to the M18 AM filter, for instance, the multi-scale approach yields 25% more detections at a purity of ∼85%.

Table 3 also shows that some detection methods are more efficient in terms of completeness, while others are more efficient in terms of purity. Since the purity and completeness cannot be compared separately, a fair comparison between the detection methods requires an analysis of the evolution of the completeness as a function of the purity. We therefore computed this for each detection method using the pre-selected S/N > 2 catalogue for the matching. In practice, we varied the threshold in the detection procedure described in Sect. 5.2 in steps of 5% and obtained new values of the completeness and purity for each new detection threshold value.

The results, averaged over the 256 realisations, are shown in Fig. 10. We see that the multi-scale approach outperforms the other detection methods at all values of purity. Compared to the detection method based on the TANH filter, there is an increase in the completeness of up to 65% at low purity. The increase in completeness is less for the detection method based on the M18 filter, but it remains significant in the range of purities studied. We also note that the performance of the wavelet filter W4 alone and the single-scale filter J04 are comparable.

thumbnail Fig. 10.

Completeness as a function of purity for the detection methods based on the S96 (blue), TANH (red), J04 (purple), and M18 (green) AM filters, the W4 wavelet filter (light blue), and the W234 wavelet filter (orange). The cross markers represent precisely where the methods are evaluated in terms of purity and completeness. The purity and the completeness are computed using a S/N > 2 halo pre-selection and averaging over the 256 realisations.

8.2. Analysis of the detections

Here we estimate the number of false detections due to the shape noise, and characterise the matched detections in terms of mass, redshift and scale.

8.2.1. False detections

In the following we define false detections as those detections not related to any lensing signal (i.e. overdensity), and therefore due only to the noise. Then, we quantify the number of false detections due to the shape noise for each detection method. For this purpose, we randomised the orientation of the sources in the noisy galaxy catalogues to suppress the lensing signal and to generate pure noise galaxy catalogues. We then derived the associated convergence maps as described in Sect. 5 and applied the various detection methods. For each method, we then computed the mean false detection rate averaged over the 256 realisations. This was obtained from the ratio of the number of detections due purely to the shape noise to the number of detections obtained when the signal is not suppressed. The resulting false detection rate due to the shape noise is given, for each detection method, in the second column of Table 4.

Table 4.

Classification of the detections.

The false detection rate due to shape noise is 1.1% for the J04 detection method, 1.8% for the detection methods based on the S96, TANH and W234 filters, and 3% for the M18 detection method. The impact of the shape noise depends on the filter. The noise level is lower at large scales compared to smaller scales, which explains why the J04 detection method is less sensitive to the noise compared to the other methods. In contrast, the M18 detection method is the most sensitive because it includes small scales. Similarly, the oscillation behaviour of the S96 and TANH filters increases their sensitivity to the noise, by integrating smaller scales. The W234 detection method also includes small scales, but the impact of the shape noise is reduced owing to its multi-scale detection procedure. Despite these differences, the number of false detections due to the shape noise is small, irrespective of detection method.

8.2.2. Characterisation of the matched detections

Once the matching is completed, we characterise the matched detections by plotting their distribution in the redshift-mass plane. Figure 11 shows the distribution of the clusters detected by the five different detection methods in the redshift-mass plane. The figure is obtained from one single realisation, but is representative of the full 256 realisation sample. The matching is performed using only the haloes with S/N > 2.

thumbnail Fig. 11.

Distribution of the matched clusters in the z − M plane for one 5° ×5° patch, using the S96 AM filter in the upper-left panel, the TANH AM filter in the upper-right panel, the J04 AM filter in the middle-left panel, the M18 filter in the middle-right panel, the W4 wavelet filter in the bottom-left panel, and the W234 wavelet filters in the bottom-right panel. The light blue dots correspond to all the haloes within the halo catalogue, and the dark blue dots to the haloes we have pre-selected to have S/N > 2. The overplotted orange crosses correspond to the haloes that have been matched with a detection, and in parentheses we note the number of matches for each method on one patch. This patch was chosen to be representative of the performance of the different methods in terms of detection number.

As expected, all the detection methods are most sensitive to the signal from high mass clusters lying at low redshift. Comparison shows that the multi-scale detection method W234 has more matched detections than the others, in agreement with the completeness results presented in Table 2. In addition, almost all the clusters detected by the S96, TANH, J04, and M18 detection methods are also detected by the multi-scale detection procedure. Compared to the four other detection methods, the multi-scale approach generally detects more clusters at lower masses, and up to higher redshifts. In Appendix A we further analyse the multi-scale approach by investigating the contribution of each wavelet scale to the total number of matched detections.

A striking aspect of Fig. 11 is that some haloes with a very high predicted S/N are not detected, regardless of the method used. We performed a case-by-case analysis of these haloes and we found several explanations for this. In some cases, the detection is missed because of the shape noise. In other cases, the detection is missed because the lensing signal is decreased by the redshift distribution of the sources behind the cluster (through Eqs. (9) and (10)). In some other cases, the cluster is detected but the matching procedure fails because the normalised distance is not always the best criterion.

8.3. Classification of the detections

We now undertake a further analysis of the matched and unmatched detections by classifying them into different categories. In particular, Fig. 12 summarises these categories and the results for the W234 method, after detection and matching of all 256 realisations.

thumbnail Fig. 12.

Characterisation of the detections from the W234 method. The matched detections (blue segments) are categorised as correctly matched with single or multiple haloes, or incorrectly matched as defined in Sect. 8.3.1. The unmatched detections are sub-categorised as blended signal, boosted signal from the source distribution, false detections, or others, as defined in Sect. 8.3.2. The results are averaged over the 256 realisations.

8.3.1. Classification of the matched detections

The matched detections can be classified into two categories.

True detections correctly matched. Almost all the matched detections correspond to correct associations to a single halo of the catalogue (‘Single matched’ in Fig. 12). However, it is inevitable that some true detections can be correctly matched but should in fact be associated with two (or possibly more) haloes whose lensing signal is merged. This may occur when two haloes are aligned in the LOS or when their apparent angular distance is small compared to the size of the filter. In this case, the matching procedure does not differentiate and the association is undertaken simply with the closest halo. We estimate that about 5% of the matched detections fall in this category by searching for multiple haloes in the vicinity of the matched detections (‘Multiple Matched (LOS)’ in Fig. 12). These imperfect associations can slightly reduce the estimated completeness, but all the methods are impacted in the same way by this effect.

False detections incorrectly matched. Among the matched detections, a small fraction are in fact randomly-matched false detections. These are detections due to the shape noise that are matched owing to being close to a halo in the catalogue. Their number depends both on the number of false detections (see Sect. 8.2.1) and on the false association rate (given in Table 5). In this study, we intentionally keep this number very small for all the detection methods (less than 0.3%). However, one must be careful because both the detection and matching procedures can make this number become large. Thus, it is essential to always compute this number to fully assess the performance of the detection methods.

Table 5.

False association rate for different filter functions.

8.3.2. Classification of the unmatched detections

Here we focus on the detections that are not matched with a halo, to fully complete the analysis of the different detection methods. These unmatched detections can be classified into several categories.

False detections unmatched correctly. The most intuitive class for of unmatched detections is that composed of those due to the shape noise (see Sect. 8.2.1.) Beyond the small fraction that is incorrectly matched owing to being located close to a halo in the catalogue (see above), almost all of these false detections are not matched. Their number thus corresponds approximately to the false detection rate, and is given in the second column of Table 4. This number is very small compared to the estimated purity in Sect. 8.1, and other explanations need to be found to explain the remaining ∼15% of the unmatched detections.

Boosted lensing signal unmatched incorrectly. One possible explanation for the remaining unmatched detections is that some of these detections should be associated with haloes that have been removed from the halo catalogue after the S/N pre-selection. This could happen because the S/N estimated for each halo in the catalogue to perform the selection is theoretical. It is computed assuming an average theoretical source density and a theoretical distribution, pzs, for the redshift of the sources. In practice the sources behind a cluster are few in number. As a consequence their number and their redshift distribution can deviate significantly from the theoretical values, and can in some cases boost the lensing signal of a cluster, regardless of its mass and redshift. At the same time, selecting haloes with a much lower S/N in the halo catalogue serves to increase the false association rate, and so the number of false detections matched incorrectly (see Sect. 8.3.1). Table 5 gives the false association rate when using a S/N > 2 pre-selection in the halo catalogue. This remains below 15% for all detection methods. Thus, the choice of imposing a S/N > 2 pre-selection in the halo catalogue is a trade-off between the false association rate and the missed association rate.

We estimated the fraction of unmatched detections due to a boosted lensing signal by undertaking the matching procedure a second time for all 256 realisations, but with a halo pre-selection of S/N > 1.5. The results for each method is given in the third column of Table 4.

Blended lensing signal unmatched incorrectly. A further explanation for the remaining unmatched detections is the case of two haloes contributing to the same detection:

  1. Blended LOS: The blended lensing signal can be caused by LOS alignments (e.g., filamentary structures). This means that the detection is coming from two haloes with S/N < 2 that are aligned along the LOS and whose lensing signals cannot be separated without considering the redshift distribution of the background galaxies in the detection step.

  2. Blended haloes: The blended lensing signal can also be linked to the size of the filter. Indeed, some unmatched detections can come from two haloes whose apparent distance is small compared to the filter radius. The lensing signal of these two haloes being merged within the AM map, there is a single detection associated with the two haloes, the position of which depends on the mass and redshift of the two haloes. In this case, the detection is unmatched when the theoretical S/N of the two haloes is less than 2, or when the position of the detection is too far away from the centre of the two haloes.

We estimated the proportion of blended lensing signal falling in the two categories by searching for multiple haloes with S/N > 1.5 in the vicinity of the unmatched detections. The proportion of unmatched detections due to the blending of the signal coming from two (or possibly more) haloes is given in the fourth column of Table 4. In this class of unmatched detections, we found that about a third of them are due to intrinsic LOS alignments.

A small fraction of unmatched detections do not fall into the above three categories. The fraction for each method is listed as ‘others’ in the last column of Table 4. Most of these remaining detections can be matched with haloes with S/N < 1.5. But further decreasing the halo pre-selection threshold increases the false association rate, making this classification very uncertain.

9. Summary and conclusions

The sensitivity of wide-field optical surveys now allows for the blind detection of galaxy clusters through their weak lensing signal. This situation will vastly improve with the launch of Euclid and the Roman Space Telescope. However, the construction of weak-lensing-selected galaxy cluster catalogues requires the improvement or development of new galaxy cluster detection algorithms able to cope with the increasingly large volume of data.

In this paper we have introduced a new, fast, multi-scale approach, based on wavelet filters operating on complementary scales, for detecting galaxy clusters through their weak lensing signal. This new method, W234, was compared to four commonly used approaches (S96, TANH, J04, and M18) based on AM filters. The comparison was performed on the same set of Euclid wide survey-like mocks obtained from the DUSTGRAIN-pathfinder simulations, composed of 256 realisations of a field of 5° × 5°, corresponding to a total area of 6400 deg2.

We first undertook a thorough examination of the filter characteristics in real and Fourier space. This allowed us to demonstrate the equivalence between some of the commonly used AM filters (S96, TANH, J04, and M18) and wavelet filters at individual scales; moreover, we show that the M18 AM filter is comparable to a combination of two wavelet filters. We then investigated different options for implementing the detection methods. Applying the detection methods to the 256 realisations, we find:

  • Similar results are obtained when applying the detection algorithms to the shear directly at the galaxy positions in real space or to the binned shear in Fourier space. Binning a 5° × 5° field into 512 × 512 bins, the power spectrum integral of the AM map suffers a maximum 1.5% loss when passing from real to Fourier space.

  • The binning scheme that is adopted can have an impact on the detection efficiency in terms of the number and distribution of detections. In particular, binning by averaging the shear in each pixel yields a smaller number of detections than binning by summing the shear, and a significant fraction of these detections (∼28%) are different from those detected in the real space approach. In contrast, the summed pixel binning yields very similar results, in terms of both number and distribution, to application to the shear in real space.

  • Application of the detection methods to the shear or the convergence yields similar results. In particular, the power spectrum loss on the AM map is negligible when passing from the shear to the convergence, and the number and distribution of detections agree exactly.

To quantitatively compare the results obtained from the different filters, each having different characteristic radii, we developed a new adaptive matching procedure to match the detections to the haloes in the simulations. We find:

  • The characteristics of the halo catalogue used for the matching can have a dramatic effect on the number of false associations. We thus applied a pre-selection to the halo catalogue based on the theoretical weak lensing S/N expected for an NFW halo, and kept only haloes with S/N > 2 for the subsequent matching step.

  • The association distance is a critical parameter when dealing with filters that have different characteristic radii. It must take both the spatial resolution of the filter and the angular size of the haloes into account. We obtained an optimal matching distance for the detections from each filter by considering the absolute association distance, D, of all haloes as a function of the association distance normalised by the characteristic radius of each halo, D/θ200c. Then, decomposing the distribution of the absolute association distance in terms of RA and Dec and fitting with a Gaussian, we obtained an optimal MMD for each filter.

  • For the multi-scale wavelet detections, a halo might be detected on more than one successive scale. We thus carried out a special matching procedure for the detections from the wavelet filters, building a cumulative catalogue from the finest to the coarsest scales.

We then compared the performance of the detection methods, after running them on the 256 simulations and using the matching procedure developed in this paper. We find:

  • The completeness is a strong function of the halo catalogue pre-selection threshold. However, irrespective of the exact S/N pre-selection threshold, the S96, TANH, and J04 filters exhibit a lower completeness than the M18 and W234 filters. In contrast, the J04 filter exhibits slightly higher purity at the expense of completeness.

  • The evolution of the completeness as a function of purity shows that the multi-scale approach outperforms the methods based on AM filters at all values of purity.

  • The multi-scale method detects practically all the detections from the individual AM approaches, with the addition of supplemental detections, particularly at lower masses and higher redshifts.

Finally, we performed an exhaustive analysis of the matched and unmatched detections for each method, using a S/N > 2 pre-selection on the halo catalogue. We found similar results for all methods. For the W234 method, we find:

  • More than 82% of the detections are correctly matched, and of these, the overwhelming majority (95%) are due to a single halo. For our matching procedure, correctly matched detections due to multiple haloes along the LOS account for less than 5% of the total. Furthermore, because our detection and matching methods aim at high purity, the number of incorrectly matched haloes is negligible (< 1%).

  • About 17% of the detections are unmatched. Only a small fraction of these (10%) are due to false detections. The remainder can be explained mostly by haloes below the S/N threshold whose signal has been boosted because of a fortuitous redshift distribution of background sources (37%), or a blended lensing signal from LOS alignments, or angular separation distances smaller than the filter size (32%). The final 20% of the unmatched detections can likely be associated with haloes with S/N < 1.5 that do not appear in the catalogue due to the pre-selection.

In this work we have shown that a multi-scale approach is faster and gives better results, in terms of purity and completeness, than the currently used AM methods. We have also introduced a new adaptative matching procedure that allows a fair comparison between detection methods that operate on different scales. A significant advantage of our method is its computational speed. Compared to methods that apply single-scale AM filters to the shear directly at the galaxy positions, our multi-scale method applied to the convergence yields a gain of two to three orders of magnitude in speed.

Based on these results, we aim to apply our multi-scale approach to current and future large-area survey data, which will allow the construction of large catalogues of clusters selected through their weak lensing signal. This next step also involves new challenges. As detailed in Hamana et al. (2020), the presence of foreground galaxies that are not part of the cluster can dilute the lensing signal. This dilution effect contaminates the weak lensing dataset and analyses. One way to tackle this issue is to use photometric redshifts to select sources and minimise contamination. By taking the redshift distribution within the dataset into account, the tomographic approach reduces the impact of the dilution effect. This solution has already shown promising results for the detection of galaxy clusters in the HSC survey (e.g., Hamana et al. 2020; Oguri et al. 2021). The tomographic approach may also yield coarse information on the redshift of the detected sources. We are confident that a multi-scale approach combined with tomography will improve on the results shown here. Application to upcoming deep, large-area surveys will allow us to develop our method further.


Acknowledgments

The authors would like to thank Joel Bergé for providing his code to estimate the theoretical S/N of detection of a halo by weak lensing, Jean-Baptiste Melin for his advices and Peter Schneider for his helpful comments on the filters. G.L. acknowledges funding from CNES, the French space agency. G.C. acknowledges support from INAF theory Grant 2022: Illuminating Dark Matter using Weak Lensing by Cluster Satellites.

References

  1. Aihara, H., Armstrong, R., Bickerton, S., et al. 2018, PASJ, 70, S8 [NASA ADS] [Google Scholar]
  2. Allen, S. W., Evrard, A. E., & Mantz, A. B. 2011, ARA&A, 49, 409 [Google Scholar]
  3. Andreon, S., & Bergé, J. 2012, A&A, 547, A117 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Bartelmann, M., & Schneider, P. 2001, Phys. Rep., 340, 291 [Google Scholar]
  5. Bergé, J., Amara, A., & Réfrégier, A. 2010, ApJ, 712, 992 [CrossRef] [Google Scholar]
  6. Bocquet, S., Dietrich, J. P., Schrabback, T., et al. 2019, ApJ, 878, 55 [Google Scholar]
  7. Closson Ferguson, H., Armus, L., Borne, K., et al. 2009, AAS Meeting Abstr., 213, 460.07 [NASA ADS] [Google Scholar]
  8. Cropper, M., Hoekstra, H., Kitching, T., et al. 2013, MNRAS, 431, 3103 [Google Scholar]
  9. Davis, M., Efstathiou, G., Frenk, C. S., & White, S. D. M. 1985, ApJ, 292, 371 [Google Scholar]
  10. Diemer, B. 2018, ApJS, 239, 35 [NASA ADS] [CrossRef] [Google Scholar]
  11. Diemer, B., & Joyce, M. 2019, ApJ, 871, 168 [NASA ADS] [CrossRef] [Google Scholar]
  12. Dietrich, J. P., & Hartlap, J. 2010, MNRAS, 402, 1049 [Google Scholar]
  13. Euclid Collaboration (Adam, R., et al.) 2019, A&A, 627, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Fan, Z., Shan, H., & Liu, J. 2010, ApJ, 719, 1408 [Google Scholar]
  15. Gavazzi, R., & Soucail, G. 2006, A&A, 462, 459 [Google Scholar]
  16. Giocoli, C., Metcalf, R. B., Baldi, M., et al. 2015, MNRAS, 452, 2757 [Google Scholar]
  17. Giocoli, C., Di Meo, S., Meneghetti, M., et al. 2017, MNRAS, 470, 3574 [NASA ADS] [CrossRef] [Google Scholar]
  18. Giocoli, C., Baldi, M., & Moscardini, L. 2018, MNRAS, 481, 2813 [Google Scholar]
  19. Hamana, T., Takada, M., & Yoshida, N. 2004, MNRAS, 350, 893 [NASA ADS] [CrossRef] [Google Scholar]
  20. Hamana, T., Oguri, M., Shirasaki, M., & Sato, M. 2012, MNRAS, 425, 2287 [Google Scholar]
  21. Hamana, T., Sakurai, J., Koike, M., & Miller, L. 2015, PASJ, 67, 34 [NASA ADS] [Google Scholar]
  22. Hamana, T., Shirasaki, M., & Lin, Y.-T. 2020, PASJ, 72, 78 [NASA ADS] [CrossRef] [Google Scholar]
  23. Hasselfield, M., Hilton, M., Marriage, T. A., et al. 2013, JCAP, 2013, 008 [Google Scholar]
  24. Hennawi, J., & Spergel, D. 2005, ApJ, 624, 59 [NASA ADS] [CrossRef] [Google Scholar]
  25. Hetterscheidt, M., Erben, T., Schneider, P., et al. 2005, A&A, 442, 43 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  26. Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111 [Google Scholar]
  27. Jarvis, M., Bernstein, G., & Jain, B. 2004, MNRAS, 352, 338 [Google Scholar]
  28. Kaiser, N., & Squires, G. 1993, ApJ, 404, 441 [Google Scholar]
  29. Lanusse, F., Starck, J. L., Leonard, A., & Pires, S. 2016, A&A, 591, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  30. Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, arXiv e-prints [arXiv:1110.3193] [Google Scholar]
  31. Leauthaud, A., Massey, R., Kneib, J.-P., et al. 2007, ApJS, 172, 219 [NASA ADS] [CrossRef] [Google Scholar]
  32. Leonard, A., Pires, S., & Starck, J.-L. 2012, MNRAS, 423, 3405 [NASA ADS] [CrossRef] [Google Scholar]
  33. Leonard, A., Lanusse, F., & Starck, J.-L. 2015, MNRAS, 449, 1146 [NASA ADS] [CrossRef] [Google Scholar]
  34. Lesci, G. F., Marulli, F., Moscardini, L., et al. 2022, A&A, 659, A88 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  35. Lin, C.-A., Kilbinger, M., & Pires, S. 2016, A&A, 593, A88 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  36. Maturi, M., Meneghetti, M., Bartelmann, M., Dolag, K., & Moscardini, L. 2005, A&A, 442, 851 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  37. Maturi, M., Schirmer, M., Meneghetti, M., Bartelmann, M., & Moscardini, L. 2007, A&A, 462, 473 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  38. Miyazaki, S., Hamana, T., Shimasaku, K., et al. 2002, ApJ, 580, L97 [NASA ADS] [CrossRef] [Google Scholar]
  39. Miyazaki, S., Hamana, T., Ellis, R. S., et al. 2007, ApJ, 669, 714 [NASA ADS] [CrossRef] [Google Scholar]
  40. Miyazaki, S., Oguri, M., Hamana, T., et al. 2018, PASJ, 70, S27 [NASA ADS] [Google Scholar]
  41. Navarro, J. F., Frenk, C. S., & White, S. D. M. 1996, ApJ, 462, 563 [Google Scholar]
  42. Oguri, M., Miyazaki, S., Li, X., et al. 2021, PASJ, 73, 817 [NASA ADS] [CrossRef] [Google Scholar]
  43. Pacaud, F., Pierre, M., Réfrégier, A., et al. 2006, MNRAS, 372, 578 [NASA ADS] [CrossRef] [Google Scholar]
  44. Pace, F., Maturi, M., Meneghetti, M., et al. 2007, A&A, 471, 731 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  45. Perrenod, S. C. 1980, ApJ, 236, 373 [CrossRef] [Google Scholar]
  46. Pires, S., Leonard, A., & Starck, J.-L. 2012, MNRAS, 423, 983 [NASA ADS] [CrossRef] [Google Scholar]
  47. Pires, S., Vandenbussche, V., Kansal, V., et al. 2020, A&A, 638, A141 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  48. Planck Collaboration XXIX. 2014, A&A, 571, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  49. Planck Collaboration XIII. 2016, A&A, 594, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  50. Pratt, G. W., Arnaud, M., Biviano, A., et al. 2019, Space Sci. Rev., 215, 25 [Google Scholar]
  51. Pyne, T., & Birkinshaw, M. 1993, ApJ, 415, 459 [NASA ADS] [CrossRef] [Google Scholar]
  52. Rozo, E., Wechsler, R. H., Rykoff, E. S., et al. 2010, ApJ, 708, 645 [Google Scholar]
  53. Salvati, L., Saro, A., Bocquet, S., et al. 2022, ApJ, 934, 129 [NASA ADS] [CrossRef] [Google Scholar]
  54. Schirmer, M. 2004, PhD Thesis, Rheinische Friedrich-Wilhelms-Universität Bonn, Germany [Google Scholar]
  55. Schirmer, M., Erben, T., Schneider, P., Wolf, C., & Meisenheimer, K. 2004, A&A, 420, 75 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  56. Schirmer, M., Erben, T., Hetterscheidt, M., & Schneider, P. 2007, A&A, 462, 875 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  57. Schneider, P. 1996, MNRAS, 283, 837 [Google Scholar]
  58. Schneider, P., & Bartelmann, M. 1997, MNRAS, 286, 696 [Google Scholar]
  59. Schneider, P., Ehlers, J., & Falco, E. E. 1992, Gravitational Lenses (Berlin Heidelberg New York: Springer-Verlag) [Google Scholar]
  60. Schneider, P., van Waerbeke, L., Jain, B., & Kruse, G. 1998, MNRAS, 296, 873 [NASA ADS] [CrossRef] [Google Scholar]
  61. Schneider, P., van Waerbeke, L., Kilbinger, M., & Mellier, Y. 2002, A&A, 396, 1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  62. Schrabback, T., Hilbert, S., Hoekstra, H., et al. 2015, MNRAS, 454, 1432 [Google Scholar]
  63. Schrabback, T., Schirmer, M., van der Burg, R. F. J., et al. 2018, A&A, 610, A85 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  64. Seitz, C., & Schneider, P. 1997, A&A, 318, 687 [NASA ADS] [Google Scholar]
  65. Shan, H., Kneib, J.-P., Tao, C., et al. 2012, ApJ, 748, 56 [NASA ADS] [CrossRef] [Google Scholar]
  66. Shan, H., Liu, X., Hildebrandt, H., et al. 2018, MNRAS, 474, 1116 [Google Scholar]
  67. Springel, V., White, S. D. M., Tormen, G., & Kauffmann, G. 2001, MNRAS, 328, 726 [Google Scholar]
  68. Spergel, D., Gehrels, N., Baltay, C., et al. 2015, arXiv e-prints [arXiv:1503.03757] [Google Scholar]
  69. Starck, J., Murtagh, F., & Bijaoui, A. 1998, Image Processing and Data Analysis: The Multiscale Approach (Cambridge University Press) [CrossRef] [Google Scholar]
  70. Starck, J. L., Pires, S., & Réfrégier, A. 2006, A&A, 451, 1139 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  71. Tang, J. Y., & Fan, Z. H. 2005, ApJ, 635, 60 [NASA ADS] [CrossRef] [Google Scholar]
  72. Van Waerbeke, L. 1998, A&A, 334, 1 [Google Scholar]
  73. van Waerbeke, L. 2000, MNRAS, 313, 524 [CrossRef] [Google Scholar]
  74. Vikhlinin, A., Kravtsov, A. V., Burenin, R. A., et al. 2009, ApJ, 692, 1060 [Google Scholar]
  75. Voit, G. M. 2005, Rev. Mod. Phys., 77, 207 [Google Scholar]
  76. White, S. D. M., & Rees, M. J. 1978, MNRAS, 183, 341 [Google Scholar]
  77. White, M., van Waerbeke, L., & Mackey, J. 2002, ApJ, 575, 640 [CrossRef] [Google Scholar]
  78. Wittman, D., Dell’Antonio, I. P., Hughes, J. P., et al. 2006, ApJ, 643, 128 [NASA ADS] [CrossRef] [Google Scholar]
  79. Wright, C. O., & Brainerd, T. G. 2000, ApJ, 534, 34 [Google Scholar]

Appendix A: Contribution of each wavelet scale

In this section we study the contribution of each wavelet scale to the matched detections made by the multi-scale approach.

Figure A.1 shows the distribution of the matched clusters in the z − M plane for the W2 wavelet filter (upper-left panel), the W3 wavelet filter (upper-right panel) and the W4 wavelet filter (lower-left panel). The lower-right panel shows the distribution of the matched clusters in the z − M plane for the multi-scale detection method. The colours allow us to see the contribution of each scale after recombination of the multiple detections. The different panels also highlight the differences in terms of targeted clusters by the different wavelet scales. In particular, we see the complementarity of the different wavelet filters.

thumbnail Fig. A.1.

Distribution of the matched clusters in the z-M plane for one 5° ×5° patch, obtained using the W2 wavelet filter (upper-left panel), W3 wavelet filter (upper-right panel), W4 wavelet filter (lower-left panel), and the multi-scale approach (lower-right panel).

If we look at the number of matched detections for each wavelet scale after recombination, we see that the proportion of detections made at each scale does not vary much from one realisation to another. On average, the W2 wavelet filter provides up to 29%±0.39, W3 around 33%±0.57, and W4 around 38%±0.42 of the matched detections. Hence, the three wavelet scales are useful and contribute similarly to the total number of matched detections.

All Tables

Table 1.

Estimated maximum matching distance (MMD) for different filter functions, evaluated using the procedure described in Sect. 7.3.

Table 2.

Completeness of the different detection methods for different S/N halo pre-selections, in percent.

Table 3.

Detection method performance.

Table 4.

Classification of the detections.

Table 5.

False association rate for different filter functions.

All Figures

thumbnail Fig. 1.

Simulated shear maps covering a field of 5° ×5° (left and middle panels) and the corresponding E-mode convergence map (right). The E-mode convergence map directly traces the projected matter distribution, the overdensities appearing as bright structures.

In the text
thumbnail Fig. 2.

Comparison of different filter functions, U (left) and Q (right), for AM map reconstruction. The figure shows in blue (dotted lines) the AM filter functions defined by Schneider (1996, S96) with R = 9′; in red (dash-dotted lines) the AM filter functions defined in Schirmer (2004, TANH) with R = 7′; in purple (dash-dotted lines) the AM filter functions defined in Jarvis et al. (2004, J04) with σ = 4′; and in green (dashed lines) the AM filter functions defined by Miyazaki et al. (2018, M18). The filter UM18 is also a compensated filter thanks to the U0 term, but due to the high truncation radius we used, this is not distinctly visible. Finally, the AM filter functions corresponding to the wavelet filter functions at scale i = 4 are displayed in orange (solid lines). All the filters are normalised to have a maximum amplitude of 1.

In the text
thumbnail Fig. 3.

Fourier space comparison of several AM filters and wavelet filters, expressed as a function of wave number k in arcmin−1 and filter radius in arcmin. The dotted blue line (upper panel) represents the S96 filter, the dash-dotted red and purple lines (middle panels) represent the TANH and J04 filters, and the dashed green line (lower panel) corresponds to the M18 filter. The solid lines show the different wavelet filters: the teal, green, orange, and red light lines correspond to the W2, W3, W4, and W5 wavelet filters with radii of 1 . 17 $ 1{{\overset{\prime}{.}}}17 $, 2 . 32 $ 2{{\overset{\prime}{.}}}32 $, 4 . 68 $ 4{{\overset{\prime}{.}}}68 $, and 9 . 37 , $ 9{{\overset{\prime}{.}}}37, $ respectively. For comparison purposes, the J04 and M18 filters are forced to the same amplitude as the W4 function to illustrate their behaviour in terms of filter radius.

In the text
thumbnail Fig. 4.

Wavelet decomposition of an E-mode convergence map (including shape noise), covering a simulated field of 5° ×5°. The upper-left panel corresponds to the E-mode convergence map to be decomposed. The other panels correspond to the wavelet maps at scale W 1 = 1 . 17 $ W1 = 1{{\overset{\prime}{.}}}17 $ (upper-middle panel), scale W 2 = 2 . 32 $ W2 = 2{{\overset{\prime}{.}}}32 $ (upper-right panel), scale W 3 = 4 . 68 $ W3 = 4{{\overset{\prime}{.}}}68 $ (lower-left panel), scale W 4 = 9 . 37 $ W4 = 9{{\overset{\prime}{.}}}37 $ (lower-middle panel), and scale W 5 = 18 . 75 $ W5 = 18{{\overset{\prime}{.}}}75 $ (lower-right panel). The wavelet maps are sensitive to structures of different apparent angular sizes. The W1 map (upper-middle panel) appears to be dominated by the shape noise. The wavelet decomposition also includes the smoothed version, which we excluded from this representation.

In the text
thumbnail Fig. 5.

Comparison of the real and Fourier space implementations. Left: power spectra of a 5° ×5° AM map with 512 × 512 pixels obtained using the real space implementation (blue) and using the Fourier space implementations (orange and green lines for summed and averaged pixel binning, respectively). Compared to the real space implementation, there is a 1.2% and 1.5% loss of power for the Fourier space implementation with summed and averaged pixel binning, respectively. Right: detections obtained on the same field when the computation is undertaken in real space (blue circles) or in Fourier space using either the summed pixel binning (golden circles) or the averaged pixel binning (green circles). The Fourier space implementation with summed pixel binning is very close to the real space implementation in terms of detections. When the binning is performed by averaging the shear values, there are fewer detections, and about 30% of these detections are different from those obtained with the real space implementation.

In the text
thumbnail Fig. 6.

Comparison of the shear and convergence approaches. Left: power spectra of a 5° ×5° AM map with 512 × 512 pixels obtained by applying the AM J04 filter to the shear maps (blue line) and to the E-mode convergence map (orange dots) computed from the same data as in Fig. 5. The power spectra agree perfectly. Right: detections obtained on the same field when the convolution by the J04 AM filter is performed on the shear map (blue circles) and on the convergence map (golden circles). There is a one-to-one correspondence between the detections.

In the text
thumbnail Fig. 7.

Application of a theoretical S/N pre-selection to the halo catalogue. Left: weak-lensing selection function for clusters of galaxies, in the redshift–mass plane. The selection function has been computed assuming a Euclid-like redshift distribution with ng = 30 gal.arcmin−2 and σϵ = 0.26. Contours denote the theoretical S/N for an NFW profile. Right: example of halo catalogue pre-selection using the theoretical S/N, for one 5° ×5° field. The light blue dots correspond to all the haloes in the field, and the dark blue dots are the selected haloes with S/N > 2.

In the text
thumbnail Fig. 8.

Dispersion of the association distance (without maximum association distance) for the W3 wavelet filter, considering all 256 realisations. In all the panels, the blue dots correspond to an association between a peak detection and its closest halo. Upper left: absolute association distance, D, as a function of its normalised expression with respect to the characteristic halo radius, θ200c. Lower right: zoomed-in view of the central 20 × 20 arcmin distribution of absolute association distances expressed in RA and Dec, to highlight their dispersion. The two remaining panels show the projection of the absolute association distance distribution onto the RA and Dec axes: the projection onto the Dec axis (lower-left panel) and the projection onto the RA axis (upper-right panel). The RA and Dec axes are zoomed around the central area. On these two panels, the dashed orange line shows the result of a fit by a Gaussian function whose parameters (μ, σ) are labelled ‘Mean’ and ‘Stddev’.

In the text
thumbnail Fig. 9.

Matched detections for the W3 filter. The two panels are similar to Fig. 8. The blue dots correspond to an association between a peak detection and its closest halo. The orange dots correspond to the detections with an absolute association distance lower than the defined MMD.

In the text
thumbnail Fig. 10.

Completeness as a function of purity for the detection methods based on the S96 (blue), TANH (red), J04 (purple), and M18 (green) AM filters, the W4 wavelet filter (light blue), and the W234 wavelet filter (orange). The cross markers represent precisely where the methods are evaluated in terms of purity and completeness. The purity and the completeness are computed using a S/N > 2 halo pre-selection and averaging over the 256 realisations.

In the text
thumbnail Fig. 11.

Distribution of the matched clusters in the z − M plane for one 5° ×5° patch, using the S96 AM filter in the upper-left panel, the TANH AM filter in the upper-right panel, the J04 AM filter in the middle-left panel, the M18 filter in the middle-right panel, the W4 wavelet filter in the bottom-left panel, and the W234 wavelet filters in the bottom-right panel. The light blue dots correspond to all the haloes within the halo catalogue, and the dark blue dots to the haloes we have pre-selected to have S/N > 2. The overplotted orange crosses correspond to the haloes that have been matched with a detection, and in parentheses we note the number of matches for each method on one patch. This patch was chosen to be representative of the performance of the different methods in terms of detection number.

In the text
thumbnail Fig. 12.

Characterisation of the detections from the W234 method. The matched detections (blue segments) are categorised as correctly matched with single or multiple haloes, or incorrectly matched as defined in Sect. 8.3.1. The unmatched detections are sub-categorised as blended signal, boosted signal from the source distribution, false detections, or others, as defined in Sect. 8.3.2. The results are averaged over the 256 realisations.

In the text
thumbnail Fig. A.1.

Distribution of the matched clusters in the z-M plane for one 5° ×5° patch, obtained using the W2 wavelet filter (upper-left panel), W3 wavelet filter (upper-right panel), W4 wavelet filter (lower-left panel), and the multi-scale approach (lower-right panel).

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.