YOLO–CL: Galaxy cluster detection in the SDSS with deep machine learning

Kirill Grishin; Simona Mei; Stéphane Ilić

doi:10.1051/0004-6361/202345976

Home

All issues

Volume 677 (September 2023)

A&A, 677 (2023) A101

Full HTML

Open Access

Issue		A&A Volume 677, September 2023


Article Number		A101
Number of page(s)		13
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/202345976
Published online		12 September 2023

A&A, 677, A101 (2023)

`YOLO–CL`: Galaxy cluster detection in the SDSS with deep machine learning

Kirill Grishin¹, Simona Mei¹^,2 and Stéphane Ilić³^,1^,4^,5

¹ Université Paris Cité, CNRS(/IN2P3), Astroparticule et Cosmologie, 10 rue Alice Domon et Léonie Duquet, 75013 Paris, France
e-mail: grishin@apc.in2p3.fr; mei@apc.in2p3.fr
² Jet Propulsion Laboratory, Cahill Center for Astronomy & Astrophysics, California Institute of Technology, 4800 Oak Grove Drive, Pasadena, California, USA
³ Université PSL, Observatoire de Paris, Sorbonne Université, CNRS, LERMA, 75 Avenue Denfert-Rochereau, 75014 Paris, France
⁴ CNES, Centre National d’Études Spatiales, 18 Av. Edouard Belin, Toulouse, France
⁵ IJCLab, Université Paris-Saclay, CNRS/IN2P3, IJCLab, rue André Ampère, Campus de la Faculté des Sciences, 91405 Orsay, France

Received: 23 January 2023
Accepted: 10 May 2023

Abstract

Galaxy clusters are powerful probes for cosmological models. Next-generation, large-scale optical and infrared surveys are poised to reach unprecedented depths and, thus, they require highly complete and pure cluster catalogs, with a well-defined selection function. We have developed a new cluster detection algorithm named YOLO for CLuster detection (YOLO–CL), which is a modified version of the state-of-the-art object detection deep convolutional network named You only look once (YOLO) that has been optimized for the detection of galaxy clusters. We trained YOLO–CL on the red-sequence Matched-filter Probabilistic Percolation (redMaPPer) cluster catalog, based on Sloan Digital Sky Survey (SDSS) color images. We find that YOLO–CL detects 95–98% of the redMaPPer clusters, with a purity of 95–98%, that is calculated by applying the network to SDSS blank fields. When compared to the Meta-Catalog of X-Ray Detected Clusters of Galaxies 2021 (MCXC2021) X-ray catalog in the SDSS footprint, YOLO–CL recovers all clusters at L_X ≳ 2–3 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 2–3 × 10¹⁴M_⊙, R₅₀₀≳0.75–0.8 Mpc and 0.4 ≲ z ≲ 0.6. When compared to the redMaPPer detection of the same MCXC2021 clusters, we find that YOLO–CL is more complete than redMaPPer, which means that the neural network has indeed improved the cluster detection efficiency of its training sample. In fact, YOLO–CL detects ~98% of the MCXC2021 clusters with an X-ray surface brightness of I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 30 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.3 ≲ z ≲ 0.6; while redMaPPer detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 55 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.5 ≲ z ≲ 0.6. The YOLO–CL selection function is approximately constant with redshift, with respect to the MCXC2021 cluster X-ray surface brightness. YOLO–CL exhibits a high level of performance when compared to traditional detection algorithms applied to SDSS. Deep learning networks display a strong advantage over traditional galaxy cluster detection techniques because they do not require the galaxy’s photometric and photometric redshift catalogs. This eliminates systematic uncertainties that may be introduced during source detections and photometry, as well as photometric redshift measurements. Our results show that YOLO–CL is an efficient alternative to traditional cluster detection methods. In general, this work shows that it is worth exploring the performance of deep convolution networks for future cosmological cluster surveys, such as the Rubin/Legacy Survey of Space and Time (Rubin/LSST), Euclid, and Roman Space Telescope surveys.

Key words: large-scale structure of Universe / galaxies: clusters: general / catalogs

© The Authors 2023

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

Clusters of galaxies are powerful probes for constraining cosmological models. In fact, since they are the largest and most massive gravitationally bound systems in the Universe, their abundances probe the growth history of structures (e.g., Allen et al. 2011). Future large-scale surveys, such as the Dark Energy Survey¹ (Abbott et al. 2018), the Dark Energy Spectroscopic Instrument² (Dey et al. 2019), the Vera C. Rubin Observatory³ (formerly Large Synoptic Survey Telescope, Kahn 2018), the Euclid satellite⁴ (Laureijs et al. 2011), and the Nancy Grace Roman Space Telescope (Eifler et al. 2021), will use large cluster samples as cosmological probes. Thus, they will require the development of fast and efficient cluster detection algorithms. These surveys will reach unprecedented depths and will require highly complete and pure cluster catalogs, with a well-defined selection function.

Cluster detection algorithms have been developed by the astronomical community at different wavelengths. The detection in optical and near-infrared bandpasses is mainly based on the search of spatial overdensities of a given class (quiescent, line-emitters, massive, etc.) of galaxies (e.g., Gladders & Yee 2005; Knobel et al. 2009; Muzzin et al. 2012; Sobral et al. 2010; Bayliss et al. 2011; Rykoff et al. 2014; Wylezalek et al. 2013, 2014), while detections in the X-rays and submillimeter rely on the assumption of model profiles that fit the data, for instance, a characteristic galaxy cluster luminosity and a radial profile (e.g., Olsen et al. 2007; Grove et al. 2009; Planck Collaboration XXVII 2016; Böhringer et al. 2004; Marriage et al. 2011; Predehl et al. 2021).

In the local Universe and up to z ~ 1, the reference cluster catalog has been the all-sky ROSAT (Röntgensatellit) X-ray catalog (Ebeling et al. 1998; Voges et al. 1999; Böhringer et al. 2004) over recent decades. However, over the past ten years, optical and millimeter-wave surveys provided large cluster samples that have been used to constrain our cosmological model parameters (Rozo et al. 2010; Allen et al. 2011; Hasselfield et al. 2013; Bleem et al. 2015; Planck Collaboration Int. XXVI 2015; de Haan et al. 2016; Costanzi et al. 2021; Chiu et al. 2023). Cluster surveys from the X-ray survey eRosita (extended Röntgen Survey with an Imaging Telescope Array; Merloni et al. 2012), the next-generation cosmic microwave background (CMB) surveys (e.g., Simons Observatory Ade et al. 2019), and Euclid (e.g., Ascaso et al. 2015; Euclid Collaboration 2019), as well as the Nancy Grace Roman Space Telescope (Eifler et al. 2021) and the Rubin Observatory Legacy Survey of Space and Time⁵ (LSST; e.g., Ivezic et al. 2019) will extend these cluster samples at lower mass and higher redshifts. In the high redshift Universe (z ≳ 1.5), cluster detection will be mainly performed by optical and infrared surveys, combined with radio and far-infrared observations. At these epochs, clusters have been predicted to be less massive (Chiang et al. 2013). This means that the X-ray and the SZ (Sunyaev-Zel’dovich) effect signals are fainter (e.g., Ascaso et al. 2017), limiting the performance of X-ray and SZ cluster detection algorithms.

The use of deep machine learning (ML) algorithms in various areas of astrophysics has been rising for the past decade, with applications ranging from the analysis of galaxy surveys (see, e.g., Huertas-Company & Lanusse 2023, for a recent review) and photometric redshift estimations (see, e.g., Henghes et al. 2021, for a recent review) to dark matter map reconstructions (e.g., Jeffrey et al. 2020). In particular, convolutional neural networks (CNN) have proved especially useful in object detection and characterization (e.g., Huertas-Company et al. 2015, 2018; Dimauro et al. 2018; Pasquet et al. 2019; Zanisi et al. 2021; Euclid Collaboration 2022, 2023a,b; Davidzon et al. 2022), as well as galaxy cluster detections (e.g., Chan & Stott 2019; Bonjean 2020; Hurier et al. 2021; Lin et al. 2021).

Many object detection algorithm have been developed in the field of deep ML (see the recent reviews of Zou et al. 2019 and Zaidi et al. 2021), most of which have not been applied in the field of astrophysics. In this paper, we use as a basis the architecture of the well-known detection-oriented deep machine learning neural network “You only look once” (YOLO, Redmon et al. 2015; Redmon & Farhadi 2016) to detect clusters of galaxies in the Sloan Digital Sky Survey (SDSS⁶) and assess its efficiency. The YOLO algorithm has been developed for a very wide range of real-life situations, for example, for face detection, the analysis of medical images, and self-driving cars. Its last implementations, among which YOLOv3 developed by Redmon & Farhadi (2018), are particularly efficient for multiple object detection and well-adapted to cluster detection.

Our results show that our YOLO network, which we called YOLO for CLuster detection (YOLO–CL), adapted for the detection of galaxy clusters, shows a high performance with respect to traditional cluster detection algorithms in obtaining dependable cluster catalogs with high levels of completeness and purity. Our results show that our YOLO–CL cluster catalogs have a purity of 95–98% on blank fields and a completeness of ~98% for X-ray detected clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6, and of ~100% for clusters with I_X,500 ≳ 30 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.3 ≲ z ≲ 0.6. Our selection function is flat as a function of redshift, when considering X-ray mean surface brightness.

In Sect. 2, we describe the data and the catalog used for the training and validation of our network. Section 3 presents our network implementation and how we build our cluster catalog. In Sect. 4, we compare our results with the training cluster catalog and a X-ray cluster catalog. We discuss and summarize our results in Sects. 5 and 6, respectively.

2 Observational dataset

For the past two decades, the SDSS has been the largest imaging and spectroscopic survey of the local Universe (York et al. 2000). It uses a dedicated 2.5-m wide field-of-view optical telescope, located at the Apache Point Observatory, and has provided astronomers with a tremendous amount of data. This wealth of data has consistently yielded cosmological constraints via the various SDSS Data Releases (DR), so far culminating in the 17th data release (DR17, Abdurro’uf et al. 2022).

To train and test the application of YOLO to cluster detection, we focus on the most complete and pure SDSS cluster catalog (see also Sect. 5), namely, the red-sequence Matched-filter Probabilistic Percolation (redMaPPer) DR8 (Data Release 8) catalog from Rykoff et al. (2014). The redMaPPer algorithm is a red sequence cluster finder specifically designed for large photometric surveys. The redMaPPer algorithm was applied to the ~ 10 000 square degrees of the SDSS DR8 data release, yielding a catalog⁷ of 26 111 clusters over the redshift range z ϵ [0.08, 0.55]. With respect to the Meta-Catalog of X-Ray Detected Clusters of Galaxies X-ray detection catalog (MCXC, Piffaretti et al. 2011), the redMaPPer catalog was found to be 100% complete up to z = 0.35, above the X-ray temperature T_X ≳ 3.5keV, and L_X ≳ 2 × 10⁴⁴ erg s⁻¹, decreasing to 90% completeness at L_X ~ 10⁴³ erg s⁻¹. Overall, 86% of the redMaPPer clusters are correctly centered with respect to their X-ray centers (Rykoff et al. 2014). All redMaPPer rich clusters (λ > 100) are detected in the X-ray ROSAT All Sky Survey (Voges et al. 1999).

To train and validate our network, we excluded clusters with redshifts z < 0.2 that cover regions in the sky that are larger than the cutout images that we consider in this work because of calculation time efficiency. Ultimately, we worked with a final sample of 24 406 clusters, whose distribution is shown in Fig. 1. For each cluster, the algorithm provides its position, the richness λ⁸ as a proxy for cluster mass and a list of cluster members (Rozo & Rykoff 2014).

For the network training and validation, we retrieved JPEG versions of the original SDSS DR16 raw images for each redMaPPer cluster, and constructed color images using the ImgCutout web service⁹ by querying the SDSS Catalog Archive Server databases. These images are based on the g, r, and i-band FITS corrected frame files from the Science Archive Server, and the color images were built using the conversion algorithm¹⁰ based on Lupton et al. (2004). These three bandpasses are sufficient to identify passive early-type galaxies in clusters at z < 1. Figure 2 shows an example of such cut-out images.

Fig. 1

redMaPPer sample of 24 406 clusters used to train and validate our network. Top: sky map of the positions of the redMaPPer clusters in celestial coordinates, where the color indicates the photometric redshift of the cluster as estimated by the redMaPPer algorithm. Bottom: training and validation redMaPPer sample redshift (left) and richness (right) distribution.

3 `YOLO–CL`: our `YOLO` network for galaxy cluster detections

3.1 The `YOLO` network

The state-of-the-art deep convolutional network You only look once (YOLO, Redmon et al. 2015) offers real-time object detection. Competing architectures in the ML literature tend to apply first a “localizer” network on a given image, at multiple locations and on multiple scales, and assign a detection probability. The high-probability regions of the image are considered as detections, and are then classified using a separate network. The YOLO architecture uses a different approach: it applies a single neural network to the full image, combining the detection and classification into a single process. This gives the network several advantages over classifier-based systems, because its predictions take into account the global context of the image. It also has the advantage of making predictions with a single network evaluation, unlike systems such as Region Based Convolutional Neural Networks (R-CNN, Girshick et al. 2013, and following iterations Fast and Faster R-CNN), which require thousands of evaluations for a single image. This may result in several orders of magnitude faster YOLO execution times, compared to R-CNN and Fast R-CNN.

In practice, the network divides the image into a S × S grid of regions (or cells), within which the detection and classification are performed. YOLO predicts B bounding boxes¹¹ per region, with their associated “objectness” probability (i.e., how confident we are that there is an object in the box) and “class probabilities” (i.e., for a set of classes, the respective probabilities that the potential object belongs to them). Both B and S are hyperparameters of the network and can be adjusted by the user.

The predicted bounding boxes and their associated probabilities are returned by the network in the following format: $(x, y, w, h, C, p (c_{1}), \dots, p (c_{n})),$ $\left( {x,y,w,h,C,p\left( {{c_1}} \right), \ldots ,p\left( {{c_n}} \right)} \right),$ (1)

where (x, y) are the coordinates of the box center, w and h its width and height, C the objectness, and p(c₁),…,p(c_n) are the probabilities (summing to one) that the object in the box belongs respectively to the class ci,…, c_n.

When training a YOLO network using a set of images (with their associated “true” bounding boxes), we optimize the following multi-part loss function ℒ (Redmon et al. 2015): $ℒ = ℒ_{bbox} + ℒ_{obj} + ℒ_{class} .$ ${\cal L} = {{\cal L}_{{\rm{bbox}}}} + {{\cal L}_{{\rm{obj}}}} + {{\cal L}_{{\rm{class}}}}.$ (2)

The first term of Eq. (2) is the “bounding box loss” as follows: $\begin{array}{l} ℒ_{bbox} = α_{coord} \sum_{i = 0}^{S^{2}} \sum_{j = 0}^{B} 𝟙_{i j}^{obj} [{(x_{i} - {\hat{x}}_{i})}^{2} + {(y_{i} - {\hat{y}}_{i})}^{2}] \\ + α_{coord} \sum_{i = 0}^{S^{2}} \sum_{j = 0}^{B} 𝟙_{i j}^{obj} [{(\sqrt{w_{i}} - \sqrt{{\hat{w}}_{i}})}^{2} + {(\sqrt{h_{i}} - \sqrt{{\hat{h}}_{i}})}^{2}], \end{array}$ $\matrix{ {{{\cal L}_{{\rm{bbox}}}} = {\alpha _{{\rm{coord}}}}\,\sum\limits_{i = 0}^{{S^2}} {\,\sum\limits_{j = 0}^B {1_{ij}^{{\rm{obj}}}\,\left[ {{{\left( {{x_i} - {{\hat x}_i}} \right)}^2} + {{\left( {{y_i} - {{\hat y}_i}} \right)}^2}} \right]} } } \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, + {\alpha _{{\rm{coord}}}}\,\sum\limits_{i = 0}^{{S^2}} {\sum\limits_{j = 0}^B {1_{ij}^{{\rm{obj}}}\,\left[ {{{\left( {\sqrt {{w_i}} - \sqrt {{{\hat w}_i}} } \right)}^2} + {{\left( {\sqrt {{h_i}} - \sqrt {{{\hat h}_i}} } \right)}^2}} \right]} ,} } \hfill \cr }$ (3)

where the (x, y) coordinates represent the center of the box relative to the bounds of the grid cell and w and h are the width and height of the box. The symbol $𝟙_{i}^{obj}$ $1_i^{{\rm{obj}}}$ denotes if an object appears in cell i and $𝟙_{i j}^{obj}$ $1_{ij}^{{\rm{obj}}}$ denotes that the jth bounding box predictor in cell i is “responsible” for that prediction. In these equations (and those below), the variables with a hat over them are the “true values” that the network is learning.

The second term is the “objectness loss” as follows: $ℒ_{obs} = \sum_{i = 0}^{S^{2}} \sum_{j = 0}^{B} 𝟙_{i j}^{obj} {(C_{i} - {\hat{C}}_{i})}^{2} + α_{noobj} \sum_{i = 0}^{S^{2}} \sum_{j = 0}^{B} 𝟙_{i j}^{noobj} {(C_{i} - {\hat{C}}_{i})}^{2},$ ${{\cal L}_{{\rm{obs}}}} = \sum\limits_{i = 0}^{{S^2}} {\sum\limits_{j = 0}^B {1_{i\,j}^{{\rm{obj}}}} {{\left( {{C_i} - {{\hat C}_i}} \right)}^2} + {\alpha _{{\rm{noobj}}}}\sum\limits_{i = 0}^{{S^2}} {\sum\limits_{j = 0}^B {1_{ij}^{{\rm{noobj}}}} {{\left( {{C_i} - {{\hat C}_i}} \right)}^2},} }$ (4)

where C represents the conditional class probability. Finally, the last term represents the “classification loss” as follows: $ℒ_{class} = \sum_{i = 0}^{S^{2}} 𝟙_{i}^{obj} \sum_{c \in classes} {(p_{i} (c) - {\hat{p}}_{i} (c))}^{2},$ ${{\cal L}_{{\rm{class}}}} = \sum\limits_{i = 0}^{{S^2}} {1_i^{{\rm{obj}}}} \sum\limits_{c \in {\rm{classes}}} {{{\left( {{p_i}\left( c \right) - {{\hat p}_i}\left( c \right)} \right)}^2},}$ (5)

where the p_i(c) correspond to the probabilities to belong to a certain class, i. The α_coord and the α_noobj coefficients appearing in the previous formulas can be changed to give more weight to certain components of the total loss. We chose to set α_coord = α_noobj = 1.

The first version of the YOLO network used a Darknet-19 neural network architecture, which contains 19 layers, as the feature extractor. Its second version (YOLO9000, Redmon & Farhadi 2016) added 11 more layers to Darknet-19, reaching a total of 30. These first architectures had difficulties to detect small objects due to the coarseness of their S × S grid.

We base our cluster detection network on the third iteration of YOLO (Redmon & Farhadi 2018), YOLOv3, which represents a significant improvement over the first two versions, and while several other YOLO versions were developed afterwards, we will consider their application only in future work. The feature extractor was replaced by Darknet-53 and residual networks. The new extractor uses 53 convolution layers, with consecutive 3 × 3 and 1 × 1 convolution layers followed by a skip connection (introduced by ResNet, He et al. 2015) to help the activations to propagate through deeper layers without gradient diminishing. With 53 additional layers for the detection, YOLO totals 106 fully convolutional layers. Its larger size makes it slower as compared to previous iterations, but significantly enhances its accuracy.

Moreover, YOLOv3 introduces a multi-scale feature in the detection process. In practice, instead of producing a single “feature map” (to be fed to the detection part of the network) at a single S × S resolution, Darknet-53 produces three different maps at three different levels of resolution. The underlying idea here is to provide the detection network with feature maps at three different scales: the coarser the map, the bigger the objects that it will detect.

YOLO networks have been used in applications in astrophysics to detect galaxies and other sources (González et al. 2018; He et al. 2021), and astrophysical transients (Li et al. 2022).

Fig. 2

SDSS image cutout of a redMaPPer cluster in our sample. The yellow box corresponds to the minimal rectangle encompassing all redMaPPer cluster members, which is the box used to train YOLO–CL. In cyan, the box detected by our network YOLO–CL, with the associated confidence level in the top left corner. The image size is 13.5 × 13.5 arcmin² and the pixel size is 0.396 arcsec.

3.2 Network optimization for galaxy cluster detection

3.2.1 Modifications to the original `YOLOv3` network

To optimize YOLOv3 for galaxy cluster detection, we applied several modifications to its standard architecture, and we call our new implementation YOLO–CL (YOLO for CLuster detection). YOLO–CL is based on a TensorFlow implementation of the YOLO architecture¹². The network was trained on a NVIDIA Tesla V100-SXM2-32GB GPU, equipped with 32 GB of memory.

Our first modification is the definition of a single (instead of multiple classes in the original network) object class, clusters, and we therefore removed the ℒ_class term, defined in Eq. (5), from the original loss function of Eq. (2). This results in fewer network weights, which leads to a lighter, faster, and easier-to-train network.

Then, we replaced the standard YOLO bounding box loss of Eq. (3) with the so-called generalized Intersection over Union (gIoU) loss of Rezatofighi et al. (2019). In fact, the traditional IoU metric has among its main weaknesses the fact that it has a plateau (equal to 0) when the true and predicted bounding boxes are non-overlapping, making it impossible to optimize the corresponding loss term because of the vanishing gradient. The gIoU addresses this weakness by amending the IoU as follows: $gIoU = IoU + \frac{𝒰}{𝒜_{c}} - 1.$ ${\rm{gIoU}}\,{\rm{ = }}\,{\rm{IoU}}\,{\rm{ + }}{\beta \over {{\alpha _c}}} - 1.$ (6)

The IoU the ratio of the area of the intersection area to the area of their union. A value of 1 corresponds to a perfect agreement, while a value that tends towards 0 indicates increasingly disjointed boxes and/or significantly different sizes. 𝒰 and 𝒜_c are respectively the areas of the union of the two boxes and the smallest box enclosing both boxes. This allows the gIoU to extend smoothly into negative values for boxes that are disjointed, and to tend towards −1 for more and more distant boxes (as IoU = 0 and 𝒰 ≪ 𝒜_c). The gIoU was shown to yield better performance for multiple object detection compared to the standard bounding box metrics.

Table 1

Settings used for the YOLO–CL training.

3.2.2 Hyperparameter optimization

Our hyperparameters were tested for best performance in optimizing the completeness and purity of our final cluster catalog. Completeness and purity are the two parameters that characterize a galaxy cluster catalog and the algorithm selection function as a function of the cluster redshift and physical properties (such as mass, richness, X-ray luminosity, etc.). The catalog completeness quantifies the fraction of true clusters that are detected by the algorithm. The catalog purity quantifies the fraction of detected clusters that are true, instead of false detections. In machine learning literature, the completeness corresponds to the recall, and the purity to the precision.

The dimension of the first network layer sets an upper limit for the amount of information that is used by the network, while also defining the size and complexity of the architecture. We start with SDSS images with size 2048 × 2048 pixels, which corresponds to ~13.5 × 13.5 arcmin² images and to twice a typical cluster virial radius of 1 Mpc at z = 0.3, the SDSS average redshift. We resized each image to the dimensions of this network first layer by average pooling. In order to explore a trade-off between performance and precision, we consider two different input layer sizes, namely, 1024 × 1024 and 512 × 512 pixels. We keep the same stride parameters as in the original YOLOv3 publication, namely 8, 16, and 32.

Multiple detections of the same object are discarded by applying a gIoU threshold of 0.5, which is similar to the Intersection-Over-Union (IoU) threshold in the original YOLO and YOLOv3 publications (Redmon et al. 2015; Redmon & Farhadi 2018). The IoU and gIoU measure the region of overlap of the bounding boxes that define two different detections (see their definition in Sect. 3.2.1). The choice of a threshold of 0.5 means that when two bounding boxes overlap more than 50%, we consider that they define the same object. If they do, we keep the highest probability detection and discard the other. If the gIoU is smaller than 50% we consider that the two objects are different. Table 1 shows the settings used for the network training.

3.3 Training and validation

We trained YOLO–CL with about half (~ 12 000) of our selected redMaPPer cluster images and the same number of random SDSS field images of the same size. The training sample is split into subsets (batches) that are simultaneously processed by the network. Our original SDSS color image cutouts are centered on the redMaPPer cluster positions. This original centering does not have an impact on cluster detection, which does not depend on the position of the cluster in the image. In fact, when we train the network, we apply data augmentation to the original cutouts, including translation, flipping, and cropping, which change the initial cluster position in the image. This prevents the network from learning that an object at the center of the image belongs to the “cluster” class and, instead, forces it to focus on the relevant features associated with clusters.

Our validation sample consists of the remaining cluster images and an equivalent number of random field images. We calculate the validation loss at the end of each training epoch. We started by setting a learning rate of 10⁻⁸, which grows slowly to 10⁻⁴ during the four warm-up epochs and then slowly decreases to 10⁻⁶.

Figure 3 shows the loss function for the training and validation sets for initial input images of size 512 × 512 pixels and 1024 × 1024 pixels. In both cases, there is a good agreement between the training and the validation loss function, excluding significant overfitting and confirming our network stability.

At each epoch, the network output is a catalog of detection positions, bounding boxes, and their probability of being a cluster (hereafter, the detection probability). To build a sample of detected cluster candidates, we had to apply a probability cut by choosing only detections with the probability of belonging to the class “cluster” (see Eq. (5)) that are higher than a given threshold. To define which epoch and which detection probability threshold to apply for our final cluster sample, we evaluated the performance of our network with respect to our YOLO–CL catalog completeness and purity for each epoch and then chose the epoch in which we obtain the largest completeness and purity. In this optimization, the redMaPPer cluster catalog is the true one.

Figure 4 shows the completeness and purity of the detected cluster candidate sample as a function of a given detection threshold when using the 512 × 512 and the 1024 × 1024 resampled SDSS images, for each in the epoch for which we obtain the highest completeness and purity. This leads to the selection of different cluster samples for different resolutions. With the 512 × 512 resolution, we miss 982 clusters found with the 1024 × 1024 (~8% of the validation sample) and with the 1024 × 1024 resolution we miss 280 clusters found with the 512 × 512 resolution (~2% of the validation sample). In both cases, we build our final detection catalog by choosing the threshold that optimizes both completeness and purity at the same time and that corresponds to the intersection of the completeness and purity curves in the figure. When using the 512 × 512 and the 1024 × 1024 resampled SDSS images, we obtain a completeness and purity of 95 and 98%, respectively, at the optimal threshold of 93 and 60%, respectively. Our YOLO–CL final detection catalog (hereafter, the cluster catalog, with the caveat that the final detections are not confirmed galaxy clusters but cluster candidates with a given probability of being a cluster) includes the detections obtained when applying the above optimal thresholds.

Fig. 3

Mean training (blue) and validation (orange) loss for YOLO–CL when using 512 × 512 (left), and 1024 × 1024 (right) resampled SDSS images. The vertical bars show the lσ standard deviation of the validation loss. The training and validation loss functions converge in a smooth way. The good agreement between training and validation loss excludes significant overfitting and confirming the network stability in both cases.

Fig. 4

YOLO–CL cluster catalog completeness and purity as a function of the detection threshold when using 512 × 512 (left), and 1024 × 1024 (right) resampled SDSS images. When using the 512 × 512 and the 1024 × 1024 resampled SDSS images, we obtain a completeness and purity of 95 and 98%, respectively, at the optimal (see text) threshold of 93 and 60%, respectively. Overall, YOLO–CL has a very good performance in the redshift range covered by redMaPPer.

4 Comparison to redMaPPer and X-ray cluster detections

In this section, we compare our YOLO–CL cluster catalog with our original redMaPPer catalog and X-ray cluster detections in the SDSS footprint.

4.1 Comparison to redMaPPer detections

Figures 5 and 6 show the fraction of redMaPPer clusters detected by YOLO–CL as a function of redshift and richness (i.e, the YOLO–CL completeness with respect to redMaPPer), respectively, and when using the 512 × 512 and the 1024 × 1024 resampled SDSS images. When using 512 × 512 resampled SDSS images, YOLO–CL detects ~98% of the redMaPPer clusters at z ≳ 0.3 and λ ≳ 40 (corresponding to M₂₀₀ ~ 10^14.3h⁻¹M_⊙ from Simet et al. 2017). For lower redshift (0.2 < z ≲ 0.3) and richness (20 < λ ≲ 40), the completeness is still very high at ~92–93%. When using 1024 × 1024 resampled SDSS images, the performance of the network is similar to that of a function of richness, with a completeness of ~98%; however, the completeness as a function of redshift is different. At z ≲ 0.5, YOLO–CL detects > 98% of the redMaPPer clusters, and at higher redshift the completeness drops to ~92%. This is synthesized in Fig. 7, which shows the YOLO–CL sample completeness as a function of both redshift and richness.

Figure 8 shows the angular distance between the YOLO–CL cluster centers¹³ and redMaPPer cluster centers. The median angular distance between our and redMaPPer’s cluster centers is of ~5.6 ± 2.9 kpc, which is a very accurate recovery of redMaPPer cluster centers.

When checking if undetected clusters were found within detected cluster bounding boxes, we found that only ~l% of the redMaPPer cluster detections are not detected by YOLO–CL because they lie within the bounding box of another YOLO–CL detections (i.e., because they are superposed on the line of sight). This outlines the efficiency of the network in object separation. The remaining 99% are detections either at λ ≲ 40 or z ≳ 0.5, in the regime where redMaPPer detections are less complete because of the SDSS depth (Rozo & Rykoff 2014).

Concerning the false positive detections in the random blank fields (2–5% of the detections), a visual inspection shows galaxy groups, in some cases, large and crowded star fields, as well as, in one case: a globular cluster. Since the redMaPPer catalog completeness decreases rapidly for low redMaPPer richness and low X-ray temperature and luminosity, as well as for clusters at z ≳ 0.4 (Rozo & Rykoff 2014), some of our false positive detections might be clusters or groups with low masses that are not detected by redMaPPer. This is further explored in the next section.

Fig. 5

YOLO–CL completeness with respect to the redMaPPer cluster catalog as a function of redshift, when using the 512 × 512 (left) and the 1024 × 1024 (right) resampled SDSS images. When using 512 × 512 and 1024 × 1024 resampled SDSS images, YOLO–CL reaches a completeness of ≳98% for z ≳ 0.3 and z ≲ 0.4, respectively. In the other redshift ranges, the completeness is of ~92–94%. Overall, YOLO–CL has a very good performance in the redshift range covered by redMaPPer.

Fig. 6

YOLO–CL completeness with respect to the redMaPPer cluster catalog as a function of the redMaPPer richness, when using the 512 × 512 (left) and the 1024 × 1024 (right) resampled SDSS images. When using 512 × 512 and 1024 × 1024 resampled SDSS images, YOLO–CL reaches a completeness of ~98% for λ ≳ 40. At lower richness the completeness is of ~92–94%. Overall, YOLO–CL has a very good performance in the richness range covered by redMaPPer.

Fig. 7

YOLO–CL cluster sample completeness for redMaPPer detections for the images resampled to 512 × 512 (left) and to 1024 × 1024 (right) as a function of both richness and redshift. This figure synthesizes the conclusions of Figs. 4 and 5 as a function of both variables. On the right of each figure is the completeness scale.

4.2 Comparison to the MCXC2021 X-ray catalog

To validate the performance of YOLO–CL on a cluster sample independent of redMaPPer, we applied our network to galaxy clusters detected by X-ray emission and published in the MCXC2021 catalog¹⁴, the updated version of the MCXC catalog (Piffaretti et al. 2011). X-ray detections confirm cluster detections as virialized dark matter haloes by their hot gas emission. We expect that this is not a mass selected sample because X-ray selected samples are biased towards relaxed cool core clusters (e.g., Rossetti et al. 2016).

We built SDSS g, r, and i-band color images for all MCXC2021 clusters in the SDSS footprint (927/1841 clusters) by generating 2048x2048 pixel (13.5 × 13.5 arcmin) cutouts with SkyServer¹⁵, following the same procedure as for redMaPPer clusters. From this sample, we excluded the clusters that are only partially covered by the SDSS footprint. As a comparison, we show the efficiency of redMaPPer on the same cluster sample. To minimize selection bias, for this comparison we use the complete redMaPPer catalog (i.e., without any cut in richness; E. Rykoff & E. Rozo, priv. comm., and Rykoff et al. 2016). We crossmatched the MCXC2021 and redMaPPer catalogs within a range of 0.1 in redshift and a radius of 5 arcmin in position.

Figure 9 shows the YOLO–CL and redMaPPer cluster catalog completeness in redshift bins with respect to the MCXC2021 catalog as a function of X-ray luminosity, L_X, derived cluster mass, M₅₀₀, and radius, R₅₀₀¹⁶. YOLO–CL recovers all clusters at L_X ≳ 1–3 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 2–3 × 10¹⁴M_⊙, R₅₀₀ ≳ 0.75–0.8 Mpc and z ≳ 0.4. At lower luminosity, mass, radius, and redshift, its performance worsens. redMaPPer recovers all clusters at L_X ≳ 3–9 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 2–6 × 10¹⁴M_⊙, R₅₀₀ ≳ 0.8–1.2 Mpc and z ≳ 0.4.

To better compare YOLO–CL to redMaPPer, Fig. 10 shows the MCXC2021 cluster detection completeness as a function of surface brightness I_X,500¹⁷ and redshift. The X-ray surface brightness quantifies the X-ray luminosity in a given area, and depends on the cluster luminosity and compactness, combining the information from the cluster L_X and R₅₀₀ shown in Fig. 9.

Our YOLO–CL network detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6; ~100% of the MCXC2021 clusters with I_X,500 ≳ 30 × 10⁻¹⁵ erg s⁻¹ cm⁻²arcmin⁻²; z ≳ 0.3. redMaPPer detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 55 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6; and ~100% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.5 ≲ z ≲ 0.6.

Figure 8 shows the angular distance between YOLO–CL and MCXC2021 cluster centers. The median angular distance is of 261 ± 327 kpc. This large dispersion is consistent with the precision on the position of most of the MCXC2021 clusters that were detected by ROSAT with a 2.3 arcmin angular resolution, which corresponds to ~600 kpc at the median MCXC2021 cluster redshift of z ~ 0.3, for our sample within the SDSS footprint.

From this comparison, YOLO–CL is more efficient than redMaPPer in detecting MCXC2021 clusters, which means that when optimizing YOLO–CL in terms of completeness and purity we improved cluster detection with respect to our training sample. This means that the network recovers cluster features that redMaPPer does not recover in clusters with lower X-ray surface brightness. It is also interesting that the YOLO–CL selection function is approximately constant with redshift, with respect to the X-ray surface brightness.

Fig. 8

Distribution of the angular distance between cluster centers detected by YOLO–CL and redMaPPer (top panel), and YOLO–CL and the MCXC2021 clusters (bottom panel).

5 Discussion and conclusions

Our deep convolutional network YOLO–CL shows high completeness and purity in detecting galaxy clusters in the SDSS footprint. When compared to the existing redMaPPer catalog, we obtained cluster catalogs with a completeness and purity of 95–98% for our optimal thresholds when using 512 × 512 and the 1024 × 1024 resampled SDSS images. The X-ray parameter that we found more interesting is the X-Ray surface brightness, which defines a clear threshold after which YOLO–CL and redMaPPer are 100% complete. When compared to the MCXC2021 X-ray detected clusters, YOLO–CL detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 30 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² and z ≳ 0.3. The lower detection rates for clusters at lower redshift could be explained by their large angular size that exceeds the size of image cutouts.

Several other SDSS cluster catalogs have been published, using different methods: the MaxBCG (Koester et al. 2007), WHL09/12 (Wen et al. 2009, 2012), GMBCG (Hao et al. 2010), and AMF (Adaptive Matched Filter cluster finder, Szabo et al. 2011) catalogs. We used the same methodology described for YOLO–CL and redMaPPer in Sect. 4.2 to asses the performance of these methods on the MCXC2021 cluster catalogs and we present our results in Figs. 11 and 12. MaxBCG recovers clusters with a ~70% completeness at 0.2 < z < 0.3 in the entire range of L_x, M₅₀₀, and R₅₀₀ that we cover here, with a 100% recovery only for R₅₀₀ ≳ 1.4 Mpc. At higher redshift, the completeness drops to ≲10%. GMBC recovers clusters with a ~100% completeness only at 0.2 < z < 0.3 and L_x ≳ 10 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 8.5 × 10¹⁴M_⊙, R₅₀₀ ≳ 1.3 Mpc. At a redshift of 0.3 < z < 0.5, its average completeness is of 50–70%. AMF recovers clusters with a ~100% completeness at 0.2 < z < 0.3 at L_X≳2× 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 4 × 10¹⁴M_⊙, R₅₀₀ ≳ 1 Mpc and 0.2 < z < 0.3. At higher redshift, it recovers clusters with a ~100% at L_X≳10× 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 6–8 × 10¹⁴M_⊙, R₅₀₀ ≳ 1–1.3 Mpc. WHL09 has a performance similar to AMF, and WHL12 is the most complete of those traditional methods, with a completeness of ≳80% for in the entire L_x, M₅₀₀, and R₅₀₀ ranges at 0.2 ≲ z ≲ 0.6. This last method reaches a completeness of~100% forL_X ≳ 0.8–3 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 2–3 × 10¹⁴M_⊙, R₅₀₀ ≳ 0.8–0.9 Mpc for z < 0.5, and for L_X ≳ 10 × 10⁴⁴ erg s⁻¹, it is M₅₀₀ ≳ 7 × 10¹⁴M_⊙, R₅₀₀ ≳ 1.2 Mpc for 0.5 < z < 0.6.

Figure 12 summarizes the performance of these three algorithms. While the MaxBCG, GMBCG, and WHL09 cluster catalogs are much less complete than the redMaPPer and YOLO–CL catalogs, the AMF cluster catalog results are very similar to redMaPPer’s when considering the X-ray surface brightness. WHL12 shows a completeness very similar to YOLO–CL. In fact, 82% of the clusters found by YOLO–CL and not found by redMaPPer are also found in WHL12.

This confirms that the redMaPPer catalog that we have used to train our network is very good in terms of the recovery of X-ray cluster detections and outlines the high performance of our YOLO–CL with respect to traditional cluster detection methods applied to SDSS. Unfortunately, we cannot complete our comparison using both completeness and purity because the estimates of purity for each method are not homogeneous.

Together with its high performance in terms of completeness and purity, a strong advantage of galaxy cluster detection by deep learning networks is that clusters can be found without the need of measuring galaxy photometry and photometric redshifts. In fact, the direct use of color images allows us to skip the step of photometric and photometric redshift catalog preparation, and eliminates the systematic uncertainties that can be introduced during this process. This advantage has been pointed out in Chan & Stott (2019), where the authors introduced, for the first time, the use of deep learning for cluster detection in SDSS with the development of Deep-CEE (Deep Learning for Galaxy Cluster Extraction and Evaluation). Deep-CEE is based on Faster region-based convolutional neural networks, trained on the cluster catalog of Wen et al. (2012) in the redshift range 0.05 ≤ z < 0.8. As a proof of concept, they obtained completeness and purity of ~75 and ~80% when optimizing both on their validation sample and the redMaPPer catalog, respectively.

The use of convolutional networks with color images also allows us to equally focus on the two main aspects of galaxy clusters that are used for detection: (i) the fact that they are galaxy overdensities and (ii) the same distance and redshift (and therefore similar colors) of cluster members. The vast majority of existing detection algorithms focus on one of these aspect more than on the another. Color images preserve both the information about galaxy positions and colors, without assumptions on the significance of the overdensity or galaxy colors.

We conclude that our YOLO–CL network exhibits a higher level of performance in terms of completeness and purity in detecting galaxy clusters when compared to traditional cluster detection algorithm applied to SDSS images and catalogs. A strong advantage of deep learning networks is that clusters can be found without any need for measuring galaxy photometry and photometric redshifts or biases inherent to galaxy detection as well as these two measurements.

Fig. 9

YOLO–CL (continuous lines) and redMaPPer (dashed lines) cluster detection completeness above a given X-Ray luminosity, L_XL_X (left panel), M₅₀₀ (middle panel), and R₅₀₀ (right panel). YOLO–CL recovers all clusters at L_X ≳ 1–3 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 2–3 × 10¹⁴M_⊙,R₅₀₀ ≳ 0.75–0.8 Mpc and z ≳ 0.4. At lower luminosity, mass, radius and redshift, its performance worsens. The redMaPPer algorithm recovers all clusters at L_X ≳ 30–90 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳2–6× 10¹⁴M_⊙, R₅₀₀ ≳ 0.8–1.2 Mpc and z ≳ 0.4. At high redshifts both YOLO–CL and redMaPPer demonstrate a similar performance that is limited by the SDSS depth.

Fig. 10

YOLO–CL and redMaPPer MCXC2021 cluster detection completeness as a function of redshift and X-ray surface brightness. Left: YOLO–CL detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 30 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² and z ≳ 0.3. Right: redMaPPer detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 55 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.5 ≲ z ≲ 0.6. On the right of each figure, we give the completeness scale. From this comparison, YOLO–CL is more complete than redMaPPer in detecting MCXC2021 clusters.

Fig. 11

Fraction of MCXC2021 clusters recovered by tradition cluster detection methods in the SDSS (see text) from to bottom: MaxBCG, GMBCG, AMF, WHL09, and WHL12. The details of cluster recovery in each case are detailed in the text. In all cases, except WHL12, their completeness is worse than that reached by redMaPPer and YOLO–CL. These results, compared with Fig. 8 outline the high performance of our YOLO–CL with respect to traditional cluster detection methods in optical images.

Fig. 12

Completeness of the MaxBCG, GMBCG and AMF galaxy cluster catalogs as a function of redshift and mean X-Ray surface brightness, I_X,500 On the right of each figure is the completeness scale. All traditional cluster detection algorithms applied to SDSS are less complete that redMaPPer and YOLO–CL (see text), except AMF, which has a performance similar to redMaPPer, and WHL12, which has a performance similar to YOLO–CL.

6 Summary

We apply the YOLO object detection deep convolutional network to the detection of galaxy clusters in the SDSS survey. Our network implementation, YOLO–CL, is a modification of the original YOLO v3 implementation to optimize galaxy cluster detection.

YOLO–CL was trained and validated using three color images (in the g, r, i bandpasses) of 24 406 detections from redMaPPer cluster catalog and the equivalent number of SDSS blank field images. In the validation, we obtained an estimation of our network cluster catalog completeness and purity. To asses our sample completeness with respect to X-ray detected clusters, we compared our YOLO–CL detections to the MCXC2021 cluster catalog within in the SDSS footprint.

Our results show the following:

When validated on the redMaPPer catalog, YOLO–CL detects 95–98% and ≳98% of the redMaPPer clusters when using 512 × 512 and 1024 × 1024 pixel resampled SDSS images, respectively. It reaches a purity of 95 and 98%, calculated by applying the network to 512 × 512 and 1024 × 1024 pixel resampled SDSS blank fields.
When compared to the redMaPPer detection of the same X-ray detected MCXC2021 clusters, YOLO–CL is more complete at lower L_X, M₅₀₀, and R₅₀₀ than redMaPPer. This means that the neural network improved the cluster detection efficiency of its training sample. In fact, YOLO–CL recovers all clusters at L_X > 1–3 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 2–3 × 10¹⁴M_⊙, R₅₀₀ ≳ 0.75–0.8 Mpc and z ≳ 0.4. At lower luminosity, mass, radius, and redshift, its performance degrades. In comparison, redMaPPer recovers all clusters at L_X ≳ 3–9 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 2–6 × 10¹⁴M_⊙, R₅₀₀ ≳ 0.8–1.2 Mpc, and z ≳ 0.4.
YOLO–CL detects lower X-ray surface brightness I_X,500 clusters with respect to redMaPPer. In fact, YOLO–CL detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 30 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² and z ≳ 0.3, while redMaPPer detects ~98% of the MCXC2021 clusters with I_X,500 > 55 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.5 ≲ z ≲ 0.6.
The YOLO–CL selection function is approximately constant with redshift, with respect to the MCXC2021 cluster X-ray surface brightness.
When comparing to other traditional detection algorithms applied to the SDSS survey, we confirm that redMaPPer is an excellent choice for training our network, in terms of the recovery of X-ray cluster detections. This comparison also outlines the high performance of our YOLO–CL with respect to most of the traditional cluster detection methods in optical images.

Our results show that our YOLO–CL cluster sample has a very high completeness when compared to redMaPPer and other traditional cluster detection algorithms in detecting prior X-ray-detections MCXC2021 clusters. YOLO–CL also shows a very high level of purity, measured using SDSS blank field images. As pointed out in the first implementation of a deep convolutional network for galaxy cluster detection (Chan & Stott 2019), deep learning networks have a strong advantage of galaxy cluster detection with respect to traditional techniques because they do not need galaxy photometric and photometric redshift catalogs. This eliminates the systematic uncertainties that can be introduced during the source detection, as well the measurements of photometry and photometric measurements, and focuses the detection method on the two main aspects of galaxy cluster detection, namely: the search for overdensities of galaxies that have similar colors because they are at the same redshift, without assumptions on the significance of the overdensity or galaxy colors. Our results highlight another advantage: a higher cluster catalog completeness than traditional cluster detection algorithms applied SDSS and a very high purity. Interestingly, the YOLO–CL selection function is approximately constant with redshift, with respect to the X-ray surface brightness.

We conclude that deep convolutional network for galaxy cluster detection are an efficient alternative to traditional cluster detection methods, and it is worth exploring their performance for future cosmological cluster catalogs for large-scale surveys, such as Rubin/LSST, Euclid, and Roman Space Telescope.

Acknowledgements

We thank Université Paris Cité (UPC), which founded KG’s Ph.D. research, and University Paris Science & Lettres (PSL), which founded SI postdoctoral research. We thank our PSL and UPC colleagues, and collaborators from the LightOn (https://lighton.ai/) company and the École Normale Superieure (ENS), Laurent Daudet, Florent Krzakala, and Amelie Chatelain, for fruitful discussions. We thank Jean-Baptiste Melin and James Bartlett for useful discussions and help in choosing the X-ray catalog that we use in this paper. We thank Alex Saro for a insighful question that improved our comparison to the redMaPPer algorithm. We thank Eli Rykoff for proving the complete redMaPPer catalog. We gratefully acknowledge support from the CNRS/IN2P3 Computing Center (Lyon - France) for providing computing and data-processing resources needed for this work. This research has made use of the M2C Galaxy Cluster Database, constructed as part of the ERC project M2C (The Most Massive Clusters across cosmic time, ERC-Adv grant No. 340519). This work was supported by the French Space Agency (CNES). We thank the anonymous referee for her/his careful reading of the manuscript and useful suggestions that helped to improve the paper.

References

Abbott, T. M. C., Abdalla, F. B., Allam, S., et al. 2018, ApJS, 239, 18 [Google Scholar]
Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35 [NASA ADS] [CrossRef] [Google Scholar]
Ade, P., Aguirre, J., Ahmed, Z., et al. 2019, J. Cosmology Astropart. Phys., 2019, 056 [CrossRef] [Google Scholar]
Allen, S. W., Evrard, A. E., & Mantz, A. B. 2011, ARA&A, 49, 409 [Google Scholar]
Ascaso, B., Benítez, N., Fernández-Soto, A., et al. 2015, MNRAS, 452, 549 [NASA ADS] [CrossRef] [Google Scholar]
Ascaso, B., Mei, S., Bartlett, J. G., & Bentez, N. 2017, MNRAS, 464, 2270 [CrossRef] [Google Scholar]
Bayliss, K. D., McMahon, R. G., Venemans, B. P., Ryan-Weber, E. V., & Lewis, J. R. 2011, MNRAS, 413, 2883 [NASA ADS] [CrossRef] [Google Scholar]
Bleem, L. E., Stalder, B., de Haan, T., et al. 2015, ApJS, 216, 27 [Google Scholar]
Böhringer, H., Schuecker, P., Guzzo, L., et al. 2004, A&A, 425, 367 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Bonjean, V. 2020, A&A, 634, A81 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Chan, M. C., & Stott, J. P. 2019, MNRAS, 490, 5770 [NASA ADS] [CrossRef] [Google Scholar]
Chiang, Y.-K., Overzier, R., & Gebhardt, K. 2013, ApJ, 779, 127 [Google Scholar]
Chiu, I. N., Klein, M., Mohr, J., & Bocquet, S. 2023, MNRAS, 522, 1601 [NASA ADS] [CrossRef] [Google Scholar]
Costanzi, M., Saro, A., Bocquet, S., et al. 2021, Phys. Rev. D, 103, 043522 [Google Scholar]
Davidzon, I., Jegatheesan, K., Ilbert, O., et al. 2022, A&A, 665, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
de Haan, T., Benson, B. A., Bleem, L. E., et al. 2016, ApJ, 832, 95 [NASA ADS] [CrossRef] [Google Scholar]
Dey, A., Schlegel, D. J., Lang, D., et al. 2019, AJ, 157, 168 [Google Scholar]
Dimauro, P., Huertas-Company, M., Daddi, E., et al. 2018, MNRAS, 478, 5410 [Google Scholar]
Ebeling, H., Edge, A. C., Bohringer, H., et al. 1998, MNRAS, 301, 881 [Google Scholar]
Eifler, T., Miyatake, H., Krause, E., et al. 2021, MNRAS, 507, 1746 [NASA ADS] [CrossRef] [Google Scholar]
Euclid Collaboration (Adam, R., et al.) 2019, A&A, 627, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Euclid Collaboration (Bretonnière, H., et al.) 2022, A&A, 657, A90 [CrossRef] [EDP Sciences] [Google Scholar]
Euclid Collaboration (Bisigello, L., et al.) 2023a, MNRAS, 520, 3529 [NASA ADS] [CrossRef] [Google Scholar]
Euclid Collaboration (Humphrey, A., et al.) 2023b, A&A, 671, A99 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Girshick, R., Donahue, J., Darrell, T., & Malik, J. 2013, ArXiv e-prints [arXiv:1311.2524] [Google Scholar]
Gladders, M. D., & Yee, H. K. C. 2005, ApJS, 157, 1 [NASA ADS] [CrossRef] [Google Scholar]
González, R. E., Muñoz, R. P., & Hernández, C. A. 2018, Astron. Comput., 25, 103 [CrossRef] [Google Scholar]
Grove, L. F., Benoist, C., & Martel, F. 2009, A&A, 494, 845 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hao, J., McKay, T. A., Koester, B. P., et al. 2010, ApJS, 191, 254 [Google Scholar]
Hasselfield, M., Hilton, M., Marriage, T. A., et al. 2013, J. Cosmol. Astropart.Phys., 2013, 008 [CrossRef] [Google Scholar]
He, K., Zhang, X., Ren, S., & Sun, J. 2015, ArXiv e-prints [arXiv:1512.03385] [Google Scholar]
He, Z., Qiu, B., Luo, A. L., et al. 2021, MNRAS, 508, 2039 [NASA ADS] [CrossRef] [Google Scholar]
Henghes, B., Pettitt, C., Thiyagalingam, J., Hey, T., & Lahav, O. 2021, MNRAS, 505, 4847 [CrossRef] [Google Scholar]
Huertas-Company, M., & Lanusse, F. 2023, PASA, 40, e001 [NASA ADS] [CrossRef] [Google Scholar]
Huertas-Company, M., Gravet, R., Cabrera-Vives, G., et al. 2015, ApJS, 221, 8 [NASA ADS] [CrossRef] [Google Scholar]
Huertas-Company, M., Primack, J. R., Dekel, A., et al. 2018, ApJ, 858, 114 [NASA ADS] [CrossRef] [Google Scholar]
Hurier, G., Aghanim, N., & Douspis, M. 2021, A&A, 653, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Ivezic, Z., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111 [NASA ADS] [CrossRef] [Google Scholar]
Jeffrey, N., Lanusse, F., Lahav, O., & Starck, J.-L. 2020, MNRAS, 492, 5023 [Google Scholar]
Kahn, S. 2018, COSPAR Scientific Assembly, 42, E1.16–5.18 [Google Scholar]
Knobel, C., Lilly, S. J., Iovino, A., et al. 2009, ApJ, 697, 1842 [NASA ADS] [CrossRef] [Google Scholar]
Koester, B. P., McKay, T. A., Annis, J., et al. 2007, ApJ, 660, 239 [NASA ADS] [CrossRef] [Google Scholar]
Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]
Li, X., Bianco, F. B., Dobler, G., et al. 2022, AJ, 164, 250 [NASA ADS] [CrossRef] [Google Scholar]
Lin, Z., Huang, N., Avestruz, C., et al. 2021, MNRAS, 507, 4149 [NASA ADS] [CrossRef] [Google Scholar]
Lupton, R., Blanton, M. R., Fekete, G., et al. 2004, PASP, 116, 133 [NASA ADS] [CrossRef] [Google Scholar]
Marriage, T. A., Acquaviva, V., Ade, P. A. R., et al. 2011, ApJ, 737, 61 [Google Scholar]
Merloni, A., Predehl, P., Becker, W., et al. 2012, ArXiv e-prints [arXiv:1209.3114] [Google Scholar]
Muzzin, A., Wilson, G., Yee, H. K. C., et al. 2012, ApJ, 746, 188 [Google Scholar]
Olsen, L. F., Benoist, C., Cappi, A., et al. 2007, A&A, 461, 81 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Pasquet, J., Bertin, E., Treyer, M., Arnouts, S., & Fouchez, D. 2019, A&A, 621, A26 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Piffaretti, R., Arnaud, M., Pratt, G. W., Pointecouteau, E., & Melin, J. B. 2011, A&A, 534, A109 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Planck Collaboration XXVII. 2016, A&A, 594, A27 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Planck Collaboration Int. XXVI. 2015, A&A, 582, A29 [CrossRef] [EDP Sciences] [Google Scholar]
Predehl, P., Andritschke, R., Arefiev, V., et al. 2021, A&A, 647, A1 [EDP Sciences] [Google Scholar]
Redmon, J., & Farhadi, A. 2016, ArXiv e-prints [arXiv: 1612.08242] [Google Scholar]
Redmon, J., & Farhadi, A. 2018, ArXiv e-prints [arXiv: 1804.02767] [Google Scholar]
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. 2015, ArXiv e-prints [arXiv: 1506.02640] [Google Scholar]
Rezatofighi, H., Tsoi, N., Gwak, J., et al. 2019, ArXiv e-prints [arXiv: 1902.09630] [Google Scholar]
Rossetti, M., Gastaldello, F., Ferioli, G., et al. 2016, MNRAS, 457, 4515 [Google Scholar]
Rozo, E., & Rykoff, E. S. 2014, ApJ, 783, 80 [NASA ADS] [CrossRef] [Google Scholar]
Rozo, E., Rykoff, E. S., Koester, B. P., et al. 2009, ApJ, 703, 601 [NASA ADS] [CrossRef] [Google Scholar]
Rozo, E., Wechsler, R. H., Rykoff, E. S., et al. 2010, ApJ, 708, 645 [Google Scholar]
Rykoff, E. S., Rozo, E., Busha, M. T., et al. 2014, ApJ, 785, 104 [Google Scholar]
Rykoff, E. S., Rozo, E., Hollowood, D., et al. 2016, ApJS, 224, 1 [NASA ADS] [CrossRef] [Google Scholar]
Simet, M., McClintock, T., Mandelbaum, R., et al. 2017, MNRAS, 466, 3103 [NASA ADS] [CrossRef] [Google Scholar]
Sobral, D., Best, P. N., Geach, J. E., et al. 2010, MNRAS, 404, 1551 [NASA ADS] [Google Scholar]
Szabo, T., Pierpaoli, E., Dong, F., Pipino, A., & Gunn, J. 2011, ApJ, 736, 21 [NASA ADS] [CrossRef] [Google Scholar]
Voges, W., Aschenbach, B., Boller, T., et al. 1999, A&A, 349, 389 [NASA ADS] [Google Scholar]
Wen, Z. L., Han, J. L., & Liu, F. S. 2009, ApJS, 183, 197 [NASA ADS] [CrossRef] [Google Scholar]
Wen, Z. L., Han, J. L., & Liu, F. S. 2012, ApJS, 199, 34 [Google Scholar]
Wylezalek, D., Galametz, A., Stern, D., et al. 2013, ApJ, 769, 79 [NASA ADS] [CrossRef] [Google Scholar]
Wylezalek, D., Vernet, J., De Breuck, C., et al. 2014, ApJ, 786, 17 [NASA ADS] [CrossRef] [Google Scholar]
York, D. G., Adelman, J., Anderson, John E.J., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]
Zaidi, S. S. A., Samar Ansari, M., Aslam, A., et al. 2021, ArXiv e-prints [arXiv:2104.11892] [Google Scholar]
Zanisi, L., Huertas-Company, M., Lanusse, F., et al. 2021, MNRAS, 501, 4359 [NASA ADS] [CrossRef] [Google Scholar]
Zou, Z., Chen, K., Shi, Z., Guo, Y., & Ye, J. 2019, ArXiv e-prints [arXiv:1905.05055] [Google Scholar]

¹

https://www.darkenergysurvey.org/

²

https://www.desi.lbl.gov/

³

https://www.vro.org/

⁴

https://www.euclid-ec.org/

⁵

https://lsst.slac.stanford.edu/

⁶

https://classic.sdss.org/

⁷

Version 6.3 of the catalog, from the VizieR archive: https://vizier.cds.unistra.fr

⁸

The cluster richness is defined as the number of cluster members above a given luminosity. For redMaPPer, it is defined as a sum of the probability of being a cluster member over all galaxies in a cluster field (Rozo et al. 2009).

⁹

http://skyserver.sdss.org/dr16/en/help/docs/api.aspx#imgcutout

¹⁰

Detailed here: https://www.sdss.org/dr16/imaging/jpg-images-on-skyserver

¹¹

For the sake of completeness, we note here that the YOLO network does not actually predicts the positions and dimensions of the bounding boxes directly, but rather offsets from a fixed set of B boxes called “anchors”, which act as priors to facilitate the training of the network. Those anchor boxes are usually derived from the training set by running a k-means clustering algorithm (with k = B) on the set of true bounding boxes.

¹²

https://github.com/YunYang1994/TensorFlow2.0-Examples

¹³

Defined as the center of the bounding box that hosts the cluster detection.

¹⁴

https://www.galaxyclusterdb.eu/m2c/

¹⁵

http://skyserver.sdss.org/dr16/en/help/docs/api.aspx#imgcutout

¹⁶

M₅₀₀ is defined as the mass within the circular region of radius R₅₀₀ containing a mean mass density equal to five hundred times the critical density of the Universe at a given redshift. The luminosity L_X is the luminosity L₅₀₀ in the same region.

¹⁷

Defined as a mean X-ray flux within the region containing a mass density equal to five hundred times the critical density of the Universe at a given redshift divided by its angular circular area.

All Tables

Table 1

Settings used for the YOLO–CL training.

In the text

All Figures

	Fig. 1 redMaPPer sample of 24 406 clusters used to train and validate our network. Top: sky map of the positions of the redMaPPer clusters in celestial coordinates, where the color indicates the photometric redshift of the cluster as estimated by the redMaPPer algorithm. Bottom: training and validation redMaPPer sample redshift (left) and richness (right) distribution.
In the text

	Fig. 2 SDSS image cutout of a redMaPPer cluster in our sample. The yellow box corresponds to the minimal rectangle encompassing all redMaPPer cluster members, which is the box used to train `YOLO–CL`. In cyan, the box detected by our network `YOLO–CL`, with the associated confidence level in the top left corner. The image size is 13.5 × 13.5 arcmin² and the pixel size is 0.396 arcsec.
In the text

Fig. 3

Mean training (blue) and validation (orange) loss for YOLO–CL when using 512 × 512 (left), and 1024 × 1024 (right) resampled SDSS images. The vertical bars show the lσ standard deviation of the validation loss. The training and validation loss functions converge in a smooth way. The good agreement between training and validation loss excludes significant overfitting and confirming the network stability in both cases.

In the text

Fig. 4

YOLO–CL cluster catalog completeness and purity as a function of the detection threshold when using 512 × 512 (left), and 1024 × 1024 (right) resampled SDSS images. When using the 512 × 512 and the 1024 × 1024 resampled SDSS images, we obtain a completeness and purity of 95 and 98%, respectively, at the optimal (see text) threshold of 93 and 60%, respectively. Overall, YOLO–CL has a very good performance in the redshift range covered by redMaPPer.

In the text

Fig. 5

YOLO–CL completeness with respect to the redMaPPer cluster catalog as a function of redshift, when using the 512 × 512 (left) and the 1024 × 1024 (right) resampled SDSS images. When using 512 × 512 and 1024 × 1024 resampled SDSS images, YOLO–CL reaches a completeness of ≳98% for z ≳ 0.3 and z ≲ 0.4, respectively. In the other redshift ranges, the completeness is of ~92–94%. Overall, YOLO–CL has a very good performance in the redshift range covered by redMaPPer.

In the text

Fig. 6

YOLO–CL completeness with respect to the redMaPPer cluster catalog as a function of the redMaPPer richness, when using the 512 × 512 (left) and the 1024 × 1024 (right) resampled SDSS images. When using 512 × 512 and 1024 × 1024 resampled SDSS images, YOLO–CL reaches a completeness of ~98% for λ ≳ 40. At lower richness the completeness is of ~92–94%. Overall, YOLO–CL has a very good performance in the richness range covered by redMaPPer.

In the text

	Fig. 7 `YOLO–CL` cluster sample completeness for redMaPPer detections for the images resampled to 512 × 512 (left) and to 1024 × 1024 (right) as a function of both richness and redshift. This figure synthesizes the conclusions of Figs. 4 and 5 as a function of both variables. On the right of each figure is the completeness scale.
In the text

	Fig. 8 Distribution of the angular distance between cluster centers detected by `YOLO–CL` and redMaPPer (top panel), and `YOLO–CL` and the MCXC2021 clusters (bottom panel).
In the text

Fig. 9

YOLO–CL (continuous lines) and redMaPPer (dashed lines) cluster detection completeness above a given X-Ray luminosity, L_XL_X (left panel), M₅₀₀ (middle panel), and R₅₀₀ (right panel). YOLO–CL recovers all clusters at L_X ≳ 1–3 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳ 2–3 × 10¹⁴M_⊙,R₅₀₀ ≳ 0.75–0.8 Mpc and z ≳ 0.4. At lower luminosity, mass, radius and redshift, its performance worsens. The redMaPPer algorithm recovers all clusters at L_X ≳ 30–90 × 10⁴⁴ erg s⁻¹, M₅₀₀ ≳2–6× 10¹⁴M_⊙, R₅₀₀ ≳ 0.8–1.2 Mpc and z ≳ 0.4. At high redshifts both YOLO–CL and redMaPPer demonstrate a similar performance that is limited by the SDSS depth.

In the text

Fig. 10

YOLO–CL and redMaPPer MCXC2021 cluster detection completeness as a function of redshift and X-ray surface brightness. Left: YOLO–CL detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 30 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² and z ≳ 0.3. Right: redMaPPer detects ~98% of the MCXC2021 clusters with I_X,500 ≳ 55 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.2 ≲ z ≲ 0.6 and ~100% of the MCXC2021 clusters with I_X,500 ≳ 20 × 10⁻¹⁵ erg s⁻¹ cm⁻² arcmin⁻² at 0.5 ≲ z ≲ 0.6. On the right of each figure, we give the completeness scale. From this comparison, YOLO–CL is more complete than redMaPPer in detecting MCXC2021 clusters.

In the text

Fig. 11

Fraction of MCXC2021 clusters recovered by tradition cluster detection methods in the SDSS (see text) from to bottom: MaxBCG, GMBCG, AMF, WHL09, and WHL12. The details of cluster recovery in each case are detailed in the text. In all cases, except WHL12, their completeness is worse than that reached by redMaPPer and YOLO–CL. These results, compared with Fig. 8 outline the high performance of our YOLO–CL with respect to traditional cluster detection methods in optical images.

In the text

Fig. 12

Completeness of the MaxBCG, GMBCG and AMF galaxy cluster catalogs as a function of redshift and mean X-Ray surface brightness, I_X,500 On the right of each figure is the completeness scale. All traditional cluster detection algorithms applied to SDSS are less complete that redMaPPer and YOLO–CL (see text), except AMF, which has a performance similar to redMaPPer, and WHL12, which has a performance similar to YOLO–CL.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Abbott, T. M. C., Abdalla, F. B., Allam, S., et al. 2018, ApJS, 239, 18 [Google Scholar]

[2] Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35 [NASA ADS] [CrossRef] [Google Scholar]

[3] Ade, P., Aguirre, J., Ahmed, Z., et al. 2019, J. Cosmology Astropart. Phys., 2019, 056 [CrossRef] [Google Scholar]

[4] Allen, S. W., Evrard, A. E., & Mantz, A. B. 2011, ARA&A, 49, 409 [Google Scholar]

[5] Ascaso, B., Benítez, N., Fernández-Soto, A., et al. 2015, MNRAS, 452, 549 [NASA ADS] [CrossRef] [Google Scholar]

[6] Ascaso, B., Mei, S., Bartlett, J. G., & Bentez, N. 2017, MNRAS, 464, 2270 [CrossRef] [Google Scholar]

[7] Bayliss, K. D., McMahon, R. G., Venemans, B. P., Ryan-Weber, E. V., & Lewis, J. R. 2011, MNRAS, 413, 2883 [NASA ADS] [CrossRef] [Google Scholar]

[8] Bleem, L. E., Stalder, B., de Haan, T., et al. 2015, ApJS, 216, 27 [Google Scholar]

[9] Böhringer, H., Schuecker, P., Guzzo, L., et al. 2004, A&A, 425, 367 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[10] Bonjean, V. 2020, A&A, 634, A81 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[11] Chan, M. C., & Stott, J. P. 2019, MNRAS, 490, 5770 [NASA ADS] [CrossRef] [Google Scholar]

[12] Chiang, Y.-K., Overzier, R., & Gebhardt, K. 2013, ApJ, 779, 127 [Google Scholar]

[13] Chiu, I. N., Klein, M., Mohr, J., & Bocquet, S. 2023, MNRAS, 522, 1601 [NASA ADS] [CrossRef] [Google Scholar]

[14] Costanzi, M., Saro, A., Bocquet, S., et al. 2021, Phys. Rev. D, 103, 043522 [Google Scholar]

[15] Davidzon, I., Jegatheesan, K., Ilbert, O., et al. 2022, A&A, 665, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[16] de Haan, T., Benson, B. A., Bleem, L. E., et al. 2016, ApJ, 832, 95 [NASA ADS] [CrossRef] [Google Scholar]

[17] Dey, A., Schlegel, D. J., Lang, D., et al. 2019, AJ, 157, 168 [Google Scholar]

[18] Dimauro, P., Huertas-Company, M., Daddi, E., et al. 2018, MNRAS, 478, 5410 [Google Scholar]

[19] Ebeling, H., Edge, A. C., Bohringer, H., et al. 1998, MNRAS, 301, 881 [Google Scholar]

[20] Eifler, T., Miyatake, H., Krause, E., et al. 2021, MNRAS, 507, 1746 [NASA ADS] [CrossRef] [Google Scholar]

[21] Euclid Collaboration (Adam, R., et al.) 2019, A&A, 627, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[22] Euclid Collaboration (Bretonnière, H., et al.) 2022, A&A, 657, A90 [CrossRef] [EDP Sciences] [Google Scholar]

[23] Euclid Collaboration (Bisigello, L., et al.) 2023a, MNRAS, 520, 3529 [NASA ADS] [CrossRef] [Google Scholar]

[24] Euclid Collaboration (Humphrey, A., et al.) 2023b, A&A, 671, A99 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[25] Girshick, R., Donahue, J., Darrell, T., & Malik, J. 2013, ArXiv e-prints [arXiv:1311.2524] [Google Scholar]

[26] Gladders, M. D., & Yee, H. K. C. 2005, ApJS, 157, 1 [NASA ADS] [CrossRef] [Google Scholar]

[27] González, R. E., Muñoz, R. P., & Hernández, C. A. 2018, Astron. Comput., 25, 103 [CrossRef] [Google Scholar]

[28] Grove, L. F., Benoist, C., & Martel, F. 2009, A&A, 494, 845 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[29] Hao, J., McKay, T. A., Koester, B. P., et al. 2010, ApJS, 191, 254 [Google Scholar]

[30] Hasselfield, M., Hilton, M., Marriage, T. A., et al. 2013, J. Cosmol. Astropart.Phys., 2013, 008 [CrossRef] [Google Scholar]

[31] He, K., Zhang, X., Ren, S., & Sun, J. 2015, ArXiv e-prints [arXiv:1512.03385] [Google Scholar]

[32] He, Z., Qiu, B., Luo, A. L., et al. 2021, MNRAS, 508, 2039 [NASA ADS] [CrossRef] [Google Scholar]

[33] Henghes, B., Pettitt, C., Thiyagalingam, J., Hey, T., & Lahav, O. 2021, MNRAS, 505, 4847 [CrossRef] [Google Scholar]

[34] Huertas-Company, M., & Lanusse, F. 2023, PASA, 40, e001 [NASA ADS] [CrossRef] [Google Scholar]

[35] Huertas-Company, M., Gravet, R., Cabrera-Vives, G., et al. 2015, ApJS, 221, 8 [NASA ADS] [CrossRef] [Google Scholar]

[36] Huertas-Company, M., Primack, J. R., Dekel, A., et al. 2018, ApJ, 858, 114 [NASA ADS] [CrossRef] [Google Scholar]

[37] Hurier, G., Aghanim, N., & Douspis, M. 2021, A&A, 653, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[38] Ivezic, Z., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111 [NASA ADS] [CrossRef] [Google Scholar]

[39] Jeffrey, N., Lanusse, F., Lahav, O., & Starck, J.-L. 2020, MNRAS, 492, 5023 [Google Scholar]

[40] Kahn, S. 2018, COSPAR Scientific Assembly, 42, E1.16–5.18 [Google Scholar]

[41] Knobel, C., Lilly, S. J., Iovino, A., et al. 2009, ApJ, 697, 1842 [NASA ADS] [CrossRef] [Google Scholar]

[42] Koester, B. P., McKay, T. A., Annis, J., et al. 2007, ApJ, 660, 239 [NASA ADS] [CrossRef] [Google Scholar]

[43] Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]

[44] Li, X., Bianco, F. B., Dobler, G., et al. 2022, AJ, 164, 250 [NASA ADS] [CrossRef] [Google Scholar]

[45] Lin, Z., Huang, N., Avestruz, C., et al. 2021, MNRAS, 507, 4149 [NASA ADS] [CrossRef] [Google Scholar]

[46] Lupton, R., Blanton, M. R., Fekete, G., et al. 2004, PASP, 116, 133 [NASA ADS] [CrossRef] [Google Scholar]

[47] Marriage, T. A., Acquaviva, V., Ade, P. A. R., et al. 2011, ApJ, 737, 61 [Google Scholar]

[48] Merloni, A., Predehl, P., Becker, W., et al. 2012, ArXiv e-prints [arXiv:1209.3114] [Google Scholar]

[49] Muzzin, A., Wilson, G., Yee, H. K. C., et al. 2012, ApJ, 746, 188 [Google Scholar]

[50] Olsen, L. F., Benoist, C., Cappi, A., et al. 2007, A&A, 461, 81 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[51] Pasquet, J., Bertin, E., Treyer, M., Arnouts, S., & Fouchez, D. 2019, A&A, 621, A26 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[52] Piffaretti, R., Arnaud, M., Pratt, G. W., Pointecouteau, E., & Melin, J. B. 2011, A&A, 534, A109 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[53] Planck Collaboration XXVII. 2016, A&A, 594, A27 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[54] Planck Collaboration Int. XXVI. 2015, A&A, 582, A29 [CrossRef] [EDP Sciences] [Google Scholar]

[55] Predehl, P., Andritschke, R., Arefiev, V., et al. 2021, A&A, 647, A1 [EDP Sciences] [Google Scholar]

[56] Redmon, J., & Farhadi, A. 2016, ArXiv e-prints [arXiv: 1612.08242] [Google Scholar]

[57] Redmon, J., & Farhadi, A. 2018, ArXiv e-prints [arXiv: 1804.02767] [Google Scholar]

[58] Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. 2015, ArXiv e-prints [arXiv: 1506.02640] [Google Scholar]

[59] Rezatofighi, H., Tsoi, N., Gwak, J., et al. 2019, ArXiv e-prints [arXiv: 1902.09630] [Google Scholar]

[60] Rossetti, M., Gastaldello, F., Ferioli, G., et al. 2016, MNRAS, 457, 4515 [Google Scholar]

[61] Rozo, E., & Rykoff, E. S. 2014, ApJ, 783, 80 [NASA ADS] [CrossRef] [Google Scholar]

[62] Rozo, E., Rykoff, E. S., Koester, B. P., et al. 2009, ApJ, 703, 601 [NASA ADS] [CrossRef] [Google Scholar]

[63] Rozo, E., Wechsler, R. H., Rykoff, E. S., et al. 2010, ApJ, 708, 645 [Google Scholar]

[64] Rykoff, E. S., Rozo, E., Busha, M. T., et al. 2014, ApJ, 785, 104 [Google Scholar]

[65] Rykoff, E. S., Rozo, E., Hollowood, D., et al. 2016, ApJS, 224, 1 [NASA ADS] [CrossRef] [Google Scholar]

[66] Simet, M., McClintock, T., Mandelbaum, R., et al. 2017, MNRAS, 466, 3103 [NASA ADS] [CrossRef] [Google Scholar]

[67] Sobral, D., Best, P. N., Geach, J. E., et al. 2010, MNRAS, 404, 1551 [NASA ADS] [Google Scholar]

[68] Szabo, T., Pierpaoli, E., Dong, F., Pipino, A., & Gunn, J. 2011, ApJ, 736, 21 [NASA ADS] [CrossRef] [Google Scholar]

[69] Voges, W., Aschenbach, B., Boller, T., et al. 1999, A&A, 349, 389 [NASA ADS] [Google Scholar]

[70] Wen, Z. L., Han, J. L., & Liu, F. S. 2009, ApJS, 183, 197 [NASA ADS] [CrossRef] [Google Scholar]

[71] Wen, Z. L., Han, J. L., & Liu, F. S. 2012, ApJS, 199, 34 [Google Scholar]

[72] Wylezalek, D., Galametz, A., Stern, D., et al. 2013, ApJ, 769, 79 [NASA ADS] [CrossRef] [Google Scholar]

[73] Wylezalek, D., Vernet, J., De Breuck, C., et al. 2014, ApJ, 786, 17 [NASA ADS] [CrossRef] [Google Scholar]

[74] York, D. G., Adelman, J., Anderson, John E.J., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]

[75] Zaidi, S. S. A., Samar Ansari, M., Aslam, A., et al. 2021, ArXiv e-prints [arXiv:2104.11892] [Google Scholar]

[76] Zanisi, L., Huertas-Company, M., Lanusse, F., et al. 2021, MNRAS, 501, 4359 [NASA ADS] [CrossRef] [Google Scholar]

[77] Zou, Z., Chen, K., Shi, Z., Guo, Y., & Ye, J. 2019, ArXiv e-prints [arXiv:1905.05055] [Google Scholar]

YOLO–CL: Galaxy cluster detection in the SDSS with deep machine learning

1 Introduction

2 Observational dataset

3 YOLO–CL: our YOLO network for galaxy cluster detections

3.1 The YOLO network

3.2 Network optimization for galaxy cluster detection

3.2.1 Modifications to the original YOLOv3 network

3.2.2 Hyperparameter optimization

3.3 Training and validation

4 Comparison to redMaPPer and X-ray cluster detections

4.1 Comparison to redMaPPer detections

4.2 Comparison to the MCXC2021 X-ray catalog

5 Discussion and conclusions

6 Summary

Acknowledgements

References

All Tables

All Figures

`YOLO–CL`: Galaxy cluster detection in the SDSS with deep machine learning

3 `YOLO–CL`: our `YOLO` network for galaxy cluster detections

3.1 The `YOLO` network

3.2.1 Modifications to the original `YOLOv3` network