Issue |
A&A
Volume 660, April 2022
|
|
---|---|---|
Article Number | A29 | |
Number of page(s) | 19 | |
Section | Galactic structure, stellar clusters and populations | |
DOI | https://doi.org/10.1051/0004-6361/202142347 | |
Published online | 05 April 2022 |
The Cetus-Palca stream: A disrupted small dwarf galaxy
A prequel to the science possible with WEAVE with precise spectro-photometric distances⋆
1
Instituto de Astrofísica de Canarias, 38205 La Laguna, Tenerife, Spain
e-mail: gthomas@iac.es
2
Universidad de La Laguna, Dpto. Astrofísica, 38206 La Laguna, Tenerife, Spain
Received:
1
October
2021
Accepted:
1
December
2021
We present a new fully data-driven approach to derive spectro-photometric distances based on artificial neural networks. The method was developed and tested on Sloan Extension for Galactic Understanding and Exploration survey (SEGUE) data and will serve as a reference for the Contributed Data Product SPDIST of the William Hershel Telescope Enhanced Area Velocity Explorer (WEAVE) survey. With this method, the relative precision of the distances is of ∼13%. The catalogue of more than 300 000 SEGUE stars for which we have derived spectro-photometric distances is publicly available on the Vizier service of the Centre de Données de Strasbourg. With this 6D catalogue of stars with positions, distances, line-of-sight velocity, and Gaia proper motions, we were able to identify stars belonging to the Cetus stellar stream in the integrals of motion space. Guided by the properties we derived for the Cetus stream from this 6D sample, we searched for additional stars from the blue horizontal and red giant branches in a 5D sample. We find that the Cetus stream and the Palca overdensity are two parts of the same structure, which we therefore propose to rename the Cetus-Palca stream. We find that the Cetus-Palca stream has a stellar mass of ≃1.5 × 106 M⊙ and presents a prominent distance gradient of 15 kpc over the ∼100° that it covers on the sky. Additionally, we also report the discovery of a second structure almost parallel to the Cetus stream and covering ∼50° of the sky, which could potentially be a stellar stream formed by the tidal disruption of a globular cluster that was orbiting around the Cetus stream progenitor.
Key words: Galaxy: halo / methods: data analysis / galaxies: dwarf / Galaxy: kinematics and dynamics / stars: distances / catalogs
The catalogue is only available at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/cat/J/A+A/660/A29
© ESO 2022
1. Introduction
Large astrometric and spectroscopic surveys, such as Gaia (Gaia Collaboration 2016, 2018), the William Hershel Telescope Enhanced Area Velocity Explorer (WEAVE; Dalton et al. 2012), the Dark Energy Spectroscopic Instrument (DESI; Flaugher & Bebek 2014), Sloan Digital Sky Survey (SDSS; Kollmeier et al. 2017), Large Sky Area Multi-Object Fibre Spectroscopic Telescope (LAMOST; Zhao et al. 2012), or the Hectochelle in the Halo at High Resolution (H3; Conroy et al. 2019a), are providing chemical and kinematic properties for a tremendous number of individual stars in our Galaxy. With these data, it is possible to reconstruct the formation history of the Milky Way in detail (e.g. Belokurov et al. 2018; Haywood et al. 2018; Kruijssen et al. 2020; Naidu et al. 2020; Fernández-Alvar et al. 2021; Ishigaki et al. 2021; Lane et al. 2022; Malhan et al. 2021; Simpson et al. 2021) and to compare it to the formation histories of similar galaxies in large-scale cosmological simulations. Precise distances of individual stars are a crucial ingredient in the determination of the structure and kinematic properties of the Milky Way stellar component; for example to transform Gaia proper motions into physical tangential velocities. However, despite the great advances brought about by the European Space Agency’s Gaia mission to provide parallaxes for more than 1.4 billions stars (Gaia Collaboration 2016, 2020), these are not of sufficient precision to measure the distances of stars beyond ∼10 kpc (e.g. Ibata et al. 2017), such as those located in the outer disc or in most of the stellar halo; other methods are therefore needed to enable an investigation of the properties of the outskirts of the Milky Way.
Several methods have been developed to infer distances, either using only photometric data (e.g. Jurić et al. 2008; Ivezić et al. 2008; Deason et al. 2011; Ibata et al. 2017; Sesar et al. 2017; Thomas et al. 2019; Conroy et al. 2021) or a combination of spectroscopic and photometric data (e.g. Xue et al. 2014; Coronado et al. 2018; McMillan et al. 2018; Queiroz et al. 2018; Hogg et al. 2019; Cargile et al. 2020). Some developed methods statistically infer the distances of the stars based on the assumptions made on the global distribution of the stars in the Galaxy (e.g. Bailer-Jones 2015; Bailer-Jones et al. 2018; Queiroz et al. 2018; Anders et al. 2019; Pieres et al. 2020). However, these methods depend sensitively on the adopted Galactic spatial distribution prior (Hogg et al. 2019), which is still not precisely known, especially in the stellar halo, as different tracers yield different trends (Thomas et al. 2018; Fukushima et al. 2019). The methods to obtain photometric distances do not require (expensive) spectroscopic measurements, but are often limited to specific stellar populations. These are not always easy to identify photometrically, leading to potentially strong biases. Spectrophotometric methods tend to be more accurate because they use both photometric and spectroscopic information. However, most of them are still focused on specific stellar populations (K-giants: Xue et al. 2014, main sequence (MS) stars: Coronado et al. 2018, red giant branch (RGB) stars: Hogg et al. 2019), and the majority of them are based on theoretical spectro-photometric relations (e.g. Xue et al. 2014; McMillan et al. 2018; Queiroz et al. 2018; Cargile et al. 2020).
Thanks to the Gaia early third data release (EDR3 Gaia Collaboration 2020), it is now possible to develop new methods to derive spectro-photometric distances using the large number of stars with spectroscopic data and precise parallaxes; this can potentially be done for stars from different stellar evolutionary phases and without relying on theoretical spectro-photometric models. Here, we present a new data-driven method to precisely measure distances using spectroscopic and photometric information using machine learning (ML) techniques. This method is heavily based on the previous work described in Thomas et al. (2019); the major difference is the use here of spectroscopic parameters (gravity, effective temperature and metallicity), which improve the distance measurements, especially for giant stars, and the absence of the u-band photometry from the Canada-France-Imaging-Survey (CFIS/UNIONS Ibata et al. 2017). Our primary objective was to develop the method and the algorithm that will be used to provide spectro-photometric distances for the stars observed by WEAVE as the WEAVE Contributed Data Product1 (CDP) SPDIST. However, at the time of writing, WEAVE is not yet operational. Therefore, we developed our method using stars observed by the Sloan Extension for Galactic Understanding and Exploration survey (SEGUE Yanny et al. 2009), because among the publicly available spectroscopic datasets it is the closest to what WEAVE will deliver in terms of instrumental specifications in its low-resolution mode. The dataset, the architecture of the algorithm, and the validation of the precision on the distance achieved with this method are presented in Sect. 2. We applied this algorithm to derive spectro-photometric distances for more than 300 000 stars present in SEGUE, the catalogue of which is publicly available at Centre de Données de Strasbourg (CDS). In Sect. 3, we present the identification of the Cetus stream as a structure in the integral-of-motion space using 6D information for SEGUE stars, including the spectro-photometric distances derived in this work. In Sect. 4 we use the information gained by integrating the orbit of the stars of the Cetus stream as a guide to expand the search to the more numerous sample of stars with 5D information: first we focus on blue horizontal branch (BHB) stars (in Sect. 4.2.2), and then on the different stellar populations of Cetus present in Gaia EDR3. Finally, we present our conclusions in Sect. 5.
2. Determination of spectro-photometric distances
In this section, we present a new ML-based method to determine the distance of individual stars through their spectro-photometric parameters. Contrary to Cargile et al. (2020), who used mock data to fit their spectro-photometric distance relation, this method uses a data-driven approach.
2.1. Data
We train and apply the method to the stars observed by the SEGUE survey (Yanny et al. 2009). From the SEGUE stellar catalogue, only stars with a photometric counterpart in the second data release of the Pan-STARRS1 3π Steradian survey (PS1 Chambers et al. 2016) and in Gaia EDR3 are used hereafter. More specifically, we used the forced-WARP PSF photometry in the griz-bands from PS1 (Magnier et al. 2020a) and the G-band from EDR3. The PS1 y-band is not used because of its shallowness compared to the other bands, and Gaia GBP and GRP are not used because of the excess in colour at the faintest magnitudes correlated with large uncertainties. Although, this is not very important for SEGUE stars, because this survey has a depth of G ∼ 18, it is important to take this point into account at an this stage because WEAVE is expected to go down to the Gaia magnitude limit in the Galactic archaeology subsurvey of the thick disc and stellar halo at low spectral resolution, as presented in the WEAVE science case.
All the stars are corrected for foreground extinction assuming the E(B − V) values given by Schlegel et al. (1998) at their positions, with the reddening conversion coefficients for the griz-bands given by Schlafly & Finkbeiner (2011) for a reddening parameter of Rv = 3.1, while for the Gaia G filter, we follow Sestito et al. (2019) by adopting the coefficients from Marigo et al. (2008). It should be noted that some stars from this cross-matched catalogue are affected by crowding in dense regions (i.e. globular clusters) especially with the PS1 forced-WARP photometry used here. Therefore, to prevent any biases generated by those stars, and to remove extended objects such as background galaxies, we keep only the stars with |STARGAL| < 3. The STARGAL parameter is the median (in sigma) of the difference between the Kron and the PSF photometry compared to their uncertainties added in quadrature for all single exposures of a single object (see Magnier et al. 2020a,b).
2.2. The method
Our algorithm uses the spectroscopically determined effective temperature (Teff), surface gravity (log(g)), and metallicity ([Fe/H]) with a combination of photometric colours to estimate the absolute magnitude in the Gaia G-band for each star, and therefore its distance, through an artificial neural network (ANN). To perform this so-called regression, the inputs of the ANN are the TEFFADOP, LOGGADOP, and FEHANNRR parameters from the SEGUE Stellar Parameters Pipeline (SSPP Lee et al. 2008), and all the possible colours that can be produced from the g0,r0, i0, z0, and G0 extinction-corrected photometry2. We choose not to use the FEHADOP metallicity because it is a combination of metallicity derived by different methods, which tend to erase the signal in the tails of the distribution (see their Appendix A of Starkenburg et al. 2017). Following Starkenburg et al. (2017), we decided to use the FEHANNRR parameter because it provides a more robust estimate, particularly in the low-metallicity regime, where the bulk of the halo stars are found. In order to improve the performance of the ANN and to limit the potential biases, all the input parameters are normalised to have a distribution with a mean equal to zero and a standard deviation of one with respect to the training sample.
To train and test an ANN, it is advantageous to use a training set that is as large as possible. However, the training set has to be composed of stars with precise parameters in order to prevent the algorithm from learning a false relation due to objects with imprecise parameters. Therefore, we imposed a signal-to-noise ratio (S/N) cut of S/N ≥ 20 for the stars in the training sample. This threshold was chosen because at lower S/N, the distribution of the uncertainties on the parameters given by the SSPP as a function of the S/N is irregular, indicating that the parameters in that region are not reliable (see discussion in Thomas et al. 2019).
Because the surface gravity plays a key role in disentangling MS and RGB stars, which at the same colour have very different absolute magnitudes, we only consider stars with uncertainties δlog(g) < 0.2 dex for the training sample, which is just below the typical internal uncertainty on LOGGADOP (0.19 dex according to Lee et al. 2008).
Furthermore, we kept only stars with photometric uncertainties in all the bands better than 0.1 mag.
The absolute magnitudes on which the ANN is trained are obtained from Gaia parallaxes (ϖ), such as MG = G0 + 5 + 5log10(ϖ [mas]/1000). We correct Gaia parallaxes from the zero-point off-set as a function of colour and sky location, following Lindegren et al. (2021). This method imposes a positive zero-point-corrected parallax on the stars of the training sample. Thus, we only keep stars with a minimum parallax of ϖ = 0.1 mas yr−1, as stars with a smaller parallax tend to have unrealistic absolute magnitudes (Luri et al. 2018; Thomas et al. 2019).
As show by Luri et al. (2018), the inversion of the parallax to obtain the distance (and so the absolute magnitude), is only valid for relative parallax uncertainties ϖ/δϖ≥5 (or a relative distance precision of better than 20%). For the dwarf stars (log(g)≥3.5), which are largely from the MS, this is not a problem because 104 516 dwarf stars respect this criterion, covering a wide range of metallicity and temperature. However, as mentioned by Thomas et al. (2019), using this relative parallax precision criterion also on giant stars (log(g) < 3.5) would limit the sample to only about 3000 stars. Furthermore, these are not even representative of the full giant sample of the SEGUE dataset, as most are subgiant stars with log(g)∼3. The reason for this difference between dwarf and giant stars is that at a similar apparent magnitude, the giants are more distant than the dwarfs, which leads to a higher uncertainty on their parallax. Thus, following Hogg et al. (2019) and Thomas et al. (2019), instead of imposing a relative precision cut on the parallax for giant stars, we keep only those with δϖ< 0.07 mas.
We also used 4625 of the 6036 K-giants from the catalogue of Xue et al. (2014) (the other 1400 do not respect the photometric uncertainty criteria in all bands), which have distance measurements with a relative precision of 16%. This additional dataset marks a great improvement compared to Thomas et al. (2019), especially for the intrinsically brightest stars in SEGUE, by adding a large number of giant stars with precise distances. For the K-giants from the Xue et al. (2014) catalogue that pass the δϖ< 0.07 mas cut, we used the (maximum likelihood) distance modulus (DM) provided by Xue et al. (2014) rather than using the Gaia parallax to infer the absolute magnitude used to train the ANN. A total of 14 826 giant stars compose the training set, of which 10 201 have been selected based on their Gaia parallax.
The BHB stars present in the training sample, which can be identified through their location on the colour–magnitude diagram (CMD) in Fig. 1, have large uncertainties on their parallax, leading to a broad range of ‘true’ absolute magnitudes. We were surprised to find that our algorithm is able to recover a BHB, likely thanks to the input spectroscopic surface gravity. However, the intrinsic brightness tends to be underestimated, such that the distance of these BHBs is overestimated by a factor of ∼1.1 when compared to the distances obtained from the relation of Deason et al. (2011) based on SDSS photometry in the g and r-bands. We therefore decided to add BHB stars from the catalogue of Xue et al. (2008) to the training sample, with the absolute magnitude derived using the relation of Deason et al. (2011). This finally leads to a training sample composed of 14 826 giant stars, 104 321 dwarf stars, and 1376 BHB stars. The impact of adding the K-giants of Xue et al. (2014) and the BHBs of Xue et al. (2008) to the training sample on the precisions of the predicted distances is discussed in Appendix A.
![]() |
Fig. 1. Colour–magnitude diagram of the training sample with the expected absolute magnitude in the Gaia G-band shown in the left panel and the absolute magnitude predicted by the algorithm in the right panel. It is striking to see the RC on the predicted CMD so clearly (highlighted by the red box), while this feature it is less obvious on the expected CMD. We also highlighted with the blue rectangle the location of the BHB stars, which were added in the training sample as explained in Sect. 2.2. |
By itself, the architecture of the ANN is rather classical. It is composed of four hidden layers composed of respectively 2048, 512, 64, and 32 neurons using a rectified linear unit (ReLU) activation function built with the KERAS package (Chollet 2015). Due to the large number of possible outliers (especially for giant stars), the cost function used by the ANN is a mean squared-error (MSE) coupled to an adaptive moment estimation (also known as an Adam) optimisation method (Kingma & Ba 2014) to prevent falling to a local minimum.
The difference between the predicted and the ‘true’ absolute magnitude (used to train the ANN) is shown in Fig. 2. The residual is of σ = 0.27 mag, corresponding to a relative distance precision of 13%, and does not show any trend with absolute magnitude. Notwithstanding the inhomogeneities of the training sample, it seems reasonable to think that the 13% precision is an upper limit when comparing the CMD built from the ‘true’ absolute magnitude (Fig. 1, left) and that made from the predicted absolute magnitude (Fig. 1, right). Indeed, not only can our algorithm recover and enhance the standard distance candles that are the BHB stars (within the blue rectangle) and the red clump (within the red rectangle), but even the MS is better characterised, with two almost parallel well-defined branches corresponding to the Galactic disc and to the Galactic halo/Gaia-Enceladus-Sausage (Haywood et al. 2018). The only population for which our algorithm fails to improve the absolute magnitude prediction are the white dwarfs. This is due to an incorrect estimation of the surface gravity for the white dwarfs by the SSPP, with log(g)≃5 cm s−2 instead of log(g) = 7 − 9 cm s−2 (e.g. Tremblay et al. 2013). The precision of the spectro-photometric distances as a function of the input spectroscopic parameters is discussed in Appendix B.
![]() |
Fig. 2. Comparison of the ‘true’ (MG, true) and predicted (MG, pred) absolute magnitude for the stars of the training sample. Left panel: the true and derived quantities plotted against each other. The one-to-one relation is shown as a solid line, and the dashed and dotted lines correspond to the 1 − σ and 2 − σ deviations, respectively. Right panel: the distribution of the difference between the true and derived absolute magnitude, with a Gaussian fit overlaid. A scatter of 0.27 mag corresponds to a relative uncertainty on the distance of 13%. |
Once the ANN was trained, we applied it to all stars from the initial cross-match SEGUE-PS1-EDR3 catalogue. The catalogue, composed of 308 692 stars with 13% precise distance, is available at the CDS.
2.3. Verification of the distance precision
Though the distances of MS stars are, by construction of our training sample, well established, this is not necessarily the case for the giants, where a certain amount of doubt can persist regarding their precision. Here it is not possible to use globular clusters to obtain independent verification of the prediction of our algorithm because of the low number of globular cluster stars observed by SEGUE whose photometry is not significantly affected by crowding. However, a significant number of SEGUE stars are part of the Sagittarius (Sgr) stream, which covers a broad range of distances (e.g. Koposov et al. 2012; Belokurov et al. 2014); we can therefore use this structure to verify the precision of our distances.
We identify star members of the Sgr stream through their phase-space position (see Sect. 3). The residual contaminants are removed using the selection criteria based on proper motions and position on the sky described in Ibata et al. (2020). This leads to 888 SEGUE stars identified as part of the Sgr stream, mainly those located between 0° < RA < 50° and 100° < RA < 200°, all of them being giant stars (log(g) < 3.5) either on the BHB or on the RGB.
To determine the DM variation along the Sgr stream, we use RRLyrae stars, because of their small typical uncertainties on the distance, which are estimated to be 3% (Hernitschek et al. 2018). We applied the selection criteria of Ibata et al. (2020) in terms of position on the sky and proper motions to a catalogue of candidate RR Lyrae stars built as the union of the Gaia DR2 SOS gaiadr2.vari_rrlyrae (Holl et al. 2018; Gaia Collaboration 2019; Clementini et al. 2019), the stars classified as RRLyrae of a, b, c, or d type in the general variability catalogues gaiadr2.vari_classifier_result, and the PS1 RRLyrae of ab or cd type by Sesar et al. (2017)3. This leaves us with 10 384 potential Sagittarius stream RR Lyrae. The DM is determined assuming an absolute magnitude of MG = 0.69 mag calculated from the MG− [Fe/H] relation of Muraveva et al. (2018) for a mean metallicity of [Fe/H]= − 1.3 (Ibata et al. 2020). The variation of the Sgr DM as a function of the position along the stream was determined by fitting a cubic spline to the RR Lyrae track (Fig. 3, top), where Λ⊙ is the longitude along the Sagittarius stream orbital plane (Majewski et al. 2003).
![]() |
Fig. 3. Top panel: distance modulus of the Sgr RR Lyrae as a function of longitude along the plane of the Sagittarius stream, as defined by Majewski et al. (2003) (in grey). The blue error bars show the mean and standard deviation of the RR Lyrae DM as a function of the Λ⊙ position, and are used to define the Sgr DM track represented by the red cubic spline function. Bottom panels: difference between the absolute magnitude derived by the DM ridgeline and the absolute magnitude predicted by our algorithm as a function of the predicted absolute magnitude for stars along the Sgr stream (left). On the right panel, the red histogram shows the normalised residuals of the DM of the RR Lyrae to the fitted ridgeline along the Sgr stream. |
This DM track was then used to determine the expected absolute magnitude of the 888 SEGUE stars along the Sgr stream. The difference between this and the absolute magnitudes predicted by our method is shown in the lower panel of Fig. 3. The standard deviation is of 0.35 mag, which is larger than the 0.27 mag found previously. However, this also includes the intrinsic width of the Sgr stream, as well as the scatter introduced by variations in the RRLyrae metallicity and the MG − [Fe/H] calibration itself. The dispersion of the Sagittarius stream RRL absolute magnitudes around the DM track amounts to 0.17 mag (red histogram in Fig. 3). By deconvolving the dispersion of the absolute magnitudes of SEGUE giant members of the Sagittarius stream with this value, we obtain 0.31 mag, leading to uncertainties on the distance of 14%, with a negligible bias of 0.054 mag; this confirms the good accuracy of our algorithm. A 14% relative uncertainty on distance is probably an upper limit, as one can appreciate a small amount of contamination even after the position and proper motion cleaning, where this would lead to a larger spread around the DM track. For the remainder of the paper, we continue assuming typical uncertainties of 13%.
We note that if we were to use an absolute magnitude of MG = 0.64 mag for the RR Lyrae stars as suggested by Iorio & Belokurov (2019) and Vasiliev et al. (2021), the dispersion in DM would not vary, but the bias would increase to 0.097 mag. It should be noted here that the trend is similar for the BHBs and RGBs of the SEGUE sample. However, for the BHB, the Sgr DM variation that we fitted on the RRLyrae leads to a scatter in absolute magnitude of the BHBs of 0.3 mag, instead of the typical scatter of 0.1 mag found by Deason et al. (2011). This is again a consequence of the scatter of the fitted DM variation in the RRLyrae, of the intrisinc scatter of the Sgr stream, and of the uncertainties on our estimation of spectro-photometric distances.
3. Scientific application: Exploration of the integral of motion-space
The catalogue of distances obtained in Sect. 2.2 enables the determination of the full six dynamical dimensions for a large number of distant stars with SEGUE spectroscopic and Gaia EDR3 proper motion measurements, without being restricted to a few specific stellar populations such as BHB, RC, or RR Lyrae stars. This allows exploration of the structures present in the outer disc–stellar halo integral of motion space (also called phase-space), similar to that recently conducted by Naidu et al. (2020) with the H3 survey (Conroy et al. 2019b). The integrals of motion are particularly useful for identifying past accretion events, because the stars originating from the same object are still clustered in phase-space, even several gigayears after the spatial coherence of the progenitor stellar component has been lost (Helmi & White 1999; Jean-Baptiste et al. 2017).
For the stars of the cross-matched SEGUE-PS1-EDR3 catalogue, we obtain the Cartesian Galactocentric positions and velocities in a right-handed system, assuming that the circular velocity rotation at the solar radius (R⊙ = 8.129 kpc, Z⊙ = 20.4 pc GRAVITY Collaboration 2018) is v0 = 229.0 km s−1 (Eilers et al. 2019), with the Sun peculiar motions from Schönrich et al. (2010) (U⊙, V⊙, W⊙) = (11.1, 12.24, 7.25) km s−1. The corresponding integral-of-motion quantities are computed using the Stäeckel approximation method from the AGAMA Python package (Vasiliev 2018). The potential used for this computation is based on the GALPOT potential (Dehnen & Binney 1998; McMillan 2017), which is composed of three exponential discs, representing the stellar thin and thick discs, the interstellar medium, the bulge, and the dark matter halo, whose parameters, listed in Table 1, have been slightly adjusted to fit the rotation curve observed by Eilers et al. (2019), as shown in Fig. 4.
![]() |
Fig. 4. Rotation curve of the Milky Way model used in this work (thick line) together with the measured rotation curve of our Galaxy by Eilers et al. (2019) (red dots). The thin blue lines show the rotation velocity at the solar radius (229 km s−1 at 8.129 kpc). |
Parameters of the GALPOT potential listed here, composed of three disc components corresponding to the thin and thick discs and the interstellar medium, and two halo components corresponding to the bulge and the dark matter halo.
Figure 5 (left) shows the distribution of the azimuthal (Jϕ) and vertical (Jz) actions of the SEGUE stars. One can see a substructure around Jϕ ≃ −3000 kpc km s−1 at low Jz, which corresponds to the Galactic disc, as confirmed by the corresponding position on the energy-vertical angular momentum4 diagram (Fig. 5, right) close to a circular orbit (represented by the green line). In this last panel, one can also see a structure around Lz ∼ 2000 kpc km s−1 and E ∼ −0.7 × 105 km2 s−2, which corresponds to Sequoia (Myeong et al. 2018a,b, 2019). In addition to the disc, an elongated overdensity in the azimuthal and vertical action plane is clearly visible (highlighted by the red ellipse; see the right panel of Fig. 5 for the location of the stars in E − Lz). The large majority of the stars forming this structure are part of the Sagittarius stream (Ibata et al. 1994), as is visible in Fig. 6, where we compare their positions, heliocentric distance, proper motions, line-of-sight, and Galactocentric radial velocities to those of the Sgr stream identified previously by Ibata et al. (2020) and Vasiliev et al. (2021).
![]() |
Fig. 5. Azimuthal and vertical actions of the stars from the SEGUE dataset (left), and corresponding energy-vertical angular momentum diagram (right). The green lines mark the locus of circular orbits in the Milky Way model. The red ellipse on the left panel shows clearly visible structure corresponding to the Sagittarius stream. The stars inside that selection are highlighted in red in the right panel. The stars between the red and blue ellipses in the left panel are used to estimate the background contamination, which is used in Fig. 6. |
However, as we can see from the different panels of Fig. 6, aside from the Sgr stream, our action-space selection encompasses stars from the foreground and background (which includes other structures too, like the Gaia-Enceladus-Sausage; see Naidu et al. 2020) and a clear substructure at RA = 10 − 35°, which stands out in radial velocity and in the two proper-motion components. To gauge the contribution from foreground and background stars, we overlay on Fig. 6 (shown by the black contour) the distribution of the stars lying at the edge of the action-space selected area (i.e. between the red and blue ellipses on Fig. 5): here one can see stars mostly belonging to the Gaia-Enceladus-Sausage (see e.g. Naidu et al. 2020; Myeong et al. 2018c), but also a few stars with low Jz belonging to the Sgr stream not included in our ellipsoid selection. Nevertheless, the group of 91 stars between RA = 10 − 35° and Galactocentric radial velocity between Vrad = −50 and 270 km s−1 (within the orange box in the lower panel of Fig. 6 and highlighted with star symbols in the others) have a clearly different distribution from both the stars of the Sgr stream and the contamination. Comparing the position on the sky and the distances of those stars to previous works (Newberg et al. 2009; Koposov et al. 2012; Yam et al. 2013), we conclude that those stars are part of the Cetus stellar stream. We refer to those stars as the Cetus stream spectroscopic sample in the remainder of the paper.
![]() |
Fig. 6. Position on the sky, heliocentric distance, proper motions, line-of-sight, and Galactocentric radial velocities of the stars from the structure that we identified in Fig. 5 (red symbols). The light-blue and purple points are the stars identified as part of the Sagittarius stream by Ibata et al. (2020) and Vasiliev et al. (2021), respectively. Lower panel: another structure, highlighted by the orange rectangle, stands out around RA ∼ 25°, and is made up of stars that have clearly distinct galactocentric radial velocities from both the Sagittarius stream and the contaminating background stars (mostly originating from the Gaia-Enceladus-Sausage) shown by the black contours in each panel. The stars from this structure, which appears to be the Cetus stream, are shown by the orange stars in the other panels; they also form a distinctly visible structure in μα, * and μδ, * |
Despite sharing similar positions in energy, and vertical and azimuthal actions, the spatial track of the Cetus stream differs from that of the Sgr stream by ∼60°, as already mentioned in these previous studies. This example clearly shows the limitation of the identification of stellar structures with only the energy–angular momentum diagram or in actions-space, without considering the action-angles, because these methods do not use the full 6D information available.
The metallicity distribution (obtained using the FEHADOP parameter) of the 91 stars in the Cetus stream spectroscopic sample is shown in the upper panel of Fig. 7. The mean metallicity is [Fe/H] = −1.93 with a dispersion σ[Fe/H] = 0.21 dex, consistent with the values found by Yam et al. (2013) and Yuan et al. (2019). These values are insensitive to being derived from the full sample of 91 stars or by applying a 3-σ clipping, which removes four stars.
![]() |
Fig. 7. Upper panel: metallicity distribution of the Cetus stream spectroscopic sample. The grey histogram shows the metallicity of the four stars that were rejected by the sigma-clipping method. Lower panel: distance distribution of the Cetus stream spectroscopic sample. The distribution is clearly asymmetric, with a tail towards shorter distances, which could be an indication of a distance gradient along the stream. |
The mean heliocentric distance we obtain for the Cetus stream spectroscopic sample is 31.2 kpc with a total dispersion of 4.2 kpc, including the distance uncertainties, which are consistent with previous measurements (Newberg et al. 2009; Koposov et al. 2012; Yam et al. 2013; Yuan et al. 2019) made in the same part of the sky, and with various stellar tracers (RGBs, BHBs, and K-giants). However, the distance distribution (Fig. 7, lower panel) appears asymmetric, with a tail pointing towards shorter distances (∼20 kpc), which could be an indication of a distance variation along the stream. Unfortunately, the spectroscopic Cetus sample is not sufficient to measure a distance gradient. In the following section, we expand the search for Cetus stream member stars using 5D information from Gaia EDR3 data.
4. The Cetus stream in Gaia EDR3
In this section, we expand the search for Cetus member stars to the 5D Gaia EDR3 sample in order to identify the edges of the stream, measure its distance gradient, and estimate the stellar mass of its progenitor. To this aim, we first use the 6D information from the Cetus stream spectroscopic sample to guide our search; specifically, we integrate the orbits of these stars to define the area of the sky likely to contain its tidal debris (Sect. 4.1). The search for Cetus stream member stars is first carried out on BHB stars, whose absolute magnitude (and therefore distance) can be derived from photometric colours with good precision. For example, such a relation exists for SDSS (g − r) (Deason et al. 2011); as no such relation is currently available for the Gaia bands, and we do not want to be limited by the SDSS coverage, we first provide a re-calibration of the Deason et al. (2011) relation to the Gaia EDR3 photometric system (Sect. 4.2.1) and then use the likely BHBs identified in Gaia EDR3 data to analyse the Cetus stream, and to define its extent, distance gradient, and track on the sky (Sect. 4.2.2). In Sect. 4.3, the properties derived in Sect. 4.2.2 are used to include stars in other evolutionary phases, assign probabilities of membership, and estimate the mass of the progenitor.
4.1. The orbital plane of the Cetus stream
The track of stellar streams emanating from the disruption of globular clusters or small dwarf galaxies is well approximated by the orbital path of their progenitor (e.g. Malhan & Ibata 2018 and Ibata et al. 2021 used this method to detect new stellar streams in Gaia data). Recent simulations of Chang et al. (2020) show that the track of the Cetus stream follows the orbit of its progenitor over two wraps, even if it originated from a dwarf galaxy with a 2 × 109 M⊙ total mass. Therefore, it is possible to use the orbit of the stars of the Cetus stream spectroscopic sample to define its orbital plane, and thus the area of the sky where tidal debris is to be searched for.
The orbits were integrated backward and forward over 1 Gyr in the same MW potential used in the previous section. The variation in Galactocentric distance as a function of time is shown in Fig. 8. Over the 87 orbits, one star is clearly not related to the Cetus stream, with a completely different apocenter from all the others. The orbits of the majority of the other stars have a similar common behaviour (56/86, highlighted in blue in the figure), with a pericenter of ∼15 kpc, an apocenter at ∼37 kpc, an orbital inclination of ∼65° , and an orbital period of ≃600 Myr, which is slightly lower than the value of 70 Myr found by Yam et al. (2013), betraying a more than probable common origin. Therefore, only two periods, one forward and one backward (i.e. between ±600 Myr) of the 56 orbits with a common behaviour are used to define the orbital plane of the Cetus stream. The differences of most of the other 30 orbits from this common behaviour can be explained by the fact that these stars are not part of the Cetus stream. Indeed, the current galactocentric radial velocity of 14 of them exceeds 150 km s−1, contrary to the stars showing a common orbital behaviour, whose galactocentric radial velocities range from 0 to 150 km s−1. For ten others, the current distance ranges between 18 kpc and 25 kpc, while the current distance of the stars with a common orbital behaviour is between 25 kpc and 40 kpc. For the six others, the deviating orbital behaviour could be a consequence of the combined uncertainties on the observables: proper motions (PM), distance, and radial velocity). We note that it is also possible that the departure from the common behaviour of all these stars is caused by the simple assumption we made that the Cetus stream is a fully coherent structure disrupted in an adiabatic process, which neglects for example the perturbations generated by a massive Large Magellanic Cloud (LMC). Interestingly, removing those 30 stars does not change the mean metallicity that we find for the Cetus stream of [Fe/H] = −1.93 ± 0.21, while the mean heliocentric distance increases slightly and the dispersion is reduced (from 31.2 to 32.0 kpc and from 4.2 to 2.8 kpc, respectively).
![]() |
Fig. 8. Orbits of the Cetus stream spectroscopic sample stars integrated over 1 Gyr backward and forward. The blue orbits are the ones used to find the natural coordinates of the Cetus stellar stream. |
Although the ‘natural’ coordinates of a stellar stream are usually expressed using a great circle rotation (i.e. Ibata et al. 2001; Koposov et al. 2010, 2019; Shipp et al. 2019), in the case of the Cetus stream, we find that a small circle is better suited. Indeed, in a Galactocentric frame, the projected coordinates will follow a great circle, but this is no longer the case if the observer is located on Earth, and a constant offset is visible in the coordinate perpendicular to the stream. We find that the ‘natural’ coordinates of the Cetus stream (ϕ1, ϕ2) are well expressed by a small circle with a pole centred on (RA, Dec) = (125.1809832°, 15.91290743°) and an offset to the great circle of 14.36°. The transformation of coordinates is performed with the GREATCircleICRSFRAME class present in the GALA package (Price-Whelan 2017), with ϕ1, ϕ2) = (0,0) corresponding to (RA, Dec) = (22.11454259°, −6.7038421°), where we think the centre of the progenitor of the Cetus stream is located (see Sect. 4.2.2).
Hereafter, we limit the search of stars belonging to the Cetus stream to the region between |ϕ2|< 35°, because this latter contains the 56 orbits used to define the stream orbital plane. Moreover, we exclude the region at |b|< 10° because of the high extinction and the large number of disc stars. The corresponding search area is highlighted in grey in Fig. 9.
![]() |
Fig. 9. Aitoff projection of the sky in Galactic coordinates; the area used to search for the Cetus stream is represented in grey. The region near the Galactic disc (|b|< 10°), within the dashed lines, is excluded from the search because of the high extinction and the large number of contaminant disc stars. The blue lines correspond to the orbits of the stars from the Cetus spectroscopic sample used to find the plane of the Cetus stream. The current location of these stars is indicated by the orange stars. |
4.2. Tracing the Cetus stream with BHB stars
Blue horizontal branch stars are very often used to estimate the distance of substructures because they are relatively bright stars and their absolute magnitude is constant to first approximation (Mg ∼ 0.5 in the SDSS g-band; Deason et al. 2011). Therefore, we first use this stellar population to investigate the extent of the Cetus stellar stream and to measure the distance gradient along it.
4.2.1. BHBs distance calibrated for Gaia EDR3
Deason et al. (2011) found that the absolute magnitude of BHB stars, and therefore their heliocentric distances, are a function of their colour, with a narrow scatter around the relation derived by these authors (∼0.1 mag). This relation was calibrated for the SDSS photometric system; however, SDSS only covers a fraction of the orbital plane of the Cetus stream. In order to exploit the full sky coverage afforded by Gaia, we re-calibrate the Deason et al. (2011) relation in the Gaia EDR3 photometric system.
To this aim, we used the sample of BHBs identified spectroscopically by Xue et al. (2008) in SEGUE, adopt their SDSS photometry to derive their DM using the relation by Deason et al. (2011), and use it to compute their absolute magnitude in the Gaia G-band, MG, BHB. As shown in Fig. 10, MG, BHB can be expressed as:
![]() |
Fig. 10. Colour–absolute magnitude relation for BHB stars in the Gaia EDR3 photometric system. The orange line corresponds to the relation found in this work (Eq. (1)) by fitting a polynomial on the BHB population by Xue et al. (2011) (grey points). The stars in the blue area are excluded from the fit as they are likely misidentified as BHB. Lower panels: residuals of the absolute magnitude to the fitted relation. |
Here we excluded the stars with MG > −1.3(GBP − GRP)0 + 1.1, because they clearly deviate from the general trend, possibly because the Xue et al. (2008) BHB sample is not 100% pure (see Starkenburg et al. 2019). It has to be noted that 2160 of the 2362 stars used are located in the range of validity of the Deason et al. (2011) relation, namely −0.25 ≤ (g − r)0 ≤ 0.0.
The scatter between the absolute magnitude estimated by our re-calibration and that by Deason et al. (2011) is σΔMG = 0.04 mag, which is smaller than the intrinsic precision of 0.1 mag of the Deason et al. (2011) relation, and with a negligible offset (< 0.005 mag). This means that our method has a relative distance precision similar to the Deason et al. (2011) calibration, i.e. 5%.
4.2.2. Identification of BHB candidates along the Cetus stream
In order to search for BHB candidates along the Cetus stream, we selected Gaia EDR3 sources with GBP − GRP)0 ≤ 0.5 located in the plane of the Cetus stream (|ϕ2|< 35°). We removed sources that had the duplicate flag on or a spurious astrometric solution (with the re-normalised unit weight error, RUWE > 1.4, Lindegren, document GAIA-C3-TN-LU-L-124-01), and those located in high-extinction areas (E(B − V) > 0.3) and those with high uncertainties on their parallax measurement (ϖ > 0.4 mas).
As the Cetus stream has a pericenter of ∼10 kpc (Fig. 8), we applied a generous cut on the parallax such that stars with ϖ − 3δϖ< 1/5 mas are kept; this removes a large portion of disc contamination. To limit contamination, we also removed stars located within: 15 times the half-light radius of any globular clusters around the MW (using the values given by Harris 2010); 5 times the half-light radius of the dwarf galaxies listed in the catalogue of McConnachie (2012), an ellipse centred on M 31 with a semi-major axis of 3° and a semi-minor axis of 1.5° oriented at 38° (de Vaucouleurs 1958) and within the inner 1° of M 33; an ellipse centred on the LMC with semi-major and minor axes of 26° and 11.6° respectively, and in an ellipse centred on the SMC with a semi-major axis of 20° and a semi-minor axis of 8°. For M 31, M 33, the LMC, and the SMC, these values have been chosen by visual inspection.
The distance distribution of the selected blue point sources is shown in Fig. 11a as a function of longitude along the Cetus stream orbital plane, ϕ1. These contain not only BHB stars, but also contamination from blue stragglers (BSs), white dwarfs (WDs), young MS turn-off stars (y-MSTOs), and distant quasi-stellar objects (QSOs; e.g. Sirko et al. 2004; Vickers et al. 2012; Fukushima et al. 2018). We removed the known QSOs compiled by Liao et al. (2019). As for removing the majority of the stellar contaminants, we perform a series of gradually more restrictive kinematic selections. Such a process allows in a first step to detect the full extent of the stream without cutting its edges, by performing a very restrictive selection, and then to clean the sample in a second step. Stars in a stream generally do not have large motions in the direction perpendicular to the orbit, unless there are strong perturbers (e.g. Erkal et al. 2019), and have a common direction of movement parallel to the orbit. For these reasons, based on the orbits of the Cetus stream spectroscopic sample, we performed a first broad kinematic selection, which is visible in Fig. 11e, such as (μϕ1,corr < 0.0 mas yr−1 and − 1.3 mas yr−1 < μϕ2,corr < 1.8 mas yr−1), where μϕ1, corr and μϕ2, corr are the proper motions along the Cetus coordinates, corrected for the Sun reflex motion using the individual distances estimated from Eq. (1) and the solar motion used in Sect. 3. As visible in Fig. 11b, we can already clearly identify the stream structure between −40° < ϕ1 < 40°. This wide generous selection still includes contamination, such as background/foreground BHBs from the Galactic discs and stellar halo, as well as blue stars misidentified as BHBs.
![]() |
Fig. 11. Distance distribution of selected BHB stars through a succession of increasingly restrictive kinematic selections. Panel a: distance distribution of BHB stars prior to any kinematic selections, where the orange stars show the positions of the 56 spectroscopic stars whose orbits have been used to define the orbital plane of the Cetus stream. Panel b: same distribution for stars after a first broad kinematic cut performed in the corrected proper motion, illustrated by the black lines in panel e, where the blue lines show the orbits of the spectroscopic stars used to define this cut. Panel c: distance distribution of BHB stars after an additional wide selection based on their non-corrected proper motions, as shown in panel f (see Sect. 4.2.1). Panel d: distribution of the BHBs of the Cetus stream (in purple) after two additional more restrictive selections to remove the contamination of foreground and background stars performed in the non-corrected proper motion space shown in panel g and in the position-corrected parallel proper motion space shown in panel h. In all panels, ϕ1 and ϕ2 refer to the ‘natural’ coordinates of the Cetus stream (see Sect. 4.1). |
As distances (and corrected proper motions) for misidentified BHBs will be incorrect, we also look into non-corrected proper motions. Based on the orbits of the stars derived earlier, we see that a second broad selection, keeping stars with 3.3 mas yr−1 > μphi, 2 > 0.04 ⋅ ϕ2 − 0.9 mas yr−1, as shown in Fig. 11f, removes a small fraction (≃16%) of the sample, but improves the detection of the edges of the stream around ϕ1 ≃ ±50° (Fig. 11c). In equatorial coordinates, this region corresponds to an area located between −10° ⪅RA ⪅ 50° and −90° ⪅Dec ⪅ 55°, extending the previous detection of the Cetus stream (Newberg et al. 2009; Yam et al. 2013; Yuan et al. 2019) over several tens of degrees in the equatorial Southern Hemisphere, overlapping with the position of the Palca overdensity discovered by Shipp et al. (2018), as predicted recently by Chang et al. (2020). Therefore, to avoid future confusions, we propose to rename the stream the Cetus-Palca stream. However, in neither Figs. 11b nor c do we see the trace of the northern (Galactic) counterpart detected by Yuan et al. (2019) in phase space, which should be located around ϕ1 ∼ 150° (l ∼ 50°, b ∼ 50°). The detection of clear edges to the Cetus-Palca stream, and the absence of broadening toward them in Fig. 11c, may suggest that the northern detection of Yuan et al. (2019) is an artefact, or not related to the Cetus-Palca stream. However, this northern counterpart is expected to be spatially diffuse, and it may not appear as a structure distinguishable from the background or foreground in our selection.
To clean out the broadly selected sample, we applied two more restrictive selections, once again in kinematic space, because selecting in position and distance spaces could lead to cutting the stream before its actual edges. The first cut was done in (non-corrected) proper-motion space, as the Cetus-Palca stream stands out from most of the Milky Way contaminants, as highlighted by the blue ellipse in Fig. 11g. The stars within this ellipse are indicated by the blue points in Figs. 11d and f. As this sample still contains a large amount of contaminants, mostly from the disc, we look at the position-parallel proper-motion space (Fig. 11h) to select the BHB stars that are part of the same structure as the stars from the spectroscopic sample. The final cleaned sample of BHB stars that are part of the Cetus-Palca stream is shown in purple in Fig. 11e. We can clearly see the Cetus-Palca stream between −40° ≤ϕ1 ≤ 35°. Around ϕ1 = 0°, the stream appears wider compared to the other sections, which could indicate the position where the leading and trailing arms meet, and so where the progenitor would likely be if it were completely disrupted. This motivated our choice for the location of the origin of the ϕ1 axis in Sect. 4.1.
With the final BHB selection, one can see a clean, almost linear distance gradient along the Cetus-Palca stream, with a distance varying between ∼25 kpc at the edge of the leading arm (ϕ1 ≃ −40°) to ∼37 kpc at the edge of the trailing arm (phi1 ≃ 35°). This explains the asymmetric distribution in the distance found with the Cetus-Palca stream spectroscopic sample, because SEGUE covers only the leading arm (ϕ1 < 0°) of the Cetus-Palca stream.
4.3. Tracing the Cetus-Palca stream with different stellar populations
As noted in the previous section, BHB stars are perfect tracers of the position and the distance gradient along the Cetus-Palca stream. However, these stars only represent a small fraction of the total number of stars of either a dwarf galaxy or a globular cluster, which renders them unusable for measuring the stellar mass of an object like Cetus-Palca. Therefore, to further explore the properties of the Cetus-Palca stream, we expand the analysis to other stellar populations. For this analysis, we used the information gained in the previous section to select potential members of Cetus-Palca.
The initial selection of stars is similar to that done for the BHBs, except that the colour selection criteria is broader, with −0.5 ≤ (GBP − GRP)0 ≤ 1.7. Moreover, we restricted the analysis to −75° < ϕ1 < 75° and to stars with ϖ > −0.4 mas. As we found that Cetus-Palca is always located at distances farther than 20 kpc, we imposed that all stars should have ϖ < 1/5 mas (rather than ϖ − 3δϖ< 1/5 mas for the BHBs), again in order to remove stars that are very likely from the disc, while keeping as many stars of the halo as possible. Finally, we restrict the analysis to the proper motion plane −3 mas yr−1 ≤ μϕ, 1 ≤ 2 mas yr−1 and −2 mas yr−1 ≤ μϕ, 2 ≤ 3 mas yr−1.
Our adopted approach to select stars that are likely members of the Cetus-Palca stream is largely inspired by the methods used to statistically separate the stars of a dwarf galaxy from the background and foreground contamination (e.g. Martin et al. 2013b; Longeard et al. 2018; Pace & Li 2019; McConnachie & Venn 2020a,b; Battaglia et al. 2022). The general premise of those methods is that any star in Gaia is either a member of a stellar structure (e.g. a dwarf galaxy or a stellar stream) or of the Milky Way foreground or background. Therefore, the unmarginalised likelihood ℒ of a given star can be defined as
where ℒsub and ℒMW are respectively the likelihoods of the stellar substructure and of the MW foreground and background, and fsub is the fraction of stars in the stellar structure.
ℒsub and ℒMW can be decomposed as spatial (ℒs) and kinematic (ℒPM) components. However, contrary to dwarf galaxies that cover a small area of the sky and for which there is no spatial variation of the measured proper motions (at first order and with the current precision of the instrument), this is not the case for an elongated stellar stream such as Cetus-Palca. Similarly, the proper motions of the MW stars vary strongly with position on the sky. Therefore, we decoupled the kinematic component in two terms, where the first one ℒPM1 is the likelihood of the proper motion at a given position along the first axis, and ℒPM2 is the likelihood of the proper motion at a given position along a second perpendicular axis. As a result, we obtain ℒsub and ℒMW expressed as:
where the index x refers either to the MW or to the satellite. Contrary to the works mentioned above, we did not use colour–magnitude information to compute the probability of being a member of Cetus-Palca because we want to use this as an independent validation check of our method, but also because we use the luminosity function to measure the stellar mass of Cetus-Palca.
For this analysis, we work with the equatorial coordinates both for position and proper motions (RA, Dec, and μδ) because we can assume, in a first approximation, that the variation of
along the right ascension axis is decoupled from the variation of μδ along the declination axis. This would not be the case if we worked in the Cetus-Palca coordinates, because we would have to introduce a correlation matrix for the proper motion perpendicular to the plane of the stream (μϕ2) to include its variation along ϕ1 and along ϕ2.
For the stream likelihood (ℒsub), the positional proper motion likelihoods in the right ascension and declination planes ℒPM1, sat and ℒPM2, sat are computed from the BHBs identified as being part of the Cetus-Palca (see Sect. 4.2.2). For both planes, we modelled the variation of the proper motion with the position by a third-order polynomial fitted on the distribution of the Cetus-Palca BHBs. In this way, at every position, the distribution of proper motion is modelled by a Gaussian function with a width equal to the residual BHB distribution with respect to the polynomial fit (σ = 0.197 mas yr−1 for the right ascension plane and σ = 0.143 mas yr−1 for the declination plane). As it has been found in previous works that the morphology of the Cetus-Palca stream is rather complex (Newberg et al. 2009; Yam et al. 2013; Yuan et al. 2019), and because the foreground and background MW contamination is largely dominating the signal, we assumed a uniform distribution for the spatial likelihood for the stream. For the MW foreground and background distribution, because it largely dominates the signal, we constructed the spatial and the two kinematics likelihood components directly from the distribution of the selected Gaia stars, and smoothed them over 1° spatially and over 0.05 mas yr−1 in proper motion space.
The fraction of stars that are part of Cetus-Palca (fsub) was found by exploring the parameter space of the posterior distribution with EMCEE (Foreman-Mackey et al. 2013). The posterior distribution of the data (D), given a model with a fraction of stars in Cetus-Palca fsub, is defined such as P(D|fsub)∝ℒ × P(fsub), where P(fsub) is a uniform flat prior on fsub between 0 and 1. Doing so, we found a fraction of stars in Cetus-Palca equal to fsub = 0.00837 ± 0.00016.
Once the value of fsub is found, it is possible to compute the probability of belonging to Cetus-Palca (PCetus) for each star, as
The spatial distribution of the 17 990 stars with PCetus ≥ 0.2 is shown in the lower panel of Fig. 12. The selection still contains contamination from the MW disk, the LMC, and the Sgr stream, but we can clearly see the extent of the Cetus-Palca stream in the bottom panel, whose position is highlighted by the blue rectangle in the upper panel. We arbitrarily divided the stream into seven boxes of equal area, for which we present the respective CMDs in Fig. 13. The RGB of the Cetus-Palca stream is clearly visible in each of these boxes, and has a very different signature from the background, as shown by the CMDs of the two control areas (purple rectangle in Fig. 12). The CMDs of boxes 3 and 4 are by far the most populated. These two boxes cover the region where we believe the centre of the progenitor to be (Sect. 4.2.2). For each box, we manually adjusted the DM of a 14 Gyr MIST isochrone with [Fe/H] = −1.93 (Dotter 2016; Choi et al. 2016) to the CMD. The age of the isochrone was chosen because it gives the best fit to the colour extent of the BHBs, and the metallicity corresponds to that which we measured with the spectroscopic sample (Sect. 5). The variation in distance measured in this way is highly consistent with that measured from the BHBs.
![]() |
Fig. 12. Lower panel: position of the stars with PCetus ≥ 0.2. Upper panel: The black circles show the position of the BHB of Cetus-Palca identified in Sect. 4.2.2, and the yellow stars show the position of the 56 stars of the Cetus-Palca spectroscopic sample with common orbital behaviour. The blue rectangle highlights the position of the Cetus-Palca stream visible in the lower panel. The stream is decomposed into seven boxes of equal area used to measure the spatial variation of the CMDs in Fig. 13. The cyan polygon highlights the position of the potential globular cluster stream that we find in this work. The two purple dashed rectangles highlight the region located outside the Cetus-Palca stream used to show the MW foreground and background CMD. The red dashed ellipses show the positions of two additional structures visible in the lower panel. The light purple band shows the track of the Sgr stream. Finally, the black and white 2D histogram in the background shows the number of good observations along the scan direction of Gaia (ASTROMETRIC_N_GOOD_OBS_AL). |
The co-added CMD of the seven boxes where we identified Cetus-Palca is shown in panel a of Fig. 14, where an MIST isochrone with a distance modulus of 17.7 mag is overlaid. To take into account the variation in distance along the Cetus-Palca stream, before co-addition, the various CMDs were offset by the difference between the DM indicated in each panel of Fig. 13 and the mean distance modulus of Cetus-Palca of 17.7 mag. The RGB and BHB are clearly visible, while the contamination from MW stars is relatively low, because the disc MS is barely detectable. In this figure, the black presents the RGB selection box whose luminosity function is shown in panel b. We compared the luminosity function of the observed RGB to a fiducial RGB population with a Kroupa (2001) mass distribution to adjust the total luminous mass of Cetus-Palca. For this, we only used the stars brighter than G0 = 18.5, because the completeness drops rapidly at fainter magnitudes. This corresponds to the threshold magnitude up to which Gaia can be considered as complete in this region of the sky (Boubert & Everall 2020). We performed 100 random realisations to take into account the photometric uncertainties of each star. In doing so, we find that the resulting stellar mass ranges between 1.3 − 1.6 × 106 M⊙, with a median of 1.4 × 106 M⊙, consistent with the upper mass range found by Yuan et al. (2019). If we were to consider stars with PCetus ≥ 0.1, we would obtain a very similar result, with a stellar mass between 1.3 − 1.7 × 106 M⊙. This is in the same order of magnitude as ‘classical’ dwarf spheroidal galaxies like Sextans or Sculptor, which have stellar masses of 0.4 × 106 and 7.8 × 106 M⊙, respectively (Irwin & Hatzidimitriou 1995; Łokas 2009; de Boer et al. 2012). According to the stellar mass–metallicity relation for Local Group dwarf galaxies from Kirby et al. (2013), such a massive galaxy should have a metallicity of [Fe/H] = −1.63 ± 0.16. This is consistent with the metallicity that we measured for the Cetus-Palca stream in Sect. 3, though it is towards the lower limit of the observed stellar mass–metallicity relation by Kirby et al. (2013). However, this relation has large uncertainties in the metal-poor regime, and a galaxy with a similar stellar mass, namely Andromeda III, is also found to be more metal poor than predicted by this relation, with [Fe/H] = −1.84 ± 0.05 (Kirby et al. 2013).
![]() |
Fig. 13. Colour–magnitude diagrams of the different spatial regions shown in Fig. 12 in which the black line shows a MIST fiducial isochrone for a 14 Gyr-old population with a metallicity of [Fe/H] = −1.93, shifted by the DM indicated in the legends. For structures 1 and 2, the DM is 17.7. The colours of the symbols reflects the colour coding of the structures highlighted in Fig. 12. |
![]() |
Fig. 14. Panel a: CMD of the stars located in the Cetus-Palca selection box (blue rectangle in Fig. 12) overlaid with a 14 Gyr isochrone with a metallicity of [Fe/H] = −1.93 (in orange). The black plain polygon shows the location of the RGB stars with the region where Gaia is complete, while the region where Gaia is not complete is shown by the dashed line polygon. Panel b: luminosity function of the RGB used to estimate the stellar mass of the Cetus-Palca stream (in blue), with the area affected by the drop in completeness (in grey). The orange histogram shows the luminosity function of a 1.5 × 106 M⊙, [Fe/H] = −1.93, 14 Gyr-old RGB population following a Kroupa mass function. |
4.4. Detection of a potential stream of stream
In addition to the Cetus-Palca stream, another, fainter stream-like structure parallel to the Cetus-Palca stream is visible in Fig. 12 located around ϕ2 ≃ 10° (highlighted by the cyan polygon in the upper panel). This structure is also visible in the distribution of the BHBs we identified as members of Cetus-Palca (black circles in the upper panel of Fig. 12). The position of this structure does not coincide with an area observed more than others by Gaia and where the completeness could be higher, as illustrated by the number of good observations in the lateral-scan direction of the Gaia satellite, ASTROMETRIC_N_GOOD_OBS_AL, (2D histogram in the upper panel of Fig. 12). This structure seems to be spatially decoupled from the position of the Cetus-Palca stream and the CMD of the stars in that region (whose area is equal to that of an individual box along Cetus-Palca) is more populated than in any of the neighbouring boxes.
The small width of this stream (≈300 pc) could indicate that it might be formed by the tidal debris of a disrupted globular cluster that was orbiting around the progenitor of Cetus-Palca. The number of stars with spectroscopic measurements that fall on this feature and share a common orbital behaviour to Cetus-Palca is too small (7 stars) to establish its nature with certainty. The low scatter in metallicity (0.08 dex around a value [Fe/H] = −1.93 dex) would support the globular cluster hypothesis. However, the distance gradient along this feature and the kinematics of its stars are very similar to those of the rest of the Cetus-Palca spectroscopic sample. Only the line-of-sight velocity might be different from those of the other stars located along the main Cetus-Palca stream but with so few measurements, this is not a significant result. Therefore, it is not possible exclude the possibility that this second structure is actually another appendix of the Cetus-Palca stream itself. Such a structure could result, for example, from the non-linear perturbations caused by the LMC (see Erkal et al. 2019; Shipp et al. 2019; Vasiliev et al. 2021for discussions on the influence of the LMC on stellar streams).
The spatial location and the distance of ≃31 kpc of this secondary stream are consistent with the already known Triangulum/Pisces stream (Tri/Psc; Bonaca et al. 2012; Martin et al. 2013a, 2014). However, the secondary stream found in our study is significantly more extended than previously found (≃50° against 12° in Bonaca et al. 2012 and 13° in Martin et al. 2014). The presence of this secondary stream on Fig. 12 along the Cetus-Palca stream suggests that the Tri/Psc and the Cetus-Palca streams share a common origin, as recently proposed by Bonaca et al. (2021) and Yuan et al. (2021).
To confirm (or disprove) the hypothesis that the Tri/Psc stream formed by the disruption of a globular cluster that was initially around the progenitor of Cetus-Palca, spectroscopic follow-up is required to confirm that those stars do not present a scatter in metallicity, have a similar kinematic behaviour, and present the CNONaAl abundances anti-correlation typical of old globular clusters (Carretta et al. 2010; Bastian & Lardo 2018). Should it be confirmed that this second structure is a second stream, to the best of our knowledge, this would be the first detected tidal stream of an object orbiting around a dwarf galaxy that itself is currently being disrupted. Such a discovery would be particularly useful in constraining the initial profile of the dark matter halo of the Cetus-Palca dwarf galaxy (Malhan et al. 2021).
Two other stream-like structures are visible in Fig. 12, illustrated by the red ellipses in the upper panel. However, regarding the CMD of Structure 1 in Fig. 13 we can see that it is a composition of the CMDs of the two control regions, which indicates that it is a structure of the disc, likely an artefact due to the high number of stars in that region close to the plane of the disc. For Structure 2, a RGB more distant than that of Cetus-Palca is visible, which actually corresponds to the Sgr stream, whose track is shown by the light purple band in Fig. 12.
5. Summary and conclusions
We present a new method to derive accurate spectro-photometric distances with a high relative precision based heavily on supervised machine learning. Specifically, the method uses the griz photometric bands from the Pan-STARRS 3π survey, the G-band from Gaia EDR3, and the effective temperature, surface gravity, and metallicity derived from a given spectroscopic survey. This method will be used to derive distances for the stars observed by the Galactic Archaeology WEAVE surveys (Dalton et al. 2012) through the SPDIST Contributed Data Product. Meanwhile, we applied the technique to SDSS and SEGUE data and derive the distance of approximately 300 000 stars in different evolutionary phases; the catalogue is available on the Vizier service of the CDS. In its application to SDSS and SEGUE data, the relative precision on the spectro-photometric distance is of 13%.
With the 6D sample obtained using these spectro-photometric distances, SEGUE line-of-sight velocities, and Gaia EDR3 proper motions, we searched the integrals-of-motion plane and found a structure corresponding to the Cetus-Palca stellar stream (Newberg et al. 2009), hidden below the location of Sagittarius in the azimuthal and vertical actions (and energy and vertical angular momentum). However, this stellar structure is clearly distinct from Sagittarius in its proper motion and radial velocity in the RA range 10 − 35°.
The good distance precision allowed us to integrate the orbits of these stars and to find the plane of the Cetus-Palca stream, which we then used to perform a detailed analysis, expanding the search for the Cetus-Palca stream to the whole sky using the 5D Gaia EDR3 sample. We measured a total extent of the stream of ≃100° on the sky over 0 to 100° in RA and over −80 to 65° in Dec, overlapping with the Palca over-density detected by Shipp et al. (2018) in the Dark Energy Survey, as predicted by Chang et al. (2020). However, we did not find a northern counterpart, as suggested in the work of Yuan et al. (2019), possibly because of its diffuseness. The detected stream covers a heliocentric distance range of between 25 and 40 kpc, is metal poor, and presents a small scatter in metallicity ([Fe/H] = −1.93 ± 0.21) typical of a dwarf galaxy and consistent with previous measurements.
Our analysis of the luminosity function leads us to estimate the stellar mass of the Cetus-Palca progenitor at 1.5 × 106 M⊙. This stellar mass is consistent with expectation from the stellar mass–metallicity relation of Local Group dwarf galaxies.
We also report the discovery of a second structure almost parallel to the Cetus-Palca stream, covering ∼50° of the sky and located at a similar heliocentric distance to Cetus-Palca. This second structure appears to be a longer segment of the already known Tri/Psc stream. The finding of this stream along the Cetus-Palca stream suggests that it is emanating from a globular cluster that was initially orbiting around the Cetus-Palca progenitor and is now completely disrupted, as recently proposed by Bonaca et al. (2021) and Yuan et al. (2021).
Acknowledgments
The authors thank Rodrigo Ibata and Eduardo Balbinot for useful discussions, as well as the anonymous referee for comments that increased the clarity of the publication. GT acknowledges support from the Agencia Estatal de Investigación (AEI) of the Ministerio de Ciencia e Innovación (MCINN) under grant FJC2018-037323-I. The authors acknowledge financial support through the grant (AEI/FEDER, UE) AYA2017-89076-P, as well as by the Ministerio de Ciencia, Innovación y Universidades (MCIU), through the State Budget and by the Consejería de Economía, Industria, Comercio y Conocimiento of the Canary Islands Autonomous Community, through the Regional Budget.
References
- Anders, F., Khalatyan, A., Chiappini, C., et al. 2019, A&A, 628, A94 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Bailer-Jones, C. A. L. 2015, PASP, 127, 994 [Google Scholar]
- Bailer-Jones, C. A. L., Rybizki, J., Fouesneau, M., Mantelet, G., & Andrae, R. 2018, AJ, 156, 58 [Google Scholar]
- Bastian, N., & Lardo, C. 2018, ARA&A, 56, 83 [Google Scholar]
- Battaglia, G., Taibi, S., Thomas, G. F., & Fritz, T. K. 2022, A&A, 657, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Belokurov, V., Koposov, S. E., Evans, N. W., et al. 2014, MNRAS, 437, 116 [NASA ADS] [CrossRef] [Google Scholar]
- Belokurov, V., Erkal, D., Evans, N. W., Koposov, S. E., & Deason, A. J. 2018, MNRAS, 478, 611 [Google Scholar]
- Bonaca, A., Geha, M., & Kallivayalil, N. 2012, ApJ, 760, L6 [NASA ADS] [CrossRef] [Google Scholar]
- Bonaca, A., Naidu, R. P., Conroy, C., et al. 2021, ApJ, 909, L26 [NASA ADS] [CrossRef] [Google Scholar]
- Boubert, D., & Everall, A. 2020, MNRAS, 497, 4246 [NASA ADS] [CrossRef] [Google Scholar]
- Cargile, P. A., Conroy, C., Johnson, B. D., et al. 2020, ApJ, 900, 28 [NASA ADS] [CrossRef] [Google Scholar]
- Carretta, E., Bragaglia, A., Gratton, R. G., et al. 2010, A&A, 516, A55 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Chambers, K. C., Magnier, E. A., Metcalfe, N., et al. 2016, ArXiv e-prints [arXiv:1612.05560] [Google Scholar]
- Chang, J., Yuan, Z., Xue, X.-X., et al. 2020, ApJ, 905, 100 [NASA ADS] [CrossRef] [Google Scholar]
- Choi, J., Dotter, A., Conroy, C., et al. 2016, ApJ, 823, 102 [Google Scholar]
- Chollet, F. 2015, Keras, GitHub, https://github.com/fchollet/keras [Google Scholar]
- Clementini, G., Ripepi, V., Molinaro, R., et al. 2019, A&A, 622, A60 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Conroy, C., Bonaca, A., Cargile, P., et al. 2019a, ApJ, 883, 107 [NASA ADS] [CrossRef] [Google Scholar]
- Conroy, C., Naidu, R. P., Zaritsky, D., et al. 2019b, ApJ, 887, 237 [NASA ADS] [CrossRef] [Google Scholar]
- Conroy, C., Naidu, R. P., Garavito-Camargo, N., et al. 2021, Nature, 592, 534 [NASA ADS] [CrossRef] [Google Scholar]
- Coronado, J., Rix, H.-W., & Trick, W. H. 2018, MNRAS, 481, 2970 [NASA ADS] [CrossRef] [Google Scholar]
- Dalton, G., Trager, S. C., Abrams, D. C., et al. 2012, Proc SPIE, 8446, 84460P [NASA ADS] [CrossRef] [Google Scholar]
- de Boer, T. J. L., Tolstoy, E., Hill, V., et al. 2012, A&A, 539, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- de Vaucouleurs, G. 1958, ApJ, 128, 465 [NASA ADS] [CrossRef] [Google Scholar]
- Deason, A. J., Belokurov, V., & Evans, N. W. 2011, MNRAS, 416, 2903 [NASA ADS] [CrossRef] [Google Scholar]
- Dehnen, W., & Binney, J. 1998, MNRAS, 294, 429 [Google Scholar]
- Dotter, A. 2016, ApJS, 222, 8 [Google Scholar]
- Eilers, A.-C., Hogg, D. W., Rix, H.-W., & Ness, M. K. 2019, ApJ, 871, 120 [Google Scholar]
- Erkal, D., Belokurov, V., Laporte, C. F. P., et al. 2019, MNRAS, 487, 2685 [Google Scholar]
- Fernández-Alvar, E., Allende Prieto, C., Beers, T. C., et al. 2016, A&A, 593, A28 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Fernández-Alvar, E., Kordopatis, G., Hill, V., et al. 2021, MNRAS, 508, 1509 [CrossRef] [Google Scholar]
- Flaugher, B., & Bebek, C. 2014, Proc. SPIE, 9147, 91470S [NASA ADS] [CrossRef] [Google Scholar]
- Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]
- Fukushima, T., Chiba, M., Homma, D., et al. 2018, PASJ, 70, 69 [NASA ADS] [CrossRef] [Google Scholar]
- Fukushima, T., Chiba, M., Tanaka, M., et al. 2019, PASJ, 71, 72 [NASA ADS] [CrossRef] [Google Scholar]
- Gaia Collaboration (Prusti, T., et al.) 2016, A&A, 595, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gaia Collaboration (Brown, A. G. A., et al.) 2018, A&A, 616, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gaia Collaboration (Eyer, L., et al.) 2019, A&A, 623, A110 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gaia Collaboration (Brown, A. G. A., et al.) 2020, A&A, 649, A1; erratum: 2021, A&A, 650, C3 [Google Scholar]
- GRAVITY Collaboration (Abuter, R., et al.) 2018, A&A, 615, L15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Harris, W. E. 2010, ArXiv e-prints [arXiv:1012.3224] [Google Scholar]
- Haywood, M., Di Matteo, P., Lehnert, M. D., et al. 2018, ApJ, 863, 113 [Google Scholar]
- Helmi, A., & White, S. D. M. 1999, MNRAS, 307, 495 [CrossRef] [Google Scholar]
- Hernitschek, N., Cohen, J. G., Rix, H.-W., et al. 2018, ApJ, 859, 31 [NASA ADS] [CrossRef] [Google Scholar]
- Hogg, D. W., Eilers, A.-C., & Rix, H.-W. 2019, AJ, 158, 147 [Google Scholar]
- Holl, B., Audard, M., Nienartowicz, K., et al. 2018, A&A, 618, A30 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ibata, R. A., Gilmore, G., & Irwin, M. J. 1994, Nature, 370, 194 [Google Scholar]
- Ibata, R., Irwin, M., Lewis, G., Ferguson, A. M. N., & Tanvir, N. 2001, Nature, 412, 49 [Google Scholar]
- Ibata, R. A., McConnachie, A., Cuillandre, J.-C., et al. 2017, ApJ, 848, 129 [NASA ADS] [CrossRef] [Google Scholar]
- Ibata, R., Bellazzini, M., Thomas, G., et al. 2020, ApJ, 891, L19 [NASA ADS] [CrossRef] [Google Scholar]
- Ibata, R., Malhan, K., Martin, N., et al. 2021, ApJ, 914, 123 [NASA ADS] [CrossRef] [Google Scholar]
- Iorio, G., & Belokurov, V. 2019, MNRAS, 482, 3868 [NASA ADS] [CrossRef] [Google Scholar]
- Irwin, M., & Hatzidimitriou, D. 1995, MNRAS, 277, 1354 [NASA ADS] [CrossRef] [Google Scholar]
- Ishigaki, M. N., Hartwig, T., Tarumi, Y., et al. 2021, MNRAS, 506, 5410 [NASA ADS] [CrossRef] [Google Scholar]
- Ivezić, Ž., Sesar, B., Jurić, M., et al. 2008, ApJ, 684, 287 [Google Scholar]
- Jean-Baptiste, I., Di Matteo, P., Haywood, M., et al. 2017, A&A, 604, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Jurić, M., Ivezić, Ž., Brooks, A., et al. 2008, ApJ, 673, 864 [Google Scholar]
- Kingma, D. P., & Ba, J. 2014, ArXiv e-prints [arXiv:1412.6980] [Google Scholar]
- Kirby, E. N., Cohen, J. G., Guhathakurta, P., et al. 2013, ApJ, 779, 102 [Google Scholar]
- Kollmeier, J. A., Zasowski, G., Rix, H. W., et al. 2017, SDSS-V: Pioneering Panoptic Spectroscopy [arXiv:1711.03234] [Google Scholar]
- Koposov, S. E., Rix, H.-W., & Hogg, D. W. 2010, ApJ, 712, 260 [Google Scholar]
- Koposov, S. E., Belokurov, V., Evans, N. W., et al. 2012, ApJ, 750, 80 [NASA ADS] [CrossRef] [Google Scholar]
- Koposov, S. E., Belokurov, V., Li, T. S., et al. 2019, MNRAS, 485, 4726 [Google Scholar]
- Kroupa, P. 2001, MNRAS, 322, 231 [NASA ADS] [CrossRef] [Google Scholar]
- Kruijssen, J. M. D., Pfeffer, J. L., Chevance, M., et al. 2020, MNRAS, 498, 2472 [NASA ADS] [CrossRef] [Google Scholar]
- Lane, J. M. M., Bovy, J., & Mackereth, J. T. 2022, MNRAS, 510, 5119 [NASA ADS] [CrossRef] [Google Scholar]
- Lee, Y. S., Beers, T. C., Sivarani, T., et al. 2008, AJ, 136, 2022 [Google Scholar]
- Liao, S.-L., Qi, Z.-X., Guo, S.-F., & Cao, Z.-H. 2019, Res. Astron. Astrophys., 19, 029 [CrossRef] [Google Scholar]
- Lindegren, L., Klioner, S. A., Hernández, J., et al. 2021, A&A, 649, A2 [EDP Sciences] [Google Scholar]
- Łokas, E. L. 2009, MNRAS, 394, L102 [NASA ADS] [CrossRef] [Google Scholar]
- Longeard, N., Martin, N., Starkenburg, E., et al. 2018, MNRAS, 480, 2609 [NASA ADS] [CrossRef] [Google Scholar]
- Luri, X., Brown, A. G. A., Sarro, L. M., et al. 2018, A&A, 616, A9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Magnier, E. A., Chambers, K. C., Flewelling, H. A., et al. 2020a, ApJS, 251, 3 [NASA ADS] [CrossRef] [Google Scholar]
- Magnier, E. A., Sweeney, W. E., Chambers, K. C., et al. 2020b, ApJS, 251, 5 [NASA ADS] [CrossRef] [Google Scholar]
- Majewski, S. R., Skrutskie, M. F., Weinberg, M. D., & Ostheimer, J. C. 2003, ApJ, 599, 1082 [NASA ADS] [CrossRef] [Google Scholar]
- Malhan, K., & Ibata, R. A. 2018, MNRAS, 477, 4063 [Google Scholar]
- Malhan, K., Yuan, Z., Ibata, R. A., et al. 2021, ApJ, 920, 51 [NASA ADS] [CrossRef] [Google Scholar]
- Marigo, P., Girardi, L., Bressan, A., et al. 2008, A&A, 482, 883 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Martin, C., Carlin, J. L., Newberg, H. J., & Grillmair, C. 2013a, ApJ, 765, L39 [NASA ADS] [CrossRef] [Google Scholar]
- Martin, N. F., Ibata, R. A., McConnachie, A. W., et al. 2013b, ApJ, 776, 80 [NASA ADS] [CrossRef] [Google Scholar]
- Martin, N. F., Ibata, R. A., Rich, R. M., et al. 2014, ApJ, 787, 19 [Google Scholar]
- McConnachie, A. W. 2012, AJ, 144, 4 [Google Scholar]
- McConnachie, A. W., & Venn, K. A. 2020a, AJ, 160, 124 [Google Scholar]
- McConnachie, A. W., & Venn, K. A. 2020b, Res. Notes Am. Astron. Soc., 4, 229 [Google Scholar]
- McMillan, P. J. 2017, MNRAS, 465, 76 [NASA ADS] [CrossRef] [Google Scholar]
- McMillan, P. J., Kordopatis, G., Kunder, A., et al. 2018, MNRAS, 477, 5279 [NASA ADS] [CrossRef] [Google Scholar]
- Muraveva, T., Delgado, H. E., Clementini, G., Sarro, L. M., & Garofalo, A. 2018, MNRAS, 481, 1195 [Google Scholar]
- Myeong, G. C., Evans, N. W., Belokurov, V., Amorisco, N. C., & Koposov, S. E. 2018a, MNRAS, 475, 1537 [NASA ADS] [CrossRef] [Google Scholar]
- Myeong, G. C., Evans, N. W., Belokurov, V., Sanders, J. L., & Koposov, S. E. 2018b, ApJ, 856, L26 [NASA ADS] [CrossRef] [Google Scholar]
- Myeong, G. C., Evans, N. W., Belokurov, V., Sanders, J. L., & Koposov, S. E. 2018c, ApJ, 863, L28 [NASA ADS] [CrossRef] [Google Scholar]
- Myeong, G. C., Vasiliev, E., Iorio, G., Evans, N. W., & Belokurov, V. 2019, MNRAS, 488, 1235 [Google Scholar]
- Naidu, R. P., Conroy, C., Bonaca, A., et al. 2020, ApJ, 901, 48 [Google Scholar]
- Newberg, H. J., Yanny, B., & Willett, B. A. 2009, ApJ, 700, L61 [NASA ADS] [CrossRef] [Google Scholar]
- Pace, A. B., & Li, T. S. 2019, ApJ, 875, 77 [NASA ADS] [CrossRef] [Google Scholar]
- Pieres, A., Girardi, L., Balbinot, E., et al. 2020, MNRAS, 497, 1547 [NASA ADS] [CrossRef] [Google Scholar]
- Price-Whelan, A. M. 2017, J. Open Source Softw., 2, 388 [NASA ADS] [CrossRef] [Google Scholar]
- Queiroz, A. B. A., Anders, F., Santiago, B. X., et al. 2018, MNRAS, 476, 2556 [Google Scholar]
- Schlafly, E. F., & Finkbeiner, D. P. 2011, ApJ, 737, 103 [Google Scholar]
- Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525 [Google Scholar]
- Schönrich, R., Binney, J., & Dehnen, W. 2010, MNRAS, 403, 1829 [NASA ADS] [CrossRef] [Google Scholar]
- Sesar, B., Hernitschek, N., Mitrović, S., et al. 2017, AJ, 153, 204 [NASA ADS] [CrossRef] [Google Scholar]
- Sestito, F., Longeard, N., Martin, N. F., et al. 2019, MNRAS, 484, 2166 [NASA ADS] [CrossRef] [Google Scholar]
- Shipp, N., Drlica-Wagner, A., Balbinot, E., et al. 2018, ApJ, 862, 114 [Google Scholar]
- Shipp, N., Li, T. S., Pace, A. B., et al. 2019, ApJ, 885, 3 [NASA ADS] [CrossRef] [Google Scholar]
- Simpson, J. D., Martell, S. L., Buder, S., et al. 2021, MNRAS, 507, 43 [NASA ADS] [CrossRef] [Google Scholar]
- Sirko, E., Goodman, J., Knapp, G. R., et al. 2004, AJ, 127, 899 [NASA ADS] [CrossRef] [Google Scholar]
- Starkenburg, E., Martin, N., Youakim, K., et al. 2017, MNRAS, 471, 2587 [NASA ADS] [CrossRef] [Google Scholar]
- Starkenburg, E., Youakim, K., Martin, N., et al. 2019, MNRAS, 490, 5757 [Google Scholar]
- Thomas, G. F., McConnachie, A. W., Ibata, R. A., et al. 2018, MNRAS, 481, 5223 [NASA ADS] [CrossRef] [Google Scholar]
- Thomas, G. F., Annau, N., McConnachie, A., et al. 2019, ApJ, 886, 10 [NASA ADS] [CrossRef] [Google Scholar]
- Tremblay, P.-E., Ludwig, H.-G., Steffen, M., & Freytag, B. 2013, A&A, 559, A104 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Vasiliev, E. 2018, ArXiv e-prints [arXiv:1802.08255] [Google Scholar]
- Vasiliev, E., Belokurov, V., & Erkal, D. 2021, MNRAS, 501, 2279 [NASA ADS] [CrossRef] [Google Scholar]
- Vickers, J. J., Grebel, E. K., & Huxor, A. P. 2012, AJ, 143, 86 [NASA ADS] [CrossRef] [Google Scholar]
- Xue, X. X., Rix, H. W., Zhao, G., et al. 2008, ApJ, 684, 1143 [Google Scholar]
- Xue, X.-X., Rix, H.-W., Yanny, B., et al. 2011, ApJ, 738, 79 [NASA ADS] [CrossRef] [Google Scholar]
- Xue, X.-X., Ma, Z., Rix, H.-W., et al. 2014, ApJ, 784, 170 [NASA ADS] [CrossRef] [Google Scholar]
- Yam, W., Carlin, J. L., Newberg, H. J., et al. 2013, ApJ, 776, 133 [NASA ADS] [CrossRef] [Google Scholar]
- Yanny, B., Newberg, H. J., Johnson, J. A., et al. 2009, ApJ, 700, 1282 [NASA ADS] [CrossRef] [Google Scholar]
- Yuan, Z., Smith, M. C., Xue, X.-X., et al. 2019, ApJ, 881, 164 [NASA ADS] [CrossRef] [Google Scholar]
- Yuan, Z., Malhan, K., Sestito, F., et al. 2021, The Complexity of the Cetus Stream Unveiled from the Fusion of STREAMFINDER and StarGO [arXiv:2112.05775] [Google Scholar]
- Zhao, G., Zhao, Y., Chu, Y., Jing, Y., & Deng, L. 2012, LAMOST Spectral Survey [Google Scholar]
Appendix A: Spectro-photometric distances with a different training set
Here, we present the improvement on the distance measurement made the by the addition of the K-giant catalogue of Xue et al. (2014) and of the BHB catalogue from Xue et al. (2008) to the training sample of the ANN used to derive spectro-photometric distances, which motivated our choices in the construction of the final training sample used by the ANN presented in Sect. 2.
Figure A.1 presents the expected and predicted CMDs for the different training samples made with the addition of K-giants and BHBs to stars selected from their parallax. The general accuracy of the ANN is the same for the different training samples because they are composed mainly of MS stars (> 80%, but the actual number differ slightly between the training samples), and these are the same between the different training samples. However, the global shape of the CMDs is different, especially for the top part of the RGB (MG ⪅ −1). Indeed, without the inclusion of the K-giants to the training sample, the predicted absolute magnitudes tend to a plateau around MG = −1, and the RGB is globally less well-defined. This is because in the training sample based only on the parallaxes, the stars on the RGB have a low relative precision on their parallax. Nevertheless, it is interesting to see that for all cases, the ANN improves the shape of the predicted CMD compared to the expected one, with a clear separation between giants and subgiants for example (around 0.8≤(BP-RP)0 ≤ 1.2 and 2≤ MG ≤ 4).
![]() |
Fig. A.1. Expected and predicted colour-magnitude diagram for the different training set we explored. Upper panels show the expected (BP-RP)0 – MG CMDs for a training sample composed only of stars that respect the parallax criteria presented in Sect. 2.2 (on the left), with the inclusion of K-giants (in the middle), and with the inclusion of both K-giants and BHBs (on the right). We only show the stars in common between the different samples, because the number of stars used to train the ANN are different from one training sample to another because of the different constraints used to create them. The lower panels show the corresponding CMDs with the absolute magnitude MG predicted by the ANN. We can clearly see the improvement made by the inclusion of the different catalogues on the global shape of the CMD, especially from the K-giants. |
The inclusion of the BHBs in the training sample has a small effect in that it leads to a better definition of the horizontal branch. However, its effect is far from marginal, since without its inclusion the absolute magnitude predicted for the BHB stars is on average ≃0.2 magnitude fainter than measured with the distance calibration of Deason et al. (2011), leading to systematic overestimation of the distance (or accuracy error) by ≃10%.
Appendix B: Precision of the spectro-photometric distances as function of Teff, log(g), and [Fe/H]
The mean relative precision on the distance predicted by the ANN presented in Sect. 2.2 is 13%, or 0.27 mag, but the actual precision varies for stars with different effective temperature, surface gravity, and metallicity. To analyse the precision of the spectro-photometric distances over these three parameters, we selected stars of the training sample located in different regions of the parameter space, and we fitted a Gaussian function to their ΔMG = Mtrue − Mpredicted histogram. The variation of the precision of the residual absolute magnitude with the effective temperature, surface gravity, and metallicity is presented in Fig. B.1.
![]() |
Fig. B.1. Variation in the residual between the ‘true’ and predicted absolute magnitudes with the effective temperature (upper panel), the surface gravity (middle panel), and the metallicity (lower panel). The dotted line shows cases where the predicted and the ‘true’ absolute magnitude are equal. The red continuous line shows the variation in this residual as a function of these parameters. In the middle panel, the blue dots show the evolution of the residual for the K-giant stars of the catalogue of Xue et al. (2014), and the green triangle shows it for the 229 stars between 2.5 < log(g) < 3.0 dex and with relative precision on their parallax of ϖ/δϖ≥10. |
B.1. Precision as a function of the effective temperature
The variation of the residual between the ‘true’ and predicted absolute magnitude as a function of the effective temperature is presented in the upper panel of Fig. B.1, where the bins in effective temperature have a width of 500 K. One can see that between 4000 < Teff < 7500 K, the residual is relatively flat with a typical scatter of 0.26 mag, slightly rising to a scatter of 0.36 mag at Teff = 7250 K due to the low number of stars in that range, making a good fit to the distance in that range more difficult. Beyond 7500 K, the precision increases drastically, with the scatter between the ‘true’ and the predicted absolute magnitude going down to 0.05 mag. This is because in that range, more than two-thirds of the stars of the training sample are BHBs that have a low intrinsic scatter (< 0.1 mag Deason et al. 2011), making it easier to obtain a precise predicted absolute magnitude for these stars.
B.2. Precision as function of the surface gravity
The middle panel of Fig. B.1 shows the evolution of the residual absolute magnitude with the surface gravity, where the bins in surface gravity have a width of log(g) = 0.5 dex. The residual is relatively constant over the range of surface gravity covered by SEGUE, except toward log(g) = 0.5 dex where the predicted magnitudes are overestimated by 0.18 mag and the scatter of the residual goes to 0.42 mag. This is due to the very low number of stars in that range and to the poor determination of the surface gravity by the SSPP in that range, as illustrate the numerous clumps of stars with exactly the surface gravity values in that surface gravity range. It should be noted here that the trend for the K-giants, shown by the blue dots, is similar to the trend of the global training set for log(g) > 2.5 dex because they constitute the majority of the training sample in that range. Between 2.5 < log(g) < 3.0 dex, the difference between the ‘true’ and the predicted absolute magnitude is also abnormally high, with a scatter of 0.58 mag. In that range of surface gravity, the large majority of the stars of the training sample (87%) have been selected based on their absolute parallax precision (δϖ< 0.07 mas), and are not part of the K-giant catalogue of Xue et al. (2014) or of the BHB catalogue of Xue et al. (2008). For this reason, the ‘true’ absolute magnitudes of the stars of the training sample are not well-defined in that surface gravity range, with an average uncertainty on the ‘true’ absolute magnitude of δMG = 0.94 mag, rather than δMG = 0.26 mag for the global training sample. This low precision of the ‘true’ absolute magnitudes alone explains the increase in the scatter of the residual. However, the precision on the predicted absolute magnitude in that range of surface gravity is likely similar to that obtained for the rest of the stars because we found a scatter of the residual of 0.30 mag for the 229 (over 3, 715) stars with ϖ/δϖ≥10 located in that region, but with predicted absolute magnitudes systematically underestimated by 0.1 mag, as showed by the green triangle in Fig. B.1. In comparison, for the 355 K-giants located in that region, the predicted absolute magnitudes are systematically overestimated by 0.2 mag, with a scatter on the residual of 0.37 mag. However, this comparison has to be taken with caution because the K-giants in that region have a typical uncertainty on their ‘true’ absolute magnitude of δMG = 0.42 mag, which is higher than for the rest of the K-giants (MG = 0.32 mag). Regarding the results of this analysis, we cannot definitively conclude that the precision on the absolute magnitude (distance) between 2.5 < log(g) < 3.0 dex is similar to that obtained for other surface gravity ranges, and the distances estimated for the stars in that range also have to be taken with caution.
B.3. Precision as function of the metallicity
The variation residual absolute magnitude between the ‘true’ and the predicted one as a function of the metallicity is presented in the lower panel of Fig. B.1, where the bins in metallicity have a width of [Fe/H] = 0.1. The scatter is constant (typically of 0.28 mag) over almost the full range of metallicities, with a small increase towards the low metallicity edges caused by the low number of stars in that region and the imprecision of the metallicities predicted by the SSPP for extremely metal-poor stars (see Fernández-Alvar et al. 2016; Starkenburg et al. 2017). Around [Fe/H]≃ − 0.1, the scatter on the residual is of 0.17 mag, which is lower than for other metallicities because in that region the stars of the training sample have a typical uncertainty on the ‘true’ absolute magnitude of δMG = 0.15 mag, which is lower than for the rest of the stars of the training sample, which have δMG = 0.26 mag.
All Tables
Parameters of the GALPOT potential listed here, composed of three disc components corresponding to the thin and thick discs and the interstellar medium, and two halo components corresponding to the bulge and the dark matter halo.
All Figures
![]() |
Fig. 1. Colour–magnitude diagram of the training sample with the expected absolute magnitude in the Gaia G-band shown in the left panel and the absolute magnitude predicted by the algorithm in the right panel. It is striking to see the RC on the predicted CMD so clearly (highlighted by the red box), while this feature it is less obvious on the expected CMD. We also highlighted with the blue rectangle the location of the BHB stars, which were added in the training sample as explained in Sect. 2.2. |
In the text |
![]() |
Fig. 2. Comparison of the ‘true’ (MG, true) and predicted (MG, pred) absolute magnitude for the stars of the training sample. Left panel: the true and derived quantities plotted against each other. The one-to-one relation is shown as a solid line, and the dashed and dotted lines correspond to the 1 − σ and 2 − σ deviations, respectively. Right panel: the distribution of the difference between the true and derived absolute magnitude, with a Gaussian fit overlaid. A scatter of 0.27 mag corresponds to a relative uncertainty on the distance of 13%. |
In the text |
![]() |
Fig. 3. Top panel: distance modulus of the Sgr RR Lyrae as a function of longitude along the plane of the Sagittarius stream, as defined by Majewski et al. (2003) (in grey). The blue error bars show the mean and standard deviation of the RR Lyrae DM as a function of the Λ⊙ position, and are used to define the Sgr DM track represented by the red cubic spline function. Bottom panels: difference between the absolute magnitude derived by the DM ridgeline and the absolute magnitude predicted by our algorithm as a function of the predicted absolute magnitude for stars along the Sgr stream (left). On the right panel, the red histogram shows the normalised residuals of the DM of the RR Lyrae to the fitted ridgeline along the Sgr stream. |
In the text |
![]() |
Fig. 4. Rotation curve of the Milky Way model used in this work (thick line) together with the measured rotation curve of our Galaxy by Eilers et al. (2019) (red dots). The thin blue lines show the rotation velocity at the solar radius (229 km s−1 at 8.129 kpc). |
In the text |
![]() |
Fig. 5. Azimuthal and vertical actions of the stars from the SEGUE dataset (left), and corresponding energy-vertical angular momentum diagram (right). The green lines mark the locus of circular orbits in the Milky Way model. The red ellipse on the left panel shows clearly visible structure corresponding to the Sagittarius stream. The stars inside that selection are highlighted in red in the right panel. The stars between the red and blue ellipses in the left panel are used to estimate the background contamination, which is used in Fig. 6. |
In the text |
![]() |
Fig. 6. Position on the sky, heliocentric distance, proper motions, line-of-sight, and Galactocentric radial velocities of the stars from the structure that we identified in Fig. 5 (red symbols). The light-blue and purple points are the stars identified as part of the Sagittarius stream by Ibata et al. (2020) and Vasiliev et al. (2021), respectively. Lower panel: another structure, highlighted by the orange rectangle, stands out around RA ∼ 25°, and is made up of stars that have clearly distinct galactocentric radial velocities from both the Sagittarius stream and the contaminating background stars (mostly originating from the Gaia-Enceladus-Sausage) shown by the black contours in each panel. The stars from this structure, which appears to be the Cetus stream, are shown by the orange stars in the other panels; they also form a distinctly visible structure in μα, * and μδ, * |
In the text |
![]() |
Fig. 7. Upper panel: metallicity distribution of the Cetus stream spectroscopic sample. The grey histogram shows the metallicity of the four stars that were rejected by the sigma-clipping method. Lower panel: distance distribution of the Cetus stream spectroscopic sample. The distribution is clearly asymmetric, with a tail towards shorter distances, which could be an indication of a distance gradient along the stream. |
In the text |
![]() |
Fig. 8. Orbits of the Cetus stream spectroscopic sample stars integrated over 1 Gyr backward and forward. The blue orbits are the ones used to find the natural coordinates of the Cetus stellar stream. |
In the text |
![]() |
Fig. 9. Aitoff projection of the sky in Galactic coordinates; the area used to search for the Cetus stream is represented in grey. The region near the Galactic disc (|b|< 10°), within the dashed lines, is excluded from the search because of the high extinction and the large number of contaminant disc stars. The blue lines correspond to the orbits of the stars from the Cetus spectroscopic sample used to find the plane of the Cetus stream. The current location of these stars is indicated by the orange stars. |
In the text |
![]() |
Fig. 10. Colour–absolute magnitude relation for BHB stars in the Gaia EDR3 photometric system. The orange line corresponds to the relation found in this work (Eq. (1)) by fitting a polynomial on the BHB population by Xue et al. (2011) (grey points). The stars in the blue area are excluded from the fit as they are likely misidentified as BHB. Lower panels: residuals of the absolute magnitude to the fitted relation. |
In the text |
![]() |
Fig. 11. Distance distribution of selected BHB stars through a succession of increasingly restrictive kinematic selections. Panel a: distance distribution of BHB stars prior to any kinematic selections, where the orange stars show the positions of the 56 spectroscopic stars whose orbits have been used to define the orbital plane of the Cetus stream. Panel b: same distribution for stars after a first broad kinematic cut performed in the corrected proper motion, illustrated by the black lines in panel e, where the blue lines show the orbits of the spectroscopic stars used to define this cut. Panel c: distance distribution of BHB stars after an additional wide selection based on their non-corrected proper motions, as shown in panel f (see Sect. 4.2.1). Panel d: distribution of the BHBs of the Cetus stream (in purple) after two additional more restrictive selections to remove the contamination of foreground and background stars performed in the non-corrected proper motion space shown in panel g and in the position-corrected parallel proper motion space shown in panel h. In all panels, ϕ1 and ϕ2 refer to the ‘natural’ coordinates of the Cetus stream (see Sect. 4.1). |
In the text |
![]() |
Fig. 12. Lower panel: position of the stars with PCetus ≥ 0.2. Upper panel: The black circles show the position of the BHB of Cetus-Palca identified in Sect. 4.2.2, and the yellow stars show the position of the 56 stars of the Cetus-Palca spectroscopic sample with common orbital behaviour. The blue rectangle highlights the position of the Cetus-Palca stream visible in the lower panel. The stream is decomposed into seven boxes of equal area used to measure the spatial variation of the CMDs in Fig. 13. The cyan polygon highlights the position of the potential globular cluster stream that we find in this work. The two purple dashed rectangles highlight the region located outside the Cetus-Palca stream used to show the MW foreground and background CMD. The red dashed ellipses show the positions of two additional structures visible in the lower panel. The light purple band shows the track of the Sgr stream. Finally, the black and white 2D histogram in the background shows the number of good observations along the scan direction of Gaia (ASTROMETRIC_N_GOOD_OBS_AL). |
In the text |
![]() |
Fig. 13. Colour–magnitude diagrams of the different spatial regions shown in Fig. 12 in which the black line shows a MIST fiducial isochrone for a 14 Gyr-old population with a metallicity of [Fe/H] = −1.93, shifted by the DM indicated in the legends. For structures 1 and 2, the DM is 17.7. The colours of the symbols reflects the colour coding of the structures highlighted in Fig. 12. |
In the text |
![]() |
Fig. 14. Panel a: CMD of the stars located in the Cetus-Palca selection box (blue rectangle in Fig. 12) overlaid with a 14 Gyr isochrone with a metallicity of [Fe/H] = −1.93 (in orange). The black plain polygon shows the location of the RGB stars with the region where Gaia is complete, while the region where Gaia is not complete is shown by the dashed line polygon. Panel b: luminosity function of the RGB used to estimate the stellar mass of the Cetus-Palca stream (in blue), with the area affected by the drop in completeness (in grey). The orange histogram shows the luminosity function of a 1.5 × 106 M⊙, [Fe/H] = −1.93, 14 Gyr-old RGB population following a Kroupa mass function. |
In the text |
![]() |
Fig. A.1. Expected and predicted colour-magnitude diagram for the different training set we explored. Upper panels show the expected (BP-RP)0 – MG CMDs for a training sample composed only of stars that respect the parallax criteria presented in Sect. 2.2 (on the left), with the inclusion of K-giants (in the middle), and with the inclusion of both K-giants and BHBs (on the right). We only show the stars in common between the different samples, because the number of stars used to train the ANN are different from one training sample to another because of the different constraints used to create them. The lower panels show the corresponding CMDs with the absolute magnitude MG predicted by the ANN. We can clearly see the improvement made by the inclusion of the different catalogues on the global shape of the CMD, especially from the K-giants. |
In the text |
![]() |
Fig. B.1. Variation in the residual between the ‘true’ and predicted absolute magnitudes with the effective temperature (upper panel), the surface gravity (middle panel), and the metallicity (lower panel). The dotted line shows cases where the predicted and the ‘true’ absolute magnitude are equal. The red continuous line shows the variation in this residual as a function of these parameters. In the middle panel, the blue dots show the evolution of the residual for the K-giant stars of the catalogue of Xue et al. (2014), and the green triangle shows it for the 229 stars between 2.5 < log(g) < 3.0 dex and with relative precision on their parallax of ϖ/δϖ≥10. |
In the text |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.