Open Access
Issue
A&A
Volume 687, July 2024
Article Number A76
Number of page(s) 24
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361/202349026
Published online 27 June 2024

© The Authors 2024

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

The vastness of the Universe and the sheer number of galaxies it contains makes it necessary to group those showing similar characteristics and study the groups’ properties as a whole rather than investigating each object individually. Such an approach is also encouraged in the modern context by the skyrocketing capacity of data collection, which will reach unprecedented heights in the coming years with such missions as Euclid (Racca et al. 2016) and LSST (LSST Science Collaboration 2017). As an increasing amount of data is being collected, automated classification procedures are becoming a necessity, whether it be as a means to study galaxies or, on a more practical aspect, as a tool to automatically and efficiently label and store new observations within large surveys.

Galaxies showcase a wide variety of shapes and structures, and they were initially used to propose the first classification system by Hubble (1926), distinguishing elliptical, lenticular, spiral, and irregular galaxies. The system was later refined by de Vaucouleurs (1959) and Sandage (1961), introducing further discrimination. Morphological classification of galaxies is still commonly used in the present day (e.g. Baillard et al. 2011; Cheng et al. 2023; Huertas-Company & Lanusse 2023) because it correlates well with certain physical properties, such as colour (e.g. Moutard et al. 2016b; Siudek et al. 2022b) or stellar content (e.g. Kennicutt 1992). Nonetheless, such classifications remain somewhat subjective and are not directly inferred from the physical characteristics of galaxies. In addition, morphological classifications are mostly limited to the local Universe, where galaxies’ shapes and structures are well resolved and defined. Beyond the local Universe, only the Hubble Space Telescope and the recently launched James Webb Space Telescope are able to provide images with sufficient spatial resolution to resolve the structure of small high-redshift galaxies (e.g. Huertas-Company et al. 2015, 2024). In addition to morphology, photometric properties and spectral features have proven to be useful attributes to separate different populations of galaxies. The Baldwin-Phillips-Terlevich (BPT) diagram (Baldwin et al. 1981) is widely used to distinguish between star-forming galaxies, Seyfert galaxies, and low-ionisation nuclear emission-line regions (LINERS) using [O III], [N II], Hα, and Hβ line ratios. However, the application of such traditional diagnostics is limited at redshifts z > 0.4 since optical emission lines such as Hα and [NII] are shifted out in the near-infrared. Other diagnostic diagrams, such as the mass-excitation diagram (Juneau et al. 2011), have been introduced as an alternative for intermediate redshifts. Rest-frame colour magnitudes are also used to separate red, passive “green valley” galaxies and blue star-forming galaxies in colour-colour diagrams such as the (NUV-r)-(r-K) (hereafter NUVrK) introduced in Arnouts et al. (2013). But such classification methods can hardly distinguish more than a few classes and only make use of a small number of parameters, which cannot realistically reflect all the diversity of galaxies in the Universe (e.g. Siudek et al. 2018b, 2022a).

An alternative approach is to use raw spectra, which encompass a great amount of information related to the intrinsic physical characteristics of galaxies, as the basis for classification. Kennicutt (1992) built an atlas of galaxy spectra but did not introduce a new classification scheme per se and used morphological types instead. In Dobos et al. (2012), they built a template of mean galaxy spectra, which has the advantage of increasing the signal-to-noise ratio and thus revealing structures that could be undetectable on individual spectra. However, they did not build their classification using the raw spectra but instead used a colour, morphology, and nuclear activity classification from the work of Lee et al. (2008) and extended it to further separate spectral line strengths. Similarly, Wang et al. (2018) proposed an atlas of composite spectra of galaxies but based their classification on spectral features and diagnostic diagrams. In fact, basing a classification on raw spectra is a challenging task due to the large number of parameters (i.e. the monochromatic fluxes), which requires the use of advanced machine-learning techniques and statistical methods specialised in performing such tasks (Fraix-Burnet et al. 2015).

Supervised classification techniques are used in a wide variety of applications in astrophysics (see Huertas-Company & Lanusse 2023, for a review). However, these techniques require learning labelled databases, and therefore, they require a priori knowledge about the expected classes, which we do not have for galaxy spectra. Unsupervised methods, on the other hand, are more suited for this task. They are designed to look for discriminant patterns in the data and find clusters without prior knowledge and expectations. The k-means algorithm (MacQueen 1967) is perhaps one of the most popular unsupervised classification techniques, and it sorts objects using centroids and distance minimisation. Sánchez Almeida et al. (2010) used the k-means algorithm to classify galaxy spectra from the Sloan Digital Sky Survey (SDSS), but De et al. (2016) argue that the method is not suited for high-dimensional clustering, as it suffers from the curse of dimensionality (Bellman 1957). This issue was avoided in Marchetti et al. (2013) by combining a principal component analysis (PCA) for dimension reduction and the k-means algorithm to obtain a spectral classification of galaxies from the VIMOS Public Extragalactic Redshift Survey (VIPERS). However, the PCA extracts axes of maximum variance and does not necessarily form a discriminant sub-space with its components (Chang 1983). In addition, as in all unsupervised approaches, the number of clusters in the k-means algorithm must be given as an input and should be chosen according to objective criteria (as in De et al. 2016).

The Fisher-EM algorithm (Bouveyron & Brunet 2012) deals with all the issues above. It is based on a Gaussian mixture model (GMM) in a sub-space optimised for the clustering using an objective criterion to decide altogether the best statistical model, the latent discriminant sub-space, and the number of clusters. This method is shown to be very efficient with high-dimensional data, and it outperforms the most commonly used methods, including k-means (Bouveyron & Brunet 2012). In astrophysics, Fisher-EM was used by Siudek et al. (2018a) to classify a dataset of 50 000 galaxies and broad-line active galactic nuclei (AGNs) on the basis of 12 photometric magnitudes and spectroscopic redshift. They obtain 11 classes of galaxies with distinct properties and an additional class of AGNs. The same approach was able to re-create the spectral diversity using only photometric information (Siudek et al. 2018a), to reveal populations hardly distinguishable based on the standard methods (Lisiecki et al. 2023; Siudek et al. 2023a, b), and to study synergies between galaxy populations at two cosmic epochs (Turner et al. 2021). The power of Fisher-EM was revealed in high dimension with the spectral classification of a dataset of 700 000 local galaxies from the SDSS (Fraix-Burnet et al. 2021). It leads to 86 classes with little dispersion in most classes. together with the identification of weird, noisy, or misclassified spectra in some specific classes. As shown in Dubois et al. (2022) using simulated galaxy spectra, Fisher-EM is capable of retrieving classes of galaxies sharing similar physical properties on the sole basis of the spectra and in a fully data-driven manner.

This present work aims at extending the classification of galaxy spectra of Fraix-Burnet et al. (2021) to higher redshifts using the VIPERS database. Our goal is similar to that of Marchetti et al. (2013) but we use a larger dataset and the more sophisticated Fisher-EM algorithm.

This paper is organised as follows: in Sect. 2, we present the data. In Sect. 3, we describe the classification algorithm Fisher-EM as well as the data preparation procedure. In Sect. 4, we present the classification results and study the evolution of the classes throughout cosmic time. In Sect. 5, we discuss the limitations of this work, and finally, the summary is given in Sect. 6. Throughout the paper, we assume the cosmological parameters with H0 = 70 km s−1, ΩM = 0.3, and ΩΛ = 0.7.

2. Data

This study uses the public data release 2 (PDR-2) of VIPERS, a spectroscopic survey carried at the ESO 8m Very Large Telescope targetting galaxies from the CFHTLS Wide Photometric survey (Scodeggio et al. 2018). The original dataset contains 86 775 spectra of objects brighter than 22.5 mag in the z ∼ 1 vicinity. The spectra cover wavelengths from 5500A to 9500A, and were observed at a resolution of R = 220 within the W1 and W4 fields, covering an area of ∼24 deg2 (for further details on the survey, see Garilli et al. 2014; Guzzo et al. 2014; Marchetti et al. 2017). They were shifted back to their rest-frame and resampled using the same method as in De et al. (2016) and Fraix-Burnet et al. (2021), which consists in doubling the sampling rate beforehand to preserve spectral features. The large range of redshifts results in a common rest-frame wavelength range from 3900 to 4300 Å. All spectra were normalised by their rest-frame average value between 4150 Å and 4250 Å, a spectral region with no prominent features and common to the entire dataset.

In addition to the spectroscopic measurements, we use the same physical parameters as used in Siudek et al. (2018b) for the interpretation of the results. They include spectral features (4000 Å break, hereafter D4000, emission-line measurements), observed magnitudes from the VIPERS Multi-Lambda Survey (Moutard et al. 2016b), absolute magnitudes, stellar masses and star-formation rates (SFRs) derived from spectral energy distribution (SED) fitting with the code “Le Phare” (Arnouts et al. 1999; Ilbert et al. 2006). For further information on the SED fitting process, please refer to Moutard et al. (2016a).

In this work, we excluded stars and limited the study to galaxies observed in the redshift range 0.4 < z < 1.2. We also excluded galaxies whose spectroscopic redshift measurements are not secure, that is, obtained from a single or weak features for which the photometric redshift is absent or incompatible1 (Scodeggio et al. 2018). The remaining galaxies have a spectroscopic redshift confidence level of either 99% or  90% but within at least 2σ of the photometric redshift measurement. The resulting subset contains 79 224 galaxies with a median redshift of 0.7.

3. Method

3.1. Masks and redshift bins

The VIPERS observations are ground-based, and the raw spectra are therefore contaminated by skylines. The VIPERS-DR2 provides cleaned spectra, from which skylines have been subtracted. These cleaned spectra are masked and reconstructed in regions where the sky residuals are above an empirically chosen threshold (Marchetti et al. 2017). The spectra reconstruction is done using a principal component analysis (PCA) and is not recommended for scientific analysis as it is not capable of accurately reconstructing spectral features other than continuum. We also note that the reconstructed chunks visually stand out from the rest of the spectra by their lack of noise, as well as the sporadic presence of some peculiar features. Such particularities could be interpreted by the classification algorithm as discriminative features, although they are non-physical and irrelevant to the study. To prevent this, we decided to clip these masked regions off in the clustering analysis. However, the classification algorithm requires that all spectra must be sampled identically, meaning that if one spectrum has a masked portion, this mask has to be transposed to all other spectra too. To minimise the share of masked wavelengths, it is thus necessary to split the dataset into smaller samples2 defined by bins of redshift. This splitting is also made necessary to increase the overlap between spectra (see Sect. 2), and has the interesting benefit of allowing an epoch-by-epoch analysis and study of how galaxies change over cosmic time.

We chose to define the redshift bins such that the cosmic-time step between two consecutive bins is constant. The smaller the time step is, the less information is lost from masking and de-redshifting. But decreasing the time step also decreases the number of galaxies in each bin. As such, a compromise must be found between conserving as much information as possible and having large enough bins to properly sample the diversity and proportion of the populations of galaxies. The latter is difficult to assess without a priori knowledge of the expected populations of galaxies. However, for each bin the sample size is also constrained from an algorithmic standpoint. In fact, most generative clustering methods require the sample size n to be greater than the number of sampled wavelength p (for an overview of this problem, see Chap. 18 of Hastie et al. 2009). While Fisher-EM can overcome that limit, we chose to use it as a lower bound for the sample size. Under the constraints of minimising the constant time-step between successive bins and having n > p for each sample, the optimum was found to be 26 bins separated by a time-step of 163 Myr (Table 1). Despite our efforts to limit the loss of information due to masking, the mask-rate remains close to 50%, meaning that roughly half of the spectra are masked before being classified. However, we show in Sect. 4.1 that our method manages to consistently overcome this limitation. Because the VIPERS dataset is not uniformly distributed in redshift, the sample size varies by one order of magnitude between the smallest (bin 1 with 491 galaxies) and biggest bin (bin 11 with 5504 galaxies).

Table 1.

Characteristics of the sub-samples: range of redshift and epoch, rest-frame spectral range, masking rate, sample size, and number of classes yielded.

3.2. The Fisher-EM algorithm

We used the unsupervised clustering algorithm Fisher-EM (Bouveyron & Brunet 2012) to classify the galaxy spectra. This algorithm is particularly well suited for high-dimensional data, as it outperforms most common unsupervised classifiers that suffer from the curse of dimensionality. Fisher-EM models the data with a GMM designed on the assumption that the data can be represented as resulting from a total probability density function f that can be written as the weighted sum of K multivariate Gaussian probability density functions Φ (Eq. (1)):

f ( K , θ ) = i = 1 K π i Φ ( μ i , Σ i ) , $$ \begin{aligned} f(K, \theta ) = \sum _{i=1}^K \pi _i \Phi (\mu _i, \Sigma _i), \end{aligned} $$(1)

with θ the parameters to be estimated, that is, for each cluster i, its weight is πi, its mean is μi and its covariance matrix is Σi.

The GMM parameters are commonly estimated using the expectation-maximisation (EM) algorithm, but it does not perform well with high dimensional data. Fisher-EM offers a workaround to this issue by providing a dimension reduction step to the regular EM algorithm, where the data are projected onto a discriminative latent sub-space of dimension d = K − 1. It assumes the existence of d latent variables in which most of the discriminative information is encoded such that the observed data Y and latent data X are linked by a linear transformation (Eq. (2)):

Y = U X + ϵ , $$ \begin{aligned} Y = UX + \epsilon , \end{aligned} $$(2)

with as U the projection matrix and ϵ as a noise term assumed to be Gaussian and centred around zero. This approach not only reduces the dimension by a great amount (two orders of magnitude in this study, p ≈ 103 and d ≈ 10) but also increases the separability of the classes much more effectively than a regular PCA. This is achieved by optimising the projection matrix U such that it maximizes the ratio of between-class variance to within-class variance, thus making the clusters compact and distinct from each other.

As such, Fisher-EM combines GMM and dimension reduction into a single model that is called a discriminative latent mixture (DLM) model. Its parameters are estimated with a 3-step loop: (i) in the E-step, the membership probabilities of all the observations are computed given the current estimation of θ; (ii) the F-step computes the projection matrix U of the discriminative latent sub-space; and (iii) the M-step updates the estimation of the model parameters θ given the new membership probabilities computed in the E-step and the new projection matrix U computed in the F-step.

3.3. Model selection

There are 12 variations of the DLM model, and they were obtained by adding more or fewer constraints on the degrees of freedom of the model: AB, ABk, AjB, AjBk, AkB, AkBk, AkjB, AkjBk, DB, DBk, DkB, and DkBk. Each model has its own specificities and may better fit certain datasets. For example, the DkBk model is the most complex, with no constraints on the DLM parameters. The DkB model assumes the noise is identical for each cluster, and the DBk model assumes that the covariance matrix is the same across all clusters. The combination of both previous assumptions results in the DB model. Similarly, assuming that the covariance matrix is diagonal leads to the AkjBk model, and adding an isotropic condition on the covariance matrix leads to the AkBk model. A detailed mathematical description of all DLMs can be found in Bouveyron & Brunet (2012).

The free parameters (i.e. the DLM model, its parameters θ, and the number of clusters K) are optimised by maximising the integrated completed likelihood (ICL), a penalised variation of the regular likelihood suited for high dimensional clustering (Biernacki et al. 2000). We find that the best forming DLM model on the VIPERS spectra for all samples is DkBk, and we therefore use it for all the classifications in this study. We note that this model is the most complex available in Fisher-EM, and offers a cluster-dependent modelling of the variance and noise. On the contrary, the optimal number of clusters differs from one sample to another.

3.4. Classification procedure

Each sample was individually classified using Fisher-EM. For all the bins, the number of classes (that we will call the main classes) was rather low, with significant dispersion, so that we performed a sub-classification to increase the discrimination between spectra. This is justified by the fact that the classes are found in a discriminative latent sub-space specifically optimised for the sample under study. The larger and more diverse the sample is, the harder it is to find a projection matrix that takes into account all the discriminative features of the spectra. Thus, it only separates the populations of spectra with the most obvious differences within the sample (see e.g. Fraix-Burnet et al. 2021). With sub-classification, a more specific latent sub-space is found for each main class, and a much more refined classification is made possible. We found that it was not necessary to sub-classify all the main classes, as some of them were smaller with lesser dispersion. As such, only the classes larger than 5% of the sample size were sub-classified.

4. Results

4.1. Classification of the samples

The classification procedure described in Sect. 3.4 yields from 20 to 43 classes per sample, with the exception of eight classes (Table 1) for the fourth bin where the location of the masks completely covers up the Hγ and Hβ lines, as well as partially the O III line, hence significantly reducing the discriminative information available within the spectra of that sample. Indeed, the most influential factor for the number of classes is found to be the sample size, with a Pearson correlation factor of r = 0.75. The larger the sample size, the more diversity in the spectra from a statistical standpoint, leading to additional classes. The spectral coverage is another defining factor for the number of classes, as it dictates the characteristic features visible in the spectra. However, because of the spectral overlap of the samples as well as the intrinsic redundancy of information within spectra, it is found not to affect the number of classes significantly (r = −0.1). The masks, while they can be problematic when masking out important emission or absorption lines, do not affect much the classifications (r = 0.05) indicating again that there are enough redundancies and correlations between data points.

The resulting classifications can be visualised by the mean spectra of the classes, which gave us a first insight into the kind of galaxies that each class separates. The results for all the samples can be found in Appendix A. We here detail the first sample (bin z = 0.41) and its 20 classes (Fig. A.1) for illustration. Aside from a few outlier classes, the spectra are rather well distributed among the classes, and the within-class dispersions are relatively low in most cases, indicating that the classes do, in fact, efficiently gather similar spectra. In this sample, the Hβ and O III lines are present and play a significant discriminative role in the classification process. On a side note, computing the mean spectra greatly improves the signal-to-noise ratio (S/N), and reveals smaller features like Hδ (see the mean spectrum C2 on the right bottom panel in Fig. A.1). Essentially three main types of classes emerge visually: red passive galaxies (e.g. classes A and D), blue galaxies with intense emission lines (e.g. classes B and C), and outliers (the less populated classes characterised by spectra with low S/N or fringing, e.g. class A1). Within each of these three visual categories, further segregation arises, mostly based on continuum slope and line intensities. We can note, however, that the dispersion in line intensities is generally higher than in the continuum.

The same kind of analysis performed on the 25 other samples (Appendix A) confirms that the classifications properly gather galaxies sharing similar spectral features with satisfying dispersion given the S/N of the data. In particular, the classes separate the slope of the continuum as well as the relative importance of the prominent lines and the Balmer break. However, the discriminative features gradually evolve as we progress through the bins due to the change in the rest-frame spectral window.

To go one step further in understanding the results, we can look at the distribution of various properties within the classes. Some of these properties, as described in Sect. 2, are directly derived from the VIPERS spectra, such as equivalent line widths or Balmer break. It can be expected that these properties could be well separated in the classes due to the information being straightforwardly available within the spectroscopic data on which the classification is based. Some other properties, however, originate from additional data, different spectral regions, or complex inference processes. Typically, spectral lines in galaxies are often used as diagnostics to identify the source of ionisation, hence separating AGNs from star-forming galaxies. The BPT diagram is perhaps the most commonly used AGN diagnostic tool (Baldwin et al. 1981), but it makes use of the N II and Hα lines, which are unfortunately out of scope within the VIPERS spectra. This is a common issue that affects higher redshift galaxies, and alternative diagnostic tools have therefore been developed. Among them, the mass-excitation (MEx; Juneau et al. 2011) diagram, which uses the O III and Hβ lines equivalent width, and the stellar mass. This diagram was usable with the VIPERS data up to z = 0.9, after which the O III line is no longer available. As a result, MEx diagrams were established for 19 out of the 26 bins (Appendix B). For instance, the MEx diagram for the first bin at z = 0.41 (Fig. B.1) shows that the sample is mostly bimodal, and is distributed in the star-forming and intermediate region, with very little AGNs. We can see that most of the classes isolate either galaxies from one region or the other. However, most of the classes in this sample that end up in the intermediate region show no obvious sign of emission on their mean spectrum. Still, this indicates a successful segregation of the stellar mass despite the fact the spectra were normalised.

Several other diagrams could be investigated (e.g. colour-colour, colour-magnitude) to assess the relevance of the classification from a physical standpoint, but we would remain severely limited by the fact that in total, 754 classes for the whole set of redshift bins were yielded. One of the initial motivation of classifying galaxies is to simplify the process of studying their general properties. It does not replace a thorough study of one object, but it brings a general understanding of many galaxies at once. However, so far, with a total of 754 classes, this goal is not exactly fulfilled. In the next section we propose another approach to synthetise our results.

4.2. Links between classes of successive epochs

Links between classes of successive epochs were constructed with the k-nearest-neighbours algorithm (KNN) in order to follow the evolution of a given class over cosmic time, from 5 Gyr (z = 1.2) to 9 Gyr (z = 0.4) after the Big Bang. The KNN algorithm is a supervised classification algorithm that attributes a class to an object based on the majority vote of the k closest (in terms of Euclidean distance) objects for which the class is known. Here we chose k = n $ k=\sqrt{n} $ where n is the number of spectra of the corresponding sample. The Euclidean distance between two spectra s and t is defined as d st 2 = l = 1 p ( f ls 2 f lt 2 ) $ d^2_{st}=\sum\nolimits_{l=1}^p\left(f^2_{ls}-f^2_{lt}\right) $ where p is the number of wavelengths in the common spectral range and fls the monochromatic flux of spectrum s at wavelength index l. By applying this method to the average spectrum of a class Ki at epoch Ti, and comparing it to all the spectra at epoch Ti − 1 we are able to find which class Ki − 1 is the closest to Ki. In other words, the mean spectrum of a class Ki is compared to the k closest spectra of the previous epoch Ti − 1, and the connection is made with the class Ki − 1 having the largest number of spectra within these k closest ones (highest vote). Naturally there is no such value for the first bin at the highest redshift. This method implies that the class Ki has one and only one precursor class, while class Ki − 1 can have from zero to several successor classes. In this study, epoch Ti corresponds to a later time than epoch Ti − 1 and thus to a lower redshift. Repeating this process on subsequent bins creates a divergent tree-like structure (Fig. 1) that highlights the change of galaxy population flow found automatically by the algorithm.

thumbnail Fig. 1.

Tree-like structure highlighting the links between the galaxy classes from a redshift of z = 1.2 down to z = 0.4. The black line is the complete classification tree. Each vertical step in the tree corresponds to a certain epoch, linearly sampled from 4 Gyr after the Big Bang (bottom of the tree) to 9 Gyr (top of the tree). Each node represents a class, and similarity links from epoch to epoch were retrieved using KNN (Sect. 4.2). The branches that are studied in this paper as examples of spectral evolution are shown with a colour characteristics of their main branches: reds for passive galaxies (noted P*), greens for the star forming ones (SF*) and blues for the very blue spectra (VB*). The nomenclature is explained in the text.

The tree is robust in the sense that 16% of the majority votes in KNN are lower than 50%, 6% below 40%, and 2% below 30%. Table 1 provides the fraction of classes within each bin that are connected to the previous epoch with a vote score higher than 50%. The low-majority connections do not show any trend on the tree with respect to dead branches or redshift (hence bin size). These dead branches occur when a class at epoch Ti − 1 is not associated (i.e. elected by KNN) with any class at Ti. This can happen when the successor is not present in the sample (incompleteness), or not in sufficient number (for the majority vote) or because the previous association was not “correct” enough (due to noise for instance). In this section, we validate whether the algorithm has found the physically driven pattern and thus the evolutionary paths of galaxy populations.

The structure visible in Fig. 1 can be divided into three independent main sub-structures that do not share any common ancestor. Our nomenclature follows the main physical properties of the galaxy populations within each branch that are discussed below: i) the main branch of star-forming galaxies (hereafter SF tree), ii) the right branch of passive galaxies (hereafter P tree), and iii) the smallest branch of the bluest galaxies at the left (hereafter VB tree). The two main branches, SF tree and P tree, become more complex (i.e. the new sub-populations are emerging) when moving to lower redshift. Such a pattern could be connected to the increase in the diversity of galaxy population as time passes but this is probably mainly created by the building method of the tree (see above). Over cosmic time, galaxies can undergo various processes, including mergers, interactions, and quenching of star formation, which can be reflected by a more diverse mix of galaxy types when transiting to the lower-redshift population. Interestingly, the VB tree does not present evolutionary sequence and disappears at z = 0.9, that is, existing only for a 1 Gyr at the epoch when the Universe was half of its age (but see Sects. 5.1.3 and 5.3).

The relative population of each tree in our sample, epoch by epoch (Fig. 2), shows that at higher redshifts, the VB tree is the most populated, but its size quickly diminishes down to zero at z = 0.9. The SF tree, on the other hand, sees its population increase at the same pace as that of the VB tree decreases, suggesting the population of galaxies found in VB at higher redshifts are transferred to the SF tree at lower redshifts. Finally, the share of P-tree galaxies remain quite constant around 25%.

thumbnail Fig. 2.

Share of the VIPERS galaxies among the P, SF, and VB trees as a function of redshift. The value has been normalised to exclude the few classes that do not belong in either of the three trees in the first two bins.

The mean stacked spectra computed for each of the three trees (Fig. 3) justify our nomenclature and the nature of the galaxy populations. The stacked spectrum of P-tree galaxies (see the top panel in Fig. 3) do not have any prominent emission lines, and is characterised by a redder continuum and absorption lines, typical of passive galaxies. A strong D40003 suggests an older stellar population, possibly indicating an ageing or evolved stellar population. The presence of old stellar population is further strengthen by the presence of the strong Hδ, an indicator of the recent star formation history of a galaxy. The presence of NaD absorption lines at 5896 Å suggests the existence of cool, neutral gas along the line of sight. This could be indicative of the interstellar medium within these galaxies. The presence of Fe lines (around 4200 Å and 5100 Å) in absorption indicating a relatively high-metallicity and the other features (strong D4000, Hδ in absorption) suggest that P-tree population gathers galaxies that are currently quiescent in terms of the recent star formation. The presence of non-negligible O II emission may indicate some low-level, ongoing star formation, but it does not dominated the stacked spectrum.

thumbnail Fig. 3.

Mean spectra (black) of the P tree (top), SF tree (middle), and VB tree (bottom). The dispersion (10% and 90% quantiles) are shown in grey. Some observed emission and absorption lines are highlighted with vertical dashed lines, and the corresponding source is shown at the top. We note that all three panels have the same vertical scale, and that OII line in the bottom panel is cropped out to focus on the continuum and the dispersion. The mean spectrum of the VB tree is limited to a narrower spectral range than the two other trees since it only includes galaxies of redshift z > 0.9.

The mean spectrum of the SF-tree galaxies (see the middle panel in Fig. 3) contains galaxies with a rather blue continuum and pronounced emission lines. Strong O II, O III, and Hβ lines indicate that these galaxies are actively forming stars. The VB tree, although consisting of 4 independent structures, gathers very blue spectra with extremely intense emission features. Similarly to the SF tree, these galaxies are characterised by prominent O II emission line, but we cannot compare O III and Hβ, which are already outside the optical windows at this redshift. However, there are visible differences between the spectra of the VB tree in respect to SF tree, such as: stronger O II emission, prominent and broad NeIII emission, Hγ in emission. The differences can be visualised on the spectra at the highest redshift (z = 1.1763, Fig. A.26): The classes for VB1, VB2, VB3, and VB4 are respectively E4, A6, A1, and A2, while the class for SF is C6. Both stacked spectra (Fig. 4) show Mg II in absorption suggesting the presence of cooler, lower-density gas in the interstellar medium of galaxies in VB and SF trees. The spectral differences (O III strength and Ne III emission) could be indicative of variations in the underlying physical processes, such as the degree of star formation activity or the presence of different ionisation sources, possibly including contributions from an active galactic nucleus (AGN) in galaxies within VB tree. The [Ne III]λ3869 is present in high excitation galaxies and might by boost by non-thermal, obscured AGN (e.g. Levesque & Richardson 2014). The presence of VB tree might be an important key in understanding the evolution in the ionisation conditions of the gas-phase medium between z ∼ 2 and z = 0 (e.g. Steidel et al. 2014; Sanders et al. 2016, 2018; Strom et al. 2017).

thumbnail Fig. 4.

Mean spectra of the branches within the P tree (top), SF tree (middle), and VB tree (bottom). The branch nomenclature can be visualised in Fig. 1. The mean spectra were computed from the very top of the branch down to the common root, so that for instance the mean spectrum of PA1 includes the PA and P sections. The dispersion is not shown here for the sake of clarity. Some observed emission and absorption lines are highlighted with vertical dashed lines, and the corresponding source is shown at the top. We note that the vertical scale varies from one panel to another to fully include the emission lines within the plot.

Within these three trees lie many intricate structures. In this paper we illustrate the power of the tree-like structure with a few example branches, as highlighted in Fig. 1. Structurally, the VB tree is the simplest and can be synthesised by four branches that will be referred to as VB1-4. Next, the P tree has a more complex tree-like structure. It takes root with a common ancestor at z = 1.2 and a single main branch (P) up until z = 0.76, where it divides into two branches (PA and PB) that then separate again into two branches each at z = 0.54 (PA1 and PA2) and z = 0.64 (PB1 and PB2). Lastly, the SF tree is significantly more complex with three main branches (SFA, SFB, and SFC) emerging very early on: at z = 1.13 for SFA and z = 0.93 for SFB and SFC. Similarly to the P tree, each of these branches then divides into several sub-branches. SFA is the smallest branch, with only 2 sub-branches emerging from it (SFA1 and SFA2), which appear at z = 1.04 and disappear respectively at z = 0.82 and z = 0.70. Similarly, SFC divides into two sub-branches (SFC1 and SFC2) at z = 0.85, which die out at z = 0.64 and z = 0.52. Lastly, the SFB branch divides into a larger number of sub-branches (SFB1, SFB2a, SFB2b, SFB3, and SFB4) at redshifts ranging from z = 0.61 to z = 0.70. The mean spectra of the all these branches (Fig. 4) show that the branches bring a finer degree of spectral discrimination with their own specificities.

The construction of the tree thus simplifies the classification results by connecting similar classes present at different epochs. The initial 754 classes are reduced to a total of 3 trees and 14 branches, allowing the analysis of the full dataset and the interpretation of the classification results in an evolutionary perspective.

5. Discussion

5.1. Physical properties of the branches

5.1.1. Passive tree

The spectra of the galaxies gathered in the P branches have a red continuum, a strong D4000 break, and mostly no emission lines, indicative of old stellar populations. Kauffmann et al. (2003) show that there is a bimodal distribution of D4000 within the SDSS spectra, which separates galaxies with old stellar populations (D4000 > 1.5) from those with more recent star formation (D4000 < 1.5). As suggested by the mean spectra, the vast majority of classes from the P tree have D4000 values typical of galaxies populated by old stellar populations (Fig. 5). The first fork PA-PB at z = 0.76 separates two populations of galaxies with distinct D4000 values (Fig. 5). In PA, we find galaxies with D4000 mostly ranging from 1.5 to 1.7, while those with a 4000 Å break above 1.7 are found in PB. Interestingly, Haines et al. (2017) observe that many massive blue galaxies within the VIPERS dataset are being quenched around z ≈ 0.7, which is possibly what is seen in the PA branch. Further down the PA branch, two paths emerge. One of them (PA1) shows a decrease in D4000 break suggesting that this branch can gather galaxies that experienced merging processes and the inflow of the cool gas has triggered the recent star formation. The galaxies PA1, besides D4000 reaching value typical for blue star-forming galaxies, are characterised by bluer continuum and noticeable [O II] emission with still Hδ absorption. In fact, the D4000 in PA1 shows a consistent decrease over several successive epochs. This branch shows characteristics of post-starburst galaxies (e.g. Setton et al. 2022, 2023), and could correspond to the quenched massive blue galaxies mentioned above. The other sub-branch, PA2, does not share these features and instead reaches D4000 levels comparable to the PB branch, as well as an overall redder spectrum. PA2, PB1, and PB2 are very similar, and mostly segregate different levels of ageing of the stellar populations, with PA1 being the younger, and PB2 the oldest (except for the last redshift bins). This may be explained by varying degrees of dust attenuation, but further insights are given below through the evolution of the properties along the branches. At the last stages of the life of PA2, PB1, and PA1 that occur at z ∼ 0.44 − 0.5 (Fig. 5) the D4000 drops significantly suggesting that these galaxies have experienced a recent star formation activity due to the tidal stripping, merger activity or AGN activity that have reshaped their properties. However, we stress that the ends of branches typically contain only a few galaxies with a lot of dispersion, and may not really corresponding to the physical processes driving their change.

thumbnail Fig. 5.

Distribution of the median values of D4000 among the classes in the P, SF, and VB branches. The error bars show the 25% and 75% quantiles, and the dashed line shows the separation threshold between passive and star-forming galaxies from Kauffmann et al. (2003).

The NUVrK colour diagram (Fig. 6) provides information relative to the star formation activity. It is in agreement with the above interpretation: (i) the classes of the P tree are in majority located in the red region of the diagram and (ii) the PA1 branch evolved from the green region to blue region with decreasing redshift, (iii) the ends of branches PA1, PB1, and PA2 follow the conclusion of recent star formation activity by moving to bluer colours.

thumbnail Fig. 6.

Distribution of the classes of P, SF, and VB branches in the NUVrK diagram. According to the classification of Davidzon et al. (2016), passive galaxies lie above the upper line, galaxies with intense star formation lie below the lower line, and the intermediate zone contains galaxies with a low SFR.

The stellar mass evolution (Fig. 7) where the high redshift galaxies are associated with a higher stellar mass, with log(Mstar/M ∼ 11), can be naturally explained by selection bias in a magnitude limited survey. At high redshifts, only massive galaxies get targeted, and low-mass galaxies are only targeted at low redshifts. This also explains the anti-correlation between SFR and stellar mass, as a higher SFR makes the galaxy appear brighter, and it gets targeted despite having a low stellar mass.

thumbnail Fig. 7.

Distribution of the median values of stellar mass among the classes in the P, SF, and VB branches. The error bars show the 25% and 75% quantiles.

5.1.2. Star-forming tree

Galaxies found in the SF tree show more complex spectral features than in the P tree, typically found in star-forming galaxies. In addition to a much bluer continuum and a small D4000 break (see Fig. 5), we observe several significant emission lines (OII; Hγ; Hβ; OIII) in all the SF tree, with varying intensities and ratios across the branches (see Fig. 4). The SFA branches are the ones with the most intense emission lines and the bluest continuum, and both SFA1 and SFA2 have very similar features. Nonetheless, SFA2 contains slightly bluer galaxies with more intense [OII] emission than SFA1. In comparison, the galaxies in the SFC branches have less strong emission features, a slightly redder continuum, and also show strong Hδ absorption. We observe the same characteristics in SFBs but to a greater extreme; the emission lines get much fainter, and the spectra redder. In particular, the spectra of galaxies found in SFB1 and SFB3 resemble that of the least quiescent P-branch, PA1.

The classes of the SF tree are, as opposed to the P tree, characterised by a D4000 systematically smaller than 1.5 (Fig. 5), typical of galaxies with recent star formation events. There is nonetheless a significant amount of diversity within that tree. While the uncertainty on the estimations of the SFR are too high to observe any significant differences between the branches, the NUVrK diagram (Fig. 6) is quite informative in that regard: The SFA and SFC branches are associated with the higher end of the SFR, whereas the PB branches are more located in the intermediate region. In particular, the spectra of galaxies found in SFB1 and SFB3 resemble that of the PA1 branch, which was interpreted as a class of currently quenching galaxies.

Interestingly, we observe that overall, SF galaxies see their stellar mass decrease at smaller redshifts (Fig. 7). Additionally, the branches isolate distinct ranges of stellar masses, with a clear anti-correlation between SFR and stellar mass. The SFA branches, which are the most star forming, are also the less massive, while the SFB, which are the least star forming, are the most massive. This observation can likely be attributed to ‘downsizing’ (Cowie et al. 1996). Generally, SF galaxies are also significantly less massive than the P galaxies.

5.1.3. Very-Blue branches

Structurally, the VB branches are different from Ps and SFs as they do not originate from the same ancestor. They are actually 4 independent branches that span through several redshifts down to z = 0.9, and while smaller than the P and SF trees, the VB branches still gather a significant amount of galaxies, especially at higher redshifts (Fig. 2). It is to be noted that despite not sharing a common ancestor, the four VB branches show very similar characteristics: The four branches separate different levels of blueness of the continuum (VB1 being the bluest, and VB4 the less blue) and line intensity (in the same order).

At first glance, these branches can be seen as ‘extreme’ versions of the SF tree. They are characterised by a particularly blue continuum, low D4000 values (Fig. 5), and very intense emission lines (Fig. 4). They are all located on the region of the NUVrK diagram associated with the highest SFR (Fig. 6). This interpretation might be supported by the fact that the VB branches are well integrated in the reverse tree even serving as precursors of some SF branches (Fig. C.1, see Sect. 5.3). However, we note a significant NeIII emission, which is usually associated with nuclei activity. It is difficult to draw clear conclusions on that aspect since MgII and NeV emissions would also be expected in stronger intensity than NeIII, but is not really observed here.

The trend between stellar mass and SFR observed for the P and SF galaxies also holds for the VB galaxies: They are the most star-forming, and also the less massive at a given redshift (Fig. 7). Contrary to the SF tree, however, it is difficult to really see a trend with redshift since the VB tree branches die out at z ≈ 0.9.

On a last note, the fact that this tree disappears at z = 0.9 is more likely to be a consequence of the representativeness of the dataset rather than an actual evolutionary meaning. It is possible that there is somewhat of a selection bias that limits the amount of VB galaxies in the dataset at lower redshift. This also limits its correlation with AGNs as most of VIPERS AGNs are rather populating lower redshifts (z < 0.7; see Fig. 6 in Siudek et al. 2023b). It is also probable that the disappearance of VB spectra at z = 0.9 might be due to the lost connection between the VB and SF branches when the O III line is masked.

5.2. Comparison with local galaxies

Fraix-Burnet et al. (2021) classified the SDSS dataset into four main classes, which were then sub-classified resulting in their final classification with 86 classes. These four main classes essentially consist of: (A) a class of red quiescent galaxies, (B) a class of galaxies with bluer continuum and some emission lines, (C) a diverse class with a lot of dispersion both in continuum shapes and line intensities, and (D) a small class of galaxies with a particularly blue continuum and very intense emission lines. These main classes are in a way reminiscent of the P, SF, and VB categorisations that emerged from the construction of the evolutionary tree.

After normalising the mean spectra of the 4 main SDSS classes on the same spectral region as for VIPERS, it is clear (Fig. 8) that the P tree and SF tree match remarkably well with the SDSS-A and SDSS-B classes respectively, both from a continuum and a line intensity standpoint. The continuum slope of the VB tree also matches perfectly with that of the SDSS-D class, although the latter has a lot more visible lines. Overall, this suggests that we are witnessing the same main categories of galaxy emerge from the analysis both in the z ∼ 1 vicinity and in the local Universe, similarly to what is found by Turner et al. (2021) when comparing the evolution of galaxy populations found within SDSS and VIPERS datasets. Nonetheless, there is no equivalent of class SDSS-C, but it is difficult to draw any conclusions due to its diversity in continuum shapes. Still, it seems to be characterised by noticeable line broadening, which we do not retrieve within the VIPERS classification (the VIPERS broad-line AGN were excluded from this analysis).

thumbnail Fig. 8.

Same figure as in Fig. 3 but with an additional layer showing the mean spectra of the SDSS main classes A, B, and D. The SDSS spectra were re-normalised in the same spectral region than the VIPERS spectra to make the comparison possible.

This simple comparison could be improved by lowering the resolution of the SDSS mean spectra down to the VIPERS resolution (R = 220). Additionally, the SDSS spectra appear to be generally redder, which could be explained by cosmic expansion. Still, the comparison portrayed here is good enough to conclude of a remarkable match between the SDSS main classes and the VIPERS sub-trees. In any case, this comparison could be refined with the SDSS sub-classes and the KNN procedure used in the present work, but this would be more rewarding by adding intermediate redshift data between SDSS and VIPERS. This is our goal for future work.

5.3. Limitations

The difficulty in the study of spectra at different redshifts is the lack of a large common wavelength range. Binning is the only option but this reduces the size of the samples for the clustering analysis. Despite the large VIPERS dataset, its large redshift range imposes a significant number of bins and thus limits the diversity of spectra within each bin. This implies that some class present at a given redshift might be absent in neighbouring ones due to the limited sample size. This could have a non-negligible impact on the tree structure.

In a given sample, sub-classification allows for a finer detection of typical spectra. Indeed, sub-classification can be pursued at more than two levels like it has been performed in this work. The number of levels is mainly given by a sufficiently low dispersion within classes, but there is no objective criterion apart from the scientific goal and interpretation. The level of sub-classification might affect the tree structure.

The tree was built using the Euclidean distance between the whole spectra with a majority vote through the KNN algorithm. However, the Euclidean distance tends to favour the continuum over narrow features because the continuum has more redundancies. In addition, the spectra contain several hundreds of monochromatic fluxes while the distance metrics suffer from the curse of dimensionality.

To optimise the overlap between the mean spectra of the classes, we were obliged to compare consecutive redshift bins. This does not allow complex structures to emerge, such as branch merging, or links that skip an epoch. Making this possible, in addition to changes in the distance computation, would likely help reduce the number of dead branches, which are very numerous in the current version of the tree. Some of them may have an actual physical meaning, being that this type of galaxy is no longer observed at lower redshifts. But so far, many of the dead branches end up in small classes of outliers spectra that seem to contain calibration issues or much noise (see for instance class D in Fig. A.19, the dead end of VP4).

All these limitations have solutions for improvement that we will investigate in future works. However, it is important to note that the tree established in this paper is not an evolutionary tree of galaxies in the sense of the phylogeny (e.g. Fraix-Burnet et al. 2012, 2019; Fraix-Burnet 2017) but rather a chronological evolution of the classes of galaxy spectra. For that one would need properties that characterise the evolutionary status of galaxies and can be given at any redshift. This might prove difficult using optical spectra only since the number of common features in the dataset tends towards zero with an increasing redshift range.

The simple approach used in this paper cannot build a converging tree. This means that we could not expect to get the exact reverse tree of Fig. 1 beginning the links from the lower redshifts as reference instead of the higher redshifts as we show in the present paper (Fig. C.1). We find the interpretation of the reverse tree more difficult since, chronologically, the new “dead ends” occur from nothing. Of course this could probably better be explained by the sample incompleteness exactly as for the tree in Fig. 1. Nevertheless, the branches identified on the tree (Fig. 1) in this paper are rather nicely recovered on the reverse tree (Fig. C.1) with two main branches, one for the P and one for the SF, the VB appearing as “precursors” of SF.

6. Conclusion

In this work, we successfully used the discriminant latent sub-space GMM clustering algorithm Fisher-EM on a large dataset of 79 224 optical spectra of galaxies with redshift of 0.4 < z < 1.2 from the VIPERS DR2. We split the dataset into 26 samples by bins of redshift. A classification was obtained on each individual sample followed by a one-level sub-classification, yielding in total 754 classes of spectra with 20 to 43 classes per sample, except for one sample that has only eight classes. Subsequently, links between the classes of the 26 samples were established using the KNN algorithm, resulting in a tree-like structure that highlights evolutionary paths of the classes.

Three main categories of classes were highlighted by the evolutionary tree through three sub-trees: (i) red passive galaxies, (ii) blue star-forming galaxies, and (iii) very blue galaxies with intense emission features. The mean spectra of these three types match remarkably well with those of the SDSS main classes A, B, and D from Fraix-Burnet et al. (2021) at z < 0.25.

As demonstrated in a previous paper (Dubois et al. 2022), the classes obtained with Fisher-EM do contain physical specificities. Here, we characterised the classes according to properties such as D4000 break, colour, SFR, and stellar mass, and we visualised their evolution through our tree. Indeed, the three sub-trees show complex branching structures that highlight different physical and evolutionary behaviours. For instance, different intensities of star formation are distinguished within the SF tree, and quenching events might be identified in a sub-branch of the P tree. Also, while the VB tree is indubitably associated with very high SFRs, some spectral features could indicate the potential presence of AGNs within the classes, but the presence of AGNs remains inconclusive due to the absence of Mg and NeV emission lines.

The present work has innovated in two ways: 1) implementing the first unsupervised classification of galaxy spectra at redshifts around one and 2) the construction of a tree to depict the evolution of the classes by their similarity. While the classification synthesises the diversity of spectra of galaxies, the tree summarises the diversification of galaxies through epochs. This novel approach to galaxy classification can easily be extended to higher redshifts with future surveys. Most notably, it will be possible to use the JWST spectroscopic observations of very high redshift galaxies, thus helping shed light on the questions that are left surrounding the intricate mechanisms governing galaxy formation and evolution.


2

In this paper, dataset refers to all the VIPERS spectra, and sample refers to the spectra of a single bin of redshift.

3

The D4000 break is a feature in the spectrum associated with the Balmer break, often used as an indicator of the stellar population age.

Acknowledgments

We warmly thank the referee for very insightful comments that helped clarify some important points. This paper uses data from the VIMOS Public Extragalactic Redshift Survey (VIPERS). VIPERS has been performed using the ESO Very Large Telescope, under the “Large Programme” 182.A-0886. The participating institutions and funding agencies are listed at http://vipers.inaf.it. We are grateful to Thibaud Moutard for allowing us to use some unpublished data from the VIPERS survey. This work has been supported by the Polish National Agency for Academic Exchange (Bekker grant BPN/BEK/2021/1/00298/DEC/1), the European Union’s Horizon 2020 Research and Innovation programme under the Maria Sklodowska-Curie grant agreement (No. 754510).

References

  1. Arnouts, S., Cristiani, S., Moscardini, L., et al. 1999, MNRAS, 310, 540 [Google Scholar]
  2. Arnouts, S., Le Floc’h, E., Chevallard, J., et al. 2013, A&A, 558, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  3. Baillard, A., Bertin, E., de Lapparent, V., et al. 2011, A&A, 532, A74 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Baldwin, J. A., Phillips, M. M., & Terlevich, R. 1981, PASP, 93, 5 [Google Scholar]
  5. Bellman, R. 1957, Dynamic Programming (Princeton, New Jersey: Princeton University Press), XXV, 342 [Google Scholar]
  6. Biernacki, C., Celeux, G., & Govaert, G. 2000, IEEE Trans. Pattern Anal. Mach. Intell., 22, 719 [Google Scholar]
  7. Bouveyron, C., & Brunet, C. 2012, Statist. Comput., 22, 301 [CrossRef] [Google Scholar]
  8. Chang, W.-C. 1983, Appl. Stat., 32, 267 [Google Scholar]
  9. Cheng, T. Y., Domínguez Sánchez, H., Vega-Ferrero, J., et al. 2023, MNRAS, 518, 2794 [Google Scholar]
  10. Cowie, L. L., Songaila, A., Hu, E. M., & Cohen, J. G. 1996, ApJ, 112, 839 [CrossRef] [Google Scholar]
  11. Davidzon, I., Cucciati, O., Bolzonella, M., et al. 2016, A&A, 586, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  12. De, T., Fraix Burnet, D., & Chattopadhyay, A. K. 2016, Commun. Stat. - Theory Methods, 45, 2638 [CrossRef] [Google Scholar]
  13. de Vaucouleurs, G. 1959, Handbuch der Physik, 53, 275 [Google Scholar]
  14. Dobos, L., Csabai, I., Yip, C.-W., et al. 2012, MNRAS, 420, 1217 [Google Scholar]
  15. Dubois, J., Fraix-Burnet, D., Moultaka, J., Sharma, P., & Burgarella, D. 2022, A&A, 663, A21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Fraix-Burnet, D. 2017, Phylogenetic Tools in Astrophysics (John Wiley& Sons, Ltd.) [Google Scholar]
  17. Fraix-Burnet, D., Chattopadhyay, T., Chattopadhyay, A. K., Davoust, E., & Thuillard, M. 2012, A&A, 545, A80 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Fraix-Burnet, D., Thuillard, M., & Chattopadhyay, A. K. 2015, Front. Astron. Space Sci., 2, 3 [Google Scholar]
  19. Fraix-Burnet, D., D’Onofrio, M., & Marziani, P. 2019, A&A, 630, A63 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  20. Fraix-Burnet, D., Bouveyron, C., & Moultaka, J. 2021, A&A, 649, A53 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  21. Garilli, B., Guzzo, L., Scodeggio, M., et al. 2014, A&A, 562, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  22. Guzzo, L., Scodeggio, M., Garilli, B., et al. 2014, A&A, 566, A108 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  23. Haines, C. P., Iovino, A., Krywult, J., et al. 2017, A&A, 605, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  24. Hastie, T., Tibshirani, R., & Friedman, J. 2009, The Elements of Statistical Learning (New York, NY, USA: Springer) [Google Scholar]
  25. Hubble, E. P. 1926, ApJ, 64, 321 [Google Scholar]
  26. Huertas-Company, M., & Lanusse, F. 2023, PASA, 40, e001 [NASA ADS] [CrossRef] [Google Scholar]
  27. Huertas-Company, M., Gravet, R., Cabrera-Vives, G., et al. 2015, ApJS, 221, 8 [NASA ADS] [CrossRef] [Google Scholar]
  28. Huertas-Company, M., Iyer, K. G., Angeloudi, E., et al. 2024, A&A, 685, A48 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  29. Ilbert, O., Arnouts, S., McCracken, H. J., et al. 2006, A&A, 457, 841 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  30. Juneau, S., Dickinson, M., Alexander, D. M., & Salim, S. 2011, ApJ, 736, 104 [Google Scholar]
  31. Kauffmann, G., Heckman, T. M., White, S. D. M., et al. 2003, MNRAS, 341, 33 [Google Scholar]
  32. Kennicutt, R. C., Jr. 1992, ApJS, 79, 255 [NASA ADS] [CrossRef] [Google Scholar]
  33. Lee, J. H., Lee, M. G., Park, C., & Choi, Y.-Y. 2008, MNRAS, 389, 1791 [NASA ADS] [CrossRef] [Google Scholar]
  34. Levesque, E. M., & Richardson, M. L. A. 2014, ApJ, 780, 100 [Google Scholar]
  35. Lisiecki, K., Małek, K., Siudek, M., et al. 2023, A&A, 669, A95 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  36. LSST Science Collaboration (Marshall, P., et al.) 2017, arXiv e-prints [arXiv:1708.04058] [Google Scholar]
  37. MacQueen, J. 1967, Proc. 5th Berkeley Symp. Math. Stat. Probab., Univ. Calif. 1965/66, 1, 281 [Google Scholar]
  38. Marchetti, A., Granett, B. R., Guzzo, L., et al. 2013, MNRAS, 428, 1424 [Google Scholar]
  39. Marchetti, A., Garilli, B., Granett, B. R., et al. 2017, A&A, 600, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  40. Moutard, T., Arnouts, S., Ilbert, O., et al. 2016a, A&A, 590, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Moutard, T., Arnouts, S., Ilbert, O., et al. 2016b, A&A, 590, A102 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  42. Racca, G. D., Laureijs, R., Stagnaro, L., et al. 2016, Proc. SPIE, 9904, 99040O [Google Scholar]
  43. Sánchez Almeida, J., Aguerri, J. A. L., Muñoz-Tuñón, C., & de Vicente, A. 2010, ApJ, 714, 487 [Google Scholar]
  44. Sandage, A. 1961, The Hubble Atlas of Galaxies (Washington: Carnegie Institution) [Google Scholar]
  45. Sanders, R. L., Shapley, A. E., Kriek, M., et al. 2016, ApJ, 816, 23 [Google Scholar]
  46. Sanders, R. L., Shapley, A. E., Kriek, M., et al. 2018, ApJ, 858, 99 [NASA ADS] [CrossRef] [Google Scholar]
  47. Scodeggio, M., Guzzo, L., Garilli, B., et al. 2018, A&A, 609, A84 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  48. Setton, D. J., Verrico, M., Bezanson, R., et al. 2022, ApJ, 931, 51 [NASA ADS] [CrossRef] [Google Scholar]
  49. Setton, D. J., Dey, B., Khullar, G., et al. 2023, ApJ, 947, L31 [NASA ADS] [CrossRef] [Google Scholar]
  50. Siudek, M., Malek, K., Pollo, A., et al. 2018a, ArXiv e-prints [arXiv:1805.09905] [Google Scholar]
  51. Siudek, M., Małek, K., Pollo, A., et al. 2018b, A&A, 617, A70 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  52. Siudek, M., Lisiecki, K., Mezcua, M., et al. 2022a, ArXiv e-prints [arXiv:2211.11792] [Google Scholar]
  53. Siudek, M., Małek, K., Pollo, A., et al. 2022b, A&A, 666, A131 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  54. Siudek, M., Lisiecki, K., Krywult, J., et al. 2023a, MNRAS, 523, 4294 [NASA ADS] [CrossRef] [Google Scholar]
  55. Siudek, M., Mezcua, M., & Krywult, J. 2023b, MNRAS, 518, 724 [Google Scholar]
  56. Steidel, C. C., Rudie, G. C., Strom, A. L., et al. 2014, ApJ, 795, 165 [Google Scholar]
  57. Strom, A. L., Steidel, C. C., Rudie, G. C., et al. 2017, ApJ, 836, 164 [NASA ADS] [CrossRef] [Google Scholar]
  58. Turner, S., Siudek, M., Salim, S., et al. 2021, MNRAS, 503, 3010 [NASA ADS] [CrossRef] [Google Scholar]
  59. Wang, L.-L., Luo, A.-L., Shen, S.-Y., et al. 2018, MNRAS, 474, 1873 [Google Scholar]

Appendix A: Mean spectra of all bin classifications

thumbnail Fig. A.1.

Mean spectra (black) of the classes obtained on the first bin at z = 0.41. The dispersion (10% and 90% quantiles) are shown in grey, and the masks are in red. Each panel shows one class, and the label is displayed in the top-left corner and size in the top right. The letter in the class label corresponds to the first run of the algorithm on this bin, and the number corresponds to the sub-classification of these main classes.

thumbnail Fig. A.2.

Stacked spectra of the classes of bin 2 (see Fig. A.1 for further information).

thumbnail Fig. A.3.

Stacked spectra of the classes of bin 3 (see Fig. A.1 for further information).

thumbnail Fig. A.4.

Stacked spectra of the classes of bin 4 (see Fig. A.1 for further information).

thumbnail Fig. A.5.

Stacked spectra of the classes of bin 5 (see Fig. A.1 for further information).

thumbnail Fig. A.6.

Stacked spectra of the classes of bin 6 (see Fig. A.1 for further information).

thumbnail Fig. A.7.

Stacked spectra of the classes of bin 7 (see Fig. A.1 for further information).

thumbnail Fig. A.8.

Stacked spectra of the classes of bin 8 (see Fig. A.1 for further information).

thumbnail Fig. A.9.

Stacked spectra of the classes of bin 9 (see Fig. A.1 for further information).

thumbnail Fig. A.10.

Stacked spectra of the classes of bin 10 (see Fig. A.1 for further information).

thumbnail Fig. A.11.

Stacked spectra of the classes of bin 11 (see Fig. A.1 for further information).

thumbnail Fig. A.12.

Stacked spectra of the classes of bin 12 (see Fig. A.1 for further information).

thumbnail Fig. A.13.

Stacked spectra of the classes of bin 13 (see Fig. A.1 for further information).

thumbnail Fig. A.14.

Stacked spectra of the classes of bin 14 (see Fig. A.1 for further information).

thumbnail Fig. A.15.

Stacked spectra of the classes of bin 15 (see Fig. A.1 for further information).

thumbnail Fig. A.16.

Stacked spectra of the classes of bin 16 (see Fig. A.1 for further information).

thumbnail Fig. A.17.

Stacked spectra of the classes of bin 17 (see Fig. A.1 for further information).

thumbnail Fig. A.18.

Stacked spectra of the classes of bin 18 (see Fig. A.1 for further information).

thumbnail Fig. A.19.

Stacked spectra of the classes of bin 19 (see Fig. A.1 for further information)

thumbnail Fig. A.20.

Stacked spectra of the classes of bin 20 (see Fig. A.1 for further information)

thumbnail Fig. A.21.

Stacked spectra of the classes of bin 21 (see Fig. A.1 for further information)

thumbnail Fig. A.22.

Stacked spectra of the classes of bin 22 (see Fig. A.1 for further information)

thumbnail Fig. A.23.

Stacked spectra of the classes of bin 23 (see Fig. A.1 for further information)

thumbnail Fig. A.24.

Stacked spectra of the classes of bin 24 (see Fig. A.1 for further information)

thumbnail Fig. A.25.

Stacked spectra of the classes of bin 25 (see Fig. A.1 for further information)

thumbnail Fig. A.26.

Stacked spectra of the classes of bin 26 (see Fig. A.1 for further information)

Appendix B: MEx diagrams

thumbnail Fig. B.1.

MEx diagrams of the 20 classes from the first bin, at z = 0.41. Each panel shows one class, whose name and sizes are displayed at the top and bottom left corner. The black line delimits the star-forming region (bottom), AGN region (top), and intermediate region (centre). The black contours show the distribution of the bin, and the red contours show the distribution of the class.

thumbnail Fig. B.2.

MEx diagram of the classes of bin 2 (see Fig. B.1 for further information).

thumbnail Fig. B.3.

MEx diagram of the classes of bin 3 (see Fig. B.1 for further information).

thumbnail Fig. B.4.

MEx diagram of the classes of bin 4 (see Fig. B.1 for further information).

thumbnail Fig. B.5.

MEx diagram of the classes of bin 5 (see Fig. B.1 for further information).

thumbnail Fig. B.6.

MEx diagram of the classes of bin 6 (see Fig. B.1 for further information).

thumbnail Fig. B.7.

MEx diagram of the classes of bin 7 (see Fig. B.1 for further information).

thumbnail Fig. B.8.

MEx diagram of the classes of bin 8 (see Fig. B.1 for further information).

thumbnail Fig. B.9.

MEx diagram of the classes of bin 9 (see Fig. B.1 for further information).

thumbnail Fig. B.10.

MEx diagram of the classes of bin 10 (see Fig. B.1 for further information).

thumbnail Fig. B.11.

MEx diagram of the classes of bin 11 (see Fig. B.1 for further information).

thumbnail Fig. B.12.

MEx diagram of the classes of bin 12 (see Fig. B.1 for further information).

thumbnail Fig. B.13.

MEx diagram of the classes of bin 13 (see Fig. B.1 for further information).

thumbnail Fig. B.14.

MEx diagram of the classes of bin 14 (see Fig. B.1 for further information).

thumbnail Fig. B.15.

MEx diagram of the classes of bin 15 (see Fig. B.1 for further information).

thumbnail Fig. B.16.

MEx diagram of the classes of bin 16 (see Fig. B.1 for further information).

thumbnail Fig. B.17.

MEx diagram of the classes of bin 17 (see Fig. B.1 for further information).

thumbnail Fig. B.18.

MEx diagram of the classes of bin 18 (see Fig. B.1 for further information).

thumbnail Fig. B.19.

MEx diagram of the classes of bin 19 (see Fig. B.1 for further information).

Appendix C: Reverse tree

thumbnail Fig. C.1.

Comparison between trees obtained from increasing or decreasing redshifts. The colours are the same as in Fig. 1. Left: Tree obtained by starting from the higher redshift (same as in Fig. 1). Right: Reverse tree obtained by starting from the lowest redshift (see Sect. 5.3).

All Tables

Table 1.

Characteristics of the sub-samples: range of redshift and epoch, rest-frame spectral range, masking rate, sample size, and number of classes yielded.

All Figures

thumbnail Fig. 1.

Tree-like structure highlighting the links between the galaxy classes from a redshift of z = 1.2 down to z = 0.4. The black line is the complete classification tree. Each vertical step in the tree corresponds to a certain epoch, linearly sampled from 4 Gyr after the Big Bang (bottom of the tree) to 9 Gyr (top of the tree). Each node represents a class, and similarity links from epoch to epoch were retrieved using KNN (Sect. 4.2). The branches that are studied in this paper as examples of spectral evolution are shown with a colour characteristics of their main branches: reds for passive galaxies (noted P*), greens for the star forming ones (SF*) and blues for the very blue spectra (VB*). The nomenclature is explained in the text.

In the text
thumbnail Fig. 2.

Share of the VIPERS galaxies among the P, SF, and VB trees as a function of redshift. The value has been normalised to exclude the few classes that do not belong in either of the three trees in the first two bins.

In the text
thumbnail Fig. 3.

Mean spectra (black) of the P tree (top), SF tree (middle), and VB tree (bottom). The dispersion (10% and 90% quantiles) are shown in grey. Some observed emission and absorption lines are highlighted with vertical dashed lines, and the corresponding source is shown at the top. We note that all three panels have the same vertical scale, and that OII line in the bottom panel is cropped out to focus on the continuum and the dispersion. The mean spectrum of the VB tree is limited to a narrower spectral range than the two other trees since it only includes galaxies of redshift z > 0.9.

In the text
thumbnail Fig. 4.

Mean spectra of the branches within the P tree (top), SF tree (middle), and VB tree (bottom). The branch nomenclature can be visualised in Fig. 1. The mean spectra were computed from the very top of the branch down to the common root, so that for instance the mean spectrum of PA1 includes the PA and P sections. The dispersion is not shown here for the sake of clarity. Some observed emission and absorption lines are highlighted with vertical dashed lines, and the corresponding source is shown at the top. We note that the vertical scale varies from one panel to another to fully include the emission lines within the plot.

In the text
thumbnail Fig. 5.

Distribution of the median values of D4000 among the classes in the P, SF, and VB branches. The error bars show the 25% and 75% quantiles, and the dashed line shows the separation threshold between passive and star-forming galaxies from Kauffmann et al. (2003).

In the text
thumbnail Fig. 6.

Distribution of the classes of P, SF, and VB branches in the NUVrK diagram. According to the classification of Davidzon et al. (2016), passive galaxies lie above the upper line, galaxies with intense star formation lie below the lower line, and the intermediate zone contains galaxies with a low SFR.

In the text
thumbnail Fig. 7.

Distribution of the median values of stellar mass among the classes in the P, SF, and VB branches. The error bars show the 25% and 75% quantiles.

In the text
thumbnail Fig. 8.

Same figure as in Fig. 3 but with an additional layer showing the mean spectra of the SDSS main classes A, B, and D. The SDSS spectra were re-normalised in the same spectral region than the VIPERS spectra to make the comparison possible.

In the text
thumbnail Fig. A.1.

Mean spectra (black) of the classes obtained on the first bin at z = 0.41. The dispersion (10% and 90% quantiles) are shown in grey, and the masks are in red. Each panel shows one class, and the label is displayed in the top-left corner and size in the top right. The letter in the class label corresponds to the first run of the algorithm on this bin, and the number corresponds to the sub-classification of these main classes.

In the text
thumbnail Fig. A.2.

Stacked spectra of the classes of bin 2 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.3.

Stacked spectra of the classes of bin 3 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.4.

Stacked spectra of the classes of bin 4 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.5.

Stacked spectra of the classes of bin 5 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.6.

Stacked spectra of the classes of bin 6 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.7.

Stacked spectra of the classes of bin 7 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.8.

Stacked spectra of the classes of bin 8 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.9.

Stacked spectra of the classes of bin 9 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.10.

Stacked spectra of the classes of bin 10 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.11.

Stacked spectra of the classes of bin 11 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.12.

Stacked spectra of the classes of bin 12 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.13.

Stacked spectra of the classes of bin 13 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.14.

Stacked spectra of the classes of bin 14 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.15.

Stacked spectra of the classes of bin 15 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.16.

Stacked spectra of the classes of bin 16 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.17.

Stacked spectra of the classes of bin 17 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.18.

Stacked spectra of the classes of bin 18 (see Fig. A.1 for further information).

In the text
thumbnail Fig. A.19.

Stacked spectra of the classes of bin 19 (see Fig. A.1 for further information)

In the text
thumbnail Fig. A.20.

Stacked spectra of the classes of bin 20 (see Fig. A.1 for further information)

In the text
thumbnail Fig. A.21.

Stacked spectra of the classes of bin 21 (see Fig. A.1 for further information)

In the text
thumbnail Fig. A.22.

Stacked spectra of the classes of bin 22 (see Fig. A.1 for further information)

In the text
thumbnail Fig. A.23.

Stacked spectra of the classes of bin 23 (see Fig. A.1 for further information)

In the text
thumbnail Fig. A.24.

Stacked spectra of the classes of bin 24 (see Fig. A.1 for further information)

In the text
thumbnail Fig. A.25.

Stacked spectra of the classes of bin 25 (see Fig. A.1 for further information)

In the text
thumbnail Fig. A.26.

Stacked spectra of the classes of bin 26 (see Fig. A.1 for further information)

In the text
thumbnail Fig. B.1.

MEx diagrams of the 20 classes from the first bin, at z = 0.41. Each panel shows one class, whose name and sizes are displayed at the top and bottom left corner. The black line delimits the star-forming region (bottom), AGN region (top), and intermediate region (centre). The black contours show the distribution of the bin, and the red contours show the distribution of the class.

In the text
thumbnail Fig. B.2.

MEx diagram of the classes of bin 2 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.3.

MEx diagram of the classes of bin 3 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.4.

MEx diagram of the classes of bin 4 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.5.

MEx diagram of the classes of bin 5 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.6.

MEx diagram of the classes of bin 6 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.7.

MEx diagram of the classes of bin 7 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.8.

MEx diagram of the classes of bin 8 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.9.

MEx diagram of the classes of bin 9 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.10.

MEx diagram of the classes of bin 10 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.11.

MEx diagram of the classes of bin 11 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.12.

MEx diagram of the classes of bin 12 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.13.

MEx diagram of the classes of bin 13 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.14.

MEx diagram of the classes of bin 14 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.15.

MEx diagram of the classes of bin 15 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.16.

MEx diagram of the classes of bin 16 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.17.

MEx diagram of the classes of bin 17 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.18.

MEx diagram of the classes of bin 18 (see Fig. B.1 for further information).

In the text
thumbnail Fig. B.19.

MEx diagram of the classes of bin 19 (see Fig. B.1 for further information).

In the text
thumbnail Fig. C.1.

Comparison between trees obtained from increasing or decreasing redshifts. The colours are the same as in Fig. 1. Left: Tree obtained by starting from the higher redshift (same as in Fig. 1). Right: Reverse tree obtained by starting from the lowest redshift (see Sect. 5.3).

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.