Open Access
Issue
A&A
Volume 698, June 2025
Article Number A132
Number of page(s) 30
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361/202554372
Published online 11 June 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model.

Open Access funding provided by Max Planck Society.

1. Introduction

To properly understand the evolution of any class of astrophysical objects through cosmic time, it is necessary not only to study individual sources in detail, but also to consider their behaviour as a (statistical) population. Demographic studies further our understanding of the distinct phases in the evolution of a type of object because extensive samples can provide enough statistics to show hidden trends. However, to build reliable statistical samples, large collaborations must be involved in a systematic approach to observe numerous sources over the entire sky. This paper introduces a sample of point-like X-ray detected sources that spans a wide redshift range (0 ≤ z < 5.8), for which optical spectra were obtained systematically with the Sloan Digital Survey (SDSS) telescope, supplemented by spectroscopic redshift information from the literature.

The first step in building this catalogue involved the detection of a large sample of X-ray sources (Brandt & Hasinger 2005). X-ray astronomy has developed more recently than optical astronomy (Elvis 2020), and the first major wide-area survey conducted in X-rays was the ROSAT (Truemper 1982) All-Sky Survey in 1990. ROSAT observed the sky in the 0.1 − 2.4 keV band and detected more than 105 unique sources. This sample is more than 100 times larger than previous X-ray surveys (Boller et al. 2016). Since then, XMM-Newton (Jansen et al. 2001) and Chandra (Weisskopf et al. 2002) have been observing in the X-rays with their larger collecting area and better spatial resolution. Although these observatories increased the number of known X-ray emitting sources significantly, their small fields of view were not designed for an all-sky survey. They therefore covered limited patches of the sky (the overall coverage of XMM-Newton with 4XMM is 1 383 deg2 and that of Chandra with CSC2 is 783 deg2; see Merloni et al. 2024 and references therein). The different settings for each observation also make any statistical population analysis more challenging because it is a complex task to account for the different calibrations, exposure times, depths, and sensitivities.

The extended ROentgen Survey with an Imaging Telescope Array (eROSITA, Predehl et al. 2021) on board the Spectrum Roentgen Gamma satellite (SRG, Sunyaev et al. 2021) is the most powerful wide-field X-ray survey telescope to date and was designed to provide X-ray spectroscopy and imaging of the entire sky. It was optimised to deliver a large effective area and field of view in the soft X-ray band while scanning the whole sky and to have an angular resolution that is good enough to distinguish point-like and large extended X-ray sources such as clusters of galaxies. The first release of the eROSITA all-sky survey data (Merloni et al. 2024) increased the number of known X-ray sources in the literature by > 60%. It detected almost one million objects, including stars, compact objects, galaxies, active galactic nuclei (AGNs), and clusters of galaxies.

Access to information at longer wavelengths allows the interpretation of the data for any X-ray catalogue. In particular, the information provided by optical spectroscopy (e.g. the redshift) permits a fit of the observed X-ray spectra with a correct model, and it also allows us to fix the basic source distance and luminosity for each object. The SDSS (York et al. 2000) has been providing astronomical optical-to-IR data for more than two decades. This laid the basis for several ground-breaking demographic studies of millions of sources from different astronomical classes in the Milky Way and beyond (see Ahumada et al. 2020 for the 16th data release and Almeida et al. 2023 for the 18th data release).

The multi-object spectroscopic capability of the SDSS (a few tens up to a few hundred objects per square degree can be observed simultaneously) matches the expected sky density of the X-ray sources that are detected at the typical depth of the eROSITA all-sky survey well (Merloni et al. 2012). The programme SPectroscopic IDentification of eROSITA Sources (SPIDERS, Dwelly et al. 2017; Comparat et al. 2020) was devised more than a decade ago, with the ultimate goal of providing SDSS optical spectra for a large number of X-ray sources that were detected by eROSITA so that their highly energetic processes (e.g. matter accretion, hot gas emission) could be studied, together with some of the physical properties that are often obtained from the emission and absorption lines in the optical-to-IR domain, such as density, metallicity, the ionisation parameter, and the contribution from star formation (e.g. Kewley et al. 2019).

In this paper, we present the value-added catalogue that was produced based on the optical spectra obtained from X-ray sources in the eFEDS field (eROSITA Final Equatorial Depth Survey, Brunner et al. 2022). The eFEDS observations covered ∼140 deg2 and were carried out as part of eROSITA performance verification to demonstrate that eROSITA was able to meet its all-sky survey science goals. As a pilot survey, eFEDS X-ray observations in this field are ∼40% deeper than the planned all-sky survey (Predehl et al. 2021; Brunner et al. 2022; Merloni et al. 2024). This allows more detailed studies of each object and the detection of a larger sample of faint objects. The eFEDS field location was selected to exploit existing observations at other wavelengths, which enabled the timely identification of the X-ray counterparts (Salvato et al. 2022).

At the time of its completion (November 2019), eFEDS was the X-ray survey medium-depth field with the largest number of detected sources in a contiguous footprint (see Fig. 14 from Brunner et al. 2022). Since the eFEDS data were published, many studies on the X-ray (e.g. Brunner et al. 2022; Liu et al. 2022b; Comparat et al. 2023; Schwope et al. 2024; Waddell et al. 2024; Nandra et al. 2025) and multi-wavelength (e.g. Salvato et al. 2022; Bulbul et al. 2022; Toba et al. 2022; Schneider et al. 2022; Klein et al. 2022; Li et al. 2024; Igo et al. 2024) properties of the sample were published.

One great advantage of this large dataset is that it was obtained with only one instrument for each wavelength range. This instrumental uniformity allows a more consistent statistical comparison of the objects because technical differences in calibration and the observing method do not have to be considered. A large and uniform sample allows the analysis of the parameter space through different bins of the main properties that drive the evolution of any class of astrophysical object. One possible technique that empowers this type of analysis is stacking. We used it to analyse the optical spectra of AGNs and quasars (QSOs)1 because they are the most common population of X-ray point-like sources that were found at the depth of eFEDS (Menzel et al. 2016; Liu et al. 2022b). Some of the key measurable features that influence the AGN evolution and can be studied with this catalogue are the luminosity, the black hole mass, and the fraction of the contribution from the host galaxy to the overall emission (e.g. Brandt & Alexander 2015; Merloni 2016), and the level of intervening obscuration by gas and dust. These measured quantities allowed us to address questions regarding the growth of supermassive black holes (SMBH, e.g. Brandt & Hasinger 2005; Alexander & Hickox 2012), the role of obscuration in observations of AGNs (e.g. Hickox & Alexander 2018), the coevolution of SMBHs and their host galaxies (e.g. Hopkins et al. 2008; Hickox et al. 2009; Kormendy & Ho 2013), and the comprehension of AGNs as the standard unification model and an evolutionary model (e.g. Antonucci 1993; Netzer 2015). For non-AGNs, the eFEDS catalogue also provides large samples that allow us to perform reliable statistical studies.

This manuscript is structured as follows. Section 2 presents the datasets of the eFEDS field, considering the X-ray data of eROSITA, the optical-to-IR spectroscopic data of SDSS, and the collection of spectroscopic redshifts obtained from the cross-match with other surveys that have been conducted in the eFEDS field. Section 3 describes the visual inspection process with which we confirmed the SDSS pipeline attribution of the redshift to the point-like X-ray detected sources and the method we followed to compile the spectroscopic redshifts from the available literature sources. Section 4 presents the analysis of the different classes of objects we found in the SDSS/eFEDS catalogue and discusses some of the statistical properties of the spectroscopic sample. The focus lies on the completeness and purity of the catalogue. A qualitative analysis of the diversity of the extragalactic point-like sources is presented in Sect. 5. We focus on AGNs and present stacks with a high signal-to-noise ratio as a function of redshift and optical colour. Section 6 summarises the catalogue and provides an outlook of the ongoing and future observations of eROSITA targets with SDSS-V. We also make the catalogues that we compiled and used for the plots available. Their data models are described in the appendix (see Appendix A.1 and A.3). Throughout this manuscript, we adopt a flat ΛCDM cosmology with ΩM = 0.3 and H0 = 70 km s−1 Mpc−1. Unless stated otherwise, the photometric magnitudes are given in the AB system (Oke & Gunn 1983). We use the term spec-z for a catalogue entry that provides a redshift measurement (and optionally, a broad classification) that was originally derived from (usually optical) spectroscopy.

2. Data

In this Section, we present the observations of the eFEDS field. We describe the eROSITA X-ray observations, the SDSS optical spectroscopic observations, and the compilation of redshifts for the detected sources considering multi-wavelength data from several surveys.

2.1. The eFEDS field: X-ray observations and multi-wavelength counterparts

The eROSITA X-ray telescope operates in the 0.2 − 8.0 keV band, with the main sample comprising the objects detected in the 0.2 − 2.3 keV band (Brunner et al. 2022), where the instrument is most sensitive (see Fig. 10 from Predehl et al. 2021). To verify the calibration and performance of eROSITA before starting the all-sky scanning operations, eROSITA observed a small region of the sky with long exposures. The eROSITA Final Equatorial Depth Survey (eFEDS) field comprises a region of ∼140 deg2 centred at RA = 136° and Dec = +2° (see Fig. 1). The eFEDS field was observed by eROSITA for about 360 ks (100 hours) in total between November 3 and 7, 2019, performing uniform exposures of ∼2.2 ks (∼1.2 ks after correcting for telescope vignetting), ∼40% deeper than the originally planned observations of the all-sky survey in 8 scans. This procedure produced a source catalogue with limiting flux of F0.5 − 2 keV ∼ 6.5 × 10−15 erg s−1 cm−2, sufficiently deep to allow for the detection of large statistical samples of different types of astronomical objects (e.g. stars, compact objects, galaxies, clusters of galaxies, and AGNs). The limiting flux of eFEDS reaches ∼15 − 20 times fainter sources than the one achieved by the previous X-ray all-sky survey, ROSAT, for a similar soft X-ray band (0.1 − 2.4 keV, Boller et al. 2016).

thumbnail Fig. 1.

Layout of the eFEDS field for the SPIDERS targeting and observations. The eROSITA X-ray coverage of the eFEDS field is shown in green, with the region with > 500 s of effective exposure depth shown with connected green points. SDSS-V plates are indicated as large black circles. The eFEDS Main X-ray sample is marked with small orange dots. The main sample sources having reliable spectroscopic redshifts (NORMQ = 3, see Table 2) are shown in dark blue, with red points indicating cases with unreliable redshift estimates (NORMQ < 3). The black diamonds indicate the objects that were detected in the X-ray as point-like sources, but which have counterparts suggesting that the source is likely associated with a galaxy cluster (CLUSTER_CLASS ≥ 4 from Salvato et al. 2022). The horizontal dashed line indicates the cut in δ(J2000) = − 2°, below which almost no SDSS plate observations are available, and which increases the spectroscopic completeness.

The eROSITA/eFEDS main sample includes 27 368 point-like, and 542 extended X-ray sources (total 27 910 sources, Brunner et al. 2022), following a classification entirely based on the detected X-ray morphology. Out of these extended sources, 102 were confirmed to be galaxy clusters in Liu et al. (2022a). Among the point-like sources, a small number of misclassified galaxy clusters were identified in Bulbul et al. (2022, see Sect. 5).

2.2. SDSS spectroscopic observations of the eFEDS field

The 2.5 m telescope located at the Apache Point Observatory (Gunn et al. 2006) observed a large portion of the Northern sky with the fibre system of the SDSS and the Baryon Oscillation Spectroscopic Survey (BOSS, Gunn et al. 2006; Dawson et al. 2013; Smee et al. 2013), considering different targeting programs to obtain significant samples of several types of objects. This instrument is well suited to the task because of its high multiplex, wide field-of-view, broad bandpass, and high efficiency. We have concentrated spectroscopic follow-up efforts on the part of the X-ray source population having optical counterparts with magnitude 16 < r < 22 AB.

This section provides a brief recap of the spectroscopic observations obtained in the eFEDS field over several generations of the SDSS project. We can break this down into the following four phases most relevant to eFEDS:

The SDSS-I, II, and III spectroscopy was obtained before the launch of SRG/eROSITA, but has serendipitously observed many objects that were discovered later to be optical counterparts to eFEDS X-ray sources. For these, we have used data products distributed as part of SDSS DR16 (Ahumada et al. 2020).

Within a rectangular region tightly bounding the eFEDS X-ray coverage (126° < α(J2000) < 146.2°, −3.2° < δ(J2000) <  + 6.2°, totalling 189.6 deg2), there are 61 430 spectra listed in the DR16 catalogue, corresponding to 54 743 unique astrophysical objects. These spectra are distributed fairly evenly over the eFEDS field, except for the South-West corner, which lies outside the footprint of the BOSS survey and is sparsely populated (see Fig. 5 of Almeida et al. 2023). We consider only the best quality spectrum per object (SPECPRIMARY = 1), and reject spectra having low signal-to-noise ratio2 (S/N, in this case SN_MEDIAN_ALL < 1) or unconstrained redshift uncertainties (Z_ERR ≤ 0). This gives a sample of 49 306 spec-z, of which 17 906 were obtained with the original SDSS spectrograph, and 31 400 with the BOSS spectrograph (Smee et al. 2013). The majority (91%) of these spectra were obtained as part of SDSS programmes carrying out galaxy and QSO redshift surveys (SDSS-Legacy, BOSS), and most of the remainder are associated with the SEGUE survey (Yanny et al. 2009). A relatively small fraction of the objects associated with these spectra are expected to be detectable in the X-rays by a survey as deep as eFEDS (see e.g. Vulic et al. 2022; Schneider et al. 2022, for the cases of non-active galaxies and stars, respectively). In contrast, the dedicated eFEDS plate observations, obtained as part of the SDSS-IV and SDSS-V surveys, have primarily targeted optical counterparts of eFEDS X-ray sources. We refer the reader to (Abdurro’uf et al. 2022, their Sect. 7.5) for a description of the SDSS-IV/eFEDS dataset (consisting of seven observed plates, with up to 1000 BOSS spectra obtained per plate). The dataset contains 6159 science spectra, although a significant minority of them (24%) are of lower quality (SN_MEDIAN_ALL < 1). The vast majority of targets were selected as counterparts of eFEDS X-ray point-like sources and candidate clusters of galaxies.

The SDSS-V/eFEDS target selection and dataset (comprising 37 plates) is described by Almeida et al. (2023, their Sects. 7.3 and 9.1). This project, part of the Black Hole Mapper survey, was executed in the early phases of the SDSS-V survey, before the availability of the new robotic fibre positioner, which enabled an automated transition from the traditional plate-based system (Almeida et al. 2023). A number of plates were specifically designed, constrained by the amount of dark observing time available to the project, the desire to minimise the total number of drilled plug plates, and the overall capabilities of the plate system to place fibres on naturally clustered targets. To reach the highest completeness possible, targets lacking existing high-quality spectroscopic observations (either from previous SDSS generations or other telescopes) were given a higher chance of receiving a fibre. During this phase, up to 500 BOSS fibres were available per plate (including 80 reserved for sky observations, and 20 fibres placed on spectrophotometric calibration stars). The eFEDS/SDSS-V dataset contains spectra for 13 269 science targets, of which 12 446 (94%) are optical counterparts to eFEDS X-ray sources. Of these observed X-ray targets, roughly three-quarters are counterparts to point-like X-ray sources (i.e. AGN candidates), and the remainder are candidate members of X-ray or optically selected galaxy clusters.

We note that the target selection for the dedicated SDSS-IV/V plates in the eFEDS field was reliant on early reductions of the eROSITA/eFEDS X-ray dataset (version ‘c940 V2T’), and early attempts at cross-matching with optical-to-infrared counterparts provided by the DESI Legacy Survey (DR8; Dey et al. 2019), SDSS DR13 (Albareti et al. 2017), and the Hyper Suprime-Cam Subaru Strategic Project (HSC-SSP DR2; Aihara et al. 2019). However, the differences between the X-ray catalogue (and counterparts) used for SDSS target selection, and that presented by Salvato et al. (2022) are modest. For example, of the 9199 optical counterparts to point-like X-ray sources that were spectroscopically observed as part of the SDSS-V plate programme, 7584 (82%) are still considered to be the ‘best’ counterpart to an eROSITA Main sample source in the eROSITA EDR catalogue (Brunner et al. 2022; Salvato et al. 2022). For the remaining 18% of that sample, either their X-ray detection is no longer considered significant, or an alternative counterpart is now preferred. The early (c940 V2T) X-ray catalogue and counterparts are not considered further in this work; we derive our science catalogue (see Sect. 3), solely from X-ray sources that are included in the most recent (and reliable) eFEDS Main catalogue (Brunner et al. 2022), using the best optical-to-IR counterparts presented by Salvato et al. (2022).

Figure 1 shows the layout of the eFEDS field, indicating the circular plates of SDSS-IV/V observations and the sources that belong to the SPIDERS program.

Figure 2 shows the time difference between the observations of SDSS and eROSITA for each source, both in the observed frame (left panel) and in the rest-frame (right panel). SDSS I-III observations that lie in the eFEDS field (negative values) will account for fewer objects and larger time spans, since these were not a follow-up of the eROSITA detections. SDSS IV-V observations were taken with a maximum of one year and a half after the eROSITA observations and represent the majority of the observations described in this manuscript. Having access to the information of when the data was taken can be interesting for looking for (correlated) variability, or for the study of changing-look AGNs (see e.g. Ricci & Trakhtenbrot 2023). We also emphasise that some objects were observed multiple times by SDSS: 2451 objects have two optical spectra, 341 objects have three available spectra, and 73 objects have more than three spectra. These objects are displayed in Fig. 3, where we show the difference in time between the repeated observations of the same object. These multiple observations can be used in the search for variability when comparing the different SDSS spectra and looking for variations in their emission line fluxes and shapes, or also in their continuum shape.

thumbnail Fig. 2.

Time difference between observations from SDSS and eROSITA. The top panel shows MJDSDSS–MJDeROSITA– in years. We highlight the different generations of SDSS as shaded regions, with SDSS I-III observations covering negative values (SDSS spectroscopy preceding eROSITA observations), while SDSS IV–V observations are to the right of the dashed line, indicating spectroscopic observations following the X-ray ones. The bottom panel displays the same time difference divided by (1 + z), showing the time difference in the rest-frame.

thumbnail Fig. 3.

Difference in time between couples of SDSS spectroscopic observations of the same X-ray source, in years. The sources with 2 observations (blue), 3 observations (orange), and more than 3 observations (green) are summed up in this histogram display.

2.3. Non-SDSS spectroscopic redshifts in the eFEDS field

The eFEDS field has been observed by many spectroscopic surveys that are now public, most notably SDSS, GAMA, WiggleZ, 2SLAQ, LAMOST, and Gaia RVS, targeting a mixture of stars, galaxies, and QSOs. The number of spectroscopic redshifts available within the eFEDS field aggregated across these surveys numbers in the hundreds of thousands. Many of these spectroscopic redshifts are of high quality and can be used for science applications, in particular where one primarily only needs a redshift and a broad classification. However, a careful collation and homogenisation of the existing spectroscopy catalogues is first needed to provide a reliable compendium of these data, as described below. The spectroscopic redshift catalogues used in this work are summarised in Table 1. In each case, we quote the number of selected spectroscopic redshifts that lie within a rectangular region tightly bounding the eFEDS X-ray coverage. We have assigned each of these spectra a normalised redshift quality grade (NORMQ) according to the criteria described below. We expand further on the use and meaning of NORMQ in Sect. 3.

Table 1.

Summary of catalogues and data sources providing spec-z for this project.

GAMA: The location of the eFEDS field was chosen, in part, to overlap with the ‘GAMA09’ sub-field of the Galaxy And Mass Assembly project (GAMA, Driver et al. 2009). The GAMA project obtained highly complete optical spectroscopy (spanning 3750–8850 Å) of (relatively) bright galaxies (r < 19.8) to intermediate redshifts (z ∼ 0.3) obtained with the 2dF/AAOmega instrument (Saunders et al. 2004) at the Anglo Australian Telescope. The GAMA survey covers over 60 deg2 of eFEDS (Liske et al. 2015), and so is expected to be particularly informative for both X-ray emitting galaxy clusters and the low-redshift, low-luminosity end of the AGN X-ray population. Here we use the catalogue of spectroscopic redshifts released as part of GAMA DR4 (Driver et al. 2022), selecting 74 926 spec-z entries having a spectroscopic quality grade (‘NQ’) of at least 2. We translate the GAMA quality grades into our normalised quality scheme (NORMQ) via NORMQ = NQ − 1.

WiggleZ: The WiggleZ survey (Drinkwater et al. 2010) primarily targeted UV-bright emission line galaxies at intermediate redshifts (0.2 < z < 1) as part of a cosmology redshift survey, also using 2dF/AAOmega. Approximately half of the eFEDS field is covered by the WiggleZ survey footprint. We select 20 922 spec-z entries from the final WiggleZ data release having a quality grade ‘Q’ of at least 2. We translate the WiggleZ quality grades into our normalised quality scheme (NORMQ) via NORMQ = Q − 1.

2SLAQ: The 2SLAQ survey (Croom et al. 2009) targeted luminous red galaxies and optically selected QSO candidates with the 2dF instrument (spanning 3700–7900 Å, Lewis et al. 2002), and its footprint partially overlaps the eFEDS field. We selected 990 spec-z entries from the 2SLAQ QSO catalogue (v1.2, Croom et al. 2009) having quality grades (‘qual2df’) equal to 1 or 2 (and so ignoring 2SLAQ targets lacking a spec-z, and any spec-z that were derived from SDSS data). We translate the highest 2SLAQ quality grade, qual2df = 1, to NORMQ = 3, and qual2df = 2 to NORMQ = 2.

6dFGS: The Six Degree Field Galaxy Survey (6dFGS, Jones et al. 2009) provides redshifts for a sample of well-resolved low redshift galaxies, derived from optical spectra (spanning at least 4000–7500 Å), which were obtained using the Six Degree Field multi-object spectrograph at the UK Schmidt Telescope. The 6dFGS overlaps with the part of the eFEDS survey below δ = 0°. We select 379 spec-z from the final 6dFGS data release, having spectroscopic quality grade (‘Q’) in the range 3 ≤ Q ≤ 6. For 6dFGS spec-z with Q equal to 6 or 4 we assign NORMQ = 3, and NORMQ = 2 for the remainder.

LAMOST: The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) survey has observed many bright objects (stars, galaxies, QSOs) within the eFEDS field. We select spec-z derived from low-resolution LAMOST spectra (spanning 3690–9100 Å), as reported by the LAMOST DR7_v2.0 data release (Luo et al. 2022). We apply the following selection criteria to reject low S/N and potentially problematic spec-z: S/N in the r band (‘snrr’) of at least 5, non-null redshift, and a well-constrained redshift uncertainty (‘z_err’), resulting in 62 997 spec-z. For LAMOST spectra having snnr > 10, z_err < 0.002 we assign NORMQ = 3, and assign NORMQ = 2 for the remainder.

Gaia RVS: The Gaia Radial Velocity Spectrograph (RVS, Sartoretti et al. 2018) has measured the radial velocities of bright stars over the full sky, covering a narrow wavelength range around the CaII IR triplet (8470–8710 Å). We select 15 568 RVS measurements from Gaia EDR3 (Gaia Collaboration 2021), all of which we assign NORMQ = 3.

FAST and Hectospec: The Reference Catalog of Spectral Energy Distributions of galaxies project (RCSED) has compiled spectroscopic measurements for a large (4M objects) galaxy sample (Chilingarian et al. 2017). We selected RCSED spec-z that had been derived from the FAst Spectrograph for the Tillinghast Telescope (FAST, Mink et al. 2021), and the Multi-Mirror Telescope Hectospec public archive (Fabricant et al. 2005), reported as sub-samples of the ‘v2’ RCSED database (Chilingarian et al. 2024). We retain 375 FAST and 352 Hectospec spec-z within the eFEDS footprint, requiring snr_median > 2; all of which we assign NORMQ = 2.

2MRS: The 2MASS Redshift Survey obtained spectroscopy for bright NIR-selected galaxies over the whole sky, using various facilities. We selected 152 spec-z from the v2.4 release of 2MRS (Huchra et al. 2012), all of which we assign NORMQ = 3.

SIMBAD and NED: To collate additional archival spectroscopy from smaller-scale surveys and reported observations of individual objects in the eFEDS field, we have exploited spec-z information provided by the Simbad database (Wenger et al. 2000) and the NASA Extragalactic Database (NED, Mazzarella & NED Team 2017). In each case, the archives were queried on 25 November 2021. To reduce duplication, we have attempted to filter out SIMBAD and NED entries that are associated with any of the spectroscopic surveys listed above. We selected 18 094 and 51 131 spec-z entries from SIMBAD and NED, respectively. We translate Simbad quality metric grades A and B to NORMQ = 3. We assign NORMQ = 2 for any lower quality Simbad spec-zs and for all spec-z originating from NED.

3. Methods for compiling and validating the catalogue

In this section, we describe the visual inspection procedure for the SDSS optical spectra, the collation of those visual inspections, and then finally how we have compiled the SDSS and non-SDSS redshift information into single redshift estimates per object in the eFEDS optical counterparts catalogues.

3.1. Visual inspection of the SDSS spectroscopy

The BOSS redshift pipeline was originally designed and optimised for broad-line QSOs and passive red galaxies. To verify the accuracy of such pipeline on the X-ray selected sources detected by eROSITA, we undertook an extensive visual inspection process of the entire collection of SDSS spectra of eFEDS point-like sources (14 895 spectra of the 13 079 sources followed-up with SDSS). In this section, we briefly describe the process and summarise the main lessons learned. For a more detailed analysis of the quality and failure rate of the photometric redshifts in comparison to spec-z, we refer the reader to Salvato et al. (2022).

The SDSS optical spectroscopic data reduction pipeline (idlspec2d, Bolton et al. 2012) is used to fit a set of model spectral templates to each observed spectrum. The outputs include parameters that best fit redshift (and its uncertainty), object classification, and redshift warning flags. For the vast majority of SDSS spectra that are not flagged by the pipeline as having uncertain redshifts, these pipeline-derived parameters are accurate (as confirmed by inspection, or additional spectroscopic observations; Bolton et al. 2012). However, for a minority of spectra (as discussed below), the pipeline redshift estimates can lead to catastrophic failures. Human visual inspection is required to identify and correct these failures. In addition, we can potentially use visual inspection to increase our confidence in lower-quality pipeline redshifts, thus increasing the size of the usable sample.

The visual inspection (VI) tools used by the SPIDERS team are described in Dwelly et al. (2017, see their Sect. 3.5). In summary, a web tool is used to organise the efforts of a small team of volunteers (drawn from within the authors), to inspect a subset of spectra that require validation. The inspectors are presented with an interactive graphical visualisation of each spectrum overlaid with the best-fitting pipeline model (redshift+template). The inspectors submit a response giving their opinion of the spectrum’s true redshift and classification, as well as an encoded measure of their confidence in the redshift (see Table 2).

Table 2.

Criteria used to grade the visual inspections of the SDSS spectra.

We emphasise that we did not use artificial intelligence techniques while performing our visual inspection. However, since the SDSS pipeline is based on machine learning, the results from the visual inspection (especially the corrections of the bad performances of the SDSS pipeline) can be later implemented in the training sample so the SDSS pipeline can be more reliable when dealing with the next and more numerous data releases.

3.1.1. Details of the visual inspections and their collation

The visual inspections are used to assign redshifts, classifications, and a simple normalised redshift quality metric (NORMQ) for each spectrum. Our definition of NORMQ is as follows: 3 – a secure spectroscopic redshift determined either via visual inspection or by a redshift fitting algorithm, 2 – a lower confidence spectroscopic redshift, e.g. derived from low S/N data or from a single emission line, 1 – a spectrum is available but it does not provide useful redshift constraints, –1 a spectrum visually determined to be Blazar-like, usually without strong redshift constraints, and finally, 0 – no spectroscopy is available.

We have carried out two discrete rounds of visual inspections of SDSS spectra in the eFEDS field: phase (I) inspection of SDSS-IV/eFEDS spectra obtained in March 2020 plus archival SDSS-DR16 spectra located near X-ray sources in the eFEDS field, and phase (II) inspection of SDSS-V/eFEDS spectra obtained between Dec. 2020 – May 2021. For the pre-SDSS-V dataset (phase I) we carried out 16.3k inspections of 12.6k spectra, including 2.7k spectra examined by multiple inspectors. For the SDSS-V dataset (phase II) we performed 4.1k inspections of 3.6k unique spectra, focussing our inspection efforts primarily on spectra that lay in ‘higher risk’ regions of the parameter space. For example, we require complete coverage of the lowest and highest S/N tails of the population, spectra towards the extremes of the redshift range, and any spectrum with a pipeline redshift warning flag.

An important step before exploiting the VI information was to collate and consolidate multiple inspections into a single estimate of redshift, classification, and confidence per spectrum. This task was performed separately for each of phase (I) and (II). As a result of this collation step, a second round of re-inspection and homogenisation was done (by a small number of the most experienced inspectors) for a few hundred spectra that had received conflicting VIs or that had been flagged as worthy of further attention.

The spectra were assigned collated NORMQ grades according to the hierarchical scheme given in Table 2.

3.1.2. Results of collated VIs

The main goals of the visual inspection of the objects are to guarantee that the redshifts attributed to the sources are reliable and to understand the possible failures of the SDSS pipeline to trace the potentially problematic cases in the next data releases. Therefore, it is important to evaluate the performance of both the SDSS pipeline and its warning flag for unreliable redshifts (ZWARNING ≠ 0), and of the visual inspection procedure and its warning flag (NORMQ < 3, see Table 2).

thumbnail Fig. 4.

Diagnostic plot of the visual inspection. As a function of the median signal-to-noise ratio provided by the SDSS pipeline, we show the cumulative distribution (top) and cumulative normalised fraction (bottom) of different samples defined based on their SDSS pipeline (ZWARNING) vs. VI redshift measurement quality (NORMQ). The black dotted line represents the cases that are lost for science since their redshifts are not reliable according to the VI process (NORMQ < 3). The blue dashed curve shows the cases for which SDSS data and/or pipeline fit are flagged as being problematic in some way(ZWARNING ≠ 0); the difference between the blue and the black lines shows the recovery of the VI. The red line indicates the most problematic cases, in which the visually inspected redshift is not reliable (NORMQ < 3), but there was no indication from the SDSS pipeline that the redshift could be wrongly assessed (ZWARNING = 0). The yellow dash-dotted line shows the other cases in which the redshift could not be recovered (NORMQ < 3), but for which the SDSS pipeline indicated some problem (ZWARNING ≠ 0). The combination of the red and yellow samples gives the total number of objects lost for science (black line). The green line represents the cases in which VI was needed (ZWARNING ≠ 0) and successful (NORMQ = 3).

Figure 4 displays a performance check of the visual inspection compared to the cases where SDSS did or did not identify a failure in its redshift attribution. The top panel shows the cumulative distribution of the median SDSS S/N (SN_MEDIAN_ALL), while the bottom panel exhibits the cumulative fraction3 of spectra normalised by the number of spectra with the same S/N. Here, the total sample is based on the full counterpart catalogue, with 27 369 sources. As expected, the total number of sources for each of the cases drops as the S/N increases, but the cumulative fraction increases from S/N ≳ 6 since the increase of total sources is much less significant for higher S/N, making these populations more relevant. Before the VI process (blue dashed curve), ∼10% of all spectra have ZWARNING ≠ 0, a fraction that drops to only ∼3% for S/N > 2 and stays around such proportion for higher S/N (> 10). The visual inspection allows the recovery of most failures at both high and low S/N, as shown by the green curve in comparison to the blue one. Even after the visual inspection, at S/N > 10 there are ∼0.3% cases in which the redshift is not obtained. This result was expected from the X-ray selection, since the optical observation of some AGNs can present a typical power-law continuum, but without emission lines that allow tracing the redshift (typically blazars; see also Dwelly et al. 2017).

In summary, we highlight the importance of the visual inspection to recover redshifts and provide confidence to the measurements for ∼99% of S/N > 2 spectra, and ∼94% for S/N > 0.2. In general, this exercise also demonstrates that the BOSS pipeline is very efficient in deriving reliable redshifts for the X-ray selected sources, with a significant fraction of pipeline failures identified by the pipeline itself and corrected by our VI process (552 sources or ∼2% of the total sample, green line). The residual number of truly problematic cases (i.e., unreliable redshifts without any pipeline warning, red line) is small, plateauing at about 1% for the whole sample and dropping to a sub-percent level for S/N > 2 (∼0.4% for the total sample, equivalent to 118 objects).

Another feature provided by the visual inspection is the attribution of classes to the objects. Table 3 presents the number of sources and percentages for the total sample and in parenthesis for the cases with reliable redshift (NORMQ = 3). The majority of the X-ray point-like sources are composed of AGNs with broad emission lines (e.g. Menzel et al. 2016), for simplicity classified here just as QSO. A small fraction of these objects also possess broad absorption lines, being classified as QSO_BAL (e.g. Weymann et al. 1991; Rankine et al. 2020). We emphasise that all these objects with a clear absorption component in their broad emission lines have a high confidence in their estimated redshift. The extragalactic sources with narrow lines in emission or absorption are all classified as GALAXIES. Despite their X-ray point-like emission indicating the presence of an active supermassive black hole, if the AGN is obscured, the optical spectra can resemble star-forming galaxies, Type 2 AGNs (hence the need of optical diagnostic diagrams such as in e.g. Baldwin et al. 1981; Veilleux & Osterbrock 1987; Cid Fernandes et al. 2011; Mazzolari et al. 2024), or passive galaxies (Fiore et al. 2003, see also Sect. 5). We also note that, since the X-ray observations were not taken simultaneously with the optical follow-up, there could be intrinsic variabilities in the accretion phase of the observed AGNs. All Galactic sources are classified as STARS, though they could be objects in different stages of stellar evolution, for example, cataclysmic variables, neutron stars, and white dwarfs (see e.g. Schneider et al. 2022; Schwope et al. 2024). These objects were identified in the visual inspection through stellar templates but mainly due to their continuum shape. The objects classified as BLAZARS have a power-law continuum with no emission lines and, as mentioned before, do not have a reliable redshift. Note that with this classification of BLAZARS, if a bona-fide blazar with prominent lines is observed, for example, an FSRQ (see Padovani et al. 2017, and references therein), it will have an associated redshift, and it will be classified as a GALAXY. Also, weak-line quasars (e.g. Diamond-Stanic et al. 2009; Ni et al. 2018, 2022) could fall into this category. Approximately 93% of the sources classified as UNKNOWN have a noisy spectrum (S/N < 2) that challenges a confident identification of their features. The remaining cases exhibit only one emission line or an apparent overlap of sources, so the attributed redshift has low confidence.

Table 3.

Classes attributed in the visual inspection of the eFEDS/SDSS spectra.

3.2. Building a compilation of all available spectroscopic redshifts in the eFEDS field

In this section, we describe how the wealth of spectroscopic information in the eFEDS field has been combined and homogenised into a single catalogue, providing a best estimate of classification, redshift and confidence per unique astrophysical object.

The eFEDS spec-z compilation presented here is intended to be as complete and versatile as possible, with only minimal (spatial and quality) down-selections from parent samples. Therefore, this compilation includes many spec-z for non-X-ray sources, in addition to the many counterparts to eROSITA sources that are the main focus of this paper.

In cases where multiple data sources offer spec-z info for a single astrophysical object, we determine a single ‘best’ spec-z. We use the DESI Legacy Imaging Survey DR9 optical-to-IR catalogue (Dey et al. 2019, hereafter ‘LS9’) to define the list of all unique astrophysical objects within the eFEDS field.

The process of merging the multiple spectroscopic catalogues into a single spec-z measurement per astrophysical object is carried out by a dedicated Python package (efeds_speccomp4). The algorithm for merging is as follows: (i) Filter or down-select each input catalogue by the criteria described in Sect. 2.3, (ii) normalise the heterogeneous spectral measurements (redshift, classifications, spectral quality) from each input catalogue onto a common system, (iii) attempt to associate each input spec-z with any close LS9 objects (after removing TYPE=DUP entries, since they have no optical flux assigned to them), using a radial match5 (after applying proper motions to move the LS9 objects to the approximate spectral epoch), (iv) collate and rank (by standardised quality grade) all spec-z measurements for each unique LS9 object, then determine a final redshift, class, and normalised quality per LS9 object from amongst the top-ranked spec-zs for that object.

The last step (iv) can be broken into several steps, as follows. For each unique LS9 object having at least one matching spec-z: (a) collect the set of all available spec-zi for this LS9 object (subscript i indexes over the available input spec-z), (b) if at least one of the spec-zi has been visually inspected, then discard any that do not have a VI, (c) discard any spec-zi that have a normalised quality grade lower than the maximum available normalised quality, (d) set a flag if there is significant scatter amongst the redshifts of the remaining spec-zi, (e) set another flag if there is significant scatter amongst the normalised quality of the remaining spec-zi (f) choose the ‘best’ remaining spec-zi, best according to the RANK of the input catalogue from which it was taken (see Table 1), (g) assign the SPECZ_REDSHIFT, SPECZ_NORMC, SPECZ_NORMQ equal to those of the chosen speczi, best, set SPECZ_RANK = 1 for these spec-z entries, (h) for the few spec-z which have no match to any LS9 object, we make no attempt to collate multiple measurements of the same astrophysical object, and they are propagated into the output catalogue without change.

The result of this process is a spec-z compilation catalogue containing a total of 334 258 spec-z entries, corresponding to 222 654 unique LS9 objects. The catalogue also contains 1409 ‘orphan’ spec-z which cannot be associated with any LS9 object. The data model for the spec-z compilation catalogue is described in Appendix A.3.

3.3. The eFEDS main source sample with updated spectroscopic redshifts

We matched the optical positions (taken from LS9) of the spectroscopic compilation catalogue to the optical positions of Main sample counterparts from Salvato et al. (2022, taken from LS8) using a conservative matching radius of 1 arcsec.

We found spectroscopic redshifts for 13 079/27 369 (∼48%) of the counterparts from the eFEDS main sample, of which 12 011 (∼44% of the main sample and ∼92% of the assigned redshifts) have the highest spectroscopic quality rank (NORMQ = 3). The number of objects with spectroscopic redshifts is approximately twice that presented in the earlier (pre-SDSS-V) work of (Salvato et al. 2022). Table 1 gives the number of ‘leading’ spec-z provided by each contributing (sub-)survey for eFEDS main sample X-ray sources. The objects with spec-z generally correspond to the optical brighter end of the sample, as we describe in greater detail below (Sect. 4). The data format of this catalogue is described in Appendix A.4.

4. Statistical properties of the spectroscopic sample

4.1. Colour-colour diagram and X-ray to optical distribution

The spectroscopic classes defined in the previous section (see Table 3) can also be visualised in a colour-colour diagram, as shown in Fig. 5. This colour-colour diagram with the filters grz from Legacy Survey (Dey et al. 2019) and W1 from WISE (Toba et al. 2022) demonstrates the diversity of the X-ray selected sample, with Galactic objects below the black line and extragalactic objects above it, as proposed by Salvato et al. (2022). Although there is a tendency of the GALAXIES (which encompasses optical passive galaxies, star-forming galaxies, and Type 2 AGNs) to be located in the redder part of the diagram (right), and of the QSOs to be on the blue part of the diagram (left), the X-ray selection is effective at detecting broad-line emitters with red colours, complementing the QSO colour-selected in previous generations of SDSS as in Vanden Berk et al. 2001. This capability can be seen by the orange squares in the right part of Fig. 5, showing an advantage in comparison to optical-only selections of AGNs (as also seen in e.g. Menzel et al. 2016). Regarding the rarer types of AGN that were classified with the visual inspection (BAL_QSO and BLAZARS), they are mostly found in a region where typical quasars lie, but also with some redder objects. We also plot Other Galactic sources, which are either stars or compact objects that have been detected as point-like sources with eROSITA and have a reliable counterpart but were not (yet) observed with SDSS. Some of the Other Galactic objects lie in the extragalactic locus of the plot, but we do not have their spectra available to see if their redshift is properly assigned.

thumbnail Fig. 5.

Colour-colour diagram of the sources with reliable redshifts (NORMQ = 3), colour-coded by their class as in Table 3. For the colours g − r and z − W1, g, r, and z come from the Legacy Survey, while W1 comes from WISE. Orange squares represent broad emission line objects (QSO), while pink diamonds show the objects that have a broad line with an absorption feature (QSO_BAL). Green triangles indicate extragalactic objects with narrow lines (GALAXY), and purple stars represent Galactic objects (STAR). The blue triangles represent BLAZARS, although these sources do not have reliable redshifts. Yellow circles indicate Other Galactic objects (z < 0.001) that have a reliable counterpart, though they were not observed with SDSS. Both BLAZARS and Other Galactic objects do not have NORMQ = 3. The black line divides objects between extragalactic (above) and Galactic (below) as in Salvato et al. (2022). Histograms on the top and right show the distribution of each photometric colour per class.

Figure 6 displays the optical (r-band) magnitude vs. X-ray flux of the sample. Investigation of possible trends comparing the X-ray emission and an optical filter dates back to at least Maccacaro et al. (1988), with other large compilations found in e.g. Brusa et al. (2010), or Civano et al. (2016). We adapted the equations that would define the locus of AGNs in the plane with the Legacy Survey DR8 r-band (mr, central wavelength at 6420 Å and FWHM 1480 Å, see Dey et al. 2019) and the X-ray flux in the main eROSITA band (FX, 0.2 − 2.3 keV, considering the spectra to have a power-law index Γ = 2 and a Galactic column density NH = 3 × 1020 cm−3 as in Liu et al. (2022b); see references therein for the motivation behind such values), with the flux ratios (FX/FO) corresponding to 10 and 0.1, respectively:

thumbnail Fig. 6.

X-ray flux vs. optical magnitude diagram of the sources with reliable redshifts after the visual inspection, except for BLAZARS and Other Galactic sources which have NORMQ ≠ 3. The sources are colour-coded as in Fig. 5. The region between the two lines defines the locus of quasars in this plane, adapted from Maccacaro et al. (1988).

m r 1 = 2.5 log ( F X ) 16.19 m r 2 = 2.5 log ( F X ) 11.19 . $$ \begin{aligned} m_{r1}&= -2.5\, \log (F_{\rm X}) - 16.19 \nonumber \\ m_{r2}&= -2.5\, \log (F_{\rm X}) - 11.19. \end{aligned} $$(1)

Hence, in comparison with the classes attributed via visual inspection (Table 3), the AGN locus defined by Eq. (1) (Fig. 6) is consistent with the distribution of QSOs, BAL_QSOs, and BLAZARs. Regarding the GALAXIES, ∼65% lie in the AGN locus, as expected from having a mixture of Type 2 AGNs, which should be in the AGN locus, but also star-forming and passive galaxies in the optical domain, which would lie below it. Most of the objects classified as STARS with reliable redshifts are also inside the quasar locus (∼63%), but this result is also expected since this class encompasses cataclysmic variables, which are expected to have a relatively strong soft X-ray emission compared to stars and X-ray binaries. For completeness, we also display the Galactic objects (z < 0.001) with spectroscopic redshifts from other surveys than SDSS; they populate the region well below the AGN locus defined by Eq. (1).

A similar analysis can be done based on Fig. 7, where the ratio of the X-ray flux and the optical flux from the r-band are displayed. The Other Galactic objects that were not observed with SDSS reach lower values of the X-ray flux in comparison to the optical flux, while the objects that have a reliable redshift after visual inspection have somewhat similar distributions of the FX/FO ratio, with the exception of GALAXIES that show a larger tail to lower values of the ratio.

thumbnail Fig. 7.

Distribution of the ratio of the X-ray flux (FX, flux at 0.2–2.3 keV from eROSITA) to the optical flux (FO, flux from the r-band magnitude from the Legacy Survey DR8). The classes are colour-coded as in Figs. 5 and 6. The vertical lines correspond to Eq. (1).

4.2. Redshift distribution, completeness

The distribution of the spec-z compilation in both r-band magnitude and redshift is illustrated in Fig. 8, with the redshift spanning the range of 0 ≲ z ≤ 5.8. We plot the absolute value of the redshift since Galactic objects could have negative values that correspond to their relative velocities more than the cosmic expansion, so their redshift does not necessarily represent their distance. As seen in the upper histogram, there are two main groups of sources, with Galactic objects corresponding to |z|≲10−3 and the extragalactic objects with redshifts higher than this cut. Among the Galactic objects, there is a wide spread in the observed r-band magnitudes, with SDSS providing most of the spec-z of fainter objects while surveys such as LAMOST and Gaia cover most of the bright sources. The main surveys that contribute to the redshifts of extragalactic objects in the eFEDS field are SDSS, GAMA, and WiggleZ; the cyan points represent all other surveys from Table 1 that did not have a significant number of sources when considered individually.

thumbnail Fig. 8.

r-band magnitude as a function of the absolute value of the redshift, colour-coded by the survey that provided the redshift measurement (see Nlead, X − ray in Table 1). The upper histogram shows the redshift distribution with regard to the number of counts divided by the eFEDS area (140 deg2 as in the green contour of Fig. 1, Brunner et al. 2022), while the histogram on the right displays the number of counts for r-band distribution. The absolute value of the redshift is plotted to allow the display of Galactic objects (|z|≲10−3) with negative radial velocities.

The quality of a sample can be determined by analysing its reliability and completeness. Reliability is the capability of selecting the aimed astrophysical objects without contamination of other types of objects in the sample, while completeness refers to the ability to select all the objects of such type in the sample without missing, for example, low-luminosity or highly obscured objects (Hickox & Alexander 2018). In our case, the quantity of interest is the measurement of a spectroscopic redshift.

The visual inspections of the eFEDS spectra account for the reliability by assessing a quality flag to the determined redshift (NORMQ) and a class that indicates the nature of the observed spectrum (see Sect. 3.1). To address the completeness of the sample, we can compare the sample that has reliable spec-z with other subsets of the main eFEDS sample of the X-ray selected sources and their optical counterparts (see Salvato et al. 2022). From the main eFEDS X-ray sample (27 369 sources), 13 079 objects (48%) have a spectroscopic redshift, with 14 895 spectra. Among those, 12 011 objects (13 674 spectra) have a reliable redshift according to visual inspection, corresponding to 44% of the counterpart sample.

Figure 9 assesses the completeness of sub-samples derived from the eFEDS catalogue. The two panels show the distribution of the r-band magnitude from Legacy Survey DR8 (LS8), with the top panel showing the differential spectroscopic completeness as a function of magnitude, while the bottom panel displays the cumulative one as a function of the magnitude limit of the sample. The peaks of the distributions can be explained by the limitations of the instruments used to obtain spec-z since sensitive spectrographs cannot observe too bright objects to avoid cross-talk effects on the spectra extracted on the CCD; this effect truncates the measured magnitudes and is more clearly visible in the upper panel. As expected, the sub-sample will be more complete as one cleans more of the comparison sample, indicating useful selections for dealing with the data. Comparing the different line styles, the total sample (filled lines) shows the lowest fractions in terms of completeness, since it is more polluted and includes low-quality observations. The completeness is improved if we consider purer (smaller) samples satisfying other quality criteria, such as having a reliable counterpart (dashed lines, CTP_QUALITY > 1), a threshold on the X-ray detection likelihood (dotted lines, DET_LIKE > 10), and a cut in declination as indicated in Fig. 1 (dash-dotted lines, DEC > 2). The dip seen at r ∼ 16 − 17 between spec-z from SDSS and other surveys was seen also in Figure 8, since BOSS design is aimed at observing fainter (and extragalactic) sources in comparison to other surveys that target mainly Galactic sources, such as LAMOST or Gaia.

thumbnail Fig. 9.

Differential (top) and cumulative (bottom) spectroscopic completeness fractions as a function of the r-band magnitude (top) or r-band magnitude limit (bottom). Blue lines are for all objects with a spectroscopic redshift from any survey (see Sect. 3.2), orange for those whose spec-z was obtained from SDSS, and green for other surveys. Red curves indicate the cases of high confidence for the redshift estimate after visual inspection. For each colour, the different parent samples are represented with different line styles: full sample (solid), only objects with reliable counterparts (CTP_QUALITY > 1; dashed), objects with reliable counterparts and a highly reliable X-ray detection (DET_LIKE > 10; dotted) and the most conservative case of objects with a reliable counterpart, reliable X-ray detection, and within the SDSS field coverage (Dec > 2; dot-dashed). The vertical grey line indicates r = 21.38, where the most complete sub-sample of objects with redshift from SDSS reaches its maximum fraction (dash-dotted orange line).

Considering only objects with SDSS redshift (orange lines), for r < 21.38 (below which the completeness drops rapidly), the total sample (filled line) will have spectroscopic completeness of 60%, rising to 66% for the sample with reliable counterparts (dashed line), to 70% for the sample with reliable optical counterparts and high X-ray detection likelihood (dotted line), and increasing to 72% for the sample that also accounts for the declination cut (dash-dotted line). At the same magnitude threshold, the full sample with spectroscopic redshifts (including non-SDSS observations; blue) reaches 81% of the contemplated objects, while the sample with reliable redshifts (red) reaches 77%.

Figures 10 and 11 present the spectroscopic completeness as a function of the location of sources in the colour-colour and flux-magnitude diagrams discussed above (Sect. 4.1). Figure 10 is colour-coded based on the density of sources with (CTP_QUALITY > 1) per bin, and the grey lines indicate such distribution in contours of 1, 2, and 3σ. The analysis of the completeness can be made based on Fig. 11, which is colour-coded by the spectroscopic success rate, i.e. the ratio of the number of sources with reliable redshift (NORMQ = 3) to the number of sources with a reliable optical counterpart (CTP_QUALITY > 1) per bin. The contours of Fig. 10 are plotted in Fig. 11 to facilitate the comparison. It is clear from the top right of the left panel and the top left of the right panel that the SDSS observations of the sources with reliable counterparts do not cover the faint end of the distributions, due to instrumental limitations. The Galactic bulk of the left panel is less populated in Fig. 11 when compared to Fig. 10, indicating that we are probably missing faint red stars, which are too faint for the Galactic programs used as LAMOST and Gaia (see Fig. 8). Sources at r ∼ 16 have also lower spectroscopic completeness, but this is due to a combination of instrumental limitations and samples being observed (SDSS did not prioritise faint stars, while LAMOST and Gaia did) as exemplified in Fig. 9.

thumbnail Fig. 10.

Colour-colour and flux-magnitude diagrams for the objects from the full counterpart sample with a reliable photometric counterpart (CTP_QUALITY > 1, 24 774 sources), colour-coded by the data density per bin. The left panel presents z-W1 versus g − r, where g, r, and z are obtained from Legacy Survey DR8, and W1 is from WISE Survey. The line separates extragalactic sources on the top from Galactic sources on the bottom, according to Salvato et al. (2022). The right panel shows the r-band magnitude from the Legacy Survey DR8 versus the X-ray flux in the 0.2–2.3 keV band from eROSITA. The two lines adapted from Maccacaro et al. (1988) indicate the locus of quasars. Both panels exhibit their 1, 2, and 3σ contours in grey.

thumbnail Fig. 11.

Colour-colour and flux-magnitude diagrams of the sources from the SDSS/eFEDS sample with reliable redshift (12 011 sources) in comparison to those with a reliable photometric counterpart (24 774 sources), as in Fig. 10. The colour-code is based on the spectroscopic success rate (i.e. N(NORMQ = 3)/N(CTP_QUALITY > 1)). The grey contours indicate the same contours as in Fig. 10 for the overall distribution of the X-ray sources with a reliable optical counterpart.

Figure 12 shows the source distribution in the X-ray luminosity vs. redshift plane. On the top panel we show the distribution for the objects with available spec-z. On the bottom panel, for each of the 24 774 objects with secure counterparts, we either assume the spectroscopic redshift, if available, or the photometric redshift as computed in Salvato et al. (2022). The plane is colour-coded by the spectroscopic completeness, i.e. the ratio between the number of sources with spec-z and the total number of objects with either a photometric or spectroscopic redshift in the counterpart sample. The X-ray luminosity was calculated from the eROSITA flux in the band between 0.2–2.3 keV (assuming a power-law spectrum with a spectral index Γ = 2.0 and galactic absorption of NH = 3 × 1020 cm−2 as described in Brunner et al. 2022) and the redshift, therefore not considering a K-correction. The dashed line indicates the approximate flux limit of eFEDS at 6.5 × 10−15erg s−1 cm−2 (Brunner et al. 2022), while the solid line indicates the knee of the soft X-ray AGN luminosity function (L*) according to the LADE model from Aird et al. (2015), after converting the Chandra band to the eROSITA band with the previous spectral index assumption. We display the X-ray AGN luminosity function knee to show that our sample covers both its faint and bright ends up to a redshift significantly larger than one.

thumbnail Fig. 12.

Observed X-ray (0.2 − 2.3 keV) luminosity distribution as a function of redshift. The top panel is colour-coded according to the number density in the logarithmic scale, while the bottom panel is colour-coded by spectroscopic completeness. The dashed line indicates the flux limit of the eROSITA survey, while the filled line indicates the location of the knee of the soft X-ray luminosity function according to the LADE model of Aird et al. (2015). The dotted horizontal line indicates the limit of L0.2 − 2.3keV ≤ 1042 erg s−1 applied in Sect. 5 for a pure AGN selection.

5. SDSS spectral properties of extragalactic point-like sources: AGN stacks

In this section, we provide a global assessment of the physical and spectral properties of the AGNs in the SDSS/eFEDS sample. We consider only objects that have reliable redshift (NORMQ = 3) and optical counterpart (CTP_QUALITY > 1), and X-ray spectroscopic information from the Liu et al. (2022b) catalogue. After such cuts, our sample consists of 10 295 AGN candidates.

Among the advantages of the X-ray selection of AGNs is that it provides one of the most reliable AGN samples, since other astrophysical processes generating X-rays in a galaxy, such as X-ray binaries or hot gas, will be weak in comparison (e.g. Hickox & Alexander 2018). The majority of inactive galaxies have soft X-ray luminosities L0.5 − 2.0 keV ≤ 1042 erg s−1 (Nandra et al. 2002; Menzel et al. 2016). A more detailed assessment of non-AGN contamination at lower luminosity necessitates reliable estimates of the hosts’ stellar masses and star formation rates (Lehmer et al. 2016; Igo et al. 2024). Lacking that information for all the objects in our sample, we adopted here a conservative cut of L0.2 − 2.3 keV > 1042 erg s−1 to guarantee that our AGN selection is as pure as possible. This excluded 598 objects.

In addition to this cut, to account for a pure AGN selection, one should also account for misidentifications between the X-ray detection and the optical counterparts when associating the X-ray source with observations in other wavelengths (Bulbul et al. 2022). To guarantee that our sample is not selecting AGNs inside galaxy clusters, whose X-ray emission could be entangled with the emission of the hot gas within the cluster, we use only objects with CLUSTER_CLASS < 4 according to the optical counterpart matching described in Salvato et al. (2022). This excludes 42 sources. Finally, we only consider sources with z > 0.001 (based on Fig. 8) to guarantee that we are considering extragalactic sources, which excluded 301 objects from the total initial number of AGN candidates. All the above-mentioned cuts, which have some degree of overlap, result in a sub-sample with 9660 objects discussed in this section (94% of the total sample of AGN candidates).

Another advantage of AGN X-ray selection is the ability to detect both unobscured or obscured sources6 (e.g. Ricci et al. 2022), with obscuration depending not only on the orientation of the dusty torus but also on the host galaxy and large-scale environment where the AGN resides (see e.g. Brandt & Alexander 2015; Gilli et al. 2022; Andonie et al. 2024). As shown in Liu et al. (2022b), the eROSITA/eFEDS sample is biased towards unobscured AGNs due to the lower sensitivity of the pass-band beyond 2.3 keV (∼92% of the sources with NH < 1021.5 cm−2), which makes its Main Sample (Brunner et al. 2022) focus on softer X-ray bands than other instruments. The Hard Sample, presented in Waddell et al. (2024), shows that eROSITA can detect obscured AGNs with column density up to 1024 cm−2, though the Hard Sample contains much fewer objects than the Main Sample. Hence, the objects detected at lower redshift do not have strong obscuration, while those at higher redshift have lower photon counts. Another instrumental limitation to account for is the fact that both eROSITA and SDSS are flux-limited surveys, so faint sources that could be associated with low-luminosity AGNs should be underrepresented (see the discussion about Figs. 10 and 11 for all the objects in the sample).

The X-ray selection overcomes some of the limitations of optical selection, especially for cases with obscuring material and faint or distant AGNs. Moreover, since the X-ray emission is less affected by the host-galaxy emission, it allows one to observe galaxies with different morphologies and in different stages of their evolution while still being able to separate the emission associated with the AGN from host galaxy emission. Therefore, an X-ray-selected AGN sample will show a high degree of diversity in their optical properties, with passive galaxies, star-forming galaxies, LINERs, Seyferts (see e.g. Heckman 1980 and Kewley et al. 2006 for the optical classification of local AGNs), and quasars. Even in the case of quasars, the X-ray detections allow the selection of both blue and reddened quasars (e.g. Brusa et al. 2003; Menzel et al. 2016).

5.1. Stacked spectral templates: Method

An example of the diversity of the AGNs in the eFEDS sample is shown in Fig. 13, where we have the objects displayed over the whole locus of the extragalactic sources of the colour-colour diagram of gr-zW1 (as presented in Fig. 5), colour-coded according to their intrinsic X-ray luminosity as measured by eROSITA (Liu et al. 2022b). Following the templates used in Salvato et al. (2022), we show lines of different colours indicating tracks of different types of galaxies’ SED between redshift of 0 and 3. The black dashed arrow indicates the direction in which the quasar SED models are shifted after applying increasing reddening. These tracks indicate the diversity of AGNs in our sample, and our objects can be described by at least one of such tracks when applying the correct redshift and reddening. Objects with lower X-ray luminosity populate the locus of (low-redshift) lenticular and elliptical galaxies; in contrast, the more intense X-ray emission is associated with bluer quasars, as expected.

thumbnail Fig. 13.

Colour-colour diagram with the AGN sample colour-coded by the average of the intrinsic X-ray luminosity (in the 0.2 − 2.3 keV band) per bin, following Liu et al. (2022b). The solid black line differentiates extragalactic objects on top from Galactic objects at the bottom according to Salvato et al. (2022). The coloured tracks indicate the colour-colour evolution with redshift of the SED templates from Salvato et al. (2022), with z = 0 shown as a circle. The purple track represents a spiral galaxy, brown represents a lenticular galaxy, orange represents an elliptical galaxy, red represents a Seyfert 2, and pink represents a quasar. The squares indicate the values of the redshift of z = 0.2,  0.4,  0.6,  1.0,  1.6,  2.0,  3.0. The black dashed line indicates the trend of the quasar SED when more reddening is applied.

After recognising the diversity of AGNs in our sample in colours and galaxy type, it is interesting to compare the spectral properties of these sources and describe the general features of the sample. In this paper, we will only analyse the (stacked) optical spectra of the 9660 SDSS AGNs qualitatively, and the quantitative analysis with measurements of the features in each individual spectrum and their classification will be provided in a further paper (Aydar et al. in prep.). Since the stacks mix different classes of AGN, one should only consider trends such as comparing stellar populations and AGN power-law contributions to the observed continuum, the influence of the host galaxy, and line profiles, with caution to not overinterpret this averaged information.

The stacking technique consists of obtaining the median distribution of multiple spectra, and its main benefit is the increase of the S/N per pixel, which can unveil features in the spectra such as faint emission lines (see e.g. Sect. 5.1 of Comparat et al. 2020). To define classes for the stacking by relying on information that does not depend directly on the spectral fitting of the sources, we considered the parameter space defined by redshift and g–r colour. Figure 14 shows the cuts we used to separate the objects into four redshift bins (low-z, z < 0.5; mid-z, 0.5 ≤ z < 1.0; high-z, 1.0 ≤ z < 2.0; and very high-z, z ≥ 2.0) as vertical lines. Then, for each redshift bin, we divided the sub-samples into three colour bins that have approximately the same number of objects per bin, with the intermediate colour bins for each redshift range as 0.72 ≤ (g − r)≤1.1; 0.15 ≤ (g − r)≤0.45; 0.21 ≤ (g − r)≤0.37; and 0.043 ≤ (g − r)≤0.20 for z < 0.5; 0.5 ≤ z < 1.0; 1.0 ≤ z < 2.0; and z ≥ 2.0, respectively. The similar number of objects stacked within each redshift bin provides spectra with a similar S/N, so larger uncertainties would indicate more diverse spectra being stacked and not just a statistical feature.

thumbnail Fig. 14.

Colour (g-r) and redshift distribution of the data used in the stacks from Figs. 17-24. The stacks considered four redshift bins separated by vertical lines, with triangles at z < 0.5, circles at 0.5 ≤ z < 1.0, squares at 1.0 ≤ z < 2.0, and diamonds at z ≥ 2.0. Each redshift bin was divided into three colour ranges with approximately the same number of sources per colour bin, shown as blue, green, and red.

To exemplify the groups of objects used for the stacks, in Fig. 15, we show the average X-ray luminosity in the soft band of eROSITA (Liu et al. 2022b) as a function of the average redshift and respective standard deviations for each of the three colour bins in each of the four redshift bins. The flux limitation of eROSITA and the soft X-ray luminosity function knee of Aird et al. (2015, LADE model) are also shown in Fig. 12. We observe a trend in which the redder objects have a lower average X-ray luminosity than the bluer ones, which could be associated with obscuration. This behaviour changes for the z > 2 bin, though we can also note that the average redshift of the redder objects in such bin is higher than for the bluer objects, which could be indicating a selection effect of detecting the high-luminosity end of the red population.

thumbnail Fig. 15.

Mean intrinsic X-ray (0.2 − 2.3 keV) luminosity distribution as a function of mean redshift for the stacking bins of Fig. 14, with the dashed and filled curves as in Fig. 12.

With the X-ray information available, we can also analyse the behaviour of the X-ray absorbing column density7 for each of the colour and redshift bins. In Fig. 16, we see the trend of the low- and mid-z red bins having higher values of the median column density distribution, indicating that these objects are more obscured than the blue and intermediate colours. For the high- and very high-z bins, however, the distributions are more similar, with the blue and red medians overlapping in the high-z panel, while the blue and intermediate colour medians overlap for the very high-z panel. The overall column density is larger for higher redshifts, as expected, since the observed X-ray photons from further objects will be harder and, therefore, able to penetrate through thicker absorbers.

thumbnail Fig. 16.

X-ray column density cumulative distribution per redshift and colour bin, as in Fig. 14. The vertical lines indicate the median of the column density for each bin, with the colours corresponding to the colour bin (red for the redder bin, green for the intermediate colour and blue for the bluer objects).

5.2. Stacked spectral templates: Results

Figure 17 shows the median stacking of the low-z objects (z < 0.5), with the upper panel showing the spectra of the blue, red and intermediate colours, and the lower panel showing the jackknife errors from the stack as filled lines and the number of objects per wavelength (as 1 / N $ 1/\sqrt{\mathrm{N}} $) as dashed lines. Figure 18 shows a zoomed-in version of Fig. 17 to allow focus on more details of each individual stack and the comparison between the different colours. The other redshift intervals of 0.5 ≤ z < 1.0 (mid-z), 1.0 ≤ z < 2.0 (high-z), and z ≥ 2.0 (very high-z) are shown in Figs. 1920, 2122, and 2324, respectively. In each redshift sample, the spectra are normalised in a fixed wavelength interval with no prominent emission lines, shown as a grey-shaded region. The main emission lines are highlighted with dashed vertical lines and the respective atomic transition is shown to the right of the line; the lines in Figs. 18 and 20, and in Figs. 22 and 24 are the same, to allow a comparison of the low- and high-redshift cases. We are showing in the colourful part of the spectra the wavelengths that contain more than 50% of the total spectra included, while the grey extremities of the spectra show the wavelengths containing less than half of the total spectra in the stack.

thumbnail Fig. 17.

Median stacked spectra for AGNs with z < 0.5, with the flux normalised at 5100 Å. The total sub-sample of z < 0.5 AGNs was divided into three colour bins with 614 sources each. The top panel shows with coloured lines the wavelength coverage where more than 50% of the total individual spectra contribute, with the remaining wavelength with less than half of the spectra in the stack shown in grey; the colour scheme is the same as Fig. 14. Some of the main emission and absorption lines are indicated with dashed vertical lines. The grey-filled line indicates the wavelength range around 5100 Å whose average flux was used for the normalisation. The bottom panel shows the errors calculated via the Jack Knife method in the stacking procedure divided by the median stack, and the dashed lines show 1 / N $ 1/\sqrt{\mathrm{N}} $, where N is the number of individual sources per stacked pixel.

thumbnail Fig. 18.

Zoomed-in version of the stacked spectra from Fig. 17. The emission and absorption lines are clearer, and the differences between the stacked colours are enhanced. The wavelength of the flux used for the normalisation of each plot is indicated in the ordinate axis and as a grey-filled vertical line.

thumbnail Fig. 19.

Similar to Fig. 17, but for 0.5 ≤ z < 1.0 AGNs. The blue and red stacks contain 870 sources each, while the intermediate stack in green has 869.

thumbnail Fig. 20.

Zoomed-in version of the stacked spectra from Fig. 19. The emission and absorption lines are clearer and the differences between the stacked colours are enhanced. The wavelength of the flux used for the normalisation of each plot is indicated in the ordinate axis and as a grey-filled vertical line. The wavelength range is the same as in Fig. 18 to facilitate the comparison of the low-redshift stacks.

thumbnail Fig. 21.

Similar to Fig 17, but for 1.0 ≤ z < 2.0 AGNs normalised with regard to the flux at 3000 Å. The blue and red stacks contain 1271 sources each, while the intermediate stack in green has 1270. The black line indicates the QSO spectrum from Vanden Berk et al. (2001).

thumbnail Fig. 22.

Zoomed-in version of the stacked spectra from Fig. 21. The emission and absorption lines are clearer and the differences between the stacked colours are enhanced. The wavelength of the flux used for the normalisation of each plot is indicated in the ordinate axis.

thumbnail Fig. 23.

Similar to Fig. 17, but for z ≥ 2.0 AGNs normalised with regard to the flux at 2000 Å. The green and red stacks contain 465 sources each, while the blue stack has 466. The black line indicates the QSO spectrum from Vanden Berk et al. (2001).

thumbnail Fig. 24.

Zoomed-in version of the stacked spectra from Fig. 23. The emission and absorption lines are clearer and the differences between the stacked colours are enhanced. The wavelength of the flux used for the normalisation of each plot is indicated in the ordinate axis. The wavelength range is the same as in Fig. 22 to facilitate the comparison of the high-redshift stacks.

Out of the total AGN sample described in this section, one object is not considered in the stacks because it does not have an available g-band. This source has the highest redshift of the SDSS/eFEDS catalogue, with z = 5.8 after VI.

5.2.1. Low-z AGNs (z< 0.5)

In the stacked spectra of the lowest redshift bin of z < 0.5 from Figs. 17 and 18, we can see the presence of both emission and absorption lines. Looking at the full spectra, normalised at 5100 Å, we see that the red stack is more dominated by the host galaxy, with the blue stack being more quasar-dominated and the intermediate stack in green making a transition between such extremes. The continuum of the red stack has the shape typical of stellar populations, while the blue stack is flatter in the logarithmic scale, as expected for AGNs.

To analyse the difference between the lines, looking at the zoomed-in plot is more instructive, where the normalisations are made in a part of the continuum closer to the wavelength of the lines. Hence, we see in Fig. 18 that the Mg II λ2799 emission, typical from quasar-dominated objects, despite being close to the limit of reliability of the spectra, is stronger in the blue than in the intermediate stack, and is not present for the red stack due to these spectra being more dominated by the host galaxy contribution than an unobscured broad-line AGN. All stacks show [Ne V] λ3426 (and faint λ3346), though the coronal lines of [Fe VII] λ3760 and [Fe X] λ6374 are not distinguishable from the noise and, therefore, undetected. The [O II] λ3727, 3729 doublet and [Ne III] λ3869 line, often used in diagnostic diagrams to distinguish nuclear activity from star formation (Kewley & Dopita 2002; Mazzolari et al. 2024), are quite similar for the red and intermediate stacks, being weaker for the blue stack, as expected since the blue stack is more quasar-dominated and less affected by the emission from the stellar populations of the host galaxy as the red stack. Confirming such a trend, the absorption lines of Ca KH and Na I D are similar for the red and intermediate stack (the red being more pronounced compared to the continuum than the intermediate for the Ca lines) while being weaker for the blue stack. As for the Balmer line Hδ, it is in absorption for the red stack, in absorption with a faint emission peak for the intermediate stack, and within the noise for the blue stack, while the Hγ is in emission for the blue and intermediate stacks, but not identified in the red stack due to the more significant contribution of the host galaxy.

Regarding the BPT lines (Baldwin et al. 1981; Veilleux & Osterbrock 1987), the [O I] λ6300 line and the [S II] λ6717, 6731 doublet are similar for all the stacks. The [O III] λ4959, 5007 doublet has the same flux intensity pattern as the Hα and [N II] λ6584 complex, being stronger for the blue stack and weaker for the red stack, with the intermediate colour between such extremes. The Hβ line, however, is stronger and broader in the blue stack, being less intense and narrower in the intermediate stack, and almost disappears in the red stack. In the [O III] λ5007 line, it is possible to see the asymmetry of the line shape through the broader left wing, which could indicate the presence of outflows.

In addition, the Fe II pseudo-continuum emission is visible at both sides of Hβ (4440 − 4680 and 5100 − 5400 Å) for the clue stack and becomes faint and mixed with the stellar continuum for the red stack, as pointed out for example by Negrete et al. (2018), and quantified for instance by Bon et al. (2020).

Therefore, all characteristics of the lines discussed above agree with the scenario in which the red stack is more dominated by stellar populations in the host galaxies, and the blue stack is more quasar-dominated, with the intermediate colours showing contributions from both quasar and star-formation activity. It is also worth pointing out that the dispersion of the blue stack is significantly larger than the ones of the intermediate and red stack. Since all stacks have the same number of spectra, this is explained by the fact that the blue stack contains a more diverse set of spectra. This was confirmed with a visual inspection, where the main difference between the blue spectra is the power-law index of the continuum. Also, the red stack contains more passive galaxies, which generally have more uniform and similar spectra.

5.2.2. Mid-z AGNs (0.5 ≤ z < 1.0)

A similar conclusion derived from the analysis of the z < 0.5 stack can also be reached in the 0.5 ≤ z < 1.0 stack shown in Figs. 19 and 20, where the red stack contains a more considerable contribution from the host galaxy while the blue stack is more AGN-dominated, with the intermediate colours in between. However, in this case, the AGN component is clearly identified in all the stacks due to the strong Mg II emission. At this intermediate redshift, we see the presence of typical AGN lines such as C II] λ2326 and [Ne IV] λ2439, and they appear more intense for the red stack due to the lower number of individual sources at this wavelength, which is also seen by the fact that the red stack seems noisier for shorter wavelengths. Once again, it is possible to detect the [Ne V] doublet, and this time, the [Fe VII] λ3760 coronal line is also visible, though very faint compared to the noise, indicating that it is likely present only in a few sources. The same trend of the red stack being more host-galaxy dominated is seen in the [O II] and [Ne III] emission lines and the Ca KH absorption lines. The Hδ line is in absorption for the red stack and within the noise for the intermediate and blue stacks.

All of the BPT lines ([S II], [N II], Hα, [O I], [O III], and Hβ) are in the wavelength range with more than half of the total spectra, but the Hβ shows the same trend as before of being broader and more intense for the blue stack compared to the red, with the intermediate stack as a transition. However, in this redshift range, the intermediate stack has the strongest [O III] λ4959, 5007 emission, with the asymmetry of the line (a possible indicator of widespread outflows) being once again noticeable.

5.2.3. High-z AGNs (1 ≤ z < 2)

Figures 21 and 22 show the stacked spectra for 1.0 ≤ z < 2.0, where the most prominent optical emission lines used in AGN diagnostic diagrams fall outside the wavelength coverage, but the broad emission lines typical from quasars are present. At those redshifts (and luminosities), the spectra are almost all QSO-dominated, as seen from the broad emission lines. For comparison, we also plot the quasar template obtained from the stack of observed spectra of SDSS 1 QSOs from Vanden Berk et al. (2001), selected based on the optical SDSS filter system. The quasar template mainly overlaps with the blue stack but is bluer at the edges of the covered wavelength range, indirectly suggesting the efficiency of the X-ray selection in detecting reddened quasars.

Regarding the Fe II pseudo-continuum emission, since we do not detect faint objects (that could include host-galaxy dominated AGNs), the optical Fe II becomes stronger as seen in the Vanden Berk spectra of Fig. 21.

On the zoomed-in version of the stack, some emission lines typical from quasars are seen, as C III] λ1909, Si III] λ1892, faint Al III λ1857, N III] λ1750, O III] λ1665, He II λ1640, and C IV λ1549. As expected from quasars, they all have similar shapes and intensities when the continuum is normalised close to their wavelength. The other lines that are typically seen on quasars, such as CII], [Ne IV], and Mg II, are also very similar, as is the typical AGN line of [Ne V]. Despite the difference in the continuum, which was already expected from the colour binning, the other difference between the stacks appears in the [O II] and [Ne III] emission lines, which trace star formation, being stronger for the red stack, consistent with the lower redshift bins. The red stack shows broad Mg II, but narrow Hβ (with a possible broad base). This should be attributed to the fact that in such different wavelength ranges, the type of AGNs dominating the stack should be different, with more AGN-dominated objects in the higher redshift and lower wavelengths in comparison to the host galaxy-dominated objects in the lower redshift and higher wavelength. This trend is also clear when comparing the red stacks at lower redshift (Fig. 17) to the ones at higher redshift (Figs. 21 and 23).

5.2.4. Very high-z AGNs (z > 2)

Finally, Figs. 23 and 24 show the stacked spectra of the z ≥ 2.0 bin, also together with the quasar template from Vanden Berk et al. (2001). Except for the larger wavelengths, once again, the quasar template overlaps the blue stack, as expected, given the flux limit of eFEDS and the prevalence of unobscured AGNs at the highest luminosities. More lines can be identified, such as the blend of Si IV and O IV] at λ1400, C II λ1335, O I λ1305, N V λ1240, Lyα, and O IV λ1033, 1038.

6. Outlook and conclusions

We reported on one of the largest systematic spectroscopic follow-up studies in X-ray survey fields, that was driven by the SDSS follow-up of eROSITA sources in the eFEDS field. The X-ray spectra cannot be interpreted without the redshift of each object. We therefore compiled all available redshifts for the eROSITA/eFEDS sources and described the compilation in Sect. 2. We considered surveys including SDSS, GAMA, WiggleZ, Gaia, and LAMOST (see Table 1 for a summary). Out of the total 27 369 point-like X-ray sources detected with eROSITA in the eFEDS field, 13 079 objects have an associated spectroscopic redshift. Some objects were observed more than once, and this resulted in 14 895 total spectra of stars, galaxies, and QSOs.

With the initial estimates of the redshifts obtained from archival public data, the SDSS spectroscopic follow-up allowed us to confirm the reliability of the redshifts and of the SDSS pipeline through a meticulous visual inspection process that we described in Sect. 3. Using strict quality cuts that were mainly informed by this inspection process, we confirmed reliable redshift estimates for 12 011 objects. We were able to reliably recover the redshifts for 99% of the sample with S/N > 2. The remaining cases are either spectra without clear emission or absorption lines or objects overlapping within the fibre. The reliable assessment of the redshift was more difficult for noisier spectra, but we were still able to recover 94% of the sample with S/N > 0.2. The visual selection process also allowed us to assign a (coarse) class for each spectrum and to further study the behaviour of the populations in different parameter spaces. This showed us that, as expected, our sample of point-like X-ray sources is dominated by AGNs. Only ∼3% of the SDSS spectroscopic sample consists of Galactic objects. The goal of this VI approach was to use human power to understand potential failures and improvements of the SDSS pipeline. This set of data might be very useful for training machine-learning techniques for the larger amount of data that will be available in the next data releases.

After flagging the reliable redshifts, we discussed the completeness and purity of the spectroscopic catalogue in Sect. 4. As expected, our sample is less complete towards the fainter end of the optical and X-ray fluxes. For X-ray sources with a very reliable counterpart (CTP_QUALITY> 1) alone, with a highly significant X-ray detection (DET_LIKE> 10), or within the footprints of the SDSS observed plates (DEC> 2°), we defined a cleaner sample for which the spectroscopic completeness reached 81% for r < 21.38. The completeness decreased markedly below this limit. To observe significantly fainter objects with similar success rates, the next generation of large area surveys that are conducted with 4m telescopes such as DESI (DESI Collaboration 2022) and 4MOST (de Jong et al. 2019) are required.

From the qualitative analysis of the stacked spectra we presented in Sect. 5, the diversity of the spectral types of the X-ray selected AGN in the eFEDS field becomes clear. The examples showed obscured (narrow-line galaxies) and unobscured AGNs, with different fractions of contribution from the host galaxy and from blue and red quasars. Other types of spectra, such as those from passive galaxies without emission lines, will not be evident through the stacking technique when they are mixed with spectra that present emission lines. These cases can be found after a visual inspection of the eFEDS individual spectra, as in the case of the AGNs within clusters of galaxies (CLUSTER_CLASS ≥ 4).

The large statistical sample presented here shows that robust trends connect the optical colour and the average X-ray column density in a given redshift interval. The sample diversity indicates its potential for the study of AGN demography and the evolution of AGNs across cosmic time through an individual study of each object or through population studies with approaches similar to those presented here.

Faithful to its nature as a pathfinder (and performance verification), the eFEDS survey paves the way to a much larger and ambitious spectroscopic follow-up program in SDSS-V BHM (Kollmeier et al., in prep.). This aims to obtain about 400k optical spectra by 2027 with a similar quality as those presented here for the X-ray sources that are detected in the eROSITA all-sky survey.

Future work (Aydar et al. in prep.) will present the optical spectral fitting of each object in eFEDS. We will provide measurements of the continuum and line features to estimate the physical properties of AGNs such as the black hole mass, the host-galaxy properties and their contribution to the observed emission, the AGN classification, the presence of outflows, and the study of specific emission lines such as coronal lines (i.e. emission lines with an ionisation potential ≥ 100 eV). Combined with those obtained from the X-ray observations (e.g. X-ray luminosity, column density, and photon index), these properties will provide a better understanding of the role of obscuration in the AGN classification, the setting of possible calibrations for scaling relations and black hole mass estimates, and a large and uniform sample that provides reliable statistics of the AGN populations according to different physical parameters.

Data availability

Table eFEDS_VAC143_AGN181_CTP17_PipelineRedshift.fits described in the Appendix A.1 and Table spectra_compilation_eFEDS_v1.4.3.fits described in the Appendix A.3 are available at the CDS via anonymous ftp to cdsarc.cds.unistra.fr (130.79.128.5) or via https://cdsarc.cds.unistra.fr/viz-bin/cat/J/A+A/698/A132. These catalogues are also available at the eROSITA-DE Early Data Release website.


1

In this paper, we refer to AGNs and QSOs interchangeably. Historically, QSOs can refer to “bright AGNs”, although there is no standard luminosity definition (see e.g. Peterson 1997).

2

The S/N means the median SDSS signal-to-noise ratio over all valid pixels in the observed wavelength range 3280–9860 Å.

3

By cumulative fraction we mean the cumulative distribution of each sub-sample divided by the total number of objects above each given S/N threshold.

5

The matching radius per input catalogue used when compiling the spectroscopic compilation is given in Table 1 These radii were chosen to either match the fibre size, the typical object size, or were based on an empirical estimate of the consistency of the published coordinates w.r.t. LS9.

6

Obscured AGN refers to objects with NH ≥ 1022 cm−2, while unobscured AGN refers to those with lower values of the column density.

7

The column density values estimated by Liu et al. (2022b) did not account for the more reliable redshift values published in this paper, so we are using the updated version of the eROSITA/eFEDs AGN catalogue available at https://erosita.mpe.mpg.de/edr/eROSITAObservations/Catalogues/liuT/eFEDS_AGN_spec_V10.html

Acknowledgments

This work is based on data from eROSITA, the primary instrument aboard SRG, a joint Russian-German science mission supported by the Russian Space Agency (Roskosmos), in the interests of the Russian Academy of Sciences represented by its Space Research Institute (IKI), and the Deutsches Zentrum für Luft- und Raumfahrt (DLR). The SRG spacecraft was built by Lavochkin Association (NPOL) and its subcontractors, and is operated by NPOL with support from the Max Planck Institute for Extraterrestrial Physics (MPE). The development and construction of the eROSITA X-ray instrument was led by MPE, with contributions from the Dr. Karl Remeis Observatory Bamberg & ECAP (FAU Erlangen-Nürnberg), the University of Hamburg Observatory, the Leibniz Institute for Astrophysics Potsdam (AIP), and the Institute for Astronomy and Astrophysics of the University of Tübingen, with the support of DLR and the Max Planck Society. The Argelander Institute for Astronomy of the University of Bonn and the Ludwig Maximilians Universität Munich also participated in the science preparation for eROSITA. The eROSITA data shown here were processed using the eSASS software system developed by the German eROSITA consortium. Funding for the Sloan Digital Sky Survey V has been provided by the Alfred P. Sloan Foundation, the Heising-Simons Foundation, the National Science Foundation, and the Participating Institutions. SDSS acknowledges support and resources from the Center for High-Performance Computing at the University of Utah. The SDSS web site is www.sdss.org. SDSS is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration, including the Carnegie Institution for Science, Chilean National Time Allocation Committee (CNTAC) ratified researchers, Caltech, the Gotham Participation Group, Harvard University, Heidelberg University, The Flatiron Institute, The Johns Hopkins University, L’Ecole polytechnique fédérale de Lausanne (EPFL), Leibniz-Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Extraterrestrische Physik (MPE), Nanjing University, National Astronomical Observatories of China (NAOC), New Mexico State University, The Ohio State University, Pennsylvania State University, Smithsonian Astrophysical Observatory, Space Telescope Science Institute (STScI), the Stellar Astrophysics Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Illinois at Urbana-Champaign, University of Toronto, University of Utah, University of Virginia, Yale University, and Yunnan University. CA acknowledges the support of the Excellence Cluster ORIGINS, which is funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC-2094 – 390783311. FEB acknowledges support from ANID-Chile BASAL CATA FB210003, FONDECYT Regular 1241005, and Millennium Science Initiative, AIM23-0001. MB acknowledges support from the European Union’s Innovative Training Network (ITN) funded by the Marie Sklodowska-Curie Actions in Horizon 2020 No 860744 (BiD4BEST). WB acknowledges the Eberly Endowment at Penn State. YD acknowledges financial support from a Fondecyt postdoctoral fellowship (3230310). LHG acknowledges funding from ANID programs: FONDECYT Iniciación 11241477, Millennium Science Initiative Program NCN2023_002, and Millennium Science Initiative AIM23-0001. TM is supported by UNAM-DGAPA by PAPIIT IN114423. TM also thanks JSPS for its support under the Invitational Fellowships for Research in Japan program (L-24523) during his sabbatical stay at Kyoto University. CAN and HJIM acknowledge the support from projects CONAHCyT CBF2023-2024-1418, PAPIIT IA104325 and IN119123. CR acknowledges support from Fondecyt Regular grant 1230345, ANID BASAL project FB210003 and the China-Chile joint research fund. RA was supported by NASA through the NASA Hubble Fellowship grant #HST-HF2-51499.001-A awarded by the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Incorporated, under NASA contract NAS5-26555. MK is supported by DLR grant FKZ 50 OR 2307.

References

  1. Abazajian, K. N., Adelman-McCarthy, J. K., Agüeros, M. A., et al. 2009, ApJS, 182, 543 [Google Scholar]
  2. Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35 [NASA ADS] [CrossRef] [Google Scholar]
  3. Ahumada, R., Prieto, C. A., Almeida, A., et al. 2020, ApJS, 249, 3 [Google Scholar]
  4. Aihara, H., AlSayyad, Y., Ando, M., et al. 2019, PASJ, 71, 114 [Google Scholar]
  5. Aird, J., Coil, A. L., Georgakakis, A., et al. 2015, MNRAS, 451, 1892 [Google Scholar]
  6. Albareti, F. D., Allende Prieto, C., Almeida, A., et al. 2017, ApJS, 233, 25 [Google Scholar]
  7. Alexander, D. M., & Hickox, R. C. 2012, New Astron. Rev., 56, 93 [Google Scholar]
  8. Almeida, A., Anderson, S. F., Argudo-Fernández, M., et al. 2023, ApJS, 267, 44 [NASA ADS] [CrossRef] [Google Scholar]
  9. Andonie, C., Alexander, D. M., Greenwell, C., et al. 2024, MNRAS, 527, L144 [Google Scholar]
  10. Antonucci, R. 1993, ARA&A, 31, 473 [Google Scholar]
  11. Baldwin, J. A., Phillips, M. M., & Terlevich, R. 1981, PASP, 93, 5 [Google Scholar]
  12. Blanton, M. R., Bershady, M. A., Abolfathi, B., et al. 2017, AJ, 154, 28 [Google Scholar]
  13. Boller, T., Freyberg, M. J., Trümper, J., et al. 2016, A&A, 588, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Bolton, A. S., Schlegel, D. J., Aubourg, É., et al. 2012, AJ, 144, 144 [NASA ADS] [CrossRef] [Google Scholar]
  15. Bon, N., Marziani, P., Bon, E., et al. 2020, A&A, 635, A151 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Brandt, W. N., & Alexander, D. M. 2015, A&ARv, 23, 1 [Google Scholar]
  17. Brandt, W. N., & Hasinger, G. 2005, ARA&A, 43, 827 [NASA ADS] [CrossRef] [Google Scholar]
  18. Brunner, H., Liu, T., Lamer, G., et al. 2022, A&A, 661, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  19. Brusa, M., Comastri, A., Mignoli, M., et al. 2003, A&A, 409, 65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  20. Brusa, M., Civano, F., Comastri, A., et al. 2010, ApJ, 716, 348 [NASA ADS] [CrossRef] [Google Scholar]
  21. Bulbul, E., Liu, A., Pasini, T., et al. 2022, A&A, 661, A10 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  22. Chilingarian, I. V., Zolotukhin, I. Y., Katkov, I. Y., et al. 2017, ApJS, 228, 14 [NASA ADS] [CrossRef] [Google Scholar]
  23. Chilingarian, I., Borisov, S., Goradzhanov, V., et al. 2024, in Astromical Data Analysis Software and Systems XXXI, eds. B. V. Hugo, R. Van Rooyen, & O. M. Smirnov, ASP Conf. Ser., 535, 179 [Google Scholar]
  24. Cid Fernandes, R., Stasińska, G., Mateus, A., & Vale Asari, N. 2011, MNRAS, 413, 1687 [Google Scholar]
  25. Civano, F., Marchesi, S., Comastri, A., et al. 2016, ApJ, 819, 62 [Google Scholar]
  26. Comparat, J., Eckert, D., Finoguenov, A., et al. 2020, Open J. Astrophys., 3, 13 [Google Scholar]
  27. Comparat, J., Luo, W., Merloni, A., et al. 2023, A&A, 673, A122 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  28. Croom, S. M., Richards, G. T., Shanks, T., et al. 2009, MNRAS, 392, 19 [CrossRef] [Google Scholar]
  29. Dawson, K. S., Schlegel, D. J., Ahn, C. P., et al. 2013, AJ, 145, 10 [Google Scholar]
  30. de Jong, R. S., Agertz, O., Berbel, A. A., et al. 2019, The Messenger, 175, 3 [NASA ADS] [Google Scholar]
  31. DESI Collaboration (Abareshi, B., et al.) 2022, AJ, 164, 207 [NASA ADS] [CrossRef] [Google Scholar]
  32. Dey, A., Schlegel, D. J., Lang, D., et al. 2019, AJ, 157, 168 [Google Scholar]
  33. Diamond-Stanic, A. M., Fan, X., Brandt, W. N., et al. 2009, ApJ, 699, 782 [NASA ADS] [CrossRef] [Google Scholar]
  34. Drinkwater, M. J., Jurek, R. J., Blake, C., et al. 2010, MNRAS, 401, 1429 [CrossRef] [Google Scholar]
  35. Driver, S. P., Norberg, P., Baldry, I. K., et al. 2009, Astron. Geophys., 50, 5.12 [NASA ADS] [CrossRef] [Google Scholar]
  36. Driver, S. P., Bellstedt, S., Robotham, A. S. G., et al. 2022, MNRAS, 513, 439 [NASA ADS] [CrossRef] [Google Scholar]
  37. Dwelly, T., Salvato, M., Merloni, A., et al. 2017, MNRAS, 469, 1065 [Google Scholar]
  38. Eisenstein, D. J., Annis, J., Gunn, J. E., et al. 2001, AJ, 122, 2267 [Google Scholar]
  39. Eisenstein, D. J., Weinberg, D. H., Agol, E., et al. 2011, AJ, 142, 72 [Google Scholar]
  40. Elvis, M. 2020, Nat. Astron., 4, 23 [Google Scholar]
  41. Fabricant, D., Fata, R., Roll, J., et al. 2005, PASP, 117, 1411 [NASA ADS] [CrossRef] [Google Scholar]
  42. Fiore, F., Brusa, M., Cocchia, F., et al. 2003, A&A, 409, 79 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  43. Gaia Collaboration (Brown, A. G. A., et al.) 2021, A&A, 649, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  44. Gilli, R., Norman, C., Calura, F., et al. 2022, A&A, 666, A17 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  45. Gunn, J. E., Siegmund, W. A., Mannery, E. J., et al. 2006, AJ, 131, 2332 [NASA ADS] [CrossRef] [Google Scholar]
  46. Heckman, T. M. 1980, A&A, 87, 152 [Google Scholar]
  47. Hickox, R. C., & Alexander, D. M. 2018, ARA&A, 56, 625 [Google Scholar]
  48. Hickox, R. C., Jones, C., Forman, W. R., et al. 2009, ApJ, 696, 891 [Google Scholar]
  49. Hopkins, P. F., Cox, T. J., Kereš, D., & Hernquist, L. 2008, ApJS, 175, 390 [Google Scholar]
  50. Huchra, J. P., Macri, L. M., Masters, K. L., et al. 2012, ApJS, 199, 26 [Google Scholar]
  51. Igo, Z., Merloni, A., Hoang, D., et al. 2024, A&A, 686, A43 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  52. Jansen, F., Lumb, D., Altieri, B., et al. 2001, A&A, 365, L1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  53. Jones, D. H., Read, M. A., Saunders, W., et al. 2009, MNRAS, 399, 683 [Google Scholar]
  54. Kewley, L. J., & Dopita, M. A. 2002, ApJS, 142, 35 [Google Scholar]
  55. Kewley, L. J., Groves, B., Kauffmann, G., & Heckman, T. 2006, MNRAS, 372, 961 [Google Scholar]
  56. Kewley, L. J., Nicholls, D. C., & Sutherland, R. S. 2019, ARA&A, 57, 511 [Google Scholar]
  57. Klein, M., Oguri, M., Mohr, J. J., et al. 2022, A&A, 661, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  58. Kormendy, J., & Ho, L. C. 2013, ARA&A, 51, 511 [Google Scholar]
  59. Lehmer, B. D., Basu-Zych, A. R., Mineo, S., et al. 2016, ApJ, 825, 7 [Google Scholar]
  60. Lewis, I. J., Cannon, R. D., Taylor, K., et al. 2002, MNRAS, 333, 279 [NASA ADS] [CrossRef] [Google Scholar]
  61. Li, J., Silverman, J. D., Merloni, A., et al. 2024, MNRAS, 527, 4690 [Google Scholar]
  62. Liske, J., Baldry, I. K., Driver, S. P., et al. 2015, MNRAS, 452, 2087 [Google Scholar]
  63. Liu, A., Bulbul, E., Ghirardini, V., et al. 2022a, A&A, 661, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  64. Liu, T., Buchner, J., Nandra, K., et al. 2022b, A&A, 661, A5 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  65. Luo, A. L., Zhao, Y. H., Zhao, G., et al. 2022, VizieR Online Data Catalog: V/156 [Google Scholar]
  66. Maccacaro, T., Gioia, I. M., Wolter, A., Zamorani, G., & Stocke, J. T. 1988, ApJ, 326, 680 [NASA ADS] [CrossRef] [Google Scholar]
  67. Mazzarella, J. M., & NED Team 2017, in Astroinformatics, eds. M. M. Brescia, S. G. Djorgovski, E. D. Feigelson, et al., 325, 379 [Google Scholar]
  68. Mazzolari, G., Übler, H., Maiolino, R., et al. 2024, A&A, 691, A345 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  69. Menzel, M. L., Merloni, A., Georgakakis, A., et al. 2016, MNRAS, 457, 110 [NASA ADS] [CrossRef] [Google Scholar]
  70. Merloni, A. 2016, Lecture Notes in Physics (Berlin: Springer Verlag), 905, 101 [NASA ADS] [CrossRef] [Google Scholar]
  71. Merloni, A., Predehl, P., Becker, W., et al. 2012, ArXiv e-prints [arXiv:1209.3114] [Google Scholar]
  72. Merloni, A., Lamer, G., Liu, T., et al. 2024, A&A, 682, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  73. Mink, J., Brown, W. R., Chilingarian, I. V., et al. 2021, AJ, 161, 3 [NASA ADS] [Google Scholar]
  74. Nandra, K., Mushotzky, R. F., Arnaud, K., et al. 2002, ApJ, 576, 625 [CrossRef] [Google Scholar]
  75. Nandra, K., Waddell, S. G. H., Liu, T., et al. 2025, A&A, 693, A212 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  76. Negrete, C. A., Dultzin, D., Marziani, P., et al. 2018, A&A, 620, A118 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  77. Netzer, H. 2015, ARA&A, 53, 365 [Google Scholar]
  78. Ni, Q., Brandt, W. N., Luo, B., et al. 2018, MNRAS, 480, 5184 [NASA ADS] [Google Scholar]
  79. Ni, Q., Brandt, W. N., Luo, B., et al. 2022, MNRAS, 511, 5251 [NASA ADS] [CrossRef] [Google Scholar]
  80. Oke, J. B., & Gunn, J. E. 1983, ApJ, 266, 713 [NASA ADS] [CrossRef] [Google Scholar]
  81. Padovani, P., Alexander, D. M., Assef, R. J., et al. 2017, A&ARv, 25, 2 [Google Scholar]
  82. Peterson, B. M. 1997, An Introduction to Active Galactic Nuclei (Cambridge University Press) [Google Scholar]
  83. Predehl, P., Andritschke, R., Arefiev, V., et al. 2021, A&A, 647, A1 [EDP Sciences] [Google Scholar]
  84. Rankine, A. L., Hewett, P. C., Banerji, M., & Richards, G. T. 2020, MNRAS, 492, 4553 [CrossRef] [Google Scholar]
  85. Ricci, C., & Trakhtenbrot, B. 2023, Nat. Astron., 7, 1282 [Google Scholar]
  86. Ricci, C., Ananna, T. T., Temple, M. J., et al. 2022, ApJ, 938, 67 [NASA ADS] [CrossRef] [Google Scholar]
  87. Salvato, M., Wolf, J., Dwelly, T., et al. 2022, A&A, 661, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  88. Sartoretti, P., Katz, D., Cropper, M., et al. 2018, A&A, 616, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  89. Saunders, W., Bridges, T., Gillingham, P., et al. 2004, SPIE Conf. Ser., 5492, 389 [NASA ADS] [Google Scholar]
  90. Schneider, P. C., Freund, S., Czesla, S., et al. 2022, A&A, 661, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  91. Schwope, A., Kurpas, J., Baecke, P., et al. 2024, A&A, 686, A110 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  92. Smee, S. A., Gunn, J. E., Uomoto, A., et al. 2013, AJ, 146, 32 [Google Scholar]
  93. Strauss, M. A., Weinberg, D. H., Lupton, R. H., et al. 2002, AJ, 124, 1810 [Google Scholar]
  94. Sunyaev, R., Arefiev, V., Babyshkin, V., et al. 2021, A&A, 656, A132 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  95. Toba, Y., Liu, T., Urrutia, T., et al. 2022, A&A, 661, A15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  96. Truemper, J. 1982, Adv. Space Res., 2, 241 [Google Scholar]
  97. Vanden Berk, D. E., Richards, G. T., Bauer, A., et al. 2001, AJ, 122, 549 [Google Scholar]
  98. Veilleux, S., & Osterbrock, D. E. 1987, ApJS, 63, 295 [Google Scholar]
  99. Vulic, N., Hornschemeier, A. E., Haberl, F., et al. 2022, A&A, 661, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  100. Waddell, S. G. H., Nandra, K., Buchner, J., et al. 2024, A&A, 690, A132 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  101. Weisskopf, M. C., Brinkman, B., Canizares, C., et al. 2002, PASP, 114, 1 [NASA ADS] [CrossRef] [Google Scholar]
  102. Wenger, M., Ochsenbein, F., Egret, D., et al. 2000, A&AS, 143, 9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  103. Weymann, R. J., Morris, S. L., Foltz, C. B., & Hewett, P. C. 1991, ApJ, 373, 23 [NASA ADS] [CrossRef] [Google Scholar]
  104. Yanny, B., Rockosi, C., Newberg, H. J., et al. 2009, AJ, 137, 4377 [Google Scholar]
  105. York, D. G., Adelman, J., Anderson, Jr., J. E., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Description of released datasets

In this section, we describe the contents and format of the data files released alongside this work.

A.1. Catalogues

The main catalogues used for this paper are shown in Table A.1, including those released here for the first time. The file eFEDS_VAC143_AGN181_CTP17_PipelineRedshift.fits is a compilation of the works by Brunner et al. (2022), Salvato et al. (2022), Liu et al. (2022b), and Almeida et al. (2023) with the data that was considered for the plots in this manuscript.

Table A.1.

File name, number of entries, and reference for the catalogues that were used in this paper.

A.2. SDSS-V spectroscopy obtained in the eFEDS field, with visual inspection information

This data table was originally released as part of SDSS DR18 (Almeida et al. 2023), but was not fully described. Therefore, in Table A.2, we provide a complete column-by-column description.

Table A.2.

Data model for the SDSS-V/eFEDS dataset - a catalogue of spectra obtained during the SDSS-V/eFEDS campaign, including visual inspection information.

A.3. The eFEDS spectroscopic compilation (all spec-z)

Notes on content and usage. This catalogue contains a row for each pairing of an original spec-z and an object from the DESI Legacy Imaging Survey DR9 optical-to-IR catalogue (Dey et al. 2019, LS9). The algorithm deliberately allows a single spec-z to be associated with multiple LS9 objects. In some cases, this is because the spectroscopic aperture covers more than one astrophysical object, but in other cases, it appears that a single astrophysical object has been split into two or more entries in the LS9 catalogue. Note, selecting entries with ORIG_LS_RANK_CTP = 1 gives only the nearest LS9 object (to the SPECZ_RA, SPECZ_DEC coordinates). Selecting rows with SPECZ_RANK = 1 gives only a single best REDSHIFT, CLASS, NORMQ per astrophysical object, which is generally what is desired for science analyses. Selecting rows with SPECZ_NORMQ = 3 retains only the spectroscopic redshifts that we consider to have the highest reliability and quality. In Table A.3 we describe the column by column content.

A.4. The eFEDS Main sample catalogue with spectroscopic information

This catalogue table is presented in section 3.3. The data table was originally released as part of SDSS DR18 (Almeida et al. 2023), but was not fully described. Therefore, in Table A.4, we provide a complete column-by-column description.

Table A.3.

Data model for the eFEDS spectroscopic compilation (all spec-z).

Table A.4.

Data model for the eFEDS Main X-ray counterpart catalogue combined with the spectroscopic compilation.

All Tables

Table 1.

Summary of catalogues and data sources providing spec-z for this project.

Table 2.

Criteria used to grade the visual inspections of the SDSS spectra.

Table 3.

Classes attributed in the visual inspection of the eFEDS/SDSS spectra.

Table A.1.

File name, number of entries, and reference for the catalogues that were used in this paper.

Table A.2.

Data model for the SDSS-V/eFEDS dataset - a catalogue of spectra obtained during the SDSS-V/eFEDS campaign, including visual inspection information.

Table A.3.

Data model for the eFEDS spectroscopic compilation (all spec-z).

Table A.4.

Data model for the eFEDS Main X-ray counterpart catalogue combined with the spectroscopic compilation.

All Figures

thumbnail Fig. 1.

Layout of the eFEDS field for the SPIDERS targeting and observations. The eROSITA X-ray coverage of the eFEDS field is shown in green, with the region with > 500 s of effective exposure depth shown with connected green points. SDSS-V plates are indicated as large black circles. The eFEDS Main X-ray sample is marked with small orange dots. The main sample sources having reliable spectroscopic redshifts (NORMQ = 3, see Table 2) are shown in dark blue, with red points indicating cases with unreliable redshift estimates (NORMQ < 3). The black diamonds indicate the objects that were detected in the X-ray as point-like sources, but which have counterparts suggesting that the source is likely associated with a galaxy cluster (CLUSTER_CLASS ≥ 4 from Salvato et al. 2022). The horizontal dashed line indicates the cut in δ(J2000) = − 2°, below which almost no SDSS plate observations are available, and which increases the spectroscopic completeness.

In the text
thumbnail Fig. 2.

Time difference between observations from SDSS and eROSITA. The top panel shows MJDSDSS–MJDeROSITA– in years. We highlight the different generations of SDSS as shaded regions, with SDSS I-III observations covering negative values (SDSS spectroscopy preceding eROSITA observations), while SDSS IV–V observations are to the right of the dashed line, indicating spectroscopic observations following the X-ray ones. The bottom panel displays the same time difference divided by (1 + z), showing the time difference in the rest-frame.

In the text
thumbnail Fig. 3.

Difference in time between couples of SDSS spectroscopic observations of the same X-ray source, in years. The sources with 2 observations (blue), 3 observations (orange), and more than 3 observations (green) are summed up in this histogram display.

In the text
thumbnail Fig. 4.

Diagnostic plot of the visual inspection. As a function of the median signal-to-noise ratio provided by the SDSS pipeline, we show the cumulative distribution (top) and cumulative normalised fraction (bottom) of different samples defined based on their SDSS pipeline (ZWARNING) vs. VI redshift measurement quality (NORMQ). The black dotted line represents the cases that are lost for science since their redshifts are not reliable according to the VI process (NORMQ < 3). The blue dashed curve shows the cases for which SDSS data and/or pipeline fit are flagged as being problematic in some way(ZWARNING ≠ 0); the difference between the blue and the black lines shows the recovery of the VI. The red line indicates the most problematic cases, in which the visually inspected redshift is not reliable (NORMQ < 3), but there was no indication from the SDSS pipeline that the redshift could be wrongly assessed (ZWARNING = 0). The yellow dash-dotted line shows the other cases in which the redshift could not be recovered (NORMQ < 3), but for which the SDSS pipeline indicated some problem (ZWARNING ≠ 0). The combination of the red and yellow samples gives the total number of objects lost for science (black line). The green line represents the cases in which VI was needed (ZWARNING ≠ 0) and successful (NORMQ = 3).

In the text
thumbnail Fig. 5.

Colour-colour diagram of the sources with reliable redshifts (NORMQ = 3), colour-coded by their class as in Table 3. For the colours g − r and z − W1, g, r, and z come from the Legacy Survey, while W1 comes from WISE. Orange squares represent broad emission line objects (QSO), while pink diamonds show the objects that have a broad line with an absorption feature (QSO_BAL). Green triangles indicate extragalactic objects with narrow lines (GALAXY), and purple stars represent Galactic objects (STAR). The blue triangles represent BLAZARS, although these sources do not have reliable redshifts. Yellow circles indicate Other Galactic objects (z < 0.001) that have a reliable counterpart, though they were not observed with SDSS. Both BLAZARS and Other Galactic objects do not have NORMQ = 3. The black line divides objects between extragalactic (above) and Galactic (below) as in Salvato et al. (2022). Histograms on the top and right show the distribution of each photometric colour per class.

In the text
thumbnail Fig. 6.

X-ray flux vs. optical magnitude diagram of the sources with reliable redshifts after the visual inspection, except for BLAZARS and Other Galactic sources which have NORMQ ≠ 3. The sources are colour-coded as in Fig. 5. The region between the two lines defines the locus of quasars in this plane, adapted from Maccacaro et al. (1988).

In the text
thumbnail Fig. 7.

Distribution of the ratio of the X-ray flux (FX, flux at 0.2–2.3 keV from eROSITA) to the optical flux (FO, flux from the r-band magnitude from the Legacy Survey DR8). The classes are colour-coded as in Figs. 5 and 6. The vertical lines correspond to Eq. (1).

In the text
thumbnail Fig. 8.

r-band magnitude as a function of the absolute value of the redshift, colour-coded by the survey that provided the redshift measurement (see Nlead, X − ray in Table 1). The upper histogram shows the redshift distribution with regard to the number of counts divided by the eFEDS area (140 deg2 as in the green contour of Fig. 1, Brunner et al. 2022), while the histogram on the right displays the number of counts for r-band distribution. The absolute value of the redshift is plotted to allow the display of Galactic objects (|z|≲10−3) with negative radial velocities.

In the text
thumbnail Fig. 9.

Differential (top) and cumulative (bottom) spectroscopic completeness fractions as a function of the r-band magnitude (top) or r-band magnitude limit (bottom). Blue lines are for all objects with a spectroscopic redshift from any survey (see Sect. 3.2), orange for those whose spec-z was obtained from SDSS, and green for other surveys. Red curves indicate the cases of high confidence for the redshift estimate after visual inspection. For each colour, the different parent samples are represented with different line styles: full sample (solid), only objects with reliable counterparts (CTP_QUALITY > 1; dashed), objects with reliable counterparts and a highly reliable X-ray detection (DET_LIKE > 10; dotted) and the most conservative case of objects with a reliable counterpart, reliable X-ray detection, and within the SDSS field coverage (Dec > 2; dot-dashed). The vertical grey line indicates r = 21.38, where the most complete sub-sample of objects with redshift from SDSS reaches its maximum fraction (dash-dotted orange line).

In the text
thumbnail Fig. 10.

Colour-colour and flux-magnitude diagrams for the objects from the full counterpart sample with a reliable photometric counterpart (CTP_QUALITY > 1, 24 774 sources), colour-coded by the data density per bin. The left panel presents z-W1 versus g − r, where g, r, and z are obtained from Legacy Survey DR8, and W1 is from WISE Survey. The line separates extragalactic sources on the top from Galactic sources on the bottom, according to Salvato et al. (2022). The right panel shows the r-band magnitude from the Legacy Survey DR8 versus the X-ray flux in the 0.2–2.3 keV band from eROSITA. The two lines adapted from Maccacaro et al. (1988) indicate the locus of quasars. Both panels exhibit their 1, 2, and 3σ contours in grey.

In the text
thumbnail Fig. 11.

Colour-colour and flux-magnitude diagrams of the sources from the SDSS/eFEDS sample with reliable redshift (12 011 sources) in comparison to those with a reliable photometric counterpart (24 774 sources), as in Fig. 10. The colour-code is based on the spectroscopic success rate (i.e. N(NORMQ = 3)/N(CTP_QUALITY > 1)). The grey contours indicate the same contours as in Fig. 10 for the overall distribution of the X-ray sources with a reliable optical counterpart.

In the text
thumbnail Fig. 12.

Observed X-ray (0.2 − 2.3 keV) luminosity distribution as a function of redshift. The top panel is colour-coded according to the number density in the logarithmic scale, while the bottom panel is colour-coded by spectroscopic completeness. The dashed line indicates the flux limit of the eROSITA survey, while the filled line indicates the location of the knee of the soft X-ray luminosity function according to the LADE model of Aird et al. (2015). The dotted horizontal line indicates the limit of L0.2 − 2.3keV ≤ 1042 erg s−1 applied in Sect. 5 for a pure AGN selection.

In the text
thumbnail Fig. 13.

Colour-colour diagram with the AGN sample colour-coded by the average of the intrinsic X-ray luminosity (in the 0.2 − 2.3 keV band) per bin, following Liu et al. (2022b). The solid black line differentiates extragalactic objects on top from Galactic objects at the bottom according to Salvato et al. (2022). The coloured tracks indicate the colour-colour evolution with redshift of the SED templates from Salvato et al. (2022), with z = 0 shown as a circle. The purple track represents a spiral galaxy, brown represents a lenticular galaxy, orange represents an elliptical galaxy, red represents a Seyfert 2, and pink represents a quasar. The squares indicate the values of the redshift of z = 0.2,  0.4,  0.6,  1.0,  1.6,  2.0,  3.0. The black dashed line indicates the trend of the quasar SED when more reddening is applied.

In the text
thumbnail Fig. 14.

Colour (g-r) and redshift distribution of the data used in the stacks from Figs. 17-24. The stacks considered four redshift bins separated by vertical lines, with triangles at z < 0.5, circles at 0.5 ≤ z < 1.0, squares at 1.0 ≤ z < 2.0, and diamonds at z ≥ 2.0. Each redshift bin was divided into three colour ranges with approximately the same number of sources per colour bin, shown as blue, green, and red.

In the text
thumbnail Fig. 15.

Mean intrinsic X-ray (0.2 − 2.3 keV) luminosity distribution as a function of mean redshift for the stacking bins of Fig. 14, with the dashed and filled curves as in Fig. 12.

In the text
thumbnail Fig. 16.

X-ray column density cumulative distribution per redshift and colour bin, as in Fig. 14. The vertical lines indicate the median of the column density for each bin, with the colours corresponding to the colour bin (red for the redder bin, green for the intermediate colour and blue for the bluer objects).

In the text
thumbnail Fig. 17.

Median stacked spectra for AGNs with z < 0.5, with the flux normalised at 5100 Å. The total sub-sample of z < 0.5 AGNs was divided into three colour bins with 614 sources each. The top panel shows with coloured lines the wavelength coverage where more than 50% of the total individual spectra contribute, with the remaining wavelength with less than half of the spectra in the stack shown in grey; the colour scheme is the same as Fig. 14. Some of the main emission and absorption lines are indicated with dashed vertical lines. The grey-filled line indicates the wavelength range around 5100 Å whose average flux was used for the normalisation. The bottom panel shows the errors calculated via the Jack Knife method in the stacking procedure divided by the median stack, and the dashed lines show 1 / N $ 1/\sqrt{\mathrm{N}} $, where N is the number of individual sources per stacked pixel.

In the text
thumbnail Fig. 18.

Zoomed-in version of the stacked spectra from Fig. 17. The emission and absorption lines are clearer, and the differences between the stacked colours are enhanced. The wavelength of the flux used for the normalisation of each plot is indicated in the ordinate axis and as a grey-filled vertical line.

In the text
thumbnail Fig. 19.

Similar to Fig. 17, but for 0.5 ≤ z < 1.0 AGNs. The blue and red stacks contain 870 sources each, while the intermediate stack in green has 869.

In the text
thumbnail Fig. 20.

Zoomed-in version of the stacked spectra from Fig. 19. The emission and absorption lines are clearer and the differences between the stacked colours are enhanced. The wavelength of the flux used for the normalisation of each plot is indicated in the ordinate axis and as a grey-filled vertical line. The wavelength range is the same as in Fig. 18 to facilitate the comparison of the low-redshift stacks.

In the text
thumbnail Fig. 21.

Similar to Fig 17, but for 1.0 ≤ z < 2.0 AGNs normalised with regard to the flux at 3000 Å. The blue and red stacks contain 1271 sources each, while the intermediate stack in green has 1270. The black line indicates the QSO spectrum from Vanden Berk et al. (2001).

In the text
thumbnail Fig. 22.

Zoomed-in version of the stacked spectra from Fig. 21. The emission and absorption lines are clearer and the differences between the stacked colours are enhanced. The wavelength of the flux used for the normalisation of each plot is indicated in the ordinate axis.

In the text
thumbnail Fig. 23.

Similar to Fig. 17, but for z ≥ 2.0 AGNs normalised with regard to the flux at 2000 Å. The green and red stacks contain 465 sources each, while the blue stack has 466. The black line indicates the QSO spectrum from Vanden Berk et al. (2001).

In the text
thumbnail Fig. 24.

Zoomed-in version of the stacked spectra from Fig. 23. The emission and absorption lines are clearer and the differences between the stacked colours are enhanced. The wavelength of the flux used for the normalisation of each plot is indicated in the ordinate axis. The wavelength range is the same as in Fig. 22 to facilitate the comparison of the high-redshift stacks.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.