Open Access
Issue
A&A
Volume 670, February 2023
Article Number A125
Number of page(s) 28
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361/202245072
Published online 15 February 2023

The Authors 2023

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe-to-Open model. Subscribe to A&A to support open access publication.

1. Introduction

Nebular attenuation1 due to dust grains is ubiquitous in astronomical observations. Besides the Galactic extinction, observations of emission lines from extragalactic H II regions inevitably suffer from different degrees of intrinsic attenuation inside their host galaxies. The overall attenuation of the emission is usually described by a parameterized nebular attenuation curve, whose shape is determined by the dust composition, grain size distribution, and geometry. Understanding the attenuation curve is important for accurate measurements of fluxes of emission lines and their derived physical properties, for example, star formation rate, metallicity, nitrogen-to-oxygen ratio (N/O), ionization parameter, among others (e.g., Pagel et al. 1979; Alloin et al. 1979; Diaz et al. 1991; Thurston et al. 1996; Dopita et al. 2000; Kennicutt & Evans 2012). Its shape also provides information about the dust sizes and compositions around emission-line regions (e.g., Spitzer 1978; Draine 2003; Flaherty et al. 2007). In addition, by comparing the average nebular attenuation curve with the average stellar attenuation curve, we can gain information about the different dust geometries around young stellar populations and older stellar populations (e.g., Charlot & Fall 2000; Inoue 2005; Chevallard et al. 2013).

Various attempts have been made to measure the nebular attenuation in both low-redshift and high-redshift emission-line galaxies (e.g., Reddy & Steidel 2004; Wild et al. 2011b; Price et al. 2014; Reddy et al. 2020; Rezaee et al. 2021). Many recent works suggest that the average nebular attenuation curve found in external galaxies has a similar shape to the average Galactic extinction curve with some slight differences (Wild et al. 2011b; Reddy et al. 2020; Rezaee et al. 2021). Although considerable dispersion around the average curve depending on galaxy properties is expected (e.g., Zafar et al. 2015; Fitzpatrick & Massa 2007; Salim & Narayanan 2020), due to the limitations of the methods and samples, this potential variation has not been investigated in depth so far. Nevertheless, there is evidence that simply applying the average Galactic extinction curve to extragalactic H II regions is inaccurate as the residual line ratio between the extinction-corrected data and the predictions from photoionization models show an unexpected dependence on the Balmer decrement (see Fig. 14 in Ji & Yan 2022). Furthermore, there could be a bias in the derived attenuation due to the complicated dust geometry, which depends on the aperture size (Vale Asari et al. 2020).

Most of the current studies on the optical attenuation curve rely on Balmer lines. In H II regions, the ratios of the intrinsic fluxes of Balmer lines are roughly fixed and are relatively insensitive to variations in temperature and density (e.g., flux H α / flux H β 2 . 86 0.11 + 0.18 $ \mathrm{flux}_{\mathrm{H}\alpha}/\mathrm{flux}_{\mathrm{H}\beta} \approx 2.86^{+0.18}_{-0.11} $ for an H II region with nH ≲ 106 cm−3 and 5000 K ≲ T ≲ 20 000 K, Osterbrock & Ferland 2006). Therefore, any significant deviation from the theoretical ratios for Balmer lines can be interpreted as the effect of dust attenuation. However, this method can only cover a relatively small range of wavelength in the optical (roughly 4000–6600 Å). Also, higher order Balmer lines (blueward of Hγ with upper levels at n > 5) are relatively weak in observations, both because they are intrinsically weak and because they are more attenuated. In addition, absorption features in the blue part of the optical spectra add further uncertainties to the measurements of weak Balmer lines. These make this method inapplicable for a large number of galaxies whose emission-line spectra have intermediate to low signal-to-noise ratios (S/Ns). Stacking galaxy spectra could partly circumvent this problem, but this would also miss a lot of information about the variation of the nebular attenuation curve in different galaxies or regions.

Finally, it is often implicitly assumed in extragalactic studies that all of the other nebular emission lines follow the same attenuation curve probed by Balmer lines. There are two major caveats associated with this assumption. First, the attenuation for emission lines not lying in the spectral range covered by Balmer lines is not well constrained. Second, this assumption requires that all emission lines come from the same spatial location and thus experience the same level of attenuation, whose validity is rarely carefully evaluated.

Compared to high-order Balmer lines, strong optical-to-near infrared (NIR) forbidden lines cover a much larger spectral range and are easier to measure. However, the intrinsic ratios of these lines strongly depend on nebular parameters such as the metallicity and ionization parameter. To determine the intrinsic ratios of attenuation-sensitive forbidden lines, one has to rely on theoretical or empirical calibrations of nebular parameters, which could bring large systematic uncertainties since different calibrations generally do not agree with each other (e.g., Kewley & Ellison 2008; Kewley et al. 2019; Ji & Yan 2022). The key question is whether it is possible to find an empirical way to constrain these nebular parameters without specifying their values. One potential and indirect solution is to compare the attenuated spectra from galaxies with similar physical properties including redshifts, stellar masses, star formation rates (SFRs), and so forth, but different degrees of attenuation. The nebular parameters of these galaxies are likely similar given the galaxy scaling relations, and thus the differences in the attenuation-sensitive strong line ratios are likely mainly driven by dust attenuation. Wild et al. (2011a), for example, used a similar pair-matching approach to empirically measure the stellar attenuation curve in external galaxies. Still, one needs to be cautious about the scatter in the adopted galaxy scaling relations and how they transfer into variations in the galaxy spectra. Also, the uncertainties in the controlled physical parameters (e.g., stellar masses, SFRs, etc.) require careful treatments.

In this work, we adopt a slightly different approach by taking the observed quantities rather than the derived ones as the controlled physical parameters. We start with the assumption that, within the same observed star-forming (SF) region, all emission lines follow the same attenuation curve scaled by the same reddening parameter [AV or E(B − V)]. By using a 3D line-ratio space spanned by three attenuation-insensitive line ratios, [N II]λ6583/Hα, [S II]λλ6716, 6731/Hα, and [O III]λ5007/Hβ, we constrain the nebular parameters of observed SF spatial pixels (spaxels) and thus the intrinsic ratios of forbidden lines. We show that small regions in this 3D line-ratio space correspond to a set of nearly constant nebular parameters, including metallicity (or O/H) and ionization parameter. In comparison, the variation of the dust attenuation inside these small regions can still be significant, driving most of the variations in attenuation-sensitive line ratios. This allows us to construct relative attenuation curves using a large number of extragalactic SF regions with detectable strong forbidden lines and the two strongest Balmer lines, Hα and Hβ. However, our results show that a single attenuation curve is insufficient to describe all emission lines, meaning this fundamental assumption of many extragalactic emission line studies breaks down. Based on our findings, we discuss potential explanations and suggest ways to move forward for future emission-line studies.

The layout of this paper is as follows. In Sect. 2, we introduce the observational data we use. In Sect. 3, we detail the method we use to measure the nebular attenuation curve. In Sect. 4, we show our results and compare the attenuation curve we derive with those in the literature. In Sect. 5, we examine some physical models and observational effects that could potentially explain our results. In Sect. 6, we estimate the impact of the line-specific attenuation on nebular diagnostics. We discuss the robustness of our method in Sect. 7, and draw our conclusions in Sect. 8. We also present several tests on different linear regression algorithms and emission line measurements in Appendices A and B. In Appendix C, we apply our method to a data set with high spatial resolutions from a single galaxy, IC 342. For convenience, we only show the wavelength of any given forbidden line when first introduced, unless there are other emission lines with different wavelengths but produced by the same kind of ion.

2. Data

Our main sample is drawn from the 11th product launch of the Mapping Nearby Galaxies at Apache Point Observatory survey (MaNGA, Bundy et al. 2015; Yan et al. 2016a), which is equivalent to the MaNGA products within the 17th public data release of the Sloan Digital Sky Survey (SDSS DR17, Abdurro’uf 2022). MaNGA is one of the three core programs of SDSS-IV (Blanton et al. 2017). It collected integral field unit (IFU) data using the 2.5 m Sloan telescope (Gunn et al. 2006) for over 10 000 galaxies at 0.01 < z < 0.14 with a nearly flat stellar mass distribution in 109 < M*/M < 1011 (Wake et al. 2017). Its IFUs are hexagonal bundles of optical fibers with numbers ranging from 19 to 127 (Drory et al. 2015), and a three-point dithering scheme was adopted as the main observing strategy (Law et al. 2015). Meanwhile, MaNGA used another set of small fiber bundles to target standard stars for flux calibrations, achieving an absolute calibration uncertainty less than 5% for 89% of the wavelength range (Yan et al. 2016b). The spectra were obtained through the BOSS spectrographs with a spectral resolution of R ∼ 2000 and a wavelength coverage of 3622 Å < λ < 10 354 Å (Smee et al. 2013). The observed spectra were first reduced by the Data Reduction Pipeline (DRP, Law et al. 2016, 2021a) and then processed by the Data Analysis Pipeline (DAP, Westfall et al. 2019; Belfiore et al. 2019) to produce science products including measurements of emission line fluxes. Throughout this work, we used the Gaussian fitted emission line fluxes from the DAP (see Appendix B for the results based on summed fluxes).

We selected our sample using the optical diagnostic diagrams. Specifically, we used the 3D diagnostic diagrams introduced by Ji & Yan (2020), which combines the traditional [N II]- and [S II]-based BPT diagrams (Baldwin et al. 1981; Veilleux & Osterbrock 1987). The selection function is given by

P 1 < 1.57 P 2 2 + 0.53 P 2 0.48 , $$ \begin{aligned} P_1 < -1.57 P_2^2 + 0.53 P_2 - 0.48, \end{aligned} $$(1)

where

P 1 = 0.63 N 2 + 0.51 S 2 + 0.59 R 3 , $$ \begin{aligned} P_1 = 0.63 N2 + 0.51 S2 + 0.59 R3, \end{aligned} $$(2)

and

P 2 = 0.63 N 2 + 0.78 S 2 . $$ \begin{aligned} P_2 = -0.63 N2 + 0.78 S2. \end{aligned} $$(3)

Here we have used N2, S2, and R3 to denote the decimal logarithms of [N II]/Hα, [S II]/Hα, and [O III]/Hβ. Before applying this criterion, we excluded spaxels that were flagged by the DAP as problematic or have S/N in Hα, Hβ, [O III], [N II], [S II], [O II]λλ3726, 3729, or [S III]λλ9069, 9531 smaller than 3. We included [O II] and [S III] during the sample selection as they are important for determining the blue end and red end of the attenuation curve.

Our selection criterion corresponds to a roughly fixed fraction of star-formation contribution based on the theoretical models of Ji & Yan (2020)2. It also allows us to include some SF spaxels in the outskirt of galaxies that would have been missed by the widely adopted demarcation lines (e.g., Kewley et al. 2001; Kauffmann et al. 2003) based on the 2D diagrams. Regardless, we note that our conclusions remain largely unchanged if we use the traditional demarcation lines for selection instead. We also tried the empirical selection of SF spaxels based on gas kinematics (Law et al. 2021b), which again produced similar results. Our final sample consists of ∼2.4 × 106 spaxels. Figure 1 shows the distribution of our sample in the 3D diagnostic diagram. For now we do not apply any further cut to the data. We check the effect of additional selection criteria on our results in Sect. 7.5.

thumbnail Fig. 1.

Distribution of the MaNGA sample viewed in a 3D line-ratio space and in one of its 2D projections. Upper left: 3D density map of the MaNGA sample, where SF spaxels are colored from black to white and other ionized regions are colored from purple to yellow. A photoionization model computed by Ji & Yan (2020) using CLOUDY is also shown, whose color coding indicates the metallicity of the simulated H II region. The blue cube is an exaggerated illustration of the 3D bins we used in this work. Upper right: the same photoionization model is shown, but color coded according to the ionization parameter of the simulated H II region. Bottom: a 2D projection of the 3D space that corresponds to an “edge-on” view of the data (color map) and model (blue). The two axes, P1 and P2, are linear combinations of the original three axes, which are denoted as N2, S2, and R3. For clarity, we only show the part of the model that covers the middle 98% of the data along the line of sight. It is clear from this view that the model surface cuts through the center of the SF spaxel distribution.

Apart from our main sample, we also made use of the MaNGA observation of a nearby galaxy IC 342, which is one of the ancillary targets in MaNGA. IC 342 is a nearby (3.3 Mpc, Saha et al. 2002) massive (M* = 109.95M, Zibetti et al. 2009) SF spiral galaxy. At the distance of IC 342, MaNGA has a spatial resolution of ∼32 pc, which is much higher than the typical resolution of 1–2 kpc for the MaNGA main sample. With IC 342, we checked the attenuation law on scales close to the size of individual H II regions. Although we only have one such nearby target in MaNGA, it provides important evidence on the scale-dependence of the nebular attenuation law. We show the analyses of the IC 342 data in Appendix C.

3. Method

In this section we introduce the method we used to measure the nebular attenuation. For a given emission line with intrinsic flux fλ0, the attenuated flux fλ is given by

f λ = 10 0.4 A λ f λ 0 , $$ \begin{aligned} f_\lambda = 10^{-0.4 A_{\lambda }}f_{\lambda 0}, \end{aligned} $$(4)

where Aλ is the wavelength-dependent attenuation in magnitudes. The ratio of two emission lines can then be expressed as

f λ 1 / f λ 2 = 10 0.4 ( A λ 1 A λ 2 ) f λ 1 0 / f λ 2 0 . $$ \begin{aligned} f_{\lambda _1}/f_{\lambda _2} = 10^{-0.4 (A_{\lambda _1} -A_{\lambda _2}) }f_{\lambda _1 0}/f_{\lambda _2 0}. \end{aligned} $$(5)

If the intrinsic flux ratio, fλ10/fλ20, is known, one can calculate the difference in attenuation between the two emission lines using the observed flux ratio. In SF H II regions, the intrinsic flux ratios of Balmer lines are roughly constants (Osterbrock & Ferland 2006). For example, under the case B recombination, the intrinsic ratio between the first two Balmer lines is roughly 2.86. Using the observed fluxes of Balmer lines, one can then measure the relative attenuation at their wavelengths, which is one of the widely adopted methods to determine the nebular attenuation curve.

Apart from Balmer lines, there are many strong forbidden lines from optical to NIR that can be observed in H II regions, such as [O II]λλ3726, 3729, [O III]λ5007, [N II]λ6583, [S II]λλ6716, 6731, [S III]λλ9069, 9531, etc. The intrinsic ratio between any two forbidden lines, however, is not a constant and varies among H II regions. In principle, the intrinsic ratio is set by the physical parameters of H II regions. To a first approximation, the variations in the intrinsic line ratios are driven by the variation in the metallicity and ionization parameter. Here the metallicity is defined as the overall chemical abundance in the gas phase, which is usually represented by the oxygen abundance, 12+log(O/H). The ionization parameter, U, is defined as the relative strength of the ionizing radiation, or Φ 0 n H c $ \frac{\Phi _0}{n_{\mathrm{H}} c} $, where Φ0 is the flux of the ionizing photons, nH is the volume density of hydrogen, and c is the speed of light. One way to understand this approximation is to look at the BPT diagrams, where pairs of forbidden-to-Balmer line ratios with similar wavelengths are used. The ratios between these emission lines are insensitive to attenuation and are close to their intrinsic ratios, if one assumes that they follow the same attenuation curve. In these diagrams, it is shown that photoioniztion model grids of varying metallicity and ionization parameter can well cover the data locus of SF regions (e.g., Dopita et al. 2000; Kewley et al. 2001, 2019; D’Agostino et al. 2019)3.

As shown by Ji & Yan (2020), 2D diagnostic diagrams might suffer from projection effects, making it difficult to tell whether a model grid fits the data distributions in two or more diagrams in a consistent manner. By combining more than two line ratios to form a high-dimensional diagram, one can put a much more stringent constraint on the position of the ideal model that self-consistently predicts all the line ratios. In Fig. 1, we plotted our best-fit photoionization model grid for MaNGA SF spaxels (Ji & Yan 2020) by varying the metallicity and ionization parameter of a simulated H II region using the photoionization code CLOUDY (Ferland et al. 2017). Comparing the area covered by the photoionization model and the spatial distribution of the SF spaxels, we see that the location of any given datum in this line-ratio space is, to a good approximation, determined by a specific combination of a given metallicity and ionization parameter. Variations in other nebular parameters (e.g., hardness of the ionizing spectrum, abundance patterns for secondary elements, etc.) manifest themselves as scatters around the model surface, and can be covered by a more complicated model grid in principle, if we allow changes in these parameters.

With this approximation, we can rewrite Eq. (5) as

log f λ 1 f λ 2 = log ( f λ 1 f λ 2 ) 0 0.4 ( A λ 1 A λ 2 ) = F λ 1 , λ 2 ( O / H , U , . . . ) + A λ 1 A λ 2 A H α A H β log f H α / f H β 2.86 = F λ 1 , λ 2 ( O / H , U , . . . ) + m λ 1 , λ 2 log f H α / f H β 2.86 , $$ \begin{aligned}&\log \frac{f_{\lambda _1}}{f_{\lambda _2}} = \log {\left(\frac{f_{\lambda _1}}{f_{\lambda _2}}\right)}_0 -0.4(A_{\lambda _1} - A_{\lambda _2})\nonumber \\&\qquad \quad =F_{\lambda _1, \lambda _2}({\mathrm{O/H}, U, ...}) + \frac{A_{\lambda _1} - A_{\lambda _2}}{A_{\rm H\alpha } - A_{\rm H\beta }}\log \frac{f_{\rm H\alpha }/f_{\rm H\beta }}{2.86}\\&\qquad \quad =F_{\lambda _1, \lambda _2}({\mathrm{O/H}, U, ...}) +m_{\lambda _1, \lambda _2}\log \frac{f_{\rm H\alpha }/f_{\rm H\beta }}{2.86},\nonumber \end{aligned} $$(6)

where Fλ1, λ2(O/H, U, ...) describes the intrinsic ratios between selected lines as a function of the metallicity, ionization parameter, and other nebular parameters of H II regions. Also, we have used the relation between the fluxes of the first two Balmer lines

log ( f H α / f H β / 2.86 ) = 0.4 ( A H α A H β ) . $$ \begin{aligned} \log (f_{\rm H\alpha }/f_{\rm H\beta }/2.86) = -0.4(A_{\rm H\alpha } - A_{\rm H\beta }). \end{aligned} $$(7)

If we define

l λ A λ A V , $$ \begin{aligned} l_\lambda \equiv \frac{A_\lambda }{A_V}, \end{aligned} $$(8)

to describe the normalized attenuation curve, we have

m λ 1 , λ 2 = l λ 1 l λ 2 l H α l H β . $$ \begin{aligned} m_{\lambda _1, \lambda _2}=\frac{l_{\lambda _1}-l_{\lambda _2}}{l_{\rm H\alpha }-l_{\rm H\beta }}. \end{aligned} $$(9)

Here mλ1, λ2 describes the relative attenuation difference between λ1 and λ2 with respect to the attenuation difference between Hα and Hβ. The key of our method is to constrain the variation in Fλ1, λ2(O/H, U, ...), making it negligible compared to the variation in attenuation. If we achieve this, we can easily derive the relative attenuation, mλ1, λ2, by fitting a linear model to the “reddening relation” between log f λ 1 f λ 2 $ \log \frac{f_{\lambda _1}}{f_{\lambda _2}} $ and log f H α f H β $ \log \frac{f_{\mathrm{H\alpha}}}{f_{\mathrm{H\beta}}} $.

We summarize our method as follows. First, we selected three line ratios that are usually thought to be attenuation-insensitive, which are [N II]/Hα, [S II]/Hα, and [O III]/Hβ. Second, we binned our sample in the 3D space spanned by log([N II]/Hα), log([S II]/Hα), and log([O III]/Hβ) (or N2, S2, and R3 for short) as illustrated in Fig. 1. Following our argument about the variation of line ratios in the 3D diagram, there is no degeneracy between O/H and U in this space. Therefore, there exists inverse functions to describe the metallicity and ionization parameter with these line ratios, that is, ( O / H ) F 0 1 ( N 2 , S 2 , R 3 ) $ (\mathrm{O/H}) \approx F^{-1}_{0}(N2, S2, R3) $ and U F 1 1 ( N 2 , S 2 , R 3 ) $ U \approx F^{-1}_{1}(N2, S2, R3) $. Thus, any small variation in O/H or U corresponds to small variations in line ratios. For example,

δ ( O / H ) c 0 ( δ N 2 ) + c 1 ( δ S 2 ) + c 2 ( δ R 3 ) , $$ \begin{aligned} \delta (\mathrm{O/H}) \approx c_0 (\delta N2) + c_1 (\delta S2) + c_2 (\delta R3), \end{aligned} $$(10)

where c0, c1, and c2 are coefficients also depending on N2, S2, and R3. We can then rewrite the intrinsic variations for attenuation sensitive line ratios as

δ F λ 1 , λ 2 a 0 δ ( O / H ) + a 1 δ U + i a i δ ( secondary parameters ) c 0 ( δ N 2 ) + c 1 ( δ S 2 ) + c 2 ( δ R 3 ) . $$ \begin{aligned}&\delta F_{\lambda _1, \lambda _2} \approx a_0 \delta (\mathrm{O/H}) + a_1 \delta U + \sum _i\ a_i\delta (\mathrm{secondary\ parameters})\nonumber \\&\qquad \ \ \approx c_0^{\prime } (\delta N2) + c_1^{\prime } (\delta S2) + c_2^{\prime } (\delta R3). \end{aligned} $$(11)

Again, ai and c i $ c_i^{\prime} $ are coefficients. It is now clear that if we make the size of each bin small enough, the variation in the metallicity, ionization parameter, and other nebular parameters would be small as well. In addition, since we binned the data in a line-ratio space that is insensitive to attenuation, we did not constrain the variation in attenuation in each bin. As a result, the variation of the second term in the rightmost hand side of Eq. (6) dominates over that of the first term. Finally, we performed a linear regression in each bin, treating log(fλ1/fλ2) as a linear function of log(fHα/fHβ/2.86). The slope of the relation then gives the relative attenuation with respect to the Balmer lines, as shown in Eq. (9).

We emphasize that our method does not have any dependence on photoionization models, as the selection is completely based on the data. Our only assumption is that the three line ratios we use are able to constrain the major parameters4 of SF spaxels, which fix their intrinsic line ratios.

In principle, the smaller the size of the bin, the more accurate our method becomes. However, reducing the bin size also reduces the number of data points within each bin. Therefore, uncertainties associated with small number statistics would become important for bins that are too small. In practice, we tried different bin sizes and eventually used a size of 0.0167 × 0.0167 × 0.0167 dex3 for each bin. With this binning scheme, we have ∼3000 bins with more than 180 data points within each bin. Meanwhile, the ±2σ range of log(fHα/fHβ/2.86) is typically 0 . 20 0.02 + 0.04 dex $ 0.20_{-0.02}^{+0.04}\, \mathrm{dex} $ in each bin. Before fitting the reddening relations, we removed 3D bins with fewer than 180 data points. We also tried other cuts that select bins with more than 100 data points and 300 data points. Since the MaNGA data are more densely populated at high metallicities in the 3D line ratio space, the cut on the number of data points in each bin influences the average metallicity of our sample. We discuss the effect of different binning scheme and bin selection criteria in detail in Sect. 7.2.

Prior to binning data in 3D, we applied an attenuation correction to line ratios that form the 3D space, assuming a Fitzpatrick (1999) extinction curve with RV = 3.1 (F99 hereafter), which is an initial guess as to the shape of the attenuation curve. This step should in principle have little effect on the derived attenuation. However, as we show in Sect. 4, our results seem to indicate different attenuation curves for different species of lines. If this is true, both the correction we applied and the assumption of the attenuation-insensitive lines become questionable, which we further discuss in Sects. 5 and 7. There is another effect associated with our choice of the line ratios that form the 3D space. Results obtained using forbidden lines that are both involved in the 3D space could be correlated by construction. This effect is discussed in Sects. 4 and 7.

The method we used to perform the linear regression for Eq. (6) is a maximum likelihood method taking into account uncertainties in both variables as well as the intrinsic scatter in the dependent variable. The natural logarithm of the likelihood function for a given data point, i, is given by

ln F likelihood , i = 0.5 ( y i m x i b ) 2 / ( m 2 σ x i 2 + σ y i 2 + σ 0 2 ) = 0.5 χ i 2 , $$ \begin{aligned}&\ln F_{\mathrm{likelihood},i} = -0.5({ y}_i-mx_i-b)^2/(m^2\sigma _{x_i}^2 + \sigma _{{ y}_i}^2 + \sigma _0^2)\nonumber \\&\qquad \qquad \quad = -0.5 \chi ^2_i, \end{aligned} $$(12)

where m and b are the slope and intercept, and σxi, σyi, and σ0 are the measurement uncertainties in xi, yi, and the intrinsic scatter in y (which is the same for all data points during the fit), respectively5. We did not consider any intrinsic scatter in x, as the Balmer ratios are relatively insensitive to temperature and density variations (Osterbrock & Ferland 2006). Still, we discuss the effect of including it in Sect. 7.1. We estimated σxi and σyi by propagating the measurement uncertainty of individual emission line. Given the test made by Belfiore et al. (2019) on emission line measurements in MaNGA, we multiplied the uncertainties reported by DAP by a factor of 1.25. We note that this adjustment does not have any significant impact on our results. We started by adopting an initial guess for the intrinsic scatter σ0 = 0, and minimized the total χ2 using the MINIMIZE function in SCIPY to obtain the slope and intercept with the Nelder & Mead (1965) method. We then checked whether the following condition is satisfied:

i = 1 N χ i 2 / ( N 2 ) 1 , $$ \begin{aligned} \sum _{i=1}^{N} \chi ^2_i/(N-2) \le 1, \end{aligned} $$(13)

where N is the number of data points. If not, we changed the intrinsic scatter by an increment of 0.0001 and recomputed the slope and intercept. We repeated the above procedure until Eq. (13) is satisfied. The method is essentially identical to the one adopted by Tremaine et al. (2002). We note that there are other ways to estimate the slope and intercept when both heteroscedastic uncertainties and intrinsic scatter are present (see e.g., Kelly 2007). We have verified the robustness of this method and present the tests in Appendix A.

4. Results

In this section we compute the relative nebular attenuation “seen” by emission lines from different species of ions and atoms. We investigate three categories of lines according to their production mechanisms as well as the ionization energies (IE) of their corresponding ions: 1) Balmer lines, including Hα, Hβ, Hγ, and Hδ; 2) high ionization lines (IE > 13.6 eV), including [Ne III]λλ3869, 3967, [O III]λ5007, and [S III]λλ9069, 9531; 3) low ionization lines (IE ≲ 13.6 eV), including [O II]λλ3726, 3729, [N II]λ6583, [S II]λλ6716, 6731, and [O I]λ6300. We summarized our results in Table 1. For the rest of the paper, we use mline1, line2 to represent the slope of the reddening relation, log(fline1/fline2) versus log(fHα/fHβ), and use m line 1 , line 2 $ m^{\prime}_{\rm line _1, line _2} $ to represent the measured median value of the slope.

Table 1.

Median slopes of the reddening relations [log(Emission line ratio) versus log(Hα/Hβ/2.86)] derived for a sample of 3D bins.

4.1. Attenuation seen by Balmer lines

We calculated two slopes related to Balmer lines: 1) the slope of log(Hα/Hγ/2.86×0.469) vs. log(Hα/Hβ/2.86) relation (or mHα, Hγ); 2) the slope of log(Hα/Hδ/2.86×0.259) vs. log(Hα/Hβ/2.86) relation (or mHα, Hδ). Here we have assumed the intrinsic Balmer ratios to follow the Case B approximation values at ne ∼ 100 cm−3 and T ∼ 104 K (Osterbrock & Ferland 2006). We divided the line ratios by their intrinsic values so that the intercepts, log(line ratioobs./line ratiotheor.)|Hα/Hβ = 2.86, ideally should be zero. We note that there is a small covariance between the independent and dependent variables due to the common term of Hα, which is taken into account by introducing an extra term of −2mCov(x, y) in the likelihood function (see Appendix A for more details). We also required that the S/N of all emission lines used to be greater than 3. We calculated these slopes in two different ways. The first way is the traditional method (hereafter TM), which uses the entire sample for the calculation. The second way is our new method, where we calculated slopes in 3D bins and used the median as the representative value. For Balmer lines, these two methods should give us identical results, which serves as a sanity check for our method.

The left panel of Fig. 2 shows a fitting example in one of the 3D bins, where a clear linear relation between log(Hα/Hβ) and log(Hα/Hγ) is present. The best-fit linear model shows a small uncertainty of 0.046 on the slope, and is in good agreement with the prediction from an F99 extinction curve with RV = 3.1. Figure 3 shows the distributions of the slopes, intercepts, and intrinsic scatters of the two reddening relations. For an F99 extinction curve with RV = 3.1, the expected slopes are m H α , H γ F 99 = 1.395 $ m^{\mathrm{F99}}_{\mathrm{H\alpha ,H\gamma}} = 1.395 $ and m H α , H δ F 99 = 1.563 $ m^{\mathrm{F99}}_{\mathrm{H\alpha ,H\delta}} = 1.563 $. When RV = 4.05, these values become m H α , H γ F 99 = 1.384 $ m^{\mathrm{F99}}_{\mathrm{H\alpha ,H\gamma}} = 1.384 $ and m H α , H δ F 99 = 1.542 $ m^{\mathrm{F99}}_{\mathrm{H\alpha ,H\delta}} = 1.542 $. In comparison, the slopes obtained by TM are mHα, Hγ = 1.3400 ± 0.0005 and mHα, Hδ = 1.5683 ± 0.0009, where the uncertainties are calculated with a Markov chain Monte Carlo (MCMC) method using the EMCEE package in PYTHON (Foreman-Mackey et al. 2013). If we change the S/N threshold for Hγ and Hδ from 3 to 10, the resulting slopes become mHα, Hγ = 1.3293 ± 0.0005 and mHα, Hδ = 1.503 ± 0.001. While the median slopes given by our new method are m H α , H γ = 1.343 ± 0.002 $ m^\prime_{\mathrm{H\alpha ,H\gamma}} = 1.343{\pm} 0.002 $ and m H α , H δ = 1.545 ± 0.003 $ m^\prime_{\mathrm{H\alpha ,H\delta}} = 1.545{\pm} 0.003 $, where the uncertainties correspond to the 68% confidence intervals derived by using the biweight estimator of Beers et al. (1990). As a sanity check, we also ran an MCMC analysis and confirmed the uncertainties given by the MCMC were in good agreement with those returned by the biweight estimator. The 1σ widths (or the standard deviations) of the slope distributions are 0.08 and 0.16, respectively.

thumbnail Fig. 2.

Fitting examples in one of the 3D bins. The dashed red lines are the best-fit lines given by the maximum-likelihood method. The orange shaded regions indicate the 1σ uncertainties of the linear models. The dash-dotted blue lines are obtained by fixing the slopes using the values from an F99 extinction curve with RV = 3.1 during the fit. The error bars indicate the measurement uncertainties in different logarithmic line ratios. Left: reddening relation between Hα/Hβ and Hα/Hγ. Middle: reddening relation between Hα/Hβ and [S III]/[O III]. Right: reddening relation between Hα/Hβ and [N II]/[O II].

thumbnail Fig. 3.

Relative attenuation probed by Balmer lines. The first row shows the results obtained by fitting a linear function to the log(Hα/Hγ/2.86×0.469) vs. log(Hα/Hβ/2.86) relation within each 3D bin we defined. From left to right, we plot the distributions of the slope, intercept, and the intrinsic scatter. For each distribution, we show the 68% confidence interval of the median [CI(Mdn)] and the standard deviation (SD). The dashed-red lines indicate the median values among our 3D bins. The dashed-green lines indicate the values obtained by the traditional method where the whole sample is used for a single fit. The dashed-blue lines correspond to the values given by the F99 extinction curve. The second row shows similar information, but is obtained by fitting a linear function to the log(Hα/Hδ/2.86×0.259) versus log(Hα/Hβ/2.86) relation.

The results given by TM and our new method are close to each other. Our new method inevitably produces larger uncertainties. On the one hand, the intrinsic scatter in line ratios can have a larger influence on the derived slopes for bins with relatively small number of data points. On the other hand, there could be genuine variation in the shape of the attenuation curve depending on the nebular parameters or host galaxy properties, which are not included in the calculation of the uncertainties (see Sect. 7.5). The slopes derived from TM cannot be described by any single F99 extinction curve with a fixed RV. Given the dependence of these slopes on the S/N cut, it is possible that the sample has intrinsic variations in RV that depends on galaxy properties (see Salim & Narayanan 2020). Regardless, the corrections based on the F99 extinction curve with RV = 3.1 would only introduce a bias less than 2% for Hα/Hγ and Hα/Hδ at AV = 1 mag. A similar conclusion can be drawn for the median slopes derived with the new method. These results confirm that the attenuation probed by Balmer lines in galaxies can be approximately described by an average MW extinction curve (e.g., Wild et al. 2011b; Reddy et al. 2020; Rezaee et al. 2021).

The intercepts obtained by two methods also show good consistency. It is noteworthy that the intercepts we derived are slightly larger than 0. This could be due to the fact that the average intrinsic Balmer ratios are different from the values we assumed. Regardless, this offset does not affect the slopes we derived. We also notice that the median intrinsic scatter in y-axis [i.e., log(Hα/Hγ/2.86×0.469) or log(Hα/Hδ/2.86×0.259)] in each bin is small compared to the change in y. This ensures that the variation in y within each 3D bin is indeed mostly driven by the variation in x, or log(Hα/Hβ/2.86).

4.2. Attenuation seen by high ionization lines

Using our 3D binning method, we calculated slope of the log([S III]/[O III]) versus log(Hα/Hβ) relation as well as the slope of the log([O III]/[Ne III]) versus log(Hα/Hβ) relation. Since DAP ties the attenuation-uncorrected fluxes of the [S III] doublet, making f[SIII]λ9531/f[SIII]λ9069 ≈ 2.439, the derived slope distributions for these two lines are identical. Considering that the attenuation at NIR is relatively weak and is less dependent on wavelength, tying the observed fluxes of these two lines should not have a significant effect on our measured nebular attenuation. For [Ne III] doublet, DAP also sets a fixed ratio for the attenuation-uncorrected fluxes, making f[NeIII]λ3869/f[NeIII]λ3967 ≈ 3.33. Although the wavelengths of [Ne III] lines are much shorter compared to [S III], the wavelength separation of the [Ne III] doublet is much smaller. The F99 extinction curve predicts that ( m [ OIII ] , [ NeIII ] λ 3869 F 99 m [ OIII ] , [ NeIII ] λ 3967 F 99 ) / m [ OIII ] , [ NeIII ] F 99 9 % $ (m^{\mathrm{F99}}_{\mathrm{[OIII],[NeIII]}\lambda 3869} - m^{\mathrm{F99}}_{\mathrm{[OIII],[NeIII]}\lambda 3967})/m^{\mathrm{F99}}_{\mathrm{[OIII],[NeIII]}}\approx 9\% $. This is much smaller than the width of the slope distribution we derived below. Since the measurement uncertainties of [Ne III] are relatively large, an S/N cut of 3 already removes a large number of data points. To have enough 3D bins, we lowered the minimum number of spaxels required for each bin from 180 to 100. Even with this lowered cut, we only had 336 qualified 3D bins to derive the median reddening relation for log([O III]/[Ne III]).

In the middle panel of Fig. 2, we plotted the fitting result for the log([S III]/[O III]) versus log(Hα/Hβ) relation in one of the 3D bins. The overall scatter along the y axis is larger compared to the case of Balmer lines, and a large intrinsic scatter of 0.06 is returned by the fitting function, indicating there is still remaining variations in the intrinsic ratio of these forbidden lines. Regardless, the best-fit linear model matches the prediction from the F99 curve well in this case. Figure 4 shows the distribution of the slopes we derived. For the [S III] doublet, we show the result of [S III]λ9531. For the [Ne III] doublet, we show the result of [Ne III]λ3869. The slope distributions derived from the other [S III] and [Ne III] lines are identical to these. We found that the 68% confidence intervals for the central locations of the two slopes distributions are m [ SIII ] , [ OIII ] = 1.660 ± 0.005 $ m^\prime_{\mathrm{[SIII],[OIII]}} = 1.660{\pm} 0.005 $ and m [ OIII ] , [ NeIII ] = 0.68 ± 0.06 $ m^\prime_{\mathrm{[OIII],[NeIII]}} = 0.68{\pm} 0.06 $. Meanwhile, the standard deviations of the two slope distributions are 0.27 and 1.04, respectively. Although we applied an S/N cut to [Ne III] lines, the resulting slope distribution is still very wide. Part of the reason could be due to our inclusion of the 3D bins having fewer data points. In addition, the measurements of [Ne III] fluxes could be influenced by the nearby absorption features in observed spectra. Despite the large uncertainties, our derived slopes still lie close to the values given by the F99 extinction curve with RV = 3.1, which are m [ SIII ] , [ OIII ] F 99 = 1.695 $ m^{\mathrm{F99}}_{\mathrm{[SIII],[OIII]}} = 1.695 $ and m [ OIII ] , [ NeIII ] F 99 = 0.81 $ m^{\mathrm{F99}}_{\mathrm{[OIII],[NeIII]}} = 0.81 $, after averaging the values for the two lines in each doublet. At AV = 1 mag, using the F99 corrections would cause a bias of 1.3% for [S III]/[O III] and a bias of 5% for [O III]/[Ne III]. Therefore, the attenuation for both Balmer lines and high ionization lines in our sample galaxies can be approximately described by the F99 extinction curve. With the inclusion of the [S III] doublet, we were able to constrain the nebular attenuation curve redward of Hα.

thumbnail Fig. 4.

Relative attenuation probed by high ionization lines. The first row shows the results obtained by fitting a linear function to the log([S III]/[O III]) vs. log(Hα/Hβ/2.86) relation within each 3D bin we defined. From left to right: we plot the distributions of the slope, intercept, and the intrinsic scatter. For each distribution, we show the 68% confidence interval of the median [CI(Mdn)] and the standard deviation (SD). The dashed-red lines indicate the median values among our 3D bins. The dashed-blue lines correspond to the values given by the F99 extinction curve. The second row shows similar information, but is obtained by fitting a linear function to the log([Ne III]/[O III]) versus log(Hα/Hβ/2.86) relation.

From Fig. 4, one can see that the intrinsic scatter found in the reddening relations for high ionization lines is much larger compared to those for Balmer lines, which contributes to the wider slope distributions. For log([O III]/[Ne III]), the median intrinsic scatter is 0.118 dex. The standard deviation of log(Hα/Hβ) in each 3D bin is typically 0.05 dex. Combining this value with m[OIII],[NeIII] ≈ 0.8, one expects that the attenuation-driven variation in log([O III]/[Ne III]) in each 3D bin is smaller than the intrinsic scatter. This further explains why the estimation of m[OIII],[NeIII] is more uncertain.

4.3. Attenuation seen by low ionization lines

We examined the attenuation seen by low ionization line ratios in this subsection. Specifically, we used line ratios involving [O II] to constrain the attenuation curve blueward of Hδ.

The right panel of Fig. 2 shows a reddening relation between log([N II]/[O II]) and log(Hα/Hβ) in one of the 3D bins. In this case, unlike the linear models for Balmer lines and high ionization lines, the best-fit model shows a clear offset from the line predicted by the F99 curve. We plotted the distributions of the linear parameters of all 3D bins in Fig. 5. Our method yielded m [ NII ] , [ OII ] = 1.525 ± 0.002 $ m^\prime_{\mathrm{[NII],[OII]}} = 1.525{\pm} 0.002 $, and the standard deviation of the slope distribution, σstd, is 0.13. In comparison, the F99 extinction curve predicts that m [ NII ] , [ OII ] F 99 = 1.844 $ m^{\mathrm{F99}}_{\mathrm{[NII],[OII]}} = 1.844 $. Our derived slope is smaller than the corresponding F99 value by roughly 2.4σstd or 126σ′, where σ′ is the uncertainty of the median. This difference is large enough to drive a systematic bias of 13% in the attenuation-corrected [N II]/[O II] at AV = 1 mag, if one uses the F99 extinction curve. Furthermore, the derived slope is significantly smaller than m H α , H δ $ m^\prime_{\mathrm{H\alpha ,H\delta}} $, which cannot be explained by any single extinction curve. Since λ[NII] > λHα and λ[OII] < λHδ, for any given extinction curve that predicts the extinction to monotonically increase with decreasing wavelength in the optical, one expects AHδ − AHα < A[OII] − A[NII] and thus mHα, Hδ < m[NII],[OII], contrary to the observed relation. Therefore, our result implies that the F99 extinction curve correctly describes the attenuation probed by Balmer lines and high ionization lines, but overpredicts the attenuation probed by low ionization lines. If we interpret this result as [N II] and [O II] both having a different magnitude of attenuation compared to that of Balmer lines, that is, AV, Low ≠ AV, Balmer, then from the definition of the reddening relation, we have

A V , Low m [ NII ] , [ OII ] m [ NII ] , [ OII ] F 99 A V , Balmer 0.83 A V , Balmer . $$ \begin{aligned} A_{V,\mathrm{Low}} \approx \frac{m^{\prime }_{\rm [NII],[OII]}}{m^\mathrm{F99}_{\rm [NII],[OII]}}A_{V,\mathrm{Balmer}}\approx 0.83A_{V,\mathrm{Balmer}}. \end{aligned} $$(14)

thumbnail Fig. 5.

Relative attenuation probed by low ionization lines. We fit a linear function to the log([N II]/[O II]) versus log(Hα/Hβ/2.86) relation within each 3D bin. From left to right, we plot the distributions of the slope, intercept, and the intrinsic scatter. For each distribution, we show the 68% confidence interval of the median [CI(Mdn)] and the standard deviation (SD). The dashed-red lines indicate the median values among our 3D bins. The dashed-blue lines correspond to the values given by the F99 extinction curve.

However, as we discuss in Sect. 7.3, simply using different AV cannot explain our results and one should not use this derived “apparent AV” to do corrections. Another complicating factor is that different low ionization lines do not necessarily have the same AV either (see Sects. 4.4 and 5.2).

Another way to see this discrepancy is to inspect log([N II]/[O II]) corrected by the F99 extinction curve as a function of log(Hα/Hβ) in each 3D bin. Figure 6 shows such an example. We removed the fitted intercept for the log([N II]/[O II]) versus log(Hα/Hβ/2.86) relation in each 3D bin (which effectively removed the intrinsic value of log([N II]/[O II]) at zero attenuation), and plotted all the intercept-removed data in this figure. Clearly, if [N II]/[O II] is corrected by the F99 extinction law before fitting, the overall slope becomes negative. This implies that the F99 extinction law overcorrects [N II]/[O II] in our sample. Meanwhile, the distribution of m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ is relatively tight and the intrinsic scatter is small, indicating that the intrinsic scatter plays a minor role in affecting our measurement of the slope. In Fig. 6, we only show data with high S/N in Hβ. This is because the measurement uncertainty in log(Hα/Hβ/2.86) can make the attenuation-corrected trend to appear steeper than it should be as the measured value of log(Hα/Hβ/2.86) is used to derive the correction. This effect is negligible when the S/N of Hβ is high. However, we note that if there is intrinsic scatter along the x axis, a similar effect can occur even at high S/N. We discuss the effect associated with the potential intrinsic scatter in detail in Sect. 7.1.

thumbnail Fig. 6.

Reddening relation between log([N II]/[O II]) and log(Hα/Hβ/2.86). Only data with S/N > 30 in Hβ are included. We obtained the intrinsic log([N II]/[O II]) value at log(Hα/Hβ/2.86) = 0 and subtracted it from the observed log([N II]/[O II]) for each 3D bin. The black contours show the distribution of the data without any attenuation correction. Whereas the blue contours show the density distribution of the data with their log([N II]/[O II]) corrected by the F99 extinction curve. The contour levels trace the number density of the data points and are equally spaced on a log scale. The outermost contour and the innermost contour enclose 90% and 10% of the data, respectively. The dashed red line and the dashed green line trace the median trends in the 3D bins for the two data sets. The dash-dotted line is a horizontal line for reference.

We made a similar measurement using another combination of low ionization lines, [S II] and [O II], and found that m [ SII ] , [ OII ] $ m^\prime_{\mathrm{[SII] ,[OII]}} $ is also significantly smaller than the prediction of the F99 curve, as shown in Table 1. One might wonder if this discrepancy is due to some hidden bias induced by our 3D binning method. Indeed, [N II] and [S II] are among the emission lines we used to construct the 3D space. As a sanity check, we computed mHγ, [OII] using our 3D binning method. Both Hγ and [O II] are not involved in the construction of the 3D bins. We found that m H γ , [ OII ] = 0.161 ± 0.003 $ m^\prime_{\mathrm{H}\gamma ,[\mathrm{OII}]} = 0.161{\pm} 0.003 $ (whose σstd is 0.14), which is much smaller than m H γ , [ OII ] F 99 = 0.439 $ m^{\mathrm{F99}}_{\mathrm{H}\gamma ,[\mathrm{OII}]} = 0.439 $. This again indicates that Balmer lines and low ionization lines (in this case the [O II] line) are not subject to the same attenuation law.

4.4. Line ratios involving different species of ions

In previous subsections, we found evidence suggesting different attenuation laws for different species of lines. To further investigate this issue, we explored slopes for reddening relations involving combinations of Balmer lines, high ionization lines, and low ionization lines (which we call hybrid line ratios hereafter).

We first checked [S III]/Hα, which is a hybrid line ratio combining a high ionization line and a Balmer line. Judging from our previous measurements, one might expect that the derived slope for this line ratio should be consistent with the prediction of the F99 curve. However, we found that m [ SIII ] , H α = 0.600 ± 0.006 $ m^\prime_{\mathrm{[SIII] ,H\alpha }} = 0.600{\pm} 0.006 $ (the σstd is 0.28), which deviates from the F99 value, m [ SIII ] , H α F 99 = 0.811 $ m^{\mathrm{F99}}_{\mathrm{[SIII] ,H\alpha }} = 0.811 $, by 0.75σstd or 38σ′. The difference indicates a fractional bias of 8% for the attenuation-corrected [S III]/Hα at AV = 1 mag, if one uses the F99 extinction curve for corrections. In fact, for almost all of the hybrid line ratios we checked in Table 1, the slopes of their reddening relations appear considerably lower than the corresponding F99 values.

Another noticeable effect is related to the additivity of the slope measurements. Ideally, one expects if line ratios A, B, and C satisfy log(A) = log(B)+log(C), then m A = m B + m C $ m^\prime_{A}=m^\prime_{B}+m^\prime_{C} $. All line ratios should follow the law of additivity if they are subject to the same attenuation curve. From Table 1, we see that some slopes do show additivity. For example, m [ SIII ] , H β = m [ SIII ] , H α + m H α , H β = m [ SIII ] , H α + 1 $ m^\prime_{\mathrm{[SIII] ,H\beta }} = m^\prime_{\mathrm{[SIII] ,H\alpha }} + m^\prime_{\mathrm{H\alpha ,H\beta }} = m^\prime_{\mathrm{[SIII] ,H\alpha }} + 1 $, m [ SIII ] , [ OII ] m [ SIII ] , [ OIII ] + m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[SIII] ,[OII] }} \approx m^\prime_{\mathrm{[SIII] ,[OIII] }}+m^\prime_{\mathrm{[OIII] ,[OII] }} $, etc. However, there are also a few cases where the additivity seems violated. For instance, m [ SIII ] , [ OIII ] > m [ SIII ] , H α + m H α , H β + m H β , [ OIII ] m [ SIII ] , H α + 0.89 $ m^\prime_{\mathrm{[SIII] ,[OIII] }} > m^\prime_{\mathrm{[SIII] ,H\alpha }} + m^\prime_{\mathrm{H\alpha , H\beta}} + m^\prime_{\mathrm{H\beta , [OIII]}} \approx m^\prime_{\mathrm{[SIII] ,H\alpha }} + 0.89 $, where the difference is roughly 0.16, equivalent to 1.5σstd or 87σ′ (σstd and σ′ are calculated from the distribution of m[SIII],[OIII] − m[SIII],Hα − mHα, Hβ). Also, m [ NII ] , [ OII ] > m [ NII ] , [ OIII ] + m [ OIII ] , [ OII ] m [ OIII ] , [ OII ] + 0.89 $ m^\prime_{\mathrm{[NII] ,[OII] }} > m^\prime_{\mathrm{[NII] ,[OIII] }} + m^\prime_{\mathrm{[OIII] , [OII]}} \approx m^\prime_{\mathrm{[OIII] ,[OII] }} + 0.89 $, where the difference is roughly 0.15 (1.3σstd or 80σ′). Interestingly, the additivity problem seems to be related to the slopes that are apparently lower than the corresponding F99 values.

One related question is whether the additivity problem comes from the bias during the fitting process associated with incorrectly estimated measurement uncertainties. In Fig. 7, we plot distributions of m[SIII],[OIII] − m[SIII],Hα − mHα, Hβ using samples selected based on different S/N cuts. It is clear from this figure that the additivity for these line ratios is asymptotically recovered at high S/N. In particular, we found m [ SIII ] , H α $ \rm m^\prime_{\mathrm{[SIII] ,H\alpha }} $ increases noticeably with increasing S/N, while m [ SIII ] , [ OIII ] $ \rm m^\prime_{\mathrm{[SIII] ,[OIII] }} $ and m H α , [ OIII ] $ \rm m^\prime_{\mathrm{H\alpha ,[OIII]}} $ remains roughly the same. If the input measurement uncertainties are biased, the best-fit parameters returned by our likelihood function would also be biased (see Appendix A), which could lead to violation of the additivity if the bias is different for different lines. On the other hand, the bias becomes negligible when the S/N is very high. However, this explanation seems inapplicable to the case of m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $. Even after we increased the S/N cut to 20 for [N II], [O II], [O III], and Hβ, m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ is still greater than m [ NII ] , [ OIII ] + m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OIII]}} + m^\prime_{\mathrm{[OIII],[OII]}} $ by 0.12 (2.1σstd or roughly 60σ′). Also, the bias does not originate from the data lost from the S/N cut, as we performed tests with mock data and found this effect was negligible even when the S/N limit was set to 3. In Sect. 7.5 we show the additivity bias is actually more related to the surface brightness of the lines.

thumbnail Fig. 7.

Distributions of the slope differences under different S/N cuts. The black solid histogram shows data with S/N of [S III], [O III], and Hβ greater than 3; the dashed green histogram shows data with S/N of the aforementioned lines greater than 10; the dotted red histogram shows data with S/N of the aforementioned lines greater than 15. The slope difference peaks toward smaller values with the increasing S/N limit. However, the difference is actually due to the dependence of the slope on the surface brightness of the lines.

The slopes of the hybrid line ratios could also be affected by our construction of the 3D bins. Since we restricted the variation of [N II]/Hα, [S II]/Hα, and [O III]/Hβ in each bin, the derived slopes for certain combinations of lines are tied together. This tying should not introduce any bias if all line ratios follow the same attenuation curve, but would become problematic if different lines are attenuated by different curves. As an example, since log([S III]/Hα) = log([S III]/[N II]) + log([N II]/Hα), and log([N II]/Hα) varies little in each bin, we have forced m [ SIII ] , H α m [ SIII ] , [ NII ] $ m^\prime_{\mathrm{[SIII] ,H\alpha }} \approx m^\prime_{\mathrm{[SIII] ,[NII] }} $. Therefore, the measurement of m[SIII],Hα is affected by m[SIII],[NII]. Even if [S III] indeed shares the same attenuation law with Balmer lines, the involvement of the low ionization line [N II] could contribute to the deviation from the F99 value. In other words, if the observed [N II] and Hα no longer share the same attenuation law, binning data using log([N II]/Hα) can modify the measured slope for certain hybrid line ratios. In addition, our assumption of negligible variations of nebular parameters might become invalid in such cases. Mathematically, our construction of the likelihood function becomes not precise, as the single parameter σ0 might no longer be sufficient to describe the intrinsic variation of line ratios.

Indeed, from Table 1, one can see that if a reddening relation directly or indirectly involves a low ionization line, the resulting median slope tends to be significantly lower than the expected value from the F99 curve. Interestingly, even the low ionization lines might not share the same attenuation law. One example is provided by [S III]/[O I], whose median slope is slightly smaller than that of [S III]/[N II]. If both [O I] and [N II] are subject to the same attenuation curve, we should have m [ SIII ] , [ OI ] > m [ SIII ] , [ NII ] $ m^\prime_{\mathrm{[SIII] ,[OI] }} > m^\prime_{\mathrm{[SIII] ,[NII] }} $. Since [O I] is an even lower ionization line compared to [N II], this discrepancy indicates a further flattening of the attenuation curve at lower ionization. There is, however, a special case where the hybrid line ratio, [N II]/Hγ, gives the slope close to the F99 value. Following the same argument we use in the preceding paragraphs, we can understand this case as follows. Even if [N II]/Hα is biased and is not attenuation-free, Hα/Hγ would still follow the unbiased attenuation curve since the slope measurements for Balmer lines do not rely on 3D binning. Given that log([N II]/Hγ) = log(Hα/Hγ) + log([N II]/Hα), and the fact that we have forced m [ NII ] , H α 0 $ m^\prime_{\mathrm{[NII],H\alpha}}\approx 0 $ in each 3D bin, it is no surprise m [ NII ] , H γ m H α , H γ m H α , H γ F 99 $ m^\prime_{\mathrm{[NII],H\gamma}}\approx m^\prime_{\mathrm{H\alpha,H\gamma}}\approx m^{F99}_{\mathrm{H\alpha,H\gamma}} $.

Finally, another explanation could be that both high ionization lines and Balmer lines follow the F99 curve, but their magnitudes of attenuation differ systematically. Thus, when measuring the reddening of their combined line ratios, the result deviates from the F99 curve.

In summary, most of the hybrid line ratios we check show slopes deviating from the F99 values and some of them violate additivity as well. There could be biases associated with measurement uncertainties, which, however, cannot explain all hybrid line ratios. Since there is already some evidence that different species of lines might follow different attenuation curves, our constructions of the 3D bins could also bias the slope measurements for certain hybrid lines. To better understand these effects, we investigate some physical models that create different attenuation laws for different lines in the next section.

5. Physical model

In this section we present some physical models to explain how different observed emission lines can have different attenuation laws. In addition, we compare the model predictions with the slopes measured in our sample.

5.1. Bias from low-resolution observations

When dealing with extragalactic observations, one usually has very limited knowledge about the geometry of the dust distribution inside the corresponding galaxy. Still, indirect evidence concerning the dust distribution can be derived by comparing the attenuation of different sources. The attenuation probed by nebular emissions from SF regions is usually larger than that probed by starlight (e.g., Fanelli et al. 1988; Calzetti et al. 1994; Kashino et al. 2013; Price et al. 2014; Pannella et al. 2015; Li et al. 2021), which is attributed to the fact that the neutral gas and dust are more concentrated and clump around individual H II region (Charlot & Fall 2000; Wild et al. 2011b; Chevallard et al. 2013). Therefore, one usually assumes that emission from H II regions is “extincted” by the foreground dust rather than “attenuated” due to a complex mix of sources and dust, which is, however, not necessarily valid for unresolved “H II regions” (Pellegrini et al. 2020a).

Within each H II region, different emission lines originate from different optical depths due to their different ionization potentials. The Balmer lines are produced by recombination throughout the ionized shell. In comparison, the high ionization lines and low ionization lines are mainly produced in the inner part and outer part of the H II region, respectively. Since dust grains also exist inside the ionized shell, one might wonder whether the internal differential attenuation within the H II region can result in different attenuation laws for different lines. However, as shown by Bottorff et al. (1998), even for a highly ionized H II region with log U ∼ −2.0 and an Orion-type dust composition, the maximum optical depth at λ ∼ 0.1 μm is still smaller than 1. For the emission lines we are concerned about, the corresponding optical depths are much smaller and the dust attenuation effect should be negligible within most H II regions.

The above scenario is applicable to radiation-bounded H II regions with ionized shells well confined by the background molecular clouds. In observations, however, people found that Lyman continuum photons (Lyc) can leak from dense clouds and create emission line regions in the surrounding lower density regions (e.g., Reynolds 1990; McKee & Williams 1997; Ferguson et al. 1996; Zurita et al. 2000; Haffner et al. 2009; Howard et al. 2018), which can extend over a few kiloparsecs (see e.g., Zurita et al. 2002; Seon 2009; Belfiore et al. 2022). These low-density regions are usually referred to as the diffuse ionized gas (DIG). The ratios between low ionization lines and Balmer lines are generally higher in DIG, and emission lines produced in DIG are also less attenuated compared to H II regions.

Apart from the leaked Lyc, there is another important contributor to the emissions in DIG. As shown by Zhang et al. (2017), the leaked Lyc from H II regions cannot explain the high [O III]/Hβ observed in DIG for massive galaxies. Thus, one needs extra ionizing sources apart from H II regions to produce the observed [O III] fluxes. As suggested by Flores-Fajardo et al. (2011), the hot low-mass evolved stars (HOLMES) are likely to play an important role in ionizing DIG. Specifically, HOLMES create high [O III]/Hβ due to their harder spectra compared to H II regions. It is important to note that the enhancement of [O III]/Hβ in DIG depends on the stellar mass (or metallicity) of the galaxy, which has been confirmed on both kiloparsec and subkiloparsec scales (Zhang et al. 2017; Belfiore et al. 2022). For low-mass galaxies, [O III]/Hβ barely changes from H II regions to DIG. Whilst for high-mass galaxies, [O III]/Hβ has a conspicuous increase toward DIG. This is because low-metallicity H II regions (which are typically found in low-mass galaxies) have much higher [O III]/Hβ ratios compared to high-metallicity H II regions (as can be seen in the BPT diagrams), while DIG in general covers a relatively narrow range in [O III]/Hβ.

The above evidence suggests that both low and high ionization lines could experience different amount of attenuation compared to Balmer lines. Since the MaNGA data we use have a typical resolution of 1–2 kpc, it is possible that a considerable amount of diffuse emission has been blended with the emission from the nearby H II regions. Due to the contribution of less-attenuated DIG, we expect the measured slopes for line ratios involving low ionization lines and high ionization lines to be different from the true slopes. This appears to be consistent with our interpretation in the previous section. In addition, this effect is likely metallicity-dependent.

Figure 8 shows how the measured slopes for Hα/Hγ, [S III]/[O III], [N II]/[O II], and [O III]/[O II] correlate with the median gas-phase metallicity as well as the median Hα luminosity surface density (corrected using the F99 curve) within each bin. We derived the metallicities following the Bayesian method of Ji & Yan (2022). In short, we first computed the likelihood of the observed [N II]/Hα, [S II]/Hα, and [O III]/Hβ given the photoionization model of Ji & Yan (2020). Combining the likelihood with a flat prior in the logarithmic space, we then obtained the posterior distribution of the metallicity for each datum. Finally, the metallicity of each datum was calculated as the weighted mean using the posterior distribution. We see that mHα,Hγ has no dependency on the metallicity. While m[NII],[OII] appears to have a sharp decline at very high metallicities, there seems to be no obvious trend at lower metallicities6. In comparison, the slopes for line ratios involving [O III] have a noticeable change with the metallicity. For high metallicity galaxies, [O III] tends to have a larger contribution from the DIG, and we see it appears less attenuated as reflected by decreasing m[SIII],[OIII] and increasing m[OIII],[OII]. However, it is worth noting that m[SIII],[OIII] and m[OIII],[OII] deviate from the F99 values at low metallicities, where the bias brought by the DIG is supposed to be lower compared to high metallicities. The contribution of DIG to [S III] and [O II] could be the complicating factor in this explanation, which we further explore in the next subsection. In addition, Fig. 8 also shows that the metallicity strongly correlates with the median Hα surface brightness of individual 3D bins. Since the Hα surface brightness is related the level of the DIG contamination (Oey et al. 2007; Zhang et al. 2017), the trends we see in Fig. 8 could also be caused by the overall change in the DIG contamination. We further discuss the effect of selecting the sample based on the Hα surface brightness in Sect. 7.5.

thumbnail Fig. 8.

Derived slopes for different line ratios as a function of the gas-phase metallicity. The y axes correspond to slopes of the log(fline1/fline2) versus log(//2.86) relations. The x axes correspond to metallicities derived by using Bayesian inference with a fiducial photoionization model. Each data point corresponds to a 3D bin with number of spaxels greater than 180. The 3D bins are color-coded according to the median Hα surface brightness (which is corrected by the Balmer decrement assuming an F99 extinction curve) of their spaxels. Dashed black lines and dotted black lines correspond to the median slopes and the values predicted by an F99 extinction curve, respectively.

Besides the potential DIG contamination, there is another effect that could be important due to the limited spatial resolution. For a single H II region or an association of spatially close H II regions, it is possible that not all of its emission is attenuated by the same amount. One can imagine a dust screen covers part but not all of the H II region. For such a “partially covered H II region”, the effective integrated Balmer decrement depends on the relative contributions from the covered and uncovered parts to the observed emission line spectrum. If the covered fraction varies in different H II regions, there would be an effective variation in the observed Balmer decrement, which can also give rise to line-specific effective attenuation or differential attenuation.

In summary, observations that are unable to resolve individual H II regions could lead to differential attenuation inconsistent with a unified curve either due to the distribution of dust or due to the contamination from the DIG, even though there is an underlying extinction curve at work. In the next subsection, we try setting up a few toy models to reproduce the attenuation we measured for different emission lines in the MaNGA data.

5.2. Attenuation model

We started by assuming a simple cloud model, which consists of two separate components that could have different optical depths and emission line fluxes. The first component corresponds to high attenuation, which we denote as CHigh; the second component corresponds to low attenuation, which we denote as CLow7. Figure 9 shows a cartoon illustration of the two-component model. Attenuated emission lines from both CHigh and CLow are mixed in observations. In this model, CHigh is a single dusty H II region or a part of an H II region that is more obscured, and CLow is the DIG surrounding the H II region or a part of the H II region that is less obscured. By definition, AV, High > AV, Low. We then defined the fractional contribution to the total intrinsic Hα flux from CHigh to be η ≡ fHα, 0(High)/fHα, 0(total) = fHα, 0(High)/[fHα, 0(High) + fHα, 0(Low)] (0 ≤ η ≤ 1).

thumbnail Fig. 9.

Cartoon illustration of a two-component attenuation model. The model consists of a more attenuated component with high AV and a less attenuated component with low AV. The emission lines from both components are mixed in observations. While the high AV component is (part of) a dusty H II region, the low AV component could be a part of the H II region that is less dusty, the DIG around the H II region, or a combination of both.

The first case we considered is where both CHigh and CLow have the same intrinsic line ratios, which corresponds to a partially covered H II region. In addition, we fixed AV, High and AV, Low for simplicity. The only thing we varied was η, the fractional flux contributed by CHigh. As a result, the observed attenuation or Balmer decrement would follow the change in η. The observed Hα flux is

f H α , obs = f H α , 0 ( total ) [ η 10 0.4 A V , High l H α + ( 1 η ) 10 0.4 A V , Low l H α ] . $$ \begin{aligned} f_{\rm H\alpha , obs} = f_{\rm H\alpha , 0 (total)}[\eta 10^{-0.4 A_{V, \mathrm{High}}l_{\rm H\alpha }} + (1-\eta )10^{-0.4 A_{V, \mathrm{Low}}l_{\rm H\alpha }}]. \end{aligned} $$(15)

Since fHβ, 0 = fHα, 0/2.86, we have

( f H α / f H β / 2.86 ) obs = η 10 0.4 A V , High l H α + ( 1 η ) 10 0.4 A V , Low l H α η 10 0.4 A V , High l H β + ( 1 η ) 10 0.4 A V , Low l H β . $$ \begin{aligned} (f_{\mathrm{H}\alpha } /f_{\mathrm{H}\beta }/2.86)_{\rm obs}=\frac{\eta 10^{-0.4 A_{V, \mathrm{High}}l_{\rm H\alpha }} + (1-\eta )10^{-0.4 A_{V, \mathrm{Low}}l_{\rm H\alpha }}}{\eta 10^{-0.4 A_{V, \mathrm{High}}l_{\rm H\beta }} + (1-\eta )10^{-0.4 A_{V, \mathrm{Low}}l_{\rm H\beta }}}. \end{aligned} $$(16)

We note that both lHα and lHβ are given by an underlying true attenuation curve, which we assume to be the F99 curve here. It is clear from the above equation that the effective V-band attenuation derived from the observed Balmer decrement, AV, eff, changes from AV, Low to AV, High as η changes from 0 to 1. In general, the observed flux ratio of two lines with wavelengths λ1 and λ2 is given by

( f λ 1 / f λ 2 ) obs = ( f λ 1 f λ 2 ) 0 η 10 0.4 A V , High l λ 1 + ( 1 η ) 10 0.4 A V , Low l λ 1 η 10 0.4 A V , High l λ 2 + ( 1 η ) 10 0.4 A V , Low l λ 2 , $$ \begin{aligned} (f_{\lambda _1} /f_{\lambda _2})_{\rm obs}=(\frac{f_{\lambda _1}}{f_{\lambda _2}})_0\frac{\eta 10^{-0.4 A_{V, \mathrm{High}}l_{\rm \lambda _1}} + (1-\eta )10^{-0.4 A_{V, \mathrm{Low}}l_{\rm \lambda _1}}}{\eta 10^{-0.4 A_{V, \mathrm{High}}l_{\rm \lambda _2}} + (1-\eta )10^{-0.4 A_{V, \mathrm{Low}}l_{\rm \lambda _2}}}, \end{aligned} $$(17)

where we have used the assumption that the intrinsic line ratio is the same for CHigh and CLow.

What we measured in observations is how log(fλ1/fλ2)obs changes with log(fHα/fHβ/2.86)obs, which is supposed to give us l λ 1 l λ 2 l H α l H β $ \frac{l_{\lambda _1}-l_{\lambda _2}}{l_{\mathrm{H\alpha}}-l_{\mathrm{H\beta}}} $. However, if the change in AV, eff is primarily driven by the change in the covering fraction, or η, we expect a modified attenuation curve to be observed. The top panel of Fig. 10 shows such an example. Instead of specifying AV, High and AV, Low, we set log(fHα/fHβ/2.86)obs to 0.4 and 0 for CHigh and CLow, respectively. We see that the final m[NII],[OII] and m[OIII],[OII] are lower than the expected values given by the F99 curve and are closer to the median value we measured in MaNGA. The values of the Balmer decrement for CHigh and CLow are the only parameters that determine the final slope in this model. Surprisingly, this oversimplified model can already produce some changes in the measured attenuation curve. However, the weakness of this model is apparent. The result depends on the wavelengths of the lines involved rather than the species of ions that produce them. As a consequence, the slopes for Balmer lines such as mHα,Hγ and mHα,Hδ become lower than their true values, as shown in the top leftmost panel of Fig. 10, contrary to the observed results. On the other hand, for the high ionization line ratio m[SIII],[OIII], this model yields a slope that is higher than the true value, different from the observations.

thumbnail Fig. 10.

Effective attenuation laws generated by different models. Row a) shows several logarithmic line ratio versus logarithmic Balmer derement relations predicted by a partially covered cloud model. For comparison, we also plotted the relation predicted by the F99 extinction curve and the median relation we found in MaNGA. Row b) shows the relations predicted by a two-component model with different intrinsic line ratios. We made the H II component to have overall lower intrinsic line ratios (relative to Hα) compared to the DIG component. Row c) shows a similar model as Row b), but the underlying true attenuation curve is replaced with the CCM89 extinction curve. All y axes have arbitrary normalization.

To solve the discrepancy associated with m[SIII],[OIII], we added new parameters to the model by setting different intrinsic line ratios for CHigh and CLow. This model simulates the situation where two kinds of ionized regions, H II regions (CHigh) and DIG (CLow) are mixed in observations. If we denote rλ, High and rλ, Low as the intrinsic values of (fλ/fHα) for CHigh and CLow, Eq. (17) becomes

( f λ 1 / f λ 2 ) obs = r λ 1 , High η 10 0.4 A V , High l λ 1 + r λ 1 , Low ( 1 η ) 10 0.4 A V , Low l λ 1 r λ 2 , High η 10 0.4 A V , High l λ 2 + r λ 2 , Low ( 1 η ) 10 0.4 A V , Low l λ 2 . $$ \begin{aligned} (f_{\lambda _1} /f_{\lambda _2})_{\rm obs}=\frac{r_{\lambda _1, \mathrm{High}}\eta 10^{-0.4 A_{V, \mathrm{High}}l_{\rm \lambda _1}} + r_{\lambda _1, \mathrm{Low}}(1-\eta )10^{-0.4 A_{V, \mathrm{Low}}l_{\rm \lambda _1}}}{r_{\lambda _2, \mathrm{High}}\eta 10^{-0.4 A_{V, \mathrm{High}}l_{\rm \lambda _2}} + r_{\lambda _2, \mathrm{Low}}(1-\eta )10^{-0.4 A_{V, \mathrm{Low}}l_{\rm \lambda _2}}}. \end{aligned} $$(18)

The intrinsic flux ratio rλ, High(Low) should depend on the species of the ion as well as nebular parameters such as the metallicity, but for now we chose a set of values that could roughly reproduce the slopes we measured in MaNGA, which is listed in Table 2. For each line, we set a value for log(rλ, High/rλ, Low). From Table 2, we see that in order to fit the observed slopes, we need to make CHigh (H II component) have overall lower intrinsic forbidden line ratios compared to CLow (DIG component), which is qualitatively consistent with observations (e.g., Zhang et al. 2017; Belfiore et al. 2022)8.

Table 2.

Logarithmic difference between the intrinsic line strengths of CHigh and CLow in a DIG contamination model.

There is, however, still a discrepancy for mHα,Hγ. One might wonder whether we can achieve better agreement with the observation by changing the underlying attenuation curve. Indeed, the bottom panels of Fig. 10 shows that if we instead assume a Cardelli et al. (1989) extinction curve with RV = 3.1 (CCM89) as the true underlying extinction curve, mHα, Hγ gets slightly closer to the observed values. Adjusting the Balmer decrement values for CHigh and CLow can further improve the fit. In addition, we changed log(r[SIII],High/r[SIII],Low) from −0.20 to −0.30 in order to fit m[SIII],[OIII] in this case. To determine the proper physical models and model parameters, we need evidence from high-resolution observations. One example using the MaNGA observation of the galaxy IC 342 is discussed in Appendix C.

The above models assume that the two components, CHigh and CLow have fixed magnitudes of attenuation so that the attenuation differences among H II regions are solely driven by the change in the weights of the two components. In reality, one expects the magnitude of the attenuation of the two (if not more) components also change. We can try going to another extreme by assuming that CHigh and CLow have fixed weights in their contributions to Hα and the attenuation differences among different MaNGA spaxels are solely driven by the change in the magnitudes of the attenuation for CHigh and CLow. If, for example, we keep AV, Low constant but vary AV, High, one expects AV, eff to first increase with increasing AV, High but then start to decrease at some point due to the reduction of the observed fluxes from CHigh. This makes the Balmer decrement not a monotonic function of AV, High. Besides the Balmer decrement, other line ratios would also reach an extreme value and then turn around. Figure 11 shows such a toy model for the attenuation of [N II]/[O II]. The turn-around point depends on the value of η. When η = 0.5, the turn around happens when the observed Balmer decrement changes by ∼0.05 dex from the value under AV, Low; for η = 0.75, the turn around happens when the observed Balmer decrement changes by ∼0.10 dex from the value under AV, Low. We did not observe this behavior clearly in our sample, indicating that this is not a dominant effect but could contribute to the scatter in the attenuation.

thumbnail Fig. 11.

Two-component attenuation model for [N II]/[O II], where only the attenuation of the more attenuated component is varied. Different sets of points correspond to models with different intrinsic flux contributions from the two components as well as different AV, Low. The arrow shows the direction of increasing AV, High in the model.

On the other hand, if we vary AV, High and AV, Low by the same amount, there is no change in the attenuation curve. This is because changing both AVs by the same amount effectively changes the column density of the common foreground dust screen.

The observations likely include both changes in the intrinsic flux ratios between CHigh and CLow and variations in the magnitudes of AV, High and AV, Low. Within our 3D bins, the latter effect might be present as scatters rather than dominating the trend for different reddening relations. To check whether these toy models are applicable to realistic data, we examined the observational evidence in the next subsection.

5.3. Comparison with observations

In Sect. 5.1, we showed how the slopes of different reddening relations change with the gas-phase metallicity in Fig. 8. Among the slope measurements we showed, m[SIII],[OIII] and m[OIII],[OII] exhibit the most clear dependence on the metallicity, which we suspected to be related to the DIG contamination on the observed [O III]. Previous studies show the [O III]/Hβ difference between the H II region and the DIG increases with increasing galaxy masses (Zhang et al. 2017; Belfiore et al. 2022), and thus increasing metallicity. The key question is whether the two-component model can explain the trends for m[SIII],[OIII] and m[OIII],[OII] given the observational evidence on the change of the [O III]/Hβ difference.

Intuitively, when the observed [O III]/Hβ has a larger contribution from the DIG, one might expect [O III] becomes less attenuated compared to other lines as the DIG contains less dust compared to H II regions. As a consequence, m[SIII],[OIII] ≡ (A[SIII] − A[OIII])/(AHα − AHβ) would decrease and m[OIII],[OII] ≡ (A[OIII] − A[OII])/(AHα − AHβ) would increase, assuming the intrinsic line ratios, rλ, High/rλ, Low, of other lines remain unchanged. Whereas as we have pointed out, this does not explain why m[SIII],[OIII] and m[OIII],[OII] deviate more from the expected F99 values at lower metallicities where the DIG has more similar intrinsic line ratios as H II regions.

Using our two-component model, we find the opposite of the observed trends for m[SIII],[OIII] and m[OIII],[OII]. When we fixed rλ, High/rλ, Low for other line ratios while increasing rλ, High/rλ, Low for [O III] (which effectively reduces the difference in [O III]/Hβ between H II regions and the DIG), m[SIII],[OIII] became smaller and m[OIII],[OII] became larger. This is because in our two-component model, the observed change in the Balmer decrement is driven by the change of the fractional contributions from the H II component and the DIG component. As the Balmer decrement becomes larger, the H II component gradually dominates and the observed line ratios approach the intrinsic line ratios of the H II component. Therefore, once the relative strength of [O III] increases in the H II component (i.e., less DIG contamination), our model actually predicts a boost in the increase of total [O III] when the observed Balmer decrement increases. As a result, if only the change in [O III]/Hβ is considered, the two-component model would fail to describe the metallicity dependence of m[SIII],[OIII] and m[OIII],[OII].

However, a complicating factor here is that [O III] is not the only line whose relative strength in H II regions and DIG changes with the metallicity. In fact, the value of [O II]/Hβ in the DIG is also likely a function of the stellar mass and metallicity (Zhang et al. 2017). When there are other line ratios changing together with [O III]/Hβ, the final slope for the reddening relation would depend on the both the intrinsic shape of the true extinction curve and the relative change in different line ratios. In principle we can tune the model predictions to follow the observed trends of m[SIII],[OIII] and m[OIII],[OII], but without further evidence from resolved observations we opt for not adding extra parameters related to different line ratios so as to avoid overfitting. Furthermore, in MaNGA SF spaxels, the metallicity is found to correlate with the Hα surface brightness, which is negatively correlated with the DIG contamination (Oey et al. 2007; Zhang et al. 2017). Therefore, the overall level of the DIG contamination could also change with the metallicity of the MaNGA SF spaxels, which is not captured by our simple toy model.

In summary, in order to explain the observed dependence of the reddening relations on the metallicity, our model requires the relative strengths of various lines to change in the DIG along with [O III]/Hβ. This extra assumption needs verification from resolved observations of H II regions with a range of host galaxy masses and metallicities and is beyond the scope of the current paper.

A more direct test of the scale-dependent attenuation model is to measure the attenuation of different emission lines using IFU data with high spatial resolutions. In Appendix C, we apply our method to the high-resolution IFU data within a single galaxy, IC 342. While the effect of multicomponent attenuation appears to vanish in this case, the uncertainties of the derived slopes are large due to the small number of spaxels. It would be interesting to determine a physical scale, around which the attenuations of different emission lines start to have good agreements. We plan to further investigate this problem using a larger set of high-resolution IFU data in future work.

Thus far, we have discussed the physical picture for the multicomponent attenuation, but we have not discussed its impact on observational studies. In the next section, we detail the biases from improper attenuation corrections.

6. Impact on nebular diagnositcs

There are many widely used strong-line metallicity calibrations and ionization parameter calibrations relying on attenuation-sensitive line ratios. For example, the calibration of N/O in H II regions relies on the attenuation corrected [N II]/[O II] (Thurston et al. 1996), the R23 metallicity estimator involves the attenuation corrected [O II]/Hβ (Pagel et al. 1979), the ionization parameter calibration usually involves the attenuation corrected [O III]/[O II] or [S III]/[S II] (Diaz et al. 1991; Kewley & Dopita 2002), and so forth.

If the attenuation correction is biased, the derived nebular parameters based on the line ratios above would be biased. Our results have shown that using a single attenuation curve for correcting emission lines from different transitions is inaccurate. However, how to quantify this impact is nontrivial.

If we ignored the underlying dust model and thought different lines were simply described by different attenuation curves, the fractional bias in nebular diagnostics from using a single attenuation curve was in general around 10%–20% depending on the calibration method and the overall attenuation. For example, our results show the F99 extinction curve with RV = 3.1 fits the attenuation of Balmer line ratios well but does not fits the attenuation of [N II]/[O II]. From Table 1, one can see using the F99 curve would underestimate the intrinsic [N II]/[O II], creating an average systematic bias of 0.05 dex in N/O derived from [N II]/[O II] using Thurston et al. (1996)’s prescription at AV = 1, where AV is derived from the Balmer decrement. As another example, using the F99 curve to correct [S III]/[S II] would underestimate the ionization parameter by 0.08 dex at AV = 1 (Diaz et al. 1991). Compared to the systematic difference between different strong-line calibrations (Kewley et al. 2019), the amount of bias estimated this way is not very significant.

However, as we have shown in Sect. 5, the potential existence of unresolved multicomponent attenuation cannot be simply described by different attenuation curves. In our two-component dust model, one expect the attenuation correction bias is small when the observed fluxes are dominated by CHigh or CLow, but becomes larger when CHigh and CLow have similar contributions to the observed fluxes. Using the same model as we constructed in Sect. 6, where the intrinsic log(Hα/Hβ/2.86) is set to 0.4 for CHigh and 0 for CLow, we can calculate how the line ratios corrected based on the apparent log(Hα/Hβ/2.86) differ from the intrinsic line ratios. For a partial coverage dust model, the bias due to the F99 correction in [N II]/[O II] is 0.06 dex when the apparent AV = 1. For a H II + DIG model assuming the parameters in Table 2, the bias becomes 0.25 dex at AV = 1, which would significantly underestimate the intrinsic N/O by 44%. Meanwhile, there is also a bias in the corrected Hα flux, which is roughly 0.23 dex at AV = 1 in both cases, which would bias the estimated SFR by 0.16 dex or 0.23 dex depending on whether the less attenuated component is from DIG or H II regions. We note that the bias estimation strongly relies on the dust model and could vary in galaxies. For example, the larger the attenuation difference between CHigh and CLow or the larger the line ratio difference between CHigh and CLow, the larger the resulting bias. Regardless, estimating the bias using dust models are more physically motivated than simply using different attenuation curves.

We thereby conclude that the impact from using a single attenuation curve depends on the dust model and the physical conditions in galaxies. In our two-component model, the impact would be negligible if a partial coverage scenario is assumed, but would be significant if the differently attenuated components have different intrinsic line ratios. Again, to constrain the dust model, we need high-resolution data in future work.

To proceed further from the physical picture we proposed, it is important to verify the assumptions we made in our method. In the next section, we discuss the validity of our method in detail.

7. Discussion

In this section we examine three important assumptions we made in this work. First, we discuss the form of the likelihood function and check the effect of the intrinsic scatter. Second, we discuss the 3D binning scheme and check its impact on the results. Third, we check the sample selection effect and its implications.

7.1. Impact of the intrinsic scatter in the Balmer decrement

Our likelihood function, Eq. (12), considers measurement errors on both axes, but assumes the intrinsic scatter is only in y, the dependent variable. When there is truly no intrinsic scatter in x, our function can well reproduce the true slope, intercept, and the intrinsic scatter in y, as is shown in Appendix A.

We also performed tests where intrinsic scatter was manually added to x. In such cases, ignoring the intrinsic scatter in x during the fits results in an underestimation of the true slope. Also, it is noteworthy that not a great amount of the intrinsic scatter is needed to result in considerable change in the derived slope. For example, a Gaussian intrinsic scatter with σ ≈ 0.022 in the Balmer decrement is enough to make one underestimate the m[NII],[OII] by ∼0.3, changing it from the F99 value to the median value we measured in MaNGA.

Despite its important effect, there is much evidence against the presence of the intrinsic scatter in x, the Balmer decrement. The most likely origin of the intrinsic scatter in x is the variation in the intrinsic Hα/Hβ due to the temperature and density variation (Osterbrock & Ferland 2006). However, by our construction of the 3D bins, variations in the metallicity and ionization parameter are negligibly small within each bin, which makes the temperature variations to be small as well. In fact, according to our CLOUDY model, even when we consider a wide range of metallicity and ionization parameter, with 7.39 ≤ 12 + log(O/H) ≤ 9.19 and −4.0 ≤ log(U)≤ − 1.5, the maximum variation in Hα/Hβ is only 0.022 dex from minimum to maximum. The expected variation in each bin is much smaller and cannot explain the discrepancy between the F99 values and our measurements. Furthermore, we performed another test where we added an extra dimension using the density-sensitive log([S II]λ6716/[S II]λ6731) and constructed 4D bins, within which the density variations are well constrained. The results based on the 4D bins are nearly identical to the ones based on the 3D bins, meaning that density variations do not introduce any noticeable intrinsic scatter.

Even if we assume the intrinsic scatter in the Balmer decrement were somehow responsible for the discrepancy between the observed slopes and F99, we would have to adopt different values of intrinsic scatter for different slope measurements. For example, while making m[NII],[OII] agree with F99 requires an intrinsic scatter of 0.022 dex for the Balmer decrement, m[OIII],[OII] needs a significantly different intrinsic scatter of 0.034 dex. In addition, slopes of Balmer line ratios and high ionization line ratios are already in good agreement with F99 and only allow an intrinsic scatter < 0.01 dex. These results are not self-consistent, as the intrinsic scatter in x should only depend on the data distribution along the x axis, which is the same for all the slope measurements using the same set of 3D bins.

However, according to Fig. 3, our likelihood function predicts nonzero intrinsic scatters for Hα/Hγ and Hα/Hδ, which seems to contradict our assumption. We note that the origin of this intrinsic scatter might come from the imperfect estimations on the measurement errors. As shown in Fig. 12, the derived intrinsic scatters for these Balmer ratios are negatively correlated with the S/N of Balmer lines. Hα/Hδ also shows larger intrinsic scatter compared to Hα/Hγ, which could come from the overall poorer detection on Hδ. Specifically, the errors resulted from the stellar continuum subtraction could play a role, which is supposed to be anticorrelated with the S/N. It is possible that the likelihood function attempts to compensate the effects from the total errors with the intrinsic scatter. If it is true, one expects Hα/Hβ to show even smaller intrinsic scatter (< 0.01), as Hβ is has overall higher S/N than Hγ9.

thumbnail Fig. 12.

Intrinsic scatters in y for the Balmer reddening relations as functions of the S/N of the weaker Balmer lines. Each point represents a 3D bin, and its median S/N is plotted. Left: how the derived intrinsic scatter in log(Hα/Hγ) varies with the S/N of Hγ. Right: how the derived intrinsic scatter in log(Hα/Hδ) varies with the S/N of Hδ.

We used a series of tests to further check if there could be any intrinsic scatter in Hα/Hβ. First, we found that when there was significant intrinsic scatter in x that was ignored, the measured intrinsic scatter for y would be overestimated, and the measured slope would be underestimated. Second, we measured a new reddening relation between log([N II]/[O II]) and log(Hα/Hγ). The F99 value for this relation is 1.321, whereas the value based on our previous measurements, m [ NII ] , [ OII ] / m H α , H γ $ m^\prime_{\mathrm{[NII],[OII]}}/m^\prime_{\mathrm{H\alpha,H\gamma}} $, is 1.134. When we ignored the intrinsic scatter in Hα/Hγ, we obtained a slope of 1.08, which is indeed an underestimation. If we included the intrinsic scatter in Hα/Hγ in the likelihood function, we obtained a slope of 1.145, which is more consistent with the value based on our previous measurements. In addition, the derived intrinsic scatter for log([N II]/[O II]) is 0.033, in good agreement with our previous measurement, which is 0.032. If Hα/Hβ has significant intrinsic scatter, one expects the intrinsic scatters in both log([N II]/[O II]) and log(Hα/Hγ) were overestimated previously. As a result, using the overestimated intrinsic scatter in Hα/Hγ, one should overestimate the slope of the log([N II]/[O II]) versus log(Hα/Hγ) relation. In other words, if the intrinsic scatter in Hα/Hβ is truly responsible for making m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ deviate from the F99 value, our derived slope for the log([N II]/[O II]) versus log(Hα/Hγ) relation should actually be larger than the F99 value. From our tests, we see that the intrinsic scatter in Hα/Hβ, even if present, should be negligibly small.

Finally, we would like to comment on the general approach to linear regression problems with measurement uncertainties and intrinsic scatters. The method we adopted from Tremaine et al. (2002) cannot simultaneously estimate intrinsic scatters in both axes. Only when one knows the intrinsic scatter in one axis, can one use this method to obtain an unbiased estimation of the intrinsic scatter in the other axis. The general approach by Kelly (2007) also does not deal with the case where both axes have intrinsic scatters. However, in principle one can further generalize Kelly (2007)’s approach. Another question is whether a Gaussian component is enough to describe the intrinsic scatter, especially when it is related to measurement errors. We intend to further explore the possibility of a general solution in future work.

7.2. Impact of the binning method on the derived slopes

Most of our slope measurements are based on cubic bins in the log([N II]/Hα)-log([S II]/Hα)-log([O III]/Hβ) space with number of spaxels greater than 180. We performed a series of tests to see how the size and the number cut for the 3D bins influence our results.

On the one hand, one expects the larger the bin size, the greater the intrinsic scatter within each bin, which increases the uncertainty of the measured slope. To test this effect, we made the volume of each bin eight times as big (with a bin size of 0.034 × 0.034 × 0.034 dex3), but keep the cut on the number of spaxels. While we found m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ remains roughly unchanged, the σstd of the slope distribution increases from 0.13 to 0.17. For reddening relations that have shallower slopes and thus are more susceptible to the intrinsic scatter, one might expect the median slopes are also affected by the bin size. For example, m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ has a significant change from 0.46 to 0.30 after we adopted larger bins. However, the change in the median metallicity could play an important role in the change as well. Since the high metallicity MaNGA spaxels are more densely populated in the line-ratio space, using larger bins while keeping the cut on the number of spaxels allows more low metallicity bins to be considered while reducing the total number of high metallicity bins. This tends to lower the median slope for [O III]/[O II] (see e.g., Fig. 8).

Our conclusions also remain valid for measurements based on smaller bins. We performed another test by making the volume of each bin 5.7 times as small (with a bin size of 0.0084 × 0.0084 × 0.0116 dex3). In this case no bin has more than 180 spaxels. By lowering the number cut to 55, we obtained 2783 bins, whose m [ NII ] , [ OII ] = 1.514 ± 0.004 $ m^\prime_{\mathrm{[NII],[OII]}} = 1.514{\pm}0.004 $ and m [ OIII ] , [ OII ] = 0.508 ± 0.004 $ m^\prime_{\mathrm{[OIII],[OII]}} = 0.508{\pm}0.004 $ are again close to our results from the original binning. Since each small-size bin contains fewer spaxels, the statistical uncertainty of the linear fitting becomes larger in each bin. One can see there are different uncertainties for large bins and small bins. Regardless, we have shown that our conclusions remain valid over a wide range in bin size.

On the other hand, changing the cut on the number of spaxels also has an effect on the median and the width of the slope distribution. We expect that increasing (decreasing) the lower limit of the cut reduces (enlarges) the width of the slope distribution while biasing the median slope to the high (low) metallicity value. To check this, we kept the size of each bin unchanged and performed two additional sets of fitting with bins having more than 300 data points and 100 data points, which include 527 bins and 7287 bins, respectively. We obtained m [ NII ] , [ OII ] = 1.51 $ m^\prime_{\mathrm{[NII],[OII]}} = 1.51 $ and σstd;[NII],[OII] = 0.10 for bins having more than 300 data points, and m [ NII ] , [ OII ] = 1.51 $ m^\prime_{\mathrm{[NII],[OII]}} = 1.51 $ and σstd;[NII],[OII] = 0.16 for bins having more than 100 data points. Whereas m [ OIII ] , [ OII ] = 0.53 $ m^\prime_{\mathrm{[OIII],[OII]}} = 0.53 $ and σstd;[OIII],[OII] = 0.10 for bins having more than 300 data points, and m [ OIII ] , [ OII ] = 0.40 $ m^\prime_{\mathrm{[OIII],[OII]}} = 0.40 $ and σstd;[OIII],[OII] = 0.23 for bins having more than 100 data points. These results are consistent with our expectations. Specifically, m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ is increasingly reflecting the low metallicities regions as we lowered the number cut.

In brief, both the size of the 3D bins and the spaxel number cut could affect the width and the median values of the slope distribution, with the latter effect being mainly a reflection of the metallicity dependence of certain slopes. Nevertheless, our main conclusion about the inconsistent attenuation laws between different species of lines still holds, nearly irrespective of the binning method.

7.3. Bias from an attenuation dependent line-ratio space

The next important effect associated with our binning method is the correlation between different slopes. As we have seen in Sect. 4, since we constructed bins by constraining the variation in [N II]/Hα, [S II]/Hα, and [O III]/Hβ, slopes of some line ratios are tied together. This would not be a problem if all emission lines are seeing the same attenuation curve. However, if there are different attenuation laws for different lines due to, for example, contamination from the DIG, our assumption breaks down since [N II]/Hα, [S II]/Hα, and [O III]/Hβ no longer form an attenuation-insensitive space. This could bias the slope measurements. In addition, there are problems associated with the additivity for certain line ratios, especially the hybrid line ratios, which could also be related to the attenuation dependence of the 3D space.

To understand this effect, we performed a simple test to recompute m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ with a semi-empirical data set. To generate the data set, we first extracted the observed Hα and Hβ fluxes from the spaxels that were identified as SF regions in MaNGA and set them as the attenuated fluxes with no uncertainties for the simulated data. We then treated the Hα and Hβ fluxes corrected by the F99 extinction curve as the true unattenuated fluxes. The simulated “observed” fluxes were computed by adding Gaussian errors to the attenuated fluxes using the uncertainty estimations of DAP.

Next, we set the observed [N II]/Hα, [S II]/Hα, and [O III]/Hβ ratios as the intrinsic ratios for the simulated data set. Combining these with the intrinsic fluxes of Balmer lines, we obtained the intrinsic fluxes of [N II], [S II], and [O III]. However, when computing the attenuated fluxes for these lines, we used different attenuation curves for different lines. Specifically, we made l [ NII ] = 0.827 l [ NII ] F 99 $ l_{\mathrm{[NII]}}=0.827\ l^{\mathrm{F99}}_{\mathrm{[NII]}} $ and l [ SII ] = 0.827 l [ SII ] F 99 $ l_{\mathrm{[SII]}}=0.827\ l^{\mathrm{F99}}_{\mathrm{[SII]}} $, and l [ OIII ] = 0.948 l [ OIII ] F 99 $ l_{\mathrm{[OIII]}} = 0.948\ l^{\mathrm{F99}}_{\mathrm{[OIII]}} $. Here we assumed all low ionization lines share the same attenuation curve, while all high ionization lines share a different one. This assumption is oversimplified, but it can make the expected slopes including m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ to be close to our measured values in Table 1. Measurement errors were subsequently added to these lines.

The intrinsic fluxes for [O II], on the other hand, were set by a theoretical photoionization model. With the SF-ionized model computed by Ji & Yan (2020), we estimated the metallicity and ionization parameter for every SF spaxel using the assumed true values of [N II]/Hα, [S II]/Hα, and [O III]/Hβ. Then for each spaxel, we find the model prediction for [N II]/[O II] using the estimated metallicity and ionization parameter. Combining [N II]/[O II] with the intrinsic fluxes of [N II], we obtained the intrinsic fluxes for [O II]. The attenuated [O II] was then computed by setting l [ OII ] = 0.827 l [ OII ] F 99 $ l_{\mathrm{[OII]}}=0.827l^{\mathrm{F99}}_{\mathrm{[OII]}} $. After we obtained the simulated observed line ratios, we added small intrinsic scatters to these line ratios. Next, we corrected the manually attenuated [N II]/Hα, [S II]/Hα, and [O III]/Hβ using the F99 curve and created 3D bins. As the final step, we remeasured m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ based on the semi-empirical data set.

As expected, the resulting slopes deviate from the F99 values due to our modification of the attenuation law. Interestingly, they are not entirely consistent with the expected values from the modified curves either. We found m [ NII ] , [ OII ] 1.59 $ m^\prime_{\mathrm{[NII],[OII]}} \approx 1.59 $ and m [ OIII ] , [ OII ] 0.61 $ m^\prime_{\mathrm{[OIII],[OII]}} \approx 0.61 $, whereas the modified curves predict m [ NII ] , [ OII ] 1.53 $ m^\prime_{[\rm NII],[OII]} \approx 1.53 $ and m [ OIII ] , [ OII ] 0.45 $ m^\prime_{\mathrm{[OIII],[OII]}} \approx 0.45 $. In addition, the additivity of slopes is violated and m [ NII ] , [ OII ] m [ NII ] , [ OIII ] m [ OIII ] , [ OII ] 0.09 $ m^\prime_{\mathrm{[NII],[OII]}}-m^\prime_{\mathrm{[NII],[OIII]}}-m^\prime_{\mathrm{[OIII],[OII]}}\approx 0.09 $. We note that m [ NII ] , [ OIII ] 0.89 m [ NII ] , [ OIII ] F 99 $ m^\prime_{\mathrm{[NII],[OIII]}}\approx0.89\approx m^{\mathrm{F99}}_{\mathrm{[NII],[OIII]}} $ even after we modified the attenuation law. In comparison, the expected modified value for m [ NII ] , [ OIII ] $ m^\prime_{\mathrm{[NII],[OIII]}} $ is 1.07. This indicates m [ NII ] , [ OIII ] $ m^\prime_{\mathrm{[NII],[OIII]}} $ is likely fixed by our construction of the 3D bins and the preapplied extinction correction based on the F99 curve, making it almost always follow the F99 extinction curve.

When generating the semi-empirical data, we noticed another effect associated with the measurement errors. If we made the propagated measurement errors for the logarithmic line ratios Gaussians rather than made the flux errors Gaussians, the additivity seemed to be retrieved, although the slopes were still different from the expected values. In such a case, m [ NII ] , [ OII ] 1.53 $ m^\prime_{\mathrm{[NII],[OII]}} \approx 1.53 $, m [ OIII ] , [ OII ] 0.68 $ m^\prime_{\mathrm{[OIII],[OII]}} \approx 0.68 $, and | m [ NII ] , [ OII ] m [ NII ] , [ OII ] m [ NII ] , [ OIII ] | < 0.01 $ |m^\prime_{\mathrm{[NII],[OII]}}-m^\prime_{\mathrm{[NII],[OII]}}-m^\prime_{\mathrm{[NII],[OIII]}}| < 0.01 $. This effect suggests the loss of the additivity is associated with the error distributions. If the flux errors are Gaussians and the propagated errors for logarithmic line ratios deviate from Gaussians significantly, the likelihood function seems to bias the results and violate the additivity. Once we manually set the measurement errors to be 1/2 of the current values, the additivity was satisfied to a good approximation even if we assumed Gaussian errors for line fluxes. However, as we have seen in Sect. 4.4, increasing the S/N does not improve the additivity for [N II]/[O II],[O III]/[O II], and [N II]/[O III] in the MaNGA sample. In addition, we did not find such a bias associated with the error distribution in another simulated data set in Appendix A even after we manually lowered the S/N of lines to be close to 3.

In fact, by checking the dependence of the derived slopes on various physical parameters, we find the surface brightness of Hα seems more relevant to the additivity issue compared to the S/N, which we discuss in Sect. 7.5. All the evidence implies that although our semi-empirical data set can provide hints on the effects of binning biases, it is not realistic enough to reproduce our measurements with the MaNGA main sample.

In summary, when there exists different attenuation laws for different lines, the final slopes derived from 3D bins are affected but could still be different from the expected values from the modified curves. Meanwhile, the assumption of the error distribution could lead to the loss of the additivity of slopes, but it cannot explain the results we found in our sample.

7.4. Reliability of the BPT diagrams

A fundamental assumption of our 3D binning method is that the line ratio variations in the 3D-BPT space are reflecting the intrinsic variations in the metallicity, ionization parameter, and other physical parameters (except the attenuation) in SF regions. This assumption is widely adopted since the theoretical photoionization models with varying nebular parameters can well reproduce the shapes of the SF loci in the BPT diagrams (e.g., Kewley et al. 2001, 2019; Dopita et al. 2013; D’Agostino et al. 2019). In addition, although photoionization models show significant degeneracy between the metallicity and ionization parameter in the 2D-BPT diagrams, the degeneracy disappears in the 3D-BPT space (Vogt et al. 2014; Ji & Yan 2020). However, there are still several caveats we would like to discuss.

First, the SF loci in the BPT diagrams could be affected by the DIG contamination. As shown by Sanders et al. (2020), the resolved H II regions in the CHemical Abundances Of Spirals survey (CHAOS, Berg et al. 2015; Croxall et al. 2015, 2016; Berg et al. 2020) show a systematic offset from the MaNGA SF spaxels in the [S II]-based BPT diagram. If the DIG contamination is strong enough to shift observed SF regions in the BPT diagrams, there would be important systematic uncertainties when people use these line ratios to constrain metallicities and ionization parameters. However, as noted by Mannucci et al. (2021), the aperture effect for H II regions become important for high-resolution studies. When a single H II region is not fully sampled in space, the line ratios would depend on the size of the aperture and the results cannot be directly compared with photoionization models that calculate the integrated emission line spectra for the whole H II region. After taking the aperture effect into account, Mannucci et al. (2021) argue the DIG contamination only has a secondary effect on the BPT diagnostics of MaNGA SF spaxels. Therefore, the DIG contamination is likely not strong enough to shift the whole SF locus in the line-ratio space. Whereas the DIG contamination on the observed attenuation within small regions of the line-ratio space cannot be ruled out.

Second, there could be degeneracy between the evolutionary stage and the metallicity of H II regions. By modeling the time evolution of an H II region, Pellegrini et al. (2020b) were able to cover a large area in the [N II]-based BPT diagrams with a photoionization model grid at fixed metallicity. If the observed H II regions really show a significant variation in their evolutionary stages, they would show scatter similar to the metallicity variation in BPT diagrams (see also Byler et al. 2017). Still, the level of potential degeneracy introduced by this effect is still unclear. In surveys like MaNGA, the unresolved H II regions are effectively light-weighted ionized regions. Since H II regions at different evolutionary phases are characterized by different brightnesses, it is likely that the observations are biased toward some average phases, which might alleviate this degeneracy.

Finally, this work also raises a question about whether the BPT space is truly attenuation-free at low spatial resolution. Although we started our calculation with the attenuation-free assumption, the results we obtained suggest differential attenuation that depends on the species of lines, undermining the initial assumption we made. Under different attenuation curves, variations in metallicities and ionization parameters are no longer well constrained in individual bins, which is reflected by the results we show in the previous subsection. Still the amount of bias associated with this process depends on the mechanism that results in different apparent attenuation laws, which could be much more complicated than the DIG contamination model we proposed.

To conclude, despite being powerful diagnostic tools, BPT diagrams are still not fully understood in terms of their applicability and their implication. A large spectroscopic sample of well-resolved H II regions and more sophisticated photoionization models are both needed to solve the aforementioned problems.

7.5. Dependence of the derived attenuation law on galaxy properties

As summarized by Salim & Narayanan (2020), observations have shown the shape of the attenuation curve depends on a number of galaxy properties including the metallicity, stellar mass, inclination, dust column density, etc. Thus far, we have seen the relative attenuation we derived shows dependencies on the metallicity and the S/N of the lines. Some of the dependencies might come from the true variation of the attenuation law, while others might be related to the observational bias induced by differently attenuated components. In this subsection we examine these potential dependencies by checking how the sample selections based on the Hα surface brightness (ΣHα), inclination (b/a), and AV influence our results. The Hα surface brightness is anticorrelated with the contamination from the DIG (Oey et al. 2007; Zhang et al. 2017), and it is also partly related to the S/N of lines and the metallicity of the galaxy. Therefore, the observational bias associated with the unresolved DIG could be reflected by the dependence of the attenuation on ΣHα. The inclination and AV are closely related to the column density of the dust and could reflect the intrinsic variations of the attenuation curve due to dust geometries and compositions.

Table 3 summarizes the resulting slopes for several line ratios with different sample selection criteria. We inspected two sets of line ratios: 1) [S III]/[O III], [S III]/Hα, and Hα/[O III], 2) [N II]/[O II], [N II]/[O III], [O III]/[O II]. Ideally, slopes for each set of line ratios should satisfy additivity. When doing the sample selection, we made sure the number of spaxels selected by each selection criterion is larger than a minimum number (roughly 60 ∼ 90). Meanwhile, we used the same set of 3D bins when making cut on the same physical parameter to control the metallicity effect on the resulting slopes, though the spaxels in each bin could be somewhat different. We chose not to divide the sample according to AV, of which the reason will be discussed later in this subsection.

Table 3.

Median slopes of reddening relations of different line ratios derived from samples selected by ΣHα and b/a.

For the sample selection based on ΣHα, we made two samples with ΣHα < 1039 erg s−1 kpc−2 and ΣHα > 2 × 1039 erg s−1 kpc−2, where ΣHα is corrected for extinction using the F99 extinction curve. One can see for the low ΣHα sample, m [ SIII ] , [ OIII ] $ m^\prime_{\mathrm{[SIII],[OIII]}} $ is much larger than m [ SIII ] , [ OIII ] F 99 $ m^{\mathrm{F99}}_{\mathrm{[SIII],[OIII]}} $ and m [ SIII ] , [ OIII ] > m [ SIII ] , H α + m H α , [ OIII ] $ m^\prime_{\mathrm{[SIII],[OIII]}} > m^\prime_{\mathrm{[SIII],H\alpha}} + m^\prime_{\mathrm{H\alpha ,[OIII]}} $. On the other hand, m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ is slightly closer to m [ NII ] , [ OII ] F 99 $ m^{\mathrm{F99}}_{\mathrm{[NII],[OII]}} $ at low ΣHα, but has much larger uncertainty. Meanwhile, at low ΣHα, m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ even becomes negative. In fact, all measured slopes except mHα, [OIII] and m[NII],[OIII] show wide distributions at low ΣHα, which are reflected by the uncertainties of the medians. For m H α , [ OIII ] $ m^\prime_{\mathrm{H\alpha,[OIII]}} $ and m [ NII ] , [ OIII ] $ m^\prime_{\mathrm{[NII],[OIII]}} $, their measured values are fixed by our construction of the 3D bins (see Sect. 7.3).

At high ΣHα, both sets of line ratios follow the additivity better, although m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ deviates more from the F99 value. The behavior of the first set of line ratios involving [S III] and [O III] seems to support the scenario of the DIG contamination, which should be reduced in brighter regions dominated by H II regions. As ΣHα increases, both m [ SIII ] , [ OIII ] $ m^\prime_{\mathrm{[SIII],[OIII]}} $ and m [ SIII ] , H α $ m^\prime_{\mathrm{[SIII],H\alpha}} $ become much closer to the expected F99 values. While for the second set of line ratios, despite the retrieval of its additivity at high ΣHα, m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ remain biased at high ΣHα, although m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ does get closer to the corresponding F99 value. One might wonder whether the slopes of Balmer lines also change with ΣHα. We found both m H α , H γ $ m^\prime_{\mathrm{H\alpha,H\gamma}} $ and m H α , H δ $ m^\prime_{\mathrm{H\alpha,H\delta}} $ to slightly decrease at high ΣHα, but m H α , H δ $ m^\prime_{\mathrm{H\alpha,H\delta}} $ is still greater than m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ by roughly 0.05, meaning they still cannot be fitted by the same attenuation curve.

In Sect. 4.4, we found the S/N cut was able to improve the additivity for the first set of line ratios, but failed for the second set of line ratios. Compared to the cut on S/N, the cut on ΣHα provides an overall better recovery of the additivity, but the physical picture behind is still unclear. In addition, the DIG contamination scenario is not able to explain the deviations of m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ from the F99 values. We also ran another test that divided the sample according to the equivalent width of Hα, which can also be used to remove the DIG (Cid Fernandes et al. 2011). However, we obtained similar results where m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ deviate from the F99 value. Still, geometric effects caused by partially covered H II regions could create a modified attenuation curve at low resolution without the DIG (see Fig. 9 and Fig. 10). Then the question becomes how the dust geometry would be different for different lines in bright H II regions. Again high-resolution data are the key to fully understanding this problem, which we will explore in future work.

For the sample selection based on b/a, we made two samples with b/a < 0.6 and b/a > 0.7, respectively. This time the differences between the slopes derived from the two samples are smaller. The high inclination sample (b/a < 0.6) shows higher m [ SIII ] , [ OIII ] $ m^\prime_{\mathrm{[SIII],[OIII]}} $ and m [ SIII ] , H α $ m^\prime_{\mathrm{[SIII],H\alpha }} $, corresponding to a larger RV under the parameterization of Fitzpatrick (1999). If these differences are truly driven by the change of the attenuation curve rather than the observational bias, it means the more inclined regions with larger dust column density exhibits a grayer attenuation curve, which is qualitatively consistent with the conclusion of Salim & Narayanan (2020). However, m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII] }} $ also become larger at high inclination, which requires a lower RV instead. This suggests either the intrinsic variation in the shape of the attenuation curve due to the change of the inclination is overwhelmed by the observation bias, or the intrinsic variation cannot be completely characterized by the variation in RV.

Finally, we checked the dependence of our results on AV. According to Salim & Narayanan (2020), AV is a more fundamental parameter that correlates with the variation of the attenuation curve, which has been extensively investigated in works using galaxy spectral energy distribution (SED) fitting. Still, the observational determination of the AV dependence based on galaxy SED fitting might be subject to systematic biases from correlated uncertainties (Qin et al. 2022). We calculated AV using the Balmer decrement assuming the F99 extinction curve. However, we cannot directly construct subsamples with AV like we did with the previous two parameters. This is because AV is exactly the x axis in the reddening relation we defined. Making cuts on the observed independent variable with measurement errors can easily bias the linear regression results, which is similar to the “missing data problem” (see e.g., Gelman et al. 1995; Little & Rubin 2002; Kelly 2007). Indeed, we found in many cases that within a 3D bin, the slopes given by our maximum-likelihood method using the high AV spaxels and low AV spaxels are both larger than when using all spaxels. Therefore, instead of making subsamples, we checked how the derived slopes depend on the median AV within each bin. We found the slope versus AV relations are very similar to the slope versus metallicity relations. In fact, as shown by Fig. 13, the median AV shows a strong positive correlation with the metallicity, indicating more metal-rich clouds are also dustier on average. If the changes in the slopes are mainly driven by the change in RV due to increasing AV, one expects in Fig. 8, m [ SIII ] , [ OIII ] $ m^\prime_{\mathrm{[SIII],[OIII]}} $ should increase with increasing metallicity (and AV) and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $ should decrease with increasing metallicity (and AV) as the attenuation curve become grayer. Given the opposite trends we obtained, the metallicity effect is more likely to be responsible for the changes of slopes. Due to the tight correlation between the median AV and metallicity, we cannot find any clear dependence of the slopes on the median AV at fixed metallicities.

thumbnail Fig. 13.

Correlation between the metallicity and the V-band attenuation AV for the 3D bins that contain more than 180 spaxels. The V-band attenuation is derived from the Balmer decrement. For each 3D bin, the median values of 12+log(O/H) and AV are plotted. The vertical and horizontal errorbars show the typical (median) standard deviations of AV and 12+log(O/H), respectively, in each 3D bin.

In summary, due to the observational bias possibly related to the contamination of the DIG, the relative attenuation of forbidden lines show complex dependencies on various galaxy properties. Specifically, increasing ΣHα seems to improve the additivity of the slopes of the reddening relations, but the deviation from the expected F99 curve remains for m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ and m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $. Changing b/a and AV also affects the derived slopes, but the results are likely dominated by the observational bias rather than the intrinsic variation in RV.

8. Summary and conclusions

In this work we study the nebular attenuation probed by different optical-to-NIR emission lines in MaNGA via a novel method. This method bins observed SF spaxels in a 3D line-ratio space spanned by [N II]/Hα, [S II]/Hα, and [O III]/Hβ. In theory, within a small region in this attenuation-insensitive space, variations in nebular parameters such as the metallicity and ionization parameter are well constrained, whereas the magnitude of nebular attenuation is free to vary. Therefore, variations in the attenuation-sensitive line ratios are dominated by nebular attenuation in each small 3D bin. By fitting linear models to reddening relations between logarithmic attenuation-sensitive line ratios and logarithmic Balmer decrements, we measured the nebular attenuation curve using different species of lines. We summarize our results as follows.

First, reddening relations of high ionization line ratios, including [S III]/[O III] and [O III]/[Ne III], shows median slopes consistent with the predictions of a Fitzpatrick (1999) extinction with RV = 3.1. The same attenuation curve can well describe the attenuation of Balmer lines as well. In contrast, line ratios involving low ionization lines appear to follow a grayer attenuation curve inconsistent with high ionization line ratios and Balmer line ratios. As a result, a single attenuation curve cannot simultaneously explain the reddening of high ionization lines, Balmer lines, and low ionization lines.

Second, we suspect the low spatial resolution of MaNGA causes this discrepancy and propose a toy model that can produce different observed attenuation laws for different species of lines. In the toy model, two distinct line-emitting components with different levels of attenuation and different intrinsic line ratios are mixed and their contributions to the integrated line fluxes vary in different observations. The DIG contamination could fit into this scenario and explain the variations of the attenuation curves as a result of the different intrinsic line ratios in the DIG and H II regions, but the model parameters are not well constrained and high-resolution data are needed for verification. We found evidence in the MaNGA ancillary observations of the nearby galaxy IC 342 that at a higher spatial resolution, all emission lines follow an attenuation curve more consistent with the Fitzpatrick (1999) one. However, the uncertainties of these measurements are too large and we need a larger volume of data to confirm these results.

Third, the impact of the multicomponent dust attenuation we discovered on nebular diagnostics has a nontrivial dependence on the dust model and the physical conditions in galaxies. If one uses a single attenuation curve determined by Balmer lines to correct observed forbidden line ratios, the corrected line ratios and nebular parameters inferred from these line ratios could be biased by 0.06–0.25 dex when the apparent AV = 1, depending on the assumptions in the attenuation model.

Finally, we checked the assumptions we made in our measurements including the intrinsic scatter, the binning method, and the sample selection. Although the intrinsic scatter in the Balmer decrement, if present, could change our slope measurements, our tests rule out any significant amount of intrinsic scatter in Hα/Hβ in our sample. We noticed the reddening relations of some line ratios are correlated due to our construction of the 3D bins. If our 3D space is no longer attenuation-free as indicated by the different attenuation laws, there would be biases associated with the measurements. Still, the main conclusion about the differential attenuation remains qualitatively unchanged, unless the 3D BPT diagrams we used are not reliable tools to reveal the overall variations in the metallicity and ionization parameter. In addition, we found strong dependence of the measured slopes on the Hα surface brightness. At high Hα surface brightness, the reddening of line ratios related to [S III] converges to the predictions of the Fitzpatrick (1999) extinction curve, whereas the reddening of line ratios involving [O II] remain different from the predictions of this curve. This result cannot be fully explained by the DIG contamination, implying the different dust geometries inside H II regions might also play a role.

Our results challenge the widely adopted assumption that different emission lines should follow the same attenuation law on all spatial scales, which is the basis for a large number of nebular diagnostics including the BPT diagnostics and metallicity calibrations. However, since the physical picture behind this phenomenon is still unclear, currently we do not have a reliable and well-tested way to make corrections to the observed emission lines, and we will try to address this issue in future work. The key to solving this issue is to perform similar analyses on large spectroscopic data sets with high spatial resolution and wide spectral coverage. Upcoming surveys such as AMASE (Yan et al. 2020) and LVM (Kollmeier et al. 2017) will provide valuable data for further investigations. Meanwhile, the PHANGS-MUSE (Emsellem et al. 2022) data, despite covering a narrower spectral range, could provide hints on the spatial distribution of dust. In addition, PHANGS-MUSE covers a few low ionization lines such as [O I]λ6300, which could be used to compare with the attenuation measurements at MaNGA’s resolution. We will compare the two data sets in a future paper.


1

Throughout this work, we use the word “attenuation” to indicate the potential presence of complex cloud geometries around the source as well as the scattering of light into and out of the line of sight. While the word “extinction” describes the simpler scenario where there is effectively a dust screen between the observer and the foreground source.

2

We note that limited by MaNGA’s spatial resolution, we cannot select individual H II regions. Therefore, each spaxel in our sample showing ionization compatible with star formation possibly includes multiple H II regions and some diffuse ionized regions.

3

However, we note that some observed line ratios, such as [O I]λ6300/Hα and [S III]/[S II], cannot be correctly reproduced by current photoionization models (see e.g., Mingozzi et al. 2020; Law et al. 2021b).

4

These include more than just the metallicity and ionization parameter, as the binning is done in 3D. Additionally, we tried using a 4D space by including the stellar mass, which correlates with the variation in the N/O versus O/H relations (Belfiore et al. 2017; Schaefer et al. 2022), and the results remain essentially the same. The drawback of higher-dimensional binning is having fewer data points in each bin and thus having larger statistical uncertainties.

5

The numerator of χ2 indicates the separation is calculated along the y axis in this linear model. However, we note that constructing the likelihood function from the perspective of orthogonal distance regression (ODR) using, for example, Eq. (32) of Hogg et al. (2010), leads to the same final expression for the χ2 term.

6

We note that the metallicity measurements at the highest metallicity might have slightly larger systematic uncertainty due to the extrapolation of the stellar spectral energy distribution in the model (Ji & Yan 2020).

7

This model shares some similarities with the Charlot & Fall (2000) model, where two different attenuation components are present.

8

We note that the assumed line ratios for the two components will affect the resulting slopes, and some fine-tuning is required to make it work approximately.

9

One might also wonder about the impact from flux calibrations. Since we defined the reddening relations in logarithmic line-ratio spaces, any systematic bias in flux calibrations would only change the intercept rather than the slope.

10

According to Kelly (2007), from the perspective of Bayesian statistics, this likelihood function corresponds to the special case where the prior for the intrinsic distribution of x is a uniform distribution.

11

Although here we assumed the error distributions of the logarithmic line ratios were Gaussian, assuming Gaussian errors for line fluxes lead to the same conclusion.

12

Here we used the PYTHON port of Kelly (2007)’s IDL package LINMIX_ERR shared by: https://github.com/jmeyers314/linmix.

13

According to the code descriptions of DAP, there is poor agreement between the reported nonparametric errors and those from a Monte Carlo simulation.

14

One might wonder whether the variation in the physical scales of the spaxels might impact our results. The redshift distribution of our sample is relatively narrow, with 68% of our sample spaxels lie in the range of 0.022 – 0.045, corresponding to physical scales ranging from 1.2 kpc to 2.4 kpc. Meanwhile, the median standard deviation of the physical scale in 3D bins is roughly 0.5 kpc. If we narrow the redshift range by removing spaxels with z < 0.017 or z > 0.050, our conclusion still stays the same.

Acknowledgments

The authors thank Kathryn Kreckel for providing the observational data of IC 342 in the data release of SDSS-IV MaNGA. The work described in this paper was partially supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China [Project No: CUHK 14302522]. RY also acknowledges support by the Hong Kong Global STEM Scholar scheme, by the Hong Kong Jockey Club Charities Trust through the JC STEM Lab of Astronomical Instrumentation, and by the Direct Grant of CUHK Faculty of Science. MB acknowledges support from FONDECYT regular grant 1211000 and by the ANID BASAL project FB210003. RR thanks to Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq, Proj. 311223/2020-6, 304927/2017-1 and 400352/2016-8), Fundação de amparo à pesquisa do Rio Grande do Sul (FAPERGS, Proj. 16/2551-0000251-7 and 19/1750-2), and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES, Proj. 0001). RAR acknowledges the support from Conselho Nacional de Desenvolvimento Científico e Tecnológico and Fundação de Amparo à pesquisa do Estado do Rio Grande do Sul. Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the US Department of Energy Office of Science, and the Participating Institutions. SDSS acknowledges support and resources from the Center for High-Performance Computing at the University of Utah. The SDSS web site is https://www.sdss.org. SDSS is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, the Chilean Participation Group, the French Participation Group, Harvard-Smithsonian Center for Astrophysics, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU)/University of Tokyo, the Korean Participation Group, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatório Nacional/MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University.

References

  1. Abdurro’uf, Accetta K., Aerts, C., et al. 2022, ApJS, 259, 35 [NASA ADS] [CrossRef] [Google Scholar]
  2. Alloin, D., Collin-Souffrin, S., Joly, M., & Vigroux, L. 1979, A&A, 78, 200 [Google Scholar]
  3. Baldwin, J. A., Phillips, M. M., & Terlevich, R. 1981, PASP, 93, 5 [Google Scholar]
  4. Beers, T. C., Flynn, K., & Gebhardt, K. 1990, AJ, 100, 32 [Google Scholar]
  5. Belfiore, F., Maiolino, R., Tremonti, C., et al. 2017, MNRAS, 469, 151 [Google Scholar]
  6. Belfiore, F., Westfall, K. B., Schaefer, A., et al. 2019, AJ, 158, 160 [CrossRef] [Google Scholar]
  7. Belfiore, F., Santoro, F., Groves, B., et al. 2022, A&A, 659, A26 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  8. Berg, D. A., Skillman, E. D., Croxall, K. V., et al. 2015, ApJ, 806, 16 [NASA ADS] [CrossRef] [Google Scholar]
  9. Berg, D. A., Pogge, R. W., Skillman, E. D., et al. 2020, ApJ, 893, 96 [NASA ADS] [CrossRef] [Google Scholar]
  10. Blanton, M. R., Bershady, M. A., Abolfathi, B., et al. 2017, AJ, 154, 28 [Google Scholar]
  11. Bottorff, M., Lamothe, J., Momjian, E., et al. 1998, PASP, 110, 1040 [Google Scholar]
  12. Bundy, K., Bershady, M. A., Law, D. R., et al. 2015, ApJ, 798, 7 [Google Scholar]
  13. Byler, N., Dalcanton, J. J., Conroy, C., & Johnson, B. D. 2017, ApJ, 840, 44 [Google Scholar]
  14. Calzetti, D., Kinney, A. L., & Storchi-Bergmann, T. 1994, ApJ, 429, 582 [Google Scholar]
  15. Cardelli, J. A., Clayton, G. C., & Mathis, J. S. 1989, ApJ, 345, 245 [Google Scholar]
  16. Charlot, S., & Fall, S. M. 2000, ApJ, 539, 718 [Google Scholar]
  17. Chevallard, J., Charlot, S., Wandelt, B., & Wild, V. 2013, MNRAS, 432, 2061 [CrossRef] [Google Scholar]
  18. Cid Fernandes, R., Stasińska, G., Mateus, A., & Vale Asari, N. 2011, MNRAS, 413, 1687 [Google Scholar]
  19. Croxall, K. V., Pogge, R. W., Berg, D. A., Skillman, E. D., & Moustakas, J. 2015, ApJ, 808, 42 [NASA ADS] [CrossRef] [Google Scholar]
  20. Croxall, K. V., Pogge, R. W., Berg, D. A., Skillman, E. D., & Moustakas, J. 2016, ApJ, 830, 4 [NASA ADS] [CrossRef] [Google Scholar]
  21. D’Agostino, J. J., Kewley, L. J., Groves, B., et al. 2019, ApJ, 878, 2 [CrossRef] [Google Scholar]
  22. Diaz, A. I., Terlevich, E., Vilchez, J. M., Pagel, B. E. J., & Edmunds, M. G. 1991, MNRAS, 253, 245 [NASA ADS] [CrossRef] [Google Scholar]
  23. Dopita, M. A., Kewley, L. J., Heisler, C. A., & Sutherland, R. S. 2000, ApJ, 542, 224 [NASA ADS] [CrossRef] [Google Scholar]
  24. Dopita, M. A., Sutherland, R. S., Nicholls, D. C., Kewley, L. J., & Vogt, F. P. A. 2013, ApJS, 208, 10 [NASA ADS] [CrossRef] [Google Scholar]
  25. Draine, B. T. 2003, ApJ, 598, 1017 [NASA ADS] [CrossRef] [Google Scholar]
  26. Drory, N., MacDonald, N., Bershady, M. A., et al. 2015, AJ, 149, 77 [CrossRef] [Google Scholar]
  27. Emsellem, E., Schinnerer, E., Santoro, F., et al. 2022, A&A, 659, A191 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  28. Fanelli, M. N., O’Connell, R. W., & Thuan, T. X. 1988, ApJ, 334, 665 [Google Scholar]
  29. Ferguson, A. M. N., Wyse, R. F. G., Gallagher, J. S. III., & Hunter, D. A. 1996, AJ, 111, 2265 [NASA ADS] [CrossRef] [Google Scholar]
  30. Ferland, G. J., Chatzikos, M., Guzmán, F., et al. 2017, Rev. Mex. Astron. Astrof., 53, 385 [Google Scholar]
  31. Fitzpatrick, E. L. 1999, PASP, 111, 63 [Google Scholar]
  32. Fitzpatrick, E. L., & Massa, D. 2007, ApJ, 663, 320 [Google Scholar]
  33. Flaherty, K. M., Pipher, J. L., Megeath, S. T., et al. 2007, ApJ, 663, 1069 [CrossRef] [Google Scholar]
  34. Flores-Fajardo, N., Morisset, C., Stasińska, G., & Binette, L. 2011, MNRAS, 415, 2182 [Google Scholar]
  35. Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]
  36. Gelman, A., Carlin, J. B., Stern, H. S., & Rubin, D. B. 1995, Bayesian Data Analysis (Chapman and Hall/CRC) [CrossRef] [Google Scholar]
  37. Gunn, J. E., Siegmund, W. A., Mannery, E. J., et al. 2006, AJ, 131, 2332 [NASA ADS] [CrossRef] [Google Scholar]
  38. Haffner, L. M., Dettmar, R. J., Beckman, J. E., et al. 2009, Rev. Mod. Phys., 81, 969 [CrossRef] [Google Scholar]
  39. Hogg, D. W., Bovy, J., & Lang, D. 2010, ArXiv e-prints [arXiv:1008.4686] [Google Scholar]
  40. Howard, C. S., Pudritz, R. E., Harris, W. E., & Klessen, R. S. 2018, MNRAS, 475, 3121 [NASA ADS] [CrossRef] [Google Scholar]
  41. Inoue, A. K. 2005, MNRAS, 359, 171 [NASA ADS] [CrossRef] [Google Scholar]
  42. Ji, X., & Yan, R. 2020, MNRAS, 499, 5749 [NASA ADS] [CrossRef] [Google Scholar]
  43. Ji, X., & Yan, R. 2022, A&A, 659, A112 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  44. Kashino, D., Silverman, J. D., Rodighiero, G., et al. 2013, ApJ, 777, L8 [Google Scholar]
  45. Kauffmann, G., Heckman, T. M., Tremonti, C., et al. 2003, MNRAS, 346, 1055 [Google Scholar]
  46. Kelly, B. C. 2007, ApJ, 665, 1489 [Google Scholar]
  47. Kelly, B. C. 2012, Statistical Challenges in Modern Astronomy V (New York: Springer), 147 [NASA ADS] [CrossRef] [Google Scholar]
  48. Kennicutt, R. C., & Evans, N. J. 2012, ARA&A, 50, 531 [NASA ADS] [CrossRef] [Google Scholar]
  49. Kewley, L. J., & Dopita, M. A. 2002, ApJS, 142, 35 [Google Scholar]
  50. Kewley, L. J., & Ellison, S. L. 2008, ApJ, 681, 1183 [Google Scholar]
  51. Kewley, L. J., Dopita, M. A., Sutherland, R. S., Heisler, C. A., & Trevena, J. 2001, ApJ, 556, 121 [Google Scholar]
  52. Kewley, L. J., Nicholls, D. C., & Sutherland, R. S. 2019, ARA&A, 57, 511 [Google Scholar]
  53. Kollmeier, J. A., Zasowski, G., Rix, H. W., et al. 2017, ArXiv e-prints [arXiv:1711.03234] [Google Scholar]
  54. Law, D. R., Yan, R., Bershady, M. A., et al. 2015, AJ, 150, 19 [CrossRef] [Google Scholar]
  55. Law, D. R., Cherinka, B., Yan, R., et al. 2016, AJ, 152, 83 [Google Scholar]
  56. Law, D. R., Ji, X., Belfiore, F., et al. 2021a, ApJ, 915, 35 [NASA ADS] [CrossRef] [Google Scholar]
  57. Law, D. R., Westfall, K. B., Bershady, M. A., et al. 2021b, AJ, 161, 52 [NASA ADS] [CrossRef] [Google Scholar]
  58. Li, N., Li, C., Mo, H., et al. 2021, ApJ, 917, 72 [NASA ADS] [CrossRef] [Google Scholar]
  59. Little, R. J. A., & Rubin, D. B. 2002, Statistical Analysis with Missing Data (New York: Wiley) [CrossRef] [Google Scholar]
  60. Mannucci, F., Belfiore, F., Curti, M., et al. 2021, MNRAS, 508, 1582 [NASA ADS] [CrossRef] [Google Scholar]
  61. McKee, C. F., & Williams, J. P. 1997, ApJ, 476, 144 [CrossRef] [Google Scholar]
  62. Mingozzi, M., Belfiore, F., Cresci, G., et al. 2020, A&A, 636, A42 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  63. Nelder, J. A., & Mead, R. 1965, Comput. J., 7, 308 [Google Scholar]
  64. Oey, M. S., Meurer, G. R., Yelda, S., et al. 2007, ApJ, 661, 801 [NASA ADS] [CrossRef] [Google Scholar]
  65. Osterbrock, D. E., & Ferland, G. J. 2006, Astrophysics of Gaseous Nebulae and Active Galactic Nuclei (Sausalito, CA: University Science Books) [Google Scholar]
  66. Pagel, B. E. J., Edmunds, M. G., Blackwell, D. E., Chun, M. S., & Smith, G. 1979, MNRAS, 189, 95 [NASA ADS] [CrossRef] [Google Scholar]
  67. Pannella, M., Elbaz, D., Daddi, E., et al. 2015, ApJ, 807, 141 [Google Scholar]
  68. Pellegrini, E. W., Rahner, D., Reissl, S., et al. 2020a, MNRAS, 496, 339 [Google Scholar]
  69. Pellegrini, E. W., Reissl, S., Rahner, D., et al. 2020b, MNRAS, 498, 3193 [NASA ADS] [CrossRef] [Google Scholar]
  70. Press, W. H., Flannery, B. P., & Teukolsky, S. A. 1986, Numerical Recipes. The Art of Scientific Computing (Cambridge: University Press) [Google Scholar]
  71. Price, S. H., Kriek, M., Brammer, G. B., et al. 2014, ApJ, 788, 86 [NASA ADS] [CrossRef] [Google Scholar]
  72. Qin, J., Zheng, X. Z., Fang, M., et al. 2022, MNRAS, 511, 765 [NASA ADS] [CrossRef] [Google Scholar]
  73. Reddy, N. A., & Steidel, C. C. 2004, ApJ, 603, L13 [NASA ADS] [CrossRef] [Google Scholar]
  74. Reddy, N. A., Shapley, A. E., Kriek, M., et al. 2020, ApJ, 902, 123 [NASA ADS] [CrossRef] [Google Scholar]
  75. Reynolds, R. J. 1990, IAU Symp., 139, 157 [NASA ADS] [Google Scholar]
  76. Rezaee, S., Reddy, N., Shivaei, I., et al. 2021, MNRAS, 506, 3588 [CrossRef] [Google Scholar]
  77. Saha, A., Claver, J., & Hoessel, J. G. 2002, AJ, 124, 839 [NASA ADS] [CrossRef] [Google Scholar]
  78. Salim, S., & Narayanan, D. 2020, ARA&A, 58, 529 [NASA ADS] [CrossRef] [Google Scholar]
  79. Sánchez, S. F., Kennicutt, R. C., Gil de Paz, A., et al. 2012, A&A, 538, A8 [Google Scholar]
  80. Sanders, R. L., Jones, T., Shapley, A. E., et al. 2020, ApJ, 888, L11 [Google Scholar]
  81. Schaefer, A. L., Tremonti, C., Kauffmann, G., et al. 2022, ApJ, 930, 160 [NASA ADS] [CrossRef] [Google Scholar]
  82. Seon, K.-I. 2009, ApJ, 703, 1159 [NASA ADS] [CrossRef] [Google Scholar]
  83. Smee, S. A., Gunn, J. E., Uomoto, A., et al. 2013, AJ, 146, 32 [Google Scholar]
  84. Spitzer, L. 1978, Physical Processes in the Interstellar Medium (New York: Wiley) [Google Scholar]
  85. Thurston, T. R., Edmunds, M. G., & Henry, R. B. C. 1996, MNRAS, 283, 990 [NASA ADS] [CrossRef] [Google Scholar]
  86. Tremaine, S., Gebhardt, K., Bender, R., et al. 2002, ApJ, 574, 740 [NASA ADS] [CrossRef] [Google Scholar]
  87. Vale Asari, N., Wild, V., de Amorim, A. L., et al. 2020, MNRAS, 498, 4205 [NASA ADS] [CrossRef] [Google Scholar]
  88. Veilleux, S., & Osterbrock, D. E. 1987, ApJS, 63, 295 [Google Scholar]
  89. Vogt, F. P. A., Dopita, M. A., Kewley, L. J., et al. 2014, ApJ, 793, 127 [NASA ADS] [CrossRef] [Google Scholar]
  90. Wake, D. A., Bundy, K., Diamond-Stanic, A. M., et al. 2017, AJ, 154, 86 [Google Scholar]
  91. Westfall, K. B., Cappellari, M., Bershady, M. A., et al. 2019, AJ, 158, 231 [Google Scholar]
  92. Wild, V., Groves, B., Heckman, T., et al. 2011a, MNRAS, 410, 1593 [NASA ADS] [Google Scholar]
  93. Wild, V., Charlot, S., Brinchmann, J., et al. 2011b, MNRAS, 417, 1760 [NASA ADS] [CrossRef] [Google Scholar]
  94. Yan, R., Newman, J. A., Faber, S. M., et al. 2006, ApJ, 648, 281 [NASA ADS] [CrossRef] [Google Scholar]
  95. Yan, R., Bundy, K., Law, D. R., et al. 2016a, AJ, 152, 197 [Google Scholar]
  96. Yan, R., Tremonti, C., Bershady, M. A., et al. 2016b, AJ, 151, 8 [Google Scholar]
  97. Yan, R., Bershady, M. A., Smith, M. P., et al. 2020, SPIE Conf. Ser., 11447, 114478Y [NASA ADS] [Google Scholar]
  98. Zafar, T., Møller, P., Watson, D., et al. 2015, A&A, 584, A100 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  99. Zhang, K., Yan, R., Bundy, K., et al. 2017, MNRAS, 466, 3217 [NASA ADS] [CrossRef] [Google Scholar]
  100. Zibetti, S., Charlot, S., & Rix, H.-W. 2009, MNRAS, 400, 1181 [NASA ADS] [CrossRef] [Google Scholar]
  101. Zurita, A., Beckman, J. E., Rozas, M., & Ryder, S. 2002, A&A, 386, 801 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  102. Zurita, A., Rozas, M., & Beckman, J. E. 2000, A&A, 363, 9 [Google Scholar]

Appendix A: Linear regression with measurement errors and intrinsic scatter

Fitting a straight line to a data set that has errors on both dependent and independent variables have been a problem discussed by many authors with different solutions (e.g., Press et al. 1986; Tremaine et al. 2002; Kelly 2007; Hogg et al. 2010). Since this process is of great significance for our analyses, we examine the robustness of our adopted method through some ideal simulations in this appendix.

For our situation, the problem can be described as follows. There is a data set with two observed parameters, x and y (i.e., logarithmic line ratios and logarithmic Balmer decrement), which we believe to be linearly correlated according to the theory. However, there are measurement errors for both parameters, of which the variances are estimated to be σ x 2 $ \sigma _x^2 $ and σ y 2 $ \sigma_y ^2 $, respectively. Besides known observational uncertainties, there are also intrinsic variations in y (caused by variations in some nebular parameters) that have an unknown variance σ 0 2 $ \sigma_0 ^2 $. Meanwhile, the intrinsic variations in x is negligible (since the Balmer line ratios are roughly constants in H II regions). The goal is to find a linear model, y = mx + b that best represents the underlying true relation.

The simple least squares method does not work because of the extra uncertainty in x. A widely adopted solution is to construct and numerically maximize a likelihood function of the model parameters. Our discussion in this appendix will mainly focus on methods based on likelihood functions. For a good review of other methods, see e.g., Kelly (2007, 2012).

Once we have the proper likelihood function, we can numerically maximize it and find the best-fit model. However, the exact form of the likelihood function as well as the maximization method varies in the literature. In the main paper, we followed the method of Tremaine et al. (2002), whose logarithmic likelihood function is given by

ln F 0 = 0.5 ( y m x b ) 2 / ( m 2 σ x 2 + σ y 2 + σ 0 2 ) = 0.5 χ 2 . $$ \begin{aligned} \ln F_{0} = -0.5(y-mx-b)^2/(m^2\sigma _x^2 + \sigma _y^2 + \sigma _0^2) = -0.5 \chi ^2. \end{aligned} $$(A.1)

One immediately notices that the above equation is monotonically increasing with increasing σ 0 2 $ \sigma_0 ^2 $. Thus, if it is directly maximized, the solution always diverges with σ 0 2 $ \sigma _0^2 \rightarrow \infty $. Tremaine et al. (2002)’s maximization method is to start by setting σ 0 2 =0 $ \sigma _0^2 = 0 $. After obtaining the first set of solution, (m, b), one checks the corresponding χ2/(N − 2), where N is the number of data points. If χ2/(N − 2) > 1, increase σ 0 2 $ \sigma_0 ^2 $ by a small amount and recompute (m, b). This step is repeated until χ2/(N − 2)≤1.

There are, however, other potential forms of the likelihood function. For example,

ln F 1 = 0.5 ( y m x b ) 2 / ( m 2 σ x 2 + σ y 2 + σ 0 2 ) 0.5 ln ( m 2 σ x 2 + σ y 2 + σ 0 2 ) $$ \begin{aligned} \ln F_{1}&= -0.5(y-mx-b)^2/(m^2\sigma _x^2 + \sigma _y^2 + \sigma _0^2) \nonumber \\&-0.5 \ln (m^2\sigma _x^2 + \sigma _y^2 + \sigma _0^2) \end{aligned} $$(A.2)

and

ln F 2 = 0.5 ( y m x b ) 2 / ( m 2 σ x 2 + σ y 2 + σ 0 2 ) 0.5 ln [ ( m 2 σ x 2 + σ y 2 + σ 0 2 ) / ( 1 + m 2 ) ] . $$ \begin{aligned} \ln F_{2}&= -0.5(y-mx-b)^2/(m^2\sigma _x^2 + \sigma _y^2 + \sigma _0^2)\nonumber \\&-0.5 \ln [(m^2\sigma _x^2 + \sigma _y^2 + \sigma _0^2)/(1+m^2)]. \end{aligned} $$(A.3)

Equation A.2 is equivalent to the logarithm of Equation 24 in Kelly (2007)10. Besides the χ2 term, F1 includes an extra term related to the total variance, which is not a constant as it involves m and σ 0 2 $ \sigma_0 ^2 $. With the variance term, one can now directly maximize the likelihood function as it is no longer a monotonic function of σ 0 2 $ \sigma_0 ^2 $. Equation A.3 is similar to Equation A.2, but it evaluates the variance perpendicular to the linear model rather than along the y axis, thus introducing the 1 + m2 term (this term cancels out in the χ2 term but not in the variance term, see e.g., Equation 32 of Hogg et al. 2010). Similar to Equation A.2, Equation A.3 can also be directly maximized.

To compare the above three likelihood methods, we set up a test based on the observed data in MaNGA. First, we generated a subsample by choosing a random spaxel classified as an SF region and selecting spaxels that lie within 0.05 dex to this spaxel in the 3D line-ratio space. By the end of this step, we obtained ∼7000 spaxels that are close to each other in the 3D line-ratio space. Second, we randomly selected 200 spaxels from this subsample. The logarithmic Balmer decrements of these spaxels were used as the intrinsic values for the independent variable, x0. Their measurement uncertainties provided by DAP were also used to calculate σx. The final simulated values for x were obtained by using random variables drawn from the normal distribution N( x 0 , σ x 2 ) $ N(x_0, \sigma_x^2) $11. The intrinsic values for the dependent variable was given by y0 = mtruex0 + btrue, where mtrue was set to m [ OIII ] , [ OII ] F 99 = 0.95 $ m^{\mathrm{F99}}_{\mathrm{[OIII],[OII]}}=0.95 $ and btrue was set to −0.5. We then used the measurement uncertainties of log([O III]/[O II]) to represent σy, while setting the intrinsic scatter in y to σ0 = 0.015. The final simulated values for y were drawn from the random distribution y 0 +N(0, σ y 2 )+N(0, σ 0 2 ) $ y_0 + N(0,\sigma _y^2) + N(0,\sigma _0^2) $. We then used the aforementioned three likelihood functions to derive m, b, and σ0. The second step was repeated 5,000 times and distributions of these model parameters were compared.

Figure A.1 shows the comparisons between model parameters estimated by different likelihood functions. Our default function, F0, best reproduces the true parameters for m, b, and σ0. Although it underestimated σ0 by yielding σ0, measured = 0 for ∼30% of the data, this does not have any noticeable impact on estimations of m and b. In comparison, F1 (F2) tends to underestimate (overestimate) m and overestimate (underestimate) b. In addition, both F1 and F2 overestimate σ0.

thumbnail Fig. A.1.

Distributions of slopes, intercepts, and intrinsic scatters estimated by different likelihood functions for the same simulated data set. In each panel, the vertical dashed line indicates the true value of the corresponding parameter for the simulated data set. Clearly F0 best reproduces the true parameters of the linear relation. It is also the default likelihood function we used in the main paper.

The above maximum-likelihood methods all require a good understanding of the measurement uncertainty. To check the impact of measurement uncertainties, we performed additional tests where these uncertainties were incorrectly estimated but were still fed into the likelihood functions. Table A.1 summarizes the impact of the biased measurement uncertainties on the derived median slopes. We found that when σx was overestimated or σy was underestimated, the resulting median m had an increase, making F0 overestimate mtrue. On the other hand, when σy was overestimated or σx was underestimated, the resulting median m had a decrease, making F0 underestimate mtrue. Interestingly, when both σx and σy were overestimated by the same fractional amount, the median m remained roughly the same. For example, in one of our tests where σx and σy were both overestimated by 25%, the median m returned by F0 only differed from mtrue by ∼3%. In contrast, when both σx and σy were underestimated by 25%, the median m estimated by F0 differed from mtrue by ∼20%. One can see that the fractional changes in the median m given by F1 and F2 when the measurement uncertainties are incorrectly estimated are actually smaller than those given by F0 in some cases. Still, F0 is the only likelihood function that recovers mtrue when there is no bias in the measurement uncertainty. Overall F0 appears slightly more robust against any bias in σy rather σx, probably because σ0 can be adjusted to partly compensate the biased σy.

Table A.1.

Median slopes given by different likelihood functions when measurement uncertainties are correctly or incorrectly estimated. (mtrue = 0.95)

Apart from the above maximum-likelihood methods, there is a Bayesian method based on Gaussian mixture models introduced by Kelly (2007), which outperforms other methods according to Kelly (2007). Basically, Kelly (2007) approximates the intrinsic distribution of x, denoted as ψ, as Gaussians and incorporate them into the likelihood function. The posterior distributions of m, b, and ψ can be jointly inferred from the data.

We performed another test to see if our likelihood function produced similar results to those given by Kelly (2007)’s estimator. We set up a new simulated data set similar to the one produced by Kelly (2007) by drawing the intrinsic values of x from the probability distribution function (PDF)

p ( ξ ) e ξ ( 1 + e 2.75 ξ ) 1 . $$ \begin{aligned} p(\xi ) \propto e^{\xi }(1+e^{2.75\xi })^{-1}. \end{aligned} $$(A.4)

This PDF can be approximated by a Gaussian distribution, N(μ = −0.493, σ2 = 1.12). The true slope and intercept are set to mtrue = 0.5 and btrue = 1, which gives the intrinsic distribution of y. The intrinsic scatter σ0 is described by another Gaussian distribution, N(0, σ 0 2 = 0.75 2 ) $ N(0, \sigma _0^2 = 0.75^2) $. Measurement uncertainties of x and y are independently drawn from a scaled inverse-χ2 distribution whose degrees of freedom ν = 5 and scale parameters sx = 0.5σ for x and sy = 0.5σ0 for y. We drew 50 data points each trial and performed a total of 5,000 trials to derived distributions of slopes, intercepts, and intrinsic scatter using F0 as well as Kelly (2007)’s estimator.

Similar to Kelly (2007)’s test, when applying his estimator, we assumed the intrinsic distribution for x was known and used the Gaussian distribution N(μ = −0.493, σ2 = 1.12) as an approximation in Kelly (2007)’s Equation 16 to obtain the likelihood function

ln F 3 = 0.5 [ ( z ζ ) T V 1 ( z ζ ) ] 0.5 | V | , $$ \begin{aligned} \ln F_{3} = -0.5[(\mathbf{z}-\zeta )^TV^{-1}(\mathbf{z}-\zeta )] -0.5 |V|, \end{aligned} $$(A.5)

where z = (y, x), ζ = (mtrueμ + btrue, μ), and the covariance matrix V = ( m true 2 σ 2 + σ y 2 + σ 0 2 m true σ 2 m true σ 2 σ 2 + σ x 2 ) $ V = \left( \begin{array}{c} m_{\mathrm{true}}^{2} \sigma^{2} + \sigma_{y}^{2} + \sigma_{0}^{2} \quad m_{\mathrm{true}} \sigma^{2} \\ \qquad m_{\mathrm{true}} \sigma^{2} \qquad\quad \sigma^{2} + \sigma_{x}^{2}\\ \end{array}\right) $. Following Kelly (2007), we simply performed maximum-likelihood estimations in this test rather than derived posteriors. Similar to F1 and F2, this likelihood function can be directly maximized to estimate m and b.

Figure A.2 shows the linear regression results we obtained with F0 and F3. It is clear that both methods not only recover the true value for each parameter equally well, but also yield similar dispersions for the distributions of the parameters. We also tried different values for the input parameters to simulate other data sets as Kelly (2007) did, and still found the two methods gave nearly identical distributions. In addition to the simulated data, we applied Kelly (2007)’s Bayesian method to several observed reddening relations in MaNGA, and again found results almost identical to what we have obtained12. We thereby concluded our adopted method is equally good as the method recommended by Kelly (2007) in terms of the output.

thumbnail Fig. A.2.

Distributions of slopes, intercepts, and intrinsic scatters estimated by our default likelihood function F0 and Kelly (2007)’s likelihood function F3 for the same simulated data set. In each panel, the vertical dashed line indicates the true value of the corresponding parameter for the simulated data set.

Finally, in the aforementioned tests the measurement errors are not correlated, which apply to most of our results in the main paper. In cases when x and y contain a common line ratio, one needs to replace the variance with a covariance matrix in the likelihood function. For example, for some of the slope measurements in the main paper, both x and y have the log(Hα) term. When measuring, for example, mHα/Hγ, we simply replaced the variance term with m 2 σ x 2 + σ y 2 2mVar[ log(Hα)]+ σ 0 2 $ m^2\sigma _x^2 + \sigma _y^2 - 2mVar[\log({\rm H}\alpha)] + \sigma _0^2 $, where Var[log(Hα)] is the variance of log(Hα). However, since Hα generally has much higher S/N compared to other lines, adding Var[log(Hα)] has little impact on the overall results.

Appendix B: Results based on nonparametric emission-line measurements

This work heavily relies on emission line measurements of the observed spectra. The MaNGA data were analyzed by DAP, which produced emission line measurements by using Gaussian models as templates for emission lines and fit them simultaneously with the stellar continuum models (Westfall et al. 2019; Belfiore et al. 2019). We note that there are other pipelines and codes that use different fitting schemes and might produce subtle differences for the emission line measurements (see e.g., Appendix of Belfiore et al. 2019). In addition, a single Gaussian component might not be optimal for emission line fitting in cases where multiple kinematic components are blended in observations. Belfiore et al. (2019) carefully assessed the quality of the emission line measurements of DAP, confirming the S/N reported by DAP is an appropriate metric and the emission lines are statistically well fitted by Gaussian models for S/N < 30. When S/N > 30, line profiles start to deviate from Gaussians, but the reported Gaussian fluxes are still consistent with the non-parametric fluxes obtained by direct summing. Therefore, significant bias in flux measurements and uncertainty estimations seem unlikely in our sample.

Regardless, we performed a test based on the nonparametric emission line measurements provided by DAP. These emission lines fluxes were obtained by first subtracting the best-fit stellar continuum models from the spectra, and then subtracting a baseline local to each emission line, and finally directly integrating the residual fluxes with predefined passbands (Yan et al. 2006; Westfall et al. 2019). The corresponding errors were estimated via propagation of pixelwise errors but was not fully tested or calibrated. Thus, Westfall et al. (2019) warned the usage of these error measurements13. In our test, we used the nonparametric fluxes and measurement errors to replace the Gaussian values and remeasure slopes for [N II]/[O II], [S III]/[O III], and [O III]/[O II], and we summarized the results in Table B.1. Similar to what we did in the main paper, we applied an S/N cut of 3 for every emission line involved. This selection criterion left us ∼8.1 × 105 spaxels, which are considerably fewer compared to the Gaussian case (∼2.4 × 106). This is because the estimated uncertainties for nonparametric fluxes are systematically higher than those for Gaussian fluxes due to the significant noise contribution from pixels far away from line centers. We included all 3D bins with number of spaxels greater than 80 and ended up with 1,589 bins. Slightly adjusting this bin-selection criterion does not change the overall conclusion.

Table B.1.

Median slopes measured from different data sets

We notice m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ given by nonparametric measurements is considerably larger than that given by Gaussian measurements (still it is significantly different from the values predicted by the F99 curve). For comparison, in Table B.1 we also list the results given by the Gaussian measurements using the same sample that was selected by the S/N cuts on nonparametric measurements. These Gaussian results also show some differences from those in the main paper, which could be due to the fact that the sample here was not selected using the S/N cuts on Gaussian measurements. While the nonparametric fluxes are in good agreement with Gaussian fluxes, the nonparametric flux uncertainties are overall higher. Recalling the test on measurement uncertainties in Appendix A, we suspected the difference is mainly driven by the systematically higher non-parametric flux uncertainties. Indeed, once we replaced the nonparametric uncertainties with the Gaussian uncertainties, but still using the nonparametric fluxes, the resulting slopes were lowered. Since the Gaussian uncertainties reported by MaNGA were well tested and calibrated while the non-parametric uncertainties were not (Westfall et al. 2019; Belfiore et al. 2019), we consider the Gaussian results to be more trustworthy.

Finally, how the different emission-line fitting methods will affect our results statistically is beyond the scope of the current paper. Still we think it is important to fully test this point in future works and apply our slope-measuring method to other large astronomical data sets.

Appendix C: Results derived from high-resolution observations on IC 342

The main MaNGA sample has a typical spatial resolution of 1 – 2 kpc14. However, MaNGA also observes the nearby galaxy IC 342 with a much higher resolution of ∼32 pc, which is enough to resolve many large H II regions and separate them from the surrounding DIG. If our speculation of a resolution-dependent attenuation law is correct, we expect to measure reddening relations that are compatible with an average MW extinction curve either described by F99 or CCM89, no matter the lines involved are low ionization lines or not. Since there are considerably fewer spaxels for IC 342 in MaNGA, we increased the sizes of the 3D bins and adjusted them based on the S/N of lines. We first selected data points with more than 3σ detections on Hα, Hβ, [N II], [S II], and [O III], and then removed non-H II regions using the 3D diagnostic diagram. This step results in a total of 56,425 spaxels. When measuring reddening relations involving other lines, we applied extra cuts to remove data points with S/N < 3 for these extra lines.

Figure C.1 shows the distributions of m[NII],[OII], m[SIII],[OIII], m[OIII],[OII], and m[SIII],[OII], and Table C.1 summarizes their median values and uncertainties. The numbers of available data points for these measurements are 10,063, 50,545, 10,063, and 10,058, respectively. For reddening relations of [N II]/[O II], [O III]/[O II], and [S III]/[O II], we selected 3D bins with the number of spaxels greater than 25. While for the reddening relation of [S III]/[O III], we selected 3D bins with the number of spaxels greater than 100. The final number of 3D bins we used for [N II]/[O II], [S III]/[O III], [O III]/[O II], and [S III]/[O II] is 107, 155, 107, and 107, respectively. Clearly the S/N cut in [O II] removes a large number of data points. If we remeasured m [ SIII ] , [ OIII ] $ m^\prime_{\mathrm{[SIII],[OIII]}} $ using a sample with S/N cuts in both [S III] and [O II] (which has 10,058 data points), the median value changes from 1.58 ± 0.07 to 1.80 ± 0.13.

thumbnail Fig. C.1.

Slope distributions for different reddening relations in IC 342. For each distribution, we show the 68% confidence interval of the median [CI(Mdn)] and the standard deviation (SD). Dashed lines mark the medians of the distributions (red), values predicted by the F99 extinction curve (blue), and values predicted by the CCM89 extinction curve (green). The median values found in the MaNGA main sample are indicated by the dotted purple lines.

Table C.1.

Median slopes for different reddening relations in IC342 and the MaNGA main sample

Despite having much wider distributions compared to the MaNGA main sample, the median slopes derived from the aforementioned line ratios seem to be more consistent with the predictions of the F99 extinction curve. Due to the limited number of spaxels in IC 342, we made larger 3D bins compared to the MaNGA main sample, which inevitably increased the uncertainties of our measurements. We made 20 bins along each of the three axes of the 3D space, covering the whole volume occupied by all the spaxels. This results in a bin size of ∼0.028 × 0.045 × 0.100 dex3 when measuring m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $, m [ OIII ] , [ OII ] $ m^\prime_{\mathrm{[OIII],[OII]}} $, and m [ SIII ] , [ OII ] $ m^\prime_{\mathrm{[SIII],[OII]}} $, and a bin size of ∼0.051 × 0.063 × 0.103 dex3 when measuring m [ SIII ] , [ OIII ] $ m^\prime_{\mathrm{[SIII],[OIII]}} $. We tried remeasuring the slopes of the MaNGA main sample using the bin size adopted for IC 342. The resulting m [ NII ] , [ OII ] $ m^\prime_{\mathrm{[NII],[OII]}} $ is still significantly different from the F99 value (in fact, the deviation becomes slightly larger with these larger bins). Therefore, it seems the different bin sizes are not the main reason for the different measurements we obtained.

In Figure C.2, we show how the three BPT line ratios change with the Balmer decrement as well as the Hα surface brightness in IC 342. Here we did no apply any S/N cut to the data. It is clear that [N II] and [S II] are relatively strong compared to Hα when the Balmer decrement or the Hα surface brightness is low and the emission is dominated by the DIG. In comparison, [O III] is also stronger when the Balmer decrement or the Hα surface brightness is low, but the scatter in [O III]/Hβ is large. One can imagine that in low-resolution observations, differently attenuated regions with different line ratios are thus mixed, which could lead to the differential attenuation as we discussed. Still, IC 342 is only one example and we need more high-resolution observations to quantify this effect.

thumbnail Fig. C.2.

Relations between forbidden line-to-Balmer line ratios and the Balmer decrement in IC 342. Data points are color-coded according to the surface brightness of Hα line. No S/N cut was applied to the data.

Overall, limited by the large standard deviations of the slope distributions, we cannot draw a strong conclusion for the attenuation law in IC 342. Currently there are not enough data from major IFU surveys to allow us to repeat the same statistical analysis we performed on the MaNGA data. The Calar Alto Legacy Integral Field Area survey (CALIFA, Sánchez et al. 2012), for example, includes more nearby galaxies with higher spatial resolution compared to MaNGA, but its data volume is much smaller and its wavelength range does not cover [S III]. As another example, the Physics at High Angular resolution in Nearby GalaxieS - Multi Unit Spectroscopic Explorer survey (PHANGS-MUSE, Emsellem et al. 2022) is able to provide resolved observations of H II regions, but the sample size is even more limited and its wavelength range does not cover [O II] and [S III]λ9531, the stronger line in the [S III] doublet. Regardless, it would be interesting to check the attenuation of low ionization lines including [N II], [S II], and [O I] using these data at the cost of larger statistical uncertainties, which we plan to investigate in future work. Upcoming surveys such as SDSS-V/Local Volume Mapper (LVM, Kollmeier et al. 2017) and the Affordable Multi-Aperture Spectroscopy Explorer (AMASE, Yan et al. 2020) can provide much stronger constraints on the multicomponent dust attenuation on even smaller physical scales, which we also plan to use in the future.

All Tables

Table 1.

Median slopes of the reddening relations [log(Emission line ratio) versus log(Hα/Hβ/2.86)] derived for a sample of 3D bins.

Table 2.

Logarithmic difference between the intrinsic line strengths of CHigh and CLow in a DIG contamination model.

Table 3.

Median slopes of reddening relations of different line ratios derived from samples selected by ΣHα and b/a.

Table A.1.

Median slopes given by different likelihood functions when measurement uncertainties are correctly or incorrectly estimated. (mtrue = 0.95)

Table B.1.

Median slopes measured from different data sets

Table C.1.

Median slopes for different reddening relations in IC342 and the MaNGA main sample

All Figures

thumbnail Fig. 1.

Distribution of the MaNGA sample viewed in a 3D line-ratio space and in one of its 2D projections. Upper left: 3D density map of the MaNGA sample, where SF spaxels are colored from black to white and other ionized regions are colored from purple to yellow. A photoionization model computed by Ji & Yan (2020) using CLOUDY is also shown, whose color coding indicates the metallicity of the simulated H II region. The blue cube is an exaggerated illustration of the 3D bins we used in this work. Upper right: the same photoionization model is shown, but color coded according to the ionization parameter of the simulated H II region. Bottom: a 2D projection of the 3D space that corresponds to an “edge-on” view of the data (color map) and model (blue). The two axes, P1 and P2, are linear combinations of the original three axes, which are denoted as N2, S2, and R3. For clarity, we only show the part of the model that covers the middle 98% of the data along the line of sight. It is clear from this view that the model surface cuts through the center of the SF spaxel distribution.

In the text
thumbnail Fig. 2.

Fitting examples in one of the 3D bins. The dashed red lines are the best-fit lines given by the maximum-likelihood method. The orange shaded regions indicate the 1σ uncertainties of the linear models. The dash-dotted blue lines are obtained by fixing the slopes using the values from an F99 extinction curve with RV = 3.1 during the fit. The error bars indicate the measurement uncertainties in different logarithmic line ratios. Left: reddening relation between Hα/Hβ and Hα/Hγ. Middle: reddening relation between Hα/Hβ and [S III]/[O III]. Right: reddening relation between Hα/Hβ and [N II]/[O II].

In the text
thumbnail Fig. 3.

Relative attenuation probed by Balmer lines. The first row shows the results obtained by fitting a linear function to the log(Hα/Hγ/2.86×0.469) vs. log(Hα/Hβ/2.86) relation within each 3D bin we defined. From left to right, we plot the distributions of the slope, intercept, and the intrinsic scatter. For each distribution, we show the 68% confidence interval of the median [CI(Mdn)] and the standard deviation (SD). The dashed-red lines indicate the median values among our 3D bins. The dashed-green lines indicate the values obtained by the traditional method where the whole sample is used for a single fit. The dashed-blue lines correspond to the values given by the F99 extinction curve. The second row shows similar information, but is obtained by fitting a linear function to the log(Hα/Hδ/2.86×0.259) versus log(Hα/Hβ/2.86) relation.

In the text
thumbnail Fig. 4.

Relative attenuation probed by high ionization lines. The first row shows the results obtained by fitting a linear function to the log([S III]/[O III]) vs. log(Hα/Hβ/2.86) relation within each 3D bin we defined. From left to right: we plot the distributions of the slope, intercept, and the intrinsic scatter. For each distribution, we show the 68% confidence interval of the median [CI(Mdn)] and the standard deviation (SD). The dashed-red lines indicate the median values among our 3D bins. The dashed-blue lines correspond to the values given by the F99 extinction curve. The second row shows similar information, but is obtained by fitting a linear function to the log([Ne III]/[O III]) versus log(Hα/Hβ/2.86) relation.

In the text
thumbnail Fig. 5.

Relative attenuation probed by low ionization lines. We fit a linear function to the log([N II]/[O II]) versus log(Hα/Hβ/2.86) relation within each 3D bin. From left to right, we plot the distributions of the slope, intercept, and the intrinsic scatter. For each distribution, we show the 68% confidence interval of the median [CI(Mdn)] and the standard deviation (SD). The dashed-red lines indicate the median values among our 3D bins. The dashed-blue lines correspond to the values given by the F99 extinction curve.

In the text
thumbnail Fig. 6.

Reddening relation between log([N II]/[O II]) and log(Hα/Hβ/2.86). Only data with S/N > 30 in Hβ are included. We obtained the intrinsic log([N II]/[O II]) value at log(Hα/Hβ/2.86) = 0 and subtracted it from the observed log([N II]/[O II]) for each 3D bin. The black contours show the distribution of the data without any attenuation correction. Whereas the blue contours show the density distribution of the data with their log([N II]/[O II]) corrected by the F99 extinction curve. The contour levels trace the number density of the data points and are equally spaced on a log scale. The outermost contour and the innermost contour enclose 90% and 10% of the data, respectively. The dashed red line and the dashed green line trace the median trends in the 3D bins for the two data sets. The dash-dotted line is a horizontal line for reference.

In the text
thumbnail Fig. 7.

Distributions of the slope differences under different S/N cuts. The black solid histogram shows data with S/N of [S III], [O III], and Hβ greater than 3; the dashed green histogram shows data with S/N of the aforementioned lines greater than 10; the dotted red histogram shows data with S/N of the aforementioned lines greater than 15. The slope difference peaks toward smaller values with the increasing S/N limit. However, the difference is actually due to the dependence of the slope on the surface brightness of the lines.

In the text
thumbnail Fig. 8.

Derived slopes for different line ratios as a function of the gas-phase metallicity. The y axes correspond to slopes of the log(fline1/fline2) versus log(//2.86) relations. The x axes correspond to metallicities derived by using Bayesian inference with a fiducial photoionization model. Each data point corresponds to a 3D bin with number of spaxels greater than 180. The 3D bins are color-coded according to the median Hα surface brightness (which is corrected by the Balmer decrement assuming an F99 extinction curve) of their spaxels. Dashed black lines and dotted black lines correspond to the median slopes and the values predicted by an F99 extinction curve, respectively.

In the text
thumbnail Fig. 9.

Cartoon illustration of a two-component attenuation model. The model consists of a more attenuated component with high AV and a less attenuated component with low AV. The emission lines from both components are mixed in observations. While the high AV component is (part of) a dusty H II region, the low AV component could be a part of the H II region that is less dusty, the DIG around the H II region, or a combination of both.

In the text
thumbnail Fig. 10.

Effective attenuation laws generated by different models. Row a) shows several logarithmic line ratio versus logarithmic Balmer derement relations predicted by a partially covered cloud model. For comparison, we also plotted the relation predicted by the F99 extinction curve and the median relation we found in MaNGA. Row b) shows the relations predicted by a two-component model with different intrinsic line ratios. We made the H II component to have overall lower intrinsic line ratios (relative to Hα) compared to the DIG component. Row c) shows a similar model as Row b), but the underlying true attenuation curve is replaced with the CCM89 extinction curve. All y axes have arbitrary normalization.

In the text
thumbnail Fig. 11.

Two-component attenuation model for [N II]/[O II], where only the attenuation of the more attenuated component is varied. Different sets of points correspond to models with different intrinsic flux contributions from the two components as well as different AV, Low. The arrow shows the direction of increasing AV, High in the model.

In the text
thumbnail Fig. 12.

Intrinsic scatters in y for the Balmer reddening relations as functions of the S/N of the weaker Balmer lines. Each point represents a 3D bin, and its median S/N is plotted. Left: how the derived intrinsic scatter in log(Hα/Hγ) varies with the S/N of Hγ. Right: how the derived intrinsic scatter in log(Hα/Hδ) varies with the S/N of Hδ.

In the text
thumbnail Fig. 13.

Correlation between the metallicity and the V-band attenuation AV for the 3D bins that contain more than 180 spaxels. The V-band attenuation is derived from the Balmer decrement. For each 3D bin, the median values of 12+log(O/H) and AV are plotted. The vertical and horizontal errorbars show the typical (median) standard deviations of AV and 12+log(O/H), respectively, in each 3D bin.

In the text
thumbnail Fig. A.1.

Distributions of slopes, intercepts, and intrinsic scatters estimated by different likelihood functions for the same simulated data set. In each panel, the vertical dashed line indicates the true value of the corresponding parameter for the simulated data set. Clearly F0 best reproduces the true parameters of the linear relation. It is also the default likelihood function we used in the main paper.

In the text
thumbnail Fig. A.2.

Distributions of slopes, intercepts, and intrinsic scatters estimated by our default likelihood function F0 and Kelly (2007)’s likelihood function F3 for the same simulated data set. In each panel, the vertical dashed line indicates the true value of the corresponding parameter for the simulated data set.

In the text
thumbnail Fig. C.1.

Slope distributions for different reddening relations in IC 342. For each distribution, we show the 68% confidence interval of the median [CI(Mdn)] and the standard deviation (SD). Dashed lines mark the medians of the distributions (red), values predicted by the F99 extinction curve (blue), and values predicted by the CCM89 extinction curve (green). The median values found in the MaNGA main sample are indicated by the dotted purple lines.

In the text
thumbnail Fig. C.2.

Relations between forbidden line-to-Balmer line ratios and the Balmer decrement in IC 342. Data points are color-coded according to the surface brightness of Hα line. No S/N cut was applied to the data.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.