Issue
A&A
Volume 675, July 2023
BeyondPlanck: end-to-end Bayesian analysis of Planck LFI
Article Number A14
Number of page(s) 18
Section Cosmology (including clusters of galaxies)
DOI https://doi.org/10.1051/0004-6361/202243160
Published online 28 June 2023

© The Authors 2023

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

One of the most important sources of information on the early universe is cosmic microwave background (CMB; Penzias & Wilson 1965). By mapping and characterizing the statistical properties of this signal over recent decades, cosmologists have constrained both the composition and evolution of the universe to less than one percent accuracy (e.g., Bennett et al. 2013; Planck Collaboration I 2020).

As shown by the COBE-FIRAS experiment (Mather et al. 1994), the frequency spectrum of the CMB may be described to a very high level of precision in terms of a blackbody with a mean temperature of TCMB = 2.7255 K (Fixsen 2009). As such, its peak intensity occurs at 161 GHz, and the primary frequency range considered by most CMB experiments is therefore around 30–300 GHz. In addition to the CMB, a diverse range of astrophysical emission mechanisms contribute to the observed signal at these frequencies, which are of both Galactic and extragalactic origin. For intensity, the main contributors are synchrotron, free–free, CO, thermal dust, and anomalous microwave emission; whereas for the polarization, we see that synchrotron and thermal dust emission are dominant (e.g., Bennett et al. 2013; Planck Collaboration IV 2020, and references therein).

At the foreground minimum, around 80 GHz, the CMB anisotropies dominate over the combined foreground amplitude over most of the sky (Planck Collaboration Int. XLVI 2016), and CMB temperature extraction is therefore relatively straightforward. The same does not hold true for polarization, even at the foreground minimum, the sum of synchrotron and thermal dust emission is greater than the polarized CMB by almost a full order of magnitude on large angular scales. Polarized foreground estimation therefore plays a critically important role in contemporary cosmology, as such observations may contain signatures of primordial gravitational waves created during the Big Bang (e.g., Kamionkowski & Kovetz 2016, and references therein). They can thereby provide a unique observational window on inflation and physics at the Planck energy scale.

However, the amplitude of the primordial gravitational wave signal is expected to be smaller than 10–100 nK on large angular scales (Tristram et al. 2021). This, combined with a plethora of confounding instrumental effects, such as temperature-to-polarization leakage (Paradiso et al. 2023), correlated noise (Ihle et al. 2023), and calibration uncertainties (Gjerløw et al. 2023), makes high-precision CMB polarization science a particularly difficult challenge. Furthermore, as summarized by the Planck team in a document (titled “Lessons learned from Planck1) the quality of current state-of-the-art CMB observations is limited by the interplay between instrumental and foreground effects. This insight has formed the basis for the BEYONDPLANCK project (BeyondPlanck Collaboration 2023), which aims to implement an end-to-end Bayesian CMB analysis framework that jointly accounts for both systematic effects and astrophysical foregrounds, starting from raw time-ordered data. This framework employs an explicit parametric model that jointly accounts for cosmological, astrophysical, and instrumental parameters. These parameters are sampled with Markov Chain Monte Carlo methods, such as Gibbs sampling, implemented in the Commander software (Eriksen et al. 2004, 2008; Galloway et al. 2023).

The BEYONDPLANCK results are described in a suite of 17 companion papers (see BeyondPlanck Collaboration 2023 and references therein), each focusing on a particular aspect of the analysis. The current paper focuses on polarized foreground characterization, both in terms of algorithms and results, with special attention paid to the spectral properties of polarized synchrotron emission on large angular scales. The current BEYONDPLANCK analysis considers only the Planck LFI observations in terms of time-ordered data, although selected preprocessed external data sets are also included to break critical degeneracies, specifically, WMAP measurements between 33 and 61 GHz (Bennett et al. 2013), Planck HFI measurements at 353 and 857 GHz (Planck Collaboration I 2020; Planck Collaboration Int. LVII 2020), and the Haslam 408 MHz measurements (Haslam et al. 1982), the latter two are only included in temperature. Overall, the main emphasis of the present analysis lies in frequencies below the foreground minimum, between 30 and 70 GHz, along with the spectral behavior of polarized synchrotron and thermal dust emission within this critically important frequency range. The present analysis is the first to combine high-resolution Planck measurements with low-resolution WMAP observations into a single coherent model, when estimating both foreground amplitudes and spectral parameters.

The rest of this paper is structured as follows: In Sect. 2, we briefly review the BEYONDPLANCK data model, focusing on the aspects that are relevant for the polarized foreground analysis. In Sect. 3, we review the data sets that are included in the current analysis. In Sect. 4 we describe the basic algorithms, and connect these to the larger Gibbs sampling framework outlined in BeyondPlanck Collaboration (2023) Our results are presented in Sect. 5 and our conclusions in Sect. 6.

2. BEYONDPLANCK data model

As described by BeyondPlanck Collaboration (2023), the main goal of this project is to perform end-to-end Bayesian CMB analysis, building upon well-established statistical methods. The first step in any such Bayesian analysis is to write down an explicit parametric data model that will be fit to the observations via posterior mapping techniques. For BEYONDPLANCK, we adopted the following model for this purpose:

d j , t = g j , t P t p , j [ B p p , j symm c M cj ( β p , Δ bp j ) a p c + B j asymm ( s j orb + s j fsl ) ] + s j , t 1 hz + n j , t corr + n j , t w , $$ \begin{aligned} d_{j,t}&= g_{j,t}\mathbf P _{tp,j}\left[\mathbf B ^{\mathrm{symm} }_{pp\prime ,j}\sum _{c} \mathbf{M }_{cj}(\beta _{p\prime }, \Delta _{\mathrm{bp} }^{j})a^c_{p\prime } + \mathbf B ^{\mathrm{asymm} }_{j}\left(s^{\mathrm{orb} }_{j} + s^{\mathrm{fsl} }_{j}\right)\right] \nonumber \\&\quad + s^{\mathrm{1hz} }_{j,t} + n^{\mathrm{corr} }_{j,t} + n^{\mathrm{w} }_{j,t}, \end{aligned} $$(1)

where j represents a radiometer (or detector) label, t indicates a single time sample, p denotes a single pixel on the sky, and c represents one single astrophysical signal component. Furthermore, dj, t denotes the measured time-ordered data; gj, t denotes the instrumental gain; Ptp, j is a pointing matrix; Bpp′,j denotes beam convolution; Mcj(βp, Δbp) denotes a foreground mixing matrix that depends on some set of spectral parameters, β, and instrumental bandpass specification, Δbp; a p c $ a^c_{p}$ $ is the amplitude of an astrophysical component, c, in pixel p, measured at the same reference frequency as the mixing matrix M, and expressed in brightness temperature units; s j , t orb $ s^{\mathrm{orb}}_{j,t} $ is the orbital CMB dipole signal; s j , t fsl $ s^{\mathrm{fsl}}_{j,t} $ denotes the contribution from far side-lobes; s j , t 1 hz $ s^{\mathrm{1hz}}_{j,t} $ denotes the contribution from electronic 1 Hz spikes; n j , t corr $ n^{\mathrm{corr}}_{j,t} $ denotes correlated instrumental noise; and n j , t w $ n^{\mathrm{w}}_{j,t} $ is uncorrelated instrumental noise with (diagonal) time-domain covariance matrix N j w $ \mathbf{N}^{\mathrm{w}}_j $. For further details regarding any of these objects, we refer the interested reader to BeyondPlanck Collaboration (2023) and references therein.

2.1. Astrophysical sky model

The current paper focuses primarily on diffuse astrophysical foregrounds, which, for practical purposes are stationary in time, and we are therefore not interested in the time-domain aspects of the model. For this reason, we rewrite Eq. (1) into the following compact form:

m ν , p = [ B p p symm c M c ( β p , Δ bp ) a p c ] + n p w , $$ \begin{aligned} m_{\nu ,p} = \left[\mathbf B ^{\mathrm{symm} }_{pp\prime }\sum _{c} \mathbf{M }_{c}(\beta _{p\prime }, \Delta _{\mathrm{bp} })a^c_{p\prime }\right] + n^{\mathrm{w} }_{p}, \end{aligned} $$(2)

where mν, p is a binned sky map derived by co-adding all radiometer data within a single frequency channel and we focus here on conditioning the all time-domain parameters:

( j ν P j t ( N j w ) 1 P j ) m ν = j P j t ( N j w ) 1 r j . $$ \begin{aligned} \left(\sum _{j \in \nu } \mathbf P _j^t (\mathbf N ^{\mathrm{w} }_{j})^{-1} \mathbf P _j\right) {\boldsymbol{m}}_{\nu } = \sum _j \mathbf P _j^t (\mathbf N ^{\mathrm{w} }_j)^{-1}{\boldsymbol{r}}_j. \end{aligned} $$(3)

Here, rj represents the cleaned and calibrated time-ordered data for the detector j, as defined by Eq. (76) in BeyondPlanck Collaboration (2023), and full marginalization over time-domain parameters is done iteratively through Gibbs sampling (see Sect. 4.1 for further details).

For the purposes of this study, we are particularly interested in the total sky signal, which may be written as follows:

s = M a c = 1 N comp a c [ U f c ( ν ; β ) τ ( ν ) d ν ] . $$ \begin{aligned} {\boldsymbol{s}} = \mathbf M {\boldsymbol{a}} \equiv \sum _{c=1}^{N_{\mathrm{comp} }}{\boldsymbol{a}}_c\, \left[\,U \int f_c(\nu ; \beta )\, \tau (\nu )\,\mathrm{d} \nu \right]. \end{aligned} $$(4)

Here, each astrophysical signal component, c, is associated with an overall amplitude parameter, a; a unit conversion factor, U, for going from either thermodynamic or intensity units to brightness temperature; a spectral energy density, fc, describing the intensity of the component relative to some reference frequency, ν0; and some set of spectral parameters, β, which characterize the frequency dependence of the various emission mechanisms. Finally, τ represents an instrumental bandpass response function that describes the detector sensitivity as a function of frequency. Again, we refer to BeyondPlanck Collaboration (2023) for further details.

2.2. Polarized sky model

We assume in this paper that the polarized microwave sky can be well approximated by three physically distinct components; synchrotron emission, thermal dust emission, and CMB. A spinning dust or anomalous microwave emission (AME) component is also commonly included in similar studies, however, with an upper limit of its polarization fraction at < 1% (Génova-Santos et al. 2017; Macellari et al. 2011; Herman et al. 2023) and we neglect it in this study.

Each component exhibits a distinctly different behavior both in terms of spatial structure and frequency dependence and must be described by some unique set of spectral parameters, β. The choice of spectral parameters is both important and nontrivial. On the one hand, it is important that the adopted parametric model for each component is able to provide an acceptable goodness-of-fit, typically as measured by some χ2 statistic. On the other hand, it is also important that the model is not too flexible, as degeneracies between components may increase the effective noise level to arbitrary high levels. Generally speaking, degeneracies also tend to exacerbate instrumental systematic errors, by attempting to accommodate small signal discrepancies within the unconstrained parameters in the signal model. Choosing an appropriate parametric model is thus a delicate trade-off between allowing for a sufficient level of flexibility to model the real sky, while avoiding the introduction of too many parameters that would otherwise increase both systematic and statistical uncertainties to untenable levels.

For BEYONDPLANCK, we adopted the following parametric model for the polarized microwave sky:

s RJ = a CMB x 2 e x ( e x 1 ) 2 ( e x 0 1 ) 2 x 0 2 e x 0 , $$ \begin{aligned} {\boldsymbol{s}}_{\mathrm{RJ} }&= \,{\boldsymbol{a}}_{\mathrm{CMB} }\, \frac{x^2e^{x}}{\left(e^{x}-1\right)^2} \frac{\left(e^{x_0}-1\right)^2}{x_0^2e^{x_0}},\end{aligned} $$(5)

+ a s ( ν ν 0 , s ) β s , $$ \begin{aligned}&\quad + {\boldsymbol{a}}_{\mathrm{s} }\, \left(\frac{\nu }{\nu _{0,\mathrm{s} }} \right)^{\beta _{\mathrm{s} }},\end{aligned} $$(6)

+ a d ( ν ν 0 , d ) β d + 1 e h ν 0 , d / k T d 1 e h ν / k T d 1 , $$ \begin{aligned}&\quad + {\boldsymbol{a}}_{\mathrm{d} }\,\left(\frac{\nu }{\nu _{0,\mathrm{d} }}\right)^{\beta _{\mathrm{d} }+1} \frac{e^{h\nu _{0,\mathrm{d} }/kT_{\mathrm{d} }}-1}{e^{h\nu /kT_{\mathrm{d} }}-1}, \end{aligned} $$(7)

where all vectors are expressed in brightness (Rayleigh-Jeans) temperature units; x = hν/kT0 (where h and k are Planck’s and Boltzmann’s constants respectively and T0 is the CMB monopole of 2.7255 K; Fixsen 2009); and ν0, c is the reference frequency for component c, a is its amplitude, βs is the synchrotron power-law index, and βd and Td are the thermal dust power-law index and temperature, respectively. Even this minimal model, which adopts a power-law SED for synchrotron and modified blackbody SED for thermal dust, contains far too many parameters to be fit per pixel with the BEYONDPLANCK data set, and several informative priors will be imposed to regularize the system, as discussed in detail below.

2.2.1. Synchrotron emission

The first foreground component, and the brightest Galactic emission mechanism between 30 and 70 GHz, is synchrotron radiation. This emission is generated when relativistic cosmic-ray electrons ejected from supernovae gyrate in the Galactic magnetic field. From both temperature and polarization observations, the synchrotron SED is known to be well approximated by a power-law over several decades in frequency (e.g., Lawson et al. 1987; Reich & Reich 1988; Platania et al. 2003; Davies et al. 2006; Gold et al. 2009), at least from 1 GHz to 100 GHz, although some analyses also claim evidence for a slight spectral steepening towards higher frequencies (Kogut 2012; Jew et al. 2019). The Galprop model (Orlando et al. 2018) is a physically motivated model that takes into account quantities such as the electron temperature (Te) and large-scale magnetic field distributions, and this physical model also predicts SED flattening below 1 GHz depending on the model parameters. However, because the flattening occurs at frequencies well below 10 GHz, this is not relevant for the current analysis, which only considers frequencies above 30 GHz, and we therefore assume a straight power-law SED model for synchrotron emission in the following.

Next, several analyses have reported significant spatial variations in βs (e.g., Fuskeland et al. 2014, 2021; Krachmalnicoff et al. 2018). In general, these analyses report flatter spectral indices (around βs = −2.8) at low Galactic latitudes, and steeper ones at high Galactic latitudes (around βs = −3.1). This picture appears roughly consistent with similar results derived from intensity observations (e.g., Vidal et al. 2015; Platania et al. 1998; Lawson et al. 1987; Reich & Reich 1988). On the other hand, when analyzing low-resolution WMAP polarization data with full correlated noise propagation, Dunkley et al. (2009b) found only minor differences between low and high Galactic latitudes.

As shown in the following, the BEYONDPLANCK data combination2 has in general low statistical power to probe spatial variations when simultaneously sampling TOD-parameters. The combination of a reduced number of observing bands to that of previous Planck and WMAP component-separation efforts, additional free instrumental parameters catalyzes degeneracies. We therefore partition the sky into four large disjoint regions, as shown in the top panel of Fig. 1, and fit one spectral index for each region. For reference, the middle panel in this figure shows the Planck 2018 synchrotron amplitude map (Planck Collaboration IV 2020) with region outlines and a corresponding processing mask used in BEYONDPLANCK.

thumbnail Fig. 1.

Top: Final region map used in the sampling procedure for the synchrotron spectral index. Middle: Planck 2018 polarized synchrotron amplitude map with the corresponding spectral index processing mask for BEYONDPLANCK and a region outline that divides the sky into six disjoint regions. This was further reduced to four, motivated primarily by the low S/N apparent in some of the regions. The mask is chosen to reduce temperature-to-polarization leakage bias, and to exclude bright point sources. Bottom: Planck 2018 polarized thermal dust amplitude with the corresponding processing mask.

The region set was defined as follows: We started from the 24-region partitioning defined by Fuskeland et al. (2014), which itself was based on the nine year WMAP polarization analysis mask (Bennett et al. 2013). We then ran preliminary analyses to determine the effective posterior width resulting from each region. If this was found to be strongly prior-dominated, we merged neighboring regions. This process resulted in four main regions, which we will refer to as (1) High Latitudes; (2) the Galactic Spur or simply Spur; (3) the Galactic Center; and (4) the Galactic plane. As we will elaborate on in Sect. 5.3.1, even after this process, the high-latitude region does not have a sufficient signal-to-noise ratio (S/N) to significantly constrain βs without suffering from detrimental degeneracies with instrumental parameters such as gain coefficients. The reason for this is illustrated in the middle panel of Fig. 1; the synchrotron emission in this region is generally very faint and there is very little leverage to estimate spectral variations as a function of frequency.

Since the spectral index is fit uniformly over large sky regions, it is important to impose a processing mask during the posterior evaluation to avoid pixels with poor goodness-of-fit from contaminating surrounding pixels. To this end, we constructed a dedicated spectral index processing mask for synchrotron emission by first fitting the component amplitudes given our chosen prior. We then smoothed these map to 4deg FWHM and constructed a corresponding mask by thresholding on the strongest 5% of the signal to remove the brightest part of the Galactic plane. Next, we combined the resulting mask with a smoothed χ2 map with a threshold at the strongest 10% of the signal to remove strong sources such as Tau A and their surrounding area. Finally, we used an unsmoothed χ2 map in order to remove particularly bright compact sources. The resulting mask is visualized along with the region outlines in the middle panel of Fig. 1.

We also note that synchrotron emission is sensitive to Faraday rotation (e.g., Beck et al. 2013) in areas with high electron densities and magnetic fields, which is the case near the Galactic center. This effect is caused by circular birefringence, where left- and right-handed circularly polarized emission traverse the magnetic field of an ionized medium at different velocities, generating a net linear polarization signal with a polarization angle proportional to the field strength. However, since this effect is proportional to the squared wavelength of the radiation, it is most prominent at frequencies below 5 GHz, and it is negligible for most of the sky above 30 GHz. In this study, we do not make any corrections for Faraday rotation to any frequency channel. However, as shown in Sect. 5, our synchrotron constraints for the Galactic center region are contaminated by systematic effects and these are most likely due to a combination of instrumental and astrophysical mismodeling effects.

2.2.2. Thermal dust emission

The second most significant polarized foreground component between 30 and 70 GHz is thermal dust emission generated by interstellar dust grains that collectively account for ≈1% of the mass of the interstellar medium (see, e.g., Hensley & Draine 2020 for a recent review of relevant physics and observations constraints). The size of these grains typically varies from a few angstroms to a few tenths of a μm, and they are heated up by the interstellar radiation field to a temperature of about 20 K. This heat is then re-emitted thermally with a peak frequency between 1000 and 2000 GHz (as defined in brightness temperature units). Furthermore, due to paramagnetic dissipation resulting from interactions between rotating grains and the local magnetic field, the dust grains tend to align with their short axes parallel to the local magnetic field. In turn, this alignment can induce a polarization signal with a polarization fraction of 20% or more, as reported by Planck Collaboration XI (2020).

The most commonly used thermal dust SED model in CMB studies is that of a modified blackbody function, as defined in Eq. (7). This SED has three free parameters: (1) the amplitude ad, which traces the surface density of dust particles along each line of sight; (2) a spectral index, βd, that quantifies the low-frequency slope of the SED and depends on the physical composition of the dust grains; and (3) the dust temperature, Td, which affects the peak location of the modified blackbody function and depends on the local radiation field strength at any given position. There are more complex, and perhaps more realistic, representations that exist, including multi-component modified blackbody (e.g., Finkbeiner et al. 1999; Meisner & Finkbeiner 2015) or physical dust grain models (Guillet et al. 2018).

The particular data combination used in the BEYONDPLANCK analysis has a very low S/N for constraining thermal dust SED variations, whether they are of frequency or spatial origin, as we are working at synchrotron dominated frequencies. In this paper, we only consider one single free parameter for the thermal dust SED, namely, a spatially constant value of βd. The bottom panel in Fig. 1 shows the Planck thermal dust amplitude map (Planck Collaboration IV 2020) with its corresponding processing mask used in the sampling procedure. This mask was generated by thresholding the 10% largest values of the smoothed χ2 map as described in the last section. The thermal dust temperature map is fixed pixel-by-pixel to that derived from Planck temperature observations between 30 and 857 GHz by Planck Collaboration Int. LVII (2020).

We made no attempts to account for SED variations along each line of sight. As emphasized by Tassis & Pavlidou (2015), the sum of two modified blackbodies is not a modified blackbody, and spatial variations in either βd or Td along each line of sight will therefore necessarily break the current model. This effect is particularly important in terms of the polarization, since the magnetic field direction also varies along the line of sight, potentially resulting in different SEDs for the two Stokes parameters, Q and U. However, these variations are far too weak to be measured with the current data set.

3. Data selection

As described by BeyondPlanck Collaboration (2023), a primary motivation for the BEYONDPLANCK project is to establish a common analysis platform for past, current, and future CMB observations that support end-to-end analysis, from raw time-ordered data to final high-level products such as astrophysical component maps and cosmological parameters. To support and guide the algorithm development process, the Planck LFI observations were chosen as a first real-world application for this framework, a choice that was primarily motivated by the fact that this data set is already well known to the collaboration members and, secondly, because of its modest data volume and relatively benign instrumental systematic effects.

At the same time, it is clear that the LFI data are not able to independently constrain all the relevant astrophysical components. There are at least five significant diffuse components between 30 and 70 GHz; CMB, synchrotron, free-free, AME, and thermal dust emission, while the LFI data only comprise three independent frequency channels. The LFI data must therefore be augmented with external observations in order to derive a statistically non-degenerate model. In principle, the Planck HFI data (Planck Collaboration III 2020) would be an ideal match, providing strong constraints on CMB, free-free and thermal dust emission. However, if these data were to be included in the BEYONDPLANCK analysis in their entirety, they would dominate over the LFI observations in terms of total sensitivity, which would then undermine the main purpose of the current presentation, namely, a focus on the algorithms themselves. For this reason, we chose to include only the Planck HFI 857 GHz channel in temperature and the 353 GHz channel in polarization to constrain the amplitude of thermal dust emission. In both cases, we adopted the latest rendition of these maps published by the Planck team, as derived by the Planck Data Release 4 (DR4) pipeline, also known as NPIPE (Planck Collaboration Int. LVII 2020).

The same argument applies to the WMAP K-band channel (Bennett et al. 2013). While this channel provides strong constraints on both AME and polarized synchrotron emission, its statistical sensitivity is so high that it would rival that of the Planck LFI 30 GHz channel if included in the BEYONDPLANCK analysis, which again would undermine the main purpose of the current work. For this reason, we include only the WMAP Ka-, Q-, and V-band channels in the following, centered on 33, 41, and 61 GHz, respectively3.

In addition, we included the Haslam 408 MHz survey to constrain synchrotron emission in temperature (see Andersen et al. 2023). This results in a total of eight frequency channels in intensity, and seven frequency channels in polarization, which, in principle, should be sufficient to constrain a model with five intensity components and three polarization components.

One of the important novel algorithmic developments (described in Sect. 4) is true multi-resolution component separation for both linear and non-linear foreground parameters. Specifically, we consider in the following only the low-resolution WMAP polarization data at a HEALPix4 (Górski et al. 2005) resolution of Nside = 16, for which dense pixel-pixel noise covariance matrices are available (Bennett et al. 2013). This ensures that we have (at least nominally) a complete noise description for all frequency channels relevant for CMB extraction and that the small-scale polarization results are entirely dominated by Planck LFI.

All data except the Planck DR4 353 GHz channel were analyzed as provided by the respective team without any preprocessing or smoothing. The 353 GHz channel, however, is smoothed to an angular resolution of 10′ and re-pixelized at a resolution of Nside = 1024, corresponding to a pixel size of 3 . 4 $ 3{{\overset{\prime}{.}}}4 $. This is done both in order to speed up the analysis process (BeyondPlanck Collaboration 2023; Galloway et al. 2023) and to suppress small-scale correlated noise that would otherwise lead to a significant χ2 excess. Table 1 provides an overview over all data sets included in the current analysis.

Table 1.

Overview of all included data bands in the BEYONDPLANCK polarization analysis.

4. Methods

The next aim is to fit the data model in Eq. (1) to the data listed in Table 1. We define ω = {g, ncorr, Δbp, a, β, …} as the set of all free parameters in this model of both instrumental and astrophysical origin. Our main goal is then to map out the corresponding joint posterior distribution, which is given by Bayes’ theorem,

P ( ω d ) = P ( d ω ) P ( ω ) P ( d ) L ( ω ) P ( ω ) . $$ \begin{aligned} P(\omega \mid \boldsymbol{d}) = \frac{P(\boldsymbol{d}\mid \omega )P(\omega )}{P(\boldsymbol{d})} \propto \mathcal{L} (\omega )P(\omega ). \end{aligned} $$(8)

In this expression, P(ω ∣ d) is the posterior distribution and it describes our knowledge about ω after performing the experiment in question; ℒ(ω)P(d ∣ ω) is called the likelihood and it quantifies the information content in d regarding ω; and P(ω) is called the prior and it quantifies our beliefs regarding ω before doing the experiment. The prior can also be used actively to regularize specific degeneracies in the model.

This posterior distribution is large and complex with billions of free parameters. Attempting to directly map out the full posterior distribution by brute-force is infeasible. Instead, we resort to Markov chain Monte Carlo methods and draw samples from the posterior distribution. In particular, we find that the Gibbs sampling algorithm (Geman & Geman 1984) is particularly well suited to handle the complexity of this model. We therefore adopt the Commander CMB Gibbs sampling framework as the starting point for our analysis. This software was first introduced by Eriksen et al. (2004) for optimal CMB power spectrum estimation applications, building on ideas originally suggested by Jewell et al. (2004) and Wandelt et al. (2004), which were later generalized to also account for joint CMB and component separation by Eriksen et al. (2008) and Seljebotn et al. (2019). In this operational mode, Commander played a central role in the official Planck analysis, as summarized by Planck Collaboration I (2014, 2016, 2020) and references therein. BEYONDPLANCK has now generalized this framework further to also account for low-level TOD processing and mapmaking, effectively turning the entire CMB analysis challenge into one global problem.

4.1. Gibbs sampling

Gibbs sampling formalizes the idea of iterative analysis within a rigorous statistical language. In short, the theory of Gibbs sampling states that samples from a complex joint distribution may be drawn by iteratively sampling from each (typically simpler) conditional distribution. The full BEYONDPLANCK Gibbs chain (BeyondPlanck Collaboration 2023) illustrates how this is done in practice, written as

g P ( g d , ξ n , Δ bp , a , β , C ) , $$ \begin{aligned} \boldsymbol{g}&\,\leftarrow P(\boldsymbol{g}&\,\mid&\,\boldsymbol{d},&\,&\,\xi _n,&\,\Delta _{\mathrm{bp} },&\,\boldsymbol{a},&\,\beta ,&\,C_{\ell }),\end{aligned} $$(9)

n corr P ( n corr d , g , ξ n , Δ bp , a , β , C ) , $$ \begin{aligned} \boldsymbol{n}_{\mathrm{corr} }&\,\leftarrow P(\boldsymbol{n}_{\mathrm{corr} }&\,\mid&\,\boldsymbol{d},&\,\boldsymbol{g},&\,&\,\xi _n,&\,\Delta _{\mathrm{bp} },&\,\boldsymbol{a},&\,\beta ,&\,C_{\ell }),\end{aligned} $$(10)

ξ n P ( ξ n d , g , n corr , Δ bp , a , β , C ) , $$ \begin{aligned} \xi _n&\,\leftarrow P(\xi _n&\,\mid&\,\boldsymbol{d},&\,\boldsymbol{g},&\,\boldsymbol{n}_{\mathrm{corr} },&\,&\,\Delta _{\mathrm{bp} },&\,\boldsymbol{a},&\,\beta ,&\,C_{\ell }),\end{aligned} $$(11)

Δ bp P ( Δ bp d , g , n corr , ξ n , a , β , C ) , $$ \begin{aligned} \Delta _{\mathrm{bp} }&\,\leftarrow P(\Delta _{\mathrm{bp} }&\,\mid&\,\boldsymbol{d},&\,\boldsymbol{g},&\,\boldsymbol{n}_{\mathrm{corr} },&\,\xi _n,&\,&\,\boldsymbol{a},&\,\beta ,&\,C_{\ell }),\end{aligned} $$(12)

a P ( a d , g , n corr , ξ n , Δ bp , β , C ) , $$ \begin{aligned} \boldsymbol{a}&\,\leftarrow P(\boldsymbol{a}&\,\mid&\,\boldsymbol{d},&\,\boldsymbol{g},&\,\boldsymbol{n}_{\mathrm{corr} },&\,\xi _n,&\,\Delta _{\mathrm{bp} },&\,&\,\beta ,&\,C_{\ell }),\end{aligned} $$(13)

β P ( β d , g , n corr , ξ n , Δ bp , a , C ) , $$ \begin{aligned} \beta&\,\leftarrow P(\beta&\,\mid&\,\boldsymbol{d},&\,\boldsymbol{g},&\,\boldsymbol{n}_{\mathrm{corr} },&\,\xi _n,&\,\Delta _{\mathrm{bp} },&\,\boldsymbol{a},&\,&\,C_{\ell }) ,\end{aligned} $$(14)

C P ( C d , g , n corr , ξ n , Δ bp , a , β , ) . $$ \begin{aligned} C_\ell&\,\leftarrow P(C_\ell&\,\mid&\,\boldsymbol{d},&\,\boldsymbol{g},&\,\boldsymbol{n}_{\mathrm{corr} },&\,\xi _n,&\,\Delta _{\mathrm{bp} },&\,\boldsymbol{a},&\,\beta ,&\,)&. \end{aligned} $$(15)

Here, the symbol ← means setting the variable on the left-hand side equal to a sample from the distribution on the right-hand side. For convenience, we also define the notation “ω \ ξ” to imply the set of parameters in ω except ξ.

The parameters not already introduced are noise power spectrum parameters (ξn), bandpass corrections (Δbp), and the CMB power spectrum (C). Since the current paper focuses on polarized foregrounds, we are here primarily interested in the amplitude parameters, a, sampled in Eq. (13), and the spectral parameters, β, sampled in Eq. (14). We will therefore describe these two sampling steps in detail in the following, and we refer the interested reader to BeyondPlanck Collaboration (2023) and references therein for details regarding the remaining steps.

4.2. Signal amplitude sampling

We first consider the signal amplitude conditional distribution, P(a ∣ d, ω \ a), which has already been a primary focus of interest for a long line of studies, including Jewell et al. (2004), Eriksen et al. (2004, 2008), and Seljebotn et al. (2019). In the following, we give a brief summary of these developments and also highlight two important novel features introduced in the current work.

The appropriate starting point for this conditional distribution is the data model described in Eq. (2). We first note that a contains all signal amplitude maps, using some appropriate linear basis, stacked column-wise, such that a = [aCMB, as, ad]t, where the t superscript denotes the transpose operator. Further, we adopt a spherical harmonics basis to describe each diffuse component, such that a i ={ a i,lm T , a i,lm E , a i,lm B } $ {\boldsymbol{a}}_i = \{a^T_{i,\ell m},a_{i,\ell m}^E,a_{i,\ell m}^B\} $ contains the various temperature and polarization spherical harmonics coefficients (Zaldarriaga & Seljak 1997) for component i. Second, for notational convenience we combine the beam and mixing matrix operators in Eq. (2) into one joint linear operator, Aν ≡ BνMν, such that this equation may be written succinctly as

m ν = A ν a + n ν . $$ \begin{aligned} {\boldsymbol{m}}_{\nu } = \mathbf{A }_{\nu }\boldsymbol{a} + {\boldsymbol{n}}_{\nu }. \end{aligned} $$(16)

In noting that the noise, nν = mν − Aνa, is assumed to be Gaussian distributed with vanishing mean and a known covariance matrix, Nν, Bayes’ theorem then allows us to write the conditional distribution of interest in the following form:

P ( a d , ω a ) P ( d ω ) P ( a ) $$ \begin{aligned} P(\boldsymbol{a}\mid \boldsymbol{d}, \omega \setminus \boldsymbol{a})&\propto P(\boldsymbol{d}\mid \omega )P(\boldsymbol{a})\end{aligned} $$(17)

P ( m a ) P ( a ) $$ \begin{aligned}&\propto P(\boldsymbol{m}\mid \boldsymbol{a})P(\boldsymbol{a})\end{aligned} $$(18)

( ν exp ( 1 2 ( m ν A ν a ) t N ν 1 ( m ν A ν a ) ) ) × exp ( 1 2 ( a a ¯ ) t S ν 1 ( a a ¯ ) ) . $$ \begin{aligned}&\propto \left(\prod _{\nu }\mathrm \exp \left(-\frac{1}{2}({\boldsymbol{m}}_{\nu }-\mathbf{A }_{\nu }\boldsymbol{a})^t\mathbf N _{\nu }^{-1}({\boldsymbol{m}}_{\nu }-\mathbf{A }_{\nu }\boldsymbol{a})\right)\right)\nonumber \\&\times \mathrm \exp \left(-\frac{1}{2}(\boldsymbol{a}-\bar{\boldsymbol{a}})^t\mathbf S _{\nu }^{-1}(\boldsymbol{a}-\bar{\boldsymbol{a}})\right). \end{aligned} $$(19)

Here, the second line holds because TOD binning is a deterministic operation when conditioning on ω (BeyondPlanck Collaboration 2023), and the set of binned sky maps, m, is a sufficient statistic for a; no additional information in the TOD can possibly provide more knowledge about the foreground amplitude maps beyond that stored in m if ω is exactly known. In the last line, we adopt a Gaussian signal prior with mean a ¯ $ \bar{\boldsymbol{a}} $ and covariance matrix S, discussed further below. In this paper, we set a ¯ = 0 $ \bar{\boldsymbol{a}}=0 $, and only use S for smoothing purposes. Since the product of two Gaussian distributions is another Gaussian, P(a ∣ d, ω \ a) is also Gaussian, and the appropriate sampling algorithm is given by a standard multivariate Gaussian (discussed in detail in Appendix A in BeyondPlanck Collaboration 2023). In particular, a proper sample may be drawn by solving the following linear equation for a:

( S 1 + ν A ν t N ν 1 A ν ) a = ν A ν t N ν 1 m ν + ν A ν t N ν 1 / 2 η ν + S 1 / 2 η 0 , $$ \begin{aligned} \biggl (\mathbf S ^{-1} + \sum _{\nu }\mathbf A ^t_{\nu }\mathbf N _{\nu }^{-1}\mathbf{A }_{\nu }\biggr )\,\boldsymbol{a} = \sum _\nu \mathbf{A }_{\nu }^t\mathbf N _\nu ^{-1}{\boldsymbol{m}}_{\nu } + \sum _{\nu }\mathbf{A }_{\nu }^t\mathbf N _{\nu }^{-1/2}\eta _{\nu } + \mathbf S ^{-1/2}\eta _{0}, \end{aligned} $$(20)

where ην and η0 are random Gaussian vectors of independent N(0, 1) variates. Because of its broad dimensionality, this equation must in practice be solved using iterative linear algebra techniques, and we use a preconditioned conjugate gradient solver for this purpose (e.g., Shewchuk 1994).

This exact equation was discussed in detail by Seljebotn et al. (2019), who also introduced two novel and efficient preconditioners for partial-sky observations based on pseudo-inverse and multi-grid techniques. However, in the present work, for which no analysis mask is imposed during the main Gibbs sampling analysis, we adopt the simpler diagonal pre-conditioner described by Eriksen et al. (2008), which converges slightly faster with full-sky data and also has a lower computational cost per iteration. The two main novel features regarding Eq. (20) introduced by BEYONDPLANCK (as described by Andersen et al. (2023) for temperature and by this paper for polarization) concern the data and model representation and active use of spatial priors.

4.2.1. Vector representation and object-oriented programming

Equation (20) is written in a general vector form, without reference to any specific vector representation or basis, or to any specific form of either the effective mixing matrix, Aν, or noise covariance matrix, Nν. All of these, in principle, may be chosen per component and frequency channel. As detailed by Galloway et al. (2023), we have implemented such a level of flexibility in the most recent version of Commander by adopting an object-oriented programming style, where separate classes are defined for each object. For instance, the current implementation contains specific classes for three general types of components, namely, diffuse components (for which a is represented in terms of spherical harmonic coefficients, am), point source objects (for which a is represented in terms of a single flux density per compact source), and fixed spatial templates (for which a is represented in a single multiplicative amplitude per template). Any combination of such objects may all be fit simultaneously and jointly through Eq. (20). It is also relatively straightforward to add new types of objects as the need arises. Two examples of classes that might be important for future applications include pixel- or needlet-based components (e.g., Marinucci et al. 2008), which could be useful for modeling partial sky experiments.

Each of the instrumental objects were also implemented in terms of individual classes. This is particularly relevant for the noise covariance matrix, Nν, which may have very different representations for different experiments. For example, in the current analysis, we modeled the Planck LFI noise as a sum of correlated and white noise, where the correlated noise is treated as a stochastic variable in the Gibbs chain, and sampled over directly, while the white noise uncertainty is propagated analytically through a diagonal Nν. This approach supports, for the first time, propagation of both correlated and white noise at all angular resolutions. For WMAP, however, we do not yet have access to time-ordered data within our framework, so Nν is defined in terms of the precomputed dense low-resolution noise covariance matrices provided by the WMAP team (Bennett et al. 2013), which is computationally feasible because this data is smoothed to Nside = 16. For this reason, WMAP contributes only to large angular scales in the current analysis. Finally, for the Planck 353 GHz measurements, which are essential for modeling polarized thermal dust emission at full angular resolution, we are for now only able to propagate white noise uncertainties with a diagonal Nν as given by Planck Collaboration Int. LVII (2020). The result of this is that any systematic effects native to this band will be propagated to lower frequencies because it serves as the primary source of signal to the thermal dust component. Of course, this will necessarily lead to an underestimation of dust-related uncertainties; hence, modeling Planck HFI observations in the time domain is clearly a high-priority issue for future works, but outside the scope of the current project.

In summary, the first critically important novel feature provided by the current Commander implementation is the ability to operate with fundamentally different types of data sets within one analysis and thereby exploit complementary features from each data set to break degeneracies within the full model. At the time of publication, there is direct support for only three types of noise covariance matrices, namely diagonal matrices, WMAP-style dense Stokes QU matrices, and diagonal matrices with explicit marginalization over low- modes. However, using the new infrastructure described here and by Galloway et al. (2023), it is straightforward to add support for other types of matrices. Possible useful examples include block-diagonal or banded covariance matrices, or noise covariance matrices with specific subspaces projected out through operator based filters. The latter could be particularly useful for ground-based experiments that tend to have limited statistical sensitivity on large angular scales, but with high systematic uncertainties. We hope that such features can be implemented and made publicly available by third-party authors as Open Source contributions (Gerakakis et al., in prep.).

4.2.2. Gaussian spatial priors

The second novel feature supported by the latest Commander implementation is the use of active spatial priors for the diffuse components. For practical purposes, we currently support only Gaussian priors, as described via Eq. (19), as non-Gaussian priors would lead to a prohibitively high computational cost associated with the current conjugate gradient-based approach. In practice, imposing a spatial prior is therefore equivalent to specifying a prior mean, a ¯ $ \bar{\boldsymbol{a}} $, and covariance matrix, S, for each component. In the current polarization-oriented analysis, however, we do not wish to enforce any informative priors on any of the three free components (polarized CMB, synchrotron or thermal dust emission), so we set a ¯ = 0 $ \bar{\boldsymbol{a}}=0 $ for all three. Instead, we used S to adjust the allowed level of fluctuations around zero for each component as a function of angular scale; this is numerically equivalent to choosing an appropriate effective smoothing scale for each component and might, for the purposes of this paper, be considered more of a technical issue than a prior in the normal sense. For an example of an application of active priors, however, see Andersen et al. (2023), in which proper informative spatial priors are imposed on both free-free and anomalous microwave emission.

The remaining question is how to choose a specific form of S for each component. There are two main requirements for this choice. First, S is the covariance of a, and therefore dictates the overall fluctuation level of the fitted components. A large value of S implies a weak prior, and the fitted component will then be dominated by the data-driven likelihood term in Eq. (19), while a low value of S implies a strong prior, resulting in values close to the prior mean. In practice, we want the prior to play a limited role where the data have a large S/N, but play a stronger role where the data are noise-dominated. It is therefore in general desirable to choose scale-dependent priors, in which the smoothing becomes gradually stronger; a prime example of this is a standard Gaussian smoothing kernel.

As already mentioned, we adopt spherical harmonics as our basis set for a, writing each diffuse polarized signal component as:

a X ( n ̂ ) = m a m X Y m ( n ̂ ) . $$ \begin{aligned} a_{X}(\hat{n}) = \sum _{\ell m} a^{X}_{\ell m} Y_{\ell m}(\hat{n}). \end{aligned} $$(21)

For each component, we therefore also define an angular power spectrum prior of the form:

D ̂ X = | a m X | 2 ( + 1 ) / 2 π , $$ \begin{aligned} \hat{D}^{X}_{\ell } = \left < |a^{X}_{\ell m}|^2\right> \,\ell (\ell +1)/2\pi , \end{aligned} $$(22)

where X = {E, B}. We note that this power spectrum prior representation is conceptually similar to the assumptions made by SMICA (Cardoso et al. 2008), one of the four CMB extraction codes used by Planck.

We adopt the following prior variances for each of the three astrophysical components in the current analysis:

D ̂ CMB 1 ( ) = 0 μ K 2 , $$ \begin{aligned}&\hat{D}^{-1}_{\mathrm{CMB} }(\ell ) = 0\,\upmu \mathrm{K} ^2,\end{aligned} $$(23)

D ̂ s ( ) = 200 e ( + 1 ) σ 2 ( 30 ) μ K 2 , $$ \begin{aligned}&\hat{D}_{\mathrm{s} }(\ell ) = 200\, \mathrm{e} ^{-\ell (\ell +1)\sigma ^2(30^\prime )} \,\upmu \mathrm{K} ^2,\end{aligned} $$(24)

D ̂ d ( ) = 500 e ( + 1 ) σ 2 ( 10 ) μ K 2 , $$ \begin{aligned}&\hat{D}_{\mathrm{d} }(\ell ) = 500\, \mathrm{e} ^{-\ell (\ell +1)\sigma ^2(10^\prime )} \,\upmu \mathrm{K} ^2, \end{aligned} $$(25)

where σ 2 ( θ FWHM )(8ln2)π/(18060) θ FWHM 2 $ \sigma^2(\theta_{\mathrm{FWHM}})\equiv (8\ln 2)\, \pi/(180\cdot60)\theta_{\mathrm{FWHM}}^2 $ is the standard deviation of a Gaussian distribution expressed in terms of a full width at half maximum, θFWHM, in arcminutes.

We note that for the CMB component, the inverse variance is set to zero, which is equivalent to an infinite variance – or simply no prior at all. For the polarized synchrotron and thermal dust amplitudes, the priors correspond to Gaussian smoothing of 30′ and 10′ FWHM, respectively. The overall scaling factors are chosen to be 200 and 500 μK2, respectively, which correspond to the observed angular power spectrum of each component on large angular scales. In these cases, the priors are used to apodize the resulting foreground amplitude maps with Gaussian smoothing kernels to avoid ringing around bright sources and the Galactic plane and unphysical degeneracies at high multipoles.

Future analyses may proceed to use S to directly estimate the angular power spectrum of each foreground component, fully analogous to the CMB case (Colombo et al. 2023; Paradiso et al. 2023). This will then both alleviate the need of specifying the prior parameters by hand before executing the analysis, and it will ensure optimal smoothing properties for each component, resulting in minimal high- degeneracies between the various components.

Finally, we conclude this section by emphasizing that the above priors are, in fact, only priors and not deterministic smoothing operators. Thus, the resulting synchrotron and thermal dust component maps will be determined by the properties of the observed data wherever the data are stronger than the prior. This is important to bear in mind for instance when trying to estimate the angular power spectrum of the resulting maps; the component maps, a, that result from solving Eq. (20) correspond to a model of the sky without instrumental beam convolution, but with spatially varying noise properties, depending on the local S/N of the data. The angular resolutions of the synchrotron and thermal dust maps are not given precisely by a Gaussian beam of 30 and 10′, respectively, but will generally be higher where the data are sufficiently strong. As such, the behavior of these foreground maps is conceptually similar to the GNILC algorithm (Remazeilles et al. 2011), in which a spatially varying angular resolution also results from S/N variations. Likewise, it is also important to note that the noise properties of these maps are highly non-trivial, and the only statistically robust way of assessing and propagating their uncertainties is through the ensemble of sky map samples produced by the algorithm itself.

4.3. Spectral parameter sampling

The second of the two main conditional distributions discussed in this paper is P(β ∣ d, ω \ β), which describes the foreground SED parameters. In general, a and β are strongly correlated for a given component, especially for high S/N data. For temperature-oriented foreground analysis, the BEYONDPLANCK Gibbs sampler therefore implements a special-purpose sampling step for these parameters, by exploiting the definition of a conditional distribution, P(a, β ∣ d) = P(a ∣ d, β)P(β ∣ d) (Stivoli et al. 2010; BeyondPlanck Collaboration 2023; Andersen et al. 2023). That is, we first sampled spectral parameters from the marginal distribution with respect to a, and then sampled a conditionally with respect to β. The resulting algorithm is thus effectively an independence sampler in {a, β} (i.e., a sampler where the new proposal does not depend on the previous) with an internally vanishing Markov chain correlation length. In this case, any long-term Markov chain correlations come from degeneracies with other parameters.

However, as described by Stivoli et al. (2010), this algorithm is only computationally practical for observations with the same angular resolution, as the computational expense for evaluating the marginal distribution P(β ∣ d) otherwise becomes prohibitively high. In practice, this means that all data maps must be smoothed to a common angular resolution. For temperature data, this is not a major problem, but for polarization analysis it is non-trivial, since the smoothing operation correlates the instrumental noise. In addition, the data combination used in the current paper involves low-resolution WMAP data with dense noise covariance matrices, and smoothing all data to this resolution is impractical.

Since we focus on polarization in this work, we instead adopted a standard Metropolis-within-Gibbs sampler for β. In this case, the appropriate conditional posterior distribution may be derived from Eq. (16) by noting that the instrumental noise is assumed to be Gaussian distributed with covariance matrix, Nν, and recalling that the amplitude is for the moment assumed to be perfectly known. Therefore,

P ( β d , a ) P ( d a , β ) P ( β ) [ ν exp ( 1 2 ( d ν A ν ( β ) a ) t N ν 1 ( d ν A ν ( β ) a ) ) ] P ( β ) , $$ \begin{aligned}&P(\beta \mid \boldsymbol{d}, \boldsymbol{a}) \propto P(\boldsymbol{d}\mid \boldsymbol{a}, \beta ) P(\beta )\nonumber \\&\quad \propto \Bigg [\prod _{\nu } \exp \left(-\frac{1}{2}\left({\boldsymbol{d}}_{\nu }-\mathbf{A }_{\nu }(\beta )\boldsymbol{a}\right)^t\mathbf N _{\nu }^{-1}\left({\boldsymbol{d}}_{\nu }-\mathbf{A }_{\nu }(\beta )\boldsymbol{a}\right)\right)\Bigg ]P(\beta ), \end{aligned} $$(26)

where d = {dν} is the set of all available frequency maps (which within the larger BEYONDPLANCK framework may be a specific set of frequency map sky samples), and P(β) is a user-defined prior, typically a Gaussian distribution with physically motivated mean and standard deviation.

We note that Eq. (26) is also written with a general vector notation and makes no reference to a specific vector basis for either a or β; it therefore allows the various data sets to be defined across different basis sets. The only requirement is that there must exist a well-defined mapping between β and the effective mixing matrix at each frequency, Aν(β). This generality

Algorithm 1Metropolis algorithm used for spectral index sampling.

is precisely what is needed to analyze multi-resolution observations jointly, for instance high-resolution LFI data together with low-resolution WMAP data.

As noted in Sect. 2.2.1, the S/N of the Planck and WMAP polarization data are modest, so we have defined broad regions for βs, as shown in Fig. 1. To sample these values, we employ a standard Metropolis algorithm, as outlined in Algorithm 1, using Eq. (26) as target distribution and a standard symmetric Gaussian proposal distribution, T(β(i) ∣ β(i − 1))∼N(β(i − 1), Cβ), where Cβ is a tunable proposal matrix. When evaluating Eq. (26), we first propose βs, i independently for each region, i, and then smooth the resulting map with a 10° FWHM Gaussian beam to suppress edge effects. This smoothed map is then used to evaluate the mixing matrix Aν at the appropriate resolution for each frequency channel.

To assess the goodness-of-fit locally across the sky, we then compute a χ2 map per pixel. Since WMAP is pixelized with Nside = 16, the χ2 map is too. The sum of this map, after application of an optional analysis mask, is used to evaluate the exponent in Eq. (26) and the corresponding Metropolis acceptance rate.

As summarized in Algorithm 1, this method has two free tunable parameters, the proposal matrix, Cβ, and the number of Metropolis steps per main Gibbs iteration, n. Before starting a full Gibbs analysis, we perform a tuning run using a diagonal Cβ and adjust the diagonal elements until the resulting accept rate is between 0.2 and 0.8. Once that happens, we run a longer chain with fixed Cβ and replace this matrix with the covariance matrix from the resulting samples. Finally, we run a third chain, computing the Markov autocorrelation length from the resulting chain, and setting n such that the empirical correlation between samples β(i) and β(i + n) is less than 0.1. These tuning steps are only performed once for each analysis setup, and files are stored on disk for subsequent runs.

4.4. Spectral index priors

The only missing part of the algorithm at this point is the specification of the priors. For the synchrotron and thermal dust spectral indices in the main analysis, Gaussian priors of βs = −3.3 ± 0.1 and βd = 1.56 ± 0.10, respectively. The former are motivated by the Planck LFI 2018 likelihood analysis, which finds a linear scaling factor of α = 0.058 ± 0.004 between 30 and 70 GHz. Accounting for the bandpasses of the respective channels (Planck Collaboration II 2020; Svalheim et al. 2023), the quoted scaling factor translates into a spectral index of βs = −3.3. The thermal dust spectral index prior is also informed by the official Planck analysis Planck Collaboration X (2016) and, in this case, the BEYONDPLANCK data sets do not add any useful independent information to complement the original work because of the absence of the high S/N HFI measurements.

Many alternative synchrotron priors were explored during the course of the project, varying both the mean and width. In general, these all led to more significant residuals than the final choice. Regarding the prior width, we note that σβ = 0.1 corresponds to only 1σ, and the full 3σ confidence interval therefore spans from −3.6 to −3.0, all of which will be covered during the full Gibbs run. As discussed by Herman et al. (2023), shallower indices than β = −3.0 are very difficult to accommodate with both LFI and WMAP without introducing an additional low-frequency dust correlated component, for instance polarized AME. Consequently, a broader prior width of σβ = 0.2, which permits synchrotron indices as shallow as βs = −2.7, leads to large foreground excesses that only can be accommodated by the CMB component within the current model. The result is an obvious foreground bias in the large-scale CMB extraction and a corresponding excess in estimates of the optical depth of reionization (Paradiso et al. 2023). For further discussions regarding different priors, see Sect. 4.2.2.

We treat the four regions differently in terms of synchrotron priors. While the Spur and Galactic plane regions are fitted using both the likelihood and prior as described above, we only include the prior for the high-Galactic and Galactic Center regions. This is because of very strong degeneracies with respect to instrumental systematic effects in these regions. Specifically, the Galactic Center region is particularly susceptible to bandpass mismatch effects, as the bandpass leakage corrections for the 30 GHz channel are of order unity in this region (Svalheim et al. 2023), while the High Latitude region is particularly sensitive to relative gain uncertainties and degeneracies with the CMB quadrupole (Gjerløw et al. 2023). To prevent potential residual systematic errors from contaminating these two regions, we only marginalize over the prior in these cases. Although it is clearly non-ideal, we consider this approach to be preferable to simply fixing the synchrotron index at some given value, as was done for most previous Planck polarization analyses, for instance, Planck Collaboration Int. XLVI (2016). Ultimately, this is a concession that the current data set are not sufficiently strong to uniquely and independently determine both synchrotron and CMB components without priors, and additional measurements from high sensitivity low-frequency experiments such as C-BASS (Jew et al. 2019) and QUIJOTE (Génova-Santos et al. 2015) will be extremely valuable in further improving the quality of the LFI and WMAP data sets in the future.

5. Results

5.1. Goodness-of-fit

Before turning our attention to the final astrophysical products, we assess the goodness-of-fit of the fitted sky model. Our first statistic is the total χ2 per pixel, summed over frequencies and Stokes Q and U parameters and averaged over Gibbs samples, as shown in Fig. 2. To aid in the visual interpretation, this map is plotted in the form ( χ 2 n d . o . f . ) / 2 n d . o . f . $ (\chi^2-n_{\mathrm{d.o.f.}})/\sqrt{2n_{\mathrm{d.o.f.}}} $, where nd.o.f. = 20 798 is an estimate of the total number of degrees of freedom (i.e., number of full-frequency data pixels summed over frequencies minus the number of fitted parameters.) within each Nside = 16 pixel. If the model performs as expected, this quantity should have zero mean and unit standard deviation. Overall, we see that our model appears to perform well at high Galactic latitudes5, while the Galactic plane shows clear excess χ2. This behavior is not surprising, as the Galactic Center is far more complex and difficult to model, both in terms of astrophysics and instrumental effects. We do note that the morphological structure of the excess χ2 appears to be correlated with Galactic emission, rather than instrumental effects.

thumbnail Fig. 2.

Normalized and reduced χ2 per pixel, summed over Stokes Q and U and averaged over all Gibbs samples.

Another useful statistic for evaluating the model goodness-of-fit is data-minus-model residual maps per frequency, rν = dν − sν. These residual maps provide a visual summary of remaining systematics on a band-to-band basis, which is useful when physically interpreting specific artefacts in the χ2 map. Figure 3 shows such residual maps for the Planck frequency bands, smoothed to 2deg FWHM to suppress white noise. Generally, these maps indicate an excellent model performance, with amplitudes of ≲3 μK. High Galactic latitudes appear consistent with noise, while small deviations are seen around the Galactic Center, with a morphology that may suggest residual bandpass-induced temperature-to-polarization leakage (Paradiso et al. 2023).

thumbnail Fig. 3.

Posterior mean total data-minus-model residual maps (rν = dν − sν) in Q and U for BEYONDPLANCK LFI 30 GHz (top left), 44 GHz (top right), 70 GHz (bottom left), and 353 GHz (bottom right). All maps are smoothed to a common angular resolution of 2° FWHM.

The 44 GHz channel exhibits the largest relative variations. This is expected from algorithmic arguments, since we have noted that this channel does not dominate the determination of any single fitted astrophysical component. In contrast, the 30 GHz channel dominates the synchrotron determination, the 70 GHz channel dominates the CMB determination, and the 353 GHz channel dominates the thermal dust determination. As such, any excess fluctuations in these channels will rather be interpreted as signal belonging to whichever foreground component that uses this as a reference band. For component separation purposes, it is clearly advantageous to have multiple frequency maps with comparable S/N per astrophysical component, as instrumental artefacts are then much easier to identify. Employing the full set of Planck frequency maps in a future analysis will improve these results, and provide many more internal cross-checks. However, we once again stress that the main aim of the current study is not to present a new state-of-the-art sky model, but rather to demonstrate the BEYONDPLANCK algorithm.

Figure 4 shows the remaining residual maps, namely, in the WMAP Ka, Q, and V bands. In all three cases, we see coherent large-scale structures that are clearly morphologically inconsistent with instrumental noise. These residual maps appear visually similar to the set of correction templates presented by Jarosik et al. (2007) that account for known transmission imbalance between the A- and B-sides of the WMAP instrument6, as shown in the second and fourth column of Fig. 4. In our study, we did not apply any explicit corrections for these templates; hence, they appear in the frequency residual maps. However, they are accounted for in the WMAP covariance matrices, and the corresponding modes are therefore appropriately down-weighed when fitting the astrophysical parameters with Eqs. (20) and (26). Transmission imbalance effects will therefore not bias any astrophysical results, but only result in larger uncertainties. At the same time, these residual maps clearly suggest how a future joint time-domain processing of WMAP and Planck will be able to constrain the WMAP transmission imbalance parameters with high precision. Once that happens, the corresponding spatial modes will no longer need to be algebraically projected out, as is effectively done now; however, they may rather be used for scientific inference in the future, on the same footing as any other mode. This work has already started and preliminary results are discussed by Watts et al. (2023).

thumbnail Fig. 4.

Posterior mean total data-minus-model residual maps (rν = dν − sν) in Q (first column) and U (third column) for all included WMAP bands in the two left-most columns, at Nside = 16, masked with the processing mask applied in the pipeline. The second and fourth columns show corresponding transmission imbalance template maps for one of the differencing assemblies (e.g., Q{1, 2}, V{1, 2}) per residual as derived by Jarosik et al. (2007); we note that these templates are each associated with an unknown scaling amplitude that may take either sign.

5.2. Amplitude results

We now turn our attention to the astrophysical results, and start with the posterior amplitude maps, a. First, Fig. 5 shows the thermal dust amplitude, plotted at an angular resolution of 10′ FWHM at Nside = 1024 for the polarization amplitude P = Q 2 + U 2 $ {P=\sqrt{Q^2+U^2}} $, as well as Q and U averaged over an ensemble of Gibbs samples. The right column shows the corresponding posterior distribution standard deviation per pixel, with values peaking around 3 μKRJ. As mentioned in the previous subsection, the thermal dust amplitude is for all practical purposes determined by the pre-computed HFI 353 GHz band in the BEYONDPLANCK processing and the LFI bands have little influence. As a result, the thermal dust standard deviation maps shown in Fig. 5 are essentially given deterministically by the input 353 GHz standard deviation map. These uncertainties are underestimated in the Galactic plane, where systematic effects must be significant. Overall, the polarized thermal dust amplitude map is in good agreement with previous results.

thumbnail Fig. 5.

Polarized thermal dust amplitude maps (left column) plotted in terms of the polarization amplitude P = Q 2 + U 2 $ P=\sqrt{Q^2+U^2} $ (top row) and the Stokes Q (middle row) and U parameters (bottom row). Corresponding posterior standard deviation maps (right column). All maps are evaluated at a reference frequency of 353 GHz in units of μKRJ and averaged over all available Gibbs samples. The effective angular resolution is 10′ FWHM.

Figure 6 shows corresponding results for the polarized synchrotron amplitude map. At high Galactic latitudes, we see that the standard deviation of this component traces the LFI 30 GHz scanning strategy, and is correspondingly largely determined by instrumental white noise. However, in this case, the Galactic plane is in fact dominated by temperature-to-polarization leakage due to bandpass and gain uncertainties, resulting in a morphology that matches the 30 GHz intensity map.

thumbnail Fig. 6.

Polarized synchrotron amplitude maps (left column) plotted in terms of the polarization amplitude P = Q 2 + U 2 $ P=\sqrt{Q^2+U^2} $ (top row) and the Stokes Q (middle row) and U parameters (bottom row). Corresponding posterior standard deviation maps (right column). All maps are evaluated at a reference frequency of 30 GHz in units of μKRJ and averaged over all available Gibbs samples. The effective angular resolution is 1° FWHM.

In Fig. 7, we show Stokes parameter maps of the difference between the BEYONDPLANCK map of polarized synchrotron amplitude and two independent synchrotron tracers. The top panel shows differences with respect to the full WMAP K-band frequency map at 23 GHz. To account for its different effective frequency, we scale the K-band map according to a power-law model with βs = −3.1, or, explicitly, by a factor of 0.38. The two data sets appear to agree reasonably well, with a certain degree of diffuse large scale structure. Considering that the effective frequency difference between K-band and 30 GHz is only 7 GHz, spatial variations in the synchrotron spectral index are unlikely to be relevant, as a difference of Δβs = 0.02 only translates into approximately 0.5 μK in the relevant regions.

thumbnail Fig. 7.

Difference between the BEYONDPLANCK polarized synchrotron amplitude and the raw WMAP K-band map (Bennett et al. 2013), with the latter scaled to 30 GHz assuming a spectral index of βs = −3.1 (top panel). Similar difference between the BEYONDPLANCK and Planck DR4 (Planck Collaboration Int. LVII 2020) synchrotron amplitude maps (bottom panel). Left and right columns show Stokes Q and U parameters respectively, and all maps are smoothed to a common angular resolution of 3° FWHM.

The bottom panel of Fig. 7 shows similar difference maps with respect to the polarized synchrotron amplitude map derived from Planck DR4 (Planck Collaboration Int. LVII 2020). In this case, the high-latitude residuals are dominated by the Planck scanning strategy, with an overall morphology that closely matches the LFI gain residual template produced by Planck Collaboration II (2020) and discussed by Gjerløw et al. (2023) in a BEYONDPLANCK context. At the same time, no striking correlations are seen between the K-band and Planck DR4 residual maps, and the overall levels of variation in these two difference maps are comparable.

5.3. Spectral index results

5.3.1. Spectral index regions

Before presenting the synchrotron spectral index posterior distribution, we revisit the final choice of sampling regions and priors by considering a preliminary analysis configuration with nine disjoint regions rather than four, and a spectral index prior of βs = −2.8 ± 0.1, rather than βs = −3.3 ± 0.1. The results are summarized in Fig. 8, which shows βs as a function of iteration for a Gibbs chain that explored only the {a, β} sub-space. That is, all TOD-related parameters were kept fixed in this particular run, in order to highlight the conditional S/N of the foreground sector. Each region is labeled with its corresponding posterior mean and standard deviation after a burn-in period of 25 Gibbs samples, and accompanied by a color-coded region map for visualization purposes.

thumbnail Fig. 8.

Synchrotron spectral index as a function of Gibbs iteration for an analysis that includes nine disjoint regions (see inset sky map). The prior is, in this case, βs = −2.8 ± 0.1, and all regions are constrained using the full posterior distribution.

Starting with the most visually striking result, we see that the Galactic Center region immediately converges to a very low mean value of βs = −4.15 ± 0.05, well outside the range of previously reported values. Of course, we already know that this region is associated with high χ2 values (see Fig. 2), and in particular shows clear evidence of temperature-to-polarization leakage when compared to WMAP and Planck DR4 (see Fig. 7). Rather than letting these systematic errors potentially contaminate other parameters through an unphysical synchrotron spectral index fit, we instead assign the spectral index of this region a physically meaningful value, as defined by the prior. However, we do not fix it, but rather draw a new value from the prior in every sample, and thereby marginalize over the prior. Technically speaking, this is done by omitting the likelihood term in Eq. (26) when computing the Metropolis acceptance probability.

Next, we see that the Gum Nebula region also converges to a notably low value, and this is most likely related to the same χ2 excess that is seen for the Galactic Center. Furthermore, we note that the Gum Nebula region, as outlined in Fig. 1, exhibits very little synchrotron signal and is therefore particularly prone to residual systematic bias when applying weak prior constraints.

Most of the remaining regions fluctuate around values that are at least nominally consistent with previous analyses reported in the literature. We do note, however, that the two Southern Hemisphere regions return values that are high, at βs = −2.70 ± 0.07 and βs = −2.88 ± 0.07, respectively, while most of the remaining ones lie around βs ≈ −3.1. Thus, even when measured conditionally with respect to the TOD parameters, there is slight evidence for a positive bias of Δβs ≈ 0.1 or more with respect to the Northern Hemisphere. Returning once again to Fig. 1, we see that all high-latitude regions exhibit very a low S/N, as the instrumental noise is comparable to, or dominates over, the synchrotron amplitude in most pixels. These regions are therefore all particularly susceptible to residual systematic errors or prior volume effects, or both (e.g., Dunkley et al. 2009b). Indeed, when sampling jointly over the full range of time-ordered systematic corrections, we find that these regions converge to βs ≳ −2.5. As in the case of the Galactic Center, we do not take this as evidence for a truly flatter spectral index in these regions, but rather as an indication that there are low-level residual systematics present in BEYONDPLANCK, WMAP, or Planck DR4 353 GHz (or possibly all of them) at a level that is sufficient to bias the spectral index in the faintest regions.

We conclude that dividing into nine sky regions is sub-optimal (not to mention, 24 regions, which was the starting point of the analysis; see Sect. 2.2.1), as most regions do not have sufficient S/N to independently constrain βs, so they are highly susceptible to residual systematic uncertainties. In particular, small changes in the absolute calibration can introduce confusing CMB dipole leakage at high latitudes, while bandpass variations can cause problems near the Galactic Center.

We therefore reduced the number of disjoint regions and combined the four high Galactic latitude regions into one, and then we merged all the regions along the galactic plane, except for the Galactic center. Even in this minimal configuration, the high Galactic latitude region is not well constrained (and would be even less so with high latitudes subdivided into northern and southern hemispheres, which was observed to be the case in a separate test); thus, we sampled this from the prior alone (as we did for the Galactic Center). This leaves only two regions (the Galactic Spur and the Galactic Plane) to be sampled properly with the full posterior distribution, and, fortunately, both of these appear to be both signal-dominated and stable with respect to instrumental parameter variations.

5.3.2. Synchrotron spectral index results

We are now finally ready to present the main results of this paper, namely, the constraints on the synchrotron spectral index. Starting with the individual Gibbs samples, Fig. 9 shows trace-plots for each of the four regions. The first half shows the first Gibbs chain and the second half shows the second Gibbs chain, both after discarding ten samples for burn-in. Overall, we see that the correlation length is modest, namely, in 30–50 samples. The chains mix well, and appear at least visually to be statistically stationary; it is not easy to identify the point at which the two chains are joined, which typically is the case if there are long-term drifts. For completeness, Fig. 10 shows posterior mean and standard deviation sky maps evaluated from these chains.

thumbnail Fig. 9.

Synchrotron spectral index as a function of Gibbs iteration binned using the final BEYONDPLANCK analysis configuration with four disjoint regions and a prior of βs = −3.3 ± 0.1. Dotted lines indicate regions that are sampled exclusively from the prior distribution, while solid lines indicate regions that are sampled from the full posterior distribution.

thumbnail Fig. 10.

Posterior mean (left panel) and standard deviation (right panel) maps of the spectral index of polarized synchrotron emission. We note that a prior of βs = −3.3 ± 0.1 is applied to all four regions, but only the Galactic Spur and Galactic Plane regions are constrained with data through the likelihood. See Sect. 5.3.2 for further discussion.

We also see that the Spur and Galactic Plane regions, which are the only two regions for which βs is actually fitted, have shallower mean spectral indices than the prior of βs = −3.3. This may seem somewhat paradoxical, since it is then natural to ask why the prior was not set to β = −3.1. As discussed earlier, the reasons are two-fold. Firstly, the High Latitude region actually does appear to prefer a steeper spectral index, as indicated by the Planck likelihood analysis (Planck Collaboration V 2020), AME constraints (Herman et al. 2023), and our own preliminary studies. At the same time, this region is also both the most important region for CMB purposes, and it is prior-dominated. It is therefore particularly important that the prior works well for this region. Secondly, the prior only has a very mild effect on the two signal-dominated regions anyway, precisely because of their higher statistical weight. This is explicitly demonstrated in Fig. 11, which compares conditional distributions for the Spur region with two different priors centered on β = −3.3 (blue) and −2.8 (red), respectively. (In this case, the TOD parameters are kept fixed, and the P(a, β|d, ωTOD) distribution is explored for one arbitrarily chosen realization of ωTOD.) Here we see that shifting the prior mean by as much as Δpβ = 0.5 only affects the posterior by Δβ ≲ 0.1. For a more realistic possible prior shift of Δpβ = 0.2, the final posterior shifts will be Δβ ≲ 0.03, which is small compared to the overall variations seen in Fig. 9. In short, the Spur and Galactic plane regions are signal-dominated, and the prior is of limited importance.

thumbnail Fig. 11.

Normalized histogram of synchrotron spectral index (βs) for the Spur region using two different priors. The solid lines show the marginal distribution of spectral index values without TOD sampling using a prior (dotted lines) of βs = −2.8 (red), and βs = −3.3 (blue).

Figure 12 compares the posterior distributions for these two regions. For these, we find posterior mean and standard deviations of β s Spur = 3.17 ± 0.06 $ {\beta_{\mathrm{s}}^{\mathrm{Spur}}=-3.17\pm0.06} $ and β s Plane = 3.03 ± 0.07 $ {\beta_{\mathrm{s}}^{\mathrm{Plane}}=-3.03\pm0.07} $. Both of these values are consistent with earlier constraints in the literature, suggesting a steepening of the spectral index from low to high latitudes (e.g., Fuskeland et al. 2021; Krachmalnicoff et al. 2018). Similarly, Dunkley et al. (2009a) report a variation of Δβs = 0.08 between low and high Galactic latitudes using the WMAP data, while we find a variation of Δβs = 0.14 between the Galactic plane and the Spur using both WMAP and LFI data. The steepening is however only statistically significant at the 2σ level as determined in the current analysis.

thumbnail Fig. 12.

Normalized histogram of synchrotron spectral index (βs) for the Spur and Plane regions over the 500 ensemble Gibbs samples, with the corresponding prior, P(βs).

Next, to illustrate the importance of marginalization over TOD parameters, Fig. 13 compares the full marginal posterior distribution (thick blue histogram) with a similar posterior distribution that fixes the TOD parameters at one arbitrary Gibbs sample (thin blue histogram). Thus, the former marginalizes over the full BEYONDPLANCK data model, while the latter only marginalizes over foreground parameters and white noise. The relative widths of the two distributions clearly demonstrate the importance of accounting uncertainties in the full parameter sets and, correspondingly, the advantage of joint global parameter estimation as well.

thumbnail Fig. 13.

Normalized histogram of synchrotron spectral index (βs) for the Spur region using a prior of βs = −3.3. The bold line shows the full BEYONDPLANCK posterior distribution including TOD sampling and the thin line shows the corresponding posterior distribution when conditioning on TOD parameters.

As an additional validation of these results, we replaced the three BEYONDPLANCK-processed LFI frequency map samples with the corresponding preprocessed Planck DR4 maps (Planck Collaboration Int. LVII 2020). In this case, we find spectral indices of β s Spur = 3.20 ± 0.06 $ {\beta_{\mathrm{s}}^{\mathrm{Spur}}=-3.20\pm0.06} $ and β s Plane = 3.06 ± 0.06 $ {\beta_{\mathrm{s}}^{\mathrm{Plane}}=-3.06\pm0.06} $, respectively, which are individually statistically consistent with the BEYONDPLANCK results at the 0.5σ level.

5.3.3. Thermal dust spectral index

For polarized thermal dust emission, we fit only one power-law index, βd, across the full sky, while fixing the dust temperature on the latest Planck estimate (Planck Collaboration Int. LVII 2020). This is exclusively due to a limited S/N and not a statement regarding the complexity of the true sky. In this case, we adopt a prior of βd = 1.56 ± 0.10, motivated by the most recent Planck HFI results (Planck Collaboration IV 2020; Planck Collaboration Int. LVII 2020).

The resulting posterior distribution is shown in Fig. 14, which may be reasonably approximated as a Gaussian with βd = 1.62 ± 0.04. This mean value is thus slightly steeper than expected based on HFI, with a statistical significance of about 1.5σ. Furthermore, the uncertainty is significantly smaller than the prior width, which suggests that the result is indeed data-driven, even when marginalizing over the full BEYONDPLANCK instrument model.

thumbnail Fig. 14.

Normalized histogram of thermal dust spectral index (βd) over the 500 ensemble Gibbs samples, with corresponding prior.

While we caution against over-interpreting the significance of this result, we do note that a possible spectral steepening in the thermal dust SED around 100 GHz would have dramatic consequences for future high-sensitivity B-mode experiments. Thus, understanding whether this result is due to a statistical fluke, instrumental modeling errors (e.g., because of an overly simplistic bandpass correction model), or actual astrophysics is an important goal for future analysis. In this respect, including Planck HFI frequencies between 100 and 217 GHz will certainly prove informative in a future analysis.

6. Summary and conclusions

The two main goals of this paper are to introduce a Bayesian sampling algorithm for polarized CMB foreground models as embedded within the end-to-end BEYONDPLANCK framework, as well as to present the first results from this pipeline as applied to the Planck LFI data set. This is the first time a joint global parametric model that accounts for both instrumental and astrophysical parameters has been fit to the Planck LFI data within a single joint posterior distribution, allowing for seamless end-to-end error propagation. This is also the first time a joint analysis of the Planck LFI and WMAP data has resulted in physically meaningful spectral indices for both polarized synchrotron and thermal dust emission. This analysis thus paves the way for future analyses that would be expected to integrate and analyze more data sets.

Indeed, we stress that the current analysis configuration has specifically been designed to demonstrate the properties and performance of the algorithm itself, not to derive a new best-fit sky model. Specifically, critically important data sets, such as WMAP K-band and Planck HFI, have been intentionally omitted, precisely because of their high S/N; if they had been included, the results would have been dominated by WMAP and HFI methodology. Including these data sets, along with other important ones such as C-BASS (Jew et al. 2019) or QUIJOTE (Génova-Santos et al. 2015) will be done in future work, either by members within the current BEYONDPLANCK team or by external researchers, and either by starting from time-ordered or from pre-pixelized sky maps. In many respects, the current analysis configuration may quite possibly represent one of the most difficult challenges that the BEYONDPLANCK pipeline will face, since it is the least constrained. Future analyses will always have access to more data and the resulting sky models will therefore be less degenerate.

With that important caveat in mind, the most particularly notable highlights from the current analysis include the following:

  1. We constrain the spectral index of polarized synchrotron emission, βs, in two large and disjoint regions of the sky, covering the Galactic Spur and the Galactic Plane, with best-fit values of β s Spur = 3.17 ± 0.06 $ \beta_{\mathrm{s}}^{\mathrm{Spur}}=-3.17\pm 0.06 $ and β s Plane = 3.03 ± 0.07 $ \beta_{\mathrm{s}}^{\mathrm{Plane}}=-3.03\pm 0.07 $, respectively. These results are statistically consistent with previous WMAP-only results (Dunkley et al. 2009b). The current analysis finds some evidence for spatial variation between spur and plane in βs, but only statistically significant at the 2σ level. At the same time, we note that the high Galactic latitude region has a S/N that is too low to support any robust conclusions regarding βs, while the Galactic Center region exhibits too strong residual systematic effects.

  2. We constrain the spectral index of thermal dust emission between 30 and 70 GHz to βd = 1.62 ± 0.04, which is somewhat steeper than that previously reported by Planck HFI (Planck Collaboration IV 2020; Planck Collaboration Int. LVII 2020) of βd ≈ 1.56, but still statistically consistent.

  3. Through a joint analysis of the Planck and WMAP data, we have been able to isolate and highlight the effect of transmission imbalance in the WMAP observations. This strongly suggests that a future joint analysis of Planck and WMAP data in the time-domain will be able to constrain the WMAP transmission imbalance parameters to a high accuracy. This work has already started, as discussed by Watts et al. (2023).

Figure 15 provides an overview of the main polarized microwave components in the frequency range from 10 to 1000 GHz, as described by the posterior BEYONDPLANCK results and our assumed sky model with the addition of spinning dust with a polarization fraction of 1% (Génova-Santos et al. 2017; Herman et al. 2023). Here, each component is represented in terms of the standard deviation of the polarization amplitude evaluated over 88% (top edge of each band) and 27% (bottom edge of each band) of the sky, with all WMAP and Planck frequency bands marked as vertical columns. To illustrate the importance of detailed foreground modeling and error propagation, the predicted levels of CMB BB power for tensor-to-scalar ratios of r = 10−2 and r = 10−4 are marked by diffuse gray regions. In order to achieve a significant measurement of this signal, exquisite control over both foreground contamination and systematic effects and their interplay will be essential. We believe that the analysis framework presented in this paper and a suite of companion papers can play an important role in this work, by providing a common and statistically well-defined analysis platform for past, current, and future CMB experiments.

thumbnail Fig. 15.

Polarization amplitude root-mean-square (RMS) as a function of frequency and astrophysical component and polarized emission. Each component map is smoothed to a common angular resolution of 1° FWHM, and the lower and upper edges of each band are defined by the RMS outside masks covering 27% and 88% of the sky, respectively. An artificial broadening has been applied to the low-amplitude tails of the foreground spectra to visually indicate uncertainties in regions of low S/N for the relevant components. The EE CMB spectrum is generated from ideal CMB simulations based on the best-fit PlanckΛCDM model, while the BB limits are derived with tensor-to-scalar ratios of r = 10−2 and 10−4, respectively. Additionally, we visualize an upper limit on the polarization fraction of spinning dust of 1% as has been reported in literature (Génova-Santos et al. 2017; Macellari et al. 2011; Herman et al. 2023).


2

Note that the BEYONDPLANCK data combination does not include the WMAP K-band sky, as this channel would otherwise compete with Planck LFI 30 GHz in terms of total S/N, and we are chiefly interested in the impact of the LFI data in this study.

3

The W-band channel is omitted because it is known to be contaminated by systematic residuals; see Bennett et al. (2013), Watts et al. (2023) for details.

5

We note that it is very difficult to compute nd.o.f. rigorously due to the presence of active priors, since each prior-constrained model parameter only contributes with a fraction of a degree-of-freedom. As such, the important feature of Fig. 2 is its spatial structure, not the absolute zero-level. The uniform and slightly negative bias at high Galactic latitudes is thus simply an indication that nd.o.f. is very slightly over-estimated; if it were due to over-estimating the white noise rms level, which is the only other possible explanation for a χ2 deficit, the Planck scanning strategy would have been visually apparent.

6

Note that Jarosik et al. (2007) provide two correction templates per WMAP differencing assembly, and both Q- and V-bands are therefore associated with four templates each. Only one of these are shown in Fig. 4 for intuition purposes; the other templates look qualitatively similar.

Acknowledgments

We thank Prof. Pedro Ferreira and Dr. Charles Lawrence for useful suggestions, comments and discussions. We also thank the entire PLANCK and WMAP teams for invaluable support and discussions, and for their dedicated efforts through several decades without which this work would not be possible. The current work has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement numbers 776282 (COMPET-4; BEYONDPLANCK), 772253 (ERC; BITS2COSMOLOGY), and 819478 (ERC; COSMOGLOBE). In addition, the collaboration acknowledges support from ESA; ASI and INAF (Italy); NASA and DoE (USA); Tekes, Academy of Finland (grant no. 295113), CSC, and Magnus Ehrnrooth foundation (Finland); RCN (Norway; grant nos. 263011, 274990); and PRACE (EU).

References

  1. Andersen, K. J., Herman, D., Aurlien, R., et al. 2023, A&A, 675, A13 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  2. Beck, R., Anderson, J., Heald, G., et al. 2013, Astron. Nach., 334, 548 [NASA ADS] [CrossRef] [Google Scholar]
  3. Bennett, C. L., Larson, D., Weiland, J. L., et al. 2013, ApJS, 208, 20 [Google Scholar]
  4. BeyondPlanck Collaboration (Andersen, K. J., et al.) 2023, A&A, 675, A1 (BeyondPlanck SI) [Google Scholar]
  5. Cardoso, J., Martin, M., Delabrouille, J., Betoule, M., & Patanchon, G. 2008, IEEE J. Selected Topics Signal Proces., 2, 735 [NASA ADS] [CrossRef] [Google Scholar]
  6. Colombo, L. P. L., Eskilt, J. R., Paradiso, S., et al. 2023, A&A, 675, A11 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  7. Davies, R. D., Dickinson, C., Banday, A. J., et al. 2006, MNRAS, 370, 1125 [NASA ADS] [CrossRef] [Google Scholar]
  8. Dunkley, J., Komatsu, E., Nolta, M. R., et al. 2009a, ApJS, 180, 306 [NASA ADS] [CrossRef] [Google Scholar]
  9. Dunkley, J., Spergel, D. N., Komatsu, E., et al. 2009b, ApJ, 701, 1804 [NASA ADS] [CrossRef] [Google Scholar]
  10. Eriksen, H. K., O’Dwyer, I. J., Jewell, J. B., et al. 2004, ApJS, 155, 227 [Google Scholar]
  11. Eriksen, H. K., Jewell, J. B., Dickinson, C., et al. 2008, ApJ, 676, 10 [Google Scholar]
  12. Finkbeiner, D. P., Davis, M., & Schlegel, D. J. 1999, ApJ, 524, 867 [NASA ADS] [CrossRef] [Google Scholar]
  13. Fixsen, D. J. 2009, ApJ, 707, 916 [Google Scholar]
  14. Fuskeland, U., Wehus, I. K., Eriksen, H. K., & Næss, S. K. 2014, ApJ, 790, 104 [Google Scholar]
  15. Fuskeland, U., Andersen, K. J., Aurlien, R., et al. 2021, A&A, 646, A69 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Galloway, M., Andersen, K. J., Aurlien, R., et al. 2023, A&A, 675, A3 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  17. Geman, S., & Geman, D. 1984, IEEE Trans. Pattern Anal. Mach. Intell., 6, 721 [CrossRef] [Google Scholar]
  18. Génova-Santos, R., Rubiño-Martín, J. A., Rebolo, R., et al. 2015, MNRAS, 452, 4169 [CrossRef] [Google Scholar]
  19. Génova-Santos, R., Rubiño-Martín, J. A., Peláez-Santos, A., et al. 2017, MNRAS, 464, 4107 [Google Scholar]
  20. Gjerløw, E., Ihle, H. T., Galeotta, S., et al. 2023, A&A, 675, A7 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  21. Gold, B., Bennett, C. L., Hill, R. S., et al. 2009, ApJS, 180, 265 [NASA ADS] [CrossRef] [Google Scholar]
  22. Górski, K. M., Hivon, E., Banday, A. J., et al. 2005, ApJ, 622, 759 [Google Scholar]
  23. Guillet, V., Fanciullo, L., Verstraete, L., et al. 2018, A&A, 610, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  24. Haslam, C. G. T., Salter, C. J., Stoffel, H., & Wilson, W. E. 1982, A&AS, 47, 1 [NASA ADS] [Google Scholar]
  25. Hensley, B. S., & Draine, B. T. 2020, ApJ, 895, 38 [NASA ADS] [CrossRef] [Google Scholar]
  26. Herman, D., Hensley, B., Andersen, K. J., et al. 2023, A&A, 675, A15 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  27. Ihle, H. T., Bersanelli, M., Franceschet, C., et al. 2023, A&A, 675, A6 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  28. Jarosik, N., Barnes, C., Greason, M. R., et al. 2007, ApJS, 170, 263 [NASA ADS] [CrossRef] [Google Scholar]
  29. Jew, L., Taylor, A. C., Jones, M. E., et al. 2019, MNRAS, 490, 2958 [NASA ADS] [CrossRef] [Google Scholar]
  30. Jewell, J., Levin, S., & Anderson, C. H. 2004, ApJ, 609, 1 [Google Scholar]
  31. Kamionkowski, M., & Kovetz, E. D. 2016, ARA&A, 54, 227 [NASA ADS] [CrossRef] [Google Scholar]
  32. Kogut, A. 2012, ApJ, 753, 110 [Google Scholar]
  33. Krachmalnicoff, N., Carretti, E., Baccigalupi, C., et al. 2018, A&A, 618, A166 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  34. Lawson, K. D., Mayer, C. J., Osborne, J. L., & Parkinson, M. L. 1987, MNRAS, 225, 307 [Google Scholar]
  35. Macellari, N., Pierpaoli, E., Dickinson, C., & Vaillancourt, J. E. 2011, MNRAS, 418, 888 [NASA ADS] [CrossRef] [Google Scholar]
  36. Marinucci, D., Pietrobon, D., Balbi, A., et al. 2008, MNRAS, 383, 539 [NASA ADS] [CrossRef] [Google Scholar]
  37. Mather, J. C., Cheng, E. S., Cottingham, D. A., et al. 1994, ApJ, 420, 439 [Google Scholar]
  38. Meisner, A. M., & Finkbeiner, D. P. 2015, ApJ, 798, 88 [Google Scholar]
  39. Orlando, E., Johannesson, G., Moskalenko, I. V., Porter, T. A., & Strong, A. 2018, Nucl. Part. Phys. Proc., 297, 129 [CrossRef] [Google Scholar]
  40. Paradiso, S., Colombo, L. P. L., Andersen, K. J., et al. 2023, A&A, 675, A12 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Penzias, A. A., & Wilson, R. W. 1965, ApJ, 142, 419 [CrossRef] [Google Scholar]
  42. Planck Collaboration I. 2014, A&A, 571, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  43. Planck Collaboration I. 2016, A&A, 594, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  44. Planck Collaboration X. 2016, A&A, 594, A10 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  45. Planck Collaboration I. 2020, A&A, 641, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  46. Planck Collaboration II. 2020, A&A, 641, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  47. Planck Collaboration III. 2020, A&A, 641, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  48. Planck Collaboration IV. 2020, A&A, 641, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  49. Planck Collaboration V. 2020, A&A, 641, A5 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  50. Planck Collaboration XI. 2020, A&A, 641, A11 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  51. Planck Collaboration Int. XLVI. 2016, A&A, 596, A107 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  52. Planck Collaboration Int. LVII. 2020, A&A, 643, A42 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  53. Platania, P., Bensadoun, M., Bersanelli, M., et al. 1998, ApJ, 505, 473 [NASA ADS] [CrossRef] [Google Scholar]
  54. Platania, P., Burigana, C., Maino, D., et al. 2003, A&A, 410, 847 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  55. Reich, P., & Reich, W. 1988, A&AS, 74, 7 [NASA ADS] [Google Scholar]
  56. Remazeilles, M., Delabrouille, J., & Cardoso, J.-F. 2011, MNRAS, 418, 467 [Google Scholar]
  57. Seljebotn, D. S., Bærland, T., Eriksen, H. K., Mardal, K. A., & Wehus, I. K. 2019, A&A, 627, A98 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  58. Shewchuk, J. R. 1994, An Introduction to the Conjugate Gradient Method Without the Agonizing Pain (Edition 1/14) [Google Scholar]
  59. Stivoli, F., Grain, J., Leach, S. M., et al. 2010, MNRAS, 408, 2319 [CrossRef] [Google Scholar]
  60. Svalheim, T. L., Zonca, A., Andersen, K. J., et al. 2023, A&A, 675, A9 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  61. Tassis, K., & Pavlidou, V. 2015, MNRAS, 451, L90 [NASA ADS] [CrossRef] [Google Scholar]
  62. Tristram, M., Banday, A. J., Górski, K. M., et al. 2021, Phys. Rev. D, 105, 083524 [Google Scholar]
  63. Vidal, M., Dickinson, C., Davies, R. D., & Leahy, J. P. 2015, MNRAS, 452, 656 [NASA ADS] [CrossRef] [Google Scholar]
  64. Wandelt, B. D., Larson, D. L., & Lakshminarayanan, A. 2004, Phys. Rev. D, 70, 083511 [Google Scholar]
  65. Watts, D. J., Galloway, M., Ihle, H. T., et al. 2023, A&A, 675, A16 (BeyondPlanck SI) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  66. Zaldarriaga, M., & Seljak, U. 1997, Phys. Rev. D, 55, 1830 [Google Scholar]

All Tables

Table 1.

Overview of all included data bands in the BEYONDPLANCK polarization analysis.

All Figures

thumbnail Fig. 1.

Top: Final region map used in the sampling procedure for the synchrotron spectral index. Middle: Planck 2018 polarized synchrotron amplitude map with the corresponding spectral index processing mask for BEYONDPLANCK and a region outline that divides the sky into six disjoint regions. This was further reduced to four, motivated primarily by the low S/N apparent in some of the regions. The mask is chosen to reduce temperature-to-polarization leakage bias, and to exclude bright point sources. Bottom: Planck 2018 polarized thermal dust amplitude with the corresponding processing mask.

In the text
thumbnail Fig. 2.

Normalized and reduced χ2 per pixel, summed over Stokes Q and U and averaged over all Gibbs samples.

In the text
thumbnail Fig. 3.

Posterior mean total data-minus-model residual maps (rν = dν − sν) in Q and U for BEYONDPLANCK LFI 30 GHz (top left), 44 GHz (top right), 70 GHz (bottom left), and 353 GHz (bottom right). All maps are smoothed to a common angular resolution of 2° FWHM.

In the text
thumbnail Fig. 4.

Posterior mean total data-minus-model residual maps (rν = dν − sν) in Q (first column) and U (third column) for all included WMAP bands in the two left-most columns, at Nside = 16, masked with the processing mask applied in the pipeline. The second and fourth columns show corresponding transmission imbalance template maps for one of the differencing assemblies (e.g., Q{1, 2}, V{1, 2}) per residual as derived by Jarosik et al. (2007); we note that these templates are each associated with an unknown scaling amplitude that may take either sign.

In the text
thumbnail Fig. 5.

Polarized thermal dust amplitude maps (left column) plotted in terms of the polarization amplitude P = Q 2 + U 2 $ P=\sqrt{Q^2+U^2} $ (top row) and the Stokes Q (middle row) and U parameters (bottom row). Corresponding posterior standard deviation maps (right column). All maps are evaluated at a reference frequency of 353 GHz in units of μKRJ and averaged over all available Gibbs samples. The effective angular resolution is 10′ FWHM.

In the text
thumbnail Fig. 6.

Polarized synchrotron amplitude maps (left column) plotted in terms of the polarization amplitude P = Q 2 + U 2 $ P=\sqrt{Q^2+U^2} $ (top row) and the Stokes Q (middle row) and U parameters (bottom row). Corresponding posterior standard deviation maps (right column). All maps are evaluated at a reference frequency of 30 GHz in units of μKRJ and averaged over all available Gibbs samples. The effective angular resolution is 1° FWHM.

In the text
thumbnail Fig. 7.

Difference between the BEYONDPLANCK polarized synchrotron amplitude and the raw WMAP K-band map (Bennett et al. 2013), with the latter scaled to 30 GHz assuming a spectral index of βs = −3.1 (top panel). Similar difference between the BEYONDPLANCK and Planck DR4 (Planck Collaboration Int. LVII 2020) synchrotron amplitude maps (bottom panel). Left and right columns show Stokes Q and U parameters respectively, and all maps are smoothed to a common angular resolution of 3° FWHM.

In the text
thumbnail Fig. 8.

Synchrotron spectral index as a function of Gibbs iteration for an analysis that includes nine disjoint regions (see inset sky map). The prior is, in this case, βs = −2.8 ± 0.1, and all regions are constrained using the full posterior distribution.

In the text
thumbnail Fig. 9.

Synchrotron spectral index as a function of Gibbs iteration binned using the final BEYONDPLANCK analysis configuration with four disjoint regions and a prior of βs = −3.3 ± 0.1. Dotted lines indicate regions that are sampled exclusively from the prior distribution, while solid lines indicate regions that are sampled from the full posterior distribution.

In the text
thumbnail Fig. 10.

Posterior mean (left panel) and standard deviation (right panel) maps of the spectral index of polarized synchrotron emission. We note that a prior of βs = −3.3 ± 0.1 is applied to all four regions, but only the Galactic Spur and Galactic Plane regions are constrained with data through the likelihood. See Sect. 5.3.2 for further discussion.

In the text
thumbnail Fig. 11.

Normalized histogram of synchrotron spectral index (βs) for the Spur region using two different priors. The solid lines show the marginal distribution of spectral index values without TOD sampling using a prior (dotted lines) of βs = −2.8 (red), and βs = −3.3 (blue).

In the text
thumbnail Fig. 12.

Normalized histogram of synchrotron spectral index (βs) for the Spur and Plane regions over the 500 ensemble Gibbs samples, with the corresponding prior, P(βs).

In the text
thumbnail Fig. 13.

Normalized histogram of synchrotron spectral index (βs) for the Spur region using a prior of βs = −3.3. The bold line shows the full BEYONDPLANCK posterior distribution including TOD sampling and the thin line shows the corresponding posterior distribution when conditioning on TOD parameters.

In the text
thumbnail Fig. 14.

Normalized histogram of thermal dust spectral index (βd) over the 500 ensemble Gibbs samples, with corresponding prior.

In the text
thumbnail Fig. 15.

Polarization amplitude root-mean-square (RMS) as a function of frequency and astrophysical component and polarized emission. Each component map is smoothed to a common angular resolution of 1° FWHM, and the lower and upper edges of each band are defined by the RMS outside masks covering 27% and 88% of the sky, respectively. An artificial broadening has been applied to the low-amplitude tails of the foreground spectra to visually indicate uncertainties in regions of low S/N for the relevant components. The EE CMB spectrum is generated from ideal CMB simulations based on the best-fit PlanckΛCDM model, while the BB limits are derived with tensor-to-scalar ratios of r = 10−2 and 10−4, respectively. Additionally, we visualize an upper limit on the polarization fraction of spinning dust of 1% as has been reported in literature (Génova-Santos et al. 2017; Macellari et al. 2011; Herman et al. 2023).

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.