A new approach for the statistical denoising of Planck interstellar dust polarization data

Bruno Regaldo-Saint Blancard; Erwan Allys; François Boulanger; François Levrier; Niall Jeffrey

doi:10.1051/0004-6361/202140503

Home

All issues

Volume 649 (May 2021)

A&A, 649 (2021) L18

Full HTML

Open Access

Issue		A&A Volume 649, May 2021


Article Number		L18
Number of page(s)		9
Section		Letters to the Editor
DOI		https://doi.org/10.1051/0004-6361/202140503
Published online		01 June 2021

A&A 649, L18 (2021)

Letter to the Editor

A new approach for the statistical denoising of Planck interstellar dust polarization data^⋆

Bruno Regaldo-Saint Blancard¹^,2, Erwan Allys¹, François Boulanger¹, François Levrier¹ and Niall Jeffrey¹^,3

¹ Laboratoire de Physique de l’École Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, 75005 Paris, France
e-mail: bruno.regaldo@phys.ens.fr
² Observatoire de Paris, PSL University, Sorbonne Université, LERMA, 75014 Paris, France
³ Department of Physics & Astronomy, University College London, Gower Street, London WC1E 6BT, UK

Received: 5 February 2021
Accepted: 13 May 2021

Abstract

Dust emission is the main foreground for cosmic microwave background polarization. Its statistical characterization must be derived from the analysis of observational data because the precision required for a reliable component separation is far greater than what is currently achievable with physical models of the turbulent magnetized interstellar medium. This Letter takes a significant step toward this goal by proposing a method that retrieves non-Gaussian statistical characteristics of dust emission from noisy Planck polarization observations at 353 GHz. We devised a statistical denoising method based on wavelet phase harmonics (WPH) statistics, which characterize the coherent structures in non-Gaussian random fields and define a generative model of the data. The method was validated on mock data combining a dust map from a magnetohydrodynamic simulation and Planck noise maps. The denoised map reproduces the true power spectrum down to scales where the noise power is an order of magnitude larger than that of the signal. It remains highly correlated to the true emission and retrieves some of its non-Gaussian properties. Applied to Planck data, the method provides a new approach to building a generative model of dust polarization that will characterize the full complexity of the dust emission. We also release PyWPH, a public Python package, to perform GPU-accelerated WPH analyses on images.

Key words: dust, extinction / polarization / methods: statistical / cosmic background radiation

^⋆

The public Python package PyWPH is available at https://github.com/bregaldo/pywph.

© B. Regaldo-Saint Blancard et al. 2021

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1. Introduction

The quest of primordial B-modes in the polarization of the cosmic microwave background (CMB) faces a major challenge, namely the accurate characterization of one of its main foregrounds: the polarized emission of interstellar dust from our Galaxy (BICEP2/Keck and Planck Collaborations 2015). To rightfully claim any detection of the cosmological signal, one has to fully take into account the complexity of the Galactic contribution to the sky emission (Hensley & Bull 2018; Clark & Hensley 2019; Pelgrims et al. 2020; Planck Collaboration XI 2020). The statistical characterization of Galactic polarized foregrounds is also important to extract the CMB lensing potential from sky observations (Beck et al. 2020).

The Galactic contribution is sourced by the thermal emission of nonspherical dust grains in the diffuse interstellar medium (ISM). The mixture of dust and gas is described as a turbulent magnetized fluid, and the grains tend to align statistically with their long axes perpendicular to the local magnetic field, thus polarizing the emission (Planck Collaboration XII 2020). The physical and chemical processes regulating and structuring the diffuse ISM are nonlinearly coupled (Draine 2011), leading to the emergence of strong scale couplings, which may be evidenced by the highly non-Gaussian statistics observed for many tracers (Burkhart et al. 2009).

Capturing non-Gaussianity is therefore essential to characterize the dust signal. To tackle this issue, several authors have introduced machine learning algorithms that need to be trained (Aylor et al. 2020; Krachmalnicoff & Puglisi 2021; Petroff et al. 2020; Thorne et al. 2021). Another approach, which does not involve any learning steps, uses statistics defined from nonlinear transforms, namely the wavelet scattering transform and the wavelet phase harmonics (WPH). These describe the couplings between scales in non-Gaussian fields, and they have been applied to ISM data (Allys et al. 2019; Regaldo-Saint Blancard et al. 2020; Saydjari et al. 2021) as well to study the large-scale structure of the Universe (Allys et al. 2020; Cheng et al. 2020). They have been successfully used to characterize simulated and observational data with a high signal-to-noise ratio (S/N), and to build generative models for these data. However, these approaches face difficulties with noisy data, which is the case of Planck polarization observations. In this letter, we introduce a statistical denoising method using the WPH statistics to retrieve the statistical properties of the noise-free dust emission.

In Sect. 2 we present our statistical denoising method and show how well it performs on noisy simulated data of the polarized emission from dust. In Sect. 3, we apply it to the Chamaeleon-Musca 353 GHz polarized emission, as observed by the Planck satellite, and discuss our results. In Sect. 4 we conclude and suggest a few perspectives for follow-up studies. This Letter also includes two appendices. Appendix A introduces the WPH statistics and the denoising procedure in detail. Appendix B presents a first comparison with other methods. We also provide a public Python package to perform GPU-accelerated WPH analyses on images called PyWPH¹.

2. Method and validation

2.1. Description of the method

We observe a noisy map d that is modeled as follows:

$\begin{matrix} d = s + n, \end{matrix}$ $\begin{aligned} \boldsymbol{d} = \boldsymbol{s} + \boldsymbol{n}, \end{aligned}$ (1)

with s being the target truth map and n being an additive noise signal. The noise n is an unknown realization of a random field N. We assume that we know N, meaning that we are able to generate as many independent realizations of it as needed. We call {n₁, …, n_M} M of these.

We introduce a method to retrieve the statistical properties of s while denoising d. This statistical denoising consists in iteratively deforming d to build a denoised map $\tilde{s}$ $\tilde{\boldsymbol{s}}$ such that the ${\tilde{s} + n_{i}}_{i}$ $\{\tilde{\boldsymbol{s}} + {\boldsymbol{n}_i}\}_i$ maps and the d map become “close enough” in a given statistical space. It employs the possibility introduced in Zhang & Mallat (2021) to build a generative model from WPH statistics, that is, the ability to generate new random realizations based on statistical constraints. In the following, we call ϕ the operator that computes a set of summary statistics from a given map. The algorithm consists in minimizing the following loss function:

$\begin{matrix} L_{M} (u) = \frac{1}{M} \sum_{i = 1}^{M} | | ϕ (u + n_{i}) - ϕ (d) | |^{2}, \end{matrix}$ $\begin{aligned} \mathcal{L} _M(\boldsymbol{u}) = \frac{1}{M}\sum _{i = 1}^M\left||\phi \left(\boldsymbol{u} + \boldsymbol{n}_i\right) - \phi \left(\boldsymbol{d}\right)\right||^2, \end{aligned}$ (2)

where ∥ ⋅ ∥ denotes the Euclidean norm. We choose u₀ = d to initialize the optimizer. The denoised map $\tilde{s}$ $\tilde{\boldsymbol{s}}$ corresponds to an approximate minimum obtained by performing this optimization in pixel space, using an LBFGS optimizer (Byrd et al. 1995). We note that a limitation of this algorithm relies on the (ideal) assumption that we know N. In practice, any non-modeled statistical property of the noise will be considered to be part of the signal s.

The choice of the operator ϕ is obviously paramount to the quality of the method and must be tailored to the properties of s and n. In the context of ISM polarization data, we expect s to be relatively regular and to exhibit non-Gaussian signatures due to the interactions between scales (e.g., filamentary structures), while the noise is expected to be highly irregular and close to a (possibly spatially-varying) Gaussian white noise.

In this work, the ϕ operator computes WPH statistics. These statistics are designed to characterize the coherent structures that appear in non-Gaussian random fields, by quantifying the phase alignment between different scales (Zhang & Mallat 2021; Mallat et al. 2020). Their ability to define a generative model has been studied in particular by Allys et al. (2020), who demonstrate quantitatively that they include most of the information captured by various other statistics, such as the power spectrum, the bispectrum, or the Minkowski functionals. These statistics were applied to real-valued data. Here we extend them to complex-valued polarization maps Q + iU. Statistics derived from this complex variable efficiently characterize the full complexity of the polarization signal (Regaldo-Saint Blancard et al. 2020). We refer readers to Appendix A for a presentation of the WPH statistics and a detailed description of the denoising procedure.

2.2. Validation on a simulation

In this section, we assess the performance of our denoising algorithm by applying it to mock data d = s + n, emulating a noisy Q + iUPlanck polarization signal. We show figures of merit based on power spectra and probability distribution functions (PDFs) of the increments of the maps. These are only shown for Q, but they are similar for U. We postpone figures of merit based on other kinds of statistics, including the WPH statistics, to a future study.

Here s is a simulated Q + iU map, representing a typical linear polarization signal produced by the thermal emission of dust in the diffuse ISM. It was built from the same magnetohydrodynamic simulation as the one used in Regaldo-Saint Blancard et al. (2020) and following the same procedure. We refer to this Letter for more details on the construction of this simulated map. We simulated a Planck observation of dust polarization using Planck instrumental noise maps introduced in Sect. 3 (Planck Collaboration III 2020). We have a total of 300 realizations of this noise at our disposal. We picked one of these, called n, to build d, and we used the remaining M = 299 noise maps, labeled {n₁, …, n_M}, for the denoising algorithm.

We define the S/N of d as the ratio of the standard deviations of ℜ(s) and ℜ(n) (in the following, the real part ℜ of a complex Q + iU map refers to its Q map). We adjusted this S/N by scaling s, so that the impact of the noise “resembles” that on the Planck map presented in Sect. 3. This was not straightforward since the power spectrum of the simulated map s has a different slope than that of the estimate of the power spectrum of the dust emission from the Planck map (see Figs. 2 and 5). We decided to adjust the S/N so that the scale k_i, at which the power spectra of n and s intersect, coincides with the one from the Planck map. Figures 2 and 5 show that k_i/(2π)≈0.8 px⁻¹. This procedure leads to S/N ≈ 0.4 for the simulated map, which is more than two times lower than the S/N of the observational map discussed in Sect. 3.

We show in Fig. 1 (top) the simulated Q maps corresponding to the truth s (left), the noise n (middle), and the resulting noisy map d (right). The map ℜ(s) exhibits coherent structures such as filaments and large smooth regions that are characteristic of its non-Gaussian statistical properties. On the other hand, ℜ(n) seems close to an inhomogeneous white Gaussian noise. The spatial inhomogeneity appears as variations of the local standard deviation due to the Planck satellite scanning strategy. Finally, in ℜ(d), coherent structures at intermediate and small scales are hard or impossible to identify.

Fig. 1.

Top row: simulated Q maps corresponding to the truth s (left), the noise n (middle), and the resulting noisy map d (right). Bottom row: denoised simulated Q map $R (\tilde{s})$ $\Re(\tilde{\boldsymbol{s}})$ (left) next to the difference noisy-denoised $R (d - \tilde{s})$ $\Re(\boldsymbol{d}-\tilde{\boldsymbol{s}})$ (middle) and denoising error $R (s - \tilde{s})$ $\Re(\boldsymbol{s}-\tilde{\boldsymbol{s}})$ (right). Units are arbitrary, but kept consistent between the maps.

We applied our denoising method to the simulated noisy map d and show, in Fig. 1 (bottom), the resulting denoised Q map $R (\tilde{s})$ $\Re(\tilde{\boldsymbol{s}})$ (left) next to the difference between the noisy and denoised maps $R (d - \tilde{s})$ $\Re(\boldsymbol{d} - \tilde{\boldsymbol{s}})$ (middle) and the denoising error $R (s - \tilde{s})$ $\Re(\boldsymbol{s}-\tilde{\boldsymbol{s}})$ (right). The map $R (\tilde{s})$ $\Re(\tilde{\boldsymbol{s}})$ shows that the noise level has been drastically reduced and that we were able to recover the filamentary structure down to a minimum scale. We can identify the smooth regions of ℜ(s) even if there still remains a visible noise. The similarities between $R (d - \tilde{s})$ $\Re(\boldsymbol{d} - \tilde{\boldsymbol{s}})$ and ℜ(n) are striking. The local variations of the standard deviation of the noise are clearly recovered, demonstrating that the inhomogeneity of the noise is not an issue for our method. Finally, $R (s - \tilde{s})$ $\Re(\boldsymbol{s} - \tilde{\boldsymbol{s}})$ exhibits some remaining structures that match the thinnest filaments appearing in ℜ(s), on top of a more diffuse background. This indicates that down to a minimum scale, below which the algorithm struggles to recover features, most of the structures are efficiently reconstructed.

Figure 2 compares the power spectra of the six maps shown in Fig. 1 plus the cross-spectrum between the denoised and truth maps. These power spectra were estimated computing the mean square moduli of the Fourier components of the maps binned on linearly spaced isotropic wavenumber bins, and the uncertainties correspond to the standard deviation of the mean. The cross-spectrum was computed similarly except for the bins above k/(2π) = 0.14 px⁻¹ that were logarithmically spaced in order to lower the statistical variance². We first point out that the power spectrum of d coincides with the sum of those of s and n because of the statistical independence between s and n. The power spectra of $R (\tilde{s})$ $\Re(\tilde{\boldsymbol{s}})$ and ℜ(s) are in very good agreement with each other up to 0.18 px⁻¹, at which scale the noise power is ten times that of the signal. At smaller scales, where the noise dominates the signal even more, our algorithm is not able to retrieve the true power spectrum but gets closer to it. The power spectrum of $R (d - \tilde{s})$ $\Re(\boldsymbol{d} - \tilde{\boldsymbol{s}})$ coincides with that of ℜ(n) at small scales, and this agreement progressively worsens toward larger scales. This large-scale behavior shows that our algorithm does not remove the already negligible noise, as shown by the superposition of the power spectrum of $R (s - \tilde{s})$ $\Re(\boldsymbol{s}-\tilde{\boldsymbol{s}})$ and that of ℜ(n) at these scales. The cross-spectrum $\tilde{s} \times s$ $\tilde{\boldsymbol{s}}\times\boldsymbol{s}$ is slightly below the power spectrum of s at intermediate scales and this discrepancy increases toward the smallest scales. At intermediate scales, where the power spectrum of $R (\tilde{s})$ $\Re(\tilde{\boldsymbol{s}})$ matches that of ℜ(s), we suspect this discrepancy to stem from differences between the phases of the Fourier components of s and $\tilde{s}$ $\tilde{\boldsymbol{s}}$ that would deserve a further quantification. Nevertheless, the production of a denoised map whose power spectrum coincides with that of s, even though n is ten times more powerful than s, and that retains a significant correlation with s is a striking success of our method.

Fig. 2.

Q maps power spectra for d, s, $\tilde{s}$ $\tilde{\boldsymbol{s}}$ , n (all in solid lines), $d - \tilde{s}$ $\boldsymbol{d}-\tilde{\boldsymbol{s}}$ , and $s - \tilde{s}$ $\boldsymbol{s}-\tilde{\boldsymbol{s}}$ , and cross-spectrum between the Q maps of $\tilde{s}$ $\tilde{\boldsymbol{s}}$ and s (all in dashed lines). Bottom panel: ratio of the power spectra of d and $\tilde{s}$ $\tilde{\boldsymbol{s}}$ with that of s.

To better characterize the non-Gaussianity of $\tilde{s}$ $\tilde{\boldsymbol{s}}$ , we computed the PDFs of the increments of Q for the noisy, denoised, and truth maps, and we plot them in Fig. 3 for three scalar lags. The increment δQ_l(r) for a scalar lag l at a position r is the set of differences δQ_l(r) = Q(r)−Q(r + l) with l ≤ |l|< l + 1 in pixel units. These statistics are usually computed on velocity maps for the tails of the PDFs that characterize the intermittent dissipation of turbulence in the ISM (Hily-Blant et al. 2008; Hily-Blant & Falgarone 2009). Contrary to the case of the noisy map, the distributions of increments for ℜ(s) are far from Gaussian for every lag. This is a clear signature of the non-Gaussianity of the data as we expect Gaussian-distributed increments for homogeneous Gaussian data. Our method recovers these statistics and their non-Gaussian tails with limited distortion for each lag, demonstrating its efficiency in retrieving non-Gaussianity in the data. A more thorough characterization of this non-Gaussianity has been postponed for further studies.

Fig. 3.

PDFs of the increments of Q for d (noisy), s (truth), and $\tilde{s}$ $\tilde{\boldsymbol{s}}$ (denoised), computed for three logarithmically spaced lags going from 2 to 32 pixels. For each lag, the increments were normalized by the standard deviation σ_l of the Gaussian that fits the core of the PDF of the noisy map. Bottom panels: ratio of the PDFs of $\tilde{s}$ $\tilde{\boldsymbol{s}}$ with those of s for each lag.

3. Application to Planck polarization data

3.1. Presentation of the data

We now apply our denoising method to a Q + iU polarization map of the Chamaeleon-Musca region observed at 353 GHz with the Planck satellite (PR3 data³, Planck Collaboration III 2020). At this frequency, the CMB is negligible and the dominant components are the dust emission and the noise (see Fig. 34 of Planck Collaboration IV 2020). We consider the Q and U maps corresponding to the full mission, and those corresponding to the two half-missions, as well as the 300 end-to-end simulated Q and U maps of the noise and the systematics of the instrument for the full mission (the ones used in Sect. 2). We projected all of these maps on 512 × 512 grids with a pixel size of 2.35′, centered on the region of the Chamaeleon-Musca clouds at Galactic coordinates (l, b) = (300.26°;, − 16.77°;). We used a Gnomonic projection through the HEALPix/healpy⁴ package (Górski et al. 2005; Zonca et al. 2019).

In Fig. 4, we show the projected full-mission Q map that we aim to denoise (left), as well as the denoised map discussed in the next section (middle). The same maps for U are given in Fig. B.2.

Fig. 4.

Noisy (left) and denoised (middle) Q maps of the Chamaeleon-Musca region as observed by the Planck satellite at 353 GHz. The noisy maps were denoised as described in Sect. 2.1. We also show the corresponding GNILC map (right) for reference.

3.2. Denoising results

We applied the denoising method presented in Sect. 2 to the full mission Q + iU map, using the corresponding 300 noises. Figures 4 and B.2 (middle) show the resulting denoised Q and U maps, respectively. The overall noise level has been clearly mitigated although subtle residuals of the patterns due to the scanning strategy remain, and we can now discern a more complex layout of structures even in the regions where the signal is weak.

In Fig. 5, we show a power spectrum analysis of the denoising of the Q map, comparing the power spectra of the noisy and denoised maps ℜ(d) and $R (\tilde{s})$ $\Re(\tilde{\boldsymbol{s}})$ , of one noise map ℜ(n), and of the difference between the noisy and denoised maps $R (d - \tilde{s})$ $\Re(\boldsymbol{d} - \tilde{\boldsymbol{s}})$ , as well as the cross-spectrum between the two half-missions maps. This cross-spectrum gives an estimate for the power spectrum of the truth signal since the noise signals are independent. It is satisfactory to see that the power spectrum of $R (\tilde{s})$ $\Re(\tilde{\boldsymbol{s}})$ is consistent with this cross-spectrum for scales down to k/(2π)∼3.4 deg⁻¹. Also, $R (d - \tilde{s})$ $\Re(\boldsymbol{d} - \tilde{\boldsymbol{s}})$ behaves similarly to what we have observed in Sect. 2.2, with a power spectrum consistent with that of ℜ(n) when the noise has a larger power than the signal, and it falls below what is expected when the emission of the dust begins to dominate. Similar results are obtained for U.

Fig. 5.

Q maps power spectra for d, $\tilde{s}$ $\tilde{\boldsymbol{s}}$ , n (all in solid lines), and $d - \tilde{s}$ $\boldsymbol{d}-\tilde{\boldsymbol{s}}$ , and cross-spectrum between d₁ and d₂ (both in dashed lines). This cross-spectrum estimates the power spectrum of the dust emission. Also shown in a solid line is the power spectrum of the GNILC Q map, corresponding to the same Chamaeleon-Musca field.

Figure 6 compares the PDFs of the increments of Q for the noisy and denoised maps, and for the same three scalar lags as in Fig. 3. Based on the results of Sect. 2, we expect the PDFs of the denoised map to be good estimates for the statistics of the truth map. These results show important differences between the noisy and denoised maps that are again a signature of the non-Gaussianity of the truth signal. We find similar results for the increments of U.

Fig. 6.

PDFs of the Chamaeleon-Musca Q maps increments for d (noisy), $\tilde{s}$ $\tilde{\boldsymbol{s}}$ (denoised), and GNILC map, computed for three logarithmically spaced lags ranging from 4.7′ to 75.2′. For each lag, the increments were normalized by the standard deviation σ_l of the Gaussian that fits the core of the PDF of the noisy map.

The literature on image denoising is rich and abundant (see for instance Buades et al. 2010; Dabov et al. 2007; Zhang et al. 2017), and a thorough comparison of our method with other denoising algorithms would be beyond the scope of this Letter. Nevertheless, to give some elements of comparison, we compare our method in Appendix B to the following: (1) Wiener filtering and sampling methods, which are widely used in astrophysics; and (2) the GNILC method (Planck Collaboration Int. XLVIII 2016), which was used on Planck polarization data and provides a local smoothing kernel in order to optimally remove the noise. For the Wiener approach, we find that neither the Wiener filtered image nor realizations drawn from the Wiener posterior distribution are able to retrieve the PDFs of the increments, even when the true power spectrum was given as input. The comparison with GNILC shows that our method recovers the true power spectrum and the PDFs of the increments better.

4. Conclusion and prospects

We have introduced a new method for the denoising of Planck polarization data based on the generation of random synthetic maps from WPH statistics that characterize the non-Gaussianity of the dust emission. This method takes advantage of the strong statistical differences between the signal of interest(non-Gaussian and regular) and the noise (close to Gaussian and irregular) by performing an optimization that constrains the statistical properties of the denoised map plus noise realizations.

We applied our method to mock Q + iU noisy data designed to emulate typical Planck polarization maps of dust in the diffuse ISM. The denoised map has a power spectrum that coincides with the true power spectrum down to a minimum scale where the power of the noise is ten times that of the signal, while being highly correlated with the truth signal. It recovers the PDFs of the increments for various isotropic lags, demonstrating that our method is able to retrieve non-Gaussianity in the data. Finally, we applied this method to a 353 GHz Planck observation of the Chamaeleon-Musca field.

Our method has been introduced as a statistical denoising method, but it should be more generally applicable to component separation problems. In particular, we expect this method to be efficient at disentangling dust emission from the CMB, cosmic infrared background, and noise in Planck maps, provided that we have relevant models of each of these contaminants at our disposal. Therefore, it could hopefully enhance the scientific outcome of Planck data.

One of the main motivations of our work is to build a generative model of the dust polarization signal that takes into account the non-Gaussianity of the data. Such a model derived from the analysis of Planck data may be used to simulate the dust polarization sky. Our denoising algorithm constitutes a step toward this modeling, in the sense that WPH statistics of the dust emission, corrected to a first approximation from noise contamination, may be computed from the denoised map. A natural extension of this work would be to quantitatively assess how well we can recover the WPH statistics of the truth map with this approach. This will be the subject of a future paper.

¹

https://github.com/bregaldo/pywph

²

We recall that this cross-spectrum, called $CS [R (s), R (\tilde{s})]$ $\mathrm{CS}[\Re(\boldsymbol{s}), \Re(\tilde{\boldsymbol{s}})]$ , is related to the power spectra by the following relation: $PS [R (s - \tilde{s})] = PS [R (s)] + PS [R (\tilde{s})] - 2 CS [R (s), R (\tilde{s})]$ ${\mathrm{PS}[\Re(\boldsymbol{s}-\tilde{\boldsymbol{s}})] = \mathrm{PS}[\Re(\boldsymbol{s})] + \mathrm{PS}[\Re(\tilde{\boldsymbol{s}})] - 2\mathrm{CS}[\Re(\boldsymbol{s}), \Re(\tilde{\boldsymbol{s}})]}$ .

³

https://wiki.cosmos.esa.int/planck-legacy-archive/index.php/Main_Page

⁴

http://healpix.sourceforge.net

⁵

These moments do not depend on the r variable because of the homogeneity of X.

Acknowledgments

We thank the anonymous referee for a very helpful report. This work was undertaken in parallel to a similar project led by J.-M. Delouis on the full sky, and we acknowledge a fruitful interaction with his team. We gratefully acknowledge P. Lesaffre for helping us with the computation of the PDFs of the increments. We also acknowledge fruitful discussions with J.-F. Cardoso, S. Mallat, and T. Marchand. This research was supported by the Agence Nationale de la Recherche (project BxB: ANR-17-CE31-0022). B. Regaldo-Saint Blancard acknowledges support from the Centre National d’Etudes Spatiales (CNES).

References

Allys, E., Levrier, F., Zhang, S., et al. 2019, A&A, 629, A115 [CrossRef] [EDP Sciences] [Google Scholar]
Allys, E., Marchand, T., Cardoso, J. F., et al. 2020, Phys. Rev. D, 102, 103506 [CrossRef] [Google Scholar]
Alsing, J., Heavens, A., & Jaffe, A. H. 2017, MNRAS, 466, 3272 [NASA ADS] [CrossRef] [Google Scholar]
Aylor, K., Haq, M., Knox, L., Hezaveh, Y., & Perreault-Levasseur, L. 2020, MNRAS, 500, 3889 [Google Scholar]
Beck, D., Errard, J., & Stompor, R. 2020, JCAP, 2020, 030 [Google Scholar]
BICEP2/Keck and Planck Collaborations 2015, Phys. Rev. Lett., 114, 101301 [Google Scholar]
Buades, A., Coll, B., & Morel, J. M. 2010, SIAM Rev., 52, 113 [CrossRef] [Google Scholar]
Burkhart, B., Falceta-Gonçalves, D., Kowal, G., & Lazarian, A. 2009, ApJ, 693, 250 [NASA ADS] [CrossRef] [Google Scholar]
Byrd, R., Lu, P., Nocedal, J., & Zhu, C. 1995, SIAM J. Sci. Comput., 16, 1190 [Google Scholar]
Cheng, S., Ting, Y.-S., Ménard, B., & Bruna, J. 2020, MNRAS, 499, 5902 [CrossRef] [Google Scholar]
Clark, S. E., & Hensley, B. S. 2019, ApJ, 887, 136 [Google Scholar]
Dabov, K., Foi, A., Katkovnik, V., & Egiazarian, K. 2007, IEEE Trans. Image Process., 16, 2080 [Google Scholar]
Draine, B. T. 2011, Physics of the Interstellar and Intergalactic Medium (Princeton: Princeton University Press) [Google Scholar]
Elsner, F., & Wandelt, B. D. 2013, A&A, 549, A111 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Górski, K. M., Hivon, E., Banday, A. J., et al. 2005, ApJ, 622, 759 [NASA ADS] [CrossRef] [Google Scholar]
Hensley, B. S., & Bull, P. 2018, ApJ, 853, 127 [Google Scholar]
Hily-Blant, P., & Falgarone, E. 2009, A&A, 500, L29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hily-Blant, P., Falgarone, E., & Pety, J. 2008, A&A, 481, 367 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Jasche, J., & Lavaux, G. 2015, MNRAS, 447, 1204 [NASA ADS] [CrossRef] [Google Scholar]
Jeffrey, N., Abdalla, F. B., Lahav, O., et al. 2018a, MNRAS, 479, 2871 [Google Scholar]
Jeffrey, N., Heavens, A. F., & Fortio, P. D. 2018b, Astron. Comput., 25, 230 [Google Scholar]
Kodi Ramanah, D., Lavaux, G., & Wandelt, B. D. 2019, MNRAS, 490, 947 [Google Scholar]
Krachmalnicoff, N., & Puglisi, G. 2021, ApJ, 911, 42 [Google Scholar]
Mallat, S., Zhang, S., & Rochette, G. 2020, Information and Inference: A Journal of the IMA, 9, 721 [Google Scholar]
Pelgrims, V., Ferrière, K., Boulanger, F., Lallement, R., & Montier, L. 2020, A&A, 636, A17 [EDP Sciences] [Google Scholar]
Petroff, M. A., Addison, G. E., Bennett, C. L., & Weiland, J. L. 2020, ApJ, 903, 104 [Google Scholar]
Planck Collaboration III. 2020, A&A, 641, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Planck Collaboration IV. 2020, A&A, 641, A4 [CrossRef] [EDP Sciences] [Google Scholar]
Planck Collaboration XI. 2020, A&A, 641, A11 [CrossRef] [EDP Sciences] [Google Scholar]
Planck Collaboration XII. 2020, A&A, 641, A12 [CrossRef] [EDP Sciences] [Google Scholar]
Planck Collaboration Int. XLVIII. 2016, A&A, 596, A109 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Regaldo-Saint Blancard, B., Levrier, F., Allys, E., Bellomi, E., & Boulanger, F. 2020, A&A, 642, A217 [EDP Sciences] [Google Scholar]
Saydjari, A. K., Portillo, S. K. N., Slepian, Z., et al. 2021, ApJ, 910, 122 [Google Scholar]
Thorne, B., Knox, L., & Prabhu, K. 2021, MNRAS, 504, 2603 [Google Scholar]
Wandelt, B. D., Larson, D. L., & Lakshminarayanan, A. 2004, Phys. Rev. D, 70, 083511 [NASA ADS] [CrossRef] [Google Scholar]
Wiener, N. 1949, Extrapolation, Interpolation, and Smoothing of Stationary Time Series (Cambridge, MA: MIT Press), 7 [CrossRef] [Google Scholar]
Zaroubi, S., Hoffman, Y., Fisher, K. B., & Lahav, O. 1995, ApJ, 449, 446 [NASA ADS] [CrossRef] [Google Scholar]
Zhang, S., & Mallat, S. 2021, Appl. Comput. Harmonic Anal., 53, 199 [Google Scholar]
Zhang, K., Zuo, W., Chen, Y., Meng, D., & Zhang, L. 2017, IEEE Trans. Image Process., 26, 3142 [NASA ADS] [CrossRef] [Google Scholar]
Zonca, A., Singer, L., Lenz, D., et al. 2019, J. Open Sour. Softw., 4, 1298 [Google Scholar]

Appendix A: The WPH statistics

In this appendix, we introduce the WPH statistics used in this Letter, referring to Allys et al. (2020) and Mallat et al. (2020) for additional details. We also describe the detailed denoising procedure introduced in Sect. 2.1. We call X a given random field and x one of its realizations.

A.1. Bump-steerable wavelets

The WPH statistics rely on a bank of wavelets that are dilations and rotations of a mother bump-steerable wavelet ψ defined in Fourier space as follows (Mallat et al. 2020):

$\begin{matrix} \hat{ψ} (k) = & exp (\frac{- {(k - ξ_{0})}^{2}}{ξ_{0}^{2} - {(k - ξ_{0})}^{2}}) \cdot 1_{[0, 2 ξ_{0}]} (k) \\ \times {cos}^{L - 1} (arg (k)) \cdot 1_{[0, π / 2]} (| arg (k) |), \end{matrix}$ $\begin{aligned} \hat{\boldsymbol{\psi }} (\boldsymbol{k}) =&\exp \left(\frac{-(k - \xi _0)^2}{\xi _0^2 - (k-\xi _0)^2}\right) \cdot \mathbf 1 _{[0, 2\xi _0]}(k)\nonumber \\&\times \cos ^{L-1} (\arg (\boldsymbol{k})) \cdot \mathbf 1 _{[0, \pi /2]}(|\arg (\boldsymbol{k})|), \end{aligned}$ (A.1)

with k = ∥k∥, 1_A(x) being the indicator function that returns 1 if x ∈ A and 0 otherwise, and ξ₀ = 0.85π being the central wavenumber of the mother wavelet. The dilated and rotated wavelets are defined as follows:

$\begin{matrix} ψ_{ξ_{j, θ}} (r) = 2^{- j} ψ (2^{- j} {rot}_{- θ} [r]), \end{matrix}$ $\begin{aligned} \boldsymbol{\psi }_{\boldsymbol{\xi }_{j,\theta }}(\boldsymbol{r}) = 2^{-j}\boldsymbol{\psi }\left(2^{-j}\mathrm{rot}_{-\theta }[\boldsymbol{r}]\right), \end{aligned}$ (A.2)

with j being the index of the dilation by a factor 2^j and θ being the angle of rotation. We call ξ_j, θ = 2^−jξ₀u_θ with u_θ = cos θ u_x + sin θ u_y the central wavevector of the wavelet obtained by such a transform. In practice, we consider J dilation indices j ranging from 0 to J − 1, and we divide 2π into 2L evenly spaced rotation angles θ. The bank of wavelets is thus the set of 2JL wavelets ${ψ_{ξ_{j, θ}} : j = 0, \dots, J - 1, θ = 0, \frac{π}{L}, \dots, \frac{(2 L - 1) π}{L}}$ ${\{\boldsymbol{\psi}_{\boldsymbol{\xi}_{j,\theta}} : j=0, \dots, J - 1, \theta=0, \frac{\pi}{L}, \dots,\frac{(2L-1)\pi}{L}\}}$ , and these wavelets cover most of Fourier space with their respective band-passes.

In this Letter, we work with 512 × 512 maps and choose J = 8 and L = 8. We show in Fig. A.1 one example wavelet from the bank.

Fig. A.1.

Real part (left) and imaginary part (right) of a 512 × 512 bump-steerable wavelet ψ_6, π/4. The wavelet is centered in the middle of the map for a better visualization.

A.2. WPH moments

The WPH moments of X are covariances of the phase harmonics of the wavelet transform of X. With {ψ_ξ₁, …, ψ_{ξ_N}}, our bank of wavelets labeled by their central wavevectors ξ_i, the wavelet transform of X corresponds to the set {X ⋆ ψ_ξ₁, …, X ⋆ ψ_{ξ_N}}, where ⋆ denotes the convolution operation. These convolutions amount to a local band-pass filtering of X on the scales probed by each of the wavelets. For a homogeneous X, the WPH moments of X are as follows:

$\begin{matrix} C_{ξ_{i}, p_{i}, ξ_{j}, p_{j}} (τ) = Cov ({[X ⋆ ψ_{ξ_{i}} (r)]}^{p_{i}}, {[X ⋆ ψ_{ξ_{j}} (r + τ)]}^{p_{j}}), \end{matrix}$ $\begin{aligned} C_{\boldsymbol{\xi }_i,p_i,\boldsymbol{\xi }_j,p_j}(\boldsymbol{\tau }) = \mathrm{Cov} \left(\left[\boldsymbol{X} \star \boldsymbol{\psi }_{\boldsymbol{\xi }_i}(\boldsymbol{r})\right]^{p_i},\left[\boldsymbol{X} \star \boldsymbol{\psi }_{\boldsymbol{\xi }_j}(\boldsymbol{r} + \boldsymbol{\tau })\right]^{p_j}\right), \end{aligned}$ (A.3)

with z ↦ [z]^p = |z|⋅e^ip arg(z) being the phase harmonic operator⁵.

These WPH moments are able to capture interactions between different scales of X thanks to the phase harmonic operator. Indeed, the covariance between X ⋆ ψ_{ξ_i} and X ⋆ ψ_{ξ_j} vanishes when the wavelets ψ_{ξ_i} and ψ_{ξ_j} have nonintersecting band-passes, and it is otherwise a function of the power spectrum of X only. With proper p_i and p_j indices, the phase harmonic operator can make [X ⋆ ψ_{ξ_i}]^p_i and [X ⋆ ψ_{ξ_j}]^p_j comparable in the sense that they share common spatial frequencies, allowing an extraction of high-order information through their covariance.

We discretize the τ variable similarly to Allys et al. (2020). This introduces a grid of τ_n, α vectors for each wavelet ψ_j, θ, defined as

$\begin{matrix} τ_{n, α} = 3 n 2^{j} u_{θ + α}, \end{matrix}$ $\begin{aligned} \boldsymbol{\tau }_{n,\alpha } = 3n2^j\boldsymbol{u}_{\theta + \alpha }, \end{aligned}$ (A.4)

with n = 0, …, Δ_n and α ∈ { − π/4, 0, π/4, π/2}. To avoid a redundancy in the information contained in the coefficients, we discarded the translations for which n > min(J − 1 − j, Δ_n).

Allys et al. (2020) identified a relevant set of WPH moments to build a generative model of real-valued simulated data of the large-scale structure of the Universe, which are highly non-Gaussian at late times and at scales smaller than 100 h⁻¹ Mpc. In the present work, we have adopted the same corresponding WPH statistics and applied them similarly on complex-valued data, except for the scaling moments discussed below. In practice we estimate a set of moments that can be divided into the following five categories:

– the S^(1, 1) moments, of the form C_{ξ, 1, ξ, 1}, at every τ_n, α with Δ_n = 2. They capture the power spectrum information of the ξ band-pass.

– the S^(0, 0) moments, of the form C_{ξ, 0, ξ, 0}, at every τ_n, α with Δ_n = 2. They capture information related to the sparsity of the data in the ξ band-pass.

– the S^(0, 1) moments, of the form C_{ξ, 0, ξ, 1}, at τ = 0 only. They capture information related to the couplings between the scales of the same ξ band-pass.

– the C^(0, 1) moments, of the form C_{ξ₁, 0, ξ₂, 1}, considering 0 ≤ j₁ < j₂ ≤ J − 1 and $| θ_{1} - θ_{2} | \leq \frac{π}{2}$ ${\lvert\theta_1-\theta_2\rvert\leq \frac{\pi}{2}}$ , at every τ_n, α with Δ_n = 2 when θ₁ = θ₂ and at τ = 0 when θ₁ ≠ θ₂. They capture information related to the correlation between local levels of oscillation for the scales in the ξ₁ and ξ₂ band-passes.

– the C^phase moments, of the form C_{ξ₁, 1, ξ₂, p₂} with p₂ = ξ₁/ξ₂ > 1, considering 0 ≤ j₁ < j₂ ≤ J − 1 and θ₁ = θ₂, at every τ_n, α with Δ_n = 2. They capture information related to the statistical phase alignment of oscillations between the scales in the ξ₁ and ξ₂ band-passes.

We call ${\tilde{C}}_{ξ_{1}, p_{1}, ξ_{2}, p_{2}} (τ)$ $\tilde{C}_{\boldsymbol{\xi}_1,p_1,\boldsymbol{\xi}_2,p_2}(\boldsymbol{\tau})$ the normalized estimates of the WPH moments C_{ξ₁, p₁, ξ₂, p₂}(τ). In practice they are defined as follows for a map x:

$\begin{matrix} {\tilde{C}}_{ξ_{1}, p_{1}, ξ_{2}, p_{2}} (τ) = \frac{〈 x^{(ξ_{1}, p_{1})} (r) \bar{x^{(ξ_{2}, p_{2})}} (r + τ) 〉}{\sqrt{〈 | x_{0}^{(ξ_{1}, p_{1})} |^{2} 〉 〈 {| x_{0}^{(ξ_{2}, p_{2})} |}^{2} 〉}}, \end{matrix}$ $\begin{aligned} \tilde{C}_{\boldsymbol{\xi }_1,p_1,\boldsymbol{\xi }_2,p_2}(\boldsymbol{\tau }) = \frac{\big \langle \boldsymbol{x}^{(\boldsymbol{\xi }_1, p_1)}\left(\boldsymbol{r}\right)\overline{\boldsymbol{x}^{(\boldsymbol{\xi }_2, p_2)}}\left(\boldsymbol{r}+\boldsymbol{\tau }\right)\big \rangle }{\sqrt{\big \langle |{\boldsymbol{x}_0^{(\boldsymbol{\xi }_1, p_1)}}|^2\big \rangle \big \langle |{\boldsymbol{x}_0^{(\boldsymbol{\xi }_2, p_2)}}|^2\big \rangle }}, \end{aligned}$ (A.5)

where the brackets stand for a spatial mean on the r variable, the bar denotes the complex conjugate, and $x^{(ξ, p)} = {[x ⋆ ψ_{ξ}]}^{p} - 〈 [x_{0} ⋆ ψ_{ξ})]^{p} 〉$ ${\boldsymbol{x}^{(\boldsymbol{\xi},p)} = [\boldsymbol{x} \star \boldsymbol{\psi}_{\boldsymbol{\xi}}]^{p} - \big \langle[\boldsymbol{x}_0 \star \boldsymbol{\psi}_{\boldsymbol{\xi}})]^{p}\big \rangle}$ . These estimates depend on a reference map x₀. We found that the choice of x₀ can have a significant impact on the results of our denoising method as explained below.

A.3. Scaling moments

Similarly to Allys et al. (2020), we complement these statistics with a small number of coefficients, the estimates of the so-called scaling moments L_j, p that better constrain the largest scales that are not probed by the WPH moments. These moments were constructed from a bank of scaling functions {φ_j}_{0 ≤ j ≤ J − 1}, which correspond to dilations of an isotropic Gaussian filter φ defined in Fourier space by the following:

$\begin{matrix} \hat{φ} (k) = exp (- \frac{{| | k | |}^{2}}{2 σ^{2}}), \end{matrix}$ $\begin{aligned} \hat{\varphi }({\boldsymbol{k}}) = \exp \left(-\frac{||{\boldsymbol{k}}||^2}{2 \sigma ^2} \right), \end{aligned}$ (A.6)

with σ = 0.496 × 2^−0.55ξ₀. For a real-valued random field X, the scaling moments of X are as follows:

$\begin{matrix} L_{j, 0} = Cov [| X ⋆ φ_{j} |, | X ⋆ φ_{j} |], \end{matrix}$ $\begin{aligned}&L_{j,0} = \mathrm{Cov} \left[|\boldsymbol{X} \star \varphi _{j} |,|\boldsymbol{X} \star \varphi _j |\right],\end{aligned}$ (A.7)

$\begin{matrix} L_{j, p} = Cov [{(X ⋆ φ_{j})}^{p}, {(X ⋆ φ_{j})}^{p}] (for p > 0) . \end{matrix}$ $\begin{aligned}&L_{j,p} = \mathrm{Cov} \left[\left(\boldsymbol{X} \star \varphi _{j} \right)^{p},\left(\boldsymbol{X} \star \varphi _j \right)^{p} \right]\,(\mathrm{for}\,p>0). \end{aligned}$ (A.8)

In practice, we only estimated the scaling moments with p ∈ {0, 1, 2, 3} and 2 ≤ j ≤ J − 2. We define the corresponding normalized estimates computed for a map x by

$\begin{matrix} {\tilde{L}}_{j, p} = \frac{⟨ | x^{(j, p)} |^{2} ⟩}{⟨ | x_{0}^{(j, p)} |^{2} ⟩}, \end{matrix}$ $\begin{aligned} \tilde{L}_{j,p} = \frac{\langle |\boldsymbol{x}^{(j,p)}|^2 \rangle }{\langle |{\boldsymbol{x}}_0^{(j,p)}|^2 \rangle }, \end{aligned}$ (A.9)

with

$\begin{matrix} x^{(j, 0)} = | (x - ⟨ x_{0} ⟩) ⋆ φ_{j} | - ⟨ | (x - ⟨ x_{0} ⟩) ⋆ φ_{j} | ⟩, \end{matrix}$ $\begin{aligned}&\boldsymbol{x}^{(j,0)} = |\left(\boldsymbol{x} - \langle \boldsymbol{x}_0\rangle \right)\star \varphi _j |- \langle |\left(\boldsymbol{x} - \langle \boldsymbol{x}_0\rangle \right)\star \varphi _j |\rangle , \end{aligned}$ (A.10)

$\begin{matrix} x^{(j, p)} = {((x - ⟨ x_{0} ⟩) ⋆ φ_{j})}^{p} - ⟨ {((x - ⟨ x_{0} ⟩) ⋆ φ_{j})}^{p} ⟩ (for p > 0) . \end{matrix}$ $\begin{aligned}&\boldsymbol{x}^{(j,p)} = (\left(\boldsymbol{x} - \langle \boldsymbol{x}_0\rangle \right)\star \varphi _j )^p - \langle (\left(\boldsymbol{x} - \langle \boldsymbol{x}_0\rangle \right)\star \varphi _j )^p \rangle \,(\mathrm{for}\,p>0). \end{aligned}$ (A.11)

These normalized estimates depend on the same reference map x₀ introduced for the estimates of the WPH moments of X. As we deal with complex maps x in this work, we computed these estimates for their real and imaginary parts separately.

A.4. Denoising procedure

We applied our denoising method to a simulated noisy map d in two steps. We performed a first denoising of d using a ϕ operator that only takes into account the estimates of the S^(1, 1) moments, related to the power spectrum information, and the estimates of the scaling moments, and by normalizing the estimates choosing u₀ = d. This first step yields a map ${\tilde{s}}_{0}$ $\tilde{\boldsymbol{s}}_0$ that has a power spectrum that is much more consistent with the truth map s. Then, we performed a second denoising of d using a ϕ operator that includes the whole set of estimates of WPH moments and scaling moments, but here normalizing these estimates with $u_{0} = {\tilde{s}}_{0}$ $\boldsymbol{u}_0 = \tilde{\boldsymbol{s}}_0$ . We call $\tilde{s}$ $\tilde{\boldsymbol{s}}$ the output of this second step. This particular choice for the normalization of the estimates has proven to be more efficient at retrieving the power spectrum and the PDFs of the increments of the truth map s.

In Table A.1, we show the resulting number of statistical coefficients per class of moments computed for a complex 512 × 512 pixels map x and for the chosen parameters. In this Letter, the ϕ operator used for the second step of the procedure computes a set of 11 176 statistical coefficients (of which 40 correspond to the estimates of the scaling moments), which corresponds to approximately 4% of the number of pixels of the input map.

Table A.1.

Number of statistical coefficients per class of moments for the parameters of this Letter.

Appendix B: Comparison to other methods

B.1. Wiener filtering and posterior sampling

Wiener filtering is a typical approach to signal inference for problems of the form given by Eq. (1). In this case, the Wiener filtered reconstruction ${\tilde{s}}_{W}$ $\tilde{\mathbf{s}}_{\mathrm{W}}$ (Wiener 1949; Zaroubi et al. 1995) is given by

$\begin{matrix} {\tilde{s}}_{W} = W d = S (S + N)^{- 1} d, \end{matrix}$ $\begin{aligned} \tilde{\mathbf{s }}_{\rm W} = \mathbf W \mathbf d = \mathbf S \big (\mathbf S + \mathbf N \big )^{-1} \mathbf d , \end{aligned}$ (B.1)

where S = ⟨ss^†⟩ and N = ⟨nn^†⟩ are the signal and noise covariance matrices, respectively. For brevity we have assumed ⟨s⟩=⟨n⟩ = 0. This is the linear filter that gives the expected minimum variance of residuals (Wd − s) for known S and N.

Furthermore, it is both the maximum a posteriori and mean posterior solution if the noise and signal are drawn from Gaussian distributions. That is, if the real-valued data are distributed with a likelihood

$\begin{matrix} p (d | s, N) = \frac{1}{\sqrt[]{(\det 2 π N)}} \exp [- \frac{1}{2} {(d - s)}^{†} N^{- 1} (d - s)], \end{matrix}$ $\begin{aligned} p(\mathbf d | \mathbf s , \mathbf N ) = \frac{1}{\root \of {(\mathrm{det} 2 \pi \mathbf N )}} \mathrm{exp} \Big [ - \frac{1}{2} (\mathbf d - \mathbf s )^{\dagger } \mathbf N ^{-1} (\mathbf d - \mathbf s ) \Big ], \end{aligned}$ (B.2)

and the prior distribution for the real-valued signal is

$\begin{matrix} p (s | S) = \frac{1}{\sqrt[]{(\det 2 π S)}} exp [- \frac{1}{2} s^{†} S^{- 1} s], \end{matrix}$ $\begin{aligned} p(\mathbf s | \mathbf S ) = \frac{1}{\root \of {(\mathrm{det} 2 \pi \mathbf S )}} \exp \Big [ - \frac{1}{2} \mathbf s ^{\dagger } \mathbf S ^{-1} \mathbf s \Big ] \ , \end{aligned}$ (B.3)

then the Wiener posterior distribution is given by

$\begin{matrix} p (s | S, N, d) & \propto p (d | s, N) p (s | S) \\ \propto exp [- \frac{1}{2} {(s - W d)}^{†} (S^{- 1} + N^{- 1}) (s - W d)] . \end{matrix}$ $\begin{aligned} p(\mathbf s | \mathbf S , \mathbf N , \mathbf d )&\propto p(\mathbf d | \mathbf s , \mathbf N ) p(\mathbf s | \mathbf S ) \nonumber \\&\propto \exp \Big [- \frac{1}{2} (\mathbf s - \mathbf W \mathbf d )^{\dagger } (\mathbf S ^{-1} + \mathbf N ^{-1}) (\mathbf s - \mathbf W \mathbf d )\Big ]. \end{aligned}$ (B.4)

For the problem described in this work, the noise covariance N can be estimated from the samples of noise realizations and the signal covariance S can either be assumed or jointly estimated from the data (Wandelt et al. 2004).

We avoided inverting the dense W matrix by implementing a messenger field approach, first described by Elsner & Wandelt (2013) and now widely used in cosmology (e.g., Jasche & Lavaux 2015; Alsing et al. 2017; Jeffrey et al. 2018a,b; Kodi Ramanah et al. 2019). We assumed that the underlying signal is homogeneous and isotropic, such that the signal covariance S is diagonal in harmonic space with elements corresponding to the one-dimensional power spectrum. The uncorrelated noise gives a diagonal noise covariance N in pixel space, so that the messenger field can efficiently iterate between harmonic and pixel space.

As we are concerned with polarization data, we used a spin-2 harmonic transformation between {s_Q, s_U} and ${{\tilde{s}}_{E}, {\tilde{s}}_{B}}$ $\{\widetilde{\mathbf{s}}_E, \widetilde{\mathbf{s}}_B\}$ , using the curl-free E-mode and divergence-free B-mode representation. The signal covariance in harmonic space is therefore a concatenation of {S_E, S_B}. This formulation preserves the relevant Q − U correlation.

Even if the Wiener filtered reconstructed signal ${\tilde{s}}_{W}$ $\tilde{\mathbf{s}}_{\mathrm{W}}$ is the maximum of the posterior distribution p(s|d), functions $f ({\tilde{s}}_{W})$ $f(\tilde{\mathbf{s}}_{\mathrm{W}})$ do not correspond to the maximum p(f(s)|d). For example, the two-point statistics (e.g., variance or power spectra) of ${\tilde{s}}_{W}$ $\tilde{\mathbf{s}}_{\mathrm{W}}$ are not unbiased estimates of the power spectrum of s. Instead, we can draw sample images s_i from the posterior p(s|d) (Eq. (B.4)), so that the transformed samples f(s_i) are correctly distributed according to p(f(s)|d). These realizations are generated by amending the messenger field algorithm (Elsner & Wandelt 2013).

Figure B.1 shows the Wiener filtered reconstruction of the simulated Q map (left panel). The low amplitude power spectrum of the residual map $R (s - {\tilde{s}}_{W})$ $\Re(\boldsymbol{s} - \tilde{\boldsymbol{s}}_{\mathrm{W}})$ demonstrates that the pixel values are close to the truth, whereas the power spectrum of $R ({\tilde{s}}_{W})$ $\Re(\tilde{\boldsymbol{s}}_{\mathrm{W}})$ is biased low (center panel), which is as expected. To compare the methods, we compared with functions of samples s_i from the Wiener posterior. Though not plotted here, as we input the true power spectrum, which was not done for the WPH denoising method, the realizations s_i did have the correct power spectra with relatively small sample variance.

Fig. B.1.

Left: Wiener filtered map ${\tilde{s}}_{W}$ $\tilde{\boldsymbol{s}}_{\mathrm{W}}$ . Middle: same as Fig. 2, but for ${\tilde{s}}_{W}$ $\tilde{\boldsymbol{s}}_{\mathrm{W}}$ . The power spectrum of ${\tilde{s}}_{W}$ $\tilde{\boldsymbol{s}}_{\mathrm{W}}$ is suppressed as expected. Right: same as Fig. 3, but for l = 2 px with the Wiener filter (mean) map and a Wiener posterior sample.

As a goal of this denoising work is to retain the statistical non-Gaussianity in the signal, which could be represented in the WPH coefficients, we again compared the PDF of increments δQ_l. The right panel of Fig. B.1 shows the comparison between the data ℜ(d), the signal ℜ(s), the Wiener filtered (mean) map $R ({\tilde{s}}_{W})$ $\Re(\tilde{\boldsymbol{s}}_{\mathrm{W}})$ , and a realization $R ({\tilde{s}}_{W, sample})$ $\Re(\tilde{\boldsymbol{s}}_{\mathrm{W,sample}})$ drawn from the Wiener posterior. We see that neither the Wiener mean map $R ({\tilde{s}}_{W})$ $\Re(\tilde{\boldsymbol{s}}_{\mathrm{W}})$ nor the sample $R ({\tilde{s}}_{W, sample})$ $\Re(\tilde{\boldsymbol{s}}_{\mathrm{W,sample}})$ capture the non-Gaussianity in terms of δQ_l increments as successfully as our WPH denoising method. Indeed, for |δQ_l|/σ_l ⪆ 1, there is a clear divergence from the true PDF, showing that the tails of the true statistics are not recovered at all.

Furthermore, inspection of the WPH coefficients of the Wiener posterior samples ϕ(s_i) shows them to be noticeably further from those of the truth map than $ϕ (\tilde{s})$ $\phi(\tilde{\boldsymbol{s}})$ . These preliminary results are encouraging, but not surprising. The Gaussian prior distribution in the statistical model of the Wiener posterior leads to a poor recovery of the non-Gaussian structures that are intrinsic to the polarized dust emission.

B.2. GNILC

The GNILC method is a wavelet-based component separation method that is designed to extract the emission of the Galactic foregrounds from the Planck full-sky maps. It has been applied to polarization maps Q and U (Planck Collaboration IV 2020; Planck Collaboration XII 2020) to disentangle the thermal dust polarization emission from the CMB polarization and instrumental noise over the entire sky. We show in Figs. 4 and B.2 (right column) the resulting Q and U maps, respectively, for the Chamaeleon-Musca field observed at 353 GHz. We note that at this frequency, the CMB can be safely ignored, so that the main effect of the GNILC algorithm is to denoise the dust emission. In these maps, we clearly see that the smallest scales, most contaminated by the noise, have been filtered out. Compared to GNILC, our denoised maps thus include a wider variety of structures at intermediate and small scales.

Fig. B.2.

Same as Fig. 4, but for U.

A quantitative comparison based on the power spectrum and the PDFs of the increments for the Q maps is shown in Figs. 5 and 6. The power spectrum of the GNILC map plummets at small scales and exhibits a lack of power compared to that of $\tilde{s}$ $\tilde{\boldsymbol{s}}$ and the cross-spectrum d₁ × d₂. The PDFs of the increments also show important discrepancies, especially at the tails of the PDFs, between the $\tilde{s}$ $\tilde{\boldsymbol{s}}$ map and the GNILC map. Assuming that the statistics of $\tilde{s}$ $\tilde{\boldsymbol{s}}$ give a relevant order of magnitude of the true statistics, based on the validation on simulated data, this suggests a distortion of these statistics by the GNILC method. This quantitative comparison demonstrates the superiority of our method for recovering the true power spectrum and, a priori, the PDFs of the increments.

All Tables

Table A.1.

Number of statistical coefficients per class of moments for the parameters of this Letter.

In the text

All Figures

Fig. 1.

Top row: simulated Q maps corresponding to the truth s (left), the noise n (middle), and the resulting noisy map d (right). Bottom row: denoised simulated Q map $R (\tilde{s})$ $\Re(\tilde{\boldsymbol{s}})$ (left) next to the difference noisy-denoised $R (d - \tilde{s})$ $\Re(\boldsymbol{d}-\tilde{\boldsymbol{s}})$ (middle) and denoising error $R (s - \tilde{s})$ $\Re(\boldsymbol{s}-\tilde{\boldsymbol{s}})$ (right). Units are arbitrary, but kept consistent between the maps.

In the text

Fig. 2.

Q maps power spectra for d, s, $\tilde{s}$ $\tilde{\boldsymbol{s}}$ , n (all in solid lines), $d - \tilde{s}$ $\boldsymbol{d}-\tilde{\boldsymbol{s}}$ , and $s - \tilde{s}$ $\boldsymbol{s}-\tilde{\boldsymbol{s}}$ , and cross-spectrum between the Q maps of $\tilde{s}$ $\tilde{\boldsymbol{s}}$ and s (all in dashed lines). Bottom panel: ratio of the power spectra of d and $\tilde{s}$ $\tilde{\boldsymbol{s}}$ with that of s.

In the text

Fig. 3.

PDFs of the increments of Q for d (noisy), s (truth), and $\tilde{s}$ $\tilde{\boldsymbol{s}}$ (denoised), computed for three logarithmically spaced lags going from 2 to 32 pixels. For each lag, the increments were normalized by the standard deviation σ_l of the Gaussian that fits the core of the PDF of the noisy map. Bottom panels: ratio of the PDFs of $\tilde{s}$ $\tilde{\boldsymbol{s}}$ with those of s for each lag.

In the text

	Fig. 4. Noisy (left) and denoised (middle) Q maps of the Chamaeleon-Musca region as observed by the Planck satellite at 353 GHz. The noisy maps were denoised as described in Sect. 2.1. We also show the corresponding GNILC map (right) for reference.
In the text

Fig. 5.

Q maps power spectra for d, $\tilde{s}$ $\tilde{\boldsymbol{s}}$ , n (all in solid lines), and $d - \tilde{s}$ $\boldsymbol{d}-\tilde{\boldsymbol{s}}$ , and cross-spectrum between d₁ and d₂ (both in dashed lines). This cross-spectrum estimates the power spectrum of the dust emission. Also shown in a solid line is the power spectrum of the GNILC Q map, corresponding to the same Chamaeleon-Musca field.

In the text

	Fig. 6. PDFs of the Chamaeleon-Musca Q maps increments for d (noisy), $\tilde{s}$ $\tilde{\boldsymbol{s}}$ (denoised), and GNILC map, computed for three logarithmically spaced lags ranging from 4.7′ to 75.2′. For each lag, the increments were normalized by the standard deviation σ_l of the Gaussian that fits the core of the PDF of the noisy map.
In the text

	Fig. A.1. Real part (left) and imaginary part (right) of a 512 × 512 bump-steerable wavelet ψ_6, π/4. The wavelet is centered in the middle of the map for a better visualization.
In the text

	Fig. B.1. Left: Wiener filtered map ${\tilde{s}}_{W}$ $\tilde{\boldsymbol{s}}_{\mathrm{W}}$ . Middle: same as Fig. 2, but for ${\tilde{s}}_{W}$ $\tilde{\boldsymbol{s}}_{\mathrm{W}}$ . The power spectrum of ${\tilde{s}}_{W}$ $\tilde{\boldsymbol{s}}_{\mathrm{W}}$ is suppressed as expected. Right: same as Fig. 3, but for l = 2 px with the Wiener filter (mean) map and a Wiener posterior sample.
In the text

	Fig. B.2. Same as Fig. 4, but for U.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Allys, E., Levrier, F., Zhang, S., et al. 2019, A&A, 629, A115 [CrossRef] [EDP Sciences] [Google Scholar]

[2] Allys, E., Marchand, T., Cardoso, J. F., et al. 2020, Phys. Rev. D, 102, 103506 [CrossRef] [Google Scholar]

[3] Alsing, J., Heavens, A., & Jaffe, A. H. 2017, MNRAS, 466, 3272 [NASA ADS] [CrossRef] [Google Scholar]

[4] Aylor, K., Haq, M., Knox, L., Hezaveh, Y., & Perreault-Levasseur, L. 2020, MNRAS, 500, 3889 [Google Scholar]

[5] Beck, D., Errard, J., & Stompor, R. 2020, JCAP, 2020, 030 [Google Scholar]

[6] BICEP2/Keck and Planck Collaborations 2015, Phys. Rev. Lett., 114, 101301 [Google Scholar]

[7] Buades, A., Coll, B., & Morel, J. M. 2010, SIAM Rev., 52, 113 [CrossRef] [Google Scholar]

[8] Burkhart, B., Falceta-Gonçalves, D., Kowal, G., & Lazarian, A. 2009, ApJ, 693, 250 [NASA ADS] [CrossRef] [Google Scholar]

[9] Byrd, R., Lu, P., Nocedal, J., & Zhu, C. 1995, SIAM J. Sci. Comput., 16, 1190 [Google Scholar]

[10] Cheng, S., Ting, Y.-S., Ménard, B., & Bruna, J. 2020, MNRAS, 499, 5902 [CrossRef] [Google Scholar]

[11] Clark, S. E., & Hensley, B. S. 2019, ApJ, 887, 136 [Google Scholar]

[12] Dabov, K., Foi, A., Katkovnik, V., & Egiazarian, K. 2007, IEEE Trans. Image Process., 16, 2080 [Google Scholar]

[13] Draine, B. T. 2011, Physics of the Interstellar and Intergalactic Medium (Princeton: Princeton University Press) [Google Scholar]

[14] Elsner, F., & Wandelt, B. D. 2013, A&A, 549, A111 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[15] Górski, K. M., Hivon, E., Banday, A. J., et al. 2005, ApJ, 622, 759 [NASA ADS] [CrossRef] [Google Scholar]

[16] Hensley, B. S., & Bull, P. 2018, ApJ, 853, 127 [Google Scholar]

[17] Hily-Blant, P., & Falgarone, E. 2009, A&A, 500, L29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[18] Hily-Blant, P., Falgarone, E., & Pety, J. 2008, A&A, 481, 367 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[19] Jasche, J., & Lavaux, G. 2015, MNRAS, 447, 1204 [NASA ADS] [CrossRef] [Google Scholar]

[20] Jeffrey, N., Abdalla, F. B., Lahav, O., et al. 2018a, MNRAS, 479, 2871 [Google Scholar]

[21] Jeffrey, N., Heavens, A. F., & Fortio, P. D. 2018b, Astron. Comput., 25, 230 [Google Scholar]

[22] Kodi Ramanah, D., Lavaux, G., & Wandelt, B. D. 2019, MNRAS, 490, 947 [Google Scholar]

[23] Krachmalnicoff, N., & Puglisi, G. 2021, ApJ, 911, 42 [Google Scholar]

[24] Mallat, S., Zhang, S., & Rochette, G. 2020, Information and Inference: A Journal of the IMA, 9, 721 [Google Scholar]

[25] Pelgrims, V., Ferrière, K., Boulanger, F., Lallement, R., & Montier, L. 2020, A&A, 636, A17 [EDP Sciences] [Google Scholar]

[26] Petroff, M. A., Addison, G. E., Bennett, C. L., & Weiland, J. L. 2020, ApJ, 903, 104 [Google Scholar]

[27] Planck Collaboration III. 2020, A&A, 641, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[28] Planck Collaboration IV. 2020, A&A, 641, A4 [CrossRef] [EDP Sciences] [Google Scholar]

[29] Planck Collaboration XI. 2020, A&A, 641, A11 [CrossRef] [EDP Sciences] [Google Scholar]

[30] Planck Collaboration XII. 2020, A&A, 641, A12 [CrossRef] [EDP Sciences] [Google Scholar]

[31] Planck Collaboration Int. XLVIII. 2016, A&A, 596, A109 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[32] Regaldo-Saint Blancard, B., Levrier, F., Allys, E., Bellomi, E., & Boulanger, F. 2020, A&A, 642, A217 [EDP Sciences] [Google Scholar]

[33] Saydjari, A. K., Portillo, S. K. N., Slepian, Z., et al. 2021, ApJ, 910, 122 [Google Scholar]

[34] Thorne, B., Knox, L., & Prabhu, K. 2021, MNRAS, 504, 2603 [Google Scholar]

[35] Wandelt, B. D., Larson, D. L., & Lakshminarayanan, A. 2004, Phys. Rev. D, 70, 083511 [NASA ADS] [CrossRef] [Google Scholar]

[36] Wiener, N. 1949, Extrapolation, Interpolation, and Smoothing of Stationary Time Series (Cambridge, MA: MIT Press), 7 [CrossRef] [Google Scholar]

[37] Zaroubi, S., Hoffman, Y., Fisher, K. B., & Lahav, O. 1995, ApJ, 449, 446 [NASA ADS] [CrossRef] [Google Scholar]

[38] Zhang, S., & Mallat, S. 2021, Appl. Comput. Harmonic Anal., 53, 199 [Google Scholar]

[39] Zhang, K., Zuo, W., Chen, Y., Meng, D., & Zhang, L. 2017, IEEE Trans. Image Process., 26, 3142 [NASA ADS] [CrossRef] [Google Scholar]

[40] Zonca, A., Singer, L., Lenz, D., et al. 2019, J. Open Sour. Softw., 4, 1298 [Google Scholar]

A new approach for the statistical denoising of Planck interstellar dust polarization data⋆

1. Introduction

2. Method and validation

2.1. Description of the method

2.2. Validation on a simulation

3. Application to Planck polarization data

3.1. Presentation of the data

3.2. Denoising results

4. Conclusion and prospects

Acknowledgments

References

Appendix A: The WPH statistics

A.1. Bump-steerable wavelets

A.2. WPH moments

A.3. Scaling moments

A.4. Denoising procedure

Appendix B: Comparison to other methods

B.1. Wiener filtering and posterior sampling

B.2. GNILC

All Tables

All Figures

A new approach for the statistical denoising of Planck interstellar dust polarization data^⋆