Free Access
Issue
A&A
Volume 546, October 2012
Article Number A114
Number of page(s) 10
Section Astronomical instrumentation
DOI https://doi.org/10.1051/0004-6361/201118234
Published online 17 October 2012

© ESO, 2012

1. Introduction

1.1. Literature overview

The gamma-ray sky has been studied with unprecedented sensitivity and image capability thanks to the Large Area Telescope (LAT), the main instrument of the Fermi Gamma-ray Space Telescope (Atwood et al. 2009), in an energy range between 20 MeV to greater than 300 GeV. The detection of gamma-ray point sources is difficult for two main reasons: the Poisson noise and the instrument’s point spread function (PSF). The Poisson noise is due to the fluctuations in the number of detected photons. Moreover, the effect of Poisson noise is strong because of the weakness of the fluxes of celestial gamma rays, especially outside the Galactic plane and far away from intense sources. The PSF width is strongly energy-dependent, varying from about 3.5 at 100 MeV to less than (68% containment) at 10 GeV. Owing to large-angle multiple scattering in the tracker, the PSF has broad tails, for which the 95%/68% containment ratio may be as large as 3.

An extensive literature exists on Poisson noise removal and the interested reader may refer to Schmitt et al. (2011) and Starck et al. (2010) for a thorough review. Motivated by new X-ray and gamma-ray data challenges, several restoration methods have been released in astrophysics that are based on wavelets (Movit 2009; Starck et al. 2009; Faÿ et al. 2011; Schmitt et al. 2010) or the Bayesian machinery (Conrad et al. 2007; Norris et al. 2010). Wavelets have also been used for source detection in Fermi data (Abdo et al. 2010), and a first Poisson denoising algorithm for spherical data was proposed in Schmitt et al. (2010). Starck et al. (2009) developed a denoising approach that effectively handles multichannel data acquired on a cartesian grid, and where the third dimension can be any physically meaningful index such as wavelength, energy, or time. While traditional techniques integrate over all the third dimension in order to improve the signal-to-noise ratio (S/N) of the sources, their approach allows us to detect the sources while preserving all of their spectral information without sacrificing any the sensitivity. Nonetheless, none of these methods take into account the point spread function (PSF) of the instrument. Deconvolution can be very helpful in many situations such as source identification or flux estimation in the low energy bands where the resolution degrades severely. To the best of our knowledge, no general method for both multichannel denoising and deconvolving on the sphere has been developed in the literature.

1.2. Contributions

We propose a general framework for denoising and deconvolution that complies with all the above requirements, namely deals with

  • 1.

    multichannel data;

  • 2.

    acquired on the sphere;

  • 3.

    and contaminated by Poisson noise.

Our approach builds upon the concept of variance stabilization applied to the spherical wavelet transform coefficients (Starck et al. 2006). This approach gives a multiscale representation of the Poisson data with variance-stabilized coefficients, which can be treated as if they were contaminated by a zero-mean white Gaussian noise. The developed algorithms are validated on simulated Fermi HEALPix multichannel cubes (nside = 256) with energy bands ranging from 50 MeV to 50 GeV.

1.3. Paper organization

The rest of the paper is organized as follows. In Sect. 2, we present a new wavelet transform for multichannel data on the sphere, and we show how the variance stabilization transform can be introduced into the decomposition. Section 3 details the way that this transform can be used for Gaussian and Poisson noise removal. Section 4 describes our deconvolution algorithm on the sphere for both mono-channel and multichannel spherical data. In Sect. 5, we finally draw some conclusions and give possible perspectives of this work.

1.4. Notations

For a real discrete-time filter whose impulse response is h [i] , \hbox{$\bar{h}[i]=h[-i], ~ i \in \mathbb{Z}$} is its time-reversed version. The discrete circular convolution product of two signals is written   , where the term circular represents periodic boundary conditions. The symbol δ [i]  is the Kronecker delta.

For the wavelet representation, the low-pass analysis filter is denoted h and the high-pass is taken as g = δ − h throughout the paper. We denote the up-sampled version of h as h ↑ j [l]  = h [l]  if l/2j ∈ Z and 0 otherwise. We define \hbox{$h^{(j)} = \bar{h}^{\uparrow j-1}\star\ldots\star\bar{h}^{\uparrow 1}\star \bar{h}$} for \hbox{$j\geqslant 1$} and h(0) = δ.

The scaling and wavelet functions used for the analysis (respectively, synthesis) are denoted φ (with φ(x2)=kh[k]φ(xk),xR and kZ\hbox{$\phi(\frac{x}{2}) = \sum_k h[k] \phi(x-k), x \in {\mathbb{R}} ~ {\rm and} ~ k \in {\mathbb{Z}}$}) and ψ (with ψ(x2)=kg[k]φ(xk),xR \hbox{$\psi(\frac{x}{2}) = \sum_k g[k] \phi(x-k), x \in {\mathbb{R}} ~ $}and   k ∈ Z) (respectively, 􏽥φ\hbox{$\widetilde{\phi}$} and 􏽥ψ\hbox{$\widetilde{\psi}$}).

2. Multiscale representation for multichannel spherical data with poisson noise

2.1. Fast undecimated 2D-1D wavelet decomposition/reconstruction on the sphere

Our goal is to analyze multichannel data acquired on a sphere with a non-isotropic two-and-one-dimensional (2D-1D) wavelet, where the two first dimensions are spatial (longitude and latitude) and the third dimension is either the time or the energy. Since the dimensions do not have the same physical meaning, it appears natural that the wavelet scale along the third dimension (energy or time) should not be connected to the spatial scale. Hence, we define the wavelet function as ψ(kθ,kϕ,kt)=ψ(θφ)(kθ,kϕ)ψ(t)(kt),\begin{equation} \psi( k_{\theta},k_{\varphi}, k_{t}) = \psi^{(\theta \phi)} (k_{\theta}, k_{\varphi}) \psi^{(t)} (k_t) , \end{equation}(1)where ψ(θφ) is the spherical two dimensional (2D) spatial wavelet and ψ(t) the one-dimensional (1D) wavelet along the third dimension. Similarly to Starck et al. (2009), we consider only isotropic and dyadic spatial scales. We build the discrete 2D-1D wavelet decomposition by first taking a spherical 2D undecimated wavelet transform for each kt, followed by a 1D wavelet transform for each spatial wavelet coefficient along the third dimension.

Hence, for a given multichannel data set on the sphere Y [kθ,kϕ,kt]  and after applying first the 2D spherical undecimated wavelet transform, we have the reconstruction formula Y[kθ,kϕ,kt]=aJ1[kθ,kϕ,kt]+j1=1J1wj1[kθ,kϕ,kt],kt,\begin{eqnarray} \label{reconstiuwt} Y[k_{\theta},k_{\varphi},k_{t}] && = a_{J_1}[k_{\theta},k_{\varphi},k_{t}] \nonumber\\&&\quad+ \sum_{j_1=1}^{J1}w_{j_1}[k_{\theta},k_{\varphi},k_{t}], \qquad \forall k_{t} , \end{eqnarray}(2)where J1 is the number of spatial scales, aJ1 is the (spatial) approximation subband, and {wj1}j1=1J1\hbox{$\{w_{j_1}\}_{j_1=1}^{J_1}$} are the (spatial) detail subbands. To simplify the notations in the sequel, we replace the two spatial indices by a single index kr, which corresponds to the pixel index. Equation (2)now reads Y[kr,kt]=aJ1[kr,kt]+j1=1J1wj1[kr,kt],kt.\begin{equation} \label{reconstiuwtpix} Y[k_{r},k_{t}] = a_{J_1}[k_{r},k_t] + \sum_{j_1=1}^{J1}w_{j_1}[k_{r},k_{t}], \qquad \forall k_{t} . \end{equation}(3)For each spatial location kr and each 2D wavelet scale j1, we then apply a 1D wavelet transform along t on the spatial wavelet coefficients wj1 [kr,·]  such that wj1[kr,kt]=wj1,J2[kr,kt]+j2=1J2wj1,j2[kr,kt],(kr,kt),\begin{equation} \label{decompwavj1} w_{j_1}[k_r,k_t] = w_{j_1,J_2} [k_r,k_t] + \sum_{j_2=1}^{J_2}w_{j_1,j_2}[k_r,k_t], \quad \forall(k_r,k_t), \end{equation}(4)where J2 is the number of scales along t. The approximation spatial subband aJ1 is processed in a similar way, hence yielding aJ1[kr,kt]=aJ1,J2[kr,kt]+j2=1J2wJ1,j2[kr,kt],(kr,kt).\begin{equation} a_{J_1}[k_r,k_t] = a_{J_1,J_2} [k_r,k_t] + \sum_{j_2=1}^{J_2} w_{J_1,j_2}[k_r,k_t], \qquad \forall(k_r,k_t) .\label{decompaj} \end{equation}(5)Inserting Eqs. (4)and (5)into (3), we obtain the 2D-1D spherical undecimated wavelet representation of Y: Y[kr,kt]=aJ1,J2[kr,kt]+j1=1J1wj1,J2[kr,kt]+j2=1J2wJ1,j2[kr,kt]+j1=1J1j2=1J2wj1,j2[kr,kt].\begin{eqnarray} \label{decomp2d1d} Y[k_r,k_t] &&= a_{J_1,J_2}[k_r,k_t] + \sum_{j_1=1}^{J_1}w_{j_1,J_2} [k_r,k_t] \nonumber\\ &&\quad + \sum_{j_2=1}^{J_2} w_{J_1,j_2} [k_r,k_t]+ \sum_{j_1=1}^{J_1}\sum_{j_2=1}^{J_2} w_{j_1,j_2} [k_r,k_t] . \end{eqnarray}(6)In this expression, four kinds of coefficients can be distinguished:

  • Detail-detail coefficients (\hbox{$j_1 \leqslant J_1$} and \hbox{$j_2 \leqslant J_2$}): wj1,j2[kr,·]=(δ1D)(h1D(j21)aj11[kr,·]h1D(j21)aj1[kr,·]).\begin{eqnarray} \label{detaildetail} w_{j_1,j_2}[k_r,\cdot] &=& (\delta - \bar{h}_{\mathrm{1D}}) \star (h_{\mathrm{1D}}^{(j_2-1)} \star a_{j_1-1}[k_r,\cdot] \notag\\&&- h_{\mathrm{1D}}^{(j_2-1)} \star a_{j_1}[k_r,\cdot]) . \end{eqnarray}(7)

  • Approximation-detail coefficients (j1 = J1 and \hbox{$j_2 \leqslant J_2$}): wJ1,j2[kr,·]=h1D(j21)aJ1[kr,·]h1D(j2)aJ1[kr,·].\begin{equation} \label{approxdetail} w_{J_1,j_2}[k_r,\cdot] = h_{\mathrm{1D}}^{(j_2-1)} \star a_{J_1}[k_r,\cdot] - h_{\mathrm{1D}}^{(j_2)} \star a_{J_1}[k_r,\cdot] . \end{equation}(8)

  • Detail-approximation coefficients (\hbox{$j_1 \leqslant J_1$} and j2 = J2): wj1,J2[kr,·]=h1D(J2)aj11[kr,·]h1D(J2)aj1[kr,·].\begin{equation} \label{detailapprox} w_{j_1,J_2}[k_r,\cdot] = h_{\mathrm{1D}}^{(J_2)} \star a_{j_1-1}[k_r,\cdot] - h_{\mathrm{1D}}^{(J_2)} \star a_{j_1}[k_r,\cdot] . \end{equation}(9)

  • Approximation-approximation coefficients (j1 = J1 and j2 = J2): aJ1,J2[kr,·]=h1D(J2)aJ1[kr,·].\begin{equation} \label{approxapprox} a_{J_1,J_2}[k_r,\cdot] = h_{\mathrm{1D}}^{(J_2)} \star a_{J_1}[k_r,\cdot] . \end{equation}(10)

2.2. Multi-scale variance stabilizing transform on the sphere (MS-VSTS)

Schmitt et al. (2010) proposed a multiscale variance stabilizing transform adapted for Poisson spherical data. This transform was dubbed the multi-scale variance stabilizing transform on the Sphere (MS-VSTS). The MS-VSTS is a multi-scale decomposition method designed for Poisson noise that is based on a variance stabilizing transform(VST). Poisson noise is indeed signal-dependent, which complicates its removal. The aim of a VST is to get rid of this signal-dependence by transforming a Poisson distribution into a Gaussian distribution of known variance. In a nutshell, the MS-VSTS consists of plugging a VST into a multi-scale transform – the isotropic undecimated wavelet transform on the sphere (IUWTS) – in order to realize (approximately) Gaussian zero-mean multiscale coefficients with constant variance. The noise on coefficients can then be easily removed using Gaussian denoising methods such as wavelet shrinkage, as we see in the next section.

The MS-VSTS scheme is defined recursively by inserting a (nonlinear) square-root VST into the IUWTS steps, that is IUWTS{={\begin{eqnarray} \label{eq27} \text{IUWTS} &&\left\{\begin{array}{ccc} a_j & = & h_{j-1} \star a_{j-1} \\[.5mm] d_j & = & a_{j-1} - a_j \end{array}\right. \Longrightarrow \nonumber\\[1.5mm] &&\qquad \begin{array}{c}\text{MS-VSTS} \\ \text{(VST + IUWTS)} \end{array} \left\{\begin{array}{ccc} a_j & = & h_{j-1} \star a_{j-1} \\[.5mm] d_j & = & T_{j-1}(a_{j-1}) - T_j(a_j) , \end{array}\right. \end{eqnarray}(11)where Tj is the VST operator on scale jTj(aj)=b(j)sign(aj+c(j))|aj+c(j)|,\begin{equation} \label{eq28} T_j(a_j) = b^{(j)} \mathrm{sign}(a_j+c^{(j)})\sqrt{|a_j + c^{(j)}|} , \end{equation}(12)with the VST constants b(j) and c(j) that depend solely on the filter h and the scale level j. Zhang et al. (2008) showed that the MS-VSTS detail coefficients dj on locally homogeneous parts of the underlying intensity signal follow asymptotically a zero-mean normal distribution with an intensity-independent variance that relies only on the filter h and the current scale j. Consequently, for a given h, both the stabilized variances and the constants b(j) and c(j) can be pre-computed once and for all (Schmitt et al. 2010).

2.3. Multichannel MS-VSTS

We now extend the MS-VSTS machinery to the multichannel case. This amounts to plugging the VST into the spherical 2D-1D undecimated wavelet transform introduced in Sect. 2.1. This again gives rise to four types of coefficients that take the following forms:

  • Detail-detail coefficients (\hbox{$j_1 \leqslant J_1$} and \hbox{$j_2 \leqslant J_2$}): wj1,j2[kr,·]=(δ1D)(Tj11,j21(h1D(j21)aj11[kr,·])Tj1,j21(h1D(j21)aj1[kr,·])).\begin{eqnarray} \label{detaildetailmsvst} w_{j_1,j_2}[k_r,\cdot] &&= (\delta - \bar{h}_{\mathrm{1D}}) \star \big( T_{j_1-1,j_2-1} \big({h}_{\mathrm{1D}}^{(j_2-1)}\nonumber\\[1mm] &&\quad \star a_{j_1-1}[k_r,\cdot] \big) - T_{j_1,j_2-1}\big(h_{\mathrm{1D}}^{(j_2-1)} \star a_{j_1}[k_r,\cdot]\big)\big) . \end{eqnarray}(13)

  • Approximation-detail coefficients (j1 = J1 and \hbox{$j_2 \leqslant J_2$}): wJ1,j2[kr,·]=TJ1,j21(h1D(j21)aJ1[kr,·])TJ1,j2(h1D(j2)aJ1[kr,·]).\begin{eqnarray} \label{approxdetailmsvst} w_{J_1,j_2}[k_r,\cdot] &&= T_{J_1,j_2-1}\parenth{h_{\mathrm{1D}}^{(j_2-1)} \star a_{J_1}[k_r,\cdot]} \nonumber\\[1mm] &&\quad- T_{J_1,j_2}\parenth{h_{\mathrm{1D}}^{(j_2)} \star a_{J_1}[k_r,\cdot]} . \end{eqnarray}(14)

  • Detail-approximation coefficients (\hbox{$j_1 \leqslant J_1$} and j2 = J2): wj1,J2[kr,·]=Tj11,J2(h1D(J2)aj11[kr,·])Tj1,J2(h1D(J2)aj1[kr,·]).\begin{eqnarray} \label{detailapproxmsvst} w_{j_1,J_2}[k_r,\cdot] &&= T_{j_1-1,J_2} \parenth{h_{\mathrm{1D}}^{(J_2)} \star a_{j_1-1}[k_r,\cdot]} \nonumber\\[1mm] &&\quad- T_{j_1,J_2} \parenth{h_{\mathrm{1D}}^{(J_2)} \star a_{j_1}[k_r,\cdot]} . \end{eqnarray}(15)

  • Approximation-approximation coefficients (j1 = J1 and j2 = J2): aJ1,J2[kr,·]=h1D(J2)aJ1[kr,·].\begin{equation} \label{approxapproxmsvst} a_{J_1,J_2}[k_r,\cdot] = h_{\mathrm{1D}}^{(J_2)} \star a_{J_1}[k_r,\cdot] . \end{equation}(16)

In summary, all 2D-1D wavelet coefficients \hbox{$\{w_{j_1,j_2}\}_{j_1 \leqslant J_1,j_2 \leqslant J_2}$} are now stabilized, and the noise in all these wavelet coefficients is a zero-mean Gaussian with known variance that depends solely on h on the resolution levels (j1,j2). As before, these variances can be easily tabulated.

3. Application to multichannel denoising

We define X to be the noiseless data and Y their observed noisy version. In the case of the additive zero-mean white Gaussian noise, we have \hbox{$Y \sim \mathcal{N}(X,\sigma^2)$}, and for the Poisson noise \hbox{$Y \sim \mathcal{P}(X)$}. The main objective behind denoising is to estimate X from Y.

3.1. Warm-up: Gaussian noise

We start with the simple and instructive case where the noise in Y is additive white Gaussian. As the spherical 2D-1D undecimated wavelet transform described in Sect. 2.1 is linear, the noise remains Gaussian in the transform domain. Therefore, the thresholding strategies that have been developed for wavelet Gaussian denoising can be applied to the spherical 2D-1D wavelet transform. Denoting the thresholding operator as TH, the denoised 2D-1D estimate of X obtained by thresholding the wavelet coefficients in Eq. (6)reads 􏽥X[kr,kt]=aJ1,J2[kr,kt]+j1=1J1TH(wj1,J2[kr,kt])+j2=1J2TH(wJ1,j2[kr,kt])+j1=1J1j2=1J2TH(wj1,j2[kr,kt]).\begin{eqnarray} \label{denoisegaussmc} \widetilde{X}[k_r,k_t] &=& a_{J_1,J_2}[k_r,k_t] + \sum_{j_1=1}^{J_1}\text{TH}(w_{j_1,J_2} [k_r,k_t]) \nonumber\\ &&\quad+ \sum_{j_2=1}^{J_2} \text{TH}(w_{J_1,j_2} [k_r,k_t]) \nonumber\\ &&\quad+ \sum_{j_1=1}^{J_1}\sum_{j_2=1}^{J_2} \text{TH}(w_{j_1,j_2} [k_r,k_t]) . \end{eqnarray}(17)A typical choice of TH is the hard thresholding operator parametrized by the scalar threshold \hbox{$\tau \geqslant 0$}, i.e. TH(x)={if|x|<τ,otherwise.\begin{equation} \label{hardthresh} \text{TH}(x) = \begin{cases} 0 & \text{if } |x| < \tau, \\ x & \text{otherwise}. \end{cases} \end{equation}(18)The threshold τ is typically chosen to be between three and five times the noise standard deviation.

3.2. Poisson noise

The treatment of Poisson noise is far more complicated than its Gaussian counterpart, the main reason being that the noise variance is equal to its mean. This is where the MS-VSTS comes into play. The role of the MS-VSTS is indeed to get rid of this dependence of the variance on the mean by ensuring that the transformed coefficients are Gaussian with constant variance (without loss of generality, this variance can be assumed to be equal to 1). In other words, after the MS-VSTS, we are brought to a Gaussian denoising problem where standard thresholding approaches apply.

Nevertheless, denoising is not straightforward because there is no explicit reconstruction formula available because of the form of the non-linear stabilization equations above. Formally, the stabilizing operators Tj1,j2 and the convolution operators along the spatial and the third dimensions do not commute, even though the filter bank satisfies the exact reconstruction formula. To circumvent this difficulty, we propose to solve this reconstruction problem by advocating an iterative reconstruction scheme.

3.3. Iterative reconstruction

We define W to be the transform operator associated with the 2D-1D IUWTS described in Sect. 2.1, and R to be its inverse transform. We define ℳ to be the multiresolution support, which is determined by the set of significant coefficients detected among WY at each scale (j1,j2) and location (kr,kt), i.e. ={(j1,j2,kr,kt)|(WY)j1,j2[kr,kt]issignificant}.\begin{equation} \label{supportmr} \mathcal{M} = \{(j_1,j_2,k_r,k_t) \big | (\mathbf{W}Y)_{j_1,j_2}[k_r,k_t] \text{ is significant}\}. \end{equation}(19)We define M to be the orthogonal projector onto ℳ, i.e. ∀d(Md)j1,j2[kr,kt]={if(j1,j2,kr,kt),otherwise.\begin{equation} (\mathbf{M}d)_{j_1,j_2}[k_r,k_t] = \begin{cases} (\mathbf{W}Y)_{j_1,j_2}[k_r,k_t] & \text{if } (j_1,j_2,k_r,k_t) \in \mathcal{M}, \\ d_{j_1,j_2}[k_r,k_t] & \text{otherwise.} \end{cases} \end{equation}(20)Our goal is to seek a solution 􏽥X\hbox{$\widetilde{X}$} that preserves the significant structures of the original data by reproducing exactly the same wavelet coefficients as those of the input data Y, but only on scales and at positions where significant coefficients have been detected. Furthermore, as Poisson intensity functions are positive by nature, a positivity constraint is imposed on the solution. It is clear that there are many solutions satisfying the positivity and multiresolution support consistency requirements, e.g. Y itself. Thus, our reconstruction problem based solely on these constraints is an ill-posed inverse problem that must be regularized. Typically, the solution in which we are interested must be sparse by involving the lowest budget of wavelet coefficients. Therefore, our reconstruction is formulated as a constrained sparsity-promoting minimization problem over the transform coefficients dmindd1subjectto{\begin{eqnarray} \label{minpoisson} &&\min_{d} \| d \|_1 \text{ subject to } \nonumber\\[0.5mm] &&\qquad\begin{cases} d_{j_1,j_2}[k_r,k_t] = (\mathbf{W}Y)_{j_1,j_2}[k_r,k_t], \forall (j_1,j_2,k_r,k_t) \in \mathcal{M}, \\ X \geqslant 0 . \end{cases} \end{eqnarray}(21)and the intensity estimate 􏽥X\hbox{$\widetilde{X}$} is reconstructed as 􏽥X=R˜d\hbox{$\widetilde{X} = \mathbf{R} \tilde{d}$}, where ˜d\hbox{$\tilde{d}$} is a global minimizer of Eq. (21). We recall that  ∥·∥ 1 =  ∑ j1,j2,kr,kt | dj1,j2 [kr,kt]  |  is the 1-norm playing the role of regularization and is well-known to promote sparsity (Donoho 2004). This problem can be solved efficiently using the hybrid steepest descent algorithm (Yamada 2001; Zhang et al. 2008), and requires about ten iterations in practice. Transposed into our context, its main steps can be summarized as follows:

where P+ is the orthogonal projector onto the positive orthant, and STβn is the entry-wise soft-thresholding operator with threshold βn, i.e. for x ∈ R, STβn(x) = max(0,1 − βn/|x|)x.

The final multichannel MS-VSTS Poisson noise removal algorithm is summarized in the following steps:

thumbnail Fig. 1

Result of the multichannel Poisson denoising algorithm on simulated Fermi data over the energy band 220–360 MeV. Top: simulated intensity skymap. Middle: simulated noisy skymap. Bottom: denoised skymap. Maps are on a logarithmic scale.

thumbnail Fig. 2

Result of the multichannel Poisson denoising algorithm on simulated Fermi data over the energy band 589–965 MeV. Top: simulated intensity skymap. Middle: simulated noisy skymap. Bottom: denoised skymap. Maps are on a logarithmic scale.

3.4. Experiments

The multichannel MS-VSTS algorithm has been applied to a simulated Fermi data set, with 14 energy bands between 50 MeV and 1.58 GeV. Figures 1 and 2 depict the denoising results for two energy bands. The algorithm is able to recover most of the sources, even the faint ones, on each energy band. Even more importantly, the 2D-1D MS-VSTS denoising algorithm allows us to recover the spectral information for each spatial position, as can be seen from Fig. 3.

4. Deconvolution of spherical data with Poisson noise

We now introduce a wavelet deconvolution approach for monochannel and multichannel data on the sphere with Poisson noise. The main idea underlying the method is to apply the MS-VSTS method described above. We first introduce the deconvolution problem and then describe how the MS-VSTS can be used to solve the deconvolution problem.

4.1. Problem statement

Many problems in signal and image processing can be cast as an inversion of a linear system Y=HX+ε,\begin{equation} \label{linsyst} Y = \mathbf{H} X + \varepsilon , \end{equation}(22)where \hbox{$X \in \mathcal{X}$} is the data to recover, \hbox{$Y \in \mathcal{Y}$} is the degraded noisy observation, ε is an additive noise, and \hbox{$\mathbf{H}{:} \mathcal{X} \to \mathcal{Y}$} is a bounded linear operator that is typically ill-behaved because it models an acquisition process that encounters loss of information. When H is the identity, it is just a denoising problem that can be treated with the previously described methods. Inverting Eq. (22)is usually an ill-posed problem, which means that there is no unique and stable solution.

Our objective is to remove the effect of the instrument’s PSF. In our case, H is the convolution by a blurring kernel (i.e. PSF) operator that causes Y to lack the high frequency content of X. Furthermore, since the noise is Poisson, ε has a variance profile HX. The problem at hand is then a deconvolution problem in the presence of Poisson noise.

We therefore need to both regularize the problem and handle the Poisson statistics of the noise. To regularize this inversion problem and reduce the space of candidate solutions, one has to add some prior knowledge of the typical structure of the original data X. This prior information accounts for the smoothness of the solution and can range from the uniform smoothness assumption to more complex knowledge of the geometrical structures of X.

In our LAT realistic simulations, the PSF width depends strongly on the energy, from at 50 MeV to better than at 10 GeV and above. Figure 4 shows the normalized profiles of the PSF for different energy bands.

4.2. Monochannel deconvolution

We first consider the single-channel case. In the literature, several algorithms have been proposed to perform image deconvolution on a cartesian grid. The Richardson-Lucy algorithm is certainly the most famous in astrophysics. In this paper, we propose a regularized Richardson-Lucy algorithm to deconvolve data on the sphere data.

thumbnail Fig. 3

Spectrum of a single gamma-ray point source recovered using the multichannel MS-VSTS denoising algorithm. Top: single gamma-ray source from simulated Fermi data integrated along the energy axis (left: simulated source; middle: noisy source; right: denoised source). Bottom: spectrum of the center of the point source: intensity as a function of the energy band with 14 energy bands between 50 MeV and 50 GeV (black: true simulated spectrum; cyan: restored spectrum from our denoising algorithm.

The Richardson-Lucy algorithm originates from a fixed-point equation obtained by maximizing the Poisson likelihood with respect to X while preserving positivity. This entails a multiplicative update rule, starting at n = 0 and X(0) = 1 and iterating X(n+1)=X(n)(HT(YHX(n))),\begin{equation} \label{JVCmethod} X^{(n+1)} = X^{(n)} \otimes \parenth{ \mathbf{H}^{T} (Y {\mathbf{H} X^{(n)}})}, \end{equation}(23)where  ⊗  (respectively  \hbox{$\odiv$}) represents the element-wise multiplication (respectively division) between two vectors, and HT is the transpose of H whose action on an image consists in convolving it with the time-reversed version of the PSF associated with H. However, it is well-known that owing to the lack of regularization, the Richardson-Lucy algorithm tends to amplify the noise after a few iterations.

We define R(n) as the residual at iteration nR(n)=YHX(n),\begin{equation} \label{residual} R^{(n)} = Y- \mathbf{H} X^{(n)}, \end{equation}(24)where R(n) can be written as the sum of its IUWTS detail subband \hbox{$\{d_j\}_{1 \leqslant j \leqslant J}$} and the last approximation subband aJ, that is, R(n)[kr]=aJ[kr]+j=1Jdj[kr],kr.\begin{equation} \label{waveletres} R^{(n)}[k_r] = a_J [k_r] + \sum_{j=1}^{J} d_j [k_r], \quad \forall k_r. \end{equation}(25)The wavelet transform provides a means of extracting only the significant structures from the residual at each iteration. With increasing number of iterations, a large part of the residual becomes statistically insignificant. The regularized significant residual is then, for a location kr(n)[kr]=aJ[kr]+(j,kr)dj[kr],\begin{equation} \label{signifgauss} \bar{R}^{(n)}[k_r] = a_J [k_r] + \sum_{(j,k_r) \in \mathcal{M}} d_j [k_r] , \end{equation}(26)where ℳ is the multiresolution support defined in a similar way to Eq. (19). The regularized Richardson-Lucy scheme then becomes X(n+1)=P+(X(n)(HT((HX(n)+(n))HX(n)))).\begin{equation} \label{regularizedJVC} X^{(n+1)} = \mathbf{P}_+\parenth{X^{(n)} \otimes \parenth{ \mathbf{H}^{T} \parenth{({\mathbf{H} X^{(n)}}+\bar{R}^{(n)}) {\mathbf{H} X^{(n)}}}}} . \end{equation}(27)This algorithm is similar to Murtagh et al. (1995), except that the à trous wavelet transform is replaced by the undecimated wavelet transform. In the next subsection, we describe how the same algorithm can be extended to the multi-channel case.

thumbnail Fig. 4

Normalized profile of the PSF for different energy bands as a function of the angle in degree. Black: 50−82 MeV. Cyan: 220−360 MeV. Orange: 960−1.6 GeV. Blue: 4.2−6.9 GeV. Green: 19−31 GeV.

thumbnail Fig. 5

Result of the multichannel deconvolution algorithm for different energy bands. Top: simulated (blurred) intensity skymap. Middle: blurred and noisy skymap. Bottom: deconvolved skymap. Energy band: 82–134 MeV. Maps are on a logarithmic scale.

thumbnail Fig. 6

Result of the multichannel deconvolution algorithm for different energy bands. Top: simulated (blurred) intensity skymap. Middle: blurred and noisy skymap. Bottom: deconvolved skymap. Energy band: 220–360 MeV. Maps are on a logarithmic scale.

thumbnail Fig. 7

Result of the multichannel deconvolution algorithm for different energy bands. Top: simulated (blurred) intensity skymap. Middle: blurred and noisy skymap. Bottom: deconvolved skymap. Energy band: 360–589 MeV. Maps are on a logarithmic scale.

thumbnail Fig. 8

Result of the multichannel deconvolution algorithm for different energy bands. Top: simulated (blurred) intensity skymap. Middle: blurred and noisy skymap. Bottom: deconvolved skymap. Energy band: 589–965 MeV. Maps are on a logarithmic scale.

4.3. Multichannel deconvolution

As the PSF is channel-dependent, the convolution observation model is now Y[·,kt]=HktX[·,kt]+ε[·,kt]\begin{equation*} Y[\cdot,k_t] = \mathbf{H}_{k_t} X[\cdot,k_t] + \varepsilon[\cdot,k_t] \end{equation*}in each channel kt, where Hkt is the (spatial) convolution operator in channel kt with known PSF.

The same recipe as in the monochannel case applies with the notable difference that the spherical 2D-1D MS-VSTS is used instead of its monochannel counterpart. The multichannel multiresolution support ℳ is obtained after thresholding these coefficients.

We now define H to be the multichannel convolution1 operator, which acts on a 2D-1D multichannel spherical data set X by applying Ht to each channel X [·,kt]  independently2. The regularized multichannel Richardson-Lucy scheme is then X(n+1)=P+(X(n)(HT((HX(n)+(n))HX(n)))),\begin{equation} \label{regularizedmcdeconv} X^{(n+1)} = \mathbf{P}_+\parenth{X^{(n)} \otimes \parenth{ \mathbf{H}^{T} \parenth{({\mathbf{H} X^{(n)}}+\bar{R}^{(n)}) {\mathbf{H} X^{(n)}}}}} , \end{equation}(28)where \hbox{$\bar{R}^{(n)}$} is the regularized (significant) residual (n)[kr,kt]=aJ1,J2[kr,kt]+(j1,j2,kr,kt)wj1,j2[kr,kt].\begin{equation} \begin{split} \label{signifresmc} \bar{R}^{(n)}[k_r,k_t] = a_{J_1,J_2} [k_r,k_t] + \sum_{(j_1,j_2,k_r,k_t) \in \mathcal{M}} w_{j_1,j_2}[k_r,k_t] . \end{split} \end{equation}(29)

4.4. Experiments

The algorithm was applied to the seven energy bands (50 MeV–1.58 GeV) of our simulated Fermi data set. Figures 5 to 8 display the deconvolution results for four energy bands. Figure 9 shows the performance of the multichannel MS-VSTS deconvolution algorithm for a single point source. The deconvolution not only effectively removes the blur and recovers sharply localized point sources, but also allows us to restore all of the spectral information. To get a better visual impression of the performance of the deconvolution algorithm, Fig. 10 depicts the result of the algorithm on a single HEALPix face covering the Galactic plane. We find that the deconvolution is remarkably effective: our MS-VSTS multichannel deconvolution algorithm manages to remove a large part of the blur introduced by the PSF.

5. Conclusion

This paper extends the MS-VSTS framework to deal with monochannel deconvolution, multichannel denoising and multichannel deconvolution. Unlike the monochannel MS-VSTS, the multichannel MS-VSTS fully exploits the information in the 2D-1D data set and allows us to recover the spectral information on the sources. As the PSF strongly depends on the energy, it is very important to have a multichannel method for deconvolution. Multichannel deconvolution using MS-VSTS removes a large part of the PSF blur and significantly improves the sharpness of the spatial localization of point sources.

The software related to this paper, MRS/Poisson, and its full documentation will be included in the next version of ISAP (Interactive Sparse astronomical data Analysis Packages) via the ISAP web site3.

thumbnail Fig. 9

Profile of a single gamma-ray point source recovered using the multichannel MS-VSTS deconvolution algorithm. Top: single gamma-ray point source on simulated (blurred) Fermi data (energy band: 360–589 MeV) (left: simulated blurred source; middle: blurred noisy source; right: deconvolved source). Bottom: profile of the point source (cyan: simulated spectrum; black: restored spectrum from the deconvolved source.

thumbnail Fig. 10

View on a single HEALPix face. Result of an application of the deconvolution algorithm to the Galactic plane. Top left: simulated Fermi Poisson intensity. Top right: simulated Fermi noisy data. Bottom: Fermi data deconvolved with multichannel MS-VSTS. Energy band: 360–589 MeV. Pictures are on a logarithmic scale.


1

Strictly speaking, this is a slight abuse of terminology since the kernel is not channel-invariant.

2

If X were to be vectorized by stacking the channels in a long column vector, H would be a block-diagonal matrix whose blocks are the circulant matrices Hkt.

Acknowledgments

Some of the results in this paper have been derived using the Healpix (Górski et al. 2005) and the MR/S software (Starck et al. 2006). This work was supported by the French National Agency for Research (ANR-08-EMER-009-01) and the European Research Council grant SparseAstro (ERC-228261).

References

  1. Abdo, A. A., Ackermann, M., Ajello, M., et al. 2010, ApJS, 188, 405 [NASA ADS] [CrossRef] [Google Scholar]
  2. Atwood, W. B., Abdo, A. A., Ackermann, M., et al. 2009, ApJ, 697, 1071 [NASA ADS] [CrossRef] [Google Scholar]
  3. Conrad, J., Scargle, J., & Ylinen, T. 2007, in The First GLAST Symposium, eds. S. Ritz, P. Michelson, & C. A. Meegan, AIP Conf. Ser., 921, 586 [Google Scholar]
  4. Donoho, D. 2004, For Most Marge Underdetermined Systems of Linear Equations, the minimal l1-norm solution is also the sparsest solution, Tech. Rep., Department of Statistics of Stanford University [Google Scholar]
  5. Faÿ, G., Delabrouille, J., Kerkyacharyan, G., & Picard, D. 2011, [arXiv:1107.5658] [Google Scholar]
  6. Górski, K. M., Hivon, E., Banday, A. J., et al. 2005, ApJ, 622, 759 [NASA ADS] [CrossRef] [Google Scholar]
  7. Movit, S. 2009, in BAAS, 41, AAS Meeting Abstracts, 332.06 [Google Scholar]
  8. Murtagh, F., Starck, J.-L., & Bijaoui, A. 1995, A&AS, 112, 179 [NASA ADS] [Google Scholar]
  9. Norris, J. P., Gehrels, N., & Scargle, J. D. 2010, ApJ, 717, 411 [NASA ADS] [CrossRef] [Google Scholar]
  10. Schmitt, J., Starck, J. L., Casandjian, J. M., Fadili, J., & Grenier, I. 2010, A&A, 517, A26 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  11. Schmitt, J., Starck, J.-L., Fadili, J., & Digel, S. 2011, in Advances in Machine Learning and Data Mining for Astronomy, eds. M. Way, J. Scargle, K. Ali, & A. Srivastava (Chapman and Hall) [Google Scholar]
  12. Starck, J.-L., Moudden, Y., Abrial, P., & Nguyen, M. 2006, A&A, 446, 1191 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  13. Starck, J., Fadili, J. M., Digel, S., Zhang, B., & Chiang, J. 2009, A&A, 504, 641 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Starck, J.-L., Murtagh, F., & Fadili, M. 2010, Sparse Image and Signal Processing (Cambridge University Press) [Google Scholar]
  15. Yamada, I. 2001, in Inherently Parallel Algorithm in Feasibility and Optimization and their Applications (Elsevier), 473 [Google Scholar]
  16. Zhang, B., Fadili, J., & Starck, J.-L. 2008, IEEE Trans. Image Proc., 11, 1093 [Google Scholar]

All Figures

thumbnail Fig. 1

Result of the multichannel Poisson denoising algorithm on simulated Fermi data over the energy band 220–360 MeV. Top: simulated intensity skymap. Middle: simulated noisy skymap. Bottom: denoised skymap. Maps are on a logarithmic scale.

In the text
thumbnail Fig. 2

Result of the multichannel Poisson denoising algorithm on simulated Fermi data over the energy band 589–965 MeV. Top: simulated intensity skymap. Middle: simulated noisy skymap. Bottom: denoised skymap. Maps are on a logarithmic scale.

In the text
thumbnail Fig. 3

Spectrum of a single gamma-ray point source recovered using the multichannel MS-VSTS denoising algorithm. Top: single gamma-ray source from simulated Fermi data integrated along the energy axis (left: simulated source; middle: noisy source; right: denoised source). Bottom: spectrum of the center of the point source: intensity as a function of the energy band with 14 energy bands between 50 MeV and 50 GeV (black: true simulated spectrum; cyan: restored spectrum from our denoising algorithm.

In the text
thumbnail Fig. 4

Normalized profile of the PSF for different energy bands as a function of the angle in degree. Black: 50−82 MeV. Cyan: 220−360 MeV. Orange: 960−1.6 GeV. Blue: 4.2−6.9 GeV. Green: 19−31 GeV.

In the text
thumbnail Fig. 5

Result of the multichannel deconvolution algorithm for different energy bands. Top: simulated (blurred) intensity skymap. Middle: blurred and noisy skymap. Bottom: deconvolved skymap. Energy band: 82–134 MeV. Maps are on a logarithmic scale.

In the text
thumbnail Fig. 6

Result of the multichannel deconvolution algorithm for different energy bands. Top: simulated (blurred) intensity skymap. Middle: blurred and noisy skymap. Bottom: deconvolved skymap. Energy band: 220–360 MeV. Maps are on a logarithmic scale.

In the text
thumbnail Fig. 7

Result of the multichannel deconvolution algorithm for different energy bands. Top: simulated (blurred) intensity skymap. Middle: blurred and noisy skymap. Bottom: deconvolved skymap. Energy band: 360–589 MeV. Maps are on a logarithmic scale.

In the text
thumbnail Fig. 8

Result of the multichannel deconvolution algorithm for different energy bands. Top: simulated (blurred) intensity skymap. Middle: blurred and noisy skymap. Bottom: deconvolved skymap. Energy band: 589–965 MeV. Maps are on a logarithmic scale.

In the text
thumbnail Fig. 9

Profile of a single gamma-ray point source recovered using the multichannel MS-VSTS deconvolution algorithm. Top: single gamma-ray point source on simulated (blurred) Fermi data (energy band: 360–589 MeV) (left: simulated blurred source; middle: blurred noisy source; right: deconvolved source). Bottom: profile of the point source (cyan: simulated spectrum; black: restored spectrum from the deconvolved source.

In the text
thumbnail Fig. 10

View on a single HEALPix face. Result of an application of the deconvolution algorithm to the Galactic plane. Top left: simulated Fermi Poisson intensity. Top right: simulated Fermi noisy data. Bottom: Fermi data deconvolved with multichannel MS-VSTS. Energy band: 360–589 MeV. Pictures are on a logarithmic scale.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.