A&A 391, 789-794 (2002)
DOI: 10.1051/0004-6361:20020821
R. Vio1,3 - L. Tenorio2 - W. Wamsteker3
1 - Chip Computers Consulting s.r.l., Viale Don L. Sturzo 82, S.Liberale di Marcon, 30020 Venice, Italy
2 - Department of Mathematical and Computer Sciences, Colorado School of Mines, Golden CO 80401, USA
3 - ESA-VILSPA, Apartado 50727, 28080 Madrid, Spain
Received 15 January 2002 / Accepted 31 May 2002
Abstract
Point-source contamination in high-precision Cosmic Microwave Background (CMB) maps severely affects
the precision of cosmological parameter estimates. Among the methods that have been proposed for
source detection, the family of pseudo-filters optimizes a measure of signal-to-noise and amplitude-scale
relation. In this paper we show that these filters are in fact only restrictive cases of a more general
class of matched
filters that optimize signal-to-noise ratio and that have, in general, better source detection
capabilities, especially for lower amplitude sources.
These conclusions are confirmed by some numerical experiments.
Key words: methods: data analysis - methods: statistical
The separation of different physical components is an important issue in the analysis of Cosmic Microwave Background (CMB) data. Among the foreground components, point-sources deserve especial attention given their extreme non-Gaussianity and highly variable spectral index. A brief summary of different methods can be found in Sanz et al. (2001) (henceforth SHM).
Methods used to detect point sources should act as high-pass filters to detect the high frequency structure introduced by the sources in the CMB data while masking other foreground contamination (like dust, synchrotron and free-free emission) characterized by lower frequencies. In principle, the good space/frequency characteristics of wavelets should make these functions an attractive choice for such tasks (e.g., Cayón et al. 2000; Vielva et al. 2001a,b, and references therein). Indeed, wavelets have been proved optimal for detection of point like singularities - at least for one-dimensional signals. Point sources in CMB maps, however, are not point singularities because the signal is smoothed by the beam of the antenna; sources are expected to have the shape of the antenna's pattern. The question is then how to include this beam profile information in the analysis. SHM considered this question and were lead to define optimal scale-dependent filters, that they named pseudo-filters, for source detection in CMB maps. However, we show that nothing seems to be gained with these filters and that other simpler techniques lead to better source detection methods.
We first set up the basic framework. Although the detection of point-like sources in CMB maps is a two-dimensional problem, we present our arguments in Rn, as in SHM, because the same methods may be used in other applications.
The sources are assumed to be point-like signals convolved with the beam of the measuring instrument and are
thus assumed to have a known profile
.
The signal
,
,
is modeled as
![]() |
(2) |
![]() |
(4) |
A classical method used to estimate source locations is based on identifying peaks in the cross-correlation
function
The cross-correlation function (5) does not take
into account the background characteristics.
This is a great disadvantage in cases where the power spectrum
is known or a good estimate is available.
In Sects. 2 and 3 we consider other methods that take into account this information
and that may be considered extensions of the cross-correlation method.
The basic procedure we consider is as follows. The signal is first filtered to enhance the sources
with respect to the background. This is done by cross-correlating the signal
with a filter
as in
(5) (with
in place of
).
The source locations are then determined by selecting the peaks in the filtered signal
that are above a selected threshold. Finally, the source amplitudes are estimated with the
values of the filtered signal at the estimated locations. The question we consider first is the selection of an
optimal filter
for such procedure.
We consider the general family of spherically symmetric filters
of the form
![]() |
= | ![]() |
|
= | ![]() |
(8) |
The first constraint on the filter concerns the second stage of the procedure; the source locations are assumed known
and the objective is to estimate the amplitudes. Given the assumed distance between sources, it is enough to consider
a field
as in (1) with a single source at the origin,
.
To estimate its amplitude we ask that
be an unbiased estimator of A - i.e.,
- so that
is required to satisfy the equation
Since
is chosen so that
is a minimum variance linear - in
- unbiased
estimator of A, it follows that (Gauss-Markov theorem)
is the (generalized) least squares estimate
of A achieved by the filter
For white noise,
,
filter (12) simplifies to
Having recognized
as a least squares estimator of A, we close this section with some remarks from
least squares methodology that we consider relevant. First note that, regardless of the spectrum P(q), the source
profile
properly normalized, that is
for
,
also provides an
unbiased estimator
of A. This is the (ordinary) least squares estimate that does not take into
account the covariance of the background; it is unbiased but not minimum variance. However, it is well known
that when the covariance is actually estimated from the data, the ordinary least squares estimate may
be better than the generalized one (e.g., Draper & Smith 1998). In other words,
uncertainities in the spectrum estimates may lead to worse amplitude estimates than those obtained with
the simpler cross-correlation filter. Uncertainties in the spectrum will also affect the selection of a
detection threshold.
Note also that the unbiasesness of
as an estimator of A depends on knowing the correct source
location, it is not necessarily unbiased once the source locations are estimated from the data. This is shown
in Sect. 4.2.
To determine an optimal filter ,
SHM minimize the variance of the filtered field subject
to two constraints: first,
is required to be, as in the previous section,
an unbiased estimator of A for some known
.
For the second constraint
is selected so
that
has a local maximum at scale R0. This constraint translates to
b ![]() ![]() |
|
c ![]() ![]() |
(18) |
We have seen that the second constraint (16) does not increase the detection level when the source
locations
are known. But this is not surprising since the constraint is defined to take advantage of the known source
scale to help determine source locations. We will show that even when source
location uncertainty is taken into account, (16) does not improve on
the simpler filter
based on the single contraint (10). In other words, enough information
about the scale of the source is already included in the derivation of the matched filter.
Therefore, it seems that nothing is gained with pseudo-filters.
In principle, as explained by SHM, constraint (16) can be used to test if a detection
corresponds to a true source by checking for its maximum at scale R0. But finding a spike at the correct
scale is not enough to make sure it is not a spurious noise artifact, we have to follow it across scales to
make sure that it scales appropriately. However, there is no evidence, either theoretical or based on
numerical simulations, that such a procedure with pseudo-filters is more effective than other simpler
approaches such as, for example,
classical tests (e.g.,
test) on the residuals after the subtraction of detected sources from the
signal. In any case, a similar scale tracking can also be designed for the matched filter:
given that the source
profile is assumed known, the matched filter leads to an amplitude versus scale dependence (see example in Sect.
4.2) that can be
determined and used for such a test. This functional dependence provides more
information than the existence of a local maxima and should improve source location estimates.
![]() |
= | ![]() |
|
= | ![]() |
(20) |
Gaussian sources
![]() |
(21) |
For a Gaussian profile and a power-law spectrum, Eq. (12) leads to
![]() |
(24) |
![]() |
(25) |
Figure 1 shows the filters
and
and their corresponding
Fourier transforms. It shows that the two filters are quite different. For example,
to provide filtered sources with the scale R0,
has to pass higher
frequencies than
.
This can be a problem for signals contaminated by high frequency noise.
![]() |
Figure 1:
Filters
![]() ![]() |
Open with DEXTER |
S/N=1 | S/N=2 | S/N=3 | |||||||||
![]() |
![]() |
![]() |
A | ![]() |
![]() |
A | ![]() |
![]() |
A | ||
![]() |
23 | 10 | 1.83 | 87 | 10 | 1.10 | 100 | 10 | 0.97 | ||
(15) | (5) | (1.94) | (80) | (5) | (1.14) | (100) | (5) | (0.98) | |||
![]() |
31 | 5 | 1.55 | 97 | 5 | 1.02 | 100 | 5 | 0.98 |
Table 1 shows the average number of correct and incorrect detections obtained with
and
and a
fixed
threshold for signal-to-noise (S/N) ratios equal to 1, 2 and 3. The two filters give equivalent
results for higher S/N sources. We see that
leads to a higher number of
correct detections and a lower number of
incorrect ones. But a proper comparison should take into account that the filters require different
thresholds. The second row for
shows the corresponding results when the threshold is chosen
to lead to the same average number of rejections as
.
For low S/N sources
leads again to a
higher number
of correct detections while for larger S/N they give similar results.
To compare amplitude estimates that include location uncertainty, we take the average of the amplitudes (since all
the generated sources have the same amplitude) of all detections. The results are shown in Table 1.
The errors in the amplitudes are of the order of 0.5% or less. We see that amplitude estimates are biased when the
source locations are estimated, and that the bias is larger for .
For low S/N the amplitude is
overestimated
because high peaks are easier to detect and noise peaks are incorrectly classified as sources. For high S/N sources
we also have centering problems but this time the smaller amplitude in the noise peaks leads to underestimated
source amplitudes. We can draw similar conclusions from the results of two-dimensional simulations
shown in Tables 2 and 3.
To conclude, note that the spatial support of the filters is an important factor when the assumption of
nonoverlapping filters is
invalid.
The support must be small compared to the distance between the sources.
Figures 2-3 show (for n=1,2) the cumulative energies
![]() |
(26) |
![]() |
(27) |
S/N=1 | S/N=2 | S/N=3 | ||||||
![]() |
![]() |
A | ![]() |
A | ![]() |
A | ||
![]() |
0.89 | 1.14 | 1.00 | 1.01 | 1.00 | 1.00 | ||
![]() |
0.99 | 1.03 | 1.00 | 1.00 | 1.00 | 1.00 |
S/N=1 | S/N=2 | S/N=3 | ||||||
![]() |
![]() |
A | ![]() |
A | ![]() |
A | ||
![]() |
0.38 | 1.79 | 0.93 | 1.11 | 1.00 | 1.03 | ||
![]() |
0.40 | 1.78 | 0.93 | 1.11 | 1.00 | 1.03 | ||
![]() |
0.47 | 1.60 | 0.97 | 1.07 | 1.00 | 1.02 |
![]() |
Figure 2:
Cumulative energies ![]() ![]() ![]() ![]() ![]() |
Open with DEXTER |
![]() |
Figure 3: Same as Fig. 2 but with n=2. |
Open with DEXTER |
The first constraint is typical in matched filter methodology where S/Nis maximized. The optimal filter provides unbiased least squares estimates of source amplitudes at known source locations. By the optimality of least squares, these amplitude estimates can not be improved with any other unbiased linear filter. However, uncertainities in source locations introduce a bias in amplitude estimates. Amplitudes are overestimated at low S/N and underestimated at high S/N. A second constraint introduced by SHM is designed to improve estimates of source locations by maximizing amplitudes at the correct scale. We found that this constraint does not lead to better estimates as compared to those obtained with a simpler matched filter, especially for lower S/N sources. For high S/N sources the results of the two methods are the same. These results contradict previous optimality studies of wavelet based filters and pseudo-filters for detection of point-sources of known profile in an isotropic CMB background of known spectrum.