A&A 370, 649-671 (2001)
DOI: 10.1051/0004-6361:20010092
F. G. Boese - S. Doebereiner
Max-Planck-Institut für extraterrestrische Physik, 85741 Garching, Germany
Received 31 October 2000 / Accepted 9 January 2001
Abstract
The point source parameter estimation via Maximum Likelihood Estimation in
ROSAT data is documented. The infall of X-ray photons is treated as a
Poisson process. Based on the Poisson model, the log-likelihood function for
unbinned data is derived. The adopted source and background models are
explained. In the chosen parameterization, the Maximum Likelihood equations
and their numerical solution are expounded. The performance of the method is
illustrated by an example and tests on simulated observations.
Key words: X-ray: general, space vehicles: instrumentation: detectors, techniques: image processing
In a class of contemporary X-ray observatories, the observing instruments consist
of one or more sets of nested telescopes and one or more detectors in the focal
plane. The detectors are often designed to register single X-ray photons. Several
geometrical and physical quantities of individual X-ray photons are recorded, see
Trümper (1983). In
the case of the ROSAT observatory, these are (a) the time of photon arrival, (b)
the impact location in the detector, (c) the amplitude of the detector signal,
and (d) the infalling direction
.
Depending on the ROSAT observation mode, the instrument's optical axis is,
relative to the observed target, nominally fixed or moving in time. During an
observation time interval, a number of photons is collected. After data preparation,
the observed sample forms the data set to be analyzed.
The majority of all observed celestial sources are point sources. The capacity of analyzing point sources with respect to (a) X-ray flux, (b) celestial direction and (c) extent (if larger than but akin to point sources) thus makes it possible to treat a large percentage of all observed sources routinely.
In this work, the technical details of the statistical inference from data to point-like and circular disk shaped X-ray sources are investigated. To X-ray flux, celestial direction, and source extent correspond point source parameters. Estimation of these parameters using Maximum Likelihood Estimation (MLE) will be described in general and also some of the peculiarities of the implementation within the ROSAT data analysis system EXSAS. The latter software system assists in analyzing ROSAT observational data, see Zimmermann et al. (1998). It is distributed to more than 300 observers in 75 institutes worldwide.
Up to December 2000, about
scientific papers based on ROSAT data
appeared in journals. Among the many ROSAT catalogues
are the ROSAT All-Sky Survey Bright Source Catalogue with
sources
and the Source Catalogue from Pointed PSPC Observations with
sources,
see Voges et al. (1999). A considerable percentage of all
related data analysis was carried out with the SASS or EXSAS analysis
systems - an imperative reason to document the underlying method.
Apart from a preliminary conference report by Cruddace et al.
(1987), no technical description exists, internally or publically.
In Sect. 2, the observed ROSAT sample is presented. In Sect. 3, the source model is introduced and the logarithmic likelihood functional for unbinned data is derived from basic principles in considerable generality. Background model and point source model are set up in Sect. 4. The maximum likelihood equations are compiled in Sect. 5. Statistical errors for the estimated source parameters will be discussed in Sect. 6. The iterative solution of the maximum likelihood equations is treated in Sect. 7. A point source estimation example is contributed and performance tests on simulated observations are presented in Sect. 8. The finishing Sect. 9 contains the discussion followed by the conclusions in Sect. 10. An Appendix, Sect. 11, explains EXSAS commands related to the MLE of point sources.
The ROSAT satellite was launched on July 1, 1990 into a circular orbit with an
inclination of
and an altitude of 580 km. The last observation
was made on December 18, 1998. The ROSAT detectors were (a) the Positional Sensitive
Proportional Counter (PSPC), (b) the High Resolution Imager (HRI), and (c) the
Wide Field Camera (WFC). Two telescopes were in operation: the larger one for the PSPC
and HRI, the smaller for the WFC.
During an observation interval
of length
starting
at time
and ending at time
,
the instrument consisting
of a telescope and a detector registers a number
of photons
. The mth photon,
,
comes from the
celestial direction (Xm,Ym), arrives at time t=tm, and hits the
instrument's focal plane at the detector location (xm,ym). The photon
causes a detector signal amplitude Am.
Upon completion of the observation and data pre-processing, we are given a
6-variate time-ordered photon series
Source spectra can be formed when the detector discriminates between photon energies.
Often, detector amplitudes are the primary measured quantities and not the photon
energies. Then, spectra are estimated by statistical inference in the detector's
amplitude space
to be presented here.
Let the instrument be designed for the photon energy space
of width
.
A single infalling photon of energy
causes an electrical impulse of amplitude A containing a superimposed (small) random amplitude
in the instrument's amplitude space
of width
.
The response distribution to a Dirac delta input single line photon spectrum
,
of energy E has the probability density
| |
Figure 1:
The
|
| Open with DEXTER | |
According to (1), a photon
in ROSAT data
has six coordinates t to A. Four of them, t,X,Y,A, are for MLE of prime significance
and span the photon space
.
This is the space in which all calculations
are done, where the sought distributions are accommodated. To understand this space
thoroughly, we follow the photon's optical path through the telescope to the detector
plane. Its relation to the (X,Y) celestial directions space will be explained.
The equatorial system with mean epoch 2000 is taken in ROSAT as the principal astronomical coordinate system.
Disregarding all distances from Earth's centre to celestial objects leads to
the unit sphere
in 3-space referred to equatorial coordinates
.
The choice of the distance unit is unimportant.
The null direction
is the vernal equinox,
the angle
is the right ascension and the angle
the declination. Seen by a ROSAT detector, there is a photon count rate
distribution
representing the celestial directional distribution
in X-rays at time t, in photon detector amplitude A, and coming from direction
.
ROSAT took samples from this celestial distribution
.
Count
rates will be formally introduced in (18).
ROSAT's pointing direction
moves in time
t due to (a) a possible survey motion, (b) a normally applied wobbling motion, and (c)
an unavoidable stochastical motion caused by the attitude control loop dynamics and, at that, excited
by external stochastical forces. The wobbling motion will be explained in Sect. 2.3.3.
The direct approach to calculate with data on the sphere
was not
taken. Instead, spherical data are mapped to a tangential plane referred to
Cartesian coordinates. In this plane, all directional calculations are actually effectuated.
At the end of the analysis, the calculated results are mapped back to the sphere, i.e.
expressed in equatorial coordinates
. We shall encounter the mapping details
by explaining the geometry and kinematics of the data acquisition.
Consider the pointing direction
as it evolves in
time t. For any fixed time, a tangential Cartesian coordinate system
with origin at
p' is erected. The Z axis goes through the origin of the unit
sphere and the point p'. The X and Y axes lie in the tangential plane
with tangency point p'. The plane
is termed sky space. It is thus
adapted to the individual pointed observation. To explain the axes' directions, consider
the two oriented, mutually orthogonal circles of constant right ascension
and constant declination
through p',
The two coordinates in the sky space
are termed sky longitude X
and sky latitude Y. Both coordinates are measured in angular units, in
units of
arcsec. This unit is appropriate for the spatial resolutions
of the three ROSAT detectors.
A fixed celestial direction
is mapped by A(t) at time t to the sky point
(X(t),Y(t)). All ROSAT detectors have fields of view in the order
.
Evaluating A(t) for small fields of view, i.e. for
yields
| |
Figure 2:
The images of the circles
|
| Open with DEXTER | |
The one-to-one image
of
under the attitude mapping A(t)
is distorted in shape by the aereal distortion factor
appearing in
From (8), it follows near the origin
(X,Y)=(0,0) in the sky space
,
i.e. for
,
Photon impact locations in the detector plane
are measured in the plane Cartesian
detector coordinate system
.
The origin,
,
is the piercing
point of the instrument's optical axis in the detector plane. The detector field
contains the field of view
and is parallel to the axes in
,
i.e.
.
Figure 3 shows the field of view (black) of the ROSAT PSPC detector.
The detector field
is a bit larger than the smallest square parallel to the axes
containing
.
![]() |
Figure 3:
The PSPC field of view |
| Open with DEXTER | |
Taking into account the different units in both spaces leads to
the invertible linear mapping between sky space
and detector space
Assume a time-constant nominal pointing direction p'0 and imagine
a source which falls on a rib in Fig. 3. Such a source is totally censored.
In order to lessen the detrimental censoring by the ribs, a periodic wobbling motion
w(t) in saw-tooth form and with a time-constant tangential vector W is
superimposed to the effect that the said photons are now distributed along a line
segment crossing the rib (when the rib and W are not parallel),
The sample
from (1) lies in the instrument's photon space
.
This is
the Cartesian product of the observation interval
,
the detector field
,
the sky space
,
and the detector's amplitude space
,
As mentioned in Sect. 1, some state-of-the art detectors register the X-ray photons one after the other in time, resolve them with respect to energy and infall direction.
A given instrument has its individual photon space. Its dimension may be lower,
equal, or larger than that of (15). The constitutive factor spaces may be
different from the factors
in (15). In this section,
is the photon
space of any astronomical photon counting instrument.
However, to be not overly abstract in this section, a paradigmatic choice is made: photons
possess an arrival time t on a certain (X,Y) target space and transport an
energy E. Let
be a sample of size M in this section.
It is appropriate to view the infall of photons as a stochastic
counting process in the instrument's photon space
.
As X-ray photons are observed one by one (by introductory assumption), a stochastical point
of view is adopted in the pertinent parts of this article.
A Poisson emitter in photon space
is the stochastic X-ray source model
adopted for ROSAT data and defined by (P1), (P2) in (16) below.
Let
be an arbitrary set in photon space and
be the random
number of photons falling into
.
According to the definition,
of a Poisson process, see Parzen (1962,
p. 118), two properties of the process
are implied:
The local mean count rate,
,
introduced in (17)
depends on all four photon parameters spanning
and is assumed to be bounded,
In possession of the sample
and having specified the parent source
model, we have to decide by which statistical estimation procedure we wish to
infer
from
.
Among several standard estimation procedures at disposal, estimation via MLE has proven to be appropriate in astronomical application, see Cash (1979), Strong (1985) and also in X-ray astronomy with ROSAT, see Cruddace et al. (1987).
In X-ray observations, one has often to cope with weak sources. Due to observation time restrictions, not more than a handful of photons are collected from weak sources. Any loss of data information due to binning is unacceptable. Technically spoken, many bins without counts are the rule. As one might expect from (P2) in (16) and as we shall see in (22), the statistics associated with MLE for photon counting is based on the Poisson distribution - a distribution for rare events. It is applicable for small occupation numbers. So, the choice of MLE is almost mandatory for ROSAT data under small source sample sizes and/or high precision requirements.
Within the MLE we have to establish the log-likelihood function, L, for
our photon counting experiment. While the likelihood
is in our context the
prime probability quantity, it is technically preferable to assign this role to
the exponent
.
The L, seen as a functional of the hypothetical count
rate, is proportional to the logarithm of the probability that the observation
emerged from the assumed count rate, given the observation data.
To start with the derivation of L, imagine a partition
of the instrument's
photon space
into finitely many bins
.
The independent increment
property (P1) in (16) allows us to partition the photon space
arbitrarily
into finitely many parts of positive 4-volume and to have identical independent
distribution forms in each bin
.
We are free to design
bins in form of Cartesian products so that the partition
has the bins
We wish to get rid of the loss of information incurred by the finite binning (21).
To this end, we consider an infinite sequence
of
binnings of the photon space
with the infinite resolution property
The same observation appears then in infinitely many binnings. We wish to investigate
the likelihood
in the passage
.
We split the probability
in (22) into three factors. The first one,
,
comprises all normalization factors and the remaining two are the contributions from the
void and occupied bins,
1. The normalization factor. Taking the logarithm and summing over all bins
leads with the total number,
,
of expected counts from (17)
immediately to
2. The contribution of the void bins. By definition,
3. The contribution due to the count rate. By hypothesis,
is continuous. Applying
the mean value theorem to the integral expressions
in (22) entails in terms
of the general point (t,X,Y,E) in the photon space
and the observed photon positions
4. The contribution due to the bin volumes. The product
depends on the occupation numbers, on the binning
,
but not on the count rate
.
Obviously, for any positive number M of total observed counts holds
The renormalization quantity
was purposely defined so that
we have with the number of expected counts
from (25)
The Maximum likelihood principle directs us to choose the
maximizing count rate,
,
as estimate for
.
Generally, the maximizer
need neither to be unique nor existent - an unhappy situation with unpleasant
numerical consequences, when really given.
The
from (31) is the general log-likelihood functional for a
Poisson counting process in the paradigmatic instrument photon space
for the hypothetical
count rate
from (18), (20), given the
photon data sample
from (1).
As explained, the integral in (31) is the total (not necessarily integer)
number,
,
of the expected photons falling into the entire photon space
and thus the model counter part of M, the number of observed photon counts,
In some cases, the model count rate depends in a special way on all parameters.
Let the full model parameter vector
be split into two groups
p1,p2 spanning
such that the count rate
factorizes,
The likelihood function
for
from (31) becomes in terms of
the likelihood functions of the two count rate factors,
with
the associated expected counts
from (32)
The parameterization of
is still open. It is desirable to have the number
of expected photons,
,
among the parameters as the counter part for the number
of observed photons M. This can be achieved upon the factorization
.
For notational convenience the prime on
is dropped again.
In this parameterization and notation, we arrive at the representation
Often, if not always, the count rate is conceived as an additive mixture
of several sources. In the simplest case of a binary mixture, one discerns
the source count rate of proper interest,
,
and the remaining count rate,
called then background count rate
.
We split the count rate
accordingly with (unknown) mixture proportions
The derivation of L for unbinned photon data is basically known, see Cash (1979) for a less elaborate but excellent treatment. Some details are worked out here.
In practice, the resolution limit is given by the detector binning
.
The
limiting likelihood
from (31),(35) for unbinned data
can only be reached up to a certain binning error
which we now wish
to assess.
We do this by observing first the photons with the fictitious detector of
unrestricted resolution. Then the ideal observation is compared with the real
observation in the finite binning
used in the data analysis. In the latter,
only the bin centre
of the bin into which the individual photon
falls is retained leading to uncertain photon positions. Hence, a finite photon
binning entails unavoidably the first error of the list
Let
be a binning of
not finer than
with bin numbers I,J,K,L'.
We approximate the bin-averaged count rates,
For a given binning
and a chosen tolerance
,
the third and fifth line
in (41) to establish observation feasibility margins on the variability
of the count rate
in time, space, and energy in terms of
first and second partial derivatives of the count rate
.
The replacement of bin-averaged count rates
by the right-hand
side of (40), as done in EXSAS, yields an enormous gain in
calculation speed. A further gain in speed is brought about by the adaption of the
photon space to the task. Joint light curve, spectral, and directional
source estimations were possible with (31) but were not included in EXSAS.
Spectral and light curve estimation is delegated to other EXSAS tasks. Therefore,
the photon space is in time and detector signal amplitudes A (replacing the energy
E from Sect. 3.1) trivially binned, L'=1 and K=1. In other words, photons
of all arrival times and of all detector signal amplitudes falling into selectable
intervals can be included,
The point source detection sensitivity depends on the imaging properties of the
instrument and the background against which a point source is to be detected.
For the ROSAT detectors, four background count sources are relevant: (a) instrumental
background, (b) charged particles induced background, (c) scattered solar
radiation, and (d) diffuse celestial X-ray background. The contributions from
these background sources for the three ROSAT detectors are compiled in the
instrument description, see Trümper (1991a) and the detector specific papers.
For instance, the ROSAT PSPC detector has no instrumental background. The particle
induced background contributes
over
of the observation time and is uniformly distributed over the detector
field
.
As in the general case (36), two separate types/classes of photons are distinguished, background photons and source photons. The first ones are those which are defined not to come from the observation target. The definition itself will be made in Sect. 4.1.3.
Apart from the parameterization in (36) we assume in (18) an additive
two-component-mixture of a local mean background count rate
and a
local mean source count rate component
in the count rate vector
,
According to the ROSAT photon space (15), we parameterize the three
in (43) by amplitude A instead of energy E. The transition from count
rates to counts follows. With the trivial time and amplitude binning from (42),
and the fine binning (21) of the sky space we have with
the
hypothetical bin counts
To find an initial estimate on source number and location is the first challenging task within MLE.
Point sources appear in spatial count space as local count enhancements in a form
given by the PSF in effect. In a brute-force approach, source regions of a form
and size compatible with the PSF are hypothesized and all their translations and
dilatations in a specified search region
are tested at a selectable significance
level to contain count variations. Testing all the hypothetical source regions
guarantees that no one can be overlooked. The form of the region, however, is
kept fixed.
First, specify the region of interest
.
Small hypothetical
source squares
parallel to the axes, called windows in some
publications, of linear half-width
around a central pixel
position (i,j)
together with a background pixel neighborhood
around the source
square are chosen. Denote by
the total exposure
times for the respective suffix regions. The number of counts
,
,
falling into
and the number of counts,
,
falling into
are
determined,
![]() |
Figure 4:
The |
| Open with DEXTER | |
The exposure time histogram
has the binning from (46).
Figure 5 shows an example of this histogram for the PSPC detector.
![]() |
Figure 5:
The
|
| Open with DEXTER | |
Does the central pixel (i,j) belong to a source region? In order to lend this
question a statistically testable form, we formulate a null hypothesis in terms
of the spatial count rates
from (45) but
without the spatial integration in the suffix regions from (48),
Let
be
the related mean counts from (17),(45). According to (P1),(P2)
from (16), the probability
to observe
counts
in the region
conditional upon
being fixed is binomial.
This probability and the cumulative probability,
,
to observe
or more source counts are given by
![]() |
Figure 6:
The likelihood sequences Lk for
|
| Open with DEXTER | |
The evaluation of the binomial cumulative probability
from (53)
becomes impractical for large count numbers n and k. The univariate version of
the Central Limit Theorem supplies us with the classical asymptotical approximation for
(with the notorious slow convergence speed
)
in terms of the univariate
normal density. Under the change
of the independent variable t, the
approximation can be rewritten in terms of the density
,
of the
distribution
with one degree of freedom. This approximation is used in the central region
.
Additionally, a lower bound sequence
of excellent approximation quality is used. Figure 7 gives
the error sequence
with lk being the said approximation.
For high counts in
(see (57)), the likelihoods Li,j are approximated
by the tail probability of the
distribution with one degree of freedom,
![]() |
Figure 7:
The likelihood error sequence
|
| Open with DEXTER | |
Each pixel position (i,j) satisfying (52) becomes a point of the
support of the (first) likelihood histogram
In the second step, the background estimate
is found.
S:=S1 circular disks around the source positions with radii
and a union
thereof serve to define the background domain
,
Figure 8 shows an example for the background photons (black dots) corresponding to
,
introduced in (2) in binned form
.
![]() |
Figure 8:
Censored
|
| Open with DEXTER | |
A bivariate least square cubic spline surface
over
solving the optimization problem,
The spline surface
is the adopted count distribution estimate
over the sky field. So, we assume the
in (45) known.
Knowing the background estimate
,
the source determination
can be redone. This is the third step. From
the background
histogram
of the same format as
from (46) is formed. The
background regions and background counts are redefined,
The image of a celestial point source in a fixed celestial direction
at infinite distance, is, by definition, the point spread function (PSF).
In the case of the ROSAT detectors, the PSFs are parameterized by the (a)
off-axis angle
and (b) the photon detector amplitude A. The off-axis angle
is the angle which subtends the PSF centre with the instrument's optical axis.
As a compromise between practicability (in terms of computing time) of the resulting
MLE procedure and model precision, a bivariate normal circular symmetric distribution density,
p, defined over the whole sky space
is used to represent the measured point
spread function in EXSAS for the MLE of single X-ray point source parameters.
In statistical interpretation, the probability dP to find a photon in sky space
in a box around the central point (X,Y) with infinitesimal side lengths dX and
dY when a photon arrived at the point (x,y) in the detector plane causing a
detector amplitude A is d
.
The spread
parameter
appearing in (64) will be explained in the sequel.
More precise point spread function models of the ROSAT instruments are used in
EXSAS for other analysis purposes, see Boese (2000). For instance, the distribution
tail in the measured PSF is higher than that of (64). The celestial
local mean source count rate
in (64) entails the count
rate
in sky space
To account for extended sources, the spread parameter
,
in fact a
double variance parameter, containing additively the nonnegative source
extent parameter
,
has been introduced. Though the source parameter
does not estimate the full morphology
of extended sources, it offers the azimuthally averaged part of the
morphology as a parameter of source extendedness. In operational terms,
a radial two-dimensional Gaussian photon distribution is fitted to an extended
source photon distribution.
Apart from the form of the distribution density
,
two further functions characterize the instrument. One of them is the instrument's
vignetting function,
,
fulfilling
A prominent property of the Poisson process is that it is preserved under random
sampling. We view the (non-)vignetting as a realization of random sampling, and
the infalling local mean source count rate
diminishes under vignetting
from the on-axis maximal count rate
to the lower vignetted count
rate
,
According to the source model (64), we have four single-source parameters
,
representing in turn: the source position
,
vignetted source count rate, and source extent parameter.
Depending on the context, we regard either
or
from (66) as the
basic count rate parameter. Only the first one is observable, and the one of eventuel physical
interest is
.
The optical axis moves across the Sky in time. Correspondingly, a celestial X-ray
source in a fixed celestial direction traces in the detector plane a source trajectory
during the observation interval
.
The motion
in time-parameterized form and
parameterized by the off-axis-angle
,
viewed as a random variate with distribution
,
for the individual trajectory
are
As mentioned in Sect. 2.3.3, in pointing mode, the motion (67) is
nominally constant apart from a standardly applied deliberate small wobbling motion
of the optical axis and superimposed by a still smaller stochastic motion due to
stochastic forces. So,
is expected to be unimodal with a mode position near
,
the source position
in detector coordinates.
In survey mode, the source under consideration executes (under the ROSAT survey scanning
scheme) a series of neighbouring parallel tracks in the field of view
.
Hence, a density
close to the uniform density
is expected. To save
execution time within MLE in EXSAS, no off-axis histogram approximating
has been
constructed and used. Equally, the individual density
was not explicitly
replaced by more general and thus less close approximations to
.
The underlying
rationale will be discussed.
Substituting the decomposition (43) into (31) gives the likelihood
functional L from (31) in the parameterized form
Conceptually, the number
of expected background photons in (68) is constant with respect to all four source parameters; to source
position
,
mean local source count rate
,
and to the source extent parameter
.
This is also in practice so, save for cases in which the background
region
from (60) is small compared to the field of
view
.
The (real) number
of expected source photons in (68) depends
in source positions near the boundary
of the field of view
remarkably on the
source parameters - as the following arguments clarify.
Assuming a continuous local mean count rate
with respect to all four arguments,
the mean value theorem applied to the integral
gives with an average
mean local source count rate
(to be determined) and a mean off-axis angle
(to be determined) the exact representation
Under the trival binning (42) with respect to time and amplitude, no
resolution with respect to these variables is achieved and only the time- and
amplitude-averaged parts
of a source count distribution
can be estimated -
a limitation without detrimental consequences when one aims at celestial directions and fluxes
only.
We reverse the argument when we state: for
For a source possessing all three properties (S1) to (S3), i.e. for stationary
amplitude-uniform point sources, one arrives with an unknown mean off-axis angle
and a constant on-axis source count rate
at
It is justified to ignore censoring by the field of view
when the source
position
lies in the central part of the field of view
.
The
fraction of photons not falling into the field of view,
Under the approximations (69),(73) for stationary amplitude-uniform
point sources, it follows the
log-likelihood representation
The mean off-axis angle
is taken constant with respect to all four source parameters.
All that follows will be based on this form of the log-likelihood function L. The form of L in (75) or (74) is the one used in EXSAS.
R. A. Fisher coined in 1921 the term "likelihood'' and in 1922 the "maximum likelihood estimate''. Readers interested in the historical development are referred to Aldrich (1977), Edwards (1997), Hald (1999), and the literature cited therein.
Having specified the distributional form of the underlying source count distribution
density in (75) up to the source parameter vector
Among several point estimation procedures, we chose estimation via the Maximum Likelihood Principle.
The latter directs us to choose
so that with L from (75) holds
In view of the differentiability of L with respect to
,
the equations for
By Sylvester's famous algebraic criterion, we know that a real
matrix
is negative definite if and only if all its four principal minors d1 to d4
are negative,
We come briefly to the non-generic case. There may be special cases
where the observation dictates a dedicated analysis effort. When
is
a boundary point of
at least one component of
assumes a
boundary value. The ML equations are to be altered accordingly,
Sources of interest lie seldom near the boundary of the field of view
.
However, a vanishing source extent parameter
is not uncommon. The
same applies to a vanishing estimate for the source count estimate
in
weak source count enhancements, cf. Sect. 6.1. The second condition in (83) is
equivalent to
and is fulfilled for a high background or small
number of source counts within the observation time.
In both cases, the corresponding inequalities in (83) are to be satisfied. Estimations for sources near the boundary
of the field of view become difficult because of data censoring and tend
to be unreliable and are not pursued any further here.
To solve the ML Eqs. (79) and to assign error bars, all partial derivatives
of first and second order are needed. We consider
as a constant with respect to the point source parameters
,
and
arrive with the denotations of (75) and
as
well as
at the representation for the four first
partial derivatives of L,
Two questions are fundamental for the physical interpretation of the source parameter estimates:
Let
be the requested significance level of the test and let gM(l)
be the sample frequency function of the test statistics l from (90).
The
-quantile,
,
is the solution of the equation for
To avoid the appearance of the sample frequeny function from (91), we resort again to large sample sizes.
According to Doob (1934), Wilks (1938), for sample size
the distribution
gM(l) tends to the
distribution with
degrees of freedom with
More delicate are the error bars from (b) in (87). The defining maximum
relation (77) for the ML source estimate vector
defines a random
vector when the photon sample
is viewed as a set of random vectors. As such,
possesses a sampling distribution, say
given by (91)
with l as upper integration limit. Then, one can calculate the mean vector
,
its sample counterpart
,
and the elements Cj,k of the
covariance matrix
,
For large sample sizes, see Wald (1943), the distribution
tends
to a multivariate normal distribution with zero mean and covariance matrix with elements
according to (96) at
,
Let
be a nonvanishing (column) p-vector and
a real, symmetric,
positive definite, simple
matrix. Then for any positive number h>0 the
following equality holds:
In EXSAS, the
-error bars are calculated. As usual,
codifies the
significance level
This level corresponds exactly to the quantile
value
.
The statistical interpretation of the EXSAS error bars,
,
for the unknown true point source parameter sk is then:
in
of all MLEs, the containment relation
holds good - under the caveat of
a large number of source photons.
All X-ray sources are variable on source-specific time scales. A certain source
found in one observation may not show up in another observation. The task is
then to prove quantitatively the non-detectability of a source in a given direction.
We wish to assert: based on the observation, with a selectable probability
,
a
hypothetical source cannot have a count rate
exceeding an upper
limit count rate
.
Assume a (hypothetical) source position
with source extent parameter
is given. For such a source with these parameters and an unknown source
count rate
,
EXSAS offers the (approximate) calculation of the upper limit of
a confidence interval,
,
containing the unknown
true source count rate parameter
with a selectable confidence level,
,
see the EXSAS command (125). The statistical interpretation is: the unknown
true source count rate
obeys in the fraction
of all MLEs the inequality
The subject of estimator variances in astronomical context is a recurrent theme. The works of Avni (1976), Cash (1978), Strong (1985) belong to a larger body of literature.
Given an initial estimate,
,
for the source parameter vector
,
we wish to set up a numerical iterative algorithm that generates a sequence of
refined estimates
The first step towards this goal consists in writing three of the four equations
in (79) in fixed-point form. The third equation thereof,
,
the local mean count rate equation, will be treated differently.
As mentioned in Sect. 5.1,
entails the vanishing maximizer
.
In the remaining case
a positive
local mean count rate
maximizes. In this case, we obtain
from (79) the system of four equations,
three of them in fixed-point form,
The local mean count rate equation will be treated separately. Given
,
we
consider
as an equation for
alone and denote by
The monotonicity property
Lemma 1 (MCR existence, uniqueness, enclosure)
The local mean count rate equation
Once an enclosing interval
for the source count rate
is available, the length of the enclosing interval can be shrunk to any given
length
by bisection or otherwise.
Using
in the third
equation,
,
of (106), its approximative solution to precision
is denoted by
.
Now, given an initial estimate
formed with that
,
its successor estimate
is calculated by performing the iteration step from the kth iterate to the (k+1)th,
![]() |
(118) |
Again, the success of the iteration hinges on the closeness of the initial estimate
to the unknown true maximizer. Therefore, EXSAS uses an elaborate process
to find positional components in the initial estimate.
![]() |
Figure 9:
|
| Open with DEXTER | |
In practice, MLE was tested innumerable times. The ROSAT All-Sky Survey (RASS) described by Voges (1992,1999) is a monumental instance. Also, several thousands of pointed ROSAT observations were processed by MLE as described, see for example the group of contributions prefaced by Trümper (1991b).
We shall illustrate the performance of the MLE twofold. Firstly, we apply MLE to a field with many point sources. Secondly, MLE was tested on simulated data. Simulated observations give the possibility to compare estimated parameter values with true values under the conditions of the implemented MLE (and test) software.
Figure 9 shows a
field around the ecliptic north pole
observed during the ROSAT All Sky Survey in the energy band
[0.1,2.35] keV. The exposure
time of the image centre was 15.5 ks. Only the photons which fell into the inner
19 arcmin of the ROSAT PSPC field of view were selected resulting in the image
Fig. 9. The darker strips emanating from the central yet darker region are caused
by a varying exposure time due to the ROSAT survey attitude. Figure 10 shows
the same region augmented by point source positions. The little dark circles
indicate the 545 point source positions found by MLE.
![]() |
Figure 10: The ecliptic north polar region as in Fig. 9. Circles indicate point source positions as found by MLE |
| Open with DEXTER | |
Many tests with simulated observations were performed in order to study the performance behaviour of the implemented MLE software in the high-dimensional parameter space spanned by the source photon parameters and additional hidden internal parameters. Among the varied input quantities in the test series were: source strength, source offset, background strength, and cut-off radius. Several dozens of output quantities were collected and plotted as function of the input quantity. As an typical example, we report on the source position error as a function of the source strength.
Figure 11 shows the stochastic process of the horizontal positional errors
between the ML estimate
and the known true source position
component
as a function of the source photon count number
as parameter.
The plot is scaled such that all errors eX fall within the vertical plot range.
Evidently, the sample standard deviation
becomes smaller from left to right in
Fig. 11, i.e. for increasing source photon numbers
.
This checks with the
expected
behaviour for large number of photons
.
Each of the 500 black dots means (a) the creation of a photon series
from (1) simulating a source with a random source position
which is (b) afterwards
subjected to a MLE analysis carried out with the help of the EXSAS commands described in
the Appendix. Judged by eye, no bias around the horizontal axis eX=0 seems to be
present for source photons above
.
An analogous picture is expected for the
vertical positional error
in Fig. 12. Again, all errors eY
fall within the vertical plot range which is dilated by a factor 2 compared to Fig. 11. This stretching is due to the randomly large errors eY for small number of
source photons, cf. eY=0.45 image pixels for the upper left dot. A closer look
reveals
as expected.
![]() |
Figure 11:
Horizontal positional errors
|
| Open with DEXTER | |
![]() |
Figure 12:
Vertical positional errors
|
| Open with DEXTER | |
A similar sliding window technique as the one described in Sect. 4.1.2 has been used in the analysis of EINSTEIN data, see Harris & Irwin (1984).
Already in the title, the application of the MLE is restricted to single sources. Thus, pairs of poorly resolved sources, less frequent triples of neighbouring sources or, generally, clusters of several sources are not admissible. In this context, distances are measured in units of the PSF width. The source parameters of source clusters must be estimated jointly. In a different EXSAS application, MLE for source groups is possible under the command name FIT/MULTI_SOURCE on image data.
At the end of Sect. 3.1, the motivations for selecting MLE were named. Besides ML estimation, source parameter estimation based on maximum entropy or wavelets are known and in use - to name only some recent methods. For the application of wavelet decomposition to ROSAT data, see Lazzati et al. (1999). A comparison of different estimation procedures is desirable, a task on its own, and far outside the scope of this article. An estimation procedure that is superior in all important respects seems not to exist.
The comparison of the hypothetical count rate distribution is made in the space
of the observational data. This is the conventional strategy to circumvent the
(ill-posed) inverse problem commonly referred to as deconvolution. After having
estimated source parameters
,
the
conversion to the source count rate
and source flux is to be done. As this
task is outside of MLE, details are not described in this work.
In Sect. 1, the Standard Analysis Software System (SASS) , cf. Voges et al. (1999, p. 223), was mentioned in passing. All ROSAT catalogues were produced with SASS. Meanwhile also a ROSAT all-sky survey of faint sources has been made public. For a list of all ROSAT catalogues see Voges et al. (2001). In relation to SASS, the EXSAS system is an extension of the former.
One implementation feature in EXSAS and SASS is not satisfactory. As mentioned, MLE on unbinned photon data is appropriate for weak sources and works down to, say, thrice as many photons per source as there are source parameters (i.e. 12 photons). On the other hand, the source parameter error bars need a large number of photons per source.
In practice no unrealistic error bar sizes were observed. Nevertheless, a better approximation to the small sample frequency distribution is a desideratum - a convergence proof for the estimate series (114) is another.
Error assignement in small sample situations is not a privilege of observational astronomy. Attempts to attack the problem are known. This is, however, not the place to review the current situation.
As remarked in its second paragraph, Sect. 8 provides only an illustration of performance and precision issues. A full account, however desirable, does not fit into the frame of this work.
Finally, we state two perturbation problems. The instrument's PSF in effect in observation is known to be different from the one used in data analysis, here via MLE. Also, the background component obtained from the estimation cannot be regarded the best possible. So, we have to raise the problem of stability of MLE against PSF variation and against background variation.
Let
,
be the ML estimates of the source parameter vector with
PSF1,PSF2 in effect, respectively. Does the convergence in probability
take place for
? If the answer is
affirmative, how fast is the convergence? Practicable measures of sensitivity
are to be constructed. Replacing in the problem statement "PSF'' by "background''
yields the problem statement on background. Both affirmative answers lie at the
heart of observational astronomy, known to be utterly succesful. But a formal
problem solution tells you how precise PSF and background have to be modelled.
For clarity, we repeat the conditions for the application of the likelihood function L from (31) or (35) for a sample of single photons as follows.
The sample must be drawn from a population which follows a Poisson distribution with respect to the parameters to be estimated. No assumption on the geometrical form of the sample in the associated photon space is made. Thus no restriction of the instrument's PSF is imposed. Equally, no (non-trivial) lower or upper bounds on the sample size and photons per source exist for MLE as method. However, the statistical quality of the parameter estimates increase with increasing source photon numbers. The observed photon positions are imprecise due to binning. In the strict sense, the resulting MLE estimates are thus at least imprecise by half a bin size (plus an implementation-dependent additive).
The MLE is qualified by method properties for high energy astronomy observations with their notorious small numbers of photons per source.
Acknowledgements
The authors would like to thank H.-U. Zimmermann for many constructive comments and for painstakingly reading an earlier version of the manuscript. The authors are also indebted to J. Englhauser and R. Supper for support. Special thanks go to J. Trümper. The constructive criticism of the Reviewer and the Deputy Editor led to many ameliorations.
comprises the operations described in Sect. 4.1.2. The input data consist of up to
six photon count histograms
from (46) and corresponding exposure
histograms
.
The output is the source lists
from (59).
File names and remaining parameters, among them the significance level, L0, and
the dilatation exponent D are collected in a steering parameter file. Parameter
files also control the other commands. The command
produces the background count estimate
as described in Sect. 4.1.4
in histogram form
.
The input consists of up to six histograms
and the
allied source lists
.
One of the parameters is the factor
from (60). Having the background available, a modification of the command (119) called
can be executed. The input files are those of the command (119) supplemented
by the background histograms
.
The result files are the source lists
from (63). The command
carries out the union in (63). After the execution of the foregoing sequence of commands, all provisions are made to perform the MLE in narrow sense embodied in the command
The command
performs the sequence of the commands (119) to (123) in one command. This command is simpler in application but allows the user less control over parameter settings.
For sources with existence likelihoods
below the user-selected
likelihood threshold
,
upper confidence limits,
,
on the source
count rate
for a selectable confidence level
are found with
the command
The input consists of the photon sample
from (1), the background
histogram
from (62) and a list of positions
,
specified in equatorial coordinates, with or without the associated source extent
parameters
according to the chosen options.
The user may keep fixed separately: source position
,
source extent
parameter
(and source count rate
too). Two tables with source
parameters form the output. One of them contains all sources, i.e. those with
source existence likelihoods
.
The strong sources defined by
are collected in a second table.
The command
simulates point sources. The underlying PSF density is a bivariate
normal density with circular symmetry, see (64). With the output sample
,
the
histogram
from (46) and an attitude table, the input to run the
above commands is provided.