A&A 468, 501-514 (2007)

DOI: 10.1051/0004-6361:20064927

## Unbinned maximum-likelihood estimators for low-count data

##### Applications to faint X-ray spectra in the Taurus molecular cloud

**K. Arzner**

^{1}, M. Güdel^{1}, K. Briggs^{1}, A. Telleschi^{1}, M. Schmidt^{2}, M. Audard^{3}, L. Scelsi^{4}, and E. Franciosini^{4}^{1}Paul Scherrer Institut, 5232 Villigen, Switzerland

e-mail: arzner@astro.phys.ethz.ch

^{2}Institute for Data Analysis and Process Design, Zurich University of Applied Sciences, Postfach 805, 8401 Winterthur, Switzerland

^{3}Columbia Astrophysical Laboratory, 550 West 120th St, MC 5247, New York, NY 10027, USA

^{4}Dipartimento di Scienze Fisiche e Astronomiche, Piazza del Parlamento 1, 90134 Palermo, Italy

(Received 27 January 2006 / Accepted 6 September 2006)

** Abstract **

Traditional binned statistics such as suffer from information loss and arbitrariness of the binning
procedure, which is especially important at low count rates as encountered in the *XMM-Newton* Extended
Survey of the Taurus Molecular Cloud (XEST). We point out that the
underlying statistical quantity (the log likelihood *L*) does not require any binning beyond
the one implied by instrumental readout channels, and we propose to use it for low-count data.
The performance of *L* in the model classification and point estimation problems is explored
by Monte-Carlo simulations of *Chandra* and *XMM-Newton* X-ray spectra, and is compared to the
performances of the binned Poisson statistic (*C*), Pearson's and Neyman's ,
the Kolmogorov-Smirnov, and Kuiper's statistics.
It is found that the unbinned log likelihood *L* performs best with regard to the expected chi-square distance between
true and estimated spectra, the chance of a successful identification among discrete candidate models,
the area under the receiver-operator curve of reduced (two-model) binary classification problems,
and generally also with regard to the mean square errors of individual spectrum parameters.
The
statistics should only be used if more than 10 (15) predicted counts
per bin are available. From the practical point of view, the computational cost of evaluating *L*
is smaller than for any of the alternative methods if the forward model is specified in terms
of a Poisson intensity and normalization is a free parameter. The maximum-*L* method is applied to 14 XEST observations,
and confidence regions are discussed. The unbinned results are compared to binned XSPEC results,
and found to generally agree, with exceptions explained by instability under re-binning and by
background fine structures. In particular, HO Tau is found by the unbinned method to be rather
cool (*kT* ~ 0.2 keV), which may be a sign of shock emission.
The maximum-*L* method has no lower limit on the available counts, and allows to treat weak sources
which are beyond the means of binned methods.

**Key words:**methods: statistical

**--**X-rays: stars

**--**stars: formation

**©**

*ESO 2007*