EDP Sciences
The XMM-Newton extended survey of the Taurus molecular cloud
Press Release
Free access
Volume 468, Number 2, June III 2007
The XMM-Newton extended survey of the Taurus molecular cloud
Page(s) 501 - 514
DOI http://dx.doi.org/10.1051/0004-6361:20064927

A&A 468, 501-514 (2007)
DOI: 10.1051/0004-6361:20064927

Unbinned maximum-likelihood estimators for low-count data

Applications to faint X-ray spectra in the Taurus molecular cloud
K. Arzner1, M. Güdel1, K. Briggs1, A. Telleschi1, M. Schmidt2, M. Audard3, L. Scelsi4, and E. Franciosini4

1  Paul Scherrer Institut, 5232 Villigen, Switzerland
    e-mail: arzner@astro.phys.ethz.ch
2  Institute for Data Analysis and Process Design, Zurich University of Applied Sciences, Postfach 805, 8401 Winterthur, Switzerland
3  Columbia Astrophysical Laboratory, 550 West 120th St, MC 5247, New York, NY 10027, USA
4  Dipartimento di Scienze Fisiche e Astronomiche, Piazza del Parlamento 1, 90134 Palermo, Italy

(Received 27 January 2006 / Accepted 6 September 2006)

Traditional binned statistics such as $\chi^2$ suffer from information loss and arbitrariness of the binning procedure, which is especially important at low count rates as encountered in the XMM-Newton Extended Survey of the Taurus Molecular Cloud (XEST). We point out that the underlying statistical quantity (the log likelihood L) does not require any binning beyond the one implied by instrumental readout channels, and we propose to use it for low-count data. The performance of L in the model classification and point estimation problems is explored by Monte-Carlo simulations of Chandra and XMM-Newton X-ray spectra, and is compared to the performances of the binned Poisson statistic (C), Pearson's $\chi^2$ and Neyman's $\chi^2_N$, the Kolmogorov-Smirnov, and Kuiper's statistics. It is found that the unbinned log likelihood L performs best with regard to the expected chi-square distance between true and estimated spectra, the chance of a successful identification among discrete candidate models, the area under the receiver-operator curve of reduced (two-model) binary classification problems, and generally also with regard to the mean square errors of individual spectrum parameters. The $\chi^2 (\chi^2_{\rm N})$ statistics should only be used if more than 10 (15) predicted counts per bin are available. From the practical point of view, the computational cost of evaluating L is smaller than for any of the alternative methods if the forward model is specified in terms of a Poisson intensity and normalization is a free parameter. The maximum-L method is applied to 14 XEST observations, and confidence regions are discussed. The unbinned results are compared to binned XSPEC results, and found to generally agree, with exceptions explained by instability under re-binning and by background fine structures. In particular, HO Tau is found by the unbinned method to be rather cool (kT ~ 0.2 keV), which may be a sign of shock emission. The maximum-L method has no lower limit on the available counts, and allows to treat weak sources which are beyond the means of binned methods.

Key words: methods: statistical -- X-rays: stars -- stars: formation

© ESO 2007