Issue |
A&A
Volume 659, March 2022
|
|
---|---|---|
Article Number | A66 | |
Number of page(s) | 17 | |
Section | Numerical methods and codes | |
DOI | https://doi.org/10.1051/0004-6361/202142444 | |
Published online | 08 March 2022 |
Exploring X-ray variability with unsupervised machine learning
I. Self-organizing maps applied to XMM-Newton data★
1
Istituto di Astrofisica Spaziale e Fisica Cosmica (INAF IASF-MI),
20133
Milano,
Italy
e-mail: milos.kovacevic@inaf.it; milosh.kovacevic@gmail.com
2
Center for Astro, Particle and Planetary Physics (CAP 3), New York University,
Abu Dhabi, UAE
e-mail: mp5757@nyu.edu
3
Physics and Astronomy Department Galileo Galilei, University of Padova,
Vicolo dell’Osservatorio 3,
35122,
Padova, Italy
4
Département de Physique, Université de Montréal,
Montreal,
Quebec
H3T 1J4, Canada
5
Istituto Nazionale di Fisica Nucleare - Sezione di Padova,
Via Marzolo 8,
35131
Padova, Italy
6
INFN, Sezione di Pavia,
via A. Bassi 6,
27100
Pavia, Italy
Received:
14
October
2021
Accepted:
26
January
2022
Context. XMM-Newton provides unprecedented insight into the X-ray Universe, recording variability information for hundreds of thousands of sources. Manually searching for interesting patterns in light curves is impractical, requiring an automated data-mining approach for the characterization of sources.
Aims. Straightforward fitting of temporal models to light curves is not a sure way to identify them, especially with noisy data. We used unsupervised machine learning to distill a large data set of light-curve parameters, revealing its clustering structure in preparation for anomaly detection and subsequent searches for specific source behaviors (e.g., flares, eclipses).
Methods. Self-organizing maps (SOMs) achieve dimensionality reduction and clustering within a single framework. They are a type of artificial neural network trained to approximate the data with a two-dimensional grid of discrete interconnected units, which can later be visualized on the plane. We trained our SOM on temporal-only parameters computed from ⪆105 detections from the Exploring the X-ray Transient and variable Sky catalog.
Results. The resulting map reveals that the ≈2500 most variable sources are clustered based on temporal characteristics. We find distinctive regions of the SOM map associated with flares, eclipses, dips, linear light curves, and others. Each group contains sources that appear similar by eye. We single out a handful of interesting sources for further study.
Conclusions. The condensed view of our dataset provided by SOMs allowed us to identify groups of similar sources, speeding up manual characterization by orders of magnitude. Our method also highlights problems with fitting simple temporal models to light curves and can be used to mitigate them to an extent. This will be crucial for fully exploiting the high data volume expected from upcoming X-ray surveys, and may also help with interpreting supervised classification models.
Key words: methods: statistical / methods: miscellaneous / catalogs / astronomical databases: miscellaneous / X-rays: general / methods: data analysis
The movie associated to Fig. 12 is available at https://www.aanda.org
© ESO 2022
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.