Open Access
Issue
A&A
Volume 666, October 2022
Article Number A147
Number of page(s) 16
Section Catalogs and data
DOI https://doi.org/10.1051/0004-6361/202243895
Published online 19 October 2022

© P. Mas-Buitrago et al. 2022

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe-to-Open model. Subscribe to A&A to support open access publication.

1 Introduction

Ultracool dwarfs (UCDs) range from M7 V to the extended L, T, and Y spectral types, and include very low mass stars, brown dwarfs (BDs), and planetary-mass objects. With effective temperatures of Teff ≲ 3000 K (Kirkpatrick 2005), they constitute about 15% of the stellar population in the solar neighbourhood and are abundant throughout the Galaxy (Gillonetal. 2016; Bochanski et al. 2010; Kirkpatrick et al. 2019). Since UCDs are fainter and smaller than solar-like stars, it is easier to detect transits around them, which gives UCDs a relevant role in the search for Earth-like exoplanets (Gillon et al. 2017). As demonstrated in Solano et al. (2019), there is still room to discover nearby UCDs, empowering the discovery and study of new habitable exoplanets in the solar neighbourhood. Furthermore, they play a relevant role in the study of Galactic kinematics and in understanding the boundary between stellar and substellar objects.

In addition to many thousands of late-M dwarfs, the current UCD discoveries include around 2000 L and 700 T dwarfs. In tandem, more than 20 Y dwarfs are currently known. In recent years, these discoveries have primarily been driven by wide-field optical and infrared imaging surveys such as the Deep Near Infrared Survey of the Southern Sky (DENIS; Epchtein et al. 1997), the Sloan Digital Sky Survey (SDSS; York et al. 2000), the Two-Micron All Sky Survey (2MASS; Skrutskie et al. 2006), the UKIRT Infrared Deep Sky Survey (UKIDSS; Lawrence et al. 2007), the Wide-Field Infrared Sky Explorer (WISE; Wright et al. 2010), the Visible and Infrared Survey Telescope for Astronomy (VISTA; Cross et al. 2012), and the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS; Chambers et al. 2016). The more recent Javalambre Physics of the Accelerated Universe Astrophysical Survey (J-PAS; Benitez et al. 2014) and the Javalambre Photometric Local Universe Survey (J-PLUS; Cenarro et al. 2019), conceived for the photometric calibration of the former, offer a unique filter system of 56 and 12 filters, respectively. This large number of filters allows for an accurate estimation of stellar parameters such as the effective temperature, which is crucial for classifying an object as a UCD.

In the years to come, future surveys will produce enormous volumes of data that will compromise traditional processing methods. To overcome this bottleneck, refined machine learning (ML) approaches have gained momentum over the last few years. It is possible that ML will be crucial when it comes to analysing and making predictions as to these volumes of data. This set of techniques has indeed already obtained successful results in several areas of astrophysics. For instance, artificial neural networks (ANNs) have been used to estimate the effective temperature and metallicity of a large subset of stars using broad- and intermediate-band J-PLUS optical photometry (Whitten et al. 2019), or to classify galaxies based on their morphology (Naim et al. 1995). Nowadays, there is also a variety of ML techniques dedicated to the automated classification of large datasets, as in Wang et al. (2022), where they build a support vector machine (SVM, Cortes & Vapnik 2009) to perform a star-galaxy-QSO classification using the 12 J-PLUS optical bands.

The main goal of the study presented here is to extend the search for UCDs initiated in Solano et al. (2019) to the J-PLUS second data release (J-PLUS DR2), using a revised Virtual Observatory1 (VO) methodology. In addition, we scout two secondary scenarios. The first one is about exploring the capability to reproduce this search with a purely ML-based methodology that relies only on J-PLUS optical photometry. For this purpose, we constructed a two-step ML method based on the principal component analysis (PCA) and the SVM algorithms. The second is to develop an algorithm capable of searching for flares on Hα and Ca ii H and K emission lines, again using only the J-PLUS photometry.

This article is organised as follows: in Sect. 2 we describe the J-PLUS survey. Section 3 is dedicated to explaining the methodology we have followed to identify the candidate ultracool dwarfs. In Sect. 4, we discuss and analyse different properties of our candidates. In Sect. 5, we compare the candidates found using our methodology with the ultracool dwarfs already found in the J-PLUS field. Sections 6 and 7 are devoted to the ML-based approach and the flares search, respectively. Finally, in Sect. 8 we give a short summary of this work and present our conclusions.

2 J-PLUS

J-PLUS is a multi-filter survey conducted from the Observatorio Astrofísico de Javalambre (OAJ; Cenarro et al. 2014) in Teruel, Spain. Since it was primarily conceived to ensure the photometric calibration of J-PAS, it uses the second largest telescope at the OAJ, which is the 0.83 m Javalambre Auxiliary Survey Telescope (JAST80). J-PLUS is covering thousands of square degrees of the sky using the panoramic wide-field (2 deg2 field of view) camera T80Cam (Marín-Franch et al. 2015), which is equipped with a CCD of 9.2k × 9.2k pixels and a pixel scale of 0.55 arcsec pix−1.

While J-PAS will use an unprecedented system of 56 narrow band filters in the optical, the J-PLUS filter system is composed of four broad (gSDSS, rSDSS, iSDSS, and zSDSS), two intermediate (uJAVA and J0861) and six narrow (J0378, J0395, J0410, J0430, J0515, and J0660) band optical filters. The transmission curves, as well as additional information of these filters, can be found at the Filter Profile Service maintained by the Spanish Virtual Observatory2.

The J-PLUS DR2, available since November 2020, comprises 1088 fields, covering 2176 deg2, observed in all the mentioned optical bands. Figure 1 shows the sky coverage of this release. López-Sanjuan et al. (2021) presents the updated photometric calibration for the DR2, that was improved by including the metallicity information from LAMOST DR5 in the stellar locus estimation. The limiting magnitudes of the 12 bands can be consulted in the Table 1 of the same paper.

3 Methodology

We divided the sky coverage of J-PLUS DR2 in 37 regions of 20 × 20 deg2. To cope with the fact that queries to the J-PLUS archive are limited to 1 million objects, we decided to tessellate each region into smaller circular subregions of 1 deg radius. We made use of TOPCAT3 (Taylor 2005) to cross-match each tessellated region with the J-PLUS DR2 sky coverage in order to avoid searching regions of the sky that are not covered by it.

We used the package STILTS4 (Taylor 2006) to query the J-PLUS DR2 database through the Virtual Observatory TAP protocol. This allowed us to write ADQL5 code to search over all 20 × 20 deg2 regions iteratively. A typical ADQL query example looks like this:

SELECT objs.filter_id,objs.alpha_j2000,

      objs.delta_j2000,objs.class_star,

      objs.mag_aper_6_0,objs.mag_err_aper_6_0,

      objs.mask_flags,imgs.aper_cor_6_0,

      imgs.aper_cor_err_6_0

FROM jplus.MagABSingleObj as objs,

      jplus.TileImage as imgs

WHERE objs.tile_id = imgs.tile_id

AND objs.alpha_j2000 between 2 and 5

AND objs.delta_j2000 between 2 and 3

AND objs.flags=0

AND objs.filter_id between 1 and 4

AND objs.class_star>0.1

In our case, we used the 6 arcsec diameter aperture photometry, since the aperture correction to pass 6 arcsec aperture magnitudes to total magnitudes for point-like sources is available in the J-PLUS DR2 database. We constrained the search to records with good photometric conditions by imposing flags=0 (no SExtractor flags6). Since object detection is performed independently on each filter, this means that for each source the flags=0 condition is applied at the filter level. We also required class_star > 0.1. We were not very restrictive with class_star (SExtractor stellarity index) in ordernot to loose faint sources that may appear as extended objects.

For each 20 × 20 deg2 region, we concatenated the data for the corresponding circular subregions into a single table and removed duplicated instances (tessellated areas may overlap). As UCDs emit most of their flux at longer wavelengths, for the methodology described in Sects. 3.1, 3.2 and 3.3, we only considered the relevant filters for these objects, i.e., the reddest ones (filter IDs 1-4 and 10-12 in the J-PLUS DR2 database, see Table 1). Even so, we stored the data for all filters separately, as we required them for the flare detection work-flow described in Sect. 7. Finally, we used the CDS X-Match service7 in TOPCAT with Gaia EDR3 J2016 (reference epoch 2016.0), using a 3 arcsec radius, to obtain the astrometric information. In those cases where more than one counterpart exists in the search region, only the nearest one was considered. In Sects. 3.1, 3.2 and 3.3, we describe the analysis carried out for each 20 × 20 deg2 region separately.

thumbnail Fig. 1

Sky coverage of J-PLUS DR2 (yellow) in α, δ coordinates (centred on α = 0 deg, δ = 0 deg, with α rising to the left). The SDSS DR9 footprint is superimposed in blue. The purple line represents the border of the J-PLUS DR1 coverage map.

Table 1

J-PLUS filter information, taken from the J-PLUS DR2 database, sorted from shortest to longest wavelength.

3.1 Parallax-based selection

From the cross-matched sample, we only kept sources with relative errors of less than 20% in parallax and less than 10% in both G and GRP photometry. With these objects, we constructed a colour-magnitude diagram (see the left panel of Fig. 2), where the absolute Gaia magnitude in the G band was estimated using MG=G+5  logϖ+5,${M_G} = G + 5\,\,\log \varpi + 5,$(1)

where G is the Gaia apparent magnitude and ϖ is the parallax in arcseconds. To obtain a shortlist of candidate UCDs, we adopted a colour cut of GGRP > 1.3 mag, which corresponds to spectral types M5 V or later according to the updated version of Table 5 in Pecaut & Mamajek (2013)8, and an absolute magnitude limit of MG > 5 mag to leave aside the red giant branch.

thumbnail Fig. 2

Location of the objects shortlisted as candidate UCDs via astrometric selection, for an example 20 × 20 deg2 region, in a colour-magnitude (left) and a reduced proper motion (right) diagram built using Gaia EDR3 sources with parallaxes larger than 10 mas (dark blue dots). The vertical and horizontal red lines mark the boundaries for a source to be shortlisted as candidate UCD. Sources fulfilling these conditions are overplotted in yellow.

3.2 Proper motion-based selection

Ultracool dwarfs may have photometric and morphological properties similar to those of objects such as giants, quasi-stellar objects (QSOs) or distant luminous red galaxies (e.g. Caballero et al. 2008; Theissen et al. 2016, 2017). Assuming nearby objects will have high proper motions, reduced proper motion diagrams are a reliable tool for discriminating between nearby stellar populations and distant sources.

From the cross-matched sample introduced in Sect. 3, we only kept sources with a relative error of less than 20% in both proper motion components and less than 10% in both G and GRP photometry. Furthermore, we only took into account sources with non-zero proper motion, i.e., sources with, at least, one of the proper motion components greater (in absolute value) than three times the associated error.

The right panel of Fig. 2 shows the reduced proper motion diagram defined as: HG=G+5  logμ+5,${H_G} = G + 5\,\,\log \mu + 5,$(2)

where G is the Gaia apparent magnitude and μ is the total proper motion in mas yr−1. Of these sources, we filtered out those already pre-selected in the parallax-guided analysis described in Sect. 3.1 and shortlisted as candidate UCDs those fulfilling the condition GGRP > 1.3 mag, and with a reduced proper motion HG > 22 mag to leave aside the red giant branch.

As discussed in Sect. 3, the cross-match with Gaia EDR3 J2016 is done using a 3 arcsec radius. Since J-PLUS DR2 is based on images collected from November 2015 to February 2020, we might miss some objects with a proper motion larger than 750 mas yr−1, as they could fall outside this 3 arcsec radius. However, we decided not to increase the radius to avoid finding erroneous counterparts.

3.3 Photometry-based selection

In the first two criteria (colour-magnitude and reduced proper motion diagrams), we are imposing parallax and proper motion constraints respectively, which makes these methods dependent on Gaia astrometric information. This means that objects with good photometry but poor astrometry will be excluded from the lists of candidate UCDs. To solve this limitation, in this section we describe a method solely dependent on photometric information. This procedure consisted of two separate steps. First, we built a colour-colour diagram with the purpose of defining a colour cut to identify the UCD locus. Then, we applied this criterion to each 20 × 20 deg2 region independently to obtain a shortlist of candidate UCDs.

To build the colour-colour diagram, we first searched in J-PLUS DR2 for true extended sources, defined as sources having class_star <0.01 Likewise, true point sources were defined as sources with class_star > 0.99. Then, we performed a cross-match with 2MASS and built a JKs (2MASS) vs. rz colour–colour diagram to separate the two types of sources. As discussed in Sect. 3.2, QSOs may have morphometric properties similar to those of UCDs, so it is crucial to also discriminate between these two types in the colour-colour diagram.

Figure 3 shows the different types of objects in a colour-colour diagram. For the sample of QSOs, we cross-matched the SDSS-DR12 Quasar Catalog9 with the J-PLUS DR2. To define the UCD locus, we overplotted in this diagram the candidate UCDs obtained by the methods described in Sects. 3.1 and 3.2 for the region α: 0–20 deg; δ: 0–20 deg. As a compromise to balance the extended object contamination and the loss of candidate UCDs, we defined the UCD locus as the region fulfilling rz > 2.2 mag and applied this criterion to all the sources of each 20 × 20 deg2 region. Of the sources fulfilling it, we filtered out those already pre-selected in the analysis described in Sects. 3.1 and 3.2 and shortlisted the remaining ones as candidate UCDs.

thumbnail Fig. 3

Colour-colour diagram built using true extended (dark blue) and true point (red) sources. Yellow dots represent the sample of 10481 QSOs. Purple dots represent the shortlisted candidate UCDs obtained by parallax-guided and proper motion-guided methods. The vertical grey line marks the rz > 2.2 mag limit for a source to be shortlisted as candidate UCD.

3.4 VOSA filtering

To estimate physical properties, such as effective temperature, luminosity or radius of the shortlisted objects described in the previous sections, we made use of the tool VOSA10 (Bayo et al. 2008). This is a tool developed and maintained by the Spanish Virtual Observatory11 which fits observational data to different collections of theoretical models. An example of VOSA Spectral Energy Distribution (SED) fitting can be found in Fig. 4. Before doing the fit, we built the observational SEDs using the J-PLUS photometric information as well as additional photometry from the 2MASS, UKIDSS, WISE, and VISTA infrared surveys, and from the SDSS data release 12 optical catalogue, available in VOSA.

In our analysis, we used the BT-Settl (CIFIST) collection of theoretical models (Allard et al. 2012; Caffau et al. 2011). Thus, the effective temperature estimated by VOSA is discretised due to the step adopted in the CIFITS grid of models (100 K). We also assumed a surface gravity logg in the range 4.5–5.5 and solar metallicity. The limiting magnitude (5σ, 3 arcsec diameter aperture) of J-PLUS DR2 is 20.5 [AB] in the z band (López-Sanjuan et al. 2021). If we take, for example, the object TVLM 891-15871, which is one of the objects in the UCD catalogue presented in Reylé (2018) with the brightest absolute magnitude (11.36 [AB]) in the z band, we see that it could be detected at a maximum distance of ~680 pc. This leads us to expect a maximum distance of about 650–700 pc to find UCDs in the J-PLUS DR2.

Extinction plays a fundamental role in shaping the SED and, therefore, in the estimation of physical parameters (Laugalys & Straižys 2002; Straižys et al. 2002). Considering the maximum distance at which UCDs can be detected with J-PLUS, we adopted a range of values between AV = 0 mag and 0.5 mag. We relied on the calibration described in Table 1 of Solano et al. (2021) to adopt a temperature cutoff of 2900 K for UCDs if the BT-Settl (CIFIST) models are used in VOSA. The goodness of fit of the SED in VOSA can be assessed with the vgfb parameter, a pseudo-reduced; χ2 internally used by VOSA that is calculated by forcing σ(Fobs) > 0.1 × Fobs, where σ(Fobs) is the error in the observed flux (Fobs). Only sources with good SED fitting (vgfb < 12) were kept.

After applying these effective temperature and vgfb conditions, we used TOPCAT to remove the objects with a non-zero confusion flag (cc_flg) in 2MASS, so as to ensure that objects are not contaminated or biased due to the proximity to a nearby source of equal or greater brightness. Moreover, we used the Aladin sky atlas (Bonnarel et al. 2000) to carry out a visual inspection of the coldest objects, in order to discard any problem related to blending or contamination by nearby objects. Finally, we ended up with 9810 final candidate UCDs. For the record, we checked that 204 of these objects have a renormalised unit weight error (RUWE; Lindegren et al. 2018) greater than 1.4 in Gaia EDR3, which could mean that the source is affected by close binary companions. These objects were not removed since a binarity analysis is performed in Sect. 4.3.

As we use multiple detection methods in our methodology, distinct candidate UCDs may have been detected by different methods, or by several of them. Figure 5 shows the breakdown of the 9810 candidate UCDs according to the methods by which they have been detected. The fact that 2100 objects are only detected by the photometric methodology (‘diag’ bar in Fig. 5) and 4530 are only detected by the astrometric methodology (‘par’, ‘pm’, and ‘par&pm’ bars in Fig. 5) argues for the complementary nature of both approaches. Considering each method separately, we detected 6086 candidates with parallax-based selection, 6338 with proper motion-based selection, and 5280 with photometry-based selection.

thumbnail Fig. 4

Example of an automatically generated SED fitting with VOSA. The blue spectrum represents the theoretical model that fits best, while red dots represent the observed photometry. The inverted yellow triangle indicates that the photometric value corresponds to an upper limit. These points are not considered in the fitting process.

4 Analysis

4.1 Temperatures and distances

Figure 6 shows that the distribution of effective temperatures for our candidate UCDs is not the same depending on whether they have been detected by astrometric methodology (blue) or not. To prove this, we performed a two-sample Kolgomorov–Smirnov test on the two samples, which returned a p value = 3.66 × 10−15, rejecting the possibility that both samples are coming from the exact same distribution. The number of cold objects (Teff ≤ 2200 K) is clearly higher in the only-photometry detected distribution (yellow). Most of our candidates (~86%) have Teff ≥ 2700 K, a clear consequence of the working wavelength, since UCDs peak in the near-infared, and J-PLUS covers only up to the z filter (λeff = 8940.28 Å).

For the distance distribution of our candidate UCDs (Fig. 7), we only considered the candidates with a relative error of less than 20% in parallax (6086 objects), so we can rely on the inverse of the parallax as a distance estimator (Luri et al. 2018). In our case, as mentioned in Sect. 3, the parallax are those of Gaia EDR3. About 70% of the objects lie in the 96 < D(pc) < 222 region (1σ limits), with a maximum and minimun distance of 471 pc and 11 pc, respectively. This upper limit is consistent with the value estimated in Sect. 3.4. We found 68 nearby objects, at distances smaller than 40 pc, that will be further discussed in Sect. 5.2. Figure 8 gives a more in-depth view of the characteristics of our candidate UCDs. As expected, most of the cooler candidates are detected at closer distances and tend to have lower bolometric luminosity.

thumbnail Fig. 5

Breakdown of our candidate UCDs according to the methods by which they have been detected. ‘diag’ label represents photometry-based selection, while ‘par’ and ‘pm’ labels represent selections based on parallax and proper motion, respectively. The ‘all’ label comprises the candidate UCDs identified with the three approaches.

thumbnail Fig. 6

Teff distribution for our candidate UCDs. In yellow, we show the candidates that were only detected by photometry. In blue we show the candidates that were, at least, detected by astrometric methodology.

thumbnail Fig. 7

Distance distribution for our candidate UCDs with an error of less than 20% in Gaia EDR3 J2016 parallax. The mean value (red vertical line) of the distribution is 159 pc, with the closest and farthest objects at 11 pc and 471 pc.

thumbnail Fig. 8

Bolometric luminosity (in solar units) vs. distance diagram of our candidate UCDs with good parallaxes. The points are colour-coded by temperature.

4.2 Kinematics

Stellar kinematics is a reliable proxy for segregating large-scale galactic populations (thin disk, thick disk, and halo) (Burgasser et al. 2015). Using Gaia EDR3 proper motions and parallaxes, we computed the tangential velocities of our candidate UCDs as vtan = 4.74μd, where vtan is given in km s−1, μ is the total proper motion in arcsec yr−1 and d is the distance in pc. For a correct estimation of the tangential velocity, we only considered candidates that met both conditions described in Sects. 3.1 and 3.2 for good parallax and proper motion (4714). Figure 9 shows the distribution of tangential velocities for these candidates, with a mean value of vtan = 39.78 km s−1, a median value of vtan = 33.99 km s−1, and a dispersion of σtan = 24.85 km s−1. Even taking into account objects located at the long tail of the distribution (134 objects, representing 2.8% of the total, with vtan > 100 km s−1), these values agree with previous calculations for UCDs (Faherty et al. 2009).

Torres et al. (2019, Fig. 10) shows a breakdown of the tangential velocity based on the membership in the thin disk, the thick disk or the halo. Relying on these values, we can segregate our candidate UCDs into thin disk (vtan ≤ 85 km s−1), thick disk (85 < vtan < 155 km s−1), and halo (vtan ≥ 155 km s−1) populations. We found 4441, 268 and five candidate UCDs in these intervals, respectively. According to Kilic et al. (2017), the corresponding ages are 6.8–7.0 Gyr (thin disk), 7.4–8.2 Gyr (thick disk), and 12.53.4+1.4$12.5_{ - 3.4}^{ + 1.4}$ Gyr (halo).

Three of the potential halo members show a very high tangential velocity. Two of them, with Simbad identifiers 2MASS J18030236+7557587 and 2MASS J13155851+2814524, are not far from the thick disk-halo threshold, with tangential velocities of vtan = 176.25 km s−1 and vtan = 177.47 km s−1, respectively. Furthermore, one of the objects has vtan = 206.16 km s−1, which significantly exceeds the limit. This object, at a distance of 179 pc, is reported as an M7 in the catalogue provided by Ahmed & Warren (2019) with the id J132625.03+333506.7. Due to its high tangential velocity, we conclude this object could be a potential member of the Galactic halo. We used the (JKs, iJ) colour-colour diagram presented in Lodieu et al. (2017) to study the metallicity of this object. With values of JKs = 0.77 and iJ = 3.29, the object exhibits subdwarf behaviour (low metallicity). Figure 10 shows the mean and standard deviation of the tangential velocity for each value of the effective temperature. There is no evidence of correlation between effective temperature and tangential velocity among our candidates.

To study the possible membership of our candidate UCDs to nearby young associations, we relied on BANYAN Σ12 (Gagné et al. 2018), a Bayesian analysis tool to identify members of young associations. Modelled with multivariate Gaussians in six-dimensional XYZUVW space, BANYAN Σ can derive membership probabilities for all known and well-characterised young associations within 150 pc. As we found no radial velocity data available for any of the 4714 candidate UCDs with good parallax and proper motion, we introduced the sky coordinates, proper motion, and parallax of these objects as input parameters to the algorithm.

For 4666 of the candidate UCDs, the algorithm predicted that most of them are field stars. However, it gave a high Bayesian probability for 48 objects to belong to a young association, in 30 of the cases with a probability greater than 95%. In more detail, the algorithm mapped 34 candidate UCDs to the Pisces-Eridanus stellar stream (Meingast et al. 2019), five to the Argus Association (Zuckerman 2018), four to the AB Doradus Moving Group (Zuckerman et al. 2004), two to the Columba association (Torres et al. 2008), and one each to the Tucana-Horologium (Torres et al. 2000), β Pictoris (Zuckerman et al. 2001), and Carina-Near (Zuckerman et al. 2006) associations. We verified all these 48 objects have tangential velocities typical of the thin disk, with mean vtan = 16.37 km s−1 and standard deviation σ = 6.17 km s−1. As mentioned in BANYAN Σ, a high membership probability in a young association does not guarantee that the star is a true member, or young, so further follow-up would be needed to demonstrate the youth of the object.

thumbnail Fig. 9

Tangential velocity distribution for our candidate UCDs with good parallax and pm conditions. The mean value (red vertical line) of the distribution is 39.78 km s−1, with a maximum value of 206 km s−1.

thumbnail Fig. 10

Mean tangential velocity for each value of effective temperature of our candidate UCDs with reliable parallax and proper motion. The error bars represent the standard deviation.

4.3 Binarity

We conducted a search for binary systems among our candidate UCDs in two ways. We searched for unresolved binaries using a methodology based purely on the photometry of our objects. Using the complementary photometry functionality of VOSA, we selected only the candidates fulfilling three conditions. First, with an excess detected by VOSA in any filter in the infrared. We discarded WISE W3 and W4 due to their poor angular resolution and sensitivity. Second, with good photometry in both 2MASS (Qfl = A) and WISE (cc_flags = 0 and ph_qual = A or B). Third, with at least three good photometric points in the infrared, apart from the detected excess.

After applying these conditions, we ended up with 291 objects with an excess in the infrared that could be ascribed to circumstellar material or to the presence of a close ultracool companion. Then, we used the binary fit functionality of VOSA to fit the observed SED of these 291 objects using the linear combination of two theoretical models. After this, we ended up with 122 candidate UCDs for which the infrared excess detected is nicely reproduced by performing a two-body fit, suggesting the existence of an unresolved companion.

In parallel to this, we looked for Gaia companions of our candidate UCDs at large angular separations, using only those with reliable parallax and proper motion (4714). Firstly, we cross-matched these sources with Gaia EDR3 J2016 to get all the objects separated a maximum of 180 arcsec in the sky (maximum separation allowed by the X-match service in TOPCAT) from each of our candidate UCDs. Then, we established a conservative upper limit of 100 000 au for the projected physical separation between a candidate and its companion. Finally, we relied on the conditions presented in Smart et al. (2019) to ensure that the companion shares a parallax and proper motion similar to that of our candidate UCD:

  • Δϖ < max[1.0,3σϖ]

  • Δ(µα cos δ) < 0.1µα cos δ

  • Δμδ < 0.1μδ

where ϖ and μ are the parallax and proper motion of our candidate UCDs, respectively. After applying these criteria, we ended up with 73 candidate UCDs with one Gaia companion and another five candidate UCDs with two Gaia companions identified. Of these 78 objects, six are already tabulated as known binary systems by the Washington Double Star catalogue (WDS; Mason et al. 2001). Table 2 lists the coordinates (J2000), parallaxes, proper motions, angular separations ρ and projected physical separations s of the six known systems. A table with the same information for the identified multiple systems that are not tabulated by the WDS is accesible through the catalogue described in Appendix A.

A deeper knowledge of the Gaia companion may allow us to infer properties, such as metallicity, of our candidate UCD. We only found spectral types in Simbad for two of the detected companions, with spectral types F2 and K3V. To obtain information about the rest of the companions, we first made use of VOSA to get an estimate of their effective temperature. Then, we relied on the updated version of Table 5 in Pecaut & Mamajek (2013) to map these effectives temperatures to the spectral types of the companions. As result, we ended up with four F-type, one G-type, 16 K-type and 42 M-type stars among the companions with good SED fitting in VOSA. For the rest of the companions, we obtained a bad SED fitting in VOSA (vgfb > 12), so we could not get an estimation of the effective temperature.

Table 2

Identified binary systems that are already tabulated by the WDS.

5 Known ultracool dwarfs

5.1 Recovered known UCDs

Here, we assess the number of known UCDs found in the J-PLUS DR2 field and the fraction of them that were recovered using our methodology. For this analysis, we used nine catalogues and services: SIMBAD13 (Wenger et al. 2000), Zhang et al. (2009), Zhang et al. (2010), Schmidt et al. (2010). Skrzypek et al. (2016), Smart et al. (2017), Reylé (2018), Best et al. (2018), and Ahmed & Warren (2019). Using the SIMBAD TAP service14 through TOPCAT, we selected objects with spectral types M7 V, M8 V, M9 V or labelled as brown dwarfs. A total of 18 282 objects were recovered. Also, from Best et al. (2018) we chose the 2090 objects having spectral type M7 or later. As all the 33 665, 14915, 1886, 1361, 806, 484 and 129 objects in the Ahmed & Warren (2019), Reylé (2018), Smart et al. (2017), Skrzypek et al. (2016), Zhang et al. (2010), Schmidt et al. (2010), and Zhang et al. (2009) catalogues, respectively, are within our scope (spectral type M7 or later), we included them in their entirety.

To select only the known UCDs that lie in the region of the sky covered by J-PLUS DR2 we made use of TOPCAT and its nearMOC functionality, which indicates whether a given sky position either falls within, or is within a certain distance of the edge of, a given MOC. The MOC15 (Multi-Order Coverage Map) is an encoding method dedicated to VO applications or data servers which allows to manage and manipulate any region of the sky, defining it by a subset of regular sky tessellation using the HEALPix method (Górski et al. 2005). Out of a total of 5817 objects lying in the J-PLUS DR2 field of view, we ended up with 4734 known UCDs with photometry in the relevant J-PLUS filters described in Sect. 3 (see Table 1), which are reduced to 4649 objects after removing those with non-zero confusion and contamination flags in 2MASS. From this set, 1983 were recovered using our methodology and 2666 were not. We conducted an in-depth analysis of the 2666 UCDs following the two methodologies (astrometric and photometric) separately, to see in which steps of the process these objects are discarded.

In short, of this 2666 unrecovered objects, 1520 are lost because they do not meet our parallax, proper motion, and photometry constraints, while another 119 are discarded in the GGRP and rz cuts. The remaining 1027 are lost in the tem-perature/vgfb cutoff after the analysis with VOSA, some due to a bad SED fitting (vgfb > 12) and most of them due to an estimated temperature higher than 2900 K. We have checked the latter and the vast majority of them are M7 V from Simbad that lie at the temperature limit, with estimated temperatures of 3000–3100 K.

thumbnail Fig. 11

Distance distribution for previously reported (blue) and new (yellow) candidate UCDs with good parallax conditions.

5.2 New candidate UCDs vs. previously known

In this section, we analyse the differences between previously known UCDs and the remaining candidate UCDs among our sample. For this, we cross-matched our candidate UCDs with the known UCDs sample described in Sect. 5. As indicated above, of the 9810 candidates identified by the proposed VO methodology, only 1983 were previously reported as UCD. This amounts to a total of 7827 new candidate UCDs in the sky coverage of J-PLUS DR2, which represents an increase of about 135% (7827/5817) in the number of UCDs for this area.

Figure 11 shows the distance distribution for our candidate UCDs, with good parallax conditions, discriminated by colour according to whether or not they were previously reported as UCD. It is clear that the new candidates detected are, on average, more distant, driven by the improvement of the quality of parallaxes with Gaia EDR3. Of the 68 nearby objects found at distances smaller than 40 pc, eight have not been previously reported as UCD. To check whether these objects could have been missed by other photometric surveys due to anomalies in their colours, we constructed a colour-colour diagram using JKs (2MASS) and GGRP (Gaia) colours. Figure 12 shows that this is not the case for any of these objects (black diamonds in the diagram).

A more in-depth view of this is the distance vs. effective temperature diagram shown in Fig. 13. Here, we can see how previously reported candidate UCDs tend to be at shorter distances for any value of the effective temperature. This trend is more clearly observed for higher temperature values, where the diagram shows how the new candidate UCDs cover the range of distances of the previously reported candidates and extend it to larger values, suggesting that our methodology allows us to go further in the search for new UCDs.

Going further, in Fig. 14 we plot the absolute proper motions δ| and |µα cos δ| for our candidate UCDs with good proper motion conditions. It shows how the new candidate UCDs detected extend to smaller values of proper motion. Especially for values of proper motion of less than 15 mas yr−1, the number of new candidates is significantly higher than the number of previously reported candidate UCDs, which reflects the improvement of the quality of proper motions with Gaia EDR3.

thumbnail Fig. 12

JKs (2MASS) vs. GGRP (Gaia) diagram of our candidate UCDs with good 2MASS photometric quality (Qflg=A) in J and Ks bands. Black diamonds represent our eight new nearby candidate UCDs at distances d < 40 pc. Green squares stand for new candidate UCDs with tangential velocities vtan > 100 km s−1. Red circles represent candidate UCDs with a possible membership in a nearby young association.

thumbnail Fig. 13

Distance vs. effective temperature diagram for previously reported (blue) and new (yellow) candidate UCDs with good parallax conditions. The vertical dashed line indicates the lower limit of effective temperature for M-type dwarfs (2359 K) according to Pecaut & Mamajek (2013).

thumbnail Fig. 14

Absolute proper motion components for previously reported (blue) and new (yellow) candidate UCDs with good proper motion conditions.

6 Machine learning analysis

The filter system of J-PLUS offers a sufficiently high-dimensional space to reliably use ML techniques. We explored the ability to reproduce the presented search for candidate UCDs with a purely ML-based methodology that uses only J-PLUS photometry. Because the sample is strongly imbalanced, as a first step in the candidate UCDs identification, we proposed a filtering strategy to discard the objects that differ the most from the UCDs using the PCA algorithm. Then, with the reduced sample, SVM models were trained and fine-tuned to maximise the identification of candidate UCDs.

Principal component analysis (Hotelling 1933), one of the most popular linear dimensionality reduction algorithms, is a non-parametric method that aims to reduce a complex data set to a lower dimension by identifying the axes that account for the largest amount of variance. The unit vectors defining each of these axes are called principal components. PCA works on the assumption that principal components with larger associated variance encompass the underlying structure of the data set in order to find the best basis for re-expressing it. The expectation behind this method, as with any method of dimensionality reduction, is that the entire data set can be well characterised along a small number of dimensions (principal components). By projecting the data set onto the hyperplane defined by these principal components, you ensure that the projection will preserve as much variance as possible.

The selection of PCA in our approach instead of other nonlinear dimensionality reduction techniques, such as t-Distributed Stochastic Neighbor Embedding (t-SNE; van der Maaten & Hinton 2008) or Uniform Manifold Approximation and Projection (UMAP; McInnes et al. 2018), is mainly based on (1) the computational efficiency, since PCA allows projecting new data along the new axes without having to reapply the algorithm, and (2) the deterministic nature of the PCA solution, i.e., different runs of PCA on a given dataset will always produce the same results. These properties of PCA are crucial in our proposal, since we use the 2D representation of PCA to perform the filtering as a first step in our ML task.

Support vector machine is a supervised (requires labelled training data) ML algorithm that has been widely used in classification and regression problems (Sarro et al. 2013; González-Marcos et al. 2017). The origin of this algorithm dates back to the late 70s, when Vapnik (1979) delved into the statistical learning theory. The idea behind SVM is to find a hyperplane that separates data into two classes while maximising a marging, defined as the distance from the hyperplane to the closest point across both classes. Thus, the SVM chooses the best separating hyperplane as the one that maximises the distance to these points, so the decision surface is fully specified by a subset of points on the inner edge of each class, known as support vectors. The SVM is a linear classifier, so if the data is not linearly separable in the instance space, we can gain linear separation by mapping the data to a higher dimensional space. To do so, different kernels are used, such as the polynomial or the radial basis function (RBF), since the kernel trick allows us to define a high-dimensional feature space without actually storing these features.

6.1 PCA cut

In our methodology, we used J-PLUS DR2 data from one of the 20 × 20 deg2 mentioned in Sect. 3. We selected as features seven different J-PLUS colours built with the most relevant filters for UCDs, i.e., the reddest ones (see Table 1): iz, ri, iJ0861, J0861 − z, (iz)2, (ri)2, and rz. We discarded the filter J0660 because the available photometry in this filter is less abundant than in the others. Thus, we first built these variables from the J-PLUS photometry, discarding objects with no information in any of the required filters, and labelled the instances as positive or negative class using the candidate UCDs obtained with the previous methodology. After this, we ended up with a sample composed of 317 UCDs and 495 274 non-UCD objects.

To perform the PCA, we first divided the sample into training (70%) and test (30%) sets using stratified sampling to ensure that these sets are representative of the overall population (have the same percentage of samples from each target class as the complete set). Thus, we trained the PCA model using the training set, obtaining that 93% of the sample’s variance lied along the two first principal components. Projecting the training data onto the hyperplane defined by these two principal components, the vast majority of non-UCD objects are clearly separated from the UCDs. Thus, it is possible to make a first cut in the identification of UCDs with this 2D projection, by defining a decision threshold (purple line in the figure) and keeping only the objects that fall on the UCD side. Figure 15 shows the same projection for the entire sample (training + test). After this cut, we reduced our sample to 317 and 29 732 UCD and non-UCD objects, respectively, achieving a 94% reduction on the negative class. Despite still being strongly imbalanced, this reduced sample has a better balance between the negative and positive class, which facilitates better results when using the SVM.

6.2 SVM model

To develop the SVM model, we used the reduced sample obtained in the PCA filtering, keeping the same training and test set structure. We used the test set for the validation of the classification model. The seven J-PLUS colours described in Sect. 6.1 were used as features in the training step.

Then, we conducted a search for the SVM’s optimal hyperparameters on the training test. To do this, we created a grid for the SVM kernel and hyperparameters and did an exhaustive search over this parameter space using the GridSearchCV class from the scikit-learn package, which optimises the hyperparameters of an estimator by k-fold cross-validation using any score to evaluate the performance of the model. In our case, we used the recall score, which measures the ability of the classifier to find all the positive instances, since our priority is to identify as many candidate UCDs as possible. For the GridSearchCV class, we used ten k-folds and set the hyperparameter class_weight to ‘balanced’ to address the imbalance by adjusting the weights inversely proportional to the class frequencies. In the grid of hyperparameters, we tested the regularisation parameter C for values of 1, 10, 100 and 1000, and the kernel scale γ of the RBF kernel for 0.001, 0.01, 0.1, 1, 10 and 100.

After this search for the optimal hyperparameters, we obtained the best recall score with an RBF kernel and hyperparameters C = 10 and γ = 0.001, with a total recall of 92% on the test set. Figure 16 shows the confusion matrix on the test set. The confusion matrix is a performance measurement in machine learning classification that compares the labels predicted by the model (x-axis) with the ground-truth labels in the data set (y-axis). The most important thing to note here is that the SVM model manages to recover nearly all positive instances, which is our main priority, as we do not want to lose any candidate UCD in the process. Also, the SVM performs very well at identifying True Negatives (TN, negative instances predicted as negative). In conclusion, the model allows us to filter out the vast majority of non-UCD objects, while keeping almost all the candidate UCDs. However, the class imbalance of the data causes the number of False Positives (FP, negative instances predicted as positive) to be larger than the number of True Positives (TP, positive instances predicted as positive). This makes the analysis with VOSA still necessary to differentiate the final candidate UCDs.

thumbnail Fig. 15

Projection of the sample used in the ML methodology onto the hyperplane defined by the first two principal components, with an explained variance ratio of 93%. Points are colour-coded according to their class, UCD (yellow) or non-UCD (dark blue). The purple line represents the decision threshold used to make a first cut at identifying UCDs, keeping only the objects that fall on the UCD side.

6.3 Blind test

To validate the classifier’s performance on unseen data, we applied our ML methodology on the J-PLUS DR2 data from another of the 20 × 20 deg2 regions containing 607 801 objects with good photometry in all relevant filters. Firstly, we used the same PCA model fitted with the previous region to perform the PCA filtering on this new region, reducing the total number of instances to 51 343. We used the previously fitted SVM model to predict over this reduced set, obtaining a recall score of 91%. Figure 17 shows the confusion matrix for this blind test. Thus, we ended up with 2606 (2379 + 227) objects to be analysed with VOSA for the final UCD identification, which means the SVM model achieved to discard ~95% (1 − 2379/51 094) of the non-UCD objects that pass the PCA filtering.

We used the objects analysed with VOSA in the VO methodology for this same region to make a thorough analysis of our ML method. Thus, we found that, of these objects, the PCA filter removes those with Teff ≳ 4100 K, so this first cut is able to purge the initial set of the hottest objects. The ML methodology is more restrictive in terms of photometric quality, as it is only applicable to the objects with photometry in all the filters used to build the input features. This means that all the final candidate UCDs with no photometry in any of these filters (around 50% for this region), obtained with the VO methodology, are not captured by the ML procedure. In summary, we concluded that the ML methodology is more efficient in the sense that it allows for a greater number of true negatives (non-UCD objects) to be discarded prior to analysis with VOSA, although it is a more restrictive method as it relies only on the photometry of the J-PLUS filters used. Another advantage of the proposed ML approach is that it consists of a single process instead of the three separate ones required in the VO methodology.

thumbnail Fig. 16

Confusion matrix for the test set, with a recall of 92%.

thumbnail Fig. 17

Confusion matrix for the blind test, with a recall of 91%.

Table 3

Objects with strong flux excess in Ha (filter J0660) or Ca ii H and K (filter J0395) emission lines, identified with our Python algorithm.

7 Detection of strong emission line emitters

Strong emission lines have been detected serendipitously in UCD optical spectra, both as transient flaring phenomena (Liebert et al. 1999, 2003; Martin & Ardila 2001; Schmidt et al. 2007) as well as steady features (Schneider et al. 1991; Mould et al. 1994; Martin et al. 1999; Burgasser et al. 2011). Stellar flares, events powered by the sudden release of magnetic energy, that is converted to kinetic energy of electrons and ions due to magnetic reconnection in the stellar atmosphere, are a common phenomenon around M dwarfs. Works such as those presented in Hambaryan et al. (2004), Berger et al. (2010) and Route & Wolszczan (2016) have confirmed that optical, radio and X-ray flares do occur in UCDs.

We decided to focus our search for strong emission on the Ha and Ca ii H and K lines, important chromospheric activity indicators (Cincunegui et al. 2007), which correspond to filters 11.0 (J0660) and 7.0 (J0395) in the J-PLUS filter system, respectively. Since this is a rare phenomenon, we decided to conduct this search on a larger sample of objects, including all the objects that met the GGRP and rz colour criteria presented in Sects. 3.1, 3.2 and 3.3. Therefore, since we did not apply the effective temperature cutoff, the search also covered spectral types hotter than those of the UCDs.

With this purpose, we developed a Python algorithm that detects any drop in magnitude in filters J0395 and J0660. Firstly, the algorithm joins the J-PLUS DR2 photometry obtained in the search described in Sect. 3 to the shortlisted objects obtained with the methodology described in Sects. 3.1, 3.2 and 3.3. Then, object by object, it computes the magnitude ratio between the filter of interest and all its neighbours. We chose as neighbours the filters 6.0, 8.0 and 9.0 for the filter 7.0 (Ca ii H and K) and the filters 1.0 and 3.0 for the 11.0 (Hα). If this ratio is lower than a fixed threshold value (entered by the user) for any neighbouring filter, the algorithm recognises a possible strong line emitter and plots the photometry of the object. For the object to be recognisable, we need at least photometry in one of the neighbouring filters, so we can detect this emission peak. The algorithm receives as input a file with the candidate UCDs photometry and returns both the plotted photometry of the objects with possible strong emission and a table with the computed magnitude drop for each of them. We were permissive with the fixed threshold, so as not to discard any interesting object, and imposed a value of 0.96. Then, we visually inspected all the possible strong emitters detected by the algorithm given this threshold.

Finally, we ended up with eight objects that exhibit significant emission peaks in the filters of interest, that are presented in Table 3. We used VOSA to estimate the effective temperature of these objects and found only one UCD, with Teff = 2500 K, among the eight objects (fifth object in Table 3). The remaining seven objects have estimated effective temperatures (see Table 3) typical of mid-M dwarfs (Zhang et al. 2018). Figure 18 shows the photometry of the object with the highest line emission excess (first object in Table 3). Also, in Fig. 19 we include images from the J-PLUS DR2 archive with the emission in different filters for the object with highest excess activity in the Ca ii H and K (first object in Table 3) and Ha (seventh object in Table 3) emission lines. With this analysis, we underline the possibility of systematically detecting strong emission lines in UCDs and earlier M-type stars with photometric surveys such as J-PLUS.

For the fifth object listed in Table 3, namely LP 310-34, we carried out a follow-up optical spectroscopy monitoring study. Five exposures of half an hour integration time each were obtained on January 12th, 2020 in service time (proposal 60-299, PI Martín) with ALFOSC attached to the Nordic Optical Telescope in La Palma. The grism number 4 and the slit with of 1.0 arcsec were selected providing a dispersion of 3.75 Å pixel−1 and a resolving power of R = 700. Our spectra confirm that it is a very late M dwarf (dM8) with Hα in emission (Schmidt et al. 2007). We measured an Hα equivalent width of −14.6 Å, using the Gaussian profile integration option available in the IRAF task splot applied to the co-added spectrum of the five exposures. Individual measurements of the equivalent width in each spectrum ranged from –7.0 to –20.7 Å, suggesting variability in the strength of the Hα emission. This level of Hα emission is not uncommon among late-M dwarfs (Martín et al. 2010; Pineda et al. 2016). No other emission lines were detected in our spectra.

One of the new strong line-emission candidates (sixth object in Table 3) was observed on April 21st, 2022 with the long-slit low-resolution mode of the SpeX instrument (Rayner et al. 2003) at the NASA Infrared Telescope Facility (IRTF, program 2022A011, PI A. Burgasser). Preliminary analysis of the data indicates that the near-infrared spectrum is well matched by a M5 dwarf template (A. Burgasser, private communication). Further details of these observations and additional spectroscopic follow-up of the J-PLUS candidates presented in this work is planned for a future paper.

This study suggests that our J-PLUS search for strong emission lines may be revealing previously unknown sporadic very strong activity in otherwise normal late-M dwarfs. It is worth noting that our search for strong line emitters has detected as many objects with Ca ii H and K excess than with Hα excess, and no object showing both excesses simultaneously. Events of strong Ca ii H and K line emission in normal late-M dwarfs may have important implications for studies of exoplanetary space weather and habitability (Yamashiki et al. 2019).

thumbnail Fig. 18

Example of a detected strong excess in the filter 7.0 (J0395), which corresponds to Ca ii H and K emission lines. The figure shows the J-PLUS photometry of the first object in Table 3, with error bars representing the error in the magnitude. The algorithm detects that the magnitude in the filter J0395 is 0.8 times the magnitude in the filter 9.0 (J0430), and recognises this as a possible emission line. In this case, the threshold value for the excess detection was 0.96.

thumbnail Fig. 19

Images from the J-PLUS DR2 archive with the photometry in different filters for two of the strong line emitters detected. The first row corresponds to an excess in the Ca ii H and K (filter J0395) emission lines (first object in Table 3), with images in the filters u, J0378, J0395, J0410, and J0430 (from left to right). The second row shows an excess in the Hα (filter J0660) emission line (seventh object in Table 3), with images in the filters J0515, r, J0660, i, and J0861 (from left to right). For both objects, all images shown were taken within a time interval of about 40 min.

8 Conclusions and future work

Using a Virtual Observatory methodology, we provide a catalogue of 9810 candidate UCDs over the entire sky coverage of J-PLUS DR2. With 7827 previously not reported as UCD, we show there is still room for the discovery of these objects even with a small telescope such as the JAST80. Our main goal is to consolidate and further develop a search methodology, introduced in Solano et al. (2019), to be used for deeper and larger surveys such as J-PAS and Euclid, both being an ideal scenario for the study and discovery of UCDs thanks to their unprecedented photometric system of 54 narrow-band filters and excellent sensitivity, respectively. Further confirmation by spectroscopy of the UCD nature of these candidates goes beyond the scope of this study. However, the candidate UCDs that are reported in Simbad, but are not in our sample of known UCDs (see Sect. 5), mostly present spectral type M6 V or are left out because they lack the luminosity class, so we expect the degree of contamination to be small.

The use of different approaches based on astrometry and photometry tends to minimise the drawbacks and biases associated to the search of ultracool objects: photometric-only selected samples may leave out peculiar UCDs not following the canonical trend in colour-colour diagrams and they can also be affected by extragalactic contamination. Proper motion searches may ignore objects with small values of projected velocity in the plane of the sky. Regarding parallax-based searches, they will be limited to the brightest objects with parallax values from Gaia.

Based on our kinematics study, almost all our candidate UCDs can be considered thin disk members, with 268 of them being potential members of the thick disk. Also, five of the candidates are likely to belong to the Galactic halo. Using the BANYAN Σ tool, we find 48 candidate UCDs with a high Bayesian probability of belonging to seven different young moving associations, in 30 of the cases with a probability greater than 95%. A further spectroscopic follow-up will be required to search for spectral signatures of youth. In the binarity analysis, we find 122 possible unresolved companions among our candidate UCDs. Searching for wide Gaia companions of our candidate UCDs, we find 78 possible multiple systems (73 binary + 5 triple), six of them already tabulated by the WDS. We use VOSA to get an estimation of the effective temperature of the wide Gaia companions identified in all the systems, finding that most of them are M-type stars.

Among the non-recovered known UCDs that lie in the sky coverage of J-PLUS DR2, we find that more than half are lost due to lack of photometric or astrometric information with enough quality. The remaining objects are discarded due to our conservative temperature cutoff at 2900 K or a bad SED fitting (vgfb > 12). Compared to previously reported candidates, the new ones are on average more distant and extend to smaller values of proper motion.

We achieve promising results when reproducing the search for UCDs with a purely ML-based methodology. In this approach, we find crucial the preliminary PCA filtering to deal with the strong imbalance of the data and discard the hottest objects. This allows us to significantly reduce the negative class and improve the classification capability of the posterior SVM model. Using the developed ML methodology to predict on unseen data, we are able to recover 91% of the candidate UCDs found with the VO methodology, discarding a larger number of true negatives (non-UCD objects) before the analysis with VOSA in a faster way. This is a significant achievement, since the main bottleneck of the VO methodology is the high number of objects to be analysed with VOSA.

In this line, the real turning point would be to develop a ML methodology that more significantly filters the number of objects we need to analyse with VOSA for the final UCD identification. This is not a straightforward task due to the imbalance of the data and because the analysis with VOSA is based on complex theoretical models. To this end, we are exploring the use of independent component analysis in the initial filtering and ensemble learning in the classification step.

Finally, we develop an algorithm capable of detecting strong emission line emitters in the optical range. We identify four objects with strong excess in the filter corresponding to the Ca ii H and K emission lines and four other objects with excess emission in the Hα filter.

Acknowledgements

This research has made use of the Spanish Virtual Observatory (https://svo.cab.inta-csic.es) project funded by MCIN/AEI/10.13039/501100011033/ through grant PID2020-112949GB-I00 and MDM-2017-0737 at Centro de Astrobiología (CSIC-INTA), Unidad de Excelencia María de Maeztu. This work presents results from the European Space Agency (ESA) space mission Gaia. Gaia data are being processed by the Gaia Data Processing and Analysis Consortium (DPAC). Funding for the DPAC is provided by national institutions, in particular the institutions participating in the Gaia MultiLateral Agreement (MLA). The Gaia mission website is https://www.cosmos.esa.int/gaia. The Gaia archive website is https://archives.esac.esa.int/gaia. This research has made extensive use of the SIMBAD database (Wenger et al. 2000), VizieR catalogue access tool (Ochsenbein et al. 2000), Aladin sky atlas (Bonnarel et al. 2000) provided by CDS, Strasbourg, France, and of the TOPCAT (Taylor 2005) and STILTS (Taylor 2006) tools. We also made use of VOSA and the SVO Filter Profile Service, developed under the Spanish Virtual Observatory project. We made extensive use of Python throughout the entire process, including the packages pandas (https://github.com/pandas-dev/pandas), seaborn (Waskom 2021), numpy (Harris et al. 2020), matplotlib (Hunter 2007), scikit-learn (Pedregosa et al. 2012), plotly (https://github.com/plotly/plotly.py), scipy (Jones et al. 2001), astropy (Astropy Collaboration 2013), and mocpy (Baumann et al. 2020). E.M. was supported by the Agencia Estatal de Investigación del Ministerio de Ciencia e Innovación (AEI-MCINN) under grant PID2019-109522GB-C53. We want to thank Jonathan Gagné for the development of the BANYAN Σ tool (http://www.exoplanetes.umontreal.ca/banyan/). PMB is funded by INTA through grant PRE-OVE. F.J.E. acknowledges financial support by the Spanish grant MDM-2017-0737 at Centro de Astrobiología (CSIC-INTA), Unidad de Excelencia María de Maeztu. Funding for the J-PLUS project has been provided by the Governments of Spain and Aragón through the Fondo de Inversiones de Teruel; the Aragonese Government through the Research Groups E96, E103, E16_17R, and E16_20R; the Spanish Ministry of Science, Innovation and Universities (MCIU/AEI/FEDER, UE) with grants PGC2018-097585-B-C21 and PGC2018-097585-B-C22; the Spanish Ministry of Economy and Competitiveness (MINECO/FEDER, UE) under AYA2015-66211-C2-1-P, AYA2015-66211-C2-2, AYA2012-30789, and ICTS-2009-14; and European FEDER funding (FCDD10-4E-867, FCDD13-4E-2685). The Brazilian agencies FAPERJ and FAPESP as well as the National Observatory of Brazil have also contributed to this project. This work is based on observations made with the JAST80 telescope at the Observatorio Astrofísico de Javalambre (OAJ) in Teruel, owned, managed and operated by the Centro de Estudios de Física del Cosmos de Aragón (CEFCA) and data obtained with ALFOSC on the Nordic Optical Telescope (NOT) at the Observatorio del Roque de los Muchachos (La Palma, Spain). ALFOSC was provided by the Instituto de Astrofísica de Andalucía (IAA) under joint agreement with the University of Copenhaguen and NOTSA. We thank the OAJ Data Processing and Archiving Unit (UPAD) for reducing and calibrating the OAJ data used in this work. R.A.D. acknowledges support from the Conselho Nacional de Desenvolvimento Científico e Tecnológico – CNPq through BP grant 308105/2018-4, and the Financiadora de Estudos e Projetos – FINEP grants REF. 1217/13 – 01.13.0279.00 and REF 0859/10 – 01.10.0663.00 for hardware support for the J-PLUS project through the National Observatory of Brazil. PC acknowledges financial support from the Government of Comunidad Autónoma de Madrid (Spain), via postdoctoral grant ‘Atracción de Talento Investigador’ 2019-T2/TIC-14760. We thank the IRTF observers (D. Chih-Chun Hsu, N. Lodieu, C. Theissen, J.-Y. Zhang) for program 2022-A011 (PI A. Burgasser). The authors wish to recognise and acknowledge the very significant cultural role and reverence that the summit of Maunakea has always had within the indigenous Hawaiian community. We are most fortunate to have the opportunity to conduct observations from this mountain.

Appendix A Virtual Observatory compliant, online catalogue

To help the astronomical community use our catalogue of candidate UCDs, we provide an archive system that can be accessed from a webpage16 or through a Virtual Observatory Cone-Search17. The archive system implements a very simple search interface that allows queries by coordinates and radius as well as by other parameters of interest. The user can also select the maximum number of sources (with values from ten to unlimited). The result of the query is a HTML table with all the sources found in the archive fulfilling the search criteria. The result can also be downloaded as a VOTable or a CSV file. Detailed information on the output fields can be obtained placing the mouse over the question mark located close to the name of the column. The archive also implements the SAMP18 (Simple Application Messaging) Virtual Observatory protocol. SAMP allows Virtual Observatory applications to communicate with each other in a seamless and transparent manner for the user. This way, the results of a query can be easily transferred to other VO applications, such as, for instance, TOPCAT.

References

  1. Ahmed, S., & Warren, S.J. 2019, A&Amp;A, 623, A127 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  2. Allard, F., Homeier, D., & Freytag, B. 2012, Philos. Trans. Roy. Soc. Lond. A, 370, 2765 [NASA ADS] [Google Scholar]
  3. Astropy Collaboration (Robitaille, T.P., et al.) 2013, A&Amp;A, 558, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Baumann, M., Boch, T., Fernique, P., Nebot, A., & Pineau, F.X. 2020, in Astronomical Data Analysis Software and Systems XXIX, eds. R. Pizzo, E.R. Deul, J.D. Mol, J. de Plaa, & H. Verkouter, Astronomical Society of the Pacific Conference Series, 527, 693 [NASA ADS] [Google Scholar]
  5. Bayo, A., Rodrigo, C., Barrado, Y. Navascués, D., et al. 2008, A&Amp;A, 492, 277 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  6. Benitez, N., Dupke, R., Moles, M., et al. 2014, ArXiv e-prints, [arXiv:1403.5237] [Google Scholar]
  7. Berger, E., Basri, G., Fleming, T.A., et al. 2010, ApJ, 709, 332 [NASA ADS] [CrossRef] [Google Scholar]
  8. Best, W.M.J., Magnier, E.A., Liu, M.C., et al. 2018, ApJS, 234, 1 [Google Scholar]
  9. Bochanski, J.J., Hawley, S.L., Covey, K.R., et al. 2010, AJ, 139, 2679 [NASA ADS] [CrossRef] [Google Scholar]
  10. Bonnarel, F., Fernique, P., Bienaymé, O., et al. 2000, A&Amp;As, 143, 33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  11. Burgasser, A.J., Sitarski, B.N., Gelino, C.R., Logsdon, S.E., & Perrin, M.D. 2011, ApJ, 739, 49 [NASA ADS] [CrossRef] [Google Scholar]
  12. Burgasser, A.J., Logsdon, S.E., Gagné, J., et al. 2015, ApJS, 220, 18 [NASA ADS] [CrossRef] [Google Scholar]
  13. Caballero, J.A., Burgasser, A.J., & Klement, R. 2008, A&Amp;A, 488, 181 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Caffau, E., Ludwig, H.G., Steffen, M., Freytag, B., & Bonifacio, P. 2011, Sol. Phys., 268, 255 [NASA ADS] [CrossRef] [Google Scholar]
  15. Cenarro, A.J., Moles, M., Mann-Franch, A., et al. 2014, in Observatory Operations: Strategies, Processes, and Systems V, eds. A.B. Peck, C.R. Benn, & R.L. Seaman, 9149, International Society for Optics and Photonics (SPIE), 553 [Google Scholar]
  16. Cenarro, A.J., Moles, M., Cristobal-Hornillos, D., et al. 2019, A&Amp;A, 622, A176 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  17. Chambers, K.C., Magnier, E.A., Metcalfe, N., et al. 2016, ArXiv e-prints, [arXiv:1612.05560] [Google Scholar]
  18. Cincunegui, C., Diaz, R.F., & Mauas, P.J.D. 2007, A&Amp;A, 469, 309 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  19. Cortes, C., & Vapnik, V. 2009, Chem. Biol. Drug Des., 297, 273 [Google Scholar]
  20. Cross, N.J.G., Collins, R.S., Mann, R.G., et al. 2012, A&Amp;A, 548, A119 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  21. Epchtein, N., de Batz, B., Capoani, L., et al. 1997, The Messenger, 87, 27 [NASA ADS] [Google Scholar]
  22. Faherty, J.K., Burgasser, A.J., Cruz, K.L., et al. 2009, AJ, 137, 1 [NASA ADS] [CrossRef] [Google Scholar]
  23. Gagné, J., Mamajek, E.E., Malo, L., et al. 2018, ApJ, 856, 23 [CrossRef] [Google Scholar]
  24. Gillon, M., Jehin, E., Lederer, S.M., et al. 2016, Nature, 533, 221 [NASA ADS] [CrossRef] [Google Scholar]
  25. Gillon, M., Triaud, A.H.M.J., Demory, B.-O., et al. 2017, Nature, 542, 456 [NASA ADS] [CrossRef] [Google Scholar]
  26. González-Marcos, A., Sarro, L.M., Ordieres-Meré, J., & Bello-Garaa, A. 2017, MNRAS, 465, 4556 [CrossRef] [Google Scholar]
  27. Górski, K.M., Hivon, E., Banday, A.J., et al. 2005, ApJ, 622, 759 [CrossRef] [Google Scholar]
  28. Hambaryan, V., Staude, A., Schwope, A.D., et al. 2004, A&Amp;A, 415, 265 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  29. Harris, C.R., Millman, K.J., van der Walt, S.J., et al. 2020, Nature, 585, 357 [NASA ADS] [CrossRef] [Google Scholar]
  30. Hotelling, H. 1933, J. Educ. Psychol., 24, 447 [Google Scholar]
  31. Hunter, J.D. 2007, Comput. Sci. Eng., 9, 90 [Google Scholar]
  32. Jones, E., Oliphant, T., & Peterson, P. 2001, SciPy: Open Source Scientific Tools for Python [Google Scholar]
  33. Kilic, M., Munn, J.A., Harris, H.C., et al. 2017, ApJ, 837, 162 [NASA ADS] [CrossRef] [Google Scholar]
  34. Kirkpatrick, J.D. 2005, ARA&Amp;A, 43, 195 [NASA ADS] [CrossRef] [Google Scholar]
  35. Kirkpatrick, J.D., Martin, E.C., Smart, R.L., et al. 2019, ApJS, 240, 19 [NASA ADS] [CrossRef] [Google Scholar]
  36. Laugalys, V., & Straižys, V. 2002, Baltic Astron., 11, 205 [NASA ADS] [Google Scholar]
  37. Lawrence, A., Warren, S.J., Almaini, O., et al. 2007, MNRAS, 379, 1599 [NASA ADS] [CrossRef] [Google Scholar]
  38. Liebert, J., Kirkpatrick, J.D., Reid, I.N., & Fisher, M.D. 1999, ApJ, 519, 345 [NASA ADS] [CrossRef] [Google Scholar]
  39. Liebert, J., Kirkpatrick, J.D., Cruz, K.L., et al. 2003, AJ, 125, 343 [NASA ADS] [CrossRef] [Google Scholar]
  40. Lindegren, L., Hernandez, J., Bombrun, A., et al. 2018, A&Amp;A, 616, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Lodieu, N., Espinoza Contreras, M., Zapatero Osorio, M.R., et al. 2017, A&Amp;A, 598, A92 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  42. López-Sanjuan, C., Yuan, H., Vazquez Ramio, H., et al. 2021, A&Amp;A, 654, A61 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  43. Luri, X., Brown, A.G.A., Sarro, L.M., et al. 2018, A&Amp;A, 616, A9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  44. Marin-Franch, A., Taylor, K., Cenarro, J., Cristobal-Hornillos, D., & Moles, M. 2015, in IAU General Assembly, 29, 2257381 [Google Scholar]
  45. Martin, E.L., & Ardila, D.R. 2001, AJ, 121, 2758 [NASA ADS] [CrossRef] [Google Scholar]
  46. Martin, E.L., Basri, G., & Zapatero Osorio, M.R. 1999, AJ, 118, 1005 [NASA ADS] [CrossRef] [Google Scholar]
  47. Martin, E.L., Phan-Bao, N., Bessell, M., et al. 2010, A&Amp;A, 517, A53 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  48. Mason, B.D., Wycoff, G.L., Hartkopf, W.I., Douglass, G.G., & Worley, C.E. 2001, AJ, 122, 3466 [NASA ADS] [CrossRef] [Google Scholar]
  49. McInnes, L., Healy, J., Saul, N., & Großberger, L. 2018, J. Open Source Softw., 3, 861 [CrossRef] [Google Scholar]
  50. Meingast, S., Alves, J., & Fürnkranz, V. 2019, A&Amp;A, 622, L13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  51. Mould, J., Cohen, J., Oke, J.B., & Reid, N. 1994, AJ, 107, 2222 [NASA ADS] [CrossRef] [Google Scholar]
  52. Naim, A., Lahav, O., Sodré, L.J., & Storrie-Lombardi, M.C. 1995, MNRAS, 275, 567 [NASA ADS] [CrossRef] [Google Scholar]
  53. Ochsenbein, F., Bauer, P., & Marcout, J. 2000, A&Amp;As, 143, 23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  54. Pecaut, M.J., & Mamajek, E.E. 2013, ApJS, 208, 9 [NASA ADS] [CrossRef] [Google Scholar]
  55. Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2012, ArXiv e-prints, [arXiv:1201.0490] [Google Scholar]
  56. Pineda, J.S., Hallinan, G., Kirkpatrick, J.D., et al. 2016, ApJ, 826, 73 [NASA ADS] [CrossRef] [Google Scholar]
  57. Rayner, J.T., Toomey, D.W., Onaka, P.M., et al. 2003, PASP, 115, 362 [NASA ADS] [CrossRef] [Google Scholar]
  58. Reylé, C. 2018, A&Amp;A, 619, L8 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  59. Route, M., & Wolszczan, A. 2016, ApJ, 830, 85 [NASA ADS] [CrossRef] [Google Scholar]
  60. Sarro, L.M., Debosscher, J., Neiner, C., et al. 2013, A&Amp;A, 550, A120 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  61. Schmidt, S.J., Cruz, K.L., Bongiorno, B.J., Liebert, J., & Reid, I.N. 2007, AJ, 133, 2258 [NASA ADS] [CrossRef] [Google Scholar]
  62. Schmidt, S.J., West, A.A., Hawley, S.L., & Pineda, J.S. 2010, AJ, 139, 1808 [NASA ADS] [CrossRef] [Google Scholar]
  63. Schneider, D.P., Greenstein, J.L., Schmidt, M., & Gunn, J.E. 1991, AJ, 102, 1180 [NASA ADS] [CrossRef] [Google Scholar]
  64. Skrutskie, M.F., Cutri, R.M., Stiening, R., et al. 2006, AJ, 131, 1163 [NASA ADS] [CrossRef] [Google Scholar]
  65. Skrzypek, N., Warren, S.J., & Faherty, J.K. 2016, A&Amp;A, 589, A49 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  66. Smart, R.L., Marocco, F., Caballero, J.A., et al. 2017, MNRAS, 469, 401 [NASA ADS] [CrossRef] [Google Scholar]
  67. Smart, R.L., Marocco, F., Sarro, L.M., et al. 2019, MNRAS, 485, 4423 [NASA ADS] [CrossRef] [Google Scholar]
  68. Solano, E., Martin, E.L., Caballero, J.A., et al. 2019, A&Amp;A, 627, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  69. Solano, E., Galvez-Ortiz, M.C., Martin, E.L., et al. 2021, MNRAS, 501, 281 [Google Scholar]
  70. Straižys, V., Cernis, K., Kazlauskas, A., & Laugalys, V. 2002, Baltic Astron., 11, 231 [Google Scholar]
  71. Taylor, M.B. 2005, in Astronomical Data Analysis Software and Systems XIV, eds. P. Shopbell, M. Britton, & R. Ebert, Astronomical Society of the Pacific Conference Series, 347, 29 [NASA ADS] [Google Scholar]
  72. Taylor, M.B. 2006, in Astronomical Data Analysis Software and Systems XV, eds. C. Gabriel, C. Arviset, D. Ponz, & S. Enrique, Astronomical Society of the Pacific Conference Series, 351, 666 [NASA ADS] [Google Scholar]
  73. Theissen, C.A., West, A.A., & Dhital, S. 2016, AJ, 151, 41 [NASA ADS] [CrossRef] [Google Scholar]
  74. Theissen, C.A., West, A.A., Shippee, G., Burgasser, A.J., & Schmidt, S.J. 2017, AJ, 153, 92 [NASA ADS] [CrossRef] [Google Scholar]
  75. Torres, C.A.O., da Silva, L., Quast, G.R., de la Reza, R., & Jilinski, E. 2000, AJ, 120, 1410 [NASA ADS] [CrossRef] [Google Scholar]
  76. Torres, C.A.O., Quast, G.R., Melo, C.H.F., & Sterzik, M.F. 2008, in Handbook of Star Forming Regions, II, ed. B. Reipurth, 5, 757 [NASA ADS] [Google Scholar]
  77. Torres, S., Cantero, C., Rebassa-Mansergas, A., et al. 2019, MNRAS, 485, 5573 [NASA ADS] [CrossRef] [Google Scholar]
  78. van der Maaten, L., & Hinton, G. 2008, J. Mach. Learn. Res., 9, 2579 [Google Scholar]
  79. Vapnik, V.N. 1979, Estimation of Dependences Based on Empirical Data (in Russian) (USSR: Nauka) [Google Scholar]
  80. Wang, C., Bai, Y., López-Sanjuan, C., et al. 2022, A&Amp;A, 659, A144 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  81. Waskom, M. 2021, J. Open Source Softw., 6, 3021 [CrossRef] [Google Scholar]
  82. Wenger, M., Ochsenbein, F., Egret, D., et al. 2000, A&Amp;AS, 143, 9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  83. Whitten, D.D., Placco, V.M., Beers, T.C., et al. 2019, A&Amp;A, 622, A182 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  84. Wright, E.L., Eisenhardt, P.R.M., Mainzer, A.K., et al. 2010, AJ, 140, 1868 [NASA ADS] [CrossRef] [Google Scholar]
  85. Yamashiki, Y.A., Maehara, H., Airapetian, V., et al. 2019, ApJ, 881, 114 [NASA ADS] [CrossRef] [Google Scholar]
  86. York, D.G., Adelman, J., Anderson, John E., J., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]
  87. Zhang, Z.H., Pokorny, R.S., Jones, H.R.A., et al. 2009, A&Amp;A, 497, 619 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  88. Zhang, Z.H., Pinfield, D.J., Day-Jones, A.C., et al. 2010, MNRAS, 404, 1817 [NASA ADS] [Google Scholar]
  89. Zhang, Z., Pinfield, D., Galvez-Ortiz, M., et al. 2018, MNRAS, 479, 1383 [NASA ADS] [CrossRef] [Google Scholar]
  90. Zuckerman, B. 2018, ApJ, 870, 27 [NASA ADS] [CrossRef] [Google Scholar]
  91. Zuckerman, B., Song, I., Bessell, M.S., & Webb, R.A. 2001, ApJ, 562, L87 [NASA ADS] [CrossRef] [Google Scholar]
  92. Zuckerman, B., Song, I., & Bessell, M.S. 2004, ApJ, 613, L65 [NASA ADS] [CrossRef] [Google Scholar]
  93. Zuckerman, B., Bessell, M.S., Song, I., & Kim, S. 2006, ApJ, 649, L115 [NASA ADS] [CrossRef] [Google Scholar]

All Tables

Table 1

J-PLUS filter information, taken from the J-PLUS DR2 database, sorted from shortest to longest wavelength.

Table 2

Identified binary systems that are already tabulated by the WDS.

Table 3

Objects with strong flux excess in Ha (filter J0660) or Ca ii H and K (filter J0395) emission lines, identified with our Python algorithm.

All Figures

thumbnail Fig. 1

Sky coverage of J-PLUS DR2 (yellow) in α, δ coordinates (centred on α = 0 deg, δ = 0 deg, with α rising to the left). The SDSS DR9 footprint is superimposed in blue. The purple line represents the border of the J-PLUS DR1 coverage map.

In the text
thumbnail Fig. 2

Location of the objects shortlisted as candidate UCDs via astrometric selection, for an example 20 × 20 deg2 region, in a colour-magnitude (left) and a reduced proper motion (right) diagram built using Gaia EDR3 sources with parallaxes larger than 10 mas (dark blue dots). The vertical and horizontal red lines mark the boundaries for a source to be shortlisted as candidate UCD. Sources fulfilling these conditions are overplotted in yellow.

In the text
thumbnail Fig. 3

Colour-colour diagram built using true extended (dark blue) and true point (red) sources. Yellow dots represent the sample of 10481 QSOs. Purple dots represent the shortlisted candidate UCDs obtained by parallax-guided and proper motion-guided methods. The vertical grey line marks the rz > 2.2 mag limit for a source to be shortlisted as candidate UCD.

In the text
thumbnail Fig. 4

Example of an automatically generated SED fitting with VOSA. The blue spectrum represents the theoretical model that fits best, while red dots represent the observed photometry. The inverted yellow triangle indicates that the photometric value corresponds to an upper limit. These points are not considered in the fitting process.

In the text
thumbnail Fig. 5

Breakdown of our candidate UCDs according to the methods by which they have been detected. ‘diag’ label represents photometry-based selection, while ‘par’ and ‘pm’ labels represent selections based on parallax and proper motion, respectively. The ‘all’ label comprises the candidate UCDs identified with the three approaches.

In the text
thumbnail Fig. 6

Teff distribution for our candidate UCDs. In yellow, we show the candidates that were only detected by photometry. In blue we show the candidates that were, at least, detected by astrometric methodology.

In the text
thumbnail Fig. 7

Distance distribution for our candidate UCDs with an error of less than 20% in Gaia EDR3 J2016 parallax. The mean value (red vertical line) of the distribution is 159 pc, with the closest and farthest objects at 11 pc and 471 pc.

In the text
thumbnail Fig. 8

Bolometric luminosity (in solar units) vs. distance diagram of our candidate UCDs with good parallaxes. The points are colour-coded by temperature.

In the text
thumbnail Fig. 9

Tangential velocity distribution for our candidate UCDs with good parallax and pm conditions. The mean value (red vertical line) of the distribution is 39.78 km s−1, with a maximum value of 206 km s−1.

In the text
thumbnail Fig. 10

Mean tangential velocity for each value of effective temperature of our candidate UCDs with reliable parallax and proper motion. The error bars represent the standard deviation.

In the text
thumbnail Fig. 11

Distance distribution for previously reported (blue) and new (yellow) candidate UCDs with good parallax conditions.

In the text
thumbnail Fig. 12

JKs (2MASS) vs. GGRP (Gaia) diagram of our candidate UCDs with good 2MASS photometric quality (Qflg=A) in J and Ks bands. Black diamonds represent our eight new nearby candidate UCDs at distances d < 40 pc. Green squares stand for new candidate UCDs with tangential velocities vtan > 100 km s−1. Red circles represent candidate UCDs with a possible membership in a nearby young association.

In the text
thumbnail Fig. 13

Distance vs. effective temperature diagram for previously reported (blue) and new (yellow) candidate UCDs with good parallax conditions. The vertical dashed line indicates the lower limit of effective temperature for M-type dwarfs (2359 K) according to Pecaut & Mamajek (2013).

In the text
thumbnail Fig. 14

Absolute proper motion components for previously reported (blue) and new (yellow) candidate UCDs with good proper motion conditions.

In the text
thumbnail Fig. 15

Projection of the sample used in the ML methodology onto the hyperplane defined by the first two principal components, with an explained variance ratio of 93%. Points are colour-coded according to their class, UCD (yellow) or non-UCD (dark blue). The purple line represents the decision threshold used to make a first cut at identifying UCDs, keeping only the objects that fall on the UCD side.

In the text
thumbnail Fig. 16

Confusion matrix for the test set, with a recall of 92%.

In the text
thumbnail Fig. 17

Confusion matrix for the blind test, with a recall of 91%.

In the text
thumbnail Fig. 18

Example of a detected strong excess in the filter 7.0 (J0395), which corresponds to Ca ii H and K emission lines. The figure shows the J-PLUS photometry of the first object in Table 3, with error bars representing the error in the magnitude. The algorithm detects that the magnitude in the filter J0395 is 0.8 times the magnitude in the filter 9.0 (J0430), and recognises this as a possible emission line. In this case, the threshold value for the excess detection was 0.96.

In the text
thumbnail Fig. 19

Images from the J-PLUS DR2 archive with the photometry in different filters for two of the strong line emitters detected. The first row corresponds to an excess in the Ca ii H and K (filter J0395) emission lines (first object in Table 3), with images in the filters u, J0378, J0395, J0410, and J0430 (from left to right). The second row shows an excess in the Hα (filter J0660) emission line (seventh object in Table 3), with images in the filters J0515, r, J0660, i, and J0861 (from left to right). For both objects, all images shown were taken within a time interval of about 40 min.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.