Properties of ultracool dwarfs with Gaia
An assessment of the accuracy for the temperature determination
^{1}
Dpt. de Inteligencia Artificial, UNED, Juan del Rosal 16,
28040
Madrid,
Spain
email:
lsb@dia.uned.es
^{2}
Dpt. Statistics and Operations Research, University of Cádiz,
Campus Universitario Río San Pedro s/n, 11510 Puerto Real, Cádiz, Spain
email:
angel.berihuete@uca.es
^{3}
Calar Alto Observatory, Centro Astronómico Hispano Alemán, C/ Jesús Durbán
Remón, 04004
Almería,
Spain
^{4}
Depto. Astrofísica, Centro de Astrobiología (INTACSIC), ESAC
campus, PO Box 78, 28691
Villanueva de la Cañada,
Spain
^{5}
Department Astronomia i Meteorologia ICCUBIEEC,
Martí i Franquès 1,
Barcelona
08028,
Spain
Received: 20 June 2012
Accepted: 23 November 2012
Context. The Gaia catalogue will contain observations and physical parameters of a vast number of objects, including ultracool dwarf stars, which we define here as stars with a temperature below 2500 K.
Aims. We aimed to assess the accuracy of the Gaia T_{eff} and log (g) estimates as derived with current models and observations.
Methods. We assessed the validity of several inference techniques for deriving the physical parameters of ultracool dwarf stars: Gaussian processes, support vector machines, knearest neighbours, kernel partial least squares and Bayesian estimation. In addition, we tested the potential benefits of data compression for improving robustness and speed. We used synthetic spectra derived from ultracool dwarf models to construct (train) the regression models. We derived the intrinsic uncertainties of the best inference models and assessed their validity by comparing the estimated parameters with the values derived in the bibliography for a sample of ultracool dwarf stars observed from the ground.
Results.We estimated the total number of ultracool dwarfs per spectral subtype, and obtained values that can be summarised (in orders of magnitude) as 400 000 objects in the M5−L0 range, 600 objects between L0 and L5, 30 objects between L5 and T0, and 10 objects between T0 and T8. A bright ultracool dwarf (with T_{eff} = 2500 K and log (g) = 3.5) will be detected by Gaia out to approximately 220 pc, while for T_{eff} = 1500 K (spectral type L5) and the same surface gravity, this maximum distance reduces to 10−20 pc. We found the crossvalidation RMSE prediction error to be 10 K for regression models based on the knearest neighbours and 62 K for Gaussian process models in the faintest limit (Gaia magnitude G = 20). However, these values correspond to the evaluation of the regression models with independent test sets of synthetic spectra of the same model families as used in the training phase (internal errors). For the knearest neighbours model, this seems an overly optimistic error estimate due to the use of a dense grid of examples in the training set, together with a relatively high signaltonoise ratio for the endofmission data. The RMSE of the prediction deduced from groundbased spectra of ultracool dwarfs simulated at the Gaia spectral range and resolution, and for a Gaia magnitude G = 20 is 213 K and 266 K for the models based on knearest neighbours and Gaussian process regression, respectively. These are total errors in the sense that they include the internal and external errors, with the latter caused by the inability of the synthetic spectral models (used for the construction of the regression models) to exactly reproduce the observed spectra, and by the large uncertainties in the current calibrations of spectral types and effective temperatures. We found maximumlikelihood methods (minimum χ^{2}, knearest neighbours, and Bayesian estimation with flat priors) to be biased in the L0T0 range in that they systematically assign a temperature around 1700 K. Finally, the likelihood landscape is significantly multimodal in spectra with realistic noise.
Key words: methods: data analysis / methods: statistical / catalogs / brown dwarfs / stars: fundamental parameters
© ESO, 2013
1. Introduction
Fig. 1 Normalised sample spectra from the BTSettl library. The top row shows simulated Gaia RP spectra of BTSettl models for T_{eff} = 500, 1000, 1500, 2000, and 2500 K. The vertical axis is proportional to the number of photons detected in each wavelength bin. The line colours reflect the various values of log (g) available in the library of models according to the colour scale on the right. The bottom row shows the original spectra with the same temperatures as in the top row, log (g) = 5.0 and solar metallicity. 

Open with DEXTER 
In this work we define an ultracool star as a star with an effective temperature below 2500 K (spectral type M8). The goal of this paper is to assess the detectability of this type of object with the Gaia spacecraft. Gaia is a mission of the European Space Agency that will produce very accurate astrometry and parallaxes for a significant fraction of the galactic population (de Bruijne 2012), thus helping to considerably improve our knowledge of a plethora of astronomical topics, from stellar evolution to exoplanets. In particular, Gaia data will improve our understanding of the nature of ultracool dwarfs by providing distances and therefore luminosities for the nearest objects. The 2MASS AllSky survey (Cutri et al. 1996) began a new era in the discovery and characterisation of verylate spectral type stars and brown dwarfs, allowing the identification of two new types: L (Kirpatrick et al. 1999; Martín et al. 1997) and T (Kirpatrick et al. 1999; Burgasser et al. 2002), and paving the way for even cooler objects, the Y type (Burningham et al. 2008; Kirkpatrick et al. 2012). Subsequent surveys have discovered hundreds of cool objects, but the comprehensive understanding of their nature relies in modelling from internal structure to atmospheres. This can only be achieved with precise data, including accurate distances. Here, Gaia will truly play the role of a Rosetta stone.
The work presented here was developed in the framework of the eighth coordination unit (CU8) of the Gaia Data Processing and Analysis Consortium (DPAC^{1}), which is devoted to determining astrophysical parameters. Gaia is expected to detect and characterise one billion sources, and hence, automatic procedures for the reduction and processing of these data are essential.
The DPAC consortium is, in charge of the design, development, and operation of this data processing and analysis chain aimed of producing the Gaia catalogue (intermediate releases and final catalogue) from the telemetry data (see Mignard et al. 2008 for a more detailed introduction to the DPAC).
Since potential Gaia targets include very different astrophysical scenarios, from unresolved galaxies and quasars to asteroids, specialised modules have been designed and implemented within the DPAC to characterise the various object types.
The astrophysical parameters in CU8 are determined by various modules integrated in the the Apsis pipeline. Apsis includes an initial classification into broad object categories (BailerJones et al. 2008), modules for derivating stellar parameters from (amongst other observables) very low and mediumresolution spectra (Liu et al. 2012; RecioBlanco et al. 2006, respectively), and specialised modules for characterising of unresolved galaxies (Tsalmantza et al. 2012), quasars, or peculiar types of stars such as emissionline stars (Blomme et al. 2011), or cool stars.
In particular, a specific module of the Gaia processing pipeline is devoted to characterising ultracool dwarfs (UCDs), which constitutes a regression problem in which the source parameters are estimated from observational data. In the next section, we describe this module and estimate the number of sources that will be detected as a function of spectral type. In Sect. 3 we briefly describe the statistical techniques explored in the search for an optimum model for the regression problem of determining the source parameters, together with the experiments carried out in order to select amongst them. In Sect. 4 we describe the results obtained by these models when they are applied to simulated Gaia spectra of wellknown ultracool dwarfs observed with ground telescopes. These results provide a prelaunch approximation to the expected accuracy of the Gaia parameter estimates (mainly for the effective temperature). Finally, Sect. 5 summarises the main results of this work and describes the experiments that are currently being carried out to complete this study.
2. Gaia sample of ultracool dwarfs
2.1. Brief description of the Gaia capabilities
The Gaia astrometric mission was approved by the European Space Agency in 2000 and the construction of the spacecraft and payload is ongoing for a launch in mid 2013. Gaia will continuously scan the entire sky for five years, yielding positional and velocity measurements with the accuracies needed to produce a stereoscopic and kinematic census of about one billion stars throughout our Galaxy and beyond. The stellar survey will be complete to Gaia magnitude G = 20 mag, with a precision of 24 μ as at magnitude V = 15 for a solartype star (G2V). Gaia will be equipped with two spectrophotometers operating in the 330−680 nm range (blue photometer or BP) and in the 640−1000 nm range (red photometer or RP). Both spectrophotometers are based on a dispersiveprism approach, and the spectral resolutions are, in both cases, wavelength dependent. The RP has a varying resolution from 7 nm pixel^{1} at 640 nm to 15 nm pixel^{1} at 1000 nm, while the BP photometer resolution reaches from 4 to 32 nm pixel^{1} in its wavelength range. The details of the Gaia passband G and the photometric performances of the Gaia instruments are summarised in Jordi et al. (2010).
2.2. Ultracool dwarfs with Gaia
The spectral energy distributions (SEDs) of ultracool dwarfs all peak in the infrared range, and we do not expect any significant flux in the BP range (see Fig. 1). Therefore, we will mainly be concerned with detecting and characterising of these stars using RP spectra.
A rough estimate of the number of ultracool dwarfs that will be detected by Gaia per bin of spectral type can be obtained asuming a local volume density such as the one compiled by Caballero et al. (2008). These authors compiled (or derived) local densities of latetype stars and brown dwarfs between spectral types M3 and T8 from the literature, to provide estimates of contamination rates by these objecs in deep photometric surveys that searched for substellar objects or highredshift quasars. The overall shape of the local density as a function of spectral type shows three local maxima at M3, L5, and T8, and reaches a minimum of 0.22 × 10^{3} objects per cubic parsec at spectral types T0T1. For each spectral type, we computed the distance at which a mainsequence object of the corresponding Iband absolute magnitude would reach the Gaia detection limit of G = 20 using the Gaia object generator (GOG). This is one of the three Gaia generators of simulated data that also include GASS (telemetry generator) and GIBIS (image generator). The GOG is a tool designed to obtain directly simulated catalogue and main database data for the Gaia satellite, passing through the entire mission data reduction chain (Robin et al. 2012). The outputs are astrometric, photometric, and spectroscopic epoch and final data. To simulate the main database data lifecycle, GOG uses error models whose formulas are coded using the current knowledge of the Gaia mission performances. In this work, we used GOG simulations of two synthetic libraries of ultracool dwarf spectra to derive the maximum distance at which an ultracool dwarf can be detected as a function of the Iband absolute magnitude, and other properties of the Gaia UCD sample.
The first library is a composite of the AMESCond and AMESDusty models described in Allard et al. (2001). The validity ranges for these models are T_{eff}< 1400 K (AMESCond) and T_{eff}> 1700 K (AMESDusty). Therefore, there is a gap in the validity (not in the coverage) in effective temperature between 1400 and 1700 models. Models in the interregnum are available in both model families (and hence, no gap in coverage exists), and will be used in this work to interpolate between the validity domains. The second library is the BTSettl family of models (Allard et al. 2012), valid across the entire range of effective temperatures. Figure 2 shows the evolutionary tracks for the BTSettl library in the T_{eff}log (g) space for a range of masses between 0.0005 and 1.4 M_{⊙}. Throughout this work we measure effective temperatures in Kelvin and gravities in cm s^{2}. The simulation of the synthetic spectra is carried out in practice by the socalled coordination unit 2 (CU2) of the Gaia DPAC. In most of this work we concentrated the results obtained with the BTSettl library of models which produced better fits to the observed spectra used for validation in Sect. 4.
Fig. 2 Evolutionary tracks in the T_{eff}log (g) space for ultracool dwarfs according to the BTSettl library. Each line corresponds to a different mass in the range 0.0005–1.4 M_{⊙} as labelled below selected tracks. Filled circles represent individual models in the grid. These are coloured according to the decimal logarithm of the age in Gigayears as indicated by the colour scale on the right. Effective temperatures are measured in Kelvin and gravities in cm s^{2}. 

Open with DEXTER 
These two model libraries will also be used to define the mapping between the source parameters and Gaia observations described in Sect. 3. Figure 1 shows a sample of spectra from the BTSettl library together with the simulated Gaia RP spectra for a range of temperatures between 500 K and 2500 K. We obtained Gaia simulated data using the GOG. In the simulation of the libraries, CU2 takes into account the evolutionary tracks provided with the models.
Fig. 3 Maximum distances at which an ultracool dwarf can be detected by Gaia at the limiting magnitud G = 20 as a function of its absolute magnitude in the I band. These have been derived from BTSettl models (filled circles) and the continuous lines represent the interpolation used in deriving the expected counts per spectral type bin in Table 1. The black continuous line corresponds to log (g) = 5.0 and the blue line to log (g) = 3.5. The top axis shows the effective temperature measured in Kelvin for a log (g) = 5.0 object with the absolute I magnitudes shown in the x axis, according to the BTSettl models. The T_{eff} – M_{I} mapping is only bivalued below 600 K. 

Open with DEXTER 
Predicted number of counts assuming solar metallicy and auxiliary relations.
Fig. 4 Predicted number of counts per spectral type bin a) and per apparent G mag b), in logarithmic units. The black line in the left panel corresponds to the derivation based on the relation between spectral type and Iband absolute magnitude included in Caballero et al. (2008), while the blue continuous line corresponds to the relation derived from the BTSettl model family and the SLC calibration. The two horizontal (dashed) lines indicate the levels of predicted counts equal to one and ten. The righthand side plot has been obtained assuming the relation between effective temperature and Iband absolute magnitude derived from the BTSettl models and the SLC calibration. 

Open with DEXTER 
Figure 3 shows the maximum distances to mainsequence UCDs as a function of their Iband absolute magnitude. The expected number of detectable mainsequence objects in each spectral type bin can be estimated multiplying a volume density estimate (in our case, the one in Caballero et al. 2008,for spectral types between M3 and T8) by the volume of a sphere with a radius equal to the maximum distance at which an UCD corresponding to the spectral type under consideration can be detected (assuming solar metallicity). In computing these expected number counts per spectral type bin, we need to define a relationship between absolute Iband magnitude and spectral type. We used two such relations. The first one is included in Table 3 of Caballero et al. (2008) (and reproduced in Table 1 for convenience). The second relationship is derived from the Iband magnitudes and effective temperatures of the BTSettl model family, combined with the calibration of effective temperatures with spectral types by Stephens et al. (2009). We used the analytic formula based on optical spectral types of L dwarfs and infrared spectral types of the T dwarfs. This calibration (hereafter refered to as SLC calibration) is valid in the M6 to T8 range. This analysis results in the values listed in Table 1 and illustrated in Fig. 4a. Table 1 lists the relationship between spectral type and absolute magnitude in the Iband given in Caballero et al. (2008) in Col. 2 for reference. This results in the estimated number of counts under the column header Counts_{CBK08}. The relationship between spectral type and absolute magnitude in the Iband implicit to the BTSettl model family is included in Col. 3. This results in the estimated number of counts under the column header Counts_{BT − Settl}. The SLC spectral typeeffective temperature calibration (see Sect. 4) is included in Col. 6, and the (G_{BP}G_{RP}) colour index computed from the noiseless GOG simulations of BTSettl model atmospheres is included in Col. 7. Since the volume densities tabulated in Caballero et al. (2008) refer only to the main sequence, these estimates do not take into account the potential detection of lowgravity objects (i.e., essentially very young objects). The final increase in the expected number of counts is due to the steep increase in the volume density for spectral types later than T0. In deriving these estimates we only used BTSettl models of solar metallicity. The expected number of detections per apparent G magnitude is shown in Fig. 4b.
2.3. Selection criteria and contamination rates
In the Gaia processing pipeline a source will only be characterised as an ultracool dwarf if the following conditions are met:

1.
Its estimated distance is smaller than the maximum distance atwhich an ultracool star can be detected. This is the distance atwhich the brightest ultracool star would have a Gaia limitingmagnitude of G = 20. We estimated the maximum brightness of an ultracool star from the BTSettl models, and it corresponds to the hottest model with the lowest surface gravity.

2.
The source is fainter (in the G band) than the brightests model placed at the same distance as estimated by Gaia.

3.
The BPRP colour index is higher (redder) than the minimum (bluest) colour index found in the model libraries.

4.
The celestial coordinates and proper motions are not consistent with solar system Keplerian motions.
Sources that fulfil these criteria (within some margin that depends on the measurement uncertainties) are subsequently processed to estimate their effective temperature and surface gravity as described in Sect. 3. Objects not detected in the BP band (but fulfilling all other criteria) will nevertheless be selected because the nondetection is itself an indication of a red spectrum. This could imply a potential contamination of the UCD sample by faint, nearby objects close to the Gaia limiting magnitude and colour indices that are positive (thus excluding white dwarfs), but bluer than the bluest UCD. The software that implements the analysis of ultracool dwarfs distinguishes between the selection module that decides whether to process a given Gaia source, and the processing module that estimates the effective temperature and gravity of the selected sources.
To define the boundaries of brightness, colour index, and distance that define the region of ultracool sources, we analysed the GOG simulations of the AMESCond, AMESDusty and BTSettl model libraries (although the latter is prefered, and the former have not been implemented in the selection software). Using GOG, we found that the most distant ultracool object in the library (a source with T_{eff} = 2500 K and log (g) = 3.5 at G = 20) corresponds to a distance of 373 pc. Thus, the first criterion limits the processing of sources by the UCD module to sources within this radius (this is necessary optimise the processing time per source). The second criterion examines the brightness of each detected source and compares it with the aforementioned brightest model placed at the same distance. We expect a strong contamination from mainsequence stars with temperatures above the 2500 K limit because lowgravity sources are brighter than mainsequence ones for temperatures above 1600 K. In fact, we derived from the GOG simulation of the model libraries that a dwarf star of ≈4180 K has the same G mag as the lowest gravity, hottest ultracool giant considered in this work (T_{eff} = 2500 K, log (g) = 3.5). Thus, mainsequence objects up to this temperature will be selected by the selection module according to this criterium despite their higher temperatures (although the colour index criterium will reject a fraction of these hotter stars; see below). We could have used the thresholds corresponding to the mainsequence objects, but this would have resulted in the potential loss of lowgravity sources in the selection process. We opted for an inclusive set of criteria in spite of the expected high contamination rates by dwarfs hotter than 2500 K. To estimate the contamination rate (caused by the criteria but also to the measurement errors), we conducted a numerical experiment in which we populated a onekiloparsec cube with sources uniformly distributed in space (this is much larger than actually needed in view of the estimated maximum distance to an UCD). The total number of sources in each temperature bin was generated using a probability density function derived by interpolating and normalising the volume densitites tabulated in Caballero et al. (2008). Since these correspond to mainsequence sources, we neglected the contribution from ultracool giants in the following estimates.
For each star, and given the effective temperature and the distance to the centre of the cube, we computed the apparent G mag and the (V − I) colour index by interpolating in the BTSettl library restricted to dwarfs with values of log (g) ≈ 5.5. We subsequently used the current estimates of the uncertainties in the Gaia magnitudes and parallaxes (see Jordi et al. 2010 and de Bruijne 2009 respectively) to generate mock measurements of the distances and G mags by sampling from normal distributions with the prescribed uncertainties. We did not use a full covariance matrix since none was available at the time of writing. We show in Fig. 5 the error models that we used in the simulations. The uncertainty estimate in the G mag is based on Eq. (6) in Jordi et al. (2010) and on the mission parameters available at the time of writing. We included a calibration error σ_{cal} = 30 mmag for a single transit. This value is only an educated guess since the final σ_{cal} can only be estimated during the operational phase. The uncertainty in the parallax is based on Eq. (1) in de Bruijne (de Bruijne 2009; see de Bruijne 2012 for a more recent review without analytic expressions for the astrometric uncertainty).
Fig. 5 Current estimates of the endofmission uncertainties in the measurements of the G apparent magnitude (black line) and the parallax (dashed and dotted lines) as a function of the G apparent magnitude. The dashed line corresponds to (V − I) = 4 and the dotted line to (V − I) = 7.5, a plausible range for the V − I colour index according to the model libraries. 

Open with DEXTER 
The resulting sample of stars in the kiloparsec cube was then examined to determine the properties of the subsample that fulfilled the criteria enumerated above (except for the nonKeplerian motion). We found that none of the ultracool dwarfs thus generated is missed in the selection process due to errors in the measurements. This was expected given i) the inclusiveness of the criteria; ii) the relative proximity of these faint sources to the Sun; and iii) the fact that we only generated mainsequence objects in the simulation (and not lowgravity young sources that would be closer to the selection thresholds; this question will be reexamined during the software validation phase with real Gaia data and may result in more inclusive thresholds if we identify examples of this kind missed by the selection module). The uncertainties in the G magnitude and colour index typical of UCDs according to the simulations are negligible in this context.
The (G_{BP}–G_{RP}) colour index thresholds in the selection criteria are defined by the bluest model in the GOG simulations of the model libraries. For the BTSettl library of models and metallicities [M/H] between −2 and 0.5, we find that the (G_{BP}–G_{RP}) colour index is in the 4.1–14.5 range. The distribution of effective temperatures in the resulting sample of nonUCD stars that fulfil the criteria is shown in Fig. 6. The shape is determined by the combination of the various selection criteria and the uncertainties in the measurements.
Fig. 6 Histogram of the temperatures of nonUCD stars that pass the selection criteria of the Gaia UCD module. 

Open with DEXTER 
Figure 6 implies a contamination rate of abound 85%. This is explained by the exponential increase in volume densities for the mid to lateM spectral types. To take this high contamination rate into account, we generated regression models in Sect. 3 that are capable of predicting effective temperatures up to 4000 K, and not only in the UCD regime below 2500 K. Therefore, even if the selection module selects these contaminants, the processing module will allow for the obtention of purer samples of UCDs by filtering out objects with estimated temperatures above a given threshold.
In computing the contamination rate we did not consider extragalactic contaminants because their parallax measurements would be incompatible with the selection criteria defined above.
3. Methodology
As mentioned in Sect. 1, Gaia is expected to detect and process a number of sources close to one billion. This order of magnitude necessitates automating the detection and characterization processes. The main objective of the eighth coordination unit of the Gaia DPAC is to implement and evaluate automated procedures to derive the astrophysical parameters of the sources detected by the Gaia instruments. This necessarily involves techniques developed in the fields of statistical learning and data mining. In the framework of statistical learning methods, this is accomplished with regression models that are constructed from sets of examples (the socalled training set) that link the independent variables (the Gaia observations in our case) with the dependent variables that we wish to infer. In this section we describe several techniques taken from the field of statistical learning, aimed at providing a reproducible, systematic characterization of the Gaia UCD candidate sources.
In collecting the aforementioned sets of examples, libraries of stellar models and associated synthetic spectra provide a homogeneous and consistent set of examples that uniformly cover the parameter spaces under consideration. In contrast, catalogues of observed spectra with associated astrophysical parameters derived from them tend to be fragmentary in nature: each collection covers only a reduced range of parameters, and it is necessary to combine several catalogues to obtain a coverage that, even then, may contain gaps and/or insufficiently sampled regions (this is especially true for the parameter log (g), which is missing from most compilations). Each catalogue has its own observational setup, systematic errors, and selection effects. Also, observational biases may favour the abundance of examples in particular regions of the parameter space, which then translate into systematic biases in the predictions. Furthermore, parameter estimation across catalogues can be inconsistent, and we indeed find slightly different spectral types assigned to the same source in different catalogues, which reveals a certain degree of subjectivity in the assignements. With all these considerations in mind (reproducibility, consistency, homogeneity, and uniformity of the training set), we prefered to construct our regression models from the aforementioned synthetic libraries. These libraries parameterise the models with physical magnitudes (effective temperatures, gravities, and metallicities) and not with spectral types.
Model libraries are nevertheless imperfect in the sense that they do not reproduce each and every spectral feature in the real spectra of UCDs or its exact dependence on the physical parameters. The problem of the potential mismatch between stellar model libraries and observations appears whenever physical parameters are to be produced, beyond phenomenological descriptions such as spectral types. Spectral types can be inferred without the need for synthetic models, while going from spectra (or spectral types) to physical parameters requires model libraries to interpret them. In this work we do not attempt to build regression models to infer spectral types. Given the low spectral resolution of the Gaia red spectrophotometer (RP), and its spectral coverage, most of the spectral features used to decide the spectral type remain unresolved or unobserved, and thus, spectral types derived from them would be of limited use.
The systematic effects of the selection of model libraries can be characterised to some extent by comparing the predictions from different model families like those presented in the previous section (AMESCond, AMESDusty, and BTsettl). In Sect. 4 we also attempt to validate our models with an external set of effective temperatures derived from groundbased observations and spectral types via a calibration that, inevitably, encompasses another synthetic stellar library. The two families of models introduced in the previous section (AMESCond and Dusty, and the BTSettl models) are used to define the relationship between Gaia observables (the RP spectrum) and the parameters that we intend to estimate, namely T_{eff} and log (g). This relationship is captured in a regression model (not to be mistaken for the physical models of the stellar atmospheres and the resulting synthetic spectra) constructed from the set of examples defined by the two synthetic spectral libraries. Each spectrum in the libraries (e.g. those represented in the lower row of Fig. 1) is characterised by the set of physical parameters that identifies the stellar atmosphere used to synthesize it. This set of examples composed of the spectrum plus the corresponding physical parameters is referred to in the following as the training set. The set of independent examples used to assess the accuracy of the models will be referred to as the test set.
As mentioned before, the training set is constructed using the AMESCond, Dusty, and BTSettl model libraries (restricted to effective temperatures below 4000 K) and simulating Gaia observations of these synthetic models using GOG (version 8.0). Gaia has a nominal duration of five years. During this period, a source is observed on average approximately 70 times. Each time a source is observed (i.e., each transit), an epoch RP spectrum is obtained. The characteristics of this spectrum depend on the instrument design including prism and CCDs, and the transit geometric properties. In general, we can assume that the spectrum will be spread along 60 spectral bands with a nonuniform dispersion. In each transit, the position of the source (continuous) spectrum may vary with respect to the discrete CCD pixel array depending on the transit geometric details. This transit dependence results in different spectra obtained for each epoch, because the wavelength coverage of each pixel is different. This subresolution information can then be used to produce oversampled combined spectra. In Fig. 1 and in the experiments carried out in this work that are described in the following sections we assumed an endofmission oversampling factor of three, resulting in a total number of flux bins of 180.
3.1. Regression models
In this section we describe three different types of statistical regression models: knearest neighbours (Cover & Hart 1967), Gaussian processes (GPs; see e.g. Bishop 2006; Rasmussen & Williams 2006), and Bayesian inference (GPs see e.g. Sivia & Skilling 2006). A previous analysis also included support vector machines (Vapnik 1995; Cortes & Vapnik 1995) and kernel partial least squares (KPLS, Rosipal et al. 2001).
knearest neighbours estimation (kNN) is by far the simplest model and derives the parameter values as the weighted average of the elements in the training set that are closest to the input spectrum in a given metric. In our case, the euclidean distance is used to define proximity and the weights are defined as the inverse of this distance. A maximumlikelihood estimate of the uncertainties in the estimated parameters can be computed by modelling the distribution of inverse distances to the nearest neighbours (under the assumptions that the model grid is sufficiently dense and the neighbourhood has the appropriate size to sample a unimodal likelihood). It has the disadvantage that the full training set has to be stored and accessed each time that the regression model is used to predict the physical parameters (T_{eff} and log (g)) of a source, and furthermore, it is severely affected by the socalled curse of dimensionality. This is reflected in the need for exponentially growing training set sizes as the problem input dimensionality increases. The exponential growth is required to ensure that the nearest neighbours are sufficiently close to provide an accurate estimate of the parameters.
Support vector machines (SVMs) and GPs are two examples of kernel methods. These methods transform the regression problem into a dual representation where the constituents of the model are no longer the input features (or nonlinear mappings thereof) but their scalar products expressed as kernel functions. More details of this dual representation can be found in the textbooks by Bishop (2006) and Hastie et al. (2001).
A GP is defined as a probability distribution over functions such that the joint probability of the random variables defined by their evaluations at a certain set of input vectors (the training set in our case) is Gaussian. If we assume that the nature of our problem is such that the probabilistic distribution for both the targets of our training set and the Gaia observations to be characterised is well captured by a multivariate Gaussian distribution, we can construct the model by computing its covariance matrix. It turns out that we can calculate it by evaluating the kernel functions at the input vectors of the training set. These kernel functions encode both the assumed error model for our determination of the target values in the training set and the length scale for the correlations between the examples in the training set. In the full Bayesian treatment these two parameters can be considered hyperparameters and are marginalised out. Here we have determined optimal values for them through exhaustive crossvalidation experiments (see below). For the SVM, the model representation is sparse in the sense that it only depends on kernel evaluations on a small subset of the training set (the support vectors). In regression problems, the support vectors are those that lie within the boundaries of, or outside a socalled insensitive tube. Support vector regression involves the search for best (minimum error) solution in a space of hyperparameters, similar in nature to those discussed for GPs.
In this section we describe a parallel aproach to parameter estimation based on Bayes’ theorem. The main advantage of Bayesian parameter estimation stems from the fact that it provides not only an estimate of the parameters but also a full multivariate probability density distribution for the set of parameters given the observations.
We assume that we observe the spectrum s of an ultracool dwarf. In the Bayesian framework, we seek the probability density function (PDF) of the physical parameters of the UCD, given this spectrum s. Bayes’ theorem provides this PDF as (1)where θ = (T_{eff},log (g)) is the vector of parameters that we intend to derive, π(θ) is the prior probability distribution of these parameters, and f_{θ}(s) is the likelihood of the spectrum given the parameters.
The denominator in the righthand side of Eq. (1) m(s) is the prior predictive distribution or evidence defined as (2)and can be viewed as a normalisation constant.
The likelihood term encompasses a predictive model for the spectra given the parameters, together with a probabilistic error model. In our case, we used the two stellar libraries mentioned above (COND + DUSTY and BTSettl) to build the predictive model. This is captured in two threelayer perceptrons (a kind of neural network) each trained with the stellar spectra of the corresponding library. The neural network captures a multivariate regression model that can be used to perform an interpolation. Extensive experiments to derive the optimal network architecture result in hidden layers of 20 hidden units for the two model libraries. With it, we can generate output synthetic spectra for any input value of θ within the grid boundaries. The neural network exactly predicts the spectra in the COND + DUSTY and BTSettl grids, and smoothly interpolates for intermediate values. The unique flux predicted by the neural network for each wavelength in the spectrum can be turned into a probabilistic statement by convolving it with the predicted measurement errors in the current Gaia model. At present, the error model for the RP spectra consists of a Gaussian distribution with a covariance Σ that depends on a series of instrument parameters, on the number of transits, and on the flux itself (see Jordi et al. 2010, discussed above). Thus, (3)with x_{θ} being the prediction from the neural network for the parameter set θ, and f_{(xθ,Σ)}(s), the probability density function of a normal distribution 𝒩(x_{θ},Σ) evaluated at the observed spectrum s.
Nested sampling (Skilling 2006) is a Monte Carlo procedure used to calculate m(s) that can extract samples from π(θs) as a byproduct of these calculations. The algorithm exploits the relation between the likelihood f_{θ}(s) and the prior volume X defined by(4)i.e., the volume of π(θ) over the region of parameter space contained within the λ likelihood isocontour f_{θ}(s) = λ. Our prior density π(·) can be set in two ways: i) according to a bidimensional uniform distribution in (400,3500) × (3.5,5.5), or ii) using a physical prior based on the temperature histograms discussed in the previous section, and a log (g) prior that favours values typical for the field mainsequence dwarfs. We here studied the predictions of a Bayesian module for regression based on a flat prior in both parameters T_{eff} and log (g). This represents the framework for the analysis of the influence of physical priors (based for example on estimated detection rates such as those presented in Sect. 2 or on the astrometric and photometric data provided by Gaia) to be included in a subsequent paper in this series.
We used ellipsoidal sampling (Shaw et al. 2007) to estimate the parameters T_{eff} and log (g). This method is a variant of nested sampling that approximates the isolikelihood of the point to be replaced in the nested sampling step by an ndimensional ellipsoid (ϵ_{N}) derived from the covariance matrix of the current active points (see Skilling 2006, for more details). Since our stellar models are restricted in parameter space to the region 500 ≤ T_{eff} ≤ 3500, 3.5 ≤ log g ≤ 5.5, points drawn from ϵ_{N} but outside of the parameter space will be discarded.
Even though ellipsoidal sampling is aimed at simplifying the computation of the evidence, it is also possible to derive posterior probabilities from it as a byproduct. Once the algorithm has converged, the resulting sample can be interpreted as a sample from the posterior probability if we weight the importance of each point by a factor p_{i} defined as (5)with w_{i} = 1/2(X_{i − 1} − X_{i + 1}). The justification for this weighting scheme is described in Skilling (2006). Thereafter, we can obtain summary statistics such as the mean of the posterior probability density of effective temperatures using the classical firstorder moment of the distribution:
with T_{eff,i} the effective temperature value in the ith sample, and p_{i}, the weights calculated according to Eq. (5). We use this summary statistic (the firstorder moment) to estimate T_{eff}, and will discuss the effect of using other alternatives such as the posterior mode in a subsequent paper. It is planned that the Gaia catalogue will contain the samples obtained for each UCD candidate.
Ellipsoidal sampling was preferred over simpler algorithms such as MetropolisHastings or nested sampling because tests carried out with these algorithms and several proposal densities produced too low acceptance rates because of the particular shape of likelihood landscape. This results in posterior samples that are not independent. These problems are more severe in ongoing applications of the software to highly multidimensional related problems such as the estimation of star+disk parameters in premainsequence systems.
3.2. Preprocessing
Before constructing the models, the spectra were normalised to yield an area equal to 1 to have an appropriate scale of values that is robust to noise and isolated outliers. This normalisation removes the information relative to the integrated energy flux that, combined with the distance measured by Gaia, can provide indications of the T_{eff} and log (g) values of the source. This information was incorporated at a later stage, and also in the consistency checks.
Other preprocessing steps were explored to determine their impact on the performance of the algorithms. In particular, we tested denoisification strategies based on wavelet decomposition and moving averages. The GOG allows generating noisy simulated spectra for any number of transits and apparent magnitude. In the experimental setup we included GOG simulations for a set of four apparent G magnitudes (G = 15, 18, 19 and 20) and two number of transits (28 and 70). Seventy transits is an estimate of the average number of transits after five years of observations (the nominal Gaia lifetime), whilst 28 transits corresponds to the average after two years of observations, when a reliable evaluation of the algorithms can be attempted. The number of transits strongly depends on the position of the source on the sky (see e.g. Lindegren et al. 2012).
Several wavelet bases were tried in the experiments including several orders of the Daubechies, Coiflet, bestlocated and least asymmetric wavelets. In addition to denoisification strategies, we explored two data compression approaches: the wellknown principal component analysis (Pearson 1901) and the local approach based on diffusion maps (Coifman & Lafon 2006).
3.3. Internal validation of the regression models
The experiments described in this section were carried out with regression models trained with the socalled nominal dataset. This comprised GOG simulations of the original spectra in the synthetic spectral libraries (restricted to effective temperatures below 4000 K), and it corresponds to the nodes of a grid in the space of T_{eff} and log (g), with grid spacings of 100 K in T_{eff} and 0.5 dex in log (g). The performance of the regression models was measured by analysing the distribution of residual errors in the socalled random dataset. This comprises spectra linearly interpolated from the nominal grid at values of the physical parameters (T_{eff} and log (g)) randomly spread in the evolutionary tracks provided by the authors of the libraries of stellar models. The regression models can be understood as continuous nonlinear mappings between the 180dimensional space of spectra and the onedimensional space of the parameter under consideration. The random dataset is not an independent test set because it is derived from the nominal dataset using multilinear interpolation. In this sense, the performance evaluation measures not only the ability of the regression models to reproduce the training set, but unfortunately also the fidelity to the multilinear interpolation between the grid nodes. We refer to the performance measures described in this section as internal errors, in the sense that they measure the ability of the regression models to reproduce the mapping between spectra and astrophysical parameters that is inherent to the training set in the UCD domain (even if the performance also measures the fidelity to the multilinear interpolation). Internal errors, thus, evaluate only the regression model and its robustness against noise, but not the validity of the training sets or their ability to reproduce observed spectral features. In Sect. 4 we analyse the performance of the regression models by applying them to observed spectra (downgraded to the spectral resolution of the Gaia instruments). These performance measures, thus, comprise the internal errors and the ability of the training sets to reproduce the spectra of real UCDs.
Fig. 7 Crossvalidation errors for endofmission spectra of UCD stars at G = 20. The upper panels show the errors in the T_{eff} estimates for the GP model (left) and the knearest neighbours model (right). The lower panels show the corresponding errors in the log (g) estimates. The colour code for the error scale is shown in the righthand side of each row. Effective temperatures are measured in Kelvin and gravities in cm/s^{2}. 

Open with DEXTER 
The complete description and analysis of the preliminary set of experiments described in the previous paragraphs is beyond the scope of this article, but the conclusions derived from it can be summarised as follows:

Residual errors increase near the boundaries of the training sets.The region below 500 K was particularlyproblematic because prediction errors were unacceptably largeand including it in the training sets also degraded the performancefor higher temperatures. It was therefore removed from thetraining sets.

Denoisification is best achieved by using a moving average filter as measured by the root mean squared reconstruction error (RMSE), and the mean and median reconstruction error.

The lowest RMSE obtained with wavelet denoising is achieved with the Coiflet mother wavelet of the order of 18; similar RMSE are obtained with the best located wavelet of the order of 14.

Training the algorithms with a noisy training set adjusted to the signaltonoise ratios of the test spectrum yields better results than training with noiseless spectra and denoisifying the test spectrum, regardless of the denoisification strategy (moving averages or wavelets).

Prediction errors obtained with KPLS models are significantly larger (RMSE systematically above 200 K for all G mags and numbers of transits) than those obtained with GP, SVMs, or kNN models;

Prediction errors from crossvalidation experiments with the various models (except KPLS) and noiseless spectra are all the same within the experimental uncertainty as measured by the standard deviation of the RMSE sample obtained from ten experiments of tenfold cross validation. The RMSE error for the noiseless training set is 7 K.

kNN models systematically outperform both SVM and GP models when applied to noisy spectra in the estimation of T_{eff} values. Figure 7 shows a typical case where both the training (nominal) set and the test (random) set were simulated with noise properties for G = 20 and 70 transits. The left column shows the distribution of residuals for the GP model (T_{eff} and log (g) in the upper and lower panel, respectively) and the right column shows the same scatter plots for the kNN model. The RMSE in the T_{eff} predictions is 10 K for the kNN model and 62.3 for the GP model. The RMSE of the kNN model is unrealistically low as shown in the next section, and due in part to the high density of examples in the training set, in this context of crossvalidation experiments to derive internal errors. In this setup, where we assumed that the training set of synthetic spectra reproduces the expected observed spectra well, the nearest neighbours are as close as allowed by the density of the grid of training examples (given the relatively high signaltonoise ratio of the endofmission spectra). The performance of the SVM model is overall very similar to that of the GP model.

Reducing the input space dimensionality with PCA (preserving 95% of the variance) deteriorates the model performances by up to 100% with respect to the complete input spectra. This is so except for the lower signaltonoise ratios (G = 20) and in the combination of PCA and kNN, where the RMSE improves by a 30%. The improvement decreases as the noise diminishes and for G = 15, the two models (with and without PCA compression) show the same RMSE. Data compression with the nonlinear technique known as diffusion maps
Fig. 8 Effective temperatures derived from the best χ^{2} fits to BTSettl models as a function of the spectral type assigned in the literature. Black circles correspond to the compilation by Leggett, red circles to the compilation of Keck LRIS spectra by Reid, orange circles to the NIRSPEC compilation, and blue ones to the IRTF compilation. The grey continuous line shows the T_{eff}spectral type calibration by Stephens et al. (2009) from optical and infrared spectra. The dashed lines represent the same calibration ±250 K. The righthand panel shows the residuals (T_{eff}_{(predicted)} − T_{eff}_{(SLC)}) with respect to the calibration.
Open with DEXTER (Coifman & Lafon 2006) results in similar performances as the PCA approach at a much higher computational cost.
In Sect. 4 we concentrate on the external validation of the kNN and GP models only, because the SVM model performances are remarkably similar to the GP models, but do not provide estimates of the prediction uncertainty in a straightforward way. As a result of the previous considerations, we have several regression models based on the knearest neighbours algorithm and GP. There is a version of each model for the 28 and 70 transits cases, and for the various signaltonoise ratios that correspond to values of the G mag G = 15, 18, 19, and 20. For each of these cases we have a model trained in the full input space, or on a PCA compressed version of it. All parameters of each model, such as the number of nearest neighbours or the kernel and noise parameters for the GP models, are determined using tenfold crossvalidation of the models obtained in an exhaustive exploration of the parameter space. The resulting models are used in Sect. 4 to analyse the expected uncertainties when applied to spectra of ultracool dwarfs observed from ground and simulated with GOG.
Unfortunately, the internal validation described above for the kNN and GP models overestimates the real accuracy because it does not include several systematic effects. The physical models used in constructing of the training set (AMESCond, AMESDusty, and BTSettl) do not exactly reproduce all spectral features and their correlation with the physical parameters encountered in reality. This may be due to several reasons, such as incomplete line lists, inaccurate line/band opacities, or mathematical simplifications introduced for the sake of tractability (e.g. in the dust cloud formation and convection, in the diffusion of chemical species, in the departures from equilibrium between the dust and gas phases, or in the neglect of rotation). To account for these systematic errors, a second battery of experiments is discussed in Sect. 4 where groundbased spectra of wellknown ultracool stars are degraded to the Gaia resolution and are convolved with its spectral response and error model using GOG. We aim at estimating the error that affects each of the regression models in the various T_{eff} and log (g) ranges covered by the empirical spectral libraries. The spectra used for the external validation of the regression models are described in detail in the next section.
4. Validation with real spectra
In this section, we use groundbased spectra to estimate the total errors of the regression models. These total errors include the internal validation errors described in the previous section, and errors due to the imperfect representation of real spectra by synthetic models. The groundbased spectra used for validating the regression models are the compilations by Reid^{2} (Burgasser et al. 2000; Delfosse et al. 1997; Gizis et al. 2000b,a; Kirkpatrick et al. 1999, 2000; Reid et al. 1999, 2000; Strauss et al. 1999; Tsvetanov et al. 2000), Leggett^{3} (Chiu et al. 2006; Golimowski et al. 2004; Knapp et al. 2004), the NIRSPEC Brown Dwarf Spectroscopic Survey^{4} (McLean et al. 2003), and the IRTF spectral library^{5} (Cushing et al. 2005; Rayner et al. 2009). None of the observed spectra covers the full Gaia wavelength range,which is especially due to the lack of optical observations at the coolest end. Therefore, it was necessary to complete them with models before simulating the Gaia observations of these stars for ad hoc G mag (and therefore noise levels) and total number of transits. This was accomplished by degrading the resolution of the model spectra to that of the observed spectrum and finding the model that yields the minimum χ^{2} fit in the wavelength range of overlap restricted to the RP passband (after removing artefacts and wavelength ranges with poor signaltonoise ratios). This model was then used to complete the observed spectrum. We used all available models from the COND, DUSTY, and BTSettl libraries to find the bestfitting model to each of the observations, and in all cases the BTSettl model produced a χ^{2} statistic superior or comparable to the COND/DUSTY models. Spectra with gaps, small overlap with the RP band, or very low signaltonoise ratios were removed from this empirical validation set to avoid biases.
Fig. 9 T_{eff} predictions obtained by the kNN algorithm based on the BTSettl grid of models for the four libraries of groundbased spectra listed in Sect. 4 for G = 15 (top row) and G = 20 (bottom row). The colour code is the same as used in Fig. 8. The xaxis shows the spectral types gathered from the literature cited in the spectral compilations. The grey line shows the T_{eff}spectral type calibration by Stephens et al. (2009) from optical and infrarred spectra (± 250 K, grey dashed lines). The righthand panels show the residuals with respect to the calibration as in Fig. 8. 

Open with DEXTER 
The best χ^{2} fits to the full resolution spectra give us a lower limit to the errors attainable in the Gaia T_{eff} estimates. Figure 8 shows in the yaxis the effective temperatures derived from the best χ^{2} fits described in the previous paragraph as a function of the spectral types assigned in the libraries. We introduced a small jitter (characterised by a standard deviation equal to 30 K) in the values of the fitted temperature to enhance the visibility of the stars with the same spectral type. In the same plot we have included the calibration of effective temperatures with spectral types by Stephens et al. (2009) as a continuous line. It corresponds to the calibration derived from optical spectral types of L dwarfs and infrared spectral types of the T dwarfs, valid in the M6 to T8 range (the SLC calibration). The SLC calibration relies on model atmospheres described in Saumon & Marley (2008), and references therein. The dashed lines correspond to the same calibration plus/minus 250 K. The righthand panel shows the difference between the predicted temperature and the temperature obtained applying the SLC calibration to the spectral types assigned in the compilations, as a function of the latter. The scatter of T_{eff} values in Fig. 8 around the SLC calibration reflects errors of the regression process, but also the uncertainty in the calibration between spectral types and effective temperatures. In this work, we are taking the effective temperature derived through the SLC relation as the ground truth. However, the SLC relation, derived from highresolution infrared spectra, which are much more appropriate for determining an effective temperature than the Gaia spectrophotometry, has an intrinsic scatter estimated by the authors as ≈100 K (although the article lacks details about how the scatter was estimated, and visual inspection of the plots suggests a much higher scatter at least in the L sequence). Furthermore, Reylé et al. (2011) compared a calibration derived from the BTSettl model library (the one used in this work) with several other calibrations for the M spectral subsequence in the literature, and found variations of the order of 200−300 K for each spectral subtype. Bayo et al. (2011) showed even higher values of the scatter for the same spectral type range (they derived the T_{eff} values from global fits to the SED). This shows that the calibration of the relationship between effective temperatures and spectral types is a problem that is not yet fully solved, and that the exact T_{eff} values derived for an UCD depend on the calibration used, with typical uncertainties of a few hundred Kelvin. The predictions of the regression modules presented in this section have to be judged in this context. In this respect, our regression models for the Gaia data have the advantage that they will provide UCD temperatures derived consistently from the BTSettl model family.
Three stars lacked precise spectral subtypes in the spectral compilations, namely 2MASS 1237392+652615, SDSS1346464–003150, and SDSS1624144 + 002916. We assumed them to be T6.5 (Kirkpatrick et al. 2011), T6.5 (Reylé et al. 2010), and T6 (Kirkpatrick et al. 2011). Figure 8 shows a tendency to assign effective temperatures around 1700 K for sources with spectral type L.
Table 2 lists the mean difference μ between the χ^{2}T_{eff} estimates and the effective temperatures derived from the SLC calibration and the spectral types cited in the spectral compilations (hereafter bias); the standard deviation with respect to the biascorrected mean (σ) displayed by the four compilations; and the RMSE without correcting for the mean bias. We list these values for the two model families (AMES Cond + Dusty and BTSettl) used in this work. All RMSE values are given in Kelvin.
Fig. 10 T_{eff} predictions obtained by the GP model based on the BTSettl grid of models for the four libraries of groundbased spectra listed in Sect. 4 for G = 15 (top row) and G = 20 (bottom row). The colour code is the same as used in Fig. 8. The xaxis shows the spectral types gathered from the literature cited in the spectral compilations. The grey line shows the T_{eff}spectral type calibration by Stephens et al. (2009) from optical and infrared spectra (± 250 K, grey dashed lines). The righthand panels show the residuals with respect to the calibration as in Fig. 8. 

Open with DEXTER 
Average bias (μ) and standard deviation of the χ^{2} effective temperature (K) fits.
The compilations by Reid and Leggett have a good overall wavelength coverage in the RP range, and spectra with poor coverage were removed from the validation set. Since the expected emission in the wavelength regions of the RP range not covered by the observed spectra (bluewards of 750 nm) is negligible in the T_{eff} and log (g) parameter ranges under consideration, we expect the completions to have little or no effect on the subsequent parameter estimation with the models described in the previous section. The NIRSPEC library is in the opposite case, with a majority of spectra starting around 1.1 μm. At this wavelength, the RP transmission is close to zero, and therefore, the input for the GOG simulations of the Gaia RP spectra in the wavelength region where the transmission is high comes only from a synthetic model (the best χ^{2} match) that was actually used during the training phase of the algorithms. As a consequence, the errors in the parameter estimates for these stars are overly optimistic. The IRTF library is an intermediate case with most spectra covering the wavelength region above 0.8 μm where most of the source flux is concentrated. Thus, the completion will not have a relevant impact on the resulting simulated RP spectrum.
Figures 9 and 10 show the predictions of the kNN and GP models for the empirical libraries described in previous paragraphs. The predicted values of T_{eff} are compared with the effective temperature assigned by the SLC calibration to the spectral type provided by the empirical libraries. The model used in all panels corresponds to the one trained with noisy spectra of G = 20 and 70 transits. The upper panels describe the performance of the model when applied to noisy spectra corresponding to G = 15 and 70 transits whilst the lower panel shows the results for the same model applied to GOG simulations at G = 20 and 70 transits. For each spectrum simulated by GOG, we constructed ten noisy replicates using the Gaia error model currently available, and predict the values of T_{eff} and log (g) for each one of them. The scatter in the predictions for these noisy replicates is better visible in the lower panels (spectra simulated at G = 20).
Root mean square errors (RMSE) of the GP and kNN models for the prediction of T_{eff}, applied to the compilations of empirical spectra.
Table 3 collects the root mean squared errors (RMSE) of the T_{eff} estimates (expressed in Kelvin) obtained by the GP and kNN models trained with a collection of synthetic spectra of the BTSettl library simulated for an apparent magnitude G = 20. The models are then applied to the four libraries of empirical spectra simulated at 28 and 70 transits, and for apparent magnitudes G = 15, 18, and 20. The prefix PCA refers to the models built for the input space of principal componets. The values in parenthesis correspond to the RMSE after correcting for the systematic bias in GP predictions. We also include the RMSE of the Bayesian inference modules built from the BTSettl and COND/DUSTY model libraries for comparison.
4.1. Bayesian inference
Fig. 11 T_{eff} predictions obtained from the ellipsoidal samplings (BTSettl) for the four empirical libraries of groundbased spectra. The colour code is the same as used in Fig. 8. The xaxis shows the spectral types gathered from the literature cited in the spectral compilations. The grey line shows the T_{eff}spectral type calibration by Stephens et al. (2009) from optical and infrared spectra (± 250 K, grey dashed lines). The righthand panels show the residuals with respect to the calibration as in Fig. 8. The top row corresponds to the spectra simulated at G = 15 and the bottom row corresponds to the G = 20 replicates. 

Open with DEXTER 
Figure 11 shows the T_{eff} predictions obtained from the ellipsoidal samplings, using the neural network trained with the BTSettl grid of models and the current error model for the Gaia RP spectra. The predictions are obtained as before, for ten replicates of each GOG simulation of an empirical spectrum. It shows a tendency to predict effective temperatures around 1700 K for stars with spectral type L. Furthermore, there are indications of a second attractor slightly above 1000 K for stars and brown dwarfs with spectral types between L5 and T5. These systematic biases can also be recognised in Fig. 8 (representing the T_{eff} estimates from χ^{2} fits) and 9 (representing the T_{eff} estimates from knearest neighbours), but not in Fig. 10 which corresponds to the GP model. The fact that the Bayesian inference shows the same kind of biases as nearest neighbours and the χ^{2} fits is not surprising because Bayesian inference is equivalent to maximum likelihood estimation under flat priors such as those used in this work. Minimum χ^{2} fits and nearest neighbours are special cases of maximum likelihood estimation.
To understand the nature of this trend to concentrate the predictions by the Bayes module around 1700−1800 K, we plot in Fig. 12 the best fits produced by the Bayesian inference module (blue), the model corresponding to the spectral type assigned in the spectral compilations (orange), and finally, one of the ten noisy replicates of the spectrum simulated with GOG for G = 20 (black), for two stars with spectral types L1 (left) and T0 (right), both of which are predicted to have temperatures between 1600 and 1700 K. For these two stars we also plot the original spectra for reference (bottom row).
The reason for this systematic effect seems related to the fact that between 1600 and 1800 K, the model spectra undergo, especially at log (g) = 5.0, rapid changes with temperature, whilst before and after this range, we find plateaux where the spectra have only a mild dependence with temperature (see Fig. 13). From the sampling perspective, relatively strong changes in the proposed temperatures falling within the plateaux regions result in small changes in the likelihood. Sampling in the 1600−1800 K region on the contrary provides a more varied range of models susceptible to better fit the observed noisy spectrum.
Fig. 12 (Top panel, black continuous lines) GOG simulations of spectra contained in the Legget compilation corresponding to 2MASS0345 + 25 (left) and SDSS1511 + 06 (right) for G = 20 and 70 transits. The blue line represents in both panels the model corresponding to the mode of the posterior probability as derived using ellipsoidal sampling. In orange, the model that corresponds to the effective temperature derived from the spectral type (L1 and T0 respectively) using the SLC calibration. The lower row shows the original spectra completed with the best χ^{2} model. 

Open with DEXTER 
Fig. 13 BTSettl model spectra between 1200 and 2000 K (the range of temperatures where the Bayesian module shows a clear tendency to concentrate predictions around 1600−1800 K) and log (g) = 5.0. Each panel shows spectra in an interval of temperatures of 200 K. The colour code represents the increment in effective temperature with respect to the lowest temperature covered in the panel. Blue lines correspond to this lowest temperature and red continuous lines correspond to the lowest temperature plus 200 K. 

Open with DEXTER 
Fig. 14 Loglikelihood landscapes for a G = 20 noisy replicate of the spectrum of SDSS0107 (left), and the BTSettl model for 1550 K and log (g) = 5.0 (right). 

Open with DEXTER 
The detailed analysis of the predictions reveals a large variance within some of the blocks of ten noisy replicates of a given spectrum. A good example of this kind of problems is provided by SDSS J010752.33 + 004156.1, where six estimates cluster around 1036 K whilst the remaining four cluster around 1664 K. As a reference, the spectral type quoted for this object is L5.5 or, equivalently, 1554 K according to the SLC calibration. This apparent inconsistency is due to the multimodal posterior distribution, which is closely related to the likelihood landscape for flat priors and noisy spectra. Figure 14a shows the loglikelihood distribution (derived from one noisy GOG simulation of the spectrum in the Leggett compilation) as a function of the parameters for SDSS J010752.33+004156.1. For comparison, we show the equivalent plot for the model in the BTSettl grid corresponding to θ = (1550 K,5.0) (Fig. 14b). Whilst the loglikelihood landscape for the BTSettl model is unimodal, the loglikelihood landscape for the noisy spectrum of SDSS J010752.33 + 004156.1 shows maxima of comparable height at 1036 and 1664 K (in addition to other local maxima in the range 1000 K <T_{eff}< 1500 K, 3.5 < log (g) < 4.0). Depending on the particular realisation of the noise, the ellipsoidal sampling will converge to different maxima. In practice, we find cases of clear bimodality in spectra of stars with spectral types between L1 and T2 (T_{eff} between 1200 and 2000 K).
This problem is easily solved by using priors based on the additional information provided by the Gaia astrometric measurements, as suggested in Sect. 3. For this case, Fig. 15 shows a particular choice of the prior that would discard all local maxima that are inconsistent with the Gaia photometry/astrometry, under the assumption of negligible circumstellar and interstellar extinction. In it, we represent the distribution of absolute G mags of the BTSettl models as a function of the dependent parameters T_{eff} and log (g). The continuous line shows the 1Σ countour of the twodimensional Gaussian prior (with Σ being the covariance matrix). The prior is fully defined by the mean μ_{prior} and covariance Σ_{prior} of the Gaussian prior distribution. Given a set of astrometric, photometric and spectrophotometric observations such as those simulated for G = 20.0 with GOG for SDSS J010752.33+004156.1, and the photometric and astrometric errors shown in Fig. 5, we calculate an absolute G mag equal to , where the uncertainties are derived from the values shown in Fig. 5 multiplied by five. We used five times the nominal uncertainties in the apparent G mag and π (the parallax) to account for the potential mismatch of the modelpredicted magnitudes with respect to the real distribution. We did not take the LutzKelker bias into account in the computation of the uncertainties of the absolute G magnitude. Models within the uncertainties in the absolute G mag () are shown in Fig. 15 as black circles. We used the T_{eff} and log (g) parameters of these models to propose the values of μ_{prior} and Σ_{prior} that were used to draw the 1Σ isocountour in Fig. 15. This prior is eleven orders of magnitude larger at the 1664 K maximum than at the 1036 K one, and thus renders this local maximum insignificant in the posterior probability density distribution. There is a narrow local maximum at T_{eff} ≈ 1500 K and log (g) ≈ 4.9 in Fig. 14 where the prior is only one order of magnitude smaller than that corresponding to the 1664 K, but this maximum is never significantly sampled in any of our ten replicates due to its narrowness.
Fig. 15 Absolute G mags for the BTSettl model library as a function of the physical parameters T_{eff} and log (g) (see colour code at the righthand side of the scatter plot). Models within (corresponding to SDSS J010752.33+004156.1 simulated at G = 20.0) are shown as black circles. The ellipse shows the 1Σ isocontour of the Gaussian physical prior described in the text. 

Open with DEXTER 
Fig. 16 T_{eff}log (g) predictions for the empirical spectral libraries obtained with the GP model (left), the kNN model (middle), and the Bayesian inference (right). The colour code is the same as in Fig. 8. The ellipses correspond to the covariance estimated from ten noisy replicates of the GOG simulated spectra (G = 15). 

Open with DEXTER 
Fig. 17 log (g) predictions for the empirical spectral libraries obtained with the GP model (left), the kNN model (middle), and the Bayesian inference (right), as a function of the log (g) value of the minimum χ^{2} fit (jittered with a Gaussian distribution of σ = 0.2). The colour code is the same as in Fig. 8. 

Open with DEXTER 
4.2. Model selection
Ellipsoidal sampling allows estimating the evidence given an observed spectrum and a model choice (COND+DUSTY or BTSettl), as defined in Eq. (2). Therefore, it is possible to compare these two model families from a Bayesian perspective. M_{0} denotes the BTSettl model library and M_{1} the COND + DUSTY library. If these were the only two alternatives, then the Bayes factor (BF) in support of the M_{0} model would be defined as the ratio of the respective marginal densities (evidences) of the data for the two models, (6)If π_{0} and π_{1} denote the respective prior probabilities (in our case π_{0} = π_{1} = 1/2), the posterior probability of M_{0} is given by (7)We obtain that 73.4% of the spectra support the BTSettl model library against 26.6% supporting the CONDDUSTY combination. For this reason we have exemplified the regression results with figures and discussion related to the models obtained with the BTSettl library.
4.3. Estimates of the gravity
Figure 16 shows the T_{eff} – log (g) diagrams obtained from the three models discussed in this section. The ellipses represent the covariance estimated from the ten noisy replicates at G = 15 of each GOG simulations of the empirical spectra. Since we do not have a compilation of surface gravities available to assess the overall validity of the predictions, we compare the regression values with those obtained from the χ^{2} fitting to the full resolution spectra. Figure 17 shows this comparison. The χ^{2} fit values are jittered with a Gaussian distribution of standard deviation equal to 0.2 to enhance the visibility. If the χ^{2} estimates are taken as targets, only the GP model can be used to obtain very rough estimates of the gravity and to tag lowgravity candidates.
The only star in our samples with an indication of low gravity in the comments section of the Dwarf Archives^{6} is the NIRSPEC target star 2MASS J1726000 + 153819. For this star, both the kNN model and the Bayesian estimate agree to assign a value of log (g) = 3.5, while the GP model assigns a higher value log (g) = 4.1.
5. Conclusions
We have presented the module that will be in charge of detecting and characterising ultracool dwarfs in the Gaia database. The module is subject to change and improvement, but this implementation provides the baseline performance that can be expected from it.
We used the current instrument models and the estimated spatial densities by Caballero et al. (2008) to predict the expected number of ultracool dwarfs per spectral type bin. We found that Gaia will be able to detect significant numbers (around or above ten detections) for UCDs of spectral types hotter than L67 v. We also used the BTSettl library of synthetic spectra to define selection criteria for the UCD module such that no UCD is missed due to measurement errors. Given the spatial densities estimated by Caballero et al. (2008), we derived contamination rates from stars hotter than 2500 K in the resulting samples.
We conducted an extensive study to find the best statistical regression model of the relationship between the observed Gaia RP spectrum and the source physical parameters (T_{eff} and log (g)). We evaluated several alternatives in view of their internal and external errors. The internal errors were estimated with cross validation experiments with a dataset interpolated from the nominal grid of models provided by the libraries of synthetic spectra. The external validation was carried out by applying the regression models to an independent set of UCD spectra observed from the ground. All these experiments were carried out on GOG simulations of the fullresolution spectra for a number of apparent G mags and numbers of transits.
As a result, we found that the expected endofmission error of the UCD module for the faintest detectable UCDs (G = 20) is 210 K for the kNN module and 260 for the GP module (207 K if a bias correction is applied). These performances are approximately constant as a function of G, at least down to G = 15 (a typical apparent magnitude for the brightest UCDs in the Gaia catalogue), and are remarkably close to the performance of a simple χ^{2} fit with the fullresolution spectrum.
The Bayesian inference of the source parameters shows systematic deviations in the distribution of predicted temperatures, which are also apparent (although less conspicuous) in the χ^{2} fits and kNN predictions. It is also severely affected by the multimodality of the likelihood maps. The application of physical priors and advanced sampling techniques capable of identifying multiple modes in the posterior will be the subject of a forthcoming paper in this series.
The logg predictions are characterised by a typical RMSE of 0.2 dex for the GP module (0.7 dex for the kNN module) as measured by the crossvalidation experiments. Unfortunately, these error estimates are overly optimistic because they are derived from testing the regression modules on synthetic spectra and not on observed spectra of real UCDs. The log (g) estimates for the empirical spectra observed from the ground, although broadly consistent with the expected distribution of values in the samples of empirical spectra, prove that the errors quoted above are indeed extremely optimistic. The GP module will undoubtedly benefit from a more realistic (i.e. wavelengthdependent) treatment of the noise parameter.
Acknowledgments
The authors wish to acknowledge the Coordination Unit 2 of the Gaia DPAC for the use of the GOG simulator, and Rosanna Sordo for their kind help and guidance with the simulation of both synthetic and empirical spectra at the Gaia instrumental characteristics. We would also like to thank the referee, Coryn BailerJones, for the insightful comments that significantly improved the first version of the manuscript. LS aknowledges José Caballero for his suggestions regarding the estimation of the number counts of detectable UCDs according to his volume density estimation. This research has been supported by the Spanish Ministry of Science through grants AyA201124052, AyA201021161C0202, AyA200914648C0201, CONSOLIDER CSD200600070, CSD200700050, and PRICITS2009/ESP1496.
References
 Allard, F., Hauschildt, P. H., Alexander, D. R., Tamanai, A., & Schweitzer, A. 2001, ApJ, 556, 357 [NASA ADS] [CrossRef] (In the text)
 Allard, F., Homeier, D., & Freytag, B. 2012, Roy. Soc. London Philos. Trans. Ser. A, 370, 2765 [NASA ADS] [CrossRef] (In the text)
 BailerJones, C. A. L., Smith, K. W., Tiede, C., Sordo, R., & Vallenari, A. 2008, MNRAS, 391, 1838 [NASA ADS] [CrossRef] (In the text)
 Bayo, A., Barrado, D., Stauffer, J., et al. 2011, A&A, 536, A63 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
 Bishop, C. M. 2006, Pattern Recognition and Machine Learning (Information Science and Statistics) (Secaucus, NJ, USA: SpringerVerlag New York, Inc.) (In the text)
 Blomme, R., Frémat, Y., Lobel, A., & Martayan, C. 2011, in EAS Publ. Ser., 45, 373 (In the text)
 Burgasser, A. J., Kirkpatrick, J. D., Reid, I. N., et al. 2000, AJ, 120, 473 [NASA ADS] [CrossRef] (In the text)
 Caballero, J. A., Burgasser, A. J., & Klement, R. 2008, A&A, 488, 181 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
 Chiu, K., Fan, X., Leggett, S. K., et al. 2006, AJ, 131, 2722 [NASA ADS] [CrossRef] (In the text)
 Coifman, R., & Lafon, S. 2006, Appl. Comput. Harm. Anal., 21, 5 [CrossRef] (In the text)
 Cortes, C., & Vapnik, V. 1995, Machine Learning, 20, 273, 10.1007/ BF00994018 (In the text)
 Cover, T. M., & Hart, P. E. 1967, IEEE Trans. Inf. Theory, 13, 21 [NASA ADS] [CrossRef] (In the text)
 Cushing, M. C., Rayner, J. T., & Vacca, W. D. 2005, ApJ, 623, 1115 [NASA ADS] [CrossRef] (In the text)
 de Bruijne, J. H. J. 2009, Gaia astrometric performance: summer2009 status, ESA/ESTEC, Tech. rep. (In the text)
 de Bruijne, J. H. J. 2012, Ap&SS, 341, 31 [NASA ADS] [CrossRef] (In the text)
 Delfosse, X., Tinney, C. G., Forveille, T., et al. 1997, A&A, 327, L25 [NASA ADS] (In the text)
 Gizis, J. E., Monet, D. G., Reid, I. N., Kirkpatrick, J. D., & Burgasser, A. J. 2000a, MNRAS, 311, 385 [NASA ADS] [CrossRef] (In the text)
 Gizis, J. E., Monet, D. G., Reid, I. N., et al. 2000b, AJ, 120, 1085 [NASA ADS] [CrossRef] (In the text)
 Golimowski, D. A., Leggett, S. K., Marley, M. S., et al. 2004, AJ, 127, 3516 [NASA ADS] [CrossRef] (In the text)
 Hastie, T., Tibshirani, R., & Friedman, J. H. 2001, The elements of statistical learning: data mining, inference, and prediction: with 200 fullcolor illustrations (New York: SpringerVerlag), 533 (In the text)
 Jordi, C., Gebran, M., Carrasco, J. M., et al. 2010, A&A, 523, A48 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
 Kirkpatrick, J. D., Reid, I. N., Liebert, J., et al. 1999, ApJ, 519, 802 [NASA ADS] [CrossRef] (In the text)
 Kirkpatrick, J. D., Reid, I. N., Liebert, J., et al. 2000, AJ, 120, 447 [NASA ADS] [CrossRef] (In the text)
 Kirkpatrick, J. D., Cushing, M. C., Gelino, C. R., et al. 2011, ApJS, 197, 19 [NASA ADS] [CrossRef] (In the text)
 Knapp, G. R., Leggett, S. K., Fan, X., et al. 2004, AJ, 127, 3553 [NASA ADS] [CrossRef] (In the text)
 Lindegren, L., Lammers, U., Hobbs, D., et al. 2012, A&A, 538, A78 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
 Liu, C., BailerJones, C. A. L., Sordo, R., et al. 2012, MNRAS, 426, 2463 [NASA ADS] [CrossRef] (In the text)
 McLean, I. S., McGovern, M. R., Burgasser, A. J., et al. 2003, ApJ, 596, 561 [NASA ADS] [CrossRef] [MathSciNet] (In the text)
 Mignard, F., BailerJones, C., Bastian, U., et al. 2008, in IAU Symp. 248, eds. W. J. Jin, I. Platais, & M. A. C. Perryman, 224 (In the text)
 Pearson, K. 1901, Philos. Mag., 2, 559 [CrossRef] (In the text)
 Rasmussen, C., & Williams, C. 2006, Gaussian processes for machine learning, Adaptive computation and machine learning (MIT Press) (In the text)
 Rayner, J. T., Cushing, M. C., & Vacca, W. D. 2009, ApJS, 185, 289 [NASA ADS] [CrossRef] (In the text)
 RecioBlanco, A., Bijaoui, A., & de Laverny, P. 2006, MNRAS, 370, 141 [NASA ADS] [CrossRef] (In the text)
 Reid, I. N., Kirkpatrick, J. D., Gizis, J. E., & Liebert, J. 1999, ApJ, 527, L105 [NASA ADS] [CrossRef] [PubMed] (In the text)
 Reid, I. N., Kirkpatrick, J. D., Gizis, J. E., et al. 2000, AJ, 119, 369 [NASA ADS] [CrossRef] (In the text)
 Reylé, C., Delorme, P., Willott, C. J., et al. 2010, A&A, 522, A112 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
 Reylé, C., Rajpurohit, A. S., Schultheis, M., & Allard, F. 2011, in Stellar Systems, and the Sun, eds. C. JohnsKrull, M. K. Browning, & A. A. West, ASP Conf. Ser., 448, 16th Cambridge Workshop on Cool Stars, 929 (In the text)
 Robin, A. C., Luri, X., Reylé, C., et al. 2012, A&A, 543, A100 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
 Rosipal, R., Be, P. P., Trejo, L. J., et al. 2001, J. Mach. Learn. Res., 2, 97 (In the text)
 Saumon, D., & Marley, M. S. 2008, ApJ, 689, 1327 [NASA ADS] [CrossRef] (In the text)
 Shaw, J., Bridges, M., & Hobson, M. 2007, MNRAS, 378, 1365 [NASA ADS] [CrossRef] (In the text)
 Sivia, D., & Skilling, J. 2006, Data analysis: a Bayesian tutorial, Oxford science publications (Oxford University Press) (In the text)
 Skilling, J. 2006, Bayesian Anal., 1, 833 [CrossRef] [MathSciNet] (In the text)
 Stephens, D. C., Leggett, S. K., Cushing, M. C., et al. 2009, ApJ, 702, 154 [NASA ADS] [CrossRef] (In the text)
 Strauss, M. A., Fan, X., Gunn, J. E., et al. 1999, ApJ, 522, L61 [NASA ADS] [CrossRef] (In the text)
 Tsalmantza, P., Karampelas, A., Kontizas, M., et al. 2012, A&A, 537, A42 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
 Tsvetanov, Z. I., Golimowski, D. A., Zheng, W., et al. 2000, ApJ, 531, L61 [NASA ADS] [CrossRef] [PubMed] (In the text)
 Vapnik, V. N. 1995, The nature of statistical learning theory (New York, NY, USA: SpringerVerlag New York, Inc.) (In the text)
All Tables
Average bias (μ) and standard deviation of the χ^{2} effective temperature (K) fits.
Root mean square errors (RMSE) of the GP and kNN models for the prediction of T_{eff}, applied to the compilations of empirical spectra.
All Figures
Fig. 1 Normalised sample spectra from the BTSettl library. The top row shows simulated Gaia RP spectra of BTSettl models for T_{eff} = 500, 1000, 1500, 2000, and 2500 K. The vertical axis is proportional to the number of photons detected in each wavelength bin. The line colours reflect the various values of log (g) available in the library of models according to the colour scale on the right. The bottom row shows the original spectra with the same temperatures as in the top row, log (g) = 5.0 and solar metallicity. 

Open with DEXTER  
In the text 
Fig. 2 Evolutionary tracks in the T_{eff}log (g) space for ultracool dwarfs according to the BTSettl library. Each line corresponds to a different mass in the range 0.0005–1.4 M_{⊙} as labelled below selected tracks. Filled circles represent individual models in the grid. These are coloured according to the decimal logarithm of the age in Gigayears as indicated by the colour scale on the right. Effective temperatures are measured in Kelvin and gravities in cm s^{2}. 

Open with DEXTER  
In the text 
Fig. 3 Maximum distances at which an ultracool dwarf can be detected by Gaia at the limiting magnitud G = 20 as a function of its absolute magnitude in the I band. These have been derived from BTSettl models (filled circles) and the continuous lines represent the interpolation used in deriving the expected counts per spectral type bin in Table 1. The black continuous line corresponds to log (g) = 5.0 and the blue line to log (g) = 3.5. The top axis shows the effective temperature measured in Kelvin for a log (g) = 5.0 object with the absolute I magnitudes shown in the x axis, according to the BTSettl models. The T_{eff} – M_{I} mapping is only bivalued below 600 K. 

Open with DEXTER  
In the text 
Fig. 4 Predicted number of counts per spectral type bin a) and per apparent G mag b), in logarithmic units. The black line in the left panel corresponds to the derivation based on the relation between spectral type and Iband absolute magnitude included in Caballero et al. (2008), while the blue continuous line corresponds to the relation derived from the BTSettl model family and the SLC calibration. The two horizontal (dashed) lines indicate the levels of predicted counts equal to one and ten. The righthand side plot has been obtained assuming the relation between effective temperature and Iband absolute magnitude derived from the BTSettl models and the SLC calibration. 

Open with DEXTER  
In the text 
Fig. 5 Current estimates of the endofmission uncertainties in the measurements of the G apparent magnitude (black line) and the parallax (dashed and dotted lines) as a function of the G apparent magnitude. The dashed line corresponds to (V − I) = 4 and the dotted line to (V − I) = 7.5, a plausible range for the V − I colour index according to the model libraries. 

Open with DEXTER  
In the text 
Fig. 6 Histogram of the temperatures of nonUCD stars that pass the selection criteria of the Gaia UCD module. 

Open with DEXTER  
In the text 
Fig. 7 Crossvalidation errors for endofmission spectra of UCD stars at G = 20. The upper panels show the errors in the T_{eff} estimates for the GP model (left) and the knearest neighbours model (right). The lower panels show the corresponding errors in the log (g) estimates. The colour code for the error scale is shown in the righthand side of each row. Effective temperatures are measured in Kelvin and gravities in cm/s^{2}. 

Open with DEXTER  
In the text 
Fig. 8 Effective temperatures derived from the best χ^{2} fits to BTSettl models as a function of the spectral type assigned in the literature. Black circles correspond to the compilation by Leggett, red circles to the compilation of Keck LRIS spectra by Reid, orange circles to the NIRSPEC compilation, and blue ones to the IRTF compilation. The grey continuous line shows the T_{eff}spectral type calibration by Stephens et al. (2009) from optical and infrared spectra. The dashed lines represent the same calibration ±250 K. The righthand panel shows the residuals (T_{eff}_{(predicted)} − T_{eff}_{(SLC)}) with respect to the calibration. 

Open with DEXTER  
In the text 
Fig. 9 T_{eff} predictions obtained by the kNN algorithm based on the BTSettl grid of models for the four libraries of groundbased spectra listed in Sect. 4 for G = 15 (top row) and G = 20 (bottom row). The colour code is the same as used in Fig. 8. The xaxis shows the spectral types gathered from the literature cited in the spectral compilations. The grey line shows the T_{eff}spectral type calibration by Stephens et al. (2009) from optical and infrarred spectra (± 250 K, grey dashed lines). The righthand panels show the residuals with respect to the calibration as in Fig. 8. 

Open with DEXTER  
In the text 
Fig. 10 T_{eff} predictions obtained by the GP model based on the BTSettl grid of models for the four libraries of groundbased spectra listed in Sect. 4 for G = 15 (top row) and G = 20 (bottom row). The colour code is the same as used in Fig. 8. The xaxis shows the spectral types gathered from the literature cited in the spectral compilations. The grey line shows the T_{eff}spectral type calibration by Stephens et al. (2009) from optical and infrared spectra (± 250 K, grey dashed lines). The righthand panels show the residuals with respect to the calibration as in Fig. 8. 

Open with DEXTER  
In the text 
Fig. 11 T_{eff} predictions obtained from the ellipsoidal samplings (BTSettl) for the four empirical libraries of groundbased spectra. The colour code is the same as used in Fig. 8. The xaxis shows the spectral types gathered from the literature cited in the spectral compilations. The grey line shows the T_{eff}spectral type calibration by Stephens et al. (2009) from optical and infrared spectra (± 250 K, grey dashed lines). The righthand panels show the residuals with respect to the calibration as in Fig. 8. The top row corresponds to the spectra simulated at G = 15 and the bottom row corresponds to the G = 20 replicates. 

Open with DEXTER  
In the text 
Fig. 12 (Top panel, black continuous lines) GOG simulations of spectra contained in the Legget compilation corresponding to 2MASS0345 + 25 (left) and SDSS1511 + 06 (right) for G = 20 and 70 transits. The blue line represents in both panels the model corresponding to the mode of the posterior probability as derived using ellipsoidal sampling. In orange, the model that corresponds to the effective temperature derived from the spectral type (L1 and T0 respectively) using the SLC calibration. The lower row shows the original spectra completed with the best χ^{2} model. 

Open with DEXTER  
In the text 
Fig. 13 BTSettl model spectra between 1200 and 2000 K (the range of temperatures where the Bayesian module shows a clear tendency to concentrate predictions around 1600−1800 K) and log (g) = 5.0. Each panel shows spectra in an interval of temperatures of 200 K. The colour code represents the increment in effective temperature with respect to the lowest temperature covered in the panel. Blue lines correspond to this lowest temperature and red continuous lines correspond to the lowest temperature plus 200 K. 

Open with DEXTER  
In the text 
Fig. 14 Loglikelihood landscapes for a G = 20 noisy replicate of the spectrum of SDSS0107 (left), and the BTSettl model for 1550 K and log (g) = 5.0 (right). 

Open with DEXTER  
In the text 
Fig. 15 Absolute G mags for the BTSettl model library as a function of the physical parameters T_{eff} and log (g) (see colour code at the righthand side of the scatter plot). Models within (corresponding to SDSS J010752.33+004156.1 simulated at G = 20.0) are shown as black circles. The ellipse shows the 1Σ isocontour of the Gaussian physical prior described in the text. 

Open with DEXTER  
In the text 
Fig. 16 T_{eff}log (g) predictions for the empirical spectral libraries obtained with the GP model (left), the kNN model (middle), and the Bayesian inference (right). The colour code is the same as in Fig. 8. The ellipses correspond to the covariance estimated from ten noisy replicates of the GOG simulated spectra (G = 15). 

Open with DEXTER  
In the text 
Fig. 17 log (g) predictions for the empirical spectral libraries obtained with the GP model (left), the kNN model (middle), and the Bayesian inference (right), as a function of the log (g) value of the minimum χ^{2} fit (jittered with a Gaussian distribution of σ = 0.2). The colour code is the same as in Fig. 8. 

Open with DEXTER  
In the text 