Gaia Data Release 3 Catalogue Validation

Context. The third Gaia data release (DR3) provides a wealth of new data products. The early part of the release, Gaia EDR3, already provided the astrometric and photometric data for nearly two billion sources. The full release now adds improved parameters compared to Gaia DR2 for radial velocities, astrophysical parameters, variability information, light curves, and orbits for Solar System objects. The improvements are in terms of the number of sources, the variety of parameter information, precision, and accuracy. For the ﬁrst time, Gaia DR3 also provides a sample of spectrophotometry and spectra obtained with the Radial Velocity Spectrometer, binary star solutions, and a characterisation of extragalactic object candidates. Aims. Before the publication of the catalogue, these data have undergone a dedicated transversal validation process. The aim of this paper is to highlight limitations of the data that were found during this process and to provide recommendations for the usage of the catalogue. Methods. The validation was obtained through a statistical analysis of the data, a conﬁrmation of the internal consistency of di ﬀ erent products, and a comparison of the values to external data or models. Results. Gaia DR3 is a new major step forward in terms of the number, diversity, precision, and accuracy of the Gaia products. As always in such a large and complex catalogue, however, issues and limitations have also been found. Detailed examples of the scientiﬁc quality of the Gaia DR3 release can be found in the accompanying data-processing papers as well as in the performance veriﬁcation papers. Here we focus only on the caveats that the user should be aware of to scientiﬁcally exploit the data.


Introduction
This paper describes the validation of the third Gaia data release, Gaia DR3 (Gaia Collaboration et al. 2016;Vallenari 2022).The validation of the astrometric and photometric content can be found in the Gaia EDR3 validation paper (Fabricius et al. 2021).We focus here on the new products of Gaia DR3, which are summarised in Vallenari (2022).The main new products of Gaia DR3 are the radial velocities, as well as line broadening and G RVS magnitude, astrophysical parameters, variable stars, Solar System objects, and for the first time, spectra (both from the spectrophotometric instrument and from the Radial Velocity Spectrometer (RVS)), non-single stars, and quasar (QSO) and galaxy candidates, and associated characterisation.The processing papers 1 and the on-line documentation2 describe the data and their internal validation in detail.The performance verification papers 1 highlight the overall quality of the catalogue.In this paper, we focus on presenting the caveats that the Gaia DR3 users should be aware of.Although the scientific validation process has confirmed the high quality of the Gaia EDR3 data, certain issues remain, and there are caveats.We focus this paper on highlighting them, and we provide advice to the users.
The approach followed by the validation presented in this paper is a transverse analysis of the properties of the catalogue content.Tests are either internal (including overall statistics, correlations, and clustering analysis between catalogue entries) or use external data, a Galaxy model, or clusters.The comparison with a Galaxy model was made using the Gaia object generator (GOG20) as a reference model or the Gaia universe model snapshot (GUMS20, which contains the intrinsic properties of the objects generated by GOG).They are described in detail in the on-line documentation 3 and were released with the Gaia EDR3 set of catalogues 4 .
Although the validation tests were designed to be transverse, we organise the paper by product for convenience.We therefore discuss in turn the Radial Velocity Spectrometer products (Sect.2: radial velocities, line broadening, G RVS magnitude, and RVS spectra), the low-resolution (Blue and Red Photometers, BP Fig. 1.Differences in radial velocities of the members of close pairs of sources as a function of the square of the angular separation in arcsec 2 (top) and after filtering (bottom).The red lines enclose one of the criteria that were used to filter the problematic cases.

Radial Velocity Spectrometer
While only radial velocities were provided in Gaia DR2 , more products from the Radial Velocity Spectrometer (RVS) are available in Gaia DR3 within the gaia_source table: the radial velocities (radial_velocity) down to G RVS <14, the spectral line broadening (vbroad), and the magnitude G RVS estimated using the RVS spectra flux (grvs_mag).Moreover, a subset of RVS spectra are available through the rvs_mean_spectrum datalink 5 .

Radial velocity
The radial velocity (RV) data are presented in detail in Katz & et al. (2022).The radial velocities of hot stars are specifically targeted in Blomme & et al. (2022).

Radial velocity contaminants
During the process of the internal validation of a preliminary version of the catalogue, we detected erroneous radial velocities due to nearby bright contaminant sources, which was a wellknown issue for Gaia DR2 (Boubert et al. (2019); Seabroke et al. (2021)).This is illustrated in Fig. 1 (top), where we took all pairs of sources whose components are closer than 10 arcsec and plotted their difference in radial velocity (absolute value) versus their angular separation (squared).In this plot, optical pairs should contribute a constant density of points at a given ordinate, while physical binaries should contribute pairs with small RV differences.If in a given transit, the dispersion of the spectra is oriented close to the line of separation of the two sources, the lines from both sources will be present in both spectra.If in addition, the lines from the neighbour source are confused with the lines of the (fainter) target source, this will give the target an erroneous RV, which differs from the RV of its neighbour in proportion to the separation.This will normally just result in an outlying observation, but if a particular scan direction dominates, the final radial velocity difference will become 145 km s −1 for each arcsecond of separation.In the upper panel of Fig. 1, we see a rounded front, a parabola, closely matching the predicted 145 km s −1 arcsec −1 dependence.This is a clear sign that source confusion is occurring.We also note a second, weaker front below the first.It corresponds to a similar effect, but the two sources are separated 1 .8 in the direction perpendicular to the scan, which is the limit at which two observations have independent data acquisition.
As a result, we have a population of false high radial velocities, but also sources with biased radial velocities that are not necessarily very high.Most of these problematic sources have been filtered out of the data released in Gaia DR3 based on the separation and magnitude difference of the pair for sources with |RV| > 200 km s −1 (see Katz & et al. (2022)).This filtering causes the fronts to almost disappear (Fig. 1 bottom), although some hints of them still remain.
In the top plot panel of Fig. 1, a vertical band of sources with very large radial velocity differences at very small separations is visible as well.From the pairs with separations smaller than 1.6 arcsec (corresponding to 2.56 arcsec 2 in these plots) with velocity differences above and below 500 and −500 km s −1 (enclosed within the red lines), we filtered out the members with a higher RV, which in general correspond to the faint member of the pair.This is a total of 57 stars.
We also used the binary catalogue by El-Badry et al. ( 2021) to test the internal consistency of the radial velocities.More than 100 000 pairs with a probability of 90% of being bound according to this catalogue have a radial velocity for both members in Gaia DR3.The comparison of their radial velocities before filtering (Fig. 2 left) indicates that most of the pairs closely follow the one-to-one line (agreement of RV), but some sources have suspiciously high radial velocities, especially the secondary members.This sample also shows correlated velocity differences and separations in a plot similar to Fig. 1.Some problematic high radial velocities still remain in the catalogue after filtering (Fig. 2

right panel).
The |radial_velocity|>600 km s −1 for 770 sources.We expect most of them to be real high-velocity stars, but due to the low signal-to-noise ratio of most of the spectra, it is difficult to know which fraction of the measurements is truly spurious.We can stack all of them, however, to improve the summed signal.All the spectra were corrected for radial velocity.If the RV value that was used was the correct value, the stacked spectra should have strong lines in the expected places such as the calcium triplet lines.This is shown in Fig. 3.However, if the radial velocity that was used for the correction was incorrect, the triplet lines are shifted and appear in the wrong place: for RV > 600 km s −1 , they are at least 1.7 nm to the left of the expected position, while for RV < −600 km s −1 , they appear at least 1.7 nm to the right (dashed lines).These secondary peaks are also seen in the figure.They are less sharp because the incorrect velocity corrections range from 600 to 900 km s −1 (i.e. between 1.7 and 2.6 nm), so they do not all peak in the same place.In any case, the figure clearly shows that most measurements are good, that is, most of the sources are real high-velocity targets.
Radial velocities are provided down to G RVS = 14.A few sources are still fainter than G > 16 (see Fig. 4), however.A high (G RVS −G) could indicate contamination from nearby bright stars (affecting the RV estimation), but also (for the faintest G RVS ) an under-subtraction of the background.We recommend caution for radial velocities with (G RVS − G) < −3.

Radial velocity systematics
Comparison with external catalogues shows that the zero-point of the radial velocities is lower than 0.1 km s −1 than in the radial velocity standard catalogue of Soubiran et al. (2018), Carmenes (Lafarga et al. 2020), and SIM (Makarov & Unwin 2015), but it is about -0.2 km s −1 for GALAH DR3 (Zwitter et al. 2021), APOGEE DR16 (Ahumada et al. 2020), and GES DR3 (Gilmore et al. 2012).The number of 5σ outliers is smaller than 3%.The radial velocity zero-point shows a de-Fig.4. Uncertainty in radial velocity as a function of G. q q q q q q q q q q q q q q q −1.0 −0.crease with metallicity in all surveys illustrated in Fig. 5.The global change in radial velocity with magnitude is not consistent across the surveys.However, Katz & et al. (2022) used subsamples on which they found a consistent trend between APOGEE and GALAH and propose a magnitude-term correction.For stars with rv_template_teff>8500 K, the correction derived in Blomme & et al. (2022) is to be used instead.
The comparison of the median radial velocity with the GOG model does not show any systematic significant difference throughout the sky, at least none that can be attributed to the data themselves.Figure 6 shows the median value of the radial velocity throughout the whole sky and per magnitude bin for DR3, EDR3, and GOG20.From G = 4 to G = 15, the DR3 values agree with GOG20 at the level of 1.5 km s −1 or lower.We note that it was at the level of 1 km s −1 or lower with EDR3.The EDR3 values are not reliable at G > 13 because there are too few stars.This limit is pushed to G > 15 for DR3.The dependence on the G magnitude seen in the data is not predicted by the model and might indicate some systematics in the data or in the model that are not yet understood.

Radial velocity uncertainties
The different methods that were used to compute the radial velocity (indicated by rv_method_used, see Katz & et al. 2022) lead to different error distributions as a function of magnitude (Fig. 4).In particular, the limit of using one method or the other is G RVS = 12, which produces the plume of large errors at G ∼ 12.
We tested the uncertainties on radial velocity given by radial_velocity_error using 48 944 stars in 804 open clusters in which at least ten members were brighter than G RVS <14  2021).These clusters are typically closer and more populated than average: 70% are located within 2 kpc of the Sun, and 50% of them have more than 140 identified members.We computed the difference ∆RV between the radial velocity of each star and the bulk cluster radial velocity (defined as the median of RVS radial velocities), and we compared this value with the nominal uncertainty of each star.The results are shown in Fig. 7.In the ideal case, ∆RV/radial_velocity_error should follow a normal distribution (centred on zero and with a dispersion of 1), but Fig. 7 shows that bright stars (with high rv_expected_sig_to_noise and small radial_velocity_error) tend to have a much broader dispersion than faint stars.It should be noted that several effects can broaden the distribution, such as the gravitational redshift and convective blueshift which affect stars of different spectral types differently, and also affect unrecognised binaries, nonmembers that are still present in the distribution, and the intrinsic internal dispersion of the clusters in different ways.While some of the effects are difficult to quantify, the internal velocity dispersion is about 0.5-1 km s −1 (see e.g.Torres et al. 2021).It is difficult to produce diagnostics on a per-cluster basis because the effect is revealed in a statistical way.However, the pattern seems to be identical for all clusters, and this is in favour of an instrumental effect.
Comparison with external catalogues and the wide binary catalogue of El-Badry et al. (2021) confirms that the errors are underestimated for G RVS < 12, but also for T eff < 4500 and T eff > 6000 K (Fig. 8).As the external catalogues provide different error underestimation estimates due to their own error estimation uncertainties, we used the wide binaries to estimate a correction.To limit the impact from the gravitational redshift, we selected stars with similar colours and magnitudes, using a difference of 0.1 mag in G BP − G RP and G.To avoid the additional dependence on the temperature, we selected only systems in which both components lay within 4500<rv_template_teff<6000 K.We further removed 5σ radial velocity outliers.This led to a total of 2452 systems that could be used.According to the APOGEE and GALAH comparison, we split the fit into bright and faint regimes at G RVS =12 mag (which corresponds to the magnitude separation between the different methods that were used to derive the radial velocity) and fitted a second-order polynomial to the factor f σ to apply to the standard deviation, by maximising the product of the likelihoods of (2) to be normally distributed.The coefficients we obtained are illustrated in Fig. 8 and are provided in Table 1.The bright side is not constrained, so that it should not be extrapolated beyond G RVS < 8 mag.Based on the comparison with Soubiran et al. (2018), the value at G RVS =8 mag seems to be a good estimate for G RVS < 8 mag.The T eff ranges showing a strong departure in Fig. 8 correspond to systematic offsets between rv_template_teff and the GALAH temperature.For cool stars, the deviation with APOGEE is found only for T eff < 4000 K.
The effect of the random temperature template mismatch is included in the correction provided in Table 1, but not the systematics as we used an internal comparison.On the range 4500<rv_template_teff<6000 K, the median absolute deviation between rv_template_teff and the GALAH T eff is 250 K.
When the correction of Table 1 was applied to wide binaries with a T eff template that was hotter and cooler, no correlation of an additional factor with rv_template_teff was detected.

Vbroad
The estimation of the line-broadening parameter, vbroad, is detailed in Fremat & et al. (2022).The comparison of the spectral line-broadening parameter vbroad with external catalogues shows that values lower than ∼ 10 km s −1 are systematically G RVS < 12 G RVS > 12 a b c a b c 0.318 0.3884 -0.02778 16.554 -2.4899 0.09933 q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q 8 9 10 11 12 13 14 0.8 1.2 1.6 2.0 G RVS f σ q q q q q q q q q q q q q q q q q q q q q q q q q q q q q 4500 5500 6500 0.8 In green we over-plot f σ estimated from the wide binaries (Eq. 1 and Table 1).overestimated, while higher values tend to be underestimated for FGK stars, as illustrated in Fig. 9, which shows a similar behaviour as the comparison with GALAH (Zwitter et al. 2021).More details about the validation of the spectral line-broadening parameter can be found in Fremat & et al. (2022).

Grvs magnitude
The estimation of the G RVS magnitude, grvs_mag, is detailed in Sartoretti & et al. (2022).The comparisons of G RVS with the Hipparcos magnitude and Tycho2 colours indicate no saturation issues.The comparison of G RVS with Gaia G magnitude and G BP -G RP colour shows a change in behaviour at G RVS > 12.To illustrate this, we used here a sample of solar metallicity dwarfs selected from APOGEE DR16 (Ahumada et al. 2020) that had low extinction (A 0 < 0.05 mag according to Lallement et al. (2019)).An empirical robust spline regression was derived to model the global relation of G-G RVS versus G BP -G RP .The residuals from this spline are plotted as a function of magnitude in Fig. 10.The effect appears to be much larger than the internal variations observed with the G, G BP , and G RP magnitudes (Fabricius et al. 2021), but are still only at the 10 mmag level.q q q q q q q q q 9 10 11 12 13 −0.Figure 11 shows the relative difference between DR3 and GOG20 in the G = 12 to 13 magnitude range.The agreement is very good, except in the bulge and Galactic plane, where the excess of simulated stars is large.This is expected as only sources with unblended spectra were used to estimate G RVS (Sartoretti & et al. 2022).Exploring these maps at other magnitude bins shows that the completeness for G RVS measurements is still high at G = 14 outside of the Galactic plane, and at fainter magnitudes, the star counts start to drop.In the Galactic plane, the data already start to be incomplete at G = 11.

RVS spectra
The main properties of the RVS spectra, available through the rvs_mean_spectrum datalink table, are described in Seabroke & et al. (2022).The sky distribution of the sources with spectra, presented in Fig. 12, is non-uniform.There are patches with a higher density of sources, and some regions are basically empty.More details about how this sample was selected are given in Seabroke & et al. (2022).
The continuum calibration of the spectra was performed using different methods, which resulted in different continuum levels.For faint targets, G RVS > 12 (or rv_method_used=2), the method set the median value of the flux at 1, which forces the continuum to be slightly above this value (see Fig. 13 top).For brighter targets (or rv_method_used=1), the continuum is slightly below 1 (see Fig. 13 middle).When the target is red, the

Spectrophotometry
Gaia DR3 provides low-resolution spectral data for about 220 million sources for the first time.These data consist of two sets of coefficients with the corresponding uncertainties and correlation matrices, available in the xp_continuous_mean_spectrum table through the datalink interface 5 .One set of coefficients is for the BP instrument, and the other set is for the RP instrument.The only exception is DR3 source_id= 5405570973190252288.This very red and faint source only has an RP spectrum.The coefficients are the development of a spectrum in basis functions for the internal spectrum in units of electrons per second per pseudo-wavelength within the Gaia aperture as a function of pseudo-wavelength (De Angeli et al. 2022).Externally calibrated spectra can be obtained through the GaiaXPy tool6 (see also the cosmos pages7 for the configuration files that allow producing these externally calibrated spectra).For a subset of the spectra with G < 15, these externally calibrated sampled spectra are available directly in the xp_sampled_mean_spectrum table through the datalink interface.However, the direct usage of the coefficients is strongly recommended (De Angeli et al. 2022).GaiaXPy can also be used to transform an external spectrum into the Gaia XP (shortcut for BP and/or RP) continuous representation.A detailed description of the data is provided by De Angeli et al. Figure 14 shows the sky density distribution for all sources with xp_continuous_mean_spectrum in Gaia DR3.In addition to the natural variation in source density, several distinct regions with a lower source density are seen.The natural variation includes high densities along the Galactic plane and in particular towards the Galactic centre, and decreasing densities towards the Galactic poles.These artificial patterns in the sky distribution of the sources result from the selection process of spectra to be published in Gaia DR3, in particular, the requirement of at least 15 observations.Figure 15 shows the distribution of the sources with XP spectra in the colour -apparent magnitude diagram.The natural increase in the number of sources at fainter apparent magnitudes is clearly visible.In addition to this, artificial structures are superimposed.For a G magnitude brighter than 17.65, all available XP spectra are included in Gaia DR3, while for fainter sources, only a subset is included, with a focus on red sources.This results in the break in the distribution at G = 17.65, and in the generally smaller number of sources at larger magnitudes, with a larger proportion of red sources.A detailed description of the selection process is provided by De Angeli et al. (2022).

Tests of source coefficients
The first test we performed on the coefficients of the XP spectra determined the stability of the representation of internal spectra.The internal BP and RP spectra are represented by a linear combination of basis functions, and the integrated flux of a source is thus a linear combination of the integrals of the basis functions over the entire real axis.The absolute values of these integrals are about one, therefore we might expect the absolute values of the coefficients of the spectrum of any particular source to not be significantly higher than the integrated flux of the source.Coefficients that are very large compared to the integrated flux would indicate that the source spectrum is a linear combination of basis functions that mostly cancel each other out, and would thus be an indicator of an unstable representation of the internal spectrum.We compared the absolute values of all coefficients of all sources with the integrated flux, and they are all lower than 3.8 times the integrated flux.In most cases, the values are significantly lower.We therefore see no indications for excessively large contributions from different basis functions to internal spectra that are cancelling each other out.
The basis functions used to represent BP and RP spectra are constructed such that they are efficient in representing typical stellar spectra (Carrasco et al. 2021;De Angeli et al. 2022).As a consequence, the broad structure of a spectrum is represented by the low-order basis functions, and detailed spectral patterns are represented by higher-order basis functions.The absolute values of the XP source coefficients should therefore decrease in general with the order of the coefficient.Figure 16 shows the distributions of the BP and RP coefficients, normalised with respect to the L 2 -norm, for all sources in Gaia DR3.In both instruments, most coefficients of the majority of XP spectra are close to zero.To study this further, we compared the sum of absolute values for the first five coefficients with the sum of the remaining higher-order coefficients.We computed the difference between the first and the second sum, with the uncertainty on the difference, and considered sources for which the difference was smaller than five times the error.Of all the sources with XP spectra in Gaia DR3, 26037 sources have BP and 5470 sources have RP spectra that fulfil this criterion.The majority of these sources are concentrated in the Galactic plane and towards the direction of the Galactic centre.These sources may therefore be affected by crowding, resulting in a contamination of the spectra by flux from nearby sources and thus unexpected spectral shapes that require an unusual combination of basis functions.The larger number of sources as compared to RP may result from the larger number of faint sources in BP.
The xp_summary table contains two parameters specifying the number of relevant coefficients for each source in BP and RP, respectively (bp_n_relevant_bases and rp_n_relevant_bases).All coefficients with indices larger than the specified number are considered to be consistent with being zero.Figure 17 shows the histogram of the number of relevant bases for BP and RP.
No source has a relevant number of coefficients zero and 54.In the first case, this is caused by assigning 55 coefficients as relevant if no coefficients are found to be relevant, and the spectrum therefore agrees with consisting of random noise alone.The lack of 54 relevant coefficients arises because only one last coefficient

Tests of the spectral shape
Due to the lower instrumental response at its edges, the flux values of the sampled internal spectra should be lower in the outer samples than in the central samples for sources with significant flux.These regions with low fluxes at either side are referred to as the wings of the spectra.To evaluate the behaviour of the spectra at the wings at either side, we integrated the fluxes over the pseudo-wavelength ranges [−∞, 0] and [0, 5] and compared them to the integrated flux over the interval [5,10].Analogously, on the other side of the spectra, the integrals over the pseudowavelength intervals [60, ∞] and [55, 60] were compared with the integral over the interval [50,55].For the comparison, the difference between the integrals was computed and normalised with respect to its uncertainty.The four normalised differences are smaller than five for 204 to 796 RP sources and for 88 to 7411 BP sources.As was already the case for the test of the decreasing coefficients, a small part of these sources is homogeneously distributed in the sky, while the majority is concentrated in the Galactic plane and in the direction of the Galactic centre.This indicates crowding and the resulting contamination of the XP spectra with flux from nearby stars as a reason for the nondecreasing spectral wings.The larger number of BP spectra in this test that do not meet the threshold as compared to RP spectra may be a result of the larger number of faint spectra in BP. Figure 18 shows the distribution of sources with normalised differences larger than five when all XP coefficients are used and when the representation is truncated at xp_n_relevant_bases.The truncation results in an increase in the number of sources above the chosen threshold, in particular, for very red sources.A possible reason might be that the truncation results in an underestimated error.
Noise may cause parts of the spectrum to be negative, in particular, for faint sources.In order to determine the number of negative values in the sampled XP spectra, we defined the negativity of a spectrum as This measure for negativity z is zero if the sampled XP spectrum is positive at all values of the pseudo-wavelength u, and one if it is negative at all values of u. Figure 19 shows the distribution of sources as a function of the L 1 norm of the spectrum and the value of z for BP and RP.The majority of sources follows a general trend of low negativity for large L 1 norms and increasing negativity and a wider spread in the distribution as the norm decreases.The latter case corresponds to faint sources with increasing negativity due to noise.The more pronounced tail of sources with small L 1 norms in BP results from the larger number of faint BP spectra as compared to RP.Only a small fraction of sources clearly lies beyond the general relation between the L 1 norm and z.These outliers in general result from an over- subtraction of the background in the spectra, shifting its overall flux level towards negative values.

Wiggling patterns
We tested whether truncation efficiently removes unnecessary wiggling patterns in the XP spectra.Here we considered 6377 main-sequence star members (Cantat-Gaudin et al. 2020) of 17 open clusters 8 .By considering only the members of these open clusters, we ensured that our testing sample is composed of stars with metallicities similar to the solar value.We employed XP spectra that were externally calibrated by Ga-iaXPy with the default constant wavelength step used for the xp_sampled_mean_spectrum table.
First, we defined a coefficient that measures the wiggling level in XP spectra.For each ith wavelength sampled portion of the spectrum, we calculated where f i is the flux associated with the ith wavelength sample.The wiggling coefficient is thus defined as the average of the δ n i across the entire spectrum or a portion of it, where N is the number of wavelengths over which δ n i is determined.
The wiggling coefficient w 3 was calculated for each star within the spectral range [450,900] nm.This coefficient is higher when the spectra contain more undesired wiggles, but also for later spectral types, whose spectra typically contain more molecular bands.In order to ensure that we probed the wiggling and not real spectral features, we therefore defined a differential coefficient ∆w 3 = log 10 (w 3 ) − log 10 (w 3 ), (5 where w 3 is the coefficient defined in Eq. 4 and measured for the jth star, while w 3 is the coefficient measured on the average spectrum calculated over a sample of 100 dwarf stars with |phot_g_mean_mag -phot_g_mean_mag j | < 0.01 mag and |bp_rp-bp_rp j | < 0.005.By averaging spectrum stars with very similar phot_g_mean_mag and bp_rp , we obtained a single spectrum that contained the typical absorption features that can be found in spectra of stars similar to the jth star and cleaned from wiggles.Therefore, ∆w 3 is truly representative of the actual wiggling shown by the jth spectrum, without the contribution from molecular bands or any other spectral feature.
Figure 20 shows the cumulative histograms of the differential wiggling coefficients ∆w 3 derived for stars in different bins of phot_g_mean_mag.The coefficients derived from non-truncated spectra are plotted in the left panel, and those from truncated spectra are shown in the right panel.While fainter stars tend to have larger ∆w 3 in their non-truncated spectra due to the lower signal-to-noise ratio, this dependence is significantly reduced by truncation.
Wiggling in XP spectra might be enhanced by strong spectral features.It is especially important to test this possibility in spectra of young accretors, which are typically characterised by strong Hα emission lines.Therefore, we used XP spectra from 197 members of the star-forming regions Chamaeleon I, IC 348, Lupus, NGC 2024, NGC 2068, ONC, Ophiucus, and R Coronae Australis.We defined a coefficient that measures the height of the Hα line, where f Hα is the flux measured at 656nm, corresponding to the centre of the Hα line, while the mean flux at the denominator is measured at the base of the Hα line, that is, at all wavelengths within 626-636nm and within 676-686nm.
The differential wiggling coefficient ∆w 10 (see Eq. 4 and 5) is shown in Fig. 21 (left panel) as a function of H Hα for nontruncated spectra.Each error bar represents the standard deviation of the coefficient w 10 measured on the comparison sample of 100 dwarfs.The plot shows that wiggling increases with the height of the Hα line.Therefore, we conclude that the presence of strong spectral features enhance wiggling in XP spectra.
In order to test whether truncation is able to fix or alleviate the problem, we repeated the experiment on truncated XP spectra.The results are shown in the right panel of Fig. 21, which indicates that truncation does not significantly remove the additional wigging produced by strong Hα lines.Instead, we found that the height of the Hα line is affected by truncation.We observe that for 23% of stars with H non−trunc Hα >1.1, H Hα is reduced by more than 5% by truncation.

Tests of the integrated fluxes
The calibration of XP integrated photometry and the spectra follow different calibration procedures that only have low-level processing steps in common.Although some differences might occur between the integrated fluxes from the XP spectra and the integrated photometry, mainly because of potential differences in passband calibration and noise, we expect to have comparable results among these two different processes in principle.In order to test this, we computed the ratio of the photometric and spectrum flux.
The distribution of this ratio shows that most sources have a value close to one (Fig. 22).For BP, however, there is a significant population of sources with values higher than one.This might be a result of a threshold of one electron/s that was applied in the selection of transits in the integrated photometry.Transits with a flux below this threshold were excluded from the computation of the mean flux, resulting in a biased mean flux for faint sources (Riello et al. 2021).This threshold was not applied in the computation of the mean spectra, thus avoiding the bias towards too high fluxes and leading to a better behaviour at low BP fluxes for the integrated flux from the spectra.We compared the flux error uncertainties derived from photometric fluxes and those derived from the spectra.Although these flux uncertainties are similar, those derived from the photometric calibration tend to be slightly larger than those derived from the spectra.The ratio of the uncertainty in XP fluxes and on the flux resulting from integrating the spectra is shown in Fig. 23 for BP and RP as a function of BP and RP photometric magnitude.The shift towards larger photometric errors is clear, together with a dependence on the source magnitude.This behaviour might result from underestimated uncertainties, in particular for low-order coefficients in the source representation (De Angeli et al. 2022) to which the integration of the spectra is particularly sensitive.The distribution of the uncertainty ratio has strong tails towards extreme values.The photometric uncertainties of hundreds of sources are 100-1000 times larger than those derived from the spectra.

Uncertainties of the XP coefficients
The analysis that we performed on the coefficients of the XP spectra tested whether their uncertainties were evaluated correctly.To do this, we compared pairs of stars using a chi-square, where X 1 and X 2 are the coefficients of the two stars in either the BP or RP channel, while C 1 and C 2 are the associated covariance matrices.In order to ensure that we compared stars with the same metallicity and reddening, we applied Eq. 7 only to pairs of stars belonging to the same open cluster (i.e.membership probability≥0.7 from Cantat-Gaudin et al. 2020).We also excluded all stars with ruwe>1.4from the comparison (to remove binaries) and stars belonging to open clusters younger than 100 Myr.This latter selection was necessary to avoid contamination due to differential extinction.Furthermore, the two stars must have G magnitudes and G BP − G RP colours that were consistent within their uncertainties.By applying all these selection criteria, we obtained a controlled sample of 1560 stellar pairs whose χ 2 values we were able to derive.
The χ 2 values were then used to calculate the associated pvalues (the null hypothesis being that χ 2 follows the expected chi-square distribution, the degree of freedom being the number of coefficients) separately for the BP and RP channels.If the χ 2 indeed followed a chi-square distribution, the p-values should be distributed uniformly.The cumulative histograms of the p-value As a second test, we applied a more stringent criterion in the selection of the pairs that were to be tested.Specifically, we  imposed that the G, G BP and G RP magnitudes of the two stars must be consistent within their uncertainties.This reduced our sample to 501 pairs.The relative p-value distributions are plotted as orange lines in Fig. 24: 25% and 26% of these pairs fail our test in BP and RP, respectively.When we applied the further condition that the magnitudes G 1 and G 2 must be fainter than 16 mag, we further reduced the sample to 437 pairs.The difference between this new sample, which is plotted as green lines in Fig. 24, and the previous sample is too small to observe significant effects in the p-value distribution.The fraction of pairs that fails our test now decreased to 22% and 24% in BP and RP, respectively.
Finally, we studied the p-value distributions obtained from the pairs composed of stars with the same number of bp_n_relevant_bases and rp_n_relevant_bases.In this way, we compared spectra with similar wiggling levels.Applying these criteria for the two bands separately, we obtained a sample of 148 and 109 pairs for BP and RP, respectively.The cumulative distributions are plotted in Fig. 24 as red lines.The fraction of stars that do not pass the test decreases to 21% for BP and 15% for RP, but it is still significant.
In order to estimate how much the errors are underestimated, we multiplied the covariance matrix by various factors and then repeated the experiment on the 437 pairs that are fainter than G=16 mag.The resulting p-value distributions are shown in Fig. 25.The figure shows that the variances are underestimated by a factor that is between 1.2 and 1.5.
However, a detailed study of the coefficient error underestimation is presented in De Angeli et al. (2022) by dividing the data into two groups of transits for the same source and comparing the obtained values.They show that the error underestimation depends on the coefficients.The lower-order coefficients lead to the highest underestimation.

Comparison with external spectra
Figure 26 shows the median flux difference, normalised by the errors, between the XP sampled and the CALSPEC 9 spectra (Bohlin et al. 2014) for the sources in common.A dip at ∼600 nm is visible.Figure 27 presents the median normalised flux difference within 560 < λ < 620 nm with both CALSPEC and NGSL (Heap & Lindler 2016) as a function of magnitude.It shows that the strength of this dip is magnitude dependent and has a saturation effect.Figure 26 seems to suggest a difference in flux level between BP and RP, but it is not statistically significant in the CALSPEC or the NGSL sample.However, when the MILES li- Fraction Fig. 25.Same as Fig. 24 for the 437 pairs that are fainter than G=16 mag, but have a covariance matrix (Cov) of one to three times its original value for the two bands BP (left) and RP (right).brary is used (Falcón-Barroso et al. 2011) and the MILES spectra are normalised to the absolute flux of the XP spectra in the common wavelength range, this difference in flux level becomes significant.The bluest wavelengths show a colour-dependent trend that is illustrated in Fig. 28.See also Montegriffo et al. (2022) for a discussion of these features.

Stellar astrophysical parameters
An overview of the Gaia DR3 astrophysical products produced by 13 different modules 10 is presented in Creevey (2022).The non-stellar content part of the astrophysical parameters is dis- A&A proofs: manuscript no.DPACP-127 q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q 5 10 15 20 −10 −5 0 G median normalized difference q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q Fig. 27.Median flux difference normalised by the errors within 560 < λ < 620 nm between the XP sampled spectra and the CALSPEC (black dots) or NGSL (grey dots) spectra normalised by the errors as a function of magnitude.
q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q −1 0 G BP − G RP median normalized difference q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q Fig. 28.Median flux difference within λ < 350 nm between the XP sampled spectra and the CALSPEC (black dots) or NGSL (grey dots) spectra normalised by the errors as a function of magnitude.cussed in Sect. 5.This section focuses on the stellar content, which is presented in detail in Fouesneau (2022).
The astrophysical parameters are available in two tables: astrophysical_parameters , and astrophysical_parameters_supp.We present here only the validation results of the main parameters.In particular, the specialised modules (ESP Creevey 2022) are almost not discussed here.The Outlier Analysis tables (oa_neuron_information, oa_neuron_xp_spectra) are not discussed here either.They were successfully checked for internal but not external consistency.Tests on the ESP and OA modules can be found in Fouesneau (2022).GSP-Phot parameters (Andrae 2022) were derived using several spectral libraries (MARCS, PHOENIX, A, and OB).The values obtained with these different libraries are presented in astrophysical_parameters_supp , while astrophysical_parameters contains the values that were obtained with what was selected as the best library indicated in the field libname_gspphot.We mainly discuss the best library results of GSP-Phot here.In this section, we compare the Galaxy model using GUMS as a reference.In contrast to GOG, GUMS contains most of the astrophysical parameters, but they are error free.

DSC
The development of the discrete source classifier (DSC) was mainly driven by extragalactic source completeness (see the on-line documentation and Delchambre (2022)).The purity for QSOs and galaxies is discussed in Sec. 5. We did not find correlations between the classprob_dsc_binarystar probabilities with Multiple Source Classifier (MSC) results or with known binaries.White dwarfs are also often confused with hot mainsequence stars.We therefore advise against using the physical binary and white dwarf class probabilities11 .

Extinction
The extinction is provided as the monochromatic extinction A 0 at 541.4 nm by GSP-Phot (azero_gspphot), the hot star module ESP-HS (azero_esphs), and the multiple source module MSC (azero_msc).GSP-Phot and ESP-HS also provide A G and E(G BP − G RP ).
The well-known and expected temperature -extinction degeneracy is discussed in Andrae (2022), as is the effect of imposing the extinction to be positive on the mean of low-extinction regions.Andrae (2022) also showed that the GSP-Phot extinction values azero_gspphot are globally overestimated with a saturation at 10 mag (by construction) in their comparison with Bayestar19 (Green et al. 2019).We find the same trend in our comparison with the monochromatic extinction A 0 at 550 nm derived from APOGEE, Gaia, and 2MASS by Lallement et al. (2018) 2019)) with a parallax relative precision lower than 10%, colour-coded with the mean extinction azero_gspphot.The colour is saturated as black for values higher than 1 mag.In this low extinction sample, M G is simply G + 5 + 5 log( /1000).
pected deviation for large extinctions between A V and A 0 (See Sect.11.2.3.1.4 of the on-line documentation).We further confirmed this overestimation of the GSP-Phot extinction values with the bstep extinctions provided with GALAH DR3 (Buder et al. 2021) and the Lallement et al. ( 2019) 3D extinction map.It is also confirmed with clusters in Fouesneau (2022).For nearby stars, the extinction stays low because of the ad hoc extinction prior (Andrae 2022).However, at high Galactic latitudes, 22% of the stars have azero_gspphot_lower>0.16,the highest extinction value expected according to the map of Schlegel et al. (1998).As illustrated in Fig. 30, high extinction values occur at the bottom of the main sequence for red giants (which can be confused with extincted hot stars), but also for some stars near G BP − G RP ∼0.2 and ∼2, which also have an impact on the temperature that is estimated for these stars.
The global overestimation of GSP-Phot extinction values naturally leads to an overestimation of the total Galactic extinctions provided in the table total_galactic_extinction_map_opt.This is shown in the comparison with Planck (Planck Collaboration et al. 2016) in Fig. 31.As a0_uncertainty provides the error on the mean and can become very small when the number of stars becomes high, we recommend using a0_uncertainty× √ num_tracers_used instead.While the overall appearance of the Galactic extinction map is as expected (large-scale dust filaments are clearly visible in the approximate expected relative intensity; see Delchambre 2022), the A 0 estimates are systematically overestimated in a large region of around 20 deg around the Galactic centre.At high Galactic latitudes, the uncertainties are large enough for the A 0 overestimation to be not visible in Fig. 31.The multiple source classifier (MSC) finds 37% of its lower extinction azero_msc_lower> 0.05 for the nearby star sample ( > 20 mas) for which no significant extinction is expected.When compared to Traven et al. (2020), the MSC extinction is also globally overestimated.We advise that the GSP-Phot extinctions be preferred even for binary stars.For the hot-star module, extinction azero_esphs can reach very high values for white dwarfs that are treated as hot mainsequence stars.We recommend to filter them out using a colourabsolute magnitude diagram.
Another estimate of the extinction is provided through the diffuse interstellar band (DIB) at 862 nm that is present in the RVS spectra: dibew_gspspec.The details of the measurement are presented in Recio-Blanco (2022), and details of its performance are reported in Schultheis (2022).The DIB equivalent width correlates well with the extinction (Schultheis 2022).The large number of outliers in wavelength seen with wide binaries (Appendix A) is due to wavelength clusters around 862.5 and 861.8 nm, most of which are removed when only sources are kept that have dibqf_gspspec< 2. This selection criterion is globally recommended for the DIB parameters (Schultheis 2022).

Teff, logg, metallicity, and abundances
The comparison of the astrophysical parameters with the GUMS model is satisfactory within the model uncertainties outside the Galactic plane.The exception is the GSP-Phot metallicity, which follows extinction patterns, as expected from the extinctiontemperature degeneracy.
Stars with 3000 < T eff < 8000K are analysed by GSP-Phot using the MARCS and PHOENIX spectral libraries.Comparing the values of T eff , we find a median difference T eff (MARCS-PHOENIX)=-63 K (median absolute deviation, MAD=145 K) due to the different temperature scale.T eff estimated using the best library compilation presents nonphysical clustering of points on the Hertzsprung-Russel (HR) diagram due to edge effects at the library borders.This is most evident for the OB library border at 15000 K.
Figure 32 shows the comparison of the atmospheric parameters for the sources derived by both GSP-Phot (Andrae 2022) and GSP-Spec (Recio-Blanco 2022).The agreement between the two methods for the temperature is reasonable, has an offset on the surface gravity for small log g, and a very large dispersion for the global metallicity.
Figure 33 shows the comparison of the atmospheric parameters derived by GSP-Phot and GSP-Spec with APOGEE DR16 (Ahumada et al. 2020).The plots look similar to GALAH DR3.The large dispersion of the GSP-Phot metallicity ex-Fig.32.Density plot of the comparison of the temperature (left), surface gravity (middle), and global metallicity (right) provided by GSP-Phot (y-axis) and GSP-Spec (x-axis).GSP-Phot has been filtered with parallax_over_error>5 and teff_gspphot<10000.GSP-Spec parameters have been filtered with flags_gspspec [1,4,8,13]=0 for T eff , flags_gspspec[2,5,8,13]=0 for log g, and flags_gspspec [3,6,8]=0 for [M/H].The dashed green line shows the one-to-one correspondence.The median absolute deviation is indicated in each panel.plains Fig. 32.The GSP-Spec T eff is slightly overestimated for Teff> 5500 versus APOGEE and GALAH, while GSP-Phot is not.GSP-Phot log g for small log g is overestimated.GSP-Spec log g is globally underestimated, and a correction is proposed in Recio-Blanco (2022).GSP-Spec median offsets and dispersion versus external catalogues after proposed corrections are quite good and are provided in Recio-Blanco (2022).We also caution the user against using GSP-Spec log g values for AGB stars.All the GSP-Spec parameters are found to be correlated with magnitude and metallicity.Figure 35 illustrates this correlation for teff_gspspec using APOGEE DR16.The plot is similar to that for GALAH DR3.This correlation with magnitude and metallicity leads to other unexpected correlations with extinction or sky position that are shown in the comparison with the GUMS model.
We also tested the correlation of the abundances with magnitude, temperature, log g, and logchisq_gspspec with open clusters and present it for NGC 7789 in Fig. 36.The plots show clear positive trends between mh_gspspec and the two stellar parameters teff_gspspec and logg_gspspec.This correlation is similar to that obtained with magnitude, as expected for a cluster in which temperature and gravity are correlated with magnitude.It is also consistent with what is seen in the comparison with external catalogues (Fig. 35).For alphafe_gspspec, we observe correlations of the opposite sign.The [M/H] and [α/Fe] calibra-q q q q q q q q q 9 10 11 12 0 50 100 GRVS teff_gspspec − TeffApogee q q q q q q q q q q −1.0 −0.tions proposed in Recio-Blanco (2022) alleviate these trends, but they do not remove them completely.
The GSP-Spec abundances do not correlate very well with external catalogue values in general (Recio-Blanco ( 2022)).However, calibration formulas are proposed in Recio-Blanco (2022), andGaia Collaboration &Recio-Blanco (2022) showed that they allow retrieving the expected chemo-kinematical correlations in the disc.
GSP-Spec ANN metallicities (available in astrophysical_parameters_supp) have underestimated uncertainties (Table A.1) and are offset by ∼-0.2 with respect to APOGEE DR16.See Recio-Blanco (2022) for a proposed calibration of the GSP-Spec ANN parameters as well.
The MSC metallicity and gravity are overestimated compared to those obtained with GALAH (Traven et al. 2020).Hot stars are found to be assigned a temperature of about 7500 K due to the empirical calibration based on APOGEE.The poor convergence of the MSC values can be flagged as low logposterior_msc values.The test using wide binaries (Table A.1) indicates a low number of outliers for the metallicity, but the errors are so large that most of the possible values are covered.We advise using the MSC parameters with caution in general (see also Fouesneau (2022) and the on-line documentation).Tests using wide binaries (Table A .1) and open clusters shows a strong underestimation of the errors for all GSP-Phot parameters with an associated large number of outliers.For GSP-Spec, they show an underestimation of the errors for mh_gspspec, alphafe_gspspec, cafe_gspspec, and crfe_gspspec, while for some other elements, the uncertainties are slightly overestimated.GSP-Spec values were discretised at two decimals, except for dibew_gspspec , which was discretised at three decimals, and Teff, which was stored as an integer.This might cause some parameters to have similar upper and lower values, in which case the discretisation step should be used as an uncertainty estimate.
The published GSP-Phot MCMC samples contain 2000 points for G < 12 , but only the last 100 points are made available for G > 12, except for a random 1% subset that was given the full 2000 points.When only 100 points are available, the upper or lower values of the GSP-Phot parameters, which were determined on the full 2000 steps, may not be fully consistent with the MCMC.This inconsistency is an indication of convergence issues.However, failed convergence usually does not relate to strong outliers, which are cases when the MCMC has converged to a very different solution.We find that 18% of the 2000point chains present some problems such as multiple solutions, local maxima of the posterior probability, or edge effects.The MSC inflated their errors in post-processing, therefore the MSC MCMCs are not consistent with the provided upper or lower values.

Distance and absolute magnitude
The global distribution of GSP-Phot distances against parallax is shown in Fig. 37, where we consider the sources with /σ > 5.While a large fraction of sources follows the inverse parallax curve, 37% are 5σ outliers (considering only the parallax error).We measured the clustering in this space using the Kullback-Leibler divergence (KLD; Kullback & Leibler 1951, Fabricius et al. 2021), which is higher away from the plane and in particular around the large and small Magellanic clouds (LMC and SMC).
Strong outliers in the GSP-Phot distances are seen in the comparison of the estimates of the two wide binary components (Appendix A), while the relative precision of the parallax is better than 20% in this sample.The distances are shown to be systematically underestimated at large distances, and the relative parallax precision is poor when the known cluster distances of Fouesneau (2022) are used.This is also confirmed with the APOGEE DR16 red clump sample.This seems to be due to a too strong prior (Andrae 2022).The MSC distances distance_msc are shown to present a higher dispersion than the GSP-Phot distances even for known binaries by Fouesneau (2022).
The GSP-Phot absolute magnitude estimate mg_gspphot is compared to the absolute magnitude computed directly from the parallax for a sample of stars with negligible extinction in Fig. 38.It shows the combination of distance outliers (leading the strong outliers) and extinction overestimation for stars with M G 7 (see Fig. 30, leading to a bias).Moreover, mg_gspphot is not correctly estimated in stars farther away than 1-2 kpc as an effect of the underestimated distance.This is clearly illustrated in Fig. 39, where the distance modulus (m-M) is derived from distance_gspphot.We recommend using the deviation between the GSP-Phot distances and the parallax12 to filter GSP-Phot outliers.For a number of usages, it may be preferable to use the parallax to estimate the distance and absolute magnitude (see Luri et al. (2018)) over the GSP-Phot estimates.

Stellar evolution parameters
The stellar evolution parameters radius, mass, age, evolution stage, and gravitational redshift are provided by the FLAME module (Creevey 2022).They are derived either from GSP-Phot parameters (fields named _flame in the table astrophysical_parameters) or from GSP-Spec parameters (fields named _flame_spec in astrophysical_parameters_supp).For the mass, age, and evolutionary stage, they use solar metallicity evolution models.The estimates of these parameters for non-solar metallicity stars should therefore be used with caution.
Figure 40 shows the comparison between the FLAME radius and the radius from the JSDC stellar diameter catalogue (Bourges et al. 2017, v2, selecting stars with χ 2 < 2) for stars with relative parallax uncertainties smaller than 10%.The parallax is used to transform the JSDC angular diameter into radius.The radius derived by FLAME using GSP-Spec Teff, radius_flame_spec, is overestimated for blue main-sequence stars and for red giants, but it is underestimated for very red giants (G BP −G RP >2.2).The radius derived by FLAME using GSP-Phot Teff, radius_flame, has the same properties, but fewer outliers than radius_gspphot provided directly by GSP-Phot because FLAME directly uses the parallax to derive the luminosity for this sample with a good parallax signal-to-noise ratio (see flags_flame).We therefore recommend using radius_flame for a radius estimation.
Masses from FLAME compare well with asteroseismic estimates for dwarfs and subgiants (using Serenelli et al. 2017;Godoy-Rivera et al. 2021), but strong outliers are seen for giants (using Yu et al. 2018).Comparison with the GUMS model confirms the presence of a high-mass tail in the FLAME data that is not predicted by the model.This tail is present in all Galactic directions, even at high latitudes.It is associated with an excess of young stars.These young (< 2Gyr) and massive (> 2 M ) stars are on the giant branch.We therefore recommend using the FLAME masses with flags_flame[_spec] first character 1 (giant flag) only within the 1-2 M range and with caution, and taking their large uncertainties into account.
The overestimation of the GSP-Phot extinction for low-mass stars (M G 7, Fig. 30) also has an impact on the FLAME parameters.The impact on masses M 0.7 is illustrated in Appendix Strong outliers in the luminosity of giants are visible in the APOGEE red clump sample.A few mismatches of the evolutionary stage may occur also for giants that are confused with dwarfs with high extinction.They can be spotted in an HR diagram using an independent extinction estimate.
The gravitational redshift determined by FLAME is compared to the one used in GALAH DR3 (Zwitter et al. 2021) in Fig. 41 for sources with a gravitational redshift error from FLAME lower than 1 km s −1 .The gravitational redshift based on GSP-Spec (gravredshift_flame_spec) has fewer outliers than the redshift based on GSP-Phot (gravredshift_flame), but it has a small bias of 0.05 km s −1 , corresponding to the bias in log g discussed above.

QSO and galaxies
Gaia DR3 includes two tables of extragalactic candidate sources, one for quasars and one for galaxies, called qso_candidates and galaxy_candidates (QSO and galaxy tables, for simplicity).These tables contain two main types of added value columns: on the one hand, we can use the different labels that are provided to tune the purity-to-completeness ratio of the sample, and on the other hand, each table also contains physical properties of the objects such as redshift, size, or variability.The astrophysical parameters (Creevey 2022) associated with these tables, that is, classification and redshifts, are described in Delchambre (2022), the surface brightness profiles are described in Ducourant (2022), the variability is presented in Rimoldini et al. (2022) andin Carnerero et al. (2022) for AGNs.Moreover, a global analysis of these tables is presented in Gaia Collaboration & Bailer-Jones (2022).It is worth noting that these tables have been constructed with the aim of completeness, and as we show below, this means that their default purity is rather low.However, it is possible to obtain a high-purity subsample (Gaia Collaboration & Bailer-Jones 2022), as discussed below.

Purity
The different labels that are included within the QSO and galaxy tables can be used to create a subsample with different properties (see section 8 of (Gaia Collaboration & Bailer-Jones 2022) for a selection leading to 94% and 95% purity in the QSO and galaxy tables, respectively).The QSO and galaxy tables contain three common labels: vari_best_class_name = "AGN"/"GALAXY" (classification according to the stellar variability patterns (Rimoldini et al. 2022)), classlabel_dsc = "quasar"/"galaxy" (from the discrete source classifier (Creevey 2022)), classlabel_dsc_joint = "quasar"/"galaxy" (similar to the previous, but more restrictive since it requires DSC-Specmod and DSC-Allosmod to agree, both with a score higher than 50%) and classlabel_oa13 (assigned by the self-organising map (SOM) (Creevey 2022)).It is worth noting that the results of the SOM were not used to construct these tables.In other words, this label was attached to the sources that were previously selected as candidates by other means.In addition to these shared class labels, we can also identify unique labels for each table.In the QSO table we have access to the astrometric_selection_flag (ASF), which allows us to select only sources with a high probability of being quasars based on their astrometry.We can also use the source_selection_flags to isolate the QSOs in the qso_catalogue_name table (bit 3 set to 1), hereafter called QuasarObject list, which effectively corresponds to the sources that are found in well-known QSO catalogues that had enough raw data to be processed successfully by the QSO pipeline.Finally, in the galaxy table, we can select the sources whose morphology was fit reliably.We refer to this subset as EO solution.
In Fig. 42 we show the astrometric properties, normalised by their formal uncertainties, of the different subsamples described above.Because extragalactic sources are so far away from the Sun, the astrometry of these objects might be expected to be dominated by observational errors.This is what we indeed observe for some of the subsamples, for example those in panels a and c, and also in panel e in the QSOs and panel b in the galaxies.However, the other subsamples show clear deviations from the expected standard normal probability distribution.These deviations in the DSC and OA subsamples are due mostly to the astrometric signal of the Magellanic clouds and the Galactic disc (Fig. 43).We note, however, that while ∼ 94% of the sources in the QSO table have a 5p or 6p astrometric solution, the galaxy table is mostly dominated by 2p sources ( ∼ 71%).Therefore, the conclusions we draw concern only a portion of the galaxy table.In either case, it is clear that some subsamples, namely vari_best_class_name and classlabel_dsc_joint, are purer than others (classlabel_dsc and classlabel_oa) that were built for completeness.
The sky distribution of the sources in these subsamples is presented in Fig. 43.These plots are difficult to relate directly to the purity as some modules remove the LMC and SMC and the disk plane by force or using criteria that depend on the density, while some others, such as DSC, do not.Moreover, a constant misclassification rate over the sky leads to a higher density of misclassified objects where the objects density is higher.However, in the LMC and SMC areas, more than 10% of the sources are in the qso_candidates table, so here the classification does not work well.
A good idea of the main stellar types of the stellar contaminants in the QSO and galaxy tables can be obtained by positioning those with a relative parallax uncertainty lower than 20% in an HR diagram in Fig. 44.It shows that the QSO candidate stellar contaminants are mainly white dwarfs and stars with G BP − G RP ∼0.4, while galaxy candidate stellar contaminants are mainly stars with G BP − G RP ∼1.4 or 0.8.Fig. 44 also shows that the criteria for the purer samples proposed in Gaia Collaboration & Bailer-Jones (2022) are efficient, but still retain a few contaminants.We use here host_galaxy_detected='true' instead of host_galaxy_flag < 6 as the latter leads to eight times more contaminants in our sample.These are due to the EO input catalogue, however, the host galaxy has not been detected

Morphological parameters
We compared the extended object morphological parameters of the galaxy table (Ducourant 2022) with the GAMA (Kelvin et al. 2012) and Dark Energy Survey (DES Tarsitano et al. 2018) Sérsic profiles and with the SDSS DR16 (Ahumada et al. 2020) de Vaucouleurs profiles.Figure 45 illustrates the comparison with the DES Sérsic profiles.Saturation of the effective radius radius_sersic and radius_de_vaucouleurs at 8000 mas and of the Sérsic n-index at 8 is visible.It corresponds to the boundaries of the algorithm.Compared to DES, the Gaia DR3 index seems to be spread out more or less uniformly, with DES preferring n = 4. Essentially, the comparison with external catalogues of galaxy profiles shows an overestimation of the Sérsic index and an underestimation of the ellipticity.Both are a consequence of the fact that Gaia observes a smaller area around the galaxies than external catalogues, which prefer central measurements and are biased towards bulges (Ducourant 2022).
The morphological parameters are accompanied by their formal uncertainties.Since these are estimated from the variance resulting from the search for a minimum in the residuals between model and observations, the provided uncertainties reflect the quality of the convergence rather than the precision of the estimation.In consequence, a fraction of sources may appear to have extremely small uncertainties while in reality, this is just the byproduct of a correlation with the convergence velocity.

Redshifts
The provided galaxy redshift upper and lower values do not correspond to confidence intervals, but to prediction limits based on machine-learning.Still, the comparison with external catalogues indicates that (redshift_ugc_upper − redshift_ugc_lower)/2 gives a good estimate of the uncertainty.A redshift peak at about 0.07 (red arrow in the orange histogram in Fig. 46) is found, which corresponds either to very bright galaxies or to stellar contaminants with convergence issues.The redshift range 0.070- 0.071 should therefore be ignored (Delchambre 2022).A global overestimation of the redshifts for bright sources (G < 19) is also observed.
The QSO redshifts are log-normally distributed.To compare them to the literature, we therefore used Z = log(redshift_qsoc + 1), which is normally distributed with a standard deviation of σ = (log(redhift_qsoc_upper + 1) − log(redhift_qsoc_lower + 1))/2 (Delchambre 2022).The comparison with LQAC5 (Souchay et al. 2019) presents 33% of outliers at 5σ, which reduces to 8% when the flag flags_qsoc=0 or flags_qsoc=16 is used.This is due to the degeneracies between spectral lines and redshift in the XP spectra (see Delchambre 2022; Gaia Collaboration & Bailer-Jones 2022).A peak, this time at about 0.08, is also visible in the redshift distribution of the QSO (red arrow in the blue histogram in Fig. 46).The reason for this peak is that the MgII emission line is misclassified as Hβ, a characteristic emission line of this specific redshift range (Delchambre 2022).However, only a small number  of sources contributes to this peak, and most of them have a nonzero flags_qsoc.

Non-single stars
Gaia DR3 provides four tables for non-single stars (NSSs).The table nss_two_body_orbit contains orbital two-body models, covering astrometric (Halbwachs 2022;Holl et al. 2022b), spectroscopic (Gosset 2022;Damerdji & et al. 2022), and eclipsing (Siopis 2022) binaries as well combinations of these.The model that is used is indicated in the field nss_solution_type, and the parameters that are solved for a given solution are described in the bit_index field.The tables nss_acceleration_astro and nss_non_linear_spectro contain astrometric (Halbwachs 2022) and spectroscopic (Gosset 2022) acceleration solutions, and nss_vim_fl contains variability-induced mover (VIM) solutions (Halbwachs 2022).Gaia Collaboration & Arenou (2022) also present the overall content of these non-single star tables.

Astrometric orbital elements
The orbital solutions for the astrometric binaries are presented using what is called Thiele-Innes coefficients.They express the orbital motion of the photocentre on the sky with a linear formulation.These coefficients replace the more usual Campbell elements a 0 , i, ω, and Ω, which are semi-major axis, inclination, longitude of periastron, and position angle of the ascending node, respectively.The relations between the two parameter sets are described in Halbwachs (2022).In the transformation from Thiele-Innes to Campbell coefficients, it may be useful to use Monte Carlo simulations that take the correlation matrix into account instead of using local linear approximation formulas.In 87% of the NSS sample, Gaussian errors in Thiele-Innes coefficients are transformed into asymmetric distributions for at least one of the Campbell elements.Mostly in the case of very low eccentricities, however, a number of sources shows a significance parameter that disagrees with the signal-to-noise ratio that can be derived from Monte Carlo simulations (Fig. 47).This seems to be due to an overestimation of the Thiele-Innes coefficient errors.Despite this, the local linear approximation formulas work well in deriving the error on the a 0 parameter even with a very strong overestimation of the Thiele-Innes coefficient errors.As Orbital solutions are filtered to have significance> 5, using a Gaussian error model for a 0 is reasonable while OrbitalTargetedSearch solutions need to be filtered.Due to an issue with the significance of AstroSpec-troSB1 (see the on-line documentation), it needs to be verified that the signal-to-noise ratio is higher than 5.The issue is also present for the spectroscopic part of the AstroSpectroSB1 solutions, for which local linear approximation errors on a 1 can be used as soon as the resulting signal-to-noise ratio is confirmed to be higher than 5.The local linear approximation formulas for the Thiele-Innes coefficients can be found in the appendix of Halbwachs (2022).Overall, to handle the Thiele-Innes coefficients, usual Monte Carlo techniques such as MCMC should not be used.Codes using automatic differentiation such as ADMB (Fournier et al. 2012) and TMB (Kristensen et al. 2016) have been tested to work fine for signal-to-noise ratios higher than 5.
The covariance matrix for very low eccentricity solutions may be problematic.In these cases, the eccentricity and periastron time should be set to zero.For AstroSpectroSB1 with eccentricity and argument of periastron fixed to zero (bit_index=65435), c_thiele_innes is fixed to the non-circular value instead of zero.The statistical properties of the distribution of the orbital elements are discussed in the appendix of Gaia Collaboration & Arenou (2022).

External comparisons
The comparison with external catalogues 14 shows that the orbital parameters agree well with literature values when the periods are consistent.It also confirms that the center_of_mass_velocity agrees better with literature binary values than with the gaia_source.radial_velocity.The strongest disagreements with external catalogues on the radial velocity semi-amplitude of the primary are for stars that are known to be SB2 (doubleline spectroscopic binary), but are treated as SB1 (single-line) by NSS (Gosset 2022).
The comparison of the literature orbits with astrometric acceleration solutions in the nss_acceleration_astro table indicates that a significant fraction might have had an orbital solution and that some Acceleration7 could have been Acceleration9.This is intrinsic to the decision chain explained in Halbwachs (2022).The acceleration values disagree with the expectations from the known orbits and the Gaia observation times.The acceleration values should therefore be used with caution.
The NSS parallaxes show a median difference with the gaia_source parallaxes that is smaller than a few µas.The HR diagram derived using NSS parallaxes (orbital and acceleration) is slightly sharper than the diagram derived with the gaia_source parallaxes, which indicates that the parallaxes are slightly more precise.The (statistical) improvement of the solutions does not guarantee that the accelerations are all physical, however.When they are compared to the long-term proper motion provided in the Hipparcos-Gaia catalogue of accelerations (Kervella et al. 2022;Gaia Collaboration & Arenou 2022), the NSS proper motions improve versus the gaia_source proper motions for the orbital solutions (moving from 21% of 5σ outliers to 9%), but not the acceleration solutions (which have a much higher median signal-to-noise ratio of the proper motion anomaly than the orbital solutions) for which the comparison is slightly worse (moving from 80% outliers to 85%).This highlights that proper motion and accelerations may both have absorbed the orbital motion.
The temperature ratio of eclipsing binaries corresponds well to the ratio derived by Eker et al. (2014), except for sources with a low g_luminosity_ratio.The correspondence with the MSC temperature ratio is poor, but these ratios are to be used with caution (see Section 4.3).The uncertainties on the eclipsing binary inclinations are suspiciously small.

Spurious solutions and error rescaling
To achieve the required radial velocity precision, the precise position of the spectra at the epoch on the focal plane needs to be known.For this purpose, the expected astrometric position as given by the predicted standard astrometric motion is used, rather than the measured astrometric position at the epoch, which would not be precise enough.However, if the astrometric motion is perturbed, or if the astrometric solution is not correct, then the computed epoch RV will absorb this astrometric perturbation.This means that the epoch radial velocities could increase by up to ≈ 0.146×astrometric_excess_noise (km/s) in the case of binaries.In the best case, this would add an unmodelled additional dispersion and possibly a small trend in the worst case.This may lead to spurious short period and large ruwe SB1 so-14 Pourbaix (2000); Jancart et al. (2005) The absence of a gaia_source.radial_velocityvalue for an SB1 solution should warn the user: the source might have been considered peculiar, potentially SB2, too hot, too cool, with emission lines, or contaminated by a nearby star.Radial velocity variations can also be due to stellar pulsations instead of an orbital motion (Gaia Collaboration & Arenou 2022), so that the variability information should also be confirmed for suspicious solutions.
While the errors were rescaled according to the goodness of fit for the astrometric solutions (Orbital, AstroSpectroSB1, VIM, and acceleration), this is not the case for the others.Because the mean goodness-of-fit distribution of SB2 and eclipsing solutions is quite large, we recommend rescaling the formal uncertainties for these solutions.The goodness_of_fit provided for SB2 solutions can deviate by up to 1.6 from the one that can be recomputed using obj_func.

Variability
Gaia DR3 provides variability information for about 11.8 million sources, including 10.5 million variable sources of about 30 types of variability (Eyer et al. 2022) and 1.3 million sources (variable or not) in the Gaia Andromeda Photometric Survey (GAPS) (Evans et al. 2022).Time-series photometry is released for all these 11.8 million sources in the epoch_photometry datalink table as well as their statistical parameters, and links to their potential other variability table are listed in the vari_summary table.The variability associated with galaxies provided in the galaxy_candidates table are mostly artefacts due to their extension (Holl et al. 2022a) and therefore are not in the vari_summary or epoch_photometry tables.Here, we present a brief overview of some issues we found during the scientific validation, while for further details, we suggest the readers to consult the on-line documentation 2 and papers 1 .
A number of sources show more than one type of variability.While most overlaps between different classes can be explained scientifically, some stars have contradicting classifications.For example, 3159 sources are classified as both long-period and short-timescale variables.Detailed analyses of the final classification for these sources are provided in Lebzelter et al. (2022).
Intensity-averaged magnitudes in the BP (int_average_bp) and RP (int_average_rp) bands for four and two RR Lyrae stars, respectively, have unreliable negative values reaching BP = −88 ± 22 mag.These six sources are faint RR Lyrae variables (G ∼ 18.5 − 19 mag) for which the specific objects study pipeline for Cepheids and RR Lyrae stars (SOS Cep&RRL; Clementini et al. 2022) failed to fit data points with the model line.The values that were provided were accordingly unreliable.Instead, other parameters that were calculated for these stars, such as intensity-averaged G magnitudes and pulsation periods, are correct.It was therefore decided to include these sources in the DR3 sample of RR Lyrae stars despite the incorrect BP and RP intensity-averaged magnitude estimates.
For 286 RR Lyrae stars, absorption in the G passband (g_absorption) reaches unreliably high values from 10 to 3367 mag.This is likely caused by the imprecise estimation of the G RP magnitudes for the faint sources (see Clementini et al. 2022 for further details).The legend shows the different categories: blue squares for NEAs, red stars for MBAs, and green dots for Jupiter Trojans.For sake of clarity, the plot does not show Centaurs and more distant objects.

Solar System objects
Gaia DR3 provides information for 158 152 Solar System objects (SSO) with more than 20 million observations (epoch astrometry).A large data-set of ultra-accurate observations like this is made available in a single day for the first time.
The sample contains 156 801 known numbered minor planets, 1 320 unmatched moving objects and 31 natural satellites of planets.The source selection is described in Tanga (2022).
All the main categories of Solar System bodies are present among the known numbered minor planets: 447 Near-Earth asteroids (NEAs), 154 771 main-belt asteroids (MBAs), and a total of 1 551 Jupiter Trojans, Centaurs, and more distant objects.Figure 48 shows the different categories in the semi-major axis and eccentricity plane.
The table sso_source in the Gaia archive contains the number of observations for each source.We would like to point out that the count of the number of observations is incorrect for four sources.The explanation for how to obtain the correct number of observations is provided in the on-line documentation.

Unmatched sources and natural satellites of planets
A small subsample of the data consists of 1 320 objects that were considered unknown at the time of processing.We refer to them as unmatched sources.Tanga (2022) performed a search to identify how many unmatched sources can now be identified (February 2022), and they found an identification for 712 sources.We cannot exclude either that some of the stillunmatched sources will be identified or linked to known objects when the observations are sent to the Minor Planet Center 15 All the sources will still appear as unmatched in the sso_source and sso_observations tables in the Gaia archive.The sample also contains natural satellites of planets for the first time.For a complete description of the process of selection, we refer to Tanga (2022) (Sect.3.1).

Orbit determination process
We used an orbit determination process to assess the quality of the data.This process is similar to the process that was carried out to validate Gaia DR2 (Gaia Collaboration et al. 2018;Arenou et al. 2018).
We selected Gaia observations only for every known numbered minor planet in Gaia DR3.We used a modified version of the OrbFit software16 to fit the orbits to Gaia observations alone.It is important to note that this software is completely independent from everything that runs in the Gaia data processing, and we improved it to fully exploit the accuracy of Gaia observations.
The results of the orbital fit can be summarised as an orbit (if the fit converges), post-fit residuals in the (α cos(δ), δ) space and in the (AL, AC) space, and rejection of incorrect quality or mistakenly linked observations.The modified version of the OrbFit software makes use of a non-linear weighted least-squares algorithm to fit the orbits.The weight matrix for Gaia is the quadratic sum of the systematic and random matrices, available in the sso_observation table.
We also corrected the observations for the light bending.This is a different approach than was applied in the validation of Gaia DR2 .

Orbit determination results: Orbit failures
The orbit fit procedure worked for almost all the known sources.It only failed for 198 objects.The reasons for this vary: the time spanned by the observations was too short, too few observations were available, or a combination of the two, as shown in Fig. 49.
The quality of the observations is not affected by the nonconvergence of the orbit.They were therefore all accepted and are available in the sso_observations table.The orbit-determination process is based on finding and removing bad-quality observations, so that they do not affect the goodness of fit.The orbit-determination software we used to validate the data rejects observations with χ 2 > 25 (5σ).This can happen because the quality of the data is not as expected, because the weights that are used are too low, or because the observations do not belong to the object.The latter case is called mistaken linkage or incorrect identification.We decided to remove from the data only the observations for which the absolute value of the along-scan post-fit residuals was higher than 250 mas and for which the absolute value of the across-scan post-fit residuals was higher than 2 500 mas.In this way, we cleaned the database from possible contaminants.At the same time, we wished to keep the largest possible number of observations so that the community could search for interesting features (e.g. the presence of satellites).As a consequence, the sample can still contain some contaminants.For example, we may only have removed part of a transit, but we decided to adopt a unique approach that is valid for all the observations.Some observations can also be rejected during the orbit-determination process, but this does not affect the overall quality of the data.
After removing the bad-quality observations, we analysed the post-fit residuals in the along-scan and across-scan directions.Figure 50 shows the histogram of the post-fit residuals along-scan (∆AL) and across-scan (∆AC).These residuals are obtained as a rotation of the residuals in α cos(δ) and δ, where the rotation angle is the position angle as given in the sso_observations table.The whole procedure has been described in Gaia Collaboration et al. (2018).
The mean of the post-fit along-scan residuals is 0.03 mas, and the standard deviation is slightly larger than 5 mas.This is exactly what we expected as a result of the orbit-determination fit (we recall that we discarded observations at 5 σ level).Postfit residuals in the across-scan direction are expected to be far larger than the corresponding along-scan residuals as a result of the geometry of the spacecraft observations (Gaia Collaboration et al. 2018), as the histogram in fig.50 shows.The mean in this case is close to 13 mas, which shows that the across-scan observations still contain a small bias.The standard deviation is larger than 200 mas.This is close to what we expected.
We now examine the (∆AL, ∆AC) post-fit residuals as a function of the G magnitude (Fig. 51).For very bright sources (G < 13 mag), a full two-dimensional window is transmitted, which means that across-scan information is available, corresponding to what we show in Fig 51, where across-scan residuals are at the milliarcsecond level when G < 13 mag.Figure 51 shows the increase in along-scan residuals when the source is fainter (G > 19 mag).They almost reach the detectability limit, but usually remain very small (inside the [−10, 10] mas interval) for all the other sources.
Additional information about residuals and a comparison with Gaia DR2 are available in Tanga (2022), even though the authors used a different set of residuals that they obtained as a result of the internal process of the observations and not from the validation.It has been proved in the same paper that these residuals and the corresponding orbit can be considered equivalent to those that were computed during the validation process.

Orbit accuracy: Comparison with known catalogues
The post-fit accuracy of the semi-major axis (σ a ) is a good estimator of the orbit quality.We compared the post-fit σ a obtained using Gaia observations alone with that available from the JPL Small Body Database 17 , which makes use of all the available observations; see Fig. 52.The black line in the figure is the bisector of the first quadrant: the orbits of the objects below the line have a better uncertainty using Gaia observation alone.It is clear that Gaia DR3 alone is still not enough to reach the final accuracy expected for Gaia (Gaia Collaboration et al. 2018, Fig. 32), but the number of orbits for which the accuracy is now better using Gaia alone has largely increased from Gaia DR2 . 17https://ssd.jpl.nasa.gov/

Conclusions
The third data release of Gaia, DR3, provides a very large amount of new data.This complex and diverse dataset has a number of caveats that the users should be aware of.In this paper we summarised the main issues we found during the transversal validation, and we provided links to the relevant papers or documentation and recommendations.In particular, we highlighted that flags provided with the data products should be used whenever available (e.g.flags_gspspec, flags_flame, dibqf_gspspec, and flags_qsoc).We warned about the error underestimation of the XP coefficients, GSP-Phot parameters, GSP-Spec ANN, spectroscopic and eclipsing binary solutions with a poor goodness-of-fit, and we provided a correction formula for the radial velocity error estimates.The DSC white dwarf and binary star classifications should not be used.A number of parameters were highlighted as to be used with caution (MSC parameters, mh_gspphot, distance_gspphot, mg_gspphot, FLAME mass and ages for giants, radial velocities for (G RVS − G) < −3, and astrometric binary acceleration values).The corrections proposed in Recio-Blanco (2022) should be applied to the GSP-Spec parameters.Some systematics such as those presented for vbroad, the XP spectra dip, or extinction overestimation are to be taken into account according to the science case.Filters need to be applied to the QSO and galaxy candidates to have purer samples (Gaia Collaboration & Bailer-Jones 2022).Monte Carlo techniques should not be used with the Thiele-Innes astrometric binary orbital parameters.
This paper focused on limitations in the data released in Gaia DR3.We encourage a study of the other papers that accompany this data release for an overview of the high quality of the Gaia products and a glance at the wonderful science outcomes that can be expected from this wealth of data.We hope that this paper will help users to find their way through the data so that they can make the best of it.

Fig. 2 .
Fig. 2. Radial velocity of the primary star against the secondary for stars of binary pairs from the El-Badry et al. (2021) catalogue before (left) and after (right) filtering.

Fig. 3 .
Fig. 3. Stacked spectra for all the sources with a radial velocity >600 km s −1 (421 sources, top) and <-600 km s −1 (349 sources, bottom).Solid vertical lines indicate the position of the calcium triplet, and dashed lines show the same lines shifted by 1.7 nm, indicating where the spectral line would be if the radial velocity correction were incorrect by 600 km s −1 .

4 −Fig. 5 .
Fig. 5. Variation in radial velocity difference with APOGEE DR16 as a function of the APOGEE metallicity.

Fig. 7 .
Fig. 7. Radial velocity uncertainties tested with open clusters.Top panel: Absolute value of the difference between the radial velocity of a star and its cluster median |∆RV| normalised by the radial_velocity_ error.The black line is the lowess (locally weighted scatterplot smoothing).The slope of the lowess for lower values of radial_velocity_error indicates that the errors can be underestimated at the bright end (but see the text for a discussion).Bottom panel: Difference between the radial velocity of a star and its cluster median ∆RV normalised by the radial velocity error in different radial velocity error bins.

Fig. 8 .
Fig.8.Standard error factor f σ that should be applied to radial_velocity_error as a function of magnitude (left) and temperature (right), estimated from the comparison with GALAH.In green we over-plot f σ estimated from the wide binaries (Eq. 1 and Table1).

Fig. 9 .
Fig. 9. Comparison of the spectral line broadening parameter with the De Medeiros et al. (2014) catalogue of FGK stars, colour-coded by the template temperature.

Fig. 10 .
Fig. 10.Residuals from a global relation G RVS − G = f (G BP − G RP ) for a sample of APOGEE low-extinction solar metallicity dwarfs.

Fig. 11 .
Fig. 11.Relative difference of the number of stars with a G RVS value between DR3 and GOG20 (DR3-GOG20)/DR3 in the magnitude range 12 < G < 13 in Galactic coordinates.-1 (+1) corresponds to a deficit (an excess) of 100% in DR3 data with regard to the GOG20 model.

Fig. 12 .
Fig. 12. Galactic distribution of the sources for which RVS spectra are available in the HEALPix map of order 6. White patches are regions without sources.

Fig. 14 .
Fig. 14.Density of Gaia DR3 sources in the sky with available xp_continuous_mean_spectrum in Galactic coordinates.

Fig. 15 .
Fig. 15.Magnitude-colour diagram for sources for which xp_continuous_mean_spectrum is available in Gaia DR3.

Fig. 16 .
Fig. 16.Source mean spectrum coefficients for all sources in the xp_continuous_mean_spectrum table for BP (top) and RP (bottom).The colour index indicates the source density.

Fig. 17 .
Fig. 17.Number of relevant bases in the xp_summary table for BP (blue) and RP (red).

Fig. 18 .
Fig. 18.Comparison of the number of sources failing the small wings test in BP (blue/cyan) and RP (red/magenta) when all source coefficients are considered (solid lines) or only truncated coefficients are taken into account (dashed lines) as a function of colour.

Fig. 19 .
Fig. 19.Distribution of sources in the z-L 1 norm plane for BP (top panel) and RP (bottom panel).

Fig. 20 .
Fig. 20.Cumulative histogram of the differential wiggling coefficient ∆w 3 measured for stars with 0.5<bp_rp<0.7 and within different bins of phot_g_mean_mag.The left panel shows ∆w 3 values measured in the non-truncated spectra, and the right panel shows these values for truncated spectra.

Fig. 21 .
Fig. 21.Differential wiggling ∆w 10 as a function of H Hα .Non-truncated spectra are shown on the left, and truncated spectra are shown on the right.

Fig. 22 .
Fig. 22. Histogram of the ratio r of the photometric and spectrum flux in BP (left) and RP (right).

Fig. 23 .
Fig. 23.Distribution of sources in decadic logarithm of the ratio of photometric and spectroscopic uncertainty and XP magnitude for BP (top panel) and RP (bottom panel).

Fig. 24 .
Fig. 24.Zoom on the p-value distributions obtained for the two bands BP (left) and RP (right).The pairs with a p-value below 0.01 failed the test.

Fig. 26 .
Fig. 26.Median flux difference normalised by the errors between the XP sampled spectra and the CALSPEC spectra normalised by the errors as a function of wavelength.Dotted lines correspond to the 1σ confidence interval.
10 DSC: Discrete source classifier GSP-Phot: General stellar parametriser from photometry GSP-Spec: General stellar parametriser from spectroscopy FLAME: Final luminosity age mass estimator ESP-CS: Extended stellar parametriser for cool stars ESP-UCD: Extended stellar parametriser for ultra-cool dwarfs ESP-HS: Extended stellar parametriser for hot stars ESP-ELS: Extended stellar parametriser for emission line stars MSC: Multiple star classifier QSOC: QSO classifier UGC: Unresolved galaxy classifier OA: Outlier analysis TGE: Total Galactic extinction Article number, page 11 of 25

Fig. 29 .
Fig. 29.Density plot of the comparison of the monochromatic extinctions of GSP-Phot with those derived by Lallement et al. (2018).The green line corresponds to the 1.02 relation that is expected given the slight wavelength difference between the two A 0 .
shown in Fig. 29.The Lallement et al. (2018) A 0 are consistent with the A V values from StarHorse provided with APOGEE DR16 (Queiroz et al. 2020) with only the ex-

Fig. 30 .
Fig. 30.Hertzsprung-Russel diagram of low-extinction stars (A 0 < 0.05 mag according to Lallement et al. (2019)) with a parallax relative precision lower than 10%, colour-coded with the mean extinction azero_gspphot.The colour is saturated as black for values higher than 1 mag.In this low extinction sample, M G is simply G + 5 + 5 log( /1000).

Fig. 34 .
Fig. 34.Comparison of mh_gspphot with literature values from open clusters.We plot the median value for each cluster.The error bars show the dispersion around the median.The red line indicates the zero value.

Fig. 35 .Fig. 36 .
Fig. 35.Correlation between the GSP-Spec parameters and magnitude (left) and metallicity (right) illustrated here with the temperature residuals compared to APOGEE DR16.

Fig. 37 .
Fig. 37. Global distribution of distance_gspphot against parallax for sources with /σ > 5.The solid black line represents the 1/parallax relation.The map (l, b) on the right shows the sky distribution of the clustering between the two parameters.

Fig. 38 .
Fig. 38.Density plot of the difference between mg_gspphot and the absolute magnitude computed directly with the parallax for a sample of stars with negligible extinction (A 0 < 0.05 according to Lallement et al. (2019)) and a parallax_over_error > 10.

Fig. 39 .
Fig. 39.Colour-magnitude diagram of NGC 6791 (left panel), mg_gspphot vs G BP − G RP (central panel), and G vs distance modulus (m-M) derived from distance_gspphot (right panel).The blue line in the right panel shows the literature value, and the green lines in the left and central panels show the PARSEC isochrone, which has the same parameters as the cluster.

Fig. 40 .
Fig. 40.Comparison between JSDC radius and the FLAME radii based on GSP-Phot (left) or GSP-Spec (right), colour-coded with the G BP − G RP colour.

Fig. 41 .
Fig. 41.Density plot of the comparison between the GALAH gravitational redshift and the FLAME redshifts based on GSP-Phot (left) or GSP-Spec (right).

Fig. 42 .
Fig. 42.Astrometric properties of the different subsamples contained within the QSO (top) and galaxy (bottom) candidate tables.Each panel contains the distribution of parallaxes and proper motions, normalised to their errors.The grey line corresponds to a normal distribution.

Fig. 43 .
Fig. 43.Sky distribution of the different subsamples contained within the QSO (left) and galaxy (right) candidate tables.

Fig. 44 .
Fig. 44.Gaia DR3 low-extinction HR diagram (grey scale).The position of sources in the QSO (top) and galaxy (bottom) candidate tables with parallax_over_error > 5 is overplotted with a red scaling with the square root of the number of sources.Colour points correspond to the stricter selection of candidates proposed in Gaia Collaboration & Bailer-Jones (2022).

Fig. 45 .
Fig. 45.Comparison of Sérsic index (panel a), and ellipticity (panel b) from DES with the Gaia measurements.

Fig. 46 .
Fig. 46.Redshift distribution of the QSO (blue) and galaxy (orange) candidate sources.The dotted blue line corresponds to the quasars that were selected with the recommended QSOC redshift flags (flags_qsoc=0 or flags_qsoc=16).The two peaks marked by the red arrows are discussed in the text.

Fig. 47 .
Fig. 47.Density plot of the signal-to-noise ratio of the semi-major axis of the photocentre orbit (a 0 ) derived from a Monte Carlo method as a function of the value provided in the field significance.

Fig. 48 .
Fig. 48.Asteroid population in Gaia DR3 in the (a, e) plane, where a is the semi-major axis in au and e is the eccentricity of the minor planets.The legend shows the different categories: blue squares for NEAs, red stars for MBAs, and green dots for Jupiter Trojans.For sake of clarity, the plot does not show Centaurs and more distant objects.

Fig. 49 .
Fig. 49.Time span by the observations in Gaia DR3 (in days) vs the number of observations for each known source.The red stars represent the objects for which the orbit determination process did not converge.

Fig. 50 .
Fig. 50.Histogram of post-fit residuals of the selected observations in the left: along-scan, right: across-scan direction.

Fig. 51 .
Fig. 51.Density plot of the post-fit residuals as a function of the G magnitude left: along-scan, right: across-scan

Table 1 .
Coefficients to derive the standard error factor f σ that should be applied to radial_velocity_error according to Eq. 1 for G RVS > 8 mag.