Predicted asteroseismic detection yield for solar-like oscillating stars with PLATO

Aims. In this work, we determine the expected yield of detections of solar-like oscillations for the targets of the foreseen PLATO ESA mission. Our estimates are based on a study of the detection probability, which takes into account the properties of the target stars, using the information available in the PIC 1.1.0, including the current best estimate of the signal-to-noise ratio (S / N). The stellar samples, as deﬁned for this mission, include those with the lowest noise level (P1 and P2 samples) and the P5 sample, which has a higher noise level. For the P1 and P2 samples, the S / N is high enough (by construction) that we can assume that the individual mode frequencies can be measured. For these stars, we estimate the expected uncertainties in mass, radius, and age due to statistical errors induced by uncertainties from the observations only. Methods. We used a formulation from the literature to calculate the detection probability. We validated this formulation and the underlying assumptions with Kepler data. Once validated, we applied this approach to the PLATO samples. Using again Kepler data as a calibration set, we also derived relations to estimate the uncertainties of seismically inferred stellar mass, radius, and age. We then applied those relations to the main sequence stars with masses equal to or below 1.2 M (cid:12) belonging to the PLATO P1 and P2 samples and for which we predict a positive seismic detection. Results. We found that we can expect positive detections of solar-like oscillations for more than 15000 FGK stars in one single ﬁeld after a two-year observation run. Among them, 1131 main sequence stars with masses of ≤ 1 . 2 M (cid:12) satisfy the PLATO requirements for the uncertainties of the seismically inferred stellar masses, radii, and ages. The baseline observation programme of PLATO consists of observing two ﬁelds of similar size (one in the southern hemisphere and one in the northern hemisphere) for two years apiece. Accordingly, the expected seismic yields of the mission amount to over 30000 FGK dwarfs and subgiants, with positive detections of solar-like oscillations. This sample of expected solar-like oscillating stars is large enough to enable the PLATO mission’s stellar objectives to be amply satisﬁed. Conclusions. The PLATO mission is expected to produce a catalog sample of extremely well seismically characterized stars of a quality that is equivalent to the Kepler Legacy sample, but containing a number that is about 80 times greater, when observing two PLATO ﬁelds for two years apiece. These stars are a gold mine that will make it possible to make signiﬁcant advances in stellar modelling.


ABSTRACT
In this work, we determine the expected yield of detections of solar-like oscillations for the targets of the foreseen PLATO ESA mission.Our estimates are based on a study of the detection probability, which takes into account the properties of the target stars, using the information available in the PIC 1.1.0,including the current best estimate of the signal-to-noise ratio (S/N).The stellar samples, as defined for this mission, include those with the lowest noise level (P1 and P2 samples) and the P5 sample, which has a higher noise level.For the P1 and P2 samples, the S/N is high enough (by construction) that we can assume that the individual mode frequencies can be measured.For these stars, we estimate the expected uncertainties in mass, radius, and age due to statistical errors induced by uncertainties from the observations only.We used a formulation from the literature to calculate the detection probability.We validated this formulation and the underlying assumptions with Kepler data.Once validated, we applied this approach to the PLATO samples.Using again Kepler data as a calibration set, we also derived relations to estimate the uncertainties of seismically inferred stellar mass, radius, and age.We then applied those relations to the main sequence stars with masses equal to or below 1.2 M ⊙ belonging to the PLATO P1 and P2 samples and for which we predict a positive seismic detection.We found that we can expect positive detections of solar-like oscillations for more than 15 000 FGK stars in one single field after a two-year observation run.Among them, 1131 main sequence stars with masses of ≤ 1.2M ⊙ satisfy the PLATO requirements for the uncertainties of the seismically inferred stellar masses, radii, and ages.The baseline observation programme of PLATO consists of observing two fields of similar size (one in the southern hemisphere and one in the northern hemisphere) for two years apiece.Accordingly, the expected seismic yields of the mission amount to over 30000 FGK dwarfs and subgiants, with positive detections of solar-like oscillations.This sample of expected solar-like oscillating stars is large enough to enable the PLATO mission's stellar objectives to be amply satisfied.The PLATO mission is expected to produce a catalog sample of extremely well seismically characterized stars of a quality that is equivalent to the Kepler LEGACY sample, but containing a number that is about 80 times greater, when observing two PLATO fields for two years apiece.These stars are a gold mine that will make it possible to make significant advances in stellar modelling.

Introduction
The PLAnetary Transits and Oscillations of stars (PLATO) mission is the ESA Cosmic Vision M3 mission and its launch is scheduled for the end of 2026.Its main objectives are 1) the detection and accurate and precise characterisation of exoplanets down to Earth-size planets in the habitable zone of solar-like stars, 2) the accurate and precise determinations of the basic parameters of their host stars (mass, radius, age, etc), and 3) careful statistical analyses of the above characteristics in order to better understand the formation and evolution of planetary and stellar systems (hereafter, stellar systems).Sufficiently precise determinations of the characteristics of these stellar systems require very high-quality photometry carried out continuously over long periods of time, hence, the need for a space mission.Furthermore, the required accuracy calls for improvements of the stellar models used to estimate the age of the star-planet system.Improving the processes of stellar modelling is thus an intrinsic main objective of the PLATO mission (Rauer et al. 2014(Rauer et al. , 2023)).The science operation phase of PLATO is planned to last for four years with a possible extension of 4.5 years.The baseline is two long-pointings (LOPs) observing one field for two years apiece.PLATO will collect high-precision photometric lightcurves of thousands of stars, which will be of particular interest for asteroseismological studies.To reach its objectives, the PLATO mission has defined a core programme with several types of stellar samples.Here, we focus on the P1, P2, and P5 samples.The P1 and P2 samples (hereafter, P1P2) consist in the brightest PLATO targets which will be observed with a 25 s cadence.The P1 sample (resp., P2) includes at least 15 000 (resp., 1000) dwarf and subgiant stars (types F5 to K7), with V ≤ 11 mag (resp.V ≤ 8.5) observed over the mission and a noise level of < 50 ppm h 1/2 .The noise level of those samples has been adapted to enable precise seismological studies.The P5 sample contains at least 245 000 dwarf and subgiant stars (F5-K7), with V ≤ 13 mag cumulative over two target fields.Sampling of these light curves will be 600 s, but it is planned to acquire light curves with a shorter cadence of 50 s for the brightest targets in P5 or for targets of particular interest.For that reason, we also consider the stars of the P5 sample.Input information about the stars in those samples has been gathered in the Input PLATO Catalogue (PIC, Montalto et al. (2021); Nascimbeni et al. (2022) ).Here, we use version PICv1.0.0.For more details about the PLATO project, we refer to Rauer et al. (2014) and Rauer et al. (2023).
Asteroseismology, namely, the detection, measurement, and analysis of stellar oscillations, is one of the main tools that will be used in the framework of the PLATO mission to achieve these objectives.The mission is indeed designed to detect stellar oscillation modes for different classes of stars, including solar-type ones which are its prime targets for exoplanet detection.Asteroseismology can be used to infer the stellar properties, either by measuring global seismic parameters such as ν max , the frequency at which the oscillation modes have their maximum amplitude, and/or ∆ν, the large frequency separation, which characterizes the pattern of power spectra of solar-like oscillations.In ideal conditions, we can precisely measure the frequencies, amplitudes, and widths of individual oscillation modes, which then provide tighter constraints on the stellar mass, radius, and age.
Starting with the Sun, two decades of observations have demonstrated that solar-like oscillations offer a powerful way to derive precise stellar masses, radii, densities, and ages, provided that high-quality seismic parameters are available (see for instance the reviews by Chaplin & Miglio 2013;Christensen-Dalsgaard 2016, 2018;García & Ballot 2019;Jackiewicz 2021;Serenelli et al. 2021).However, this requires short-cadence (less than 1 mn for dwarfs and subgiants), ultra-high photometric precision (at the level of parts-per-million or ppm), and nearlyuninterrupted long-duration (from months to years) monitoring.
Here, we investigate the seismic performances of the core programme of the PLATO mission; more precisely, our goal is twofold: 1) to obtain an estimate of the number of stars of the PLATO catalogue for which solar-like oscillations can be detected and 2) to obtain an estimate of the uncertainties on the stellar mass, radius, and age inferences in cases where the data are of high enough quality that individual modes can be detected and their frequencies be measured.By uncertainties here we mean those statistical errors which can only be decreased with higher quality observations (i.e. higher S/N values and longer observation times).We then estimate the (statistical) uncertainties on stellar mass, radius, and age, which result from the propagation of observational errors on the seismic data.We stress that systematic errors and/or biases must be added to the statistical errors to obtain the final error budget.In the present case, systematic errors and/or biases mostly depend on our ability to improve our stellar modelling.This is only briefly discussed at the end of the paper.
The paper is organized as follows.In Sect.2, we detail the theoretical approach we used to derive the probability of detection of solar-like oscillations.The calculation is based on (Chaplin et al. 2011b, hereafter C11) 's methodology which was developed for studying the asteroseismic potential of the Kepler mission and later on used for the TESS (Campante et al. (2016), Schofield et al. (2019)) and CHEOPS (Moya et al. 2018) missions.In order to validate our own computations, we use several samples of stars observed by the NASA Kepler mission (Borucki et al. 2010;Koch et al. 2010) with and without detected solar-like oscillations.These test samples are presented in Sect.3, together with the results about the reliability of the detection probability.In Sect.4, we present our approach to estimate the uncertainties in the seismic inferences of stellar masses, radii, and ages (hereafter, MRA) when assuming that frequencies of individual modes can be measured.In Sect.5, we present the computations of the detection probabilities for stars in the P1P2 and P5 samples.Those calculations predict the number of stars in each sample for whichvwe expect to detect solar-like oscillations in the core programme of PLATO.As assumed by the PLATO consortium, we consider two observational conditions: 1) we take the adopted noise level arising from observations from nominal 24 cameras at the beginning of life (BOL), so there is no degradation of the instrument; and 2) we take the noise level arising from observations by 22 cameras only at the end of of life (EOL), allowing for some degradation of the instrument as usually taken as reference by ESA.We use the noise level given in the PIC that takes into account the fact that each target is observed by either 6, 12, 18, or 24 cameras (Montalto et al. 2021;Nascimbeni et al. 2022) .The calculations are carried out for one LOP which will continuously observe the same field for at least two years.In Sect.6, for stars in the P1P2 samples with positive seismic detection, we compute the expected uncertainties of the individual frequencies and deduce the resulting MRA uncertainties using the approach described in Sect. 4.
In this section, we derive the formalism for computing the detection probability of solar-like oscillations, P det , in the photometric power density spectra.It is the probability that such oscillations are detected globally in the power spectrum, not to be confused with the probability to detect and measure the properties of individual oscillation modes.P det is calculated according to Eqs. 26 and 28 of C11.The statistics of the power spectrum is a χ 2 with 2N b degrees of freedom, where N b is the number of independent bins in the envelope band, δν env , of the power spectrum.The probability of having a peak above a given level s, due to noise only, in the binned power spectrum is then given by Eq. 1 of Appourchaux (2004), namely, where S is the mean of the power spectrum and Γ(n) is the Gamma function.
Here we seek the probability for any value s ′ to be larger than a given level s for a binned power spectrum normalised to the noise level, S /N, With the change of variable u ′ = u/(S /N), the probability is given by which is the probability that the normalised power take any value x ′ = (S /N) ′ larger than a given level x = S /N.For x → 0, thatis S → 0, corresponding to no signal in the power spectrum, the above probability is P = 1 as expected.We note that for convenience, C11 considered a slightly different formulation and calculated the probability that any value x ′ = [(S + N)/N] ′ is larger than a given level x = (S + N)/N, so that in absence of signal S , x = 1, an approach which we use in the following.
As in C11, using the above equation, we first calculate a signal threshold S thres such that the probability for any value S ≥ S thres to be due to pure noise is smaller than a predefined value p fa (false alarm probability).In practice, we remain very conservative and choose a very small value for this false alarm probability, p fa = 0.1%.
In a second step, following C11, we now consider that there is some seismic signal S mod in the power spectrum.We obtain the probability of detection of this signal within the false alarm probability p fa as the probability of being above the threshold defined previously (Eq.3), but with a new normalisation by S mod + N, Choosing as before u ′ = u/(S mod + N) as the new variable, we obtain which finally becomes where and (S /N) thres is given by the solution of Eq. ( 3) for P = p fa .We note that Eqs. ( 6)-( 7) are equivalent to Eqs. ( 28) and ( 29) of C11.
In practice, the probability P det can be computed for any particular target of PLATO or any other mission, such as Kepler, as soon as we have determined: (i) the signal-to-noise ratio (S/N) in the power spectrum (S /N) mod and (ii) the number N b of bins to consider in the envelope band of the oscillation modes.The derivation of these two quantities is detailed below.The last technical difficulty is then the computation of the Γ function for the expected very large values of N b , which we perform using the classical asymptotic approximation, which is valid for large values of n.
We consider that there is detection of power excess due to solar-like oscillations when the probability P det > 0.99.Power spectra showing peaks above S thres as defined by Eq. (3) for p fa = 0.1%, but with P det ≤ 0.99 are considered as potentially indicating solar-like oscillations, but with too little confidence to derive any global seismic parameters, let alone properties of individual modes.This is a very conservative position, so that the resulting detection probability can be considered as a lower limit of what can be expected with PLATO.
2.1.1Global S/N in the power spectrum: The global S/N entering Eq. ( 7) is calculated as where P tot is the total power density in the oscillations and N tot the total power density in the noise, both quantities being estimated near ν max , the frequency where the oscillations reach their maximum amplitude.In the following, we detail how these two quantities were calculated.
2.1.2Oscillation power density P tot : The calculation of the oscillation power density P tot (in ppm 2 /µHz) can be performed using a formulation as established by C11 restricted to the calculation of the probability of a global detection of power excess due to solar-like oscillations.The power density (in ppm 2 /µHz) is given by (Eq.19 in C11): where ∆ν is in µHz , V 2 mod ≃ 3.1 is the average visibility calculated for a set of 4 modes ℓ = 0 − 3 (Ballot et al. 2011), and A max (in ppm) is the maximum oscillation amplitude of modes, reached at frequency ν ≃ ν max .The scaling laws for the determination of ∆ν are given in Appendix A. The estimate of A max is discussed in Sect.2.2 and Appendix A.
2.1.3Noise power density N tot : We must also estimate the total noise power density, N tot .It is composed on the one hand of the instrumental noise, N inst , which includes photon noise and all other instrumental contributors to the noise, and on the other hand of the stellar intrinsic noise, N gran , which we assume is dominated by granulation noise at the relevant frequencies.The total noise power density is then given by: Following (Samadi et al. 2019, hereafter S19), the granulation noise power density, N gran , is calculated as a scaling power law of ν max (i.e.Eq. 36 in S19) , as in Kallinger et al. (2014).The instrumental noise, N inst , including photon noise as well as all other sources of noise from the instrument or from the background (satellite jitter, readout noise, digitisation noise, stellar background, zodiacal light, etc), will be a major contributor to the total noise entering the calculation of the (S /N) mod term involved in Eq. ( 7).It depends on which sample of stars is considered: Kepler stars or PLATO targets and will discussed in Sect.2.2 below.

Number of bins in the oscillation mode envelope:
Once P tot and N tot , and thus (S /N) mod are determined, we only need to determine the number of frequency bins N b in the oscillation envelope in order to apply Eqs. ( 3)-( 6)-( 7).This number is given by where T obs is the total time interval of the photometric monitoring, in seconds, δν env is the frequency range over which the oscillations are present in the power spectrum, in µHz, and Int denotes the integer part.
The parameter δν env entering Eq. ( 11) is essential, because with the observing time, it controls the number of degrees of freedom of the χ 2 statistics followed by the power spectrum.

Adopted inputs for the calculation of the detection probability P det
In order to compute the global detection probability P det , we must provide as input the S/N, namely, (S /N) mod , the width of the envelope of the oscillations, δν env , and the observation time, T obs , for each target.The S/N, here, (S /N) mod in Eq. ( 8) involves the power P tot .We estimate P tot at its maximum namely, at ν max leading (S /N) mod at its maximum which we denote hereafter (S /N) max for clarity.We then use Eq. ( 9) assuming the existence of a regular pattern in a power spectrum every ∆ν and not individual modes.

Amplitude at maximum power
The amplitude of maximum power, A max , involved in P tot is obtained in the literature under the form of an empirical relation depending on some combination of the global parameters among stellar mass, stellar radius, effective temperature and/or equivalently the seismic global parameters ν max , and ∆ν.We consider two relations, respectively given by C11 and S19.Both relations are scaled to the solar values.In Appendix A, we show that our adopted recalibrated theoretical values for A max , A max,scal as given in Table 1, are in good agreement with the observed amplitudes at maximum power A max,obs , as derived by Lund et al. (2017) (their Table 3) for the Kepler LEGACY sample.In Table 1, we show how the stellar mass is derived from the scaling relation Eq. ( 32) of Appendix A, relating the mass M to the effective temperature T eff and ν max , namely, For the Kepler samples, ν max values are taken from the measurements by Lund et al. (2017).For the PLATO targets, in Sect. 5 and 6, we will take the stellar radii and effective temperatures from PICv1.1.0to derive ν max .For the solar values, we adopt A max,bol,⊙ = 2.53 ppm (rms value, see Michel et al. 2009), ν max,⊙ = 3090 µHz, ∆ν ⊙ = 135.1 µHz, and T eff,⊙ = 5777 K throughout.

Instrumental noise
As discussed in Sect.2.1, we need to estimate the instrumental noise, N inst , in the power spectrum.The Kepler instrumental noise used by C11 is taken from Gilliland et al. (2010), namely, where N rand , the total random noise per time interval of the data series (in ppm 2 ) is given by with α = 10 5 1.28 10 −0.4 (12−K p) .(15) In the above equations, K p is the Kepler magnitude of the star, dt is the cadence of the photometric series, namely, dt = 58.85seconds for the Kepler short cadence mode.The remaining factor of 2 in Eq. ( 13) accounts for the choice of a single-sided power spectrum in C11, a convention that we adopt for the remainder of the paper.
For the PLATO calculations in Sects.5 and 6, we use the noise level, N inst , that is available for each target in the PLATO catalogue, taking into account all instrumental sources of noise according to the most up-to-date understanding of the instrument (Rauer et al. (2023) and Sect.5).

Width of the envelope of the oscillations, δν env
The parameter δν env can be measured by the width of the assumed Gaussian-like shape envelope of the oscillations in the power spectrum.Several formulations have been suggested for δν env in the literature.They generally take the form of a scaling relation of the type δν env = aν b max .The coefficients a and b are obtained by fitting the Kepler data.Their values differ according to whether one considers MS stars of spectral type G, K or hotter stars of spectral type F or subgiants, or red giants (Kim & Chang (2021) and references therein).For instance, Kim & Chang (2021) found slightly different values depending on the formulation assumed for the granulation noise background.We find that these different relationships remain within the upper and lower limits δν env = ν max and δν env = ν max /2, respectively.Specificlly at low ν max (i.e. for more evolved stars), the curves are close to ν max /2, while the curves approach ν max at higher ν max (i.e. for younger stars).In the following, we therefore consider both cases δν env = ν max and δν env = ν max /2 , but keep the conservative case δν env = ν max /2 for our baseline and estimate the changes in the detection predictions when using δν env = ν max .

Main sequence versus subgiant stars
For purpose of discussion presented later on in this work, we distinguish the cases of main sequence stars (MS stars) and subgiants.From a stellar evolutionary point of view, the subgiant phase starts when there is not enough hydrogen left at the centre to produce nuclear energy and kinetic pressure to sustain gravity.We use the central hydrogen mass fraction, X c , to define a threshold.The MS stars are then defined with X c ≥ 10 −6 .According to our stellar models for the range of mass of interest here and the adopted solar chemical composition, the subgiants satisfy The transition between MS tars and subgiants is located in a HR diagram in Fig. 11 in Appendix C. Mass subsamples: In the present work, we consider that only MS stars with M ≤ 1.6 M ⊙ can show solar-like oscillations whereas subgiant being evolved and therefore cooler, can still oscillate with solar-like oscillations while being more massive.We therefore study more carefully the sample of MS stars with seismic masses below 1.6 M ⊙ while no mass restriction is made for the subgiants.The mass threshold corresponds approximatively to the transition between stars with no convective core like the Sun and stars with a convective core in the MS.We also draw a specific attention to the subsample of main-sequence (MS) stars with seismic masses M ≤ 1.2 M ⊙ .The reason is that the stellar requirements of the PLATO mission are established for a star like the Sun (in mass and age).
3 Validation of the calculation of the detection probability with Kepler stars The formalism described in Sect. 2 must be validated before being applied to the stars of the PLATO Input catalogue.Here we use several samples of stars observed by Kepler, in order to verify the performance of this formalism in predicting detectability of solar-like oscillations.In other words, we checked with which confidence level Eqs. ( 3)-( 6)-( 7), with the above prescriptions for P tot , N tot and N b , can predict whether or not solar-like oscillations can be detected.For that purpose, we use two types of Kepler stars, those for which such oscillations were or were not detected.The confidence level will be measured in terms of false positive and false negative predicted detections.
3.1 Kepler data sets for calibration and validation of theoretical calculations In order to build a final sample of stars with detected oscillations with all the necessary parameters available, we considered the set of 413 stars common to M22 (updated ν max and ∆ν) and S17 (observing intervals and grid-based inferred stellar mass and radius).In the following we use the updated ν max and ∆ν from M22.In Appendix B, we look at the impact of choosing the values of ν max , ∆ν, and T eff from S17 instead of M22.

Sample 2: large sample of stars with no oscillations detected by Kepler
As a second sample, we consider the list of 990 Kepler shortcadence main-sequence solar-like stars for which analyses revealed no detected oscillations, as published by (Mathur et al. 2019, hereafter M19).For each star in sample 2, the value of ν max is computed according to where the surface gravity g = GM/R 2 and T eff are taken from M19.1 .Stellar masses are obtained from the seismic scaling law Eq.( 12).This provides δν env .The observation time, T obs , is taken from KASOC (Kepler Asteroseismic Science Operations Center provides asteroseismological data).
Here again, we focused on stars with masses M < 1.6 M ⊙ .The resulting set of 833 stars constitutes our final sample of Kepler non-oscillating stars (sample 2).

Kepler LEGACY sample
Finally, we need to validate and calibrate the calculation of the oscillation maximum amplitude A max , as detailed in Appendix A, as well as Libbrecht (1992)'s relation between individual mode frequencies, linewidths, and S/Ns (see Appendix D), We then used the Kepler Legacy sample, which is composed of 66 main-sequence stars with the highest quality of seismic data (Lund et al. 2017) (in the following).For those stars, individual modes are identified.Indeed solar-like oscillation modes can be described by spherical harmonics with spherical degree ℓ and azimuthal order m for their surface geometry and by the radial order n labelling the overtones of a given ℓ, m mode.When rotation is not taken into account or cannot be detected seismically, the modes are m-degenerate and the mode frequencies do not depend on m.This is the case here, so for each individual mode ℓ, n, the frequency, amplitude and line width are measured with the highest precision.For those stars, the observed values of ν max are taken from L17.

Results of the validation: Performance of the detection probability approach
Our approach for calculating the probability to globally detect solar-like oscillations was tested against the above Kepler samples.Using the formalism described in Sect.2.1 and the various needed inputs as explained in Sect.2.2, we assessed on the one hand the fraction of Kepler targets with detected oscillations for which we predict no detection (false negatives) and on the other hand the fraction of Kepler targets with no detected oscillations for which we predict detection (false positives).
In prevision of the investigation for the PLATO case, we made as our baseline the conservative choices of a positive detection when P det > 0.99 and δν env = ν max /2.As summarized in Table 17 in Appendix B, considering the total population of 1349 Kepler stars (MS stars with masses M < 1.6M ⊙ and subgiants of all masses, hereafter R 1 sample) with both predicted false seismic positive (186 stars) and negative (40 stars) detections of oscillations in the baseline conditions leads to an underestimate of the number of real detections (false negative) by ∼ 3% for the PLATO samples.On the other hand, we can see in Table 17 that one overestimates the number of real detection (false positive) by 14%.If one considers only MS stars with masses M ≤ 1.2M ⊙ , we overestimate the number of real detection by 7%.These tests using Kepler results confirm that our approach is valid within the quoted uncertainties and will be used for the PLATO targets in Sect.5.The detailed results of the calculations of the above results, as well as justifications of the choices made for defining our baseline, are given in Appendix B.
We must stress here that the high percentage (14%) of false positive detection for the R 1 sample is mostly due to subgiant stars with masses larger than 1.6 M ⊙ .If we consider a subsample of stars including both MS stars and subgiants with masses less than 1.6M ⊙ , the percentage of false positive detection of 14% decreases to ∼ 9%.
False positive detections can be due to actual amplitudes being lower than predicted.Several reasons have been put forward to explain lower-than-expected amplitudes for Kepler stars, including significant magnetic activity (Chaplin et al. (2011a); M19) and low metallicity (Samadi et al. (2010); M19).This likely depends on the properties of the stars themselves (mass, luminosity, temperature, rotation, magnetism, etc.).M19 provided the iron-to-hydrogen mass fraction [Fe/H] and the photometric proxy for magnetic activity S ph (Garcia et al. 2010;Mathur 2014;Santos et al. 2023) which measures the amplitude of the spot modulation in the light curves and must be considered as a lower limit of the stellar activity (Salabert et al. 2016(Salabert et al. , 2017)).M19 then found that the probability of non detection of solartype oscillations is ∼ 98.7% when S ph > 2000 ppm (for reference, M19 gave S ph,min = 67.4ppm and S ph,max = 314.5 ppm at the minimum and maximum of activity for the Sun and that the solar oscillation amplitudes decrease by 12.5 % from minimum to maximum of activity).Concerning the impact of metallicity, the magnitude of the amplitude decrease due to a low metallicity remains uncertain and solar-like oscillations have been detected for some metal poor stars.So there is no clear one-toone correspondence between metallicity and non detection of solar-like oscillation.Nevertheless it is still of interest to look at the 146 stars with false positive detection that have values for S ph and [Fe/H] in the M19 sample.This subsample includes 100 subgiants among which 72 with masses larger than 1.6 M ⊙ .We then note that none of those 72 subgiants have high stellar activity and only 38 of them are metal poor compared to the Sun ([Fe/H]< −0.1).We also considered the whole sample of 146 stars with false positive detection and available values for S ph and [Fe/H] and found 82 stars with a high activity level (S ph > 2000) or [Fe/H] < −0.1.Removing those 82 stars from the original sample of 186 stars with false positive detection leaves 104 stars.Using 104 stars instead of the original 186 stars with false positive detections, we find that the percentage of false positive for the R 1 sample drops to ∼ 7.7%.
In any case, taking into account stellar activity and metallicity, and/or additional specific properties of the stars to explain the whole sample of false detections would deserve further investigation but is out of scope of the present paper.Since such detailed information are not yet available for the PIC 1.1.0stars we will therefore keep a conservative value of 14% for the false positive uncertainty.Accordingly we later give the values of X together with their uncertainties under the form X +3% −14% for the PLATO subsample of MS stars with masses M < 1.6M ⊙ and subgiants of all masses.However, as an optimist remark, let us stress that stars in the M19 sample were observed over one month only.As the S/N increases over time, we expect to reach smaller amplitudes, everything else equal, with the PLATO mission and therefore a smaller percentage of false positive detection due to too small oscillation amplitudes.

Uncertainties on stellar properties in cases of individual frequency measurements
For estimating the seismic MRA inference performances, we go on to consider the case when the mode frequencies can be measured individually.We derived empirical relations giving the mass and radius relative uncertainties as a function of the uncertainty δν ℓ=1,max of the ℓ = 1 mode closest to ν max , the frequency at maximum power (those modes have the smallest uncertainties).For that purpose, we used the stellar evolution code CES-TAM (Morel & Lebreton 2008;Marques et al. 2013) to build a set of stellar models of MS stars with masses of M ≤ 1.2M ⊙ , for which we numerically computed the individual frequencies using the ADIPLS code (Christensen-Dalsgaard 2008).The frequency uncertainties are obtained from a rescaling of the frequency uncertainties derived for a 'degraded Sun' (Lund et al. 2017).We then used the above frequency set for each synthetic star in the inference code AIMS (Rendle et al. 2019;Lund & Reese 2018) updated for the present purpose by one of the coauthors (D.Reese) in order to infer the MRA and their statistical uncertainties.It is known that non seismic constraints play only a minor role when the inference includes a large number (e.g. a few dozen) of highly precise individual frequencies.We nevertheless include uncertainties for non-seismic constraints: a generic 70 K as an uncertainty for the effective temperature and 0.05 dex for the metallicity expected from individual spectroscopic study (e.g see the PASTEL catalogue, Soubiran et al. ( 2022)).For Sun-like stars, differential studies with respect to the Sun are even more precise and accurate (Morel et al. 2021).
On purpose, we did not introduce any systematic errors, so that we can estimate the seismic performances specifically due to the quality of the data.We then established a correlation between the MRA uncertainties and the frequency uncertainty δν ℓ=1,max which was then fitted.We found the following fitted relations (see Eq. ( 18) in Appendix C): where δν ℓ=1,max is in µHz.All details of these calculations are presented in Appendix C. The MRA relative uncertainties (Eq.18) can then be seen as a lower limit of what can be achieved given the observational constraints (which depend on the observing conditions).Realistic uncertainties require to ad d systematic errors to obtain the final error budget.This is out of scope of the present work (but see the discussion in Sect.7).
For the PLATO targets, we determine the (theoretical) uncertainties on the individual frequencies using the Libbrecht (1992) formula (Eq.45 in Appendix D), σ Libb , which depends on the S/N for that particular mode and on the duration of the observation and has proven to yield the right order of magnitude.The S/N in such a case is given by the the power per resolved mode -instead of the global power density in the oscillation envelope as before-over the background noise.The power per resolved mode is related to the height of the mode.Accordingly, the power density per resolved mode is derived from (S19) and Lochard (2003) for a single-sided spectrum (see also Appourchaux 2004): where Γ max is the mode linewidth at ν ≈ ν max , expressed in µHz.
For ℓ = 1 modes, the square visibility is V 1 = 1.5 (Ballot et al. 2011).The computation of A max is described in Appendix A (see in particular Table 12).Estimates of the mode linewidths are obtained by a fit of measurements in Lund et al. (2017) as a function of effective temperature (see Appendix D, and in particular Table 19 for details).The Libbrecht (1992) formula predicts the uncertainty for a single, isolated mode peak such as the ℓ = 0 modes.For higher ℓ degree modes, one should take into account the fact that they are comprised of multiple components that might not be resolved; and moreover, the exact appearance of the non-radial modes will depend on the angle of inclination presented by the star, currently unknown for the PLATO targets.We therefore rather adopt an empirical approach using real data from the Kepler mission: we establish in Appendix D a relation between the theoretical Libbrecht uncertainty of a ℓ = 1 mode at ν max , σ Libb,ℓ=1,max , and the measured frequency uncertainty δν l=1,max for the same mode for stars of the Kepler Legacy sample.As found in Appendix D (Fig. 16), the ratio δν ℓ=1,max /σ Libb,ℓ=1,max tends to decrease with the effective temperature of the star and the decrease is significant over the effective temperature interval found for the PLATO targets.The fit of the ratio δν ℓ=1,max /σ Libb,ℓ=1,max as a function of effective temperature for the stars of Kepler LEGACY sample is shown in Appendix D (see Eq. 47) and yields valid for 5000 < T eff ≤ 6200 K.This roughly corresponds to the effective temperature range of the PLATO targets for which we will compute those uncertainties later on.For each PLATO target, we will compute σ Libb,ℓ=1,max then derive δν l=1,max using Eq.20, before using Eq. ( 18) for the stellar mass and radius uncertainties.

Expected solar-like oscillations with PLATO
In this section, we estimate the number of stars for which solarlike oscillations are expected to be detected with the PLATO mission.For this purpose and as already mentioned, we took for each PLATO target star the stellar radius and effective temperature from PICv1.1.0(Montalto et al. 2021;Nascimbeni et al. 2022).The luminosity is then derived as: We consider, on one hand, the P1P2 sample and, on the other, the P5 sample, both for one LOP.We removed the hot stars that appear in the instability strip using the criterion from C11 log T eff > log(8907) − 0.093 log(L/L ⊙ ). ( This eliminated only a few stars because the criterion on the temperature of the hot side adopted to construct the PIC is much more severe.We also removed the early red giants from the PIC sample, based on their location in a theoretical Hertzsprung-Russell (HR) diagram.These stars are included in a specific scientific validation sub-catalogue of the PIC (Aerts et al. 2023).
PLATO performances for such evolved stars were assessed by Miglio et al. (2017).These stars are located at the base of the red giant branch in a HR diagram which we use as an empirical criterion to remove them.We define the criterion in terms of luminosity and effective temperature by computing and plotting a set of evolutionary tracks with different masses and locating the onset of the red giant branch in the HR diagram.This leads us to remove stars when they satisfy: After the removal of hot stars and evolved ones, we are left with a set of 7,009 stars in our P1P2 sample and 130 140 stars in our P5 sample for one LOP.The stellar mass was derived from the seismic scaling relation Eq. ( 12) where ν max is evaluated here according to Eq. ( 34): Appendix A offers more details.The mass is used only to consider various subsamples of stars when analysing the results of the calculations.MS stars with masses larger than 1.6 M ⊙ are too hot and therefore unlikely to show solar-like oscillations, except perhaps for the stars with high metallicity.In absence of information about metallicity at the present time, hereafter we exclude MS stars with predicted seismic masses M > 1.6M ⊙ .
The detection probability, P det , is obtained using Eq. ( 6) which involves the (S/N) max (Eq.8), the observing time T obs and the width of Gaussian-like envelope of the oscillation power spectrum δν env .The amplitudes, A max , used to compute (S /N) max are taken according to Table 1.In the calculation of N tot (Eq.10), we used for N inst the PLATO (random and systematic residuals) noise level included in the PIC1.1.0, N PIC , for EOL conditions, which was then converted in ppm 2 /µHz.For the BOL conditions, we used the data provided by one of the co-authors (Börner et al. 2023) .Because the convention in our calculation is a single-sided spectrum as for the power density, we take N inst = 2N PIC .We then added the single-sided stellar granulation background noise as used in S19 (see Sect. 2).
To remain conservative, we kept only those stars for which the probability of the signal be due to noise is 0.1% or less and of those stars we kept only stars for which the probability of the signal being due to solar-like oscillation is larger than 99 %.

Expected solar-like oscillations within the PLATO P1P2 sample
The predicted numbers of P1P2 target stars with positive oscillation detection obtained in different mass subsamples are collected in Tables 2 and 3. When we apply the (1 LOP, BOL, δν env = ν max /2) conditions after two years of observation, we expect to detect solar-like oscillations for at least 5839 stars (of which 2732 MS stars with M ≤ 1.6M ⊙ and 3107 subgiants.). Figure 1 show the distributions of those stars with expected detected solar-like oscillations as a function of log T eff , stellar mass and radius.One expects to detect solar-like oscillations in a subsample of 1245 stars with M/M ⊙ ≤ 1.2 after two years of observation in BOL conditions (Fig. 2).When restricted to MS stars with masses ≤ 1.2M ⊙ , the subsample counts 1016 stars.Those stars are small and therefore of the utmost interest for detecting small planets.On the stellar side, more massive stars are likely prone to large systematic uncertainties because they developed a convective core, the exact extent of which is unknown.Stars with masses M ∼ 1.2M ⊙ might also develop a convective core but it is small enough that convective overshoot does not contribute significantly to the total error budget on the age.
• Impact of BOL/EOL conditions: assuming EOL conditions instead of BOL ones, we would lose a few tens of percent of stars, mostly at small mass and radius (Table 2).
• Impact of T obs : as can be expected and be seen in Table 3, the observing time plays an important role.The increase of   the number of stars with predicted positive seismic detection is a factor ∼ 5-7 greater when increasing the observing time from 30 days to two years.After 30 days, one expects 1877 stars (among which 1596 subgiants), number which increases up to 5858 stars (among which 3107 subgiants) after two years of observation.Not surprisingly, stars for which we might not typically detect solar-like oscillations for too short an observing time are low-mass, MS stars because their oscillation amplitudes are too small.It is also interesting to consider the distributions of stars with positive seismic detection of solar-like oscillations with their magnitudes.Solar oscillations for stars with magnitude > 10.5 will be detected only after about one year of observations.• Impact of uncertainties in the probability calculations: the uncertainties on the number of stars with positive seismic detection due to the uncertainty on δν env (see Sect.2.5) can be estimated from Table 3. Denoting by D the number of stars with positive seismic detection, the impact of δν env uncertainty can be estimated as (D ν max /2 − D ν max )/7009, where 7009 is the total number of stars in the initial sample.This yields ∼ 4% and 7.5% when considering, respectively the sample of stars with masses ≤ 1.2M ⊙ and the sample of all mass stars with positive seismic detection over 730 days in BOL conditions .
We obtain an order of magnitude of the uncertainties on the number of positive detections, say X, by considering the underestimate due to false negative and the overestimate due to false positive detections.We then used the false negative and positive detection rates derived for our samples 1 and 2 of Kepler stars in the option δν env = ν max /2 (second column of Table 17 in Appendix B).This yields X +3% −14% for the sample of MS stars with M < 1.6M ⊙ and subgiants and X +3% −7% for the sample of stars with M ≤ 1.2M ⊙ when δν env = ν max /2.Such uncertainties are provided in Table 4.In percentages, Table 4 indicates a predicted seismic positive detection rate for stars with M ≤ 1.2M ⊙ between ∼ 55-61% (resp.65-74%) of the whole sample of 2099 stars with masses ≤ 1.2M ⊙ in BOL conditions after two years of observation taking into account uncertainties due to ν max /2 (resp.δν env = ν max ).In the same conditions but for the sample of MS stars with masses M < 1.6M ⊙ and subgiants with all masses, the same uncertainties yield predicted seismic positive detections at the level of 71-86% (resp.76-100%) of the whole sample of 7009 stars in the initial sample taking into account uncertainties due to ν max /2 (resp.δν env = ν max ) We note that uncertainties in the probability calculation and the number of stars with expected solar-like oscillation detection can also come from the fact that we used PIC1.1.0radius and effective temperature to compute the global seismic parameters and derive the seismic mass.

Expected solar-like oscillations within PLATO P5 sample
We carried out the same probability calculation as for the P1P2 sample after removing the same types of stars and assuming again a positive detection for P det > 0.99 (1 LOP, BOL, δν env = ν max /2) conditions.We found that the number of expected postive seismic detections amounts to 9 486 for the sample of MS stars with M/M ⊙ < 1.6 and subgiants after two years of observation (Table 5).Here again, we observe a drastic increase in positive seismic detections with the observing time.This is illustrated in Fig. 3.This figure shows the distributions in T eff , mass, and radius of P5 MS stars with masses M < 1.6M ⊙ and subgiants with all masses with expected positive detections for different observing durations.The number of stars significantly increases when T obs increases beyond 1 year, in particular toward stars with smaller radii.The subgiants outnumber significantly the MS stars for the whole sample of stars with M < 1.6M ⊙ (Table 5).As expected, after only 90 days of observation, only 81 stars with masses M ≤ 1.2M ⊙ have a seismic positive detection, all subgiants because their amplitudes (roughly ∝ L/M) are the highest.
Figure 4 compares the distributions of the PLATO noise level, N PIC , taken from the PIC1.1.0between BOL and EOL conditions for stars with expected positive detection of solar-like oscillations in the case of (1 LOP, δν env = ν max /2) for 730 days of observation.By construction, following the PLATO ESA requirements, the stars belonging to the P1P2 sample have N PIC ≤ 50 ppm h 1/2 whereas the stars with higher noise levels constitute the P5 sample.This was based on the EOL conditions.Assuming that the more optimistic BOL conditions hold, we find that 115 MS stars with mass M ≤ 1.2M ⊙ in the P5 sample have N PIC ≤ 50 ppm.h 1/2 and could be reclassified as P1 stars, increasing the number of P1P2 stars from 1016 (Table 2) to 1131 -that-is an increase of the positive detection rate from 15% to above 17%.

PLATO seismic performances for MRA inferences in the P1P2 sample
For the subset P1P2 stars with expected solar-like oscillations, the detection and highly precise measurement of individual frequencies for a significant number of modes is ensured by the selection of a high S/N by construction.This will allow us to satisfy the requirements that must be achieved by the PLATO mission (PLATO Science Requirements Document PTO-EST-SCI-RS-0150, ESA document, June 2021) which are: a mass uncertainty better than 15%, a radius uncertainty lower than 2% and an age uncertainty as low as 10% for a star like the Sun or the PLATO reference star defined as 1M ⊙ , 1R ⊙ , and T eff = 6000K.

PLATO seismic performances for oscillation frequencies for the P1P2 sample
For the PLATO targets, assuming individual frequencies are available, we can only determine the (theoretical) uncertainty on the frequencies using the Libbrecht (1992) formula (Eq.45), σ Libb,ℓ=1,max .We then use Eq. ( 20) to relate δν ℓ=1,max to σ ℓ=1,max for each target.We computed σ Libb,ℓ=1,max then δν ℓ=1,max for each target of our P1P2 sample of stars with expected positive seismic detection in BOL condition for 730 days and 90 days.We also included the case where we take into account the scatter in the relation Eq. ( 20) and the measurement uncertainties in the mode linewidths derived from the Kepler stars which leads to multiplying by √ 3 the Libbrecht's uncertainties σ Libb,ℓ=1,max (Appendix D).The corresponding distributions of frequency uncertainties δν ℓ=1,max are shown in Fig. 5.In the case (BOL, 730 days, δν env = ν max /2) conditions, the bulk of uncertainties are concentrated below 0.1 µHz.Including the scatter in the relation Eq. ( 20) shifts up the maxima of the distributions by about 0.07 µHz.The uncertainties remain below the PLATO requirement of frequency uncertainties 0.3-0.5 µHz.In case of 3 months observations, the shift is higher, about 0.1-0.13µHz and the bulk of uncertainties reach 0.2-0.3µHz.When the mode linewidth are increased by a factor  shifted with the bulk of uncertainties concentrating around 0.1 µHz.We also computed the frequency uncertainties in EOL conditions for 730 days but the associated degradation of the signal has only a small impact and is not shown.

PLATO seismic performances for stellar MRA inferences for the P1P2 sample
We now turn to the MRA uncertainties from seismic inferences resulting from the error propagation due to δν ℓ=1,max .We focused on stars with masses M ≤ 1.2M ⊙ .We used Eq. ( 18) to estimate the mass and radius uncertainties as discussed in Sect. 4. We also used the constraint δν ℓ=1,max as a proxy for the constraint on the age uncertainty.The condition δM/M ≤ 3% (often used to obtain an age uncertainty at the level of 10%) can also be added as an additional constraint.
We derived the numbers of P1P2 stars with a positive seismic detection while adding successive constraints on the uncertainties giving rise to three cases as follows: We consider the frequency uncertainties δν ℓ=1,max as given by Eq. ( 20) or Eq. ( 47) without (case a) and with (case b) a +0.5 shift due to the scatter in the fitted relation between δν ℓ=1,max and σ Libb,ℓ=1,max (Appendix D).The results are listed in Table 6.
Figure 6 shows the distributions of MS stars with masses M ≤ 1.2M ⊙ corresponding to the cases listed in Table 6 as   Figure 7 shows the histogram of the evolution of the number of these stars when the observation time increases from 90 days to 730 days, assuming (1 LOP, BOL) conditions for MS stars with M/M ⊙ ≤ 1.2, δM/M ≤ 15%; δR/R ≤ 2%, and δν ℓ=1,max ≤ 0.2µHz.The gain of stars satisfying the PLATO requirements is particularly significant for stars like the reference star when going from 90 days to a year.We note that no such star is found when observing over a short period of time of 30 days.
Taking into account uncertainties in frequencies and mode linewidths Γ values: The above numbers of stars remain unchanged when including ∼ 5-σ uncertainties due to scatter in Eq. ( 18) (Fig. 14 in Appendix C): δM/M(1 ± 0.5%); δR/R(1 ± 0.1%).In contrast, in the extreme case where one takes into account a scatter of +0.5 in the δν ℓ=1,max /σ Libb,ℓ=1,max fitted relation for all stars, the number of MS stars with M ≤ 1.2M ⊙ satisfying the PLATO requirement decreases by about 25%.
The impact of degrading the frequency uncertainty δν ℓ=1,max by √ 3 for stars with T eff ≤ 5650K (due to uncertainties in the values the mode linewidths, as per Appendix D), instead of the original values of δν ℓ=1,max -leads to a decrease of the number of stars by 38% in case IIb with (BOL, 730d, δν env = ν max /2) conditions.
We then conclude that even in the conservative case we considered for the detection probability and in the above worst cases for the frequency uncertainties, the PLATO mission should yield a set of MS stars with a seismic characterisation between ∼ 80 and 100 times the Kepler Legacy sample assuming two observ-4800 5000 5200 5400 5600 5800 6000 6200 6400  6.
ing fields (i.e.doubling the number of stars obtained for 1 LOP) and depending on whether we assume δν env = ν max /2 or ν max .

Summary and discussion
The present study is part of the scientific preparation for the ESA's PLATO mission, which will be launched towards the end of 2026.In this work, we calculate the theoretical probability of detecting solar-like oscillations for stars belonging to the PLATO Input Catalogue (PIC).More specifically, we considered bright stars (magnitude of 11 and brighter) of the FGK spectral type on the main sequence (masses lower than 1.6 M ⊙ ) and the subgiant branch.The calculation takes into account the estimated noise level for each individual star, provided by the PIC.
Our results indicate that the proportion of stars with positive detections of solar-like oscillations lies within a range of 70% to 100% for a continuous observation of two years.The lower (upper) value of this range comes from the assumed narrow (wide) bandwidth of the oscillation spectrum in the Fourier domain for 4800 5000 5200 5400 5600 5800 6000 6200 6400 T eff (K)  6.
each star, which is the main uncertainty in the calculation.It also depends on the beginning-of-life or end-of-life conditions of the PLATO instrument.The CoRoT and Kepler missions have taught us that individual oscillation modes can be detected for stars with a noise level of 50 ppm in one hour or less.For the stars in our sample that satisfy this criterion and with positive seismic detection we have estimated the uncertainty in the individual frequency measurements at the maximum of the power spectral density for each star based on the results of the Kepler mission.This enabled us to assess the propagation of this statistical error on the seismic inference of the mass, radius, and age of each star.We found that ∼ 47-61% of the sample of MS stars with masses M ≤ 1.2M ⊙ with statistical uncertainties below the PLATO requirements of 15%, 2% for the stellar masses and radii, respectively and satisfying oscillation frequency uncertainty ≤ 0.2µHz at maximum power density amplitude as a proxy for 10% uncertainty of the age of a Sun-like star.Those uncertainties are small enough that they leave margins for including systematic errors while still keeping the total error budget satisfying the PLATO requirements.We note that the masses used to define various mass samples, especially the sample represented by M ≤ 1.2M ⊙ , are seismic masses derived from scaling relations.As such, they are approximated as are the number of targets found in each mass subsample but the order of magnitude remains correct.
We also stress that for a few stars, it may be expected that additional errors can come from unexpected issues in the data acquisition or in the variable behavior of the star (such as magnetic activity, Pérez Hernández et al. 2019;Karoff et al. 2019;Thomas et al. 2021;Santos et al. 2023) that can alter the measurements of the frequencies and therefore the MRA uncertainties.From the Kepler experience, this could add an uncertainty up to 0.3 µHz to the statistical uncertainties but it is difficult at this stage to foresee for which stars in the PIC this can happen and this was ignored here.
For the P5 sample, the noise level is higher-again by construction-than for the P1P2 sample in the same (EOL or BOL) conditions.We find a percentage of 7.3-4.3% P5 stars with expected positive seismic detection after 730 days of observation in BOL and EOL conditions, respectively.Among those, a percentage of 0.5-0.2% of P5 MS stars with masses M ≤ 1.2M ⊙ are expected to show positive seismic detections.Among them, 115 P5 MS stars with masses M ≤ 1.2M ⊙ and with a PIC noise level lower than 50 ppm .h 1/2 in BOL conditions satisfy the above PLATO requirements, which means that those stars could be re-classified as P1 stars.
Accordingly, and as a whole, the calculations yield a total of 1131 MS stars with masses M ≤ 1.2M ⊙ for which one expects a positive seismic detection and seismic analyses providing mass, radius, and age satisfying the above PLATO requirements in BOL consitions after two years of observation for one LOP.The stars of this sample are plotted in a HR diagram in Fig. 8.For each target, the luminosity is derived with Eq. ( 21) and the stellar radius and the effective temperature, and their uncertainties taken for the PICv1.1.0.Uncertainties are plotted for three stars as representative of the typical PIC uncertainties in the HR locations.Overplotted over the PLATO targets locations, evolutionary tracks of stellar models cover the mass range of [0.8 − 1.2]M ⊙ .The stellar models were built with the CESTAM code (Morel & Lebreton 2008;Marques et al. 2013) assuming the solar relative chemical abundances AGSS09 (Asplund et al. 2009) with different initial values for X 0 , Y 0 , Z 0 (namely, the mass fractions of hydrogen X 0 , helium Y 0 , and metallicity representing all heavier chemical elements collectively counted as Z 0 ).The convection is described with the classical MLT formulation (Cox & Giuli 1968) involving the mixing length parameter, α MLT (a free parameter representing the efficient of convective transport in 1D stellar models).Otherwise, the input physical assumptions are similar to those of the reference model A described in Lebreton & Goupil (2014).The evolutionary tracks in Fig. 8 were computed until an age of 14 Gyr (on purpose greater than the age of the Universe) for the lowest mass stellar models or stopped at an arbitrary phase of the red giant branch for the most massive ones.Hence assuming one single chemical composition and α MLT values-usually taken as for the Sun-would clearly not reproduce the whole extended region in the HR occupied by the PLATO targets with the lowest masses.This remains true even taking into account the observational uncertainties in luminosity and effective temperature and the fact that several stars might belong to binary systems.
The anticipated PLATO sample of ∼ 2793 well characterized main sequence stars with masses of M < 1.6M ⊙ in one LOP will contribute to the PLATO set of best seismically characterized stars and is roughly 42 times larger than the Kepler Legacy sample and will complement the latter in providing tight constraints on stellar modelling.All these results will be reviewed and revised when confronted in 2.5 years from now with the real PLATO data after launch.
In the present work, we have purposefully estimated only the statistical uncertainties on the frequencies and resulting MRA seismic inferences in order to appreciate the added value due to the expected high quality of the PLATO photometric data.The total error budget, however, must include the uncertainties due to the various systematic errors that can be identified but not fully corrected.Of all stellar properties, the stellar age is by far the most challenging to determine accurately.In most cases indeed, stellar ages of single, field stars can only be determined through stellar modelling (Soderblom 2010 Thanks to the high quality of data acquired by space missions such as CoRoT (Baglin et al. 2009) and Kepler, and currently TESS (Ricker et al. 2015), theoretical studies were initiated in order to identify and quantify the impact of the main systematic errors that contribute the most to the error budget.Studies have been carried out as pure theoretical investigations and hare and hounds exercises (e.g.Appourchaux et al. 2006;Lebreton et al. 2014a;Reese et al. 2016;Cunha et al. 2021) or by modelling specific or sets of stars with the best seismic observations such as the small set of CoRoT stars or the larger set of Kepler LEGACY stars (e.g.Appourchaux et al. 2008;Benomar et al. 2010;Metcalfe et al. 2012;Lebreton & Goupil 2014;Silva Aguirre et al. 2015, 2017;Creevey et al. 2017;Bellinger et al. 2017;Nsamba et al. 2018;Valle et al. 2020;Ong et al. 2021;Farnir et al. 2020;Bétrisey et al. 2022Bétrisey et al. , 2023, among others), among others).At present, due to lack of accuracy, the age uncertainties for solarlike filed stars can increase roughly by ∼ 5 to 25% depending on poorly modelled physical processes and unknown initial chemical composition and to ∼ 15%-50% for a more massive star with a convective core.
We consider below two illustrative cases : the degraded Sun and the two best studied stars of the Legacy sample.

The 'degraded' Sun
A natural test of the accuracy of seismic modelling and characterisation is to look at the Sun-as-a-star and compare the results of the seismic MRA inferences to the independently known values of the mass, radius, and age of the Sun.This is now done routinely when inferring the MRA for various sets of Kepler stars.We therefore carried out several MRA inferences using the data of the so-called 'degraded Sun' of Lund et al. (2017).The frequencies of the 'degraded' Sun and their uncertainties were built to match the quality of the Kepler Legacy sample (δν ℓ=1,max ∼ 0.15µHz with oscillation modes in the range ℓ = 0, n = 16 − 27; ℓ = 1, n = 15 − 27; ℓ = 2, n = 16 − 24; ℓ = 3, n = 20 , (Lund et al. 2017)).Here again we used a gridbased approach (GBM) with the AIMS code.The observational constraints besides the oscillation frequencies and their uncertainties were taken as: T eff = 5777 ± 77K, [Fe/H] = 0 ± 0.1, ν max = 3090 ± 3.9µHz.We compare the results obtained when using two different grids of stellar models.The (Cunha et al. 2021, hereafter C21) for which we recall that the stellar models were computed with MESA evolutionary code (Paxton et al. 2018, and references therein) and the frequencies were computed using with the oscillation code ADIPLS (Christensen-Dalsgaard 2008).The second grid of stellar models (hereafter Mo23) was computed by one of the co-authors (namely, N. Moedas, for more details see Moedas et al., in prep.) using also the MESA code and the frequencies were computed using the GYRE oscillation code (Townsend & Teitler 2013).The input physics, values of free parameters and the reference solar relative chemical abundances differ between both grids.This allows to assess the impact of the main uncertainties in modelling solar-like stars.Intended to be applied to real stars unlike C21, Mo23 used a more updated stellar physics (similar to models D1 in Moedas et al. (2022), namely, it included atomic diffusion of chemical elements that helioseismology taught us is crucial for the modelling of solar-like stars.The reference solar abundances are AGSS09 (Asplund et al. 2009) in Mo23 whereas it is GS98 (Grevesse & Sauval 1998) in C21.For sake of simplicity, Mo23 kept the mixing length value of α MLT fixed to the calibrated solar value whereas C21 let the convection parameter be adjusted in the fitting process.Because here we dealt with real stars, we had to include surface-effect corrections and adopted the Ball & Gizon (2014b)'s correction in the AIMS inferences with both grids.
Following Reese et al. (2016) and C21, in the particular case of the Sun, we can measure the biases (or departure from accuracy) with d X,rel ≡ (X fit − X true )/X true for X = M, R, A. We wish then to compare these values to the relative statistical uncertainties δ X ≡ |δX|/X fit (%) (where |δX| corresponds to one standard deviation) provided by the MRA inferences with the AIMS' code.Finally it is also informative to estimate how large are the departures from accuracy compared to statistical uncertainties derived from the GBM approach d X,norm ≡ |(X fit −X true )|/δX since we will have only access to the last ones in most PLATO stars.The departure between the seismically inferred values for the solar MRA and the known solar values (taken here as 4.6 ± 0.4 Gyr for the solar age (Houdek & Gough 2007) as measured by d X,rel are given in Table 7 for the two grids.As expected the accuracy is much higher in the case of the Mo23 grid than for the C21 grid mainly because the C21 grid does not included atomic diffusion.These figures are similar to the values derived in other works which all adopt various different input assumptions and which fall in the range (∼ 0.3 − 4%) for the mass, (∼ 0.1 − 2%) for the radius, and for the age ∼ 2 − 9% when atomic diffusion is included and ∼ 15 − 16% when atomic diffusion is not included (e.g.Silva Aguirre et al. 2017;Creevey et al. 2017;Rendle et al. 2019;Jiang & Gizon 2021;Aguirre Børsen-Koch et al. 2022;Metcalfe et al. 2023).
On the other hand, the 1σ uncertainties given by the inference calculations as measured by δ X are comparable between the two grids (in the typical range 0.5-4% and 3-8% for the mass and age), relatively independently of the accuracy.Accordingly the departures from accuracy as measured in terms of δX uncertainties (d X,norm , last column of Table 7) significantly differ between the two calculations and between the fitted parameters MRA.In addition in both cases, the inaccuracies of the derived luminosities, which were not included as input constraints amount to 5% in both cases.As we want to decrease the inaccuracies at the level of or below the statistical uncertainties, this shows that there is room for improvements in the inference process or in the present solar modelling.This also emphasized the importance of building a set of stars with determination of mass and/or radius, and or age (benchmark stars) independently of stellar modelling as is done for the GAIA mission (Heiter et al. (2015b) Heiter et al. (2015a)) and in preparation for the PLATO mission (Maxted & Creevey 2023).In the case of the 'degraded Sun', the net error budget for the age remains close to ∼ 10 % accuracy.One must nevertheless keep in mind that free parameters entering the solar and stellar modelling (namely, the initial helium abundance, convective efficiency parameter, α MLT ) are calibrated for the Sun so that the solar model reproduces the radius, luminosity at the age of the Sun.On the seismic side, the surface corrections of the frequency are designed for the theoretical frequencies of the stellar models to match the observed ones.Inaccuracy in the solar modelling are then either compensated or minimized by such procedures.This cannot be done for other solar-like stars and one must either attribute arbitrarily the solar values to the free parameters or adjust them during the fitting process for other stars.This can lead to hidden inaccuracies.What can then be and is done is rather studying the sensitivity of the fitted results to changes in the physical description or the values of the free parameters.This is what was carried out for the two brightest (V ∼ 6) solar-like stars from the Kepler LEGACY, which we discuss below.

Best studied stars of the Kepler Legacy
The two best studied stars of the Legacy sample, the stars 16 Cyg A (KIC12069424) and B (KIC12069449) belong to a multiple system and show solar-like oscillations (Metcalfe et al. 2012;Lund et al. 2017).They are bright stars for which interferometric radii are available.We can also assume that they were born with the same chemical composition and have the same age.Unlike the Sun, we have no independent measurements of the masses and independent determinations of their ages.On the other hand, the information of a common age and interferometric radii can act as validation of the inferred results and assessment of the accuracy of the MRA inferences.The most recent studies dedicated to 16 Cyg A,B were those of Bazot (2020), Farnir et al. (2020), Nsamba et al. (2022), Buldgen et al. (2022) who provided references to former works.The uncertainties are found of the order of 4% and 15% for the masses and ages, respectively.The interferometric radii are well reproduced with uncertainties of 2%.As an illustration, we carried out seismic inferences for both stars with the two already mentioned grids C21 and Mo23.We inferred the MRA for each Kepler star independently using again the Ball & Gizon (2014a) surface-effect correction for the frequencies.The observational constraints are listed in Table 8.The sample of frequencies are those provided by Lund et al. (2017).As can be seen in Table 9, the uncertainties derived from the MRA inference processes are small, namely, ∼ 1% or below in all MRA cases and with both grids.They are smaller for the C21 grid than the Mo23 grid.At that level of relative uncertainties, this is likely due to a difference in the properties of the grids such as the number of adjusted free parameters or to a difference in density of stellar models in the parameter space around the studied stars.This would deserve further investigation but is out of scope in the present study.As for the accuracy, the relative departure of the inferred radius of each star from its respective interferometric radius is slightly above 1% for both grids (Table 10), showing that the radius is well constrained by seismology, rather independently of the physical description of the stellar models in the grids.Table 10 also gives ∆A = (A X − < A >)/δA, the relative differences between the age of each star and their average age, (A X − < A >) for X = 16CygA and 16CygB and < A >= (A 16CygA + A 16Cyg B )/2 compared to the relative uncertainties of the inferred ages, δA.This shows that the departure from a common age is of the order of the inference uncertainties.
Along with the Sun, the 16Cyg system is often used to test the sensitivity of the inferred results to the use of new/updated inference approaches or new and updated physical processes implemented in stellar modelling (e.g.Bellinger et al. (2016)  The centroid values fall in the same ranges as found by previous authors.However they were not able to find stellar models of both stars with a common age and the same chemical composition while assuming the same physical description.They had to give up either the assumption of the same chemical composition or assume that the stars undergo different efficiency of the atomic diffusion, probably counteracted by additional transport processes yet to be identified.In our illustrative case, we give in Table 9 the relative differences of the median values for the mass, radius, and age of each star resulting from the inferences using the two grids Mo23 and C21.As is well-known and already seen above with the 'degraded Sun' discussion, the age is the most affected by differences between the two grids.Here again the main reason is the inclusion or not of atomic diffusion.This illustrates the lessons we can learn from the study of seismically well characterized stars.

Conclusion
We estimated the detection probability of solar-like oscillations for the target stars of the ESA project PLATO as provided by the version 1.1.0 of the PLATO input catalogue.The targets belong to different samples: stars with the lowest expected noise level constitute the P1P2 sample (main sequence and subgiant FGK stars with magnitude less or equal to 11) and the P5 sample contains similar types of stars but with a higher noise level.A positive detection was assumed whenever the probability that the signal is due to noise is less or equal to 0.1% and the probability of the signal be due to solar like oscillation is larger than 99%.We then found that we can expect positive detections of Table 10: Relative differences (%) between the interferometric radius and the corresponding inferred radius ∆R = (R int − R X )/R int for each star.Relative differences between ages of both stars compared to the inference uncertainties ∆A ≡ (|A 16CygX − < A 16Cyg > |/δA where < A 16Cyg > is the mean of ages of 16 Cyg A and 16Cyg B.
Mo23 C21 ∆R 1.27-1.241.27-1.32∆A 0.65-0.781.19-1.00 16 CygA 16 CygB DM (%) 3.26 2.73 DR (%) 1.14 0.97 DA (%) -13.00 -13.43 solar-like oscillations for 5858 +176 −879 stars in the P1P2 sample in one single field after a two-year run of observation assuming the instrument remains nominal over the two years.The given uncertainties are due to false negative and false positive detections as calibrated with Kepler data and likely mostly due the fact that we could not take into account the stellar activity or a non solar chemical composition.For the P5 sample, we find a positive detection of 9491 +285 −1424 stars in the same observing conditions and assuming the same relative uncertainty percentages.As a whole, we can expect more than 15000 stars with solar-like oscillations to be compared to the Kepler solar-like oscillating (main sequence and subgiant stars) sample of 624 stars (M22).
The S/Ns of the targets in the P1P2 sample is (by construction of the sample) high enough that individual mode frequencies can be measured with high precision.For the P1P2 targets for which we predicted a positive seismic detection, we computed the expected frequency uncertainties.We used the error propagation due to those frequency uncertainties to estimate the relative uncertainties that we must expect for the seismically inferred masses, radii, and ages of those targets.
Focusing on main sequence stars with masses of ≤ 1.2M ⊙ , we found that about 1131 stars satisfy the PLATO requirements for the uncertainties of the seismically inferred stellar masses, radii, and ages in one single field after a two-year run of observation.Those stars will constitute an enlarged set of well characterized stars, compared to the Kepler LEGACY sample, which contains 66 stars, out of which about 31 main sequence stars with mass ≤ 1.2M ⊙ .
We note that the PLATO mission is expected to operate for four years, with possible extensions over 4.5 more years.This will make possible to more than double the number of detection of solar-like oscillators or to increase signicantly the precision of the measurements (of frequencies and then of MRA inferences), depending on whether we observe several fields or remain longer on one field.
We must stress that to the above uncertainties, we must add uncertainties due to systematic errors that mostly arise from imperfect physical description of our stellar models.Those can contribute up to 5 to 10% to the age uncertainties depending on the type of stars.Ongoing theoretical works are therefore currently addressing the main problems of inaccuracy.Tests and validations of improvements in the physical description of stellar models must use the best seismically characterized stars at our disposal.While the well-characterized stars of the Kepler Legacy sample helped us to identify such stellar modelling biases and offering a path to solving them, further advances are currently limited by the small number of stars able to bring tight constraints on the various modelling issues of solar-like oscillating stars.It is therefore one of the key goals of the PLATO mission to collect a sufficiently large number of stars with the highest quality data that can serve as benchmark stars or calibration stars to improve stellar modelling.The expected sample of PLATO solar-like oscillators will provide a much larger diversity of well-characterized stars than that available today.This will result in a larger and denser parameter space in terms of mass, age, chemical composition, and rotation rate.This will then make it possible to reduce the uncertainties in stellar modelling, particularly with regard to the internal transport processes that mainly affect the determination of stellar ages.
acknowledgements We would like to thank the anonymous referee for his pertinent comments, which helped to improve the manuscript.This work presents results from the European Space Agency (ESA) space mission PLATO.The PLATO payload, the PLATO Ground Segment, and PLATO data processing are joint developments of ESA and the PLATO Mission Consortium (PMC).Funding for the PMC is provided at national levels, in particular by countries participating in the PLATO Multilateral Agreement (Austria, Belgium, Czech Republic, Denmark, France, Germany, Italy, Netherlands, Portugal, Spain, Sweden, Switzerland, Norway, and United Kingdom) and institutions from Brazil.Members of the PLATO Consortium can be found at https://platomission.com/.The ESA PLATO mission website is https://www.cosmos.esa.int/plato.We thank the teams working for PLATO for all their work.M.C. acknowledges the support of Fundac ¸ão para a Ciência e Tecnologia FCT/MCTES, Portugal, throughnational funds by these grants UIDB/04434/2020, UIDP/04434/2020.FCT, 2022.Kepler short cadence data.Top: S17's catalogue (S17's table 3).Bottom: Legacy data (L17).Dashed lines are linear fits.
It is known that the global seismic scaling relations Eqs. ( 25) and Eqs. ( 26) are accurate at the level of a few percents (Huber et al. (2017), S17 and references therein).This is enough for our purpose since masses and radii are only used here to delimitate some sub-samples of stars of interest.When ν max is not known, it is derived by inverting Eq. ( 33) The seismic parameter ν max can be obtained from a seismic analysis (e.g. from Kepler data) and T eff is obtained from a spectroscopic study.Because for the PLATO targets, ν max are not yet known, we use the stellar radius and T eff from the PIC to derive ν max .

A.2 Oscillation power amplitude A max
Both formulations, Eqs. ( 9) and ( 19), for the power, P tot , involve the oscillation maximum amplitude A max .In order to adopt a theoretical relation for the amplitudes, we compare the amplitudes given by two semi-empirical relations derived in C11 and S19 with the measured amplitudes, A max,obs , of the Kepler Legacy sample by Lund et al. (2017) (their Table 3).We recall that this sample is composed of 66 main-sequence stars with the highest quality of seismic data, and for which individual oscillation modes are identified and their frequencies, amplitudes, and line widths are measured with the highest level of precision.
On the theoretical side, C11 derived the relation (Eq. 9 in C11): where  12 with the observed amplitudes A max,obs for the LEGACY sample from (Lund et al. 2017). and S19 used the relation between A max , ν max , ∆ν ⊙ , and T eff established by Corsaro et al. (2013) based on their model 4β (Eq.19 , Table 3 in Corsaro et al. (2013)) valid for 4000µHz > ν max > 150µHz.We eliminate ∆ν/∆ν ⊙ with Eq. ( 29).With s = 0.82, a = 1, this yields the second relation that we will consider (S19): Both relations involve the effective temperature T eff , C11 also involves the stellar radius and S19 involves ν max .We also need the stellar mass to discriminate between low and high mass cases.
The stellar radius and the stellar mass are given by the scaling relation Eq. ( 32). Figure 10 compares the amplitudes A max,scal as given in Table 12 to the observed amplitudes A max,obs for the Legacy sample.The comparison leads us to adopt as best estimate the purely empirical amplitude relations as given in Table 12.As can be seen in this figure, our empirical calibration of A max is quite satisfactory.The rescaling factors 1.19 and 1.31 in Table 12 can be explained as arising from uncertainties in the estimate of the Kepler noise.The threshold M = 1.15M ⊙ is arbitrary but reflects the fact that the scatter in A max is larger for stars in the high mass regime corresponding to higher effective temperature for which measurements of frequencies, linewidths, and amplitudes are more difficult (Appourchaux et al. 2012).On the theoretical side, the 1.15M ⊙ threshold corresponds to the discrimination between main sequence stars with and without a convective core.

B Validation for the probability of detection
In order to validate our calculations, we computed the detection probability for the samples of Kepler stars with known detection (sample 1) on one hand and the sample of stars for which no solar-like oscillations were detected (sample 2) on the other hand.For sample 1, we estimate the percentages of false negative detections (i.e. the number of stars which we predict no seismic detection).For sample 2, we estimate the percentages of false positive detections (i.e. the number of stars from which we predict a seismic detection).For the results, we consider 3 types of mass regimes: R1 (MS stars with M < 1.6M ⊙ and subgiants all masses), R2 (stars with M ≤ 1.2), R3 (MS stars with B.1 Probability of detection for the sample 1 and sample 2 When computing the detection probability for sample 1 (413 stars), we assumed as our baseline the following input assumptions: ν max , ∆ν and T eff are from M22, masses are derived with Eq.27 and δν env = ν max /2.The yields are listed in Table 13.We find, in the R1 mass regime when P det ≤ 0.99, a percentage of about ∼ 10% of false negative detections when the detection probability is P det ≤ 0.99 and ∼ 5% when P det ≤ 0.90.The choice of P det ≤ 0.99 obviously predicts a larger number of false negative detections compared to the choice of P det ≤ 0.90 and is therefore more conservative for estimating the percentage of positive detections.As a sanity check, we verified that all stars in the Legacy sample are found with a detection probability equal to 1 as expected.
We next considered the 990 Kepler stars in sample 2. For that sample, we assumed as our baseline the following input assumptions: ν max derived from log g and T eff given by M19, ∆ν from the scaling relation Eq.29 with a = 1, s = 0.82, masses are derived with Eq.32 and δν env = ν max /2.Table 14 provides the yields of the calculations of the detection probability for those stars.Assuming a seismic positive detection when P det > 0.99, we find that the percentage of false positive detections is ∼ 20%, 11%, and 7% for the R1, R2, and R3 mass regimes, respectively .When smoothing the PSD in order to decrease the noise level in case of a low S/N, it is found that the measured amplitudes are decreased by about 6% compared to the true amplitudes (Lund et al. 2017).Assuming detection when P det > 0.99, the impact of decreasing our theoretical amplitudes by 6% leads to a percentage of false positive detections ∼ 17%, 10%, 6% for the R1, R2 and R3 mass regimes, respectively, which is slightly lower that for the baseline case (∼ 21%, 11% and 7% in Table 13).The impact is not important and we will no longer consider the case of decreasing the amplitudes.Here the choice of P det > 0.99 rather than 0.90 provides a lower number of false positive detections.
Hence, in both cases above, P det > 0.99 is the conservative choice, namely, lowering the number of positive detections and maximising the number of false detections.We therefore adopted a detection threshold at 99% when considering the PLATO targets.
There are a number of assumptions behind the above calculations which lead to some uncertainties in the results.In the next sections, we estimate the impact of the main sources of uncertainties in predicting the detection probability (assuming the conservative case of a detection threshold at 99%).

B.2 Impact of uncertainties in the input quantities
We estimated the impact on the detection probability of the uncertainties in the various input quantities.We carried out the calculations, changing one or several input quantities at a time and the results must be compared to the results obtained with the baseline cases.
We assessed first the impact on the number of false negative detections with sample 1 when using the values of ν max , ∆ν, and T eff from the S17 instead of using those of M22.The results are listed in Table 15.We note that the number of stars-in each mass regime-can change depending on the adopted assumptions, specifically because the seismic mass is computed using a scaling relation which involves ν max , ∆ν, and T eff .The false negative detections in case of the S17 input parameters are ∼ 11%, 14%, 16% for R1, R2, and R3, respectively (Table 15), namely, slightly greater than ∼ 10%, 6%, and 21%, respectively, when using the set ν max , ∆ν and T eff from M22 in Table 14.The impact is higher when considering the smaller sample of MS stars with M ≤ 1.2 (R3) than the whole sample (R1).As a second type of uncertainty, we considered the changes when one uses the scaling relation Eq. (29) (with s = 0.82 and a = 1) for ∆ν, ∆ν scaling , which is what we have to use in absence of an observed value (case of sample 2 and Plato targets) in Appendix A. We computed the detection probability using our adopted sample of oscillating stars, sample 1.As can be seen from Table 15, we find false negative detections at the level of 8%, 4%, and 21% for the R1, R2, and R3 mass regimes, respectively.These figures must be compared with 10%, 6%, and 21%, respectively found with the baseline case (Table 14).Here the changes in the false negative percentages come from the fact that changing the values of ν max and ∆ν impacts the mass derived with the scaling relation Eq. ( 12) and, thus, the number of stars in each subsample.The impact remains small.The third type of uncertainty concerns our choice for the width of the envelope of the oscillations in a power spectrum.We then compare the yields obtained assuming the baseline case δν env = ν max = 1/2 and the more optimist case δν env = ν max for sample 1.In Table 15, we can see that the false negative detections are about 2.5 smaller than when assuming δν env = ν max /2.One then underestimates the detection rates when assuming δν env = ν max compared to the δν env = ν max /2 case.The comparison between the two options for δν env can also be made with sample 2 for the false positive detection rates.One finds a higher false positive detection rate when assuming δν env = ν max (Table 16) instead of δν env = ν max /2 (Table 14).Therefore, we overestimate the positive detection rate when assuming δν env = ν max compared with the δν env = ν max /2 case.As a net result and to remain conservative, we chose the option which gives a lower false positive detection rate to the cost of a higher false negative rate.This justifies taking δν env = ν max /2 for our baseline condition.We will nevertheless provide also the results in the more optimistic case δν env = ν max for the PLATO targets.

B.3 Global uncertainties for the predicted percentage of predicted seismic detection
In anticipation of the investigation for the PLATO case, we considered as a single sample the total sample of 1349 Kepler stars with detected solar-like oscillation (413 stars, sample 1) and with no detection of solar-like oscillation (936 stars from sample 2) in the R1 mass regime and similarly the total sample of 620 stars in the R2 mass regime and the total sample of 412 stars in the R3 mass sample.According to Table 17, the calculations predicting the number of seismic positive detections applied to PLATO targets in Sect. 5 underestimate it by 3.0% for all mass regimes and overestimate it by 14%, 7%, 7% for the R1, R2, and R3 mass regimes respectively in the baseline conditions.We then used those rates to estimate the uncertainties on the predicted number of positive detections, say X, for the PLATO targets in Sect.5, as follows: X +3% −14% for R1, X +3% −7% for R2, and X +3% −7% for R3.C Uncertainties of the inferred mass, radius, and age We seek relative uncertainties for the mass, radius, and age, δM/M, δR/R, δA/A, which can be computed for the PIC targets.
In accordance with the stellar requirements of the PLATO mission (which is the most challenging goal), we focus exclusively on the MS stars with masses ≤ 1.2M ⊙ .We want to estimate the purely statistical uncertainties for the seismically inferred masses, radii, and ages generated by  the propagation of seismic observational uncertainties only.In order to eliminate as much as possible all systematic errors (which will be briefly discussed in the conclusions/discussion section), we built a set of synthetic stars which masses and ages covering the ranges of interest here: (M/M ⊙ , A(Gyr)) = (0.9, 3.), (1.0, 2), (1.0, 4.57), (1.0, 9.0), (1.03, 4.6), (1.08, 4.6), (1.15, 2.0), (1.15, 9.0) and infer their seismic masses, radii, and ages by means of a routinely used a grid-based approach, as described in Cunha et al. (2021).

C.1 A set of synthetic stars and their frequencies
We built the stellar models of the synthetic stars with the evolutionary code CESTAM (Morel & Lebreton 2008;Marques et al. 2013) with the input physics as much as possible similar to that of the stellar models included in the input grid.The locations of those fictitious stars in the HR diagram are shown in Fig. 11.
The frequencies for each synthetic star are calculated with ADIPLS (Christensen-Dalsgaard 2008).The adopted uncertainties on the frequencies are scaled with respect to those of the 'degraded Sun' (corresponding to the Sun seen as a star) (Lund et al. 2017) which was used as a reference for the studies of the Kepler Legacy sample in Silva Aguirre et al. (2017).For the degraded Sun, the ℓ = 1 mode frequency closest to ν max (ν max,⊙ = 3090µHz) with the smallest uncertainty (δν ℓ=1,max,⊙ = 0.057µHz) is ν ℓ=1,max,⊙ = 2963.3µHz.
For each synthetic star, the frequency uncertainty for each frequency ν nl is taken as δν n,ℓ = x × δν n,ℓ,⊙ .We infer the MRA uncertainties for the cases x = 1, 3, 5, 7, 10, which cover the range of uncertainties for the PLATO P1P2 sample.
For sake of simplicity, we keep the same number and types of modes as given above for which frequencies are computed in all considered cases, although this number decreases when the S/N decreases.Some impact of such degradation is discussed in Cunha et al. (2021).

C.2 MRA inference for the set of synthetic stars
The stellar MRA and their uncertainties are obtained with the inference code AIMS (Rendle et al. 2019;Lund & Reese 2018).AIMS reads as an input a precomputed grid of stellar models.
For convenience, we adopt the stellar grid described and used in Cunha et al. (2021).
The observational constraints for each star are T eff , L/L ⊙ , [Fe/H], and the frequencies of the individual modes ℓ = 0 (n = 16 − 27), 1 (n = 16 − 27), 2 (n = 15 − 27) for the degree ℓ and radial order n similar to the synthetic star known as Zebedee (a young Sun) studied in Cunha et al. (2021).In order to eliminate systematic errors that are not relevant in this section, we take as central values the exact values of T eff , L/L ⊙ , [Fe/H] from the models of the synthetic stars.The uncertainties (1σ) adopted are 70K, 0.3, 0.05 respectively, as expected at the time of PLATO launch.
For also inferred the stellar mass, radius and age, and their uncertainties for the Zebedee case (real values of the corresponding stellar model: M/M ⊙ , A(Gyr) = 1.0165, 3.085).In that particular case, the uncertainties are taken from Cunha et al. (2021).Since the input frequencies for that synthetic star included surfaceeffect corrections according to the formulation of Ball & Gizon (2014a,b), we also included the surface effect correction according to the same prescription when inferring the stellar parameters for Zebedee with the AIMS code.The results compare well with the results published in Cunha et al. (2021) and we do not show them.The variations of the relative uncertainties for the inferred mass, radius, and age are shown as a function of the uncertainties δν ℓ=1,max in Figs. 12 and 13.As expected the MRA uncertainties increase with increasing δν ℓ=1,max .It can be seen that the precision for the inferred mass and radius is so high that a comfortable margin is left for systematic errors that must be added quadratically in order to obtain realistic mass and radius uncertainties while still satisfying the PLATO requirements.This is also true of stars like the Sun (in mass and age).Indeed for such a Sun-like star, with a frequency uncertainty δν ℓ=1,max = 0.2µHz, the relative age uncertainty is below 5%.
In view of application to the PLATO samples in Sect.6, we show the same results in Fig. 14 than in Fig. 12 and Fig. 13, respectively, but without the mass and age information.We then carried out linear fits of the general trends for the mass and radius uncertainties.For the mass and radius relative uncertainties (in %), the fits give: δM/M = 2.083 δν ℓ=1,max + 0.046 δR/R = 0.707 δν ℓ=1,max + 0.149 , where δν ℓ=1,max is in µHz.The above relations are used for the PIC targets in Sect. 5.The scatter (about 0.5% and 0.1% for the the mass and radius relative uncertainties, respectively) due to different masses and ages remains acceptable for our purposes.We also see that there is too large a scatter for enabling a meaningful fit for δA/A.This would be possible for the absolute age uncertainty, δA.We actually found δA = 0.702 δν ℓ=1,max + 0.054 , but for the PIC targets, the age is unknown and we can only use the relative age uncertainty so we will rather adopt an alternative criterion.As shown in Fig. 14 for δA/A, the relative statistical uncertainty amounts roughly to 7% when δν ℓ=1,Libb,ν max < 0.2µHz.We then adopt the criterion: to select the cases for which relative age uncertainties of < 10% can be expected.This is in accordance with Appourchaux (2020).This leaves as much margin to allow for systematic errors (that must be added in quadrature to obtain the final uncertainties).This is a challenge that has driven -and still drivesmany theoretical studies in the community (see Sect. 7 for a brief discussion).
D Relation between σ Libb,ℓ=1 and δν ℓ=1,max In absence of observations as it is the case today for the PLATO targets, a convenient way of estimating the uncertainties of individual frequencies of solar-like oscillation mode is the generally 5000 5200 5400 5600 5800 6000 6200 6400 T eff (K) accepted Libbrecht (1992) relation: where β = 1/(S /N) is the inverse of the S/N, Γ (in s −1 ) is the FWHM linewidth of the mode, and T obs (in s) the duration of the observation.It is well accepted that this statistical estimate of the frequency uncertainties of solar-like oscillation modes represents well the reality.
To proceed further, we denote σ Libb,ℓ=1,max the Libbrecht (1992) frequency uncertainty on the frequency of a ℓ = 1 mode closest to ν max .It is evaluated for each star with (S /N) max given by Eq. ( 8) and Γ in Table 19 below.D.1 Deriving Γ ℓ=1,max for estimating σ Libb,ℓ=1,max For each star, we take from Lund et al. (2017) the values of the linewidth, Γ ℓ=1,max , of the ℓ = 1 mode with the frequency closest to ν max .Those values are represented in Fig. 15 as function of the effective temperature.We restrict the case to P1P2 stars with mass ≤ 1.2M ⊙ which are the PLATO targets for which we will estimate the MRA uncertainties in Sect.6.One clearly notes two regimes: one regime at high effective temperature where Γ ℓ=1,max increases almost linearly with the effective temperature and one regime at low effective temperature where the scatter and the large uncertainties prevent from establishing a trend with T eff .In the low regime, we therefore keep Γ ℓ=1,max constant at the lowest value Γ 0 and consider also the case when 3Γ 0 for T eff ≤ 5650K (Fig. 15).We then adopt for Γ ℓ=1,max (in µ Hz) the scaling relation adapted from Appourchaux et al. (2012)   quency uncertainty, δν max .We then need to convert σ Libb,ℓ=1,max into the frequency uncertainty δν ℓ=1,max .Again here we use the Kepler Legacy data to carry out that calibration.
For each star, Fig. 16 displays the ratio δν ℓ=1,max /σ Libb,ℓ=1,max as a function of the effective temperature.There is a clear trend: the ratio decreases with the effective temperature.For convenience, we derive a linear fit to represent that trend about that fit.The linear fit gives: δν ℓ=1,max σ Libb,ℓ=1,max = 4.89 − 4.18 T eff 6000K (47) valid for 5000 < T eff ≤ 6200 K.The scatter about the linear relation is about ±0.5.In Sect.6.2, we will therefore consider the effect of adding ∼ 0.5 to the above linear relation on the results for the PLATO targets.

Figure 1 :
Figure1: Histograms of the number of stars from the P1P2 sample (subgiants of all masses and MS stars with masses M/M ⊙ < 1.6) with a probability of > 99 % of positive detection of solarlike oscillation in the case of (1 LOP, BOL,ν env = 0.5ν max ).Top: Distribution in T eff , middle in stellar mass, bottom in stellar radius.The color code represents the assumed observing duration.

Figure 3 :
Figure3: Histograms of the number of stars (MS stars with M < 1.6M ⊙ and subgiants) from the P5 sample with an expected detection of solar-like oscillations with at least 99% probability assuming 730, 365, 90 and 30 days of observations.Conditions are (1 LOP, BOL, δν env = 0.5ν max ).

Figure 5 :
Figure 5: Distribution of frequency uncertainties δν ℓ=1,max for the sample of P1P2 stars with expected positive seismic detection.The color code corresponds to different mass samples as indicated in the top right panel.The top panels correspond to the frequency uncertainties computed with Eq. (20) for observation lengths of 730d (top-left panel) and 90 days (top-right panel).The bottom panels show the frequency uncertainties distribution when one takes into account the scatter in the relation obtained with Eq. (20) (left panel) or the uncertainties in the mode linewidths at low effective temperature width, as per Appendix D (right panel).

Figure 6 :
Figure 6: Histograms in T eff (top), and mass (bottom) for MS stars with M/M ⊙ ≤ 1.2 in the P1P2 sample assuming (1 LOP, BOL, 730 days, δν env = ν max /2) conditions with expected detection of solar-like oscillations and satisfying cases IIa and IIIa listed in Table6.

Figure 11 :
Figure11: Evolutionary tracks for masses from 0.8 to 1.15M ⊙ in step of 0.05M ⊙ (blue solid curves) in a HR diagram.The black dots indicate the location of the synthetic targets stars studied here.The magenta dashed line shows the location of models with their central hydrogen content X c ∼ 10 −6 which we take here as the transition between MS stars and subgiant phases.The transition follows the empirical relation : log L/L ⊙ = 10 (log T eff − 3.7532) + 0.25 for that range of mass given the adopted chemical composition and physics of the stellar models.

Figure 12 :
Figure12: Relative uncertainties for the masses (top) and radii (bottom) with increasing frequency uncertainties for all synthetic stars as a function of δν ℓ=1,max .The color code is given in the panels as (M/M ⊙ , age (Gyr)).The solid blue corresponds to a Sun.The solid curve with black dots represents a case similar to Zebedee (1.0165 M ⊙ , 3.085 Gyr) inCunha et al. (2021).

Figure 13 :
Figure 13: Same as in Fig.12 but for the age uncertainties (top: relative, bottom: absolute).

Figure 15 :
Figure 15: Variation of Γ with the effective temperature.Blue crosses are the observed values while the dashed line represents the computed Γ values.

Table 1 :
Lund et al. (2017)formulation for A max,scal (A max,C11 and A max,S19 are obtained using the scaling relations as in C11 and S19, respectively in Appendix.A.As described in Sect.2.2 and detailed in Appendix A, the proportionality factors are calibrated to the observations inferred byLund et al. (2017)(ν max in µHz, the mass, M, in solar units).

Table 2 :
Numbers of stars in the P1P2 sample in 1 LOP with expected detection of solar-like oscillations after 730 days of observation and assuming δν env = ν max /2.Stellar masses, M, and radii R in solar units.

Table 3 :
Number of stars in the P1P2 sample in 1 LOP with expected detection of solar-like oscillations (P dec > 0.99) assuming δν env = ν max /2.The numbers without parenthesis correspond to stars with estimated seismic masses ≤ 1.2M ⊙ , whereas the numbers in parenthesis correspond to stars with all masses.

Table 4 :
Uncertainties in the number of P1P2 stars with predicted positive seismic detection after 730 days of observations in (1 LOP, BOL) conditions.

Table 5 :
Numbers of stars in the P5 sample with expected positive detection (P dec > 0.99) of solar-like oscillations in (1 LOP, BOL, δν env = 0.5ν max ) conditions.Masses and radii are given in

Table 6 :
Number of MS dwarfs with M ≤ 1.2M ⊙ stars in the P1P2 sample with expected positive seismic detection and satisfying different cases of MRA uncertainties.Assumed conditions are (1 LOP, BOL, 730 days, δν env = ν max /2).Labels a) and b) refer to frequency uncertainties assumed without and with a +0.5 shift due to the scatter in the fitted relation between δν ℓ=1,max and σ Libb,ℓ=1,max (Eq.D.3).We expect that the number of stars decrease when adding new constraints.Cases I do not reduce the number of stars compared with the initial sample of P1P2 MS stars with masses M ≤ 1.2M ⊙ with positive seismic detection.This means that the main constraints are the S/Ns imposed by design and the detection probability.The PLATO requirements for the mass, radius and age uncertainties (case II) of P1P2 stars are automatically satisfied in the PIC, provided the oscillations are detected.

Table 7 :
MRA inference for the 'degraded Sun' of one standard deviation or 68% credible intervals about the median values.grid δ X (%) d X,rel (%) d X,

Table 9 :
Results of MRA inferences (%) for 16 Cyg A and B: 68% credible intervals about the median.16 Cyg A δν ℓ=1,max = ences of ± 3.7% and ± 7% for the mass and age of 16 CygA and for CygB.

Table 11 :
Sensitivity to different input physics, number and values of free parameters and reference solar chemical composition DM ≡

Table 13 :
Numbers of false negative detections for sample 1 with known detected oscillations assuming the baseline conditions.Number of stars predicted with NO seismic detection.

Table 14 :
Numbers of false positive detections for the stars of sample 2 assuming the baseline case for that sample.Numbers of stars predicted with seismic positive detection.

Table 15 :
False negative detections for sample 1.Each row corresponds to one change compared to the baseline case.Figures in parenthesis correspond to the total number of stars in each mass regimes and considered case.Masses, M, are in solar units.

Table 16 :
False positive (P det > 0.99) for sample 2. The baseline is assumed except for δν env = ν max .Masses, M, in solar unit.

Table 17 :
Number of stars and associated percentages of predicted false negative and positive detections when considering the whole Kepler population with and without real detection in each regimes and the baseline assumptions.Masses, M, are in

Table 18 :
Number of stars and associated percentages of predicted false negative and positive detections when considering the whole Kepler population with and without real detection in each regimes and the baseline assumptions except for δν env = ν max .Masses, M, are in solar units.