Open Access
Issue
A&A
Volume 692, December 2024
Article Number A150
Number of page(s) 20
Section Planets, planetary systems, and small bodies
DOI https://doi.org/10.1051/0004-6361/202451537
Published online 09 December 2024

© The Authors 2024

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

The properties of planetary systems are known to be linked to those of their host stars. This relationship is particularly evident in the well-established correlation between the stellar metallicity and prevalence of giant planets. Close-in giant planets are more likely to be found around metal-rich stars (Gonzalez 1997; Fischer & Valenti 2005; Johnson et al. 2010; Buchhave et al. 2012; Thorngren et al. 2016), and giants in general have lower eccentricities when orbiting metal-poor stars (Dawson & Murray-Clay 2013; Buchhave et al. 2018). Further, the characteristic radius valley – a decrease in the occurrence rates of planets between super-Earths and sub-Neptunes (Fulton et al. 2017) – also appears to evolve over time. While warm super-Earths with periods between 10–100 days have a nearly constant occurrence rate of −0.4 < [Fe/H] < 0.4, warm sub-Neptune occurrence rates double over this range (Petigura et al. 2018). Orbital periods of small planets around metal-poor stars tend to be longer (Beaugé & Nesvorný 2013; Wilson et al. 2018), whereas close-in (P < 10 days) small (<2 M) planets are more likely to be found around metal-rich stars (Mulders et al. 2016; Lu et al. 2020). In addition to metallicity, planet properties also correlate with stellar age, effective temperature, and mass. According to Howard et al. (2012) and Mulders et al. (2015), the number of close-in planets, ranging from Earth- to Neptune-sized, decreases as stellar mass and temperature increase. Yang et al. (2020) and He et al. (2021) report that the fraction of stars hosting planets changes from roughly 0.3 for F-type stars to around 0.8 for K-type stars.

Across the Milky Way, stellar populations exhibit significant variations in age and chemical composition (Gilmore & Reid 1983; Prochaska et al. 2000; Reddy et al. 2006; Schuster et al. 2006; Bensby et al. 2014; Masseron & Gilmore 2015). This suggests that the local Galactic environment may significantly impact planetary populations (Nielsen et al. 2023; Boettner et al. 2024). Within the solar neighbourhood, distinct stellar populations can be classified into three categories: the thin disk, which is generally composed of young stars with roughly solar-like metallicities; the thick disk, which hosts an older, more metalpoor but noticeably a-enriched population (Wallerstein 1962); and the halo, which hosts some of the oldest and most metal-poor stars, with metallicities ranging from −7 < Fe/H < −0.5 (Beers & Christlieb 2005; Frebel & Norris 2015; Nordlander et al. 2017; Starkenburg et al. 2018). The majority of exoplanet studies have so far been focused on planets orbiting nearby thin disk stars (Bashi & Zucker 2019), primarily with stellar [Fe/H] > −0.5 and solar-like α abundances. These stars are more common in the solar neighbourhood, and on average are brighter than their older counterparts, naturally promoting their planetary companions for transit and radial velocity detections. Because of this, the effects of the Galactic environment on planet demographics has so far been largely unexplored.

More recently, however, the impact of stellar abundance variations on planet formation has attracted growing interest. Theoretical studies by Bitsch & Battistini (2020) and Cabral et al. (2023) have analysed the effects of chemical abundances from the GALAH and APOGEE spectroscopic surveys on the chemical composition of planetary building blocks (PBBs). Bitsch & Battistini (2020) find that while PBB composition inside the water ice line (T > 150 K) does not change significantly with increasing [Fe/H], outside the ice line, water ice content drastically decreases from nearly 50% at [Fe/H] = −0.4 to 6% at [Fe/H] = 0.4. This decrease is caused by an increasing C/O ratio that binds oxygen in gaseous CO and CO2. They predict that planets that formed outside the ice line will have a strong dependence of water ice content on host star metallicity, with more metal-poor stars hosting more water-rich planets.

Some studies suggest that a high α abundance directly affects the chemical composition of the PBBs, even at a fixed [Fe/H] value. Larger α contents may lead to more icy PBBs, which aids planet formation and can partially compensate for low [Fe/H] metallicity (Bashi & Zucker 2019; Bashi et al. 2020; Morbidelli & Rickman 2015; Gundlach & Blum 2015). Cabral et al. (2023) have found the water content of PBBs to be decreasing with increasing [Fe/H], and that the bimodal distribution of stars in the [Fe/H] − [α/Fe] plane (Wallerstein 1962) leads to a bimodal distribution in PBB compositions, where most chemical species have a α-content dependence. Similar results were found by Santos et al. (2017) and Cabral et al. (2019), suggesting that PBB chemical composition might strongly depend on the α abundance and thus the Galactic origin. However, the exact effect of α elements is not entirely clear. Based on the APOGEE sample, Cabral et al. (2023) found that thin disk stars host more water-rich PBBs, while their GALAH sample showed the opposite. Furthermore, whether the water abundance of PBBs is a good proxy for planetary water contents is still debated. Observationally, Bashi & Zucker (2022) found that close-in super-Earths are more likely to be hosted around thin disk stars rather than thick disk stars, but their sample was limited to only eight planets that were found around thick disk hosts. Adibekyan et al. (2012) have found that planet-hosting stars have a higher abundance of α elements, and Chen et al. (2022) further suggested that the ratio of super-Earths to sub-Neptunes decreases significantly with increasing [α/Fe] in stars, meaning sub-Neptunes are more common around thick disk stars. Zink et al. (2023) analysed the occurrence rate of planets around FGK dwarf stars observed by the K2 mission, finding that stars with large galactic oscillation amplitudes host fewer super-Earths and subNeptunes. They demonstrate that the metallicity gradient with galactic scale height alone cannot explain this observed trend, and suggest that the increased relative alpha-element abundance of these high amplitude stars may influence planet formation. It has also been suggested that while giant planets around sub-solar metallicity stars are rare, they are more common around thick disk stars (Haywood 2008, 2009).

The detection of a large population of close-in giant planets orbiting high-metallicity stars has drawn considerable research attention to such high-metallicity systems since the early days of exoplanet discoveries (Santos et al. 2004; Fischer & Valenti 2005; Johnson et al. 2010; Mortier et al. 2013; Petigura et al. 2018). As a result, studies of low-metallicity systems have largely played a secondary role. However, this focus has begun to shift in recent years, with increasing attention being paid to planets around low-metallicity stars. A long-standing idea is the existence of a minimum metallicity requirement for planet formation. Johnson & Li (2012) argued for a critical metallicity of [Fe/H]crit ∽−1.5 + log(r/1AU). Observationally, similar limits have been proposed around [Fe/H] = −0.5 (Mortier et al. 2012; Mordasini et al. 2012) and −0.7 (Boley et al. 2021). Specifically for hot Jupiters (P < 10 days, R between 0.8–2 RJupiter), Boley et al. (2021) found an upper limit on the occurrence rate of 0.18% for stars with −2 < [Fe/H] < −0.6 based on a study of 11 125 halo stars from the TESS survey. The lowest known metallicity of a hot Jupiter host is [Fe/H] = −0.6 (Hellier et al. 2014). A suggested reason for this is the shortened protoplan- etary disk lifetime in a metal-poor region (Kornet et al. 2005; Ercolano & Clarke 2010; Yasui et al. 2010, see however Winter et al. 2024). Andama et al. (2024) studied planetesimal formation in low-metallicity environments using the 1D chemocomp code. They found that the amount of planetesimals (in terms of mass) decreases with decreasing disk and host star metallicity, especially for low-viscosity disks. The total planetesimal mass drops by multiple orders of magnitude when going from [Fe/H] = 0 to [Fe/H] = −0.4. The drop is weaker for slightly higher viscosity, more turbulent disks, but it is still about one order of magnitude, with a lower limit of [Fe/H] = −0.6. Low metallicity environments therefore present an ideal environment to study the early stages of planet formation.

The PLAnetary Transits and Oscillations of stars (PLATO) mission is a planned European Space Agency space telescope set to launch in 2026, with the primary mission target being the search for terrestrial exoplanets in the habitable zones of Sun-like stars using the transit method. PLATO’s unique design, featuring 24 ‘normal’ cameras and two ‘fast’ cameras with a higher cadence (Rauer et al. 2014, 2016), covers a field of view (FoV) of 49° × 49° per pointing, which is about 20 times larger than Kepler (Nascimbeni et al. 2022). PLATO’s mission target is to observe at least 245 000 FGK dwarf stars with a magnitude V < 13, 15 000 of which are high priority targets with a magnitude V < 11, as well as a further 5000 M dwarfs with V < 16 (Montalto et al. 2021).

In this study, we aim to assess the capabilities of PLATO to study exoplanet populations across Galactic environments. We dynamically classified the likely target stars of the two proposed PLATO long-duration observation fields into three categories – their thin disk, thick disk, and halo membership – based on the Gaia survey. Using the New Generation Planetary Population Synthesis (NGPPS) dataset, based on the Bern planet formation model (Emsenhuber et al. 2021b,a), we simulated the planetary populations in each Galactic component. We then estimated the detection efficiency of PLATO for these planet populations as a function of both the instrumental, planetary, and stellar properties, based on a model by Börner et al. (2022), assuming a nominal PLATO pointing time of 2 years per field.

The structure of this paper is as follows. In Sect. 2 we describe the selection of our stellar sample, and the classification of stars into their respective Galactic components. In Sect. 3 we detail the PLATO detection efficiency model, the NGPPS planet population dataset, and the methodology for creating mock planet populations and observations. In Sect. 4 we present the results of our analysis, including the detection efficiencies, planet occurrence rates, and expected PLATO planet yields for the two long-observation fields. Finally, in Sect. 5 we discuss the implications of our findings for understanding exoplanet demographics and the impact of Galactic environment and stellar properties on planet formation. A summary of our findings can be found in Sect. 6.

2 Stellar populations

2.1 Sample selection and characterisation

The starting point for our analysis is the all-sky PLATO input catalogue (asPIC 1.1; Montalto et al. 2021). The aim of the PLATO mission is the search for terrestrial exoplanets in the habitable zone around solar-type stars, and the asPIC catalogue encompasses the most promising targets for this is goal, based on the Gaia DR2 catalogue (Gaia Collaboration 2016, 2018). They have been selected to lie within a specific region of the Gaia colour-magnitude diagram to ensure they are mostly main-sequence FGK stars, with an additional smaller sample optimised for M-type dwarfs. A magnitude limit is imposed, optimised to maximise the probability of detecting terrestrial planets in the habitable zone. Montalto et al. (2021) also verified that this selection process does not introduce any bias in the metallicity distribution of the sample.

More specifically, the catalogue encompasses 2 675 539 stars, including 2 378 177 FGK dwarfs and subgiants brighter than a visual apparent magnitude of V < 13 with a median distance of 428 pc, and 297 362 M dwarfs brighter than V < 16 with a median distance of 146 pc. Montalto et al. (2021) determine the stellar fundamental parameters (effective temperature Teff, radius R, and mass M) for all target stars, and estimate overall uncertainties on these parameters to be 4% for the Teff, 9% for R and 11% for M. In our analysis, we restrict ourselves to the FGK sub-sample, requiring non-zero estimates for both mass, radius and effective temperature.

We cross-match these stars with the Gaia DR3 catalogue using the cross-match table provided by the (Gaia Collaboration 2023) and retrieve parallaxes, proper motions, and radial velocities (Lindegren et al. 2021; Seabroke et al. 2021). Stars with uncertainties exceeding 20% in any of these parameters are removed.

For this sample, we obtain metallicity [Fe/H] and α-element abundance [α/Fe] from the Gaia DR3 medium-resolution RVS (Radial Velocity Spectrometer) spectra, using the best quality sample as defined by Recio-Blanco et al. (2023)1. This covers about 15% of the sample. If RVS estimates are unavailable, the [Fe/H] values are filled in using the machine learning-based estimates by Andrae et al. (2023), which make use of the low- resolution XP spectra published with Gaia DR3, if possible. We do the same for the surface gravity log ɡ, and remove stars without a metallicity or surface gravity estimate. After these steps, 1 830 288 stars remain (68.4% of the original sample).

We supplemented this information with [Fe/H], α-element abundance and log ɡ estimates from high-resolution spectra found in the APOGEE-DR17 (Abdurro’uf et al. 2022) and GALAH DR3 (Buder et al. 2021) spectroscopic survey databases, if available. We applied the following quality selection: For APOGEE, we require that none of the STAR_WARN, M_H_WARN, or ALPHA_M_WARN quality flags are set. For GALAH, we do the same with the flag_sp, flag_fe_h, and flag_alpha_fe flags. This allows us to retrieve information for a sub-sample of 73 632 target stars from APOGEE and 14 347 stars from GALAH. A discussion on the difference between the APOGEE/GALAH estimates and the Gaia values for these properties can be found in Appendix B.

2.2 Galactic component classification

To classify the stellar sample into three Galactic components: thin disk, thick disk, and halo, we follow the probabilistic approach outlined by Bensby et al. (2003, 2014), which has previously been applied in the context of exoplanet host star classification (Bashi et al. 2020; Chen et al. 2021; Bashi & Zucker 2022). This method relies solely on stellar kinematics, specifically the Galactic velocities (ULSR, VLSR, WLSR) in the Local Standard of Rest (LSR). We assume that the Galactic velocities in the LSR follow a multivariate normal distribution, characterised by the probability density function: fC(ULSR,VLSR,WLSR)=N([ 0Vasym 0 ],[ σUσVσW ]),${f_{\rm{C}}}\left( {{U_{{\rm{LSR}}}},{V_{{\rm{LSR}}}},{W_{{\rm{LSR}}}}} \right) = {\cal N}\left( {\left[ {\matrix{ 0 \cr {{V_{{\rm{asym }}}}} \cr 0 \cr } } \right],\left[ {\matrix{ {{\sigma _U}} \cr {{\sigma _V}} \cr {{\sigma _W}} \cr } } \right]} \right),$(1)

where fC represents the probability density of the star belonging to component C. The kinematic parameter Vasym, σU, σV, and σW differ for each Galactic component, and also depend on the star’s Galactocentric radius (R) and height (Z). For this study, we utilise the revised kinematic characteristics presented by Chen et al. (2021).

To perform the classification, we compute the relative Galactic component membership probabilities, TD/D (thick disk-to- thin disk) and TD/H (thick disk-to-halo), for each star: TDD=XTDXDfTDfD,TDH=XTDXHfTDfH,${{{\rm{TD}}} \over D} = {{{X_{{\rm{TD}}}}} \over {{X_{\rm{D}}}}} \cdot {{{f_{{\rm{TD}}}}} \over {{f_{\rm{D}}}}},\quad {{{\rm{TD}}} \over H} = {{{X_{{\rm{TD}}}}} \over {{X_{\rm{H}}}}} \cdot {{{f_{{\rm{TD}}}}} \over {{f_{\rm{H}}}}},$(2)

where X represents the fraction of stars belonging to a given component and is similarly dependent on R and Z.

Based on the membership probabilities, we categorise the stars into one of three components (Bensby et al. 2014): (i) Thin disk star if TD/D < 0.1 (i.e. the thin disk membership probability is ten times larger than the thick disk membership probability), (ii) Thick disk star if TD/D > 10 and TD/H > 10, and (iii) Halo star if TD/H < 0.1. We apply these large probability ratios in order to reduce contamination. Values that fall between these limits are considered to be not clearly classifiable. They are labelled thick disk candidate (if 0.1 < TD/D < 10) and halo candidate (if 0.1 < TD/H < 10). About 1/3 of stars fall into these unclear candidate categories and will not be considered in our exoplanet population analysis.

The Galactic velocities and coordinates are calculated using the galpy python package (Bovy 2015). The corresponding Toomre diagram is shown in Fig. 1. For the all-sky sample, about 63.4% of stars are classified as thin disk stars, and 36.2% are classified as either thick disk candidates or thick disk stars. The halo candidate and halo star sub-sample makes up about 0.4% of stars, totalling about 6,900 stars out of the approximately 2 million. A detailed breakdown can be found in Table 1.

thumbnail Fig. 1

Toomre diagram of the all-sky sample. The stars are coloured according to their classification into the different Galactic components, as described in Sect. 2.2.

Table 1

Number of stars per stellar population in the all-sky sample, and in the southern (LOPS2) and northern (LOPN1) long-observation fields.

2.3 Chemical abundances: [Fe/H] and [α /Fe] distributions

Different Galactic components exhibit distinct stellar populations, characterised not only by kinematic variations (Gilmore & Reid 1983) but also by chemical composition differences (Bensby et al. 2014; Masseron & Gilmore 2015; Hawkins et al. 2015). These differences have the potential to significantly impact the exoplanet populations associated with each component (Bashi & Zucker 2022; Nielsen et al. 2023; Cabral et al. 2023; Boettner et al. 2024). Ideally, classifying stars into specific Galactic components should consider both kinematic and chemical information, as relying on a single method can result in significant contamination (Bensby et al. 2014; Bashi & Zucker 2019).

Since only about 5% of the sample have [Fe/H] and [α/Fe] estimates from APOGEE or GALAH high-resolution spectra, we cannot uniformly apply this additional condition in our classification across the full sample. However, we can utilise this subset to confirm the effectiveness of the kinematic classification, and loosely quantify contamination.

In terms of metallicity, halo stars are the most metal-poor, followed by thick disk and thin disk stars (Bland-Hawthorn & Gerhard 2016). This trend is also observed in our kinematically classified sample. In Figure 2, we show the metallicity distribution of the stars with [Fe/H] estimates available from APOGEE or GALAH. The thin disk population peaks around [Fe/H] = 0, the thick disk population around [Fe/H] =−0.4. The halo sub-sample only contains 176 stars, but has a clear bimodal distribution, with peaks at [Fe/H]= −0.7 and [Fe/H]= −1.2. The lower metallicity population constitutes primarily accreted stars, while the higher metallicity population are stars formed in situ and expelled onto halo-like orbits by the last major merger (Helmi et al. 2018; Di Matteo et al. 2019).

There is a large overlap in terms of metallicity between the thin disk and thick disk. Because of that, their populations are better distinguished through their distribution in the [Fe/H]−[α/Fe] plane (Prochaska et al. 2000; Reddy et al. 2003, 2006), which is well-known to be bimodal (Wallerstein 1962). We show the distribution of the APOGEE and GALAH sub-samples in this plane in Fig. 3. The kinematically classified stars fall into three distinct clusters, which are largely consistent with the thin disk/thick disk/halo expectation: high metallicity ([Fe/H] ∼ 0) and low α ([α/Fe] ∼ 0) for the thin disk, lower metal- licity ([Fe/H] ∼ −0.4) and high α ([α/Fe] ∼ 0.2) for the thick disk, and low metallicity ([Fe/H] ∼< 0.5) for halo stars (see also Table 2).

To quantify the level of contamination between the kinematically defined populations, we independently classify all stars possessing both metallicity and α-abundance estimates from high-resolution spectra into the three Galactic components. This chemical classification is based on the criteria defined by Horta et al. (2023), as visualised in Fig. 3. The resulting confusion matrix, presented in Fig. 4, reveals that 93.6% of targets share consistent chemical and dynamical classifications. However, this high agreement is largely driven by the significant imbalance in the dataset towards thin disk stars, which are inherently more common.

More importantly, approximately 30% of stars dynamically classified as belonging to the thick disk are chemically classified into the thin disk. This overlap is also visible in the metallic- ity distribution shown in Fig. 2, where the thick disk population exhibits a more pronounced tail towards higher metallicity values ([Fe/H] > 0) than one would expect for a thick disk sample, indicative of thin disk contamination. Similarly, around 33% of dynamically classified halo stars are chemically identified as thick disk stars, and a further 10% as thin disk stars. However, these stars consistently have lower metallicities ([Fe/H] < −0.3) compared to the average metallicity of the thick disk and thin disk sample.

thumbnail Fig. 2

Histogram of the [Fe/H] metallicity distribution for the thin disk, thick disk, and halo components for the all-sky sample. Shown are only those stars that have reliable estimates from APOGEE or GALAH.

thumbnail Fig. 3

Scatter plot of the all-sky sample in the metallicity − α element plane, categorised by their classification into the thin disk, thick disk, and halo component. Only those stars with corresponding APOGEE or GALAH estimates are shown. Panel a: APOGEE. Panel b: GALAH. Darker shades represent a higher density of points. Halo stars are marked as diamonds. Note that for the APOGEE sample, we show [α/M], which may slightly differ from [α/Fe]. The (dashed) chemical separation lines between halo (left section), thick disk (upper right section) and thin disk (lower right section) are based on Horta et al. (2023).

Table 2

Median metallicity and α-element abundance from APOGEE- DR17 and GALAH DR3 for the different stellar populations; the values were calculated for the all-sky sample.

2.4 PLATO fields: LOPS2 and LOPN1

The PLATO mission is designed to observe for at least 4 years in total. The observing strategy is yet to be finalised, but it is likely that PLATO will observe at least two fields during its Long-Duration Observation Phase (LOP). The minimum observation time for a LOP field is one year, with a planned time of at least two years (Rauer et al. 2014, 2016). For this study, we will assume a nominal observation strategy with two years being dedicated to the observation of the southern long-observation field (LOPS2), and a further two years to the provisional northern long-observation field (LOPN1).

PLATO’s optical design includes 24 identical “normal” cameras (and an additional two “fast” with a higher cadence), grouped into arrangements of six, which will observe overlapping parts of the total field. This unique constellation results in a 49° × 49° (approximately 5% of the entire sky) square-shaped FoV, with parts being observed by 6, 12, 18, or 24 cameras (Nascimbeni et al. 2022). Additionally, these observations can be complemented by shorter “Step and stare” observations of about three months each, which could increase the coverage to 40% of the sky.

The first long duration field LOPS2, located in the Southern Hemisphere, has recently been selected. It is located at l = 255.9375° and b = −24.62432° in Galactic coordinates. The preliminary northern field LOPN1 is located at l=81.56250° and b=24.62432° (Nascimbeni et al. 2022). The northern field encompasses the entire Kepler field and north TESS Continuous Viewing Zone (CVZ), whereas the southern field covers a large fraction of the south TESS CVZ.

We use a modified version of the publicly available platopoint code to select the targets in the two LOP fields from the all-sky catalogue. The location and geometry of the fields are shown in Fig. 5. Combining both fields, a total of 255 980 target stars are covered (14.0% of our sample). Of those, around 165 000 stars are classified as thin disk stars, 8000 as thick disk stars, and about 500 stars falling into the halo category (see also Table 1).

thumbnail Fig. 4

Confusion matrix comparing the dynamical and chemical classifications for stars possessing both metallicity and α-abundance estimates from APOGEE or GALAH. Each tile presents the total number of stars (top), precision (middle, fraction normalised along columns), and recall (bottom, fraction normalised along rows). For instance, 3242 stars are consistently classified as belonging to the thick disk by both methods. This corresponds to 68.15% of targets that were kinematically classified as thick disk stars, and 58.79% of targets that were chemically classified as such.

thumbnail Fig. 5

Scatter plot of the all-sky sample in Galactic coordinates (Aitoff projection). Stars that fall into the southern (LOPS2) and northern (LOPN1) long-observation fields are colour-coded by the number of cameras observing the target.

3 Detection and exoplanet population models

3.1 PLATO detection efficiency model

The efficiency of planet detection depends greatly on the instrument, the detection pipeline, and the properties of the stellar system, including stellar magnitude, orbital inclination, etc. A typical approach to assess detection efficiencies for similar instruments, like Kepler and TESS, is to use simulated or observed light curves, inject an artificial transit, and measure the recovery rate for those artificial planets. For this purpose, light curves are usually pre-processed and de-trended, after which transit detection is performed using either the box least squares (BLS, Kovács et al. 2002) or transit least squares (TLS, Hippke & Heller 2019) algorithm.

For this study, we take a simplified approach similar to the one employed by Matuszewski et al. (2023). We model the detection efficiency (DE) as a function of the signal-to-noise ratio (S/N) DE(S/N). We define the S/N as S/N=δσntransit ${\rm{S}}/{\rm{N}} = {\delta \over \sigma }\sqrt {{n_{{\rm{transit }}}}} $(3)

(see also Boley et al. 2021) under the assumption of purely white noise. Here, δ is the transit depth, σ2 is the variance of the detrended light curve excluding the transit, and ntransit is the number of in-transit data points.

The next step is to link the signal-to-noise ratio to the detection efficiency. In the case of Kepler, Fressin et al. (2013) employed a linear function. This function assigns a detection efficiency of 0% for S/N values below 5, a detection efficiency of 100% for S/N values above 16, and a linear increase in between these two points. However, Christiansen et al. (2015) suggested that a Gamma distribution provides a better fit to the Kepler stellar sample.

We thus assume that the detection efficiency DE(S/N) is given by a cumulative Gamma distribution, DE(S/N)=cbaΓ(a)0S/Nta1exp(t/b)dt,${\rm{DE}}({\rm{S}}/{\rm{N}}) = {c \over {{b^a}\Gamma (a)}}\int_0^{{\rm{S}}/{\rm{N}}} {{t^{a - 1}}} \exp ( - t/b){\rm{d}}t,$(4)

with parameters a = 30.87, b = 0.271, c = 0.940 found by Christiansen (2017) for Kepler, although the specific parameter for PLATO will likely differ from those values.

With this parameterisation, DE ≤ 0.01 for an S/N ≤ 5.29, and DE ≥ 0.90 for S/N ≥ 11.13. This is slightly more conservative than the assumption in the PLATO Red Book, which assumes a detection efficiency of virtually 100% for S/N > 10, based on Jenkins et al. (1996, 2010) and Fressin et al. (2013).

We detail the definitions of δ, σ and ntransit in the next sections. The detection efficiencies for a variety of orbital periods and planet radii are shown in Fig. 6.

thumbnail Fig. 6

Contour plot of the detection efficiency as a function of Rp and P. The fixed parameters are Rstar = R, Mstar = M, V = 11, ncameras = 24, cos i = 0, σstar = 10−5, u1 = 0.4 and u2 = 0.15. The white marker is located at P = 1 year, R = 1R.

3.2 Transit depth δ

We calculate the transit depth with a correction for limb darkening using the analytical treatment by Heller (2019). The transit depth is given by the following equation: δ=(RpRs)2IbIA,$\delta = {\left( {{{{R_{\rm{p}}}} \over {{R_{\rm{s}}}}}} \right)^2} \cdot {{{I_b}} \over {{{\langle I\rangle }_A}}},$(5)

where Rp and Rs are the radii of the planet and star, respectively. The impact parameter-dependent flux covered by the planet during mid-transit is denoted as Ib, while ⟨IA represents the average intensity across the stellar disk.

Assuming a quadratic limb darkening law, Heller (2019) find Ib=1u1(11b2)u2(11b2)2${I_b} = 1 - {u_1}\left( {1 - \sqrt {1 - {b^2}} } \right) - {u_2}{\left( {1 - \sqrt {1 - {b^2}} } \right)^2}$(6) IA=1u13u26,${\langle I\rangle _A} = 1 - {{{u_1}} \over 3} - {{{u_2}} \over 6},$(7)

where b = a/Rs cos i is the impact parameter, a is the semi-major axis of the planet’s orbit, and u1, u2 are the limb darkening coefficients. The limb-darkening coefficients depend on Teff and log ɡ, we estimate the values for our sample based on the grid provided for PLATO by Morello et al. (2022). your language preference (US or UK)

3.3 Light curve variance σ2

The primary metric to quantify the light curve variance σ2 is the combined differential photometric precision (CDPP), which consists of two components: an instrument-related component and a stellar component. Again, assuming white noise, we can write σ2=σinstrument 2+σstar 2${\sigma ^2} = \sigma _{{\rm{instrument }}}^2 + \sigma _{{\rm{star }}}^2$.

To quantify the instrumental contribution, we implement the model described by Börner et al. (2022) for the PINE software, which was also for the PLATO field selection process (Nascimbeni et al. 2022). In short, the noise budget is composed of three major components σinstrument 2=σphoton 2+σrandom 2+σsystematic 2,$\sigma _{{\rm{instrument }}}^2 = \sigma _{{\rm{photon }}}^2 + \sigma _{{\rm{random }}}^2 + \sigma _{{\rm{systematic }}}^2,$(8)

where σphoton is the photon noise of the target, σrandom are random noise components including readout and smearing noise, while σsystematic describes systematic components including jitter and breathing noise. For a particular target, the instrumental noise is mostly influenced by the star’s magnitude and the number of cameras observing it.

For faint stars (V > 12), random noise is the primary noise component, while systematic noise becomes dominant for brighter stars (V < 8). In between these two regimes, photon noise dominates. According to the PLATO design specifications, the maximum random noise allowed for a V = 11 star in the end-of-lifetime (EOL) configuration observed with at least 22 cameras is 50 parts per million (ppm) for an integration time of one hour (Börner et al. 2022). This takes account of a failure of up to two cameras during the mission’s lifetime.

The stellar component of the noise, σstar, is influenced by the inherent variability of the star due to factors like granulation, oscillations, and activity, as well as any residual uncertainties after the light curve de-trending process. Studies by Morris et al. (2020) and Heller et al. (2022) estimate the stellar variability for PLATO targets to be between 100 and 200 ppm in a one-hour window. Similar values have been found by Zhang et al. (2024) for the LAMOST-Kepler common star sample. With a cadence of 25s per exposure for the normal cameras (Börner et al. 2022), this amounts to a stellar noise standard deviation between 8 and 17 ppm if the signal is integrated over one hour. These estimates align with those by Matuszewski et al. (2023) and Van Cleve & Caldwell (2016). For this study, we adopt a value of 10 ppm across all stars. The effects of stellar variability on the number of exoplanet detections is further elaborated on in Appendix C.

3.4 Transit observations ntransit

The number of in-transit data points is given by ntransit =Ntransit ttransit tintegration ,${n_{{\rm{transit }}}} = {N_{{\rm{transit }}}} \cdot {{{t_{{\rm{transit }}}}} \over {{t_{{\rm{integration }}}}}},$(9)

where Ntransit is the number of observed transits, ttransit is the transit duration and tintegration = 1 refers to the integration time specified for the noise model (Börner et al. 2022).

Assuming an eccentricity of zero, the number of guaranteed transits is simply given by Ntransit = tpoint P .${N_{{\rm{transit }}}} = \left\lfloor {{{{t_{{\rm{point }}}}} \over P}} \right\rfloor .$(10)

Here, tpoint is the pointing duration of a particular field (nominal two years for the long observation fields) and P the orbital period of the planet. The transit duration is ttransit =Pπarcsin(Rsa{ [ 1+(RpRs) ]2[ (aRs)cosi ]2 }1/2),${t_{{\rm{transit }}}} = {P \over \pi }\arcsin \left( {{{{R_{\rm{s}}}} \over a}{{\left\{ {{{\left[ {1 + \left( {{{{R_{\rm{p}}}} \over {{R_{\rm{s}}}}}} \right)} \right]}^2} - {{\left[ {\left( {{a \over {{R_{\rm{s}}}}}} \right)\cos i} \right]}^2}} \right\}}^{1/2}}} \right),$(11)

where Rp and Rs are the radii of the planet and star, respectively; a is the semi-major axis; and i is the orbital inclination (Seager & Mallén-Ornelas 2003).

Aside from the guaranteed number of transits (Ntransit), an additional transit may be observed, depending on the timing of the initial transit. The overall detection efficiency of a target is calculated as a weighted average of the detection using N_transit and N_transit + 1 transits. The probability of observing an additional transit is pN+1=tpoint (modP)P.${p_{{\rm{N}} + 1}} = {{{t_{{\rm{point }}}}(\,\bmod \,P)} \over P}.$(12)

Finally, for a detection to be considered valid, at least two transits are required. Otherwise, the corresponding detection efficiency is set to zero.

3.5 The NGPPS planet population dataset

In order to study the planet populations in the different stellar populations, and estimate planet yields for PLATO, we employ the New Generation Planet Population Synthesis (NGPPS) dataset, based on the third generation Bern global model for planet formation and evolution (Emsenhuber et al. 2021b). The model is based on the core-accretion paradigm, encompassing the growth of planets from 1 to 100 embryos (each about 0.01 M) into planets ranging from Earth-like to Super-Jupiter giants. The planet embryos grow in fluid-like protoplanetary disks described by gas and planetesimal fields. Direct N-body interactions between growing planets are also included, which increases the number of massive planets compared to isolated evolution.

The model simulations are initialised when planet embryos have formed, bypassing the dust-to-planet embryo growth stages. The number and distribution of embryos are free parameters. The formation phase lasts 20 Myr, after which planets evolve in isolation, influenced by tidal migration and atmospheric escape. This phase duration is longer than typical protoplanetary disk lifetimes, encompassing most growth stages but not fully simulating the late-stage giant impact phase, potentially underestimating rocky planet masses. Emsenhuber et al. (2021b) found planet formation mostly completes within ∼ 1 AU after 20 Myr, while more distant planets remain underdeveloped.

A wide range of physical processes and parameters are included in the model. Among the varied parameters, the initial number of embryos and stellar metallicity are crucial. The number of embryos (NEmbryo) directly influences the final number and properties of the planetary systems, with Emsenhuber et al. (2021a) studying scenarios with 1, 10, 20, 50, and 100 embryos. For consistency, we focus on the 10–100 embryos range. Stellar metallicity is directly related to the dust-to-gas ratio, critical for planet formation.

Emsenhuber et al. (2021b) and Burn et al. (2021) have performed extensive population runs, classifying planets into categories based on their final masses. The host star mass is fixed at 1 M in Emsenhuber et al. (2021a), while Burn et al. (2021) extend the analysis to stars with 10, 30, 50, and 70% of the Sun’s mass, finding systematic correlations between higher [Fe/H] and total disk mass with an increased number of giant planets.

In this study, we focus on FGK-type stars, and therefore, we utilise the 1 M model. Emsenhuber et al. (2021a) conducted 1000 simulations for this model, varying stellar metallicity, among other parameters, for each initial embryo count. This population of 1000 systems serves as the foundation for populating our stellar sample with target planets, enabling us to investigate the planetary population.

3.6 Creating mock populations and observations

To create mock planet populations for our stellar sample, we follow these steps:

(i) Assign planetary systems: we randomly assign each star in the stellar sample a planetary system from the 1000 planetary systems in the NGPPS population (with NEmbryo fixed). Since planetary system architecture is strongly correlated with host star metallicity, we use an assignment probability log10p=α([Fe/H]star[Fe/H]system )2.${\log _{10}}p = - \alpha \sqrt {{{\left( {{{[{\rm{Fe}}/{\rm{H}}]}_{{\rm{star}}}} - {{[{\rm{Fe}}/{\rm{H}}]}_{{\rm{system }}}}} \right)}^2}} .$(13)

with α = 10, so that each 0.1 dex difference in metallicity makes the assignment 10 times less likely. Emsenhuber et al. (2021a) initially assumed a Gaussian metallicity distribution (with mean µ = −0.02 and standard deviation σ = 0.22), so we divide by this probability to avoid skewing our sample towards more common systems in the NGPPS dataset.

(ii) Assign system inclination: We assign each system a random inclination by drawing a sample of cos i from a uniform distribution [0, 1].

(iii) Roche limit check: For consistency, we remove any planet whose semi-major axis falls within the Roche limit dRoche =2.44Rstar (ρstar ρplanet )1/3. ${d_{{\rm{Roche }}}} = 2.44{R_{{\rm{star }}}}{\left( {{{{\rho _{{\rm{star }}}}} \over {{\rho _{{\rm{planet }}}}}}} \right)^{1/3}}{\rm{. }}$(14)

(iv) Metallicity limit (optional): Since NGPPS is restricted to a metallicity range of [−0.6, 0.5], most low-metallicity stars will be assigned systems with metallicity around −0.6. Alternatively, following Andama et al. (2024)’s suggestion that stars with [Fe/H] < −0.6 are unlikely to form planetesimals, we can remove planets from such systems.

To create mock observations from these mock populations, we calculate the detection efficiency for each planet and randomly select a sub-sample based on those probabilities. We assume an eccentricity of zero for all orbits, and assign a detection efficiency of zero to non-transiting planets.

4 Results

4.1 Detection efficiency as a function of stellar and planetary properties

For a fixed planet radius and orbital period, the detection efficiency is primarily determined by the stellar magnitude, stellar radius, and to a lesser extent, the number of cameras observing the target. The detection efficiency is higher for larger planets, shorter orbital periods, smaller stars, and brighter stars.

The distribution of detection efficiencies for all LOPS2 and LOPN1 targets, assuming each star hosts an Earth analogue (1 Earth radius, 1-year orbital period) with a transit aligned with the line of sight (cos i = 0), is shown in Fig. 7a. The distribution is strongly bimodal, with the vast majority of targets assigned a near-zero detection efficiency. Those stars are primarily faint (V ≥ 12) or large (R ≥ 1.5 R). High detection efficiencies (>0.9), are found for bright (V ≤ 9.5), small (R < 1R) stars. Intermediate values are relatively rare, since the detection efficiency rises from near-zero to almost one just by doubling the signal-to-noise ratio from 5 to 11 (see Sect. 3.1).

We define “detectable planets” as those with a detection efficiency of at least 1/Nsample, where Nsample is the number of targets in the thin disk, thick disk, or stellar halo population to which the host belongs. This threshold guarantees that, on average, at least one planet would be found if each target star in the sample possessed a transiting planet with the same orbital period and radius.

In Fig. 7b, we show the fraction of detectable planets as a function of planet radius for the thin disk, thick disk, and halo populations, assuming a 100-day orbital period and cos i = 0. For smaller planets (R ≤ 1 R), the detection efficiency is nearly uniform across all populations. However, for larger planets, the halo population exhibits a smaller fraction of detectable planets compared to the thin and thick disk populations. This difference arises because the halo population contains a bigger proportion of larger or fainter stars, which decreases detectability for a planet of a given size.

Figure 8 displays the fraction of detectable planets as a function of both planet radius and orbital period for all LOPS2 and LOPN1 targets, as well as for the high signal-to-noise PLATO P1 sample. The P1 sample contains all targets with the additional requirements that V ≤ 11 and that the random noise level is below 50ppm in one hour. Across the LOPS2 and LOPN1 fields, 29 238 target stars fulfil this requirement. For the entire sample, an Earth analogue would be detectable around 10–20% of stars. For the P1 sample, this number increases to 50%.

thumbnail Fig. 7

Detection efficiency for the LOPS2 + LOPN1 targets. (a) The distribution of detection efficiencies calculated for the targets, assuming the transit of an Earth analogue (P = 1 year, R = 1R) with cos i = 0. The colour of the bars encodes the mean magnitude of the stars within each bin. (b) The fraction of detectable planets (defined as planets with a detection efficiency >1/Nsample) for the thin disk, thick disk and halo populations, as a function of radius. The detection efficiencies are calculated for cos i = 0 and P = 100 days.

thumbnail Fig. 8

Fraction of detectable planets for the whole sample and P1 subsample, as a function of planet radius and orbital period. Detectable planets are defined as planets with a detection efficiency ≥1/Nsample, assuming cos i = 0 for all planets. The white marker is located at P = 1 year, R = 1 R. (a) All LOPS2 + LOPN1 targets. (b) P1 sample (V ≤ 11 and random noise ≤50 ppm integrated over one hour).

Table 3

Planet categories as defined in Emsenhuber et al. (2021a).

4.2 Planet occurrence rates and multiplicities across stellar populations

To investigate the planet populations around different types of stars, we categorise the planets in our mock populations into five types based on their mass: Earth, super-Earth, Neptunian, sub-giant, and giant. This is the same classification as done by Emsenhuber et al. (2021b), and used in our previous work (Boettner et al. 2024). The exact definitions can be found in Table 3.

For each planet type, we calculate the key metrics: total number of planets, fraction of systems with a planet of the given type, occurrence rate (the total number of planets of this type divided by the total number of target stars), and multiplicity (the average number of planets of a given type around a star hosting at least one such planet). An example for a single mock population is shown in Table 4.

These metrics are calculated for each stellar population (thin disk, thick disk and halo) and for different assumptions about the initial number of embryos in the planet formation simulations (Table 5).

For the NEmbryo = 100 simulations, we find that Earth-like planets are the most common across all stellar populations. The occurrence rate and multiplicity increases with decreasing metallicity in the stellar populations, with an increase in the occurrence rate from 4.56 in the thin disk to 6.02 in the halo. More massive planets become a lot rarer (in comparison to Earth-like planets), as the average metallicity of the stellar population decreases. While in the thin disk, there are roughly as many Earth-like planets as there are super-Earths, they outnumber the super-Earths by a factor of seven in the halo. This trend becomes more extreme as the planet mass increases. For the halo population, there are predicted to be 1000 times as many Earth-like planets as there are giant planets.

The distribution of planets changes with the assumption of the initial number of embryos. While Earth-like planets are the most common type for the NEmbryo = 100 simulation, there are more super-Earths than Earths in the NEmbryo = 10 case. As the number of initial embryos decreases, the number of sub-giant and giant planets in the halo increases, since there is proportionally more material available for them to form and grow. We show the occurrence rate, as a function of planet type and stellar population, for the NEmbryo = 10 and NEmbryo = 100 case in Fig. 9.

We also investigate the effect of a metallicity limit for planet formation. Andama et al. (2024) suggest planetesimal formation is severely inhibited for [Fe/H] < −0.6. As an extreme case, we remove all planets around these low metallicity stars, which has a strong influence on the planet population in the halo. For NEmbryo = 10, the number of super-Earths decreases by a factor of eleven. The same holds true for Earth-like planets in the NEmbryo = 100 case. The number of giant planets is less affected by this limit, with a decrease from 34 to 14 in the NEmbryo = 10 case, and no changes in the NEmbryo = 100. This is caused by the fact that massive planets are inherently rare around these low-metallicity stars in the NGPPS population, independent of a metallicity threshold.

4.3 Expected PLATO planet yields

To estimate the yield of planet detections for PLATO, we combine the planet populations from Sect. 4.2 with the detection efficiencies from Sect. 4.1. We consider various scenarios, varying the number of initial embryos in the planet formation model (10, 20, 50, and 100) and whether we impose a minimum metallicity threshold of [Fe/H] = −0.6, below which planet formation is assumed to be suppressed.

The resulting planet yields, expressed as the mean number of expected detections for different planet types and stellar populations, are presented in Fig. A.1. These estimates are for a 2+2 year observation strategy across the LOPS2 and LOPN1 fields. The heat maps show the expected number of detections as a function of orbital period and planet radius.

In general, we find that the highest planet yields are for superEarths around thin disk stars, with the number of detections decreasing for larger planets, and for planets around thick disk and halo stars. The most populated bin for thin and thick disks are planets with R = 2–10 R, and orbital periods between 2 and 5 days. For halo stars, detections are more likely for planets with a similar radius but longer orbital periods of 20–50 days. The yields are also sensitive to the assumed number of initial embryos. A larger number of embryos leads to more planets, although fewer embryos lead on average to more massive, easily detectable planets. As a result, the NEmbгyo = 10 and 20 simulation runs lead to the largest number of expected detections at 14 481 and 16 361 respectively. The NEmbryo = 100 case on the other hand leads to 13 857 expected detections (Table 6).

Imposing a minimum metallicity threshold of [Fe/H] = −0.6, has a large effect on the detection rate around halo stars, as expected (Table 7). While up to 84 planets can be expected around halo stars without the limit, this number decreases to less than 10 if the metallicity threshold is imposed. In the NEmbryo = 100 case, 23 out of the 300 mock observations contain zero detection of planets around halo stars, if no metallicity limit is imposed. With the metallicity limit, 141 out of 300 planets yield non-detections. This means that while it is likely that PLATO will detect the first exoplanet around a halo star, it is by no means a given. Moreover, a non-detection would not necessarily provide strong evidence for a lower metallicity limit for planet formation around halo stars.

Table 4

Key planet population metrics, calculated for a single mock population with NEmbryos = 100 across all targets in the LOPS2 and LOPN1 fields.

Table 5

Median number of planets for the targets in the LOPS2 and LOPN1 fields across 300 mock simulations, by the number of embryos assumed in the NGPPS populations.

thumbnail Fig. 9

Occurrence rates of different planet types per stellar population, for varying NEmbryo. (a) NEmbryo = 10. (b) NEmbryo = 100.

Table 6

Median number of expected planet detections in the different Galactic components across the LOPS2 and LOPN1 fields, for a varying number of initial embryos NEmbryo.

5 Discussion

5.1 General findings and comparison to previous work

Our estimates predict the detection of between 13 000 and 17 000 planets for a 2+2 year observation strategy across the LOPS2 and LOPN1 fields. This number only accounts for stars classified as thin disk, thick disk, or halo, representing about 51% of the targets in the full asPIC catalogue. The vast majority (>90%) of these detections are expected in the thin disk, with 400–1000 planets predicted for the thick disk and between 1 and 80 planets for the halo.

Most detected planets are likely to be super-Earths and subNeptunes (R = 2–10 R) with orbital periods between 1 and 100 days. These planets strike a balance between occurrence rate and detectability. We also anticipate 20–130 Earth-like planets with orbital periods of 250–500 days, some of which will be rocky planets in the habitable zone, PLATO’s primary target. Our estimates are compatible with Heller et al. (2022), who predict 11–34 planets with radii between 0.5 and 1.5 R in the habitable zone. Our larger upper limit estimate results from the fact that we use a simpler model for stellar variability and planet detection, as well as a broader mass bin (M = 0.5–2M) for Earth-like planets.

Our estimates align with Matuszewski et al. (2023), who used a similar methodology based on the NGPPS sample. We find similar numbers for the planet yield, with a slightly lower number for us (when scaled to the full asPIC sample) due to our treatment of the stellar radius distribution and limb-darkening effects. It is worth stressing that final planet yield strongly depends on the assumed underlying planet population. Using the NGPPS populations, we find an overall occurrence rate of 4.6 for planets with M > 0.5 M and P < 500 days in the NEmbryo = 100 case, and an occurrence rate of 2.6 in the NEmbryo = 10 case. This is in contrast to empirical estimates by Hsu et al. (2019) and Kunimoto & Matthews (2020). Hsu et al. (2019) find an upper limit on the occurrence rate of about 5 planets per star, primarily due to large uncertainty on the occurrence rate of small (0.5 R < R < 1 R) planets with orbits between 64 and 500 days. Exploring this part of the parameter space is the prime goal of PLATO. Kunimoto & Matthews (2020) find a much lower occurrence rate of around one planet per star for orbital periods <400 days, since they expect a much lower fraction of small (<2 R) planets. While the occurrence rate of super-Earths with radii between 2 and 3 R are similar between the different estimates, NGPPS predicts a larger fraction of close-in planets (P < 10 days), which results in a large fraction of our predicted planet yield being these types of planets.

While the number of planets formed in the simulation varies with the initial number of embryos, the expected number of planet detections remains relatively consistent. The model predicting the highest number of planets (NEmbryo = 100) actually yields the lowest number of expected detections. This is because the same amount of disk mass is distributed among more planets in this model, resulting in smaller, less massive planets that are harder to detect. While the median planet radius falls within the super-Earth range, regardless of initial number of embryos, the median radius for the NEmbryo = 10 case (R¯=3.1R$\bar R = 3.1\,{R_ \oplus }$, counting only planets with M > 0.5M) is 50% larger than for NEmbryo = 100 (R¯=2.1R$\bar R = 2.1\,{R_ \oplus }$).

Table 7

Median number of expected planet detections across the LOPS2 and LOPN1 fields, enforcing the metallicity limit. Planets around stars with [Fe/H] < −0.6 are removed.

5.2 Exoplanet demographics in the Galactic context

We find characteristic differences in planet populations across Galactic stellar populations, consistent with our previous simulation-based approach (Boettner et al. 2024). The fraction of low-mass planets (Earth and super-Earth) increases as the average metallicity of the stellar population decreases, matching observational findings that average planet masses increase with host star metallicity (Sousa et al. 2019), and that giant planets are more common around metal-rich stars (Thorngren et al. 2016).

The metal-rich thin disk exhibits a diverse exoplanet population, with Earth-like planets or super-Earths being the most common, depending on the assumed NEmbryo, and hosted in around 90% of systems. More massive planets (Neptunians, Subgiants, giants) are rarer, but still found in about 20% of systems. Compared to the original NGPPS population, our thin disk population is very similar in terms of fraction of planet-hosting systems, occurrence rate, and multiplicity for Earth-like planets, Neptunians, Sub-giants, and giants. This is expected since the NGPPS sample was created to reflect the exoplanet population in the direct solar neighbourhood.

The fraction of giant-hosting systems drops sharply in more metal-poor regions, from 0.16 in the thin disk to 0.004 in the halo. The Earth-hosting fraction remains largely unchanged, while the occurrence rate and multiplicity of Earth-like planets increase. This agrees with previous studies finding that metal-poor stars host a more homogeneous, lower-mass planet population (Petigura et al. 2018).

The number of giant planets in metal-poor populations depends strongly on the assumed initial number of embryos. Fewer embryos lead to more giant planets due to more available material for single planets, while more embryos lead to more Earth-like planets and fewer giants. For the NEmbryo = 100 case, the occurrence rate of Earth-like planets increases with decreasing [Fe/H], while the occurrence rate of super-Earths decreases sharply. For NEmbryo = 10 case, the opposite is true (see Fig. 9). In fact, while Earth-like planets are most common in the thin disk, thick disk, and halo for the NEmbryo = 100 case, super-Earths are the most common in the NEmbryo = 10 case. For NEmbryo = 20, the most common type switches, with Earth-like planets being most common in the thin disk and super-Earths being the most common in the thick disk and halo.

5.3 The impact of host star chemistry on exoplanet populations

The effect of a star’s chemical composition, particularly the abundance of α-elements, on planet formation is an active area of research. Theoretical studies suggest that the abundance of water ice in planetary building blocks, and potentially in planets themselves, depends on both the metallicity and alpha-element content of the host star (Bitsch & Battistini 2020; Cabral et al. 2023). Observationally, there are indications that the relative abundance of super-Earths and sub-Neptunes may vary with α-element abundance (Chen et al. 2022), and that giant planets might be more frequent around α-enriched, metal-poor stars (Haywood 2008, 2009), compared to α-poor, metal-poor stars. However, current observational evidence is limited.

PLATO is poised to significantly expand our understanding in this area by providing a large sample of planets around stars with diverse chemical compositions. We expect PLATO to detect between 400 and 1000 planets around thick disk stars, with an average metallicity of −0.3 to −0.4 and an α-elemental abundance around 0.17. This would be a major increase compared to the eight planets around seven thick disk stars found by Bashi & Zucker (2022) for the Kepler sample. The vast majority of these planets are predicted to be short-period (2–20 days) superEarths and sub-Neptunes (R = 2–10 R), though with the caveat that is type of planet is over-represented in the NGPPS population compared to other estimates (see Sect. 5.1). Bashi & Zucker (2022) find for their limited sample of planets with R = 1–2 R and P = 1–100 days have an occurrence rate around thin disk stars that is more than twice the occurrence rate around thick disk stars. We find a much smaller deviation of only around 10% in the NEmbryo = 100 case, and basically no deviation in the NEmbryo = 10 case. The increased number statistics that PLATO provides will help establish if the findings by Bashi & Zucker (2022) are a statistical fluke, or if the dynamical history of the star has a direct effect on the occurrence rate as they have suggested.

Within the NGPPS population there is only a small hint of a radius valley (Fulton et al. 2017) between super-Earths (R between 1 and 1.7 R) and sub-Neptunes (R between 2.1 and 3.5 R) with periods <100 days (see also Burn et al. 2021, with improvements by Burn et al. 2024). This valley is largely unchanging with metallicity, and between thin disk and thick disk, in contrast to the metallicity evolution suggested by Chen et al. (2022). They find that the valley disappears for lower metal- licity hosts, as the number of super-Earths decreases. For fixed [Fe/H], they further find that the ratio of super-Earths and subNeptunes decreases as the α-elemental abundance increases. The regime probed by PLATO is therefore ideal for studying the evolution of the planet radius valley, and analysing the effects of chemical abundances on the formation of rocky versus gaseous and water-rich versus dry worlds.

We expect to find a smaller number of up to 100 of Neptune and sub-giant-sized planets in the thick disk, mostly with periods <50 days. Additionally, we anticipate detecting up to a dozen giant planets around thick disk stars, although roughly half of these are predicted to have longer orbital periods (>100 days). However, it is worth noting that the fraction of high metallicity stars ([Fe/H] > 0) is elevated in our thick disk sample, due to thin disk contamination, which is likely to artificially increase the occurrence rate of large planets in this sample. Nonetheless, comparing the occurrence rates of large planets around low- metallicity stars will help us understand if high α-content aids planet formation, as suggested by Gundlach & Blum (2015).

5.4 Planets around stars with sub-solar metallicity

The formation of planets around low-metallicity stars is a topic of ongoing research. Theoretical and observational evidence suggests a potential minimum metallicity threshold for planet formation, with estimates ranging from [Fe/H] = −0.5 (Mortier et al. 2012; Mordasini et al. 2012) to −1.5 (Johnson & Li 2012). To date, only a handful of planets around low-metallicity stars have been discovered. A recent study by Andama et al. (2024) suggests that planetesimal formation, is significantly hindered in low-metallicity environments, particularly for [Fe/H] < −0.6.

PLATO has the potential to greatly improve our understanding of this regime of planet formation. A lower metallicity limit, as suggested by Andama et al. (2024), would greatly affect planet yield in low-metallicity environments. Across the LOPS2 and LOPN1 fields, we were able to classify 3451 target stars with [Fe/H] < −0.6 into the thin disk, thick disk or halo. The expected number of planet detections around these targets varies widely. From zero in the presence of a metallicity limit, to around 30 in the NEmbryo = 100 case. This value can reach up to 400 in the NEmbryo = 20 case, which has on average more massive, more easily detectable planets, especially in low- metallicity disks where the available solid material is limited. The detection (or non-detection) of planets around these stars can yield valuable constraints for planet formation models in the low-metallicity regime.

We predict between 1 and 80 planet detections around halo stars, with metallicities < −0.3. Most of these planets are expected to be super-Earths with periods between 2 and 50 days. We want to especially highlight a special target list of 47 halo targets in the high-priority, high-S/N PLATO P1 sample, which are prime targets in the search for planets around halo stars. The sample has an estimated median metallicity of −1.27, although metallicity estimates less reliable for values this low. The highest metallicity star in this sample has [Fe/H] = −0.324. A single star in this sample (Gaia DR3 ID: 4676601464106231040) has an APOGEE-confirmed metallicity estimate of [Fe/H] = −1.467. The full target list can be found in Tables D.1 and D.2.

6 Conclusion

In this study, we have demonstrated the potential of the upcoming PLATO mission to greatly improve our understanding of exoplanet demographics across diverse Galactic stellar populations. By combining asPIC with a probabilistic kinematic classification scheme and the NGPPS planet population synthesis models, we have estimated the PLATO exoplanet yields for the thin disk, thick disk, and stellar halo across the LOPS2 and LOPN1 fields for a 2+2 year observation strategy. Our primary findings are:

  1. PLATO is expected to detect a vast number of exoplanets, primarily super-Earths and sub-Neptunes, orbiting stars within the thin disk, thick disk, and halo. We predict the detection of at least 13 000 planets across these populations, with the majority residing in the thin disk. This extensive dataset will enable detailed studies of exoplanet occurrence rates, architectures, and properties as a function of Galactic location and stellar population.

  2. The exoplanet populations exhibit distinct characteristics across different Galactic components. The thin disk is expected to host a diverse range of planets, including a significant number of habitable-zone rocky planets. The thick disk, with its lower average metallicity, is predicted to have a higher relative fraction of low-mass planets (Earth-like and super-Earths) and a lower occurrence rate of giant planets.

  3. PLATO will be able to observe at least 400 planets around stars in the α-enriched thick disk. These observations will enable detailed investigations of the relationship between α- element abundance, the properties of planet building blocks, and planet populations. This will provide insights into the role of α elements in planet formation and the potential diversity of planet compositions across different Galactic environments.

  4. We predict the detection of several dozen planets around halo stars, with [Fe/H] < −0.3, and potentially hundreds with [Fe/H] < −0.6 across the thin disk, thick disk, and halo. These observations will help constrain the possible existence of a minimum metallicity threshold for planet formation.

  5. We have identified a list of 47 targets that have been kinematically classified as halo stars in the high-priority, high-S/N PLATO P1 sample across the LOPS2 and LOPN1 fields, making them prime candidates in the search for planets around low-metallicity stars.

These results demonstrate the potential of PLATO to significantly advance our understanding of planet formation and evolution across different stellar populations in the Milky Way. The list of halo target stars identified by us can be found in Tables D.1 and D.2.

Data availability

The full catalogue of stars, classified by their Galactic component membership, a target list of the LOPS2 and LOPN1 fields, as well as the halo special target list, can be found at https://doi.org/10.5281/zenodo.11428968. The analysis code, including notebooks to recreate all figures in this work, are located at https://github.com/ChrisBoettner/plato.

Acknowledgements

C.B. thanks the Young Academy Groningen for their generous support through an interdisciplinary PhD fellowship. P.D. acknowledges support from the NWO grant 016.VIDI.189.162 (“ODIN”) and warmly thanks the European Commission’s and University of Groningen’s CO-FUND Rosalind Franklin program. We want to thank Alexandre Emsenhuber for providing us with the Monte Carlo variable dataset used for the original NGPPS sample, and the Data & Analysis Center for Exoplanets (dace.unige.ch) for seamless access to the NGPPS and observational exoplanet data. Valerio Nascimbeni and Hugh Osborn have helped with access to the platopoint code used in this analysis. We further want to thank Else Starkenburg for her helpful comments. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement. This work made use of the Third Data Release of the GALAH Survey (Buder et al. 2021). The GALAH Survey is based on data acquired through the Australian Astronomical Observatory, under programs: A/2013B/13 (The GALAH pilot survey); A/2014A/25, A/2015A/19, A2017A/18 (The GALAH survey phase 1); A2018A/18 (Open clusters with HERMES); A2019A/1 (Hierarchical star formation in Ori OB1); A2019A/15 (The GALAH survey phase 2); A/2015B/19, A/2016A/22, A/2016B/10, A/2017B/16, A/2018B/15 (The HERMES-TESS program); and A/2015A/3, A/2015B/1, A/2015B/19, A/2016A/22, A/2016B/12, A/2017A/14 (The HERMES K2-follow- up program). We acknowledge the traditional owners of the land on which the AAT stands, the Gamilaraay people, and pay our respects to elders past and present. This paper includes data that has been provided by AAO Data Central (datacentral.org.au). Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the U.S. Department of Energy Office of Science, and the Participating Institutions. SDSS-IV acknowledges support and resources from the Center for High Performance Computing at the University of Utah. The SDSS website is www.sdss4.org. SDSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, Center for Astrophysics | Harvard & Smithsonian, the Chilean Participation Group, the French Participation Group, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU) / University of Tokyo, the Korean Participation Group, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatário Nacional / MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University. The analysis has been performed using Python, and the scientific data analysis libraries numpy (Harris et al. 2020), scipy (Virtanen et al. 2020), scikit-learn (Pedregosa et al. 2011) and pandas (The pandas development team 2023). Astronomical calculations and data retrieval were performed using astropy (Astropy Collaboration 2022), astroquery (Ginsburg et al. 2019) and galpy (Bovy 2015). Visualisation were made using matplotlib (Caswell et al. 2023) and seaborn (Waskom 2021). The PLATO targets were selected using a modified version of platopoint by Hugh P. Osborn. The code used for this analysis can be found at https://github.com/ChrisBoettner/plato.

Appendix A Additional figure

thumbnail Fig. A.1

Heat maps of the expected number of observed planets in across the LOPS2 and LOPN1 fields for a 2+2 year observation strategy, as a function of orbital period and planet radius. The numbers in each bin represent the mean and the interquartile range (16th to 84th percentile) for 300 mock observations, for NEmbryo = [10, 50, 100] (top to bottom). The heat maps on the right side (b, d, f) enforce a minimum stellar metallicity of [Fe/H] = −0.6, below which no planets can form as suggested by Andama et al. (2024). Note that these estimates are only for the targets we were able to classify into one of the three components (see Table 1), not considering stars that were categorised as thick disk or halo candidates, or those removed in the sample selection process (Sect. 2.1).

Appendix B Comparison of Gaia metallicity and surface gravity with APOGEE and GALAH

We compare the metallicity estimates derived from Gaia (using medium-resolution spectra, and XGBoost, using the quality flags described in Sect. 2.1) with those from the APOGEE and GALAH sub-samples, which are based on high-resolution spectra. The Gaia estimates show good agreement with the high- resolution values within the metallicity range of −2 to 0.8 (Fig. B.1). The mean metallicity for the Gaia, APOGEE and GALAH estimates are −0.096, −0.085 and −0.100, respectively. For surface gravity, the Gaia estimates align well with the APOGEE values between 2 and 4.8. For the GALAH sample, the values match between 3 and 4.6. For values below three, the Gaia estimates are systematically biased towards larger values (Fig. B.2).

thumbnail Fig. B.1

Gaia metallicity estimates for our sample compared to values determined from APOGEE and GALAH. Panel (a): APOGEE. Panel (b): GALAH. The Gaia estimates show a larger fraction of low- and high-metallicity stars, while the APOGEE and GALAH samples are more centralised around 0. The colour represents the density of points, while the grey dashed line shows the x=y diagonal.

thumbnail Fig. B.2

Gaia surface gravity estimates for our sample compared to values determined from APOGEE and GALAH. Panel (a): APOGEE. Panel (b): GALAH. Note the artefacts around 3.7–4.7 cm/s2, where the Gaia results have a tendency to underestimate log g for some targets.

Appendix C Effects of stellar variability

Variations in stellar flux due to intrinsic effects like granulation are an inherent source of noise in exoplanet transit photometry and spectroscopy. This stellar variability is widely recognised as a major factor limiting the detection capabilities of Kepler, which failed to identify a true Earth analogue due to underestimating the activity of FGK stars compared to the Sun (Jenkins 2002; Gilliland et al. 2011). The heightened stellar activity in these stars resulted in a photometric precision of 30 ppm over 6.5 hours for Kepler, falling short of the 20 ppm goal for a Kp = 12 star (Jenkins & Dunnuck 2011).

While stellar variability poses a significant challenge, studies suggest it should not prevent PLATO from detecting Earth-like planets (Hippke & Angerhausen 2015; Morris et al. 2020). The extent to which a transit is affected by stellar variability depends on various factors, including the properties of the star and planet, as well as the de-trending method employed. Morris et al. (2020) conducted a comprehensive analysis using the PLATO Solar-like Light-curve Simulator (PSLS, Samadi et al. 2019) to assess this impact.

For this study, we adopted a simplified approach, assuming a stellar variability of 10 ppm for a light curve integrated over one hour. In comparison to systematic, random, and photon noise, this contribution is relatively minor for most targets in the LOPS2 and LOPN1 fields, where the overall noise ranges from 30 ppm to 400 ppm (Fig. C.1). Stellar variability primarily becomes a concern for detecting small planets at large orbital distances.

Figure C.2 illustrates how the number of detected planets is influenced by stellar variability. Our simple model exhibits a linear decrease in the number of detected planets with increasing stellar variability between 0 and 200 ppm, with a slope of 0.001 ppm−1. Even at a variability of 200 ppm, approximately 80% of planets are still detectable. The missed planets are primarily Earth-like planets with orbital periods exceeding 100 days.

thumbnail Fig. C.1

Distribution of estimated total noise (systematic + random + photon), excluding stellar variability, for the target stars in the LOPS2 and LOPN1 fields. The noise modelling is described in Sect. 3.3.

thumbnail Fig. C.2

Effects of stellar variability on the number of exoplanet detections. The values are normalised to the number of detections without a stellar variability component (σstar = 0). The grey vertical line corresponds to the 10ppm we assume for this work.

Appendix D Halo special target list

Tables D.1 and D.2 contain the targets that have been kinematically classified as halo stars, and fall within the high-priority PLATO P1 sample.

Table D.1

Special target list for halo stars in the LOPS2 field.

Table D.2

Same as Table D.1, but for the LOPN1 field.

References

  1. Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35 [CrossRef] [Google Scholar]
  2. Adibekyan, V. Zh., Delgado Mena, E., Sousa, S. G., et al. 2012, A&A, 547, A36 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  3. Andama, G., Mah, J., & Bitsch, B. 2024, A&A, 683, A118 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Andrae, R., Rix, H.-W., & Chandra, V. 2023, ApJ, 267, L8 [Google Scholar]
  5. Astropy Collaboration (Price-Whelan, A. M., et al.) 2022, ApJ, 935, 167 [NASA ADS] [CrossRef] [Google Scholar]
  6. Bashi, D., & Zucker, S. 2019, ApJ, 158, 61 [CrossRef] [Google Scholar]
  7. Bashi, D., & Zucker, S. 2022, MNRAS, 510, 3449 [NASA ADS] [CrossRef] [Google Scholar]
  8. Bashi, D., Zucker, S., Adibekyan, V., et al. 2020, A&A, 643, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  9. Beaugé, C., & Nesvorný, D. 2013, ApJ, 763, 12 [Google Scholar]
  10. Beers, T. C., & Christlieb, N. 2005, ARA&A, 43, 531 [NASA ADS] [CrossRef] [Google Scholar]
  11. Bensby, T., Feltzing, S., & Lundström, I. 2003, A&A, 410, 527 [CrossRef] [EDP Sciences] [Google Scholar]
  12. Bensby, T., Feltzing, S., & Oey, M. S. 2014, A&A, 562, A71 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  13. Bitsch, B., & Battistini, C. 2020, A&A, 633, A10 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  14. Bland-Hawthorn, J., & Gerhard, O. 2016, ARA&A, 54, 529 [Google Scholar]
  15. Boettner, C., Dayal, P., Trebitsch, M., et al. 2024, A&A, 686, A167 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  16. Boley, K. M., Wang, J., Zinn, J. C., et al. 2021, ApJ, 162, 85 [CrossRef] [Google Scholar]
  17. Börner, A., Paproth, C., Perez, J. C., et al. 2022, PLATO’s Signal and Noise Budget (Berlin: Springer) [Google Scholar]
  18. Bovy, J. 2015, ApJS, 216, 29 [NASA ADS] [CrossRef] [Google Scholar]
  19. Buchhave, L. A., Latham, D. W., Johansen, A., et al. 2012, Nat, 486, 375 [NASA ADS] [CrossRef] [Google Scholar]
  20. Buchhave, L. A., Bitsch, B., Johansen, A., et al. 2018, ApJ, 856, 37 [Google Scholar]
  21. Buder, S., Sharma, S., Kos, J., et al. 2021, MNRAS, 506, 150 [NASA ADS] [CrossRef] [Google Scholar]
  22. Burn, R., Schlecker, M., Mordasini, C., et al. 2021, A&A, 656, A72 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  23. Burn, R., Mordasini, C., Mishra, L., et al. 2024, Nat. Astron., 8, 463 [Google Scholar]
  24. Cabral, N., Lagarde, N., Reylé, C., Guilbert-Lepoutre, A., & Robin, A. C. 2019, A&A, 622, A49 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  25. Cabral, N., Guilbert-Lepoutre, A., Bitsch, B., Lagarde, N., & Diakite, S. 2023, A&A, 673, A117 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  26. Caswell, T. A., Elliott Sales de Andrade, Lee, A., et al. 2023, https://doi.org/18.5281/zenodo.18152882 [Google Scholar]
  27. Chen, D.-C., Xie, J.-W., Zhou, J.-L., et al. 2021, ApJ, 909, 115 [NASA ADS] [CrossRef] [Google Scholar]
  28. Chen, D.-C., Xie, J.-W., Zhou, J.-L., et al. 2022, ApJ, 163, 249 [CrossRef] [Google Scholar]
  29. Christiansen, J. L. 2017, Kepler Sci. Doc. KSCI-19110-001, 18 [Google Scholar]
  30. Christiansen, J. L., Clarke, B. D., Burke, C. J., et al. 2015, ApJ, 810, 95 [NASA ADS] [CrossRef] [Google Scholar]
  31. Dawson, R. I., & Murray-Clay, R. A. 2013, ApJ, 767, L24 [Google Scholar]
  32. Di Matteo, P., Haywood, M., Lehnert, M. D., et al. 2019, A&A, 632, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  33. Emsenhuber, A., Mordasini, C., Burn, R., et al. 2021a, ApJ, 656, A70 [Google Scholar]
  34. Emsenhuber, A., Mordasini, C., Burn, R., et al. 2021b, A&A, 656, A69 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  35. Ercolano, B., & Clarke, C. J. 2010, MNRAS, 402, 2735 [Google Scholar]
  36. Fischer, D. A., & Valenti, J. 2005, ApJ, 622, 1102 [NASA ADS] [CrossRef] [Google Scholar]
  37. Frebel, A., & Norris, J. E. 2015, ARA&A, 53, 631 [NASA ADS] [CrossRef] [Google Scholar]
  38. Fressin, F., Torres, G., Charbonneau, D., et al. 2013, ApJ, 766, 81 [NASA ADS] [CrossRef] [Google Scholar]
  39. Fulton, B. J., Petigura, E. A., Howard, A. W., et al. 2017, ApJ, 154, 109 [Google Scholar]
  40. Gaia Collaboration (Prusti, T., et al.) 2016, A&A, 595, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Gaia Collaboration (Brown, A. G. A., et al.) 2018, A&A, 616, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  42. Gaia Collaboration (Vallenari, A., et al.) 2023, A&A, 674, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  43. Gilliland, R. L., Chaplin, W. J., Dunham, E. W., et al. 2011, ApJS, 197, 6 [NASA ADS] [CrossRef] [Google Scholar]
  44. Gilmore, G., & Reid, N. 1983, MNRAS, 202, 1025 [Google Scholar]
  45. Ginsburg, A., Sipocz, B. M., Brasseur, C. E., et al. 2019, AJ, 157, 98 [NASA ADS] [CrossRef] [Google Scholar]
  46. Gonzalez, G. 1997, MNRAS, 285, 403 [Google Scholar]
  47. Gundlach, B., & Blum, J. 2015, ApJ, 798, 34 [Google Scholar]
  48. Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357 [NASA ADS] [CrossRef] [Google Scholar]
  49. Hawkins, K., Jofré, P., Masseron, T., & Gilmore, G. 2015, MNRAS, 453, 758 [NASA ADS] [CrossRef] [Google Scholar]
  50. Haywood, M. 2008, A&A, 482, 673 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  51. Haywood, M. 2009, ApJ, 698, L1 [NASA ADS] [CrossRef] [Google Scholar]
  52. He, M. Y., Ford, E. B., & Ragozzine, D. 2021, ApJ, 161, 16 [CrossRef] [Google Scholar]
  53. Heller, R. 2019, A&A, 623, A137 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  54. Heller, R., Harre, J.-V., & Samadi, R. 2022, A&A, 665, A11 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  55. Hellier, C., Anderson, D. R., Collier Cameron, A., et al. 2014, MNRAS, 440, 1982 [NASA ADS] [CrossRef] [Google Scholar]
  56. Helmi, A., Babusiaux, C., Koppelman, H. H., et al. 2018, Nature, 563, 85 [Google Scholar]
  57. Hippke, M., & Angerhausen, D. 2015, ApJ, 810, 29 [NASA ADS] [CrossRef] [Google Scholar]
  58. Hippke, M., & Heller, R. 2019, A&A, 623, A39 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  59. Horta, D., Schiavon, R. P., Mackereth, J. T., et al. 2023, MNRAS, 520, 5671 [NASA ADS] [CrossRef] [Google Scholar]
  60. Howard, A. W., Marcy, G. W., Bryson, S. T., et al. 2012, ApJS, 201, 15 [Google Scholar]
  61. Hsu, D. C., Ford, E. B., Ragozzine, D., & Ashby, K. 2019, ApJ, 158, 109 [CrossRef] [Google Scholar]
  62. Jenkins, J. M. 2002, ApJ, 575, 493 [Google Scholar]
  63. Jenkins, J. M., & Dunnuck, J. 2011, Proc. SPIE, 8146, 814602 [NASA ADS] [CrossRef] [Google Scholar]
  64. Jenkins, J. M., Doyle, L. R., & Cullers, D. K. 1996, Icarus, 119, 244 [NASA ADS] [CrossRef] [Google Scholar]
  65. Jenkins, J. M., Caldwell, D. A., Chandrasekaran, H., et al. 2010, ApJ, 713, L87 [Google Scholar]
  66. Johnson, J. L., & Li, H. 2012, ApJ, 751, 81 [NASA ADS] [CrossRef] [Google Scholar]
  67. Johnson, J. A., Aller, K. M., Howard, A. W., & Crepp, J. R. 2010, PASP, 122, 905 [Google Scholar]
  68. Kornet, K., Bodenheimer, P., Rózyczka, M., & Stepinski, T. F. 2005, A&A, 430, 1133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  69. Kovács, G., Zucker, S., & Mazeh, T. 2002, A&A, 391, 369 [Google Scholar]
  70. Kunimoto, M., & Matthews, J. M. 2020, ApJ, 159, 248 [CrossRef] [Google Scholar]
  71. Lindegren, L., Bastian, U., Biermann, M., et al. 2021, A&A, 649, A4 [EDP Sciences] [Google Scholar]
  72. Lu, C. X., Schlaufman, K. C., & Cheng, S. 2020, ApJ, 160, 253 [CrossRef] [Google Scholar]
  73. Masseron, T., & Gilmore, G. 2015, MNRAS, 453, 1855 [CrossRef] [Google Scholar]
  74. Matuszewski, F., Nettelmann, N., Cabrera, J., Börner, A., & Rauer, H. 2023, A&A, 677, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  75. Montalto, M., Piotto, G., Marrese, P. M., et al. 2021, A&A, 653, A98 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  76. Morbidelli, A., & Rickman, H. 2015, A&A, 583, A43 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  77. Mordasini, C., Alibert, Y., Benz, W., Klahr, H., & Henning, T. 2012, A&A, 541, A97 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  78. Morello, G., Gerber, J., Plez, B., et al. 2022, Res. Notes AAS, 6, 248 [CrossRef] [Google Scholar]
  79. Morris, B. M., Bobra, M. G., Agol, E., Lee, Y. J., & Hawley, S. L. 2020, MNRAS, 493, 5489 [NASA ADS] [CrossRef] [Google Scholar]
  80. Mortier, A., Santos, N. C., Sozzetti, A., et al. 2012, A&A, 543, A45 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  81. Mortier, A., Santos, N. C., Sousa, S. G., et al. 2013, A&A, 557, A70 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  82. Mulders, G. D., Pascucci, I., & Apai, D. 2015, ApJ, 798, 112 [Google Scholar]
  83. Mulders, G. D., Pascucci, I., Apai, D., Frasca, A., & Molenda-Zakowicz, J. 2016, ApJ, 152, 187 [Google Scholar]
  84. Nascimbeni, V., Piotto, G., Börner, A., et al. 2022, A&A, 658, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  85. Nielsen, J., Gent, M. R., Bergemann, M., Eitner, P., & Johansen, A. 2023, A&A, 678, A74 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  86. Nordlander, T., Amarsi, A. M., Lind, K., et al. 2017, A&A, 597, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  87. Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011, J. Mach. Learn. Res., 12, 2825 [Google Scholar]
  88. Petigura, E. A., Marcy, G. W., Winn, J. N., et al. 2018, ApJ, 155, 89 [CrossRef] [Google Scholar]
  89. Prochaska, J. X., Naumov, S. O., Carney, B. W., McWilliam, A., & Wolfe, A. M. 2000, ApJ, 120, 2513 [Google Scholar]
  90. Rauer, H., Catala, C., Aerts, C., et al. 2014, Exp. Astron., 38, 249 [Google Scholar]
  91. Rauer, H., Aerts, C., Cabrera, J., & PLATO Team 2016, Astron. Nachr., 337, 961 [Google Scholar]
  92. Recio-Blanco, A., de Laverny, P., Palicio, P. A., et al. 2023, A&A, 674, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  93. Reddy, B. E., Tomkin, J., Lambert, D. L., & Allende Prieto, C. 2003, MNRAS, 340, 304 [Google Scholar]
  94. Reddy, B. E., Lambert, D. L., & Allende Prieto, C. 2006, MNRAS, 367, 1329 [Google Scholar]
  95. Samadi, R., Deru, A., Reese, D., et al. 2019, A&A, 624, A117 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  96. Santos, N. C., Israelian, G., & Mayor, M. 2004, A&A, 415, 1153 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  97. Santos, N. C., Adibekyan, V., Dorn, C., et al. 2017, A&A, 608, A94 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  98. Schuster, W. J., Moitinho, A., Márquez, A., Parrao, L., & Covarrubias, E. 2006, A&A, 445, 939 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  99. Seabroke, G. M., Fabricius, C., Teyssier, D., et al. 2021, A&A, 653, A160 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  100. Seager, S., & Mallén-Ornelas, G. 2003, ApJ, 585, 1038 [Google Scholar]
  101. Sousa, S. G., Adibekyan, V., Santos, N. C., et al. 2019, MNRAS, 485, 3981 [NASA ADS] [CrossRef] [Google Scholar]
  102. Starkenburg, E., Aguado, D. S., Bonifacio, P., et al. 2018, MNRAS, 481, 3838 [NASA ADS] [CrossRef] [Google Scholar]
  103. The pandas development team 2023, https://doi.org/18.5281/zenodo.8364959 [Google Scholar]
  104. Thorngren, D. P., Fortney, J. J., Murray-Clay, R. A., & Lopez, E. D. 2016, ApJ, 831, 64 [NASA ADS] [CrossRef] [Google Scholar]
  105. Van Cleve, J. E., & Caldwell, D. A. 2016, Kepler Instrument Handbook, Technical report, Kepler [Google Scholar]
  106. Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Methods, 17, 261 [Google Scholar]
  107. Wallerstein, G. 1962, ApJS, 6, 407 [NASA ADS] [CrossRef] [Google Scholar]
  108. Waskom, M. 2021, JOSS, 6, 3021 [NASA ADS] [CrossRef] [Google Scholar]
  109. Wilson, R. F., Teske, J., Majewski, S. R., et al. 2018, ApJ, 155, 68 [CrossRef] [Google Scholar]
  110. Winter, A. J., Benisty, M., & Andrews, S. M. 2024, ApJ, 972, L9 [NASA ADS] [CrossRef] [Google Scholar]
  111. Yang, J.-Y., Xie, J.-W., & Zhou, J.-L. 2020, ApJ, 159, 164 [CrossRef] [Google Scholar]
  112. Yasui, C., Kobayashi, N., Tokunaga, A. T., Saito, M., & Tokoku, C. 2010, ApJ, 723, L113 [NASA ADS] [CrossRef] [Google Scholar]
  113. Zhang, J., Xiang, M., Yu, J., et al. 2024, ApJS, 272, 40 [CrossRef] [Google Scholar]
  114. Zink, J. K., Hardegree-Ullman, K. K., Christiansen, J. L., et al. 2023, ApJ, 165, 262 [CrossRef] [Google Scholar]

1

Specifically, this means that first 13 GSP-Spec quality flags for the RVS spectra are equal to zero. For more information, see Appendix C in Recio-Blanco et al. (2023).

All Tables

Table 1

Number of stars per stellar population in the all-sky sample, and in the southern (LOPS2) and northern (LOPN1) long-observation fields.

Table 2

Median metallicity and α-element abundance from APOGEE- DR17 and GALAH DR3 for the different stellar populations; the values were calculated for the all-sky sample.

Table 3

Planet categories as defined in Emsenhuber et al. (2021a).

Table 4

Key planet population metrics, calculated for a single mock population with NEmbryos = 100 across all targets in the LOPS2 and LOPN1 fields.

Table 5

Median number of planets for the targets in the LOPS2 and LOPN1 fields across 300 mock simulations, by the number of embryos assumed in the NGPPS populations.

Table 6

Median number of expected planet detections in the different Galactic components across the LOPS2 and LOPN1 fields, for a varying number of initial embryos NEmbryo.

Table 7

Median number of expected planet detections across the LOPS2 and LOPN1 fields, enforcing the metallicity limit. Planets around stars with [Fe/H] < −0.6 are removed.

Table D.1

Special target list for halo stars in the LOPS2 field.

Table D.2

Same as Table D.1, but for the LOPN1 field.

All Figures

thumbnail Fig. 1

Toomre diagram of the all-sky sample. The stars are coloured according to their classification into the different Galactic components, as described in Sect. 2.2.

In the text
thumbnail Fig. 2

Histogram of the [Fe/H] metallicity distribution for the thin disk, thick disk, and halo components for the all-sky sample. Shown are only those stars that have reliable estimates from APOGEE or GALAH.

In the text
thumbnail Fig. 3

Scatter plot of the all-sky sample in the metallicity − α element plane, categorised by their classification into the thin disk, thick disk, and halo component. Only those stars with corresponding APOGEE or GALAH estimates are shown. Panel a: APOGEE. Panel b: GALAH. Darker shades represent a higher density of points. Halo stars are marked as diamonds. Note that for the APOGEE sample, we show [α/M], which may slightly differ from [α/Fe]. The (dashed) chemical separation lines between halo (left section), thick disk (upper right section) and thin disk (lower right section) are based on Horta et al. (2023).

In the text
thumbnail Fig. 4

Confusion matrix comparing the dynamical and chemical classifications for stars possessing both metallicity and α-abundance estimates from APOGEE or GALAH. Each tile presents the total number of stars (top), precision (middle, fraction normalised along columns), and recall (bottom, fraction normalised along rows). For instance, 3242 stars are consistently classified as belonging to the thick disk by both methods. This corresponds to 68.15% of targets that were kinematically classified as thick disk stars, and 58.79% of targets that were chemically classified as such.

In the text
thumbnail Fig. 5

Scatter plot of the all-sky sample in Galactic coordinates (Aitoff projection). Stars that fall into the southern (LOPS2) and northern (LOPN1) long-observation fields are colour-coded by the number of cameras observing the target.

In the text
thumbnail Fig. 6

Contour plot of the detection efficiency as a function of Rp and P. The fixed parameters are Rstar = R, Mstar = M, V = 11, ncameras = 24, cos i = 0, σstar = 10−5, u1 = 0.4 and u2 = 0.15. The white marker is located at P = 1 year, R = 1R.

In the text
thumbnail Fig. 7

Detection efficiency for the LOPS2 + LOPN1 targets. (a) The distribution of detection efficiencies calculated for the targets, assuming the transit of an Earth analogue (P = 1 year, R = 1R) with cos i = 0. The colour of the bars encodes the mean magnitude of the stars within each bin. (b) The fraction of detectable planets (defined as planets with a detection efficiency >1/Nsample) for the thin disk, thick disk and halo populations, as a function of radius. The detection efficiencies are calculated for cos i = 0 and P = 100 days.

In the text
thumbnail Fig. 8

Fraction of detectable planets for the whole sample and P1 subsample, as a function of planet radius and orbital period. Detectable planets are defined as planets with a detection efficiency ≥1/Nsample, assuming cos i = 0 for all planets. The white marker is located at P = 1 year, R = 1 R. (a) All LOPS2 + LOPN1 targets. (b) P1 sample (V ≤ 11 and random noise ≤50 ppm integrated over one hour).

In the text
thumbnail Fig. 9

Occurrence rates of different planet types per stellar population, for varying NEmbryo. (a) NEmbryo = 10. (b) NEmbryo = 100.

In the text
thumbnail Fig. A.1

Heat maps of the expected number of observed planets in across the LOPS2 and LOPN1 fields for a 2+2 year observation strategy, as a function of orbital period and planet radius. The numbers in each bin represent the mean and the interquartile range (16th to 84th percentile) for 300 mock observations, for NEmbryo = [10, 50, 100] (top to bottom). The heat maps on the right side (b, d, f) enforce a minimum stellar metallicity of [Fe/H] = −0.6, below which no planets can form as suggested by Andama et al. (2024). Note that these estimates are only for the targets we were able to classify into one of the three components (see Table 1), not considering stars that were categorised as thick disk or halo candidates, or those removed in the sample selection process (Sect. 2.1).

In the text
thumbnail Fig. B.1

Gaia metallicity estimates for our sample compared to values determined from APOGEE and GALAH. Panel (a): APOGEE. Panel (b): GALAH. The Gaia estimates show a larger fraction of low- and high-metallicity stars, while the APOGEE and GALAH samples are more centralised around 0. The colour represents the density of points, while the grey dashed line shows the x=y diagonal.

In the text
thumbnail Fig. B.2

Gaia surface gravity estimates for our sample compared to values determined from APOGEE and GALAH. Panel (a): APOGEE. Panel (b): GALAH. Note the artefacts around 3.7–4.7 cm/s2, where the Gaia results have a tendency to underestimate log g for some targets.

In the text
thumbnail Fig. C.1

Distribution of estimated total noise (systematic + random + photon), excluding stellar variability, for the target stars in the LOPS2 and LOPN1 fields. The noise modelling is described in Sect. 3.3.

In the text
thumbnail Fig. C.2

Effects of stellar variability on the number of exoplanet detections. The values are normalised to the number of detections without a stellar variability component (σstar = 0). The grey vertical line corresponds to the 10ppm we assume for this work.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.