Open Access
Issue
A&A
Volume 688, August 2024
Article Number A38
Number of page(s) 13
Section Galactic structure, stellar clusters and populations
DOI https://doi.org/10.1051/0004-6361/202449744
Published online 30 July 2024

© The Authors 2024

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

The Milky Way’s disc, similar to those of most other spiral galaxies, is warped. The outer disc is bent upwards in roughly the direction that the Sun is currently moving, and downwards on the opposite side. This was first discovered in 21 cm line measurements of the distribution of H I gas in the Galaxy (Kerr & Hindman 1957; Burke 1957; Westerhout 1957) before being found in the stellar population (Djorgovski & Sosin 1989; Freudenreich 1996) including the Cepheid population, where accurate distance estimation is possible to large distances (e.g. Chen et al. 2019; Skowron et al. 2019). Similarly, warps have been found to be common features in both the gas (e.g. Bosma 1981) and stars (e.g. Sánchez-Saavedra et al. 1990) of spiral galaxies.

The warp influences the movement of stars and leaves a signature in Milky Way stellar kinematics. Systematic vertical velocity trends, dependent on Galactic positions or angular momentum, have long been attributed to the warp (e.g. Dehnen 1998; Drimmel et al. 2000; Poggio et al. 2020; Cheng et al. 2020). For a warp that was not precessing, we would simply expect a net upward motion of stars moving towards the current higher end of the Milky Way’s warp (and downwards for stars moving towards the lower end) in a predictable way. However, a warp that is precessing relatively quickly around the galaxy does, at a certain radius, begin to overtake the stars orbiting around the centre of the Galaxy, and therefore the sign of the stars’ vertical velocity reverses. This is a trend very clearly seen for the Milky Way’s stars by Cheng et al. (2020).

Most kinematic models of the warp assume that the velocity of stars towards or away from the centre of the Galaxy is irrelevant. This assumption underlies “tilted ring” models of galactic warps, which are commonly used for analysis of both external galaxies (e.g. Rogstad et al. 1974; Briggs 1990) and the Milky Way (e.g. Dehnen et al. 2023). It is also an assumption that is often put into models that apply Jeans’ first equation to study the Milky Way’s warp (e.g. Drimmel et al. 2000; Poggio et al. 2020). However, a net radial velocity1 for a set of stars does correspond to a net movement within the warped disc, which one would expect to correspond to a net motion vertically. As shown by Cheng et al. (2020), this naturally enters into Jeans’ first equation and can be treated quantitatively in the corresponding models.

With Gaia data release 2 (DR2: Gaia Collaboration 2016, 2018a) providing new insight into these stellar kinematics, the warp has been found to be precessing relatively rapidly in the prograde direction (the direction of the disc’s rotation) at rates of 10−14 km s−1 kpc−1 (Poggio et al. 2018; Cheng et al. 2020). We note that Chrobáková & López-Corredoira (2021) dispute this finding, though their estimate of 4 4 + 6 $ 4^{+6}_{-4} $ km s−1 kpc−1 is consistent with either a stationary or rapidly precessing warp.

For a sample of Cepheids from Gaia data release 3 (DR3: Gaia Collaboration 2023a), Dehnen et al. (2023) found a similarly high prograde precession rate of 9 − 15 km s−1 kpc−1. This is a much smaller number of stars, but with much more accurate and precise distance estimates, and can be considered an independent data sample and analysis. It also specifically probes the warp in the population of young stars, as Cepheids have ages of a few hundred million years or less. This precession is unexpected in the context of the standard picture of a warp made up of tilted rings of stars, in which the precession is expected to be retrograde and much slower.

In most warp models, the line of nodes, ϕw, along which the warped disc intersects z = 0, is a parameter used to describe the orientation of the warp. A simple approximation that is suitable for relatively local samples, made for example by Poggio et al. (2018) and Cheng et al. (2020), is that ϕw is a constant. However, Briggs (1990) found that the outer parts of warps in external galaxies are twisted, and that ϕw forms a leading spiral in the outer part of the warp (this is known as “Briggs’ rule”). For more extended samples, the twist of the warp can be expected to be significant, and indeed both Chen et al. (2019) and Dehnen et al. (2023) found that their Cepheid sample’s warp followed Briggs’ rule beyond 12 kpc from the Galactic centre, but the latter study also found that the warp precession rate decreased outwards such that it could unwind the twisted warp in ∼100 Myr.

The cause of the Milky Way’s warp remains uncertain. Theories include that it is the torque that arises from a misalignment between the Galactic disc and the dark matter halo (e.g. Dubinski & Chakrabarty 2009), accretion of intergalactic matter onto the disc (e.g. Ostriker & Binney 1989), internal amplification of a small misalignment between the inner and outer disc (Sellwood & Debattista 2021), or the interaction with the Large Magellanic Cloud (e.g. Kerr 1957; Burke 1957) or with the Sagittarius dwarf galaxy (e.g. Bailin 2003). A better understanding of the warp’s behaviour and kinematics is needed for us to compare the predictions of these theories to observations.

With the increased data set provided by Gaia DR3, we are in a position to investigate the warp of the Milky Way over a large area, and to investigate the azimuthal variation of the warp’s kinematic signature. We focus purely on the kinematics of the sample because the physical distribution of the sample seen in the Gaia data is enormously affected by the dust extinction of the disc, which is not well understood. We therefore leave the study of the physical distribution of the warp to others, and will compare the results from our kinematic study to the physical warp found by these works. While doing this we are mindful that any limitations of the kinematic model will be translated into an alteration of the shape of the warp in the model. To keep a clear separation between results from kinematic models, and more direct measurements of the current physical shape of the warp, we use the subscript phys when describing the latter.

Our ability to trace the kinematics of the warp across a large area of the disc allows us to investigate its properties in great detail. It allows us to, for the first time, investigate the twist of the warp’s line of nodes in the outer galaxy. We explore the precession rate and line of nodes of the warp, and how they vary with radius. We also examine the effect of including the radial velocity of stars in our kinematic models, following the pioneering study by Cheng et al. (2020). These authors put all the stars in a given radial range in a single bin, which was characterised by its centre. The influence of the radial velocity on the model vertical velocity is zero along the line of nodes, and the centres of these bins tend to lay along the Sun’s azimuth in the disc, which Cheng et al. (2020) had fixed as the line of nodes. Looking at the stars binned in azimuth gives us a much better lever arm to study the radial velocity’s effect on the vertical velocity in this model, and we show that this effect is significant.

In this paper we describe how we used Gaia DR3 data to study the kinematic signature of the Milky Way’s warp, and to measure its precession rate and line of nodes under different assumptions. Section 2 describes the data sample taken from Gaia DR3, and Sect. 3 describes the kinematic models we used to fit these data. In Sect. 4 we show how the various models fitted the measured average vertical velocities of stars across the Milky Way’s disc, before looking at the properties of these models in Sect. 5. Finally, we discuss these results in context and conclude the paper in Sects. 6 and 7, respectively.

2. Data

To study the kinematic signature of the Milky Way warp we need to know stars’ three-dimensional positions and velocities, which Gaia DR3 provides us to good accuracy for roughly 33 million stars. Since deriving the heliocentric distances from Gaia’s parallaxes alone leads to biased and increasingly uncertain estimates at large distances, we used the Bailer-Jones et al. (2021) photogeometric distances which combine Gaia measured parallaxes with the colour and magnitudes of the star, and a Bayesian prior. The ADQL query used to obtain the data is given in Appendix A. To prevent the very worst distance errors from affecting our results we excluded stars for which the relative parallax error is large (ϖ/σϖ > 1), and for which the Gaia astrometric quality indicator RUWE > 1.4 (Lindegren et al. 2021).

We analysed the data in galactocentric cylindrical coordinates (R, ϕ, z), where the position and velocity of the Sun with respect to the Galaxy’s centre were taken to be the astropy default values2. These are that the Sun is at a cylindrical galactocentric radius R = 8.122 kpc, (GRAVITY Collaboration 2018) and z = 20.8 pc above the Galactic plane, with velocity v = (vR, ⊙, vϕ, ⊙, vz, ⊙) = (−12.9, −245.6, 7.78) km s−1 (Drimmel & Poggio 2018; Reid & Brunthaler 2004). The Sun’s position in galactocentric azimuth is defined to be exactly ϕ = 180°. Since we used a standard right-handed coordinate system, the Milky Way disc’s typical sense of rotation is negative, that is, disc stars are rotating towards lower angles in azimuth. Where we cite studies that have used different conventions for the Sun’s position in ϕ, and for the direction of rotation, the values we quote in this paper are converted into our coordinate system.

We removed contaminants from the sample that come from the Large and the Small Magellanic Clouds (LMC and SMC), and 47 Tucanae. All stars within 12° and 6° respectively of the centres of the LMC and SMC respective on-sky positions3 were removed. For 47 Tucanae we removed stars in the range of R ∈ (6.6, 6.9) kpc and ϕ ∈ (200° ,205° ) with z < −2500 kpc.

The stars in our sample were divided up into bins in galactocentric radii and azimuth. In radii, the bins were 500 pc intervals beginning at R = 4 kpc and extending outwards to the edge of our sample, while the azimuth intervals were 10° wide and extend from ϕ = 120° −240°. We disregarded any bin that contains fewer than 50 stars so that we could determine the average positions and velocities and their uncertainties from a sufficiently large sample. This leaves us with a sample of 26 891 917 stars in 327 bins. A map of the bins included in our sample, coloured by the number of stars in that bin, can be seen in Fig. 1.

thumbnail Fig. 1.

Distribution (left) and average vertical velocity (right) of our sample over the Galactic plane. This is binned, with each bin covering 0.5 kpc in R and 10° in ϕ, both in galactocentric coordinates, with stars in the disc rotating in the negative ϕ direction. We note that moving radially outwards from the position of the Sun v ¯ z $ \bar{v}_z $ rises to around 10 km s−1 before falling to below zero, but that this behaviour disappears for ϕ ≳ 220°.

For each component of velocity, we used the median as our average value, which we preferred to the mean as it is more robust to outliers and contamination. We derived uncertainties on the average of each velocity component v ¯ i $ \bar{v}_i $ using non-parametric bootstrapping. For each bin we resampled the velocities 1000 times, and determined the median value in each case. Our statistical uncertainty σ v ¯ i , stat $ \sigma_{\bar{v}_i, \mathrm{stat}} $ is then the standard deviation of the distribution of resampled medians.

As tests of the robustness of our results, we have looked at data samples with a more conservative cut of 1000 stars per bin, and tests in which we consider stars within limited ranges of R. In this latter case we looked at both 2 kpc annuli in 1 kpc steps with the outer R ranging from 13 to 16 kpc and at samples of stars with R less than a threshold value which ranges from 13 to 18 kpc. We discuss the outputs of these tests when describing our results.

The unprecedented number of stars in Gaia DR3 for which we have full six dimensional phase space measurements allowed us to probe the warp’s kinematic signature as a function of azimuth, and Fig. 1 also shows the average vertical velocity of the sample as a function of position in the Galaxy. It is immediately obvious that the stars in the range 10 ≲ R ≲ 15 kpc are moving upwards across most of the area covered, while those outside ∼15 kpc are moving downwards. We can also see that beyond ϕ ≈ 220° the stars in the range 10 ≲ R ≲ 15 kpc are not moving upwards. As we will discuss through the rest of this paper, these two clear features of the data would appear to indicate that the warp is precessing quickly, and that the line of nodes of the warp, as defined by kinematics, lies closer to ϕ ≈ 135° than the value measured from the positions of stars of around 180°.

3. The warp models

3.1. Warp shape

The mean vertical position of the plane of the disc in our model, z0, depends on both R and ϕ, and we separated these dependencies out, adopting

z 0 ( R , ϕ ) = h ( R ) s ( ϕ , R ) , $$ \begin{aligned} z_0(R,\phi ) = h(R) s(\phi , R), \end{aligned} $$(1)

where h(R) sets the greatest displacement due to the warp at a given R, while s(ϕ, R) is a function that takes values in the range −1 ≤ s ≤ 1.

For h(R), numerical experiment led us to use a piecewise-linear model, with the free parameters being the peak heights at 1 kpc steps from R = 7 to 16 kpc. Inside 6 kpc we assume zero warp, while beyond 16 kpc we extrapolate h(R) linearly from the last segment (or set it as constant if the final segment had a negative gradient – our results do not depend on this approximation). We denote these h7, h8, …, h16. For our robustness tests where there is a cut in R we use the same h(R) model, but limit it to the radii where the model affects the smaller dataset.

It has been common in studies of the Milky Way’s warp to assume that the warp’s dependence on azimuth, s(ϕ, R) in Eq. (1), is a simple sinusoidal function with no dependence on R (see Drimmel et al. 2000; Cheng et al. 2020, and others). We followed this precedent and adopt the azimuthal dependence

s ( ϕ ) = sin ( ϕ ϕ w ω p t ) , $$ \begin{aligned} s(\phi )=\sin (\phi - \phi^\prime _{ w} - \omega _{\rm p}t), \end{aligned} $$(2)

where ϕ w $ \phi^\prime_{\it w} $ is the line of nodes, and ωp is the warp’s precession rate. We present results using this model in this study. However, with the Gaia DR3 data we can go further than this, and we therefore investigated the radial change (or “twist”) of the line of nodes.

We therefore extended our model to allow both ϕ w $ \phi^\prime_{\it w} $ and ωp to vary linearly with R. Specifically, we took

s ( ϕ , R ) = sin ( ϕ ϕ w ( R ) ω p ( R ) t ) , $$ \begin{aligned} s(\phi , R)=\sin \left(\phi - \phi^\prime _{ w}(R) - \omega _{\rm p}(R)\,t\right), \end{aligned} $$(3)

where

ϕ w ( R ) = ϕ w , 0 + ϕ w , 1 ( R R s ) ω p ( R ) = ω p , 0 + ω p , 1 ( R R s ) , $$ \begin{aligned}&\phi^\prime _{ w}(R) = \phi^\prime _{{ w},0} + \phi _{{ w},1}(R-R_{\rm s})\nonumber \\&\omega _{\rm p}(R) = \omega _{\rm p,0} + \omega _{\rm p,1}(R-R_{\rm s}), \end{aligned} $$(4)

where Rs is a fixed at 12 kpc. This is purely a numerical convenience that ensures that ϕ w,0 $ \phi^\prime_{{\it w},0} $ and ωp, 0 took values that are easy to interpret (i.e. the values of the respective parameters at R = 12 kpc). We note that, given our formulation of the problem, typical values of ϕ w (R) $ \phi^\prime_{\it w}(R) $ correspond to the line of nodes on the far side of the Galactic disc. However, by symmetry the other line of nodes in our model is always at ϕ w = ϕ w + 180 ° $ \phi_{\it w} = \phi^\prime_{\it w} + 180^\circ $, so it is this value that we quote, along with ϕ w,0 = ϕ w,0 + 180 ° $ \phi_{{\it w},0} = \phi^\prime_{{\it w},0} + 180^\circ $ for models where the line of nodes varied with R.

3.2. Modelling the kinematic signature

Qualitatively, we can understand that stars moving from the lower part of the warp to the higher part will, today, be observed with a net upward motion. For a warp that varies sinusoidally with ϕ, this velocity will be a maximum at the line of nodes, and zero at the peak or trough of the warp. A first complication to this is that, for a precessing warp, the warp may be catching up with the stars. This can cause stars that appear to be heading towards the higher part of the warp to have a net downward motion. A subtler effect is that stars moving inwards or outwards should also follow the warp shape, and this could also affect the expected velocities of the stars.

We can create a quantitive model for the expected velocities in idealised models using Jeans’ first equation (Jeans 1915), which is an integral of the collisionless Boltzmann equation to give, effectively, the continuity equation. In cylindrical polar coordinates (R, z, ϕ) we can write it as

0 = n t + ( n v ¯ R ) R + 1 R ( n v ¯ ϕ ) ϕ + ( n v ¯ z ) z , $$ \begin{aligned} 0 = \frac{\partial n}{\partial t} + \frac{\partial (n \bar{v}_{\rm R})}{\partial R} + \frac{1}{R} \frac{\partial (n \bar{v}_\phi )}{\partial \phi } + \frac{\partial (n \bar{v}_z)}{\partial z}, \end{aligned} $$(5)

where n is the number density of stars in the disc. We assumed that n follows a double exponential profile for stars in the disc

n ( R , z ) = f ( R ) g ( z ) = n 0 exp ( | z | z h R R h ) , $$ \begin{aligned} n(R, z^\prime ) = f(R)g(z^\prime ) = n_0 \exp \left(-\frac{|z^\prime |}{z_{\rm h}} - \frac{R}{R_{\rm h}}\right), \end{aligned} $$(6)

where the scale height zh and scale length Rh are parameters that ultimately disappear at the end of the model derivation. The term z′ is the distance along the z-axis to the warped midplane of the disc at z0 (Eq. (1)), and is given by

z = z z 0 = z h ( R ) s ( ϕ , R ) . $$ \begin{aligned} z^\prime = z - z_0 = z - h(R)s(\phi , R). \end{aligned} $$(7)

The standard approach to warp modelling in the Milky Way has been to assume a rigidly processing warp, and that the effect of vR is negligible, yielding (Drimmel et al. 2000; Poggio et al. 2020)

v ¯ z , mod = ( v ¯ ϕ R ω p ) h ( R ) s ϕ · $$ \begin{aligned} \bar{v}_{z,\mathrm{mod}} = \left(\frac{\bar{v}_\phi }{R} - \omega _{\rm p} \right)h(R) \frac{\partial s}{\partial \phi }\cdot \end{aligned} $$(8)

However, as we noted above, Cheng et al. (2020) showed how to include the effect of vR motion in this equation, and we can follow their derivation, with the same assumptions that v ¯ ϕ / ϕ = 0 $ \partial\bar{v}_\phi/\partial\phi=0 $ and v ¯ z / z = 0 $ \partial\bar{v}_z/\partial z = 0 $, to arrive at an expression for v ¯ z $ \bar{v}_z $

v ¯ z , mod = ( v ¯ ϕ R ω p ) h ( R ) s ϕ + v ¯ R h ( R ) R s ( ϕ , R ) + v ¯ R h ( R ) s ( ϕ , R ) R · $$ \begin{aligned} \bar{v}_{z,\mathrm{mod}} =&\left(\frac{\bar{v}_\phi }{R} - \omega _{\rm p} \right)h(R) \frac{\partial s}{\partial \phi } + \bar{v}_{\rm R}\frac{\partial h(R)}{\partial R}s(\phi , R)\nonumber \\&+ \bar{v}_{\rm R} h(R) \frac{\partial s(\phi , R)}{\partial R}\cdot \end{aligned} $$(9)

This differs from the equation derived by Cheng et al. (2020, their Eq. (9)), by the addition of the final term which includes s ( ϕ , R ) R $ \frac{\partial s(\phi, R)}{\partial R} $. This new term was required when we introduced the twist to the line of nodes and precession rate as given in Eqs. (3) and (4).

Again, we note that the parameters of the model, such as h(R) and ϕw, are properties of a kinematic model which, if the model is correct, correspond to the physical height of the warp and its line of nodes. The model is inevitably incomplete, and there are clear degeneracies between h and ωp implicit in Eq. (8). We draw a distinction between these model parameters and these properties measured in other studies by referring to the physical warp height and line of nodes as hphys and ϕw, phys, respectively.

3.3. Models summary

The modelling approach described above leaves us with four different models that we fitted to the data. The names we give them through this study are:

  1. Classical – Assume that we should neglect the vR terms, and fix ωp and ϕw as a single values (Eqs. (2) and (8)).

  2. Classical twisted – Assume that we should neglect the vR terms, but allow ωp and ϕw to vary linearly with radius (Eqs. (3) and (8)).

  3. vR term – Include the vR terms, and fix ωp and ϕw as a single values (Eqs. (2) and (9)).

  4. vR term twisted – Include the vR terms, and allow ωp and ϕw to vary linearly with radius (Eqs. (3) and (9)).

For all of these models we treat v ¯ z $ \bar{v}_z $ as the quantity of interest, for which we have measured values and their statistical uncertainties. The statistical uncertainties on v ¯ ϕ $ \bar{v}_\phi $ and v ¯ R $ \bar{v}_{\mathrm{R}} $ enter as, effectively, uncertainties in the model v ¯ z , mod $ \bar{v}_{z,\mathrm{mod}} $ velocity,

σ v ¯ z , mod = σ v ¯ ϕ h ( R ) R s ϕ $$ \begin{aligned} \sigma _{\bar{v}_{z,\mathrm{mod}}} = \sigma _{\bar{v}_\phi }\frac{h(R)}{R} \frac{\partial s}{\partial \phi } \end{aligned} $$(10)

and

σ v ¯ z , mod = σ v ¯ ϕ h ( R ) R s ϕ + σ v ¯ R ( h ( R ) R s ( ϕ , R ) + h ( R ) s ( ϕ , R ) R ) , $$ \begin{aligned} \sigma _{\bar{v}_{z,\mathrm{mod}}} = \sigma _{\bar{v}_\phi } \frac{h(R)}{R} \frac{\partial s}{\partial \phi } +\sigma _{\bar{v}_{\rm R}}\left(\frac{\partial h(R)}{\partial R}s(\phi , R) + h(R) \frac{\partial s(\phi , R)}{\partial R}\right), \end{aligned} $$(11)

for the Classical and vR term models, respectively. We added these in quadrature to the measurement uncertainty v ¯ z $ \bar{v}_z $, along with a systematic uncertainty σ v ¯ z , sys = 0.7 $ \sigma_{\bar{v}_{z,\mathrm{sys}}}=0.7 $ km s−1 to give us

σ v ¯ z , tot = σ v ¯ z , stat 2 + σ v ¯ z , mod 2 + σ v ¯ z , sys 2 . $$ \begin{aligned} \sigma _{\bar{v}_{z,\mathrm{tot}}} = \sqrt{\sigma _{\bar{v}_{z,\mathrm{stat}}}^2 + \sigma _{\bar{v}_{z,\mathrm{mod}}}^2 + \sigma _{\bar{v}_{z,\mathrm{sys}}}^2}. \end{aligned} $$(12)

The systematic uncertainty allows for the imperfections in both the model and the data, and prevents the fit being dominated by the few bins near the Sun that contain over a million stars (see Fig. 1) and therefore have tiny statistical uncertainties. The value of 0.7 km s−1 was chosen such that simple models are, on average, around 1σ from the measurement in each bin. Changing this value by a factor of 2 in either direction did not change our results substantially.

To find the best fitting parameters of the model, and their associated uncertainties, we used a Markov chain Monte Carlo approach using the Goodman & Weare (2010) sampling scheme as applied in the Python package emcee (Foreman-Mackey et al. 2013). The log-probability that we sample was then simply

ln p = 1 2 ( v ¯ z , model v ¯ z , data ) 2 σ v ¯ z , tot 2 + const , $$ \begin{aligned} \ln p = -\frac{1}{2} \sum \frac{(\bar{v}_{z,\mathrm{model}} - \bar{v}_{z,\mathrm{data}})^2}{\sigma _{\bar{v}_{z,\mathrm{tot}}}^2} + {\mathrm{const} }, \end{aligned} $$(13)

which is equivalent to taking a flat prior on all of our parameters. We ran 50 walkers for 20 000 steps, discarding the first 20% as burn-in.

4. Fit to the data

The models we used in this study have ten parameters that describe the greatest height of the warp as a function of R (h7, …, h16) and either two or four which describe the position of the line of nodes and the pattern speed of the warp, with only the two twisted models having four parameters (Eq. (4)).

Appendix B gives the best fitting values of the parameters and their uncertainties in numerical form, while we present them graphically in this main body of the paper. In Figs. 2 and 3 we show the fits to the data. In these figures, as in all figures showing model parameters, we represent the uncertainty by randomly drawing 100 samples of the MCMC chain and plotting them as semi-transparent lines. The uncertainties quoted in Appendix B are taken from the 15.9 and 84.1 percentiles of the MCMC chain after burn-in. Experiments in which we limit our sample to bins with 1000 or more stars give results that are very similar to those shown here, except with larger uncertainties, and these results are shown as an additional table in the appendix.

thumbnail Fig. 2.

Fits to the v ¯ z $ \bar{v}_z $ data for Classical models. Each panel shows data for 10° sectors of the Galaxy, and plots the average vz in 0.5 kpc bins, with associated uncertainties (statistical and systematic). The model fits with a single value of ϕw and ωp are plotted in black, while those of the twisted model are in light blue. Overall both models provide reasonable fits, without major differences.

thumbnail Fig. 3.

Fits to the v ¯ z $ \bar{v}_z $ data for the vR term models. As in Fig. 2, each panel shows data for 10° sectors of the Galaxy, and plots the average vz in 0.5 kpc bins, with associated uncertainties (statistical and systematic). Note that the uncertainties are larger here because we include uncertainties propagated from the vR uncertainties. The models with a single value of ϕw and ωp are plotted in orange, while those of the twisted models are in green. Both models provided reasonable fits, but note in particular that model v ¯ z $ \bar{v}_z $ values are all around zero at larger R.

All four kinematic models capture at least some of the features of the data, with the characteristic rise and fall of v ¯ z $ \bar{v}_z $ as seen by Cheng et al. (2020) found from ϕ = 120° to ∼210°. For the Classical models, the data for ϕ ≳ 210 values were not perfectly fitted by the model. The model flattens towards zero in the outer radii, while the data is almost all below zero. This is somewhat improved for the twisted model, but not for the bins with the largest values of ϕ. For the vR term models the model v ¯ z $ \bar{v}_z $ flatten out to zero in the outer radii in a way that the data does not. The twisted versions of these models provide the best fit of all our models at ϕ > 210, but this comes at the cost of the worst fit of all the models at low ϕ.

In Fig. 4 we show a quantitative comparison of the quality of fit for the four models. We computed a quantity analogous to the chi-squared statistic per degree of freedom, χ ν 2 $ \chi^2_\nu $, for each model

χ ν 2 = 1 N bin N param ( v ¯ z , model v ¯ z , data ) 2 σ v ¯ z , tot 2 , $$ \begin{aligned} \chi ^2_\nu = \frac{1}{N_{\rm bin} - N_{\rm param}} \sum \frac{(\bar{v}_{z,\mathrm{model}} - \bar{v}_{z,\mathrm{data}})^2}{\sigma _{\bar{v}_{z,\mathrm{tot}}}^2}, \end{aligned} $$(14)

thumbnail Fig. 4.

Quantitative comparison of the four models. We compute a quantity analogous to the chi-squared statistic per degree of freedom, χ ν 2 $ \chi^2_\nu $, for each model, and plot the distribution of these values for the final 50 000 steps of the MCMC chain. We note that the Classical models perform significantly better than the vR term.

where Nbin = 327 is the number of bins for our data and the number of parameters Nparam = 12 or 14 for the different models. We emphasise that setting the systematic uncertainty that we apply to be 0.7 km s−1 was guided by the premise that this would lead to χ ν 2 $ \chi^2_\nu $ ≈ 1 for simple models, so the absolute value here is far less important than the relative values for the different models. The Classical models perform significantly better than the vR term models. We also see that the twisted models perform better than the single value models, but that the improvement is not as great as the difference between the vR term and Classical models. These relative results are not changed if we change the systematic uncertainty by a factor of 2 in either direction.

In summary, while all four model provided fits to the kinematic data that seem reasonable to the eye, a quantitative comparison showed that the Classical models perform significantly better than the vR term ones.

5. Properties of the models

We now look at the properties of the models, bearing in mind that the Classical models provide the better fit to the data.

5.1. Warp amplitude

In Fig. 5 we show the maximum amplitudes of the warp in our models, and how it compares with other measurements of the warp amplitude found in the literature. We divide this into results that were found purely from kinematics (labelled h) and from photometric data and star counts (labelled hphys). The two Classical models reach ∼3 kpc at R ≈ 14 kpc, and then remain around the same height (with significant uncertainty) further out. The two vR term models both stay at much smaller heights, reaching only ≲0.3 kpc by R ≈ 13 kpc and then noticeably decreasing further out, down to almost zero in the case of the untwisted model.

thumbnail Fig. 5.

Maximum warp amplitude as a function of radius for our four models (top panel), and for a selection of recent estimates for the older population – rather than in Cepheids or other young populations – in the literature (lower panel). Two of these studies looked at the lopsidedness of the warp and in these cases we give both the warp upwards and downwards, indicated by the two lines for Amôres et al. (2017, with the upward warp being the greater) and the uncertainty bar for the Red Giant Branch (RGB) results of Romero-Gómez et al. (2019, with the downward warp being the greater value).

The literature values we show come from studies which looked at the bulk of the population or specified older populations (López-Corredoira et al. 2002; Amôres et al. 2017; Cheng et al. 2020; Chrobáková et al. 2020; Romero-Gómez et al. 2019; Uppal et al. 2024), as opposed to those which looked at the gas or young populations, such as Cepheid variables, that tend to be less warped than all of these except the Chrobáková et al. (2020) estimate (h typically around 0.5 − 0.8 kpc at R = 14 kpc for Cepheids, e.g. Skowron et al. 2019; Dehnen et al. 2023; Cabrera-Gadea et al. 2024). We pick out specifically the results using red-giant branch (RGB) stars from Romero-Gómez et al. (2019). We chose results for 6 Gyr old stars from Amôres et al. (2017), corresponding to their second-oldest age bin. Their oldest age bin (8.3 Gyr) has significantly larger uncertainties and even steeper warps. Of the results shown, only ours and that from the Cheng et al. (2020) study comes purely from kinematic data, while the other studies used photometric data and star counts to trace the warp shape.

The Classical models have a warp amplitude that lies somewhere in the middle of these models, a stronger warp at ∼14 kpc than in all but the Amôres et al. (2017) case, but flattening out to be fairly similar to both the López-Corredoira et al. (2002) and Cheng et al. (2020). This flattening is not something one would expect for a typical warp, and does come with significant uncertainties. The vR term models have a much smaller amplitude than all but the Chrobáková et al. (2020) warp, and do not look like a warp in the sense that they do not tend towards greater heights at greater radii.

Our Classical models are close to the results from Cheng et al. (2020), which in one sense is unsurprising, as we both fitted the kinematic data alone. However, one might have expected the results from our vR term models to be more similar to theirs, since they also used these models in their analysis. The reason for this difference is probably that we analysed the data in several azimuthal bins at each radius, while Cheng et al. (2020) combined data at all azimuths into a single radial bin, thus reducing the effect of the vR term on their model.

Our robustness tests in which we cut the data at different maximum values of R produced very similar height profiles to those found here for all models, and we show these in the appendix. Our robustness tests in which we look at 2 kpc wide rings in R had slightly low values of h at R = 16 in the Classical models, with large uncertainties. The results from the vR term models looking at 2 kpc wide rings had greater h than those we see here, but this reflects a weakness of this piecewise-linear model applied to this limited sample, rather than a worrying feature of the data analysis. We discuss this, and show results from all of these robustness tests in Appendix B.

In summary, the Classical models, in addition to providing a better fit to the data, also provided a warp amplitude that is more in line with the literature. The vR term models, as well as being a worse fit for the data, do not resemble warps as we would expect them. We draw the conclusion that the Classical models are much more likely to be a helpful representation of the warp in the Milky Way.

5.2. Line of nodes

The positions of the line of nodes for each of our four models are shown in Fig. 6. We plot it from R = 6 kpc outwards, as our model warp has zero amplitude inside this radius. In both the cases where the line of nodes is held at a fixed value, it lies around ϕ = 140°. This reflects the clear signature in the data (Fig. 1) that towards ϕ = 230° the vertical velocity is close to zero. The Classical twisted model has a line of nodes that forms a trailing spiral, while the vR term twisted model’s line of nodes is a leading spiral that is much more tightly wound. In both cases these cross 140° at R ≈ 12 kpc, which is the radius where the velocity signal of the warp appears to be strongest.

thumbnail Fig. 6.

Position of the line of nodes for our Classical (left) and vR term models. The Classical twisted model has a line of nodes that forms a trailing spiral, while the vR term twisted model’s line of nodes is a leading spiral that is much more wound up.

This line of nodes is significantly different from that found in the literature using the current positions of stars. Most early 21st-century studies of the Milky Way’s warp place ϕw, phys very close to 180° (e.g. Drimmel et al. 2000; López-Corredoira et al. 2002; Levine et al. 2006) but some more recent work has placed it closer to 165° in both the Cepheid population (e.g. Skowron et al. 2019) and the older population (e.g. Momany et al. 2006; Amôres et al. 2017).

Both Chen et al. (2019) and Dehnen et al. (2023) find a line of nodes for samples of Cepheids that is close to 180° for R = 12 kpc, but then decreases almost linearly in ϕ (a leading spiral) to 140° around R = 16 kpc. This is measured using the positions of the Cepheids by both studies, while Dehnen et al. (2023) find the same result again when they also use the velocities to measure the warp’s properties from the angular momenta of the stars around the Galactic centre4. Inside 12 kpc, the warp measured using only the positions of the Cepheids is instead a trailing spiral in both studies. The warp measured by Dehnen et al. (2023) using angular momentum is generally too small inside 12 kpc to draw any conclusions about its line of nodes there.

Perhaps most relevantly for understanding our results, using Gaia DR2 Romero-Gómez et al. (2019) find that, for red giant stars, their estimates of the position of the line of nodes using the positions (ϕw, phys ∼ 180° −200°) and kinematics (ϕw ∼ 160° −170°) of the stars are rather different from one another. They attributed this to the warp being lopsided, with the line of nodes being offset from the maximum vertical velocity.

The robustness tests we conducted suggest that the trailing spiral of the line of nodes seen in the Classical twisted model is not a secure result from our data. In Fig. 7 we show two examples of results from these tests which illustrate this (more complete figures can be found in Appendix B). When we divided the data into 2 kpc annuli and fitted a Classical model to each, we found that the different lines of nodes appeared to be forming a leading spiral. When we varied the outermost radius of the data we fitted, we found that the twist of the line of nodes swings between trailing for the R < 13 and 14 kpc samples, to leading for the R < 15 and 16 kpc samples, then back to trailing for the R < 17 and 18 kpc samples. We therefore cannot make a strong statement about the twist of the line of nodes.

thumbnail Fig. 7.

Position of the line of nodes for two of our robustness tests. The left panel shows the results of our Classical model when we cut the data into 2 kpc wide rings from 11 − 13 kpc to 14 − 16 kpc. The right panel shows our Classical twisted model when we cut the data at different maximum radii from 13 to 18 kpc. To avoid clutter in the figures we only show the line of nodes for the best fitting model for each sample. It is worth noting that in the right panel the models for data with R < 15 or 16 kpc are so similar that they sit on almost perfectly top of each other. The different trends seen for the different samples make it hard to draw concrete conclusions about the twist of the line of nodes.

We can however say with some confidence that the kinematically estimated line of nodes based on this population is clearly leading that found in the positions of similar samples (or from Cepheids) and that this result is robust against selection of different subsets of our data.

5.3. Precession rate

Figure 8 shows the precession rate of the line of nodes for each of our four models. The Classical models had a precession rate of ωp ≈ −13.5 km s−1 kpc−1 (prograde with the disc rotation). This would be equivalent of a full rotation around the galaxy in around half a Gyr. The Classical twisted model had negligible change in precession rate with radius.

thumbnail Fig. 8.

Precession of the line of nodes for our four models. Note that the disc is spinning in the negative sense, so the two Classical models are both precessing in the prograde sense, with very similar precession rates. The Classical twisted model has negligible change in precession rate with radius. The two vR term models have very different precession rates with the vR term twisted model having a dramatic change in precession rate with radius.

This relatively fast prograde precession of the warp is consistent with the results from Poggio et al. (2020) and Cheng et al. (2020). It is consistent with results for Cepheids from Dehnen et al. (2023) at R ≈ 12 kpc, but they found that the precession speed decreased to ∼6 km s−1 kpc−1 for R ≳ 13 kpc. Our Classical twisted model allows for this possibility, but does not find it.

The vR term model had a retrograde precession of around 8 km s−1 kpc−1, while the vR term twisted model had a dramatic change in precession rate with radius of ∼3 km s−1 kpc−2, changing sign from prograde at lower radii to retrograde for R ≳ 12 kpc. This seems rather unlikely, and we consider it be a consequence of the fitting algorithm hunting down a mathematically acceptable solution for a model where a physically meaningful one was not available.

Our robustness tests reinforce these results, with the tests in which we set a maximum R value all being consistent with these values. The tests with 2 kpc wide rings in R are the same for the Classical models, but the vR models become more consistent with the Classical models. As is the case for the h(R) in these models, this reflects a weakness of this piecewise-linear model applied to this limited sample.

5.4. Results summary

The Classical models provided a better fit to the data, with an amplitude and shape consistent with typical warps, and a precession speed consistent with other measures in the literature. Their line of nodes did not lie where one would expect given results in the literature based on the positions of stars. The vR term models provide a worse fit to the data, with an amplitude and shape that are not consistent with typical warps and, in the twisted case, a precession speed that varies dramatically with radius.

6. Discussion

6.1. The vR term

Cheng et al. (2020) introduced the idea that one could include the velocity of stars radially in the disc (vR) in the kind of Jeans modelling of the warp that we have used here. While they had a smaller dataset, and combined stars at all azimuths (ϕ) into a single bin at each radius, we have exploited the larger dataset from Gaia DR3 to look at the vertical velocity of the warp as a function of ϕ as well as R.

It is important to recognise that the magnitude of the vR terms’ effect on the model vertical velocity (Eq. (9)) is 90° out of phase with the effect of the other term (the only term in the Classical models, Eq. (8)). The Classical models have greatest v ¯ z , mod $ \bar{v}_{z,\mathrm{mod}} $ at the line of nodes because their value depends on the derivative of warp height with ϕ, which is greatest there. The vR terms, in contrast, have their maximum effect at the ϕ value where the warp height is maximum, because their value depends on the derivative of warp height with R. This means that the influence of the vR terms is zero at the line of nodes.

Our results with the Classical models are very similar to those of Cheng et al. (2020), while those with the vR term models are very different. The most likely explanation for this is that when the Cheng et al. (2020) study combined all azimuths into a single bin at each radius, they treated the stars as all being at the median azimuth of the bin. This will be close to 180° which they assumed was the location of the line of nodes. This means that the vR terms will have had a small effect in their fits. In contrast, because we have fit the data in bins of ϕ at each radius, the vR terms have a much greater effect on our fits across the disc.

We should be a little cautious, because as Gaia Collaboration (2023b) point out, vR estimates are prone to systematic errors across the disc. We can expect that our use of the Bailer-Jones et al. (2021) Bayesian distances has reduced these errors, and since our robustness tests with much more local samples come to similar results, we are confident that including these vR terms makes the fit to the data worse, even when we have allowed the properties to the warp to vary with radius in a way that, especially for the precession of the warp, seems physically implausible. It seems therefore that including these terms makes our model of the warp significantly worse.

The radial motions of stars are not the only things omitted from our Classical models that might be important for analysing large scale trends due to the warp. We ignore any possible variation of warp amplitude with time, and our variation with ϕ is a simple sinusoid. Cabrera-Gadea et al. (2024) found that for a sample of Cepheids the lopsidedness of the warp (parameterised as an m = 2 mode) was an important factor, while the change in amplitude was not. We defer investigation of these for the bulk population to future work.

As Box (1976) noted: all models are wrong, but some are useful. Our warp models cannot capture the full complexity of the warp, but they can be useful in understanding the warp’s properties. The assumption underlying our models is that the warp is fixed, except for its precession. We therefore require that a star moving radially “knows about” the height of the warp at the radii it is moving towards. The radial motions only exist because the warp is not fixed, and the outer parts of the disc have a complicated structure including spiral arms. Introducing these radial motions into our models is therefore a way of trying to capture the complexity of the warp. However, it seems that this is not a successful way of doing so.

6.2. The inconsistent line of nodes

The line of nodes in our Classical models lies near ϕ = 140° at R = 12 kpc. As discussed in Sect. 5 this is not the same as the line of nodes found by numerous studies of the warp in the Milky Way, whether of position of the bulk population or young populations such as Cepheids, which tend to be closer to 165° −180°.

From this we conclude that the observed structure and kinematics of the bulk population in the outer disc are inconsistent with the picture of a fixed, precessing, warped disc. This stands in contrast to the younger Cepheid population for which Dehnen et al. (2023) found warps defined by the positions or angular momenta of the stars were much more consistent – though not perfectly consistent, as they discuss. We note also that Cabrera-Gadea et al. (2024) found that the line of maximum z velocity for the Cepheid population, which for our model would be the line of nodes, trailed the line of nodes found from the positions of stars by ∼20°, the opposite of the trend we see in the older population.

Our approach within this paper has been to look only at the kinematics of the sample. This is a pragmatic choice, since it avoids dealing with the dust extinction within the Milky Way, but it has great scientific merit to look at the two probes (density distribution and kinematics) separately, and see if they are consistent. Doing so allows us to check our assumptions, and we can see that they do not hold up.

We are not the first to note that the line of nodes determined from kinematics is offset from the Galactic anticentre, with Poggio et al. (2018) noting that the maximum in v ¯ z $ \bar{v}_z $ was at ϕ < 180° and Romero-Gómez et al. (2019) placing it in the range 160° −170°. Our study places the line of nodes even further from the anti-centre than these studies. This is appears to be because we fitted a model to the kinematics alone, and this increases the significance that v ¯ z 0 $ \bar{v}_{z} \approx 0 $ at ϕ ≈ 220° even at R ∼ 12 kpc where we see the strongest positive v ¯ z $ \bar{v}_{z} $ elsewhere in the disc. This is 90° away from the line of nodes in our warp model.

6.3. The disturbed Milky Way

The Milky Way’s disc is not a particularly calm and settled system. The spiral arms create large disturbances in the radial velocities of stars within the disc (e.g. Gaia Collaboration 2018b). Even before Gaia DR2, disturbance of the vertical velocity of stars across the disc had been and found and associated with “corrugations” or “bending” and “breathing” modes of the disc (Widrow et al. 2014; Schönrich & Dehnen 2018). One of the first discoveries with Gaia DR2 was the “phase-spiral” (Antoja et al. 2018) which is a disturbance seen in the density in the z − vz plane of stars in the Solar neighbourhood. Further studies have shown that this is how far from the Solar neighbourhood this feature can be found (Xu et al. 2020), and that it grows particularly strong when looking specifically at stars with orbital guiding centre radii around 10 kpc (angular momentum ∼2300 km s−1 kpc, Alinder et al. 2023).

Investigation of the outer disc by Gaia Collaboration (2021) and McMillan et al. (2022) showed that there is a bifurcation of the velocity distribution at radii near 12 kpc, which varies as we look around the disc. This is near the peak of the warp signal in our data.

None of this behaviour is captured by a simple warp model of the kind we have used here. Perhaps, therefore, we should not be surprised by the fact that our kinematic model of the warp is not consistent with the positions of stars in the outer disc. None-the-less, we have quantified this inconsistency and are left with quite a stringent constraint for any model of the creation of the Milky Way’s warp. A range of such models have been proposed in the recent literature, most focussing on the LMC and its effect on the dark-matter halo (e.g. Han et al. 2023), the Sagittarius dwarf (e.g. Poggio et al. 2021) or a combination of both (e.g. Laporte et al. 2018). To match what we observe in the Milky Way, these models need to have warps that appear to be precessing fast in the prograde sense, and the line of nodes in the kinematics of the bulk population must lead the line of nodes in the positions of the bulk population by a substantial amount.

In this sense, we also see a difference between the warp seen in the Cepheid population, which is young, and the bulk population, which is older. The results of Dehnen et al. (2023) and Cabrera-Gadea et al. (2024) suggest that the line of nodes in simple kinematic models of the Cepheid population are roughly consistent with that in position, or that they trail it, while our results have a line of nodes in the kinematic model that leads one found in position. It may be that the Cepheids, a dynamically cold population, had a different response to perturbation than the dynamically hotter bulk population. Perhaps a more likely explanation is that the Cepheids more closely represent the disturbance of the gas disc from which they formed, while the bulk population is dominated by stars that were already in the disc when it was disturbed, and therefore we see the response of the collisionless component of the disc to the disturbance.

The difference between the responses to a Sagittarius-like perturber of the gas and stellar discs in a Milky-Way-like galaxy was investigated by Tepper-García et al. (2022). They refer to the variation in v ¯ z $ \bar{v}_z $ for stars and gas across the disc as a corrugation, and note that they are initially in phase, but move apart after 500 − 700 Myr. They argue that the difference between the two disturbances may help to determine the age of the perturbation. In this context the difference between the warp seen in the Cepheids and the bulk population may be a sign that the warp is due to a relatively old perturbation of the Milky Way’s disc, but we will defer more quantitive statements about this to future work.

7. Conclusions

In this study we have used Gaia DR3 data to study the vertical motion of stars affected by the Milky Way’s warp. The exquisite quality of the data allowed us to fit a kinematic model to observations spanning a significant fraction of the Milky Way’s disc. We have used this to investigate the position and behaviour of the warp, under the assumption that it is a fixed, precessing, warped disc.

The kinematic signature of the warp in the bulk population is consistent with a warp that is precessing at around 13 km s−1 kpc−1 in a prograde direction with a line of nodes around ϕ = 140°, which is 40° prograde of the Sun’s position. We cannot come to any robust conclusions about how this line of nodes varies with radius, because different subsamples of our data lead us to differing results.

Including the expected effects of radial motions on the continuity-equation-based model of the warp makes the fit to the data worse. We conclude that this is not a helpful addition to models of the warp’s kinematics.

The kinematic signature of the warp, under the assumption that it is fixed and precessing, does not line up with the position of the warp seen in star counts. We are forced to conclude that this model is not sufficient to describe the system, and that this provides a challenge to models of the formation of the warp in the Milky Way. The difference between this behaviour and that of the Cepheid population may provide a clue as to the nature and date of the last major perturbation of the Milky Way’s disc.


1

Here, as throughout the paper, we use “radial” velocity to mean inwards or outwards in the disc. For the component of velocity towards or away from the Sun, we use the term “line-of-sight” velocity.

4

This is neither a measurement using purely physical positions nor purely kinematics.

Acknowledgments

The authors are grateful to the anonymous referee for suggestions which strengthened this work, to Walter Dehnen for discussing his work on the warp with us, and providing us with a pre-publication copy of his paper. P.M. thanks Eloisa Poggio and Joss Bland-Hawthorn for helpful advice. The authors gratefully acknowledge support from project grants from the Swedish Research Council (Vetenskaprådet, Reg: 2017-03721; 2021-04153). Some of the computations in this project were completed on computing equipment bought with a grant from The Royal Physiographic Society in Lund. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement. This research has made use of NASA’s Astrophysics Data System. This paper made use of the following software packages for Python: Numpy (Harris et al. 2020), AstroPy (Astropy Collaboration 2013, 2018, 2022), emcee (Foreman-Mackey et al. 2013), SciPy (Virtanen et al. 2020), Pandas (McKinney 2010), and Matplotlib (Hunter 2007).

References

  1. Alinder, S., McMillan, P. J., & Bensby, T. 2023, A&A, 678, A46 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  2. Amôres, E. B., Robin, A. C., & Reylé, C. 2017, A&A, 602, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  3. Antoja, T., Helmi, A., Romero-Gómez, M., et al. 2018, Nature, 561, 360 [Google Scholar]
  4. Astropy Collaboration (Robitaille, T. P., et al.) 2013, A&A, 558, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  5. Astropy Collaboration (Price-Whelan, A. M., et al.) 2018, AJ, 156, 123 [Google Scholar]
  6. Astropy Collaboration (Price-Whelan, A. M., et al.) 2022, ApJ, 935, 167 [NASA ADS] [CrossRef] [Google Scholar]
  7. Bailer-Jones, C. A. L., Rybizki, J., Fouesneau, M., Demleitner, M., & Andrae, R. 2021, AJ, 161, 147 [Google Scholar]
  8. Bailin, J. 2003, ApJ, 583, L79 [NASA ADS] [CrossRef] [Google Scholar]
  9. Bosma, A. 1981, AJ, 86, 1825 [NASA ADS] [CrossRef] [Google Scholar]
  10. Box, G. E. P. 1976, J. Am. Stat. Assoc., 71, 791 [CrossRef] [Google Scholar]
  11. Briggs, F. H. 1990, ApJ, 352, 15 [NASA ADS] [CrossRef] [Google Scholar]
  12. Burke, B. F. 1957, AJ, 62, 90 [Google Scholar]
  13. Cabrera-Gadea, M., Mateu, C., Ramos, P., et al. 2024, MNRAS, 528, 4409 [NASA ADS] [CrossRef] [Google Scholar]
  14. Chen, X., Wang, S., Deng, L., et al. 2019, Nat. Astron., 3, 320 [NASA ADS] [CrossRef] [Google Scholar]
  15. Cheng, X., Anguiano, B., Majewski, S. R., et al. 2020, ApJ, 905, 49 [Google Scholar]
  16. Chrobáková, Ž., & López-Corredoira, M. 2021, ApJ, 912, 130 [CrossRef] [Google Scholar]
  17. Chrobáková, Ž., Nagy, R., & López-Corredoira, M. 2020, A&A, 637, A96 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Dehnen, W. 1998, AJ, 115, 2384 [NASA ADS] [CrossRef] [Google Scholar]
  19. Dehnen, W., Semczuk, M., & Schönrich, R. 2023, MNRAS, 523, 1556 [NASA ADS] [CrossRef] [Google Scholar]
  20. Djorgovski, S., & Sosin, C. 1989, ApJ, 341, L13 [NASA ADS] [CrossRef] [Google Scholar]
  21. Drimmel, R., & Poggio, E. 2018, Res. Notes Am. Astron. Soc., 2, 210 [Google Scholar]
  22. Drimmel, R., Smart, R. L., & Lattanzi, M. G. 2000, A&A, 354, 67 [NASA ADS] [Google Scholar]
  23. Dubinski, J., & Chakrabarty, D. 2009, ApJ, 703, 2068 [NASA ADS] [CrossRef] [Google Scholar]
  24. Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]
  25. Freudenreich, H. T. 1996, ApJ, 468, 663 [NASA ADS] [CrossRef] [Google Scholar]
  26. Gaia Collaboration (Prusti, T., et al.) 2016, A&A, 595, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  27. Gaia Collaboration (Brown, A. G. A., et al.) 2018a, A&A, 616, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  28. Gaia Collaboration (Katz, D., et al.) 2018b, A&A, 616, A11 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  29. Gaia Collaboration (Antoja, T., et al.) 2021, A&A, 649, A8 [EDP Sciences] [Google Scholar]
  30. Gaia Collaboration (Vallenari, A., et al.) 2023a, A&A, 674, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  31. Gaia Collaboration (Drimmel, R., et al.) 2023b, A&A, 674, A37 [CrossRef] [EDP Sciences] [Google Scholar]
  32. Goodman, J., & Weare, J. 2010, Commun. Appl. Math. Comput. Sci., 5, 65 [Google Scholar]
  33. GRAVITY Collaboration (Abuter, R., et al.) 2018, A&A, 615, L15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  34. Han, J. J., Conroy, C., & Hernquist, L. 2023, Nat. Astron., 7, 1481 [CrossRef] [Google Scholar]
  35. Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357 [Google Scholar]
  36. Hunter, J. D. 2007, Comput. Sci. Eng., 9, 90 [Google Scholar]
  37. Jeans, J. H. 1915, MNRAS, 76, 70 [Google Scholar]
  38. Kerr, F. J. 1957, AJ, 62, 93 [NASA ADS] [CrossRef] [Google Scholar]
  39. Kerr, F. J., & Hindman, J. V. 1957, PASP, 69, 558 [NASA ADS] [CrossRef] [Google Scholar]
  40. Laporte, C. F. P., Johnston, K. V., Gómez, F. A., Garavito-Camargo, N., & Besla, G. 2018, MNRAS, 481, 286 [Google Scholar]
  41. Levine, E. S., Blitz, L., & Heiles, C. 2006, ApJ, 643, 881 [NASA ADS] [CrossRef] [Google Scholar]
  42. Lindegren, L., Klioner, S. A., Hernández, J., et al. 2021, A&A, 649, A2 [EDP Sciences] [Google Scholar]
  43. López-Corredoira, M., Cabrera-Lavers, A., Garzón, F., & Hammersley, P. L. 2002, A&A, 394, 883 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  44. McKinney, W. 2010, in Proceedings of the 9th Python in Science Conference, eds. S. van der Walt, & J. Millman, 56 [Google Scholar]
  45. McMillan, P. J., Petersson, J., Tepper-Garcia, T., et al. 2022, MNRAS, 516, 4988 [NASA ADS] [CrossRef] [Google Scholar]
  46. Momany, Y., Zaggia, S., Gilmore, G., et al. 2006, A&A, 451, 515 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  47. Ostriker, E. C., & Binney, J. J. 1989, MNRAS, 237, 785 [NASA ADS] [CrossRef] [Google Scholar]
  48. Poggio, E., Drimmel, R., Lattanzi, M. G., et al. 2018, MNRAS, 481, L21 [Google Scholar]
  49. Poggio, E., Drimmel, R., Andrae, R., et al. 2020, Nat. Astron., 4, 590 [NASA ADS] [CrossRef] [Google Scholar]
  50. Poggio, E., Laporte, C. F. P., Johnston, K. V., et al. 2021, MNRAS, 508, 541 [NASA ADS] [CrossRef] [Google Scholar]
  51. Reid, M. J., & Brunthaler, A. 2004, ApJ, 616, 872 [Google Scholar]
  52. Rogstad, D. H., Lockhart, I. A., & Wright, M. C. H. 1974, ApJ, 193, 309 [Google Scholar]
  53. Romero-Gómez, M., Mateu, C., Aguilar, L., Figueras, F., & Castro-Ginard, A. 2019, 53rd ESLAB Symposium: The Gaia Universe, held 8–12 April, 2019 at ESTEC/ESA, Noordwijk, The Netherlands, 20 [Google Scholar]
  54. Schönrich, R., & Dehnen, W. 2018, MNRAS, 478, 3809 [Google Scholar]
  55. Sellwood, J. A., & Debattista, V. P. 2021, MNRAS, 510, 1375 [NASA ADS] [CrossRef] [Google Scholar]
  56. Skowron, D., Skowron, J., Mróz, P., et al. 2019, Acta Astron., 69, 305 [NASA ADS] [Google Scholar]
  57. Sánchez-Saavedra, M., Battaner, E., & Florido, E. 1990, MNRAS, 246, 458 [Google Scholar]
  58. Tepper-García, T., Bland-Hawthorn, J., & Freeman, K. 2022, MNRAS, 515, 5951 [CrossRef] [Google Scholar]
  59. Uppal, N., Ganesh, S., & Schultheis, M. 2024, MNRAS, 527, 4863 [Google Scholar]
  60. Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Meth., 17, 261 [Google Scholar]
  61. Westerhout, G. 1957, Bull. Astron. Inst. Neth., 13, 312 [Google Scholar]
  62. Widrow, L. M., Barber, J., Chequers, M. H., & Cheng, E. 2014, MNRAS, 440, 1971 [Google Scholar]
  63. Xu, Y., Liu, C., Tian, H., et al. 2020, ApJ, 905, 6 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: ADQL query

Our sample is selected from the https://gea.esac.esa.int/archive/ table gaiadr3.gaia_source_lite, cross-matched with its table external.gaiaedr3_distance which contains the Bailer-Jones et al. (2021) distance estimates. We use the following ADQL query: We make the final selections to remove stars associated with 47 Tucanae, on parallax uncertainty and RUWE within our data processing.

SELECT source_id, ra, dec,
parallax, parallax_error, pmra, pmdec,
phot_g_mean_mag, bp_rp, ruwe,
r_med_photogeo, r_lo_photogeo, r_hi_photogeo,
radial_velocity, radial_velocity_error
FROM external.gaiaedr3_distance
JOIN gaiadr3.gaia_source_lite
USING (source_id)
WHERE 0=CONTAINS(POINT("ICRS",ra,dec),
    CIRCLE("ICRS",81.28,-69.78,12))
AND 0=CONTAINS(POINT("ICRS",ra,dec),
    CIRCLE("ICRS",13.1583,-72.8003,6))
AND radial_velocity IS NOT NULL
AND parallax IS NOT NULL

Appendix B: Best fitting parameter values and robustness tests

We present the best fitting parameters from all of our models in Table B.1. These correspond to the values that we illustrate graphically in Figs. 5, 6, and 8.

Table B.1.

Parameters of the best fitting models for our main sample with associated uncertainties.

The results of our robustness test with a minimum of 1000 stars per spatial bin are in Tab. B.2, and it is clear that the results are consistent with the full sample, but with much larger uncertainties in the outer bins and in ωp for the vR term models. An increase in uncertainty is, of course, to be expected since there is less data to constrain the model.

Table B.2.

Parameters of the best fitting models for tests using only spatial bins with 1000 stars or more in the fit.

Figures B.1 & B.2 show the results of our tests with samples selected by Galactocentric radius, either divided into 2 kpc wide annuli (Fig. B.1) or with an upper limit on R (Fig. B.2). In each case we show h(R), ϕw(R) and ωw(R) for the best fitting model with indicative error bars that are symmetrical around the best fit.

thumbnail Fig. B.1.

Maximum warp heights (h – left group of panels), line of nodes (ϕw – centre group of panels), and warp pattern speed (ωp, right group of panels) as a function of radius for our four models, where the data have been binned into overlapping 2 kpc rings in R, as indicated. Note that the line of nodes appears to be twisting in a prograde direction in all cases.

thumbnail Fig. B.2.

Maximum warp heights (h – left group of panels), line of nodes (ϕw – centre group of panels), and warp pattern speed (ωp, right group of panels) as a function of radius for our four models, where the data are truncated at different radii, as indicated.

The maximum warp heights as a function of radius, h(R), are consistent with the values found for our main sample for the two Classical models, with a slight tendency in some cases for lower warp heights in the outer parts of the warp. These still appear to be consistent within the uncertainties. The two vR term models are very much consistent for all the samples truncated at different R, as seen in Fig. B.2, but this is not the case for the samples divided into 2 kpc wide annuli, as seen in Fig. B.1 (lower left panels). In this case the warp heights are significantly higher than for vR term models fitted to the other samples – much closer to the values found for the Classical models. It is noticeable that these fits are inconsistent with one another where they overlap, and that the radial gradient of each is negative. Because we have to start this model at a non-zero height (to allow for normal behaviour in a warp at these radii), the fitting algorithm is able to find solutions with a significant height but never with a positive gradient in h(R). This allows it to find these solutions without ‘paying the price’ of a positive h(R) gradient elsewhere in the disc that would presumably harm the fit with the data. Note that the vR term models – unlike the Classical models – explicitly include the gradient of the warp height with radius in the model (eq. 9). These models are completely unrealistic physically, and should be thought of as a numerical difficulty introduced by using the piecewise linear model for h(R) for these samples.

The lines of nodes for the two Classical models applied to the different data samples tell an interesting, hard to disentangle, story which we discuss in Sect. 5.2. The samples divided into 2 kpc wide annuli (Fig. B.1) appear to show a leading spiral which joins up neatly and consistently for the twisted model. This is the exact opposite of that found with our full sample or when we set a stricter limit of 1000 stars per bin. We can gain a bit more insight by looking at results for the Classical twisted model when we truncate the sample at different radii (upper row, fourth panel of Fig. B.2, also shown as a polar plot in Fig. 7). Depending on the cut we take, we can either end up with a leading or trailing spiral of the line of nodes. This does not follow a simple trend, being leading if we truncate at R = 13 or 14 kpc, trailing if we truncate at 15 or 16 kpc, and leading again with truncations at larger radii.

These results lead us to conclude that our analysis is not capable of providing a robust answer to the question of the twist of the warp’s line of nodes.

The results for the pattern speed of the warp for the different data samples are, for the Classical models, consistent with our main samples’ values, sometimes with large uncertainties. The pattern speed found for 2 kpc annuli with the vR terms are, as with the values of h(R), closer to those for the full sample with the Classical models. This is again due to the numerical difficulty introduced by using the piecewise linear model for h(R) for these samples – there is a strong correlation between h(R) and ωp in the models (eqns. 8 & 9) so a problem with the description of h(R) transfers naturally to a problem with ωp.

All Tables

Table B.1.

Parameters of the best fitting models for our main sample with associated uncertainties.

Table B.2.

Parameters of the best fitting models for tests using only spatial bins with 1000 stars or more in the fit.

All Figures

thumbnail Fig. 1.

Distribution (left) and average vertical velocity (right) of our sample over the Galactic plane. This is binned, with each bin covering 0.5 kpc in R and 10° in ϕ, both in galactocentric coordinates, with stars in the disc rotating in the negative ϕ direction. We note that moving radially outwards from the position of the Sun v ¯ z $ \bar{v}_z $ rises to around 10 km s−1 before falling to below zero, but that this behaviour disappears for ϕ ≳ 220°.

In the text
thumbnail Fig. 2.

Fits to the v ¯ z $ \bar{v}_z $ data for Classical models. Each panel shows data for 10° sectors of the Galaxy, and plots the average vz in 0.5 kpc bins, with associated uncertainties (statistical and systematic). The model fits with a single value of ϕw and ωp are plotted in black, while those of the twisted model are in light blue. Overall both models provide reasonable fits, without major differences.

In the text
thumbnail Fig. 3.

Fits to the v ¯ z $ \bar{v}_z $ data for the vR term models. As in Fig. 2, each panel shows data for 10° sectors of the Galaxy, and plots the average vz in 0.5 kpc bins, with associated uncertainties (statistical and systematic). Note that the uncertainties are larger here because we include uncertainties propagated from the vR uncertainties. The models with a single value of ϕw and ωp are plotted in orange, while those of the twisted models are in green. Both models provided reasonable fits, but note in particular that model v ¯ z $ \bar{v}_z $ values are all around zero at larger R.

In the text
thumbnail Fig. 4.

Quantitative comparison of the four models. We compute a quantity analogous to the chi-squared statistic per degree of freedom, χ ν 2 $ \chi^2_\nu $, for each model, and plot the distribution of these values for the final 50 000 steps of the MCMC chain. We note that the Classical models perform significantly better than the vR term.

In the text
thumbnail Fig. 5.

Maximum warp amplitude as a function of radius for our four models (top panel), and for a selection of recent estimates for the older population – rather than in Cepheids or other young populations – in the literature (lower panel). Two of these studies looked at the lopsidedness of the warp and in these cases we give both the warp upwards and downwards, indicated by the two lines for Amôres et al. (2017, with the upward warp being the greater) and the uncertainty bar for the Red Giant Branch (RGB) results of Romero-Gómez et al. (2019, with the downward warp being the greater value).

In the text
thumbnail Fig. 6.

Position of the line of nodes for our Classical (left) and vR term models. The Classical twisted model has a line of nodes that forms a trailing spiral, while the vR term twisted model’s line of nodes is a leading spiral that is much more wound up.

In the text
thumbnail Fig. 7.

Position of the line of nodes for two of our robustness tests. The left panel shows the results of our Classical model when we cut the data into 2 kpc wide rings from 11 − 13 kpc to 14 − 16 kpc. The right panel shows our Classical twisted model when we cut the data at different maximum radii from 13 to 18 kpc. To avoid clutter in the figures we only show the line of nodes for the best fitting model for each sample. It is worth noting that in the right panel the models for data with R < 15 or 16 kpc are so similar that they sit on almost perfectly top of each other. The different trends seen for the different samples make it hard to draw concrete conclusions about the twist of the line of nodes.

In the text
thumbnail Fig. 8.

Precession of the line of nodes for our four models. Note that the disc is spinning in the negative sense, so the two Classical models are both precessing in the prograde sense, with very similar precession rates. The Classical twisted model has negligible change in precession rate with radius. The two vR term models have very different precession rates with the vR term twisted model having a dramatic change in precession rate with radius.

In the text
thumbnail Fig. B.1.

Maximum warp heights (h – left group of panels), line of nodes (ϕw – centre group of panels), and warp pattern speed (ωp, right group of panels) as a function of radius for our four models, where the data have been binned into overlapping 2 kpc rings in R, as indicated. Note that the line of nodes appears to be twisting in a prograde direction in all cases.

In the text
thumbnail Fig. B.2.

Maximum warp heights (h – left group of panels), line of nodes (ϕw – centre group of panels), and warp pattern speed (ωp, right group of panels) as a function of radius for our four models, where the data are truncated at different radii, as indicated.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.