A general basis set algorithm for galactic haloes and discs

E. J. Lilley; G. van de Ven

doi:10.1051/0004-6361/202245730

Home

All issues

Volume 672 (April 2023)

A&A, 672 (2023) A91

Full HTML

Open Access

Issue		A&A Volume 672, April 2023


Article Number		A91
Number of page(s)		21
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/202245730
Published online		06 April 2023

A&A 672, A91 (2023)

A general basis set algorithm for galactic haloes and discs

E. J. Lilley and G. van de Ven

Department of Astrophysics, University of Vienna, Türkenschanzstraße 17, 1180 Vienna, Austria
e-mail: edward.lilley@univie.ac.at; glenn.vandeven@univie.ac.at

Received: 19 December 2022
Accepted: 14 February 2023

Abstract

We present a unified approach to (bi-)orthogonal basis sets for gravitating systems. Central to our discussion is the notion of mutual gravitational energy, which gives rise to a ‘self-energy inner product’ on mass densities. We consider a first-order differential operator that is self-adjoint with respect to this inner product, and prove a general theorem that gives the conditions under which a (bi-)orthogonal basis set arises by repeated application of this differential operator. We then show that these conditions are fulfilled by all the families of analytical basis sets with infinite extent that have been discovered to date. The new theoretical framework turns out to be closely connected to Fourier-Mellin transforms, and it is a powerful tool for constructing general basis sets. We demonstrate this by deriving a basis set for the isochrone model and demonstrating its numerical reliability by reproducing a known result concerning unstable radial modes.

Key words: galaxies: halos / galaxies: structure / methods: numerical

© The Authors 2023

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

Orthogonal basis sets play a key role in the efficient calculation of the gravitational potential of perturbed, isolated mass distributions. They can also be of great value when investigating the stability of dynamical models for galaxies. Both these topics have attracted renewed interest recently in light of the mounting observational evidence that the Milky Way and other galaxies are not as symmetric in shape as assumed previously (Vera-Ciro & Helmi 2013; Law & Majewski 2010), and moreover that they may not be in exact dynamical equilibrium (Erkal et al. 2021; Petersen & Peñarrubia 2021). A small sample of recent applications of basis sets includes: efficiently reconstructing individual trajectories in time-varying snapshots of N-body simulations of dark matter haloes (Lowing et al. 2011; Sanders et al. 2020; Petersen et al. 2022a); flexible non-parametric models for the Milky Way (Garavito-Camargo et al. 2021); and a wide variety of perturbation calculations (Hamilton et al. 2018; Fouvry & Prunet 2022).

The development of the so-called biorthogonal basis sets began with Clutton-Brock (1972, 1973), who introduced two remarkable analytical sets of potential-density pairs based on the Kuzmin (1956) disc and Plummer (1911) sphere, respectively. These mathematical discoveries (along with some later results discussed below), while fortunate, are limited. It has long been recognised that to make best use of the basis set technique, one would prefer complete freedom in choosing the zeroth-order (as well as underlying coordinate system and geometry), while making minimal sacrifices regarding computational efficiency.

To this end, there are basically three possible directions of generalisation. One might hope to have the good fortune of finding other ‘analytical’ basis sets, taking some known model as the zeroth-order potential-density pair and then hoping that by some ingenious change of variables or integral transform a set of orthogonal higher-order functions could be written down. This approach is limited but has provided a handful of further results in both spherical polar coordinates (Hernquist & Ostriker 1992; Zhao 1996; Rahmati & Jalali 2009; Lilley et al. 2018b,a) and for infinitesimally thin discs (Kalnajs 1976; Qian 1993). Generally speaking, for both spheres and thin discs, basis sets exist for some double power laws and for certain types of exponential distributions of mass.

Secondly, one could posit an arbitrary sequence of nonorthogonal potential-density pairs, and from them derive an orthogonal set using the Gram-Schmidt algorithm. This is the approach of Saha (1993), Robijn & Earn (1996). The downsides are the large number of expensive numerical integrations required to compute the required inner products, the numerical instability inherent to the Gram-Schmidt process, and the uncertain completeness or convergence properties of the resulting orthogonal basis.

Lastly, the strategy devised by Weinberg (1999), Petersen et al. (2022b) generalises the original result of Clutton-Brock (1973) directly by noticing that the potential-density relation takes the form of a Sturm-Liouville eigenfunction equation with a certain weight function; by choosing a different weight function and using a numerical Sturm-Liouville solver, a different set of eigenfunctions (and hence basis set) can be found. This approach has the upside that certain guarantees about completeness| and convergence can be made, but the downside that the resulting eigenfunctions must be tabulated numerically on a coordinate grid.

In this paper we describe a different generalisation of Clutton-BrockÆs original results – we jettison the eigenfunction equation but retain a three-term recurrence relation. Essentially our approach is motivated by the observation that the extant basis sets¹ so far described in the literature admit the curious property of ‘tridiagonality’ with respect to a radial derivative operator. That is, for a given density basis function ρ_n(r) (suppressing the angular indices and coordinates), the following holds, $r \frac{\partial ρ_{n}}{\partial r} = a_{n} ρ_{n - 1} + b_{n} ρ_{n} + c_{n} ρ_{n + 1},$ $r{{\partial {\rho _n}} \over {\partial r}} = {a_n}{\rho _{n - 1}} + {b_n}{\rho _n} + {c_n}{\rho _{n + 1}},$ (1)

where a_n, b_n and c_n are constants. This may seem to be merely a curiosity, but upon further reflection it motivates a far-reaching generalisation: armed with just knowledge of an arbitrary (smooth) zeroth-order basis element, the tridiagonality property (Eq. (1)) allows us to build up an entire ladder of basis elements recursively, using just one additional integral perrecur-sive step. The resulting basis elements are linear combinations of derivatives of the zeroth order and hence require no further interpolation. Along these lines, in Sect. 2 we present an algorithm to generate general basis sets from arbitrary zeroth-order potential-density pairs.

Underlying this main result is a link to the general theory of orthogonal polynomials, which motivates us to claim completeness of the resulting basis sets. This theoretical background is discussed in Sect. 3, where we introduce the Fourier-Mellin transform, and show a correspondence between tridiagonal orthogonal basis sets and orthogonal polynomials in the transformed space. Key to this link is the notion of the gravitational self-energy inner product, and an operator (𝒟) that is self-adjoint with respect to it.

The new approach was, in part, first suggested implicitly by Kalnajs (1976), who introduced the Fourier-Mellin transform (but not naming it as such) in the case of thin discs², but nevertheless only used it to rederive the Clutton-Brock (1972) basis set. Those results are partially repeated (using our updated notation) in Sect. 4.2, where we show that a formalism equivalent to the spherical case exists for thin discs in cylindrical polar coordinates, along with a similar self-adjoint operator (𝒜).

As further motivation for our new algorithm, in Sect. 4 we demonstrate concretely how the formalism applies to some existing basis sets in the literature. Specifically we show that the two major families of basis sets – corresponding to double power laws in the spherical (Lilley et al. 2018a) and thin disc (Qian 1993) scenarios (along with their various limiting forms) – both possess the tridiagonality property, and hence each admit a representation in terms of a polynomial in 𝒟 or 𝒜, respectively.

In Sect. 5, we return to the general algorithm described in Sect. 2, and discuss the numerical and computational issues that arise when trying to implement it in practice. In particular it is necessary to find a fast, stable method to evaluate the requisite numerical integrals. This is most easily accomplished using Gauss-Laguerre quadrature in the transformed (Fourier–Mellin) space, first computing the underlying system of orthogonal polynomials. The recommended procedure is illustrated with the case of the isochrone model, which we use in Sect. 5.4 to recover a known result about unstable radial modes.

Finally in Sect. 6, we discuss some of the geometric ideas underlying the new formalism. We outline how our results might be extended to other geometries or coordinate systems relevant to the study of realistic galaxies, and give an outlook on future work to be done in the area.

2 Description of algorithm

First we make some new definitions as well as recapitulating the standard terminology. We define the ‘self-energy inner product’ 〈·, ·〉 on mass densities $〈 ρ_{1}, ρ_{2} 〉 = \int d^{3} r \int d^{3} r^{'} \frac{ρ_{1} (r) \bar{ρ_{2} (r^{'})}}{‖ r - r^{'} ‖} .$ $\left\langle {{\rho _1},{\rho _2}} \right\rangle = \int {{{\rm{d}}^3}{\bf{r}}\int {{{\rm{d}}^3}{\bf{r'}}} {{{\rho _1}\left( {\bf{r}} \right)\overline {{\rho _2}\left( {{\bf{r'}}} \right)} } \over {\left\| {{\bf{r}} - {\bf{r'}}} \right\|}}} .$ (2)

This is sometimes referred to as the mutual gravitational potential energy of ρ₁ with respect to ρ₂. Of course, the total self-energy is just ‖ρ‖² = 〈ρ,ρ〉, which here must clearly always be real and positive (although the normal convention is for this quantity to be negative, the overall choice of sign is irrelevant for our purposes). It is important that Eq. (2) obeys the standard properties of an inner product: linear in its first and conjugate linear in its second argument. Generically we allow mass densities to be complex-valued, as it eases some of the following derivations; however the entire formalism (necessarily) also works in the case of purely real mass densities. We are also not limited to densities with finite total mass, only finite total self-energy³. Finally we note that if we have a solution to Poisson’s equation for ρ₁ and ρ₂, finding their gravitational potentials to be Φ₁ and Φ₂, then (using Green’s identities) we can rewrite the inner product (Eq. (2)) as $〈 ρ_{1}, ρ_{2} 〉 = \frac{1}{4 π} \int d^{3} r \nabla Φ_{1} \cdot \bar{\nabla Φ_{2}},$ $\left\langle {{\rho _1},{\rho _2}} \right\rangle = {\textstyle{1 \over {4\pi }}}\int {{{\rm{d}}^3}{\bf{r}}\,\nabla {{\rm{\Phi }}_1} \cdot \overline {\nabla {{\rm{\Phi }}_2}} ,}$ (3)

or alternatively as $〈 ρ_{1}, ρ_{2} 〉 = - \int d^{3} r Φ_{1} \bar{ρ_{2}} .$ $\left\langle {{\rho _1},{\rho _2}} \right\rangle = - \int {{{\rm{d}}^3}{\bf{r}}\,{{\rm{\Phi }}_1}\overline {{\rho _2}} .}$ (4)

We set the gravitational constant G = 1 throughout. Now we introduce both spherical polar coordinates (r, φ, ϑ) and cylindrical polar coordinates (R, φ, z), the latter being used here only in the situation where the mass density is confined to a thin disc aligned with the z-axis. We define two important operators, $𝒟 = i (r \partial_{r} + \frac{5}{2})$ ${\rm{D}} = {\rm{i}}\,\,\left( {r{\partial _{\rm{r}}} + {5 \over 2}} \right)$ (5)

and $𝒜 = i (R \partial_{R} + \frac{3}{2}) .$ ${\rm{A}} = {\rm{i}}\,\,\left( {R{\partial _{\rm{R}}} + {3 \over 2}} \right).$ (6)

These have the important property of being self-adjoint with respect to the inner product (Eq. (2)) (see Appendix A for a proof), that is $〈 𝒟 f, g 〉 = 〈 f, 𝒟 g 〉,$ $\left\langle {{\rm{D}}f,g} \right\rangle = \left\langle {f,Dg} \right\rangle ,$ (7)

and (when f and ɡ are thin discs) $〈 𝒜 f, g 〉 = 〈 f, 𝒜 g 〉 .$ $\left\langle {{\rm{A}}f,g} \right\rangle = \left\langle {f,Ag} \right\rangle .$ (8)

Our standard notation for basis sets is as follows. We denote by {ρ_nlm}a complete basis for the set of smooth mass densities satisfying: ${‖ ρ ‖}^{2} < \infty (finite self-energy),$ ${\left\| \rho \right\|^2} gt; \infty \quad \quad \left( {{\rm{finite}}\,{\rm{self - energy}}} \right),$ (9)

The set {ρ_nlm} is assumed orthogonal with respect to Eq. (2), $〈 ρ_{n l m}, ρ_{n^{'} l^{'} m^{'}} 〉 = N_{n l m} δ_{n l m}^{n^{'} l^{'} m^{'}}, N_{n l m} = - K_{n l} N_{n l} .$ $\left\langle {{\rho _{nlm}},{\rho _{n'l'm'}}} \right\rangle = {N_{nlm}}\delta _{nlm}^{n'l'm'},\quad \quad {N_{nlm}} = - {K_{nl}}{N_{nl}}.$ (10)

These basis functions are the product of radial and angular components, $\begin{array}{l} Φ_{n l m} (r) = Φ_{n l} (r) Y_{l m} (\hat{r}), \\ ρ_{n l m} (r) = K_{n l} ρ_{n l} (r) Y_{l m} (\hat{r}), \end{array}$ $\matrix{ {{{\rm{\Phi }}_{nlm}}\left( {\bf{r}} \right) = {{\rm{\Phi }}_{nl}}\left( r \right)\,\,{Y_{lm}}\left( {{\bf{\hat r}}} \right),} \hfill \cr {{\rho _{nlm}}\left( {\bf{r}} \right) = {K_{nl}}{\rho _{nl}}\left( r \right)\,\,{Y_{lm}}\left( {{\bf{\hat r}}} \right),} \hfill \cr }$ (11)

which satisfy $\begin{array}{l} \nabla^{2} Φ_{n l m} = 4 π ρ_{n l m}, \\ \nabla_{l}^{2} Φ_{n l} = 4 π K_{n l} ρ_{n l} . \end{array}$ $\matrix{ {{\nabla ^2}{{\rm{\Phi }}_{nlm}} = 4\pi {\rho _{nlm}},} \hfill \cr {\,\,\,\nabla _l^2{{\rm{\Phi }}_{nl}} = 4\pi {K_{nl}}{\rho _{nl}}.} \hfill \cr }$ (12)

where K_nl are constants factored out of ρ_nl just to simplify the expressions; and $\nabla_{l}^{2}$ $\nabla _l^2$ is the radial part of the Laplacian when operating on (radial functions) × (a spherical harmonic of order l): $\nabla_{l}^{2} = r^{- 2} \frac{d}{d r} (r^{2} \frac{d}{d r}) - \frac{l (l + 1)}{r^{2}} .$ $\nabla _l^2 = {r^{ - 2}}{{\rm{d}} \over {{\rm{d}}r}}\left( {{r^2}{{\rm{d}} \over {{\rm{d}}r}}} \right) - {{l\left( {l + 1} \right)} \over {{r^2}}}.$ (13)

The purely radial functions ρ_nl (r) and Φ_nl(r) are real-valued, and satisfy a biorthogonality relation $\int_{0}^{\infty} d r r^{2} Φ_{n l} ρ_{n l} = N_{n l} δ_{n n^{'}},$ $\int_0^\infty {{\rm{d}}r\,{r^2}\,\,{{\rm{\Phi }}_{nl}}\,{\rho _{nl}}} = {N_{nl}}{\delta _{nn'}},$ (14)

and it is for this reason that such basis sets are traditionally referred to as ‘bi-orthogonal’. We take Y_lm throughout to be a unit-normalised (complex) spherical harmonic. If non-orthonormal spherical harmonics are employed then N_nlm must contain the appropriate factor that normalises them. The radial functions Φ_nl and ρ_nl are typically functions of the quantity r/r_s, where r_s is some ‘scalelength’ with units of length; we generally use r_s = 1 implicitly⁴.

An analogous notational convention is used throughout for the case of a thin disc. We write {σ_nm} to represent a complete basis, where $\nabla^{2} (ψ_{n m} (R, z) e^{i m φ}) = σ_{n m} (R, z) = σ_{n m} (R) e^{i m φ} δ (z) .$ ${\nabla ^2}\left( {{\psi _{nm}}\left( {R,z} \right){{\rm{e}}^{{\rm{i}}m\varphi }}} \right) = {\sigma _{nm}}\left( {R,z} \right) = {\sigma _{nm}}\left( R \right){{\rm{e}}^{{\rm{i}}m\varphi }}\,\delta \left( z \right).$ (15)

In an abuse of notation we suppress the z-dependence and elide the quantities which have subscript nm, writing the potential in the disc plane as $ψ_{n m} (R) = ψ_{n m} (R) e^{i m φ} .$ ${\psi _{nm}}\left( {\bf{R}} \right) = {\psi _{nm}}\left( R \right)\,\,{{\rm{e}}^{{\rm{i}}m\varphi }}.$ (16)

We now describe a natural method for deriving basis sets with any smooth analytical zeroth-order element. We shall focus on the spherical case, and afterwards describe the (slight) changes required in the thin disc case.

The first step is to choose a suitable zeroth-order potential, which we denote Φ (r). This can be chosen according to the problem at hand, the only requirements being that it must be a smooth spherically symmetric function of r, and the potential-density pair must have finite total gravitational self-energy. Starting from Φ we must then invent a function Φ_0l(r) that provides the zeroth radial order for the higher multipoles indexed by l. This function must satisfy two boundary conditions⁵: Φ_0l ~ r^l as r → 0 and Φ_0l ~ r^−l−1 as r → ∞. One way to achieve this is to take $Φ_{0 l} (r) = r^{l} {[Φ (r)]}^{2 l + 1},$ ${{\rm{\Phi }}_{{\rm{0}}l}}\left( r \right) = {r^l}{\left[ {{\rm{\Phi }}\left( r \right)} \right]^{2l + 1}},$ (17)

but any choice with the correct asymptotic behaviour will do just as well⁶. Once Φ_0l is chosen, the corresponding density multipoles ρ_0l are fully determined by $\nabla_{l}^{2} Φ_{0 l} = 4 π K_{0 l} ρ_{0 l},$ $\nabla _l^2{{\rm{\Phi }}_{0l}} = 4\pi {K_{0l}}{\rho _{0l}},$ (18)

where K_0l is an arbitrary constant chosen to simplify the algebra.

The defining relation for the basis set with zeroth order ρ_0l is the differential-recurrence relation, $ρ_{n + 1, l} = (r \partial_{r} + \frac{5}{2}) ρ_{n l} + β_{n l} ρ_{n - 1, l},$ ${\rho _{n + 1,l}} = \left( {r{\partial _r} + {5 \over 2}} \right){\rho _{nl}} + {\beta _{nl}}{\rho _{n - 1,l}},$ (19)

with initial conditions ρ_−1,l = 0, and where β_nl are some (as yet undetermined) constants. Note that the operator applied to the ρ_nl term on the RHS is equal to −i𝒟 (Eq. (5)). We can immediately write down a similar recurrence for the potential elements, $Φ_{n + 1, l} = (r \partial_{r} + \frac{1}{2}) Φ_{n l} + β_{n l} Φ_{n - 1, l},$ ${{\rm{\Phi }}_{n + 1,l}} = \left( {r{\partial _r} + {1 \over 2}} \right){{\rm{\Phi }}_{nl}} + {\beta _{nl}}{{\rm{\Phi }}_{n - 1,l}},$ (20)

due to the commutation relation between 𝒟 and the radial Laplacian $\nabla_{l}^{2}$ $\nabla _l^2$ (see Appendix B). By taking the inner product of Eq. (19) with both ρ_n+1,l and ρ_n−1,l, and exploiting the self-adjointness property (Eq. (7)), we find that the constants β_nl are given by $β_{n l} = \frac{{‖ ρ_{n l} ‖}^{2}}{{‖ ρ_{n - 1, l} ‖}^{2}} .$ ${\beta _{nl}} = {{{{\left\| {{\rho _{nl}}} \right\|}^2}} \over {{{\left\| {{\rho _{n - 1,l}}} \right\|}^2}}}.$ (21)

This is just the ratio of the gravitational self-energy of the nth and (n − 1)th basis elements. Because the RHS of Eq. (19) depends only on the nth and lower elements, we can now build up the entire sequence of basis elements by alternating applications of Eqs. (19) and (21).

This deceptively simple algorithm leaves some unresolved issues: firstly, whether these basis sets are truly complete; secondly, how we deal with the differentiation required in Eq. (19); and thirdly, whether the numerical integrals in Eq. (21) are stable.

We can give at least convincing heuristic answers to these questions. The question of completeness we consider in the course of the theoretical discussion in Sect. 3.2. The repeated differentiation will in general require some form of ‘symbolic’ or ‘automatic’ differentiation, which we discuss in Sect. 5.3 –unless the specific form of the zeroth-order allows for a simplification. The question of numerically calculating the recurrence coefficients β_nl is thorny, and we return to it in Sect. 5 after developing in Sect. 3 the theoretical machinery that links these basis sets to the theory of general orthogonal polynomials.

Our resulting basis elements are linear combinations of the higher-derivatives of the zeroth-order functions: {𝒟ⁿρ_0lm} in the case of the density, and {𝒟ⁿΦ_0lm} in the case of the potential. This means that, given a closed form zeroth-order, all higher elements are generated through differentiation – no numerical interpolation is required, unlike Weinberg (1999)’s algorithm based on Sturm–Liouville eigenfunctions. In fact, given a particular zeroth-order, a basis computed via the Sturm–Liouville approach will not in general coincide with the basis set developed from our own algorithm, except for certain special cases that are known to obey eigenfunction equations (for example the Zhao 1996 basis sets).

In addition, unlike Saha (1993), we are able to avoid the brute force approach of Gram Schmidt orthogonalisation (with complexity O(n²) in the number of inner products, and uncertain numerical stability). This is due to the self-adjointness of the operator 𝒟, which ensures that each basis set maps onto an underlying orthogonal polynomial in Fourier-Mellin space, a mathematical connection elaborated upon in Sect. 3. Thus we can reuse the large body of literature regarding the construction of general orthogonal polynomials, the most important property being that any set of orthogonal polynomials obeys a three-term recurrence relation - this relation is transferred over to the basis set, manifesting as the differential-recurrence relation (19).

Lastly we note that in the case of a thin disc the surface densities σ_nm have fundamental differential-recurrence relation $σ_{n + 1, m} = (R \partial_{R} + \frac{3}{2}) σ_{n m} + β_{n m} σ_{n - 1, m},$ ${\sigma _{n + 1,m}} = \left( {R{\partial _R} + {\textstyle{3 \over 2}}} \right){\sigma _{nm}} + {\beta _{nm}}{\sigma _{n - 1,m}},$ (22)

where the operator applied to σ_nm on the RHS is now -i𝒜 (Eq. (6)); but the algorithm is otherwise identical to the spherical case. In both the spherical and thin disc case the algorithm can be initialised by choosing either the zeroth-order potential or the zeroth-order density; but starting with a density may be more difficult in the thin disc case, as analytical potential-density pairs are harder to come by the required boundary conditions on ψ_0m (with azimuthal index m standing in for l) are unchanged, as is the requirement of smoothness and finite self-energy.

3 Theoretical background

3.1 Functional calculus of 𝒟 and the Fourier–Mellin transform

Consider the eigenfunctions of 𝒟, which we denote Ψ_s. These satisfy ${𝒟Ψ}_{s} = s Ψ_{s}, s \in ℝ$ ${\rm{D}}{{\rm{\Psi }}_s} = s{{\rm{\Psi }}_s},\quad \quad s\, \in \,{\rm{R}}$ (23)

and have the form $Ψ_{s} (r) = r^{- i s - 5 / 2} .$ ${{\rm{\Psi }}_s}\left( r \right) = {r^{ - {\rm{i}}s - {5 \mathord{\left/ {\vphantom {5 2}} \right. \kern-\nulldelimiterspace} 2}}}.$ (24)

We combine this with a spherical harmonic to define the ‘𝒟-eigenbasis’ $Ψ_{s l m} (r) = Ψ_{s} (r) Y_{l m} (\hat{r}) .$ ${{\rm{\Psi }}_{slm}}\left( {\bf{r}} \right) = {{\rm{\Psi }}_s}\left( r \right)\,\,{Y_{lm}}\left( {{\bf{\hat r}}} \right).$ (25)

Now let F(r) be a general mass density, and F_1m(r) its spherical multipole moments. Then the expansion coefficient of F in the 𝒟-eigenbasis is (see Appendix C for proof) $〈 F, Ψ_{s l m} 〉 = \frac{4 π}{k_{l} (i s)} ℳ_{r} {F_{l m} (r)} (5 / 2 + i s),$ $\left\langle {F,{{\rm{\Psi }}_{slm}}} \right\rangle = {{4\pi } \over {{k_l}\left( {{\rm{i}}s} \right)}}{M_r}\left\{ {{F_{lm}}\left( r \right)} \right\}\left( {{5 \mathord{\left/ {\vphantom {5 {2 + {\rm{i}}s}}} \right. \kern-\nulldelimiterspace} {2 + {\rm{i}}s}}} \right),$ (26)

where K_l(is) is defined through $K_{l} (s) = (l + 1 / 2 + s) (l + 1 / 2 - s),$ ${K_l}\left( s \right) = \left( {l + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2} + s} \right)\left( {l + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2} - s} \right),$ (27)

and ℳ_r is the Mellin transform, $ℳ_{r} {f (r)} (s) = \int_{0}^{\infty} r^{s - 1} f (r) d r .$ ${{\rm{M}}_r}\left\{ {f\left( r \right)} \right\}\left( s \right) = \int_0^\infty {{r^{s - 1}}\,f\left( r \right)\,{\rm{d}}r} .$ (28)

We subsequently refer (with some precedent) to the combination (Eq. (26)) of taking multipole moments and a Mellin transform as the three-dimensional ‘Fourier–Mellin transform’. We can re-express F in terms of its Fourier–Mellin expansion coefficients using the Mellin inversion theorem (via an appropriate change of variable), $F (r) = \frac{1}{8 π^{2}} \sum_{l m} \int_{- \infty}^{\infty} d s K_{l} (i s) Ψ_{s l m} (r) 〈 F, Ψ_{s l m} 〉 .$ $F\left( {\bf{r}} \right) = {1 \over {8{\pi ^2}}}\sum\limits_{lm} {\int_{ - \infty }^\infty {{\rm{d}}s\,{K_l}\left( {{\rm{i}}s} \right)\,{{\rm{\Psi }}_{slm}}\left( {\bf{r}} \right)\left\langle {F,{{\rm{\Psi }}_{slm}}} \right\rangle } .}$ (29)

where the inverse Mellin transform ℳ⁻¹ is $ℳ_{s}^{- 1} {g (s)} (r) = \frac{1}{2 π i} \int_{c - i \infty}^{c + i \infty} r^{- s} g (s) d s$ ${\rm{M}}_s^{ - 1}\left\{ {g\left( s \right)} \right\}\left( r \right) = {1 \over {2\pi {\rm{i}}}}\int_{c - {\rm{i}}\infty }^{c + {\rm{i}}\infty } {{r^{ - s}}\,g\left( s \right)\,{\rm{d}}s}$ (30)

for some constant c, the choice of which does not affect any of our results. The mutual gravitational energy of two general mass densities F and G can therefore be expressed as $〈 F, G 〉 = \frac{1}{8 π^{2}} \sum_{l m} \int_{- \infty}^{\infty} d s K_{l} (i s) 〈 F, Ψ_{s l m} 〉 〈 Ψ_{s l m}, G 〉 .$ $\left\langle {F,G} \right\rangle = {1 \over {8{\pi ^2}}}\sum\limits_{lm} {\int_{ - \infty }^\infty {{\rm{d}}s\,\,{K_l}\left( {{\rm{i}}s} \right)\left\langle {F,{{\rm{\Psi }}_{slm}}} \right\rangle \left\langle {{{\rm{\Psi }}_{slm}},G} \right\rangle } }$ (31)

Because 𝒟 is self-adjoint the spectral theorem applies, and we can consider arbitrary bounded complex-valued functions of 𝒟. The Fourier–Mellin transform can be viewed as the (unitary) map to the space in which 𝒟 acts as a multiplication operator. In practice though, we can limit ourselves to considering polynomials in 𝒟.

The formalism developed above also applies mutatis mutandis to the thin disc case. The derivation is now mostly the same as that found in Kalnajs (1971,1976), but we update his notation. Our self-adjoint operator A has eigenfunctions Σ_s satisfying $\begin{matrix} 𝒜 \sum_{s} = s \sum_{s}, \\ \sum_{s} (R) = R^{- i s - 3 / 2}, \\ \sum_{s m} (R) = \sum_{s} (R) e^{i m φ} . \end{matrix}$ $\matrix{ {A{\sum _s} = s{\sum _s},} \cr {{\sum _s}\left( R \right) = {R^{ - {\rm{i}}s - {3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2}}}} , \cr {{\sum _{sm}}\left( {\bf{\rm R}} \right) = {\sum _s}\left( R \right){{\rm{e}}^{{\rm{i}}m\varphi }}} \cr } .$ (32)

The functions Σ_sm(R) are Kalnajs’ ‘logarithmic spirals’⁷. For a general razor-thin mass density σ(R) we have the thin disc version of the Fourier–Mellin transform (see Appendix C.2 for proof), $〈 σ, \sum_{s m} 〉 = \frac{π}{K_{m} (i s)} ℳ_{R} {σ_{m} (R)} (3 / 2 + i s),$ $\left\langle {\sigma ,{\sum _{sm}}} \right\rangle = {\pi \over {{K_m}\left( {{\rm{i}}s} \right)}}{M_R}\left\{ {{\sigma _m}\left( R \right)} \right\}\left( {{3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2} + {\rm{i}}s} \right),$ (33)

where σ_m(R) are the cylindrical multipoles of σ(R), and $K_{m} (i s) = {| \frac{Γ (\frac{m + 3 / 2 + i s}{2})}{Γ (\frac{m + 1 / 2 + i s}{2})} |}^{2} .$ ${K_m}\left( {{\rm{i}}s} \right) = {\left| {{{\Gamma \left( {{{m + {3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2} + {\rm{i}}s} \over 2}} \right)} \over {\Gamma \left( {{{m + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2} + {\rm{i}}s} \over 2}} \right)}}} \right|^2} .$ (34)

3.2 Tridiagonality and polynomials

Associated to each of our basis sets is a polynomial we refer to as the ‘index-raising polynomial’ – depending on the normalisation we write either ρ_n1(s) or ρ_nl(s) (for a polynomial of degree n in the variable s). The general result proved below is that applying the nth-degree polynomial with argument 𝒟 to the zeroth-order density element gives the nth-order density element. It may help with interpretation to note that these polynomials in a sense ‘live’ in the Fourier–Mellin space introduced in the previous section.

There are several related statements that one can make about a given basis set and its associated index-raising polynomial:

The tridiagonality of the density basis functions {ρ_nlm} with respect to the operator 𝒟;
The expressibility of each basis function in terms of a polynomial in 𝒟 applied to the lowest-order basis function, with these polynomials obeying a three-term recurrence relation;
The orthogonality of ρ_nl(s) with respect to a weight function w_l(s) given in terms of the Mellin transform of ρ_0l;
The orthogonality of the basis functions {ρ_nlm} with respect to the self-energy inner product 〈· , ·〉.

Below we show that the first and second statements are equivalent. We also find that the third statement implies the fourth, and the second and fourth together imply the third. However, while it is easy to show that the third statement implies the second, the converse is much harder. Favard’s theorem guarantees that a set of polynomials obeying a three-term recurrence relation is orthogonal with respect to some measure, however this is a difficult computation and is not what we actually want. In practice we want the freedom to specify zeroth-order basis elements, not the recurrence coefficients themselves.

Therefore, to construct an arbitrary basis set we impose the first and fourth statements. Then the second and third statements (which provide the polynomials P_nl(s) or p_nl(s)) are a useful representation of the underlying basis set, which we exploit in order to solve numerical issues in the implementation described in Sect. 5.

The idea of finding orthogonal polynomials from tridiagonal matrices or operators is not new; in the finite-dimensional case the corresponding matrix is called a ‘Jacobi matrix’, and gives rise to polynomials of discrete argument. Our work invokes the infinite-dimensional case, in which a ‘Jacobi operator’ (here 𝒟 or 𝒜) operates on an infinite sequence of functions, which we generally assume to be a complete orthogonal set that spans the relevant function space. Such infinite-dimensional Jacobi operators are studied in Granovskii & Zhedanov (1986), Ismail & Koelink (2011), Dombrowski (1985), and our set-up mimics the development given in the first paper, with the difference that our 𝒟 and 𝒜 are taken as given and do not arise from any Lie algebraic considerations⁸.

As in the previous section, we give the main derivations in the case of spherical polar coordinates; the thin disc case then follows with little modification.

3.2.1 Polynomials from tridiagonality

We show that any set of densities {ρ_nlm} that is tridiagonal with respect to D gives rise to an expression for each ρ_nlm in terms of an index-raising polynomial in 𝒟, of the form $ρ_{n l m} = P_{n l} (𝒟) ρ_{0 l m} .$ ${\rho _{nlm}} = {P_{nl}}\left( D \right){\rho _{0lm}} .$ (35)

By ‘tridiagonality’ we mean that the following expression holds, $𝒟 ρ_{n l m} = a_{n l} ρ_{n - 1, l m} + b_{n l} ρ_{n l m} + c_{n l} ρ_{n + 1, l m},$ $D{\rho _{nlm}} = {a_{nl}}{\rho _{n - 1,lm}} + {b_{nl}}{\rho _{nlm}} + {c_{nl}}{\rho _{n + 1,lm}} ,$ (36)

for some constants a_nl, b_nl and c_nl. First, define $χ_{n l m} = 𝒟^{n} ρ_{0 l m} .$ ${\chi _{nlm}} = {D^n}{\rho _{0lm}} .$ (37)

From Eq. (36), there exists an expansion of χ_nlm of the form $χ_{n l m} = \sum_{j = 0}^{n} B_{n l j} ρ_{j l m} .$ ${\chi _{nlm}} = \sum\limits_{j = 0}^n {{B_{nlj}}{\rho _{jlm}}} .$ (38)

Then by inverting B_njl(interpreted as a matrix with respect to the n j indices) it is evidently possible to write an expansion for ρ_nlm of the form $ρ_{n l m} = \sum_{j = 0}^{n} A_{n l} χ_{j l m} .$ ${\rho _{nlm}} = \sum\limits_{j = 0}^n {{A_{nl}}{\chi _{jlm}}} .$ (39)

Now make the definition $P_{n l} (s) = \frac{〈 Ψ_{s l m}, ρ_{n l m} 〉}{〈 Ψ_{s l m}, ρ_{0 l m} 〉} .$ ${P_{nl}}\left( s \right) = {{\left\langle {{{\rm{\Psi }}_{slm}},{\rho _{nlm}}} \right\rangle } \over {\left\langle {{{\rm{\Psi }}_{slm}},{\rho _{0lm}}} \right\rangle }} .$ (40)

To prove that ρ_nl(s) is a polynomial, take the Fourier–Mellin expansion of Eq. (39), $\begin{array}{l} 〈 Ψ_{s l m}, ρ_{n l m} 〉 = \sum_{j = 0}^{n} A_{n j l} 〈 Ψ_{s l m}, χ_{j l m} 〉 \\ = \sum_{j = 0}^{n} A_{n j l} s^{j} 〈 Ψ_{s l m}, ρ_{0 l m} 〉, \end{array}$ $\matrix{ {\left\langle {{{\rm{\Psi }}_{slm}},{\rho _{nlm}}} \right\rangle = \sum\limits_{j = 0}^n {{A_{njl}}\left\langle {{{\rm{\Psi }}_{slm}},{\chi _{jlm}}} \right\rangle } } \hfill \cr {\quad \quad \quad \quad \,\,\,\, = \sum\limits_{j = 0}^n {{A_{njl}}{s^j}\left\langle {{{\rm{\Psi }}_{slm}},{\rho _{0lm}}} \right\rangle ,} } \hfill \cr }$ (41)

where the second equality uses the self-adjointness property (Eq. (7)) as well as the definition of the eigenbasis (Eq. (25)). Dividing through by 〈Ψ_slm,ρ_0lm〉 then gives P_nl(s) as a polynomial in s with (as yet undetermined) coefficients A_njl. But from the definition (Eq. (39)), we see that P_nl(𝒟) is just the operator expression for ρ_nlm that raises the radial index from 0 to n, which is Eq. (35). To find the three-term recurrence relation for P_nl(s), take the Fourier–Mellin expansion of Eq. (36), divide through by 〈Ψ_slm,ρ_0lm〉, and rearrange, giving $P_{n + 1, l} (s) = \frac{s - b_{n l}}{c_{n l}} P_{n l} (s) - \frac{a_{n l}}{c_{n l}} P_{n - 1, l} (s) .$ ${P_{n + 1,l}\left( s \right)} = {{s - {b_{nl}}} \over {{c_{nl}}}}{P_{nl}}\left( s \right) - {{{a_{nl}}} \over {{c_{nl}}}}{P_{n - 1,l}\left( s \right)} .$ (42)

For the converse statement, substituting 𝒟 for s in the above recurrence and left-applying to ρ_0lm trivially recovers the tridiagonality property (we must also take the initial conditions P_0l = 1 and P_−1l = 0).

3.2.2 Orthogonal polynomials

From Favard’s theorem we know that the P_nl(𝒟) are a system of orthogonal polynomials, as they satisfy a three-term recurrence relation (42). However, in order to actually construct the orthogonalising weight function, we first assume that the underlying basis functions are already orthogonal. It follows that $\begin{array}{l} 〈 ρ_{n l m}, ρ_{n' l' m'} 〉 = δ_{l l'} δ_{m m'} \int_{- \infty}^{\infty} d s ω_{l} (s) P_{n l} (s) \bar{P_{n' l} (s)} \\ \propto δ_{n n'} δ_{l l'} δ_{m m'}, \end{array}$ $\matrix{ {\left\langle {{\rho _{nlm}},{\rho _{n\prime l\prime m\prime }}} \right\rangle = {\delta _{ll\prime }}{\delta _{mm\prime }}\int_{ - \infty }^\infty {{\rm{d}}s{\omega _l}\left( s \right){P_{nl}}\left( s \right)\overline {{P_{n\prime l}}\left( s \right)} } } \hfill \cr {\quad \quad \quad \quad \quad \propto {\delta _{nn\prime }}{\delta _{ll\prime }}{\delta _{mm\prime }},} \hfill \cr }$ (43)

where the (positive, real-valued) weight function w_l(s) is related to the zeroth-order density basis function ρ_0l by $ω_{l} (s) \frac{2 K_{0 l}^{2}}{K_{l} (i s)} {| ℳ_{r} {ρ_{0 l} (r)} (5 / 2 + i s) |}^{2},$ ${\omega _l}\left( s \right){{2K_{0l}^2} \over {{K_l}\left( {{\rm{i}}s} \right)}}{\left| {{M_r}\left\{ {{\rho _{0l}}\left( r \right)} \right\}\left( {{5 \mathord{\left/ {\vphantom {5 {2 + {\rm{i}}s}}} \right. \kern-\nulldelimiterspace} {2 + {\rm{i}}s}}} \right)} \right|^2},$ (44)

the proof of which is in Appendix D. The orthogonality relation (43) works in both directions: if we instead assume that P_nl(s) are orthogonal with respect to (a given) w_l(s), then the orthogonality of the ρ_nlm follows.

In fact, P_nl(s) can be written in terms of purely real polynomials p_nl(s), which are also orthogonal with respect to $\int_{- \infty}^{\infty} ω_{l} (s) p_{n l} (s) p_{n' l} (s) d s \propto δ_{n n'},$ $\int_{ - \infty }^\infty {{\omega _l}\left( s \right){p_{nl}}\left( s \right){p_{n\prime l}}\left( s \right){\rm{d}}s \propto {\delta _{nn\prime }},}$ (45)

with $P_{n l} (s) \propto i^{- n} p_{n l} (s) .$ ${P_{nl}}\left( s \right) \propto {{\rm{i}}^{ - n}}{p_{nl}}\left( s \right) .$ (46)

It is often more convenient in applications to deal with these real-valued polynomials. Without loss of generality (up to normalisation) we can take the polynomials p_nl(s) to be monic⁹, obeying a three-term recurrence relation $p_{n + 1, l} (s) = s p_{n l} (s) - β_{n l} p_{n - 1, l} (s) .$ ${p_{n + 1,l}}\left( s \right) = s\,\,{p_{nl}}\left( s \right) - {\beta _{nl}}\,\,{p_{n - 1,l}}\left( s \right) .$ (47)

In this way we only have to consider the single sequence of recurrence coefficients β_nl. According to this normalisation the Pnl(s) therefore obey the recurrence $p_{n + 1, l} (s) = - i s P_{n l} (s) + β_{n l} P_{n - 1, l} (s) .$ ${p_{n + 1,l}}\left( s \right) = - {\rm{i}}s\,\,{P_{nl}}\left( s \right) + {\beta _{nl}}\,\,{P_{n - 1,l}}\left( s \right) .$ (48)

Replacing s with 𝒟 and applying to ρ_0l on the right then leads to the defining recurrence for the density basis elements (Eq. (19)). Alternatively we can express the density and potential directly in terms of p_nl(s), $\begin{array}{l} Φ_{n l m} = i^{- n} p_{n l} (i (r \partial_{r} + 1 / 2)) Φ_{0 l m}, \\ ρ_{n l m} = i^{- n} p_{n l} (i (r \partial_{r} + 5 / 2)) ρ_{o l m} . \end{array}$ $\matrix{ {{{\rm{\Phi }}_{nlm}} = {{\rm{i}}^{ - n}}{p_{nl}}\left( {{\rm{i}}\left( {r{\partial _r} + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}} \right)} \right){{\rm{\Phi }}_{0lm}},} \hfill \cr {{\rho _{nlm}} = {{\rm{i}}^{ - n}}{p_{nl}}\left( {{\rm{i}}\left( {r{\partial _r} + {5 \mathord{\left/ {\vphantom {5 2}} \right. \kern-\nulldelimiterspace} 2}} \right)} \right){\rho _{olm}}} . \hfill \cr }$ (49)

3.2.3 Disc case

As expected, similar results apply in the case of thin discs. Take {σ_nm} to be a set of infinitesimally thin surface densities that are tridiagonal with respect to the operator 𝒜 and orthogonal with respect to 〈·, ·〉. We have index-raising polynomials P_nm(s), defined by $P_{n m} (s) = \frac{〈 \sum_{s m}, σ_{n m} 〉}{〈 \sum_{s m}, σ_{0 m} 〉}$ ${P_{nm}}\left( s \right) = {{\left\langle {{\sum _{sm}},{\sigma _{nm}}} \right\rangle } \over {\left\langle {{\sum _{sm}},{\sigma _{0m}}} \right\rangle }}$ (50)

This gives rise to a representation of the basis functions via repeated application of the operator 𝒜, $\begin{array}{l} σ_{n m} = P_{n m} (𝒜) σ_{0 m}, \\ ψ_{n m} = P_{n m} (𝒜 - i) ψ_{0 m .} \end{array}$ $\matrix{ {{\sigma _{nm}} = {P_{nm}}\left( A \right){\sigma _{0m}},} \hfill \cr {{\psi _{nm}} = {P_{nm}}\left( {A - {\rm{i}}} \right){\psi _{0m.}}} \hfill \cr }$ (51)

The orthogonality relation can be written $\begin{array}{l} 〈 σ_{n m}, σ_{n' m'} 〉 = δ_{m m'} \int_{- \infty}^{\infty} d s Ω_{m} (s) P_{n m} (s) \bar{P_{n' m} (s)} \\ \propto δ_{n n'} δ_{m m'} \end{array}$ $\matrix{ {\left\langle {{\sigma _{nm}},{\sigma _{n\prime m\prime }}} \right\rangle = {\delta _{mm\prime }}\int_{ - \infty }^\infty {{\rm{d}}s\,{{\rm{\Omega }}_m}\left( s \right){P_{nm}}\left( s \right)\overline {{P_{n\prime m}}\left( s \right)} } } \hfill \cr {\quad \quad \quad \quad \,\,\, \propto {\delta _{nn\prime }}{\delta _{mm\prime }}} \hfill \cr }$ (52)

where $Ω_{m} (s) = \frac{{| ℳ_{R} {σ_{0 m} (R)} (3 / 2 + i s) |}^{2}}{4 π K_{m} (i s)} .$ ${{\rm{\Omega }}_m}\left( s \right) = {{{{\left| {{M_R}\left\{ {{\sigma _{0m}}\left( R \right)} \right\}\left( {{3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2} + {\rm{i}}s} \right)} \right|}^2}} \over {4\pi {K_m}\left( {{\rm{i}}s} \right)}} .$ (53)

The details of this derivation are in Appendix D.2. As before we can instead use real-valued polynomials p_nm(s), also orthogonal with respect to Ω_m(s). The potential and surface density in terms of p_nm(s) are $\begin{array}{l} ψ_{n m} = i^{- n} p_{n m} (i (R \partial_{R} + 1 / 2)) ψ_{0 m}, \\ σ_{n m} = i^{- n} p_{n m} (i (R \partial_{R} + 3 / 2)) σ_{0 m .} \end{array}$ $\matrix{ {{\psi _{nm}} = {{\rm{i}}^{ - n}}{p_{nm}}\left( {{\rm{i}}\left( {R{\partial _R} + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}} \right)} \right){\psi _{0m}},} \hfill \cr {{\sigma _{nm}} = {{\rm{i}}^{ - n}}{p_{nm}}\left( {{\rm{i}}\left( {R{\partial _R} + {3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2}} \right)} \right){\sigma _{0m.}}} \hfill \cr }$ (54)

In general it is difficult to find the z-dependence of the potential for thin discs analytically, although in exceptional cases there may be a simple solution (for example the Kuzmin-Toomre discs). In any case because p_nm(𝒜) acts by differentiation with respect to R alone, this guarantees that if the z-dependence of the zeroth-order potential is known, then the correct the z-dependence is preserved in all higher-order potential basis elements. This will have important implications when considering the extension of our results to the Robijn & Earn (1996) method for thickened-disc basis sets, however we do not pursue this in the present work.

3.3 Completeness

We make some informal comments about the completeness of a general basis set {ρ_nlm}, derived from a zeroth-order ρ_0l(r) as described above. The completeness of the angular part of each basis (the spherical harmonics) is taken as given.

The question then of whether a set {ρ_0l, 𝒟ρ_0l, 𝒟²ρ_0l,…} forms a complete basis for (the th multipole of) the space of mass densities is the same as asking whether ρ_0l is a ‘cyclic vector’ for the operator 𝒟. This is related to the completeness of the associated orthogonal polynomials p_nl(s), as powers of 𝒟 correspond to powers of s; so we require that the monomials sⁿ (weighted by w_l(s)) form a complete basis for functions on the interval (−∞, ∞). This is achieved if w_l(s) is nonzero everywhere. By the definition of w_l(s), this then requires the Mellin transform of ρ_0l to be nonzero everywhere, which in turn requires that 𝒟ⁿρ_0l be non-vanishing everywhere for all n (Marín & Seubert 2006).

Therefore, to be a valid zeroth-order density, ρ_0l must fulfil the following: $\begin{array}{l} {‖ 𝒟^{n} ρ_{0 l} ‖}^{2} < \infty, \\ 𝒟^{n} ρ_{0 l} (r) = 0 at only isolated r . \end{array}$ $\matrix{ {{{\left\| {{D^n}{\rho _{0l}}} \right\|}^2}\,\, < \infty }, \hfill \cr {{D^n}{\rho _{0l}}\left( r \right) = 0\,{\rm{at}}\,{\rm{only}}\,{\rm{isolated}}\,r.} \hfill \cr }$ (55)

These conditions are required to hold for all n ∈ ℕ; restricting to n = 0 gives the conditions (9) on ‘representable’ mass densities. While these conditions are fairly restrictive, in general any reasonable analytical potential-density pairs will satisfy them; in particular those described in the following section whose corresponding basis sets or index-raising polynomials have closed form expressions.

4 Application to known basis sets

In Sect. 3, we developed a theoretical justification for the simple algorithm described in Sect. 2. We now provide further motivation by applying the formalism to some concrete examples of basis sets from the literature. Remarkably, all known analytical spherical (resp. thin disc) basis sets of infinite extent have a representation in terms of 𝒟 (resp. 𝒜). In fact, it is extremely theoretically suggestive that these previously described analytical basis sets have index-raising polynomials that can be written in terms of known classical orthogonal polynomials. The expressions we derive below for the various basis sets’ index-raising polynomials may appear complicated; however the presence of a classical polynomial indicates simply that in each case the recurrence coefficient β_nl (Eq. (19)) can be written as a rational combination of the given basis set’s fixed shape parameters.

4.1 Spherical case

4.1.1 Clutton-Brock’s Plummer basis set

The simplest possible useful basis set in spherical polar coordinates is that of Clutton-Brock (1973), which uses the Plummer (1911) model as its zeroth-order. By making an appropriate variable substitution, Clutton-Brock transformed the Poisson equation for the radial components (Eq. (12)) into the defining second-order differential equation for the Gegenbauer polynomials (DLMF, Sect. 18.8). Each radial density and potential component is proportional to just one polynomial, $\begin{array}{l} Φ_{n l}^{CB73} (r) = \frac{- r^{l}}{{(1 + r^{2})}^{l + 1 / 2}} C_{n}^{(l + 1)} (\frac{r^{2} - 1}{r^{2} + 1}), \\ ρ_{n l}^{CB73} (r) = \frac{- (2 n + 2 l + 3) (2 n + 2 l + 1) Φ_{n l}^{CB73} (r)}{4 π {(1 + r^{2})}^{2}}, \end{array}$ $\matrix{ {{\rm{\Phi }}_{nl}^{{\rm{CB73}}}\left( r \right) = {{ - {r^l}} \over {{{\left( {1 + {r^2}} \right)}^{l + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}}}}}C_n^{\left( {l + 1} \right)}\left( {{{{r^2} - 1} \over {{r^2} + 1}}} \right),} \hfill \cr {\rho _{nl}^{{\rm{CB73}}}\left( r \right) = {{ - \left( {2n + 2l + 3} \right)\left( {2n + 2l + 1} \right){\rm{\Phi }}_{nl}^{{\rm{CB73}}}\left( r \right)} \over {4\pi {{\left( {1 + {r^2}} \right)}^2}}},} \hfill \cr }$ (56)

and the normalisation constant is¹⁰ $\int_{0}^{\infty} r^{2} d r Φ_{n l}^{CB73} (r) ρ_{n l}^{CB73} (r) = - \frac{(2 n + 2 l + 3) (2 n + 2 l + 1) (n + 2 l + 1)!}{2^{4 l + 6} (n + l + 1) n! {(l!)}^{2}} .$ $\int_0^\infty {{r^2}{\rm{d}}r{\rm{\Phi }}_{nl}^{{\rm{CB73}}}\left( r \right)\rho _{nl}^{{\rm{CB73}}}\left( r \right) = - {{\left( {2n + 2l + 3} \right)\left( {2n + 2l + 1} \right)\left( {n + 2l + 1} \right)!} \over {{2^{4l + 6}}\left( {n + l + 1} \right)n!{{\left( {l!} \right)}^2}}}} .$ (57)

This basis set is in fact a special case of the family described in Sect. 4.1.2, but as it is the simplest (and earliest) of all the spherical basis sets we present it in some depth as a didactic example.

Plugging $ρ_{0 l}^{CB 73}$ $\rho _{0l}^{{\rm{CB}}73}$ into the definition of the weight function (Eq. (44)), we find that $\begin{array}{l} ω_{l}^{CB73} (s) = \frac{| Γ (1 / 4 + l / 2 + i s / 2) Γ (5 / 4 + l / 2 + i s / 2) |^{2}}{8 π^{2} Γ {(l + 1 / 2)}^{2}} \\ = \frac{w (s / 2; 1 / 4 + l / 2, 5 / 4 + l / 2)}{8 π^{2} Γ {(l + 1 / 2)}^{2}}, \end{array}$ $\matrix{ {\omega _l^{{\rm{CB73}}}\left( s \right) = {{\left| {\Gamma \left( {{1 \mathord{\left/ {\vphantom {1 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2} + {{{\rm{i}}s} \mathord{\left/ {\vphantom {{{\rm{i}}s} 2}} \right. \kern-\nulldelimiterspace} 2}} \right)\Gamma \left( {{5 \mathord{\left/ {\vphantom {5 {4 + {l \mathord{\left/ {\vphantom {l {2 + {{{\rm{i}}s} \mathord{\left/ {\vphantom {{{\rm{i}}s} 2}} \right. \kern-\nulldelimiterspace} 2}}}} \right. \kern-\nulldelimiterspace} {2 + {{{\rm{i}}s} \mathord{\left/ {\vphantom {{{\rm{i}}s} 2}} \right. \kern-\nulldelimiterspace} 2}}}}}} \right. \kern-\nulldelimiterspace} {4 + {l \mathord{\left/ {\vphantom {l {2 + {{{\rm{i}}s} \mathord{\left/ {\vphantom {{{\rm{i}}s} 2}} \right. \kern-\nulldelimiterspace} 2}}}} \right. \kern-\nulldelimiterspace} {2 + {{{\rm{i}}s} \mathord{\left/ {\vphantom {{{\rm{i}}s} 2}} \right. \kern-\nulldelimiterspace} 2}}}}}} \right)} \right|}^2 \over {8{\pi ^2}\Gamma {{\left( {{{l + 1} \mathord{\left/ {\vphantom {{l + 1} 2}} \right. \kern-\nulldelimiterspace} 2}} \right)}^2}}}} \hfill \cr {\quad \quad \quad \,\,\,\, = {{w\left( {{s \mathord{\left/ {\vphantom {s {2;{1 \mathord{\left/ {\vphantom {1 {4 + {l \mathord{\left/ {\vphantom {l {2,{5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2}}}} \right. \kern-\nulldelimiterspace} {2,{5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2}}}}}} \right. \kern-\nulldelimiterspace} {4 + {l \mathord{\left/ {\vphantom {l {2,{5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2}}}} \right. \kern-\nulldelimiterspace} {2,{5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2}}}}}}}} \right. \kern-\nulldelimiterspace} {2;{1 \mathord{\left/ {\vphantom {1 {4 + {l \mathord{\left/ {\vphantom {l {2,{5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2}}}} \right. \kern-\nulldelimiterspace} {2,{5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2}}}}}} \right. \kern-\nulldelimiterspace} {4 + {l \mathord{\left/ {\vphantom {l {2,{5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2}}}} \right. \kern-\nulldelimiterspace} {2,{5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} + {l \mathord{\left/ {\vphantom {l 2}} \right. \kern-\nulldelimiterspace} 2}}}}}}}} \right)} \over {8{\pi ^2}\Gamma {{\left( {l + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}} \right)}^2}}}} \hfill \cr } ,$ (58)

where w(x; a, b) is the weight function for the continuous Hahn polynomials p_n(x; a, b) (Appendix E.1). It can be verified that each basis element $ρ_{n l}^{CB 73}$ $\rho _{nl}^{{\rm{CB}}73}$ and $Φ_{n l}^{CB 73}$ ${\rm{\Phi }}_{nl}^{{\rm{CB}}73}$ indeed Fourier-Mellin-transforms into a single continuous Hahn polynomial. Specifically, we find that $\begin{array}{l} P_{n l}^{CB73} (s) = A_{n l} i^{n} p_{n} (\frac{s}{2}; \frac{1}{4} + \frac{l}{2}, \frac{5}{4} + \frac{l}{2}), \\ A_{n l} = \frac{\sqrt{π} Γ (l + 1 / 2) (n + 2 l + 1)!}{2^{2 l} (2 n + 2 l + 1) l! Γ {(n + l + 1 / 2)}^{2}} . \end{array}$ $\matrix{ {P_{nl}^{{\rm{CB73}}}\left( s \right) = {A_{nl}}{{\rm{i}}^n}{p_n}\left( {{s \over 2};{1 \over 4} + {l \over 2},{5 \over 4} + {l \over 2}} \right),} \hfill \cr {\quad \quad {A_{nl}} = {{\sqrt \pi {\rm{\Gamma }}\left( {l + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}} \right)\left( {n + 2l + 1} \right)!} \over {{2^{2l}}\left( {2n + 2l + 1} \right)l!{\rm{\Gamma }}{{\left( {n + l + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}} \right)}^2}}}} \hfill \cr } .$ (59)

Looking at the definition of a continuous Hahn polynomial (Eq. (E.1)), we find a hypergeometric function that terminates after n terms, but where the argument s appears as a ‘parameter’. Given how this relates to the definition of the density elements, this means that $\begin{array}{l} ρ_{n l}^{CB73} = B_{n l 3} F_{2} (\begin{matrix} - n, n + 2 l + 2, l / 2 + 1 / 4 + i D / 2 \\ l + 1 / 2, l + 3 / 2 \end{matrix} | 1) ρ_{0 l}^{CB73}, \\ B_{n l} = {(- 1)}^{n} (_{n}^{n + 2 l + 1}), \end{array}$ $\matrix{ {\rho _{nl}^{{\rm{CB73}}} = {B_{nl 3}}{F_2}\left( {\left. {\matrix{ {{{ - n,n + 2l + 2,{l \mathord{\left/ {\vphantom {l {2 + {1 \mathord{\left/ {\vphantom {1 {4 + {\rm{i}}{\cal D}}}} \right. \kern-\nulldelimiterspace} {4 + {\rm{i}}{\cal D}}}}}} \right. \kern-\nulldelimiterspace} {2 + {1 \mathord{\left/ {\vphantom {1 {4 + {\rm{i}}{\cal D}}}} \right. \kern-\nulldelimiterspace} {4 + {\rm{i}}{\cal D}}}}}} \mathord{\left/ {\vphantom {{ - n,n + 2l + 2,{l \mathord{\left/ {\vphantom {l {2 + {1 \mathord{\left/ {\vphantom {1 {4 + {\rm{i}}{\cal D}}}} \right. \kern-\nulldelimiterspace} {4 + {\rm{i}}{\cal D}}}}}} \right. \kern-\nulldelimiterspace} {2 + {1 \mathord{\left/ {\vphantom {1 {4 + {\rm{i}}{\cal D}}}} \right. \kern-\nulldelimiterspace} {4 + {\rm{i}}{\cal D}}}}}} 2}} \right. \kern-\nulldelimiterspace} 2}} \cr {l + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2},l + {3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2}} \cr } } \right|1} \right)\rho _{0l}^{{\rm{CB73}}},} \hfill \cr {{B_{nl}} = {{\left( { - 1} \right)}^n}\left( {_n^{n + 2l + 1}} \right),} \hfill \cr }$ (60)

where the operator 𝒟 alarmingly also appears as a parameter; each term in the sum is proportional to a Pochhammer symbol whose argument involves 𝒟. However these are unproblematic to evaluate, as they expand to $\begin{array}{l} {(l / 2 + 1 / 4 + i 𝒟 / 2)}_{j} = (l / 2 - 1 - r \partial_{r} / 2 + j - 1) \\ \times (l / 2 - 1 - r \partial_{r} / 2 + j - 2) \times ... (l / 2 - 1 - r \partial_{r} / 2), \end{array}$ $\matrix{ {{{\left( {l/2 + 1/4 + i{\cal D}/2} \right)}_j} = \left( {l/2 - 1 - r{\partial _r}/2 + j - 1} \right)} \hfill \cr {\,\,\,\,\,\, \times \left( {l/2 - 1 - r{\partial _r}/2 + j - 2} \right) \times ...\left( {l/2 - 1 - r{\partial _r}/2} \right),} \hfill \cr }$ (61)

and each occurrence of r∂_r then operates to the right on $ρ_{0 l}^{CB 73}$ $\rho _{0l}^{{\rm{CB}}73}$ (r) in the expected fashion. The index-raising polynomials (of argument 𝒟 or 𝒜) in the remainder of this section are evaluated in a similar way.

4.1.2 The double power law basis sets

Practically all known double power law basis sets in spherical polar coordinates are contained within one super-family described in Lilley et al. (2018a, containing within it the basis sets of Clutton-Brock 1973; Hernquist & Ostriker 1992; Zhao 1996; Rahmati & Jalali 2009; Lilley et al. 2018b). There are two free parameters (α and v) controlling both the asymptotic power law slope and turnover. We refer to the expressions given in Lilley et al. (2018a) for the potential, density and normalisation constants (Eqs. (30)–(33) of that work), and label them with the superscript LSE. The zeroth-order has $ρ_{0 l}^{LSE} ~ r^{- 2 + 1 / α + l}$ $\rho _{0l}^{{\rm{LSE}}}\~{r^{{{ - 2 + 1} \mathord{\left/ {\vphantom {{ - 2 + 1} {\alpha + l}}} \right. \kern-\nulldelimiterspace} {\alpha + l}}}}$ , and $ρ_{0 l}^{LSE} ~ r^{- 3 - v / α - l}$ $\rho _{0l}^{{\rm{LSE}}}\~{r^{{{ - 3 - v} \mathord{\left/ {\vphantom {{ - 3 - v} {\alpha - l}}} \right. \kern-\nulldelimiterspace} {\alpha - l}}}}$ as r → ∞. Inserting into the definition of the weight function (Eq. (44)) and writing µ = α(1 + 2l), we find that $ω_{l}^{LSE} (s) = {(\frac{μ}{4 π Γ (μ + v)})}^{2} w (α s; \frac{μ}{2}, \frac{μ}{2} + v),$ $\omega _l^{{\rm{LSE}}}\left( s \right) = {\left( {{\mu \over {4\pi \Gamma \left( {\mu + v} \right)}}} \right)^2}w\left( {\alpha s;{\mu \over 2},{\mu \over 2} + v} \right),$ (62)

which is again proportional to a continuous Hahn weight function (Appendix E.1). Explicitly for the index-raising polynomials we have $\begin{array}{l} P_{n l}^{LSE} (s) = \frac{K_{n l}^{LSE}}{K_{0 l}^{LSE}} \frac{i^{n} n!}{{(μ)}_{n} {(μ + v)}_{n}} p_{n} (α s; \frac{μ}{2}, \frac{μ}{2} + v) \\ = \frac{{(- 1)}^{n} K_{n l}^{LSE}}{K_{0 l}^{LSE}}_{3} F_{2} (\begin{matrix} - n, n + 2 μ + 2 v - 1, μ / 2 + i α s \\ μ, μ + v \end{matrix} | 1) . \end{array}$ $\matrix{ {P_{nl}^{{\rm{LSE}}}\left( s \right) = {{K_{nl}^{{\rm{LSE}}}} \over {K_{0l}^{{\rm{LSE}}}}}{{{{\rm{i}}^n}n!} \over {{{\left( \mu \right)}_n}{{\left( {\mu + v} \right)}_n}}}{p_n}\left( {\alpha s;{\mu \over 2},{\mu \over 2} + v} \right)} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = {{{{\left( { - 1} \right)}^n}K_{nl}^{{\rm{LSE}}}} \over {K_{0l}^{{\rm{LSE}}}}}_3{F_2}\left( {\left. {\matrix{ { - n,n + 2\mu + 2v - 1,\mu /2 + i\alpha s} \cr {\mu ,\mu + v} \cr } } \right|1} \right).} \hfill \cr }$ (63)

4.1.3 The cuspy-exponential basis sets

These basis sets were not mentioned in Lilley et al. (2018a) and are therefore newly presented in the literature¹¹; but they are a straightforward derivation from the double power law result, obtained by letting the parameter v and the scalelength simultaneously tend to infinity. The result is a family of basis sets with both an exponential fall-off and a central cusp in density, both controlled by the parameter α − hence the nickname ‘cuspy-exponential’ (CE). The lowest order density function is $ρ_{00}^{CE} \propto r^{- 2 + 1 / α} e^{- r^{1 / α}}$ $\rho _{00}^{{\rm{CE}}} \propto {r^{{{ - 2 + 1} \mathord{\left/ {\vphantom {{ - 2 + 1} \alpha }} \right. \kern-\nulldelimiterspace} \alpha }}}{{\rm{e}}^{ - {r^{{1 \mathord{\left/ {\vphantom {1 \alpha }} \right. \kern-\nulldelimiterspace} \alpha }}}}}$ . Important cases are α = 1/2 which gives a Gaussian, and α = 1 which is a density familiar to chemists as the Slater-type orbital. We use the superscript CE for these basis functions. The density and potential are (with µ = α(1 + 2l)) $\begin{array}{l} ρ_{n l}^{CE} (r) = 2 {(- 1)}^{n} r^{l - 2 + 1 / α} e^{- r^{1 / α}} [L_{n}^{(μ)} (2 r^{1 / α}) + L_{n - 1}^{(μ)} (2 r^{1 / α})] \\ Φ_{0 l}^{CE} (r) = \frac{μ γ (μ, r^{1 / α})}{r^{l + 1}}, \\ Φ_{n l}^{CE} - Φ_{n + 1, l}^{CE} = \frac{2 n! {(- 1)}^{n}}{{(μ + 1)}_{n}} r^{l} e^{- r^{1 / α}} L_{n}^{(μ)} (2 r^{1 / α}), \end{array}$ $\matrix{ {\,\,\,\,\,\,\,\,\,\,\,\rho _{nl}^{{\rm{CE}}}\left( r \right) = 2{{\left( { - 1} \right)}^n}{r^{l - 2 + 1/\alpha }}{{\rm{e}}^{ - {r^{1/\alpha }}}}\left[ {L_{n}^{\left( \mu \right)}\left( {2{r^{1/\alpha }}} \right)} + {L_{n - 1}^{\left( \mu \right)}\left( {2{r^{1/\alpha }}} \right)} \right]} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\Phi _{0l}^{{\rm{CE}}}\left( r \right) = {{\mu \gamma \left( {\mu ,{r^{1/\alpha }}} \right)} \over {{r^{l + 1}}}},} \hfill \cr {\Phi _{nl}^{{\rm{CE}}} - \Phi _{n + 1,l}^{{\rm{CE}}} = {{2n!{{\left( { - 1} \right)}^n}} \over {{{\left( {\mu + 1} \right)}_n}}}{r^l}{{\rm{e}}^{ - {r^{1/\alpha }}}}L_n^{\left( \mu \right)}\left( {2{r^{1/\alpha }}} \right),} \hfill \cr }$ (64)

where γ(µ, z) is the (lower) incomplete Gamma function and $L_{n}^{(μ)}$ $L_n^{\left( \mu \right)}$ (z) is a Laguerre polynomial. The relevant constants are $\begin{array}{l} N_{n l}^{CE} = \frac{α Γ (μ + 1)}{2 μ - 1}, \\ K_{n l}^{CE} = \frac{- n! Γ (μ + 1)}{8 μ α^{2} Γ (n + μ)} . \end{array}$ $\matrix{ {N_{nl}^{{\rm{CE}}} = {{\alpha \Gamma \left( {\mu + 1} \right)} \over {2\mu - 1}},} \hfill \cr {K_{nl}^{{\rm{CE}}} = {{ - n!\Gamma \left( {\mu + 1} \right)} \over {8\mu {\alpha ^2}\Gamma \left( {n + \mu } \right)}}.} \hfill \cr }$ (65)

We can apply the limiting procedure directly to $P_{n l}^{LSE}$ $P_{nl}^{{\rm{LSE}}}$ (s), and the calculation is simpler than for the basis functions themselves. The operator 𝒟 does not depend on the scalelength, and hence is unaffected by the limiting procedure. So we need only consider the limit in v. The result is proportional to a Meixner-Pollaczek polynomial $P_{n}^{(μ / 2)}$ $P_n^{\left( \mu /2 \right)}$ (z; ϕ, Appendix E.2), $P_{n l}^{CE} (s) = \frac{K_{n l}^{CE}}{K_{0 l}^{CE}} \frac{i^{n} n!}{{(μ)}_{n}} P_{n}^{(μ / 2)} (α s; \frac{π}{2}) .$ $P_{nl}^{{\rm{CE}}}\left( s \right) = {{K_{nl}^{{\rm{CE}}}} \over {K_{0l}^{{\rm{CE}}}}}{{{{\rm{i}}^n}n!} \over {{{\left( \mu \right)}_n}}}P_n^{\left( {\mu /2} \right)}\left( {\alpha s;{\pi \over 2}} \right).$ (66)

4.2 Thin disc case

4.2.1 Clutton-Brock’s Kuzmin-Toomre basis set

The Kuzmin–Toomre model (Kuzmin 1956; Toomre 1963) is the simplest power law for the infinitesimally thin disc. This model provides the zeroth-order for a basis set introduced by Clutton-Brock (1972). This basis set turns out to be a special case of Qian’s family (Sect. 4.2.2), but here at least we can write down simple expressions in terms of a single Gegenbauer polynomial $C_{n}^{(α)} (x)$ $C_n^{\left( \alpha \right)}\left( x \right)$ (Aoki & Iye 1978), so it is worth recording the results separately. The density and potentials in the plane are $\begin{matrix} ψ_{n m}^{CB72} (R, φ) = \frac{- e^{i m φ} R^{m}}{{(1 + R^{2})}^{m + 1 / 2}} C_{n}^{(m + 1 / 2)} (\frac{R^{2} - 1}{R^{2} + 1}), \\ σ_{n m}^{CB72} (R, φ) = - \frac{n + m + 1 / 2}{2 π} \frac{ψ_{n m}^{CB72} (R, φ)}{1 + R^{2}}, \end{matrix}$ $\matrix{ {\psi _{nm}^{{\rm{CB72}}}\left( {R,\varphi } \right) = {{ - {{\rm{e}}^{{\rm{i}}m\varphi }}{R^m}} \over {{{\left( {1 + {R^2}} \right)}^{m + 1/2}}}}C_n^{\left( {m + 1/2} \right)}\left( {{{{R^2} - 1} \over {{R^2} + 1}}} \right),} \cr {\sigma _{nm}^{{\rm{CB72}}}\left( {R,\varphi } \right) = - {{n + m + 1/2} \over {2\pi }}{{\psi _{nm}^{{\rm{CB72}}}\left( {R,\varphi } \right)} \over {1 + {R^2}}},} \cr }$ (67)

and the normalisation constant in the orthogonality relation is $N_{n m}^{CB72} = \int d^{2} R σ_{n m}^{CB72} ψ_{n m}^{CB72} = \frac{- π Γ (n + 2 m + 1)}{2^{4 m + 2} n! Γ {(m + 1 / 2)}^{2}} .$ $N_{nm}^{{\rm{CB72}}} = \int {{{\rm{d}}^2}{\bf{R}}\sigma _{nm}^{{\rm{CB72}}}\psi _{nm}^{{\rm{CB72}}} = {{ - \pi \,\Gamma \left( {n + 2m + 1} \right)} \over {{2^{4m + 2}}n!\Gamma {{\left( {m + 1/2} \right)}^2}}}.}$ (68)

The corresponding Fourier-Mellin weight function (Eq. (53)) is then $Ω_{m}^{CB72} (s) = \frac{{| Γ (1 / 2 + m + i s) |}^{2}}{2^{2 m + 5} π^{2} Γ {(m + 1 / 2)}^{2}},$ $\Omega _m^{{\rm{CB72}}}\left( s \right) = {{\,{{\left| {\Gamma \left( {1/2 + m + {\rm{i}}s} \right)} \right|}^2}} \over {{2^{2m + 5}}{{\pi}^2}\Gamma {{\left( {m + 1/2} \right)}^2}}},$ (69)

which is proportional to the weight function for a Meixner-Pollaczek polynomial (Appendix E.2) with parameters λ = m + 1/2 and ϕ = π/2. So the index-raising polynomials have a simple expression in terms of the Meixner-Pollaczek polynomials, $P_{n m}^{CB72} (s) = i^{n} P_{n}^{(m + 1 / 2)} (s; π / 2) .$ $P_{nm}^{{\rm{CB72}}}\left( s \right) = {{\rm{i}}^n}P_n^{\left( {m + 1/2} \right)}\left( {s;\pi /2} \right).$ (70)

4.2.2 Qian’s k-basis sets

The family of basis sets introduced by Qian (1993) is a generalisation of Clutton-Brock (1972), allowing for an arbitrary generalised Kuzmin–Toomre model to be the zeroth order.

That is, the zeroth-order density functions are (using the superscript Q) $\begin{matrix} σ_{0 m}^{Q} (R, φ) = \sqrt{π} \frac{Γ (m + k + 3 / 2)}{Γ (m + k + 1)} \frac{R^{m} e^{i m φ}}{{(1 + R^{2})}^{m + k + 3 / 2}}, \\ ψ_{0 m}^{Q} (R, φ) = π B (m + \frac{1}{2}, \frac{1}{2}) \frac{R^{m} e^{i m φ}}{{(1 + R^{2})}^{m + 1 / 2}} \\ \times_{2} F_{1} (\begin{matrix} - k, m + 1 / 2 \\ m + 1 \end{matrix} | \frac{R^{2}}{1 + R^{2}}) . \end{matrix}$ $\matrix{ {\sigma _{0m}^Q\left( {R,\varphi } \right) = \sqrt \pi {{\Gamma \left( {m + k + 3/2} \right)} \over {\Gamma \left( {m + k + 1} \right)}}{{{R^m}{{\rm{e}}^{im\varphi }}} \over {{{\left( {1 + {R^2}} \right)}^{m + k + 3/2}}}},} \cr {\psi _{0m}^Q\left( {R,\varphi } \right) = \pi {\rm{B}}\left( {m + {1 \over 2},{1 \over 2}} \right){{{R^m}{{\rm{e}}^{im\varphi }}} \over {{{\left( {1 + {R^2}} \right)}^{m + 1/2}}}}} \cr {{ \times _2}{F_1}\left( {\left. {\matrix{ { - k,m + 1/2} \cr {m + 1} \cr } } \right|{{{R^2}} \over {1 + {R^2}}}} \right).} \cr }$ (71)

Here B(x, y) = Γ(x)Γ(y)/Γ(x + y) is the standard Beta function, and the prefactors have been chosen so that all derived expressions are compatible with those in Qian (1993). The higher-order potential and density functions that Qian provides are given in terms of very complicated recursion relations, that are only valid when k is an integer. However there is no such limitation in our representation. The weight function is proportional to that for a continuous Hahn polynomial p_n(s; a, b, Appendix E.1), and so the index-raising polynomial is $P_{n m}^{Q} (s) = \frac{i^{n}}{{(m + k + 1)}_{n}} p_{n} (\frac{s}{2}; \frac{m}{2} + \frac{1}{4}, \frac{m}{2} + k + \frac{3}{4}) .$ $P_{nm}^Q\left( s \right) = {{{{\rm{i}}^n}} \over {{{\left( {m + k + 1} \right)}_n}}}{p_n}\left( {{s \over 2};{m \over 2} + {1 \over 4},{m \over 2} + k + {3 \over 4}} \right).$ (72)

We therefore have closed form expressions for $σ_{n m}^{Q}$ $\sigma _{nm}^{\rm{Q}}$ and $ψ_{n m}^{Q}$ $\psi _{nm}^{\rm{Q}}$ , that are valid for all real values of k, as long as the zeroth-order model has finite total self-energy. The original Clutton-Brock (1972) basis set (Sect. 4.2.1) is recovered when k = 0. The normalisation constant for the orthogonality relation can be derived from that of the continuous Hahn polynomials, and is $\begin{array}{l} N_{n m}^{Q} = \int_{0}^{\infty} R d R σ_{n m}^{Q} ψ_{n m}^{Q} \\ = \frac{π^{2} Γ (m + n + \frac{1}{2}) Γ (2 k + m + n + \frac{3}{2})}{2 n! (2 k + 2 m + 2 n + 1) Γ (2 k + 2 m + n + 1)} . \end{array}$ $\matrix{{N_{nm}^Q = \int_0^\infty {R\,{\rm{d}}R\,\sigma _{nm}^Q\,\psi _{nm}^Q} } \hfill \cr {\,\,\,\,\,\,\,\,\,\, = {{{\pi ^2}\Gamma \left( {m + n + {1 \over 2}} \right)\Gamma \left( {2k + m + n + {3 \over 2}} \right)} \over {2n!\left( {2k + 2m + 2n + 1} \right)\Gamma \left( {2k + 2m + n + 1} \right)}}.} \hfill \cr }$ (73)

4.2.3 Qian’s Gaussian basis set

A Gaussian density profile is another plausible model for the density of a galactic disc, and such a basis set was also studied by Qian (1993). Just as we derived the cuspy-exponential basis sets of Sect. 4.1.3 from the double power law result by taking the infinite limit of the shape parameter v, it turns out that Qian’s basis set for the Gaussian disc can be derived by taking the limit k → ∞ in the corresponding expressions (Eq. (71)) for the generalised Kuzmin-Toomre basis set of Sect. 4.2.2. The zeroth-order density and potential are (using the superscript G) $\begin{array}{l} σ_{0 m}^{G} (R, φ) =_{k \to \infty}^{\lim} {k^{\frac{m}{2}} σ_{0 m}^{Q} (\frac{R}{\sqrt{k}}, φ)} \\ = \sqrt{π} R^{m} e^{- R^{2}} e^{i m φ}, \\ ψ_{0 m}^{G} (R, φ) =_{k \to \infty}^{\lim} {k^{\frac{m}{2}} ψ_{0 m}^{Q} (\frac{R}{\sqrt{k}}, φ)} \\ = π B (m + \frac{1}{2}, \frac{1}{2}) R^{m} e^{i m φ}_{1} F_{1} (\begin{matrix} m + 1 / 2 \\ m + 1 \end{matrix} | - R^{2}) . \end{array}$ $\matrix{ {\sigma _{0m}^{\rm{G}}\left( {R,\varphi } \right) = _{k \to \infty }^{\lim }\left\{ {{k^{{m \over 2}}}\sigma _{0m}^Q\left( {{R \over {\sqrt k }},\varphi } \right)} \right\}} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = \sqrt \pi {R^m}{{\rm{e}}^{ - {R^2}}}{{\rm{e}}^{{\rm{i}}m\varphi }},} \hfill \cr {\psi _{0m}^{\rm{G}}\left( {R,\varphi } \right) = _{k \to \infty }^{\lim }\left\{ {{k^{{m \over 2}}}\psi _{0m}^Q\left( {{R \over {\sqrt k }},\varphi } \right)} \right\}} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = \pi {\rm{B}}\left( {m + {1 \over 2},{1 \over 2}} \right){R^m}{{\rm{e}}^{{\rm{i}}m\varphi }}_1{F_1}\left( {\left. {\matrix{ {m + 1/2} \cr {m + 1} \cr } } \right| - {R^2}} \right).} \hfill \cr }$ (74)

The function denoted ₁F₁ is a confluent hypergeometric (Kummer) function, that reduces to combinations of modified Bessel functions for any given m. At zeroth-order we have the well-known result that the potential of a plain Gaussian disc involves a single modified Bessel function, $ψ_{00}^{G} (R) = π^{2} I_{0} (R^{2} / 2) e^{- R^{2} / 2} .$ $\psi _{00}^{\rm{G}}\left( R \right) = {\pi ^2}{I_0}\left( {{{{R^2}} \mathord{\left/ {\vphantom {{{R^2}} 2}} \right. \kern-\nulldelimiterspace} 2}} \right)\,{{\rm{e}}^{{{ - {R^2}} \mathord{\left/ {\vphantom {{ - {R^2}} 2}} \right. \kern-\nulldelimiterspace} 2}}}.$

Again Qian gives the higher-order potential and densities only as complicated recursion relations. However, explicit expressions follow upon taking the limit k → ∞ in Eq. (72). We find that $\begin{matrix} P_{n m}^{G} (s) =_{k \to \infty}^{\lim} {P_{n m}^{Q} (s)} = i^{- n} P_{n}^{(m / 2 + 1 / 4)} (\frac{s}{2}; \frac{π}{2}), \\ N_{n m}^{G} =_{k \to \infty}^{\lim} {k^{m + \frac{1}{2}} N_{n m}^{Q}} = \frac{π^{2} Γ (n + m + 1 / 2)}{2^{m + 3 / 2} n!}, \end{matrix}$ $\matrix{ {P_{nm}^{\rm{G}}\left( s \right) = _{k \to \infty }^{\lim }\left\{ {P_{nm}^{\rm{Q}}\left( s \right)} \right\} = {{\rm{i}}^{ - n}}P_n^{\left( {m/2 + 1/4} \right)}\left( {{s \over 2};{\pi \over 2}} \right),} \cr {N_{nm}^{\rm{G}} = _{k \to \infty }^{\lim }\left\{ {{k^{m + {1 \over 2}}}N_{nm}^Q} \right\} = {{{\pi ^2}\Gamma \left( {n + m + 1/2} \right)} \over {{2^{m + 3/2}}n!}},} \cr }$ (75)

where $P_{n}^{(m / 2 + 1 / 4)} (s / 2; π / 2)$ $P_n^{\left( {{m \mathord{\left/ {\vphantom {m 2}} \right. \kern-\nulldelimiterspace} 2} + {1 \mathord{\left/ {\vphantom {1 4}} \right. \kern-\nulldelimiterspace} 4}} \right)}\left( {{s \mathord{\left/ {\vphantom {s 2}} \right. \kern-\nulldelimiterspace} 2};{\pi \mathord{\left/ {\vphantom {\pi 2}} \right. \kern-\nulldelimiterspace} 2}} \right)$ is a Meixner-Pollaczek polynomial (Appendix E.2). Then Eqs. (74) and (75) can be combined to find $\begin{array}{l} σ_{n m}^{G} (R, φ) =_{k \to \infty}^{\lim} {k^{\frac{m}{2}} σ_{n m}^{Q} (\frac{R}{\sqrt{k}}, φ)} \\ =_{k \to \infty}^{\lim} {k^{\frac{m}{2}} P_{n m}^{Q} (𝒜) σ_{0 m}^{Q} (\frac{R}{\sqrt{k}}, φ)} \\ = P_{n m}^{G} (𝒜) σ_{0 m}^{Q} (R, φ), \end{array}$ $\matrix{ {\sigma _{nm}^{\rm{G}}\left( {R,\varphi } \right) = _{k \to \infty }^{\lim }\left\{ {{k^{{m \over 2}}}\sigma _{nm}^Q\left( {{R \over {\sqrt k }},\varphi } \right)} \right\}} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = _{k \to \infty }^{\lim }\left\{ {{k^{{m \over 2}}}P_{nm}^Q\left( A \right)\sigma _{0m}^Q\left( {{R \over {\sqrt k }},\varphi } \right)} \right\}} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = P_{nm}^{\rm{G}}\left( A \right)\sigma _{0m}^Q\left( {R,\varphi } \right),} \hfill \cr }$ (76)

which works because the factor of $1 / \sqrt{k}$ ${1 \mathord{\left/ {\vphantom {1 {\,\,\sqrt k }}} \right. \kern-\nulldelimiterspace} {\,\,\sqrt k }}$ cancels out in 𝒜; there is a similar expression for the potential functions.

4.2.4 Exponential disc

Interestingly, there is another thin disc model which has classical index-raising polynomials: we briefly sketch the derivation for an exponential disc.

We require all density components to fall off exponentially like e^−R but also to behave like an interior multipole as R → 0, so as a zeroth-order ansatz for the density we take simply $σ_{0 m}^{\exp} (R) = R^{m} e^{- R} .$ $\sigma _{0m}^{\exp }\left( R \right) = {R^m}{{\rm{e}}^{ - R}}.$ (77)

This gives a weight function (via Eq. (53)) proportional to that for a continuous Hahn polynomial¹². Thus the index-raising polynomials can be written down explicitly as $P_{n m}^{\exp} (s) = i^{- n} p_{n} (s / 2; m / 2 + 1 / 4, m / 2 + 5 / 4),$ $P_{nm}^{\exp }\left( s \right) = {{\rm{i}}^{ - n}}{p_n}\left( {s/2;m/2 + 1/4,m/2 + 5/4} \right),$ (78)

along with closed form expressions for the recurrence coefficient and normalisation constant. The remaining complication is the zeroth-order potential. The m = 0 case is awkward but classical (Binney & Tremaine 1987, Ch. 2) and uses modified Bessel functions, $ψ_{00}^{\exp} (R) = - π R [I_{0} (\frac{R}{2}) K_{1} (\frac{R}{2}) - I_{1} (\frac{R}{2}) K_{0} (\frac{R}{2})] .$ $\psi _{00}^{\exp }\left( R \right) = - \pi R\left[ {{I_0}\left( {{R \over 2}} \right){K_1}\left( {{R \over 2}} \right) - {I_1}\left( {{R \over 2}} \right){K_0}\left( {{R \over 2}} \right)} \right].$ (79)

Deriving expressions when m > 0 is trickier – we give the details in Appendix F –but it can be accomplished with the following differential-recurrence relation: $ψ_{0, m + 1}^{\exp} (R) = \frac{(R \partial_{R} - m - 1) (\partial_{R} - m / R)}{2 m + 3} ψ_{0 m}^{\exp} (R) .$ $\psi _{0,m + 1}^{\exp }\left( R \right) = {{\left( {R{\partial _R} - m - 1} \right)\left( {{\partial _R} - m/R} \right)} \over {2m + 3}}\psi _{0m}^{\exp }\left( R \right).$ (80)

Some examples of the potential basis elements $ψ_{n m}^{exp} (R)$ $\psi _{nm}^{{\rm{exp}}}\left( R \right)$ are plotted in Fig. 2.

5 Numerical implementation

At the end of Sect. 2 we mentioned the main obstacles to the effective implementation of the new algorithm – primarily the numerical stability when computing the coefficients β_nl, but also the need to compute repeated radial derivatives of the zeroth-order elements.

For the recurrence coefficients β_nl the difficulty is that naively computing the integrals (Eq. (21)) become computationally expensive very quickly with increasing order n (and to some extent also with l). Therefore it is essential to pick a numerical integration method that is fast without sacrificing accuracy. Unfortunately due to the total freedom in choice of zeroth-order ρ_0l, it is difficult to find a quadrature scheme for the integrals (Eq. (21)) that is optimal in general.

Fortunately, due to the link to the polynomials p_nl(s) developed in Sect. 3, we can take advantage of the extensive literature on the construction of general orthogonal polynomials. Following Gautschi (1985) we have two options: either the discretised Stieltjes procedure or the modified Chebyshev algorithm. As it happens, computing the recurrence coefficients naively as in Eq. (21) is directly analogous to using the discretised Stieltjes procedure, except that now we perform the integrals in Fourier– Mellin space. This turns out to be the better option numerically, as the modified Chebyshev algorithm runs into floating point issues sooner due to catastrophic cancellation of terms. However, for completeness we describe both algorithms (Sects. 5.1 and 5.2). We also discuss computer-assisted techniques for performing the repeated differentiations (Sect. 5.3).

All these methods are illustrated throughout for a basis set constructed to have the isochrone model (Henon 1959) as its zeroth-order, and we follow up the numerical discussion with a demonstration of the validity of the isochrone-adapted basis set (Sect. 5.4); however the underlying methods we describe are applicable to any suitable zeroth-order model. The potential, density and polynomial weight function for the isochrone model are as follows: $\begin{array}{l} Φ_{0 l}^{iso} (r) = \frac{- r^{l}}{{(1 + \sqrt{1 + r^{2}})}^{2 l + 1}}, \\ ρ_{0 l}^{iso} (r) = \frac{(2 l + 1) r^{l} [(2 l + 3) (1 + \sqrt{1 + r^{2}}) + (2 l + 2) r^{2}]}{{(1 + \sqrt{1 + r^{2}})}^{2 l + 3} {(1 + r^{2})}^{3 / 2}}, \\ ω_{l}^{iso} (s) = \frac{{(2 l + 1)}^{2}}{2^{2 l + 6} π^{2}} {| \frac{Γ (l + 3 / 2 + i s) Γ (l / 2 + 1 / 4 + i s / 2)}{Γ (3 l / 2 + 7 / 4 + i s / 2)} |}^{2} . \end{array}$ $\matrix{ {\Phi _{0l}^{{\rm{iso}}}\left( r \right) = {{ - {r^l}} \over {{{\left( {1 + \sqrt {1 + {r^2}} } \right)}^{2l + 1}}}},} \hfill \cr {\rho _{0l}^{{\rm{iso}}}\left( r \right) = {{\left( {2l + 1} \right){r^l}\left[ {\left( {2l + 3} \right)\left( {1 + \sqrt {1 + {r^2}} } \right) + \left( {2l + 2} \right){r^2}} \right]} \over {{{\left( {1 + \sqrt {1 + {r^2}} } \right)}^{2l + 3}}{{\left( {1 + {r^2}} \right)}^{3/2}}}},} \hfill \cr {\omega _l^{{\rm{iso}}}\left( s \right) = {{{{\left( {2l + 1} \right)}^2}} \over {{2^{2l + 6}}{\pi ^2}}}{{\left| {{{\Gamma \left( {l + 3/2 + is} \right)\Gamma \left( {l/2 + 1/4 + is/2} \right)} \over {\Gamma \left( {3l/2 + 7/4 + is/2} \right)}}} \right|}^2}.} \hfill \cr }$ (81)

The precise l-dependence of these expressions is of course arbitrary to some extent, but we have made a suitable natural choice.

5.1 Discretised Stieltjes procedure

The sequence of recurrence coefficients β_nl that we need to compute can be expressed as the ratio of two integrals, $β_{n l} = \frac{I_{n l}}{I_{n - 1, l}}, where I_{n l} = 〈 ρ_{n l}, ρ_{n l} 〉 = {‖ ρ_{n l} ‖}^{2},$ ${\beta _{nl}} = {{{I_{nl}}} \over {{I_{n - 1,l}}}},{\rm{where}}\,{I_{nl}} = \left\langle {{\rho _{nl}},{\rho _{nl}}} \right\rangle = {\left\| {{\rho _{nl}}} \right\|^2},$ (82)

and so for each higher n we need one additional evaluation of I_nl. Evaluations of β_nl alternate with applications of the recurrence relation (47) to find the next basis element ρ_nl. Once sufficient β_nl have been found, the potential or density functions ρ_nlm and Φ_nlm, can be evaluated via their own recurrences as described in Sect. 2.

The difficulty then is in finding an appropriate strategy to compute the integrals I_nl. We opt to evaluate them in Fourier– Mellin-space, using the polynomials p_nl(s) directly, and making use of the fact that the integral can be written $I_{n l} = \int_{- \infty}^{\infty} ω_{l} (s) {[p_{n l} (s)]}^{2} d s .$ ${I_{nl}} = \int_{ - \infty }^\infty {{\omega _l}\left( s \right){{\left[ {{p_{nl}}\left( s \right)} \right]}^2}{\rm{d}}s.}$ (83)

Therefore, the first step is to determine the weight function w_l(s). This can be found in terms of the (Fourier-)Mellin transform of either the zeroth-order potential or the density, $\begin{array}{l} ω_{l} (s) = \frac{2 K_{0 l}^{2}}{{(l + 1 / 2)}^{2} + s^{2}} {| ℳ_{r} {ρ_{0 l} (r)} (5 / 2 + i s) |}^{2} \\ = \frac{{(l + 1 / 2)}^{2} + s^{2}}{8 π^{2}} {| ℳ_{r} {Φ_{0 l} (r)} (1 / 2 + i s) |}^{2} . \end{array}$ $\matrix{{{\omega _l}\left( s \right) = {{2K_{0l}^2} \over {{{\left( {l + 1/2} \right)}^2} + {s^2}}}{{\left| {{{\cal M}_r}\left\{ {{\rho _{0l}}\left( r \right)} \right\}\left( {5/2 + {\rm{i}}s} \right)} \right|}^2}} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\, = {{{{\left( {l + 1/2} \right)}^2} + {s^2}} \over {8{\pi ^2}}}{{\left| {{{\cal M}_r}\left\{ {{\Phi _{0l}}\left( r \right)} \right\}\left( {1/2 + {\rm{i}}s} \right)} \right|}^2}.} \hfill \cr }$ (84)

The Mellin transform is perhaps one of the less familiar integral transforms, but in practice a wide variety of Mellin transforms can be found in closed form (helped especially by computer algebra systems), in part because with a logarithmic change of variable it can be written as a Fourier transform. All the polynomial weight functions considered in this paper can be found symbolically using MATHEMATICA¹³. Numerical evaluation of the Mellin transform is also an option – by transformation to the Fourier transform and approximation using fast Fourier transform methods – however we do not pursue this further in the present work.

Now we consider the asymptotic behaviour of the weight function w_l(s) as s → ±∞. The smoothness requirement (Eq. (9)) on ρ_0l forces w_l(s) to decay faster than any power of s, that is, at least exponentially. We expect that $ω_{l} (s) \sim {| s |}^{b} e^{- a | s |} a s s \to \pm \infty,$ ${\omega _l}\left( s \right) \sim {\left| s \right|^b}{{\rm{e}}^{ - a\left| s \right|}}as\,s \to \pm \infty ,$ (85)

so we need to determine the decay constant a. In the case of our isochrone basis set this asymptotic behaviour is derived from the behaviour of the complex gamma function at infinity (DLMF, Sect. 5.11.9), giving w_l(s) ~ |s|⁻¹ e^−π|s|, or a = π. When w_l(s) can be written down, it is usually simple to read off the decay constant a; for example, the double power law basis sets (Sect. 4.1.2) have a = α.

The at-least-exponential decay of the weight function suggests that the appropriate discretization scheme for Eq. (83) is Gauss–Laguerre quadrature. To implement this for the isochrone case, rewrite Eq. (83) to pull out a factor of e^−πs, and use the symmetry of the integrand to change the domain of integration to (0, ∞) (defining x = πs), $I_{n l} = \frac{2}{π} \int_{0}^{\infty} e^{- x} \underset{\sim x^{2 n - 1} as x \to \infty}{\underset{︸}{e^{x} ω_{l} (x / π) {[p_{n l} (x / π)]}^{2}}} d x$ ${I_{nl}} = {2 \over \pi }\int_0^\infty {{{\rm{e}}^{ - x}}\underbrace {{{\rm{e}}^x}{\omega _l}\left( {x/\pi } \right){{\left[ {{p_{nl}}\left( {x/\pi } \right)} \right]}^2}}_{ \sim {x^{2n - 1}}{\rm{as}}\,x \to \infty }dx}$ (86)

We can then implement Gauss-Laguerre quadrature of order v, as a weighted sum over evaluation points x_jv, which are the roots of the vth Laguerre polynomial L_v(x): $I_{n l} = \frac{2}{π} \sum_{j = 1}^{v} w_{j v} e^{x_{j v}} ω_{l} (x_{j v} / π) {[p_{n l} (x_{j v} / π)]}^{2},$ ${I_{nl}} = {2 \over \pi }\sum\limits_{j = 1}^v {{w_{jv}}{{\rm{e}}^{{x_{jv}}}}{\omega _l}\left( {{x_{jv}}/\pi } \right){{\left[ {{p_{nl}}\left( {{x_{jv}}/\pi } \right)} \right]}^2},}$ (87) $\begin{matrix} w_{j v} = \frac{x_{j v}}{{[(v + 1) L_{v + 1} (x_{j v})]}^{2}}, \\ L_{v} (x_{j v}) = 0, j = 1 \dots v . \end{matrix}$ $\matrix{ {{w_{jv}} = {{{x_{jv}}} \over {{{\left[ {\left( {v + 1} \right){L_{v + 1}}\left( {{x_{jv}}} \right)} \right]}^2}}},} \cr {{L_v}\left( {{x_{jv}}} \right) = 0,\quad j = 1 \ldots v.} \cr }$

The quadrature rule of order v integrates polynomials exactly up to order 2v − 1, so to compute I_nl with the isochrone weight function we would expect to need at least v ≥ n. (An acceptable rule of thumb is that I_nl requires v = max(n + l, 10).) It may be necessary to compute the weights w_jv and roots x_jv to a higher order of precision internally using arbitrary-precision arithmetic, but this is not a bottleneck in practice – and typically Gauss-Laguerre quadrature is implemented as a library function whose implementation details are hidden. In this way we can get, for example, 50 orders of β_nl to floating-point precision in under a tenth of a second using one core of a modern CPU.

The radial parts of some examples of potential elements in the isochrone basis set are plotted in Fig. 1.

Fig. 1

Radial parts of the isochrone potential basis $Φ_{n l}^{iso} (r)$ ${\rm{\Phi }}_{nl}^{{\rm{iso}}}\left( r \right)$ for n = 0, 1, 2, 3 and l = 0 (top) and l = 1 (bottom). The potentials have been unit-normalised.

5.2 Modified Chebyshev algorithm

This is an alternative method described in Gautschi (1985), which we find to be less numerically stable in practice. However we describe it here for completeness, as it may yet find some usefulness (for example to facilitate finding exact expressions for the recurrence coefficients in certain cases).

Gautschi’s modified Chebyshev algorithm prescribes the ‘modified moments ‘¹⁴ ${\tilde{μ}}_{k l} \equiv \int_{- \infty}^{\infty} ω_{l} (s) {\tilde{p}}_{k l} (s) .$ ${\tilde \mu _{kl}} \equiv \int_{ - \infty }^\infty {{\omega _l}\left( s \right){{\tilde p}_{kl}}\left( s \right).}$ (88)

Here, ${\tilde{p}}_{k l} (s)$ ${\tilde p_{kl}}\left( s \right)$ are some auxiliary set of (monic) polynomials, orthogonal with respect to a symmetric measure on the interval (−∞, ∞), and obeying a three-term recurrence relation $\begin{array}{l} {\tilde{p}}_{- 1, i} (s) = 0 \\ {\tilde{p}}_{k + 1, l} (s) = s {\tilde{p}}_{k l} (s) - {\tilde{β}}_{k l} {\tilde{p}}_{k - 1, i} (s) . \end{array}$ $\matrix{ {{{\tilde p}_{ - 1,i}}\left( s \right) = 0} \hfill \cr {{{\tilde p}_{k + 1,l}}\left( s \right) = s\,{{\tilde p}_{kl}}\left( s \right) - {{\tilde \beta }_{kl}}{{\tilde p}_{k - 1,i}}\left( s \right).} \hfill \cr }$ (89)

By symmetry this means that ${\tilde{μ}}_{k l}$ ${{\tilde \mu }_{kl}}$ are nonzero only for even k. In principle the choice of auxiliary polynomial is wide open, but the obvious choice in our case (for ease and stability of computation) is the monic Hermite polynomials $H e_{k} (s)$ $H{e_k}\left( s \right)$ , for which ${\tilde{β}}_{k l} = k$ ${{\tilde \beta }_{kl}} = k$ . We can then proceed to find the ‘mixed’ moments $σ_{j k l} = \int_{- \infty}^{\infty} p_{j l} (s) {\tilde{p}}_{k l} (s) ω_{l} (s) d s$ ${\sigma _{jkl}} = \int_{ - \infty }^\infty {{p_{jl}}\left( s \right){{\tilde p}_{kl}}\left( s \right){\omega _l}\left( s \right){\rm{d}}s}$ (90)

via a system of recurrence relations that produces the desired recurrence coefficients β_nl as a byproduct: $\begin{array}{l} σ_{0 k l} = {\tilde{μ}}_{k l}, \\ σ_{j k l} = σ_{j - 1, k + 1, l} - β_{j - 1, l} σ_{j - 2, k, l} + {\tilde{β}}_{k l} σ_{j - 1, k - 1, l,} \\ β_{n l} = \frac{σ_{n m l}}{σ_{n - 1, n - 1, l}} . \end{array}$ $\matrix{ {{\sigma _{0kl}} = {{\tilde \mu }_{kl}},} \hfill \cr {{\sigma _{jkl}} = {\sigma _{j - 1,k + 1,l}} - {\beta _{j - 1,l}}{\sigma _{j - 2,k,l}} + {{\tilde \beta }_{kl}}{\sigma _{j - 1,k - 1,l,}}} \hfill \cr {{\beta _{nl}} = {{{\sigma _{nml}}} \over {{\sigma _{n - 1,n - 1,l}}}}.} \hfill \cr }$ (91)

In practice (for our isochrone basis set) we find that σ_jkl suffers from catastrophic cancellation beyond approximately j = 20. Alternatively if the modified moments ${\tilde{μ}}_{k l}$ ${{\tilde \mu }_{kl}}$ are known in closed form then this method is convenient for finding ‘exact’ recurrence coefficients. This turns out to be the case for the isochrone basis set, for which see Appendix G.

Fig. 2

Radial parts of the exponential disc basis $ψ_{n m}^{\exp} (r)$ $\psi _{nm}^{\exp }\left( r \right)$ for n = 0, 1, 2, 3 and m = 0 (top) and m = 1 (bottom). The potentials have been unit-normalised.

5.3 Repeated differentiation

There are three classes of algorithm for computer-assisted differentiation: 1. finite-differencing, 2. symbolic differentiation, and 3. automatic differentiation. The first of these we can discount pretty much immediately as being wildly numerically unstable and expensive compared to the other two.

The second, symbolic differentiation via computer algebra, is potentially competitive at low expansion orders, but it is hard to predict the degree of blow-up in the number of algebraic terms. It depends strongly on the precise form of the function that is being differentiated. In practice we find that efficient application of symbolic differentiation at high expansion orders requires alternating between differentiation and algebraic simplification.

Of course, one may also attempt symbolic differentiation by hand, attempting to find simplifications that reduce the tower of applications of 𝒟ⁿ to a simpler form – whether this is possible also depends on the form of Φ_0l and ρ_0l. Many of the basis sets considered in Sect. 4 have simple closed forms (at all orders) due to fortunate simplification in repeated differentiation. For example, taking the double power law basis (Sect. 4.1.2) with parameters α = 1/2, v = p − 3/2 and n = l = 0 (and labelling each density function with p), we have $ρ_{p + 1} = p^{- 1} (p - 5 / 4 - i𝒟 / 2) ρ_{p}$ ${\rho _{p + 1}} = {p^{ - 1}}\left( {p - {5 \mathord{\left/ {\vphantom {5 4}} \right. \kern-\nulldelimiterspace} 4} - {{{\rm{iD}}} \mathord{\left/ {\vphantom {{{\rm{iD}}} 2}} \right. \kern-\nulldelimiterspace} 2}} \right){\rho _p}$ (92)

Using this identity in Eq. (63) then leads (after some further simplification) to a known closed form expression for $ρ_{n l}^{LSE}$ $\rho _{nl}^{{\rm{LSE}}}$ . However it is likely to be difficult to find easy differentiation formulas in general. For our isochrone basis set, the method we give in Appendix G for computing the modified moments can be adapted to find expressions for the higher-order derivatives, but the result is complicated and of dubious numerical stability.

The third method, automatic differentiation (AD) is what we find to be most competitive in practice. This is a general term referring to a class of algorithms implemented entirely at the software library level, that provides an evaluation of the derivative at a single point given only knowledge of the chain rule and the differentiation rules for primitive arithmetic operations and standard mathematical library functions. Essentially, the function to be differentiated is written in ordinary code, and the AD algorithm ‘automatically’ deduces the correct sequence of chain rule steps to carry out. For our purposes we require higher-order derivatives; while applying an AD algorithm to itself works in principle (and often works in practice) it is very inefficient, as the AD logic itself must be differentiated. It is better to use an AD implementation that natively understands higher-order derivatives.

As we are coding in the JULIA programming language, we use a suitable library called TAYLORSERIES.JL (Benet & Sanders 2019). A special variable t(N) is instantiated that represents the first N terms of an (abstract) Taylor series. Given a point r₀, we can use t + r₀ as the argument of any ordinary mathematical function¹⁵; the result is the first N coefficients of the Taylor series around r₀ that approximates that function. For example, setting N = 3 and r₀ = 1.0 and using the potential of the isochrone model (Eq. (81)) as our function, the computer prints a data structure representing the following truncated Taylor series, $Φ_{00}^{iso} (t (3) + 1.0) = - 0.4142 + 0.1213 t - 0.0052 t^{2} - 0.0225 t^{3}$ ${\rm{\Phi }}_{00}^{{\rm{iso}}}\left( {t\left( 3 \right) + \left. {1.0} \right) = - 0.4142 + 0.1213t - 0.0052{t^2} - 0.0225{t^3}} \right.$

When it comes to the actual implementation, we have two choices, which we find to have similar efficiency in practice. The first option begins with computing the vector of derivatives (at a point r₀) up to some maximum order N all in one go, $V_{l} = (Φ_{0 l m}, 𝒟 Φ_{0 l m}, 𝒟^{2} Φ_{0 l m}, \dots, 𝒟^{N} Φ_{0 l m})$ ${{\bf{V}}_l} = \left( {{{\rm{\Phi }}_{0lm}},D{{\rm{\Phi }}_{0lm}},{D^2}{{\rm{\Phi }}_{0lm}}, \ldots ,{D^N}{{\rm{\Phi }}_{0lm}}} \right)$ (93)

In fact, because 𝒟 can be expressed as a single differentiation with respect to a transformed variable (via r d/dr = d/ds, where s = log r), V_l can be obtained directly from a single N-term Taylor series evaluation. Separately, we derive from β_nl the matrix elements (A_l)_nj = A_njl in the expansion $Φ_{n l m} = \sum_{j = 0}^{n} A_{n j l} 𝒟^{j} Φ_{0 l m}$ ${{\rm{\Phi }}_{nlm}} = \sum\limits_{j = 0}^n {{A_{njl}}{D^j}{{\rm{\Phi }}_{0lm}}}$ (94)

To evaluate a vector of potential functions at a single point, $Φ_{n l m} = (Φ_{0 l m}, Φ_{1 l m}, \dots, Φ_{N l m})$ ${{\bf{\Phi }}_{nlm}} = \left( {{{\rm{\Phi }}_{0lm}},{{\rm{\Phi }}_{1lm}}, \ldots ,{{\rm{\Phi }}_{Nlm}}} \right)$ (95)

we perform the contraction Φ_l= A_l · V_l. At each different point r₁, we have to re-compute V but not A.

The second option is to use the recurrence relation directly (Eq. (19)) or (Eq. (20)). Because we know ahead of time that we want N iterations of the recurrence relation, we set up the Taylor series t(N), and use r₀ + t(N) as the dependent variable. The length of the series then shrinks as we go up the ladder of basis function evaluations. In practice this second method seems to be marginally slower than the first one, as more operations on the abstract Taylor series need to be performed.

5.4 Unstable modes of a spherical system

It is important to check whether a basis set constructed according to the prescriptions of Sect. 5 actually works in practice. One simple approach might be to just construct n basis functions, and integrate up the n × n square of inner products, testing whether orthogonality is achieved to a given floating-point precision. However, we know that it is possible to construct basis sets that are genuinely orthogonal but whose expansions of realistic mass densities fail to converge in practice, or display other undesirable numerical effects¹⁶. Therefore we choose to demonstrate the validity of our approach by reproducing a physical result from the literature – the unstable radial mode of the isochrone model.

We use the discretised Stieltjes method described in Sect. 5.1, where the basis set is adapted to the isochrone model at zeroth order. However the specific adaptation is not the crucial part; for this particular application only the perturbing density needs to be accurately resolved by the basis elements, so the key feature required of the basis set is only that it has the correct asymptotic behaviour. To this end, we adapt the code and method of Fouvry & Prunet (2022) to show that the same unstable mode is recovered by our isochrone-adapted basis set. The part of the code that implements the basis set has been provided online in a GIT repository¹⁷.

The details of the computation can be found in Fouvry & Prunet (2022). In brief, we start with knowledge of an isotropic distribution function that solves the collisionless Boltzmann equation for the isochrone potential. We also have the corresponding action and angle coordinates (J, w) as a function of position and momentum, which for the isochrone potential are known in closed form. Then, each potential basis element must be Fourier-transformed with respect to the angle coordinates ${\hat{Φ}}_{n l m}^{n} (J) = {(2 π)}^{- 3} \int d^{3} w e^{- i n \cdot w} Φ_{n l m} (J, w),$ ${\rm{\hat \Phi }}_{nlm}^{\bf{n}}\left( {\bf{J}} \right) = {\left( {2\pi } \right)^{ - 3}}\int {{{\rm{d}}^3}{\bf{w}}\,{{\rm{e}}^{ - {\rm{i}}{\bf{n}} \cdot {\bf{w}}}}\,\,{{\rm{\Phi }}_{nlm}}\left( {{\bf{J}},{\bf{w}}} \right),}$ (96)

out of which a matrix M is formed¹⁸, ${(M)}_{n_{1} l_{1}, n_{2} l_{2}} = {(2 π)}^{3} \sum_{n} \int d^{3} J {\hat{Φ}}_{n_{1} l_{1} 0}^{n} \bar{{\hat{Φ}}_{n_{2} l_{2} 0}^{n}} R_{n} (s),$ ${\left( {\rm{M}} \right)_{{n_1}{l_1},{n_2}{l_2}}} = {\left( {2\pi } \right)^3}\sum\limits_{\bf{n}} \int {{{\rm{d}}^3}{\bf{J}}\,\,{\rm{\hat \Phi }}_{{n_1}{l_1}0}^{\bf{n}}\,\,\overline {{\rm{\hat \Phi }}_{{n_2}{l_2}0}^{\bf{n}}} {R_{\bf{n}}}\left( s \right),}$ (97)

where the R_n(s) represents the collisionless Boltzmann operator for a perturbation with growth rate proportional to e^st. The unstable growing mode then corresponds to a solution A of the matrix equation $(M+𝕀) \cdot A = 0,$ $\left( {{\rm{M + I}}} \right) \cdot {\bf{A}} = 0 ,$ (98)

with the vector of coefficients A = (A_nl) giving the expansion of the mode δΦ with respect to the basis {Φ_nlm}, $δ Φ= \sum_{n l} A_{n l} Φ_{n l 0} .$ $\delta {\rm{\Phi = }}\sum\limits_{nl} {{A_{nl}}{{\rm{\Phi }}_{nl0}}} .$ (99)

A plot of this mode is shown in Fig. 3. The maximum expansion orders were n_max = 6 and l_max = 2, with a scale length of r_s = 1 and a maximum resonance number of $n_{1}^{\max} = 10$ $n_1^{\max } = 10$ . All our other integration parameters are identical to those in Fouvry & Prunet (2022, Appendix C), where a matching result was obtained using the Clutton-Brock (1973) basis set with n_max = 100 and r_s = 20 – the mode shape also agreeing with the original result of Saha (1991). As mentioned previously, it is not strictly necessary to exactly match the zeroth-order element of the basis set to the underlying equilibrium model. However the basis elements must have the correct asymptotic behaviour, so using the isochrone-adapted basis set guarantees that this condition is satisfied. Nevertheless, our results do hint that accurate mode recovery may be possible with many fewer basis elements when the basis is suitably adapted, although we hesitate to draw any firm conclusions until a more systematic comparison can be drawn.

Calculating the matrix M is very computationally expensive, as it requires multiple truncated infinite summations, over several indices (n, l, and the vector of wavenumbers n). It also requires two nested integrations, as the Fourier transform (Eq. (96)) must also be performed numerically. In the general non-isochrone case a third level of integration is required, because the action and angle coordinates are no longer known in closed form. Any method of reducing this computational effort is therefore desirable. It is possible that judicious choice of basis elements and application of their differential-recursion relation Eq. (20) may ameliorate these calculations, but further investigation is needed.

6 Discussion and conclusions

We have reformulated the study of bi-orthogonal basis sets using the language of Fourier–Mellin transforms. This unexpected development unifies many previous results into a coherent theoretical framework. The general idea of generating new potential-density pairs from old by differentiation is not entirely new. Traditionally this is accomplished by differentiating with respect to the model’s scalelength – in particular, Aoki & Iye (1978) found compact expressions for the thin disc basis of Clutton-Brock (1972) by repeatedly applying the operator a∂_a (for a the scalelength) and orthogonalising the resulting sequence of potential-density pairs by the Gram-Schmidt process. |Subsequently de Zeeuw & Pfenniger (1988), in the course of deriving a series of ellipsoidal potential-density pairs, noted that the operators r∂_r and $\nabla^{2}$ ${\nabla ^2}$ obey an important commutation relation (which we re-derive in Appendix B). Therefore Aoki & Iye (1978)’s result – and by extension our algorithm presented here – can be expressed in terms of the coordinates alone, without reference to an arbitrary scalelength.

The formalism developed in Sects. 2–3 deserves some further interpretation. In particular, the operator 𝒟 on which the whole development hinges may appear to have been plucked out of thin air, but it is in fact no accident: 𝒟 is precisely the inflnites-imal generator of the scaling symmetry of the self-energy inner product (Eq. (2)). To see this, let S_t be a ‘radial scaling’ operator, $(S_{t} f) (r) = t^{- 5 / 2} f (t r) .$ $\left( {{S_t}f} \right)\left( {\bf{r}} \right) = {t^{{{ - 5} \mathord{\left/ {\vphantom {{ - 5} 2}} \right. \kern-\nulldelimiterspace} 2}}}f\left( {t{\bf{r}}} \right) .$ (100)

As is immediately evident from dimensional analysis, this preserves the self-energy, $〈 S_{t} f, S_{t} g 〉 = 〈 f, g 〉 .$ $\left\langle {{S_t}f,{S_t}g} \right\rangle = \left\langle {f,g} \right\rangle .$ (101)

The operator 𝒟 is now defined in terms of the infinitesimal generator of S_t, ${(𝒟 f) (r) \equiv i \frac{d}{d t} (S_{t} f) (r) |}_{t = 0} .$ ${\left. {\left( {Df} \right)\left( {\bf{r}} \right) \equiv {\rm{i}}{{\rm{d}} \over {{\rm{d}}t}}\left( {{S_t}f} \right)\left( {\bf{r}} \right)} \right|_{t = 0}} .$ (102)

Differentiating Eq. (101) with respect to the parameter t, it is immediately evident that 𝒟 is self-adjoint¹⁹. In Sect. 3.1, we implicitly invoked Stone’s theorem from functional analysis to provide a Fourier-like transform whose integral kernel is the eigenfunction of a self-adjoint operator. In our case the operator is 𝒟, the eigenfunction is Ψ_s (Eq. (24)), and the resulting integral transform is exactly the radial part of the Fourier-Mellin transform that we defined in Eq. (26). The spherical harmonics arise from a similar argument applied to the generators of the coordinate rotations²⁰.

This line of reasoning suggests that it may be worthwhile to look for other symmetries of the self-energy inner product, perhaps arising from other coordinate systems or geometries in which the Laplacian separates. Given a set of three mutually commuting operators arising from three symmetries of the self-energy, we would expect to be able to construct a basis set formalism similar to that of the present work. To sketch out what this looks like in full generality, let τ be a suitable self-adjoint operator according to the criteria just described (restricting to one spatial dimension for the sake of discussion). Then the self-adjointness condition (Eq. (7)) combined with the properties of the inner product (Eq. (2)) implies that $\nabla^{2} τ = τ^{*} \nabla^{2},$ ${\nabla ^2}\tau = {\tau ^ * }{\nabla ^2} ,$ (103)

where τ^* is the Hermitian adjoint of τ with respect to the ordinary inner product on L² functions (Eq. (A.3)). If we suppose further that we have found a set of orthogonal potential functions {Φ_n}, with an index-raising polynomial p_n(s) such that $Φ_{n} = p_{n} (τ) Φ_{0},$ ${{\rm{\Phi }}_n} = {p_n}\left( \tau \right){{\rm{\Phi }}_0},$ (104)

then the associated density functions (obeying $\nabla^{2} Φ_{n} = ρ_{n}$ ${\nabla ^2}{{\rm{\Phi }}_n} = {\rho _n}$ ) are given by $ρ_{n} = p_{n} (τ^{*}) ρ_{0} .$ ${\rho _n} = {p_n}\left( {{\tau ^ * }} \right){\rho _0} .$ (105)

There are further simplifications involved in Sect. 3, which come about essentially because 𝒟 = 𝒟^* + c for some constant c, which means that the eigenfunctions of 𝒟 and 𝒟^* are the same up to a constant shift in the eigenvalue. Generically we would expect a different relationship between τ and τ^*.

The task remaining, which we leave to future efforts, is therefore to classify the symmetries of the self-energy inner product, in order to develop expansions that are usefully adapted to different coordinate systems and geometries. In a sense, the ‘holy grail’ would be the construction of an expansion adapted to the confocal ellipsoidal coordinate system, appropriate for studying the equilibrium dynamics of ellipsoidal galaxies²¹.

Some symmetries are already known. For example, in Cartesian coordinates (x, y, z) we trivially have the three cardinal translations (x → x + a etc.). Writing down their associated infinitesimal generators X = i∂_x, Y = i∂_y and Z = i∂_z, their joint eigenfunction e^{ik r} is just the kernel of the standard Fourier transform, with the wavevector k taking the role of the (continuous) eigenvalue. The Fourier transform would therefore play the same role in the resulting basis set formalism as the Fourier-Mellin transform did in ours (Sect. 3). Poisson solvers directly using the Fourier transform are ubiquitous in astrophysical applications, so it would be interesting to construct a set of ‘Cartesian’ basis functions and compare their performance with the current state of the art.

Other symmetries are known from classical potential theory. Firstly, the Kelvin transform; this is an inversion in a sphere and preserves the self-energy up to a sign (Kalnajs 1976). However, it is not a continuous symmetry, so there is no associated infinitesimal self-adjoint operator. Secondly, a symmetry that takes spheres to concentric ellipsoids (sometimes called homeoids); this maps the spherical radius to an ‘ellipsoidal’ radius, $r \mapsto m = \sqrt{x^{2} / a^{2} + y^{2} / b^{2} + z^{2} / c^{2}}$ $r \mapsto m = \sqrt {{{{x^2}} \mathord{\left/ {\vphantom {{{x^2}} {{a^2} + {{{y^2}} \mathord{\left/ {\vphantom {{{y^2}} {{b^2} + {{{z^2}} \mathord{\left/ {\vphantom {{{z^2}} {{c^2}}}} \right. \kern-\nulldelimiterspace} {{c^2}}}}}} \right. \kern-\nulldelimiterspace} {{b^2} + {{{z^2}} \mathord{\left/ {\vphantom {{{z^2}} {{c^2}}}} \right. \kern-\nulldelimiterspace} {{c^2}}}}}}}} \right. \kern-\nulldelimiterspace} {{a^2} + {{{y^2}} \mathord{\left/ {\vphantom {{{y^2}} {{b^2} + {{{z^2}} \mathord{\left/ {\vphantom {{{z^2}} {{c^2}}}} \right. \kern-\nulldelimiterspace} {{c^2}}}}}} \right. \kern-\nulldelimiterspace} {{b^2} + {{{z^2}} \mathord{\left/ {\vphantom {{{z^2}} {{c^2}}}} \right. \kern-\nulldelimiterspace} {{c^2}}}}}}}}$ . It has long been known that this transformation preserves the mutual self-energy of any two charge or mass densities (Carlson 1961), up to a constant factor that is essentially just an elliptical integral of the three semi-axes (a, b, c). We can use this to transform any purely spherical basis set²² into one stratified on concentric ellipsoids. It is important to note, however, that the concentric ellipsoids in this transformation are distinct from the confocal ellipsoids inherent in the ellipsoidal coordinate system that is more dynamically relevant due to its relationship to the Stäckel potentials (de Zeeuw 1985; de Zeeuw et al. 1986).

Also, we would like to mention some gaps in our analysis. While we purport in this work to provide a general theory of orthogonal basis sets, there are some aspects that are still not fully characterised. Firstly, it is clear from Sect. 4 that there exists a connection between basis sets that have a classical index-raising polynomial P_n(s), and those whose potential and density elements are known in closed form (that is, possessing a recurrence relation independent of 𝒟 or 𝒜). However, the exact nature of this connection is unknown, although it is likely related to the fact that the Hahn-type polynomials appearing in the various index-raising polynomials obey second-order difference equations²³. Secondly, we do not touch on the issue of basis sets appropriate for finite-radius systems. This was approached by Kalnajs (1976) in the case of thin discs, using a formalism initially similar to our own. There are also contributions from Polyachenko & Shukhman (1981) for finite spheres, and from Tremaine (1976) for finite elliptical discs. In general, it appears to be straightforward to construct basis sets for finite systems out of polynomials or Bessel functions, but it would be useful to make a concrete connection between the finite bases of Kalnajs (1976) and our new formalism. A more rigorous form of the argument about completeness in Sect. 3.3 would also be desirable, as would a quantitative comparison with basis sets computed via the Sturm-Liouville approach of Weinberg (1999).

We end with some broader speculation. It is possible that applications may be found for the general ideas developed here, beyond the solution of Poisson’s equation. In physics we are often required to compute the inverse of Hermitian operators with a continuous spectrum – a well-known example being the Schrödinger operator for certain boundary conditions and choices of potential. These operators could conceivably be supplied with a set of (adapted) orthogonal basis functions, by identifying a suitable commuting set of self-adjoint operators and then diagonalising their cyclic vectors. Any such basis set then provides an infinite series representation of the Green’s function of the underlying Hermitian operator²⁴ where the coordinates appear multiplicatively separated in each term. A use for such series representations may be found in various applications. The appearance of tridiagonal Jacobi operators in particular may presage links to similar numerical methods in quantum mechanics (Alhaidari et al. 2008; Ismail & Koelink 2011).

Fig. 3

Recovery of the unstable radial mode of the isotropic isochrone model. The mode is recovered well despite the low (n_max = 6) number of basis functions used.

Acknowledgements

E.L. and G.vdV. acknowledge funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme under grant agreement no. 724857 (Consolidator Grant ArcheoDyn). We thank Jean-Baptiste Fouvry for granting permission to adapt his computer code for the purposes of our Sect. 5.4; and also to the referee Michael Petersen for numerous helpful suggestions that have strengthened the results of this paper.

Appendix A Self-adjointness of 𝒟

Let f, ɡ be densities that are non-zero on a δ-dimensional hyperplane in three-dimensional space (δ ≤ 3). Then $〈 f, g 〉 = \int d^{δ} r \int d^{δ} r' f (r) \bar{g (r')} G (r, r'),$ $\left\langle {f,g} \right\rangle = \int {{{\rm{d}}^\delta }{\bf{r}}} \int {{d^\delta }{\bf{r}}\prime } f\left( {\bf{r}} \right)\overline {g\left( {{\bf{r}}\prime } \right)} G\left( {{\bf{r}},{\bf{r}}\prime } \right),$ (A.1)

where the (three-dimensional Newtonian) Green’s function G is $G (r, r') = {‖ r - r' ‖}^{- 1} = {(r^{2} + r'^{2} - 2 r r' \cos ϕ)}^{- 1 / 2},$ $G\left( {{\bf{r}},{\bf{r}}\prime } \right) = {\left\| {{\bf{r}} - {\bf{r}}\prime } \right\|^{ - 1}} = {\left( {{r^2} + r{\prime ^2} - 2rr\prime \,\cos \,\phi } \right)^{ - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}}},$ (A.2)

and ϕ is the angle between the two position vectors. Also define the ordinary L² inner product, $(f, g) = \int d^{δ} r f (r) \bar{g (r)} .$ $\left( {f,g} \right) = \int {{d^\delta }{\bf{r}}f\left( {\bf{r}} \right)\overline {g\left( {\bf{r}} \right)} } .$ (A.3)

We write θ = r∂_r and $θ^{'} = r^{'} \partial_{r^{'}}$ $\theta ' = r'{\partial _{r'}}$ . Preliminaries: first note that $(θ + θ') G = - G,$ $\left( {\theta + \theta \prime } \right)G = - G,$ (A.4)

and also note that (from integration by parts on r) $(f + θ_{g}) + (θ f, g) = - δ (f, g) .$ $\left( {f + {\theta _g}} \right) + \left( {\theta f,g} \right) = - \delta \left( {f,g} \right).$ (A.5)

So we compute $\begin{array}{l} 〈 f, θ g 〉 = \int d^{δ} r \int d^{δ} r' f (r) G (r, r') θ' \bar{g (r')} \\ = - \int d^{δ} r \int d^{δ} r' [f (r) (δ G (r, r') \bar{g (r')} + \bar{g (r')} θ' G (r, r')] \\ = \int d^{δ} r \int d^{δ} r' [(1 - δ) f (r) G (r, r') \bar{g (r')} + f (r) \bar{g (r')} θ G (r, r')] \\ = \int d^{δ} r \int d^{δ} r' [(1 - 2 δ) f (r) G (r, r') \bar{g (r')} - \bar{g (r')} G (r, r') θ f (r)] \\ = (1 - 2 δ) 〈 f, g 〉 - 〈 θ f, g 〉, \end{array}$ $\matrix{ {\left\langle {f,{\theta g}} \right\rangle = \int {{{\rm{d}}^\delta }{\bf{r}}} \int {{d^\delta }{\bf{r}}\prime } f\left( {\bf{r}} \right)G\left( {{\bf{r}},{\bf{r}}\prime } \right)\theta \prime \overline {g\left( {{\bf{r}}\prime } \right)} } \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = - \int {{{\rm{d}}^\delta }{\bf{r}}} \int {{d^\delta }{\bf{r}}\prime \left[ {f\left( {\bf{r}} \right)\left( {\delta G\left( {{\bf{r}},{\bf{r}}\prime } \right)\overline {g\left( {{\bf{r}}\prime } \right)} + \overline {g\left( {{\bf{r}}\prime } \right)} \theta \prime G\left( {{\bf{r}},{\bf{r}}\prime } \right)} \right.} \right]} } \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = \int {{{\rm{d}}^\delta }{\bf{r}}} \int {{d^\delta }{\bf{r}}\prime } \left[ {\left( {1 - \delta } \right)f\left( {\bf{r}} \right)G\left( {{\bf{r}},{\bf{r}}\prime } \right)\overline {g\left( {{\bf{r}}\prime } \right)} + f\left( {\bf{r}} \right)\overline {g\left( {{\bf{r}}\prime } \right)} \theta G\left( {{\bf{r}},{\bf{r}}\prime } \right)} \right]} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = \int {{{\rm{d}}^\delta }{\bf{r}}} \int {{d^\delta }{\bf{r}}\prime } \left[ {\left( {1 - 2\delta } \right)f\left( {\bf{r}} \right)G\left( {{\bf{r}},{\bf{r}}\prime } \right)\overline {g\left( {{\bf{r}}\prime } \right)} - \overline {g\left( {{\bf{r}}\prime } \right)} G\left( {{\bf{r}},{\bf{r}}\prime } \right)}\theta f\left( {\bf{r}} \right) \right]} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, = \left( {1 - 2\delta } \right)\left\langle {f,g} \right\rangle - \left\langle {\theta f,g} \right\rangle ,} \hfill \cr }$ (A.6)

where to obtain the final result we applied (A.5), then (A.4), and then (A.5) again. So define 𝒟 in a δ-dependent way, as $𝒟=i (θ + δ - 1 / 2),$ ${\rm{D = i}}\left( {\theta + \delta - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}} \right),$ (A.7)

and we can see that $〈 f, 𝒟_{g} 〉 = 〈 𝒟 f, g 〉 .$ $\left\langle {f,{{\rm{D}}_g}} \right\rangle = \left\langle {{\rm{D}}\,f,g} \right\rangle .$ (A.8)

Setting δ = 3 (no restriction to a hyperplane) gives the appropriate result for spherical geometry. For thin discs we have 𝒜 = 𝒟|_δ=2. We could also consider δ = 1 for an infinite line density.

Appendix B Commutator of 𝒟 and $\nabla^{2}$ ${\nabla ^2}$

Working on a δ-dimensional hyperplane again, write the potential using the Green’s function (A.2), $Φ (r) = \int d^{δ} r' ρ (r') G (r, r') .$ ${\rm{\Phi }}\left( {\bf{r}} \right) = \int {{d^\delta }{\bf{r}}\prime \rho \left( {{\bf{r}}\prime } \right)G\left( {{\bf{r}},{\bf{r}}\prime } \right)} .$ (B.1)

Now apply θ, giving $\begin{array}{l} θ Φ (r) = \int d^{δ} r' ρ (r') [G (r, r') + θ' G (r, r')] \\ = - Φ (r) + \int d^{δ} r' G (r, r') (δ + θ') ρ (r') \\ = (δ - 1) Φ (r) + \int d^{δ} r' G (r, r') θ' ρ (r'), \end{array}$ $\matrix{ {\theta {\rm{\Phi }}\left( {\bf{r}} \right) = \int {{d^\delta }{\bf{r}}\prime } \rho \left( {{\bf{r}}\prime } \right)\left[ {G\left( {{\bf{r}},{\bf{r}}\prime } \right) + \theta \prime G\left( {{\bf{r}},{\bf{r}}\prime } \right)} \right]} \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\, = - {\rm{\Phi }}\left( {\bf{r}} \right) + \int {{d^\delta }{\bf{r}}\prime G\left( {{\bf{r}},{\bf{r}}\prime } \right)\left( {\delta + \theta \prime } \right)\rho \left( {{\bf{r}}\prime } \right)} } \hfill \cr {\,\,\,\,\,\,\,\,\,\,\,\,\,\, = \left( {\delta - 1} \right){\rm{\Phi }}\left( {\bf{r}} \right) + \int {{d^\delta }{\bf{r}}\prime G\left( {{\bf{r}},{\bf{r}}\prime } \right)\theta \prime \rho \left( {{\bf{r}}\prime } \right),} } \hfill \cr }$ (B.2)

where we used (A.4) and then (A.5). Note that in the spherical (δ = 3) case this is equivalent to calculating the following commutator, $[\nabla_{l}^{2}, θ] = \nabla_{l}^{2} θ - θ \nabla_{l}^{2} = 2 \nabla_{l}^{2},$ $\left[ {\nabla _l^2,\theta } \right] = \nabla _l^2\theta - \theta \nabla _l^2 = 2\nabla _l^2,$ (B.3)

which can be shown directly by differentiation and the Leibniz rule; however (B.3) is inapplicable to the thin disc (δ = 2) case, so the previous derivation in terms of the Green’s function is required. Now, writing these results in terms of the self-adjoint operator 𝒟, we have $(𝒟+ (1 - δ) i) Φ= \int d^{δ} r' G (r, r') 𝒟' ρ (r') .$ $\left( {{\rm{D + }}\left( {1 - \delta } \right){\rm{i}}} \right){\rm{\Phi = }}\int {{d^\delta }{\bf{r}}\prime G\left( {{\bf{r}},{\bf{r}}\prime } \right){\rm{D}}\prime \rho \left( {{\bf{r}}\prime } \right)} .$ (B.4)

Specialising to the spherical case, if for some basis set {ρ_n} there exists a suitable index-raising polynomial P_n(s), we have $ρ_{n} = P_{n} (𝒟) ρ_{0}$ ${\rho _n} = {P_n}\left( {\rm{D}} \right){\rho _0}$ (B.5)

for the density functions, and $Φ_{n} = P_{n} (𝒟 - 2 i) Φ_{0}$ ${{\rm{\Phi }}_n} = {P_n}\left( {{\rm{D - 2i}}} \right){{\rm{\Phi }}_0}$ (B.6)

for the potentials. Analogously, in the thin disc case, we have $σ_{n} = P_{n} (𝒜) σ_{0}$ ${\sigma _n} = {P_n}\left( {\rm{A}} \right){\sigma _0}$ (B.7)

and $ψ_{n} = P_{n} (𝒜 - i) ψ_{0} .$ ${\psi _n} = {P_n}\left( {{\rm{A}} - {\rm{i}}} \right){\psi _0}.$ (B.8)

Appendix C The Fourier-Mellin transform

We develop expressions for the forwards and reverse Fourier-Mellin transform, and the corresponding orthogonality relation. A similar procedure is followed for both the spherical and the thin disc cases.

Appendix C.1 Spherical case

We work in spherical polar coordinates (r,ϑ, φ), with $r = r \hat{r}$ ${\bf{r}} = r{\bf{\hat r}}$ . Our density basis function for the Fourier-Mellin transform is Ψ_slm, defined in (25). The corresponding potential, obeying $\nabla^{2} ϕ_{s l m} = 4 π ψ_{s l m}$ ${\nabla ^2}{\phi _{slm}} = 4\pi {{\rm{\psi }}_{slm}}$ , is $ϕ_{s l m} (r) = \frac{- 4 π}{k_{l} (i s)} r^{- i s - 1 / 2} Y_{l m} (\hat{r}),$ ${\phi _{slm}}\left( {\bf{r}} \right) = {{ - 4\pi } \over {{k_l}\left( {{\rm{i}}s} \right)}}{r^{ - {\rm{i}}s - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}}}\,{Y_{lm}}\left( {{\bf{\hat r}}} \right),$ (C.1)

where K_l(is) is defined in (27). The expansion of an arbitrary mass density F with respect to the Ψ_slm-basis is the Fourier-Mellin transform of F: $\begin{array}{l} 〈 F, Ψ_{s l m} 〉 = - \int d^{3} r F (r) \bar{ϕ_{s l m} (r)} \\ = \frac{4 π}{K_{l} (i s)} \int_{0}^{\infty} r^{2} d r r^{i s - 1 / 2} \int d^{2} \hat{r} \bar{Y_{l m} (\hat{r})} F (r) \\ = \frac{4 π}{K_{l} (i s)} ℳ_{r} {F_{l m} (r)} (5 / 2 + i s), \end{array}$ $\matrix{ {\left\langle {F,{{\rm{\Psi }}_{slm}}} \right\rangle = - \int {{{\rm{d}}^3}{\bf{r}}\,F} \left( {\bf{r}} \right)\overline {{\phi _{slm}}\left( {\bf{r}} \right)} } \hfill \cr {\quad \quad \quad \,\,\, = {{4\pi } \over {{K_l}\left( {{\rm{i}}s} \right)}}\int_0^\infty {{r^2}dr\,{r^{{\rm{i}}s - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}}}} \,\int {{{\rm{d}}^2}{\bf{\hat r}}} \overline {{Y_{lm}}\left( {{\bf{\hat r}}} \right)}F\left( {\bf{r}} \right)} \hfill \cr {\quad \quad \quad \,\,\, = {{4\pi } \over {{K_l}\left( {{\rm{i}}s} \right)}}{{\rm{M}}_r}\left\{ {{F_{lm}}\left( r \right)} \right\}\left( {{5 \mathord{\left/ {\vphantom {5 {2 + {\rm{i}}s}}} \right. \kern-\nulldelimiterspace} {2 + {\rm{i}}s}}} \right),} \hfill \cr }$ (C.2)

where $F_{l m} (r) = \int d^{2} \hat{r} \bar{Y_{l m} (\hat{r})} F (r)$ ${F_{lm}}\left( r \right) = \int {{{\rm{d}}^2}{\bf{\hat r}}\overline {{Y_{lm}}\left( {{\bf{\hat r}}} \right)} F\left( {\bf{r}} \right)}$ are the spherical multipole moments of F. Inverting this using the Mellin inversion theorem (30) (choosing the constant c = 5/2 in the integral), we have $F (r) = \frac{1}{8 π^{2}} \sum_{l m} \int_{- \infty}^{\infty} d s K_{l} (i s) Ψ_{s l m} (r) 〈 F, Ψ_{s l m} 〉 .$ $F\left( {\bf{r}} \right) = {1 \over {8{\pi ^2}}}\sum\limits_{lm} {\int_{ - \infty }^\infty {ds\,{K_l}\left( {{\rm{i}}s} \right)} \,\,{{\rm{\Psi }}_{slm}}} \left( {\bf{r}} \right)\,\left\langle {F,{{\rm{\Psi }}_{slm}}} \right\rangle .$ (C.3)

The potential corresponding to the density F can be expressed similarly by replacing Ψ_slm(r) in (C.3) by its potential ϕ_slm(r). Finally, the mutual energy of two densities F₁ and F₂ is $〈 F_{1}, F_{2} 〉 = \frac{1}{8 π^{2}} \sum_{l m} \int_{- \infty}^{\infty} d s K_{l} (i s) 〈 F_{1}, Ψ_{s l m} 〉 〈 Ψ_{s l m}, F_{2} 〉$ $\left\langle {{F_1},{F_2}} \right\rangle = {1 \over {8{\pi ^2}}}\sum\limits_{lm} {\int_{ - \infty }^\infty {ds\,{K_l}\left( {{\rm{i}}s} \right)} \,} \left\langle {{F_1},{{\rm{\Psi }}_{slm}}} \right\rangle \left\langle {{{\rm{\Psi }}_{slm}},{F_2}} \right\rangle$ (C.4)

and the Fourier-Mellin basis functions satisfy the orthogonality relation $〈 Ψ_{s l m}, Ψ_{t λ μ} 〉 = \frac{8 π^{2}}{K_{l} (i s)} δ_{m μ} δ_{l λ} δ (s - t) .$ $\left\langle {{{\rm{\Psi }}_{slm}},{{\rm{\Psi }}_{t\lambda \mu }}} \right\rangle = {{8{\pi ^2}} \over {{K_l}\left( {{\rm{i}}s} \right)}}{\delta _{m\mu }}{\delta _{l\lambda }}\,\delta \left( {s - t} \right).$ (C.5)

Appendix C.2 Disc case

We work in cylindrical polar coordinates (R, φ, z), with $R = R \hat{R}$ ${\bf{R}} = R{\bf{\hat R}}$ . Define 𝒜 = i(R∂_R + 3/2). Then for two arbitrary thin disc densities σ₁,σ₂ ∞ δ(z) we have 〈𝒜σ₁, σ₂〉 = 〈σ₁, 𝒜σ₂〉, so 𝒜 is self-adjoint (see Appendix A for proof, setting δ = 2 at the end to give the thin disc case). The eigenfunctions of 𝒜 are Σ_s(R) = R^−is−3/2 with real eigenvalue s. We then adjoin a cylindrical harmonic to form the basis functions (Kalnajs’ logarithmic spirals) ${\sum_{s m} (R) = \sum_{s} (R) e^{i m φ} = R^{- i s - 3 / 2} e}^{i m φ} .$ ${\sum\nolimits_{sm} {\left( {\bf{R}} \right) = \sum\nolimits_s {\left( R \right)} \,{{\rm{e}}^{{\rm{i}}m\varphi }} = {R^{ - is - {3 \mathord{\left/{\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2}}}\,{\rm{e}}} ^{{\rm{i}}m\varphi }}.$ (C.6)

Using Toomre’s Hankel-transform method we can find the potential corresponding to this density, which is²⁵ $\begin{matrix} ψ_{s m} (R, φ, 0) = \frac{- π}{K_{m} (i s)} R^{- i s - 1 / 2} e^{i m φ}, & K_{m} (i s) = {| \frac{Γ (\frac{m + 3 / 2 + i s}{2})}{Γ (\frac{m + 1 / 2 + i s}{2})} |}^{2} \geq 0, \end{matrix}$ $\matrix{ {{\psi _{sm}}\left( {R,\varphi ,0} \right) = {{ - \pi } \over {{K_m}\left( {{\rm{i}}s} \right)}}{R^{ - {\rm{i}}s - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}}}\,\,{{\rm{e}}^{{\rm{i}}m\varphi }},} & {{K_m}\left( {{\rm{i}}s} \right) = {{\left| {{{{\rm{\Gamma }}\left( {{\textstyle{{m + {3 \mathord{\left/ {\vphantom {3 {2 + {\rm{i}}s}}} \right. \kern-\nulldelimiterspace} {2 + {\rm{i}}s}}} \over 2}}} \right)} \over {{\rm{\Gamma }}\left( {{\textstyle{{m + {3 \mathord{\left/ {\vphantom {1 {2 + {\rm{i}}s}}} \right. \kern-\nulldelimiterspace} {2 + {\rm{i}}s}}} \over 2}}} \right)}}} \right|}^2} \ge 0,} \cr }$ (C.7)

so that in the plane (that is, first acting with the full Laplacian, then afterwards setting z = 0) we have ${\nabla^{2} ψ_{s m} (r) |}_{z = 0} = 4 π \sum_{s m} (R) .$ ${\left. {{\nabla ^2}{\psi _{sm}}\left( {\bf{r}} \right)} \right|_{z = 0}} = 4\pi \sum\nolimits_{sm} {\left( {\bf{R}} \right)} .$ (C.8)

Now we compute the thin disc Fourier-Mellin transform for an arbitrary thin disc density σ, $\begin{array}{l} 〈 σ, \sum_{s m} 〉 = - \int d^{3} r σ (R) δ (z) \bar{ψ_{s m} (r)} \\ = \frac{π}{K_{m} (i s)} \int_{0}^{\infty} R d R R^{i s - 1 / 2} σ_{m} (R) \int_{0}^{2 π} d ϕ e^{- i m φ} σ (R) \\ = \frac{π}{K_{m} (i s)} ℳ_{R} {σ_{m} (R)} (3 / 2 + i s), \end{array}$ $\matrix{ {\left\langle {\sigma ,{\sum _{sm}}} \right\rangle = - \int {{{\rm{d}}^3}{\bf{r}}\,\sigma \left( {\bf{R}} \right)\delta \left( z \right)\overline {{\psi _{sm}}\left( {\bf{r}} \right)} } } \hfill \cr {\quad \quad \quad \,\, = {\pi \over {{K_m}\left( {{\rm{i}}s} \right)}}\int_0^\infty {R\,dR\,{R^{{\rm{i}}s - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}}}\,{\sigma _m}\left( R \right)\,\int_0^{2\pi } {d\phi \,{{\rm{e}}^{ - {\rm{i}}m\varphi }}\sigma \left( {\bf{R}} \right)} } } \hfill \cr {\quad \quad \quad \,\, = {\pi \over {{K_m}\left( {{\rm{i}}s} \right)}}{{\rm{M}}_R}\left\{ {{\sigma _m}\left( R \right)} \right\}\left( {{3 \mathord{\left/ {\vphantom {3 {2 + {\rm{i}}s}}} \right. \kern-\nulldelimiterspace} {2 + {\rm{i}}s}}} \right),} \hfill \cr }$ (C.9)

where σ_m(R) are the cylindrical multipoles of σ(R). Using the Mellin inversion theorem to invert this transform (30) (with constant c = 3/2) gives $σ (R) = \frac{1}{4 π^{3}} \sum_{m = - \infty}^{\infty} \int_{- \infty}^{\infty} d s K_{m} (i s) \sum_{s m} (R) 〈 σ, \sum_{s m} 〉 .$ $\sigma \left( {\bf{R}} \right) = {1 \over {4{\pi ^3}}}\sum\limits_{m = - \infty }^\infty {\int_{ - \infty }^\infty {ds\,{K_m}\left( {{\rm{i}}s} \right)} } {\sum _{sm}}\left( {\bf{R}} \right)\left\langle {\sigma ,{\sum _{sm}}} \right\rangle .$ (C.10)

Therefore the mutual energy of two thin disc densities can be expressed as $〈 σ_{1}, σ_{2} 〉 = \frac{1}{4 π^{3}} \sum_{m = - \infty}^{\infty} \int_{- \infty}^{\infty} d s K_{m} (i s) 〈 σ_{1,} \sum_{s m} 〉 〈 \sum_{s m}, σ_{2} 〉 .$ $\left\langle {{\sigma _1},{\sigma _2}} \right\rangle = {1 \over {4{\pi ^3}}}\sum\limits_{m = - \infty }^\infty {\int_{ - \infty }^\infty {ds\,{K_m}\left( {{\rm{i}}s} \right)\left\langle {{\sigma _{1,}}{\sum _{sm}}} \right\rangle } } \left\langle {{\sum _{sm}},{\sigma _2}} \right\rangle .$ (C.11)

We also have the orthogonality relation $〈 \sum_{s m}, \sum_{t μ} 〉 = \frac{4 π^{3}}{K_{m} (i s)} δ_{m μ} δ (t - s) .$ $\left\langle {{\sum _{sm}},{\sum _{t\mu }}} \right\rangle = {{4{\pi ^3}} \over {{K_m}\left( {{\rm{i}}s} \right)}}{\delta _{m\mu }}\,\delta \left( {t - s} \right).$ (C.12)

As noted in Sect. 3.2.3, these results are independent of the z-dependence of the potential away from the disc plane.

Appendix D Orthogonality relation

Appendix D.1 Spherical case

For the inner product of any two density basis functions ρ_nlm we have $\begin{array}{l} 〈 ρ_{n l m}, ρ_{n' l' m'} 〉 = \frac{δ_{l l'} δ_{m m'}}{8 π^{2}} \int_{- \infty}^{\infty} d s K_{l} (i s) 〈 Ψ_{s l m}, ρ_{n l m} 〉 〈 ρ_{n' l m}, Ψ_{s l m} 〉 \\ = \frac{δ_{l l'} δ_{m m'}}{8 π^{2}} \int_{- \infty}^{\infty} d s K_{l} (i s) 〈 Ψ_{s l m}, P_{n l} (𝒟) ρ_{0 l m} 〉 〈 P_{n' l} (𝒟) ρ_{0 l m}, Ψ_{s l m} 〉 \\ = \frac{δ_{l l'} δ_{m m'}}{8 π^{2}} \int_{- \infty}^{\infty} d s K_{l} (i s) \bar{P_{n l} (s)} P_{n' l} (s) {| 〈 Ψ_{s l m}, ρ_{0 l m} 〉 |}^{2} . \end{array}$ $\matrix{ {\left\langle {{\rho _{nlm}},{\rho _{n\prime l\prime m\prime }}} \right\rangle = {{{\delta _{ll\prime }}{\delta _{mm\prime }}} \over {8{\pi ^2}}}\int_{ - \infty }^\infty {ds\,\,{K_l}\left( {{\rm{i}}s} \right)\left\langle {{{\rm{\Psi }}_{slm}},{\rho _{nlm}}} \right\rangle \left\langle {{\rho _{n\prime lm}},{{\rm{\Psi }}_{slm}}} \right\rangle } } \hfill \cr {\quad \quad \quad \quad \,\,\,\,\,\, = {{{\delta _{ll\prime }}{\delta _{mm\prime }}} \over {8{\pi ^2}}}\int_{ - \infty }^\infty {ds\,\,{K_l}\left( {{\rm{i}}s} \right)\left\langle {{{\rm{\Psi }}_{slm}},{P _{nl}}\left( D \right){\rho _{0lm}}} \right\rangle \left\langle {{P_{n\prime l}}\left( D \right){\rho _{0lm}},{{\rm{\Psi }}_{slm}}} \right\rangle } } \hfill \cr {\quad \quad \quad \quad \,\,\,\,\,\,\, = {{{\delta _{ll\prime }}{\delta _{mm\prime }}} \over {8{\pi ^2}}}\int_{ - \infty }^\infty {ds\,\,{K_l}\left( {{\rm{i}}s} \right)\overline {{P_{nl}}\left( s \right)} {P_{n\prime l}}\left( s \right){{\left| {\left\langle {{{\rm{\Psi }}_{slm}},{\rho _{0lm}}} \right\rangle } \right|}^2}} } \hfill \cr } .$ (D.1)

The non-polynomial factors in the above expression are collected into a weight function w_l(s), which can be written explicitly in terms of the Mellin transform of the zeroth-order density (or a similar expression in terms of the zeroth-order potential - see (84)), $\begin{array}{l} ω_{l} (s) = \frac{K_{l} (i s)}{8 π^{2}} {| 〈 ρ_{0 l m}, Ψ_{s l m} 〉 |}^{2} \\ = \frac{2 K_{0 l}^{2}}{K_{l} (i s)} {| ℳ_{r} {ρ_{0 l} (r)} (5 / 2 + i s) |}^{2} . \end{array}$ $\matrix{ {{\omega _l}\left( s \right) = {{{K_l}\left( {{\rm{i}}s} \right)} \over {8{\pi ^2}}}{{\left| {\left\langle {{\rho _{0lm}},{{\rm{\Psi }}_{slm}}} \right\rangle } \right|}^2}} \hfill \cr {\quad \quad \, = {{2K_{0l}^2} \over {{K_l}\left( {{\rm{i}}s} \right)}}{{\left| {{M_r}\left\{ {{\rho _{0l}}\left( r \right)} \right\}\left( {{5 \mathord{\left/ {\vphantom {5 2}} \right. \kern-\nulldelimiterspace} 2} + {\rm{i}}s} \right)} \right|}^2}} \hfill \cr } .$ (D.2)

We also assume we have found the (real) monic polynomials orthogonal with respect to the weight function w_l(s), writing them as p_nl(s), so that $\int_{- \infty}^{\infty} ω_{l} (s) p_{n l} (s) p_{n' l} (s) = δ_{n n'} h_{n l} .$ $\int_{ - \infty }^\infty {{\omega _l}\left( s \right){p_{nl}}\left( s \right){p_{n\prime l}}\left( s \right) = {\delta _{nn\prime }}{h_{nl}}} .$ (D.3)

Now write P_nl(s) in terms of p_nl(s) as $P_{n l} (s) = i^{- n} p_{n l} (s),$ ${P_{nl}}\left( s \right) = {{\rm{i}}^{ - n}}{p_{nl}}\left( s \right),$ (D.4)

so that the orthogonality relation for the p_nlm becomes $\begin{array}{l} 〈 ρ_{n l m}, ρ_{n' l' m'} 〉 = δ_{l l'} δ_{m m'} \int_{- \infty}^{\infty} d s ω_{l} (s) \bar{P_{n l} (s)} P_{n' l} (s) \\ = δ_{l l'} δ_{m m'} {(- i)}^{- n} i^{- n'} \int_{- \infty}^{\infty} ω_{l} (s) p_{n l} (s) p_{n' l} (s) \\ = δ_{l l'} δ_{m m'} δ_{n n'} h_{n l} . \end{array}$ $\matrix{ {\left\langle {{\rho _{nlm}},{\rho _{n\prime l\prime m\prime }}} \right\rangle = {\delta _{ll\prime }}{\delta _{mm\prime }}\,\,\,\int_{ - \infty }^\infty {ds\,\,{\omega _l}\left( s \right)\overline {{P_{nl}}\left( s \right)} \,{P_{n\prime l}}\left( s \right)} } \hfill \cr {\quad \quad \quad \quad \,\,\,\,\,\, = {\delta _{ll\prime }}{\delta _{mm\prime }}{{\left( { - {\rm{i}}} \right)}^{ - n}}{{\rm{i}}^{ - n\prime }}\,\,\,\int_{ - \infty }^\infty {{\omega _l}\left( s \right)\,\,{p_{nl}}\left( s \right)\,\,{p_{n\prime l}}\left( s \right)} } \hfill \cr {\quad \quad \quad \quad \,\,\,\,\,\, = {\delta _{ll\prime }}{\delta _{mm\prime }}{\delta _{nn\prime }}{h_{nl}}} \hfill \cr }.$ (D.5)

We have that P_nl(𝒟) is a real operator, because $\begin{array}{l} \bar{P_{n l} (𝒟)} = {(- i)}^{- n} p_{n l} (\bar{𝒟}) \\ = {(- i)}^{- n} p_{n l} (- D) \\ = {(- i)}^{- n} {(- 1)}^{- n} p_{n l} (𝒟) \\ = i^{- n} p_{n l} (𝒟) \\ = P_{n l} (𝒟) . \end{array}$ $\matrix{ {\overline {{P_{nl}}\left( D \right)} = {{\left( { - {\rm{i}}} \right)}^{ - n}}\,\,{p_{nl}}\left( {\overline D } \right)} \hfill \cr {\quad \quad \quad = {{\left( { - {\rm{i}}} \right)}^{ - n}}\,\,{p_{nl}}\left( { - D} \right)} \hfill \cr {\quad \quad \,\,\,\,\,\, = {{\left( { - {\rm{i}}} \right)}^{ - n}}\,\,{{\left( { - 1} \right)}^{ - n}}\,\,{p_{nl}}\left( D \right)} \hfill \cr {\quad \,\,\,\,\,\,\,\,\,\,\,\, = {{\rm{i}}^{ - n}}\,\,{p_{nl}}\left( D \right)} \hfill \cr {\quad \quad \,\,\,\,\,\, = {P_{nl}}\left( D \right)} \hfill \cr } .$ (D.6)

This ensures that applying P_nl(𝒟) to a real function (such as p_0l(r)) gives a real result. Note that we used p_nl(−x) = (−l)ⁿp_nl(x), which is true for any orthogonal polynomial where the weight function and domain of integration are both symmetric.

Appendix D.2 Thin disc case

For σ_nm = P_nm(𝒜)σ_0m we have the orthogonality relation $\begin{array}{l} 〈 σ_{n m}, σ_{n' m'} 〉 = \frac{δ_{m m'}}{4 π^{3}} \int_{- \infty}^{\infty} d s K_{m} (i s) 〈 P_{n m} (𝒜) σ_{0 m}, \sum_{s m} 〉 〈 \sum_{s m}, P_{n' m} (𝒜) σ_{0 m} 〉 \\ = \frac{δ_{m m'}}{4 π^{3}} \int_{- \infty}^{\infty} d s K_{m} (i s) {| 〈 σ_{0 m}, \sum_{s m} 〉 |}^{2} P_{n m} (s) \bar{P_{n' m} (s)} \\ = δ_{m m'} \int_{- \infty}^{\infty} d s Ω_{m} (s) P_{n m} (s) \bar{P_{n' m} (s)}, \end{array}$ $\matrix{ {\left\langle {{\sigma _{nm}},{\sigma _{n\prime m\prime }}} \right\rangle = {{{\delta _{mm\prime }}} \over {4{\pi ^3}}}\int_{ - \infty }^\infty {ds\,\,{K_m}\left( {{\rm{i}}s} \right)\left\langle {{P_{nm}}\left( A \right){\sigma _{0m}},{\sum _{sm}}} \right\rangle \left\langle {{\sum _{sm}},{P_{n\prime m}}\left( A \right){\sigma _{0m}}} \right\rangle } } \hfill \cr {\quad \quad \quad \quad \,\,\, = {{{\delta _{mm\prime }}} \over {4{\pi ^3}}}\int_{ - \infty }^\infty {ds\,\,{K_m}\left( {{\rm{i}}s} \right)\,\,{{\left| {\left\langle {{\sigma _{0m}},{\sum _{sm}}} \right\rangle } \right|}^2}{P_{nm}}\left( s \right)\overline {{P_{n\prime m}}\left( s \right)} } } \hfill \cr {\quad \quad \quad \quad \,\,\, = {\delta _{mm\prime }}\int_{ - \infty }^\infty {ds{{\rm{\Omega }}_m}\left( s \right)\,\,{P_{nm}}\left( s \right)\,\,\overline {{P_{n\prime m}}\left( s \right)} ,} } \hfill \cr }$ (D.7)

where the weight function can be written in terms of either the zeroth-order potential or density, $\begin{array}{l} Ω_{m} (s) = \frac{K_{m} (i s)}{4 π^{3}} {| ℳ_{R} {ψ_{0 m} (R)} (1 / 2 + i s) |}^{2} \\ = \frac{{| ℳ_{R} {σ_{0 m} (R)} (3 / 2 + i s) |}^{2}}{4 π K_{m} (i s)} \end{array}$ $\matrix{ {{{\rm{\Omega }}_m}\left( s \right) = {{{K_m}\left( {{\rm{i}}s} \right)} \over {4{\pi ^3}}}{{\left| {{M_R}\left\{ {{\psi _{0m}}\left( R \right)} \right\}\left( {{1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2} + {\rm{i}}s} \right)} \right|}^2}} \hfill \cr {\quad \quad \,\,\,\, = {{{{\left| {{M_R}\left\{ {{\sigma _{0m}}\left( R \right)} \right\}\left( {{3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2} + {\rm{i}}s} \right)} \right|}^2}} \over {4\pi {K_m}\left( {{\rm{i}}s} \right)}}} \hfill \cr }$ (D.8)

Appendix E Classical polynomials

Here we record two types of orthogonal polynomial that are used in Sect. 4 –the continuous Hahn and the Meixner-Pollaczek polynomials. We summarise only the properties that are relevant for our purposes, and direct the reader to other sources for more comprehensive information (DLMF, Sect. 18.19).

These two polynomials are perhaps obscure compared to the well-known classical polynomials of Jacobi, Laguerre and Hermite. However, a slight generalisation of the notion of ‘classical’ leads to the Askey scheme (Koekoek et al. 2010), according to which the continuous Hahn and Meixner-Pollaczek polynomials lie just one level above the Jacobi polynomials. Like the standard classical polynomials, all Askey polynomials possess 1. closed form expressions in terms of hypergeometric functions, and 2. three-term recurrence relations with simple expressions for the recurrence coefficients. The latter property means that detailed knowledge about the polynomials is usually unnecessary, and the end-user can just plug in the recurrence formulas (E.4) and (E.11).

Appendix E.1 Continuous Hahn

The continuous Hahn polynomials conventionally take four real parameters, usually written in terms of two complex parameters: $(a, b, \bar{a}, \bar{b})$ $\left( {a,b,\bar a,\bar b} \right)$ . We restrict ourselves to the case of two real parameters²⁶, so $a = \bar{a}$ $a = \bar a$ and $b = \bar{b}$ $b = \bar b$ , and an explicit representation in terms of a terminating ₃F₂ hypergeometric series is $\begin{array}{l} p_{n} (s; a, b) = i^{n} \frac{{(2 a)}_{n} {(a + b)}_{n}}{n!}_{3} F_{2} (\begin{matrix} - n, n + 2 a + 2 b - 1, a + i s \\ 2 a, a + b \end{matrix} | 1) . \end{array}$ $\matrix{ {{p_n}\left( {s;a,b} \right) = {{\rm{i}}^n}{{{{\left( {2a} \right)}_n}{{\left( {a + b} \right)}_n}} \over {n!}}_3{F_2}\left( {\left. {\matrix{ { - n,n + 2a + 2b - 1,a + {\rm{i}}s} \cr {2a,a + b} \cr } } \right|1} \right).} \hfill \cr }$ (E.1)

The orthogonality relation is $\begin{matrix} \int_{- \infty}^{\infty} p_{n} (s; a, b) p_{m} (s; a, b) d s = δ_{n m} h_{n} (a, b), & where & h_{n} (a, b) = \frac{2 π Γ (n + 2 a) Γ (n + 2 b) Γ {(n + a + b)}^{2}}{n! (2 n + 2 a + 2 b - 1) Γ (n + 2 a + 2 b - 1)} . \end{matrix}$ $\matrix{ {\int_{ - \infty }^\infty {{p_n}\left( {s;a,b} \right){p_m}\left( {s;a,b} \right)ds = {\delta _{nm}}{h_n}\left( {a,b} \right),} } & {{\rm{where}}} & {{h_n}\left( {a,b} \right) = {{2\pi \,\Gamma \left( {n + 2a} \right)\Gamma \left( {n + 2b} \right)\Gamma {{\left( {n + a + b} \right)}^2}} \over {n!\left( {2n + 2a + 2b - 1} \right)\Gamma \left( {n + 2a + 2b - 1} \right)}}.} \cr }$ (E.2)

Note that p_n(s; a, b) is a real-valued polynomial in s of degree n, symmetric in the parameters a and b, despite the fact that s appears (abnormally) in the ‘parameter’ part of the hypergeometric function. Like any orthogonal polynomial on a symmetric interval, each individual polynomial is either an even or an odd function, according to the parity relation p_n(−s; a, b) = (−1)ⁿp_n(s; a, b). We also define the monic form of the polynomials, $\begin{matrix} {\hat{p}}_{n} (s; a, b) = p_{n} (s; a, b) / k_{n} (a, b), & where & k_{n} (a, b) = \frac{{(n + 2 a + 2 b - 1)}_{n}}{n!} . \end{matrix}$ $\matrix{ {{{{{\hat p}_n}\left( {s;a,b} \right) = {p_n}\left( {s;a,b} \right)} \mathord{\left/ {\vphantom {{{{\hat p}_n}\left( {s;a,b} \right) = {p_n}\left( {s;a,b} \right)} {{k_n}\left( {a,b} \right),}}} \right. \kern-\nulldelimiterspace} {{k_n}\left( {a,b} \right),}}} & {{\rm{where}}} & {{k_n}\left( {a,b} \right) = {{{{\left( {n + 2a + 2b - 1} \right)}_n}} \over {n!}}.} \cr }$ (E.3)

The monic form obeys the three-term recurrence relation $\begin{array}{l} \begin{matrix} {\hat{p}}_{- 1} (s; a, b) = 0; & {\hat{p}}_{0} (s; a, b) = 1; \end{matrix} \\ \begin{matrix} {\hat{p}}_{n + 1} (s; a, b) = s {\hat{p}}_{n} (s; a, b) - β_{n} (a, b) {\hat{p}}_{n - 1} (s; a, b) & where & β_{n} (a, b) = \frac{n (n + 2 a - 1) (n + 2 b - 1) (b + 2 a + 2 b - 2)}{4 (2 n + 2 a + 2 b - 3) (2 n + 2 a + 2 b - 1)} . \end{matrix} \end{array}$ $\matrix{ {\matrix{ {{{\hat p}_{ - 1}}\left( {s;a,b} \right) = 0;} & {{{\hat p}_0}\left( {s;a,b} \right) = 1;} \cr } } \hfill \cr {\matrix{ {{{\hat p}_{n + 1}}\left( {s;a,b} \right) = s\,{{\hat p}_n}\left( {s;a,b} \right) - {\beta _n}\left( {a,b} \right){{\hat p}_{n - 1}}\left( {s;a,b} \right)} & {{\rm{where}}} & {{\beta _n}\left( {a,b} \right) = {{n\left( {n + 2a - 1} \right)\left( {n + 2b - 1} \right)\left( {b + 2a + 2b - 2} \right)} \over {4\left( {2n + 2a + 2b - 3} \right)\left( {2n + 2a + 2b - 1} \right)}}.} \cr } } \hfill \cr }$ (E.4)

Appendix E.2 Meixner-Pollaczek

The Meixner-Pollaczek polynomials are another set of orthogonal polynomials on the interval (−∞, ∞), depending on two real parameters λ and ϕ, and have an explicit representation in terms of a terminating₂ F₁ hypergeometric function $P_{n}^{(λ)} (x; ϕ) = \frac{{(2 λ)}_{n} e^{i n ϕ}}{n!} 2 F_{1} (\begin{matrix} - n, λ + i x \\ 2 λ \end{matrix} | 1 - e^{- 2 i ϕ}) .$ $P_n^{\left( \lambda \right)}\left( {x;\phi } \right) = {{{{\left( {2\lambda } \right)}_n}{{\rm{e}}^{{\rm{i}}n\phi }}} \over {n!}}2{F_1}\left( {\left. {\matrix{ { - n,\lambda + {\rm{i}}x} \cr {2\lambda } \cr } } \right|1 - {{\rm{e}}^{ - 2{\rm{i}}\phi }}} \right).$ (E.5)

The orthogonality relation is $\begin{matrix} \int_{- \infty}^{\infty} P_{n}^{(λ)} (s; ϕ) P_{m}^{(λ)} (s; ϕ) d s = δ_{n m} h_{n}^{(λ)} (ϕ), & where & h_{n}^{(λ)} (ϕ) = \frac{2 π Γ (n + 2 λ)}{{(2 \sin ϕ)}^{2 λ} n!} . \end{matrix}$ $\matrix{ {\int_{ - \infty }^\infty {P_n^{\left( \lambda \right)}\left( {s;\phi } \right)P_m^{\left( \lambda \right)}\left( {s;\phi } \right)ds = {\delta _{nm}}h_n^{\left( \lambda \right)}\left( \phi \right),} } & {{\rm{where}}} & {h_n^{\left( \lambda \right)}\left( \phi \right) = {{2\pi \,\Gamma \left( {n + 2\lambda } \right)} \over {{{\left( {2\sin \phi } \right)}^{2\lambda }}n!}}.} \cr }$ (E.6)

Note that once again the variable x appears in the parameter part of the hypergeometric function. The weight function is $w^{(λ)} (x; ϕ) = {| Γ (λ + i x) |}^{2} e^{(2 ϕ - π) x} .$ ${w^{\left( \lambda \right)}}\left( {x;\phi } \right) = {\left| {\Gamma \left( {\lambda + {\rm{i}}x} \right)} \right|^2}{{\rm{e}}^{\left( {2\phi - \pi } \right)x}}.$ (E.7)

In the case that the parameter ϕ = π/2, the Meixner-Pollaczek polynomials can be derived from the continuous Hahn polynomials in two different ways (DLMF, Sect. 18.21): if the two parameters (of the latter) differ by one half $p_{n} (x; a, a + 1 / 2) = \frac{{(n + 4 a)}_{n}}{2^{2 n}} P_{n}^{(2 a)} (2 s; \frac{π}{2}),$ ${p_n}\left( {x;a,a + {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-\nulldelimiterspace} 2}} \right) = {{{{\left( {n + 4a} \right)}_n}} \over {{2^{2n}}}}P_n^{\left( {2a} \right)}\left( {2s;{\pi \over 2}} \right),$ (E.8)

or if the second parameter is taken to infinity $\lim_{b \to \infty} {\frac{p_{n} (x; a, b)}{{(a + b)}_{n}}} = P_{n}^{(a)} (s; \frac{π}{2}) .$ $\mathop {\lim }\limits_{b \to \infty } \left\{ {{{{p_n}\left( {x;a,b} \right)} \over {{{\left( {a + b} \right)}_n}}}} \right\} = P_n^{\left( a \right)}\left( {s;{\pi \over 2}} \right).$ (E.9)

The monic form is $\begin{matrix} {\hat{P}}_{n}^{(λ)} (s; ϕ) = P_{n}^{(λ)} (s; ϕ) / k_{n} (ϕ), & where & k_{n} (ϕ) = \frac{{(2 \sin ϕ)}^{n}}{n!}, \end{matrix}$ $\matrix{ {{{\hat P_n^{\left( \lambda \right)}\left( {s;\phi } \right) = P_n^{\left( \lambda \right)}\left( {s;\phi } \right)} \mathord{\left/ {\vphantom {{\hat P_n^{\left( \lambda \right)}\left( {s;\phi } \right) = P_n^{\left( \lambda \right)}\left( {s;\phi } \right)} {{k_n}\left( \phi \right),}}} \right. \kern-\nulldelimiterspace} {{k_n}\left( \phi \right),}}} & {{\rm{where}}} & {{k_n}\left( \phi \right) = {{{{\left( {2\sin \phi } \right)}^n}} \over {n!}},} \cr }$ (E.10)

and for the case ϕ = π/2 the three-term recurrence relation is $\begin{array}{l} \begin{matrix} {\hat{P}}_{- 1}^{(λ)} (s; π / 2) = 0; & {\hat{P}}_{0}^{(λ)} (s; π / 2) = 1; \end{matrix} \\ \begin{matrix} {\hat{P}}_{n + 1}^{(λ)} (s; π / 2) = s {\hat{P}}_{n}^{(λ)} (s; π / 2) - β_{n}^{(λ)} {\hat{P}}_{n - 1}^{(λ)} (s; π / 2), & where & β_{n}^{(λ)} = \frac{n (n + 2 λ - 1)}{4} . \end{matrix} \end{array}$ $\matrix{ {\matrix{ {\hat P{ - 1}^{\left( \lambda \right)}\left( {s;\pi /2} \right) = 0;} & {\hat P_0^{\left( \lambda \right)}\left( {s;\pi /2} \right) = 1;} \cr } } \hfill \cr {\matrix{ {\hat P_{n + 1}^{\left( \lambda \right)}\left( {s;\pi /2} \right) = s\,\hat P_n^{\left( \lambda \right)}\left( {s;\pi /2} \right) - \beta n^{\left( \lambda \right)}\hat P_{n - 1}^{\left( \lambda \right)}\left( {s;\pi /2} \right),} & {{\rm{where}}} & {\beta _n^{\left( \lambda \right)} = {{n\left( {n + 2\lambda - 1} \right)} \over 4}.} \cr } } \hfill \cr }$ (E.11)

Appendix F Exponential disc potential

We find the potential multipoles corresponding to the exponential disc density given in (77), using Toomre’s Hankel transform method as a starting point. Applying the Toomre method to the disc density gives an auxiliary function $g_{m} (k) = - 2 π \int_{0}^{\infty} R σ_{m}^{\exp} (R) J_{m} (k R) d R = - \frac{2^{2 + m} \sqrt{π} Γ (m + 3 / 2) k^{m}}{{(1 + k^{2})}^{m + 3 / 2}},$ ${g_m}\left( k \right) = - 2\pi \int_0^\infty {R\sigma _m^{\exp }\left( R \right){J_m}\left( {kR} \right)dR = - {{{2^{2 + m}}\sqrt \pi \Gamma \left( {m + {3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2}} \right){k^m}} \over {{{\left( {1 + {k^2}} \right)}^{{{m + 3} \mathord{\left/ {\vphantom {{m + 3} 2}} \right. \kern-\nulldelimiterspace} 2}}}}}} ,$ (F.1)

from which the potential is found via $ψ_{m}^{\exp} (R) = \int_{0}^{\infty} g_{m} (k) J_{m} (k R) d k .$ $\psi _m^{\exp }\left( R \right) = \int_0^\infty {{g_m}\left( k \right){J_m}\left( {kR} \right)dk.}$ (F.2)

Surprisingly, this integral does not appear in the standard tables, and computer algebra provides an unsatisfactory result involving a Meijer G-function. The m = 0 case is given (79), but to derive the higher-orders we need to combine two basic ideas. Firstly, Lynden-Bell (1989) shows how to (in effect) raise the angular index m of the RHS of (F.2), using an operator (modifying his notation) $Δ_{m} = R^{m} \partial_{R} R^{- m} = \partial_{R} + (1 - m) / R$ ${{\rm{\Delta }}_m} = {R^m}{\partial _R}{R^{ - m}} = {\partial _R} + {{\left( {1 - m} \right)} \mathord{\left/ {\vphantom {{\left( {1 - m} \right)} R}} \right. \kern-\nulldelimiterspace} R}$ (F.3)

that obeys (for generic ψ_m and ɡ_m) $Δ_{m} ψ_{m} = - \int_{0}^{\infty} k g_{m} (k) J_{m + 1} (k R) d k .$ ${{\rm{\Delta }}_m}{\psi _m} = - \int_0^\infty {k\,{g_m}} \left( k \right){J_{m + 1}}\left( {kR} \right)dk.$ (F.4)

Secondly, inspired by the use of the operator θ = R∂_R in the main part of the present work, we apply it to (F.2) and perform some integration by parts to find (writing θ_k = k∂_k) $(θ + 1) ψ_{m} = - \int_{0}^{\infty} θ_{k} (g_{m} (k)) J_{m} (k R) d k .$ $\left( {\theta + 1} \right){\psi _m} = - \int_0^\infty {{\theta _k}\left( {{g_m}\left( k \right)} \right){J_m}\left( {kR} \right)dk.}$ (F.5)

It remains to apply linear combinations of θ and Δ_m to (F.2), and then rearrange the terms inside the integral sign according to our knowledge of ɡ_m(k) such that only a term proportional to ɡ_m+1(k) J_m+1(kR) remains on the RHS. The result is the recursion relation given in (80).

Appendix G Exact moments for the isochrone

Using the expressions for the isochrone model in (81), we seek the modified moments of self-energy ${\tilde{μ}}_{j l} = 〈 {\tilde{p}}_{j l} (𝒟) ρ_{0 l}^{iso}, ρ_{0 l}^{iso} 〉 = \int_{- \infty}^{\infty} d s ω_{l}^{iso} (s) {\tilde{p}}_{j l} (s) = - \int_{0}^{\infty} d r r^{2} ρ_{0 l}^{iso} {\tilde{p}}_{j l} (𝒟 - 2 i) Φ_{0 l}^{iso} .$ ${\tilde \mu _{jl}} = \left\langle {{{\tilde p}_{jl}}\left( {\cal D} \right)\rho _{0l}^{{\rm{iso}}},\rho _{0l}^{{\rm{iso}}}} \right\rangle = {\int_{ - \infty }^\infty {ds\,\omega _l^{{\rm{iso}}}\left( s \right)\tilde p} _{jl}}\left( s \right) = - \int_0^\infty {dr\,{r^2}\rho _{0l}^{{\rm{iso}}}{{\tilde p}_{jl}}\left( {{\cal D} - 2{\rm{i}}} \right)\Phi _{0l}^{{\rm{iso}}}.}$ (G.1)

The auxiliary polynomials p̃_jl(s) here are the monic Hermite polynomials²⁷. To facilitate variable substitutions in this integral, it is useful to rewrite both $Φ_{0 l}^{iso}$ $\Phi _{0l}^{{\rm{iso}}}$ and $ρ_{0 l}^{iso}$ $\rho _{0l}^{{\rm{iso}}}$ in rationalised-surd form, $\begin{array}{l} Φ_{0 l}^{iso} (r) = - \frac{{(1 - \sqrt{1 + r^{2}})}^{1 + 2 l}}{r^{2 + 3 l}}, \\ ρ_{0 l}^{iso} (r) = - \frac{(1 + 2 l) {(1 - \sqrt{1 + r^{2}})}^{2 + 2 l}}{4 π r^{4 + 3 l} {(1 + r^{2})}^{3 / 2}} [1 + 2 (1 + l) \sqrt{1 + r^{2}}] . \end{array}$ $\matrix{ {\Phi _{0l}^{{\rm{iso}}}\left( r \right) = - {{{{\left( {1 - \sqrt {1 + {r^2}} } \right)}^{1 + 2l}}} \over {{r^{2 + 3l}}}},} \hfill \cr {\rho _{0l}^{{\rm{iso}}}\left( r \right) = - {{\left( {1 + 2l} \right){{\left( {1 - \sqrt {1 + {r^2}} } \right)}^{2 + 2l}}} \over {4\pi {r^{4 + 3l}}{{\left( {1 + {r^2}} \right)}^{{3 \mathord{\left/ {\vphantom {3 2}} \right. \kern-\nulldelimiterspace} 2}}}}}\left[ {1 + 2\left( {1 + l} \right)\sqrt {1 + {r^2}} } \right].} \hfill \cr }$ (G.2)

We also define an auxiliary quantity K_jl, $K_{j l} = (1 + 2 l) \int_{0}^{1} d t {(1 + t)}^{- 5 / 2 - 3 l} {(1 - t)}^{l + 1 / 2} t^{1 + 2 l + j} = \frac{(1 + 2 l) B (l + 3 / 2, 2 l + j + 2)}{2^{3 l + 5 / 2}}_{2} F_{1} (\begin{matrix} l + 3 / 2, 3 l + 5 / 2 \\ 3 l + j + 7 / 2 \end{matrix} | \frac{1}{2}) .$ ${K_{jl}} = \left( {1 + 2l} \right)\int_0^1 {dt} {\left( {1 + t} \right)^{{{ - 5} \mathord{\left/ {\vphantom {{ - 5} {2 - 3l}}} \right. \kern-\nulldelimiterspace} {2 - 3l}}}}{\left( {1 - t} \right)^{{{l + 1} \mathord{\left/ {\vphantom {{l + 1} 2}} \right. \kern-\nulldelimiterspace} 2}}}{t^{1 + 2l + j}} = {{\left( {1 + 2l} \right){\rm{B}}\left( {{{l + 3} \mathord{\left/ {\vphantom {{l + 3} {2,2l + j + 2}}} \right. \kern-\nulldelimiterspace} {2,2l + j + 2}}} \right)} \over {{2^{{{3l + 5} \mathord{\left/ {\vphantom {{3l + 5} 2}} \right. \kern-\nulldelimiterspace} 2}}}}}_2{F_1}\left( {\left. {\matrix{ {{{l + 3} \mathord{\left/ {\vphantom {{l + 3} {2,3l + {5 \mathord{\left/ {\vphantom {5 2}} \right. \kern-\nulldelimiterspace} 2}}}} \right. \kern-\nulldelimiterspace} {2,3l + {5 \mathord{\left/ {\vphantom {5 2}} \right. \kern-\nulldelimiterspace} 2}}}} \cr {{{3l + j + 7} \mathord{\left/ {\vphantom {{3l + j + 7} 2}} \right. \kern-\nulldelimiterspace} 2}} \cr } } \right|{1 \over 2}} \right).$ (G.3)

In fact K_jl can always be reduced (by a computer algebra system such as MATHEMATICA) to a form a + b/π with a, b rational. Writing the integral for the zeroth-order self-energy µ_0l with the variable substitution $t = 1 / \sqrt{1 + r^{2}}$ $t = 1/\sqrt {1 + {r^2}}$ , we find that $\begin{array}{l} μ_{0 l} = - \int_{0}^{\infty} d r r^{2} ρ_{0 l} Φ_{0 l} \\ = (1 + 2 l) \int_{0}^{1} d t t^{1 + 2 l} {(1 - t)}^{1 / 2 + l} {(1 + t)}^{- 5 / 2 - 3 l} [t + 2 (l + 1)] \\ - K_{1 l} + 2 (l + 1) K_{0 l} . \end{array}$ $\matrix{ {{\mu _{0l}} = - \int_0^\infty {dr\,{r^2}{\rho _{0l}}{\Phi _{0l}}} } \hfill \cr { = \left( {1 + 2l} \right)\int_0^1 {dt\,{t^{1 + 2l}}{{\left( {1 - t} \right)}^{{1 \mathord{\left/ {\vphantom {1 {2 + l}}} \right. \kern-\nulldelimiterspace} {2 + l}}}}{{\left( {1 + t} \right)}^{{{ - 5} \mathord{\left/ {\vphantom {{ - 5} {2 - 3l}}} \right. \kern-\nulldelimiterspace} {2 - 3l}}}}\left[ {t + 2\left( {l + 1} \right)} \right]} } \hfill \cr { - {K_{1l}} + 2\left( {l + 1} \right){K_{0l}}.} \hfill \cr }$ (G.4)

To find the higher-order moments, consider the following polynomial Q_jl(t) of degree 2 j – 1, $Q_{j l} (t) = {[Φ_{0 l}^{iso} (r (t))]}^{- 1} {\tilde{p}}_{j l} (𝒟 - 2 i) Φ_{0 l}^{iso} (r (t)),$ ${Q_{jl}}\left( t \right) = {\left[ {\Phi _{0l}^{{\rm{iso}}}\left( {r\left( t \right)} \right)} \right]^{ - 1}}{\tilde p_{jl}}\left( {{\cal D} - 2{\rm{i}}} \right)\Phi _{0l}^{{\rm{iso}}}\left( {r\left( t \right)} \right),$ (G.5)

and note that r∂_r = –t(1 – t²)∂_t. Using the recurrence relation (89) for the auxiliary polynomials p̃_jl(s) we can therefore write Q_jl(t) recursively as $\begin{array}{l} Q_{0 l} (t) = 1, \\ Q_{1 l} (t) = \frac{i}{2} (1 + 2 l) (2 t - 1), \\ Q_{j l} (t) = [Q_{1 l} (t) - i t (1 - t^{2}) \partial_{t}] Q_{j - 1, l} (t) - {\tilde{β}}_{j - 1, l} Q_{j - 2, l} (t) . \end{array}$ $\matrix{ {{Q_{0l}}\left( t \right) = 1,} \hfill \cr {{Q_{1l}}\left( t \right) = {{\rm{i}} \over 2}\left( {1 + 2l} \right)\left( {2t - 1} \right),} \hfill \cr {{Q_{jl}}\left( t \right) = \left[ {{Q_{1l}}\left( t \right) - {\rm{i}}t\left( {1 - {t^2}} \right){\partial _t}} \right]{Q_{j - 1,l}}\left( t \right) - {{\tilde \beta }_{j - 1,l}}{Q_{j - 2,l}}\left( t \right).} \hfill \cr }$ (G.6)

Writing out the polynomial explicitly as $Q_{j l} (t) = \sum_{k = 0}^{2 j - 1} q_{j k l} t^{k}$ ${Q_{jl}}\left( t \right) = \sum\nolimits_{k = 0}^{2j - 1} {{q_{jkl}}{t^k}}$ , we have the following recurrence on the coefficients q_jkl, $\begin{array}{l} q_{j k l} = 0 when k < 0 or k > 2 j - 1, \\ q_{00 l} = 1, \\ q_{10 l} = \frac{- i}{2} (1 + 2 l), \\ q_{11 l} = i (1 + 2 l), \\ q_{j k l} = i [(1 + 2 l) q_{j - 1, k - 1, l} - (1 / 2 + l + k) q_{j - 1, k l} + (k - 2) q_{j - 1, k - 2, l}] - {\tilde{β}}_{j - 1} q_{j - 2, k l} . \end{array}$ $\matrix{ {{q_{jkl}} = 0\,{\rm{when}}\,k alt; 0\,{\rm{or}}\,k > 2j - 1,} \hfill \cr {{q_{00l}} = 1,} \hfill \cr {{q_{10l}} = {{ - {\rm{i}}} \over 2}\left( {1 + 2l} \right),} \hfill \cr {{q_{11l}} = {\rm{i}}\left( {1 + 2l} \right),} \hfill \cr {{q_{jkl}} = {\rm{i}}\left[ {\left( {1 + 2l} \right){q_{j - 1,k - 1,l}} - \left( {{1 \mathord{\left/ {\vphantom {1 {2 + l + k}}} \right. \kern-\nulldelimiterspace} {2 + l + k}}} \right){q_{j - 1,kl}} + \left( {k - 2} \right){q_{j - 1,k - 2,l}}} \right] - {{\tilde \beta }_{j - 1}}{q_{j - 2,kl}}.} \hfill \cr }$ (G.7)

Now insert this into the integral for μ̃_jl, finally giving us the modified moments ${\tilde{μ}}_{j l} = \sum_{k = 0}^{2 j - 1} q_{j k l} (K_{k + 1, l} + 2 (l + 1) K_{k l}) .$ ${\tilde \mu _{jl}} = \sum\limits_{k = 0}^{2j - 1} {{q_{jkl}}\left( {{K_{k + 1,l}} + 2\left( {l + 1} \right){K_{kl}}} \right).}$ (G.8)

Appendix G.1 Specific exact coefficient expressions

Plugging the expression for the modified moments into the modified Chebyshev method described in Sect. 5.2, we can get exact expressions for the recurrence coefficients β_nl. Setting $b_{l} = B_{1 / 2} (3 l + \frac{5}{2}, - l - \frac{1}{2})$ ${b_l} = {B_{1/2}}\left( {3l + {5 \over 2}, - l - {1 \over 2}} \right)$ (for B_z(a, b) an incomplete Beta function), the first few are $\begin{array}{l} β_{0 l} = \frac{{(2 l + 1)}^{2} (2 l)! Γ (l + \frac{1}{2}) (4^{- l} - 2 (2 l + 1) b_{l})}{24 π Γ (3 l + \frac{3}{2})} \\ β_{1 l} = \frac{(2 l + 1) (2 l + 3) (4 l - 2^{2 l + 1} (2 l + 1) (8 l + 5) b_{l} + 1)}{4^{l + 2} (2 l + 1) b_{l} - 8} \\ β_{2 l} = \frac{4 l (3 l (20 l + 39) + 67) + 4^{l + 1} {(2 l + 1)}^{2} b_{l} (4 l (l (8 l - 23) - 58) - 4^{l} (2 l - 1) (2 l (16 l (4 l + 9) + 95) + 35) b_{l} - 93) + 55}{16 (4 l + 4^{l + 1} {(2 l + 1)}^{2} b_{l} (4^{l} (8 l + 5) b_{l} - 3) + 1)} \end{array}$ $\matrix{ {{\beta _{0l}} = {{{{\left( {2l + 1} \right)}^2}\left( {2l} \right)!\Gamma \left( {l + {1 \over 2}} \right)\left( {{4^{ - l}} - 2\left( {2l + 1} \right){b_l}} \right)} \over {24\pi \Gamma \left( {3l + {3 \over 2}} \right)}}} \hfill \cr {{\beta _{1l}} = {{\left( {2l + 1} \right)\left( {2l + 3} \right)\left( {4l - {2^{2l + 1}}\left( {2l + 1} \right)\left( {8l + 5} \right){b_l} + 1} \right)} \over {{4^{l + 2}}\left( {2l + 1} \right){b_l} - 8}}} \hfill \cr {{\beta _{2l}} = {{4l\left( {3l\left( {20l + 39} \right) + 67} \right) + {4^{l + 1}}{{\left( {2l + 1} \right)}^2}{b_l}\left( {4l\left( {l\left( {8l - 23} \right) - 58} \right) - {4^l}\left( {2l - 1} \right)\left( {2l\left( {16l\left( {4l + 9} \right) + 95} \right) + 35} \right){b_l} - 93} \right) + 55} \over {16\left( {4l + {4^{l + 1}}{{\left( {2l + 1} \right)}^2}{b_l}\left( {{4^l}\left( {8l + 5} \right){b_l} - 3} \right) + 1} \right)}}} \hfill \cr }$

References

Alhaidari, A. D., Yamani, H. A., Heller, E. J., & Abdelmonem, M. S., 2008, The J-Matrix Method (Netherlands: Springer) [CrossRef] [Google Scholar]
Aoki, S., & Iye, M. 1978, PASJ, 30, 519 [Google Scholar]
Benet, L., & Sanders, D. P. 2019, J. Open Source Softw., 4, 1043 [NASA ADS] [CrossRef] [Google Scholar]
Binney, J., & Tremaine, S. 1987, Galactic Dynamics (Princeton, NJ, Princeton University Press), 747 [Google Scholar]
Carlson, B. C. 1961, J. Math. Phys., 2, 441 [CrossRef] [Google Scholar]
Clutton-Brock, M. 1972, Ap&SS, 16, 101 [NASA ADS] [CrossRef] [Google Scholar]
Clutton-Brock, M. 1973, Ap&SS, 23, 55 [NASA ADS] [CrossRef] [Google Scholar]
de Zeeuw, T. 1985, MNRAS, 216, 273 [NASA ADS] [CrossRef] [Google Scholar]
de Zeeuw, T., & Pfenniger, D. 1988, MNRAS, 235, 949 [NASA ADS] [CrossRef] [Google Scholar]
de Zeeuw, T., Peletier, R., & Franx, M. 1986, MNRAS, 221, 1001 [NASA ADS] [CrossRef] [Google Scholar]
Dombrowski, J. 1985, Pac. J. Math., 120, 47 [Google Scholar]
Earn, D. J. D. 1996, ApJ, 465, 91 [NASA ADS] [CrossRef] [Google Scholar]
Erkal, D., Deason, A. J., Belokurov, V., et al. 2021, MNRAS, 506, 2677 [NASA ADS] [CrossRef] [Google Scholar]
Fouvry, J.-B., & Prunet, S. 2022, MNRAS, 509, 2443 [NASA ADS] [Google Scholar]
Garavito-Camargo, N., Besla, G., Laporte, C. F. P., et al. 2021, ApJ, 919, 109 [NASA ADS] [CrossRef] [Google Scholar]
Gautschi, W. 1985, J. Comput. Appl. Math., 12–13, 61 [Google Scholar]
Granovskii, Y. I., & Zhedanov, A. S. 1986, Sov. Phys. J., 29, 387 [NASA ADS] [CrossRef] [Google Scholar]
Hamilton, C., Fouvry, J.-B., Binney, J., & Pichon, C. 2018, MNRAS, 481, 2041 [NASA ADS] [CrossRef] [Google Scholar]
Henon, M. 1959, Ann. Astrophys., 22, 126 [NASA ADS] [Google Scholar]
Hernquist, L., & Ostriker, J. P. 1992, ApJ, 386, 375 [Google Scholar]
Ismail, M. E., & Koelink, E. 2011, Adv. Appl. Math., 46, 379 [Google Scholar]
Kalnajs, A. J. 1971, ApJ, 166, 275 [CrossRef] [Google Scholar]
Kalnajs, A. J. 1976, ApJ, 205, 745 [CrossRef] [Google Scholar]
Koekoek, R., Lesky, P. A., & Swarttouw, R. F. 2010, Hypergeometric Orthogonal Polynomials and Their q-Analogues (Berlin, Heidelberg: Springer) [CrossRef] [Google Scholar]
Kuzmin, G. G. 1956, Publ. Tartu Astrofizica Observ., 33, 75 [NASA ADS] [Google Scholar]
Law, D. R., & Majewski, S. R. 2010, ApJ, 714, 229 [Google Scholar]
Lilley, E. J. 2020, PhD thesis, University of Cambridge [Google Scholar]
Lilley, E. J., Sanders, J. L., & Evans, N. W. 2018a, MNRAS, 478, 1281 [NASA ADS] [CrossRef] [Google Scholar]
Lilley, E. J., Sanders, J. L., Evans, N. W., & Erkal, D. 2018b, MNRAS, 476, 2092 [NASA ADS] [CrossRef] [Google Scholar]
Lowing, B., Jenkins, A., Eke, V., & Frenk, C. 2011, MNRAS, 416, 2697 [NASA ADS] [CrossRef] [Google Scholar]
Lynden-Bell, D. 1989, MNRAS, 237, 1099 [NASA ADS] [CrossRef] [Google Scholar]
Marín, J., & Seubert, S. M. 2006, J. Math. Anal. Applic., 320, 599 [CrossRef] [Google Scholar]
Navarro, J. F., Frenk, C. S., & White, S. D. M. 1997, ApJ, 490, 493 [Google Scholar]
Olver, F. W. J., Daalhuis, A. B. O., Lozier, D. W., et al. 2022, NIST Digital Library of Mathematical Functions, Release 1.1.7 of 2022-10-15 [Google Scholar]
Petersen, M. S., & Peñarrubia, J. 2021, Nat. Astron., 5, 251 [NASA ADS] [CrossRef] [Google Scholar]
Petersen, M. S., Peñarrubia, J., & Jones, E. 2022a, MNRAS, 514, 1266 [CrossRef] [Google Scholar]
Petersen, M. S., Weinberg, M. D., & Katz, N. 2022b, MNRAS, 510, 6201 [NASA ADS] [CrossRef] [Google Scholar]
Plummer, H. C. 1911, MNRAS, 71, 460 [Google Scholar]
Polyachenko, V. L., & Shukhman, I. G. 1981, Soviet Ast., 25, 533 [NASA ADS] [Google Scholar]
Qian, E. E. 1993, MNRAS, 263, 394 [CrossRef] [Google Scholar]
Rahmati, A., & Jalali, M. A. 2009, MNRAS, 393, 1459 [NASA ADS] [CrossRef] [Google Scholar]
Robijn, F. H. A., & Earn, D. J. D. 1996, MNRAS, 282, 1129 [NASA ADS] [CrossRef] [Google Scholar]
Saha, P. 1991, MNRAS, 248, 494 [NASA ADS] [CrossRef] [Google Scholar]
Saha, P. 1993, MNRAS, 262, 1062 [CrossRef] [Google Scholar]
Sanders, J. L., Lilley, E. J., Vasiliev, E., Evans, N. W., & Erkal, D. 2020, MNRAS, 499, 4793 [CrossRef] [Google Scholar]
Toomre, A. 1963, ApJ, 138, 385 [NASA ADS] [CrossRef] [Google Scholar]
Tremaine, S. D. 1976, MNRAS, 175, 557 [NASA ADS] [CrossRef] [Google Scholar]
Vera-Ciro, C., & Helmi, A. 2013, ApJ, 773, L4 [Google Scholar]
Weinberg, M. D. 1999, AJ, 117, 629 [NASA ADS] [CrossRef] [Google Scholar]
Zhao, H. 1996, MNRAS, 278, 488 [Google Scholar]

¹

Of analytical form and infinite extent.

²

Throughout this work we will use ‘thin disc’ to refer to an idealised infinitesimally thin disc; basis sets for discs with nonzero thickness are out of scope, albeit an important future direction of research.

³

Many popular mass laws have infinite mass but finite self-energy, for example the NFW model (Navarro et al. 1997).

⁴

Explicit length units can be reintroduced by writing Φ_nl(r/r_s) and ρ_nl(r/r_s) and then adding the correct number of powers of r_s in whatever expression they are used. Note that such r_s-dependency cancels out in the operators 𝒟 and 𝒜.

⁵

For models with infinite enclosed mass the potential can contain an additional factor of log r as r → ∞.

⁶

Other choices may be preferable from an analytical point of view, for example Φ_0l = r^lΦ(r^2l+1) or Φ_0l = r^lΦ(r)/(1 + r)^2l, the latter suggested by Saha(1993).

⁷

See for example Kalnajs (1971, Eq. (14)), set u = log R and relabel θ → φ and α → −s. The RHS there is then proportional to our Σ_sm(R), apart from a factor of R^−3/2.

⁸

The operators 𝒟 and 𝒜 do in fact arise as the generators of symmetries of the self-energy inner product; see the discussion in Sect. 6.

⁹

‘Monic’ meaning that the term of highest-degree has coefficient 1. Note that in Sect. 4 the polynomials are not necessarily in monic form.

¹⁰

This corrects a typo in Clutton-Brock (1973).

¹¹

See Lilley (2020, Ch. 6) for a detailed derivation.

¹²

Unfortunately generalising the exponent to $e^{- R^{1 / α}}$ ${{\rm{e}}^{ - {R^{1/\alpha }}}}$ gives no similarly simple result.

¹³

Some MATHEMATICA code demonstrating this is included in the repository at https://github.com/ejlilley/basis

¹⁴

This is in distinction to Chebyshev’s original algorithm, which uses the raw moments $μ_{k} = \int s^{k} ω (s)$ ${\mu _k} = \int {{s^k}} \omega \left( s \right)$ ds.

¹⁵

That is, a function that accepts and returns a floating-point value.

¹⁶

For example, the ‘defective’ NFW basis set constructed in Lilley (2020, Ch. 2), which does not converge with the addition of higherorder angular terms. See also Saha (1991), who suggests that “glitches and generally anomalous behaviour” in the recovery of modes may be related to the form of the chosen basis functions – this should be systematically investigated.

¹⁷

https://github.com/ejlilley/basis

¹⁸

The azimuthal index m is set to zero as it does not affect the final result.

¹⁹

Multiplication by i in the definition of 𝒟 makes it a self-adjoint rather than a skew-symmetric operator.

²⁰

The standard construction does not use the φ- and ϑ-generators directly, as they do not commute; instead the operators representing the total angular momentum L = ||L|| and the z-component L_z are used. The spherical harmonics are then the joint eigenfunctions of L and L_z.

²¹

Limited work on perturbation analysis has been done for the fully ellipsoidal case, including for example Tremaine (1976). There is also some existing work on (non-orthogonal) spheroidal basis sets (Earn 1996; Robijn & Earn 1996).

²²

For example, setting l = m = 0 in any spherical basis set considered in this paper.

²³

Contrast the second-order differential equations obeyed by the polynomials (Gegenbauer etc.) appearing in the expressions of many of the known basis sets.

²⁴

In the case of the Laplacian this is a multipole-like expansion, ${‖ r - r^{'} ‖}^{- 1} = \sum_{n l m} Φ_{n l m} (r) \bar{Φ_{n l m} (r^{'})}$ ${\left\| {{\bf{r}} - {\bf{r'}}} \right\|^{ - 1}} = \sum\limits_{nlm} {{{\rm{\Phi }}_{nlm}}} \left( {\bf{r}} \right)\overline{{\bf{\Phi }}_{nlm}}\left( {{\bf{r'}}} \right)$ .

²⁵

Kalnajs defines a similar quantity K(α, m), related to our K_m(s) by K_m(is) = 1/ (2K(s, m)).

²⁶

Under this parameter restriction these polynomials are sometimes referred to as the continuous symmetric Hahn polynomials.

²⁷

The choice of auxiliary polynomial does affect the values of the modified moments μ̃_jl but in principle it does not affect the final value of the recurrence coefficients β_jl, other than indirectly via its effect on the numerical stability of the algorithm.

All Figures

	Fig. 1 Radial parts of the isochrone potential basis $Φ_{n l}^{iso} (r)$ ${\rm{\Phi }}_{nl}^{{\rm{iso}}}\left( r \right)$ for n = 0, 1, 2, 3 and l = 0 (top) and l = 1 (bottom). The potentials have been unit-normalised.
In the text

	Fig. 2 Radial parts of the exponential disc basis $ψ_{n m}^{\exp} (r)$ $\psi _{nm}^{\exp }\left( r \right)$ for n = 0, 1, 2, 3 and m = 0 (top) and m = 1 (bottom). The potentials have been unit-normalised.
In the text

	Fig. 3 Recovery of the unstable radial mode of the isotropic isochrone model. The mode is recovered well despite the low (n_max = 6) number of basis functions used.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Alhaidari, A. D., Yamani, H. A., Heller, E. J., & Abdelmonem, M. S., 2008, The J-Matrix Method (Netherlands: Springer) [CrossRef] [Google Scholar]

[2] Aoki, S., & Iye, M. 1978, PASJ, 30, 519 [Google Scholar]

[3] Benet, L., & Sanders, D. P. 2019, J. Open Source Softw., 4, 1043 [NASA ADS] [CrossRef] [Google Scholar]

[4] Binney, J., & Tremaine, S. 1987, Galactic Dynamics (Princeton, NJ, Princeton University Press), 747 [Google Scholar]

[5] Carlson, B. C. 1961, J. Math. Phys., 2, 441 [CrossRef] [Google Scholar]

[6] Clutton-Brock, M. 1972, Ap&SS, 16, 101 [NASA ADS] [CrossRef] [Google Scholar]

[7] Clutton-Brock, M. 1973, Ap&SS, 23, 55 [NASA ADS] [CrossRef] [Google Scholar]

[8] de Zeeuw, T. 1985, MNRAS, 216, 273 [NASA ADS] [CrossRef] [Google Scholar]

[9] de Zeeuw, T., & Pfenniger, D. 1988, MNRAS, 235, 949 [NASA ADS] [CrossRef] [Google Scholar]

[10] de Zeeuw, T., Peletier, R., & Franx, M. 1986, MNRAS, 221, 1001 [NASA ADS] [CrossRef] [Google Scholar]

[11] Dombrowski, J. 1985, Pac. J. Math., 120, 47 [Google Scholar]

[12] Earn, D. J. D. 1996, ApJ, 465, 91 [NASA ADS] [CrossRef] [Google Scholar]

[13] Erkal, D., Deason, A. J., Belokurov, V., et al. 2021, MNRAS, 506, 2677 [NASA ADS] [CrossRef] [Google Scholar]

[14] Fouvry, J.-B., & Prunet, S. 2022, MNRAS, 509, 2443 [NASA ADS] [Google Scholar]

[15] Garavito-Camargo, N., Besla, G., Laporte, C. F. P., et al. 2021, ApJ, 919, 109 [NASA ADS] [CrossRef] [Google Scholar]

[16] Gautschi, W. 1985, J. Comput. Appl. Math., 12–13, 61 [Google Scholar]

[17] Granovskii, Y. I., & Zhedanov, A. S. 1986, Sov. Phys. J., 29, 387 [NASA ADS] [CrossRef] [Google Scholar]

[18] Hamilton, C., Fouvry, J.-B., Binney, J., & Pichon, C. 2018, MNRAS, 481, 2041 [NASA ADS] [CrossRef] [Google Scholar]

[19] Henon, M. 1959, Ann. Astrophys., 22, 126 [NASA ADS] [Google Scholar]

[20] Hernquist, L., & Ostriker, J. P. 1992, ApJ, 386, 375 [Google Scholar]

[21] Ismail, M. E., & Koelink, E. 2011, Adv. Appl. Math., 46, 379 [Google Scholar]

[22] Kalnajs, A. J. 1971, ApJ, 166, 275 [CrossRef] [Google Scholar]

[23] Kalnajs, A. J. 1976, ApJ, 205, 745 [CrossRef] [Google Scholar]

[24] Koekoek, R., Lesky, P. A., & Swarttouw, R. F. 2010, Hypergeometric Orthogonal Polynomials and Their q-Analogues (Berlin, Heidelberg: Springer) [CrossRef] [Google Scholar]

[25] Kuzmin, G. G. 1956, Publ. Tartu Astrofizica Observ., 33, 75 [NASA ADS] [Google Scholar]

[26] Law, D. R., & Majewski, S. R. 2010, ApJ, 714, 229 [Google Scholar]

[27] Lilley, E. J. 2020, PhD thesis, University of Cambridge [Google Scholar]

[28] Lilley, E. J., Sanders, J. L., & Evans, N. W. 2018a, MNRAS, 478, 1281 [NASA ADS] [CrossRef] [Google Scholar]

[29] Lilley, E. J., Sanders, J. L., Evans, N. W., & Erkal, D. 2018b, MNRAS, 476, 2092 [NASA ADS] [CrossRef] [Google Scholar]

[30] Lowing, B., Jenkins, A., Eke, V., & Frenk, C. 2011, MNRAS, 416, 2697 [NASA ADS] [CrossRef] [Google Scholar]

[31] Lynden-Bell, D. 1989, MNRAS, 237, 1099 [NASA ADS] [CrossRef] [Google Scholar]

[32] Marín, J., & Seubert, S. M. 2006, J. Math. Anal. Applic., 320, 599 [CrossRef] [Google Scholar]

[33] Navarro, J. F., Frenk, C. S., & White, S. D. M. 1997, ApJ, 490, 493 [Google Scholar]

[34] Olver, F. W. J., Daalhuis, A. B. O., Lozier, D. W., et al. 2022, NIST Digital Library of Mathematical Functions, Release 1.1.7 of 2022-10-15 [Google Scholar]

[35] Petersen, M. S., & Peñarrubia, J. 2021, Nat. Astron., 5, 251 [NASA ADS] [CrossRef] [Google Scholar]

[36] Petersen, M. S., Peñarrubia, J., & Jones, E. 2022a, MNRAS, 514, 1266 [CrossRef] [Google Scholar]

[37] Petersen, M. S., Weinberg, M. D., & Katz, N. 2022b, MNRAS, 510, 6201 [NASA ADS] [CrossRef] [Google Scholar]

[38] Plummer, H. C. 1911, MNRAS, 71, 460 [Google Scholar]

[39] Polyachenko, V. L., & Shukhman, I. G. 1981, Soviet Ast., 25, 533 [NASA ADS] [Google Scholar]

[40] Qian, E. E. 1993, MNRAS, 263, 394 [CrossRef] [Google Scholar]

[41] Rahmati, A., & Jalali, M. A. 2009, MNRAS, 393, 1459 [NASA ADS] [CrossRef] [Google Scholar]

[42] Robijn, F. H. A., & Earn, D. J. D. 1996, MNRAS, 282, 1129 [NASA ADS] [CrossRef] [Google Scholar]

[43] Saha, P. 1991, MNRAS, 248, 494 [NASA ADS] [CrossRef] [Google Scholar]

[44] Saha, P. 1993, MNRAS, 262, 1062 [CrossRef] [Google Scholar]

[45] Sanders, J. L., Lilley, E. J., Vasiliev, E., Evans, N. W., & Erkal, D. 2020, MNRAS, 499, 4793 [CrossRef] [Google Scholar]

[46] Toomre, A. 1963, ApJ, 138, 385 [NASA ADS] [CrossRef] [Google Scholar]

[47] Tremaine, S. D. 1976, MNRAS, 175, 557 [NASA ADS] [CrossRef] [Google Scholar]

[48] Vera-Ciro, C., & Helmi, A. 2013, ApJ, 773, L4 [Google Scholar]

[49] Weinberg, M. D. 1999, AJ, 117, 629 [NASA ADS] [CrossRef] [Google Scholar]

[50] Zhao, H. 1996, MNRAS, 278, 488 [Google Scholar]

A general basis set algorithm for galactic haloes and discs

1 Introduction

2 Description of algorithm

3 Theoretical background

3.1 Functional calculus of 𝒟 and the Fourier–Mellin transform

3.2 Tridiagonality and polynomials

3.2.1 Polynomials from tridiagonality

3.2.2 Orthogonal polynomials

3.2.3 Disc case

3.3 Completeness

4 Application to known basis sets

4.1 Spherical case

4.1.1 Clutton-Brock’s Plummer basis set

4.1.2 The double power law basis sets

4.1.3 The cuspy-exponential basis sets

4.2 Thin disc case

4.2.1 Clutton-Brock’s Kuzmin-Toomre basis set

4.2.2 Qian’s k-basis sets

4.2.3 Qian’s Gaussian basis set

4.2.4 Exponential disc

5 Numerical implementation

5.1 Discretised Stieltjes procedure

5.2 Modified Chebyshev algorithm

5.3 Repeated differentiation

5.4 Unstable modes of a spherical system

6 Discussion and conclusions

Acknowledgements

Appendix A Self-adjointness of 𝒟

Appendix B Commutator of 𝒟 and ∇2

Appendix C The Fourier-Mellin transform

Appendix C.1 Spherical case

Appendix C.2 Disc case

Appendix D Orthogonality relation

Appendix D.1 Spherical case

Appendix D.2 Thin disc case

Appendix E Classical polynomials

Appendix E.1 Continuous Hahn

Appendix E.2 Meixner-Pollaczek

Appendix F Exponential disc potential

Appendix G Exact moments for the isochrone

Appendix G.1 Specific exact coefficient expressions

References

All Figures

Appendix B Commutator of 𝒟 and $\nabla^{2}$ ${\nabla ^2}$