Broyden’s method for the solution of the multilevel non-LTE radiation transfer problem

S. Nicolas; L. Bigarré; F. Paletou

doi:10.1051/0004-6361/201015923

Home

All issues

Volume 527 (March 2011)

A&A, 527 (2011) A1

Full HTML

Free Access

Issue		A&A Volume 527, March 2011


Article Number		A1
Number of page(s)		4
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/201015923
Published online		18 January 2011

A&A 527, A1 (2011)

Research Note

Broyden’s method for the solution of the multilevel non-LTE radiation transfer problem

S. Nicolas, L. Bigarré and F. Paletou

Institut de Recherche en Astrophysique et Planétologie, Université de Toulouse, CNRS, 14 av. E. Belin, 31400 Toulouse, France
e-mail: fpaletou@ast.obs-mip.fr

Received: 13 October 2010
Accepted: 12 December 2010

Abstract

This study concerns the fast and accurate solution of multilevel non-LTE radiation transfer problems. We propose and evaluate an alternative iterative scheme to the classical MALI method. Our study is instead based on the application of Broyden’s method for the solution of nonlinear systems of equations. Comparative tests, in 1D plane-parallel geometry, of the popular MALI method and our alternative method are discussed. The Broyden method is typically 4.5 times faster than MALI. This makes it also fairly competitive with the Gauss-Seidel and Successive Over-Relaxation methods developed after MALI.

Key words: radiative transfer / methods: numerical

© ESO, 2011

1. Introduction

The solution of the non-LTE multilevel-atom radiative transfer problem is a classical one in astrophysics. Indeed, consistently with the departure of the source functions from Planck functions, the assumption of non-LTE implies that the population density of the atomic or molecular levels considered departs from what can be derived at LTE in a straightforward manner with Saha and Boltzmann relations (see e.g., Mihalas 1978).

In the non-LTE case, one has on the contrary to solve simultaneously and self-consistently for a set of N_T equations of radiative transfer together with N_L equations of statistical equilibrium (hereafter ESE) that describe the detailed balanced of excitation and de-excitation processes between every atomic or molecular levels. Because absorption and stimulated emission radiative rates explicitely depend on the radiation field, which itself depends on the level populations, this problem is intrinsically a search for the solution of coupled nonlinear equations.

Since the beginning of numerical radiative transfer in the late 60’s, the two most popular methods used for tackling this problem have been the complete linearization method of Auer & Mihalas (1969) and the accelerated Λ-iteration-based scheme called MALI (Rybicki & Hummer 1991). Despite their apparent differences, they however have in common that basically they deal with linearized equations. An interesting comparative study of these two approaches has been made by Socas-Navarro & Trujillo Bueno (1997).

In this study, we investigate the use of a quasi-Newton numerical method for the solution of the nonlinear ESE. Our choice is Broyden’s method (1965), whose elements will be presented in Sect. 2. To the best of our knowledge, Koesterke et al. (1992) were the first to bring this numerical scheme into the field of radiation transfer. Their study was presented in the context of the modelling of spherically expanding atmospheres of hot and massive Wolf-Rayet stars. Broyden’s method was more recently invoked in the context of the coupled-escape probability method (Elitzur & Asensio Ramos 2006).

Besides the required algebra and the mention of caveats with regard to the implementation of the method, it remains difficult however to figure out from Koesterke et al. (1992) the actual performance of this approach. The authors also briefly compared their method with another one, but they mentioned a significant speed-up, which is provided by Broyden algorithm for large N_L atomic models. In particular, it is a pity that no comparison with the MALI method was made yet, because it is contemporary with the publication of Rybicki & Hummer (1991). This evaluation is the aim of the present work.

2. The numerical scheme

For an N_L-level atomic model, the ESE will are generally written as a set of elementary equations: $\begin{matrix} \sum_{j < i} [n_{i} A_{ij} - (n_{j} B_{ji} - n_{i} B_{ij}) J̅ ij] \\ - \sum_{j > i} [n_{j} A_{ji} - (n_{i} B_{ij} - n_{j} B_{ji}) J̅ ij] \\ + \sum_{j \neq i} (n_{i} C_{ij} - n_{j} C_{ji}) = 0, \end{matrix}$ $\begin{eqnarray} \sum\limits_{j<i} [n_i A_{ij}-(n_j B_{ji}-n_i B_{ij}) \bar{J}_{ij}]\nonumber\\ -\sum\limits_{j>i} [n_j A_{ji}-(n_i B_{ij}-n_j B_{ji})\bar{J}_{ij}]\nonumber\\ +\sum\limits_{j\neq i} (n_i C_{ij}-n_j C_{ji})=0 \, , \label{eq:ese} \end{eqnarray}$ (1)where the A_ij and B_ij stand for the spontaneous emission respectively, and the absorption and stimulated emission rates, n_i represents the population density for each energy level, and $\bar{J}_{ij}$ is the scattering integral for each radiatively allowed transition we will consider. Besides the radiative processes, the C_ij are collisional excitation and de-excitation rates. In general, these rates depend on the electronic density, so that if the latter is not known a priori, terms like n_iC_ij are nonlinear in the population densities. Below we will consider only cases for which the collisional rates are known a priori.

The scattering integral entering the ESE is formally written as $J̅ ij = Λ_{ij} [S_{ij}],$ $\begin{equation} \bar{J}_{ij} = \Lambda_{ij} [S_{ij}], \end{equation}$ (2)where, assuming complete redistribution in frequency, the source function is defined as $S_{ij} = \frac{n_{i} A_{ij}}{n_{j} B_{ji} - n_{i} B_{ij}} \cdot$ $\begin{equation} {S}_{ij} = { {n_i A_{ij}} \over {n_j B_{ji}-n_i B_{ij}}} \cdot \end{equation}$ (3)A large system of Eqs. (1) is homogeneous unless one of the equations is replaced by a constraint equation like, for instance, a conservation equation of the form $\sum_{j = 1}^{N_{L}} n_{j} = n_{t} .$ $\begin{equation} \sum\limits_{j=1}^{N_{\rm L}}n_j = n_t . \label{eq:cons} \end{equation}$ (4)

2.1. Broyden’s algorithm

The system of Eqs. (1) and (4) can be reformulated by defining a function F that acts on the set of $n^{(τ)} = (n_{1}^{(τ)}, ..., n_{i}^{(τ)}, ..., n_{N_{L}}^{(τ)})$ $n^{(\tau)}=(n_1^{(\tau)},...,n_i^{(\tau)},...,n_{N_L}^{(\tau)})$ , where τ is a discrete depth along the opacity scale used to sample our slab or atmosphere. F is defined as $\begin{matrix} F_{i}^{(τ)} = & \sum_{j < i} & [n_{i}^{(τ)} A_{ij} - (n_{j}^{(τ)} B_{ji} - n_{i}^{(τ)} B_{ij}) J̅ ij] \\ - \sum_{j > i} & [n_{j}^{(τ)} A_{ji} - (n_{i}^{(τ)} B_{ij} - n_{j}^{(τ)} B_{ji}) J̅ ij] \\ + \sum_{j \neq i} \end{matrix}$ $\begin{eqnarray} \lefteqn \,F_i^{(\tau)}= & \sum\limits_{j<i} & [n_i^{(\tau)}A_{ij}-(n_j^{(\tau)}B_{ji}-n_i^{(\tau)}B_{ij})\bar{J}_{ij}]\nonumber\\ & -\sum\limits_{j>i} & [n_j^{(\tau)}A_{ji}-(n_i^{(\tau)}B_{ij}-n_j^{(\tau)}B_{ji})\bar{J}_{ij}]\nonumber\\ & +\sum\limits_{j\neq i} & (n_i^{(\tau)}C_{ij}-n_j^{(\tau)}C_{ji}) \label{eq:F1} \end{eqnarray}$ (5)for i ≠ N_L and, if i = N_L $F_{i}^{(τ)} = \sum_{j = 1}^{N_{L}} n_{j}^{(τ)} - n_{t} .$ $\begin{equation} F_i^{(\tau)}=\sum\limits_{j=1}^{N_L}n_j^{(\tau)}-n_t . \label{eq:F2} \end{equation}$ (6)Computations of F require repeated evaluations of the scattering integrals $\bar{J}_{ij}$ – Eq. (2) – which we perform here using the well-know short-characteristics method with monotonic parabolic interpolation introduced by Auer & Paletou (1994).

In that frame, and using the Sherman-Morrison formula (see e.g., Press et al. 1992), which provides an analytical formula for the direct computation of the inverse of the Broyden matrix, our algorithm consists of the following steps. We choose an initial vector n₀, at every τ depth, and an initial Broyden matrix B₀; then we compute $B_{0}^{-1}$ $B_{0}^{-1}$ . The iterative scheme is $δ n_{k} = - B_{k}^{-1} F (n_{k}),$ $\begin{equation} \delta n_k=-B_k^{-1}F(n_k) , \end{equation}$ (7)then update $n_{(k + 1)} = n_{k} + δ n_{k},$ $\begin{equation} n_{(k+1)}=n_k+ \delta n_k , \end{equation}$ (8)then compute $δ F_{k} = F (n_{(k + 1)}) - F (n_{k}),$ $\begin{equation} \delta F_k=F(n_{(k+1)})-F(n_k) , \end{equation}$ (9)and finally, update $B_{(k + 1)}^{-1} = B_{k}^{-1} + \frac{(δ n_{k} - B_{k}^{-1} δ F_{k}) δ n_{k}^{T} B_{k}^{-1}}{(δ n_{k}^{T} B_{k}^{-1}) δ F_{k}},$ $\begin{equation} B_{(k+1)}^{-1}=B_k^{-1}+\frac{(\delta n_k-B_k^{-1} \delta F_k)\delta n_k^T B_k^{-1}}{(\delta n_k^TB_k^{-1})\delta F_k} , \end{equation}$ (10)while ∥F∥ ₂ > ε. In practice, we chose ε = 10^-2, which guarantees that we reached a fully converged state (see also Fig. 1).

Fig.1

Typical relative error, R_k (thin), convergence error, C_k (thick) and F_k (dotted) for MALI (dash) and Broyden (full) vs. the number of iterations for both schemes.

2.2. Initialization

The proper initialization of the Broyden scheme is a critical issue. We employed the following method, which was tested as suitable both from the standpoint of an adequate start and from the one of an acceptable computing time.

Before starting the iterative scheme, we assume LTE populations for our model-atom. In this way, the grand function F defined by Eqs. (5) and (6) can be fully evaluated. Then, we compute an initial Jacobian, B₀, using the finite difference scheme fdjac described by Press et al. (1992).

In the figure below with regard to the timing properties of Broyden’s method vs. MALI, we will always include the specific time necessary for the evaluation of B₀.

3. Comparison of Broyden vs. MALI

We adopted the popular “flavour” of the MALI method with a diagonal approximate operator as described by Rybicki & Hummer (1991) and without acceleration of convergence schemes.

In order to compare the properties of the Broyden scheme with those of MALI, we adopted a 1D semi-infinite plane parallel slab model of τ_max = 10¹⁰, discretised in a various number of points per decade in optical depth, using also a 3 to 10 energy level H-atom, inspired by the classic benchmark proposed by Avrett (1968; see also Léger & Paletou 2007). As in the latter, the slab temperature was fixed at 5000 K and the collisional rates set at 10⁵ s^-1. We also adopted the definitions initially proposed by Auer et al. (1994) for the relative error, from one iteration (k) to another:

$R_{k} = {| | \frac{n_{(k)} - n_{(k - 1)}}{n_{(k)}} | |}_{\infty},$ $\begin{equation} R_k = {\left| \left| { {n_{(k)} - n_{(k-1)}} \over {n_{(k)}} } \right| \right|_{\infty}} \, , \end{equation}$ (11)

and for the “convergence error”:

$C_{k} = {| | \frac{n_{(k)} - n_{(\infty)}}{n_{(\infty)}} | |}_{\infty},$ $\begin{equation} C_k = {\left| \left| { {n_{(k)} - n_{(\infty)}} \over {n_{(\infty)}} } \right| \right|_{\infty}} , \end{equation}$ (12)

where n_(∞) is the “fully converged” solution obtained for a given method and model after a substantial number of iterations. We also introduce the quantity

$F_{k} = ∥ F_{(k)} ∥_{2},$ $\begin{equation} F_k = {\| F_{(k)} \|_{2}} , \end{equation}$ (13)

i.e., the Euclidian norm of F, the function defined by Eqs. (5) and (6). Note that $F_{k}^{(M)}$ $F_k^{(M)}$ for the MALI method is defined by a modified Eq. (5) following the preconditioning strategy proposed by Rybicky & Hummer (1991).

Fig.2

Convergence error, C_k, for MALI (dashed) and Broyden (full) are displayed vs. the computing time for a 5-level H atom and five (thin) and height (thick) points per decade in optical depth.

3.1. Convergence

In Fig. 1 we display the rates of convergence of the Broyden and MALI methods. The convergence error C_k for each method was computed with population densities obtained once F_k < 10^-2 in both cases. The case used here is a 5-level H atom with a 1D slab discretized by a 5 points per decade grid in optical depth. It is worth noting that to reach F_k < 10^-2 and a well-converged solution, one should iterate until one reaches R_k values as small as 10^-10 typically. In terms of the number of iterations, Broyden typically beats MALI by more than one order of magnitude. However, the quite distinct nature of each method makes this comparison incomplete. Below we perform this analysis but we will compare the respective computing times.

3.2. Sensitivity to the spatial (optical depth) refinement

In Fig. 2 we turn to an analysis of the respective timing properties of Broyden and MALI. We show that Broyden, again, always beats MALI by a typical factor of the order of about 4–5 in time. This is also the case when the spatial grid refinement is increased from 5 to 8 points per decade, for instance. It is important to note that timings given for Broyden include the computation of the initial matrix B₀. This is why the rates of convergence displayed for the Broyden method do not start at t = 0 in Fig. 2.

3.3. Sensitivity to the number of transitions

A next important point to investigate regards the advantage of Broyden against MALI when an increasing number of atomic transitions is considered. Again, as demonstrated in Fig. 3, the full Broyden iterative process is always significantly faster than MALI. In general, the gain in total computing time with the Broyden method is of the order of 4–5. This is less than the the gain of the order of 8 already reported by Koesterke et al. (1992), although their method of reference was presumably different from MALI.

Fig.3

Respective computing times vs. number of levels of the H-atom model, for MALI (dashed) and Broyden (full). The dash-dotted curve corresponds to the time required for the evaluation of the initial Jacobian.

3.4. Discussion

We are aware that the MALI method can be accelerated by specific schemes (see e.g., Auer 1991). But the most significant improvements in the field of iterative methods for the non-LTE radiative transfer problem were introducted by the Gauss-Seidel (GS) and successive over-relaxation (SOR) methods (Trujillo Bueno & Fabiani Bendicho 1995). It was already shown, for instance, that SOR always beats both Jacobi (i.e., accelerated Λ-iteration with the diagonal of the full Λ operator as an approximate operator) and GS, even when Ng acceleration of convergence is applied.

Beyond the fact that Broyden is significantly faster than MALI, we can also add that Broyden is as competitive as the SOR method, according to Paletou & Léger (2007; see their Table 1 where comparable timing and the corresponding iteration numbers were given for MALI, as we used in the present study, GS and SOR).

The Broyden method is also potentially more advantageous than MALI and GS/SOR because of its intrinsic capability to deal with the self-consistent evaluation of the electron density in a multilevel non-LTE problem, if necessary – a problem for which MALI additionally needs a Newton-Raphson scheme, as proposed by Heinzel (1995) and Paletou (1995).

Another important point is that, as indicated in our Fig. 3, a great deal of time of our Broyden code is spent on the computation of the initial Jacobian, a task which can be performed with great advantage using parallel computing. The inner structure of the fdjac routine indeed permits parallelization with a high scalability.

As a final comment, it is also important to consider that Broyden’s method can be easily implemented to already existing codes without the need of modifying the formal solver, unlike the GS/SOR methods.

4. Conclusion

We propose an alternative method for the solution of the non-LTE multilevel radiation transfer problem. It is based on Broyden’s method for the solution of nonlinear systems of equations. The method is easy to implement and is about of factor of 4.5 times faster than the well-known MALI method. Another advantage is that it does not require any modification of usual formal solvers, as is the case for GS-SOR methods developed after MALI. It is also potentially very well-suited for parallel computing. Further tests will include the self-consistent treatment of the ionization balance, which is usually treated together with MALI with a Newton-Raphson scheme. In a next step, we will consider more demanding models such as H₂O, for instance.

References

Auer, L. H. 1991, in Stellar Atmospheres: Beyond Classical Models, NATO ASI Series (Dordrecht: Reidel) [Google Scholar]
Auer, L. H., & Mihalas, D. 1969, ApJ, 158, 641 [NASA ADS] [CrossRef] [Google Scholar]
Auer, L. H., & Paletou, F. 1994, A&A, 285, 675 [NASA ADS] [Google Scholar]
Auer, L. H., Fabiani Bendicho, P., & Trujillo Bueno, J. 1994, A&A, 292, 599 [NASA ADS] [Google Scholar]
Broyden, C. G. 1965, Math. Comp., 19, 577 [Google Scholar]
Elitzur, M., & Asensio Ramos, A. 2006, MNRAS, 365, 779 [Google Scholar]
Heinzel, P. 1995, A&A, 299, 563 [NASA ADS] [Google Scholar]
Koesterke, L., Hamman, W.-R., & Kosmol, P. 1992, A&A, 255, 490 [NASA ADS] [Google Scholar]
Mihalas, D. 1978, Stellar Atmospheres (San Francisco: Freeman) [Google Scholar]
Paletou, F. 1995, A&A, 302, 587 [NASA ADS] [Google Scholar]
Paletou, F., & Léger, L. 2007, JQSRT, 103, 57 [Google Scholar]
Press, W. H., Teukolsky, S. A., Vetterling, W. T., et al. 1992, Numerical recipes (Cambridge: University Press) [Google Scholar]
Rybicki, G. B., & Hummer, D. G. 1991, A&A, 245, 171 [NASA ADS] [Google Scholar]
Socas-Navarro, H., & Trujillo Bueno, J. 1997, ApJ, 490, 383 [NASA ADS] [CrossRef] [Google Scholar]
Trujillo Bueno, J., & Fabiani Bendicho, P. 1995, ApJ, 455, 646 [NASA ADS] [CrossRef] [Google Scholar]

All Figures

	Fig.1 Typical relative error, R_k (thin), convergence error, C_k (thick) and F_k (dotted) for MALI (dash) and Broyden (full) vs. the number of iterations for both schemes.
In the text

	Fig.2 Convergence error, C_k, for MALI (dashed) and Broyden (full) are displayed vs. the computing time for a 5-level H atom and five (thin) and height (thick) points per decade in optical depth.
In the text

	Fig.3 Respective computing times vs. number of levels of the H-atom model, for MALI (dashed) and Broyden (full). The dash-dotted curve corresponds to the time required for the evaluation of the initial Jacobian.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Auer, L. H. 1991, in Stellar Atmospheres: Beyond Classical Models, NATO ASI Series (Dordrecht: Reidel) [Google Scholar]

[2] Auer, L. H., & Mihalas, D. 1969, ApJ, 158, 641 [NASA ADS] [CrossRef] [Google Scholar]

[3] Auer, L. H., & Paletou, F. 1994, A&A, 285, 675 [NASA ADS] [Google Scholar]

[4] Auer, L. H., Fabiani Bendicho, P., & Trujillo Bueno, J. 1994, A&A, 292, 599 [NASA ADS] [Google Scholar]

[5] Broyden, C. G. 1965, Math. Comp., 19, 577 [Google Scholar]

[6] Elitzur, M., & Asensio Ramos, A. 2006, MNRAS, 365, 779 [Google Scholar]

[7] Heinzel, P. 1995, A&A, 299, 563 [NASA ADS] [Google Scholar]

[8] Koesterke, L., Hamman, W.-R., & Kosmol, P. 1992, A&A, 255, 490 [NASA ADS] [Google Scholar]

[9] Mihalas, D. 1978, Stellar Atmospheres (San Francisco: Freeman) [Google Scholar]

[10] Paletou, F. 1995, A&A, 302, 587 [NASA ADS] [Google Scholar]

[11] Paletou, F., & Léger, L. 2007, JQSRT, 103, 57 [Google Scholar]

[12] Press, W. H., Teukolsky, S. A., Vetterling, W. T., et al. 1992, Numerical recipes (Cambridge: University Press) [Google Scholar]

[13] Rybicki, G. B., & Hummer, D. G. 1991, A&A, 245, 171 [NASA ADS] [Google Scholar]

[14] Socas-Navarro, H., & Trujillo Bueno, J. 1997, ApJ, 490, 383 [NASA ADS] [CrossRef] [Google Scholar]

[15] Trujillo Bueno, J., & Fabiani Bendicho, P. 1995, ApJ, 455, 646 [NASA ADS] [CrossRef] [Google Scholar]