A new nonlocal thermodynamical equilibrium radiative transfer method for cool stars

J. Lambert; E. Josselin; N. Ryde; A. Faure

doi:10.1051/0004-6361/201322852

Home

All issues

Volume 580 (August 2015)

A&A, 580 (2015) A50

Full HTML

Free Access

Issue		A&A Volume 580, August 2015


Article Number		A50
Number of page(s)		12
Section		Numerical methods and codes
DOI		https://doi.org/10.1051/0004-6361/201322852
Published online		29 July 2015

A&A 580, A50 (2015)

Method and numerical implementation

J. Lambert¹^,2, E. Josselin², N. Ryde¹ and A. Faure³

¹ Lund Observatory, Box 43, 221 00 Lund, Sweden
e-mail: julien.lambert@astro.lu.se
² Laboratoire Univers et Particules de Montpellier (LUPM), UMR 5299, CNRS, Université Montpellier 2 − CC72 Place Eugène Bataillon, 34095 Montpellier Cedex 5, France
³ Institut de Planétologie et d’Astrophysique de Grenoble (IPAG), UMR 5109, Université Joseph Fourier, CNRS, OSUG, 38041 Grenoble Cedex 9, France

Received: 15 October 2013
Accepted: 2 April 2015

Abstract

Context. The solution of the nonlocal thermodynamical equilibrium (non-LTE) radiative transfer equation usually relies on stationary iterative methods, which may falsely converge in some cases. Furthermore, these methods are often unable to handle large-scale systems, such as molecular spectra emerging from, for example, cool stellar atmospheres.

Aims. Our objective is to develop a new method, which aims to circumvent these problems, using nonstationary numerical techniques and taking advantage of parallel computers.

Methods. The technique we develop may be seen as a generalization of the coupled escape probability method. It solves the statistical equilibrium equations in all layers of a discretized model simultaneously. The numerical scheme adopted is based on the generalized minimum residual method.

Results. The code has already been applied to the special case of the water spectrum in a red supergiant stellar atmosphere. This demonstrates the fast convergence of this method, and opens the way to a wide variety of astrophysical problems.

Key words: radiative transfer / stars: atmospheres / methods: numerical

© ESO, 2015

1. Introduction

Radiative transfer plays a central role in astrophysics. The solution of the radiative transfer equation (RTE) is required to properly interpret any spectrum observed from an astrophysical object. Furthermore the radiation field often represents an important ingredient in hydrodynamics, for instance, through the energy budget (radiative heating and/or cooling) and the radiative pressure. However, as we discuss below, while the standard expression of the RTE is an ordinary differential equation of the 1st order, it may be highly nonlinear and nonlocal, especially because of scattering processes. The solution of the RTE thus requires specific procedures. Furthermore, with the development of infrared (IR) astronomy (e.g., from VLT, CRIRES in the near-IR to ALMA through Herschel/HIFI), one is now often confronted with very large-scale problems resulting from molecular spectra, which most techniques developed so far cannot handle.

The techniques for solving the RTE may be divided into a few strategies. Among approximate methods, one that is widely used, relies on the escape probability technique (e.g., Castor 1970), which does not take the nonlocality of the RTE into account. Secondly, Monte Carlo methods can be used because of the probabilistic nature of scattering processes. These methods can successfully be adapted to any geometry, but may be unadapted when large optical depths are met (see, e.g., Juvela 2005, for a recent review). We do not discuss these two classes of methods hereafter.

To properly treat the nonlocality and the nonlinearity of the RTE, it is usually rewritten in the form of a diffusion integral equation, which is then solved iteratively¹. This integral form is usually referred to as the Λ operator. The form involves a kernel, which determines the mathematical nature of the integral equation. For example, in the case of an assumed plane parallel geometry, the kernel involves the exponential integral function, and the RTE then becomes a weakly singular Fredholm equation of the second type (this singularity is an important aspect, which will be treated below). It can then be formally solved thanks to appropriate algorithms, such as the Atkinson algorithm (Ahues et al. 2002). Since the inversion of the Λ operator is usually numerically prohibitive, one needs an iteration scheme. It is well known that the iteration with the complete Λ operator at best converges very slowly or even stabilizes far from the real solution. A major step in solving the RTE has been the introduction of the so-called approximate lambda iteration (ALI), initially by Cannon (1973), then called the operator splitting method. The ALI is in fact an application of the Jacobi method to the RTE, as shown by Olson et al. (1986), who greatly improved the mathematical understanding of the problem. In these methods, the choice of the approximate operator (denoted Λ^⋆) is crucial. The most common choice for Λ^⋆ is the diagonal part of Λ, which ensures a rapid convergence if Λ is diagonally dominant matrix (Gershgorin’s theorem). ALI has been applied to a large variety of problems, including 3D polarized transfer (Štěpán & Trujillo Bueno 2013). This latter work also emphasizes the potential progress that can be made by taking advantage of massively parallel computing.

Meanwhile, new mathematical techniques have been applied to the RTE problem, especially the Gauss-Seidel method and the successive over relaxation (SOR) method, which significantly improve the convergence properties (Trujillo Bueno & Fabiani Bendicho 1995) that are determined by the spectral radius of their amplification matrix. Besides these, complete linearization methods have also been developed, the well-known case being MULTI (Scharmer & Carlsson 1985). Despite apparent different formulations, the two approaches are essentially the same, as shown by Socas-Navarro & Trujillo Bueno (1997).

Indeed, all these methods are based on linearization techniques. In particular, the dependence of the radiation field on the populations of the energy levels is approximated. Furthermore, all these methods are stationary (in the case of SOR, this is true if the relaxation parameter is constant over iterations), meaning that the spectral radius is constant. As the error vector e^m at the mth iteration is proportional to the power m of the spectral radius of the amplification matrix | λ_max | ^m, the convergence may thus be very slow in some cases. For the Lambda iteration, the spectral radius is close to 1 when the photon destruction probability ϵ ≪ 1 and/or the total optical depth is ≫1, explaining why it falsely converges. This is illustrated in Figs. 1 and 2, where the classical Eddington problem (two-level atom with constant ϵ) is solved with Λ iteration (LI) and ALI, respectively. The false convergence of LI and the very low convergence of ALI are illustrated in cases B. Indeed, their rate of convergence is very slow for strongly nonlocal thermodynamical equilibrium (non-LTE) problems (non-LTE parameter ϵ ≪ 1, ϵ being the fraction of the source function given by the Planck function, or the photon destruction probability) and to the number of points per decade of optical depth. In the latter case (i.e., a fine sampling of the stellar atmosphere), this problem of slow convergence is well explained by Fabiani Bendicho et al. (1997), who also showed that the multigrid method can nicely circumvent this problem. This fine sampling is however rarely required, especially as classical model atmospheres are static. An excellent discussion of the general problem of false convergence is given in Chap. 13 in Hubeny & Mihalas (2014).

The iterative schemes are usually combined with a technique of acceleration of convergence (Auer 1991). These fall into two main categories. The first is based on the minimization of the residual, which is the case of the Ng acceleration (Ng 1974). The second class relies on minimization with respect to a set of conjugate vectors. This was first applied to the RTE by Klein et al. (1989; see also Dickel & Auer 1994).

The second class of techniques (the minimization with respect to a set of conjugate vectors) leads naturally to more general, nonlinear, nonstationary methods, such as the conjugate gradient and the generalized minimum residual method (GMRES). To our knowledge, only the conjugate gradient has been applied to the RTE, by Paletou & Anterrieu (2009).

The aim of our work is to develop a new nonstationary method for solving the RTE, which fully takes its nonlinearity into account. This new method thus opens the possibility to explore the mathematical consequences of this intrinsic property of the RTE. Furthermore, circumventing the possible slow or false convergence of stationary methods should allow an appropriate treatment of some specific cases, such as, e.g., shocked regions, which require a fine sampling and present a large dynamics of optical depth values. Finally, as the coding strategy of this method has been conceived as a parallel code from the beginning, it is applicable to large-scale systems such as molecular spectra, and this was indeed our initial motivation (a posteriori parallelization is often less efficient, or even impossible).

The derivation of the equations, including the Jacobian matrix coefficients, used in the nonlinear method is presented in Sect. 2. The implementation of this method and its parallelization are described in Sect. 3. The application of the code to a specific problem facilitates the determination of its convergence properties, as shown in Sect. 4.

Fig. 1

Spectral radius of amplification matrix for the Λ iteration method. Convergence is shown for two spectral radii, marked with letters A (upper panel) and B (lower panel) in the right-hand panels. The thin black lines correspond to successive iterations; the true solution is given by the orange line.

Fig. 2

Same as Fig. 1, except for ALI with Λ^⋆ = diagonal of Λ.

2. Global strategy

Contrary to the two-level atom problem with a constant destruction probability, which can be directly solved by direct inversion (or through the analytic Eddington solution). Multi-level atom or molecule problems require an iterative determination for two reasons. Firstly, the multi-level problem is nonlinear. Secondly, the radiative transfer equation is nonlocal, and one has to propagate the modification of the field during iterations. As we discussed in the previous section, however, this stationary iteration may be problematic for large-scale systems. The basic concept of the strategy we adopt is to suppress the iterations due to the nonlocality of the RTE by considering the full system (spatially) and computing the radiation field exactly (assuming given populations) with an analytic formulation of the mean intensity field, which explicitly depends on populations.

2.1. Formulation of the problem

For clarity, we hereafter consider the special case of plane parallel geometry. In this context the 1D RTE is $μ \frac{d I_{ν}}{d s} = η_{ν}^{Cont} + η_{ν}^{Line} - (χ_{ν}^{Cont} + {χ_{ν}^{Line}}^{)} I_{ν},$ $\begin{equation} \label{eq:Radiativ transfer equationCH2} \mu \frac{{\rm d}I_\nu}{{\rm d}s}=\eta_\nu^\mathrm{Cont}+\eta_\nu^\mathrm{Line}-\left(\chi_\nu^\mathrm{Cont}+ \chi_\nu^\mathrm{Line}\right)I_\nu , \end{equation}$ (1)where I_ν is the specific intensity, η and χ are the emission and extinction coefficients, and s is the geometrical path. We explicitly separate the continuum and the line processes (all demonstrations and symbols can be found in Appendices A and C, respectively). Assuming complete redistribution, we introduce classical expressions for line processes, $\begin{matrix} χ_{ν}^{Line} & = & φ_{ν}^{ul} χ_{ul} = φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} (B_{lu} n_{l} - B_{ul} n_{u}) \\ η_{ν}^{Line} & = & φ_{ν}^{ul} η_{ul} = φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} A_{ul} n_{u,} \end{matrix}$ $% subequation 914 0 \begin{eqnarray} \label{eq:def1a} \chi_\nu^\mathrm{Line} & =& \phi^{ul}_\nu \chi_{ul} = \phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} (B_{lu}n_{l}-B_{ul}n_{u})\\ \label{eq:def1b} \eta_\nu^\mathrm{Line} & = &\phi^{ul}_\nu \eta_{ul} = \phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi}A_{ul}n_{u,} \end{eqnarray}$ where u and l refer to the upper and lower levels, respectively. Hereafter, if the distinction between upper and lower levels is not required, the indices i and j are used. The normalized line profile is $φ_{ν}^{ul}$ $\hbox{$\phi^{ul}_\nu$}$ and A and B are the Einstein coefficients.

Following the strategy of Gonzalez Garcia et al. (2008), we then introduce a unique optical depth scale for all frequencies, e.g,. the commonly used τ_V scale (optical depth at 500 nm). Equation (1)then becomes $μ \frac{d I_{ν}}{d τ_{V}} = [ξ_{ν} + E_{ij} ζ φ_{ν}^{ij} (x_{j} - x_{i})^{]} I_{ν} - D_{ij} x_{i} ζ φ_{ν}^{ij} - ξ_{ν} S_{ν},$ $\begin{equation} \label{eq:Radiativ transfer equationvar} \mu\frac{{\rm d}I_\nu}{{\rm d}\tau_\mathrm{V}}=\left[\xi_\nu + E_{ij}\zeta\phi^{ij}_\nu(x_j-x_i)\right]I_\nu - D_{ij}x_i\zeta\phi^{ij}_\nu-\xi_\nu S_\nu , \end{equation}$ (3)where

$\begin{matrix} ξ_{ν} = \frac{{χ_{ν_{ij}}^{Cont}}_{˜}}{\begin{matrix} ˜ \\ χ_{V} \end{matrix}} \\ ζ = \frac{1}{\begin{matrix} ˜ \\ χ_{V} \end{matrix}} \frac{n^{k}}{n^{H}} \\ E_{ij} = \frac{h ν_{ij}}{4 π} B_{ij} g_{i} \\ D_{ij} = \frac{h ν_{ij}}{4 π} A_{ij} g_{i} \\ x_{i} = \frac{n_{i}}{n^{k}} \frac{1}{g_{i}} \cdot \end{matrix}$ $% subequation 971 0 \begin{eqnarray} &&\xi_{\nu} =\frac{\tilde{\chi}_{\nu_{ij}}^\mathrm{Cont}}{\tilde{\chi}_\mathrm{V}} \\ &&\zeta =\frac{1}{\tilde{\chi}_\mathrm{V}}\frac{n^{k}}{n^{H}} \\ &&E_{ij} =\frac{h\nu_{ij}}{4\pi}B_{ij}g_i \\ &&D_{ij} =\frac{h\nu_{ij}}{4\pi}A_{ij}g_i \\ &&x_i =\frac{n_i}{n^k}\frac{1}{g_i}\cdot \end{eqnarray}$ The notation $\begin{matrix} ˜ \\ . \end{matrix}$ $\hbox{$\tilde{.}$}$ means that the quantity is expressed per hydrogen atom. Details can be found in Appendix A. As implicitly assumed in Eq. (3), we do not consider line overlap. This assumption is justified in the case of infrared molecular lines. Otherwise, a summation over transitions (at least adjacent ones) would be required.

This formulation depends explicitly on the populations of each energy level of the species considered. These populations are solutions of the statistical equilibrium equation system², which is for each level n_i, the detailed balance of incoming and outgoing (de-)excitation processes, defined by radiative (Einstein coefficients), and collisional rates (n_col being the local density of the colliding partner). Thus the statistical equilibrium equations take the form $\begin{matrix} \frac{d n_{i}}{d t} & = & n_{i} [\sum_{j < i} A_{ij} + \sum_{j \neq i} (B_{ij} {𝒥̅}_{ij} + n_{col} C_{ij})] \end{matrix}$ $\begin{eqnarray} \frac{{\rm d}n_i}{{\rm d}t}& = &n_i \left[\sum_{j<i}{A_{ij}+ \sum_{j\ne i}{\left(B_{ij}{\bar{\mathcal{J}}}_{ij}+ n_{col} C_{ij} \right)}}\right]\nonumber\\ & &- \left[\sum_{j>i}{n_j A_{ji}+ \sum_{j\ne i}{n_j \left(B_{ji}{\bar{\mathcal{J}}}_{ji} +n_{col}C_{ji} \right)}}\right]=0 \label{eq:ES} , \end{eqnarray}$ (5)where the mean radiation field $\hbox{${\bar{\mathcal{J}}}_{ij}({\tau }_{\rm V})={\bar{\mathcal{J}}}_{ji}({\tau }_{\rm V})$}$ depends on the solution of the RTE, ${𝒥̅}_{ij} (τ_{V}) = \frac{1}{2} \int_{0}^{\infty} φ_{ν}^{ij} (τ_{V}) \int_{-1}^{+ 1} I_{ν} (τ_{V},μ) d μ d ν .$ $\begin{equation} \label{eq:jbar} {\bar{\mathcal{J}}}_{ij}(\tau_\mathrm{V}) = \frac{1}{2} \int_{0}^{\infty}{ \phi^{ij}_\nu (\tau_\mathrm{V}) \int_{-1}^{+1} {I_\nu(\tau_\mathrm{V},\mu){\rm d}\mu} {\rm d}\nu} . \end{equation}$ (6)Because this system of equations is degenerate, one of them is replaced by the normalization equation $\sum_{i}^{N} n_{i} = n^{k}$ $\hbox{$\sum_i^N{n_i}=n^k$}$ , where n^k is the total density of the species k.

To express each equation of the statistical equilibrium system with the same unknowns as in Eq. (3), we rewrite the statistical equilibrium with the variables x_i defined in Eq. (4). Moreover, for numerical reasons, we make a variable change for the mean intensity, ${𝒥̅}_{ij} (τ_{V}) = \frac{2 h ν_{ij}^{3}}{c^{2}} {𝒵̅}_{ij} (τ_{V}) .$ $\begin{equation} \label{eq:defZ} {\bar{\mathcal{J}}}_{ij}(\tau_\mathrm{V}) = \frac{2h\nu_{ij}^3}{c^2}{\bar{\mathcal{Z}}}_{ij}(\tau_\mathrm{V}) . \end{equation}$ (7)This leads to a matrix formulation of the statistical equilibrium, with an upper triangular part corresponding to incoming processes, and a lower triangular part corresponding to outgoing processes. Finally, thanks to the Einstein relations ( $B_{ul} = (c^{2} / (2 h ν_{ul}^{3})) A_{ul}$ $\hbox{$B_{ul}=(c^2/(2h\nu_{ul}^3))A_{ul}$}$ and B_lu = (g_u/g_l)B_ul), the system of Eqs. (5)becomes $\begin{matrix} x_{i} [\sum_{j < i} g_{i} A_{ij} (1 + {𝒵̅}_{ij}) + \sum_{j > i} g_{j} A_{ji} {𝒵̅}_{ij} + \sum_{j \neq i} n_{col} g_{i} C_{ij}] \end{matrix}$ $\begin{eqnarray} && x_i \left[\sum_{j\,<\,i}{g_i {A}_{ij}\left(1+{\bar{\mathcal{Z}}}_{ij}\right)} + \sum_{j\,>\,i}{g_j {A}_{ji}{\bar{\mathcal{Z}}}_{ij}} + \sum_{j \ne i}{n_{\rm col} g_i {C}_{ij}}\right]\nonumber\\ &&- \left[\sum_{j\,<\,i}{x_j g_j {A}_{ij}{\bar{\mathcal{Z}}}_{ij}} + \sum_{j \,>\, i}{x_j g_j {A}_{ji}\left(1+{\bar{\mathcal{Z}}}_{ji}\right)} + \sum_{j \,\ne\, i}{n_{\rm col} g_i {C}_{ij}} \right]=0 \label{eq:ES2} . \end{eqnarray}$ (8)Because radiative transfer is a nonlocal problem, an integral formulation of the mean radiation field with an explicit dependence on populations is required.

2.2. The mean radiation field

To derive the mean radiation field, the first step consists of deriving the formal solution of the RTE. A classical way to obtain an integral form of the formal solution of a differential equation is to use the Green’s functions G_μν. This method consists of replacing the source term by a Dirac function. Taking the boundary conditions into account, the equation is then solved using its Laplace transform. The complete solution is then obtained after integration of this Green’s function, that is, performing a summation over each point source, according to the superposition principle (which is valid if the differential operator is linear). Thus, the integral form of I_ν(τ_V,μ) can be obtained by the determination of the Green’s function of Eq. (3). This Green’s function is easily computable for spherical and other geometries using some symbolic tools such as Mathematica, making it possible to generalize the method for 2D or 3D problems. Knowing this Green’s function, we can directly obtain the formal integral form of the specific intensity, which is $I_{ν} (τ_{V},μ) = \int_{τ_{V}^{Min}}^{τ_{V}^{Max}} [D_{ij} x_{i} (t) ζ (t) φ_{ν}^{ij} (t) + ξ_{ν} (t) S_{ν} (t)^{]} G_{μν} (τ_{V}; t) d t .$ $\begin{equation} \label{eq:Inu} I_{\nu}(\tau_\mathrm{V},\mu)= \int^{\tau^\mathrm{Max}_\mathrm{V}}_{\tau^\mathrm{Min}_\mathrm{V}}{\left[ D_{ij}x_i(t)\zeta(t)\phi^{ij}_\nu(t) +\xi_\nu(t) S_\nu(t)\right]\mathscr{G}_{\mu \nu}(\tau_\mathrm{V};t){\rm d}t} . \end{equation}$ (9)We assume the line profile does not depend on μ. This expression is thus not applicable in the case of anisotropic scattering or in the presence of radiation fields, but is valid in the case of classical static model atmospheres. The formal expression of the mean intensity, $\hbox{${\bar{\mathcal{J}}}_{ij}({\tau }_\mathrm{V})$}$ , requires an integration over solid angles and frequencies according to Eq (6). We assume that D_ijx_iζ and ξ_νS_ν are angular independent (physically this corresponds to the assumption of isotropic scattering) and frequency independent over the line width (ξ_νS_ν → ξ_ijS_{ν_ij}). This is a very weak assumption because the continuum is nearly constant on the scale of a line width. Then, we can extract the source terms from the integral over solid angles and frequencies. Moreover, we separate line and continuum contributions by splitting the integral, which leads to $\begin{matrix} {𝒥̅}_{ij} (τ_{V}) & = & \int \begin{matrix} τ_{V}^{Max} \\ τ_{V}^{Min} \end{matrix} ξ_{ij} (t) S_{ν_{ij}} (t) \\ \times [\frac{1}{2} \int_{0}^{\infty} φ_{ν}^{ij} (τ_{V}) [\int_{-1}^{+ 1} G_{μν} (τ_{V}; t) d μ] d ν] d t \\ + D_{ij} \int_{τ_{V}^{Min}}^{τ_{V}^{Max}} x_{i} (t) ζ (t) \\ \times [\frac{1}{2} \int_{0}^{\infty} φ_{ν}^{ij} (τ_{V}) φ_{ν}^{ij} (t) [\int_{-1}^{+ 1} G_{μν} (τ_{V}; t) d μ] d ν] d t \end{matrix}$ $\begin{eqnarray} {\bar{\mathcal{J}}}_{ij}({\tau }_\mathrm{V}) & =&\int^{{\tau}^\mathrm{Max}_\mathrm{V}}_{{\tau }^\mathrm{Min}_\mathrm{V}}{\xi_{{ij}}(t) S_{\nu_{ij}}(t)}\nonumber\\ && \times \left[\frac{1}{2}\int_{0}^{\infty}{ \phi^{ij}_\nu (\tau_\mathrm{V}) \left[\int_{-1}^{+1}{ \mathscr{G}_{\mu \nu}(\tau_\mathrm{V};t){\rm d}\mu}\right]{\rm d}\nu}\right]{\rm d}t\nonumber\\ &&+D_{ij}\int^{{\tau}^\mathrm{Max}_\mathrm{V}}_{{\tau}^\mathrm{Min}_\mathrm{V}}{x_i(t)\zeta(t)}\nonumber\\ & & \times \left[\frac{1}{2}\int_{0}^{\infty}{ \phi^{ij}_\nu (\tau_\mathrm{V}) \phi^{ij}_{\nu}(t) \left[ \int_{-1}^{+1}{\mathscr{G}_{\mu \nu}(\tau_\mathrm{V};t){\rm d}\mu}\right]{\rm d}\nu}\right]{\rm d}t \end{eqnarray}$ (10)Then, after determination of the Green’s function in a given geometry, one obtains an integral form for the different contributors to the radiation field. We assume that the continuum contribution is in LTE (S_{ν_ij} = B_{ν_ij}(T)). Moreover, we consider, as boundary conditions, an incoming field $I_{-}^{Ext}$ $\hbox{$I_-^\mathrm{Ext}$}$ from the inner atmosphere. Then, $𝒥̅ ij (τ_{V}) = 𝒥̅ \begin{matrix} Ext \\ ij \end{matrix} (τ_{V}) + 𝒥̅ \begin{matrix} Cont \\ ij \end{matrix} (τ_{V}) + 𝒥̅ \begin{matrix} Line \\ ij \end{matrix} (τ_{V})$ $\begin{equation} \bar{\mathcal{J}}_{ij}(\tau_\mathrm{V})= \bar{\mathcal{J}}_{ij}^\mathrm{Ext}(\tau_\mathrm{V}) +\bar{\mathcal{J}}_{ij}^\mathrm{Cont}(\tau_\mathrm{V}) +\bar{\mathcal{J}}_{ij}^\mathrm{Line}(\tau_\mathrm{V}) \end{equation}$ (11)where, $\begin{matrix} 𝒥̅ \begin{matrix} Ext \\ ij \end{matrix} [x_{i}, x_{j}] (τ_{V}) & = \\ 𝒥̅ \begin{matrix} Cont \\ ij \end{matrix} [x_{i}, x_{j}] (τ_{V}) & = \\ {𝒥̅}_{ij}^{Line} [x_{i}, x_{j}] (τ_{V}) & = \end{matrix}$ $% subequation 1199 0 \begin{eqnarray} \bar{\mathcal{J}}^\mathrm{Ext}_{ij}[x_i,x_j] (\tau_\mathrm{V}) & =&I_-^\mathrm{Ext}\beta_-[x_i,x_j](\tau_\mathrm{V})~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \label{eq:Jtota} \\ \bar{\mathcal{J}}^\mathrm{Cont}_{ij}[x_i,x_j] (\tau_\mathrm{V}) & =&\int^{{\tau}^\mathrm{Max}_\mathrm{V}}_{{\tau}^\mathrm{Min}_\mathrm{V}}{\xi_{{ij}}(t)B_{\nu_{ij}}(T(t)) L_1 [x_i,x_j](t,\tau_\mathrm{V}){\rm d}t} \label{eq:Jtotb} \\ {\bar{\mathcal{J}}}^\mathrm{Line}_{ij}[x_i,x_j] (\tau_\mathrm{V}) &=&D_{ij}\int^{{\tau}^\mathrm{Max}_\mathrm{V}}_{{\tau}^\mathrm{Min}_\mathrm{V}}{x_i(t)\zeta(t) K_1 [x_i,x_j](t,\tau_\mathrm{V}){\rm d}t,} \label{eq:Jtotc} \end{eqnarray}$ where β₋ [x_i,x_j] (τ_V) is the escape probability from position τ_V in the atmosphere, $β_{-} [x_{i}, x_{j}] (τ_{V}) = \frac{1}{2} \int_{0}^{+ \infty} φ_{ν}^{ij} (τ_{V}) \times E_{2} (τ^{Tot} [x_{i}, x_{j}] (τ_{V})^{)} d ν .$ $\begin{equation} \label{eq:beta} \beta_-[x_i,x_j](\tau_\mathrm{V}) = \frac{1}{2}\int_{0}^{+\infty}{\phi^{ij}_\nu(\tau_\mathrm{V})}\\ \times { E_2 \left(\tau^\mathrm{Tot}[x_i,x_j] (\tau_\mathrm{V})\right) {\rm d}\nu} . \end{equation}$ (13)The kernels L₁ [x_i,x_j] (t,τ_V) and K₁ [x_i,x_j] (t,τ_V) are, respectively: $\begin{matrix} L_{1} & [x_{i}, x_{j}] (t, τ_{V}) = \frac{1}{2} \int_{0}^{+ \infty} φ_{ν}^{ij} (τ_{V}) \\ \times E_{1} (- ε (t, τ_{V}) {(τ^{Tot} [x_{i}, x_{j}] (t) - τ^{Tot} [x_{i}, x_{j}] {(τ_{V})}^{)}}^{)} d ν \\ K_{1} & [x_{i}, x_{j}] (t, τ_{V}) = \frac{1}{2} \int_{0}^{+ \infty} φ_{ν}^{ij} (τ_{V}) φ_{ν}^{ij} (t) \\ \times E_{1} (- ε (t, τ_{V}) {(τ^{Tot} [x_{i}, x_{j}] (t) - τ^{Tot} [x_{i}, x_{j}] {(τ_{V})}^{)}}^{)} d ν, \end{matrix}$ $% subequation 1243 0 \begin{eqnarray} L_1&[x_i,x_j](t,\tau_\mathrm{V}) = \frac{1}{2}\int_{0}^{+\infty}{\phi^{ij}_\nu(\tau_\mathrm{V})}\nonumber\\ & {\times E_1\left(-\varepsilon(t,\tau_\mathrm{V})\left(\tau^\mathrm{Tot}[x_i,x_j](t) -\tau^\mathrm{Tot}[x_i,x_j]\left(\tau_\mathrm{V}\right)\right)\right) {\rm d}\nu} \\ K_1&[x_i,x_j](t,\tau_\mathrm{V}) = \frac{1}{2}\int_{0}^{+\infty}{\phi^{ij}_\nu(\tau_\mathrm{V})\phi^{ij}_\nu(t)}\nonumber\\ & {\times E_1\left(-\varepsilon\left(t,\tau_\mathrm{V}\right)\left(\tau^\mathrm{Tot}[x_i,x_j](t) -\tau^\mathrm{Tot}[x_i,x_j]\left(\tau_\mathrm{V}\right)\right)\right) {\rm d}\nu,} \end{eqnarray}$ where the total optical depth is given by $\begin{matrix} τ_{ν}^{Tot} (τ_{V}) & = & \int \begin{matrix} τ_{V} \\ 0 \end{matrix} (χ_{ν}^{Cont} + {χ_{ν}^{Line}}^{)} d t \\ = & \int_{0}^{τ_{V}} (ξ_{ν} (t) + E_{ij} ζ (t) φ_{ν}^{ij} (t) [x_{j} (t) - x_{i} (t)]^{)} d t . \end{matrix}$ $\begin{eqnarray} \tau_\nu^\mathrm{Tot}(\tau_\text{V}) & =&\int_{0}^{\tau_\text{V}}{ \left( \chi_\nu^\text{Cont}+ \chi_\nu^\text{Line} \right) {\rm d}t}\nonumber \\ & =&\int_{0}^{\tau_\mathrm{V}}{\left(\xi_\nu(t) + E_{ij}\zeta(t)\phi^{ij}_\nu(t)[x_j(t)-x_i(t)]\right) {\rm d}t} . \end{eqnarray}$ (15)We do not include an external radiation field. In the plane parallel geometry considered here, it would require us to take additional effects such as limb darkening in realistic cases into account (e.g., illumination by a planet or a companion). We have introduced the function ε(t,τ_V) = 1 if t < τ_V and −1 if t > τ_V to split the integration over μ according to its sign. The physical meaning of the kernels L₁(t,τ_V) and K₁(t,τ_V) can be understood as the probability density for photons, emitted in a layer t, to contribute to the line excitation in the layer τ_V.

We now have an analytic expression of the mean intensity with an explicit dependence on populations. We now have to include this expression in the equations of statistical equilibrium to close the system and form a system of coupled integral equations (a Fredholm-like equation system).

2.3. The multi-zone statistical equilibrium equation system

The inversion of the system of statistical equilibrium equations allows us to obtain the populations also including the formal solution of RTE. But because it is nonlocal, the knowledge of $\hbox{$\bar{\mathcal{J}}$}$ requires the knowledge of the populations everywhere in the atmosphere because of the integral over the optical depth t in Eq. (12).

For this reason, one needs to solve the statistical equilibrium everywhere in the atmosphere simultaneously. The first step consists of rewriting Eq. (5)in a matrix form. A vector x(τ_V) containing all the populations of the considered species in a given layer, normalized to the density in that layer, is built. The size of this vector is thus equal to N, the number of energy levels of the considered species. We introduce the rate matrix $\begin{matrix} R [x] (τ_{V}) & = & A^{T} ◦ {(L_{1} + Z̅ [x] (τ_{V}))}^{T} \end{matrix}$ $\begin{eqnarray} \mathbf{R}[\vec{x}]({\tau }_\mathrm{V})& = & \vec{A}^T \circ \left( \mathbf{L}_1 +\bar{\mathbf{Z}}[\vec{x}]({\tau }_\mathrm{V})\right)^T \nonumber\\ &&+\mathrm{diag(\vec{g})} \!\cdot\! \left[\vec{A}\!\circ\!s \bar{\mathbf{Z}}[\vec{x}]({\tau }_\mathrm{V})\right] \cdot \mathrm{diag(\vec{g})}^{-1}\!+\!\mathbf{C}^T({\tau }_\mathrm{V}) \label{eq:matriceR} . \end{eqnarray}$ (16)Where L₁ is a lower triangular matrix of ones, A is a vector containing the spontaneous de-excitation coefficients (A_ul), Z is a matrix containing the radiation field (see Eq. (7)), C a matrix containing the collisional rates (n_colC_ij), and g is a vector containing the statistical weights (g_i).

Now the statistical equilibrium system can be written as, $M^{T} [x] (τ_{V}) x (τ_{V}) = 0,$ $\begin{equation} \label{eq:ES4} \mathbf{M}^{T}[\vec{x}]({\tau }_\mathrm{V}) \vec{x}({\tau }_\mathrm{V}) = \mathbf{0} , \end{equation}$ (17)with $M [x] (τ_{V}) = [diag (u \cdot R [x] (τ_{V})) - R [x] (τ_{V})] \cdot diag (g),$ $\begin{equation} \label{eq:matriceM} \mathbf{M}[\vec{x}](\tau_\mathrm{V})=\left[\mathrm{diag(\vec{u} \cdot \mathbf{R}[\vec{x}]({\tau }_\mathrm{V}))} - \mathbf{R}[\vec{x}]({\tau }_\mathrm{V})\right] \cdot \mathrm{diag(\vec{g})} , \end{equation}$ (18)where u is a ones vector. After discretization of the medium into P layers indexed by l, the vectorial function x(τ_V) becomes x^l and Eq. (16)becomes $\begin{matrix} R^{l} (x^{1}, x^{2}, ..., x^{P}) = A^{T} ◦ {(L_{1} + Z̅ l (x^{1}, x^{2}, ..., x^{P})^{)}}^{T} \\ + diag (g) \cdot [A ◦ Z̅ l (x^{1}, x^{2}, ..., x^{P})^{]} \cdot {diag (g)}^{-1} + {[C^{l}]}^{T} \end{matrix}$ $\begin{eqnarray} &&\mathbf{R}^l(\vec{x}^1,\vec{x}^2,...,\vec{x}^P) = \mathbf{A}^T \circ \left( \mathbf{L}_1 +\bar{\mathbf{Z}}^l(\vec{x}^1,\vec{x}^2,...,\vec{x}^P)\right)^T \nonumber\\ &&+\mathrm{diag(\vec{g})} \cdot \left[\vec{A}\circ \bar{\mathbf{Z}}^l(\vec{x}^1,\vec{x}^2,...,\vec{x}^P)\right] \cdot \mathrm{diag(\vec{g})}^{-1}+\left[\mathbf{C}^{l}\right]^T \end{eqnarray}$ (19)as the radiation field, and thus the matrix Z depends implicitly on the populations in all layers, and the statistical equilibrium system can be rewritten, ${[M^{l} (x^{1}, x^{2}, ..., x^{P})^{]}}^{T} x^{l} = 0.$ $\begin{equation} \label{eq:MatEqStat} \left[\mathbf{M}^l(\vec{x}^1,\vec{x}^2,...,\vec{x}^P)\right]^{T}\vec{x}^l = \mathbf{0} . \end{equation}$ (20)To take care of the coupling between the layers and close the system to solve Eq. (20)for all l in a unique step, we build the multi-zone statistical equilibrium system: ${(\begin{matrix} M^{1} (X^{)} & 0 & 0 \\ 0 & {\begin{matrix} . \end{matrix}}^{.} . & 0 \\ 0 & 0 & M^{p} (X^{)} \end{matrix})}^{T} (\begin{matrix} x^{1} \\ \begin{matrix} . \\ . \\ . \end{matrix} \\ x^{P} \end{matrix}) = (\begin{matrix} 0 \\ \begin{matrix} . \\ . \\ . \end{matrix} \\ 0 \end{matrix}) .$ $\begin{equation} \label{eq:MatEqStat2} \left( \begin{array}{ccc} {{\mathbf M}}^1\left({\vec X}\right) & 0 & 0\\ 0 & \ddots & 0\\ 0 & 0 & {{\mathbf M}}^p\left({\vec X}\right) \end{array} \right)^{T} \left( \begin{array}{c} \vec{x}^1\\ \vdots \\ \vec{x}^P \end{array} \right)= \left( \begin{array}{c} \mathbf{0} \\ \vdots \\ \mathbf{0} \end{array} \right) . \end{equation}$ (21)We introduce the vector X = (x¹,x²,...,x^P) and the block-diagonal matrix Γ to rewrite Eq. (21)as a multi-D nonlinear function, ${[Γ (X)]}^{T} X = F (X) = 0.$ $\begin{equation} \label{eq:fonction} \left[\boldsymbol{\Gamma}(\vec{X})\right]^{T}\vec{X}=\mathbf{F}(\vec{X})=\mathbf{0} . \end{equation}$ (22)With an initial X₀ within the convergence radius, we can solve this nonlinear equation with a classical scheme such as the Newton method. An important question, however, is to determine whether the above system has a solution and whether the solution we expect to get is physical (unique and positive).

2.4. Existence, uniqueness and positivity of the solution

From a physical point of view, one naturally expects a unique and positive solution of the RTE. Reflecting on this problem is in fact a way to verify whether it is mathematically well formulated. Indeed, as stated above, the coupling between the populations and the radiation field is nonlinear. Without linearization, the solution can be regarded as the equilibrium state of Markov processes. Furthermore, because of radiative selection rules and collisional propensity rules, many factors are very small, and may numerically tend toward zero. The problem is thus potentially ill-conditioned.

The formulation adopted here is a multi-zone statistical equilibrium. The problem of uniqueness and positivity of the solution of the mono-zone statistical equilibrium has been addressed by Damgaard et al. (1992) and Rybicki (1997). As this system of equations is degenerate, one equation is usually replaced by a normalization condition, namely the conservation equation, to overcome the singularity of the associated matrix. Then, Damgaard et al. (1992) show that the solution is positive if all the rates are strictly non-zero. Yet certain transition probabilities vanish, physically or numerically. As long as all the levels are connected, just one nonzero cofactor guarantees the regularity of the matrix. More generally, Rybicki (1997) demonstrated that even for the system without a normalization condition, positive states can link (and be linked) only to positive states. Then, if the statistical equilibrium equations can be transformed into a set of uncoupled irreducible subproblems with a strictly positive normalization condition, the solution is positive and unique. In other words, the uniqueness and positivity properties of the general linear statistical equilibrium equations relate to the underlying connectivity properties of the states.

In our multi-zone formulation, the coupling between statistical equilibria in every atmospheric layer is strongly nonlinear. That is, the system cannot be split into uncoupled sets of equations, i.e., a set of uncoupled irreducible subproblems. This may impact the number of required normalization conditions, while only one per atmospheric layer (the conservation equation) is available. The uniqueness of the solution may then be demonstrated through the search for a nonlinear fixed point as the solution of the equilibrium of a Markov chain (P. Azerad, priv. comm.). A forthcoming paper will be devoted to this. At least, as the properties of connectivity still apply, the positivity of the solution is guaranteed.

3. Implementation

3.1. Nonlinear solving

In the method developed here, the computation of the statistical equilibrium is based on its exact formulation and explicitly includes radiative transfer effects through the mean radiation field $\hbox{$\bar{\mathcal{J}}$}$ in Eq. (22). Finding the root of this function naturally leads to the exact solution to the statistical equilibrium, and avoids any false convergence that may be obtained with stationary iterative methods.

The computation of the norm of a function, $F : {(R^{+})}^{N \times P} \to R^{N \times P},$ $\begin{equation} \mathbf{F}:\left({\mathbb{R}^{+}}\right)^{N \times P} \rightarrow \mathbb{R}^{N \times P} , \end{equation}$ (23)N and P being the number of considered energy levels and the number of layers in the model atmosphere, respectively, is equivalent to an optimization problem, that is, the search for the minimum of the L₂ norm of the function f, $\min_{X \in R^{N \times P}} f (X) = \frac{1}{2} F (X) F^{T} (X) = \frac{1}{2} {∥ F (X) ∥}_{L_{2}}^{2} .$ $\begin{equation} \min_{\vec{X} \in \mathbb{R}^{N\times P}} f(\vec{X})=\frac{1}{2} \textbf{F}(\vec{X})\textbf{F}^{T}(\vec{X})=\frac{1}{2}{\|\textbf{F}(\vec{X})\|}^2_{L_2} . \end{equation}$ (24)The classical way to minimize this function is to move along a vector δX in a descending direction, using the Newton’s method. Indeed, for δX = X_k−X_k−1 = −J^-1·F, the gradient along the vector δX is $\begin{matrix} \nabla f (X_{k})^{T} δ X & = & [F (X_{k})^{T} J (X_{k})^{]} [- J (X_{k})^{-1} F (X_{k})^{]} \\ = \end{matrix}$ $\begin{eqnarray} \nabla f(\vec{X}_k)^{T} \delta \vec{X} &=& \left[\textbf{F}(\vec{X}_k)^{T} \textbf{J}(\vec{X}_k)\right]\left[-\textbf{J}(\vec{X}_k)^{-1}\textbf{F}(\vec{X}_k)\right] \nonumber\\ & =& -\textbf{F}(\vec{X}_k)^{T} \textbf{F}(\vec{X}_k) = -{\|\textbf{F}(\vec{X})\|}^2_{L_2} < 0 \label{eq:opti} . \end{eqnarray}$ (25)Newton’s method is very efficient in approaching the solution as it has a quadratic convergence. For each successive Newton step, we have to solve the linear problem JδX_k = −F, where k is the index of the Newton iteration. However, with classical algebric methods, this operation leads to an $\hbox{$\mathcal{O}(N^3)$}$ algorithm, which is prohibitive in terms of computational time for large-scale systems. A robust and efficient way to accelerate this operation is to approximate the solution using a faster algorithm. We adopt GMRES, which is a subclass of Krylov subspace methods.

A Krylov subspace method begins with an initial guess $δ X_{k}^{0}$ $\hbox{$\delta \vec{X}_k^0$}$ . At the nth sub-iteration, $δ X_{k}^{n}$ $\hbox{$\delta \vec{X}_k^n$}$ is determined through a correction in the nth Krylov subspace, $𝒦_{k}^{n} = span (R, J R, ..., J^{n - 1} R); R = - F - J δ X_{k}^{n} .$ $\begin{equation} \mathcal{K}_k^n = \text{span}(\textbf{R}, \textbf{J}\textbf{R}, . . ., \textbf{J}^{n-1}\textbf{R}) ; \textbf{R} = -\textbf{F} - \textbf{J} \delta \vec{X}_k^n . \end{equation}$ (26)The main advantage of this method is that its implementation requires only some vector matrix (or vector transposed matrix) products. It leads to a complexity in $\hbox{$\mathcal{O}(n^2),$}$ which may be reduced to $\hbox{$\mathcal{O}(n)$}$ if the matrix is sparse. Moreover, the scheme needs $\hbox{$\mathcal{O}(kn)$}$ operations, but the algorithm can be restarted to minimize this contribution.

For each Newton step, we thus have to solve the linear problem JδX = −F, which will be rewritten in this section as Ax = b for simplicity. Stationary iterative methods (Jacobi, Gauss-Seidel, SOR) approximate the unknown vector x with a scheme x^(k) = Bx^(k−1) + c, where neither B nor c depend upon the iteration count k. We use the GMRES method, which is part of a class of (nonstationary) methods named Krylov subspace methods. The GMRES algorithm tries to evaluate the guess $\hbox{$\vec{x}_n \in \mathcal{K}_n$}$ (the nth order Krylov subspace) by solving an optimization problem (minimizing the residue ∥ Ax_n−b ∥ _L₂) like a least-square problem. The nth Krylov subspace is spanned over the n basis vectors, $𝒦_{n} = span (b, A b, A^{2} b ..., A^{n - 1} b) .$ $\begin{equation} \mathcal{K}_n = \text{span}(\vec{b}, \vec{A}\vec{b},\vec{A}^2\vec{b} . . ., \vec{A}^{n-1}\vec{b}) . \end{equation}$ (27)Then, the approximate solution is x_n = K_nc, where the Krylov matrix is K_n = ^[b,Ab,A²b...,Aⁿ⁻¹b^] and c is an appropriate vector such that $\begin{matrix} \min \end{matrix} ∥ A x_{n} - b ∥_{L_{2}} = \begin{matrix} \min \end{matrix} ∥ A K_{n} c - b ∥_{L_{2}} .$ $\begin{equation} \min{\|\vec{A}\vec{x}_n-\vec{b}\|_{L_2}}=\min{\|\vec{A}\mathbf{K}_n\vec{c}-\vec{b}\|_{L_2}} . \end{equation}$ (28)This vector is easily computed as it just requires a Matrix-vector product. This is particularly well suited for sparse matrices because this operation evolves roughly as the number of nonzero elements.

A straightforward method to find the least-square solution of this problem would be to compute the QR factorization of the matrix AK_n. Indeed, the normal equation A^TAx = A^Tb ⇔ Rx = Q^Tb can be solved by back substitution. However, this is both unstable and too expensive (the QR factorization requires 2mn² flops, d = Q^Tb, 2mn flops, and Rx = d, n² flops, where m is the iterate and n is the order of the Krylov subspace). Moreover this natural Krylov basis is ill-conditioned and is thus not a good choice. Indeed, the basis vectors have to be linearly independent, and the successive multiplications by A lead to vectors, which point (fastly) to the dominant eigenvector of A and are thus numerically collinear. For this reason, we build another space , $\hbox{$\mathcal{Q}_n$}$ , where the basis vectors are orthogonal (this orthogonalization of Krylov bases is named the Arnoldi method). During the Arnoldi step to build the orthogonal basis, the matrix H of the orthogonalization elements is built. This matrix has a special shape (Hessenberg shape), which is close to triangular (one subdiagonal more). Our minimization problem can be rewritten as $\begin{matrix} \min \end{matrix} ∥ A x_{n} - b ∥_{L_{2}} = \begin{matrix} \min \end{matrix} ∥ A Q_{n} y - b ∥_{L_{2}} = \begin{matrix} \min \end{matrix} ∥ Q_{n + 1} H_{n + 1} y - b ∥_{L_{2}},$ $\begin{equation} \min{\|\vec{A}\vec{x}_n-\vec{b}\|_{L_2}}\!=\!\min{\|\vec{A}\mathbf{Q}_n\vec{y}-\vec{b}\|_{L_2}}\!=\!\min{\|\mathbf{Q}_{n+1}\mathbf{H}_{n+1}\vec{y}-\vec{b}\|_{L_2}} , \end{equation}$ (29)where y designs the new unknown (Q_ny = x_n).

Furthermore, we have to combine this method with a global convergence strategy. Indeed, as one gets close to the solution, the choice of the descending direction may lead to spurious effects, especially if δX becomes very large. Then, to ensure the global convergence of the scheme, we use a combination of the line search and backtracking methods, which is an efficient method to solve nonlinear equations. Precisely, when f_k>f_k−1, δX may be too large. Then, δX is reduced by a factor λ, $X_{k} = X_{k - 1} + λδ X,$ $\begin{equation} \vec{X}_k=\vec{X}_{k-1}+\lambda \delta \vec{X} , \end{equation}$ (30)with λ chosen to minimize f(X_k−1 + λδX) to maintain the iteration in the descending direction.

3.2. Treatment of the singularity

The computation of $\hbox{$\bar{\mathcal{J}}$}$ goes through the integration of a singular function. Indeed, in the integrand of Eq. (12), the function E₁(x) has a logarithmic divergence at x = 0⁺, see Fig 3.

To integrate this kernel, we use the periodization method developed by Helluy et al. (1998), which consists of a high order quadrature method with a variable change. The new variable is a polynomial of degree k, properties of which improve on the order of rectangle rule. The advantage of this method is that it can handle singular functions. Indeed the error | E_N(f) | on an interval subdivided in N subintervals goes as ~C_k/N^γ, where γ → (2k−1) if f has a logarithmic singularity at one bound of the integration domain (see Table 1).

Table 1

Benchmark of the periodization method for a logarithmic singularity (from Helluy et al. 1998).

However the periodization method requires that the integration is done over the interval [0,1] and that the singularity is located at one extremum of this interval. We thus split the integration domain into two subdomains, $[τ_{V}^{in}, τ_{V}]$ $\hbox{$[{\tau}^{\rm in}_V,{\tau}_V]$}$ and $[τ_{V}, τ_{V}^{out}]$ $\hbox{$[{\tau}_V,{\tau}^{\rm out}_V]$}$ , and then normalize each subdomain to [0,1]. However, since the MPI strategy limits the number of model layers handled by each process, this can lead to the undersampling of one of the subdomains, when τ_V approaches one of the bounds of the total domain (i.e., the boundaries of the model atmosphere). We have thus tested this method under the same conditions as those met in our problem, without any resampling of the subdomains.

Fig. 3

Map of the integrand of $\hbox{$\bar{\mathcal{J}}_{ij}$}$ for a radiative transition (see Eq. (12c)). The diagonal is singular, and the quadrature in the periodization method produces a mesh refinement around it. The color encoding of the integrant is in logarithmic arbitrary units.

The result is displayed in Fig. 4. As expected, the relative error strongly increases when one of the subdomains includes too few model layers (typically fewer than 4) and becomes unacceptable for the extreme points. They will thus be considered ghost points in all subsequent computations. This does not affect the computation of the statistical equilibrium.

Fig. 4

Benchmarking of the periodization method applied to the integration with a logarithmically singular kernel (g(t) = a₀ + a₁t, K(x,t) = −γ−ln( | G(t)−G(x) |) and $G (x) = {^{\int}}_{0}^{t} g (s) d s$ $\hbox{$G(x)=\int_0^t{g(s){\rm d}s}$}$ ). The integration domain has been split into two subdomains (see text for details).

Because we can compute the nonlinear function F(X) with good accuracy, the next step to find its root is to compute the Jacobian matrix to solve the system through a Newton’s scheme.

3.3. Computation of the Jacobian matrix

The method presented here, and more generally the solving of nonlinear systems of equations, requires the computation of the Jacobian matrix. One of the advantages of the method we develop is that an analytical form can be derived. Here, the function F(X) is a set of N × P equations, N being the number of energy levels of the species considered, and P the number of layers in the discretized model atmosphere. The differentiation of $F_{i}^{l}$ $\hbox{$F_i^l$}$ over $x_{k}^{m}$ $\hbox{$x_k^m$}$ (i and k referring to energy levels, and l and m to atmosphere layers) leads to a set of integro-differential equations, with the following four cases: $\begin{matrix} if k = i & and l = m \\ \frac{\partial F_{i}^{l}}{\partial x_{i}^{l}} & = & \sum_{j \neq i}^{N} \frac{\partial 𝒥̅ \begin{matrix} l \\ ij \end{matrix}}{\partial x_{i}^{l}} ({x_{i}^{l} B_{ij} - x_{j}^{l} B_{ji}}^{)} + \sum_{j \neq i}^{N} (A_{ji} + B_{ji} 𝒥̅ ji + n_{col} C_{ji}), \\ if k \neq i & and l = m \\ \frac{\partial F_{i}^{l}}{\partial x_{k}^{l}} & = & \frac{\partial 𝒥̅ \begin{matrix} l \\ ik \end{matrix}}{\partial x_{k}^{l}} ({x_{k} B_{ki} - x_{i} B_{ik}}^{)} - (A_{ki} + B_{ki} 𝒥̅ ki + n_{col} C_{ki}), \\ if k = i & and l \neq m \\ \frac{\partial F_{i}^{l}}{\partial x_{i}^{m}} & = & \sum_{j \neq i}^{N} \frac{\partial 𝒥̅ \begin{matrix} l \\ ij \end{matrix}}{\partial x_{i}^{m}} ({x_{i} B_{ij} - x_{j} B_{ji}}^{)}, \\ if k \neq i & and t \neq m \\ \frac{\partial F_{i}^{l}}{\partial x_{k}^{m}} & = & \frac{\partial 𝒥̅ \begin{matrix} l \\ ik \end{matrix}}{\partial x_{k}^{m}} ({x_{k} B_{ki} - x_{i} B_{ik}}^{)} . \end{matrix}$ $% subequation 2162 0 \begin{eqnarray} \text{if } k = i &\text{ and } l=m \label{eq:jac1} \\ \frac{{\partial F_i^l}} {{\partial x_i^l}}& =& \sum\limits_{j \ne i}^N\frac{\partial \bar{\mathcal{J}}^l_{ij}}{\partial x_i^l} \left( {x_i^l B_{ij} - x_j^l B_{ji} } \right) \nonumber + \sum\limits_{j \ne i}^N{\left( A_{ji}+B_{ji}\bar{\mathcal{J}}_{ji}+n_{col} C_{ji} \right), } \nonumber \\ \text{if } k \neq i &\text{ and } l=m \label{eq:jac2} \\ \frac{{\partial F_i^l}} {{\partial x_k^l}}& =& \frac{\partial \bar{\mathcal{J}}^l_{ik}}{\partial x_k^l} \left( {x_k B_{ki} - x_i B_{ik} } \right) - {\left( A_{ki}+B_{ki}\bar{\mathcal{J}}_{ki}+n_{col} C_{ki} \right), } \nonumber \\ \text{if } k = i &\text{ and } l \neq m \label{eq:jac3} \\ \frac{{\partial F_i^l}} {{\partial x_i^m}} &=& \sum\limits_{j \ne i}^N \frac{\partial \bar{\mathcal{J}}^l_{ij}}{\partial x_i^m} \left( {x_i B_{ij} - x_j B_{ji} } \right), \nonumber \\ \text{if } k \neq i &\text{ and } t \neq m \label{eq:jac4} \\ \frac{{\partial F_i^l}} {{\partial x_k^m}} &=& \frac{\partial \bar{\mathcal{J}}^l_{ik}}{\partial x_k^m} \left( {x_k B_{ki} - x_i B_{ik} } \right). \nonumber \end{eqnarray}$ Equations (31a)and (31b)represent local couplings. Mathematically, they correspond to the diagonal and block-diagonal parts of the Jacobian, respectively. Physically, Eq. (31a)describes perturbations of the statistical equilibrium of a given level, by modifying the population of this level, within one atmospheric layer. Equation (31b)describes the perturbations of the statistical equilibrium of a given level, by modifying the population of other levels, still within one atmospheric layer. Equations (31c)and (31d)correspond to off-diagonal blocks and describe nonlocal perturbations, i.e., the influence of the populations in another layer. Naturally, in these last differentiations the coupling is purely radiative, whereas the first and second differentiations include collisional processes.

To determine the terms $\partial 𝒥̅ \begin{matrix} l \\ ik \end{matrix} / \partial x_{k}^{m}$ $\hbox{$\partial \bar{\mathcal{J}}^l_{ik}/\partial x_k^m$}$ , one needs to differentiate an integral expression over the variable $x_{k}^{m}$ $\hbox{$x_k^m$}$ . This derivation must be considered a functional derivative, and be performed before discretization (see Appendix B for the demonstration), $\frac{δ 𝒥̅ ij [x_{k}] (τ_{v})}{δ x_{k} (s^{)}} = \lim_{ε \to 0} \frac{𝒥̅ ij [x_{k} (τ_{V}) + εδ (τ_{V} - s)] - 𝒥̅ ij [x_{k} (τ_{V})]}{ε},$ $\begin{equation} \frac{{\delta \bar{\mathcal{J}}_{ij} \left[x_k\right]\left( \tau _{\text{v}} \right)}} {{\delta x_k \left( s \right)}}=\lim\limits_{\varepsilon \to 0}\frac{\bar{\mathcal{J}}_{ij} \left[x_k(\tau_\mathrm{V}) + \varepsilon \delta(\tau_\mathrm{V} - s)\right]-\bar{\mathcal{J}}_{ij} \left[x_k(\tau_\mathrm{V})\right]}{\varepsilon} , \end{equation}$ (32)where δ(τ_V−s) is the Dirac distrbution.

In the case τ_V ≠ s (l ≠ m), the derivatives for each component are $\begin{matrix} \begin{matrix} \begin{matrix} \end{matrix} \frac{δ 𝒥̅ \begin{matrix} Line \\ ij \end{matrix} [x_{i}, x_{j}] (τ_{V})}{δ x_{i} (s)} = D_{ij} ζ (s) K (s, τ_{V}) + D_{ij} ζ (s) E_{ij} \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} \times {\begin{matrix} \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} \end{matrix} \frac{δ 𝒥̅ \begin{matrix} Line \\ ij \end{matrix} [x_{i}, x_{j}] (τ_{V})}{δ x_{j} (s)} = - D_{ij} ζ (s) E_{ij} \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} \times {\begin{matrix} \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} \end{matrix} \frac{δ 𝒥̅ \begin{matrix} Cont \\ ij \end{matrix} [x_{i}, x_{j}] (τ_{V})}{δ x_{i} (s)} = ζ (s) E_{ij} \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} \times {\begin{matrix} \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} \end{matrix} \frac{δ 𝒥̅ \begin{matrix} Cont \\ ij \end{matrix} [x_{i}, x_{j}] (τ_{V})}{δ x_{j} (s)} = - ζ (s) E_{ij} \begin{matrix} \end{matrix} \\ \begin{matrix} \end{matrix} \times {\begin{matrix} \end{matrix} \end{matrix} \end{matrix}$ $% subequation 2251 0 \begin{eqnarray} \label{eq:jac11} \begin{split} \frac{\delta \bar{\mathcal{J}}_{ij}^\mathrm{Line}[x_i,x_j](\tau_\mathrm{V}) } {\delta x_i(s)} &= D_{ij} \zeta(s) K(s,\tau_\mathrm{V}) + D_{ij} \zeta(s) E_{ij} \\ &\times \begin{cases} \displaystyle \int_k^{\tau_\mathrm{V}^\mathrm{Out}}{\zeta(t) x_i(t) \tilde{K}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }k > \tau_\mathrm{V}\\ \displaystyle \int^k_{\tau_\mathrm{V}^\mathrm{In}}{\zeta(t) x_i(t) \tilde{K}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }k < \tau_\mathrm{V}, \end{cases} \end{split} \\ \label{eq:jac22} \begin{split} \frac{\delta \bar{\mathcal{J}}_{ij}^\mathrm{Line}[x_i,x_j](\tau_\mathrm{V}) } {\delta x_j(s)} &= -D_{ij} \zeta(s) E_{ij} \\ &\times \begin{cases} \displaystyle \int_k^{\tau_\mathrm{V}^\mathrm{Out}}{\zeta(t) x_i(t) \tilde{K}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }k > \tau_\mathrm{V}\\ \displaystyle \int^k_{\tau_\mathrm{V}^\mathrm{In}}{\zeta(t) x_i(t) \tilde{K}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }k < \tau_\mathrm{V}, \end{cases} \end{split} \\ \label{eq:jac33} \begin{split} \frac{\delta \bar{\mathcal{J}}_{ij}^\mathrm{Cont}[x_i,x_j](\tau_\mathrm{V}) } {\delta x_i(s)} &= \zeta(s) E_{ij} \\ &\times \begin{cases} \displaystyle \int_k^{\tau_\mathrm{V}^\mathrm{Out}}{\xi_{{ij}}(t)B_{{ij}}(t) \tilde{L}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }k > \tau_\mathrm{V}\\ \displaystyle \int^k_{\tau_\mathrm{V}^\mathrm{In}}{\xi_{{ij}}(t)B_{{ij}}(t) \tilde{L}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }k < \tau_\mathrm{V}, \end{cases} \end{split} \\ \label{eq:jac44} \begin{split} \frac{\delta \bar{\mathcal{J}}_{ij}^\mathrm{Cont}[x_i,x_j](\tau_\mathrm{V}) } {\delta x_j(s)} &= -\zeta(s) E_{ij} \\ & \times \begin{cases} \displaystyle \int_k^{\tau_\mathrm{V}^\mathrm{Out}}{\xi_{{ij}}(t)B_{{ij}}(t) \tilde{L}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }k > \tau_\mathrm{V}\\ \displaystyle \int^k_{\tau_\mathrm{V}^\mathrm{In}}{\xi_{{ij}}(t)B_{{ij}}(t) \tilde{L}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }k < \tau_\mathrm{V}. \end{cases} \end{split} \end{eqnarray}$ However $\hbox{$\bar{\mathcal{J}}$}$ is not differentiable at x(t) = x(τ_V) because of the singularity, therefore Eq. (33a)is undetermined for s = τ_V. For these local terms, we thus need an approximate expression of the radiation field. We emphasize that we only apply this approximation to the computation of the Jacobian matrix.

Fig. 5

Pattern of the Jacobian matrix for 4 model atmosphere layers and 200 energy levels.

3.4. Sparsity of Jacobian matrix

In Sect. 3.1, we argue that GMRES is a well-adapted method for large-scale problems if the Jacobian matrix is sparse, as it only requires matrix-vector products, and no matrix inversion. The sparsity (or density) coefficient is defined by the ratio of nonzero elements over the total number of elements. The Jacobian matrix is composed by P × P submatrix of N × N elements leading to NP × NP elements. The block diagonal of this matrix is full because of the irreducibility of the collisional coupling within a given layer (no selection rules for collisions). The number of nonzero elements in the block-diagonal part is P × N². In the P(P−1) off-diagonal submatrices, the nonzero elements are only due to radiative coupling between different layers, and their number is thus linked to the selection rules for allowed transitions. These rules thus lead to a sparsity of each submatrix with a density ρ_mol = N_transitions/N². The density of the complete Jacobian matrix is then given by $ρ = \frac{P \times N^{2} + P (P - 1) \times N_{transitions}}{N^{2} P^{2}} = \frac{1 + ρ_{mol} (P - 1)}{P} \cdot$ $\begin{equation} \rho=\frac{P\times N^2+P(P-1)\times N_\text{transitions}}{N^2P^2}=\frac{1+\rho_\text{mol}{(P-1)}}{P}\cdot \end{equation}$ (34)Then, as ρ_mol(N) decreases rapidly with N (e.g., ρ_H₂O ~ 10^-3, for N = 411, see Figs. 6 and 7) and ρ is inversely proportional to P, the larger the system, the sparser the Jacobian matrix. The GMRES scheme is thus well adapted and asymptotically approaches the NP complexity.

Fig. 6

Example of the evolution of the off-diagonal, block-matrix sparsity (ρ_mol) according to the number of considered levels. This example is computed for the 411 levels of the ortho-H₂O molecule.

Fig. 7

Evolution of the Jacobian matrix sparsity according to the number of atmospheric layers for different values of ρ_mol.

3.5. Jacobian block-diagonal approximation

Assuming that τ_T(τ_V)−τ_T(t) ∝ (t−τ_V) (first order linearization) when t is close to τ_V, because the kernel decreases when | t−τ_V | increases, we assume that every function at t is equal to its value at τ_V.

Splitting the integral over t at τ_V and inverting the integrals over t and μ, Gonzalez Garcia et al. (2008) show that the mean intensity can be rewritten as $\begin{matrix} 𝒥̅ ij (τ_{V}) & = & \frac{x_{i} (τ_{V})}{x_{j} (τ_{V}) - x_{i} (τ_{V})} (1 - β_{-} (τ_{V}) - β_{+} (τ_{V})) \\ 𝒥̅ ij (τ_{V}) & = & τ_{\max} I_{ν} β (τ_{V}), \end{matrix}$ $% subequation 2527 0 \begin{eqnarray} \bar{\mathcal{J}}_{ij}(\tau_\mathrm{V}) &=&\frac{x_i(\tau_\mathrm{V})}{x_j(\tau_\mathrm{V})-x_i(\tau_\mathrm{V})}\left(1-\beta_-(\tau_\mathrm{V})-\beta_+(\tau_\mathrm{V})\right) \\ \bar{\mathcal{J}}_{ij}(\tau_\mathrm{V}) & = &\tau_{\max}I_\nu\beta(\tau_\mathrm{V}), \end{eqnarray}$ where β₋(τ_V) and β₊(τ_V) are inward and outward escape probabilities (defined in Gonzalez Garcia et al. 2008). The β(τ_V) invoke the function E₂(x), which is not singular and differentiable everywhere in the domain. This enables an analytic estimate of the bloc diagonal terms. The pattern of the Jacobian is displayed in Fig. 5.

3.6. The numerical code MOrad

The above strategy has been implemented in a numerical code named MOrad written in Fortran 90. It takes as inputs:

1.
A model atmosphere (temperature, atomic, molecular, andelectronic densities etc. as a function of geometrical thickness).
2.
Spectroscopic data (level energies, statistical weights, radiative rates).
3.
Collisional rates.

Then it computes the continuous opacities and uses an initial guess to calculate the function (22)and its Jacobian matrix. Choosing the initial guess X₀ is an important operation. In MOrad we tested the use of an LTE solution (Boltzmann distribution), and a 0 K solution (the molecules in the ground level). We find that the convergence rate is better with the “0 K” initial guess.

The singular integrations needed to compute $\hbox{$\bar{\mathcal{J}}$}$ are performed with the periodization method. The Green’s function needed to compute $\hbox{$\bar{\mathcal{J}}$}$ is stored at each point. The required space scales as the number of layers times the number of frequency points (in a parallel code; otherwise it would scale with the square of the number of layers). The evaluation of the function and the Jacobian are parallelized with MPI. At first glance, two parallelization strategies may be considered: either a parallelization over frequencies or over atmosphere layers. However, van Noort et al. (2002) show that the most efficient parallelization to reduce communications is spatial parallelization because of the strong local coupling between energy levels. The populations of all energy levels within each atmosphere layer should thus be computed by the same processor, with several adjacent layers possibly treated by the same processor. In MOrad we adopt a subdomain decomposition where each processor computes the physical values needed in one given layer and solves the local statistical equilibrium. Integrations are performed with a scalable algorithm, which minimizes global communications to preserve a good scalability.

The code has been interfaced with a nonlinear solver of the parallel and scalable library PETSc (see, e.g., Balay et al. 2013b,a, 1997). This solver takes as input the distributed function vector and the Jacobian matrix and returns the root of the function after preconditioning.

The main advantages of this library are its performance and its scalability. Moreover, PETSc provides access to many different nonlinear methods and preconditioning, which allows us a versatile choice of the best method to be adapted for a given physical problem. Other efficient libraries, such as the parallel IO library HDF5, are interfaced with MOrad to preserve the performances for large-scale problems and the portability of output data. Future planned developments include an MPI/OpenMP hybridization (openMP parallelization over frequencies).

4. Discussion

4.1. Example of application

To illustrate the applicability of MOrad, we compute the non-LTE departure coefficients of water molecules in a MARCS (Gustafsson et al. 2008) red supergiant model atmosphere. The model considered here has an effective temperature of 3500 K, a log g = 0, a solar metallicity, and a microturbulence of 2 km s^-1. These parameters correspond to typical red supergiant stars. We consider more than 800 rovibrational levels, leading to more than 330 000 transitions and 15 000 lines (see the Grotrian diagram of ortho H₂O in Fig. 8). The energy levels and the radiative coefficients are taken from Barber et al. (2006). The collisional rates for H₂O–H₂ and H₂O–e⁻ are taken from Faure & Josselin (2008).

Fig. 8

Grotrian diagram of the ortho-H₂O molecule showing the radiative transitions taken into account. The parameter J_n is the rotational angular momentum of H₂O. Colors indicate different vibrational states.

The code MOrad was launched on 47 processors using a GMRES Newton-Krylov method and a line-search global convergence strategy. We conducted preconditioning with the AMS method (Block Additive Schwarz method), where each subblock is preconditioned with an approximate iterative LU factorization method. We obtain the root X^⋆ of the function (∥ F(X^⋆) ∥ _∞ ≤ 10^-12) after three nonlinear iterations and a total time of code execution of less than ~1 h on the Alarik LUNARC system (see Lunarc 2013). The departure coefficients are presented in Fig. 9. The convergence rate is Q-quadratic, though a quadratic convergence rate would be expected. This may be because of an over reconditioning or a problem that is too simple (close to linear), but we obtain the same Q-quadratic convergence for a purely radiative problem.

Fig. 9

LTE departure coefficients of ortho-H₂O in a red supergiant model atmosphere.

A detailed analysis of the results is out of the scope of this paper and is devoted to a forthcoming paper; a preliminary analysis was done in Lambert et al. (2013). In particular a detailed discussion of the use of the super-level approximation, which seems appropriate for the vibrationally excited states, will be presented. In short, the non-LTE calculations lead to stronger lines, especially rotational lines in the fundamental vibrational state around 2 μm, compared to LTE computation.

4.2. Performance and complementarity of the method

The method presented here is naturally accurate as it is equivalent to finding the root of a function and thus avoids any false convergence (assuming the solution is unique)³. Furthermore, we emphasize that because the integration is analytic, the method leads naturally to null residue. Concerning the performance of the method, the performances of the method are difficult to assess. The convergence rate depends upon the choice of both the solver and the preconditioning. These choices are not unique and may vary according to the considered problem (species and model atmosphere). The memory demands scales with the dimensionality of the considered problem.

The computation of the function and the Jacobian matrix leads to a scheme with an asymptotic complexity in $\hbox{$\mathcal{O}(Nb_\text{freq} \times (N \times P)^2)$}$ . The complexity of the resolution scheme is more difficult to evaluate because the preconditioned Newton-Krylov method has a cost in N_freq × N² × P² (N_freq being the number of frequency points sampling the line profile, N the number of energy levels, and P the number of layers in the discretized atmosphere), but invokes another term that evolves as the square of the iteration index. This term is partially eliminated by restarting GMRES and is not really important in practice because the number of iterations is small after appropriate preconditioning.

Furthermore, the performance is better than $\hbox{$\mathcal{O}(n^2)$}$ and probably close to $\hbox{$\mathcal{O}(n)$}$ . Indeed, for a sparse matrix, GMRES evolves in $\hbox{$\mathcal{O}(n)$}$ . Because the only nonzero elements of the off-diagonal blocks of the Jacobian matrix are due to radiative coupling between layers, the sparsity of J evolves as the ratio of the number of radiative transitions over N². Taking a complexity in Nb_freq × N² × P² is then an superior limit, which underevaluates the method scalability in real cases.

As a comparison, the computation of the equivalent function of F(X) in MULTI has a complexity evolving in N × P because of the Scharmer operator, which performs no numerical integration to evaluate diffusion integrals but only one quadrature point. The system is solved with a Newton-Raphson scheme, which has a cost in N³ × P² when using the full matrix or N³ × P when using the local operator (block-diagonal matrix; M. Carlsson, priv. comm.). The theoretical asymptotic speed-up for MOrad compared to MULTI are summarized in Table 2.

Table 2

Theoretical asymptotic speed-up for two extreme cases in terms of ρ = sparsity(J)(0 ≤ ρ ≤ 1) for MOrad, compared to MULTI with either full or local operator.

From Table 2, one can see that our method is complementary to existing methods, such as direct linearization. Indeed, in some cases, with a large number of levels and a small number of layers, the expected time of computation is shorter.

5. Conclusion

We have developed a new method to solve the RTE in non-LTE conditions. Rather than solving the RTE itself, we solve the statistical equilibrium equation system, which includes an analytical formal solution of the RTE. One of the main advantages of this choice is that the coefficients of the associated Jacobian matrix can be computed exactly. With this approach, a nonlinear solver can be used. In other words, this method avoids any linearization or stationary iterative schemes, which may converge very slowly or even falsely in some extreme cases (a very low non-LTE parameter ϵ and/or highly discretized media). We chose a method based on the GMRES nonlinear method. It has been implemented in a code that uses the PETSc library. We pay special attention to the proper treatment of the singularity that arises in the integration of the mean radiation field.

Furthermore, this method is parallelized using MPI, which makes it applicable to large-scale systems such as molecules and atoms in stellar atmospheres or interstellar medium, as it has an excellent scalability with the number of energy levels. Indeed, our code has been successfully applied to the modeling of water in the atmospheres of red supergiants. The results will be presented in a forthcoming paper. Further developments of the code include hybrid parallelization, with openMP parallelization over frequencies. We then plan to make the code public.

¹

We already emphasize here that solving either the RTE explicitly or the statistical equilibrium equation is strictly identical, and both equations are of similar mathematical nature, as shown, for instance, in the Appendix of Hauschildt et al. (1995).

²

At steady-state. Chemical formation and destruction terms are not included.

³

The solution of the Eddington problem (the two-level atom in the isothermal optically thick medium) is recovered at the 1st iteration with preconditioning, and after five iterations without preconditioning (relative error <0.005 for ϵ = 10^-5 and τ_max = 1000). This is expected because the inversion of the matrix by the Krylov subspace method is well known to work in this kind of situation.

Acknowledgments

We are grateful to Pascal Azerad for fruitful discussions and Karin Ryde for proofreading the English in the manuscript. We acknowledge financial support from the Swedish Karl Tryggers Foundation under grants CTS 12:408 and CTS 13:388 and its investment in science and astrophysics. Moreover, we thank the French National Agency for Research (ANR) through program number ANR-06-BLAN-0105, and from “Programme National de Physique Stellaire” (PNPS) of CNRS/INSU, France. N.R. is a Royal Swedish Academy of Sciences Research Fellow supported by a grant from the Knut and Alice Wallenberg Foundation. Funds from Kungl. Fysiografiska Sällskapet i Lund and support from the Swedish Research Council, VR are gratefully acknowledged.

References

Ahues, M., D’Almeida, F., Largillier, A., Titaud, O., & Vasconcelos, P. 2002, J. Comput. Appl. Math., 140, 13 [NASA ADS] [CrossRef] [Google Scholar]
Auer, L. 1991, in NATO ASIC Proc. 341: Stellar Atmospheres − Beyond Classical Models, eds. L. Crivellari, I. Hubeny, & D. G. Hummer, 9 [Google Scholar]
Balay, S., Gropp, W. D., McInnes, L. C., & Smith, B. F. 1997, in Modern Software Tools in Scientific Computing, eds. E. Arge, A. M. Bruaset, & H. P. Langtangen (Birkhäuser Press), 163 [Google Scholar]
Balay, S., Brown, J., Buschelman, K., et al. 2013a, PETSc Users Manual, Tech. Rep. ANL-95/11 − Revision 3.4 (Argonne National Laboratory) [Google Scholar]
Balay, S., Brown, J., Buschelman, K., et al. 2013b, PETSc Web page, http://www.mcs.anl.gov/petsc [Google Scholar]
Barber, R. J., Tennyson, J., Harris, G. J., & Tolchenov, R. N. 2006, MNRAS, 368, 1087 [NASA ADS] [CrossRef] [Google Scholar]
Cannon, C. J. 1973, J. Quant. Spec. Radiat. Transf., 13, 1011 [NASA ADS] [CrossRef] [Google Scholar]
Castor, J. I. 1970, MNRAS, 149, 111 [NASA ADS] [Google Scholar]
Damgaard, P. H., Hjorth, P. G., & Thejll, P. A. 1992, A&A, 254, 422 [NASA ADS] [Google Scholar]
Dickel, H. R., & Auer, L. H. 1994, ApJ, 437, 222 [NASA ADS] [CrossRef] [Google Scholar]
Fabiani Bendicho, P., Trujillo Bueno, J., & Auer, L. 1997, A&A, 324, 161 [Google Scholar]
Faure, A., & Josselin, E. 2008, A&A, 492, 257 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Gonzalez Garcia, M., Le Bourlot, J., Le Petit, F., & Roueff, E. 2008, A&A, 485, 127 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Gustafsson, B., Edvardsson, B., Eriksson, K., et al. 2008, A&A, 486, 951 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hauschildt, P. H., Starrfield, S., Shore, S. N., Allard, F., & Baron, E. 1995, ApJ, 447, 829 [NASA ADS] [CrossRef] [Google Scholar]
Helluy, P., Maire, S., & Ravel, P. 1998, Comptes Rendus de l’Académie des Sciences Paris, Série Sciences Mathématiques, 327, 843 [Google Scholar]
Hubeny, I., & Mihalas, D. 2014, Theory of Stellar Atmospheres (Princeton University Press) [Google Scholar]
Juvela, M. 2005, A&A, 440, 531 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Klein, R. I., Castor, J. I., Dykema, P. G., Greenbaum, A., & Taylor, D. 1989, J. Quant. Spec. Radiat. Transf., 41, 199 [NASA ADS] [CrossRef] [Google Scholar]
Lambert, J., Josselin, E., Ryde, N., & Faure, A. 2013, in EAS Pub. Ser. 60, eds. P. Kervella, T. Le Bertre, & G. Perrin, 111 [Google Scholar]
Lunarc 2013, Alarik system details, http://www.lunarc.lu.se/Systems/AlarikDetails [Google Scholar]
Ng, K.-C. 1974, J. Chem. Phys., 61, 2680 [NASA ADS] [CrossRef] [Google Scholar]
Olson, G. L., Auer, L. H., & Buchler, J. R. 1986, J. Quant. Spec. Radiat. Transf., 35, 431 [NASA ADS] [CrossRef] [Google Scholar]
Paletou, F., & Anterrieu, E. 2009, A&A, 507, 1815 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Rybicki, G. B. 1997, ApJ, 479, 357 [NASA ADS] [CrossRef] [Google Scholar]
Scharmer, G. B., & Carlsson, M. 1985, J. Comput. Phys., 59, 56 [NASA ADS] [CrossRef] [Google Scholar]
Socas-Navarro, H., & Trujillo Bueno, J. 1997, ApJ, 490, 383 [NASA ADS] [CrossRef] [Google Scholar]
Trujillo Bueno, J., & Fabiani Bendicho, P. 1995, ApJ, 455, 646 [NASA ADS] [CrossRef] [Google Scholar]
Štěpán, J., & Trujillo Bueno, J. 2013, A&A, 557, A143 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
van Noort, M., Hubeny, I., & Lanz, T. 2002, ApJ, 568, 1066 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Formulation

One starts with the classical formulation of the radiative transfer equation in a plane parallel geometry, $μ \frac{d I_{ν}}{d s} = - (χ_{ν}^{Cont} + {χ_{ν}^{Line}}^{)} I_{ν} + η_{ν}^{Cont} + η_{ν}^{Line} .$ $\appendix \setcounter{section}{1} \begin{equation} \mu \frac{{\rm d}I_\nu}{{\rm d}s}=-\left(\chi_\nu^\text{Cont}+\chi_\nu^\text{Line}\right)I_\nu+ \eta_\nu^\text{Cont}+\eta_\nu^\text{Line} . \end{equation}$ (A.1)The monochromatic extinction and emissivity coefficients are expressed with an explicit dependence on levels populations as follows: $\begin{matrix} χ_{ν}^{Line} & = & φ_{ν}^{ul} χ_{ul} = φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} (B_{lu} n_{l} - B_{ul} n_{u}) \\ η_{ν}^{Line} & = & φ_{ν}^{ul} η_{ul} = φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} A_{ul} n_{u .} \end{matrix}$ $\appendix \setcounter{section}{1} % subequation 3008 0 \begin{eqnarray} \label{eq:def1a} \chi_\nu^\mathrm{Line} & = &\phi^{ul}_\nu \chi_{ul} = \phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} (B_{lu}n_{l}-B_{ul}n_{u})\\ \label{eq:def1b} \eta_\nu^\mathrm{Line} & = &\phi^{ul}_\nu \eta_{ul} = \phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi}A_{ul}n_{u.} \end{eqnarray}$ By inserting relative populations f_i = n_i/n^k ,where n^k is the total density of the species labeled by k, one gets $\begin{matrix} χ_{ν}^{Line} & = & φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} n^{k} (B_{lu} f_{l} - B_{ul} f_{u}) \\ η_{ν}^{Line} & = & φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} n^{k} A_{ul} f_{u .} \end{matrix}$ $\appendix \setcounter{section}{1} % subequation 3030 0 \begin{eqnarray} \label{eq:def1a} \chi_\nu^\mathrm{Line} & =& \phi^{ul}_\nu \frac{h{\nu_{ul }}}{4\pi} n^{\rm{k}}(B_{lu}f_{l}-B_{ul}f_{u})\\ \label{eq:def1b} \eta_\nu^\mathrm{Line} & =& \phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} n^{\rm{k}} A_{ul}f_{u.} \end{eqnarray}$ One brings up the new variable x_i = f_i/g_i, and using the relation B_ulg_u = B_lug_l, where g_i is the statistical weight of the level i, yields $\begin{matrix} χ_{ν}^{Line} & = & φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} n^{k} g_{u} (B_{lu} \frac{f_{l}}{g_{u}} - B_{ul} \frac{f_{u}}{g_{u}}) = φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} n^{k} g_{u} B_{ul} (x_{l} - x_{u}) \\ η_{ν}^{Line} & = & φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} n^{k} g_{u} A_{ul} \frac{f_{u}}{g_{u}} = φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} n^{k} g_{u} A_{ul} x_{u .} \end{matrix}$ $\appendix \setcounter{section}{1} % subequation 3055 0 \begin{eqnarray} \label{eq:def1a} \chi_\nu^\mathrm{Line} & = &\phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} n^{\rm{k}}g_{u}\left(B_{lu}\frac{f_{l}}{g_{u}}-B_{ul}\frac{f_{u}}{g_{u}}\right) = \phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} n^{\rm{k}}g_{u}B_{ul}(x_{l}-x_{u}) \\ \label{eq:def1b} \eta_\nu^\mathrm{Line} & = & \phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} n^{\rm{k}} g_{u} A_{ul}\frac{f_{u}}{g_{u}} = \phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} n^{\rm{k}} g_{u} A_{ul}x_{u.}~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \end{eqnarray}$ Including the formulation of $χ_{ν}^{Line}$ $\hbox{$\chi_\nu^\text{Line}$}$ and $η_{ν}^{Line}$ $\hbox{$\eta_\nu^\text{Line}$}$ in the radiative transfer equation, one obtains $\begin{matrix} μ \frac{d I_{ν}}{d s} & = & - (χ_{ν}^{Cont} + φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} n^{k} g_{u} B_{ul} (x_{l} - x_{u})) I_{ν} \\ + η_{ν}^{Cont} + φ_{ν}^{ul} \frac{h ν_{ul}}{4 π} n^{k} g_{u} A_{ul} x_{u} . \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} \mu \frac{{\rm d}I_\nu}{{\rm d}s} &=&-\left(\chi_\nu^\text{Cont}+\phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} n^{\rm{k}}g_{u}B_{ul}(x_{l}-x_{u})\right)I_\nu \notag\\&& +\ \eta_\nu^\text{Cont}+\phi^{ul}_\nu \frac{h{\nu_{ul}}}{4\pi} n^{\rm{k}} g_{u} A_{ul}x_{u} . \end{eqnarray}$ (A.5)In order to create a transition independent scale, the previous equation is divided by an arbitrarily chosen continuum extinction coefficient χ_V, here χ_V ≡ χ_{ν = 500 nm}. Moreover we express it per hydrogen atom to bring out the ratio n^k/n^H for numerical reasons. This normalization is symbolized by a tilde (~) for other variables, $\begin{matrix} μ \frac{d I_{ν}}{χ_{V} d s} & = & - (\frac{{χ_{ν}^{Cont}}_{˜}}{\begin{matrix} ˜ \\ χ_{V} \end{matrix}} \frac{n^{H}}{n^{H}} + \frac{h ν_{ul}}{4 π} \frac{n^{k}}{n^{H}} \frac{1}{\begin{matrix} ˜ \\ χ_{V} \end{matrix}} B_{ul} g_{u} (x_{l} - x_{u}) φ_{ν}^{ul}) I_{ν} \\ + \frac{h ν_{ul}}{4 π} \frac{n^{k}}{n^{H}} \frac{1}{\begin{matrix} ˜ \\ χ_{V} \end{matrix}} A_{ul} g_{u} x_{u} φ_{ν}^{ul} + \frac{η_{ν}^{Cont}}{n^{H} \begin{matrix} ˜ \\ χ_{V} \end{matrix}} \frac{n^{H} {χ_{ν}^{Cont}}_{˜}}{χ_{ν}^{Cont}} \cdot \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} \mu \frac{{\rm d}I_\nu}{ \chi_{V} {\rm d}s}&=&- \left(\frac{\tilde{\chi}_\nu^\text{Cont}}{\tilde{\chi}_{V}}\frac{n^{\rm H}}{n^{\rm H}}+ \frac{h\nu_{ul}}{4\pi}\frac{n^{\rm{k}}}{n^{\rm H}} \frac{1}{\tilde{\chi}_{V}} B_{ul}g_u \left(x_l-x_u\right)\phi^{ul}_\nu \right) I_\nu \\\notag&&+ \frac{h\nu_{ul}}{4\pi} \frac{n^{\rm{k}}}{n^{\rm H}} \frac{1}{\tilde{\chi}_{V}} A_{ul}g_ux_u \phi^{ul}_\nu + \frac{\eta_\nu^\text{Cont}}{n^{\rm H} \tilde{\chi}_{V}} \frac{n^{\rm H} \tilde{\chi}_\nu^\text{Cont}}{\chi_\nu^\text{Cont}} \cdot \end{eqnarray}$ (A.6)With $ξ_{ν} = \frac{\begin{matrix} _{˜} \\ χ_{ν} \end{matrix}}{\begin{matrix} _{˜} \\ χ_{V} \end{matrix}}$ $\hbox{$\xi_\nu=\frac{\tilde{\chi}_\nu}{\tilde{\chi}_{V}}$}$ , $E_{ij} = \frac{h ν_{ij}}{4 π} B_{ij} g_{i}$ $\hbox{$E_{ij}=\frac{h\nu_{ij}}{4\pi}B_{ij}g_i$}$ , $ζ = \frac{n^{k}}{n^{H}} \frac{1}{\begin{matrix} _{˜} \\ χ_{V} \end{matrix}}$ $\hbox{$\zeta=\frac{n^{\rm{k}}}{n^{\rm H}} \frac{1}{\tilde{\chi}_{V}}$}$ , $D_{ij} = \frac{h ν_{ij}}{4 π} A_{ij} g_{i}$ $\hbox{$D_{ij}=\frac{h\nu_{ij}}{4\pi}A_{ij}g_i$}$ , $S_{ν} = \frac{η_{ν}^{Cont}}{χ_{ν}^{Cont}}$ $\hbox{$S_\nu=\frac{\eta_\nu^\text{Cont}}{\chi_\nu^\text{Cont}}$}$ , and dτ_V = −χ_Vds. Assuming that the continuum is formed in LTE ( $S_{ν}^{Cont} = B_{ν}$ $\hbox{$S_\nu^\text{Cont}=B_\nu$}$ ), one obtains $μ \frac{d I_{ν}}{d τ_{V}} = [ξ_{ν} + E_{ul} ζ φ_{ν}^{ul} (x_{l} - x_{u})^{]} I_{ν} - D_{ul} x_{u} ζ φ_{ν}^{ul} - ξ_{ν} B_{ν} .$ $\appendix \setcounter{section}{1} \begin{equation} \mu\frac{{\rm d}I_\nu}{{\rm d}\tau_{V}}=\left[\xi_\nu + E_{ul}\zeta\phi^{ul}_\nu(x_l-x_u)\right]I_\nu - D_{ul}x_u\zeta\phi^{ul}_\nu-\xi_\nu B_\nu . \end{equation}$ (A.7)

Appendix B: Jacobian matrix coefficients

We start with the analytic formulation of the mean intensity field for the line component (similar demonstration for contunium), i.e., $\bar{𝒥_{ij}} [x_{i}, x_{j}] (τ_{V}) = D_{ij} \int_{τ_{V}^{In}}^{τ_{V}^{Out}} ζ (t) x_{i} (t) K [x_{i}, x_{j}] (t, τ_{V}) d t .$ $\appendix \setcounter{section}{2} \begin{equation} \bar{\mathcal{J}_{ij}}[x_i,x_j](\tau_\mathrm{V}) = D_{ij}\int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} { \zeta(t) x_i(t) K[x_i,x_j](t,\tau_\mathrm{V}) {\rm d}t } . \end{equation}$ (B.1)Because the functional derivative of a functional product is $\frac{δ (F [y] G [y])}{δy (x)} = G [y] \frac{δF [y]}{δy (x)} + F [y] \frac{δG [y]}{δy (x)},$ $\appendix \setcounter{section}{2} \begin{equation} \frac{\delta (F[y]G[y]) }{\delta y(x)} = G[y]\frac{\delta F[y] }{\delta y(x)} + F[y]\frac{\delta G[y] }{\delta y(x)} , \end{equation}$ (B.2)we functionally derive the product x_i(t)K [x_i,x_j] over x_i(τ_V = s), which yields, $\begin{matrix} \frac{δ \bar{𝒥_{ij}} [x_{i}, x_{j}] (τ_{V})}{δ x_{i} (s)} & = & D_{ij} \int_{τ_{V}^{In}}^{τ_{V}^{Out}} ζ (t) \frac{δ x_{i} (t)}{δ x_{i} (s)} K [x_{i}, x_{j}] (t, τ_{V}) d t \\ + D_{ij} \int_{τ_{V}^{In}}^{τ_{V}^{Out}} ζ (t) x_{i} (t) \frac{δK [x_{i}, x_{j}] (t, τ_{V})}{δ x_{i} (s)} d t . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} \label{eq:DJsplit} \frac{\delta \bar{\mathcal{J}_{ij}}[x_i,x_j](\tau_\mathrm{V}) } {\delta x_i(s)} & = &D_{ij} \int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} {\zeta(t) \frac{\delta x_i(t)}{\delta x_i(s)} K[x_i,x_j](t,\tau_\mathrm{V}) {\rm d}t} \notag\\&& + D_{ij} \int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} {\zeta(t) x_i(t) \frac{\delta K[x_i,x_j](t,\tau_\mathrm{V})} {\delta x_i(s)} {\rm d}t} . \end{eqnarray}$ (B.3)Equation (B.3) can be rewritten as a sum of two integrals over t, I₁ and I₂.

The first term I₁ simplifies $\begin{matrix} I_{1} & = & D_{ij} \int_{τ_{V}^{In}}^{τ_{V}^{Out}} ζ (t) \frac{δ x_{i} (t)}{δ x_{i} (s)} K [x_{i}, x_{j}] (t, τ_{V}) d t \\ = & D_{ij} \int_{τ_{V}^{In}}^{τ_{V}^{Out}} ζ (t) δ (t - s) K [x_{i}, x_{j}] (t, τ_{V}) d t \\ = & D_{ij} ζ (s) K (s, τ_{V}) . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} I_1 & =& D_{ij} \int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} {\zeta(t) \frac{\delta x_i(t)}{\delta x_i(s)} K[x_i,x_j](t,\tau_\mathrm{V}) {\rm d}t} \notag\\& =& D_{ij} \int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} {\zeta(t) \delta{(t - s)} K[x_i,x_j](t,\tau_\mathrm{V}) {\rm d}t} \notag\\ & =& D_{ij} \zeta(s) K(s,\tau_\mathrm{V}) . \end{eqnarray}$ (B.4)This term is an indefinite form if s = τ_V, because the kernel K(x,y) is singular at x = y.

We consider here the case s ≠ τ_V (the case s = τ_V is detailed in Sect. 3.4).

The second term I₂ requires the computation of the functional derivative of the kernel, which is $K [x_{i}, x_{j}] (t, τ_{V}) = \frac{1}{2} \int_{0}^{\infty} φ_{ν} (τ_{V}) φ_{ν} (t) E_{1} (Δ τ_{ν}^{Tot} [x_{i}, x_{j}] (t, τ_{V})^{)} d ν .$ $\appendix \setcounter{section}{2} \begin{equation} K[x_i,x_j](t,\tau_\mathrm{V}) = \frac{1}{2} \int_0^\infty {\phi_\nu(\tau_\mathrm{V}) \phi_\nu(t) E_1\left(\Delta \tau^\mathrm{Tot}_\nu [x_i,x_j](t,\tau_\mathrm{V}) \right) {\rm d}\nu} . \end{equation}$ (B.5)With the chain rule theorem of functional derivatives, we get $\frac{δf (F [y])}{δy (x)} = \frac{d f (F [y])}{d F [y]} \frac{δF [y]}{δy (x)} \cdot$ $\appendix \setcounter{section}{2} \begin{equation} \frac{\delta f(F[y]) }{\delta y(x)} = \frac{{\rm d} f(F[y]) }{{\rm d} F[y] } \frac{\delta F[y]}{\delta y(x)}\cdot \end{equation}$ (B.6)We then derive the kernel K, finding $\begin{matrix} \frac{δK [x_{i}, x_{j}] (t, τ_{V})}{δ x_{i} (s)} & = & \frac{1}{2} \int_{0}^{\infty} φ_{ν} (τ_{V}) φ_{ν} (t) \\ \times \frac{δ E_{1} (Δ τ_{ν}^{Tot} [x_{i}, x_{j}] (t, τ_{V})^{)}}{δ x_{i} (s)} d ν \\ = & - \frac{1}{2} \int_{0}^{\infty} φ_{ν} (τ_{V}) φ_{ν} (t) \times E_{0} (Δ τ_{ν}^{Tot} [x_{i}, x_{j}] (t, τ_{V})^{)} \\ \times \frac{δ Δ τ_{ν}^{Tot} [x_{i}, x_{j}] (t, τ_{V})}{δ x_{i} (s)} d ν . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} \label{eq:Dkern} \frac{\delta K[x_i,x_j](t,\tau_\mathrm{V})}{\delta x_i(s)} & =& \frac{1}{2} \int_0^\infty \phi_\nu(\tau_\mathrm{V}) \phi_\nu(t)\notag\\&&\times \frac{ \delta E_1\left(\Delta \tau^\mathrm{Tot}_\nu [x_i,x_j](t,\tau_\mathrm{V}) \right)} {\delta x_i(s) } {\rm d}\nu \notag\\ & =& -\frac{1}{2} \int_0^\infty \phi_\nu(\tau_\mathrm{V}) \phi_\nu(t)\!\times \!E_0\left(\Delta \tau^\mathrm{Tot}_\nu [x_i,x_j](t,\tau_\mathrm{V}) \right) \notag\\&&\times\frac{ \delta \Delta \tau^\mathrm{Tot}_\nu [x_i,x_j](t,\tau_\mathrm{V})} {\delta x_i(s) }{\rm d}\nu . \end{eqnarray}$ (B.7)To simplify the calculus, we rewrite $Δ τ_{ν}^{Tot}$ $\hbox{$\Delta \tau^\mathrm{Tot}_\nu$}$ as $\begin{matrix} Δ τ_{ν}^{Tot} [x_{i}, x_{j}] (t, τ_{V}) & = & | \int_{t}^{τ_{V}} ξ_{ν} (s^{'}) + E_{ij} ζ (s^{'}) φ_{ν} (s^{'}) [x_{j} (s^{'}) - x_{i} (s^{'})^{]} d s^{'} | \\ = & \int \begin{matrix} τ_{V}^{Out} \\ τ_{V}^{In} \end{matrix} Θ (t, τ_{V}, s^{'}) \\ \times [ξ_{ν} (s^{'}) + E_{ij} ζ (s^{'}) φ_{ν} (s^{'}) {[x_{j} (s^{'}) - x_{i} (s^{'})^{]}}^{]} d s^{'}, \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} \label{eq:Deltatau} \Delta \tau^\mathrm{Tot}_\nu [x_i,x_j](t,\tau_\mathrm{V}) & =& \left| \int_t^{\tau_\mathrm{V}}\! {\xi_\nu(s')\! +\! E_{ij}\zeta(s')\phi_\nu(s') \left[x_j(s')\!-\!x_i(s')\right] {\rm d}s'}\right|\notag \\ &=& \int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} \Theta(t,\tau_\mathrm{V},s')\notag\\&&\times\left[\xi_\nu(s') + E_{ij}\zeta(s')\phi_\nu(s') \left[x_j(s')-x_i(s')\right]\right]{\rm d}s', \end{eqnarray}$ (B.8)where the distribution Θ(t,τ_V,s) = −2(θ(t−τ_V)−0.5)(θ(s−t)−θ(s−τ_V)), and θ(x) is the Heaviside function. This distribution can be rewritten, $Θ ({t, τ_{V},s}^{)} = {\begin{matrix} if s < τ_{V} \\ if s > τ_{V} . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{equation} \Theta \left( {t,\tau _V,s } \right) = \begin{cases} \Pi_{a,s}(t) & \text{if } s<\tau_\mathrm{V} \\ \Pi_{s,b}(t) & \text{if } s>\tau_\mathrm{V}. \end{cases} \end{equation}$ (B.9)We functionally derive Eq. (B.8), yielding $\begin{matrix} \frac{δ Δ τ_{ν}^{Tot} [x_{i}, x_{j}] (t, τ_{V})}{δ x_{i} (s)} & = & - \int_{τ_{V}^{In}}^{τ_{V}^{Out}} Θ (t, τ_{V}, s^{'}) E_{ij} ζ (s^{'}) φ_{ν} (s^{'}) \frac{δ x_{i} (s^{'})}{δ x_{i} (s)} d s^{'} \\ = & - \int_{τ_{V}^{In}}^{τ_{V}^{Out}} Θ (t, τ_{V}, s^{'}) E_{ij} ζ (s^{'}) φ_{ν} (s^{'}) δ (s - s^{'}) d s^{'} \\ = & - E_{ij} ζ (s) Θ (t, τ_{V},s) φ_{ν} (s) . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} \frac{ \delta \Delta \tau^\mathrm{Tot}_\nu [x_i,x_j](t,\tau_\mathrm{V})} {\delta x_i(s) } \!& =& \!- \!\!\int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} {\Theta(t,\tau_\mathrm{V},s')E_{ij} \zeta(s') \phi_\nu(s') \frac{\delta x_i(s')}{\delta x_i(s) }{\rm d}s'}\notag\\ \!& =&\! \! - \! \int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}}\! {\Theta(t,\tau_\mathrm{V},s')E_{ij} \zeta(s') \phi_\nu(s') \delta(s\! -\! s') {\rm d}s'} \notag\\ & = &- E_{ij} \zeta(s) \Theta(t,\tau_\mathrm{V},s) \phi_\nu(s). \end{eqnarray}$ (B.10)We insert this expression in Eq. (B.7) and we define the modified kernel $\begin{matrix} _{˜} \\ K_{0} \end{matrix} (t, τ_{V},s)$ $\hbox{$\tilde{K}_0(t,\tau_\mathrm{V},s)$}$ , $\begin{matrix} \frac{δK [x_{i}, x_{j}] (t, τ_{V})}{δ x_{i} (s)} & = & E_{ij} ζ (s) Θ (t, τ_{V},s) \frac{1}{2} \int_{0}^{\infty} φ_{ν} (τ_{V}) φ_{ν} (t) φ_{ν} (s) \\ \times E_{0} (Δ τ_{ν}^{Tot} [x_{i}, x_{j}] (t, τ_{V})^{)} d ν \\ = & E_{ij} ζ (s) Θ (t, τ_{V},s) \begin{matrix} ˜ \\ K_{0} \end{matrix} [x_{i}, x_{j}] (t, τ_{V},s) . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} \frac{\delta K[x_i,x_j](t,\tau_\mathrm{V})}{\delta x_i(s)} & =& E_{ij} \zeta(s) \Theta(t,\tau_\mathrm{V},s) \frac{1}{2} \int_0^\infty \phi_\nu(\tau_\mathrm{V}) \phi_\nu(t) \phi_\nu(s)\notag\\&&\times E_0\left(\Delta \tau^\mathrm{Tot}_\nu [x_i,x_j](t,\tau_\mathrm{V}) \right) {\rm d}\nu \notag\\ & =& E_{ij} \zeta(s) \Theta(t,\tau_\mathrm{V},s) \tilde{K}_0[x_i,x_j](t,\tau_\mathrm{V},s). \end{eqnarray}$ (B.11)The integral I₂ from Eq. (B.3) can be rewritten as $\begin{matrix} I_{2} & = & D_{ij} \int_{τ_{V}^{In}}^{τ_{V}^{Out}} ζ (t) x_{i} (t) \frac{δK [x_{i}, x_{j}] (t, τ_{V})}{δ x_{i} (s)} d t \\ = & D_{ij} E_{ij} ζ (s) \int_{τ_{V}^{In}}^{τ_{V}^{Out}} Θ (t, τ_{V},s) ζ (t) x_{i} (t) \begin{matrix} ˜ \\ K_{0} \end{matrix} [x_{i}, x_{j}] (t, τ_{V},s) d t, \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} I_2 & =& D_{ij} \int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} {\zeta(t) x_i(t) \frac{\delta K[x_i,x_j](t,\tau_\mathrm{V})} {\delta x_i(s)} {\rm d}t}\nonumber \\ & =& D_{ij} E_{ij} \zeta(s) \int_{\tau_\mathrm{V}^\mathrm{In}}^{\tau_\mathrm{V}^\mathrm{Out}} {\Theta(t,\tau_\mathrm{V},s) \zeta(t) x_i(t) \tilde{K}_0[x_i,x_j](t,\tau_\mathrm{V},s) {\rm d}t} , \end{eqnarray}$ (B.12)which, according to the Θ properties, becomes $I_{2} = D_{ij} E_{ij} ζ (s) \times {\begin{matrix} if s > τ_{V} \\ if s < τ_{V} . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{equation} I_2= D_{ij} E_{ij} \zeta(s) \times \begin{cases} \displaystyle \int_s^{\tau_\mathrm{V}^\mathrm{Out}}{\zeta(t) x_i(t) \tilde{K}_0(t,\tau_\mathrm{V},s){\rm d}t} & \hspace*{-2mm}\text{if }s > \tau_\mathrm{V}\\ \displaystyle \int^s_{\tau_\mathrm{V}^\mathrm{In}}{\zeta(t) x_i(t) \tilde{K}_0(t,\tau_\mathrm{V},s){\rm d}t} & \hspace*{-2mm}\text{if }s < \tau_\mathrm{V}. \end{cases} \end{equation}$ (B.13)Finally, we get the derivative of the mean radiation field, $\begin{matrix} \frac{δ \bar{𝒥_{ij}} [x_{i}, x_{j}] (τ_{V})}{δ x_{i} (s)} = D_{ij} ζ (s) \\ (K (s, τ_{V}) + E_{ij} \times {\begin{matrix} \end{matrix}) . \end{matrix}$ $\appendix \setcounter{section}{2} \begin{eqnarray} && \frac{\delta \bar{\mathcal{J}_{ij}}[x_i,x_j](\tau_\mathrm{V}) } {\delta x_i(s)} = D_{ij} \zeta(s)\notag\\&& \left(K(s,\tau_\mathrm{V}) + E_{ij} \times \begin{cases} \displaystyle \int_s^{\tau_\mathrm{V}^\mathrm{Out}}{\zeta(t) x_i(t) \tilde{K}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }s > \tau_\mathrm{V}\\ \displaystyle \int^s_{\tau_\mathrm{V}^\mathrm{In}}{\zeta(t) x_i(t) \tilde{K}_0(t,\tau_\mathrm{V},s){\rm d}t} & \text{if }s < \tau_\mathrm{V} \end{cases} \right). \end{eqnarray}$ (B.14)

Appendix C: Definition of the variables

Table C.1

Physical quantities.

Table C.2

Functions, operators, and distributions.

All Tables

Table 1

Benchmark of the periodization method for a logarithmic singularity (from Helluy et al. 1998).

In the text

Table 2

Theoretical asymptotic speed-up for two extreme cases in terms of ρ = sparsity(J)(0 ≤ ρ ≤ 1) for MOrad, compared to MULTI with either full or local operator.

Physical quantities.

Functions, operators, and distributions.

In the text

All Figures

	Fig. 1 Spectral radius of amplification matrix for the Λ iteration method. Convergence is shown for two spectral radii, marked with letters A (upper panel) and B (lower panel) in the right-hand panels. The thin black lines correspond to successive iterations; the true solution is given by the orange line.
In the text

	Fig. 2 Same as Fig. 1, except for ALI with Λ^⋆ = diagonal of Λ.
In the text

	Fig. 3 Map of the integrand of $\hbox{$\bar{\mathcal{J}}_{ij}$}$ for a radiative transition (see Eq. (12c)). The diagonal is singular, and the quadrature in the periodization method produces a mesh refinement around it. The color encoding of the integrant is in logarithmic arbitrary units.
In the text

	Fig. 4 Benchmarking of the periodization method applied to the integration with a logarithmically singular kernel (g(t) = a₀ + a₁t, K(x,t) = −γ−ln( \| G(t)−G(x) \|) and $G (x) = {^{\int}}_{0}^{t} g (s) d s$ $\hbox{$G(x)=\int_0^t{g(s){\rm d}s}$}$ ). The integration domain has been split into two subdomains (see text for details).
In the text

	Fig. 5 Pattern of the Jacobian matrix for 4 model atmosphere layers and 200 energy levels.
In the text

	Fig. 6 Example of the evolution of the off-diagonal, block-matrix sparsity (ρ_mol) according to the number of considered levels. This example is computed for the 411 levels of the ortho-H₂O molecule.
In the text

	Fig. 7 Evolution of the Jacobian matrix sparsity according to the number of atmospheric layers for different values of ρ_mol.
In the text

	Fig. 8 Grotrian diagram of the ortho-H₂O molecule showing the radiative transitions taken into account. The parameter J_n is the rotational angular momentum of H₂O. Colors indicate different vibrational states.
In the text

	Fig. 9 LTE departure coefficients of ortho-H₂O in a red supergiant model atmosphere.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Ahues, M., D’Almeida, F., Largillier, A., Titaud, O., & Vasconcelos, P. 2002, J. Comput. Appl. Math., 140, 13 [NASA ADS] [CrossRef] [Google Scholar]

[2] Auer, L. 1991, in NATO ASIC Proc. 341: Stellar Atmospheres − Beyond Classical Models, eds. L. Crivellari, I. Hubeny, & D. G. Hummer, 9 [Google Scholar]

[3] Balay, S., Gropp, W. D., McInnes, L. C., & Smith, B. F. 1997, in Modern Software Tools in Scientific Computing, eds. E. Arge, A. M. Bruaset, & H. P. Langtangen (Birkhäuser Press), 163 [Google Scholar]

[4] Balay, S., Brown, J., Buschelman, K., et al. 2013a, PETSc Users Manual, Tech. Rep. ANL-95/11 − Revision 3.4 (Argonne National Laboratory) [Google Scholar]

[5] Balay, S., Brown, J., Buschelman, K., et al. 2013b, PETSc Web page, http://www.mcs.anl.gov/petsc [Google Scholar]

[6] Barber, R. J., Tennyson, J., Harris, G. J., & Tolchenov, R. N. 2006, MNRAS, 368, 1087 [NASA ADS] [CrossRef] [Google Scholar]

[7] Cannon, C. J. 1973, J. Quant. Spec. Radiat. Transf., 13, 1011 [NASA ADS] [CrossRef] [Google Scholar]

[8] Castor, J. I. 1970, MNRAS, 149, 111 [NASA ADS] [Google Scholar]

[9] Damgaard, P. H., Hjorth, P. G., & Thejll, P. A. 1992, A&A, 254, 422 [NASA ADS] [Google Scholar]

[10] Dickel, H. R., & Auer, L. H. 1994, ApJ, 437, 222 [NASA ADS] [CrossRef] [Google Scholar]

[11] Fabiani Bendicho, P., Trujillo Bueno, J., & Auer, L. 1997, A&A, 324, 161 [Google Scholar]

[12] Faure, A., & Josselin, E. 2008, A&A, 492, 257 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[13] Gonzalez Garcia, M., Le Bourlot, J., Le Petit, F., & Roueff, E. 2008, A&A, 485, 127 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[14] Gustafsson, B., Edvardsson, B., Eriksson, K., et al. 2008, A&A, 486, 951 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[15] Hauschildt, P. H., Starrfield, S., Shore, S. N., Allard, F., & Baron, E. 1995, ApJ, 447, 829 [NASA ADS] [CrossRef] [Google Scholar]

[16] Helluy, P., Maire, S., & Ravel, P. 1998, Comptes Rendus de l’Académie des Sciences Paris, Série Sciences Mathématiques, 327, 843 [Google Scholar]

[17] Hubeny, I., & Mihalas, D. 2014, Theory of Stellar Atmospheres (Princeton University Press) [Google Scholar]

[18] Juvela, M. 2005, A&A, 440, 531 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[19] Klein, R. I., Castor, J. I., Dykema, P. G., Greenbaum, A., & Taylor, D. 1989, J. Quant. Spec. Radiat. Transf., 41, 199 [NASA ADS] [CrossRef] [Google Scholar]

[20] Lambert, J., Josselin, E., Ryde, N., & Faure, A. 2013, in EAS Pub. Ser. 60, eds. P. Kervella, T. Le Bertre, & G. Perrin, 111 [Google Scholar]

[21] Lunarc 2013, Alarik system details, http://www.lunarc.lu.se/Systems/AlarikDetails [Google Scholar]

[22] Ng, K.-C. 1974, J. Chem. Phys., 61, 2680 [NASA ADS] [CrossRef] [Google Scholar]

[23] Olson, G. L., Auer, L. H., & Buchler, J. R. 1986, J. Quant. Spec. Radiat. Transf., 35, 431 [NASA ADS] [CrossRef] [Google Scholar]

[24] Paletou, F., & Anterrieu, E. 2009, A&A, 507, 1815 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[25] Rybicki, G. B. 1997, ApJ, 479, 357 [NASA ADS] [CrossRef] [Google Scholar]

[26] Scharmer, G. B., & Carlsson, M. 1985, J. Comput. Phys., 59, 56 [NASA ADS] [CrossRef] [Google Scholar]

[27] Socas-Navarro, H., & Trujillo Bueno, J. 1997, ApJ, 490, 383 [NASA ADS] [CrossRef] [Google Scholar]

[28] Trujillo Bueno, J., & Fabiani Bendicho, P. 1995, ApJ, 455, 646 [NASA ADS] [CrossRef] [Google Scholar]

[29] Štěpán, J., & Trujillo Bueno, J. 2013, A&A, 557, A143 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[30] van Noort, M., Hubeny, I., & Lanz, T. 2002, ApJ, 568, 1066 [NASA ADS] [CrossRef] [Google Scholar]