A&A 485, 327-336 (2008)
DOI: 10.1051/0004-6361:20078911

A Boltzmann-kinetical description of an MHD shock with arbitrary field inclination

M. Siewert - H.-J. Fahr

Argelander Institut für Astronomie der Universität Bonn, Abteilung f. Astrophysik und Extraterrestrische Forschung, Auf dem Huegel 71, 53121 Bonn, Germany

Received 24 October 2007 / Accepted 27 March 2008

Abstract
Aims. We revisit the general problem of the anisotropic MHD shock for arbitrary magnetic field inclinations, where the jump conditions are underdetermined. To describe the transition region of the shock, we derive a variant of a kinetic Boltzmann-Vlasov equation previously used to describe the perpendicular shock in the absence of dissipative processes.
Methods. We derive effective force terms, for the kinetic equation, that are based on the conservation of the Chew-Goldberger-Low (CGL) MHD invariants which appear in the standard model for anisotropic MHD. This approach is based on a generalisation of the well-known equivalence between the first CGL invariant and the integral over the magnetic moments of the underlying particles.
Results. Assuming an arbitrary distribution function on the upstream side, we integrate the kinetic equation across the shock. This result allows us to establish further relations between the MHD velocity moments on both sides. Using this additional information, we close the anisotropic MHD jump conditions. In addition, the now unique solution of the jump conditions allows us to present explicit cuts through a representative Maxwellian distribution function on both sides of the shock. In the kinetic equation, one only requires two parameters that need to be derived from the classical jump conditions, the classical MHD compression ratio and an equivalent ratio for the magnetic field strengths.

Key words: plasmas - shock waves - magnetohydrodynamis (MHD) - sun: solar wind

1 Introduction

We published previously several kinetic studies of an ion plasma crossing an MHD shock, such as the bow shock of the Earth or the solar wind termination shock (Fahr & Siewert 2006; Siewert & Fahr 2007a,b), that attempted to improve our understanding of this area of magnetohydrodynamics. Essentially, MHD shocks have been described using one of the following approaches, each of which possesses its own, inherent flaws.

First, from the theoretical MHD side, there are Rankine-Hugoniot-like jump conditions, based on the conservation of moments such as the MHD mass and momentum fluxes, the energy flux, and the conservation of the Poynting vector flux (see, e.g. Vogl et al. 2003; Hudson 1970; Erkaev et al. 2000). This approach is based on MHD and describes only a few low-order velocity moments of the plasma flow instead of the full ion velocity distribution function $f(\vec {w})$ . Therefore, this approach is well suited to the analysis of Maxwell-Boltzmann-like distribution functions, where the entire function can be parameterised using only a few velocity moments, but may be inappropriate for, say, power-law distribution functions, as those found in most cosmic ray spectra (see, e.g. Fisk & Gloeckler 2006; Schlickeiser 2002). In addition, it turns out that, for an anisotropic plasma, the jump conditions are underdetermined, where one possible parameterisation is to assume that the downstream pressure anisotropy, $\lambda_2 = p_{\perp,2}/p_{\parallel,2}$ , is the free parameter (Erkaev et al. 2000). Since an anisotropic plasma may emerge in the presence of magnetic fields (Chew et al. 1956), this restriction can be traced back to the inherent MHD flaw, where one initially obtains an infinite hierarchy of moment equations (see e.g. Cercignani 1988), which needs to be truncated at an arbitrary point.

The second approach to this problem is based on experimental observations, followed by appropriate modelling of these data. The cluster mission (Escoubet et al. 1997) has observed the Earth bow shock for many years now, observing its highly nonstationary behaviour (see e.g. Lobzin et al. 2007). Another prominent set of shock-related data was taken by the Voyager 1 spacecraft, which, in late 2004, crossed the solar wind termination shock (Stone et al. 2005; Decker et al. 2005): the spacecraft observed power-law spectra of practically unchanging power indices across the shock (Cummings et al. 2006), in addition to a series of magnetic bumps and holes in the heliopause (Burlaga et al. 2006a,b), which could be explained in terms of plasma waves. Up to now, however, these experimental data can only be fitted using by an ad hoc model because a robust physical explanation of their origin does not exist yet.

A third approach to the understanding of understand MHD shocks has also emerged, that is based on numerical shock simulation models completed using powerful supercomputers (e.g. Hada et al. 2003; Scholer et al. 2003): this, in principle, allows to study not only the behaviour of the individual particles, but also the nonstationarity in the ``fine structure'' of the transition region (i.e. fields and distribution functions inside the shock). However, numerical simulations always require the introduction of boundary conditions, in addition to a numerically stable ``stepping scheme'', which, in principle, may introduce additional, unphysical terms in the underlying equations (see, e.g. Press 1987-2002, Chap. 19). A consistent description of the physical boundary conditions typically requires the simulation of a system that is much larger than the transition region of the shock itself, and resolving this region can be difficult. One example of such a situation is the heliospheric termination shock, where the heliosphere with an average radius of the order of 100 AU requires a grid size that is considerably different from the typical size of a shock, which is assumed to be of the order of a few gyroradii (Fahr & Siewert 2007; Scholer et al. 2003). For these reasons, in any large-scale simulations, it is possible to verify only the existence of a shock by using the Rankine-Hugoniot jump conditions. Results from such numerical simulations demonstrate a highly nonstationary behaviour, similar to in situ space observations, for which, however, no complete theoretical understanding exists.

Since all of these approaches are, in one way or another, incomplete, we developed an approach based on a kinetic Boltzmann-Vlasov equation, which might be able to fill the gaps between the competing descriptions. In contrast to common MHD, our approach provides a description of the entire distribution function $f(\vec {w})$ , which includes all forms of nonthermal distribution functions. So far, this model allowed us to explain the conserved power-law index observed by the Voyager 1 spacecraft (Siewert & Fahr 2007a,b), as well as the magnetic bumps and holes observed by the same spacecraft in the heliosheath (Fahr & Siewert 2007; Burlaga et al. 2006b). Our results imply that, to obtain a (quasi-)stationary transition region, one requires additional physical processes in addition to the deceleration of ions and the change in electric and magnetic fields across the shock. Since, in our approach, the conservation of the mass flow transforms into a set of specific mathematical conditions, the additional physical processes to be included must be of a rather specific form as well, unless the transition region is highly nonstationary (Siewert & Fahr 2007b). This result is agrees with the nonstationarity emerging in both numerical approaches and experimental observations, which also implies that additional, internal microphysics should be taken into account. In this study, we restrict ourselves to the solar wind termination shock, which is located approximately at a similar spatial position over a significant period of time. We reinvestigate our kinetic model of the MHD shock, and derive a new, improved form of the kinetic Boltzmann equation for the general shock, which is based on a new, systematic connection between the MHD view and the per-particle view of the shock. These arguments enable to be removed most of the physical and mathematical problems that emerged in previous studies (e.g. Fahr & Siewert 2006; Siewert & Fahr 2007a,b). As a side-result, we prove the equivalence of MHD invariants and single particle invariants, by generalising the conservation of the ``magnetic CGL-moments''.

2 The kinetic approach

2.1 The mathematical and physical decomposition of the Boltzmann equation

In Siewert & Fahr (2007b), we proved that the specific form of the Boltzmann equation which was used in earlier studies is inherently unable to describe the parallel MHD shock in the anisotropic CGL model. For this reason, we now rederive the Boltzmann equation, looking carefully for shortcomings and flaws in the up-to-now description, including missing terms.

We consider a more general form of the Boltzmann equation that we later develop to include nontrivial terms, such as wave-turbulence generation, diffusion and an exchange of particle number, energy and momentum between multiple particle populations, which occur when we consider processes such as charge exchange, ionization or wave-particle induced friction. Following the earlier studies of Fahr & Siewert (2006) and Siewert & Fahr (2007a,b), a sufficiently general form of the Boltzmann-Vlasov equation is given by

$\displaystyle % A_{\rm s}(\vec{w},s) \frac{\rm d}{{\rm d}s} f(\vec{w},s) = A_\p... ...l} f(\vec{w},s) + A_\perp(\vec{w},s) \frac{\rm d}{{\rm d}w_\perp} f(\vec{w},s),$

(1)

where the basic Boltzmann equation has been specialised to a spatially one-dimensional problem with the streamline coordinate s. The coefficients $A_\parallel$ and $A_\perp$ are, at this point, free functions, which need to be derived from the physical processes in the transition region of the shock.

In a more general approach, one has to consider small fluctuations, in terms of plasma waves and turbulence. In the presence of a stationary background magnetic field, these processes are described traditionally using the Fokker-Planck equation (see, e.g. Schlickeiser 1989; Chalov & Fahr 1998), which then introduces terms such as

$\begin{displaymath}\frac{1}{w^2} \frac{\rm d}{{\rm d}w} \left( w^2 D_{ww} \frac{\rm d}{{\rm d}w} f \right)\cdot \end{displaymath}$

(2)

Obviously, these terms lead to the emergence of second-order derivatives in Eq. (1),

$\displaystyle % A_{\rm s}(\vec{w},s) \frac{\rm d}{{\rm d}s} f(\vec{w},s)$	=	$\displaystyle A_\parallel(\vec{w},s) \frac{\rm d}{{\rm d}w_\parallel} f(\vec{w},s)$
		$\displaystyle + A_\perp(\vec{w},s) \frac{\rm d}{{\rm d}w_\perp} f(\vec{w},s)$
		$\displaystyle + A_{\parallel, \parallel}(\vec{w},s) \frac{{\rm d}^2}{{\rm d}w_\parallel^2} f(\vec{w},s)$
		$\displaystyle + A_{\perp, \perp}(\vec{w},s) \frac{{\rm d}^2}{{\rm d}w_\perp^2} f(\vec{w},s)$
		$\displaystyle + A_{\parallel, \perp}(\vec{w},s) \frac{{\rm d}^2}{{\rm d}w_\parallel \ {\rm d}w_\perp} f(\vec{w},s),$	(3)

as well as to further contributions to $A_\parallel$ and $A_\perp$ . For this reason, we call the coefficients A_i the first-order and second-order Fokker-Planck terms (which are not to be confused with the better known Fokker-Planck coefficients). However, in this study, we focus on the more basic situation of a wave-free plasma, where all second-order terms vanish, since in this special situation, the general form of the solution is already known (see Siewert & Fahr 2007b, with the main mathematical methods repeated later in this paper). Finally, we work with the gyroaveraged approach commonly found in literature, i.e. there are only two velocity components, the parallel velocity $w_\parallel$ , and the perpendicular velocity $w_\perp$ . Depending on the specific structure inside the transition region, this approach may be no longer justified; however, a more specific description of this configuration does not yet exist in an analytical approximation.

The Boltzmann equation may also be parameterised from a physical standpoint that emphasizes not the mathematical form but the physics behind it. We write this alternate representation in the form

$\displaystyle L_{\rm kin}[f]$	+	$\displaystyle L_{\rm F}[f]$
	+	$\displaystyle L_{\rm acc}[f]$
	+	$\displaystyle Q_{\rm source}[f,f^\ast] = 0.$	(4)

Here L_i is a linear differential operator acting on the distribution function; the subscripts stand for kinetic terms on the individual particle level, ``true'' Force terms, and pseudo forces within an accelerated reference frame. Finally, we take into account of a possible source/sink term, which represents a possible exchange of particles, energy, and momentum between different particle populations; this term may be, in principle, nonlinear and dependent on multiple distribution functions, such as ions and electrons. We denote the distribution function related to this additional population with an asterisk. In principle, the corresponding operator does not need to be linear, and is referred as Q.

2.2 The Fokker-Planck terms in the various reference frames

A general formulation of the collisionless Boltzmann equation (i.e. the Vlasov equation) in an accelerated reference frame is given by

$\begin{displaymath}\left(\vec{w} \cdot \vec{\nabla}_x\right) f + \left(\vec{F} \... ...{\rm d}\vec{U}}{{\rm d}t} \cdot \vec{\nabla}_{w}\right) f = 0, \end{displaymath}$

(5)

where ${\rm d}\vec{U}/{\rm d}t$ is the acceleration of the bulk plasma system that moves with the bulk velocity $\vec{U}$ , i.e. this term may be identified with $L_{\rm acc}$ . The first term of the left-hand side of the expression corresponds to the kinetic term $L_{\rm kin}$ , while the middle term may be identified with the electromagnetic force term $L_{\rm F,em}$ . Specializing to a one-dimensional problem, we obtain

$\begin{displaymath}L_{\rm kin}[f](\vec{w},s) = w_\parallel \cos\Theta_{B{\rm n}}\frac{\rm d}{{\rm d}s} f(w_\parallel,w_\perp) \end{displaymath}$

(6)

in the (accelerated) plasma frame, where $\Theta_{B{\rm n}}$ is the angle between the magnetic field and the shock normal. Since this is the only term containing spatial gradients of the distribution function, one obtains easily the final form of $A_{\rm s}$ ,

$\begin{displaymath}A_{\rm s} = w_\parallel \cos\Theta_{B{\rm n}}(s) = U_{\rm n}(w_\parallel,s), \end{displaymath}$

(7)

where we introduce the differential bulk velocity $U_{\rm n}(w_\parallel,s)$ . This means that any temporal derivative appearing in the equation may be transformed into a spatial derivative using

$\begin{displaymath}\frac{\rm d}{{\rm d}t} = w_\parallel \cos\Theta_{B{\rm n}}(s) \frac{\rm d}{{\rm d}s}, \end{displaymath}$

(8)

Then, as long as the Fokker-Planck terms $A_\parallel$ and $A_\perp$ are proportional to d/dt, is it possible to completely eliminate the factor $w_\parallel \cos\Theta_{B{\rm n}}(s)$ . We note that this point was missing in an earlier equation derived in Fahr & Siewert (2006), which produced a singularity under specific conditions. Deriving the ``real'' force term is more complicated because the electromagnetic tension forces (i.e. electric and magnetic force terms) must be considered. One possible alternative approach to this problem is

$\begin{displaymath}\frac{{\rm d}f}{{\rm d}t} = \sum_i \left( \frac{{\rm d}w_\pa... ...erp}{{\rm d}t}\right)_{i} \frac{\partial f}{\partial w_\perp}, \end{displaymath}$

(9)

where the sums are performed over all parameters which get modified by the shock (and no explicitly time-dependent terms do appear). Unfortunately, collecting these parameters is still rather complicated, as several nontrivial points have to be taken into account. For example, in pure MHD, the electromagnetic fields are considered to be frozen into the plasma on the far upstream and downstream sides of the plasma, which implies that this should be true also for the small transition region in which pure MHD no longer holds. Thus, the behaviour of the particles inside the shock is not arbitrary, but restricted by several pseudo-MHD properties.

In principle, there are two different ways to derive the factors ${\rm d}w_i/{\rm d}t$ in their final form. There is first the per-particle approach, where no a priori limits or averages are made, that considers a single particle, with three velocity coordinates including a gyroangle, and the full force acting on this particle. Then, one has to derive the modifications to this force caused by the local field modifications inside the shock, which in turn have to be derived from the particle behaviour inside the system. In a sufficiently general case, this system of equations is complicated and has to be solved numerically, i.e. the analytical forms of the kinetic terms cannot be derived.

The other approach is what we call the semikinetic approach. From the kinetic view, we borrow the idea of taking an individual particle (or, alternatively, a narrow region in velocity space), but we describe the force terms using MHD quantities. In other words, force terms and their corresponding energy and momentum exchange are implicitly included in MHD quantities such as the bulk velocity, the partial pressures, and the magnetic field tension. These quantities must then be parameterised in a way that is consistent with the MHD jump conditions, which leads to a consistent description of the transition region using MHD quantities only, where only a small part of the MHD parameters actually needs to be modelled. This produces, however, a complicated equation because all important MHD quantities must be represented, that is the magnetic field, partial pressures, bulk velocities, and mass density, which are all related in some way. Finally, one requires a systematic relation between the MHD quantities and the individual kinetic velocities, unless all particles react in an identical way to the shock, which would be unphysical. In the remaining part of this section, we introduce a formalism than enables this to be realised in a straightforward way.

We begin with a phenomenological motivation of this formalism. First, we would like to emphasize that, in a wide variety of physical systems, the magnetic moment of the individual particles is conserved, i.e.

$\begin{displaymath}\frac{\rm d}{{\rm d}t} \mu \propto \frac{\rm d}{{\rm d}t} \frac{w_\perp^2}{B} = 0. \end{displaymath}$

(10)

For a broad distribution function $f(\vec {w})$ , the total magnetic moment of all particles is conserved, which is given by

$\begin{displaymath}\frac{\rm d}{{\rm d}t} \mu_f = \frac{\rm d}{{\rm d}t} \int {\rm d}^3w \ \frac{w_\perp^2}{B} \ f(\vec{w}) = 0. \end{displaymath}$

(11)

However, this expression may be interpreted in terms of an MHD velocity moment, i.e.

$\displaystyle \frac{\rm d}{{\rm d}t} \mu_f$	=	$\displaystyle \frac{\rm d}{{\rm d}t} \frac{1}{B} \int {\rm d}^3w \ w_\perp^2 \ f(\vec{w})$
	=	$\displaystyle \frac{\rm d}{{\rm d}t} \frac{1}{B} \langle w_\perp^2\rangle$
	=	$\displaystyle \frac{\rm d}{{\rm d}t} \frac{p_\perp}{B}\cdot$	(12)

As a generalisation of this, any individual particle invariant that is proportional to $w_\parallel^i \ w_\perp^j$ may be trivially transformed into an MHD form, simply by applying

$\begin{displaymath}w_\parallel^i \ w_\perp^j \rightarrow \langle w_\parallel^i \ w_\perp^j\rangle. \end{displaymath}$

(13)

For the magnetic moment, the corresponding MHD invariant is given by

$\begin{displaymath}\frac{\rm d}{{\rm d}t} \frac{p_\perp}{B} = {\rm const.} \end{displaymath}$

(14)

Now, it is known that, in the CGL model for anisotropic MHD systems (Chew et al. 1956), one obtains two adiabatic invariants that relate the different MHD parameters with each other. These two invariants are given by

$\begin{displaymath}\frac{\rm d}{{\rm d}t} {\rm CGL}_1 = \frac{\rm d}{{\rm d}t} \... ...f}}{\rho} = \frac{\rm d}{{\rm d}t} \frac{p_\perp}{B \rho} = 0, \end{displaymath}$

(15)

and

$\begin{displaymath}\frac{\rm d}{{\rm d}t} {\rm CGL}_2 = \frac{\rm d}{{\rm d}t} \frac{p_\parallel B^2}{\rho^3} = 0. \end{displaymath}$

(16)

Obviously, the first CGL invariant is given by a ``normalised'' form of the MHD magnetic moment.

In an inertial rest frame, the normalisation of the stationary distribution function is constant, i.e. $\dot{\rho}=0$ . In our formalism, however, we are working in an accelerated reference frame, where $\dot{\rho}\neq 0$ . Introducing the normalised distribution function

$\begin{displaymath}f_{\rm norm}(\vec{w}) = \frac{f(\vec{w})}{\rho}, \end{displaymath}$

(17)

we see that the perpendicular pressure is given by

$\begin{displaymath}p_\perp \propto \langle w_\perp^2\rangle \propto \int {\rm d}^3w \ w_\perp^2 \ f_{\rm norm}(\vec{w}), \end{displaymath}$

(18)

and the additional factor $\rho^{-1}$ in Eq. (15) cancels out the contributions from the acceleration of the system. In other words, Eq. (13) has to be generalised to

$\begin{displaymath}\frac{w_\parallel^i \ w_\perp^j}{\rho} \rightarrow \frac{\la... ...\rho} = \langle w_\parallel^{i} \ w_\perp^j\rangle_{\rm norm}, \end{displaymath}$

(19)

which is valid both in inertial and non-inertial rest frames.

Next, we demonstrate that, in the absence of stochastic processes, the opposite direction of Eq. (19) is also valid. In other words, it is possible, under certain conditions, to interpet an MHD invariant in terms of an integral over per-particle invariants. Taking, representatively, the second CGL invariant, we may write

$\displaystyle \frac{\rm d}{{\rm d}t} \frac{p_\parallel}{\rho} \frac{B^2}{\rho^2}$	=	$\displaystyle \frac{\rm d}{{\rm d}t} \int {\rm d}^3w \ w_\parallel^2 \frac{B^2}{\rho^2} \ f_{\rm norm}(\vec{w})$
	=	$\displaystyle \int {\rm d}^3w \ f_{\rm norm}(\vec{w}) \frac{\rm d}{{\rm d}t} \left(w_\parallel^2 \frac{B^2}{\rho^2} \right)$
	$\textstyle \quad+$	$\displaystyle \frac{B^2}{\rho^2} \int w_\parallel^2 \frac{\rm d}{{\rm d}t} \left( {\rm d}^3w \ f_{\rm norm}(\vec{w}) \right) = 0,$	(20)

Obviously, as soon as the second integral in this sum vanishes, the first integral vanishes as well, which requires that

$\begin{displaymath}\frac{\rm d}{{\rm d}t} \left(w_\parallel^2 \frac{B^2}{\rho^2} \right) = 0. \end{displaymath}$

(21)

Since this approach is not restricted to the second CGL invariant alone, we may write our condition in the more general form

$\begin{displaymath}\frac{\rm d}{{\rm d}t} \frac{\langle w_\parallel^i w_\perp^j\... ...rac{\rm d}{{\rm d}t} \frac{w_\parallel^i w_\perp^j}{\rho} = 0. \end{displaymath}$

(22)

In Appendix A, we demonstrate in detail how Eq. (22) is valid under the conditions presented at the start of this section.

Although we do not study the physical nature of Eq. (21), we emphasize that the mostly mathematical approach to this identification requires that such an invariant must exist. In Fahr & Siewert (2008), we identified this invariant in the solar wind, following from the divergence of the plasma stream and the corresponding Parker model for the frozen-in magnetic fields (Parker 1965). Using a general expression for the corresponding velocity modification,

$\begin{displaymath}\frac{\rm d}{{\rm d}t} w_\parallel = -w_\parallel \frac{\vec{... ...vec{\nabla}}{B} \left( \frac{\vec{U}\cdot\vec{B}}{B} \right), \end{displaymath}$

(23)

the reason for the resulting change of the parallel velocity component $w_\parallel$ is due to the recognition of the bulk velocity gradient parallel to the magnetic field $\vec{B}$ , by the particle, when it covers a path $w_\parallel$ per unit time of its motion parallel to $\vec{B}$ . However, evaluating this expression requires an intimate knowledge of the reaction of the frozen-in magnetic field, for which, inside the transition layer of the shock, no systematic theory yet exists.

Therefore, lacking any better description, we make the ad-hoc assumption that both CGL invariants are conserved inside the shock. This approach requires that the magnetic field must be changing slowly. In other words, the reorientation and condensation timescale $\tau_{\rm c}$ must be much larger than the gyration timescale,

$\begin{displaymath}\tau_{\rm c} = \frac{{\rm d}B}{{\rm d}t} / B \gg \tau_{\rm gyr}. \end{displaymath}$

(24)

Since all MHD quantities in the system are connected with each other, similar conditions must hold for the other MHD quantities appearing in the adiabatic invariants as well. From this requirement, it is automatically possible to derive another condition, namely the absence of particle-particle collisions. Since the conservation of the magnetic moment requires slow variations of all parameters, including the perpendicular particle velocities, any fast energy and momentum exchange mechanism (such as scattering) must be absent from the system, or the gyration of the individual particles would be significantly perturbed. The same requirement holds for the second CGL invariant to be valid (see e.g. Kulsrud 1983, p. 115).

Finally, we consider the Eqs. (21) and (10) to evaluate the temporal derivative and obtain the expressions

$\begin{displaymath}\left. \frac{{\rm d}w_\parallel}{{\rm d}t} \right\vert _{\rm CGL_2} = w_\parallel \frac{\rm d}{{\rm d}t} (\ln \rho - \ln B) \end{displaymath}$

(25)

and

$\begin{displaymath}\left. \frac{{\rm d}w_\perp}{{\rm d}t} \right\vert _{\mu} = \frac{w_\perp}{2} \frac{\rm d}{{\rm d}t} \ln B. \end{displaymath}$

(26)

These two Eqs. (26) and (25), are sufficient to describe the system. Then, collecting our results, the Fokker-Planck terms are given by

$\displaystyle A_{\rm s}$	=	$\displaystyle w_\parallel \cos\Theta_{B{\rm n}}$	(27)
$\displaystyle A_\parallel$	=	$\displaystyle -w_\parallel^2 \cos\Theta_{B{\rm n}}\frac{\rm d}{{\rm d}s} (\ln \rho - \ln B)$	(28)
$\displaystyle A_\perp$	=	$\displaystyle -\frac{w_\perp}{2} w_\parallel \cos\Theta_{B{\rm n}}\frac{\rm d}{{\rm d}s} \ln B.$	(29)

In these equations, the additional negative sign follows from the fact that, in Eq. (1), the force term was moved to the other side of the equation. From the argument following Eq. (8), the global factor $w_\parallel \cos\Theta_{B{\rm n}}$ may be removed, which allows us to write the simpler form

$\displaystyle A_{\rm s}$	=	1	(30)
$\displaystyle A_\parallel$	=	$\displaystyle -w_\parallel \frac{\rm d}{{\rm d}s} ( \ln \rho - \ln B )$	(31)
$\displaystyle A_\perp$	=	$\displaystyle -\frac{w_\perp}{2} \frac{\rm d}{{\rm d}s} \ln B.$	(32)

Then, we obtain the final form of the kinetic Boltzmann equation,

$\begin{displaymath}\frac{\rm d}{{\rm d}s} f = - w_\parallel \frac{\rm d}{{\rm d}... ...} \frac{\rm d}{{\rm d}s} \ln B \frac{\rm d}{{\rm d}w_\perp} f. \end{displaymath}$

(33)

This equation describes the full downstream distribution function at an MHD shock (instead of only a few, low-order velocity moments) using the assumption that the adiabatic invariants of the CGL theory also hold inside the transition region of the shock.

It it worth mentioning that this equation does not depend upon the magnetic field orientation, which is represented by the fact that no magnetic field projections ( $B_{\rm n}$ or $B_{\rm t}$ ) appears. Therefore, the terms related to this effect derived by Fahr & Siewert (2006) must be discarded on account of mixing the semikinetic approach with the full kinetic approach. Since both approaches rely on different amounts of averaging and other approximations, a self-consistent description of the shock must not mix these different representations.

2.3 Solutions of the improved Boltzmann equation

In Siewert & Fahr (2007b), we derived restrictions for the possible form of the Fokker-Planck terms A_i, based on the concept that the average parallel velocity $\langle w_\parallel\rangle$ vanishes in the plasma frame. At this point, we repeat the nontrivial parts relevant for the solution of our kinetic equation. First, we emphasize that the adiabatic invariants (Eqs. (15) and (16)) are only valid in the rest frame comoving with the system. This may be understood since we derived the Fokker-Planck terms $A_\parallel$ and $A_\perp$ using adiabatic invariants depending on the partial pressures $p_\parallel$ and $p_\perp$ . Conventionally, these partial pressures are taken in the ``natural'' rest frame that is comoving with the plasma, since the integral

$\begin{displaymath}\langle w^2 \rangle = \int {\rm d}w \ w^2 \ f(w) \end{displaymath}$

(34)

is not invariant under a coordinate transformation of the form $w \rightarrow w + U$ . Now, this specific reference frame is characterised by the fact that the velocity moment $\langle \vec{w}\rangle$ vanishes. Taking our specific choice of the velocity coordinate system, this translates into the requirements that

$\begin{displaymath}\langle \vec{w} \rangle = \left \langle \begin{array}{c} w... ...p\sin\phi \end{array}\right \rangle \stackrel{!}{=} \vec{0}, \end{displaymath}$

(35)

where the second and third terms vanish because the distribution function f does not depend upon the gyroangle $\phi$ . The only nontrivial part of this equation is $\langle w_\parallel\rangle=0$ .

In the absence of stochastic processes, any single point in phase space will remain forever a single point, which justifies the approach

$\begin{displaymath}f_2(w_\parallel,w_\perp) = f_1(\bar{w}_\parallel(\vec{w}), \bar{w}_\perp(\vec{w})), \end{displaymath}$

(36)

i.e. the statistical weights $f(\vec {w})$ at the individual points in velocity space are simply moved around, but not smeared out. Here, the subscripts 1 and 2 denote the upstream and downstream distribution functions, respectively. Next, we take into account that the absolute normalisation of the distribution function in an accelerated rest frame is not constant, which is conventionally parameterised using the MHD compression ratio x, i.e. $\rho_2 = x \rho_1$ . Then, we write down this relation using the full distribution functions,

$\displaystyle \rho_2$	=	$\displaystyle \int {\rm d}^3\bar{w} \ D \ f_1(\bar{w}_\parallel, \bar{w}_\perp)$
	$\textstyle \stackrel{!}{=}$	$\displaystyle x \rho_1 = x \int {\rm d}^3w \ f_1(w_\parallel, w_\perp),$	(37)

where x is an arbitrary positive number that has to be derived from the MHD jump conditions. Since this relation must not depend on the f_i or x, we see that

$\begin{displaymath}{\rm d}^3\bar{w} \ D \stackrel{!}{=} {\rm d}^3w \ x. \end{displaymath}$

(38)

In other words, the Jacobi determinant of the transformation must encode the compression ratio between the upstream and downstream sides in a specific way. This result means that D may not depend upon the particle velocities, and the relation connecting upstream and downstream variables must therefore be linear, such as

$\displaystyle w_\parallel$	=	$\displaystyle C_\parallel \bar{w}_\parallel + C_\parallel' \bar{w}_\perp$	(39)
$\displaystyle w_\perp$	=	$\displaystyle C_\perp' \bar{w}_\parallel + C_\perp \bar{w}_\perp.$	(40)

Now, all that remains to do is to determine the form of the coefficients C_i and B_i. To do this, we take Eq. (35) on the downstream side and express it in the integral form,

0	$\textstyle \stackrel{!}{=}$	$\displaystyle \int {\rm d}^3w \ w_\parallel \ f_2(w_\parallel,w_\perp)$
	=	$\displaystyle \int {\rm d}^3w \ w_\parallel \ f_1(\bar{w}_\parallel(\vec{w}), \bar{w}_\perp(\vec{w}))$
	=	$\displaystyle \int {\rm d}^3\bar{w} \ D \ (C_\parallel \bar{w}_\parallel + C_\parallel' \bar{w}_\perp) \ f_1(\bar{w}_\parallel, \bar{w}_\perp)$
	=	$\displaystyle 0 + B_\parallel \int {\rm d}^3\bar{w} \ D \ f_1(\bar{w}_\parallel, \bar{w}_\perp).$	(41)

Here, the first term vanishes because of our choice of the reference frame, while the second integral is always greater than zero. Therefore, to make the entire expression vanish, one requires $C_\parallel' = 0$ , i.e. the parallel velocities do not become mixed with the perpendicular ones. Using Eq. (38) and writing out the differential then allows to prove that the perpendicular velocities do not get parallel contributions either. This means that the most general, mass flow conserving transformation between the upstream and downstream coordinates is given by

$\displaystyle w_\parallel$ = $\displaystyle C_\parallel \bar{w}_\parallel$	(42)
$\displaystyle w_\perp$ = $\displaystyle C_\perp \bar{w}_\perp.$	(43)

Then, using Eq. (38), one immediately sees that

$\begin{displaymath}{\rm d}^3\bar{w} \ D = {\rm d}^3w \ C_\parallel C_\perp^2 = {\rm d}^3w \ x, \end{displaymath}$

(44)

where the additional factor $C_\perp$ follows from the fact that d $^3w \propto w_\perp$ in cylinder coordinates. In other words, we obtain the additional condition

$\begin{displaymath}C_\parallel C_\perp^2 = x. \end{displaymath}$

(45)

Now, taking Eqs. (36) and the most general coordinate transformation (i.e. Eqs. (42) and (43)), we see that the most general, mass flow conserving solution of Eq. (1) is given by

$\begin{displaymath}f_2(w_\parallel, w_\perp) = f_1\left(\frac{w_\parallel}{C_\parallel}, \frac{w_\perp}{C_\perp}\right), \end{displaymath}$

(46)

where the parameters C_i must be independent of $\vec{w}$ .

From this point, the rest of our formalism is rather straightforward mathematics. Taking Eqs. (42) and (43), inserting them into Eq. (1) and comparing coefficients then allows to reduce the partial differential equation to the two ordinary differential equations

$\begin{displaymath}\frac{{\rm d}C_\parallel}{{\rm d}s} = -\frac{A_\parallel}{w_\parallel} C_\parallel \end{displaymath}$

(47)

and

$\begin{displaymath}\frac{{\rm d}C_\perp}{{\rm d}s} = -\frac{A_\perp}{w_\perp} C_\perp. \end{displaymath}$

(48)

Since the coefficients C_i must not depend upon $\vec{w}$ , the Fokker-Planck terms A_i must be linear functions of w_i,

A_i (w_i,s) = a_i(s) w_i,

(49)

and the differential equations are formally solved by

$\begin{displaymath}C_{ i} = \exp \left( -\int a_{ i} \ {\rm d}s \right). \end{displaymath}$

(50)

Then, the Eqs. (49), (50) and (45), applied in this order, allow us to determine if a kinetic equation is able to describe an MHD shock while conserving the mass flow, and the downstream distribution function is given by Eq. (46).

Now, we may apply this formalism to the equation that we derived in this study. Obviously, the A_i given by Eqs. (31) and (32) fulfill Eq. (49), i.e. they are linear functions of their corresponding velocities. Since they are of the form ${\rm d}/{\rm d}s \ln g(s)$ , evaluating Eq. (50) is trivial as well because the exponential and logarithmic functions cancel out, leading to

$\begin{displaymath}C_\parallel = x \frac{B_1}{B_2} \end{displaymath}$

(51)

and

$\begin{displaymath}C_\perp = \sqrt{ \frac{B_2}{B_1} }, \end{displaymath}$

(52)

from which we automatically see that Eq. (45) is indeed fulfilled. We may also derive analytic relations between all upstream and downstream MHD moments in the plasma rest frame, which are equivalent to knowledge of the distribution function. According to Siewert & Fahr (2007b), this relation is given by

$\begin{displaymath}a_{ij,2} = \left\langle w_\parallel^i w_\perp^j\right\rangle_... ...lel^i C_\perp^j a_{ij,1} = x C_\parallel^i C_\perp^j a_{ij,1}. \end{displaymath}$

(53)

The downstream pressure anisotropy derived using this formalism is then

$\begin{displaymath}\lambda_2 = \frac{C_\perp^2}{C_\parallel^2} \lambda_1 = \left(\frac{B_2}{B_1} \right)^3 x^{-2} \lambda_1. \end{displaymath}$

(54)

For the parallel shock ( $\Theta_{B{\rm n}}= 0$ ), this relation simplifies to

$\begin{displaymath}\lambda_{2,\parallel} = \frac{\lambda_1}{x^2}, \end{displaymath}$

(55)

while for the perpendicular shock ( $\Theta_{B{\rm n}}= \pi/2$ ), where one obtains B₂ = x B₁ (Erkaev et al. 2000), the downstream pressure anisotropy is given by

$\begin{displaymath}\lambda_{2,\perp} = x \lambda_1. \end{displaymath}$

(56)

Obviously, the kinetic behaviour of an arbitrary distribution function across the shock depends on only two parameters, the MHD compression ratio x and the total magnetic field strength ratio B₂/B₁, i.e. on one parameter related to the massive particles, and one parameter related to the electromagnetic fields. It is noteworthy that the behaviour of the distribution function depends only on upstream and downstream quantities, and not on the fine structure of the shock itself. In conventional MHD, it is usually assumed that a few velocity moments of low order are sufficient to describe the behaviour of the system; in this light, our result may be interpreted in a way that, at least when considering shocks, these few lowest-order moments may be further boiled down to a single kinetic parameter. This result likewise hints that our kinetic equation successfully describes the non-MHD region of the shock in a quasi-MHD approximation, which is as close to MHD as possible, while leaving a sufficient number of degrees of freedom to include strictly non-MHD behaviour. We return to this point in a future study.

Table 1: Initial upstream and final downstream parameters for a single-fluid, ion-only plasma. The downstream pressure anisotropies $\lambda$ are not estimated, but are an exact result using Eq. (54). Normalised values are used.

Since this formalism was derived in the comoving reference frame of the plasma, which must always exist no matter how complicated the microphysics in the system may be, we call this the minimal kinetic extension. In other reference frames, this extension should be similarly applicable, although the transformation of the velocity moments between the different reference frames is mathematically complicated, and not pursued further in this study.

Using the treatment of the MHD shock presented in this paper, we show a few selected results. We represent the upstream distribution function using a bi-Maxwellian function,

$\begin{displaymath}f(\vec{w}) = \exp \left( -\frac{w_\parallel^2}{\theta_\parallel^2} -\frac{w_\perp^2}{\theta_\perp^2} \right), \end{displaymath}$

(57)

with

$\begin{displaymath}\theta_i = \frac{2 k T_i}{m_p} = \sqrt{ \frac{2 p_i}{\rho}}, \end{displaymath}$

(58)

where the second identity follows from the ideal gas equation. Then, we use the same upstream parameters as those used by Erkaev et al. (2000), i.e. the Alfvenic Mach number $M_{\rm A} = 2$ and the sonic parameter $A_{\rm s} = 0.01$ . Using these parameters, it is possible to derive a more conventional form of the upstream parameters, the magnetic field strength

$\begin{displaymath}B = \frac{\sqrt{4\pi \rho_1 U_1^2}}{M_{\rm a}} \end{displaymath}$

(59)

and the upstream perpendicular pressure,

$\begin{displaymath}p_\perp = A_{\rm s} \rho_1 U_1^2. \end{displaymath}$

(60)

In addition to the initial two dimensionless parameters, we use a more conventional approach to the upstream mass flow, by adopting the mass density $\rho_1 = 1~{\rm m}^{-3}$ and the upstream bulk velocity $U = U_{\rm n} = 10^5~{\rm m/s}$ . Since the solution of the anisotropic jump conditions does not depend on the mass flow $U_{\rm n} \rho$ (see, e.g. Vogl et al. 2003; Erkaev et al. 2000), any other choice of the parameters constituting the mass flow would lead to similar results. In contrast to this, we remark that the dimensionless parameters used earlier would influence the result. Finally, we assume that, on the upstream side, the plasma is perfectly isotropic, i.e. $\lambda_1 = 1$ , and that the magnetic field orientation with respect to the shock normal is characterised by the angle $\Theta_{B{\rm n}}$ . For convenience, these upstream parameters are collected in Table 1.

Next, we solve the anisotropic MHD jump conditions (Erkaev et al. 2000; Hudson 1970) for the upstream parameters given. In a classical (i.e. MHD-only) approach to this problem, the equations are underdetermined, and one is faced with one more downstream parameter than equations, which means that an additional equation must be derived using a different formalism. In this study, we follow Erkaev et al. (2000) and select the downstream pressure anisotropy $\lambda_2$ , which may be described using Eq. (54). Using this approach, we finally arrive at a unique solution for the downstream parameters, which is given in Table 1. Using this solution, we derive the MHD compression ratio x and the ``field compression ratio'' B₂/B₁, allowing us to determine the full distribution function on the downsteam side.

$\begin{figure} \par\includegraphics[width=8cm,clip]{8911fig1.eps}\end{figure}$	Figure 1: Representative cuts through the distribution function at $w_\perp = 0$ for the inclined shock, using an upstream bi-Maxwellian distribution function and the parameters from Table 1.
Open with DEXTER

$\begin{figure} \par\includegraphics[width=8cm,clip]{8911fig2.eps}\end{figure}$	Figure 2: Representative cuts through the distribution function at $w_\parallel = 0$ for the inclined shock, using an upstream bi-Maxwellian distribution function and the parameters from Table 1.
Open with DEXTER

Since the perpendicular shock has already been treated in Siewert & Fahr (2007a), and the current approach to the force terms leads to identical results, we focus on the inclined and parallel shocks. In Figs. 1 and 2, we present cuts through the distribution functions for an inclined shock ( $\Theta_{B{\rm n}}= 45^\circ$ ) on the upstream and downstream sides, at $w_\perp = 0$ and $w_\parallel = 0$ , respectively. These figures demonstrate a basic property of our solution (Eq. (46)), namely that the shock does not modify the basic shape of the distribution function, which is still of the characteristic Maxwellian form. Instead, it modifies the broadness of this distribution, i.e. the parameter $\theta_{ i}$ in Eq. (57), which is a function of the partial pressures. For the inclined shock, both components of the velocity are modified, which directly follows from the fact that, in this case, both coefficients C_i are not unity (i.e. in Eqs. (51) and (52), the magnetic field ratio is not 1 or x).

For the parallel shock, the situation is different. Since this approach is defined by $B=B_{\rm n}$ , and it follows from the MHD jump conditions that the normal magnetic field is conserved, the magnetc field ratios appearing in the coefficients C_i are unity, and $C_\perp = 1$ , which leads to a completely unmodified perpendicular velocity. This effect is demonstrated in Figs. 3 and 4, where we present, again, cuts through the upstream and downstream distribution functions for an (almost) parallel shock. This is similar to the earlier result obtained for the perpendicular shock (Siewert & Fahr 2007a), where the parallel velocity components remain untouched.

$\begin{figure} \par\includegraphics[width=8cm,clip]{8911fig3.eps}\end{figure}$	Figure 3: Representative cuts through the distribution function at $w_\perp = 0$ for the almost parallel shock ( $\Theta _{B{\rm n}}\simeq 1^\circ$ ), using an upstream bi-Maxwellian distribution function and the parameters from Table 1.
Open with DEXTER

$\begin{figure} \par\includegraphics[width=8cm,clip]{8911fig4.eps}\end{figure}$	Figure 4: Representative cuts through the distribution function at $w_\parallel = 0$ for the almost parallel shock ( $\Theta _{B{\rm n}}\simeq 1^\circ$ ), using an upstream bi-Maxwellian distribution function and the parameters from Table 1.
Open with DEXTER

Finally, we point out that Eq. (33) does not depend upon the behaviour of the MHD quantities inside the transition region, which is in excellent agreement with MHD, and which also implies that we indeed find a kinetic description for the MHD shock that depends essentially only on MHD quantities. However, we emphasize that this approach is, by no means, a globally complete description, but applies only under the restrictions imposed by the MHD approach to the shock. First of all, MHD requires that the electromagnetic fields are frozen-in, i.e. convected along with the background plasma. Without this requirement, there would be no motion perpendicular to the magnetic field lines, and no perpendicular shock either. Therefore, although the frozen-in field condition is derived within the framework of classical MHD, it must be valid even when all other MHD requirements fail, since otherwise, there would be no perpendicular shock. For this reason, it must be expected that the fields are still frozen-in into the system even in the transition layer of the shock, in the sense of a generalised frozen-in field condition.

In addition, the MHD approach to shocks requires that the system is charge-neutral, i.e. that there are no local electric currents present. However, in a more consistent description, one has to include both ions and electrons as separate, interacting fluids. On the other hand, the jump conditions are explicitly tailored to one single fluid, which is typically interpreted in terms of an ion flow, with the implicit asumption that the much lighter electrons are convected along with the rest of the system, and that quasineutrality is obtained. Therefore, to arrive at a more consistent description of the MHD shock, one requires a two-fluid generalization of the MHD jump conditions, including an MHD formulation of charge-neutrality on the upstream and downstream sides. This, however, opens yet another problem, namely the fact that, inside the transition region, where MHD is not applicable, quasineutrality may no longer be an absolute requirement. Considering that the electron distribution function may be quite different from the ion one, this is clearly not a trivial problem. We emphasize that even the particle-field interactions present in MHD may already be interpreted as a two-fluid system, with one fluid being composed of massive particles, and the other fluid of frozen-in fields. In light of this interpretation, interactions between multiple fluids should, in fact, be possible in the framework of MHD. Such an approach would allow to describe wave generation, by including the wavemodes as yet another separate fluid. On the kinetic level, such interactions between various components of the model is realised by upgrading the Boltzmann-Vlasov equation to a Fokker-Planck-like form, where the interactions are parameterised as diffusion coefficients. To the best of our knowledge, there exists no comparable systematic theory of interacting fluids on the MHD level yet. The closest thing to such a theory found in literature is two-fluid hydrodynamics (see Holzer & Axford 1970).

Clearly, a self-consistent solution to all of these problems may become complicated and is far beyond the scope of this present study. As a first step towards such a description, we are currently working on a multifluid generalisation of the classical MHD jump conditions. Although this work is close to completion, we point out that the ``initial problem'', i.e. the fact that the anisotropic jump conditions are not perfectly closed, appears to be only the literal tip of the iceberg; for multiple fluids, the amount of free parameters seems to be growing, which offers an excellent interface to include fluid-fluid interactions, in terms of additional conditions required to close the generalised jump conditions. In face of all these aspects, our current result must be interpreted as a working, self-consistent description of the classical, single-fluid MHD shock only, and as a basis for future work.

3 Applications and outlook

3.1 On the incompleteness of a single-fluid system

Taking Eq. (53), we see that the partial downstream pressures are given by

$\begin{displaymath}p_{\parallel,2} = x C_\parallel^2 p_{\parallel,1} \end{displaymath}$

(61)

and

$\begin{displaymath}p_{\perp,2} = x C_\perp^2 p_{\perp,1}. \end{displaymath}$

(62)

As demonstrated by Erkaev et al. (2000), the anisotropic MHD jump conditions are underdetermined, and one additional equation is required, which they associated with the downstream pressure anisotropy $\lambda_2$ . Now, however, the minimal kinetic extension gives us two additional equations, transforming the underdetermined system of equations into an overdetermined one. This follows from the fact that the downstream pressure anisotropy is defined by

$\begin{displaymath}\lambda_2 = \frac{ p_{\perp,2} }{ p_{\parallel,2} }, \end{displaymath}$

(63)

which is invariant under the transformation

$\begin{displaymath}p_{\parallel,\perp}' = C \cdot p_{\parallel,\perp}, \end{displaymath}$

(64)

indicating that the absolute normalisation of the pressures is not preserved by $\lambda_2$ , and that, taking $\lambda_2$ alone might result in a solution that does not satisfy Eqs. (61) and (62). Introducing the parameter (Erkaev et al. 2000)

$\begin{displaymath}\epsilon = 1 - 4 \pi \frac{p_\parallel - p_\perp}{B^2}, \end{displaymath}$

(65)

one may express most of the partial pressure terms in the jump conditions as a function of $\epsilon$ and $\lambda_2$ , with one isolated perpendicular pressure remaining, which enables us to derive one of the partial pressures from MHD, and leads to two determination conditions for $p_{\perp,2}$ . However, $\epsilon$ is invariant under Eq. (64), since such a renormalisation may also be interpreted in terms of $B'^2 \rightarrow C \cdot B^2$ . This is sufficient to prove that Erkaev et al. (2000) are unable to predict the correct normalisation of the partial pressures, and that just providing an expression for $\lambda_2$ alone is insufficient to arrive at a closed system of equations. This may be understood since any theory capable of predicting the downstream pressure anisotropy must also be able to predict both individual partial pressures, which results in two more equations instead of just one, replacing the previously underdetermined system of equations with an overdetermined system. A possible solution for this situation is the inclusion of electrons, which introduces additional equations and parameters that might lead to a more consistent description. Based on our current description of a single fluid shock, we are currently working on a consistent description of a multifluid shock, which may be used to explicitly model quasineutrality and possibly also stochastic interactions or hybrid fluid-particle descriptions in a more systematic way than commonly found in literature.

3.2 The parallel shock and the transition region

Taking the general anisotropic jump conditions (Erkaev et al. 2000) and specialising them to the parallel shock ( $B_{\rm t} = 0$ , $B_{\rm n} = {\rm const.}$ ), one obtains

$\displaystyle \left[[B_{\rm n}]\right]$	=	0	(66)
$\displaystyle \left[[\rho U_{\rm n}]\right]$	=	0	(67)
$\displaystyle \left[[ U_{\rm t} B_{\rm n}]\right]$	=	0	(68)
$\displaystyle \left[[ p_\parallel + \rho U_{\rm n}^2 ]\right]$	=	0	(69)
$\displaystyle \left[[\rho U_{\rm n} U_{t}]\right]$	=	0	(70)
$\displaystyle \left[\left[U_{\rm n} \left(\frac{3}{2} p_\parallel + p_\perp + \frac{U_{\rm n}^2}{2}\right)\right]\right]$	=	0,	(71)

Obviously, this relation does not contain any variable magnetic field terms, which may be interpreted in terms of the plasma following the field lines. In other words, pure MHD is unable to describe a parallel shock, and inside the transition region of the shock, pure MHD must no longer hold. The existence of a parallel shock is mostly accepted on the condition that many astrophysical shock configurations (see e.g. Treumann & Scholer 2002) do require this configuration. From a mathematical point of view, the parallel shock may be described in terms of the limit $\Theta_{B{\rm n}}\rightarrow 0^\circ$ . As it turns out, the downstream transversal magnetic field does not converge towards zero in this limit, and the perfectly parallel shock may be unphysical, being replaced instead with a shock where the magnetic field is parallel on the upstream side, but not on the downstream side. However, this approach requires considerably more work, as the conventional anisotropic Rankine-Hugoniot equations do not allow such a solution.

While, in principle, many interpretations of this behaviour are possible, perhaps the most straightforward idea is that the transition region of the MHD shock differs from ideal MHD predictions, and that some of the jump conditions have to be modified. Since energy and momentum are conserved quantities even outside of MHD, and the normal magnetic field conservation is related to many other, non-MHD plasmaphysical applications as well, the conservation of the transverse electric field is the only MHD jump conditions which may, perhaps, be modified by the shock. This jump condition is closely related to the so-called frozen-in field condition (Alfvén & Fälthammar 1963),

$\begin{displaymath}\partial_{\rm t} \vec{B} - \vec{\nabla} \times ( \vec{U} \times \vec{B} ) = 0, \end{displaymath}$

(72)

which is required to allow motion perpendicular to the magnetic field, as it is allowed in conventional MHD. Clearly, the time-dependent term, which is usually set to zero on the far upstream and downstream sides, must be nonzero inside the shock to preserve frozen-in fields. Since the presence of time-dependent terms in electrodynamical equations is usually interpreted in terms of plasma waves, this automatically hints that an MHD shock is a natural plasma wave generator. However, since plasma waves usually require a quasineutral system (i.e. where ions and electrons are present in equivalent quantities), one needs to extend the MHD jump conditions to include at least two particle flows (or, alternatively, two charge flows). As already mentioned, we are working on a consistent description of all these aspects. We expect that this description is able to describe the behaviour of the magnetic fields inside the transition region, shedding more light on the currently unsolved points related to the second CGL invariant.

4 Conclusions

In this study, we derived an improved version of an earlier kinetic Boltzmann equation derived by Fahr & Siewert (2006), which attempts to describe MHD shocks, such as the solar wind termination shock. Using a more strict approach in terms of reference frames and initial assumptions, we were able to eliminate the restrictions which emerged in the earlier studies, strengthening the connection between MHD and kinetic theory. This new equation fulfils the requirement derived by Siewert & Fahr (2007b) based on the conservation of the mass flow, which suggests that Eq. (33) is a self-consistent description of a basic, turbulence-free MHD shock that depends only on MHD upstream and downstream quantities, but not on the behaviour of the plasma in the transition region.

In addition, we derived what might turn out to be a new theory of per-particle invariants, derived from MHD invariants, generalising the well-known equivalence between the single-particle magnetic moment conservation and the equivalent MHD adiabatic invariant. While we have not yet been able to prove that this generalisation leads to physical expressions for all possible MHD invariants, we have found several arguments that strongly suggest that this approach works, at least, for the two adiabatic invariants appearing in the CGL theory. Our current work hints that the conservation of the second CGL invariant is related to a bulk velocity gradient parallel to $\vec{B}$ and the corresponding reaction of the frozen-in magnetic field (Fahr & Siewert 2008).

Acknowledgements

We are grateful for financial support to the DFG within the frame of the DFG-Project Fa 97/31-2.

Appendix A: Transformation of an MHD invariant in a per-particle invariant

In this appendix, we prove that Eq. (22) is always fulfilled, for arbitrary distribution functions f₁(w). Writing down the temporal derivative of this expression, one obtains the following requirement

$\begin{displaymath}\int {\rm d}^3w \left( \left(\frac{\rm d}{{\rm d}t} f_1(w)\ri... ...t} \ w_\parallel^i \ w_\perp^j \right) \right)\stackrel{!}{=} \end{displaymath}$ $\begin{displaymath}\quad\quad\quad \quad\quad\quad\int {\rm d}^3w f_1(w) \left(\frac{\rm d}{{\rm d}t} \ w_\parallel^i \ w_\perp^j \right). \end{displaymath}$

(A.1)

In other words, we require

$\begin{displaymath}\int {\rm d}^3w \left(\frac{\rm d}{{\rm d}t} f_1(w)\right) \ w_\parallel^i \ w_\perp^j \stackrel{!}{=} 0. \end{displaymath}$

(A.2)

To prove this, we begin by using the most simple distribution function,

$\begin{displaymath}f(\vec{w}) = \delta(\vec{w} - \vec{w}_0), \end{displaymath}$

(A.3)

which describes a single particle (or a single ``cell'' in velocity space). Inserting this into Eq. (A.2) leads to

$\begin{displaymath}\int {\rm d}^3w \left(\frac{\rm d}{{\rm d}t} \delta(\vec{w} -... ...c{w}_0)\right) \ w_\parallel^i \ w_\perp^j \stackrel{!}{=} 0. \end{displaymath}$

(A.4)

Here, the derivative of the delta function trivially vanishes because of

$\displaystyle \frac{\rm d}{{\rm d}t} \int {\rm d}^3w \ \delta(\vec{w} - \vec{w}_0) \ w_\parallel^i \ w_\perp^j$	=	$\displaystyle \frac{\rm d}{{\rm d}t} \ w_{\parallel,0}^i \ w_{\perp,0}^j$
	=	$\displaystyle \int {\rm d}^3w \ \delta(\vec{w} - \vec{w}_0) \ \frac{\rm d}{{\rm d}t} w_\parallel^i \ w_\perp^j.$	(A.5)

This automatically means that

$\begin{displaymath}\int {\rm d}^3w \left( \frac{\rm d}{{\rm d}t} \delta(\vec{w} - \vec{w}_0) \right) \ w_\parallel^i \ w_\perp^j = 0. \end{displaymath}$

(A.6)

For a system with N particles, the situation is more complicated. Here, the distribution function is given by

$\begin{displaymath}f_1(\vec{w}) = \sum_{n=1}^N \ \alpha_{\rm n} \ \delta(\vec{w} - \vec{w}_{0,n}), \end{displaymath}$

(A.7)

where

$\begin{displaymath}\sum_{n=1}^N \ \alpha_{\rm n}(t) = 1 \end{displaymath}$

(A.8)

and

$\begin{displaymath}\frac{\rm d}{{\rm d}t} \sum_{n=1}^N \ \alpha_{\rm n}(t) = 0. \end{displaymath}$

(A.9)

Inserting this distribution function into Eq. (A.2) leads to

$\displaystyle 0 \stackrel{!}{=} \sum_{n=1}^N \left[ \left( \frac{\rm d}{{\rm d}... ...\rm d}^3w \delta(\vec{w} - \vec{w}_{\rm n}) \ w_\parallel^i \ w_\perp^j \right.$			(A.10)
$\displaystyle \left. + \alpha_{\rm n}(t) \ \int {\rm d}^3w \left( \frac{\rm d}{... ... \delta(\vec{w} - \vec{w}_{\rm n}) \right) \ w_\parallel^i \ w_\perp^j \right].$			(A.11)

Here, the second term vanishes because of Eq. (A.6). The first term may be removed by noting that, when controlled by the collisionless Boltzmann-Vlasov equation, individual particles in a physical system do always follow determined trajectories, and that therefore the weight functions $\alpha_{\rm n}(t)$ are constant parameters on these trajectories as a consequence of Liouvilles theorem. For this reason, the first term also vanishes, and we have proven that any temporal change of a velocity moment in an N-particle system may be described using the sum of changes of the individual particles.

This approach works only in the absence of stochastical processes, which destroy the uniqueness of the particle trajectories. For this reason, it may be assumed that this approach works for a broad distribution function only when stochastical processes are still absent, as it is the case for the Boltzmann-Vlasov equation. We will present a more detailed analysis under which conditions this holds in a future publication.

References

Alfvén, H., & Fälthammar, C. G. 1963, Cosmical Electrodynamics, 2nd Ed. (Oxford: Clarendon Press) (In the text)
Burlaga, L. F., Ness, N. F., & Acuna, M. H. 2006a, ApJ, 642, 584 [NASA ADS] [CrossRef]
Burlaga, L. F., Ness, N. F., & Acuna, M. H. 2006b, Geophys. Res. Lett., 33, L21106 [NASA ADS] [CrossRef]
Cercignani, C. 1988, The Boltzmann Equation and Its Applications (New York: Springer-Verlag) (In the text)
Chalov, S. V., & Fahr, H. J. 1998, A&A, 335, 746 [NASA ADS]
Chew, G. F., Goldberger, M. L., & Low, F. E. 1956, Proc. R. Soc. London A, 236, 112 [NASA ADS] (In the text)
Cummings, A. C., Stone, E. C., McDonald, F. B., Heikkila, B. C., & Lal, N. 2006, in Physics of the inner Heliosheath, AIP Conf. Proc., 858, 86 (In the text)
Decker, R. B., Krimigis, S. M., Roelof, E. C., et al. 2005, Science, 309, 2020 [NASA ADS] [CrossRef]
Erkaev, N. V., Vogl, D. F., & Biernat, H. K. 2000, J. Plasma Physics, 64, 561 [NASA ADS] [CrossRef]
Escoubet, C. P., Schmidt, R., & Goldstein, M. L. 1997, Space Sci. Rev., 79, 11 [NASA ADS] [CrossRef] (In the text)
Fahr, H.-J., & Siewert, M. 2006, A&A, 458, 13 [NASA ADS] [CrossRef] [EDP Sciences]
Fahr, H.-J., & Siewert, M. 2007, ASTRA, 3, 21 [NASA ADS]
Fahr, H.-J., & Siewert, M. 2008, A&A, 484, L1 [NASA ADS] [CrossRef] [EDP Sciences] (In the text)
Fisk, L. A., & Gloeckler, G. 2006, ApJ, 640, L79 [NASA ADS] [CrossRef]
Hada, T., Onishi, M., Lembege, B., & Savoinin, P. 2003, J. Geophys. Res., 108, 1233 [CrossRef]
Holzer, T. E., & Axford, W. I. 1970, ARA&A, 8, 31 [NASA ADS] [CrossRef] (In the text)
Hudson, P. D. 1970, Planet. Space Sci., 18, 1611 [NASA ADS] [CrossRef]
Kulsrud, R. M. 1983, in Handbook of Plasma Physics, ed. M. N. Rosenbluth, & R. Z. Sagdeev (Amsterdam, North-Holland: Elsevier), 1, 115 (In the text)
Lobzin, V. V., Krasnoselskikh, V. V., Josqued, J.-M., et al. 2007, Geophys. Res. Lett., 34, L05107 [CrossRef] (In the text)
Parker, E. N. 1965, Plan. Sp. Sc., 13, 9 [NASA ADS] [CrossRef] (In the text)
Press, W. H. 1987-2002, Numerical recipes (New York: Cambridge University Press) (In the text)
Schlickeiser, R. 1989, ApJ, 336, 243 [NASA ADS] [CrossRef]
Schlickeiser, R. 2002, Cosmic Ray Astrophysics (Berlin: Springer Verlag)
Scholer, M., Shinohara, I., & Matsukiyo, S. 2003, J. Geophys. Res., 108, 1014 [CrossRef]
Siewert, M., & Fahr, H.-J. 2007a, A&A, 463, 799 [NASA ADS] [CrossRef] [EDP Sciences]
Siewert, M., & Fahr, H.-J. 2007b, A&A, 471, 7 [NASA ADS] [CrossRef] [EDP Sciences]
Stone, E. C., Cummings, A. C., McDonald, F. B., et al. 2005, Science, 309, 2017 [NASA ADS] [CrossRef]
Treumann, R. A., & Scholer, M. 2002, in The Century of Space Science (Norwell: Kluwer Academic Publishers), 1495 (In the text)
Vogl, D. F., Langmayr, D., Erkaev, N. V., et al. 2003, Planet. Space Sci., 51, 715 [NASA ADS] [CrossRef]