A&A 485, 327-336 (2008)
DOI: 10.1051/0004-6361:20078911
M. Siewert - H.-J. Fahr
Argelander Institut für Astronomie der Universität Bonn, Abteilung f. Astrophysik und Extraterrestrische Forschung, Auf dem Huegel 71, 53121 Bonn, Germany
Received 24 October 2007 / Accepted 27 March 2008
Abstract
Aims. We revisit the general problem of the anisotropic MHD shock for arbitrary magnetic field inclinations, where the jump conditions are underdetermined. To describe the transition region of the shock, we derive a variant of a kinetic Boltzmann-Vlasov equation previously used to describe the perpendicular shock in the absence of dissipative processes.
Methods. We derive effective force terms, for the kinetic equation, that are based on the conservation of the Chew-Goldberger-Low (CGL) MHD invariants which appear in the standard model for anisotropic MHD. This approach is based on a generalisation of the well-known equivalence between the first CGL invariant and the integral over the magnetic moments of the underlying particles.
Results. Assuming an arbitrary distribution function on the upstream side, we integrate the kinetic equation across the shock. This result allows us to establish further relations between the MHD velocity moments on both sides. Using this additional information, we close the anisotropic MHD jump conditions. In addition, the now unique solution of the jump conditions allows us to present explicit cuts through a representative Maxwellian distribution function on both sides of the shock. In the kinetic equation, one only requires two parameters that need to be derived from the classical jump conditions, the classical MHD compression ratio and an equivalent ratio for the magnetic field strengths.
Key words: plasmas - shock waves - magnetohydrodynamis (MHD) - sun: solar wind
We published previously several kinetic studies of an ion plasma crossing an MHD shock, such as the bow shock of the Earth or the solar wind termination shock (Fahr & Siewert 2006; Siewert & Fahr 2007a,b), that attempted to improve our understanding of this area of magnetohydrodynamics. Essentially, MHD shocks have been described using one of the following approaches, each of which possesses its own, inherent flaws.
First, from the theoretical MHD side, there are Rankine-Hugoniot-like jump conditions, based on the conservation of moments such as the MHD mass and momentum fluxes, the energy flux, and the conservation of the Poynting vector flux (see, e.g. Vogl et al. 2003; Hudson 1970; Erkaev et al. 2000). This approach is based on MHD and describes only a few low-order velocity moments of the plasma flow instead of the full ion velocity distribution function
.
Therefore, this approach is well suited to the analysis of Maxwell-Boltzmann-like distribution functions, where the entire function can be parameterised using only a few velocity moments, but may be inappropriate for, say, power-law distribution functions, as those found in most cosmic ray spectra (see, e.g. Fisk & Gloeckler 2006; Schlickeiser 2002). In addition, it turns out that, for an anisotropic plasma, the jump conditions are underdetermined, where one possible parameterisation is to assume that the downstream pressure anisotropy,
,
is the free parameter (Erkaev et al. 2000). Since an anisotropic plasma may emerge in the presence of magnetic fields (Chew et al. 1956), this restriction can be traced back to the inherent MHD flaw, where one initially obtains an infinite hierarchy of moment equations (see e.g. Cercignani 1988), which needs to be truncated at an arbitrary point.
The second approach to this problem is based on experimental observations, followed by appropriate modelling of these data. The cluster mission (Escoubet et al. 1997) has observed the Earth bow shock for many years now, observing its highly nonstationary behaviour (see e.g. Lobzin et al. 2007). Another prominent set of shock-related data was taken by the Voyager 1 spacecraft, which, in late 2004, crossed the solar wind termination shock (Stone et al. 2005; Decker et al. 2005): the spacecraft observed power-law spectra of practically unchanging power indices across the shock (Cummings et al. 2006), in addition to a series of magnetic bumps and holes in the heliopause (Burlaga et al. 2006a,b), which could be explained in terms of plasma waves. Up to now, however, these experimental data can only be fitted using by an ad hoc model because a robust physical explanation of their origin does not exist yet.
A third approach to the understanding of understand MHD shocks has also emerged, that is based on numerical shock simulation models completed using powerful supercomputers (e.g. Hada et al. 2003; Scholer et al. 2003): this, in principle, allows to study not only the behaviour of the individual particles, but also the nonstationarity in the ``fine structure'' of the transition region (i.e. fields and distribution functions inside the shock). However, numerical simulations always require the introduction of boundary conditions, in addition to a numerically stable ``stepping scheme'', which, in principle, may introduce additional, unphysical terms in the underlying equations (see, e.g. Press 1987-2002, Chap. 19). A consistent description of the physical boundary conditions typically requires the simulation of a system that is much larger than the transition region of the shock itself, and resolving this region can be difficult. One example of such a situation is the heliospheric termination shock, where the heliosphere with an average radius of the order of 100 AU requires a grid size that is considerably different from the typical size of a shock, which is assumed to be of the order of a few gyroradii (Fahr & Siewert 2007; Scholer et al. 2003). For these reasons, in any large-scale simulations, it is possible to verify only the existence of a shock by using the Rankine-Hugoniot jump conditions. Results from such numerical simulations demonstrate a highly nonstationary behaviour, similar to in situ space observations, for which, however, no complete theoretical understanding exists.
Since all of these approaches are, in one way or another, incomplete, we developed an approach based on a kinetic Boltzmann-Vlasov equation, which might be able to fill the gaps between the competing descriptions. In contrast to common MHD, our approach provides a description of the entire distribution function
,
which includes all forms of nonthermal distribution functions. So far, this model allowed us to explain the conserved power-law index observed by the Voyager 1 spacecraft (Siewert & Fahr 2007a,b), as well as the magnetic bumps and holes observed by the same spacecraft in the heliosheath (Fahr & Siewert 2007; Burlaga et al. 2006b). Our results imply that, to obtain a (quasi-)stationary transition region, one requires additional physical processes in addition to the deceleration of ions and the change in electric and magnetic fields across the shock. Since, in our approach, the conservation of the mass flow transforms into a set of specific mathematical conditions, the additional physical processes to be included must be of a rather specific form as well, unless the transition region is highly nonstationary (Siewert & Fahr 2007b). This result is agrees with the nonstationarity emerging in both numerical approaches and experimental observations, which also implies that additional, internal microphysics should be taken into account. In this study, we restrict ourselves to the solar wind termination shock, which is located approximately at a similar spatial position over a significant period of time. We reinvestigate our kinetic model of the MHD shock, and derive a new, improved form of the kinetic Boltzmann equation for the general shock, which is based on a new, systematic connection between the MHD view and the per-particle view of the shock. These arguments enable to be removed most of the physical and mathematical problems that emerged in previous studies (e.g. Fahr & Siewert 2006; Siewert & Fahr 2007a,b). As a side-result, we prove the equivalence of MHD invariants and single particle invariants, by generalising the conservation of the ``magnetic CGL-moments''.
In Siewert & Fahr (2007b), we proved that the specific form of the Boltzmann equation which was used in earlier studies is inherently unable to describe the parallel MHD shock in the anisotropic CGL model. For this reason, we now rederive the Boltzmann equation, looking carefully for shortcomings and flaws in the up-to-now description, including missing terms.
We consider a more general form of the Boltzmann equation that we later develop to include nontrivial terms, such as wave-turbulence generation, diffusion and an exchange of particle number, energy and momentum between multiple particle populations, which occur when we consider processes such as charge exchange, ionization or wave-particle induced friction. Following the earlier studies of Fahr & Siewert (2006) and Siewert & Fahr (2007a,b), a sufficiently general form of the Boltzmann-Vlasov equation is given by
In a more general approach, one has to consider small fluctuations, in terms of plasma waves and turbulence. In the presence of a stationary background magnetic field, these processes are described traditionally using the Fokker-Planck equation
(see, e.g. Schlickeiser 1989; Chalov & Fahr 1998), which then introduces terms such as
![]() |
(2) |
The Boltzmann equation may also be parameterised from a physical standpoint that emphasizes not the mathematical form but the physics behind it. We write this alternate representation in the form
A general formulation of the collisionless Boltzmann equation (i.e. the Vlasov equation) in an accelerated reference frame is given by
![]() |
(7) |
![]() |
(9) |
In principle, there are two different ways to derive the factors
in their final form. There is first the per-particle approach, where no a priori limits or averages are made, that considers a single particle, with three velocity coordinates including a gyroangle, and the full force acting on this particle. Then, one has to derive the modifications to this force caused by the local field modifications inside the shock, which in turn have to be derived from the particle behaviour inside the system. In a sufficiently general case, this system of equations is complicated and has to be solved numerically, i.e. the analytical forms of the kinetic terms cannot be derived.
The other approach is what we call the semikinetic approach. From the kinetic view, we borrow the idea of taking an individual particle (or, alternatively, a narrow region in velocity space), but we describe the force terms using MHD quantities. In other words, force terms and their corresponding energy and momentum exchange are implicitly included in MHD quantities such as the bulk velocity, the partial pressures, and the magnetic field tension. These quantities must then be parameterised in a way that is consistent with the MHD jump conditions, which leads to a consistent description of the transition region using MHD quantities only, where only a small part of the MHD parameters actually needs to be modelled. This produces, however, a complicated equation because all important MHD quantities must be represented, that is the magnetic field, partial pressures, bulk velocities, and mass density, which are all related in some way. Finally, one requires a systematic relation between the MHD quantities and the individual kinetic velocities, unless all particles react in an identical way to the shock, which would be unphysical. In the remaining part of this section, we introduce a formalism than enables this to be realised in a straightforward way.
We begin with a phenomenological motivation of this formalism. First, we would like to emphasize that, in a wide variety of physical systems, the magnetic moment of the individual particles is conserved, i.e.
![]() |
(11) |
![]() |
= | ![]() |
|
= | ![]() |
||
= | ![]() |
(12) |
![]() |
(14) |
In an inertial rest frame, the normalisation of the stationary distribution function is constant, i.e.
.
In our formalism, however, we are working in an accelerated reference frame, where
.
Introducing the normalised distribution function
![]() |
(17) |
![]() |
(18) |
Next, we demonstrate that, in the absence of stochastic processes, the opposite direction of Eq. (19) is also valid. In other words, it is possible, under certain conditions, to interpet an MHD invariant in terms of an integral over per-particle invariants. Taking, representatively, the second CGL invariant, we may write
Although we do not study the physical nature of Eq. (21), we emphasize that the mostly mathematical approach to this identification requires that such an invariant must exist. In Fahr & Siewert (2008), we identified this invariant in the solar wind, following from the divergence of the plasma stream and the corresponding Parker model for the frozen-in magnetic fields (Parker 1965). Using a general expression for the corresponding velocity modification,
Therefore, lacking any better description, we make the ad-hoc assumption that both CGL invariants are conserved inside the shock. This approach requires that the magnetic field must be changing slowly. In other words, the reorientation and condensation timescale
must be much larger than the gyration timescale,
![]() |
(24) |
Finally, we consider the Eqs. (21) and (10) to evaluate the temporal derivative and obtain the expressions
![]() |
= | ![]() |
(27) |
![]() |
= | ![]() |
(28) |
![]() |
= | ![]() |
(29) |
It it worth mentioning that this equation does not depend upon the
magnetic field orientation, which is represented by the fact that no
magnetic field projections (
or
)
appears. Therefore, the
terms related to this effect derived by Fahr & Siewert (2006) must be discarded on account of mixing the semikinetic approach with the full kinetic approach. Since both approaches rely
on different amounts of averaging and other approximations, a self-consistent description of the shock must not mix these different representations.
In Siewert & Fahr (2007b), we derived restrictions for the possible form of the Fokker-Planck terms Ai, based on the concept that the average parallel velocity
vanishes in the plasma frame.
At this point, we repeat the nontrivial parts relevant for the solution of our kinetic equation. First, we emphasize that the adiabatic invariants (Eqs. (15) and (16)) are only valid in the rest frame comoving with the system. This may be understood since we derived the Fokker-Planck terms
and
using adiabatic invariants depending on the partial pressures
and
.
Conventionally, these partial pressures are taken in the ``natural'' rest frame that is comoving with the plasma, since the integral
![]() |
(34) |
In the absence of stochastic processes, any single point in phase space will remain forever a single point, which justifies the approach
0 | ![]() |
![]() |
|
= | ![]() |
||
= | ![]() |
||
= | ![]() |
(41) |
![]() |
(44) |
From this point, the rest of our formalism is rather straightforward mathematics. Taking Eqs. (42) and (43), inserting them into Eq. (1) and comparing coefficients then allows to reduce the partial differential equation to the two ordinary differential equations
![]() |
(47) |
![]() |
(48) |
Now, we may apply this formalism to the equation that we derived in this study. Obviously, the Ai given by Eqs. (31) and (32) fulfill Eq. (49), i.e. they are linear functions of their corresponding velocities. Since they are of the form
,
evaluating Eq. (50) is trivial as well because the exponential and logarithmic functions cancel out, leading to
Table 1:
Initial upstream and final downstream parameters for a single-fluid,
ion-only plasma. The downstream pressure anisotropies
are not
estimated, but are an exact result using Eq. (54). Normalised
values are used.
Since this formalism was derived in the comoving reference frame of the plasma, which must always exist no matter how complicated the microphysics in the system may be, we call this the minimal kinetic extension. In other reference frames, this extension should be similarly applicable, although the transformation of the velocity moments between the different reference frames is mathematically complicated, and not pursued further in this study.
Using the treatment of the MHD shock presented in this paper, we show a few selected results. We represent the upstream distribution function using a bi-Maxwellian function,
![]() |
(58) |
![]() |
(59) |
![]() |
(60) |
Next, we solve the anisotropic MHD jump conditions (Erkaev et al. 2000; Hudson 1970) for the upstream parameters given. In a classical (i.e. MHD-only) approach to this problem, the equations are underdetermined, and one is faced with one more downstream parameter than equations, which means that an additional equation must be derived using a different formalism. In this study, we follow Erkaev et al. (2000) and select the downstream pressure anisotropy ,
which may be described using Eq. (54). Using this approach, we finally arrive at a unique solution for the downstream parameters, which is given in Table 1. Using this solution, we derive the MHD compression ratio x and the ``field compression ratio'' B2/B1, allowing us to determine the full distribution function on the downsteam side.
![]() |
Figure 1:
Representative cuts through the distribution function at
![]() |
Open with DEXTER |
![]() |
Figure 2:
Representative cuts through the distribution function at
![]() |
Open with DEXTER |
Since the perpendicular shock has already been treated in Siewert & Fahr (2007a), and the current approach to the force terms leads to identical results, we focus on the inclined and parallel shocks. In Figs. 1 and 2, we present cuts through the distribution functions for an inclined shock (
)
on the upstream and downstream sides, at
and
,
respectively. These figures demonstrate a basic property of our solution (Eq. (46)), namely that the shock does not modify the basic shape of the distribution function, which is still of the characteristic Maxwellian form. Instead, it modifies the broadness of this distribution, i.e. the parameter
in Eq. (57), which is a function of the partial pressures. For the inclined shock, both components of the velocity are modified, which directly follows from the fact that, in this case, both coefficients Ci are not unity (i.e. in Eqs. (51) and (52), the magnetic field ratio is not 1 or x).
For the parallel shock, the situation is different. Since this approach is defined by
,
and it follows from the MHD jump conditions that the normal magnetic field is conserved, the magnetc field ratios appearing in the coefficients C i are unity, and
,
which leads to a completely unmodified perpendicular velocity. This effect is demonstrated in Figs. 3 and 4, where we present, again, cuts through the upstream and downstream distribution functions for an (almost) parallel shock. This is similar to the earlier result obtained for the perpendicular shock (Siewert & Fahr 2007a), where the parallel velocity components remain untouched.
![]() |
Figure 3:
Representative cuts through the distribution function
at
![]() ![]() |
Open with DEXTER |
![]() |
Figure 4:
Representative cuts through the distribution function
at
![]() ![]() |
Open with DEXTER |
Finally, we point out that Eq. (33) does not depend upon the behaviour of the MHD quantities inside the transition region, which is in excellent agreement with MHD, and which also implies that we indeed find a kinetic description for the MHD shock that depends essentially only on MHD quantities. However, we emphasize that this approach is, by no means, a globally complete description, but applies only under the restrictions imposed by the MHD approach to the shock. First of all, MHD requires that the electromagnetic fields are frozen-in, i.e. convected along with the background plasma. Without this requirement, there would be no motion perpendicular to the magnetic field lines, and no perpendicular shock either. Therefore, although the frozen-in field condition is derived within the framework of classical MHD, it must be valid even when all other MHD requirements fail, since otherwise, there would be no perpendicular shock. For this reason, it must be expected that the fields are still frozen-in into the system even in the transition layer of the shock, in the sense of a generalised frozen-in field condition.
In addition, the MHD approach to shocks requires that the system is charge-neutral, i.e. that there are no local electric currents present. However, in a more consistent description, one has to include both ions and electrons as separate, interacting fluids. On the other hand, the jump conditions are explicitly tailored to one single fluid, which is typically interpreted in terms of an ion flow, with the implicit asumption that the much lighter electrons are convected along with the rest of the system, and that quasineutrality is obtained. Therefore, to arrive at a more consistent description of the MHD shock, one requires a two-fluid generalization of the MHD jump conditions, including an MHD formulation of charge-neutrality on the upstream and downstream sides. This, however, opens yet another problem, namely the fact that, inside the transition region, where MHD is not applicable, quasineutrality may no longer be an absolute requirement. Considering that the electron distribution function may be quite different from the ion one, this is clearly not a trivial problem. We emphasize that even the particle-field interactions present in MHD may already be interpreted as a two-fluid system, with one fluid being composed of massive particles, and the other fluid of frozen-in fields. In light of this interpretation, interactions between multiple fluids should, in fact, be possible in the framework of MHD. Such an approach would allow to describe wave generation, by including the wavemodes as yet another separate fluid. On the kinetic level, such interactions between various components of the model is realised by upgrading the Boltzmann-Vlasov equation to a Fokker-Planck-like form, where the interactions are parameterised as diffusion coefficients. To the best of our knowledge, there exists no comparable systematic theory of interacting fluids on the MHD level yet. The closest thing to such a theory found in literature is two-fluid hydrodynamics (see Holzer & Axford 1970).
Clearly, a self-consistent solution to all of these problems may become complicated and is far beyond the scope of this present study. As a first step towards such a description, we are currently working on a multifluid generalisation of the classical MHD jump conditions. Although this work is close to completion, we point out that the ``initial problem'', i.e. the fact that the anisotropic jump conditions are not perfectly closed, appears to be only the literal tip of the iceberg; for multiple fluids, the amount of free parameters seems to be growing, which offers an excellent interface to include fluid-fluid interactions, in terms of additional conditions required to close the generalised jump conditions. In face of all these aspects, our current result must be interpreted as a working, self-consistent description of the classical, single-fluid MHD shock only, and as a basis for future work.
Taking Eq. (53), we see that the partial downstream pressures are given by
![]() |
(63) |
![]() |
(65) |
Taking the general anisotropic jump conditions (Erkaev et al. 2000) and specialising them to the parallel shock (
,
), one obtains
![]() |
= | 0 | (66) |
![]() |
= | 0 | (67) |
![]() |
= | 0 | (68) |
![]() |
= | 0 | (69) |
![]() |
= | 0 | (70) |
![]() |
= | 0, | (71) |
While, in principle, many interpretations of this behaviour are possible,
perhaps the most straightforward idea is that the transition region
of the MHD shock differs from ideal MHD predictions, and that some of the jump conditions have to be modified. Since energy and momentum are conserved quantities
even outside of MHD, and the normal magnetic field conservation is related
to many other, non-MHD plasmaphysical applications as well, the conservation
of the transverse electric field is the only MHD jump conditions which
may, perhaps, be modified by the shock. This jump condition is closely
related to the so-called frozen-in field condition (Alfvén & Fälthammar 1963),
In this study, we derived an improved version of an earlier kinetic Boltzmann equation derived by Fahr & Siewert (2006), which attempts to describe MHD shocks, such as the solar wind termination shock. Using a more strict approach in terms of reference frames and initial assumptions, we were able to eliminate the restrictions which emerged in the earlier studies, strengthening the connection between MHD and kinetic theory. This new equation fulfils the requirement derived by Siewert & Fahr (2007b) based on the conservation of the mass flow, which suggests that Eq. (33) is a self-consistent description of a basic, turbulence-free MHD shock that depends only on MHD upstream and downstream quantities, but not on the behaviour of the plasma in the transition region.
In addition, we derived what might turn out to be a new theory of per-particle invariants, derived from MHD invariants, generalising the well-known equivalence between the single-particle magnetic moment conservation and the equivalent MHD adiabatic invariant. While we have not yet been able to prove that this generalisation leads to physical expressions for all possible MHD invariants, we have found several arguments that strongly suggest that this approach works, at least, for the two adiabatic invariants appearing in the CGL theory. Our current work hints that the conservation of the second CGL invariant is related to a bulk velocity gradient parallel to
and the corresponding reaction of the frozen-in magnetic field (Fahr & Siewert 2008).
Acknowledgements
We are grateful for financial support to the DFG within the frame of the DFG-Project Fa 97/31-2.
In this appendix, we prove that Eq. (22) is always fulfilled, for arbitrary distribution functions f1(w). Writing down the temporal derivative of this expression,
one obtains the following requirement
![]() ![]() |
(A.1) |
![]() |
(A.3) |
![]() |
(A.7) |
![]() |
(A.9) |
This approach works only in the absence of stochastical processes, which destroy the uniqueness of the particle trajectories. For this reason, it may be assumed that this approach works for a broad distribution function only when stochastical processes are still absent, as it is the case for the Boltzmann-Vlasov equation. We will present a more detailed analysis under which conditions this holds in a future publication.