Free Access
Issue
A&A
Volume 532, August 2011
Article Number A68
Number of page(s) 9
Section Astrophysical processes
DOI https://doi.org/10.1051/0004-6361/201117182
Published online 22 July 2011

© ESO, 2011

1. Introduction

The scattering and the spatial diffusion of high energy particles off magnetic turbulence play a crucial role in many fields of astrophysics. For instance, they are key ingredients of Fermi acceleration processes because they directly control the efficiency and the rate of particle acceleration. They determine the properties of the confinement of astrophysical objects from jets to galaxies and clusters of galaxies, and governs the transport of the particles through interplanetary, interstellar, or intergalactic space. Diffusion has long been described by a quasi-linear theory approach (Jokipii 1966, 1973), which allows us to calculate the diffusion coefficients when the turbulent field is significantly weaker than the background field. However, in many circumstances the level of turbulence turns out to be large so that this standard picture requires extension. Several studies have examined the transport properties in strong turbulence by means of numerical simulations, e.g. Giacalone & Jokipii (1999), Casse et al. (2002), Candia & Roulet (2004), and Fatuzzo et al. (2010). Most of these investigations have focused on the situation in which a large-scale turbulence cascades toward small dissipative scales – as in the Kolmogorov scheme – and in which particles interact with gyroresonant modes of the turbulence spectrum. From the point of view of the particle, the turbulence therefore occurs on large scales, as the coherence length of the magnetic field corresponds roughly to the maximal scale of the turbulent spectrum.

However, in a variety of physical situations, the Larmor radius of the particle can exceed the coherence scale of the turbulence. The transport of particles downstream of a relativistic shock wave provides a clear example of this situation. The mean field is there mostly transverse to the flow because of Lorentz tranform effects and shock compression, and the turbulence that is excited in the shock precursor is generated on microscopic plasma skin depth scales. In this case, perpendicular diffusion at high (possibly very high) rigidity controls the transport of the particles back and forth from the shock. More generally, the high rigidity regime likely plays an important role in the deconfinement process of particles of high energy, when their Larmor radius exceeds the size of the astrophysical system. However, this high rigidity regime has received little attention so far, except for the pioneering study of Shalchi & Dosch (2009). The pitch angle scattering rate is known to increase in proportion to the square of the particle energy in this limit, but the behavior of the transverse diffusion coefficient, which is crucial in the above contexts deserves a careful analysis. This analysis is the objective of the present paper. It will be found in particular that even a weak mean field, as measured relatively to the turbulent component, can affect the scaling of the perpendicular diffusion coefficient.

The present paper describes both a theoretical and a numerical study of diffusion at high rigidity. The theoretical aspects are discussed in Sect. 2, while the numerical simulations are presented in Sect. 3. Finally in Sect. 4 we summarize our results and discuss some applications.

2. Transport of high rigidity particles with a mean field

2.1. Notations and summary of previous results

The transport of particles in magnetostatic turbulence is characterized by the reduced rigidity ρ, the level of turbulence η, and the power spectrum of magnetic fluctuations in three dimensions (3D) S3d(k). These quantities are defined as ρLc=ϵec,\begin{equation} \rho \equiv \frac{\bar r_{\rm L}}{\ell_{\rm c}} = \frac{\epsilon}{e\bar B \ell_{\rm c}}, \end{equation}(1)where \hbox{$\bar r_{\rm L}$} denotes the Larmor radius of the particle in the total (mean B0 and turbulent δB) field \hbox{$\bar B$} where \hbox{$\boldsymbol{\bar B}^2\,\equiv\,\boldsymbol{B_0}^2 + \boldsymbol{\delta B}^2$}, ϵ the energy of the particle, and c the coherence length of the fluctuations.

The turbulence level η is defined as ηδB2δB2+B02,\begin{equation} \eta\,\equiv\,\frac{\langle \boldsymbol{\delta B}^2\rangle}{\langle \boldsymbol{\delta B}^2\rangle + \boldsymbol{B_0}^2}, \end{equation}(2)where η → 0 corresponds to weak turbulence and η → 1 corresponds to pure turbulence with no mean field.

The correlation function C(r) of the random field C(r)δB(x+r)δB(x)δB2,\begin{equation} C(\vec{r})\,\equiv\, \frac{\langle \boldsymbol{\delta B}(\boldsymbol{x}+\boldsymbol{r})\boldsymbol{\delta B}(\boldsymbol{x})\rangle}{\langle \boldsymbol{\delta B}^2\rangle}, \end{equation}(3)can be written in terms of the one-dimensional power spectrum S(k) ∝ k2S3d(k) C(r)=dkS(k)sin(kr)/(kr)dkS(k)·\begin{equation} C(r)\,=\,\frac{\int\mathrm{d}k\,S(k)\,\mathrm{sin}(kr)/(kr)} {\int\mathrm{d}k\,S(k)}\cdot\label{eq:corr-func} \end{equation}(4)Casse et al. (2002) defined the coherence length as the scale at which C(r) is maximum; if the power spectrum takes the form of a broad-band truncated power-law S(k) ∝ (k/kmin) − β for kmin ≤ k ≤ kmax and zero otherwise, one finds for the coherence length c0.77kmin-1\hbox{$\ell_{\rm c}\simeq 0.77 k_{\rm min}^{-1}$}. Alternatively, one can define the coherence length as we do here, to be c0+drC(r),\begin{equation} \ell_{\rm c}\,\equiv\, \int_{0}^{+\infty}{\rm d}r\,C(r), \label{eq:corr-length} \end{equation}(5)where one then derives in a straightforward way c=π21η0+dkk-1S(k),\begin{equation} \ell_{\rm c}\,=\,\frac{\pi}{2}\frac{1}{\eta}\int_0^{+\infty} {\rm d}k\,k^{-1}\,S(k), \end{equation}(6)and the presence of 1/η results from our choice of normalization for the power spectrum 0+dkS(k)η,\begin{equation} \int_0^{+\infty} {\rm d}k\,S(k)\,\equiv\,\eta, \end{equation}(7)where in practice the spectrum is bounded between kmin and kmax. Both definitions for c coincide to within a factor close to unity. As a function of the spectrum index β, the coherence length is close to either kmin-1\hbox{$k_{\rm min}^{-1}$} on larges scale for β > 1, or to kmax-1\hbox{$k_{\rm max}^{-1}$} on small scales for β < 1.

The scattering frequency νs is defined as the reciprocal of the decorrelation time of the pitch angle of the particle, the latter being defined relative to the direction of the mean field. As discussed in Casse et al. (2002), the scattering frequency can be written νsπ3c2LkL>1k-1S(k)dkS(k)dk,\begin{equation} \nu_{\rm s} \,\approx\, \frac{\pi}{3} \frac{c}{\bar r_{\rm L}^2}\frac{\int_{k\bar r_{\rm L} >1}\, k^{-1}S(k)\, {\rm d}k}{\int S(k)\,{\rm d}k}, \end{equation}(8)an expression that extends to the strong turbulence regime the results of the quasi-linear theory. This leads to the scalings νs23ηccρβ2(ρ1)νs23ηccρ2(ρ1).\begin{eqnarray} \nu_s & \simeq & \frac{2}{3}\eta\frac{c}{\ell_{\rm c}} \rho^{\beta-2} \quad (\rho \ll 1) \nonumber\\ \nu_s & \simeq & \frac{2}{3}\eta \frac{c}{\ell_{\rm c} \rho^2} \quad (\rho \gg 1). \end{eqnarray}(9)The Bohm scaling holds only in the very special case where β = 1. In addition to these quantities, the notion of correlation time also plays an important role because it measures the time beyond which a particle experiences a force that is decorrelated from the initial one, along the particle trajectory. It is then defined as τc0+C(|Δx(τ)|)dτ,\begin{equation} \label{eq:corr-time} \tau_{\rm c} \equiv \int_0^{+\infty} C(\vert \Delta x(\tau) \vert) {\rm d}\tau, \end{equation}(10)where Δx(τ) represents the displacement after a time τ in the turbulence. In quasi-linear theory, only the unperturbed trajectory is inserted into this definition, although one can extend that definition with a diffusive trajectory as we later indicate.

If a relativistic particle travels over a coherence length of the turbulent field without having displayed any wiggle, corresponding to the regime ρ ≫ 1, then τc ~ c/c. This correlation time is much shorter than the scattering time νs-1~η-1ρ2c/c\hbox{$\nu_{\rm s}^{-1}\sim \eta^{-1}\rho^2 \ell_{\rm c}/c$} in this regime. The correlation time τc can be recovered from Eq. (10) by using the ballistic approximation Δx(τ) ≃ , which is appropriate in this regime ρ ≫ 1, in which case Eq. (4) leads to τc=(π/2)(β1)β-1kmin-1/c~c/c\hbox{$\tau_{\rm c}=(\pi/2)(\beta-1)\beta^{-1}k_{\rm min}^{-1}/c\sim \ell_{\rm c}/c$}. We note that in the special case where the power-law index of turbulence β = 1 (Bohm regime) Eqs. (4) and (5) lead to c = (λmin/4)log (λmax/λmin), where λmin and λmax are the shortest and the longest wavelengths of turbulence.

If a particle experiences a chaotic motion on a length-scale smaller than c, corresponding to the regime ρ ≪ 1, then the estimate is more complicated to obtain but one finds that τc ~ ρβc/c as follows. Since the correlation time remains shorter than the scattering time, Casse et al. (2002) proposed a heuristic estimate in which decorrelation arises out of the small-scale modes with wavenumber k > kminρ-1, which give rise to gyroresonant interactions with the particle of rigidity ρ. The modes with wavelengths longer than the Larmor radius (i.e., k < kminρ-1) construct the field line to which the particle is attached, hence do not cause decorrelation on timescales shorter than the scattering time. The above correlation time is indeed shorter than the scattering time and increases with ρ. The heuristic estimate for ρ < 1 is consistent with quasi-linear theory when η ≪ 1 and with numerical results in the strong turbulence regime (Casse et al. 2002) can then be written as τc1ηck>kminρ-1dkk-1S(k),\begin{equation} \tau_{\rm c} \simeq \frac{1}{\eta c}\int_{k> k_{\rm min}\rho^{-1}} {\rm d}k\, k^{-1} S(k), \end{equation}(11)which bears some resemblance to the case discussed before for ρ ≫ 1, except that ρ explicitly enters the sinc function, since one must now follow the orbit of the particle around the field line, and the integral is limited to k > kminρ-1 for the reasons given above. The calculation then implies that τc ~ ρβc/c as announced. The particle trajectory undergoes decoherence before traveling c because of the large number of wiggles in the random field.

Thus, except for η ~ 1 and ρ ~ 1 for which the correlation time becomes comparable to the scattering time, a Markovian theory of the scattering process is appropriate, even if the turbulence is strong, stronger even than the mean field. This is an essential key for the present discussion.

Independently of the rigidity, the parallel diffusion coefficient is always given by D = c2/(3νs), even in the strong regime of turbulence. As for the transverse diffusion coefficient, in the strong regime at low rigidities, it does not follow a law similar to the quasi-linear result but is proportional to D (Casse et al. 2002) because of the magnetic field line wandering that transmits parallel diffusion in the transverse direction. Casse et al. (2002) found in particular that D = η2.3D at small rigidities, which rules out the conjecture of Bohm’s diffusion. In the next section, we discuss the transverse diffusion in the large rigidity regime.

2.2. Transverse diffusion at large rigidity

As mentioned previously, in the large rigidity regime ρ ≫ 1, the correlation time is (much) shorter than the scattering time, hence we expect to derive the parallel and transverse diffusion coefficients using a Markovian description of the trajectory. In particular when ρ ≫ 1, the velocity changes by 1/ρ only over a correlation time. This implies that significant changes in the velocity occur on timescales that are much longer than the correlation time. Therefore we can assimilate the effect of small-scale fluctuations to a fully decorrelated white noise on the relevant timescales.

To calculate the particle transport in a random field, one has to use the solution of the differential equation that governs the evolution of the particle velocity vddtv=[Ω̂0+δΩ̂(t)]·v.\begin{equation} \frac{\rm d}{{\rm d}t} \vec{v} = \left[{\boldsymbol {\hat \Omega_0}} + {\boldsymbol {\delta \hat \Omega(t)}}\right] \cdot \vec{v}. \end{equation}(12)The quantities Ω̂0\hbox{${\boldsymbol {\hat \Omega_0}}$} and \hbox{${\boldsymbol {\delta \hat \Omega}}(t)$} are rotation operators developed as linear combinations of the generators of the Lie algebra of the rotation group, 1\hbox{$\boldsymbol{\hat L_1}$}, 2\hbox{$\boldsymbol{\hat L_2}$}, 3\hbox{$\boldsymbol{\hat L_3}$}1:=(00000-1010),2:=(001000-100),3:=(0-10100000).\begin{equation} \hat L_1\, := \begin{pmatrix} 0 & 0 & 0 \\ 0 & 0 & -1 \\ 0 & 1 & 0 \\ \end{pmatrix}, \hat L_2\, := \begin{pmatrix} 0 & 0 & 1 \cr 0 & 0 & 0\cr -1 & 0 & 0 \\ \end{pmatrix}, \hat L_3\, := \begin{pmatrix} 0 & -1 & 0 \cr 1 & 0 & 0 \cr 0 & 0 & 0 \\ \end{pmatrix}. \end{equation}(13)In detail, Ω̂0=Ω0B0ii/B0\hbox{$\boldsymbol{\hat\Omega_0}=\Omega_0 B_0^i \boldsymbol{\hat L_i}/B_0$}, where B0i\hbox{$B_0^i$} denotes the ith component of B0 and Ω0 ≡ c/rL,0 the Larmor pulsation defined with respect to the mean field. With this notation, \hbox{${\boldsymbol {\hat \Omega_0}} \cdot \vec{v} = \Omega_0 \vec{v}\times\vec{B_0}/B_0$}. The operator \hbox{${\boldsymbol {\delta \hat \Omega}}(t)$} is decomposed in a similar way as the generators of the rotation group, and δΩ ≡ c/rL, where rL is now measured relatively to δB.

To solve the equation of motion, one uses an auxiliary variable u that is defined as v(t)0(t)·u(t),\begin{equation} \vec{v}(t)\,\equiv\, {\boldsymbol{\hat R_0}}(t)\cdot\vec{u}(t), \end{equation}(14)where 0(t)exp(tΩ̂0).\begin{equation} {\boldsymbol{\hat R_0}}(t)\,\equiv\,\exp\left(t{\boldsymbol {\hat \Omega_0}}\right). \end{equation}(15)We then define ˜Ωˆ(t)0(t)-1·δΩ̂(t)·0(t),\begin{equation} \boldsymbol {\hat {\tilde \Omega}}(t) \equiv \boldsymbol {\hat R_0}(t)^{-1}\cdot \boldsymbol {\delta \hat \Omega}(t)\cdot \boldsymbol{\hat R_0}(t), \end{equation}(16)one finds that u(t) obeys ddtu=˜Ωˆ(t)·u.\begin{equation} \frac{\rm d}{{\rm d}t}\vec{u} = \boldsymbol{\hat{\tilde \Omega}}(t) \cdot \vec{u}. \end{equation}(17)This equation is solved as u(t)=𝒯exp[0t˜Ωˆ(t)dt]·u(0).\begin{equation} \vec{u} (t)\,=\,{\cal T} \exp\left[\int_0^t \boldsymbol{\hat{\tilde \Omega}}(t')\, {\rm d}t'\right] \cdot \vec{u}(0). \end{equation}(18)Because the operator in the exponent is time dependent, to preserve the exponential character of the solution, a time-ordering operator \hbox{$\cal{T}$} has to be introduced, as we now explain.

We note that u(0) = v(0), thus the solution for v is given by v(t)=0(t)·𝒯exp[0t˜Ωˆ(t)dt]·v(0).\begin{equation} \vec{v}(t)\,=\,\boldsymbol{\hat R_0}(t)\cdot {\cal T} \exp\left[\int_0^t \boldsymbol{\hat{\tilde \Omega}}(t')\, {\rm d}t'\right] \cdot \vec{v}(0). \label{eq:sol-v} \end{equation}(19)The regular part of the field generates the regular rotation matrix \hbox{$\boldsymbol{\hat R_0}(t)$}, while the exponential accounts for the effect of the turbulent part. The time-ordering operator \hbox{${\cal T}$} maintains the chronological order of the products in the non-commuting ˜Ωˆ(tk)\hbox{$\hat{\tilde \Omega}(t_k)$} in the expansion of the exponential operator, i.e. 𝒯Ω̂(t1)·Ω̂(t2)=Ω̂(t1)·Ω̂(t2)ift1>t2,=Ω̂(t2)·Ω̂(t1)ift2>t1,\begin{eqnarray} {\cal T} \boldsymbol{\hat \Omega}(t_1) \cdot\boldsymbol{ \hat \Omega}(t_2) &=& \boldsymbol{\hat \Omega}(t_1) \cdot \boldsymbol{\hat \Omega}(t_2) \, \, {\rm if} \, \, t_1>t_2, \nonumber\\ &=& \boldsymbol{\hat \Omega}(t_2) \cdot\boldsymbol{ \hat \Omega}(t_1) \, \, {\rm if} \, \, t_2>t_1, \end{eqnarray}(20)and so on for higher order products. Alternatively, the time-ordered expansion can be written as a Dyson series 𝒯exp[0t˜Ωˆ(t)dt]1+n=1n=+0tdt1...0tn1dtn˜Ωˆ(t1)...˜Ωˆ(tn).\begin{eqnarray} {\cal T} \exp\left[\int_0^t \boldsymbol{\hat{\tilde \Omega}}(t')\, {\rm d}t'\right]&\equiv& 1 \nonumber\\ &\!\!\!\!\! +&\sum_{n=1}^{n=+\infty} \int_{0}^{t}{\rm d}t_1\ldots\int_{0}^{t_{n-1}}{\rm d}t_n \,\boldsymbol{\hat{\tilde \Omega}}(t_1)\ldots\boldsymbol{\hat{\tilde \Omega}}(t_n).\nonumber\\ &&\label{eq:Dyson-series} \end{eqnarray}(21)We now use the following theorem that holds for a Gaussian stationary random process in the white noise limit. As discussed in detail in the Appendix, this is a direct generalization to any Lie algebra of a well-known result for a scalar random process, with no other restriction than the white noise assumption 𝒯exp[0t˜Ωˆ(t)dt]=\begin{eqnarray} &&\left\langle{\cal T} \exp\left[ \int_0^t \boldsymbol{\hat {\tilde \Omega} }(t')\, {\rm d}t' \right]\right\rangle =\nonumber \\ && \quad {\cal T}\exp\left[\frac{1}{2} \int_0^t {\rm d}t_1 \int_0^t {\rm d}t_2\, \left\langle \boldsymbol{\hat {\tilde \Omega}}(t_1) \cdot \boldsymbol{\hat{\tilde \Omega}}(t_2)\right\rangle\right]. \label{eq:av-Tord} \end{eqnarray}(22)Various properties of the turbulent field can be considered i.e. that is either isotropic with no helicity, isotropic with helicity, or anisotropic with rotation invariance in the transverse direction. All these cases can be easily treated, although we focus on two relevant cases: (A) 3D isotropic turbulence and (B) two-dimensional (2D) isotropic turbulence in the plane transverse to B0, with δB·B0 = 0.

We define the projection operators π̂\hbox{$\boldsymbol{\hat \pi_{\perp}}$} on the plane transverse to B0 and the projection operator π̂\hbox{$\boldsymbol{\hat \pi_{\parallel}}$} along B0. We now define the correlation function of the random rotation matrices δΩˆ\hbox{$\boldsymbol{\hat{\delta \Omega}}$}δΩˆ(t1)δΩˆ(t1)=δΩi(t1)δΩj(t2)ij,\begin{equation} \langle\boldsymbol{\hat{\delta \Omega}}(t_1)\boldsymbol{\hat{\delta \Omega}}(t_1)\rangle\,=\, \langle \delta\Omega^i(t_1)\delta\Omega^j(t_2)\rangle\boldsymbol{\hat L_i}\boldsymbol{\hat L_j}, \end{equation}(23)where δΩi(t1)δΩj(t2)=2τcδ(t1t2)[12δΩ2π̂ij+δΩ2π̂ij].\begin{equation} \langle \delta\Omega^i(t_1)\delta\Omega^j(t_2)\rangle\,=\,2\tau_{\rm c}\delta(t_1-t_2)\,\left[\frac{1}{2}\langle\delta\Omega_\perp^2\rangle\boldsymbol{\hat \pi_\perp}^{ij} + \langle\delta\Omega_\parallel^2\rangle\boldsymbol{\hat\pi_\parallel}^{ij}\right]. \end{equation}(24)The scalars δΩ2\hbox{$\langle\delta\Omega_\parallel^2\rangle$} and δΩ2\hbox{$\langle\delta\Omega_\perp^2\rangle$} characterize the relative strengths of the turbulence in the parallel (to B0) and perpendicular directions. In particular, for 3D isotropic turbulence, δΩ2=2δΩ2\hbox{$\langle\delta\Omega_\perp^2\rangle=2\langle\delta\Omega_\parallel^2\rangle$}, in which case the above correlator becomes proportional to the identity. Then, using the properties of π̂\hbox{$\boldsymbol{\hat\pi_\perp}$}, π̂\hbox{$\boldsymbol{\hat\pi_\parallel}$} and the i\hbox{$\boldsymbol{\hat L_i}$}, one finds δΩˆ(t1)δΩˆ(t1)=2τcδ(t1t2)×[δΩ2δΩ2π̂12δΩ2π̂],\begin{eqnarray} \langle\boldsymbol{\hat{\delta \Omega}}(t_1)\boldsymbol{\hat{\delta \Omega}}(t_1)\rangle &\,=\,& -2\tau_{\rm c}\delta(t_1-t_2) \nonumber\\ &&\,\,\times\,\left[ \langle\delta\Omega^2\rangle\boldsymbol{\hat{1}} - \langle\delta\Omega_\parallel\rangle^2\boldsymbol{\hat\pi_\parallel} -\frac{1}{2}\langle\delta\Omega_\perp^2\rangle\boldsymbol{\hat\pi_\perp}\right], \end{eqnarray}(25)where δΩ2δΩ2+δΩ2\hbox{$\langle\delta\Omega^2\rangle\equiv \langle\delta\Omega_\parallel\rangle^2+ \langle\delta\Omega_\perp^2\rangle$}. We note that the above correlation function holds for δΩˆ\hbox{$\boldsymbol{\hat{\delta \Omega}}$}, which should not be confused with ˜Ωˆ\hbox{$\boldsymbol{\hat{\tilde \Omega}}$}, the latter being the quantity of relevance for calculating the transport properties, as expressed in Eq. (22). However, ˜Ωˆ(t1)˜Ωˆ(t2)=et1Ω̂0δΩˆ(t1)e(t2t1)Ω̂0δΩˆ(t2)et2Ω̂0,\begin{equation} \langle\boldsymbol{\hat{\tilde \Omega}}(t_1)\boldsymbol{\hat{\tilde \Omega}}(t_2)\rangle = e^{-t_1\boldsymbol{\hat \Omega_0}}\langle\boldsymbol{\hat{\delta \Omega}}(t_1)e^{-(t_2-t_1)\boldsymbol{\hat \Omega_0}} \boldsymbol{\hat{\delta \Omega}}(t_2)\rangle e^{t_2\boldsymbol{\hat \Omega_0}}, \end{equation}(26)and, because \hbox{$\left[\boldsymbol{\hat\pi_\parallel},e^{t\boldsymbol{\hat \Omega_0}}\right]=\left[\boldsymbol{\hat\pi_\perp},e^{t\boldsymbol{\hat \Omega_0}}\right]=0$}, the correlation function for ˜Ωˆ\hbox{$\boldsymbol{\hat{\tilde \Omega}}$} is the same as that for δΩˆ\hbox{$\boldsymbol{\hat{\delta \Omega}}$}.

Using Eq. (22), one then finds the solution for v: v(t)=0(t)·exp[120tdt10tdt2˜Ωˆ(t1)˜Ωˆ(t2)]v(0),\begin{equation} \langle\vec{v}(t)\rangle\,=\,\boldsymbol{\hat R_0}(t)\cdot \exp\left[\frac{1}{2}\int_{0}^t{\rm d}t_1 \int_{0}^t{\rm d }t_2\,\langle\boldsymbol{\hat{\tilde \Omega}}(t_1)\boldsymbol{\hat{\tilde \Omega}}(t_2)\rangle\right]\vec{v}(0), \end{equation}(27)where the average is taken over the possible realizations of the turbulent field. This leads to v(t)=0(t)·exp[tτc(δΩ2δΩ2π̂12δΩ2π̂)]v(0).\begin{equation} \langle\vec{v}(t)\rangle\,=\,\boldsymbol{\hat R_0}(t)\cdot\exp\left[-t\tau_{\rm c}\left(\langle\delta\Omega^2\rangle - \langle\delta\Omega_\parallel^2\rangle\boldsymbol{\hat\pi_\parallel}- \frac{1}{2}\langle\delta\Omega_\perp^2\rangle \boldsymbol{\hat\pi_\perp}\right)\right]\vec{v}(0). \end{equation}(28)Using the properties of π̂\hbox{$\boldsymbol{\hat\pi_\parallel}$} and π̂\hbox{$\boldsymbol{\hat\pi_\perp}$}, this can be rewritten as v(t)=0(t)·{exp[tτc(δΩ2δΩ2)]π̂+exp[tτc(δΩ212δΩ2)]π̂}v(0).\begin{eqnarray} \langle\vec{v}(t)\rangle&\,=\,& \boldsymbol{\hat R_0}(t)\cdot\Biggl\{\exp\left[-t\tau_{\rm c}\left(\langle\delta\Omega^2\rangle-\langle\delta\Omega_\parallel^2\rangle\right)\right]\boldsymbol{\hat\pi_\parallel}\nonumber\\ && + \exp\left[-t\tau_{\rm c}\left(\langle\delta\Omega^2\rangle-\frac{1}{2}\langle\delta\Omega_\perp^2\rangle\right)\right]\boldsymbol{\hat\pi_\perp}\Biggr\}\vec{v}(0). \end{eqnarray}(29)Therefore, one derives the general results v(0)v(t)=exp[tτc(δΩ2δΩ2)]v(0)2.\begin{equation} \langle v_\parallel(0) v_\parallel(t)\rangle \,=\, \exp\left[-t\tau_{\rm c}\left(\langle\delta\Omega^2\rangle-\langle\delta\Omega_\parallel^2\rangle\right)\right] \langle v_\parallel(0)^2\rangle\label{eq:parv-corr}. \end{equation}(30)In the transverse direction, v(0)·v(t)=exp[tτc(δΩ212δΩ2)]×Tv(0)·0·v(0)=exp[tτc(δΩ212δΩ2)]\begin{eqnarray} \langle\boldsymbol{v_\perp}(0)\cdot\boldsymbol{v_\perp}(t)\rangle&=& \exp\left[-t\tau_{\rm c}\left(\langle\delta\Omega^2\rangle-\frac{1}{2}\langle\delta\Omega_\perp^2\rangle\right)\right]\,\nonumber\\ &&\quad\quad\times\, ^{\rm T}\boldsymbol{v_\perp}(0)\cdot\boldsymbol{\hat R_0}\cdot\boldsymbol{v_\perp}(0)\nonumber\\ &=& \exp\left[-t\tau_{\rm c}\left(\langle\delta\Omega^2\rangle-\frac{1}{2}\langle\delta\Omega_\perp^2\rangle\right)\right]\nonumber\\ &&\quad\quad\times\,\cos\left(\Omega_0 t\right) \,\langle \boldsymbol{v_\perp}(0)^2\rangle.\label{eq:perpv-corr} \end{eqnarray}(31)The last equality follows from developing the exponential \hbox{$\boldsymbol{\hat R_0}=\exp\left(t\boldsymbol{\hat \Omega_0}\right)$}, noting that \hbox{$\boldsymbol{\hat \Omega_0}=\Omega_0 \boldsymbol{\hat L_3}$} for B0 oriented along z, \hbox{$\boldsymbol{\hat L_3}^{2n}=(-1)^n\boldsymbol{\hat\pi_\perp}$}, \hbox{$\boldsymbol{\hat L_3}^{2n+1}=(-1)^n\boldsymbol{\hat L_3}$}, \hbox{$^{\rm T}\boldsymbol{v_\perp}(0)\cdot\boldsymbol{\hat\pi_\perp} \cdot\boldsymbol{v_\perp}(0)=\langle \boldsymbol{v_\perp}(0)^2\rangle$}, and \hbox{$^{\rm T}\boldsymbol{v_\perp}(0)\cdot\boldsymbol{\hat L_3} \cdot\boldsymbol{v_\perp}(0)=0$}.

The parallel D and perpendicular D ⊥  diffusion coefficients are directly obtained from the correlation functions of the velocity components after averaging over the initial velocities D=+0dtv(0)v(t),D=120+dtv(0)·v(t).\begin{eqnarray} D_\parallel&\,=\,& \int_0^{+\infty} {\rm d}t\, \langle v_\parallel(0)v_\parallel(t)\rangle,\nonumber\\ D_\perp&\,=\,& \frac{1}{2}\int_0^{+\infty} {\rm d}t\, \langle \boldsymbol{v_\perp}(0)\cdot\boldsymbol{v_\perp}(t)\rangle. \end{eqnarray}(32)Using Eqs. (30) and (31), this leads to D=13c2νs,D=\begin{eqnarray} D_\parallel&\,=\,& \frac{1}{3}\frac{c^2}{\nu_{\rm s}},\nonumber\\ D_\perp&\,=\,& \frac{1}{3}c^2\frac{\nu_\perp}{\nu_\perp^2+\Omega_0^2}, \label{eq: coeff-dperp} \end{eqnarray}(33)where νs=τc(δΩ2δΩ2),ν=τc(δΩ212δΩ2).\begin{eqnarray} \nu_{\rm s}&\,=\,& \tau_{\rm c}\left(\langle\delta\Omega^2\rangle-\langle\delta\Omega_\parallel^2\rangle\right), \nonumber\\ \nu_\perp&\,=\,& \tau_{\rm c}\left(\langle\delta\Omega^2\rangle-\frac{1}{2} \langle\delta\Omega_\perp^2\rangle\right). \end{eqnarray}(34)These expressions for D ⊥  are formally similar to the results of the so-called classical diffusion theory, although they are obtained here under different physical assumptions; in particular, a strong turbulence situation is assumed.

In case (A), for 3D isotropic turbulence, δΩ2=2δΩ2=23δΩ2\hbox{$\langle\delta\Omega_\perp^2\rangle= 2\langle\delta\Omega_\parallel^2\rangle=\frac{2}{3}\langle\delta\Omega^2\rangle$}, so that ν=νs=23δΩ2τc=23ηρ2cc·\begin{equation} \nu_\perp=\nu_{\rm s}= \frac{2}{3}\langle\delta\Omega^2\rangle \tau_{\rm c} = \frac{2}{3}\frac{\eta}{\rho^2}\frac{c}{\ell_{\rm c}}\cdot \end{equation}(35)One may note that the expression for νs matches that derived from a random walk argument for pitch angle diffusion. We also note that the above calculation for νs may be applied to the regime ρ ≪ 1, as long as the correlation time is shorter than the scattering time. This is true in the case of νs = (2/3)ηρβ − 2, which is the standard quasi-linear theory result. The result for the perpendicular coefficient cannot, of course, be extended to the regime ρ ≪ 1, as the above calculation does not account for field line wandering.

In case (B), for 2D transverse isotropic turbulence, δΩ2=0\hbox{$\langle\delta\Omega_\parallel^2\rangle=0$}, δΩ2=δΩ2\hbox{$\langle\delta\Omega_\perp^2\rangle=\langle\delta\Omega^2\rangle$}, hence νs=2ν=δΩ2τc.\begin{equation} \nu_{\rm s}=2\nu_\perp=\langle\delta\Omega^2\rangle \tau_{\rm c}. \end{equation}(36)This demonstrates that the transverse diffusion coefficients follows the scalings, which we express here for case (A), i.e., isotropic turbulence DD12ccρ2/η(1ρ/B0),D\begin{eqnarray} D_\perp &\simeq& D_\parallel\,\simeq\,\frac{1}{2}c\ell_{\rm c}\rho^2/\eta\quad (1\ll\rho \ll \bar B/B_0),\nonumber\\ D_\perp &\simeq& \frac{2}{9}c\ell_{\rm c} \frac{\bar B^2}{B_0^2}\quad (\bar B/B_0\ll\rho).\label{eq:dperp_th} \end{eqnarray}(37)The transition between these two regimes takes place at \hbox{$\rho \sim \bar B/B_0\simeq \eta^{1/2}(1-\eta)^{-1/2}$}, corresponding to νs ~ Ω0. At larger rigidities, the perpendicular diffusion coefficient remains constant, while the parallel diffusion coefficient continues to increase as ρ2.

This result is supported by the numerical simulation that we now discuss.

3. Numerical simulation of the transport with a mean field for high rigidities

3.1. Numerical set up

A Monte Carlo strategy is adopted to measure the diffusion coefficients by integrating a large number of particle trajectories in given turbulent magnetic field configurations. Averages are then performed and statistical values of the diffusion coefficients deduced. The numerical set up is presented hereafter: we first discuss the construction of the magnetic field, then the integration of particle motion from Lorentz-Newton equation, and finally the estimates of the diffusion coefficients.

The total magnetic field is expressed as B = B0 + δB as before. The regular field is oriented along z, and δB is assumed to be isotropic in the three dimensions. An algorithm similar to Giacalone & Jokipii (1999) is used to construct the turbulent component of magnetic field δB by summing over plane wave modes (Nmod) with turbulent wavelengths extending from Lmin = 1 ≡ 2π/kmax to Lmax ≡ 2π/kmin, the power spectrum following a truncated power law between kminand kmax. In detail

δB(x)=nGn(kn)ξncos(kn·x+βn).\begin{eqnarray} \boldsymbol{\delta B}(\boldsymbol{x}) = \sum_n G_n(k_n) \boldsymbol{\xi_n}\cos \left(\boldsymbol{k_n}\cdot\boldsymbol{x}+\beta_n\right). \end{eqnarray}(38)With Fourier modes of amplitude Gn, and wave vectors kn=2πLnek\hbox{$\boldsymbol{k_n}=\frac{2\pi}{L_n}\boldsymbol{e_k}$} isotropically distributed, the unitary vector ξn is perpendicular to kn in order to ensure that ·δB=0, and βn ∈  [0,2π] represents the random phase. The power spectrum is normalized by the turbulence parameter η introduced earlier such that δB2=B02η/(1η)\hbox{$\langle\boldsymbol{\delta B}^2\rangle=B_0^2\eta/(1-\eta)$}. For definiteness, the mode amplitudes are constructed according to a Kolmogorov cascade with logarithmic spacing between wavenumbers: Gnkn5/3\hbox{$G_n \propto k_n^{-5/3}$}. We note that the details of the inertial range of the turbulence are not important because we are interested in the scattering properties at large rigidities, when the particle Larmor radius \hbox{$\bar r_{\rm L}$} is larger than all turbulent length-scales. For a detailed presentation of the numerical turbulent magnetic field construction, the reader is referred to Sect. 2.B of Casse et al. (2002) and Sect. 3 of Giacalone & Jokipii (1999).

Several tests of the dynamic range of turbulence Lmax/Lmin and the magnetic wave-modes Nmod were performed. The main difficulty is that the scattering timescales increase as a square of particle rigidity. For large rigidities, it is thus difficult to preserve the accuracy with time when achieving particle diffusion together with a realistic magnetic field model. To develop a simulation that operates over a few scattering times, one needs to achieve an integration time of at least \hbox{$100 \rho \bar r_{\rm L}/c$}, as in our simulations. One must also strike a compromise with the number of plane wave modes to save computational time; values of order 200–300 have emerged as a satisfactory compromise between accuracy and calculation time. To save computational time, and because the small scales of the turbulent cascade are of little influence, the dynamic range has been shortened to Lmin/Lmax = 0.1. Tests performed with a larger dynamic range have provided similar results; the highest accuracy is obtained when modes are concentrated on the largest scale. It is explained physically by the high energy particles interacting only with the largest magnetic structures.

Particle motion is solved using the Lorentz-Newton equation of motion that preserves its energy, hence its Lorentz factor γdvdt=qγmcv×(B0+δB).\begin{eqnarray} \frac{{\rm d}\boldsymbol{v}}{{\rm d}t } = \frac{q}{\gamma m c}\boldsymbol{v} \times \left(\boldsymbol{B_0}+ \boldsymbol{\delta B}\right). \label{eq:Lorentz-Newton} \end{eqnarray}(39)At this point, we define the numerical rigidity \hbox{$\rho' \equiv 2 \pi \bar r_{\rm L} / L_{\rm max}$}, which differs from the previous physical definition by a numerical factor of order unity, as discussed earlier. The exact relation between c and Lmax/2π depends on the dynamic range and the power-law index of turbulence. In the following, the conversion factor between both rigidities is derived using c ≃ 0.1Lmax, a good approximation for a Kolmogorov-type spectrum.

The numerical integration of Eq. (39) is performed using a Bulirsch-Stoer schema (Press et al. 1986). Once a large number of particle trajectories were calculated and stored, statistical averages instead being performed. Given the number of particles Np for each field realization and the number of field realizations Nfield, the diffusion tensor coefficient (i,j) is evaluated as Dij(t)=12NpNfieldn=1Nfieldk=1Np(xi(t)xi(t0))(xj(t)xj(t0))n,ktt0=ΔxiΔxj2Δt·\begin{eqnarray} D_{ij}(t) & = & \frac{1}{2N_{\rm p}N_{\rm field}}\sum_{n=1}^{N_{\rm field}}\sum_{k=1}^{N_{\rm p}} \frac{(x_i(t)-x_i(t_0))(x_j(t)-x_j(t_0))_{n,k}}{t-t_0} \nonumber\\ & = & \frac{\left\langle \Delta x_i \Delta x_j\right\rangle}{2\Delta t}\cdot \end{eqnarray}(40)The average is performed over different particle trajectories and different field realizations. For each value of ρ, we take Nfield × Np = 103 different trajectories with random initial velocity directions. The asymptotic value for t → ∞ (plateau) is roughly constant and defines the actual diffusion regime. It gives the diffusion coefficient as Dij as t → ∞, precisely when tνs-1\hbox{$t \gg \nu_{\rm s}^{-1}$}. This method of coefficient estimation appears precise enough for an integration involving 103 particles. A complementary technique consists of evaluating time correlations between velocities over particle trajectories. With 1000 particles in the transport regime studied here, this method is affected by numerical noise for the velocity correlation function, hence is not presented.

Two different cases were investigated numerically: a pure turbulence situation (B0 = 0) and a strong turbulent case with δB ≫ B0. Results are presented in the following sub-sections.

3.2. Pure turbulence B0 = 0

These simulations were performed to test the correctness and accuracy of the code. On theoretical grounds (see the appendix of Casse et al. 2002; Aloisio et al. 2004; Pelletier et al. 2009) and previous numerical works (Parizot 2004), we expect the diffusion coefficient to evolve as the square of energy (e.g. rigidity) when ρ′ ≫ 1.

Here we set η = 1 and δB isotropically distributed by construction, so that the three space directions are equivalent. The equivalence of the three directions was numerically verified in our simulations. The diffusion coefficient is evaluated as Diso=Δx2+Δy2+Δz26Δt·\begin{equation} D_{\rm iso} = \frac{\left\langle \Delta x^2\right\rangle+ \left\langle \Delta y^2 \right\rangle+ \left\langle \Delta z ^2\right\rangle}{6 \Delta t}\cdot \end{equation}(41)Figure 1 shows numerical values calculated for ρ′ going from 1 to 100. The diffusion coefficient is plotted in units of cLmax/(2π) as a function of rigidity ρ′. A power law is observed for 1 < ρ′ < 100, as predicted by the theory, namely Diso ∝ ρ′2 ∝ ϵ2. One may be able to discern a slight deviation at ρ′ close to 100. This purely numerical effect disappears when taking a larger number of magnetic wave-modes on scales close to Lmax by defining Lmin/Lmax ~ 1. We retain however the current field configuration, taking this effect into account when interpreting the results.

thumbnail Fig. 1

The diffusion coefficient variation is plotted in units of cLmax/(2π) as a function of rigidity ρ′ in pure turbulence (B0 = 0 or η = 1). The dashed line is drawn as a reference for a scaling Diso ∝ ρ′2. For ρ′ > 1, Diso is indeed proportional to ρ2.

3.3. Weak mean field B0 < δB

We now consider the case where a constant weak mean field B0 along the z direction is present. In this case, two different diffusion coefficients are defined D = Dzz and D ⊥  = (Dxx + Dyy)/2. Overall, we explored five different levels of turbulence η =  { 0.5,0.9,0.99,0.999,0.9999 } , spanning five orders of magnitude in δB2/B02\hbox{$\delta B^2/B_0^2$}. The rigidity ρ′ ranges from 1 to 100 for each value of η. At each calculation point  { η,ρ } , the coefficients are evaluated by averaging over 103 particles (10 particles  ×  100 field realizations), as before.

As shown in Fig. 2, the parallel diffusion coefficient retains the same dependence on rigidity as in pure turbulence, D ∝ ρ′2. For η > 0.5, the turbulence level has almost no influence on the value of D0.9cρL\hbox{$D_{\parallel} \simeq 0.9 c\bar r_{\rm L}\rho'$}. Therefore as expected, the mean field, as long as it remains weak enough, has no influence on the diffusion of particles along its direction.

thumbnail Fig. 2

The parallel diffusion coefficient D plotted in units of (cLmax/(2π)) as a function of ρ′ for different degrees of turbulence δB2/B02[1,9999]\hbox{$\delta B^2 / B_0^2 \in [1,9999]$}. Here D ∝ ρ2, as in the case of purely isotropic turbulence without mean field. As long as δB2/B021\hbox{$\delta B^2 / B_0^2\gg1$}, the strength of the turbulence does not influence the normalization of D.

The picture is different for the transverse coefficient when the particle rigidity becomes large. In Fig. 3, the simulated transverse diffusion coefficient is plotted as a function of rigidity ρ′ for different degrees of turbulence. In each case, its value saturates to a constant value when ρ′ ~ δB/B0. This value behaves proportionally to the turbulence degree; in detail, D0.13c(Lmax/2π)δB2/B02\hbox{$D_{\perp} \simeq 0.13 c (L_{\rm max}/2\pi)\delta B^2/B_0^2$}, in excellent agreement with our theoretical prediction from Eq. (37). Individual particle trajectories reveal a weakly perturbed helical path when ρ′ ≫ 1. Therefore, a strong small-scale turbulence acts as a collection of small-scale scattering centers, each producing a small deflection.

thumbnail Fig. 3

The transverse diffusion coefficient D ⊥  plotted in units of cLmax/(2π) as a function of ρ′ for different degrees of turbulence δB2/B02[1,9999]\hbox{$\delta B^2 / B_0^2 \in [1,9999]$}. The diffusion coefficient saturates at ρ′ ~ δB/B0. Below this value, its behavior is similar to the parallel diffusion coefficient. Beyond ρ′, its value becomes independent of particle rigidity.

According to the theory, D is the limit of a function c2g(t)/3 as t → ∞, precisely as t > ts, the function being g(t)=1eνstνs·\begin{equation} g_{\parallel}(t) = \frac{1-e^{-\nu_{\rm s}t}}{\nu_{\rm s}}\cdot \end{equation}(42)In a similar way D ⊥  is the limit of a function c2g ⊥ (t)/3 as t → ∞, in addition to when t > ts, the function being g(t)=νsΩ02+νs2{1eνst[cos(Ω0t)Ω0νssin(Ω0t)]}·\begin{equation} g_{\perp}(t) = \frac{\nu_{\rm s}} {\Omega_{0}^2 + \nu_{\rm s}^2}\left\{1-e^{-\nu_{\rm s}t}\left[\cos \left(\Omega_{0} t\right) -\frac{\Omega_{0}}{\nu_{\rm s}} \sin\left(\Omega_{0}t\right)\right]\right\}\cdot \end{equation}(43)

thumbnail Fig. 4

Transition toward parallel and perpendicular diffusion. Before reaching its asymptotic value for t > τs, the transverse diffusion rate decreases as in a sub-diffusive regime.

The numerical simulation reproduces these types of behavior, although the transverse evolution departs slightly from the above formula before reaching the scattering time τs. Nevertheless, the agreement between the theory and the numerical simulation holds during the linear growth at the beginning of the evolution and when the evolution approaches the asymptotic behavior. The numerical results confirm the theory we proposed in the previous section for the asymptotic regime. The scattering time is clearly the time beyond which spatial diffusion takes place. We can also note that there is a sub-diffusion regime before the settlement of the transverse diffusion regime.

The anisotropy ratio D ⊥ /D can be seen in Fig. 5 as a function of ρ′. When the turbulence level η is close to 1 and ρ′ is not too large, the transport appears isotropic D ⊥ /D ≃ 1. At higher rigidities, its behavior follows the law  ∝ ρ′ − 2 for all turbulence levels, illustrating the saturation of the transverse coefficient and in agreement with the theoretical prediction.

thumbnail Fig. 5

Anisotropy ratio D ⊥ /D as function of ρ′ for different levels of turbulence δB2/B02[1,9999]\hbox{$\delta B^2 / B_0^2 \in [1,9999]$}, as indicated by the various symbols. The dashed line provides a guide for a ρ-2 scaling.

3.4. Comparisons with previous results

Transverse diffusion at high rigidity, as far as we know, has been poorly studied in the literature. However, we can compare our results with several previous numerical and theoretical studies with different limits.

The seminal study of Giacalone& Jokipii (1999) focused on the propagation of mildly relativistic particles (E = 1 MeV to 1 GeV) in the interplanetary magnetic field (δB2~B02\hbox{$\delta B^2 \sim B_0^2$}). Their simulations provided results for ρ ≤ 1 and η ≤ 0.5. However, they also performed several simulations in which the particle energy and the coherence length remained fixed, while the turbulence level was varied. In particular, they examined the case rL,0/c = 10 for moderate values of δB2/B02\hbox{$\delta B^2/ B_0^2$} (Fig. 6 of their paper) in which D ⊥ /D is plotted as a function of λ/rL,0 (λ denoting the mean free path in the parallel direction). By inspecting their figure, one can see that they varied δB2/B02\hbox{$\delta B^2/ B_0^2$} from 0.05 to 30. As a result, they found a classical scattering theory scaling but no physical explanation was proposed. Strictly speaking, the classical theory is valid only for weak turbulence (δB2B02\hbox{$\delta B^2\ll B_0^2$}), which clearly does not apply to those simulations. The present theoretical framework provides a clear explanation of this result, which we confirmed with additional detailed numerical simulations. It is found, for instance, that particles with large rigidities do not interact directly with the magnetic field lines but experience an overall magnetic topology dominated by the mean field with “infinite” coherence length. As a result, the particles execute regular orbits around B0 and undergo random deflections on the coherence length-scale.

The simulations of Casse et al. (2002) investigated weak as well as strong turbulence regimes where δB2/B02[0.1,99]\hbox{$\delta B^2 / B_0^2 \in [0.1, 99]$}. An FFT algorithm was used to construct the magnetic field. For ρ′ > 1, these authors found evidence of anisotropic scattering D ⊥ /D < 1 for all turbulence levels. However, only three simulations points were computed in the high rigidity range and the estimate of the power law slope was inaccurate. Nevertheless, a reasonable agreement is obtained when comparing values of D and D ⊥  with the present results.

Parizot (2004) presented simulations of particle propagation in pure isotropic turbulence. The results in the regime rL ≫ c leads to a diffusion coefficient with a quadratic scaling, D ∝ E2, in agreement with our results from Sect. 3.2.

thumbnail Fig. 6

Ratio D/D as a function of ρ′ for η = 0.99, compared to theorectical predictions and other numerical simulations. Filled diamonds: our simulation results. Star symbols: results from Casse et al. 2002. Solid curve: present theoretical prediction with best-fit D from simulations (see Fig. 2). Dashed curve: present theoretical prediction with analytical D = c2/(3νs). Dot-dashed curve: analytical prediction from Shalchi & Dosch (2009), their Eq. (15).

Shalchi & Dosch (2009) derived an analytical expression for the diffusion anisotropy ratio D ⊥ /D in the framework of a non-linear guiding centretheory. They assume an isotropic turbulence δB with a mean field B0. No assumption was made about either the level of turbulence or about particle energy, hence their result should be valid for any particle rigidity and turbulent field strength. An expression of D ⊥ /D (Eq. (15) in their work) that depends on two parameters was obtained. The first parameter corresponds to the ratio of the mean free path (λ) along the mean field direction to the coherence length c of the turbulent field. The second parameter is the turbulence level δB2/B02=η/(1η)\hbox{$\delta B^2/ B_0^2 = \eta/(1-\eta)$}. Shalchi & Dosch (2009) thus find that the transport becomes highly anisotropic, meaning D ⊥ /D ≪ 1 when λ/c ≫ 1 and/or δB2/B02\hbox{$\delta B^2/ B_0^2$} is not too large (see Figs. 1 and 2 of their work). Therefore, our present conclusions agree with theirs, at least at a qualitative level. A detailed comparison would require us to define λ as a function of ρ′, which could be done by using our results of D for which ρ′ = (4πη/30)1/2(λ/c)1/2. With this substitution, we can directly compare their predictions to our results. In Fig. 6, we plot the ratio of diffusion coefficients as a function of ρ′ from our numerical simulations and compare these results to both the predictions of Shalchi & Dosch (2009) and the theoretical model developed in Sect. 2. Good agreement is found between the simulation results (diamond symbols) and our theory (solid curve and dashed curve); however, the predictions of Shalchi & Dosch (2009) disagree with the numerical results, increasingly so as the rigidity increases. In particular, their analysis predicts a scaling with a slope  − 2.4 instead of the value of  − 2 observed here. Repeating the same comparisons for each simulated value of δB2/B2, we were unable to find agreement between the predictions of Shalchi & Dosch (2009) and our simulations; the predicted values always lie below the numerical results, with a different power-law scaling, comprised between –2.5 for δB2/B02=1\hbox{$\delta B^2/ B_0^2 = 1$} and –2.4 for δB2/B02=104\hbox{$\delta B^2 / B_0^2 =10^4$}. At this point, it could be argued that our definition of λ as a function of ρ′ is inaccurate. However, on physical grounds, the scaling λ ∝ ϵ2 when \hbox{$\bar r_{\rm L} \gg \ell_{\rm c}$} remains robust. Therefore, the discrepancy between the power law scalings should not be affected by uncertainties in the numerical prefactors. We think that the “guiding center” assumption in their work is questionable.

4. Summary and some astrophysical applications

4.1. Summary

Our investigation of the diffusion process in small-scale turbulence with a mean field revealed that, despite its smallness, the mean field plays a role in transverse diffusion because the scattering frequency decreases like ϵ-2, whereas the Larmor frequency decreases like ϵ-1. Instead of finding a single diffusion coefficient that increases like ϵ2, we found an anisotropic diffusion with a transverse coefficient that reaches a limit value at large rigidities. The theory we proposed is based on a single assumption, namely that the correlation time is much smaller than the scattering time, which is valid for both small and large rigidities. The only regime where the theory fails is for a rigidity close to 1 and a high turbulence level; however, the interpolation is obvious. The theory allows us to derive a correct pitch-angle scattering rate and a correct parallel diffusion coefficient for every rigidity. It provides a transverse diffusion coefficient similar to the classical scattering theory formula, despite the arbitrary level of turbulence, which is a correct result for large rigidity. At low rigidity, the present theory is incorrect because it does not take into account the effect of field line wandering described in Casse et al. (2002).

4.2. Particle transport in relativistic shock environments

One major application of the diffusion theory in small-scale turbulence is the transport of supra-thermal particles in the vicinity of a relativistic shock. By crossing the shock transition, electrons and protons reach more or less the same characteristic energy  ⟨ ϵ ⟩  ~ γshmpc2 as revealed clearly by particle-in-cell simulations (e.g., Sironi & Spitkovsky 2011). There is a single plasma frequency ωp∗ ~ ωpi, where ωpi is the ion plasma frequency in the upstream or unshocked plasma. This length-scale characterizes the typical length scale of the microturbulence excited in the shock precursor, as transmitted downstream of the shock transition and viewed in the downstream rest frame. The generation of short scale intense micro-turbulence is possible only at low magnetizations of the upstream plasma (Sironi & Spitkovsky 2011), where the magnetization parameter σ is here defined as the flux of magnetic energy crossing the shock over the flux of matter energy, σB02sin2θB/4πρuc2\hbox{$\sigma \equiv B_0^2 \sin^2 \theta_B/4\pi \rho_{\rm u} c^2$} (where θB is the angle of the background magnetic field with the shock normal, and ρu the unshocked plasma mass density). However, this same level of magnetization also permits the efficient acceleration of particles through a first-order Fermi process at the shock front (Lemoine & Pelletier 2010, 2011). For larger magnetizations – the exact level depending on the shock Lorentz factor, see the above references – the Fermi process cannot develop because of a lack of efficient scattering in the microturbulence (Lemoine et al. 2006; Niemiec et al. 2006; Pelletier et al. 2009). In brief, the development of the Fermi process hinges on the development of micro-turbulence, which itself requires (in the absence of external sources of turbulence) a sufficiently low magnetization level. The situation in which particles are accelerated is by far the most interesting as it should produce directly observable signatures, in the form of radiation and possibly neutrinos.

The transport properties of these accelerated particles is then directly governed by the parallel and perpendicular diffusion coefficients in the limit of large rigidity, as discussed above. We assume that the microturbulence has a typical length-scale close to δ = c/ωp ∗  and that a fraction ϵB of shock dissipated energy is converted into electromagnetic turbulence, i.e. δB28π=2ϵBγsh2ρuc2,\begin{equation} \frac{\langle\delta B^2\rangle}{8\pi} = 2\epsilon_B\gamma_{\rm sh}^2 \rho_{\rm u} c^2, \end{equation}(44)where the rigidity of shock accelerated particles of energy ϵ is given by ρϵB1/2δcϵϵ·\begin{equation} \rho \approx \epsilon_B^{-1/2} \frac{\delta_*}{\ell_{\rm c}} \frac{\epsilon}{\langle\epsilon\rangle}\cdot \end{equation}(45)Current simulations indicate values of ϵB ~ 0.01 − 0.1, hence ρ > 1 and all the more so at high energy.

In this regime, the perpendicular diffusion coefficient that we discussed in the previous section becomes particularly relevant, as the mean magnetic field is mostly perpendicular to the shock normal in the downstream frame, since the transverse components (relatively to the shock normal) are increased by 22γsh\hbox{$2\!\sqrt{2}\gamma_{\rm sh}$}, while the parallel component remains the same as in the upstream frame. Therefore, perpendicular diffusion at high rigidity plays an essential role in the transport of particles in the downstream flow of a relativistic shock.

We consider the diffusive behavior of particles in the downstream rest frame. In this frame the shock front appears to move away with velocity Vshock ≃ c/3. Achieving Fermi cycles requires the particle to return to the shock front. The return time is then measured by identifying shock front with the particle mean displacements c3tret=2Dtret.\begin{equation} {c \over 3}t_{\rm ret} = \sqrt{2 D_{\perp} t_{\rm ret}}. \end{equation}(46)Therefore tret = 18D ⊥ /c2 and Fermi cycles are possible until tret is neither large nor too short. While the first case is constrained by confinement in the acceleration site, the second one is related to the diffusive approximation that is valid only when tret ≥ τs. Using the second limit to constrain diffusive returns, one obtain D ⊥ /D ≥ 1/6, equivalent to νs5Ω0\hbox{$\nu_{\rm s} \geq \sqrt{5} \Omega_0$} when D ⊥  is replaced by its expression from Eq. (33). Fiducial values for a relativistic shock in the interstellar medium provide an energy limit Elim ~ 1019 eV. This limit is somewhat irrelevant because tret ≫ Racc/c at this energy, where Racc/c is the shock dynamics timescale. Hence, the returns appear to be efficient when the condition τs < tret ≪ Racc/c is satisfied. Further investigation would require us to solve a kinetic equation taking into account acceleration, scattering and energy losses processes. Diffusion coefficients obtained in this work may be relevant to providing more realistic results. Previous works assumed Bohm diffusion or isotropic pitch-angle scattering.

Detailed discussion of the performance of the relativistic Fermi process is beyond of the scope of the present paper and is left to future work.

In certain astrophysical settings, the transverse diffusion may play a key role in the transport of particles upstream of a relativistic shock, most particularly if the shock propagates in a wind with a dominant toroidal field at large distances. These circumstances can be encountered in particular when a gamma-ray burst explodes in the wind of the progenitor, or at the termination shock of a pulsar wind.

4.3. High-energy cosmic rays

The above result about transverse diffusion has a broader application than Fermi acceleration at shocks, as it governs the confinement properties of any relativistic flow containing a small-scale turbulence, where “small” is measured relatively to the Larmor radius of the test particles propagating in this flow. This concerns in particular the propagation of very high-energy cosmic rays in our Galaxy. Assuming a coherence length of interstellar turbulence c ~ 10 − 100   pc, a mean field intensity of 3   μG approximately and a turbulent field of the same order, the rigidity of particles of energy E is given by \hbox{$\rho \simeq 2 (E/10^{17}\,{\rm eV})(\ell_{\rm c}/10\,{\rm pc})^{-1}(\bar B/5\,\mu{\rm G})^{-1}$}, while the Larmor radius \hbox{$r_{\rm L}\simeq 20\,{\rm pc}\,(E/10^{17}\,{\rm eV})(\bar B/5\,\mu{\rm G})^{-1}$}. Assuming η ≃ 0.5 in the Galaxy and using Eq. (33), the perpendicular mean free path is then of order λ ⊥  ~ 6   pc with these values of energy and magnetic field. This implies that the escape, or transport across the disk magnetic field of particles of energy  ≥ 1017   eV is governed by the perpendicular diffusion in the high rigidity regime discussed above. Quite interestingly, this energy range presumably corresponds to the transition between the Galactic and extragalactic cosmic-ray components in the all-particle spectrum.

Finally, one could mention another application of the present discussion, to the field of magnetic reconnection. There, transverse diffusion likely plays a role in the control of particle diffusion across the field lines with small-scale turbulence being associated with the dissipation of magnetic energy. The reconnection rate depends on two fundamental parameters (Lyutikov & Uzdensky 2003): magnetization and the Lundquist number that involves diffusion across field lines. In general, one assumes Bohm diffusion for simplicity but the present work provides the grounds for a more accurate estimate.

References

  1. Aloisio, R., & Berezinsky, V. 2004, ApJ, 612, 900 [NASA ADS] [CrossRef] [Google Scholar]
  2. Bell, A. R. 2004, MNRAS, 353, 550 [Google Scholar]
  3. Bell, A. R., & Lucek, S. G. 2001, MNRAS, 321, 433 [NASA ADS] [CrossRef] [Google Scholar]
  4. Candia, J., & Roulet, E. 2004, JCAP, 10, 007 [Google Scholar]
  5. Casse, F., Lemoine, M., & Pelletier, G. 2002, Phys. Rev. D, 65, 023002 [Google Scholar]
  6. Fatuzzo, M., Melia, F., Todd, E., Adams, F. C., et al. 2010, ApJ, 725, 515 [NASA ADS] [CrossRef] [Google Scholar]
  7. Frisch, U. 1966, Annales d’Asphysique, 29, 645 [NASA ADS] [Google Scholar]
  8. Giacalone, J., & Jokipii, J. R. 1999, ApJ, 520, 204 [Google Scholar]
  9. Jokipii, J. R. 1966, ApJ, 146, 480 [Google Scholar]
  10. Jokipii, J. R. 1973, ApJ, 183, 1029 [NASA ADS] [CrossRef] [Google Scholar]
  11. Lemoine, M., & Pelletier, G. 2010, MNRAS, 402, 321 [NASA ADS] [CrossRef] [Google Scholar]
  12. Lemoine, M., & Pelletier, G. 2011, MNRAS [arXiv:1102.1308] [Google Scholar]
  13. Lemoine, M., Pelletier, G., & Revenu, B. 2006, ApJL, 645, 129 [Google Scholar]
  14. Lyutikov, M., & Uzdensky, D. 2003, ApJ, 589, 893 [NASA ADS] [CrossRef] [Google Scholar]
  15. Niemiec, A., Ostrowsky, M., & Pohl, M. 2006, ApJ, 650, 1020 [NASA ADS] [CrossRef] [Google Scholar]
  16. Parizot, E. 2004, Nuc. Phys. B, 136, 169 [Google Scholar]
  17. Pelletier, G. 1977, J. Plasma Phys., 18, 49 [NASA ADS] [CrossRef] [Google Scholar]
  18. Pelletier, G., Lemoine, M., & Marcowith, A. 2009, MNRAS, 393, 587 [NASA ADS] [CrossRef] [Google Scholar]
  19. Press, W. H., Flannery, B. P., Teukolsky, S. A. , & Vetterling, W. T. 1986, Numerical Recipes (Cambridge: Cambridge Univ. Press) [Google Scholar]
  20. Shalchi, & Dosch 2009, Phys. Rev. D, 79, 083001 [Google Scholar]
  21. Sironi, L., & Spitkovsky, A. 2011, ApJ, 726, 75 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Average of the time ordered exponential

We solve the differential equation u˙=˜Ωˆ·u\appendix \setcounter{section}{1} \hbox{$\dot {\vec u} = \boldsymbol{\hat {\tilde \Omega}}\cdot \vec{u}$} in successive iterations that leads to a Dyson series, the average of which is composed of products of the form 2p(t)=0tdt10t1dt2...0t2p1dt2p˜Ωˆ(t1)·˜Ωˆ(t2)...˜Ωˆ(t2p),\appendix \setcounter{section}{1} \begin{equation} \boldsymbol{ \hat A}_{2p}(t) = \int_0^{t} {\rm d}t_1\int_0^{t_1} {\rm d}t_2\ldots\int_0^{t_{2p-1}} {\rm d}t_{2p}\, \langle\boldsymbol{\hat {\tilde\Omega}}(t_1) \cdot \boldsymbol{\hat {\tilde \Omega}}(t_2)\ldots \boldsymbol{\hat {\tilde \Omega}}(t_{2p})\rangle, \end{equation}(A.1)which can be compared to Eq. (21).

For a Gaussian process, each average of order 2p products can be divided into a sum of p products of second-order moments, the sum containing (2p − 1) !  !  terms. We assume a stationary random process such that the second order moment is an even function of the time difference. In the white noise limit, the “nested” and “crossed” averages vanish, only the “unconnected” averages remaining in the expansion. Nested terms contain products of the form  ⟨ X(ti)X(tl) ⟩  ⟨ X(tj)X(tk) ⟩  with ti ≥ tj ≥ tk ≥ tl, while crossed terms are of the form  ⟨ X(ti)X(tk) ⟩  ⟨ X(tj)X(tl) ⟩  with ti ≥ tj ≥ tk ≥ tl. These terms vanish as the various delta functions associated with the second order moments cancel each other as a result of the time ordering in the upper bounds of the integrals. Thus, only the unconnected average remains at each order 2p(t)=t0dt10t1dt2...0t2p1dt2p˜Ωˆ(t1)·˜Ωˆ(t2)...˜Ωˆ(t2p1)·˜Ωˆ(t2p).\appendix \setcounter{section}{1} \begin{eqnarray} \boldsymbol{\hat A}_{2p}(t) &=& \int_0^{t} {\rm d}t_1\int_0^{t_1} {\rm d}t_2...\int_0^{t_{2p-1}} {\rm d}t_{2p}\,\nonumber\\ && \quad \langle\boldsymbol{\hat {\tilde\Omega}}(t_1) \cdot \boldsymbol{\hat {\tilde \Omega}}(t_2)\rangle\ldots \langle\boldsymbol{\hat {\tilde \Omega}}(t_{2p-1})\cdot\boldsymbol{\hat {\tilde \Omega}}(t_{2p})\rangle. \end{eqnarray}(A.2)

We introduce the short-hand notation ˜Ωˆ(t1)·˜Ωˆ(t2)(t1t2)=2τcδ(t1t2)0\appendix \setcounter{section}{1} \hbox{$\langle\boldsymbol{\hat {\tilde\Omega}}(t_1) \cdot \boldsymbol{\hat {\tilde \Omega}}(t_2)\rangle \equiv \boldsymbol{\hat C}(t_1-t_2) = 2\tau_{\rm c} \delta(t_1-t_2) \boldsymbol{\hat C_0}$}. Then one can calculate Â2p(t) by recursion, starting from the last double integral in the product 2p(t)=(τct)pp!0p.\appendix \setcounter{section}{1} \begin{equation} \boldsymbol{\hat A}_{2p}(t) = \frac{(\tau_{\rm c} t)^p}{p!} \boldsymbol{\hat C}_{\vec 0}^p. \end{equation}(A.3)We consider now the integral of the second order moment (t)0tdt10tdt2˜Ωˆ(t1)·˜Ωˆ(t2)=2τct0.\appendix \setcounter{section}{1} \begin{equation} \boldsymbol{\hat K}(t) \equiv \int_0^t {\rm d}t_1 \int_0^t {\rm d}t_2 \,\langle\boldsymbol{\hat{\tilde\Omega}}(t_1) \cdot \boldsymbol{\hat {\tilde \Omega}}(t_2)\rangle = 2\tau_{\rm c} t \boldsymbol{\hat C_0}. \end{equation}(A.4)Therefore 2p(t)=12pp!p(t),\appendix \setcounter{section}{1} \begin{equation} \boldsymbol{\hat A}_{2p}(t) = \frac{1}{2^p p!} \boldsymbol{\hat K}^p(t), \end{equation}(A.5)hence summing all the terms of the series, p=0p=+2p(t)=exp[12(t)].\appendix \setcounter{section}{1} \begin{equation} \sum_{p=0}^{p=+\infty} \boldsymbol{\hat A}_{2p}(t) = \exp\left[\frac{1}{2} \boldsymbol{\hat K}(t)\right]. \end{equation}(A.6)Further details can be found in Frisch (1966) and Pelletier (1977).

All Figures

thumbnail Fig. 1

The diffusion coefficient variation is plotted in units of cLmax/(2π) as a function of rigidity ρ′ in pure turbulence (B0 = 0 or η = 1). The dashed line is drawn as a reference for a scaling Diso ∝ ρ′2. For ρ′ > 1, Diso is indeed proportional to ρ2.

In the text
thumbnail Fig. 2

The parallel diffusion coefficient D plotted in units of (cLmax/(2π)) as a function of ρ′ for different degrees of turbulence δB2/B02[1,9999]\hbox{$\delta B^2 / B_0^2 \in [1,9999]$}. Here D ∝ ρ2, as in the case of purely isotropic turbulence without mean field. As long as δB2/B021\hbox{$\delta B^2 / B_0^2\gg1$}, the strength of the turbulence does not influence the normalization of D.

In the text
thumbnail Fig. 3

The transverse diffusion coefficient D ⊥  plotted in units of cLmax/(2π) as a function of ρ′ for different degrees of turbulence δB2/B02[1,9999]\hbox{$\delta B^2 / B_0^2 \in [1,9999]$}. The diffusion coefficient saturates at ρ′ ~ δB/B0. Below this value, its behavior is similar to the parallel diffusion coefficient. Beyond ρ′, its value becomes independent of particle rigidity.

In the text
thumbnail Fig. 4

Transition toward parallel and perpendicular diffusion. Before reaching its asymptotic value for t > τs, the transverse diffusion rate decreases as in a sub-diffusive regime.

In the text
thumbnail Fig. 5

Anisotropy ratio D ⊥ /D as function of ρ′ for different levels of turbulence δB2/B02[1,9999]\hbox{$\delta B^2 / B_0^2 \in [1,9999]$}, as indicated by the various symbols. The dashed line provides a guide for a ρ-2 scaling.

In the text
thumbnail Fig. 6

Ratio D/D as a function of ρ′ for η = 0.99, compared to theorectical predictions and other numerical simulations. Filled diamonds: our simulation results. Star symbols: results from Casse et al. 2002. Solid curve: present theoretical prediction with best-fit D from simulations (see Fig. 2). Dashed curve: present theoretical prediction with analytical D = c2/(3νs). Dot-dashed curve: analytical prediction from Shalchi & Dosch (2009), their Eq. (15).

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.