Free Access
Issue
A&A
Volume 530, June 2011
Article Number A148
Number of page(s) 23
Section The Sun
DOI https://doi.org/10.1051/0004-6361/201016426
Published online 27 May 2011

© ESO, 2011

1. Introduction

The sub-surface of the Sun is optically thick, preventing us from directly observing the interior layers. Understanding the properties of the plasma in these regions has consequences for the theories of convection, stability of sunspots, the dynamics of stratified convection, and others. Most current knowledge about convection comes primarily from computational work (e.g., Vögler et al. 2005; Benson et al. 2006; Rempel et al. 2009). Helioseismic inversions of the sub-surface flows will play an important role in constraining these theories.

A powerful way of imaging the solar interior is via inferences gathered from studying the statistics of the acoustic and surface gravity waves at the surface. Solar pressure and surface gravity modes are generated randomly by the vigorous turbulence in the upper convection zone. These oscillations are observed in the solar photosphere by measuring Doppler shifts of photospheric absorption lines. Forward modelling allows us to relate anomalies (like flows, thermal hot/cold spots etc.) to changes in helioseismic observables.

The aim of helioseismic inversions is to reveal the structure of the subsurface flows (rotation, meridional circulation, convection), magnetic fields, and to measure deviations in the plasma state parameters (temperature, density, pressure) from a quiet Sun average. In this paper, we focus on travel times (Duvall et al. 1993), i.e., quantities that emerge from fits to cross correlations of observed signals. Time-distance helioseismology is used to measure and interpret changes in travel times of seismic waves caused by inhomogeneities in the structure of the Sun (see review by Gizon et al. 2010). In recent years, time-distance helioseismology has been used to invert for near-surface flows (e.g., Gizon et al. 2000; Duvall & Gizon 2000; Zhao & Kosovichev 2004; Jackiewicz et al. 2008), for flows beneath sunspots (e.g., Duvall et al. 1996; Zhao et al. 2001; Couvidat et al. 2006; Cameron et al. 2008; Gizon et al. 2009; Moradi et al. 2010) and flows in their vicinity (Gizon et al. 2000), study the rotational gradient at the base of the convection zone (e.g., Hanasoge & Duvall 2009), etc.

Helioseismic inversions are performed using two principal methods: the regularised least squares (RLS) and optimally localised averaging (OLA). The RLS method (in time-distance helioseismology used for the first time by Kosovichev 1996) seeks to find the models of the solar interior, which provide the best least-squares fit to the measured travel-time maps, while regularising the solution (e.g., by requiring the smooth solution). The OLA method was developed for geoseismology (Backus & Gilbert 1968, 1970). A form suitable for use in helioseismology was devised by Pijpers & Thompson (1992), who formulated the subtractive-OLA method. SOLA is based on explicitly constructed spatially confined averaging kernels by taking linear combination of sensitivity kernels, while simultaneously keeping the error magnification small. The resulting coefficients are then used to linearly combine the travel-time maps and obtain the estimate for structure and magnitude of solar plasma perturbations. A SOLA-type inversion is the principal method discussed in the current paper. The SOLA has been used in time-distance local helioseismology in the past by Jackiewicz et al. (2007, 2008) who demonstrated the ability of SOLA inversions to reveal the structure of the 3D internal flows. An efficient approach to solve fully consistent SOLA inversions was introduced by Jackiewicz et al. (2011). In this paper we focus on inversions for three-dimensional vector flows on supergranular scales in the near-surface layers of the solar convection zone.

1.1. Validating helioseismic inversions

Two approaches have been used so far to validate time-distance inversions. The first approach consists of generating synthetic travel-time maps by convolving a (known) arbitrary frozen flow field with travel-time sensitivity kernels and then testing the inversion method using these synthetic travel times (e.g., Kosovichev & Duvall 1997). The second approach (Zhao et al. 2007, 2010) is to use evolving realistic numerical simulations of three-dimensional radiative convection, where the helioseismic waves are naturally excited by the convection.

The first approach is convenient, but may not represent a realistic situation, in particular because the prescribed subsurface structures are often too idealised. The second approach is preferable, but is limited by computing resources: only simulations of the very near-surface regions of the Sun are available today. Both approaches have shown that inversions are generally able to retrieve the horizontal components of velocity at supergranulation scales in the quiet Sun. However, some problems have been reported. For example, vertical velocities have been measured with the opposite sign near the surface (Zhao et al. 2007) and the structure and sign of the flows around sunspots vary with the inversion method used (see, e.g., Gizon et al. 2009; Moradi et al. 2010).

The aim of this paper is to validate a particular implementation of SOLA inversions for time-distance helioseismology using a mixed approach: we take a snapshot from a large-box realistic simulation of solar convection (Ustyugov 2008) to generate realistic travel-time maps by convolution with Born travel-time sensitivity kernels. A realistic noise component is added to the travel times. The travel-time maps are then inverted using a multichannel SOLA inversion and compared with the known flows. This approach allows us to investigate various types of biases in the results of the inversion and to develop a robust procedure to minimise them. The most serious bias comes from the natural correlations among the components of flows induced by mass conservation that translate into a cross-talk between the components of the inverted flow (as we shall see, this is important when retrieving the vertical flow). Using realistic numerical simulations of convective flow velocities as input is very useful to set the acceptable level of random noise of the inverted flow velocities. The use of travel times with realistic noise properties is important in order to derive realistic estimates of the noise of the inverted flow velocities.

2. Synthetic travel times

To construct synthetic travel times, we use a realistic hydrodynamic simulation (Ustyugov 2008) of the solar convection. The computational domain is a box 20 Mm in depth and 60 Mm in each horizontal direction. The simulation provides us with a reasonably realistic description of flows in the upper convection zone v = (vx,vy,vz). Throughout this paper the spatial coordinates are defined as x=(r,z),\begin{equation} \bx=(\br, z), \end{equation}(1)where r is the horizontal position vector and z is the height.

We choose a snapshot (t = 500 min, see Fig. 16 in the on-line supplement) of the above mentioned simulation for our inversion tests. Following Duvall et al. (1997), we consider different types of travel-time measurements between a surface point at position r and a concentric annulus or quadrants in order to measure travel times sensitive to flows in inward-outward (denoted by “oi”), west-east (“we”), and north-south (“ns”) directions. Travel-time maps are denoted by τa(r), where the superscript a is an integer that uniquely refers to a particular combination of choices in the data analysis: a type of geometry (oi, we, or ns), annulus radius (from 7.3 Mm to 29.2 Mm every 1.46 Mm), and a wave filter (here, ridge filters for one of f, p1, p2, p3, or p4 modes). Thus, index a refers to one of M = 3 × 16 × 5 = 240 possibilities. Additional information describing the measurement procedure in time-distance helioseismology is given by Gizon & Birch (2005).

Travel-time maps (travel times as functions of position vector r) are generated by convolving the convection snapshot with sensitivity kernels according to τa(r)=Ka(rr,z)·v(r,z)d2rdz+na(r),\begin{equation} \tau^a(\br) = \int_{\odot} \bK^a(\br'-\br,z) \cdot \bv(\br',z)\; \id^2\br' \, \id z + n^a(\br) , \label{eq:traveltimesdef} \end{equation}(2)where Ka=(Kxa,Kya,Kza)\hbox{$\bK^a = (K_x^a, K_y^a, K_z^a)$} is a vector travel-time sensitivity kernel (see, e.g., Fig. 17 in the on-line supplement), v is the velocity vector of convecting flows, and the volume integral is taken over the Sun. The noise component is denoted by na. The original horizontal size of the simulation box of 60 Mm was too small for our purpose. Since the simulated velocities are periodic in the horizontal directions, we copied the simulation box 10 × 10 times.

Kernels are computed using the single-scattering Born approximation (Birch & Gizon 2007) and depend on eigenmodes of a background 1D standard solar model (Christensen-Dalsgaard et al. 1996). The kernels are invariant under horizontal translations; horizontal averages of these kernels for various oscillation modes are displayed in Fig. 18 in the on-line supplement to display their sensitivity in depth. All kernels used in this study have sensitivities only in upper-most 10 Mm of the convection zone.

Solar waves are excited by the action of turbulent convection in the Sun and therefore travel times are inherently noisy. Following Gizon & Birch (2004), we use a realistic noise covariance matrix (see Fig. 19 in the on-line supplement), Λab(rr)=E[na(r)nb(r)],\begin{equation} \Lambda_{ab}(\br-\br') = E[ n^a(\br) n^b(\br')], \end{equation}(3)to generate realizations of the noise.

Each travel-time map has 400 × 400 pixels with spatial sampling of 1.46 Mm, corresponding to the pixel size of a Michelson-Doppler-Imager full-disc image (MDI, Scherrer et al. 1995). In total, we generate 240 different travel-time maps (one for each index a, see, e.g., Fig. 1). These travel-time maps have spatial power spectra that are similar to observed travel-time power spectra based on MDI data, as demonstrated in Fig. 2. However, our synthetic travel times have weaker power at low wave-numbers due to the complete lack of signal from the simulated convective velocities below k   R = 60 (only simulated noise is present).

thumbnail Fig. 1

Examples of synthetic travel-time maps used in this paper (top row) and observed travel times from SOHO/MDI (bottom row). Temporal length of observation is T = 6 h.

thumbnail Fig. 2

Comparison of azimuthally averaged spatial power spectra \hbox{$| \bar{\tau}^a(k) |^2$} of synthetic (solid) and observed (dashed) travel times displayed in Fig. 1 as a function of k   R, where k is the horizontal wave-number and R is the radius of the Sun.

We check to ensure that point-to-annulus sensitivity kernels and noise covariance matrices obey expected symmetries. Furthermore, the sensitivity kernels and covariance matrices are forced to decay smoothly to zero towards the edge of the computation box by multiplying with a smooth spatial function having zeros far from the region of interest. In addition, we make sure that the horizontal integral of each sensitivity kernel for vz is zero at all depths. This is a consequence of symmetries associated with point-to-point kernels for vz, which are insensitive to the mean value of vertical velocity (Birch & Gizon 2007).

Table 1

Notations.

3. Improved SOLA inversions

3.1. Real-space formulation

The inversion method we use here is an improved version of the subtractive optimally localised averages (SOLA) method described by Jackiewicz et al. (2007, 2008, 2011).

The SOLA algorithm describes how to optimally combine a given set of travel time measurements to infer the underlying properties of medium. In this paper, we wish to retrieve vαinv˜(x0)\hbox{$\vinv_\alpha(\bx_0)$}, i.e., an estimate of the α-component of the flow velocity in the neighbourhood of position x0 = (r0,z0) in the solar interior. In practice, we search for a linear combination of the travel-time measurements, vαinv˜(x0)=i,awaα(rir0;z0)τa(ri),\begin{equation} \vinv_\alpha (\bx_0) = \sum_{i, a} w^\alpha_a(\br_i-\br_0; z_0) \tau^a(\br_i) , \label{eq:v_tilde} \end{equation}(4)where waα(ri)\hbox{$w^\alpha_a(\br_i)$} are inversion weights to be determined. Combining Eqs. (2) and (4), we have vαinv˜(x0)=𝒦α(rr0,z;z0)·v(x)d2rdz\begin{eqnarray} \vinv_\alpha (\bx_0) & = & \int_\odot \bcK^\alpha(\br-\br_0, z; z_0) \cdot \bv(\bx) \; \id^2\br\,\id z \nonumber \\ & & + \sum_{i, a} w^\alpha_a(\br_i-\br_0; z_0) n^a(\br_i) , \label{eq:tilde_v_vector} \end{eqnarray}(5)where 𝒦α=(𝒦xα,𝒦yα,𝒦zα)\hbox{$\bcK^\alpha = (\cK_x^\alpha, \cK_y^\alpha, \cK_z^\alpha)$} is a vector averaging kernel with components 𝒦βα(r,z;z0)i,awaα(ri;z0)Kβa(rri,z).\begin{equation} \cK_\beta^\alpha (\br,z; z_0) \equiv \sum_{i,a} w^\alpha_a(\br_i; z_0) K_\beta^a(\br-\br_i,z). \label{eq:akerndefinition} \end{equation}(6)The expression d2r   dz means d3x.

The expectation value of inverted velocity vαinv˜(x0)\hbox{$\vinv_\alpha (\bx_0)$} may be expanded as E[vαinv˜]=vαinv+βαvαinv(β),\begin{equation} {\rm E}[\vinv_\alpha]= \vakern_\alpha + \sum\limits_{\beta \ne \alpha} {\vakern_\alpha}{}^{(\beta)} , \label{eq:v_expectation} \end{equation}(7)where vαinv(β)(x0)𝒦βα(rr0,z;z0)vβ(x)d2rdz,\begin{equation} {\vakern_\alpha}{}^{(\beta)} (\bx_0) \equiv \int_\odot \cK^\alpha_\beta(\br-\br_0, z; z_0) v_\beta(\bx) \; \id^2\br\,\id z , \label{eq:v_breve} \end{equation}(8)and vαinvvαinv(α)\hbox{$\vakern_\alpha \equiv {\vakern_\alpha}{}^{(\alpha)}$}. The form of Eq. (7) will be useful later in this paper to optimize the inversion. Notice that the second term on the right side of (7) represents leakage of other flow components into the inverted one. This cross-talk may be a significant source of bias and needs to be studied.

We search for inversion weights wα so that the vector averaging kernel \hbox{$\bcK^\alpha(\br-\br_0,z;z_0)$} resembles a user-supplied target \hbox{$\bcT^\alpha(\br-\br_0,z;z_0)$}. Because we wish to invert for the α-component of the flow, we choose the target with components: 𝒯βα(rr0,z;z0)=𝒯(rr0,z;z0)δαβ,\begin{equation} \cT^\alpha_\beta(\br-\br_0,z;z_0)=\cT(\br-\br_0,z;z_0) \delta_{\alpha\beta} , \end{equation}(9)where \hbox{$\cT(\br-\br_0,z;z_0)$} is a function that peaks around x0 = (r0,z0), diminishes rapidly away from that point, and has unit integral. δαβ is a Kronecker δ. Throughout this paper, we choose a simple Gaussian 𝒯(r,z;z0)=(4ln2)3/2π3/2sh2szexp[4ln2sh2r24ln2sz2(zz0)2],\begin{equation} \cT(\br,z;z_0)= \frac{(4\ln2)^{3/2}}{\pi^{3/2} s^2_h s_z} \exp{\left[-\frac{4\ln{2}}{s_h^2}\|\br\|^2-\frac{4\ln2}{s_z^2}(z-z_0)^2\right]} , \end{equation}(10)where sh and sz are full widths at half maximum (FWHM) of the Gaussian in horizontal and vertical directions respectively.

A successful inversion will return a value of vαinv(x0)\hbox{$\vakern_\alpha(\bx_0)$} close to the target velocity vαtgt(x0)𝒯(rr0,z;z0)vα(x)d2rdz,\begin{equation} \vtarg_\alpha (\bx_0) \equiv \int_\odot \cT(\br-\br_0, z; z_0) v_\alpha(\bx) \; \id^2\br\,\id z, \label{eq:v_hat} \end{equation}(11)and cross-talk velocities vαinv(β)(x0)\hbox{${\vakern_\alpha}{}^{(\beta)}(\bx_0)$} close to zero for β ≠ α.

A list of the notations used for the various flow velocities referred to in the inversion procedure is given in Table 1.

The problem to be solved to obtain the wα belongs to the class of constrained regularised optimisations. The terms to be regularised are the following:

  • The misfit, i.e., how far the averaging kernel is from the desired target function, χ2(wα;z0)=[𝒦αα(r,z;z0)𝒯(r,z;z0)]2d2rdz,\begin{equation} \chi^2(\weights^\alpha; z_0)=\int_\odot [\cK_\alpha^\alpha(\br,z; z_0) - \cT(\br,z; z_0)]^2 \; \id^2\br\,\id z , \label{eq:misfit} \end{equation}(12)where 𝒦αα\hbox{$\cK_\alpha^\alpha$} is an implicit function of the weights wα.

  • The cross-talk quantifying the leakage of the signal from other flow components into the inverted one, XT(wα;z0)=βα[𝒦βα(r,z;z0)]2d2rdz.\begin{equation} XT(\weights^\alpha; z_0)=\sum\limits_{\beta \ne \alpha} \int_\odot [\cK_\beta^\alpha(\br,z; z_0)]^2 \; \id^2\br\,\id z . \label{eq:crosstalk} \end{equation}(13)

  • The variance of vαinv˜\hbox{$\vinv_\alpha$}, which corresponds to the root-mean-square of noise-related fluctuations in inferred velocities. We refer to this quantity as predicted error hereafter, σα2(wα;z0)=i,j,a,bwaα(ri;z0)Λab(rirj)wbα(rj;z0).\begin{equation} \sigma^2_\alpha(\weights^\alpha; z_0)=\sum\limits_{i,j,a,b} w^\alpha_a(\br_i;z_0) \Lambda_{ab}(\br_i-\br_j) w^\alpha_b(\br_j;z_0) . \label{eq:noise} \end{equation}(14)

  • The ad hoc term quantifying the spread of the weights in space around r0, S(wα;z0)=a,i[waα(ri;z0)]2.\begin{equation} S(\weights^\alpha; z_0)=\sum\limits_{a,i} [w^\alpha_a(\br_i; z_0)]^2 . \label{eq:confinement} \end{equation}(15)The regularisation based on S ensures that weights decrease towards the edge of the horizontal domain in order to enforce the spatial locality of the inversion and to prevent the weights from oscillating in the spatial domain.

In practice, we search for the weights wα that minimise the cost function Fα(wα)=[𝒦αα(r,z;z0)𝒯(r,z;z0)]2d2rdz+νβα[𝒦βα(r,z;z0)]2d2rdz+ϵa,i[waα(ri;z0)]2+μi,j,a,bwaα(ri;z0)Λab(rirj)wbα(rj;z0).\begin{eqnarray} F^\alpha(\weights^\alpha) & = &\int_\odot [\cK_\alpha^\alpha(\br,z; z_0) - \cT(\br,z; z_0)]^2 \; \id^2\br\,\id z \nonumber \\ &&+\, \nu \,\sum\limits_{\beta \ne \alpha} \int_\odot [\cK_\beta^\alpha(\br,z; z_0)]^2 \; \id^2\br\,\id z \nonumber + \epsilon \sum\limits_{a,i} [w^\alpha_a(\br_i; z_0)]^2 \\ \label{eq:costfunction} &&+\, \mu \sum\limits_{i,j,a,b} w^\alpha_a(\br_i;z_0) \Lambda_{ab}(\br_i-\br_j) w^\alpha_b(\br_j;z_0) . \end{eqnarray}(16)Trade-off parameters μ, ν, and ϵ balance these terms. The strategy to be employed in order to set these values will be described in Sect. 3.3.

The inversion is subject to constraints d2rdz𝒦βα(r,z;z0)=δαβforallβ,\begin{equation} \int_\odot \id^2\br\,\id z \, \cK^\alpha_\beta(\br,z;z_0) = \delta_{\alpha \beta}\ \foral \beta , \label{eq:constraint} \end{equation}(17)in order to scale the amplitude of the inverted flow ˜vinv\hbox{$\bvinv$} appropriately.

By taking derivative of (16) with constraint (17) added with respect to the weights, the problem can be cast into a linear inverse problem, as explained by Pijpers & Thompson (1992) or Jackiewicz et al. (2011). In real space, the matrix to be inverted has (N2M + P)2 ≃ (107)2 elements, where N = 200 is the number of grid points in one horizontal direction, P = 3 is the number of physical unknowns (three velocity components), and M = 240 as already defined earlier.

3.2. Fourier-space (multichannel) formulation

The full problem written in real space is intractable to be solved using nowadays computers, because the matrix to be inverted is too large. Jackiewicz et al. (2011) found a solution to this problem by transforming to spatial Fourier space, where the inverse problem decouples as a consequence of the horizontal translation invariance of the sensitivity kernels. Thus, instead of a big linear inverse problem (inverting matrix having (107)2 elements), we solve 40   000 small linear inverse problems (inverting matrices having 2402 elements) in wave-vector space. This approach is called a multichannel inversion (Jensen et al. 1998). Here we summarise results of Jackiewicz et al. (2011) for the sake of completeness.

Following Jackiewicz et al. (2011), we use the following definition of the Fourier transform, such that any function f(r) and its 2D spatial Fourier transform \hbox{$\bar{f}(\bk)$} are related according to f(r)=hk2k(k)exp(ik·r),(k)=hx2(2π)2rf(r)exp(ik·r),\begin{eqnarray} f(\br) & = & h_k^2 \sum\limits_{\bk} \bar{f}(\bk) \exp{\left({\rm i} \bk \cdot \br \right)} , \\ \bar{f}(\bk) & = & \frac{h_x^2}{(2\pi)^2} \sum\limits_{\br} f(\br) \exp{\left(-{\rm i} \bk \cdot \br \right)} , \end{eqnarray}where hx = 1.46 Mm and hk = 0.022 rad/Mm are the grid spacings in the real and Fourier domains respectively.

For each non-vanishing wave-vector k = (kx,ky), the vector of weights W(k)=[α1(k)α2(k)...αM(k)]T\hbox{$W(\bk)=[\bar{w}^\alpha_1(\bk) \; \bar{w}^\alpha_2(\bk) \;\dots\; \bar{w}^\alpha_M(\bk)]^{\rm T}$} is the solution to the matrix equation, hk2N2A(k)W(k)=T(k)fork0.\begin{equation} {h_k^2 N^2\,A(\bk) W(\bk) }= T(\bk) \quad {\rm for\ } \bk \neq \bzero. \label{eq:problem-nonzerok} \end{equation}(20)Each matrix A(k) has M × M elements Aab given by Aab(k)=(2π)2+dzaα(k,z)bα(k,z)+(2π)2νβα+dzaβ(k,z)bβ(k,z)\begin{eqnarray} A_{ab}(\bk) &=& (2\pi)^2 \int\limits_{-\infty}^{+\infty} {\rm d}z {\bar{K}_\alpha^{a *}}(\bk,z) \bar{K}^b_\alpha (\bk,z) \nonumber \\ & &+ (2\pi)^2 \nu \sum\limits_{\beta \ne \alpha} \int\limits_{-\infty}^{+\infty} {\rm d}z {\bar{K}_\beta^{a *}}(\bk,z) \bar{K}^b_\beta (\bk,z) \nonumber \\ &&+ \mu \bar\Lambda_{ab}(\bk) + \epsilon \delta_{ab}. \label{eq:a_ca} \end{eqnarray}(21)The vector \hbox{$T(\bk)=[\bar{t}_1(\bk) \; \bar{t}_2(\bk) \;\dots\; \bar{t}_M(\bk)]^{\rm T}$} has M elements \hbox{$\bar{t}_a$} given by a(k;z0)=(2π)2+dzaα(k,z)𝒯̅(k,z;z0).\begin{equation} \bar{t}_a (\bk; z_0) = (2\pi)^2 \int\limits_{-\infty}^{+\infty} \id z \, {\bar{K}^{a *}_\alpha}(\bk,z) \bar{\cT}(\bk,z; z_0). \label{eq:t_c} \end{equation}(22)In addition, the matrix equation for the case k = 0 is [hk4N2A(0)CCT0][W(0)L]=[hk2T(0)Uα/(hk2N2)],\begin{equation} {\left[ \begin{array}{cc} h_k^4 N^2 A(\bzero) & C \\ C^{\rm T} & 0 \end{array} \right] \left[ \begin{array}{c} W(\bzero) \\ L \end{array} \right]} = {\left[ \begin{array}{c} h_k^2\, T(\bzero) \\ U^\alpha/(h_k^2 N^2) \end{array} \right],} \label{eq:problem-zerok} \end{equation}(23)where C is an M × P matrix with elements C=d2rdzKβa(r,z),\begin{equation} C_{a \beta} = \int_\odot \id^2 \br\, \id z \, K^a_\beta (\br,z),\label{eq:c_alpha} \end{equation}(24)L is a 1 × P vector of Lagrange multipliers, and the vector Uα is a 1 × P unit vector with components uβα=δαβ\hbox{$u^\alpha_\beta=\delta_{\alpha\beta}$}.

Solutions to the N − 1 Eq. (20) and the Eq. (23) give Fourier components of the weights, αa(k)\hbox{$\bar{w}^\alpha_a(\bk)$}. By taking an inverse Fourier transform we obtain waα(r)\hbox{$w^\alpha_a(\br)$}. Equation (4) then gives an estimate of the α-component of the flow, vαinv˜\hbox{$\vinv_\alpha$}.

The difference between the above equations and those in Jackiewicz et al. (2011) arises from additional constraint terms (13) and (15).

3.3. Picking trade-off parameters

Trade-off parameters μ, ν, and ϵ control the balance between various terms in the cost-function (16).

In practice, during the inversion for each flow component, we compute a grid of solutions by varying all three trade-off parameters. For each solution on this grid, we compute the misfit (12), predicted error of results (14), amount of cross-talk (13), and the spatial power of weights (15). All four quantities can be computed in spatial Fourier domain.

Standard optimisation methods to achieve optimality, such as L-curve analysis (see, e.g., Hansen 1998; Jackiewicz et al. 2008), are not particularly useful to our problem. The elbow of the L-curve, which is considered an optimal point in parameter space, was located where the predicted noise level was very large. We therefore developed our own strategy of selecting values for the trade-off parameters.

We start with parameter μ, which controls the trade-off between the misfit and random error. We select μ such that the noise level of the inverted quantity is less than some target value, chosen by the user, depending on the problem at hand. For example, one may choose a target noise level of 20 m/s to invert for horizontal flows in supergranules. A much lower noise level will be required to invert for the vertical component of the flow (Duvall & Birch 2010). The example noise levels σα are given in Table 2, where we give the predicted inversion error for vxinv˜\hbox{$\vinv_x$} and vzinv˜\hbox{$\vinv_z$} and the root-mean-square of the flow averaged with the target function (vxtgt\hbox{$\vtarg_x$} and vztgt\hbox{$\vtarg_z$}) and averaged with the resulting averaging kernel (vxinv\hbox{$\vakern_x$} and vzinv\hbox{$\vakern_z$}). It may be that this selection of μ leads to an averaging kernel which does not resemble the desired target function at all, in which case one would have to allow for a coarser spatial resolution.

Table 2

Predicted inversion errors and the expected magnitude of the velocities at the three different depths discussed throughout this paper.

For the given μ, we then choose ν so that the degree of cross-talk is less than 10-5 Mm-3. We observe that this constraint places the following upper bound on the magnitude of the cross-talk terms: max|βα𝒦βα(r,z;z0)|max|𝒦αα(r,z;z0)|<0.05.\begin{equation} \frac{{\rm max} \left|\sum\limits_{\beta \ne \alpha} \cK^\alpha_\beta(\br, z; z_0)\right|}{{\rm max\ }\left|\cK^\alpha_\alpha(\br, z; z_0)\right|} < 0.05 . \end{equation}(25)The parameter ϵ controls the degree of spatial confinement of the inversion weights and has an impact on the misfit and error as well. If ϵ is too large, the weights will be highly localised in space around the central point, but a large misfit will result.

For the particular cases that we study here, we find that localisation of weights is accomplished when S < 2 × 10-3   km2   s-2. The impact of this regularisation term is illustrated in Fig. 3. For the case shown, weights are confined to within a disc of radius  ~50 Mm.

thumbnail Fig. 3

Example inversion weights wax(x,y)\hbox{$w_a^x(x,y)$} for 1 Mm depth and effect of the minimisation of the spread S. A strong regularisation ϵ confines the inversion weights in the spatial domain (bottom row, ϵ = 100.5) compared to the case ϵ = 0 (top row). The cut at y = 0 (horizontal black line) is displayed on the right-hand-side panel, where the values of wax(x,y=0)\hbox{$w^x_a(x,y=0)$} were scaled by wax(x=0,y=0)\hbox{$w^x_a(x=0,y=0)$}.

3.4. The code

The code implementing the above described procedure is written in Matlab. Matlab provides a compromise between computational efficiency and the availability of higher-level software constructs that make the code lucid, modular, and easy to modify. Matlab-based code may be compiled into binary-executable form suitable for pipeline pseudo-automatic processing. The input is a text file, in which the user specifies all relevant parameters including the kernels to be used in the inversion and the set of trade-off parameters to be investigated. This allows for the development of user interfaces such as the web-based or graphical interface, which can serve as the user front-end of the inversion code.

The code is parallel and scales linearly with number of parallel jobs. The execution is fast, e.g., a set of inversions for one flow component involving 240 kernels, 200  ×  200 spatial points, and a grid of 200 trade-off parameter values takes around 6 h using 48 Opteron-2.3 GHz CPUs. When many CPUs are involved, the extensive input-output load becomes a bottle-neck and affects the total processing time.

4. Inversion for horizontal flow

We applied the above-described method to known synthetic travel times in order to validate the set-up and the performance of the inversion procedure. We focus first on the horizontal flow components, i.e., α = x or α = y. We compare two principal quantities: (1) the flow map that is obtained by convolving the known velocity field with the target function (vαtgt\hbox{$\vtarg_\alpha$}), i.e., the best-case inversion scenario, and (2) the flow map actually resulting from the inversion (vαinv˜\hbox{$\vinv_\alpha$}). In this manner, we may investigate in detail different sources of bias in the results.

The simulation convolved with the target function gives us vαtgt\hbox{$\vtarg_\alpha$}, from which we estimate the expected magnitude of the flow that we want to invert for (see Table 2). This places limits on the required noise level of the inversion so that the results have signal-to-noise ratios larger than 1. Requirements on the targeted error level fix the trade-off parameter μ.

The choice of the target function depends on the discretion and needs of the user. Here we focus on layers in the top few Mm, in particular on depths  − z0 = 1, 3.5, and 5.5 Mm. This set-up was selected because similar flow inversions were also performed and discussed by Jackiewicz et al. (2008), which makes it possible to compare the results of both methods. As discussed in the preceding sections, the outcome of the inversion is a set of weights waα(r)\hbox{$w^\alpha_a(\br)$}, which are used to combine the travel-time maps in order to obtain estimates of velocity vxinv˜(r)\hbox{$\vinv_x(\br)$} and vyinv˜(r)\hbox{$\vinv_y(\br)$}. Example weights for vxinv˜\hbox{$\vinv_x$} are displayed in Fig. 3.

All components of the averaging kernels 𝒦βx(r,z;z0)\hbox{$\cK^x_\beta(\br,z;z_0)$} are shown in Figs. 2022. We also show comparisons between inversions when the cross-talk is minimised and not minimised. The minimisation of the cross-talk with ν = 100 is very efficient. However, for depths of 3.5 and 5.5 Mm the minimisation of the cross-talk introduces some small artefacts in the inversion averaging kernel 𝒦xx\hbox{$\cK^x_x$}, which are a small price to pay.

Figure 4 shows vertical cuts through the target functions and the averaging kernels when the cross-talk is minimised. Except for the target depth of 5.5 Mm, the depth dependences of the averaging kernels 𝒦xx\hbox{$\cK^x_x$} resemble that of the target functions. The near-surface inversion at 1 Mm is dominated by the f-mode. The inversion at 3.5 Mm has equal contributions from f and p1, with opposite signs, as shown in Fig. 23 in the on-line supplement, where we plot the contribution of individual modes to the averaging kernels 𝒦xx\hbox{$\cK_x^x$} as a function of depth.

thumbnail Fig. 4

The cut through the x = y = 0 point of the averaging kernel (solid) and the respective target function (dashed) for vx inversion with minimised cross-talk, using travel times averaged over 4 days, at three discussed depths (1, 3.5, and 5.5 Mm).

thumbnail Fig. 5

Comparison of inverted vx with input data. Top-row panels show the input flow field convolved with the target function and the averaging kernel respectively. Bottom rows show inversion results in cases when cross-talk is ignored (left) and minimised (right). Random errors of the inversion are given in Table 2.

thumbnail Fig. 6

Inversion biases for vx at 1 Mm depth. Left: noiseless vxinv\hbox{$\vakern_x$} with cross-talk minimised versus the ideal vxtgt\hbox{$\vtarg_x$}. The departure from slope unity (grey dashed line) is due to an imperfect match between \hbox{$\cK$} and \hbox{$\cT$}. Right: noisy vxinv˜\hbox{$\vinv_x$} versus noiseless vxinv\hbox{$\vakern_x$}. The results are plotted in two cases: when the cross-talk is ignored (grey  × ) and minimised (black  + ). The linear fit to the black crosses coincides with the dashed line of slope unity. The black dotted lines represent the predicted error of 14 m s-1, which is consistent with the observed scatter of the black crosses.

thumbnail Fig. 7

The azimuthally-averaged power spectra of various components of the vx inversions using travel times averaged over 4 days. For reference, we plot the power spectrum of vxinv\hbox{$\vakern_x$} (thick grey solid line). We plot also the power spectrum of vxinv˜\hbox{$\vinv_x$} (solid line) and the power spectrum of the noise (i.e., the power spectrum of vxinv˜vxinv\hbox{$\vinv_x-\vakern_x$}; dashed line) for the inversion where the cross-talk is minimised (black) and ignored (grey).

The validation of the vx inversion is demonstrated in Fig. 5. Here we plot the desired vxtgt\hbox{$\vtarg_x$} at three different depths and the inverted vxinv\hbox{$\vakern_x$} without noise contributions. These two are very close for the depths 1 and 3.5 Mm. The differences between vxtgt\hbox{$\vtarg_x$} and vxinv\hbox{$\vakern_x$} at depth 5.5 Mm are caused by an imperfect averaging kernel. When the random noise is added to the solution (bottom row of Fig. 5), we see that the inversions for vx at 1 Mm and 3.5 Mm are still very good, while the inverted vx at depth 5.5 Mm is dominated by noise. Minimising cross-talk does not improve the quality of the solution in this case.

In order to quantify the inversion biases, we compare directly the expected and inverted values for a set of spatial locations. We select points separated by 7.5 Mm in each horizontal coordinate. This sampling interval is equal to the half of the horizontal FWHM of the target function (thus the points are somewhat independent). In addition, to avoid possible edge effects, we cut the outer part of the horizontal plane so that only the central 200  ×  200 pixel patch is kept. The scatter plots comparing various inversion components are displayed in Fig. 6 for the depth of 1 Mm, the plots for the depth of 3.5 Mm are qualitatively similar. We estimate that for the depth of 1 Mm, the imperfect averaging kernel leads to an average underestimation of the horizontal flow components by some 20% (Fig. 6 left). Furthermore we note that in the case of the inversion for the horizontal flow components, the bias caused by the cross-talk is not important and the random noise level corresponds to the predicted value (Fig. 6 right). The results for the depth 3.5 Mm are similar, while the results for the depth 5.5 Mm are dominated by the random noise.

Table 3

Correlation analysis of various inversion components using synthetic travel times averaged over 4 days.

The noise domination of vx inversion at depth of 5.5 Mm is evident from the plot of the azimuthally averaged power spectra of the random noise and of the signal as a function of k   R (Fig. 7 right). For comparison, similar plots for the depths 1 Mm and 3.5 Mm are given also in Fig. 7, where the power spectrum of the signal is well above the random noise near supergranular spatial scales. The decrease in power of the signal at low kR in Fig. 7 is because the convection simulation does not contain these scales.

We showed that it is possible to retrieve vx and vy in the top 3.5 Mm without noticeable bias and within the predicted noise level of  ~25 m/s (for observing time T = 4 days) with 240 different travel-time measurements for ridges f to p4. In Table 3, we summarise our findings. We state the statistical quantities (correlation coefficient and the slope of the linear fit) comparing vαinv˜\hbox{$\vinv_\alpha$} and the corresponding vαinv\hbox{$\vakern_\alpha$}. We also show the signal-to-noise ratio of the inverted vαinv˜\hbox{$\vinv_\alpha$}.

The cross-talk is unimportant in inversions for the horizontal flow components. The cross-talk could come only from vz, which is weak. We estimate that the influence of the cross-talk is less than 5%. The inversion at 5.5 Mm is already dominated by noise. If we drop the requirement on horizontal resolution, inversions at a depth of 5.5 Mm are also possible (e.g., with a FWHM of sh = 25 Mm).

It is possible to perform inversions with less temporal averaging, but to obtain a reasonable signal-to-noise ratio, we would have to relax the demand on the match between the averaging kernel and the target function. Typically, this leads to side-lobes in the averaging kernel (especially in the z-direction), which may make interpretation of the results more difficult. Another possibility would be to include a larger number of independent travel-time measurements (and therefore more sensitivity kernels). These issues are being worked on.

5. Inversion for vertical flow

5.1. Specificity of inversions for vertical flow

Inversions for vz require a different methodology because sensitivity kernels Kza\hbox{$K_z^a$} have zero horizontal integrals at each depth (and therefore zero total integral) implying (see Eq.(6)) +d2r𝒦zz(r,z;z0)=0,forallz.\begin{equation} \int\limits_{-\infty}^{+\infty} \id^2\br\, \cK_z^z(\br,z';z_0) = 0,\ \foral z' . \label{eq:zero_vz_sensitivity} \end{equation}(26)Consequently, with measurements discussed here, it is impossible to retrieve horizontal average  ⟨ vz ⟩  of vertical flow and we may only invert for fluctuations vz −  ⟨ vz ⟩ . Equation (26) implies

d2rdz𝒦βz(r,z;z0)=0forβ=z\begin{equation} \int_\odot \id^2\br\,\id z' \, \cK^z_\beta(\br,z';z_0) = 0\ {\rm for\ } \beta=z\ \end{equation}(27)and thus Eq. (17) cannot be written for α = z. This implies in turn that the matrix Eq. (23) for k = 0 may not be written either.

5.2. Ignoring k=0

A solution to this issue is simply to replace Eq. (23) by za(0)0foralla.\begin{equation} \bar{w}^z_a(\bzero) \equiv 0\ \foral a . \label{eq:problem-zerok-new} \end{equation}(28)Since the averaging kernel 𝒦zz\hbox{$\cK_z^z$} must have zero integral, the averaging kernel will be offset by a small negative constant away from the central peak. Luckily, regularization by the term (15) will ensure that the averaging kernel will drop to zero at the edge of the inversion box. However, this solution is not quite satisfactory, because we do not have very much control over this extended negative surrounding sidelobe.

5.3. Target function with zero mean

It is more elegant to select a target function \hbox{$\cT(\br-\br_0,z;z_0)$} with vanishing horizontal integral at each depth. If this target function is to peak around x0 = (r0,z0), then this peak must be compensated by negative side-lobes in horizontal directions. These side-lobes have to be constructed in such a way that they do not lead to significant biases. We suggest to choose 𝒯(r,z;z0)=H(r)4ln2πszexp[4ln2sz2(zz0)2],\begin{equation} \cT(\br,z;z_0)=H(\br) \frac{\sqrt{4\ln2}}{\sqrt{\pi} s_z} \exp{\left[-\frac{4\ln2}{s_z^2}{(z-z_0)^2}\right]}\ , \end{equation}(29)where H(r)=4ln2πsh2exp[4ln2sh2r2]\begin{eqnarray} H(\br) & = & \frac{4\ln2}{\pi s_h^2} \exp{\left[-\frac{4\ln2}{s_h^2}{\|\br\|^2}\right]} \nonumber \\ & & - \frac{1}{c} \frac{4\ln2}{\pi (ns_h)^2} \exp{\left[-\frac{4\ln2}{(ns_h)^2} \left(\|\br\|-\frac{s_h}{2\sqrt{2\ln2}}\right)^2\right]}\cdot \label{eq:improved_target1} \end{eqnarray}(30)The constant c=exp(12n2)+π2n2[1+erf(1n2)]\begin{equation} c=\exp{\left( -\frac{1}{2n^2} \right)}+\sqrt{\frac{\pi}{2n^2}}\left[1+{\rm erf}{\left(\frac{1}{n\sqrt{2}} \right)} \right] \end{equation}(31)ensures that the horizontal integral of \hbox{$\cT$} is zero. The horizontal part of the target function is constructed from a Gaussian peaked at r0 = 0, from which a wide surrounding Gaussian annulus is subtracted. The free parameter n balances the width of a side-lobe compared to the width of the central peak.

The inversion is performed for each k ≠ 0 with this new target function, together with za(k=0)=0\hbox{$\bar{w}^z_a(\bk=\bzero)=0$}.

5.4. Both target functions provide similar answers

We compared the results obtained using the two target functions proposed above and found two conclusions.

The two solutions described above provide results which are very close (see, e.g., Fig. 24). However, the solution with the target function having zero horizontal average is elegant and provides more control over the solution averaging kernel.

By comparing the results obtained with various sizes of the negative annulus surrounding the central peak we found that the bias in the inverted flow caused by this negative sidelobe is negligible if n > 3 in Eq. (30).

The results presented in the following sections are obtained using the first formalism.

thumbnail Fig. 8

Horizontal spatial cross-correlation between vx and vz from the simulation in the top 1 Mm. These flow correlations together with a non-vanishing cross-talk averaging kernel 𝒦xz\hbox{$\cK_x^z$} may lead to a systematic bias of the inferred vz. The circle of radius of 15 Mm shows the width of the desired target function.

thumbnail Fig. 9

The cut through the x = y = 0 point of the averaging kernel (solid) and the respective target function (dashed) for vz inversion using travel times averaged over 4 days. Compare with Fig. 4.

5.5. Validation of vertical flow inversion

As is evident already, inversions for vertical flow are not as seamless as in the case of horizontal components. This is mostly because the vertical flow is much weaker on supergranular scales than the horizontal flow. As shown in Table 2, the expected rms of the vertical flow in the top layers is of the order of 5 m s-1. Therefore, the predicted noise of the inverted vz has to be set to a much smaller value than for the horizontal velocities. As a result, the match between the desired target function and the resulting averaging kernel will have to be poorer.

Furthermore, it is absolutely crucial to minimise the cross-talk to avoid the leakage of the large horizontal velocities into the small inferred vertical velocity. Even an apparently negligible cross-talk averaging kernel could in the end cause a significant bias in the results. Minimising the cross-talk is especially important because of the natural correlations between the vertical and horizontal flow components in the mass-conserving flow of the supergranules (see Fig. 8). In the upper layers, horizontal outflows are associated with upflows. The structure of the cross-talk averaging kernel 𝒦xz\hbox{$\cK^z_x$} as shown in the top half of Fig. 10 in the case when cross-talk is not minimised then implies a negative bias in vz due to vzinv(x)\hbox{$\vakern_z{}^{(x)}$} and vzinv(y)\hbox{$\vakern_z{}^{(y)}$}.

thumbnail Fig. 10

All components of the averaging kernel for vz inversion at 1 Mm depth with a FWHM of sz = 1.1 Mm and sh = 15 Mm. Bottom row: with cross-talk minimised, top row: cross-talk is ignored. The cross-talk is presented in the form of 𝒦xz\hbox{$\cK_x^z$} and 𝒦yz\hbox{$\cK_y^z$} averaging kernel components. Random error of the results is 3 m s-1 when assuming data averaged over 4 days. Over-plotted contours, which are also marked on the colour bar for reference, denote the following: half-maximum of the kernel (white), half-maximum of the target function (black), and by grey lines  ± 5% of the maximum value of the kernel (solid and dotted, respectively).

thumbnail Fig. 11

Comparison of inverted vz with input data, compare with Fig. 5.  ⟨ vz ⟩  = 11 m s-1.

thumbnail Fig. 12

All components of the vz inversion at 1 Mm depth. Top row with the cross-talk ignored, bottom row with the cross-talk minimised. We demonstrate that if cross-talk is not addressed, horizontal components will leak into the inverted vz and cause a bias.

Figure 9 shows vertical cuts through the target functions and the averaging kernels when the cross-talk is minimised. The averaging kernels 𝒦zz\hbox{$\cK^z_z$} for the inversions at depth 3.5 and 5.5 Mm have sidelobes towards the surface. All components of the averaging kernel 𝒦βz(r,z;z0)\hbox{$\cK^z_\beta(\br,z;z_0)$} for vz are displayed in Fig. 10 (and Figs. 25, 26), again comparing the cases when the cross-talk is and is not minimised. The action of the cross-talk minimisation term is very efficient.

thumbnail Fig. 13

Inversion biases for vz at 1 Mm depth, similar to Fig. 6. While the black line (with minimised cross-talk) almost coincides with the dashed line of slope unity, the grey one (cross-talk ignored) indicates a horrible bias. The predicted error is 3 m s-1. The inversion is not sensitive to the horizontally averaged horizontal flow ⟨vz⟩ = 11 m s-1, therefore it was subtracted from vztgt\hbox{$\vtarg_z$} in this comparison.

thumbnail Fig. 14

The azimuthally-averaged power spectra of various components of the vz inversions using travel times averaged over 4 days. Compare with Fig. 7 for vx inversions. Notice that in the case of vzinv˜\hbox{$\vinv_z$} containing the cross-talk at 1 Mm depth, the excess in power around kR ~ 150 is not caused by the random noise, but by the bias coming from the cross-talk (not plotted separately). In the other two cases the random noise is a main cause why these inversions are not possible.

The validation of the vz inversion is demonstrated in Fig. 11. Here we plot vztgt\hbox{$\vtarg_z$} at three different depths and the inverted vzinv\hbox{$\vakern_z$} without noise contributions. These two are very similiar at the depth of 1 Mm. Small differences between vztgt\hbox{$\vtarg_z$} and vzinv\hbox{$\vakern_z$} caused by an imperfect averaging kernel are visible at the depth of 3.5 Mm, become more significant at depth 5.5 Mm. When random noise is added to the solution (bottom row of Fig. 11), we see that the inversion for vertical flow is possible at 1 Mm depth only if cross-talk is minimised. Inversions at depths 3.5 and 5.5 Mm are buried in random noise, where minimising cross-talk does not help. The magnitude of cross-talk at a depth 1 Mm is much larger than that of the vertical flow; further, cross-talk is highly anti-correlated with the vertical flow, which makes the correlation coefficient between vztgt\hbox{$\vtarg_z$} and vzinv˜\hbox{$\vinv_z$} close to  − 1. Note that a similarly high anti-correlation was measured by Zhao et al. (2007).

All components vzinv(β)\hbox{$\vakern_z{}^{(\beta)}$} and the random noise component in the inversion for vz at 1 Mm depth are displayed in Fig. 12. We see in the top row that the leakage of the horizontal components (vzinv(x)\hbox{$\vakern_z{}^{(x)}$} and vzinv(y)\hbox{$\vakern_z{}^{(y)}$}) covers up completely the weak signal of the vertical flow when the cross-talk is not minimised.

Biases in the vz inversion at a depth of 1 Mm may be quantified by directly comparing expected and inverted values for a set of spatial locations in the horizontal plane (Fig. 13), as in the case of the vx inversion (Sect. 4). The imperfect averaging kernel in this case does not cause any significant bias (Fig. 13 left). The vz inversion is not sensitive to the horizontally averaged vertical flow  ⟨ vz ⟩ , which is therefore subtracted from vztgt\hbox{$\vtarg_z$} in the corresponding plot. The effect of minimising cross-talk is shown in Fig. 13 left. The inverted vzinv˜\hbox{$\vinv_z$} is anti-correlated with vzinv\hbox{$\vakern_z$} expected from the inversion due to the leakage of the horizontal mass-conserving flow components into the vertical one.

Meaningful inversions for the vertical flow are possible in shallow near-subsurface layers of the convection zone only when the cross-talk between the vertical and the horizontal components is minimised. Vertical flow inversions on supergranular horizontal scales at depths greater than  ~1 Mm performed using f to p4 modes and travel time maps averaged over 4 days are dominated by random noise and the signal of the vertical flow may not therefore be inferred at all (see summary in Table 3). This fact is also demonstrated in Fig. 14, where we plot power spectra of individual inversion components as a function of the spatial scale. At depths larger than 1 Mm, the power of the signal is much less than the power of the random noise regardless of the spatial scale.

thumbnail Fig. 15

Comparison at three depths of statistical averages of velocity-vector over many flow realisations. The horizontal flow is displayed by arrows while the vertical flow is colour-coded. The reference arrow indicates 100 m s-1, the random errors of the inversions are given in Table 2. Compare to Figs. 5 and 11.

6. Beating the noise: statistical averaging

As demonstrated in previous sections, it is very difficult to have a meaningful inversion for solar flows on supergranular scales averaged over a short time, even in the shallow near-subsurface layers of the solar convection zone. The signal is overwhelmed by random noise. It is not feasible to average over longer times, because the expected lifetime of convection on supergranular scales is on the order of a day. By averaging over longer time, a significant portion of the scientifically useful information would be lost.

It seems feasible to solve the issue by averaging over many realisations of similar flow structures each averaged over short time (therefore noisy). This concept was already used by other authors (e.g., Duvall et al. 2006). As an example we note the possibility to measure the flows in many individual supergranules and to average over this sample in order to obtain the typical flow structure in an “average supergranule”, similar to Duvall & Birch (2010). The predicted error in the results after averaging scales as 1/𝒩\hbox{$1/\!\!\sqrt{\cal{N}}$}, where \hbox{$\cal{N}$} is the size of the ensemble of independent flow realisations. This scaling law allows us to relax the noise constraint on the inversion and therefore regularise more strongly about the misfit and cross-talk terms. See Table 2 for required random error of inversions results.

For instance, let us assume that we average over 104 independent inverted flows, each obtained by considering travel-time maps averaged over 6 h. Individual flow maps are noisy and therefore contain little useful information about underlying flows. This selection allows us to relax the constraint on the estimated error level by a factor of a 100 and therefore obtain a much better fit to the target function (see Fig. 27). With this set-up, the validation displayed in Fig. 15 shows almost perfect correspondence between expected vtgt and inverted ˜vinv\hbox{$\bvinv$} flows in the top 5.5 Mm of the convection zone. Inversion for vz at 5.5 Mm with a FWHM of sh = 15 Mm is an exception, because the selected target function cannot be matched by the averaging kernel. Different inversions for vz at this depth are possible with greater averaging, e.g., with sh = 25 Mm. It is still crucial to minimise cross-talk in the case of vz inversions to retrieve the correct answer. Power spectra of inversion components at all discussed depths are displayed in Fig. 28.

7. Conclusions

We improved and validated an inversion algorithm based on a SOLA inversion approach. The formalism, algorithm and the code is universal and can, in principle, be used to invert for any quantity describing inhomogeneities in the solar plasma, provided that the corresponding sensitivity kernels are available. The code is also ready to be used for application to real measurements. It will become part of the helioseismic pipeline running at German Data Center for SDO. We plan to use it to analyse all available SOHO/MDI and SDO/HMI data in order to routinely provide a tomographic image of the structure of the solar upper convection zone.

The code is absolutely scalable allowing to include more independent measurements (and therefore more sensitivity kernels) in order to further refine the precision of the results. Thanks to the decoupling, the problem will still be solvable even using nowadays computers.

We improved the inversions by introducing additional terms that allow to control and minimise some sources of bias in the results. Most importantly, we minimise the cross-talk between individual flow components, which is crucial especially for vz inversions. In principle, the formalism allows to minimise cross-talk between any selected quantities during the general inversion. The validation performed here silently assumes that the sensitivity kernels and noise covariance matrices are perfect. For the validation of the method and the code it is important if the sensitivity kernels and noise covariance matrices are solar-like. This allows us to study different sources of biases we may expect in the Sun. It is also very important that the sensitivity kernels used in the inversion contain all the details of the travel-time measurements, including the instrumental function affecting the solar oscillation power spectrum.

We found that by considering f to p4 frequency-averaged modes and supergranular spatial scales it is possible to perform reliable and trustworthy flow inversions of the travel-time maps averaged over a few days in the top 3.5 Mm layer of the convection zone in the case of horizontal vx and vy components, and in the top 1 Mm in the case of vertical vz component. Based on our experiment, we expect that using travel-time maps averaged over 4 days, it is possible to measure 3D velocities as weak as 10 m s-1 at the surface and horizontal velocities having amplitude 20 m s-1 in the top 5 Mm. We estimate, that using the travel-times averaged over 1 day, it should still be possible to measure all components of the supergranular flow at the surface and its horizontal components in the top 5 Mm. The cross-talk minimisation is crucial in order to measure the correct vertical velocity. Its presence in the results may explain the opposite sign of the vertical flow inversion discovered by Zhao et al. (2007).

By considering many flow realisations and statistical averaging we might go deeper in the convection zone to learn about the horizontal flow components and perform vertical flow inversions for the top few Mm depths.

Online material

thumbnail Fig. 16

Slices through the x and z components of the simulated flow v at depths 1 Mm, 3.5 Mm, and 5.5 Mm.

thumbnail Fig. 17

The point-to-annulus sensitivity kernels for three flow components computed for the f-mode, distance 28 Mm and outward-inward geometry. The white circle represents the location of the averaging annulus.

thumbnail Fig. 18

Horizontal averages of sensitivity kernels may serve useful when estimating which depths are easier to target. The trend for each mode/ridge was obtained by taking Kxa(r;z)d2r\hbox{$\int K_x^a(\br;z) \id^2\br$} and averaging over all as within the given mode.

thumbnail Fig. 19

An example noise-covariance matrix for f-mode travel times averaged over 6 h. In this plot, a stands for the combination of the f-mode, oi geometry, and annulus radius of 7.3 Mm, b stands for the combination of f-mode, we geometry, and annulus radius of 8.8 Mm.

thumbnail Fig. 20

All components of the averaging kernel for vx inversion at 1 Mm depth with a FWHM of sz = 1.1 Mm and sh = 15 Mm. Bottom row: with cross-talk minimised, top row: cross-talk is ignored. Random error of the results is 14 m s-1 when assuming data averaged over 4 days. Over-plotted contours, which are also marked on the colour bar for reference, denote the following: half-maximum of the kernel (white), half-maximum of the target function (red), and  ± 5% of the maximum value of the kernel (blue and green, respectively).

thumbnail Fig. 21

All components of the averaging kernel for vx inversion at 3.5 Mm depth with a FWHM of sz = 2.2 Mm and sh = 15 Mm. Random error of the results is 20 m s-1 when assuming data averaged over 4 days. For details see Fig. 20.

thumbnail Fig. 22

All components of the averaging kernel for vx inversion at 5.5 Mm depth with a FWHM of sz = 3.5 Mm and sh = 15 Mm. Random error of the results is 28 m s-1 when assuming data averaged over 4 days. For details see Fig. 20.

thumbnail Fig. 23

The contributions of particular modes to the horizontally averaged averaging kernel for vx inversions using travel times averaged over 4 days for depths 1 and 3.5 Mm. We do not display the inversion for the depth of 5.5 Mm, because it is heavily dominated by noise.

thumbnail Fig. 24

To solve the peculiarity of the vz inversion, we introduced two formalisms in Sect. 5.1. Here we plot performance of those. In black, the magnified section a y = 0 and z = z0 of different target functions are displayed, one with removed mean (1) and one constructed with negative side-lobes (2). The resulting averaging kernels are also plotted. It is evident that the resulting averaging kernels are qualitatively very similar even when different formalisms were used to compute them.

thumbnail Fig. 25

All components of the averaging kernel for vz inversion at 3.5 Mm depth with a FWHM of sz = 2.2 Mm and sh = 15 Mm. Random error of the results is 13 m s-1 when assuming data averaged over 4 days. For details see Fig. 20.

thumbnail Fig. 26

All components of the averaging kernel for vz inversion at 5.5 Mm depth with a FWHM of sz = 3.5 Mm and sh = 15 Mm. Random error of the results is 133 m s-1 when assuming data averaged over 4 days. For details see Fig. 20.

thumbnail Fig. 27

The cut through the x = y = 0 point of the averaging kernel (solid) and the respective target function (dashed) for the vx (left) and vz (right) inversions using averaging over many flow realisations plotted along with the corresponding target functions at three discussed depths (1 Mm in blue, 3.5 Mm in green, and 5.5 Mm in red). Compare to Figs. 4 and 9 where the resemblance of the target functions is worse. The random error of the results is given in Table 2.

thumbnail Fig. 28

The azimuthally-averaged power spectra of the vz inversion components at depths of 1, 3.5 and 5.5 Mm for averaging over many flow representations. For reference, we plot the power spectrum of vzinv\hbox{$\vakern_z$} using the black solid line. Then we plot the power spectrum of vzinv˜\hbox{$\vinv_z$} (solid line) and power spectrum of the noise (i.e., the power spectrum of vzinv˜vzinv\hbox{$\vinv_z-\vakern_z$}; dashed line) for the inversion where the cross-talk is minimised (blue) and ignored (red). Compare to Fig. 14.

Acknowledgments

This study was supported by the European Research Council under the European Community’s Seventh Framework Programme (FP7/2007–2013)/ERC grant agreement #210949, “Seismic Imaging of the Solar Interior”, to PI L. Gizon (Milestone #5). Authors would like to thank J. Jackiewicz for providing us with the noise covariance matrices. The flow sensitivity kernels were computed using the code written by A. C. Birch deployed within the HELAS project at http://www.mps.mpg.de/projects/seismo/NA4. M.Š. acknowledges a partial support through the Grant Agency of Academy of Sciences of the Czech Republic under grant IAA30030808.

References

  1. Backus, G. E., & Gilbert, J. F. 1968, Geophys. J., 16, 169 [NASA ADS] [CrossRef] [Google Scholar]
  2. Backus, G. E., & Gilbert, J. F. 1970, Roy. Soc. London Philos. Trans. Ser. A, 266, 123 [Google Scholar]
  3. Benson, D., Stein, R., & Nordlund, Å. 2006, in Solar MHD Theory and Observations: A High Spatial Resolution Perspective, ed. J. Leibacher, R. F. Stein, & H. Uitenbroek, ASP Conf. Ser., 354, 92 [Google Scholar]
  4. Birch, A. C., & Gizon, L. 2007, Astron. Nachr., 328, 228 [NASA ADS] [CrossRef] [Google Scholar]
  5. Cameron, R., Gizon, L., & Duvall, Jr., T. L. 2008, Sol. Phys., 251, 291 [NASA ADS] [CrossRef] [Google Scholar]
  6. Christensen-Dalsgaard, J., Dappen, W., Ajukov, S. V., et al. 1996, Science, 272, 1286 [CrossRef] [Google Scholar]
  7. Couvidat, S., Birch, A. C., & Kosovichev, A. G. 2006, ApJ, 640, 516 [NASA ADS] [CrossRef] [Google Scholar]
  8. Duvall, Jr., T. L., & Birch, A. C. 2010, ApJ, 725, L47 [NASA ADS] [CrossRef] [Google Scholar]
  9. Duvall, Jr., T. L., & Gizon, L. 2000, Sol. Phys., 192, 177 [NASA ADS] [CrossRef] [Google Scholar]
  10. Duvall, Jr., T. L.,Jefferies, S. M., Harvey, J. W., & Pomerantz, M. A. 1993, Nature, 362, 430 [NASA ADS] [CrossRef] [Google Scholar]
  11. Duvall, Jr., T. L., D’Silva, S., Jefferies, S. M., Harvey, J. W., & Schou, J. 1996, Nature, 379, 235 [NASA ADS] [CrossRef] [Google Scholar]
  12. Duvall, Jr., T. L.,Kosovichev, A. G., Scherrer, P. H., et al. 1997, Sol. Phys., 170, 63 [NASA ADS] [CrossRef] [Google Scholar]
  13. Duvall, Jr., T. L.,Birch, A. C., & Gizon, L. 2006, ApJ, 646, 553 [NASA ADS] [CrossRef] [Google Scholar]
  14. Gizon, L., & Birch, A. C. 2004, ApJ, 614, 472 [NASA ADS] [CrossRef] [Google Scholar]
  15. Gizon, L., & Birch, A. C. 2005, Liv. Rev. Sol. Phys., 2, 6 [Google Scholar]
  16. Gizon, L., Duvall, Jr., T. L., & Larsen, R. M. 2000, J. Astrophys. Astron., 21, 339 [Google Scholar]
  17. Gizon, L., Schunker, H., Baldner, C. S., et al. 2009, Space Sci. Rev., 144, 249 [NASA ADS] [CrossRef] [Google Scholar]
  18. Gizon, L., Birch, A. C., & Spruit, H. C. 2010, ARA&A, 48, 289 [Google Scholar]
  19. Hanasoge, S. M., & Duvall, T. L. 2009, ApJ, 693, 1678 [NASA ADS] [CrossRef] [Google Scholar]
  20. Hansen, P. C. 1998, Rank-deficient and discrete ill-posed problems: numerical aspects of linear inversion (Philadelphia, PA, USA: Society for Industrial and Applied Mathematics) [Google Scholar]
  21. Jackiewicz, J., Gizon, L., Birch, A. C., & Thompson, M. J. 2007, Astron. Nachr., 328, 234 [NASA ADS] [CrossRef] [Google Scholar]
  22. Jackiewicz, J., Gizon, L., & Birch, A. C. 2008, Sol. Phys., 251, 381 [Google Scholar]
  23. Jackiewicz, J., Birch, A. C., Gizon, L., Hanasoge, S., & Hohage, T. 2011, Sol. Phys., accepted [Google Scholar]
  24. Jensen, J. M., Jacobsen, B. H., & Christensen-Dalsgaard, J. 1998, in Structure and Dynamics of the Interior of the Sun and Sun-like Stars, ed. S. Korzennik, ESA SP, 418, 635 [NASA ADS] [Google Scholar]
  25. Kosovichev, A. G. 1996, ApJ, 461, L55 [NASA ADS] [CrossRef] [Google Scholar]
  26. Kosovichev, A. G., & Duvall, Jr., T. L. 1997, in SCORe’96: Solar Convection and Oscillations and their Relationship, ed. F. P. Pijpers, J. Christensen-Dalsgaard, & C. S. Rosenthal, Astrophys. Space Sci. Lib., 225, 241 [Google Scholar]
  27. Moradi, H., Baldner, C., Birch, A. C., et al. 2010, Sol. Phys., 267, 1 [NASA ADS] [CrossRef] [Google Scholar]
  28. Pijpers, F. P., & Thompson, M. J. 1992, A&A, 262, L33 [NASA ADS] [Google Scholar]
  29. Rempel, M., Schüssler, M., & Knölker, M. 2009, ApJ, 691, 640 [NASA ADS] [CrossRef] [Google Scholar]
  30. Scherrer, P. H., Bogart, R. S., Bush, R. I., et al. 1995, Sol. Phys., 162, 129 [Google Scholar]
  31. Ustyugov, S. D. 2008, in Subsurface and Atmospheric Influences on Solar Activity, ed. R. Howe, R. W. Komm, K. S. Balasubramaniam, & G. J. D. Petrie, ASP Conf. Ser., 383, 43 [Google Scholar]
  32. Vögler, A., Shelyag, S., Schüssler, M., et al. 2005, A&A, 429, 335 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  33. Zhao, J., & Kosovichev, A. G. 2004, ApJ, 603, 776 [NASA ADS] [CrossRef] [Google Scholar]
  34. Zhao, J., & Kosovichev, A. G., & Duvall, Jr., T. L. 2001, ApJ, 557, 384 [NASA ADS] [CrossRef] [Google Scholar]
  35. Zhao, J., Georgobiani, D., Kosovichev, A. G., et al. 2007, ApJ, 659, 848 [NASA ADS] [CrossRef] [Google Scholar]
  36. Zhao, J., Parchevsky, K. V., Hartlep, T., & Kosovichev, A. G. 2010, in Am. Astron. Soc. Meet. Abstr., 216, 319.05 [Google Scholar]

All Tables

Table 1

Notations.

Table 2

Predicted inversion errors and the expected magnitude of the velocities at the three different depths discussed throughout this paper.

Table 3

Correlation analysis of various inversion components using synthetic travel times averaged over 4 days.

All Figures

thumbnail Fig. 1

Examples of synthetic travel-time maps used in this paper (top row) and observed travel times from SOHO/MDI (bottom row). Temporal length of observation is T = 6 h.

In the text
thumbnail Fig. 2

Comparison of azimuthally averaged spatial power spectra \hbox{$| \bar{\tau}^a(k) |^2$} of synthetic (solid) and observed (dashed) travel times displayed in Fig. 1 as a function of k   R, where k is the horizontal wave-number and R is the radius of the Sun.

In the text
thumbnail Fig. 3

Example inversion weights wax(x,y)\hbox{$w_a^x(x,y)$} for 1 Mm depth and effect of the minimisation of the spread S. A strong regularisation ϵ confines the inversion weights in the spatial domain (bottom row, ϵ = 100.5) compared to the case ϵ = 0 (top row). The cut at y = 0 (horizontal black line) is displayed on the right-hand-side panel, where the values of wax(x,y=0)\hbox{$w^x_a(x,y=0)$} were scaled by wax(x=0,y=0)\hbox{$w^x_a(x=0,y=0)$}.

In the text
thumbnail Fig. 4

The cut through the x = y = 0 point of the averaging kernel (solid) and the respective target function (dashed) for vx inversion with minimised cross-talk, using travel times averaged over 4 days, at three discussed depths (1, 3.5, and 5.5 Mm).

In the text
thumbnail Fig. 5

Comparison of inverted vx with input data. Top-row panels show the input flow field convolved with the target function and the averaging kernel respectively. Bottom rows show inversion results in cases when cross-talk is ignored (left) and minimised (right). Random errors of the inversion are given in Table 2.

In the text
thumbnail Fig. 6

Inversion biases for vx at 1 Mm depth. Left: noiseless vxinv\hbox{$\vakern_x$} with cross-talk minimised versus the ideal vxtgt\hbox{$\vtarg_x$}. The departure from slope unity (grey dashed line) is due to an imperfect match between \hbox{$\cK$} and \hbox{$\cT$}. Right: noisy vxinv˜\hbox{$\vinv_x$} versus noiseless vxinv\hbox{$\vakern_x$}. The results are plotted in two cases: when the cross-talk is ignored (grey  × ) and minimised (black  + ). The linear fit to the black crosses coincides with the dashed line of slope unity. The black dotted lines represent the predicted error of 14 m s-1, which is consistent with the observed scatter of the black crosses.

In the text
thumbnail Fig. 7

The azimuthally-averaged power spectra of various components of the vx inversions using travel times averaged over 4 days. For reference, we plot the power spectrum of vxinv\hbox{$\vakern_x$} (thick grey solid line). We plot also the power spectrum of vxinv˜\hbox{$\vinv_x$} (solid line) and the power spectrum of the noise (i.e., the power spectrum of vxinv˜vxinv\hbox{$\vinv_x-\vakern_x$}; dashed line) for the inversion where the cross-talk is minimised (black) and ignored (grey).

In the text
thumbnail Fig. 8

Horizontal spatial cross-correlation between vx and vz from the simulation in the top 1 Mm. These flow correlations together with a non-vanishing cross-talk averaging kernel 𝒦xz\hbox{$\cK_x^z$} may lead to a systematic bias of the inferred vz. The circle of radius of 15 Mm shows the width of the desired target function.

In the text
thumbnail Fig. 9

The cut through the x = y = 0 point of the averaging kernel (solid) and the respective target function (dashed) for vz inversion using travel times averaged over 4 days. Compare with Fig. 4.

In the text
thumbnail Fig. 10

All components of the averaging kernel for vz inversion at 1 Mm depth with a FWHM of sz = 1.1 Mm and sh = 15 Mm. Bottom row: with cross-talk minimised, top row: cross-talk is ignored. The cross-talk is presented in the form of 𝒦xz\hbox{$\cK_x^z$} and 𝒦yz\hbox{$\cK_y^z$} averaging kernel components. Random error of the results is 3 m s-1 when assuming data averaged over 4 days. Over-plotted contours, which are also marked on the colour bar for reference, denote the following: half-maximum of the kernel (white), half-maximum of the target function (black), and by grey lines  ± 5% of the maximum value of the kernel (solid and dotted, respectively).

In the text
thumbnail Fig. 11

Comparison of inverted vz with input data, compare with Fig. 5.  ⟨ vz ⟩  = 11 m s-1.

In the text
thumbnail Fig. 12

All components of the vz inversion at 1 Mm depth. Top row with the cross-talk ignored, bottom row with the cross-talk minimised. We demonstrate that if cross-talk is not addressed, horizontal components will leak into the inverted vz and cause a bias.

In the text
thumbnail Fig. 13

Inversion biases for vz at 1 Mm depth, similar to Fig. 6. While the black line (with minimised cross-talk) almost coincides with the dashed line of slope unity, the grey one (cross-talk ignored) indicates a horrible bias. The predicted error is 3 m s-1. The inversion is not sensitive to the horizontally averaged horizontal flow ⟨vz⟩ = 11 m s-1, therefore it was subtracted from vztgt\hbox{$\vtarg_z$} in this comparison.

In the text
thumbnail Fig. 14

The azimuthally-averaged power spectra of various components of the vz inversions using travel times averaged over 4 days. Compare with Fig. 7 for vx inversions. Notice that in the case of vzinv˜\hbox{$\vinv_z$} containing the cross-talk at 1 Mm depth, the excess in power around kR ~ 150 is not caused by the random noise, but by the bias coming from the cross-talk (not plotted separately). In the other two cases the random noise is a main cause why these inversions are not possible.

In the text
thumbnail Fig. 15

Comparison at three depths of statistical averages of velocity-vector over many flow realisations. The horizontal flow is displayed by arrows while the vertical flow is colour-coded. The reference arrow indicates 100 m s-1, the random errors of the inversions are given in Table 2. Compare to Figs. 5 and 11.

In the text
thumbnail Fig. 16

Slices through the x and z components of the simulated flow v at depths 1 Mm, 3.5 Mm, and 5.5 Mm.

In the text
thumbnail Fig. 17

The point-to-annulus sensitivity kernels for three flow components computed for the f-mode, distance 28 Mm and outward-inward geometry. The white circle represents the location of the averaging annulus.

In the text
thumbnail Fig. 18

Horizontal averages of sensitivity kernels may serve useful when estimating which depths are easier to target. The trend for each mode/ridge was obtained by taking Kxa(r;z)d2r\hbox{$\int K_x^a(\br;z) \id^2\br$} and averaging over all as within the given mode.

In the text
thumbnail Fig. 19

An example noise-covariance matrix for f-mode travel times averaged over 6 h. In this plot, a stands for the combination of the f-mode, oi geometry, and annulus radius of 7.3 Mm, b stands for the combination of f-mode, we geometry, and annulus radius of 8.8 Mm.

In the text
thumbnail Fig. 20

All components of the averaging kernel for vx inversion at 1 Mm depth with a FWHM of sz = 1.1 Mm and sh = 15 Mm. Bottom row: with cross-talk minimised, top row: cross-talk is ignored. Random error of the results is 14 m s-1 when assuming data averaged over 4 days. Over-plotted contours, which are also marked on the colour bar for reference, denote the following: half-maximum of the kernel (white), half-maximum of the target function (red), and  ± 5% of the maximum value of the kernel (blue and green, respectively).

In the text
thumbnail Fig. 21

All components of the averaging kernel for vx inversion at 3.5 Mm depth with a FWHM of sz = 2.2 Mm and sh = 15 Mm. Random error of the results is 20 m s-1 when assuming data averaged over 4 days. For details see Fig. 20.

In the text
thumbnail Fig. 22

All components of the averaging kernel for vx inversion at 5.5 Mm depth with a FWHM of sz = 3.5 Mm and sh = 15 Mm. Random error of the results is 28 m s-1 when assuming data averaged over 4 days. For details see Fig. 20.

In the text
thumbnail Fig. 23

The contributions of particular modes to the horizontally averaged averaging kernel for vx inversions using travel times averaged over 4 days for depths 1 and 3.5 Mm. We do not display the inversion for the depth of 5.5 Mm, because it is heavily dominated by noise.

In the text
thumbnail Fig. 24

To solve the peculiarity of the vz inversion, we introduced two formalisms in Sect. 5.1. Here we plot performance of those. In black, the magnified section a y = 0 and z = z0 of different target functions are displayed, one with removed mean (1) and one constructed with negative side-lobes (2). The resulting averaging kernels are also plotted. It is evident that the resulting averaging kernels are qualitatively very similar even when different formalisms were used to compute them.

In the text
thumbnail Fig. 25

All components of the averaging kernel for vz inversion at 3.5 Mm depth with a FWHM of sz = 2.2 Mm and sh = 15 Mm. Random error of the results is 13 m s-1 when assuming data averaged over 4 days. For details see Fig. 20.

In the text
thumbnail Fig. 26

All components of the averaging kernel for vz inversion at 5.5 Mm depth with a FWHM of sz = 3.5 Mm and sh = 15 Mm. Random error of the results is 133 m s-1 when assuming data averaged over 4 days. For details see Fig. 20.

In the text
thumbnail Fig. 27

The cut through the x = y = 0 point of the averaging kernel (solid) and the respective target function (dashed) for the vx (left) and vz (right) inversions using averaging over many flow realisations plotted along with the corresponding target functions at three discussed depths (1 Mm in blue, 3.5 Mm in green, and 5.5 Mm in red). Compare to Figs. 4 and 9 where the resemblance of the target functions is worse. The random error of the results is given in Table 2.

In the text
thumbnail Fig. 28

The azimuthally-averaged power spectra of the vz inversion components at depths of 1, 3.5 and 5.5 Mm for averaging over many flow representations. For reference, we plot the power spectrum of vzinv\hbox{$\vakern_z$} using the black solid line. Then we plot the power spectrum of vzinv˜\hbox{$\vinv_z$} (solid line) and power spectrum of the noise (i.e., the power spectrum of vzinv˜vzinv\hbox{$\vinv_z-\vakern_z$}; dashed line) for the inversion where the cross-talk is minimised (blue) and ignored (red). Compare to Fig. 14.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.