Statistical deprojection of galaxy pairs
^{1}
LUTH, UMR CNRS 8102, Paris Observatory,
92195
Meudon Cedex, France
email: laurent.nottale@obspm.fr
^{2}
GEPI, UMR CNRS 8111, Paris Observatory,
92195
Meudon Cedex, France
email: pierre.chamaraux@obspm.fr
Received:
25
January
2018
Accepted:
16
February
2018
Aims. The purpose of the present paper is to provide methods of statistical analysis of the physical properties of galaxy pairs. We perform this study to apply it later to catalogs of isolated pairs of galaxies, especially two new catalogs we recently constructed that contain ≈1000 and ≈13 000 pairs, respectively. We are particularly interested by the dynamics of those pairs, including the determination of their masses.
Methods. We could not compute the dynamical parameters directly since the necessary data are incomplete. Indeed, we only have at our disposal one component of the intervelocity between the members, namely along the line of sight, and two components of their interdistance, i.e., the projection on the skyplane. Moreover, we know only one point of each galaxy orbit. Hence we need statistical methods to find the probability distribution of 3D interdistances and 3D intervelocities from their projections; we designed those methods under the term deprojection.
Results. We proceed in two steps to determine and use the deprojection methods. First we derive the probability distributions expected for the various relevant projected quantities, namely intervelocity v_{z}, interdistance r_{p}, their ratio, and the product , which is involved in mass determination. In a second step, we propose various methods of deprojection of those parameters based on the previous analysis. We start from a histogram of the projected data and we apply inversion formulae to obtain the deprojected distributions; lastly, we test the methods by numerical simulations, which also allow us to determine the uncertainties involved.
Key words: methods: data analysis / methods: statistical / catalogs / galaxies: groups: general
© ESO 2018
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0;), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1 Introduction
With the aim of using an improved method to study the dynamics of galaxy pairs, we constructed two new pair catalogs using welldefined criteria and improved observational data. The first is a catalog of ≈1000 pairs with high accuracy radial velocities (Chamaraux & Nottale 2016) extracted from Nilson’s Uppsala Galaxy Catalog (UGC; Nilson 1973), which has the advantage that it is complete in apparent diameter. The second is a catalog of ≈13 000 pairs (Nottale & Chamaraux 2018, the largest presently available to our knowledge), which makes use of the huge recent increase in astronomical data. We used the HyperLEDA database (2016, Makarov et al. 2014) to identify galaxy pairs and extract their parameters from the large surveys (e.g., SDSS; Alam et al. 2015 and 2MASS).
To improve our understanding of physics at extragalactic scales, the goal of this paper is to elaborate better methods of statistical analysis of galaxy pair data (which can also be applied to any type of pairs of astronomical objects). This is the goal of the present paper. We need to obtain statistical information on the physical characteristics of these pairs, particularly their masses through Kepler’s third law and the possible existence of anomalous dynamics in these pairs. Such a new dynamics is, according to various proposals, attributed to missing mass or dark matter (Bergstrom 2000), to modification of gravity (Sanders 2002), or to a new dark potential (Nottale 2011; Chavanis 2017a,b).
In order to compute those physical quantities, we have to know the 3D velocity difference between the pair members and their 3D interdistances. But we have at our disposal only one component of the velocity difference (along the line of sight) and two components of the interdistance (projection on the skyplane). Therefore one has to find statistical methods to obtain those 3D quantities from the projected quantities. We call these methods statistical deprojection. Moreover, we note that for each galaxy pair we can determine the various 3D parameters for only one point of each galaxy orbit and only one instant.
Because of these limitations, various methods of analysis have been devised (Chengalur et al. 1996; Peterson 1979; Faber & Gallagher 1979). However, as shown by Faber & Gallagher (1979), these methods remain unsatisfactory from a mathematical viewpoint. In particular, recovering the mass remains very uncertain. For these reasons, we propose new methods of deprojection, which we show here to yield more precise results for recovering the 3D unprojected parameters.
The paper is organized as follows. In Sect. 2 we first analyze the statistical projection process and derive the probability distributions expected for the various relevant projected quantities, namely, intervelocity, v_{z}; interdistance, r_{p}; their ratio, r_{p} ∕v_{z}, which can be used as signature of circular orbits; and the product, , which is involved in mass determination.
In Sect. 3, we develop statistical methods of deprojection of these parameters, that is, intervelocity, interdistance, and mass, based on the previous analysis. We start from a histogram of the projected data organized in various ways, i.e., constant bins, moving bins, and variable moving bins. Then we apply inversion formulae, which amount in some cases to matrix inversion, to obtain the deprojected distributions. Finally we test these methods using numerical simulations, which also allow us to determine the uncertainties involved.
Fig. 1 Theoretical expectation vs. numerical simulation of the statistical distribution of v_{z} ∕v, where v_{z} is the radial velocity difference between galaxies in randomly oriented pairs and v the true velocity difference. The number of simulated pairs is here N ≈ 10 000. Apart from statistical fluctuations, the obtained probability distribution of the radial velocity is constant in the range [0, v], in agreement with the theoretical expectation (given by the constant red line). 

Open with DEXTER 
2 Statistical analysis
Let us use a cylindrical coordinate system whose axis z is oriented from the observer to the galaxy pair. In this case the (x, y) plane is the plane of the sky. The position vector r is defined as (1)
while the velocity vector v is defined as (2)
The measured quantities are the position vector projected on the plane of the sky, r_{p} = (x, y), and the radial velocity v_{z}. Then r_{p}, v_{z}, and φ are known while z, v_{p}, and φ_{v} are unknown variables.
2.1 Probability distribution of radial velocity
Consider a randomly oriented 3D vector (x, y, z) of fixed length r. Let us determine the probability distribution of any of its projections, say z. The part of the sphere of radius r which projects between z and z + dz has a surface (3)
Since z = rcosθ, then d z = −rsinθ dθ, and we finally find that dS∕dz = 2πr. The probability distribution of the projection on any axis of a randomly oriented 3D vector of length r can then be derived (4)
and it is therefore constant when r is constant. Since z can vary from − r to + r, we verify that this probability distribution is correctly normalized.
This can be easily applied to the velocity vector. Given a fixed pair configuration (r, v) with random orientation, the probability distribution of the values of the radial velocity differences between the pair members is (5)
We note that here no member of the pair is priviledged, such that we consider only positive differences, which then vary between 0 and v.
We performed a numerical simulation of this projection of a given pair with random orientation. The obtained distribution of v_{z} confirms this expectation (Fig. 1).
2.2 Probability distribution of projected distance
We consider a pair of objects, the 3D interdistance between which is r, fixed. Let be the distance between the pair members projected on the plane of the sky. The part of the sphere corresponding to projected distances lying in the interval [r_{p}, r_{p} + dr_{p}] is made of two rings of width rdθ and radius r_{p}. Therefore its surface isdS = 4πr_{p}rdθ. Now, since r_{p} = rsinθ, we find (6)
The differential probability distribution of r_{p} is d P = dS∕(4πr^{2}). Then the normalized probability density of r_{p} values p(r_{p}), projected from a given r value, is written as (7)
We performed a numerical simulation of such a projection. The obtained distribution is in fair agreement with this theoretical expectation (Fig. 2).
More generally, we consider now a set of pairs whose distances between their members are distributed with a probability P_{r} (r) when r lies in the interval [r_{1}, r_{2}]. When the pair orientation is random, the expected probability distribution of the projected distances on the plane of the sky is now given by (8)
2.3 Probability distribution of ratio ζ = r_{p}∕v_{z}
This ratio can be used as signature of circular orbits. We give in Appendix A the probability distribution of a product and of the ratio of two variables, of which the individual probability distributions are known, and also that of the inverse of a variable. From these basic formulae, one can infer the probability distribution functions (PDFs) of various combinations of the variables observed for pairs [v_{z} and (x, y), involving r_{p}].
Since with 0 < r_{p} < 1 (for r = 1) and p(v_{z})= 1 with 0 < v_{z} < 1 (for v = 1), we obtain for the ratio ζ = r_{p}∕v_{z} [more generally, ζ = (r_{p}∕r)∕(v_{z}∕v)] (9)
Two cases must now be considered. When ζ > 1, the integration interval is reduced to [0, 1∕ζ]. One finds (11)
When ζ < 1, the integration interval is again [0, 1] and one finds (12)
We performed a numerical simulation of randomly oriented pairs with uncorrelated randomly oriented velocity differences. As can be seen in Fig. 3, the theoretically expected distribution agrees very well with the simulation.
In the case of circular orbits, once r_{p} is given, the possible values of v_{z}, instead of being uniformly distributed between 0 and v, are constrained to be smaller than v × (r_{p}∕r). As a consequence, the left part of the PDF of ζ = (r_{p}∕r)∕(v_{z}∕v) below ζ = 1 is expected to be empty, which achieves a possible statistical signature of circular orbits.
The probability distribution of the reverse function, χ = v_{z}∕r_{p} is easy to derive from these expressions and from Eq. (A.3). We find
It is therefore constant up to χ = 1, after which it decreases quickly toward 0. In the case of circular orbits, only the constant part of the PDF remains (0 < χ < 1), which means that the projection of v_{z}∕r_{p} becomes similar to that of v_{z}.
Fig. 2 Numerical simulation of the probability density distribution of r_{p}∕r, where r_{p} is the projection on the plane of the sky of the distance r between members of randomly oriented pairs (for a fixed 3D r value). The number of simulated pairs is N ≈ 10 000. Apart from statistical fluctuations, the obtained probability distribution agrees with the theoretical expectation (continuous red line, see text). 

Open with DEXTER 
Fig. 3 Numerical simulation of the density distribution of the ratio r_{p}∕v_{z}, where r_{p} is the distance between pair members projected on the plane of the sky and v_{z} is their radial velocity difference (here with r = 1 and v = 1). The number of simulated pairs is N = 4000. Within statistical fluctuations, the obtained probability distribution agrees well with the theoretical expectation (red continuous line, Eqs. (11) and (12)). In the case of circular orbits, the velocity and radial vectors are orthogonal, which implies r_{p} ∕v_{z} ≥ r∕v (see text), i.e., r_{p}∕v_{z} ≥ 1 in this figure. That achieves a clear signature of circular orbits, for which the left part of the figure (r_{p} ∕v_{z} < 1) below the sharp peak of the r_{p}∕v_{z} PDF is expected to be empty. 

Open with DEXTER 
2.4 Probability distribution of product
This product is involved in mass determination. Indeed, the orbits of isolated galaxy pairs are subjected to Kepler’s third law, (15)
where M = M_{1} + M_{2}, a is the semimajor axis of the orbit and T the period. Defining a characteristic velocity V = 2πa∕T, it yields the total mass of the system (16)
The perimeter of an ellipse varies between 4a (radial free fall) and 2πa (circular orbit). This perimeter is given by elliptic integrals, which can be approximated by a power series (17)
where e is the eccentricity of the ellipse, and therefore the average velocity is such that .
When the orbit is circular, r = a and v = V, such that in this case the total mass is given by (18)
Instead of r we have only access to r_{p}, and, instead of v, to v_{z}. This leads us to look for the probability distribution of .
Let us set , then , while we recall that when the radius is normalized to r = 1.
In the case of uncorrelated values of r and v, the general formula for the probability distribution of a product Eq. (A.1) then yields (19)
The integration limits are determined by the fact that 0 < v_{z} < 1 (for a normalized velocity v = 1) and hence U < 1; U^{2} − η^{2} must be > 0 due to the square root such that U > η. This integral can be integrated in terms of the hypergeometric function _{2} F_{1} (a, b;c;z) as follows: (20)
This formula is in excellent agreement with the result of a numerical simulation of the projections of r to r_{p} and v to v_{z} (Fig. 4).
Fig. 4 Analytical formula (see text) vs. numerical simulation of the density distribution of the product , where r_{p} is the distance between pair members projected on the plane of the sky and v_{z} is their radial velocity difference. This combination of variable is essential for the statistical determination of the total mass of the pair. The product is plotted here for r = 1 and v = 1, i.e., the plotted variable is actually . The number of simulated pairs is N ≈ 10 000. Within statistical fluctuations, the obtained probability distribution agrees well with the theoretical expectation (red line, Eq. (20)). 

Open with DEXTER 
3 Methods of statistical deprojection
The knowledge of the expected statistical distribution of the various variables or of their combination allows one to construct methods of deprojection from the observed subset of variables.
3.1 Deprojection of x, y, and v_{z}
3.1.1 Theoretical deprojection of v_{z}
The simplest method of deprojection deals with the value of a vector projected on a single axis. This is the case in particular for v_{z} (radial velocity difference). The various methods described in this work are also valid for deprojection of the individual observed variables x and y (projections of r on the plane of the sky along right ascension and declination).
We have seen that, in this case, the expected probability distribution for a random orientation and a given value of the unprojected variable is constant. Therefore, if P_{v} (v) is the probability distribution of the 3D velocity v, the probability distribution of v_{z} is given by Nottale (2011) (21)
This distribution can be easily inverted. One obtains a deprojection formula, that is, (22)
and similar formulae for x and y separately; the deprojection of their combination in is considered in the following.
Fig. 5 Illustration of the deprojection method for radial velocity (or any 1D variable projected from a 3D vector). For a bin (V _{i−1}, V _{i}) of width δV, where V _{i} = i × δV, N_{i} is the number of values contained in this bin in the histogram of the projected quantity. The number of objects that have a deprojected value v = V _{i} is given by the area of the rectangle of height (N_{i} − N_{i+1}) and of extent [0, V _{i}]. 

Open with DEXTER 
3.1.2 General algorithm of deprojection of v_{z}
The simplest way to achieve statistical deprojection of a single variable, such as the radial velocity v_{z}, consists of directly implementing Eq. (22). This formula means that if N_{v} pairs have a true velocity difference v, their radial velocities are uniformly distributed between 0 and v. The existence of these N_{v} objects at velocity v creates a jump δN = N_{v} in the observed distribution of v_{z}, N(v_{z}) = N_{tot}P(v_{z}). Therefore, the contribution of a particular true intervelocity v to the observed PDF of v_{z} is given by the surface of the rectangle of sides v × δN (see Fig. 5).
3.1.3 Various methods of deprojection of v_{z}
Differences on adjacent constant bins. The simplest way to implement Eq. (22) consists of
 (1)
constructing the histogram of radial (projected) velocities V _{r} in bins [V _{i−1}, V _{i}] of given width δV;
 (2)
computing the differences between the numbers in successive bins;
 (3)
multiplying by the rank i = V _{i}∕δV of the bin.
We obtainedin this way the deprojected numbers, N = V (δN^{p}∕δV), i.e., in terms of the elements of the histogram, . This numbershould be assigned to a bin of width δV and of mean deprojected velocity . Indeed, the projected velocity V _{i} = i δV is the limit between the bins of ranks i and i + 1 in the projected histogram, and the resulting N_{i} should be assigned to this velocity V _{i} in the mean, i.e., to a bin centered on this velocity and therefore shifted by δV∕2 with respect to the initial histogram.
We performed a numerical simulation in which we calculated the mean and standard deviations of 100 realizations of the deprojection (see Fig. 6). We recovered in these simulations the overall shape of the initial distribution (chosen to be a Gaussian of mean 190 km s^{−1} and standard deviation 90 km s^{−1}). In particular the existence and position of the main peak is recovered in a satisfactory way. We can see in Fig. 6 the improvement obtained when going from a catalog of ≈ 1000 pairs to ≈ 10 000, regarding the decrease of both the optimal bin width and dispersion of the various PDFs. A given realization of a catalog deprojection is expected to be contained (with a high probability) between the two ± 1σ lines, as supported by an example given in Fig. 7 (left part).
However, Eq. (22) presupposes a strictly monotonic decreasing distribution. Otherwise it may lead to obtainingnegative numbers. The problem is that there are fluctuations that may locally break this expected monotony.
For large enough bin sizes, it is clear that the monotony is preserved. Therefore, a possible way to solve this difficulty without losing too much resolution in the deprojection is to choose the smallest of these large enough bins. This value depends on the total number of pairs. In our numerical simulations, the optimal bin width was found to be ≈ 30 km s^{−1} for ≈1000 pairs and ≈ 20 km s^{−1} for ≈10 000 pairs.
Another possible method consists of correcting the negative numbers by substracting these numbers to the adjacent bins. This method is possible only for small deviations and reveals to be equivalent to the previous method.
Differences between constant intervals separated by two bins. Actually, the previous method where the difference is taken between two adjacent bins is not optimized and it can therefore be improved. It is more efficient (as in finite difference methods) to take differences between two intervals separated by one bin, . This improvement is based on thefact that f(x + dx) − f(x) = f′(x)dx + O(dx^{2}), while (f(x + dx) − f(x −dx))∕2 = f′(x)dx + O(dx^{3}).
We checked the method by a numerical simulation (see right Fig. 7). We find that the ± 1σ standard deviation on the PDF is half that obtained with the previous method (differences ), thus achieving a significant improvement.
Differences on constant moving bins. The method that uses constant bins has drawbacks: the result isdigitalized too much, has low resolution, and is too dependent on limits between bins.
Then we devised another method that deals with more information about the original radial velocity PDF and is more continuous. It amounts to performing a histogram of the projected velocities on a large enough moving bin w_{bin} shifted by a low value, e.g., δv = 1 km s^{−1}. We obtain a function that is both monotonous and quasicontinuous. An example of such a function is given in Fig. 8: it is obtained from the projection of ≈ 12 000 velocities having a Gaussian distribution of mean 150 km s^{−1} and standard deviation 50 km s^{−1}.
Then we applied on this function the deprojection formula (Eq. (22)), but with thesmall shift dv_{r} = δv. Then we performed a moving average of the resulting distribution using the original bin width w_{bin}.
Numerical simulations were achieved to validate the method. We first defined a given PDF of the 3D velocity v, performed a random realization of this PDF on N_{v} values v_{i}, randomly projected them to v_{ri}, and then we deprojected the distribution obtained using the moving bins method. The process was repeated N_{s} times to obtain the convenient statistics of the method.
We give in Fig. 9 the results obtained for an initial v distribution showing one unique peak described by a Gaussian for two different peak widths. The quality of the result depends on the width ofthe initial peak: for a large enough peak (left Fig. 9 and Fig. 10), the position, amplitude, and width of the peak are correctly recovered. When the peak is narrower (right Fig. 9), its position is correctly recovered by the deprojection, but its amplitude is slightly too low and correspondingly its width becomes too large (but only within one sigma).
Differences on varying moving bins. In order to account for this bias, we devised another more accurate method. The problem is that, when there is a peak in the PDF of the 3D true velocity v, it manifestsas a high value of the slope in the PDF of the radial (projected) velocity v_{r}. If the binwidth is too large, this slope is decreased by the smoothing out effect of the binning. As a consequence, the amplitude of the deprojected peak is too low. Thus we corrected this bias by using a moving bin of variable width, decreasing when the slope increases. A balance should be found for the variation of the binwidth, since a smaller bin also increases the fluctuations and therefore the final dispersion. This method gives very good results, since it allows us to recover multiple peaks present in the initial 3D distribution, as can be seen in Fig. 11.
Fig. 6 Numerical simulation of the intervelocity deprojection of a sample of 1200 pairs (left figure) and 13 000 pairs (right figure). The original 3D velocities have a Gaussian distribution of standard deviation σ_{v} = 90 km s^{−1} and peak velocity 190 km s^{−1} (cutoff on V = 0) [red curve]. We randomly project a random realization of this Gaussian distribution, then deproject it using the constant bin method with a bin of width 30 km s^{−1} (left figure) and 20 km s^{−1} (right figure, see text). The blue histogram is the mean result of 100 realizations of such a projection/deprojection and the black lines are the ± 1σ lines. 

Open with DEXTER 
Fig. 7 Various realizations of the numerical simulation of the intervelocity deprojection (sample of 13 000 pairs). Left figure: an example of a single realization. The original 3D velocities have a Gaussian distribution of standard deviation σ_{v} = 90 km s^{−1} and peak velocity 190 km s^{−1} (cutoff on V = 0) [red curve]. We randomly projected a random realization of this Gaussian distribution, then deprojected it using the constant bin method with a bin width 20 km s^{−1} (see text). The blue broken line is the obtained deprojection, compared with the ± 1σ lines estimated from 100 realizations (see Fig. 6). Right figure: various realizations using differences on nonadjacent bins (), for the same original velocity PDF. We randomly projected random realizations of this Gaussian distribution, then deprojected it using the constant bin method (blue histograms) with a bin width 20 km s^{−1} (see text). The dashed orange lines show the ± 1σ lines estimated from 100 realizations. The quality of the deprojection is improved by a factor of ≈ 2 with respect to the adjacent bin method. 

Open with DEXTER 
Fig. 8 Histogram of the projected velocities (from an initial Gaussian distribution of mean 150 km s^{−1} and standard deviation 50 km s^{−1}) on a moving bin w_{bin} shifted by differences between bin positions δv = 1 km s^{−1}. 

Open with DEXTER 
3.2 Deprojection of r_{p}
3.2.1 Theoretical deprojection
Although one can obtain the probability distribution of r from x and y separately, more information is contained in their combination , from which a better deprojection is expected to be achieved. This expectation is confirmed by the distribution of r_{p} for a given value of r, which shows a strong peak at r_{p} = r (Fig. 2). Assuming a probability distribution P_{r}(r) of the unprojected variable r, one expects for r_{p} a probability distribution, i.e., (23)
For example, if the unprojected probability distribution is constant between 0 and r_{0}, the projected distribution is given by (see Fig. 12) (24)
It does not seem possible to invert this formula analytically. However, it is very possible to construct an algorithm to perform this inversion numerically.
3.2.2 Deprojection method for r_{p}
Let i_{max} be the total number of bins in the histogram of r_{p} values. We denote the observed number of projected values r_{p} in the bin of rank i. This number is the sum of contributions of 3D values r ≥ r_{p} from bins of rank j ≥ i – we assume a same number of bins for the r and r_{p} histograms (see Fig. 12).
Let us derive the probability law for the various contributions. The probability distribution of r_{p} for a given r value has been shown (Sect. 2.2) to be (25)
for x = r_{p}∕r. We assume, as an approximation valid for a small enough bin width (i.e., a large enough number of bins), that the probability distribution of r remains constant in any bin. Then the probability density of given r values projected in the bin of rank i and relative width b is written as (26)
Concerning the r values pertaining to the bin of rank j, the normalized x values (i.e., r_{p} ≤ r) belong to only a total number of j bins, such that the relative bin width (normalized to a total range x = 0 to 1) is b = 1∕j.
Let us now show that the probability law of the contributions of the various r values (given by index j) can be recovered from that of the projected values r_{p} (given by index i).
Indeed, let us introduce the matrix (27)
for i≤ j. The other elements of this matrix (i > j) are zero. Let P_{ij} be the transpose of this matrix, i.e., P = π^{T}.
If the column vector represents the various projected numbers in the bin of rank i, they are obtained from the initial 3D column vector by the matrix product (28)
Therefore the initial unprojected probability distribution is recovered from a mere matrix inversion, i.e., (29)
For example, for three bins the relation between the probability distributions is written as
The deprojected reverse relation is, in this case,
This method is illustrated in Fig. 13, where we deproject a projected distribution N_{p} = arccosr_{p}, which is the expected function for a constant initial distribution N_{r} = cst (in the null bin width limit). The deprojected distribution is constant as expected, except for a small bias mainly involving the first and last bins (which is therefore easy to correct and disappears when the number of bins is increased). Another example is given in the numerical simulation of Fig. 14, where the initial 3D distribution is randomly drawn from a Gaussian probability density (10 000 points). This distribution is nicely recovered from the matrix inversion method. In this case the bias involving the extreme points is unobservable, since these values are almost vanishing.
Fig. 9 Deprojections by constant moving bins method of several realizations of a sample projected from an initial Gaussian distribution (N_{v} ≈ 1200 points, N_{s} = 25 realizations). We show the mean deprojected distribution (black curve) and the ± 1σ limits (green curves) compared with the initial distribution (red curve). In the left figure, the peak position is 150 km s^{−1} and its standard deviation 50 km s^{−1}. The deprojection shows no bias with such a velocity peak width. In the right figure, the peak position is 150 km s^{−1} and the standard deviation 20 km s^{−1}. The deprojection shows a bias with such a narrow velocity peak width. The deprojected velocity peak is too low by ≈ 12% and too wide by ≈20%, although the peak position is precisely recovered to within ≈1%. 

Open with DEXTER 
Fig. 10 Deprojections by the constant moving bins method of several realizations of a sample projected from an initial Gaussian distribution with peak position 150 km s^{−1} and standard deviation 50 km s^{−1} (N_{v} ≈ 12 000 points, N_{s} = 25 realizations). We show the mean deprojected distribution (black curve) and the ± 1σ limits (greencurves) compared with the initial distribution (red curve). The quality of the deprojection is clearly improved compared to the 1200 points case, in particular concerning the peak amplitude (Fig. 9). 

Open with DEXTER 
Fig. 11 Deprojection by a varying moving bin. The bin width varies between 20 and 40 km s^{−1} dependingon the slope. The obtained distribution is finally smoothed out by a bin width 20 km s^{−1}. We performed several realizations of a sample projected from an initial twoGaussian peaks distribution with respective peak positions 70 and 150 km s^{−1} and standard deviations 15 and 20 km s^{−1} (N_{v} ≈ 12 000 points, N_{s} = 25 realizations). We show the initial distribution as a red continuous curve. Despite the narrowness of the peaks, the quality of the deprojection is very good since the two peaks and the intermediate hollow are clearly identified at their true positions. 

Open with DEXTER 
3.3 Deprojection of mass
3.3.1 Theoretical deprojection
The total mass M = M_{1} + M_{2} of a pair of objects in relative Keplerian motion is given by (32)
where G is Newton’s constant of gravitation, a is the semimajor axis of the orbit of one object around the other, and T is the periodand V = 2πa∕T.
However, there are two drawbacks that deteriorate the available information on mass, since we have no direct access to a and V: first, we deal only with instantaneous interdistance r and velocity v; and second these 3D values are themselves projected to r_{p} and v_{z}, i.e., . The first drawback vanishes for circular orbits, but it can lead to some additional uncertainty in the elliptical case, since the relation between M and rv^{2} is written as (33)
where e is the eccentricity and ξ the parameter of the orbit, which is such that r = a(1 − ecosξ) and t = (T∕2π)(ξ − esinξ). The value of rv^{2}∕GM fluctuates between 1 − e and 1 + e while its time average varies from 1 to 0.5 when e varies from 0 to 1, respectively; it can be approximated by the relation ⟨rv^{2} ⟩∕GM = cos^{2}(πe∕4) up to some percents.
The mass distribution should therefore be statistically deprojected from the observed products . We established in Sect. 2.4 the expected PDF of the projection of a given value of rv^{2} (see Eq. (20) and Fig. 4).
From this formula, one can theoretically deproject any distribution of . Its analytical integration does not seem to be possible, but, as in the case of the deprojection of r_{p}, it is very possible to construct an algorithm to perform this inversion numerically. However, this deprojection is expected to be difficult, since the individual projection function vanishes for the original unprojected value rv^{2} (see Fig. 4), while it is constant for v_{z} (easier deprojection) and shows a divergent peak for r_{p} (best deprojection).
Fig. 12 Illustration of the method of deprojection of r_{p}, for 10 bins and an initial distribution P_{r}(r) = constant between 0 and r_{0}. We show how the initial unprojected number density in each given bin (this initial density is chosen to be constant = 1) is distributed among the original bin and those at smaller distances (areas contained between the broken lines). The red dashed line is the expected distribution [arccos(r_{p})] in the limit of vanishing bin width (infinite number of bins). Conversely, one can recover the initial unprojected distribution from the projecteddistribution by inverting such a decomposition (see text). There is just a small bias on the first and last bins. 

Open with DEXTER 
Fig. 13 Example of statistical deprojection of r_{p} for a constantPDF of r. We start froma projected distribution N_{p} = arccos(r_{p}) (inclined blue points) for 20 bins (see text). The deprojected probability distribution is shown as almost horizontal magenta points. 

Open with DEXTER 
Fig. 14 Example of deprojection of r_{p} for a projected distribution N_{p} obtained from a 3D (unprojected) initial Gaussian distribution N_{r} (black histogram, total 10 000 points) for 10 bins. The deprojected probability distribution (magenta points) is in good agreement with the initial probability distribution. 

Open with DEXTER 
Fig. 15 Example of deprojection (points) of a projected distribution (decreasing blue curve) obtained from a 3D (unprojected) initial distribution r v^{2} (red curve), for 10 bins. The initial distribution is built from a Gaussian velocity distribution of mean 200 km s^{−1} and dispersion 50 km s^{−1} and a Gaussian distance distribution of mean 0.3 Mpc and dispersion 0.05 Mpc. 

Open with DEXTER 
3.3.2 Deprojection method for
As in the deprojection method for r_{p}, let us introduce a projection matrix constructed from the projection function p_{η} (η) given in Eq. (20) and Fig. 4. We divide the η range into N bins. The projection matrix is written as (for its nonnull coefficients) (34)
where j = 1 to N and i = 1 to N + 1 − j (and A_{ji} = 0 for the remaining coefficients).
Then the deprojection matrix is obtained from this matrix by the following transformation: (35)
For example, for N = 3, the projection matrix is written as (36)
and the resulting deprojection matrix is written as (37)
We tested the method by numerical simulations in which we built a probability distribution for v and r (e.g., Gaussian distributions, see Fig. 15), then we projected these to v_{z} and r_{p} and computed the resulting distribution.Whatever the initial 3D distribution, the projected distribution is strongly decreasing, as expected from the shape of the projection function p_{η}(η) (see Fig. 4). This property is the reason why it is so difficult to estimate the pair mass (Faber & Gallagher 1979). However, despite this problem, our new method allows us to recover the 3D PDF of rv^{2} with a fair accuracy (see Figs. 15 and 16).
This yields the total mass M of the system for circular orbits, but in the general case, where r and v are instantaneous values on elliptical orbits, there is, as previously specified, an additional uncertainty to go from the rv^{2} distribution to the M distribution, which may be estimated from an evaluation of the eccentricity distribution. This will be studied in a forthcoming work.
Fig. 16 Simulation of 100 random projections of an initial distribution (given by the red curve, see Fig. 15), followed by their respective deprojections (individual points) using a 10 bin deprojection matrix (see text). This simulation allows us to establish the uncertainty of the deprojection method (dashed curves = ± 1σ). 

Open with DEXTER 
4 Conclusions
In this paper, we developed statistical methods of deprojection of the main physical parameters of pairs of astronomical objects (in particular, galaxy pairs). These methods are needed since, in the extragalactic domain, only one component of the intervelocityand two components of the interdistance are available on only one point of the orbit instead of the six (x_{k}, v_{k}) coordinates on the full orbit.
We analytically determined the various PDFs of the projected variables: radial velocity v_{z}, projected interdistance on the skyplane r_{p}, their ratio r_{p} ∕v_{z}, which can be used as signature of circular orbits, and , which intervenes in the calculation of the pair mass. These analytical solutions were validated by numerical simulations.
Then we described in detail the deprojection methods obtained by inversion of these projection functions. These methods were conceived in a digitalized way to deal with the real data that will be available in the form of histograms.
In this paper, we prepared the application to effective extragalactic galaxy pair data, and in particular to the two galaxy pair catalogs that we recently constructed, one of which contains 13 000 pairs (Nottale & Chamaraux 2018). These numerical simulations support the validity of our deprojection methods, and they also allow us to determine their error bars.
In a forthcoming work, we will apply these deprojection methods to the study of the physics of galaxy pairs, in particular to their dynamics. Such a study will then benefit from twin improvements to catalogs (membership criteria, data quality, and size) and to methods aiming at statistically recovering missing information.
Appendix A Various probability distributions
A.1 Probability distribution of a product of two variables
Consider two random variables x and y whose probability distribution p(x, y) is known. The probability distribution of the product η = xy is given by Rohatgi (1976) and Glen et al. (2004) (A.1)
It becomes in the uncorrelated case, where we have p_{xy} (x, y) = p_{x}(x) p_{y}(y), (A.2)
A.2 Probability distribution of the inverse of a variable
Let x be a random variable of probability distribution p_{x} (x). The probability distribution of its inverse X = 1∕x is written as (A.3)
A.3 Probability distribution of the ratio of two variables
From these two relations we easily derive the probability distribution of the ratio of two random variables, ζ = y∕x. We find (A.4)
Appendix B Circular orbits
We give here a criterion for identifying circular orbits (and more generally, all configurations when the vectors v and are perpendicular). We consider the scalar product of the position and velocity vectors, i.e., (B.1)
where ϕ is the angle between the two vectors. This relation is written in cylindrical coordinates as (B.2)
where ϕ_{p} is the angle between the projected vectors v_{p} and r_{p} on the plane of the sky. Since and , the ratio of the two observables v_{z} and r_{p} is written as (B.3)
When the velocity vector v is perpendicular to the position vector r, we have cosϕ = 0 and therefore this formula takes the simplified form (B.4)
The statistical distribution of the ratio v_{z}∕r_{p} is therefore expected to be very different between the orthogonal case and the general case, providing us with a statistical signature for circular orbits.
Let us establish the theoretical expectation of this distribution.
In the case when r and v are perpendicular, which corresponds to circular orbits and to special positions on elliptic orbits, the distributions of v_{z} and r_{p} become highly correlated. In particular, let us show that, in this case, v_{z} ∕v ≤ r_{p}∕r.
Let us set α = (xv_{y})∕(yv_{x}), we have , which may be written as α + 1∕α ≥ 2, i.e., (B.5)
The orthogonality of r and v writes xv_{x} + yv_{y} + zv_{z} = 0, such that (B.6)
Accounting for this relation, the inequality (B.5) becomes (B.7)
i.e., v_{z} r ≤ r_{p} v, QED.
This means that for given r, v and r_{p}, the possible values of v_{z} are no more uniform between 0 and v, but are limited to v_{z} ≤ v r_{p}∕r.
References
 Alam, S., Albareti, F. D., Allende Prieto, C., et al. 2015, ApJS, 219, 12 [NASA ADS] [CrossRef] [Google Scholar]
 Bergstrom, L. 2000, Rep. Prog. Phys, 63, 793 [NASA ADS] [CrossRef] [Google Scholar]
 Chamaraux, P., & Nottale, L. 2016, VizieR Online Data Catalog: J/other/AstBu/71.270 [Google Scholar]
 Chavanis, P. H. 2017a, Eur. Phys. J. Plus, 132, 286 [CrossRef] [Google Scholar]
 Chavanis, P. H. 2017b, ArXiv eprints [arXiv:1706.05900] [Google Scholar]
 Chengalur, J. N., Salpeter, E.E., & Terzian, Y. 1996, ApJ, 461, 546 [NASA ADS] [CrossRef] [Google Scholar]
 Faber, S. M., & Gallagher, J. S. 1979, ARA&A, 17, 135 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
 Glen, A. G., Leemis, L. M., & Drew, J. H. 2004, Comput. Stat. Data Anal., 44, 451 [CrossRef] [Google Scholar]
 HyperLEDA database 2016, http://leda.univlyon1.fr [Google Scholar]
 Makarov, D., Prugniel, P., Terekhova, N., Courtois, H., & Vauglin, I. 2014, A&A, 570, A12 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Nilson, P. N. 1973, Uppsala General Catalogue of Galaxies (UGC), Uppsala Obs. Ann., 6 [Google Scholar]
 Nottale, L., 2011, Scale Relativity and Fractal SpaceTime: A New Approach to Unifying Relativity and Quantum Mechanics (London: Imperial College Press), 764 [Google Scholar]
 Nottale, L., & Chamaraux, P. 2018, Astrophys. Bull., submitted [arXiv:1706.06482] [Google Scholar]
 Peterson, S. D. 1979, ApJS, 40, 527 [NASA ADS] [CrossRef] [Google Scholar]
 Rohatgi, V. K. 1976, An Introduction to Probability Theory, Mathematical Statistics (New York: Wiley) [Google Scholar]
 Sanders, R. H., & McGaugh, S. S. 2002, ARA&A, 40, 263 [NASA ADS] [CrossRef] [Google Scholar]
All Figures
Fig. 1 Theoretical expectation vs. numerical simulation of the statistical distribution of v_{z} ∕v, where v_{z} is the radial velocity difference between galaxies in randomly oriented pairs and v the true velocity difference. The number of simulated pairs is here N ≈ 10 000. Apart from statistical fluctuations, the obtained probability distribution of the radial velocity is constant in the range [0, v], in agreement with the theoretical expectation (given by the constant red line). 

Open with DEXTER  
In the text 
Fig. 2 Numerical simulation of the probability density distribution of r_{p}∕r, where r_{p} is the projection on the plane of the sky of the distance r between members of randomly oriented pairs (for a fixed 3D r value). The number of simulated pairs is N ≈ 10 000. Apart from statistical fluctuations, the obtained probability distribution agrees with the theoretical expectation (continuous red line, see text). 

Open with DEXTER  
In the text 
Fig. 3 Numerical simulation of the density distribution of the ratio r_{p}∕v_{z}, where r_{p} is the distance between pair members projected on the plane of the sky and v_{z} is their radial velocity difference (here with r = 1 and v = 1). The number of simulated pairs is N = 4000. Within statistical fluctuations, the obtained probability distribution agrees well with the theoretical expectation (red continuous line, Eqs. (11) and (12)). In the case of circular orbits, the velocity and radial vectors are orthogonal, which implies r_{p} ∕v_{z} ≥ r∕v (see text), i.e., r_{p}∕v_{z} ≥ 1 in this figure. That achieves a clear signature of circular orbits, for which the left part of the figure (r_{p} ∕v_{z} < 1) below the sharp peak of the r_{p}∕v_{z} PDF is expected to be empty. 

Open with DEXTER  
In the text 
Fig. 4 Analytical formula (see text) vs. numerical simulation of the density distribution of the product , where r_{p} is the distance between pair members projected on the plane of the sky and v_{z} is their radial velocity difference. This combination of variable is essential for the statistical determination of the total mass of the pair. The product is plotted here for r = 1 and v = 1, i.e., the plotted variable is actually . The number of simulated pairs is N ≈ 10 000. Within statistical fluctuations, the obtained probability distribution agrees well with the theoretical expectation (red line, Eq. (20)). 

Open with DEXTER  
In the text 
Fig. 5 Illustration of the deprojection method for radial velocity (or any 1D variable projected from a 3D vector). For a bin (V _{i−1}, V _{i}) of width δV, where V _{i} = i × δV, N_{i} is the number of values contained in this bin in the histogram of the projected quantity. The number of objects that have a deprojected value v = V _{i} is given by the area of the rectangle of height (N_{i} − N_{i+1}) and of extent [0, V _{i}]. 

Open with DEXTER  
In the text 
Fig. 6 Numerical simulation of the intervelocity deprojection of a sample of 1200 pairs (left figure) and 13 000 pairs (right figure). The original 3D velocities have a Gaussian distribution of standard deviation σ_{v} = 90 km s^{−1} and peak velocity 190 km s^{−1} (cutoff on V = 0) [red curve]. We randomly project a random realization of this Gaussian distribution, then deproject it using the constant bin method with a bin of width 30 km s^{−1} (left figure) and 20 km s^{−1} (right figure, see text). The blue histogram is the mean result of 100 realizations of such a projection/deprojection and the black lines are the ± 1σ lines. 

Open with DEXTER  
In the text 
Fig. 7 Various realizations of the numerical simulation of the intervelocity deprojection (sample of 13 000 pairs). Left figure: an example of a single realization. The original 3D velocities have a Gaussian distribution of standard deviation σ_{v} = 90 km s^{−1} and peak velocity 190 km s^{−1} (cutoff on V = 0) [red curve]. We randomly projected a random realization of this Gaussian distribution, then deprojected it using the constant bin method with a bin width 20 km s^{−1} (see text). The blue broken line is the obtained deprojection, compared with the ± 1σ lines estimated from 100 realizations (see Fig. 6). Right figure: various realizations using differences on nonadjacent bins (), for the same original velocity PDF. We randomly projected random realizations of this Gaussian distribution, then deprojected it using the constant bin method (blue histograms) with a bin width 20 km s^{−1} (see text). The dashed orange lines show the ± 1σ lines estimated from 100 realizations. The quality of the deprojection is improved by a factor of ≈ 2 with respect to the adjacent bin method. 

Open with DEXTER  
In the text 
Fig. 8 Histogram of the projected velocities (from an initial Gaussian distribution of mean 150 km s^{−1} and standard deviation 50 km s^{−1}) on a moving bin w_{bin} shifted by differences between bin positions δv = 1 km s^{−1}. 

Open with DEXTER  
In the text 
Fig. 9 Deprojections by constant moving bins method of several realizations of a sample projected from an initial Gaussian distribution (N_{v} ≈ 1200 points, N_{s} = 25 realizations). We show the mean deprojected distribution (black curve) and the ± 1σ limits (green curves) compared with the initial distribution (red curve). In the left figure, the peak position is 150 km s^{−1} and its standard deviation 50 km s^{−1}. The deprojection shows no bias with such a velocity peak width. In the right figure, the peak position is 150 km s^{−1} and the standard deviation 20 km s^{−1}. The deprojection shows a bias with such a narrow velocity peak width. The deprojected velocity peak is too low by ≈ 12% and too wide by ≈20%, although the peak position is precisely recovered to within ≈1%. 

Open with DEXTER  
In the text 
Fig. 10 Deprojections by the constant moving bins method of several realizations of a sample projected from an initial Gaussian distribution with peak position 150 km s^{−1} and standard deviation 50 km s^{−1} (N_{v} ≈ 12 000 points, N_{s} = 25 realizations). We show the mean deprojected distribution (black curve) and the ± 1σ limits (greencurves) compared with the initial distribution (red curve). The quality of the deprojection is clearly improved compared to the 1200 points case, in particular concerning the peak amplitude (Fig. 9). 

Open with DEXTER  
In the text 
Fig. 11 Deprojection by a varying moving bin. The bin width varies between 20 and 40 km s^{−1} dependingon the slope. The obtained distribution is finally smoothed out by a bin width 20 km s^{−1}. We performed several realizations of a sample projected from an initial twoGaussian peaks distribution with respective peak positions 70 and 150 km s^{−1} and standard deviations 15 and 20 km s^{−1} (N_{v} ≈ 12 000 points, N_{s} = 25 realizations). We show the initial distribution as a red continuous curve. Despite the narrowness of the peaks, the quality of the deprojection is very good since the two peaks and the intermediate hollow are clearly identified at their true positions. 

Open with DEXTER  
In the text 
Fig. 12 Illustration of the method of deprojection of r_{p}, for 10 bins and an initial distribution P_{r}(r) = constant between 0 and r_{0}. We show how the initial unprojected number density in each given bin (this initial density is chosen to be constant = 1) is distributed among the original bin and those at smaller distances (areas contained between the broken lines). The red dashed line is the expected distribution [arccos(r_{p})] in the limit of vanishing bin width (infinite number of bins). Conversely, one can recover the initial unprojected distribution from the projecteddistribution by inverting such a decomposition (see text). There is just a small bias on the first and last bins. 

Open with DEXTER  
In the text 
Fig. 13 Example of statistical deprojection of r_{p} for a constantPDF of r. We start froma projected distribution N_{p} = arccos(r_{p}) (inclined blue points) for 20 bins (see text). The deprojected probability distribution is shown as almost horizontal magenta points. 

Open with DEXTER  
In the text 
Fig. 14 Example of deprojection of r_{p} for a projected distribution N_{p} obtained from a 3D (unprojected) initial Gaussian distribution N_{r} (black histogram, total 10 000 points) for 10 bins. The deprojected probability distribution (magenta points) is in good agreement with the initial probability distribution. 

Open with DEXTER  
In the text 
Fig. 15 Example of deprojection (points) of a projected distribution (decreasing blue curve) obtained from a 3D (unprojected) initial distribution r v^{2} (red curve), for 10 bins. The initial distribution is built from a Gaussian velocity distribution of mean 200 km s^{−1} and dispersion 50 km s^{−1} and a Gaussian distance distribution of mean 0.3 Mpc and dispersion 0.05 Mpc. 

Open with DEXTER  
In the text 
Fig. 16 Simulation of 100 random projections of an initial distribution (given by the red curve, see Fig. 15), followed by their respective deprojections (individual points) using a 10 bin deprojection matrix (see text). This simulation allows us to establish the uncertainty of the deprojection method (dashed curves = ± 1σ). 

Open with DEXTER  
In the text 