Statistical deprojection of galaxy pairs

Aims. The purpose of the present paper is to provide methods of statistical analysis of the physical properties of galaxy pairs. We perform this study to apply it later to catalogs of isolated pairs of galaxies, especially two new catalogs we recently constructed that contain ≈ 1000 and ≈ 13000 pairs, respectively. We are particularly interested by the dynamics of those pairs, including the determination of their masses. Methods. We could not compute the dynamical parameters directly since the necessary data are incomplete. Indeed, we only have at our disposal one component of the intervelocity between the members, namely along the line of sight, and two components of their interdistance, i.e., the projection on the sky-plane. Moreover, we know only one point of each galaxy orbit. Hence we need statistical methods to ﬁnd the probability distribution of 3D interdistances and 3D intervelocities from their projections; we designed those methods under the term deprojection. Results. We proceed in two steps to determine and use the deprojection methods. First we derive the probability distributions expected for the various relevant projected quantities, namely intervelocity v z , interdistance r p , their ratio, and the product r p v 2 z , which is involved in mass determination. In a second step, we propose various methods of deprojection of those parameters based on the previous analy- sis. We start from a histogram of the projected data and we apply inversion formulae to obtain the deprojected distributions; lastly, we test the methods by numerical simulations, which also allow us to determine the uncertainties


Introduction
With the aim of using an improved method to study the dynamics of galaxy pairs, we constructed two new pair catalogs using well-defined criteria and improved observational data. The first is a catalog of ≈1000 pairs with high accuracy radial velocities (Chamaraux & Nottale 2016) extracted from Nilson's Uppsala Galaxy Catalog (UGC;Nilson 1973), which has the advantage that it is complete in apparent diameter. The second is a catalog of ≈13 000 pairs (Nottale & Chamaraux 2018, the largest presently available to our knowledge), which makes use of the huge recent increase in astronomical data. We used the Hyper-LEDA database ( , Makarov et al. 2014) to identify galaxy pairs and extract their parameters from the large surveys (e.g., SDSS; Alam et al. 2015 and 2MASS).
To improve our understanding of physics at extragalactic scales, the goal of this paper is to elaborate better methods of statistical analysis of galaxy pair data (which can also be applied to any type of pairs of astronomical objects). This is the goal of the present paper. We need to obtain statistical information on the physical characteristics of these pairs, particularly their masses through Kepler's third law and the possible existence of anomalous dynamics in these pairs. Such a new dynamics is, according to various proposals, attributed to missing mass or dark matter (Bergstrom 2000), to modification of gravity (Sanders 2002), or to a new dark potential (Nottale 2011;Chavanis 2017a,b).
In order to compute those physical quantities, we have to know the 3D velocity difference between the pair members and their 3D interdistances. But we have at our disposal only one component of the velocity difference (along the line of sight) and two components of the interdistance (projection on the skyplane). Therefore one has to find statistical methods to obtain those 3D quantities from the projected quantities. We call these methods statistical deprojection. Moreover, we note that for each galaxy pair we can determine the various 3D parameters for only one point of each galaxy orbit and only one instant.
Because of these limitations, various methods of analysis have been devised (Chengalur et al. 1996;Peterson 1979;Faber & Gallagher 1979). However, as shown by Faber & Gallagher (1979), these methods remain unsatisfactory from a mathematical viewpoint. In particular, recovering the mass remains very uncertain. For these reasons, we propose new methods of deprojection, which we show here to yield more precise results for recovering the 3D unprojected parameters.
The paper is organized as follows. In Sect. 2 we first analyze the statistical projection process and derive the probability distributions expected for the various relevant projected quantities, namely, intervelocity, v z ; interdistance, r p ; their ratio, r p /v z , which can be used as signature of circular orbits; and the product, r p v 2 z , which is involved in mass determination. In Sect. 3, we develop statistical methods of deprojection of these parameters, that is, intervelocity, interdistance, and mass, based on the previous analysis. We start from a histogram of the projected data organized in various ways, i.e., constant bins, moving bins, and variable moving bins. Then we apply inversion formulae, which amount in some cases to matrix inversion, Fig. 1. Theoretical expectation vs. numerical simulation of the statistical distribution of v z /v, where v z is the radial velocity difference between galaxies in randomly oriented pairs and v the true velocity difference. The number of simulated pairs is here N ≈ 10 000. Apart from statistical fluctuations, the obtained probability distribution of the radial velocity is constant in the range [0, v], in agreement with the theoretical expectation (given by the constant red line).
to obtain the deprojected distributions. Finally we test these methods using numerical simulations, which also allow us to determine the uncertainties involved.

Statistical analysis
Let us use a cylindrical coordinate system whose axis z is oriented from the observer to the galaxy pair. In this case the (x, y) plane is the plane of the sky. The position vector r is defined as x = r p cos ϕ, y = r p sin ϕ, z, while the velocity vector u is defined as The measured quantities are the position vector projected on the plane of the sky, r p = (x, y), and the radial velocity v z . Then r p , v z , and ϕ are known while z, v p , and ϕ v are unknown variables.

Probability distribution of radial velocity
Consider a randomly oriented 3D vector (x, y, z) of fixed length r. Let us determine the probability distribution of any of its projections, say z. The part of the sphere of radius r which projects between z and z + dz has a surface dS = −2πr p r dθ = −2πr 2 sin θ dθ = dS dz dz.
Since z = r cos θ, then dz = −r sin θ dθ, and we finally find that dS /dz = 2πr. The probability distribution of the projection on any axis of a randomly oriented 3D vector of length r can then be derived and it is therefore constant when r is constant. Since z can vary from −r to +r, we verify that this probability distribution is correctly normalized.
This can be easily applied to the velocity vector. Given a fixed pair configuration (r, u) with random orientation, the probability distribution of the values of the radial velocity differences between the pair members is We note that here no member of the pair is priviledged, such that we consider only positive differences, which then vary between 0 and v. We performed a numerical simulation of this projection of a given pair with random orientation. The obtained distribution of v z confirms this expectation (Fig. 1).

Probability distribution of projected distance
We consider a pair of objects, the 3D interdistance between which is r, fixed. Let r p = x 2 + y 2 be the distance between the pair members projected on the plane of the sky. The part of the sphere corresponding to projected distances lying in the interval [r p , r p + dr p ] is made of two rings of width rdθ and radius r p . Therefore its surface is dS = 4πr p rdθ. Now, since r p = r sin θ, we find The differential probability distribution of r p is dP = dS /(4πr 2 ). Then the normalized probability density of r p values p(r p ), projected from a given r value, is written as We performed a numerical simulation of such a projection. The obtained distribution is in fair agreement with this theoretical expectation (Fig. 2).
More generally, we consider now a set of pairs whose distances between their members are distributed with a probability P r (r) when r lies in the interval [r 1 , r 2 ]. When the pair orientation is random, the expected probability distribution of the projected distances on the plane of the sky is now given by 2.3. Probability distribution of ratio ζ = r p /v z This ratio can be used as signature of circular orbits. We give in Appendix A the probability distribution of a product and of the ratio of two variables, of which the individual probability distributions are known, and also that of the inverse of a variable. From these basic formulae, one can infer the probability distribution functions (PDFs) of various combinations of the variables observed for pairs [v z and (x, y), involving r p ].
Since p(r p ) = r p / 1 − r 2 p with 0 < r p < 1 (for r = 1) and p(v z ) = 1 with 0 < v z < 1 (for v = 1), we obtain for the ratio ζ = r p /v z [more generally, ζ = (r p /r)/(v z /v)] which is written as Two cases must now be considered. When ζ > 1, the integration interval is reduced to [0, 1/ζ]. One finds When ζ < 1, the integration interval is again [0, 1] and one finds We performed a numerical simulation of randomly oriented pairs with uncorrelated randomly oriented velocity differences. As can be seen in Fig. 3, the theoretically expected distribution agrees very well with the simulation.
In the case of circular orbits, once r p is given, the possible values of v z , instead of being uniformly distributed between 0 and v, are constrained to be smaller than v × (r p /r). As a consequence, the left part of the PDF of ζ = (r p /r)/(v z /v) below ζ = 1 is expected to be empty, which achieves a possible statistical signature of circular orbits.
The probability distribution of the reverse function, χ = v z /r p is easy to derive from these expressions and from Eq. (A.3). We find It is therefore constant up to χ = 1, after which it decreases quickly toward 0. In the case of circular orbits, only the constant part of the PDF remains (0 < χ < 1), which means that the projection of v z /r p becomes similar to that of v z . Fig. 3. Numerical simulation of the density distribution of the ratio r p /v z , where r p is the distance between pair members projected on the plane of the sky and v z is their radial velocity difference (here with r = 1 and v = 1). The number of simulated pairs is N = 4000. Within statistical fluctuations, the obtained probability distribution agrees well with the theoretical expectation (red continuous line, Eqs. (11) and (12)). In the case of circular orbits, the velocity and radial vectors are orthogonal, which implies r p /v z ≥ r/v (see text), i.e., r p /v z ≥ 1 in this figure. That achieves a clear signature of circular orbits, for which the left part of the figure (r p /v z < 1) below the sharp peak of the r p /v z PDF is expected to be empty.

Probability distribution of product
This product is involved in mass determination. Indeed, the orbits of isolated galaxy pairs are subjected to Kepler's third law, where M = M 1 + M 2 , a is the semimajor axis of the orbit and T the period. Defining a characteristic velocity V = 2πa/T , it yields the total mass of the system The perimeter of an ellipse varies between 4a (radial free fall) and 2πa (circular orbit). This perimeter is given by elliptic integrals, which can be approximated by a power series where e is the eccentricity of the ellipse, and therefore the average velocity V is such that When the orbit is circular, r = a and v = V, such that in this case the total mass is given by Instead of r we have only access to r p , and, instead of v, to v z . This leads us to look for the probability distribution of η = r p v 2 z . Let us set U = v 2 z , then p U (U) = 1/(2 √ U), while we recall that p(r p ) = r p / 1 − r 2 p when the radius is normalized to r = 1. In the case of uncorrelated values of r and v, the general formula for the probability distribution of a product Eq. (A.1) then yields vs. numerical simulation of the density distribution of the product r p v 2 z , where r p is the distance between pair members projected on the plane of the sky and v z is their radial velocity difference. This combination of variable is essential for the statistical determination of the total mass of the pair. The product is plotted here for r = 1 and v = 1, i.e., the plotted variable is actually (r p /r)(v z /v) 2 . The number of simulated pairs is N ≈ 10 000. Within statistical fluctuations, the obtained probability distribution agrees well with the theoretical expectation (red line, Eq. (20)).
The integration limits are determined by the fact that 0 < v z < 1 (for a normalized velocity v = 1) and hence U < 1; U 2 − η 2 must be >0 due to the square root such that U > η. This integral can be integrated in terms of the hypergeometric function 2 F 1 (a, b; c; z) as follows: This formula is in excellent agreement with the result of a numerical simulation of the projections of r to r p and v to v z (Fig. 4).

Methods of statistical deprojection
The knowledge of the expected statistical distribution of the various variables or of their combination allows one to construct methods of deprojection from the observed subset of variables.
3.1. Deprojection of x, y, and v z 3.1.1. Theoretical deprojection of v z The simplest method of deprojection deals with the value of a vector projected on a single axis. This is the case in particular for v z (radial velocity difference). The various methods described in this work are also valid for deprojection of the individual observed variables x and y (projections of r on the plane of the sky along right ascension and declination).
We have seen that, in this case, the expected probability distribution for a random orientation and a given value of the unprojected variable is constant. Therefore, if P v (v) is the probability distribution of the 3D velocity v, the probability distribution of v z is given by Nottale (2011) This distribution can be easily inverted. One obtains a deprojection formula, that is, and similar formulae for x and y separately; the deprojection of their combination in r p = x 2 + y 2 is considered in the following.

General algorithm of deprojection of v z
The simplest way to achieve statistical deprojection of a single variable, such as the radial velocity v z , consists of directly implementing Eq. (22). This formula means that if N v pairs have a true velocity difference v, their radial velocities are uniformly distributed between 0 and v. The existence of these N v objects at velocity v creates a jump δN = N v in the observed distribution of v z , N(v z ) = N tot P(v z ). Therefore, the contribution of a particular true intervelocity v to the observed PDF of v z is given by the surface of the rectangle of sides v × δN (see Fig. 5).

Various methods of deprojection of v z
Differences on adjacent constant bins. The simplest way to implement Eq. (22) consists of (1) constructing the histogram N p i of radial (projected) velocities V r in bins [V i−1 , V i ] of given width δV; (2) computing the differences (N p i − N p i−1 ) between the numbers in successive bins; (3) multiplying by the rank i = V i /δV of the bin. We obtained in this way the deprojected numbers, N = V(δN p /δV), i.e., in terms of the elements of the histogram, . This number should be assigned to a bin of width δV and of mean deprojected velocity V = (i + 1 2 ) δV. Indeed, the projected velocity V i = i δV is the limit between the bins of ranks i and i + 1 in the projected histogram, and the resulting N i should be assigned to this velocity V i in the mean, i.e., to a bin centered on this velocity and therefore shifted by δV/2 with respect to the initial histogram.
We performed a numerical simulation in which we calculated the mean and standard deviations of 100 realizations of the deprojection (see Fig. 6). We recovered in these simulations the overall shape of the initial distribution (chosen to be a Gaussian of mean 190 km s −1 and standard deviation 90 km s −1 ). In particular the existence and position of the main peak is recovered in a satisfactory way. We can see in Fig. 6 the improvement obtained when going from a catalog of ≈1000 pairs to ≈10 000, regarding the decrease of both the optimal bin width and dispersion of the  . We randomly projected a random realization of this Gaussian distribution, then deprojected it using the constant bin method with a bin width 20 km s −1 (see text). The blue broken line is the obtained deprojection, compared with the ±1σ lines estimated from 100 realizations (see Fig. 6). Right figure: various realizations using differences on non-adjacent bins (N p i+1 − N p i−1 ), for the same original velocity PDF. We randomly projected random realizations of this Gaussian distribution, then deprojected it using the constant bin method (blue histograms) with a bin width 20 km s −1 (see text). The dashed orange lines show the ±1σ lines estimated from 100 realizations. The quality of the deprojection is improved by a factor of ≈2 with respect to the adjacent bin method. various PDFs. A given realization of a catalog deprojection is expected to be contained (with a high probability) between the two ±1σ lines, as supported by an example given in Fig. 7 (left part) .
However, Eq. (22) presupposes a strictly monotonic decreasing distribution. Otherwise it may lead to obtaining negative numbers. The problem is that there are fluctuations that may locally break this expected monotony.
For large enough bin sizes, it is clear that the monotony is preserved. Therefore, a possible way to solve this difficulty without losing too much resolution in the deprojection is to choose the smallest of these large enough bins. This value depends on the total number of pairs. In our numerical simulations, the optimal bin width was found to be ≈30 km s −1 for ≈1000 pairs and ≈20 km s −1 for ≈10 000 pairs.
Another possible method consists of correcting the negative numbers by substracting these numbers to the adjacent bins. This method is possible only for small deviations and reveals to be equivalent to the previous method.
Differences between constant intervals separated by two bins. Actually, the previous method where the difference is taken between two adjacent bins is not optimized and it can therefore be improved. It is more efficient (as in finite difference methods) to take differences between two intervals separated by one bin, N p i+1 − N p i−1 . This improvement is based on the fact that f ( We checked the method by a numerical simulation (see right Fig. 7). We find that the ±1σ standard deviation on the PDF is half that obtained with the previous method (differences N p i − N p i−1 ), thus achieving a significant improvement. Differences on constant moving bins. The method that uses constant bins has drawbacks: the result is digitalized too much, has low resolution, and is too dependent on limits between bins.
Then we devised another method that deals with more information about the original radial velocity PDF and is more continuous. It amounts to performing a histogram of the projected velocities on a large enough moving bin w bin shifted by a low value, e.g., δv = 1 km s −1 . We obtain a function that is both monotonous and quasi-continuous. An example of such a function is given in Fig. 8: it is obtained from the projection of ≈12 000 velocities having a Gaussian distribution of mean 150 km s −1 and standard deviation 50 km s −1 . Then we applied on this function the deprojection formula P v dv r = −VdP v r (Eq. (22)), but with the small shift dv r = δv. Then we performed a moving average of the resulting distribution using the original bin width w bin .
Numerical simulations were achieved to validate the method. We first defined a given PDF of the 3D velocity v, performed a random realization of this PDF on N v values v i , randomly projected them to v ri , and then we deprojected the distribution obtained using the moving bins method. The process was repeated N s times to obtain the convenient statistics of the method.
We give in Fig. 9 the results obtained for an initial v distribution showing one unique peak described by a Gaussian for two different peak widths. The quality of the result depends on the width of the initial peak: for a large enough peak (left Fig. 9 and Fig. 10), the position, amplitude, and width of the peak are correctly recovered. When the peak is narrower (right Fig. 9), its position is correctly recovered by the deprojection, but its amplitude is slightly too low and correspondingly its width becomes too large (but only within one sigma).
Differences on varying moving bins. In order to account for this bias, we devised another more accurate method. The problem is that, when there is a peak in the PDF of the 3D true velocity v, it manifests as a high value of the slope in the PDF of the radial (projected) velocity v r . If the binwidth is too large, this slope is decreased by the smoothing out effect of the binning. As a consequence, the amplitude of the deprojected peak is too low. Thus we corrected this bias by using a moving bin of variable width, decreasing when the slope increases. A balance should be found for the variation of the binwidth, since a smaller bin also increases the fluctuations and therefore the final dispersion. This method gives very good results, since it allows us to recover multiple peaks present in the initial 3D distribution, as can be seen in Fig. 11.

Theoretical deprojection
Although one can obtain the probability distribution of r from x and y separately, more information is contained in their combination r p = x 2 + y 2 , from which a better deprojection is expected to be achieved. This expectation is confirmed by the distribution of r p for a given value of r, which shows a strong peak at r p = r (Fig. 2). Assuming a probability distribution P r (r) of the unprojected variable r, one expects for r p a probability distribution, i.e., P r p (r p ) = r p ∞ r p P r (r) dr For example, if the unprojected probability distribution is constant between 0 and r 0 , the projected distribution is given by (see Fig. 12) It does not seem possible to invert this formula analytically. However, it is very possible to construct an algorithm to perform this inversion numerically.

Deprojection method for r p
Let i max be the total number of bins in the histogram of r p values. We denote N p i the observed number of projected values r p in the bin of rank i. This number is the sum of contributions of 3D values r ≥ r p from bins of rank j ≥ i -we assume a same number of bins for the r and r p histograms (see Fig. 12).
Let us derive the probability law for the various contributions. The probability distribution of r p for a given r value has been shown (Sect. 2.2) to be for x = r p /r. We assume, as an approximation valid for a small enough bin width (i.e., a large enough number of bins), that the probability distribution of r remains constant in any bin. Then the probability density of given r values projected in the bin of rank i and relative width b is written as Concerning the r values pertaining to the bin of rank j, the normalized x values (i.e., r p ≤ r) belong to only a total number of j bins, such that the relative bin width (normalized to a total range x = 0 to 1) is b = 1/ j. Let us now show that the probability law of the contributions of the various r values (given by index j) can be recovered from that of the projected values r p (given by index i).
Indeed, let us introduce the matrix for i ≤ j. The other elements of this matrix (i > j) are zero. Let P i j be the transpose of this matrix, i.e., P = π T . If the column vector N p i represents the various projected numbers in the bin of rank i, they are obtained from the initial 3D column vector N r j by the matrix product Therefore the initial unprojected probability distribution is recovered from a mere matrix inversion, i.e., Fig. 9. Deprojections by constant moving bins method of several realizations of a sample projected from an initial Gaussian distribution (N v ≈ 1200 points, N s = 25 realizations). We show the mean deprojected distribution (black curve) and the ±1σ limits (green curves) compared with the initial distribution (red curve). In the left figure, the peak position is 150 km s −1 and its standard deviation 50 km s −1 . The deprojection shows no bias with such a velocity peak width. In the right figure, the peak position is 150 km s −1 and the standard deviation 20 km s −1 . The deprojection shows a bias with such a narrow velocity peak width. The deprojected velocity peak is too low by ≈12% and too wide by ≈20%, although the peak position is precisely recovered to within ≈1%. Fig. 10. Deprojections by the constant moving bins method of several realizations of a sample projected from an initial Gaussian distribution with peak position 150 km s −1 and standard deviation 50 km s −1 (N v ≈ 12 000 points, N s = 25 realizations). We show the mean deprojected distribution (black curve) and the ±1σ limits (green curves) compared with the initial distribution (red curve). The quality of the deprojection is clearly improved compared to the 1200 points case, in particular concerning the peak amplitude ( Fig. 9).
For example, for three bins the relation between the probability distributions is written as The deprojected reverse relation is, in this case, This method is illustrated in Fig. 13, where we deproject a projected distribution N p = arccos r p , which is the expected function for a constant initial distribution N r = cst (in the null bin width limit). The deprojected distribution is constant as expected, except for a small bias mainly involving the first and last bins (which is therefore easy to correct and disappears when the number of bins is increased). Another example is given in Fig. 11. Deprojection by a varying moving bin. The bin width varies between 20 and 40 km s −1 depending on the slope. The obtained distribution is finally smoothed out by a bin width 20 km s −1 . We performed several realizations of a sample projected from an initial two-Gaussian peaks distribution with respective peak positions 70 and 150 km s −1 and standard deviations 15 and 20 km s −1 (N v ≈ 12 000 points, N s = 25 realizations). We show the initial distribution as a red continuous curve. Despite the narrowness of the peaks, the quality of the deprojection is very good since the two peaks and the intermediate hollow are clearly identified at their true positions. the numerical simulation of Fig. 14, where the initial 3D distribution is randomly drawn from a Gaussian probability density (10 000 points). This distribution is nicely recovered from the matrix inversion method. In this case the bias involving the extreme points is unobservable, since these values are almost vanishing.

Theoretical deprojection
The total mass M = M 1 + M 2 of a pair of objects in relative Keplerian motion is given by where G is Newton's constant of gravitation, a is the semimajor axis of the orbit of one object around the other, and T is the period and V = 2πa/T . Fig. 12. Illustration of the method of deprojection of r p , for 10 bins and an initial distribution P r (r) = constant between 0 and r 0 . We show how the initial unprojected number density in each given bin (this initial density is chosen to be constant = 1) is distributed among the original bin and those at smaller distances (areas contained between the broken lines). The red dashed line is the expected distribution [arccos(r p )] in the limit of vanishing bin width (infinite number of bins). Conversely, one can recover the initial unprojected distribution from the projected distribution by inverting such a decomposition (see text). There is just a small bias on the first and last bins. Fig. 13. Example of statistical deprojection of r p for a constant PDF of r. We start from a projected distribution N p = arccos(r p ) (inclined blue points) for 20 bins (see text). The deprojected probability distribution is shown as almost horizontal magenta points.
Fig. 14. Example of deprojection of r p for a projected distribution N p obtained from a 3D (unprojected) initial Gaussian distribution N r (black histogram, total 10 000 points) for 10 bins. The deprojected probability distribution (magenta points) is in good agreement with the initial probability distribution. Fig. 15. Example of deprojection (points) of a projected distribution r p v 2 z (decreasing blue curve) obtained from a 3D (unprojected) initial distribution r v 2 (red curve), for 10 bins. The initial distribution is built from a Gaussian velocity distribution of mean 200 km s −1 and dispersion 50 km s −1 and a Gaussian distance distribution of mean 0.3 Mpc and dispersion 0.05 Mpc. Fig. 16. Simulation of 100 random projections of an initial distribution (given by the red curve, see Fig. 15), followed by their respective deprojections (individual points) using a 10 bin deprojection matrix (see text). This simulation allows us to establish the uncertainty of the deprojection method (dashed curves = ±1σ).
where j = 1 to N and i = 1 to N + 1 − j (and A ji = 0 for the remaining coefficients).
Then the deprojection matrix is obtained from this matrix by the following transformation: For example, for N = 3, the projection matrix is written as We tested the method by numerical simulations in which we built a probability distribution for v and r (e.g., Gaussian distributions, see Fig. 15), then we projected these to v z and r p and computed the resulting r p v 2 z distribution. Whatever the initial 3D distribution, the projected distribution is strongly decreasing, as expected from the shape of the projection function p η (η) (see Fig. 4). This property is the reason why it is so difficult to estimate the pair mass (Faber & Gallagher 1979). However, despite this problem, our new method allows us to recover the 3D PDF of rv 2 with a fair accuracy (see Figs. 15 and 16).
This yields the total mass M of the system for circular orbits, but in the general case, where r and v are instantaneous values on elliptical orbits, there is, as previously specified, an additional uncertainty to go from the rv 2 distribution to the M distribution, which may be estimated from an evaluation of the eccentricity distribution. This will be studied in a forthcoming work.

Conclusions
In this paper, we developed statistical methods of deprojection of the main physical parameters of pairs of astronomical objects (in particular, galaxy pairs). These methods are needed since, in the extragalactic domain, only one component of the intervelocity and two components of the interdistance are available on only one point of the orbit instead of the six (x k , v k ) coordinates on the full orbit.
We analytically determined the various PDFs of the projected variables: radial velocity v z , projected interdistance on the sky-plane r p , their ratio r p /v z , which can be used as signature of circular orbits, and r p v 2 z , which intervenes in the calculation of the pair mass. These analytical solutions were validated by numerical simulations.
Then we described in detail the deprojection methods obtained by inversion of these projection functions. These methods were conceived in a digitalized way to deal with the real data that will be available in the form of histograms.
In this paper, we prepared the application to effective extragalactic galaxy pair data, and in particular to the two galaxy pair catalogs that we recently constructed, one of which contains 13 000 pairs (Nottale & Chamaraux 2018). These numerical simulations support the validity of our deprojection methods, and they also allow us to determine their error bars.
In a forthcoming work, we will apply these deprojection methods to the study of the physics of galaxy pairs, in particular to their dynamics. Such a study will then benefit from twin improvements to catalogs (membership criteria, data quality, and size) and to methods aiming at statistically recovering missing information.