Free Access
Issue
A&A
Volume 563, March 2014
Article Number A36
Number of page(s) 11
Section Cosmology (including clusters of galaxies)
DOI https://doi.org/10.1051/0004-6361/201321941
Published online 28 February 2014

© ESO, 2014

1. Introduction

Determining the value of the constitutive parameters of the Friedman equations is a problem of great observational and interpretative difficulty, but owing to its bearing on fundamental physics, it is one of great theoretical significance. Current estimations suggest that we live in a homogeneous and isotropic universe where baryonic matter is a minority (1/6) of all matter, matter is a minority (1/4) of all forms of energy, geometry is spatially flat, and cosmic expansion is presently accelerated (e.g. Komatsu et al. 2011; Blake et al. 2012; Reid et al. 2012; Sanchez et al. 2012; Anderson et al. 2012; Marinoni et al. 2012; Planck Collaboration 2014).

However evocative these cosmological results may be, incorporating them into a physical theory of the universe is a very perplexing problem. Virtually all the attempts to explain the nature of dark matter which is the indispensable ingredient in models of cosmic structure formation (Frenk & White 2012), as well as of dark energy, which is the physical mechanism that drives cosmic acceleration (Peebles & Ratra 2003), invoke exotic physics beyond current theories (Feng 2010; Copeland et al. 2006; Clifton et al. 2012). Measurements, on the other hand, are not yet precise enough to exclude most of these proposals (Amendola et al. 2013). The Wilkinson microwave anisotropy probe (WMAP, Komatsu et al. 2011) and the Planck mission Planck Collaboration (2014), for example, fix the parameters of a “power-law ΛCDM” model with impressive accuracy. This is a cosmological model characterized by a flat geometry, by a positive dark energy (DE) component ΩX with w = −1, and by primordial perturbations that are scalar, Gaussian, and adiabatic (Weinberg 2008). A combination of astrophysical probes are needed if we are to constrain deviations from this minimal model. Because they are sensitive to different sets of nuisance parameters and to different subsets of the full cosmological parameter set, different techniques provide consistency checks, lift parameter degeneracies, and enable stronger constraints and, in the end, a safer theoretical interpretation (Frieman et al. 2008).

A way to meet this challenge is by designing and exploring the potential of new cosmic probes. In this spirit, we have recently proposed to exploit the internal dynamics of isolated disc galaxies (Marinoni et al. 2008a) and of pairs of galaxies (Marinoni & Buzzi 2010) in order to set limits on relevant cosmological parameters. Here we demonstrate that precise and accurate cosmological information can be extracted from the large scale spatial distribution of galaxies if their clustering properties are characterized through the measurement of the clustering ratio η, the ratio between the correlation function, and the variance of the galaxy overdensity field smoothed on a scale R.

Gaining insight into the dark matter and dark energy sectors via the analysis of the 3D clustering of galaxies is a tricky task. There is no reason to expect (and there is observational evidence to the contrary) that the galaxies trace the underlying matter distribution exactly. A fundamental problem is thus understanding how to map the clustering of different galaxy types into the clustering of the underlying matter, which the theories most straightforwardly predict.

Most of the attempts to address this issue centre on the feasibility of reconstructing the biasing relation from independent observational evidence, for example weak lensing surveys (thus increasing the complexity of the observational programes) or on the possibility of considering biasing as a nuisance quantity that can be statistically marginalized (thus increasing the number of parameters of the model and degrading the predictability of the theory). The complementary line of attack taken in this paper consists in developing right from the start a bias-free statistical descriptor of clustering, that is an observable that can be directly compared to theoretical predictions.

The paper is organized as follows. We define the clustering ratio η in Sect. 2, and we discuss its theoretical properties in Sect. 3. In Sect. 4 we test its robustness and performances by analysing numerical simulations of the large scale structure of the universe. Cosmological constraints obtained from SDSS DR7 sample (Abazajian et al. 2009) are derived and discussed in Sect. 5. Conclusions are drawn in Sect. 6.

Throughout, the Hubble constant is parameterized via h = H0/100 km s-1 Mpc-1. Data analysis is not framed in any given fiducial cosmology; i.e., we compute the clustering ratio statistic in any tested cosmology.

2. A new cosmological observable: the galaxy clustering ratio

We characterize the inhomogeneous distribution of galaxies in terms of the local dimensionless density contrast δg(x)ρg(x)ρg(x)1,\begin{equation} \delta_g({\vec x}) \equiv \frac{ \rho_g({\vec x})}{\langle\rho_g({\vec x})\rangle}-1, \end{equation}(1)where ρg(x) is the comoving density of galaxies in real space, and where ⟨ ρg(x) ⟩ denotes the selection function, which is the expected mean number of galaxies at position x, given the selection criteria of the survey. Since the galaxy distribution is a stochastic point process, a spherical top-hat filter W of radius R is applied to generate a continuous, coarse-grained observable δg,R(x)=δg(y)W(|xy|/R)dy.\begin{equation} \delta_{g,R}({\vec x})=\int \delta_g({\vec y}) {\vec W}(|{x-\mathbf{y}}|/R) {\rm d}{\vec y}. \end{equation}(2)The second-order, one-point κ20,g,R=δg,R2(x)c\begin{equation} \kappa_{20,g,R}=\langle \delta_{g,R}^2({\vec x}) \rangle_{\rm c} \end{equation}(3)and the two-point κ11,g,R(r)=δg,R(x)δg,R(x+r)c\begin{equation} \kappa_{11,g,R}({\vec r}) = \langle \delta_{g,R}({\vec x})\delta_{g,R}({\vec x} +{\vec r}) \rangle_{\rm c} \end{equation}(4)cumulant moments are the lowest order, non-zero connected moments of the probability density functional (PDF) of the field δg,R(x) (Szapudi et al. 1992; Szapudi & Szalay 1997, 1998; Bernardeau 1996; Bernardeau et al. 2002; Bel & Marinoni 2012). If the PDF is stationary and isotropic, the ratio κ11,g,R(r)/κ20,g,R is equivalent to the one between the correlation function and the variance of the filtered field ηg,R(r,p)=ξg,R(r,p)σg,R2(p),\begin{equation} \eta_{g,R}(r,{\vec p})=\frac{\xi_{g,R}(r,{\vec p})}{\sigma^2_{g,R}({\vec p})}, \label{etagr} \end{equation}(5)where we have explicitly emphasized the dependence of the observable on the set (p) of cosmological parameters. This comes through because the statistical descriptor (5) can be estimated from data only once a comoving distance-redshift conversion model has been supplied.

2.1. Estimating the galaxy clustering ratio

The variance σg,R2\hbox{$\sigma^2_{g,R}$} and the correlation function ξg,R of a smoothed density field of galaxies can be estimated from the three-dimensional distribution of galaxies following the procedure outlined in Bel & Marinoni (2012). We assume that the random variable N models the number of galaxies within typical spherical cells (of constant comoving radius R) and consider the dimensionless galaxy excess δNN1,\begin{equation} \delta_N\equiv\frac{N}{\bar N}-1, \end{equation}(6)where \hbox{$\bar N$} is the mean number of galaxies contained in the cells.

To estimate the one-point second-order moment of the galaxy overdensity field (κ2=δN2c\hbox{$\kappa_{2} = \langle \delta_N^2 \rangle_{\rm c}$}), we fill the survey volume with the maximum number (nt) of non-overlapping spheres of radius R (whose centre is called a seed) and we compute κ̂2=1nti=1ntδN,i2,\begin{equation} \hat\kappa_{2}=\frac{1}{n_t}\sum_{i=1}^{n_t}\delta_{N,\,i}^2, \end{equation}(7)where δN,i is the dimensionless counts excess in the ith sphere.

The two-point second-order moment κ11 =  ⟨ δNiδNj ⟩ c follows from a generalization of this cell-counting process. To this purpose, we add a motif of isotropically distributed spheres around each seed and retain as proper seeds only those for which the spheres of the motifs lie completely within the survey boundaries. The centre of each new sphere is separated from the proper seed by the length r = nR (where n is a generic real parameter usually taken to be an integer without loss of generality), and the pattern is designed in such a way as to maximize the number of quasi non-overlapping spheres at the given distance r. In this study, as in Bel & Marinoni (2012), the maximum allowed overlap between contiguous spheres is set to 2% in volume. This value represents a good comprise between the need to maximize the number of the spheres that are correlated to a given seed and the volume probed by them. If there is a substantial overlap of the spheres of the motif, all the signal on the correlation scales r is fully sampled. However, this strategy is computationally expensive. On the other hand, if the number of spheres is too small, then a substantial fraction of the information available at a given correlation scale r is lost. We have verified that varying this threshold in the range 2−50% does not modify the estimation of one- and two-point statistical properties of the counts.

An estimator of the average excess counts in the i and j cells at separation r is κ̂11=1ntnmot{i=1ntδN,ij=1nmotδN,ij},\begin{equation} \hat\kappa_{11}=\frac{1}{n_t n_{\rm mot}}\left\lbrace\sum_{i=1}^{n_t}\delta_{N,i}\sum_{j=1}^{n_{\rm mot}}\delta_{N,ij}\right\rbrace, \end{equation}(8)where nmot is the number of spheres in the motif, and where δN,ji is the excess count in the motif’s cell j at distance r from the seed i.

It is natural, or at least convenient, to model the spatial point distribution of galaxies as a process resulting from the discrete sampling of an underlying continuous stochastic field λg(x). The quantity of effective physical interest to which we want to have access is thus \hbox{$\delta_{g,R}({\vec x})\equiv \Lambda_g({\vec x}) / {\bar \Lambda_g}-1$}, where Λg(x) = V(x)λ(x′)dx is the continuous limit of the discrete counts N in volume V. As a consequence, it is necessary to correct counting estimators for discreteness effects. To this purpose and following standard practice in the field, we model the sampling as a local Poisson process (LPP, Layser 1956). Specifically, we map moments of the discrete variable N into moments of its continuous limit by using Λgk=N(N1)...(Nk+1)(N)fk,\begin{equation} \langle\Lambda_g^k\rangle=\langle N(N-1)...(N-k+1)\rangle \equiv \langle(N)_f^k\rangle, \label{unpoint} \end{equation}(9)in the case of one-point statistics, and its generalization (Szapudi & Szalay 1997; Bel & Marinoni 2012) Λgk(x1)Λgq(x2)=(N1)fk(N2)fq,\begin{equation} \langle\Lambda_g^k({\vec x}_1)\Lambda_g^q({\vec x}_2)\rangle = \langle(N_1)_f^k(N_2)_f^q\rangle, \label{deuxpoint} \end{equation}(10)for the two-point case. As a result, the estimators of the variance and the correlation function, corrected for shot noise effects, are (Szapudi et al. 1992; Bel & Marinoni 2012)

σ̂ 2 g,R = κ̂ 2 -1 \begin{eqnarray} && \hat \sigma_{g,R}^2 = \hat\kappa_{2}-\bar N^{-1} \nonumber \\ && \hat \xi_{g,R} = \hat\kappa_{11}. \label{ccum} \end{eqnarray}(11)

By construction, the η observable is not sensitive to the shape of the radial selection function, if the density gradient is nearly constant on scales r ~ 2R. Sample-dependent corrections are mandatory, instead, if the survey geometry is not regular; i.e., a significant fraction of the counting cells overlap the survey boundaries. Bel et al. (2014), for example, show how to minimize the impact of radial (redshift) and angular incompleteness when the η estimator is applied to a survey, the VIMOS Public Extragalactic Redshift Survey (VIPERS, Guzzo et al. 2013), characterized by non-trivial selection functions.

The scaling of the galaxy clustering ratio ηg,R(nR) as a function of both R and n is shown in Fig. 1. That galaxies tend to cluster on small cosmic scales is revealed by the fact that the amplitude of η decreases monotonically as a function of the correlation scale r. In other terms, the probability of finding two cells with density contrasts of equal sign is suppressed as the separation between the cells increases. The trend of ηg,R(r) as a function of R (for a given fixed value of r) is the opposite, implying that the relative loss of power that results from filtering the field on larger and larger scales is stronger in one-point than in two-point statistics (at least when second-order moments are considered). We also note that, for constant values of n, ηg,R(nR) decreases monotonically as a function of the smoothing radius R only if the field is correlated on small scales, i.e. n < 4. For higher n values, in fact, the clustering ratio becomes sensitive to the baryon acoustic oscillations (BAO) imprinted on the large scale distribution of galaxies. As a consequence, a peak shows up at n = l/R, where l is the position of the BAO relative maximum in the correlation function of galaxies.

thumbnail Fig. 1

Redshift-space galaxy clustering ratio ηg,R(nR) (LHS of Eq. (21)) extracted from 160 independent LasDamas simulations of the distribution of luminous red galaxies (LRG, see Sect. 4). The observable ηg,R(nR) is shown as a function of the smoothing radius R for four different values of n as indicated in the key. Simulated observations are compared to the predicted amplitudes of the real-space mass clustering ratio ηR(nR) (RHS of Eq. (21)) for the same values of n. Dot-dashed lines show predictions obtained using a linear power spectrum, while solid lines are derived using the non-linear (halofit) model of Smith et al. (2003). Both the galaxy and mass clustering ratios are derived by adopting the same cosmological parameters of the LasDamas simulations. Error bars represent the uncertainties expected from a redshift survey covering a volume 160 times larger than that explored by the LRG sample used in this study.

3. Theoretical predictions for the amplitude of the clustering ratio

It is straightforward to predict the theoretical value of the second-order galaxy clustering ratio ηg,R. On large enough cosmic scales R, matter fluctuations are small and are described by the linear power spectrum Δ2(k,p)=4πAkns+3T2(k,p),\begin{equation} \Delta^2(k, {\vec p}) = 4\pi A k^{n_{\rm s}+3}T^2(k, {\vec p}), \label{plin} \end{equation}(12)where A is a normalization factor, ns the primordial spectral index, and T2 the transfer function. (In this study we assume the analytic approximation of Eisenstein & Hu 1998.) Accordingly, the amplitudes of the second-order statistics for mass evolve as a function of redshift (z) and scale (R) as σR2(z,p)=σ82(z=0)D2(z)R(p),\begin{equation} \sigma^2_R(z,{\vec p})=\sigma^2_8(z=0)D^2(z){\mathcal F}_R({\vec p}), \end{equation}(13)and ξR(r,z,p)=σ82(z=0)D2(z)𝒢R(r,p).\begin{equation} \xi_R(r,z, {\vec p})=\sigma_8^2(z=0) D^2 (z){\mathcal G}_R(r, {\vec p}). \end{equation}(14)Both these equations are normalized on the scale r8 = 8  h-1 Mpc, where D(z) represents the linear growing mode (Weinberg 2008), while the effects of filtering are incorporated in the functions R(p)=0+Δ2(k,p)2(kR)dlnk0+Δ2(k,p)2(kr8)dlnk\begin{equation} {\mathcal F}_R({\vec p})=\frac{\int_0^{+\infty}\Delta^2(k, {\vec p})\hat{W}^2(kR){\rm d}\ln k}{\int_0^{+\infty}\Delta^2(k,{\vec p})\hat{W}^2(kr_8){\rm d}\ln k} \label{ff} \end{equation}(15)𝒢R(r,p)=0+Δ2(k,p)2(kR)j0(kr)dlnk0+Δ2(k,p)2(kr8)dlnk\begin{equation} {\mathcal G}_R(r, {\vec p})=\frac{\int_0^{+\infty}\Delta^2(k, {\vec p}) \hat{W}^2(kR) j_{0}(kr) {\rm d}\ln k}{\int_0^{+\infty}\Delta^2(k, {\vec p})\hat{W}^2(kr_8){\rm d}\ln k} \label{gg} \end{equation}(16)where jn(x) is the spherical Bessel function of order n, and Ŵ is the Fourier transform of the window function. In this analysis we adopt a spherical top hat filter for which (kR)=3kRj1(kR).\begin{equation} \hat{W}(kR)=\frac{3}{kR}j_1(kR). \end{equation}(17)By analogy with the galaxy clustering ratio, we can now define the mass clustering ratio as ηR(r,p)=𝒢R(r,p)R(p)·\begin{equation} \eta_{R}(r, {\vec p})=\frac{{\mathcal G}_R(r, {\vec p})}{{\mathcal F}_R({\vec p})}\cdot \label{etar} \end{equation}(18)The relationship between the mass and galaxy clustering ratios in real space follows immediately once we specify how well the overall matter distribution is traced by its luminous subcomponent on a given scale R. On large enough scales, such as those explored in this paper, a local, deterministic, non-linear biasing scheme (Fry & Gaztañaga 1993), namely δg,R(x)=i=0N(bi,R/i!)δRi(x),\begin{equation} \delta_{g,R}({\vec x})=\sum_{i=0}^{N}(b_{i,R}/i!)\delta_R^i({\vec x}), \label{nlbias} \end{equation}(19)offers a fair description of the mapping between mass and galaxy density fields. The scale-dependent parameters bi,R are called biasing parameters, and within the limits σR2|b1,R/b2,R|\hbox{$\sigma_{R}^2 \ll |b_{1,R}/b_{2,R}|$} and ξR ≪ 1/b2 (see Eq. (28) of Bel & Marinoni 2012), one has

σg,R2(r)b1,R2σR2(r)\begin{eqnarray} && \sigma^2_{g,R}(\vr) \approx b_{1,R}^2 \sigma^2_{R}(\vr) \nonumber \\ && \xi_{g,R}(\vr) \approx b_{1,R}^2 \xi_{R}(\vr). \label{biassim} \end{eqnarray}(20)We therefore deduce that ηg,R(r,p)=ηR(r,p),\begin{equation} \eta_{g,R}(r, {\vec p}) = \eta_R(r, {\vec p}), \label{core} \end{equation}(21)a relation independent of the specific value of the bias amplitudes b1R.

The conditions under which Eq. (21) is derived are quite generic and are fulfilled once the field is filtered on sufficiently large scales R and/or once the second-order bias coefficient is negligible compared to the linear bias term. This last requirement is satisfied for large R (for example, Marinoni et al. 2005, 2008b), using VVDS galaxies at z = 1 (Le Fèvre et al. 2005), find b2,R/b1,R |  ~ 0.17±0.07 for R = 10  h-1 Mpc, a result implying that, at least on these scales, any eventual scale dependence of the biasing relation is also negligible.

thumbnail Fig. 2

Redshift-space galaxy clustering ratio ηg,R(nR) extracted from the LasDamas simulations (see Sect. 4) using R = 14  h-1 Mpc and n = 3 shown as a function of redshift. Markers represent the average measurement from 40 LasDamas mock catalogues, each of them simulating a redshift survey that has the same radial density profile and observational selection effects as the LRG sample of SDSS DR7. Simulated observations are compared to the predicted amplitudes of the real-space mass clustering ratio ηR(nR) (right-hand side of Eq. (21)). Dashed lines show predictions computed using a linear power spectrum, while solid lines are derived assuming the non-linear (halofit) model of Smith et al. (2003). Both the galaxy and mass clustering ratios are estimated by adopting the same cosmological parameters as the LasDamas simulations. Error bars represent the uncertainty from measurements in a single mock catalogue. They are obtained as the standard deviation of 40 independent measurements, therefore including the contribution due to cosmic variance.

The most interesting aspect of Eq. (18) is that, in the linear limit, it is effectively insensitive to redshift distortions, and, therefore, independent of their specific modelling. In other terms, the ratio ηR(r,p) between second-order statistics has identical amplitude in both real and redshift space. If the only net effect of peculiar velocities is to enhance the amplitudes of the density ripples in Fourier space, without any change in their phases or frequencies (as predicted, for example, by the Kaiser model, Kaiser 1987), then redshift distortions contribute equally to the numerator and denominator of Eq. (18). This simplifies the interpretation of the master Eq. (21): its left-hand side (LHS) can be estimated using galaxies in redshift space, while its right-hand side (RHS) can be theoretically evaluated in real space. This factorization property holds because the η statistic is only sensitive to the monopol of the power spectrum, i.e. to the quantity obtained by averaging the redshift-space anisotropic power spectrum Pz(k) over angles in k space (-11Pz(k,μk)dμk\hbox{$\int_{-1}^{1} P^z(k,\mu_k){\rm d}\mu_k$}, where μk is the cosine of the angle between the k vector and the line-of-sight). Indeed, as explained in Sect. 2.1, to estimate η, we position an isotropic distribution of cells (the motif) around each seed.

Another salient property is that, as long as a linear matter’s power spectrum is assumed, the mass clustering ratio ηR is effectively independent of the amplitude of the matter power spectrum normalization parameter A. In linear regime, i.e. when R is sufficiently large, the clustering ratio is also independent of redshift. This property is graphically illustrated in Fig. 2. This comes about because the only time-dependence appearing in the expression of the variance and correlation function of a smooth density field is through the growing mode D(z), a quantity that, on large linear scales, appears in the relevant equations as a multiplicative parameter that eventually factors out in the η ratio. Figure 2 shows that, on large scales and sufficiently low redshift, this property still holds even when a non-linear description of the matter power spectrum is adopted. Specifically, it shows that the mass clustering ratio predicted using the non-linear power spectrum of Smith et al. (2003) is effectively time-independent. In conclusion, for any given filtering (R) and correlation (r) scales, the mass clustering ratio ηR(r,p) behaves as a cosmic “standard of clustering”, a quantity that does not evolve across cosmic time.

The bias free property of Eq. (21) also deserves further comment. Since the smoothing and correlation are performed on fixed scales R and r, ηg,R is not a function but a number. Although one can estimate it using any given visible tracer of the large scale structure of the universe, this number captures second-order information about the clustering of the general distribution of mass. This conclusions holds as long as Eq. (19) provides a fair description of the nature of biasing on large scales R, which is as long as the matter – galaxy relation is deterministic and local. In this regard we remark that the galaxy density is likely to show some scatter around the dark matter density due to various physical effects. This stochasticity, however, is not expected to influence the accuracy of the Eq. (21) in a significant way. On large scales, the only effect stochasticity might introduce is a rescaling of the linear biasing parameter by a constant (Scherrer & Weinberg 1998). Because it is defined as a ratio, η is unaffected by such a systematic effect. A second source of concern comes from the fact that the η formalism is built out of the hypothesis that biasing is local; i.e., we can neglect the possible existence of scale dependent operators in the matter-galaxy density relations, at least on large smoothing scales R. A simple argument provides the guess for the sensitivity of the clustering ratio to this assumption. In the limit in which r > R, Ŵ2(kR)j0(kr) ~ j0(kr), the mass clustering ratio scales approximatively as ηR(nR,p)Δ2(kj0,p)Δ2(kW0,p),\begin{equation} \eta_R(nR,{\vec p}) \propto \frac{\Delta^2(k_{j_0}, {\vec p})}{\Delta^2(k_{W_0},{\vec p})}, \label{simp} \end{equation}(22)where, quite crudely, kj0 and kW0 can be estimated as the wave vectors where the amplitude of the low pass filters j0(kr) and Ŵ2(kR) falls to one half. The proposed test, therefore, exploits neither the absolute amplitude nor the full shape of the power spectrum, but only its relative strength on two distinct k scales. By choosing to correlate the field on scales r that are not too large compared to the smoothing scale R, it is possible to minimize the impact of any eventual k-dependent bias.

thumbnail Fig. 3

Left: isocontours of the function ηg,R/ηg,Rbestfit1\hbox{$\eta_{g,R}/\eta_{g,R}^{\rm best fit}-1$} displaying the relative variation in the galaxy clustering ratio (with respect to the best-fitting value deduced from the analysis of SDSS DR7 data) induced by a wrong choice of the distance-redshift conversion model (i.e. a wrong guess of the cosmological parameters mX]). The galaxy clustering ratio is reconstructed on the scales (R,r) = (14,42)  h-1 Mpc, and its expression is approximated well by Eq. (25). The best-fitting cosmology inferred from the analysis of SDSS DR7 data is Ωm = 0.28, ΩX = 0.45, w = −1.2. The isocontours measure the strength of the AP geometric distortions. Right: relative variation in the mass clustering ratio ηR/ηRbestfit1\hbox{$\eta_{R}/\eta_{R}^{\rm best fit}-1$} that results from computing the power spectrum in the wrong cosmology.

3.1. Precision and accuracy of the η formalism

The next step is to make sure that systematic effects, whether physical (such as any eventual non-locality in the biasing relation), observational (like systematic errors in estimating the relevant observables from data), or statistical (that, for example, although the estimator in Eq. (5) is a ratio of unbiased estimators, it does not necessarily to be unbiased itself) do not compromise the effectiveness of the formalism in practical applications. We therefore use numerical simulations of the large scale structure to compare the amplitude of the redshift space galaxy clustering ratio (the LHS of Eq. (21)) and the theoretically predicted value of the real space mass clustering ratio (the RHS of Eq. (21)). Figure 1 shows that, on scales R > 10  h-1 Mpc and r = 3R, Eq. (21) is good to better than 1%, and to better than 0.1% on scales larger than 15 h-1 Mpc (for the same correlation length r = 3R). This last figure is two orders of magnitude smaller than the precision achievable when measuring ηg,R(r) using current data (~7% see Sect. 4). Simulations also indicate that, for smoothing scales R < 15  h-1 Mpc, the precision achieved by using a non-linear power spectrum (in our case the phenomenological prescription of Smith et al. 2003) is nearly a factor of five better than the one obtained by adopting a simple linear model.

It is impressive how this remarkable sub-percentage precision in mapping theory onto real-world observations is achieved without introducing any external non-cosmological information, such as any explicit biasing parameter or any model that corrects galaxy positions for non-cosmological redshift distortions. Nonetheless, for very large separations (i.e. r = 6R), theoretical predictions fail to reproduce observations. As a matter of fact, when the field is smoothed on a scale R = 16  h-1 Mpc and correlated on a scale r = 16×6~100  h-1 Mpc, the η indicator becomes extremely sensitive to the specific features of the baryon acoustic oscillations imprinted in the galaxy distributions. Indeed, from a theoretical point of view, the correlation function of the smoothed density field results from convolving the correlation function of galaxies over two smoothing windows of radius R at positions x1 and x2 separated by the distance rξR(r)=1VR2VR(x1)dy1VR(x2)dy1ξ(y1,y2).\begin{equation} \xi_R({\vec r})=\frac{1}{V_R^2}\int_{V_{R({\vec x}_1)}}{\rm d}{\vec y}_1\int_{V_{R({\vec x}_2)}}{\rm d}{\vec y}_1\xi({\vec y}_1,{\vec y}_2). \end{equation}(23)This convolution preserves the position of the BAO peak, but it decreases its amplitude and broadens its width. Notwithstanding, it is still possible to detect the BAO scale in a precise way using the eta estimator. As the poor agreement between theory and measurements of Fig. 1 shows, however, the simple power spectrum models described above are not able to grasp the non-linear physics involved in this phenomenon. The modelling of local peculiar motions is also paramount if we are to predict the BAO profile to the precision required for cosmological purposes.

At the opposite limit, when the field is correlated on small scales (r = 2R), the mass clustering ratio fits observations in a better way if the non-linear prescription of Smith et al. (2003) is considered instead of the linear power spectrum. In other terms, numerical simulations indicate that once the δ-field is coarse grained on a sufficiently large scale R, a linear power spectrum captures the essential physics governing the clustering of galaxies only if the field is correlated on scales 3 ≤ n < 6.

3.2. The clustering ratio as a cosmic probe

Apart from the most immediate use as a statistic to measure the clustering of mass in a universal (sample-independent) way, the clustering ratio can also serve as a tool to set limits to the value of cosmological parameters. The RHS of Eq. (21) relies upon the theory of cosmological perturbations, and it is fully specified (it is essentially analytic) given the shape of the mass power spectrum in real space. Therefore it is directly sensitive to the shape (ns) of the primordial power spectrum of density fluctuations, as well as to the parameters determining the shape of the transfer function of matter perturbations at matter-radiation equality (in particular, the present-day extrapolated value of the matter (Ωmh2) and baryon (Ωbh2) density parameters; the possible contribution of massive neutrinos is neglected in this analysis). By contrast, the LHS term of Eq. (21) probes the structure of the comoving distance-redshift relation, therefore it is fully specified once the homogeneous expansion rate history of the universe is known. On top of Ωm, the LHS term is thus also sensitive to ΩX and w, i.e. to a wide range of DE models. If distances are expressed in units of h = H0/100 km s-1 Mpc-1, the LHS is effectively independent of the value of the Hubble constant H0.

The possibility of constraining the true cosmological model follows from noting that the equivalence expressed by Eq. (21) holds true if and only if the LHS and RHS are both estimated in the correct cosmology. Analytically, the amplitudes of the galaxy clustering ratio ηg,R(r) and of the mass clustering ratio ηR(r) do not match if clustering is analysed by assuming the wrong cosmology; since the derivatives with respect to the parameters p of the LHS and RHS of Eq. (21) are not identical. Physically, two effects contribute to breaking the equivalence expressed by Eq. (21) when a wrong set of parameters p is assumed: geometrical distortions, the Alcock and Paczynski (AP) signal (Alcock & Paczynski 1979; Ballinger et al. 1996; Marinoni & Buzzi 2010), dynamical effects, and distortions in the power spectrum shape. By AP, we mean the artificial anisotropy in the clustering signal that results when the density field is correlated using the wrong distance-redshift relation. In this case, and contrary to what one would expect on the grounds of symmetry considerations, the clustering power in the directions parallel and perpendicular to the observer’s line of sight is not statistically identical. Transfer function shape distortions, the second source of sensitivity to cosmology, are generated by evolving the relevant mass statistics in the wrong cosmological background, and they can be traced primarily to the spurious modification of the size of the horizon scale at the epoch of equivalence.

The amplitude of AP and power spectrum distortions are compared in Fig. 3. Geometric effects (for R = 14  h-1 Mpc and r = 3R), are almost completely degenerate with respect to the Ωm parameter, that is for any given fixed value of ΩX, the amplitude of the AP signal is nearly independent of Ωm. On the contrary, the distortions in the observable ηg,R induced by a wrong guess of the dark energy parameter are an increasing function of ΩX: the amplitude of the clustering ratio is underestimated (/overestimated) by at most 20% when ΩX varies in the interval [0,1]. Power spectrum distortions, instead, which are by definition independent of ΩX, are extremely sensitive to the matter density parameter Ωm. A change of ~± 0.1 of the true value (in this case Ωm = 0.28) results in a change of ηR by nearly ± 50%. We conclude that, given the uncertainty characterizing the SDSS data analysed in this paper, the AP signal is marginal – but it will become an interesting constraint on dark energy with larger datasets, such as Euclid (Laureijs et al. 2011).

thumbnail Fig. 4

Upper panels: distribution of the galaxy clustering ratio ηg,R(r) obtained by analysing 160 LasDamas mock catalogues simulating the distribution of luminous red galaxies (histograms). The observable ηg,R(r) is computed for the three different smoothing radii R shown on top of each plot and by assuming, furthermore, that r = 3R. The solid line displays the best-fitting Gaussian PDF. Lower panels: the relative deviation between the Gaussian model and the actual distribution of the data is shown.

thumbnail Fig. 5

Left panel: two-dimensional confidence limits on Ωm and ΩΛ from a “blind” analysis of the τCDM HVP simulation (Jenkins et al. 2001). Contours are plotted for ℒ/ℒmin < 2.3, 6.17 corresponding to 68 and 95 per cent c.l. for a multivariate Gaussian distribution with 2 degrees of freedom. The relevant one-dimensional marginalized constraint is shown in the legend. The clustering ratio ηg,R(r,p) is estimated using R = 14  h-1  Mpc and r = 3R. For the best-fitting cosmology we measure ηg,R(r,pf) = 0.0294±0.0017 where the error is evaluated via 30 jackknife resampling of the data, excluding each time a sky area of 36×19.5 deg2. Dirac delta priors are taken of Ωbh2, H0, and ns, that are centred on the simulated values. Right panel: same as before, but now contours represent the joint likelihood analysis of 40 independent ΛCDM mock catalogues simulating the SDSS LRG sample. For the best-fitting cosmology we find ηg,R(r,pf) = 0.0573±0.0008, where the error, computed as the s.d. of the mean of 40 measurements, includes the contribution from cosmic variance.

We now illustrate how, in practice, we evaluate P(p | ηg,R), the likelihood of the unknown set of parameters p = (ΩmX,w,H0bh2,ns) given the actual value of the observable ηg,R. The analysis does not require any specification of further model parameters other than those quoted above.

The probability distribution function of the clustering ratio is not immediately obvious since this observable is defined via a ratio of two non-independent random variables. Simulated and real data suggest that it is fairly accurate to assume that the PDF of ηg,R is approximately Gaussian. This is demonstrated in Fig. 4. As a result, the most likely set of cosmological parameters (pf) is the one that minimizes the logarithmic posterior ℒ = −log P=Nlogση,i+χ22logπ+B,\begin{equation} {\mathcal L}= \sum^{N} \log \sigma_{\eta,i} +\frac{\chi^2}{2}- \log \pi +B, \label{logpost} \end{equation}(24)where χ2=Nση,i-2(ηg,RηR)2\hbox{$\chi^2=\sum^{N}\sigma_{\eta,i}^{-2}(\eta_{g,R}-\eta_R)^2$}, where N is the number of estimates of ηg,R, ση,i is the error on the observable, π describes any a-priori information about the PDF of p, and where B is a normalization constant that can be fixed by requiring Pdp = 1.

Since Eq. (21) is free of look-back time effects, the analysis does not require slicing the sample in arbitrary redshift bins. Only one estimation (N = 1) of Eq. (5) is needed across the whole sample volume. We have recalculated the observable ηg,R(r,p) for each comoving distance-redshift model, i.e. on a grid mX,w) of spacing [10-2,5 × 10-2,5 × 10-2]. As a consequence, the posterior does not vary smoothly between different models because the number of galaxies counted in any given cell varies from model to model. However, since computing the observable ηg,R(3R,p) takes a limited amount of time, shot noise is the price we have decided to pay to obtain an unbiased likelihood hyper-surface.

4. Blind analysis of cosmological simulations

We assess the performances of the η test under realistic operating conditions via a “blind” analysis of mock catalogues that are characterized by widely different sets of expansion rate parameters, power spectrum shapes, mass tracers, and radial selection functions. These are the τCDM Hubble Volume Project (HVP) simulation (Jenkins et al. 2001) and the ΛCDM LasDamas simulation (McBride et al. 2009).

The HVP is a synthetic catalogue of clusters of galaxies that covers one octant of sky, extends over the redshift interval 0.1 < z < 0.43, is comprised of ~106 massive haloes with an average space density of 6×10-4  h3  Mpc-3, and it was simulated using pi = (1, 0, −1, 21, 0, 1). An end-to-end analysis of clusters mock data (HVP) allows us to check for any insidious algorithmic biases that could arise when training a method to “recognize” only a fiducial ΛCDM cosmology via a single class of biased tracer of the matter clustering pattern, namely galaxies. The left-hand panel of Fig. 5 shows that the input value ΩM = 1 is statistically retrieved.

The ΛCDM LasDamas simulations are a set of 40 nearly independent galaxy simulations that cover 120×45 deg2, extend over the redshift interval 0.16 < z < 0.43, are comprised of ~60   000 galaxies each, and have an average space density of ~8.9×10-5  h3  Mpc-3. The mock catalogues are obtained from N-body simulations (whose input cosmological parameters are pi = (0.25, 0.75, −1, 70, 0.0196, 1)) by populating dark matter haloes with galaxies according to a halo occupation distribution function. The structural parameters of this statistic, which describes the probability distribution of the number of galaxies in a halo as a function of the host halo mass, are fixed using SDSS data. These catalogues, which incorporate all the observing selections of the SDSS luminous red galaxies survey, i.e. the real data sample used in our analysis (see Sect. 5), allow us to verify that the specific SDSS observing biases do not spoil our cosmological inferences. Another advantage is that we can forecast the statistical (and systematic) fluctuations in the observable and compare this figure, which includes contribution from cosmic variance, with the uncertainty in η directly estimated from SDSS DR7 data via a block-jackknife technique (see Sect. 5).

The encouraging outcome of the analysis of these SDSS like data is presented in Fig. 5, where we show the cosmological bounds that could be obtained from a redshift survey having a volume 40 times larger than is probed by the SDSS DR7 LRG sample (~1/2 of the volume that will be measured by surveys such as EUCLID, Laureijs et al. 2011; or BigBOSS, Schlegel et al. 2011). This figure shows that one can constrain Ωm precisely (~3.7%) and accurately (the input value is recovered within the 95% c.l.). Whereas a proper analysis of an EUCLID-like simulation is beyond the purposes of the present work, our clustering analysis in a comparable volume is already suggestive of the robustness with which future data will allow constraining the matter density parameter. As Fig. 2 indicates, pushing the technique to its limit, at least on a scale R = 14  h-1  Mpc, only requires incorporating predictions from a non-linear matter power spectrum. Finally, the strong sensitivity to the abundance of matter of the clustering ratio essentially arises because the zero-order spherical Bessel function in Eq. (3) filters different portions of Δ2 when Ωm is changed. If we bias high Ωm, the suppression of power in ξR(r,p) on a scale r is more than what is observed when estimating ξg,R(r,p) using the corresponding wrong distance-redshift relation (see Fig. 3).

5. Cosmological constraints from the SDSS DR7 sample

We now present and discuss cosmological constraints inferred from the analysis of the LRG sample extracted from the SDSS DR7.

The geometry of the subsample that we have analysed is dictated by the need of sampling the galaxy distribution with cells of radius R, as well as of correlating cell counts on scales r. The inferior limit on R is set as to simplify our analysis. Although not mandatory, by framing the analysis in the linear domain, i.e. by choosing an inferior threshold for R, we avoid introducing any phenomenological description of the matter power spectrum. It is true that this choice prevents us from extracting the maximum information from the data; however, it makes the data analysis more transparent. For example, by adopting as a model the quasi linear model (halofit) of Smith et al. (2003) for the non-linear power spectrum, we would be forced to add an extra component, σ8(0), to the vector of unknown parameters p, therefore complicating unnecessarily the likelihood analysis and the marginalization procedures. The choice of R is additionally conditioned by the practical requirement of minimizing the shot noise contribution in each cell, i.e. R > (4πρ/3)−1/3. In this work we adopt the scale R = 14  h-1 Mpc. The lower limit r = 3R on the correlation scale, instead, is set to guarantee optimal accuracy in the approximation (21). At the opposite end, the higher the values of R and r, the less the inferred cosmological predictions are informative. The sample that complies with these constraints covers the redshift interval 0.15 < z < 0.43, has a contiguous sky area of 120×45 deg2, and is comprised of 62   652 LRG (corresponding to 55% of the total number of LRG contained in DR7), with a mean density of 9.2×10-5  h3  Mpc-3.

thumbnail Fig. 6

Upper: variance σg,R = ξg,R(0) of the SDSS LRG sample as a function of the filtering scale R for 2 different sets: the full sample (diamonds) and the subsample containing the brightest 50% (in the r band). Middle: the correlation function ξg,R(3R) of the smoothed galaxy density field is shown as a function of R for the same subsets. Both the variance and the correlation function are estimated in the best-fitting cosmological model. Lower: the amplitude of the galaxy clustering ratio ηg,R(3R) is demonstrated to be the same for these two samples of biased tracers of the large scale distribution of matter. Also shown is the theoretically predicted scaling of the mass clustering ratio (dot-dashed line). This last quantity is computed assuming the best-fitting cosmological parameters pf = (0.28, 0.45, −1.2, 73.8, 0.0213, 0.96) deduced from analysis of the whole LRG sample on the specific scale R = 14  h-1 Mpc. In all panels, error bars are derived from 30 block-jackknife resampling of the data, excluding, each time, a sky area of 12×14 deg2.

thumbnail Fig. 7

Upper panels: galaxy clustering ratio ηg,R(r,ΩmX,w) (dotted line) reconstructed from SDSS DR7 data assuming R = 14  h-1 Mpc and r = 3R, plotted as a function of ΩΛ (left), Ωm (centre) and w (right), after fixing the remaining cosmological parameters, shown in each panel, to the values that minimize the logarithmic posterior (24). Grey shading is the 1-sigma deviation around the mean, and it is computed from 30 block-jackknife resampling of the data. This uncertainty depends on cosmology (because of the redshift-distance conversion model) in a stochastic way (± 20% around the mean, for the cosmologies shown in the picture). We also show the predicted scaling of the mass clustering ratio (solid line) in the corresponding cosmology. The best set of cosmological parameters pf = (0.28, 0.45, −1.2, 73.8, 0.0213, 0.96) is the very one that minimizes the difference between theoretical predictions (line) and actual data (points). For SDSS DR7 data, we find ηg,R(3R,pf) = 0.0502±0.0036 on a scale R = 14  h-1 Mpc. Lower panels: two-dimensional marginalized constraints on a curved XCDM model in which both ΩX and w are allowed to vary. Gaussian priors are taken of Ωbh2 = 0.0213±0.0010, of H0 = 73.8±2.4 km s-1 Mpc-1 and of ns = 0.96±0.014 from BBN (Pettini et al. 2008), HST (Riess et al. 2011) and WMAP7 (Larson et al. 2011) determinations, respectively. Contours are plotted for ℒ/ℒmin < 2.3,6.17.

The R-scaling of the variance σg,R and correlation function ξg,R(nR) of the smoothed galaxy density field is shown in Fig. 6. The amplitude of both these observables is not a universal quantity. In addition to the background cosmological model, it also depends on the particular set of luminous objects used to trace the mass density field. That bright galaxies display stronger rms fluctuations and are more strongly correlated than faint ones as shown in Fig. 6. This same figure also shows that the amplitude of ξg,R is nearly one order of magnitude smaller than (σg,R). The lack of power in the two-point statistic is theoretically expected, since only in the limit for r → 0 does the correlation function of a smoothed field converge to the variance, i.e. ξg,R(0)=σg,R2\hbox{$\xi_{g,R}(0)=\sigma^2_{g,R}$}. Given that in a sample of N cells, the number of independent combinations of two cells is smaller than N, we also expect the two-point statistic to be estimated with larger uncertainties than the corresponding one-point statistic of equal order. For example, the SDSS DR7 sample allows us to extract the value of ξg,R(/σR2)\hbox{$\xi_{g,R} (/\sigma^2_R)$} with a precision of 6.5%  (/ 5%) when the field is smoothed on the scale R = 14  h-1 Mpc. As far as the functional dependence on R is concerned, the scaling of both these statistics is essentially featureless and approximated well by a power law over the interval 12 < R < 20  h-1 Mpc. For a given fixed value of n, the relative loss of power resulting from filtering the field on larger and larger scales R is stronger for ξg,R(nR) than σg,R2\hbox{$\sigma^2_{g,R}$}.

The galaxy clustering ratio ηg,R(nR) estimated in the best-fitting cosmology is shown in Fig. 6. This observable is a universal quantity associated to the mass density field, and, as such, it is independent of the specific biasing properties of the mass tracers adopted in this analysis. The precision with which the clustering ratio is determined from SDSS LRG data (ση/η ~ 7% on a scale R = 14  h-1 Mpc) is estimated from 30 jackknife resampling of the data each time excluding a sky area of 12×14 deg2. This figure is in excellent agreement with the one (~8%) deduced from the analysis of the standard deviation displayed by the 40 SDSS-like simulations LasDamas, which include, by definition, the contribution from cosmic variance. This indicates that η, which is defined as a ratio of equal order statistics and thus contains the same stochastic source, is weakly sensitive to this systematic effect. We note that the relative uncertainty on ηg,R(nR) cannot be simply deduced via standard propagation of errors since measurements of σg,R2\hbox{$\sigma^2_{g,R}$} and ξg,R are positively correlated.

Without fixing either the curvature of the universe (flat prior 0<| Ωk |<1) or the quality of the DE component (flat prior −3/2<w<−1/2), but taking (strong) Gaussian priors of Ωbh2, H0, and ns from big bang nucleosynthesis (BBN, Pettini et al. 2008), Hubble Space Telescope (HST, Riess et al. 2011), and WMAP7 (Larson et al. 2011), respectively, we constrain the local abundance of matter Ωm with a precision of 8% (Ωm = 0.283±0.023, see Fig. 7. For the sake of completeness, by choosing a smoothing scale of 18/22  h-1 Mpc, we obtain Ωm=0.270-0.027+0.037/0.255\hbox{$\Omega_{\rm m}=0.270^{+0.037}_{-0.027}/0.255$}±0.038). This figure improves the precision of the estimate (Ωm = 0.259±0039) by a factor of two obtained by analysing, with the same priors, the BAO of the SDSS DR5 sample (which contains 25% fewer galaxies than DR7) (Eisenstein et al. 2005). It also improves by ~20% the precision of the constraint Ωm=0.24-0.024+0.025\hbox{$\Omega_{\rm m}=0.24^{+0.025}_{-0.024}$} that Percival et al. (2010) obtained by combining BAO results from the DR7 sample with the full likelihood of the WMAP5 data (Dunkley et al. 2009) and a strong HST prior (Riess et al. 2009). The value Ωm = 0.294±0.017 (Anderson et al. 2012) obtained by combining the BAO results from the DR9 sample (containing 5 times more LRGs than the DR7 catalogue) with the full likelihood of CMB data (WMAP7, Larson et al. 2011) is nearly 40% more precise than our estimate.

thumbnail Fig. 8

Two-dimensional marginalized constraints on Ωm and H0 obtained by fitting data with a curved XCDM model. Contours are plotted for ℒ/ℒmin < 2.3,6.17 Gaussian priors are taken of Ωbh2 = 0.0213±0.0010 and ns = 0.96±0.014 from BBN (Pettini et al. 2008) and WMAP7 (Larson et al. 2011) determinations, respectively.

The best-fitting value of Ωm still remains within the quoted 68% contour level (cl), even when we relax some of the strong priors. With a (weak) flat prior on ns (in the interval [0.9,1.1]), we obtain Ωm=0.271-0.031+0.030\hbox{$\Omega_{\rm m}=0.271^{+0.030}_{-0.031}$}, while if we go on to weaken the prior on Ωbh2 (flat in the interval [0,0.03]), we obtain Ωm = 0.255±0.040. Although the best-fitting value of Ωm is weakly sensitive to changes in Ωbh2 and ns, Fig. 8 shows that Ωm degenerates with H0 when the HST prior is removed. If we also relax the strong prior on H0 (by assuming a flat prior in the interval [40,100] km s-1 Mpc-1), we find Ωm=0.22-0.04+0.14\hbox{$\Omega_{\rm m}=0.22^{+0.14}_{-0.04}$}. The stability of the best-fitting central value Ωm is of even more interest if contrasted to CMB results showing that it is the combination Ωmh2 that is insensitive to the prior on the curvature of the universe. On the contrary, if on top of the chosen priors we also impose as constraints that the universe is flat and that dark energy is effectively described by a cosmological constant (i.e. we fix w = −1), we obtain Ωm = 0.275±0.020, an estimate that improves the WMAP7 bound by 50% (0.267±0.029, Larson et al. 2011). For comparison, the recent combination of the Planck temperature power spectrum with WMAP polarization gives Ωm,0 = 0.315±0.017.

By switching from precision to accuracy, it is important to point out that the mass-clustering ratio (cf. Eq. (18)) computed using the best parameters p inferred on a scale R = 14  h-1 Mpc correctly predicts the galaxy clustering ratio observed on various different scales R (see lower panel of Fig. 6). This highlights the overall unbiasedness of our cosmological inference.

The Fisher matrix formalism cannot be reliably applied to reproduce constraints on cosmological parameters that are strongly degenerate. A simple way to make sense and reproduce our results consists of working out an effective measure of the observable ηg,R(r,p). The relevant cosmological information contained in the SDSS DR7 data can be effectively retrieved by approximating the galaxy clustering ratio ηg,R via the fitting formula ηg,R(3R,p)=0.07120.0788xp+0.0981xp20.0538xp3\begin{equation} \eta_{g,R}(3R,{\vec p})\!=\!0.0712\!-\!0.0788 x_p\!+\!0.0981 x^2_p\!-\!0.0538x^3_p \label{appeta} \end{equation}(25)where R = 14  h-1 Mpc, xp = 1 + fp/8.48, and where \begin{eqnarray*} \begin{array}{rl} {\mathrm f}_p\!=\!&\!3w\Omega_{\rm X}\!-\!\Omega_k\!+0.042\left\{w\Omega_{\rm X}[1\!-\!\frac{9}{2}(1+w)w]\!+\!\frac{\Omega_k^2-5\Omega_k}{2}\right\}\\ & +0.29\left\{w\Omega_{\rm X}(14+3w)-3w\Omega_{\rm X}\Omega_k+\frac{\Omega_k^2}{2}-\frac{11}{3}\Omega_k\right\}. \end{array} \end{eqnarray*}This phenomenological formula allows a fast computation of the value of the observable ηg,R in any given cosmological model characterized by 0.1<Ωm<1, 0<ΩX<1, and −1.5<w<−0.5. The left-hand panel of Fig. 3 is obtained by setting w = −1.2 and by taking isocontours of the resulting two-dimensional surface in the plane mX]. By inserting this formula in the likelihood expression (24), and by further assuming ση = 0.004 independently of the cosmological model, one can reproduce the cosmological constraints shown in Fig. 7.

6. Conclusions

The paper presents and illustrates the use of a new cosmic standard of clustering, the galaxy clustering ratio ηg,R(r), which is defined as the ratio of the correlation function to the variance of the galaxy overdensity field smoothed on a scale R. The key result of the analysis, the one that a posteriori justifies by itself the introduction of this statistic, is the demonstration that there is no need to model the complex physics underpinning galaxy formation and evolution processes if we are to deduce the amplitude of the corresponding mass statistics (ηR(r)) from the observed value of the galaxy clustering ratio (ηg,R(r)). Indeed, it is straightforward to show that, on large linear scales, ηg,R(r) = ηR(r). In other words, what you see (baryon clustering) is what you get (dark matter clustering).

Interestingly, the magnitude of the mass clustering ratio ηR(r) on the given fixed scales R and r is a number that is neatly related to the relative amplitude of the real-space mass power spectrum in two distinct bandwidths. This implies that, instead of being characteristic of a specific galaxy sample, the galaxy clustering ratio estimated for the pair of linear scales R and r is a universal quantity that only depends on the initial distribution of matter density fluctuations and on the nature of dark matter.

Many other cosmological observables probe information encoded in second-order mass statistics. The distinctiveness of the galaxy clustering ratio resides primarily in its simplicity. On large linear scales, its amplitude is independent of galaxy biasing, peculiar velocity modelling, linear growth rate of structures, and normalization of the matter power spectrum. The advantage over standard clustering analyses is both practical and conceptual. The clustering ratio provides a way to extract information about the mass power spectrum without the need to estimate the galaxy power spectrum or the correlation function of galaxies. Moreover, that ηg,R(r) is estimated via cell-counting techniques on a given fixed pair of scales (R,r) makes the estimation process and the likelihood analysis computationally fast. There is no need, for example, to compute the sample covariance matrix or to speed up calculations by framing the clustering analysis in a given fiducial cosmological model. Additionally, since results do not follow from the reconstruction of the full shape of the galaxy correlation function ξ(s), the cosmological interpretation is in principle less affected by observational biases that act differentially as a function of scale s. The η formalism is also conceptually transparent. Only a minimum number of physical hypotheses, and no astrophysical nuisance parameters at all, condition the cosmological inference: the clustering ratio can be extracted from data and compared to theoretical predictions or numerical simulations in a fairly straightforward way.

The method’s robustness is thoroughly tested via an end-to-end analysis of numerical simulations of the large scale structure of the universe. We have shown that the galaxy density field can be smoothed and correlated on opportunely chosen scales R and r so that the galaxy clustering ratio effectively measured in simulations and the mass clustering ratio predicted by theory differs, at most, by 0.1% on large linear scales R. This remarkable agreement explains both the precision and the accuracy of the method in retrieving, by means of a blind analysis, the values of the matter density parameter Ωm of various numerical simulations of the large scale structure of the universe.

We have also demonstrated the method with real data. Using the LRG sample extracted from the spectroscopic SDSS DR7 galaxy catalogue, no CMB information, weak (flat) priors on the value of the curvature of the universe (Ωk) and the constant value of the dark energy equation of state (w), we find Ωm=0.271-0.031+0.030\hbox{$\Omega_{\rm m}=0.271^{+0.030}_{-0.031}$}. Since one of the main goals of the paper was to illustrate the intrinsic strengths and limitations of the η-test, we did not present cosmological results from a full joint analysis with CMB data. This analysis, which will allow priors to be lifted and a broader range of cosmological parameters to be constrained including h and ns, is left for a future work.

Of strong interest is the flexibility of the method, which can still be improved along several directions. Owing to its scale-free nature, the precision of the technique could be further improved by adopting a non-linear power spectrum in Eqs. (15) and (16), and by smoothing the over-density field on an even smaller scale R than the one adopted throughout this analysis. There are a few caveats to this approach, though. One must verify that mass fluctuations δR in real and redshift spaces are still approximately proportional, as predicted, for example, by the linear Kaiser model. More importantly, one must properly describe the non-linear redshift space distortions induced by virial motions of galaxies, the so called Finger-of-God effect. As pointed out by Neyrinck (2011), who developed a similar statistic in Fourier space, this is a highly non-linear phenomenon that is expected to be significant on small smoothing scales R. A strategy for overcoming these difficulties and implementing the η test even on such extreme regimes is explained and applied to the VIPERS (Guzzo et al. 2013) high-redshift spectroscopic data by Bel et al. (2014). Alternatively, the method could also be applied to large photometric redshift surveys. In this case, the predictions of Eq. (18) must be statistically corrected to account for the line-of-sight distortions introduced when estimating low-resolution distances from photometry. Finally, this probe enriches the arsenal of methods with which the next generation of redshift surveys such as EUCLID and BigBOSS will hunt for new physics by challenging all sectors of the cosmological model.

Acknowledgments

We acknowledge useful discussions with F. Bernardeau, E. Branchini, E. Gaztañaga, B. Granett, L. Guzzo, Y. Mellier, L. Moscardini, A. Nusser, J. A. Peacock, W. Percival, H. Steigerwald, I. Szapudi, and P. Taxil. C.M. is grateful for support from specific project funding of the Institut Universitaire de France and of the Labex OCEVU.

References

  1. Abazajian, K. N., Adelman-McCarthy, J. K., Agueros, M. A., et al. 2009, ApJS, 182, 543 [NASA ADS] [CrossRef] [Google Scholar]
  2. Alcock, C., & Paczynski, B. 1979, Nature, 281, 358 [NASA ADS] [CrossRef] [Google Scholar]
  3. Amendola, L., Appleby, S., Bacon, D., et al. 2013, Liv. Rev. Rel., 16, 6 [Google Scholar]
  4. Anderson, L., Aubourg, E., Bailey, S., et al. 2012, MNRAS, 427, 3435 [Google Scholar]
  5. Ballinger, W. E., Peacock, J. A., & Heavens, A. F. 1996, MNRAS, 282, 877 [NASA ADS] [CrossRef] [Google Scholar]
  6. Bel, J., & Marinoni, C. 2012, MNRAS, 424, 971 [NASA ADS] [CrossRef] [Google Scholar]
  7. Bel, J., Marinoni, C., Granett, B., et al. (the VIPERS team) 2014, A&A, 563, A37 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  8. Bernardeau, F. 1996, A&A, 312, 11 [NASA ADS] [Google Scholar]
  9. Bernardeau, F., Colombi, S., Gaztañaga, E., & Scoccimarro, R. 2002, Phys. Rep., 367, 1 [NASA ADS] [CrossRef] [EDP Sciences] [MathSciNet] [Google Scholar]
  10. Blake, C., Brough, S., Colless, M., et al. 2012, MNRAS, 425, 405 [NASA ADS] [CrossRef] [Google Scholar]
  11. Clifton, T., Ferreira, P. G., Padilla, A., & Skordis, C. 2012, Phys. Rep., 513, 1 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
  12. Copeland, E. J., Sami, M., & Tsujikawa, S. 2006, Int. J. Mod. Phys. D, 15, 1753 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
  13. Dunkley, J., Spergel, D. N., Komatsu, E., et al. 2009, ApJS, 180, 306 [NASA ADS] [CrossRef] [Google Scholar]
  14. Eisenstein, D., & Hu, W. 1998, ApJ, 496, 605 [NASA ADS] [CrossRef] [Google Scholar]
  15. Eisenstein, D., Zehavi, I., Hogg, D. W., et al. 2005, ApJ, 633, 560 [NASA ADS] [CrossRef] [Google Scholar]
  16. Feng, J. 2010, ARA&A, 48, 495 [Google Scholar]
  17. Frenk, C. S., & White, S. D. M. 2012, Ann. Phys., 524, 507 [CrossRef] [Google Scholar]
  18. Frieman, J. A., Turner, M. S., & Huterer, D. 2008, ARA&A, 46, 385 [NASA ADS] [CrossRef] [Google Scholar]
  19. Fry, J. N., & Gaztañaga, E. 1993, ApJ, 413, 447 [NASA ADS] [CrossRef] [Google Scholar]
  20. Gaztanaga, E., & Cabre, A. 2009, MNRAS, 399, 1663 [NASA ADS] [CrossRef] [Google Scholar]
  21. Guzzo, L., Pierleoni, M., Meneux, B., et al. 2008, Nature, 451, 541 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  22. Guzzo, L., Scodeggio, M., Garilli, B., et al. 2013, A&A, submitted [arXiv:1303:2623] [Google Scholar]
  23. Jenkins, A., Frenk, C. S., White, S. D. M., et al. 2001, MNRAS, 321, 372 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
  24. Kaiser, N. 1987, MNRAS, 227, 1 [NASA ADS] [CrossRef] [Google Scholar]
  25. Komatsu, E., Smith, K. M., Dunkley, J., et al. 2011, ApJS, 192, 18 [NASA ADS] [CrossRef] [Google Scholar]
  26. Larson, D., Dunkley, J., Hinshaw, G., et al. 2011, ApJS, 192, 16 [NASA ADS] [CrossRef] [Google Scholar]
  27. Laureijs, R., Amiaux, J., Arduini, S., et al. 2011 [arXiv:1110.3193] [Google Scholar]
  28. Layser, D. 1956, AJ, 61, 1243 [Google Scholar]
  29. Le Fèvre, O., Vettolani, G., Garilli, B., et al. 2005, A&A, 439, 845 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  30. Marinoni, C., & Buzzi, A. 2010, Nature, 468, 539 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  31. Marinoni, C., Le Fèvre, O., Meneux, B., et al. 2005, A&A, 442, 801 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  32. Marinoni, C., Saintonge, A., Giovanelli, R., et al. 2008a, A&A, 478, 43 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  33. Marinoni, C., Guzzo, L., Cappi, A., et al. 2008b, A&A, 487, 7 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  34. Marinoni, C., Bel, J., & Buzzi, A. 2012, JCAP, 10, 036 [NASA ADS] [CrossRef] [Google Scholar]
  35. Martin, J. 2012, Comptes Rendus Physique, 13, 566 [NASA ADS] [CrossRef] [Google Scholar]
  36. McBride, C., Berlind, A., Scoccimarro, R., et al. 2009, AAS, 21342506 [Google Scholar]
  37. Neyrinck, M. 2011, ApJ, 742, 91 [NASA ADS] [CrossRef] [Google Scholar]
  38. Peebles, P. J. E., & Ratra, B. 2003, Rev. Mod. Phys., 75, 559 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
  39. Percival, W. J., Reid, B. A., Eisenstein, D. J., et al. 2010, MNRAS, 401, 2148 [NASA ADS] [CrossRef] [Google Scholar]
  40. Pettini, M., Zych, B., Murphy, M. T., Lewis, A., & Steidel, C. C. 2008, MNRAS, 391, 1499 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
  41. Planck Collaboration 2014, A&A, in press [arXiv:1303.5076] [Google Scholar]
  42. Reid, B. A., Samushia, L., White, M., et al. 2012, MNRAS, 426, 2719 [NASA ADS] [CrossRef] [Google Scholar]
  43. Riess, A. G., Macri, L., Casertano, S., et al. 2009, 699, 539 [Google Scholar]
  44. Riess, A., Macri, L., Casertano, S., et al. 2011, ApJ, 730, 119 [NASA ADS] [CrossRef] [Google Scholar]
  45. Sanchez, A. G., Scóccola, C. G., Ross, A. J., et al. 2012, MNRAS, 425, 415 [NASA ADS] [CrossRef] [Google Scholar]
  46. Scherrer, R. J., & Weinberg, D. H. 1998, ApJ, 504, 607 [NASA ADS] [CrossRef] [Google Scholar]
  47. Schlegel, D., Abdalla, F., Abraham, T., et al. 2011 [arXiv:1106.1706] [Google Scholar]
  48. Smith, R. E., Peacock, J. A., Jenkins, A., et al. 2003, MNRAS, 341, 1311 [NASA ADS] [CrossRef] [Google Scholar]
  49. Szapudi, I., & Szalay, A. 1997, ApJ, 481, 1 [NASA ADS] [CrossRef] [Google Scholar]
  50. Szapudi, I., & Szalay, A. 1998, ApJ, 494, 41 [Google Scholar]
  51. Szapudi, I., Szalay, A., & Boschán, P. 1992, ApJ, 390, 350 [NASA ADS] [CrossRef] [Google Scholar]
  52. Weinberg, S. 1989, Rev. Mod. Phys., 61, 1 [NASA ADS] [CrossRef] [MathSciNet] [Google Scholar]
  53. Weinberg, S. 2008, Cosmology (Oxford University Press) [Google Scholar]

All Figures

thumbnail Fig. 1

Redshift-space galaxy clustering ratio ηg,R(nR) (LHS of Eq. (21)) extracted from 160 independent LasDamas simulations of the distribution of luminous red galaxies (LRG, see Sect. 4). The observable ηg,R(nR) is shown as a function of the smoothing radius R for four different values of n as indicated in the key. Simulated observations are compared to the predicted amplitudes of the real-space mass clustering ratio ηR(nR) (RHS of Eq. (21)) for the same values of n. Dot-dashed lines show predictions obtained using a linear power spectrum, while solid lines are derived using the non-linear (halofit) model of Smith et al. (2003). Both the galaxy and mass clustering ratios are derived by adopting the same cosmological parameters of the LasDamas simulations. Error bars represent the uncertainties expected from a redshift survey covering a volume 160 times larger than that explored by the LRG sample used in this study.

In the text
thumbnail Fig. 2

Redshift-space galaxy clustering ratio ηg,R(nR) extracted from the LasDamas simulations (see Sect. 4) using R = 14  h-1 Mpc and n = 3 shown as a function of redshift. Markers represent the average measurement from 40 LasDamas mock catalogues, each of them simulating a redshift survey that has the same radial density profile and observational selection effects as the LRG sample of SDSS DR7. Simulated observations are compared to the predicted amplitudes of the real-space mass clustering ratio ηR(nR) (right-hand side of Eq. (21)). Dashed lines show predictions computed using a linear power spectrum, while solid lines are derived assuming the non-linear (halofit) model of Smith et al. (2003). Both the galaxy and mass clustering ratios are estimated by adopting the same cosmological parameters as the LasDamas simulations. Error bars represent the uncertainty from measurements in a single mock catalogue. They are obtained as the standard deviation of 40 independent measurements, therefore including the contribution due to cosmic variance.

In the text
thumbnail Fig. 3

Left: isocontours of the function ηg,R/ηg,Rbestfit1\hbox{$\eta_{g,R}/\eta_{g,R}^{\rm best fit}-1$} displaying the relative variation in the galaxy clustering ratio (with respect to the best-fitting value deduced from the analysis of SDSS DR7 data) induced by a wrong choice of the distance-redshift conversion model (i.e. a wrong guess of the cosmological parameters mX]). The galaxy clustering ratio is reconstructed on the scales (R,r) = (14,42)  h-1 Mpc, and its expression is approximated well by Eq. (25). The best-fitting cosmology inferred from the analysis of SDSS DR7 data is Ωm = 0.28, ΩX = 0.45, w = −1.2. The isocontours measure the strength of the AP geometric distortions. Right: relative variation in the mass clustering ratio ηR/ηRbestfit1\hbox{$\eta_{R}/\eta_{R}^{\rm best fit}-1$} that results from computing the power spectrum in the wrong cosmology.

In the text
thumbnail Fig. 4

Upper panels: distribution of the galaxy clustering ratio ηg,R(r) obtained by analysing 160 LasDamas mock catalogues simulating the distribution of luminous red galaxies (histograms). The observable ηg,R(r) is computed for the three different smoothing radii R shown on top of each plot and by assuming, furthermore, that r = 3R. The solid line displays the best-fitting Gaussian PDF. Lower panels: the relative deviation between the Gaussian model and the actual distribution of the data is shown.

In the text
thumbnail Fig. 5

Left panel: two-dimensional confidence limits on Ωm and ΩΛ from a “blind” analysis of the τCDM HVP simulation (Jenkins et al. 2001). Contours are plotted for ℒ/ℒmin < 2.3, 6.17 corresponding to 68 and 95 per cent c.l. for a multivariate Gaussian distribution with 2 degrees of freedom. The relevant one-dimensional marginalized constraint is shown in the legend. The clustering ratio ηg,R(r,p) is estimated using R = 14  h-1  Mpc and r = 3R. For the best-fitting cosmology we measure ηg,R(r,pf) = 0.0294±0.0017 where the error is evaluated via 30 jackknife resampling of the data, excluding each time a sky area of 36×19.5 deg2. Dirac delta priors are taken of Ωbh2, H0, and ns, that are centred on the simulated values. Right panel: same as before, but now contours represent the joint likelihood analysis of 40 independent ΛCDM mock catalogues simulating the SDSS LRG sample. For the best-fitting cosmology we find ηg,R(r,pf) = 0.0573±0.0008, where the error, computed as the s.d. of the mean of 40 measurements, includes the contribution from cosmic variance.

In the text
thumbnail Fig. 6

Upper: variance σg,R = ξg,R(0) of the SDSS LRG sample as a function of the filtering scale R for 2 different sets: the full sample (diamonds) and the subsample containing the brightest 50% (in the r band). Middle: the correlation function ξg,R(3R) of the smoothed galaxy density field is shown as a function of R for the same subsets. Both the variance and the correlation function are estimated in the best-fitting cosmological model. Lower: the amplitude of the galaxy clustering ratio ηg,R(3R) is demonstrated to be the same for these two samples of biased tracers of the large scale distribution of matter. Also shown is the theoretically predicted scaling of the mass clustering ratio (dot-dashed line). This last quantity is computed assuming the best-fitting cosmological parameters pf = (0.28, 0.45, −1.2, 73.8, 0.0213, 0.96) deduced from analysis of the whole LRG sample on the specific scale R = 14  h-1 Mpc. In all panels, error bars are derived from 30 block-jackknife resampling of the data, excluding, each time, a sky area of 12×14 deg2.

In the text
thumbnail Fig. 7

Upper panels: galaxy clustering ratio ηg,R(r,ΩmX,w) (dotted line) reconstructed from SDSS DR7 data assuming R = 14  h-1 Mpc and r = 3R, plotted as a function of ΩΛ (left), Ωm (centre) and w (right), after fixing the remaining cosmological parameters, shown in each panel, to the values that minimize the logarithmic posterior (24). Grey shading is the 1-sigma deviation around the mean, and it is computed from 30 block-jackknife resampling of the data. This uncertainty depends on cosmology (because of the redshift-distance conversion model) in a stochastic way (± 20% around the mean, for the cosmologies shown in the picture). We also show the predicted scaling of the mass clustering ratio (solid line) in the corresponding cosmology. The best set of cosmological parameters pf = (0.28, 0.45, −1.2, 73.8, 0.0213, 0.96) is the very one that minimizes the difference between theoretical predictions (line) and actual data (points). For SDSS DR7 data, we find ηg,R(3R,pf) = 0.0502±0.0036 on a scale R = 14  h-1 Mpc. Lower panels: two-dimensional marginalized constraints on a curved XCDM model in which both ΩX and w are allowed to vary. Gaussian priors are taken of Ωbh2 = 0.0213±0.0010, of H0 = 73.8±2.4 km s-1 Mpc-1 and of ns = 0.96±0.014 from BBN (Pettini et al. 2008), HST (Riess et al. 2011) and WMAP7 (Larson et al. 2011) determinations, respectively. Contours are plotted for ℒ/ℒmin < 2.3,6.17.

In the text
thumbnail Fig. 8

Two-dimensional marginalized constraints on Ωm and H0 obtained by fitting data with a curved XCDM model. Contours are plotted for ℒ/ℒmin < 2.3,6.17 Gaussian priors are taken of Ωbh2 = 0.0213±0.0010 and ns = 0.96±0.014 from BBN (Pettini et al. 2008) and WMAP7 (Larson et al. 2011) determinations, respectively.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.