A&A 465, 23-33 (2007)
DOI: 10.1051/0004-6361:20065321
F. Sylos Labini1,2 - N. L. Vasilyev3 - Y. V. Baryshev3
1 - "Enrico Fermi Center'', via Panisperna 89 A, Compendio del Viminale, 00184 Rome, Italy
2 - "Istituto dei Sistemi Complessi'' CNR, via dei Taurini 19, 00185 Rome, Italy
3 -
Institute of Astronomy, St. Petersburg State University, Staryj Peterhoff, 198504 St. Petersburg, Russia
Received 30 March 2006 / Accepted 18 October 2006
Abstract
We discuss the estimation of galaxy correlation properties in several
volume limited samples, in different sky regions, obtained from the
Fourth Data Release of the Sloan Digital Sky Survey. The small scale
properties are characterized through the determination of the nearest
neighbor probability distribution. By using a very conservative
statistical analysis, in the range of scales [0.5,
30] Mpc/h we
detect power-law correlations in the conditional density in redshift
space, with an exponent
0.1. This behavior is stable
in all the different samples we considered; thus it does not depend on
galaxy luminosity. In the range of scales [
30,
100] Mpc/hwe find evidence for systematic unaveraged fluctuations and we
discuss in detail the problems induced by finite volume effects on the
determination of the conditional density. We conclude that in such a range of scales there is evidence for a smaller power-law index of
the conditional density. However we cannot distinguish between two possibilities: (i) that a crossover to homogeneity (corresponding to
in the conditional density) occurs before 100 Mpc/h; (ii) that correlations extend to scales of order 100 Mpc/h (with a smaller
exponent
). We emphasize that galaxy distributions in
these samples present large fluctuations at the largest scales probed,
corresponding to the presence of large scale structures extending up
to the boundaries of the present survey. We discuss several
differences between the behavior of the conditional density in mock
galaxy catalogs built from cosmological N-body simulations and real
data. We discuss some theoretical implications of such differences
considering also the super-homogeneous features of primordial density
fields.
Key words: cosmology: observations - large-scale structure of Universe
A major problem in modern cosmology is the statistical characterization and the physical understanding of large scale galaxy structures. The first question in this context concerns the studies of galaxy correlation properties. Two-point properties are particularly useful to determine correlations and their spatial extension. There are different ways of measuring two-point properties and, in general, the most suitable method depends on the type of correlation, strong or weak, characterizing a given point distribution in a sample.
For example, Hogg et al. (2005) recently measured the conditional average density in a sample of Luminous Red Galaxies (LRG) from a data release of the Sloan Digital Sky Survey (SDSS). Such a statistic is very useful to determine correlation properties in the regime of strong clustering and the spatial extension of strong fluctuations in a given sample. This was firstly introduced by Pietronero (1987) and then measured in many samples by Sylos Labini et al. (1998). We refer the reader to Baryshev & Teerikorpi (2006) for a review of the measurements of the reduced and complete correlation functions by different authors in the various angular and three-dimensional samples.
The conditional density gives the average density of points in a spherical volume (or a spherical shell) centered around a galaxy (see Gabrielli et al. 2004, for a discussion of this method). The results obtained by Hogg et al. (2005) can be summarized as follows:
The paper is organized as follows. In Sect. 2 we describe the data and the way we have constructed the VL samples. We also discuss the determination of the nearest neighbor (NN) distribution, and of the average distance between nearest galaxies, which allows us to define the lower cut-off for the studies of correlations. In addition we discuss the determination of the radial counts in different VL samples, emphasizing that large variations for this quantity are found in the different samples. Such fluctuations, which seem to be persistent up to the sample boundaries, correspond to the large scale structures observed in these catalogs. The quantitative characterization of the correlation properties of these fluctuations is presented in Sect. 3, where we discuss the determination of the conditional average density in the different VL samples. In particular we present several tests useful to clarify the effect of systematic fluctuations at scales of the order of the sample size.
In Sect. 4 we discuss the differences between the galaxy conditional density, measured in these samples and the conditional density of point-particles in cosmological N-body simulations. We show that by using these statistics, together with a study of the NN probability distribution, two-point properties of observed galaxies of different luminosity and mock galaxy catalogs constructed using particles lying in region with different local density in cosmological N-body simulations, present different behaviors. In Sect. 5 we draw our main conclusions.
The SDSS (http://www.sdss.org) is currently the largest spectroscopic survey of extragalactic objects and one of the most ambitious observational programs ever undertaken in astronomy. It will measure about 1 million redshifts, giving a complete mapping of the local universe up to a depth of several hundreds of Mpc. In this paper we consider the data from the latest public data release (SDSS DR4) which is accessible at http://www.sdss.org/dr4 (Adelman-McCarthy et al. 2005) containing redshifts for more than 565 thousand galaxies and 67 thousand quasars. There are two independent parts of the galaxy survey in the SDSS: the main galaxy sample and the LRG sample. Here we discuss the former only. The spectroscopic survey covers an area of 4783 square degrees of the celestial sphere. The apparent magnitude limit for the galaxies is 17.77 in the r-filter and photometry for each galaxy is available in five different bands, of which we consider the ones in the r and g filters.
We have used the following criteria to query the SDSS DR4
database. We constrain the flags indicating the type of
object so that we select only the objects from the main galaxy sample.
We then consider galaxies in the redshift interval
and with the redshift confidence parameter larger than 0.95. In addition we apply the filtering condition r < 17.77,
thus taking into account the target magnitude limit for the main
galaxy sample in the SDSS DR4. Thus we have selected 321 516 objects.
The angular coverage of the survey is not uniform but observations
have been done in different sky regions. For this reason we have
considered three rectangular angular fields (named R1, R2 and R3) in
the SDSS internal angular coordinates
:
in such a way
we do not have to consider the irregular boundaries of the survey
mask, as we have cut such boundaries to avoid uneven edges of observed
regions. In Table 1 we report the
parameters of the three angular regions considered. We do not use
corrections for the redshift completeness mask or for fiber collision
effects. Completeness varies most near the current survey edges which
are excluded in our samples. Fiber collisions in general do not
present a problem for measurements of galaxy correlations (see
discussion in, e.g., Strauss et al. 2002).
Table 1:
Main properties of the angular regions considered:
The limits in degrees of the cuts are chosen using the intrinsic
coordinates of the survey
and
(in degrees). The last column
gives the solid angle of three angular regions in steradians.
To construct VL samples that are unbiased for the selection effect
related to the cut in the apparent magnitude, we have applied a standard procedure (see e.g. Zehavi et al. 2004): we compute metric distances as
We use Petrosian apparent magnitudes in the r filter mr which are
corrected for galactic absorption. The absolute magnitudes can be
computed as
We have considered 4 different VL samples (named VL1, VL2, VL3 and VL4) defined by two chosen limits in absolute magnitude and metric distance, whose parameters are reported in
Table 2. While VL1 and VL2 contain
relatively faint galaxies in the local universe, the VL3 sample covers
a wide range of distances, and VL4 consists of bright galaxies at
distances up to 600 Mpc/h. Considering the three different rectangular
areas (described above), we have 4
3 = 12 VL subsamples, whose
characteristics are reported in Table 3.
The comparison between VL samples with the same magnitude and distance
cuts, in different sky regions, will allow us to test the statistical
stationarity of galaxy distributions in these samples and to estimate
sample-to-sample fluctuations.
Table 2:
Main properties of the obtained VL samples:
,
(in Mpc/h) are the chosen limits for the metric
distance;
define the interval for the absolute
magnitude in each sample. The quantity
(in Mpc/h) is the average distance between nearest-neighbor galaxies.
Table 3:
Number of galaxies in each of the VL samples.
Names are given according to the discussion in the text. The scale
(in Mpc/h) is discussed in Sect. 3.2 below.
The NN distance probability distribution depends on the cut
in absolute magnitude of a given VL sample. We expect this function
not to be dependent on the angular sky cuts if the distribution is
statistically stationary in the different VL samples. As discussed in
Vasilyev et al. (2006) space correlations introduce
a deviation from the case of a pure Poisson distribution: the average distance
between NN is expected to be smaller than for the
Poisson case in the same sample and with the same number of
points. The measurements in the data, obtained by simple
pair-counting, are shown in Figs. 1-4. When a VL sample includes fainter galaxies (e.g. VL1,VL2)
is smaller (see Table 2) than for the case when only
brighter galaxies are inside (e.g. VL3, VL4). This is because brighter
galaxies are sparser than fainter ones. This corresponds to the
exponential decay of the galaxy luminosity function at the bright end
(see discussion in Gabrielli et al. 2004)
![]() |
Figure 1:
Nearest Neighbor distribution in the VL1 sample: different symbols
correspond to different angular regions. The average distance between
nearest galaxies is
|
| Open with DEXTER | |
![]() |
Figure 2:
As Fig. 1 but for the
VL2 samples. The average distance between galaxies is
|
| Open with DEXTER | |
![]() |
Figure 3:
As Fig. 1 but for the
VL3 samples. The average distance between nearest galaxies is
|
| Open with DEXTER | |
![]() |
Figure 4:
As Fig. 1 but for the VL4 samples.The average distance
between nearest galaxies is
|
| Open with DEXTER | |
Zehavi et al. (2004) have estimated that at a scale of the order of
Mpc/h there is a departure from a power law behavior in the
reduced correlation function. In the light of the discussion above we
stress that this change occurs over a range of scales where NN correlations are dominant in all samples considered. For the
interpretation of this behavior one may consider the relation between
the conditional density, or the reduced correlation function, and the
NN probability distribution (see Baertschiger & Sylos Labini 2004, for
a discussion of this point). In this respect, in the comparison of
galaxy data with N-body simulations, one has to be careful in that
these small-scale properties can be determined by sampling, sparseness
and other more subtle finite size effects related to the precision of
a given N-body simulation (Baertschiger & Sylos Labini 2004).
We have then studied the effect of the fiber collisions on the NN statistic: about
of galaxies that satisfy the selection criteria
of the main galaxy sample are not observed because they have a companion closer than the 55 arcsec minimum separation of spectroscopic fibers (Strauss et al. 2002). However not all
55-arcsec pairs are affected by fiber collisions, because some of the
SDSS were observed spectroscopically more than once. We have
identified all <=55 arcsec pairs for which both galaxies have
redshifts, and we have randomly removed one of those redshifts in each
case to make a new sample with an even more severe fiber collision
problem than the existing sample. Because of the very small number of
galaxy pairs with angular separation <=55 arcsec (of the order of
a few percent in all the volume limited samples we have considered)
there is no noticeable effect of the results. For galaxies in the main
sample the average redshift
,
and hence the angular
distance 55 arcsec corresponds to the linear separation
Mpc/h which is marginally outside the scale interval in which we
have studied the NN distribution, i.e. r>0.2 Mpc/h. Hence we expect
that the fiber collision effect does not influence our results as
indeed we find.
A simple statistic a value of that can be easily computed in VL samples is the differential number counts. This gives us a first indication about (i) the slope of the counts; and (ii) the nature of
fluctuations (see e.g. Gabrielli et al. 2004). In general we may
write that the number of points counted from a given point chosen as
the origin (in this case the Earth) grows as
![]() |
(4) |
![]() |
Figure 5: Differential number counts as a function of distance in the VL1 sample in different angular regions normalized to their own solid angle. |
| Open with DEXTER | |
![]() |
Figure 6: The same as Fig. 5 but for the VL2 samples. |
| Open with DEXTER | |
Given that a VL sample is defined by two cuts in distance we compute
Results in the samples considered are shown in Figs. 5-8, where for each sample we have normalized the counts to the solid angle of the corresponding angular region. The best fit exponent (reported in the figures) fluctuates, and in several cases it is larger than 2. This means that there are large fluctuations as revealed by the non-smooth behaviors of n(r)in the different samples. Similar evidence for the effect of large scale structures in these samples on other statistical quantities has been pointed out by Nichol et al. (2006).
This is a first rough indication that the question of uniformity at scales of order 100 Mpc/h is not simple to resolve in these samples. These large fluctuations in slope and amplitude correspond to the presence of large scale galaxy structures extending up to the boundaries of the various samples considered. We do not present a more quantitative discussion of these behaviors as the statistics are rather weak.
![]() |
Figure 7: The same as Fig. 5 but for the VL3 samples. |
| Open with DEXTER | |
![]() |
Figure 8: The same as Fig. 5 but for the VL4 samples. |
| Open with DEXTER | |
We now study the behavior of the conditional average density in the
various VL samples discussed in the previous section. We use the
full-shell estimator, discussed extensively in Gabrielli et al. (2004)
and in Vasilyev et al. (2006). This estimator has
the advantage of making no assumptions in the treatment of boundary
conditions and it is the most conservative among estimators of
two-pint correlations (see discussion in Kerscher 1999). The
conditional density in spheres
is defined
for an ensemble of realizations of a given point process, as
This full-shell estimator has an important constraint: it is
measured only in spherical volumes fully included in the sample
volume. In this situation the number of centers
over which
the average Eq. (7) is performed becomes strongly
dependent on the scale r when
,
being the
sample size. In this context such a length scale can be defined as the
radius of the largest sphere fully included in the sample volume: the
center of such a sphere lies in the middle of the sample volume.
![]() |
Figure 9:
Conditional density in spheres in the VL1 sample in the angular region R1, R2, R3. Here and in Figs. 10-12 we report, for each sample, a vertical line corresponding to the distance scale |
| Open with DEXTER | |
Thus, when approaching the scale
there are two sources of
fluctuations which increase the variance of the measurements. On the
one hand the number of points over which the average is performed
decreases very rapidly and on the other hand the remaining points are
concentrated toward the center of the sample. In such a way systematic
fluctuations may affect the estimation, given that these are not
averaged out by the volume average. An estimation of the scale beyond
which systematic effects become strong is thus important.
The following subsection discusses the measurements of
in the different VL samples, while Sect. 3.2 is devoted
to the problem of the determination of the maximum scale up to which
the volume average is properly performed, and thus beyond which
systematic unaveraged fluctuations may affect the behavior of the
conditional density.
The results of the measurements in redshift space of the conditional
density by the full-shell estimator, in VL samples with the same cuts
in absolute magnitude and distance but in different angular regions,
are reported in Figs. 9-12. The formal
statistical error, reported in the figures, for the determination of
at each scale, can be derived from the dispersion of the average
One may note the following behaviors:
![]() |
(9) |
![]() |
Figure 10: As for Fig. 9 but for the VL2 samples. |
| Open with DEXTER | |
Thus the correlation properties are independent of galaxy luminosity
and they are characterized by a power-law index in the behavior of
the conditional density
0.1 up to 30 Mpc/h. At
larger scales, as shown for example in the two samples R1VL4 and R2VL4 the situation is less clear: fluctuations are more important because they are not smoothed out by the volume average. In the next subsection we define the range where the volume average is properly
performed.
![]() |
Figure 11: As for Fig. 9 but for the VL3 samples. |
| Open with DEXTER | |
![]() |
Figure 12: As for Fig. 9 but for the VL4 samples. |
| Open with DEXTER | |
In order to quantify the finite volume effects previously mentioned,
we have divided each of the VL samples of the R1 field into two non-overlapping contiguous angular regions, and we have recomputed the
conditional density in each of the 2
4 samples. The
properties of these subsamples are listed in Table 4. In
Figs. 13-16 we show the results.
Table 4:
Main properties of the different subsamples considered in the R1 region.
The angular limits of the cuts in the intrinsic coordinates of the
survey
and
(in degrees). The last column gives the
number of points in the sample.
![]() |
Figure 13:
Conditional density in spheres in the R1VL1 sample
and in the 2 subsamples defined by the angular cut performed as
discussed in the text. The lines labeled with |
| Open with DEXTER | |
![]() |
Figure 14: As Fig. 13 but for the R1VL2 sample. |
| Open with DEXTER | |
![]() |
Figure 15: As Fig. 13 but for the R1VL3 sample. |
| Open with DEXTER | |
![]() |
Figure 16: As Fig. 13 but for the R1VL4 sample. |
| Open with DEXTER | |
As already mentioned the average computed by Eq. (7) is
made by changing, at each scale r, the number
of points
which contribute. This scale-dependency follows from the requirement
that only those points are chosen for which, when chosen as centers of
a sphere of radius r, the volume does not overlap or intersect the
boundaries of the sample. In this way, in a sample of size
,
when
almost all points will contribute to the average,
while when
only those points lying close to the
center of the volume will be taken into account in the average. Hence
at large scales the average is performed on a number of points that
exponentially decay when
.
In
Figs. 13-16 we show the behavior of the
number of centers
as a function of scale, normalized to an arbitrary factor. The normalization is simple because at small scales
where N is the number of points contained in a given VL sample: at such small scales all points contribute to the statistics.
At a scale comparable with, but smaller than, the sample size there is
an abrupt decay of this quantity: this means that only few points
contribute to the average at large scales.
That systematic fluctuations are more important than statistical ones can be noticed from the behavior of the conditional density in Figs. 13-16 by comparing the behaviors in the original sample (e.g. R1VL1) and in the two separate subsamples (e.g. R1_1VL1 and R1_2VL1). When the distance scale approaches the boundaries of the samples there are systematic variations that are larger than the (small) error bars derived from Eq. (8). As already mentioned, in some cases there is evidence for a more flatter behavior while in other cases instead the conditional density show a decay up to the sample boundaries which is slower than at smaller scales. This situation suggests caution in the interpretation of the large scale tail of the conditional density. The question is how to quantify the regime where systematic fluctuations are important and may affect the behavior of the conditional density.
One may define a criterion for the statistical robustness of the
volume average by imposing for example that
must be larger
than a certain value. While this can certainly give a useful
indication, the problem of the volume average is more subtle. In fact
when
there can be sufficient points for
to be larger than a given pre-defined value: however it may
happen that all these points lie, for example, in a cluster located
close to the sample center. In this situation the volume average is
not properly performed, in the sense that all points "see'' almost
the same volume.
A way to clarify such a situation has been proposed by Joyce et al. (1999). One may compute the average distance between the
centers at the scale r:
![]() |
(10) |
| (11) |
![]() |
Figure 17: Conditional density in spheres in the R1VL3 and R1VL4 samples, normalized to have the same amplitude at 1 Mpc/h. The large scale behavior (r> 30 Mpc/h) is different due to the effect of systematic fluctuations. |
| Open with DEXTER | |
However we note that there is enough evidence that the signal is
smoother on scales >40 Mpc/h and that sample-to-sample fluctuations
or the variations in radial counts (discussed in Sect. 2) are
smaller, thus indicating a tendency toward a more uniform
distribution. However these data do not unambiguously support a clear
evidence in favor of homogeneity at scales of the order of 70 Mpc/h,
as Hogg et al. (2005) found by analyzing the LRG sample, because the
change in correlation properties occurs at scales comparable to the
scales
and
.
We conclude that these data support a change
of slope, with a clear tendency for
,
but with an undefined value.
These tests indicate that the availability of larger samples,
provided, for example, by DR5, will allow one to understand these
systematic variations. To study scales of the order of 100 Mpc/h,
samples with
300 Mpc/h are needed. However the full SDSS data will provide us with such large and complete catalogs.
Gravitational clustering in the regime of strong fluctuations is usually studied through gravitational N-body simulations. The particles are not meant to describe galaxies but collisionless dark-matter mass tracers. During gravitational evolution complex non-linear dynamics make non-linear structures at small scales, while at large scales a linear amplification occurs according to linear perturbation theory. Thus, while on large scales correlation properties do not change from the beginning - except a simple linear scaling of amplitudes - at small scales non-linear correlations occur. Typically in these simulations non-linear clustering is formed up to scales of the order of a few Mpc.
![]() |
Figure 18:
Conditional density for the four samples of
points selected in the simulation: the original dark matter (DM)
field, all "galaxies'' (ALL), blue galaxies (BLUE) and red galaxies
(RED). The conditional density for dark matter particles (DM) has been
normalized arbitrarily. The reference dashed-dotted line has a slope
|
| Open with DEXTER | |
At late times one can identify subsamples of points that trace the high density regions, and these would represent the sites for galaxy formation, whose statistical properties are ultimately compared with the ones found in galaxy samples.
In order to study this problem we consider the GIF galaxy catalog (Kauffmann et al. 1999) constructed from a
CDM simulation run by the Virgo consortium (Jenkins et al. 1998). This is done firstly identifying the halos, which represent almost spherical structures with a power-law
density profile from their center. The number of galaxies belonging to
each halo is set proportional to the total number of points belonging
to the halo to a certain power. This procedure identifies points
lying in high density regions of the dark-matter particles. One may
assign to each point a luminosity and a color on the basis of a certain criterion which is not relevant for what follows (see Sheth et al. 2001, and reference therein).
The resulting catalog is divided into two subsamples based on
"galaxy'' color B-I as in Sheth et al. (2001): (brighter) red
galaxies (for which B-I is redder than 1.8) and (fainter) blue
galaxies (B-I bluer than 1.8).
In summary four samples of points may be considered: (i) the original dark matter particles with N = 2563 particles; (ii) all galaxies with N = 15 445; (iii) blue galaxies with N = 11 023; and (iv) red galaxies with N = 4422.
In order to understand the correlation properties in the sampled point distributions it is useful to study the behavior of the conditional density which, as already discussed, has a straightforward interpretation in terms of correlations; results are shown in Fig. 18. The red galaxies are responsible for the strong correlations observed in the full sample as the conditional density is almost the same as for all galaxies at small scales. At large scales there is instead a fast decrease as the sample average of red galaxies is smaller than the one of all galaxies (there are fewer objects). For red galaxies the sampling is local, i.e. their conditional density is (almost) invariant at small scales. Clearly, as there are globally less objects, the sample density of red galaxies is smaller than that of all galaxies. On the other hand blue galaxies present only some residual correlations at small scales, and they are more numerous than red galaxies.
The small scale properties of these distributions can be studied by analyzing the NN probability distribution (see Fig. 19). Blue galaxies have a bell-shaped distribution, typical of the case where correlation are very weak. Instead red and all galaxies present almost the same function, with a long small-scale tail, which is a typical feature indicating the presence of strong two-point correlations (see discussion in Baertschiger & Sylos Labini 2002). This situation is different from the one detected in the samples of DR4 as shown in Figs. 1-4, where the NN probability distribution has the same shape for all samples considered.
![]() |
Figure 19: Nearest neighbors probability distribution for three point sets selected in the simulation (see discussion in the text): all "galaxies'' (ALL), blue galaxies (BLUE) and red galaxies (RED). |
| Open with DEXTER | |
The main points are the following:
Note that the data are analyzed in redshift space and the simulations in real space. However given that velocities are typically smaller than 500 km s-1 the difference between real and redshift space cannot be accounted by the effects of peculiar velocities on scales larger than 5 Mpc/h. The problem of the relation between real and redshift space, considering the finite size effects present when strong correlations characterize the data, has been discussed in Vasilyev et al. (2006).
Our main results are the following:
We do not confirm the results of Zehavi et al. (2004) who found a departure from a power-law in the galaxy correlation function at a scale of the order of 1 Mpc/h: their analysis was performed in real space while ours is in redshift space. In this range of scale nearest-neighbor correlations dominate the behavior of the conditional density and thus also of the reduced correlation function and for a detailed understanding of this regime a study of the nearest-neighbor is necessary.
We do not find a luminosity or color dependence of the galaxy the
conditional density in the regime of strongly non-linear correlations.
In this respect Zehavi et al. (2005) have considered the behavior of
the reduced two-point correlation function, and concluded that there
is a color (luminosity) dependence of galaxy correlations. This
apparent disagreement can be understood by considering that the
reduced two-point correlation function can be strongly affected by
finite-size effects in the regime where the conditional density
presents power-law correlations (see discussion, e.g., in Joyce et al. 2005). Moreover results by Zehavi et al. (2005) have been obtained in real space: in Vasilyev et al. (2006) we discussed the kind of finite size effects which perturb the estimation of
when the conditional density
has power-law correlations.
Thus in the range
Mpc/h we find
evidence for a more uniform distribution and hence a smaller power law
index (
)
in the conditional density. This is a stable
result in all samples considered. However a detailed analysis of the
behavior of the conditional density in all samples does not allow us
to conclude either that there is definitive crossover to homogeneity
at a scales of order 70 Mpc/h as Hogg et al. (2005) have concluded by
considering the LRG sample, or that there is a change of power-law
index beyond 30 Mpc/h which remains stable up to the samples limit, i.e. up
to 100 Mpc/h. Both possibilities are still open and will be clarified
by forthcoming data releases of SDSS as the solid angle will increase.
We discuss our results in relation to theoretical models of
fluctuations in standard cosmologies. It has been shown (see
e.g. Gabrielli et al. 2004) that the only feature of the primordial
correlations, defined in theoretical models like the cold dark matter
(CDM) one, that can be detected in galaxy data is represented by the
large scale tail of the reduced correlation function. In
terms of correlation function
,
CDM models presents the
following behavior: it is positive at small scales, it crosses zero at
a certain scale and then it is negative approaching zero with a tail
which goes as r-4 in the region corresponding to
(see e.g. Gabrielli et al. 2004). The super-homogeneity (or
Harrison-Zeldovich) condition says that the volume integral over all
space of the correlation function is zero
![]() |
(12) |
In standard models of structure formation, galaxies result from a sampling of the underlying CDM density field: for instance one
selects only the highest fluctuations of the field that would
represent the locations where a galaxy will eventually form. It has been
shown that sampling a super-homogeneous fluctuation field changes the
nature of the correlations (Durrer et al. 2003). The reason for this can
be found in the property of super-homogeneity of such a distribution:
the sampling necessarily destroys the surface nature of the
fluctuations, as it introduces a volume (Poisson-like) term in the
mass fluctuations, giving rise to a Poisson-like PS on large scales
const. The "primordial'' form of the PS is thus not
apparent in that which one would expect to measure from objects
selected in this way. This conclusion should hold for any generic
model of bias and its size has to be established in
any given model (Durrer et al. 2003).
On the other hand, one may show (Durrer et al. 2003) that the negative r-4 tail in the correlation function does not change with
sampling: on large enough scales, where in these models (anti)
correlations are small enough, the biased fluctuation field has a correlation function that is linearly amplified with respect to the
underlying dark matter correlation function. For this reason the
detection of such a negative tail would be the main confirmation of
models of a primordial density field. This will be possible if firstly a clear determination of the homogeneity scale is obtained, and
then if the data are statistically robust enough to allow the
determination of the correlation when it is
.
While
Eiseinstein et al. (2005) claimed to have measured that
at scales of order 100 Mpc/h in a sample of SDSS LRG galaxies, here we cannot confirm these results as our analysis does
not extend to such large scales with robust statistics. However
from the large fluctuations observed, for example in the behavior of
the radial counts and in sample-to-sample variations of the
conditional density at such large scales, we conclude that this result
needs more studies, and perhaps much larger samples, to be confirmed.
Acknowledgements
We thank Andrea Gabrielli, Michael Joyce and Luciano Pietronero for useful discussions and comments. Yu.V.B. and N.L.V. thank the "Istituto dei Sistemi Complessi'' (CNR, Rome, Italy) for the kind hospitality during the writing of this paper. FSL acknowledges the EC grant No. 517588 "Statistical Physics for Cosmic Structures'' and the MIUR-PRIN05 project on "Dynamics and Thermodynamics of systems with long range interactions'' for financial supports. Yu.V.B and N.L.V acknowledge the partial financial support by Russian Federation grants NSh-8542.2006.2 and RNP.2.1.1.2852.