Into the third dimension: stochastic measurements of Stokes parameters within the Poincaré sphere

Jason L. Quinn

doi:10.1051/0004-6361/201423921

Home

All issues

Volume 571 (November 2014)

A&A, 571 (2014) A89

Full HTML

Free Access

Issue		A&A Volume 571, November 2014


Article Number		A89
Number of page(s)		8
Section		Astronomical instrumentation
DOI		https://doi.org/10.1051/0004-6361/201423921
Published online		14 November 2014

A&A 571, A89 (2014)

Into the third dimension: stochastic measurements of Stokes parameters within the Poincaré sphere

Jason L. Quinn

Ul. Sępia 14, 04-512 Warszawa, Poland
e-mail: jason.lee.quinn@gmail.com

Received: 1 April 2014
Accepted: 29 September 2014

Abstract

Inspired by recent use of polarimetry to study the Cosmic Microwave Background and extragalatic supernovae, a foray into the statistical properties of Stokes parameters expressed in spherical coordinates is began, allowing circular polarization and linear polarization to be treated in a unified manner. The use of spherical coordinates is quite necessary as it permits a Stokes polarization state to be expressed in terms of the customary polarization angles and degree of polarization usually needed for human interpretation. As shall be demonstrated, circular and linear polarization are not statistically independent quantities but intertwined in a way that is especially important, for instance, at low signal-to-noise. New distributions, classical estimators, and marginalizations are presented for this “three-dimensional” polarization problem including a generalization of the Rice distribution. The paper concludes with discussion regarding the potential pitfalls of a lower dimensional analysis.

Key words: polarization / methods: data analysis / methods: statistical

© ESO, 2014

1. Introduction

Astronomical polarimetry and spectropolarimetry, which have historically been under-utilized, are now routinely being used in many experiments to search for and test new physics. Besides the diverse application of polarimetric techniques in solar and stellar astronomy, two areas of particular note are polarimetric observations of the cosmic microwave background (CMB) and spectropolarimetric observations of supernovae (SNe).

Detection of inflation signatures on the CMB using polarimetric imaging were recently announced by the BICEP2 team (Ade et al. 2014b,a). This result would confirm one of astrophysics most cherished but experimentally elusive theories. After a review of the public BICEP2 data, some have questioned the interpretation of the BICEP2 data and suggested that the foreground dust contamination had been underestimated or that the signal could have been produced by other mechanisms such as “radio loops” (Liu et al. 2014). The BICEP2 team have stood by their results although they have moderated their language in the peer-reviewed version of their paper (Ade et al. 2014b). If valid, the BICEP2 discovery continues the trend set by all-sky mapping satellites like COBE, WMAP, and Planck and various ground and balloon surveys of the CMB being perhaps the single most fruitful source to test cosmological theory and sometimes even fundamental physics. Further results related to CMB polarization are expected in the near future from BICEP3 (Kuo & BICEP3 and POLAR1 Collaborations 2013), the Keck Array (Staniszewski et al. 2012), Polarbear (The POLARBEAR Collaboration 2014), and the Planck team (Planck Collaboration XIX 2014). It is clear that the CMB will continue to be a vital source of cutting edge science; but, as the controversy around the BICEP2 announcement indicates, polarization work is tricky and extreme care must be made to handle properly statistical and systematic error in the data.

Other research groups have been using spectropolarimetric observations to investigate the systematic behavior and statistical variation of thermonuclear (Type Ia) and core-collapse SNe (all other types) arising from asymmetries. (The Wang & Wheeler 2008 Annual Review is a good introduction.) Spectropolarimetry is the only known way to probe asymmetries of unresolved sources such as extra-galactic supernovae so it will continue to be an effective research tool. The detection of asymmetries has helped place much needed constraints on SNe explosion physics/progenitor scenarios (recent examples include Maund et al. 2010b; Tanaka et al. 2012; Patat et al. 2012; Zelaya et al. 2013; Maund et al. 2013) and environments (Mauerhan et al. 2014). Asymmetries of Type Ia SNe are of particular interest as high-redshift events were famously used to discover cosmic acceleration (Riess et al. 1998; Perlmutter et al. 1999), a result for which leads of two large teams shared the 2011 Nobel Prize. This acceleration implies the existence of so-called “dark energy”, whose physical nature is completely puzzling and is perhaps the biggest unsolved problem in astrophysics. As such, any possible correlations between metrics of asymmetry in Type Ia SNe and other observables (Leonard et al. 2005; Wang et al. 2007; Maund et al. 2010a) are of high importance because correlations may be used to identify contaminating Type Ia sub-classes (Wang et al. 2013) or apply statistical corrections to the maximum brightness, both of which could help refine the measurement of the cosmological constant (Maeda et al. 2011). Spectropolarimetry of SNe is a blooming field but, just as with the CMB, handling the statistical and systematic error of polarimetric data for SNe is challenging.

As both polarimetric imaging and spectropolarimetry require dividing incoming light into bins, researchers in these fields often find themselves in photon-starved situations. Indeed, in both subjects discussed above, photon-limited data at low signal-to-noise must be confronted. (Low signal-to-noise is often the hallmark of forefront science by its very nature.) This problem is compounded in polarimetric imaging when a narrow-band filter is needed. Working at low signal-to-noise will continue to be an issue for polarimetrists.

Polarization data is often measured in Stokes parameters but expressed in human-digestible quantities like degrees of polarization and angles of polarization. Despite this, there has been relatively little attention given to the surprisingly complicated statistics involved when making transformations to these quantities at low signal-to-noise. Most of the published literature of this fundamental, practical topic has focused on the analysis of linear polarization alone that involves marginalizations over a two-dimensional gaussian distribution over the Q-U plane cross section of the Poincaré sphere (the “Poincaré disk” is a useful expression although it is already in use in other mathematical contexts). Marginalization over the linear polarization angle results in the Rice distribution (Rice 1945) while marginalization over the degree of linear polarization results in another distribution given by Vinokur (1965). The authors refer to this marginalized approach as the “one-dimensional problem”. A small number of papers such as Wang et al. (1997), Vaillancourt (2006), and Quinn (2012) have used a non-marginalized distribution over the Poincaré disk (the “two-dimensional problem”), with the latter being the first to put the subject on a rigorous mathematical footing. To date, there has been little to no attention paid to the three-dimensional problem defined by a three-dimensional distribution over the Poincaré sphere. Progress on higher dimensional generalizations of the two-dimensional problem have been made, for instance, by members of the Planck Collaboration (Planck Collaboration XIX 2014; Montier et al. 2014a,b) by including intensity in the analysis (see also Viola et al. 2014) but researchers have thus far generally ignored circular polarization when measuring linear polarization in astronomical sources (and vice versa); however, the measurement of one is not actually independent of the other. This effect is generally negligible except when one is working at very small signal-to-noise ratios or with objects with polarization near 100% (see Quinn 2012 for details on how large polarization introduces complication). The aim of this paper is to elucidate some of this interdependence and to see how it relates to the more common ways of treating linear polarization measurements at low signal-to-noise via the Rice distribution or through Bayesian techniques as outlined in Vaillancourt (2006) and Quinn (2012). The results should be of interest to all polarimetrists and those those studying the CMB or SNe in particular.

2. The sampling distribution

A very practical way of encoding the polarization state of electromagnetic radiation is the Stokes parameter formalism invented by George Stokes in 1852. Stokes parameters consist of four quantities: I (intensity), Q and U (related to linear polarization), and V (related to circular polarization)¹. They are convenient quantities to measure but it is easier for people to think about polarization in terms of percentages and angles. This may be done by introducing a spherical coordinate system (p,θ,φ) on points of the Poincaré sphere (a unit ball in Cartesian coordinates centered on the origin of $\frac{Q}{I_{0}}$ $\hbox{$\frac{Q}{I_0}$}$ , $\frac{U}{I_{0}}$ $\hbox{$\frac{U}{I_0}$}$ , and $\frac{V}{I_{0}}$ $\hbox{$\frac{V}{I_0}$}$ axes), which represents all possible polarization states. Accessible modern introductions to Stokes parameters and their theory are given in del Toro Iniesta (2003) and Landi Degl’Innocenti (2002).

If an astronomical source’s polarization state is measured with gaussian error (Σ_I,Σ_Q,Σ_U,Σ_V) on each Stokes parameter (I,Q,U,V), how do the corresponding spherical coordinates (p,θ,φ) relate to the “true” polarization state (p₀,θ₀,φ₀) determined uniquely from (I₀,Q₀,U₀,V₀)? (Unsubscripted variables will usually refer to measured values while “0”-subscripted values will refer to “true” values. Later, an overbar on some variables and distributions will indicate signal-to-noise related quantities.) As it turns out, a spherical coordinate system introduces non-trivial biases into the measured quantities and the measurement itself must be “corrected” to remove them. This paper extends Quinn (2012), which studied these effects in polar coordinates before discussing the Bayesian approach, to spherical coordinates. The reader is referred to that paper for a more extensive introduction on Stokes parameters.

Under the assumptions that the source intensity is large compared to the measurement error, i.e., I₀ ≫ Σ_I which implies I ≈ I₀ (Quinn 2012)², and the error on a Stokes measurement is described by a three-dimensional uncorrelated ellipsoidal Gaussian distribution (with semi-major axes Σ_Q, Σ_U, Σ_V), the normalized distribution, F_C, of the measured Stokes parameters (Q, U, V) given the “true” Stokes parameters (Q₀, U₀, V₀) is $\begin{matrix} F_{C} (Q,U,V | Q_{0}, U_{0}, V_{0}, Σ_{Q}, Σ_{U}, Σ_{V}) = \frac{1}{(2 π)^{3 / 2} Σ_{Q} Σ_{U} Σ_{V}} e^{- (\frac{(Q - Q_{0})^{2}}{2 Σ_{Q}^{2}} + \frac{(U - U_{0})^{2}}{2 Σ_{U}^{2}} + \frac{(V - V_{0})^{2}}{2 Σ_{V}^{2}})} . \end{matrix}$ $\begin{eqnarray} F_C(Q,U,V|Q_0,U_0,V_0,\Sigma_Q,\Sigma_U,\Sigma_V) = \frac{1}{(2 \pi)^{3/2} \Sigma_Q \Sigma_U \Sigma_V} {\rm e}^{-\left( \frac{(Q-Q_0)^2}{2 \Sigma_Q^2} + \frac{(U-U_0)^2}{2 \Sigma_U^2} + \frac{(V-V_0)^2}{2 \Sigma_V^2} \right)}. \label{starteq} \end{eqnarray}$ (1)These parameters are not usually worked with directly. Instead “normalized Stokes parameters” are defined as q ≡ Q/I₀, u ≡ U/I₀, v ≡ V/I₀ (with also q₀ ≡ Q₀/I₀, u₀ ≡ U₀/I₀, and v₀ ≡ V₀/I₀), and σ_q ≡ Σ_Q/I₀, σ_u ≡ Σ_U/I₀, and σ_v ≡ Σ_V/I₀, yielding a new normalized distribution $\begin{matrix} f_{C} (q,u,v | q_{0}, u_{0}, v_{0}, σ_{q}, σ_{u}, σ_{v}) = \frac{1}{(2 π)^{3 / 2} σ_{q} σ_{u} σ_{v}} e^{- (\frac{(q - q_{0})^{2}}{2 σ_{q}^{2}} + \frac{(u - u_{0})^{2}}{2 σ_{u}^{2}} + \frac{(v - v_{0})^{2}}{2 σ_{v}^{2}})} . \end{matrix}$ $\begin{eqnarray} f_C(q,u,v|q_0,u_0,v_0,\sigma_q,\sigma_u,\sigma_v) = \frac{1}{(2 \pi)^{3/2} \sigma_q \sigma_u \sigma_v} {\rm e}^{-\left(\frac{(q-q_0)^2}{2 \sigma_q^2} + \frac{(u-u_0)^2}{2 \sigma_u^2} + \frac{(v-v_0)^2}{2 \sigma_v^2}\right)}. \end{eqnarray}$ (2)Lastly, one most often works with “signal-to-noise” versions of the Stokes parameters defined as $q \equiv Q / Σ_{Q} = q / σ_{q}$ $\hbox{$\overline{q} \equiv Q/\Sigma_Q =q/\sigma_q$}$ , $u \equiv U / Σ_{U} = u / σ_{u}$ $\hbox{$\overline{u} \equiv U/\Sigma_U =u/\sigma_u$}$ , and $v \equiv V / Σ_{V} = v / σ_{v}$ $\hbox{$\overline{v} \equiv V/\Sigma_V =v/\sigma_v$}$ which gives another distribution $f_{C} (q, u, v | q_{0}, u_{0}, v_{0}, σ_{q}, σ_{u}, σ_{v}) = \frac{1}{(2 π)^{3 / 2}} e^{- \frac{(q - q_{0})^{2} + (u - u_{0})^{2} + (v - v_{0})^{2}}{2}},$ $\begin{equation} \overline{f}_C(\overline{q},\overline{u},\overline{v}|\overline{q}_0,\overline{u}_0,\overline{v}_0,\sigma_q,\sigma_u,\sigma_v) = \frac{1}{(2 \pi)^{3/2}} {\rm e}^{-\frac{(\overline{q}-\overline{q}_0)^2 + (\overline{u}-\overline{u}_0)^2 + (\overline{v}-\overline{v}_0)^2}{2}}, \end{equation}$ (3)which is normalized, ${^{\int}}_{- \infty}^{\infty} {^{\int}}_{- \infty}^{\infty} {^{\int}}_{- \infty}^{\infty} f_{C} d q d u d v = 1$ $\hbox{$\int_{-\infty}^\infty \int_{-\infty}^\infty \int_{-\infty}^\infty \overline{f}_C \, {\rm d}\overline{q} \, {\rm d}\overline{u} \, {\rm d}\overline{v}=1$}$ . This distribution no longer explicitly depends on the σ’s and naturally handles ellipsoidal distributions. The σ’s do, however, play a minor role: they limit the possible values of q₀, u₀, and v₀ so it is important not to forget their significance (Quinn 2012).

Fig. 1

Illustration of the spherical coordinate system used in the paper. The variable θ is used for the azimuthal angle with a range of (−π,π ], and the variable ϕ is measured from the positive v-axis with a range of [ 0,π ].

We wish to now transform our equations to spherical coordinates as the Poincaré sphere suggests. The transformation equations³ are $\begin{matrix} p = \sqrt{q^{2} + u^{2} + v^{2}} \\ θ = \arctan (u,q) \\ ϕ = \arctan (\sqrt{q^{2} + u^{2}},v) \end{matrix}$ $\begin{eqnarray} &&p = \sqrt{q^2+u^2+v^2} \notag \\ &&\theta = \arctan(u, q) \notag \\ &&\varphi = \arctan({\sqrt{q^2+u^2}}, v) \end{eqnarray}$ (4)which has inverse $\begin{matrix} q = p \sin (ϕ) \cos (θ) \\ u = p \sin (ϕ) \sin (θ) \\ v = p \cos (ϕ), \end{matrix}$ $\begin{eqnarray} &&q = p \sin(\varphi) \cos(\theta) \notag \\ &&u = p \sin(\varphi) \sin(\theta) \notag \\ &&v = p \cos(\varphi), \end{eqnarray}$ (5)where θ is the angle in the q-u plane and ϕ is the angle from the positive v-axis (see Fig. 1). The “true values”, p₀, θ₀, and ϕ₀, are similarly related to q₀, u₀, and v₀.

It is crucial not to confuse the degree of polarization, p, with the degree of linear polarization in the following. The degree of linear polarization is a separate quantity equal to $\sqrt{q^{2} + u^{2}}$ $\hbox{$\sqrt{q^2+u^2}$}$ or psin(ϕ) and, although it is certainly important, is not discussed in this paper.

We now restrict ourselves to the case where all three standard deviations are equal (Σ_Q = Σ_U = Σ_V ≡ Σ) so that σ_q = σ_u = σ_v ≡ σ. The distribution in spherical coordinates is then $\begin{matrix} f^{'} (p,θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) = \frac{p^{2} \sin ϕ}{(2 π)^{3 / 2} σ^{3}} e^{- \frac{p^{2} + p_{0}^{2} - 2 p p_{0} (\sin ϕ \sin ϕ_{0} \cos (θ - θ_{0}) + \cos ϕ \cos ϕ_{0})}{2 σ^{2}}} . \end{matrix}$ $\begin{eqnarray} f'(p,\theta,\varphi|p_0,\theta_0,\varphi_0,\sigma) = \frac{p^2 \sin\varphi}{(2 \pi)^{3/2} \sigma^3} {\rm e}^{-\frac{p^2 + p_0^2 - 2 p p_0 \left( \sin\varphi \sin\varphi_0 \cos(\theta-\theta_0) + \cos\varphi \cos\varphi_0 \right)}{2 \sigma^2}}. \label{fprime} \end{eqnarray}$ (6)The p²sinϕ in the numerator of the factor out front is due to the Jacobian of the transformation, i.e., dq du dv = p²sinϕ dp dθ dϕ⁴. It is rather promising that the analytic form of Eq. (6) is similar to and not much more complicated than the two-dimensional polar version. Already one can see that the functional form of the equation is not independent of the true value of the circular polarization.

The barred version is $\begin{matrix} f^{'} (p,θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) = \frac{p^{2} \sin ϕ}{(2 π)^{3 / 2}} e^{- \frac{p^{2} + p \begin{matrix} 2 \\ 0 \end{matrix} - 2 p p_{0} (\sin ϕ \sin ϕ_{0} \cos (θ - θ_{0}) + \cos ϕ \cos ϕ_{0})}{2}}, \end{matrix}$ $\begin{eqnarray} \overline{f}'(\overline{p},\theta,\varphi|\overline{p}_0,\theta_0,\varphi_0,\sigma) = \frac{\overline{p}^2 \sin\varphi}{(2 \pi)^{3/2}} {\rm e}^{-\frac{\overline{p}^2 + \overline{p}_0^2 - 2 \overline{p} \, \overline{p}_0 \left( \sin\varphi \sin\varphi_0 \cos(\theta-\theta_0) + \cos\varphi \cos\varphi_0 \right)}{2}}, \end{eqnarray}$ (7)where $p \equiv \sqrt{Q^{2} + U^{2} + V^{2}} / Σ = p / σ$ $\hbox{$\overline{p} \equiv \sqrt{Q^2+U^2+V^2}/\Sigma = p/\sigma$}$ and $p_{0} \equiv \sqrt{Q_{0}^{2} + U_{0}^{2} + V_{0}^{2}} / Σ = p_{0} / σ$ $\hbox{$\overline{p}_0 \equiv \sqrt{Q_0^2+U_0^2+V_0^2}/\Sigma = p_0/\sigma$}$ . This is still normalized, ${^{\int}}_{0}^{π} {^{\int}}_{- π}^{π} {^{\int}}_{0}^{1 / σ} f^{'} d p d θ d ϕ = 1$ $\hbox{$\int_{0}^\pi \int_{-\pi}^\pi \int_0^{1/\sigma} \overline{f}' \, {\rm d}\overline{p} \, {\rm d}\theta \, {\rm d}\varphi =1$}$ .

3. Classical estimators

Now that $f^{'} (p,θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ)$ $\hbox{$\overline{f}'(\overline{p},\theta,\varphi|\overline{p}_0,\theta_0,\varphi_0,\sigma)$}$ (the sampling distribution in signal-to-noise variables) is at hand, two important classical estimators may be calculated. These are the so-called “Most Likely” and “Most Probable” estimators. (Note that the prime is not indicating a derivative in $f^{'}$ $\hbox{$\overline{f}'$}$ but serving as a warning that we are using the q-u plane angle, θ, instead of the sky angle for linear polarization.)

3.1. The “Most Likely” solution

The “Most Likely” (ML) solution is obtained by maximizing $f^{'}$ $\hbox{$\overline{f}'$}$ with respect to $p_{0}$ $\hbox{$\overline{p}_0$}$ , θ₀, and ϕ₀. Physically, this corresponds to finding the value of $(p_{0}, θ_{0}, ϕ_{0})$ $\hbox{$(\overline{p}_0, \theta_0, \varphi_0)$}$ which makes the measured value of $(p,θ,ϕ)$ $\hbox{$(\overline{p}, \theta, \varphi)$}$ the most statistically likely, hence the name. A general solution may be found by solving the system $\frac{\partial f^{'}}{\partial p_{0}} = 0$ $\hbox{$\frac{\partial \overline{f}'}{\partial \overline{p}_0}=0$}$ , $\frac{\partial f^{'}}{\partial θ_{0}} = 0$ $\hbox{$\frac{\partial \overline{f}'}{\partial \theta_0}=0$}$ , and $\frac{\partial f^{'}}{\partial ϕ_{0}} = 0$ $\hbox{$\frac{\partial \overline{f}'}{\partial \varphi_0}=0$}$ for $p_{0}$ $\hbox{$\overline{p}_0$}$ , θ₀, and ϕ₀, which are then treated as estimators. Technically, a second derivative test must also be done to check that the point is actually a maximum. For functions of more than two variables, this test requires checking that all the eigenvalues of the function’s Hessian matrix are positive. (All negative would be a minima, while mixed values would be inconclusive.) The equations involved in the test are very large and cumbersome. From graphical investigation or intuition, it is obvious, however, that the solution to be found is a maximum.

From the azimuthal equation, $\frac{\partial f^{'}}{\partial θ_{0}} = 0$ $\hbox{$\frac{\partial \overline{f}'}{\partial \theta_0}=0$}$ , it is quickly deduced that θ₀ = θ everywhere except for values that correspond to an origin (where $p_{0}$ $\hbox{$\overline{p}_0$}$ or $p$ $\hbox{$\overline{p}$}$ equals zero) or a circular polarization axis (where ϕ₀ or ϕ equals zero or π) of the true and measured Poincaré spheres. Further tedious but straight-forward algebra allows the full solution to be found, which is just

$\begin{matrix} p_{0, ML} = p \\ θ_{0, ML} = θ \end{matrix}$ $\begin{eqnarray} &&\overline{p}_{\rm 0,ML}= \overline{p}\notag \\ &&\theta_{\rm 0,ML}= \theta\notag\\ &&\varphi_{\rm 0,ML}= \varphi. \label{MLsolution} \end{eqnarray}$ (8)

This solution involves no “bias corrections” at all. What you measure is your best estimate for the “true” polarization.

This solution is consistent with the polar coordinate case (that is, the two-dimensional case corresponding to ϕ → π/ 2, ϕ₀ → π/ 2, and σ_v → 0) presented in Quinn (2012). It is only when the Rice distribution (a one-dimensional distribution arising from marginalization) is used that a non-trivial solution is found for the ML method (Simmons & Stewart 1985). As marginalization intentionally removes information present in the original distribution, the logical basis for the use of non-trivial ML estimators found from a marginalized distribution to perform a “bias correction” is questionable. This is underscored by the fact that the p₀, θ₀, and ϕ₀ are not stochastic variables but input parameters in $f^{'}$ $\hbox{$\overline{f}'$}$ and should not be treated as such as is done in the ML approach. The logically correct way to “invert” the distribution is to use Bayes’ Theorem.

3.2. The “Most Probable” solution

The “Most Probable” (MP) solution is found by maximizing $f^{'}$ $\hbox{$\overline{f}'$}$ with respect to $p$ $\hbox{$\overline{p}$}$ , θ, and ϕ. This corresponds to finding the $(p_{0}, θ_{0}, ϕ_{0})$ $\hbox{$(\overline{p}_0,\theta_0,\varphi_0)$}$ point that produces a distribution of measured points with a maximum at the actual measured point $(p,θ,ϕ)$ $\hbox{$(\overline{p},\theta,\varphi)$}$ . A general solution may be found by solving the system $\frac{\partial f^{'}}{\partial p} = 0$ $\hbox{$\frac{\partial \overline{f}'}{\partial \overline{p}}=0$}$ , $\frac{\partial f^{'}}{∂θ} = 0$ $\hbox{$\frac{\partial \overline{f}'}{\partial \theta}=0$}$ , and $\frac{\partial f^{'}}{∂ϕ} = 0$ $\hbox{$\frac{\partial \overline{f}'}{\partial \varphi}=0$}$ for $p_{0}$ $\hbox{$\overline{p}_0$}$ , θ₀, and ϕ₀. (As before, a second derivative test is technically necessary to determine if the solution is actually a maximum as opposed to a minimum or some point of mixed inflection but this test is similarly impractical to perform because the equations end up being quite large. Graphically it can be shown that the solutions are maximums.)

Again, one quickly finds θ₀ = θ from the azimuthal equation, $\frac{\partial f^{'}}{∂θ} = 0$ $\hbox{$\frac{\partial \overline{f}'}{\partial \theta}=0$}$ . Using this condition, the ϕ and $p$ $\hbox{$\overline{p}$}$ equations yield $p^{2} = 2 + p p_{0} \cos (ϕ - ϕ_{0})$ $\hbox{$\overline{p}^2 = 2 + \overline{p} \, \overline{p}_0 \cos(\varphi - \varphi_0)$}$ and $p p_{0} \cos (ϕ_{0}) \sin (ϕ)^{2} = \cos (ϕ) (1 + p p_{0} \sin (ϕ) \sin (ϕ_{0}))$ $\hbox{$\overline{p} \, \overline{p}_0 \cos(\varphi_0) \sin(\varphi)^2 = \cos(\varphi) (1 + \overline{p} \, \overline{p}_0 \sin(\varphi) \sin(\varphi_0))$}$ , respectively. This two-equation system is somewhat difficult to solve and the solution is best accomplished with a computer algebra system. The full analytic solution is

$\begin{matrix} p_{0, MP} = \frac{\sqrt{\cot (ϕ)^{2} + (p^{2} - 2)^{2}}}{p} \\ θ_{0, MP} = θ \end{matrix}$ $\begin{eqnarray} &&\overline{p}_{\rm 0,MP} = \frac{\sqrt{\cot(\varphi)^2+(\overline{p}^2-2)^2}}{\overline{p}} \notag\\ &&\theta_{\rm 0,MP} = \theta \notag\\ &&\varphi_{\rm 0,MP} = \varphi - \arctan\left(\frac{\cot(\varphi)}{\overline{p}^2-2}\right), \label{MPsolution} \end{eqnarray}$ (9)

which is only valid for points satisfying the condition $p > \sqrt{1 + \csc (ϕ)^{2}},$ $\begin{equation} \overline{p} > \sqrt{1 + \csc(\varphi)^2}, \label{condition} \end{equation}$ (10)where csc(x) is the cosecant function. Thus there are two coupled “bias corrections” that must be made to the data: one for $p$ $\hbox{$\overline{p}$}$ and one for ϕ. A graphical representation of the preceding solution is given in Fig. 2. The left-most extension of the region consisting of points satisfying Eq. (10) (shown in blue in the figure) approaches the point $(p,ϕ) = (\sqrt{2},π / 2)$ $\hbox{$(\overline{p},\varphi)=(\sqrt{2},\pi/2)$}$ . In the special case of ϕ = π/ 2 and $p > \sqrt{2}$ $\hbox{$\overline{p}>\sqrt{2}$}$ , the correction is just ϕ_0,MP = π/ 2 and $p_{0, MP} = p - 2 / p$ $\hbox{$\overline{p}_{\rm 0,MP}=\overline{p}-2/\overline{p}$}$ .

Fig. 2

Visualization of the $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ bias correction field of Eq. (9). The tail of each arrow is located at a measured $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ point and the head is attached to the corresponding “Most Probable” estimate of $(p_{0}, ϕ_{0})$ $\hbox{$(\overline{p}_0,\varphi_0)$}$ . This correction should only be applied to $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ points lying in the blue region given by Eq. (10). See the text for discussion about $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ points outside the blue region.

Fig. 3

Behavior of the bias correction field (Eq. (9)) on the boundary of the blue valid region (Eq. (10)). The left-most point on the boundary of the blue region is at $(\sqrt{2},π / 2)$ $\hbox{$(\!\sqrt2,\pi/2)$}$ .

The “bias correction field” in Fig. 2 shows that the correction applied to $p$ $\hbox{$\overline{p}$}$ gets larger as $p$ $\hbox{$\overline{p}$}$ gets smaller (the actual magnitude is ϕ-dependent). This is consistent with the known behavior in the two-dimensional case. Figure 2 also shows that ϕ itself must be corrected: it should be made larger when ϕ>π/ 2 and smaller when ϕ<π/ 2. The magnitude of this correction also tends to be larger as $p$ $\hbox{$\overline{p}$}$ gets smaller. The exact magnitude of the ϕ-correction at a given $p$ $\hbox{$\overline{p}$}$ depends on ϕ itself with the magnitude being largest in the “mid-latitudes” of the Poincaré sphere and zero on the equator. Generally, $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ -points not on the equator are “attracted” towards the poles, an effect most prominent at low signal-to-noise (between, say, $\sqrt{2} < p ≲ 5$ $\hbox{$\sqrt2 < \overline{p} \lesssim 5$}$ ). The bias correction vector field diminishes as $p$ $\hbox{$\overline{p}$}$ gets larger but for ϕ values near the poles is seen to remain significant even for $p > 10$ $\hbox{$\overline{p}>10$}$ , a regime typically thought of as having signal-to-noise high enough not to have to worry about such effects.

For points infinitesimally close to the boundary defined by Eq. (10), the bias correction for ϕ estimates ϕ₀ = 0 or ϕ₀ = π, whichever pole is nearest. Figure 3 illustrates this behavior. The special case of ϕ = π/ 2 “corrects” to ϕ₀ = π/ 2. More specifically, points on the boundary with π/ 2 <ϕ<π all bias correct to ϕ₀ = π while those with 0 <ϕ<π/ 2 all bias correct to ϕ₀ = 0. The $p$ $\hbox{$\overline{p}$}$ correction on the boundary acts typically and corrects more and more towards the origin as $p$ $\hbox{$\overline{p}$}$ gets smaller.

From Figs. 2 and 3, it can be seen that every possible $(p_{0}, ϕ_{0})$ $\hbox{$(\overline{p}_0,\varphi_0)$}$ with $p_{0} > 0$ $\hbox{$\overline{p}_0>0$}$ and sinϕ₀ ≠ 0 is associated with a $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ point within the blue region and vice versa. Unfortunately, a closed form solution for the inverse of Eq. (9) was not found and may not exist.

Intuitively⁵, the $p$ $\hbox{$\overline{p}$}$ correction may be understood by considering a $p_{0}$ $\hbox{$\overline{p}_0$}$ value of zero. The measured value of $p$ $\hbox{$\overline{p}$}$ cannot be negative and, due to scatter from measurement error, you expect that $p$ $\hbox{$\overline{p}$}$ will almost always be positive (that is, there is a negligibly small chance of zero) even though $p_{0} = 0$ $\hbox{$\overline{p}_0=0$}$ . Thus, the measurement process “biases” $p$ $\hbox{$\overline{p}$}$ to larger values by some amount. Therefore, a correction should be made that subtracts a bit from the measured value to obtain a good estimate of $p_{0}$ $\hbox{$\overline{p}_0$}$ . Even for a non-zero $p_{0}$ $\hbox{$\overline{p}_0$}$ , this biasing occurs although with diminishing effect as $p_{0}$ $\hbox{$\overline{p}_0$}$ gets larger. The same line of reasoning can be used to gain an intuitive understanding of the ϕ bias correction. Consider a source in a state of pure circular polarization. This corresponds to a pole of the Poincaré sphere. Let’s focus on the purely right-circularly polarized case (ϕ₀ = 0). Measurements of this state will be clustered around the pole (including outside the Poincaré sphere) but ϕ is never negative and is almost always positive. Similarly for the left-circularly polarized case (ϕ₀ = π), measurements of ϕ will be clustered near that pole but cannot have ϕ greater than π and almost always will have ϕ less than π. Just as with $p$ $\hbox{$\overline{p}$}$ , a correction is needed to remove this effect from ϕ. It is important to remark that both the $p$ $\hbox{$\overline{p}$}$ and ϕ bias corrections are purely coordinate effects. It may seem at first as if the ϕ-correction is inconsistent with the symmetry of the system (from Σ_Q = Σ_U = Σ_V) but it is not. While conventionally the v-axis is always chosen to isolate circular polarization, the effect would arise regardless of the orientation of the v-axis within the Poincaré sphere, which preserves the overall symmetry (i.e., the symmetry is spontaneously broken by introduction of the coordinate system).

It must be stressed that the solution of Eq. (9) only applies to measured points satisfying Eq. (10) (illustrated by the blue region in the figures). Presumably, the area outside the blue region divides into three other regions: one characterized by $p_{0, MP} = 0$ $\hbox{$\overline{p}_{\rm 0,MP}=0$}$ , another by $p_{0, MP} \neq 0$ $\hbox{$\overline{p}_{\rm 0,MP} \ne 0$}$ and ϕ_0,MP = 0, and still another by $p_{0, MP} \neq 0$ $\hbox{$\overline{p}_{\rm 0,MP} \ne 0$}$ and ϕ_0,MP = π. In the latter two cases, there could still be a $p$ $\hbox{$\overline{p}$}$ bias correction at work. It is unclear how to determine mathematically a “MP solution” in the non-blue region of Fig. 2. A formal solution may simply not exist there. The authors’ best guess as to the best practice is that measurements within the “triangle” defined by points (0,0), (0,π), and $(\sqrt{2},π / 2)$ $\hbox{$(\sqrt{2},\pi/2)$}$ correct to (0,π) if ϕ>π/ 2, (0,0) if ϕ<π/ 2, and (0,π/ 2) if ϕ = π/ 2⁶. The remaining points could correct to the same value as the blue region boundary point whose “correction arrow” passes through the point. Another possibility for these later points is to translate the correction arrow from the boundary of the blue region with the same ϕ to the point and use the intersection of that arrow with the $p$ $\hbox{$\overline{p}$}$ -axis as the corrected value. Formal solutions in these regions where not, however, found.

Equation (9) does indeed provide an estimate of the $(p_{0}, ϕ_{0})$ $\hbox{$(\overline{p}_0,\varphi_0)$}$ point that would produce a distribution of measured points with maximum at $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ . It is however just a point estimate. The actual $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ distribution for a small values of $p_{0}$ $\hbox{$\overline{p}_0$}$ tends to be broad, shallow, and nearly flat, especially in the ϕ dimension. In other words, the errors bars associated with each estimate will be large so it is cautioned that the point estimate should not be taken too literally. Using the $f^{'}$ $\hbox{$\overline{f}'$}$ distribution, one could construct confidence regions. These regions are however not uniquely determined, even in the one-dimensional case (Simmons & Stewart 1985). Construction of confidence regions is reserved for future work.

4. Marginalized distributions

Experimenters often work with only a partial description of a physical system. This may be necessary when independent variables of a distribution of interest related to the system are not measurable, or desired when there are so-called nuisance variables of little concern. In a polarization study, for instance, it may be that only the degree of linear polarization is the focus and not the angle of linear polarization. In such situations, one can marginalize over “unwanted” independent variables to find a new distribution which no longer depends on them but at the cost of losing some information present in the original distribution.

The marginalization possibilities for f′ are to marginalize over (I) p only; (II) θ only; (III) ϕ only; (IV) p and θ; (V) p and ϕ; and (VI) θ and ϕ. The first and last cases are perhaps the most interesting and shall now be investigated.

4.1. The p-marginalized angular distribution

The p-marginalized angular distribution is $M (θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) = \int_{0}^{\infty} f^{'} (p,θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) d p$ $\begin{equation} M(\theta, \varphi|p_0,\theta_0,\varphi_0,\sigma) = \int_{0}^{\infty} f'(p,\theta,\varphi|p_0,\theta_0,\varphi_0,\sigma) \, {\rm d}p \end{equation}$ (11)or, in signal-to-noise variables, $M (θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) = \int_{0}^{\infty} f^{'} (p,θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) d p .$ $\begin{equation} \overline{M}(\theta, \varphi|\overline{p}_0,\theta_0,\varphi_0,\sigma) = \int_{0}^{\infty} \overline{f}'(\overline{p},\theta,\varphi|\overline{p}_0,\theta_0,\varphi_0,\sigma) \, {\rm d}\overline{p}. \end{equation}$ (12)This equation can be simplified. Start with $\begin{matrix} M (θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) = \int_{0}^{\infty} \frac{p^{2} \sin ϕ}{(2 π)^{3 / 2}} e^{- \frac{p^{2} + p \begin{matrix} 2 \\ 0 \end{matrix} - 2 p p_{0} (\sin ϕ \sin ϕ_{0} \cos (θ - θ_{0}) + \cos ϕ \cos ϕ_{0})}{2}} d p \\ = \frac{\sin ϕ}{(2 π)^{3 / 2}} e^{- \frac{p \begin{matrix} 2 \\ 0 \end{matrix}}{2}} \int_{0}^{\infty} p^{2} e^{- \frac{p^{2} - 2 p p_{0} (\sin ϕ \sin ϕ_{0} \cos (θ - θ_{0}) + \cos ϕ \cos ϕ_{0})}{2}} d p \end{matrix}$ $\begin{eqnarray*} \overline{M}(\theta, \varphi|\overline{p}_0,\theta_0,\varphi_0,\sigma)= \qquad\qquad~\int_{0}^{\infty} \frac{\overline{p}^2 \sin\varphi}{(2 \pi)^{3/2}} {\rm e}^{-\frac{\overline{p}^2 + \overline{p}_0^2 - 2 \overline{p} \, \overline{p}_0 \left( \sin\varphi \sin\varphi_0 \cos(\theta-\theta_0) + \cos\varphi \cos\varphi_0 \right)}{2}} {\rm d}\overline{p} \\ =\frac{\sin\varphi}{(2 \pi)^{3/2}} {\rm e}^{-\frac{\overline{p}_0^2}{2}} \int_{0}^{\infty} \overline{p}^2 {\rm e}^{-\frac{\overline{p}^2 - 2 \overline{p} \, \overline{p}_0 \left( \sin\varphi \sin\varphi_0 \cos(\theta-\theta_0) + \cos\varphi \cos\varphi_0 \right)}{2}} {\rm d}\overline{p} \end{eqnarray*}$ and use the definite integral $\int_{0}^{\infty} x^{2} e^{- \frac{x^{2} + ax}{2}} d x = \sqrt{\frac{π}{32}} (a^{2} + 4) e^{a^{2} / 8} erfc (\frac{a}{2 \sqrt{2}}) - \frac{a}{2},$ $\begin{equation} \int_0^\infty \! x^2 {\rm e}^{-\frac{x^2+a x}{2}} {\rm d}x = \sqrt{\frac{\pi}{32}} (a^2+4) {\rm e}^{a^2/8} \operatorname{erfc}\left( \frac{a}{2\sqrt{2}} \right) -\frac{a}{2}, \end{equation}$ (13)where erfc(x) is the complementary error function defined as⁷ $erfc (x) \equiv 1 - \erf (x) = \frac{2}{\sqrt{π}} \int_{x}^{\infty} e^{- t^{2}} d t$ $\begin{equation} \operatorname{erfc}(x) \equiv 1 - \operatorname{erf}(x) = \frac{2}{\sqrt{\pi}} \int_x^\infty {\rm e}^{-t^2} {\rm d}t \end{equation}$ (14)and erf(x) is the error function to reduce this integral. $\begin{matrix} M (θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) = \frac{\sin ϕ}{(2 π)^{3 / 2}} e^{- \frac{p \begin{matrix} 2 \\ 0 \end{matrix}}{2}} (\sqrt{\frac{π}{32}} (A^{2} + 4) e^{A^{2} / 8} erfc (\frac{A}{2 \sqrt{2}}) - \frac{A}{2}) \end{matrix}$ $\begin{eqnarray} \overline{M}(\theta, \varphi|\overline{p}_0,\theta_0, \varphi_0, \sigma) = \frac{\sin\varphi}{(2 \pi)^{3/2}} {\rm e}^{-\frac{\overline{p}_0^2}{2}} \left( \sqrt{\frac{\pi}{32}} (A^2+4) {\rm e}^{A^2/8} \operatorname{erfc} \left( \frac{A}{2\sqrt{2}} \right) -\frac{A}{2} \right) \label{pmarginalized} \end{eqnarray}$ (15)is obtained where $A \equiv - 2 p_{0} (\sin ϕ \sin ϕ_{0} \cos (θ - θ_{0}) + \cos ϕ \cos ϕ_{0})$ $\begin{equation} A \equiv -2 \, \overline{p}_0 \left( \sin\varphi \sin\varphi_0 \cos(\theta-\theta_0) + \cos\varphi \cos\varphi_0 \right) \end{equation}$ (16)which may be viewed as a higher-dimensional analog of the angular distribution presented in Vinokur (1965; see also Clarke & Stewart 1986; Naghizadeh-Khouei & Clarke 1993; Quinn 2012).

Examining a few density plots of Eq. (15) for various values of $p_{0}$ $\hbox{$\overline{p}_0$}$ , θ₀, and ϕ₀ reveals that the probability distribution is attenuated near the poles of the coordinate system, an effect that becomes broader for small values of $p_{0}$ $\hbox{$\overline{p}_0$}$ . In particular, one should notice that even for $p_{0} = 0$ $\hbox{$\overline{p}_0=0$}$ , the probability density still varies with ϕ. This is due to the pole bias effect discussed earlier.

One would like to go further and find the two one-dimensional marginalized angular distributions (cases IV and V) but the integrations seem unable to be completely performed using elementary or even common special functions.

4.2. The angular-marginalized p-distribution

One is often primarily concerned with the degree of polarization, p, and not the value of the angular variables. Marginalization over the angular variables will therefore produce a new distribution, to be called Q ( $Q$ $\hbox{$\overline{\myletter}$}$ in signal-to-noise variables) of high practical importance. In the polar coordinate case, marginalization over the (lone) angular variable, θ, produces the Rice distribution. In the spherical coordinate case, the marginalization must occur over both θ and ϕ. Thus, the function $Q$ $\hbox{$\overline{\myletter}$}$ may be considered a higher-dimensional analog of the Rice distribution. Let $Q (p | p_{0}, θ_{0}, ϕ_{0},σ) = \int_{0}^{π} \int_{- π}^{π} f^{'} (p,θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) d θ d ϕ$ $\begin{equation} \myletter(p|p_0,\theta_0,\varphi_0,\sigma) = \int_{0}^{\pi} \! \int_{-\pi}^{\pi} f'(p,\theta,\varphi|p_0,\theta_0,\varphi_0,\sigma) \, {\rm d}\theta \, {\rm d}\varphi \end{equation}$ (17)or similarly $Q (p | p_{0}, θ_{0}, ϕ_{0},σ) = \int_{0}^{π} \int_{- π}^{π} f^{'} (p,θ,ϕ | p_{0}, θ_{0}, ϕ_{0},σ) d θ d ϕ .$ $\begin{equation} \overline{\myletter}(\overline{p}|\overline{p}_0,\theta_0,\varphi_0,\sigma) = \int_{0}^{\pi} \! \int_{-\pi}^{\pi} \overline{f}'(\overline{p},\theta,\varphi|\overline{p}_0,\theta_0,\varphi_0,\sigma) \, {\rm d}\theta \, {\rm d}\varphi. \end{equation}$ (18)Let’s calculate. $\begin{matrix} Q (p | p_{0}, θ_{0}, ϕ_{0},σ) = \frac{p^{2}}{(2 π)^{3 / 2}} e^{- \frac{p^{2} + p \begin{matrix} 2 \\ 0 \end{matrix}}{2}} \\ \times \int_{0}^{π} \int_{- π}^{π} \sin ϕ e^{p p_{0} (\sin ϕ \sin ϕ_{0} \cos (θ - θ_{0}) + \cos ϕ \cos ϕ_{0})} d θ d ϕ \\ = \frac{p^{2}}{(2 π)^{3 / 2}} e^{- \frac{p^{2} + p \begin{matrix} 2 \\ 0 \end{matrix}}{2}} \\ \times \int_{0}^{π} \sin ϕ e^{p p_{0} \cos ϕ \cos ϕ_{0}} \int_{- π}^{π} e^{p p_{0} \sin ϕ \sin ϕ_{0} \cos (θ - θ_{0})} d θ d ϕ . \end{matrix}$ $\begin{eqnarray} &&\overline{\myletter}(\overline{p}|\overline{p}_0, \theta_0,\varphi_0,\sigma) = \frac{\overline{p}^2}{(2 \pi)^{3/2}} {\rm e}^{-\frac{\overline{p}^2+\overline{p}_0^2}{2}}\notag \\ &&~ ~\quad \times \int_{0}^{\pi} \! \int_{-\pi}^{\pi} \! \sin\varphi \, {\rm e}^{\overline{p} \, \overline{p}_0 ( \sin\varphi \sin\varphi_0 \cos(\theta-\theta_0) + \cos\varphi \cos\varphi_0)} \, {\rm d}\theta \, {\rm d}\varphi\notag \\ && \quad\qquad\qquad \qquad=\frac{\overline{p}^2}{(2 \pi)^{3/2}} {\rm e}^{-\frac{\overline{p}^2+\overline{p}_0^2}{2}}\notag \\ & &~~\quad\times \int_{0}^{\pi} \! \sin\varphi \, {\rm e}^{\overline{p} \, \overline{p}_0 \cos\varphi \cos\varphi_0 } \int_{-\pi}^{\pi} \! {\rm e}^{\overline{p} \, \overline{p}_0 \sin\varphi \sin\varphi_0 \cos(\theta-\theta_0) } \, {\rm d}\theta \, {\rm d}\varphi . \end{eqnarray}$ (19)The θ-integral can be solved using ${^{\int}}_{- π}^{π} e^{a \cos (t)} d t = 2 π ℐ_{0} (a)$ $\hbox{$\int_{-\pi}^{\pi} {\rm e}^{a \cos(t)} {\rm d}t = 2\pi \mathcal{I}_0(a)$}$ , where ℐ₀(x) is the modified Bessel function of the first kind. ℐ₀ is an even, real function when its argument is real. (Because the range of integration is over a full period of θ, the value of θ₀ is immaterial because it appears as a phase shift and does not affect the integral.) Once the θ-integral is performed, the $Q$ $\hbox{$\overline{\myletter}$}$ distribution no longer explicitly depends on θ₀. We shall therefore drop θ₀ from the conditions. Continuing with the calculation, $\begin{matrix} Q (p | p_{0}, ϕ_{0},σ) = \frac{p^{2}}{(2 π)^{3 / 2}} e^{- \frac{p^{2} + p \begin{matrix} 2 \\ 0 \end{matrix}}{2}} \\ \times \int_{0}^{π} \sin ϕ e^{p p_{0} \cos ϕ \cos ϕ_{0}} (2 π) ℐ_{0} (p p_{0} \sin ϕ \sin ϕ_{0}) d ϕ \\ = \frac{p^{2}}{\sqrt{2 π}} e^{- \frac{p^{2} + p \begin{matrix} 2 \\ 0 \end{matrix}}{2}} \end{matrix}$ $\begin{eqnarray} &&\overline{\myletter}(\overline{p}|\overline{p}_0,\varphi_0,\sigma) = \frac{\overline{p}^2}{(2 \pi)^{3/2}} {\rm e}^{-\frac{\overline{p}^2+\overline{p}_0^2}{2}} \notag \\ &&~~\quad \times\int_{0}^{\pi} \! \sin\varphi \, {\rm e}^{\overline{p} \, \overline{p}_0 \cos\varphi \cos\varphi_0 } (2 \pi) \mathcal{I}_0(\overline{p} \, \overline{p}_0 \sin\varphi \sin\varphi_0) {\rm d}\varphi \notag \\ &&\qquad \qquad\qquad= \frac{\overline{p}^2}{\sqrt{2 \pi}} {\rm e}^{-\frac{\overline{p}^2+\overline{p}_0^2}{2}} \notag\\ &&~~\quad \times\int_{0}^{\pi} \! \sin\varphi \, {\rm e}^{\overline{p} \, \overline{p}_0 \cos\varphi \cos\varphi_0 } \mathcal{I}_0(\overline{p} \, \overline{p}_0 \sin\varphi \sin\varphi_0) {\rm d}\varphi. \label{bigbad} \end{eqnarray}$ (20)The ϕ-integral in the last expression is non-trivial but it appears it can be solved. Let $ω (a,ϕ, ϕ_{0}) \equiv \sin ϕ e^{a \cos ϕ \cos ϕ_{0}} ℐ_{0} (a \sin ϕ \sin ϕ_{0})$ $\begin{equation} \omega(a,\varphi,\varphi_0) \equiv \sin\varphi \, {\rm e}^{a \cos\varphi \cos\varphi_0} \mathcal{I}_0(a \sin\varphi \sin\varphi_0) \label{hardfunc} \end{equation}$ (21)and $Ω (a, ϕ_{0}) \equiv \int_{0}^{π} ω (a,ϕ, ϕ_{0}) d ϕ,$ $\begin{equation} \Omega(a,\varphi_0) \equiv \int_{0}^{\pi} \! \omega(a,\varphi,\varphi_0) {\rm d}\varphi, \label{hardint} \end{equation}$ (22)where $a \equiv p p_{0} \geq 0$ $\hbox{$a \equiv \overline{p} \, \overline{p}_0 \ge 0$}$ . No computer algebra system tested could solve the integral in Eq. (22) and no applicable integrals were found in tables of integrals. Manual attempts to solve it also failed. From graphs of Ω versus ϕ₀ created numerically⁸ for several values of a, it is noticed that the function may actually be constant in ϕ₀ for a given a. Unfortunately, it is not a simple matter to show $\frac{\partial Ω}{\partial ϕ_{0}} = 0$ $\hbox{$\frac{\partial\Omega}{\partial\varphi_0} = 0$}$ but if $\frac{\partial Ω}{\partial ϕ_{0}}$ $\hbox{$\frac{\partial\Omega}{\partial\varphi_0}$}$ is graphed for 0 ≤ ϕ₀ ≤ π at several sampled values of a it seems to always show residuals about zero with scatter consistent with numerical noise. The previous numerical observations lead one to suspect very strongly that Ω(a,ϕ₀) is independent of the value of ϕ₀. This may be surprising in light of the bias corrections previously discussed; however, the same marginalized formula seems like it should be found regardless of the rotational orientation of the $(q, u, v)$ $\hbox{$(\overline{q},\overline{u},\overline{v})$}$ coordinate system within the Poincaré sphere. If it is true that Ω(a,ϕ₀) is constant in ϕ₀, any value of ϕ₀ may be chosen and it does not alter Ω(a,ϕ₀)’s value at a given a. In that case, a value of ϕ₀ may be judiciously chosen that simplifies the integral. Good choices are ϕ₀ = 0 or π for which $Ω (a, ϕ_{0}) = \frac{2 \sinh (a)}{a}$ $\begin{equation} \Omega(a,\varphi_0) = \frac{2 \sinh(a)}{a} \label{unprovedresult} \end{equation}$ (23)is found using the common integral ${^{\int}}_{0}^{π} e^{x \cos ϕ} \sin ϕ d ϕ = 2 \sinh (x) / x$ $\hbox{$\int_0^\pi \! {\rm e}^{x \cos{\varphi}} \sin{\varphi} \, {\rm d}\varphi = 2\sinh(x)/x$}$ , where sinh(x) is the hyperbolic sine of x. Indeed, simple arguments modifying the factors in the integrand of Eq. (22) can prove this to be a lower limit for Ω(a,ϕ₀) at a given a (see Appendix A). Proof that it is also the upper limit was elusive (except at ϕ₀ = 0, π/ 2, and π). A Taylor series expansion of Ω(a,ϕ₀) in ϕ₀ at ϕ₀ = 0 was found to be consistent with Eq. (23) out to 26-th order using computer algebra systems⁹. Based on physical suspicion corroborated by numerical results, Eq. (23) shall be assumed to be true henceforth.

Returning to the main distribution, $Q$ $\hbox{$\overline{\myletter}$}$ , and using Eq. (23), it is proposed that $Q (p | p_{0},σ) = \sqrt{\frac{2}{π}} \frac{p}{p_{0}} e^{- \frac{p^{2} + p \begin{matrix} 2 \\ 0 \end{matrix}}{2}} \sinh (p p_{0}) .$ $\begin{equation} \overline{\myletter}(\overline{p}|\overline{p}_0,\sigma) = \sqrt{\frac{2}{\pi}} \frac{\overline{p}}{\overline{p}_0} {\rm e}^{-\frac{\overline{p}^2+\overline{p}_0^2}{2}} \sinh(\overline{p} \, \overline{p}_0). \label{mydistribution} \end{equation}$ (24)Since the value of ϕ₀ is irrelevant, $Q$ $\hbox{$\overline{\myletter}$}$ ’s dependence on ϕ₀ has been dropped from the notation. This is the higher-dimensional analog of the Rice distribution. Surprisingly, it is a simpler function than the Rice distribution in the sense that it depends on the hyperbolic sine function rather than a modified Bessel function. The $Q$ $\hbox{$\overline{\myletter}$}$ distribution is normalized, ${^{\int}}_{0}^{\infty} Q d p = 1$ $\hbox{$\int_0^\infty \! \overline{\myletter} \, {\rm d}\overline{p} =1$}$ .

Fig. 4

Contour plot of the angular-marginalized $p$ $\hbox{$\overline{p}$}$ -distribution given by Eq. (24). The contours are at 0.1 intervals with the lighter shades corresponding to larger values. The dotted and dashed curves (Eqs. (25) and (26)) trace maximums along horizontal and vertical slices and end at the red points at $(\sqrt{2}, 0)$ $\hbox{$(\sqrt{2},0)$}$ and $(\sqrt{3}, 0)$ $\hbox{$(\sqrt{3},0)$}$ , respectively. Although the “shape” of the distribution is not affected by the value of σ, the plot assumes that σ ≤ 1/4 so that the plotted values for $p_{0}$ $\hbox{$\overline{p}_0$}$ up to 4 (=1 /σ_max) exist.

Some general properties of $Q$ $\hbox{$\overline{\myletter}$}$ can now be investigated. A contour plot of the function is shown in Fig. 4 with contours at 0.1 intervals. The function achieves a global maximum as $(p, p_{0})$ $\hbox{$(\overline{p},\overline{p}_0)$}$ approaches $(\sqrt{2}, 0)$ $\hbox{$(\!\sqrt{2},0)$}$ of $\frac{2}{e} \sqrt{\frac{2}{π}} \approx 0.587051$ $\hbox{$\frac{2}{e} \sqrt{\frac{2}{\pi}} \approx 0.587051$}$ . Tracing the maximums of horizontal slices (the dotted line in the figure) yields the equation $(p^{2} - 1) \sinh (p p_{0}) - p p_{0} \cosh (p p_{0}) = 0$ $\hbox{$(\overline{p}^2-1) \sinh(\overline{p} \, \overline{p}_0) - \overline{p} \, \overline{p}_0 \cosh(\overline{p} \, \overline{p}_0)=0$}$ or $(p \begin{matrix} 2 \\ 0 \end{matrix} - 1) \tanh (p p_{0}) = p p_{0} .$ $\begin{equation} (\overline{p}_0^2-1) \tanh(\overline{p} \, \overline{p}_0) = \overline{p} \, \overline{p}_0. \label{hslices} \end{equation}$ (25)This implicit curve intersects the $p$ $\hbox{$\overline{p}$}$ -axis at the global maximum at $(\sqrt{2}, 0)$ $\hbox{$(\!\sqrt{2},0)$}$ and is the “Most Probable” estimator curve for $Q$ $\hbox{$\overline{\myletter}$}$ . Tracing the maximums of vertical slices (dashed line) yields $p p_{0} \cosh (p p_{0}) - (p \begin{matrix} 2 \\ 0 \end{matrix} + 1) \sinh (p p_{0}) = 0$ $\hbox{$\overline{p} \, \overline{p}_0 \cosh(\overline{p} \, \overline{p}_0) - (\overline{p}_0^2+1) \sinh(\overline{p} \, \overline{p}_0)=0$}$ or $(p \begin{matrix} 2 \\ 0 \end{matrix} + 1) \tanh (p p_{0}) = p p_{0} .$ $\begin{equation} (\overline{p}_0^2+1) \tanh(\overline{p} \, \overline{p}_0) = \overline{p} \, \overline{p}_0. \label{vslices} \end{equation}$ (26)This curve intersects at the $p$ $\hbox{$\overline{p}$}$ -axis at $(\sqrt{3}, 0)$ $\hbox{$(\sqrt{3},0)$}$ and is the “Most Likely” estimator curve for $Q$ $\hbox{$\overline{\myletter}$}$ .

The MP estimator curve of Eq. (25) (dotted curve in Fig. 4) can be used for all $p > \sqrt{2}$ $\hbox{$\overline{p}>\sqrt2$}$ . Yet it was stressed that the full MP solution given Eqs. (9) and (10) is only valid for some values with $p > \sqrt{2}$ $\hbox{$\overline{p}>\sqrt2$}$ . Marginalization has thus hidden some of the complexity of the problem. The ML estimator given by Eq. (26) (dashed curve) now has non-trivial behavior unlike the full solution in Eq. (8). It is not easy to anticipate the effect that marginalization will have on various estimators.

It is interesting to compare the long form of these estimator curves and the corresponding curves based on the Rice distribution (Wardle & Kronberg 1974; Simmons & Stewart 1985; Quinn 2012). These curves are based on the hyperbolic trigonometric functions whereas the curves in the two-dimensional case are based on Bessel functions. Their points of intersection with the $p$ $\hbox{$\overline{p}$}$ -axis have also been shifted. The differences arise because the Jacobian factor is proportional to p² in spherical coordinates but only p for polar coordinates.

5. Discussion

How do these new results relate to the old results derived using polar coordinates for the “Poincaré disk”? First, the old results are not special cases of the new results when ϕ → π/ 2 and ϕ₀ → π/ 2; for instance, in the special case of ϕ = π/ 2, the $p$ $\hbox{$\overline{p}$}$ equation of Eq. (9) reduces to $p - 2 / p$ $\hbox{$\overline{p}-2/\overline{p}$}$ rather than the Wang estimator $p - 1 / p$ $\hbox{$\overline{p}-1/\overline{p}$}$ (Wang et al. 1997; Quinn 2012) and $M (θ,ϕ)$ $\hbox{$\overline{M}(\theta,\varphi)$}$ does not reduce to angular distribution given by Vinokur (1965). Similarly, $Q$ $\hbox{$\overline{\myletter}$}$ is altogether distinct from the Rice distribution. The new results are therefore not direct extensions of the old functions but they do have somewhat similar forms. The three versus two dimensional nature of the parameter space is responsible for this.

Particularly interesting is a comparison of Fig. 4 with Fig. 2a of Quinn (2012). The intercepts of all the estimator curves have undergone shifts of the form $\sqrt{n} \to \sqrt{n + 1}$ $\hbox{$\sqrt{n} \rightarrow \sqrt{n+1}$}$ . This again is due to the increased dimensionality of the problem. Thus, when circular polarization has equal measurement error to the linear Stokes axes’ measurement error, it takes a bigger measurement of the magnitude of the polarization vector to be significant than the two-dimensional case suggests. The two-dimensional results can perhaps be recovered by some limiting procedure where σ_v → 0 while σ (=σ_q = σ_u) is held constant. This suggests that decreasing the measurement error on one of σ_q, σ_u, or σ_v should decrease the $p$ $\hbox{$\overline{p}$}$ -threshold for a significant measurement. On the other hand, often an instrument measures only linear or only circular polarization. This may correspond to a scenario where the distribution of Stokes parameters is best modeled by assuming a flat (instead of gaussian) distribution for the variables of the undetected type. This would violate our starting assumptions in Eq. (1) and would produce rather different results. As a flat distribution is in an informal sense a “wide” distribution, it is probable that this would raise the detection threshold higher. It should be clear that interpreting polarization data at low signal-to-noise in “human digestible” quantities like degrees of polarization or angle of polarization is difficult.

6. Conclusion

The three-dimensional Stokes sampling distribution in spherical coordinates is given. From it the “Most Likely” and “Most Probable” classical (that is, non-Bayesian) estimators for the “true” Stokes parameters given measured values have been derived. Additionally, a two useful marginalizations of the sampling distribution have been calculated including a higher dimensional analog of the Rice distribution. These results are necessary stepping stones towards a full and proper treatment of polarization measurements.

It is cautioned that a deep understanding of the measurement of Stokes parameters in spherical coordinates like here or the polar coordinates usually used for linear polarization requires a Bayesian approach. The presented results may nonetheless have tolerable accuracy for many experiments.

While the setup of the problem and the presentation of the results has been framed in terms of the study of polarization, the solutions are in fact far more general. They are applicable to the measurement of any “source point” contained in a three-dimensional ball measured with Gaussian error in each Cartesian direction and presented in spherical coordinates. The results could be used to study the location of neutrino production within the Sun or the determination of the epicenter of an earthquake. The results could also influence the interpretation of cosmic microwave background polarization data. Extensions of the work removing the constraint that the error be equal in each direction or that involve correlated variables would be highly desirable.

¹

This notation is fairly common in optical astronomy but other notations are also in use in different disciplines like solar astronomy or optical physics.

²

This somewhat technical assumption does not (especially at low signal-to-noise) imply Q ≈ Q₀, U ≈ U₀, or V ≈ V₀. It is a weaker condition used implicitly (but without recognition) by many important past papers on polarization statistics even though it is crucial for defining the reduced Stokes parameters as, for instance, q ≡ Q/I₀ rather than the problematic q ≡ Q/I. Even more fundamentally, without the condition, studying a distribution over I, Q, U, and V instead of just Q, U, and V would be thrust upon us. That I ≈ I₀ does not imply the other approximations is easy to understand intuitively with back-of-the-envelope calculations by comparing the relative (Poisson) error on the intensity to the relative expected (Poisson) error on the Stokes parameters for some low degree of polarization source and some assumed moderate number of total counts for the measured intensity (like, for instance, a 1% source and 1000 total counts). The cited paper details the proper role of the condition in the theory.

³

In this set of equations, the arctangent function, arctan(y,x), is the two-dimensional version often supported in computer languages. Its value is assumed to vary continuously from −π to + π on the unit circle starting from the negative x-axis counterclockwise again to the negative x-axis. Be extremely careful when implementing this function in computer code. Some languages switch the arguments so that it is arctan(x,y). Make sure your implementation gives valid angles for all quadrants and axes, especially if you use the single argument arctangent.

⁴

The function f′ is a scalar density and therefore gains a factor due to the Jacobian under a coordinate transform.

⁵

This argument, while “intuitive” for understanding the bias correction from a classical statistical perspective, can easily be misapplied in a Bayesian scenario. Nevertheless, when properly used, it is still an important tool to understand Bayesian results. The key difference is that in the classical approach, it applies to a single “true” point producing the measured points, whereas in the Bayesian approach it applies individually to all “possibly true” points (weighted by their likelihood).

⁶

It may seem strange to worry about the ϕ angle’s best estimate if $p_{0, MP} = 0$ $\hbox{$\overline{p}_{\rm 0,MP}=0$}$ but using a “proper” solution may have nicer continuity properties and be better for computer programs than, for instance, simply assuming or defining ϕ_0,MP = π/ 2.

⁷

Some authors define erfc(x) without the $\frac{2}{\sqrt{π}}$ $\hbox{$\frac{2}{\sqrt{\pi}}$}$ factor.

⁸

This is a sensitive result and may require use of a program or library that allows the user to specify a higher level of precision than the native floating point size.

⁹

The zeroth-order term of the expansion is 2sinh(a) /a. If Eq. (23) is true, then it is the only non-zero term. It was proven that all the odd-order coefficients in the expansion are identically zero. It appears that the positive even-order expansion coefficients will also always be identically zero but this remains unproven.

References

Ade, P. A. R., Aikin, R. W., Amiri, M., et al. 2014a, ApJ, 792, 62 [NASA ADS] [CrossRef] [Google Scholar]
Ade, P. A. R., Aikin, R. W., Barkats, D., et al. 2014b, Phys. Rev. Lett., 112, 241101 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
Clarke, D., & Stewart, B. G. 1986, Vistas Astron., 29, 27 [Google Scholar]
del Toro Iniesta, J. C. 2003, Introduction to Spectropolarimetry (Cambridge University Press) [Google Scholar]
Kuo, C.-L., & BICEP3 and POLAR1 Collaborations 2013, in IAU Symp. 288, eds. M. G. Burton, X. Cui, & N. F. H. Tothill, 80 [Google Scholar]
Landi Degl’Innocenti, E. 2002, in Astrophysical Spectropolarimetry, Proc. of the XII Canary Islands Winter Scool of Astrophysics, eds. J. Trujillo-Bueno, F. Moreno-Insertis, & F. Sánchez (Cambridge: Cambridge University Press), 1 [Google Scholar]
Leonard, D. C., Li, W., Filippenko, A. V., Foley, R. J., & Chornock, R. 2005, ApJ, 632, 450 [NASA ADS] [CrossRef] [Google Scholar]
Liu, H., Mertsch, P., & Sarkar, S. 2014, ApJ, 789, L29 [NASA ADS] [CrossRef] [Google Scholar]
Maeda, K., Leloudas, G., Taubenberger, S., et al. 2011, MNRAS, 413, 3075 [NASA ADS] [CrossRef] [Google Scholar]
Mauerhan, J., Williams, G. G., Smith, N., et al. 2014, MNRAS, 442, 1166 [NASA ADS] [CrossRef] [Google Scholar]
Maund, J. R., Höflich, P., Patat, F., et al. 2010a, ApJ, 725, L167 [NASA ADS] [CrossRef] [Google Scholar]
Maund, J. R., Wheeler, J. C., Wang, L., et al. 2010b, ApJ, 722, 1162 [NASA ADS] [CrossRef] [Google Scholar]
Maund, J. R., Spyromilio, J., Höflich, P. A., et al. 2013, MNRAS, 433, L20 [NASA ADS] [CrossRef] [Google Scholar]
Montier, L., Plaszczynski, S., Levrier, F., et al. 2014a [arXiv:1406.6536] [Google Scholar]
Montier, L., Plaszczynski, S., Levrier, F., et al. 2014b [arXiv:1407.0178] [Google Scholar]
Naghizadeh-Khouei, J., & Clarke, D. 1993, A&A, 274, 968 [NASA ADS] [Google Scholar]
Patat, F., Höflich, P., Baade, D., et al. 2012, A&A, 545, A7 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Perlmutter, S., Aldering, G., Goldhaber, G., et al. 1999, ApJ, 517, 565 [NASA ADS] [CrossRef] [Google Scholar]
Planck Collaboration XIX. 2014, A&A, 571, A19 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Quinn, J. L. 2012, A&A, 538, A65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Rice, S. O. 1945, Bell Systems Tech. J., 24, 46 [CrossRef] [Google Scholar]
Riess, A. G., Filippenko, A. V., Challis, P., et al. 1998, AJ, 116, 1009 [NASA ADS] [CrossRef] [Google Scholar]
Simmons, J. F. L., & Stewart, B. G. 1985, A&A, 142, 100 [NASA ADS] [Google Scholar]
Staniszewski, Z., Aikin, R. W., Amiri, M., et al. 2012, J. Low Temp. Phys., 167, 827 [NASA ADS] [CrossRef] [Google Scholar]
Stokes, G. G. 1852, Trans. Cambridge Phil. Soc., 9, 399 [Google Scholar]
Tanaka, M., Kawabata, K. S., Hattori, T., et al. 2012, ApJ, 754, 63 [NASA ADS] [CrossRef] [Google Scholar]
The POLARBEAR Collaboration, Ade, P. A. R., Akiba, Y., Anthony, A. E., et al. 2014, ApJ, 794, 171 [NASA ADS] [CrossRef] [Google Scholar]
Vaillancourt, J. E. 2006, PASP, 118, 1340 [Google Scholar]
Vinokur, M. 1965, Ann. d’Astrophys., 28, 412 [Google Scholar]
Viola, M., Kitching, T. D., & Joachimi, B. 2014, MNRAS, 439, 1909 [NASA ADS] [CrossRef] [Google Scholar]
Wang, L., & Wheeler, J. C. 2008, ARA&A, 46, 433 [NASA ADS] [CrossRef] [Google Scholar]
Wang, L., Wheeler, J. C., & Hoeflich, P. 1997, ApJ, 476, L27 [NASA ADS] [CrossRef] [Google Scholar]
Wang, L., Baade, D., & Patat, F. 2007, Science, 315, 212 [NASA ADS] [CrossRef] [Google Scholar]
Wang, X., Wang, L., Filippenko, A. V., Zhang, T., & Zhao, X. 2013, Science, 340, 170 [NASA ADS] [CrossRef] [Google Scholar]
Wardle, J. F. C., & Kronberg, P. P. 1974, ApJ, 194, 249 [NASA ADS] [CrossRef] [Google Scholar]
Zelaya, P., Quinn, J. R., Baade, D., et al. 2013, AJ, 145, 27 [NASA ADS] [CrossRef] [Google Scholar]

Appendix A: Bounds on Ω(a, ϕ₀)

It helps to have functional bounds on $Ω (a, ϕ_{0}) \equiv \int_{0}^{π} \sin (ϕ) e^{a \cos (ϕ) \cos (ϕ_{0})} I_{0} (a \sin (ϕ) \sin (ϕ_{0})) d ϕ$ $\appendix \setcounter{section}{1} \begin{equation} \Omega(a,\varphi_0) \equiv \int_0^\pi \! \sin(\varphi) {\rm e}^{a \cos(\varphi) \cos(\varphi_0)} I_0(a \sin(\varphi) \sin(\varphi_0)) \, {\rm d}\varphi \label{origeq} \end{equation}$ (A.1)to show, for instance, that it is at least non-infinite over the ϕ₀-domain of interest. There are three factors (sin(ϕ), e^{acos(ϕ)cos(ϕ₀)}, and I₀(asin(ϕ)sin(ϕ₀))) in the integrand, each of which is simple enough that they allow various bounds to be obtained for the total expression.

Appendix A.1: Lower bound

Under variation of ϕ₀, the e^{acos(ϕ)cos(ϕ₀)} factor is minimized when cos(ϕ₀) is the smallest, which occurs for ϕ₀ = π, so that the factor becomes e^−acos(ϕ). Furthermore, the I₀(asin(ϕ)sin(ϕ₀)) factor is the smallest when its argument is zero, which also occurs at ϕ₀ = π (also ϕ₀ = 0) so that the factor become ℐ₀(0) = 1. After these substitutions, the integral for Ω can be performed explicitly: $\int_{0}^{π} \sin (ϕ) e^{- a \cos (ϕ)} d ϕ = \frac{2 \sinh (a)}{a} \cdot$ $\appendix \setcounter{section}{1} \begin{equation} \int_0^\pi \! \sin(\varphi) \, {\rm e}^{-a \cos(\varphi)} {\rm d}\varphi = \frac{2 \sinh(a)}{a}\cdot \end{equation}$ (A.2)For any given value of a, this sets a (constant) global lower bound on the integral: the value of integral must be greater than or equal to $\frac{2 \sinh (a)}{a}$ $\hbox{$\frac{2 \sinh(a)}{a}$}$ for all values of ϕ₀. It is probably also the greatest lower bound for each ϕ₀.

Appendix A.2: Upper bounds

The e^{acos(ϕ)cos(ϕ₀)} factor is maximized when its exponent is the largest while the I₀(asin(ϕ)sin(ϕ₀)) factor is the largest when the magnitude of its argument is as big as possible.

A constant global upper bound is found by modifying the factors in the obvious way: $\int_{0}^{π} \sin (ϕ) e^{a \cos (ϕ)} ℐ_{0} (\pm a) d ϕ = \frac{2 \sinh (a)}{a} ℐ_{0} (a) .$ $\appendix \setcounter{section}{1} \begin{equation} \int_0^\pi \! \sin(\varphi) \, {\rm e}^{a \cos(\varphi)} \mathcal{I}_0(\pm a) \, {\rm d}\varphi = \frac{2 \sinh (a) }{a} \mathcal{I}_0(a). \end{equation}$ (A.3)A stricter (but non-constant) global upper bound can be found merely by allowing sin(ϕ₀) → 1 in the Bessel function factor, $\begin{matrix} \int_{0}^{π} \sin (ϕ) e^{a \cos (ϕ) \cos (ϕ_{0})} ℐ_{0} (a \sin (ϕ)) d ϕ = \frac{2 \sec (ϕ_{0}) \sinh (a \cos (ϕ_{0}))}{a} ℐ_{0} (a) for ϕ_{0} \neq π / 2. \end{matrix}$ $\appendix \setcounter{section}{1} \begin{eqnarray} \int_0^\pi \! \sin(\varphi) \, {\rm e}^{a \cos(\varphi) \cos(\varphi_0)} \mathcal{I}_0(a \sin(\varphi)) \, {\rm d}\varphi = \frac{2 \sec(\varphi_0) \sinh (a \cos(\varphi_0))}{a} \mathcal{I}_0(a) \quad \text{for } \varphi_0 \ne \pi/2. \label{bestbound} \end{eqnarray}$ (A.4)This last solution has functional dependence on ϕ₀ and is not valid for ϕ₀ = π/ 2 because of the sec(ϕ₀) component. For ϕ₀ = 0 this bound is $\frac{2 \sinh (a)}{a} ℐ_{0} (a)$ $\hbox{$\frac{2 \sinh (a) }{a} \mathcal{I}_0(a)$}$ and it decreases smoothly and approaches $\frac{2 \sinh (a)}{a}$ $\hbox{$\frac{2 \sinh(a)}{a}$}$ as ϕ₀ → π/ 2⁺. Similarly, for ϕ₀ = π it is $\frac{2 \sinh (a)}{a} ℐ_{0} (a)$ $\hbox{$\frac{2 \sinh (a) }{a} \mathcal{I}_0(a)$}$ and it decreases smoothly and approaches $\frac{2 \sinh (a)}{a}$ $\hbox{$\frac{2 \sinh(a)}{a}$}$ as ϕ₀ → π/ 2⁻. Of course, if ϕ₀ = π/ 2, Eq. (A.1) can be accomplished directly: $\int_{0}^{π} \sin (ϕ) e^{0} ℐ_{0} (a \sin (ϕ)) d ϕ = \frac{2 \sinh (a)}{a} for ϕ_{0} = π / 2,$ $\appendix \setcounter{section}{1} \begin{equation} \int_0^\pi \sin(\varphi) \, {\rm e}^{0} \mathcal{I}_0(a \sin(\varphi)) \, {\rm d}\varphi = \frac{2 \sinh(a)}{a} \quad \text{for } \varphi_0=\pi/2, \end{equation}$ (A.5)so the discontinuity in Eq. (A.4) at ϕ₀ = π/ 2 can be “removed”. Equation (A.1) can also be performed for ϕ₀ = 0 and ϕ₀ = π and in both cases $\frac{2 \sinh (a)}{a}$ $\hbox{$\frac{2 \sinh(a)}{a}$}$ results, so we have explicit upper bounds at those values too.

Appendix A.3: Best bounds

Combining the previous information, the best bounds are $\frac{2 \sinh (a)}{a} \leq Ω (a, ϕ_{0}) \leq {\begin{matrix} if ϕ_{0} = 0, \frac{π}{2},π \\ otherwise, \end{matrix}$ $\appendix \setcounter{section}{1} \begin{equation} \frac{2 \sinh(a)}{a}\! \le \!\Omega(a,\varphi_0) \!\le\! \begin{cases} \frac{2 \sinh(a)}{a} &\!\!\mbox{if } \varphi_0=0,\frac{\pi}{2},\pi \\ \frac{2 \sec(\varphi_0) \sinh (a \cos(\varphi_0))}{a} \mathcal{I}_0(a) &\!\!\mbox{otherwise,}\vspace{-2mm} \end{cases} \end{equation}$ (A.6)which are finite for all ϕ₀ at a given value of a.

All Figures

	Fig. 1 Illustration of the spherical coordinate system used in the paper. The variable θ is used for the azimuthal angle with a range of (−π,π ], and the variable ϕ is measured from the positive v-axis with a range of [ 0,π ].
In the text

Fig. 2

Visualization of the $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ bias correction field of Eq. (9). The tail of each arrow is located at a measured $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ point and the head is attached to the corresponding “Most Probable” estimate of $(p_{0}, ϕ_{0})$ $\hbox{$(\overline{p}_0,\varphi_0)$}$ . This correction should only be applied to $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ points lying in the blue region given by Eq. (10). See the text for discussion about $(p,ϕ)$ $\hbox{$(\overline{p},\varphi)$}$ points outside the blue region.

In the text

	Fig. 3 Behavior of the bias correction field (Eq. (9)) on the boundary of the blue valid region (Eq. (10)). The left-most point on the boundary of the blue region is at $(\sqrt{2},π / 2)$ $\hbox{$(\!\sqrt2,\pi/2)$}$ .
In the text

Fig. 4

Contour plot of the angular-marginalized $p$ $\hbox{$\overline{p}$}$ -distribution given by Eq. (24). The contours are at 0.1 intervals with the lighter shades corresponding to larger values. The dotted and dashed curves (Eqs. (25) and (26)) trace maximums along horizontal and vertical slices and end at the red points at $(\sqrt{2}, 0)$ $\hbox{$(\sqrt{2},0)$}$ and $(\sqrt{3}, 0)$ $\hbox{$(\sqrt{3},0)$}$ , respectively. Although the “shape” of the distribution is not affected by the value of σ, the plot assumes that σ ≤ 1/4 so that the plotted values for $p_{0}$ $\hbox{$\overline{p}_0$}$ up to 4 (=1 /σ_max) exist.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Ade, P. A. R., Aikin, R. W., Amiri, M., et al. 2014a, ApJ, 792, 62 [NASA ADS] [CrossRef] [Google Scholar]

[2] Ade, P. A. R., Aikin, R. W., Barkats, D., et al. 2014b, Phys. Rev. Lett., 112, 241101 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]

[3] Clarke, D., & Stewart, B. G. 1986, Vistas Astron., 29, 27 [Google Scholar]

[4] del Toro Iniesta, J. C. 2003, Introduction to Spectropolarimetry (Cambridge University Press) [Google Scholar]

[5] Kuo, C.-L., & BICEP3 and POLAR1 Collaborations 2013, in IAU Symp. 288, eds. M. G. Burton, X. Cui, & N. F. H. Tothill, 80 [Google Scholar]

[6] Landi Degl’Innocenti, E. 2002, in Astrophysical Spectropolarimetry, Proc. of the XII Canary Islands Winter Scool of Astrophysics, eds. J. Trujillo-Bueno, F. Moreno-Insertis, & F. Sánchez (Cambridge: Cambridge University Press), 1 [Google Scholar]

[7] Leonard, D. C., Li, W., Filippenko, A. V., Foley, R. J., & Chornock, R. 2005, ApJ, 632, 450 [NASA ADS] [CrossRef] [Google Scholar]

[8] Liu, H., Mertsch, P., & Sarkar, S. 2014, ApJ, 789, L29 [NASA ADS] [CrossRef] [Google Scholar]

[9] Maeda, K., Leloudas, G., Taubenberger, S., et al. 2011, MNRAS, 413, 3075 [NASA ADS] [CrossRef] [Google Scholar]

[10] Mauerhan, J., Williams, G. G., Smith, N., et al. 2014, MNRAS, 442, 1166 [NASA ADS] [CrossRef] [Google Scholar]

[11] Maund, J. R., Höflich, P., Patat, F., et al. 2010a, ApJ, 725, L167 [NASA ADS] [CrossRef] [Google Scholar]

[12] Maund, J. R., Wheeler, J. C., Wang, L., et al. 2010b, ApJ, 722, 1162 [NASA ADS] [CrossRef] [Google Scholar]

[13] Maund, J. R., Spyromilio, J., Höflich, P. A., et al. 2013, MNRAS, 433, L20 [NASA ADS] [CrossRef] [Google Scholar]

[14] Montier, L., Plaszczynski, S., Levrier, F., et al. 2014a [arXiv:1406.6536] [Google Scholar]

[15] Montier, L., Plaszczynski, S., Levrier, F., et al. 2014b [arXiv:1407.0178] [Google Scholar]

[16] Naghizadeh-Khouei, J., & Clarke, D. 1993, A&A, 274, 968 [NASA ADS] [Google Scholar]

[17] Patat, F., Höflich, P., Baade, D., et al. 2012, A&A, 545, A7 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[18] Perlmutter, S., Aldering, G., Goldhaber, G., et al. 1999, ApJ, 517, 565 [NASA ADS] [CrossRef] [Google Scholar]

[19] Planck Collaboration XIX. 2014, A&A, 571, A19 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[20] Quinn, J. L. 2012, A&A, 538, A65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[21] Rice, S. O. 1945, Bell Systems Tech. J., 24, 46 [CrossRef] [Google Scholar]

[22] Riess, A. G., Filippenko, A. V., Challis, P., et al. 1998, AJ, 116, 1009 [NASA ADS] [CrossRef] [Google Scholar]

[23] Simmons, J. F. L., & Stewart, B. G. 1985, A&A, 142, 100 [NASA ADS] [Google Scholar]

[24] Staniszewski, Z., Aikin, R. W., Amiri, M., et al. 2012, J. Low Temp. Phys., 167, 827 [NASA ADS] [CrossRef] [Google Scholar]

[25] Stokes, G. G. 1852, Trans. Cambridge Phil. Soc., 9, 399 [Google Scholar]

[26] Tanaka, M., Kawabata, K. S., Hattori, T., et al. 2012, ApJ, 754, 63 [NASA ADS] [CrossRef] [Google Scholar]

[27] The POLARBEAR Collaboration, Ade, P. A. R., Akiba, Y., Anthony, A. E., et al. 2014, ApJ, 794, 171 [NASA ADS] [CrossRef] [Google Scholar]

[28] Vaillancourt, J. E. 2006, PASP, 118, 1340 [Google Scholar]

[29] Vinokur, M. 1965, Ann. d’Astrophys., 28, 412 [Google Scholar]

[30] Viola, M., Kitching, T. D., & Joachimi, B. 2014, MNRAS, 439, 1909 [NASA ADS] [CrossRef] [Google Scholar]

[31] Wang, L., & Wheeler, J. C. 2008, ARA&A, 46, 433 [NASA ADS] [CrossRef] [Google Scholar]

[32] Wang, L., Wheeler, J. C., & Hoeflich, P. 1997, ApJ, 476, L27 [NASA ADS] [CrossRef] [Google Scholar]

[33] Wang, L., Baade, D., & Patat, F. 2007, Science, 315, 212 [NASA ADS] [CrossRef] [Google Scholar]

[34] Wang, X., Wang, L., Filippenko, A. V., Zhang, T., & Zhao, X. 2013, Science, 340, 170 [NASA ADS] [CrossRef] [Google Scholar]

[35] Wardle, J. F. C., & Kronberg, P. P. 1974, ApJ, 194, 249 [NASA ADS] [CrossRef] [Google Scholar]

[36] Zelaya, P., Quinn, J. R., Baade, D., et al. 2013, AJ, 145, 27 [NASA ADS] [CrossRef] [Google Scholar]

Into the third dimension: stochastic measurements of Stokes parameters within the Poincaré sphere

1. Introduction

2. The sampling distribution

3. Classical estimators

3.1. The “Most Likely” solution

3.2. The “Most Probable” solution

4. Marginalized distributions

4.1. The p-marginalized angular distribution

4.2. The angular-marginalized p-distribution

5. Discussion

6. Conclusion

References

Appendix A: Bounds on Ω(a, ϕ0)

Appendix A.1: Lower bound

Appendix A.2: Upper bounds

Appendix A.3: Best bounds

All Figures

Appendix A: Bounds on Ω(a, ϕ₀)