System equivalent flux density of a low-frequency polarimetric phased array interferometer

A. T. Sutinjo; D. C. X. Ung; M. Sokolowski; M. Kovaleva; S. McSweeney

doi:10.1051/0004-6361/202142759

Home

All issues

Volume 660 (April 2022)

A&A, 660 (2022) A134

Full HTML

Free Access

Issue		A&A Volume 660, April 2022


Article Number		A134
Number of page(s)		14
Section		Astronomical instrumentation
DOI		https://doi.org/10.1051/0004-6361/202142759
Published online		02 May 2022

A&A 660, A134 (2022)

System equivalent flux density of a low-frequency polarimetric phased array interferometer^★

A. T. Sutinjo, D. C. X. Ung, M. Sokolowski, M. Kovaleva and S. McSweeney

International Centre for Radio Astronomy Research (ICRAR), Curtin University, 6102 Australia
e-mail: adrian.sutinjo@curtin.edu.au

Received: 26 November 2021
Accepted: 30 January 2022

Abstract

Aims. This paper extends the treatment of system equivalent flux density (SEFD), discussed in our earlier paper to interferometric phased array telescopes. The objective is to develop an SEFD formula involving only the most fundamental assumptions that is readily applicable to phased array interferometer radio observations. Our aim is to compare the resultant SEFD expression against the often-used root-mean-square (rms) SEFD approximation, ${SEFD}_{I}^{rms} = \frac{1}{2} \sqrt{{SEFD}_{X X}^{2} + {SEFD}_{Y Y}^{2}}$ ${\rm{SEFD}}_I^{{\rm{rms}}} = {1 \over 2}\sqrt {{\rm{SEFD}}_{XX}^{\rm{2}} + {\rm{SEFD}}_{YY}^{\rm{2}}}$ , to study the inaccuracy of the SEFD_I^rms.

Methods. We take into account all mutual coupling and noise coupling within an array environment (intra-array coupling). This intra-array noise coupling is included in the SEFD expression through the realized noise resistance of the array, which accounts for the system noise. No assumption is made regarding the polarization (or lack thereof) of the sky nor the orthogonality of the antenna elements. The fundamental noise assumption is that, in phasor representation, the real and imaginary components of a given noise source are independent and equally distributed (iid) with zero mean. Noise sources that are mutually correlated and non-iid among themselves are allowed, provided the real and imaginary components of each noise source are iid. The system noise is uncorrelated between array entities separated by a baseline distance, which in the case of the Murchison Widefield Array (MWA) is typically tens of wavelengths or greater. By comparing the resulting SEFD formula to the SEFD_I^rms approximation, we proved that SEFD_I^rms always underestimates the SEFD, which leads to an overestimation of array sensitivity.

Results. We present the resulting SEFD formula that is generalized for the phased array, but has a similar form to the earlier result. Here, the physical meaning of the antenna lengths and the equivalent noise resistances have been generalized such that they are also valid in the array environment. The simulated SEFD was validated using MWA observation of a Hydra-A radio galaxy at 154.88 MHz. The observed SEFD_XX and SEFD_I are on average higher by 9% and 4%, respectively, while the observed SEFD_YY is lower by 4% compared to simulated values for all pixels within the −12 dB beam width. The simulated and observed SEFD errors due to the rms SEFD approximation are nearly identical, with mean difference of images of virtually 0%. This result suggests that the derived SEFD expression, as well as the simulation approach, is correct and may be applied to any pointing. As a result, this method permits identification of phased array telescope pointing angles where the rms approximation underestimates SEFD (overestimates sensitivity). For example, for Hydra-A observation with beam pointing (Az, ZA) = (81°, 46°), the underestimation in SEFD calculation using the rms expression is 7% within the −3 dB beam width, but increases to 23% within the −12 dB beam width. At 199.68 MHz, for the simulated MWA pointing at (Az, ZA) = (45°, 56.96°), the underestimation reached 29% within the −3 dB beam width and 36% within the −12 dB beam width. This underestimation due to rms SEFD approximation at two different pointing angles and frequencies was expected and is consistent with the proof.

Key words: instrumentation: polarimeters / instrumentation: interferometers / techniques: interferometric / telescopes / techniques: polarimetric / methods: observational

^★

The value table is only available at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/cat/J/A+A/660/A134

© ESO 2022

1 Introduction

Our previous work in Sutinjo et al. (2021, hereafter Paper I) discussed a formulation for the sensitivity in terms of system equivalent flux density (SEFD) for a polarimetric interferometer that consists of dual-polarized antennas. In that work we provided an example using a dual-polarized MWA dipole element embedded in an array. In this paper, we extend and generalize the SEFD formulation to polarimetric phased arrays interferometers. This is an important generalization as it is directly applicable to low-frequency interferometric phased array telescopes in operation such as the Murchison Widefield Array (MWA; Tingay et al. 2013) and Low-Frequency Array (LOFAR; van Haarlem et al. 2013), as well as the future Low Frequency Square Kilometre Array (SKA-Low; Labate et al. 2017a,b). In particular regarding the SKA-Low, a clear conceptual understanding of array sensitivity, how it varies over telescope pointing angles, and how to calculate it are crucial for validation against SKA sensitivity requirements (Caiazzo 2017).

The reasons for using SEFD as the valid figure of merit (FoM) for a polarimetric radio interferometer as opposed to antenna effective area on system temperature (A_e/T_sys) was thoroughly reviewed in Paper I. For convenience, we mention the key ideas here. The primary reason is that antenna effective area, A_e, is a number that is defined as matched to the polarization of the incident wave, which is not known in advance for a polarimeter. In contrast, the concept of equivalent flux density is not constrained to the polarization state and it can be readily equated to the system noise. The work in Paper I allowed us to demonstrate that the often-used conversion between A_e/T_sys and SEFD is only an approximation that is valid in certain cases where the Jones matrix is diagonal or anti-diagonal. This is further generalized in the current paper (see Sect. 2.5), where we show that the SEFD approximation is correct only for row vectors in the Jones matrix (see Eq. (1)) that are orthogonal. When the row vectors are not orthogonal, the root-mean-square (rms) SEFD approximation always underestimates the true SEFD.

Furthermore, in our work on this subject we make an explicit connection to observational radio interferometry which is demonstrated by comparison to radio images. We also calculated and conducted a careful review of second-order noise statistics that form the basis for the SEFD formula. These are the main differences between our work and existing work in the radio astronomy phased array sensitivity literature, for example Warnick et al. (2012), Ellingson (2011), Tokarsky et al. (2017), and Sutinjo et al. (2015). However, we acknowledge that there are aspects of our calculations that benefit from the pre-existing collective knowledge in the community, in particular regarding the computation of array response to an incident wave and array noise temperature, as discussed in Sects. 2.1 and 2.3.

The polarimetric phased array under consideration has N dual-polarized elements where each element is connected to a low-noise amplifier (LNA), as is typical in practice. The LNA outputs are connected to the phased array weights and subsequently summed to produce the array output shown in Fig. 1. We consider the array response to an incident electric field from a target direction, e_t. The voltages that represent the response of Array 1 are

$\begin{array}{l} v_{1} |_{t} = {Je}_{t}, \\ [\begin{array}{l} v_{X 1} |_{t} \\ v_{Y 1} |_{t} \end{array}] = [\begin{array}{l} l_{X θ} l_{X ϕ} \\ l_{Y θ} l_{Y ϕ} \end{array}] [\begin{array}{l} E_{t θ} \\ E_{t ϕ} \end{array}], \end{array}$ $\matrix{ {{{\rm{v}}_1}{|_t} = {\bf{J}}{{\bf{e}}_t},} \cr {\left[ {\matrix{ {{V_{X1}}{|_t}} \hfill \cr {{V_{Y1}}{|_t}} \hfill \cr } } \right] = \left[ {\matrix{ {{l_{X\theta }}{l_{x\phi }}} \hfill \cr {{l_{Y\theta }}{l_{Y\phi }}} \hfill \cr } } \right]\left[ {\matrix{ {{E_{t\theta }}} \hfill \cr {{E_{t\phi }}} \hfill \cr } } \right]} \cr }$ (1)

where V_X1 and V_Y1 are the voltages measured by the X and Y arrays, respectively, and J is the Jones matrix of the array; the elements of the Jones matrix are antenna lengths (in meters) that represent the response of the array to each polarization basis of the electric field, E_tθ, E_tϕ (in units of Vm⁻¹). There is a similar equation corresponding to Array 2. For antenna arrays of an identical design, it is reasonable to assume the same Jones matrix as Array 1.

Equation (1) is applicable to an array of antennas. We consider an array of dual-polarized antennas with outputs that are weighted and summed, then connected to a correlator as depicted in Fig. 1, thereby forming an interferometer. The antennas receive noise from the sky which includes a partially polarized target source ${\bar{E}}_{t}$ ${\overline E _{\rm{t}}}$ and a background with noise temperature distribution given by T_sky. For brevity, only four elements are shown per array. The extension to N elements is immediately evident. The LNAs, represented by the triangle gain blocks, produce their own noise which is scattered by the array and picked up by all elements in the array. Similarly, the incident sky signal arriving at the array undergoes the same scattering and coupling process. Therefore, the combined noise voltages seen at the LNA inputs are mutually correlated. We consider these noise sources and the interactions thereof to extend the SEFD formula to the interferometric and polarimetric phased array telescope.

This paper is organized as follows. The SEFD derivation and calculation are presented in Sect. 2. An example SEFD calculation procedure for the MWA is demonstrated in Sect. 3. The SEFD calculation is validated by MWA observations described in Sect. 4, with comparison and discussion given in Sect. 5. The concluding remarks are summarized in Sect. 6. The appendix presents a detailed review of fundamental assumptions and statistical calculations that justify the SEFD formula presented in Sect. 2.

Fig. 1

Arrays of dual-polarized low-frequency antennas observing the sky. The antennas are at fixed positions above the ground, and the z-axis is up. The signal from each element is amplified by an LNA, weighted, and summed. The output of each summation is directed into the correlator. The y-directed elements (black) follow the same signal flow, but are not shown. A four-element array is shown as an example.

2 SEFD of a polarimetric phased array interferometer

2.1 Jones matrix of a dual-polarized array

The calculations in Paper I are reusable for an array, but need reinterpretation and adaptation which we now discuss. The first modification is to take the direction-dependent antenna lengths seen in the array environment (i.e., where each antenna element is connected to the input of an LNA). We call this quantity the antenna’s “realized length” (Ung et al. 2020; Ung 2020) which is different from the open circuit antenna effective length in Paper I.

The realized length has the advantage of having a clear and physical interpretation in an array environment depicted in Fig. 2. It is obtained by summing the embedded element realized lengths to form an overall equivalent realized length for the array. For example, for the X array,

$\begin{array}{l} l_{X θ} = w_{x}^{T} l_{x θ}, \\ l_{X ϕ} = w_{x}^{T} l_{x ϕ}, \end{array}$ $\matrix{ {{l_{X\theta }} = {\bf{w}}_x^T{{\bf{l}}_{x\theta }},} \cr {{l_{X\phi }} = {\bf{w}}_x^T{{\bf{l}}_{x\phi }},} \cr }$ (2)

where $w_{x}^{T} = [w_{1 x}, \dots, w_{4 x}]$ ${\bf{w}}_x^T = [{w_{1x}}, \ldots,{w_{4x}}]$ is the vector of weights and $l_{x θ}^{T} = [l_{1 x θ}, \dots, l_{4 x θ}]$ ${\bf{l}}_{x\theta }^T = [{{\bf{l}}_{1x\theta }}, \ldots,{{\bf{l}}_{4x\theta }}]$ is a vector containing the embedded element realized lengths, and similarly with l_xϕ. The quantities l_1xθ and l_1xϕ are obtained for a dual-polarized embedded element (number 1) with all other elements in the array terminated with the LNA input impedance, Z_LNA, as shown in Fig. 2. The embedded antenna realized length can be obtained through full-wave electromagnetic simulation (Ung et al. 2020) or measurement. Similarly, for the Y array, $\begin{array}{l} l_{Y θ} = w_{y}^{T} l_{y θ}, \\ l_{Y ϕ} = w_{y}^{T} l_{y ϕ} \end{array}$ $\matrix{ {{l_{Y\theta }} = {\bf{w}}_y^T{{\bf{l}}_{y\theta }},} \cr {{l_{Y\phi }} = {\bf{w}}_y^T{{\bf{l}}_{y\phi }},} \cr }$ (3)

where $w_{y}^{T} = [w_{1 y}, \dots, w_{4 y}]$ ${\bf{w}}_y^T = [{w_{1y}}, \ldots,{w_{4y}}]$ is the vector of weights for the Y array, which may differ from w_x. We note that Eqs. (2) and (3) apply to the antenna realized length as we noted earlier, as opposed to the open-circuit length. We assume LNAs of an identical design in this paper, and hence the LNA voltage gains are identical and need not be explicitly shown in Eq. (1).

Given the above discussion regarding the realized length, then for a phased array, the voltages on the left-hand side of Eq. (1) represent those at the outputs of the summation shown in Fig. 1. Therefore, the Jones matrix, J, translates the incident field to the voltages at the outputs of the summation. This is similarly the case for Array 2 in Fig. 1. The entries of J represent the realized lengths of the array (for the X or Y component) to an electric field basis ( $\hat{θ}$ $\widehat \theta$ or $\hat{ϕ}$ $\widehat \phi$ ) such that each entry is a linear combination of the realized lengths of the embedded elements in the array. The Jones matrix then, is $J = [\begin{array}{l} l_{X θ} l_{X ϕ} \\ l_{Y θ} l_{Y ϕ} \end{array}] = [\begin{array}{l} w_{x}^{T} l_{x θ} w_{x}^{T} l_{x ϕ} \\ w_{y}^{T} l_{y θ} w_{y}^{T} l_{y ϕ} \end{array}] .$ ${\bf{J}} = \left[ {\matrix{ {{l_{X\theta }}{l_{X\phi }}} \cr {{l_{Y\theta }}{l_{Y\phi }}} \cr } } \right] = \left[ {\matrix{ {{\bf{w}}_x^T{{\bf{l}}_{x\theta }}{\bf{w}}_x^T{{\bf{l}}_{x\phi }}} \cr {{\bf{w}}_y^T{{\bf{l}}_{y\theta }}{\bf{w}}_y^T{{\bf{l}}_{y\phi }}} \cr } } \right]$ (4)

We adopt the uppercase X or Y to denote the overall array response and lower case x or y for the embedded elements in the array.

Fig. 2

Antenna length calculation for an array, where l_1xθ is the antenna length associated with $\hat{θ}$ $\widehat \theta$ polarized incident field for x-directed antenna in the embedded element number 1 in the array shown, similarly to l_ixθ, l_iyθ, where i is the embedded element number. For brevity we consider a four-element array i = 1, ⋯, 4; the extension to i = 1, ⋯, N presents no complication.

2.2 SEFD expression

The interferometric polarimeter estimates the outer product of the incident electric field in the target direction, which from Paper I is given by $\tilde{e} {\tilde{e}}^{H} = J^{- 1} v_{1} v_{2}^{H} J^{- H}$ $\widetilde {\bf{e}}{\widetilde {\bf{e}}^H} = {{\bf{J}}^{ - 1}}{{\rm{v}}_1}{\rm{v}}_2^H{{\bf{J}}^{ - H}}$ (5)

where J is defined by Eq. (4), which represents the response of the array. The SEFD is proportional to the standard deviation of the flux estimate ${SEFD}_{I} = \frac{SDev (\tilde{I})}{η_{0}} = \frac{SDev [{(\tilde{e} {\tilde{e}}^{H})}_{1, 1} + {(\tilde{e} {\tilde{e}}^{H})}_{2, 2}]}{η_{0}}$ ${\rm{SEF}}{{\rm{D}}_I} = {{{\rm{SDev}}(\tilde I)} \over {{\eta _0}}} = {{{\rm{SDev}}[{{(\widetilde {\rm{e}}{{\widetilde {\rm{e}}}^H})}_{1,1}} + {{(\widetilde {\rm{e}}{{\widetilde {\rm{e}}}^H})}_{2,2}}]} \over {{\eta _0}}}$ (6)

where _1,1,_2,2 indicate the diagonal entries of the matrix in Eq. (5) (hence the subscript I, which refers to Stokes I) and η₀ ≈ 120π Ω is the free space impedance. What is needed is the expression for the standard deviations (SDev) for the phased array interferometer, which we develop next.

Fig. 3

Diagram representing system noise. The array is surrounded by a homogeneous blackbody radiator at temperature T_sys that represents the diffuse sky noise and the LNA noise. The y elements are similarly connected to a separate weight and summing circuit that are not shown for brevity.

2.3 Array system noise

We now compute the standard deviation of the electric field estimates as expressed in Eq. (6) due to the system noise. As shown in Paper I, we expect to express the standard deviation in Eq. (6) in terms of the array realized lengths and the system noise temperatures, T_SyS. Therefore, we need expressions for the mean square voltages for the X and Y arrays due to the system noise.

Figure 3 depicts an array illuminated by an equivalent homogeneous blackbody environment at temperature T_SyS The value of T_SyS is that which equals the noise due to the actual diffuse background sky under observation, T_sky, plus the noise of the LNAs (T_rcv) in the array environment, and the ohmic loss which is a function of the radiation efficiency (η_rad) of the array. The T_SyS value for a phased array can be obtained through computation, measurements, or a combination of the two using well-documented procedures. For example, computation for T_rcv of an array was discussed in the literature (Warnick et al. 2009; Belostotski et al. 2015; Warnick et al. 2018; Ung et al. 2019a, 2020). The computation of T_SyS for the MWA was demonstrated in Ung et al. (2020) and Ung (2020) and was validated by observation. The T_SyS value could also be obtained through measurements by carefully calibrating the array gain via known hot and cold sources (see, e.g., Chippendale et al. 2014).

The T_sys for the X and Y arrays are generally different; this is similarly noted in Paper I for a dual-polarized antenna element. This is expected because for the same sky under observation, the X and Y arrays produce different responses on the sky. Moreover, the x elements and the y elements interact differently depending on the array configuration and weights. Therefore, we anticipate assigning different T_sysX and T_sysY to the respective arrays, which we discuss next.

An equivalent homogeneous sky at T_sys gives rise to partially correlated noise voltages at the antenna ports in the array. A convenient way to quantify this is to start with the case where each antenna port is terminated with an open circuit: Z_LNA → ∞. In this case, the mutual coherence is known from the generalized Nyquist theorem (Twiss 1955; Hillbrand & Russer 1976) $C_{o} |_{T_{sys}} = 〈 v_{o} v_{o}^{H} 〉 |_{T_{sys}} = 4 k T_{sys} Δ f ℜ [Z]$ ${{\rm{C}}_o}{|_{{T_{{\rm{sys}}}}}} = \left\langle {{{\rm{v}}_o}{\rm{v}}_o^H} \right\rangle {|_{{T_{{\rm{sys}}}}}} = 4k{T_{{\rm{sys}}}}\Delta f\Re [{\rm{Z}}]$ (7)

where Δf is the noise bandwidth and ℜ[Z] is the real part of the antenna impedance matrix, which is a real and symmetric matrix. We chose the port numbering convention such that $ℜ [Z] = ℜ [\begin{array}{l} Z_{x x} Z_{x y} \\ Z_{y x} Z_{y y} \end{array}],$ $\Re [{\rm{Z}}] = \Re \left[ {\matrix{ {{{\bf{Z}}_{xx}}{{\bf{Z}}_{xy}}} \cr {{{\bf{Z}}_{yx}}{{\bf{Z}}_{yy}}} \cr } } \right]$ (8)

that is the x_ı, ⋯, x₄ embedded elements ports are numbered 1 to 4 and the y₁, ⋯, y₄ embedded elements ports are numbered 5 to 8, such that the open circuit voltage vector is $v_{o}^{T} = [V_{o x 1}, \dots, V_{o x 4}, V_{o y 1}, \dots, V_{o y 4}]$ ${\bf{v}}_o^T = [{V_{ox1}}, \cdots,{V_{ox4}},{V_{oy1}}, \cdots,{V_{oy4}}]$ . The open circuit voltage vector is related to the voltage seen at the LNA inputs through a transformation matrix (Warnick et al. 2009), which we call T, such that $v = Z_{LNA} {[Z_{LNA} I + Z]}^{- 1} v_{O} = T v_{O}$ ${\rm{v}} = {Z_{{\rm{LNA}}}}{[{Z_{{\rm{LNA}}}}{\bf{I}} + {\bf{Z}}]^{ - 1}}{{\rm{v}}_o} = {\rm{T}}{{\rm{v}}_o}$ (9)

Therefore, the coherence matrix of the voltages seen at the inputs of the LNAs loaded with Z_LNA is $\begin{array}{l} T C_{o} |_{T_{sys}} T^{H} = T 〈 v_{o} v_{o}^{H} 〉 |_{T_{sys}} T^{H} \\ = 4 k T_{sys} Δ f T ℜ [Z] T^{H}, \end{array}$ $\matrix{ {{\rm{T}}{{\rm{C}}_o}{|_{{T_{{\rm{sys}}}}}}{{\rm{T}}^H} = {\bf{T}}\left\langle {{{\rm{v}}_o}{\rm{v}}_o^H} \right\rangle {|_{{T_{{\rm{sys}}}}}}{{\bf{T}}^H}} \cr { = 4k{T_{{\rm{sys}}}}\Delta f{\bf{T}}\Re [{\rm{Z}}]{{\bf{T}}^H}} \cr }$ (10)

which is a Hermitian matrix, since (Tℜ[Z]T^H)^H = Tℜ[Z]T^H.

The sought-after quantities are the mean square voltages after weighting and summing $\begin{matrix} 〈 | V_{X} |^{2} 〉 = [w_{x}^{T}, 0_{4}^{T}] {TC}_{o} |_{T_{sys}_{X}} T^{H} [\begin{array}{l} w_{x}^{*} \\ 0_{4} \end{array}] \\ = 4 k T_{sysX} Δ f w_{x 0}^{T} T ℜ [Z] T^{H} w_{x 0}^{*} \\ 〈 | V_{Y} |^{2} 〉 = [0_{4}^{T}, w_{y}^{T}] T C_{o} |_{T_{sysY}} T^{H} [\begin{array}{l} 0_{4} \\ w_{y}^{*} \end{array}] \\ = 4 k T_{sysY} Δ f w_{y 0}^{T} T ℜ [Z] T^{H} w_{y 0}^{*} \end{matrix}$ $\matrix{ {\left\langle {|{V_X}{|^2}} \right\rangle = [{\bf{w}}_x^T,\;{\bf{0}}_4^T]{\rm{T}}{{\rm{C}}_o}{|_{{T_{{\rm{sys}}}}_{\rm{X}}}}{{\rm{T}}^H}\left[ {\matrix{ {{\bf{w}}_x^*} \hfill \cr {{0_4}} \hfill \cr } } \right]} \cr { = 4k{T_{{\rm{sysX}}}}\Delta f{\bf{w}}_{x0}^T{\bf{T}}\Re [{\rm{Z}}]{{\bf{T}}^H}{\bf{w}}_{x0}^*} \cr {\left\langle {|{V_Y}{|^2}} \right\rangle = [{\bf{0}}_4^T,{\bf{w}}_y^T]{\bf{T}}{{\rm{C}}_o}{|_{{T_{{\rm{sysY}}}}}}{{\bf{T}}^H}\left[ {\matrix{ {{0_4}} \hfill \cr {{\rm{w}}_y^*} \hfill \cr } } \right]} \cr { = 4k{T_{{\rm{sysY}}}}\Delta f{\rm{w}}_{y0}^T{\rm{T}}\Re [{\rm{Z}}]{{\rm{T}}^H}{\bf{w}}_{y0}^*} \cr }$ (11)

where 0₄ is a column vector of four zeros; w_x0, w_y0 reflect the fact that the x and y antennas are summed separately; T_sysX and T_sysY are distinguished as discussed previously; k is the Boltzmann constant; and Δf is the noise bandwidth. We know that Eq. (11) is real because the eigenvalues of a Hermitian matrix are real. For brevity in the SEFD formula, we adopt the following shorthand notations: $\begin{matrix} R_{X} = w_{x 0}^{T} T ℜ [Z] T^{H} w_{x 0}^{*}, \\ R_{Y} = w_{y 0}^{T} T ℜ [Z] T^{H} w_{y 0}^{*} . \end{matrix}$ $\matrix{ {{R_X} = {\bf{w}}_{x0}^T{\bf{T}}\Re [{\rm{Z}}]{{\bf{T}}^H}{\bf{w}}_{x0}^*,} \cr {{R_Y} = {\bf{w}}_{y0}^T{\rm{T}}\Re [{\rm{Z}}]{{\bf{T}}^H}{\bf{w}}_{y0}^*.} \cr }$ (12)

We can think of them as the realized array noise resistances, in units of Ω, representing the noise voltages at the outputs of the summations. A detailed explanation of this quantity and comparison to current literature can be found in Appendix B. We note that Eq. (11) transforms T_sysX, T_sysY from a quantity external to the array to the mean square voltages after weighting and summing by the array. Therefore, the output of the summation is the reference plane at which the SEFD is to be calculated, as we show next.

2.4 SEFD_I formulation

We assume all the antenna arrays in question are of identical design and the coupling between arrays is negligible due to the inter-array distance of tens of wavelengths or larger. The intra-array coupling within each array is accounted for through the process described in Sects. 2.1 and 2.3. This is the same level of assumptions as in Paper I. The main difference between Paper I and this work is that the sensitivity formula is now extended and generalized to the phased array. Following from Paper I, the Stokes I estimate is $\tilde{I} = {(\tilde{e} {\tilde{e}}^{H})}_{1, 1} + {(\tilde{e} {\tilde{e}}^{H})}_{2, 2}$ $\widetilde I = {(\widetilde {\bf{e}}{\widetilde {\bf{e}}^H})_{1,1}} + {(\widetilde {\bf{e}}{\widetilde {\bf{e}}^H})_{2,2}}$ from Eqs. (5) and (6). Expanding the matrix equation, we can write $\begin{matrix} | D |^{2} \tilde{I} = X_{1} X_{2}^{*} (| l_{Y ϕ} |^{2} + | l_{Y θ} |^{2}) \\ + Y_{1} Y_{2}^{*} (| l_{X ϕ} |^{2} + | l_{X θ} |^{2}) \\ - X_{1} Y_{2}^{*} (l_{X ϕ}^{*} l_{Y ϕ} + l_{X θ}^{*} l_{Y θ}) \\ - Y_{1} X_{2}^{*} (l_{X ϕ} l_{Y ϕ}^{*} + l_{X θ} l_{Y θ}^{*}) \end{matrix}$ $\matrix{ {|D{|^2}\widetilde I = {X_1}X_2^*(|{l_{Y\phi }}{|^2} + |{l_{Y\theta }}{|^2})} \cr { + {Y_1}Y_2^*(|{l_{X\phi }}{|^2} + |{l_{X\theta }}{|^2})} \cr { - {X_1}Y_2^*(l_{X\phi }^*{l_{Y\phi }} + l_{X\theta }^*{l_{Y\theta }})} \cr { - {Y_1}X_2^*({l_{X\phi }}l_{Y\phi }^* + {l_{X\theta }}l_{Y\theta }^*)} \cr }$ (13)

where D = l_Xθl_Yϕ − I_Xϕl_Yθ is the determinant of the array Jones matrix; the leading Vs have been suppressed for brevity; for example, X₁ is a complex random variable that refers to V_Xı. The components of the realized lengths (which are direction-dependent complex scalars) now refer to the array, as discussed in Sect. 2.1. Following the statistical calculation described in Appendix A, $\frac{| D |^{4} Var (\tilde{I})}{{(4 k Δ f)}^{2}} = t_{R}^{T} L t_{R}$ ${{|D{|^4}{\rm{Var}}(\tilde I)} \over {{{(4k\Delta f)}^2}}} = {\rm{t}}_R^T{\mathbb{L}}{{\rm{t}}_R}$ (14)

where $t_{R} = [\begin{array}{l} T_{sysX} R_{X} \\ T_{sysY} R_{Y} \end{array}]$ ${{\bf{t}}_R} = \left[ {\matrix{ {{T_{{\rm{sysX}}}}{R_X}} \hfill \cr {{T_{{\rm{sysY}}}}{R_Y}} \hfill \cr } } \right]$ (15)

is a column vector, and the matrix $L = [\begin{matrix} {‖ l_{Y} ‖}^{4} & {| l_{X ϕ}^{*} l_{Y ϕ} + l_{X θ}^{*} l_{Y θ} |}^{2} \\ {| l_{X ϕ}^{*} l_{Y ϕ} + l_{X θ}^{*} l_{Y θ} |}^{2} & {‖ l_{X} ‖}^{4} \end{matrix}],$ ${\mathbb{L}} = \left[ {\matrix{ {{{\left\| {{{\bf{l}}_Y}} \right\|}^4}} & {{{\left| {l_{X\phi }^*{l_{Y\phi }} + l_{X\theta }^*{l_{Y\theta }}} \right|}^2}} \cr {{{\left| {l_{X\phi }^*{l_{Y\phi }} + l_{X\theta }^*{l_{Y\theta }}} \right|}^2}} & {{{\left\| {{{\bf{l}}_X}} \right\|}^4}} \cr } } \right],$ (16)

where the vector norms are ${‖ l_{X} ‖}^{2} = {| l_{X θ} |}^{2} + {| l_{X ϕ} |}^{2}$ ${\left\| {{{\bf{l}}_X}} \right\|^2} = {\left| {{l_{X\theta }}} \right|^2} + {\left| {{l_{X\phi }}} \right|^2}$ and similarly with ‖l_Y‖². We note in Eq. (14) that R_X and R_Y differ and generally cannot be factored out. Again, taking $\sqrt{Var(\tilde{I})}$ $\sqrt {{\rm{Var(}}\widetilde I{\rm{)}}}$ from Eq. (14) and dividing by the free space impedance, η₀, produces the desired formula ${SEFD}_{I} = \frac{SDev (\tilde{I})}{η_{0}} = \frac{4 k Δ f}{η_{0}} \frac{\sqrt{t_{R}^{T} L t_{R}}}{| D |^{2}}$ ${\rm{SEF}}{{\rm{D}}_I} = {{{\rm{SDev}}(\tilde I)} \over {{\eta _0}}} = {{4k\Delta f} \over {{\eta _0}}}{{\sqrt {{\rm{t}}_R^T{\mathbb{L}}{{\bf{t}}_R}} } \over {|D{|^2}}}$ (17)

with units Wm⁻². If SEFD_I is stated in Wm⁻² Hz⁻¹, which is often preferred in radio astronomy, then we remove Δf from the right-hand side.

Fig. 4

Depiction of the vectors, their projections, and the area of the parallelogram.

2.5 rms approximation always underestimates SEFD

As mentioned in Sect. 1, we now demonstrate the claim that the very commonly used rms approximation in radio astronomy (see, e.g., Wrobel & Walker 1999) always underestimates the true SEFD. The rms approximation is given by ${SEFD}_{I}^{rms} = \frac{1}{2} \sqrt{{SEFD}_{X X}^{2} + {SEFD}_{Y Y}^{2},}$ ${\rm{SEFD}}_I^{{\rm{rms}}} = {1 \over 2}\sqrt {{\rm{SEFD}}_{XX}^2 + {\rm{SEFD}}_{YY}^2,}$ (18)

where SEFD_XX, SEFD_YY assume an unpolarized source, as discussed in Paper I. For the notations and approach discussed in the current paper, this approximation can be written as $\begin{matrix} {SEFD}_{I}^{rms} = \frac{4 k}{η_{0}} \sqrt{\frac{T_{sysX}^{2} R_{X}^{2}}{| | l_{X} | |^{4}} + \frac{T_{sysY}^{2} R_{Y}^{2}}{| | l_{Y} | |^{4}}} \\ = \frac{4 k}{η_{0}} \sqrt{t_{R}^{T} D_{rms} t_{R}} \end{matrix}$ $\matrix{ {{\rm{SEFD}}_I^{{\rm{rms}}} = {{4k} \over {{\eta _0}}}\sqrt {{{T_{{\rm{sysX}}}^2R_X^2} \over {||{{\bf{l}}_X}|{|^4}}} + {{T_{{\rm{sysY}}}^2R_Y^2} \over {||{{\bf{l}}_Y}|{|^4}}}} } \cr { = {{4k} \over {{\eta _0}}}\sqrt {{\bf{t}}_R^T{{\bf{D}}_{{\rm{rms}}}}{{\bf{t}}_R}} } \cr }$ (19)

as pointed out in Paper I. In addition, Appendix C shows that Eq. (19) is derivable using A/T. The diagonal matrix D_rms = diag $[{‖ l_{X} ‖}^{- 4}, {‖ l_{Y} ‖}^{- 4}]$ $\left[ {{{\left\| {{{\bf{l}}_X}} \right\|}^{ - 4}},{{\left\| {{{\bf{l}}_Y}} \right\|}^{ - 4}}} \right]$ is different from $L / {| D |}^{4}$ ${\mathbb{L}}/{\left| D \right|^4}$ in Eq. (17). It can be shown that $t_{R}^{T} \frac{L}{| D |^{4}} t_{R} \geq t_{R}^{T} D_{rms} t_{R}$ ${\bf{t}}_R^T{{\mathbb{L}} \over {|D{|^4}}}{{\bf{t}}_R} \ge {\bf{t}}_R^T{{\bf{D}}_{{\rm{rms}}}}{{\bf{t}}_R}$ (20)

The reason for this is explained as follows. It can be shown through the Gram-Schmidt orthogonalization steps or the related QR factorization (Strang 2016, see Chap. 4) of the Jones matrix, that the absolute value of the determinant can be written as $| D | = ‖ l_{Y} ‖ ‖ e ‖$ $\left| D \right| = \left\| {{{\bf{l}}_Y}} \right\|\left\| {\rm{e}} \right\|$ (21)

where e is the projection vector of l_X onto a line which is perpendicular to l_Y (see Fig. 4),

$e = l_{X} - p$ ${\rm{e}} = {{\bf{l}}_X} - {\rm{p}}$ (22)

where $p = (l_{Y}^{H} l_{X}) l_{Y} / {‖ l_{Y} ‖}^{2}$ ${\bf{p}} = ({\bf{l}}_Y^H{{\bf{l}}_X}){{\bf{l}}_Y}/{\left\| {{{\bf{l}}_Y}} \right\|^2}$ is the orthogonal projection vector of l_X onto l_Y (Strang 2016, see Chap. 4), such that e and p are orthogonal, e ⊥ p. Equation (21) is easily verified by substitution of l_X = [l_Xθ, l_Xϕ]^T and l_Y = [l_Yθ, l_Yϕ]^T into Eq. (22) and simplifying the vector algebra. There is also an insightful geometric interpretation of Eq. (21): |D| is the area formed by the parallelogram whose parallel sides are represented by vectors l_X and l_Y, as shown in Fig. 4; this is known from the volume property of determinants (Strang 2016, see Chap. 5) and is useful for understanding Eq. (23).

Since ‖e‖ ≤‖l_X‖, then $\begin{array}{l} Since ‖ e ‖ \leq ‖ l_{X} ‖, then \\ | D |^{2} \leq {‖ l_{X} ‖}^{2} {‖ l_{Y} ‖}^{2} \end{array}$ $\matrix{ {\quad {\rm{Since}}\left\| {\rm{e}} \right\| \le \left\| {{{\bf{l}}_X}} \right\|,{\rm{then}}} \hfill \cr {|D{|^2} \le {{\left\| {{{\bf{l}}_X}} \right\|}^2}{{\left\| {{{\bf{l}}_Y}} \right\|}^2}} \hfill \cr }$ (23)

The equality is fulfilled when the vectors are orthogonal, l_Y ⊥ l_X, which also implies p = 0, e = l_X, and $l_{X}^{H} l_{Y}$ ${\bf{l}}_X^H{{\bf{l}}_Y}$ . This is fully

expected given the above-mentioned geometric interpretation and proves Eq. (20). To see this more clearly, we write $\frac{L}{| D |^{4}} = \frac{1}{{‖ l_{X} - p ‖}^{4} {‖ l_{Y} ‖}^{4}} [\begin{array}{l} {‖ l_{Y} ‖}^{4} & {‖ l_{_{X}}^{H} l_{Y} ‖}^{2} \\ {‖ l_{_{X}}^{H} l_{Y} ‖}^{2} & {‖ l_{X} ‖}^{4} \end{array}]$ ${{\mathbb{L}} \over {|D{|^4}}} = {1 \over {{{\left\| {{{\bf{l}}_X} - {\rm{p}}} \right\|}^4}{{\left\| {{{\bf{l}}_Y}} \right\|}^4}}}\left[ {\matrix{ {{{\left\| {{{\bf{l}}_Y}} \right\|}^4}} \hfill & {{{\left\| {{\bf{l}}_{_X}^H{{\bf{l}}_Y}} \right\|}^2}} \hfill \cr {{{\left\| {{\bf{l}}_{_X}^H{{\bf{l}}_Y}} \right\|}^2}} \hfill & {{{\left\| {{{\bf{l}}_X}} \right\|}^4}} \hfill \cr } } \right]$ (24)

and $D_{rms} = \frac{1}{{‖ l_{X} ‖}^{4} {‖ l_{Y} ‖}^{4}} [\begin{array}{l} {‖ l_{Y} ‖}^{4} & 0 \\ 0 & {‖ l_{X} ‖}^{4} \end{array}]$ ${{\rm{D}}_{{\rm{rms}}}} = {1 \over {{{\left\| {{{\bf{l}}_X}} \right\|}^4}{{\left\| {{{\bf{l}}_Y}} \right\|}^4}}}\left[ {\matrix{ {{{\left\| {{{\bf{l}}_Y}} \right\|}^4}} \hfill & 0 \hfill \cr 0 \hfill & {{{\left\| {{{\bf{l}}_X}} \right\|}^4}} \hfill \cr } } \right]$ (25)

We note that the denominator on the right-hand side of Eq. (24), ${‖ l_{X} - p ‖}^{4} {‖ l_{Y} ‖}^{4} \leq {‖ l_{X} ‖}^{4} {‖ l_{Y} ‖}^{4}$ ${\left\| {{{\bf{l}}_X} - {\bf{p}}} \right\|^4}{\left\| {{{\bf{l}}_Y}} \right\|^4} \le {\left\| {{{\bf{l}}_X}} \right\|^4}{\left\| {{{\bf{l}}_Y}} \right\|^4}$ , which indeed proves Eq. (20). Furthermore, it is evident that the matrix in Eq. (24) converges to D_rms in Eq. (25) when the orthogonality, l_Y ⊥ l_X, is fulfilled. Since in practice orthogonality is approached, but never fulfilled exactly, the statement that the rms approximation always underestimates SEFD is demonstrably correct.

It is important to note that antenna length is a direction-dependent quantity. Therefore orthogonality, l_Y ⊥ l_X, as discussed in this section must be evaluated by taking the inner product, ${|_{X}^{H} |}_{Y}$ ${\left| {_X^H} \right|_Y}$ , in every direction of arrival of interest. This is particularly applicable to a wide field-of-view antenna system. For example, mechanically orthogonal cross dipoles only possess antenna length orthogonality in the directions of arrival on the cardinal planes (as discussed in Paper I). The rms approximation is only exact in the directions of arrivals for which antenna length orthogonality, ${|_{X}^{H} |}_{Y} = 0$ ${\left| {_X^H} \right|_Y} = 0$ , is fulfilled.

Following the proof, we can then quantify this difference by defining a relative percentage difference of the two results as $Δ {SEFD}_{I} = \frac{{SEFD}_{I} - {SEFD}_{I}^{rms}}{{SEFD}_{I}} \times 100 % \geq 0$ $\Delta {\rm{SEF}}{{\rm{D}}_I} = {{{\rm{SEF}}{{\rm{D}}_I} - {\rm{SEFD}}_I^{{\rm{rms}}}} \over {{\rm{SEF}}{{\rm{D}}_I}}} \times 100\% \ge 0$ (26)

We will use Eq. (26) in Sect. 5 to measure the difference between rms approximation and true SEFD based on simulation and observation, the expected value of which are always positive (for observation data, the “expected value” refers to the ensemble mean).

3 SEFD simulation procedure

Following the previous section, the steps to computing the SEFD of a polarimetric phased array interferometer are as follows. The first step involves the computation of the array Jones matrix as shown in Eq. (4). The components of the matrix were constructed from embedded antenna realized lengths as shown in Eqs. (2) and (3), which in our case were obtained from electromagnetic simulation using Altair FEKO¹. Additional information regarding the simulation setup and results used in this paper are discussed in Sokolowski et al. (2017). The embedded antenna realized lengths were found by converting (E_θ, E_ϕ) using (Ung et al. 2020) $\begin{array}{l} l_{n θ} (θ, ϕ) = - j \frac{2 λ}{η_{0} V_{t}} Z_{LNA} E_{θ, n} (θ, ϕ), \\ l_{n ϕ} (θ, ϕ) = - j \frac{2 λ}{η_{0} V_{t}} Z_{LNA} E_{ϕ, n} (θ, ϕ), \end{array}$ $\matrix{ {{l_{n\theta }}(\theta,\;\phi ) = - j{{2\lambda } \over {{\eta _0}{V_{\rm{t}}}}}{Z_{{\rm{LNA}}}}{E_{\theta,n}}(\theta,\;\phi ),} \hfill \cr {{l_{n\phi }}(\theta,\phi ) = - j{{2\lambda } \over {{\eta _0}{V_{\rm{t}}}}}{Z_{{\rm{LNA}}}}{E_{\phi,n}}(\theta,\;\phi ),} \hfill \cr }$ (27)

where λ is the wavelength, Z_LNA is the input impedance of the low-noise amplifier (LNA), and V_t is the port excitation voltage used during the simulation which generates the corresponding electric far-field components E_θ,n and E_ϕ,n of the n-th embedded element as a function of direction in the sky.

In the second step we determine the system temperatures of the array, T_SySX and T_SySY, using $T_{sys, p} = T_{ant, p} + T_{rcv, p} + T_{0} (1 - η_{rad, p})$ ${T_{{\rm{sys}},{\rm{p}}}} = {T_{{\rm{ant}},{\rm{p}}}} + {T_{{\rm{rcv}},{\rm{p}}}} + {T_0}(1 - {\eta _{{\rm{rad}},{\rm{p}}}})$ (28)

where T_ant,p is the antenna temperature due to the sky for each polarization p (p = X or Y); T_rcv,p is the receiver noise temperature due to the LNA calculated using the methodology discussed in Ung et al. (2020) based on measured LNA noise parameters (Sutinjo et al. 2018); η_rad,p is the radiation efficiency of the array; and T₀ = 290 K is the reference temperature (which can be substituted by ambient temperature if known). Radiation efficiency, η_rad, is the ratio of the total radiated power to the total injected power into the array calculated using methodologies presented in Warnick et al. (2010) and Ung et al. (2019b).

In the third step we compute the realized array noise resistances, R_X and R_Y, using Eq. (12). Fourth, we calculate $t_{R}^{T} L t_{R}$ ${\rm{t}}_R^T{\bf{L}}{t_R}$ in Eq. (14). Finally, we calculate the polarimetric and interferomet-ric array SEFD using Eq. (17). As an example, we provide several key parameters required to evaluate Eq. (17) at three selected frequency points for an MWA tile pointed at azimuth and zenith angles, Az = 80.54° and ZA = 46.15°, in Table 1. A full list of these values (excluding the realized length) can be found at the CDS. In the next section, we describe the procedure used to obtain SEFD_I from observational data.

Table 1

Calculated parameters required to compute SEFD_I for the array pointing at the Hydra-A source (Az = 81°, ZA = 46°) on 2014-12-26 at 16:05:43 UTC.

4 SEFD measurement procedure

The data analysis procedure is based on a very similar analysis performed earlier (see Paper I). The relative difference $Δ {SEFD}_{I}^{obs}$ $\Delta {\rm{SEFD}}_I^{{\rm{obs}}}$ defined by Eq. (26) was measured using MWA observations tracking Hydra-A radio galaxy recorded in a standard observing mode at the central frequency 154.88 MHz. On 2014 December 26 between 16:05:42 and 05:13:42 UTC, the MWA recorded 23 112 s observations centered on the Hydra-A radio galaxy at a position (Az, ZA) = (80.7°, 45.8°) and (Az,ZA) = (290.7°, 32.4°) at the start and end of the observations, respectively. For the purpose of a comparison of ΔSEFD_I between the data and simulations, 52 s of the first 112 s observation (obsID 1103645160 referred to later as ObsA), which started at 201412-26 16:05:42 UTC with Hydra-A at (Az,ZA) = (80.7°, 45.8°), were analyzed. This observation was selected at a pointing direction where the simulation predicted that ΔSEFD_I would be the most prominent due to the lowest elevation of the pointing direction at gridpoint 114 at the elevation of ≈43.85° (zA = 46.15°). We note that MWA observations significantly lower than 45° are generally not routinely performed and were not considered for the presented analysis due to systematic effects which could affect the data quality.

4.1 Calibration and imaging

The MWA data were converted into CASA measurement sets (McMullin et al. 2007) in 40 kHz frequency (total of 768 channels) and 1 s time resolution, and downloaded using the MWA All-Sky Virtual Observatory interface (Sokolowski et al. 2020). In order to avoid aliasing effects at the edges of 1.28 MHz coarse channels (24 in total), 160 kHz (4 fine channels) on each end of the coarse channels were excluded, which reduced the observing bandwidth to R_bw =0.75 fraction of the full recorded band (30.72 MHz).

The same observation ObsA was used for calibration. It was sufficient to use a single-source sky model composed of Hydra-A alone, which was the dominant source in the observed field. However, due to the extended structure of Hydra-A, which is resolved by the longest baselines of the MWA Phase 1, it was necessary to use its model by Lane et al. (2014) converted to the required multi-component format by Hurley-Walker (in prep.)². The total flux density of Hydra-A in this model is within 7% of the most recent and accurate flux measurements of Hydra-A by Perley & Butler (2017) over the entire observing band (139.5 –169.0 MHz). Selecting an appropriate model of Hydra-A was important to ensure the correct flux scale of the calibrated visibilities and images, and ultimately of the SEFD values. The calibration was performed and applied using the CALI-BRATE and APPLYSOLUTIONS software, which are part of the MWA-reduce software package (Offringa et al. 2016) routinely used for MWA data reduction. The CALIBRATE program uses the 2016 MWA beam model (Sokolowski et al. 2017) to calculate the Jones matrices and apparent fluxes of the sources (just one in this particular case) included in the sky model. Baselines shorter than 58 wavelengths were excluded from the calibration. The phases of the resulting calibration solutions were fitted with a linear function, while the amplitudes were fitted with a fifth-order polynomial; the resulting fitted calibration solutions were then applied to the un-calibrated visibilities using the APPLYSOLUTIONS program.

Sky images with 4096 × 4096 pixels of angular size ≈0.6arcmin (image size ≈ 42.2° × 42.2°), in 1 s time resolution, were formed from all correlations products (XX, YY, XY, and YX)³ using the WSCLEAN⁴ program (Offringa et al. 2014). Natural weighting (robust weighting parameter defined by Briggs 1995 set to +2) was used in order to preserve full sensitivity of the array. We note that natural weighting increases the confusion noise, but it was eliminated by measuring the noise from difference images (see Sect. 4.2). The dirty maps were CLEANed with 100 000 iterations and a threshold of 1.2 standard deviations of the local noise. In order to reduce the contribution from extended emission, baselines shorter than 30 wavelengths were excluded, which led to a small reduction in the of number of baselines used in the imaging to 7466, which is R_bs ≈ 0.92 fraction of all the baselines (8128). Corrections for a reduced number of baselines and observing bandwidth were taken into account in the later analysis by applying correction factors (R_bw and R_bs) during the conversion from standard deviation of the noise to SEFD.

The resulting XX and YY images (see left image in Fig. 5) were divided by corresponding images of the beam in X and Y polarizations generated with the 2016 beam model (Sokolowski et al. 2017) at the same pointing direction ((Az, ZA) = (80.5°, 46.2°) at the MWA gridpoint 114⁵). All the resulting images (XX, YY, XY, and YX) were converted to Stokes images (I, Q, U, and V) also using the 2016 beam model. The final products resulting from the above procedure were three sets (XX, YY, and Stokes I) of n_t = 53 primary beam corrected images corresponding to 53 s of the analyzed MWA data.

Fig. 5

Example 0.5 s sky and difference images in X polarization (scale of the color bar is in Jy) used to calculate SEFD. Left: 0.5 s sky image. Center: difference between two consecutive 0.5 s images. Right: difference between two consecutive 0.5 s images corrected for the primary beam (divided by the primary beam in the X polarization). In order to calculate noise in every direction in the sky within the field of view (noise map), the standard deviation of the noise was calculated in small regions around each pixel in the difference image and divided by $\sqrt{2}$ $\sqrt 2$ . These noise maps were later converted into SEFD using Eq. (29). The same procedure was applied to Y and is very similar to Stokes I polarizations (in this case the correction for the primary beam was more complex than the simple division by the corresponding primary beam).

4.2 Measuring the SEFD from the noise in the sky images

Using the three series (corresponding to XX, YY, and Stokes I) of n_t = 53 images, difference images (between the subsequent i-th and i - 1 image) were calculated, resulting in n_d = 52 difference images in each of the XX, YY, and Stokes I polarizations. Difference imaging effectively removes confusion noise resulting in uniform images of thermal noise (entirely due to the system temperature T_sys). The difference images were visually inspected and verified to have a uniform noise-like structure (see examples in the middle and right panels of Fig. 5). Therefore, the resulting standard deviation is purely due to the instrumental and sky noise (T_sys).

The standard deviation of the noise calculated in a small region (here a circular region of radius R_n = 10 pixels corresponding to ≈5 synthesized beams) around pixels in the resulting difference images is distributed around a mean of zero and is not contaminated by the variations in the flux density within the circular regions due to astronomical sources contained inside these regions. The interquartile range divided by 1.35 was used as a robust estimator of standard deviation, which is more robust against outlier data points due to radio-frequency interference (RFI) or residuals of astronomical sources in the difference images. Once this calculation was performed around all the pixels in a difference image, an image of the noise over the field of view (FoV) was created. These images of the noise are later referred to as noise maps or just N (which stands for a 2D image and not a number). This procedure was applied to n_d all-sky difference images resulting in n_d noise maps (N_I, N_XX, and N_YY) for each of the polarizations (Stokes I, XX, and YY, respectively). The corresponding SEFD images (2D maps of SEFD) of the entire FoV were calculated from these noise maps (referred to in general as N) according to the equation $SEFD = N \sqrt{Δ v Δ t N_{b}}$ ${\rm{SEFD}} = N\sqrt {\Delta v\Delta t{N_{\rm{b}}}}$ (29)

where Δt = 1 s is the integration time, Δv = 30.72 × R_bw MHz is the observing bandwidth corrected for reduced number of channels (R_bw) used in the analysis, and N_b = 7466 is the number of baselines used in the imaging.

Noise maps in the three polarizations, N_I, N_XX, and N_YY, were converted into corresponding SEFD_I, SEFD_XX, and SEFD_YY images using Eq. (29), resulting in n_d SEFD images in each polarization (I, XX, and YY). Then, median SEFD images $\bar{{SEFD}_{I},} \bar{S E F D_{X X}}$ $\overline {{\rm{SEF}}{{\rm{D}}_I},} \overline {SEF{D_{XX}}}$ , and $\bar{S E F D_{Y Y}}$ $\overline {SEF{D_{YY}}}$ were calculated out of individual n_d SEFD images (a median image of SEFD_I will be shown in Sect. 5.1). These median images were used to calculate $Δ {SEFD}_{I}^{obs}$ $\Delta {\rm{SEFD}}_I^{{\rm{obs}}}$ $Δ {SEFD}_{I}^{obs} = \frac{(\bar{{SEFD}_{I}} - \frac{1}{2} \sqrt{{\bar{{SEFD}_{X X}}}^{2} + {\bar{{SEFD}_{Y Y}}}^{2}})}{\bar{{SEFD}_{I}}}$ $\Delta {\rm{SEFD}}_I^{{\rm{obs}}} = {{\left( {\overline {{\rm{SEF}}{{\rm{D}}_I}} - {1 \over 2}\sqrt {{{\overline {{\rm{SEF}}{{\rm{D}}_{XX}}} }^2} + {{\overline {{\rm{SEF}}{{\rm{D}}_{YY}}} }^2}} } \right)} \over {\overline {{\rm{SEF}}{{\rm{D}}_I}} }}$ (30)

In the next section we provide the results derived from simulated and observed data, and discuss their implications.

4.3 SEFD as a function of frequency

The method described in Sect. 4.1 allows us to determine spatial structure of SEFD by calculating SEFD values in each pixel of 4096 × 4096 sky images. This method was successfully used in Paper I, where a single dipole in each MWA tile was enabled and the remaining 15 dipoles were terminated. However, it has never been applied to measure the SEFD from MWA images resulting from observations in a standard mode with all dipoles in MWA tiles enabled. Therefore, before comparison with the simulations, an initial verification was performed by deriving SEFD values using the well-tested method described in Appendix C (Sutinjo et al. 2015). This method uses the standard deviation of calibrated visibilities to measure SEFD_XX and SEFD_YY as a function of frequency in the direction of Hydra-A source (no spatial information); we note that only the SEFD of instrumental polarizations can be measured using this method, but not SEFD_I. These measurements were used as a reference to confirm that the image-based SEFD values were correct. Furthermore, this method allowed us to compare the frequency dependence of SEFD_XX and SEFD_YY against the simulations. Figure 6 shows the comparison between the observed and simulated results. At the location in the sky corresponding to (Az, ZA) = (80°, 46°), the realized area of the X-polarized dipole is significantly smaller than the Y-polarized dipole, and therefore SEFD_XX is higher than SEFD_YY.

At 154.88 MHz, observed SEFD_XX = 78 ± 7.8 kJy and SEFD_YY = 57.6 ± 3.6 kJy, which is in very good agreement with the values obtained from the imaging method. Overall, there is excellent agreement between simulated and observed SEFD_XX/YY over a wide range of frequencies.

Fig. 6

SEFD in the pointing direction of gridpoint 114 with realized area calculated at (Az, ZA) = (80°, 46°) as a function of frequency. The observed SEFD calculated from calibrated visibilities is compared with simulated SEFD.

5 Results and discussion

5.1 Comparison between the data and simulation

In order to ensure that we are able to draw accurate conclusions regarding the difference in SEFD values calculated using the rms approximation in Eq. (18) and the true SEFD using Eq. (17), we compared the simulated SEFD with the measurements. The SEFD of the X, Y, and I using the difference imaging method probes the SEFD over multiple locations in the sky for a given frequency (154.88 MHz). Similarly, the simulated SEFD_XX/YY images were generated using A/T formulation while the simulated SEFD_I image was generated using Eq. (17). Figure 7 shows the SEFD obtained from simulation (Fig. 7a), the Stokes I image (Fig. 7b), and the difference in percentage between the two (Fig. 7c). For reference, contours of the normalized beam pattern for each polarization are displayed in 3 dB increments down to the –12 dB level. The contours of the beam pattern shown in the SEFD_I are obtained by normalizing the reciprocal of Eq. (17). A similar comparison was done for the X and Y polarization, but for brevity the results are included in Appendix D.

The relative difference in percentage, denoted M_SEFD, between the simulated and observed SEFD in all images (XX, YY, and I) was computed as $M_{SEFD} = \frac{{SEFD}^{obs} - {SEFD}^{sim}}{{SEFD}^{sim}} \times 100 %$ ${M_{{\rm{SEFD}}}} = {{{\rm{SEF}}{{\rm{D}}^{{\rm{obs}}}} - {\rm{SEF}}{{\rm{D}}^{{\rm{sim}}}}} \over {{\rm{SEF}}{{\rm{D}}^{{\rm{sim}}}}}} \times 100\%$ (31)

We analyzed the errors using histograms, and the observed SEFD_XX and SEFD_I are on average higher by 9 and 4%, respectively, while the observed SEFD_YY is lower by 4% compared to simulated values for all pixels within the −12 dB beam width. Overall, the agreement between the simulated and observed values is within the ±10% range which indicates excellent correspondence between simulated and observed SEFD data.

Following the verification of simulated SEFD with observed data, we were able to calculate the error between the rms and proposed method using Eq. (26). Figures 8a,b show this error for an MWA tile for simulated and observed data, respectively. We can see that the difference between ${SEFD}_{I}^{rms}$ ${\rm{SEFD}}_I^{{\rm{rms}}}$ and SEFD_I is not a uniform offset in the image, but has a noticeable structure. For the observation within the −3 dB beam width, the error of SEFD prediction using the rms approximation is only 7%. However, it increases as we move farther away from the beam center and reaches 23% in the −12 dB beam width. We note that for errors in the simulated values, $Δ {SEFD}_{I}^{sim}$ $\Delta {\rm{SEFD}}_I^{{\rm{sim}}}$ is always positive, meaning the rms method underestimates the SEFD, which is consistent with the discussion in Sect. 2.5.

Figure 8c shows the subtraction of $Δ {SEFD}_{I}^{obs}$ $\Delta {\rm{SEFD}}_I^{{\rm{obs}}}$ and $Δ {SEFD}_{I}^{sim}$ $\Delta {\rm{SEFD}}_I^{{\rm{sim}}}$ calculated as $Δ_{sim / obs} = Δ {SEFD}_{I}^{obs} - Δ {SEFD}_{I}^{sim}$ ${\Delta _{{\rm{sim}}/{\rm{obs}}}} = \Delta {\rm{SEFD}}_I^{{\rm{obs}}} - \Delta {\rm{SEFD}}_I^{{\rm{sim}}}$ (32)

The resulting mean difference for all pixels within the −12 dB beam width is zero, which allows us to draw the following conclusions: (i) the negative values seen in $Δ {SEFD}_{I}^{obs}$ $\Delta {\rm{SEFD}}_I^{{\rm{obs}}}$ are due to noise and (ii) we can fully predict the amount of underestimation (% errors) in SEFD_I produced by the rms method. This indicates that Eq. (17) more accurately predicts the array’s SEFD_I.

Fig. 7

(a) Simulated SEFD_I, (b) observed SEFD_I, and (c) percentage difference between the simulated and observed SEFD_I at 154.88 MHz calculated by Eq. (31).

5.2 Computation of SEFD_I for diagonal plane and low elevation pointing angle

There was a noticeable structure in the result shown in Fig. 8; however, we note that for that observation time, frequency, and pointing angle the difference in SEFD formulation would probably be hard to notice within the 3 dB beam width, and indeed in some cases only images in the target field down to −3 dB are created. Nevertheless, these results show that the improved expression of SEFD_I provides highly accurate SEFD values, and thus can be used to precisely calculate the sensitivity of a radio telescope for any pointing angle within the operating frequency range using simulations.

To this end, it is reasonable to continue our analysis based only on simulated results. We completed an iterative search aimed to find a case where ΔSEFD_I affects the data within −3 dB beam width. Figure 9 shows one outcome of this iterative search. It was found for Az = 45°, ZA = 56.96° (33.04° elevation angle) at 199.68 MHz that the rms SEFD approximation results in a 29% error within the −3 dB beam width and increases up to 36% within the −12 dB beam width. This example further demonstrates the inaccuracies of the rms approximation, which always predicts lower SEFD than the actual value.

Fig. 8

Relative difference between the (a) simulated and (b) observed SEFD_I and (c) the difference of $Δ {SEFD}_{I}^{obs}$ $\Delta {\rm{SEFD}}_I^{{\rm{obs}}}$ and $Δ {SEFD}_{I}^{sim}$ $\Delta {\rm{SEFD}}_I^{{\rm{sim}}}$

6 Conclusion

This work, which is an extension of Paper I, provides a generalized expression of SEFD_I for interferometric phased array polarimeters. As in Paper I, this expression was derived by performing statistical analysis on the standard deviation of the flux density estimate. Our current work further clarifies that the SEFD_I expression does not depend on assumptions made regarding the background polarization or orthogonality in the antenna elements. The key array parameters were obtained from the full-wave electromagnetic simulation of the phased array of interest, which is a MWA tile in this example, and was used to compute SEFD_I given by Eq. (17). The mean percentage difference $(M {SEFD}_{I})$ $\left({M{\rm{SEF}}{{\rm{D}}_I}} \right)$ between simulated and observed SEFD_I is 4%, which indicates excellent agreement. Furthermore, we provided proof that the rms approximation always underestimates the SEFD_I. This proof also demonstrates that the equality between the rms approximation and SEFD_I is reached when the row vectors in the Jones matrix are orthogonal, which is a condition that can be approached but never fulfilled in practice.

After verifying the accuracy of our simulation with the observed data, we proceeded to compare the relative percentage difference in SEFD_I, as defined in Eq. (26), calculated by the often-used rms approximation and Eq. (17). A large percentage difference here would indicate that the rms approximation underestimates the SEFD_I. We saw that for our chosen observation, the maximum simulated error $Δ {SEFD}_{I}^{sim} = 7 %$ $\Delta {\rm{SEFD}}_I^{{\rm{sim}}} = 7\%$ within the –3 dB beam width and increases to 23% within the –12 dB beam width. This result was also verified against the observed data $(Δ {SEFD}_{I}^{obs})$ $\left({\Delta {\rm{SEFD}}_I^{{\rm{obs}}}} \right)$ by taking the absolute difference between the two images (Figs. 8a,b) defined in Eq. (32). The resulting difference between simulated and observed ΔSEFD_I has a zero mean value, which indicates a remarkable agreement between the results.

As we had thoroughly validated our simulation against observed data, we then performed an iterative search for pointing angles and frequencies that would yield a higher difference between the rms approximation and Eq. (17). We found that for beam pointing (Az, ZA) = (45°, 56.96°) at 199.68 MHz the rms approximation produces an error of 29% in $Δ {SEFD}_{I}^{s i m}$ $\Delta {\rm{SEFD}}_I^{sim}$ within the –3 dB beam width, which increases to 36% within the –12 dB beam width. This outcome is in agreement with the prediction made in Paper I whereby the difference increases at Az = 45^° at low elevation angles.

We conclude that the derived SEFD_I expression improves the fundamental understanding of instrument performance and can be used to accurately calculate sensitivity not only at the principal planes, but also at the diagonal planes (Az = 45^°) and low elevation angles. This enables us to more confidently predict sensitivity for detection of pulsars, fast radio bursts (FRBs), or epoch of reionization (EoR) signal. This is particularly important for the cases where the target sources can only be observed at very low elevations (e.g., in response to alerts about FRBs or any other transient sources).

Fig. 9

$Δ {SEFD}_{I}^{sim}$ $\Delta {\rm{SEFD}}_I^{{\rm{sim}}}$ computed for 199.68 MHz at (Az, ZA) = (45°, 56.96°) showing the impact of the error resulting from the rms expression of SEFD_I.

Acknowledgements

This scientific work makes use of the Murchison Radio-astronomy Observatory (MRO), operated by CSIRO. We acknowledge the Wajarri Yamatji people as the traditional owners of the Observatory site. Support for the operation of the MWA is provided by the Australian Government (NCRIS), under a contract to Curtin University administered by Astronomy Australia Limited. This work was further supported by resources provided by the Pawsey Supercomputing Centre with funding from the Australian Government and the Government of Western Australia. The authors thank A/Prof. R. B. Wayth and Prof. D. Davidson for discussions on this topic and for reviewing the draft manuscript. The authors thank Dr. Natasha Hurley-Walker for providing the model of the Hydra-A calibrator source.

Appendix A Statistical Analysis of Var $(\tilde{I})$ $\left({\tilde I} \right)$

The statistical reasoning and the vanishing covariance are similar those presented in the Appendix in Paper I. In the current paper, we were able to reduce the assumptions only to the most fundamental ones, namely zero mean Gaussian noise with independent and identically distributed real and imaginary parts (in phasor domain) and zero mutual coherence of system noise between two arrays forming a baseline. We do not assume the polarization of the sky, the orthogonality of the elements in the array, or the lack of correlation of noise complex noise sources. The right-hand side of Eq. (13) has the form $W = a X_{1} X_{2}^{*} - z X_{1} Y_{2}^{*} - z^{*} Y_{1} X_{2}^{*} + b Y_{1} Y_{2}^{*}$ $W = a{X_1}X_2^* - z{X_1}Y_2^* - {z^*}{Y_1}X_2^* + b{Y_1}Y_2^*$ (A.1)

where X₁, X₂, Y₁, Y₂ are complex random variables representing the voltages seen at the outputs of the summation; $a = {| l_{Y ϕ} |}^{2} + {| l_{Y θ} |}^{2}, b = {| l_{X ϕ} |}^{2} + {| l_{X θ} |}^{2}$ $a = {\left| {{l_{Y\phi }}} \right|^2} + {\left| {{l_{Y\theta }}} \right|^2},b = {\left| {{l_{X\phi }}} \right|^2} + {\left| {{l_{X\theta }}} \right|^2}$ are real constants and $z = l_{X ϕ}^{*} l_{Y ϕ} + l_{X θ}^{*} l_{Y θ}$ $z = l_{X\phi }^*{l_{Y\phi }} + l_{X\theta }^*{l_{Y\theta }}$ is a complex constant. The variance is $\begin{matrix} Var (W) = Var (a X_{1} X_{2}^{*} - z X_{1} Y_{2}^{*} - z^{*} Y_{1} X_{2}^{*} + b Y_{1} Y_{2}^{*}), \\ = | a |^{2} Var (X_{1} X_{2}^{*}) + | z |^{2} Var (X_{1} Y_{2}^{*}) \\ + | z |^{2} Var (Y_{1} X_{2}^{*}) + | b |^{2} Var (Y_{1} Y_{2}^{*}) + 2 C \end{matrix}$ $\matrix{ {{\rm{Var}}(W) = {\rm{Var}}\left({a{X_1}X_2^* - z{X_1}Y_2^* - {z^*}{Y_1}X_2^* + b{Y_1}Y_2^*} \right),} \cr { = |a{|^2}{\rm{Var}}\left({{X_1}X_2^*} \right) + |z{|^2}{\rm{Var}}\left({{X_1}Y_2^*} \right)} \cr { + |z{|^2}{\rm{Var}}\left({{Y_1}X_2^*} \right) + |b{|^2}{\rm{Var}}\left({{Y_1}Y_2^*} \right) + 2C} \cr }$ (A.2)

where C is the covariance of cross terms. The conjugation signs (_*) have been included here for clarity, for example in the Var $(X_{1} X_{2}^{*})$ $\left({{X_1}X_2^*} \right)$ term. We begin by considering this term $\begin{matrix} Var (X_{1} X_{2}^{*}) = 〈 {| X_{1} X_{2}^{*} - {X_{1} X_{2}^{*}} |}^{2} 〉 \\ = 〈 (X_{1} X_{2}^{*} - 〈 X_{1} X_{2}^{*} 〉) (X_{1}^{*} X_{2} - 〈 X_{1}^{*} X_{2} 〉) 〉 \\ = 〈 X_{1} X_{2}^{*} X_{1}^{*} X_{2} 〉 - 〈 X_{1} X_{2}^{*} 〈 X_{1}^{*} X_{2} 〉 〉 \\ - 〈 〈 X_{1} X_{2}^{*} 〉 X_{1}^{*} X_{2} 〉 + 〈 〈 X_{1} X_{2}^{*} 〉 〈 X_{1}^{*} X_{2} 〉 〉 . \end{matrix}$ $\matrix{ {{\rm{Var}}({{\rm{X}}_1}X_2^*) = \left\langle {{{\left| {{X_1}X_2^* - \left\{ {{X_1}X_2^*} \right\}} \right|}^2}} \right\rangle } \cr { = \left\langle {\left({{X_1}X_2^* - \left\langle {{X_1}X_2^*} \right\rangle } \right)\left({X_1^*{X_2} - \left\langle {X_1^*{X_2}} \right\rangle } \right)} \right\rangle } \cr { = \left\langle {{X_1}X_2^*X_1^*{X_2}} \right\rangle - \left\langle {{X_1}X_2^*\left\langle {X_1^*{X_2}} \right\rangle } \right\rangle } \cr {\quad - \left\langle {\left\langle {{X_1}X_2^*} \right\rangle X_1^*{X_2}} \right\rangle + \left\langle {\left\langle {{X_1}X_2^*} \right\rangle \left\langle {X_1^*{X_2}} \right\rangle } \right\rangle.} \cr }$ (A.3)

We recognize that the last two terms of the last line cancel out because $\begin{matrix} 〈 〈 X_{1} X_{2}^{*} 〉 X_{1}^{*} X_{2} 〉 = 〈 X_{1} X_{2}^{*} 〉 〈 X_{1}^{*} X_{2} 〉 \\ 〈 〈 X_{1} X_{2}^{*} 〉 〈 X_{1}^{*} X_{2} 〉 〉 = 〈 X_{1} X_{2}^{*} 〉 〈 X_{1}^{*} X_{2} 〉 \end{matrix}$ $\matrix{ {\left\langle {\left\langle {{X_1}X_2^*} \right\rangle X_1^*{X_2}} \right\rangle = \left\langle {{X_1}X_2^*} \right\rangle \left\langle {X_1^*{X_2}} \right\rangle } \cr {\left\langle {\left\langle {{X_1}X_2^*} \right\rangle \left\langle {X_1^*{X_2}} \right\rangle } \right\rangle = \left\langle {{X_1}X_2^*} \right\rangle \left\langle {X_1^*{X_2}} \right\rangle } \cr }$ (A.4)

This leaves us with $Var (X_{1} X_{2}^{*}) = 〈 X_{1} X_{2}^{*} X_{1}^{*} X_{2} 〉 - 〈 X_{1} X_{2}^{*} 〉 〈 X_{1}^{*} X_{2} 〉$ ${\rm{Var}}\left({{X_1}X_2^*} \right) = \left\langle {{X_1}X_2^*X_1^*{X_2}} \right\rangle - \left\langle {{X_1}X_2^*} \right\rangle \left\langle {X_1^*{X_2}} \right\rangle$ (A.5)

For the first term on the right-hand side of Eq. (A.5) we apply the formula for the zero-mean joint Gaussian random variable Z_{1,2, 3, 4} (Thompson et al. 2017; Baudin 2015): $\begin{matrix} 〈 Z_{1} Z_{2} Z_{3} Z_{4} 〉 = 〈 Z_{1} Z_{2} 〉 〈 Z_{3} Z_{4} 〉 + 〈 Z_{1} Z_{3} 〉 〈 Z_{2} Z_{4} 〉 \\ + 〈 Z_{1} Z_{4} 〉 〈 Z_{2} Z_{3} 〉 \end{matrix}$ $\matrix{ {\langle {Z_1}{Z_2}{Z_3}{Z_4}\rangle = \langle {Z_1}{Z_2}\rangle \langle {Z_3}{Z_4}\rangle + \langle {Z_1}{Z_3}\rangle \langle {Z_2}{Z_4}\rangle } \cr { + \langle {Z_1}{Z_4}\rangle \langle {Z_2}{Z_3}\rangle } \cr }$ (A.6)

Therefore, $\begin{matrix} 〈 X_{1} X_{2}^{*} X_{1}^{*} X_{2} 〉 = 〈 X_{1} X_{2}^{*} 〉 〈 X_{1}^{*} X_{2} 〉 + 〈 {| X_{1} |}^{2} 〉} 〈 {| X_{2} |}^{2} 〉 \\ + 〈 X_{1} X_{2} 〉 〈 X_{1}^{*} X_{2}^{*} 〉 \end{matrix}$ $\matrix{ {\left\langle {{X_1}X_2^*X_1^*{X_2}} \right\rangle = \left\langle {{X_1}X_2^*} \right\rangle \left\langle {X_1^*{X_2}} \right\rangle + \left\langle {{{\left| {{X_1}} \right|}^2}} \right\rangle \} \left\langle {{{\left| {{X_2}} \right|}^2}} \right\rangle } \cr { + \left\langle {{X_1}{X_2}} \right\rangle \left\langle {X_1^*X_2^*} \right\rangle } \cr }$ (A.7)

The first term on the right-hand side of Eq. (A.7) cancels the last term in Eq. (A.4), leaving us with $Var (X_{1} X_{2}^{*}) = 〈 {| X_{1} |}^{2} 〉 〈 {| X_{2} |}^{2} 〉 + 〈 X_{1} X_{2} 〉 〈 X_{1}^{*} X_{2}^{*} 〉$ ${\rm{Var}}({X_1}X_2^*) = \left\langle {{{\left| {{X_1}} \right|}^2}} \right\rangle \left\langle {{{\left| {{X_2}} \right|}^2}} \right\rangle + \left\langle {{X_1}{X_2}} \right\rangle \left\langle {X_1^*X_2^*} \right\rangle$ (A.8)

The last term on the right-hand side of Eq. (A.8) contains X₁ X₂ that are not conjugated with each other in the expectation operation 〈.〉, where each X₁, X₂ represents complex noise. We now show that we can write this as $\begin{matrix} 〈 X Y 〉 = 〈 (ℜ_{X} + j ℑ_{X}) (ℜ_{Y} + j ℑ_{Y}) 〉 \\ = 〈 ℜ_{X} ℜ_{Y} 〉 - 〈 ℑ_{X} ℑ_{Y} 〉 + j 〈 ℜ_{X} ℑ_{Y} 〉 \\ + j 〈 ℜ_{Y} ℑ_{X} 〉 \end{matrix}$ $\matrix{ {\langle XY\rangle = \langle \left({{\Re _X} + j{\Im _X}} \right)\left({{\Re _Y} + j{\Im _Y}} \right)\rangle } \cr { = \langle {\Re _X}{\Re _Y}\rangle - \langle {\Im _X}{\Im _Y}\rangle + j\langle {\Re _X}{\Im _Y}\rangle } \cr { + j\langle {\Re _Y}{\Im _X}\rangle } \cr }$ (A.9)

For independent real and imaginary parts, the terms $〈 ℜ_{X} ℑ_{Y} 〉, 〈 ℜ_{Y} ℑ_{X} 〉$ $\left\langle {{\Re _X}{\Im _Y}} \right\rangle,\left\langle {{\Re _Y}{\Im _X}} \right\rangle$ vanish. Furthermore, for identical correlation in the real part and in the imaginary part, we have $〈 ℜ_{X} ℜ_{Y} 〉 - 〈 ℑ_{X} ℑ_{Y} 〉 = 0$ $\left\langle {{\Re _X}{\Re _Y}} \right\rangle - \left\langle {{\Im _X}{\Im _Y}} \right\rangle = 0$ . These are consistent with zero-mean Gaussian noise representing thermal noise. Under these conditions, $〈 X_{1} X_{2} 〉 = 〈 X_{1}^{*} X_{2}^{*} 〉 = 0$ $\left\langle {{X_1}{X_2}} \right\rangle = \left\langle {X_1^*X_2^*} \right\rangle = 0$ . This leaves us with the key result $\begin{matrix} Var (X_{1} X_{2}^{*}) = 〈 {| X_{1} |}^{2} 〉 〈 {| X_{2} |}^{2} 〉 \\ = {(4 k Δ f)}^{2} T_{s y s X 1} R_{X 1} T_{s y s X 2} R_{X 2} \end{matrix}$ $\matrix{ {{\rm{Var}}({X_1}X_2^*) = \left\langle {{{\left| {{X_1}} \right|}^2}} \right\rangle \left\langle {{{\left| {{X_2}} \right|}^2}} \right\rangle } \cr { = {{(4k\Delta f)}^2}{T_{sysX1}}{R_{X1}}{T_{sysX2}}{R_{X2}}} \cr }$ (A.10)

With the same reasoning, it can be shown that $\begin{matrix} Var (X_{1} Y_{2}^{*}) = 〈 {| X_{1} |}^{2} 〉 〈 {| Y_{2} |}^{2} 〉 \\ = {(4 k Δ f)}^{2} T_{s y s X 1} R_{X 1} T_{s y s Y 2} R_{Y 2} \\ Var (Y_{1} X_{2}^{*}) = 〈 {| Y_{1} |}^{2} 〉 〈 {| X_{2} |}^{2} 〉 \\ = {(4 k Δ f)}^{2} T_{s y s Y 1} R_{Y 1} T_{s y s X 2} R_{X 2}, \\ Var (Y_{1} Y_{2}^{*}) = 〈 {| Y_{1} |}^{2} 〉 〈 {| Y_{2} |}^{2} 〉 \\ = {(4 k Δ f)}^{2} T_{s y s Y 1} R_{Y 1} T_{s y s Y 2} R_{Y 2} \end{matrix}$ $\matrix{ {{\rm{Var}}({X_1}Y_2^*) = \left\langle {{{\left| {{X_1}} \right|}^2}} \right\rangle \left\langle {{{\left| {{Y_2}} \right|}^2}} \right\rangle } \cr { = {{(4k\Delta f)}^2}{T_{sysX1}}{R_{X1}}{T_{sysY2}}{R_{Y2}}} \cr {{\rm{Var}}({Y_1}X_2^*) = \left\langle {{{\left| {{Y_1}} \right|}^2}} \right\rangle \left\langle {{{\left| {{X_2}} \right|}^2}} \right\rangle } \cr { = {{(4k\Delta f)}^2}{T_{sysY1}}{R_{Y1}}{T_{sysX2}}{R_{X2}},} \cr {{\rm{Var}}({Y_1}Y_2^*) = \left\langle {{{\left| {{Y_1}} \right|}^2}} \right\rangle \left\langle {{{\left| {{Y_2}} \right|}^2}} \right\rangle } \cr { = {{(4k\Delta f)}^2}{T_{sysY1}}{R_{Y1}}{T_{sysY2}}{R_{Y2}}} \cr }$ (A.11)

We note that only the zero-mean joint Gaussian random variable and iid real and imaginary parts are needed to obtain the results thus far. Also, for antenna arrays of an identical design, it is reasonable to let R_X1 = R_X2 = R_X and T_sysX1 = T_sysX2 = T_sysX, and similarly with Y.

In Eq. (A.11) the terms $Var (Y_{1} X_{2}^{*})$ ${\rm{Var}}\left({{Y_1}X_2^*} \right)$ and $Var (Y_{1} Y_{2}^{*})$ ${\rm{Var}}\left({{Y_1}Y_2^*} \right)$ contribute to the overall variance in Eq. (A.2). The terms $Var (Y_{1} X_{2}^{*})$ ${\rm{Var}}\left({{Y_1}X_2^*} \right)$ and $Var (Y_{1} Y_{2}^{*})$ ${\rm{Var}}\left({{Y_1}Y_2^*} \right)$ are as significant as $Var (X_{1} X_{2}^{*})$ ${\rm{Var}}\left({{X_1}X_2^*} \right)$ and $Var (Y_{1} Y_{2}^{*})$ ${\rm{Var}}\left({{Y_1}Y_2^*} \right)$ , but the contributions of the former are scaled by |z|². The |z|² term, in turn, becomes increasingly appreciable in the diagonal scan plane of the phased array with decreasing elevation angles. Moreover, this | z|² term is neglected in the RMS approximation of SEFD, which contributes to the underestimation.

Next, we consider the C term: $\begin{matrix} C = - a z Cov (X_{1} X_{2}^{*}, X_{1} Y_{2}^{*}) - a z^{*} Cov (X_{1} X_{2}^{*}, Y_{1} X_{2}^{*}) \\ + a b Cov (X_{1} X_{2}^{*}, Y_{1} Y_{2}^{*}) + | z |^{2} Cov (X_{1} Y_{*}^{*}, Y_{1} X_{2}^{*}) \\ - b z Cov (X_{1} Y_{2}^{*}, Y_{1} Y_{2}^{*}) - b z^{*} Cov (Y_{1} X_{2}^{*}, Y_{1} Y_{2}^{*}) \end{matrix}$ $\matrix{ {C = - az{\rm{Cov}}({X_1}X_2^*,{X_1}Y_2^*) - a{z^*}{\rm{Cov}}({X_1}X_2^*,\;{Y_1}X_2^*)} \cr { + ab{\rm{Cov}}({X_1}X_2^*,\;{Y_1}Y_2^*) + |z{|^2}{\rm{Cov}}({X_1}Y_*^*,\;{Y_1}X_2^*)} \cr { - bz{\rm{Cov}}({X_1}Y_2^*,\;{Y_1}Y_2^*) - b{z^*}{\rm{Cov}}({Y_1}X_2^*,\;{Y_1}Y_2^*)} \cr }$ (A.12)

So far there is no difference with the corresponding expressions in Paper I. We consider the covariance terms next, starting with $Cov (X_{1} X_{2}^{*}, X_{1} Y_{2}^{*}) = 〈 X_{1} X_{2}^{*} X_{1} Y_{2}^{*} 〉 - 〈 X_{1} X_{2}^{*} 〉 〈 X_{1} Y_{2}^{*} 〉$ ${\rm{Cov}}({{\rm{X}}_1}X_2^*,{X_1}Y_2^*) = \left\langle {{X_1}X_2^*{X_1}Y_2^*} \right\rangle - \left\langle {{X_1}X_2^*} \right\rangle \left\langle {{X_1}Y_2^*} \right\rangle$ (A.13)

Again, applying Eq. (A.6) to the first term on the right-hand side of Eq. (A.13), we get $\begin{matrix} 〈 X_{1} X_{2}^{*} X_{1} Y_{2}^{*} 〉 = 〈 X_{1} X_{2}^{*} 〉 〈 X_{1} Y_{2}^{*} 〉 + 〈 X_{1} X_{1} 〉 〈 X_{2}^{*} Y_{2}^{*} 〉 \\ + 〈 X_{1} Y_{2}^{*} 〉 〈 X_{2}^{*} X_{1} 〉 \end{matrix}$ $\matrix{ {\left\langle {{X_1}X_2^*{X_1}Y_2^*} \right\rangle = \left\langle {{X_1}X_2^*} \right\rangle \left\langle {{X_1}Y_2^*} \right\rangle + \langle {X_1}{X_1}\rangle \left\langle {X_2^*Y_2^*} \right\rangle } \cr { + \left\langle {{X_1}Y_2^*} \right\rangle \left\langle {X_2^*{X_1}} \right\rangle } \cr }$ (A.14)

As a result, $\begin{matrix} Cov (X_{1} X_{2}^{*}, X_{1} Y_{2}^{*}) = 〈 X_{1} X_{1} 〉 〈 X_{2}^{*} Y_{2}^{*} 〉 + 〈 X_{1} Y_{2}^{*} 〉 〈 X_{2}^{*} X_{1} 〉 \\ = 〈 X_{1} Y_{2}^{*} 〉 〈 X_{2}^{*} X_{1} 〉 \end{matrix}$ $\matrix{ {{\rm{Cov}}({X_1}X_2^*,{X_1}Y_2^*) = \langle {X_1}{X_1}\rangle \left\langle {X_2^*Y_2^*} \right\rangle + \left\langle {{X_1}Y_2^*} \right\rangle \left\langle {X_2^*{X_1}} \right\rangle } \cr { = \left\langle {{X_1}Y_2^*} \right\rangle \left\langle {X_2^*{X_1}} \right\rangle } \cr }$ (A.15)

The last line is due to $〈 X_{1} X_{1} 〉 = 〈 X_{2}^{*} Y_{2}^{*} 〉 = 0$ $\left\langle {{X_1}{X_1}} \right\rangle = \left\langle {X_2^*Y_2^*} \right\rangle = 0$ , as discussed with regard to Eq. (A.9). Following the same reasoning, $\begin{matrix} Cov (X_{1} X_{2}^{*}, Y_{1} X_{2}^{*}) = 〈 X_{1} X_{2}^{*} 〉 〈 X_{2}^{*} Y_{1} 〉, \\ Cov (X_{1} X_{2}^{*}, Y_{1} Y_{2}^{*}) = 〈 X_{1} Y_{2}^{*} 〉 〈 X_{2}^{*} Y_{1} 〉, \\ Cov (X_{1} Y_{2}^{*}, Y_{1} X_{2}^{*}) = {〈 X_{1} X_{2}^{*} 〉} 〈 Y_{2}^{*} Y_{1} 〉, \\ Cov (X_{1} Y_{2}^{*}, Y_{1} Y_{2}^{*}) = 〈 X_{1} Y_{2}^{*} 〉 〈 Y_{2}^{*} Y_{1} 〉, \\ Cov (Y_{1} X_{2}^{*}, Y_{1} Y_{2}^{*}) = 〈 Y_{1} Y_{2}^{*} 〉 〈 X_{2}^{*} Y_{1} 〉 \end{matrix}$ $\matrix{ {{\rm{Cov}}({{\rm{X}}_1}X_2^*,\;{Y_1}X_2^*) = \left\langle {{X_1}X_2^*} \right\rangle \left\langle {X_2^*{Y_1}} \right\rangle,} \cr {{\rm{Cov}}({X_1}X_2^*,\;{Y_1}Y_2^*) = \left\langle {{X_1}Y_2^*} \right\rangle \left\langle {X_2^*{Y_1}} \right\rangle,} \cr {{\rm{Cov}}({X_1}Y_2^*,\;{Y_1}X_2^*) = \{ \left\langle {{X_1}X_2^*} \right\rangle \} \left\langle {Y_2^*{Y_1}} \right\rangle,} \cr {{\rm{Cov}}({X_1}Y_2^*,\;{Y_1}Y_2^*) = \left\langle {{X_1}Y_2^*} \right\rangle \left\langle {Y_2^*{Y_1}} \right\rangle,} \cr {{\rm{Cov}}({Y_1}X_2^*,\;{Y_1}Y_2^*) = \left\langle {{Y_1}Y_2^*} \right\rangle \left\langle {X_2^*{Y_1}} \right\rangle } \cr }$ (A.16)

Putting these results together into Eq. (A.12) yields $\begin{matrix} C = - a z 〈 X_{1} Y_{2}^{*} 〉 〈 X_{2}^{*} X_{1} 〉 - a z^{*} 〈 X_{1} X_{2}^{*} 〉 〈 X_{2}^{*} Y_{1} 〉 \\ + a b 〈 X_{1} Y_{2}^{*} 〉 〈 X_{2}^{*} Y_{1} 〉 + | z |^{2} 〈 X_{1} X_{2}^{*} 〉 〈 Y_{2}^{*} Y_{1} 〉 \\ - b z 〈 X_{1} Y_{2}^{*} 〉 〈 Y_{2}^{*} Y_{1} 〉 - b z^{*} 〈 Y_{1} Y_{2}^{*} 〉 〈 X_{2}^{*} Y_{1} 〉 \end{matrix}$ $\matrix{ {C = - az\left\langle {{X_1}Y_2^*} \right\rangle \left\langle {X_2^*{X_1}} \right\rangle - a{z^*}\left\langle {{X_1}X_2^*} \right\rangle \left\langle {X_2^*{Y_1}} \right\rangle } \cr { + ab\left\langle {{X_1}Y_2^*} \right\rangle \left\langle {X_2^*{Y_1}} \right\rangle + |z{|^2}\left\langle {{X_1}X_2^*} \right\rangle \left\langle {Y_2^*{Y_1}} \right\rangle } \cr { - bz\left\langle {{X_1}Y_2^*} \right\rangle \left\langle {Y_2^*{Y_1}} \right\rangle - b{z^*}\left\langle {{Y_1}Y_2^*} \right\rangle \left\langle {X_2^*{Y_1}} \right\rangle } \cr }$ (A.17)

This is the point where the second assumption is introduced. We note in Eq. (A.18) that we are left with factors with differing subscripts: 〈_{_1,_2}〉. These factors vanish assuming independent zero mean noise in Arrays 1 and 2. This is reasonable for low-frequency phased array radio telescopes since the dominant Galactic noise decays rapidly for baselines of tens of wavelengths (Sutinjo et al. 2015).

We validated our statistical reasoning by calculating various mean correlation products using the complex voltages measured at two neighboring MWA tiles (baseline ≈ 14 m), in both polarizations (i.e., X₁, Y₁, X₂, Y₂). The complex voltages were collected using the Voltage Capture System (Tremblay et al. 2015), at a frequency/time resolution of 10 kHz/100μs. The tile beam was pointed at Az/El of 51.3/40.6 deg. Figure A.1 shows the cumulative statistics for one (arbitrary) channel, written out at a one-second cadence. As predicted by the above formalism, the only significantly non-zero products are the autocorrelations; all other combinations converge rapidly to zero. In the final analysis C = 0 in Eq. (A.12) is well justified, which is the same conclusion reached in the Appendix in Paper I. However, the reasoning here only involves the most fundamental assumptions, as reviewed in this Appendix.

Fig. A.1

Magnitudes of various cumulative statistics of data recorded with the MWA Voltage Capture System (Tremblay et al. 2015) at two neighboring tiles. All except the auto-correlations 〈|X|²〉 = 〈X*X〉 converge rapidly to zero.

Appendix B Linking R to Active Reflection Coefficient and τ

The quantity R_X and R_Y are the realized noise resistance of the array such that the noise voltage per $\sqrt{Δ f}$ $\sqrt {\Delta f}$ across the input impedance of the LNA can be calculated using $v_{n, X} = \sqrt{4 k T R_{X}}$ ${v_{n,X}} = \sqrt {4kT{R_X}}$ , and thus the total power delivered per unit bandwidth is given by $P_{d e l, ext} = ℜ {\frac{| v_{n, X} |^{2}}{Z_{_{L N A}}^{*}}} .$ ${P_{del,{\rm{ext}}}} = \Re \left\{ {{{|{v_{n,X}}{|^2}} \over {Z_{_{LNA}}^*}}} \right\}.$ (B.1)

The key point is that all the complex signal paths taken by the noise signal to reach the summing junction that exists due to mutual coupling are completely captured in RX. Additionally, we can show that R_X can be calculated using already-established quantities such as τ_X and the active impedance $z_{a c t u, X}^{'}$ $z_{actu,X}^'$ calculated from the active reflection coefficient.

The quantity τ derived in Ung (2020) is a ratio of the power delivered to the LNA input impedance due to the homogeneous sky to the available power (kT). Therefore, we can easily convert τX to RX and vice versa using $R_{X} = \frac{τ_{X}}{4 ℜ {\frac{1}{Z_{L N A}^{*}}}} .$ ${R_X} = {{{\tau _X}} \over {4\Re \left\{ {{1 \over {Z_{LNA}^*}}} \right\}}}.$ (B.2)

Alternatively, the active reflection coefficient of an antenna element in an array as defined in Belostotski et al. (2015) can be used to calculate R_X. The active reflection coefficient accounts for all the coupling signal paths and combines it into a single equivalent signal path.

Hence, the power for each equivalent branch can be calculated using $P_{d e l, i} = \frac{4 k T}{N} {| \frac{z_{LNA} \sqrt{ℜ {{Z^{'}}_{a c t v i^{'}}}}}{Z_{LNA} + {Z^{'}}_{a c t v, i}} |}^{2} ℜ {\frac{1}{Z_{_{L N A}}^{*}}},$ ${P_{del,i}} = {{4kT} \over N}{\left| {{{{z_{{\rm{LNA}}}}\sqrt {\Re \{ {{Z'}_{actv{i^\prime }}}\} } } \over {{Z_{{\rm{LNA}}}} + {{Z'}_{actv,i}}}}} \right|^2}\Re \left\{ {{1 \over {Z_{_{LNA}}^*}}} \right\},$ (B.3)

and thus $R_{X} = \frac{1}{N} \sum_{i = 1}^{N} | \frac{Z_{LNA}}{Z_{LNA} + {Z^{'}}_{a c t v, i}} | ℜ {{Z^{'}}_{a c t v, i}},$ ${R_X} = {1 \over N}\sum\limits_{i = 1}^N {\left| {{{{Z_{{\rm{LNA}}}}} \over {{Z_{{\rm{LNA}}}} + {{Z'}_{actv,i}}}}} \right|} \Re \{ {Z'_{actv,i}}\},$ (B.4)

where N is the number of antenna elements and $z_{a c t v, i}^{'}$ $z_{actv,i}^'$ is the active impedance for a given polarization obtained from the active reflection coefficient of the i-th element.

Figure B.1 shows R_X as a function of frequency calculated using three methods. Equations (B.2) and (B.4) yield the same quantity as equation (12).

Fig. B1

Comparison of R_x calculated using three methods. The solid curve was obtained using equation (12), while the data points represented by the circle and triangle markers are obtained using (B.2) and (B.4), respectively.

Appendix C Relation Between RMS Approximation and A/T

As demonstrated by Ung (2020), SEFD_XX/YY can be calculated using the realized area, Ar, and thus $\begin{array}{l} {SEFD}_{I}^{r m s} = \frac{1}{2} \sqrt{{SEFD}_{X X}^{2} + {SEFD}_{Y Y}^{2}} \\ = \frac{1}{2} k \sqrt{{(\frac{τ_{X} T_{sysX}}{A_{r X}})}^{2} + {(\frac{τ_{Y} T_{sysY}}{A_{r Y}})}^{2}} \end{array}$ $\matrix{{{\rm{SEFD}}_I^{rms} = {1 \over 2}\sqrt {{\rm{SEFD}}_{XX}^2 + {\rm{SEFD}}_{YY}^2} } \hfill \cr { = {1 \over 2}k\sqrt {{{\left({{{{\tau _X}{T_{{\rm{sysX}}}}} \over {{A_{rX}}}}} \right)}^2} + {{\left({{{{\tau _Y}{T_{{\rm{sysY}}}}} \over {{A_{rY}}}}} \right)}^2}} } \hfill \cr }$ (C.1)

and $A_{r} = \frac{η_{0}}{2} \frac{ℜ {Z_{LNA}}}{| Z_{LNA} |^{2}} ‖ 1 ‖^{2},$ ${A_r} = {{{\eta _0}} \over 2}{{\Re \{ {Z_{{\rm{LNA}}}}\} } \over {|{Z_{{\rm{LNA}}}}{|^2}}}1{^2},$ (C.2)

where A_r can be interpreted as the area of the array such that for a given incident plane wave with power density, P_inc, the power delivered to the load is P_load = A_rP_inc. Alternatively, it can also be expressed as A_r = τA_e.

As demonstrated in Appendix B, the quantity τ relates to R. Substituting Eq. (C.2) and Eq. (B.2) into (C.1) and simplifying yields ${SEFD}_{I}^{r m s} = \frac{4 k}{η_{0}} \sqrt{\frac{T_{sysX}^{2} R_{X}^{2}}{| | 1_{X} ‖^{4}} + \frac{T_{sysY}^{2} R_{Y}^{2}}{| | 1_{Y} ‖^{4}}} .$ ${\rm{SEFD}}_I^{rms} = {{4k} \over {{\eta _0}}}\sqrt {{{T_{{\rm{sysX}}}^2R_X^2} \over {||{1_X}{^4}}} + {{T_{{\rm{sysY}}}^2R_Y^2} \over {||{1_Y}{^4}}}}.$ (C.3)

Appendix D Simulated and observed SEFD_XX and SEFD_YY

Fig. D.1

Comparison of simulated and measured SEFD_XX and SEFD_YY. The relative difference M_SEFD, in percent, is evaluated for each pixel.

References

Baudin, P. 2015, Wireless Transceiver Architecture: Bridging RF and Digital Communications, 1st edn. (Hoboken, NJ: Wiley) [Google Scholar]
Belostotski, L., Veidt, B., Warnick, K.F., & Madanayake, A. 2015, IEEE Trans. Antennas Propag., 63, 2508 [CrossRef] [Google Scholar]
Briggs, D.S. 1995, PhD thesis, The New Mexico Institute of Mining and Technology, USA [Google Scholar]
Caiazzo, M. 2017, SKA Phase 1 System Requirements Specification V11, Technical Report SKA-TEL-SKO-0000008, SKA Organisation [Google Scholar]
Chippendale, A.P., Hayman, D.B., & Hay, S.G. 2014, PASA, 31, e019 [NASA ADS] [CrossRef] [Google Scholar]
Ellingson, S.W. 2011, IEEE Trans. Antennas Propag., 59, 1855 [CrossRef] [Google Scholar]
Hillbrand, H., & Russer, P. 1976, IEEE Trans. Circuits Syst., 23, 235 [CrossRef] [Google Scholar]
Labate, M.G., Braun, R., Dewdney, P., Waterson, M., & Wagg, J. 2017a, in 2017 XXXIInd General Assem. and Scientific Symp. of the Int. Union of Radio Sci. (URSI GASS), 1 [Google Scholar]
Labate, M.G., Dewdney, P., Braun, R., Waterson, M., & Wagg, J. 2017b, in 2017 11th European Conference on Antennas and Propagation (EUCAP), 2259 [CrossRef] [Google Scholar]
Lane, W.M., Cotton, W.D., van Velzen, S., et al. 2014, MNRAS, 440, 327 [NASA ADS] [CrossRef] [Google Scholar]
McMullin, J.P., Waters, B., Schiebel, D., Young, W., & Golap, K. 2007, ASP Conf. Ser., 376, 127 [NASA ADS] [Google Scholar]
Offringa, A.R., McKinley, B., Hurley-Walker, N., et al. 2014, MNRAS, 444, 606 [NASA ADS] [CrossRef] [Google Scholar]
Offringa, A.R., Trott, C.M., Hurley-Walker, et al. 2016, MNRAS, 458, 1057 [NASA ADS] [CrossRef] [Google Scholar]
Perley, R.A., & Butler, B.J. 2017, ApJS, 230, 7 [NASA ADS] [CrossRef] [Google Scholar]
Sokolowski, M., Colegate, T., Sutinjo, A.T., et al. 2017, PASA, 34, e062 [NASA ADS] [CrossRef] [Google Scholar]
Sokolowski, M., Jordan, C.H., Sleap, G., et al. 2020, PASA, 37, e021 [NASA ADS] [CrossRef] [Google Scholar]
Strang, G. 2016, Introduction to Linear Algebra, 5th edn. (Wellesley, MA, USA: Wellesley-Cambridge Press) [Google Scholar]
Sutinjo, A.T., Colegate, T.M., Wayth, R.B., et al. 2015, IEEE Trans. Antennas Propag., 63, 5433 [CrossRef] [Google Scholar]
Sutinjo, A.T., Ung, D.C.X., & Juswardy, B. 2018, IEEE Trans. Antennas Propag, 66, 5511 [CrossRef] [Google Scholar]
Sutinjo, A.T., Sokolowski, M., Kovaleva, M., et al. 2021, A&A, 646, A143 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Thompson, A.R., Moran, J.M., & Swenson, G.W. 2017, Response of the Receiving System (Cham: Springer International Publishing), 207 [Google Scholar]
Tingay, S.J., Goeke, R., Bowman, J.D., et al. 2013, PASA, 30, 7 [NASA ADS] [CrossRef] [Google Scholar]
Tokarsky, P.L., Konovalenko, A.A., & Yerin, S.N. 2017, IEEE Trans. Antennas Propag., 65, 4636 [CrossRef] [Google Scholar]
Tremblay, S.E., Ord, S.M., Bhat, N.D.R., et al. 2015, PASA, 32, e005 [CrossRef] [Google Scholar]
Twiss, R.Q. 1955, J. Appl. Phys., 26, 599 [NASA ADS] [CrossRef] [Google Scholar]
Ung, D.C.X. 2020, MPhil thesis, School of Electrical Engineering, Computing & Mathematical Sciences, Curtin University, Bentley, Western Australia [Google Scholar]
Ung, D., Sutinjo, A., & Davidson, D. 2019a, in 2019 13th European Conference on Antennas and Propagation (EuCAP), 1 [Google Scholar]
Ung, D., Sutinjo, A., Davidson, D., Johnston-Hollitt, M., & Tingay, S. 2019b, in 2019 IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, 401 [CrossRef] [Google Scholar]
Ung, D.C.X., Sokolowski, M., Sutinjo, A.T., & Davidson, D.B. 2020, IEEE Trans. Antennas Propag., 68, 5395 [CrossRef] [Google Scholar]
van Haarlem, M.P., Wise, M.W., Gunst, A.W., et al. 2013, A&A, 556, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Warnick, K.F., Woestenburg, B., Belostotski, L., & Russer, P. 2009, IEEE Trans. Antennas Propag., 57, 1634 [CrossRef] [Google Scholar]
Warnick, K.F., Ivashina, M.V., Maaskant, R., & Woestenburg, B. 2010, IEEE Trans. Antennas Propag., 58, 2121 [CrossRef] [Google Scholar]
Warnick, K.F., Ivashina, M.V., Wijnholds, S.J., & Maaskant, R. 2012, IEEE Trans. Antennas Propag., 60, 184 [CrossRef] [Google Scholar]
Warnick, K.F., Maaskant, R., Ivashina, M.V., Davidson, D.B., & Jeffs, B.D. 2018, Phased Arrays for Radio Astronomy, Remote Sensing, and Satellite Communications, EuMA High Frequency Technologies Series (Cambridge: Cambridge University Press) [CrossRef] [Google Scholar]
Wrobel, J.M., & Walker, R.C. 1999, ASP Conf. Ser., 180, 171 [NASA ADS] [Google Scholar]

¹

https://www.altair.com/feko/

²

https://github.com/nhurleywalker/GLEAM-X-pipeline

³

The correlation products are in fact XX^*, YY^*, XY^*, and YX^*, but in the context of this section the conjugate symbols (^*) were dropped for brevity.

⁴

https://sourceforge.net/p/wsclean/wiki/Home/

⁵

MWA gridpoints are pointing directions where delays applied in the analog beamformers are exact.

All Tables

Table 1

Calculated parameters required to compute SEFD_I for the array pointing at the Hydra-A source (Az = 81°, ZA = 46°) on 2014-12-26 at 16:05:43 UTC.

In the text

All Figures

Fig. 1

Arrays of dual-polarized low-frequency antennas observing the sky. The antennas are at fixed positions above the ground, and the z-axis is up. The signal from each element is amplified by an LNA, weighted, and summed. The output of each summation is directed into the correlator. The y-directed elements (black) follow the same signal flow, but are not shown. A four-element array is shown as an example.

In the text

Fig. 2

Antenna length calculation for an array, where l_1xθ is the antenna length associated with $\hat{θ}$ $\widehat \theta$ polarized incident field for x-directed antenna in the embedded element number 1 in the array shown, similarly to l_ixθ, l_iyθ, where i is the embedded element number. For brevity we consider a four-element array i = 1, ⋯, 4; the extension to i = 1, ⋯, N presents no complication.

In the text

	Fig. 3 Diagram representing system noise. The array is surrounded by a homogeneous blackbody radiator at temperature T_sys that represents the diffuse sky noise and the LNA noise. The y elements are similarly connected to a separate weight and summing circuit that are not shown for brevity.
In the text

	Fig. 4 Depiction of the vectors, their projections, and the area of the parallelogram.
In the text

Fig. 5

Example 0.5 s sky and difference images in X polarization (scale of the color bar is in Jy) used to calculate SEFD. Left: 0.5 s sky image. Center: difference between two consecutive 0.5 s images. Right: difference between two consecutive 0.5 s images corrected for the primary beam (divided by the primary beam in the X polarization). In order to calculate noise in every direction in the sky within the field of view (noise map), the standard deviation of the noise was calculated in small regions around each pixel in the difference image and divided by $\sqrt{2}$ $\sqrt 2$ . These noise maps were later converted into SEFD using Eq. (29). The same procedure was applied to Y and is very similar to Stokes I polarizations (in this case the correction for the primary beam was more complex than the simple division by the corresponding primary beam).

In the text

	Fig. 6 SEFD in the pointing direction of gridpoint 114 with realized area calculated at (Az, ZA) = (80°, 46°) as a function of frequency. The observed SEFD calculated from calibrated visibilities is compared with simulated SEFD.
In the text

	Fig. 7 (a) Simulated SEFD_I, (b) observed SEFD_I, and (c) percentage difference between the simulated and observed SEFD_I at 154.88 MHz calculated by Eq. (31).
In the text

	Fig. 8 Relative difference between the (a) simulated and (b) observed SEFD_I and (c) the difference of $Δ {SEFD}_{I}^{obs}$ $\Delta {\rm{SEFD}}_I^{{\rm{obs}}}$ and $Δ {SEFD}_{I}^{sim}$ $\Delta {\rm{SEFD}}_I^{{\rm{sim}}}$
In the text

	Fig. 9 $Δ {SEFD}_{I}^{sim}$ $\Delta {\rm{SEFD}}_I^{{\rm{sim}}}$ computed for 199.68 MHz at (Az, ZA) = (45°, 56.96°) showing the impact of the error resulting from the rms expression of SEFD_I.
In the text

	Fig. A.1 Magnitudes of various cumulative statistics of data recorded with the MWA Voltage Capture System (Tremblay et al. 2015) at two neighboring tiles. All except the auto-correlations 〈\|X\|²〉 = 〈X*X〉 converge rapidly to zero.
In the text

	Fig. B1 Comparison of R_x calculated using three methods. The solid curve was obtained using equation (12), while the data points represented by the circle and triangle markers are obtained using (B.2) and (B.4), respectively.
In the text

	Fig. D.1 Comparison of simulated and measured SEFD_XX and SEFD_YY. The relative difference M_SEFD, in percent, is evaluated for each pixel.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Baudin, P. 2015, Wireless Transceiver Architecture: Bridging RF and Digital Communications, 1st edn. (Hoboken, NJ: Wiley) [Google Scholar]

[2] Belostotski, L., Veidt, B., Warnick, K.F., & Madanayake, A. 2015, IEEE Trans. Antennas Propag., 63, 2508 [CrossRef] [Google Scholar]

[3] Briggs, D.S. 1995, PhD thesis, The New Mexico Institute of Mining and Technology, USA [Google Scholar]

[4] Caiazzo, M. 2017, SKA Phase 1 System Requirements Specification V11, Technical Report SKA-TEL-SKO-0000008, SKA Organisation [Google Scholar]

[5] Chippendale, A.P., Hayman, D.B., & Hay, S.G. 2014, PASA, 31, e019 [NASA ADS] [CrossRef] [Google Scholar]

[6] Ellingson, S.W. 2011, IEEE Trans. Antennas Propag., 59, 1855 [CrossRef] [Google Scholar]

[7] Hillbrand, H., & Russer, P. 1976, IEEE Trans. Circuits Syst., 23, 235 [CrossRef] [Google Scholar]

[8] Labate, M.G., Braun, R., Dewdney, P., Waterson, M., & Wagg, J. 2017a, in 2017 XXXIInd General Assem. and Scientific Symp. of the Int. Union of Radio Sci. (URSI GASS), 1 [Google Scholar]

[9] Labate, M.G., Dewdney, P., Braun, R., Waterson, M., & Wagg, J. 2017b, in 2017 11th European Conference on Antennas and Propagation (EUCAP), 2259 [CrossRef] [Google Scholar]

[10] Lane, W.M., Cotton, W.D., van Velzen, S., et al. 2014, MNRAS, 440, 327 [NASA ADS] [CrossRef] [Google Scholar]

[11] McMullin, J.P., Waters, B., Schiebel, D., Young, W., & Golap, K. 2007, ASP Conf. Ser., 376, 127 [NASA ADS] [Google Scholar]

[12] Offringa, A.R., McKinley, B., Hurley-Walker, N., et al. 2014, MNRAS, 444, 606 [NASA ADS] [CrossRef] [Google Scholar]

[13] Offringa, A.R., Trott, C.M., Hurley-Walker, et al. 2016, MNRAS, 458, 1057 [NASA ADS] [CrossRef] [Google Scholar]

[14] Perley, R.A., & Butler, B.J. 2017, ApJS, 230, 7 [NASA ADS] [CrossRef] [Google Scholar]

[15] Sokolowski, M., Colegate, T., Sutinjo, A.T., et al. 2017, PASA, 34, e062 [NASA ADS] [CrossRef] [Google Scholar]

[16] Sokolowski, M., Jordan, C.H., Sleap, G., et al. 2020, PASA, 37, e021 [NASA ADS] [CrossRef] [Google Scholar]

[17] Strang, G. 2016, Introduction to Linear Algebra, 5th edn. (Wellesley, MA, USA: Wellesley-Cambridge Press) [Google Scholar]

[18] Sutinjo, A.T., Colegate, T.M., Wayth, R.B., et al. 2015, IEEE Trans. Antennas Propag., 63, 5433 [CrossRef] [Google Scholar]

[19] Sutinjo, A.T., Ung, D.C.X., & Juswardy, B. 2018, IEEE Trans. Antennas Propag, 66, 5511 [CrossRef] [Google Scholar]

[20] Sutinjo, A.T., Sokolowski, M., Kovaleva, M., et al. 2021, A&A, 646, A143 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[21] Thompson, A.R., Moran, J.M., & Swenson, G.W. 2017, Response of the Receiving System (Cham: Springer International Publishing), 207 [Google Scholar]

[22] Tingay, S.J., Goeke, R., Bowman, J.D., et al. 2013, PASA, 30, 7 [NASA ADS] [CrossRef] [Google Scholar]

[23] Tokarsky, P.L., Konovalenko, A.A., & Yerin, S.N. 2017, IEEE Trans. Antennas Propag., 65, 4636 [CrossRef] [Google Scholar]

[24] Tremblay, S.E., Ord, S.M., Bhat, N.D.R., et al. 2015, PASA, 32, e005 [CrossRef] [Google Scholar]

[25] Twiss, R.Q. 1955, J. Appl. Phys., 26, 599 [NASA ADS] [CrossRef] [Google Scholar]

[26] Ung, D.C.X. 2020, MPhil thesis, School of Electrical Engineering, Computing & Mathematical Sciences, Curtin University, Bentley, Western Australia [Google Scholar]

[27] Ung, D., Sutinjo, A., & Davidson, D. 2019a, in 2019 13th European Conference on Antennas and Propagation (EuCAP), 1 [Google Scholar]

[28] Ung, D., Sutinjo, A., Davidson, D., Johnston-Hollitt, M., & Tingay, S. 2019b, in 2019 IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, 401 [CrossRef] [Google Scholar]

[29] Ung, D.C.X., Sokolowski, M., Sutinjo, A.T., & Davidson, D.B. 2020, IEEE Trans. Antennas Propag., 68, 5395 [CrossRef] [Google Scholar]

[30] van Haarlem, M.P., Wise, M.W., Gunst, A.W., et al. 2013, A&A, 556, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[31] Warnick, K.F., Woestenburg, B., Belostotski, L., & Russer, P. 2009, IEEE Trans. Antennas Propag., 57, 1634 [CrossRef] [Google Scholar]

[32] Warnick, K.F., Ivashina, M.V., Maaskant, R., & Woestenburg, B. 2010, IEEE Trans. Antennas Propag., 58, 2121 [CrossRef] [Google Scholar]

[33] Warnick, K.F., Ivashina, M.V., Wijnholds, S.J., & Maaskant, R. 2012, IEEE Trans. Antennas Propag., 60, 184 [CrossRef] [Google Scholar]

[34] Warnick, K.F., Maaskant, R., Ivashina, M.V., Davidson, D.B., & Jeffs, B.D. 2018, Phased Arrays for Radio Astronomy, Remote Sensing, and Satellite Communications, EuMA High Frequency Technologies Series (Cambridge: Cambridge University Press) [CrossRef] [Google Scholar]

[35] Wrobel, J.M., & Walker, R.C. 1999, ASP Conf. Ser., 180, 171 [NASA ADS] [Google Scholar]

System equivalent flux density of a low-frequency polarimetric phased array interferometer★

1 Introduction

2 SEFD of a polarimetric phased array interferometer

2.1 Jones matrix of a dual-polarized array

2.2 SEFD expression

2.3 Array system noise

2.4 SEFDI formulation

2.5 rms approximation always underestimates SEFD

3 SEFD simulation procedure

4 SEFD measurement procedure

4.1 Calibration and imaging

4.2 Measuring the SEFD from the noise in the sky images

4.3 SEFD as a function of frequency

5 Results and discussion

5.1 Comparison between the data and simulation

5.2 Computation of SEFDI for diagonal plane and low elevation pointing angle

6 Conclusion

Acknowledgements

Appendix A Statistical Analysis of Var(I˜)

Appendix B Linking R to Active Reflection Coefficient and τ

Appendix C Relation Between RMS Approximation and A/T

Appendix D Simulated and observed SEFDXX and SEFDYY

References

All Tables

All Figures

System equivalent flux density of a low-frequency polarimetric phased array interferometer^★

2.4 SEFD_I formulation

5.2 Computation of SEFD_I for diagonal plane and low elevation pointing angle

Appendix A Statistical Analysis of Var $(\tilde{I})$ $\left({\tilde I} \right)$

Appendix D Simulated and observed SEFD_XX and SEFD_YY