Open Access
Issue
A&A
Volume 670, February 2023
Article Number A57
Number of page(s) 14
Section Astronomical instrumentation
DOI https://doi.org/10.1051/0004-6361/202243863
Published online 06 February 2023

© The Authors 2023

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe-to-Open model. Subscribe to A&A to support open access publication.

1 Introduction

Optical/mid-infrared nulling interferometry from space has been experiencing a resurgence of interest over the past few years, particularly with regards to detecting Earth-like exoplanets around solar-type stars. Such an idea is not new, having been first proposed by Bracewell (Bracewell 1978), and then through multiple studies resulting in two large missions: the European Space Agency’s Darwin (Léger et al. 1996) and NASA’s Terrestrial Planet Finder – Interferometer (TPF-I; Beichman et al. 1999). However, due to a myriad of reasons, not least concerning the lack of technological readiness, both missions were cancelled in the late 2000s.

Since then, various teams have continued to work on improving nulling interferometry, leading to the formation of the Large Interferometer For Exoplanets (LIFE) initiative. This project is being considered as one of the large-class missions of the European Space Agency’s Voyage 2050 programme (Voyage 2050 Senior Committee 2021): a large space interferometer in the legacy of Darwin, working in the mid-infrared, with a goal to both detect and characterise Earth-like exoplanets that are difficult to access using other techniques such as single aperture coronography and transit spectroscopy. Significant work has already been done to characterise the planet yield of such a mission (Kammerer & Quanz 2018; Quanz et al. 2022), and the spectral requirements of the instrument (Konrad et al. 2022). A simulator tool to simulate observations and signal-to-noise ratio (S/N) requirements has also been developed (Dannert et al. 2022).

The renewal of interest in space interferometry also presents an opportune time to reanalyse the technology behind nulling interferometry. In particular, new technologies such as ‘kernel-nulling’ (Martinache & Ireland 2018; Laugier et al. 2020) have opened up avenues to consider other telescope configurations away from the Emma X-array configuration decided upon in the Darwin/TPF-I era (Lay et al. 2005). The fourth paper in this series (Hansen et al. 2022, hereby LIFE4) conducted a trade study between a number of different configurations, including the X-array, to determine whether other architectures would provide a higher yield and higher S/N. It was found that, in fact, an architecture consisting of five telescopes in a pentagonal shape (e.g. Léger et al. 1996; Mennesson & Mariotti 1997), using a kernel-nulling beam combination scheme, outperformed the X-array in both detection and characterisation.

In this paper, we propose a practical way of implementing the five-telescope beam combination scheme discussed in the previous paper. We also discuss the systematic instrumental errors of such a beam combiner: how these change the dominant sources of photon noise, and how they impact the robustness and sensitivity of the kernel observable. Finally, we discuss a major advantage of this beam combination scheme – that even if a collector telescope is damaged or fails, the interferometer and beam combiner are still able to produce robust transmission maps with fewer telescopes.

2 Implementation of the beam combination scheme

We devised an implementation of the Kernel-5 nuller beam combiner through the method of Guyon et al. (2013). They posit that any predetermined unitary lossless transfer matrix M (denoted U in their notation) of m inputs, can be created through a series of n=m(m1)2$n = {{m\left( {m - 1} \right)} \over 2}$ unequal beam splitters, with a phase shifting plate put in front of one of the inputs of each beam splitter. Such a design for a five-telescope combiner, can be seen in Fig. 1. This design also includes a set of m adaptive nullers (denoted AN) and m – 1 spatial filters (denoted SF) that are used to remove systematic amplitude and phase errors on input. We note here that the additional glass prisms on the top row of beam splitters are to ensure that the path lengths are matched at all wavelengths by passing through equal amounts of air and glass. We have also indicated that three of the fold mirrors should be total internal reflection (TIF) prisms or similar for this same reason. This design implicitly assumes that the path lengths of the beams are matched before entering the first prisms.

The design can be broadly broken down into two parts: the first row of beam splitters before the spatial filters perform the nulling, and the remaining beam splitters perform mixing in order to create kernel outputs. The spatial filters are placed after the first row of beam splitters so that a precise optical alignment is not required to get a deep null. These will be discussed further in Sect. 3. Finally, we also have included two shutters in the design (denoted S); these shutters can be used in the case of a collector telescope failure to reconfigure the beam combiner to produce robust observables with fewer telescopes. This will be discussed in more detail in Sect. 4. The parameters for each beam splitter and phase shifter can be derived through working backwards from the predetermined matrix M.

We start with the general case of a beam combiner with m inputs (labelled V1 through Vm) and m outputs (W1 through Wm) as depicted for m = 5 in Fig. 1. A phase shifting plate is put in front of the second input of each beam splitter (the beam entering from the left in the figure), imposing a phase shift of ϕj for each jth plate. We define each beam splitter through a mixing angle θ, which is related to their reflectance R and transmittance T as follows: R=sinθT=cosθ.$ \matrix{ {R = \sin \theta } \hfill & {T = \cos \theta .} \hfill \cr } $(1)

Hence each phase shifter and beam splitter module can be described by a 2×2 matrix: Cj=[ sinθjeiϕjcosθjcosθjeiϕjsinθj ].$ {{\bf{C}}_j} = \left[ {\matrix{ {\sin {\theta _j}} & {{e^{i{\phi _j}}}\cos {\theta _j}} \cr {\cos {\theta _j}} & { - {{\rm{e}}^{i{\phi _j}}}\sin {\theta _j}} \cr } } \right]. $(2)

As we have m input beams, with only two interfering at any one time, the other m – 2 beams are represented by identity rows and columns. Each beam combining step can thus be represented as an m × m block diagonal matrix Aj, with the diagonals of the rows and columns corresponding to the combining beams being equal to Cj and having ones on the other diagonal terms. For example, for a beam splitter module combining beams two and three out of a five beam combiner, the matrix Aj is given by Aj=[ 10T000Cj0000T1000T01 ],$ {{\bf{A}}_j} = \left[ {\matrix{ 1 \hfill & {{{\bf{0}}^T}} \hfill & 0 \hfill & 0 \hfill \cr {\bf{0}} \hfill & {{{\bf{C}}_j}} \hfill & {\bf{0}} \hfill & {\bf{0}} \hfill \cr 0 \hfill & {{{\bf{0}}^T}} \hfill & 1 \hfill & 0 \hfill \cr 0 \hfill & {{{\bf{0}}^T}} \hfill & 0 \hfill & 1 \hfill \cr } } \right], $(3)

where 0 = [0,0] is a two element zero column vector.

The full beam combiner is hence described by a multiplication of the n Aj matrices. We also note that each of the output electric fields of the beam combiner can have an arbitrary phase shift (ω) relative to the first output, as we only measure the intensity of these beams. We represent this as the matrix B, given by B=[ 10...00eiω2...0.........0000eiωm ].$ {\bf{B}} = \left[ {\matrix{ 1 & 0 & {...} & 0 \cr 0 & {{{\rm{e}}^{i{\omega _2}}}} & {...} & 0 \cr {...} & {...} & {...} & 0 \cr 0 & 0 & 0 & {{{\rm{e}}^{i{\omega _m}}}} \cr } } \right]. $(4)

Therefore, to create a beam combiner for any transfer matrix M, we solve the following equation for parameters θj, ϕj (j = 1, .., n) and ωk (k = 2, …, m). This equation is built from the Aj matrices in ascending order, corresponding to the order in which light traverses the combiner from the top left corner to the bottom right: M=BAnAn1...A2A1.$ {\bf{M}} = {\bf{B}}{{\bf{A}}_n}{{\bf{A}}_{n - 1}}...{{\bf{A}}_2}{{\bf{A}}_1}. $(5)

For the Kernel-5 nuller there are m = 5 inputs and thus n = 10 beam splitters, and we know the transfer matrix M from LIFE4: M=15[ 111111e4πi5e2πi5e2πi5e4πi51e4πi5e2πi5e2πi5e4πi51e2πi5e4πi5e4πi5e2πi51e2πi5e4πi5e4πi5e2πi5 ].$ {\bf{M}} = {1 \over {\sqrt 5 }}\left[ {\matrix{ 1 & 1 & 1 & 1 & 1 \cr 1 & {{{\rm{e}}^{{{ - 4\pi i} \over 5}}}} & {{{\rm{e}}^{{{2\pi i} \over 5}}}} & {{{\rm{e}}^{{{ - 2\pi i} \over 5}}}} & {{{\rm{e}}^{{{4\pi i} \over 5}}}} \cr 1 & {{{\rm{e}}^{{{4\pi i} \over 5}}}} & {{{\rm{e}}^{{{ - 2\pi i} \over 5}}}} & {{{\rm{e}}^{{{2\pi i} \over 5}}}} & {{{\rm{e}}^{{{ - 4\pi i} \over 5}}}} \cr 1 & {{{\rm{e}}^{{{2\pi i} \over 5}}}} & {{{\rm{e}}^{{{4\pi i} \over 5}}}} & {{{\rm{e}}^{{{ - 4\pi i} \over 5}}}} & {{{\rm{e}}^{{{ - 2\pi i} \over 5}}}} \cr 1 & {{{\rm{e}}^{{{ - 2\pi i} \over 5}}}} & {{{\rm{e}}^{{{ - 4\pi i} \over 5}}}} & {{{\rm{e}}^{{{4\pi i} \over 5}}}} & {{{\rm{e}}^{{{2\pi i} \over 5}}}} \cr } } \right]. $(6)

This combiner design will produce two sets of second-order nulls, forming one kernel, and two sets of fourth-order nulls, forming the other kernel. Here, a second-order null is one where the interferometer throughput scales as |α|2, |α| being the angular separation from the optical axis, and a fourth-order null scaling as |α|4. In this investigation, we have swapped rows two and four from the matrix in LIFE4 so that the two kernel outputs are formed from neighbouring pairs (with the second order kernel being |W4|2 – |W5|2 and the fourth order being |W2|2 – |W3|2). Assuming the pentagonal formation described in LIFE4, this also allows us to form the deeper, fourth-order null, calculated from the difference in rows two and three, using fewer beam splitting modules.

To be consistent between the naming schemes of the this paper and LIFE4, we define kernel 1 as the second-order null and kernel 2 as the fourth-order null. We plot the instrument transmission for the outputs corresponding to each kernel as a function of angular position in Fig. 2, in both a linear (2a) and log–log (2b) scaling. We have plotted this transmission in both the horizontal and vertical directions, assuming the configuration shown in the inset of Fig. 1; as such the vertical slice is symmetric about zero. Here, we can clearly see that kernel 2 has a deeper, broader null and is thus less affected by stellar leakage in comparison to kernel 1. Furthermore, from the slopes of the lines in Fig. 2b, we confirm the second and fourth order relations of the nulls with angular position, and we can generate a mathematical description of the nulls, being | W4 |2=| W5 |237.2(| α |Bλ)2$ {\left| {{W_4}} \right|^2} = {\left| {{W_5}} \right|^2} \approx 37.2{\left( {{{\left| \alpha \right|B} \over \lambda }} \right)^2} $(7) | W2 |2=| W3 |270.8(| α |Bλ)4.$ {\left| {{W_2}} \right|^2} = * {\left| {{W_3}} \right|^2} \approx 70.8{\left( {{{\left| \alpha \right|B} \over \lambda }} \right)^4}. $(8)

Working through element by element, we solve Eq. (5) for this matrix M, resulting in the parameters listed in Table 1. The output phase shifts ω are all zero except for ω5, which has a phase shift of π. We note here that the phase shifts listed do not account for the phase shifts induced by the fold mirrors and TIF prisms. However, these can be compensated for by adding the additional phase shift (nominally π for a simple mirror reflection) before the relevant beam splitters (i.e. modifying ϕ1, ϕ5, ϕ8 and ϕ10) and as this should be a constant offset, it will not interfere with the following analysis.

We note here several remarks on this implementation of the beam combiner. Firstly, the beam splitters and phase shifting plates are required to have achromatic phase shifts and reflection coefficients (with tolerances discussed in Sect. 3.2) over a large wavelength band (nominally 4–19 μm (Quanz et al. 2022)). This could be alleviated by increasing the number of beam trains, splitting the wavelength into a few coarse channels, and implementing multiple versions of the beam combiner. The downside to this method is the increase of optical components and space requirements; a pertinent problem for a space-based mission. We return to this issue when we discuss phase chopping in Sect. 3.6.

Secondly, this design is inherently polarisation dependent. For this reason, we notionally assume that the beam combiner is planar and there is a polarisation split orthogonal to the plane before the telescope light is injected into the combiner unit. Thus we do not explicitly consider polarisation effects in the following discussions.

Thirdly, we have included the spatial filters after the nulling stage of the beam combiners so that aberrations in this first stage can be compensated with upstream corrective optics or the adaptive nullers as described in Sect. 3. If spatial filtering occurred before the nulling stage, then any alignment errors or optical aberrations in the beamsplitters could not be corrected and would result in a decreased null depth.

Finally, this implementation is schematically drawn as bulk optics. While photonics provides multiple advantages in terms of spatial filtering and space requirements (in terms of both footprint and space compatibility), adequate demonstrations of far mid-infrared achromatic directional couplers and phase shifts have not, to the authors’ knowledge, occurred. There is also an inherent tradeoff in throughput due to photonic transmission and component losses. We note however that progress is ongoing in this area, particularly at the shorter end of the mid-infrared band (around 3–4 μm) (e.g. Kenchington Goldsmith et al. 2017; Gretzinger et al. 2019). In principle, all components could be photonic.

thumbnail Fig. 1

Schematic of a Kernel-5 beam combiner, based on the design of Guyon et al. (2013). Inputs V1 through V5 pass through five adaptive nuller units (AN), a series of ten beam splitter modules (A1 through A10) consisting of a beam splitter and phase shifting plate on one input, and four spatial filters (SF). The five outputs consist of one bright output W1 and four nulled outputs W2 through W4. Two shutters (S) can be used in case of a telescope failure (see Sect. 4). The inset shows the telescope configuration and their corresponding inputs.

thumbnail Fig. 2

Instrument transmission per telescope flux as a function of angular separation from the optical axis for each output pair corresponding to the two kernels (i.e. W4 for kernel 1 and W2 for kernel 2). For each output, a slice through the optical axis was taken in the horizontal and vertical directions; due to the symmetry about the horizontal axis shown in the inset in Fig. 1, the vertical component is symmetric about zero.

Table 1

Optical parameters for the beam combiner design discussed in Sect. 2 and displayed in Fig. 1.

3 Systematic instrumental uncertainties

In this section, we analyse the systematic uncertainties associated with this beam combiner implementation, and the effects of phase fluctuations of the input beams more broadly, on the robustness and sensitivity of the kernel-nulling architecture.

3.1 Adaptive nullers and alignment procedure

In this architecture, systematic errors can come from a number of places: errors in the phase and amplitude of the input beams, as well as errors in the optical elements of each beam splitter module. We can, however, eliminate some of these errors immediately through a careful calibration process using an adaptive nuller.

Adaptive nullers, as described by Lay et al. (2003), are compensators that can adjust the phase and amplitude of an input beam of light through the use of a deformable mirror (DM). The light is spectrally dispersed onto a DM, adjusted to tune the wavelength dependent phase and amplitude, before being dedispersed and recollimated for beam combination. Amplitude is tunable through phase tilts orthogonal to the dispersion direction, when combined with spatial filtering. This is an invaluable tool for correction of the beams on input, and has also been shown to provide stable achromatic phase shifts (Peters et al. 2010).

For our purposes, the adaptive nuller can also be used to eliminate any errors in the top nulling row of beam splitter modules (A1 through A4) with a process similar to the following:

  1. Modulate the adaptive nuller on input V1 such that it maximises amplitude for all wavelengths. This input is set as the global phase reference.

  2. Modulate the adaptive nuller on input V2 so that, as well as entering the spatial filter and removing wavelength dependence, it forms a nulled output on output W2.

  3. Repeat stage two with the other three inputs, forming nulls on outputs W3, W4 and W5 respectively.

  4. If necessary, reduce the amplitude of input V1 and repeat steps 2 and 3.

In this manner, the phases of the input beams can be managed to completely remove any phase errors in the top nulling beam splitter optics, and any errors in the amplitudes of the inputs will just result in less overall light; the inputs can be modified such that they all have the amplitude of the dimmest input beam. In other words, the beam splitter aberrations that may have an impact on the null and stellar leakage, including chromatic aberrations, can be removed by the adaptive nuller. This same process can also be used to intermittently remove alignment drifts between the nuller and fringe tracker during observations; this is discussed and proven to be sufficiently efficient in Appendix A.

The spatial filters at the end of the first row are required to remove unwanted spatial modes of the light that would prevent deep nulls. Any optical losses after the spatial filter will not have any impact on stellar leakage terms, and will affect zodiacal and exozodiacal light equally. These latter losses primarily affect the ability to form a good kernel output.

3.2 Beam combiner optical errors

While the adaptive nullers are able to negate the effects of optical errors in the top nulling row of beam splitters, the remaining six modules will still contribute to errors in the kernel-nulls. To simulate this, we apply random fluctuations to the optical parameters based on a predefined root-mean-squared (RMS) error: θ=θ0+| ΔR |cosθ0x$ \theta = {\theta _0} + {{\left| {{\rm{\Delta }}R} \right|} \over {\cos {\theta _0}}}x $(9) ϕ=ϕ0+Δϕx,$ \phi = {\phi _0} + {\rm{\Delta }}\phi x, $(10)

where x ∈ [−1,1] is a random number, which is uniformly distributed to simulate the effect of a typical pass or fail optical specification. The parameters θ0 and ϕ0 are the true values as defined in Table 1, |ΔR| denotes the error in the reflectance of the beam splitter, and Δϕ represents the error in the phase shifter. For the remainder of this analysis, we consider three sets of these uncertainties and refer to the pair by their |ΔR| amount. These uncertainties are chosen for realistic manufacturing tolerances from optics suppliers: | ΔR |=2%Δϕ=1°| ΔR |=5%Δϕ=3°| ΔR |=10%  Δϕ=6°.$\matrix{ {\left| {{\rm{\Delta }}R} \right| = 2\% } \hfill & {{\rm{\Delta }}\phi = 1^\circ } \hfill \cr {\left| {{\rm{\Delta }}R} \right| = 5\% } \hfill & {{\rm{\Delta }}\phi = 3^\circ } \hfill \cr {\left| {{\rm{\Delta }}R} \right| = 10\% } \hfill & {{\rm{ \Delta }}\phi = 6^\circ .} \hfill \cr } $

We ran a Monte-Carlo simulation to find the standard deviation of the kernel maps as a function of angular coordinate when these errors are applied. We assume a pentagonal arrangement of the telescopes equivalent to the inset in Fig. 1. The two maps, assuming |ΔR| = 5%, are shown in Fig. 3. Here we see that the kernels show a maximum standard deviation of 8% of the total telescope flux, with an average standard deviation of 2.7% and 2.2% of kernel 1 and 2 respectively. We also find that the average standard deviations for |ΔR| = 2% are 1% and 0.9%, and for |ΔR| = 10% we have 5.7% and 4.6% respectively.

We also examine the effect of these uncertainties on the modulation efficiency (RMS azimuthal average) of the kernel map; this highlights the fluctuations that may influence the power of the planet signal as the array rotates. We show this in Fig. 4, where we overlay the modulation efficiency as a function of radius with no error, normalised by flux per telescope, on top of the average of twenty random draws with |ΔR| = 5%. What is apparent here is that with this amount of error, the modulation efficiency is not significantly affected, indicating the information in the signal does not significantly change with these optical errors even though the detailed map structure requires additional calibration or modelling. For clarity, we note that Fig. 4 plots the modulation efficiency of the kernel maps (that is, the difference between two outputs), and not the raw nulled outputs themselves; it is the latter that defines the order of the null and the amount of stellar leakage.

thumbnail Fig. 3

Standard deviation of each kernel as a function of angular position, given as a percentage of the total array flux, for |ΔR| = 5%.

thumbnail Fig. 4

Modulation efficiency (RMS azimuthal average) of the kernel maps as a function of radius. The average with no beam combiner optical errors is overlaid on the average of twenty random draws with |ΔR| = 5%. The random draws themselves are also plotted with low opacity.

3.3 Null depth

From the plots in Fig. 4, the most concerning trend induced by errors in the optical elements is the effect on the null; whether the null no longer reaches the desired depths and thus greatly increasing stellar leakage. Using the simulation machinery detailed in LIFE4, we calculated the base-10 logarithm of the ratio of the stellar leakage noise and zodiacal background light as a function of wavelength, assuming a 2 m aperture size. The wavelength range chosen is between 4 and 19 μm, to align with that of Quanz et al. (2022) and LIFE4. We calculated these plots for two stars located at 5 pc: a M5V dwarf based on Proxima Centauri, and a G2V dwarf based on the Sun. The latter was chosen based on the closest stars of F or G stellar type: there are three stars within 6 pc (τ Ceti, e Eri and η Cas) and can be considered on average to be roughly a solar-type star at 5 pc. Hence this can be used as an extreme scenario on stellar leakage. For the local zodiacal light, we once again used the JWST background calculator1, assuming sky coordinates equivalent to Tau Boo (as an average stellar case). This is roughly equivalent to 1 × 10−7 the blackbody radiation of a 300 K source. The simulation was then repeated for |ΔR| values of 2%, 5% and 10%, to see the effect of optical errors on stellar leakage. The plots are shown in Fig. 5.

We can see from these plots that kernel 2 is far more sensitive to these optical errors; as kernel 1 is only a second-order null, it is much more dominated by stellar leakage, and as such the noise floor is above that induced by these beam combiner optical errors. The fourth-order null of kernel 2, however, produces a smaller stellar leakage term and as such, a reduction in the null depth arising from optical errors. We find that an error of 2% results in a six fold increase in stellar leakage for the solar type star, with a similar increase at 10% for the M-dwarf.

One important point to be made here is that these optical errors will not modify the total amount of stellar leakage present in the system. This can be seen in the zoomed-in inset of Fig. 5, where additional optical abberations marginally decreases the amount of stellar leakage present in kernel 1, counteracting the increase in kernel 2. This is due to the optical errors only affecting the mixing component of the beam combiner, and not the nulling stage (the first row of beam splitting units). With 90% optical error (that is, essentially random values for all the mixing beam splitter units), the two kernel curves overlap, converging slightly below the position of the kernel 1 line in Fig. 5. The main effect these errors induce then, is reducing the effectiveness of kernel 2 (which can produce a much higher S/N) and removing the robustness against systematic errors that kernel outputs are designed to have (see Sect. 3.5).

Furthermore, while this error results in quite a shift, particularly for the solar type star, we remind the reader that these stars are extreme scenarios. Most stars in the LIFE catalogue (Quanz et al. 2022) are further than 5 pc away, and the amount by which stellar leakage dominates at the short wavelengths decreases with distance. There are also few non M-dwarfs within 5 pc, and a 2% |ΔR| error impacting an M-dwarf measurement does not result in a large change in the leakage to zodiacal ratio. Nevertheless, this does indicate that optical errors in the beam combiner are important to minimise, particularly in suppressing stellar leakage at the short wavelengths. Conversely, post-nulling optical beam combiner errors at typical commercial specifications of ~ 10% do not matter beyond approximately 8 μm as measurements will be strongly zodiacal background dominated.

thumbnail Fig. 5

Base-10 logarithm of the ratio of the stellar leakage to zodiacal light against wavelength for two different stars and varying amounts of beam combiner optical error. The black dashed line divides the upper region where the combiner is dominated by stellar leakage, and the lower region where the instrument is zodiacal limited. The inset highlights the proximity of the different optical error lines for kernel 1.

3.4 Null stability

In LIFE4, we provided a simple approximation of the RMS fringe tracking requirements to remain limited by photon background noise, rather than fluctuations in the null. We found that the interferometer should aim for < 9 nm RMS when looking at M-dwarfs, and around 2 nm for G-dwarfs, both at about 5 pc. We now look further into this, including the impact of optical beam combiner errors on the fringe tracking requirements and null stability.

From equation 50 in LIFE4, recall that the minimum fringe tracking error needed to remain photon noise limited can be estimated from this equation: ϕ2 Fstar<max[ PzodiacalA,Fleakage ].$ \left\langle {{\phi ^2}} \right\rangle {F_{{\rm{star}}}} &lt; \max \left[ {{{{P_{{\rm{zodiacal}}}}} \over A},{F_{{\rm{leakage}}}}} \right]. $(11)

where Fstar is the stellar flux, Pzodiacal is the zodiacal light per diffraction limited mode (in phots−1), Fleakage is the stellar leakage flux and A is the single telescope aperture.

We simulate a random erroneous phase of all five telescopes, adding a term ϕi=2πλX,$ {\phi _i} = {{2\pi } \over \lambda }X, $(12)

where i represents the index of the telescope, λ the wavelength in nanometers and X ~ 𝒩(0, δ) is a random variable pulled from a normal distribution with zero mean and a standard deviation of δ, the RMS fringe tracking error in nanometers. We then calculate the mean square response over a large number of random phases, ϕi2 $\left\langle {\phi _i^2} \right\rangle $ and multiply by the stellar flux to obtain the noise due to null fluctuations. In Fig. 6, we plot as a function of wavelength the base-10 logarithm of the ratio of the null fluctuations against the maximum background of that given wavelength. This was plotted for the same two stars and optical beam combiner errors as for Fig. 5. An RMS fringe tracking error of δ = 5 nm was used for this plot.

Firstly, we note that as in Fig. 5, kernel 1 is never dominated by the null fluctuations, again due to it not providing as sensitive a null. We can also see the areas for which stellar leakage and zodiacal light dominate as a function of wavelength: dominant stellar leakage is represented by a flat line (as leakage follows the same functional form as the stellar flux), and the downwards curve represents being zodiacal light dominated. As to be expected from the previous discussion, we see that with a greater beam combiner optical error, the stellar leakage dominates for a larger part of the spectrum. The stronger leakage also washes out the effect of null fluctuations: with a high optical error, these terms dominate and as such null stability becomes less important.

In Table 2, we plot the minimum fringe tracking RMS needed to remain dominated by photon noise at all wavelengths for stars of type G, K and M at 5 pc, as well as for differing amounts of beam combiner optical error. Interestingly, the RMS fringe tracking requirement is stricter than the previous calculation, being less than 1 nm when looking at G-dwarfs with no optical errors and about 6 nm for an M-dwarf. A balance needs to be struck with regards to acceptable stellar leakage and the achievable fringe tracking uncertainties. With |ΔR| = 5%, the additional stellar leakage is not increased too much, and the fringe tracking requirement remains manageable at 3 nm; allowing measurements for G, K and M type stars to remain photon noise limited at all wavelengths.

We also note here that the instrument remains zodiacal limited regardless of beam combiner optical errors for λ > 8 μm and δ = 5 nm even for solar-type stars at 5 pc. The teams defining the science goals for the LIFE mission will need to take this into consideration: performance will be unequal throughout the spectral range, with spectral signatures beyond 8 μm requiring less stringent RMS fringe tracking uncertainties due to being zodiacal background dominated in this regime.

thumbnail Fig. 6

Base-10 logarithm of the ratio of the null fluctuation noise to background noise against wavelength for a fringe tracking RMS of δ = 5 nm. The background noise is chosen to be the maximum of stellar leakage and zodiacal light for that given wavelength. Plotted for two different stars and varying amounts of beam combiner optical error. The black dashed line divides the upper region where the combiner is dominated by instrumental errors induced by fluctuations in the null depth, and the lower region where the instrument is photon limited by either stellar leakage or the zodiacal background.

Table 2

Minimum fringe tracking requirements to remain photon limited over 4–19 μm for different stellar types and different optical error amounts at 5 pc.

3.5 Sensitivity and robustness of the kernel

We also looked at the effect of these optical errors on the robustness of the kernel itself. As described in Martinache & Ireland (2018), the kernel operators are designed to be independent of systematic errors in phase, and are analogous to the technique of ‘phase chopping’ in the literature (e.g. Woolf & Angel 1997; Lay 2004; Mennesson et al. 2005) with regards to removing unwanted symmetrical emission such as exozodiacal light and the removal of instrumental errors. However, with the inclusion of beam combiner optical errors the outputs are no longer ‘pure’ kernels and will hence be affected by systematic phase offsets. To analyse this, we once again modified the phase of the telescopes as in Eq. (12) and took the standard deviation of the resulting kernel output from this phase error. We interpret the error introduced in the phase as a systematic piston offset; that is, over multiple exposures, the phase at the centre of the map will average to a non-zero value. We note that these phase errors are separate and fundamentally different to the internal optical phase shift errors in the previous sections, primarily as these will affect the stellar leakage and nulling stage of the combiner.

We plot this systematic piston offset (in nanometers) against the standard deviation of the kernel outputs for multiple beam combiner errors in Fig. 7. We also plotted the curve for two different wavelengths: one at 10 μm, and one at 4 μm. We can identify that the curves follow a quadratic relationship with the systematic piston error, and that kernel 1 is marginally more sensitive to systematic piston error than kernel 2.

In order for a confident planet detection to be made, we need to make sure that the kernel output is sensitive enough to the planet and that systematic errors will not show up as false signals. In the mid-infrared, the contrast for an Earth-like planet against a solar-type star is 1 × 10−7 (Defrère et al. 2018), and so we need to ensure the error in the kernel lies below this amount. This is shown on the plots as a black dotted line. From this, it is apparent that systematic piston error needs to be kept as low as possible, especially at short wavelengths and with large optical errors. At λ = 4 μm, an optical error of |ΔR| = 2% requires a systematic piston error <0.75 nm, whereas |ΔR| = 10% requires a very stringent 0.3 nm or less. The longer wavelength plot at λ = 10 μm is more lenient, suggesting a systematic error of <1.8 nm at |ΔR| = 2% and 0.7 nm for |ΔR| = 10%. This finding emphasises a result that has been consistent across all of these investigations: the shorter wavelengths are much more affected by instrumental errors.

We also fitted a quadratic to each of the curves in Fig. 7b, and plotted the coefficient against the respective error amount |ΔR|. This is shown in Fig. 8. We identify that the coefficient of the quadratic curve scales linearly with beam combiner error, and hence show that the kernel error is overall third order in systematic piston errors.

3.6 Phase chopping

While kernel-nulling allows us to remove on-axis symmetric photon noise sources, as well as being resistant to second-order errors in piston (Martinache & Ireland 2018), there are still residual instrumental noise sources that could be removed through phase chopping, namely differential zodiacal background levels in each output, and detector noise. This technique involves rapidly swapping the rows corresponding to kernel output pairs with each other (i.e., for the five-telescope combiner, swapping rows two and three, and four and five). In doing so, the kernel output remains the same but the signals are being measured on different detector pixels. Hence we can remove any slowly variable detector bias or gain effects in the system.

Furthermore, if there is a different amount of zodiacal light in each output due to different telescope sensitivities or similar effects, this will remain stationary in the output despite phase chopping and can hence be removed. It is this latter point that is the main motivation for phase chopping: for an Earth-like planet around a solar analogue at 5 pc, the zodiacal light can dominate the signal by three orders of magnitude at the upper end of the bandpass. Hence differing coupling of starlight and zodiacal background at the 1% level can cause major difficulties in signal extraction and so should be removed if possible.

Phase chopping with our beam combiner design is theoretically not difficult; all that is required is to flip the signs for each of the phase shifts in front of beam splitter modules A5 through A10. Mathematically: ϕiϕii[ 5,10 ].$ \matrix{ {{\phi _i} \Rightarrow - {\phi _i}} \hfill &amp; {i \in \left[ {5,10} \right].} \hfill \cr } $(13)

This could be made to happen, for example, by putting beam splitters on piezo stages and rapidly moving them by a fraction of a wavelength to induce a rapid phase shift sign flip (and hence a ‘delay chop').

Of course, by chopping in delay, this induces another error term. As we are working over a large wavelength range and these changing phase shifts will only work at specific wavelength, elsewhere in the bandpass will incur a degree of chromatic phase error. This can be minimised by reducing the size of the bandpass (such as the use of multiple beam trains as described in Sect. 2), by making the reference wavelength for phase chopping in the centre of the bandpass (such that the errors on either end of bandpass are equal), and by designing the beamsplitter to have a wavelength independent 0 or π phase shift. This last point allows us to halve the amount of phase shift error that would otherwise occur, as well as ensure that both outputs on either side of the phase chop have symmetrical errors.

To model this error, we split our nominal wavelength range into three sub-bandpasses, evenly spaced with regard to the amount of phase error induced at the end of the sub-bandpasses. These ranges became 4–6.7 μm, 6.7–11.2 μm and 11.2–19 μm. The shift in delay (δ) required to perform the phase chop is given by δi=Δϕiλc2π=min(2ϕi,2π2ϕi)×λc2πi[ 5,10 ],$ \matrix{ {{\delta _i} = {\rm{\Delta }}{\phi _i}{{{\lambda _c}} \over {2\pi }} = \min \left( {2{\phi _i},2\pi - 2{\phi _i}} \right) \times {{{\lambda _c}} \over {2\pi }}} \hfill &amp; {i \in \left[ {5,10} \right],} \hfill \cr } $(14)

where λc is the central (reference) wavelength of the relevant sub-bandpass and Δϕi is the minimum change in phase required to flip the sign of the ith beam splitter phase shift. As previously mentioned, we defined the central wavelength to be such that the chromatic errors at the edges of the sub-bandpasses are equal. For the ranges listed above, these wavelengths are λc = 5.02 μm, 8.43 μm and 14.17 μm. The chromatic error in phase at wavelengths other than the central wavelength can then be calculated by σϕi=12| Δϕi2πδiλ |=12| Δϕi(1λcλ) |.$ {\sigma _{{\phi _i}}} = {1 \over 2}\left| {{\rm{\Delta }}{\phi _i} - {{2\pi {\delta _i}} \over \lambda }} \right| = {1 \over 2}\left| {{\rm{\Delta }}{\phi _i}\left( {1 - {{{\lambda _c}} \over \lambda }} \right)} \right|. $(15)

For the Kernel-5 beam combiner, these phase shifts and the maximum error associated with them at the edge of the sub-bandpass (in radians) are Δϕ5=2.17σϕ5=0.28Δϕ6=0.60σϕ6=0.08Δϕ7=1.62σϕ7=0.21Δϕ8=0.29σϕ8=0.04Δϕ9=0.99σϕ9=0.13Δϕ10=πσϕ10=0.4$\matrix{ {{\rm{\Delta }}{\phi _5} = 2.17} \hfill &amp; {{\sigma _{{\phi _5}}} = 0.28} \hfill \cr {{\rm{\Delta }}{\phi _6} = 0.60} \hfill &amp; {{\sigma _{{\phi _6}}} = 0.08} \hfill \cr {{\rm{\Delta }}{\phi _7} = 1.62} \hfill &amp; {{\sigma _{{\phi _7}}} = 0.21} \hfill \cr {{\rm{\Delta }}{\phi _8} = 0.29} \hfill &amp; {{\sigma _{{\phi _8}}} = 0.04} \hfill \cr {{\rm{\Delta }}{\phi _9} = 0.99} \hfill &amp; {{\sigma _{{\phi _9}}} = 0.13} \hfill \cr {{\rm{\Delta }}{\phi _{10}} = \pi } \hfill &amp; {{\sigma _{{\phi _{10}}}} = 0.4} \hfill \cr } $

To see the effect that this chromatic phase chop error would have on the measurements, we performed the same simulation as in Sect. 3.3, except adding the relevant phase chop error to the Δø of each beam splitter. This is shown in Fig. 9.

We can see in these plots that the added chromatic error indeed makes the stellar leakage considerably worse for kernel 2, with kernel 1 being barely effected for the same reasons as in Sect. 3.3. We also see the effects of chromaticity – the leakage is at a minimum in the centre of each sub-bandpass (where there should be no added error) and increases to a local maximum or inflection point at the edges. The effect is quite strong at the shortest wavelengths, reducing the effect of the null by an order of magnitude. However, the second kernel is still zodiacal dominated beyond 8 μm; hence this will only be a problem for the shortest wavelengths around the closest stars.

If this were deemed to be too great an error to propagate uncorrected, we could add a thin wedge of glass to a second piezo stage in front of each of the effected beam splitters that could act as a corrector for this chromatic effect, though this doubles the number of piezos and would considerably increase the beam combiner’s complexity. Nevertheless, due to the likelihood of varying zodiacal background levels in each output, as well as detector effects, this added complexity is likely a worthwhile tradeoff.

thumbnail Fig. 7

Error in the kernel, plotted against systematic piston error in nanometers for a variety of optical beam combiner errors. The dotted line at 1 × 10−7 represents the point where the kernel should be sensitive enough to detect an Earth-like planet around a solar type star.

thumbnail Fig. 8

Relationship between the quadratic coefficients of Fig. 7b and their associated error in the beam combiner |ΔR|.

thumbnail Fig. 9

Base-10 logarithm of the ratio of the stellar leakage to zodiacal light against wavelength for two different stars, varying amounts of beam combiner optical error. Chromatic phase error has been induced by a delay chop.

4 Redundancy for failed telescopes

4.1 Kernel-5 nuller

One significant advantage of the ‘Guyon'-type beam combiner design for the Kernel-5 nuller described in Sect. 2, on top of the planet yield advantages discussed in LIFE4, is the ability for it to continue producing robust observable measurements even if a collecting spacecraft fails. In other words, the Kernel-5 nuller will still be able to function with only four telescopes. This is not applicable to the traditional X-array beam combiner – if one of the telescopes of that design fails, the main mission objectives for detecting Earth-like exoplanets is severely compromised.

This safeguard against a damaged telescope can be implemented through the use of a well placed shutter in the midst of the beam splitters, shown as S in Fig. 1. If a collector telescope fails, all that is required is for the four operating telescopes to move into input positions two through five (that is, the failed telescope corresponds to input V1), and the shutter to close. We can emulate this in matrix notation through blocking beam one at the start of the relay (representing the failed telescope; F) and then blocking beam two in between beam splitting modules A4 and A7 (representing the shutter; S1). Inserting these into Eq. (5): F=[ 0000001000001000001000001 ]S=[ 1000000000001000001000001 ]$ \matrix{ {{\bf{F}} = \left[ {\matrix{ 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill \cr 0 \hfill &amp; 1 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill \cr 0 \hfill &amp; 0 \hfill &amp; 1 \hfill &amp; 0 \hfill &amp; 0 \hfill \cr 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 1 \hfill &amp; 0 \hfill \cr 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 1 \hfill \cr } } \right]} &amp; {{\bf{S}} = \left[ {\matrix{ 1 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill \cr 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill \cr 0 \hfill &amp; 0 \hfill &amp; 1 \hfill &amp; 0 \hfill &amp; 0 \hfill \cr 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 1 \hfill &amp; 0 \hfill \cr 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 0 \hfill &amp; 1 \hfill \cr } } \right]} \cr } $(16) M˜=BA10A9A8A7A6A5SA4A3A2A1F.$ {\bf{\tilde M}} = {\bf{B}}{{\bf{A}}_{10}}{{\bf{A}}_9}{{\bf{A}}_8}{{\bf{A}}_7}{{\bf{A}}_6}{{\bf{A}}_5}{\bf{S}}{{\bf{A}}_4}{{\bf{A}}_3}{{\bf{A}}_2}{{\bf{A}}_1}{\bf{F}}. $(17)

Using the same parameters of the beam splitters and phase shifters derived in Sect. 2, we can calculate the new transfer matrix (M˜$ {{\bf{\tilde M}}} $) for the damaged system: M˜=15[ 01111054ai54+bi54bi54+ai054+ai54bi54+bi54ai054+bi54+ai54ai54bi054bi54ai54+ai54+bi ],$ {\bf{\tilde M}} = {1 \over {\sqrt 5 }}\left[ {\matrix{ 0 &amp; 1 &amp; 1 &amp; 1 &amp; 1 \cr 0 &amp; { - {{\sqrt 5 } \over 4} - ai} &amp; {{{\sqrt 5 } \over 4} + bi} &amp; {{{\sqrt 5 } \over 4} - bi} &amp; { - {{\sqrt 5 } \over 4} + ai} \cr 0 &amp; { - {{\sqrt 5 } \over 4} + ai} &amp; {{{\sqrt 5 } \over 4} - bi} &amp; {{{\sqrt 5 } \over 4} + bi} &amp; { - {{\sqrt 5 } \over 4} - ai} \cr 0 &amp; {{{\sqrt 5 } \over 4} + bi} &amp; { - {{\sqrt 5 } \over 4} + ai} &amp; { - {{\sqrt 5 } \over 4} - ai} &amp; {{{\sqrt 5 } \over 4} - bi} \cr 0 &amp; {{{\sqrt 5 } \over 4} - bi} &amp; { - {{\sqrt 5 } \over 4} - ai} &amp; { - {{\sqrt 5 } \over 4} + ai} &amp; {{{\sqrt 5 } \over 4} + bi} \cr } } \right], $(18)

where a=52(5+5)  b=1212(5+5).$\matrix{ {a = \sqrt {{5 \over {2\left( {5 + \sqrt 5 } \right)}}} \quad \quad } \hfill &amp; {b = {1 \over 2}\sqrt {{1 \over 2}\left( {5 + \sqrt 5 } \right).} } \hfill \cr } $

It is apparent from this system that output W1 is again the bright output. What is less apparent is that outputs W2 and W3, and W4 and W5 form enantiomorphic pairs; an attribute that allows them to form a kernel-null (Laugier et al. 2020). To demonstrate this, we perform a relative phase shift at the output (that is, change ω) so that the contribution of input 2 is always real; this should result in the kernel-null pairs becoming complex conjugates of each other. We plot the pairs of outputs in a ‘Complex Matrix Plot', akin to Laugier et al. (2020), in Fig. 10, where it is easily seen that the pairs are mirror images of each other. Thus, even with one telescope no longer working, the system is able to produce two kernel-null outputs.

We show the output kernel maps in Fig. 11, where we have assumed that the remaining four telescopes have changed configuration into a 6:1 X-array formation as in LIFE4 and Lay (2006). We find that one kernel produces a maximum transmission of 0.65 single telescope fluxes, and the other producing 2.75 telescope fluxes (together producing an efficiency of 85% compared to the X-array, or 68% with respect to the original five telescopes).

To properly compare the various combiners, let us consider a flux-normalised S/N metric defined as the ratio of the final background-subtracted S/N to that of a single telescope observing the faint planet in a background-limited imaging mode. For an array with m telescopes, the upper bound for this value of this metric is m. If the output is in chopped pairs, then the upper bound is m/2$m/\sqrt 2 $, and we can write for a single chopped pair: S/Ni=msignal2fback,$ {S \mathord{\left/ {\vphantom {S {{N_i} = {{{m_{{\rm{signal}}}}} \over {\sqrt {2{f_{{\rm{back}}}}} }},}}} \right. \kern-\nulldelimiterspace} {{N_i} = {{{m_{{\rm{signal}}}}} \over {\sqrt {2{f_{{\rm{back}}}}} }},}} $(19)

where mSignal is the number of telescopes that direct planet flux out a single output (equivalently the maximum value of the transmission map), and fback is the fraction of a single telescope background directed out a single output. This has an upper limit of 1, as the output is a single spatial mode, fback can be reduced by cold shutters.

For multiple output pairs that have uncorrelated noise, we can make a inverse variance weighted average of the planet flux signals, with a total signal to noise being S/Ntotal=i(S/Ni)2.$ {S \mathord{\left/ {\vphantom {S {{N_{{\rm{total}}}} = \sqrt {\sum\limits_i {{{\left( {{S \mathord{\left/ {\vphantom {S {{N_i}}}} \right. \kern-\nulldelimiterspace} {{N_i}}}} \right)}^2}.} } }}} \right. \kern-\nulldelimiterspace} {{N_{{\rm{total}}}} = \sqrt {\sum\limits_i {{{\left( {{S \mathord{\left/ {\vphantom {S {{N_i}}}} \right. \kern-\nulldelimiterspace} {{N_i}}}} \right)}^2}.} } }} $(20)

This is derived in Appendix B. So, for the ’shuttered’ beam combiner, noting that the background here is reduced to fback = 0.8 due to the shutter, we find that the total S/N metric is S/N=(2.752×0.8)2+(0.652×0.8)2=2.23.$ {S \mathord{\left/ {\vphantom {S {N = \sqrt {{{\left( {{{2.75} \over {2 \times 0.8}}} \right)}^2} + {{\left( {{{0.65} \over {2 \times 0.8}}} \right)}^2}} = 2.23.}}} \right. \kern-\nulldelimiterspace} {N = \sqrt {{{\left( {{{2.75} \over {2 \times 0.8}}} \right)}^2} + {{\left( {{{0.65} \over {2 \times 0.8}}} \right)}^2}} = 2.23.}} $(21)

The relative S/N of the damaged array is therefore 63% of the original five telescope combiner (3.54), or 80% of the equivalent non-damaged four-telescope array architecture (2.83). This infers that this configuration, made out of necessity due to a collector telescope failure, results in a 37% S/N reduction compared to the original beam combination architecture with 100% of its telescopes functioning. Despite this loss in S/N, this is still much better than the 100% reduction that would occur in the X-array architecture due to it not being able to null if a telescope malfunctioned. We note that the reason kernel 2 specifically contains most of the transmission is determined solely by the arrangement and numbering of telescopes in the array; a different arrangement would result in kernel 1 having the maximum transmission.

This idea can be extended further to three telescopes (that is, two telescopes failing) by implementing a second shutter (S2) between A3 and A6. This results in a transfer matrix of M˜=15[ 001110056+ai56ci53+ei0056ai56+ci53ei0056+bi56di53fi0056bi56+di53+fi ],$ {\bf{\tilde M}} = {1 \over {\sqrt 5 }}\left[ {\matrix{ 0 &amp; 0 &amp; 1 &amp; 1 &amp; 1 \cr 0 &amp; 0 &amp; {{{\sqrt 5 } \over 6} + ai} &amp; {{{\sqrt 5 } \over 6} - ci} &amp; { - {{\sqrt 5 } \over 3} + ei} \cr 0 &amp; 0 &amp; {{{\sqrt 5 } \over 6} - ai} &amp; {{{\sqrt 5 } \over 6} + ci} &amp; { - {{\sqrt 5 } \over 3} - ei} \cr 0 &amp; 0 &amp; { - {{\sqrt 5 } \over 6} + bi} &amp; { - {{\sqrt 5 } \over 6} - di} &amp; {{{\sqrt 5 } \over 3} - fi} \cr 0 &amp; 0 &amp; { - {{\sqrt 5 } \over 6} - bi} &amp; { - {{\sqrt 5 } \over 6} + di} &amp; {{{\sqrt 5 } \over 3} + fi} \cr } } \right], $(22)

where numerically a=0.7551280988643292d=0.2707664135274219b=0.9048040910575242e=0.3918568348616487c=1.1469849337259779f=0.6340376775301025.$\matrix{ {a = 0.7551280988643292} &amp; {d = 0.2707664135274219} \cr {b = 0.9048040910575242} &amp; {e = 0.3918568348616487} \cr {c = 1.1469849337259779} &amp; {f = 0.6340376775301025.} \cr } $

Again, we find that there are two sets of enantiomorphic pairs, shown in a CMP in Fig. 12. The resultant maps have a maximum transmission of 0.91 and 1.47 telescope fluxes, and hence an array efficiency of 79% compared to the Kernel-3 design, or 48% compared to the original Kernel-5 design. The shutters also effectively reduce the background of the outputs by a factor of 0.6, which results in an effective S/N 1.58. This is 44% of the S/N of the original five-telescope combiner (3.54), or 74% of the S/N of a three-telescope combiner with all telescopes functioning (2.12). This modified combiner would therefore still be adequate to continue the mission after a failure of two spacecraft.

thumbnail Fig. 10

Complex Matrix Plot of the ‘damaged’ Kernel-5 beam combiner with four telescope inputs.

thumbnail Fig. 11

Kernel maps of the ‘damaged’ Kernel-5 beam combiner with four telescope inputs. It is important to note the different scaling in the colour maps; this is due to a combination of the beam combiner architecture, along with the geometry of the array.

thumbnail Fig. 12

Complex Matrix Plot of the ‘damaged’ Kernel-5 beam combiner with three telescope inputs.

4.2 Modified X-array

While we stated that the X-array design does not allow for this redundancy advantage, this is only the case for the traditional beam combiner design consisting of two combiners with a π phase shift along the nulled baseline, and then a π2${\pi \over 2}$ phase chop of these nulled outputs. The X-array or Bracewell design could in fact be implemented in the same ‘Guyon’ -type beam combiner as described in Sect. 2.

Consider a combiner with m = 4 inputs and n = 6 beam splitter modules, shown in Fig. 13, along with the phase shifts and reflectance parameters found in Table 3. Other than the parameters and number of inputs and outputs, this design in identical to that of the Kernel-5 nuller described in Sect. 2. When the parameters are inserted into Eq. (5), we obtain the following transfer matrix: M=14[ 11111ii1i11i1111 ].$ {\bf{M}} = {1 \over {\sqrt 4 }}\left[ {\matrix{ 1 &amp; 1 &amp; 1 &amp; 1 \cr 1 &amp; i &amp; { - i} &amp; { - 1} \cr i &amp; 1 &amp; { - 1} &amp; { - i} \cr { - 1} &amp; 1 &amp; 1 &amp; { - 1} \cr } } \right]. $(23)

The middle two nulled rows of this matrix is equivalent to the middle two rows of the transfer matrix of the traditional X-array beam combiner found in equation 6 of LIFE4, with an alternative numbering of the telescopes. Hence this beam combiner could be used to produce the properties of the X-array as described in previous works (Quanz et al. 2022; Hansen et al. 2022). The benefit of this combination scheme is two-fold: there is an additional nulled output (albeit not a contribution to the kernel-null), and the same redundancy benefits as the Kernel-5 nuller apply. If a shutter (S1) is placed between modules A3 and A5, and defining the new transfer matrix in the same way as Eq. (17), we obtain the following ‘damaged’ transfer matrix: M˜=14[ 0111013+i13i23013i13+i230232343 ].$ {\bf{\tilde M}} = {1 \over {\sqrt 4 }}\left[ {\matrix{ 0 &amp; 1 &amp; 1 &amp; 1 \cr 0 &amp; {{1 \over 3} + i} &amp; {{1 \over 3} - i} &amp; { - {2 \over 3}} \cr 0 &amp; {{1 \over 3} - i} &amp; {{1 \over 3} + i} &amp; { - {2 \over 3}} \cr 0 &amp; {{2 \over 3}} &amp; {{2 \over 3}} &amp; { - {4 \over 3}} \cr } } \right]. $(24)

Here, we obtain an enantiomorphic pair in outputs two and three as demonstrated in the CMP representation in Fig. 14. Thus, if a telescope was to fail in this variant of the X-array, the remaining telescopes could move into a triangular position (like in the Kernel-3 nuller of LIFE4) and the beam combiner could still produce a robust observable. Such a map is shown in Fig. 15.

This map has a maximum transmission of 1.73 telescope fluxes, an efficiency of 58% compared to a fully functioning three-telescope combiner, or 43% compared to the undamaged X-array. As before, the shutter will reduce the background in the nulled outputs, this time by a factor of 0.75. This results in an effective S/N of 1.41; 50% of the original X-array combiner and 66% of the fully functioning Kernel-3 array. While this is substantially less than the 100% efficiency of the X-array with four telescopes, nonetheless this modified combiner would be adequate to continue on the mission in the event of a collector telescope failure.

thumbnail Fig. 13

Schematic of an X-array beam combiner based on the design of Guyon et al. (2013). The design is the same as in Fig. 1, except with four inputs and outputs, and the optical parameters found in Table 3.

Table 3

Optical parameters for the beam combiner design of the X-array, discussed in Sect. 4.2 and displayed in Fig. 13.

thumbnail Fig. 14

Complex Matrix Plot of the ‘damaged’ X-array type beam combiner with three telescope inputs.

thumbnail Fig. 15

Kernel map of the ‘damaged’ X-array type beam combiner with three telescope inputs.

5 Conclusion

In this work, we have provided a practical method to implement a Kernel-5 beam combiner, using a collection of adaptive nullers, spatial filters, beam splitters and phase shifting plates. Adaptive nullers can be used to negate any phase errors induced by imperfections in four of the beam splitting modules, leaving optical errors in the remaining six modules to contribute to errors in the remaining system, including null depth, null stability and kernel sensitivity. These also influence requirements in systematic phase offset errors of the interferometer, as well as RMS fringe tracking errors.

Taken with a beam splitter reflectance error of |ΔR| = 5%, and associated phase shift error of Δϕ = 3°, we find that in order to be photon limited and not limited by null fluctuations, we require a fringe tracking error less than 3 nm RMS. Furthermore, in order for the kernels to be appropriately sensitive to planets with a contrast of 1 × 10−7 over a bandpass from 4 to 19 μm, we find that the systematic phase error must be less than 0.5 nm.

We do note, however, that these limits are strongly dominated by the shorter wavelengths, and that at longer wavelengths the requirements lessen substantially. Obtaining high signal in the shorter wavelength regions (around 4 μm) will therefore prove to be harder than at longer wavelengths beyond approximately 8 μm.

We have also shown a major benefit of the described beam combiner implementation: in introducing a well placed shutter between a coupler of the beam splitter modules, the Kernel-5 combiner can function as a four-telescope combiner. This is a critical advantage if a collecting telescope were to fail or go offline. If these four telescopes were then placed into an X-array configuration, this modified combiner would produce an identical map to the original X-array architecture, albeit with a total throughput penalty of 15%, split over the two kernel outputs. This is offset by a reduction in the background due to the shutter, resulting in an S/N per telescope equal to 80% of the fully functional X-array, or an S/N reduction of 37% compared to the original array. A further telescope could also be removed with the addition of a second shutter, leading to a Kernel-3 type map with a relative S/N 74% of the equivalent Kernel-3 beam combiner. Finally, we note that the beam combiner of the X-array itself could be designed in a similar way, and providing the same benefits as the Kernel-5 nuller. If one of the X-array telescopes were to fail, a Kernel-3 type map could be created with an efficiency of 58% and relative S/N 66% compared to the Kernel-3 combiner – lower than the equivalent Kernel-5 design, but nonetheless adequate to continue scientific observations.

The next step forward would be to investigate physically constructing such a beam combiner in a laboratory, to test the assumptions about errors and uncertainties in this paper. Further-more, a more detailed study at the opto-mechanics of injection into a beam combiner like the one described would need to be addressed, for example how four telescopes in a rectangular formation could inject into the combiner designed for five in a pentagonal formation.

The advantage of telescope redundancy, along with the sensitivity advantages as discussed in LIFE4, further adds credence to the Kernel-5 beam combiner, with five telescopes in a pentagonal configuration, as the ideal architecture for the LIFE mission. We therefore suggest that future studies consider adopting this architecture in their analysis of future science and technological requirements for space-based mid-infrared nulling interferometry.

Acknowledgements

We acknowledge and celebrate the traditional custodians of the land on which the Australian National University is based, the Ngunnawal and Ngambri peoples, and pay our respects to elders past and present. This research was supported by the ANU Futures scheme and by the Australian Government through the Australian Research Council’s Discovery Projects funding scheme (project DP200102383) and the Australian Government Research Training Program. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement no. 866070). We also thank members of the LIFE team for their constructive feedback. Data is available upon request to the author.

Appendix A Tuning the null depth

As mentioned in the main text, we can use the alignment procedure intermittently during observations to correct for alignment drifts. In this appendix, we show that the integration time required to correct this is small enough to realistically calibrate the null.

First, we assume that for a generic nuller, the stellar flux is much larger than the background. We first derive the integration time needed to correct an alignment drift of phase for a single nulled output. To correct for this, we assume that we modulate the adaptive nuller by an amplitude of ±Δϕ such that the intensity in the nulled outputs is significantly higher than the background. In this regime, far away from the null bottom, drifts in phase vary quadratically with the intensity (ϕI)$\left( {\phi \propto \sqrt I } \right)$.

Let the background photon rate per telescope (which can include stellar leakage) be b, and the stellar flux rate per telescope be s. The nulled output photon rate is then n=12sΔϕ$ n = {1 \over 2}s{\rm{\Delta }}\phi $(A.1)

with total number of photons N=12sTΔϕ2.$ N = {1 \over 2}sT{\rm{\Delta }}{\phi ^2}. $(A.2)

For a simplistic requirement on the uncertainty in Δϕ, σ, we require that 12sσ2bσ22bs.$ {1 \over 2}s{\sigma ^2} \ll b \Rightarrow {\sigma ^2} \ll {{2b} \over s}. $(A.3)

We note that there is a more complex question regarding kernel nulling calibration, which is beyond the scope of this discussion. We can derive this uncertainty as follows: Δ(Δϕ)ΔN=12TNs$ {{{\rm{\Delta }}\left( {{\rm{\Delta }}\phi } \right)} \over {{\rm{\Delta }}N}} = \sqrt {{1 \over {2TN\,s}}} $(A.4) Δ(Δϕ)=σ=12Ts,$ {\rm{\Delta }}\left( {{\rm{\Delta }}\phi } \right) = \sigma = \sqrt {{1 \over {2T\,s}}} , $(A.5)

and so the requirement on σ becomes 12Ts2bsT14b.$ {1 \over {2T\,s}} \ll {{2b} \over s} \Rightarrow T \gg {1 \over {4b}}. $(A.6)

The relevant background rate is the zodiacal light in a single channel, which is about 1 photon per second in a 1% bandpass at 4 μm. So, as long as the phase offset between the fringe tracker and nuller does not drift on a timescale shorter than 1 s, there is no problem repeating this calibration. This is also short enough to not greatly impact the amount of integration time spent on science data.

The same derivation applies to drifts in amplitude and for each of the outputs, and so with approximately 8 of these calibrations, we need about 8 s of callibration for each science observing block.

Appendix B Derivation of the S/N metric

In this appendix, we give a brief derivation of the S/N equations introduced in the text. Let f be the flux of the planet and mi be the transmission of the combiner for the ith output, kernel or map (henceforth output). Let the noise of this output be σi. The measured signal for each output is then si=mi×f.$ {s_i} = {m_i} \times f. $(B.1)

An estimator for the flux and uncertainty in the flux can be derived: f^=simi   σ^(f^)=σimi.$ \matrix{ {\hat f = {{{s_i}} \over {{m_i}}}\quad \quad \quad } &amp; {\hat \sigma \left( {\hat f} \right) = {{{\sigma _i}} \over {{m_i}}}.} \cr } $(B.2)

The flux-normalised signal to noise ratio, S/N, for each output can be written as S/Ni=1f^f^σ^(f^)=miσi.$ {S \mathord{\left/ {\vphantom {S {{N_i} = {1 \over {\hat f}}{{\hat f} \over {\hat \sigma \left( {\hat f} \right)}} = {{{m_i}} \over {{\sigma _i}}}.}}} \right. \kern-\nulldelimiterspace} {{N_i} = {1 \over {\hat f}}{{\hat f} \over {\hat \sigma \left( {\hat f} \right)}} = {{{m_i}} \over {{\sigma _i}}}.}} $(B.3)

Now, the inverse variance weighted average flux, fw, can be calculated with f^w= if^σ^(f^)2 i1σ^(f^)2= imisiσi2 imi2σi2,$ {\hat f_w} = {{\sum {_i{{\hat f} \over {\hat \sigma {{\left( {\hat f} \right)}^{\rm{2}}}}}} } \over {\sum {_i{1 \over {\hat \sigma {{\left( {\hat f} \right)}^{\rm{2}}}}}} }} = {{\sum {_i{{{m_i}{s_i}} \over {\sigma _i^{\rm{2}}}}} } \over {\sum {_i{{m_i^{\rm{2}}} \over {\sigma _i^{\rm{2}}}}} }}, $(B.4)

with a variance of σ^(fw)2=1 i1σ^(f^)2=1 imi2σi2.$ \hat \sigma {\left( {{f_w}} \right)^{\rm{2}}} = {1 \over {\sum {_i{1 \over {\hat \sigma {{\left( {\hat f} \right)}^{\rm{2}}}}}} }} = {1 \over {\sum {_i{{m_i^{\rm{2}}} \over {\sigma _i^{\rm{2}}}}} }}. $(B.5)

The weighted flux-normalised S/N is then S/Ntotal=1f^wf^wσ^(fw)=imi2σi2=i(S/Ni)2.$ {S \mathord{\left/ {\vphantom {S {{N_{total}} = {1 \over {{{\hat f}_w}}}{{{{\hat f}_w}} \over {\hat \sigma \left( {{f_w}} \right)}} = \sqrt {\sum\limits_i {{{m_i^{\rm{2}}} \over {\sigma _i^{\rm{2}}}}} } = \sqrt {\sum\limits_i {{{\left( {{S \mathord{\left/ {\vphantom {S {{N_i}}}} \right. \kern-\nulldelimiterspace} {{N_i}}}} \right)}^{\rm{2}}}} } }}} \right. \kern-\nulldelimiterspace} {{N_{total}} = {1 \over {{{\hat f}_w}}}{{{{\hat f}_w}} \over {\hat \sigma \left( {{f_w}} \right)}} = \sqrt {\sum\limits_i {{{m_i^{\rm{2}}} \over {\sigma _i^{\rm{2}}}}} } = \sqrt {\sum\limits_i {{{\left( {{S \mathord{\left/ {\vphantom {S {{N_i}}}} \right. \kern-\nulldelimiterspace} {{N_i}}}} \right)}^{\rm{2}}}} } }}. $(B.6)

References

  1. Beichman, C. A., Woolf, N. J., & Lindensmith, C. A. 1999, The Terrestrial Planet Finder (TPF): A NASA Origins Program to Search for Habitable Planets, Tech. rep., Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California [Google Scholar]
  2. Bracewell, R. N. 1978, Nature, 274, 780 [NASA ADS] [CrossRef] [Google Scholar]
  3. Dannert, F. A., Ottiger, M., Quanz, S. P., et al. 2022, A&A, 664, A22 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Defrère, D., Absil, O., & Beichman, C. A. 2018, in Handbook of Exoplanets, eds. H. J. Deeg, & J. A. Belmonte, 82 [Google Scholar]
  5. Gretzinger, T., Gross, S., Arriola, A., & Withford, M. J. 2019, Opt. Express, 27, 8626 [NASA ADS] [CrossRef] [Google Scholar]
  6. Guyon, O., Mennesson, B., Serabyn, E., & Martin, S. 2013, PASP, 125, 951 [NASA ADS] [CrossRef] [Google Scholar]
  7. Hansen, J. T., Ireland, M. J., & LIFE Collaboration 2022, A&A, 664, A52 (LIFE4) [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  8. Kammerer, J., & Quanz, S. P. 2018, A&A, 609, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  9. Kenchington Goldsmith, H.-D., Cvetojevic, N., Ireland, M., & Madden, S. 2017, Opt. Express, 25, 3038 [NASA ADS] [CrossRef] [Google Scholar]
  10. Konrad, B. S., Alei, E., Quanz, S. R., et al. 2022, A&A, 664, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  11. Laugier, R., Cvetojevic, N., & Martinache, F. 2020, A&A, 642, A202 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  12. Lay, O. P. 2004, Appl. Opt., 43, 6100 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
  13. Lay, O. P. 2006, SPIE Conf. Ser., 6268, 62681A [NASA ADS] [Google Scholar]
  14. Lay, O. P., Jeganathan, M., & Peters, R. 2003, SPIE Conf. Ser., 5170, 103 [NASA ADS] [Google Scholar]
  15. Lay, O. P., Gunter, S. M., Hamlin, L. A., et al. 2005, SPIE Conf. Ser., 5905, 8 [NASA ADS] [Google Scholar]
  16. Léger, A., Mariotti, J., Mennesson, B., et al. 1996, Icarus, 123, 249 [CrossRef] [Google Scholar]
  17. Martinache, F., & Ireland, M. J. 2018, A&A, 619, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Mennesson, B., & Mariotti, J. M. 1997, Icarus, 128, 202 [NASA ADS] [CrossRef] [Google Scholar]
  19. Mennesson, B., Léger, A., & Ollivier, M. 2005, Icarus, 178, 570 [NASA ADS] [CrossRef] [Google Scholar]
  20. Peters, R. D., Lay, O. P., & Lawson, P. R. 2010, PASP, 122, 85 [CrossRef] [Google Scholar]
  21. Quanz, S. P., Ottiger, M., Fontanet, E., et al. 2022, A&A, 664, A21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  22. Voyage 2050 Senior Committee 2021, Voyage 2050 - Final Recommendations from the Voyage 2050 Senior Committee [Google Scholar]
  23. Woolf, N. J., & Angel, J. R. P. 1997, in Planets Beyond the Solar System and the Next Generation of Space Missions, Astronomical Society of the Pacific Conference Series, ed. D. Soderblom, 119, 285 [NASA ADS] [Google Scholar]

All Tables

Table 1

Optical parameters for the beam combiner design discussed in Sect. 2 and displayed in Fig. 1.

Table 2

Minimum fringe tracking requirements to remain photon limited over 4–19 μm for different stellar types and different optical error amounts at 5 pc.

Table 3

Optical parameters for the beam combiner design of the X-array, discussed in Sect. 4.2 and displayed in Fig. 13.

All Figures

thumbnail Fig. 1

Schematic of a Kernel-5 beam combiner, based on the design of Guyon et al. (2013). Inputs V1 through V5 pass through five adaptive nuller units (AN), a series of ten beam splitter modules (A1 through A10) consisting of a beam splitter and phase shifting plate on one input, and four spatial filters (SF). The five outputs consist of one bright output W1 and four nulled outputs W2 through W4. Two shutters (S) can be used in case of a telescope failure (see Sect. 4). The inset shows the telescope configuration and their corresponding inputs.

In the text
thumbnail Fig. 2

Instrument transmission per telescope flux as a function of angular separation from the optical axis for each output pair corresponding to the two kernels (i.e. W4 for kernel 1 and W2 for kernel 2). For each output, a slice through the optical axis was taken in the horizontal and vertical directions; due to the symmetry about the horizontal axis shown in the inset in Fig. 1, the vertical component is symmetric about zero.

In the text
thumbnail Fig. 3

Standard deviation of each kernel as a function of angular position, given as a percentage of the total array flux, for |ΔR| = 5%.

In the text
thumbnail Fig. 4

Modulation efficiency (RMS azimuthal average) of the kernel maps as a function of radius. The average with no beam combiner optical errors is overlaid on the average of twenty random draws with |ΔR| = 5%. The random draws themselves are also plotted with low opacity.

In the text
thumbnail Fig. 5

Base-10 logarithm of the ratio of the stellar leakage to zodiacal light against wavelength for two different stars and varying amounts of beam combiner optical error. The black dashed line divides the upper region where the combiner is dominated by stellar leakage, and the lower region where the instrument is zodiacal limited. The inset highlights the proximity of the different optical error lines for kernel 1.

In the text
thumbnail Fig. 6

Base-10 logarithm of the ratio of the null fluctuation noise to background noise against wavelength for a fringe tracking RMS of δ = 5 nm. The background noise is chosen to be the maximum of stellar leakage and zodiacal light for that given wavelength. Plotted for two different stars and varying amounts of beam combiner optical error. The black dashed line divides the upper region where the combiner is dominated by instrumental errors induced by fluctuations in the null depth, and the lower region where the instrument is photon limited by either stellar leakage or the zodiacal background.

In the text
thumbnail Fig. 7

Error in the kernel, plotted against systematic piston error in nanometers for a variety of optical beam combiner errors. The dotted line at 1 × 10−7 represents the point where the kernel should be sensitive enough to detect an Earth-like planet around a solar type star.

In the text
thumbnail Fig. 8

Relationship between the quadratic coefficients of Fig. 7b and their associated error in the beam combiner |ΔR|.

In the text
thumbnail Fig. 9

Base-10 logarithm of the ratio of the stellar leakage to zodiacal light against wavelength for two different stars, varying amounts of beam combiner optical error. Chromatic phase error has been induced by a delay chop.

In the text
thumbnail Fig. 10

Complex Matrix Plot of the ‘damaged’ Kernel-5 beam combiner with four telescope inputs.

In the text
thumbnail Fig. 11

Kernel maps of the ‘damaged’ Kernel-5 beam combiner with four telescope inputs. It is important to note the different scaling in the colour maps; this is due to a combination of the beam combiner architecture, along with the geometry of the array.

In the text
thumbnail Fig. 12

Complex Matrix Plot of the ‘damaged’ Kernel-5 beam combiner with three telescope inputs.

In the text
thumbnail Fig. 13

Schematic of an X-array beam combiner based on the design of Guyon et al. (2013). The design is the same as in Fig. 1, except with four inputs and outputs, and the optical parameters found in Table 3.

In the text
thumbnail Fig. 14

Complex Matrix Plot of the ‘damaged’ X-array type beam combiner with three telescope inputs.

In the text
thumbnail Fig. 15

Kernel map of the ‘damaged’ X-array type beam combiner with three telescope inputs.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.