StarDICE I: sensor calibration bench and absolute photometric calibration of a Sony IMX411 sensor

The Hubble diagram of type-Ia supernovae (SNe-Ia) provides cosmological constraints on the nature of dark energy with an accuracy limited by the flux calibration of currently available spectrophotometric standards. The StarDICE experiment aims at establishing a 5-stage metrology chain from NIST photodiodes to stars, with a targeted accuracy of \SI{1}{mmag} in $griz$ colors. We present the first two stages, resulting in the calibration transfer from NIST photodiodes to a demonstration \SI{150}{Mpixel} CMOS sensor (Sony IMX411ALR as implemented in the QHY411M camera by QHYCCD). As a side-product, we provide full characterization of this camera. A fully automated spectrophotometric bench is built to perform the calibration transfer. The sensor readout electronics is studied using thousands of flat-field images from which we derive stability, high resolution photon transfer curves and estimates of the individual pixel gain. The sensor quantum efficiency is then measured relative to a NIST-calibrated photodiode. Flat-field scans at 16 different wavelengths are used to build maps of the sensor response. We demonstrate statistical uncertainty on quantum efficiency below \SI{0.001}{e^-/\gamma} between \SI{387}{nm} and \SI{950}{nm}. Systematic uncertainties in the bench optics are controlled at the level of \SI{1e-3}{e^-/\gamma}. Uncertainty in the overall normalization of the QE curve is 1\%. Regarding the camera we demonstrate stability in steady state conditions at the level of \SI{32.5}{ppm}. Homogeneity in the response is below \SI{1}{\percent} RMS across the entire sensor area. Quantum efficiency stays above \SI{50}{\percent} in most of the visible range, peaking well above \SI{80}{\percent} between \SI{440}{nm} and \SI{570}{nm}. Differential non-linearities at the level of \SI{1}{\percent} are detected. A simple 2-parameter model is proposed to mitigate the effect.


Introduction
The calibration of wide-field optical surveys is a subject of active research that is driven by requirements from the use of type Ia supernovae to measure the evolution of luminosity distance with redshift (the Hubble diagram), one of the main probes of dark energy.Essentially, this measurement involves the comparison of the apparent fluxes of supernovae at different redshifts.Errors in the differential flux calibration between the bluer photometric bands, in which the low-redshift events are observed, and the redder bands, in which high redshift events fall, translate directly into a systematic error in the Hubble diagram.In recent studies, the contribution of the calibration uncertainty to the total uncertainty on the dark energy equation of state parameter has been A119, page 1 of 18 Fig. 1.Metrology chain of the StarDICE experiment: with light detectors in the left column and light sources in the right column.Each arrow represents a step in the chain, and the label gives the order of magnitude of the beam intensity, irradiance, or spectral irradiance depending if the beam is contained or extended and monochromatic or not.The steps in gray were conducted at NIST (Houston 2008) and result in a silicon photodiode calibrated against an electrical substitution cryogenic radiometer.The two steps in red are covered in this paper and provide calibrated sensors with a much lower dark current than the photodiode.The rest of the chain leading to astrophysical standards will be the subject of forthcoming papers.
brought down to a level comparable to the statistical uncertainty (Betoule et al. 2014;Scolnic et al. 2018;Jones et al. 2019).The next generation of instruments, in particular the Vera C. Rubin observatory, requires a substantial step forward in terms of photometric accuracy to benefit from the one-order-of-magnitude increase in the available statistics.
StarDICE is one of the experiments aimed at establishing a metrology chain between laboratory flux references (silicon photodiodes calibrated by NIST) and stars from the CALSPEC library of spectrophotometric standards (Bohlin et al. 2020).As supernova surveys are calibrated relatively to these standard stars, establishing this chain with sufficient accuracy essentially solves the calibration issue of the Hubble diagram.
With dark currents on the order of 100 fA, however, the reference photodiodes cannot be used to reliably measure irradiances fainter than 1 × 10 −11 W cm −2 , and intermediate steps are required to bridge the remaining gap leading to the irradiance of mag 13 stars on the order of 1 × 10 −19 W nm −1 cm −2 .Experiments vary in the way they achieve and control the flux reduction between the calibration reference and the target instrument.Examples of proposed strategies are diffusion on a screen (Stubbs et al. 2007(Stubbs et al. , 2010;;Tonry et al. 2012;Marshall et al. 2013;Ferguson et al. 2020) and diffusion in a sphere of short light pulses (Coughlin et al. 2018).The most advanced experiment to date (Lombardo et al. 2017;Küsters 2019;Küsters et al. 2020) uses a combination of diffusion in integrating spheres together with extremely sensitive calibrated photodiodes (Küsters et al. 2022).
The StarDICE proposal consists of a five-stage chain depicted in Fig. 1, which relies on the near-field calibration of an ultra-faint (less than 1 µW of optical power) and stable light source, which in turn serves as a distant (∼100 m) in situ calibration reference for a small astronomical telescope.To fulfill this task, sensitive calibrated devices, either cooled complementary metal oxide semiconductor image sensors (CMOS image sensors or CIS) or charged coupled devices (CCD), are required to rapidly and accurately map the radiant intensity of this artificial star at a short distance (∼20 cm).The present paper, the first in a series describing the chain, details the setup of a spectrophotometric test bench built to transfer the NIST photodiode calibration to the nearby calibration sensor with the required ∼0.1% accuracy, and demonstrating the first two steps from the chain.
The choice of a suitable sensor deserves some discussion.Since the early 2000s, many applications of image sensors have gradually shifted from CCD to CIS.In the field of astronomy, however, the important characteristics are quantum efficiency (especially in the near infrared; QE), uniformity, and linearity in the response (provided simple corrections such as flat-fielding), and sensitive area.In the case of these characteristics, the field has been well served by the developments in CCD technology such as infrared sensitive ultra-thick substrates, multilayer coatings, four-sided buttable sensors, and increase in the number of output amplifiers.A recent example is given by the Teledyne e2v CCD250-82 sensor developed for the camera of the Vera C. Rubin Observatory (Juramy et al. 2014;O'Connor et al. 2016), which combines all these recent developments.As a consequence, professional astronomy experiments still mainly rely on CCDs.Nevertheless, the last decade has seen increased availability of QE-boosting technologies such as backside illumination (BSI) in CIS, improvements in the performance and arrangement of column-parallel ADCs allowing larger area and three-sided buttable sensors.Specific applications are already making use of the advantages of CIS, such as the ability to address subsets of individual pixels as in the TAOS-II project (Huang et al. 2021), which uses the Teledyne CIS113 (Wang et al. 2020) with deported ADCs.Another useful characteristic of CIS is the achievement of high frame rates coupled with low readout noise (≲1 e − ) thanks to highly parallel readout.This feature has important potential applications in high-resolution groundbased astronomy, which requires short exposure times (∼10 ms) to freeze the speckle pattern caused by atmospheric turbulence.
In this first study, we characterized a camera equipped with the recent 150 mpixel IMX411ALR Sony CMOS sensor as implemented in the QHY411M camera.With a large 53 mm × 40 mm sensitive area, low dark current without the need for extensive cooling, and negligible read-out noise, this sensor appears very convenient for the direct mapping of low irradiance we are pursuing.
The interest in this camera also extends well beyond the StarDICE experiment.The sampling of the large area by small 3.76 µm side pixels makes the sensor well suited for wide-field 1m-class telescopes.This kind of instrument is particularly useful in the follow-up of gravitational waves and neutrino events or as a survey instrument for the serendipitous discovery of optical transients such as supernovae or kilonovae.This field of research requires a monitoring of multiple sources rapidly fading and poorly localized (>1 deg 2 Abbott et al. 2018).The Global Rapid Advanced Network Devoted to Multi-messenger Addicts (GRANDMA, Antier et al. 2020;Aivazyan et al. 2022) needs telescopes with photometric performance similar to the Zwicky Transient Facility (Bellm et al. 2018) (which has a magnitude limit of 21 for 30 s exposures in the r band) at a significantly lower cost to be replicated at different sites.It is particularly interesting to obtain high efficiency in the near-infrared band since it allows the imaging of the peak of the red kilonova component even if the blue is too weak to be detected by 1m-class telescopes and alert the sensitive spectrographs (Metzger 2019; A119, page 2 of 18 Andreoni et al. 2019;Abbott et al. 2017).The unknown feature here is the absolute quantum efficiency of the camera over the required wavelength range.A secondary aim of the paper is therefore to study the camera and measure its quantum efficiency as a reference for the GRANDMA collaboration or other potential applications in astronomy.
The rest of the paper is organized as follows.Section 2 provides a detailed description of the instrumental setup.Section 3 gives an overview of the measurement campaign and describes the measurement process in detail.The main results and associated uncertainties are discussed in Sect. 4. The results are summarized and discussed in Sect. 5.

Description of the sensor calibration bench
The StarDICE sensor calibration bench is designed to calibrate CCD or CMOS cameras relative to a flux standard established by NIST (Houston 2008).The light illumination system has been designed to provide both flat-fielding and monochromatic local illumination capabilities in order to fully map the camera quantum efficiency and readout gain.
The camera tested in this study is a QHY411M equipped with a back-illuminated monochrome CMOS sensor IMX411ALR.The sensor illuminated area consists of 10 654 pixels × 14 206 pixels (151.3Mpixels) with a pixel width of 3.76 µm and a saturation capacity around 80 ke − .The sensor has a rolling electronic shutter 1 , and its ADC array allows a 16-bit readout of the pixel array at a maximum frame rate of 2 image s −1 , with adjustable gain.The camera thermoelectrically cools the sensor down to −45 °C (adjustable), and the generated heat is extracted by a closed-loop circulation of water at 16 °C.The QHY411 camera is connected to the control computer through a USB3 interface, and commands are sent through a python wrapper on top of the QHY SDK 2 .The data were acquired in exposure mode, not in video mode, which we did not manage to work with from Linux.The tunable electronic bias and amplifier gain of the camera were kept fixed to their respective values of 100 and 60, which delivered a gain on the order of 1.2 e − ADU −1 , effectively mapping the sensor dynamics 3 over the 16 bits.
The test camera is mounted on a two-axis linear stage in an optical test bench, where it can intercept either a wavelength tunable f /10 monochromatic light beam or a flat-field illumination beam.Other light sensors, in particular a NIST calibrated Hamamatsu S2281 photodiode, are mounted in the same plane and can in turn intercept the same beams.A sketch description of the setup is given in Fig. 2.
Intercalibration between detectors is obtained by taking the ratio of their response to illumination by the same monochromatic light beam.Measurements of the detector uniformity and photon transfer curves (PTC) are obtained from flat-field exposures in the second beam.

Monochromatic beam
The monochromatic light beam is obtained by imaging the horizontal output slit of an integrating sphere (IS1) on the vertical 1 See e.g., Pace (2021) for a description of CMOS image sensors shuttering. 2 https://github.com/ebertin/qhpyccd 3The single pixel full well capacity is expected to be 80 ke − according to the manufacturer website: https://www.qhyccd.com/scientific-camera-qhy411-qhy461/ entrance slit of a Czerny-Turner monochromator (SP-DK240) using a relay of two 90°off-axis parabolic mirrors (M1 and M2).
The output slit of the monochromator is then imaged on an aperture stop in front of the detector plane by a second relay (M3 and M4).The focal length of the monochromator is 240 mm and its turret hosts three 68 mm × 68 mm gratings, accepting input beams with a recommended minimal f number of 3.9.The diameter and focal length of the off-axis parabolic mirrors (OAP) are given in Table 1.The first relay shapes an f /4 input beam which is then extended to f /12 by the second relay.The use of mirrors makes the position of the focus point and image shape conveniently achromatic.Illumination in IS1 is provided by an electronic board in the 3 inch port featuring 48 LEDs at different central wavelengths.The flux level of each LED is tunable by software.Typically only one is shining at any given time to minimize pollution of the output beam by out-of-band light.A switchable Ar-Hg arc lamp (not shown in Fig. 2) is fit to the remaining port of the sphere and is used to determine the wavelength calibration and resolution of the monochromator.Fig. 3. Wavelength calibration model for monochromatic calibration beam.Results for the blue-optimized grating in use for wavelengths shorter than 650 nm are presented in the left column, and results for the infrared-optimized grating are presented in the right column.Top row: observed spectrum of the Hg-Ar arc lamp (black dots) and best-fit model (red curve).Second row: Difference in the observed spectrum and the best-fit model.Third row: wavelength-calibration component of the best-fit model, that is, the quantity to be added to the monochromator set wavelength (λ M ) to match the theoretical wavelength of the observed lines (λ).Last row: residuals to the wavelength calibration, that is, the difference in measured and theoretical line wavelength in the air remaining after wavelength calibration for the brightest lines in the lamp spectrum.The RMS in the difference is, respectively, 0.17 nm and 0.16 nm for the blue and red gratings.
Our target wavelength resolution is on the order of 2 nm, with gratings of 1200 g mm −1 and a Ebert angle of 18.7°.The theoretical dispersion relation therefore varies between 2.92 nm mm −1 and 3.47 nm mm −1 over the 1100-300 nm wavelength range.A slit width of 635 µm delivers a full width at half maximum (FWHM) ranging between 1.86 nm and 2.2 nm.We keep both slits at this fixed width across the entire range to avoid changing the image position and the wavelength calibration.The vertical extent of the image is set by the opening of IS1 output slits, which is set around 500 µm.In this setting, the spatial extent of the image is about 2 mm × 3 mm in the sensor plane.Due to the relatively low wavelength resolution it is necessary to build a full forward model of the measured spectrum of the Hg-Ar calibration arc lamp to derive accurate wavelength calibration from unresolved doublets or triplets of the lamp.The observed spectrum is modeled as a series of lines of adjustable amplitude convolved with a triangular wavelength response whose width is allowed to vary linearly with wavelength.The wavelength calibration is adjusted as a second-order polynomial of the wavelength.The background light is developed on a 13-knot B-spline.The model provides a fairly good description of the spectral lamp data as illustrated in Fig. 3 for the two gratings in use.The wavelength correction curves shown in the third row of the figure are applied to all subsequent measurements and bring the uncertainty on the wavelength calibration well below 1 nm.
The 50% reflective beam splitter inserted after the last mirror redirects a fraction of the flux on a CMOS IMX174LLJ sensor that monitors the time stability of the illumination for a given wavelength.To further clean the main beam of stray light and make the image extension on the detector perfectly stable, the light is focused on a fixed aperture stop slightly smaller than the image, behind which the slowly diverging beam reaches the  detectors.Finally, a shutter at the entrance of the monochromator is used to shut the beam off during measurements of the ambient and stray light level.

Flat-field illumination beam
The flat-field light beam is provided by a second integrating sphere (IS2) whose 1 ′′ output port shines directly on the detectors.The light is provided by 16 LEDs in a 2 ′′ port without wavelength filtering, so that the spectrum of the light emitted out of the sphere corresponds roughly to the narrow spectrum of the shining LED.A measurement of the output spectrum for each of the 16 independent channels is given in Fig. 4. The central wavelength ( λ) and spectral width (FWHM), together with the wavelength of the flux maxima (λ peak ) in all channels are provided in Table 2.The intensity of the current flowing in each LED can be tuned independently by software.The integrated surface brightness in the sphere is monitored by a photodiode inserted in the last 1 inch port of the sphere.Over the full extent of the large IMX411 sensor, the illumination varies by 2.4% peak to valley, according to the mapping of the beam reconstructed from the images taken during the uniformity scan (see Sect. 4.4).

Dataset
Throughout this study, we performed five different kinds of measurements: dark current and readout noise measurements for which the sensor is protected from any illumination by an aluminum front cover; stability measurements for which the camera is kept fixed in front of the flat-field beam at four different (stabilized) temperatures to study temperature dependence; photon transfer curve and linearity measurements for which the illumination level is varied by changing either the source brightness or the exposure time; uniformity measurements scanning the sensor sensitive surface; and quantum-efficiency measurements for which we interleaved measurements of the brightness of the monochromatic beam with the camera and the reference photodiode.This last measurement is repeated scanning every nanometer in wavelength.We detail the specifics of each measurement  in the following, except for the dark current measurement, which is straightforward.

Stability measurements
A large series of images were obtained with the camera roughly aligned on the center of the flat-field beam to test the readout electronics.The fluctuation of the illumination during a sequence at a given flux level can be estimated from measurements with the monitoring photodiode installed in IS2.The fluctuations in the illumination typically drop below 10 −4 RMS (root mean square), after a warm-up period of 1000 s, as is shown in Fig. 5.The stability measurement is performed around 510 nm using LED channel 4. A stabilized current of 10 mA is established in the LED and the thermoelectric cooling system of the sensor is set to a target temperature.This was repeated at four set A119, page 5 of 18 A&A 670, A119 (2023) points between −5 and 10 °C.The exposure time is set to 500 ms, which results in an average pixel charge of 36 ke − .A first batch of I = 100 exposures are taken to wait for the cooling and stabilization of the system temperature.The stability measurement is performed on a subsequent batch of I = 100 exposures.Denoting q p,i as the count measured in pixel p in image number i, all images containing the same number of pixels P, the following statistics can be gathered on the fly for each batch of images: V δ p = 1 As described in Appendix A, the readout gain can be reconstructed from the relation between the temporal mean of individual pixels M p and the mean of the squares V 0 p , while the mean of individual images m i tracks fluctuations in the illumination level.The individual images are not stored in this approach.Although the absence of storage prevents the use of robust statistics, it is preferred in this study due to the very large size of individual images.Inter pixel products (V δ p for δ > 0) can be stored to study inter-pixel covariances.This is, however, not needed for the stability study.We keep track of the cooling power of the sensor at the end of image i and I phd (t) the photocurrent delivered by the photodiode during the sequence.

Photon transfer curve and linearity measurements
The measurement of the PTC is similar to that of the stability except that the sensor temperature is now kept fixed at 0 °C, and the exposure time and LED current are varied.Below, we combine data from six different sets whose characteristics are given in Table 3. Sets 1-3 are dedicated to a fine measurement of the gain and linearity on the first third of the scale <15 ke − , which covers the entire dynamic range of the QE measurement.To mitigate potential deviations from Poissonian statistics due to the brighter-fatter effect (see below), the statistics, M p and V p , are computed on binned pixels and divided by the number of pixels in the bins to lie on the same ADU scale as the non-binned statistics.Sets 4-6 sample the full scale without binning.
Outside of the sensor calibration bench, we also gathered two supplemental datasets using a simpler illumination source to enable the experimental study of correlations between neighboring pixels as a signature for inter-pixel capacitance or brighterfatter effects (dataset A) and mapping of the individual pixel gain using a large number of images (datasets B and C).In this configuration, the camera was capped with a 3D-printed hollow cone holding a simple 450 nm LED behind a diffusive screen.Switching to an external illumination source allowed us to free the bench for the calibration of another camera while taking these time-consuming measurements.The illumination stability with this alternate source is, however, one order of magnitude worse than what is achieved on the bench.Therefore, we do not rely on these supplemental datasets for our baseline determination of the gain.For study A, in addition to the statistics presented in the previous section, we also gather V 01 p and V 10 p , the mean of the cross-products between immediately neighboring pixels along a line and a column.

Uniformity measurements
The measurement of the uniformity of the camera response is similar to that of the stability, but the camera is moved through 25 positions in a 5 × 5 grid spanning a square with a 60 mm side with 15 mm resolution in the flat-field beam.The goal is to disentangle variations in the camera response from spatial variations of the illumination.The measurement is only carried at a sensor temperature of 0 °C but is repeated for the sixteen LEDs to look for wavelength dependencies.All LEDs are operated at a stabilized current of 10 mA where they deliver on the order of 60 ke − s −1 in a single camera pixel.The exposure time is set to 0.3 s so that none of the pixels saturate.

Quantum-efficiency measurements
The quantum-efficiency measurement is performed in the 375-1078 nm range, turning on each of the 26 relevant LEDs, one at a time.After turning on one LED, the camera is set to intercept the monochromatic beam, and the corresponding wavelength range is scanned with a 1 nm step size.For each wavelength, two successive camera exposures are taken, one with the beam shutter (S) closed and a second with the beam shutter opened.Simultaneously, the same exposures are obtained on the monitoring sensor.Once the wavelength range is exhausted, the reference photodiode is brought to intercept the monochromatic beam, and a similar scan is performed using the monitoring sensor and the photodiode.The quantum-efficiency estimate is built from these four measurements as where Φ is the measured ADU count in the camera, G is the camera gain estimate in e − ADU −1 , τ exp is the exposure time in seconds, ϵ NIST is the quantum efficiency of the reference sensor in e − /γ, I NIST is the photocurrent delivered by the reference sensor in ampere, e is the elementary charge in coulomb, and is the ratio of the count rate in the monitoring sensor during the reference photodiode and camera exposures.We further detail the computation of the different terms in what follows.

Photodiode operations
Our reference sensor is a Hamamatsu S2281 photodiode, with a circular photosensitive area of 1 cm 2 (5.65 mm radius).The measurement of its quantum-efficiency curve ϵ NIST (λ) performed at NIST (Houston 2008) along with associated uncertainties is reproduced in Fig. 6.
Originally built with a non-cooled camera in mind, the heat generated by the focal plane cooling slightly exceeded the heat dissipation capability of the sensor calibration bench enclosure resulting in a nominal operation temperature in the 26.4 °C to 26.6 °C range.This temperature is slightly higher than the target 23 °C temperature at which the reference NIST sensor was calibrated.This introduces a systematic uncertainty in the infrared region due to the change of the photodiode quantum efficiency with temperature.According to Houston (2008, Fig. 9.1), the change is only significant beyond 1000 nm, where it amounts to 0.2% °C−1 .
Centering of the photodiode in the monochromatic light beam is performed by scanning in x and y at constant illumination past the edges of the photodiode sensitive area.The photodiode is then moved back to the barycenter of the resulting flux map.
The flux level of the monochromatic beam was designed to be relatively low in order to avoid saturation of CCD cameras in relatively long exposures (larger than 1 s), allowing its use to calibrate cameras with mechanical shutters while minimizing exposure time uncertainties.As a consequence, sensitive readings of the photocurrent required us to select a large feedback resistance in the transimpedance amplifier and limited bandwidth.The photocurrent out of the photodiode is sampled at 7.8 Hz by a Keithley 6514 picoammeter.At any given wavelength, the photocurrent is sampled for 30 s, starting 10 s before the opening of the monochromatic beam shutter.The shutter is left open for 10 s and closed again for the end of the operation.The average shape of the photocurrent reading is plotted in Fig. 7.We constructed three numerical averages -I 0 , I 1 , and I 2 -of the readings before, during, and after the light exposure, avoiding the transition regions.We constructed the photocurrent estimate as I NIST = I 1 − (I 0 + I 2 )/2, and estimated uncertainties on I NIST from the standard deviation of the samples in each of the quantities as σ 2

Camera operations during quantum-efficiency measurements
We applied the same settings for the quantum-efficiency measurement as for the PTC measurement, so we can assume the readout gain to be identical.In normal mode, we select 2s exposures for both the "dark" and "open" exposures.
The estimate of the camera count Φ at each wavelength is built in several steps.First we compute the centroid of the slit image in the camera by building a stack of all images from a complete wavelength scan.The stack is built as the sum of all open images minus the sum of all dark images.Second, we perform the photometry of all images in circular apertures centered on the computed centroid with physical radii of 2, 5.65 and 6 mm 4 .The second radius matches the physical size of the NIST photodiode exactly and is very significantly larger than the slit image FWHM of 2.9 mm.This aperture minimizes systematic errors from small amount of light diffused at large angles (see the discussion in Sect.4.5).The flux in an aperture of radius r is denoted Φ r .We also gather the sum of all pixels in the image T and compute a background light proxy as B = T − Φ 6 .
The large apertures, required to match the physical size of the reference sensor, makes the photometry very sensitive to background contamination.We use the "dark" exposures to refine our background estimate as follows.For a given aperture ,r, we adjust a linear relation between B and Φ r on all the dark images in the sequence as Φ r = αB + β.An illustration of the fit is given in Fig. 8.We then use the best-fit parameters to build the background-corrected aperture flux as Φr = Φ r − αB − β.In doing so, the uncertainties on background subtraction are typically on the order of 70 kADU for the full 5.65 mm aperture and drops down to 15 kADU for the smaller 2 mm aperture, where it compares more favorably to the photon noise (see the results section for a more detailed discussion of the various noise contributions).

Monitoring sensor
Lastly, we used an IMX174 sensor to monitor changes in illumination while switching between target and reference sensor.When the camera faces the beam, the exposures of the monitoring sensor are synchronized with the camera exposures and the resulting photometry delivers the quantity M c in Eq. ( 4).With the photodiode in the beam, the monitoring sensor continuously acquires 2 s exposures.Images corresponding to opening and closing of the shutter are discarded, and the photometry of the three fully exposed images is averaged to deliver the quantity M N in Eq. ( 4).In both cases, photometry is performed similarly to what was described for the target sensor; we gather photometry in several apertures centered on the centroid of the slit images and build a background estimator from dark images.The only difference is that we are not constrained to matching the aperture size of the NIST photodiode for this monitoring measurement, the only important point being that M N and M c are derived from the same aperture so that the ratio of the two corresponds to the variation of the illumination.Therefore, we selected a much smaller aperture (1 mm) closer to the extent of the image to optimize the signal-to-noise ratio.

Sensor cosmetics and inter-pixel capacitance
The sensor cosmetics is studied on a stack of 3000 dark frames with 10 s exposures with a stabilized sensor temperature of 0 °C.The fractions of pixels displaying a level of dark current in excess of 1, 10, 100, and 1000 e − s −1 are respectively 5.00 × 10 −4 , 3.69 × 10 −5 , 4.11 × 10 −6 , and 2.08 × 10 −7 .
We can take advantage of the hot pixels to get a handle on the pattern of the inter-pixel capacitance (IPC, Moore et al. 2004;Finger et al. 2006), which is found to be significant only between two adjacent pixels in a column, with a correlation coefficient of 3.69 ± 0.01 × 10 −3 .The effect is not symmetric.Pixels with even line numbers are only coupled to the pixel in the following line (respectively, odd-parity pixels are coupled to the preceding pixels).The effect on photon statistics is thus half what the correlation constant would suggest.The measured 3 × 3 correlation pattern between neighboring pixels as seen by pixels with even-parity coordinates is presented in Fig. 9.

Stability of the photometric response
The stability of the camera response in time and temperature is obtained from the stability dataset described in Sect.3.1.A small trend with sensor temperature is detectable in the camera response.Figure 10 displays the measured response at four different stabilized temperatures between −5 and 10 °C.The measurements are well described by a simple linear trend, with a relative slope of 0.027% °C−1 .The gain estimate derived from the same dataset is in contrast extremely stable and compatible with no gain change at the 2 × 10 −4 level over the 15 °C spanned by the test data.This result suggests that the observed temperature trend is dominated by a change in the sensor quantum efficiency.The test, conceived as a check of the gain stability, was performed at a single wavelength.It would be interesting to repeat the same measurement at a different wavelength, especially in the infrared, where the strongest temperature dependence is expected.
The result shows that sub-millimagnitude photometric stability is easily achievable even without stringent control of the environment, in particular when operating at a steady frame rate.Here, the RMS of the camera average in the second batches of stability exposures (that is, the batch starting after 100 stabilization exposures) is 3.26 × 10 −5 .This is even lower than the RMS of the monitoring photodiode readings (∼10 −4 see Fig. 5), so we cannot distinguish between fluctuations in the camera response or in the illumination.The quoted number can be used as an upper bound on the camera response fluctuation in stable conditions.

Study of the readout electronics
The shape of the relation between the variance and the expectation of the pixel readout across a large range of illumination A119, page 8 of 18 levels, usually called the photon transfer curve, is a welldocumented tool to study the gain of the readout chain of CCD sensors.It was recently extended to the study of the dynamical electrostatic effect, coined as the brighter-fatter effect, affecting thick, deep-depleted CCD sensors (see Astier et al. 2019 for the modification of the PTC shape and reference therein for a description of the effect).
As a large number of pixels share the exact same readout chain, CCD studies typically rely on statistics built from a large number of pixels and a small number of images5 .Different levels of spatial averaging are relevant in the study of CIS: averaging over the entire focal plane gives the best handle over features shared by the entire ADC array, such as reference voltages; averaging over individual columns is expected to show defects specific to a single ADC pipeline.Measuring the gain for individual pixels through this technique is however challenging and requires very large statistics.Around 30 000 images are required to reach percentage-point accuracy on the individual pixel gain.We present limited but promising efforts in this direction in the next section.As no information is available to us regarding the structure of the sensor readout chain we only assume that it follows a standard column parallel structure.
We propose various models for the statistics M p and V 0 p and m i in Appendix A. In the case where perfect linearity of the pixel response to illumination can be assumed, and where the photoconversion is a perfect Poisson process (independent conversion events with constant probability), the readout chain is described by three parameters: its gain (G) in e − /ADU, the readout noise (σ), and the readout bias (b), which can be inferred from fitting relation (Eq.(A.9)) to the empirical statistics.Deviations from the linearity or Poissonian hypothesis will appear as deviations from the straight line in the plot.Disentangling between the different effects requires additional data, either direct linearity measurements or extensive study of the correlation between neighboring pixels.
A simple way to mitigate the effect of correlations between neighboring pixels (either from IPC or brighter-fatter) is to compute statistics on pixels made artificially bigger by binning.We first present the study of the average transfer curve of the sensor, obtained by binning pixel counts in 6 × 6 square superpixels, on the first third of the digital scale, which widely encompasses the dynamic range over which the quantum-efficiency measurements was performed.We then present the measurements of the PTC on non-binned statistics over the full scale, a direct measurement of the integral linearity, and a measurement of nearest neighbors covariances.Those three measurements combined hint toward a consistent picture with good integral linearity and measurable deviation from the Poisson hypothesis.At this stage, however, we cannot provide a complete model satisfactorily describing this dataset.Therefore, we rely on the measurement in binned pixels to infer our baseline value for the camera gain.Finally, we present our measurements of the readout gain uniformity.

Photon transfer curve in 6 × 6 superpixels
For each M p and V p in datasets 1-3, we compute the mean variance V of all the N µ superpixels sharing the same M p = µ value as follows: where the ∆ term corrects the statistics for the effect of small illumination fluctuations during the measurement (see Appendix A.2 for details).This procedure allows us to recover the full resolution of the PTC shape, while a simple averaging over the focal plane would smear the features due to nonuniformity in the illumination level.The result is presented in Fig. 11.
All three datasets are presented on the same plot and present a rather consistent picture.The most striking feature is a periodic differential nonlinearity (DNL), which is very consistent in all datasets, likely corresponding to a fixed bit width error.As striking as it appears on the PTC, the DNL likely has few practical consequences.However, it complicates the analysis of the PTC as statistics obtained with slightly different illumination are no longer directly comparable, and a simple model for the shape does not describe the data.
When independently fitting a simple linear relation to the three datasets, we recover an average gain value of 1.2740 ± 0.0016 e − ADU −1 , where the quoted uncertainty is the RMS among the three datasets.The middle panel in Fig. 11 shows the residuals to this simple model for the three datasets.Fitting instead for the approximate shape of the PTC in presence of BF proposed in Astier et al. (2019, Eq. ( 16)) gives a significantly different average value of 1.2636 ± 0.0023 e − ADU −1 and a 4-σ difference with a moderate preference for a nonzero curvature of the binned PTC: a 00 = −5 ± 2 × 10 −7 .The recovered values for the curvature are not very consistent between the three datasets, and they change when the top of the scale is cut differently (e.g., at 15 kADU), as can be expected from the poor quality of the fit.
A rough modeling of the DNL can be attempted to improve the fit consistency and enable comparison between statistics at different focal plane locations more easily, alleviating the need for matched illumination levels.In Appendix A.3, we present the tools to numerically compute the relevant statistics with arbitrary digital boundaries.We model the DNL as a sine function with a period of 512 bits; that is to say, we assume that the digital scale admits boundaries between codes 512 (n − ϕ DNL ) .Not knowing the details of the architecture of the ADC array, little more than that can be A119, page 9 of 18 A&A 670, A119 (2023) Fig. 11.Photon transfer curve for the IMX411 sensor obtained from statistics build from 6 × 6 binned superpixels over the first third of the scale.Top: the three colors correspond to three independent datasets obtained with different illuminations, exposure times, and numbers of images (see Table 3).The black solid line presents the linear relation from Eq. (A.9) using the best-fit gain value.Middle: residuals to the linear model.The dashed black lines denote 1% variations of the integral gain.Bottom: residuals to the full model, including the effect of a 512-bit periodical DNL with 0.65 ADU of amplitude, and provision for curvature of the PTC due to residual influence of a brighter-fatter effect in the binned superpixels.
done, but this ad hoc model may be a good enough description of the reality for the sake of the present study.Fitting for A DNL and ϕ DNL along with the readout gain and noise delivers a fairly comprehensive description of datasets 1 and 3, although distinctive features remain visible in the residuals as shown in the bottom panel of Fig. 11.The best-fit values are A DNL = 0.651 ± 0.006ADU and ϕ DNL = 44.4± 0.2.The corresponding reduced chi-squared for the two datasets are 1.41 and 1.22.Interestingly, dataset 2, which is obtained from a much higher number of consecutive frames (see Table 3) is at odds with the other two datasets with a much less satisfactory fit quality of χ 2 /d.o.f.= 2.39.The reasons behind this disagreement are for the moment not understood.However, the problem is unlikely to affect the gain determination as all three datasets now deliver a very consistent gain value G = 1.273 ± 0.0008e − ADU −1 , where again the quoted uncertainty is the RMS of the three fits.The improved modeling, however, does not entirely solve the inconsistency between the datasets when allowing for curvature.The mean and RMS of the recovered curvature now settles at a 00 = −1.6 ± 1.3 × 10 −7 , compatible with zero and resulting in a slightly lower mean gain value of G = 1.269 ± 0.0035e − ADU −1 .We adopted this last value and its uncertainty as our baseline determination of the readout gain.Due to the binning, this value is not affected by the IPC.

Distinguishing between nonlinearity and other effects in the full-scale PTC
The PTC measured directly on individual pixels and over the full scale is presented in Fig. 12.The model including both brighter-fatter and 512-bit periodic nonlinearity is a rather poor fit of the data, mainly because other differential features show up beyond the 512-bit periodic feature.The best-fit parameters are, however, fairly compatible with those previously obtained: G = 1.273 ± 0.007 e − ADU −1 , A DNL = 0.665 ± 0.016 ADU, and ϕ DNL = 49.2 ± 1.7.The uncertainty on the PTC curvature remains high: a 00 = −4.0 ± 1.9 × 10 −7 , because of the remaining features.We did not attempt to build a comprehensive model of the remaining features at this stage, but we believe that the general shape of the PTC points toward significant non-null curvature.Correcting the recovered gain value from the expected influence of the IPC measured on hot pixels (see Sect. 4.1), brings the gain estimate on the full statistics even closer to the value recovered in binned statistics.
Both classical nonlinearity and the brighter-fatter effect could cause the observed curvature of the PTC.Behavior similar to the brighter-fatter effect has previously been reported in CMOS (Greffe et al. 2022) and CMOS NIR detectors (Plazas et al. 2018) Fig. 12. Photon transfer curve for the IMX411 sensor obtained from statistics build from individual physical pixels over the full scale.Top: the three colors correspond to three independent datasets obtained with different illumination levels and exposure times (see Table 3).The black solid line presents the linear relation from Eq. (A.9) using the best-fit gain value.Middle: residuals to the linear model.The dashed black lines show 1% variations of the integral gain.Bottom: residuals to the full model, including the effect of a 512-bit periodical DNL with 0.65 ADU of amplitude, and provision for curvature of the PTC due to influence of brighter-fatter-like effect in the physical pixels.
of the depletion region (Plazas et al. 2017) -paralleling the mechanism observed in CCDs.Disentangling between classical nonlinearity or the brighter-fatter effect necessitates either direct linearity measurements or detection of the characteristic correlation between neighboring pixels introduced by the electrostatic influence.We present both measurements in Figs. 13 and 14.
The dataset specifically taken to study correlations (A in Table 3) provides moderate evidence for the presence of positive correlations between the nearest neighbors in a line (C 01 ) and in a column (C 10 ), in excess of what would be expected from measured IPC alone.The resulting curves are shown in Fig. 13.The shape does not readily comply with expectations for BF-induced correlations, and at this stage we do not have a compelling model for the measured shape.The most obvious caveat in the dataset is the need to subtract for the contribution of quite large illumination fluctuations (∆ ∼ 10 −3 ).This is done using the RMS of the mean of all images in the sequence to build an estimate of the illumination fluctuation (see Appendix A.2). We note that the proposed technique would allow investigation of the correlations at larger distances C i j ; however, adequate hardware to perform the required comparatively heavy computations in real time would be required.The flat-field illumination source was built with the idea of providing a direct linearity measurement, but at this stage the linearity of the illumination has not yet been calibrated.The gathered dataset still allows limited investigation of the linearity by interlacing variations of the illumination (through variation of the LED intensity), with variations of the exposure time, and solving for illumination level at a given LED intensity.Doing so gives the linearity check presented in Fig. 14, which does not detect significant integral nonlinearities in the chain up to 55 000 ADU (70 ke − ).We therefore conclude that the curvature observed in the PTC is not attributable to classical nonlinearity and is at least qualitatively compatible with expectations of a small brighter-fatter effect.

Uniformity of the readout gain
Determination of the individual pixel gains using this technique requires a very large number of images, but is in principle achievable.We made an attempt to test the concept with the high-statistics datasets denoted (B) and (C) in Table 3.The result is presented in the left panel of Fig. 15.The main visible feature in the recovered gain map is the imprint of the differential nonlinearity due to the smooth change in illumination level in the flat-field images.Simply correcting the measured variance using the ad hoc model from Fig. 11 gives the per-pixel meanover-variance map presented in the right-panel.Residual features in the corrected map show that the applied correction is not fully adequate across the focal plane, but it at least qualitatively describes the phenomenon.The corrected map has an RMS of 1.7%, dominated by statistical noise.Binning the map by 120 × 120 pixels to suppress the statistical noise brings the RMS down to 1.0%, with residual artifacts from the DNL dominating the spatial features.Therefore, we conclude that we can set an upper bound of 1%RMS on spatial variations of the effective gain value across the focal plane.In addition, the average gain in the region illuminated in the quantum-efficiency measurement differs from the focal-plane average by 0.0008 e − ADU −1 .Therefore, we stick to the baseline gain value to interpret our quantum-efficiency measurement.

Uniformity of the camera response
The camera response (i.e., the product of the quantum efficiency and the readout gain) has been mapped at the 16 wavelengths delivered by the flat-field light source.Given that the spatial extent of the IMX411 sensor is larger than the area over which the flat-field beam can be considered uniform, we resorted to iteratively solving for the intensity of the illumination pattern and the camera response map.The 25 images in a single wavelength dataset are thus interpreted as the product of the illumination pattern B(x) at this wavelength with the camera response map M(x) for 25 known central positions x i of the camera.Image i therefore follows the following equation: The illumination pattern over the full area is developed on a cardinal bicubic B-spline basis with 10 × 10 nodes.A first estimate of the illumination pattern B 0 is obtained assuming a uniform camera response M 0 = 1, and this B 0 is used to solve for a first map of the camera response M 1 .The procedure is iterated once and quickly converges.The model is an accurate description of the dataset except for projective features such as defects on the camera window.
The resulting camera response maps are shown in Fig. 16.All demonstrate uniformity in the response at the level of 4% peakto-peak (<1% RMS), with the position selected for the QE measurement generally being slightly below the average.The pattern shows some evolution with wavelength; however, the evolution appears slow enough to be corrected by filter-dependent flat-fielding, with an accuracy dependent on the chosen photometric system.

Quantum-efficiency curve
Our quantum-efficiency measurement is obtained from Eq. (4) using the baseline gain estimate of 1.269 e − ADU −1 obtained in Sect.4.3.1.The measurement was performed three times and then the optical configuration was varied to look for systematic errors.The three independent measurements are presented in Fig. 17.
The largest systematic uncertainties affecting the overall scale of the curve are i) the uncertainties in the gain determination and ii) the 1% calibration uncertainty of the picoammeter reading the photocurrent of the NIST reference sensor. 6Those uncertainties affect the overall scale of the curve, which therefore peaks at 0.84 ± 0.01 e − ADU −1 for the measured position.The quantum efficiency stays above 80% in the range 440 nm to 570 nm, above 50% in the range from 387 nm to 650 nm and above 20% all the way up to 790 nm.
The breakout of the noise contribution from the various components in Eq. (4) to the statistical uncertainty is presented in the bottom panel of Fig. 17, along with the RMS of three measurements.The quadratic sum of the contributions lines up nicely with the observed RMS of the measurements.The uncertainty is dominated by background noise in the IMX411 sensor because a very large aperture, matching the spatial extent of the NIST sensor exactly, is used in order to minimize aperture correction systematics.The statistical uncertainty on a single measurement with nm resolution is smaller than 0.001 e − /γ over the majority of the range and smaller 0.01 e − /γ everywhere.
Systematic uncertainties associated with the NIST calibration of the reference photodiode are subdominant at this stage.We looked at two other potential sources for systematic chromatic uncertainties.First, a wavelength-dependent error would arise if the sensitive area of the reference sensor and of the photometric aperture in the camera were not perfectly matched, because the position and the shape of the projected slit image changes slightly with wavelength.We tested this possibility by stopping down the physical aperture in front of the sensor by a factor of 2 in radius.This reduces the spatial extent of the illuminated area on the sensor in all directions and makes the measurement less sensitive to alignment issues.The difference between the baseline and the stopped down measurement is presented on the left panel in Fig. 17.Quantum-efficiency curve of the QHY411M camera (window included).Top: three independent measurements at the same location are superimposed.The solid black line corresponds to a smooth B-spline fit to the data.Middle: residuals to the B-spline fit.The three colors correspond to the three independent measurements.Bottom: breakout of the noise in the quantum-efficiency measurement according to the origin of the contribution from the four different terms in Eq. ( 4).The plot also presents the quadratic sum of the four contributions (solid black line) and the RMS of the three independent measurements recalled from the middle panel (dotted black line).
Finally, we looked at the potential impact of the nonlinearities on the QE measurement.Nonlinearities may affect the measurement because the illumination level is not constant across all wavelengths.We tested this by doubling the camera exposure time and redoing the same measurement.The difference between the two measurements is presented in the right panel of Fig. 18.The observed difference reaches 0.005 e − /γ peak-to-peak, dominating the chromatic uncertainty budget.In principle, the effect of nonlinearities could be corrected, but this correction was not attempted at this stage.A specific measurement of the sensor integral linearity and exposure time involving a precisely pulsed light source is planned for the upgrade.

Conclusion
The StarDICE sensor calibration bench presented above delivers quantum-efficiency measurements with statistical uncertainties below 10 −3 e − /γ in the range 387 nm to 950 nm (<10 −2 e − /γ in the range from 375 nm to 1078 nm) and low systematic uncertainties (∼10 −3 e − /γ) in quantum-efficiency ratios between different wavelengths, which is the metric relevant for the measurement of type-Ia supernovae colors.The measurement is sensitive to sensor nonlinearities due to the change in illumination intensity as a function of wavelength in the monochromatic beam.A dedicated measurement of the sensor linearity is therefore required to reach the requirements of StarDICE and the addition of a tunable pulsed light source to the bench is planned for this purpose.The systematic uncertainty on the gray scale is currently dominated by the calibration uncertainty of the picoammeter providing the photocurrent of the reference photodiode.
The sensor calibration bench has been used for the calibration of the quantum efficiency of a QHY411M camera hosting the 150 mpixels IMX411ALR sensor from Sony.The camera was found to provide excellent response stability (<0.0033%RMS in stable conditions).Quantum efficiency is above 80% in the range from 440 nm to 570 nm, above 50% on most of the visible range, above 20% all the way up to 790 nm, and close to 10% at 900 nm.This translates to an average quantum efficiency in each of the photometric bands of the ugrizy system of, respectively, 47%, 79%, 59%, 27%, 10%, and 2%.The uniformity of the sensor response is at or below the 1% RMS level for all tested wavelengths.The sensor cosmetics is also excellent.Degradation of the images from electrostatic influence between pixels is expected to be negligible for most use cases given the small measured size of the linear IPC and brighter-fatter effect.This is especially appreciable for a3.76 µm pixel side.The only adverse effect found at this stage is a rather large periodical differential nonlinearity, despite apparently good integral linearity.Such effects, commonly affecting ADCs, can require correction in applications sensitive to differential linearity, typically those measuring fluctuations around a variable level (see, e.g., Planck Collaboration II 2020 for a successful account of a posteriori corrections for ADC nonlinearities in the context of the measurement of CMB fluctuations).A two-parameter periodic alteration of the scale is sufficient to mitigate the effect on the PTC up to ∼20 ke − .More complex alteration would be required to describe the entire scale, which was not attempted.Also, our integral linearity test relies on the accuracy of the rolling shutter timing, which we did not control.We suggest that applications sensitive to linearity should incorporate adequate mapping of the scale.We conclude that the sensor is very well suited for many astronomical applications in the visible range.For the GRANDMA experiment, this applies to the search of the optical counterparts of transitory events with poor localization from gravitational wave or neutrino detections that can be conducted in bands g and r.The characterization of transients through their strong color evolution will, however, require separated follow-up.Given the relatively low QE of the camera (27%) in the i band), longer exposures would be required to reach adequate sensitivity for hunting kilonovae, necessitating a trade-off with survey size.Additionally, we note that the relatively high QE in the u band (47%) could be useful to capture the earliest light of kilonovae in a particularly important band for enabling model selection, provided that the source is detected early enough.
The goal of the StarDICE experiment to control calibration systematic uncertainties from 375 nm to 1050 nm at the 0.1% level appears difficult to achieve in this first measurement due to the low quantum efficiency beyond 900 nm.The StarDICE calibration sensor bench is therefore being upgraded to increase the flux of the input light source in the infrared and to calibrate a 40 µm deep-depleted CCD sensor that is more sensitive at these wavelengths.Other ongoing improvements include custom dust-protective housing for the reference sensor, heat and temperature management, upgrade of opto-mechanical parts to ease the alignment of the optics, improved calibration of the reference sensor photocurrent reading, and dedicated measurement of the sensor linearity.

Fig. 4 .
Fig. 4. Normalized spectra of the 16 channels of the flat-field light source on a log scale.The UV LED displays some amount of fluorescent light; otherwise, the wavelength range of each channel is quite narrow with respect to the typical width of broadband filters.

Fig. 5 .
Fig. 5. Stability of the flat-field illumination source.Top: evolution of the photocurrent delivered by the monitoring photodiode in the flatfield light source (IS2) with respect to its average over the entire period.Bottom: relative RMS of the photocurrent computed over 500 successive samples.

4Fig. 8 .
Fig. 8. Background subtraction in the camera images.Top: instrumental flux in 5.65 mm apertures measured in dark images as a function of the background light proxy build from the never exposed region of the sensor.The best-fit linear relation (solid black line) gives an estimate of background contribution in exposed images.Bottom: residuals to the best-fit relation.The measured RMS gives the level of the background subtraction noise which is the dominant noise contribution in our QE measurement.

Fig. 9 .
Fig.9.Coupling coefficients between neighboring pixels attributed to interpixel capacitance (IPC).The asymmetry of the effect forces us to distinguish between even and odd column and line numbers.In the figure, the pattern is presented as it would appear for a central pixel with even line and column coordinates.A pixel with odd line coordinates is instead mostly correlated with the pixel immediately above.

Fig. 10 .
Fig. 10.Relative change in camera response (black circle) and gain (blue squares) with sensor temperature.The black line is a linear fit to the change in camera response.
, with a proposed physical mechanism -shrinkage A119, page 10 of 18 M. Betoule et al.: StarDICE.I. Sensor calibration bench and absolute photometric calibration of a Sony IMX411 sensor

Fig. 13 .Fig. 14 .
Fig. 13.Measured covariances between the nearest neighbors in a line (C 01 ) and in a column (C 10 ) as a function of pixel readout.The dashed lines show the corresponding expected covariances from the measured IPC.The black dashed line shows the first-order shape of the correlation expected for an amplitude of the correlation factor corresponding to −a 00 2 , with a 00 being the best-fit value from the PTC.

Fig. 15 .
Fig. 15.Map of the individual pixel gain obtained from 12 000 images at four different illumination levels.Left: raw gain map.The circular features arise from a periodic differential nonlinearity imprinted on the gain estimate due to a center-to-edge gradient in the illumination pattern.Right: gain map after correction of the measured empirical variance for the prediction of the nonlinearity model adjusted in Fig. 11.

Fig. 16 .
Fig. 16.Map of transmission uniformity for the 16 different flat-field channels, corresponding to different illumination wavelength from UV to near-infrared.The scale of the map is relative to the average response over the entire sensor area.The black contour gives the region in which the high-resolution quantum-efficiency curve measurement was performed.
Fig. 18.It suggests that the aperture mismatch systematics are controlled at the 0.001 e − /γ level.

Fig. 18 .
Fig.18.Investigations for systematic errors in the QE measurement.Left: difference between baseline and stopped down measurement of the IMX411 quantum-efficiency curve.Right: difference between two quantum-efficiency measurements using 4s and 2s exposures.

Table 1 .
Diameter and focal length of the OAP mirrors.

Table 2 .
Central wavelength and spectral width in the flat-field light source.

Table 3 .
Parameter range for the photon transfer curve and linearity Datasets.