The Astro-H (Hitomi)1 Soft X-ray Spectrometer (SXS)2 was designed for high-resolution broadband imaging x-ray spectroscopy with a high effective area. The detector34.5.–6 and its cryogenic systems126.96.36.199.12.–13 are described in a number of papers, as is the Soft X-ray Telescope (SXT),14,15 the on-orbit operation of the instrument,16 and the data analysis software and archive.17 The detector18 and telescope19188.8.131.52.24.–25 were the subjects of intensive ground calibration campaigns at the subsystem and instrument level. The ground calibration measurements were used to predict the on-orbit performance of the instrument, including the effective area and energy resolution. The ground calibration measurements also were used to derive a calibration for assignment of photon energies to events based on the measured pulse heights and to assign absolute photon arrival times based on the nominal trigger times.
The instrument calibration must be validated and corrected based on on-orbit calibration measurements. The suite of onboard diagnostics included an “always-on” collimated radioactive source illuminating a calibration pixel installed outside the telescope field of view, an source installed on a rotatable filter wheel above the SXS dewar aperture, and an electron impact x-ray generator with a photocathode activated by a fast UV photodiode [modulated x-ray source (MXS)]. An extensive set of observations of celestial sources was planned to complement diagnostics from these onboard sources. Because of the unfortunate loss of the observatory during the commissioning phase, only a subset of the planned calibration experiments was carried out. Nevertheless, the instrument performance was largely verified by the data that were collected.
The observations performed in the commissioning phase were not initially planned to be fully calibrated, but because of the loss of the observatory, there is a mandate to extract the maximum possible science return from them. The most important additional requirement introduced by this mandate is a procedure for calibrating the pulse height–energy relation early in the mission, when we performed SXS science observations before fiducial calibration sources were available and while the SXS dewar was still coming into thermal equilibrium. The transmission of the SXS dewar gate valve also had to be calibrated more precisely than the available preflight data allowed.
In this article, we give an overview of four key aspects of the instrument calibration and the on-orbit measurements and observations that were used to validate and/or measure these parameters. In Sec. 2, we describe measurements of the instrument resolution and line spread function (LSF). In Sec. 3, we describe the procedure used to establish the pulse height–energy relation. In Sec. 4, we use observations of bright continuum sources to measure the thickness of Be in the SXS dewar gate valve. In Sec. 5, we use observations of the Crab nebula to make corrections to the absolute and relative event timing.
The on-orbit calibration of the SXS is described in greater depth in a series of forthcoming articles: the LSF is discussed by Leutenegger et al.,26 the pulse height–energy relation is discussed by Eckart et al.,27 the effective area is discussed by Tsujimoto et al.,28 and timing is discussed by Koyama et al.29 Two further articles will describe topics not covered here: the SXS background is discussed by Kilbourne et al.30 and the SXT point spread function is discussed by Maeda et al.31
Line Spread Function
The LSF was characterized on the ground using monochromatic x-ray sources augmented by x-ray fluorescence sources emitting characteristic lines. The LSF is well-characterized by a dominant component consisting of a Gaussian with energy-dependent resolution (the “core” LSF), as well as three weak components (the “extended” LSF): an exponential tail with -folding of , an electron loss continuum, and escape peaks.18 The extended LSF components are not expected to vary on orbit, so we do not discuss them further in this article.
The core LSF energy resolution is a function of event grade and pixel. The event grades are described in detail by Ishisaki et al.6 To summarize briefly, events are processed differently depending on their relative arrival time, and the method of processing affects the energy resolution for that event type. The classifications are hi-res, mid-res, and low-res; and within mid-res and low-res, events can be primary (first in a series) or secondary (subsequent events in a series). Both preflight and in-flight calibrations of the core LSF are only valid for primary event grades, and a modeling of secondary event grades will be discussed in a future article.
The so-called “baseline” events are a key tool in modeling the energy resolution of the SXS. These events measure the effects of baseline noise in the detector system by processing strings of samples from the outputs in the absence of an x-ray absorption event using the optimally filtered template pulse as if a real event had been triggered. They provide the resolution at zero pulse height, as well as an estimate of all fast time scale noise contributions to the energy resolution.
We have found that the energy resolution for a given event grade and pixel can be modeled as an excess broadening term added in quadrature with the baseline energy resolution
In Fig. 1(a), we show the energy resolution for a single pixel measured at several energies in instrument-level testing together with the best-fit model for resolution as a function of energy (discussed in more detail by Eckart et al.18) and compare this with on-orbit measurements using the source installed in the filter wheel. In Fig. 1(b), we show the baseline resolution, hi-res event resolution at 5.9 keV, and excess broadening at 5.9 keV for all pixels, comparing measurements from instrument-level testing on the ground with on-orbit measurements using the source installed in the filter wheel.
We find that the baseline resolution is slightly higher on-orbit compared with instrument-level testing, by an average of 0.38 eV. At least some degradation in resolution is expected, given the noise tones observed due to spacecraft interference from the attitude and orbit control system (AOCS) (see Fig. 7 in the work by Porter et al.3). Noise tones generated by operation of the AOCS in ground testing also led to a degradation in baseline resolution.18 Note that although the effect of noise tones can be mitigated by creating a new optimal filter based on the changed noise environment, this method cannot be used here because the noise is nonstationary in frequency space; furthermore, changing the optimal filter would invalidate the ground calibration of the gain scale, which would then have to be recalibrated on orbit. In addition to the spacecraft interference, the baseline resolution is further degraded by the presence of untriggered thermal crosstalk events resulting from so-called frame events, where minimum ionizing particles deposit energy into the silicon frame. This effect results in a slight degradation in resolution, with the magnitude of the effect dependent on the cutoff rigidity, which is a proxy for the rate of minimum ionizing particles.26
We also find that although the baseline resolution is slightly degraded by operation of the AOCS and the untriggered thermal crosstalk from frame events, the excess broadening at averaged over all main array pixels is statistically consistent with that observed in instrument-level testing. We thus conclude that within the statistical sensitivity of our measurements, the on-orbit resolution can be modeled by modifying the preflight model of resolution using the on-orbit baseline resolution [shown as the red curve in Fig. 1(a)]. The one exception to this is the calibration pixel, which has accumulated many more counts than the main array. The calibration pixel shows both increased baseline noise and a small but statistically significant () increase in the excess broadening from to at . The mechanisms responsible for the increase in excess broadening in the calibration pixel will be investigated in a future article.26
We intended to operate the pulse shape processor (PSP) in “forced mid-res mode,” where all hi-res and mid-res triggered events would be processed with a short pulse template appropriate for mid-res events, allowing calibration of mid-res event gain and energy resolution without having to choose a source count rate that would generate a high fraction of true mid-res events, and also giving a measurement of the mid-res baseline energy resolution. Since this experiment was not completed before the end of mission operations, we instead analyzed the small number of mid-res events accumulated in the exposure. We found that the mean resolution was slightly higher than in instrument-level testing, just as for hi-res events. We assumed that the excess broadening for mid-res events was the same in flight as in instrument-level testing, as we found for hi-res events.
For the sake of brevity, in this paragraph, all discussion implicitly pertains to mid-res events. Since we had no on-orbit measurement of the baseline energy resolution, we subtracted the mean excess broadening (averaged over all pixels) at 5.9 keV that was measured in preflight calibration from the mean energy resolution measured in in-flight at 5.9 keV to obtain an estimate of the mean baseline energy resolution
Finally, we computed a new estimated energy resolution as a function of energy using the preflight excess broadening
The calibration of low-res events can be accomplished using the low-res pulse height of any events, which is telemetered for all event grades. We, thus, analyzed the low-res pulse heights of hi-res events in the filter wheel exposure to measure the energy resolution at . We then used these measurements to scale up the preflight model of the low-res energy resolution, under the assumption that the excess broadening was identical, and that the energy resolution at increased due to a larger baseline resolution.
The SXS pulse height–energy relation (or gain scale) is crucial in allowing accurate absolute measurements of photon energy. Because the SXS sensors are extremely sensitive thermometers, the gain scale is very sensitive to the SXS thermal environment. Differential thermal loads between the SXS pixels and the adiabatic demagnetization refrigerator (ADR) control thermometer can lead to substantial gain drift even while nominally controlling the instrument at a constant temperature, and differential thermal loads among pixels can result in differential relative gain drift among pixels.
An extensive campaign of ground calibration created a solid foundation for understanding and measuring gain scales and gain drift in the SXS.18,32 We found that the gain scale for a given stable set of conditions can be well fit with a fourth-order polynomial. We also found that although the polynomial would be different for a different set of thermal conditions, essentially all variation in gain scale can be treated as a variation in the effective temperature of the sensor, which is biased to operate at above the 50-mK stage control thermometer. By measuring gain curves at discrete temperature setpoints on the ADR control thermometer while maintaining constant thermal boundary conditions at the other thermal interfaces, we are able to map the change in shape of the gain curve as a function of control temperature. (The most important thermal interfaces affecting the detector effective temperature are the He tank and the inner vapor cooled shield. The SXS cryogenic design is discussed by Fujimoto et al.12.) We can then interpolate these gain curves for any control temperature setpoint within the range we sampled. When the thermal boundary conditions change, we can measure the gain offset at a single fiducial energy and use this to find an effective temperature (and accompanying gain curve) corresponding to the equivalent control thermometer temperature for the thermal boundary conditions under which the reference gain curves were measured.
The SXS in-flight calibration plan envisioned a threefold approach to measuring and validating the gain scale and monitoring the gain drift. First, a calibration pixel is located outside the instrument field of view and is illuminated by an always-on collimated radioactive x-ray source. This allows for continuous monitoring of short-term gain drift, where all pixels are expected to drift in parallel. Second, onboard x-ray sources are available to illuminate the whole SXS array with photons of known energy. These sources include an source mounted on the SXS filter wheel, four MXSs, electron impact sources using fast photocathodes modulated with UV photodiodes. Third, a number of celestial sources featuring emission lines with known energies were planned to be observed, allowing validation of the gain scale across the SXS science band.
Because of the untimely loss of the mission, the SXS gain was not fully validated as planned. Furthermore, the observations of the Perseus cluster during the commissioning phase of the instrument needed to be calibrated as well as possible in order to extract the maximum possible science. Because these observations occurred without contemporaneous full-array gain drift monitoring, accurate calibration of the gain scale required the development of techniques beyond what was intended for standard operation of the SXS.
In Fig. 2(a), we show the calibration pixel gain with respect to the fiducial gain measured in ground calibration as a function of time, together with the He tank temperature. The calibration pixel gain qualitatively tracks the overall trends in the He tank temperature. Other SXS dewar internal temperatures (not shown) correlate to a lesser degree with the gain drift on the calibration pixel and other pixels.
The main array pixels show a qualitatively similar drift to the calibration pixel. We thus measure the calibration pixel gain and remove this “common-mode” gain drift from the whole array as a linear stretch correction applied to the ground calibration curves. Minor modifications to this procedure were adopted in the final pipeline processing for observations beginning with Perseus Obs 3. These changes will be motivated and explained by Eckart et al.27 Figure 2(b) shows a scatter plot of event energies in the Fe XXV complex for observations of the Perseus cluster using a single pixel after applying this correction based on the calibration pixel. Significant gain drift is still visible, both between the observations, which are separated by 6 days, and even within the observations, especially the earlier one, when dewar internal interface temperatures are more rapidly drifting. We term the residual gain drift as “differential” gain drift, both because it is differential relative to the calibration pixel but also between the different pixels of the main array.
We can quantify the differential gain drift for a given pixel at any point by fitting an empirical model composed of a series of Gaussian lines to the Fe XXV complex. The number of time slices for which the differential gain drift can be measured is limited by the statistical quality of the data. In Fig. 3(a), we show an example of fitting an empirical model to laboratory data33 acquired with a calorimeter34 at the Lawrence Livermore National Laboratory (LLNL) Electron Beam Ion Trap (EBIT) facility.3536.37.38.–39 This laboratory spectrum is very similar to the Fe K spectrum observed in Perseus. In Fig. 3(b), we show the same type of model being fit to data from a time slice of a single pixel in the observations of Perseus, resulting in a measurement of the differential gain drift for that pixel and time slice.
In Fig. 4(a), we show the results of fitting four time slices from two Perseus observations for three representative pixels in the main array. Both of the observations had the same aimpoint, and during each observation, the pointing was stable to a fraction of a pixel; therefore, each pixel measures a constant spectrum over these two observations, and the data can be used to track differential gain drift. We plot these measurements as a function of the calibration pixel gain drift and fit them with a linear function. This reflects the reasonable but significant assumption that the common mode and differential gain drift have a related origin (although not necessarily a common mechanism). The slope for each pixel gives the magnitude of the differential sensitivity of each pixel. In Fig. 4(b), we show a map of this differential sensitivity. The pattern of sensitivity shows that the gain changes in the innermost pixels are more different from the calibration pixel than the outer ring of pixels.
A difference in behavior between inner and outer pixels arises in differential loading from microwave radiation resonating in the CTS cavity, since the HgTe x-ray absorbers of the inner pixels are most shielded from absorption of microwave radiation due to their proximity to four neighboring absorbers; the edge and corner pixels are less shielded, with two or three neighbors; and the calibration pixel is the least shielded, with one neighbor. (The neighbor of the calibration pixel is an unwired alternate calibration pixel.) The radiative loading elevates the true temperature of the pixels with respect to the heat sink, and this effect is strongest in the earliest part of the mission and decreases as the dewar shields cool. In contrast, the common heat sink for all the pixels is depressed during the early parts of the mission because the conductive heat load from the helium tank onto the CTS control thermometer is greater. Thus, these two effects partially cancel in all pixels as the dewar cools, with the common-mode term dominating. The partial cancellation is strongest in the calibration pixel, and it thus shows the smallest gain variations. When the calibration pixel gain is used to correct the array pixel gains, it is an undercorrection, and the magnitude of the undercorrection is largest for the inner pixels, which experience the smallest radiative load and, thus, have the smallest cancellation of the common-mode term by the radiative term.
The slopes derived from the fits, as shown in Fig. 4(a), show the relative differential gain drift per pixel as a function of calibration pixel gain, but because the mean centroid per pixel of the Perseus spectrum is not known a priori, it cannot provide an absolute energy scale calibration. In other words, this figure allows us to measure the slope but not the intercept of the differential gain drift. The filter wheel was rotated to illuminate the full array with a radioactive source after all of the Perseus observations were completed, around the time that the SXS dewar was coming into thermal equilibrium. This measurement of a line at a known energy allows us to fix the intercept of the linear functions as shown in Fig. 4(a) at the green line, which indicates the calibration pixel gain shift at the time of the filter wheel experiment. Figure 5(a) shows the spectra from each pixel after correcting for the calibration pixel gain drift. Figure 5(b) shows the composite spectrum after applying the offsets derived from this experiment.
In Fig. 6, we show the stacked spectrum from coadding all pixels in two observations of the Perseus cluster. The event energies have been corrected using the method described above. The broadening is slightly larger in the earlier observation than in the later one because of greater uncertainty in the extrapolation of the differential gain drift correction from the fiducial experiment, which occurred only a short time after the later observation.
In summary, our gain correction method implements the following steps: (1) we apply the ground gain calibration curves to every pixel to obtain a preliminary energy from the telemetered pulse heights, (2) we apply a time-dependent linear gain correction to all pixels (the “common-mode” correction), and (3) we apply a second time-dependent linear gain correction to each pixel, this time on a per-pixel basis (the “differential-mode” correction), based on the apparent relative drift rates of the fiducial lines on the array and on the cal pixel.
We note that the differential gain drift method described here is essentially the same as that used to measure the bulk velocity field in the Perseus cluster, as reported by the Hitomi collaboration (2016).40 The turbulent velocity measurements reported in that article were performed using a different method, in which the drift for each pixel was corrected relative only to itself, destroying the bulk velocity information, but preserving the broadening.
In pipeline processing, this differential gain drift method was implemented in the task sxsperseus, which was used for data from observations 1 and 2. Observations 3 and 4 were much closer to thermal equilibrium, and thus, the standard gain scale software task was used.
The instrument effective area contains contributions from the telescope (including thermal shield transmission), as well as the detector quantum efficiency and fill factor, and the dewar internal blocking filters. These quantities were all calibrated on the ground and are discussed in detail in several papers.18,19,2122.23.24.–25,41
The dewar gate valve was kept closed to protect the dewar internal filters from potential contamination by outgassing material. This gate valve consists of a nominally -thick beryllium window covered by a 0.2-mm-thick stainless steel mesh with 71% open fraction.
Because of the early termination of mission operations, all of the observations were conducted before the gate valve was opened. Furthermore, because the early observations were intended to evaluate the instrument and observatory capabilities and not to perform key science observations, the transmission of the gate valve was not directly measured preflight but was instead estimated using the midpoint of the manufacturer’s quoted measurement of the beryllium thickness of 274 to .42
To extract the maximum science output from the limited number of observations performed, a better estimate of the beryllium window thickness is required. We estimated the window thickness in two ways: first, we fit the observed spectra of two bright astrophysical continuum sources, and second, we measured the x-ray transmission of the flight spare gate-valve beryllium window using a beamline at the SPring8 synchrotron. The flight spare window is expected to be identical to the flight window.
We jointly modeled the observed spectra of two bright continuum sources, the Crab nebula and G21.5-0.9, using a model with beryllium thickness as a free parameter. The derived thickness is (statistical). We estimate that a systematic error may be introduced by our ad-hoc method of modeling the electron loss continuum in this procedure.
The SPring8 measurements are reported by Hoshino et al.43 The best-fit beryllium thickness is . Edges due to contaminants are detected and included in the model; these include Ni, Fe, Cu, Mn, and Cr. Other weak edge-like features were also detected, which are interpreted as being due to Bragg diffraction from polycrystalline beryllium.
The synchrotron measurements of the spare beryllium window are much more precise than the modeling of the window thickness from spectra of astrophysical calibration targets. Therefore, we adopt a model based on the synchrotron measurements in our final calibration data file for the gate-valve beryllium window.18 The good agreement of the astrophysical calibration spectral fits with the laboratory measurements validates the model to the extent possible.
Because of the presence of the SXS gate valve in all observations, the validity of the gate-valve-open effective area model at low energies could not be tested in flight. The verification of the high-energy effective area model will be discussed in a future article.
For each event, the photon arrival time is calculated from the event trigger time or pulse derivative maximum time. However, the method of determination of the trigger and derivative maximum times is different among event grades, leading to systematic differences that must be calibrated.
The preflight modeling of the pulse timing assumed a time assignment of the form6
In preflight modeling,44 and were not expected to be large and were, thus, assumed to be zero for all pixels and event grades. was found to be 1.625 for hi-res and mid-res graded events on pixel 26, and zero for hi-res and mid-res graded events on all other pixels. The difference in for pixel 26 is because it had a higher trigger threshold setting when its pulse template was created due to ringing behavior,6 resulting in a shifted event trigger time. For low-res graded events, was found to be 11.625.
The timing of the events was evaluated using an observation of the Crab nebula. Its pulse profile is shown in Fig. 7(a). The folding parameters (, , and ) were calculated using a simultaneous radio observation.45
Because RISE_TIME and DERIV_MAX are correlated, and the distribution of RISE_TIME relative to DERIV_MAX is narrow, was kept at zero, and and were evaluated as follows.
For each event grade grouping (hi-res, mid-res, and low-res), events were split into three DERIV_MAX bins (100 to 2600, 2600 to 5100, and 5100 to 7600). For each DERIV_MAX bin, a folded light curve was constructed, and the first pulse peak was fit with a Gaussian model to find the timing centroid. The coefficient was evaluated by performing a linear fit to the relative time shift as a function of DERIV_MAX. The coefficient is small for all event grades, and the statistical constraints on it are not strong.
After correcting the event time with the coefficient, corrections to the coefficients were evaluated by fitting the first pulse peak for each event grouping (hi-res, mid-res, and low-res) and comparing the best-fit peak centroid to that predicted using a contemporary high precision radio ephemeris45 together with the 3- to 20-keV x-ray phase lag relative to the radio peak measured by Molkov et al.46 The results of this procedure are shown in Fig. 7(b). The relative event-to-event timing is validated at the level, while the absolute timing accuracy is estimated to be at the level, with contributions from systematic uncertainty in the radio ephemeris by Molkov et al.,46 as well as statistical and systematic uncertainties in the radio ephemeris by Terada et al.45 We note that the relative pixel-to-pixel timing accuracy is independently confirmed with a similar level of precision by studies of coincident events due to cosmic rays in the pixels and anticoincidence detector.
Although the SXS was not able to finish the full planned suite of in-flight calibration measurements, its expected performance was confirmed. The array-averaged energy resolution was better than 5-eV full width at half maximum at 5.9 keV despite moderate electrical interference from the AOCS. The energy scale was reconstructed to high accuracy for early observations of the Perseus cluster performed during the SXS commissioning phase, even in the absence of regular measurements of calibration fiducials, allowing a measurement of the bulk velocity field in Perseus.40 The outstanding performance of the SXS combined with the high quality of the calibration achieved even in a very limited campaign is a strong indication of the promise of x-ray calorimeter spectrometer technology for future x-ray observatories.
We thank all of the US, Japanese, and European SXS team members who contributed to the years of development and testing at GSFC, Niihama, Tsukuba, and Tanegashima, including the postdocs and students who supported the long series of tests at Niihama and Tsukuba. Part of this work was performed under the auspices of the US DoE by LLNL under Contract No. DE-AC52-07NA27344 and also supported by NASA grants to LLNL.