Quantitative photoacoustic tomography (PAT) is concerned with recovering quantitatively accurate estimates of chromophore concentration distributions, or related quantities such as optical coefficients or blood oxygenation, from photoacoustic images.1 The source of contrast in PAT is optical absorption, which is directly related to the tissue constituents. By obtaining PAT images at multiple optical wavelengths, it may be possible to recover chemically specific information about the tissue. However, such a spectroscopic use of PAT images must consider the effect of the spatially and spectrally varying light fluence distribution. As a photoacoustic image is the product of the optical absorption coefficient distribution, which carries information about the tissue constituents, and the optical fluence, which only acts to distort that information, the challenge in quantitative photoacoustic imaging is to remove the effect of the light fluence.
A common approach is to use a model of the unknown fluence and use it to extract the desired optical properties from the measured data. This has been done analytically23.4.–5 or numerically,6,7 often within a minimization framework.18.104.22.168.13.14.–15 The majority of this literature uses the diffusion approximation to the radiative transfer equation (RTE) to model the light distribution, which is accurate in highly scattering media.16 In PAT, the region of interest often lies close to the tissue surface where the diffusion approximation is not accurate. The RTE, on the other hand, is widely considered to be an accurate model of light transport so long as coherent effects are negligible, which is the case here. Finite element discretizations of the RTE have been developed1718.19.20.–21 and proposed for quantitative PAT reconstructions,12,15 but due to the need to discretize in angle as well as space, they quickly become computationally intensive and their applicability is limited to small- and medium-scale problems. An alternative is Monte Carlo (MC) modeling,22,2324.–25 which is a stochastic technique for modeling light transport that converges to the solution to the RTE. The significant advantage of the MC approach is that it is highly parallelizable, thus it scales well to the large-scale inversions that will be encountered in practice.
MC models of light transport are popular in biomedical optics and have predominantly been applied in the planning of the experimental measurements2627.–28 and in dosimetric studies for a range of light-based therapies.2930.–31 Many of the applications are summarized by Zhu et al.32 One early MC model of light transport, MCML,22 computes the fluence in three-dimensional (3-D) slab geometry. This model was later extended to simulate spherical inclusions in the tissue,33 and later to spheroidal and cylindrical34 inclusions. MC modeling in 3-D heterogeneous media has been shown both for voxelized media,23 which was later GPU-accelerated,24 and using a mesh-based geometry.25,35 Although the RTE is an equation for the radiance, which is a function of angle at every point, the quantity usually calculated by MC models is the fluence rate, which is the radiance integrated over all angles. The reasons are practical: most measurable quantities are related to the fluence rate rather than the radiance, storing just the integrated quantity saves on computational memory, and the estimates for the fluence rate will converge sooner than the underlying estimates for the radiance. In photoacoustics, the measurable signal is related to the fluence (the time-integrated fluence rate) so current MC models can be used in the simulation of photoacoustic signals. However, as will be discussed in Sec. 4, the full angle-dependent radiance is required when tackling the inverse problem of estimating the optical coefficients, specifically the optical scattering. A common approach in estimating the optical coefficients is to use a gradient-based inversion scheme in which functional gradients are used to minimize some objective function. This paper demonstrates the computation of these gradients through the use of forward and adjoint MC models of the radiance.
In this paper, Sec. 2 introduces the inverse problem of quantitative PAT. Sections 3 and 4 present forward and adjoint MC models of the radiance employing a harmonic angular basis. In Sec. 5, it is shown that this choice of basis allows the functional gradients for the inverse problem to be calculated straightforwardly. Inversions for absorption and scattering coefficient distributions are given in Sec. 6.
Quantitative Photoacoustic Tomography
The inverse problem in QPAT can be stated as the minimization12 give expressions for these gradients in terms of the forward and adjoint fields, and :
Monte Carlo Modeling of Light Transport
In PAT, the optical and acoustic propagation times are so different that the optical propagation can be considered instantaneous and the time-dependence of the light transport can be neglected. The time-independent RTE is given by36 The approach used here begins with launching a packet of energy, referred to herein as a “photon,” from a given position in an initial direction . After traveling a distance (using the convention ), where is a real uniform random variable on [0, 1], a fraction of the photon’s “weight” is deposited in the current voxel, where is the current weight (or energy) of the photon packet. The photon weight is updated: . Scattering into a new direction in two-dimensional (2-D) involves sampling the scattering phase function, which describes the probability of a photon scattering from direction into direction . The phase function used here was the 2-D Henyey–Greenstein phase function, commonly used in biomedical optics,37,38
By calculating photon paths through the medium, the MC models presented in the literature2223.–24,35 do in fact simulate the radiance, but this is typically integrated over angle upon deposition of the weights in the voxels. In order to simulate the radiance, a method of depositing the weight in the voxels without losing the angular information is required.
Monte Carlo Modeling of the Radiance
In order to compute the radiance using an MC model, angular as well as spatial discretization is required. One approach is to use discrete ordinates, whereby the unit circle is divided equally into sectors and the weight deposited in a voxel is also assigned to the relevant angular sector. The memory required will scale linearly with the number of sectors, and will slow convergence of the radiance estimate, compared with the fluence estimate, by a factor inversely related to the number of sectors. Here, a harmonic angular basis was used because a sufficiently diffuse field is dense in such a basis, meaning the field can be represented using relatively few orders. Less memory will, therefore, be required.
In 2-D, the expansion for the radiance in a Fourier basis is3940) For a given voxel, the weight is deposited into the relevant Fourier coefficients according to 41
Validation of the Forward Model
Analytical solutions to the RTE are available for the fluence for a range of geometries and source types;23,42,43 however, there are few analytical solutions for the radiance, particularly in 2-D. The RMC model was compared to one such analytic solution for an infinite, homogeneous 2-D domain illuminated by an isotropic point source.4445.–46 An isotropic point source was placed at the center of a domain of size , large compared to the transport mean free path in order to approximate an infinite domain. The absorption and scattering coefficients were 0.01 and , respectively, and the Henyey–Greenstein phase function47 was used with set to 0.9. The pixel size was , and five Fourier harmonics were used. Figure 1 shows the good agreement between the analytical and RMC modeled radiance at radial distances of 2 and 3 mm from the source along the horizontal axis.
Adjoint Monte Carlo Model
The adjoint equation to the RTE is given by3.1). The principle difference is that the light sources typically used in PAT are restricted to the boundary, but the adjoint source will not be, as a consequence of the fact that the “data” in QPAT—the photoacoustic images—is volumetric. The internally distributed sources, , were simulated by uniformly distributing the initial positions of each photon over a source pixel, with their direction randomly sampled from uniform angular distribution. The weight of each photon launched from a source pixel was scaled by , where is the position corresponding to the source pixel. This of course can result in photons whose weight is negative. The adjoint RMC model treats a negative weight photon in the same way as one whose weight is positive; the photon’s weight decays to zero and rather than the deposition of energy into the computational grid, thus negative weight photons produce a reduction in the energy in each voxel it traverses.
Validation of the Adjoint Model
The adjoint model was validated by checking that it satisfied the condition
Substituting these into Eq. (13) yields
Simulations were performed using a () domain, and 10 Fourier harmonics. Each source distribution emitted photons. was set to be the center of the domain with moved along the -direction across the domain. Comparisons are shown in Fig. 2 for case 1 with an isotropic source and detector, Fig. 3 for case 2 with an isotropic source and anisotropic detector with , , and Fig. 4 for case 3 with the same , but with istropic and the distributed shown in Fig. 4(a). Good agreement was obtained in all cases, showing the the RMC adjoint model is an accurate representation of the RTE adjoint. It can be seen in the comparison of with that some noise is present in the latter. As the overall number of photons simulated in the forward and adjoint simulations was the same, the use of spatially distributed sources in the adjoint case resulted in lower photon density in the domain, thereby reducing SNR. This issue was exacerbated in the case where an angularly dependent detector was used because a significant fraction of photons went undetected.
Both the radiance and the adjoint radiance can be expressed as Fourier series as in Eq. (8). By substituting these expressions into Eqs. (3) and (4) for the functional gradients, simple and easily computed expressions for the gradients can be obtained. The fluence is simply given by the isotropic component of the field . The other terms in the expressions for the functional gradients contain integrals of products of the radiance and its adjoint. If , , and are the Fourier coefficients of the adjoint radiance, then the gradient with respect to absorption can be written as
The second term in Eq. (4) is37
Inversions for Absorption and Scattering
The forward and inverse MC models of radiance described above were used with a gradient-descent (GD) scheme to estimate and from simulated PAT images by minimizing the error functional in Eq. (2). As the adjoint source, was independent of angle, photons were launched istropically with the launch position being spread out over the range of a source voxel using a randomly distributed number on the interval [0, 1]. The initial photon weight was scaled according to the source strength with normalization of the output quantity (i.e., radiance, absorbed energy density, and harmonic) being . The adjoint source may be negative in some places, so the initial photon weight is negative and weight deposition is also negative. The photon termination condition was, therefore, set to be the absolute value of the photon weight falling below the threshold value. The gradients were calculated using Eqs. (20) and (26). A GD scheme was chosen for the minimization because it is more robust to the MC noise in the functional gradients and error functional than techniques such as L-BFGS that use second-order information. A line-search algorithm presented by Hager and Zhang48 was used for the reconstruction of . A backtracking line search was implemented for the reconstruction of . This is described in Sec. 6.2.
The termination condition used by the optimization was
Inversion for Absorption Coefficient
The domain used in the estimation of the absorption coefficient consisted of a background absorption coefficient of with a rectangular inclusion equal to [shown in Fig. 5(a)], and a background scattering coefficient of with a rectangular inclusion equal to [shown in Fig. 5(b)]. The anisotropy was a homogeneously distributed value of 0.9. The measured data, , were formed using an MC simulation illuminated by a collimated line source on the boundary at and on the adjacent boundary at , consisting of photons. The inversion for the absorption coefficient only was performed under the assumption that the scattering coefficient was known and the starting estimate of the absorption coefficient was a homogeneous value of . The termination condition in Eq. (27) was satisfied after 11 iterations, having taken 4.1 h to run, and is shown in Fig. 5(b) with profiles through the true and reconstructed distributions of shown in Fig. 5(d).
Very good agreement between the true and reconstructed absorption coefficient is observed, with a value of the error function after 11 iterations being . The optimization routine was stopped as the change in the error function on the 12th iteration was below the function tolerance of , indicating convergence. The average error in the estimate of the absorption coefficient , computed as , was 0.2% over the entire domain.
Inversion for the Scattering Coefficient
Inversions for the scattering coefficient were performed using the same domain as above, shown in Figs. 6(a) and 6(b), with the illumination and number of photons in the forward simulation also being the same. Here, it was assumed that the absorption coefficient was known and the starting estimate of the scattering coefficient was equal to the background value of .
Two modifications were necessary to achieve convergence in the optimization for the scattering coefficient. First, a custom GD algorithm was used in which a backtracking line search was implemented. A backtracking line search49 starts with a large candidate step length and progressively reduces the step size while checking for a sufficient decrease in the error functional. The sufficient decrease condition is expressed asFig. 6(c) with profiles through the true and reconstructed distributions of as shown in Fig. 6(d).
It can be seen from Figs. 6(c) and 6(d) that the inversion has partly reconstructed the inclusion in the scattering coefficient. The inability to reconstruct edges of the inclusion in the scattering coefficient is expected, given the diffusive nature of the scattering. However, the discrepancy in in the inclusion, evident from Fig. 6(d), suggests premature termination of the optimization. This is due to the fact that the gradient with respect to scattering is small and prone to noise in the functional gradients. This low SNR in the gradients has the impact that that search directions in the optimization routine are often suboptimal, which results in little or no progress of the optimization. The progressive reduction in SNR in the gradient means that nondescent steps are likely and can therefore trigger the termination condition.
Discussion and Conclusions
In this paper, an MC model of the RTE was presented. The model computes the radiance in a Fourier basis in 2-D and is straightforward to extend to 3-D using a spherical harmonics basis. The accuracy of the model was demonstrated by comparing the angle-resolved radiance at two positions in the domain to corresponding appropriate analytic solutions.
Sections 5 and 6 presented the application of the RMC algorithm to estimating the absorption and scattering coefficients from simulated PAT images. In Sec. 6.1, it was observed that the absorption coefficient was estimated with an average error of 0.2% over the domain relative to the true value, when the scattering coefficient is known, and in the presence of 0.7% average noise in the data. This is encouraging, particularly because noise is not only present in but also in the Fourier harmonics computed using the forward and adjoint RMC simulations, which is propagated to the estimates of the functional gradients. Consequently, the search direction in the GD algorithm will always be suboptimal. Furthermore, noise in , the estimate of the absorbed energy density at the ’th iteration of the line search, will be also be propagated to the error functional, resulting in a nonsmooth search trajectory for the line search because at every point , the error function will be corrupted by some different noise . In practice, this did not preclude reconstruction of the absorption coefficient since the calculated gradients remained descent directions despite the noise. Furthermore, the error functional in is sufficiently convex that the addition of some noise does not prevent the linsearch from yielding sufficiently large a step length to allow rapid convergence.
Reconstruction of the scattering coefficent correctly located the scattering perturbation in the simulated image; however, the peak value in the reconstruction was lower than the true value. This is a direct consequence of the fact that the scattering coefficient is related to the absorbed energy distribution only through the optical fluence distribution. Consequently, the SNR in is typically much less than that for absorption. This causes termination of the algorithm before the peak magnitude of the parameter has been found in the search space.
R. Hochuli was funded by the EPSRC CDT in Medical Imaging, EP/L016478/1. S. Powell was funded by EPSRC Grant No. EP/J021318/1, and RAEng Fellowship RF1516\15\33. S. Arridge and B. Cox are employed and funded by their host institution, University College London. The authors declare they have no conflicts of interest, financial or otherwise, relevant to this manuscript. The authors acknowledge the contribution of Andre Liemert who kindly provided radiance data used in the validation of RMC in Sec. 3.2. The authors acknowledge the use of the UCL Legion High Performance Computing Facility (Legion@UCL), and associated support services, in the completion of this work. The authors also thank Tanja Tarvainen for the useful discussions regarding the forward model and inversion.