Patients with early stage oral cavity squamous cell carcinoma are generally treated with surgery. Surgeons aim to remove the tumor with a margin of healthy tissue, to ensure optimal local control and prognosis.
The extent of the margin is a trade-off between removing enough tissue to ensure clear margins and sparing as much healthy tissue as possible for good functional outcome. During surgery, information about the extent of the tumor is limited to the surgeon’s palpation of the tumor and preoperative radiological assessment with magnetic resonance (MR) and/or ultrasound (US) imaging. Other than palpation, there is no real-time feedback about tumor borders. In general, the deep resection plane is the most challenging location to achieve well clear margins. Hence, in up to 30% to 85% of the patients, the tumor is removed with an involved or close margin of healthy tissue (i.e., an involved margin, in which tumor cells are present at the resection surface, or a close margin, in which tumor cells are found within 5 mm from the resection surface).1 The majority of these patients need adjuvant treatment with additional surgery, radiotherapy, or sometimes chemo radiation. To reduce the number of patients needing adjuvant treatment, there is an urgent need for a technology that can provide real-time feedback on the presence of tumor cells at the resection margins.
In diffuse reflectance spectroscopy (DRS), diffusely reflected light is measured after illuminating the tissue with a broadband white light source using fiber optics. The reflectance spectrum contains information about the absorption and scattering properties of the illuminated tissue, representing an “optical fingerprint” of the tissue. This technology can be incorporated into a handheld probe for intraoperative tissue characterization.
Within our research group, this technology has been investigated for the ability to discriminate tumor from healthy tissue in lung, liver, breast, and colorectal cancer specimens.2–5 For example, using the fat and water content obtained from the near-infrared (NIR) part of the reflectance spectrum, it was possible to distinguish tumor from healthy fat tissue in breast specimens with a sensitivity and specificity of 100%.4 Similar numbers were obtained in discriminating colorectal cancer from healthy colorectal wall (muscle) using an advanced classification algorithm on both the visual and the NIR part of the spectrum.5
DRS and related technologies have been investigated for use as noninvasive screening tools for early detection of malignancies in the oral cavity.6–13 These studies focused on the discrimination between (pre) malignant, benign, and healthy mucosa. The instruments used measured the reflected light over a wavelength range within 350 to 1000 nm and translated the measured spectra into tissue components. Amelink et al.7 measured a significant decrease in microvascular oxygenation and scattering amplitude and an increase in blood content and scattering slopes in tumor tissue, compared to healthy mucosa. Likewise, Stephen et al.12 showed that tumor tissue reflects more light of the two wavelengths 545 and 575 nm, the wavelengths specifically for the absorption by oxyhemoglobin components, compared to healthy mucosa. In various studies, sensitivities of 82% to 97% and specificities of 87% to 100% have been reported for discriminating oral cavity tumor from healthy mucosa.9,11,12
In contrast to these studies, which concentrate on early tumor detection on the mucosal surface, we aim to use DRS as an intraoperative tool to evaluate the deep resection plane of oral cavity tumors. Hence, we want to differentiate tumor tissue from healthy muscle tissue at the resection plane of the deep resection margin. Furthermore, the main absorber in the visual wavelength range, used by the above-mentioned studies, is blood. They showed that the detection of the microvasculature played a significant role for tumor diagnosis. In contrast to those studies, we would like to use the technology intraoperatively, in an environment where blood is present, and where the blood components will not indicate the tissue type that has been measured. Thus, for intraoperative use of the DRS technology, we extended our wavelength range toward the NIR (1000 to 1700 nm), in which the absorption of light by blood is negligible.2,4
With the aim to assess the deep resection margin during oral cavity cancer surgery, we evaluated whether DRS (400 to 1600 nm) can discriminate tumor from healthy oral muscle tissue, in an ex-vivo setting.
Materials and Methods
Patients undergoing surgery for the removal of oral cavity cancer were considered for this study. Despite different locations of the tumors throughout the oral cavity, all measurements were performed on the same tumor type and on healthy oral muscle tissue. All patients were treated in the Netherlands Cancer Institute-Antoni van Leeuwenhoek, Amsterdam. All ethical guidelines for ex-vivo human studies were followed.
Diffuse Reflectance Spectroscopy
Measurements were obtained with a system consisting of a broadband tungsten halogen light for illumination and two different spectrometers to record the diffuse reflectance spectra from the tissue. The first spectrometer contained a silicon detector resolving the visual light between 400 and 1100 nm (Andor Technology, DU420A-BRDD), and the second spectrometer contained an InGaAs detector resolving light in the NIR region from 800 to 1700 nm (Andor Technology, DU492A-1.7).14 The optical fibers guiding the light from the light source toward the tissue and the light reflected from the tissue toward the spectrometers are in contact with the tissue via a handheld probe. Multiple diffuse reflectance spectra were obtained for different distances between source fiber and detector fiber: 0.3, 0.7, 1, and 2 mm (Fig. 1). Different source–detector distances were used to obtain different sampling depths.
Directly after resection, the specimen was brought to the pathology department, where the resection margins were inked according to standard pathological protocol. The pathologist localized the tumor by palpation and cut the specimen into two parts, right through the middle of the tumor. At the cut surface, suspected tumor and healthy tissue areas were pointed out by the pathologist. Based on this, the measurement locations were chosen in both tumor tissue and healthy muscle tissue. A red, green, and blue (RGB) image was taken from the cut surface for registration purposes. DRS measurements were acquired, and the probe’s position for each measurement was recorded on an RGB image. The spectroscopy system could only measure on two fibers simultaneously. As a consequence, all measurements were first acquired with the 2- and 0.3-mm fiber distance. Subsequently, to acquire the measurements for the 1- and 0.7-mm fiber distances, the probe was positioned at the same location on the basis of the RGB images. Thereafter, the specimen was brought back to the pathology department for further routine pathological processing.
The locations measured with the DRS probe had to be matched with the corresponding locations on the histopathology slide, to confirm the tissue type that was measured. For this, a digital scan was made of the histopathological slide taken from the cut surface, which was registered to the RGB image (Fig. 2). Due to the fact that the specimen was deformed during the histopathological processing, a nonrigid registration algorithm was used for the registration. In this algorithm, obviously matching points in both the histopathological slide and the RGB image were visually selected. On the histopathological slide, the pathologist delineated tumor and healthy muscle tissue. The measurement locations were drawn on the RGB image, based on the images taken from the probe’s position during each measurement. An overlay of the delineated registered histopathological slide on the RGB image containing the measurement locations enabled determination of the measured tissue type for each measurement. Only measurements acquired at locations with an undisputed tissue type were included in the analysis, meaning that the probe was positioned on one tissue type only and not on a boundary between two different tissue types. For algorithm development, such “pure” tissue classes are obligatory.
The diffuse reflectance spectra were acquired from tumor and muscle measurement locations, in the wavelength range of 400 to 1600 nm at a 1-nm interval. To investigate subtle changes in the shape of the reflectance spectra, the first-3 and the second-order derivatives were calculated with the following equations:
The spectral data (the first- and second-order derivatives) were downsampled with a factor 5 to reduce the number of features and prevent the algorithm from overfitting. For the classification of samples, we performed a repeated fivefold cross validation (CV) using linear support vector machine (SVM). In this approach, patients were randomly divided into five equal sized partitions. Each partition was used as a validation set while the other four partitions were used to train the classifier.
The CV procedure was repeated 10 times, each with a different random distribution of the patients over the 5 partitions, yielding 10 tissue type predictions per measurement location. Of these predictions, mean sensitivity, specificity, accuracy (ACC), area under the curve (AUC), and Matthew’s correlation coefficient (MCC) were calculated. The latter gives an indication of the quality of the classification (ranging from to 1, with indicating perfect disagreement, 0 random outcome, and 1 perfect agreement between prediction and observation) and corrects for classification problems with an unequal class (tumor/healthy) size.
This analysis was repeated for each of the four data sets obtained from the different distances between source and detection fiber. Receiver-operating curve (ROC) and MCCs were used to compare results of the different fiber distances.
All data analyses were performed in MATLAB R2018a.
Freshly excised specimens were measured from 28 patients with squamous cell carcinoma in the oral cavity. Nineteen of the tumors were located in the tongue (sixteen lateral tongue, two dorsal tongue, and one tongue base), one in the oropharynx, seven in the floor of the mouth (six paramedial and one anterior), and one in the left cheek. Table 1 shows the total number of spectra measured from tumor and healthy muscle tissue for each distance between source and detection fiber. In Fig. 3, the mean original reflectance spectra are shown with the accompanying standard deviations (STD) for both the tumor and healthy muscle measurement locations. Largest differences between the two tissue types are visible in the wavelength range of 600 to 1000 nm when measuring with 0.7, 1, or 2 mm between source and detection fiber. Especially, the slope of the spectra along these wavelengths differs between tumor and healthy. For the 0.3-mm source–detector distance, there is almost a complete overlap between the STDs of both tissue types. For all source–detector distances, the NIR part of the spectrum (1000 to 1700 nm) does not indicate any difference between measurements acquired on tumor or healthy tissue.
Number of measurements for each distance between source and detection fiber.
|Distance between source and detection fiber (mm)|
|Number of total measurements (patients)||172 (24)||186 (28)||186 (28)||186 (27)|
|Number of tumor measurements||70 (20)||76 (23)||76 (23)||77 (22)|
|Number of muscle measurements||102 (22)||110 (26)||110 (26)||109 (25)|
Figure 4 shows the original reflectance measurement, the slope of the original reflectance measurement (first-order derivative), and slope of the first-order derivative (the second-order derivative) for the complete spectrum and the wavelengths between 1000 and 1400 nm. The first derivative shows differences especially around 800 nm, indicating a large difference in slope between the original tumor and healthy reflectance spectra. Figures 4(d)–4(f) show that although the original reflectance spectrum does not necessarily indicate a difference between tumor and healthy measurements for the NIR range, both the first- and second-order derivatives do show subtle differences between the two tissue types.
Using the first- and the second-order derivatives of the spectra in a linear SVM in the 10 times fivefold CV, MCC ranged from 0.50 to 0.71 for all fiber distances (Table 2). A 1-mm distance between source and detection fiber resulted in the highest value for the MCC (Fig. 5). Mean sensitivity, specificity, and ACC were , , and , respectively, for this distance between source and detection fiber.
Classification results of 10 times fivefold CV with a linear SVM.
|Distance between source and detection fiber (mm)|
Classification outcomes for different factors for data reduction and data processing are shown in Table 3. Reducing the data with a factor 5 did not indicate a significant difference in classification outcomes; it only affected the STD. Thus, using less data, the results were more in line. This shows that, using a factor 5, the data are less subject to overfitting. Also, when comparing the different data processing methods, the classification outcomes are not significantly affected. Using the first- and second-order derivatives resulted in a minor increase in sensitivity, which is clinically most valuable. Furthermore, using the derivatives resulted in the smallest STDs.
Classification results of 1-mm distance between source and detection fiber with different factors for data reduction and data processing.
|Variable downsampling factor used with first- and second-order derivatives||Variable processing of diffuse reflectance spectra used with downsampling factor 5|
|None||Factor 3||Factor 5||None||First-order derivative||Second-order derivative||First- and second-order derivative|
The results obtained with a 0.7-mm distance are comparable to the results obtained with a 1-mm distance, whereas the measurements obtained with 0.3- and 2-mm distances show less accurate discrimination of tumor from healthy tissue.
Aiming for resection margin assessment of the deep resection margin during oral cavity cancer surgery, we evaluated whether DRS (in the wavelength range of 400 to 1700 nm) can accurately discriminate tumor from healthy oral muscle tissue, in an ex-vivo setting. Using a 1-mm distance between source and detection fiber, and a linear SVM, we found a sensitivity, specificity, and ACC of 89%, 82%, and 86%, respectively.
The sampling depth of a DRS probe is defined as the depth reached by 50% of the photons. This depth depends on different factors, such as the optical properties of the tissue, the geometry of the probe, and the wavelength of the light.15 Hennessy et al.15 used a Monte Carlo model of DRS to investigate the effect of the source–detector distance and the wavelength on the sampling depth of photons collected by a DRS probe. The source–detector distance of 0.25 and 1.0 mm measured sampling depths of up to 0.3 and 0.7 mm, respectively. Thus, with a larger source–detector distance, larger sampling depths can be measured. They also showed that the exact sampling depth was wavelength dependent: within the visual range of 400 to 700 nm, the sampling depth varied between 0.4 and 0.7 mm for a source–detector distance of 1.0 mm. According to these data, we estimated that in our study a source–detector distance of 1.0 mm should be able to measure a sampling depth between 0.4 and 0.7 mm, and a source–detector distance of 2 mm should at least be able to measure at a depth of .
We looked at 0.3-, 0.7-, 1.0-, and 2-mm distances between source and detection fiber. In our study, both the 0.7- and 1-mm categories were superior in discriminating tumor from healthy tissue, over the 0.3- and 2-mm categories. This could be explained by the methodology we used to validate the data. We labeled the measurements with a tumor or healthy label based on the first histopathological section of the complete surface of the specimen. Due to the direction of the histopathological slices and the orientation of the specimen in the paraffin block, it is not known from what depth this slice originated. It could have been that the absolute superficial layer, being measured with the 0.3-mm source–detector distance, was not present in our histopathological section, because it was sliced off before the slice including the entire intact surface was reached in the cutting process. On the other hand, in the 2-mm source–detector distance, the category with the largest sampling depth, it could have been the case that, besides traveling through a superficial layer of tumor tissue, the light also could have been traveling through an underlying layer of healthy tissue. Since we did not measure the tumor thickness at the measurement locations, we could not accurately validate the data for this larger sampling depth.
The current clinical practice guidelines in head and neck cancer define a close margin as tumor cells found within from the resection surface. However, there is no consensus with regard to whether is the right definition of “close.” Alicandri-Ciufelli et al.16 reported in their systematic review that a range of 2 to 7 mm is considered close. Also, specifically for the tongue, Zanoni et al.17 recently suggested to redefine the definition of close margins for patients with squamous cell carcinoma. In their study, local recurrence-free survival was significantly affected only with surgical margins of . Thus, to be able to use DRS in the detection of close margins, the required sampling depth is .
In our study, the discriminative power of tumor from healthy tissue was not the strongest for the largest sampling depth obtained with the 2-mm source–detector distance, due to the validation method of this category. Thus, we recommend that future work focus on improving the validation method for larger sampling depths. As an initial approach, US could be used to measure the tumor thickness at the measurement locations. Several studies have shown that the tumor thickness can be accurately measured using US.18,19
As far as we are aware, this is the first study reporting on the use of DRS in discriminating oral cavity tumor from healthy oral muscle tissue, which is essential for the intraoperative application we are aiming at. Other studies have been reporting on the use of DRS for diagnosing oral cavity cancer and are, therefore, focused on discriminating tumor from healthy mucosa, a completely different tissue type.6–13 Therefore, it is not directly possible to compare our results with these studies. At the moment, oral cavity tumor resection is performed without any intraoperative feedback on the presence of tumor cells at the tumor resection margins. So far, there are only studies reporting on the in-vivo use of US to obtain tumor-free margins in tongue surgery.19–21 In these studies, US was used to position needles as an indication for the safety margin during the resection. Also, Songra18 used a metal retractor into the surgical cut, halfway during resection to evaluate the resection margin with US. Other research on this topic focused on ex-vivo evaluation of the resection margins, e.g., by MR. Steens et al.,22 reported in six out of seven patients a resection margin within a 2-mm range of the resection margin reported by histopathology. Both US and MR provide image guidance during surgery, while we aim to use DRS for tissue characterization. Therefore, DRS and US/MR are not mutually exclusive and could be used complementarily to each other.
Clinically, we would like to use DRS as an intraoperative tool to distinguish oral cavity tumor tissue from healthy muscle tissue at the resection plane. The surgeon could position the probe at a suspicious location at the deep resection margin. After measuring the tissue with DRS, the reflectance spectrum will be compared to the database that was used to train the SVM, and a prediction can be made on the type of the measured tissue, real time. A scale on a monitor will indicate the likelihood of measuring tumor tissue. Detection of tumor tissue at the deep resection margin will allow for direct re-excision and thus reduce the involved margins. Intraoperative use of DRS is feasible, because measurements do not delay the surgery and the results can be made available in real time. With the reported sensitivity and specificity in this ex-vivo study, it would be interesting to further develop the technology into a surgical instrument, which can be sterilized for in-vivo use.
This ex-vivo study showed that DRS can discriminate tumor from healthy oral tissue. Using a linear SVM in a 10 times fivefold CV, sensitivity, specificity, and ACC measures of 89%, 82%, and 86% were obtained. Future work should focus on accurate validation methods for larger sampling depths to allow real-time guidance during oral cavity tumor excision.
This study was supported by Philips Research, Eindhoven, Netherlands. The author who is affiliated with Philips Research (BHW) has financial interests in the subject matter, materials, and equipment only in the sense that he is an employee of Philips. None of the other authors has any financial relationship with Philips Research or conflicts of interest.
We would like to thank the NKI-AvL head and neck surgeons and the pathology department for their help in the collection of the data.