|
1.IntroductionLung cancer is responsible for the most cancer-related deaths in the world. In 2017, it is estimated that there will be 222,500 cases of lung cancer and 155,870 deaths from lung cancer in the United States.1 Despite the steady increase in survival for most cancers in recent years, advance has been slow for lung cancer, resulting in only 18% 5-year survival rate.1,2 This low rate is partly due to the asymptomatic nature of lung lesions,3 which prevents lung cancer from being diagnosed at an earlier stage.4 There are two main types of lung cancer: about 80% to 85% of lung cancer is nonsmall cell lung cancer (NSCLC), the remaining forms are small-cell lung cancer. Adenocarcinoma and squamous cell carcinoma are the two major subtypes of NSCLC. Adenocarcinoma is the most common form of lung cancer that accounts for about 40% of total lung cancer incidences. Squamous cell carcinoma accounts for about 30% of total lung cancer incidences, and they are often linked to a history of smoking. Currently, a computed tomography (CT)-guided fine needle biopsy is the standard routine for pathological analysis and definitive diagnosis of lung cancer.5,6 Although CT-guided fine needle aspiration can reduce the amount of tissue taken and complications, because of the limited resolution of CT scans and the respiratory motion of patients, it is sometimes difficult to obtain samples precisely at the site of neoplastic lesions and thus the correct diagnosis.7 Some patients will have to undergo rebiopsy, resulting in increased costs and delay in diagnosis and definitive treatment. Therefore, it would be beneficial to develop techniques that allow medical practitioners to detect and diagnose lung cancer with high speed and accuracy. In recent years, although needle biopsy and pathology analysis still remain the gold standard for definitive lung cancer diagnosis, rapid development in nonlinear optical imaging technologies has contributed to advancing diagnostic strategies in cancer, including autofluorescence bronchoscopy,8–10 two-photon-excited fluorescence microscopy,11,12 and optical coherence tomography.13–15 Our group has previously demonstrated the use of coherent anti-Stokes Raman scattering (CARS) imaging as a fast and accurate differential diagnostic tool for the detection of lung cancer.16–18 CARS is an emerging label-free imaging technique that captures intrinsic molecular vibrations and offers real-time submicron spatial resolution along with information of the biological and chemical properties of samples, without applying any exogenous contrast agents that may be harmful to humans.19 In CARS, two laser beams are temporally and spatially overlapped at the sample to probe designated molecular vibrations and produce signals through a four-wave mixing process19 in a direction determined by the phase-matching conditions.20 When the frequency difference between the pump field and the Stokes field matches the vibrational frequency of Raman active molecules in the sample, the resonant molecules are coherently driven to an excited state and then generate CARS signals that are typically several orders of magnitude stronger than conventional spontaneous Raman scattering.21 We have shown that CARS can offer cellular resolution images of both normal and cancerous lung tissue where pathologically related features can be clearly revealed.16 Based on the extracted quantitative features describing fibrils and cell morphology, we have developed a knowledge-based classification platform to differentiate cancerous from normal lung tissues with 91% sensitivity and 92% specificity. Small-cell carcinoma was distinguished from NSCLC with 100% sensitivity and specificity.18 Since CARS has the capability to provide three-dimensional (3-D) images, we have also designed a superpixel-based 3-D nuclear segmentation and clustering algorithm22–24 to characterize NSCLC subtypes. The result showed greater than 97% accuracy in separating adenocarcinoma and squamous cell carcinoma.17 The above-mentioned classification algorithms, however, require extensive image preprocessing, nuclei segmentation, and feature extraction based on the cellular and fibril structural information inherent in CARS images. A total of 145 features was directly calculated from the CARS images to separate fibril-dominant normal from cell-dominant cancerous lung lesions.18 A semiautomatic nuclei segmentation algorithm was implemented to facilitate the measurement of morphological features that describe both the attributes of individual cells and their relative spatial distribution, such as cell volume, nuclear size, and distance between cells.25 Thirty-five features were extracted based on the segmentation result to distinguish cancerous lung lesions. Partial least squares regression26 and a support vector machine27 were employed as the classification algorithms. Aiming at improving the classification accuracy of the two subtypes of NSCLC, we extended the two-dimensional (2-D) analytical framework to a 3-D nuclear segmentation, feature extraction, and classification system. This diagnostic imaging system involved partitioning the image volume into supervoxels and manually selecting the cell nuclei.28 Another thirty-two features were computed to build a classifier capable of differentiating adenocarcinoma and squamous cell carcinoma lung cancer subtypes.17 To circumvent the complex image processing and feature engineering procedure and avert any possible human interventions, in this paper, we demonstrate the classification of CARS images on lung tissues using a deep convolutional neural network (CNN) that needs no hand-crafted features.29–34 Deep learning algorithms, powered by advances in computation and very large datasets, have recently been shown to be successful in various tasks.35–39 CNN is effective for image classification problems because the nature of weight sharing produces information on spatially correlated features of the image. Since training a deep CNN from scratch is computationally expensive and time-consuming, we apply a transfer learning approach that takes advantage of the weights of a pretrained model named GoogleNet Inception v3, which was pretrained on images (1000 object categories) from the 2014 ImageNet large-scale visual recognition challenge and achieved 5.64% top-5 error.40–45 To the best of our knowledge, the characterization of CARS or other label-free microscopic images using deep CNN has not been attempted. Combining the label-free high-resolution imaging modality and deep learning image classification algorithm will lead to a strategy of automated differential diagnosis of lung cancer. 2.Materials and Methods2.1.Tissue Sample PreparationFresh human lung tissues were obtained from patients undergoing lung surgery at Houston Methodist Hospital, Houston, Texas, approved by the Institutional Review Board. The excised tissues were immediately snap-frozen in liquid nitrogen for storage. Frozen tissue samples were passively thawed for 30 min at room temperature, placed on a cover slide (VWR, Radnor, Pennsylvania), and then reversely placed on an imaging chamber to keep the samples from being pressed. CARS images were acquired ex vivo using a modified confocal microscope. Each lung tissue was imaged at several different locations, and at each location a series of images was acquired at different image depths, called a -stack.46 Each -stack was considered as an independent sample. A total of 388 -stacks (7926 images) was collected, including 83 (1920 images) normal cases, 156 (2618 images) adenocarcinoma cases, 111 (2535 images) squamous cell carcinoma cases, and 38 (853 images) small-cell carcinoma cases. Small-cell carcinoma cases are seldom resected clinically, resulting in a lower number. After experiments, all samples were marked to indicate the imaged locations, fixed with 4% neutral-buffered formaldehyde, embedded in paraffin, sectioned through imaged locations, and stained with hematoxylin and eosin (H&E). Bright-field images of the H&E slides were examined by pathologists with an Olympus BX51 microscope. Pathologists labeled the tissue based on the H&E-sectioning results, which was then used as the ground truth for the CARS images. Each tissue has only one specific label, i.e., we did not use tissues that might have both normal and tumor cells. 2.2.Coherent Anti-Stokes Raman Scattering Imaging SystemThe schematic of the CARS imaging system was previously described.16 The light sources include a mode-locked Nd:YVO4 laser (High-Q Laser, Hohenems, Austria) that delivers a 7-ps, 76-MHz pulse train at 1064 nm used as the Stokes beam, and a frequency-doubled pulse train at 532 nm, which is used to pump a tunable optical parametric oscillator (OPO, Levante, APE, Berlin, Germany) to generate the 5-ps pump beam at 817 nm. The resulting CARS signal is at 663 nm, corresponding to the stretch vibration mode (). The Stokes and pump laser beams are spatially overlapped using a long-pass dichroic mirror (q10201pxr, Chroma, Vermont) and temporally overlapped using an adjustable time-delay line. They are tightly focused by a 1.2-NA water immersion objective lens (, IR UPlanApo, Olympus, Melville, New Jersey), yielding CARS signals with a lateral resolution of and an axial resolution of . A bandpass filter (hq660/40m-2p, Chroma Inc.) is placed before the photomultiplier tube (PMT, R3896, Hamamatsu, Japan) detector to block unwanted background signals. The PMT is designed to be most sensitive in the visible wavelength region. The upright microscope is modified from an FV300 confocal laser scanning microscope (Olympus, Japan) adopting a 2-D galvanometer. A dichroic mirror is used in the microscope to separate the CARS signals from excitation laser beams. The acquisition time is about 3.9 s per imaging frame of . Image is displayed with the Olympus FluoView v5.0 software. The field of view (FOV) that we can normally achieve is . The average laser power is 75 mW for pump beam and 35 mW for Stokes beam, both within the safety range. In the human body, the tolerance to laser power is much higher than that in thawed tissues, therefore less photodamage is expected and the current laser power is expected to be suitable for clinical applications.47 2.3.Data PreparationWhile CARS images corresponding to the same tissue but from different viewpoints look very different, a -stack is likely to contain similar images at different imaging depths, as the smallest -step is only . Therefore, extensive care was taken to ensure that images from the same -stack were not split between the training and testing sets. We randomly picked eight normal lung -stacks (160 images), 16 adenocarcinoma -stacks (257 images), 12 squamous cell carcinoma -stacks (239 images), and four small-cell carcinoma -stacks (102 images) as our holdout test set. The rest of the dataset (7168 images) was used for training and validation. We converted our grayscale CARS images into RGB and resized each image to as required by the pretrained model. To facilitate the training of deep neural networks, a large amount of labeled data is normally needed. Unlike image databases, such as the ImageNet43 and MS-COCO,48 that are publicly available for the computer vision community, we have only a limited number of CARS images on human lung tissues, thus data augmentation was applied to prevent overfitting.49 Overfitting occurs when a model is excessively complex as compared to the number of training data, so the model is not able to learn patterns that can generalize to new observations. Since tissues do not possess a particular orientation, performing augmentation with rotation and mirroring transformation should not alter the pathological information contained on the CARS images. Each image in the cross-validation set was augmented by a factor of 120 by randomly rotating the image 60 times and then flipping the rotated image horizontally. The cross-validation set contains 860,160 images after image augmentation. We did not use other augmentation methods that might involve artificial distortions on pixel values. The largest inscribed rectangle was then cropped from each image, resulting in an image with . 2.4.Transfer LearningTransfer learning refers to the process of using the knowledge learned on a specific task and adapting this knowledge to a different domain, based on the hypothesis that the features learned by the pretrained model are highly transferable.42,50 Transfer learning reduces the time and computational resources needed to train deep neural networks from scratch on a large amount of data because it does not need to take time optimizing millions of parameters. Despite the disparity between natural images and biological images, deep CNN architectures comprehensively trained on the large-scale well-annotated ImageNet can still be transferred to biological and medical domains and have demonstrated their successes in various image classification tasks, such as skin cancer,37 colon polyps,51 thyroid nodules,52 thoraco-abdominal lymph nodes, and interstitial lung disease.53 The GoogleNet Inception v3 model consists of 22 layers stacked on top of each other (see Fig. 1).40 We replaced the final classification layer with four classes instead of the original 1000 ImageNet classes, the previous layers remained unchanged because the existing weights are already valuable at finding and summarizing features that are useful for image classification problems. Generally speaking, earlier layers in the model contain more general features, such as edge detectors or shape detectors, which should be ubiquitous for many image classification tasks, whereas later layers contain higher level features. We retrained the Inception v3 model using only pixels and labels of CARS images as inputs, and fine-tuned the final classification layer using a global learning rate of 0.05. We randomly divided the training and validation set into nine partitions so that we could use ninefold cross validation to validate the performance of the algorithm. We then calculated and cached the bottleneck values for each image to accelerate the following training process. The bottleneck values are the information stored in the penultimate layer, which will be used multiple times during training.36 In each step of training, the model selected a small batch of images at random, found their bottlenecks from the cache, and fed them into the final classification layer to perform prediction. Those predictions were then compared against the ground-truth labels to iteratively update the final layer’s weights through backpropagation.55 We used cross entropy as the cost function to measure the performance of our model. Smaller cross-entropy value means smaller mismatch between the predicted labels and the ground-truth labels. After 40,000 steps, the retrained model was evaluated on the holdout test set that was separated from the training and validation sets. All methods were implemented using Google’s TensorFlow (version r1.1) deep learning framework56 on Amazon Web Services (AWS) EC2 p2.xlarge instance, which contains 4 vCPU with 61 GiB memory and 1 GPU with 12 GiB memory. It took about 3 days to calculate and cache all the bottleneck values for each image in the cross-validation set. After that, each cross-validation round took about half an hour, whereas the prediction on the holdout test set needed only 6 min. 3.Results3.1.Coherent Anti-Stokes Raman Scattering Imaging of Human Lung TissuesFigure 2 represents cellular-level CARS images (upper panels) and the corresponding H&E staining results (lower panels) obtained from normal, adenocarcinoma, squamous cell carcinoma, and small-cell carcinoma human lung tissues. The normal lung mainly consists of well-organized fibrous elastin and collagen structures [shown as bright structures in Fig. 2(a)], and they work together to form a supporting network for the maintenance and integrity of lung tissues. Elastin constitutes up to 50% of the lung connective tissue mass and acts as the elastic protein allowing the lung tissue to resume its original shape after stretching. Collagen fibers are well-aligned proteins that provide support to maintain the tensile strength of lungs during the respiration process.57 In contrast, cancerous lung tissues no longer preserve rich fibrous structures because cancer progression is often associated with the destruction and degradation of the original matrix network at the tumor invasion front.58,59 Therefore, Figs. 2(b)–2(d) show much denser cellularity and significantly less fibrous structures compared with the normal samples. Because cell nuclei (yellow arrow) have less bonds compared to cell membrane and cytoplasm, they appear as dark spots in CARS images. Commonly used pathological features can be identified in the three neoplastic lung types. Adenocarcinoma mostly contains cancer cells that have nested large round shapes and vesicular nuclei, with inhomogeneous cytoplasm forming glandular structures.60 Only a few broken elastin and collagen fibers are present in the tissue. Typical features in squamous cell carcinoma are pleomorphic malignant cells in sheets with abundant dense cytoplasm and formation of intracellular bridges.60 Small-cell carcinoma is noted by round/oval cells with high nuclear–cytoplasmic radio.61 All CARS images illustrate good pathological correlation with H&E staining sections from the same tissue samples. Minor observed discrepancies between CARS and histology imaging and the inconsistent colors of H&E imaging were attributed to tissue fixation, processing, staining, and sectioning artifacts. The difficulty in locating the exact imaging spots due to the small FOV of CARS imaging and small working distance of the objective may also contribute to the mismatch between CARS and H&E imaging. 3.2.Automatic Classification and Differential Diagnosis of Lung TissuesWe first validated the effectiveness of transfer learning on CARS images using a nine-fold cross-validation approach. The entire training and validation dataset was split into nine folds (folds are split into disjoint sets of patients). We held one fold at each time as the test set and trained the model on the remaining eight folds of the data, until each of the nine folds had served as the test set. The overall accuracy that the retrained model achieves is (). The learning curves of training and validation for one cross-validation round displayed by TensorBoard are shown in Fig. 3. The stochastic gradient descent optimization algorithm was used to iteratively minimize the cost function. Sixty-four images were randomly picked in each training step to estimate the gradient over all examples. The cost functions of training and validation in Fig. 3 both incline downward and converge to a low value after 40,000 steps. We then tested the retrained model on the holdout test set that had never been seen by the model. The model also achieves 91.4% prediction accuracy, indicating its robustness to data. Figure 4(a) shows the normalized confusion matrix of our method on the holdout test set. Each row represents the instances in a ground-truth class and each column represents the instances predicted by the model. It can be noted that 100% of the actual normal cases are correctly classified as normal. 95% of the actual small-cell lung carcinoma cases are correctly labeled as small-cell lung carcinoma, only 4% of them are wrongly identified as nonsmall cell lung carcinoma. It is also worth mentioning that the two subtypes of NSCLC are sometimes misclassified as each other due to their morphological similarities, especially in poorly differentiated areas. This is in accordance with the challenges for pathologists in separating adenocarcinoma and squamous cell lung carcinoma.62 Given that the distribution of class labels in our dataset is not balanced, we plotted the receiver operating characteristic (ROC) curves [see Fig. 4(b)] for three conditions: (1) separating cancerous lung images from normal lung images; (2) separating small-cell carcinoma lung images from nonsmall cell carcinoma lung images; and (3) separating the subtypes of nonsmall cell carcinoma lung images, i.e., adenocarcinoma and squamous cell lung carcinoma. ROC curves feature false positive rate (1–specificity) on the axis and true positive rate (sensitivity) on the axis.63 Specifically, “true positive” in each condition is: (1) the number of correctly predicted cancerous lung images; (2) the number of correctly predicted small-cell lung images; and (3) the number of correctly predicted adenocarcinoma lung images. When a test CARS image is fed through the model, the very last classification layer outputs a normalized probability distribution over four classes (normal, small-cell, adenocarcinoma, and squamous cell carcinoma). The ROC curve for each condition is created by plotting the true positive rate against the false positive rate at various discrimination thresholds. The area under the curve (AUC score) is also computed for each condition, whereas an AUC score close to 1 indicates an excellent diagnostic test. From a medical perspective, minimizing false negatives (known as type II error) is more important in cancer diagnosis because missing a cancerous case will lead to patients being untreated. To better understand the internal features learned by the retrained model, we visualized the penultimate layer using t-distributed stochastic neighbor embedding (t-SNE) in Fig. 5.64,65 Each point in Fig. 5 represents a CARS image from the holdout test set and the color represents its ground-truth label. The original 2048-dimensional output of the model’s last hidden layer is reduced and projected into a 2-D plane. t-SNE basically converts similarities between data points to joint probabilities and tries to minimize the Kullback–Leibler divergence between the joint probabilities of the low-dimensional embedding and the high-dimensional data.64 The points that belong to the same class should be clustered close together. As can be seen in Fig. 5, almost all the normal cases are grouped together clearly, which corresponds to the results in the confusion matrix. A few misclassified small-cell cases are located near the squamous cell carcinoma cases. Similarly, there exists some overlap between adenocarcinoma and squamous cell carcinoma as these two lung cancer subtypes have noticeable pathological similarities. Neural networks have long been known as “black boxes” because it is difficult to understand exactly how any particular, trained neural network functions due to the large number of interacting, nonlinear parts.66 In Fig. 6, we fed a normal and small-cell carcinoma CARS images into the model and visualized the first convolutional layer that consists of thirty-two kernels. Certain conventional image descriptors developed for object recognition can be observed in Fig. 6, such as detecting the edges of cell nuclei and detecting round lipid droplets. The visualization of internal convolutional layers thus gives us better intuition on the convolutional operations inside the deep CNN model. Finally, we proposed a majority voting strategy to further improve the performance of the model in predicting lung tissue classes. Since multiple CARS images at different imaging depths were acquired at each sampling point, we label the tissue type with the majority class to adjust the conflicting results among each -stack. In the holdout test set, we correctly predict 8 out of 8 normal lung tissues, 4 out of 4 small-cell carcinoma lung tissues, 15 out of 16 adenocarcinoma lung tissues, and 9 out of 11 squamous cell carcinoma lung tissues. The overall prediction accuracy of this strategy is 92.3%, which is expected to be higher if we have more -stacks for testing. This method gives more confidence in determining the tissue types without sacrificing too much time, as the acquisition of a -stack normally takes less than a minute. 4.DiscussionAutomating image analysis is essential to promoting the diagnostic value of nonlinear optical imaging techniques for clinical applications. Thus far, we have explored the potential of CARS imaging in differentiating lung cancer from nonneoplastic lung tissues and identifying different lung cancer types.16–18 These strategies, however, are based on extraction and calibration of a series of disease-related morphology features. Therefore, they are not fully automatic because the quantitative measurement of the features requires the segmentation of cell nuclei, which involves a manual selection procedure and creates potential bias. In the superpixel-based 3-D image analysis platform, labeling the cell nuclei on all the images of one -stack would take more than five minutes, which is very inefficient for analysis of a large amount of image data. In this study, we exploited the recent advances in deep CNN and demonstrated how we can apply the model pretrained on the ImageNet data to a completely new domain such as CARS images and yet achieve reliable results. Our experimental results indicate that the proposed deep learning method has the following advantages when coupled with label-free CARS images. First, deep CNN can extract useful features automatically,45 thus we can use raw image pixel values directly as the input for the model. The algorithm described in this paper can label a batch of images quickly (0.48 s per image) without any preprocessing or human intervention. Second, we separated normal, small-cell carcinoma, adenocarcinoma, and squamous cell carcinoma lung CARS images using a single retrained model. In comparison, we previously built three different classifiers using three different feature sets in a hierarchical manner to separate normal from cancerous lung, small-cell carcinoma from nonsmall cell carcinoma, and adenocarcinoma from squamous cell carcinoma, respectively. Third, we used a larger dataset (7926 images as compared to previously images) and did not exclude the images that have blurry cell nuclei, as in a previous semiautomatic approach, resulting in a more robust model performance. Yet we still achieved comparable results to the previously reported results, i.e., 98.7% sensitivity and 100% specificity in separating cancerous from normal lung, 96.1% sensitivity and 96.2% specificity in separating small-cell carcinoma from nonsmall cell carcinoma, and 93.0% overall accuracy in separating adenocarcinoma from squamous cell carcinoma. Finally, in our previous works, we validated the performance of the classification algorithm using the leave-one-out method in which we repeatedly reserved one observation for prediction and trained the model on the rest of the dataset. However, this is a suboptimal validation method as compared to the -fold cross validation (in our method ), as it leads to higher variation in testing model effectiveness and it is computationally expensive when the size of the dataset is big. We noticed that our CARS images have uneven background, most likely due to the chromatic aberrations. Interestingly, the traditional approach to remove background information of the microscopic images before feeding images into the deep CNN actually reduced the accuracy of prediction. This may be due to the fact that the background information contains subtle information about the foreground image features of interest, which normally are ignored by human inspectors. Meanwhile, it should be noted that most lung cancers are histologically heterogeneous among different patients, and the same cancer subtype may have different pathological characteristics at different stages, resulting much larger within-class appearance variations.60 Figure 7 shows certain representative misclassified CARS images. Figure 7(a) is a false negative case where the model labels an adenocarcinoma image as normal, which may be caused by the fibrous structure (yellow arrow) that is rarely seen in cancerous tissues. This is called a type II error, which must be minimized in clinical practice. In Fig. 7(b), a squamous cell carcinoma is incorrectly classified as adenocarcinoma with a probability of 76%, indicating that the model is sometimes confused in differentiating adenoma and squamous cell carcinoma cases. In fact, separating NSCLC by visual inspection is also challenging for pathologists. In Fig. 7(c), a small-cell carcinoma is wrongly identified as squamous cell carcinoma. These mistakes might be induced by the fact that these misleading images are minorities in the dataset, and consequently the model has not been well trained to label them correctly. Therefore, the best way to further improve the current performance is collecting more data that can cover a broader morphological spectrum of lung cancer tissues. Fine-tuning the hyperparameters across all the previous layers may also boost the model performance. However, the Inception v3 CNN contains about 5 millions tunable parameters and would thus require sufficiently large numbers of uniquely labeled images than our current method. Given the amount of data (before augmentation) and the computational resources available, we believe that leveraging the pretrained weights would be a more efficient solution than fine-tuning hyperparameters of the previous layers. In the past decades, increasing attention has addressed developing minimized optical fiber-based CARS imaging probes for direct in vivo applications at clinical settings.67–70 Our group has been studying the mechanisms of optical fiber delivered CARS,71–74 and recently reported a fiber-based miniaturized endomicroscope probe design that incorporates a small lens part with microelectro-mechanical systems.75 This device has opened up the possibility of applying CARS in vivo to provide reliable architectural and cellular information without the need for invasive and sometimes repetitive tissue removal. We expect that combing a miniaturized fiber optics probe and a deep learning-based automatic image classification algorithm will have the potential to detect and differentiate lung tissue on-the-spot during image-guided intervention of lung cancer patients and save precious time in sending tissue specimens for frozen sectioning diagnosis in a pathology laboratory. Furthermore, deep learning is agnostic to the type of images, thus it can be applied not only on CARS images, but also some other nonlinear optical imaging and microscopy imaging modalities, such as two-photon-excited autofluorescence (TPEAF) and second harmonic generation (SHG). We have proven that the morphological difference shown in TPEAF and SHG images can be used to differentiate normal and desmoplastic lung tissues.76 A multimodal image classification algorithm can be developed in the future to incorporate with multimodal imaging techniques, such that more tissue structures can be detected and analyzed in real time. In conclusion, we demonstrated the viability of analyzing CARS images of human lung tissues using a deep CNN that was pretrained on ImageNet data. We applied transfer learning to retrain the model and built a classifier that can differentiate normal and cancerous lung tissues as well as different lung cancer types. The average process time of predicting a single image is less than half a second. The reported computerized and label-free imaging strategy holds the potential for substantial clinical impact by offering efficient differential diagnosis of lung cancer, enabling medical practitioners to obtain essential information in real time and accelerate clinical decision-making. When coupled with a fiber-based imaging probe, this strategy would reduce the need for tissue biopsy while facilitating definitive treatment. This method is primarily constrained by data and can be further improved by having a bigger and more generalized dataset, to validate this technique across the full distribution and spectrum of lung lesions encountered in clinical practice. DisclosuresNo conflicts of interest, competing or financial interests, or otherwise, are declared by the authors. AcknowledgmentsWe would like to thank Kelvin Wong, Miguel Valdivia Y Alvarado, Lei Huang, Zhong Xue, Zheng Yin, and Tiancheng He from the Department of Systems Medicine and Bioengineering, Houston Methodist Research Institute for helpful discussions. We would also like to thank Rebecca Danforth for proofreading this manuscript. The funding of this research was supported by NIH UO1 188388, DOD W81XWH-14-1-0537, John S. Dunn Research Foundation, and Cancer Fighters of Houston Foundation. ReferencesR. L. Siegel, K. D. Miller and A. Jemal,
“Cancer statistics, 2017,”
CA. Cancer J. Clin., 67 7
–30
(2017). http://dx.doi.org/10.3322/caac.21387 CAMCAM 0007-9235 Google Scholar
D. R. Youlden, S. M. Cramb and P. D. Baade,
“The international epidemiology of lung cancer: geographical distribution and secular trends,”
J. Thorac. Oncol., 3 819
–831
(2008). http://dx.doi.org/10.1097/JTO.0b013e31818020eb Google Scholar
A. McWilliams et al.,
“Innovative molecular and imaging approaches for the detection of lung cancer and its precursor lesions,”
Oncogene, 21 6949
–6959
(2002). http://dx.doi.org/10.1038/sj.onc.1205831 ONCNES 0950-9232 Google Scholar
P. T. Cagle et al.,
“Revolution in lung cancer: new challenges for the surgical pathologist,”
Arch. Pathol. Lab. Med., 135 110
–116
(2011). http://dx.doi.org/10.1043/2010-0567-RA.1 APLMAS 0003-9985 Google Scholar
C. I. Henschke et al.,
“Early lung cancer action project: overall design and findings from baseline screening,”
Lancet, 354 99
–105
(1999). http://dx.doi.org/10.1016/S0140-6736(99)06093-6 LANCAO 0140-6736 Google Scholar
“Survival of patients with stage I lung cancer detected on CT screening,”
N. Engl. J. Med., 2006 1763
–1771
(2006). http://dx.doi.org/10.1056/NEJMoa060476 Google Scholar
M. Montaudon et al.,
“Factors influencing accuracy of CT-guided percutaneous biopsies of pulmonary lesions,”
Eur. Radiol., 14 1234
–1240
(2004). http://dx.doi.org/10.1007/s00330-004-2250-3 Google Scholar
T. Gabrecht et al.,
“Optimized autofluorescence bronchoscopy using additional backscattered red light,”
J. Biomed. Opt., 12 064016
(2007). http://dx.doi.org/10.1117/1.2811952 JBOPFO 1083-3668 Google Scholar
B. Zaric et al.,
“Advanced bronchoscopic techniques in diagnosis and staging of lung cancer,”
J. Thorac. Dis., 5 S359
–S370
(2013). http://dx.doi.org/10.3978/j.issn.2072-1439.2013.05.15 Google Scholar
A. Tremblay et al.,
“Low prevalence of high-grade lesions detected with autofluorescence bronchoscopy in the setting of lung cancer screening in the Pan-Canadian lung cancer screening study,”
Chest J., 150 1015
–1022
(2016). http://dx.doi.org/10.1016/j.chest.2016.04.019 Google Scholar
W. Denk, J. H. Strickler and W. W. Webb,
“Two-photon laser scanning fluorescence microscopy,”
Science, 248 73
–76
(1990). http://dx.doi.org/10.1126/science.2321027 SCIEAS 0036-8075 Google Scholar
G. Thomas et al.,
“Advances and challenges in label-free nonlinear optical imaging using two-photon excitation fluorescence and second harmonic generation for cancer research,”
J. Photochem. Photobiol. B, 141 128
–138
(2014). http://dx.doi.org/10.1016/j.jphotobiol.2014.08.025 JPPBEG 1011-1344 Google Scholar
D. Huang et al.,
“Optical coherence tomography,”
Science, 254 1178
–1181
(1991). http://dx.doi.org/10.1126/science.1957169 Google Scholar
L. P. Hariri et al.,
“Diagnosing lung carcinomas with optical coherence tomography,”
Ann. Am. Thorac. Soc., 12 193
–201
(2015). http://dx.doi.org/10.1513/AnnalsATS.201408-370OC Google Scholar
H. Pahlevaninezhad et al.,
“Coregistered autofluorescence-optical coherence tomography imaging of human lung sections,”
J. Biomed. Opt., 19 036022
(2014). http://dx.doi.org/10.1117/1.JBO.19.3.036022 JBOPFO 1083-3668 Google Scholar
L. Gao et al.,
“Differential diagnosis of lung carcinoma with coherent anti-Stokes Raman scattering imaging,”
Arch. Pathol. Lab. Med., 136 1502
–1510
(2012). http://dx.doi.org/10.5858/arpa.2012-0238-SA APLMAS 0003-9985 Google Scholar
L. Gao et al.,
“Differential diagnosis of lung carcinoma with three-dimensional quantitative molecular vibrational imaging,”
J. Biomed. Opt., 17 066017
(2012). http://dx.doi.org/10.1117/1.JBO.17.6.066017 JBOPFO 1083-3668 Google Scholar
L. Gao et al.,
“On-the-spot lung cancer differential diagnosis by label-free, molecular vibrational imaging and knowledge-based classification,”
J. Biomed. Opt., 16 096004
(2011). http://dx.doi.org/10.1117/1.3619294 JBOPFO 1083-3668 Google Scholar
J.-X. Cheng and X. S. Xie,
“Coherent anti-Stokes Raman scattering microscopy: instrumentation, theory, and applications,”
J. Phys. Chem. B, 108 827
–840
(2004). http://dx.doi.org/10.1021/jp035693v JPCBFK 1520-6106 Google Scholar
C. L. Evans et al.,
“Chemical imaging of tissue in vivo with video-rate coherent anti-Stokes Raman scattering microscopy,”
Proc. Natl. Acad. Sci. U. S. A, 102 16807
–16812
(2005). http://dx.doi.org/10.1073/pnas.0508282102 Google Scholar
J.-X. Cheng, A. Volkmer and X. S. Xie,
“Theoretical and experimental characterization of coherent anti-Stokes Raman scattering microscopy,”
J. Opt. Soc. Am. B, 19 1363
–1375
(2002). http://dx.doi.org/10.1364/JOSAB.19.001363 JOBPDE 0740-3224 Google Scholar
R. Achanta et al.,
“SLIC superpixels,”
Switzerland
(2010). Google Scholar
A. A. Hammoudi et al.,
“Automated nuclear segmentation of coherent anti-Stokes Raman scattering microscopy images by coupling superpixel context information with artificial neural networks,”
in Proc. of the Second Int. Conf. on Machine Learning in Medical Imaging,
317
–325
(2011). Google Scholar
A. Lucchi et al.,
“Supervoxel-based segmentation of mitochondria in EM image stacks with learned shape features,”
IEEE Trans. Med. Imaging, 31 474
–486
(2012). http://dx.doi.org/10.1109/TMI.2011.2171705 ITMID4 0278-0062 Google Scholar
V. Kumar et al., The Lung, Elsevier Health Sciences(2014). Google Scholar
H. Abdi,
“Partial least square regression (PLS regression),”
Encycl. Res. Methods Soc. Sci., 6 792
–795
(2003). Google Scholar
C. J. Burges,
“A tutorial on support vector machines for pattern recognition,”
Data Min. Knowl. Discovery, 2 121
–167
(1998). http://dx.doi.org/10.1023/A:1009715923555 Google Scholar
L. Gao et al.,
“Label-free high-resolution imaging of prostate glands and cavernous nerves using coherent anti-Stokes Raman scattering microscopy,”
Biomed. Opt. Express, 2 915
–926
(2011). http://dx.doi.org/10.1364/BOE.2.000915 BOEICL 2156-7085 Google Scholar
Y. LeCun, Y. Bengio and G. Hinton,
“Deep learning,”
Nature, 521 436
–444
(2015). http://dx.doi.org/10.1038/nature14539 Google Scholar
J. Schmidhuber,
“Deep learning in neural networks: an overview,”
Neural Networks, 61 85
–117
(2015). http://dx.doi.org/10.1016/j.neunet.2014.09.003 NNETEB 0893-6080 Google Scholar
C. Szegedy et al.,
“Going deeper with convolutions,”
in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition,
1
–9
(2015). http://dx.doi.org/10.1109/CVPR.2015.7298594 Google Scholar
M. Lin, Q. Chen and S. Yan,
“Network in network,”
(2013). Google Scholar
H. Wang and B. Raj,
“On the origin of deep learning,”
(2017). Google Scholar
M. A. Arbib, The Handbook of Brain Theory and Neural Networks, Part 1: Background: The Elements of Brain Theory and Neural Networks, MIT Press(2003). Google Scholar
J.-Z. Cheng et al.,
“Computer-aided diagnosis with deep learning architecture: applications to breast lesions in US images and pulmonary nodules in CT scans,”
Sci. Rep., 6 24454
(2016). http://dx.doi.org/10.1038/srep24454 SRCEC3 2045-2322 Google Scholar
K. He et al.,
“Deep residual learning for image recognition,”
in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition,
770
–778
(2016). http://dx.doi.org/10.1109/CVPR.2016.90 Google Scholar
A. Esteva et al.,
“Dermatologist-level classification of skin cancer with deep neural networks,”
Nature, 542 115
–118
(2017). http://dx.doi.org/10.1038/nature21056 Google Scholar
A. Krizhevsky, I. Sutskever and G. E. Hinton,
“ImageNet classification with deep convolutional neural networks,”
in Advances in Neural Information Processing Systems 25,
1097
–1105
(2012). Google Scholar
D. Silver et al.,
“Mastering the game of Go with deep neural networks and tree search,”
Nature, 529 484
–489
(2016). http://dx.doi.org/10.1038/nature16961 Google Scholar
C. Szegedy et al.,
“Rethinking the inception architecture for computer vision,”
in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition,
2818
–2826
(2016). http://dx.doi.org/10.1109/CVPR.2016.308 Google Scholar
S. Ioffe and C. Szegedy,
“Batch normalization: accelerating deep network training by reducing internal covariate shift,”
in Int. Conf. on Machine Learning,
448
–456
(2015). Google Scholar
S. J. Pan and Q. Yang,
“A survey on transfer learning,”
IEEE Trans. Knowl. Data Eng., 22 1345
–1359
(2010). http://dx.doi.org/10.1109/TKDE.2009.191 ITKEEH 1041-4347 Google Scholar
J. Deng et al.,
“ImageNet: a large-scale hierarchical image database,”
in IEEE Conf. on Computer Vision and Pattern Recognition,
248
–255
(2009). http://dx.doi.org/10.1109/CVPR.2009.5206848 Google Scholar
O. Russakovsky et al.,
“ImageNet large scale visual recognition challenge,”
(2014). Google Scholar
K. Simonyan and A. Zisserman,
“Very deep convolutional networks for large-scale image recognition,”
(2014). Google Scholar
C. L. Evans and X. S. Xie,
“Coherent anti-Stokes Raman scattering microscopy: chemical imaging for biology and medicine,”
Annu. Rev. Anal. Chem., 1 883
–909
(2008). http://dx.doi.org/10.1146/annurev.anchem.1.031207.112754 Google Scholar
Y. Yang et al.,
“Differential diagnosis of breast cancer using quantitative, label-free and molecular vibrational imaging,”
Biomed. Opt. Express, 2 2160
–2174
(2011). http://dx.doi.org/10.1364/BOE.2.002160 BOEICL 2156-7085 Google Scholar
T.-Y. Lin et al.,
“Microsoft coco: common objects in context,”
(2014). Google Scholar
S. Xie et al.,
“Hyper-class augmented and regularized deep learning for fine-grained image classification,”
in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition,
2645
–2654
(2015). http://dx.doi.org/10.1109/CVPR.2015.7298880 Google Scholar
R. Caruana,
“Multitask learning,”
Mach. Learn., 28 41
–75
(1997). http://dx.doi.org/10.1023/A:1007379606734 MALEEZ 0885-6125 Google Scholar
E. Ribeiro et al.,
“Transfer learning for colonic polyp classification using off-the-shelf CNN features,”
in Int. Workshop on Computer-Assisted and Robotic Endoscopy,
1
–13
(2016). Google Scholar
J. Chi et al.,
“Thyroid nodule classification in ultrasound images by fine-tuning deep convolutional neural network,”
J. Digital Imaging, 30 477
–486
(2017). http://dx.doi.org/10.1007/s10278-017-9997-y JDIMEW Google Scholar
S. Hoo-Chang et al.,
“Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning,”
IEEE Trans. Med. Imaging, 35 1285
–1298
(2016). http://dx.doi.org/10.1109/TMI.2016.2528162 ITMID4 0278-0062 Google Scholar
J. Shlens,
“Train your own image classifier with Inception in TensorFlow,”
(2016) https://research.googleblog.com/2016/03/train-your-own-image-classifier-with.html March ). 2016). Google Scholar
R. Hecht-Nielsen,
“Theory of the backpropagation neural network,”
Neural Networks, 1 445
–448
(1988). http://dx.doi.org/10.1016/0893-6080(88)90469-8 NNETEB 0893-6080 Google Scholar
M. Abadi et al.,
“TensorFlow: large-scale machine learning on heterogeneous distributed systems,”
(2016). Google Scholar
M. Toshima, Y. Ohtani and O. Ohtani,
“Three-dimensional architecture of elastin and collagen fiber networks in the human and rat lung,”
Arch. Histol. Cytol., 67 31
–40
(2004). http://dx.doi.org/10.1679/aohc.67.31 AHCYEZ 0914-9465 Google Scholar
R. A. Cairns, R. Khokha and R. P. Hill,
“Molecular mechanisms of tumor invasion and metastasis: an integrated view,”
Curr. Mol. Med., 3 659
–671
(2003). http://dx.doi.org/10.2174/1566524033479447 Google Scholar
T. Eto et al.,
“The changes of the stromal elastotic framework in the growth of peripheral lung adenocarcinomas,”
Cancer, 77 646
–656
(1996). http://dx.doi.org/10.1002/(ISSN)1097-0142 CANCAR 0008-543X Google Scholar
P. T. Cagle, R. Barrios and T. C. Allen,
“Color atlas and text of pulmonary pathology,”
Carcinomas, Lippincott Williams & Wilkins(2008). Google Scholar
R. Govindan et al.,
“Changing epidemiology of small-cell lung cancer in the United States over the last 30 years: analysis of the surveillance, epidemiologic, and end results database,”
J. Clin. Oncol., 24 4539
–4544
(2006). http://dx.doi.org/10.1200/JCO.2005.04.4859 JCONDN 0732-183X Google Scholar
J. Terry et al.,
“Optimal immunohistochemical markers for distinguishing lung adenocarcinomas from squamous cell carcinomas in small tumor samples,”
Am. J. Surg. Pathol., 34 1805
–1811
(2010). http://dx.doi.org/10.1097/PAS.0b013e3181f7dae3 Google Scholar
T. Fawcett,
“An introduction to ROC analysis,”
Pattern Recognit. Lett., 27 861
–874
(2006). http://dx.doi.org/10.1016/j.patrec.2005.10.010 PRLEDG 0167-8655 Google Scholar
L. van der Maaten and G. Hinton,
“Visualizing data using t-SNE,”
J. Mach. Learn. Res., 9 2579
–2605
(2008). Google Scholar
J. Donahue et al.,
“DeCAF: a deep convolutional activation feature for generic visual recognition,”
in Int. Conf. on Machine Learning,
647
–655
(2014). Google Scholar
J. Yosinski et al.,
“Understanding neural networks through deep visualization,”
(2015). Google Scholar
B. G. Saar et al.,
“Coherent Raman scanning fiber endoscopy,”
Opt. Lett., 36 2396
–2398
(2011). http://dx.doi.org/10.1364/OL.36.002396 OPLEDP 0146-9592 Google Scholar
M. Balu et al.,
“Fiber delivered probe for efficient CARS imaging of tissues,”
Opt. Express, 18 2380
–2388
(2010). http://dx.doi.org/10.1364/OE.18.002380 OPEXFF 1094-4087 Google Scholar
S. Murugkar et al.,
“Miniaturized multimodal CARS microscope based on MEMS scanning and a single laser source,”
Opt. Express, 18 23796
–23804
(2010). http://dx.doi.org/10.1364/OE.18.023796 OPEXFF 1094-4087 Google Scholar
F. Légaré et al.,
“Towards CARS endoscopy,”
Opt. Express, 14 4427
–4432
(2006). http://dx.doi.org/10.1364/OE.14.004427 OPEXFF 1094-4087 Google Scholar
Z. Wang et al.,
“Coherent anti-Stokes Raman scattering microscopy imaging with suppression of four-wave mixing in optical fibers,”
Opt. Express, 19 7960
–7970
(2011). http://dx.doi.org/10.1364/OE.19.007960 OPEXFF 1094-4087 Google Scholar
Z. Wang et al.,
“Delivery of picosecond lasers in multimode fibers for coherent anti-Stokes Raman scattering imaging,”
Opt. Express, 18 13017
–13028
(2010). http://dx.doi.org/10.1364/OE.18.013017 OPEXFF 1094-4087 Google Scholar
S. Weng et al.,
“Dual CARS and SHG image acquisition scheme that combines single central fiber and multimode fiber bundle to collect and differentiate backward and forward generated photons,”
Biomed. Opt. Express, 7 2202
–2218
(2016). http://dx.doi.org/10.1364/BOE.7.002202 BOEICL 2156-7085 Google Scholar
Z. Wang et al.,
“Use of multimode optical fibers for fiber-based coherent anti-Stokes Raman scattering microendoscopy imaging,”
Opt. Lett., 36 2967
–2969
(2011). http://dx.doi.org/10.1364/OL.36.002967 OPLEDP 0146-9592 Google Scholar
X. Chen et al.,
“Multimodal nonlinear endo-microscopy probe design for high resolution, label-free intraoperative imaging,”
Biomed. Opt. Express, 6 2283
–2293
(2015). http://dx.doi.org/10.1364/BOE.6.002283 BOEICL 2156-7085 Google Scholar
X. Xu et al.,
“Multimodal non-linear optical imaging for label-free differentiation of lung cancerous lesions from normal and desmoplastic tissues,”
Biomed. Opt. Express, 4 2855
–2868
(2013). http://dx.doi.org/10.1364/BOE.4.002855 BOEICL 2156-7085 Google Scholar
BiographySheng Weng received his PhD in applied physics from Rice University and his BSc degree in physics from Zhejiang University. He is a data scientist at Amazon. His research interests include translational biophotonics imaging, image processing and recognition, and machine learning. Xiaoyun Xu is a research associate at Houston Methodist Research Institute. She received her PhD in chemistry from the University of Utah. Her research interests include label-free optical imaging for early detection and accurate diagnosis of cancer. Jiasong Li is a postdoctoral fellow at the Houston Methodist Research Institute. He has published more than 30 peer-reviewed journal articles since his PhD study. His research is focused on the developments and applications of optical imaging systems, data analysis, and signal processing. Stephen T. C. Wong is a John S. Dunn presidential distinguished chair of Houston Methodist Research Institute, a chair of the Department of Systems Medicine and Bioengineering at Houston Methodist Research Institute, and a professor at Weill Cornell Medicine. His research applies systems biology, AI, and imaging to solve complex disease problems. |