Open Access Paper
28 December 2022 Multi-attention prior based residual encoder-decoder network for fast and accurate reconstruction in fluorescence molecular tomography
Author Affiliations +
Proceedings Volume 12506, Third International Conference on Computer Science and Communication Technology (ICCSCT 2022); 125063F (2022) https://doi.org/10.1117/12.2662505
Event: International Conference on Computer Science and Communication Technology (ICCSCT 2022), 2022, Beijing, China
Abstract
Fluorescence molecular tomography (FMT) is a powerful modality for resolving the three-dimensional (3D) distribution of fluorescent targets inside biological tissues. However, the inverse problem of the FMT is severely ill-posed due to the strong scattering effects of photons inside biological tissues. Previously, regularization-based methods have been widely used to mitigate the ill-posedness of FMT. Due to the complex iterative computation and time-consuming reconstruction process, the FMT remains an intractable challenge for achieving accurate and fast 3D reconstructions. In this work, we propose a multi-attention prior based residual encoder-decoder network (MAP-REDN) to perform FMT reconstruction. Firstly, the multi-attention mechanism can provide weighted a priori information to the fluorescence source, enabling MAP-REDN to effectively mitigate the ill-posedness and enhance the reconstruction accuracy. Secondly, since the direct reconstruction strategy is adopted, the complex iterative computation process in the traditional regularization-based algorithms can be avoided, thus tremendously accelerating the reconstruction process. The experimental results demonstrate the feasibility of the MAP-REDN in achieving accurate and fast FMT reconstruction.

1.

INTRODUCTION

Fluorescence molecular imaging (FMI) can label specific molecules or cells with targeted fluorescent molecular probes to detect physiological and pathological changes in vivo at the molecular or cellular levels in advance1,2. However, FMI cannot quantitatively detect concentration changes of fluorescent molecular probes. Therefore, fluorescence molecular tomography (FMT) technology has been further presented. Owing to its high sensitivity and specificity, FMT has great application potential in tumor diagnosis, drug development, and treatment evaluation owing to its high sensitivity and specificity3-5.

However, achieving accurate FMT reconstruction is a challenging task, which comes from the following reasons. Firstly, the simplified photon propagation model is difficult to accurately describe the propagation process of photons in biological tissues, which hinders the improvement of detection accuracy. Secondly, the inverse problem of FMT is severely ill-posed due to the strong scattering of photons in biological tissues. In addition, the inverse problem of FMT is also ill-conditioned because the measured values of the tissue surface are much smaller than the internal nodes, which makes the solution process vulnerable to system noise and model errors, resulting in poor noise immunity and low robustness.

In order to address the severe ill-posedness and ill-conditioned of the inverse problem to a large extent and enhance the reconstruction accuracy, various reconstruction algorithms have been presented in recent decades. On the one hand, feasible region methods6-8 are proposed, i.e., the interior distribution of the fluorescent source is inferred from its distribution area on the surface of the object, thus reducing the number of unknowns in the equation to be solved. However, due to the subjective nature of the selection of feasible regions, this approach does not fully address the ill-posedness of the inverse problem. On the other hand, more constraints are added to introduce a priori knowledge to the reconstruction process. The constraints are usually implemented by introducing a regularization term into the objective function9-13. The FMT reconstruction is converted into an objective function optimization problem with constraints, and the optimal solution of the reconstructed image is obtained through the optimization method. The commonly used penalty term includes the L2 norm9, L1 norm10, L0 norm11, TV12, group sparsity13, etc. These methods are usually effective only under certain assumptions; however, these methods typically involve complex iterative computations and time-consuming reconstruction processes. Therefore, it is crucial to explore more efficient methods or solutions to achieve accurate FMT reconstruction.

Recently, deep learning techniques with powerful generalization capabilities have shown great advantages in FMT reconstruction and have obtained more competitive reconstruction performance than traditional algorithms14-16. Inspired by these works, we propose a novel multi-attention prior based residual encoder-decoder network for fast and accurate FMT reconstruction. First, the multi-attention mechanism can provide weighted a priori information to the fluorescence source, enabling MAP-REDN to effectively mitigate the ill-posedness to enhance the reconstruction accuracy. Second, since the direct reconstruction strategy is adopted, the complex iterative computation process in the traditional regularization-based algorithms can be avoided more effectively, thus greatly accelerating the reconstruction process. Experiments demonstrate that the proposed MAP-REDN method achieves faster and more accurate reconstructions than the cutting-edge methods.

2.

METHOD

2.1

Light propagation model

The light propagation process can be represented by the coupling of diffuse equation (DE) and Robin-type boundary conditions17,18, which is defined by,

00126_PSISDG12506_125063F_page_2_1.jpg

The above-detailed explanation can be found in References13,15. Solving equation (1) with the finite element method (FEM) [19] in the discrete domain yields the following linear relationship,

00126_PSISDG12506_125063F_page_2_2.jpg

where W is the system matrix, y is the boundary fluorescence measurement, and x denotes the distribution of fluorescent sources to be reconstructed in biological tissues.

Therefore, the FMT inverse problem is to reconstruct the fluorescence distribution x in equation (3). However, the inverse problem is ill-posed owing to the strong scattering effect and insufficient measurement data (i.e., y<<x). To alleviate the ill-posedness and obtain good reconstruction results in a robust manner, a regularization term is generally introduced as a constraint term to optimize the inverse problem, which is formulated as equation (3),

00126_PSISDG12506_125063F_page_2_3.jpg

where λ denotes the regularization parameter used to balance the regularity term and data term.

2.2

MAP-REDN for FMT

In this section, we develop an deep learning method to establish a mapping relationship between surface photon measurements (y) and internal fluorescence source distribution (x), as shown in equation (4),

00126_PSISDG12506_125063F_page_2_4.jpg

where f:R → R denotes the deep neural network, y denotes the surface fluorescence measurement signals.

The model structure of the MAP-REDN is shown in Figure 1. The designed multi-attention module is that the feature representations extracted from the network encoder are provided to the decoder module. We define the feature map extracted from the network encoder as 00126_PSISDG12506_125063F_page_2_6.jpg, where D denotes the depth (i.e., the number of FMT projections), C represents the number of feature maps, and H, W represents the height and width of the FMT projections or feature maps, respectively. Feature compression and fusion are calculated as follows,

00126_PSISDG12506_125063F_page_2_5.jpg
00126_PSISDG12506_125063F_page_3_1.jpg

Figure 1.

The model structure of the proposed MAP-REDN.

00126_PSISDG12506_125063F_page_3_3.jpg

where 00126_PSISDG12506_125063F_page_3_7.jpg represents the fused features of Z5 and Z4, 00126_PSISDG12506_125063F_page_3_8.jpg represents the fused features of Z4 and Z3, 00126_PSISDG12506_125063F_page_3_9.jpg represent the corresponding compressed convolutions, 00126_PSISDG12506_125063F_page_3_10.jpg represents the fused convolution of Z5 and Z4, and ⊕ represents the feature concatenation. The final output of the multi-branch attention prior module is calculated as follows:

00126_PSISDG12506_125063F_page_3_2.jpg

where 00126_PSISDG12506_125063F_page_3_11.jpg represents the fused convolution of Z4 and Z3, 00126_PSISDG12506_125063F_page_3_12.jpg represents the fused output convolution.

2.3

Image quality evaluation

In this work, we use position error (PE), normalized mean square error (NMSE) and Dice to quantitatively assess the reconstruction performance of the MAP-REDN method,

00126_PSISDG12506_125063F_page_3_4.jpg
00126_PSISDG12506_125063F_page_3_5.jpg
00126_PSISDG12506_125063F_page_3_6.jpg

here Prec and Ptrue represent the reconstructed tumor centers and ground truth centers, respectively, X and 00126_PSISDG12506_125063F_page_3_13.jpg represent the actual and reconstructed values, respectively, and |x| represents the number of voxels in the region X. The larger the Dice, the better the reconstruction performance, with 1 as the upper limit; while the smaller the PE and NMSE, the better the localization performance, with 0 as the lower limit.

3.

EXPERIMENTS AND RESULTS

To comprehensively assess the reconstruction capacity of MAP-REND in terms of accuracy, numerical simulation experiments were performed. The classical iteration-based L2-regularized, L1-regularized, and learning-based 3D-En-Decoder methods were utilized for comparison. The reconstruction results of the four different methods are shown in Figure 2.

Figure 2.

Geometry configuration of the numerical simulations. (a): The 3D display of the cylindrical phantom; (b): The 2D display of the cylindrical fluorfscent targets with the EED of 5mm.

00126_PSISDG12506_125063F_page_4_1.jpg

Figure 2a displays the configurations of the numerical simulation experiments. A cylinder phantom model was put on a rotating stage, the axis of rotation is defined as the Z axis, and the bottom plane is set to Z=0 cm. A excitation light source is placed at a height of Z=0.75 cm to excite the fluorophore to be reconstructed. The fluorescence projection images were collected at 15° intervals for a full 360° view (i.e., 24 projection data were used for reconstruction). The cylindrical phantom had a height of 1.5 cm and a diameter of 3 cm. Two fluorescent targets (Figure 2a) with a diameter of 3 mm, a height of 5 mm and an edge-to-edge (EED) of 5 mm (Figure 2b) were put inside the cylinder phantom (Figure 2b). Furthermore, a level of 5% Gaussian noise was inserted into the measurement data to better mimic the real experimental situations.

Qualitative analysis is shown in Figures 3a-3d, including a 3D view of the entire imaged phantom and a sliced 2D view of the target localization plane. It can be seen that the L2-regularized method obtains a over-smoothed reconstruction result, which makes the reconstructed fluorescent target much larger than the real target with lower spatial resolution. Second, although the L1-regularized and 3D-En-Decoder methods can approximately reconstruct the tumor region, the two methods face the reconstruction problem of small discontinuous tumors located near the ground truth region. In contrast, the proposed MAP-REDN method achieves high morphological similarity and low position error. Furthermore, a series of evaluation metrics are used to quantitatively evaluate the reconstruction performance. As shown in Figure 3e, compared with the L2-regularized, L1-regularized and 3D-En-Decoder methods, the MAP-REDN has lower PE and NMSE and higher Dice. Therefore, quantitative analysis showed that MAP-REDN performed better in tumor localization and morphological similarity. Therefore, it can be concluded that the multi-attention mechanism is able to provide weighted prior information for the fluorescence sources, alleviating the severe ill-posed nature of the inverse problem and further improving the reconstruction performance.

Figure 3.

The qualitative and quantitative analysis of the four different reconstruction methods. (a)-(d): The white dotted circles represent the true fluorescent sources. The red polygons clustered around the red spheres in the 3D view are assumed to be reconstructed tumors; (e): The quantitative analysis results of the four different reconstruction methods.

00126_PSISDG12506_125063F_page_4_2.jpg

4.

CONCLUSION AND DISCUSSION

In this work, we develop a novel MAP-REDN method to achieve fast and accurate FMT reconstruction. The motivation behind this method is that spatial feature mapping and reorganization can be enhanced by introducing a multi-attention mechanism to alleviate the severe ill-conditionedness and enhance the reconstruction accuracy. Experiments show that compared with the traditional L2-regularized, L1-regularized methods and the deep learning-based 3D-En-Decoder method, the MAP-REDN method has improved reconstruction accuracy, morphological similarity and spatial resolution. This method has great potential to provide a potential solution for FMT and alternative imaging models in pursuit of higher reconstruction accuracy.

ACKNOWLEDGMENTS

This work was partially supported by the National Key Research and Development Program of China (No. 2017YFA0700401), the 111 Project (No. B13003), the National Natural Science Foundation of China (No. 61871022), the Beijing Natural Science Foundation (No. 7202102), the Fundamental Research Funds for Central Universities, and the Academic Excellence Foundation of BUAA for PhD Students.

REFERENCES

[1] 

Ntziachristos, V., “Fluorescence molecular imaging,” Annu. Rev. Biomed. Eng, 8 1 –33 (2006). https://doi.org/10.1146/bioeng.2006.8.issue-1 Google Scholar

[2] 

Graves, E. E., Weissleder, R. and Ntziachristos, V., “Fluorescence molecular imaging of small animal tumor models,” Curr. Mol. Med, 4 (4), 419 –430 (2004). https://doi.org/10.2174/1566524043360555 Google Scholar

[3] 

Rudin, M. and Weissleder, R., “Molecular imaging in drug discovery and development,” Nat Rev Drug Discov, 2 (2), 123 –131 (2003). https://doi.org/10.1038/nrd1007 Google Scholar

[4] 

Ale, A., Ermolayev, V., Herzog, E., et al., “FMT-XCT: In vivo animal studies with hybrid fluorescence molecular tomography—X-ray computed tomography,” Nat. Methods, 9 (6), 615 –620 (2012). https://doi.org/10.1038/nmeth.2014 Google Scholar

[5] 

Ntziachristos, V., Tung, C. H., Bremer, C. and Weissleder, R., “Fluorescence molecular tomography resolves protease activity in vivo,” Nat. Med, 8 (7), 757 –761 (2002). https://doi.org/10.1038/nm729 Google Scholar

[6] 

An, Y., Liu, J., Zhang, G., Ye, J., Du, Y., Mao, Y., Chi, C. and Tian, J., “A novel region reconstruction method for fluorescence molecular tomography,” IEEE. Trans. Biomed. Eng, 62 (7), 1818 –1826 (2015). https://doi.org/10.1109/TBME.2015.2404915 Google Scholar

[7] 

He, X., Wang, X., Zhang, H., Yi, H. and Hou, Y., “Limited-projection fluorescence molecular tomography based on smoothed l0 norm and feasible region,” Chinese Journal of Lasers, 45 (9), 0907001 (2018). https://doi.org/10.3788/CJL Google Scholar

[8] 

Cao, X., Zhang, B., Wang, X., Liu, F., Liu, K., Luo, J. and Bai, J., “An adaptive Tikhonov regularization method for fluorescence molecular tomography,” Med. Biol. Eng. Comput, 51 (8), 849 –858 (2013). https://doi.org/10.1007/s11517-013-1054-5 Google Scholar

[9] 

Xie, W., Deng, Y., Wang, K., Yang, X. and Luo, Q., “Reweighted L1 regularization for restraining artifacts in FMT reconstruction images with limited measurements,” Opt. Lett, 39 (14), 4148 –4151 (2014). https://doi.org/10.1364/OL.39.004148 Google Scholar

[10] 

Guo, H., Yu, J., He, X., Hou, Y., Fang, D. and Zhang, S., “Improved sparse reconstruction for fluorescence molecular tomography with L 1/2 regularization,” Biomed. Opt. Express, 6 (5), 1648 –1664 (2015). https://doi.org/10.1364/BOE.6.001648 Google Scholar

[11] 

Zhao, L., Yang, H., Cong, W., Wang, G. and Intes, X., “L(p) regularization for early gate fluorescence molecular tomography,” Opt. Lett, 39 (14), 4156 –4159 (2014). https://doi.org/10.1364/OL.39.004156 Google Scholar

[12] 

Jiang, S., Liu, J., Zhang, G., An, Y., Meng, H. and Tian, J., “Reconstruction of fluorescence molecular tomography via a fused LASSO method based on group sparsity prior,” IEEE. Trans. Biomed. Eng, 66 (5), 1361 –1371 (2018). https://doi.org/10.1109/TBME.10 Google Scholar

[13] 

Zhang, P., Ma, C., Song, F., Fan, G., Sun, Y., Feng, Y., Ma, X., Liu, F. and Zhang, G., “A review of advances in imaging methodology in fluorescence molecular tomography,” Phys. Med. Bio, 67 (10), (2022). https://doi.org/10.1088/1361-6560/ac5ce7 Google Scholar

[14] 

Guo, L., Liu, F., Cai, C., Liu, J. and Zhang, G., “3D deep encoder-decoder network for fluorescence molecular tomography,” Opt Lett, 44 (8), 1892 –1895 (2019). https://doi.org/10.1364/OL.44.001892 Google Scholar

[15] 

Zhang, P., Fan, G., Xing, T., Song, F. and Zhang, G., “UHR-DeepFMT: Ultra-high spatial resolution reconstruction of fluorescence molecular tomography based on 3D fusion dual-sampling deep neural network,” IEEE. Trans. Med. Imaging, 40 (11), 3217 –3228 (2021). https://doi.org/10.1109/TMI.2021.3071556 Google Scholar

[16] 

Huang, C., Meng, H., Gao, Y., Jiang, S., Wang, K. and Tian, J., “Fast and robust reconstruction method for fluorescence molecular tomography based on deep neural network,” in Proc. SPIE 10881, Imaging, Manipulation, and Analysis of Biomolecules, Cells, and Tissues XVII, (2019). https://doi.org/10.1117/12.2508468 Google Scholar

[17] 

Soubret, A., Ripoll, J. and Ntziachristos, V., “Accuracy of fluorescent tomography in the presence of heterogeneities: Study of the normalized Born ratio,” IEEE. Trans. Med. Imaging, 24 (10), 1377 –1386 (2005). https://doi.org/10.1109/TMI.2005.857213 Google Scholar

[18] 

Arridge, S. R., “Optical tomography in medical imaging,” Inverse. Probl, 15 (2), 41 –93 (1999). https://doi.org/10.1088/0266-5611/15/2/022 Google Scholar

[19] 

Zhang, G., Pu, H., He, W., Liu, F., Luo, J. and Bai, J., “Bayesian framework based direct reconstruction of fluorescence parametric images,” IEEE. Trans. Med. Imaging, 34 (6), 1378 –1391 (2015). https://doi.org/10.1109/TMI.42 Google Scholar
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Peng Zhang, Fan Song, Chenbin Ma, Zeyu Liu, and Guanglei Zhang "Multi-attention prior based residual encoder-decoder network for fast and accurate reconstruction in fluorescence molecular tomography", Proc. SPIE 12506, Third International Conference on Computer Science and Communication Technology (ICCSCT 2022), 125063F (28 December 2022); https://doi.org/10.1117/12.2662505
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Reconstruction algorithms

Inverse problems

Fluorescence tomography

Tissues

Tomography

Tumors

Photons

Back to Top