Paper
12 October 2022 LSTM-SENet: category decoding model with an attention mechanism
Penghui Ding, Chi Zhang, Linyuan Wang, Guoen Hu, Bin Yan, Li Tong
Author Affiliations +
Proceedings Volume 12342, Fourteenth International Conference on Digital Image Processing (ICDIP 2022); 123422M (2022) https://doi.org/10.1117/12.2644595
Event: Fourteenth International Conference on Digital Image Processing (ICDIP 2022), 2022, Wuhan, China
Abstract
With the development of deep neural networks (DNN), building visual decoding models based on functional magnetic resonance imaging (fMRI) to simulate the visual system of the human brain and studying visual mechanisms have become a research hotspot. Although existing visual decoding models built using DNNs have achieved a certain accuracy, most models ignore the differences between different voxels. Among them, the BRNN-based category decoding model uses the bidirectional long short term memory (LSTM) network to simulate the visual bidirectional information flow, which improves the decoding accuracy, but it uses the voxels of each brain area as an overall input model. Therefore, we embed the channel attention module, the Squeeze-and-Excitation Networks (SENet), into the LSTM network to construct an LSTM-SENet vision that introduces an attention mechanism The decoding model allows the model to learn by itself and assign different weights to each voxel, focusing on important voxels, thereby improving the classification accuracy of natural images. The experimental results show that our method improves the accuracy of (three-level) category decoding than other methods, and the results further verify the effectiveness of building a visual decoding model based on the visual mechanism.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Penghui Ding, Chi Zhang, Linyuan Wang, Guoen Hu, Bin Yan, and Li Tong "LSTM-SENet: category decoding model with an attention mechanism", Proc. SPIE 12342, Fourteenth International Conference on Digital Image Processing (ICDIP 2022), 123422M (12 October 2022); https://doi.org/10.1117/12.2644595
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Brain

Information visualization

Functional magnetic resonance imaging

Performance modeling

Neural networks

Visual cortex

RELATED CONTENT

Target image search using fMRI signals
Proceedings of SPIE (March 13 2014)
Is the hMT+ V5 complex in the human brain involved...
Proceedings of SPIE (June 07 2004)
Visual neural dynamics
Proceedings of SPIE (June 30 1994)

Back to Top