Poster + Paper
22 November 2024 High-fidelity single-pixel imaging using multihead attention mechanism
Author Affiliations +
Conference Poster
Abstract
Single-pixel imaging has gained prominence for its wide working wavelength and high sensitivity. Deep learning-based single-pixel imaging shows superiority in real-time reconstruction, particularly with limited resources. In this work, we report a novel encoder-decoder method for single-pixel imaging, which aims at enhancing imaging quality from extremely low measurement amounts. First, we encode the high-dimensional target information into one-dimensional measurements using globally optimized modulation patterns, implemented by a fully connected or convolutional layer. Second, we integrate a U-Net neural network with an advanced multi-head self-attention mechanism and a pyramid pooling module to decode the measurements and reconstruct high-fidelity images. Under such a strategy, the skip connections within the U-Net structure enhance the preservation of fine image features, and the incorporation of the multi-head self-attention mechanism and pyramid pooling module effectively captures contextual dependencies among low-dimensional measurements, thereby extracting significant image features and enhancing reconstruction quality. The simulation results conducted on the STL-10 dataset validate the efficiency of the reported technique. With a resolution of 96 × 96 pixels and an ultra-low sampling rate of 1%, we consistently achieved the highest image fidelity compared to traditional single-pixel reconstruction methods for both grayscale and color images.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Hui Lu, Xinrui Zhan, and Liheng Bian "High-fidelity single-pixel imaging using multihead attention mechanism", Proc. SPIE 13239, Optoelectronic Imaging and Multimedia Technology XI, 1323911 (22 November 2024); https://doi.org/10.1117/12.3036102
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image restoration

Sampling rates

Modulation

Feature extraction

Head

Network architectures

Education and training

Back to Top