Paper
15 August 2023 Visual transformer-based image retrieval with multiple loss fusion
Huayong Liu, Cong Huang, Hanjun Jin, Xiaosi Fu, Pei Shi
Author Affiliations +
Proceedings Volume 12719, Second International Conference on Electronic Information Technology (EIT 2023); 127191N (2023) https://doi.org/10.1117/12.2685738
Event: Second International Conference on Electronic Information Technology (EIT 2023), 2023, Wuhan, China
Abstract
Through hash learning, the image retrieval based on deep hash algorithm encodes the image into a fixed length hash code for fast retrieval and matching. However, previous deep hash retrieval models based on convolutional neural networks extract local information of the image using pooling and convolution technology, which requires deeper networks to obtain long distance dependency, leading to high complexity and computation. In this paper, we propose a visual Transformer model based on self-attention to learn long dependencies of images and enhance the extraction ability of image features. Furthermore, a loss function with multiple loss fusion is proposed, which combines hash contrastive loss, classification loss, and quantization loss, to fully utilize image label information to improve the quality of hash coding by learning more potential semantic information. Experimental results demonstrate the superior performance of the proposed method over multiple classical deep hash retrieval methods based on CNN and two transformer-based hash retrieval methods, on two different datasets and different lengths of hash code.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Huayong Liu, Cong Huang, Hanjun Jin, Xiaosi Fu, and Pei Shi "Visual transformer-based image retrieval with multiple loss fusion", Proc. SPIE 12719, Second International Conference on Electronic Information Technology (EIT 2023), 127191N (15 August 2023); https://doi.org/10.1117/12.2685738
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image retrieval

Transformers

Quantization

Image classification

Visualization

Feature extraction

Image processing

Back to Top