28 September 2022 Dynamic memory network with spatial-temporal feature fusion for visual tracking
Hongchao Zhang, Hua Bao, Yixiang Lu, Dexiang Zhang, Lina Xun
Author Affiliations +
Abstract

Recently, single object tracking has demonstrated great success. However, due to various problems caused by fast motion, occlusion, and deformation, it is still intractable for traditional trackers to adapt to changes in object appearance. In this work, a dynamic memory network is proposed for visual tracking to handle the template matching process globally. More specifically, we first build a memory model, which consists of a memory feature fusion module and a memory bank. By the memory model, the network not only accepts the first frame as initial information but also memorizes the selective frames in the video sequence to provide rich time-domain information. Second, an innovative sampling strategy is adopted in the tracking process. By updating the template and guiding the selection of memory frames, our model can output higher-quality features. In addition, a spatial-channel fused attention module that effectively improves the representational capability and discriminability of the model is introduced. Our proposed method obtains compelling results on six challenging tracking benchmarks, including the OTB100, VOT2019, UAV123, NFS, GOT-10k, and LaSOT datasets. Extensive experiments demonstrate that our approach shows satisfactory robustness and leading application potential in real-time speed.

© 2022 SPIE and IS&T
Hongchao Zhang, Hua Bao, Yixiang Lu, Dexiang Zhang, and Lina Xun "Dynamic memory network with spatial-temporal feature fusion for visual tracking," Journal of Electronic Imaging 31(5), 053023 (28 September 2022). https://doi.org/10.1117/1.JEI.31.5.053023
Received: 13 March 2022; Accepted: 7 September 2022; Published: 28 September 2022
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical tracking

Video

Code division multiplexing

Convolution

Network architectures

Chemical species

Visualization

RELATED CONTENT

Comparison on long-term tracking: methods and benchmarks
Proceedings of SPIE (October 19 2022)

Back to Top