9 September 2023 Motion information supplement for joint detection and embedding tracking
Ensen Mo, Jun Kong, Min Jiang, Tianshan Liu
Author Affiliations +
Abstract

Most existing multi-object tracking models only focus on the spatial information generated by image-level input while ignoring the necessary temporal information. The temporal motion information between consecutive frames can effectively reflect the target’s motion status, which is essential to improving the performance in dealing with occlusion and motion blur of the model. We propose MISTracker to realize the motion information supplement to the original tracking model. Specifically, we divide multi-scale feature maps into two categories from the perspective of space and channel information. Meanwhile, the spatial-level frame differences processing (SFDP) module and the channel-level frame differences processing (CFDP) module are proposed to deal with these differences between continuous frames, respectively. The SFDP processes the differences from the perspective of spatial information and supplements the motion information through the perception of pixel-level information changes in the feature maps. The CFDP processes the differences from the perspective of channels and enhances the information of motion-sensitive channels through the overall pixel differences of different channels. Eventually, temporal and motion information are complementary to each other after upsampling fusion. The whole process is realized by simple convolution, which reduces the computational force as much as possible and enhances the tracking performance of the model.

© 2023 SPIE and IS&T
Ensen Mo, Jun Kong, Min Jiang, and Tianshan Liu "Motion information supplement for joint detection and embedding tracking," Journal of Electronic Imaging 32(5), 053007 (9 September 2023). https://doi.org/10.1117/1.JEI.32.5.053007
Received: 27 February 2023; Accepted: 29 August 2023; Published: 9 September 2023
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Motion models

Education and training

Target detection

Data modeling

Performance modeling

Feature extraction

Convolution

Back to Top