Mini-transformer with pooling for unsupervised domain adaptation person reidentification

Lei Ma; Min Jiang; Jun Kong

doi:10.1117/1.JEI.31.5.053027

6 October 2022 Mini-transformer with pooling for unsupervised domain adaptation person reidentification

Lei Ma, Min Jiang, Jun Kong

Author Affiliations +

Journal of Electronic Imaging, Vol. 31, Issue 5, 053027 (October 2022). https://doi.org/10.1117/1.JEI.31.5.053027

Abstract

Person reidentification (Re-ID) aims to match specific pedestrians across nonoverlapping camera views. Due to the dramatic disparities between different datasets, transferring a Re-ID model trained on the source domain to the target domain is challenging. Some outstanding unsupervised domain adaptation (UDA) Re-ID methods use clustering to generate pseudolabels, optimizing the model on the target domain, but the pseudolabels are inevitably noisy. To address the above issues, we propose a framework named mini-transformer with pooling (MTP) to facilitate the generation of superior quality pseudolabels by improving the model’s feature representation capability. First, we introduce an effective mini-transformer (MT) that can be placed directly behind the CNNs as a feature extractor to capture long-range dependency. Then, we design two delicate pooling methods named global hybrid pooling (GHP) and global subvalue pooling (GSVP) to suit mini-transformer’s tremendous capability without increasing computational complexity. Specifically, GHP can keep more global information and GSVP can keep more discriminative information. Finally, experiments on four mainstream UDA Re-ID tasks demonstrate that MTP achieves competitive mAP and rank-1 accuracy to the current state-of-the-art methods, suggesting that our technique is simple but effective. In addition to UDA Re-ID, our MTP can be extended to other supervised retrieval tasks.

Citation Download Citation

Lei Ma, Min Jiang, and Jun Kong "Mini-transformer with pooling for unsupervised domain adaptation person reidentification," Journal of Electronic Imaging 31(5), 053027 (6 October 2022). https://doi.org/10.1117/1.JEI.31.5.053027

Received: 19 May 2022; Accepted: 21 September 2022; Published: 6 October 2022

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
19 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Transformers

Performance modeling

Data modeling

Cameras

Head

Visual process modeling

Visualization

Show All Keywords

Keywords/Phrases

Search In:

Publication Years