EBiDA-FPN: enhanced bi-directional attention feature pyramid network for object detection

Xiaobao Yang; Yulong He; Junsheng Wu; Wentao Wang; Wei Sun; Sugang Ma; Zhiqiang Hou

doi:10.1117/1.JEI.33.2.023013

8 March 2024 EBiDA-FPN: enhanced bi-directional attention feature pyramid network for object detection

Xiaobao Yang, Yulong He, Junsheng Wu, Wentao Wang, Wei Sun, Sugang Ma, Zhiqiang Hou

Author Affiliations +

Journal of Electronic Imaging, Vol. 33, Issue 2, 023013 (March 2024). https://doi.org/10.1117/1.JEI.33.2.023013

Abstract

As a fundamental task in computer vision, object detection has long been a challenging visual task. However, current object detection models lack attention to salient features when fusing the lateral connections and top-down information flows in feature pyramid networks (FPNs). To address this, we propose a method for object detection based on an enhanced bi-directional attention feature pyramid network, which aims to enhance the feature representation capability of lateral connections and top-down links in FPN. This method adopts the triplet module to give attention to salient features in the original multi-scale information in spatial and channel dimensions, establishing an enhanced triplet attention. In addition, it introduces improved top and down attention to fuse contextual information using the correlation of features between adjacent scales. Furthermore, adaptively spatial feature fusion and self-attention are introduced to expand the receptive field and improve the detection performance of deep levels. Extensive experiments conducted on the PASCAL VOC, MS COCO, KITTI, and CrowdHuman datasets demonstrate that our method achieves performance gains of 1.8%, 0.8%, 0.5%, and 0.2%, respectively. These results indicate that our method has significant effects and is competitive compared with advanced detectors.

Citation Download Citation

Xiaobao Yang, Yulong He, Junsheng Wu, Wentao Wang, Wei Sun, Sugang Ma, and Zhiqiang Hou "EBiDA-FPN: enhanced bi-directional attention feature pyramid network for object detection," Journal of Electronic Imaging 33(2), 023013 (8 March 2024). https://doi.org/10.1117/1.JEI.33.2.023013

Received: 30 September 2023; Accepted: 20 February 2024; Published: 8 March 2024

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
16 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Object detection

Sensors

Feature fusion

Education and training

Semantics

Feature extraction

Performance modeling

Show All Keywords

Keywords/Phrases

Search In:

Publication Years