Presentation + Paper
14 June 2023 Fusion of deep-learning models for multi-view image classification
Brian Maguire, Eleanor Seminerio
Author Affiliations +
Abstract
Advances in Convolutional Neural Networks (CNN) have demonstrated state of the art performance in the tasks of image classification and object detection over the past decade. While significant progress has been made in development of more efficient networks, the computational and memory requirements still exceed practical limits in many applications. Additionally, the pose variability in such applications requires even larger training datasets for the network to generalize to all possible scenarios. The goal of this work is to develop an architecture for fusion of multiple views of a single target to provide robust classification with a lightweight backbone network used across all agents. Motivated by approaches to ensemble learning, we demonstrate that multiple weak learners with computationally efficient networks can combine to enhance classification accuracy. Three methods of fusion are considered: decision fusion, feature fusion, and multi-scale feature fusion. A novel network architecture is developed and implemented for each approach then trained and evaluated using synthetic data. For the feature fusion models, a custom training scheme is developed to minimize classification error while maintaining a common feature extraction backbone across agents. This conforms to a distributed classification use case where each agent has no prior knowledge of its position relative to target. Finally, we discuss the requirements for shared data of each approach in the context of applications with limited communication bandwidth.
Conference Presentation
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Brian Maguire and Eleanor Seminerio "Fusion of deep-learning models for multi-view image classification", Proc. SPIE 12547, Signal Processing, Sensor/Information Fusion, and Target Recognition XXXII, 125470E (14 June 2023); https://doi.org/10.1117/12.2665000
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Education and training

Feature fusion

Image fusion

Image classification

Network architectures

Feature extraction

Object detection

Back to Top