9 September 2024 SMLoc: spatial multilayer perception-guided camera localization
Jingyuan Feng, Shengsheng Wang, Haonan Sun
Author Affiliations +
Abstract

Camera localization is a technique for obtaining the camera’s six degrees of freedom using the camera as a sensor input. It is widely used in augmented reality, autonomous driving, virtual reality, etc. In recent years, with the development of deep-learning technology, absolute pose regression has gained wide attention as an end-to-end learning-based localization method. The typical architecture is constructed by a convolutional backbone and a multilayer perception (MLP) regression header composed of multiple fully connected layers. Typically, the two-dimensional feature maps extracted by the convolutional backbone have to be flattened and passed into the fully connected layer for pose regression. However, this operation will result in the loss of crucial pixel position information carried by the two-dimensional feature map and adversely affect the accuracy of the pose estimation. We propose a parallel structure, termed SMLoc, using a spatial MLP to aggregate position and orientation information from feature maps, respectively, reducing the loss of pixel position information. Our approach achieves superior performance on common indoor and outdoor datasets.

© 2024 SPIE and IS&T
Jingyuan Feng, Shengsheng Wang, and Haonan Sun "SMLoc: spatial multilayer perception-guided camera localization," Journal of Electronic Imaging 33(5), 053013 (9 September 2024). https://doi.org/10.1117/1.JEI.33.5.053013
Received: 30 April 2024; Accepted: 8 August 2024; Published: 9 September 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Cameras

3D modeling

Sun

Education and training

Feature extraction

Head

Ablation

Back to Top