3D reconstruction from a single RGB image is making some progress especially with the advent of deep learning in recent years. In this paper, we focus on 3D reconstruction of indoor scenes taking as input a single image, without point clouds, multi-view images, depth or masks. Our mesh reconstruction is based on an encoder-decoder framework of mesh deformation. Features extracted by the encoder from the input image will be used as supervision for mesh deformation. We propose an inter-point interaction attention in the decoder to exploit the vertexes’ influence on each other. We also use a smooth loss to generate smoother surface for objects. We have evaluated the proposed framework on the Pix3D dataset, and state-of-the-art performance has been achieved with visually appealing 3D geometry.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.