Estimating a scene's depth from image pairs is a problem for many computer vision applications such as autonomous vehicles and 3D object reconstruction. However, traditional methods propose dissimilarity functions that do not solve ambiguity problems. We present a combined siamese convolutional neural network (CSCNN) approach to calculate the costs of disparities in stereo images using patches with different sizes. This approach can learn the patches context, improving the accuracy of the estimated costs. We apply the semi-global matching method and the median filter to increase further the robustness of matching costs. We trained and evaluated our approach using the Middlebury database, and, through the bad pixel evaluation, we demonstrate that our approach achieved an accuracy improvement of approximately 22%, compared to the results obtained by single-window CNN-based approaches.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.