Abstract
In the past few years, depth estimation from a single image has received increased attentions due to its wide applicability in image and video understanding. For realizing these tasks, many approaches have been developed for estimating depth from a single image based on various depth cues such as shading, motion, etc. However, they failed to estimate plausible depth map when input color image is derived from different category in training images. To alleviate these problems, data-driven approaches have been popularly developed by leveraging the discriminative power of a large scale RGB-D database. These approaches assume that there exists appearance- depth correlation in natural scenes. However, this assumption is likely to be ambiguous when local image regions have similar appearance but different geometric placement within the scene. Recently, a depth analogy (DA) has been developed by using the correlation between color image and depth gradient. DA addresses depth ambiguity problem effectively and shows reliable performance. However, no experiments are conducted to investigate the relationship between database scale and the quality of the estimated depth map. In this paper, we extensively examine the effects of database scale and quality on the performance of DA method. In order to compare the quality of DA, we collect a large scale RGB-D database using Microsoft Kinect v1 and Kinect v2 on indoor and ZED stereo camera on outdoor environments. Since the depth map obtained by Kinect v2 has high quality compared to that of Kinect v1, the depth maps from the database from Kinect v2 are more reliable. It represents that the high quality and large scale RGB-D database guarantees the high quality of the depth estimation. The experimental results show that the high quality and large scale training database leads high quality estimated depth map in both indoor and outdoor scenes.
Original language | English |
---|---|
Title of host publication | Three-Dimensional Imaging, Visualization, and Display 2016 |
Editors | Bahram Javidi, Jung-Young Son |
Publisher | SPIE |
ISBN (Electronic) | 9781510601086 |
DOIs | |
Publication status | Published - 2016 |
Event | Three-Dimensional Imaging, Visualization, and Display 2016 Conference - Baltimore, United States Duration: 2016 Apr 18 → 2016 Apr 20 |
Publication series
Name | Proceedings of SPIE - The International Society for Optical Engineering |
---|---|
Volume | 9867 |
ISSN (Print) | 0277-786X |
ISSN (Electronic) | 1996-756X |
Other
Other | Three-Dimensional Imaging, Visualization, and Display 2016 Conference |
---|---|
Country/Territory | United States |
City | Baltimore |
Period | 16/4/18 → 16/4/20 |
Bibliographical note
Publisher Copyright:© 2016 SPIE.
All Science Journal Classification (ASJC) codes
- Electronic, Optical and Magnetic Materials
- Condensed Matter Physics
- Computer Science Applications
- Applied Mathematics
- Electrical and Electronic Engineering