Publication details for Professor Toby BreckonAtapour-Abarghouei, A. & Breckon, T.P. (2018), Real-Time Monocular Depth Estimation using Synthetic Data with Domain Adaptation via Image Style Transfer, 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Salt Lake City, Utah, IEEE, Piscataway, NJ, 2800-2810.
- Publication type: Conference Paper
- ISSN/ISBN: 2575-7075, 9781538664209
- DOI: 10.1109/CVPR.2018.00296
- Keywords: monocular depth, generative adversarial network, GAN, depth map, disparity, depth from single image
- Further publication details on publisher web site
- Durham Research Online (DRO) - may include full text
Author(s) from Durham
Monocular depth estimation using learning-based approaches has become promising in recent years. However, most monocular depth estimators either need to rely on large quantities of ground truth depth data, which is extremely expensive and difficult to obtain, or predict disparity as an intermediary step using a secondary supervisory signal leading to blurring and other artefacts. Training a depth estimation model using pixel-perfect synthetic data can resolve most of these issues but introduces the problem of domain bias. This is the inability to apply a model trained on synthetic data to real-world scenarios. With advances in image style transfer and its connections with domain adaptation (Maximum Mean Discrepancy), we take advantage of style transfer and adversarial training to predict pixel perfect depth from a single real-world color image based on training over a large corpus of synthetic environment data. Experimental results indicate the efficacy of our approach compared to contemporary state-of-the-art techniques.