We study the notion of consistency between a 3D shape and a 2D observation and propose a differentiable formulation which allows computing gradients of the 3D shape given an observation from an arbitrary view. We do so by reformulating view consistency using a differentiable ray consistency (DRC) term. We show that this formulation can be incorporated in a learning framework to leverage different types of multi-view observations e.g. foreground masks, depth, color images, semantics etc. as supervision for learning single-view 3D prediction. We present empirical analysis of our technique in a controlled setting. We also show that this approach allows us to improve over existing techniques for single-view reconstruction of objects from the PASCAL VOC dataset.
Submitted 20 Apr 2017 to Computer Vision and Pattern Recognition
Published 21 Apr 2017
Author comments: To appear at CVPR 2017. Project webpage : https://shubhtuls.github.io/drc/http://arxiv.org/abs/1704.06254http://arxiv.org/pdf/1704.06254.pdf