Neural Ray Surfaces for Self-Supervised Learning of Depth and Ego-motion

I. Vasiljevic, V. Guizilini, R. Ambrus, S. Pillai, W. Burgard, G. Shakhnarovich, A. Gaidon

Published in 3DV 2020 (oral) - August 2020

Links: arxiv, video, bibtex

Neural Ray Surfaces (NRS) Model

Abtract

Self-supervised learning has emerged as a powerful tool for depth and ego-motion estimation, leading to state-of-the-art results on benchmark datasets. However, one significant limitation shared by current methods is the assumption of a known parametric camera model – usually the standard pinhole geometry – leading to failure when applied to imaging systems that deviate significantly from this assumption (e.g., catadioptric cameras or underwater imaging). In this work, we show that self-supervision can be used to learn accurate depth and ego-motion estimation without prior knowledge of the camera model. Inspired by the geometric model of Grossberg and Nayar, we introduce Neural Ray Surfaces (NRS), convolutional networks that represent pixel-wise projection rays, approximating a wide range of cameras. NRS are fully differentiable and can be learned end-to-end from unlabeled raw videos. We demonstrate the use of NRS for self-supervised learning of visual odometry and depth estimation from raw videos obtained using a wide variety of camera systems, including pinhole, fisheye, and catadioptric.

Video

Bibtex

@inproceedings{vasiljevic2020neural,
    title={End-to-end Birds-eye-view Flow Estimation for Autonomous Driving},
    author={Igor Vasiljevic and Vitor Guizilini and Rares Ambrus and Sudeep Pillai
        and Wolfram Burgard and Greg Shakhnarovich and Adrien Gaidon},
    booktitle={3DV},
    year={2020},
}