I am the head of Machine Learning at the Toyota Research Institute (TRI) and an Adjunct Professor at Stanford, CA, USA. My work focuses on Principle-centric ML (combining knowledge and data) for Embodied Intelligence (physical ML systems), spanning Scene and Behavior Understanding, Simulation for Deep Learning, 3D Computer Vision, and Self-Supervised Learning (cf. a short bio and my CV for more info). Outside of work, I spend time with my wife, daughter, and the mountains (I love camping, climbing, and snowboarding).
- July 2023: Robust Self-Supervised Extrinsic Self-Calibration accepted at IROS’23, and 3 papers accepted at ICCV’23: ZeroDepth (scale-aware monocular depth that generalizes zero-shot across domains, including indoors and outdoors!), DeLiRa (self-supervised depth, light, and radiance fields), and NeO 360 (outdoor NeRFs from sparse views).
- June 2023: we received the best paper award at L4DC’23 for iDBF: Self-Supervised Policy Filters that Avoid Out-of-Distribution States! Congrats Fernando and Haruki!
- May 2023: 3 papers published at CVPR’23: a new 3D detector using view-point equivariance (VEDet), a new video object segmentation benchmark (VOST), a new video model for object discovery from motion-guided tokens (MoTok), and more from the team as mentioned in our TRI @ CVPR 2023 blog.
- January 2023: Neural Groundplans published at ICLR’23, Depth Is All You Need for 3D Detection at ICRA’23, and Active Sampling to reduce Causal Confusion at the NeurIPS’23 CML4Impact workshop.
- September 2022: 3 papers accepted at CoRL’22: ROAD: Learning an Implicit Recursive Octree Auto-Decoder to Efficiently Encode 3D Shapes, Representation Learning for Object Detection from Unlabeled Point Cloud Sequences, and RAP: Risk-Aware Prediction for Robust Planning (oral).
- July 2022: 4 papers accepted at ECCV’22: Depth Field Networks (DeFiNe), implicit representations of shape and appearance (ShAPO), Photo-realistic Neural Domain Randomization (PNDR), Stochastic Sequential Pointcloud Forecasting (S2Net).
- June 2022: I am proposing a new paradigm for Embodied Intelligence: Principle-centric AI. See an intro in my TWiMLAI podcast with Sam Charrington and a technical presentation at ICRA’22.
- June 2022: one paper accepted at IROS’22 on uncertainty modeling for trajectory forecasting (HAICU)
- May 2022: new paper accepted at ICML’22: Object Permanence Emerges in a Random Walk along Memory.
- March 2022: 3 papers accepted at CVPR’22: Revisiting the “Video” in Video-Language Understanding (oral), Multi-Frame Self-Supervised Depth with Transformers, Discovering Objects that Can Move.
- February 2022: 4 papers accepted at ICRA’22: Full Surround Monodepth from Multiple Cameras (FSM), Self-Supervised Camera Self-Calibration from Video, Learning Optical Flow, Depth, and Scene Flow without Real-World Labels, and Control-Aware Prediction Objectives for Autonomous Driving.
- January 2022: 2 papers accepted as spotlights at ICLR’22: Self-supervised Learning is More Robust to Dataset Imbalance and Dynamics-Aware Comparison of Learned Reward Functions (DARD).
- Fall 2021: co-teaching CS131: Computer Vision: Foundations and Applications at Stanford with Juan Carlos Niebles.
- October 2021: presenting our papers at ICCV’21, including work on Self-supervised sim-to-real transfer (GUDA), Learning to Track with Object Permanence (PermaTrack), self-supervised pre-training for monocular 3D object detection (DD3D), and video auto-labeling (Warp-Refine Propagation). I also gave 3 workshop talks on 3D detection at 3DODI, scene understanding at ROAD, and trajectory forecasting at BTFM. See our medium post for an overview.
- September 2021: one oral at NeurIPS’21 (top 1% of ~10k submissions!) on Provable Guarantees for Self-Supervised Learning and one paper accepted at CoRL’21 on Single Shot Scene Reconstruction.
- August 2021: my team is growing! We are hiring 4 new Research Scientists in areas including Perception, Reconstruction / Inverse Graphics, Computer Vision Safety, and RL Safety. Please apply if you are interested in joining a talented team on a mission to discover the learning principles for safe robot autonomy and human amplification at scale!
- July 2021: I gave a talk on Bridging the Perception-Control gap with Prediction at RSS’21.
- July 2021: I did a fun interview with the founder & CEO of Parallel Domain, Kevin McNamara, on how we use Synthetic Data to advance the state of the art in Computer Vision. PD also made a cool blog covering our recent PermaTrack work.
- June 2021: we are organizing the CVPR’21 DDAD depth estimation challenge! Try your ideas on public data from our TRI fleet! Winners will win prizes and present, along with prestigious invited speakers, at our Mono3D CVPR’21 workshop on the Frontiers of Monocular 3D Perception.
- May 2021: got an outstanding reviewer award at CVPR!
- May 2021: together with Vitor and Rares, we wrote a blog post summarizing a lot of our research results in self-supervised learning for depth estimation.
- May 2021: I did a fun interview about ML for Autonomous Driving with Lukas Biewald for the Gradient Dissent podcast. Check it out!
- April 2021: new preprints on Full Surround Monodepth (FSM) and trajectory forecasting (HAICU).
- March 2021: 2 multi-task learning papers accepted at CVPR’21, one on joint depth prediction and completion (PackNet-SAN) and another on Hierarchical Lovász Embeddings for panoptic segmentation.
- March 2021: Blake Wulfe and I won the NeurIPS 2020 ProcGen RL Competition! Find the full report here.
- February 2021: 1 paper at RoboSoft in collaboration with the brilliant TRI Robotics team on monocular depth estimation inside visuotactile sensors.
- January 2021: 1 paper on distributionally robust control accepted at RA-L/ICRA and another at ICLR on regularization for heteroskedastic and imbalanced Deep Learning. Both on robustness and with great Stanford collaborators from Mac Schwager and Tengyu Ma’s labs.
- November 2020: 1 paper (oral) at 3DV on Neural Ray Surfaces in collaboration with Greg Shakhnarovich at TTIC.
- October 2020: got a “top 10% reviewer award” at NeurIPS!
- October 2020: 2 papers accepted at CoRL 2020, including one oral on self-supervised 3D keypoints and a poster on interpretable trajectory forecasting in collaboration with Marco Pavone’s lab at Stanford
- October 2020: I gave an invited talk at IPAM covering a lot of our recent results across the full AV stack.
- August 2020: we are organizing the ECCV 2020 workshop on Perception for Autonomous Driving (PAD)
- July 2020: we are organizing the ICML 2020 workshop on AI for Autonomous Driving (AIAD)
- July 2020: 2 papers accepted at ECCV 2020, including an oral on trajectory prediction and a poster on differentiable rendering
- July 2020: 5 papers accepted at IROS 2020 and 1 at ITSC on scene flow, imitation learning, game-theoretic planning, risk sensitive control, traffic simulation, and planner testing.
- July 2020: 1 paper with Stanford on hierarchical RL and imitation in near-accidents accepted at RSS 2020
- June 2020: Together with colleagues from PFN and TRI-AD, we have released a survey on Differentiable Rendering.
- June 2020: 4 papers (3 orals!) accepted at CVPR 2020 (PackNet pseudo-lidar, real-time panoptic segmentation, auto-labeling via Differentiable Rendering, spatio-temporal graph distillation). See also our new DDAD dataset for depth estimation!
- February 2020: 1 paper with Stanford on Pedestrian Intent Prediction accepted at RA-L/ICRA 2020. See also our new STIP dataset!
- January 2020: I got promoted to Senior Manager! Super grateful to my team and excited for our next steps together!
- December 2019: 1 paper accepted at ICLR 2020 and 1 (oral) at WACV 2020
- October 2019: 1 paper accepted at the International Journal of Computer Vision (IJCV)
- September 2019: 1 paper accepted at NeurIPS 2019 (also oral at BayLearn 2019) and 2 papers (spotlights) accepted at CoRL 2019
- July 2019: 1 paper accepted (oral) at ICCV 2019
- May 2019: I did an interview with the wonderful Sam Charrington for the TWIML AI podcast!
Dr. Adrien Gaidon is the Director of Machine Learning at Toyota Research Institute (TRI) and an Adjunct Professor of computer science at Stanford University. Dr. Gaidon’s research focuses on discovering machine learning principles at the foundation of embodied intelligence. He has over 80 publications at top AI venues covering fundamental topics like Computer Vision and Learning for Robotics (cf. Google Scholar). His research has led to more than 80 patents and applications in areas including safe and autonomous driving. Dr. Gaidon received his Ph.D. from Microsoft Research - Inria Paris and has received multiple scientific awards at top conferences like CVPR and international AI competitions like PASCAL VOC and NeurIPS ProcGen.