VoxNet | 3D Convolutional Neural Networks


Convolutional Neural Networks have largely replaced the traditional "preprocessing -> features -> classifier" pipeline for object recognition and other tasks in computer vision. Can we do the same for analogous tasks using range sensors such as LiDAR? We propose a framework integrating columetric Occupancy Grids with 3D Convolutional Neural Networks. Unlike previous 2.5D approaches based on depth images, our approach is fully volumetric. Our results show that this framework obtains state-of-the-art accuracy at high speeds, mirroring results in computer vision.

Code: github.com/dimatura/voxnet


  • D. Maturana and S. Scherer. 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR. In ICRA. 2015. [ .bib ] [ .pdf ]
  • D. Maturana and S. Scherer. VoxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition. In IROS. 2015. [ .bib ] [ .pdf ]