In April, I visited Prof. Bernt Schiele’s Computer Vision and Multimodal Computing Department at the Max Planck Institute for Informatics in Saarbrücken. Aside from a presentation on my recent superpixel benchmark, I also met many interesting people and learned a lot about a career in research.
Lifting convolutional neural networks to 3D data is challenging due to different data modalities (videos, image volumes, CAD models, LiDAR data etc.) as well as computational limitations (regarding runtime and memory). In this article, I want to summarize several recent papers addressing these problems and tackling different applications such as shape recognition, shape retrieval, medical image segmentation or object detection.