HI! I'm Robin Baumann
Computer Vision and Deep Learning Researcher
from Karlsruhe, Germany.

About Me

Profile

I received my MSc degree from Karlsruhe University of Applied Sciences, where I specialized in Machine Learning. I am particularly interested in Computer Vision and its intersection with Computer Graphics. I like to spend my free time doing all kinds of sports. To expand my knowledge, I read non-fiction books about different topics and enjoy spending time outdoors with my friends. My other passion besides sports is music. As such, I play the Saxophone and enjoy going to festivals and concerts. Recently, I also started DJing and producing electronic music.

Picture of me in black/white
Detail
Name: Robin Baumann
Age: 27
Location: Karlsruhe, Germany
Hobbies: DJing, all kinds of sports

Publications

Classification of 3D structures based on an object detection for facade elements in multiple views during the reconstruction process

View on IEEE Xplore!

Classification of 3D structures based on an object detection for facade elements in multiple views during the reconstruction process

Robin Baumann, Jan-Friso Evers-Senne, Marcus Strand
Published in: 2019 IEEE International Conference on Industrial Cyber Physical Systems (ICPS)

The objective of this paper is to propose an approach which combines the multi view 3D reconstruction process of a building's facade with semantic information obtained by a deep neural network on the 2D observations of the considered scene. The object detector was used to annotate precise 2D features with semantic labels based on the object proposals. In a next step, spatially connected 2D features were evaluated to extend the initially computed labels to the whole facade element. The set of annotated 2D precise features was used to derive a classification of reconstructed 3D features. The a priori probabilities of the object detection model were used to compensate for biased predictions. The described approach was evaluated thoroughly with a view to different strategies for the annotation correction in 2D. Several failure patterns occurring in the considered application of facade reconstruction were observed and suggested solutions were given.

Analyzing CNN Architectures for Secure and Private Image Classification with Homomorphic Encryption and Differential Privacy

Read the PDF

Analyzing CNN Architectures for Secure and Private Image Classification with Homomorphic Encryption and Differential Privacy

Robin Baumann, David Böhmb, Raoul Saiptb and Astrid Laubenheimer
Presentes at the 8th Collaborative European Research Conference (CERC) June 9-10, 2023, Barcelona, Spain, 2023. Read the PDF.

Homomorphic encryption (HE) enables privacy-preserving deep learning, but it comes with significant perfor- mance overheads. In this study, we evaluate the impact of model architectures on the utility and efficiency of deep learning models under differential privacy (DP) and HE settings. Our experiments reveal that dedicated model architectures are crucial for maintaining model utility when using DP. Moreover, we observe that align- ing complex model architectures like ResNets for HE by replacing ReLU with square activation, max pooling with average pooling, and group norm with batch norm strongly deteriorates model utility and results in archi- tectures with sharp minima that fail to generalize. Training such models with DP, however, yields a regularizing effect that improves model utility. Our study contributes to the understanding of the role of model architecture on the applicability of DP and HE.

Work

Deep Learning for Mobile Devices with TensorFlow Lite:
Train Your Custom Object Detector

View Blogpost!

Deep Learning for Mobile Devices with TensorFlow Lite:
Train Your Custom Object Detector

This is a follow-up to my first article on concepts and architectures of Deep Learning for mobile devices. This article provides a step-by-step tutorial on how you can train your custom object detecor with the TensorFlow Object Detection API and prepare the model for the integration into an Android App.

Google Summer of Code
@ TensorFlow

View Documentation and Code!

Implement Mesh R-CNN in TensorFlow

I participated in the Google Summer of Code 2020 for TensorFlow. During the three month period, I implemented Mesh R-CNN with TensorFlow 2 and a TensorFlow Datasets Provider for Pix3D for the TensorFlow Graphics team.

TensorFlow AtlasNet V2 Implementation

View Code on GitHub!

Implement AtlasNet V2 in TensorFlow

In a university project, I implemented AtlasNet V2 in TensorFlow and conducted several experiments regarding the 3D reconstruction of articulated and non-articulated shapes.

Deep Learning for Mobile Devices with TensorFlow Lite:
Concepts and Architectures

View Blogpost!

Deep Learning for Mobile Devices with TensorFlow Lite:
Concepts and Architectures

In this article, I cover several Deep Learning Architectures that are well suited to be run on mobile devices, like smartphones. I also tackle other concepts, like quantization.

3D Deep Learning with TensorFlow 2
 

View Blogpost!

3D Deep Learning with TensorFlow 2

I published a Blockpost on the inovex blog about this topic. Below, you can find the abstract. The world that we interact with each and every day is three-dimensional, but the majority of deep learning models process visual data as 2D images. However, there are some neural network architectures that are capable of processing 3D structures directly. An early approach was presented at the Conference on Computer Vision and Pattern Recognition (CVPR) in 2017 and is called PointNet.

Self Organizing Map
 

View on GitHub!

PySOM

This is a Python implementation of a Self Organizing Map. I developed it as a homework for my computer science studies.