I am a Postdoctoral Computer Vision Researcher at the Vrije Universiteit Brussel. I completed my PhD in Engineering Sciences in June 2024 under the supervision of Prof. Adrian Munteanu. My doctoral research included important practical applications and involved close collaborations with companies such as Xenomatix, Sabca and VoxelSensors. Our work with VoxelSensors resulted in a patent. I was also awarded a PhD fellowship for strategic basic research by the Research Foundation Flanders (FWO).
Research interests are in 3D computer vision, deep learning, and point cloud processing. Most of my research has been focused on 3D scene understanding and the introduction of scalability to deep learning. Recently, I have also developed an interest in the impressive development of view synthesis methods and its connection with existing 3D processing methods.
[09/2024] My PhD thesis has been selected for presentation at BMVC 2024 Doctoral Consortium.
[07/2024] Our paper, RT-GS2, has been accepted for presentation at BMVC 2024.
[06/2024] I have succesfully defended my PhD thesis.
Education
Vrije Universiteit Brussel
PhD in Engineering Sciences (awarded summa cum laude)
2019-2024
Vrije Universiteit Brussel
Master in Electrical Engineering (awarded summa cum laude)
2017-2019
Université libre de Bruxelles
Master in Electrical Engineering (awarded summa cum laude)
2017-2019
Sapienza Università di Roma
Master in Artificial Intelligence and Robotics (Exchange program)
2018-2019
The first generalizable semantic segmentation method employing Gaussian Splatting. Our method is not only superior in segmentation quality, but also achieves real-time performance of 27 FPS, marking an astonishing 901 times speedup compared to the SOTA.
A 3D instance segmentation method which simultaneously learns coefficients and prototypes. The obtained prototypes are visualizable and interpretable. Experiments on S3DIS-blocks and PartNet.
A significant improvement in performance over RESSCAL3D: resolution scalable 3D semantic segmentation + a new dataset VX-S3DIS exhibiting resolution scalable point streams with semantic labels.
A 3D instance segmentation method which simultaneously learns coefficients and prototypes. The obtained prototypes are visualizable and interpretable.
W6dnet: Weakly-supervised domain adaptation for monocular vehicle 6d pose estimation with 3d priors and synthetic data Yangxintong Lyu,
Remco Royen,
Adrian Munteanu IEEE Transactions on Instrumentation and Measurement, 2024
[Paper][Code]
1) The presentation of a new highly realistic synthetic traffic dataset, SynthV6D, for 6D pose estimation. 2) A weakly-supervised domain adaptation approach for 6D pose estimation of vehicles.
A fully parallelizable vector-quantized variational autoencoder model (VQVAE) that generates high-quality point clouds in milliseconds.
RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds Remco Royen,
Adrian Munteanu IEEE International Conference on Image Processing (ICIP), 2023
[ArXiv][Paper]
Resolution-scalable 3D semantic segmentation of point clouds. By enabling the processing of different resolution scales in parallel, RESSCAL3D is 31-62% faster than the non-scalable baseline and allows early predictions.
A GPU-accelerated simulator that enables the generation of high-quality, perfectly labelled data for any Time-of-Flight sensor, including LiDAR.
Mono6D++: Learning Point Cloud Visibility for 3D Prior-based Vehicle 6D Pose Estimation Yangxintong Lyu,
Olivier Ducastel,
Remco Royen,
Adrian Munteanu European Workshop on Visual Information Processing (EUVIP), 2023
[Paper]
An improvement over Mono6D by the introduction of point cloud visibility prediction.
Improved Block Merging for 3D Point Cloud Instance Segmentation Leon Denis,
Remco Royen,
Adrian Munteanu International Conference on Digital Signal Processing (DSP), 2023
[ArXiv][Paper][Code]
A new block merging algorithm suitable for any block-based 3D instance segmentation technique. It significantly and consistently improves the obtained accuracy for all evaluation metrics employed in literature, regardless of the underlying network architecture.
The first deep-learning-based approach to automatic foot measurement extraction from a single 3D scan.
MONO6D: Monocular Vehicle 6D Pose Estimation with 3D Priors Yangxintong Lyu,
Remco Royen,
Adrian Munteanu IEEE International Conference on Image Processing (ICIP), 2022
[Paper]
A monocular approach for vehicle pose estimation using vehicle 3D priors provided by vehicle make-and-model recognition methods to estimate the 6D pose.