Details
Citation
Swingler K, Rumble T, Goutcher R, Hibbard P, Donoghue M & Harvey D (2024) Combined Depth and Semantic Segmentation from Synthetic Data and a W-Net Architecture. In: volume 1. 16th International Conference on Neural Computation Theory and Applications, Porto, Portugal, 20.11.2024-22.11.2024. SCITEPRESS - Science and Technology Publications, pp. 413-422. https://doi.org/10.5220/0012877500003837
Abstract
Monocular pixel level depth estimation requires an algorithm to label every pixel in an image with its estimated distance from the camera. The task is more challenging than binocular depth estimation, where two cameras fixed a small distance apart are used. Algorithms that combine depth estimation with pixel level semantic segmentation show improved performance but present the practical challenge of requiring a dataset that is annotated at pixel level with both class labels and depth values. This paper presents a new convolutional neural network architecture capable of simultaneous monocular depth estimation and semantic segmentation and shows how synthetic data generated using computer games technology can be used to train such models. The algorithm performs at over 98% accuracy on the segmentation task and 88% on the depth estimation task.
Status | Published |
---|---|
Funders | |
Publication date | 30/11/2024 |
Publication date online | 30/11/2024 |
Publisher | SCITEPRESS - Science and Technology Publications |
ISBN | 9789897587214 |
Conference | 16th International Conference on Neural Computation Theory and Applications |
Conference location | Porto, Portugal |
Dates |
People (6)
Technical Specialist (Cognition), Psychology
Associate Professor, Psychology
Research Fellow, Psychology
Professor in Psychology, Psychology
Tutor, Psychology
Professor, Computing Science