JD-SLAM: Joint camera pose estimation and moving object segmentation for simultaneous localization and mapping in dynamic scenes
Author(s): Zhai, YJ (Zhai, Yujia); Lu, BL (Lu, Baoli); Li, WJ (Li, Weijun); Xu, J (Xu, Jian); Ma, SY (Ma, Shuangyi)
Source: INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS Volume: 18 Issue: 1 Article Number: 1729881421994447 DOI: 10.1177/1729881421994447 Published: JAN 2021
Abstract: As a fundamental assumption in simultaneous localization and mapping, the static scenes hypothesis can be hardly fulfilled in applications of indoor/outdoor navigation or localization. Recent works about simultaneous localization and mapping in dynamic scenes commonly use heavy pixel-level segmentation net to distinguish dynamic objects, which brings enormous calculations and limits the real-time performance of the system. That restricts the application of simultaneous localization and mapping on the mobile terminal. In this article, we present a lightweight system for monocular simultaneous localization and mapping in dynamic scenes, which can run in real time on central processing unit (CPU) and generate a semantic probability map. The pixel-wise semantic segmentation net is replaced with a lightweight object detection net combined with three-dimensional segmentation based on motion clustering. And a framework integrated with an improved weighted-random sample consensus solver is proposed to jointly solve the camera pose and perform three-dimensional object segmentation, which enables high accuracy and efficiency. Besides, the prior information of the generated map and the object detection results is introduced for better estimation. The experiments on the public data set, and in the real-world demonstrate that our method obtains an outstanding improvement in both accuracy and speed compared to state-of-the-art methods.
Accession Number: WOS:000623481800001
ISSN: 1729-8814
Full Text: https://journals.sagepub.com/doi/10.1177/1729881421994447