Project page template is borrowed from DreamBooth.
[Arxiv Report] [Github] [BibTeX]
Maps provide robots with crucial environmental knowledge, thereby enabling them to perform interactive tasks effectively. Easily accessing accurate abstract-to-detailed geometric and semantic concepts from maps is crucial for robots to make informed and efficient decisions.
We propose DHP-Mapping, a dense mapping system that represented the environment as a collection of TSDF submaps, with each submap representing a unique object. The map structure enable hierarchical modeling. It stores the voxel level geometry and label information into the TSDF and label layer of each submap and maintain the instance-level information via the submap collection.
Our system converts sensor data into point segments. (A): A data association process assign each segment a submap ID. (B): Segment information is integrated into the assigned submap's TSDF and label layers through ray-tracing. Two modules are proposed to enhance mapping quality. (C): Fusing information among voxels sharing identical spatila inforamtion to avoid submap overlaping. (D): A CRF algorithm encourages label consistency among voxels exhibiting similar color and nearby position.
We conduct experiments on indoor simulation and outdoor real-world datasets. (flat SemanticKITTI) Qualitative results demonstrate our system's advancement in comprehensively reconstructing scenes. Compared to panmap, our system can categorize semantic classes and track objects IDs with higher accuracy, and can produce denser map with high precision.
The exclusion of the refinement module results in an obvious decline in performance. Directly integrating imperfect panoptic segmentation results into the map leads to unclear submap boundaries, causing submaps to mix with others.
Quantatitive results show our method provides more accurate metric-semantic map. It categorize semantic classes and track objects IDs with higher accuracy, and produce denser map with high precision.
The use of a comprehensive labeling system in our mapping system greatly enriches scene representation Besides, the hierarchical submap-based data structure facilitates submap-level manipulation and ensures rapid information retrieval.
Conclusion
In this work, we design a dense volumetric mapping system that uses multiple TSDF submaps and panoptic labels to represent the scene hierarchically and holistically, while maintaining voxel-level and submap-level metric and label information. The proposed inter-submaps label management module ensures the disjoint of spatial information in each submap. The label refinement module improves the accuracy of panoptic labels by taking advantage of the inherent cohesion of objects and incorporating contextual information from the entire scene. This hierarchical TSDF submaps with panoptic labels data structure enable high-level interactive tasks and dynamic environment modeling. In future work, more abstract and high-level representation can be further extracted and integrated above this data structure. This includes establishing topological connections between entities and integrating language-based expressions that go beyond metric and symbolic representations.
BibTeX
@misc{hu2024dhpmapping,
title={DHP-Mapping: A Dense Panoptic Mapping System with Hierarchical World Representation and Label Optimization Techniques},
author={Tianshuai Hu and Jianhao Jiao and Yucheng Xu and Hongji Liu and Sheng Wang and Ming Liu},
year={2024},
eprint={2403.16880},
archivePrefix={arXiv},
primaryClass={cs.RO}
}
Project page template is borrowed from DreamBooth.