InteractionMap<br>Improving Online Vectorized HDMap Construction with Interaction

Abstract

Vectorized high-definition (HD) maps are essential for an autonomous driving system. Recently, state-of-the-art map vectorization methods are mainly based on DETR-like framework to generate HD maps in an end-to-end manner. In this paper, we propose InteractionMap, which improves previous map vectorization methods by fully leveraging local-to-global information interaction in both time and space. Firstly, we explore enhancing DETR-like detectors by explicit position relation prior from point-level to instance-level, since map elements contain strong shape priors. Secondly, we propose a key-frame-based hierarchical temporal fusion module, which interacts temporal information from local to global. Lastly, the separate classification branch and regression branch lead to the problem of misalignment in the output distribution. We interact semantic information with geometric information by introducing a novel geometric-aware classification loss in optimization and a geometric-aware matching cost in label assignment. InteractionMap achieves state-of-the-art performance on both nuScenes and Argoverse2 benchmarks.

Motivation

Vectorized high-definition (HD) maps are essential for an autonomous driving system.
The map elements predicted by existing HD map models often suffer from distortion and jitter, which directly impact downstream tasks. There are mainly three reasons:

1. The inconsistency between the classification probability and the geometric quality of the map elements.
2. The lack of inter-instance and intra-instance interactions.
3. The long-duration map element occlusion caused by moving vehicles.

compare

Method

We propose InteractionMap, which improves previous map vectorization methods by fully leveraging local-to-global information interaction in both time and space.

Pipeline

Geometry-aware alignment module is designed to solve the misalignment problem of classification and position output by leveraging the interaction between semantic information and geometry information.
Relation map decoder utilizes explicit position relation embedding method from point to instance, efficiently employing progressive interaction of point-wise information and instance-wise information.
Key-frame-based temporal fusion module leverage temporal information interaction from local to global.

stream

Result

Comparison with SOTA methods on the nuScenes validation set

nus

We visualize results of InteractionMap in sequential frames under the condition of cloudy, sunny, rainy and nighttime respectively.

cloudy1 cloudy2 sunny1 sunny2 rainy1 rainy2 night1 night2

Citation

@article{wu2025interactionmap,
  title={InteractionMap: Improving Online Vectorized HDMap Construction with Interaction},
  author={Wu, Kuang and Yang, Chuan and Li, Zhanbin},
  journal={arXiv preprint arXiv:2503.21659},
  year={2025}
}

InteractionMapImproving Online Vectorized HDMap Construction with Interaction

Abstract

Motivation

Method

Result

Citation

InteractionMap
Improving Online Vectorized HDMap Construction with Interaction