VADv2: End-to-End Vectorized Autonomous

Driving via Probabilistic Planning

Shaoyu Chen^1*, Bo Jiang^1*, Hao Gao¹, Bencheng Liao¹, Qing Xu²,

Qian Zhang², Chang Huang², Wenyu Liu¹, Xinggang Wang^1,✉

¹Huazhong University of Science & Technology ²Horizon Robotics

^*Equal Contributions. ^✉Corresponding Authors.

Framework

VADv2 takes multi-view image sequence as input in a streaming manner, transforms sensor data into environmental token embeddings, outputs the probabilistic distribution of action, and samples one action to control the vehicle. The probabilistic distribution of action is learned from large-scale driving demonstrations. VADv2 is trained on Town03, Town04, Town06, Town07, and Town10, and evaluated on unseen Town05. It runs stably in a fully end-to-end manner, even w/o rule-based wrapper.

VADv2: End-to-End Vectorized Autonomous

Driving via Probabilistic Planning

Ten Routes of Town05 Long Benchmark

A Ten Miles Long Route around Town05

Framework