V2XScenes: A Multiple Challenging Condition Dataset for Large-Range Vehicle-Infrastructure Collaborative Perception

Overview

V2XScenes is the first high-quality real-world multiple challenging condition dataset under large-range road section for large-range V2X cooperative perception.

Data simultaneously collected by 26 roadside sensors and 9 onboard sensors.
The sequential data with different scenes of complex traffic conditions, diverse weather are well-organized with specific scene-descriptive labels.
5 type sensors of mechanical LiDAR, Solid-state LiDAR, blind repair LiDAR, 4D Radar and Camera.
Global sequential 3D bounding boxes and trajectories with unique tracking IDs for the same targets under the range of 600m

Abstract

Whether autonomous driving can effectively handle challenging scenarios such as bad weather and complex traffic environments is still in doubt. One of the critical difficulty is that the single-agent perception is hard to obtain the complementary perceptual information around the multi-condition scenes, such as meeting occlusion. To investigate the advantages of collaborative perception in high-risky driving scenarios, we constructed a multiple challenging condition dataset for large-range vehicle-Infrastructure cooperative perception, called V2XScenes, which include seven typical multi-modal sensor layouts at successive road section. Particularly, each selected scene is labeled with specific condition description, and we provide the unique global object tracking numbers across the entire road section and sequential frames to ensure consistency. Comprehensive cooperative perception benchmarks of 3D object detection and tracking are provided, the quantitative results based on the state-of-the-art demonstrate the effectiveness of collaborative perception facing corner condition.