OpenCodePapers
bird-s-eye-view-semantic-segmentation-on
Bird's-Eye View Semantic Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
IoU veh - 224x480 - Vis filter. - 100x100 at 0.5
↕
IoU veh - 448x800 - Vis filter. - 100x100 at 0.5
↕
IoU veh - 224x480 - No vis filter - 100x100 at 0.5
↕
IoU veh - 448x800 - No vis filter - 100x100 at 0.5
↕
IoU ped - 224x480 - Vis filter. - 100x100 at 0.5
↕
IoU lane - 224x480 - 100x100 at 0.5
↕
IoU veh - 224x480 - No vis filter - 100x50 at 0.25
↕
IoU vehicle - Setting 3
↕
ModelName
ReleaseDate
↕
PointBeV: A Sparse Approach to BeV Predictions
✓ Link
44.7
48.7
39.9
43.2
19.9
PointBeV
2023-12-01
PointBeV: A Sparse Approach to BeV Predictions
✓ Link
44.0
47.6
38.7
42.1
18.5
49.6
PointBeV (static)
2023-12-01
Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?
✓ Link
43.0
46.6
36.9
40.9
Simple-BEV
2022-06-16
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
✓ Link
42.0
45.5
35.8
39.0
25.7
BEVFormer
2022-03-31
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras
✓ Link
39.8
35.8
17.2
FIERY (static)
2021-04-21
BAEFormer: Bi-Directional and Early Interaction Transformers for Bird's Eye View Semantic Segmentation
38.9
41.0
36
37.8
BAEFormer
2023-01-01
LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation
✓ Link
38.9
35.4
LaRa
2022-06-27
Cross-view Transformers for real-time Map-view Semantic Segmentation
✓ Link
36.0
37.7
31.4
32.5
CVT
2022-05-05
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras
✓ Link
38.2
41.1
58.5
FIERY
2021-04-21
TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving
18.6
TBP-Former
2023-03-17
TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving
17.2
TBP-Former (static)
2023-03-17
Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D
✓ Link
15.0
Lift-Splat-Shoot
2020-08-13
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning
✓ Link
14.5
ST-P3
2022-07-15
PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
✓ Link
44.8
PETRv2
2022-06-02
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception
✓ Link
44.8
MatrixVT
2022-11-19
[]()
38.0
M^2BEV
Monocular Semantic Occupancy Grid Mapping with Convolutional Variational Encoder-Decoder Networks
8.8
VED
2018-04-06