OpenCodePapers

bird-s-eye-view-semantic-segmentation-on

Bird's-Eye View Semantic Segmentation

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	IoU veh - 224x480 - Vis filter. - 100x100 at 0.5	IoU veh - 448x800 - Vis filter. - 100x100 at 0.5	IoU veh - 224x480 - No vis filter - 100x100 at 0.5	IoU veh - 448x800 - No vis filter - 100x100 at 0.5	IoU ped - 224x480 - Vis filter. - 100x100 at 0.5	IoU lane - 224x480 - 100x100 at 0.5	IoU veh - 224x480 - No vis filter - 100x50 at 0.25	IoU vehicle - Setting 3	ModelName	ReleaseDate
PointBeV: A Sparse Approach to BeV Predictions	✓ Link	44.7	48.7	39.9	43.2	19.9				PointBeV	2023-12-01
PointBeV: A Sparse Approach to BeV Predictions	✓ Link	44.0	47.6	38.7	42.1	18.5	49.6			PointBeV (static)	2023-12-01
Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?	✓ Link	43.0	46.6	36.9	40.9					Simple-BEV	2022-06-16
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers	✓ Link	42.0	45.5	35.8	39.0		25.7			BEVFormer	2022-03-31
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras	✓ Link	39.8		35.8		17.2				FIERY (static)	2021-04-21
LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation	✓ Link	38.9		35.4						LaRa	2022-06-27
BAEFormer: Bi-Directional and Early Interaction Transformers for Bird's Eye View Semantic Segmentation		38.9	41.0	36	37.8					BAEFormer	2023-01-01
Cross-view Transformers for real-time Map-view Semantic Segmentation	✓ Link	36.0	37.7	31.4	32.5					CVT	2022-05-05
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras	✓ Link			38.2				41.1	58.5	FIERY	2021-04-21
TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving						18.6				TBP-Former	2023-03-17
TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving						17.2				TBP-Former (static)	2023-03-17
Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D	✓ Link					15.0				Lift-Splat-Shoot	2020-08-13
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning	✓ Link					14.5				ST-P3	2022-07-15
PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images	✓ Link						44.8			PETRv2	2022-06-02
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception	✓ Link						44.8			MatrixVT	2022-11-19
[]()							38.0			M^2BEV
Monocular Semantic Occupancy Grid Mapping with Convolutional Variational Encoder-Decoder Networks								8.8		VED	2018-04-06