OpenCodePapers

semantic-segmentation-on-deliver

Semantic Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodemIoUModelNameReleaseDate
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes✓ Link68.6CAFuser-CAA2024-10-14
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation✓ Link68.18StitchFusion(RGB-D-E-LiDAR)2024-08-02
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer✓ Link66.9GeminiFusion2024-06-03
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation✓ Link66.65StitchFusion (RGB-D-LiDAR)2024-08-02
Delivering Arbitrary-Modal Semantic Segmentation✓ Link66.30CMNeXt (RGB-D-E-LiDAR)2023-03-02
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation✓ Link66.03StitchFusion (RGB-D-Event)2024-08-02
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation✓ Link65.75StitchFusion (RGB-Depth)2024-08-02
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation✓ Link65.38MemorySAM-B+(R-D-E-L)2025-03-09
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation✓ Link63.48MemorySAM-B+(R-D)2025-03-09
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers✓ Link62.67CMX (RGB-Depth)2022-03-09
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation✓ Link62.42MemorySAM-B+(R-D-E)2025-03-09
Multimodal Token Fusion for Vision Transformers✓ Link60.25TokenFusion (RGB-Depth)2022-04-19
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation✓ Link58.03StitchFusion (RGB-LiDAR)2024-08-02
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation✓ Link57.44StitchFusion (RGB-Event)2024-08-02
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers✓ Link56.52CMX (RGB-Event)2022-03-09
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers✓ Link56.37CMX (RGB-LiDAR)2022-03-09
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation✓ Link53.22MemorySAM-B+(RGB)2025-03-09
Multimodal Token Fusion for Vision Transformers✓ Link53.01TokenFusion (RGB-LiDAR)2022-04-19
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection✓ Link52.97HRFuser (RGB-D-E-Li)2022-06-30
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection✓ Link52.72HRFuser (RGB-D-LiDAR)2022-06-30
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection✓ Link51.88HRFuser (RGB-Depth)2022-06-30
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection✓ Link51.83HRFuser (RGB-D-Event)2022-06-30
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection✓ Link47.95HRFuser (RGB)2022-06-30
Multimodal Token Fusion for Vision Transformers✓ Link45.63TokenFusion (RGB-Event)2022-04-19
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection✓ Link43.13HRFuser (RGB-LiDAR)2022-06-30
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection✓ Link42.22HRFuser (RGB-Event)2022-06-30