OpenCodePapers

speech-prompted-semantic-segmentation-on

Semantic SegmentationSpeech Prompted Semantic Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodemAPmIoUModelNameReleaseDate
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language✓ Link48.736.8DenseAV2024-06-09
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input32.226.3DAVENet2018-04-04
Contrastive Audio-Visual Masked Autoencoder✓ Link27.219.9CAVMAE2022-10-02
ImageBind: One Embedding Space To Bind Them All✓ Link20.219.7ImageBIND2023-05-09