OpenCodePapers

audio-classification-on-epic-sounds

ClassificationAudio Classification
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities78.2Mirasol3B2023-11-09
CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition61CA2ST(B/16)2025-03-30
CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition60.3CAVA(B/16)2025-03-30