OpenCodePapers

audio-visual-question-answering-music-avqa-v2

Audio-visual Question AnsweringAUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time✓ Link79.15Meerkat2024-07-01
Question-Aware Gaussian Experts for Audio-Visual Question Answering✓ Link76.43QA-TIGER2025-03-06
Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering✓ Link75.44LAST-Att2023-10-10
Vision Transformers are Parameter-Efficient Audio-Visual Learners✓ Link73.18LAVISH2022-12-15
Learning to Answer Questions in Dynamic Audio-Visual Scenarios✓ Link71.02AVST2022-03-26