Paper | Code | Mean AP | ModelName | ReleaseDate |
---|---|---|---|---|
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework | ✓ Link | 48.5 | M2D-AS/0.7 | 2024-04-09 |
LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging | 46.6 | LHGNN | 2025-01-07 | |
From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation | ✓ Link | 38.7 | VAB-Encodec (Ours) | 2024-09-27 |