Paper | Code | Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
AudioCLIP: Extending CLIP to Image, Text and Audio | ✓ Link | 90.07 | AudioCLIP | 2021-06-24 |
Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning | ✓ Link | 89.4 | MATPAC (SSL, linear eval) | 2025-02-17 |
End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network | ✓ Link | 89 | 1DCNN | 2019-04-18 |