Paper | Code | CIDEr | ModelName | ReleaseDate |
---|---|---|---|---|
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities | ✓ Link | 0.518 | Audio Flamingo (4-shot) | 2024-02-02 |
RECAP: Retrieval-Augmented Audio Captioning | ✓ Link | 0.359 | RECAP (4-shot) | 2023-09-18 |
Prefix tuning for automated audio captioning | ✓ Link | 0.211 | Prefix tuning for automated audio captioning | 2023-03-30 |
Audio Captioning Transformer | ✓ Link | 0.149 | Audio captioning transformer | 2021-07-21 |
AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGS | ✓ Link | 0.147 | Automated audio captioning by fine-tuning bart with audioset tags | 2021-11-15 |