Paper | Code | HM | ZSL | ModelName | ReleaseDate |
---|---|---|---|---|---|
Boosting Audio-visual Zero-shot Learning with Large Language Models | ✓ Link | 41.10 | 28.05 | KDA | 2023-11-21 |
Temporal and cross-modal attention for audio-visual zero-shot learning | ✓ Link | 31.72 | 24.81 | TCaF | 2022-07-20 |
Hyperbolic Audio-visual Zero-shot Learning | 29.32 | 22.24 | Hyper-multiple | 2023-08-24 | |
Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language | ✓ Link | 27.15 | 20.0 | AVCA | 2022-03-07 |
Attribute Prototype Network for Any-Shot Learning | 20.61 | 16.44 | APN | 2022-04-04 | |
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings | 18.05 | 13.65 | AVGZSLNet | 2020-05-27 | |
Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos | 12.48 | 8.29 | CJME | 2019-10-19 |