Paper | Code | Top-1 Accuracy | PRE-TRAINING DATASET | ModelName | ReleaseDate |
---|---|---|---|---|---|
Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity | ✓ Link | 97 | AudioSet | CrissCross (AudioSet) | 2021-11-09 |
Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity | ✓ Link | 96 | Kinetics-400 | CrissCross (Kinetics-400) | 2021-11-09 |
Self-Supervised Learning by Cross-Modal Audio-Video Clustering | ✓ Link | 95 | IG-Random | XDC | 2019-11-28 |
Self-Supervised Learning by Cross-Modal Audio-Video Clustering | ✓ Link | 95 | AudioSet | XDC | 2019-11-28 |
Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity | ✓ Link | 93 | Kinetics-Sound | CrissCross (Kinetics-Sound) | 2021-11-09 |