Paper | Code | Top-1 accuracy % | Top-5 Accuracy % | ModelName | ReleaseDate |
---|---|---|---|---|---|
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning | ✓ Link | 77.6 | 92.9 | XKD (ViT-B/112/16) | 2022-11-25 |
Spatiotemporal Contrastive Video Representation Learning | ✓ Link | 71.6 | CVRL (R3D-152 2x; K600 pretrain) | 2020-08-09 | |
Spatiotemporal Contrastive Video Representation Learning | ✓ Link | 67.6 | CVRL (R3D-101) | 2020-08-09 | |
Spatiotemporal Contrastive Video Representation Learning | ✓ Link | 66.1 | CVRL (R3D-50) | 2020-08-09 |