OpenCodePapers

action-segmentation-on-coin

Action Segmentation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeFrame accuracyModelNameReleaseDate
UnLoc: A Unified Framework for Video Localization Tasks✓ Link72.8UnLoc-L2023-08-21
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation✓ Link70.0Univl2020-02-15
Multi-granularity Correspondence Learning from Long-term Noisy Videos✓ Link69.8Norton2024-01-30
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding✓ Link68.7VideoClip2021-09-28
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding✓ Link68.4VLM2021-05-20
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment68.4TACo2021-08-23
End-to-End Learning of Visual Representations from Uncurated Instructional Videos✓ Link61.0MIL-NCE2019-12-13
ActBERT: Learning Global-Local Video-Text Representations✓ Link57.0ActBERT2020-11-14
End-to-End Learning of Visual Representations from Uncurated Instructional Videos✓ Link53.9CBT2019-12-13