Paper | Code | Accuracy (%) | ModelName | ReleaseDate |
---|---|---|---|---|
HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics | ✓ Link | 95.2 | HERMES | 2024-08-30 |
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding | ✓ Link | 93.0 | MA-LMM | 2024-04-08 |
Selective Structured State-Spaces for Long-Form Video Understanding | 90.7 | S5 | 2023-03-25 | |
Efficient Movie Scene Detection using State-Space Transformers | ✓ Link | 90.27 | TranS4mer | 2022-12-29 |
Learning To Recognize Procedural Activities with Distant Supervision | ✓ Link | 89.9 | D-Sprv. | 2022-01-26 |
Long Movie Clip Classification with State-Space Video Models | ✓ Link | 88.2 | ViS4mer | 2022-04-04 |
Graph-Based High-Order Relation Modeling for Long-Term Action Recognition | 75.5 | GHRM | 2021-06-19 | |
Timeception for Complex Action Recognition | ✓ Link | 71.3 | Timeception | 2018-12-04 |
VideoGraph: Recognizing Minutes-Long Human Activities in Videos | 69.5 | VideoGraph | 2019-05-13 |