Paper | Code | recall@5 | ModelName | ReleaseDate |
---|---|---|---|---|
Interaction Region Visual Transformer for Egocentric Action Anticipation | ✓ Link | 23.75 | InAViT | 2022-11-25 |
Anticipative Video Transformer | ✓ Link | 16.7 | AVT++ | 2021-06-03 |
Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation | ✓ Link | 14.9 | AFFT | 2022-10-23 |
Predicting the Next Action by Modeling the Abstract Goal | 14.29 | Abstract Goal | 2022-09-12 | |
Technical Report: Temporal Aggregate Representations | ✓ Link | 12.6 | TempAgg | 2021-06-06 |
Anticipative Video Transformer | ✓ Link | 12.6 | AVT+ | 2021-06-03 |
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video | ✓ Link | 11.2 | RULSTM | 2020-05-04 |
Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos | 11.0 | TBN | 2021-07-18 |