| Paper | Code | recall@5 | ModelName | ReleaseDate |
|---|---|---|---|---|
| Interaction Region Visual Transformer for Egocentric Action Anticipation | ✓ Link | 23.75 | InAViT | 2022-11-25 |
| Anticipative Video Transformer | ✓ Link | 16.7 | AVT++ | 2021-06-03 |
| Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation | ✓ Link | 14.9 | AFFT | 2022-10-23 |
| Predicting the Next Action by Modeling the Abstract Goal | 14.29 | Abstract Goal | 2022-09-12 | |
| Technical Report: Temporal Aggregate Representations | ✓ Link | 12.6 | TempAgg | 2021-06-06 |
| Anticipative Video Transformer | ✓ Link | 12.6 | AVT+ | 2021-06-03 |
| Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video | ✓ Link | 11.2 | RULSTM | 2020-05-04 |
| Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos | 11.0 | TBN | 2021-07-18 |