Paper | Code | CIDEr | SPICE | ROUGE-L | ModelName | ReleaseDate |
---|---|---|---|---|---|---|
NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative | 49.87 | 15.76 | 27.90 | CEN | 2024-06-10 | |
GiT: Towards Generalist Vision Transformer through Universal Language Interface | ✓ Link | 32.43 | 13.70 | 24.51 | GIT | 2024-03-14 |
SEM-POS: Grammatically and Semantically Correct Video Captioning | 26.01 | 12.09 | 20.11 | SEM-POS | 2023-03-26 | |
Action knowledge for video captioning with graph neural networks | ✓ Link | 25.90 | 11.99 | 21.42 | AKGNN | 2023-03-16 |