Paper | Code | R@1 | R@5 | Sum(R@1,5) | ModelName | ReleaseDate |
---|---|---|---|---|---|---|
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts | ✓ Link | 51.9 | 76.8 | 128.7 | PaCE | 2023-05-24 |
Image Chat: Engaging Grounded Conversations | ✓ Link | 50.3 | 75.4 | 125.7 | TransResNet | 2018-11-02 |
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts | ✓ Link | 46.8 | 67.5 | 114.3 | VLMo | 2021-11-03 |