Paper | Code | F1 | ModelName | ReleaseDate |
---|---|---|---|---|
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts | ✓ Link | 77.6 | PaCE | 2023-05-24 |
MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation | ✓ Link | 75.5 | Divter | 2022-11-10 |
MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation | ✓ Link | 59.0 | DE++ | 2022-11-10 |
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision | ✓ Link | 55.8 | ViLT | 2021-02-05 |