Paper | Code | BLEU-1 | ModelName | ReleaseDate |
---|---|---|---|---|
PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering | ✓ Link | 23.2 | MedVInT | 2023-05-17 |
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models | ✓ Link | 7.6 | BLIP-2 | 2023-01-30 |
Flamingo: a Visual Language Model for Few-Shot Learning | ✓ Link | 4.1 | Open-Flamingo | 2022-04-29 |