visual-question-answering-on-mmbench

Visual Question Answering

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	GPT-3.5 score	ModelName	ReleaseDate
Mixture-of-Subspaces in Low-Rank Adaptation	✓ Link	73.8	LLaVA-InternLM2-ViT + MoSLoRA	2024-06-16
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts	✓ Link	73.0	CuMo-7B	2024-05-09
Mixture-of-Subspaces in Low-Rank Adaptation	✓ Link	73.0	LLaVA-LLaMA3-8B-ViT + MoSLoRA	2024-06-16
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization	✓ Link	67.3	Video-LaVIT	2024-02-05
DreamLLM: Synergistic Multimodal Comprehension and Creation	✓ Link	49.9	DreamLLM-7B	2023-09-20

OpenCodePapers