OpenCodePapers

mmr-total-on-mrr-benchmark

MMR total
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTotal Column ScoreModelNameReleaseDate
Claude 3.5 Sonnet Model Card Addendum463Claude 3.5 Sonnet2024-06-24
GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding457GPT-4o2024-06-14
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)✓ Link415GPT-4V2023-09-29
Visual Instruction Tuning✓ Link412LLaVA-NEXT-34B2023-04-17
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone397Phi-3-Vision2024-04-22
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks✓ Link368InternVL2-8B2023-12-21
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond✓ Link366Qwen-vl-max2023-08-24
Visual Instruction Tuning✓ Link335LLaVA-NEXT-13B2023-04-17
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond✓ Link310Qwen-vl-plus2023-08-24
What matters when building vision-language models?256Idefics-2-8B2024-05-03
Visual Instruction Tuning✓ Link243LLaVA-1.5-13B2023-04-17
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks✓ Link237InternVL2-1B2023-12-21
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models✓ Link214Monkey-Chat-7B2023-11-11
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents✓ Link139Idefics-80B2023-06-21