OpenCodePapers

image-classification-on-coloninst-v1-seen

Image Classification
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccurayModelNameReleaseDate
Frontiers in Intelligent Colonoscopy✓ Link94.06ColonGPT (w/ LoRA, w/o extra data)2024-10-22
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link93.84LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)2023-06-01
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices✓ Link93.64MobileVLM-1.7B (w/ LoRA, w/ extra data)2023-12-28
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link93.62LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link93.52LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)2023-06-01
Improved Baselines with Visual Instruction Tuning✓ Link93.33LLaVA-v1.5 (w/ LoRA, w/ extra data)2023-10-05
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models✓ Link93.24MGM-2B (w/o LoRA, w/ extra data)2024-03-27
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices✓ Link93.02MobileVLM-1.7B (w/o LoRA, w/ extra data)2023-12-28
Improved Baselines with Visual Instruction Tuning✓ Link92.97LLaVA-v1.5 (w/ LoRA, w/o extra data)2023-10-05
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models✓ Link92.97MGM-2B (w/o LoRA, w/o extra data)2024-03-27
Efficient Multimodal Learning from Data-centric Perspective✓ Link92.47Bunny-v1.0-3B (w/ LoRA, w/ extra data)2024-02-18
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning✓ Link91.49MiniGPT-v2 (w/ LoRA, w/o extra data)2023-10-14
Efficient Multimodal Learning from Data-centric Perspective✓ Link91.16Bunny-v1.0-3B (w/ LoRA, w/o extra data)2024-02-18
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning✓ Link90.00MiniGPT-v2 (w/ LoRA, w/ extra data)2023-10-14
Visual Instruction Tuning✓ Link89.61LLaVA-v1 (w/ LoRA, w/ extra data)2023-04-17
Visual Instruction Tuning✓ Link87.86LLaVA-v1 (w/ LoRA, w/o extra data)2023-04-17
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link87.22LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)2023-06-01