OpenCodePapers

image-classification-on-coloninst-v1-unseen

Image Classification
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccurayModelNameReleaseDate
Frontiers in Intelligent Colonoscopy✓ Link83.24ColonGPT (w/ LoRA, w/o extra data)2024-10-22
Improved Baselines with Visual Instruction Tuning✓ Link80.89LLaVA-v1.5 (w/ LoRA, w/ extra data)2023-10-05
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices✓ Link80.44MobileVLM-1.7B (w/ LoRA, w/ extra data)2023-12-28
Efficient Multimodal Learning from Data-centric Perspective✓ Link79.50Bunny-v1.0-3B (w/ LoRA, w/ extra data)2024-02-18
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link79.24LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)2023-06-01
Improved Baselines with Visual Instruction Tuning✓ Link79.10LLaVA-v1.5 (w/ LoRA, w/o extra data)2023-10-05
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models✓ Link78.99MGM-2B (w/o LoRA, w/o extra data)2024-03-27
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices✓ Link78.75MobileVLM-1.7B (w/o LoRA, w/ extra data)2023-12-28
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models✓ Link78.69MGM-2B (w/o LoRA, w/ extra data)2024-03-27
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link78.04LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)2023-06-01
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning✓ Link77.93MiniGPT-v2 (w/ LoRA, w/o extra data)2023-10-14
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link77.38LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)2023-06-01
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning✓ Link76.82MiniGPT-v2 (w/ LoRA, w/ extra data)2023-10-14
Efficient Multimodal Learning from Data-centric Perspective✓ Link75.50Bunny-v1.0-3B (w/ LoRA, w/o extra data)2024-02-18
Visual Instruction Tuning✓ Link72.08LLaVA-v1 (w/ LoRA, w/o extra data)2023-04-17
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link66.51LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)2023-06-01
Visual Instruction Tuning✓ Link42.17LLaVA-v1 (w/ LoRA, w/ extra data)2023-04-17