OpenCodePapers

referring-expression-generation-on-coloninst

Referring expression generation
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccurayModelNameReleaseDate
Frontiers in Intelligent Colonoscopy✓ Link99.96ColonGPT (w/ LoRA, w/o extra data)2024-10-22
Improved Baselines with Visual Instruction Tuning✓ Link99.32LLaVA-v1.5 (w/ LoRA, w/ extra data)2023-10-05
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link99.3LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)2023-06-01
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models✓ Link98.75MGM-2B (w/o LoRA, w/ extra data)2024-03-27
Improved Baselines with Visual Instruction Tuning✓ Link98.58LLaVA-v1.5 (w/ LoRA, w/o extra data)2023-10-05
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models✓ Link98.17MGM-2B (w/o LoRA, w/o extra data)2024-03-27
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices✓ Link97.87MobileVLM-1.7B (w/ LoRA, w/ extra data)2023-12-28
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices✓ Link97.78MobileVLM-1.7B (w/o LoRA, w/ extra data)2023-12-28
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link97.74LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)2023-06-01
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link97.35LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)2023-06-01
Efficient Multimodal Learning from Data-centric Perspective✓ Link96.61Bunny-v1.0-3B (w/ LoRA, w/o extra data)2024-02-18
Efficient Multimodal Learning from Data-centric Perspective✓ Link96.02Bunny-v1.0-3B (w/ LoRA, w/ extra data)2024-02-18
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning✓ Link94.69MiniGPT-v2 (w/ LoRA, w/o extra data)2023-10-14
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link90.4LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)2023-06-01
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning✓ Link87.65MiniGPT-v2 (w/ LoRA, w/ extra data)2023-10-14
Visual Instruction Tuning✓ Link86.87LLaVA-v1 (w/ LoRA, w/ extra data)2023-04-17
Visual Instruction Tuning✓ Link84.55LLaVA-v1 (w/ LoRA, w/o extra data)2023-04-17