OpenCodePapers

referring-expression-generation-on-coloninst-1

Referring expression generation
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccurayModelNameReleaseDate
Frontiers in Intelligent Colonoscopy✓ Link80.18ColonGPT (w/ LoRA, w/o extra data)2024-10-22
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices✓ Link78.03MobileVLM-1.7B (w/ LoRA, w/ extra data)2023-12-28
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link75.25LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)2023-06-01
Efficient Multimodal Learning from Data-centric Perspective✓ Link75.08Bunny-v1.0-3B (w/ LoRA, w/ extra data)2024-02-18
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link75.07LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)2023-06-01
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models✓ Link74.30MGM-2B (w/o LoRA, w/ extra data)2024-03-27
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices✓ Link73.14MobileVLM-1.7B (w/o LoRA, w/ extra data)2023-12-28
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link73.05LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)2023-06-01
Improved Baselines with Visual Instruction Tuning✓ Link72.88LLaVA-v1.5 (w/ LoRA, w/ extra data)2023-10-05
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning✓ Link72.05MiniGPT-v2 (w/ LoRA, w/o extra data)2023-10-14
Improved Baselines with Visual Instruction Tuning✓ Link70.38LLaVA-v1.5 (w/ LoRA, w/o extra data)2023-10-05
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning✓ Link70.23MiniGPT-v2 (w/ LoRA, w/ extra data)2023-10-14
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day✓ Link70.00LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)2023-06-01
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models✓ Link69.81MGM-2B (w/o LoRA, w/o extra data)2024-03-27
Efficient Multimodal Learning from Data-centric Perspective✓ Link69.45Bunny-v1.0-3B (w/ LoRA, w/o extra data)2024-02-18
Visual Instruction Tuning✓ Link68.11LLaVA-v1 (w/ LoRA, w/o extra data)2023-04-17
Visual Instruction Tuning✓ Link46.85LLaVA-v1 (w/ LoRA, w/ extra data)2023-04-17