Paper | Code | GPT-3.5 score | ModelName | ReleaseDate |
---|---|---|---|---|
Mixture-of-Subspaces in Low-Rank Adaptation | ✓ Link | 73.8 | LLaVA-InternLM2-ViT + MoSLoRA | 2024-06-16 |
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts | ✓ Link | 73.0 | CuMo-7B | 2024-05-09 |
Mixture-of-Subspaces in Low-Rank Adaptation | ✓ Link | 73.0 | LLaVA-LLaMA3-8B-ViT + MoSLoRA | 2024-06-16 |
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization | ✓ Link | 67.3 | Video-LaVIT | 2024-02-05 |
DreamLLM: Synergistic Multimodal Comprehension and Creation | ✓ Link | 49.9 | DreamLLM-7B | 2023-09-20 |