OpenCodePapers

spatial-reasoning-on-6-dof-spatialbench

Visual Question AnsweringSpatial Reasoning
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeTotalPosition-relPosition-absOrientation-relOrientation-absModelNameReleaseDate
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation✓ Link43.959.633.854.631.3SoFar2025-02-18
GPT-4o System Card36.249.428.444.225.8GPT-4o2024-10-25
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics33.543.830.833.825.8RoboPoint2024-06-15
SpatialBot: Precise Spatial Understanding with Vision Language Models✓ Link32.750.921.639.622.9SpatialBot2024-06-19
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities28.933.629.227.225.0SpaceMantis2024-01-22
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities28.232.430.530.924.9SpaceLLaVA2024-01-22
Improved Baselines with Visual Instruction Tuning✓ Link27.230.924.528.325.8LLaVA-1.52023-10-05