OpenCodePapers

mathematical-reasoning-on-aime24

Mathematical Reasoning
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccModelNameReleaseDate
Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team✓ Link94.4Xolver2025-06-17
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning✓ Link79.8DeepSeek-r12025-01-22
[]()74.4Openai-o1
[]()70.0Openai-o1-mini
Search-o1: Agentic Search-Enhanced Large Reasoning Models✓ Link56.7Search-o12025-01-09
s1: Simple test-time scaling✓ Link56.7s1-32B2025-01-31
[]()44.6Openai-o1-preview
Qwen2.5 Technical Report✓ Link23.3Qwen2.5-72B-Instruct2024-12-19
[]()16Claude3.5-Sonnet