Paper | Code | Acc | ModelName | ReleaseDate |
---|---|---|---|---|
Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team | ✓ Link | 94.4 | Xolver | 2025-06-17 |
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | ✓ Link | 79.8 | DeepSeek-r1 | 2025-01-22 |
[]() | 74.4 | Openai-o1 | ||
[]() | 70.0 | Openai-o1-mini | ||
Search-o1: Agentic Search-Enhanced Large Reasoning Models | ✓ Link | 56.7 | Search-o1 | 2025-01-09 |
s1: Simple test-time scaling | ✓ Link | 56.7 | s1-32B | 2025-01-31 |
[]() | 44.6 | Openai-o1-preview | ||
Qwen2.5 Technical Report | ✓ Link | 23.3 | Qwen2.5-72B-Instruct | 2024-12-19 |
[]() | 16 | Claude3.5-Sonnet |