| Paper | Code | Acc | ModelName | ReleaseDate |
|---|---|---|---|---|
| Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team | ✓ Link | 94.4 | Xolver | 2025-06-17 |
| DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | ✓ Link | 79.8 | DeepSeek-r1 | 2025-01-22 |
| []() | 74.4 | Openai-o1 | ||
| []() | 70.0 | Openai-o1-mini | ||
| Search-o1: Agentic Search-Enhanced Large Reasoning Models | ✓ Link | 56.7 | Search-o1 | 2025-01-09 |
| s1: Simple test-time scaling | ✓ Link | 56.7 | s1-32B | 2025-01-31 |
| []() | 44.6 | Openai-o1-preview | ||
| Qwen2.5 Technical Report | ✓ Link | 23.3 | Qwen2.5-72B-Instruct | 2024-12-19 |
| []() | 16 | Claude3.5-Sonnet |