OpenCodePapers
math-word-problem-solving-on-asdiv-a
Mathematical Reasoning
Math Word Problem Solving
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
Execution Accuracy
↕
ModelName
ReleaseDate
↕
ATHENA: Mathematical Reasoning with Thought Expansion
✓ Link
91
ATHENA (roberta-large)
2023-11-02
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning
✓ Link
87.6
MMOS-DeepSeekMath-7B(0-shot)
2024-02-23
ATHENA: Mathematical Reasoning with Thought Expansion
✓ Link
86.4
ATHENA (roberta-base)
2023-11-02
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning
✓ Link
85.1
MMOS-CODE-34B(0-shot)
2024-02-23
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
✓ Link
84.7
OpenMath-CodeLlama-70B (w/ code)
2024-02-15
Are NLP Models really able to Solve Simple Math Word Problems?
✓ Link
82.2
Graph2Tree with RoBERTa
2021-03-12
Are NLP Models really able to Solve Simple Math Word Problems?
✓ Link
81.2
GTS with RoBERTa
2021-03-12
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning
✓ Link
78.6
MMOS-CODE-7B(0-shot)
2024-02-23
Are NLP Models really able to Solve Simple Math Word Problems?
✓ Link
76.9
LSTM Seq2Seq with RoBERTa
2021-03-12