OpenCodePapers

math-word-problem-solving-on-asdiv-a

Mathematical ReasoningMath Word Problem Solving
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeExecution AccuracyModelNameReleaseDate
ATHENA: Mathematical Reasoning with Thought Expansion✓ Link91ATHENA (roberta-large)2023-11-02
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning✓ Link87.6MMOS-DeepSeekMath-7B(0-shot)2024-02-23
ATHENA: Mathematical Reasoning with Thought Expansion✓ Link86.4ATHENA (roberta-base)2023-11-02
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning✓ Link85.1MMOS-CODE-34B(0-shot)2024-02-23
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset✓ Link84.7OpenMath-CodeLlama-70B (w/ code)2024-02-15
Are NLP Models really able to Solve Simple Math Word Problems?✓ Link82.2Graph2Tree with RoBERTa2021-03-12
Are NLP Models really able to Solve Simple Math Word Problems?✓ Link81.2GTS with RoBERTa2021-03-12
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning✓ Link78.6MMOS-CODE-7B(0-shot)2024-02-23
Are NLP Models really able to Solve Simple Math Word Problems?✓ Link76.9LSTM Seq2Seq with RoBERTa2021-03-12