OpenCodePapers

math-word-problem-solving-on-mawps

Mathematical ReasoningMath Word Problem Solving
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracy (%)ModelNameReleaseDate
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset✓ Link95.7OpenMath-CodeLlama-70B (w/ code)2024-02-15
Learning Multi-Step Reasoning by Solving Arithmetic Tasks✓ Link94.3MsAT-DeductReasoner2023-06-02
ATHENA: Mathematical Reasoning with Thought Expansion✓ Link93ATHENA (roberta-large)2023-11-02
Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem✓ Link92.3Multi-view2022-10-21
An Expression Tree Decoding Strategy for Mathematical Equation Generation✓ Link92.3Exp-Tree2023-10-14
ATHENA: Mathematical Reasoning with Thought Expansion✓ Link92.2ATHENA (roberta-base)2023-11-02
Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction✓ Link92Roberta-DeductReasoner2022-03-19
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements✓ Link91.0DeBERTa (PM + VM)2023-06-24
EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers✓ Link88.7EPT
Are NLP Models really able to Solve Simple Math Word Problems?✓ Link88.7Graph2Tree with RoBERTa2021-03-12
Are NLP Models really able to Solve Simple Math Word Problems?✓ Link88.5GTS with RoBERTa2021-03-12
Generating Equation by Utilizing Operators : GEO model85.1GEO2020-12-01
EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers✓ Link84.57EPT-X
Point to the Expression: Solving Algebraic Word Problems using the Expression-Pointer Transformer Model✓ Link84.51EPT
Graph-to-Tree Learning for Solving Math Word Problems✓ Link83.7Graph2Tree2020-07-01
Llama 2: Open Foundation and Fine-Tuned Chat Models✓ Link82.4LLaMA 2-Chat2023-07-18
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements✓ Link80.3GPT-3.5 turbo (175B)2023-06-24
[]()44.0Toolformer
[]()19.8GPT-3 (175B)
[]()15.0Toolformer (disabled)
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements✓ Link9.9GPT-J2023-06-24
[]()9.3GPT-J + CC
[]()7.9OPT (66B)
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements✓ Link4.09GPT-3 text-curie-001 (13B)2023-06-24
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements✓ Link2.76GPT-3 text-babbage-001 (6.7B)2023-06-24