OpenCodePapers

math-word-problem-solving-on-math23k

Mathematical ReasoningMath Word Problem Solving
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracy (5-fold)Accuracy (training-test)weakly-supervisedModelNameReleaseDate
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models✓ Link94.3GPT-4 (Teaching-Inspired)2024-10-10
Multi-View Reasoning: Consistent Contrastive Learning for Math Word Problem✓ Link85.287.1Multi-view* (ours)2022-10-21
Generate & Rank: A Multi-task Framework for Math Word Problems84.385.4Generate and Rank2021-09-07
An Expression Tree Decoding Strategy for Mathematical Equation Generation✓ Link84.186.2Exp-Tree2023-10-14
[]()83.1885.2REAL2: Memory-augmented Solver
Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction✓ Link83Roberta-DeductReasoner2022-03-19
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving✓ Link82.484.7MWP-BERT2021-07-28
Recall and Learn: A Memory-augmented Solver for Math Word Problems✓ Link80.882.3Recall and Learn2021-09-27
MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers✓ Link76.6RoBERTaGen2021-09-02
[]()75.9GTS w/ Data Augmentation
Graph-to-Tree Learning for Solving Math Word Problems✓ Link75.577.4Graph2Tree2020-07-01
Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems✓ Link74.84SAU-Solver2020-10-14
A Goal-Driven Tree-Structured Neural Model for Math Word Problems✓ Link74.3GTS2019-08-10
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions✓ Link66.969.5GROUP-ATT2019-07-01
Deep Neural Solver for Math Word Problems64.7Hybrid model w/ SNI2017-09-01
ATHENA: Mathematical Reasoning with Thought Expansion✓ Link86.5ATHENA (roberta-large)2023-11-02
ATHENA: Mathematical Reasoning with Thought Expansion✓ Link84.4ATHENA (roberta-base)2023-11-02
[]()66.9T-RNN
Learning by Fixing: Solving Math Word Problems with Weak Supervision✓ Link59.8LBF2020-12-19