OpenCodePapers

mathematical-reasoning-on-lila-ood

Mathematical Reasoning
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyModelNameReleaseDate
Lila: A Unified Benchmark for Mathematical Reasoning✓ Link0.586Codex (Few-Shot, 175B)2022-10-31
Lila: A Unified Benchmark for Mathematical Reasoning✓ Link0.448Bhāskara-P (Fine-tuned, 2.7B)2022-10-31
Lila: A Unified Benchmark for Mathematical Reasoning✓ Link0.384GPT-3 (Few-Shot, 175B)2022-10-31
Lila: A Unified Benchmark for Mathematical Reasoning✓ Link0.268Bhāskara-A (Fine-tuned, 2.7B)2022-10-31
Lila: A Unified Benchmark for Mathematical Reasoning✓ Link0.238Neo-P (Fine-tuned, 2.7B)2022-10-31
Lila: A Unified Benchmark for Mathematical Reasoning✓ Link0.177Neo-A (Fine-tuned, 2.7B)2022-10-31