OpenCodePapers

multi-task-language-understanding-on-mgsm

Multi-Task LearningMulti-task Language Understanding

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Average (%)	ModelName	ReleaseDate
PaLM 2 Technical Report	✓ Link	87.0	PaLM 2 (few-shot, k=8, SC)	2023-05-17
PaLM 2 Technical Report	✓ Link	72.2	PaLM 2 (8-shot, CoT)	2023-05-17
Scaling Instruction-Finetuned Language Models	✓ Link	72.0	Flan-PaLM 540B (8-shot, fine-tuned, CoT + SC)	2022-10-20
Scaling Instruction-Finetuned Language Models	✓ Link	60.4	Flan-U-PaLM 540B (CoT)	2022-10-20
Scaling Instruction-Finetuned Language Models	✓ Link	57.0	Flan-PaLM 540B (8-shot, fine-tuned, CoT)	2022-10-20
PaLM: Scaling Language Modeling with Pathways	✓ Link	55.0	PaLM 540B	2022-04-05
Transcending Scaling Laws with 0.1% Extra Compute		49.9	U-PaLM 540B (CoT)	2022-10-20
Scaling Instruction-Finetuned Language Models	✓ Link	36	text-davinci-003	2022-10-20
Scaling Instruction-Finetuned Language Models	✓ Link	35	code-davinci-002	2022-10-20
Scaling Instruction-Finetuned Language Models	✓ Link	23.7	text-davinci-002	2022-10-20
Scaling Instruction-Finetuned Language Models	✓ Link	21.2	Flan-PaLM 540B (8-shot, fine-tuned)	2022-10-20
Scaling Instruction-Finetuned Language Models	✓ Link	5.7	GPT-3 Davinci 175B	2022-10-20