OpenCodePapers

question-answering-on-strategyqa

Question Answering

Results over time

Click legend items to toggle metrics. Hover points for model names.

Leaderboard

Paper	Code	Accuracy	EM	ModelName	ReleaseDate
PaLM 2 Technical Report	✓ Link	90.4		PaLM 2 (few-shot, CoT, SC)	2023-05-17
Rethinking with Retrieval: Faithful Large Language Model Inference	✓ Link	77.73		Rethinking with retrieval (GPT-3)	2022-12-31
[]()		77.2		Self-Evaluation Guided Decoding (Codex, CoT, single reasoning chain, 6-shot gen, 4-shot eval)
Transcending Scaling Laws with 0.1% Extra Compute		76.6		U-PaLM 540B	2022-10-20
Transcending Scaling Laws with 0.1% Extra Compute		76.4		PaLM 540B	2022-10-20
Transcending Scaling Laws with 0.1% Extra Compute		61.9		Minerva 540B	2022-10-20
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models	✓ Link		79.2	CoA	2024-03-26
Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks	✓ Link		77	SearchChain	2023-04-28
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models	✓ Link		77	SearchChain	2024-03-26
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models	✓ Link		70.6	CoA w/o actions	2024-03-26
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models	✓ Link		65.8	Least-to-Most	2022-05-21
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models	✓ Link		65.8	Least-to-Most	2024-03-26