OpenCodePapers
question-answering-on-strategyqa
Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
Accuracy
↕
EM
↕
ModelName
ReleaseDate
↕
PaLM 2 Technical Report
✓ Link
90.4
PaLM 2 (few-shot, CoT, SC)
2023-05-17
Rethinking with Retrieval: Faithful Large Language Model Inference
✓ Link
77.73
Rethinking with retrieval (GPT-3)
2022-12-31
[]()
77.2
Self-Evaluation Guided Decoding (Codex, CoT, single reasoning chain, 6-shot gen, 4-shot eval)
Transcending Scaling Laws with 0.1% Extra Compute
76.6
U-PaLM 540B
2022-10-20
Transcending Scaling Laws with 0.1% Extra Compute
76.4
PaLM 540B
2022-10-20
Transcending Scaling Laws with 0.1% Extra Compute
61.9
Minerva 540B
2022-10-20
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
✓ Link
79.2
CoA
2024-03-26
Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks
✓ Link
77
SearchChain
2023-04-28
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
✓ Link
77
SearchChain
2024-03-26
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
✓ Link
70.6
CoA w/o actions
2024-03-26
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
✓ Link
65.8
Least-to-Most
2022-05-21
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
✓ Link
65.8
Least-to-Most
2024-03-26