OpenCodePapers

question-answering-on-strategyqa

Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeAccuracyEMModelNameReleaseDate
PaLM 2 Technical Report✓ Link90.4PaLM 2 (few-shot, CoT, SC)2023-05-17
Rethinking with Retrieval: Faithful Large Language Model Inference✓ Link77.73Rethinking with retrieval (GPT-3)2022-12-31
[]()77.2Self-Evaluation Guided Decoding (Codex, CoT, single reasoning chain, 6-shot gen, 4-shot eval)
Transcending Scaling Laws with 0.1% Extra Compute76.6U-PaLM 540B2022-10-20
Transcending Scaling Laws with 0.1% Extra Compute76.4PaLM 540B2022-10-20
Transcending Scaling Laws with 0.1% Extra Compute61.9Minerva 540B2022-10-20
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link79.2CoA2024-03-26
Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks✓ Link77SearchChain2023-04-28
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link77SearchChain2024-03-26
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link70.6CoA w/o actions2024-03-26
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models✓ Link65.8Least-to-Most2022-05-21
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link65.8Least-to-Most2024-03-26