OpenCodePapers

question-answering-on-webquestions

Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeEMF1ModelNameReleaseDate
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link70.7CoA2024-03-26
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link64.7CoA w/o actions2024-03-26
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines✓ Link59.4DSP2023-10-05
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link59.4DSP2024-03-26
FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering56.3FiE+PAQ2022-11-18
FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering52.4FiE2022-11-18
FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference51.1FiDO2022-12-15
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks✓ Link45.2RAG2020-05-22
Language Models are Few-Shot Learners✓ Link44.7Few-shot2020-05-28
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link44.7Few-shot2024-03-26
PaLM: Scaling Language Modeling with Pathways✓ Link43.5PaLM-540B (Few-Shot)2022-04-05
Language Models are Unsupervised Multitask Learners✓ Link43Zero-shot2019-02-14
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link43Zero-shot2024-03-26
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer✓ Link42.8T5.1.1-XXL+SSM2019-10-23
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models✓ Link42.5CoT2022-01-28
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link42.5CoT2024-03-26
Dense Passage Retrieval for Open-Domain Question Answering✓ Link42.4DPR2020-04-10
Language Models are Few-Shot Learners✓ Link41.5GPT-3-175B (Few-Shot)2020-05-28
REALM: Retrieval-Augmented Language Model Pre-Training✓ Link40.7REALM2020-02-10
ReAct: Synergizing Reasoning and Acting in Language Models✓ Link38.3React2022-10-06
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link38.3React2024-03-26
Latent Retrieval for Weakly Supervised Open Domain Question Answering✓ Link36.4ORQA2019-06-01
Measuring and Narrowing the Compositionality Gap in Language Models✓ Link31.1Self-Ask2022-10-07
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link31.1Self-Ask2024-03-26
PaLM 2 Technical Report✓ Link28.2PaLM 2-L (one-shot)2023-05-17
PaLM 2 Technical Report✓ Link26.9PaLM 2-M (one-shot)2023-05-17
Tree of Thoughts: Deliberate Problem Solving with Large Language Models✓ Link26.3ToT2023-05-17
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models✓ Link26.3ToT2024-03-26
Language Models are Few-Shot Learners✓ Link25.3GPT-3-175B (One-Shot)2020-05-28
PaLM: Scaling Language Modeling with Pathways✓ Link22.6PaLM-540B (One-Shot)2022-04-05
PaLM 2 Technical Report✓ Link21.8PaLM 2-S (one-shot)2023-05-17
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts15.5GLaM 62B/64E (Zero-Shot)2021-12-13
Language Models are Few-Shot Learners✓ Link14.4GPT-3-175B (Zero-Shot)2020-05-28
PaLM: Scaling Language Modeling with Pathways✓ Link10.6PaLM-540B (Zero-Shot)2022-04-05
Large-scale Simple Question Answering with Memory Networks✓ Link42.2%Memory Networks (ensemble)2015-06-05
Question Answering with Subgraph Embeddings✓ Link39.2%Subgraph embeddings2014-06-14
Open Question Answering with Weakly Supervised Embedding Models29.7%Weakly Supervised Embeddings2014-04-16