OpenCodePapers

question-answering-on-newsqa

Question Answering
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeEMF1ModelNameReleaseDate
o3-mini vs DeepSeek-R1: Which One is Safer?✓ Link92.5293.13OpenAI/o3-2025-01-31-high2025-01-30
DeepSense: A Unified Deep Learning Framework for Time-Series Mobile Sensing Data Processing✓ Link92.1494.01Riple/Saanvi-v0.5-DeepAnalysis2016-11-07
Thinking Like Transformers✓ Link88.2491.31OpenAI/o4-mini-2025-05-01-high2021-06-13
0/1 Deep Neural Networks via Block Coordinate Descent81.4488.72OpenAI/o1-2024-12-17-high2022-06-19
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning✓ Link80.5786.13deepseek-r12025-01-22
Claude 3.5 Sonnet Model Card Addendum74.2382.3Anthropic/claude-3-7-sonnet2024-06-24
Time-series Transformer Generative Adversarial Networks✓ Link72.6185.44Riple/Saanvi-v0.12022-05-23
XAI for Transformers: Better Explanations through Conservative Propagation✓ Link70.5788.24xAI/grok-3-12122022-02-15
GPT-4o as the Gold Standard: A Scalable and General Purpose Approach to Filter Language Model Pretraining Data70.2181.74OpenAI/GPT-4o2024-10-03
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context✓ Link68.7579.91Google/Gemini 2.5 Pro2024-03-08
Learning to Generate Questions by Learning to Recover Answer-containing Sentences54.764.5BERT+ASGen
Densely Connected Attention Propagation for Reading Comprehension✓ Link53.166.3DecaProp2018-11-10
Efficient and Robust Question Answering from Minimal Context over Documents✓ Link50.163.2MINIMAL(Dyn)2018-05-21
A Question-Focused Multi-Factor Attention Network for Question Answering✓ Link48.463.7AMANDA2018-01-25
Making Neural QA as Simple as Possible but not Simpler✓ Link43.756.1FastQAExt2017-03-14
SpanBERT: Improving Pre-training by Representing and Predicting Spans✓ Link73.6SpanBERT2019-07-24
LinkBERT: Pretraining Language Models with Document Links✓ Link72.6LinkBERT (large)2022-03-29
DyREx: Dynamic Query Representation for Extractive Question Answering✓ Link68.53DyREX2022-10-26