OpenCodePapers

fact-verification-on-kilt-fever

Fact Verification
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeKILT-ACR-PrecRecall@5AccuracyModelNameReleaseDate
Re2G: Retrieve, Rerank, Generate✓ Link78.5388.9292.5289.55Re2G2022-07-13
[]()71.2881.4589.5689.54intersect
[]()65.6874.7787.8988.99Wikipedia
[]()64.4175.684.9585.58KGI
[]()63.9474.4887.5286.32Multitask DPR + BART
[]()58.5872.9373.5269.68BERT + DPR
KILT: a Benchmark for Knowledge Intensive Language Tasks✓ Link53.4561.9475.5586.31RAG2020-09-04
[]()47.6855.3374.2986.74BART + DPR
[]()41.8849.2470.1666.1NSMN
[]()0.084.4588.620.0TABi
[]()0.084.0789.410.0chriskuei
[]()0.083.6488.150.0GENRE
[]()0.074.4887.520.0Multi-task DPR
[]()0.00.00.089.12Sphere
[]()0.00.00.088.45aa_evalai
[]()0.00.00.078.93BART
KILT: a Benchmark for Knowledge Intensive Language Tasks✓ Link0.00.00.076.3T5-base2020-09-04
[]()0.00.00.076.26GENRE+roBERTa finetuning
[]()0.00.00.072.34SVM with rbf kernel
[]()0.00.00.071.58ElefPav
[]()0.00.00.071.42Alessandro_Tansel
[]()0.00.00.071.38JuanTran
[]()0.00.00.071.24Logistic Regression
[]()0.00.00.071.12QDA
[]()0.00.00.070.71SVM
[]()0.00.00.069.71stupidTeam
[]()0.00.00.069.41QDA_EMB2
[]()0.00.00.068.43SVM
[]()0.00.00.067.98Marco Aurelio Sterpa
[]()0.00.00.061.6its_all_greek_to_me
[]()0.00.00.033.58multi-task small
[]()0.00.00.023.01LogisticRegression
[]()0.00.00.012.57galimaldo