OpenCodePapers
logical-reasoning-on-lingoly
Logical Reasoning
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
Delta_NoContext
↕
Exact Match Accuracy
↕
ModelName
ReleaseDate
↕
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
28.8%
46.3%
Claude Opus
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
25.1%
37.6%
GPT-4o
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
23.4%
32.1%
Gemini 1.5 Pro
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
21.5%
33.4%
GPT-4
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
11.6%
21.5%
Command R+
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
11.2%
21.2%
GPT-3.5
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
6.4%
14.2%
Mixtral 8x7B
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
4.9%
11.4%
Llama 3 8B
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
2.9%
10.3%
Llama 3 70B
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
2.2%
4.9%
Gemma 7B
2024-06-10
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
✓ Link
1.1%
6.4%
Llama 2 70B
2024-06-10