Paper | Code | Accuracy | ModelName | ReleaseDate |
---|---|---|---|---|
Training Compute-Optimal Large Language Models | ✓ Link | 69.1 | Chinchilla-70B (few-shot, k=5) | 2022-03-29 |
Scaling Language Models: Methods, Analysis & Insights from Training Gopher | ✓ Link | 56.4 | Gopher-280B (few-shot, k=5) | 2021-12-08 |
Galactica: A Large Language Model for Science | ✓ Link | 49.1 | OPT 175B | 2022-11-16 |
Galactica: A Large Language Model for Science | ✓ Link | 48.7 | GAL 120B (few-shot, k=5) | 2022-11-16 |
Galactica: A Large Language Model for Science | ✓ Link | 47.0 | GAL 30B (few-shot, k=5) | 2022-11-16 |
Galactica: A Large Language Model for Science | ✓ Link | 1.3 | BLOOM 176B | 2022-11-16 |