OpenCodePapers
code-generation-on-res-q
Code Generation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
Show papers without code
Paper
Code
pass@1
↕
ModelName
ReleaseDate
↕
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
58.0
QurrentOS-coder + Claude 3.5 Sonnet
2024-06-24
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
46.0
QurrentOS-coder + GPT-4o
2024-06-24
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
37.0
QurrentOS-coder + GPT-4 Turbo
2024-06-24
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
36.0
QurrentOS-coder + Claude 3 Opus
2024-06-24
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
30.0
QurrentOS-coder + GPT-4
2024-06-24
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
30.0
QurrentOS-coder + Gemini 1.5 Pro
2024-06-24
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
29.0
QurrentOS-coder + DeepSeek-Coder-V2
2024-06-24
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
20.0
QurrentOS-coder + Llama 3 70b
2024-06-24
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
✓ Link
18.0
QurrentOS-coder + Qwen-72B-Instruct
2024-06-24