OpenCodePapers

code-generation-on-apps

Code Generation
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeIntroductory Pass@1Interview Pass@1Competition Pass@1Competition Pass@anyInterview Pass@anyIntroductory Pass@anyCompetition Pass@5Interview Pass@5Introductory Pass@5Competition Pass@1000Interview Pass@1000Introductory Pass@1000Pass@1ModelNameReleaseDate
Planning-Driven Programming: A Large Language Model Programming Workflow✓ Link87.265.234.862.6LPW (GPT-4o)2024-11-21
MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks✓ Link68.4444.4927.84MoTCoder-32B-V1.52023-12-26
MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks✓ Link54.2632.6321.18MoTCoder-7B-V1.52023-12-26
CodeT: Code Generation with Generated Tests✓ Link47.3%14.3%6.2%code-davinci-002 175B (CodeT)2022-07-21
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence✓ Link33.8019.7011.09deepseek-ai/deepseek-coder-6.7b-instruct2024-01-25
CodeT: Code Generation with Generated Tests✓ Link31.92code-davinci-002 175B2022-07-21
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules✓ Link29.3%6.4%2.5%14.5%25.4%60.9%CodeChain+WizardCoder-15b2023-10-13
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules✓ Link26.297.493.75WizardCoder-15b2023-10-13
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging✓ Link26.044.210.81CodeSim (GPT4)2025-02-08
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning✓ Link2013.533.3CodeRL+CodeT52022-07-05
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning✓ Link6.77%1.80%0.69%15.70%14.33%38.10%2.36%4.48%15.27%15.70%14.33%38.10%GPT-J 6B (Finetuned)2022-07-05
Evaluating Large Language Models Trained on Code✓ Link5.60%1.00%0.50%13.51%13.15%35.20%1.00%1.73%9.20%13.51%13.15%35.20%Codex 12B (Raw)2021-07-07
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning✓ Link4.14%0.14%0.02%3.32%3.70%25.02%0.09%0.51%9.65%3.23%3.70%25.02%GPT-Neo 2.7B (Finetuned)2022-07-05
Measuring Coding Challenge Competence With APPS✓ Link3.90%0.57%0.00%11.40%9.83%27.90%0.00%0.80%5.50%11.40%9.83%27.90%GPT-Neo 2.7B2021-05-20
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning✓ Link3.90%0.57%0.00%0.0%0.80%5.50%0.00%0.80%5.50%GPT2 1.5B (Finetuned)2022-07-05
MapCoder: Multi-Agent Code Generation for Competitive Problem Solving✓ Link1.30%0.70%0.00%8.80%9.27%25.00%0.00%1.03%3.60%8.80%9.27%25.00%MapCoder APPS-150-cherrypicked (GPT-4)2024-05-18
Competition-Level Code Generation with AlphaCode✓ Link22.0AlphaCode 1B Filtered from 500002022-02-08
Competition-Level Code Generation with AlphaCode✓ Link7.75%9.66%20.36%7.75%9.66%20.36%AlphaCode 1B2022-02-08