Paper | Code | pass@1 | ModelName | ReleaseDate |
---|---|---|---|---|
A Case Study of Web App Coding with OpenAI Reasoning Models | ✓ Link | 0.952 | o1-preview | 2024-09-19 |
A Case Study of Web App Coding with OpenAI Reasoning Models | ✓ Link | 0.939 | o1-mini | 2024-09-19 |
Insights from Benchmarking Frontier Language Models on Web App Code Generation | ✓ Link | 0.885 | gpt-4o-2024-08-06 | 2024-09-08 |
Insights from Benchmarking Frontier Language Models on Web App Code Generation | ✓ Link | 0.8808 | claude-3.5-sonnet | 2024-09-08 |
A Case Study of Web App Coding with OpenAI Reasoning Models | ✓ Link | 0.834 | deepseek-v2.5 | 2024-09-19 |
Insights from Benchmarking Frontier Language Models on Web App Code Generation | ✓ Link | 0.7804 | mistral-large-2 | 2024-09-08 |
Insights from Benchmarking Frontier Language Models on Web App Code Generation | ✓ Link | 0.7002 | deepseek-coder-v2-instruct | 2024-09-08 |
Insights from Benchmarking Frontier Language Models on Web App Code Generation | ✓ Link | 0.302 | llama-v3p1-405b-instruct | 2024-09-08 |