Paper | Code | CorrSc | ModelName | ReleaseDate |
---|---|---|---|---|
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code | ✓ Link | 0.848 | GPT-4 | 2023-12-22 |
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code | ✓ Link | 0.617 | GPT-3.5-Turbo | 2023-12-22 |
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code | ✓ Link | 0.327 | CodeLlama:13B-4bit-quantised | 2023-12-22 |
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code | ✓ Link | 0.289 | CodeLlama:7B-4bit-quantised | 2023-12-22 |
Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code | ✓ Link | 0.063 | Command | 2023-12-22 |