OpenCodePapers

instruction-following-on-ifeval

Instruction Following
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodeInst-level loose-accuracyInst-level strict-accuracyPrompt-level loose-accuracyPrompt-level strict-accuracyModelNameReleaseDate
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models✓ Link90.486.785.680.2AutoIF (Llama3 70B)2024-06-19
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models✓ Link8886.182.380.2AutoIF (Qwen2 72B)2024-06-19
Instruction-Following Evaluation for Large Language Models✓ Link85.3783.5779.376.89GPT-42023-11-14
Instruction-Following Evaluation for Large Language Models✓ Link59.1155.7646.9543.07PaLM 2 S2023-11-14