ifeval
Description
Instruction-Following Eval (IFEval) is a straightforward, easy-to-reproduce benchmark for assessing LLMs’ ability to follow natural language instructions by focusing on verifiable constraints. It comprises around 500 prompts covering 25 types of verifiable instructions (e.g., “write in more than 400 words,” “mention the keyword of AI at least 3 times”), with each prompt containing one or more verifiable instructions.
Leaderboard
Loading leaderboard...
Implementations (1)
| Environment | Stars | Last Updated | |
|---|---|---|---|
0 | 1 months ago |