mathcamps

Description

MathCAMPS is a benchmark for evaluating LLM mathematical problem-solving and reasoning that synthesizes high-quality math problems grounded in 44 fine-grained K–8 Common Core standards. It encodes each standard in a formal grammar to sample symbolic problems, uses LLMs to realize them as word problems with cycle-consistency validation, and derives follow-up question dialogues to probe robustness and the development of specific skills.

Leaderboard
Loading leaderboard...
Implementations

No implementations linked yet. Add one to showcase related work.

arXiv/mathcamps | OpenReward