mathcanvas-bench

Description

MathCanvas-Bench is a challenging benchmark for evaluating large multimodal models’ visual-aided mathematical reasoning by requiring models to produce interleaved visual–textual solutions. It contains 3,000 problems that demand generation and strategic use of diagrams alongside textual reasoning.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/MathCanvas
0
1 months ago
arXiv/mathcanvas-bench | OpenReward