emma

Description

EMMA (Enhanced MultiModal reAsoning) is a benchmark for assessing organic multimodal reasoning in MLLMs across mathematics, physics, chemistry, and coding. It comprises tasks that require integrated cross-modal, multi-step reasoning that cannot be solved by independent unimodal reasoning, revealing significant limitations of state-of-the-art MLLMs even with Chain-of-Thought prompting and test-time compute scaling.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/EMMA
0
1 months ago
arXiv/emma | OpenReward