GeneralReasoning/MMLU-Redux-2 | OpenReward