GeneralReasoning/PaperSearchQA | OpenReward