heal-medvqa

Description

HEAL-MedVQA (Hallucination Evaluation via Localization MedVQA) is a benchmark for evaluating medical large multi-modal models' localization abilities and robustness to hallucinations in visual question answering. It consists of two evaluation protocols to assess visual and textual shortcut learning and a 67K VQA-pair dataset with doctor-annotated anatomical segmentation masks for pathological regions.

Leaderboard
Loading leaderboard...
Implementations

No implementations linked yet. Add one to showcase related work.

arXiv/heal-medvqa | OpenReward