scihorizon-gene
Description
SciHorizon-GENE is a large-scale gene-centric benchmark constructed from authoritative biological databases that integrates curated knowledge for over 190K human genes and comprises more than 540K questions covering diverse gene-to-function reasoning scenarios relevant to cell type annotation, functional interpretation, and mechanism-oriented analysis. It evaluates LLMs along four biologically critical perspectives—research attention sensitivity, hallucination tendency, answer completeness, and literature influence—to probe failure modes that hinder faithful, complete, and literature-grounded gene-level reasoning.
Leaderboard
Loading leaderboard...
Implementations
No implementations linked yet. Add one to showcase related work.