sciclaimeval

Description

SciClaimEval is a benchmark for scientific claim verification composed of authentic claims (including refuted ones) directly extracted from published papers across machine learning, natural language processing, and medicine. It generates refuted claims by modifying supporting evidence (figures and tables) rather than altering claims or using LLMs, and provides cross-modal evidence—figures as images and tables in image, LaTeX, HTML, and JSON formats—covering 1,664 expert-annotated samples from 180 papers.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
XanhXanh/SciClaimEval
1
2 months ago
arXiv/sciclaimeval | OpenReward