sciclaimeval

Name: arXiv/sciclaimeval
Author: arXiv

arXiv/sciclaimeval

Description

SciClaimEval is a benchmark for scientific claim verification composed of authentic claims (including refuted ones) directly extracted from published papers across machine learning, natural language processing, and medicine. It generates refuted claims by modifying supporting evidence (figures and tables) rather than altering claims or using LLMs, and provides cross-modal evidence—figures as images and tables in image, LaTeX, HTML, and JSON formats—covering 1,664 expert-annotated samples from 180 papers.

arXiv

Leaderboard

Loading leaderboard...

Implementations (1)

Environment	Stars	Last Updated
Xanh/SciClaimEval	1	3 months ago