yearguessr

Description

YearGuessr is the largest open benchmark for evaluating vision-language models’ ability to predict buildings’ construction years and to expose popularity-driven memorization bias. It contains 55,546 building images from 157 countries with multimodal attributes (continuous ordinal construction-year labels 1001–2024, GPS, and page-view counts as a popularity proxy), frames year prediction as ordinal regression, and provides popularity-aware interval accuracy metrics evaluated across 30+ models.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/YearGuessr
0
1 months ago
arXiv/yearguessr | OpenReward