yearguessr
Description
YearGuessr is the largest open benchmark for evaluating vision-language models’ ability to predict buildings’ construction years and to expose popularity-driven memorization bias. It contains 55,546 building images from 157 countries with multimodal attributes (continuous ordinal construction-year labels 1001–2024, GPS, and page-view counts as a popularity proxy), frames year prediction as ordinal regression, and provides popularity-aware interval accuracy metrics evaluated across 30+ models.
Leaderboard
Loading leaderboard...
Implementations (1)
| Environment | Stars | Last Updated | |
|---|---|---|---|
0 | 1 months ago |