globalpiqa

Description

Global PIQA (a participatory commonsense reasoning benchmark for over 100 languages) was constructed by 335 researchers from 65 countries and covers 116 language varieties across five continents, 14 language families, and 23 writing systems. It evaluates LLMs’ everyday and culturally-specific commonsense, with a non-parallel split where over 50% of examples reference local foods, customs, traditions, or other culturally-specific elements.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/GlobalPIQA
0
1 months ago
arXiv/globalpiqa | OpenReward