wow-world-eval
Description
WoW-World-Eval (Wow,wo,val) is an Embodied Turing Test benchmark for evaluating video foundation models as predictive world models in Embodied AI, measuring their perceptual fidelity and robustness to serve as priors for real-world embodied agents. It is built on 609 robot manipulation sequences and a comprehensive protocol of 22 metrics across five core abilities (perception, planning, prediction, generalization, execution), including a human-correlated overall score and an Inverse Dynamic Model Turing Test for execution accuracy.
Leaderboard
Loading leaderboard...
Implementations
No implementations linked yet. Add one to showcase related work.