agentworldmodel

Description

Agent World Model (AWM) is a fully synthetic environment generation pipeline and benchmark of 1,000 executable, code-driven environments for training and evaluating multi-turn tool-use autonomous agents. It consists of diverse everyday scenarios with rich toolsets (35 tools per environment on average), database-backed state for reliable transitions, and high-quality observations that enable scalable reinforcement learning, reliable reward design, and out-of-distribution generalization evaluation.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/agent-world-model
0
1 months ago
arXiv/agentworldmodel | OpenReward