productqa

Description

ProductQA is a benchmark for evaluating and training LLM agents on challenging online-shopping question-answering tasks that probe capabilities such as memory, tool usage, reflection, and expert consultation. It consists of a dataset of difficult product-related questions designed for agent interaction and reinforcement-learning fine-tuning within the AGILE framework.

Leaderboard
Loading leaderboard...
Implementations

No implementations linked yet. Add one to showcase related work.

arXiv/productqa | OpenReward