financeagentbenchmark

Description

Finance Agent Benchmark is a benchmark for evaluating LLM-driven finance agents on challenging, diverse real-world research problems that require complex analysis of recent SEC filings. It comprises 537 expert-authored, validated questions across nine financial task categories and includes an agentic harness with tools (e.g., Google Search and EDGAR) to measure model performance and practical deployment readiness.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/financeagent-terminal
0
1 months ago
arXiv/financeagentbenchmark | OpenReward