API Endpoint

Leaderboard

Loading leaderboard...

Implementation of

arXiv/secque

README

SecQue

Description

SecQue is an environment for evaluating financial analysis capabilities on SEC filings. It contains 565 expert-written questions covering comparison analysis, ratio calculation, risk assessment, and financial insight generation. Agents receive questions with relevant SEC filing context and must provide detailed analytical answers.

Capabilities

Financial document analysis
SEC filing interpretation
Ratio calculation and comparison analysis
Risk assessment and financial insight generation

Compute Requirements

Agents are given a standard environment with no sandbox or file system access.

License

MIT.

Tasks

There is one split in this environment:

test: 565 tasks

Questions cover companies including Discover Financial Services, NextEra Energy, Apple Inc., and NVIDIA. Context is provided as markdown-formatted SEC filing excerpts.

Reward Structure

This is a single-turn environment. The agent submits an answer via the answer tool. An LLM grader (gpt-5-mini) evaluates the response against expert reference answers, allowing for reasonable numerical rounding and minor formatting variations. Reward is binary: 1.0 if correct, 0.0 if incorrect.

Data

Data consists of a Parquet file (secque.parquet) sourced from HuggingFace nogabenyoash/SecQue. Each row contains a question, SEC filing context in markdown format, and expert reference answer. Data is stored on the OpenReward platform.

Tools

Tool	Description
`answer`	Submit your detailed financial analysis answer. Ends the episode.

Time Horizon

Single-turn. The agent reads the question and SEC filing context, then submits one answer.

Environment Difficulty

Model performance on SECQUE:

Model	Accuracy
GPT-4o	69%
Llama-3.3-70B-Instruct	65%
GPT-4o-mini	64%
Qwen2.5-32B-Instruct	61%
Phi-4	56%
Meta-Llama-3.1-8B-Instruct	48%

Ratio analysis and analyst insights are the most challenging categories, requiring complex numerical reasoning combined with financial understanding.

Other Environment Requirements

OpenAI API key required for LLM-based grading. Pass via secrets={"openai_api_key": "..."}.

Safety

Agents in SecQue analyze SEC filings in a standard environment. The environment does not present direct safety risks.

Citation

@article{benyoash2025secque,
  title={SECQUE: A Benchmark for Evaluating Real-World Financial Analysis Capabilities},
  author={Ben Yoash, Noga and Brief, Meni and Ovadia, Oded and Levin-Schwartz, Yael and Eldan, Ronen},
  journal={arXiv preprint arXiv:2504.04596},
  year={2025}
}

Repository

Source repository

EnvCommons/SECQUE

Clone Repository

Tools

Tools available in the environment

No tools available for this environment, it probably hasn't been indexed yet.

Compute Configuration

Resource allocation for this environment.

Component	Configuration
Environment Server	1 vCPU / 4 GB RAM
Sandbox Machine	Not configured

Estimated Cost

Pay per second of active session usage. Billing starts when your session begins and stops when it ends.

Component	Cost / second
Environment	$0.0000320
Sandbox	Not configured
Total	$0.0000320

Examples

5-minute session$0.0096

1-hour session$0.1152

SECQUE

GeneralReasoning/SECQUE

SecQue

Description

Capabilities

Compute Requirements

License

Tasks

Reward Structure

Data

Tools

Time Horizon

Environment Difficulty

Other Environment Requirements

Safety

Citation

Repository

Clone Repository

Tools

Compute Configuration

Estimated Cost

Examples