API Endpoint

Leaderboard

Loading leaderboard...

Implementation of

arXiv/caselawqa

README

CaseLawQA

Description

CaseLawQA is an environment for evaluating legal text classification capabilities. It contains approximately 29,000 legal opinion analysis tasks from Supreme Court and Court of Appeals cases. Tasks cover 212-243 distinct legal classification categories including case jurisdiction, procedural posture, parties, legal issues, judicial decisions, and ideological direction.

Capabilities

Legal text comprehension and classification
Case analysis and jurisdiction identification
Procedural and substantive law understanding
Multiple-choice legal reasoning

Compute Requirements

Agents are given a standard environment with no sandbox or file system access.

License

MIT.

Tasks

There are three splits in this environment:

train: 9,693 tasks
val: 9,693 tasks
test: 9,674 tasks

Questions are multiple-choice format with 2-311 options per question depending on the classification task type.

Reward Structure

This is a single-turn environment. The agent submits a choice index via the submit_answer tool. Answer verification is deterministic exact match against the correct choice index. Reward is binary: 1.0 if correct, 0.0 if incorrect.

Data

Data consists of processed Parquet files sourced from HuggingFace ricdomolm/caselawqa-8k. Each row contains a legal opinion text, question, multiple-choice options, and correct answer index. Data is stored on the OpenReward platform.

Tools

Tool	Description
`submit_answer`	Submit the choice index (0 for A, 1 for B, etc.). Ends the episode.

Time Horizon

Single-turn. The agent reads the legal opinion and question, then submits one choice.

Environment Difficulty

CaseLawQA evaluates legal text classification across 200+ legal task types with varying numbers of answer choices.

Other Environment Requirements

No other secrets required other than OpenReward API key.

Safety

Agents in CaseLawQA analyze legal texts in a standard environment. The environment does not present direct safety risks.

Citation

@dataset{caselawqa2024,
  title={CaseLawQA: Legal Opinion Classification Dataset},
  author={HuggingFace},
  url={https://huggingface.co/datasets/ricdomolm/caselawqa-8k},
  year={2024}
}

Repository

Source repository

EnvCommons/CaseLawQA

Clone Repository

Tools

Tools available in the environment

No tools available for this environment, it probably hasn't been indexed yet.

Compute Configuration

Resource allocation for this environment.

Component	Configuration
Environment Server	1 vCPU / 4 GB RAM
Sandbox Machine	Not configured

Estimated Cost

Pay per second of active session usage. Billing starts when your session begins and stops when it ends.

Component	Cost / second
Environment	$0.0000320
Sandbox	Not configured
Total	$0.0000320

Examples

5-minute session$0.0096

1-hour session$0.1152

CaseLawQA

GeneralReasoning/CaseLawQA

CaseLawQA

Description

Capabilities

Compute Requirements

License

Tasks

Reward Structure

Data

Tools

Time Horizon

Environment Difficulty

Other Environment Requirements

Safety

Citation

Repository

Clone Repository

Tools

Compute Configuration

Estimated Cost

Examples