plawbench

Description

PLawBench (Practical Law Benchmark) is a benchmark for evaluating LLMs' ability to perform realistic legal practice tasks by assessing fine-grained legal reasoning, issue identification, fact spotting, and legally coherent document generation. It contains 850 questions across 13 practical legal scenarios with expert-designed rubrics yielding about 12,500 rubric items and employs an LLM-based evaluator aligned with human expert judgments for detailed assessment.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/PLawBench
0
1 months ago
arXiv/plawbench | OpenReward