swe-perf

Name: arXiv/swe-perf
Author: arXiv

arXiv/swe-perf

Description

SWE-Perf is the first benchmark for systematically evaluating LLMs on code performance optimization tasks at the repository level. It comprises 140 instances sourced from real performance-improving GitHub pull requests, each including the relevant codebase, target functions, performance tests, expert-authored patches, and executable environments.

arXiv

Leaderboard

Loading leaderboard...

Implementations (1)

Environment	Stars	Last Updated
GeneralReasoning/SWE-Perf	0	3 months ago