frontiercs

Name: arXiv/frontiercs
Author: arXiv

arXiv/frontiercs

Open-ended Problem Solving in Computer Science

Description

FrontierCS is a benchmark of 156 expert-designed, open-ended computer science problems across diverse areas that require models to produce executable programs (with an expert reference solution and automatic evaluator provided for each problem) rather than direct answers. It targets tasks with unknown optimal solutions—including NP-hard algorithmic variants and research-style problems—so progress is measured via objective solution quality and partial scoring rather than binary correctness.

arXiv GitHub

Leaderboard

Loading leaderboard...

Implementations (1)

Environment	Stars	Last Updated
GeneralReasoning/FrontierCS	1	3 months ago