MLE-bench

Name: OpenAI/MLE-bench
Author: OpenAI

OpenAI/MLE-bench

Description

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering. It consists of 75 ML engineering-related competitions from Kaggle.

arXiv

Leaderboard

Loading leaderboard...

Implementations (1)

Environment	Stars	Last Updated
GeneralReasoning/MLE-Bench	0	3 months ago