MLE-bench

Description

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering. It consists of 75 ML engineering-related competitions from Kaggle.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/MLE-Bench
0
4 weeks ago
OpenAI/MLE-bench | OpenReward