MLE-bench

Description

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering. It consists of 75 ML engineering-related competitions from Kaggle.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/MLE-Bench
0
1 months ago
OpenAI/MLE-bench | OpenReward