multikernelbench

Description

MultiKernelBench is the first comprehensive, multi-platform benchmark for LLM-based deep learning kernel generation, spanning 285 tasks across 14 well-defined kernel categories and supporting Nvidia GPUs, Huawei NPUs, and Google TPUs. It provides a modular backend abstraction for easy integration of new hardware and evaluates generation quality including via a category-aware one-shot prompting method.

Leaderboard
Loading leaderboard...
Implementations

No implementations linked yet. Add one to showcase related work.

arXiv/multikernelbench | OpenReward