multikernelbench
Description
MultiKernelBench is the first comprehensive, multi-platform benchmark for LLM-based deep learning kernel generation, spanning 285 tasks across 14 well-defined kernel categories and supporting Nvidia GPUs, Huawei NPUs, and Google TPUs. It provides a modular backend abstraction for easy integration of new hardware and evaluates generation quality including via a category-aware one-shot prompting method.
Leaderboard
Loading leaderboard...
Implementations
No implementations linked yet. Add one to showcase related work.