cns-bench

Description

CNS-Bench (Continuous Nuisance Shift Benchmark) is a benchmark for quantifying out-of-distribution robustness of image classifiers under continuous, realistic generative nuisance shifts. It uses LoRA adapters on diffusion models to produce a wide range of individual nuisance shifts at continuous severities with a novel filtering mechanism for reliable benchmarking, enabling large-scale evaluation and identification of model failure points.

Leaderboard
Loading leaderboard...
Implementations

No implementations linked yet. Add one to showcase related work.

arXiv/cns-bench | OpenReward