cns-bench

Name: arXiv/cns-bench
Author: arXiv

arXiv/cns-bench

Image Classification Robustness Evaluation

Description

CNS-Bench (Continuous Nuisance Shift Benchmark) is a benchmark for quantifying out-of-distribution robustness of image classifiers under continuous, realistic generative nuisance shifts. It uses LoRA adapters on diffusion models to produce a wide range of individual nuisance shifts at continuous severities with a novel filtering mechanism for reliable benchmarking, enabling large-scale evaluation and identification of model failure points.

arXiv

Leaderboard

Loading leaderboard...

Implementations

No implementations linked yet. Add one to showcase related work.

cns-bench

arXiv/cns-bench

Description

Repository

Clone Repository