asr-germanmed

Description

A curated dataset of simulated doctor-patient conversations (German medical ASR benchmark) is a benchmark for evaluating German-language automatic speech recognition systems in clinical contexts, explicitly covering dialectal variations and medical terminology. It evaluates 29 ASR models—including open-weight Whisper, Voxtral, and Wav2Vec2 families as well as commercial APIs (AssemblyAI, Deepgram)—using WER, CER, and BLEU to reveal large performance differences, particularly on medical terms and dialect-influenced speech.

Leaderboard
Loading leaderboard...
Implementations

No implementations linked yet. Add one to showcase related work.

arXiv/asr-germanmed | OpenReward