emobox

Description

EmoBox is an out-of-the-box multilingual multi-corpus speech emotion recognition toolkit and benchmark for both intra-corpus and cross-corpus evaluation. It provides carefully designed dataset splits for intra-corpus tests, uses the foundation SER model emotion2vec to create fully balanced speaker-and-emotion cross-corpus test sets, and includes baseline results from 10 pre-trained models across 32 emotion datasets in 14 languages (intra-corpus) and 4 datasets (cross-corpus), forming the largest multilingual multi-corpus SER benchmark to date.

Leaderboard
Loading leaderboard...
Implementations

No implementations linked yet. Add one to showcase related work.

arXiv/emobox | OpenReward