notesum.ai

Published at November 5

Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT

cs.SD
cs.AI
cs.LG
eess.AS

Released Date: November 5, 2024

Authors: Pourya Jafarzadeh1, Amir Mohammad Rostami1, Padideh Choobdar1

Aff.: 1Lab of Artin, TOSAN TECHNO

Arxiv: http://arxiv.org/abs/2411.02964v1