notesum.ai

Published at November 12

SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model

cs.SD
cs.AI
cs.CV
cs.MM
eess.AS

Released Date: November 12, 2024

Authors: Xinyuan Qian, Jiaran Gao, Yaodan Zhang, Qiquan Zhang, Hexin Liu, Leibny Paola Garcia, Haizhou Li

Arxiv: http://arxiv.org/abs/2411.07751v1