notesum.ai

Published at October 30

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection

cs.SD
cs.AI
cs.LG
cs.MM
eess.AS

Released Date: October 30, 2024

Authors: Yoto Fujita1, Yoshiaki Bando2, Keisuke Imoto3, Masaki Onishi2, Kazuyoshi Yoshii1

Aff.: 1Graduate School of Informatics, Kyoto University, Japan; 2National Institute of Advanced Industrial Science and Technology, Japan; 3Faculty of Science and Engineering, Doshisha University, Japan

Arxiv: http://arxiv.org/abs/2410.22803v1