notesum.ai

Published at November 27

SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation

eess.AS
cs.AI
cs.CL
cs.SD

Released Date: November 27, 2024

Authors: Wenyi Yu1, Siyin Wang1, Xiaoyu Yang2, Xianzhao Chen3, Xiaohai Tian3, Jun Zhang3, Guangzhi Sun2, Lu Lu3, Yuxuan Wang3, Chao Zhang1

Aff.: 1Tsinghua University; 2University of Cambridge; 3ByteDance

Arxiv: http://arxiv.org/abs/2411.18138v1