notesum.ai

Published at November 1

Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

cs.SD
cs.AI
cs.CL
eess.AS

Released Date: November 1, 2024

Authors: Xiong Wang1, Yangze Li2, Chaoyou Fu3, Lei Xie2, Ke Li1, Xing Sun1, Long Ma1

Aff.: 1Tencent Youtu Lab; 2Audio, Speech and Language Processing Group (ASLP@NPU); 3Nanjing University

Arxiv: http://arxiv.org/abs/2411.00774v1