notesum.ai

Published at December 9

MuMu-LLaMA: Multi-modal Music Understanding and Generation via Large Language Models

cs.SD
cs.MM
eess.AS

Released Date: December 9, 2024

Authors: Shansong Liu1, Atin Sakkeer Hussain2, Qilong Wu2, Chenshuo Sun2, Ying Shan1

Aff.: 1ARC Lab, Tencent PCG; 2National University of Singapore

Arxiv: http://arxiv.org/pdf/2412.06660v1