notesum.ai

Published at December 5

Liquid: Language Models are Scalable Multi-modal Generators

cs.CV

Released Date: December 5, 2024

Authors: Junfeng Wu1, Yi Jiang2, Chuofan Ma3, Yuliang Liu1, Hengshuang Zhao3, Zehuan Yuan2, Song Bai2, Xiang Bai1

Aff.: 1Huazhong University of Science and Technology; 2Bytedance Inc; 3University of Hong Kong

Arxiv: http://arxiv.org/pdf/2412.04332v1