notesum.ai

Published at October 22

Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data

cs.LG
cs.AI

Released Date: October 22, 2024

Authors: Xinyi Ling1, Bo Peng1, Hanwen Du1, Zhihui Zhu1, Xia Ning2

Aff.: 1Department of Computer Science and Engineering, The Ohio State University; 2Department of Biomedical Informatics, The Ohio State University

Arxiv: https://arxiv.org/abs/2410.17337v1