notesum.ai

Published at November 4

MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs

cs.CL
cs.AI
cs.CV
cs.IR
cs.LG

Released Date: November 4, 2024

Authors: Sheng-Chieh Lin1, Chankyu Lee1, Mohammad Shoeybi1, Jimmy Lin2, Bryan Catanzaro1, Wei Ping1

Aff.: 1NVIDIA; 2University of Waterloo

Arxiv: http://arxiv.org/abs/2411.02571v1