notesum.ai

Published at November 13

Enhancing Multimodal Query Representation via Visual Dialogues for End-to-End Knowledge Retrieval

cs.CV
cs.AI
cs.IR
cs.MM

Released Date: November 13, 2024

Authors: Yeong-Joon Ju1, Ho-Joong Kim1, Seong-Whan Lee1

Aff.: 1Department of Artificial Intelligence, Korea University

Arxiv: http://arxiv.org/abs/2411.08334v1