notesum.ai

Published at November 25

Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong Baselines

cs.CL

Released Date: November 25, 2024

Authors: Zi-Ao Ma1, Tian Lan1, Rong-Cheng Tu2, Yong Hu3, Heyan Huang1, Xian-Ling Mao1

Aff.: 1School of Computer Science and Technology, Beijing Institute of Technology, China; 2Nanyang Technological University, Singapore; 3WeChat AI, Tencent Inc., China

Arxiv: http://arxiv.org/abs/2411.16365v1