notesum.ai

Published at December 9

Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels

cs.CV

Released Date: December 9, 2024

Authors: Weijie Tu1, Weijian Deng1, Dylan Campbell1, Yu Yao2, Jiyang Zheng2, Tom Gedeon3, Tongliang Liu2

Aff.: 1Australian National University; 2University of Sydney; 3Curtin University

Arxiv: http://arxiv.org/pdf/2412.06461v1