notesum.ai
Published at December 9Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels
cs.CV
Released Date: December 9, 2024
Authors: Weijie Tu1, Weijian Deng1, Dylan Campbell1, Yu Yao2, Jiyang Zheng2, Tom Gedeon3, Tongliang Liu2
Aff.: 1Australian National University; 2University of Sydney; 3Curtin University

| Method | MCVQ | VQA | Average | ||
| AI2D | MMMU | TextVQA | ChartQA | ||
| AoL | |||||
| NLLmin | |||||
| NLLavg | |||||
| Entmax | |||||
| Entavg | |||||
| SampleBLEU | |||||
| ATC | |||||