notesum.ai

Published at November 27

From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects

cs.CV
cs.AI

Released Date: November 27, 2024

Authors: Zizhao Li1, Zhengkang Xiang1, Joseph West1, Kourosh Khoshelham1

Aff.: 1The University of Melbourne

Arxiv: http://arxiv.org/abs/2411.18207v1