notesum.ai

Published at November 26

What's in the Image? A Deep-Dive into the Vision of Vision Language Models

cs.CV
cs.AI

Released Date: November 26, 2024

Authors: Omri Kaduri1, Shai Bagon1, Tali Dekel1

Aff.: 1Weizmann Institute of Science

Arxiv: http://arxiv.org/abs/2411.17491v1