notesum.ai

Published at December 9

Visual Lexicon: Rich Image Features in Language Space

cs.CV
cs.AI
cs.LG

Released Date: December 9, 2024

Authors: XuDong Wang1, Xingyi Zhou1, Alireza Fathi1, Trevor Darrell2, Cordelia Schmid1

Aff.: 1Google DeepMind; 2UC Berkeley

Arxiv: http://arxiv.org/pdf/2412.06774v1