notesum.ai

Published at December 5

MegaCOIN: Enhancing Medium-Grained Color Perception for Vision-Language Models

cs.CV
cs.LG

Released Date: December 5, 2024

Authors: Ming-Chang Chiu1, Shicheng Wen2, Pin-Yu Chen3, Xuezhe Ma1

Aff.: 1USC; 2UC Davis; 3IBM Research

Arxiv: http://arxiv.org/pdf/2412.03927v1