notesum.ai

Published at November 5

HumanVLM: Foundation for Human-Scene Vision-Language Model

cs.AI
cs.MM

Released Date: November 5, 2024

Authors: Dawei Dai1, Xu Long1, Li Yutang1, Zhang Yuanhui1, Shuyin Xia1

Aff.: 1Chongqing Key Laboratory of Computational Intelligence, Chongqing University of Posts and Telecommunications, 400065, Chongqing, China

Arxiv: http://arxiv.org/abs/2411.03034v1