notesum.ai

Published at November 21

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI

cs.CV

Released Date: November 21, 2024

Authors: Tianbin Li1, Yanzhou Su1, Wei Li2, Bin Fu2, Zhe Chen3, Ziyan Huang2, Guoan Wang4, Chenglong Ma5, Ying Chen6, Ming Hu7, Yanjun Li4, Pengcheng Chen8, Xiaowei Hu1, Zhongying Deng9, Yuanfeng Ji10, Jin Ye7, Yu Qiao, Junjun He1

Aff.: 1Shanghai AI Laboratory; 2Shanghai AI Laboratory, Shenzhen Institute of Advanced Technology (SIAT), Chinese Academy of Sciences; 3Shanghai AI Laboratory, Nanjing University; 4Shanghai AI Laboratory, East China Normal University; 5Shanghai AI Laboratory, Fudan University; 6Shanghai AI Laboratory, Xiamen University; 7Shanghai AI Laboratory, Monash University; 8Shanghai AI Laboratory, University of Washington; 9Shanghai AI Laboratory, University of Cambridge; 10Stanford University

Arxiv: http://arxiv.org/abs/2411.14522v1