notesum.ai

Published at December 5

NVILA: Efficient Frontier Visual Language Models

cs.CV

Released Date: December 5, 2024

Authors: Zhijian Liu1, Ligeng Zhu, Baifeng Shi, Zhuoyang Zhang, Yuming Lou, Shang Yang, Haocheng Xi, Shiyi Cao, Yuxian Gu, Dacheng Li, Xiuyu Li, Yunhao Fang, Yukang Chen, Cheng-Yu Hsieh, De-An Huang, An-Chieh Cheng, Vishwesh Nath, Jinyi Hu, Sifei Liu, Ranjay Krishna, Daguang Xu, Xiaolong Wang, Pavlo Molchanov, Jan Kautz, Hongxu Yin, Song Han, Yao Lu

Aff.: 1NVIDIA

Arxiv: http://arxiv.org/pdf/2412.04468v1