notesum.ai
Published at April 22SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models
NeurIPS
Released Date: April 22, 2024
Authors: An-Chieh Cheng, Hongxu Yin, Yang Fu, Qiushan Guo, Ruihan Yang, Jan Kautz, Xiaolong Wang, Sifei Liu
Arxiv: https://openreview.net/pdf/72621ba2893cdc746a92fa241286edca2ca9aab0.pdf