notesum.ai

Published at November 25

RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics

cs.CV
cs.AI
cs.CL
cs.RO

Released Date: November 25, 2024

Authors: Chan Hee Song1, Valts Blukis2, Jonathan Tremblay2, Stephen Tyree2, Yu Su1, Stan Birchfield2

Aff.: 1The Ohio State University; 2NVIDIA

Arxiv: http://arxiv.org/abs/2411.16537v1