notesum.ai

Published at November 20

LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement

cs.CV
cs.AI

Released Date: November 20, 2024

Authors: Siwen Jiao1, Yangyi Fang2

Aff.: 1National University of Singapore, Agency for Science, Technology and Research, Singapore; 2Tsinghua University

Arxiv: http://arxiv.org/abs/2411.12980v1