notesum.ai
Published at November 8Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent
cs.CV
cs.AI
cs.RO
Released Date: November 8, 2024
Authors: Linfeng He1, Yiming Sun1, Sihao Wu2, Jiaxu Liu2, Xiaowei Huang2
Aff.: 1School of Computer Science, University of Nottingham, United Kingdom; 2Department of Computer Science, University of Liverpool, United Kingdom
| Experiment | Accuracy | ChatGPT | Match | Bleu_1 | Bleu_2 | Bleu_3 | Bleu_4 | ROUGE_l | CIDEr | Final_score |
| DriveLM-Agent | - | - | - | - | - | - | - | - | - | - |
| Our Method (Llama-Adapter) | 0.0 | 65.55 | 18.59 | 0.041 | 0.0002 | 0.000034 | 0.000014 | 0.076 | 0.082 | 0.3057 |
| Our Method (Yolos) | 58.243 | 0.0093 |