notesum.ai
Published at October 21Reducing Hallucinations in Vision-Language Models via Latent Space Steering
cs.CV
cs.AI
Released Date: October 21, 2024
Authors: Sheng Liu1, Haotian Ye1, James Zou1
Aff.: 1Stanford University

| Model | LLaVA-1.5 | InstructBLIP | Qwen-VL | |||
| Method | Accuracy | F1 Score | Accuracy | F1 Score | Accuracy | F1 Score |
| Vanilla | 79.8 | 79.4 | 76.3 | 78.0 | 83.5 | 81.2 |
| VCD | 82.3 | 83.4 | 80.1 | 81.0 | 84.5 | 83.3 |
| OPERA | 84.2 | 83.7 | 79.6 | 80.9 | 84.3 | 82.6 |
| VTI | 86.5 | 85.9 | 81.8 | 83.2 | 85.2 | 84.1 |