notesum.ai
Published at December 9Ranking-aware adapter for text-driven image ordering with CLIP
cs.CV
Released Date: December 9, 2024
Authors: Wei-Hsiang Yu1, Yen-Yu Lin1, Ming-Hsuan Yang2, Yi-Hsuan Tsai3
Aff.: 1National Yang Ming Chiao Tung University; 2UC Merced; 3Atmanity Inc.

| Method | Fine-tuning | PLCC () | SRCC () |
| BLIP-2 | |||
| Flamingo (10-shot) | |||
| InstructBLIP | |||
| VLM-VILA | |||
| Zero-shot CLIP | |||
| CountingCLIP Paiss et al. (2023) | ✓ | ||
| Ours | ✓ |