notesum.ai
Published at November 26DepthCues: Evaluating Monocular Depth Perception in Large Vision Models
cs.CV
Released Date: November 26, 2024
Authors: Duolikun Danier1, Mehmet Aygün, Changjian Li1, Hakan Bilen1, Oisin Mac Aodha1
Aff.: 1University of Edinburgh

| Model | NYUv2 Acc. (%) | DIW WHDR (%) |
|---|---|---|
| DINOv2 | 87.78 | 11.99 |
| DINOv2+DC | 87.06 | 11.95 |
| concat(DINOv2, noise) | 87.56 | 12.20 |
| concat(DINOv2, DINOv2+DC ) | 88.46 | 11.72 |
| CLIP | 43.78 | 35.25 |
| CLIP+DC | 43.59 | 35.45 |
| concat(CLIP, noise) | 43.38 | 35.39 |
| concat(CLIP, CLIP+DC ) | 44.32 | 33.53 |