notesum.ai
Published at December 5SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning
cs.CV
Released Date: December 5, 2024
Authors: Seokju Yun1, Seunghye Chae, Dongheon Lee, Youngmin Ro
Aff.: 1Machine Intelligence Laboratory, University of Seoul, Korea
![[Uncaptioned image]](https://arxiv.org/html/2412.04077v1/x1.png)
| Synthetic-to-Real Generalization | Test Domains (mIoU in %) | |||||
|---|---|---|---|---|---|---|
| Methods | Backbone | Citys. | BDD | Map. | Avg. | |
| Single-source DGSS Trained on GTAV | ||||||
| CLOUDS [3] | CLIP-CN-L | 0.0M | 60.20 | 57.40 | 67.00 | 61.50 |
| VLTSeg [36] | EVA02-L | 304.2M | 65.30 | 58.30 | 66.00 | 63.20 |
| Rein [80] | EVA02-L | 3.0M | 65.30 | 60.50 | 64.90 | 63.60 |
| FADA [4] | EVA02-L | 11.7M | 66.70 | 61.90 | 66.10 | 64.90 |
| tqdm [59] | EVA02-L | 304.2M | 68.88 | 59.18 | 70.10 | 66.05 |
| SoRA (Ours) | EVA02-L | 5.1M | 68.05 | 60.81 | 68.33 | 65.73 |
| SoRA (Ours) | EVA02-L | 5.1M | 69.94 | 62.48 | 68.33 | 66.92 |
| DoRA [52] | DINOv2-L | 7.5M | 66.12 | 59.31 | 67.07 | 64.17 |
| VPT [37] | DINOv2-L | 3.7M | 68.75 | 58.64 | 68.32 | 65.24 |
| SET [86] | DINOv2-L | 6.1M | 68.06 | 61.64 | 67.68 | 65.79 |
| FADA [4] | DINOv2-L | 11.7M | 68.23 | 61.94 | 68.09 | 66.09 |
| AdaptFormer [12] | DINOv2-L | 6.3M | 70.10 | 59.81 | 68.77 | 66.23 |
| SSF [51] | DINOv2-L | 0.5M | 68.97 | 61.30 | 68.77 | 66.35 |
| LoRA [33] | DINOv2-L | 7.3M | 70.13 | 60.13 | 70.42 | 66.89 |
| [80] | DINOv2-L | 3.0M | 70.68 | 62.51 | 69.61 | 67.60 |
| SoRA (Ours) | DINOv2-L | 4.9M | 71.82 | 61.31 | 71.67 | 68.27 |
| SoRA (Ours) | DINOv2-L | 4.9M | 73.63 | 63.33 | 70.98 | 69.31 |
| Multi-source DGSS Trained on GTAV + SYNTHIA | ||||||
| [80] | DINOv2-L | 3.0M | 72.17 | 61.53 | 70.69 | 68.13 |
| SoRA (Ours) | DINOv2-L | 4.9M | 73.16 | 61.90 | 72.73 | 69.26 |
| SoRA (Ours) | DINOv2-L | 4.9M | 74.85 | 63.59 | 73.92 | 70.79 |
| Multi-source DGSS Trained on GTAV + SYNTHIA + UrbanSyn | ||||||
| FFT | DINOv2-L | 304.2M | 75.90 | 60.93 | 72.80 | 69.88 |
| SoRA (Ours) | DINOv2-L | 4.9M | 77.33 | 62.78 | 74.93 | 71.68 |
| DINOv2-L | 307.3M | 77.06 | 61.81 | 75.09 | 71.32 | |
| [80] | DINOv2-L | 3.0M | 78.42 | 62.20 | 74.49 | 71.70 |
| (Ours) | DINOv2-L | 4.9M | 79.22 | 63.84 | 76.30 | 73.12 |
| Freeze | DINOv2-G | 0.0M | 76.08 | 61.98 | 72.23 | 70.10 |
| FFT | DINOv2-G | 1.1B | 76.90 | 61.69 | 73.53 | 70.71 |
| SoRA (Ours) | DINOv2-G | 6.6M | 78.39 | 63.75 | 75.16 | 72.43 |
| SoRA (Ours) | DINOv2-G | 6.6M | 80.37 | 65.67 | 76.18 | 74.07 |