notesum.ai
Published at November 22Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation
cs.CV
cs.AI
Released Date: November 22, 2024
Authors: Jeongsol Kim1, Beomsu Kim2, Jong Chul Ye2
Aff.: 1Department of Bio and Brain Engineering, KAIST; 2Kim Jaechul Graduate School of AI, KAIST
![[Uncaptioned image]](https://arxiv.org/html/2411.14863v1/extracted/6017942/figures/overview.jpg)
| Cat Dog | Horse Zebra | Dog Wild* | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Source | DDIB | PnP | SDEdit | Ours | Source | DDIB | PnP | SDEdit | Ours | Source | DDIB | PnP | SDEdit | Ours | |
| FID | 189.7 | 135.6 | 167.1 | 123.2 | 113.2 | 258.8 | 104.5 | 124.4 | 160.0 | 96.18 | 177.0 | 114.4 | 239.8 | 156.3 | 94.79 |
| KD | 3.296 | 5.993 | 4.109 | 5.187 | 4.297 | 14.66 | 2.505 | 2.798 | 3.454 | 1.910 | 4.909 | 6.350 | 5.934 | 5.505 | 4.841 |
| DINOv2 FD | 3001.0 | 2329.7 | 2665.5 | 2068.8 | 1813.3 | 2945.5 | 902.5 | 999.4 | 1090.0 | 764.1 | 3299.8 | 2973.4 | 2991.4 | 3354.3 | 2151.8 |
| LPIPS | - | 0.444 | 0.357 | 0.434 | 0.418 | - | 0.479 | 0.284 | 0.448 | 0.432 | - | 0.434 | 0.354 | 0.424 | 0.396 |
| Structure Dist. | - | 0.081 | 0.019 | 0.077 | 0.077 | - | 0.184 | 0.022 | 0.180 | 0.175 | - | 0.163 | 0.018 | 0.169 | 0.161 |
| CLIP Score | 24.30 | 24.71 | 24.58 | 24.79 | 24.78 | 22.33 | 22.94 | 22.19 | 23.08 | 23.14 | 20.52 | 20.77 | 20.64 | 20.42 | 20.62 |
| ImageReward | -2.147 | 0.097 | -1.359 | -0.020 | 0.186 | -2.231 | -0.354 | -0.343 | -0.621 | -0.280 | -0.509 | -0.316 | -0.264 | -0.642 | -0.167 |