notesum.ai
Published at October 31DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
cs.CV
cs.AI
cs.GR
cs.RO
Released Date: October 31, 2024
Authors: Weicai Ye1, Chenhao Ji2, Zheng Chen3, Junyao Gao2, Xiaoshui Huang4, Song-Hai Zhang3, Wanli Ouyang4, Tong He4, Cairong Zhao2, Guofeng Zhang1
Aff.: 1State Key Lab of CAD&CG, Zhejiang University; 2Tongji University; 3Tsinghua University; 4Shanghai AI Laboratory

| FID | IS | CS | PSNR | SSIM | Inference time(s) | |
| = 1, = 10 | 73.30 | 3.40 | 22.14 | 29.99 | 0.69 | 30.06 |
| = 2, = 10 | 66.02 | 3.34 | 22.92 | 32.04 | 0.79 | 33.12 |
| = 3, = 10 | 69.89 | 3.58 | 22.76 | 32.29 | 0.81 | 35.79 |
| = 4, = 6 | 68.39 | 3.57 | 22.74 | 33.32 | 0.86 | 26.61 |
| = 4, = 8 | 67.30 | 3.54 | 22.66 | 33.00 | 0.84 | 32.23 |
| = 4, = 10 | 65.98 | 3.37 | 22.59 | 33.39 | 0.87 | 37.91 |
| = 4, = 12 | 62.79 | 3.26 | 22.65 | 32.89 | 0.83 | 43.72 |