notesum.ai
Published at November 7DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
cs.CV
cs.AI
cs.GR
Released Date: November 7, 2024
Authors: Wenqiang Sun1, Shuo Chen2, Fangfu Liu2, Zilong Chen2, Yueqi Duan2, Jun Zhang1, Yikai Wang2
Aff.: 1HKUST; 2Tsinghua University
![[Uncaptioned image]](https://arxiv.org/html/2411.04928v1/x1.png)
| Methods | Tank and Temples | MipNeRF360 | LLFF | DL3DV | |||||||||
| PSNR | SSIM | LPIPS | PSNR | SSIM | LPIPS | PSNR | SSIM | LPIPS | PSNR | SSIM | LPIPS | ||
| Single-View | ZeroNVS [38] | 12.31 | 0.301 | 0.567 | 15.84 | 0.327 | 0.536 | 15.62 | 0.497 | 0.354 | 12.39 | 0.251 | 0.559 |
| ViewCrafter [61] | 15.18 | 0.499 | 0.319 | 15.65 | 0.404 | 0.378 | 17.56 | 0.620 | 0.337 | 14.78 | 0.422 | 0.417 | |
| Ours | 17.11 | 0.613 | 0.199 | 18.91 | 0.527 | 0.333 | 20.38 | 0.744 | 0.200 | 18.28 | 0.642 | 0.215 | |
| Sparse-View | DNGaussian [22] | 12.13 | 0.292 | 0.511 | 15.21 | 0.127 | 0.632 | 17.51 | 0.586 | 0.409 | 14.99 | 0.286 | 0.432 |
| InstantSplat [10] | 18.70 | 0.634 | 0.258 | 16.80 | 0.574 | 0.296 | 22.33 | 0.818 | 0.149 | 18.30 | 0.691 | 0.222 | |
| ViewCrafter [61] | 18.76 | 0.637 | 0.216 | 18.49 | 0.691 | 0.212 | 21.60 | 0.823 | 0.155 | 19.19 | 0.686 | 0.196 | |
| Ours | 20.42 | 0.668 | 0.185 | 20.21 | 0.713 | 0.184 | 25.11 | 0.913 | 0.067 | 21.69 | 0.780 | 0.124 | |