notesum.ai
Published at December 9MoViE: Mobile Diffusion for Video Editing
cs.CV
Released Date: December 9, 2024
Authors: Adil Karjauv1, Noor Fathima1, Ioannis Lelekas1, Fatih Porikli1, Amir Ghodrati1, Amirhossein Habibian1
Aff.: 1Qualcomm AI Research
![[Uncaptioned image]](https://arxiv.org/html/2412.06578v1/x1.png)
| Method | Steps | PickScore | CLIPFrame | TFLOPs (per frame) | Latency (GPU) | Latency (Phone) |
|---|---|---|---|---|---|---|
| Fairy | 10 | 19.80 | 0.933 | - | - | - |
| TokenFlow | 50 | 20.49 | 0.940 | 109.35 | 2.45s | - |
| Rerender-A-Video | 20 | 19.58 | 0.909 | 107.52 | 2.13s | - |
| ControlVideo | 50 | 20.06 | 0.930 | 89.49 | 5.63s | - |
| InsV2V | 20 | 20.76 | 0.911 | 52.21 | 2.70s | - |
| RAVE | 50 | 20.35 | 0.932 | 83.09 | 4.31s | - |
| EVE | - | 20.76 | 0.922 | - | - | - |
| Base Model | 10 | 20.34 | 0.943 | 21.31 | 1.37s | 7s |
| + Mobile-Pix2Pix | 10 | 19.43 | 0.922 | 16.10 | 1.06s | 1.9s |
| + Multi-Guidance Dist. | 10 | 19.60 | 0.919 | 5.50 | 0.82s | 0.6s |
| + Adversarial Distillation | 1 | 19.40 | 0.913 | 0.76 | 0.11s | 0.08s |