notesum.ai
Published at November 29Fleximo: Towards Flexible Text-to-Human Motion Video Generation
cs.CV
cs.LG
Released Date: November 29, 2024
Authors: Yuhang Zhang1, Yuan Zhou2, Zeyu Liu3, Yuxuan Cai2, Qiuyue Wang2, Aidong Men1, Huan Yang2
Aff.: 1Beijing University of Posts and Telecommunications; 201.AI; 3Tsinghua University

| Methods | PSNR | SSIM | LPIPS | DreamSim | FID | FVD | MotionScore |
| I2VGen-XL [42] | 7.931 | 0.3684 | 0.559 | 0.537 | 220.221 | 1905.26 | 0.6806 |
| VideoCrafter [3] | 6.727 | 0.5190 | 0.608 | 0.211 | 149.534 | 1536.00 | 0.6866 |
| DynamiCrafter [35] | 9.607 | 0.6800 | 0.407 | 0.699 | 99.644 | 1462.42 | 0.6868 |
| Fleximo | 16.647 | 0.7148 | 0.284 | 0.879 | 76.181 | 1360.12 | 0.6990 |