notesum.ai
Published at December 3It Takes Two: Real-time Co-Speech Two-person's Interaction Generation via Reactive Auto-regressive Diffusion Model
cs.SD
cs.CV
cs.GR
cs.MM
eess.AS
Released Date: December 3, 2024
Authors: Mingyi Shi1, Dafei Qin1, Leo Ho1, Zhouyingcheng Liao1, Yinghao Huang2, Junichi Yamagishi3, Taku Komura1
Aff.: 1The University of Hong Kong; 2Great Bay University; 3National Institute of Informatics, Tokyo
![[Uncaptioned image]](https://arxiv.org/html/2412.02419v1/extracted/6041954/figures/teaser.jpg)
| Motion Quality | Interaction | ||||
|---|---|---|---|---|---|
| FPD | Div. | Foot.Slid. | FDD | Div. | |
| ReMoS [19] | 475.32 | 111.90 | 0.3050 | 394.7 | 34.0 |
| Ours | 103.19 | 10.42 | 0.0109 | 133.72 | 14.13 |
| Ours(w/t audio) | 86.79 | 14.84 | 0.0141 | 104.43 | 19.90 |