notesum.ai
Published at November 4xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
cs.DC
cs.AI
Released Date: November 4, 2024
Authors: Jiarui Fang1, Jinzhe Pan2, Xibo Sun1, Aoyu Li1, Jiannan Wang3
Aff.: 1Tencent; 2Tencent & Huazhong University of Science and Technology; 3Tencent & The University of Hong Kong

| Method | Communication | Memory Cost | ||
| Cost | Overlap | Model | KV Activations | |
| Tensor Parallelism | ✗ | |||
| DistriFusion | ✓ | |||
| SP-Ring | ✓ | |||
| SP-Ulysses | ✗ | |||
| PipeFusion | 2 | ✓ | ||