notesum.ai

Published at May 13

DiTFastAttn: Attention Compression for Diffusion Transformer Models

NeurIPS

Released Date: May 13, 2024

Authors: Zhihang Yuan1, Hanling Zhang1, Lu Pu, Xuefei Ning1, Linfeng Zhang2, Tianchen Zhao1, Shengen Yan3, Guohao Dai2, Yu Wang1

Aff.: 1Tsinghua University; 2Shanghai Jiao Tong University; 3Infinigence AI

Arxiv: https://openreview.net/pdf/f04f7cb65b98f0bf20c13bbd3cb6d0ecc0432d01.pdf