notesum.ai
Published at October 30MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts
cs.CV
cs.AI
cs.LG
Released Date: October 30, 2024
Authors: Jie Zhu1, Yixiong Chen2, Mingyu Ding3, Ping Luo4, Leye Wang1, Jingdong Wang5
Aff.: 1Key Lab of High Confidence Software Technologies (Peking University), Ministry of Education, China and School of Computer Science, Peking University, Beijing, China; 2Johns Hopkins University; 3UC Berkeley; 4The University of Hong Kong; 5Baidu
![[Uncaptioned image]](https://arxiv.org/html/2410.23332v1/extracted/5964680/title.png)
| Model | COCO Human Prompts | |
|---|---|---|
| HPS (%) | IR (%) | |
| VQ-Diffision | ||
| Versatile Diffusion | ||
| SDXL | ||
| SD v1.5 | ||
| MoLE (SD v1.5) | ||
| MoLE (SDXL) | ||
| Model | DiffusionDB Human Prompts | |
| HPS (%) | IR (%) | |
| VQ-Diffision | ||
| Versatile Diffusion | ||
| SDXL | ||
| SD v1.5 | ||
| MoLE (SD v1.5) | ||
| MoLE (SDXL) | ||