notesum.ai
Published at November 26HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
cs.CV
cs.AI
Released Date: November 26, 2024
Authors: Fan Yang1, Ru Zhen2, Jianing Wang3, Yanhao Zhang2, Haoxiang Chen4, Haonan Lu2, Sicheng Zhao1, Guiguang Ding1
Aff.: 1Tsinghua University; 2OPPO AI Center; 3Peking University; 4Fudan University

| Heatmap Prediction | Plausibility Score | |||||||
|---|---|---|---|---|---|---|---|---|
| MSE (All Data)↓ | MSE (GT=0)↓ | KLD↓ | CC↑ | SIM↑ | AUC-Judd↑ | PLCC↑ | SRCC↑ | |
| PickScore (off-the-shelf) | - | - | - | - | - | - | 0.010 | 0.028 |
| EVA-CLIP encoder (fine-tuned) | 0.01614 | 0.00512 | 2.835 | 0.350 | 0.082 | 0.549 | 0.157 | 0.143 |
| CLIP encoder (fine-tuned) | 0.01437 | 0.00425 | 2.462 | 0.251 | 0.122 | 0.747 | 0.390 | 0.378 |
| RAHF (multi-head) | 0.01216 | 0.00141 | 1.971 | 0.425 | 0.302 | 0.877 | 0.666 | 0.654 |
| RAHF (augmented prompt) | 0.00920 | 0.00095 | 1.652 | 0.556 | 0.409 | 0.913 | 0.693 | 0.681 |
| HEIE (ours) | 0.00825 | 0.00014 | 1.634 | 0.574 | 0.417 | 0.915 | 0.697 | 0.683 |