notesum.ai
Published at November 18Dissecting Misalignment of Multimodal Large Language Models via Influence Function
cs.AI
cs.CV
Released Date: November 18, 2024
Authors: Lijie Hu1, Chenyang Ren2, Huanyi Xie3, Khouloud Saadi1, Shu Yang1, Jingfeng Zhang4, Di Wang1
Aff.: 1King Abdullah University of Science and Technology; 2Shanghai Jiao Tong University; 3KTH Royal Institute of Technology; 4The University of Auckland

| Sample | Method | FGVCAircraft | Food101 | Flowers102 | |||
|---|---|---|---|---|---|---|---|
| Accuracy(%) | RT (second) | Accuracy(%) | RT (second) | Accuracy(%) | RT (second) | ||
| Random | Retrain | 23.070.29 | 19.57 | 84.930.17 | 14.59 | 68.160.22 | 16.59 |
| ECIF | 22.770.09 | 7.60 | 84.870.24 | 7.28 | 68.530.12 | 7.29 | |
| Valuable | Retrain | 22.930.33 | 15.56 | 84.800.16 | 15.88 | 68.230.33 | 16.43 |
| ECIF | 22.730.09 | 5.95 | 84.860.05 | 6.27 | 68.260.12 | 6.52 | |
| Harmful | Retrain | 23.500.11 | 22.40 | 84.830.05 | 14.59 | 68.000.16 | 16.09 |
| ECIF | 23.020.07 | 6.26 | 84.900.01 | 6.22 | 68.300.01 | 6.27 | |