notesum.ai
Published at October 30DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET
cs.CV
cs.AI
Released Date: October 30, 2024
Authors: Yitong Li1, Morteza Ghahremani1, Youssef Wally1, Christian Wachinger1
Aff.: 1Lab for Artificial Intelligence in Medical Imaging, Technical University of Munich (TUM), Germany

| Data | Method | Modality | CN vs. AD | CN vs. MCI vs. AD | ||
|---|---|---|---|---|---|---|
| BACC | AUC | BACC | F1-Score | |||
| ADNI | 3D-ResNet [26] | 86.59 4.22 | 93.67 2.43 | 53.58 6.20 | 53.70 6.16 | |
| 3D-ResNet [26] | 89.07 2.67 | 95.19 2.22 | 54.84 3.40 | 55.79 4.22 | ||
| 3D-ViT [28] | 86.22 3.83 | 93.66 2.50 | 60.83 7.41 | 60.03 7.45 | ||
| 3D-ViT [28] | 88.75 1.69 | 93.53 1.39 | 58.24 4.64 | 53.20 4.41 | ||
| ResNet-based early-fusion[30] | + | 82.66 2.40 | 85.58 7.29 | 57.30 2.30 | 55.79 1.02 | |
| ResNet-based middle-fusion[26] | + | 82.63 7.04 | 88.17 6.77 | 53.01 3.40 | 53.70 6.16 | |
| ResNet-based late-fusion[26] | + | 89.74 1.98 | 96.73 0.92 | 57.71 2.90 | 58.89 1.73 | |
| ViT-based early-fusion | + | 89.20 3.29 | 94.87 1.80 | 62.89 2.08 | 59.84 0.73 | |
| ViT-based late-fusion | + | 90.60 3.57 | 96.48 1.24 | 61.86 3.00 | 59.60 5.83 | |
| Mul-T [8] | + | 86.37 3.25 | 93.59 1.54 | 56.49 3.82 | 55.54 3.94 | |
| MMTFN [25] | + | 88.76 1.98 | 93.69 1.95 | 63.11 4.51 | 60.51 3.49 | |
| DiaMond (Ours) | + | 92.42 2.63 | 97.11 1.47 | 65.18 1.57 | 64.89 2.78 | |
| J-ADNI | 3D-ResNet [26] | 84.66 2.92 | 91.40 3.16 | 56.89 1.64 | 54.87 1.83 | |
| 3D-ResNet [26] | 85.48 9.87 | 92.22 4.51 | 50.87 12.8 | 46.16 18.7 | ||
| 3D-ViT [28] | 83.65 4.15 | 92.26 1.29 | 55.11 1.62 | 47.71 5.23 | ||
| 3D-ViT [28] | 89.42 3.08 | 96.14 1.16 | 51.77 5.45 | 51.32 4.74 | ||
| ResNet-based early-fusion[30] | + | 86.02 6.85 | 91.61 3.82 | 53.46 3.92 | 53.28 3.70 | |
| ResNet-based middle-fusion[26] | + | 77.73 8.97 | 74.12 6.78 | 49.48 2.85 | 43.94 7.39 | |
| ResNet-based late-fusion[26] | + | 88.81 3.36 | 94.85 1.79 | 50.20 6.23 | 48.90 6.50 | |
| ViT-based early-fusion | + | 89.59 3.68 | 93.86 2.98 | 54.68 5.32 | 52.25 3.42 | |
| ViT-based late-fusion | + | 86.70 7.73 | 96.34 2.42 | 54.07 5.80 | 50.83 4.89 | |
| Mul-T [8] | + | 81.02 6.08 | 88.45 2.96 | 50.04 7.28 | 45.66 8.46 | |
| MMTFN [25] | + | 86.74 6.05 | 84.27 7.91 | 57.55 2.81 | 53.45 6.97 | |
| DiaMond (Ours) | + | 91.72 2.52 | 96.20 2.50 | 58.44 3.61 | 58.88 1.73 | |