notesum.ai
Published at December 4Equivariant Representation Learning for Augmentation-based Self-Supervised Learning via Image Reconstruction
cs.CV
Released Date: December 4, 2024
Authors: Qin Wang1, Kai Krajsek1, Hanno Scharr1
Aff.: 1Forschungszentrum Jülich, Germany

| ImageNet | Rotation | Color | Blur radius | Translation | Crop prediction | Flip |
| SIE(rot) | 0.990 | 0.867 | 0.042 | 0.540 | 0.266 | 0.532 |
| SIE(color) | 0.078 | 0.890 | 0.097 | 0.355 | 0.178 | 0.333 |
| SIE(blur) | 0.153 | 0.883 | 0.941 | 0.189 | 0.412 | 0.415 |
| SIE(trans) | 0.213 | 0.885 | 0.023 | 0.978 | 0.368 | 0.511 |
| SIE(crop) | 0.273 | 0.819 | 0.018 | 0.450 | 0.922 | 0.485 |
| SIE(flip) | 0.155 | 0.798 | 0.056 | 0.312 | 0.266 | 0.993 |
| VICReg[4] | 0.318 0.005 | 0.804 0.016 | 0.101 0.023 | 0.333 0.008 | 0.423 0.140 | 0.872 0.070 |
| SIE(all) | 0.331 0.007 | 0.899 0.003 | 0.211 0.005 | 0.925 0.002 | 0.835 0.008 | 0.945 0.004 |
| SIE(all, single each time) | 0.435 0.011 | 0.907 0.009 | 0.377 0.004 | 0.922 0.010 | 0.829 0.005 | 0.939 0.007 |
| Ours | 0.862 0.004 | 0.921 0.006 | 0.823 0.003 | 0.853 0.005 | 0.912 0.002 | 0.952 0.008 |