notesum.ai
Published at October 21FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
cs.CV
cs.AI
cs.CL
cs.LG
Released Date: October 21, 2024
Authors: Woosung Koh1, Wonbeen Oh1, Siyeol Kim1, Suhin Shin1, Hyeongjin Kim1, Jaein Jang1, Junghyun Lee2, Se-Young Yun2
Aff.: 1Yonsei University; 2KAIST AI

| MPEv2 Environment | Spread | Repel | Tag | |||
|---|---|---|---|---|---|---|
| Method | OOD1 | OOD2 | OOD1 | OOD2 | OOD1 | OOD2 |
| FlickerFusion-Attention | -198.4 | -230.1 | 845.6 | 1070.2 | -2.1 | -9.4 |
| FlickerFusion-MLP | -183.7 | -209.0 | 906.1 | 1189.0 | -18.8 | -33.7 |
| ACORM (Hu et al., 2024) | -244.4 | -330.3 | 153.6 | 98.1 | -247.9 | -578.2 |
| ODIS (Zhang et al., 2023) | -257.1 | -376.5 | -3532.8 | -2982.1 | -1992.0 | -1723.4 |
| CAMA (Shao et al., 2023) | -262.2 | -464.1 | 698.2 | 928.1 | -29.6 | -31.2 |
| REFIL (Iqbal et al., 2021) | -205.0 | -289.2 | 404.6 | 822.4 | -9.3 | -15.2 |
| UPDeT (Hu et al., 2021) | -222.0 | -273.9 | -2645.2 | -1929.0 | -129.5 | -128.8 |
| Meta DotProd (Kedia et al., 2021) | -367.8 | -555.9 | -12538.6 | -9365.7 | -7611.4 | -7302.5 |
| DG-MAML (Wang et al., 2021a) | -217.3 | -254.0 | 343.2 | 623.0 | -39.2 | -50.8 |
| SMLDG (Li et al., 2020) | -258.2 | -290.4 | -1711.0 | -1211.7 | -210.6 | -185.9 |
| MLDG (Li et al., 2018) | -413.1 | -610.7 | -12763.4 | -9711.1 | -2625.9 | -2333.2 |
| QMIX-Attention (Iqbal et al., 2021) | -190.1 | -251.5 | 719.0 | 927.4 | -14.4 | -26.6 |
| QMIX-MLP (Rashid et al., 2018) | -209.1 | -242.8 | 380.0 | 883.7 | -41.6 | -62.5 |
| MPEv2 Environment | Guard | Adversary | Hunt | |||
| Method | OOD1 | OOD2 | OOD1 | OOD2 | OOD1 | OOD2 |
| FlickerFusion-Attention | -1258.3 | -1160.0 | 60.9 | 9.9 | -297.1 | -337.5 |
| FlickerFusion-MLP | -1127.2 | -962.9 | 56.3 | 9.3 | -278.5 | -305.3 |
| ACORM (Hu et al., 2024) | -2074.3 | -2049.6 | 56.1 | 13.1 | -367.2 | -397.3 |
| ODIS (Zhang et al., 2023) | -1449.3 | -1442.8 | -56.8 | -188.7 | -667.5 | -657.8 |
| CAMA (Shao et al., 2023) | -5002.5 | -4656.9 | 50.7 | 26.5 | -1063.1 | -1131.3 |
| REFIL (Iqbal et al., 2021) | -1445.8 | -1294.7 | 14.5 | -11.7 | -305.0 | -347.9 |
| UPDeT (Hu et al., 2021) | -2845.5 | -2637.0 | -64.6 | -0.6 | -340.8 | -383.5 |
| Meta DotProd (Kedia et al., 2021) | -7507.4 | -7620.4 | 82.3 | 34.4 | -956.0 | -1087.9 |
| DG-MAML (Wang et al., 2021a) | -1885.3 | -1989.1 | 18.7 | 6.1 | -384.1 | -416 |
| SMLDG (Li et al., 2020) | -2765.9 | -2591.8 | -12.7 | -134.8 | -601.8 | -623.0 |
| MLDG (Li et al., 2018) | -10509.2 | -9508.4 | -76.2 | -162.4 | -1164.1 | -1386.9 |
| QMIX-Attention (Iqbal et al., 2021) | -1252.3 | -1202.8 | 34.7 | 2.1 | -305.3 | -347.9 |
| QMIX-MLP (Rashid et al., 2018) | -1464.6 | -1325.1 | 17.3 | -8.1 | -337.8 | -357.5 |