notesum.ai
Published at November 12Doubly Mild Generalization for Offline Reinforcement Learning
cs.LG
cs.AI
Released Date: November 12, 2024
Authors: Yixiu Mao1, Qi Wang1, Yun Qu1, Yuhang Jiang1, Xiangyang Ji1
Aff.: 1Department of Automation, Tsinghua University

| Dataset-v2 | QL (+DMG) | SQL(+DMG) |
|---|---|---|
| halfcheetah-m | 47.7 55.3 | 48.3 54.5 |
| hopper-m | 71.1 90.1 | 75.5 97.7 |
| walker2d-m | 81.5 88.7 | 84.2 89.8 |
| halfcheetah-m-r | 44.8 51.1 | 44.8 51.8 |
| hopper-m-r | 97.3 102.5 | 101.7 101.8 |
| walker2d-m-r | 75.9 90.0 | 77.2 95.2 |
| halfcheetah-m-e | 89.8 92.5 | 94.0 93.5 |
| hopper-m-e | 107.1 111.1 | 111.8 110.4 |
| walker2d-m-e | 110.1 111.3 | 110.0 109.6 |
| total | 725.3 792.7 | 747.5 804.2 |