notesum.ai
Published at November 11Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
cs.LG
cs.AI
Released Date: November 11, 2024
Authors: Xingrui Yu1, Zhenglin Wan2, David Mark Bossens1, Yueming Lyu1, Qing Guo1, Ivor W. Tsang1
Aff.: 1Centre for Frontier AI Research (CFAR), A*STAR, Singapore; 2Centre for Frontier AI Research (CFAR), A*STAR, Singapore; School of Data Science, The Chinese University of Hong Kong, Shenzhen, China

| HalfCheetah | Walker2d | Humanoid | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| QD-Score | Cov(%) | Best | Avg | QD-Score | Cov(%) | Best | Avg | QD-Score | Cov(%) | Best | Avg | |
| PPGA-trueReward | 94.08 | 8,942 | 2,871 | 77.04 | 5,588 | 1,891 | 49.96 | 9,691 | 4,570 | |||
| mCWAE-WGAIL-Bonus | 98.28 | 4,553 | 1,547 | 87.39 | 4,142 | 1,407 | 75.05 | 6,875 | 3,447 | |||
| WAE-WGAIL-Bonus | 76.89 | 4,788 | 1,246 | 73.93 | 3,310 | 1,381 | 67.27 | 8,620 | 3,09 | |||
| mCWAE-WGAL | 87.76 | 5,728 | 1,623 | 71.61 | 3,611 | 1,326 | 66.96 | 8,485 | 2,114 | |||
| WAE-WGAIL | 93.77 | 5,463 | 1,615 | 73.20 | 3,327 | 1,148 | 68.08 | 8,553 | 2,359 | |||