notesum.ai
Published at November 26Spatially Visual Perception for End-to-End Robotic Learning
cs.CV
cs.AI
cs.RO
Released Date: November 26, 2024
Authors: Travis Davies1, Jiahuan Yan1, Xiang Chen2, Yu Tian3, Yueting Zhuang4, Yiqi Huang1, Luhui Hu1
Aff.: 1ZhiCheng AI; 2Peking University; 3Harvard University; 4Zhejiang University

| Task | Exposure | 10 | 20 | 40 | 60 | 80 | 100 | 120 | 140 | 160 | 170 | AVG | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Success Rate (%) | |||||||||||||
| CupStack | DP (baseline) | 0 | 0 | 0 | 0 | 0 | 97 | 75 | 40 | 17 | 0 | 23 | |
| DP+Depth | 0 | 0 | 0 | 0 | 0 | 90 | 91 | 78 | 0 | 0 | 26 | ||
| DP+AugBlender | 0 | 0 | 0 | 66 | 72 | 100 | 66 | 92 | 0 | 0 | 40 | ||
| DP+Varied Data | 0 | 0 | 0 | 0 | 0 | 50 | 40 | 50 | 10 | 0 | 15 | ||
| Ours | 62 | 88 | 93 | 91 | 91 | 93 | 91 | 88 | 91 | 90 | 88 | ||
| PickSmall | DP (baseline) | 0 | 0 | 0 | 0 | 0 | 66 | 78 | 100 | 100 | 90 | 43 | |
| DP+Depth | 0 | 71 | 82 | 85 | 91 | 93 | 79 | 86 | 65 | 71 | 72 | ||
| DP+AugBlender | 0 | 66 | 50 | 73 | 92 | 100 | 72 | 80 | 75 | 68 | 68 | ||
| DP+Varied Data | 0 | 0 | 0 | 41 | 42 | 55 | 65 | 67 | 71 | 65 | 41 | ||
| Ours | 0 | 84 | 82 | 92 | 92 | 100 | 92 | 100 | 87 | 92 | 82 | ||
| PickBig | DP (baseline) | 10 | 22 | 31 | 61 | 67 | 100 | 59 | 45 | 38 | 32 | 47 | |
| DP+Depth | 53 | 82 | 78 | 75 | 83 | 82 | 90 | 75 | 70 | 68 | 78 | ||
| DP+AugBlender | 51 | 65 | 72 | 75 | 80 | 89 | 51 | 61 | 60 | 58 | 66 | ||
| DP+Varied Data | 0 | 0 | 21 | 55 | 51 | 82 | 65 | 62 | 55 | 45 | 44 | ||
| Ours | 61 | 83 | 85 | 83 | 100 | 84 | 82 | 83 | 81 | 83 | 83 | ||