notesum.ai
Published at November 21Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning
cs.RO
Released Date: November 21, 2024
Authors: Yafei Ou1, Mahdi Tavakoli2
Aff.: 1Graduate Student Member, IEEE; 2Senior Member, IEEE

| Hyperparameter | Value |
|---|---|
| Common | |
| Rollout buffer size | 32768 |
| Batch size | 2048 |
| Learning rate (linear decay) | 3e-4 |
| Entropy regularization | 1e-2 |
| Clipping parameter (linear decay) | 0.2 |
| Generalized advantage estimation | 0.95 |
| Epochs per update | 3 |
| Visual encoder type | simple (2 layers of CNN) |
| MLP layers | 3 |
| MLP hidden units | 128 |
| Imitation learning (if applicable) | |
| BC loss strength (linear decay) | 0.2 |
| BC steps | 1e4 |
| GAIL reward strength | 5e-2 |