notesum.ai
Published at November 14Iterative Batch Reinforcement Learning via Safe Diversified Model-based Policy Search
cs.LG
cs.AI
stat.ML
Released Date: November 14, 2024
Authors: Amna Najib1, Stefan Depeweg1, Phillip Swazinna1
Aff.: 1Siemens AG

| Cost (-Reward) over Iterations | |||
|---|---|---|---|
| Policy | Initial data | ||
| Iteration0 | x | x | |
| Iteration1 | x | ||
| Iteration2 | x | ||
| Iteration3 | x | ||
| Iteration4 | x | ||