notesum.ai
Published at December 9Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
cs.LG
cs.AI
Released Date: December 9, 2024
Authors: Egor Cherepanov1, Nikita Kachaev1, Artem Zholus2, Alexey K. Kovalev1, Aleksandr I. Panov1
Aff.: 1AIRI, Moscow, Russia; 2Chandar Research Lab

| Hyperparameter | Value |
|---|---|
| Number of layers | 2 |
| Number of attention heads | 2 |
| Hidden dimension | 256 |
| Batch size | 64 |
| Optimizer | Adam |
| Learning rate | 3e-4 |
| Dropout | 0.1 |
| Replay buffer size | 1e6 |
| Discount () | 0.99 |
| Entropy temperature | 0.1 |