notesum.ai

Published at December 10

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization

cs.AI
cs.LG

Released Date: December 10, 2024

Authors: Zongkai Liu1, Qian Lin, Chao Yu, Xiawei Wu, Yile Liang, Donghui Li, Xuetao Ding

Aff.: 1School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China

Arxiv: http://arxiv.org/pdf/2412.07639v1