notesum.ai

Published at December 9

Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone

cs.LG
cs.AI

Released Date: December 9, 2024

Authors: Max Sobol Mark1, Tian Gao2, Georgia Gabriela Sampaio2, Mohan Kumar Srirama1, Archit Sharma2, Chelsea Finn2, Aviral Kumar1

Aff.: 1Carnegie Mellon University; 2Stanford University

Arxiv: http://arxiv.org/pdf/2412.06685v1