notesum.ai
Published at May 7Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
NeurIPS
Released Date: May 7, 2024
Authors: Asaf Cassel1, Aviv Rosenberg2
Aff.: 1Tel Aviv University; 2Google Research
Arxiv: https://openreview.net/pdf/306830e11a9e3b0046b7c85d29e7cb963f283c26.pdf