notesum.ai
Published at November 10A Variance Minimization Approach to Temporal-Difference Learning
cs.LG
cs.AI
Released Date: November 10, 2024
Authors: Xingguo Chen1, Yu Gong, Shangdong Yang1, Wenhao Wang2
Aff.: 1Jiangsu Key Laboratory of Big Data Security and Intelligent Processing, Nanjing University of Posts and Telecommunications, Nanjing 210023, China; 2College of Electronic Engineering, National University of Defense Technology, China
![[Uncaptioned image]](https://arxiv.org/html/2411.06396v1/x1.png)
| algorithm | TD | VMTD | TDC | VMTDC | ETD | VMETD |
|---|---|---|---|---|---|---|
| ON-POLICY | ||||||
| OFF-POLICY |