Topic: "etd-lambda"
plopd/on-policy-experiments-td-and-etd
An Empirical Comparison of Temporal-Differences Learning Methods with Emphatic Temporal-Differences Learning Methods in the On-Policy Case.
Language: Python - Size: 35.2 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
plopd/plop-msc-thesis
A Comparison of Temporal-Difference Learning with Emphatic Temporal-Difference Learning
Language: Python - Size: 361 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0