Topic: "td-lambda"
adik993/reinforcement-learning-sutton
Language: Python - Size: 75.2 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 13 - Forks: 3

khanhvu207/ddrl
Distributed Deep Reinforcement Learning Framework
Language: Jupyter Notebook - Size: 831 KB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

Pegah-Ardehkhani/Reinforcement-Learning-Algorithms-from-Scratch
Explore key RL algorithms with detailed explanations and fully commented Python code implementations
Language: Jupyter Notebook - Size: 2.36 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

PeeteKeesel/Basic-RL-Algorithms
:robot: Implementation and short explanation of basic RL algorithms, reproducing the simulations from Andrej Kaparthy's REINFORCEjs library.
Language: Python - Size: 18.8 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

TomGeorge1234/ThetaSequencesAreEligibilityTraces
Code for my paper: "Theta sequences as eligibility traces: a biological solution to credit assignment"
Language: Jupyter Notebook - Size: 2.05 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

MaviVestini/RL_HW2
Second homework for the Reinforcement Learning course
Language: Python - Size: 332 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

plopd/on-policy-experiments-td-and-etd
An Empirical Comparison of Temporal-Differences Learning Methods with Emphatic Temporal-Differences Learning Methods in the On-Policy Case.
Language: Python - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

plopd/plop-msc-thesis
A Comparison of Temporal-Difference Learning with Emphatic Temporal-Difference Learning
Language: Python - Size: 361 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

giulio-derasmo/Reinforcement-Learning-Projects
Repository of Reinforcement Learning projects done during the course @Sapienza
Language: Python - Size: 27.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Anjali001/Reinforcement-Learning
Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

jolares/replicate-sutton-1998-td-lambda-experiments
Replicates the Random Walk Experiments from Sutton's 1998 paper "Learning to predict by the methods of Temporal Differences"
Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

rabieifk/Prison_Break_Machine_Learning
Machine-learning application in path finding using the n-step TD(lambda) algorithm
Language: Python - Size: 165 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

dyth/Juno
Tic-Tac-Toe agent trained by Deep Reinforcement Learning
Language: Python - Size: 87.9 KB - Last synced at: 3 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1
