An open API service providing repository metadata for many open source software ecosystems.

Topic: "td-lambda"

adik993/reinforcement-learning-sutton

Language: Python - Size: 75.2 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 13 - Forks: 3

khanhvu207/ddrl

Distributed Deep Reinforcement Learning Framework

Language: Jupyter Notebook - Size: 831 KB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

Pegah-Ardehkhani/Reinforcement-Learning-Algorithms-from-Scratch

Explore key RL algorithms with detailed explanations and fully commented Python code implementations

Language: Jupyter Notebook - Size: 2.36 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

PeeteKeesel/Basic-RL-Algorithms

:robot: Implementation and short explanation of basic RL algorithms, reproducing the simulations from Andrej Kaparthy's REINFORCEjs library.

Language: Python - Size: 18.8 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

TomGeorge1234/ThetaSequencesAreEligibilityTraces

Code for my paper: "Theta sequences as eligibility traces: a biological solution to credit assignment"

Language: Jupyter Notebook - Size: 2.05 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

MaviVestini/RL_HW2

Second homework for the Reinforcement Learning course

Language: Python - Size: 332 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

plopd/on-policy-experiments-td-and-etd

An Empirical Comparison of Temporal-Differences Learning Methods with Emphatic Temporal-Differences Learning Methods in the On-Policy Case.

Language: Python - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

plopd/plop-msc-thesis

A Comparison of Temporal-Difference Learning with Emphatic Temporal-Difference Learning

Language: Python - Size: 361 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

giulio-derasmo/Reinforcement-Learning-Projects

Repository of Reinforcement Learning projects done during the course @Sapienza

Language: Python - Size: 27.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Anjali001/Reinforcement-Learning

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

jolares/replicate-sutton-1998-td-lambda-experiments

Replicates the Random Walk Experiments from Sutton's 1998 paper "Learning to predict by the methods of Temporal Differences"

Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

rabieifk/Prison_Break_Machine_Learning

Machine-learning application in path finding using the n-step TD(lambda) algorithm

Language: Python - Size: 165 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

dyth/Juno

Tic-Tac-Toe agent trained by Deep Reinforcement Learning

Language: Python - Size: 87.9 KB - Last synced at: 3 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1