td-lambda | Sujet | Ecosyste.ms: Repos

Sujet: "td-lambda"

adik993/reinforcement-learning-sutton

langage: Python - taille: 75,2 ko - dernière synchronisation: il y a presque 3 ans - enregistré: il y a presque 6 ans - étoiles: 13 - forks: 3

khanhvu207/ddrl

Distributed Deep Reinforcement Learning Framework

langage: Jupyter Notebook - taille: 831 ko - dernière synchronisation: il y a 2 mois - enregistré: il y a presque 4 ans - étoiles: 5 - forks: 0

Pegah-Ardehkhani/Reinforcement-Learning-Algorithms-from-Scratch

Explore key RL algorithms with detailed explanations and fully commented Python code implementations

langage: Jupyter Notebook - taille: 2,36 Mo - dernière synchronisation: il y a 8 mois - enregistré: il y a environ un an - étoiles: 4 - forks: 0

PeeteKeesel/Basic-RL-Algorithms

:robot: Implementation and short explanation of basic RL algorithms, reproducing the simulations from Andrej Kaparthy's REINFORCEjs library.

langage: Python - taille: 18,8 Mo - dernière synchronisation: il y a plus de 2 ans - enregistré: il y a presque 3 ans - étoiles: 4 - forks: 0

TomGeorge1234/ThetaSequencesAreEligibilityTraces

Code for my paper: "Theta sequences as eligibility traces: a biological solution to credit assignment"

langage: Jupyter Notebook - taille: 2,05 Mo - dernière synchronisation: il y a 6 mois - enregistré: il y a plus de 2 ans - étoiles: 3 - forks: 0

MaviVestini/RL_HW2

Second homework for the Reinforcement Learning course

langage: Python - taille: 332 ko - dernière synchronisation: il y a environ 2 ans - enregistré: il y a environ 2 ans - étoiles: 0 - forks: 0

plopd/on-policy-experiments-td-and-etd

An Empirical Comparison of Temporal-Differences Learning Methods with Emphatic Temporal-Differences Learning Methods in the On-Policy Case.

langage: Python - taille: 35,2 ko - dernière synchronisation: il y a plus de 2 ans - enregistré: il y a plus de 2 ans - étoiles: 0 - forks: 0

plopd/plop-msc-thesis

A Comparison of Temporal-Difference Learning with Emphatic Temporal-Difference Learning

langage: Python - taille: 361 ko - dernière synchronisation: il y a plus de 2 ans - enregistré: il y a plus de 2 ans - étoiles: 0 - forks: 0

giulio-derasmo/Reinforcement-Learning-Projects

Repository of Reinforcement Learning projects done during the course @Sapienza

langage: Python - taille: 27,3 ko - dernière synchronisation: il y a presque 3 ans - enregistré: il y a presque 3 ans - étoiles: 0 - forks: 0

Anjali001/Reinforcement-Learning

langage: Jupyter Notebook - taille: 1,05 Mo - dernière synchronisation: il y a presque 3 ans - enregistré: il y a plus de 3 ans - étoiles: 0 - forks: 0

jolares/replicate-sutton-1998-td-lambda-experiments

Replicates the Random Walk Experiments from Sutton's 1998 paper "Learning to predict by the methods of Temporal Differences"

taille: 9,77 ko - dernière synchronisation: il y a 9 mois - enregistré: il y a plus de 4 ans - étoiles: 0 - forks: 0

rabieifk/Prison_Break_Machine_Learning

Machine-learning application in path finding using the n-step TD(lambda) algorithm

langage: Python - taille: 165 ko - dernière synchronisation: il y a plus de 2 ans - enregistré: il y a presque 5 ans - étoiles: 0 - forks: 0

dyth/Juno

Tic-Tac-Toe agent trained by Deep Reinforcement Learning

langage: Python - taille: 87,9 ko - dernière synchronisation: il y a 10 mois - enregistré: il y a plus de 7 ans - étoiles: 0 - forks: 1

Sujets associés

reinforcement-learning 8 policy-iteration 3 td-learning 3 q-learning 3 sarsa 3 monte-carlo 2 reinforcement-learning-algorithms 2 rl 2 etd-lambda 2 value-iteration 2 sarsa-lambda 2 machine-learning 2 theta 1 theoretical-neuroscience 1 etd-learning 1 sequences 1 reinforcement 1 markov-decision-process 1 neuroscience 1 multi-step-ahead-forecasting 1 deep-reinforcement-learning 1 value-network 1 epsilon-greedy-exploration 1 exploration-exploitation 1 greedy-algorithm 1 policy-gradient 1 reinforce 1 sarsa-learning 1 ucb-algorithm 1 etd 1 td 1 n-step 1 rbf 1 a2c 1 ilqr 1 bandit-algorithm 1 cliffwalking 1 dyna-q 1 gridworld 1 multi-armed-bandits 1 racecar 1 random-walk 1 sutton-book 1 distributed-reinforcement-learning 1 openai-gym 1 ppo 1 pytorch 1 vtrace 1 deep-q-learning 1 epsilon-greedy 1 iterative-policy-evaluation 1 optimistic-inital-values 1 reinforcement-learning-agent 1 reinforcement-learning-environments 1 td-0 1 thompson-sampling 1 ucb1 1 algorithms 1 artficial-intelligence 1 computational-neuroscience 1 hippocampus 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos