GitHub topics: dataset-aggregation
kwk2696/sb3-jax-haiku
stable-baselines with JAX & Haiku
Language: Python - Size: 297 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 4

hartikainen/berkeley-cs294
Berkeley CS 294: Deep Reinforcement Learning
Language: Jupyter Notebook - Size: 1.27 MB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 2

Hilton-AH/Imitation_Learning-Behavioral_Cloning-for-Robot-Learning
Lunar Lander game from OpenAI Gym using behavioral cloning, DAgger methods, and POMDP(Partially-Observable Markov Decision Processes)
Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Hilton-AH/YODO-novel-RL-algorithm
Using DAgger with our MPC treated as the expert, we are able to effectively distill knowledge into relatively simple networks while still being able to retain a large fraction of the performance. (Please see paper for full description).
Language: Jupyter Notebook - Size: 14.2 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
