GitHub topics: dataset-aggregation

Repositories

kwk2696/sb3-jax-haiku

stable-baselines with JAX & Haiku

Language: Python - Size: 297 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 4

hartikainen/berkeley-cs294

Berkeley CS 294: Deep Reinforcement Learning

Language: Jupyter Notebook - Size: 1.27 MB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 2

Hilton-AH/Imitation_Learning-Behavioral_Cloning-for-Robot-Learning

Lunar Lander game from OpenAI Gym using behavioral cloning, DAgger methods, and POMDP(Partially-Observable Markov Decision Processes)

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Hilton-AH/YODO-novel-RL-algorithm

Using DAgger with our MPC treated as the expert, we are able to effectively distill knowledge into relatively simple networks while still being able to retain a large fraction of the performance. (Please see paper for full description).

Language: Jupyter Notebook - Size: 14.2 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Related Keywords

dataset-aggregation 4 reinforcement-learning 4 robot-learning 2 imitation-learning 2 behavioral-cloning 2 model-predictive-control 1 gym-environment 1 dqn 1 deep-reinforcement-learning 1 deep-learning 1 cs294 1 berkeley-reinforcement-learning 1 a3c 1 soft-actor-critic 1 proximal-policy-optimization 1 jax 1 haiku 1 dm-haiku 1 diffusion 1 decision-transformers 1 behavior-cloning 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub topics: dataset-aggregation

kwk2696/sb3-jax-haiku

hartikainen/berkeley-cs294

Hilton-AH/Imitation_Learning-Behavioral_Cloning-for-Robot-Learning

Hilton-AH/YODO-novel-RL-algorithm