An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dataset-aggregation

kwk2696/sb3-jax-haiku

stable-baselines with JAX & Haiku

Language: Python - Size: 297 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 4

hartikainen/berkeley-cs294

Berkeley CS 294: Deep Reinforcement Learning

Language: Jupyter Notebook - Size: 1.27 MB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 2

Hilton-AH/Imitation_Learning-Behavioral_Cloning-for-Robot-Learning

Lunar Lander game from OpenAI Gym using behavioral cloning, DAgger methods, and POMDP(Partially-Observable Markov Decision Processes)

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Hilton-AH/YODO-novel-RL-algorithm

Using DAgger with our MPC treated as the expert, we are able to effectively distill knowledge into relatively simple networks while still being able to retain a large fraction of the performance. (Please see paper for full description).

Language: Jupyter Notebook - Size: 14.2 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0