An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: deep-rl

kengz/awesome-deep-rl

A curated list of awesome Deep Reinforcement Learning resources.

Size: 118 KB - Last synced at: about 18 hours ago - Pushed at: 9 months ago - Stars: 747 - Forks: 75

lexfridman/mit-deep-learning

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Language: Jupyter Notebook - Size: 62.4 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 10,271 - Forks: 2,213

epignatelli/navix

Accelerated minigrid environments with JAX

Language: Python - Size: 4.53 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 133 - Forks: 17

lexfridman/deeptraffic

DeepTraffic is a deep reinforcement learning competition, part of the MIT Deep Learning series.

Language: JavaScript - Size: 5.92 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 1,774 - Forks: 262

MishaLaskin/curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Language: Python - Size: 2.16 MB - Last synced at: 5 days ago - Pushed at: over 4 years ago - Stars: 586 - Forks: 87

devendrachaplot/Neural-SLAM

Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"

Language: Python - Size: 5.97 MB - Last synced at: 21 days ago - Pushed at: 10 months ago - Stars: 787 - Forks: 145

OmniTitanAI/OmniTitan-RL-AI

A universal RL engine transcending modality barriers, empowering cross-industry intelligence with superhuman decision efficiency. Created by @sudip_royedu

Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

jakegrigsby/super_sac

A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data augmentation, offline learning and behavioral cloning.

Language: Python - Size: 487 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 37 - Forks: 6

reza-ebrahimi/awesome-autonomous-systems

A curated list of awesome intelligent autonomous systems ecosystem

Size: 6.84 KB - Last synced at: about 17 hours ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

gxywy/rl-plotter

:sparkles: A plotter for reinforcement learning (RL)

Language: Python - Size: 2.89 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 213 - Forks: 30

KanishkNavale/AI-for-Robot-Tasks

This repository implements the use of AI for robot tasks.

Language: Python - Size: 265 MB - Last synced at: about 7 hours ago - Pushed at: 4 days ago - Stars: 10 - Forks: 1

aminkhani/Deep-RL

You can see a reference for Books, Articles, Courses and Educational Materials in this field. Implementation of Reinforcement Learning Algorithms and Environments. Python, OpenAI Gym, Tensorflow.

Language: Jupyter Notebook - Size: 21.5 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 0

mike-gimelfarb/deep-successor-features-for-transfer

A reusable framework for successor features for transfer in deep reinforcement learning using keras.

Language: Python - Size: 1 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 40 - Forks: 11

ethanmclark1/rl_toolkit

Implementation of core reinforcement learning algorithms with PyTorch

Language: Python - Size: 233 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

leolellisr/poke_RL

Code repository with classical reinforcement learning and deep reinforcement learning methods for Pokémon battles in Pokémon Showdown.

Language: Jupyter Notebook - Size: 39.5 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 10 - Forks: 2

lgvaz/rlbox

RLbox: Solving OpenAI Gym with TensorFlow

Language: Python - Size: 2.62 MB - Last synced at: 22 days ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 6

aai-institute/tfl-training-rl

Language: Jupyter Notebook - Size: 223 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 5 - Forks: 0

zhihanyang2022/deeprl_notes

Demos of deep RL algorithms.

Language: Jupyter Notebook - Size: 4.81 MB - Last synced at: 11 months ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

victor-iyi/deep-RL

Exploration of deep reinforcement learning and various state-of-the-art techniques to create a turely autonomous agent.

Language: Python - Size: 61.5 KB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 1

hiwonjoon/cs294-deep-rl-hw Fork of berkeleydeeprlcourse/homework

Berkeley DeepRL Homework

Language: Jupyter Notebook - Size: 7.27 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 3

CapsuleEndoscope/VirtualCapsuleEndoscopy

VR-Caps: A Virtual Environment for Active Capsule Endoscopy

Language: C# - Size: 460 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 133 - Forks: 39

qiwihui/spinningup Fork of openai/spinningup

OpenAI团队的深度强化学习教程中文版

Language: Python - Size: 29.2 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 24 - Forks: 8

FeiyuYangAIOptimPlanning/DRLA-eTGM

DRLA-eTGM

Size: 105 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

ashutoshtiwari13/Demystifying-Deep-Reinforcement-Learning

⚡️ Code and Notes 📝 for Grokking Deep RL and RL: An Introduction by Sutton & Barto(2nd edition, 2018) 🤘

Language: Jupyter Notebook - Size: 73.3 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

holarissun/Accountable-Offline-RL

Code for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Language: Python - Size: 74.8 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

YiqinYang/ICQ

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight https://arxiv.org/abs/2106.03400)

Language: Python - Size: 1.05 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 49 - Forks: 7

devendrachaplot/Object-Goal-Navigation

Pytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"

Language: Python - Size: 8.59 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 209 - Forks: 48

Tsaousis/Reinforcement-Learning-Trading-GME

$GME Trading using Deep Reinforcement Learning algorithms

Language: Jupyter Notebook - Size: 119 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

albjerto/moon-landing

Deep Q-Learning algorithms to solve LunarLander-v2.

Language: Python - Size: 1.38 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

kkhetarpal/unrealwithattention

Attend Before you Act: Leveraging human visual attention for continual learning

Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 1

Yangyangii/Curiosity-Driven-A2C

Implementation of Curiosity-Driven Exploration with PyTorch

Language: Jupyter Notebook - Size: 329 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 2

ebrahimpichka/vanilla-pg

Simple PyTorch implementation of the Vanilla Policy Gradient algorithm.

Language: Python - Size: 0 Bytes - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

LJ147/spinningup Fork of openai/spinningup

OpenAI团队的深度强化学习教程中文版

Language: Python - Size: 30.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 56 - Forks: 18

amtoine/dqn 📦

An application of Reinforcement Learning (RL) from the SDD RL class to function approximators and deep learning.

Language: Python - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

manantomar/Mirror-Descent-Policy-Optimization

Mirror Descent Policy Optimization

Language: Python - Size: 44.9 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 25 - Forks: 3

dbobrenko/reinforceflow

Deep Reinforcement Learning framework based on TensorFlow and OpenAI Gym

Language: Python - Size: 259 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 12 - Forks: 3

JojiJoseph/PPO

Language: Python - Size: 80.1 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

gbartyzel/yadrl

Yet another deep reinforcement learning

Language: Python - Size: 337 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

NUS-LID/RENAULT

Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning

Language: Python - Size: 25.4 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

ishanshukla97/Deep-Q-Network

Simple DQN implementation in jupyter notebook

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

Related Keywords
deep-rl 40 reinforcement-learning 26 deep-reinforcement-learning 24 deep-learning 11 pytorch 9 rl 8 dqn 7 machine-learning 7 openai-gym 6 ppo 5 double-dqn 4 policy-gradient 4 artificial-intelligence 4 tensorflow 4 sac 4 reinforcement-learning-algorithms 4 dueling-dqn 3 ddqn 3 a3c 3 actor-critic 3 model-free-rl 3 proximal-policy-optimization 3 robotics 3 exploration 2 navigation 2 self-driving-cars 2 visual-navigation 2 ai 2 keras 2 control 2 auxiliary-tasks 2 openai 2 dueling-ddqn 2 python 2 spinningup 2 mit 2 deeplearning 2 deep-learning-algorithms 2 deep-q-learning 2 a2c 2 off-policy 2 multi-agent-reinforcement-learning 1 xai 1 offlinerl 1 mutli-agent 1 policy-network 1 offline-rl 1 offline-control 1 deeplearning-ai 1 temporal-difference 1 interpretable-reinforcement-learning 1 interpretable-machine-learning 1 berkeley 1 behavioral-cloning 1 batched-control 1 cs294 1 temporal-difference-algorithms 1 active-capsule-endoscopy 1 area-coverage 1 disease-classification 1 domain-adaptation 1 sim2real 1 unity 1 single-track-rail 1 sutton-book 1 train-scheduling 1 sutton-barto-book 1 grokking-algorithms 1 monte-carlo-methods 1 deep-learning-ai 1 mdpo 1 mirror-descent 1 policy-optimization 1 stable-baselines 1 trpo 1 async-dqn 1 prioritized-replay 1 categorical-dqn 1 ddpg 1 distributional-ddpg 1 multi-step-dqn 1 noisynet-dqn 1 qr-dqn 1 td3 1 data-efficient-learning 1 ensemble-learning 1 multi-task-learning 1 dqn-tensorflow 1 notebook-jupyter 1 offline 1 object-goal-navigation 1 sem-exp 1 semantic-navigation 1 gme-stock 1 trading 1 trading-strategies 1 lunarlander-v2 1 attend-before-you-act 1 human-visual-attention 1 unreal 1