GitHub topics: reinforcement-learning-algorithms
I-Halder/reinforcing-higher-order-statistics-in-large-language-model-training
New algorithm for training large languge models taking advantage of high order statistics of the logit distribution
Language: Python - Size: 4.34 MB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 0 - Forks: 0

ronniross/symbiotic-core-library
The Symbiotic Core Library provides a framework of ethical principles, practical modules, and grounded research to guide AI development, deployment and inferencing.
Size: 26.1 MB - Last synced at: about 15 hours ago - Pushed at: about 15 hours ago - Stars: 12 - Forks: 3

KunjShah01/RL-A2A
Language: Python - Size: 449 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 2

Skylark0924/Rofunc
๐ค The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation
Language: Python - Size: 1.01 GB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 644 - Forks: 59

amin-sharifi-github/quant-rl-trading-agent
End-to-end RL trading framework with PPO agent, self-attention neural network, custom Gym environment, and advanced backtesting.
Language: Python - Size: 4.01 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

JuliaPOMDP/POMDPs.jl
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
Language: Julia - Size: 11.1 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 723 - Forks: 107

asieradzk/RL_Matrix
Deep Reinforcement Learning in C#
Language: C# - Size: 46.2 MB - Last synced at: 2 days ago - Pushed at: 30 days ago - Stars: 277 - Forks: 24

filippottaviani/ObstacleAvoidanceG1
Progetto di tesi magistrale in Ingegneria Robotica. Progettazione e implementazione di un algoritmo di obstacle avoidance per un robot umanoide (Unitree G1) attraverso tecniche di reinforcement learning e addestramento in ambiente simulato Isaac Sim.
Language: Python - Size: 61.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

shehio/rl
Implementing RL agents, one algorithm at a time
Language: Python - Size: 226 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

opendilab/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)
Size: 191 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 1,174 - Forks: 69

sozelfist/annotated_deep_learning_paper_implementations Fork of labmlai/annotated_deep_learning_paper_implementations
๐งโ๐ซ 50! Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
Language: Python - Size: 147 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

linhlpv/awesome-offline-to-online-RL-papers
A list of Offline to Online RL papers (continually updated)
Size: 16.6 KB - Last synced at: about 16 hours ago - Pushed at: 11 months ago - Stars: 47 - Forks: 0

binary-husky/hmp2g
Multiagent Reinforcement Learning Research Project
Language: Python - Size: 141 MB - Last synced at: 4 days ago - Pushed at: 24 days ago - Stars: 212 - Forks: 37

WindJammer6/35.-Star-Wars-Reinforcement-Learning
A series of Star Wars-inspired Gymnasium custom-made Reinforcement Learning (RL) Environments in grid-world style.
Language: Jupyter Notebook - Size: 63.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

coax-dev/coax
Modular framework for Reinforcement Learning in python
Language: Python - Size: 17.6 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 174 - Forks: 18

opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Language: Python - Size: 293 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3,497 - Forks: 407

dimitarpg13/reinforcement_learning_and_game_theory
Collection of materials and code samples on reinforcement learning / optimal control and game theory
Size: 884 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 18 - Forks: 4

EdanToledo/Stoix
๐๏ธA research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX โข End-to-End JAX RL
Language: Python - Size: 14.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 344 - Forks: 37

balarcode/reinforcement-learning
Practical implementation of selected algorithms, concepts and techniques in reinforcement learning.
Language: Python - Size: 1.56 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Faycal214/RL-algorithms
๐ Reinforcement learning from scratch โ including value-based and policy-based methods, alongside search-driven approaches from evolutionary strategies like Genetic Algorithms to adaptive techniques like Simulated Annealing, PSO, and CMA-ES
Language: Python - Size: 716 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

sushant1827/Reinforcement-Learning-with-OpenAI-Gymnasium
This repository contains basic examples demonstrating how to use the gymnasium library for setting up and interacting with Reinforcement Learning environments.
Language: Python - Size: 29.8 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

ajaykr2712/ML_DS
Dialy Curated Open Source Learnings of ML ๐ค
Language: Python - Size: 74.3 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

jianzhnie/llmtech
LLMTechSite, ไธๆณจไบ้็จไบบๅทฅๆบ่ฝ้ขๅ็ๆๆฏ็ๆใ
Language: Python - Size: 6.59 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 11 - Forks: 4

rust-control/qtable
A simple Q-Table implementation for Rust
Language: Rust - Size: 25.4 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language: Python - Size: 4.7 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 11,188 - Forks: 1,891

Omegastick/pytorch-cpp-rl ๐ฆ
PyTorch C++ Reinforcement Learning
Language: C++ - Size: 540 KB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 524 - Forks: 89

nicrusso7/rex-gym
OpenAI Gym environments for an open-source quadruped robot (SpotMicro)
Language: Python - Size: 319 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 1,056 - Forks: 139

Zihao-Felix-Zhou/UavNetSim-v1
"UavNetSim-v1" can be used for designing and testing routing protocols (e.g., DSDV, AODV), MAC protocols, UAV 3D path planning (e.g., A*, Dijkstra) and topology control algorithms (e.g., virtual force). Additionally, it models the wireless channel, mobility, and energy consumption, and provides accurate performance analysis and visualization.
Language: Python - Size: 46.7 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 73 - Forks: 16

Silidrone/aiplane
AI agent that uses DQN to learn how to land an airplane in XPLANE-12 flight simulator.
Language: Python - Size: 74.6 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

EricSteinberger/PokerRL
Framework for Multi-Agent Deep Reinforcement Learning in Poker
Language: Python - Size: 301 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 476 - Forks: 99

pockerman/cuberl
Library for reinforcement learning with c++
Language: C++ - Size: 4.2 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 6 - Forks: 1

cpnota/autonomous-learning-library
A PyTorch library for building deep reinforcement learning agents.
Language: Python - Size: 6.24 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 649 - Forks: 72

leehe228/LogisticsEnv
UAV Logistics Environment for Multi-Agent Reinforcement Learning / Unity ML-Agents / Unity 3D
Language: C# - Size: 1.31 GB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 101 - Forks: 14

Rasso555/eco-benchmark
Eco-Benchmark offers a new framework for evaluating AI models, focusing on societal outcomes and ethical data sourcing. Join the movement towards responsible AI! ๐ฑ๐ ๏ธ
Size: 25.4 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

syed-bilal-bukhari/AWSDeepRacer
Some of my reward functions and analysis that helped me throughout my learning
Language: Python - Size: 143 KB - Last synced at: 14 days ago - Pushed at: about 2 years ago - Stars: 20 - Forks: 4

Stable-Baselines-Team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Language: Python - Size: 1.44 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 622 - Forks: 207

udacity/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
Language: Jupyter Notebook - Size: 3.37 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 5,075 - Forks: 2,376

WeiHanTu/expectimax-2048
This project implements an AI agent for the 2048 game using the Expectimax algorithm. The AI agent is designed to make optimal moves by evaluating possible future game states. The project includes a game engine to handle the game mechanics, a user interface for manual play, and testing to verify the AI's performance and correctness.
Language: Python - Size: 6.06 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

zli12321/free-form-grpo
grpo to train long form QA and instructions with long-form reward model
Language: Python - Size: 25.9 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 10 - Forks: 0

benedekrozemberczki/awesome-monte-carlo-tree-search-papers
A curated list of Monte Carlo tree search papers with implementations.
Language: Python - Size: 238 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 682 - Forks: 76

Gen-Verse/CURE
Open-Source LLM Coders with Co-Evolving Reinforcement Learning
Language: Python - Size: 1.72 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 94 - Forks: 12

alejotoro-o/rlforge
RL Forge is an open source reinforcement learning library that aims to provide the users with useful functions for the development of Reinforcement Learning Agents. The library also includes multiple popular reinforcement learning agents and environments, in addition, it is designed to be compatible with the gymnasium library (previous OpenAI Gym).
Language: Python - Size: 4.03 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

xujiuqing2023/marl-ppo-suite
Clean, documented implementations of PPO-based algorithms for cooperative multi-agent reinforcement learning, focusing on SMAC environments. Features MLP and RNN-based MAPPO with various normalization techniques.
Language: Python - Size: 33.2 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 6 - Forks: 0

legalaspro/marl-ppo-suite
Clean, documented implementations of PPO-based algorithms for cooperative multi-agent reinforcement learning, focusing on SMAC environments. Features MLP and RNN-based MAPPO and HAPPO with various techniques.
Language: Python - Size: 4.17 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

namoshizun/PyPOMDP
Python implementation of POMDP framework and PBVI & POMCP algorithms.
Language: Python - Size: 1.81 MB - Last synced at: 7 days ago - Pushed at: almost 4 years ago - Stars: 114 - Forks: 26

XuehaiPan/mate
MATE: the Multi-Agent Tracking Environment.
Language: Python - Size: 492 KB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 41 - Forks: 22

wwxFromTju/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
Size: 132 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 352 - Forks: 42

HectorMozo3110/meta_cognitive_self_model_agents
Modular framework for building self-modeling artificial agents with explicit internal state representation and meta-cognitive capabilities. Includes RL, hybrid, and dummy policies with integrated SelfModel monitoring and scientific metrics.
Language: Python - Size: 4.03 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
Size: 884 KB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 809 - Forks: 35

TarunNagarajan/RL
Tell me and I forget. Teach me and I may remember. Involve me and I learn. Welcome to the Reinforcement Learning Repository.
Language: Python - Size: 405 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

MouseTrap-codes/rlmathtoolkit
A math-faithful Python toolkit implementing reinforcement learning algorithms chapter-by-chapter from Sutton & Barto.
Language: Python - Size: 135 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

imagry/aleph_star
Reinforcement learning with A* and a deep heuristic
Language: Jupyter Notebook - Size: 55.7 MB - Last synced at: 15 days ago - Pushed at: over 6 years ago - Stars: 294 - Forks: 36

MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
Language: Python - Size: 2.16 MB - Last synced at: 3 days ago - Pushed at: almost 5 years ago - Stars: 592 - Forks: 90

NeuromatchAcademy/course-content-dl
NMA deep learning course
Language: Jupyter Notebook - Size: 594 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 787 - Forks: 281

uma-dev/chihuahua-game
Train your custom sprite-based character to hit the correct tile using RL
Language: Python - Size: 510 KB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

datascienceid/reinforcement-learning-resources
A curated list of awesome reinforcement courses, video lectures, books, library and many more.
Size: 3.91 KB - Last synced at: 6 days ago - Pushed at: almost 3 years ago - Stars: 71 - Forks: 39

rlberry-py/rlberry
An easy-to-use reinforcement learning library for research and education.
Language: Python - Size: 18.6 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 169 - Forks: 30

fadeldnswr/learn-reinforcement-learning
Holds a repository for my journey of learning reinforcement learning algorithm
Language: Jupyter Notebook - Size: 1000 Bytes - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 1 - Forks: 0

david-abel/simple_rl
A simple framework for experimenting with Reinforcement Learning in Python.
Language: Python - Size: 3.92 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 316 - Forks: 104

ddihora1604/QuestX Fork of AyaanZ30/AIchemists_Datahack
QuestX is an AI-powered adaptive quiz platform that generates dynamic multimodal flashcards (text, audio, video) from unstructured inputs. Built with Python, React + Vite, and Flask, it uses reinforcement learning to adjust quiz difficulty in real-time and tailors study resource recommendations based on user performance.
Language: HTML - Size: 39.1 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

opendilab/awesome-exploration-rl
A curated list of awesome exploration RL resources (continually updated)
Size: 2.51 MB - Last synced at: 24 days ago - Pushed at: 30 days ago - Stars: 513 - Forks: 15

XinJingHao/TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
Language: Python - Size: 15.3 MB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 90 - Forks: 17

Wu-Zongyu/Awesome-Large-Search-Models
Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search (Large search models).
Size: 89.8 KB - Last synced at: 25 days ago - Pushed at: about 1 month ago - Stars: 108 - Forks: 4

nsidn98/InforMARL
Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation
Language: Python - Size: 18.5 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 115 - Forks: 22

FerhatAkalan/DroneDeliverySystemDQN
This project is an intelligent drone simulator that optimizes urban package deliveries using the Deep Q-Network (DQN) algorithm. Drones pick up packages from the cargo depot and learn the most efficient routes to deliver them to multiple delivery points through neural network-based decision making.
Language: Python - Size: 33.2 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

lisamandro/News-Recommendation-System-using-Reinforcement-Learning
Multi Armed Bandits implementation on MIND dataset.
Language: Python - Size: 49.4 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

yassinhawari/FreeHoopRL
# FreeHoopRLThis project uses a Deep Q-Network (DQN) algorithm to train an AI agent for shooting basketballs in a simple 2D environment. The agent learns to choose the right angle and force to score points, with results visualized through training and analysis graphs. ๐๐ค
Language: Python - Size: 17.6 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

paitayon/Awesome-Large-Search-Models
Awesome-Large-Search-Models is a collection of key papers and resources focused on search-oriented large language models. Explore methods, datasets, and tools to enhance your understanding and development of large search models. ๐๐
Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Evan09064/Thesis---3D-Wave-Shooter
Thesis project exploring an AI system that dynamically adapts game difficulty in real time by analyzing key player performance indicators (reaction times, accuracy, strategic choices). The system tailors the experience to each player's unique playstyle and counteracts it to encourage adaptation and skill evolution.
Language: C# - Size: 6.64 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

gseismic/rlearn_dev.py
A Reinforcement Learning Library [dev] DDPG,TD3,SAC,PPO and more
Language: Python - Size: 4.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Jason-CKY/DeepRL-pytorch
Pytorch implementations of various Deep Reinforcement Learning algorithms on pybullet environments.
Language: Python - Size: 258 MB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 30 - Forks: 6

gokulp01/bluerov2_gym
A Gymnasium environment for simulating and training reinforcement learning agents on the BlueROV2 underwater vehicle.
Language: Python - Size: 13.7 MB - Last synced at: 28 days ago - Pushed at: 4 months ago - Stars: 16 - Forks: 3

davidgeorgewilliams/JessicaRabbit-QLoRA-Axolotl
This comprehensive technical guide, developed at the request of OnlyFans founder, demonstrates advanced AI model fine-tuning methodologies to transform Qwen2-72b into a Jessica Rabbit personality emulation using cutting-edge QLoRA and ORPO techniques.
Size: 1.39 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

yunfeng-net/RL
Language: Python - Size: 37.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

danmcleran/tinymind Fork of intel/cppnnml
Tinymind is a Neural Network and Machine Learning project intended to provide a C++ template library for neural nets and machine learning algorithms within embedded systems.
Language: C++ - Size: 1.44 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 2

arianahejazyan/RL
Simple implementations of RL algorithms
Language: Jupyter Notebook - Size: 423 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

ChengshuLi/HRL4IN
Code for CoRL 2019 paper: HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators
Language: Python - Size: 83 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 63 - Forks: 13

Pyoussefpour/Self-Driving-AI-Car-No-Libraries-
DQN to drive a car through traffic - no libraries everything implemented from scratch
Language: JavaScript - Size: 37.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

xiangwang1223/kgpolicy
Reinforced Negative Sampling over Knowledge Graph for Recommendation, WWW2020
Language: Python - Size: 92.8 KB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 136 - Forks: 37

ahmadkh1995/Trash_Collector_Robot_RL
A small demo of RL algorithms on a trash collector robot
Language: Python - Size: 1.16 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Nara-On/DDPG
An implementation of DeepDeterministic Policy Gradient
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

smt970913/OCRL_discharge_decision_making
This is the Github code repository for the discharge decision-making project.
Language: Jupyter Notebook - Size: 7.77 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

dvalenciar/ReinforceUI-Studio
ReinforceUI-Studio. A Python-based application designed to simplify the configuration and monitoring of RL training processes. Supporting MuJoCo, OpenAI Gymnasium, and DeepMind Control Suite. Algorithms included: CTD4, DDPG, DQN, PPO, SAC, TD3, TQC
Language: Python - Size: 21.2 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 65 - Forks: 3

varejad/behavioral_flow
Reinforcement learning library inspired by Behavioral Psychology, focused on simplicity. It uses Python dictionaries instead of complex tables, making implementation intuitive. Agents learn through reinforcement and punishment, considering context and consequences. They can also combine actions to create new behaviors.
Language: Python - Size: 41 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

1jamesthompson1/dsunrise
D-Sunrise: A Dynamic Ensemble Algorithm for Deep Reinforcement Learning
Language: Python - Size: 376 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

alison-carrera/onn
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
Language: Python - Size: 66.4 KB - Last synced at: 9 days ago - Pushed at: over 5 years ago - Stars: 187 - Forks: 46

Allenpandas/2020-Reinforcement-Learning-Conferences-Papers ๐ฆ
The proceedings of top conference in 2020 on the topic of Reinforcement Learning (RL), including: AAAI, IJCAI, NeurIPS, ICML, ICLR, ICRA, AAMAS and more.
Size: 94.7 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

zhao-kun/rl-grpo
A reinforcement learning project demonstrate using GRPO to train an AI agent to play a fruits catching game.
Language: Python - Size: 5.64 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

maomao-zm/2D_Packing
2D_Packing via Deep Learning
Language: Python - Size: 50.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

sozelfist/handson-ml3 Fork of ageron/handson-ml3
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
Language: Jupyter Notebook - Size: 36.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

gordicaleksa/pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Language: Python - Size: 13.1 MB - Last synced at: 3 days ago - Pushed at: about 4 years ago - Stars: 154 - Forks: 34

BY571/Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
Language: Python - Size: 5.99 MB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 288 - Forks: 33

uzumstanley/DEEP-LEARNING
UNIVERSITY OF ROEHAMPTON LONDON
Language: Jupyter Notebook - Size: 69.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ardaegeunlu/Contextual-Gaussian-Process-Bandit-Optimization
Simple implementation of the CGP-UCB algorithm.
Language: Python - Size: 2.72 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 36 - Forks: 6

amoghj98/SHIRE
This repository contains code associated with SHIRE: Enhancing Sample Efficiency using Human Intuition in Reinforcement Learning (ICRA 2025)
Language: Python - Size: 155 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 0

SrajanByndoor/DQN-for-Lunar-Lander
A deep reinforcement learning implementation using Deep Q-Network (DQN) to train an agent to successfully land a lunar module in OpenAI Gymnasium's LunarLander-v3 environment.
Language: Jupyter Notebook - Size: 242 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

sanatren/Legal-Document-Analyzer
This Legal Document Analyzer is a proof-of-concept NLP project demonstrating the potential of transformers for legal document summarization.
Language: Python - Size: 82 KB - Last synced at: 21 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sail-sg/optim4rl
Optim4RL is a Jax framework of learning to optimize for reinforcement learning.
Language: Python - Size: 135 KB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 26 - Forks: 2

rutubhanderi/Dynamic_Pricing_Strategy Fork of AtharvRaje33/Dynamic_Pricing_Strategy
Reinforcement Learning Project
Language: Jupyter Notebook - Size: 24.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

FortsAndMills/RL-Theory-book
Reinforcement learning theory book about foundations of deep RL algorithms with proofs.
Language: TeX - Size: 80.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 320 - Forks: 20
