GitHub topics: reinforcement-learning-algorithms

Repositories

I-Halder/reinforcing-higher-order-statistics-in-large-language-model-training

New algorithm for training large languge models taking advantage of high order statistics of the logit distribution

Language: Python - Size: 4.34 MB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 0 - Forks: 0

ronniross/symbiotic-core-library

The Symbiotic Core Library provides a framework of ethical principles, practical modules, and grounded research to guide AI development, deployment and inferencing.

Size: 26.1 MB - Last synced at: about 15 hours ago - Pushed at: about 15 hours ago - Stars: 12 - Forks: 3

KunjShah01/RL-A2A

Language: Python - Size: 449 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 2

Skylark0924/Rofunc

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

Language: Python - Size: 1.01 GB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 644 - Forks: 59

amin-sharifi-github/quant-rl-trading-agent

End-to-end RL trading framework with PPO agent, self-attention neural network, custom Gym environment, and advanced backtesting.

Language: Python - Size: 4.01 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

JuliaPOMDP/POMDPs.jl

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.

Language: Julia - Size: 11.1 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 723 - Forks: 107

asieradzk/RL_Matrix

Deep Reinforcement Learning in C#

Language: C# - Size: 46.2 MB - Last synced at: 2 days ago - Pushed at: 30 days ago - Stars: 277 - Forks: 24

filippottaviani/ObstacleAvoidanceG1

Progetto di tesi magistrale in Ingegneria Robotica. Progettazione e implementazione di un algoritmo di obstacle avoidance per un robot umanoide (Unitree G1) attraverso tecniche di reinforcement learning e addestramento in ambiente simulato Isaac Sim.

Language: Python - Size: 61.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

shehio/rl

Implementing RL agents, one algorithm at a time

Language: Python - Size: 226 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

opendilab/awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

Size: 191 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 1,174 - Forks: 69

sozelfist/annotated_deep_learning_paper_implementations Fork of labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language: Python - Size: 147 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

linhlpv/awesome-offline-to-online-RL-papers

A list of Offline to Online RL papers (continually updated)

Size: 16.6 KB - Last synced at: about 16 hours ago - Pushed at: 11 months ago - Stars: 47 - Forks: 0

binary-husky/hmp2g

Multiagent Reinforcement Learning Research Project

Language: Python - Size: 141 MB - Last synced at: 4 days ago - Pushed at: 24 days ago - Stars: 212 - Forks: 37

WindJammer6/35.-Star-Wars-Reinforcement-Learning

A series of Star Wars-inspired Gymnasium custom-made Reinforcement Learning (RL) Environments in grid-world style.

Language: Jupyter Notebook - Size: 63.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

coax-dev/coax

Modular framework for Reinforcement Learning in python

Language: Python - Size: 17.6 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 174 - Forks: 18

opendilab/DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language: Python - Size: 293 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3,497 - Forks: 407

dimitarpg13/reinforcement_learning_and_game_theory

Collection of materials and code samples on reinforcement learning / optimal control and game theory

Size: 884 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 18 - Forks: 4

EdanToledo/Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Language: Python - Size: 14.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 344 - Forks: 37

balarcode/reinforcement-learning

Practical implementation of selected algorithms, concepts and techniques in reinforcement learning.

Language: Python - Size: 1.56 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Faycal214/RL-algorithms

🚀 Reinforcement learning from scratch — including value-based and policy-based methods, alongside search-driven approaches from evolutionary strategies like Genetic Algorithms to adaptive techniques like Simulated Annealing, PSO, and CMA-ES

Language: Python - Size: 716 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

sushant1827/Reinforcement-Learning-with-OpenAI-Gymnasium

This repository contains basic examples demonstrating how to use the gymnasium library for setting up and interacting with Reinforcement Learning environments.

Language: Python - Size: 29.8 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

ajaykr2712/ML_DS

Dialy Curated Open Source Learnings of ML 🤖

Language: Python - Size: 74.3 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

jianzhnie/llmtech

LLMTechSite, 专注于通用人工智能领域的技术生态。

Language: Python - Size: 6.59 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 11 - Forks: 4

rust-control/qtable

A simple Q-Table implementation for Rust

Language: Rust - Size: 25.4 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language: Python - Size: 4.7 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 11,188 - Forks: 1,891

Omegastick/pytorch-cpp-rl 📦

PyTorch C++ Reinforcement Learning

Language: C++ - Size: 540 KB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 524 - Forks: 89

nicrusso7/rex-gym

OpenAI Gym environments for an open-source quadruped robot (SpotMicro)

Language: Python - Size: 319 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 1,056 - Forks: 139

Zihao-Felix-Zhou/UavNetSim-v1

"UavNetSim-v1" can be used for designing and testing routing protocols (e.g., DSDV, AODV), MAC protocols, UAV 3D path planning (e.g., A*, Dijkstra) and topology control algorithms (e.g., virtual force). Additionally, it models the wireless channel, mobility, and energy consumption, and provides accurate performance analysis and visualization.

Language: Python - Size: 46.7 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 73 - Forks: 16

Silidrone/aiplane

AI agent that uses DQN to learn how to land an airplane in XPLANE-12 flight simulator.

Language: Python - Size: 74.6 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

EricSteinberger/PokerRL

Framework for Multi-Agent Deep Reinforcement Learning in Poker

Language: Python - Size: 301 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 476 - Forks: 99

pockerman/cuberl

Library for reinforcement learning with c++

Language: C++ - Size: 4.2 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 6 - Forks: 1

cpnota/autonomous-learning-library

A PyTorch library for building deep reinforcement learning agents.

Language: Python - Size: 6.24 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 649 - Forks: 72

leehe228/LogisticsEnv

UAV Logistics Environment for Multi-Agent Reinforcement Learning / Unity ML-Agents / Unity 3D

Language: C# - Size: 1.31 GB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 101 - Forks: 14

Rasso555/eco-benchmark

Eco-Benchmark offers a new framework for evaluating AI models, focusing on societal outcomes and ethical data sourcing. Join the movement towards responsible AI! 🌱🛠️

Size: 25.4 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

syed-bilal-bukhari/AWSDeepRacer

Some of my reward functions and analysis that helped me throughout my learning

Language: Python - Size: 143 KB - Last synced at: 14 days ago - Pushed at: about 2 years ago - Stars: 20 - Forks: 4

Stable-Baselines-Team/stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Language: Python - Size: 1.44 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 622 - Forks: 207

udacity/deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

Language: Jupyter Notebook - Size: 3.37 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 5,075 - Forks: 2,376

WeiHanTu/expectimax-2048

This project implements an AI agent for the 2048 game using the Expectimax algorithm. The AI agent is designed to make optimal moves by evaluating possible future game states. The project includes a game engine to handle the game mechanics, a user interface for manual play, and testing to verify the AI's performance and correctness.

Language: Python - Size: 6.06 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

zli12321/free-form-grpo

grpo to train long form QA and instructions with long-form reward model

Language: Python - Size: 25.9 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 10 - Forks: 0

benedekrozemberczki/awesome-monte-carlo-tree-search-papers

A curated list of Monte Carlo tree search papers with implementations.

Language: Python - Size: 238 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 682 - Forks: 76

Gen-Verse/CURE

Open-Source LLM Coders with Co-Evolving Reinforcement Learning

Language: Python - Size: 1.72 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 94 - Forks: 12

alejotoro-o/rlforge

RL Forge is an open source reinforcement learning library that aims to provide the users with useful functions for the development of Reinforcement Learning Agents. The library also includes multiple popular reinforcement learning agents and environments, in addition, it is designed to be compatible with the gymnasium library (previous OpenAI Gym).

Language: Python - Size: 4.03 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

xujiuqing2023/marl-ppo-suite

Clean, documented implementations of PPO-based algorithms for cooperative multi-agent reinforcement learning, focusing on SMAC environments. Features MLP and RNN-based MAPPO with various normalization techniques.

Language: Python - Size: 33.2 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 6 - Forks: 0

legalaspro/marl-ppo-suite

Language: Python - Size: 4.17 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

namoshizun/PyPOMDP

Python implementation of POMDP framework and PBVI & POMCP algorithms.

Language: Python - Size: 1.81 MB - Last synced at: 7 days ago - Pushed at: almost 4 years ago - Stars: 114 - Forks: 26

XuehaiPan/mate

MATE: the Multi-Agent Tracking Environment.

Language: Python - Size: 492 KB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 41 - Forks: 22

wwxFromTju/awesome-reinforcement-learning-lib

GitHub's code repository is all you need

Size: 132 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 352 - Forks: 42

HectorMozo3110/meta_cognitive_self_model_agents

Modular framework for building self-modeling artificial agents with explicit internal state representation and meta-cognitive capabilities. Includes RL, hybrid, and dummy policies with integrated SelfModel monitoring and scientific metrics.

Language: Python - Size: 4.03 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

opendilab/awesome-decision-transformer

A curated list of Decision Transformer resources (continually updated)

Size: 884 KB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 809 - Forks: 35

TarunNagarajan/RL

Tell me and I forget. Teach me and I may remember. Involve me and I learn. Welcome to the Reinforcement Learning Repository.

Language: Python - Size: 405 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

MouseTrap-codes/rlmathtoolkit

A math-faithful Python toolkit implementing reinforcement learning algorithms chapter-by-chapter from Sutton & Barto.

Language: Python - Size: 135 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

imagry/aleph_star

Reinforcement learning with A* and a deep heuristic

Language: Jupyter Notebook - Size: 55.7 MB - Last synced at: 15 days ago - Pushed at: over 6 years ago - Stars: 294 - Forks: 36

MishaLaskin/curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Language: Python - Size: 2.16 MB - Last synced at: 3 days ago - Pushed at: almost 5 years ago - Stars: 592 - Forks: 90

NeuromatchAcademy/course-content-dl

NMA deep learning course

Language: Jupyter Notebook - Size: 594 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 787 - Forks: 281

uma-dev/chihuahua-game

Train your custom sprite-based character to hit the correct tile using RL

Language: Python - Size: 510 KB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

datascienceid/reinforcement-learning-resources

A curated list of awesome reinforcement courses, video lectures, books, library and many more.

Size: 3.91 KB - Last synced at: 6 days ago - Pushed at: almost 3 years ago - Stars: 71 - Forks: 39

rlberry-py/rlberry

An easy-to-use reinforcement learning library for research and education.

Language: Python - Size: 18.6 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 169 - Forks: 30

fadeldnswr/learn-reinforcement-learning

Holds a repository for my journey of learning reinforcement learning algorithm

Language: Jupyter Notebook - Size: 1000 Bytes - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 1 - Forks: 0

david-abel/simple_rl

A simple framework for experimenting with Reinforcement Learning in Python.

Language: Python - Size: 3.92 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 316 - Forks: 104

ddihora1604/QuestX Fork of AyaanZ30/AIchemists_Datahack

QuestX is an AI-powered adaptive quiz platform that generates dynamic multimodal flashcards (text, audio, video) from unstructured inputs. Built with Python, React + Vite, and Flask, it uses reinforcement learning to adjust quiz difficulty in real-time and tailors study resource recommendations based on user performance.

Language: HTML - Size: 39.1 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

opendilab/awesome-exploration-rl

A curated list of awesome exploration RL resources (continually updated)

Size: 2.51 MB - Last synced at: 24 days ago - Pushed at: 30 days ago - Stars: 513 - Forks: 15

XinJingHao/TD3-BipedalWalkerHardcore-v2

Solve BipedalWalkerHardcore-v2 with TD3

Language: Python - Size: 15.3 MB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 90 - Forks: 17

Wu-Zongyu/Awesome-Large-Search-Models

Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search (Large search models).

Size: 89.8 KB - Last synced at: 25 days ago - Pushed at: about 1 month ago - Stars: 108 - Forks: 4

nsidn98/InforMARL

Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation

Language: Python - Size: 18.5 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 115 - Forks: 22

FerhatAkalan/DroneDeliverySystemDQN

This project is an intelligent drone simulator that optimizes urban package deliveries using the Deep Q-Network (DQN) algorithm. Drones pick up packages from the cargo depot and learn the most efficient routes to deliver them to multiple delivery points through neural network-based decision making.

Language: Python - Size: 33.2 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

lisamandro/News-Recommendation-System-using-Reinforcement-Learning

Multi Armed Bandits implementation on MIND dataset.

Language: Python - Size: 49.4 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

yassinhawari/FreeHoopRL

# FreeHoopRLThis project uses a Deep Q-Network (DQN) algorithm to train an AI agent for shooting basketballs in a simple 2D environment. The agent learns to choose the right angle and force to score points, with results visualized through training and analysis graphs. 🎉🤖

Language: Python - Size: 17.6 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

paitayon/Awesome-Large-Search-Models

Awesome-Large-Search-Models is a collection of key papers and resources focused on search-oriented large language models. Explore methods, datasets, and tools to enhance your understanding and development of large search models. 🌟🐙

Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Evan09064/Thesis---3D-Wave-Shooter

Thesis project exploring an AI system that dynamically adapts game difficulty in real time by analyzing key player performance indicators (reaction times, accuracy, strategic choices). The system tailors the experience to each player's unique playstyle and counteracts it to encourage adaptation and skill evolution.

Language: C# - Size: 6.64 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

gseismic/rlearn_dev.py

A Reinforcement Learning Library [dev] DDPG,TD3,SAC,PPO and more

Language: Python - Size: 4.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Jason-CKY/DeepRL-pytorch

Pytorch implementations of various Deep Reinforcement Learning algorithms on pybullet environments.

Language: Python - Size: 258 MB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 30 - Forks: 6

gokulp01/bluerov2_gym

A Gymnasium environment for simulating and training reinforcement learning agents on the BlueROV2 underwater vehicle.

Language: Python - Size: 13.7 MB - Last synced at: 28 days ago - Pushed at: 4 months ago - Stars: 16 - Forks: 3

davidgeorgewilliams/JessicaRabbit-QLoRA-Axolotl

This comprehensive technical guide, developed at the request of OnlyFans founder, demonstrates advanced AI model fine-tuning methodologies to transform Qwen2-72b into a Jessica Rabbit personality emulation using cutting-edge QLoRA and ORPO techniques.

Size: 1.39 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

yunfeng-net/RL

Language: Python - Size: 37.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

danmcleran/tinymind Fork of intel/cppnnml

Tinymind is a Neural Network and Machine Learning project intended to provide a C++ template library for neural nets and machine learning algorithms within embedded systems.

Language: C++ - Size: 1.44 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 2

arianahejazyan/RL

Simple implementations of RL algorithms

Language: Jupyter Notebook - Size: 423 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

ChengshuLi/HRL4IN

Code for CoRL 2019 paper: HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators

Language: Python - Size: 83 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 63 - Forks: 13

Pyoussefpour/Self-Driving-AI-Car-No-Libraries-

DQN to drive a car through traffic - no libraries everything implemented from scratch

Language: JavaScript - Size: 37.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

xiangwang1223/kgpolicy

Reinforced Negative Sampling over Knowledge Graph for Recommendation, WWW2020

Language: Python - Size: 92.8 KB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 136 - Forks: 37

ahmadkh1995/Trash_Collector_Robot_RL

A small demo of RL algorithms on a trash collector robot

Language: Python - Size: 1.16 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Nara-On/DDPG

An implementation of DeepDeterministic Policy Gradient

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

smt970913/OCRL_discharge_decision_making

This is the Github code repository for the discharge decision-making project.

Language: Jupyter Notebook - Size: 7.77 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

dvalenciar/ReinforceUI-Studio

ReinforceUI-Studio. A Python-based application designed to simplify the configuration and monitoring of RL training processes. Supporting MuJoCo, OpenAI Gymnasium, and DeepMind Control Suite. Algorithms included: CTD4, DDPG, DQN, PPO, SAC, TD3, TQC

Language: Python - Size: 21.2 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 65 - Forks: 3

varejad/behavioral_flow

Reinforcement learning library inspired by Behavioral Psychology, focused on simplicity. It uses Python dictionaries instead of complex tables, making implementation intuitive. Agents learn through reinforcement and punishment, considering context and consequences. They can also combine actions to create new behaviors.

Language: Python - Size: 41 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

1jamesthompson1/dsunrise

D-Sunrise: A Dynamic Ensemble Algorithm for Deep Reinforcement Learning

Language: Python - Size: 376 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

alison-carrera/onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

Language: Python - Size: 66.4 KB - Last synced at: 9 days ago - Pushed at: over 5 years ago - Stars: 187 - Forks: 46

Allenpandas/2020-Reinforcement-Learning-Conferences-Papers 📦

The proceedings of top conference in 2020 on the topic of Reinforcement Learning (RL), including: AAAI, IJCAI, NeurIPS, ICML, ICLR, ICRA, AAMAS and more.

Size: 94.7 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

zhao-kun/rl-grpo

A reinforcement learning project demonstrate using GRPO to train an AI agent to play a fruits catching game.

Language: Python - Size: 5.64 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

maomao-zm/2D_Packing

2D_Packing via Deep Learning

Language: Python - Size: 50.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

sozelfist/handson-ml3 Fork of ageron/handson-ml3

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Language: Jupyter Notebook - Size: 36.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

gordicaleksa/pytorch-learn-reinforcement-learning

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

Language: Python - Size: 13.1 MB - Last synced at: 3 days ago - Pushed at: about 4 years ago - Stars: 154 - Forks: 34