GitHub topics: q-learning
pintamonas4575/RL-model-The-Simpsons
Implementation of a Reinforcement Learning (RL) model to learn to scratch the less possible surface on the scratch game of the Badulaque of the app "The Simpsons Springfield".
Language: Jupyter Notebook - Size: 15.8 MB - Last synced at: about 4 hours ago - Pushed at: about 4 hours ago - Stars: 1 - Forks: 0

khteh/pAIthon
Exploration of python language features and AI/ML.
Language: Python - Size: 231 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

sanatren/SIMPLANG-q-LR
Overview This repository contains an implementation of a neural text simplification model that combines sequence-to-sequence learning with reinforcement learning and lexical-semantic loss. The model aims to simplify complex text while maintaining meaning and grammatical correctness.
Language: Python - Size: 30.3 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

qlan3/Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Language: Python - Size: 914 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 92 - Forks: 14

havva-nur-ezginci/ML-Applications
💡This project involves the implementation of machine learning (ML) algorithms 💻🔍on various datasets.
Language: Jupyter Notebook - Size: 4.08 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

mgiannopoulos24/Artificial-Intelligence
Solutions for the Projects of the Artificial Intelligence (CS 188) course of UC Berkeley
Language: Python - Size: 22.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 12 - Forks: 9

ivotints/Learn2Slither
A reinforcement learning project where a snake learns to navigate and survive in a dynamic environment through Q-learning.
Language: Python - Size: 17.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

ostad-ai/Reinforcement-Learning
This repository is about Reinforcement Learning (RL) and related topics
Language: Jupyter Notebook - Size: 174 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Prayag2003/across-protocol-discord-bot
R.O.S.S - Discord Bot for Across Protocol
Language: Python - Size: 21.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

AminHP/gym-anytrading
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
Language: Python - Size: 3.64 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 2,246 - Forks: 483

XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Language: Python - Size: 55.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2,216 - Forks: 282

Morri3/GymCar
This is the coursework of the COMP4125 in 2025.
Language: Python - Size: 99.5 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

shasankp000/AI-Player
A minecraft mod which aims to add a "second player" into the game which will actually be intelligent.
Language: Java - Size: 2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 51 - Forks: 4

benedekrozemberczki/awesome-monte-carlo-tree-search-papers
A curated list of Monte Carlo tree search papers with implementations.
Language: Python - Size: 238 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 673 - Forks: 74

pdoup/Q-Cobra
Advanced DQN Agent for Snake 🐍
Language: Python - Size: 49.8 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Language: Jupyter Notebook - Size: 516 MB - Last synced at: 12 days ago - Pushed at: 23 days ago - Stars: 10,882 - Forks: 2,000

vmayoral/basic_reinforcement_learning
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Language: Jupyter Notebook - Size: 43.1 MB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 1,142 - Forks: 362

MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Language: Python - Size: 428 KB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 9,143 - Forks: 5,029

utkuatasoy/Othello-Reversi-AI
This project presents an AI-driven Othello (Reversi) game designed to compare multiple algorithms, including Minimax, MCTS, A*, Q-Learning, and Negamax. It demonstrates various difficulty levels, which introduce controlled randomness to the AI for a more dynamic gameplay experience.
Language: Python - Size: 1010 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

Guille-ux/ux-framework
a framework with a lot of things, and they work
Language: Python - Size: 20.4 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

reshalfahsi/swinging-up-acrobot
Swinging Up Acrobot with n-Step Q-Learning
Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

sudharsan13296/Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Language: Jupyter Notebook - Size: 23.9 MB - Last synced at: 15 days ago - Pushed at: about 4 years ago - Stars: 409 - Forks: 136

zw007981/BasicRLAlgo
a Python library that implements various reinforcement learning algorithms using PyTorch and Gymnasium
Language: Python - Size: 32.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

nuno-faria/tetris-ai
A deep reinforcement learning bot that plays tetris
Language: Python - Size: 8.73 MB - Last synced at: 15 days ago - Pushed at: 8 months ago - Stars: 294 - Forks: 72

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 847 - Forks: 325

Ofekirsh/RL-FlappyBird
Reinforcement Learning agent that plays Flappy Bird using Q-Learning and SARSA, implemented from scratch. Includes a custom environment, tabular Q-table updates, and performance visualization.
Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

odinhg/Q-Learning-Tutorial
This is an introductory tutorial to tabular Q-learning in Norwegian.
Language: TeX - Size: 3.38 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Paperspace/DinoRunTutorial
Accompanying code for Paperspace tutorial "Build an AI to play Dino Run"
Language: Jupyter Notebook - Size: 405 KB - Last synced at: 14 days ago - Pushed at: almost 5 years ago - Stars: 327 - Forks: 103

lucidrains/q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
Language: Python - Size: 1.56 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 370 - Forks: 21

PeterBaptista/Q-learn-training
Este projeto implementa um agente de aprendizado por reforço usando Q-Learning para aprender a navegar por um ambiente baseado em plataformas. O agente interage com o jogo por meio de uma conexão via socket e tenta maximizar sua pontuação ao longo dos episódios.
Language: ASP.NET - Size: 0 Bytes - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

stratisMarkou/sample-efficient-bayesian-rl
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
Language: Jupyter Notebook - Size: 44.8 MB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 25 - Forks: 15

suraj5424/Q-Learning-for-Blackjack-in-different-environments
This repository implements Q-Learning in Blackjack, comparing it with random action selection and basic strategies. Includes experiments with various strategies, rule variations, and deck numbers to evaluate performance.
Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

Chaoukia/Reinforcement-Learning-course
A Reinforcement Learning course with classic examples of agents trained on gym environments.
Language: Python - Size: 1.69 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 3 - Forks: 1

NeymarL/Pacman-RL
Implement some reinforcement learning algorithms, test and visualize on Pacman.
Language: Python - Size: 7.26 MB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 2

MatiasCarabella/self-driving-ai
2D self-driving car simulation developed in Python using Pygame, featuring a Q-learning agent that learns to navigate a circuit.
Language: Python - Size: 374 KB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

HosseinAtrsaei/Reinforcement-Learning-Schedulers
A reinforcement learning project for centralized scheduling in communication networks. Implements classical dynamic programming (Value Iteration, Policy Iteration) and modern RL methods (Q-Learning, Deep Q-Networks) to optimize user transmission scheduling with constraints on delay, energy, and communication quality.
Language: Python - Size: 447 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

omerbsezer/Reinforcement_learning_tutorial_with_demo 📦
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Language: Jupyter Notebook - Size: 151 KB - Last synced at: 24 days ago - Pushed at: about 6 years ago - Stars: 751 - Forks: 174

kkm24132/ReinforcementLearning
Focuses on Reinforcement Learning related concepts, use cases, and learning approaches
Language: Jupyter Notebook - Size: 7.55 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 7 - Forks: 3

chen0040/java-reinforcement-learning
Package provides java implementation of reinforcement learning algorithms such Q-Learn, R-Learn, SARSA, Actor-Critic
Language: Java - Size: 154 KB - Last synced at: 17 days ago - Pushed at: almost 6 years ago - Stars: 126 - Forks: 41

StarLight1212/self_play
Self play strategy for all interesting games.
Language: Jupyter Notebook - Size: 206 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 2 - Forks: 0

aryankeluskar/irina.minesweeper
AI that solves Minesweeper. Uses a probabilistic approach, scans the grid, trains a Reinforcement Learning model to take decisions based on the existing grid, and marks the grid with mines to complete the game.
Language: TypeScript - Size: 14.6 MB - Last synced at: 12 days ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

AlinaBaber/Robotic-Path-Tracking-with-Reinforcement-Learning-Q-Learning-and-SARSA
The project titled "Robotic Path Tracking with Q-Learning and SARSA" focuses on the application of reinforcement learning to the task of robotic path tracking.
Language: Python - Size: 11.2 MB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

MehdiShahbazi/DQN-Mountain-Car-Gymnasium
This repo implements Deep Q-Network (DQN) for solving the Mountain Car v0 environment (discrete version) of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 with a custom reward function for faster convergence.
Language: Python - Size: 1020 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

MehdiShahbazi/DQN-Cliff-Walking-Gymnasium
This repo implements Deep Q-Network (DQN) for solving the Cliff Walking v0 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 with the finest tuning.
Language: Python - Size: 2.88 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

MehdiShahbazi/DQN-Frozenlake-Gymnasium
This repo implements Deep Q-Network (DQN) for solving the Frozenlake-v1 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 in both 4x4 and 8x8 map sizes.
Language: Python - Size: 1.32 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 2

ductaingn/Power-Allocation
Power Allocation in Sub6GHz/mmWave Networks with Risk-Averse Reinforcement Learning
Language: Python - Size: 601 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5 - Forks: 1

cool-RR/marley
A framework for multi-agent reinforcement learning.
Language: Python - Size: 30.7 MB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 272 - Forks: 26

gsurma/deep_traffic
MIT DeepTraffic top 2% solution (75.01 mph) 🚗.
Language: JavaScript - Size: 5.81 MB - Last synced at: 20 days ago - Pushed at: almost 4 years ago - Stars: 55 - Forks: 11

Urinx/ReinforcementLearning
Reinforcing Your Learning of Reinforcement Learning
Language: Python - Size: 118 MB - Last synced at: 11 days ago - Pushed at: almost 6 years ago - Stars: 94 - Forks: 22

ravi72munde/Chrome-Dino-Reinforcement-Learning
An RL implementation in Keras
Language: Jupyter Notebook - Size: 138 MB - Last synced at: 18 days ago - Pushed at: almost 7 years ago - Stars: 102 - Forks: 21

accel-brain/accel-brain-code
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
Language: Python - Size: 98.3 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 314 - Forks: 91

lufficc/dqn
Implementation of q-learning using TensorFlow
Language: Python - Size: 508 KB - Last synced at: 28 days ago - Pushed at: almost 8 years ago - Stars: 58 - Forks: 20

mpatacchiola/dissecting-reinforcement-learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Language: Python - Size: 28.1 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 615 - Forks: 179

VinF/deer
DEEp Reinforcement learning framework
Language: Python - Size: 12.6 MB - Last synced at: 28 days ago - Pushed at: 12 months ago - Stars: 484 - Forks: 124

rfeinman/tictactoe-reinforcement-learning
Train a tic-tac-toe agent using reinforcement learning.
Language: Python - Size: 549 KB - Last synced at: 15 days ago - Pushed at: about 4 years ago - Stars: 59 - Forks: 24

djhepker/java-q-learning
Q-learning dependency for Java projects. Made to be lightweight & easy to use. Instructions in README for how to utilize the Agent. Open source.
Language: Java - Size: 124 KB - Last synced at: 18 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

PimLb/olfactory-navigation
A repository for everything olfactory navigation related.
Language: Python - Size: 472 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

pskrunner14/trading-bot
Stock Trading Bot using Deep Q-Learning
Language: Jupyter Notebook - Size: 9.42 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 1,032 - Forks: 349

pcluteijn/MazeLearning
Q-Learing algorithm solves simple mazes.
Language: MATLAB - Size: 230 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1

AbdullahTariqCS/QRouting-implementation
A simulation of AI-powered Q-routing for UAV swarm networks, designed to optimize communication, reduce latency, and adapt to dynamic network conditions.
Language: Python - Size: 402 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

flakas/bubble-shooter-bot
Tensorflow and Q-Learning based Python bot for the Bubble Shooter flash game
Language: Python - Size: 61.5 KB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

djhepker/ai-checkers
Checkers with Data Structures and AI integration. Q-Learning. Custom linked lists, Java records
Language: Java - Size: 44.4 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

ZenSam7/Easy_My_AI
Библиотека для создания ИИ, написана ПОЛНОСТЬЮ с нуля
Language: Python - Size: 292 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

KyleCummins/Mancala
A new version of my mancala AI project, rebuilt from the ground up in C++ utilizing object oriented design principles.
Language: C++ - Size: 81.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

TimKoornstra/automatic-piano-fingering
This repository contains the code for automatically generating piano fingerings using a reinforcement learning agent that uses Q-Learning.
Language: Python - Size: 56.6 KB - Last synced at: 22 days ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2

iisys-hof/map-matching-2
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Language: C++ - Size: 20.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 58 - Forks: 9

byrnesy924/Yahtzee_Deep_Learning_Agent
This project is a Double Deep Q learning Agent that learns to play the dice game Yahtzee
Language: Python - Size: 1.02 GB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

WilliamZhang20/Cart-Pole-DQN
A Deep-Q Network (DQN) agent trained to balance a cartpole in the Gymnasium simulation environment
Language: Python - Size: 651 KB - Last synced at: 26 days ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

DavideDevetak24/Reinforcement-Learning-for-Algorithmic-Trading-A-Q-Learning-Approach-to-Strategy-Optimization
This project analyzes two algorithmic trading strategies: the first one is a simple rule-based strategy, the second one leverages a Reinforcement Learning (RL) approach using Q-Learning
Language: R - Size: 332 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Lucien-MG/deep-rl-hugging-face
Python implementation of the Hugging Face's Reinforcement Learning tutorial.
Language: Python - Size: 1.59 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

kvba1337/FlappyBirdAI
FlappyBirdAI - AI agents master Flappy Bird using Q-Learning and Neural Networks with genetic algorithms
Language: Python - Size: 7.09 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Anroshka/snake-ai
🐍 A Snake game AI that learns to play through Deep Q-Learning. Built with PyTorch and Pygame, featuring CUDA acceleration and real-time visualization of the learning process.
Language: Python - Size: 166 KB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 0

Devinterview-io/q-learning-interview-questions
🟣 Q-Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
Size: 16.6 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 4

Sea-Snell/Implicit-Language-Q-Learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
Language: Python - Size: 1.14 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 205 - Forks: 18

FelipeMarcelino/2048-DDQN-PER-Reinforcement-Learning
This project aims apply Dueling Deep Q Learning with Prioritized experience to play game 2048
Language: Python - Size: 191 KB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 1

JiguangLi/deep_CAT
Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

AxelBcr/Reinforcement-Learning-Navigating-Drone
Drone Project Using Q-Learning : Helping a Drone find a target
Language: Python - Size: 147 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

fahmizainal17/Reinforcement_Learning_Project
This project are running on Gradio under the name of Q-Learning GridWorld Simulator
Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

onekyn/qsnake
An AI agent that learns to play snake using the Q-learning algorithm
Language: C - Size: 31.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

TimHanewich/tetris-ai-mini
Training a neural network (AI) to play a very simplified game of 4x4 Tetris using Q-Learning.
Language: Python - Size: 41 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 7 - Forks: 1

Echo24h/Learn2Slither
Un projet innovant de Q-learning axé sur la prise de décision autonome dans un environnement dynamique.
Language: Python - Size: 2.69 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

avkhimen/Electric_Vehicle_Charging_Simulation
This project implements Q-Learning to find the optimal policy for charging and discharging electric vehicles in a V2G scheme under conditions of uncertain commitment of EV owners. The problem is modelled as a multi-objective multi-agent cooperative game. Project is part of fulfillment criteria for ECE 730 course at the University of Alberta.
Language: Python - Size: 1.09 MB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 47 - Forks: 5

laroccacharly/reinforcement_learning_adventure
A compilation of reinforcement learning algorithms.
Language: Python - Size: 113 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

jihoonerd/rl-maze
Simple maze solver by reinforcement learning
Language: Python - Size: 1.54 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 7 - Forks: 1

taka-rl/tic-tac-toe_q_learning
tic-tac-toe with q-learning
Language: Python - Size: 1.31 MB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

entrpn/fingym
A tool for developing reinforcement learning algorithms focused in stock prediction
Language: Python - Size: 20.7 MB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 17 - Forks: 5

lucidrains/llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
Language: Python - Size: 26.4 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 163 - Forks: 8

Harish-Balaji-B/Music-Recommendation-Using-Q-Learning-And-Deep-Reinforcement-Learning
This repository compares two methodologies for music recommendation: Q-learning and Deep Reinforcement Learning (Dueling DQN), applied to a dataset of music tracks with features like genre, artist, and danceability. The goal is to build a system that recommends music based on user preferences.
Language: Jupyter Notebook - Size: 1.02 MB - Last synced at: 12 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

Pirito10/CannonDuel-PSI-UVigo
Android Turn-Based Game project for the course "Programación de Sistemas Inteligentes" in the Telecommunications Engineering Degree at the Universidad de Vigo (2024 - 2025)
Language: Kotlin - Size: 3.24 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

misaghsoltani/DeepCubeAI
Learning Discrete World Models for Heuristic Search
Language: Python - Size: 16.6 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 5 - Forks: 1

javrui/nim
Q-Learning-based Python script for playing the Nim game
Language: Python - Size: 655 KB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Ashad001/2Q48
This project compares heuristic, classical reinforcement learning (RL), and quantum RL approaches to evaluate their efficiency, decision-making strategies, and performance in a grid-based environment.
Language: Python - Size: 1.47 MB - Last synced at: 23 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

viznuv/Pricing_models_dynamic
This project simulates driver allocation and surge pricing using Reinforcement Learning (Q-learning). The simulation models driver behavior, demand fluctuations, and pricing dynamics across multiple routes.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

deepbiolab/drl
Implementation of deep reinforcement learning
Language: Jupyter Notebook - Size: 30.7 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

PsorTheDoctor/ludo-rl
Q-learning and SARSA playing ludo.
Language: Python - Size: 544 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

MarioFiorino/Tutorial-Reinforcement-Learning-ITA-Python
In questa repository una collezione di tutorial sulle basi del Reinforcement Learning, sviluppati in Python, interamente in italiano.
Language: Jupyter Notebook - Size: 5.46 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 2

ChaitanyaC22/Numerical_TicTacToe_Agent_using_Reinforcement_Learning
Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.
Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 25 days ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1

Sahilgill24/ArbiCredit
On chain AI agent built for Dynamic Collateralization on Arbitrum stylus
Language: TypeScript - Size: 3.66 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

data-intelligence-for-health-lab/RL4CAD
RL4CAD: Personalized Decision Making for Coronary Artery Disease Treatment using Offline Reinforcement Learning
Language: Python - Size: 113 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 1

Paraskevi-KIvroglou/rl-pong-agent
A project by Paraskevi Kivroglou as part of exploring deep reinforcement learning applications.
Language: Jupyter Notebook - Size: 92.6 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0
