An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: q-learning

pintamonas4575/RL-model-The-Simpsons

Implementation of a Reinforcement Learning (RL) model to learn to scratch the less possible surface on the scratch game of the Badulaque of the app "The Simpsons Springfield".

Language: Jupyter Notebook - Size: 15.8 MB - Last synced at: about 4 hours ago - Pushed at: about 4 hours ago - Stars: 1 - Forks: 0

khteh/pAIthon

Exploration of python language features and AI/ML.

Language: Python - Size: 231 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

sanatren/SIMPLANG-q-LR

Overview This repository contains an implementation of a neural text simplification model that combines sequence-to-sequence learning with reinforcement learning and lexical-semantic loss. The model aims to simplify complex text while maintaining meaning and grammatical correctness.

Language: Python - Size: 30.3 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

qlan3/Explorer

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Language: Python - Size: 914 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 92 - Forks: 14

havva-nur-ezginci/ML-Applications

💡This project involves the implementation of machine learning (ML) algorithms 💻🔍on various datasets.

Language: Jupyter Notebook - Size: 4.08 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

mgiannopoulos24/Artificial-Intelligence

Solutions for the Projects of the Artificial Intelligence (CS 188) course of UC Berkeley

Language: Python - Size: 22.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 12 - Forks: 9

ivotints/Learn2Slither

A reinforcement learning project where a snake learns to navigate and survive in a dynamic environment through Q-learning.

Language: Python - Size: 17.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

ostad-ai/Reinforcement-Learning

This repository is about Reinforcement Learning (RL) and related topics

Language: Jupyter Notebook - Size: 174 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Prayag2003/across-protocol-discord-bot

R.O.S.S - Discord Bot for Across Protocol

Language: Python - Size: 21.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

AminHP/gym-anytrading

The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)

Language: Python - Size: 3.64 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 2,246 - Forks: 483

XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Language: Python - Size: 55.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2,216 - Forks: 282

Morri3/GymCar

This is the coursework of the COMP4125 in 2025.

Language: Python - Size: 99.5 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

shasankp000/AI-Player

A minecraft mod which aims to add a "second player" into the game which will actually be intelligent.

Language: Java - Size: 2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 51 - Forks: 4

benedekrozemberczki/awesome-monte-carlo-tree-search-papers

A curated list of Monte Carlo tree search papers with implementations.

Language: Python - Size: 238 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 673 - Forks: 74

pdoup/Q-Cobra

Advanced DQN Agent for Snake 🐍

Language: Python - Size: 49.8 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

datawhalechina/easy-rl

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Language: Jupyter Notebook - Size: 516 MB - Last synced at: 12 days ago - Pushed at: 23 days ago - Stars: 10,882 - Forks: 2,000

vmayoral/basic_reinforcement_learning

An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.

Language: Jupyter Notebook - Size: 43.1 MB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 1,142 - Forks: 362

MorvanZhou/Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Language: Python - Size: 428 KB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 9,143 - Forks: 5,029

utkuatasoy/Othello-Reversi-AI

This project presents an AI-driven Othello (Reversi) game designed to compare multiple algorithms, including Minimax, MCTS, A*, Q-Learning, and Negamax. It demonstrates various difficulty levels, which introduce controlled randomness to the AI for a more dynamic gameplay experience.

Language: Python - Size: 1010 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

Guille-ux/ux-framework

a framework with a lot of things, and they work

Language: Python - Size: 20.4 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

reshalfahsi/swinging-up-acrobot

Swinging Up Acrobot with n-Step Q-Learning

Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

sudharsan13296/Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Language: Jupyter Notebook - Size: 23.9 MB - Last synced at: 15 days ago - Pushed at: about 4 years ago - Stars: 409 - Forks: 136

zw007981/BasicRLAlgo

a Python library that implements various reinforcement learning algorithms using PyTorch and Gymnasium

Language: Python - Size: 32.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

nuno-faria/tetris-ai

A deep reinforcement learning bot that plays tetris

Language: Python - Size: 8.73 MB - Last synced at: 15 days ago - Pushed at: 8 months ago - Stars: 294 - Forks: 72

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 847 - Forks: 325

Ofekirsh/RL-FlappyBird

Reinforcement Learning agent that plays Flappy Bird using Q-Learning and SARSA, implemented from scratch. Includes a custom environment, tabular Q-table updates, and performance visualization.

Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

odinhg/Q-Learning-Tutorial

This is an introductory tutorial to tabular Q-learning in Norwegian.

Language: TeX - Size: 3.38 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Paperspace/DinoRunTutorial

Accompanying code for Paperspace tutorial "Build an AI to play Dino Run"

Language: Jupyter Notebook - Size: 405 KB - Last synced at: 14 days ago - Pushed at: almost 5 years ago - Stars: 327 - Forks: 103

lucidrains/q-transformer

Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind

Language: Python - Size: 1.56 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 370 - Forks: 21

PeterBaptista/Q-learn-training

Este projeto implementa um agente de aprendizado por reforço usando Q-Learning para aprender a navegar por um ambiente baseado em plataformas. O agente interage com o jogo por meio de uma conexão via socket e tenta maximizar sua pontuação ao longo dos episódios.

Language: ASP.NET - Size: 0 Bytes - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

stratisMarkou/sample-efficient-bayesian-rl

Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL

Language: Jupyter Notebook - Size: 44.8 MB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 25 - Forks: 15

suraj5424/Q-Learning-for-Blackjack-in-different-environments

This repository implements Q-Learning in Blackjack, comparing it with random action selection and basic strategies. Includes experiments with various strategies, rule variations, and deck numbers to evaluate performance.

Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

Chaoukia/Reinforcement-Learning-course

A Reinforcement Learning course with classic examples of agents trained on gym environments.

Language: Python - Size: 1.69 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 3 - Forks: 1

NeymarL/Pacman-RL

Implement some reinforcement learning algorithms, test and visualize on Pacman.

Language: Python - Size: 7.26 MB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 2

MatiasCarabella/self-driving-ai

2D self-driving car simulation developed in Python using Pygame, featuring a Q-learning agent that learns to navigate a circuit.

Language: Python - Size: 374 KB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

HosseinAtrsaei/Reinforcement-Learning-Schedulers

A reinforcement learning project for centralized scheduling in communication networks. Implements classical dynamic programming (Value Iteration, Policy Iteration) and modern RL methods (Q-Learning, Deep Q-Networks) to optimize user transmission scheduling with constraints on delay, energy, and communication quality.

Language: Python - Size: 447 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

omerbsezer/Reinforcement_learning_tutorial_with_demo 📦

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Language: Jupyter Notebook - Size: 151 KB - Last synced at: 24 days ago - Pushed at: about 6 years ago - Stars: 751 - Forks: 174

kkm24132/ReinforcementLearning

Focuses on Reinforcement Learning related concepts, use cases, and learning approaches

Language: Jupyter Notebook - Size: 7.55 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 7 - Forks: 3

chen0040/java-reinforcement-learning

Package provides java implementation of reinforcement learning algorithms such Q-Learn, R-Learn, SARSA, Actor-Critic

Language: Java - Size: 154 KB - Last synced at: 17 days ago - Pushed at: almost 6 years ago - Stars: 126 - Forks: 41

StarLight1212/self_play

Self play strategy for all interesting games.

Language: Jupyter Notebook - Size: 206 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 2 - Forks: 0

aryankeluskar/irina.minesweeper

AI that solves Minesweeper. Uses a probabilistic approach, scans the grid, trains a Reinforcement Learning model to take decisions based on the existing grid, and marks the grid with mines to complete the game.

Language: TypeScript - Size: 14.6 MB - Last synced at: 12 days ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

AlinaBaber/Robotic-Path-Tracking-with-Reinforcement-Learning-Q-Learning-and-SARSA

The project titled "Robotic Path Tracking with Q-Learning and SARSA" focuses on the application of reinforcement learning to the task of robotic path tracking.

Language: Python - Size: 11.2 MB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

MehdiShahbazi/DQN-Mountain-Car-Gymnasium

This repo implements Deep Q-Network (DQN) for solving the Mountain Car v0 environment (discrete version) of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 with a custom reward function for faster convergence.

Language: Python - Size: 1020 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

MehdiShahbazi/DQN-Cliff-Walking-Gymnasium

This repo implements Deep Q-Network (DQN) for solving the Cliff Walking v0 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 with the finest tuning.

Language: Python - Size: 2.88 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

MehdiShahbazi/DQN-Frozenlake-Gymnasium

This repo implements Deep Q-Network (DQN) for solving the Frozenlake-v1 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 in both 4x4 and 8x8 map sizes.

Language: Python - Size: 1.32 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 2

ductaingn/Power-Allocation

Power Allocation in Sub6GHz/mmWave Networks with Risk-Averse Reinforcement Learning

Language: Python - Size: 601 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5 - Forks: 1

cool-RR/marley

A framework for multi-agent reinforcement learning.

Language: Python - Size: 30.7 MB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 272 - Forks: 26

gsurma/deep_traffic

MIT DeepTraffic top 2% solution (75.01 mph) 🚗.

Language: JavaScript - Size: 5.81 MB - Last synced at: 20 days ago - Pushed at: almost 4 years ago - Stars: 55 - Forks: 11

Urinx/ReinforcementLearning

Reinforcing Your Learning of Reinforcement Learning

Language: Python - Size: 118 MB - Last synced at: 11 days ago - Pushed at: almost 6 years ago - Stars: 94 - Forks: 22

ravi72munde/Chrome-Dino-Reinforcement-Learning

An RL implementation in Keras

Language: Jupyter Notebook - Size: 138 MB - Last synced at: 18 days ago - Pushed at: almost 7 years ago - Stars: 102 - Forks: 21

accel-brain/accel-brain-code

The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.

Language: Python - Size: 98.3 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 314 - Forks: 91

lufficc/dqn

Implementation of q-learning using TensorFlow

Language: Python - Size: 508 KB - Last synced at: 28 days ago - Pushed at: almost 8 years ago - Stars: 58 - Forks: 20

mpatacchiola/dissecting-reinforcement-learning

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Language: Python - Size: 28.1 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 615 - Forks: 179

VinF/deer

DEEp Reinforcement learning framework

Language: Python - Size: 12.6 MB - Last synced at: 28 days ago - Pushed at: 12 months ago - Stars: 484 - Forks: 124

rfeinman/tictactoe-reinforcement-learning

Train a tic-tac-toe agent using reinforcement learning.

Language: Python - Size: 549 KB - Last synced at: 15 days ago - Pushed at: about 4 years ago - Stars: 59 - Forks: 24

djhepker/java-q-learning

Q-learning dependency for Java projects. Made to be lightweight & easy to use. Instructions in README for how to utilize the Agent. Open source.

Language: Java - Size: 124 KB - Last synced at: 18 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

PimLb/olfactory-navigation

A repository for everything olfactory navigation related.

Language: Python - Size: 472 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

pskrunner14/trading-bot

Stock Trading Bot using Deep Q-Learning

Language: Jupyter Notebook - Size: 9.42 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 1,032 - Forks: 349

pcluteijn/MazeLearning

Q-Learing algorithm solves simple mazes.

Language: MATLAB - Size: 230 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1

AbdullahTariqCS/QRouting-implementation

A simulation of AI-powered Q-routing for UAV swarm networks, designed to optimize communication, reduce latency, and adapt to dynamic network conditions.

Language: Python - Size: 402 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

flakas/bubble-shooter-bot

Tensorflow and Q-Learning based Python bot for the Bubble Shooter flash game

Language: Python - Size: 61.5 KB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

djhepker/ai-checkers

Checkers with Data Structures and AI integration. Q-Learning. Custom linked lists, Java records

Language: Java - Size: 44.4 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

ZenSam7/Easy_My_AI

Библиотека для создания ИИ, написана ПОЛНОСТЬЮ с нуля

Language: Python - Size: 292 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

KyleCummins/Mancala

A new version of my mancala AI project, rebuilt from the ground up in C++ utilizing object oriented design principles.

Language: C++ - Size: 81.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

TimKoornstra/automatic-piano-fingering

This repository contains the code for automatically generating piano fingerings using a reinforcement learning agent that uses Q-Learning.

Language: Python - Size: 56.6 KB - Last synced at: 22 days ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2

iisys-hof/map-matching-2

High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).

Language: C++ - Size: 20.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 58 - Forks: 9

byrnesy924/Yahtzee_Deep_Learning_Agent

This project is a Double Deep Q learning Agent that learns to play the dice game Yahtzee

Language: Python - Size: 1.02 GB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

WilliamZhang20/Cart-Pole-DQN

A Deep-Q Network (DQN) agent trained to balance a cartpole in the Gymnasium simulation environment

Language: Python - Size: 651 KB - Last synced at: 26 days ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

DavideDevetak24/Reinforcement-Learning-for-Algorithmic-Trading-A-Q-Learning-Approach-to-Strategy-Optimization

This project analyzes two algorithmic trading strategies: the first one is a simple rule-based strategy, the second one leverages a Reinforcement Learning (RL) approach using Q-Learning

Language: R - Size: 332 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Lucien-MG/deep-rl-hugging-face

Python implementation of the Hugging Face's Reinforcement Learning tutorial.

Language: Python - Size: 1.59 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

kvba1337/FlappyBirdAI

FlappyBirdAI - AI agents master Flappy Bird using Q-Learning and Neural Networks with genetic algorithms

Language: Python - Size: 7.09 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Anroshka/snake-ai

🐍 A Snake game AI that learns to play through Deep Q-Learning. Built with PyTorch and Pygame, featuring CUDA acceleration and real-time visualization of the learning process.

Language: Python - Size: 166 KB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 0

Devinterview-io/q-learning-interview-questions

🟣 Q-Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.

Size: 16.6 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 4

Sea-Snell/Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"

Language: Python - Size: 1.14 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 205 - Forks: 18

FelipeMarcelino/2048-DDQN-PER-Reinforcement-Learning

This project aims apply Dueling Deep Q Learning with Prioritized experience to play game 2048

Language: Python - Size: 191 KB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 1

JiguangLi/deep_CAT

Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

AxelBcr/Reinforcement-Learning-Navigating-Drone

Drone Project Using Q-Learning : Helping a Drone find a target

Language: Python - Size: 147 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

fahmizainal17/Reinforcement_Learning_Project

This project are running on Gradio under the name of Q-Learning GridWorld Simulator

Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

onekyn/qsnake

An AI agent that learns to play snake using the Q-learning algorithm

Language: C - Size: 31.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

TimHanewich/tetris-ai-mini

Training a neural network (AI) to play a very simplified game of 4x4 Tetris using Q-Learning.

Language: Python - Size: 41 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 7 - Forks: 1

Echo24h/Learn2Slither

Un projet innovant de Q-learning axé sur la prise de décision autonome dans un environnement dynamique.

Language: Python - Size: 2.69 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

avkhimen/Electric_Vehicle_Charging_Simulation

This project implements Q-Learning to find the optimal policy for charging and discharging electric vehicles in a V2G scheme under conditions of uncertain commitment of EV owners. The problem is modelled as a multi-objective multi-agent cooperative game. Project is part of fulfillment criteria for ECE 730 course at the University of Alberta.

Language: Python - Size: 1.09 MB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 47 - Forks: 5

laroccacharly/reinforcement_learning_adventure

A compilation of reinforcement learning algorithms.

Language: Python - Size: 113 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

jihoonerd/rl-maze

Simple maze solver by reinforcement learning

Language: Python - Size: 1.54 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 7 - Forks: 1

taka-rl/tic-tac-toe_q_learning

tic-tac-toe with q-learning

Language: Python - Size: 1.31 MB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

entrpn/fingym

A tool for developing reinforcement learning algorithms focused in stock prediction

Language: Python - Size: 20.7 MB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 17 - Forks: 5

lucidrains/llama-qrlhf

Implementation of the Llama architecture with RLHF + Q-learning

Language: Python - Size: 26.4 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 163 - Forks: 8

Harish-Balaji-B/Music-Recommendation-Using-Q-Learning-And-Deep-Reinforcement-Learning

This repository compares two methodologies for music recommendation: Q-learning and Deep Reinforcement Learning (Dueling DQN), applied to a dataset of music tracks with features like genre, artist, and danceability. The goal is to build a system that recommends music based on user preferences.

Language: Jupyter Notebook - Size: 1.02 MB - Last synced at: 12 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

Pirito10/CannonDuel-PSI-UVigo

Android Turn-Based Game project for the course "Programación de Sistemas Inteligentes" in the Telecommunications Engineering Degree at the Universidad de Vigo (2024 - 2025)

Language: Kotlin - Size: 3.24 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

misaghsoltani/DeepCubeAI

Learning Discrete World Models for Heuristic Search

Language: Python - Size: 16.6 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 5 - Forks: 1

javrui/nim

Q-Learning-based Python script for playing the Nim game

Language: Python - Size: 655 KB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Ashad001/2Q48

This project compares heuristic, classical reinforcement learning (RL), and quantum RL approaches to evaluate their efficiency, decision-making strategies, and performance in a grid-based environment.

Language: Python - Size: 1.47 MB - Last synced at: 23 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

viznuv/Pricing_models_dynamic

This project simulates driver allocation and surge pricing using Reinforcement Learning (Q-learning). The simulation models driver behavior, demand fluctuations, and pricing dynamics across multiple routes.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

deepbiolab/drl

Implementation of deep reinforcement learning

Language: Jupyter Notebook - Size: 30.7 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

PsorTheDoctor/ludo-rl

Q-learning and SARSA playing ludo.

Language: Python - Size: 544 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

MarioFiorino/Tutorial-Reinforcement-Learning-ITA-Python

In questa repository una collezione di tutorial sulle basi del Reinforcement Learning, sviluppati in Python, interamente in italiano.

Language: Jupyter Notebook - Size: 5.46 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 2

ChaitanyaC22/Numerical_TicTacToe_Agent_using_Reinforcement_Learning

Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.

Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 25 days ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1

Sahilgill24/ArbiCredit

On chain AI agent built for Dynamic Collateralization on Arbitrum stylus

Language: TypeScript - Size: 3.66 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

data-intelligence-for-health-lab/RL4CAD

RL4CAD: Personalized Decision Making for Coronary Artery Disease Treatment using Offline Reinforcement Learning

Language: Python - Size: 113 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 1

Paraskevi-KIvroglou/rl-pong-agent

A project by Paraskevi Kivroglou as part of exploring deep reinforcement learning applications.

Language: Jupyter Notebook - Size: 92.6 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Related Keywords
q-learning 1,180 reinforcement-learning 811 machine-learning 229 python 218 deep-learning 131 artificial-intelligence 129 deep-reinforcement-learning 127 sarsa 120 ai 108 dqn 97 openai-gym 83 reinforcement-learning-algorithms 75 pytorch 73 deep-q-learning 69 neural-network 64 value-iteration 60 policy-gradient 55 tensorflow 55 deep-q-network 55 policy-iteration 54 markov-decision-processes 50 python3 47 gym 41 pygame 39 q-learning-algorithm 37 sarsa-learning 37 rl 35 actor-critic 33 genetic-algorithm 32 tic-tac-toe 31 monte-carlo 28 epsilon-greedy 28 keras 28 dynamic-programming 27 game 27 neural-networks 25 reinforcement-learning-agent 23 gym-environment 21 numpy 21 agent 21 frozenlake 20 ppo 20 temporal-differencing-learning 20 mdp 19 qlearning 18 machine-learning-algorithms 17 simulation 17 java 17 sarsa-lambda 17 double-dqn 17 ddpg 17 jupyter-notebook 17 reinforcement-learning-environments 17 maze-solver 16 openai 16 snake-game 16 alpha-beta-pruning 15 double-q-learning 15 minimax 15 robotics 14 bellman-equation 14 td-learning 14 q-learning-vs-sarsa 14 dqn-pytorch 14 td3 13 atari 13 flappy-bird 13 cnn 13 maze 13 deep-neural-networks 13 monte-carlo-methods 12 monte-carlo-tree-search 12 tictactoe 12 dueling-dqn 12 expected-sarsa 12 gymnasium 12 multi-agent-reinforcement-learning 12 artificial-neural-networks 12 javascript 11 reinforce 11 convolutional-neural-networks 11 cpp 11 prioritized-experience-replay 11 grid-world 11 ml 11 minimax-algorithm 11 game-theory 10 cartpole 10 keras-tensorflow 10 gridworld 10 mountain-car 10 snake 10 optimization 9 matplotlib 9 atari-games 9 self-driving-car 9 q-table 9 c 9 cartpole-v1 9 blackjack 9