gitlab.com topics: reinforcement learning
DracoStriker/pokemon-vgc-engine
The VGC AI Framework aims to emulate the Esports scenario of human video game championships of Pokémon with AI agents, including the game balance aspect.
Last synced at: almost 2 years ago - Stars: 0 - Forks: 0

DracoStriker/multiplayer-difficulty-balance-rl
An RL framework for multiplayer dynamic difficulty adjustment.
Last synced at: about 2 years ago - Stars: 0 - Forks: 0
da_doomer/pylic
Program analysis-based optimization in differentiable simulators.
Last synced at: about 2 years ago - Stars: 0 - Forks: 0
moda707/vrpvcsd-of
This project adopts Reinforcement Learning to solve a variant of the stochastic VRP, where customers are highly variable, demands are stochastic, and vehicles have a duration limit.
Last synced at: about 2 years ago - Stars: 1 - Forks: 0
DracoStriker/simplified-pokemon-environment
The Simple Pokémon Environment is an AI environment complient with gym that allows for training Reinforcement Learning agents in a simplified version of Pokémon Battles.
Last synced at: about 2 years ago - Stars: 0 - Forks: 0

z00logist/rugpt3-movie-critic
Finetuning several language models in order to generate sentiment-controlled (positive, negative, neutral) movie reviews in Russian
Last synced at: about 2 years ago - Stars: 0 - Forks: 0
MariusLC/internship-ethical-morl
Stage de fin d'étude sur le développement d'un modèle d'apprentissage par renforcement multi-objectifs pour répondre à des problèmes éthiques. Encadrants : Aurélie Beynier, Nicolas Maudet, Paolo Viappiani Date : Mars - Septembre 2022
Last synced at: about 2 years ago - Stars: 0 - Forks: 0

IlyaOrson/BatchReactor
Part of the code experimets related to this publication: https://www.sciencedirect.com/science/article/pii/B9780128186343501545
Last synced at: over 2 years ago - Stars: 1 - Forks: 0

Ch_V/cart_pole
Example of CartPole task solution with random search, noise hill-climbing, Q-net training and policy gradient approaches. Use `play_result.py` to visualize result.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Ch_V/cliff_walking
Solution of Cliff Walking problem with different variants of Q-learning and SARSA algorithms.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

renke.kuhlmann/reinforcement-learning-for-nonlinear-programming
This project aims is to learn adaptive parameter updates of nonlinear programming solvers by using reinforcement learning techniques.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0


CavenaghiEmanuele/reinforcementlearninglibrary
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Ch_V/frozen_lake
Simple example of FrozenLake task solution with value/strategy iteration, Monte-Carlo methods and with genetic algorithm.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

vg_tum/graph-boc
Source Code for Gabler et al. "Bayesian Optimization with Unknown Constraints in Graphical Skill-Models for Compliant Manipulation Tasks Using an Industrial Robot", Frontiers in Robotics and AI, 2022
Last synced at: almost 2 years ago - Stars: 0 - Forks: 0

ComputationalScience/epidemic-control
This project provides optimal control and reinforcement learning implementations to optimize vaccination and testing resource allocation problems on networks.
Last synced at: about 1 year ago - Stars: 0 - Forks: 0
netmode/arrle
RL-driven Interactive Recommender based on SocioEmotional Behavioural Models
Last synced at: 4 months ago - Stars: 3 - Forks: 0

TING2938/rlcpp
Reinforcement Learning Algorithms with C++
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
clengele/egt-presentation-commonpoolresources
This project was an attempt to reproduce the work published by DeepMind for the paper https://arxiv.org/pdf/1707.06600.pdf . This reproduction was part of the course INFO-F409 : Learning Dynamics.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

TING2938/gym_cpp
C++ high-performance gym environment framework
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
omidmohaddesi/pomdp_belief_learning
Imitating the behavior of Gamette players in their supply chain decisions.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
alesniewski123/al_frozenlake
Reinforcement Learning (Artificial Intelligence) Algorithm for solving OpenCV problem
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

team4hire-open-source/thesis_2
Multi-agent implementations of Reinforcement Learning algorithm Proximal Policy Optimization.
Last synced at: over 2 years ago - Stars: 2 - Forks: 0

schalappe/deep-double-q-learning-minigrid
Minigrid is a simulation of the world in the form of a grid. The target is to train an agent to understand textual instructions.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

ai-in-game/monsterai
An attempt at developing a learning agent to train an enemy mob in a mario like game
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
Everything-Machine-Learning/keras-rl-dql-pong
A Keras-RL based DQL agent that can learn to play pong.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

DracoStriker/unity_gym_wrapper_example
Demo showing how to run an external python trainer over a ml-agents unity environment using gym wrapper as communication interface.
Last synced at: about 2 years ago - Stars: 0 - Forks: 0

team4hire-open-source/muliti-agent-particle-env-fork
A simple harbour simulation to test multi-agent reinforcement learning algorithms based on multi-agent-particle-envs from OpenAi
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

tuyen.ta/lorawan-ml
Machine Learning (ML) algorithm for the resource allocation problems in the Low Power Wide Area Network (LPWAN).
Last synced at: over 2 years ago - Stars: 0 - Forks: 0

dim4gg/AI2RL
Reinforcement learning algorithm implementation project for "Artificial Intelligence" course, La Sapienza, Rome, Italy, 2018
Last synced at: over 2 years ago - Stars: 1 - Forks: 0
maximgaina/reinforcement-learning-atari
Understanding why DeepMind's DQN is not good enough for some Atari 2600 games.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
sylvanix/Q-learning-Intersection-Crossing
Using a Deep Q Network (a kind of Reinforcement Learning agent) to learn a policy to cross a busy intersection
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
danielegrattarola/deep-q-snake
Application of deep Q-Learning to play Snake. Project for the Machine Learning course, A.Y. 2015/2016, Politecnico di Milano
Last synced at: over 2 years ago - Stars: 0 - Forks: 0