gitlab.com topics: reinforcement learning

Repositories

DracoStriker/pokemon-vgc-engine

The VGC AI Framework aims to emulate the Esports scenario of human video game championships of Pokémon with AI agents, including the game balance aspect.

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

DracoStriker/multiplayer-difficulty-balance-rl

An RL framework for multiplayer dynamic difficulty adjustment.

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

da_doomer/pylic

Program analysis-based optimization in differentiable simulators.

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

moda707/vrpvcsd-of

This project adopts Reinforcement Learning to solve a variant of the stochastic VRP, where customers are highly variable, demands are stochastic, and vehicles have a duration limit.

Last synced at: over 2 years ago - Stars: 1 - Forks: 0

DracoStriker/simplified-pokemon-environment

The Simple Pokémon Environment is an AI environment complient with gym that allows for training Reinforcement Learning agents in a simplified version of Pokémon Battles.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

z00logist/rugpt3-movie-critic

Finetuning several language models in order to generate sentiment-controlled (positive, negative, neutral) movie reviews in Russian

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

MariusLC/internship-ethical-morl

Stage de fin d'étude sur le développement d'un modèle d'apprentissage par renforcement multi-objectifs pour répondre à des problèmes éthiques. Encadrants : Aurélie Beynier, Nicolas Maudet, Paolo Viappiani Date : Mars - Septembre 2022

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

IlyaOrson/BatchReactor

Part of the code experimets related to this publication: https://www.sciencedirect.com/science/article/pii/B9780128186343501545

Last synced at: over 2 years ago - Stars: 1 - Forks: 0

Ch_V/cart_pole

Example of CartPole task solution with random search, noise hill-climbing, Q-net training and policy gradient approaches. Use `play_result.py` to visualize result.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Ch_V/cliff_walking

Solution of Cliff Walking problem with different variants of Q-learning and SARSA algorithms.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

renke.kuhlmann/reinforcement-learning-for-nonlinear-programming

This project aims is to learn adaptive parameter updates of nonlinear programming solvers by using reinforcement learning techniques.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

webermarci/neural-flappybird

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

CavenaghiEmanuele/reinforcementlearninglibrary

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Ch_V/frozen_lake

Simple example of FrozenLake task solution with value/strategy iteration, Monte-Carlo methods and with genetic algorithm.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

sdpkjc/abcdrl

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

vg_tum/graph-boc

Source Code for Gabler et al. "Bayesian Optimization with Unknown Constraints in Graphical Skill-Models for Compliant Manipulation Tasks Using an Industrial Robot", Frontiers in Robotics and AI, 2022

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

ComputationalScience/epidemic-control

This project provides optimal control and reinforcement learning implementations to optimize vaccination and testing resource allocation problems on networks.

Last synced at: over 1 year ago - Stars: 0 - Forks: 0

netmode/arrle

RL-driven Interactive Recommender based on SocioEmotional Behavioural Models

Last synced at: 5 months ago - Stars: 3 - Forks: 0

TING2938/rlcpp

Reinforcement Learning Algorithms with C++

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

alex-rep-perso/aigym

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

clengele/egt-presentation-commonpoolresources

This project was an attempt to reproduce the work published by DeepMind for the paper https://arxiv.org/pdf/1707.06600.pdf . This reproduction was part of the course INFO-F409 : Learning Dynamics.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

TING2938/gym_cpp

C++ high-performance gym environment framework

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

omidmohaddesi/pomdp_belief_learning

Imitating the behavior of Gamette players in their supply chain decisions.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

alesniewski123/al_frozenlake

Reinforcement Learning (Artificial Intelligence) Algorithm for solving OpenCV problem

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

Thieso/Easy21

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

Artificiele_Intelligentie/chess

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

team4hire-open-source/thesis_2

Multi-agent implementations of Reinforcement Learning algorithm Proximal Policy Optimization.

Last synced at: over 2 years ago - Stars: 2 - Forks: 0

schalappe/deep-double-q-learning-minigrid

Minigrid is a simulation of the world in the form of a grid. The target is to train an agent to understand textual instructions.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

tomasz.cichy98/minigrid-rl

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

clansfear/ml_agents_car

Last synced at: over 2 years ago - Stars: 1 - Forks: 0

ai-in-game/monsterai

An attempt at developing a learning agent to train an enemy mob in a mario like game

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

anirban.sen/scripts_shivam

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Everything-Machine-Learning/keras-rl-dql-pong

A Keras-RL based DQL agent that can learn to play pong.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

DracoStriker/unity_gym_wrapper_example

Demo showing how to run an external python trainer over a ml-agents unity environment using gym wrapper as communication interface.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

team4hire-open-source/muliti-agent-particle-env-fork

A simple harbour simulation to test multi-agent reinforcement learning algorithms based on multi-agent-particle-envs from OpenAi

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

tuyen.ta/lorawan-ml

Machine Learning (ML) algorithm for the resource allocation problems in the Low Power Wide Area Network (LPWAN).

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

dim4gg/AI2RL

Reinforcement learning algorithm implementation project for "Artificial Intelligence" course, La Sapienza, Rome, Italy, 2018

Last synced at: over 2 years ago - Stars: 1 - Forks: 0

maximgaina/reinforcement-learning-atari

Understanding why DeepMind's DQN is not good enough for some Atari 2600 games.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

sylvanix/Q-learning-Intersection-Crossing

Using a Deep Q Network (a kind of Reinforcement Learning agent) to learn a policy to cross a busy intersection

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

danielegrattarola/deep-q-snake

Application of deep Q-Learning to play Snake. Project for the Machine Learning course, A.Y. 2015/2016, Politecnico di Milano

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Related Keywords

reinforcement learning 40 AI 6 Q-learning 6 python 6 machine learning 5 ppo 4 deep reinforcement learning 3 pytorch 3 deep learning 3 neural network 3 optimal control 2 Q Learning 2 C++ 2 multi-agent 2 python3 2 data science 2 gym 2 sarsa 2 opencv 1 Monte Carlo 1 function approximation 1 chess 1 proximal policy optimization 1 artificial intelligence 1 POMDP 1 Imitation Learning 1 Cython 1 DQN 1 socio-emotional competences 1 recommendation system 1 optimization 1 transformers 1 snake 1 Deep Q 1 DeepMind 1 Atari 1 mountain car 1 agent 1 Q-function 1 ML 1 lpwan 1 energy consumption 1 LoRaWAN 1 openai 1 environment 1 unity game engine 1 stable baselines 1 headless server 1 Deep Q Learning 1 DQL 1 tensorforce 1 racing 1 Unity 1 Car driving 1 Autonomous agents 1 NLP 1 sentiment classification 1 rugpt 1 rubert 1 dataset 1 GPT-3 1 BERT 1 Pokémon Simulator 1 Compatitive Learning 1 stochastic optimization 1 dynamic programming 1 VRP 1 Markov Decision Process 1 program analysis 1 marble 1 differentiable simulators 1 control 1 game-design 1 MLAgents 1 C# 1 competitive games 1 infectious diseases 1 epidemic-model 1 robotics 1 Gaussian Processes 1 docker 1 value-iteration 1 strategy-iteration 1 genetic algorithm 1 Monte-Carlo methods 1 p5js 1 javascript 1 WORHP 1 Nonlinear Programming 1 random search 1 hill-climbing 1 reinforce 1 dynamical systems 1 neural networks 1 multi-objective optimization 1 Internship Project 1 Generative Adversarial Networks 1