An open API service providing repository metadata for many open source software ecosystems.

gitlab.com topics: reinforcement learning

DracoStriker/pokemon-vgc-engine

The VGC AI Framework aims to emulate the Esports scenario of human video game championships of Pokémon with AI agents, including the game balance aspect.

Last synced at: almost 2 years ago - Stars: 0 - Forks: 0

DracoStriker/multiplayer-difficulty-balance-rl

An RL framework for multiplayer dynamic difficulty adjustment.

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

da_doomer/pylic

Program analysis-based optimization in differentiable simulators.

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

moda707/vrpvcsd-of

This project adopts Reinforcement Learning to solve a variant of the stochastic VRP, where customers are highly variable, demands are stochastic, and vehicles have a duration limit.

Last synced at: about 2 years ago - Stars: 1 - Forks: 0

DracoStriker/simplified-pokemon-environment

The Simple Pokémon Environment is an AI environment complient with gym that allows for training Reinforcement Learning agents in a simplified version of Pokémon Battles.

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

z00logist/rugpt3-movie-critic

Finetuning several language models in order to generate sentiment-controlled (positive, negative, neutral) movie reviews in Russian

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

MariusLC/internship-ethical-morl

Stage de fin d'étude sur le développement d'un modèle d'apprentissage par renforcement multi-objectifs pour répondre à des problèmes éthiques. Encadrants : Aurélie Beynier, Nicolas Maudet, Paolo Viappiani Date : Mars - Septembre 2022

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

IlyaOrson/BatchReactor

Part of the code experimets related to this publication: https://www.sciencedirect.com/science/article/pii/B9780128186343501545

Last synced at: over 2 years ago - Stars: 1 - Forks: 0

Ch_V/cart_pole

Example of CartPole task solution with random search, noise hill-climbing, Q-net training and policy gradient approaches. Use `play_result.py` to visualize result.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Ch_V/cliff_walking

Solution of Cliff Walking problem with different variants of Q-learning and SARSA algorithms.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

renke.kuhlmann/reinforcement-learning-for-nonlinear-programming

This project aims is to learn adaptive parameter updates of nonlinear programming solvers by using reinforcement learning techniques.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

webermarci/neural-flappybird

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

CavenaghiEmanuele/reinforcementlearninglibrary

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Ch_V/frozen_lake

Simple example of FrozenLake task solution with value/strategy iteration, Monte-Carlo methods and with genetic algorithm.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

sdpkjc/abcdrl

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

vg_tum/graph-boc

Source Code for Gabler et al. "Bayesian Optimization with Unknown Constraints in Graphical Skill-Models for Compliant Manipulation Tasks Using an Industrial Robot", Frontiers in Robotics and AI, 2022

Last synced at: almost 2 years ago - Stars: 0 - Forks: 0

ComputationalScience/epidemic-control

This project provides optimal control and reinforcement learning implementations to optimize vaccination and testing resource allocation problems on networks.

Last synced at: about 1 year ago - Stars: 0 - Forks: 0

netmode/arrle

RL-driven Interactive Recommender based on SocioEmotional Behavioural Models

Last synced at: 4 months ago - Stars: 3 - Forks: 0

TING2938/rlcpp

Reinforcement Learning Algorithms with C++

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

alex-rep-perso/aigym

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

clengele/egt-presentation-commonpoolresources

This project was an attempt to reproduce the work published by DeepMind for the paper https://arxiv.org/pdf/1707.06600.pdf . This reproduction was part of the course INFO-F409 : Learning Dynamics.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

TING2938/gym_cpp

C++ high-performance gym environment framework

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

omidmohaddesi/pomdp_belief_learning

Imitating the behavior of Gamette players in their supply chain decisions.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

alesniewski123/al_frozenlake

Reinforcement Learning (Artificial Intelligence) Algorithm for solving OpenCV problem

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

Thieso/Easy21

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Artificiele_Intelligentie/chess

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

team4hire-open-source/thesis_2

Multi-agent implementations of Reinforcement Learning algorithm Proximal Policy Optimization.

Last synced at: over 2 years ago - Stars: 2 - Forks: 0

schalappe/deep-double-q-learning-minigrid

Minigrid is a simulation of the world in the form of a grid. The target is to train an agent to understand textual instructions.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

tomasz.cichy98/minigrid-rl

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

clansfear/ml_agents_car

Last synced at: over 2 years ago - Stars: 1 - Forks: 0

ai-in-game/monsterai

An attempt at developing a learning agent to train an enemy mob in a mario like game

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

anirban.sen/scripts_shivam

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

Everything-Machine-Learning/keras-rl-dql-pong

A Keras-RL based DQL agent that can learn to play pong.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

DracoStriker/unity_gym_wrapper_example

Demo showing how to run an external python trainer over a ml-agents unity environment using gym wrapper as communication interface.

Last synced at: about 2 years ago - Stars: 0 - Forks: 0

team4hire-open-source/muliti-agent-particle-env-fork

A simple harbour simulation to test multi-agent reinforcement learning algorithms based on multi-agent-particle-envs from OpenAi

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

tuyen.ta/lorawan-ml

Machine Learning (ML) algorithm for the resource allocation problems in the Low Power Wide Area Network (LPWAN).

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

dim4gg/AI2RL

Reinforcement learning algorithm implementation project for "Artificial Intelligence" course, La Sapienza, Rome, Italy, 2018

Last synced at: over 2 years ago - Stars: 1 - Forks: 0

maximgaina/reinforcement-learning-atari

Understanding why DeepMind's DQN is not good enough for some Atari 2600 games.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

sylvanix/Q-learning-Intersection-Crossing

Using a Deep Q Network (a kind of Reinforcement Learning agent) to learn a policy to cross a busy intersection

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

danielegrattarola/deep-q-snake

Application of deep Q-Learning to play Snake. Project for the Machine Learning course, A.Y. 2015/2016, Politecnico di Milano

Last synced at: over 2 years ago - Stars: 0 - Forks: 0