An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: renforcement-learning

Aisuko/notebooks

Implementation for the different ML tasks on Kaggle platform with GPUs.

Language: Jupyter Notebook - Size: 160 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 20 - Forks: 3

gzbin365/map

算法 数学 科学。这是一个全网收藏夹; 一个备忘录; 一个To-Do List; 未来的技能点; 个人知识库; 也是一个算法工程师的网址导航.热爱生活, 不断探索.Have fun : )

Size: 176 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 57 - Forks: 14

Gepetto/constraints-as-terminations

Integrating Constraints in PPO (using Isaac Gym or Isaac Lab)

Language: Python - Size: 3.09 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 69 - Forks: 10

RainbowC0/JacksCarRental

杰克租车问题动态规划求解,C语言实现

Language: C - Size: 17.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

dimitri009/Deep-Learning-Applications

This repository contains the code and report for the final evaluation of the Deep Learning Applications module. It includes three exercises on Convolutional Neural Networks (CNNs), Reinforcement Learning, and Adversarial Training. Each exercise is designed to showcase different aspects of deep learning techniques and their applications.

Language: Jupyter Notebook - Size: 3.93 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

omarkhaled00/robot-vacuum-cleaner-with-Reinforcement-learning-

In this project i created virtual environment of house that have obstacles , wall ,and dirt that the vacuum cleaner have to clean in the most efficient movement with was done with Reinforcement learning

Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mathias-kinninkpo/morabaraba-game

Morabaraba implemented in python as part of the MIFY Artificial Intelligence Context (MAIC) competition organized by Machine Intelligence For You (MIFY)

Language: Python - Size: 1.36 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

YarickVodila/TinkoffRobotRL

Пример работы в рамках соревнования Tinkoff Invest Robot Contest #2 с использованием обучения с подкреплением

Language: Jupyter Notebook - Size: 5.69 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

Ruben-2828/RL-Traffic-Control

Application of reinforcement learning to the management of traffic light intersection

Language: Python - Size: 122 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

Drackass/ChatPY

💬 This repository is made to contain ChatPY an AI Chatbot

Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

idthanm/mpg

MPG is originated from the paper "Mixed policy gradient", which also contains a cluster of high-quality implementations of deep reinforcement learning algorithms.

Language: Python - Size: 13.5 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 16 - Forks: 9

Raffaelbdl/minimalistic-rl

RL algorithms made simple in JAX

Language: Python - Size: 125 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

ludel/AutoTrading

Algorithmic trading (postgraduate dissertation)

Language: Jupyter Notebook - Size: 7.94 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

iRaneem/AI-fundmental-CCAI221

This is my work at 2021/2022 include : lab , assignment and project solutions

Language: Prolog - Size: 8.53 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

AnuragSharma5893/DeepRace_Model

To Train a reinforcement learning (RL) model and simulation of how that a model performs on a task. Using Algorithm PPO (Proximal Policy Optimization).

Size: 20.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Rishik-J/NLP-RLE

This repository contains a range of Machine Learning projects utilizing Natural Language Processing and reinforcement learning

Language: Jupyter Notebook - Size: 48.8 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

yashodeepchikte/Machine-Learning

A collection of templates of various machine learning and deep learning algorithms

Language: Python - Size: 353 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

olmoulin/Research

This repository stores the codes relatives to the scientific papers published by Olivier Moulin ( VU Amsterdam).

Language: Python - Size: 2.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Pyrofoux/vivarium

Environnement de simulation multi-agent avec un paradigme écologoiquement valide. Basé sur SimplePlaygrounds.

Language: Python - Size: 3.22 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Related Keywords
renforcement-learning 19 python 6 machine-learning 5 artificial-intelligence 4 computer-vision 3 ai 3 data-science 2 trading 2 trading-algorithms 2 nlp 2 natural-language-processing 2 machine-learning-algorithms 2 deep-reinforcement-learning 2 deep-learning 2 mdps 1 markov-decision-processes 1 lab 1 minmax-algorithm 1 project 1 hill-climbing-search 1 heuristic-search-algorithms 1 expert-system 1 expectimax 1 csp 1 beam-search 1 assignment 1 alpha-beta-pruning 1 adversarial-search 1 advanced-search 1 sckiit-learn 1 pandas 1 jax 1 dm-haiku 1 simulation 1 ecological-niche-modelling 1 ecological-models 1 ecological-emergency 1 reinforcement-learning-algorithms 1 multi-agent-systems 1 multi-agent-reinforcement-learning 1 xgboost 1 tensorflow2 1 tensorflow 1 recurrent-neural-networks 1 recommender-system 1 model-evaluation 1 keras 1 deep-learning-algorithms 1 clustering 1 classification 1 sentiment-analysis 1 rle 1 natual-language-processing 1 robotics-simulation 1 uniform-cost-search 1 search-algorithm 1 rational-agent 1 prolog-programming-language 1 constraints 1 science 1 rl 1 mathematics 1 linux 1 gpu 1 cv 1 computer-science 1 algorithms 1 wandb 1 visulization 1 transformers 1 tensorboard 1 quantization 1 pytorch 1 peft 1 neural-network 1 multimodal 1 large-language-models 1 kaggle 1 fine-tuning 1 accelerator 1 policy-gradient 1 model-driven 1 data-driven 1 asynchronous-learning 1 supervised-learning 1 sumo-rl 1 sumo 1 stablebaselines3 1 sarsa-learning 1 qlearning 1 dqn 1 trading-strategies 1 game-development 1 q-learning-algorithm 1 pygame 1 residual-neural-network 1 adversarial-learning 1 value-iteration 1 policy-iteration 1 dynamic-programming 1