An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ppo-algorithm

VachanVY/Reinforcement-Learning

PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.

Language: Python - Size: 87.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 160 - Forks: 10

unaizaahmedk/Balancing-Inverted-Pendulum-using-RL

Reinforcement learning–based controller for balancing an inverted pendulum using Proximal Policy Optimization (PPO). Supports configurable mass, length, and gravity settings (Earth, lunar, microgravity) with automated training logs, reward visualization, and performance analysis.

Language: Python - Size: 1.98 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

Gummala-vinaykumar/AWS-DeepRacer-Autonomous-Racing-Model

Developed an AWS DeepRacer model using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.

Size: 21.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

imjeasung/Production-Line-RL-PPO

AI-powered production line optimization using reinforcement learning (PPO).

Language: Python - Size: 139 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

sanatren/Legal-Document-Analyzer

This Legal Document Analyzer is a proof-of-concept NLP project demonstrating the potential of transformers for legal document summarization.

Language: Python - Size: 82 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mohammadzainabbas/Reinforcement-Learning-CS

💡 Grasp - Pick-and-place with a robotic hand 👨🏻‍💻

Language: Jupyter Notebook - Size: 19.1 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 4

sayeang/AWS-DeepRacer-Autonomous-Racing-Model

Developed-an-AWS-DeepRacer-model-using-Python-&-the-PPO-algorithm,-leveraging-TensorFlow-to-train-&-fine-tune-a-deep-reinforcement-learning-model.-Designed-a-custom-reward-function-&-optimized-hyperparameters-to-improve-policy-learning-&-navigation-performance.-Utilized-AWS-infrastructure-for-scalable-training-&-deployment.

Size: 1.95 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

negarhonarvar/DeepReinforcementLearning

A Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments

Language: Python - Size: 6.42 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 0

rohit123-wq/AWS-DeepRacer-Autonomous-Racing-Model

Developed an AWS DeepRacer model using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.

Language: JavaScript - Size: 329 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Scorpionwarrior/AWS-DeepRacer-Autonomous-Racing-Model

el using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.

Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Paul7513/AWS-DeepRacer-Autonomous-Racing-Model

el using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.

Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

haimanm3/AWS-DeepRacer-Autonomous-Racing-Model

Developed an AWS DeepRacer model using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.

Size: 24.6 MB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

RhizomaticRobin/LibreGrabbie

LibreGrabbe 16-DOF Robot Hand

Language: Roff - Size: 158 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Degik/ClimatePredictor

ClimatePredictor implemented by using Proximal policy optimization (PPO) with ray framework for the FederatedLearning approach

Language: Python - Size: 14.2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

nikhilgrad/Reinforcement-Learning-Model-for-Super-Mario

An RL based model using PPO algorithm leveraging OpenAI Gym environment to play the popular Super Mario game.

Language: Jupyter Notebook - Size: 39.1 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

Anca-Mt/TrackmaniaRL-AI

AI agents for Trackmania using the TMRL package. Implemented DDPG, PPO, and used two SAC algorithms (with one or two critics) to train cars to navigate custom-built tracks.

Language: Python - Size: 15.9 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

girishkolli/Lunar-Lander

Training a lunar lander to land using the OpenAI "gym" library and Stable Baselines3 "PPO" reinforcement learning algorithm

Language: Jupyter Notebook - Size: 295 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mafaldaaires/Reinforcement-Learning

Stable Baselines3

Language: Python - Size: 20.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kochlisGit/TraderNet-CRv2

TraderNet-CRv2 - Combining Deep Reinforcement Learning with Technical Analysis and Trend Monitoring on Cryptocurrency Markets

Language: Jupyter Notebook - Size: 157 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 17 - Forks: 3

Related Keywords
ppo-algorithm 19 python 11 machine-learning 9 tensorflow 7 reinforcement-learning 7 deep-learning 7 training 6 aws-infrastructure 6 deployment 6 hyperparameter-tuning 6 scalable 6 deepracer 5 aws-deepracer 5 rl 5 aws 5 pytorch 4 reinforcement-learning-algorithms 3 openai-gym 2 ppo 2 proximal-policy-optimization 2 gymnasium 2 dqn 2 deep-reinforcement-learning 2 ddpg-algorithm 2 simulation 2 robots 1 ros2-humble 1 distributed-systems 1 federated-learning 1 noaa-data 1 raylib 1 ai 1 game-ai 1 robotics 1 robothand 1 robot 1 opensource 1 open-source-hardware 1 open-source 1 isaac-lab 1 isaac 1 swimmer 1 softmax-exploration 1 smart-factory 1 trading-bot 1 trading-algorithms 1 tf-agents 1 technical-indicators 1 technical-analysis 1 risk-adjusted-return 1 reward-shaping 1 google-trends-api 1 google-trends 1 explainability 1 double-dqn-algorithm 1 deep-neural-networks 1 ddqn 1 cryptocurrency-trading 1 car-racing-environment 1 a2c-algorithm 1 tmrl-package 1 tmrl 1 sac-algorithm 1 modern-game-ai 1 simpy 1 production-optimization 1 optimization 1 manufacturing 1 industrial-ai 1 digital-twin 1 automation 1 inverted-pendulum 1 sutton-barto-book 1 soft-actor-critic-continuous 1 rl-book 1 reinforcement-learning-an-introduction 1 policy-gradient-with-baseline 1 policy-gradient 1 dqn-pytorch 1 deep-deterministic-policy-gradient 1 artificial-intelligence 1 actor-critic-pytorch 1 actor-critic-algorithm 1 sarsa 1 lunar-lander 1 gymnasium-environment 1 drl-algorithms 1 d3qn 1 cartpole-v1 1 boltzmann-exploration 1 sac 1 ppo-agent 1 physics-engine 1 model-free-rl 1 mamba 1 gym-environment 1 brax 1 transformer 1 huggingface 1 finetuning-transformers 1