An open API service providing repository metadata for many open source software ecosystems.

Topic: "ppo-agent"

pythonlessons/Reinforcement_Learning

Reinforcement learning tutorials

Language: Python - Size: 87 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 384 - Forks: 155

bitsauce/Carla-ppo

This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.

Language: Python - Size: 1.25 GB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 223 - Forks: 57

davide97l/rl-policies-attacks-defenses

Adversarial attacks on Deep Reinforcement Learning (RL)

Language: Jupyter Notebook - Size: 346 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 72 - Forks: 12

abhilash1910/Deep_Reinforcement_Learning_Trading

Deep Reinforcement Learning for Trading

Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 27 - Forks: 6

Atharv24/SnakeGym

Multi agent gym environment based on the classic Snake game with implementations of various reinforcement learning algorithms in pytorch

Language: Python - Size: 2.75 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 13 - Forks: 1

mohammadzainabbas/Reinforcement-Learning-CS

💡 Grasp - Pick-and-place with a robotic hand 👨🏻‍💻

Language: Jupyter Notebook - Size: 19.1 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 4

7enTropy7/Racer_AI

Developed an highly customizable OpenAI gym environment and trained a stable_baselines3 PPO agent. Used the expert agent for Imitation Learning with DAgger

Language: Python - Size: 1.7 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

jookie/jojoBot

Financial trading strategies using deep reinforcement learning (DRL). It offers a frameworks for quantitative finance, enabling practitioners to create, test, and implement investments strategies.

Language: TypeScript - Size: 86.6 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

jfpettit/flare

Modular Reinforcement Learning in PyTorch.

Language: Python - Size: 27.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

c2d08y/LearningBot

A deep reinforcement learning Bot for https://kana.byha.top:444/

Language: Python - Size: 126 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

pranshurastogi29/BTC_mining_fees_optimization_RL

In this project, I have tried to use DeepRL for optimizing the selection of transactions done by the miner to increase the fee when they execute a block on the chain

Language: Jupyter Notebook - Size: 471 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1

dschori/Ackerbot

Reinforcement Learning based navigation

Language: Jupyter Notebook - Size: 107 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

QuantDevJayson/robo-credit-underwriter-multi-rl

AI-driven credit underwriting system combining Machine Learning (ML) & Reinforcement Learning (RL) to optimize loan approvals while managing risk: Credit Risk Prediction via Random Forest model; PPO & DQN for dynamic risk control; Custom OpenAI Gym Environment for simulating real-world lending scenarios & FastAPI real-time processing.

Language: Jupyter Notebook - Size: 3.01 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

fracapuano/Quinto

Repository for the final project of the "Computational Intelligence" course @ PoliTo, 2022/2023

Language: Python - Size: 20.1 MB - Last synced at: 25 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

JulianCatnip/atst-walker-agent

Concept and development of a walking AT-ST Walker (Starwars) ML-agent.

Language: C# - Size: 20.8 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

RsGoksel/Snake-Game_PPO-Solution

Snake game environment integrated with OpenAI Gym. Proximal Policy Optimization (PPO) implementation for training. Visualization of training progress and agent performance. Easy to understand code.

Language: Jupyter Notebook - Size: 22.5 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

rohmatmret/microk8s-autoscaling

This project integrates MicroK8s (lightweight Kubernetes) with Reinforcement Learning (RL) for adaptive autoscaling in startups, reducing cloud costs by up to 30% compared to traditional solutions (HPA/CA).

Language: Python - Size: 514 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

emanuelegreco29/Toy_Model_RL

Toy model implementing various architectures to teach a generic point of mass to reach a static and/or dynamic target in a 3D space.

Language: Python - Size: 409 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

jookie/jojostock1

An adaptive Machine Reinforcement Learning (MRL) system is being developed to gather and analyze media data using web scraping, training models to predict outcomes in areas like stock market trends, sports events, and other performance domains. It continuously refines its strategies based on real-time data and evolving patterns.

Language: HTML - Size: 793 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 1

master-moose/RLTrader

Personal project - attempting to train an RL model to trade crypto/other markets

Language: Python - Size: 192 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

su1phurd/UAV-WRF-LES-PPO-LSTM

无人机自主溯源甲烷羽流系统/Autonomous UAV Methane Plume Tracing System

Language: Python - Size: 14.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

andrewimpellitteri/floor-cleaner-agent

An RL agent that finds an optimal policy of cleaning dirt off a floor with a power washer.

Language: Python - Size: 28.2 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Dmoore628/Autonomous_Reinforcement_Learning_Trader

Autonomous Reinforcement Learning Trader (ARLT) is a high-performance, PPO-based reinforcement learning system designed to consistently achieve 10x profit growth within a single trading day, using RL to ensure high-frequency execution, precision trading, and adaptive risk management.

Language: Python - Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

AnantVerma-58/Flappy-Bird

Flappy Bird using Reinforcement Learning

Language: Jupyter Notebook - Size: 316 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

roeey777/Splendor-AI

AI agents for the boardgame Splendor

Language: Python - Size: 103 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

PetropoulakisPanagiotis/igae

State Representations as Incentives for Reinforcement Learning Agents: A Sim2Real Analysis on Robotic Grasping

Language: Python - Size: 45.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

strcoder4007/Mario-Reinforcement-Learning

Training a Mario reinforcement learning agent using Open AI Gym and Stable Baselines 3 PPO algorithm.

Language: Python - Size: 2.16 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

ellkrauze/gc-ml

Performance Tuning using Reinforcement Learning

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ImSOLty/On-The-Waves

🚤🏖️BOATS DO VZHHHHH BBBDROOM, BEEEEP, BEEEP, GNAA, HONK, VZHHHHHHHHHHHHHH🏖️🚤

Language: C# - Size: 346 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

navneet1083/textsum-tune

This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.

Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

GerTheMessiah/Snake-AI

Short own implementation of the game snake. In this project I'am using the ray library together with ray tune and a custom PPO model.

Language: Python - Size: 11.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

00Utkarsh00/ML-DOOM

Automated gaming using machine learning

Language: Jupyter Notebook - Size: 336 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

harikris001/Super-Mario-Reinforcement_Learning

Reinforcement Learning in Super Mario using Pytorch and PPO

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

iamvigneshwars/ai-walkers-ppo-pytorch

AI agent learns to walk, run, hop and crawl with out any given data using proximal policy optimisation.

Language: Python - Size: 152 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 23 ppo 13 deep-reinforcement-learning 9 pytorch 9 python 9 deep-learning 6 ppo-pytorch 6 a2c 5 gym-environment 5 reinforcement-learning-agent 5 machine-learning 4 stable-baselines3 4 neural-network 3 dqn-agents 3 sac 3 unity 3 gym 3 rl 2 ml-agents 2 stable-baselines 2 imitation-learning 2 artificial-intelligence 2 lstm 2 reinforcement-learning-environments 2 cryptocurrency 2 actor-critic 2 snake-game 2 ppo2 2 proximal-policy-optimization 2 d3qn 2 ddqn 2 a2c-agent 2 dqn 2 dueling-dqn 2 reinforcement-learning-algorithms 2 tensorflow2 1 actor-critic-methods 1 recurrent-neural-networks 1 recurrent-ppo 1 a3c-agent 1 splendor 1 torch 1 board-game 1 bitcoin-wallet 1 drl-algorithms 1 options-trading 1 state-representation 1 state-randomization 1 sharpe-ratio 1 vae 1 sharpe-ratios 1 stock-market 1 algorithmic-trading 1 time-series 1 trading 1 soft 1 trading-strategies 1 sarimax 1 trpo 1 prophet-model 1 double-dqn 1 ddpg 1 ai 1 custom-gym-environment 1 genetic-algorithms 1 gymnasium 1 maskable-ppo 1 cnn-lstm-models 1 ppo-gru 1 ppo-lstm 1 arima-model 1 ppo-self-attention 1 data-science 1 dynamic-programming 1 mining 1 optimization 1 pytorch-rl 1 a3c 1 actor-critic-algorythm 1 bipedalwalker 1 lunarlander 1 policy-gradient 1 ai-driven-chatbot 1 credit-risk 1 cvar-optimization 1 deep-q-learning 1 fastapi 1 risk-underwriting 1 robotics-simulation 1 streamlit-webapp 1 synthetic-data 1 stable-baseline3 1 toy-models 1 futures-market 1 gymnasium-environment 1 optimization-algorithms 1 visualization-library 1 mario 1 openai-gym 1 atmosphere 1