Topic: "partially-observable-markov-decision-process"
acforvs/dhc-robust-mapf
Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and benchmarked in out-of-distribution (OOD) scenarios. A new robust training loop to handle communication failures is introduced.
Language: Python - Size: 38 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

ImanRHT/MultiAgentDRL4CollaborativeMEC
Multi-Agent Deep Reinforcement Learning for Collaborative Computation Offloading in Mobile Edge-Computing
Size: 858 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 8 - Forks: 1

aygong/aoi-scheduling-pomdp
Code for the paper "Age-of-Information-based Scheduling in Multiuser Uplinks with Stochastic Arrivals: A POMDP Approach"
Language: MATLAB - Size: 42 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

DongmingShenDS/POMDP-RL
This repository contains Dongming Shen's code and documentation for the research projects conducted at the AIDyS Lab, USC. The project focuses on integrating Reinforcement Learning (RL) to solve partially observable Markov decision processes (POMDP) under finite linear temporal logic (LTL) constraints.
Language: C++ - Size: 15.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

asl-epfl/DecPOMDP_Policy_Evaluation_w-Belief_Sharing
Language: Python - Size: 3.17 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

LEAP-HI-ClimACT/Coastal-Infrastructure-Planning
Climate change-related risk mitigation for infrastructure systems often requires adaptation. A computational framework for optimal decision-making under uncertainty based on dynamically changing conditions observed in time is developed in response.
Language: MATLAB - Size: 4.56 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

pavlosdais/MountainCar-v0
Mastering MountainCar-v0: A Comprehensive Exploration of Reinforcement Learning Algorithms
Language: Jupyter Notebook - Size: 6.06 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

gsmithline/RL_Project
Project exploring Policy Space Response Oracles (PSRO) in a Normative POMDPs
Language: Jupyter Notebook - Size: 10.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rohankalbag/deep-recurrent-q-learning-for-pomdps
Course Project - Advanced Topics in Machine Learning - Autumn Semester 2023 - Indian Institute of Technology Bombay
Language: Jupyter Notebook - Size: 9.93 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1
