Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pomdp

Repositories

moves-rwth/storm

A Modern Probabilistic Model Checker

Language: C++ - Size: 199 MB - Last synced: about 17 hours ago - Pushed: about 22 hours ago - Stars: 124 - Forks: 71

MarcoMeter/endless-memory-gym

Challenging Memory-based Deep Reinforcement Learning Agents

Language: Python - Size: 6.25 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 60 - Forks: 3

proroklab/popgym

Partially Observable Process Gym

Language: Python - Size: 255 KB - Last synced: 1 day ago - Pushed: about 2 months ago - Stars: 146 - Forks: 8

h2r/pomdp-py

A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/

Language: Python - Size: 6.85 MB - Last synced: 4 days ago - Pushed: 6 days ago - Stars: 197 - Forks: 47

Abdoulaye27/falco-rgb

This repo contains the work done on building the rgb component of our autonomous target detection framework named FALCO at the COHRINT LAB.

Language: Python - Size: 68.4 KB - Last synced: 8 days ago - Pushed: 9 days ago - Stars: 1 - Forks: 1

MarcoMeter/episodic-transformer-memory-ppo

Clean baseline implementation of PPO using an episodic TransformerXL memory

Language: Python - Size: 23.9 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 111 - Forks: 12

hai-h-nguyen/equi-rl-for-pomdps

[CoRL2023] Equivariant Reinforcement Learning under Partial Observability

Language: Python - Size: 43.1 MB - Last synced: 19 days ago - Pushed: 6 months ago - Stars: 2 - Forks: 0

laurimi/multiagent-prediction-reward

Multi-agent active perception with prediction rewards

Language: C++ - Size: 312 KB - Last synced: 25 days ago - Pushed: over 3 years ago - Stars: 10 - Forks: 0

chauvinSimon/My_Bibliography_for_Research_on_Autonomous_Driving

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

Size: 784 MB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 412 - Forks: 91

swarna-kpaul/neoplanner

Sequential planner for large text based environments

Language: Python - Size: 221 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 5 - Forks: 0

hai-h-nguyen/cosil-corl22

[CoRL 22] Code for "Leveraging Fully Observable Policies for Learning under Partial Observability"

Language: Python - Size: 69.3 KB - Last synced: 20 days ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

mKabouri/grid-pouct

Partially Observable Monte Carlo Planning algorithm (POMCP)

Language: Python - Size: 45.9 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

C0PEP0D/otto

a Python package to simulate, solve and visualize the source-tracking POMDP

Language: Python - Size: 109 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 11 - Forks: 3

yujianyuanhaha/DQN-DSA

DQN in Dynamic Channel Access

Language: Python - Size: 7.73 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 2 - Forks: 2

twni2016/Memory-RL

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)

Language: Python - Size: 153 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 30 - Forks: 3

A method of collaborative decision making using action suggestions by using the agent's policy to estimate the distribution over suggestions and treating a suggested action as an observation of the environment to update the agent's belief.

Language: Julia - Size: 32.4 MB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 10 - Forks: 0

agentmodels/agentmodels.org

Modeling agents with probabilistic programs

Language: TeX - Size: 16.6 MB - Last synced: 12 days ago - Pushed: over 4 years ago - Stars: 65 - Forks: 17

Abdoulaye27/falco-ir

This repo contains the work done on building the infrared component of our autonomous target detection framework named FALCO at the COHRINT LAB.

Language: Python - Size: 5.5 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

DevSlem/AINE-DRL

AINE-DRL is a deep reinforcement learning (DRL) baseline framework. AINE means "Agent IN Environment".

Language: Python - Size: 39.3 MB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 7 - Forks: 2

twni2016/pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Language: Python - Size: 6.34 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 234 - Forks: 33

jerrodparker20/adaptive-transformers-in-rl

Adaptive Attention Span for Reinforcement Learning

Language: Python - Size: 5.78 MB - Last synced: 7 months ago - Pushed: about 4 years ago - Stars: 122 - Forks: 14

catohaste/POMDP

Implementing a RL algorithm based upon a partially observable Markov decision process.

Language: MATLAB - Size: 420 KB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 41 - Forks: 21

AdaCompNUS/despot

The DESPOT online POMDP solver

Language: C++ - Size: 5.06 MB - Last synced: 7 months ago - Pushed: almost 2 years ago - Stars: 213 - Forks: 94

mobeets/value-rnn-td

train an RNN to estimate value in a POMDP using TD learning

Language: Python - Size: 324 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 5 - Forks: 1

glambrechts/informed-dreamer Fork of danijar/dreamerv3

Official implementation of the Informed Dreamer algorithm, based on DreamerV3

Language: Python - Size: 19.8 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 2 - Forks: 0

MarcoMeter/recurrent-ppo-truncated-bptt

Baseline implementation of recurrent PPO using truncated BPTT

Language: Jupyter Notebook - Size: 17.7 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 82 - Forks: 9

biy001/UAV-autonomous-landing

A UAV autonomously lands on a moving ground platform.

Language: Julia - Size: 4.17 MB - Last synced: 10 months ago - Pushed: about 5 years ago - Stars: 4 - Forks: 0

glimow/bayesian-pomdp

Partially observable markov decision process solver in python

Language: Python - Size: 482 KB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 6 - Forks: 0

ppartha03/MACA

Goal Oriented Dialog System

Language: Python - Size: 534 KB - Last synced: 10 months ago - Pushed: about 7 years ago - Stars: 14 - Forks: 4

rvdweerd/simmodel

Solving pursuit-evasion problems on graphs using Reinfocement Learning and GNNs

Language: Python - Size: 364 MB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 9 - Forks: 1

YC-Liang/Recurrent-Macro-Action-Generator

Proposing better macro actions set using recurrent neural networks conditioned on encoded environmental contexts

Language: C++ - Size: 342 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

jmuchovej/BoxesWorld.jl

A box-picking POMDP created using POMDPs.jl

Language: Julia - Size: 235 KB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 0 - Forks: 0

HaiyinPiao/pytorch-a2clstm-DRQN

using recurrent networks(LSTM) to solve POMDPs

Language: Python - Size: 12.7 KB - Last synced: 11 months ago - Pushed: over 5 years ago - Stars: 29 - Forks: 2

1391819/MA-seek

A multi agent reinforcement learning environment where two agents controlled by DRQNs play a custom version of the pursuit-evasion game.

Language: Python - Size: 4.75 MB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 2 - Forks: 0

dityas/Protos

Factored Interactive POMDP solver based on symbolic Perseus.

Language: Java - Size: 217 MB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 10 - Forks: 1

TGDivy/Language-Evolution

We use reinforcement learning to study how language can be used as a tool for agents to accomplish tasks in their environment, and show that structure in the evolved language emerges naturally through iterated learning, leading to the development of compositional language for describing and generalising about unseen objects.

Language: Python - Size: 429 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0

namoshizun/PyPOMDP

Python implementation of POMDP framework and PBVI & POMCP algorithms.

Language: Python - Size: 1.81 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 76 - Forks: 23

laurimi/npgi

Non-linear policy graph improvement - planning for Dec-POMDPs

Language: C++ - Size: 197 KB - Last synced: 25 days ago - Pushed: about 3 years ago - Stars: 15 - Forks: 2

caelan/SS-Replan

Online Replanning in Belief Space for Partially Observable Task and Motion Problems

Language: Python - Size: 9.07 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 48 - Forks: 19

chauvinSimon/Hierarchical-Decision-Making-for-Autonomous-Driving

Rich literature review and discussion on the implementation of "Hierarchical Decision-Making for Autonomous Driving"

Size: 10.4 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 48 - Forks: 13

iciac/POMDP

POMDPX converter and simulator for testing DRL algorithms on classic POMDP problems.

Language: Python - Size: 10.9 MB - Last synced: 12 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 1

AdaCompNUS/sarsop

Efficient Point-Based POMDP Planning by Approximating

Language: C++ - Size: 2.83 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 76 - Forks: 35

d3sm0/gym_pomdp

Gym-like extensions for POMDP

Language: Python - Size: 174 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 45 - Forks: 13

karimn/FundingPOMDPs.jl

POMDP for a programs funder and evaluator

Language: Julia - Size: 557 KB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

caelan/planning-algorithms

MIT Planning Algorithms Class Implementations

Language: Python - Size: 10.5 MB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 6 - Forks: 3

vincentberaud/Minecraft-Reinforcement-Learning

Deep Recurrent Q-Learning vs Deep Q Learning on a simple Partially Observable Markov Decision Process with Minecraft

Language: Jupyter Notebook - Size: 1.55 MB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 41 - Forks: 6

ollema/purl

Pathfinding Using Reinforcement Learning

Language: Python - Size: 78.1 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 11 - Forks: 0

martcram/hrc_pomdp

POMDP planning of collaborative robot tasks in assembly

Language: C++ - Size: 164 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

mynkpl1998/Recurrent-Deep-Q-Learning

Solving POMDP using Recurrent networks

Language: Jupyter Notebook - Size: 46.1 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 62 - Forks: 26

yangyou95/POMDP_PBVI_solver

PBVI C++ Implementation for solving POMDPs

Language: C++ - Size: 101 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 20 - Forks: 3

info-structures/ais

This repository contains the code for RL for POMDPs through learning an Approximate Information State.

Language: Python - Size: 2.16 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 14 - Forks: 5

aijunbai/hplanning

Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation

Language: C++ - Size: 27.1 MB - Last synced: about 1 year ago - Pushed: almost 8 years ago - Stars: 11 - Forks: 3

Megha-Bose/Partially-Observable-MDP

Solving Partially Observable MDP (POMDP) problems

Language: HTML - Size: 215 KB - Last synced: 12 months ago - Pushed: about 3 years ago - Stars: 2 - Forks: 0

boettiger-lab/sarsop

:package: A library for solving POMDPs

Language: C++ - Size: 6.75 MB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 9 - Forks: 8

rmoehn/piglet_pbvi

Implementation of point-based value iteration (for POMDPs)

Language: Python - Size: 33.2 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 12 - Forks: 2

Johny-kann/Path-Planning-in-iRobot-create-using-POMDP

The goal of the project is to make a robot plan its path from a source to the destination and reach the destination only by evidence and its previous transition.

Language: Python - Size: 38.1 KB - Last synced: about 1 year ago - Pushed: about 8 years ago - Stars: 5 - Forks: 1

alexscarlatos/CORPP

An implementation of the CORPP algorithm, described in a paper by Shiqi Zhang and Peter Stone

Language: Python - Size: 147 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 2 - Forks: 2

ScazLab/task-models

Task models for human robot collaboration

Language: Python - Size: 690 KB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 10 - Forks: 3

shanigu/CostSensitive

This project implements several methods for cost sensitive classification, based on a POMDP formalization, and an MDP formalization of the problem

Language: C# - Size: 44.3 MB - Last synced: 12 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

veeral-agarwal/POMDPs

policy generation using SARSOP, in Machine Learning, MDL | Spring 2021

Language: Python - Size: 5.1 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

kage08/qmdp-net

QMDP Net fork (python 3 compatible)

Language: Python - Size: 9.76 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 3 - Forks: 2

WhiffleFish/CovidPOMDP.jl

POMDP formulation and solution of the COVID-19 epidemic

Language: Julia - Size: 27.9 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

emuskardin/gridworld-gym

Scallable partially observable and/or non-Markovian gridworld for planning or reinforcement learning

Language: Python - Size: 43.9 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 1

yunfanjiang/drl-rws

CS 238 Final Project: Deep Reinforcement Learning Agents that Run with Scissors

Language: Python - Size: 9.6 MB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

JagtapSagar/NAO-humanoid

Trained Probabilistic Models for the NAO Robot in a Labyrinth

Language: Python - Size: 3.6 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

LorenRd/POMDP

Planificación en Entornos con Incertidumbre

Language: Python - Size: 3.61 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 1

boettiger-lab/future-of-fish

:notebook:

Language: R - Size: 30.2 MB - Last synced: 4 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

rmoehn/pomdp2json

Python script to convert Tony Cassandra's POMDP files to JSON

Language: C - Size: 106 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 2 - Forks: 0

AniketBajpai/lift-agent

Effective policy for running a set of elevators formulated using a POMDP

Language: C++ - Size: 345 KB - Last synced: about 1 year ago - Pushed: over 7 years ago - Stars: 0 - Forks: 0

AssistiveRoboticsUNH/asd_pomdp

A POMDP model of an ABA styel social greeting behavioral intervention

Language: C++ - Size: 77.1 KB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 1 - Forks: 1

Related Keywords

pomdp 70 reinforcement-learning 14 mdp 9 deep-reinforcement-learning 8 planning 7 pytorch 7 reinforcement-learning-algorithms 6 ppo 5 dqn 5 ai 5 decision-making 5 transformer 4 python 4 markov-decision-processes 4 deep-learning 4 recurrent-neural-networks 3 planning-algorithms 3 pomdps 3 lstm 3 drqn 3 sarsop 3 rl 3 multi-agent-systems 2 information-gathering 2 bibliography 2 decentralized 2 dec-pomdp 2 gtrxl 2 decision-making-under-uncertainty 2 imitation-learning 2 mcts 2 fisheries 2 decision-theory 2 pathfinding 2 credit-assignment 2 value-iteration 2 probabilistic-programming 2 machine-learning 2 drl 2 gym 2 recurrence 2 gru 2 partially-observable-environment 2 computer-vision 2 nao-robot 2 decision-support-system 2 trxl 2 actor-critic 2 transformer-xl 2 lstm-neural-networks 2 partial-observability 2 on-policy 2 policy-gradient 2 proximal-policy-optimization 2 pomdpx 1 appl 1 approximate 1 self-driving-cars 1 autonomous-driving 1 a2clstm 1 learning 1 reinforcement 1 ea 1 a2c 1 robotics-algorithms 1 funding 1 research-paper 1 macro-actions 1 pytorch-geometric 1 iterated-learning 1 language-evolution 1 probabilistic-planning 1 multiagent-systems 1 multiagent-planning 1 multiagent-reinforcement-learning 1 natural-language-generation 1 student-teacher-learning 1 wordembeddings 1 ipomdp 1 educational 1 belief-space 1 tensorflow2 1 manipulation 1 motion-planning 1 pddl 1 pddlstream 1 marl 1 pybullet 1 robotics 1 stochasticity 1 shopping-requests 1 htm 1 human-robot-interaction 1 clasisfication 1 cost-sensitive-classification 1 machine-learning-algorithms 1 covid-19 1 gridworld 1 non-markovian-rl 1 scalable-gridworld 1