GitHub topics: offline-rl
Farama-Foundation/Minari
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
Language: Python - Size: 1.86 GB - Last synced at: about 20 hours ago - Pushed at: 4 days ago - Stars: 382 - Forks: 56

Sea-Snell/Implicit-Language-Q-Learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
Language: Python - Size: 1.14 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 207 - Forks: 18

hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
Size: 293 KB - Last synced at: 4 days ago - Pushed at: 12 months ago - Stars: 977 - Forks: 89

opendilab/GenerativeRL
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
Language: Python - Size: 8.87 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 129 - Forks: 9

Cryolite/kanachan
A Japanese (Riichi) Mahjong AI Framework
Language: Python - Size: 996 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 308 - Forks: 40

takuseno/d3rlpy
An offline deep reinforcement learning library
Language: Python - Size: 21.3 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 1,445 - Forks: 246

denisyarats/exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
Language: Python - Size: 59.6 KB - Last synced at: about 20 hours ago - Pushed at: over 3 years ago - Stars: 113 - Forks: 9

opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Language: Python - Size: 292 MB - Last synced at: 18 days ago - Pushed at: 25 days ago - Stars: 3,379 - Forks: 398

liuzuxin/OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
Language: Python - Size: 1.44 MB - Last synced at: 19 days ago - Pushed at: 8 months ago - Stars: 198 - Forks: 13

hakuhodo-technologies/scope-rl
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
Language: Python - Size: 580 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 122 - Forks: 12

mbreuss/diffusion-literature-for-robotics
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
Size: 220 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 803 - Forks: 41

opendilab/DI-engine-docs
DI-engine docs (Chinese and English)
Language: Python - Size: 178 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 297 - Forks: 63

Div99/XQL
Extreme Q-Learning: Max Entropy RL without Entropy
Language: Python - Size: 46.8 MB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 86 - Forks: 10

takuseno/minerva
An out-of-the-box GUI tool for offline deep reinforcement learning
Language: JavaScript - Size: 4.25 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 100 - Forks: 9

yingchengyang/Reinforcement-Learning-Papers
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
Size: 2.87 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 386 - Forks: 27

LAMDA-RL/OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
Language: Python - Size: 4.56 MB - Last synced at: 30 days ago - Pushed at: 2 months ago - Stars: 72 - Forks: 7

junming-yang/mopo
Model-based Offline Policy Optimization re-implement all by pytorch
Language: Python - Size: 444 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 8

young-geng/JaxCQL
Conservative Q learning in Jax
Language: Python - Size: 1020 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 53 - Forks: 6

liuzuxin/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
Language: Python - Size: 6.79 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 88 - Forks: 6

MLforHealth/rl_representations
Learning representations for RL in Healthcare under a POMDP assumption
Language: Jupyter Notebook - Size: 65.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 53 - Forks: 12

XanderJC/medkit-learn
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, and Mihaela van der Schaar.
Language: Python - Size: 70.2 MB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 1

callmespring/RL-short-course
Reinforcement Learning Short Course
Language: Jupyter Notebook - Size: 95.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 53 - Forks: 18

nissymori/JAX-CORL
Clean single-file implementation of offline RL algorithms in JAX
Language: Python - Size: 423 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 35 - Forks: 0

nikisim/ReBRAC_for_robotics_tasks Fork of tinkoff-ai/CORL
Implemenation of CORL for Fetch and Unitree A1 tasks
Language: Python - Size: 95.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

elated-sawyer/RL-in-Federated-Setting
Summarising the research of Offline RL in Federated Setting.
Size: 16.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

acforvs/awac_iql
Offline to Online RL: AWAC & IQL PyTorch Implementation
Language: Jupyter Notebook - Size: 38.1 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ethanluoyc/td3_bc_jax
Direct port of TD3_BC to JAX using Haiku and optax.
Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

samholt/NeuralLaplaceControl
Neural Laplace Control for Continuous-time Delayed Systems - an offline RL method combining Neural Laplace dynamics model and MPC planner to achieve near-expert policy performance in environments with irregular time intervals and an unknown constant delay.
Language: Python - Size: 203 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 3

ebenezergelo/offline-wvf
🧠 Learning World Value Functions without Exploration
Size: 2.14 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nickmarks00/Need-4-Speed
Need 4 Speed, FYP 2023-24 @ Monash.
Language: Python - Size: 44.6 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

BY571/Implicit-Q-Learning
PyTorch implementation of the implicit Q-learning algorithm (IQL)
Language: Python - Size: 44.3 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 3

holarissun/Prompt-OIRL
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
Language: Python - Size: 186 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 3

DT6A/GSM8K-AI-SubQ
Author's repository for GSM8K-AI-SubQ reasoning dataset
Language: Python - Size: 14.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

sql-hkr/offline-rl-research
オフライン強化学習用フレームワーク及びSCQL,SCQL+Dの実装
Language: Python - Size: 56.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

xionghuichen/MAPLE
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
Language: Python - Size: 5.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 5

christopher-beckham/coms-are-energy-models
Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model
Language: Jupyter Notebook - Size: 5.95 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 0

holarissun/Accountable-Offline-RL
Code for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Language: Python - Size: 74.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

YiqinYang/VEM
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)
Language: Python - Size: 548 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 4

YiqinYang/Offline-Gamma
Codes accompanying the paper "On the Role of Discount Factor in Offline Reinforcement Learning" (ICML 2022)
Language: Python - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

AIR-DI/D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
Language: Python - Size: 6.78 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

sail-sg/rosmo
Code for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
Language: Python - Size: 74.2 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 0

siemens-research/lion
Code to reproduce experiments from "User-Interactive Offline Reinforcement Learning" (ICLR 2023)
Language: Python - Size: 3.32 MB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

Shanghai-Digital-Brain-Laboratory/BDM-DB1
A large-scale multi-modal pre-trained model
Language: Python - Size: 789 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 91 - Forks: 7

hari-sikchi/offline_rl
Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.
Language: Python - Size: 123 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 2

tinkoff-ai/eop
Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022
Language: Jupyter Notebook - Size: 48.3 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 26 - Forks: 3

amazon-science/cdc-batch-rl
Code for Continuous Doubly Constrained Batch Reinforcement Learning, NeurIPS 2021.
Language: Python - Size: 25.4 KB - Last synced at: 9 days ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 1

SaminYeasar/Offline-Reinforcement-Learning-Algorithms
PyTorch Implementation of Offline Reinforcement Learning algorithms
Language: Python - Size: 203 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

MLD3/OfflineRL_ModelSelection
[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003
Language: Jupyter Notebook - Size: 608 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 4

alxschwrz/gym_dataset_recorder
Package for recording Transitions in OpenAI Gym Environments.
Language: Python - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0
