An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: offline-rl

Farama-Foundation/Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Language: Python - Size: 1.86 GB - Last synced at: about 20 hours ago - Pushed at: 4 days ago - Stars: 382 - Forks: 56

Sea-Snell/Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"

Language: Python - Size: 1.14 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 207 - Forks: 18

hanjuku-kaso/awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

Size: 293 KB - Last synced at: 4 days ago - Pushed at: 12 months ago - Stars: 977 - Forks: 89

opendilab/GenerativeRL

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

Language: Python - Size: 8.87 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 129 - Forks: 9

Cryolite/kanachan

A Japanese (Riichi) Mahjong AI Framework

Language: Python - Size: 996 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 308 - Forks: 40

takuseno/d3rlpy

An offline deep reinforcement learning library

Language: Python - Size: 21.3 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 1,445 - Forks: 246

denisyarats/exorl

ExORL: Exploratory Data for Offline Reinforcement Learning

Language: Python - Size: 59.6 KB - Last synced at: about 20 hours ago - Pushed at: over 3 years ago - Stars: 113 - Forks: 9

opendilab/DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language: Python - Size: 292 MB - Last synced at: 18 days ago - Pushed at: 25 days ago - Stars: 3,379 - Forks: 398

liuzuxin/OSRL

🤖 Elegant implementations of offline safe RL algorithms in PyTorch

Language: Python - Size: 1.44 MB - Last synced at: 19 days ago - Pushed at: 8 months ago - Stars: 198 - Forks: 13

hakuhodo-technologies/scope-rl

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Language: Python - Size: 580 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 122 - Forks: 12

mbreuss/diffusion-literature-for-robotics

Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.

Size: 220 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 803 - Forks: 41

opendilab/DI-engine-docs

DI-engine docs (Chinese and English)

Language: Python - Size: 178 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 297 - Forks: 63

Div99/XQL

Extreme Q-Learning: Max Entropy RL without Entropy

Language: Python - Size: 46.8 MB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 86 - Forks: 10

takuseno/minerva

An out-of-the-box GUI tool for offline deep reinforcement learning

Language: JavaScript - Size: 4.25 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 100 - Forks: 9

yingchengyang/Reinforcement-Learning-Papers

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

Size: 2.87 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 386 - Forks: 27

LAMDA-RL/OfflineRL-Lib

Benchmarked implementations of Offline RL Algorithms.

Language: Python - Size: 4.56 MB - Last synced at: 30 days ago - Pushed at: 2 months ago - Stars: 72 - Forks: 7

junming-yang/mopo

Model-based Offline Policy Optimization re-implement all by pytorch

Language: Python - Size: 444 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 8

young-geng/JaxCQL

Conservative Q learning in Jax

Language: Python - Size: 1020 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 53 - Forks: 6

liuzuxin/DSRL

🔥 Datasets and env wrappers for offline safe reinforcement learning

Language: Python - Size: 6.79 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 88 - Forks: 6

MLforHealth/rl_representations

Learning representations for RL in Healthcare under a POMDP assumption

Language: Jupyter Notebook - Size: 65.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 53 - Forks: 12

XanderJC/medkit-learn

The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, and Mihaela van der Schaar.

Language: Python - Size: 70.2 MB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 1

callmespring/RL-short-course

Reinforcement Learning Short Course

Language: Jupyter Notebook - Size: 95.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 53 - Forks: 18

nissymori/JAX-CORL

Clean single-file implementation of offline RL algorithms in JAX

Language: Python - Size: 423 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 35 - Forks: 0

nikisim/ReBRAC_for_robotics_tasks Fork of tinkoff-ai/CORL

Implemenation of CORL for Fetch and Unitree A1 tasks

Language: Python - Size: 95.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

elated-sawyer/RL-in-Federated-Setting

Summarising the research of Offline RL in Federated Setting.

Size: 16.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

acforvs/awac_iql

Offline to Online RL: AWAC & IQL PyTorch Implementation

Language: Jupyter Notebook - Size: 38.1 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ethanluoyc/td3_bc_jax

Direct port of TD3_BC to JAX using Haiku and optax.

Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

samholt/NeuralLaplaceControl

Neural Laplace Control for Continuous-time Delayed Systems - an offline RL method combining Neural Laplace dynamics model and MPC planner to achieve near-expert policy performance in environments with irregular time intervals and an unknown constant delay.

Language: Python - Size: 203 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 3

ebenezergelo/offline-wvf

🧠 Learning World Value Functions without Exploration

Size: 2.14 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nickmarks00/Need-4-Speed

Need 4 Speed, FYP 2023-24 @ Monash.

Language: Python - Size: 44.6 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

BY571/Implicit-Q-Learning

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Language: Python - Size: 44.3 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 3

holarissun/Prompt-OIRL

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

Language: Python - Size: 186 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 3

DT6A/GSM8K-AI-SubQ

Author's repository for GSM8K-AI-SubQ reasoning dataset

Language: Python - Size: 14.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

sql-hkr/offline-rl-research

オフライン強化学習用フレームワーク及びSCQL,SCQL+Dの実装

Language: Python - Size: 56.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

xionghuichen/MAPLE

The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)

Language: Python - Size: 5.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 5

christopher-beckham/coms-are-energy-models

Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model

Language: Jupyter Notebook - Size: 5.95 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 0

holarissun/Accountable-Offline-RL

Code for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Language: Python - Size: 74.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

YiqinYang/VEM

Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)

Language: Python - Size: 548 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 4

YiqinYang/Offline-Gamma

Codes accompanying the paper "On the Role of Discount Factor in Offline Reinforcement Learning" (ICML 2022)

Language: Python - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

AIR-DI/D2C

D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.

Language: Python - Size: 6.78 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

sail-sg/rosmo

Code for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

Language: Python - Size: 74.2 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 0

siemens-research/lion

Code to reproduce experiments from "User-Interactive Offline Reinforcement Learning" (ICLR 2023)

Language: Python - Size: 3.32 MB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

Shanghai-Digital-Brain-Laboratory/BDM-DB1

A large-scale multi-modal pre-trained model

Language: Python - Size: 789 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 91 - Forks: 7

hari-sikchi/offline_rl

Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.

Language: Python - Size: 123 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 2

tinkoff-ai/eop

Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022

Language: Jupyter Notebook - Size: 48.3 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 26 - Forks: 3

amazon-science/cdc-batch-rl

Code for Continuous Doubly Constrained Batch Reinforcement Learning, NeurIPS 2021.

Language: Python - Size: 25.4 KB - Last synced at: 9 days ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 1

SaminYeasar/Offline-Reinforcement-Learning-Algorithms

PyTorch Implementation of Offline Reinforcement Learning algorithms

Language: Python - Size: 203 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

MLD3/OfflineRL_ModelSelection

[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003

Language: Jupyter Notebook - Size: 608 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 4

alxschwrz/gym_dataset_recorder

Package for recording Transitions in OpenAI Gym Environments.

Language: Python - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Related Keywords
offline-rl 49 reinforcement-learning 37 pytorch 11 imitation-learning 8 deep-learning 8 iql 5 offline-reinforcement-learning 4 deep-reinforcement-learning 4 python 4 jax 4 rl 4 robotics 4 model-based-rl 4 inverse-reinforcement-learning 3 model-based-reinforcement-learning 3 off-policy-evaluation 3 q-learning 3 datasets 3 batch-rl 2 bc 2 safe-rl 2 reinforcement-learning-algorithms 2 pytorch-rl 2 model-free-rl 2 benchmark 2 dataset 2 artificial-intelligence 2 atari 2 mujoco 2 dm-haiku 2 transformers 2 behavioral-cloning 2 diffusion-models 2 machine-learning 2 research 2 implicit-q-learning 2 neural-laplace 1 neural-laplace-control 1 unknown-delays 1 d4rl 1 compositional-rl 1 autonomous-driving 1 raspberry-pi 1 bcq 1 irl 1 large-language-models 1 llm 1 offline-irl 1 prompt-engineering 1 rlaif 1 rlhf 1 dataset-generation 1 reasoning 1 ridesharing 1 temporal-differencing-learning 1 value-iteration 1 flax 1 single-file 1 corl 1 federated-learning 1 awac 1 gym 1 optax 1 td3 1 continuous-time 1 delay-differential-equations 1 delayed-systems 1 irregular-time-intervals 1 model-selection 1 mlhc-2021 1 healthcare-application 1 laplace-transform 1 model-predictive-control 1 batched-control 1 deep-rl 1 interpretable-machine-learning 1 interpretable-reinforcement-learning 1 offline-control 1 multi-modal 1 offlinerl 1 xai 1 decision-model 1 offline 1 doge 1 h2o 1 user-interaction 1 arcade-learning-environment 1 bsuite 1 muzero 1 muzero-unplugged 1 rl-unplugged 1 adaptive-policies 1 cql 1 framework 1 scql 1 scql-plus-d 1 icml-2022 1 icml 1 paper 1 tensorflow 1