An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: offline-rl

Farama-Foundation/Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Language: Python - Size: 1.86 GB - Last synced at: 3 days ago - Pushed at: 17 days ago - Stars: 429 - Forks: 62

takuseno/d3rlpy

An offline deep reinforcement learning library

Language: Python - Size: 21.3 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 1,539 - Forks: 258

hakuhodo-technologies/scope-rl

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

Language: Python - Size: 580 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 128 - Forks: 12

opendilab/DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language: Python - Size: 293 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 3,513 - Forks: 409

opendilab/GenerativeRL

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

Language: Python - Size: 8.87 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 149 - Forks: 12

hanjuku-kaso/awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

Size: 293 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 1,016 - Forks: 93

callmespring/RL-short-course

Reinforcement Learning Short Course

Language: Jupyter Notebook - Size: 138 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 76 - Forks: 19

takuseno/minerva

An out-of-the-box GUI tool for offline deep reinforcement learning

Language: JavaScript - Size: 4.25 MB - Last synced at: 19 days ago - Pushed at: over 4 years ago - Stars: 100 - Forks: 11

Binny-Shukla/OFFLINE-RL

A Repo covering most of Offline RL algo

Size: 2.16 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 0

sql-hkr/offline-rl-research

オフライン強化学習用フレームワーク

Language: Python - Size: 57.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

liuzuxin/DSRL

🔥 Datasets and env wrappers for offline safe reinforcement learning

Language: Python - Size: 6.79 MB - Last synced at: 14 days ago - Pushed at: 12 months ago - Stars: 98 - Forks: 6

igopalakrishna/sbsim Fork of google/sbsim

Google Open-Source Project: Stochastic building simulator and real-world dataset for training and benchmarking reinforcement learning agents in energy-efficient smart control environments. Built with Gym, TensorFlow Agents, and 6+ years of real building data.

Size: 6.89 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

liuzuxin/OSRL

🤖 Elegant implementations of offline safe RL algorithms in PyTorch

Language: Python - Size: 1.44 MB - Last synced at: 19 days ago - Pushed at: 12 months ago - Stars: 207 - Forks: 13

LAMDA-RL/OfflineRL-Lib

Benchmarked implementations of Offline RL Algorithms.

Language: Python - Size: 4.56 MB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 75 - Forks: 7

junming-yang/mopo

Model-based Offline Policy Optimization re-implement all by pytorch

Language: Python - Size: 444 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 33 - Forks: 8

opendilab/DI-engine-docs

DI-engine docs (Chinese and English)

Language: Python - Size: 178 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 299 - Forks: 64

young-geng/JaxCQL

Conservative Q learning in Jax

Language: Python - Size: 1020 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 6

Sea-Snell/Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"

Language: Python - Size: 1.14 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 207 - Forks: 18

sail-sg/rosmo

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

Language: Python - Size: 66.4 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 29 - Forks: 0

Cryolite/kanachan

A Japanese (Riichi) Mahjong AI Framework

Language: Python - Size: 996 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 308 - Forks: 40

denisyarats/exorl

ExORL: Exploratory Data for Offline Reinforcement Learning

Language: Python - Size: 59.6 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 113 - Forks: 9

mbreuss/diffusion-literature-for-robotics

Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.

Size: 220 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 803 - Forks: 41

Div99/XQL

Extreme Q-Learning: Max Entropy RL without Entropy

Language: Python - Size: 46.8 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 86 - Forks: 10

yingchengyang/Reinforcement-Learning-Papers

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

Size: 2.87 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 386 - Forks: 27

MLforHealth/rl_representations

Learning representations for RL in Healthcare under a POMDP assumption

Language: Jupyter Notebook - Size: 65.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 53 - Forks: 12

XanderJC/medkit-learn

The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, and Mihaela van der Schaar.

Language: Python - Size: 70.2 MB - Last synced at: 15 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 1

nissymori/JAX-CORL

Clean single-file implementation of offline RL algorithms in JAX

Language: Python - Size: 423 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 0

nikisim/ReBRAC_for_robotics_tasks Fork of tinkoff-ai/CORL

Implemenation of CORL for Fetch and Unitree A1 tasks

Language: Python - Size: 95.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

elated-sawyer/RL-in-Federated-Setting

Summarising the research of Offline RL in Federated Setting.

Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

acforvs/awac_iql

Offline to Online RL: AWAC & IQL PyTorch Implementation

Language: Jupyter Notebook - Size: 38.1 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ethanluoyc/td3_bc_jax

Direct port of TD3_BC to JAX using Haiku and optax.

Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

samholt/NeuralLaplaceControl

Neural Laplace Control for Continuous-time Delayed Systems - an offline RL method combining Neural Laplace dynamics model and MPC planner to achieve near-expert policy performance in environments with irregular time intervals and an unknown constant delay.

Language: Python - Size: 203 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 3

ebenezergelo/offline-wvf

🧠 Learning World Value Functions without Exploration

Size: 2.14 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

chumb3rs/Need-4-Speed

Need 4 Speed, FYP 2023-24 @ Monash.

Language: Python - Size: 44.6 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BY571/Implicit-Q-Learning

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Language: Python - Size: 44.3 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 3

holarissun/Prompt-OIRL

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

Language: Python - Size: 186 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 18 - Forks: 3

DT6A/GSM8K-AI-SubQ

Author's repository for GSM8K-AI-SubQ reasoning dataset

Language: Python - Size: 14.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

xionghuichen/MAPLE

The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)

Language: Python - Size: 5.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 5

christopher-beckham/coms-are-energy-models

Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model

Language: Jupyter Notebook - Size: 5.95 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 14 - Forks: 0

holarissun/Accountable-Offline-RL

Code for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Language: Python - Size: 74.8 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

YiqinYang/VEM

Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)

Language: Python - Size: 548 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 4

YiqinYang/Offline-Gamma

Codes accompanying the paper "On the Role of Discount Factor in Offline Reinforcement Learning" (ICML 2022)

Language: Python - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

AIR-DI/D2C

D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.

Language: Python - Size: 6.78 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

siemens-research/lion

Code to reproduce experiments from "User-Interactive Offline Reinforcement Learning" (ICLR 2023)

Language: Python - Size: 3.32 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Shanghai-Digital-Brain-Laboratory/BDM-DB1

A large-scale multi-modal pre-trained model

Language: Python - Size: 789 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 91 - Forks: 7

hari-sikchi/offline_rl

Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.

Language: Python - Size: 123 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 19 - Forks: 2

tinkoff-ai/eop

Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022

Language: Jupyter Notebook - Size: 48.3 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 26 - Forks: 3

amazon-science/cdc-batch-rl

Code for Continuous Doubly Constrained Batch Reinforcement Learning, NeurIPS 2021.

Language: Python - Size: 25.4 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 1

SaminYeasar/Offline-Reinforcement-Learning-Algorithms

PyTorch Implementation of Offline Reinforcement Learning algorithms

Language: Python - Size: 203 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

MLD3/OfflineRL_ModelSelection

[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003

Language: Jupyter Notebook - Size: 608 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 4

alxschwrz/gym_dataset_recorder

Package for recording Transitions in OpenAI Gym Environments.

Language: Python - Size: 14.6 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Related Keywords
offline-rl 51 reinforcement-learning 39 pytorch 11 imitation-learning 8 deep-learning 8 iql 5 python 5 rl 4 model-based-rl 4 robotics 4 jax 4 deep-reinforcement-learning 4 offline-reinforcement-learning 4 dataset 3 q-learning 3 model-based-reinforcement-learning 3 inverse-reinforcement-learning 3 datasets 3 off-policy-evaluation 3 research 2 transformers 2 safe-rl 2 dm-haiku 2 machine-learning 2 implicit-q-learning 2 diffusion-models 2 model-free-rl 2 artificial-intelligence 2 bc 2 reinforcement-learning-algorithms 2 benchmark 2 pytorch-rl 2 batch-rl 2 mujoco 2 atari 2 behavioral-cloning 2 continuous-time 1 td3 1 optax 1 gymnasium 1 delay-differential-equations 1 delayed-systems 1 gym 1 irregular-time-intervals 1 laplace-transform 1 model-predictive-control 1 neural-laplace 1 neural-laplace-control 1 unknown-delays 1 batched-control 1 generalization-reinforcement-learning 1 iclr23 1 iclr24 1 icml22 1 icml23 1 icml24 1 meta-reinforcement-learning 1 neurips22 1 neurips23 1 neurips24 1 reinforcement-learning-papers 1 unsupervised-reinforcement-learning 1 healthcare 1 representation-learning 1 sequential-decision-making-problems 1 clinical-data 1 medicine 1 synthetic-data 1 flax 1 single-file 1 corl 1 federated-learning 1 awac 1 deep-rl 1 mlhc-2021 1 interpretable-machine-learning 1 interpretable-reinforcement-learning 1 offline-control 1 offlinerl 1 xai 1 offline 1 healthcare-application 1 doge 1 h2o 1 adaptive-policies 1 user-interaction 1 decision-model 1 multi-modal 1 multi-task 1 offline-reinforcement 1 d4rl 1 evaluation 1 bcq 1 icml 1 icml-2022 1 compositional-rl 1 autonomous-driving 1 raspberry-pi 1 irl 1 large-language-models 1