An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: preference-based-reinforcement-learning

kmk0224/RLStudy

Share RL study material

Size: 7.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

snu-mllab/DPPO

Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)

Language: Python - Size: 26.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 35 - Forks: 1

aleksa-sukovic/iclr2024-reward-design-for-justifiable-rl

Code for the paper "Reward Design for Justifiable Sequential Decision-Making"; ICLR 2024

Language: Jupyter Notebook - Size: 2.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0