GitHub topics: preference-based-reinforcement-learning
kmk0224/RLStudy
Share RL study material
Size: 7.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

snu-mllab/DPPO
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
Language: Python - Size: 26.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 35 - Forks: 1

aleksa-sukovic/iclr2024-reward-design-for-justifiable-rl
Code for the paper "Reward Design for Justifiable Sequential Decision-Making"; ICLR 2024
Language: Jupyter Notebook - Size: 2.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
