GitHub topics: behavior-regularization
thu-ml/SRPO
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
Language: Python - Size: 592 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 2
