Topic: "reinforcement-learning-with-verifiable-rewards"
thuml/RLVR-World
Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934
Language: Python - Size: 13.5 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 45 - Forks: 2

yflyzhang/simpleR1
simpleR1: A Simple Framework for Training R1-like Models
Language: Python - Size: 1020 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 23 - Forks: 2
