reinforcement-learning-with-verifiable-rewards | Topic | Ecosyste.ms: Repos

Topic: "reinforcement-learning-with-verifiable-rewards"

thuml/RLVR-World

Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934

Language: Python - Size: 13.5 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 45 - Forks: 2

yflyzhang/simpleR1

simpleR1: A Simple Framework for Training R1-like Models

Language: Python - Size: 1020 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 23 - Forks: 2

Related Topics

grpo 2 real2sim 1 rlvr 1 robotic-manipulation 1 text-game 1 verl 1 video-generation 1 video-gpt 1 video-prediction 1 web-agent 1 world-model 1 deepseek-r1 1 grpotrainer 1 ppo 1 r1-zero 1 reinforcement-learning 1 trl 1