ecosyste.ms

Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: llm-exploration

Repositories

sail-sg/oat

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Language: Python - Size: 2.29 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 338 - Forks: 23

Related Keywords

alignment 1 distributed-rl 1 distributed-training 1 dpo 1 dueling-bandits 1 grpo 1 llm 1 llm-aligment 1 llm-exploration 1 online-alignment 1 online-rl 1 ppo 1 r1-zero 1 reasoning 1 rlhf 1 thompson-sampling 1