GitHub topics: llm-exploration
sail-sg/oat
🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Language: Python - Size: 2.29 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 338 - Forks: 23
