Topic: "reinforcement-learning-fine-tuning"
BY571/DistRL-LLM
Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
Language: Python - Size: 374 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0
