reinforcement-learning-fine-tuning | Topic

Topic: "reinforcement-learning-fine-tuning"

Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization

Language: Python - Size: 374 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0