An open API service providing repository metadata for many open source software ecosystems.

Topic: "reinforcement-learning-fine-tuning"

BY571/DistRL-LLM

Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization

Language: Python - Size: 374 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0