Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / dumpmemory / LoL-RL
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dumpmemory%2FLoL-RL
Fork of abaheti95/LoL-RL
Stars: 0
Forks: 0
Open Issues: 0
License: None
Language: Python
Repo Size: 3.98 MB
Dependencies:
23
Created: 7 months ago
Updated: 7 months ago
Last pushed: about 2 months ago
Last synced: about 1 month ago
Files
Loading...
Readme
Loading...
Dependencies
requirements.txt
pypi
- accelerate ==0.22.0
- bitsandbytes ==0.41.1
- datasets *
- ftfy *
- matplotlib *
- names *
- nltk *
- numba *
- numpy *
- nvidia-ml-py3 *
- openai *
- pandas *
- peft ==0.5.0
- scikit-learn *
- scipy *
- seaborn *
- sentence_transformers *
- sentencepiece *
- torch *
- tqdm *
- transformers ==4.32.1
- trl *
- wandb *