Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
Package Usage: pypi: trl
A Pytorch implementation of Proximal Policy Optimization for transfomer language models.
16 versions
Latest release: 11 months ago
33,573 downloads last month
View more package details: https://packages.ecosyste.ms/registries/pypi.org/packages/trl
Dependent Repos 52
dmmagdal/ScaleLLMs
Look at scaling and exporting LLMs to smaller consumer devices (mobile/web/desktop)- ==0.7.1 QLora-Falcon7B/environment.yml
Size: 2.69 MB - Last synced: 5 months ago - Pushed: 5 months ago
dumpmemory/LoL-RL Fork of abaheti95/LoL-RL
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients- * requirements.txt
Size: 3.98 MB - Last synced: about 2 months ago - Pushed: 2 months ago
Jasonqi146/AMEFT
AMEFT: Aggressive Memory Efficient Fine Tuning- >=0.8.1 LLaMA-Factory/requirements.txt
Size: 17.4 MB - Last synced: 21 days ago - Pushed: 22 days ago
underwoodnoble/llm_codebase
A codebase for llm training- ==0.7.10 requirements/dpo.txt
Size: 80 MB - Last synced: 1 day ago - Pushed: 2 days ago
josem7/GraphGPT-blar
copy of graphGPT for NL2SQL- * requirements.txt
Size: 32.2 MB - Last synced: 24 days ago - Pushed: 3 months ago
tcmaps/autotrain-advanced Fork of huggingface/autotrain-advanced
🤗 AutoTrain Advanced- * requirements.txt
Size: 5.53 MB - Last synced: 10 months ago - Pushed: 10 months ago
geronimi73/3090_shorts
minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever- * requirements.txt
Size: 485 KB - Last synced: about 1 month ago - Pushed: about 1 month ago
harpreetmann24/runpod-container
- ==0.8.5 builder/requirements.txt
Size: 59.6 KB - Last synced: about 1 month ago - Pushed: about 1 month ago
WangRongsheng/CareGPT
🌞 CareGPT (关怀GPT)是一个医疗大语言模型,同时它集合了数十个公开可用的医疗微调数据集和开放可用的医疗大语言模型,包含LLM的训练、测评、部署等以促进医疗LLM快速发展。Medical LLM, Open Source Driven for a Healthy Future.- >=0.5.0 requirements.txt
Size: 34 MB - Last synced: 14 days ago - Pushed: 14 days ago
ritikamangla/QSalience
https://arxiv.org/abs/2404.10917- ==0.8.4 code/requirements.txt
Size: 1.03 MB - Last synced: about 12 hours ago - Pushed: 9 days ago
AmourWaltz/FactDial
- * requirements.txt
Size: 1.67 MB - Last synced: about 1 month ago - Pushed: about 1 month ago
SinghJagpreet096/text-sql
- ==0.7.10 requirements.txt
Size: 10.7 MB - Last synced: 12 days ago - Pushed: about 1 month ago
msh2481/DenseGPT
- >=0.7.10 requirements.txt
Size: 34.2 KB - Last synced: 4 months ago - Pushed: 4 months ago
Zuhashaik/HOLD-Z
- ==0.7.4 envs/requirements.txt
Size: 663 KB - Last synced: about 1 month ago - Pushed: 2 months ago
prabodhw96/Fairness-Attention-Regularization
- ==0.0.3 requirements.txt
Size: 820 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago
aakashveera/Finance-Chatbot
- * training_pipeline/requirements.txt
Size: 128 KB - Last synced: 2 months ago - Pushed: 2 months ago
davche163/DB-GPT-Hub Fork of eosphoros-ai/DB-GPT-Hub
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance, especially in Text-to-SQL.- >=0.5.0 requirements.txt
Size: 24.5 MB - Last synced: 7 months ago - Pushed: 7 months ago
rankun203/axolotl
- ==0.8.5 requirements.txt
Size: 0 Bytes - Last synced: about 1 month ago - Pushed: about 1 month ago
hyunwoongko/instruct-tuning-example
Instruct tuning example using Hugging Face Transformers and TRL- ==0.2.1 requirements.txt
Size: 2.93 KB - Last synced: about 1 year ago - Pushed: over 1 year ago
chrisliu298/tapt
Data augmentation by generating new samples- * requirements.txt
Size: 140 MB - Last synced: 10 months ago - Pushed: almost 4 years ago
sungeuns/llm-fine-tuning-sagemaker
- * rlhf-src/requirements.txt
- * stack-dolly-src/requirements.txt
Size: 39.1 KB - Last synced: about 1 year ago - Pushed: about 1 year ago
HongzheBi/DocQA
基于本地知识库检索和 LLM 轻量化微调的问答系统- >=0.4.1 finetune/requirements.txt
Size: 29.3 MB - Last synced: 11 months ago - Pushed: 11 months ago
PyThaiNLP/WangChanGLM
WangChanGLM 🐘 - The Multilingual Instruction-Following Model- ==0.2.2.dev0 script/wandb/run-20230306_093248-cecitdkp/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_102757-194hr5ah/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_102757-ew7d7ux2/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_102757-rqxbol0a/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_102757-s3dy6nic/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_102757-vu2s1ocu/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_102757-wlwgu8gt/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_102758-6h5lc3p2/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_102758-lfg0iq0c/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103451-6ixdogh9/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103451-8qrks0b2/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103451-9xddknyz/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103451-a7mepyw3/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103451-wn15qv7f/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103452-cvdmdn02/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103452-drqksnme/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103452-lnncgs0d/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103612-8ivycg16/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103612-8wygst9y/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103612-ek2bzqfq/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103612-glrwienz/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103612-hmt82qho/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103612-nocmn2e3/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103612-tkfb7muz/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103612-v2ne5fpc/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103715-2hfh7d7k/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103715-az66139f/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103715-mwdq0ftw/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103715-ppohkhut/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103715-rzd1u84o/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103715-uq2qm49i/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103715-yiiqi768/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_103716-j5bdj4mi/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_112144-0tg8de7j/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_112144-9h35b2at/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_112144-krzacj07/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_112144-lsa5p812/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_112144-rj8zg7ki/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_112144-s6ybzyog/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_112144-vq0189pj/files/requirements.txt
- ==0.2.2.dev0 script/wandb/run-20230306_112145-c1xio8n9/files/requirements.txt
Size: 3.02 MB - Last synced: 22 days ago - Pushed: 6 months ago
X-D-Lab/Sunsimiao
🌿孙思邈中文医疗大模型(Sunsimiao):提供安全、可靠、普惠的中文医疗大模型- >=0.4.4 requirements.txt
Size: 880 KB - Last synced: 2 days ago - Pushed: about 1 month ago
sayakpaul/personal-coding-assistant
Shows how to create a personal coding assistant by fine-tuning StarCoder on a custom code corpus.- * requirements.txt
Size: 143 KB - Last synced: 11 months ago - Pushed: 11 months ago
jimux/ShellShaper
- * requirements.txt
Size: 5.17 MB - Last synced: 29 days ago - Pushed: 2 months ago
shuishen112/rl_transformer
- ==0.0.3 nbs/wandb/run-20220221_152047-2i72vyve/files/requirements.txt
Size: 2.22 MB - Last synced: about 1 year ago - Pushed: about 2 years ago
harrywang/falcon-huggingface
code for https://huggingface.co/blog/falcon- * requirements.txt
Size: 32.2 KB - Last synced: 29 days ago - Pushed: 11 months ago
haoranD/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 3.67 MB - Last synced: 11 months ago - Pushed: 11 months ago
syncdoth/comet-rl
A WIP project of training COMET model using RL.- * requirements.txt
Size: 12.5 MB - Last synced: 29 days ago - Pushed: 11 months ago
3dot141/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 3.73 MB - Last synced: 10 months ago - Pushed: 10 months ago
MaxMax2016/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 3.52 MB - Last synced: 7 months ago - Pushed: 11 months ago
seanzhang-zhichen/ziya-pretrain
用子牙模型来做增量预训练,注入领域知识- * requirements.txt
Size: 0 Bytes - Last synced: 11 months ago - Pushed: 11 months ago
newlxj/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 3.52 MB - Last synced: 10 months ago - Pushed: 10 months ago
goggeryang/LLaMA-Efficient-Tuning Fork of hiyouga/LLaMA-Efficient-Tuning
Fine-tuning LLaMA with PEFT (PT+SFT+RLHF with QLoRA)- >=0.4.4 requirements.txt
Size: 35.9 MB - Last synced: 10 months ago - Pushed: 11 months ago
goggeryang/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 3.79 MB - Last synced: 10 months ago - Pushed: 11 months ago
SavarusAlbert/ChatGLM-LoRA-RLHF-from-trl
- * requirements.txt
Size: 68.4 MB - Last synced: 10 months ago - Pushed: 10 months ago
SwarmKit/ChatGLM-Efficient-Tuning Fork of hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调- >=0.4.4 requirements.txt
Size: 86.1 MB - Last synced: about 1 month ago - Pushed: 10 months ago
codemayq/ChatGLM-Efficient-Tuning Fork of hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调- >=0.4.4 requirements.txt
Size: 86.1 MB - Last synced: 10 months ago - Pushed: 10 months ago
iioSnail/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 3.52 MB - Last synced: 10 months ago - Pushed: 10 months ago
xusenlinzy/LLaMA-Efficient-Tuning Fork of hiyouga/LLaMA-Factory
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA)- >=0.4.4 requirements.txt
Size: 54.9 MB - Last synced: about 1 month ago - Pushed: 10 months ago
cuican1432/LLaMA-Efficient-Tuning Fork of hiyouga/LLaMA-Factory
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA)- >=0.4.4 requirements.txt
Size: 55.4 MB - Last synced: about 1 month ago - Pushed: 10 months ago
away-star/LLaMA-Efficient-Tuning Fork of hiyouga/LLaMA-Efficient-Tuning
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA)- >=0.4.4 requirements.txt
Size: 55 MB - Last synced: 8 months ago - Pushed: 10 months ago
zouchl/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 3.52 MB - Last synced: about 1 month ago - Pushed: 10 months ago
nowadays0421/ChatGLM-Efficient-Tuning Fork of hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调- >=0.4.4 requirements.txt
Size: 86.1 MB - Last synced: 10 months ago - Pushed: 10 months ago
Yang-HangWA/ChatGLM-Efficient-Tuning Fork of hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调- >=0.4.4 requirements.txt
Size: 185 MB - Last synced: about 1 month ago - Pushed: 10 months ago
statelesshz/LLaMA-Efficient-Tuning Fork of hiyouga/LLaMA-Efficient-Tuning
Fine-tuning LLaMA with PEFT (PT+SFT+RLHF with QLoRA)- ==0.4.4 requirements.txt
Size: 170 MB - Last synced: 8 months ago - Pushed: 8 months ago
ArtificialZeng/ChatGLM-Efficient-Tuning-New
ChatGLM-Efficient-Tuning-New-explained- >=0.4.7 requirements.txt
Size: 170 MB - Last synced: 29 days ago - Pushed: 10 months ago
FanHengbo/ActiveLLM
- >=0.4.1 requirements.txt
Size: 69.9 MB - Last synced: 29 days ago - Pushed: 10 months ago
Hongze-Wang/ChatGLM-Efficient-Tuning Fork of hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调- >=0.4.7 requirements.txt
Size: 175 MB - Last synced: 10 months ago - Pushed: 10 months ago
Hongze-Wang/LLaMA-Efficient-Tuning Fork of hiyouga/LLaMA-Efficient-Tuning
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA)- >=0.4.4 requirements.txt
Size: 54.8 MB - Last synced: 10 months ago - Pushed: 11 months ago
hydrallm/llama-moe-v1
- * requirements.txt
Size: 29.8 MB - Last synced: 29 days ago - Pushed: 10 months ago
brewswang/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 4.87 MB - Last synced: about 1 month ago - Pushed: 10 months ago
loganamcnichols/autotrain-advanced Fork of huggingface/autotrain-advanced
🤗 AutoTrain Advanced- * requirements.txt
Size: 5.64 MB - Last synced: about 1 month ago - Pushed: 9 months ago
zhengr/MedicalGPT Fork of shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。- * requirements.txt
Size: 11.3 MB - Last synced: about 1 month ago - Pushed: 7 months ago
charlieoneill11/diverse-llm
Improving the diversity of large language models.- ==0.4.7 requirements.txt
Size: 77 MB - Last synced: 8 months ago - Pushed: 8 months ago
alpayariyak/llama-moe-v1 Fork of hydrallm/llama-moe-v1
- ==0.4.7 requirements.txt
Size: 29.8 MB - Last synced: about 1 month ago - Pushed: 10 months ago
hcd233/LLaMA-Efficient-Tuning Fork of hiyouga/LLaMA-Efficient-Tuning
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan)- >=0.4.7 requirements.txt
Size: 144 MB - Last synced: 10 months ago - Pushed: 10 months ago