GitHub topics: finetuning
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language: Python - Size: 16.9 MB - Last synced at: about 7 hours ago - Pushed at: about 9 hours ago - Stars: 5,601 - Forks: 394

wang8740/MAP
Documentation at
Language: Python - Size: 6.87 MB - Last synced at: about 13 hours ago - Pushed at: 5 months ago - Stars: 11 - Forks: 3

git-disl/awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
Size: 3.89 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 205 - Forks: 7

Raumberg/myllm
Multi-node distributed LLM training framework
Language: Python - Size: 1.66 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 17 - Forks: 1

Lisa-Baumgaertner/cybdd
Repository containing the code for a finetuning project.
Language: Python - Size: 16.1 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Language: Python - Size: 54.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 4,607 - Forks: 488

yahya-ben/mplug2-vp-for-nriqa
Parameter-Efficient Adaptation of mPLUG-Owl2 via Pixel-Level Visual Prompts for NR-IQA
Language: Python - Size: 738 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
Language: Jupyter Notebook - Size: 266 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 17,792 - Forks: 2,593

stochasticai/xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
Language: Python - Size: 18.4 MB - Last synced at: 2 days ago - Pushed at: 12 months ago - Stars: 2,660 - Forks: 202

glassesholder/2025_LLM_Study
I want to share how to utilize the latest open-source LLMs.
Language: Jupyter Notebook - Size: 62.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

Armaggheddon/BricksFinder
BricksFinder is your ultimate LEGO sidekick ๐งฑ๐โa magical tool that lets you search for LEGO minifigures and bricks using text or images. Whether you're hunting for that elusive piece or just geeking out, weโve got you covered! ๐โจ
Language: Jupyter Notebook - Size: 59.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

minosvasilias/godot-dodo
Finetuning large language models for GDScript generation.
Language: Python - Size: 8.01 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 554 - Forks: 25

Koratahiu/MLorc
Unofficial implementation of "MLorc: Momentum Low-rank Compression for Large Language Model Adaptation"
Language: Python - Size: 43.9 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

hyeonsangjeon/PDF2LLM-Tuning-Studio
PDF ๋ฌธ์์์ GPU ๊ฐ์ ์ฒ๋ฆฌ๋ก ๊ณ ํ์ง ์ง์์๋ต(QA) ๋ฐ์ดํฐ๋ฅผ ์๋ ์์ฑํ๊ณ LLM์ ํจ์จ์ ์ผ๋ก ํ์ธํ๋ํ๋ ์๋ฃจ์ ์ ๋๋ค. Unstructured ๋ผ์ด๋ธ๋ฌ๋ฆฌ์ AWS Bedrock Claude๋ก ๋๋ฉ์ธ ํนํ QA ์์ ์์ฑํ๊ณ , LoRA ๊ธฐ๋ฒ์ผ๋ก ๊ฒฝ๋ ๋ชจ๋ธ์ ํ๋ จํฉ๋๋ค.
Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 1

helixml/helix
โพ๏ธ Helix is a private GenAI stack for building AI agents with declarative pipelines, knowledge (RAG), API bindings, and first-class testing.
Language: Go - Size: 59.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 517 - Forks: 57

iiis-ai/TemplateMath
Official implementation of ICLR 2025 DATA-FM paper "Training and Evaluating Language Models with Template-based Data Generation" (https://arxiv.org/abs/2411.18104)
Language: Python - Size: 8.13 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11 - Forks: 0

georgian-io/LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
Language: Python - Size: 32.7 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 854 - Forks: 101

NVIDIA-NeMo/Automodel
Fine-tune any Hugging Face LLM or VLM on day-0 using PyTorch-native features for GPU-accelerated distributed training with superior performance and memory efficiency.
Language: Python - Size: 4.09 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 56 - Forks: 8

sydverma123/awesome-ai-repositories
A curated list of open source repositories for AI Engineers
Size: 178 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 116 - Forks: 20

eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSLใText2APIใText2Vis and more.
Size: 317 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 3,076 - Forks: 217

kaito-project/aikit
๐๏ธ Fine-tune, build, and deploy open-source LLMs easily!
Language: Go - Size: 4.91 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 469 - Forks: 46

sweefoo/midjourney-prompt-generator
โจ Generate diverse Midjourney prompts effortlessly with this open-source tool built using Next.js, TypeScript, and Tailwind CSS.
Language: TypeScript - Size: 185 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

shivendrra/Seeker
Research Application based on AI Agentic workflow
Language: Python - Size: 35.8 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

JosefAlbers/Phi-3-Vision-MLX
Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon
Language: Jupyter Notebook - Size: 5.95 MB - Last synced at: about 17 hours ago - Pushed at: 12 months ago - Stars: 273 - Forks: 22

LazyAGI/LazyLLM
Easiest and laziest way for building multi-agent LLMs applications.
Language: Python - Size: 10.9 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2,483 - Forks: 193

KaviduIsura/Web3-AI-Trading-Agent
๐ค Build autonomous AI trading agents for Solana and Bitcoin, leveraging machine learning for cross-chain trading and automated strategies.
Language: Python - Size: 1.04 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

CogitoNTNU/TutorAI
TutorAI is a RAG system capable of assisting with learning academic subjects and using the curriculum and citing it. The project revolves around building an application that ingests a textbook in most formats and facilitates efficient learning of the course material.
Language: Python - Size: 20.7 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 30 - Forks: 11

Tommaso-Sgroi/VojoLe-LM
DL24-25 project. The goal is Fine-Tuning a LLM on Italian Dialect.
Language: Python - Size: 514 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Cre4T3Tiv3/unsloth-llama3-alpaca-lora
Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-edge parameter-efficient fine-tuning with Unsloth integration.
Language: Jupyter Notebook - Size: 2.11 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 25 - Forks: 0

microsoft/FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
Language: Jupyter Notebook - Size: 209 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 4,199 - Forks: 542

ruimalheiro/training-custom-llama
Llama-style transformer in PyTorch with multi-node DDP. Includes SFT, DPO, LoRA, and knowledge distillation. Scripts for dataset mixing and training from scratch.
Language: Python - Size: 1.17 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 14 - Forks: 1

baselinerepo/llm
Building Language Models
Language: CSS - Size: 36.7 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

adithya-s-k/CompanionLLM
CompanionLLM - A framework to finetune LLMs to be your own sentient conversational companion
Language: Jupyter Notebook - Size: 40.1 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 5

ServiceNow/TapeAgents
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
Language: Python - Size: 188 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 294 - Forks: 36

learnables/learn2learn
A PyTorch Library for Meta-learning Research
Language: Python - Size: 9.52 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 2,825 - Forks: 362

Datalore-ai/datalore-localgen-cli
synthetic dataset generation workflow using local file resources for finetuning llms.
Language: Python - Size: 2.77 MB - Last synced at: 9 days ago - Pushed at: 18 days ago - Stars: 71 - Forks: 7

GURPREETKAURJETHRA/Generative-AI-LLM-Projects
Gen AI Large Language Model Projects
Language: Jupyter Notebook - Size: 23 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 69 - Forks: 22

Pavansomisetty21/Supervised-Fine-Tuning-of-GPT-OSS-20B-on-OpenAI-s-gsm8k-reasoning-with-LoRA
In this we finetune GPT-OSS-20B on OpenAI's gsm8k dataset
Language: Jupyter Notebook - Size: 30.3 KB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

bet0x/unsloth-docker
Unsloth Training Environment
Language: Python - Size: 14.6 KB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

Shaurya-Sethi/transqlate
End-to-end natural language to SQL system: schema-aware model fine-tuning, retrieval-augmented prompting, and production-grade CLI, powered by a custom fine-tuned Phi-4 Mini.
Language: Python - Size: 1.7 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 23 - Forks: 1

kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
Language: Python - Size: 61.5 KB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 113 - Forks: 11

Vini09-cpu/agentin
AI Agents for Technology Services
Size: 1000 Bytes - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

jina-ai/finetuner ๐ฆ
:dart: Task-oriented embedding tuning for BERT, CLIP, etc.
Language: Python - Size: 71.5 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 1,507 - Forks: 70

SerdarHelli/TuneCraft
A fun collection of notebooks for finetuning AI models... Share ready-to-run notebooksโฆ tips & tricks for finetuning... Hugging Face Transformers, Unsloth, vLLM, PyTorch / CUDA magic
Language: Jupyter Notebook - Size: 500 KB - Last synced at: 6 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

Cobbson12gh/D-FINE
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
Language: Python - Size: 403 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

DavidLabrin/claude_proxy
Deploy a TypeScript proxy on Cloudflare Workers to convert Claude API requests to OpenAI API format. Seamlessly integrate compatible clients. ๐๐
Language: TypeScript - Size: 21.5 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 2

soulking42/web3-ai-trading-agent
Build a Web3 AI trading agent for ETH-USDC on BASE using Uniswap V4. Follow our hands-on guide for deep insights into autonomous trading. ๐๐ป
Language: Python - Size: 1.03 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

DJEZEQUIELOK/agentin
AI Agents for Technology Services
Size: 1000 Bytes - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

CodeWizardWalter/AI-Studio
AI-Studio ๐ Streamlit toolkit for devs & creators with summarization, README & blog writer, code explainer, commit message and image-prompt generators.
Language: Python - Size: 11.7 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

gloryodeyemi/Product_Review_Class_Label_Prediction
Development and comparison of five NLP models, FastText, BERT, DistilBERT, RoBERTa, and XLNet to classify product reviews as positive or negative, using pre-trained transformer architectures and fine-tuning techniques.
Language: Jupyter Notebook - Size: 439 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

divK12/Industry-Project
Experiments on post-facto methods inspired by Differential Privacy to protect BERT embeddings from inversion attacks while keeping the utility intact. The project explores the tradeoff between privacy and utility .
Size: 260 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

data-prep-kit/data-prep-kit
Open source project for data preparation for GenAI applications
Language: HTML - Size: 224 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 765 - Forks: 212

raghavbali/mastering_llms_workshop
Full Day Workshop on Mastering LLMs
Size: 22.1 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

aakriti1318/GenAI
GenAI Series - RAG, Fine tuning, Agents, Knowledge Graph
Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 13 - Forks: 4

MohamedSebaie/Fight_Detection_From_Surveillance_Cameras-PyTorch_Project
Fight Detection From Surveillance Cameras by fine-tuning a PyTorch Pretrained Model
Language: Jupyter Notebook - Size: 208 MB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 50 - Forks: 13

6Morpheus6/alltalk-tts
[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC. It supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Language: JavaScript - Size: 5.43 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 4 - Forks: 0

chainstacklabs/web3-ai-trading-agent
Build an Autonomous Web3 AI Trading Agent (BASE + Uniswap V4 example)
Language: Python - Size: 1.11 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 23 - Forks: 5

LHRLAB/ChatKBQA
[ACL 2024] Official resources of "ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models".
Language: Python - Size: 18.5 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 316 - Forks: 27

speediedan/finetuning-scheduler
A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.
Language: Python - Size: 2.66 MB - Last synced at: 3 days ago - Pushed at: 19 days ago - Stars: 66 - Forks: 6

microsoft/AzureML-BERT
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
Language: Jupyter Notebook - Size: 314 KB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 400 - Forks: 125

junxia97/awesome-pretrain-on-molecules
[IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models
Size: 565 KB - Last synced at: about 2 hours ago - Pushed at: about 2 years ago - Stars: 531 - Forks: 59

Azure-Samples/azureai-foundry-finetuning-raft
A recipe that will walk you through using either Meta Llama 3.1 405B or OpenAI GPT-4o deployed on Azure AI to generate a synthetic dataset using UC Berkeley's Gorilla project RAFT method.
Language: Jupyter Notebook - Size: 41.2 MB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 68 - Forks: 23

dat-adi/llm_synth_boost
Investigating the impact of synthetic data on LLM perplexity via QLoRA finetuning
Language: Python - Size: 16 MB - Last synced at: 5 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

microsoft/Build25-LAB329
Fine-Tune End-to-End Distillation Models with Azure AI Foundry Models and Foundry Local
Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 2 days ago - Pushed at: 24 days ago - Stars: 27 - Forks: 12

git-cloner/llama-lora-fine-tuning
llama fine-tuning with lora
Language: Python - Size: 109 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 138 - Forks: 15

sparklerz/hivemind-qwen2-0.5b
Internet-scale data-parallel fine-tuning of Qwen2-0.5B-Instruct using Hivemind + TorchTune. Initial peer on public IP; second peers on free GPUs (e.g., Kaggle).
Size: 1.95 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

Lannuela/efficient-domain-tuning
Efficiently fine-tune small language models for financial risk management tasks using QLoRA, LoRA, and AdaLoRA. Explore datasets and experiments. ๐
Size: 13.7 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

kuutsav/llm-toys
Small finetuned LLMs for a diverse set of useful tasks
Language: Python - Size: 72.6 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 128 - Forks: 6

baidubce/bce-qianfan-sdk
Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (ๆไพๅคงๆจกๅๅทฅๅ ท้พๆไฝณๅฎ่ทต๏ผไปฅๅไผ้ ไธไพฟๆทๅฐ่ฎฟ้ฎๅๅธๅคงๆจกๅๅนณๅฐ๏ผ
Language: Jupyter Notebook - Size: 75.2 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 372 - Forks: 58

adithya-s-k/AI-Engineering.academy
Mastering Applied AI, One Concept at a Time
Language: Jupyter Notebook - Size: 96.1 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 1,033 - Forks: 113

paulocoutinhox/mini-llm
Simple and lightweight tool to fine-tune GPT models (like GPT-2 and GPT-Neo) using your own data โ built with Python and Transformers. Adapt powerful language models to your domain with ease.
Language: Python - Size: 89.8 KB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 26 - Forks: 0

sarabesh/Finetuning
Repo to serve as a baseline/guide for performing post training(SFT/RLHF) of modern LLM models, and evaluating them with baseline datasets.
Language: Jupyter Notebook - Size: 25.4 KB - Last synced at: 23 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

Baijiong-Lin/LoRA-Torch
PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention in OpenCLIP)
Language: Python - Size: 60.5 KB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 67 - Forks: 7

ayperiKhudaybergenova/bert-distilbert-comparison-WNLI-NER
Comparative analysis of BERT and DistilBERT on WNLI and NER tasks
Language: Python - Size: 106 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

HenryNdubuaku/super-lazy-autograd
Hand-derived memory-efficient super lazy PyTorch VJPs for training LLMs on laptop, all using one op (bundled scaled matmuls).
Language: Python - Size: 1.32 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 73 - Forks: 1

MaxiDonkey/DelphiMistralAI
DelphiMistralAI wrapper brings Mistralโs text-vision-audio models and agentic Conversations to Delphi, with chat, embeddings, Codestral codegen, fine-tuning, batching, moderation, async/await helpers and live request monitoring.
Language: Pascal - Size: 1.76 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 23 - Forks: 5

codelion/ellora
Enhancing LLMs with LoRA
Language: Jupyter Notebook - Size: 2.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 1

yifanzhang-pro/StackMathQA
StackMathQA: A Curated Collection of 2 Million Mathematical Questions and Answers Sourced from Stack Exchange
Size: 48.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

omerbsezer/Fast-LLM-Agent-MCP
This repo covers LLM, Agents concepts both theoretically and practically: LLMs, RAG, Fine Tuning, Agents, Tools, MCP, AWS Strands Agents, Google Agent Development Kit, ADK, Reference Documents, etc.
Language: Python - Size: 65.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 26 - Forks: 6

Abeshith/FineTuning_LanguageModels
๐ฏ Fine-tune large language models and use them for text-related tasks. This repository provides a straightforward approach to fine-tuning models like Gemma, Llama ๐ฆ, and Mistral ๐ช๏ธ for various NLP tasks. ๐ง It includes training ๐, fine-tuning ๐ ๏ธ, and inference pipelines โ๏ธ. ๐
Language: Jupyter Notebook - Size: 454 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

linhlpv/awesome-offline-to-online-RL-papers
A list of Offline to Online RL papers (continually updated)
Size: 16.6 KB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 47 - Forks: 0

natserract/nokia-rag-finetuning
RAG and fine-tuning strategy for Nokia guide PDF using internal dataset
Language: Python - Size: 658 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

ThomasRochefortB/open-agentinstruct
An open-source recreation of the AgentInstruct agentic workflow for synthetic data generation
Language: Python - Size: 372 KB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 19 - Forks: 3

hearmeneigh/dataset-rising
Toolchain for creating custom datasets and training Stable Diffusion (1.x, 2.x, XL) models and LoRAs
Language: Python - Size: 234 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 1

promptslab/LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Language: Python - Size: 591 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 240 - Forks: 15

Vimalnegi03/GenerativeAI
This resource offers a comprehensive exploration of Generative AI, guiding you from foundational principles through the latest advanced concepts and practical skills. Whether you're a newcomer or aiming for mastery, you'll find curated content to build both theoretical understanding and hands-on expertise.
Language: Python - Size: 7.91 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

LongxingTan/open-retrievals
All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers
Language: Python - Size: 1.42 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 63 - Forks: 13

acceleratedscience/finetune-controller
Job scheduling api for finetuning ML models on clusters
Language: Python - Size: 211 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

soufiane001/plop
Official code for PLoP
Language: Python - Size: 55.7 KB - Last synced at: 30 days ago - Pushed at: 2 months ago - Stars: 15 - Forks: 4

krish1925/Persona-Chatbot-G28
Fine-tuning GPT-3.5 and Llama3 LLMs for enhanced persona consistency in chatbots using Google's Synthetic Persona Chat dataset
Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

sapritanand/Code-Generation-using-LLM
This project extracts Python code from the OpenAI Gym GitHub repository, creates a dataset of functions, and fine-tunes a code generation model (codegen-350M-mono) using Hugging Face Transformers to generate new code snippets.
Language: Jupyter Notebook - Size: 101 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

git-cloner/llama2-lora-fine-tuning
llama2 finetuning with deepspeed and lora
Language: Python - Size: 22.6 MB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 176 - Forks: 14

veralvx/xtts-gradio Fork of coqui-ai/TTS
Run XTTS within Docker/Podman for voice fine-tuning in a Web UI
Language: Python - Size: 133 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

agiornot/gpu-math
A mini tool that helps estimate the resources needed for training/finetuning/inference with Hugging Face models.
Size: 2.93 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

nicolay-r/distil-tuning-llm
Disillation-Tuning implementation for decoder based LM models (Qwen2.5) adapted for text summarization (BioASQ-2025 workshop)
Language: Python - Size: 3.35 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

gruporaia/TTS-AutoTuning
Pipeline para finetuning automรกtico de modelos de Text to Speech.
Language: Python - Size: 2.65 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

woctezuma/finetune-detr
Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.
Language: Jupyter Notebook - Size: 79.5 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 150 - Forks: 24

SpringPixels/PawNet-Classifier
Paws and pixels: Classifying dogs and cats with deep learning and transfer learning magic.
Language: Jupyter Notebook - Size: 193 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

aalizelau/Clone-Yourself
Clone yourself through WhatsApp chat history and fine tuning model.
Language: Jupyter Notebook - Size: 104 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

zou-group/sirius
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
Language: Python - Size: 70.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 60 - Forks: 5
