An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: finetuning

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

Language: Python - Size: 16.9 MB - Last synced at: about 7 hours ago - Pushed at: about 9 hours ago - Stars: 5,601 - Forks: 394

wang8740/MAP

Documentation at

Language: Python - Size: 6.87 MB - Last synced at: about 13 hours ago - Pushed at: 5 months ago - Stars: 11 - Forks: 3

git-disl/awesome_LLM-harmful-fine-tuning-papers

A survey on harmful fine-tuning attack for large language model

Size: 3.89 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 205 - Forks: 7

Raumberg/myllm

Multi-node distributed LLM training framework

Language: Python - Size: 1.66 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 17 - Forks: 1

Lisa-Baumgaertner/cybdd

Repository containing the code for a finetuning project.

Language: Python - Size: 16.1 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Language: Python - Size: 54.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 4,607 - Forks: 488

yahya-ben/mplug2-vp-for-nriqa

Parameter-Efficient Adaptation of mPLUG-Owl2 via Pixel-Level Visual Prompts for NR-IQA

Language: Python - Size: 738 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

Language: Jupyter Notebook - Size: 266 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 17,792 - Forks: 2,593

stochasticai/xTuring

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

Language: Python - Size: 18.4 MB - Last synced at: 2 days ago - Pushed at: 12 months ago - Stars: 2,660 - Forks: 202

glassesholder/2025_LLM_Study

I want to share how to utilize the latest open-source LLMs.

Language: Jupyter Notebook - Size: 62.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

Armaggheddon/BricksFinder

BricksFinder is your ultimate LEGO sidekick ๐Ÿงฑ๐Ÿ”โ€”a magical tool that lets you search for LEGO minifigures and bricks using text or images. Whether you're hunting for that elusive piece or just geeking out, weโ€™ve got you covered! ๐Ÿš€โœจ

Language: Jupyter Notebook - Size: 59.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

minosvasilias/godot-dodo

Finetuning large language models for GDScript generation.

Language: Python - Size: 8.01 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 554 - Forks: 25

Koratahiu/MLorc

Unofficial implementation of "MLorc: Momentum Low-rank Compression for Large Language Model Adaptation"

Language: Python - Size: 43.9 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

hyeonsangjeon/PDF2LLM-Tuning-Studio

PDF ๋ฌธ์„œ์—์„œ GPU ๊ฐ€์† ์ฒ˜๋ฆฌ๋กœ ๊ณ ํ’ˆ์งˆ ์งˆ์˜์‘๋‹ต(QA) ๋ฐ์ดํ„ฐ๋ฅผ ์ž๋™ ์ƒ์„ฑํ•˜๊ณ  LLM์„ ํšจ์œจ์ ์œผ๋กœ ํŒŒ์ธํŠœ๋‹ํ•˜๋Š” ์†”๋ฃจ์…˜์ž…๋‹ˆ๋‹ค. Unstructured ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์™€ AWS Bedrock Claude๋กœ ๋„๋ฉ”์ธ ํŠนํ™” QA ์Œ์„ ์ƒ์„ฑํ•˜๊ณ , LoRA ๊ธฐ๋ฒ•์œผ๋กœ ๊ฒฝ๋Ÿ‰ ๋ชจ๋ธ์„ ํ›ˆ๋ จํ•ฉ๋‹ˆ๋‹ค.

Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 1

helixml/helix

โ™พ๏ธ Helix is a private GenAI stack for building AI agents with declarative pipelines, knowledge (RAG), API bindings, and first-class testing.

Language: Go - Size: 59.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 517 - Forks: 57

iiis-ai/TemplateMath

Official implementation of ICLR 2025 DATA-FM paper "Training and Evaluating Language Models with Template-based Data Generation" (https://arxiv.org/abs/2411.18104)

Language: Python - Size: 8.13 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11 - Forks: 0

georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

Language: Python - Size: 32.7 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 854 - Forks: 101

NVIDIA-NeMo/Automodel

Fine-tune any Hugging Face LLM or VLM on day-0 using PyTorch-native features for GPU-accelerated distributed training with superior performance and memory efficiency.

Language: Python - Size: 4.09 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 56 - Forks: 8

sydverma123/awesome-ai-repositories

A curated list of open source repositories for AI Engineers

Size: 178 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 116 - Forks: 20

eosphoros-ai/Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSLใ€Text2APIใ€Text2Vis and more.

Size: 317 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 3,076 - Forks: 217

kaito-project/aikit

๐Ÿ—๏ธ Fine-tune, build, and deploy open-source LLMs easily!

Language: Go - Size: 4.91 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 469 - Forks: 46

sweefoo/midjourney-prompt-generator

โœจ Generate diverse Midjourney prompts effortlessly with this open-source tool built using Next.js, TypeScript, and Tailwind CSS.

Language: TypeScript - Size: 185 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

shivendrra/Seeker

Research Application based on AI Agentic workflow

Language: Python - Size: 35.8 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

JosefAlbers/Phi-3-Vision-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

Language: Jupyter Notebook - Size: 5.95 MB - Last synced at: about 17 hours ago - Pushed at: 12 months ago - Stars: 273 - Forks: 22

LazyAGI/LazyLLM

Easiest and laziest way for building multi-agent LLMs applications.

Language: Python - Size: 10.9 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2,483 - Forks: 193

KaviduIsura/Web3-AI-Trading-Agent

๐Ÿค– Build autonomous AI trading agents for Solana and Bitcoin, leveraging machine learning for cross-chain trading and automated strategies.

Language: Python - Size: 1.04 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

CogitoNTNU/TutorAI

TutorAI is a RAG system capable of assisting with learning academic subjects and using the curriculum and citing it. The project revolves around building an application that ingests a textbook in most formats and facilitates efficient learning of the course material.

Language: Python - Size: 20.7 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 30 - Forks: 11

Tommaso-Sgroi/VojoLe-LM

DL24-25 project. The goal is Fine-Tuning a LLM on Italian Dialect.

Language: Python - Size: 514 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Cre4T3Tiv3/unsloth-llama3-alpaca-lora

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-edge parameter-efficient fine-tuning with Unsloth integration.

Language: Jupyter Notebook - Size: 2.11 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 25 - Forks: 0

microsoft/FLAML

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

Language: Jupyter Notebook - Size: 209 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 4,199 - Forks: 542

ruimalheiro/training-custom-llama

Llama-style transformer in PyTorch with multi-node DDP. Includes SFT, DPO, LoRA, and knowledge distillation. Scripts for dataset mixing and training from scratch.

Language: Python - Size: 1.17 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 14 - Forks: 1

baselinerepo/llm

Building Language Models

Language: CSS - Size: 36.7 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

adithya-s-k/CompanionLLM

CompanionLLM - A framework to finetune LLMs to be your own sentient conversational companion

Language: Jupyter Notebook - Size: 40.1 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 5

ServiceNow/TapeAgents

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

Language: Python - Size: 188 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 294 - Forks: 36

learnables/learn2learn

A PyTorch Library for Meta-learning Research

Language: Python - Size: 9.52 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 2,825 - Forks: 362

Datalore-ai/datalore-localgen-cli

synthetic dataset generation workflow using local file resources for finetuning llms.

Language: Python - Size: 2.77 MB - Last synced at: 9 days ago - Pushed at: 18 days ago - Stars: 71 - Forks: 7

GURPREETKAURJETHRA/Generative-AI-LLM-Projects

Gen AI Large Language Model Projects

Language: Jupyter Notebook - Size: 23 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 69 - Forks: 22

Pavansomisetty21/Supervised-Fine-Tuning-of-GPT-OSS-20B-on-OpenAI-s-gsm8k-reasoning-with-LoRA

In this we finetune GPT-OSS-20B on OpenAI's gsm8k dataset

Language: Jupyter Notebook - Size: 30.3 KB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

bet0x/unsloth-docker

Unsloth Training Environment

Language: Python - Size: 14.6 KB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

Shaurya-Sethi/transqlate

End-to-end natural language to SQL system: schema-aware model fine-tuning, retrieval-augmented prompting, and production-grade CLI, powered by a custom fine-tuned Phi-4 Mini.

Language: Python - Size: 1.7 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 23 - Forks: 1

kyegomez/Lets-Verify-Step-by-Step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Language: Python - Size: 61.5 KB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 113 - Forks: 11

Vini09-cpu/agentin

AI Agents for Technology Services

Size: 1000 Bytes - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

jina-ai/finetuner ๐Ÿ“ฆ

:dart: Task-oriented embedding tuning for BERT, CLIP, etc.

Language: Python - Size: 71.5 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 1,507 - Forks: 70

SerdarHelli/TuneCraft

A fun collection of notebooks for finetuning AI models... Share ready-to-run notebooksโ€ฆ tips & tricks for finetuning... Hugging Face Transformers, Unsloth, vLLM, PyTorch / CUDA magic

Language: Jupyter Notebook - Size: 500 KB - Last synced at: 6 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

Cobbson12gh/D-FINE

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Language: Python - Size: 403 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

DavidLabrin/claude_proxy

Deploy a TypeScript proxy on Cloudflare Workers to convert Claude API requests to OpenAI API format. Seamlessly integrate compatible clients. ๐Ÿš€๐Ÿ™

Language: TypeScript - Size: 21.5 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 2

soulking42/web3-ai-trading-agent

Build a Web3 AI trading agent for ETH-USDC on BASE using Uniswap V4. Follow our hands-on guide for deep insights into autonomous trading. ๐Ÿš€๐Ÿ’ป

Language: Python - Size: 1.03 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

DJEZEQUIELOK/agentin

AI Agents for Technology Services

Size: 1000 Bytes - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

CodeWizardWalter/AI-Studio

AI-Studio ๐Ÿ™ Streamlit toolkit for devs & creators with summarization, README & blog writer, code explainer, commit message and image-prompt generators.

Language: Python - Size: 11.7 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

gloryodeyemi/Product_Review_Class_Label_Prediction

Development and comparison of five NLP models, FastText, BERT, DistilBERT, RoBERTa, and XLNet to classify product reviews as positive or negative, using pre-trained transformer architectures and fine-tuning techniques.

Language: Jupyter Notebook - Size: 439 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

divK12/Industry-Project

Experiments on post-facto methods inspired by Differential Privacy to protect BERT embeddings from inversion attacks while keeping the utility intact. The project explores the tradeoff between privacy and utility .

Size: 260 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

data-prep-kit/data-prep-kit

Open source project for data preparation for GenAI applications

Language: HTML - Size: 224 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 765 - Forks: 212

raghavbali/mastering_llms_workshop

Full Day Workshop on Mastering LLMs

Size: 22.1 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

aakriti1318/GenAI

GenAI Series - RAG, Fine tuning, Agents, Knowledge Graph

Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 13 - Forks: 4

MohamedSebaie/Fight_Detection_From_Surveillance_Cameras-PyTorch_Project

Fight Detection From Surveillance Cameras by fine-tuning a PyTorch Pretrained Model

Language: Jupyter Notebook - Size: 208 MB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 50 - Forks: 13

6Morpheus6/alltalk-tts

[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC. It supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.

Language: JavaScript - Size: 5.43 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 4 - Forks: 0

chainstacklabs/web3-ai-trading-agent

Build an Autonomous Web3 AI Trading Agent (BASE + Uniswap V4 example)

Language: Python - Size: 1.11 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 23 - Forks: 5

LHRLAB/ChatKBQA

[ACL 2024] Official resources of "ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models".

Language: Python - Size: 18.5 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 316 - Forks: 27

speediedan/finetuning-scheduler

A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.

Language: Python - Size: 2.66 MB - Last synced at: 3 days ago - Pushed at: 19 days ago - Stars: 66 - Forks: 6

microsoft/AzureML-BERT

End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service

Language: Jupyter Notebook - Size: 314 KB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 400 - Forks: 125

junxia97/awesome-pretrain-on-molecules

[IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models

Size: 565 KB - Last synced at: about 2 hours ago - Pushed at: about 2 years ago - Stars: 531 - Forks: 59

Azure-Samples/azureai-foundry-finetuning-raft

A recipe that will walk you through using either Meta Llama 3.1 405B or OpenAI GPT-4o deployed on Azure AI to generate a synthetic dataset using UC Berkeley's Gorilla project RAFT method.

Language: Jupyter Notebook - Size: 41.2 MB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 68 - Forks: 23

dat-adi/llm_synth_boost

Investigating the impact of synthetic data on LLM perplexity via QLoRA finetuning

Language: Python - Size: 16 MB - Last synced at: 5 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

microsoft/Build25-LAB329

Fine-Tune End-to-End Distillation Models with Azure AI Foundry Models and Foundry Local

Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 2 days ago - Pushed at: 24 days ago - Stars: 27 - Forks: 12

git-cloner/llama-lora-fine-tuning

llama fine-tuning with lora

Language: Python - Size: 109 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 138 - Forks: 15

sparklerz/hivemind-qwen2-0.5b

Internet-scale data-parallel fine-tuning of Qwen2-0.5B-Instruct using Hivemind + TorchTune. Initial peer on public IP; second peers on free GPUs (e.g., Kaggle).

Size: 1.95 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

Lannuela/efficient-domain-tuning

Efficiently fine-tune small language models for financial risk management tasks using QLoRA, LoRA, and AdaLoRA. Explore datasets and experiments. ๐Ÿ™

Size: 13.7 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

kuutsav/llm-toys

Small finetuned LLMs for a diverse set of useful tasks

Language: Python - Size: 72.6 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 128 - Forks: 6

baidubce/bce-qianfan-sdk

Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (ๆไพ›ๅคงๆจกๅž‹ๅทฅๅ…ท้“พๆœ€ไฝณๅฎž่ทต๏ผŒไปฅๅŠไผ˜้›…ไธ”ไพฟๆทๅœฐ่ฎฟ้—ฎๅƒๅธ†ๅคงๆจกๅž‹ๅนณๅฐ๏ผ‰

Language: Jupyter Notebook - Size: 75.2 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 372 - Forks: 58

adithya-s-k/AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

Language: Jupyter Notebook - Size: 96.1 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 1,033 - Forks: 113

paulocoutinhox/mini-llm

Simple and lightweight tool to fine-tune GPT models (like GPT-2 and GPT-Neo) using your own data โ€” built with Python and Transformers. Adapt powerful language models to your domain with ease.

Language: Python - Size: 89.8 KB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 26 - Forks: 0

sarabesh/Finetuning

Repo to serve as a baseline/guide for performing post training(SFT/RLHF) of modern LLM models, and evaluating them with baseline datasets.

Language: Jupyter Notebook - Size: 25.4 KB - Last synced at: 23 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

Baijiong-Lin/LoRA-Torch

PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention in OpenCLIP)

Language: Python - Size: 60.5 KB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 67 - Forks: 7

ayperiKhudaybergenova/bert-distilbert-comparison-WNLI-NER

Comparative analysis of BERT and DistilBERT on WNLI and NER tasks

Language: Python - Size: 106 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

HenryNdubuaku/super-lazy-autograd

Hand-derived memory-efficient super lazy PyTorch VJPs for training LLMs on laptop, all using one op (bundled scaled matmuls).

Language: Python - Size: 1.32 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 73 - Forks: 1

MaxiDonkey/DelphiMistralAI

DelphiMistralAI wrapper brings Mistralโ€™s text-vision-audio models and agentic Conversations to Delphi, with chat, embeddings, Codestral codegen, fine-tuning, batching, moderation, async/await helpers and live request monitoring.

Language: Pascal - Size: 1.76 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 23 - Forks: 5

codelion/ellora

Enhancing LLMs with LoRA

Language: Jupyter Notebook - Size: 2.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 1

yifanzhang-pro/StackMathQA

StackMathQA: A Curated Collection of 2 Million Mathematical Questions and Answers Sourced from Stack Exchange

Size: 48.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

omerbsezer/Fast-LLM-Agent-MCP

This repo covers LLM, Agents concepts both theoretically and practically: LLMs, RAG, Fine Tuning, Agents, Tools, MCP, AWS Strands Agents, Google Agent Development Kit, ADK, Reference Documents, etc.

Language: Python - Size: 65.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 26 - Forks: 6

Abeshith/FineTuning_LanguageModels

๐ŸŽฏ Fine-tune large language models and use them for text-related tasks. This repository provides a straightforward approach to fine-tuning models like Gemma, Llama ๐Ÿฆ™, and Mistral ๐ŸŒช๏ธ for various NLP tasks. ๐Ÿ”ง It includes training ๐Ÿ“š, fine-tuning ๐Ÿ› ๏ธ, and inference pipelines โš™๏ธ. ๐Ÿš€

Language: Jupyter Notebook - Size: 454 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

linhlpv/awesome-offline-to-online-RL-papers

A list of Offline to Online RL papers (continually updated)

Size: 16.6 KB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 47 - Forks: 0

natserract/nokia-rag-finetuning

RAG and fine-tuning strategy for Nokia guide PDF using internal dataset

Language: Python - Size: 658 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

ThomasRochefortB/open-agentinstruct

An open-source recreation of the AgentInstruct agentic workflow for synthetic data generation

Language: Python - Size: 372 KB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 19 - Forks: 3

hearmeneigh/dataset-rising

Toolchain for creating custom datasets and training Stable Diffusion (1.x, 2.x, XL) models and LoRAs

Language: Python - Size: 234 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 1

promptslab/LLMtuner

FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)

Language: Python - Size: 591 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 240 - Forks: 15

Vimalnegi03/GenerativeAI

This resource offers a comprehensive exploration of Generative AI, guiding you from foundational principles through the latest advanced concepts and practical skills. Whether you're a newcomer or aiming for mastery, you'll find curated content to build both theoretical understanding and hands-on expertise.

Language: Python - Size: 7.91 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

LongxingTan/open-retrievals

All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers

Language: Python - Size: 1.42 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 63 - Forks: 13

acceleratedscience/finetune-controller

Job scheduling api for finetuning ML models on clusters

Language: Python - Size: 211 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

soufiane001/plop

Official code for PLoP

Language: Python - Size: 55.7 KB - Last synced at: 30 days ago - Pushed at: 2 months ago - Stars: 15 - Forks: 4

krish1925/Persona-Chatbot-G28

Fine-tuning GPT-3.5 and Llama3 LLMs for enhanced persona consistency in chatbots using Google's Synthetic Persona Chat dataset

Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

sapritanand/Code-Generation-using-LLM

This project extracts Python code from the OpenAI Gym GitHub repository, creates a dataset of functions, and fine-tunes a code generation model (codegen-350M-mono) using Hugging Face Transformers to generate new code snippets.

Language: Jupyter Notebook - Size: 101 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

git-cloner/llama2-lora-fine-tuning

llama2 finetuning with deepspeed and lora

Language: Python - Size: 22.6 MB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 176 - Forks: 14

veralvx/xtts-gradio Fork of coqui-ai/TTS

Run XTTS within Docker/Podman for voice fine-tuning in a Web UI

Language: Python - Size: 133 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

agiornot/gpu-math

A mini tool that helps estimate the resources needed for training/finetuning/inference with Hugging Face models.

Size: 2.93 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

nicolay-r/distil-tuning-llm

Disillation-Tuning implementation for decoder based LM models (Qwen2.5) adapted for text summarization (BioASQ-2025 workshop)

Language: Python - Size: 3.35 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

gruporaia/TTS-AutoTuning

Pipeline para finetuning automรกtico de modelos de Text to Speech.

Language: Python - Size: 2.65 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

woctezuma/finetune-detr

Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.

Language: Jupyter Notebook - Size: 79.5 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 150 - Forks: 24

SpringPixels/PawNet-Classifier

Paws and pixels: Classifying dogs and cats with deep learning and transfer learning magic.

Language: Jupyter Notebook - Size: 193 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

aalizelau/Clone-Yourself

Clone yourself through WhatsApp chat history and fine tuning model.

Language: Jupyter Notebook - Size: 104 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

zou-group/sirius

SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning

Language: Python - Size: 70.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 60 - Forks: 5