Topic: "peft-fine-tuning-llm"
peremartra/Large-Language-Model-Notebooks-Course
Practical course about Large Language Models.
Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,572 - Forks: 399

liuqidong07/MOELoRA-peft
[SIGIR'24] The official implementation code of MOELoRA.
Language: Python - Size: 10.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 105 - Forks: 11

nbasyl/DoRA 📦
Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"
Size: 557 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 104 - Forks: 2

TUDB-Labs/MoE-PEFT
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
Language: Python - Size: 7.18 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 84 - Forks: 11

UCDvision/NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
Language: Python - Size: 60.4 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 45 - Forks: 1

ROIM1998/APT
[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference
Language: Python - Size: 4.08 MB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 38 - Forks: 1

dvgodoy/FineTuningLLMs
Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"
Language: Jupyter Notebook - Size: 7.48 MB - Last synced at: 19 days ago - Pushed at: 29 days ago - Stars: 31 - Forks: 7

brown-palm/AntGPT
Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
Language: Python - Size: 28.8 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 21 - Forks: 2

misonsky/HiFT
memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B
Language: Python - Size: 41.3 MB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 20 - Forks: 2

PRITHIVSAKTHIUR/GALLO-3XL
High Quality Image Generation Model - Powered with NVIDIA A100
Language: Python - Size: 11.2 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 13 - Forks: 1

wrmthorne/cycleformers
A Python library for efficient and flexible cycle-consistency training of transformer models via iteratie back-translation. Memory and compute efficient techniques such as PEFT adapter switching allow for 7.5x larger models to be trained on the same hardware.
Language: Python - Size: 2.43 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 11 - Forks: 0

StarLight1212/LLM-and-Generative-Models-Community
AI Community Tutorial, including: LoRA/Qlora LLM fine-tuning, Training GPT-2 from scratch, Generative Model Architecture, Content safety and control implementation, Model distillation techniques, Dreambooth techniques, Transfer learning, etc for practice with real project!
Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 11 - Forks: 1

sayan112207/Text2SQL
Fine-tune StarCoder2-3b for SQL tasks on limited resources with LORA. LORA reduces model size for faster training on smaller datasets. StarCoder2 is a family of code generation models (3B, 7B, and 15B), trained on 600+ programming languages from The Stack v2 and some natural language text such as Wikipedia, Arxiv, and GitHub issues.
Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 1

yuki-2025/llama3-8b-fine-tuning-math
Fine-tuning Llama3 8b to generate JSON formats for arithmetic questions and process the output to perform calculations.
Language: Python - Size: 19.5 KB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 4 - Forks: 1

himanshuvnm/Foundation-Model-Large-Language-Model-FM-LLM
This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.
Language: Jupyter Notebook - Size: 431 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 2

zeyadusf/Finetuning-LLMs
Finetuning Large Language Models
Size: 28.3 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

zeyadusf/topics-in-nlp-llm
In this repo I will share different topics on anything I want to know in nlp and llms
Size: 51.8 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

aman-17/MediSOAP
FineTuning LLMs on conversational medical dataset.
Language: Jupyter Notebook - Size: 39.9 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

DongmingShenDS/Mistral_From_Scratch
Mistral and Mixtral (MoE) from scratch
Language: Python - Size: 5.13 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

RETR0-OS/ModelForge
A no-code toolkit to finetune LLMs on your local GPU—just upload data, pick a task, and export to GGUF. Perfect for hackathons or prototyping, with automatic hardware detection and a guided Gradio interface.
Language: JavaScript - Size: 34.2 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

gabe-zhang/paper2summary
LoRA fine-tuning scripts with Llama-3.2-1B-Instruct on scientific paper summarization
Language: Python - Size: 22.5 KB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

03chrisk/PEFT-T5-on-CNN-dailynews
Fine tuning the T5 model on the CNN daily-news dataset
Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

erdemormann/kanarya-and-trendyol-classification-tests
Test results of Kanarya and Trendyol models with and without fine-tuning techniques on the Turkish tweet hate speech detection dataset.
Language: Jupyter Notebook - Size: 293 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

zeyadusf/Summarization-by-Finetuning-FlanT5-LoRA
Language: Jupyter Notebook - Size: 90.8 KB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

Arya920/Natural_Language_To_SQL_Queries
The task of this project is to Convert Natural Language to SQL Queries
Language: Python - Size: 2.64 MB - Last synced at: 27 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

eshan1347/GPT-NEO-LORA
A GPT-Neo model is fine tuned on a custom dataset using huggingface transformers package
Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: 30 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

swastikmaiti/Llama-2-7B-Chat-PEFT
PEFT is a wonderful tool that enables training a very large model in a low resource environment. Quantization and PEFT will enable widespread adoption of LLM.
Language: Jupyter Notebook - Size: 123 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

AnishJoshi13/Bash-Scripting-Assistant
A bash scripting assistant that helps you automate tasks. Powered by a streamlit chat interface, A finetuned nl2bash model generates bash code from natural language descriptions provided by the user
Language: Jupyter Notebook - Size: 43.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

shoryasethia/ConversationSummarizerLLM
Fine Tuning pegasus and flan-t5 pre-trained language model on dialogsum datasets for conversation summarization to to optimize context window in RAG-LLMs
Language: Jupyter Notebook - Size: 38.1 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

pankajrawat9075/Dialog-Summarization-with-Generative-AI
Using Open-Source LLMs like FLAN-T5, built a Dialog Summarization model and did fine-tuning with DialogSum HF Dataset
Language: Jupyter Notebook - Size: 78.1 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

MyriamBA/Dialogue-Summarizer
An End-to-End Dialogue Summarization Project using LLM.
Language: Jupyter Notebook - Size: 74.2 KB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 0 - Forks: 0

RATHOD-SHUBHAM/Finetuning-LLMs
This repository contains experiments on fine-tuning LLMs (Llama, Llama3.1, Gemma). It includes notebooks for model tuning, data preprocessing, and hyperparameter optimization to enhance model performance.
Language: Jupyter Notebook - Size: 5.12 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

matthewapeters/newsies
AI-assisted news aggregator
Language: Jupyter Notebook - Size: 6.11 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

Dhanush-R-git/MH-Analysis
The MHRoberta is Mental Health Roberta model. The pretrained Roberta transformer based model fine-tunned on Mental Health dataset by adopting PEFT method.
Language: Jupyter Notebook - Size: 3.42 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

Wb-az/peft_lora_opt_roberta_modernbert_llm
This repository contains code to fine llm with diverse peft techniques with custom datasets.
Language: Jupyter Notebook - Size: 111 KB - Last synced at: 13 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

yuxuan-z19/peft-animation
Animations of the PEFT algorithms
Language: HTML - Size: 38.6 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

prateeknigam9/Fine-Tuning-LLM
Fine-tuning LLM for improved task-specific performance.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Kleo-Karap/KPA_thesis
Thesis project for the MSc "Language Technology" of the National and Kapodistrian University of Athens (NKUA)
Language: HTML - Size: 9.75 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

nikisetti01/MTL-LORA-for-PubMedQA-and-Riddle
🚀 Fine-tuning LLaMA 1B for a medical chatbot using LoRA and a custom MTL-LoRA framework in PyTorch, enabling efficient multi-task learning for medical NLP! 🏥💡
Language: Jupyter Notebook - Size: 180 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

Zarharan/FarExStance
FarExStance: The first and largest claim-based explainable stance detection dataset on Farsi
Language: Python - Size: 149 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ohmatheus/Kaggle_WSDMCup_Multilingual_Chatbot_Arena
My project for Kaggle's WSDM 2025 cup. Big project where I fuze gemma2-9b and some feature engineering for sequence classification.
Language: Jupyter Notebook - Size: 3.71 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

AnanthaPadmanaban-KrishnaKumar/EffiLLaMA
Finetuning LLaMA 3.2-1B-Instruct model using qLoRA and LoRA quantization PEFT methods
Language: Python - Size: 44.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

SathvikNayak123/Agentic-RAG
Agentic RAG with Llama-3.1-8b model Fine-tuned on medical conversational dataset
Language: Jupyter Notebook - Size: 215 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Hamid-Nasiri/EDoRA
EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition
Language: Python - Size: 1.69 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

david-thrower/DoRA-fine-tuning-gemma-2-2b-it
A simple example of fine tuning Gemma 2 2B instruct using DoRA / LoRA
Language: Jupyter Notebook - Size: 103 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

etechoptimist/generative_ai
This repository contains a collection of generative AI models and applications designed for various tasks such as text generation, image synthesis, and style transfer. The models leverage cutting-edge architectures like GPT, GANs, and VAEs, enabling users to explore different generative tasks.
Language: Jupyter Notebook - Size: 21.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

paraglondhe098/sentiment-classification-llm
Implemented and fine-tuned BERT for a custom sequence classification task, leveraging LoRA adapters for efficient parameter updates and 4-bit quantization to optimize performance and resource utilization.
Language: Jupyter Notebook - Size: 6.66 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

1Preusse/exprep_LLM
LLM fine tuning for text classification with DistilBERT and MiniLM.
Language: Jupyter Notebook - Size: 12.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

0xZee/finetune_model_dataset
Fine-Tunning Model llama3.2 on local Dataset using LoRA, QLoRA and PEFT..
Size: 5.86 KB - Last synced at: 25 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ajithvcoder/emlo4-session-09-ajithvcoder
Deploying a Vision model with LitServe and a LLM - llama3.2 model with litserve from The School of AI EMLO-V4 course assignment https://theschoolof.ai/#programs
Language: Python - Size: 1.82 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

afondiel/Finetuning-LLMs-Crash-Course-DLAI
Notes & Resources of LLMs Finetuning Crash Course from LAMINI.AI & DeepLearning.AI.
Language: Jupyter Notebook - Size: 8.41 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

chatterjeesaurabh/Dialogue-Summarization-with-Large-Language-Model
Explored In-Context prompt learning, Full Fine-Tuning, Parameter-Efficient Fine-Tuning (PEFT) with LoRA, and Fine-tune with Reinforcement Learning (PPO) to generate less-toxic summaries.
Language: Jupyter Notebook - Size: 63.5 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

mafda/lightweight_fine_tuning_project
This repository provides a Jupyter notebook demonstrating parameter-efficient fine-tuning (PEFT) with LoRA on Hugging Face models.
Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

satyampurwar/large-language-models
Unlocking the Power of Generative AI: In-Context Learning, Instruction Fine-Tuning and Reinforcement Learning Fine-Tuning.
Language: Jupyter Notebook - Size: 170 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

akthammomani/Casual_Conversation_Chatbot
Build a Multi-turn Conversations Chit-Chat Bot
Language: Jupyter Notebook - Size: 10.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

SahilBarbade1203/Generative_AI_and_Large_Language_Models Fork of Sdmr12012003/Generative_AI_and_Large_Language_Models
Institute Technical Summer Project -23/24
Language: Jupyter Notebook - Size: 11.7 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

CZboop/Micro-Genre-Generator
Fine-tuning an LLM to generate musical micro-genres
Language: Python - Size: 22.5 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

doguilmak/FineTune-DiaSum-PEFT-LoRA
PEFT and LoRA to fine-tune large language models for dialogue summarization, reducing computational resources for broader application.
Language: Jupyter Notebook - Size: 5.66 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

qiqinyi/GenAI-with-LLMs
My lab work of “Generative AI with Large Language Models” course offered by DeepLearning.AI and Amazon Web Services on coursera.
Language: Jupyter Notebook - Size: 28.9 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

thekaranacharya/llm-fine-tuning
Comparing popular Parameter Efficient Fine-Tuning (PEFT) techniques for Large Language Models
Language: Python - Size: 402 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

kconstable/LLM-fine-tuning
For this project, I fine-tuned two separate models for three tasks: document summarization, dialogue summarization and text classification
Language: Jupyter Notebook - Size: 195 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

kHarshit/llm-projects
LLM projects
Language: Jupyter Notebook - Size: 4.69 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

nikhil-chigali/AdapterBERT
This project is an implementation of the paper: Parameter-Efficient Transfer Learning for NLP, Houlsby [Google], ICML 2019.
Language: Python - Size: 130 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

polarbeargo/GenAIND-Apply-Lightweight-Fine-Tuning-LLMs
Language: Jupyter Notebook - Size: 12.5 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

babadue/unfathomableAI
Stumble upon a fine tuning that is unfathomable.
Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
