peft-fine-tuning-llm | Topic | Ecosyste.ms: Repos

Topic: "peft-fine-tuning-llm"

peremartra/Large-Language-Model-Notebooks-Course

Practical course about Large Language Models.

Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,572 - Forks: 399

liuqidong07/MOELoRA-peft

[SIGIR'24] The official implementation code of MOELoRA.

Language: Python - Size: 10.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 105 - Forks: 11

nbasyl/DoRA 📦

Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"

Size: 557 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 104 - Forks: 2

TUDB-Labs/MoE-PEFT

An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT

Language: Python - Size: 7.18 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 84 - Forks: 11

UCDvision/NOLA

Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"

Language: Python - Size: 60.4 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 45 - Forks: 1

ROIM1998/APT

[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference

Language: Python - Size: 4.08 MB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 38 - Forks: 1

dvgodoy/FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

Language: Jupyter Notebook - Size: 7.48 MB - Last synced at: 19 days ago - Pushed at: 29 days ago - Stars: 31 - Forks: 7

brown-palm/AntGPT

Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Language: Python - Size: 28.8 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 21 - Forks: 2

misonsky/HiFT

memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B

Language: Python - Size: 41.3 MB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 20 - Forks: 2

PRITHIVSAKTHIUR/GALLO-3XL

High Quality Image Generation Model - Powered with NVIDIA A100

Language: Python - Size: 11.2 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 13 - Forks: 1

A Python library for efficient and flexible cycle-consistency training of transformer models via iteratie back-translation. Memory and compute efficient techniques such as PEFT adapter switching allow for 7.5x larger models to be trained on the same hardware.

Language: Python - Size: 2.43 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 11 - Forks: 0

StarLight1212/LLM-and-Generative-Models-Community

AI Community Tutorial, including: LoRA/Qlora LLM fine-tuning, Training GPT-2 from scratch, Generative Model Architecture, Content safety and control implementation, Model distillation techniques, Dreambooth techniques, Transfer learning, etc for practice with real project!

Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 11 - Forks: 1

sayan112207/Text2SQL

Fine-tune StarCoder2-3b for SQL tasks on limited resources with LORA. LORA reduces model size for faster training on smaller datasets. StarCoder2 is a family of code generation models (3B, 7B, and 15B), trained on 600+ programming languages from The Stack v2 and some natural language text such as Wikipedia, Arxiv, and GitHub issues.

Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 1

yuki-2025/llama3-8b-fine-tuning-math

Fine-tuning Llama3 8b to generate JSON formats for arithmetic questions and process the output to perform calculations.

Language: Python - Size: 19.5 KB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 4 - Forks: 1

himanshuvnm/Foundation-Model-Large-Language-Model-FM-LLM

This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.

Language: Jupyter Notebook - Size: 431 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 2

zeyadusf/Finetuning-LLMs

Finetuning Large Language Models

Size: 28.3 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

zeyadusf/topics-in-nlp-llm

In this repo I will share different topics on anything I want to know in nlp and llms

Size: 51.8 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

aman-17/MediSOAP

FineTuning LLMs on conversational medical dataset.

Language: Jupyter Notebook - Size: 39.9 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

DongmingShenDS/Mistral_From_Scratch

Mistral and Mixtral (MoE) from scratch

Language: Python - Size: 5.13 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

RETR0-OS/ModelForge

A no-code toolkit to finetune LLMs on your local GPU—just upload data, pick a task, and export to GGUF. Perfect for hackathons or prototyping, with automatic hardware detection and a guided Gradio interface.

Language: JavaScript - Size: 34.2 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

gabe-zhang/paper2summary

LoRA fine-tuning scripts with Llama-3.2-1B-Instruct on scientific paper summarization

Language: Python - Size: 22.5 KB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

03chrisk/PEFT-T5-on-CNN-dailynews

Fine tuning the T5 model on the CNN daily-news dataset

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

erdemormann/kanarya-and-trendyol-classification-tests

Test results of Kanarya and Trendyol models with and without fine-tuning techniques on the Turkish tweet hate speech detection dataset.

Language: Jupyter Notebook - Size: 293 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

zeyadusf/Summarization-by-Finetuning-FlanT5-LoRA

Language: Jupyter Notebook - Size: 90.8 KB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

Arya920/Natural_Language_To_SQL_Queries

The task of this project is to Convert Natural Language to SQL Queries

Language: Python - Size: 2.64 MB - Last synced at: 27 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

eshan1347/GPT-NEO-LORA

A GPT-Neo model is fine tuned on a custom dataset using huggingface transformers package

Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: 30 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

swastikmaiti/Llama-2-7B-Chat-PEFT

PEFT is a wonderful tool that enables training a very large model in a low resource environment. Quantization and PEFT will enable widespread adoption of LLM.

Language: Jupyter Notebook - Size: 123 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

AnishJoshi13/Bash-Scripting-Assistant

A bash scripting assistant that helps you automate tasks. Powered by a streamlit chat interface, A finetuned nl2bash model generates bash code from natural language descriptions provided by the user

Language: Jupyter Notebook - Size: 43.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

shoryasethia/ConversationSummarizerLLM

Fine Tuning pegasus and flan-t5 pre-trained language model on dialogsum datasets for conversation summarization to to optimize context window in RAG-LLMs

Language: Jupyter Notebook - Size: 38.1 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

pankajrawat9075/Dialog-Summarization-with-Generative-AI

Using Open-Source LLMs like FLAN-T5, built a Dialog Summarization model and did fine-tuning with DialogSum HF Dataset

Language: Jupyter Notebook - Size: 78.1 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

MyriamBA/Dialogue-Summarizer

An End-to-End Dialogue Summarization Project using LLM.

Language: Jupyter Notebook - Size: 74.2 KB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 0 - Forks: 0

RATHOD-SHUBHAM/Finetuning-LLMs

This repository contains experiments on fine-tuning LLMs (Llama, Llama3.1, Gemma). It includes notebooks for model tuning, data preprocessing, and hyperparameter optimization to enhance model performance.

Language: Jupyter Notebook - Size: 5.12 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

matthewapeters/newsies

AI-assisted news aggregator

Language: Jupyter Notebook - Size: 6.11 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

Dhanush-R-git/MH-Analysis

The MHRoberta is Mental Health Roberta model. The pretrained Roberta transformer based model fine-tunned on Mental Health dataset by adopting PEFT method.

Language: Jupyter Notebook - Size: 3.42 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

Wb-az/peft_lora_opt_roberta_modernbert_llm

This repository contains code to fine llm with diverse peft techniques with custom datasets.

Language: Jupyter Notebook - Size: 111 KB - Last synced at: 13 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

yuxuan-z19/peft-animation

Animations of the PEFT algorithms

Language: HTML - Size: 38.6 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

prateeknigam9/Fine-Tuning-LLM

Fine-tuning LLM for improved task-specific performance.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Kleo-Karap/KPA_thesis

Thesis project for the MSc "Language Technology" of the National and Kapodistrian University of Athens (NKUA)

Language: HTML - Size: 9.75 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

nikisetti01/MTL-LORA-for-PubMedQA-and-Riddle

🚀 Fine-tuning LLaMA 1B for a medical chatbot using LoRA and a custom MTL-LoRA framework in PyTorch, enabling efficient multi-task learning for medical NLP! 🏥💡

Language: Jupyter Notebook - Size: 180 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

Zarharan/FarExStance

FarExStance: The first and largest claim-based explainable stance detection dataset on Farsi

Language: Python - Size: 149 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ohmatheus/Kaggle_WSDMCup_Multilingual_Chatbot_Arena

My project for Kaggle's WSDM 2025 cup. Big project where I fuze gemma2-9b and some feature engineering for sequence classification.

Language: Jupyter Notebook - Size: 3.71 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

AnanthaPadmanaban-KrishnaKumar/EffiLLaMA

Finetuning LLaMA 3.2-1B-Instruct model using qLoRA and LoRA quantization PEFT methods

Language: Python - Size: 44.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

SathvikNayak123/Agentic-RAG

Agentic RAG with Llama-3.1-8b model Fine-tuned on medical conversational dataset

Language: Jupyter Notebook - Size: 215 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Hamid-Nasiri/EDoRA

EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition

Language: Python - Size: 1.69 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

david-thrower/DoRA-fine-tuning-gemma-2-2b-it

A simple example of fine tuning Gemma 2 2B instruct using DoRA / LoRA

Language: Jupyter Notebook - Size: 103 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

etechoptimist/generative_ai

This repository contains a collection of generative AI models and applications designed for various tasks such as text generation, image synthesis, and style transfer. The models leverage cutting-edge architectures like GPT, GANs, and VAEs, enabling users to explore different generative tasks.

Language: Jupyter Notebook - Size: 21.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0