GitHub topics: supervised-finetuning
GaryYufei/AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey
Size: 335 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 727 - Forks: 32

InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language: Python - Size: 2.09 MB - Last synced at: about 24 hours ago - Pushed at: 9 days ago - Stars: 4,487 - Forks: 341

magpie-align/magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
Language: Python - Size: 1.08 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 673 - Forks: 60

Kh0uloud/Modeling-Mental-Health-Trends-Using-Social-Media-Data
This project explores the intersection of social media analytics, user behavior modeling, and mental health assessment using a data-driven AI approach. Leveraging both structured metadata and unstructured textual data, a robust regression model is developed to estimate depression intensity from Twitter user profiles, interactions, and content.
Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language: Python - Size: 199 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 2,805 - Forks: 171

Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
Size: 18.6 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 979 - Forks: 56

NVlabs/catk
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR 2025 Oral.
Language: Python - Size: 7.89 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 73 - Forks: 3

BUAADreamer/MLLM-Finetuning-Demo
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
Language: Python - Size: 61.5 KB - Last synced at: 10 days ago - Pushed at: 7 months ago - Stars: 32 - Forks: 2

liziniu/GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
Language: Python - Size: 279 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 17 - Forks: 0

KodCode-AI/kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
Language: Python - Size: 40.6 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 180 - Forks: 10

ShiZhengyan/InstructionModelling
[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"
Language: Python - Size: 22.9 MB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 36 - Forks: 8

asifhaider/LLM-Finetuning-Prompting-Project
Python Project Sample for Demonstration
Language: Jupyter Notebook - Size: 60.8 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

DeepLearn1998/My_RAG
My first RAG
Language: Python - Size: 5.86 KB - Last synced at: 12 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

mirabdullahyaser/LLaMA3-Financial-Analyst
LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.
Language: Jupyter Notebook - Size: 44.9 KB - Last synced at: 22 days ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

quanshr/AugCon
[AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Language: Python - Size: 414 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 1

guanwei49/LogLLM
LogLLM: Log-based Anomaly Detection Using Large Language Models (system log anomaly detection)
Language: Python - Size: 140 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 30 - Forks: 6

AstraZeneca/vlm
Official implementation for "Diffusion Instruction Tuning"
Size: 5.57 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 14 - Forks: 0

inst-it/inst-it
Official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"
Language: Python - Size: 2.66 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 27 - Forks: 0

BUAADreamer/Qwen2-VL-History
Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums
Size: 73.8 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 6 - Forks: 2

SSahas/python-code-assistant
Finetuning salesforce/codegen model in to a python code assistant
Language: Jupyter Notebook - Size: 124 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

artaasd95/rap-music-generator
The Rap Music Generator project is an innovative LLM-based tool designed to create rap lyrics. It offers multiple fine-tuning approaches to accommodate diverse rap generation techniques, providing users with a versatile platform for generating unique and stylistically varied content.
Language: Jupyter Notebook - Size: 71.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

LIN-SHANG/InstructERC
The offical realization of InstructERC
Language: Python - Size: 9.02 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 123 - Forks: 8

rasyosef/phi-2-sft-and-dpo
Notebooks to create an instruction following version of Microsoft's Phi 2 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)
Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

tien02/llm-math
Fine tune Large Language Model on Mathematic dataset
Language: Python - Size: 7.81 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

chaoswork/sft_datasets
开源SFT数据集整理,随时补充
Size: 3.91 KB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 440 - Forks: 34

sail-sg/sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
Language: Shell - Size: 18.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 94 - Forks: 4

eliashornberg/EPFLLaMA
EPFLLaMA: A lightweight language model fine-tuned on EPFL curriculum content. Specialized for STEM education and multiple-choice question answering. Implements advanced techniques like SFT, DPO, and quantization.
Language: Jupyter Notebook - Size: 28.1 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

rasyosef/phi-1_5-instruct
Notebooks to create an instruction following version of Microsoft's Phi 1.5 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)
Size: 3.91 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

KwokHing/AI-Planet-LLM-Bootcamp-Challenge
An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain
Language: Jupyter Notebook - Size: 874 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

AliBakly/EPFLLaMA
EPFLLaMA: A lightweight language model fine-tuned on EPFL curriculum content. Specialized for STEM education and multiple-choice question answering. Implements advanced techniques like SFT, DPO, and quantization.
Language: Jupyter Notebook - Size: 28.1 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

quazirab/fine-tuning-llama-3.1-on-medical-questionnaires
Llama 3.1 Fine Tuning
Language: Jupyter Notebook - Size: 104 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

bhattbhavesh91/google-gemma-finetuning-n2sql
Finetuning Google's Gemma Model for Translating Natural Language into SQL
Language: Jupyter Notebook - Size: 30.3 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 4

sovit-123/lm_sft
Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks
Language: Jupyter Notebook - Size: 1.19 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

ChryssaNab/ECG-Heartbeat-Classification
Binary classification of pathological heartbeats from ECG signals using 1D CNNs in PyTorch
Language: Python - Size: 140 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

18907305772/KCA
Knowledge Verification to Nip Hallucination in the Bud
Language: Python - Size: 4.84 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

fanqiwan/KCA Fork of 18907305772/KCA
Knowledge Verification to Nip Hallucination in the Bud
Language: Python - Size: 4.84 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 0

nsrinidhibhat/fine-tune-llama-2
This project streamlines the fine-tuning process, enabling you to leverage Llama-2's capabilities for your own projects.
Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jmaczan/c-137
🦙 Llama 2 7B fine-tuned to revive Rick
Language: Jupyter Notebook - Size: 3.27 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
