GitHub topics: supervised-finetuning

Repositories

GaryYufei/AlignLLMHumanSurvey

Aligning Large Language Models with Human: A Survey

Size: 335 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 727 - Forks: 32

InternLM/xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language: Python - Size: 2.09 MB - Last synced at: about 24 hours ago - Pushed at: 9 days ago - Stars: 4,487 - Forks: 341

magpie-align/magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Language: Python - Size: 1.08 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 673 - Forks: 60

Kh0uloud/Modeling-Mental-Health-Trends-Using-Social-Media-Data

This project explores the intersection of social media analytics, user behavior modeling, and mental health assessment using a data-driven AI approach. Leveraging both structured metadata and unstructured textual data, a robust regression model is developed to estimate depression intensity from Twitter user profiles, interactions, and content.

Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

InternLM/InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Language: Python - Size: 199 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 2,805 - Forks: 171

Tebmer/Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

Size: 18.6 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 979 - Forks: 56

NVlabs/catk

Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR 2025 Oral.

Language: Python - Size: 7.89 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 73 - Forks: 3

BUAADreamer/MLLM-Finetuning-Demo

使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory

Language: Python - Size: 61.5 KB - Last synced at: 10 days ago - Pushed at: 7 months ago - Stars: 32 - Forks: 2

liziniu/GEM

Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)

Language: Python - Size: 279 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 17 - Forks: 0

KodCode-AI/kodcode

✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork

Language: Python - Size: 40.6 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 180 - Forks: 10

ShiZhengyan/InstructionModelling

[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"

Language: Python - Size: 22.9 MB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 36 - Forks: 8

asifhaider/LLM-Finetuning-Prompting-Project

Python Project Sample for Demonstration

Language: Jupyter Notebook - Size: 60.8 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

DeepLearn1998/My_RAG

My first RAG

Language: Python - Size: 5.86 KB - Last synced at: 12 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

mirabdullahyaser/LLaMA3-Financial-Analyst

LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.

Language: Jupyter Notebook - Size: 44.9 KB - Last synced at: 22 days ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

quanshr/AugCon

[AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity

Language: Python - Size: 414 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 1

guanwei49/LogLLM

LogLLM: Log-based Anomaly Detection Using Large Language Models (system log anomaly detection)

Language: Python - Size: 140 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 30 - Forks: 6

AstraZeneca/vlm

Official implementation for "Diffusion Instruction Tuning"

Size: 5.57 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 14 - Forks: 0

inst-it/inst-it

Official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"

Language: Python - Size: 2.66 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 27 - Forks: 0

BUAADreamer/Qwen2-VL-History

Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums

Size: 73.8 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 6 - Forks: 2

SSahas/python-code-assistant

Finetuning salesforce/codegen model in to a python code assistant

Language: Jupyter Notebook - Size: 124 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

artaasd95/rap-music-generator

The Rap Music Generator project is an innovative LLM-based tool designed to create rap lyrics. It offers multiple fine-tuning approaches to accommodate diverse rap generation techniques, providing users with a versatile platform for generating unique and stylistically varied content.

Language: Jupyter Notebook - Size: 71.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

LIN-SHANG/InstructERC

The offical realization of InstructERC

Language: Python - Size: 9.02 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 123 - Forks: 8

rasyosef/phi-2-sft-and-dpo

Notebooks to create an instruction following version of Microsoft's Phi 2 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

tien02/llm-math

Fine tune Large Language Model on Mathematic dataset

Language: Python - Size: 7.81 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

chaoswork/sft_datasets

开源SFT数据集整理,随时补充

Size: 3.91 KB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 440 - Forks: 34

sail-sg/sdft

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Language: Shell - Size: 18.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 94 - Forks: 4

eliashornberg/EPFLLaMA

EPFLLaMA: A lightweight language model fine-tuned on EPFL curriculum content. Specialized for STEM education and multiple-choice question answering. Implements advanced techniques like SFT, DPO, and quantization.

Language: Jupyter Notebook - Size: 28.1 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

rasyosef/phi-1_5-instruct

Notebooks to create an instruction following version of Microsoft's Phi 1.5 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)

Size: 3.91 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

KwokHing/AI-Planet-LLM-Bootcamp-Challenge

An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain

Language: Jupyter Notebook - Size: 874 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

AliBakly/EPFLLaMA

Language: Jupyter Notebook - Size: 28.1 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

quazirab/fine-tuning-llama-3.1-on-medical-questionnaires

Llama 3.1 Fine Tuning

Language: Jupyter Notebook - Size: 104 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

bhattbhavesh91/google-gemma-finetuning-n2sql

Finetuning Google's Gemma Model for Translating Natural Language into SQL

Language: Jupyter Notebook - Size: 30.3 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 4

sovit-123/lm_sft

Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks

Language: Jupyter Notebook - Size: 1.19 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

ChryssaNab/ECG-Heartbeat-Classification

Binary classification of pathological heartbeats from ECG signals using 1D CNNs in PyTorch

Language: Python - Size: 140 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

18907305772/KCA

Knowledge Verification to Nip Hallucination in the Bud

Language: Python - Size: 4.84 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

fanqiwan/KCA Fork of 18907305772/KCA

Knowledge Verification to Nip Hallucination in the Bud

Language: Python - Size: 4.84 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 0

nsrinidhibhat/fine-tune-llama-2

This project streamlines the fine-tuning process, enabling you to leverage Llama-2's capabilities for your own projects.

Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jmaczan/c-137

🦙 Llama 2 7B fine-tuned to revive Rick

Language: Jupyter Notebook - Size: 3.27 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Related Keywords

supervised-finetuning 38 large-language-models 14 llm 11 lora 6 direct-preference-optimization 5 llama2 5 huggingface 5 language-model 5 fine-tuning 5 machine-learning 4 natural-language-processing 4 llama3 4 nlp 4 pytorch 4 instruction-tuning 3 large-language-model 3 mllm 3 transformers 3 gemma 3 llms 3 multimodal-large-language-models 3 synthetic-data 3 gpt 2 chatgpt 2 multimodal 2 vision-language-model 2 finetuning 2 huggingface-transformers 2 instruction-following 2 self-distillation 2 artificial-intelligence 2 datasets 2 llama-factory 2 trl 2 llama2-7b 2 unsloth 2 gpt-4 2 llama 2 survey 2 chatbot 2 retrieval-augmented-generation 2 llava 2 llm-training 2 llama-2 2 hallucination 2 synthetic-dataset-generation 2 phi3 2 qlora 2 alignment 2 paper 2 dataset 2 rag 1 post-training 1 large-multimodal-models 1 llama-7b 1 beauty 1 vector-database 1 emotion-recognition-in-conversation 1 anomaly-detection 1 chatglm2-6b 1 system-logs 1 chatglm-6b 1 python 1 system-security 1 history 1 museum 1 qwen2-vl 1 multimodal-alignment 1 sft 1 rickandmorty 1 rick-sanchez 1 rick-and-morty 1 google-colab 1 deep-learning 1 c-137 1 apple-m2 1 open 1 transfer-learning 1 pre-trained-model 1 heartbeat-classification 1 ecg-signals 1 binary-classification 1 1d-cnns 1 gpt2 1 natural-language-to-sql 1 google 1 finetuning-llms 1 pandas 1 llama3-1-8b-finetuning 1 transformer-models 1 sentence-embeddings 1 ocra-mini-3b 1 mistral-7b 1 langchain 1 embeddings-model 1 chinese-dataset 1 transformer 1 mathematics 1 unified-data-processing 1 multi-modal 1