GitHub topics: instruction-tuning
Cre4T3Tiv3/unsloth-llama3-alpaca-lora
Fine-tuned 4-bit LoRA adapter for LLaMA 3 using Alpaca-style and QLoRA-grounded instructions, built with Unsloth for fast local training.
Language: Jupyter Notebook - Size: 2.1 MB - Last synced at: about 4 hours ago - Pushed at: about 10 hours ago - Stars: 1 - Forks: 0

sileod/tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
Language: Python - Size: 376 KB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 184 - Forks: 10

rafsid/e2e-llm-v1
🚀 Optimized LLM Training Environment Comprehensive setup for high-performance LLM training with automated Git integration, real-time feedback, and memory optimization. Features timestamped versioning and accelerated training pipelines. ⚡️ Features: Automated Git workflow Visual feedback system Memory optimization Accelerated training 🛠️
Language: Jupyter Notebook - Size: 3.18 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Size: 82.9 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 15,751 - Forks: 1,022

DSXiangLi/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
Size: 2.2 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,113 - Forks: 308

yassinelahdiy/page-language-model
Open-source framework for defining Page Language Models (PLMs) for intelligent app understanding and AI-assisted testing.
Language: Python - Size: 26.4 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language: Python - Size: 13.4 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 22,955 - Forks: 2,537

RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language: Python - Size: 43.1 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 11,642 - Forks: 911

X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language: Python - Size: 33.5 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 2,491 - Forks: 184

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language: Python - Size: 50.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 53,543 - Forks: 6,560

modelscope/data-juicer
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Language: Python - Size: 271 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 4,713 - Forks: 245

zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
Language: Python - Size: 38.7 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 1,327 - Forks: 132

dhranidharan/Llama
Llama is a browser-based tool that lets you run GGUF models using JavaScript and WebAssembly. Explore its features and supported models on GitHub! 🐱💻🌐
Language: HTML - Size: 964 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

EvolvingLMMs-Lab/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language: Python - Size: 7.39 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,260 - Forks: 212

tamlhp/awesome-instruction-editing
Awesome Instruction Editing. Image and Media Editing with Human Instructions. Instruction-Guided Image and Media Editing.
Size: 619 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 70 - Forks: 2

HKUDS/HiGPT
[KDD'2024] "HiGPT: Heterogenous Graph Language Models"
Language: Python - Size: 6.17 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 132 - Forks: 7

princeton-nlp/LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Language: Jupyter Notebook - Size: 366 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 458 - Forks: 46

PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language: Python - Size: 113 MB - Last synced at: 8 days ago - Pushed at: 7 months ago - Stars: 3,294 - Forks: 235

ictnlp/BayLing
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.
Language: Python - Size: 67.2 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 316 - Forks: 19

mindspore-courses/step_into_llm
MindSpore online courses: Step into LLM
Language: Jupyter Notebook - Size: 246 MB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 472 - Forks: 121

HKUDS/RecLM
"RecLM: Recommendation Instruction Tuning"
Language: Python - Size: 212 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 105 - Forks: 12

PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
Language: Jupyter Notebook - Size: 137 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,752 - Forks: 254

FSoft-AI4Code/CodeCapybara
Open-source Self-Instruction Tuning Code LLM
Language: Python - Size: 922 KB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 170 - Forks: 11

ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Language: Python - Size: 12.7 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 326 - Forks: 35

yaotingwangofficial/Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Size: 11.5 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 669 - Forks: 20

HKUDS/UrbanGPT
[KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"
Language: Python - Size: 15.5 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 369 - Forks: 47

Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
Language: HTML - Size: 82.7 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 4,312 - Forks: 305

yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
Size: 33.2 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 1,124 - Forks: 56

HKUDS/GraphGPT
[SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"
Language: Python - Size: 36.5 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 755 - Forks: 78

zjr2000/Awesome-Multimodal-Chatbot
Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a seamless and versatile user experience.
Size: 17.6 KB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 78 - Forks: 7

wxjiao/ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
Language: Python - Size: 48.1 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 176 - Forks: 22

ContextualAI/gritlm
Generative Representational Instruction Tuning
Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 654 - Forks: 47

zhuang-li/SCAR
[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Models
Language: Python - Size: 125 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 18 - Forks: 3

huggingface/instruction-tuned-sd
Code for instruction-tuning Stable Diffusion.
Language: Python - Size: 105 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 235 - Forks: 19

mlpc-ucsd/BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
Language: Python - Size: 12.3 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 261 - Forks: 24

RenzeLou/awesome-instruction-learning
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
Language: Python - Size: 6.25 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 497 - Forks: 24

shure-dev/Awesome-LLM-Papers-Comprehensive-Topics
Awesome LLM Papers and repos on very comprehensive topics.
Size: 450 KB - Last synced at: 9 days ago - Pushed at: 11 months ago - Stars: 220 - Forks: 22

nlp-uoregon/Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Language: Python - Size: 262 MB - Last synced at: 5 days ago - Pushed at: almost 2 years ago - Stars: 97 - Forks: 3

OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language: Python - Size: 53.2 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 1,915 - Forks: 112

datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Language: Python - Size: 895 KB - Last synced at: 18 days ago - Pushed at: 5 months ago - Stars: 1,026 - Forks: 54

bespokelabsai/curator
Synthetic data curation for post-training and structured data extraction
Language: Python - Size: 62.6 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 1,391 - Forks: 109

NVlabs/DoRA
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
Language: Python - Size: 3.06 MB - Last synced at: 28 days ago - Pushed at: 9 months ago - Stars: 797 - Forks: 57

Spico197/MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
Language: Python - Size: 552 KB - Last synced at: 26 days ago - Pushed at: 9 months ago - Stars: 39 - Forks: 0

YuanheZ/LoRA-One
LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently (ICML2025 Oral)
Language: Jupyter Notebook - Size: 4.61 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 0

ChanLiang/PEARL
[ICLR 2025] PEARL: Towards Permutation-Resilient LLMs
Language: Python - Size: 728 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

ashleykleynhans/llava-docker
Docker image for LLaVA: Large Language and Vision Assistant
Language: Shell - Size: 97.7 KB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

Abbey4799/CuteGPT
An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.
Language: Python - Size: 276 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 3

simplifine-llm/Simplifine
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
Language: Python - Size: 844 KB - Last synced at: 20 days ago - Pushed at: 11 months ago - Stars: 93 - Forks: 4

crux82/BISS-2024
This repository hosts materials from the Bertinoro International Spring School 2024 course
Language: Jupyter Notebook - Size: 20 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 3

HenryHZY/Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
Size: 17.6 KB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 358 - Forks: 16

yuanze-lin/Olympus
[CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"
Language: Python - Size: 3.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 425 - Forks: 71

yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language: Python - Size: 58.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 4,381 - Forks: 507

ntphuc149/ViAG
ViAG: A Novel Framework for Fine-tuning Answer Generation models ultilizing Encoder-Decoder and Decoder-only Transformers's architecture
Language: Python - Size: 110 KB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language: Python - Size: 1.99 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 1,905 - Forks: 132

kavya4411/tune
Flutter Piano is a simple and educational music application that allows users to play black and white piano keys that produce realistic sounds upon tapping. It is built with Flutter and designed with a clean, intuitive interface that offers an authentic piano playing experience.
Language: C++ - Size: 2.82 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

zhilizju/Awesome-instruction-tuning
A curated list of awesome instruction tuning datasets, models, papers and repositories.
Language: Python - Size: 6.01 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 335 - Forks: 14

AdrianBZG/llama-multimodal-vqa
Multimodal Instruction Tuning for Llama 3
Language: Python - Size: 31.3 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 49 - Forks: 11

bigscience-workshop/xmtf
Crosslingual Generalization through Multitask Finetuning
Language: Jupyter Notebook - Size: 28.6 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 533 - Forks: 39

juyongjiang/CodeUp
CodeUp: A Multilingual Code Generation Llama-X Model with Parameter-Efficient Instruction-Tuning
Language: Python - Size: 18.6 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 126 - Forks: 9

salesforce/DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
Language: Python - Size: 13 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 500 - Forks: 34

InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language: Python - Size: 200 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2,834 - Forks: 172

HKUDS/GraphEdit
"GraphEdit: Large Language Models for Graph Structure Learning"
Language: Python - Size: 2.31 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 134 - Forks: 15

OpenSparseLLMs/LLaMA-MoE-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
Language: Python - Size: 2.21 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 84 - Forks: 12

NExT-GPT/NExT-GPT
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
Language: Python - Size: 125 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3,499 - Forks: 352

daniel-furman/sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
Language: Jupyter Notebook - Size: 9.64 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 77 - Forks: 9

yichengchen24/MIG
Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
Language: Python - Size: 10.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 19 - Forks: 1

YJiangcm/WebR
[ACL 2025] Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction
Language: Python - Size: 4.38 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 3

zjysteven/lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
Language: Python - Size: 13 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 296 - Forks: 33

longday1102/VietAI-experiment-LLaMA2
⚡ LLaMA-2 model experiment
Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 2

hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language: Python - Size: 240 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 554 - Forks: 29

InquestGeronimo/tllm
An LLM training library for instruction-tuning.
Language: Python - Size: 785 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 26 - Forks: 3

reshalfahsi/gpt2chat
Creating a GPT-2-Based Chatbot with Human Preferences
Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

chiffonng/mnemonic-gen
[WIP] Mnemonic Generation for English Language Learning
Language: Python - Size: 6.57 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

ShareGPT4Omni/ShareGPT4V
[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions
Language: Python - Size: 644 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 217 - Forks: 6

kosaokis/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language: Python - Size: 40.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

reshalfahsi/gpt2moe-instruct
Instruction Fine-tuning of the GPT2MoE Model: GPT-2 with Mixture-of-Experts
Language: Jupyter Notebook - Size: 12 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ShiZhengyan/PowerfulPromptFT
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner"
Language: Python - Size: 34.2 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 74 - Forks: 18

mittapallynitin/InstructAI
Instruction Fine-Tuning of DistilGPT2 on Alpaca Dataset. This project demonstrates full instruction fine-tuning of the distilgpt2 model on the Alpaca dataset using Hugging Face Transformers. It covers the complete pipeline from dataset preparation to training and inference, following the theory from the Self-Instruct paper.
Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

zjukg/KnowPAT
[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
Language: Python - Size: 9.03 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 193 - Forks: 17

blazerye/DrugAssist
[Briefings In Bioinformatics] DrugAssist: A Large Language Model for Molecule Optimization
Language: Python - Size: 7.03 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 129 - Forks: 13

OFA-Sys/DiverseEvol
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
Language: Python - Size: 62 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 79 - Forks: 2

Kh0uloud/Fine-Grained-Sentiment-Analysis-for-Gym-Customer-Feedback
This repository provides a deep learning–driven solution for automatically extracting aspects and sentiments from gym reviews—transforming unstructured feedback into actionable insights that are adaptable to any customer review domain.
Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

pipixin321/HolmesVAU
[CVPR 2025] Official implementation of "Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity"
Language: Python - Size: 60.1 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 39 - Forks: 2

ShiZhengyan/InstructionModelling
[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"
Language: Python - Size: 22.9 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 36 - Forks: 8

FudanDISC/DISC-FinLLM
DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financial consulting services in financial scenarios.
Language: Python - Size: 36 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 691 - Forks: 80

Lintianqianjin/LangGFM
A Large Language Model Alone Can be a Powerful Graph Foundation Model
Language: Python - Size: 74.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 7 - Forks: 0

LostXine/LLaRA
🔥[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Language: Python - Size: 38.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 199 - Forks: 6

baaihealth/opi
This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instruction dataset with which LLMs can be adapted to protein-related tasks via instruction tuning and evaluated on these tasks.
Language: Python - Size: 52.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

FudanDISC/ReForm-Eval
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
Language: Python - Size: 10 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 46 - Forks: 4

UCSC-REAL/DS2
[ICLR 2025] Improving Data Efficiency via Curating LLM-Driven Rating Systems
Language: Python - Size: 18 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 49 - Forks: 5

patrick-tssn/LM-Research-Hub
Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)
Language: Python - Size: 5 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 18 - Forks: 3

ShinoharaHare/LLM-Training
A distributed training framework for large language models powered by Lightning.
Language: Python - Size: 281 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 19 - Forks: 4

hulkiciray/llm_from_scratch
I am using here just to share what I have been struggling to understand
Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

DaehanKim/EasyRLHF
EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets
Language: Python - Size: 73.9 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

inst-it/inst-it
Official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"
Language: Python - Size: 2.66 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 27 - Forks: 0

OSU-NLP-Group/QA4RE
[ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"
Language: Python - Size: 50.8 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 39 - Forks: 5

yuzhimanhua/CoF
Chain-of-Factors Paper-Reviewer Matching (WWW'25)
Language: Python - Size: 498 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

Marker-Inc-Korea/KoNEFTune
Random Noisy Embeddings with fine-tuning 방법론을 한국어 LLM에 간단히 적용할 수 있는 Kosy🍵llama
Language: Python - Size: 1.53 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

Logisx/LLMath-QLoRA
🧮 End-to-end LLM instruction finetuning based on PEFT & QLoRA to solve math problems.
Language: Jupyter Notebook - Size: 289 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ippolito-cmu/ChasingRandom
Official Repository for the Paper: Chasing Random: Instruction Selection Strategies Fail to Generalize
Language: Python - Size: 4.98 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0
