finetuning | Topic | Ecosyste.ms: Repos

Topic: "finetuning"

unslothai/unsloth

Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Language: Python - Size: 4.57 MB - Last synced at: about 20 hours ago - Pushed at: 3 days ago - Stars: 37,380 - Forks: 2,911

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

Language: Jupyter Notebook - Size: 209 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 17,112 - Forks: 2,446

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

Language: Python - Size: 15.7 MB - Last synced at: about 21 hours ago - Pushed at: 1 day ago - Stars: 4,895 - Forks: 309

h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Language: Python - Size: 54.2 MB - Last synced at: 4 days ago - Pushed at: 13 days ago - Stars: 4,277 - Forks: 443

microsoft/FLAML

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

Language: Jupyter Notebook - Size: 205 MB - Last synced at: about 19 hours ago - Pushed at: about 23 hours ago - Stars: 4,109 - Forks: 535

Dataherald/dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

Language: Python - Size: 4.34 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 3,454 - Forks: 250

learnables/learn2learn

A PyTorch Library for Meta-learning Research

Language: Python - Size: 9.52 MB - Last synced at: 4 days ago - Pushed at: 11 months ago - Stars: 2,756 - Forks: 359

stochasticai/xTuring

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

Language: Python - Size: 18.4 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 2,642 - Forks: 205

eosphoros-ai/Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

Size: 299 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 2,612 - Forks: 179

LazyAGI/LazyLLM

Easiest and laziest way for building multi-agent LLMs applications.

Language: Python - Size: 8.35 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 1,583 - Forks: 91

jina-ai/finetuner 📦

:dart: Task-oriented embedding tuning for BERT, CLIP, etc.

Language: Python - Size: 71.5 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1,492 - Forks: 69

SocialAI-tianji/Tianji

制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程

Language: Python - Size: 8.17 MB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 1,281 - Forks: 98

georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

Language: Python - Size: 32.7 MB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 834 - Forks: 97

daswer123/xtts-webui

Webui for using XTTS and for finetuning it

Language: Python - Size: 2.76 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 777 - Forks: 150

data-prep-kit/data-prep-kit

Open source project for data preparation of LLM application builders

Language: HTML - Size: 219 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 622 - Forks: 193

minosvasilias/godot-dodo

Finetuning large language models for GDScript generation.

Language: Python - Size: 8.01 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 548 - Forks: 26

junxia97/awesome-pretrain-on-molecules

[IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models

Size: 565 KB - Last synced at: about 18 hours ago - Pushed at: almost 2 years ago - Stars: 522 - Forks: 54

helixml/helix

🧬 Helix is a private GenAI stack for building AI applications with declarative pipelines, knowledge (RAG), API bindings, and first-class testing.

Language: Go - Size: 50.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 490 - Forks: 48

xing61/xiaoyi-robot

优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy，支持ChatGPT的API调用，支持openai的API接口，支持：gpt-4，gpt-3.5。不需要openai Key, 不需要买openai的账号，不需要美元的银行卡，通通不用的，直接调用就行，稳定好用！！智增增

Language: PHP - Size: 384 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 465 - Forks: 35

sozercan/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Language: Go - Size: 4.6 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 441 - Forks: 39

Xirider/finetune-gpt2xl

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Language: Python - Size: 5.44 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 408 - Forks: 70

microsoft/AzureML-BERT

End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service

Language: Jupyter Notebook - Size: 314 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 396 - Forks: 125

baidubce/bce-qianfan-sdk

Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (提供大模型工具链最佳实践，以及优雅且便捷地访问千帆大模型平台）

Language: Jupyter Notebook - Size: 75.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 367 - Forks: 57

zjysteven/lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Language: Python - Size: 13 MB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 284 - Forks: 29

ServiceNow/TapeAgents

TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle

Language: Python - Size: 186 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 257 - Forks: 24

promptslab/LLMtuner

FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)

Language: Python - Size: 591 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 237 - Forks: 15

JosefAlbers/Phi-3-Vision-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

Language: Jupyter Notebook - Size: 5.95 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 227 - Forks: 16

gyunggyung/KoGPT2-FineTuning

🔥 Korean GPT-2, KoGPT2 FineTuning cased. 한국어 가사 데이터 학습 🔥

Language: Python - Size: 24.6 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 217 - Forks: 59

babycommando/neuralgraffiti

Live-bending a foundation model’s output at neural network level.

Language: Jupyter Notebook - Size: 31.3 KB - Last synced at: 14 days ago - Pushed at: 16 days ago - Stars: 212 - Forks: 16

rasbt/dora-from-scratch

LoRA and DoRA from Scratch Implementations

Language: Jupyter Notebook - Size: 41 KB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 199 - Forks: 15

LHRLAB/ChatKBQA

ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models

Language: Python - Size: 18.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 188 - Forks: 17

git-cloner/llama2-lora-fine-tuning

llama2 finetuning with deepspeed and lora

Language: Python - Size: 22.6 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 174 - Forks: 14

git-disl/awesome_LLM-harmful-fine-tuning-papers

A survey on harmful fine-tuning attack for large language model

Size: 5.64 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 157 - Forks: 4

adithya-s-k/AI-Engineering.academy

Navigating the World of AI, One Step at a Time

Language: Jupyter Notebook - Size: 91.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 157 - Forks: 37

woctezuma/finetune-detr

Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.

Language: Jupyter Notebook - Size: 79.5 MB - Last synced at: 15 days ago - Pushed at: almost 2 years ago - Stars: 146 - Forks: 24

git-cloner/llama-lora-fine-tuning

llama fine-tuning with lora

Language: Python - Size: 109 KB - Last synced at: 4 days ago - Pushed at: 12 months ago - Stars: 139 - Forks: 15

Praveen76/LLMs-Interview-Prep-Guide

Welcome to the LLMs Interview Prep Guide! This GitHub repository offers a curated set of interview questions and answers tailored for Data Scientists. Enhance your understanding of Large Language Models, prepare for technical interviews, and excel in the evolving landscape of data science with a focus on LLM applications.

Size: 2.05 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 131 - Forks: 19

kuutsav/llm-toys

Small finetuned LLMs for a diverse set of useful tasks

Language: Python - Size: 72.6 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 126 - Forks: 6

sydverma123/awesome-ai-repositories

A curated list of open source repositories for AI Engineers

Size: 178 KB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 110 - Forks: 21

kyegomez/Lets-Verify-Step-by-Step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Language: Python - Size: 60.5 KB - Last synced at: 4 days ago - Pushed at: 16 days ago - Stars: 108 - Forks: 11

Trainy-ai/llm-atc 📦

Fine-tuning and serving LLMs on any cloud

Language: Python - Size: 1.71 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 89 - Forks: 2

kamalkraj/e5-mistral-7b-instruct

Finetune mistral-7b-instruct for sentence embeddings

Language: Python - Size: 34.2 KB - Last synced at: 11 days ago - Pushed at: 12 months ago - Stars: 81 - Forks: 18

yifanzhang-pro/AutoMathText

Official implementation of DPFM @ ICLR 2024 paper "Autonomous Data Selection with Language Models for Mathematical Texts" (As Huggingface Daily Papers: https://huggingface.co/papers/2402.07625)

Language: Python - Size: 1.81 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 77 - Forks: 4

goodreasonai/praetor-data

Praetor is a lightweight finetuning data and prompt management tool

Language: Python - Size: 6.89 MB - Last synced at: 28 days ago - Pushed at: 5 months ago - Stars: 68 - Forks: 0

speediedan/finetuning-scheduler

A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.

Language: Python - Size: 2.59 MB - Last synced at: 11 days ago - Pushed at: 20 days ago - Stars: 64 - Forks: 6

Baijiong-Lin/LoRA-Torch

PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention)

Language: Python - Size: 43.9 KB - Last synced at: 15 days ago - Pushed at: 5 months ago - Stars: 60 - Forks: 5

kyegomez/Finetuning-Suite

Finetune any model on HF in less than 30 seconds

Language: Jupyter Notebook - Size: 95.4 MB - Last synced at: 4 days ago - Pushed at: 17 days ago - Stars: 58 - Forks: 7

GURPREETKAURJETHRA/Generative-AI-LLM-Projects

Gen AI Large Language Model Projects

Language: Jupyter Notebook - Size: 23 MB - Last synced at: 4 days ago - Pushed at: 11 months ago - Stars: 57 - Forks: 18

unit-mesh/unit-gen

UnitGen 是一个用于生成微调代码的数据框架 —— 直接从你的代码库中生成微调数据：代码补全、测试生成、文档生成等。UnitGen is a code fine-tuning data framework that generates data from your existing codebase.

Language: Kotlin - Size: 1.26 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 56 - Forks: 11

LongxingTan/open-retrievals

All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers

Language: Python - Size: 1.36 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 55 - Forks: 12

924973292/MambaPro

【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

Language: Python - Size: 24.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 51 - Forks: 2

deshwalmahesh/PHUDGE

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.

Language: Jupyter Notebook - Size: 13.1 MB - Last synced at: 22 days ago - Pushed at: 10 months ago - Stars: 49 - Forks: 7

zou-group/sirius

SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning

Language: Python - Size: 2.93 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 48 - Forks: 4

MohamedSebaie/Fight_Detection_From_Surveillance_Cameras-PyTorch_Project

Fight Detection From Surveillance Cameras by fine-tuning a PyTorch Pretrained Model

Language: Jupyter Notebook - Size: 208 MB - Last synced at: 2 days ago - Pushed at: about 3 years ago - Stars: 46 - Forks: 13

chuangchuangtan/LLaVA-NeXT-Image-Llama3-Lora

LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft

Language: Python - Size: 11.6 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 44 - Forks: 4

LennartPurucker/finetune_tabpfn_v2

Code for finetuning TabPFN on one downstream tabular dataset.

Language: Python - Size: 140 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 42 - Forks: 6

adithya-s-k/CompanionLLM

CompanionLLM - A framework to finetune LLMs to be your own sentient conversational companion

Language: Jupyter Notebook - Size: 40.1 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 40 - Forks: 5

linhlpv/awesome-offline-to-online-RL-papers

A list of Offline to Online RL papers (continually updated)

Size: 16.6 KB - Last synced at: about 15 hours ago - Pushed at: 8 months ago - Stars: 39 - Forks: 0

conneroisu/Text-Dataset-Aid-Plugin

This is a obsidian plugin to help with the creation of personal jsonl datasets for text generation models.

Language: TypeScript - Size: 157 KB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 37 - Forks: 3

git-cloner/Llama2-chinese

Llama2 chinese finetuning

Language: Python - Size: 65.4 KB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 37 - Forks: 8

avocardio/Zicklein

Finetuning instruct-LLaMA on german datasets.

Language: Python - Size: 9.78 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 5

VatsaDev/nanoChatGPT

nanogpt turned into a chat model

Language: Python - Size: 266 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 33 - Forks: 5

dvgodoy/FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

Language: Jupyter Notebook - Size: 7.48 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 31 - Forks: 7

machinelearningnuremberg/QuickTune

[ICLR2024] Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How

Language: Python - Size: 5.86 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 31 - Forks: 5

MaxiDonkey/DelphiGemini

The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.

Language: Pascal - Size: 215 KB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 30 - Forks: 2

francoislanc/midistral

LLM finetuned for generating symbolic music

Language: Python - Size: 1.52 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 28 - Forks: 2

ssbuild/chatglm_rlhf

chatglm_rlhf_finetuning

Language: Python - Size: 149 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 1

neph1/finetrainers-ui

Gradio UI for training video models using finetrainers

Language: Python - Size: 103 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 27 - Forks: 2

dannylee1020/openpo

Language: Python - Size: 10.7 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 27 - Forks: 0

CogitoNTNU/TutorAI

TutorAI is a RAG system capable of assisting with learning academic subjects and using the curriculum and citing it. The project revolves around building an application that ingests a textbook in most formats and facilitates efficient learning of the course material.

Language: Python - Size: 20.6 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 24 - Forks: 11

meaningalignment/dft

Democratic Fine-tuning with a Moral Graph

Language: TypeScript - Size: 10 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 22 - Forks: 9

MaxiDonkey/DelphiMistralAI

The MistralAI API wrapper for Delphi utilizes the various advanced models developed by Mistral to provide robust capabilities for chat interactions, string embeddings, precise code generation with Codestral, batch and moderation.

Language: Pascal - Size: 257 KB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 21 - Forks: 4

SIC98/GPT2-python-code-generator

GPT2 finetuning with transformers 🤗

Language: Jupyter Notebook - Size: 185 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 21 - Forks: 2

Hemanthkumar2112/Reward-Modeling-RLHF-Finetune-and-RAG

Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform

Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 20 - Forks: 6

git-disl/Booster

This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation" (ICLR2025).

Language: Shell - Size: 293 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 0

git-disl/Vaccine

This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)

Language: Shell - Size: 730 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 19 - Forks: 0

shaheennabi/Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

🎋🌿🌟 Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations 🌟🌿🎋 Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. 🚀✨ Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning🎇🎉

Language: Jupyter Notebook - Size: 692 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 18 - Forks: 2