llama | Topic | Ecosyste.ms: Repos

ollama/ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Language: Go - Size: 46.8 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 151,303 - Forks: 12,981

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Language: Python - Size: 53 MB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 56,995 - Forks: 6,986

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language: Python - Size: 76.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 56,958 - Forks: 9,835

unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Language: Python - Size: 7.34 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 44,982 - Forks: 3,648

Aider-AI/aider

aider is AI pair programming in your terminal

Language: Python - Size: 133 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 37,143 - Forks: 3,440

chatchat-space/Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language: TypeScript - Size: 138 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 36,013 - Forks: 6,015

mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

Language: Go - Size: 25.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 35,056 - Forks: 2,737

haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language: Python - Size: 13.4 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 23,432 - Forks: 2,590

fishaudio/fish-speech

SOTA Open Source TTS

Language: Python - Size: 18.5 MB - Last synced at: about 1 hour ago - Pushed at: 5 days ago - Stars: 22,878 - Forks: 1,884

HqWu-HITCS/Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

Size: 11.1 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 21,076 - Forks: 2,011

yamadashy/repomix

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

Language: TypeScript - Size: 9.1 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 18,944 - Forks: 835

ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language: Python - Size: 23 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 18,913 - Forks: 1,880

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

Language: Jupyter Notebook - Size: 266 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 17,792 - Forks: 2,593

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Language: Python - Size: 32.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 17,596 - Forks: 2,822

GaiZhenbiao/ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Language: Python - Size: 3.06 MB - Last synced at: 5 days ago - Pushed at: 23 days ago - Stars: 15,437 - Forks: 2,273

LlamaFamily/Llama-Chinese

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

Language: Python - Size: 19.2 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 14,685 - Forks: 1,304

cocktailpeanut/dalai

The simplest way to run LLaMA on your local machine

Language: CSS - Size: 11.7 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 13,058 - Forks: 1,379

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Language: Python - Size: 111 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 12,754 - Forks: 3,069

AstrBotDevs/AstrBot

✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify

Language: Python - Size: 31.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 11,809 - Forks: 843

bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Language: Python - Size: 41.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 11,738 - Forks: 763

ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Language: Python - Size: 31.8 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 11,577 - Forks: 1,217

TheR1D/shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Language: Python - Size: 249 KB - Last synced at: about 2 hours ago - Pushed at: about 2 months ago - Stars: 11,311 - Forks: 908

getumbrel/llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

Language: TypeScript - Size: 1.71 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 11,000 - Forks: 711

tensorzero/tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Language: Rust - Size: 96.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 9,909 - Forks: 663

bigscience-workshop/petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Language: Python - Size: 4.06 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 9,781 - Forks: 570

modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Language: Python - Size: 67.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9,686 - Forks: 854

dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

Language: TypeScript - Size: 79 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9,560 - Forks: 1,556

langchain4j/langchain4j

Java version of LangChain

Language: Java - Size: 17.2 MB - Last synced at: 2 days ago - Pushed at: 7 days ago - Stars: 8,898 - Forks: 1,638

xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language: Python - Size: 47.1 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 8,482 - Forks: 735

oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Language: Python - Size: 30.4 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 8,450 - Forks: 640

SJTU-IPADS/PowerInfer

High-speed Large Language Model Serving for Local Deployment

Language: C++ - Size: 21.7 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 8,319 - Forks: 443

reorproject/reor

Private & local AI personal knowledge management app for high entropy people.

Language: JavaScript - Size: 93.7 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 8,225 - Forks: 501

LianjiaTech/BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Language: HTML - Size: 18 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 8,221 - Forks: 767

LostRuins/koboldcpp Fork of ggml-org/llama.cpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

Language: C++ - Size: 301 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 8,132 - Forks: 525

zilliztech/GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language: Python - Size: 22.2 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 7,707 - Forks: 554

ymcui/Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language: Python - Size: 8.18 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 7,177 - Forks: 569

InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language: Python - Size: 9.01 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 6,942 - Forks: 599

k8sgpt-ai/k8sgpt

Giving Kubernetes Superpowers to everyone

Language: Go - Size: 27.5 MB - Last synced at: about 1 hour ago - Pushed at: about 22 hours ago - Stars: 6,925 - Forks: 864

yangjianxin1/Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language: Python - Size: 6.24 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 6,531 - Forks: 583

arcee-ai/mergekit

Tools for merging pretrained large language models.

Language: Python - Size: 911 KB - Last synced at: about 1 hour ago - Pushed at: 20 days ago - Stars: 6,258 - Forks: 607

WangRongsheng/awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

Size: 36.2 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 6,103 - Forks: 599

mishushakov/llm-scraper

Turn any webpage into structured data using LLMs

Language: TypeScript - Size: 127 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 5,985 - Forks: 355

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: about 6 hours ago - Pushed at: 3 days ago - Stars: 5,912 - Forks: 460

serge-chat/serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Language: Svelte - Size: 3 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 5,751 - Forks: 402

baichuan-inc/Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language: Python - Size: 3.83 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 5,685 - Forks: 504

gluonfield/enchanted

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

Language: Swift - Size: 5.38 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 5,618 - Forks: 378

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

Language: Python - Size: 16.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 5,601 - Forks: 394

cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

Language: Python - Size: 349 KB - Last synced at: about 19 hours ago - Pushed at: about 21 hours ago - Stars: 5,221 - Forks: 452

multimodal-art-projection/YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Language: Python - Size: 32.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5,052 - Forks: 559

SCIR-HI/Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草（原名：华驼）模型仓库，基于中文医学知识的大语言模型指令微调

Language: Python - Size: 10.5 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 4,855 - Forks: 487

h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Language: Python - Size: 54.5 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 4,607 - Forks: 488

Instruction-Tuning-with-GPT-4/GPT-4-LLM

Instruction Tuning with GPT-4

Language: HTML - Size: 82.7 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 4,327 - Forks: 306

CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。

Language: Python - Size: 305 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 4,161 - Forks: 338

clusterzx/paperless-ai

An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents.

Language: JavaScript - Size: 14.3 MB - Last synced at: about 1 hour ago - Pushed at: 7 days ago - Stars: 4,156 - Forks: 178

Facico/Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

Language: C - Size: 252 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 4,147 - Forks: 414

shibing624/MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Language: Python - Size: 13.3 MB - Last synced at: 1 day ago - Pushed at: 7 days ago - Stars: 4,070 - Forks: 598

transformerlab/transformerlab-app

Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.

Language: TypeScript - Size: 10.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 4,044 - Forks: 372

casibase/casibase

⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com

Language: Go - Size: 21.8 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 3,985 - Forks: 474

langroid/langroid

Harness LLMs with Multi-Agent Programming

Language: Python - Size: 110 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 3,672 - Forks: 345

yuanzhoulvpi2017/zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language: Jupyter Notebook - Size: 51 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 3,636 - Forks: 431

datawhalechina/tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

Language: Jupyter Notebook - Size: 21 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 3,617 - Forks: 365

gpustack/gpustack

Simple, scalable AI model deployment on GPU clusters

Language: Python - Size: 132 MB - Last synced at: 9 days ago - Pushed at: 12 days ago - Stars: 3,579 - Forks: 364

ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language: Python - Size: 8.01 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,577 - Forks: 276

datawhalechina/llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

Language: Jupyter Notebook - Size: 39.9 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3,489 - Forks: 483

higgsfield-ai/higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Language: Jupyter Notebook - Size: 4.83 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,469 - Forks: 573

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language: Python - Size: 6.62 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 3,401 - Forks: 262

SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

Language: C# - Size: 393 MB - Last synced at: about 2 hours ago - Pushed at: 5 days ago - Stars: 3,347 - Forks: 465

X-D-Lab/LangChain-ChatGLM-Webui

基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答

Language: Python - Size: 18.7 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,288 - Forks: 493

OpenGVLab/InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language: Python - Size: 41.9 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 3,218 - Forks: 231

wenge-research/YAYI 📦

雅意大模型：为客户打造安全可靠的专属大模型，基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型，由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)

Language: Python - Size: 153 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,174 - Forks: 44

strands-agents/sdk-python

A model-driven approach to building AI agents in just a few lines of code.

Language: Python - Size: 1.15 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3,101 - Forks: 339

Josh-XT/AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

Language: Python - Size: 168 MB - Last synced at: about 6 hours ago - Pushed at: about 8 hours ago - Stars: 3,075 - Forks: 428

DAMO-NLP-SG/Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language: Python - Size: 19.6 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,061 - Forks: 280

CVI-SZU/Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Language: Python - Size: 7.27 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,057 - Forks: 232

SilasMarvin/lsp-ai

LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

Language: Rust - Size: 1.61 MB - Last synced at: 6 days ago - Pushed at: 8 months ago - Stars: 2,962 - Forks: 104

johnbean393/Sidekick

A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.

Language: Swift - Size: 401 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,956 - Forks: 119

run-llama/LlamaIndexTS

Data framework for your LLM applications. Focus on server side solution

Language: TypeScript - Size: 79.1 MB - Last synced at: about 20 hours ago - Pushed at: 1 day ago - Stars: 2,854 - Forks: 468

aandrew-me/tgpt

AI Chatbots in terminal without needing API keys

Language: Go - Size: 3.22 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 2,822 - Forks: 284

PhoebusSi/Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language: Jupyter Notebook - Size: 137 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,767 - Forks: 252

stochasticai/xTuring

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

Language: Python - Size: 18.4 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 2,660 - Forks: 202

ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 2,620 - Forks: 682

om-ai-lab/OmAgent

Build multimodal language agents for fast prototype and production

Language: Python - Size: 11.4 MB - Last synced at: 22 days ago - Pushed at: 6 months ago - Stars: 2,542 - Forks: 283

zjunlp/EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Language: Jupyter Notebook - Size: 84 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 2,529 - Forks: 313

X-PLUG/mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Language: Python - Size: 33.5 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 2,511 - Forks: 187

young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language: Python - Size: 378 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 2,497 - Forks: 261

xusenlinzy/api-for-open-llm

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口

Language: Python - Size: 17.8 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 2,455 - Forks: 279

pytorch/ao

PyTorch native quantization and sparsity for training and inference

Language: Python - Size: 41.5 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 2,309 - Forks: 328

darrenburns/elia

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

Language: Python - Size: 567 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 2,275 - Forks: 143

Mobile-Artificial-Intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Language: Dart - Size: 124 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 2,167 - Forks: 218

tairov/llama2.mojo

Inference Llama 2 in one file of pure 🔥

Language: Mojo - Size: 2.61 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,117 - Forks: 139

camenduru/text-generation-webui-colab

A colab gradio web UI for running Large Language Models

Language: Jupyter Notebook - Size: 161 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,103 - Forks: 370

lxe/simple-llm-finetuner

Simple UI for LLM Model Finetuning

Language: Jupyter Notebook - Size: 1.53 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,066 - Forks: 132

MetaGLM/FinGLM

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目，利用开源开放来促进「AI+金融」。

Language: HTML - Size: 581 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,064 - Forks: 304

chenking2020/FindTheChatGPTer

ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利

Size: 5 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 2,033 - Forks: 200

floneum/floneum

Instant, controllable, local pre-trained AI models in Rust

Language: Rust - Size: 259 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,005 - Forks: 111

vitoplantamura/OnnxStream

Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK.

Language: C++ - Size: 34.5 MB - Last synced at: 5 days ago - Pushed at: 10 days ago - Stars: 1,976 - Forks: 88

ymcui/Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Language: Python - Size: 2.25 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 1,938 - Forks: 166

guinmoon/LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

Language: C - Size: 355 MB - Last synced at: 5 days ago - Pushed at: 29 days ago - Stars: 1,857 - Forks: 152

heshengtao/comfyui_LLM_party

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG

Language: Python - Size: 136 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1,854 - Forks: 150

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language: Python - Size: 5.35 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 1,850 - Forks: 84

Topic: "llama"

LostRuins/koboldcpp Fork of ggml-org/llama.cpp

wenge-research/YAYI 📦