Topic: "llama"
ollama/ollama
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Language: Go - Size: 46.8 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 151,303 - Forks: 12,981

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language: Python - Size: 53 MB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 56,995 - Forks: 6,986

vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language: Python - Size: 76.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 56,958 - Forks: 9,835

unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
Language: Python - Size: 7.34 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 44,982 - Forks: 3,648

Aider-AI/aider
aider is AI pair programming in your terminal
Language: Python - Size: 133 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 37,143 - Forks: 3,440

chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language: TypeScript - Size: 138 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 36,013 - Forks: 6,015

mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Language: Go - Size: 25.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 35,056 - Forks: 2,737

haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language: Python - Size: 13.4 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 23,432 - Forks: 2,590

fishaudio/fish-speech
SOTA Open Source TTS
Language: Python - Size: 18.5 MB - Last synced at: about 1 hour ago - Pushed at: 5 days ago - Stars: 22,878 - Forks: 1,884

HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Size: 11.1 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 21,076 - Forks: 2,011

yamadashy/repomix
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.
Language: TypeScript - Size: 9.1 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 18,944 - Forks: 835

ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language: Python - Size: 23 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 18,913 - Forks: 1,880

meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
Language: Jupyter Notebook - Size: 266 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 17,792 - Forks: 2,593

sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language: Python - Size: 32.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 17,596 - Forks: 2,822

GaiZhenbiao/ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Language: Python - Size: 3.06 MB - Last synced at: 5 days ago - Pushed at: 23 days ago - Stars: 15,437 - Forks: 2,273

LlamaFamily/Llama-Chinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Language: Python - Size: 19.2 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 14,685 - Forks: 1,304

cocktailpeanut/dalai
The simplest way to run LLaMA on your local machine
Language: CSS - Size: 11.7 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 13,058 - Forks: 1,379

PaddlePaddle/PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
Language: Python - Size: 111 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 12,754 - Forks: 3,069

AstrBotDevs/AstrBot
✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify
Language: Python - Size: 31.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 11,809 - Forks: 843

bentoml/OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Language: Python - Size: 41.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 11,738 - Forks: 763

ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
Language: Python - Size: 31.8 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 11,577 - Forks: 1,217

TheR1D/shell_gpt
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
Language: Python - Size: 249 KB - Last synced at: about 2 hours ago - Pushed at: about 2 months ago - Stars: 11,311 - Forks: 908

getumbrel/llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
Language: TypeScript - Size: 1.71 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 11,000 - Forks: 711

tensorzero/tensorzero
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.
Language: Rust - Size: 96.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 9,909 - Forks: 663

bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Language: Python - Size: 4.06 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 9,781 - Forks: 570

modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).
Language: Python - Size: 67.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9,686 - Forks: 854

dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Language: TypeScript - Size: 79 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9,560 - Forks: 1,556

langchain4j/langchain4j
Java version of LangChain
Language: Java - Size: 17.2 MB - Last synced at: 2 days ago - Pushed at: 7 days ago - Stars: 8,898 - Forks: 1,638

xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language: Python - Size: 47.1 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 8,482 - Forks: 735

oumi-ai/oumi
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
Language: Python - Size: 30.4 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 8,450 - Forks: 640

SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving for Local Deployment
Language: C++ - Size: 21.7 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 8,319 - Forks: 443

reorproject/reor
Private & local AI personal knowledge management app for high entropy people.
Language: JavaScript - Size: 93.7 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 8,225 - Forks: 501

LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Language: HTML - Size: 18 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 8,221 - Forks: 767

LostRuins/koboldcpp Fork of ggml-org/llama.cpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
Language: C++ - Size: 301 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 8,132 - Forks: 525

zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Language: Python - Size: 22.2 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 7,707 - Forks: 554

ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language: Python - Size: 8.18 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 7,177 - Forks: 569

InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language: Python - Size: 9.01 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 6,942 - Forks: 599

k8sgpt-ai/k8sgpt
Giving Kubernetes Superpowers to everyone
Language: Go - Size: 27.5 MB - Last synced at: about 1 hour ago - Pushed at: about 22 hours ago - Stars: 6,925 - Forks: 864

yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language: Python - Size: 6.24 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 6,531 - Forks: 583

arcee-ai/mergekit
Tools for merging pretrained large language models.
Language: Python - Size: 911 KB - Last synced at: about 1 hour ago - Pushed at: 20 days ago - Stars: 6,258 - Forks: 607

WangRongsheng/awesome-LLM-resources
🧑🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
Size: 36.2 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 6,103 - Forks: 599

mishushakov/llm-scraper
Turn any webpage into structured data using LLMs
Language: TypeScript - Size: 127 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 5,985 - Forks: 355

lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: about 6 hours ago - Pushed at: 3 days ago - Stars: 5,912 - Forks: 460

serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
Language: Svelte - Size: 3 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 5,751 - Forks: 402

baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language: Python - Size: 3.83 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 5,685 - Forks: 504

gluonfield/enchanted
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.
Language: Swift - Size: 5.38 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 5,618 - Forks: 378

linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language: Python - Size: 16.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 5,601 - Forks: 394

cheahjs/free-llm-api-resources
A list of free LLM inference resources accessible via API.
Language: Python - Size: 349 KB - Last synced at: about 19 hours ago - Pushed at: about 21 hours ago - Stars: 5,221 - Forks: 452

multimodal-art-projection/YuE
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Language: Python - Size: 32.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5,052 - Forks: 559

SCIR-HI/Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
Language: Python - Size: 10.5 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 4,855 - Forks: 487

h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Language: Python - Size: 54.5 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 4,607 - Forks: 488

Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
Language: HTML - Size: 82.7 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 4,327 - Forks: 306

CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。
Language: Python - Size: 305 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 4,161 - Forks: 338

clusterzx/paperless-ai
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents.
Language: JavaScript - Size: 14.3 MB - Last synced at: about 1 hour ago - Pushed at: 7 days ago - Stars: 4,156 - Forks: 178

Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
Language: C - Size: 252 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 4,147 - Forks: 414

shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
Language: Python - Size: 13.3 MB - Last synced at: 1 day ago - Pushed at: 7 days ago - Stars: 4,070 - Forks: 598

transformerlab/transformerlab-app
Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.
Language: TypeScript - Size: 10.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 4,044 - Forks: 372

casibase/casibase
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com
Language: Go - Size: 21.8 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 3,985 - Forks: 474

langroid/langroid
Harness LLMs with Multi-Agent Programming
Language: Python - Size: 110 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 3,672 - Forks: 345

yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
Language: Jupyter Notebook - Size: 51 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 3,636 - Forks: 431

datawhalechina/tiny-universe
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Language: Jupyter Notebook - Size: 21 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 3,617 - Forks: 365

gpustack/gpustack
Simple, scalable AI model deployment on GPU clusters
Language: Python - Size: 132 MB - Last synced at: 9 days ago - Pushed at: 12 days ago - Stars: 3,579 - Forks: 364

ModelTC/LightLLM
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language: Python - Size: 8.01 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,577 - Forks: 276

datawhalechina/llms-from-scratch-cn
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
Language: Jupyter Notebook - Size: 39.9 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3,489 - Forks: 483

higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Language: Jupyter Notebook - Size: 4.83 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,469 - Forks: 573

predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language: Python - Size: 6.62 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 3,401 - Forks: 262

SciSharp/LLamaSharp
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.
Language: C# - Size: 393 MB - Last synced at: about 2 hours ago - Pushed at: 5 days ago - Stars: 3,347 - Forks: 465

X-D-Lab/LangChain-ChatGLM-Webui
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答
Language: Python - Size: 18.7 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,288 - Forks: 493

OpenGVLab/InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Language: Python - Size: 41.9 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 3,218 - Forks: 231

wenge-research/YAYI 📦
雅意大模型:为客户打造安全可靠的专属大模型,基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型,由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)
Language: Python - Size: 153 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,174 - Forks: 44

strands-agents/sdk-python
A model-driven approach to building AI agents in just a few lines of code.
Language: Python - Size: 1.15 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3,101 - Forks: 339

Josh-XT/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
Language: Python - Size: 168 MB - Last synced at: about 6 hours ago - Pushed at: about 8 hours ago - Stars: 3,075 - Forks: 428

DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language: Python - Size: 19.6 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,061 - Forks: 280

CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
Language: Python - Size: 7.27 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 3,057 - Forks: 232

SilasMarvin/lsp-ai
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
Language: Rust - Size: 1.61 MB - Last synced at: 6 days ago - Pushed at: 8 months ago - Stars: 2,962 - Forks: 104

johnbean393/Sidekick
A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.
Language: Swift - Size: 401 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2,956 - Forks: 119

run-llama/LlamaIndexTS
Data framework for your LLM applications. Focus on server side solution
Language: TypeScript - Size: 79.1 MB - Last synced at: about 20 hours ago - Pushed at: 1 day ago - Stars: 2,854 - Forks: 468

aandrew-me/tgpt
AI Chatbots in terminal without needing API keys
Language: Go - Size: 3.22 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 2,822 - Forks: 284

PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
Language: Jupyter Notebook - Size: 137 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,767 - Forks: 252

stochasticai/xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
Language: Python - Size: 18.4 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 2,660 - Forks: 202

ashishpatel26/LLM-Finetuning
LLM Finetuning with peft
Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 2,620 - Forks: 682

om-ai-lab/OmAgent
Build multimodal language agents for fast prototype and production
Language: Python - Size: 11.4 MB - Last synced at: 22 days ago - Pushed at: 6 months ago - Stars: 2,542 - Forks: 283

zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Language: Jupyter Notebook - Size: 84 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 2,529 - Forks: 313

X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language: Python - Size: 33.5 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 2,511 - Forks: 187

young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Language: Python - Size: 378 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 2,497 - Forks: 261

xusenlinzy/api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
Language: Python - Size: 17.8 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 2,455 - Forks: 279

pytorch/ao
PyTorch native quantization and sparsity for training and inference
Language: Python - Size: 41.5 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 2,309 - Forks: 328

darrenburns/elia
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
Language: Python - Size: 567 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 2,275 - Forks: 143

Mobile-Artificial-Intelligence/maid
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
Language: Dart - Size: 124 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 2,167 - Forks: 218

tairov/llama2.mojo
Inference Llama 2 in one file of pure 🔥
Language: Mojo - Size: 2.61 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,117 - Forks: 139

camenduru/text-generation-webui-colab
A colab gradio web UI for running Large Language Models
Language: Jupyter Notebook - Size: 161 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,103 - Forks: 370

lxe/simple-llm-finetuner
Simple UI for LLM Model Finetuning
Language: Jupyter Notebook - Size: 1.53 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,066 - Forks: 132

MetaGLM/FinGLM
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。
Language: HTML - Size: 581 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,064 - Forks: 304

chenking2020/FindTheChatGPTer
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利
Size: 5 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 2,033 - Forks: 200

floneum/floneum
Instant, controllable, local pre-trained AI models in Rust
Language: Rust - Size: 259 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,005 - Forks: 111

vitoplantamura/OnnxStream
Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK.
Language: C++ - Size: 34.5 MB - Last synced at: 5 days ago - Pushed at: 10 days ago - Stars: 1,976 - Forks: 88

ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
Language: Python - Size: 2.25 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 1,938 - Forks: 166

guinmoon/LLMFarm
llama and other large language models on iOS and MacOS offline using GGML library.
Language: C - Size: 355 MB - Last synced at: 5 days ago - Pushed at: 29 days ago - Stars: 1,857 - Forks: 152

heshengtao/comfyui_LLM_party
LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG
Language: Python - Size: 136 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1,854 - Forks: 150

FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language: Python - Size: 5.35 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 1,850 - Forks: 84
