Topic: "qwq"
johnbean393/Sidekick
A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.
Language: Swift - Size: 309 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2,819 - Forks: 109

adysec/OllamaR
Ollama负载均衡服务器 | 一款高性能、易配置的开源负载均衡服务器,优化Ollama负载。它能够帮助您提高应用程序的可用性和响应速度,同时确保系统资源的有效利用。
Size: 1.48 MB - Last synced at: 24 days ago - Pushed at: about 2 months ago - Stars: 132 - Forks: 124

NetEase-Media/grps_trtllm
Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.
Language: Python - Size: 126 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 130 - Forks: 8

eqimp/hogwild_llm
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
Language: Python - Size: 1.7 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 79 - Forks: 4

aws-samples/easy-model-deployer
A user-friendly Command-line/SDK tool that makes it quickly and easier to deploy open-source LLMs on AWS
Language: Python - Size: 39.8 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 37 - Forks: 6

zihao-ai/BoT
🔥🔥🔥Breaking long thought processes of o1-like LLMs, such as DeepSeek-R1, QwQ
Language: Python - Size: 13.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 19 - Forks: 0

userElaina/Multiple-Keys-for-1-File
多文件多密钥加密成一个大文件. 给一个正确的密钥,可以提取对应文件. 用于在受到胁迫的情况下隐藏文件.
Language: C++ - Size: 760 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 1

TeamVastsea/QwQUI
A modern, silky-smooth UI framwork built on RsPack.
Language: TypeScript - Size: 1.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 2

userElaina/big-file-2-small-bmp
大文件分卷混淆为小图片
Language: Python - Size: 204 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Moha111-h/Qwen3
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Language: Shell - Size: 3.07 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

mahshid1378/Bot-LLM
Breaking long thought processes of o1-like LLMs, such as DeepSeek-R1, QwQ
Language: Python - Size: 13.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

CheapNightbot/Doors-11
i created better operating system, except it doesn't work...
Language: HTML - Size: 384 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

userElaina/naive-confuse
简单的混淆,可用于网盘文件防封。
Language: C++ - Size: 45.9 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

userElaina/m-of-n-keys
一个key/文件生成n个keyfile,获取其中任意m个即可还原key/解密原文件.
Language: Python - Size: 16.1 MB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

userElaina/naive-Huffman 📦
只能压ASCII数据...
Language: C++ - Size: 6.84 KB - Last synced at: 2 days ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0
