GitHub topics: gguf-models
controlecidadao/samantha_ia
Experimental interface environment for open source LLM, designed to democratize the use of AI. Powered by llama-cpp, llama-cpp-python and Gradio.
Language: Python - Size: 24.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8 - Forks: 1

ashioyajotham/fingpt_trader
An algorithmic trading system based on FinGPT, demonstrating new applications of large pre-trained Language Models in quantitative finance.
Language: Python - Size: 823 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8 - Forks: 3

pollockjj/ComfyUI-MultiGPU Fork of neuratech-ai/ComfyUI-MultiGPU
This custom_node for ComfyUI adds one-click "Virtual VRAM" for any GGUF UNet and CLIP loader, managing the offload of layers to DRAM or VRAM to maximize the latent space of your card. Also includes nodes for directly loading entire components (UNet, CLIP, VAE) onto the device you choose. Includes 16 examples covering common use cases.
Language: Python - Size: 14.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 212 - Forks: 9

testli-ai/outlines-llama-cpp-python-streaming-output
This repository demonstrates how to use outlines and llama-cpp-python for structured JSON generation with streaming output, integrating llama.cpp for local model inference and outlines for schema-based text generation.
Language: Python - Size: 98.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

LuisMiSanVe/GGUF-to-PyTorchTensor
Simple Python Script that converts the Weight of a GGUF Model to a PyTorch Tensor
Language: Python - Size: 28.3 KB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

mili-tan/Onllama.GGUFLinkOut
Create out symbolic links for the GGUF Models in Ollama Blobs. for use in other applications such as Llama.cpp/Jan/LMStudio etc. / 将 Ollama GGUF 模型文件软链接出,以便其他应用使用。
Language: C# - Size: 15.6 KB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 5 - Forks: 1
