Topic: "prompt-compression"
atjsh/llmlingua-2-js
JavaScript/TypeScript implementation of LLMLingua-2 (Experimental)
Language: TypeScript - Size: 2.39 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 16 - Forks: 1
centminmod/or-cli
Python command-line tool for interacting with AI models through the OpenRouter API/Cloudflare AI Gateway, or local self-hosted Ollama. Optionally support Microsoft LLMLingua prompt token compression
Size: 15.4 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 4
kaistAI/GenPI
This repository is the official implementation of Generative Context Distillation.
Language: Python - Size: 2.57 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0
contextcrunch-ai/contextcrunch-python
Compress LLM Prompts and save 80%+ on GPT-4 in Python
Language: Python - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1
sidedwards/tinyprompt
A fast, Unix-style CLI tool for semantic prompt compression. Cuts LLM prompt tokens by 10-20x with >90% fidelity, saving costs and latency.
Language: Python - Size: 164 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0
ksm26/Prompt-Compression-and-Query-Optimization
Enhance the performance and cost-efficiency of large-scale Retrieval Augmented Generation (RAG) applications. Learn to integrate vector search with traditional database operations and apply techniques like prefiltering, postfiltering, projection, and prompt compression.
Language: Jupyter Notebook - Size: 88.9 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
SreeyaSrikanth/RL-Prompt-Compression
RL-Prompt-Compression employs graph-enhanced reinforcement learning with a Phi-3 compressor trained via GRPO using a TinyLlama evaluator and a MiniLM cross-encoder feedback model, to optimize prompt compression and improve model efficiency.
Language: Jupyter Notebook - Size: 1020 KB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 1
simplemindedbot/crewai-test
Self-hosted, open-source swarm of AI agents for end-to-end research on arbitrary topics using CrewAI orchestration, LLMLingua prompt compression, and dynamic MCP integration. Features specialized research agents, cost-optimized workflows, and runtime tool discovery for comprehensive automated research systems.
Language: Python - Size: 696 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1