An open API service providing repository metadata for many open source software ecosystems.

Topic: "prompt-compression"

atjsh/llmlingua-2-js

JavaScript/TypeScript implementation of LLMLingua-2 (Experimental)

Language: TypeScript - Size: 2.39 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 16 - Forks: 1

centminmod/or-cli

Python command-line tool for interacting with AI models through the OpenRouter API/Cloudflare AI Gateway, or local self-hosted Ollama. Optionally support Microsoft LLMLingua prompt token compression

Size: 15.4 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 12 - Forks: 4

kaistAI/GenPI

This repository is the official implementation of Generative Context Distillation.

Language: Python - Size: 2.57 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

contextcrunch-ai/contextcrunch-python

Compress LLM Prompts and save 80%+ on GPT-4 in Python

Language: Python - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

sidedwards/tinyprompt

A fast, Unix-style CLI tool for semantic prompt compression. Cuts LLM prompt tokens by 10-20x with >90% fidelity, saving costs and latency.

Language: Python - Size: 164 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

ksm26/Prompt-Compression-and-Query-Optimization

Enhance the performance and cost-efficiency of large-scale Retrieval Augmented Generation (RAG) applications. Learn to integrate vector search with traditional database operations and apply techniques like prefiltering, postfiltering, projection, and prompt compression.

Language: Jupyter Notebook - Size: 88.9 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

SreeyaSrikanth/RL-Prompt-Compression

RL-Prompt-Compression employs graph-enhanced reinforcement learning with a Phi-3 compressor trained via GRPO using a TinyLlama evaluator and a MiniLM cross-encoder feedback model, to optimize prompt compression and improve model efficiency.

Language: Jupyter Notebook - Size: 1020 KB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 1

simplemindedbot/crewai-test

Self-hosted, open-source swarm of AI agents for end-to-end research on arbitrary topics using CrewAI orchestration, LLMLingua prompt compression, and dynamic MCP integration. Features specialized research agents, cost-optimized workflows, and runtime tool discovery for comprehensive automated research systems.

Language: Python - Size: 696 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1