Topic: "prompt-evaluation"
abilzerian/LLM-Prompt-Library
A playground of highly experimental prompts, tools & scripts for machine intelligence models from DeepSeek, OpenAI, Anthropic, Meta, Mistral, Google, xAI & others.
Language: Python - Size: 143 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 1,116 - Forks: 116

prompt-foundry/python-sdk
The prompt engineering, prompt management, and prompt evaluation tool for Python
Language: Python - Size: 20.7 MB - Last synced at: 25 days ago - Pushed at: 8 months ago - Stars: 7 - Forks: 0

prompt-foundry/typescript-sdk
The prompt engineering, prompt management, and prompt evaluation tool for TypeScript, JavaScript, and NodeJS.
Language: TypeScript - Size: 20.9 MB - Last synced at: 12 days ago - Pushed at: 8 months ago - Stars: 6 - Forks: 1

prompt-foundry/ruby-sdk
The prompt engineering, prompt management, and prompt evaluation tool for Ruby.
Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

prompt-foundry/java-sdk
The prompt engineering, prompt management, and prompt evaluation tool for Java.
Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

thunderous77/GLaPE
Official implementation for "GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Models" (stay tuned & more will be updated)
Language: Python - Size: 2.08 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

michellepace/anthropic-model-compare
Runs two simple test prompts against 5 Anthropic models. Visually compares speed, capability, costs.
Language: Jupyter Notebook - Size: 405 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Rickcau/ConsoleApp-Prompt-Testing
Language: C# - Size: 19.5 KB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

danielrosehill/LLM-Evaluation-Prompts
A few prompts that I am storing in a repo for the purpose of running controlled experiments comparing and benchmarking different LLMs for defined use-cases
Language: Python - Size: 435 KB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

prompt-foundry/dotnet-sdk
The prompt engineering, prompt management, and prompt evaluation tool for C# and .NET
Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

prompt-foundry/kotlin-sdk
The prompt engineering, prompt management, and prompt evaluation tool for Kotlin.
Size: 6.84 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0
