GitHub topics: deepgram
bolna-ai/bolna
Conversational voice AI agents
Language: Python - Size: 32 MB - Last synced at: about 8 hours ago - Pushed at: 1 day ago - Stars: 379 - Forks: 152

agentvoiceresponse/avr-sts-deepgram
This repository showcases the integration between Agent Voice Response and Deepgram's Speech-to-Speech API. The application leverages Deepgram's powerful speech processing capabilities to provide intelligent, context-aware responses in real-time audio format.
Language: JavaScript - Size: 29.3 KB - Last synced at: about 13 hours ago - Pushed at: about 15 hours ago - Stars: 0 - Forks: 1

autoshow/autoshow
End-to-end workflow to automatically generate show notes from audio/video transcripts
Language: TypeScript - Size: 5.25 MB - Last synced at: about 11 hours ago - Pushed at: 4 months ago - Stars: 89 - Forks: 10

kaymen99/AI-Voice-assistant
AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web searches using simple voice commands
Language: Python - Size: 22.5 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 54 - Forks: 24

voxos-ai/bolna
End-to-end platform for building voice first multimodal agents
Language: Python - Size: 18.4 MB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 422 - Forks: 114

jeffo777/input-right
An open-source AI voice agent platform that turns conversations into 100% accurate, user-verified data via a visual form.
Language: TypeScript - Size: 903 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 18 - Forks: 5

deepgram/deepgram-dotnet-sdk
Official .NET SDK for Deepgram.
Language: C# - Size: 6.82 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 44 - Forks: 36

deepgram/deepgram-go-sdk
Official Go SDK for Deepgram.
Language: Go - Size: 8.54 MB - Last synced at: 4 days ago - Pushed at: 24 days ago - Stars: 57 - Forks: 39

RodneyFinkel/groq_deepgram_agent
STT-LLM-TTS (websockets/asynchronous) Agent using Deepgram and Groq LPU's and Bert for Vector Embeddings, Chroma for persistent vector db storage and simiarity search for RAG context management
Language: Python - Size: 13 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

wiljansantiago/livekit-voice-agent
A production-ready voice agent implementation using LiveKit and Python, featuring advanced conversational AI capabilities and optional telephony integration.
Language: Python - Size: 9.77 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 0

deepgram-devs/deepgram-ai-agent-demo Fork of deepgram-starters/nextjs-live-transcription
Deepgram Conversational AI demo
Language: TypeScript - Size: 10.3 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 396 - Forks: 114

deepgram/deepgram-api-specs
Deepgram's API Specs
Language: JavaScript - Size: 309 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 2

Spac5y/Vocal-Agent
A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
Language: Python - Size: 433 KB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 6 - Forks: 2

spences10/audiomind
An MP3 to AI Chat Assistant - A configurable AI chat assistant that can be customized for your content and use case. Transform audio content into interactive, searchable conversations.
Language: TypeScript - Size: 577 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 2

deepgram/deepgram-python-sdk
Official Python SDK for Deepgram.
Language: Python - Size: 17.1 MB - Last synced at: 12 days ago - Pushed at: 25 days ago - Stars: 339 - Forks: 94

deepgram-devs/deepgram-voice-agent-demo
Deepgram Voice Agent Demo
Language: TypeScript - Size: 712 KB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 41 - Forks: 38

wiljansantiago/Youtube-video-analyzer
A sophisticated Node.js application that analyzes YouTube videos for legal compliance. It transcribes the audio content of the videos using the Deepgram API and then compares it against predefined legal rules using the GPT-4 language model.
Language: TypeScript - Size: 105 MB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 41 - Forks: 25

deepgram-starters/django-voice-agent
Get started using Deepgram's Voice Agent with this Django demo app
Language: Python - Size: 45.9 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 4 - Forks: 1

deepgram-starters/node-live-text-to-speech
Get started using Deepgram's Live Text-to-Speech with this Node demo app
Language: JavaScript - Size: 134 KB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 3

Yunichi/livekit-voice-ai-agent-setup
The "livekit-voice-ai-agent-setup" repository provides a comprehensive guide and resources for setting up a voice-enabled AI agent using the LiveKit platform. It includes step-by-step instructions, code samples, and documentation to help users quickly deploy their own voice AI agent solutions.
Size: 1.95 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 10 - Forks: 1

sj0n/heepno
A CLI program to transcribe audio file using Deepgram, OpenAI and AssemblyAI models.
Language: Go - Size: 62.5 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

deepgram-devs/deepgram-twilio-streaming-python
a Demo of Deepgram & Twilio that allows multiple client subscribers to watch live transcripts from ongoing Twilio calls.
Language: Python - Size: 9.77 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 9

kaloprojects/KALO-ESP32-Voice-Chat-AI-Friends
ESP32-based voice device for chatting with multiple custom AI bots. Recording questions with I2S microphone, transcribing via ElevenLabs or Deepgram STT, creating response with Groq or Open AI LLM. TTS audio output with custom AI voices via I2S & speaker. Supporting ongoing dialogues, calling bots ‘by name’, real-time web search via keyword.
Language: C++ - Size: 14.4 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 19 - Forks: 0

deepgram-devs/sts-twilio
Enable calls made to your Twilio phone number to pass through to Deepgram's Voice Agent API, enabling the caller to talk to a voice agent/bot.
Language: Python - Size: 51.8 KB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 8

bitcointranscripts/tstbtc
This cli app transcribe audio and videos for submission to the bitcointranscripts repo
Language: Python - Size: 6.38 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 6 - Forks: 9

deepgram/deepgram-js-sdk
Official JavaScript SDK for Deepgram.
Language: TypeScript - Size: 24.2 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 214 - Forks: 74

deepgram-starters/java-transcription
Get started using Deepgram's Transcription with this Java demo app
Language: Java - Size: 1.71 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

AppajiDheeraj/AURA
Aura is a multimodal, voice-first AI assistant designed to be your personal command center. It provides an intelligent, responsive interface to control your computer, access real-time information, and even understand the world through visual input.
Language: TypeScript - Size: 353 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Nakul2401/AI_Excel_Mock_Interviewer
This is the AI agents-powered Excel interview platform that streamlines the technical evaluation of the candidate’s Excel proficiency through a voice-enabled, agentic interview experience. It automates question generation, real-time interview conduction, answer evaluation and feedback report generation – reducing time and bias in manual screenings
Language: JavaScript - Size: 425 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

faraaz-baig/spill-with-voice
A simple, open-source mac app for freewriting and more
Language: Swift - Size: 72.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AppajiDheeraj/Kairos
Kairos is an AI-powered companion designed for mindful conversation and self-reflection. It provides an empathetic, non-judgmental space for users to express their feelings, leveraging a sophisticated backend agent and a sleek, modern frontend.
Language: TypeScript - Size: 1.58 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

deepgram/deepgram-rust-sdk
Community Rust SDK for Deepgram.
Language: Rust - Size: 2.43 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 54 - Forks: 32

danieladdisonorg/livekit-voice-agent
A production-ready voice agent implementation using LiveKit and Python, featuring advanced conversational AI capabilities and optional telephony integration. It provides intelligent turn detection, function calling, comprehensive logging, telephony integration, and audio enhancement.
Language: Python - Size: 18.6 KB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 46 - Forks: 21

deepgram-starters/csharp-transcription
Get started using Deepgram's Transcription with this C# demo app
Language: C# - Size: 984 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

deepgram-starters/csharp-live-text-to-speech
Get started using Deepgram's Live Text-to-Speech with this C# demo app
Language: C# - Size: 133 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deepgram-starters/php-transcription
Get started using Deepgram's Transcription with this PHP demo app
Language: PHP - Size: 288 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 1

deepgram-starters/go-transcription
Get started using Deepgram's PreRecorded Transcription with this Go demo app
Language: Go - Size: 81.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

deepgram-starters/go-live-transcription
Get started using Deepgram's Live Transcription with this Go demo app
Language: Go - Size: 79.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 6

deepgram-starters/csharp-voice-agent
Get started using Deepgram's Voice Agent with this C# demo app
Language: C# - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deepgram-starters/sinatra-transcription
Get started using Deepgram's PreRecorded Transcription with this Sinatra demo app
Language: Ruby - Size: 69.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deepgram-starters/flask-transcription
Get started using Deepgram's Pre-Recorded Transcription with this Flask demo apps
Language: Python - Size: 3.89 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 15 - Forks: 11

deepgram-starters/django-transcription
Get started using Deepgram's PreRecorded Transcription with this Django demo app
Language: Python - Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 3

iib7rii/event-management-agent
AI event management agent that schedules and manages events using natural language. Built with Python, LangChain, and OpenAI GPT. 🌟📅
Language: Python - Size: 284 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deepgram-devs/flask-agent-function-calling-demo
Function calling with Deepgram's Voice Agent API using Python Flask
Language: Python - Size: 135 KB - Last synced at: 8 days ago - Pushed at: 2 months ago - Stars: 13 - Forks: 17

deepgram-devs/node-live-example
A simple express server setup for live audio transcriptions using Deepgram.
Language: JavaScript - Size: 142 KB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 32 - Forks: 21

xRiddin/Real-Time-AI-Voice-Assistant
Real Time AI Voice Assistant using nodejs
Language: JavaScript - Size: 82 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 15

danieladdisonorg/AI-Agent-for-Telephony-voice-bot
AI-powered telephony solution that enables businesses to deploy intelligent voice agents for various use cases such as customer support, appointment scheduling, lead qualification, and information collection.
Language: Python - Size: 198 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 40 - Forks: 6

deepgram-devs/livestream-audio-notebook
A Python notebook that walks you through how to transcribe live-streamed audio into text using the Deepgram API.
Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 8

harsha-yuvaraj/Iris-Voice-AI
A voice-to-voice conversational AI built with Django, Deepgram, OpenAI, and Twilio—designed with smart time-wasting capabilities. Live now! Call & Chat at +1 956 952 7270!
Language: Python - Size: 63.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

zehraseren/AIMasteryLab
Advanced C# AI integrations — 20 real-world projects with multiple AI services. 🤖
Language: C# - Size: 7.95 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Divyansh723/InstructBot
🎙️ InstructBot – A voice-controlled AI assistant built with Flask. Listens to your commands, decides if they’re system tasks or questions, executes or responds intelligently, and speaks back in real-time.
Language: Python - Size: 39.1 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ebowwa-archive/LLM_telecenter 📦
A fastapi wrapper of babca / python-gsmmodem for a waveshare sim7600x. Not an exact copy of the 'python-gsmmodem' so be sure to uninstall that lib or venv to run | Open-source Twilio with LLM batteries
Language: Python - Size: 179 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

itsRares/react-native-deepgram
Brings Deepgram's capabilities to React Native applications, with a focus on performance and ease of use.
Language: TypeScript - Size: 2.12 MB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

UltiRequiem/bun-voice-agent-deepgram Fork of deepgram-starters/node-voice-agent
A simple voice agent using deepgram and bun
Language: JavaScript - Size: 204 KB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

deepgram-devs/deepgram-deno-sdk 📦
Deno SDK for Deepgram's automated speech recognition APIs
Language: TypeScript - Size: 56.6 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

deepgram-starters/nextjs-text-to-speech
Get started using Deepgram's Text-to-Speech with this Next.js demo app
Language: TypeScript - Size: 347 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15 - Forks: 6

S4mpl3r/youtube2blog
Turn any Youtube video into a nice blogpost, using Groq and Deepgram.
Language: Python - Size: 44.9 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 3

hardador2/AI-Voice-Agent
AI Voice Agent is a real-time voice interaction system that leverages LiveKit for seamless communication. It integrates Speech-to-Text, a Large Language Model, and Text-to-Speech to deliver an engaging user experience. 🐙🌟
Language: Python - Size: 807 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

deepgram-starters/nextjs-live-transcription
Get started using Deepgram's Live Transcription with this Next.js demo app
Language: TypeScript - Size: 363 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 221 - Forks: 249

prakharbhardwaj/voice-agent-mcp-server
A Model Context Protocol (MCP) server that integrates Twilio Voice, Deepgram AI, and OpenAI to create intelligent voice-based HR automation tools.
Language: JavaScript - Size: 62.5 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

digispect-intel/business_voice_agent_frontend
A FastHTML-based frontend for a Business Voice Agent, an AI assistant for a business website. This frontend provides a user interface for interacting with business_voice_agent_backend.
Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

digispect-intel/business_voice_agent_backend
A voice-enabled AI assistant backend for a business website. This backend powers business_voice_agent_frontend, providing real-time voice interaction capabilities using Restack AI.
Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

deepgram-starters/flask-voice-agent
Get started using Deepgram's Voice Agent with this Flask demo app
Language: Python - Size: 43.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 2

panubhav2001/voice_agent
A real-time voice-enabled assistant that understands speech, verifies identity, handles booking-related queries, and speaks natural responses using AI—ideal for modern customer service scenarios.
Language: Python - Size: 1.53 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Nico2603/MarIA
MarIA es un chatbot de salud mental potenciado por IA que, desde una web en Next.js/TypeScript, combina GPT-4, análisis de voz Deepgram y chat en vivo (LiveKit) para ofrecer apoyo emocional y técnicas de relajación con total privacidad y seguridad.
Language: TypeScript - Size: 37.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

danieladdisonorg/AI-Voice-Assitant
An advanced AI-powered voice assistant that combines speech-to-text and text-to-speech capabilities with intelligent tool integration for seamless digital interactions.
Language: Python - Size: 22.5 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 27 - Forks: 6

ARK018/multi-voice-sdk
A universal Text-to-Speech (TTS) SDK . Easily generate and manage audio content with a unified API.
Language: JavaScript - Size: 57.6 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

hosuaby/Transcriptionist
Tool to transcribe videos using AI.
Language: TypeScript - Size: 256 KB - Last synced at: about 18 hours ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

ZanSara/real-life-subtitles
Language: HTML - Size: 3.25 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

saurabhpandey33301/Snip_Ai
A SaaS application that generates stunning short videos in one click — powered by AI-written scripts (Gemini & ChatGPT APIs), dynamic captions, realistic voiceovers (Deepgram & Vapi SDK), and visually engaging animations (Remotion). Orchestrated with Inngest for seamless background processing and smooth user experience.
Language: JavaScript - Size: 43 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

deepgram-devs/video-chat
Sample app to display live captioning to a WebRTC video session with the Deepgram API.
Language: JavaScript - Size: 392 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 37 - Forks: 14

deepgram/gnosis
Gnosis is a lightweight proxy for chat completions and Deepgram's voice agent, injecting in-depth knowledge into knowledge using RAG and function calling
Language: Python - Size: 2.48 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Just-Moh-it/Spity-Sense 📦
Next-JS interface for 🤖 Open-AI based 🕷 spider-man conversation simulator ⚡️
Language: JavaScript - Size: 1.86 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 5

mrkkvnsndvl/kalma-copilot-extension
Kalma Copilot is a Chrome extension that provides real-time AI-powered assistance during online job interviews on platforms like Google Meet, Zoom, and Microsoft Teams. It offers features such as real-time audio capture, interview setup, and a draggable/minimizable interface to help users navigate their virtual interviews with confidence.
Language: JavaScript - Size: 472 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

videosdk-community/ai-agent
Build realtime AI interviewer voice agent that joins meetings. It demonstrates integrating Deepgram (STT), OpenAI (LLM), and Eleven Labs (TTS) via WebRTC for natural conversations.
Language: Python - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Vince-0/AI-Voice-Connector
Connect VOIP SIP calls to a conversational AI
Size: 208 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

spandan114/AI-realtime-voice-agent
A Python-based real-time voice-to-voice conversation system that lets you have natural conversations with very low latency, plug & play multiple llm based on your requirement.
Language: Python - Size: 2.93 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 3

AlexandreSajus/JARVIS
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
Language: Python - Size: 1.16 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 495 - Forks: 91

deepgram-starters/flask-live-text-to-speech
Get started using Deepgram's Live Text-to-Speech with this Flask demo app
Language: Python - Size: 123 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 2

deepgram-starters/go-live-text-to-speech
Get started using Deepgram's Live Text-to-Speech with this Go demo app
Language: Go - Size: 125 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

deepgram-starters/flask-live-chatgpt-text-to-speech
Get started using Deepgram's Live ChatGPT Text-to-Speech with this Flask demo app
Language: Python - Size: 123 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 1

Dhravya/discord-voice-transcript-for-teams
A simple discord bot that listens to voice channel and generates a transcript, then assigns tasks and summarises the conversation
Language: Python - Size: 563 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 3

nickytonline/deepgram-speech-to-text-stream
Bekah Hawrot Weigel joins Nick to show how you can transcribe text using Deepgram's Node.js SDK. They go through the demo code all the way to building out an app with Express that allows you to submit a URL for transcription.
Language: JavaScript - Size: 185 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

deepgram-starters/node-voice-agent
Get started using Deepgram's Voice Agent with this Node demo app
Language: HTML - Size: 77.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 3

Agentic-Insights/voice-bot
AI Agent for Telephony voice bot - based on vocode, twilio, deepgram, and elevenlabs. Just add your own keys and prompt.
Language: Python - Size: 379 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 8

ryanlevee/medication-reminder-system
Voice-driven, Node.js-based medication reminder system utilizing real-time communication technologies, along with Text-to-Speech (TTS), Speech-to-Text (STT), and a Large Language Model (LLM).
Language: JavaScript - Size: 1.13 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

Victoran0/chat-pal
Experience seamless, real-time speech-to-speech conversations with an AI assistant. Engage in natural dialogues, ask questions, and receive instant spoken responses, creating a truly immersive and interactive experience.
Language: TypeScript - Size: 1.01 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

OoLunar/HarmonyInSilence 📦
Harmony in Silence: A Speech-to-Text Empowerment Initiative for the Hard of Hearing Community.
Language: C# - Size: 1.06 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

navjotdhanawat/py-ai-voice-agent
PipeCat Voice Agent is an AI-powered voice communication system that enables intelligent, real-time phone conversations through WebSocket connections. It combines multiple technologies including speech recognition (Deepgram), natural language processing (GPT-4), tts (Cartesia), and Telephony (Plivo) to create seamless voice inte
Language: Python - Size: 9.04 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 1

brunogaliati/speech2text-investments
This project automates the download, transcription, and summarization of audio from YouTube videos. Using Deepgram Nova and GPT 4o models, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.
Language: Python - Size: 5.86 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

deepgram-starters/go-text-to-speech
Get started using Deepgram's Text-to-Speech with this Go demo app
Language: Go - Size: 146 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

sinanuozdemir/oreilly-multimodal-ai
Learn how multimodal AI merges text, image, and audio for smarter models
Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 23 - Forks: 8

deepgram-devs/prerecorded-audio-notebook
A Python notebook that walks you through how to transcribe audio files into text using the Deepgram API.
Language: Jupyter Notebook - Size: 644 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 4

smartdev00/Touch-Base
Record the voice call and extract the name, summary, and follow-up date. Then, save this information to Firebase.
Language: TypeScript - Size: 86.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

pankaj-raikar/AI-Subtitler
AI-Subtitler is a web app using OpenAI and Deepgram AI to automatically generate subtitles for videos in multiple languages, featuring a user dashboard and SRT output.
Language: TypeScript - Size: 308 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

craigsdennis/genai-phone-call
WIP exploration using Twilio Media Streams and Generative AI
Language: JavaScript - Size: 21.5 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 40 - Forks: 13

spark-engine-ai/ALICE
A voice AI named ALICE (Audio Language Interface and Communication Engine) which uses Deepgram, Groq and Neets APIs
Language: TypeScript - Size: 10.6 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

maihoangbichtram/virtual-voice-agent Fork of dsa/multi-agent-meeting
Multimodal Virtual General Practitioner Voice Agent
Language: Python - Size: 208 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

deepgram/deepgram-js-captions
This package is the JavaScript implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Language: TypeScript - Size: 206 KB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 2

kaloprojects/KALO-ESP32-Voice-Assistant
Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.
Language: C++ - Size: 287 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 28 - Forks: 8
