An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: deepgram

bolna-ai/bolna

Conversational voice AI agents

Language: Python - Size: 32 MB - Last synced at: about 8 hours ago - Pushed at: 1 day ago - Stars: 379 - Forks: 152

agentvoiceresponse/avr-sts-deepgram

This repository showcases the integration between Agent Voice Response and Deepgram's Speech-to-Speech API. The application leverages Deepgram's powerful speech processing capabilities to provide intelligent, context-aware responses in real-time audio format.

Language: JavaScript - Size: 29.3 KB - Last synced at: about 13 hours ago - Pushed at: about 15 hours ago - Stars: 0 - Forks: 1

autoshow/autoshow

End-to-end workflow to automatically generate show notes from audio/video transcripts

Language: TypeScript - Size: 5.25 MB - Last synced at: about 11 hours ago - Pushed at: 4 months ago - Stars: 89 - Forks: 10

kaymen99/AI-Voice-assistant

AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web searches using simple voice commands

Language: Python - Size: 22.5 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 54 - Forks: 24

voxos-ai/bolna

End-to-end platform for building voice first multimodal agents

Language: Python - Size: 18.4 MB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 422 - Forks: 114

jeffo777/input-right

An open-source AI voice agent platform that turns conversations into 100% accurate, user-verified data via a visual form.

Language: TypeScript - Size: 903 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 18 - Forks: 5

deepgram/deepgram-dotnet-sdk

Official .NET SDK for Deepgram.

Language: C# - Size: 6.82 MB - Last synced at: 7 days ago - Pushed at: 24 days ago - Stars: 44 - Forks: 36

deepgram/deepgram-go-sdk

Official Go SDK for Deepgram.

Language: Go - Size: 8.54 MB - Last synced at: 4 days ago - Pushed at: 24 days ago - Stars: 57 - Forks: 39

RodneyFinkel/groq_deepgram_agent

STT-LLM-TTS (websockets/asynchronous) Agent using Deepgram and Groq LPU's and Bert for Vector Embeddings, Chroma for persistent vector db storage and simiarity search for RAG context management

Language: Python - Size: 13 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

wiljansantiago/livekit-voice-agent

A production-ready voice agent implementation using LiveKit and Python, featuring advanced conversational AI capabilities and optional telephony integration.

Language: Python - Size: 9.77 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 0

deepgram-devs/deepgram-ai-agent-demo Fork of deepgram-starters/nextjs-live-transcription

Deepgram Conversational AI demo

Language: TypeScript - Size: 10.3 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 396 - Forks: 114

deepgram/deepgram-api-specs

Deepgram's API Specs

Language: JavaScript - Size: 309 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 2

Spac5y/Vocal-Agent

A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.

Language: Python - Size: 433 KB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 6 - Forks: 2

spences10/audiomind

An MP3 to AI Chat Assistant - A configurable AI chat assistant that can be customized for your content and use case. Transform audio content into interactive, searchable conversations.

Language: TypeScript - Size: 577 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 2

deepgram/deepgram-python-sdk

Official Python SDK for Deepgram.

Language: Python - Size: 17.1 MB - Last synced at: 12 days ago - Pushed at: 25 days ago - Stars: 339 - Forks: 94

deepgram-devs/deepgram-voice-agent-demo

Deepgram Voice Agent Demo

Language: TypeScript - Size: 712 KB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 41 - Forks: 38

wiljansantiago/Youtube-video-analyzer

A sophisticated Node.js application that analyzes YouTube videos for legal compliance. It transcribes the audio content of the videos using the Deepgram API and then compares it against predefined legal rules using the GPT-4 language model.

Language: TypeScript - Size: 105 MB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 41 - Forks: 25

deepgram-starters/django-voice-agent

Get started using Deepgram's Voice Agent with this Django demo app

Language: Python - Size: 45.9 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 4 - Forks: 1

deepgram-starters/node-live-text-to-speech

Get started using Deepgram's Live Text-to-Speech with this Node demo app

Language: JavaScript - Size: 134 KB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 3

Yunichi/livekit-voice-ai-agent-setup

The "livekit-voice-ai-agent-setup" repository provides a comprehensive guide and resources for setting up a voice-enabled AI agent using the LiveKit platform. It includes step-by-step instructions, code samples, and documentation to help users quickly deploy their own voice AI agent solutions.

Size: 1.95 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 10 - Forks: 1

sj0n/heepno

A CLI program to transcribe audio file using Deepgram, OpenAI and AssemblyAI models.

Language: Go - Size: 62.5 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

deepgram-devs/deepgram-twilio-streaming-python

a Demo of Deepgram & Twilio that allows multiple client subscribers to watch live transcripts from ongoing Twilio calls.

Language: Python - Size: 9.77 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 9

kaloprojects/KALO-ESP32-Voice-Chat-AI-Friends

ESP32-based voice device for chatting with multiple custom AI bots. Recording questions with I2S microphone, transcribing via ElevenLabs or Deepgram STT, creating response with Groq or Open AI LLM. TTS audio output with custom AI voices via I2S & speaker. Supporting ongoing dialogues, calling bots ‘by name’, real-time web search via keyword.

Language: C++ - Size: 14.4 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 19 - Forks: 0

deepgram-devs/sts-twilio

Enable calls made to your Twilio phone number to pass through to Deepgram's Voice Agent API, enabling the caller to talk to a voice agent/bot.

Language: Python - Size: 51.8 KB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 8

bitcointranscripts/tstbtc

This cli app transcribe audio and videos for submission to the bitcointranscripts repo

Language: Python - Size: 6.38 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 6 - Forks: 9

deepgram/deepgram-js-sdk

Official JavaScript SDK for Deepgram.

Language: TypeScript - Size: 24.2 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 214 - Forks: 74

deepgram-starters/java-transcription

Get started using Deepgram's Transcription with this Java demo app

Language: Java - Size: 1.71 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

AppajiDheeraj/AURA

Aura is a multimodal, voice-first AI assistant designed to be your personal command center. It provides an intelligent, responsive interface to control your computer, access real-time information, and even understand the world through visual input.

Language: TypeScript - Size: 353 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Nakul2401/AI_Excel_Mock_Interviewer

This is the AI agents-powered Excel interview platform that streamlines the technical evaluation of the candidate’s Excel proficiency through a voice-enabled, agentic interview experience. It automates question generation, real-time interview conduction, answer evaluation and feedback report generation – reducing time and bias in manual screenings

Language: JavaScript - Size: 425 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

faraaz-baig/spill-with-voice

A simple, open-source mac app for freewriting and more

Language: Swift - Size: 72.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AppajiDheeraj/Kairos

Kairos is an AI-powered companion designed for mindful conversation and self-reflection. It provides an empathetic, non-judgmental space for users to express their feelings, leveraging a sophisticated backend agent and a sleek, modern frontend.

Language: TypeScript - Size: 1.58 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

deepgram/deepgram-rust-sdk

Community Rust SDK for Deepgram.

Language: Rust - Size: 2.43 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 54 - Forks: 32

danieladdisonorg/livekit-voice-agent

A production-ready voice agent implementation using LiveKit and Python, featuring advanced conversational AI capabilities and optional telephony integration. It provides intelligent turn detection, function calling, comprehensive logging, telephony integration, and audio enhancement.

Language: Python - Size: 18.6 KB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 46 - Forks: 21

deepgram-starters/csharp-transcription

Get started using Deepgram's Transcription with this C# demo app

Language: C# - Size: 984 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

deepgram-starters/csharp-live-text-to-speech

Get started using Deepgram's Live Text-to-Speech with this C# demo app

Language: C# - Size: 133 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deepgram-starters/php-transcription

Get started using Deepgram's Transcription with this PHP demo app

Language: PHP - Size: 288 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 1

deepgram-starters/go-transcription

Get started using Deepgram's PreRecorded Transcription with this Go demo app

Language: Go - Size: 81.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

deepgram-starters/go-live-transcription

Get started using Deepgram's Live Transcription with this Go demo app

Language: Go - Size: 79.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 6

deepgram-starters/csharp-voice-agent

Get started using Deepgram's Voice Agent with this C# demo app

Language: C# - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deepgram-starters/sinatra-transcription

Get started using Deepgram's PreRecorded Transcription with this Sinatra demo app

Language: Ruby - Size: 69.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deepgram-starters/flask-transcription

Get started using Deepgram's Pre-Recorded Transcription with this Flask demo apps

Language: Python - Size: 3.89 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 15 - Forks: 11

deepgram-starters/django-transcription

Get started using Deepgram's PreRecorded Transcription with this Django demo app

Language: Python - Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 3

iib7rii/event-management-agent

AI event management agent that schedules and manages events using natural language. Built with Python, LangChain, and OpenAI GPT. 🌟📅

Language: Python - Size: 284 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deepgram-devs/flask-agent-function-calling-demo

Function calling with Deepgram's Voice Agent API using Python Flask

Language: Python - Size: 135 KB - Last synced at: 8 days ago - Pushed at: 2 months ago - Stars: 13 - Forks: 17

deepgram-devs/node-live-example

A simple express server setup for live audio transcriptions using Deepgram.

Language: JavaScript - Size: 142 KB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 32 - Forks: 21

xRiddin/Real-Time-AI-Voice-Assistant

Real Time AI Voice Assistant using nodejs

Language: JavaScript - Size: 82 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 15

danieladdisonorg/AI-Agent-for-Telephony-voice-bot

AI-powered telephony solution that enables businesses to deploy intelligent voice agents for various use cases such as customer support, appointment scheduling, lead qualification, and information collection.

Language: Python - Size: 198 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 40 - Forks: 6

deepgram-devs/livestream-audio-notebook

A Python notebook that walks you through how to transcribe live-streamed audio into text using the Deepgram API.

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 8

harsha-yuvaraj/Iris-Voice-AI

A voice-to-voice conversational AI built with Django, Deepgram, OpenAI, and Twilio—designed with smart time-wasting capabilities. Live now! Call & Chat at +1 956 952 7270!

Language: Python - Size: 63.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

zehraseren/AIMasteryLab

Advanced C# AI integrations — 20 real-world projects with multiple AI services. 🤖

Language: C# - Size: 7.95 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Divyansh723/InstructBot

🎙️ InstructBot – A voice-controlled AI assistant built with Flask. Listens to your commands, decides if they’re system tasks or questions, executes or responds intelligently, and speaks back in real-time.

Language: Python - Size: 39.1 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ebowwa-archive/LLM_telecenter 📦

A fastapi wrapper of babca / python-gsmmodem for a waveshare sim7600x. Not an exact copy of the 'python-gsmmodem' so be sure to uninstall that lib or venv to run | Open-source Twilio with LLM batteries

Language: Python - Size: 179 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

itsRares/react-native-deepgram

Brings Deepgram's capabilities to React Native applications, with a focus on performance and ease of use.

Language: TypeScript - Size: 2.12 MB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

UltiRequiem/bun-voice-agent-deepgram Fork of deepgram-starters/node-voice-agent

A simple voice agent using deepgram and bun

Language: JavaScript - Size: 204 KB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

deepgram-devs/deepgram-deno-sdk 📦

Deno SDK for Deepgram's automated speech recognition APIs

Language: TypeScript - Size: 56.6 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

deepgram-starters/nextjs-text-to-speech

Get started using Deepgram's Text-to-Speech with this Next.js demo app

Language: TypeScript - Size: 347 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15 - Forks: 6

S4mpl3r/youtube2blog

Turn any Youtube video into a nice blogpost, using Groq and Deepgram.

Language: Python - Size: 44.9 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 3

hardador2/AI-Voice-Agent

AI Voice Agent is a real-time voice interaction system that leverages LiveKit for seamless communication. It integrates Speech-to-Text, a Large Language Model, and Text-to-Speech to deliver an engaging user experience. 🐙🌟

Language: Python - Size: 807 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

deepgram-starters/nextjs-live-transcription

Get started using Deepgram's Live Transcription with this Next.js demo app

Language: TypeScript - Size: 363 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 221 - Forks: 249

prakharbhardwaj/voice-agent-mcp-server

A Model Context Protocol (MCP) server that integrates Twilio Voice, Deepgram AI, and OpenAI to create intelligent voice-based HR automation tools.

Language: JavaScript - Size: 62.5 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

digispect-intel/business_voice_agent_frontend

A FastHTML-based frontend for a Business Voice Agent, an AI assistant for a business website. This frontend provides a user interface for interacting with business_voice_agent_backend.

Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

digispect-intel/business_voice_agent_backend

A voice-enabled AI assistant backend for a business website. This backend powers business_voice_agent_frontend, providing real-time voice interaction capabilities using Restack AI.

Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

deepgram-starters/flask-voice-agent

Get started using Deepgram's Voice Agent with this Flask demo app

Language: Python - Size: 43.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 2

panubhav2001/voice_agent

A real-time voice-enabled assistant that understands speech, verifies identity, handles booking-related queries, and speaks natural responses using AI—ideal for modern customer service scenarios.

Language: Python - Size: 1.53 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Nico2603/MarIA

MarIA es un chatbot de salud mental potenciado por IA que, desde una web en Next.js/TypeScript, combina GPT-4, análisis de voz Deepgram y chat en vivo (LiveKit) para ofrecer apoyo emocional y técnicas de relajación con total privacidad y seguridad.

Language: TypeScript - Size: 37.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

danieladdisonorg/AI-Voice-Assitant

An advanced AI-powered voice assistant that combines speech-to-text and text-to-speech capabilities with intelligent tool integration for seamless digital interactions.

Language: Python - Size: 22.5 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 27 - Forks: 6

ARK018/multi-voice-sdk

A universal Text-to-Speech (TTS) SDK . Easily generate and manage audio content with a unified API.

Language: JavaScript - Size: 57.6 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

hosuaby/Transcriptionist

Tool to transcribe videos using AI.

Language: TypeScript - Size: 256 KB - Last synced at: about 18 hours ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

ZanSara/real-life-subtitles

Language: HTML - Size: 3.25 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

saurabhpandey33301/Snip_Ai

A SaaS application that generates stunning short videos in one click — powered by AI-written scripts (Gemini & ChatGPT APIs), dynamic captions, realistic voiceovers (Deepgram & Vapi SDK), and visually engaging animations (Remotion). Orchestrated with Inngest for seamless background processing and smooth user experience.

Language: JavaScript - Size: 43 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

deepgram-devs/video-chat

Sample app to display live captioning to a WebRTC video session with the Deepgram API.

Language: JavaScript - Size: 392 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 37 - Forks: 14

deepgram/gnosis

Gnosis is a lightweight proxy for chat completions and Deepgram's voice agent, injecting in-depth knowledge into knowledge using RAG and function calling

Language: Python - Size: 2.48 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Just-Moh-it/Spity-Sense 📦

Next-JS interface for 🤖 Open-AI based 🕷 spider-man conversation simulator ⚡️

Language: JavaScript - Size: 1.86 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 5

mrkkvnsndvl/kalma-copilot-extension

Kalma Copilot is a Chrome extension that provides real-time AI-powered assistance during online job interviews on platforms like Google Meet, Zoom, and Microsoft Teams. It offers features such as real-time audio capture, interview setup, and a draggable/minimizable interface to help users navigate their virtual interviews with confidence.

Language: JavaScript - Size: 472 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

videosdk-community/ai-agent

Build realtime AI interviewer voice agent that joins meetings. It demonstrates integrating Deepgram (STT), OpenAI (LLM), and Eleven Labs (TTS) via WebRTC for natural conversations.

Language: Python - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Vince-0/AI-Voice-Connector

Connect VOIP SIP calls to a conversational AI

Size: 208 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

spandan114/AI-realtime-voice-agent

A Python-based real-time voice-to-voice conversation system that lets you have natural conversations with very low latency, plug & play multiple llm based on your requirement.

Language: Python - Size: 2.93 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 3

AlexandreSajus/JARVIS

Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface

Language: Python - Size: 1.16 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 495 - Forks: 91

deepgram-starters/flask-live-text-to-speech

Get started using Deepgram's Live Text-to-Speech with this Flask demo app

Language: Python - Size: 123 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 2

deepgram-starters/go-live-text-to-speech

Get started using Deepgram's Live Text-to-Speech with this Go demo app

Language: Go - Size: 125 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

deepgram-starters/flask-live-chatgpt-text-to-speech

Get started using Deepgram's Live ChatGPT Text-to-Speech with this Flask demo app

Language: Python - Size: 123 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 1

Dhravya/discord-voice-transcript-for-teams

A simple discord bot that listens to voice channel and generates a transcript, then assigns tasks and summarises the conversation

Language: Python - Size: 563 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 3

nickytonline/deepgram-speech-to-text-stream

Bekah Hawrot Weigel joins Nick to show how you can transcribe text using Deepgram's Node.js SDK. They go through the demo code all the way to building out an app with Express that allows you to submit a URL for transcription.

Language: JavaScript - Size: 185 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

deepgram-starters/node-voice-agent

Get started using Deepgram's Voice Agent with this Node demo app

Language: HTML - Size: 77.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 3

Agentic-Insights/voice-bot

AI Agent for Telephony voice bot - based on vocode, twilio, deepgram, and elevenlabs. Just add your own keys and prompt.

Language: Python - Size: 379 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 8

ryanlevee/medication-reminder-system

Voice-driven, Node.js-based medication reminder system utilizing real-time communication technologies, along with Text-to-Speech (TTS), Speech-to-Text (STT), and a Large Language Model (LLM).

Language: JavaScript - Size: 1.13 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

Victoran0/chat-pal

Experience seamless, real-time speech-to-speech conversations with an AI assistant. Engage in natural dialogues, ask questions, and receive instant spoken responses, creating a truly immersive and interactive experience.

Language: TypeScript - Size: 1.01 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

OoLunar/HarmonyInSilence 📦

Harmony in Silence: A Speech-to-Text Empowerment Initiative for the Hard of Hearing Community.

Language: C# - Size: 1.06 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

navjotdhanawat/py-ai-voice-agent

PipeCat Voice Agent is an AI-powered voice communication system that enables intelligent, real-time phone conversations through WebSocket connections. It combines multiple technologies including speech recognition (Deepgram), natural language processing (GPT-4), tts (Cartesia), and Telephony (Plivo) to create seamless voice inte

Language: Python - Size: 9.04 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 1

brunogaliati/speech2text-investments

This project automates the download, transcription, and summarization of audio from YouTube videos. Using Deepgram Nova and GPT 4o models, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.

Language: Python - Size: 5.86 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

deepgram-starters/go-text-to-speech

Get started using Deepgram's Text-to-Speech with this Go demo app

Language: Go - Size: 146 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

sinanuozdemir/oreilly-multimodal-ai

Learn how multimodal AI merges text, image, and audio for smarter models

Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 23 - Forks: 8

deepgram-devs/prerecorded-audio-notebook

A Python notebook that walks you through how to transcribe audio files into text using the Deepgram API.

Language: Jupyter Notebook - Size: 644 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 4

smartdev00/Touch-Base

Record the voice call and extract the name, summary, and follow-up date. Then, save this information to Firebase.

Language: TypeScript - Size: 86.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

pankaj-raikar/AI-Subtitler

AI-Subtitler is a web app using OpenAI and Deepgram AI to automatically generate subtitles for videos in multiple languages, featuring a user dashboard and SRT output.

Language: TypeScript - Size: 308 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

craigsdennis/genai-phone-call

WIP exploration using Twilio Media Streams and Generative AI

Language: JavaScript - Size: 21.5 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 40 - Forks: 13

spark-engine-ai/ALICE

A voice AI named ALICE (Audio Language Interface and Communication Engine) which uses Deepgram, Groq and Neets APIs

Language: TypeScript - Size: 10.6 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

maihoangbichtram/virtual-voice-agent Fork of dsa/multi-agent-meeting

Multimodal Virtual General Practitioner Voice Agent

Language: Python - Size: 208 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

deepgram/deepgram-js-captions

This package is the JavaScript implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

Language: TypeScript - Size: 206 KB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 2

kaloprojects/KALO-ESP32-Voice-Assistant

Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.

Language: C++ - Size: 287 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 28 - Forks: 8