GitHub / kmesiab / go-script-tokenizer
Go-Script-Tokenizer efficiently processes AWS Transcription JSON, converting speech to tokens for LLMs. It handles timestamps, speaker differentiation, and formats data for NLP tasks, streamlining the path from audio to AI-ready text.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kmesiab%2Fgo-script-tokenizer
Stars: 1
Forks: 0
Open issues: 0
License: None
Language: Go
Size: 164 KB
Dependencies parsed at: Pending
Created at: over 1 year ago
Updated at: over 1 year ago
Pushed at: over 1 year ago
Last synced at: 11 months ago