An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speech-database

mborsdorf/TargetLanguageExtraction

Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

Azariagmt/Speech-to-text-data_collection Fork of Morawetz/Speech-to-text-data_collection

Speech-to-text data collection with Kafka, Airflow, and Spark, building a pipeline that can be deployed to process posting and receiving text and audio files from and into a data lake, apply transformation in a distributed manner, and load it into a warehouse in a suitable format to train a speech-to-text model.

Size: 38.5 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0