Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speech-translation

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language: Python - Size: 68.4 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 10,329 - Forks: 1,799

hlt-mt/FBK-fairseq

Repository containing the open source code of works published at the FBK MT unit.

Language: Python - Size: 7.44 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 33 - Forks: 0

Dadangdut33/Speech-Translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

Language: Python - Size: 14.5 MB - Last synced: 2 days ago - Pushed: 5 months ago - Stars: 404 - Forks: 53

Rongjiehuang/awesome-speech-to-speech-translation

List of direct speech-to-speech translation papers.

Size: 4.88 KB - Last synced: 4 days ago - Pushed: over 1 year ago - Stars: 23 - Forks: 1

echogarden-project/echogarden

Integrated speech toolset designed to be accessible to end-users. Fully open-source.

Language: TypeScript - Size: 1.46 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 75 - Forks: 9

KevKibe/African-Whisper

🚀 Seamlessly fine-tune and deploy Whisper model on a multi-lingual dataset.

Language: Python - Size: 59.3 MB - Last synced: 20 days ago - Pushed: 22 days ago - Stars: 11 - Forks: 2

zhangshaolei1998/Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

Size: 1.91 MB - Last synced: 9 days ago - Pushed: 5 months ago - Stars: 544 - Forks: 6

George0828Zhang/torch_cif

A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.

Language: Python - Size: 167 KB - Last synced: 22 days ago - Pushed: 4 months ago - Stars: 29 - Forks: 3

microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language: Python - Size: 17.8 MB - Last synced: 23 days ago - Pushed: about 1 month ago - Stars: 1,037 - Forks: 108

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language: Python - Size: 241 MB - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 10,121 - Forks: 2,154

espnet/espnet

End-to-End Speech Processing Toolkit

Language: Python - Size: 920 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7,825 - Forks: 2,083

dqqcasia/awesome-speech-translation Fork of ucaslyc/speech_translation-papers

Size: 296 KB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 169 - Forks: 0

mt-upc/ZeroSwot

Pushing the Limits of Zero-shot End-to-End Speech Translation

Language: Python - Size: 4.88 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 14 - Forks: 1

csikasote/bigc

This repository contains the data resources for the LacunaFund supported project, Multimodal datasets for the Bemba Language of Zambia.

Size: 17.4 GB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 5 - Forks: 3

double22a/speech_dataset

The dataset of Speech Recognition

Size: 62.5 KB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 333 - Forks: 66

ksquarekumar/whisper-stream

Whisper Transcription Service

Language: Jupyter Notebook - Size: 6.21 MB - Last synced: 3 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

ictnlp/STEMM

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Language: Python - Size: 3.44 MB - Last synced: 18 days ago - Pushed: 7 months ago - Stars: 34 - Forks: 6

JeffWang0325/Microsoft-Azure-Cognitive-Services

🖍️ This project combines multiple operations in Microsoft Azure Cognitive Services into one GUI, including QnA Maker, LUIS, Computer Vision, Custom Vision, Face, Form Recognizer, Text To Speech, Speech To Text and Speech Translation. It's very user-friendly for users to implement any operation mentioned above.

Language: C# - Size: 17.4 MB - Last synced: 26 days ago - Pushed: over 2 years ago - Stars: 7 - Forks: 6

ictnlp/DASpeech

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

Language: Python - Size: 15.2 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 40 - Forks: 4

mt-upc/SegAugment

SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Language: Python - Size: 122 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

ictnlp/DiSeg

Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"

Language: Python - Size: 1.73 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 35 - Forks: 2

bzhangGo/zero

Zero -- A neural machine translation system

Language: Python - Size: 1.93 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 140 - Forks: 21

kahne/SpeechTransProgress

Tracking the progress in end-to-end speech translation

Size: 121 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 218 - Forks: 26

ictnlp/BT4ST

Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".

Language: Python - Size: 64.5 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 9 - Forks: 2

xuchennlp/S2T

The project for speech translation

Language: Python - Size: 3.47 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 2 - Forks: 0

ictnlp/CRESS

Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".

Language: Python - Size: 56.6 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 8 - Forks: 0

liamdugan/speech-to-speech

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

Language: Python - Size: 1.68 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 12 - Forks: 3

yousef0sa/Speech-To-Text

Speech-To-Text is a C# desktop app that uses Azure Cognitive Services to convert and translate speech. You can copy or show the text on the screen, and choose the language of the speech or the translation.

Language: C# - Size: 50.8 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

JadenChun/real-time-caption-generator

Real time caption generator using Microsoft Azure speech services

Language: C++ - Size: 86.9 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

ictnlp/ITST

Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"

Language: Python - Size: 1.62 MB - Last synced: 11 months ago - Pushed: over 1 year ago - Stars: 11 - Forks: 1

yaya-sy/speechscorer

speechscorer: unsupervised spoken utterances scorer

Language: Python - Size: 1.31 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

mt-upc/s2t-perceiver

Efficient Speech Translation with Dynamic Latent Perceivers

Language: Shell - Size: 380 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 1

jojijacobk/Translator

A hobby project. Online translator service. This service helps you to translate a text or speech from any languages in the world to any other.

Language: JavaScript - Size: 512 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 1

bzhangGo/st_from_scratch

Revisiting End-to-End Speech-to-Text Translation From Scratch

Language: Python - Size: 652 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 11 - Forks: 2

ReneeYe/ConST

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Language: Python - Size: 3.62 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 38 - Forks: 3

hagarz/Speech-to-text-translator

Speech to text and translation client-server using Google cloud

Language: Python - Size: 19.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

mt-upc/SHAS

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

Language: Python - Size: 368 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 24 - Forks: 2

mt-upc/iwslt-2022

Systems submitted to IWSLT 2022 by the MT-UPC group.

Language: Python - Size: 130 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 2 - Forks: 0

VinAIResearch/PhoST

A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)

Size: 10.7 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 14 - Forks: 0

ReneeYe/XSTNet

This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)

Language: Python - Size: 988 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 17 - Forks: 3

George0828Zhang/simulst

PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.

Language: Python - Size: 988 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 8 - Forks: 3

tran-khoa/joint-training-cascaded-st

Code for the paper "Does Joint Training Really Help Cascaded Speech Translation?" (EMNLP 2022)

Language: Python - Size: 5.22 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

TuAnh23/MultiModalST

Limit the use of end-to-end data for Speech Translation (by leveraging Automatic Speech Recognition and Machine Translation data instead) using zero-shot multilingual text translation techniques.

Language: Python - Size: 58.9 MB - Last synced: 10 months ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

Related Keywords
speech-translation 43 speech-to-text 15 machine-translation 15 speech-recognition 14 speech 9 speech-synthesis 8 simultaneous-translation 7 asr 5 pytorch 5 translation 5 whisper 5 speech-transcription 4 deep-learning 4 speech-to-speech 4 text-to-speech 4 transformer 4 tts 3 natural-language-processing 3 speech-processing 3 simultaneous-machine-translation 3 streaming 3 text-translation 3 neural-machine-translation 3 spoken-language-processing 3 automatic-speech-recognition 3 nlp 2 speech-to-speech-translation 2 awesome 2 python 2 voice-conversion 2 punctuation-restoration 2 wav2vec2 2 speech-separation 2 speech-enhancement 2 audio-segmentation 2 self-supervised-learning 2 end-to-end-speech-translation 2 speech-alignment 2 formrecognizer 1 pyaudio 1 gui-application 1 cpp 1 artificial-intelligence 1 azure-speech-service 1 dotnet 1 natural-language-generation 1 desktop-application 1 spoken-language-translation 1 azure-cognitive-services 1 c-sharp 1 luis 1 luis-ai 1 microsoft 1 qna-maker 1 qnamaker 1 data-augmentation 1 segment 1 segmentation 1 sequence-segmentation 1 streaming-speech-to-text 1 aan 1 adaptive-feature-selection 1 average-attention-network 1 deep-transformer 1 depth-scaled-initialization 1 fast-bidirectional-decoder 1 l0drop 1 massively-multilingual-translation 1 opus-100 1 python3 1 socket 1 stream-audio 1 tcp-ip 1 adapters 1 fine-tuning 1 pretrained-models 1 benchmark-dataset 1 english 1 english-to-vietnamese 1 phost 1 vietnamese 1 interspeech2021 1 tensorflow2 1 emnlp2022 1 fairseq 1 few-shot 1 multi-modal 1 zero-shot 1 qt-widgets 1 real-time-caption 1 windows-application 1 hubert 1 efficiency 1 perceiver 1 css3 1 flexbox 1 html5 1 translator 1 vanilla-javascript 1 speech-to-text-translation 1