GitHub topics: g2p
thewh1teagle/phonikud
Hebrew grapheme to phoneme (g2p)
Language: Python - Size: 1.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 16 - Forks: 0

open-dict-data/ipa-dict
Monolingual wordlists with pronunciation information in IPA
Size: 42.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 620 - Forks: 94

Kyubyong/g2pK
g2pK: g2p module for Korean
Language: Python - Size: 60.5 KB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 251 - Forks: 45

humair-m/zuhri
Language: Jupyter Notebook - Size: 271 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

MahtaFetrat/Persian-G2P-Tools-Benchmark
Benchmarking notebooks for various Persian G2P models, comparing their performance on the SentenceBench dataset, including Homo-GE2PE and Homo-T5.
Language: Jupyter Notebook - Size: 229 KB - Last synced at: 5 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 0

tenebo/g2pk2 Fork of harmlessman/g2pkk
Updated folk of g2pk
Language: Python - Size: 66.4 KB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 12 - Forks: 3

MahtaFetrat/Homo-GE2PE-Persian
A Persian grapheme-to-phoneme (G2P) model designed for homograph disambiguation, fine-tuned using the HomoRich dataset to improve pronunciation accuracy.
Language: Jupyter Notebook - Size: 213 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 4 - Forks: 0

xinjli/transphone
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
Language: Python - Size: 342 KB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 161 - Forks: 16

MahtaFetrat/HomoRich-G2P-Persian
HomoRich: The first large-scale Persian homograph dataset for G2P conversion, featuring 528K annotated sentences with balanced pronunciation variants and dual phoneme representations.
Language: Jupyter Notebook - Size: 49.3 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

CUNY-CL/wikipron
Massively multilingual pronunciation mining
Language: Python - Size: 172 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 340 - Forks: 73

v-nhandt21/Viphoneme
Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
Language: Python - Size: 1.06 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 86 - Forks: 18

alphacep/awesome-russian-speech
Russian speech technology links
Size: 134 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 296 - Forks: 21

tachi-hi/jamorasep
A module to separate Japanese kana (hiragana and katakana) text into a list of mora.
Language: Python - Size: 29.3 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 9 - Forks: 0

Kyubyong/g2p
g2p: English Grapheme To Phoneme Conversion
Language: Python - Size: 7.12 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 850 - Forks: 129

cmusphinx/g2p-seq2seq
G2P with Tensorflow
Language: Python - Size: 866 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 674 - Forks: 192

seanghay/awesome-khmer-language
A large collection of Khmer language resources. Khmer is a language used by Cambodia.
Language: Python - Size: 5.38 MB - Last synced at: 20 days ago - Pushed at: 25 days ago - Stars: 113 - Forks: 24

GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
Language: Python - Size: 347 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 329 - Forks: 40

hamanlp/hama
🦛 Hangul Morphological Analyzer
Language: Zig - Size: 2.66 MB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 1

ArseniiBuhaiev/phonetics-lab-UA
A Python package and a desktop app designed to automatically generate phonetic and phonematic transcription of text in Ukrainian
Language: Python - Size: 683 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

Mobile-Artificial-Intelligence/babylon.cpp
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
Language: Python - Size: 422 MB - Last synced at: 25 days ago - Pushed at: 9 months ago - Stars: 19 - Forks: 3

p1an-lin-jung/teochew-g2p
这是一个潮州话文本端的处理工具和正字标准,主要为潮州方言的语音合成服务
Language: Python - Size: 924 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0

ExpressiveLabs/deepphonemizer-rs
Pure Rust implementation of the DeepPhonemizer G2P model.
Language: Rust - Size: 132 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 2

neurlang/goruut
IPA Phonemizer/Dephonemizer for 139 human languages
Language: Go - Size: 406 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 22 - Forks: 2

lastleon/phonetisaurus-g2p-rs
Using Phonetisaurus models for quick phonemization in Rust.
Language: Rust - Size: 7.81 KB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

spring-media/DeepPhonemizer
Grapheme to phoneme conversion with deep learning.
Language: Python - Size: 1.34 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 381 - Forks: 45

seanghay/automatic-phonemic-and-phonetic-transcription
A mirror from https://gitlab.com/mkrlab/automatic-phonemic-and-phonetic-transcription by @MakaraSok
Language: Ruby - Size: 191 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

CiscoDevNet/g2p_seq2seq_pytorch
Grapheme to phoneme model for PyTorch
Language: Python - Size: 1.48 MB - Last synced at: 9 days ago - Pushed at: almost 3 years ago - Stars: 41 - Forks: 11

bookbot-kids/g2p_id
g2p ID: Indonesian Grapheme-to-Phoneme Converter
Language: Python - Size: 7.47 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 20 - Forks: 9

seanghay/khmerpronounce
Khmer Pronounciation Toolkit
Language: Python - Size: 5.01 MB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

CyboBrown/Cebuano-G2P
A rule-based grapheme-to-phoneme conversion system for Cebuano with stress prediction and dictionary lookup.
Language: Python - Size: 901 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

MahtaFetrat/LLM-Powered-G2P
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
Language: Jupyter Notebook - Size: 28.3 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 1

ftyers/commonvoice-utils
Linguistic processing for Common Voice
Language: Python - Size: 445 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 17

wannaphong/thai-grapheme-to-phoneme
Thai Grapheme-to-Phoneme (Thai G2P)
Language: Python - Size: 337 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 3

Kyubyong/g2pC
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
Language: Python - Size: 21.8 MB - Last synced at: 12 days ago - Pushed at: almost 6 years ago - Stars: 240 - Forks: 31

Wikidepia/g2p-id
Indonesian Grapheme-to-Phoneme (IPA notation)
Language: Python - Size: 8.7 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 11

LGirrbach/EM-G2P-Aligner
Python implementation of the many-to-many aligner proposed by Jiampojamarn et al. (2007): Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion
Language: Python - Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Rickynags/LLM-Powered-G2P
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

svedev0/g2p-dotnet
🔠 A grapheme to phoneme (G2P) tool for phonemicizing text for Mel spectrogram generation
Language: C# - Size: 860 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

seanghay/sosap
🗣️ sosap(សូរសព្ទ) Python binding for Phonetisaurus
Language: C++ - Size: 2.36 MB - Last synced at: 27 days ago - Pushed at: 11 months ago - Stars: 6 - Forks: 2

PasaOpasen/PersianG2P Fork of AzamRabiee/Persian_G2P
Persian Grapheme-to-Phoneme (G2P) converter
Language: Python - Size: 28.8 MB - Last synced at: 22 days ago - Pushed at: over 4 years ago - Stars: 20 - Forks: 2

ionite34/Aquila-Resolve
Augmented Recurrent Neural Grapheme-to-Phoneme conversion with Inflectional Orthography.
Language: Python - Size: 1.95 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

ionite34/h2p-parser
Heteronym to Phoneme Parser
Language: Python - Size: 1.9 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 18 - Forks: 5

seanghay/phonetisaurus-js
Grapheme to Phoneme on the Web powered by WebAssembly.
Language: C++ - Size: 3.29 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

vlomme/Multi-Tacotron-Voice-Cloning 📦
Phoneme multilingual(Russian-English) voice cloning based on
Language: Python - Size: 985 KB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 390 - Forks: 96

NikiPshg/Grapheme-to-Phoneme-G2P-with-Stress
G2P_en_lex
Language: Python - Size: 42.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

traderpedroso/xphoneBR
XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and normalization tool modeling library that leverages recent deep learning technology and is optimized for usage in production systems such as TTS. In particular, the library should be accurate, fast, easy to use
Language: Python - Size: 33.2 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

ye-kyaw-thu/myG2P
Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Language: Perl - Size: 6.58 MB - Last synced at: 10 months ago - Pushed at: about 4 years ago - Stars: 52 - Forks: 9

AdolfVonKleist/Phonetisaurus
Phonetisaurus G2P
Language: Shell - Size: 2.24 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 440 - Forks: 122

bookbot-hive/lexikos
Lexikos - λεξικός /lek.si.kós/ - A collection of pronunciation dictionaries and neural grapheme-to-phoneme models.
Language: Jupyter Notebook - Size: 42 MB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 5 - Forks: 0

egorsmkv/g2p-uk
SHA-RNN Grapheme-to-Phoneme for Ukrainian
Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

uiuc-sst/g2ps
Data and code for grapheme-to-phoneme transducers in lots of languages
Language: HTML - Size: 287 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 109 - Forks: 20

kotlinguistics/IPA-Transcribers
Convert native orthographies to the International Phonetic Alphabet
Language: Kotlin - Size: 1.04 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 2

fquirin/kaldi-adapt-lm Fork of gooofy/kaldi-adapt-lm
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Language: Python - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 2

mohamad-hasan-sohan-ajini/G2P
Grapheme To Phoneme
Language: Python - Size: 7.72 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 65 - Forks: 16

jcsilva/multilingual-g2p
Multilingual Grapheme to Phoneme
Language: Shell - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: over 9 years ago - Stars: 45 - Forks: 5

mdm-code/prg2p
Grapheme-to-phoneme rule-based converter for Polish in Go.
Language: Go - Size: 51.8 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

sajadalipour7/Persian-Grapheme-To-Phoneme-With-Transformer
Persian Grapheme To Phoneme with Transformer in Pytorch
Language: Jupyter Notebook - Size: 208 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

jacksonllee/wikipron Fork of CUNY-CL/wikipron
Scraping grapheme-to-phoneme data from Wiktionary
Language: Python - Size: 74.2 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

uiuc-sst/asr24
24-hour Automatic Speech Recognition
Language: C++ - Size: 962 KB - Last synced at: 27 days ago - Pushed at: almost 4 years ago - Stars: 27 - Forks: 7

harmlessman/g2pkk
This is a cross-platform g2p for Korean.
Language: Python - Size: 41 KB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 7

juletx/writing-systems
Comparing Writing Systems with Multilingual Grapheme-to-Phoneme and Phoneme-to-Grapheme Conversion
Language: Jupyter Notebook - Size: 68.5 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

dangvansam/phoneme2grapheme-vietnamese
convert phoneme to grapheme vietnames
Language: Python - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

AsoSoft/Kurdish-G2P-dataset
Datasets for evaluation of Central Kurdish Grapheme-to-Phoneme Conversion systems
Size: 85 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

Bing-su/g2pkiwi Fork of Kyubyong/g2pK
a fork of g2pK, using kiwipiepy
Language: Python - Size: 120 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

grammatek/g2p-thrax
This project provides a grapheme-to-phoneme (g2p) tool based on Thrax-compiled g2p grammars.
Language: C++ - Size: 1.95 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

scarletcho/KoG2P
Korean grapheme-to-phone conversion in Python
Language: Python - Size: 35.2 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 99 - Forks: 24

lifefeel/Grapheme-to-Phoneme
Grapheme-to-Phoneme(G2P) 관련자료 모음
Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 16 - Forks: 0

newlogic/newlogic-g2p 📦
Newlogic G2P - Social Protection
Language: Python - Size: 6.86 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 14 - Forks: 0

bhashini-ai/g2p
Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a building block for Indic text-to-speech (TTS) systems
Language: Java - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 8 - Forks: 5

grammatek/simaromur
Icelandic TTS (text-to-speech) service for Android
Language: Java - Size: 49.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 2

mura4k/transcription
course work
Language: Python - Size: 476 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

peresolb/g2p-no
Grapheme-to-Phoneme models for Norwegian
Language: Python - Size: 15.9 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

cassiotbatista/g2p-decision-trees
Grapheme-to-Phoneme conversion for Brazilian Portuguese Using Decision Trees with Python Scikit Learn
Language: Python - Size: 5.1 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

cadia-lvl/g2p-service Fork of rkjaran/g2p-service
REST wrapper for Sequitur and Fairseq G2P
Language: Python - Size: 35.2 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Bangla-Language-Processing/Bangla-pronunciation
Lexicon and machine learning based Bangla pronunciation system development
Size: 0 Bytes - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0
