An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: g2p

thewh1teagle/phonikud

Hebrew grapheme to phoneme (g2p)

Language: Python - Size: 1.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 16 - Forks: 0

open-dict-data/ipa-dict

Monolingual wordlists with pronunciation information in IPA

Size: 42.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 620 - Forks: 94

Kyubyong/g2pK

g2pK: g2p module for Korean

Language: Python - Size: 60.5 KB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 251 - Forks: 45

humair-m/zuhri

Language: Jupyter Notebook - Size: 271 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

MahtaFetrat/Persian-G2P-Tools-Benchmark

Benchmarking notebooks for various Persian G2P models, comparing their performance on the SentenceBench dataset, including Homo-GE2PE and Homo-T5.

Language: Jupyter Notebook - Size: 229 KB - Last synced at: 5 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 0

tenebo/g2pk2 Fork of harmlessman/g2pkk

Updated folk of g2pk

Language: Python - Size: 66.4 KB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 12 - Forks: 3

MahtaFetrat/Homo-GE2PE-Persian

A Persian grapheme-to-phoneme (G2P) model designed for homograph disambiguation, fine-tuned using the HomoRich dataset to improve pronunciation accuracy.

Language: Jupyter Notebook - Size: 213 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 4 - Forks: 0

xinjli/transphone

phoneme tokenizer and grapheme-to-phoneme model for 8k languages

Language: Python - Size: 342 KB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 161 - Forks: 16

MahtaFetrat/HomoRich-G2P-Persian

HomoRich: The first large-scale Persian homograph dataset for G2P conversion, featuring 528K annotated sentences with balanced pronunciation variants and dual phoneme representations.

Language: Jupyter Notebook - Size: 49.3 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

CUNY-CL/wikipron

Massively multilingual pronunciation mining

Language: Python - Size: 172 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 340 - Forks: 73

v-nhandt21/Viphoneme

Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA

Language: Python - Size: 1.06 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 86 - Forks: 18

alphacep/awesome-russian-speech

Russian speech technology links

Size: 134 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 296 - Forks: 21

tachi-hi/jamorasep

A module to separate Japanese kana (hiragana and katakana) text into a list of mora.

Language: Python - Size: 29.3 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 9 - Forks: 0

Kyubyong/g2p

g2p: English Grapheme To Phoneme Conversion

Language: Python - Size: 7.12 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 850 - Forks: 129

cmusphinx/g2p-seq2seq

G2P with Tensorflow

Language: Python - Size: 866 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 674 - Forks: 192

seanghay/awesome-khmer-language

A large collection of Khmer language resources. Khmer is a language used by Cambodia.

Language: Python - Size: 5.38 MB - Last synced at: 20 days ago - Pushed at: 25 days ago - Stars: 113 - Forks: 24

GitYCC/g2pW

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Language: Python - Size: 347 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 329 - Forks: 40

hamanlp/hama

🦛 Hangul Morphological Analyzer

Language: Zig - Size: 2.66 MB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 1

ArseniiBuhaiev/phonetics-lab-UA

A Python package and a desktop app designed to automatically generate phonetic and phonematic transcription of text in Ukrainian

Language: Python - Size: 683 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

Mobile-Artificial-Intelligence/babylon.cpp

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

Language: Python - Size: 422 MB - Last synced at: 25 days ago - Pushed at: 9 months ago - Stars: 19 - Forks: 3

p1an-lin-jung/teochew-g2p

这是一个潮州话文本端的处理工具和正字标准,主要为潮州方言的语音合成服务

Language: Python - Size: 924 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0

ExpressiveLabs/deepphonemizer-rs

Pure Rust implementation of the DeepPhonemizer G2P model.

Language: Rust - Size: 132 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 2

neurlang/goruut

IPA Phonemizer/Dephonemizer for 139 human languages

Language: Go - Size: 406 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 22 - Forks: 2

lastleon/phonetisaurus-g2p-rs

Using Phonetisaurus models for quick phonemization in Rust.

Language: Rust - Size: 7.81 KB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

spring-media/DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

Language: Python - Size: 1.34 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 381 - Forks: 45

seanghay/automatic-phonemic-and-phonetic-transcription

A mirror from https://gitlab.com/mkrlab/automatic-phonemic-and-phonetic-transcription by @MakaraSok

Language: Ruby - Size: 191 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

CiscoDevNet/g2p_seq2seq_pytorch

Grapheme to phoneme model for PyTorch

Language: Python - Size: 1.48 MB - Last synced at: 9 days ago - Pushed at: almost 3 years ago - Stars: 41 - Forks: 11

bookbot-kids/g2p_id

g2p ID: Indonesian Grapheme-to-Phoneme Converter

Language: Python - Size: 7.47 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 20 - Forks: 9

seanghay/khmerpronounce

Khmer Pronounciation Toolkit

Language: Python - Size: 5.01 MB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

CyboBrown/Cebuano-G2P

A rule-based grapheme-to-phoneme conversion system for Cebuano with stress prediction and dictionary lookup.

Language: Python - Size: 901 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

MahtaFetrat/LLM-Powered-G2P

Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.

Language: Jupyter Notebook - Size: 28.3 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 1

ftyers/commonvoice-utils

Linguistic processing for Common Voice

Language: Python - Size: 445 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 17

wannaphong/thai-grapheme-to-phoneme

Thai Grapheme-to-Phoneme (Thai G2P)

Language: Python - Size: 337 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 3

Kyubyong/g2pC

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

Language: Python - Size: 21.8 MB - Last synced at: 12 days ago - Pushed at: almost 6 years ago - Stars: 240 - Forks: 31

Wikidepia/g2p-id

Indonesian Grapheme-to-Phoneme (IPA notation)

Language: Python - Size: 8.7 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 11

LGirrbach/EM-G2P-Aligner

Python implementation of the many-to-many aligner proposed by Jiampojamarn et al. (2007): Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion

Language: Python - Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Rickynags/LLM-Powered-G2P

Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

svedev0/g2p-dotnet

🔠 A grapheme to phoneme (G2P) tool for phonemicizing text for Mel spectrogram generation

Language: C# - Size: 860 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

seanghay/sosap

🗣️ sosap(សូរសព្ទ) Python binding for Phonetisaurus

Language: C++ - Size: 2.36 MB - Last synced at: 27 days ago - Pushed at: 11 months ago - Stars: 6 - Forks: 2

PasaOpasen/PersianG2P Fork of AzamRabiee/Persian_G2P

Persian Grapheme-to-Phoneme (G2P) converter

Language: Python - Size: 28.8 MB - Last synced at: 22 days ago - Pushed at: over 4 years ago - Stars: 20 - Forks: 2

ionite34/Aquila-Resolve

Augmented Recurrent Neural Grapheme-to-Phoneme conversion with Inflectional Orthography.

Language: Python - Size: 1.95 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

ionite34/h2p-parser

Heteronym to Phoneme Parser

Language: Python - Size: 1.9 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 18 - Forks: 5

seanghay/phonetisaurus-js

Grapheme to Phoneme on the Web powered by WebAssembly.

Language: C++ - Size: 3.29 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

vlomme/Multi-Tacotron-Voice-Cloning 📦

Phoneme multilingual(Russian-English) voice cloning based on

Language: Python - Size: 985 KB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 390 - Forks: 96

NikiPshg/Grapheme-to-Phoneme-G2P-with-Stress

G2P_en_lex

Language: Python - Size: 42.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

traderpedroso/xphoneBR

XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and normalization tool modeling library that leverages recent deep learning technology and is optimized for usage in production systems such as TTS. In particular, the library should be accurate, fast, easy to use

Language: Python - Size: 33.2 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

ye-kyaw-thu/myG2P

Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).

Language: Perl - Size: 6.58 MB - Last synced at: 10 months ago - Pushed at: about 4 years ago - Stars: 52 - Forks: 9

AdolfVonKleist/Phonetisaurus

Phonetisaurus G2P

Language: Shell - Size: 2.24 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 440 - Forks: 122

bookbot-hive/lexikos

Lexikos - λεξικός /lek.si.kós/ - A collection of pronunciation dictionaries and neural grapheme-to-phoneme models.

Language: Jupyter Notebook - Size: 42 MB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 5 - Forks: 0

egorsmkv/g2p-uk

SHA-RNN Grapheme-to-Phoneme for Ukrainian

Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

uiuc-sst/g2ps

Data and code for grapheme-to-phoneme transducers in lots of languages

Language: HTML - Size: 287 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 109 - Forks: 20

kotlinguistics/IPA-Transcribers

Convert native orthographies to the International Phonetic Alphabet

Language: Kotlin - Size: 1.04 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 2

fquirin/kaldi-adapt-lm Fork of gooofy/kaldi-adapt-lm

Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech

Language: Python - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 2

mohamad-hasan-sohan-ajini/G2P

Grapheme To Phoneme

Language: Python - Size: 7.72 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 65 - Forks: 16

jcsilva/multilingual-g2p

Multilingual Grapheme to Phoneme

Language: Shell - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: over 9 years ago - Stars: 45 - Forks: 5

mdm-code/prg2p

Grapheme-to-phoneme rule-based converter for Polish in Go.

Language: Go - Size: 51.8 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

sajadalipour7/Persian-Grapheme-To-Phoneme-With-Transformer

Persian Grapheme To Phoneme with Transformer in Pytorch

Language: Jupyter Notebook - Size: 208 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

jacksonllee/wikipron Fork of CUNY-CL/wikipron

Scraping grapheme-to-phoneme data from Wiktionary

Language: Python - Size: 74.2 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

uiuc-sst/asr24

24-hour Automatic Speech Recognition

Language: C++ - Size: 962 KB - Last synced at: 27 days ago - Pushed at: almost 4 years ago - Stars: 27 - Forks: 7

harmlessman/g2pkk

This is a cross-platform g2p for Korean.

Language: Python - Size: 41 KB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 7

juletx/writing-systems

Comparing Writing Systems with Multilingual Grapheme-to-Phoneme and Phoneme-to-Grapheme Conversion

Language: Jupyter Notebook - Size: 68.5 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

dangvansam/phoneme2grapheme-vietnamese

convert phoneme to grapheme vietnames

Language: Python - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

AsoSoft/Kurdish-G2P-dataset

Datasets for evaluation of Central Kurdish Grapheme-to-Phoneme Conversion systems

Size: 85 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

Bing-su/g2pkiwi Fork of Kyubyong/g2pK

a fork of g2pK, using kiwipiepy

Language: Python - Size: 120 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

grammatek/g2p-thrax

This project provides a grapheme-to-phoneme (g2p) tool based on Thrax-compiled g2p grammars.

Language: C++ - Size: 1.95 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

scarletcho/KoG2P

Korean grapheme-to-phone conversion in Python

Language: Python - Size: 35.2 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 99 - Forks: 24

lifefeel/Grapheme-to-Phoneme

Grapheme-to-Phoneme(G2P) 관련자료 모음

Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 16 - Forks: 0

newlogic/newlogic-g2p 📦

Newlogic G2P - Social Protection

Language: Python - Size: 6.86 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 14 - Forks: 0

bhashini-ai/g2p

Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a building block for Indic text-to-speech (TTS) systems

Language: Java - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 8 - Forks: 5

grammatek/simaromur

Icelandic TTS (text-to-speech) service for Android

Language: Java - Size: 49.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 2

mura4k/transcription

course work

Language: Python - Size: 476 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

peresolb/g2p-no

Grapheme-to-Phoneme models for Norwegian

Language: Python - Size: 15.9 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

cassiotbatista/g2p-decision-trees

Grapheme-to-Phoneme conversion for Brazilian Portuguese Using Decision Trees with Python Scikit Learn

Language: Python - Size: 5.1 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

cadia-lvl/g2p-service Fork of rkjaran/g2p-service

REST wrapper for Sequitur and Fairseq G2P

Language: Python - Size: 35.2 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Bangla-Language-Processing/Bangla-pronunciation

Lexicon and machine learning based Bangla pronunciation system development

Size: 0 Bytes - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0