Topic: "punctuation"
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language: Python - Size: 100 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 10,292 - Forks: 1,036

ottokart/punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
Language: Python - Size: 55.7 KB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 677 - Forks: 194

notAI-tech/deepsegment 📦
A sentence segmenter that actually works!
Language: Python - Size: 81.1 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 305 - Forks: 56

notAI-tech/fastPunct 📦
Punctuation restoration and spell correction experiments.
Language: Python - Size: 35.2 KB - Last synced at: 1 day ago - Pushed at: about 4 years ago - Stars: 252 - Forks: 39

26hzhang/neural_sequence_labeling
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
Language: Python - Size: 136 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 234 - Forks: 46

davidmogar/cucco
Text normalization library for Python
Language: Python - Size: 188 KB - Last synced at: 6 days ago - Pushed at: about 7 years ago - Stars: 204 - Forks: 27

yeyupiaoling/PunctuationModel
中文标点符号模型,可以给文本添加标点符号。
Language: Python - Size: 6.74 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 140 - Forks: 21

bedapudi6788/deepcorrect 📦
Text and Punctuation correction with Deep Learning
Language: Python - Size: 32.2 KB - Last synced at: 9 months ago - Pushed at: about 5 years ago - Stars: 129 - Forks: 33

motazsaad/process-arabic-text
Pre-process arabic text (remove diacritics, punctuations and repeating characters)
Language: Python - Size: 16.6 KB - Last synced at: 4 days ago - Pushed at: about 8 years ago - Stars: 106 - Forks: 37

LanguageMachines/ucto
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --
Language: C++ - Size: 6.17 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 68 - Forks: 14

kaituoxu/X-Punctuator
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.
Language: Python - Size: 158 KB - Last synced at: 4 days ago - Pushed at: almost 5 years ago - Stars: 62 - Forks: 20

mbejda/Node-OpenNLP
Apache OpenNLP wrapper for Nodejs
Language: JavaScript - Size: 19.7 MB - Last synced at: 16 days ago - Pushed at: about 6 years ago - Stars: 56 - Forks: 17

FerdinandZhong/punctuator
A small seq2seq punctuator tool based on DistilBERT
Language: Python - Size: 119 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 51 - Forks: 8

gleb-skobinsky/ru_punct
Нейронная сеть для восстановления пунктуации на русском языке.
Language: Python - Size: 642 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 18 - Forks: 5

ZarahShibli/Arabic_Punctuation_Prediction
Sequence to sequence model for Arabic punctuation prediction.
Language: Jupyter Notebook - Size: 58.9 MB - Last synced at: 18 days ago - Pushed at: about 5 years ago - Stars: 12 - Forks: 1

rewired-gh/tep
A blazingly fast tool for converting to English punctuations
Language: Rust - Size: 326 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 1

regexhq/punctuation-regex
Regular expression for matching punctuation characters.
Language: JavaScript - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 10 - Forks: 2

ChristophLabacher/fix-punctuation
Regular Expressions for finding wrong punctuation before publishing.
Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: about 8 years ago - Stars: 10 - Forks: 1

ZNClub-PA-ML-AI/Sentiment-analysis-using-Business-News
#Sentimental Analytics
Language: CSS - Size: 22.8 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 9 - Forks: 5

dotland/mnemonic-kb-ru
Russian mnemonic keyboard layout
Language: HTML - Size: 2.1 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 7 - Forks: 2

VishwaGauravIn/string-tools-pro
🤏 Tiny & versatile 🔥 Node.js library for in-depth text analysis, manipulation and data extraction.
Language: JavaScript - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

Fusyong/zhpunc
a ConTeXt LMTX module to support Chinese punctuation
Language: Lua - Size: 1.74 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 4

linto-ai/linto-punctuation
LinTO Platform punctuation service.
Language: Python - Size: 72.3 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 1

dotland/mnemonic-kb-hy
Armenian mnemonic keyboard layout
Size: 3.22 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 4 - Forks: 2

cgnieder/fnpct
Manage interaction between footnotes and punctuation
Language: TeX - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

veltzer/gae-nikuda
Nikuda web site
Language: JavaScript - Size: 11 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 1

aidayang/FunASR-OneClick
FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件
Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

dotland/mnemonic-kb-hy-r
Armenian Mnemonic R keyman keyboard layout
Language: HTML - Size: 800 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

bigcash/awesome-punctuator
A curated list of awesome punctuator
Size: 1.95 KB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

callforpapers-source/doc2term
A fast sentence/word tokenizer, and punctuation remover.
Language: C - Size: 108 KB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

bryanchw/Traditional-Chinese-Stopwords-and-Punctuations-Library
Created a Python library specifically for Traditional Chinese stopwords and punctuations removal
Language: Python - Size: 43.9 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

Guevara-chan/Unicide
⋮Forced evolution for unicellular entites⋮
Language: HTML - Size: 400 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

hamedzarei/nlp-simple-punctuation-correction
simple regex for correcting punctuations
Language: Python - Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

gitfaf/node-punctuation-stats
A small library for getting stats on punctuation in files. - Node Module
Language: JavaScript - Size: 6.84 KB - Last synced at: 19 days ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

dotland/mnemonic-kb-hy-km
Armenian Mnemonic keyman keyboard layout
Language: HTML - Size: 774 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 1

stdlib-js/string-remove-punctuation
Remove punctuation characters from a string.
Language: JavaScript - Size: 987 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

snowdreams1006/gitbook-plugin-punctuation-converter
基于正则表达式实现全局英文标点符号转换成中文标点符号的 Gitbook 插件
Language: JavaScript - Size: 884 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

appellj/Writing-Fundamentals-Guide
A guide to the fundamentals of technical writing in American English
Size: 80.1 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

KOUISAmine/punctuation-remover
Remove Punctuation is a tool that help you to strip all punctuation marks and symbols from a text document or input string.
Language: HTML - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

WhistlingZephyr/espanso-package-quotes
Type different type of quotes from many languages using espanso.
Language: TypeScript - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

rbardini/dashes
A quick reference guide to the use of dashes
Language: HTML - Size: 3.91 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

TurekBot/AutoDash
Want to type an Em Dash—now you can. Just type "--".
Language: AutoHotkey - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

artivilla/progressive-punctuation-open 📦
Punctuation Marks for the Open Web
Language: JavaScript - Size: 2.67 MB - Last synced at: 15 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

tonyjurg/John-punctuation-browser
Alternative puntuations for the N1904 Gospel of John
Language: Jupyter Notebook - Size: 6.65 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

googlefonts/exemplar
JSON endpoints for CLDR exemplar data by locale tag
Language: Python - Size: 2.8 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 1

demondehellis/corrector
External Tool for JetBrains IDEs to correct grammar and punctuation in selected lines.
Language: Python - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jparkerweb/punctuation-restore
🧑🏭 Node.js package for restoring punctuation and casing to strings via ONNX Model `punctuation_fullstop_truecase_english`
Language: JavaScript - Size: 1.67 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

wyattscarpenter/golgotha
Define new operators in any language. Do it, coward! Do it now!
Language: Python - Size: 32.2 KB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

harmanveer-2546/Tweets-Cleaning-with-Python
Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.
Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

awatts/depunctuator
Language: Vue - Size: 1.33 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

tracyreuter/NLP-speech-to-text
Convert speech to text using HuggingFace, comparing Wav2Vec2 versus OpenAI Whisper
Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

tobah59x/FakeRealNews
Training a model to detect fake news articles, then Identifying the text features that indicate fake news.
Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

super16/punct
SPA that clears input text from words, leaves only punctuation
Language: HTML - Size: 521 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

EmamulHossen/Spam_Email_Detection
Spam mail detection is the process of identifying and filtering out unwanted or unsolicited emails, commonly referred to as "spam," from a user's inbox.
Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

TeachSolution/interview-punctuation-marks
☪ Useful Punctuation marks or symbols for live coding interview
Size: 1000 Bytes - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Rotten-LKZ/nopun
Try to read writings in classical Chinese without punctuation!(学古人尝试知句读,读没有标点的文言文!)
Language: Vue - Size: 39.1 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

DuncanRitchie/KeyboardLayouts
Windows keyboard layouts (made with Microsoft Keyboard Layout Creator) for macrons (ā), breves (ă), and punctuation that I find useful.
Size: 164 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Rairye/mnl-punct-norm
Light-weight tool for removing punctuation. Supports multiple natural languages. Useful for scrapping, machine learning, and data analysis.
Language: Python - Size: 55.7 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

sholladay/denizen
Username validation and processing utilities
Language: JavaScript - Size: 13.7 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Smlep/SpacingChecker
Simple shell program to check spacing around punctuation
Language: Shell - Size: 48.8 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

dankondr/PunctuationLearn
An application made on C++ which helps user train punctuation
Language: C++ - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

JoeKarlsson/punc Fork of sgnl/punc
Ever wonder what your favorite books look like without words?
Language: JavaScript - Size: 202 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

mremad/SpokenInputTopicDetection
Language: Python - Size: 46.8 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1
