An open API service providing repository metadata for many open source software ecosystems.

Topic: "punctuation"

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language: Python - Size: 100 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 10,292 - Forks: 1,036

ottokart/punctuator2

A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text

Language: Python - Size: 55.7 KB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 677 - Forks: 194

notAI-tech/deepsegment 📦

A sentence segmenter that actually works!

Language: Python - Size: 81.1 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 305 - Forks: 56

notAI-tech/fastPunct 📦

Punctuation restoration and spell correction experiments.

Language: Python - Size: 35.2 KB - Last synced at: 1 day ago - Pushed at: about 4 years ago - Stars: 252 - Forks: 39

26hzhang/neural_sequence_labeling

A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.

Language: Python - Size: 136 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 234 - Forks: 46

davidmogar/cucco

Text normalization library for Python

Language: Python - Size: 188 KB - Last synced at: 6 days ago - Pushed at: about 7 years ago - Stars: 204 - Forks: 27

yeyupiaoling/PunctuationModel

中文标点符号模型,可以给文本添加标点符号。

Language: Python - Size: 6.74 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 140 - Forks: 21

bedapudi6788/deepcorrect 📦

Text and Punctuation correction with Deep Learning

Language: Python - Size: 32.2 KB - Last synced at: 9 months ago - Pushed at: about 5 years ago - Stars: 129 - Forks: 33

motazsaad/process-arabic-text

Pre-process arabic text (remove diacritics, punctuations and repeating characters)

Language: Python - Size: 16.6 KB - Last synced at: 4 days ago - Pushed at: about 8 years ago - Stars: 106 - Forks: 37

LanguageMachines/ucto

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

Language: C++ - Size: 6.17 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 68 - Forks: 14

kaituoxu/X-Punctuator

A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.

Language: Python - Size: 158 KB - Last synced at: 4 days ago - Pushed at: almost 5 years ago - Stars: 62 - Forks: 20

mbejda/Node-OpenNLP

Apache OpenNLP wrapper for Nodejs

Language: JavaScript - Size: 19.7 MB - Last synced at: 16 days ago - Pushed at: about 6 years ago - Stars: 56 - Forks: 17

FerdinandZhong/punctuator

A small seq2seq punctuator tool based on DistilBERT

Language: Python - Size: 119 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 51 - Forks: 8

gleb-skobinsky/ru_punct

Нейронная сеть для восстановления пунктуации на русском языке.

Language: Python - Size: 642 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 18 - Forks: 5

ZarahShibli/Arabic_Punctuation_Prediction

Sequence to sequence model for Arabic punctuation prediction.

Language: Jupyter Notebook - Size: 58.9 MB - Last synced at: 18 days ago - Pushed at: about 5 years ago - Stars: 12 - Forks: 1

rewired-gh/tep

A blazingly fast tool for converting to English punctuations

Language: Rust - Size: 326 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 1

regexhq/punctuation-regex

Regular expression for matching punctuation characters.

Language: JavaScript - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 10 - Forks: 2

ChristophLabacher/fix-punctuation

Regular Expressions for finding wrong punctuation before publishing.

Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: about 8 years ago - Stars: 10 - Forks: 1

ZNClub-PA-ML-AI/Sentiment-analysis-using-Business-News

#Sentimental Analytics

Language: CSS - Size: 22.8 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 9 - Forks: 5

dotland/mnemonic-kb-ru

Russian mnemonic keyboard layout

Language: HTML - Size: 2.1 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 7 - Forks: 2

VishwaGauravIn/string-tools-pro

🤏 Tiny & versatile 🔥 Node.js library for in-depth text analysis, manipulation and data extraction.

Language: JavaScript - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

Fusyong/zhpunc

a ConTeXt LMTX module to support Chinese punctuation

Language: Lua - Size: 1.74 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 4

linto-ai/linto-punctuation

LinTO Platform punctuation service.

Language: Python - Size: 72.3 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 1

dotland/mnemonic-kb-hy

Armenian mnemonic keyboard layout

Size: 3.22 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 4 - Forks: 2

cgnieder/fnpct

Manage interaction between footnotes and punctuation

Language: TeX - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

veltzer/gae-nikuda

Nikuda web site

Language: JavaScript - Size: 11 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 1

aidayang/FunASR-OneClick

FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件

Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

dotland/mnemonic-kb-hy-r

Armenian Mnemonic R keyman keyboard layout

Language: HTML - Size: 800 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

bigcash/awesome-punctuator

A curated list of awesome punctuator

Size: 1.95 KB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

callforpapers-source/doc2term

A fast sentence/word tokenizer, and punctuation remover.

Language: C - Size: 108 KB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

bryanchw/Traditional-Chinese-Stopwords-and-Punctuations-Library

Created a Python library specifically for Traditional Chinese stopwords and punctuations removal

Language: Python - Size: 43.9 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

Guevara-chan/Unicide

⋮Forced evolution for unicellular entites⋮

Language: HTML - Size: 400 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

hamedzarei/nlp-simple-punctuation-correction

simple regex for correcting punctuations

Language: Python - Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

gitfaf/node-punctuation-stats

A small library for getting stats on punctuation in files. - Node Module

Language: JavaScript - Size: 6.84 KB - Last synced at: 19 days ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

dotland/mnemonic-kb-hy-km

Armenian Mnemonic keyman keyboard layout

Language: HTML - Size: 774 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 1

stdlib-js/string-remove-punctuation

Remove punctuation characters from a string.

Language: JavaScript - Size: 987 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

snowdreams1006/gitbook-plugin-punctuation-converter

基于正则表达式实现全局英文标点符号转换成中文标点符号的 Gitbook 插件

Language: JavaScript - Size: 884 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

appellj/Writing-Fundamentals-Guide

A guide to the fundamentals of technical writing in American English

Size: 80.1 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

KOUISAmine/punctuation-remover

Remove Punctuation is a tool that help you to strip all punctuation marks and symbols from a text document or input string.

Language: HTML - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

WhistlingZephyr/espanso-package-quotes

Type different type of quotes from many languages using espanso.

Language: TypeScript - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

rbardini/dashes

A quick reference guide to the use of dashes

Language: HTML - Size: 3.91 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

TurekBot/AutoDash

Want to type an Em Dash—now you can. Just type "--".

Language: AutoHotkey - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

artivilla/progressive-punctuation-open 📦

Punctuation Marks for the Open Web

Language: JavaScript - Size: 2.67 MB - Last synced at: 15 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

tonyjurg/John-punctuation-browser

Alternative puntuations for the N1904 Gospel of John

Language: Jupyter Notebook - Size: 6.65 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

googlefonts/exemplar

JSON endpoints for CLDR exemplar data by locale tag

Language: Python - Size: 2.8 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 1

demondehellis/corrector

External Tool for JetBrains IDEs to correct grammar and punctuation in selected lines.

Language: Python - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jparkerweb/punctuation-restore

🧑‍🏭 Node.js package for restoring punctuation and casing to strings via ONNX Model `punctuation_fullstop_truecase_english`

Language: JavaScript - Size: 1.67 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

wyattscarpenter/golgotha

Define new operators in any language. Do it, coward! Do it now!

Language: Python - Size: 32.2 KB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

harmanveer-2546/Tweets-Cleaning-with-Python

Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

awatts/depunctuator

Language: Vue - Size: 1.33 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

tracyreuter/NLP-speech-to-text

Convert speech to text using HuggingFace, comparing Wav2Vec2 versus OpenAI Whisper

Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

tobah59x/FakeRealNews

Training a model to detect fake news articles, then Identifying the text features that indicate fake news.

Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

super16/punct

SPA that clears input text from words, leaves only punctuation

Language: HTML - Size: 521 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

EmamulHossen/Spam_Email_Detection

Spam mail detection is the process of identifying and filtering out unwanted or unsolicited emails, commonly referred to as "spam," from a user's inbox.

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

TeachSolution/interview-punctuation-marks

☪ Useful Punctuation marks or symbols for live coding interview

Size: 1000 Bytes - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Rotten-LKZ/nopun

Try to read writings in classical Chinese without punctuation!(学古人尝试知句读,读没有标点的文言文!)

Language: Vue - Size: 39.1 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

DuncanRitchie/KeyboardLayouts

Windows keyboard layouts (made with Microsoft Keyboard Layout Creator) for macrons (ā), breves (ă), and punctuation that I find useful.

Size: 164 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Rairye/mnl-punct-norm

Light-weight tool for removing punctuation. Supports multiple natural languages. Useful for scrapping, machine learning, and data analysis.

Language: Python - Size: 55.7 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

sholladay/denizen

Username validation and processing utilities

Language: JavaScript - Size: 13.7 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Smlep/SpacingChecker

Simple shell program to check spacing around punctuation

Language: Shell - Size: 48.8 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

dankondr/PunctuationLearn

An application made on C++ which helps user train punctuation

Language: C++ - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

JoeKarlsson/punc Fork of sgnl/punc

Ever wonder what your favorite books look like without words?

Language: JavaScript - Size: 202 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

mremad/SpokenInputTopicDetection

Language: Python - Size: 46.8 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1