GitHub topics: unicode-normalization
obinexus/nlink
NexusLink is a revolutionary modular build orchestrator that fundamentally reimagines build system architecture through the application of automaton theory and state machine minimization principles. Developed by Nnamdi Michael Okpala at OBINexus Computing.
Language: C - Size: 6.8 MB - Last synced at: 40 minutes ago - Pushed at: about 1 hour ago - Stars: 1 - Forks: 0

mlodewijck/pyunormalize
Unicode normalization forms (NFC, NFKC, NFD, NFKD). A library independent of the Python core Unicode database. This library supports version 16.0 of the Unicode Standard.
Language: Python - Size: 1.87 MB - Last synced at: 18 days ago - Pushed at: 9 months ago - Stars: 9 - Forks: 2

railgunlabs/unicorn
Unicode® algorithms on a chip. Compliant with MISRA C:2012.
Language: Python - Size: 1.42 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 62 - Forks: 3

google-research/nisaba
Finite-state script normalization and processing utilities
Language: Python - Size: 2.41 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 40 - Forks: 4

AsoSoft/AsoSoft-Library-py
AsoSoft's Library for Kurdish language processing tasks in python
Language: Python - Size: 49.8 KB - Last synced at: 11 days ago - Pushed at: 12 months ago - Stars: 15 - Forks: 2

iseki0/ktunstrnorm
Lightweight Unicode normalization for Kotlin Multiplatform. Uses system APIs (like ICU, Java Normalizer, JS normalize) with zero bundled data.
Language: Kotlin - Size: 74.2 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

seanghay/khnormal.cpp
Khmer encoding normalization implementation in C++.
Language: C++ - Size: 25.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

elgar328/nfd2nfc
nfd2nfc is a macOS CLI tool that converts filenames between NFD and NFC for consistent cross-platform compatibility. It also includes a background service that continuously monitors specified folders and automatically applies the necessary conversions in real time.
Language: Rust - Size: 38.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

composewell/unicode-transforms
Fast Unicode normalization in Haskell
Language: Haskell - Size: 29.5 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 47 - Forks: 16

nitely/nim-normalize
Unicode normalization forms (tr15) in linear time
Language: Nim - Size: 1.08 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 20 - Forks: 0

AsoSoft/AsoSoft-Library
AsoSoft's Library for Kurdish language processing tasks
Language: C# - Size: 349 KB - Last synced at: 25 days ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 1

tanaikech/ConvertNFDtoNFC
This is a script for converting strings from NFD (Normalization Form Decomposition) to NFC (Normalization Form Composition) using Google Apps Script.
Language: JavaScript - Size: 2.93 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

ClaireCJS/fix_unicode_filenames
remove all emoji, unicode, and special/problematic characters from filenames and strings with this standalone tool / importable module
Language: Python - Size: 2.75 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

eldstal/strinvader
Unicode denormalization tool
Language: Python - Size: 195 KB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

Kurdinus/kurdinusLibrary
JavaScript tools for normalization and transliteration of Kurdish texts
Language: JavaScript - Size: 2.3 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 2

sile/unf
Unicode Normalization Forms
Language: C++ - Size: 2.93 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

Erutuon/lutf8proc
Lua bindings to utf8proc
Language: C - Size: 38.1 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

brynne8/ccnorm
Lua Unicode normalization data
Language: Lua - Size: 414 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

coarchive/hangul-unicode 📦
A library to process and standardize hangul characters
Language: JavaScript - Size: 2.34 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

ssrathi/text_styler
Convert ASCII alphanumeric text to a random style using Unicode character normalization.
Language: Python - Size: 19.5 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

isawnyu/textnorm
Normalize whitespace and Unicode forms in Python 3.
Language: Python - Size: 37.1 KB - Last synced at: 12 days ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 0

rohankhayech/UnicodeVisualSpoofing
Unicode visual spoofing demo program showcasing the vulnerability and a patch.
Language: Java - Size: 67.8 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

varnishcache-friends/libvmod-utf8
A Varnish VMOD for Unicode normalization, case-folding, and other operations for data in the UTF-8 encoding
Language: C - Size: 67.4 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 1

johannesl/RealWorldPHP
Tools useful for real world PHP projects, such as a nice MySQL API and Unicode precomposer / NFC normalizer.
Language: PHP - Size: 14.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

jarrodldavis/normalize-filenames 📦
A quick-and-dirty Ruby script to perform Unicode Normalization on filesystem paths
Language: Ruby - Size: 14.6 KB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

linux-apfs/apfs-ucd-parser
Unicode data parser for the linux-apfs module
Language: C - Size: 681 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

sabracrolleton/saslprep
A common lisp implementation of unicode normalization and saslprep as required by RFC 4013 for usernames and passwords
Language: Common Lisp - Size: 1.17 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

jleeothon/fish-normalise-unicode
Normalise Unicode in files provided using Node
Language: Shell - Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

sjorek/typo3-unicode-normalization
A TYPO3 extension adding unicode-normalization capabilities to TYPO3.
Language: PHP - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

azyobuzin/UnicodeNormalizationSample
Language: C# - Size: 791 KB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 1
