Topic: "utf-8"
mpdf/mpdf
PHP library generating PDF files from UTF-8 encoded HTML
Language: PHP - Size: 93.4 MB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 4,498 - Forks: 1,077

danielstjules/Stringy
A PHP string manipulation library with multibyte support
Language: PHP - Size: 1.39 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 2,457 - Forks: 217

magiblot/tvision
A modern port of Turbo Vision 2.0, the classical framework for text-based user interfaces. Now cross-platform and with Unicode support.
Language: C++ - Size: 8.25 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 2,168 - Forks: 166

symfony/string
Provides an object-oriented API to strings and deals with bytes, UTF-8 code points and grapheme clusters in a unified way
Language: PHP - Size: 529 KB - Last synced at: 3 days ago - Pushed at: 8 days ago - Stars: 1,759 - Forks: 20

marzer/tomlplusplus
Header-only TOML config file parser and serializer for C++17.
Language: C++ - Size: 20.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,728 - Forks: 169

nemtrif/utfcpp
UTF-8 with C++ in a Portable Way
Language: C++ - Size: 173 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 1,685 - Forks: 211

ilai-deutel/kibi
A text editor in ≤1024 lines of code, written in Rust
Language: Rust - Size: 1.58 MB - Last synced at: 12 days ago - Pushed at: 18 days ago - Stars: 1,661 - Forks: 96

BalazsJako/ImGuiColorTextEdit
Colorizing text editor for ImGui
Language: C++ - Size: 1.4 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,525 - Forks: 263

bitcookies/winrar-keygen
Principle of WinRAR key generation.
Language: C++ - Size: 233 MB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 1,378 - Forks: 2,516

BurntSushi/bstr
A string type for Rust that is not required to be valid UTF-8.
Language: Rust - Size: 2.37 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 929 - Forks: 59

jagracey/Awesome-Unicode
:joy: :ok_hand: A curated list of delightful Unicode tidbits, packages and resources.
Language: JavaScript - Size: 225 KB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 925 - Forks: 67

rhysd/kiro-editor
A small terminal UTF-8 text editor written in Rust 📝🦀
Language: Rust - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 745 - Forks: 32

AmokHuginnsson/replxx
A readline and libedit replacement that supports UTF-8, syntax highlighting, hints and Windows and is BSD licensed.
Language: C++ - Size: 891 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 707 - Forks: 114

polygonplanet/encoding.js
Convert and detect character encoding in JavaScript
Language: JavaScript - Size: 1.76 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 600 - Forks: 125

yf-hk/transliteration
UTF-8 to ASCII transliteration / slugify module for node.js, browser, Web Worker, React Native, Electron and CLI.
Language: TypeScript - Size: 1.9 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 583 - Forks: 53

rusticstuff/simdutf8
SIMD-accelerated UTF-8 validation for Rust.
Language: Rust - Size: 2.97 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 548 - Forks: 29

DuffsDevice/tiny-utf8
Unicode (UTF-8) capable std::string
Language: C++ - Size: 854 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 548 - Forks: 44

voku/portable-utf8
🉑 Portable UTF-8 library - performance optimized (unicode) string functions for PHP.
Language: PHP - Size: 8.74 MB - Last synced at: 1 day ago - Pushed at: 24 days ago - Stars: 516 - Forks: 86

magiblot/turbo
An experimental text editor based on Scintilla and Turbo Vision.
Language: C++ - Size: 1.29 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 516 - Forks: 36

JakubSzark/zig-string
A String Library made for Zig
Language: Zig - Size: 107 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 460 - Forks: 35

InstantWebP2P/peer-vnc
Secure Access VNC from anywhere based on noVNC
Language: JavaScript - Size: 28 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 337 - Forks: 74

adalkiran/llama-nuts-and-bolts
A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.
Language: Go - Size: 21.8 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 303 - Forks: 15

whatwg/encoding
Encoding Standard
Language: HTML - Size: 6.83 MB - Last synced at: 4 days ago - Pushed at: 16 days ago - Stars: 295 - Forks: 83

uni-algo/uni-algo
Unicode Algorithms Implementation for C/C++
Language: C++ - Size: 2.32 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 294 - Forks: 26

sallar/stringz
:100: Super fast unicode-aware string manipulation Javascript library
Language: TypeScript - Size: 1.03 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 237 - Forks: 11

ehmicky/cross-platform-terminal-characters
All the characters that work on most terminals
Language: JavaScript - Size: 4.64 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 228 - Forks: 6

Lichtso/netLink
Socket and Networking Library using msgpack.org[C++11]
Language: C++ - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 216 - Forks: 48

jecolon/ziglyph
Unicode text processing for the Zig programming language.
Size: 32.7 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 209 - Forks: 7

end2endzone/ShellAnything
ShellAnything is a C++ open-source software which allow one to easily customize and add new options to *Windows Explorer* context menu. Define specific actions when a user right-click on a file or a directory.
Language: C++ - Size: 6.25 MB - Last synced at: 27 days ago - Pushed at: 5 months ago - Stars: 198 - Forks: 29

stuartcarnie/go-simd
Optimized functions for Go using SIMD
Language: Assembly - Size: 88.9 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 194 - Forks: 9

fe3dback/str
A fast, solid and strong typed string manipulation library with multibyte support
Language: PHP - Size: 354 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 189 - Forks: 12

a-merezhanyi/voca_rs
Voca_rs is the ultimate Rust [unicode] string library, implemented as independent functions and on Foreign Types (String and str).
Language: Rust - Size: 3.56 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 182 - Forks: 11

U8String/U8String
[work-in-progress] Highly functional and performant UTF-8 string primitive for C#
Language: C# - Size: 2.96 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 173 - Forks: 2

anonyco/FastestSmallestTextEncoderDecoder
The fastest smallest Javascript polyfill for encodeInto of TextEncoder, encode of TextEncoder, and decode of TextDecoder for UTF-8 only.
Language: JavaScript - Size: 56.4 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 149 - Forks: 36

ww898/utf-cpp
UTF-8/16/32 C++11 header only library for Windows / Linux / macOS
Language: C++ - Size: 89.8 KB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 132 - Forks: 19

janlelis/unibits
Visualize different Unicode encodings in the terminal
Language: Ruby - Size: 1.51 MB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 129 - Forks: 3

MitchTalmadge/ASCII-Data
A small Java library for producing nice looking text-based line-graphs and tables.
Language: Java - Size: 85.9 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 124 - Forks: 14

websockets/utf-8-validate
Check if a buffer contains valid UTF-8
Language: JavaScript - Size: 118 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 119 - Forks: 36

jecolon/zigstr
Zigstr is a UTF-8 string type for Zig programs.
Size: 1.86 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 111 - Forks: 4

samthor/fast-text-encoding
Fast polyfill for TextEncoder and TextDecoder, only supports UTF-8
Language: JavaScript - Size: 117 KB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 106 - Forks: 31

thpatch/win32_utf8
Transparent UTF-8 support for native Win32 ANSI applications
Language: C - Size: 398 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 101 - Forks: 6

kayon/iploc 📦
每秒百万高性能IP查询库,使用纯真IP库,国内省、市、县,qqwry.dat转换工具:GBK转为UTF-8
Language: Go - Size: 24.3 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 95 - Forks: 37

anthonynsimon/jurl
Fast and simple URL parsing for Java, with UTF-8 and path resolving support
Language: Java - Size: 211 KB - Last synced at: 3 days ago - Pushed at: about 6 years ago - Stars: 86 - Forks: 11

life4/homoglyphs 📦
Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.
Language: Python - Size: 563 KB - Last synced at: 5 days ago - Pushed at: over 4 years ago - Stars: 81 - Forks: 22

cyb70289/utf8
Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Language: C - Size: 270 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 78 - Forks: 10

MJVL/UniObfuscator
Java obfuscator that hides code in comment tags and Unicode garbage by making use of Java's Unicode escapes.
Language: Java - Size: 3.1 MB - Last synced at: 30 days ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 6

ws-garcia/VBA-CSV-interface
The power you need to cleanse, filter, sort, reshape, manage and analyze data from CSV files.
Language: VBA - Size: 146 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 73 - Forks: 9

gaborcsardi/rencfaq
The R Encoding FAQ
Size: 177 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 66 - Forks: 3

sunxfancy/flex-bison-examples
a list of flex/bison examples to show reentrant/C++/error-handling
Language: C - Size: 97.7 KB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 62 - Forks: 6

zrax/string_theory
Flexible modern C++ string library with type-safe formatting
Language: C++ - Size: 2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 59 - Forks: 12

BobSteagall/utf_utils
My work on high-speed conversion of UTF-8 to UTF-32/UTF-16
Language: C++ - Size: 2.37 MB - Last synced at: 9 days ago - Pushed at: over 4 years ago - Stars: 59 - Forks: 12

AidanSun05/ImGuiTextSelect
Text selection implementation for Dear ImGui
Language: C++ - Size: 554 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 58 - Forks: 7

figsoda/utf8
UTF-8 support for Nix
Language: Nix - Size: 33.2 KB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 57 - Forks: 0

eddieantonio/ocreval
Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support
Language: C - Size: 403 KB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 57 - Forks: 14

danielkrupinski/StringPool
A performant and memory efficient storage for immutable strings with C++17. Supports all standard char types: char, wchar_t, char16_t, char32_t and C++20's char8_t.
Language: C++ - Size: 96.7 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 56 - Forks: 8

benkasminbullock/unicode-c
A C library for handling Unicode, UTF-8, surrogate pairs, etc.
Language: C - Size: 189 KB - Last synced at: 16 days ago - Pushed at: almost 4 years ago - Stars: 51 - Forks: 8

sugawarayuuta/charcoal
Faster utf8.Valid using multi-byte processing without SIMD.
Language: Go - Size: 232 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 43 - Forks: 1

gpakosz/UnicodeBOMInputStream
Doing things right, in the name of Sun / Oracle
Language: Java - Size: 11.7 KB - Last synced at: 29 days ago - Pushed at: almost 2 years ago - Stars: 38 - Forks: 12

stimulsoft/Samples-Reports.WEB-for-ASP.NET-MVC
ASP.NET MVC samples for Reports.WEB embedded report components, Visual Studio C# projects, and .NET Framework 4.5.2, 4.6, 4.7, 4.8 report engine
Language: JavaScript - Size: 59 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 37 - Forks: 40

Aternus/csv-to-xlsx
Convert CSV files to XLSX (Excel 2007+ XML Format) files.
Language: TypeScript - Size: 382 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 14

Acceis/unisec
Unicode Security Toolkit
Language: Ruby - Size: 701 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 34 - Forks: 2

jasonlam604/Stringizer
String Manipulation Library for PHP with MultiByte support
Language: PHP - Size: 231 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 34 - Forks: 1

rick-de-water/Lingo
Text encoding for modern C++
Language: C++ - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 29 - Forks: 2

artichoke/intaglio
🗃 UTF-8 string, byte string, and C string interner
Language: Rust - Size: 2.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 27 - Forks: 1

janlelis/characteristics
Character info under different encodings
Language: Ruby - Size: 68.4 KB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 27 - Forks: 1

devstein/unicode-eth
The Unicode Ethereum Project is an initiative to provide libraries and contracts for Unicode data, algorithms, and utilities for Ethereum developers.
Language: Solidity - Size: 1.47 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 26 - Forks: 1

eriknyquist/boyermoore
Boyer-moore in pure python, search for unicode strings in large files quickly
Language: Python - Size: 2.79 MB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 0

p-ranav/lexer
Hackable Lexer with UTF-8 support
Language: C++ - Size: 124 KB - Last synced at: 5 days ago - Pushed at: about 6 years ago - Stars: 22 - Forks: 0

myfreeer/nginx-build-msys2
static nginx build scripts on msys2 mingw with dependencies and custom patches for windows
Language: Shell - Size: 38.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 21 - Forks: 16

hcodes/isutf8
Quick check if a Node.js Buffer or Uint8Array is UTF-8
Language: TypeScript - Size: 1.09 MB - Last synced at: 24 days ago - Pushed at: 8 months ago - Stars: 21 - Forks: 3

janlelis/unicopy
Unicode command-line codepoint dumper
Language: Ruby - Size: 20.5 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 1

detiam/Linux-TTY-UTF-8-Patch
Let TTY of the Linux kernel support UTF-8 (like CJKTTY
Language: C - Size: 5.29 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 20 - Forks: 8

digital-preservation/utf8-validator
UTF-8 Validator
Language: Java - Size: 120 KB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 7

sauce-code/cuckoo
This is an adaption of Peter Österlund's CuckooChess 1.12. The source code provided is a Java Maven project in UTF-8. The program, except for the chess font, is copyrighted by Peter Österlund, and is available as open source under the GNU GPL v3 license.
Language: Java - Size: 332 KB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 24

hkspirt/ahocorasick
基于ahocorasick算法的敏感词过滤,支持中文、线程安全
Language: Go - Size: 94.7 KB - Last synced at: 11 months ago - Pushed at: over 6 years ago - Stars: 18 - Forks: 3

ThinkR-open/utf8splain
Explain utf-8 encoded strings
Language: R - Size: 222 KB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 1

m2osw/libutf8
C++ UTF-8 string handling utilities with conversions and a simple to use iterator
Language: C++ - Size: 992 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 16 - Forks: 1

KaiHuaDou/calibre Fork of kovidgoyal/calibre
Calibre that doesn't enforce ASCII filenames 不强制使用 ASCII 文件名的 Calibre
Language: Python - Size: 290 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 16 - Forks: 1

krlmlr/enc 📦
A simple class for storing UTF-8 strings
Language: R - Size: 428 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 3

anonyco/BestBase64EncoderDecoder
The most standard, most cross-browser, most compact, and fastest possible btoa and atob solution for unicode strings with high code points.
Language: JavaScript - Size: 84 KB - Last synced at: 12 days ago - Pushed at: about 5 years ago - Stars: 16 - Forks: 5

BobSteagall/CppCon2018
Materials from my talks from CppCon 2018
Language: C++ - Size: 3.81 MB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 2

m13253/libWinTF8
The library handling things related to UTF-8 and Unicode when you want to port your program to Windows
Language: C++ - Size: 142 KB - Last synced at: 25 days ago - Pushed at: over 8 years ago - Stars: 16 - Forks: 3

sanette/ubase
remove accents from utf8 strings
Language: OCaml - Size: 130 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 15 - Forks: 1

akoweb/tcpdf
persian and arabic fonts for TCPDF - PHP -فونت فارسی برای tcpdf
Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 8

jianboy/gbk2utf8
其他编码文件批量转换为utf-8编码工具。http://git.yoqi.me/lyq/gbk2utf8
Language: Python - Size: 5.86 KB - Last synced at: 25 days ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 11

rokups/nim-ustring
utf-8 string for Nim
Language: C - Size: 146 KB - Last synced at: 7 days ago - Pushed at: almost 6 years ago - Stars: 15 - Forks: 1

softlandia/cpd
code page detect
Language: Go - Size: 2.89 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 3

contrebande-labs/charred
CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell
Language: Python - Size: 264 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 14 - Forks: 3

Lekensteyn/lua-unicode
Patched Lua library to add UTF-8 support on Windows.
Language: CMake - Size: 12.7 KB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 5

Fourmilab/unum
Utility for looking up Unicode characters and HTML entities by code, name, block, or description. Written in Perl, compatible with almost any system that runs Perl.
Language: Perl - Size: 4.18 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 0

buildthomas/Demojify
Remove emoji characters from a string in Roblox
Language: Lua - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 13 - Forks: 3

ehmicky/string-byte-length
Get the UTF-8 byte length of a string.
Language: JavaScript - Size: 7.96 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 12 - Forks: 1

jfcherng/php-mb-string
An implementation targeting high performance for frequently reading/writing operations for multi-byte string.
Language: PHP - Size: 217 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 12 - Forks: 0

3urobeat/arduino-lcdHelper-library
Make working with LCD displays easier and improve UTF-8 support
Language: C++ - Size: 94.7 KB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 12 - Forks: 0

katahiromz/mcpp
UTF-16 readable C preprocessor (A fork of mcpp 2.7.2)
Language: C - Size: 1.57 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 12 - Forks: 0

devatrun/sutfcpplib
Simple UTF library for C++
Language: C++ - Size: 20.5 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 0

uetchy/binyl
🔬 Bitwise UTF-8 string inspector
Language: Rust - Size: 1 MB - Last synced at: 20 days ago - Pushed at: about 3 years ago - Stars: 11 - Forks: 0

rfivet/uemacs
µEMACS (ue) on Cygwin/Linux/NetBSD, based on uEmacs/PK (em) from kernel.org.
Language: C - Size: 1.16 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 10 - Forks: 7

mnemnion/runeset
Fast UTF-8 codepoint sets for Zig.
Language: Zig - Size: 46.4 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 0

huanz/vscode-GBKtoUTF8
a vscode extension to convert gbk to utf8
Language: TypeScript - Size: 44.9 KB - Last synced at: about 1 month ago - Pushed at: over 8 years ago - Stars: 10 - Forks: 4
