Topic: "utf8"
sheredom/utf8.h
📚 single header utf8 string functions for C and C++
Language: C - Size: 275 KB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 1,815 - Forks: 132

symfony/string
Provides an object-oriented API to strings and deals with bytes, UTF-8 code points and grapheme clusters in a unified way
Language: PHP - Size: 528 KB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 1,757 - Forks: 21

BalazsJako/ImGuiColorTextEdit
Colorizing text editor for ImGui
Language: C++ - Size: 1.4 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 1,525 - Forks: 263

simdutf/simdutf
Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64, POWER. Part of Node.js, WebKit/Safari, Ladybird, Chromium, Cloudflare Workers and Bun.
Language: C++ - Size: 7.76 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,346 - Forks: 87

d99kris/rapidcsv
C++ CSV parser library
Language: C++ - Size: 15.2 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 948 - Forks: 190

jagracey/Awesome-Unicode
:joy: :ok_hand: A curated list of delightful Unicode tidbits, packages and resources.
Language: JavaScript - Size: 225 KB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 924 - Forks: 67

moononournation/Arduino_GFX
Arduino GFX developing for various color displays and various data bus interfaces
Language: C - Size: 39.4 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 922 - Forks: 179

mathiasbynens/utf8.js
A robust JavaScript implementation of a UTF-8 encoder/decoder, as defined by the Encoding Standard.
Language: JavaScript - Size: 59.6 KB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 562 - Forks: 115

DuffsDevice/tiny-utf8
Unicode (UTF-8) capable std::string
Language: C++ - Size: 854 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 548 - Forks: 44

voku/portable-utf8
🉑 Portable UTF-8 library - performance optimized (unicode) string functions for PHP.
Language: PHP - Size: 8.74 MB - Last synced at: 7 days ago - Pushed at: 10 days ago - Stars: 516 - Forks: 85

haskell/text
Haskell library for space- and time-efficient operations over Unicode text.
Language: Haskell - Size: 3.43 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 412 - Forks: 158

cytopia/awesome-ci
Awesome Continuous Integration - Lot's of tools for git, file and static source code analysis.
Language: Shell - Size: 318 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 334 - Forks: 20

anyascii/anyascii
Unicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET
Language: Kotlin - Size: 69.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 305 - Forks: 25

uni-algo/uni-algo
Unicode Algorithms Implementation for C/C++
Language: C++ - Size: 2.32 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 294 - Forks: 26

p-ranav/hypergrep
Recursively search directories for a regex pattern
Language: C++ - Size: 9.31 MB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 213 - Forks: 7

a-merezhanyi/voca_rs
Voca_rs is the ultimate Rust [unicode] string library, implemented as independent functions and on Foreign Types (String and str).
Language: Rust - Size: 3.56 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 182 - Forks: 11

Code-Hex/firebase-auth-cloudflare-workers
Language: TypeScript - Size: 635 KB - Last synced at: 21 days ago - Pushed at: 4 months ago - Stars: 150 - Forks: 6

anonyco/FastestSmallestTextEncoderDecoder
The fastest smallest Javascript polyfill for encodeInto of TextEncoder, encode of TextEncoder, and decode of TextDecoder for UTF-8 only.
Language: JavaScript - Size: 56.4 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 149 - Forks: 36

Stepets/utf8.lua
pure-lua 5.3 regex library
Language: Lua - Size: 138 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 143 - Forks: 27

ww898/utf-cpp
UTF-8/16/32 C++11 header only library for Windows / Linux / macOS
Language: C++ - Size: 89.8 KB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 132 - Forks: 19

websockets/utf-8-validate
Check if a buffer contains valid UTF-8
Language: JavaScript - Size: 118 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 119 - Forks: 36

x1angli/cvt2utf
This lightweight tool converts non-UTF-encoded (such as GB2312, GBK, BIG5 encoded) files to UTF-8 encoding.
Language: Python - Size: 84 KB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 99 - Forks: 27

life4/homoglyphs 📦
Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.
Language: Python - Size: 563 KB - Last synced at: 23 days ago - Pushed at: over 4 years ago - Stars: 81 - Forks: 22

moehriegitt/vastringify
Type-safe Printf in C
Language: C - Size: 238 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 72 - Forks: 5

ivanseidel/unicute
💙 Cute Unicode symbols. Make the terminal GREAT AGAIN
Size: 1000 Bytes - Last synced at: 18 days ago - Pushed at: about 8 years ago - Stars: 59 - Forks: 2

soasis/cuneicode
A C library for converting between two different encodings in a simple, easy, and powerful way.
Language: C++ - Size: 2.24 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 57 - Forks: 7

serokell/haskell-with-utf8
Get your IO right on the first try
Language: Haskell - Size: 159 KB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 54 - Forks: 3

SynchronetBBS/sbbs
Mirror of gitlab.synchro.net/sbbs (don't submit pull requests here)
Language: C - Size: 209 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 52 - Forks: 14

figsoda/utf8
UTF-8 support for Nix
Language: Nix - Size: 45.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 49 - Forks: 0

p-ranav/unicode_display_width
Displayed width of UTF-8 strings in Modern C++
Language: C++ - Size: 498 KB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 47 - Forks: 6

vilien/base64-utf8
无依赖utf8字符base64编/解码模块,可安全用于微信小程序
Language: JavaScript - Size: 13.7 KB - Last synced at: 25 days ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 16

jwerle/libutf8
A whatwg compliant UTF8 encoding and decoding library
Language: C - Size: 14.6 KB - Last synced at: 3 days ago - Pushed at: over 5 years ago - Stars: 35 - Forks: 4

patch/unicode-programming
Unicode programming examples
Size: 38.1 KB - Last synced at: 28 days ago - Pushed at: over 8 years ago - Stars: 35 - Forks: 2

Gumichan01/utf8_string
A simple implementation of utf8 strings for C++
Language: C++ - Size: 252 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 6

maoxuepeng/mysqlutf8
默认支持utf8编码的MySQL镜像
Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 28 - Forks: 16

maxlath/fix-utf8
Fix Unicode encoding errors
Language: JavaScript - Size: 62.5 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 26 - Forks: 8

kimsehwan96/pyjosa
간단한 파이썬 🇰🇷 한글 조사처리 라이브러리 은/는 와/과 이/가 등을 처리합니다. PyPI에 배포한 오픈소스 프로젝트입니다.
Language: Python - Size: 2.91 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 23 - Forks: 1

zzzsochi/trans
National characters transcription module.
Language: Python - Size: 55.7 KB - Last synced at: 2 days ago - Pushed at: over 6 years ago - Stars: 23 - Forks: 10

saberzero1/unzip-jp-gui
Unzip Japanese Shift-JIS zip archives on non-Japanese systems.
Language: Python - Size: 76.2 KB - Last synced at: 21 days ago - Pushed at: 11 months ago - Stars: 22 - Forks: 4

eriknyquist/boyermoore
Boyer-moore in pure python, search for unicode strings in large files quickly
Language: Python - Size: 2.79 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 0

msztolcman/subst
Search and des... argh... replace in many files at once. Use regexp and power of Python to replace what you want.
Language: Python - Size: 165 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 3

sauce-code/cuckoo
This is an adaption of Peter Österlund's CuckooChess 1.12. The source code provided is a Java Maven project in UTF-8. The program, except for the chess font, is copyrighted by Peter Österlund, and is available as open source under the GNU GPL v3 license.
Language: Java - Size: 332 KB - Last synced at: 13 days ago - Pushed at: 23 days ago - Stars: 18 - Forks: 24

anonyco/BestBase64EncoderDecoder
The most standard, most cross-browser, most compact, and fastest possible btoa and atob solution for unicode strings with high code points.
Language: JavaScript - Size: 84 KB - Last synced at: 19 days ago - Pushed at: about 5 years ago - Stars: 16 - Forks: 5

mediamonks/symfony-mssql-bundle
Greatly improves mssql support for Symfony on Unix using pdo_dblib
Language: PHP - Size: 35.2 KB - Last synced at: 19 days ago - Pushed at: almost 8 years ago - Stars: 16 - Forks: 2

BassLC/idUTF8lib
Idiot's UTF-8 Library
Language: C++ - Size: 6.67 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 15 - Forks: 5

nigels-com/tutf8e
Tiny UTF-8 Encoder for C
Language: C - Size: 126 KB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 14 - Forks: 8

cytopia/docker-file-lint
Alpine-based Docker image to perform generic file checks on your source code in order to improve consistency within your repository (e.g. for easy usage in CI).
Language: Shell - Size: 87.9 KB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 4

Stenway/RSV-Specification
Rows of String Values (RSV Data Format) Specification - A Simple Binary Alternative to CSV
Size: 477 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

svenvc/UTF8String
A proof of concept / prototype alternative String implementation for Pharo using a variable length UTF8 encoded internal representation
Language: Smalltalk - Size: 44.9 KB - Last synced at: 19 days ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 0

apollo008/orchid-fst
This project Orchid-Fst implements a fast text string dictionary search data structure: Finite state transducer (short for FST) in c++ language.This FST C++ open source project has much significant advantages.
Language: C++ - Size: 7.31 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 2

rastapasta/invisible-attachment
🙈 Utilize invisible UTF8-characters to encode and attach any integer to a string without changing its visual appearance
Language: JavaScript - Size: 78.1 KB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

devatrun/sutfcpplib
Simple UTF library for C++
Language: C++ - Size: 20.5 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 0

ikozyris/kri
simple, compact & very fast text editor
Language: C++ - Size: 769 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 10 - Forks: 1

stgatilov/utf8lut
Vectorized UTF-8 conversion with LookUp Tables
Language: C++ - Size: 1.08 MB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 2

huanz/vscode-GBKtoUTF8
a vscode extension to convert gbk to utf8
Language: TypeScript - Size: 44.9 KB - Last synced at: 25 days ago - Pushed at: over 8 years ago - Stars: 10 - Forks: 4

gene-hightower/ghsmtp
Gene's SMTP server — receive Internet mail with less fuss
Language: C++ - Size: 2.32 MB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 3

sindresorhus/gulp-bom
Add a UTF-8 BOM to files
Language: JavaScript - Size: 11.7 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 1

mirage/uuuu
Language: OCaml - Size: 97.7 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 2

sasairc/clangsay
The classic cowsay program, written in C.
Language: C - Size: 364 KB - Last synced at: about 1 hour ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 3

grantm/encoding-fixlatin
CPAN module: Fixes Latin-1 and CP1252 characters in UTF8 data
Language: Perl - Size: 221 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 4

LB--/utf
My personal C++14 UTF-8 library in the public domain.
Language: C++ - Size: 55.7 KB - Last synced at: 7 days ago - Pushed at: over 8 years ago - Stars: 9 - Forks: 3

vampirefrog/x68ksjis
X68000 specific Shift-JIS to/from Unicode conversion code
Language: C - Size: 132 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 2

duzun/string-encode.js
Convert different types of JavaScript String to/from Uint8Array
Language: JavaScript - Size: 177 KB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 0

hayk314/LaTex-handler
functionality for working with LaTex files
Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 5

guzba/unicody
An alternative / companion to Nim's std/unicode and std/strutils.
Language: Nim - Size: 109 KB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 0

lambdalisue/rs-async-utf8-decoder
🦀 Utf8 decoder for AsyncRead in futures-rs
Language: Rust - Size: 69.3 KB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

rvncerr/u8
Do not like ICU? Here is a basic UTF-8 library on C.
Language: C - Size: 6.84 KB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 7 - Forks: 0

mirage/yuscii
UTF-7 decoder to Unicode
Language: C - Size: 59.6 KB - Last synced at: 5 days ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 2

hermanzdosilovic/petiteutf8
Petite C++17 UTF-8 library
Language: C++ - Size: 28.3 KB - Last synced at: about 6 hours ago - Pushed at: almost 7 years ago - Stars: 7 - Forks: 1

hetao29/sphinx Fork of sphinxsearch/sphinx
Sphinx for Chinese with scws 使用方法参考 https://github.com/hetao29/sphinx-chinese
Language: C++ - Size: 23.1 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 7 - Forks: 3

rmawatson/utf
utf iterators & converters for modern c++
Language: C++ - Size: 175 KB - Last synced at: about 8 hours ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

ivantcholakov/codeigniter-utf8
UTF-8 string support for CodeIgniter based on Kohana's implementation.
Language: PHP - Size: 33.2 KB - Last synced at: 11 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

sauce-code/chessy
Chessy is a simple Chess A.I. using a look-ahead strategy.
Language: Java - Size: 2.48 MB - Last synced at: 16 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

copriwolf/serverless-transitcode
一个用于转码/加解密的 Tencent Serverless 云函数。a tencent serverless application for converting codes(base64, url, html, unicode, utf8<>GBK)
Language: Go - Size: 5.6 MB - Last synced at: 28 days ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 1

haliphax/x84-dockerfile 📦
Docker image for the x/84 Python BBS software
Language: Shell - Size: 9.77 KB - Last synced at: 12 months ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 1

Fdawgs/fix-latin1-to-utf8
Node.js module to fix errors when converting Latin-1 encoded text to UTF-8
Language: JavaScript - Size: 95.7 KB - Last synced at: 12 days ago - Pushed at: 25 days ago - Stars: 5 - Forks: 0

fab2s/OpinHelpers
A collection of simple, opinionated yet hopefully helpful PHP Helpers
Language: PHP - Size: 129 KB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

pweerd/bigfile
Viewer for very large logfiles and JSON/CSV/XML dumps
Language: C# - Size: 1.31 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

Auties00/QrToTerminal
Prints a qr code generated from zxing to the terminal
Language: Java - Size: 8.79 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 3

mbarnig/cornerstoneDicomParserUTF8
Fork of Chris Hafey's Javascript DicomParser with added UTF8 support for DICOM Part 10 data
Language: JavaScript - Size: 1.5 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 2

jasmcaus/cstl
The neatest (mini)rewrite of the C/C++ Standard Library
Language: C - Size: 1.06 MB - Last synced at: 22 days ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

ljungloef/Pansar
A high performance, low memory allocation focused F# parser combinator library
Language: F# - Size: 38.1 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 0

berdav/zsh-digraphs
Insert VIM digraphs with ZSH
Language: Shell - Size: 55.7 KB - Last synced at: 20 days ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

novi/nkf-swift
nkf(Network Kanji Filter) for Swift
Language: C - Size: 224 KB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

Erutuon/lua-utf8-identifiers
Version of Lua with UTF-8 identifiers
Language: C - Size: 307 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 2

ianthehenry/utf8-parser
A fun way to learn about UTF-8 and parser combinators
Language: Haskell - Size: 168 KB - Last synced at: about 1 year ago - Pushed at: over 10 years ago - Stars: 5 - Forks: 0

SgtSilvio/gradle-defaults
Gradle plugin that configures sensible defaults
Language: Kotlin - Size: 378 KB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

original-birdman/nuemacs
An extension to uemacs p/K 4.015 which makes the bottom line a minibuffer. Plus more...
Language: C - Size: 1.88 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

nyaosorg/go-windows-mbcs
Convert between UTF8 and non-UTF8 character codes(ANSI) using Windows APIs: MultiByteToWideChar and WideCharToMultiByte
Language: Go - Size: 70.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 2

Krezalis/Cisco-79xx-Ukrainian
Українізація інтерфейсту телефону Cisco
Language: PHP - Size: 207 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

Im-Rises/cUnicodeLib
C header only Library to write UTF8 text to the console for Windows, macOs and Linux.
Language: C - Size: 61.5 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

fel88/GFXFontTool
GFX font viewer/generator for Arduino TFT
Language: C# - Size: 207 KB - Last synced at: 26 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 2

dnsquery/utf8-codec
utf8 to/from bytes codec (esm/cjs)
Language: JavaScript - Size: 8.79 KB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

Prashoon123/base64-encoder-decoder
Helps to encode a string to base64 and decode a base64 string to a normal string.
Language: JavaScript - Size: 9.77 KB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

ssor/bom
small tools for cleaning bom from byte array or reader
Language: Go - Size: 2.93 KB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 4

xdevelnet/untranslable
Make texts harder to translate without human
Language: C - Size: 19.5 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 1

launchcodedev/ninja-print
Finding those ninja characters: Utility for printing human-readable special characters
Language: JavaScript - Size: 13.7 KB - Last synced at: 4 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

davidkennedydev/utf8_string_view
A string_view addressed to UTF-8 encoded characters.
Language: C++ - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

MatheusPrudente/special-character-codes
Tabela de caracteres UTF-8 especiais no HTML e JS
Size: 41 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

efmsoft/utf8
This library contains a set of classes for working with strings in utf8 format, as well as functions for converting strings in utf8, ANSI, utf16, utf32 formats. The most commonly used format conversion operations are converting from ANSI encoding (on Windows), as well as from a Unicode string
Language: C++ - Size: 194 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 2
