Topic: "utf-16"
nemtrif/utfcpp
UTF-8 with C++ in a Portable Way
Language: C++ - Size: 173 KB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 1,693 - Forks: 212

jagracey/Awesome-Unicode
:joy: :ok_hand: A curated list of delightful Unicode tidbits, packages and resources.
Language: JavaScript - Size: 225 KB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 926 - Forks: 67

uni-algo/uni-algo
Unicode Algorithms Implementation for C/C++
Language: C++ - Size: 2.32 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 294 - Forks: 26

ww898/utf-cpp
UTF-8/16/32 C++11 header only library for Windows / Linux / macOS
Language: C++ - Size: 89.8 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 132 - Forks: 19

janlelis/unibits
Visualize different Unicode encodings in the terminal
Language: Ruby - Size: 1.51 MB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 129 - Forks: 3

VoidStarKat/widestring-rs
A wide string Rust library for converting to and from wide-character strings, including UTF-16 and UTF-32 encoding.
Language: Rust - Size: 591 KB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 88 - Forks: 26

BobSteagall/utf_utils
My work on high-speed conversion of UTF-8 to UTF-32/UTF-16
Language: C++ - Size: 2.37 MB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 59 - Forks: 12

danielkrupinski/StringPool
A performant and memory efficient storage for immutable strings with C++17. Supports all standard char types: char, wchar_t, char16_t, char32_t and C++20's char8_t.
Language: C++ - Size: 96.7 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 56 - Forks: 8

Bodigrim/text-rope
A wrapper around Text for fast line/column navigation and logarithmic concatenation
Language: Haskell - Size: 124 KB - Last synced at: 11 days ago - Pushed at: 10 months ago - Stars: 37 - Forks: 3

Acceis/unisec
Unicode Security Toolkit
Language: Ruby - Size: 701 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 34 - Forks: 2

TheFel0x/img2braille
Turns image into Unicode Braille art. (Now also with color!)
Language: Python - Size: 81.1 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 34 - Forks: 3

rick-de-water/Lingo
Text encoding for modern C++
Language: C++ - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 29 - Forks: 2

LeeeeeeM/bilibili-web-socket
弹幕 web websocket
Language: TypeScript - Size: 808 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 20 - Forks: 9

Juici/wchar-rs
A procedural macro for compile time UTF-16 strings
Language: Rust - Size: 105 KB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 20 - Forks: 9

softlandia/cpd
code page detect
Language: Go - Size: 2.89 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 3

contrebande-labs/charred
CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell
Language: Python - Size: 264 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 14 - Forks: 3

JuliaStrings/LegacyStrings.jl
Legacy Unicode string types
Language: Julia - Size: 93.8 KB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 17

ehmicky/string-byte-length
Get the UTF-8 byte length of a string.
Language: JavaScript - Size: 7.96 MB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 1

katahiromz/mcpp
UTF-16 readable C preprocessor (A fork of mcpp 2.7.2)
Language: C - Size: 1.57 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 12 - Forks: 0

devatrun/sutfcpplib
Simple UTF library for C++
Language: C++ - Size: 20.5 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 0

screepers/utf15
Library (codec generator) for packing integers to JavaScript UTF-16 strings
Language: JavaScript - Size: 18.6 KB - Last synced at: 16 days ago - Pushed at: about 4 years ago - Stars: 11 - Forks: 2

getsentry/utf16string
UTF-16 string types
Language: Rust - Size: 43 KB - Last synced at: 2 days ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 7

noraj/unisec Fork of Acceis/unisec
Unicode Security Toolkit
Language: Ruby - Size: 761 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 8 - Forks: 0

ehmicky/string-byte-slice
Like `string.slice()` but bytewise.
Language: JavaScript - Size: 7.85 MB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 0

danielaparker/unicode_traits
The C++ unicode_traits class template makes using unicode easier
Language: C++ - Size: 303 KB - Last synced at: 25 days ago - Pushed at: almost 5 years ago - Stars: 8 - Forks: 2

schollz/utfdecode
A simple library to decode a string that contains escaped UTF-16/32 characters into unescaped UTF-16.
Language: Go - Size: 12.7 KB - Last synced at: 1 day ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 1

katahiromz/UTF
C/C++ UTF-8, UTF-16, UTF-32 conversion library
Language: C++ - Size: 35.2 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 6 - Forks: 2

fornwall/utfdecode
Unicode decoder, encoder and debugger command-line tool.
Language: C++ - Size: 1.5 MB - Last synced at: 5 days ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 3

nathancorvussolis/cveuc
character encodings converter for Windows. EUC-JIS-2004, EUC-JP, UTF-16 and UTF-8 supported
Language: C++ - Size: 24.8 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

Loara/StringSuite
C++ library to manage strings with different encodings
Language: C++ - Size: 864 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

Photosounder/fopen_utf8
Like fopen() except it always works with UTF-8 paths
Language: C - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

efmsoft/utf8
This library contains a set of classes for working with strings in utf8 format, as well as functions for converting strings in utf8, ANSI, utf16, utf32 formats. The most commonly used format conversion operations are converting from ANSI encoding (on Windows), as well as from a Unicode string
Language: C++ - Size: 194 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 2

mbits-libs/utfconv
Conversion library between string, u16string, u32string and u8string
Language: C++ - Size: 22.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 2

stdlib-js/string-from-code-point
Create a string from a sequence of Unicode code points.
Language: JavaScript - Size: 920 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

stdlib-js/constants-unicode-max
Maximum Unicode code point.
Language: JavaScript - Size: 308 KB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

i2van/UnicodeCharsetDetector Fork of posledam/UnicodeCharsetDetector
Text files Unicode charset detector w/wo BOM.
Language: C# - Size: 523 KB - Last synced at: 21 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

stdlib-js/string-utf16-to-utf8-array
Convert a UTF-16 encoded string to an array of integers using UTF-8 encoding.
Language: JavaScript - Size: 839 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

stdlib-js/string-code-point-at
Return a Unicode code point from a string at a specified position.
Language: JavaScript - Size: 861 KB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

JDanielSmith/Base16k
More efficient encoding of binary data in a Unicode (i.e., UTF-16) string
Language: C# - Size: 1.19 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

railgunlabs/charisma
Secure Unicode® character decoders and encoders.
Language: C - Size: 67.4 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

stdlib-js/constants-unicode-max-bmp
Maximum Unicode code point in the Basic Multilingual Plane (BMP).
Language: JavaScript - Size: 314 KB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

stdlib-js/assert-has-utf16-surrogate-pair-at
Test if a position in a string marks the start of a UTF-16 surrogate pair.
Language: JavaScript - Size: 769 KB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

stdlib-js/string-next-grapheme-cluster-break
Return the next extended grapheme cluster break in a string after a specified position.
Language: JavaScript - Size: 1.06 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

stdlib-js/regexp-utf16-surrogate-pair
Return a regular expression to match a UTF-16 surrogate pair.
Language: JavaScript - Size: 549 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

stdlib-js/regexp-utf16-unpaired-surrogate
Return a regular expression to match an unpaired UTF-16 surrogate.
Language: JavaScript - Size: 463 KB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

stdlib-js/string-tools-grapheme-cluster-break
Grapheme cluster break tooling.
Language: JavaScript - Size: 720 KB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

maxdz-gmbh/mdz_unicode
Very lightweight and portable ANSI C 89/90 compliant library for handling Unicode strings.
Language: C - Size: 2.01 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

NuloSois/Tecelan
Program for Analyzing, Sorting, Encoding and Decoding Strings.
Language: C# - Size: 226 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

loSpaccaBit/color
Color in c++, How to use color and resize console in c++, Come usare i colori e ridimensionare la console in c++ Libreria grafica.
Language: C++ - Size: 45.9 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

GeoRouv/UTF8-to-UTF16
🔁 Another simple converter!
Language: C - Size: 23.4 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

glukianets/Strings
Additional materials for my 2022 Mobius Presentation
Language: Swift - Size: 10.1 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

fab2s/Bom
A purely static zero dependencies Helper to handle unicode BOMs
Language: PHP - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

DevelopersTree/Encoding
Convert text to UTF-8, UTF-16 and UTF-32 online
Language: JavaScript - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

stdlib-js/string-base-for-each
Invoke a function for each UTF-16 code unit in a string.
Language: JavaScript - Size: 259 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 1 - Forks: 0

stdlib-js/string-base-code-point-at
Return a Unicode code point from a string at a specified position.
Language: JavaScript - Size: 288 KB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

stdlib-js/string-next-code-point-index
Return the position of the next Unicode code point in a string after a specified position.
Language: JavaScript - Size: 302 KB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

xavetar/COXave
Instruments for codings
Language: Rust - Size: 2.79 MB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

stdlib-js/string-to-well-formed
Convert a string to a well formed string.
Language: JavaScript - Size: 182 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

stdlib-js/string-prev-grapheme-cluster-break
Return the previous extended grapheme cluster break in a string before a specified position.
Language: JavaScript - Size: 1.19 MB - Last synced at: 22 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

stdlib-js/string-base-for-each-right
Invoke a function for each UTF-16 code unit in a string iterating from right to left.
Language: JavaScript - Size: 144 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

peelonet/peelo-unicode
Simple Unicode utilities for C++
Language: C++ - Size: 616 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

stdlib-js/string-base-percent-encode
Percent-encode a UTF-16 encoded string according to RFC 3986.
Language: JavaScript - Size: 524 KB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

stdlib-js/constants-unicode
Standard string unicode constants.
Language: JavaScript - Size: 402 KB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

stdlib-js/string-percent-encode
Percent-encode a UTF-16 encoded string according to RFC 3986.
Language: JavaScript - Size: 896 KB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

skial/seri
Unicode information. Blocks, scripts, classes and ranges for Haxe. [Experimental]
Language: Haxe - Size: 4.9 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

skial/regex
Helper methods to cross-compile Unicode regular expressions in Haxe.
Language: Haxe - Size: 44.9 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

AntonV0/mixed-messages
Project #35 of Codecademy's Full Stack Engineer Career Path
Language: JavaScript - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

merveeDogan/Bin2XML
Language: C - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

GeoRouv/UTF16-to-UTF8
🔁 A simple converter
Language: C - Size: 25.4 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

monomonedula/temeco
Telegram Bot API message entities to HTML translator
Language: Python - Size: 22.5 KB - Last synced at: 15 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

pRizz/Unicode-Converter Fork of r12a/app-conversion
Language: HTML - Size: 88.9 KB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

visualdoj/dunicode
Unicode UTF-8 and UTF-16 units for Free Pascal
Language: Pascal - Size: 44.9 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

circulosmeos/Perl-with-Unicode-for-Windows
recompilation of Perl for Windows with Unicode support on command-line
Language: C - Size: 5.86 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

TeamTest57/utflib 📦
🗜️: UTF-8 UTF-16 UTF-32
Language: C++ - Size: 40 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 1

subratamal/unicode-converter
Converts unicode strings/code points across languages/encodings.
Size: 1000 Bytes - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

leoovs/unipp
Iterator-based Unicode conversion
Language: C++ - Size: 48.8 KB - Last synced at: about 4 hours ago - Pushed at: about 4 hours ago - Stars: 0 - Forks: 0

paulhuggett/icubaby
A C++ header-only library for converting between Unicode encodings
Language: C++ - Size: 1.86 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 1

mulle-c/mulle-utf
🔤 UTF8-16-32 analysis and manipulation library
Language: C - Size: 1.48 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 1

Borewit/read-next-line
Utility for reading lines from a `ReadableStream` in JavaScript
Language: JavaScript - Size: 269 KB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

shimoning/php-bom
BOM を取得するためだけのユーティリティ
Language: PHP - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

PranoSA/LearnUTF8Web
Frontend For Unicode Calculator Application
Language: TypeScript - Size: 203 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

cs127/unicorn
lightweight implementation of wide character functions for C
Language: C - Size: 22.5 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

maxdz-gmbh/mdz_string
Very lightweight and portable ANSI C 89/90 compliant library for handling single-byte (ASCII/ANSI) strings and Unicode strings.
Language: C - Size: 240 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

jurgen-kluft/cstring
string library (WIP = work in progress)
Language: C++ - Size: 275 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

vinser/u8xml
u8xml NewDecoder func can be used to parse XML files with IANA character encodings instead of Go Standard Library xml package Decoder
Language: Go - Size: 13.7 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

FedericoBruzzone/ende
A rust library for encoding/decoding unicode/utf-8/utf-16(ucs-2) code points. 🦀
Language: Rust - Size: 59.6 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

szawrowski/unicore
Lightweight Unicode Library
Language: C++ - Size: 62.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

max-and-me/utf-8-everywhere
UTF-8 Everywhere
Language: C++ - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

matgat/unicode_text
A small c++ library for unicode encoding and decoding
Language: C++ - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

PranoSA/LearnUTF8Sam
Backend For Unicode Calculator
Language: Python - Size: 24.4 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kaan-yalcinkaya/CUnicode
An Unicode library for C and C++.
Language: C - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dfint/utf8-decoder
A prototype of a minimalistic utf-8 to utf-16 converter. Includes implementations in Python and C.
Language: Python - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

rampaa/UnicodeRangeToUtf16CompliantRegex
Non-Basic Multilingual Plane Regex Ranges to UTF-16 Compliant Regex
Language: C# - Size: 21.5 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

blackoutroulette/PyBase32k
Python3 port of https://github.com/qntm/base32768
Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

KMX-Systems/conversion-to-utf8
Converter from UTF-16 and UTF-32 to UTF-8 (header only)
Language: C++ - Size: 3.91 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SecOpsBear/sb-shells
Generate reverse shells
Language: Go - Size: 12.7 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

bmarlo/utf-8
UTF-8 encoder/decoder for C++17
Language: C++ - Size: 11.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

pixa-pics/UTFzap
UTF-16 up to half lighter and 100-175% Faster!
Language: JavaScript - Size: 2.94 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

thatisuday/utf-info
Get encoding information of a character in UTF-8, UTF-16 and UTF-32 encodings.
Language: JavaScript - Size: 75.2 KB - Last synced at: 25 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mulekick/bitwise-escape
Single HTML page for dynamic display of two unsigned integer numbers in base 10, 2 and 16, as well as base 10, 2 and 16 dynamic computation of the results of bitwise operations on those numbers. Easy to use and intuitive.
Language: HTML - Size: 20.5 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
