An open API service providing repository metadata for many open source software ecosystems.

Topic: "utf-8"

mpdf/mpdf

PHP library generating PDF files from UTF-8 encoded HTML

Language: PHP - Size: 93.4 MB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 4,498 - Forks: 1,077

danielstjules/Stringy

A PHP string manipulation library with multibyte support

Language: PHP - Size: 1.39 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 2,457 - Forks: 217

magiblot/tvision

A modern port of Turbo Vision 2.0, the classical framework for text-based user interfaces. Now cross-platform and with Unicode support.

Language: C++ - Size: 8.25 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 2,168 - Forks: 166

symfony/string

Provides an object-oriented API to strings and deals with bytes, UTF-8 code points and grapheme clusters in a unified way

Language: PHP - Size: 529 KB - Last synced at: 3 days ago - Pushed at: 8 days ago - Stars: 1,759 - Forks: 20

marzer/tomlplusplus

Header-only TOML config file parser and serializer for C++17.

Language: C++ - Size: 20.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,728 - Forks: 169

nemtrif/utfcpp

UTF-8 with C++ in a Portable Way

Language: C++ - Size: 173 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 1,685 - Forks: 211

ilai-deutel/kibi

A text editor in ≤1024 lines of code, written in Rust

Language: Rust - Size: 1.58 MB - Last synced at: 12 days ago - Pushed at: 18 days ago - Stars: 1,661 - Forks: 96

BalazsJako/ImGuiColorTextEdit

Colorizing text editor for ImGui

Language: C++ - Size: 1.4 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,525 - Forks: 263

bitcookies/winrar-keygen

Principle of WinRAR key generation.

Language: C++ - Size: 233 MB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 1,378 - Forks: 2,516

BurntSushi/bstr

A string type for Rust that is not required to be valid UTF-8.

Language: Rust - Size: 2.37 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 929 - Forks: 59

jagracey/Awesome-Unicode

:joy: :ok_hand: A curated list of delightful Unicode tidbits, packages and resources.

Language: JavaScript - Size: 225 KB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 925 - Forks: 67

rhysd/kiro-editor

A small terminal UTF-8 text editor written in Rust 📝🦀

Language: Rust - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 745 - Forks: 32

AmokHuginnsson/replxx

A readline and libedit replacement that supports UTF-8, syntax highlighting, hints and Windows and is BSD licensed.

Language: C++ - Size: 891 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 707 - Forks: 114

polygonplanet/encoding.js

Convert and detect character encoding in JavaScript

Language: JavaScript - Size: 1.76 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 600 - Forks: 125

yf-hk/transliteration

UTF-8 to ASCII transliteration / slugify module for node.js, browser, Web Worker, React Native, Electron and CLI.

Language: TypeScript - Size: 1.9 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 583 - Forks: 53

rusticstuff/simdutf8

SIMD-accelerated UTF-8 validation for Rust.

Language: Rust - Size: 2.97 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 548 - Forks: 29

DuffsDevice/tiny-utf8

Unicode (UTF-8) capable std::string

Language: C++ - Size: 854 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 548 - Forks: 44

voku/portable-utf8

🉑 Portable UTF-8 library - performance optimized (unicode) string functions for PHP.

Language: PHP - Size: 8.74 MB - Last synced at: 1 day ago - Pushed at: 24 days ago - Stars: 516 - Forks: 86

magiblot/turbo

An experimental text editor based on Scintilla and Turbo Vision.

Language: C++ - Size: 1.29 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 516 - Forks: 36

JakubSzark/zig-string

A String Library made for Zig

Language: Zig - Size: 107 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 460 - Forks: 35

InstantWebP2P/peer-vnc

Secure Access VNC from anywhere based on noVNC

Language: JavaScript - Size: 28 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 337 - Forks: 74

adalkiran/llama-nuts-and-bolts

A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.

Language: Go - Size: 21.8 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 303 - Forks: 15

whatwg/encoding

Encoding Standard

Language: HTML - Size: 6.83 MB - Last synced at: 4 days ago - Pushed at: 16 days ago - Stars: 295 - Forks: 83

uni-algo/uni-algo

Unicode Algorithms Implementation for C/C++

Language: C++ - Size: 2.32 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 294 - Forks: 26

sallar/stringz

:100: Super fast unicode-aware string manipulation Javascript library

Language: TypeScript - Size: 1.03 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 237 - Forks: 11

ehmicky/cross-platform-terminal-characters

All the characters that work on most terminals

Language: JavaScript - Size: 4.64 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 228 - Forks: 6

Lichtso/netLink

Socket and Networking Library using msgpack.org[C++11]

Language: C++ - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 216 - Forks: 48

jecolon/ziglyph

Unicode text processing for the Zig programming language.

Size: 32.7 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 209 - Forks: 7

end2endzone/ShellAnything

ShellAnything is a C++ open-source software which allow one to easily customize and add new options to *Windows Explorer* context menu. Define specific actions when a user right-click on a file or a directory.

Language: C++ - Size: 6.25 MB - Last synced at: 27 days ago - Pushed at: 5 months ago - Stars: 198 - Forks: 29

stuartcarnie/go-simd

Optimized functions for Go using SIMD

Language: Assembly - Size: 88.9 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 194 - Forks: 9

fe3dback/str

A fast, solid and strong typed string manipulation library with multibyte support

Language: PHP - Size: 354 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 189 - Forks: 12

a-merezhanyi/voca_rs

Voca_rs is the ultimate Rust [unicode] string library, implemented as independent functions and on Foreign Types (String and str).

Language: Rust - Size: 3.56 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 182 - Forks: 11

U8String/U8String

[work-in-progress] Highly functional and performant UTF-8 string primitive for C#

Language: C# - Size: 2.96 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 173 - Forks: 2

anonyco/FastestSmallestTextEncoderDecoder

The fastest smallest Javascript polyfill for encodeInto of TextEncoder, encode of TextEncoder, and decode of TextDecoder for UTF-8 only.

Language: JavaScript - Size: 56.4 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 149 - Forks: 36

ww898/utf-cpp

UTF-8/16/32 C++11 header only library for Windows / Linux / macOS

Language: C++ - Size: 89.8 KB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 132 - Forks: 19

janlelis/unibits

Visualize different Unicode encodings in the terminal

Language: Ruby - Size: 1.51 MB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 129 - Forks: 3

MitchTalmadge/ASCII-Data

A small Java library for producing nice looking text-based line-graphs and tables.

Language: Java - Size: 85.9 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 124 - Forks: 14

websockets/utf-8-validate

Check if a buffer contains valid UTF-8

Language: JavaScript - Size: 118 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 119 - Forks: 36

jecolon/zigstr

Zigstr is a UTF-8 string type for Zig programs.

Size: 1.86 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 111 - Forks: 4

samthor/fast-text-encoding

Fast polyfill for TextEncoder and TextDecoder, only supports UTF-8

Language: JavaScript - Size: 117 KB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 106 - Forks: 31

thpatch/win32_utf8

Transparent UTF-8 support for native Win32 ANSI applications

Language: C - Size: 398 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 101 - Forks: 6

kayon/iploc 📦

每秒百万高性能IP查询库,使用纯真IP库,国内省、市、县,qqwry.dat转换工具:GBK转为UTF-8

Language: Go - Size: 24.3 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 95 - Forks: 37

anthonynsimon/jurl

Fast and simple URL parsing for Java, with UTF-8 and path resolving support

Language: Java - Size: 211 KB - Last synced at: 3 days ago - Pushed at: about 6 years ago - Stars: 86 - Forks: 11

life4/homoglyphs 📦

Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.

Language: Python - Size: 563 KB - Last synced at: 5 days ago - Pushed at: over 4 years ago - Stars: 81 - Forks: 22

cyb70289/utf8

Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)

Language: C - Size: 270 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 78 - Forks: 10

MJVL/UniObfuscator

Java obfuscator that hides code in comment tags and Unicode garbage by making use of Java's Unicode escapes.

Language: Java - Size: 3.1 MB - Last synced at: 30 days ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 6

ws-garcia/VBA-CSV-interface

The power you need to cleanse, filter, sort, reshape, manage and analyze data from CSV files.

Language: VBA - Size: 146 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 73 - Forks: 9

gaborcsardi/rencfaq

The R Encoding FAQ

Size: 177 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 66 - Forks: 3

sunxfancy/flex-bison-examples

a list of flex/bison examples to show reentrant/C++/error-handling

Language: C - Size: 97.7 KB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 62 - Forks: 6

zrax/string_theory

Flexible modern C++ string library with type-safe formatting

Language: C++ - Size: 2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 59 - Forks: 12

BobSteagall/utf_utils

My work on high-speed conversion of UTF-8 to UTF-32/UTF-16

Language: C++ - Size: 2.37 MB - Last synced at: 9 days ago - Pushed at: over 4 years ago - Stars: 59 - Forks: 12

AidanSun05/ImGuiTextSelect

Text selection implementation for Dear ImGui

Language: C++ - Size: 554 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 58 - Forks: 7

figsoda/utf8

UTF-8 support for Nix

Language: Nix - Size: 33.2 KB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 57 - Forks: 0

eddieantonio/ocreval

Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support

Language: C - Size: 403 KB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 57 - Forks: 14

danielkrupinski/StringPool

A performant and memory efficient storage for immutable strings with C++17. Supports all standard char types: char, wchar_t, char16_t, char32_t and C++20's char8_t.

Language: C++ - Size: 96.7 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 56 - Forks: 8

benkasminbullock/unicode-c

A C library for handling Unicode, UTF-8, surrogate pairs, etc.

Language: C - Size: 189 KB - Last synced at: 16 days ago - Pushed at: almost 4 years ago - Stars: 51 - Forks: 8

sugawarayuuta/charcoal

Faster utf8.Valid using multi-byte processing without SIMD.

Language: Go - Size: 232 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 43 - Forks: 1

gpakosz/UnicodeBOMInputStream

Doing things right, in the name of Sun / Oracle

Language: Java - Size: 11.7 KB - Last synced at: 29 days ago - Pushed at: almost 2 years ago - Stars: 38 - Forks: 12

stimulsoft/Samples-Reports.WEB-for-ASP.NET-MVC

ASP.NET MVC samples for Reports.WEB embedded report components, Visual Studio C# projects, and .NET Framework 4.5.2, 4.6, 4.7, 4.8 report engine

Language: JavaScript - Size: 59 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 37 - Forks: 40

Aternus/csv-to-xlsx

Convert CSV files to XLSX (Excel 2007+ XML Format) files.

Language: TypeScript - Size: 382 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 14

Acceis/unisec

Unicode Security Toolkit

Language: Ruby - Size: 701 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 34 - Forks: 2

jasonlam604/Stringizer

String Manipulation Library for PHP with MultiByte support

Language: PHP - Size: 231 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 34 - Forks: 1

rick-de-water/Lingo

Text encoding for modern C++

Language: C++ - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 29 - Forks: 2

artichoke/intaglio

🗃 UTF-8 string, byte string, and C string interner

Language: Rust - Size: 2.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 27 - Forks: 1

janlelis/characteristics

Character info under different encodings

Language: Ruby - Size: 68.4 KB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 27 - Forks: 1

devstein/unicode-eth

The Unicode Ethereum Project is an initiative to provide libraries and contracts for Unicode data, algorithms, and utilities for Ethereum developers.

Language: Solidity - Size: 1.47 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 26 - Forks: 1

eriknyquist/boyermoore

Boyer-moore in pure python, search for unicode strings in large files quickly

Language: Python - Size: 2.79 MB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 0

p-ranav/lexer

Hackable Lexer with UTF-8 support

Language: C++ - Size: 124 KB - Last synced at: 5 days ago - Pushed at: about 6 years ago - Stars: 22 - Forks: 0

myfreeer/nginx-build-msys2

static nginx build scripts on msys2 mingw with dependencies and custom patches for windows

Language: Shell - Size: 38.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 21 - Forks: 16

hcodes/isutf8

Quick check if a Node.js Buffer or Uint8Array is UTF-8

Language: TypeScript - Size: 1.09 MB - Last synced at: 24 days ago - Pushed at: 8 months ago - Stars: 21 - Forks: 3

janlelis/unicopy

Unicode command-line codepoint dumper

Language: Ruby - Size: 20.5 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 1

detiam/Linux-TTY-UTF-8-Patch

Let TTY of the Linux kernel support UTF-8 (like CJKTTY

Language: C - Size: 5.29 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 20 - Forks: 8

digital-preservation/utf8-validator

UTF-8 Validator

Language: Java - Size: 120 KB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 7

sauce-code/cuckoo

This is an adaption of Peter Österlund's CuckooChess 1.12. The source code provided is a Java Maven project in UTF-8. The program, except for the chess font, is copyrighted by Peter Österlund, and is available as open source under the GNU GPL v3 license.

Language: Java - Size: 332 KB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 18 - Forks: 24

hkspirt/ahocorasick

基于ahocorasick算法的敏感词过滤,支持中文、线程安全

Language: Go - Size: 94.7 KB - Last synced at: 11 months ago - Pushed at: over 6 years ago - Stars: 18 - Forks: 3

ThinkR-open/utf8splain

Explain utf-8 encoded strings

Language: R - Size: 222 KB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 1

m2osw/libutf8

C++ UTF-8 string handling utilities with conversions and a simple to use iterator

Language: C++ - Size: 992 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 16 - Forks: 1

KaiHuaDou/calibre Fork of kovidgoyal/calibre

Calibre that doesn't enforce ASCII filenames 不强制使用 ASCII 文件名的 Calibre

Language: Python - Size: 290 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 16 - Forks: 1

krlmlr/enc 📦

A simple class for storing UTF-8 strings

Language: R - Size: 428 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 3

anonyco/BestBase64EncoderDecoder

The most standard, most cross-browser, most compact, and fastest possible btoa and atob solution for unicode strings with high code points.

Language: JavaScript - Size: 84 KB - Last synced at: 12 days ago - Pushed at: about 5 years ago - Stars: 16 - Forks: 5

BobSteagall/CppCon2018

Materials from my talks from CppCon 2018

Language: C++ - Size: 3.81 MB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 2

m13253/libWinTF8

The library handling things related to UTF-8 and Unicode when you want to port your program to Windows

Language: C++ - Size: 142 KB - Last synced at: 25 days ago - Pushed at: over 8 years ago - Stars: 16 - Forks: 3

sanette/ubase

remove accents from utf8 strings

Language: OCaml - Size: 130 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 15 - Forks: 1

akoweb/tcpdf

persian and arabic fonts for TCPDF - PHP -فونت فارسی برای tcpdf

Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 8

jianboy/gbk2utf8

其他编码文件批量转换为utf-8编码工具。http://git.yoqi.me/lyq/gbk2utf8

Language: Python - Size: 5.86 KB - Last synced at: 25 days ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 11

rokups/nim-ustring

utf-8 string for Nim

Language: C - Size: 146 KB - Last synced at: 7 days ago - Pushed at: almost 6 years ago - Stars: 15 - Forks: 1

softlandia/cpd

code page detect

Language: Go - Size: 2.89 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 3

contrebande-labs/charred

CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell

Language: Python - Size: 264 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 14 - Forks: 3

Lekensteyn/lua-unicode

Patched Lua library to add UTF-8 support on Windows.

Language: CMake - Size: 12.7 KB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 5

Fourmilab/unum

Utility for looking up Unicode characters and HTML entities by code, name, block, or description. Written in Perl, compatible with almost any system that runs Perl.

Language: Perl - Size: 4.18 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 0

buildthomas/Demojify

Remove emoji characters from a string in Roblox

Language: Lua - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 13 - Forks: 3

ehmicky/string-byte-length

Get the UTF-8 byte length of a string.

Language: JavaScript - Size: 7.96 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 12 - Forks: 1

jfcherng/php-mb-string

An implementation targeting high performance for frequently reading/writing operations for multi-byte string.

Language: PHP - Size: 217 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 12 - Forks: 0

3urobeat/arduino-lcdHelper-library

Make working with LCD displays easier and improve UTF-8 support

Language: C++ - Size: 94.7 KB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 12 - Forks: 0

katahiromz/mcpp

UTF-16 readable C preprocessor (A fork of mcpp 2.7.2)

Language: C - Size: 1.57 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 12 - Forks: 0

devatrun/sutfcpplib

Simple UTF library for C++

Language: C++ - Size: 20.5 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 0

uetchy/binyl

🔬 Bitwise UTF-8 string inspector

Language: Rust - Size: 1 MB - Last synced at: 20 days ago - Pushed at: about 3 years ago - Stars: 11 - Forks: 0

rfivet/uemacs

µEMACS (ue) on Cygwin/Linux/NetBSD, based on uEmacs/PK (em) from kernel.org.

Language: C - Size: 1.16 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 10 - Forks: 7

mnemnion/runeset

Fast UTF-8 codepoint sets for Zig.

Language: Zig - Size: 46.4 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 0

huanz/vscode-GBKtoUTF8

a vscode extension to convert gbk to utf8

Language: TypeScript - Size: 44.9 KB - Last synced at: about 1 month ago - Pushed at: over 8 years ago - Stars: 10 - Forks: 4