An open API service providing repository metadata for many open source software ecosystems.

Topic: "utf8"

sheredom/utf8.h

📚 single header utf8 string functions for C and C++

Language: C - Size: 275 KB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 1,815 - Forks: 132

symfony/string

Provides an object-oriented API to strings and deals with bytes, UTF-8 code points and grapheme clusters in a unified way

Language: PHP - Size: 528 KB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 1,757 - Forks: 21

BalazsJako/ImGuiColorTextEdit

Colorizing text editor for ImGui

Language: C++ - Size: 1.4 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 1,525 - Forks: 263

simdutf/simdutf

Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64, POWER. Part of Node.js, WebKit/Safari, Ladybird, Chromium, Cloudflare Workers and Bun.

Language: C++ - Size: 7.76 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,346 - Forks: 87

d99kris/rapidcsv

C++ CSV parser library

Language: C++ - Size: 15.2 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 948 - Forks: 190

jagracey/Awesome-Unicode

:joy: :ok_hand: A curated list of delightful Unicode tidbits, packages and resources.

Language: JavaScript - Size: 225 KB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 924 - Forks: 67

moononournation/Arduino_GFX

Arduino GFX developing for various color displays and various data bus interfaces

Language: C - Size: 39.4 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 922 - Forks: 179

mathiasbynens/utf8.js

A robust JavaScript implementation of a UTF-8 encoder/decoder, as defined by the Encoding Standard.

Language: JavaScript - Size: 59.6 KB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 562 - Forks: 115

DuffsDevice/tiny-utf8

Unicode (UTF-8) capable std::string

Language: C++ - Size: 854 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 548 - Forks: 44

voku/portable-utf8

🉑 Portable UTF-8 library - performance optimized (unicode) string functions for PHP.

Language: PHP - Size: 8.74 MB - Last synced at: 7 days ago - Pushed at: 10 days ago - Stars: 516 - Forks: 85

haskell/text

Haskell library for space- and time-efficient operations over Unicode text.

Language: Haskell - Size: 3.43 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 412 - Forks: 158

cytopia/awesome-ci

Awesome Continuous Integration - Lot's of tools for git, file and static source code analysis.

Language: Shell - Size: 318 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 334 - Forks: 20

anyascii/anyascii

Unicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET

Language: Kotlin - Size: 69.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 305 - Forks: 25

uni-algo/uni-algo

Unicode Algorithms Implementation for C/C++

Language: C++ - Size: 2.32 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 294 - Forks: 26

p-ranav/hypergrep

Recursively search directories for a regex pattern

Language: C++ - Size: 9.31 MB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 213 - Forks: 7

a-merezhanyi/voca_rs

Voca_rs is the ultimate Rust [unicode] string library, implemented as independent functions and on Foreign Types (String and str).

Language: Rust - Size: 3.56 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 182 - Forks: 11

Code-Hex/firebase-auth-cloudflare-workers

Language: TypeScript - Size: 635 KB - Last synced at: 21 days ago - Pushed at: 4 months ago - Stars: 150 - Forks: 6

anonyco/FastestSmallestTextEncoderDecoder

The fastest smallest Javascript polyfill for encodeInto of TextEncoder, encode of TextEncoder, and decode of TextDecoder for UTF-8 only.

Language: JavaScript - Size: 56.4 MB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 149 - Forks: 36

Stepets/utf8.lua

pure-lua 5.3 regex library

Language: Lua - Size: 138 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 143 - Forks: 27

ww898/utf-cpp

UTF-8/16/32 C++11 header only library for Windows / Linux / macOS

Language: C++ - Size: 89.8 KB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 132 - Forks: 19

websockets/utf-8-validate

Check if a buffer contains valid UTF-8

Language: JavaScript - Size: 118 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 119 - Forks: 36

x1angli/cvt2utf

This lightweight tool converts non-UTF-encoded (such as GB2312, GBK, BIG5 encoded) files to UTF-8 encoding.

Language: Python - Size: 84 KB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 99 - Forks: 27

life4/homoglyphs 📦

Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.

Language: Python - Size: 563 KB - Last synced at: 23 days ago - Pushed at: over 4 years ago - Stars: 81 - Forks: 22

moehriegitt/vastringify

Type-safe Printf in C

Language: C - Size: 238 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 72 - Forks: 5

ivanseidel/unicute

💙 Cute Unicode symbols. Make the terminal GREAT AGAIN

Size: 1000 Bytes - Last synced at: 18 days ago - Pushed at: about 8 years ago - Stars: 59 - Forks: 2

soasis/cuneicode

A C library for converting between two different encodings in a simple, easy, and powerful way.

Language: C++ - Size: 2.24 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 57 - Forks: 7

serokell/haskell-with-utf8

Get your IO right on the first try

Language: Haskell - Size: 159 KB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 54 - Forks: 3

SynchronetBBS/sbbs

Mirror of gitlab.synchro.net/sbbs (don't submit pull requests here)

Language: C - Size: 209 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 52 - Forks: 14

figsoda/utf8

UTF-8 support for Nix

Language: Nix - Size: 45.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 49 - Forks: 0

p-ranav/unicode_display_width

Displayed width of UTF-8 strings in Modern C++

Language: C++ - Size: 498 KB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 47 - Forks: 6

vilien/base64-utf8

无依赖utf8字符base64编/解码模块,可安全用于微信小程序

Language: JavaScript - Size: 13.7 KB - Last synced at: 25 days ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 16

jwerle/libutf8

A whatwg compliant UTF8 encoding and decoding library

Language: C - Size: 14.6 KB - Last synced at: 3 days ago - Pushed at: over 5 years ago - Stars: 35 - Forks: 4

patch/unicode-programming

Unicode programming examples

Size: 38.1 KB - Last synced at: 28 days ago - Pushed at: over 8 years ago - Stars: 35 - Forks: 2

Gumichan01/utf8_string

A simple implementation of utf8 strings for C++

Language: C++ - Size: 252 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 6

maoxuepeng/mysqlutf8

默认支持utf8编码的MySQL镜像

Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 28 - Forks: 16

maxlath/fix-utf8

Fix Unicode encoding errors

Language: JavaScript - Size: 62.5 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 26 - Forks: 8

kimsehwan96/pyjosa

간단한 파이썬 🇰🇷 한글 조사처리 라이브러리 은/는 와/과 이/가 등을 처리합니다. PyPI에 배포한 오픈소스 프로젝트입니다.

Language: Python - Size: 2.91 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 23 - Forks: 1

zzzsochi/trans

National characters transcription module.

Language: Python - Size: 55.7 KB - Last synced at: 2 days ago - Pushed at: over 6 years ago - Stars: 23 - Forks: 10

saberzero1/unzip-jp-gui

Unzip Japanese Shift-JIS zip archives on non-Japanese systems.

Language: Python - Size: 76.2 KB - Last synced at: 21 days ago - Pushed at: 11 months ago - Stars: 22 - Forks: 4

eriknyquist/boyermoore

Boyer-moore in pure python, search for unicode strings in large files quickly

Language: Python - Size: 2.79 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 0

msztolcman/subst

Search and des... argh... replace in many files at once. Use regexp and power of Python to replace what you want.

Language: Python - Size: 165 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 3

sauce-code/cuckoo

This is an adaption of Peter Österlund's CuckooChess 1.12. The source code provided is a Java Maven project in UTF-8. The program, except for the chess font, is copyrighted by Peter Österlund, and is available as open source under the GNU GPL v3 license.

Language: Java - Size: 332 KB - Last synced at: 13 days ago - Pushed at: 23 days ago - Stars: 18 - Forks: 24

anonyco/BestBase64EncoderDecoder

The most standard, most cross-browser, most compact, and fastest possible btoa and atob solution for unicode strings with high code points.

Language: JavaScript - Size: 84 KB - Last synced at: 19 days ago - Pushed at: about 5 years ago - Stars: 16 - Forks: 5

mediamonks/symfony-mssql-bundle

Greatly improves mssql support for Symfony on Unix using pdo_dblib

Language: PHP - Size: 35.2 KB - Last synced at: 19 days ago - Pushed at: almost 8 years ago - Stars: 16 - Forks: 2

BassLC/idUTF8lib

Idiot's UTF-8 Library

Language: C++ - Size: 6.67 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 15 - Forks: 5

nigels-com/tutf8e

Tiny UTF-8 Encoder for C

Language: C - Size: 126 KB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 14 - Forks: 8

cytopia/docker-file-lint

Alpine-based Docker image to perform generic file checks on your source code in order to improve consistency within your repository (e.g. for easy usage in CI).

Language: Shell - Size: 87.9 KB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 4

Stenway/RSV-Specification

Rows of String Values (RSV Data Format) Specification - A Simple Binary Alternative to CSV

Size: 477 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

svenvc/UTF8String

A proof of concept / prototype alternative String implementation for Pharo using a variable length UTF8 encoded internal representation

Language: Smalltalk - Size: 44.9 KB - Last synced at: 19 days ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 0

apollo008/orchid-fst

This project Orchid-Fst implements a fast text string dictionary search data structure: Finite state transducer (short for FST) in c++ language.This FST C++ open source project has much significant advantages.

Language: C++ - Size: 7.31 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 2

rastapasta/invisible-attachment

🙈 Utilize invisible UTF8-characters to encode and attach any integer to a string without changing its visual appearance

Language: JavaScript - Size: 78.1 KB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

devatrun/sutfcpplib

Simple UTF library for C++

Language: C++ - Size: 20.5 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 0

ikozyris/kri

simple, compact & very fast text editor

Language: C++ - Size: 769 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 10 - Forks: 1

stgatilov/utf8lut

Vectorized UTF-8 conversion with LookUp Tables

Language: C++ - Size: 1.08 MB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 2

huanz/vscode-GBKtoUTF8

a vscode extension to convert gbk to utf8

Language: TypeScript - Size: 44.9 KB - Last synced at: 25 days ago - Pushed at: over 8 years ago - Stars: 10 - Forks: 4

gene-hightower/ghsmtp

Gene's SMTP server — receive Internet mail with less fuss

Language: C++ - Size: 2.32 MB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 3

sindresorhus/gulp-bom

Add a UTF-8 BOM to files

Language: JavaScript - Size: 11.7 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 1

mirage/uuuu

Language: OCaml - Size: 97.7 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 2

sasairc/clangsay

The classic cowsay program, written in C.

Language: C - Size: 364 KB - Last synced at: about 1 hour ago - Pushed at: about 4 years ago - Stars: 9 - Forks: 3

grantm/encoding-fixlatin

CPAN module: Fixes Latin-1 and CP1252 characters in UTF8 data

Language: Perl - Size: 221 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 4

LB--/utf

My personal C++14 UTF-8 library in the public domain.

Language: C++ - Size: 55.7 KB - Last synced at: 7 days ago - Pushed at: over 8 years ago - Stars: 9 - Forks: 3

vampirefrog/x68ksjis

X68000 specific Shift-JIS to/from Unicode conversion code

Language: C - Size: 132 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 2

duzun/string-encode.js

Convert different types of JavaScript String to/from Uint8Array

Language: JavaScript - Size: 177 KB - Last synced at: 26 days ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 0

hayk314/LaTex-handler

functionality for working with LaTex files

Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 5

guzba/unicody

An alternative / companion to Nim's std/unicode and std/strutils.

Language: Nim - Size: 109 KB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 0

lambdalisue/rs-async-utf8-decoder

🦀 Utf8 decoder for AsyncRead in futures-rs

Language: Rust - Size: 69.3 KB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

rvncerr/u8

Do not like ICU? Here is a basic UTF-8 library on C.

Language: C - Size: 6.84 KB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 7 - Forks: 0

mirage/yuscii

UTF-7 decoder to Unicode

Language: C - Size: 59.6 KB - Last synced at: 5 days ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 2

hermanzdosilovic/petiteutf8

Petite C++17 UTF-8 library

Language: C++ - Size: 28.3 KB - Last synced at: about 6 hours ago - Pushed at: almost 7 years ago - Stars: 7 - Forks: 1

hetao29/sphinx Fork of sphinxsearch/sphinx

Sphinx for Chinese with scws 使用方法参考 https://github.com/hetao29/sphinx-chinese

Language: C++ - Size: 23.1 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 7 - Forks: 3

rmawatson/utf

utf iterators & converters for modern c++

Language: C++ - Size: 175 KB - Last synced at: about 8 hours ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

ivantcholakov/codeigniter-utf8

UTF-8 string support for CodeIgniter based on Kohana's implementation.

Language: PHP - Size: 33.2 KB - Last synced at: 11 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

sauce-code/chessy

Chessy is a simple Chess A.I. using a look-ahead strategy.

Language: Java - Size: 2.48 MB - Last synced at: 16 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

copriwolf/serverless-transitcode

一个用于转码/加解密的 Tencent Serverless 云函数。a tencent serverless application for converting codes(base64, url, html, unicode, utf8<>GBK)

Language: Go - Size: 5.6 MB - Last synced at: 28 days ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 1

haliphax/x84-dockerfile 📦

Docker image for the x/84 Python BBS software

Language: Shell - Size: 9.77 KB - Last synced at: 12 months ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 1

Fdawgs/fix-latin1-to-utf8

Node.js module to fix errors when converting Latin-1 encoded text to UTF-8

Language: JavaScript - Size: 95.7 KB - Last synced at: 12 days ago - Pushed at: 25 days ago - Stars: 5 - Forks: 0

fab2s/OpinHelpers

A collection of simple, opinionated yet hopefully helpful PHP Helpers

Language: PHP - Size: 129 KB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

pweerd/bigfile

Viewer for very large logfiles and JSON/CSV/XML dumps

Language: C# - Size: 1.31 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

Auties00/QrToTerminal

Prints a qr code generated from zxing to the terminal

Language: Java - Size: 8.79 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 3

mbarnig/cornerstoneDicomParserUTF8

Fork of Chris Hafey's Javascript DicomParser with added UTF8 support for DICOM Part 10 data

Language: JavaScript - Size: 1.5 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 2

jasmcaus/cstl

The neatest (mini)rewrite of the C/C++ Standard Library

Language: C - Size: 1.06 MB - Last synced at: 22 days ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

ljungloef/Pansar

A high performance, low memory allocation focused F# parser combinator library

Language: F# - Size: 38.1 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 0

berdav/zsh-digraphs

Insert VIM digraphs with ZSH

Language: Shell - Size: 55.7 KB - Last synced at: 20 days ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 0

novi/nkf-swift

nkf(Network Kanji Filter) for Swift

Language: C - Size: 224 KB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 2

Erutuon/lua-utf8-identifiers

Version of Lua with UTF-8 identifiers

Language: C - Size: 307 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 2

ianthehenry/utf8-parser

A fun way to learn about UTF-8 and parser combinators

Language: Haskell - Size: 168 KB - Last synced at: about 1 year ago - Pushed at: over 10 years ago - Stars: 5 - Forks: 0

SgtSilvio/gradle-defaults

Gradle plugin that configures sensible defaults

Language: Kotlin - Size: 378 KB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

original-birdman/nuemacs

An extension to uemacs p/K 4.015 which makes the bottom line a minibuffer. Plus more...

Language: C - Size: 1.88 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

nyaosorg/go-windows-mbcs

Convert between UTF8 and non-UTF8 character codes(ANSI) using Windows APIs: MultiByteToWideChar and WideCharToMultiByte

Language: Go - Size: 70.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 2

Krezalis/Cisco-79xx-Ukrainian

Українізація інтерфейсту телефону Cisco

Language: PHP - Size: 207 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

Im-Rises/cUnicodeLib

C header only Library to write UTF8 text to the console for Windows, macOs and Linux.

Language: C - Size: 61.5 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

fel88/GFXFontTool

GFX font viewer/generator for Arduino TFT

Language: C# - Size: 207 KB - Last synced at: 26 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 2

dnsquery/utf8-codec

utf8 to/from bytes codec (esm/cjs)

Language: JavaScript - Size: 8.79 KB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

Prashoon123/base64-encoder-decoder

Helps to encode a string to base64 and decode a base64 string to a normal string.

Language: JavaScript - Size: 9.77 KB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

ssor/bom

small tools for cleaning bom from byte array or reader

Language: Go - Size: 2.93 KB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 4

xdevelnet/untranslable

Make texts harder to translate without human

Language: C - Size: 19.5 KB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 1

launchcodedev/ninja-print

Finding those ninja characters: Utility for printing human-readable special characters

Language: JavaScript - Size: 13.7 KB - Last synced at: 4 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

davidkennedydev/utf8_string_view

A string_view addressed to UTF-8 encoded characters.

Language: C++ - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

MatheusPrudente/special-character-codes

Tabela de caracteres UTF-8 especiais no HTML e JS

Size: 41 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

efmsoft/utf8

This library contains a set of classes for working with strings in utf8 format, as well as functions for converting strings in utf8, ANSI, utf16, utf32 formats. The most commonly used format conversion operations are converting from ANSI encoding (on Windows), as well as from a Unicode string

Language: C++ - Size: 194 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 2