Topic: "character-encoding"
paragonie/constant_time_encoding
Constant-Time Character Encoding in PHP Projects
Language: PHP - Size: 180 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 850 - Forks: 34

aadsm/jschardet
Character encoding auto-detection in JavaScript (port of python's chardet)
Language: JavaScript - Size: 1.58 MB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 728 - Forks: 97

polygonplanet/encoding.js
Convert and detect character encoding in JavaScript
Language: JavaScript - Size: 1.76 MB - Last synced at: 2 days ago - Pushed at: 15 days ago - Stars: 601 - Forks: 125

SheetJS/js-codepage
:currency_exchange: Codepages for JS
Language: JavaScript - Size: 13.5 MB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 152 - Forks: 38

sonicdoe/detect-character-encoding
Detect character encoding using ICU
Language: C++ - Size: 57 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 83 - Forks: 15

kanasimi/CeJS
The project aims to develop a JavaScript module framework that is simple to use with some interesting features. 本計畫目標是建立一個簡單上手的 JavaScript 模組架構。
Language: JavaScript - Size: 34.1 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 62 - Forks: 12

JuliaStrings/StringEncodings.jl
String encoding conversion in Julia using iconv
Language: Julia - Size: 135 KB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 42 - Forks: 8

t-sin/inquisitor
Encoding/end-of-line detection and external-format abstraction for Common Lisp
Language: Common Lisp - Size: 1.13 MB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 33 - Forks: 3

timakin/gonvert
Golang character encoding converter with an automatic code-estimation.
Language: Go - Size: 46.9 KB - Last synced at: 17 days ago - Pushed at: over 8 years ago - Stars: 26 - Forks: 2

xiaooloong/lua-resty-iconv
LuaJIT FFI bindings for libiconv - Character encoding conversion in OpenResty
Language: Lua - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 22 - Forks: 7

MacPaw/universal-detector
A Library for character set autodetection
Language: C++ - Size: 788 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 16

vaiorabbit/everyday_use_kanji
常用漢字・人名用漢字のテーブルとImGui用のコード生成スクリプト
Language: C++ - Size: 5.82 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 18 - Forks: 4

krlmlr/enc 📦
A simple class for storing UTF-8 strings
Language: R - Size: 428 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 3

goark/gnkf
Network Kanji Filter by Golang
Language: Go - Size: 960 KB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 12 - Forks: 2

TwistAtom/ZWSP-Tool
ZWSP-Tool is a powerful toolkit that allows to manipulate zero width spaces quickly and easily. ZWSP-Tool allows in particular to detect, clean, hide, extract and bruteforce a text containing zero width spaces.
Language: Python - Size: 147 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 0

fumoboy007/UniversalCharsetDetection
A Swift wrapper around the `uchardet` library to detect the character encoding of a sequence of bytes.
Language: C++ - Size: 242 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 6

anshul1004/LyricsGenerator
Song lyrics generation using Recurrent Neural Networks (RNNs)
Language: Python - Size: 47.3 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

MuntahaShams/Character-level-LSTM-Pytorch
In this notebook, I'll construct a character-level LSTM with PyTorch. The network will train character by character on some text, then generate new text character by character. As an example, I will train on Anna Karenina. This model will be able to generate new text based on the text from the book!
Language: Jupyter Notebook - Size: 13.3 MB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

Emoun/racp
Revised ASCII Codes for Programming - A new character set for the modern programming environment
Language: Java - Size: 169 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 1

MarkMaldaba/PHPSerializationFixer
A tool to fix corrupted PHP serialised strings.
Language: PHP - Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

KotlinNLP/LanguageModel
LanguageModel contains a number of methods for language modeling based on the SimpleDNN library.
Language: Kotlin - Size: 112 KB - Last synced at: 3 days ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

exponential-decay/fndec
Guess character encoding in your filenames
Language: Go - Size: 121 KB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

kshetline/by-request
Simple Node http client for use with promises, async/await.
Language: TypeScript - Size: 673 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

dahlia/html-charset
Determine character encoding of HTML documents/fragments
Language: Haskell - Size: 19.5 KB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

SwamiKannan/Automatic-Shakespeare
Auto-generated text based on Shakespeare poetry
Language: Jupyter Notebook - Size: 37.2 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

m7modg97/dotmatrix Fork of stefangordon/dotmatrixtool
A web application for generating character or image byte arrays for dot matrix style OLED or LCD displays.
Language: JavaScript - Size: 344 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

marschall/writers
implementations of java.io.Writer with different trade-offs
Language: Java - Size: 50.8 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

simplepie/text-encoder
Don't use this yet. Modern character encoding detection and conversion using Intl and ICU.
Language: PHP - Size: 367 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

busterc/x-base-32
:symbols: base32 encoding and decoding that optimizes byte size based on input characters and supports unicode.
Language: JavaScript - Size: 86.9 KB - Last synced at: 21 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

shiyou0130011/Numeric-Character-Reference-js
Language: TypeScript - Size: 39.1 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 1

curegit/unicodecheck
Simple tool to check if Unicode text files are Unicode-normalized
Language: Python - Size: 52.7 KB - Last synced at: 8 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

MarjovanLier/StringManipulation
A comprehensive PHP library for efficient string manipulation, providing functions for search word transformation, name fixing, UTF-8 to ANSI conversion, accent removal, and date & time validation.
Language: PHP - Size: 354 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1

zjimn/EncodingConverter
A user-friendly desktop application designed to efficiently convert the character encoding of text files. Whether you're dealing with different languages' encodings or need to standardize file formats, this tool simplifies your workflow with an intuitive graphical interface.
Language: Python - Size: 216 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

raminsharifian/ASCII-Character-Set
This repository contains a comprehensive PDF guide on the ASCII character set in C programming, including a detailed ASCII chart with decimal, octal, hexadecimal, and character representations.
Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Uuwai/CharacterEncoding-form
Changes the string to another encoding
Language: C# - Size: 10.7 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

KosmX/utf-8_transcoder
Convert windows-retard encodings to UTF-8, but leave utf-8 files
Language: Java - Size: 178 KB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

a-t-em/spam-text-classification
LSTM models for text classification on character embeddings.
Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

trwyant/perl-Encode-RAD50
Convert to and from the Rad50 character set
Language: Perl - Size: 128 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

sean0505tsai/ct2023s Fork of tjhsieh/ct2023s
Computer Typography and Character Encoding course assignments, 2023 Spring, NTUT
Language: HTML - Size: 111 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

exponential-decay/pathlesstaken
Profile strings, e.g. file paths for digital preservation considerations, e.g. characters that you want to preserve, or characters that you don't want to preserve.
Language: Python - Size: 254 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

lupuionut/html-ref-encode
Encode HTML text using numeric character references.
Language: Go - Size: 0 Bytes - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

vudaoanhtuan/character_language_model
Character language model for Vietnamese names, LSTM, character embedding
Language: Jupyter Notebook - Size: 3.66 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

bballamudi/Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs Fork of kamalkraj/Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs
Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs
Size: 1.09 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

robert-d-schultz/encoding-recurrent
LSTM neural networks that classify byte sequences by their encoding.
Language: Python - Size: 5.3 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

ggthedev/chardet Fork of chardet/chardet
Python 2/3 compatible character encoding detector.
Language: Python - Size: 3.7 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

katmore/intl-rand-string
generate internationalized random alpha-numeric strings
Language: PHP - Size: 77.1 KB - Last synced at: 22 days ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

gungwald/find-non-ascii-c
Finds non-ASCII characters in a text file, C implementation, Windows and Linux
Language: C - Size: 50.8 KB - Last synced at: 12 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

gunim/cp857
Cp857 package for go
Language: Go - Size: 8.79 KB - Last synced at: 10 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

ajmd17/simple-utf-8
Useful UTF-8 handling utilities for C++
Language: C++ - Size: 220 KB - Last synced at: 4 months ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

sp1ff/numblr
IOStreams & character encodings
Language: Makefile - Size: 167 KB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 1

sheetjs/js-codepage
💱 Codepages for JS
Language: JavaScript - Size: 13.5 MB - Last synced at: over 1 year ago - Stars: 0 - Forks: 0