An open API service providing repository metadata for many open source software ecosystems.

Topic: "character-encoding"

paragonie/constant_time_encoding

Constant-Time Character Encoding in PHP Projects

Language: PHP - Size: 180 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 850 - Forks: 34

aadsm/jschardet

Character encoding auto-detection in JavaScript (port of python's chardet)

Language: JavaScript - Size: 1.58 MB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 728 - Forks: 97

polygonplanet/encoding.js

Convert and detect character encoding in JavaScript

Language: JavaScript - Size: 1.76 MB - Last synced at: 2 days ago - Pushed at: 15 days ago - Stars: 601 - Forks: 125

SheetJS/js-codepage

:currency_exchange: Codepages for JS

Language: JavaScript - Size: 13.5 MB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 152 - Forks: 38

sonicdoe/detect-character-encoding

Detect character encoding using ICU

Language: C++ - Size: 57 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 83 - Forks: 15

kanasimi/CeJS

The project aims to develop a JavaScript module framework that is simple to use with some interesting features. 本計畫目標是建立一個簡單上手的 JavaScript 模組架構。

Language: JavaScript - Size: 34.1 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 62 - Forks: 12

JuliaStrings/StringEncodings.jl

String encoding conversion in Julia using iconv

Language: Julia - Size: 135 KB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 42 - Forks: 8

t-sin/inquisitor

Encoding/end-of-line detection and external-format abstraction for Common Lisp

Language: Common Lisp - Size: 1.13 MB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 33 - Forks: 3

timakin/gonvert

Golang character encoding converter with an automatic code-estimation.

Language: Go - Size: 46.9 KB - Last synced at: 17 days ago - Pushed at: over 8 years ago - Stars: 26 - Forks: 2

xiaooloong/lua-resty-iconv

LuaJIT FFI bindings for libiconv - Character encoding conversion in OpenResty

Language: Lua - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 22 - Forks: 7

MacPaw/universal-detector

A Library for character set autodetection

Language: C++ - Size: 788 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 16

vaiorabbit/everyday_use_kanji

常用漢字・人名用漢字のテーブルとImGui用のコード生成スクリプト

Language: C++ - Size: 5.82 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 18 - Forks: 4

krlmlr/enc 📦

A simple class for storing UTF-8 strings

Language: R - Size: 428 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 3

goark/gnkf

Network Kanji Filter by Golang

Language: Go - Size: 960 KB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 12 - Forks: 2

TwistAtom/ZWSP-Tool

ZWSP-Tool is a powerful toolkit that allows to manipulate zero width spaces quickly and easily. ZWSP-Tool allows in particular to detect, clean, hide, extract and bruteforce a text containing zero width spaces.

Language: Python - Size: 147 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 0

fumoboy007/UniversalCharsetDetection

A Swift wrapper around the `uchardet` library to detect the character encoding of a sequence of bytes.

Language: C++ - Size: 242 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 6

anshul1004/LyricsGenerator

Song lyrics generation using Recurrent Neural Networks (RNNs)

Language: Python - Size: 47.3 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

MuntahaShams/Character-level-LSTM-Pytorch

In this notebook, I'll construct a character-level LSTM with PyTorch. The network will train character by character on some text, then generate new text character by character. As an example, I will train on Anna Karenina. This model will be able to generate new text based on the text from the book!

Language: Jupyter Notebook - Size: 13.3 MB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

Emoun/racp

Revised ASCII Codes for Programming - A new character set for the modern programming environment

Language: Java - Size: 169 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 1

MarkMaldaba/PHPSerializationFixer

A tool to fix corrupted PHP serialised strings.

Language: PHP - Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

KotlinNLP/LanguageModel

LanguageModel contains a number of methods for language modeling based on the SimpleDNN library.

Language: Kotlin - Size: 112 KB - Last synced at: 3 days ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

exponential-decay/fndec

Guess character encoding in your filenames

Language: Go - Size: 121 KB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

kshetline/by-request

Simple Node http client for use with promises, async/await.

Language: TypeScript - Size: 673 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

dahlia/html-charset

Determine character encoding of HTML documents/fragments

Language: Haskell - Size: 19.5 KB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

SwamiKannan/Automatic-Shakespeare

Auto-generated text based on Shakespeare poetry

Language: Jupyter Notebook - Size: 37.2 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

m7modg97/dotmatrix Fork of stefangordon/dotmatrixtool

A web application for generating character or image byte arrays for dot matrix style OLED or LCD displays.

Language: JavaScript - Size: 344 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

marschall/writers

implementations of java.io.Writer with different trade-offs

Language: Java - Size: 50.8 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

simplepie/text-encoder

Don't use this yet. Modern character encoding detection and conversion using Intl and ICU.

Language: PHP - Size: 367 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

busterc/x-base-32

:symbols: base32 encoding and decoding that optimizes byte size based on input characters and supports unicode.

Language: JavaScript - Size: 86.9 KB - Last synced at: 21 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

shiyou0130011/Numeric-Character-Reference-js

Language: TypeScript - Size: 39.1 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 1

curegit/unicodecheck

Simple tool to check if Unicode text files are Unicode-normalized

Language: Python - Size: 52.7 KB - Last synced at: 8 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

MarjovanLier/StringManipulation

A comprehensive PHP library for efficient string manipulation, providing functions for search word transformation, name fixing, UTF-8 to ANSI conversion, accent removal, and date & time validation.

Language: PHP - Size: 354 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1

zjimn/EncodingConverter

A user-friendly desktop application designed to efficiently convert the character encoding of text files. Whether you're dealing with different languages' encodings or need to standardize file formats, this tool simplifies your workflow with an intuitive graphical interface.

Language: Python - Size: 216 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

raminsharifian/ASCII-Character-Set

This repository contains a comprehensive PDF guide on the ASCII character set in C programming, including a detailed ASCII chart with decimal, octal, hexadecimal, and character representations.

Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Uuwai/CharacterEncoding-form

Changes the string to another encoding

Language: C# - Size: 10.7 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

KosmX/utf-8_transcoder

Convert windows-retard encodings to UTF-8, but leave utf-8 files

Language: Java - Size: 178 KB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

a-t-em/spam-text-classification

LSTM models for text classification on character embeddings.

Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

trwyant/perl-Encode-RAD50

Convert to and from the Rad50 character set

Language: Perl - Size: 128 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

sean0505tsai/ct2023s Fork of tjhsieh/ct2023s

Computer Typography and Character Encoding course assignments, 2023 Spring, NTUT

Language: HTML - Size: 111 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

exponential-decay/pathlesstaken

Profile strings, e.g. file paths for digital preservation considerations, e.g. characters that you want to preserve, or characters that you don't want to preserve.

Language: Python - Size: 254 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

lupuionut/html-ref-encode

Encode HTML text using numeric character references.

Language: Go - Size: 0 Bytes - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

vudaoanhtuan/character_language_model

Character language model for Vietnamese names, LSTM, character embedding

Language: Jupyter Notebook - Size: 3.66 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

bballamudi/Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs Fork of kamalkraj/Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs

Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs

Size: 1.09 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

robert-d-schultz/encoding-recurrent

LSTM neural networks that classify byte sequences by their encoding.

Language: Python - Size: 5.3 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

ggthedev/chardet Fork of chardet/chardet

Python 2/3 compatible character encoding detector.

Language: Python - Size: 3.7 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

katmore/intl-rand-string

generate internationalized random alpha-numeric strings

Language: PHP - Size: 77.1 KB - Last synced at: 22 days ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

gungwald/find-non-ascii-c

Finds non-ASCII characters in a text file, C implementation, Windows and Linux

Language: C - Size: 50.8 KB - Last synced at: 12 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

gunim/cp857

Cp857 package for go

Language: Go - Size: 8.79 KB - Last synced at: 10 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

ajmd17/simple-utf-8

Useful UTF-8 handling utilities for C++

Language: C++ - Size: 220 KB - Last synced at: 4 months ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

sp1ff/numblr

IOStreams & character encodings

Language: Makefile - Size: 167 KB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 1

sheetjs/js-codepage

💱 Codepages for JS

Language: JavaScript - Size: 13.5 MB - Last synced at: over 1 year ago - Stars: 0 - Forks: 0