Topic: "data-format"
toon-format/toon
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
Language: TypeScript - Size: 1.65 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 20,377 - Forks: 896
lance-format/lance
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
Language: Rust - Size: 38.1 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 5,842 - Forks: 501
ron-rs/ron
Rusty Object Notation
Language: Rust - Size: 7.77 MB - Last synced at: about 3 hours ago - Pushed at: 6 days ago - Stars: 3,790 - Forks: 143
apache/carbondata
High performance data store solution
Language: Scala - Size: 82.8 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 1,439 - Forks: 705
bevry/cson
CoffeeScript-Object-Notation. Same as JSON but for CoffeeScript objects.
Language: CoffeeScript - Size: 698 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,344 - Forks: 55
securisec/chepy
Chepy is a python lib/cli equivalent of the awesome CyberChef tool.
Language: Python - Size: 4.98 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 1,016 - Forks: 60
kinverarity1/lasio
Python library for reading and writing well data using Log ASCII Standard (LAS) files
Language: Lasso - Size: 5.03 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 375 - Forks: 160
couchbase/fleece
A super-fast, compact, JSON-equivalent binary data format
Language: C++ - Size: 4.56 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 324 - Forks: 34
ScrapeGraphAI/toonify
Toonify: Compact data format reducing LLM token usage by 30-60%
Language: Python - Size: 1.66 MB - Last synced at: 27 days ago - Pushed at: 30 days ago - Stars: 248 - Forks: 16
NeurodataWithoutBorders/pynwb
A Python API for working with Neurodata stored in the NWB Format
Language: Python - Size: 45.5 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 205 - Forks: 90
cheminfo/netcdfjs
Read and explore NetCDF files
Language: TypeScript - Size: 5.31 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 161 - Forks: 31
svenvc/ston
STON - Smalltalk Object Notation - A lightweight text-based, human-readable data interchange format for class-based object-oriented languages like Smalltalk.
Language: Smalltalk - Size: 565 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 140 - Forks: 32
jorinvo/edn-data
EDN parser and generator that works with plain JS data, with support for TS and node streams
Language: TypeScript - Size: 536 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 112 - Forks: 6
HelgeSverre/toon-php
Token-Oriented Object Notation - A compact data format for reducing token consumption when sending structured data to LLMs (PHP implementation)
Language: PHP - Size: 526 KB - Last synced at: 17 days ago - Pushed at: 20 days ago - Stars: 97 - Forks: 7
DahnJ/Awesome-Zarr
🎀 Awesome Zarr resources
Size: 201 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 93 - Forks: 1
NaturalIntelligence/nimn-spec
Just Data. Save up to 85% network bandwidth and storage.
Size: 64.5 KB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 89 - Forks: 5
gmggroup/omf-python
Python library for working with OMF files
Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 77 - Forks: 19
bevry/envfile
Parse and write environment files with Node.js
Language: TypeScript - Size: 955 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 62 - Forks: 10
MiraGeoscience/geoh5py
Python API for geoh5, an open file format for geoscientific data.
Language: Python - Size: 35.2 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 60 - Forks: 12
NeurodataWithoutBorders/nwb-schema
Data format specification schema for the NWB neurophysiology data format
Size: 67.2 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 60 - Forks: 16
PEtab-dev/PEtab
PEtab - an SBML and TSV based data format for parameter estimation problems in systems biology
Size: 11.4 MB - Last synced at: 21 days ago - Pushed at: 23 days ago - Stars: 60 - Forks: 12
eternal-io/keon
A human readable object notation / serialization format that syntactic similar to Rust and completely supports Serde's data model.
Language: Rust - Size: 221 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 59 - Forks: 1
reflektone-games/SimaiSharp
A serializer/deserializer for the rhythm game chart format simai.
Language: C# - Size: 264 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 46 - Forks: 8
ViliOrg/Vili
A nice and readable data format !
Language: C++ - Size: 834 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 27 - Forks: 3
open-gamma-ray-astro/gamma-astro-data-formats
Data formats for gamma-ray astronomy
Language: Python - Size: 9.23 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 27 - Forks: 27
BAMWelDX/weldx
The welding data exchange format
Language: Python - Size: 7.7 MB - Last synced at: 8 days ago - Pushed at: 11 days ago - Stars: 23 - Forks: 9
vim89/toon4s
toon4s: Token-Oriented Object Notation for JVM
Language: Scala - Size: 572 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 23 - Forks: 3
hltcoe/concrete-python
Python modules and scripts for working with Concrete, a data serialization format for NLP
Language: Python - Size: 1.96 MB - Last synced at: 11 days ago - Pushed at: about 2 years ago - Stars: 21 - Forks: 8
JohnnyBravo75/DataBridge.NET
Configurable data bridge for permanent ETL jobs
Language: C# - Size: 11.1 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 10
ready-steady/hdf5
Reader and writer of HDF5 files
Language: Go - Size: 28.3 KB - Last synced at: 4 months ago - Pushed at: over 8 years ago - Stars: 18 - Forks: 2
theobori/nix-converter
All-in-one converter configuration language to Nix and vice versa
Language: Go - Size: 88.9 KB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 17 - Forks: 2
hltcoe/concrete
Thrift definitions, making HLT data specifications concrete
Language: Thrift - Size: 4.61 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 5
LorisYounger/LinePutScript
LinePutScript是一种数据交换格式定义行读取结构和描述其内容的标准语言
Language: C# - Size: 24 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 13 - Forks: 2
insightindustry/spss-converter
A simple utility that converts SPSS data to / from Pandas DataFrames, CSV, Excel, JSON, YAML, and dict.
Language: Python - Size: 69.3 KB - Last synced at: 29 days ago - Pushed at: almost 3 years ago - Stars: 13 - Forks: 4
iddev5/inon
:floppy_disk: Data serialization format in Zig
Language: Zig - Size: 169 KB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0
fundsxml/schema
FundsXML XSD Files
Size: 2.12 MB - Last synced at: 18 days ago - Pushed at: 21 days ago - Stars: 9 - Forks: 2
tkarabela/ensight-reader
A pure Python reader for the EnSight Gold format
Language: Python - Size: 438 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 4
dnaka91/mabo
Data format and schema, with a type system as strong as Rust's.
Language: Rust - Size: 2.3 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 9 - Forks: 1
burtonageo/bvh_anim
Loader for bvh animation files
Language: Rust - Size: 534 KB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 7
monsterkodi/noon
no ordinary object notation
Language: JavaScript - Size: 1.72 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 9 - Forks: 1
vimaec/vim
VIM - Runtime 3D BIM Data Format for AEC
Language: C# - Size: 153 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 3
erikw/nestedtext-ruby
A ruby implementation of NestedText https://nestedtext.org/
Language: Ruby - Size: 1.96 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 1
ason-format/ason
ASON (Aliased Serialization Object Notation) is a serialization format designed to optimize token consumption in LLM (Large Language Model) contexts while maintaining human readability and guaranteeing complete round-trip fidelity.
Language: JavaScript - Size: 1.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 0
reflektone-games/simai.js
A serializer/deserializer for the rhythm game chart format simai.
Language: TypeScript - Size: 206 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 2
RNABioInfo/rna-interaction-format
RNA Interaction Format (RIF) with reference implementations in C++, JavaScript, and Python
Language: C++ - Size: 298 KB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0
nexusformat/exampledata
Examples of (mostly) real world NeXus files to inspect, test and train reading software with.
Language: Python - Size: 45.8 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 7
cs91chris/flask_response_builder
Implementations of flask response in many formats like: json, xml, html...
Language: Python - Size: 83 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0
OGRECave/DotSceneFormat 📦
Now in the main repository
Language: C++ - Size: 67.4 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 2
HeavyIonAnalysis/AnalysisTree
AnalysisTree data format
Language: C++ - Size: 29.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 8
metayeti/Lime
Game data packer
Language: C++ - Size: 28.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0
JoiLa/cdt
convert between different data types easily and conveniently, with golang. support json and sql serialize
Language: Go - Size: 156 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0
mamantoha/toon-crystal
Crystal implementation of the Token-Oriented Object Notation(TOON) serialization format
Language: Crystal - Size: 93.8 KB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0
jimmystridh/toon-rs
🎨 Rust implementation of TOON (Token-Oriented Object Notation) - A human-readable, token-efficient serialization format for LLMs with full serde integration
Language: Rust - Size: 332 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 2
lifs-tools/rmzTabM
The R-language bindings for mzTab-M
Language: R - Size: 1.86 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 3
hltcoe/concrete-js
JavaScript library for working with Concrete, a data serialization format for NLP
Language: JavaScript - Size: 4.26 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2
master-co/business
A business data model for quick verification, access and output of specific data formats.
Language: TypeScript - Size: 332 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0
DFASDL/dfasdl-core 📦
Moved: https://codeberg.org/tensei-data/dfasdl-core
Language: Scala - Size: 93.8 KB - Last synced at: almost 3 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 2
Hihaheho/eure
A minimalist, schema- and editor-friendly data language for algebraic data types and concise descriptions of deeply nested data.
Language: Rust - Size: 2.92 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0
crawlcore/scp-protocol
Site Content Protocol (SCP) reduces waste of bandwidth & processing power during site crawling.
Language: Python - Size: 49.8 KB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 0
AndreaIannoli/TOONIFY
Universal converter for JSON, YAML, XML, and CSV into the TOON format, written in Rust.
Language: Rust - Size: 126 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0
NeuroJSON/jsnirf
A JSON/binary JSON extension to the SNIRF format
Size: 84 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 7
deshima-dev/demerge
:truck: DESHIMA merge code for observed datasets
Language: Python - Size: 262 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1
kesh-lang/sode
kesh tree structured data format
Size: 93.8 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0
nuald/glSatellite-Demo
Technological demo featuring various areas as using OpenGL ES with Android NDK, custom message queues, interacting JNI and Java activities, using NORAD databases and parsing their data.
Language: C++ - Size: 6.21 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1
twesterhout/hdf5-hs
High-level Haskell bindings to HDF5
Language: Haskell - Size: 170 KB - Last synced at: 7 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0
robianchini/react-data-formatter
A react-data-formatter é uma biblioteca em JavaScript para formatação de dados brasileiros como CPF, CNPJ, CEP, telefone, moeda, placa e gênero.
Language: JavaScript - Size: 616 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
ALANVF/reon
REON is an expressive data format based on Red/Rebol syntax that can be converted to/from JSON
Language: CoffeeScript - Size: 176 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0
antgon/edfrw
A Python library for reading and writing European Data Format (EDF) files
Language: Python - Size: 2.86 MB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0
abrudz/parsing
Dyalog APL expressions to parse common and unusual data formats from text files
Language: APL - Size: 3.91 KB - Last synced at: 9 months ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0
kikones34/dsmap-serial-format
Low-level description of the format in which GameMaker Studio serializes ds_map objects.
Size: 1.95 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0
DFASDL/dfasdl-utils 📦
Moved: https://codeberg.org/tensei-data/dfasdl-utils
Language: Scala - Size: 705 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2
matthewdeanmartin/hissbytenotation
Library to make it easy to use python literal syntax as a data format
Language: Python - Size: 211 KB - Last synced at: 9 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0
poseidon-framework/poseidon-schema
An archaeogenetic genotype data organisation file format
Size: 1.03 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 2
rhinoceros7/flash
Flash is a C library and CLI for append-only, verifiable event streams stored in ".flsh" files.
Language: C - Size: 203 KB - Last synced at: 17 days ago - Pushed at: 21 days ago - Stars: 2 - Forks: 0
iamgerwin/toon-php
A lightweight, fast TOON (Token-Oriented Object Notation) library for PHP. Optimized for LLM contexts. PHP 7.0-8.0 [legacy] and 8.1 and up [modern] support.
Language: PHP - Size: 96.7 KB - Last synced at: 12 days ago - Pushed at: 24 days ago - Stars: 2 - Forks: 0
biomarkersParkinson/tsdf
A package to read, modify and write TSDF data in Python.
Language: Python - Size: 4.5 MB - Last synced at: 27 days ago - Pushed at: 29 days ago - Stars: 2 - Forks: 0
fezcode/piml
Parenthesis Intended Markup Language
Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0
fezcode/piml.js
PIML (Parenthesis Intended Markup Language) encoder/decoder for Javascript
Language: JavaScript - Size: 42 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0
fezcode/go-piml
PIML library for go
Language: Go - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0
Cactus-minecraft-server/nbt
[CactusMC] Module component responsible for NBT parsing.
Language: Rust - Size: 13.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0
deshima-dev/dems
:truck: DESHIMA Measurement Set
Language: Python - Size: 180 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1
zarr-developers/blog
Zarr Official Blog
Language: Ruby - Size: 436 KB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 7
digitallinguistics/DFT
Discourse Functional Transcription
Size: 23.4 KB - Last synced at: 9 months ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1
tomhodgins/htmlforever
HTML templating helper functions written in a Continuation-Passing Style for JavaScript, Python, and Ruby
Language: Python - Size: 21.5 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 1
CHollasch/LSON4J 📦
Java LSON parser.
Language: Java - Size: 697 KB - Last synced at: 9 months ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0
ready-steady/mat
Reader and writer of MATLAB MAT-files
Language: Go - Size: 602 KB - Last synced at: 3 months ago - Pushed at: almost 11 years ago - Stars: 2 - Forks: 0
MiraGeoscience/las-geoh5
Import/Export LAS files to/from GEOH5 format for geoscientific data
Language: Python - Size: 3.37 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 1
TangYewLabs/corpus-informaticus
CIVD — Corpus-Informaticus Volumetric Data Experimental 3D container + ROI engine for robotics/AI payloads (maps, logs, sensor grids) with capsule metadata.
Language: Python - Size: 21.3 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0
naufalhanif25/hron-format
HRON (Hierarchical Reference Object Notation) is a structured text format focused on being readable, compact, and fast to parse.
Language: TypeScript - Size: 348 KB - Last synced at: 18 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0
devcom33/json-toon-converter
Convert JSON to TOON format and back - reduce LLM token usage by up to 40%.
Language: JavaScript - Size: 894 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
DigitalCoreHub/laravel-toon
A friendly JSON transformer for Laravel — Convert JSON ↔ TOON, an ultra-minimal, line-based data format.
Language: PHP - Size: 118 KB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
supervisely-ecosystem/convert-yolov5-to-supervisely-format
YOLOv5 to Supervisely format
Language: Python - Size: 2.19 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 2
vimalnathnambiar/exfilms
A command-line interface tool to extract, filter, and standardise MS data.
Language: JavaScript - Size: 80 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0
Amnicastro98/JSON-Reference
Size: 8.79 KB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0
peterkuma/pst
Plain Structured Text (PST)
Language: Python - Size: 110 KB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0
trilobite-stdlib/trilo-xfile-c
That format framework from the laboratory. Basically, the data format handling functionality for JSON, CSV, XML, and more written in C.
Language: C - Size: 79.1 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1
AvailLang/avail-json
Read and write freeform JSON with precise error checking.
Language: Kotlin - Size: 160 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
AmateurPanda92/game-data-schema
🗂️ A TypeScript data model and JSON schema for encapsulating game data, with a focus on flexibility and human readability.
Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0
Naira0/datax
a custom data format parser.
Language: C++ - Size: 6.84 KB - Last synced at: almost 3 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0
andrevdl/SmartIO
Transforms common data formats from one type to another, such as JSON, XML and datasets
Language: C++ - Size: 220 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0